Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP019390 | Brucella sp. 09RB8910 chromosome 1, complete sequence | 3 crisprs | WYL,csa3 | 0 | 3 | 6 | 0 |
NZ_CP019391 | Brucella sp. 09RB8910 chromosome 2, complete sequence | 1 crisprs | DEDDh,cas3,csa3 | 0 | 0 | 2 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP019390_1 | 310310-310422 | Orphan |
NA
Consensus repeat of NZ_CP019390_1
|
2 spacers
spacers of NZ_CP019390_1
>1.1|310330|27|NZ_CP019390|PILER-CR TAAATTCCCCTTCCAAAGCAAATCAAT >1.2|310377|26|NZ_CP019390|PILER-CR GATTGGCGCACCCCATGGGGACGCGC |
CRISPR arrays and Neighbor proteins around NZ_CP019390_1
The CRISPR arrays of NZ_CP019390_1 >merge|NZ_CP019390|1|310310-310422|PILER-CR TGAACAAAGTCATACGACTTTAAATTCCCCTTCCAAAGCAAATCAATTAAACAAAGTCATATGACTTGATTGGCGCACCCCATGGGGACGCGCTGAACGAAGTCATATGACTT >NZ_CP019390|1|1|310310-310422|PILER-CR TGAACAAAGTCATACGACTT TAAATTCCCCTTCCAAAGCAAATCAAT TAAACAAAGTCATATGACTT GATTGGCGCACCCCATGGGGACGCGC TGAACGAAGTCATATGACTT
>NZ_CP019390.1|WP_076770650.1|309603_310305_+|hypothetical-protein MRINSAIMRIVKVGMFSSCVSPAHNFMRMEQGKKEQQDWVKAVLSHLGITATELARRTGVAPSTIHKPLNDPEFPGMISSRTIHKIAEAAGLRPMEFPGRARAFNDKETEPYVFDEPEANPSFERAIRDLIAQRNGRSAWRIRSYALEISGILPGDVLIVDQSPQPKPNDIVAVELKDWSSGRTENVFRLYLPPYVITNSLREGAQKPLPVDGSDVSIRGVVDVVMRQPRVAH >NZ_CP019390.1|WP_076770649.1|309372_309573_-|helix-turn-helix-transcriptional-regulator MRTFDQIEQLRKHYKITRKQLYERAGIHKETWRRTAQSKTSPNVKTLQDLSEAIDTLISERGDAHA >NZ_CP019390.1|WP_154146686.1|309224_309380_-|hypothetical-protein MLDLVVMISGMAMIALGCGWMADRKLWVGLSCFLYAALVFVVEVFVFAVKW >NZ_CP019390.1|WP_154146684.1|308871_309219_-|hypothetical-protein MTERLTILHSEKRTEAPKSLENIRVVLQHEDNSRISVATMAADTSLATAEVTFHLSESVWIAERIIAGDMRVATRPGLARILAGAVVALAKIGCAAGALEEVTDVEEVSDDKQSD >NZ_CP019390.1|WP_076770647.1|308526_308820_+|hypothetical-protein MNIFNRLQGFKVSHNLTNEEYDRLLMVVGGAGMAAKAVLTVSMAGLPITVDSAVETCRHWMDPNSPATPVFLEAVGAHVNEMLATAEAISPPSKGRA >NZ_CP019390.1|WP_076770646.1|307947_308496_-|hypothetical-protein MTADLAIELQKIEEIRRENGIAQYHVERAAMLCNGYYSLLITGAKRPRMGTLNALRLALRRLIVTPEADTSPQSAFCNMAIRAAIALLCEARGLNAEKIQNSIASKRATQSPEWLEAARVRRDAWALVSNAFGISGSDLARAAGVSKAAISLALRAVEDARDDKEFDREMERLERALTGGGW >NZ_CP019390.1|WP_154144636.1|307459_307948_-|hypothetical-protein MTDEKNVLPFKVSEHQREDRNFYSTGLDGKLKTVSPDELVKEKVPSWPFLFLAINPPPQLGDKGRKERAASIISLTCERTPSASPSKFGLVRDTQLDSASKAASSDISLPAIRHVRTALFAYVATATVSMKWLRSFWSILCTISSLFVGLCSCRESDRRGKR >NZ_CP019390.1|WP_076770644.1|306731_307463_-|hypothetical-protein MNRRLISTAVRARRFEPATSLDFFPTAPWATRAFCEHVMPVVSPYGDRFKATAWDPACGMLHMSRVLEEYFERSVYASDIFDYGQGVSTFDFLDRSVAPLSVDWIITNPPFKCGAQFVERALELAGVGVAMLVRTQFLETVDRYRTLFQKRPADLIAQFVERVPMHRGRWVVNGKTATSYCWLVWLRHREHARSRAAVHFGQGFIWIPPCRQKLTRPDDVLRFGGCVDIPKRHKAWRPERMAA >NZ_CP019390.1|WP_154146682.1|306525_306735_-|hypothetical-protein MKKTVQLSAKVGMCEGLGAGFAIDDDAWEEFAQDLVNFGTAVFSADTVDGFVRLSIENDFWKRDPEAKR >NZ_CP019390.1|WP_076770642.1|305404_306529_-|toprim-domain-containing-protein MNGRGDLQEIKEMLKDRAQDVCEKLLPNGRRDGNLYVSHDPVQGDYDHKPALKVRLRGNIGGWEQFRGGAEGARKDIIGLVEYVLRTDTKGALAWARDFLGLRSMSRQEREAMQMAASAKAKKRQKEDHDRRLWRIKKADELFSQRTMQIHPDRLDPPALRHALSYFRGRDIPLEEVTHLPAETFRFSPQTEWWSGAKFQTEGARRFKVSPGPMFPAVHSAMRSQHGIVTACHCTFLDPLEPVKAPVEPPKLIFGEASGAVVEVSYGQFGDPFWLSRQAGLVIICEGVETALSLAIAAPEARVWAGGSITNMGNAPVWLDCVGAIILARDNNHGNPTAQKQIQQTIDKLESWGKPLTVINSHLGDDFNDLMKGE >NZ_CP019390.1|WP_154146688.1|310538_310814_+|hypothetical-protein MLSIETASIEEVADAIGRSEIWLKRNWNKFNAKHGFPRPIPGSDWKWPRRAVELWLIAGGVIMRAANSNEASADLISLQRQAHHERYGVVS >NZ_CP019390.1|WP_154146690.1|310810_311044_+|hypothetical-protein MTAKGSISLNHLEIDEHFAMSDYEPYHDYEPDDDFSDELDTGDACGRWINGQLGEHCTLAGTEFCDWECPYSGGGNE >NZ_CP019390.1|WP_076770652.1|311036_311324_+|hypothetical-protein MNSTSQPALIDPRELTPTERKLLREVATYKFYRRRNGWAIPGQGNKVSLKSVDNLYRKHLISDRSDCLRLTGAGQMVLAVMQERERAKTERKAAQ >NZ_CP019390.1|WP_076770653.1|311320_311581_+|DUF2312-domain-containing-protein MSDDITSEAQTIAVGQLRAFIERIERLQEEKKTIGDDINEVYAELKGSGFDSKVVRTIIRLRKKEDHERQEEDAILQLYMDALGMS >NZ_CP019390.1|WP_076770654.1|311814_313140_+|hypothetical-protein MAKKANPHPHVSWRDGRPRFQPGKELRAMGYSGKDLRHEDGRWFTRGEAVDWSTNFQQELAGKRKPAPSVPRTAPVRYAGYTVENLIADWTNPKRNPKWKLNGTRSYSPSTMNDYRDKMNVLMQHDGELWVSDVRSLDRPTCRNLFDSLWSQRGLATAKGVLLTLSSAISWGMLRGHVKLLDNPATKLQMESPEPRVRFGTRPEILALIAAADRIGRPEIGDMIVLAVWSGQRQGDRRELIDKGLMNNRRIFKQSKTGEIVAVMQAPELERRLEASKKRREAAKIKDPRVILDEKLWKSFSKRHYHEIFSKVRDAAIIGVVDVSATQKNFPSVKAMNVALKRAYEKHRKGEAVYDVRIVYEISPLISLHDLQDLDFRDTSVTWMALAGATIPEIISVTGHTAESATRILRHYLARHPEMADSAIRKMVSWYEADGETEIGY >NZ_CP019390.1|WP_076770655.1|313253_313727_+|hypothetical-protein MKQHCVIETEDIGRGKVRLHFHIDIQIDQLQPLPREIIDDAFTCLRHLLMFKPDGETPVRRMIESALNDNEDSRLLLRLAGIRVVSDFATNEGFIVASAGVGVNRLFDGSAWGEGLHRHALSSLPGARPTGPLNFAEGKAFRGTFLPARLIYDAVHY >NZ_CP019390.1|WP_076770656.1|314208_315882_+|L-lactate-permease MPWNQVYDPLGSMFWSTLLAALPIVVLLGGIGIFHIKAHIAAILGLITALLIAVIGFGMPADMAGATAVYGAAYGLLPIGWIILNVIFLYRLTEQTGQFNILRDSIAGITPDRRLQLLFIAFSFGAFFEGAAGFGTPVAVTAAMLMGLGFAPLPAAGLSLIANTAPVAYGALGTPVIALSAVTGIDLLQLSGMIGHQLPFFSAIVPFWLIWAFAGRKGMIEVWPALLVAGVSFAVPQYVVSNFHGPWLVDVIAAICSMAALAAFLRFWQPRRIWTSTGKEGEEANAPVQPRHSHSTGAVFRAWLPWLVLSVFVFLWGTPQIRTWLDSLWIWKMQVPHLHNLVFKVPPVVAEAHSEAAIFTLNLLSATGTGILLSAVVAGFILGFNPLKLVKEYFKTAYVVRFSLITISAMLALGYVTRYSGTDATLGLAFAQTGWVYPFFGAMLGWLGVALTGSDTASNVLFGGLQKITAEQLGLSPVLMAAANSSGGVMGKMIDAQSIVVASTATQWYGHESKILRYVFFHSIALACLVGVLVLAQAYVPPFTSLVPAETLPLVAH >NZ_CP019390.1|WP_002963859.1|316163_316319_+|hypothetical-protein MAIQTVSDELYRRHEREWAEFRNAVEASETNQKQQGQASGKTSCDSKTGNQ >NZ_CP019390.1|WP_076770657.1|316480_317185_+|SDR-family-oxidoreductase MTIQKVAIITAGGSGMGAASARRLAQDGFAVAILSSSGKGEALAKELGGIGVTGSNQSNDDLQKLVDQALEKWGRIDVLVNSAGHGPRAPILEITDEDWHKGMDTYFLNAVRPARLVVPAMQKQKSGVIINISTAWAFEPSAMFPTSAVFRAGLASFTKIFADTYAAENIRMNNVLPGWIDSLPATEERRESVPMQRYGKSEEIAATVSFLASDGAAYITGQNLRVDGGLTRSV >NZ_CP019390.1|WP_076770658.1|317257_318043_+|hypothetical-protein MAARKRSSSSRSPKRSSRRKSGSGSSTPILALGAILALGAFSLWSTAQHKSPQAAFSALFQRPAPKPAPAATAKAENKPAPGKTADTKAPARDYAAAVGPVPRPAAPVAPAPQKPVQTAAVAAPKPPRPSGSVVASVTPTAPHAMPPRGVNTPANSPSVIYARAKLTIHKNAWDRSPAIATIEKGREMRSYGKTGRWHRVVVPSTNIIGWVHEDQLIGGRNKPDSATLITGSVAKKAPSPAPTPAHEQSHILPPKAVGAKK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP019390_2 | 1044062-1044152 | Orphan |
NA
Consensus repeat of NZ_CP019390_2
|
1 spacers
spacers of NZ_CP019390_2
>2.1|1044090|35|NZ_CP019390|CRISPRCasFinder ACTCTTTGTTTTTACGCATTATCCAACGCAAAACT |
CRISPR arrays and Neighbor proteins around NZ_CP019390_2
The CRISPR arrays of NZ_CP019390_2 >merge|NZ_CP019390|2|1044062-1044152|CRISPRCasFinder GCTTCGCACTTTTGCTGGAAATGCTCTAACTCTTTGTTTTTACGCATTATCCAACGCAAAACTGCTTCGCACTTTTGCTGGAAATGCTCTA >NZ_CP019390|2|1|1044062-1044152|CRISPRCasFinder GCTTCGCACTTTTGCTGGAAATGCTCTA ACTCTTTGTTTTTACGCATTATCCAACGCAAAACT GCTTCGCACTTTTGCTGGAAATGCTCTA
>NZ_CP019390.1|WP_008935088.1|1043282_1043936_-|RNA-binding-protein MMVRVRTKQRFRVGQQEMNDRTCIVTRESGSADDMIRFVAGPDGSVVPDLKRTLPGRGCWVKAERRLVDEAVKRKLFARALKEGVTPQADLGALVDQLLTKSALGSLALARKAGAVVTGSTKVDQAIRTGAAAMVLHAREAAADGVRKLDQARRAVVHLEGPEIPSFTLFSGEEMDLAFGGGNVIHAAVLEGKAAAGFVKRALLLHRYRGESASNLD >NZ_CP019390.1|WP_076771020.1|1040265_1043145_-|translation-initiation-factor-IF-2 MSDKTNDDKTLSVNPKKTLTLKRPGVEQSTVRQNFSHGRTKAVVVETKKRKFSRPDEKPEVEAAAAPKPAAPAAAPQQAPSSAPVSASAAQASAPQPAPVKAPATKAPAAPSAPVTKPHVAQQRPVHQRPGGQQAQRPRPADRSGMVLNTLSRSEMDARRRALEEAQIREVEERARAVEEAKRRAEEDARRAKEREESARRQAEEEARLKAEAEARRKAEEEAAKRMPQPEARSERRDDARPAPYGARPQQAGRPQGGRPQPAGRPQQGSPRPAPIIADAAPIAGKPLPQSQLRKPGQSDDDDDRRSGAARRGVAAKPEVRAPKVVKGEDDRRRGKLTLTSNLEEEGRSRSLSAMRRRQEKFKRSQMQETREKISREVTIPETITLQELAQRMAERSVDIIKYLMKQGQMMKPGDVIDADTAQLIAEEFGHTVKRVAESDVEEGIFDVADNESAMVSRPPVVTIMGHVDHGKTSLLDAIRHANVVSGEAGGITQHIGAYQVVQNGQKITFIDTPGHAAFTAMRARGAQATDIAILVVAADDSVMPQTIESINHAKAAGVPIIVAINKIDKPAADPQKVRTALLQHEVFVESMGGEVLDVEVSAKNKINLDKLLDAVLLQAEMLDLKADPDRTAEGVVIEAQLDRGRGSVATVLIQKGTLHPGDILVAGSEWGRVRALVNDRGEHVKEAGPAMPVEILGLQGTPQAGDRFAVVANEAKAREIAEYRQRLARDKAVARQSGARGSLEQMMNQLQVSGTKEFPLVIKGDVQGSIEAITNALDKLGTDEVRARIVHSGAGGITESDVSLAEASNAAIIGFNVRANKQARDSAEQQGIEIRYYNIIYDLIDDVKAAMSGLLSPERRETFLGNAEILEVFNITKVGKVAGCRVTEGKVERGAGVRLIRDNVVIHEGKLKTLKRFKDEVAEVPSGQECGMAFENYDDIRAGDVIEAFRVEHVSRTL >NZ_CP019390.1|WP_076771890.1|1039547_1040000_-|30S-ribosome-binding-factor-RbfA MARSHDMKGSGGLSQRQLRVGEQVRHALAQVLQRGEIRDDLIERTVISVSEVRMSPDLKIATCFITPLGSADPQAVIKALASHAKFIRGRVAPSLAQMKYMPEFRFRPDTSFDNFSKIDALLRSPEVARDLSHDDDEDGGADEAPRNGDE >NZ_CP019390.1|WP_076771019.1|1038569_1039544_-|tRNA-pseudouridine(55)-synthase-TruB MARRGKKKGRPISGWVIFDKPKGMGSTEAVSKIKWLFSAEKAGHAGTLDPLASGMLPIALGEATKTVPYVMDGTKVYRFTVTWGEERSTDDLEGQPTKTSDKRPSREEVEALLPDYTGVISQVPPQFSAIKIDGERAYDLAREGETVEIPAREVEIDRLEIVGFPDADRTEFEVECSKGTYVRSLARDMGRDLGCYGHISDLRRVEVAPFTDEDMVTLARLEAVWPPLPPKDEDGNVIEPAPWRDFSALDALVIDTGAALDCLPQVPLSDDQAQRVRLGNPVILRGRDAPLEADEACVTTRGKLLAIGYIEHGQFKPKRVFTAG >NZ_CP019390.1|WP_002965230.1|1038129_1038399_-|30S-ribosomal-protein-S15 MSITAERKQALIKEYATKEGDTGSPEVQVAVLSERIANLTEHFKGHKNDNHSRRGLLKLVSQRRRLLDYVKGVDHARYQALITRLGLRR >NZ_CP019390.1|WP_002965231.1|1035701_1037846_-|polyribonucleotide-nucleotidyltransferase MFNTHKVEIEWGGRPLTLETGKIARQADGAVLATYGETAVLATVVSAKEPKPGQDFFPLTVNYQEKTYAAGKIPGGYFKREGRPSENETLVSRLIDRPIRPLFVDGYKNDTQVVITVLQHDLENNPDILSMVAASAALTISGVPFMGPISGARVGYIDGEYVLNPNIDEMPESKLDLVVAGTSEAVLMVESEAQELPEDVMLGAVMFGHKSFQPVIDAIIKLAEVAAKEPRDFQPEDLSELEAKVLAVVENDLREAYKITEKQARYAAVDAAKAKAKEHFFPEGVEETEMSAEQFATIFKHLQAKIVRWNILDTGNRIDGRDLSTVRPIVSEVGILPRTHGSALFTRGETQAIVVATLGTGEDEQMIDALTGTYKESFMLHYNFPPYSVGETGRMGSPGRREIGHGKLAWRAIHPMLPAAEQFPYTIRAVSEITESNGSSSMATVCGTSLALMDAGVPIVRPVAGIAMGLIKEGERFAVLSDILGDEDHLGDMDFKVAGTEFGITSLQMDIKIDGITEEIMKVALEQAKGGRVHILGEMAKAISSSRAELGEFAPRIEVMNIPTDKIRDVIGSGGKVIREIVEKTGAKINIEDDGTVKIASSNGKEIEAAKKWIHSIVAEPEVGEIYEGTVVKTADFGAFVNFFGPRDGLVHISQLAADRVAKTTDVVKEGQKVWVKLMGFDERGKVRLSMKVVDQETGKEIVAEKKKEEVDAE >NZ_CP019390.1|WP_008935091.1|1034567_1035590_-|class-I-SAM-dependent-methyltransferase MTTPAQKTLFLPFEQGILDMPDPGQSFLACGLAADRLLEPEWKQALTCLQPWRPDWLALQKEGFHAEPRLATDRNFSGGLLLLGKHRGRNEAWFAQLLARVQPGGWIVVSGDKKLGIDSFRKWAGNIAEISDRMSKNHAVVFWLRRPDDLDEAFIADLKPLAADIEGGFRTEPGMFSHGAIDKGSALLARHMEKIVFGNVADLGAGWGYLAAQCLKYADRIKNIDLYEADYEALEAARGNLERLGASIPISFNWFDVTSEKIAGIYDTVIMNPPFHEGRVTDVSLGQSFIAAAASRLKPGGRLLVVANRQLPYELTLKGLFKTVTLLEEAEGFKIFDAKK >NZ_CP019390.1|WP_002965233.1|1033701_1034520_+|enoyl-ACP-reductase-FabI MEGLMQGKRGLIMGVANNHSLAWGIAKQLAAQGAELAFTYQGDALGKRVKPLAEQVGSDFVLPCDVEDIATVDAVFEEIEKKWGGLDFLVHAIGFSDKTELKGRYADVTTRENFSRTMVISAYSFTEVAQRAEKLMKDGGSILTLTYGGSTRTIPNYNVMGVAKAALEAMVRYLAADYGPQGIRVNAISAGPVRTLAGAGIGDARAIFSYQRRNSPLRRTVDIDDVGKSAVYLLSDLSSGVTGEIHFVDSGYNIVSMPTLEELKSSDSERGE >NZ_CP019390.1|WP_059243166.1|1032463_1033687_+|beta-ketoacyl-ACP-synthase-I MRRVVVTGMGIVSSIGNNTEEVTASLREAKSGISRAEEYAELGFRCQVHGAPDIDIESLVDRRAMRFHGRGTAWNHIAMDQAIADAGLTEEEVSNERTGIIMGSGGPSTRTIVDSADITREKGPKRVGPFAVPKAMSSTASATLATFFKIKGINYSISSACATSNHCIGNAYEMIQYGKQDRMFAGGCEDLDWTLSVLFDAMGAMSSKYNDTPSTASRAYDKNRDGFVIAGGAGVLVLEDLETALARGAKIYGEIVGYGATSDGYDMVAPSGEGAIRCMKMALSTVTSKIDYINPHATSTPAGDAPEIEAIRQIFGAGDACPPIAATKSLTGHSLGATGVQEAIYSLLMMQNNFICESAHIEELDPAFADMPIVRKRIDNVQLNTVLSNSFGFGGTNATLVFQRYQG >NZ_CP019390.1|WP_002968051.1|1031781_1032300_+|3-hydroxyacyl-[acyl-carrier-protein]-dehydratase-FabA MAEQKSSYGYEELLACGRGEMFGPGNAQLPLPPMLMIHRITEISETGGAFDKGYIRAEYDVRPDDWYFPCHFQGNPIMPGCLGLDGMWQLTGFFLGWLGEPGRGMALSTGEVKFKGMVRPHTKLLEYGIDFKRVMRGRLVLGTADGWLKADGELIYQATDLRVGLSKEGSAQ >NZ_CP019390.1|WP_179947192.1|1044330_1046655_-|transcription-termination/antitermination-protein-NusA MTEQVQANETETPVAVADERIIRETGIDAKVAGIVEPVINTLGFRLVRVRLSGLNGQTLQIMAERPDGTMTVDDCELVSRTVAPVLDVEDPISGKYHLEISSPGIDRPLVRKSDFSDWAGHIAKVETSIVHEGRKKFRGRIVVGEADSVTIESDQISYGNEPVVRIPFDLISDARLVLTDDLIRDALRKDKALREGRIPGDDLGQSRKKPHLPKRKRRNKFQASAQEKAGCDTKERPMAVSANRLELLQIADAVAREKSIDREIVLAAMADAIQKAARSRYGQESNIRADINAKSGEIKLQRLLEVVENVEDYATQISLFMARDRNPDAQVGDFIADQLPPMDFGRIAAQSAKQVIVQKVREAERDRQYDEYKDRVGEIVNGTVKRVEYGNVIVDLGRGEAIVRRDELIPREAFRYGDRIRAYVYDVRREQRGPQIFLSRTHPQFMAKLFTMEVPEIYDGIIEIKSVARDPGSRAKIAVVSRDASIDPVGACVGMRGSRVQAVVAELQGEKIDIIPWSPDAASFIVNALQPAEVAKVVLDEDAERIEVVVPNDQLSLAIGRRGQNVRLASQLTGWDIDILTEDEESERRQKEFAERSNLFMEALNVDEMVGQVLASEGFASVEELAYVDAGEISSIDGFDEDTAGEIQDRAREYLERIEAEQDARRKELGVADELRELPGMTTAMLVAVGEDGVKTMEDFAGYAVDDLVGWRERKDGDTINHSGVLTPFDLSRVDAEQMVLAARLKAGWITEEELAAASEDVEAEEAGDEEAAS >NZ_CP019390.1|WP_004684581.1|1046822_1047524_-|tRNA-(guanosine(46)-N7)-methyltransferase-TrmB MIDENHPMRAAGNFFGRRHGKPLRPHQSNLFEDLLPRLKLDLATPAPQDLRSLFEAPVETVRMEIGFGGGEHLHHESGRYPQSGFIGVEPFINGMAKMLAALDQAPRPNLRLYDEDATAVLDWLPDASLAGIDLFYPDPWHKRRHWKRRFVSDANLDRFARVLKPGAKFRFASDIEHYVNWTLQHCRRHAAFDWQAESPADWNDAYEGWPGTRYEAKAFREGRRAAYLTFIRR >NZ_CP019390.1|WP_042972263.1|1047532_1048798_-|methionine-adenosyltransferase MSRSSYLFTSESVSEGHPDKVCDRISDEIVDMIYKEARRTGVDPWSVRIACETLATTNRVVIAGEVRVPETFLKKNKDGSIAHDAAGHPLINPSRFRSAARKAIREIGYEQDGFNWRTVKVDVLLHPQSADIAQGVDNAADRQGEEGAGDQGIMFGYACRETPDLMPAPIYYSHKILEKLAEARHKGEGDAGKLGPDAKSQVTVRYENGKAAEVTQIVLSTQHLDASWDSRKVRSVVEPYIREALGDLPIAENCNWYINPTGKFVIGGPDGDAGLTGRKIIVDTYGGAAPHGGGAFSGKDTTKVDRSAAYAARYLAKNVVAAGLADRCTIQLSYAIGVAQPLSVYVDLHGTGKVAESAVEDALRKVMDLSPTGIRKHLDLNKPIYAKTSSYGHFGRKPGRDGSFSWEKTDLIKALKAAVSA >NZ_CP019390.1|WP_002965221.1|1049024_1049438_-|helix-turn-helix-transcriptional-regulator MIENKKKPNPIDMHVGSRIRLRRNMLGLSQEKLGENLGITFQQIQKYEKGTNRVGASRLQAISSILNVPVSFFFEDAPGSGSSGGDGFAEDNEATYVVDFLNSNEGVQLTRAFTKISDPKVRRKIIDLVKSLAADAE >NZ_CP019390.1|WP_076771021.1|1049584_1051183_-|apolipoprotein-N-acyltransferase MIARLAGRIILLSGWRRALAAFLSGAFATLTQPPFDIFVAGFVSFPILVWLIDGAIARTDAGPLRRLLPAAKVGWWFGFGYFVSGLWWIGTALLVDADQFAWALPLAVLGLPAFLALFYAFAAMIARLLWSDGLGRILALAFGFALAEWLRTFIFTGFPWNLIGYAAMPVPLLMQSVAVLGLVGMSALAVFVFAAPALLTGGHFARTGIGLAIFLALAHVGFGAWTLSRAPAIVDENGPLAVRIVQPSIAQAMKWDNAERRAIFDKLVGLTEEAPAEGKPRPDVIVWPETAIPYILESTPQALAHIGDALQEGQVLLAGAVREEKGADGGEPRYYNSIYTIDDRGRIKGTADKVHLVPFGEYLPFESFLRGLGLQEVVEMPGGFTAGTIRHALAVKDGRSFLPLICYEAIFPDELGYEGARASAIINVTNDAWYGDTPGPYQHFRQAQLRAVEQGLPLIRAANNGLSAIVDTYGRITGSLALDAVGVVDSYLPSPRDPFWGRPPGWIQTVLILLTLLAASVGLILYSRRRFH >NZ_CP019390.1|WP_002968046.1|1051179_1052298_-|HlyC/CorC-family-transporter MAEQTLHPPSAGDNRGETQDSEGQSTQRSSVTEKRSLLSNIFPFMRARQASSLREDLADALSSATSEDGAAFSPEEKAMLHNILRLREIRVEDVMIPRADVEAVEITTPLWEVLELFEKSGHSRMPVYAETLDDPRGMIHIRDVLNYITRQARQKARRRTTAKSAATEVAPKFDMSRIDLAKTIGELNLMRKVLFVPPSMMASGLMARMQASHIQMALVIDEYGGTDGLVSLEDIVEMVVGDIEDEHDDEEIMIAEDADGVFVVDARADLEELAARIGPGFAVGEHGEDVDTVGGLIFSVLGRIPVRGEVVQAIPGYEFHVLEVDPRRVKKVRIVPLSAADRRRQQRAVVSARPEDHAADAAHAAQNSKEAG >NZ_CP019390.1|WP_156884120.1|1052300_1052807_-|rRNA-maturation-RNase-YbeY MSDNAIHIDIMIEAGNWPDEASLESLVSKSVAAAWNSLGLKSATSELSVVFTDDASIQVLNGEWRGKDKPTNVLSFPAFPVKAGSQPGPMLGDIVIARETVEREAKEEGKPIENHLSHLVVHGFLHLLGYDHETDEEAEVMEAREREILHALAIPDPYAVSDEDINND >NZ_CP019390.1|WP_002968045.1|1052942_1054004_-|PhoH-family-protein MSATEKLKSAKPTNQTQKSPTTGASDMAHIVLTFDNNRLASALYGQFDENLARIEQKLGVDVRSKGNQLSIRGEPTATEQARRALDHLYETLQKGHELTVSDVDGALRMAIAADDQLTLPTMENKGKLSAAQISTRKKTIFARTPTQDAYMRALDRSELVFGVGPAGTGKTYLAVAHAAMLLERGLVERIILSRPAVEAGERLGFLPGDMKEKVDPYLRPLYDALYDMMPAEKVERAITAGVIEIAPLAFMRGRTLAHSAVILDEAQNTTSMQMKMFLTRLGEGSRMIVTGDPSQIDLPPGQKSGLVEALRVLDDVEGVIKVRFTEKDVVRHPLVAAIVGAYDRDGKQHAGPE >NZ_CP019390.1|WP_076771022.1|1054162_1055566_-|tRNA-(N6-isopentenyl-adenosine(37)-C2)-methylthiotransferase-MiaB MSDDTTQIEPAMAQETSPRANTRKVFVKTYGCQMNVYDSQRMADSLAAEGYVATDTPDDADLVLLNTCHIREKASEKLYSALGRLRKMKDARAADGKELTIGVAGCVAQAEGQEILRRAPNVDLVIGPQTYHRLPNALARVRGGEKVVETDYAIEDKFEHLPAPRREETRKRGVSAFLTVQEGCDKFCTFCVVPYTRGSEVSRSVKQIVAEAERLADSGVRELTLLGQNVNAWHGEGEDGREWGLGELLFRLARIPGIARLRYTTSHPRDMDDSLIAAHRDLRQLMPYLHLPVQSGSDRILKAMNRRHKADEYLRLIERIRDVRPDMALSGDFIVGFPGETDQDFEDTMQLVREVHYAQAYSFKYSPRPGTPGADLDDHVEEAVKDERLQRLQALLSAQQYAFQDSMIGRAMDVLLEKPGREAGQMVGRSPWLLPVIIDDNKDRVGDIIHVKIVSTGTNSLIAQKLA >NZ_CP019390.1|WP_002967046.1|1055623_1056427_-|1-acyl-sn-glycerol-3-phosphate-acyltransferase MIGTIRIFLVVAAMVALSLSLIPFQYLFLKLKNGWKRRLPNFFHRIVARLFGFRIRTVGKLHEGCPLLLVSNHTSWSDIVVLSAVGQVSFIAKSEVRDWPVFGMFAVLQRTVFVERARRGKTVHQTSEIANRLIAGDAMVLFAEGTTSDGNRVLPFKTALFGAAHAAIREAGVAEVAVQPVAIAYTRVHGMAMGRYFRPLVSWPGDVELMPHLKGILREGAIDVEVRFGEPVFVTAETDRKALARTMENRVRALLQSALLGREIPEA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP019390_3 | 1354302-1354540 | Orphan |
NA
Consensus repeat of NZ_CP019390_3
|
3 spacers
spacers of NZ_CP019390_3
>3.1|1354352|13|NZ_CP019390|PILER-CR GAATGCCAAGGGC >3.2|1354415|13|NZ_CP019390|PILER-CR CAATGCCAAGGGC >3.3|1354478|13|NZ_CP019390|PILER-CR CAATGCCAAAGGT |
CRISPR arrays and Neighbor proteins around NZ_CP019390_3
The CRISPR arrays of NZ_CP019390_3 >merge|NZ_CP019390|3|1354302-1354540|PILER-CR GTTGCCGATAACAAGACTGCCATTGGCCAGAATAGTGGCCGGATCGACACGAATGCCAAGGGCGTTGCTGACAACCGGGCTGCCATTGGCCAGAACAGTGGCCGGATCGACGCCAATGCCAAGGGCGTTGCCGACAACAAGACTGCCATTGGCCGGAACAGTGGCCGGATCGACGCCAATGCCAAAGGTGTTGCCGACAACAAGACTGCCATTGGCCAGAATAGTGGCCGGATCGACAC >NZ_CP019390|3|2|1354302-1354540|PILER-CR GTTGCCGATAACAAGACTGCCATTGGCCAGAATAGTGGCCGGATCGACAC GAATGCCAAGGGC GTTGCTGACAACCGGGCTGCCATTGGCCAGAACAGTGGCCGGATCGACGC CAATGCCAAGGGC GTTGCCGACAACAAGACTGCCATTGGCCGGAACAGTGGCCGGATCGACGC CAATGCCAAAGGT GTTGCCGACAACAAGACTGCCATTGGCCAGAATAGTGGCCGGATCGACAC
>NZ_CP019390.1|WP_076771171.1|1352898_1353456_+|GNAT-family-N-acetyltransferase MNDICTIARDKVVVRALEKGDLQRYRAIRLNALQRAPMSFGSTFEEENAYSDTIFARRLEQVDGNAIFGAFHGEELLGIAGHHRHERRTERHRGTLASVYVEPQARGLKLGEALVQKVIDHAARHVVVLDARVVATNEAAKRIYYALGFKTCGVERKALLVQGQYLDQKLIYIDFSDPAWKDKLG >NZ_CP019390.1|WP_076771944.1|1352194_1352845_+|GNAT-family-N-acetyltransferase MVVEEESYEASCCEREADYADASPGLEKELGYVLDCPILVTERLVLRPPHAEDVDAISYLANNARVSSMLARMPHPYTRENALDFVGRVRKGEMGNCIYAITQAETGIFMGCCGIHPYKHGEGLEIGYWLGEPYWGHGFATEAAHALIDLAFRATGIGRLHVSCRASNGGSRRVIHKCGFQFSGMGMADSLAAGNVPVEHYVLDRRTWIGLRSWHS >NZ_CP019390.1|WP_002964927.1|1351804_1352074_+|50S-ribosomal-protein-L27 MAHKKAGGSSRNGRDSESKRLGVKKFGGEAVLAGNIIVRQRGTKWHPGANVGLGKDHTIFATVNGSVSFRTKANGRTYVSVNPIAEAAE >NZ_CP019390.1|WP_076771170.1|1351350_1351779_+|50S-ribosomal-protein-L21 MFAVIKTGGKQYRVAANDLIKVEKVAGEAGDIVEFAEVLMVGSTIGAPTVAGALVTAEVVEQGRGRKVIAFKKRRRQNSKRTRGHRQELTTIRISEILTDGAKPSNKAAEKKAPKADAAEGEATKPKKAAPKKAAAKAESAE >NZ_CP019390.1|WP_076771169.1|1349611_1349950_+|AzlD-domain-containing-protein MSTELWIAVIVIGVLTYITRAVPFLMSLAGNTAPAARAWLSALGPCLLSAMATVVFLDGFQTSIQIGKIIPFLVGSVFASVSMSVRPDPGIGTVAGVTGWWLAASIGSVPTF >NZ_CP019390.1|WP_076771168.1|1348937_1349612_+|AzlC-family-ABC-transporter-permease MTERWRQFFHGARSAIPIILGYLPVAFAFGTAASGQGLALLDSTAISALMFSGANQAFFLSAVASGLPTIAIVAICAVASLRHILYGFVLRLRLAGGLASRLAFAFGLTDEVFATVLNATEKSKPDGGWIFGLAFFAWISWVAATFFGAWMGNILQAQFLQLSDALHFALPALFLGLVWVSTSARNVIPMVAAAVIAAMFLCLNLPALAIPGAASAALLARRGD >NZ_CP019390.1|WP_076771941.1|1347012_1348941_+|helix-turn-helix-domain-containing-protein MNLARLRKRRGLTLDGLAELSSISRAAISALENGAGNPRLETLWSLANALGIEFGELVGARNDVEVVEADGISVRLIDRQTRPRTVEAFLLDLPANAKRHADAHVHGVSENVVVLSGAIAVGPLSTPMLLHAGQSHQFAADVPHIYSSGAEPSRAIVTIIYPEDDTALTSEDQELEWPVGKDEWANVRAQLNRARIEVQNGYAHSRITFKSTPEPLQSAIRLIEDELATRSGIAETAKVFVTGNRTPAIATFYRTTQMRPLPINEQLATPLITNCRELANAAITPWLAKKVDADGLHAKSQNSTHIIEAALAAEVLTRLGRPTVPTGISQKQVTPKQSPLMDRMFEDRIDVDAYEAYELVHPAYARQVLAVAETLPVFATKSDQTILDVGTGSGLPLQMLLELRPELHVVAIDPSEIANVHLSRRFADDSRVQAVQASIIDYRPADYLFDAAVSIGASHHLDTKQFLSSIHECLAAEGVLVIADEMLAPFRDRRERNLALVTHHLWYILDTLFDLPASSSEAERAVCDILKQGLPLAMSLALSGRSEAATRQVRETFKAATDIDLGNALVAREAAFNRFHLLELQALVAGLDYEVEQKTYPARFVSLAESSGFSLLQHRRIYATQGDGSYDAGTHLFVMVKR >NZ_CP019390.1|WP_179947168.1|1346331_1346826_-|ISAs1-family-transposase MTGPSPCSLTLLDHFSALSAPRQRWKVACPLEEILLVVLCATISGMEGFVETKLWVEHRLEFLRRFLPFERGIPSHDTLNDVINALEPALFKECFTNWVEALRERDGDIIAIDGKTSRRSHDRGKGRGPLHMVSAWATRQRLVLDSKPARKKVLSQILIMLKAA >NZ_CP019390.1|WP_076771166.1|1344048_1344723_+|hypothetical-protein MTDLIHIHEDDWGMRNLFPLAAFSEVKEDIARSATAAEKHQDASGFGYTDVYLMEPPSISYADVGLLVSDAESVLLPILPRVQHFRATSFQGMKSGKHDPYGTYQDDTSCFGLGRHCYLKLDKKGPLVEGIWFGLDTDDADAIGRLRMAIEAIDALVPSVIADYFLDISGPVGADGVLDSYFEAFQLQHRKAKQAVQEFQAKYQRQENMLDKLRKLVAFLGRFR >NZ_CP019390.1|WP_076771165.1|1334058_1336014_-|M23-family-metallopeptidase MKELGPNNLDPGDEPPLSVGGRRRPPDRREVSARWLAGTFLTGVTSCMLIGIALFAALDGREQLATPPEILARNEMPGITEDDTATKGGRLISMVSQQKTRDRRRFDLSTMQRVGEREVIRTKPFEFVRMALAVDHPTNRKYPPFNAMTIFSEGPAAPQPSDAGQIYGAKVESEVSLRVVDFPISTATFDASSDLTVDEVEKVVRDTGGLLTDGDVQVASLHYVDPARFGGNDSPFALSPPLGVKITQENVSIAPRSDDEQANEGFSEELIPFRQTADIAQALEDAGYTGEDASNMAEALAKLMNSPRLKQGSVLRVGTESHDGNDRIVRASIYNRTTHLVTVALNDRQQYVPSDEPEETPLLQTAFDGNAAPAAIRGNLPSVYDGISHAALAYGMTETMREQLVKMLASDVDLQARLSPSDAIDAFFSLPDDPEKPENDSQLLYVAATFGGTTRKFYRYQAPDGSVDYYNEDGKSAKQFLLRNPVPNGIFRSPFGMRRHPILGYTRMHTGVDWAAPRGTPIIAAGNGVVEKAGWSNGYGNQTLIRHANGYVTSYSHQNAIARGITPGARVRQGQVIGYVGSTGLSTGPHLHYELIVNGTKVDALRIRLPDNKALKGKEFEAFKQERDRIDTLLNSDENGTKLASNGSAKS >NZ_CP019390.1|WP_083699653.1|1354889_1355768_+|YadA-like-family-protein MGRNSGRIDTNAKGVADNRAAIGRNSGRIDTNAKGVADNKAAIGRNSSRIDTNAKGVADNRAAIGRNSGRIDANAKGVADNRAAIGRNSGRIDTNAKGVADNRAAIGRNSGRIDTNAKGVADNKTAIGRNSGRIDTNAKGVADNRAAIGRNSGRIDTNTKGVADNRAAINQNRGRINANAAGVASNRAAIRQNSAAISALGQRVDGLQGQINSARKEARAGAANAAALSGLRYDNRPGKVSIATGVGGFKGSTALAAGIGYTSKNENARYNVSVAYNEAGTSWNAGASFTLN >NZ_CP019390.1|WP_009364205.1|1356023_1357049_+|GTPase-ObgE MKFLDQAKIYIRSGNGGAGAVSFRREKFLEFGGPDGGDGGRGGDVWVEAVDGLNTLIDYRYQQHFKAKTGMHGMGRNMTGGKGDDVVLRVPVGTQIFEEDNETLICDITEVGQRYRLAKGGNGGFGNLHFTTSTNRAPRRANPGQEGIERTIWLRLKLIADAGLVGLPNAGKSTFLASVTAAKPKIADYPFTTLHPNLGVARIDGREFVIADIPGLIEGASEGVGLGDRFLGHVERTRVLLHLVSAQEEDVAKAYQVIRGELEAYEHGLADKPEIVALSQVDTLDPETRKAKVKALKKACGREPLLLSAVSHEGLNDTLRQLARIIDLSRAEEAGTAQAEE >NZ_CP019390.1|WP_076771172.1|1357076_1358213_+|glutamate-5-kinase MLKKLKDYRRIVVKIGSGLLVDRATGLKREWLESLGQDIAALQHAGVEVLVVSSGAIALGRTVLGLPKKALKLEESQAAAAAGQIALAKAYADVLGGHGIKSGQILVTLSDTEERRRYLNARATIETLLKLKAVPIINENDTVATTEIRYGDNDRLAARVATMMGADLLILLSDIDGLYTAPPHKNPDAQFLPFVETITPQIEAMAGAAASELSRGGMKTKLDAGKIANAAGTAMIITSGTRFGPLSAIDRGERATLFEAAHAPVNAWKTWISGNLEPAGRLTVDAGAVKALKSGKSLLPAGVKDVDGDFERGDTVAVMNEDGREIARGLIAYDAADARKVAGHKSDEISAILGYDARAAMIHRNDLVVRAASDAKAA >NZ_CP019390.1|WP_076771173.1|1358222_1359506_+|glutamate-5-semialdehyde-dehydrogenase MLVKADMTKDIAQVMAEVGRKAKAAAAPLSIATSEQKNKALNAAGDAILEARADILEANRLDLANAEKNGMAASFVDRLTLNEARIDAIAEGIRAIAALPDPVGEVIAEWDRPNGLHIERVRTPLGVIGVIYESRPNVTADAGALCLKAGNAVILRGGSDSAHSSAAIHKALVKGLEAASLPADAIQIVPVTDRAAVGEMLKGLGGAIDVIVPRGGKSLVARVQSEARVPVFAHLEGICHLYIDKSADLEMARRIALDAKMRRTGICGAAETLLVDRAVASTHLAPILGDLAAGGCEIRGSAEVLALYPAAKPATEEDWSTEYLDAIISVALVDGISGAINHINRYSSHHTEAIVAEDAQTVARFFNEIDSAILLHNASTQFADGGEFGMGAEIGIATGKMHARGPVGVEQLTSFKYRVRGSGQVRG >NZ_CP019390.1|WP_070997100.1|1359505_1360180_+|nicotinate-nucleotide-adenylyltransferase MKFGFGLSALKEQYPGVDAHYLRMPHVEKGMTVGLFGGSFNPPHGGHALVAEIAIRRLKLDQLWWMVTPGNPLKDSRELAPLSERLRLSEEVAEDPRIKVTALEAAFHVRYTADTLALIRNANPDVYFVWVMGADNLASFHRWQRWREIAQNFPIAIIDRPGSTLSYLSSRMAQTFSDSRLDERYAPVLARRMPPAWTFIHGPRSSLSSTALRKVQSKKAPSKK >NZ_CP019390.1|WP_002972331.1|1360342_1360714_+|ribosome-silencing-factor MNETNLVSATFDAALASLENSKAESIIPIDIRGRSTIGDYMIVASGRSHRHVTAVADHLVQALREAGCKEMRVEGLESGDWVLIDTGDIIVHIFRPEVRDFYNLEKIWLDDDFGDERAPGLVH >NZ_CP019390.1|WP_076771174.1|1360732_1361257_+|23S-rRNA-(pseudouridine(1915)-N(3))-methyltransferase-RlmH MRVSVFAVGRMKSGPERELVERYLDRFAKAGPPLGLEFAGVSEIPESRGQTAQLRKAEEAQRIHEALDNAKSGGAKSGGTSSGGAALILLDERGKTLGSEAFAATVGHMRDDGKRQLIVAIGGPDGHDPALRSRADLVLALGELTWPHQIARILIAEQLYRAATILAGHPYHRS >NZ_CP019390.1|WP_076771175.1|1361374_1362274_+|hypothetical-protein MSRPTIITLSSIPPRFGLLKPTLLSLLSQRLKAEEVRLYIPHKYRRFPDWDGRLPEVPAGITIVRCDEDLGPATKVLPAARDLKGRDVDILFCDDDKIYDANWHQRFKAEAARKPGHCIIEDGETFPDIADAGRPADRLPRSRWKPKDFKYRMKRIASLFLYKPNRGTPGYVDRISGFAGVLVHPDWFDELFYDIPDIMWTVDDPWISGHLERRGIPIWMIGRNSRRAEGKASGVSALLNHVEDGQDRVDADLLVIDYFRQHYGIWQKTEMIDEKATLRTASMREMMRRRKAELGLIDP >NZ_CP019390.1|WP_004688741.1|1363724_1365053_+|S41-family-peptidase MIRKLSLLFAGALLGASAMVMVQGAPASTAFAAGKDSDVYKELALFGDIFERVRAQYVTPPDDKKLIESAINGMLTSLDPHSSYLNPEAAQDMRVQTKGEFGGLGIEVTMDNDLVKVIAPIDDTPASKAGVLSGDLITKIDGQEVRGLSLTDAVDKMRGEVGAPIELTILRKGADKPITLKINRAIIKVKAVRSRVENDVGYLRIISFTEQTSEDLKKAIKDIQEKVPADKLKGYVLDLRLNPGGLLDQAVAVSDAFLDKGEIVSTRGRDPQDVTRFDARKGDLTNGKPLIVLINGGSASASEIVAGALQDHRRATVLGTQSFGKGSVQTIIPLGENGSLRLTTALYYTPSGKSIQGKGITPDIKVDQPLPPELKGEDVVRGESELKGHIKGNAEDASGSGSSAYVPPDPKDDLQLNEALKLLRGEVANAAFPPDPKKGVLN >NZ_CP019390.1|WP_004688740.1|1365270_1365807_+|RNA-pyrophosphohydrolase MSKHKGPTGAMVDPESLPYRPCVGLMVLNKAGLVWAGRRIVIPGDEMDGATQLWQMPQGGIDKGEDPAQAALRELYEETGMTSVSLLEEASDWINYDLPPHLVGLALKGKYRGQTQKWFAYRFEGDESEIAINPPPGGHTAEFDCWEWKPMADLPNLIVPFKRKVYEQVVATFRHLAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP019390_1 | 1.1|310330|27|NZ_CP019390|PILER-CR | 310330-310356 | 27 | MF370964 | Pseudoalteromonas phage SL20, complete genome | 18053-18079 | 4 | 0.852 |
NZ_CP019390_1 | 1.2|310377|26|NZ_CP019390|PILER-CR | 310377-310402 | 26 | NZ_CP039340 | Ralstonia solanacearum strain UW386 plasmid pUW386, complete sequence | 534180-534205 | 4 | 0.846 |
NZ_CP019390_2 | 2.1|1044090|35|NZ_CP019390|CRISPRCasFinder | 1044090-1044124 | 35 | MW091529 | Bacteriophage sp. 103231, partial genome | 17779-17813 | 4 | 0.886 |
NZ_CP019390_1 | 1.1|310330|27|NZ_CP019390|PILER-CR | 310330-310356 | 27 | NC_006298 | Histophilus somni 129PT plasmid pHS129, complete sequence | 4433-4459 | 5 | 0.815 |
NZ_CP019390_1 | 1.2|310377|26|NZ_CP019390|PILER-CR | 310377-310402 | 26 | NZ_CP015737 | Shinella sp. HZN7 plasmid pShin-01, complete sequence | 448519-448544 | 5 | 0.808 |
NZ_CP019390_1 | 1.2|310377|26|NZ_CP019390|PILER-CR | 310377-310402 | 26 | NZ_CP029209 | Nitratireductor sp. OM-1 plasmid pOM-1, complete sequence | 427107-427132 | 5 | 0.808 |
NZ_CP019390_1 | 1.2|310377|26|NZ_CP019390|PILER-CR | 310377-310402 | 26 | CP000662 | Rhodobacter sphaeroides ATCC 17025 plasmid pRSPA01, complete sequence | 469166-469191 | 5 | 0.808 |
1. spacer 1.1|310330|27|NZ_CP019390|PILER-CR matches to MF370964 (Pseudoalteromonas phage SL20, complete genome) position: , mismatch: 4, identity: 0.852
taaattccccttccaaagcaaatcaat CRISPR spacer ttaattctccttccaaagcaaatccgt Protospacer * *****.**************** .*
2. spacer 1.2|310377|26|NZ_CP019390|PILER-CR matches to NZ_CP039340 (Ralstonia solanacearum strain UW386 plasmid pUW386, complete sequence) position: , mismatch: 4, identity: 0.846
gattggcgcaccccatggggacgcgc CRISPR spacer acttggcgcaccccttgggggcgcgc Protospacer . ************ *****.*****
3. spacer 2.1|1044090|35|NZ_CP019390|CRISPRCasFinder matches to MW091529 (Bacteriophage sp. 103231, partial genome) position: , mismatch: 4, identity: 0.886
actctttgtttttacgcattatccaacgcaaaact CRISPR spacer tctcttggtttttacgcattatccgacgcaaaacc Protospacer ***** *****************.*********.
4. spacer 1.1|310330|27|NZ_CP019390|PILER-CR matches to NC_006298 (Histophilus somni 129PT plasmid pHS129, complete sequence) position: , mismatch: 5, identity: 0.815
taaattccccttccaaagcaaatcaat CRISPR spacer gatattccccttccaaagcaaaactag Protospacer * ******************* * *
5. spacer 1.2|310377|26|NZ_CP019390|PILER-CR matches to NZ_CP015737 (Shinella sp. HZN7 plasmid pShin-01, complete sequence) position: , mismatch: 5, identity: 0.808
gattggcgcaccccatggggacgcgc CRISPR spacer cgatggcgcgccccatggcgacgcgc Protospacer . ******.******** *******
6. spacer 1.2|310377|26|NZ_CP019390|PILER-CR matches to NZ_CP029209 (Nitratireductor sp. OM-1 plasmid pOM-1, complete sequence) position: , mismatch: 5, identity: 0.808
gattggcgcaccccatggggacgcgc CRISPR spacer cgatggcgcgccccatggcgacgcgc Protospacer . ******.******** *******
7. spacer 1.2|310377|26|NZ_CP019390|PILER-CR matches to CP000662 (Rhodobacter sphaeroides ATCC 17025 plasmid pRSPA01, complete sequence) position: , mismatch: 5, identity: 0.808
gattggcgcaccccatggggacgcgc CRISPR spacer cgatggcgcgccccatggcgacgcgc Protospacer . ******.******** *******
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
86022 : 100601
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP019390|86022:100601|DBSCAN-SWA CTTACACCGAAAGACGCACGGCGCCGAAATCGGATGGGTTCACCGCCGTCCCGGTGGCGTGACCGATGAAGGTATTGCCGGATGCTACGGTCGACAGGTTTTTGTCGGCGGCAACATAATAAACCTTGGCACCCACGGCCCACGCCTGCGCAGAAACCTTGGGCAAGGCAAAGACGCCCTTGGTTGCGATTTCAACGTCCTCGCCAGCTTTTGCGGAGAACTGGGCAACGCCAAACAGATCGCCGACAACAACCAGGTCACCAGAATTTACGTCAGCCGGAGCCGGAACGGTGACGCTATCACCGGGCTGAATATAGTTTTTCATGGGAGTTTTTCCTTATGAGGGATCAAAACAGATAGGACGGCGGGTCAAGCTGGCCCGCCAGCGCTCGATCAGGCTCCAGCGTTTTTGTAGCCGAAGCGATAGTCTGTCGCGCCACAGCCGAAGTCGTGCTCGACCGACATGCTGAAACCCTGACGCCCGAACGGCTCATCCATGCGCACGCGCGGTGCCTCATAACCTTCCAGATAGCCCCAACGGTAGTTCGAGCCTGCAGCCGGGTCAGAGAACAGATGCCAGGCATTACCCTTGATCTGGGTGCTTTCGATCAGCTCAAGCCTGCCGGAGAAAATGTTGACATTCGCCACTGTCGCAGGCGTGATCGAGGTCAGAAGCTTTTCGGCTTCGGTAAGCTTATCCGGCCCGACAAGCATGATGCGTGGCGAATTGGAGAGCAACGGATTTCCATCCTTGCTTTCCTGCTTGCCCATAGCCGTGCGGCCAAGTCCGACACTATCGACAGTGATGGCAGAAGCCGTAGCGGCGAGGTTCTTATGGTCTGCGTGGAAGACAGACTTTCCGTCCGCGAGGTTCCCGTTGAAGGCACCCGCATAGAAAGTGACTTCCTCGAACAGCGCCACCGACGCACCATAGCTGGTCAGAAGCTCAGAGATCGCACCCAGATCGTCGTTGATCAGCATTTGGCGGCTGATGTTGAGCGCGATGGCATAGCTAAATGCCTGCACCTGTTCCTTGCCTTCACCGAACGAGCCGTATTTGATCTCGCCGTTTTCCAGCACCTTTTTCAGAAGCGGGAAATCACCGACCTTGACGGTGGTATCGGGACGGAAATCACGGAAGTTTCGCTTGCGTGCGAAACGTTTGAACGTCGGCTGGGCAAGCGCATAGCGCTGTTCGAGAGTCCGGTTGACAGCACCTTCGAAGATCGCCGGGAAATCAGAGGTCGAATGCGAAGCGCGCGTAAACACATCGTCGATGTCACGGGCATTCATCATGCGACGACCGCGATAGTTGACGCTCTCGGCAGCCAGATCGACAAGGCCCATGCCCATATATTGGCGAGCAGCTGCAGAAGGGCCAGCCTGCGGCGTCGGTGCACCGAGGCCGTATGCCAGCGCCTCAATCTGGGCAGATCGACGAGTGACCGCCTCATCATGAACCACATCGACACGCACACGGCTGTCGGTCGGGCTTTGGCGCTCATTCGAAACCATGTGATCGAGAAGCAAGCTGCGGAACTGCTCGACCGGTGTGCCGGCACGAACATGCTCGCGACCGAGATCCGGGAAACCGGAACGGGTAGCCAGATCTTCAATAATGCTGGACCGCTCACGCTCGGCGCGCACGCCCTCATCGACAGCGGCTCGCACCGACGGATCAACTGGCGGCGCGTTGCGCTGCTCGTTTTCCATCCGGGTTATATCCGCACGGACCTGGTCGGCCTCAGCCAGAATACCCGCGTGTTCCTGCTCGATTGCCCGAACCGCATCTTCATCCAGATCATCGGTAATTCGGGCGCGGGTAGCTTCCGCACGGTCGGTGATTTCTTTAAGCTTCGAACGCAGACCAAGCAAGGCAACGTTGGCACCGATCAGGTGCATTCCGTCTGGCTGTACGAAAGCACGGTAATCGAGGGATGCTGCATGCGATGGATCTGCGGCGAACAACGCCATAGCCAGACCGACGCAGATAAATGCGGCGACGGTCGCGAAAATGTAAGCACCCTTTTTCATGGTGTGCGCTTCCTTCTATGTACCGGGAAAAACAAGCGCCGTCGCCCTGCAACCCCGGAGAATTGCAGGCGGCAAACTGAAAATCTGTGACAGCTGAATTAAGCGAAGCGGCGAATAGCTTCCGCCATTCTCATCCGCGCCGATGCCGCAAACGTCGAGACGGGCTTGCGGTCGATCAAAAGCGGGAACGTATCCGTGTGATTGCGAACCTGCGCGCCTGGATCTGCCGGAACCGTCACGAACGAAATTTCGTTCGGCGTCCAGCGTTCGACGAACACCTTTTCGACCTCGCCCTTTTTCTGGGCTTCCTCAATCCGGATCTTGTCGATGGAATAGCCGACAGACACATTCTTGATAATGTCATCGGAGACCAGACCGAACATGCGGTCGGCGCGTTCGTCGATCCCGGCTTTTGGAAAACGGATCGTGGCTTTACCCTCACCACCTTCAATCCATGCACGCTCCACAACGGCAACCTGCGAATATGTTGACCAAGTCGAATGGCTATCGAGGACTGGAGCGCCTGCATTCATGCGTGACAGATCAATCGCTTTGTCGCTCACAACAAGGATTTCATCGAACTGAACTATGGTGTCCCATCCCGCATATCGCTTACGGCGAACCTTTGCACCTGTTGTCCAGACAAGATTCAGAGTACGTGTCTCGGCATCGACACCGGAGGGCAACAGCCGCACCTCCTGCATCTGCATGGGCAGACTGTCCGGCATCTTGCGCAGGTTAAGTTTCGTCATCGTTTTCATCCTTCTCAGGAGGATCGTCAGTATCGACGGGCTGCTGGACCTGTCCGGCCTGCGATATGCGGCGCGGATCGCTGTCGAGGATAATGCCGCGCTTATCGAGCTTGGCATTGTCCGACGCTATTTCACCCAGCACGTCATCAATGTTTTCACCCGTCTCGGCGATTGCCGATGAGAGCGATCGGAACCCGGCGCGAACTTCCTTGATCCTGGCATTCACATCTTTCAGCGGGTCCGCCGAATAGAAACGCGGCGGCGACCATTCGACCGCCACTGTCGGCGTGCTGATTTTTCCTGCAAGATATGCTGCCTCGCAAAACCAATCCCACATCGGCTGCAACAGCATCGGTATGATAATCTGCCATTGCAGCATGGAGATCATGCGCCGGAATCCCTCCAGGCCGATCTTGCTCGACGAGTAATTAACCTTGTCGAGCCGACCCGTCATCAGTGCATAGGGAACACGCCAACCAGCCGAGATCGTATGCAGCATCGAAACTTTATAAGGATCGTAGCTGTCGGTTACGGCAGGCTGAGAGAATTTCATATCTCGACCGCCGACCGCATTATAGAACATACCGGGTGCGAACTTCTCAACACGCTGCCCGTGCACATTGTAAATGCCCGGCTTGGTCGCATCGCCATCATTGCCCGTCAGCGGCATTCCAAGCGTGTCGTCGATATCGCCGCCCGTCATCACACCTACAATGCATGATTCAAGACGCTTTCTGACAAGTTCGGACTGCTCGTACTCGGCAAGGTCAAAAGTATCGTCCATTGCTGGCGTTCCCCATGGAACGCCCCTGACCTGCGTACGCTGTTTCTCGAAGACATGCGCAATATCAGCTGCCGGAACTGGTTTCGAAACGACGGTCGATTGCGGATCGAAAAAGCTGTTTCCGGGGTGAGATCCAAACATCCAGTAGGCGCGCTTGCGGCCTATCGCATCGAACTCGATCCCCTGAATTGCCTTACCTCCGCCCGACAGCACCCCTTCCTTGGTACTGTCGAGCAGATCCGCTTCAACGACCTGCAACTGTAGTGGAACCGGCAAGCCGTCTTCTAGACGACGACGACGGCGGCGTACAATACCGTCACCACTTTCAAACATCTCTCGAACCGTCAGGGCAACTATGCCGTTGAAGTCGAGATCGCCATCAGCATCGCAAACCTTGCTCCATTCCTGGAACAGTTTGATGGCTTCCTTGTTTTTCGAGCGCGGAATAATGCCGTCGCCGATGGCATGGCTGACCAGCTCGGAAACAGCTTTCGCCGCATAGGGATTGTTGCGCACCAGATCCCGCATACGGTCGCGAAGCTTGCGACCGGCGCGCGCGATCTCGGTATCTGCCGAAGTCGATTTCGCGCGACGGCCTGATTTCAGGCGATTTGTTTCCGCACCGGAATAGCTGCGCTGGACAATCTCCATTGCAGCGCGGTGACGAACACGGCGCAAGCCTGCCTCCGGCGACACATAGCCGATGGCCTTATCAAGAATGTTCCCGATGCCCATTAGTCTAACGCCGCCAGAATGGTGCGAGGGCCACGCGATTTACGGGCCCGAAGATCCGCAAGCGCCTCCCGCATATCTTTCAAAGAATGATATTCCACCTCACGGCGGGTGCCGCCTGAGTGGAATATGACTTTCTTCGCGCCGGTCAGAATGGCTTCCTCTAGCGCTGCGATCTGATCGTCTATTGACGCCATGACTATAACCACTCCGAAGGTGCGATCTGTGGAAGGGATGAAACAGGTTTGATTTCCGGTGGCGTCGACAACTCGCCCTCACGGTGCGCCCAGTTCGCATTCACCATCTGGCGAGCGGCGAAGGCATAAACCGTGCAGTCGAGAGCTTCGTGTCGTCGCCCCGGCACCGGAACGAATTGCCGGACCGTTTGTCCGCGCGAATATTTGACTACCAGTTGCTCGCCCACGAGCTGCTCAAACCAGACATCAGGCAGAGATTTCGAAAAACGCATGGAACTCGCACGCGCCAGCCGACCGAAAATGTGGCTCTTGATGCCATCGACACCGACGATGAACAGCTTGCCGCCCTTCACTGTCGATTTGGATTTCTCAATCCAGGGCCGGTTGCCGCCCACGCCCTTGATCGCAAACACGCGCCTGCGGAACCGTGGAAAAGCATAACGATAGACAGTTTCCATCGTTTCGCCGTCCGAGCTATCGATGCAGGTAGCATCGACCTTGATCTTGCCGCCGAGCGGATGATCCCATTGCGTGCCGAGCGCGACATCCAATTCCGACCAGGTCGTGTGATCGTCATATCGACCCCAAATCACTTCATGTCCGAGGGCGTAAGGAATCCCCTCTTTGTCCCAACCTATGAAGGTGATTTCCAATCGATCATCCTGCACGTCAACGCCTGCGGTGATGATGAGAACCTGCACAGGAATGCCGGTCGTGGCCGTCGCATCGTCATCGTCTGGAGTTTCCGCAACCAGGCTGAAATCTTCGGCTCGGCTTGCAAGTTCGATATCATCCAGCTCGTCGGTGTTTTCCTTCCAGCCTTGCGCCAGGATCGTGTTGATGAACGTTTGAAGCTTCGACGGATCATTCTTCGCGCCGACAAACTCTTTCGCCAATCGCCCCCAAGATGCATTCGGCAGCAAGGATATCAGGGCGTTCATCCGAAACCCTGCATGGTCCTTGACTTCCGGCTTCAATGCCCGCCAGCGACCATTCGCAACCATGCCGGGCTTGTGGCGCTCATCGATAACCGAGCCGCACTCACGGCAAACATAATAGGCCTTCTCCGGCTCACCTTCCGGCCATTGAATGTCGGACCACTGGATCTCGTGGAAGTGGCCGCACTCAGGGCAAGGCACCTCATAAATTCGCTTGTCAGACTGTTCGTAGGATTGCAGCACATGGCTTGTTTCCTCATAAACCGGCGTCGACCCCATAACGATCTTGCGATCCGCAAACGACAGGGTGCGGCGCTCGGCGAGCAGGATCGGCGACCCTTCCTTGGTCGCCGACATGCCGTCCGCTTCGTCGATAAACAGGATGCGCACATTATGGCGGCGCAGGTTGCGCGGTGCTTTCGCAGCAATGACTTTCAGAAAGCCGCCTGGAAAGCGCCGGGAAAGCAGGGTGTTTCTGCCGCCCTCATCCACATCGCCGGTCAACAATCCATTGAGCGCGGGCGAGGCATCAAAGATCGGCTCGACATCCGAAACCATATAGTCGCGGCAGTCGGCCTCAGTCGGCAGGAGCGAAAGGATCGGCGACGGATCGTTCGAACAGAAACTGGCCATCGCGCTTGTCAGAAGCGTCGTGAAGCCGACGCGCACCGGCTTCACCAGTGTGACACGCTCAATCGCGCTGTCGCCGATAGCGTCCGCAATCTCGCGCTGCGGTGGCCAGAGCCTGACCCGACCAGTGAGGGACGATACACCCTCTGGGAGATAAACTGTCTGCTCAATCCAATCCGAGAGCTTCAGTTTCGGTGGCGGCGTCAGCGCTTCCCACACCGCCCGCCGCAAGATCGCCAGAGCCGCTGCCATACTCCCAACCCTGTGCAAGAACACAATTTGTAAAGACCTGCATAAACGCCACGCGCCCTTTGCAGGCTAAGAACTCACGTGCCAAGCGGTCCCACGACGCATTCGGTAAAGTTGAAATAAGTTGATTTGCTCGAAAACCGGCATGATTCTTGATCTCTGGTCGTAGCGCCCGCCACCGGCCACCAGCAATCATCTCGATTTTGAAGCGCTCATTGATAGCATTCCCGCATTCGGGGCAGACGTAATGTGCCTTTTCCGGTTCACCTTCCGGCCACTGTATGTCACTCCACCCGATTTCATGAAAGTGACCACATTCAGGGCATGGAACCTCATAGATTCGTCTATCCGACAATTCGTACGAGCGCAGAACATTACTTGTTTTCGAAAACGTGGGTGTGCCGCTAATAACGATCTTGCGATTTTTGAAACTCAGAGTGCGCTTTTCTGCCAGTTCGATAGGGCATCCTTCGGCATTATTTGCGATCCCGTCAGCTTCATCAACGAAAAGAAATCGAACGGCATCACGTCCTACACGCGAAATCCTACTGCTATCGATAATATCAAGGTGGTTTTGTAGGTGGGAAAGCCTATAGCGACGGCCATTTGAGAAGCGTCCGGCCTCACCTCTCCAATGCACTGCTACCTGCATATTCTCATCAAGAAGACTTCTGAGTTCAAAGTCTACGAACTCGGAAACATCACAAGTGGGCAAGAGTATGAGAGCTGATACAGGACTGATGCTCCCCTGCACCAGGAGCGAACTTGTAAGCAGGGTGGTCAGTCCGAACCGGACAGGTTTGACGATTGTAACGCGCTCGATGGCATCATCACCTATCGCGTTTGCAACTTGCCTCAATGGCGGATGCATACGGATGGGACCCGGCGACGGGTACCACGACGGTAGATGGATTACCTTTTCAATCTGATCGGACAGGGATGTCATTTTCAGCTTTCTCCATCGCCGATTGATCGGCACTCTCCACGACACTTTCCTTCTCACCAAGCTCTGTAAGTGCCGTGCGAATTTCCTCGTCGATCAGGTCGACATCGAAAGTGGTTAGGTGAGGCAGCATCTGGCGGCATCTGGATGGAACCGCCATCATCACGTTTCGGACCCGTCGGGCTATCGAAACCCATTCGTTGCGAACTTCGGCAATCGGGACCAGTTCCTTGCGCATTGCAGCATTGCGCAATGCTGCCTGGTCGGCCTGCTCTCGCGCCAGTCTGGCGCGTTCTGTTGCGAGCGTATCGACATTGTCACCGCCTCGCCCCGCCGCGATCCCGCGCAGGTGCTCGCAATAAAGCTGGACGGACTTGCGCAGATCGAAGCGATTCCGACCTGTCTTCACGATGATTCCACGCTCGACGTAATCGGATATTGCCCGCTTCGAAACGCCCAGGATCTCTGCGAGATCCGCCGCGGTTATCTCAGCATCATTTTCAGGCTGCTGGTCATCCTCAACCGGCTCTTGCTCCGGCAAAAGCGGCTGCGCAGCCTGATCGACATGTTTCTTGTGCGATTTGGCAGCAAAGCTGGGGCTGACATTGAACTTTTCCGCTGCCTGCCGAACGGTATGGCCTTCCTCAATGAAGGCTATTACCTGCTGGCGCAGCTCGTCTGAATACCCTTTGGCCATGTGATTCCGATTCCATTCAAAGGGCGGTGGAATCCCCCTGTAAAAATTCGCAGAGACCGAAATCCCGCAGTCGAGCGCACCCGCTCCGGCGATAGACCCGGAAAAGGACCCAATGGAGGGGGATGGGGTCAGGATCGGTCGGTCGGGTGCGCCCGCTCCTCAATCCGCTGCCGGTCCAGGCGTTCACCCGAACGATCTCCTAAGGCACTAGTTTTTTGAGAGCAGCGTCGACACGCTGTTGCAGCAATGGTGCTGCCGTTCTCTCAAATGCCGATCTGGTTGCGCCCTTGGTCATTTCCATCGGGATGAAAACGCCGGACCGGGTATAGGTGATCTTGGTTCCAGACTTGTTCAGCCTATGGAAAGCATGGCCGTAAAACTTTGGAACATTGACGCGACCGGGAAAGCGTCCGCCCTTGAGAAACGATCCAGCGAACAACTTCCGCTGCCCGAATGGCTTTGCGGAAACGCCTGCACGCATCTCGCGAGGCGAAAGGTATTTCAGACGAATATCGCCGCCTCGCGTCACCATTTCATAGGAAAGCTTGCCAGGTCGTGCAACGCCGGGATCCCCGACTGCCTTGACGATTGTCTTGCGCGCCAACCCCGTTTGCTTGGTCAGGTTTCGAATAACCTGTGTTTTTGCCCGGTTGCCGACCTGATTGACGATGCGAGGAAGAACTTTCGGAAAGCGCGAATTGAGAACCGCAATCCTCGATCCGAACAACGACAGGTGCTTGTCAGCCCACTTGGCCGTAATTGTTGCCATGGTAGGAAACCTCGATCACTTGCAGGCGCTGCTGCTGAAATAGTCATCCATCATGCAGGAATGCTTTGCGCATCCACTCAACGCGAAGATAACCAGCAACAGGGCAACCAGGACAAACAGTCGATTGTCGATGGTTCTGATCATTTCCCAACCTTTCAACAAAACCGTTTATCCTAAGGACGAACGAAAAGAGCGCCTTGCGGCGCTCATCATTCGGTAAAATCTGGACATAGCTTACGCACTGGCCCTGAATCGATTGCTGCCATTCTGGCAGTCAGGGCGGGGTCCGAGCGCGACCACCTCAGGAACTAGTCCCCACGTTTTACCGTGTATCGAGGTTCACTATTCATACCGGATCATCCCGTGAGCAGATTACGCTCACAAAGGTTCGAGAATTGCAATAGGCACAGTCATTGCCACTGGCCTGCCCATAATCGAAACCTCAATCACAACGAGACCGTTGCCCTTTGTTCCACCCGAAACCAGTTCTGCACGGCAACCGGCAAAAGGACCATCGGCAACACGTGCCCATTTCACCCCGATAAATTTCCGGTGAAAATGCTCGTAATCATATTTACCATCTTCGGCTTTCGCTTTGAAAAGATATACCTTTTCGGCACTGACCAGAAATGGCGCTTCATATCCACCAAGGATCGAAACGACGTGATCGAAACTCAACAGACCGGCAAGGCATTCGTTCAAAACTGCACAGCGTACCAGCACGTAACCATTCATGACGGGCTGCTGTTTCGCCGGAATGACTCGATGCTGCCGGCGAACAGTCGGCCCCATTTTCATGGGGACAAGCACTTCAATATTTTCCCTATCGAGAGCATCACGCACCGAAATCTCGCGCCCTGACTTCACCTGAAGGACCAACCACGGAGAATCATCACTCACGCGATTCGCTGCCGCCTCTCTCATTCGAGATACCCTGTGACGCTCGGCAAGCACCTTGTCGATAGCACAAGCCTGCTCGAATGTTGGCTGTCTGTTGATAGCATCTGCAATCTGCTTTGCGTCAATTGCCATCATTTTCACCCAATCCCCTCAGTGCGATCTCAAAACCGTTCAAACCATCAGGCCCACCAGCCGGGAAGTACGCCCACTCGACATTGCCGGGATCGGGAAACCATGGCCAACCTTGCTGACTGTGGAAATCGGCCCATGCTTTCCACTCATCGCCGCCAACACGAACCTGTACGAGCAAATCCTTGATCGCCTGCAGGCGAGCAGGAACAAGCGCACCACGCCCACCGGCTGCACGCTCGAAAAGTTCGTTGACTGCCGGAAACCCTTGCTTGGCTTGCTTGTCGAGCAACAGATATTCCTCGGAATAGCGCCCGCTCTCCACCAAACCGCGTTCGATCTGGGTAAGGCCAACAACACGAGTGGGGCCGTTCAGCAAAAGCTCATAGACCCTCGCGCCCCACATCTTGCCCAACGGCGCTGCCTGTGCAGATCCGGTCCGCTCGACCACCGTCTTGGCTGGCAGCTTTTCCCAGCGCCGTTCGCGAAGGTAGACCGCATAGGAGCATACCAGCTTCCTGCCCGTCGCCTTTGCAGCTTCGACGTAGCGTGCAGCCTCGTCGACTGCCGCTTGACGTTCTTCTGGCGTCAGAGACAACCAGACGCGAAAAGCCTCAGGCTCGCTATCGGATATCGCTGTAGGCCAACCATGGAAACCACGCTTGAACGAACGCTCGACAGATTTCCGGCTTTCCCTTCCATCGTCATCGCTCTCGCGCGCTCTCTCACCTAATGGTTTATCTAATGGTTCTATTACGGTTTGGGTGACACCGTGACACCCGTCGGCGTCGTCAATGTCACCCGTCGGTGCTTCCGTTGTCACGGGTGACACCATGACACCCGTCGGCAGCAGGTTTTCCCCCTGTAACTTGTCCAATGCCCGCATATCAAAATCGTACCGCGTGGCCTGCCCCGGTCGCCCGCCGCCCTCAGCCACGACAATCAGCAACCCCTCATCGACGAAATCGCGCAACAGGCGCTGCACGGTTCGCTCGGATAGCTCAGTTTCAAGCGCAAGCCGTCCAACTGTAGGCCAGATCCCGCGCCCGTCATCGTCAGCGAAATCCGCCAGACGGATCGCGAGCATCTTGCGGCTGGGAGACCCTAATTGCGCCTTGAACAGGCGTGACATGACAGCAATGCTCATTGTGCTGCCTCCAGATACTCAGCGGCAGGCAAACGCCACCACTGGTCAGCCATAGCGGCTGCAATTCCCGTAAAAAACCGGCTGCGCTCGCGCCAACGGTCTGGACCTGGCGGCATACGGTGCACGCGCGCTTCCCTGCCCTCGACAATGTCGGTCGCCGTCAGCGGCGGAAGGTTGCGCAGCCAGAAGCAGGTACGCTTGACCTCGCCATGGCCAAATTGCCAAGGCTGGACGCTCTGGGCGGGCGGCGCATAGTTTGTGATGCGCGCCTTGGCATGCTTGTGCATGACAGGATTTTCAACGCAAACGCGGTGAATGGGTGCATTCCAGAACGTCGAAAACAGTTCAGCCGCTTCATCCAGCTCGCGCCAAATCTGTTCGACCGTCTTGCCGCGCGGAGGCACCGTCAGCCAGCGAACGCCGGAGTTGCAAAGCCGTGTGCATGGCGGGTGCGCGACAATCAACAGATCCCAGCCATCGTGCAGCAGATCCCGCGCATCGCCGACAATATGACGGTTTGTCCAGTCCTCAGCAGGGAGAAGATCGCACGACCATGCATCATGTCCCGCGTCGAGGAATGCATTTCGCACCGTGCCGGAAAACTCACACGCCACAAGAACGCGCAGAGGCTTTACCGTGTCGTAATGTATCGTCATTGCCCCACCCTCAGCCACGCCTCAAAATCGGCGCGCAAATCGCACCACCGATCCGCAGCGGCAGCATCGCTATTCAGTTCTTTTCTTGATTTGACCCGCAGCACGGCACGTAAGGCATCTGCGGCAGTATCAGTCGTCAAAGGCCCCTCAGCGCCGTGCCGTTCTTCGAGATAGATCCGGAAGGAGGCCTGATCGCATTTCATGGCAGCTTCCGCCGCGAAGTCCTTCGAAACACGCCGCTTTTGCTGAACCGGCGCGGCTTTACGTGATGCCGCAATTGCGCGATCGACAAGCCCCAGCAAAAACCCGACCATATCCGGCGCGCTGACCAGAAAATCGATTTCATCCGGCGTTGCACCGGGATGAAAATTCGCAATTTCGATCAACTCGCCATTACAAGTTTTAGCTTCGACAAACGTCCTGTTGTCGACACAGCAAAGCTGCCAGTGTGCGCCGTCGAGCGCTTTAAACCGGTCCCTGATTTTATGCAGATGTTCGGCATCGGAGGTCATATCGCCTCCAGCCATTCGATAATGTCGATGCCGCAGTTAAGCGCCAGCTGTCGTTCCGCATTCGCCCCCTTGGACGCCCGCCAACCTGGCAGCAACACTATCGTATCGGCCTCCAGACAGATGAAGTTGCAATACGACGCGAACGCCTGCCGGATCGGGAAAAGCTCCGGTGGCCCCTTATGGGGATATTCGGCAGGGTTATAGACGCGATGCCCCGCCAGGCGGAGGGCAGCGGCAGCCCTACGAAAAGCTGGATAGTTGAACTCTGGCAATCCCGTCATCGGCCCGGAAAGGTAAATAATGCGCGGGCTTCCCAATGTCGCGATAAAGCAATTGGAGCACATTATGCGGCCTCCACGGCGGCGGCAGGCGCTTCATAGCCCCATACATCCCAGCCGGGACGCTTGCGGCGCGCATTCAGTTCCAGTTTTGGGAGATCTGGATAAAATTTCTCGATCTGCTCGGCGAAATATTCCGGCTTGGCGGAATGCTCGCCCTTCTTTTCGACATAGATCGTTGGCGGCAGCATCTCCGGCAACGGACAGGCCACCTCGCCGCGCCTGCCGATGAGCAGCAGCTCATGCCGGTCCCTGCCCCAATAGCCGGTGCCGATATCGACCTTGTCCCAAACCCAGTGATGCACATAGGTGAAGCCGCAAGCCTCCATGACCCGCAGCGAATCCGGCAGCATCGGGTTTGTCGCCCACAGAAACAGCACAGCCGGATGACTGCCGCCGATCAGCTCGACCATCTGGGCGACAATTTCGTCGGTGGTCATCGTCGGATAGTGATTCTCGGCACTTTTTTCGCGCCCGGTCACTTCCGAATGAACCTGAAATTTCCAAGCCGGATCGGCATAATAGACCGGATAGAGCCGATCCAGCTTTGCCGGCGCCATGGCCTTGCCGCGTTCGGCAGTCAGCGCCATTTCGGTCAGCCTGACAGCATGGCGCACTTTCTGTTGCTGTGCGCGAATACTTTTGTTCTCGACCTTGATGCGTTTTTCTTCCGCAAGCGCGCCCTCGACCCACGCAATTTGGGCAACGTGCGACGGCAGAGCCTTGAGGCGATCCAGCGTCACACCATTATCGAACCGAGTGCCGCGCAGCTTGTCGAGAGCCGCCTTGCAGATCTTTTCGCCACGATCAGCATCCCGCCTGACCGCCCGCTCGGATTTCCCCGATAATTCCGCCGTTGCTGCAACAAAGCTTTTACGCTCTTGCCGGTCGATCAAGTGGCCAACCTGGCCACTTGATTTCCTGTCGCCGCCATGCGCGGTTTCAGGATATTTCTGCAAATACAGTTCCTTGCGCCGGAAAACGAACATGGCGCGATCAGCTGGAGTCAGCTCTGCACGGGCAAGGTTTTCGTCAATTTCCCACAGCTCGGCATCGAGCGCGCTTTCCCTTCGCACAAATGCCGGGATTTCCTCCCAACCAAGCGCCAGCGCAGCGGCCAGCCGGTGGGCGCCGGCCGACAGGATATAGCGTGGCACATCCTTCCGTCTGCTATCATTCACCCGAACCGTGATCGGCGTCCGCAAACCCAGATCAGCAAAGGATGGCTTCAACGCCTCGACTTTCGCCGGATCCACATCGCGCAGGCGCTTGCCGGTGTCGATATCGGATATCCGGATCATTTCAGATACTAGCGCGTCCATGTCACCGCCCCGCAAGAAACAGAAAGAAGATGAGAAAAGCCGCGACTGGTGACTCAGCAACCAACGCGGCACAAACAATCAGCGCGACCTCGCGCGGCTGGAAACGGGCGGTCATGGGCCGATCCGCCGAAGCGGCTCCCTGCCATGCAGCCTGCGAAAATCGTCGCGCAGTTCGACCAGCTTTTCCCAGCGCATGACCTGCCAGCGGCCATCGAGCCGCCTCACGGCAAAGCGATTTCCCTTGCGCCGCACCTCGACGCCGAACCCCTGCAACTCATATTGCGTGGCATCAAAACCGGACATGACGCCGGATTCAAAGCGACGAGGCCCGCCATGCGATTCCATCCATTGATCGACAAGCGAGGAAACAGCGGTCATTTTCCTGCCTCCAGACGCTTCAACACACCCTCAAGCGCCCGGATCGCCTCGCGCACTTCCTTGGTGATCCCCCGCTTCTCACCGGCGTCAATGCGCCCATCCTCCAGGGCCGTAACAATGGATTTGGAAACGTCCATCGCTTCGGACATGACGCGGTGCGCATCCATTTCGGTAAGTGTGGCATTGTCAGACGACCTGCCTGCCGACGACGATGCCGGAATCAATTCGTATCCGAGCAGCCCGGCTGCGGCCTTTATGATCGTCGGCGTTTGCGCGCGCCGGTCGACCTCGACCGCGACATCTATGGGCATGAAGCTGTCGCCATGTTCCTCGCCGAACGATGCGTACTTGGAAAGTGTCGACACGCCGACACGGGTAAAGGGCACGATGCAGGAAATGCCACCCGACAGCATGTATGCGCCATCGGTGGCAGACTTGAGAGAACGCTGTTCTTGGTCGGAAATTGTGCGCAC
Protein sequences of DBSCAN-SWA_1 >NZ_CP019390|86022:100601|98086_98434_-|WP_076770510.1|DBSCAN-SWA MCSNCFIATLGSPRIIYLSGPMTGLPEFNYPAFRRAAAALRLAGHRVYNPAEYPHKGPPELFPIRQAFASYCNFICLEADTIVLLPGWRASKGANAERQLALNCGIDIIEWLEAI >NZ_CP019390|86022:100601|86022_86346_-|WP_076770498.1|DBSCAN-SWA MKNYIQPGDSVTVPAPADVNSGDLVVVGDLFGVAQFSAKAGEDVEIATKGVFALPKVSAQAWAVGAKVYYVAADKNLSTVASGNTFIGHATGTAVNPSDFGAVRLSV >NZ_CP019390|86022:100601|90325_90520_-|WP_041545160.1|DBSCAN-SWA MASIDDQIAALEEAILTGAKKVIFHSGGTRREVEYHSLKDMREALADLRARKSRGPRTILAALD >NZ_CP019390|86022:100601|100124_100601_-|WP_076770513.1|DBSCAN-SWA MRTISDQEQRSLKSATDGAYMLSGGISCIVPFTRVGVSTLSKYASFGEEHGDSFMPIDVAVEVDRRAQTPTIIKAAAGLLGYELIPASSSAGRSSDNATLTEMDAHRVMSEAMDVSKSIVTALEDGRIDAGEKRGITKEVREAIRALEGVLKRLEAGK >NZ_CP019390|86022:100601|96918_97578_-|WP_076770508.1|DBSCAN-SWA MTIHYDTVKPLRVLVACEFSGTVRNAFLDAGHDAWSCDLLPAEDWTNRHIVGDARDLLHDGWDLLIVAHPPCTRLCNSGVRWLTVPPRGKTVEQIWRELDEAAELFSTFWNAPIHRVCVENPVMHKHAKARITNYAPPAQSVQPWQFGHGEVKRTCFWLRNLPPLTATDIVEGREARVHRMPPGPDRWRERSRFFTGIAAAMADQWWRLPAAEYLEAAQ >NZ_CP019390|86022:100601|86414_88055_-|WP_076770499.1|DBSCAN-SWA MKKGAYIFATVAAFICVGLAMALFAADPSHAASLDYRAFVQPDGMHLIGANVALLGLRSKLKEITDRAEATRARITDDLDEDAVRAIEQEHAGILAEADQVRADITRMENEQRNAPPVDPSVRAAVDEGVRAERERSSIIEDLATRSGFPDLGREHVRAGTPVEQFRSLLLDHMVSNERQSPTDSRVRVDVVHDEAVTRRSAQIEALAYGLGAPTPQAGPSAAARQYMGMGLVDLAAESVNYRGRRMMNARDIDDVFTRASHSTSDFPAIFEGAVNRTLEQRYALAQPTFKRFARKRNFRDFRPDTTVKVGDFPLLKKVLENGEIKYGSFGEGKEQVQAFSYAIALNISRQMLINDDLGAISELLTSYGASVALFEEVTFYAGAFNGNLADGKSVFHADHKNLAATASAITVDSVGLGRTAMGKQESKDGNPLLSNSPRIMLVGPDKLTEAEKLLTSITPATVANVNIFSGRLELIESTQIKGNAWHLFSDPAAGSNYRWGYLEGYEAPRVRMDEPFGRQGFSMSVEHDFGCGATDYRFGYKNAGA >NZ_CP019390|86022:100601|98433_99750_-|WP_076770511.1|DBSCAN-SWA MDALVSEMIRISDIDTGKRLRDVDPAKVEALKPSFADLGLRTPITVRVNDSRRKDVPRYILSAGAHRLAAALALGWEEIPAFVRRESALDAELWEIDENLARAELTPADRAMFVFRRKELYLQKYPETAHGGDRKSSGQVGHLIDRQERKSFVAATAELSGKSERAVRRDADRGEKICKAALDKLRGTRFDNGVTLDRLKALPSHVAQIAWVEGALAEEKRIKVENKSIRAQQQKVRHAVRLTEMALTAERGKAMAPAKLDRLYPVYYADPAWKFQVHSEVTGREKSAENHYPTMTTDEIVAQMVELIGGSHPAVLFLWATNPMLPDSLRVMEACGFTYVHHWVWDKVDIGTGYWGRDRHELLLIGRRGEVACPLPEMLPPTIYVEKKGEHSAKPEYFAEQIEKFYPDLPKLELNARRKRPGWDVWGYEAPAAAVEAA >NZ_CP019390|86022:100601|92254_93283_-|WP_076770503.1|terminase|DBSCAN-SWA MTSLSDQIEKVIHLPSWYPSPGPIRMHPPLRQVANAIGDDAIERVTIVKPVRFGLTTLLTSSLLVQGSISPVSALILLPTCDVSEFVDFELRSLLDENMQVAVHWRGEAGRFSNGRRYRLSHLQNHLDIIDSSRISRVGRDAVRFLFVDEADGIANNAEGCPIELAEKRTLSFKNRKIVISGTPTFSKTSNVLRSYELSDRRIYEVPCPECGHFHEIGWSDIQWPEGEPEKAHYVCPECGNAINERFKIEMIAGGRWRALRPEIKNHAGFRANQLISTLPNASWDRLAREFLACKGRVAFMQVFTNCVLAQGWEYGSGSGDLAAGGVGSADAATETEALGLD >NZ_CP019390|86022:100601|90522_92340_-|WP_076770502.1|terminase|DBSCAN-SWA MAAALAILRRAVWEALTPPPKLKLSDWIEQTVYLPEGVSSLTGRVRLWPPQREIADAIGDSAIERVTLVKPVRVGFTTLLTSAMASFCSNDPSPILSLLPTEADCRDYMVSDVEPIFDASPALNGLLTGDVDEGGRNTLLSRRFPGGFLKVIAAKAPRNLRRHNVRILFIDEADGMSATKEGSPILLAERRTLSFADRKIVMGSTPVYEETSHVLQSYEQSDKRIYEVPCPECGHFHEIQWSDIQWPEGEPEKAYYVCRECGSVIDERHKPGMVANGRWRALKPEVKDHAGFRMNALISLLPNASWGRLAKEFVGAKNDPSKLQTFINTILAQGWKENTDELDDIELASRAEDFSLVAETPDDDDATATTGIPVQVLIITAGVDVQDDRLEITFIGWDKEGIPYALGHEVIWGRYDDHTTWSELDVALGTQWDHPLGGKIKVDATCIDSSDGETMETVYRYAFPRFRRRVFAIKGVGGNRPWIEKSKSTVKGGKLFIVGVDGIKSHIFGRLARASSMRFSKSLPDVWFEQLVGEQLVVKYSRGQTVRQFVPVPGRRHEALDCTVYAFAARQMVNANWAHREGELSTPPEIKPVSSLPQIAPSEWL >NZ_CP019390|86022:100601|94176_94746_-|WP_076770505.1|DBSCAN-SWA MATITAKWADKHLSLFGSRIAVLNSRFPKVLPRIVNQVGNRAKTQVIRNLTKQTGLARKTIVKAVGDPGVARPGKLSYEMVTRGGDIRLKYLSPREMRAGVSAKPFGQRKLFAGSFLKGGRFPGRVNVPKFYGHAFHRLNKSGTKITYTRSGVFIPMEMTKGATRSAFERTAAPLLQQRVDAALKKLVP >NZ_CP019390|86022:100601|97574_98090_-|WP_083699576.1|DBSCAN-SWA MTSDAEHLHKIRDRFKALDGAHWQLCCVDNRTFVEAKTCNGELIEIANFHPGATPDEIDFLVSAPDMVGFLLGLVDRAIAASRKAAPVQQKRRVSKDFAAEAAMKCDQASFRIYLEERHGAEGPLTTDTAADALRAVLRVKSRKELNSDAAAADRWCDLRADFEAWLRVGQ >NZ_CP019390|86022:100601|95124_95781_-|WP_083699574.1|DBSCAN-SWA MMAIDAKQIADAINRQPTFEQACAIDKVLAERHRVSRMREAAANRVSDDSPWLVLQVKSGREISVRDALDRENIEVLVPMKMGPTVRRQHRVIPAKQQPVMNGYVLVRCAVLNECLAGLLSFDHVVSILGGYEAPFLVSAEKVYLFKAKAEDGKYDYEHFHRKFIGVKWARVADGPFAGCRAELVSGGTKGNGLVVIEVSIMGRPVAMTVPIAILEPL >NZ_CP019390|86022:100601|93257_93977_-|WP_076770504.1|terminase|DBSCAN-SWA MAKGYSDELRQQVIAFIEEGHTVRQAAEKFNVSPSFAAKSHKKHVDQAAQPLLPEQEPVEDDQQPENDAEITAADLAEILGVSKRAISDYVERGIIVKTGRNRFDLRKSVQLYCEHLRGIAAGRGGDNVDTLATERARLAREQADQAALRNAAMRKELVPIAEVRNEWVSIARRVRNVMMAVPSRCRQMLPHLTTFDVDLIDEEIRTALTELGEKESVVESADQSAMEKAENDIPVRSD >NZ_CP019390|86022:100601|95767_96922_-|WP_076770507.1|DBSCAN-SWA MSIAVMSRLFKAQLGSPSRKMLAIRLADFADDDGRGIWPTVGRLALETELSERTVQRLLRDFVDEGLLIVVAEGGGRPGQATRYDFDMRALDKLQGENLLPTGVMVSPVTTEAPTGDIDDADGCHGVTQTVIEPLDKPLGERARESDDDGRESRKSVERSFKRGFHGWPTAISDSEPEAFRVWLSLTPEERQAAVDEAARYVEAAKATGRKLVCSYAVYLRERRWEKLPAKTVVERTGSAQAAPLGKMWGARVYELLLNGPTRVVGLTQIERGLVESGRYSEEYLLLDKQAKQGFPAVNELFERAAGGRGALVPARLQAIKDLLVQVRVGGDEWKAWADFHSQQGWPWFPDPGNVEWAYFPAGGPDGLNGFEIALRGLGENDGN >NZ_CP019390|86022:100601|99861_100128_-|WP_076770512.1|DBSCAN-SWA MTAVSSLVDQWMESHGGPRRFESGVMSGFDATQYELQGFGVEVRRKGNRFAVRRLDGRWQVMRWEKLVELRDDFRRLHGREPLRRIGP >NZ_CP019390|86022:100601|88793_90326_-|WP_076770501.1|portal|DBSCAN-SWA MGIGNILDKAIGYVSPEAGLRRVRHRAAMEIVQRSYSGAETNRLKSGRRAKSTSADTEIARAGRKLRDRMRDLVRNNPYAAKAVSELVSHAIGDGIIPRSKNKEAIKLFQEWSKVCDADGDLDFNGIVALTVREMFESGDGIVRRRRRRLEDGLPVPLQLQVVEADLLDSTKEGVLSGGGKAIQGIEFDAIGRKRAYWMFGSHPGNSFFDPQSTVVSKPVPAADIAHVFEKQRTQVRGVPWGTPAMDDTFDLAEYEQSELVRKRLESCIVGVMTGGDIDDTLGMPLTGNDGDATKPGIYNVHGQRVEKFAPGMFYNAVGGRDMKFSQPAVTDSYDPYKVSMLHTISAGWRVPYALMTGRLDKVNYSSSKIGLEGFRRMISMLQWQIIIPMLLQPMWDWFCEAAYLAGKISTPTVAVEWSPPRFYSADPLKDVNARIKEVRAGFRSLSSAIAETGENIDDVLGEIASDNAKLDKRGIILDSDPRRISQAGQVQQPVDTDDPPEKDENDDET >NZ_CP019390|86022:100601|88153_88807_-|WP_076770500.1|DBSCAN-SWA MTKLNLRKMPDSLPMQMQEVRLLPSGVDAETRTLNLVWTTGAKVRRKRYAGWDTIVQFDEILVVSDKAIDLSRMNAGAPVLDSHSTWSTYSQVAVVERAWIEGGEGKATIRFPKAGIDERADRMFGLVSDDIIKNVSVGYSIDKIRIEEAQKKGEVEKVFVERWTPNEISFVTVPADPGAQVRNHTDTFPLLIDRKPVSTFAASARMRMAEAIRRFA |
17 | Stenotrophomonas_phage(28.57%) | portal,terminase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
116259 : 128177
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP019390|116259:128177|DBSCAN-SWA TTCAGCTTTCCAGATTAAGCGATTGCGCAAGCTCCGCATCCACCGCGTCACCGGCATCGGTTTCGCGCGGCTTGAGACCGAACTGAACAAGCAGCGGTGCTGCGATGAAGATTGAAGAATAACTCGCCACAATGATACCGACACTGAGCGCCAGCGCAAACATGCGAATTTCCGAGCCGCCAAAGGCATAAAGCGGAACATGTGCCAGGAAGGTGACGAAAGAGGTCAACAGCGTTCGCGACAGGGTCTGGTTGATCGAGGCATCGATGATCGCAGGCAATGGCGCACTCTTGTAGCGGCGAAGGTTCTCGCGCACCCGGTCATAGATCACCACCGTATCGTTCAGCGAATAGCCGATGATCGTGAGGATTGCCGCCACGCTCCACAAATTGAATTCCATGCGGAAAACGATGAACATGCCTGAAAGAATGACGACATCATGCAGTGTGGAAAGCACCGCGCCCAGTGCGAGCTGCCAGCGGAAGCGGAACCAGACATAGATGAAGATGCCAATCAGCGACAGGATGACAGCGAGTACGCCTGCACGCGACAATTGTTCCGATACGGTCGGCCCGACCACATCGACCCGCTGGAAGGAATAATCCTGCTCGAATTCACCGCGAAGCTTGACCGCAACCGTCTGCTCCGCATCATCGCCCACTTCCTGACTGCCGATAATGACAAGAGCCGAGCGCGGCGATTTCGCAGGCAGAACGCGGGCGCTATCGATATTCAGCTCCGCGAGCCGCTCGTTGATATCTTCCAGATTGGCATCGCCATTGCGCGCCTGTAGTTCGACCATCGAACCGCCACGGAAATCGATGCCGTAGTTGAAGCCGATATTGACGAAAAGCGCGACCACGATTGCGCAGGCCAGCACCGAAATGCCGAGCGTTACGAACTGGAGGCGCATGAAGGGAATATGGGTGACGGTTGGCACCAGTTTCAGGCGACGCTTGGGCACTTCCTTCGGCTTGGCGGTGCGCACCCATTGGGCGATCAGCAGGCGCGTGAAAGTGAGCGTGGTGAAAAGCGTCGTGCCAATGCCGATTGCGACCGTGAGCGCAAAACCATGCACCGTACCCGACCCCAGAAGGAACAGAACGAGTGCGGCGATAAGCGTGGTAAGATTGGCATCCACAATGGTGGAAAGCGCGCGATAAAAGCCCGATTCCATCGCCTGCACGACGGAATAGCCCTTGCGGCGATCCTCGCGCACGCGCTCATAGATCAGAATATGCGCATCGACCGCAAGGCCGATGGTCAAGACGAGACCTGCAATGCTCGCAAGGCTTATCGAAGCGCCAATGAGCGACAGAACAGCCGTCAGGATGATGATATTGACCGCAAGCGCAACCAGGGCGATGACGCCCAGAATGCCGTAGGAAAGCACCATGAAGAGGCCGACCACCAGTGCCGCCAGAAGGGCCGCAAGCACTGCCGCACTCGCATAATCCTCACCGAGCGCGGAAGCGATTGTGCGCTCTTCAAGCACAGTCACCGCTTGTGGCAAGGCGCCAGAGCGCAGAACCACGGCCATATTATTGGCGGCCTGTAAATCGAATGCGCCTTCGATCTGCAATTCACTGGTGTCGAGCGGACCGGAAACCGTCGGCGCGGAAACCACCTGATTATCAACGACGATGGCGAAGGAATTCTCATTGCCCTGCGCCGTCAGGTCGGCCAGACGCCGGCGTCCGTTGTCGTCCAGTGTCAGCGTGATGACCGGCTGGCCATCGTCGGCGGAAATACTCGCCTTGGCATCCGTGATGTCATGCCCGGTGAGAATAGGTGTTTTTTTCAGAAGATAACCAACCGGGGGATCATCAAATGAATAGACGATTTCGCTATCCGCAGGCGGTGTGCCGCGAATGGCGTCATCCGGCGACATGGTGTCGTCCATGGCGCGGAAGGAAAGATTTCCGCGAATGGTAAGAATATCTTTGAGAAGCTGCGCATCGTAAAGGCCCGGAACCTCCACGCGAATCTGATTGCGCCCCTCTCCCTCGACAACGGGATTGCCATAGCCAAGCTCTTCCAGGCGCTGCCGCATGATATTGGCAGTCGTCGCCAGATCGGTCTTGCCTGCATTCTGCACCTGAAGAATAAGCCGCGAGCCGCCCGAAAGGTCGAGCCCGAGCGATACCTGTTTCTTGGGCAGAAAATCCGGCAGATTTGCCAGCGTTTCACGCGAGAAAAAATTTGGAGATGCGATGATAAGGCTGACAAGGACCGCCAGCCATATCAGTGCAGATTTCCAGCGTGAAAAATAGAGCATGGAATGCAAGTCCGTTTCAGCTTGGATGAAGCATTGCCGGAGTTCACCCAAAGGGAAAATCTTCCGGCTATTTGCACGCAAACGCAATTTTGCGCGCGTGCTGCATAATGAACGGGCTCAGGTTATTTAAACCTTACCGGAGCCCGCCTGCTCTTTCAAAAAAAAGAGATTATTTGTTCTTGTTGTCGGCGACAGGTTCGCCCTTCACCCGCACATCCATCAAGGTGGCGCGCACAACGCGAATACGAACGCCATCGGCAATTTCGAGTTCGAGTTCGTTGTCATCGACAACCTTGAGAACCTTGCCGACGATGCCGCCACCCGTCACGACCGTATCGCCACGACGCACGGAGTTCAGCATTTCCTGACGCTTCTTCATCTGGGTCCGCTGCGGGCGGATGATGAGAAAGTACATGATGACGAAGATCAGGATGAACGGCAGGATGCTCATGAGCATATCTGGTCCAACAACGCTGCCGGATGCTTGAGCGAAAGCCGGTGTTACGAACATTAGGAACTCCTCTGAAGTCACGAAGACAAATTATTAACGGCATTACAATGGTTGGTGCTTCAATTGCAACCGGCGATGAACCCGCCTGCGCCAAGGTCGCACACTTTTCGCTTCCGCGTTAGCATGTTAAGTCGTTACCCCTACCATCAGGGGCCTGAAATGACGACCGAAGATATACTCAGCCAGAAACTGGACCGTTTGATAGCCGCGCTGGAACGCATTGCACCGCCGGAAGCAAAAACACCCGATCTCGACGCCGCAGACTGTTTCGTCTGGGCCCCGGAGCGGCTTGTTCTCGATCCCGTCAGCCGTGTGAACCGCGTGGATATCGCGTTGATCCGTGGCGTAGATTTGGTACGTGACCAGCTTGTGGACAATACCCGCCGCTTCGCAAAAGGTTTTCCCGCCAATAATGTTCTGCTTTGGGGCGCGCGCGGCATGGGCAAATCATCGCTCGTAAAGGCCGCACAGGCGAGCGTGAACGCCGAGTTTCCGGATATAACACCGCTGAAGCTTGTCGAAATTCATCGTGAGGATATCGACAGCCTGCCGGTTTTGATGAATCTCATCAAGGAAACGCGCCATCGTTTCATCCTGTTCTGCGACGACCTGTCCTTCGATCATGACGATACATCGTATAAATCGCTGAAAGCAGCACTTGAAGGCGGCGTGGAAGGACGGCCCGACAATGTGATTTTCTACGCAACATCGAATCGTCGCCACCTGATGCCGCGCGACATGATCGACAATGAACGCTCCACGGCGATCAACCCTTCAGAAGCCATCGAGGAAAAGGTTTCACTGTCGGACCGCTTCGGCCTGTGGCTCGGCTTTCATAAATGCAGCCAGGACGATTATCTTGCCATGATCGACGGCTATGCAGCGTATTTCAAACTCGACTATGCCCGCGACCAGATGCACAGGGAAGCCCTGGAATGGTCCACCACCCGCGGCAACCGTTCGGGCCGTGTTGCGTGGCAATATATTCAGGATCTGGCAGGCAGGCTTGGAAAAACCATGCAATGAAAAAGGCGGGCCCCTGCCCGCCTTTTTCGTTTTTTCCTGCTGAAACTGTTATGATTCCAGATATTTGGTCGGATTGACCGGCGCCGAGTTCTTGCGCACCTCGAAATGCAGCTTCGGCGACTTGGCGTTGCCGCTCATGCCCGACTTGGCGATTTCCTCGCCACGGCGAACCTTCTGGCCGCGCTGCACCATGATCTGGCTGTTATGGCCATAGACGGTCACAAGGCCATTGTCGTGGCGGATCAGAACGGTCTGGCCAAACTCCTTCAAACCATCGCCCGCATAGATCACAACACCGTTTTCGGCGGCTTTGACCGGCGTGCCTTCCGGCACCATAATGTCGATACCGTCGCTGACCGAGGTGCCCTCACGCTGGCCGAAGCTTGCCAGAATGCGCCCACGAACCGGCCAGCGCATCTGCGAGATGCCGGTCGAGGATGGTGCTGCGGCCTGATCCTTTTCCGCGTCCTCGATTACCTTGTTGCTGGCCTGCGGCGGCGTGTAAGGCTTTACCTCCGCACCTCCATTGGCCGGCGCGCTGGCTGCTTTGGCCGGATTTGCCGGCTGCGGCGTGATTGCGGCCACCTGCGTCGGTGCACCTGCGGCAGCAGACGGAATAACGAGCGACTGCCCGACGCGAATGGCGCCACTGGTCAGGCCGTTAGCCGCCTTCAACTGGTCGACAGGGACATTGTGCTTCTTTGCGATGGAGAACAGCGAATCCCCGCTCTTCACGACGTAAGCACCCCCCGCCGATGGCGGGGTTGCGATAGCGCCGCCAGCCGCCGCCATATTGGTCGGCGAAGATTTCTTGCCGTTGACAGTAGGTGCTTGCGGAACGCCAGCGATATTGTTGTCCATCGGGCGCGGCGCGCCCTGCCCCGTCGCCGGAAGCTGGCCAAGAACCTTTTCCTTCGCGCCATTCACCGTATTATGCGTAACGGTGGCGGCACTTGCGACCCTGGTTTCAGCGGCATTGACGGTATTATTGACGCGGGATGCCGCCATATCCTGGGCTTGATCGACCTGATTGCGCATCTGCGTCGATGCCGCAGCAACAGGCGCACCCAGAGGTGCTGACGATACCGGCGGCAAGGAATTGCGCTGGACCATTCCGCTTGAAACCGGCGGGGCCGAAGCCACGGGGGCAGACGCATAAACATTGCCTGCGGGTTGCTGCGCGACGGGCTGACTGGACGTAGAACCAGTAAAAATGCCGTCTGTGAAGCGCATCGTATCAGCACTGCACCCGGCTCCAAAACCGGCAATCAGAACGATTGCGACATTCCGCAGGAGACGTTCGGACGTATGCTGCAAAATTGGTAAACGCATGTTCAACTCGTCCATACTCGAAACTCACTAAGACGATTAAAACGCGTTAATGTTACTCTCTGGTTAAGAATGCCCCGAAACTAAAAAAAGATTCACCAAATAAACACCATAAGCGCAAGAATGACGCGCAAACAGCCACTTCCGGCCCGCAAATGCGAAGCTGGCGAGGCTGCCATTGCTCAAAATCAATGCAACTGAAGCCGTTCCGACAAAAGCGCGAAGCGGCTTTTTTTGGAATCATCCTCAAACAAAATCTTGGAGCGGGATGATGGTTGGACTTAAATTCATCCCGTTTTAGAGCGCGTTTCGATCTGATTGAATCAGATCGGCGCTCTAATCCTTTGTTTTGACGCGCATCTTTTCCGAAAACCGTTTCACACTTTTCGAGATGCGCTCTAAAGAACGGAAGACGTGCCTTCGATGAACGGCTGATATCGAACCGGCATGAGATCTTCCTGTTCAAAACGGCTTCCAACCTTTGAAATCCGCGTCATGATCTGGCGTCCATCGCCAGGGCCGATCGGGGCTATCAGGACGCCATGGGTGGCGAGCAGTTCAACGAAATGGCGCGGCACCTCATCGCATGCGAGCCAGATGACAATACGGTCAAACGGCCCGCCCGGCATACCGTGGCGCCCGTCTGTATGTTTCACCATGATATTCTCGCGCTTCAGCGAAACGAATTGCTGGAGAGCGTGGTCGCAGAGTTTTCGATACCGTTCCACCGTCGTTACACGGCCGGACAGCAAGGACATAACGGCGGCGGTAAAGCCGGAGCCGGTGCCGATTTCCAGAACCCGATGGCCGGGCTCAAGCTTCAGGGCGGAAATGACGCGCGCCTGATCGTCTATGCCTTCCATATATTCACCGCAATCAAGCGGCGCGGTTCGCGGGCTATAGGCAAGATGCGACCATGCCGCCGCCAGAAAGCTCTGGCGCGGTGTTGCTTCAATTGCCGCAAAAAGTTGCGGATCATCAATGCTGTGCCCGCGCATCCGCAGAACAAAGGATGCAAATCCCTCCCGGTCCGAAAGCCGCGGGCGTTCAGACGTTGCCTGCCTCATGCTTCCACTCCAAGCGCCGCGCCCAGTTCTGCACGAACCTTATGAGCGGTCAGATCAAGGTGGAGTGGCGTCACTGAAATGCAACCCGAACGGATGGCAGCAATATCGCTATCGTCGGCAACCGGAGCCTTGCCGCGACCGAAATGCAGCCAGAAATAAGGGAAACCACGTCCATCGCGGCGCTCGTCAAGGCGCGCATCATGGCTAAGCTTGCCTTGTGCCGTGACGCGCACGCCCTTCACTTCTTCCGGAGCGCAATTCGGGAAATTGAGGTTCAACAGCACGCCTTCCGGCCAGCCCGCCTCCATCAGCCTCCCGATAAGCTCAGGCGCATGGGCTTCCGCCGTTTCCCACGGCACGATCCGGCGATCGCCCGCATATTCATATTCCTGCGACAAAGCGATGGCTCGCACACCAAGCAATGTCCCCTCCATCGCACCGGCAACCGTGCCCGAATAGGTCACATCGTCGGCCATGTTCGCCCCGGAATTGACGCCGGAGAGGACGAGATCGGGCGCGCCCGGCAATACATGGCGCACCCCCATGATGACGCAATCGGTCGGGGTGCCGCGCAGGGCGAAATGACGGGCATCGATCTGGCGAAGGCGAAGCGGCTCCGACAGTGTCAGTGAGTGGGCAAGCCCACTCTGGTCCGTTTCAGGGGCCACCACCCACACATCGTCGGAGAGCTTGCGTGCAATCCGCTCCAGAACAGCGAGGCCTTCAGCGTGGATACCGTCATCGTTCGTCAGCAGAATACGCAATTTGTCACTCCTTCGCCGAAATGGATAAGACACTTAAGACACTAGAGCGGTTCCAGTTAAAATGTAGTCGTTGAAACTGCTCTCTCTTTGTTCTTTCGCATGTCCCCGAAACCGGTTCCCACTTTTGGGGACATGTTATAATTCCAGATCAAGCGGCTTTTTCGATCCGCGTGAGGCCGCCCATATATGGCTGTAATACTTCAGGAATATGAATACTGCCGTCTTCCTGCTGGTAATTTTCCATAACCGCAATCAGCGCGCGCCCGACAGCAACGCCCGACCCGTTGAGGGTGTGCACGAAGCGCGTGGATTTCTCGCCTTCCGGGCGATAGCGGGCATTCATGCGGCGGCCCTGGAAATCACCGCAGGTCGAACAGCTTGAAATTTCGCGATAGGTGTTCTGCCCCGGCAACCAGACCTCGATATCATAGGTCCGCTGTGCGCCAAAGCCCATGTCGCCCGTGCAAAGCACAACGGTACGGAACGGCAGGCCCAGCCGCTTCAGCACTTCTTCCGCGCAAGCCGTCATGCGCTCATGTTCGGCAACGGAGCTTTCCGCATCGGTGATCGATACCATCTCCACTTTCAGGAACTGATGCTGGCGCAACATGCCGCGCGTATCGCGCCCAGCCGACCCCGCTTCCGAACGAAAGCACGGGGTGAGCGCCGTGAAGCGCAGCGGCAGCCCCTTCATATCGACAATTTCTTCGGCAACCAGATTGGTAAGCGGCACCTCCGCCGTCGGGATCAGCCAGCGGCCATCCGTCGTGCGAAAAAGATCTTCCGAAAACTTCGGCAATTGCCCCGTGCCATAGACCGCTTCGTCGCGCACCATCAGCGGCGGCATGACTTCGGTATAACCGTGTTCTGTCGTGTGAAGATCGAGCATGAACTGGCCAAGCGCACGCTCAAGACGGGCGAGCGGGCCTTTCAGCACCGTAAAGCGCGCACCGGCAAGCTTGGCCGCGCGCTCGAAATCCATATATCCAAGCGCCTCGCCCAGCTCAAAATGCTCTTTCGGCTGAAAGGAGAAATTGTGCGGGGTGCCGATGCGGCGCAGCTCAACATTGTCGCTTTCGTCCTTGCCGAGCGGCACATCATCAAGCGGAATATTGGGAATGGTGGATAATGCGTCGCTCAGTTCCTTGCTGAGGCGGCGCTCGTCTTCTTCCGCATGAGCGAGAAAATCTTTCAGTTCGCCCACTTCGGCCTTCAGCTTTTCAGCCGTGCCCATGTCCTTTGCGGCCATGGCCTTGCCGATTTCCTTTGAGGCGGCATTGCGGCGCTCCTGTGCGGCCTGCACCTTGCCGACATGCTCGCGGCGCTTTTCATCCAGCGCAATCAGTTCGGACGAAAGCGGAGCAGCCCCACGCTTTGCGAGCGCCTTGTCGAGGGTTTCCGGGTTTTCGCGAATCCATTTGATGTCGAGCATGGAAAAAAGCCATTTCGTGAAATTGAACAGAAGCAAGGCTGAACGATCTTCAGCCCCAAAGATGCCTGACGCCAGATCAGGTGGAGGAAGCGTTGTTATCAGCGTCGGCAGATGCCTGCGCCTCATCCCGCTTCTTCTCGATCATGCGCGCCAGAAAGATCGAAATCTCGTAAAGAAGGATCGTCGGCAAGGCAAGACCGATCTGGCTCGCCGGGTCCGGCGGAGTCAGCACCGCAGCCGCGACGAAGGCGATGACGATTGCATATTTGCGCTTGTCCTTCAGCCCCGCCGAAGTCACCAGCCCCACACGCGCCATGAGGCTCGTCACCACCGGCAACTGGAAGACCAGGCCAAAAGCAAATATGAGCGTCATGATGAGGCTCAGATATTCCGACACTTTCGGCAGAAGCGAAATCTGGACCTCGCCGCTGCCGCCGGTCTGCTGCATGGCGAGGAAGAACCACATAACCATGGGCGTGAAAAAGAAATAGACGAGCGCGCCGCCAATCAGGAACAGAATGGGCGACGCGATCAGGAACGGCAGGAATGCGGTGCGTTCGTGCTTGTAGAGCCCCGGAGCCACGAATTTATAAATCTGCGCGGCGATGACCGGGAAAGCCAATACAATGCCGCCGAACATGGCCACCTTCACCTGCGTGAAGAAGAATTCCTGAGGTGCGGTATAGATCAATTCAGCCTTGGAGCGGTCCATGCCGGCCCAATCGATGGCCCATTGATACGGCACCACAAGCAGGTTGAAGAGCTGTTTTGCGAAAGCAAAGCAGAAAATGAATGCCACGAAAAAAGCCAGGATAGCCCAGATAAGGCGGCGGCGCAGTTCGATCAGGTGTTCAAGCAGAGGCGCTGCGCTCTGTTCGATTTCATCCTCGTCCCGGTTCACGCTTTGGTTCCTGTCTTCTTTGTGGTCTTTTTAACCGGCGTTGCGGTCTTGTCTGCCGTCGGCTTGGGGGTAGCTCCGGTTTTTTTGGCAGCAGTCTTTGCCGTCGTCGGTTTCGGCCCGGCTTTTGCAGCCGGACGCGGTGATGTTTTCTTAGGCTTGGCGGGTTCTTCGGGCGCGGTGACCGCTGGTGCGGGAACAGGCGTTCCGTCCGGCTCAACCGGCGTCGTAACCTCACCCACCTTGTTCTCGGTGACTGGCGACATTGACGTTGCGGACTGGAGACCAGACCGCAAATCCTCGCCAGCACTGCGAATCGGGTCAAAAACCTGTGTCAGCCTTGAGCGCGGATCAAGGCTTCTGGCTTCGTCGATGATGGTCTTGACGTCTTCAAGTTCCGCCTCTTTCAAGGCCTCGTTGAATTGATGGCGAAACTCGTTGGCGGTGGTGCGCATGCGTGCAGTCGCCTTGCCGAAGGCGCGAAGCATTTTCGGCAAATCCTTGGGACCGACCACCACAATCATGACAATTGCGATAATCAGCAGTTCAGACCAAGCGATATCGAACATAATTTGATACCTTGCGCTCTGCGCGCGCATCCCGTCTCTTGGCGAAAAGCCGCACTGCCCACAAACCTGCCATGCGCGTTTTCAGCCCATGGCAGTTCCATCCCGGAAGGATCAGGACTTGGTGGTTTTCTTGACGTCCTTGACGGGTTCTTCCGCTTTGGCGTCGATCGTACGCGGATCTTCCTTGGCGTCTTCGTCAGCCATGCCCTGCTTGAAATTCTTGATACCCTTGGCGACATCGCCCATCAGCTCGGGGATCTTGCCGCGGCCGAACAGAAGAAGCACAACCGCCAGAACGATCAGCCAGTGCCAGATGGAAAAGCTACCCATATTATTCCTCTCAGTGCCGCCCGAGGCGCGGCATATGCCTGCTATCTCCGATACGATTTAAGCGCTTTCAACAAATCTTTCAAACAGAAGTGTGATGGTGAACGGCTTCAAACCGGATTAATTCGTCGCAGGCAGAAATTTTGTTCTATTCTCCCCTGGGTGCAAGCAAACCCAGCCCCTCCAGATCGATATCCTCCAGCGGGTCCTCCCCTTCGGTCAGCTCGTCCGGGTCGATATTGGGGATCGGTACGGCAAAACTGGAAGGAATGCGCGCCGAAAGAAGTCCTGCGCCGCGCAACTCCTCAAGACCGGGCAGATCGCGGATTTCCGGCAGGCCAAAATGGTCGAGAAAAGCGTCAGTGGTGCCATATGTTACCGGGCGCCCTGGCGTGCGCCTGCGCCCGCGCAGCTTGATCCAGCCGGTTTCCATCAAGACATCGAGCGTCCCCTTGGAGGTTTCCACGCCGCGAATATCCTCAAGTTCGGCGCGCGTCACCGGCTGGTGATAGGCAATGATGGCAAGCACCTCCATGGCCGCGCGCGAAAGCTTGCGCTGCTGAACAGTCTCGCGGTTCATGATGAAGGCGAGGTCTGGCGCGGTGCGAAACGCCCAGCCACTGCCCACCTTCACGAAATGCACGCCCCTGCCCTCGTAAACTTTCTGGAGATGGTTCAAAACCGGAGCAATATCCACATTGGCGGGAAGCCGCTCGGCAAGTGCGCGCTCGCCAACAGGCTGCGAAGACGCAAAAACAATCGCCTCCACAATGCGGGCAAGCTCGGCAAGCGTCACCGGCGAGGCAGCCCCCGCCTGCTCTTCTTCCCCAACGCCTTCCATATCCATCAAATCGCTGCGCTCTGCTTCAGGCATTTTCGTCCTCATCGAATTCATCGAGTTCGCGGGTCGCGCGCATATAGATCGGCTCGAACGGAGCGTTCTGGCGTACTTCAAGCTTGCCTTCGCGCACCAGCTCAAGGCTTGCGGCGAAAGAACTGGCAAGCGCCGACGCCCTCTCCTGCGGAGAAAGTGCATAATCGATCAAAAAACGGTCCAGCGAAACCCAGTCGCCCACCGCGCCCATCAGGCGCACGAGCGCCGTGCGTGCCTCCTTGAGGGACCAGACGCTGCGTTTTTCTATCTGCACCTGGGAAACCGCCTGGCGCTGGCGCTGCGACGCATAGGCGCTAAGCAGATCGTAAAGCGTTGCGGAAAACCGGCTGGCGCGATCCACCACCACCATTTCCGGCATGCCGCGCGGGAAAACATCGCGGCCAAGCCGATGACGGTTGACGAGTGCCGCCGCCGCATCGCGCATGGCTTCAAGCCGTTTCAACCGGAATTGCAGGGAGGCAACGAGTTCCTCGCCCGTAGCGCCATCGTCGCCCTGCTGCTTCGGGATCAGCAGCTTGGATTTCAGATAGGCAAGCCATGCCGCCATGACGAGATAATCGGCGGCAAGCTCCAGACGCAGCGCGCGCGCCTGCTCCACGAAACCGAGATATTGTTCGGCAAGCGCCAGCACGGAAATGCGCGCAAGATCGACGCGCTGGTTACGTGCCAGATGCAGGAGAAGGTCGAGCGGACCTTCAAAGCCCTGCACATCGATCAGCAGTGACGGCTCGCCTGCGCCTCGCCCGGCCTCATTTTGCCACAGGGTATCCATCGGCACGCGTGTGCCGTCTTTTCCACCTGTGTGTGCATCCGATGCTGCCAA
Protein sequences of DBSCAN-SWA_2 >NZ_CP019390|116259:128177|119206_120073_+|WP_076770535.1|DBSCAN-SWA MTTEDILSQKLDRLIAALERIAPPEAKTPDLDAADCFVWAPERLVLDPVSRVNRVDIALIRGVDLVRDQLVDNTRRFAKGFPANNVLLWGARGMGKSSLVKAAQASVNAEFPDITPLKLVEIHREDIDSLPVLMNLIKETRHRFILFCDDLSFDHDDTSYKSLKAALEGGVEGRPDNVIFYATSNRRHLMPRDMIDNERSTAINPSEAIEEKVSLSDRFGLWLGFHKCSQDDYLAMIDGYAAYFKLDYARDQMHREALEWSTTRGNRSGRVAWQYIQDLAGRLGKTMQ >NZ_CP019390|116259:128177|126607_127333_-|WP_076770537.1|DBSCAN-SWA MPEAERSDLMDMEGVGEEEQAGAASPVTLAELARIVEAIVFASSQPVGERALAERLPANVDIAPVLNHLQKVYEGRGVHFVKVGSGWAFRTAPDLAFIMNRETVQQRKLSRAAMEVLAIIAYHQPVTRAELEDIRGVETSKGTLDVLMETGWIKLRGRRRTPGRPVTYGTTDAFLDHFGLPEIRDLPGLEELRGAGLLSARIPSSFAVPIPNIDPDELTEGEDPLEDIDLEGLGLLAPRGE >NZ_CP019390|116259:128177|120121_121420_-|WP_076770536.1|DBSCAN-SWA MDELNMRLPILQHTSERLLRNVAIVLIAGFGAGCSADTMRFTDGIFTGSTSSQPVAQQPAGNVYASAPVASAPPVSSGMVQRNSLPPVSSAPLGAPVAAASTQMRNQVDQAQDMAASRVNNTVNAAETRVASAATVTHNTVNGAKEKVLGQLPATGQGAPRPMDNNIAGVPQAPTVNGKKSSPTNMAAAGGAIATPPSAGGAYVVKSGDSLFSIAKKHNVPVDQLKAANGLTSGAIRVGQSLVIPSAAAGAPTQVAAITPQPANPAKAASAPANGGAEVKPYTPPQASNKVIEDAEKDQAAAPSSTGISQMRWPVRGRILASFGQREGTSVSDGIDIMVPEGTPVKAAENGVVIYAGDGLKEFGQTVLIRHDNGLVTVYGHNSQIMVQRGQKVRRGEEIAKSGMSGNAKSPKLHFEVRKNSAPVNPTKYLES >NZ_CP019390|116259:128177|125562_126132_-|WP_076771752.1|DBSCAN-SWA MFDIAWSELLIIAIVMIVVVGPKDLPKMLRAFGKATARMRTTANEFRHQFNEALKEAELEDVKTIIDEARSLDPRSRLTQVFDPIRSAGEDLRSGLQSATSMSPVTENKVGEVTTPVEPDGTPVPAPAVTAPEEPAKPKKTSPRPAAKAGPKPTTAKTAAKKTGATPKPTADKTATPVKKTTKKTGTKA >NZ_CP019390|116259:128177|118705_119047_-|WP_002964021.1|DBSCAN-SWA MFVTPAFAQASGSVVGPDMLMSILPFILIFVIMYFLIIRPQRTQMKKRQEMLNSVRRGDTVVTGGGIVGKVLKVVDDNELELEIADGVRIRVVRATLMDVRVKGEPVADNKNK >NZ_CP019390|116259:128177|124741_125566_-|WP_002964014.1|DBSCAN-SWA MNRDEDEIEQSAAPLLEHLIELRRRLIWAILAFFVAFIFCFAFAKQLFNLLVVPYQWAIDWAGMDRSKAELIYTAPQEFFFTQVKVAMFGGIVLAFPVIAAQIYKFVAPGLYKHERTAFLPFLIASPILFLIGGALVYFFFTPMVMWFFLAMQQTGGSGEVQISLLPKVSEYLSLIMTLIFAFGLVFQLPVVTSLMARVGLVTSAGLKDKRKYAIVIAFVAAAVLTPPDPASQIGLALPTILLYEISIFLARMIEKKRDEAQASADADNNASST >NZ_CP019390|116259:128177|126243_126462_-|WP_002964012.1|DBSCAN-SWA MGSFSIWHWLIVLAVVLLLFGRGKIPELMGDVAKGIKNFKQGMADEDAKEDPRTIDAKAEEPVKDVKKTTKS >NZ_CP019390|116259:128177|116259_118587_-|WP_076770534.1|DBSCAN-SWA MGELRQCFIQAETDLHSMLYFSRWKSALIWLAVLVSLIIASPNFFSRETLANLPDFLPKKQVSLGLDLSGGSRLILQVQNAGKTDLATTANIMRQRLEELGYGNPVVEGEGRNQIRVEVPGLYDAQLLKDILTIRGNLSFRAMDDTMSPDDAIRGTPPADSEIVYSFDDPPVGYLLKKTPILTGHDITDAKASISADDGQPVITLTLDDNGRRRLADLTAQGNENSFAIVVDNQVVSAPTVSGPLDTSELQIEGAFDLQAANNMAVVLRSGALPQAVTVLEERTIASALGEDYASAAVLAALLAALVVGLFMVLSYGILGVIALVALAVNIIILTAVLSLIGASISLASIAGLVLTIGLAVDAHILIYERVREDRRKGYSVVQAMESGFYRALSTIVDANLTTLIAALVLFLLGSGTVHGFALTVAIGIGTTLFTTLTFTRLLIAQWVRTAKPKEVPKRRLKLVPTVTHIPFMRLQFVTLGISVLACAIVVALFVNIGFNYGIDFRGGSMVELQARNGDANLEDINERLAELNIDSARVLPAKSPRSALVIIGSQEVGDDAEQTVAVKLRGEFEQDYSFQRVDVVGPTVSEQLSRAGVLAVILSLIGIFIYVWFRFRWQLALGAVLSTLHDVVILSGMFIVFRMEFNLWSVAAILTIIGYSLNDTVVIYDRVRENLRRYKSAPLPAIIDASINQTLSRTLLTSFVTFLAHVPLYAFGGSEIRMFALALSVGIIVASYSSIFIAAPLLVQFGLKPRETDAGDAVDAELAQSLNLES >NZ_CP019390|116259:128177|123381_124665_-|WP_076771750.1|tRNA|DBSCAN-SWA MLDIKWIRENPETLDKALAKRGAAPLSSELIALDEKRREHVGKVQAAQERRNAASKEIGKAMAAKDMGTAEKLKAEVGELKDFLAHAEEDERRLSKELSDALSTIPNIPLDDVPLGKDESDNVELRRIGTPHNFSFQPKEHFELGEALGYMDFERAAKLAGARFTVLKGPLARLERALGQFMLDLHTTEHGYTEVMPPLMVRDEAVYGTGQLPKFSEDLFRTTDGRWLIPTAEVPLTNLVAEEIVDMKGLPLRFTALTPCFRSEAGSAGRDTRGMLRQHQFLKVEMVSITDAESSVAEHERMTACAEEVLKRLGLPFRTVVLCTGDMGFGAQRTYDIEVWLPGQNTYREISSCSTCGDFQGRRMNARYRPEGEKSTRFVHTLNGSGVAVGRALIAVMENYQQEDGSIHIPEVLQPYMGGLTRIEKAA >NZ_CP019390|116259:128177|121800_122469_-|WP_004683704.1|DBSCAN-SWA MRQATSERPRLSDREGFASFVLRMRGHSIDDPQLFAAIEATPRQSFLAAAWSHLAYSPRTAPLDCGEYMEGIDDQARVISALKLEPGHRVLEIGTGSGFTAAVMSLLSGRVTTVERYRKLCDHALQQFVSLKRENIMVKHTDGRHGMPGGPFDRIVIWLACDEVPRHFVELLATHGVLIAPIGPGDGRQIMTRISKVGSRFEQEDLMPVRYQPFIEGTSSVL >NZ_CP019390|116259:128177|122465_123233_-|WP_004683703.1|DBSCAN-SWA MRILLTNDDGIHAEGLAVLERIARKLSDDVWVVAPETDQSGLAHSLTLSEPLRLRQIDARHFALRGTPTDCVIMGVRHVLPGAPDLVLSGVNSGANMADDVTYSGTVAGAMEGTLLGVRAIALSQEYEYAGDRRIVPWETAEAHAPELIGRLMEAGWPEGVLLNLNFPNCAPEEVKGVRVTAQGKLSHDARLDERRDGRGFPYFWLHFGRGKAPVADDSDIAAIRSGCISVTPLHLDLTAHKVRAELGAALGVEA >NZ_CP019390|116259:128177|127325_128177_-|WP_076770538.1|DBSCAN-SWA MAASDAHTGGKDGTRVPMDTLWQNEAGRGAGEPSLLIDVQGFEGPLDLLLHLARNQRVDLARISVLALAEQYLGFVEQARALRLELAADYLVMAAWLAYLKSKLLIPKQQGDDGATGEELVASLQFRLKRLEAMRDAAAALVNRHRLGRDVFPRGMPEMVVVDRASRFSATLYDLLSAYASQRQRQAVSQVQIEKRSVWSLKEARTALVRLMGAVGDWVSLDRFLIDYALSPQERASALASSFAASLELVREGKLEVRQNAPFEPIYMRATRELDEFDEDENA |
12 | uncultured_Mediterranean_phage(90.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
390079 : 436805
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP019390|390079:436805|DBSCAN-SWA AGTGATCATCGTTGGGGAAACGAAGATCGATACGGGCGACAAATATGCGCCGATTATCGATTACAATCTGAATTATATATCCGGCAAAAATCCAAAGCACCGGCTCGTCGAGCACTATTCGGTGGCGGAGCTGACGGCAAAATATATCAATATCCTCTGGGATGATGGATCGCATAAATATAATGTAAGGTCGTTCCTCGGCGAGATTGACGAGATTCTGAAAGGCGCACGTTTTTCAGGTTTTGATCAGGAAATGCTCGATTCCATCATCGGCACGCTTCGCGAACGCGGCAACAGCAATGCAACCATCAATAGAAAGATGGCTGCGCTGAGCAAGCTGCTGCGAAAGGCGCACAAGATGGGGGATATCTTCAATCTTCCGGAGTTTATCCGGCAGAAAGAGCGCGTGGGGCGCATTCGATTCCTGGAACATAAGGAAGAGAAGCGATTGTTCGCCGCAATAAAGTCGCGCTGCGAAGACAGCTATCGCCTATCGGTCTTTCTCGTGGATACGGGTTGCCGTCTTGGCGAAGCAATCGGCCTCACATGGAATGATATTCAGGAACAACGGGTTACGTTCTGGGTCACCAAATCCAATCGCAGCCGCACCGTTCCCCTCACCCGGCGTGCACGAAAAGCATCCCATATTCCGCATGAGAGGCTAAAAGGCCCCTTCTCCATGCTCAATCAGGTTCGGTTTCGCCAAATCTGGAACGAAGCGAAGGCCGAAGTTGGTTTGGGAGCGGATGACCAAATCGTTCCGCACATTCTACGCCATACATGTGCGTCGCGACTGGTGCGTGGGGGTATCGACATCCGCCGGGTTCAGATGTGGCTTGGTCACCAGACCTTGCAGATGACGATGCGCTACGCGCATCTGGCAACACATGATCTCGATTCCTGCGTTAAGGTGCTCGAAATTCATTAGAGCATTTTCGAGCCAAAAGTGCGAAGCGGTTTTGCGTCGGATAATGCGACGAAACAAAGAAATGAAACGGTTCCAACGCATCGAAGTGGGATCAGAAATCAATCCAGTGAACTGATTTGCCCGCGTAGGCATTTCACATTTTTGGCTCGAAAATGCTCAAGGCCACAACAAAAAGCGCCAAGCTGTGGCGACTTTTGAAACGGGCTTTGAGCCAACAGGCCTTGCGGATAATGAGGGACTTCCCGCCTGCAGGACGTTATCTGGAATAGCCCCGTGGCAGAGAAAAACGGCAAGGCAACTTCCCCTGAACTCGCGCAGAGCACCAAAGGATATTCTCATTATCCTTATAGAACCGGCTGTTGCAGGCTGATGTGCGTTAAAACGATTCCAGCAAGATTGACACGTTGGAGCCGGTTTGCATTTTGGGTTCGAAAAGCCTTTAATGCCGCCGAATAGCTGGCCGGGTTGGCCGGGCTTTCAATGCGCCTTTACTGTCAGATTGAAACAGACCGGGATATCGCCCATGGCACAACCTCGCCTCACCCCCCTTGTCGAAAGCCTGCCCTCAACTGTCCCCTTTATCGGACCAGAAACATTAGAATTACAGCGCGGTAAGCCCTTCAAGGCGCGGATCGGCGCCAATGAAAGCAGCTTTGGTCCAGCGCCTTCTGTCATCGAGGCCATGCGGAATGAAGCCACCGAAGTTTGGAAATATGGCGATCCGGAAAATTATGCGTTGCGCCACGCCATTGCAGCCCATCACGGCCTGAAGGCTGAGCACATCATGCCCGGCGCTGGCGTCGATGCGTTGCTTGGCCTCATCGTCCGTCAATATGTGCAGCAGGGCGACAAGGTCATCAACTCGCTCGGCGGTTATCCGACTTTCAATTATCATGTCGCAGGCTATGGCGGGCAGCTCGTCACCGTTCCTTATCGCGATGACAAACCGGACCTCGACGCCCTTATCGATGCGGCTGCAAGGGAAAAACCAGCGCTCCTCTATATCGCCAATCCCGACAACCCCATGGGAACATGGCATGAGGGGGCCGATATCCAGTCCTTCATTGAGCGCCTTCCGGAAACAACATTATTAATTCTCGACGAGGCCTATTGCGAAACGGCTCCGGCATCGGCATTTCCACCTTTCGAGACGGATCGTCCGAATGTTCTTCGGATGCGTACATTCTCCAAGGCCTATGGGCTTGCAGGCATCCGCTGCGGCTATGCGGTGGGGAACTCGGCGGCAATAAAAACCTTCGACAAAGTCCGCGATCATTTCGCAGTCAGCCGCATGGCGCAGGCTGCCGCCATCGCAGCTTTGAAAGATCAGGCTTATTTGCACGAGGTCGTGGGCAAGATTTGCGCCGGGCGTGATCGTATTGCGGCCATTGCTGAAGCCAATGGCCTCCAGGCCGTTGCATCAGCGACCAATTTTGTTGCAATCGATTGCGGCAGGGGGAAGGATTTTGCGCAGGCAGTACTCAACGGATTGATTTCGCGTGATATTTTTGTGCGCAAGCCGGGCACGCCCGTGCTGGACCGCTGCATCCGTGTGAGTGTGGGCGTGAAAGAACAACTTGACCAATTTGAAGCGGCATTTCCCGAAGCACTTGAAGAAGCGCGCAAGATTTACGCCGCCAACGCAGAAAACACCTGATCGTGGACAATATTAAAATGCCTTCAATCAACGGCCAGCCACGCAGCGTCCTGTTCGGCACACTCGCCGGACTGTGTGGTGCATTGGGCATAGCATCCTACGCCGGAGCTGCCCATATGGGCGAAAGCCATCTTGGCACGATCGCCCCTCTCCTTCTGGCCCACGCGCCGGCCCTGCTTTTCCTGTCACTCATCAGCCCCGTCAGCCGTGCGGCACGGATCGGCGGCGCGATTCTGGTCGTCGGGCTTGCCCTGTTTTGCGGTGATCTTTTCATGCGCGACATGACCGGAGATCGCCTGTTTCCTTTCGCCGCGCCCACGGGCGGAAGTCTCATGATCCTGGGTTGGCTTTGTCTCGGCTGTAGTGGCTGGTCTTCGGCCAATGCCAAATGAAAAAGGGGCCAGAACGGCCCCTTTCTCATTCACATGAAGTGCTTTTATGAATTGCCCTTCAAACGTGCATTGCAAAGGCTGTAATAGCCGCCGCCCTTTTGAACCCACTTCAGGCCGCCCAAAGCATTGGCATCCTTGAGCGCATGATACTGGTCGAGGCAGGTGTGCATCCGTGCCTTGCCTGCAGATTCTTTTGAATATTTCGCGGAAACCGCCGTCGGGAACTTCACGCCCTTGGGGGCCGCGACGCTTGGAGCAACAGGTTCAGAATCGCCATCCGTGCTCAACGCCACCGGATCCGCGCCGGGGCCGCATTCTGCTTTACGGAAATCGTTCCACTTCATGCCATTGTCGGTACCGGCGTCTTTCGCCGCCTGATATTTCGCGCTGCACTGCTTCATGGTGAGGCTCTTGGCGCCATCATTACTGGCAGGCGCGGCGGCCTTCGCGGCTTTCTTGGTCGCAGGCGCAGCAGCAGGTGCTGCCGCCGGAGCTGACGCAGCATCATCACCACACTGAGCCTTGCGGAAATCGTTCCATTTCATGTTGCCAAGTGTGCCAGCATCTTTCGCAGCCTGATATTTCGTGCTGCATTCTTTCATCGTCAGCGCACTTGCCGGCGAGGAAAAGGCAATAGCGGCAGCACCCATGATGAAAGCGGTACCGGCCATAGCTGTGATGGTTCGAATAGACATCTCGTTACTCCCTGTCGAACGCGGCTTCTCGATCGCTCGGCATCTTCCCAACCCCGCGACCTTTCTAATATATAGAAACCGAAGATAACATGTTTGCTCTCTACCGGTCCGGTTATCTGCCAAGTTTGCGCCCGGCAAAGTTGACTGTCAATCATCTGCCCAATATTACTTTGGGGTCCAATACAAATTTTCTAACGGGCATTCTTTGCGCGTTATGGCTGCGTGATTGCCATGAAGTAAAGATAGAGTAATTCTCTATCCGTGCACGATCTTTATTCGGTAAAAGTTAACATTCATGGTCGGCCCGGCTGCACCAGCGAATCATGACCATCGCCGGCAGCAGCGTCGGGAAACCGAACCTTGCAACAAAACCGGGCGAGATAACGCACTGCGCGTCATCTGAAACAGCTTCGTCCCGCAAGCGTTTTTTTAGCCCACATGTCGCACGAAAGAAATTAACAGGAAGAACCTGAAAAATGGCGACAAACCACCAATGCGGGCATGAAAATAACAATAATGCTAGCCGTGCGCGCAGCACTTCTGCCTGTTATATTACCAGTGTTTCCCAAGACCTGACGCTGAGCCTGATGATGTAAAGCGTTATGAACCAACATGAAAGCCGGGTTTTCCCGGCTTTTGTGCGTCGTGCCGGTGGCTGCAGAGCGGATCCGGTGAAGACGGCATCTTTGGAACCGCTCTATTTCGTTGTTTTTACGCATTATCCAACGCAAAGCCGCTTCGCATTTTTGCTGGAAATGCTCTGGAGTTTGAAATTGGCCGATATCGCAGATAGCGCCCCTTCGCGTGCCGATGCATCCAGCCCGATCAGTGTCGATCGGCACCGCCTCTATGAAGATGCGATTGCGATGCTGATTGGCACATCCTTCATCGCCCTTGGCATAACGCTTTACAGCCATGCCATGCTGATGACCGGCAGCACAGCCGGCATCGCCCTGCTGATCCATTATGCGACCGGCACGGGATTCGGCCTTCTTTATTTCCTGATAAACCTGCCCTTTTATTATTTCGCGGTGCGCCGCATGGGCTGGGCCTTCACGATCAGGACTTTCGCAGCCGTGGCCCTATTGTCCGGGTTCACCCGCCTCATGCCGCTGAACGTAGATTTTACAAGCATCAACCCACTTTTCGCGGCCCTGATGGGCGGAACGCTGATGGGCATGGGCGTACTGGCGCTTTTCCGCCATCGTTCCGGCGTCGGCGGCGTCAATATTCTGGCGCTCTATCTTCAGGATGCCTATGGAATTCGCGCAGGATGGTTCCAGCTTGGCCTCGATGTGCTCATCATGCTGGCTTCGCTGTTTTTCATTCCGTGGGAAAACATGGTTCTCTCACTGGTCGGCGCGGTCGCGATGAATGTCATCATCGCGATCAATCATAAACCGGGGCGCTATATTGGCATAAGCTGACAAAAAAGCCGCCATCGAAGGCGGCTTTTTTCATTCCGAACAGTTCGATACCAGACAGTTAATGAGCTGTCATCCATTCGCGGGCGCAGCGCAGGGCAGCGGCCAGTTCGCTTTTGCTCATCGTTGCGGCAACCTGATTGCGCATCCGTTCCGCCTTCTGGTTGCCGCGGATGGCAGCAATGTTGAGCCACTTGTGCGCCTCGATCACATCGATCTCGCAATCGCGGCCGATCGCATATTTCATGCCCATTTCCAGAAGGATACGATCCTGCGCCGCAGGGTTTTCGTTGTGGTTGAAGCTCTGAAACTGTGCCATTTTTCTTGTCCCTGTTAAATCCCGAGAACCGTCCGGTCTTTCTTTTGACGCTGGCTCCTGCCGCTTCAAAACCGCCGGATCTGTCGTTCCGTTCATGATCCTGAAGATAGGGGAAAGGGCTGAAATGCACGTTAAACCAGATGCTTAACATCAAGCCAACAAAATCCGAACAAGGAGTAAACTTGGGATTTTGTTAAACATCCAAGCCTCACGGACTATTGGGTTTTTCTGAAAATTTTCGATTTGGACGCAAGGCCGGCGGAAACCGGATGATAACCATAGGGCGCATTGACGTGGTGAGAATTGGCGCCAAGCGCGCACTCCAGACTGCCATGATGAAGCTGAAAATTGCCCCTTCCGCTTCCCCGGAAACTCGTTTACGTTTACGTTTGCGTAAGCTGATGAGGGGCGCTGAACGCATGCTCCTGCAAAGGCTGAAACGCATGAATAGCGGCTTCATCGCCGCCCCGACAATTCGGGATTCGCGGCATTTTGGGAGGAACGAATGATCCGCACACTAATTCTTGGAGTGACGCTCGCCGCTGGTTTTGCAGCACCTGCATTCGCCGACGAAGCCATAGTCGGCACGTGGAAACGCCCGAACGGAACGCTTATCAGCTATGCCGCCTGCGGCGCCAACAAGTTCTGCGGCACGGTGATGACGGGCGAATACAAGGGCAAGTCCATCGGCACGATGTCCGGCAAGGACGGCAATTACAAGGGCGAAGTGAACAAGCTTGACGAAGGCAAGACCTATTCGGGCAAAGCCAGCGTCAAGGGCAACACGCTTTCGCTTTCGGGCTGCGTCATGGGCGGCCTGATCTGCAAGAGCGAAAGCCTTGCCCGGCAATAGAGCACTGGCAGAAACAGCAAAGGGCGGCCACAAGCCGCCCTTTTGATACCATATACGATCAACGCGCCTTCTGGCCGAAGAGAATTTTCTGCTCCTCTTCCTTATTGGCTGCGGCAGCAGCGCGCTCACGCTCTTCCTTGCCAACCAGCAAGCCGCGCTGCACGGCAGGCCGGGCGTTCATGGTTTCATGCCACCGCTTGAGATTTGCGAAATCATCGAGATTCTGTTCGTAATTCTTGTAGGCGTTGACCCAACCGATGCAGGCCATATCCGCGATCGAATATTCATCGGCGATGTAATCGCGGCCTTCAAGGCGGCGGTTCAAGACGCCATAGAGGCGGTTGACCTCATTCGTATAACGGTCGATGCCATACTGGATTTTCTCCGGCGCATAGATGCGGAAATGCCCAGCCTGGCCGGACATCGGGCCAAGCCCGCCCATCTGCCACATCAGCCATTCTTCCACCGCAACGCGCTTGCGCTCGTCAGTCGGATAAAACTTGCCGAATTTACGGCCCAGATATTGCAGGATCGCACCGGATTCAAACACCGAGATCGGTTCGCCGCCCGGCCCTTCGGGATCGACGATGGCGGGCATACGATTGTTCGGTGCAATTTTCAGGAAACCTGGTTCAAACTGATCGCCCTTGCCGATATTGATATATTTCACCGCGTAAGGGACGCCCAGTTCCTCCAGCATGATGCTGATCTTGAAGCCATTCGGCGTCGGCCAATAATAAAGTTCAATAGGCTTGGTCTGTTCAGCCATGGGTCTTCTCCCGCTTTCTTGGCTTTGTCCATGAATCCAATAAGCGTTAGCATAGGCCGGAAGAAAACAGGTACAAGTTAACCCGGCATCAATATTTTTGAACGCGGCATGAACAGCGCCCTCAGGGAAAGTTCAGGTACGCAGCCGTAAGCTCCAATGCGAAAGCAAACAGGTTTCGAGGATTGGGGACAATCATGCTGCTTCAGCGGTTTACAATAGTCTGCCTAATGATCGCGGGTTTTGCCGCCTTTGCCAGCATTCTGACATACCGGACATACGATATTTATGCCGACCAGCCGGTTGTCGACGCTGTCGCCTTGGCCCATATGCACGATCTTGCGGATGGAAAGACACGTCAATGAGAGCCGGAGCACCTTCCATTGCCGCCATTGGCCTGCGCATATTCCTCTTCACCGCCATTTTTACAGGCCCTGCCGCGGCTTTCGCCTATACGCAGATCGGCAACCATGCAGAGATGAACGGGGCCGGTCTGGTTCTTTATATAAGCCTGCAAAGAAGTGCATAATCAGATCATGCCACCGTCATGGAGTTTGCTATACTAGGTCATGACGCAACGAATCGAACATCCATTTTTGACTGATATTAAAACCCCTCCGCCGATGGAGCCGGAAGTTTTTAGCGACGCGCAGGCAGCCGTGGCCGCCTTGTGCAAACTTTATGAGCGCAACACCGCTTTTTTGCGCTCTGCTTTTGAAAAAGTTGCCCGCGGCGAAATTGCGCCGCAGCGTTATCGTGCATTCTATCCCGAAATATGCCTTTCCACGTCGAGCTTCGCCCATGTGGATTCACGTCTGGCATATGGCCATGTCTCGACGCCCGGCGACTATTCGGCAACCGTCACCCGGCCTGATCTTTTCGGGCATTATCTGCGCGAACAGATCCGCCTCCTGATGCGCAATCATGGCGTGACCGTCACGGTTCGCGAATCCTCGACGCCTATTCCCATCCATTTCGCCTTCAAGGAAGGCGCATATGTCGAAGCCTCCGTCGCCAGCGCCTTTACCCATCCGCTGCGCGACCTTTTCGATGTGCCGGATCTGGCCGCAACAGACGACAAGATCGTCAATGCGGATTTCGAGCCTGCCCCCGGCGAGCCGATGCCGCTTGCCCCCTTCACGGCGCAACGGATCGACTATTCGCTGCATCGCCTTTCCCATTACACCGCAACCAGCCCCAGCCATTTCCAGAACTTCGTCCTTTTTACCAATTACCAGTTCTATATGGACGAATTCTGCGCCTATGCCCGCCAGTTGATGGCGGAGGGCGGCGGCGGTTATGACCAGTTTGTCGAACCCGGCAACATTGTCACGCGCGCTGGCGAAACCGCGCCCAGCACCGGAAACCCGTTACAGCGCCTGCCGCAGATGCCTGCCTACCACTTGCAGAAGGCAGGCCATGGCGGCATCACCATGGTCAATATCGGCGTGGGGCCTTCCAACGCCAAAACAATAACCGACCATATCGCCGTGCTGCGCCCCCATGCGTGGCTGATGCTTGGCCACTGCGCGGGCCTGCGTAACAGCCAGCAGCTTGGCGATTATGTGCTGGCCCATGCCTATATGCGCGAGGACCATGTGCTGGACGATGACCTCCCGGTCTGGGTGCCGCTTCCAGCCCTTGCTGAAATTCAGGTAGCGCTGGAAGAAGCAGTAGAGGAAATTACCGGCCTTCAGGGCTATGATCTCAAGCAGATCATGCGCACCGGAACGGTTGCCACGATCGATAACCGCAATTGGGAGCTGCGCGACCAGCGCGGGCCGGTCCAGCGTCTGTCGCAGGCGCGGGCCGTAGCGCTCGACATGGAATCGGCCACGATTGCGGCCAATGGTTTCCGTTTCCGCGTTCCCTACGGCACCTTGCTCTGCGTCTCCGACAAGCCGCTGCATGGCGAATTAAAGCTGCCTGGCATGGCAACGGAATTCTACAAGCGACAGGTGGCACAGCATTTGCGCATCGGCATCCGCGCCATGGAAAAGATCGCTTCCATGCCGGATGAAAGGCTGCATTCCCGCAAGCTGCGCAGCTTCTACGAAACGGCCTTCCAGTGAGGTTAGGGGAGTAGGGGAATAAGGGAGTAAGGGAATAGGGTTTCTGGCTGTCCGGCCCTCTGTTTCCGGCACGGCTCCAACATACTGCCTTACTGCCTTACTGCCCTATTCCCTTACTCCCTTATTCCCCTACCCCATTCATTTTCTCGCCGTAATCCGGCACTACTGCCGCCTCATGTCAAAAAGCCGCCTGAACCTGCCCTTCCTGATCCTTGTCGCCGCCACGCTGGTTTCGATCCCGCTTGTGCTGAGCTTCCTGAACAGCCTTCATCCGGCGTTCGACACCTTCTCACATTTACGCATCCATCTGGCCGTGCTGATGGGGCTGCTGGCGCTGCCGCTGCTTTTCACGAAACTGCGGCGCGAAGGCGCGATGGTCCTGCTGCTCGCCGTTTTTGCAATTGCCGTCACGCCCCATGTTTTCCCCGCGAGTGAAGATGCCCATGCAGGCGAAGCAGCACAGCCTCACTATCGCCTTTTGCAAATGAACCTTCGCTTCGACAATGGCTCACCGGAACAGGCACTCTCCCTCATCGCCCATATCCGCCCCGATGTCGTCACCCTGGAAGAAGTATCATCCATGTGGCGGGAGAAGTTCGGCCATATTGCATCCGCCTATCCCTACAGCATTTTCTGCCCTCATCCCGGCGCGGTTTTCGGGGTGGCAATCCTGTCGCGCCGACCCTTCATCGCTGACAGCACCCCTGCCTGCGACCCGAAAGGCATGATGGCCGTCGCCTCGGTCGATTTCGGTGGCCGCCCCGTGGATGTGGCCGCACTGCATCTCCATTGGCCTTGGCCTTTTCAGCAGAGCGAGCAGATCGAGGCCCTTTCGGAGCAGTTTCGCGGACTGTCGGAAAATGCAATCCTGAGCGGCGATCTGAACGCAACACCATGGAGCGCGACTACAAAGCGCATCGCGGAACTCGCTGCCATGACACCTGCCCCGCCGACCGGCCCAACATGGCTTTATCGCCGCCTGCCCGCTTCACTGCGCTTCGCCGGATTGCCTATCGACCAGACCTTCGCGAAAGGCCGGGTCGCGATATCGAAGATTACCCGGCAACAGCCCATCGGCTCTGACCATCTGCCCGTCCTGGTGGAATTTTCCATCATTCCCCAGCCGGAAACTGTCACATCCTAACAAATAAGGCGGCATAGAAGCCGCCTTATTCAGTCAAATTACAATTCTCGTTCCGGTCCTTCACCGACATTTTCCCAACGGCAGTCGAGACGGACACGGTCTTAAAATGTGAACATTACATGTTCACATAGACCGGTCCTTCGCCGGCATTTTCCCGACGGCGGTCGAGACGGACACGGTCTCAAAATGTGAACATTACATGTTCACATAGACCGGTCCTTCGCCGACATCTTCCCAGCGGCGGTCGAGACGGACACGGTCTTAAAATGTGAACATTTACATGTTCACATAGACCGGTCCTTCGCCGCCTTGTGGCGGAACCCAGTTGATATTCTGGTTCGGGTCCTTGATGTCGCAGGTCTTGCAGTGGACGCAGTTCTGCGCATTGATGACGAAACGCACATCCTTGACGCCCGGATCAGCAGCGGCATTGCCGTCCGCATCCACCCATTCATAAACACCAGCCGGGCAATAGCGTGTCGACGGTCCTGCGAAAACATCGTGTTCCGAAGTCTTCTGCAATTCCATGTCACGCACTTGCAAATGCACCGGCTCGTTTTCGTCGTGGTTCGTGTTCGACAGGAACACAGACGACAGACGATCAAAAGTCAGCACACCATCCGGCTTCGGATAGTCGATCTTCTTGTAGTTCGCAGCCGGTTCAAGCGCCTGCGCATCGGTCTTGCCGTGCTTCATCGTGCCGAAGAACGAGAAGCCGAAAAGCTGGTTGGTCCACATATCCAGGCCGCCCAGAGCAATGCCGATAGCGGTGCCGAATTTCGACCAGAGCGGCTTGACGTTGCGCACGCGCTTCAAGTCCTTGCCAATGGCGCTCGCCCGCCAGCTATTCTCGATCTCGATCGGCTCGTCATTGGCACGGCCAGCGGCAATCGCCTCGGCAATCTTGTCGGCGGCCAGAATGCCGGACAGGATCGCATTGTGGCTGCCCTTGATGCGCGGCACATTGACAAAACCGGCCGAGCAGCCGATGAGCGCGCCGCCGGGGAAGGAAAGCTTCGGCACGGATTGCCAGCCACCCTCGGTAATTGCGCGCGCGCCATAGGAAAGGCGCTTGCCGCCCTCGAACGTATCGCGGATGGCCGGATGCGTCTTGAAACGCTGGAATTCCTCGAATGGCGAAAGATAGGGGTTCTTGTAATTGAGGTGCAGCACAAAGCCCACCGCAACCATATTGTCCTCAAGATGATAGAGGAACGAACCGCCGCCGGTTTTCATGTCCAGCGGCCAGCCGAACGAATGCTGCACGAGGCCCGGCTTGTGCTTCGACGGATCAACCTGCCAAAGCTCCTTCAGGCCGATGCCGAACTTGGCTGGTTCGCGGCCCTCATCAAGCTTGAACTTCGCAATGAGCTGCTTGGCGAGTGAACCGCGCGCGCCCTCGCCAATCAGCACATATTTACCGAGCAGCGCCATGCCGCGCGTATAGTTCGGGCCATGTGTGCCGTCGCGTTCGACGCCCATGTCGCCCGTCGCGACGCCGATCACCGCGCCCTCGTCATTATAGAGTACTTCGGTCGCAGCGAAGCCCGGATAGATTTCCACGCCCAGCTCCTCGGCCTTCGTGCCGAGCCAGCGGCAGACATTACCGAGCGAAACGATGTAATTGCCGTGGTTGTTCATCAGCGACGGCATGGCAAAATTGGGCAGGCGCACGGAACCGGCAGGCCCCAGCACCAGGAAATGATCCGCCGTCACAGGCGTTTTGAAAGGATGGCCTTCCTCCTCGCGCCAGCCGGGCAGAAGCTGGTCGATACCGACCGGGTCCACCACCGCACCCGACAAGATATGCGCGCCGACTTCCCCACCCTTTTCCAACACGACAACGGAAAGTTCGGGGTTTATCTGTTTGAAACGAATCGCCGCTGCAAGACCGGCAGGGCCTGCGCCGACAATCACGACGTCGAATTCCATGCTCTCGCGTTCGGGAAGCTCGTTCGCTTCAGACATTCATCCCTCCGTCCGGTATGCCATCATCTCCTCGCTATGCCAGCGCCAATGATGCATCCAATTGTTTTTCCGCATGATCCTATCACAAACGCCAGTGAACGGGTCAAAGGCGTCTGGAAGATCAGTCTGTCCTGCATTCCGCCATACCCTCTCGCAGAGTTGGAGACTGCCGCATATGCGTCCAATTGTCCAAAAATGGCATCATACAGCTACATTTTTATTTTACCTTAACGTAAACGTCAAATTTGGCCGATTTAGATTGCTGCCATCACGATTTGGCGAAAGAATGTTCTTCTTATGCCAACAATCGGCTGTATGAACCGGAAAATTGGCCCATGCCCTTGCATTCGGCGCCAAAATTCGCCACCTCAGACGGTGAGGTGCAAGAATGACCGATGGCCGGGAAACGCTGGATATAGAGGGCCTGCTTCGCTTTTATGCGGAAGCGGGTGTCGATGTGCCGCTTTGCGAAACGCCGATTGACCGTTTTGCGGCCGCCACACATCCTGCCCCCGCACAAAGCCGGATGCAGGCCGCAGCACAACAGCCGGAATCAAACCCGGCGCAGGCCCGCGAGGAACGGGCGCGTACCGTTGCCGTACCCTCCCCTTCCAGCAAGCCGGTGCAAGCTGCAATGGACCTGCCCGACAATGCGCAGATCGCGCTTGCCCGCGAGGCTGCTTCGCAGGCCGAAACACTGGAAGAACTGCGCGAAAAGCTTGCAGCCTTCGACGGCTGCAACCTGAAATTCACCGCCAAGAACCTCTGCTTTGCCGATGGCGACCCTTCGTCCGATATCATGTTCATCGGTGAAGCGCCGGGGCGGGACGAGGATATGGAAGGGCTGCCTTTCGTCGGCAAATCCGGCCAGCTTCTGAACCGCATGATCGAGGCTATCGGCCTCAAGCGTGAAGAGGTTTATATTGCCAACACGATTCCGTGGCGGCCACCGGGTAATCGGGCGCCCACACCGCTTGAAACGGAGCTATGCCGCCCCTTCATCGAACGGCAGATAGAGCTGGCCGCACCAAAAGTGCTGGTGGCGCTTGGCGGTCCAGCCGGCAAGGCGCTGACCGGCGCAGCGGAGGGCATATTGCGCCTGCGCGGCAACTGGAAAATCCACCGCACGCCGACAGGCATGGAAATCCCCGTGATGCCGACATTGCATCCGGCCTATCTTCTGCGCACCCCGGCACAGAAACGTTTCGCCTGGCGCGACTTTCTGGCGGTGAAACTCAAATTGGCCGAATTGCGTGGTTGATATTGCCCGCATAGAGCGATAGCACGCGCGCATCATCACAGCCTGGTGCCGCAAACCCGGCTCAATTCGTTGTTTTCATGCGTATCAATCCGCCACTTGTCAAATCACGCCAAAGCGACAAGTAGCTTTTGAATTCCTGACCCATTGTGGAGTCCGTCATGGCGGGACAGCTTCTGCCCATAGCCGCGCTGCTGGCAAGCACATTTCTCATGCTGCTCGCAGGCGGCCTTGCCGGTATTCTCCTTCCGCTTCGCGGTGGCATGGAAGGCTGGTCCACGACGACAATCGGCTGGATGGGGACAAGCTATTCGCTGGCCTTCACCATCGGCTGCATTTTCATTCCGCATCTGGTGCGCCGCGTCGGCCATGTGCGTGTCTTTTCGGCGCTTCTGACGCTGCTGTCCATGGCGCTGCTGTTCCATGCACTGGTGGTCAACCCGGCCGCGTGGATGATCTTTCGCGGCATTGCGGGCTTTTCCCTCGCCGGTTCCTACATGATTATCGAAAGCTGGCTGAACGAGCGCGTGACCAATGAATCGCGCGGCATGATCTTCTCGATCTATATGATCATCACCATGGTCGGGCTGCTTCTGGGGCAGTATATCCTACCCTTCGGCAATGCCGCCACGCAGACACTCTTCATCATCTGCGCCATTATCTATGCCAGCGCGCTTCTGCCCACAGCACTTTCAAGCGCCCAGTCACCCAACCCGCTGACACAGGTTTCGCTGGACCTCAAAGGGCTTTACCGCCGCTCGCCCGCCGCAGTTGTCGGCTCCTTCATCGCGGGCATCGTGGCAGGTACATGGAATTTCCTCGCCGCCATCTATGGCGAGATGAACGGGCTTTCCACTTTCGGCATCGCCACCATGCTTGCCTCGGCGATGATCGGCGGCGCAATCTTCCAGTACCCGCTTGGGCGCGCTTCCGATTTCGTCGACCGCCGCTATATGATGATCCTTGCAGGCGCCATCGGCTTCATCCTGTCGTTCATCATGGTGCTGTTTCACCCGACCTCGCCCTATACGCTCTATGCGATGATGTTTCTGTTCGGCTCGGTGGTCTTTCCGATCTATAGCCTGAACGTCGCCCATGCGAACGACTATGCCGATGCCAGCGAATTCGTGAAGATTTCCGGCGGGCTGCTCATCGTCTATGGGGTTGGCAGCGTGCTTGGGCCAGCAATTTCAGGCCCGCTGATGGATGTCATCGGCGCGAACGGATTTTTCGTCACCATGGCGATTGCCTATTGCATCTATGGGCTGCACGCCTGGTGGCGCATCTATCGGCTGGAGCGCCCCGCAATCAGCGACCAGAAGACGGAATTCAAATTCCACACGCCCGACGGCCAATCGACACCGGAAACCATGCAGCTTGACCCGCGTGTGGAAGGCACATCGGGGCAGGCGCAATAAGACCCTATTGCGCCTGCGGGGCATTAACCTTGTGCGGCGCCAGCGAAAGCTGGTGCTCGCGGTAGATGATAAAGCCTCCGGCAGCCATCACGATGCCGCTGCCGATCAACATTTCAAAGGTCGGTATATCGCCGAAGAGCATGAAGCCGATCACCAGCCCCAATAGCATCGAAGTATATTCAAACGGCGCAATCGTAGACATGGGTGCATGGCGATAGCATTCCGTGAGCAGAATTTGCCCGATGCCGCCCGCAAAGCCCGCCCCCACCAGCATGGCAAGCTGTGACCAGTTCGGCACAACCCAGCCGAACGGCAGGCTCACCAGCGCGATCACGCTTGCCGATATGGAAAAATAGATCACGATGGTCGGCGTACGTTCGGTCTGGACCAGACGGCGCACCAGCATCATGGCGACAGCCGACATGACCGCGGCCCCAAGCGCGGCCAGCGCACCCACCGCTTCGTCGCGCCCGACACTTCCGGCAGAAAACAGGGTCAGGCGTGGCCAGATGATAATCATGACGCCGAACAGCCCGATCAGCACCGCACTCCAGCGGTAGAAGCGCACCACTTCGTGCAGGATGGCCGCCCCCAGAATAACGGTAATCAGGGGTGAGGCATAATTGATGGCGATGGCCTCCGGCAAAGGCAGCTTTGTCAGCGCGAAGAAGCTCATCGACATGGAACACACACCCACCAGCCCGCGCCAGAAATGGCTGAAGCCGTGGCGGGTAGAGAAAACCCCGCCAAGCTGCCCGCGCCAGCCCAGATAGAGCAGGATGGCGAAAATTGCGAAGAAGGAGCGGAAGAAGATCAATTGGCCGACCGGAACACCTTCCGCCGCCTTGAGCAAGGTGGACATGGCGACGAAAACAGCAACGGAAGCAATTTTGAGGCCGATGCCCAGCATCGGATTCATTTCAAGCCTGGCCGCATCGGCGGCTGCGCGCGTGTGGGCGTTCACGACGCTATTCCTTAAAAGTAAGCGGTGCGAATCTCCGGCTGAAAATATTTCAGCCAGACCCCTATTTATTAGACGTGTTTTTGTGGCCCCACCAGTAGGGCAAAGCTTAACAGCGAAATTTATTTGGCTCAGTAAAGCTCGCACTCACCCCGTAGTTTTGCTGGAATTGCCCTGAAAAGCCGGTTACGACTCCAGCACCCAGGAGTTCAAAGAAAGAGTTTCGCAATGCGTACTGAAACCGGCCATACTTTCCGCCTCGAAGATTATCGCCAGACGCCTTACGCCATCCCCGAAACGAAACTCGACTTCACACTGGAGCCGGAAAAAACCATCGTGCGCGCAACGCTCACCATAGAGCGCCGCCCCGATACGCCCGCCGGTACGCCGCTCGTTCTCCACGGTGACGCATTGAAGCTCGTGAGCCTTGCCATCGACGGCAAGGCGCTTTCCGACAACAGCTTTTCGGCCACGCCCGACCAGTTGGCCATCAGCGATCTTCCGAAAGATGCGCGCTTCACCTTGCAGATCGTGACCGAGGTGAACCCAACAGCCAATCGCCAGCTCTCCGGCCTTTACCGCTCGAGCGGCGTCTATTGCACCCAATGCGAGGCGGAAGGCTTTCGTCGCATCACCTATTTTTACGACCGCCCGGACGTGCTGTCGGTCTATACGGTGCGCATCGATGCCGACCGCAAAGCCGCTCCCATCCTGCTTTCAAACGGCAATCCTGTCGAAAGCGGCATGGTGGAGGGCCATCCGGAACGGCATTTTACCGTCTGGCACGACCCACATCCAAAACCCTCCTATCTTTTCGCGCTCGTCGCCGGTTCGCTCGGCGTAGTGAAAGACCACTTTACAACCCGATCTGGACGGCCCGTCGATCTCGCCATCCATGTGGAACATGGCAAGGAGGGCCGCGCGCTTTATGCGATGGACGCGCTAAAACGCTCCATGAAATGGGACGAGGAAAAATTCGGCCGCGAATATGACCTTAACGTTTTCAATATCGTCGCCGTCTCCGATTTCAACATGGGCGCGATGGAGAACAAGGGCCTCAATATCTTCAACGACAAATATGTGCTGGCCGATCCTGAAACCGCGACCGATGCGGATTATGCCGGCATCGAAGCCGTTATCGCACATGAATATTTCCACAACTGGACCGGCAACCGCATCACCTGCCGCGACTGGTTCCAACTATGCCTCAAGGAAGGCCTGACGGTTTATCGCGATCACGAATTTTCCGCCGACCAGCGCTCGCGCCCTGTCAAGCGCATTGCGGAGGTGAAAATCCTGAAAGCACAGCAATTCCCGGAGGATGCTGGCCCGCTTGCCCATCCGGTGCGCCCTCGCGAATATCGCGAGATCAATAATTTCTACACGGCAACCGTCTATGAAAAAGGTTCGGAAGTCGTTCGCATGATCCGCACCATCATCGGACCGGAGCTGTTCCGCAAGGGCATGGACCTCTATTTCGAGCGCCATGATGGCGATGCGGCGACCATCGAGGATTTCATCCAGGTTTTTGCCGATGTTTCCGGGCAGGATTTCTCGCAATTCGCGCTCTGGTACGACCAGGCCGGCACACCGAAGGTGGAACCCGGGTTCCATCATGACGCAGCCGCGAAGACATTCACGATCAAGCTGGAACAGTCACTTGCGCCGACACCTGGCCAGTCGATCAGGAAGCCCATGCATATACCCATTGCCTTCGGCCTGATCGGGCCGGACGGTAAGGACATGCAGCCCTCGTCGGTGGAAGGCGGCGAGGTGCGCGACGGCGTAATCCATTTGCGCCGCCCGTCCGAAACCATCGTCTTCCATGGCATCGAGGCCCGGCCCGTGCCATCGCTGCTGCGCGGCTTTTCGGCGCCGGTCAATCTCGCCGCGCCTCTCACGGCGGAAGACCGGATTTTCCTTGCCCTGAACGATAGCGATCCCGTTGCGCGCTGGCAGGCGATGAACAGCATTTTCTCTGCCGCCCTTCTGGATGGCGCCAAGCGTGTGCGCGGCGGGCACCAGCCGGAAACCGATCCGAAGATCGTCGCGCTGGCCGGAAAGGTCGCCTTCGATGAAATGCTGGACCCGGCTTTCCGGGCGCTTTGCCTGACGCTGCCGAGCGAAAGCGATATTGCGCGCGAAATGGGTAACAATGTCGATCCCGACGCAATCCTCGCCAGCCGCAACCATCTCATTGCAGCAATCGCTTCAGCCTATGCCGATGGATTTGCCGGGCTCTATGACACGCTGAAGCAGGAAGGGGCGTTTTCACCCGATGCGGCCCCGGCGGGAAAGCGTGCCTTGCGTAGCGCCCTTCTCGATTATCTCAGCGTTCAGGAAAAGAGCCCTGAACGCGCAGAAAGGCAATTTGTCGAAGCCGACAACATGACGGACCGCGCCACGGCGCTGGCCGTTCTGGTCCATCGTTTTGGCGATAGCGGCGAAGCCCGGCAGGCGCTTGCAACCTTCGAGCAAAAATTCGGCCAGGATGCGCTCGTGATGGACAAGTGGTTCATCGTGCAGGCGACACGCCCCGGCGAAACGGCCCTTGAAGCAGTCAGGGAACTGACCCGCCATCCGCTCTTTTCTCTCGACAATCCAAATCGCGTGCGCGCGCTCATCGGCGCATTTACGGCTTCCAACCCGACCGGGTTCAACCGTCGGGATGGTGCTGCCTATGGTTTCCTCGCCGATACGCTTCTGACCATTGATCCGAAAAACCCGCAGCTTTCCGCACGGCTTTTGACGGCAATGCGCTCATGGCGGTCGCTGGAAGAGGTGCGGCGCGAATATGCCCGCGCGGCACTCGCGCGCATTGCAGGTGCAGGCAAACTCTCCACCGATCTGCGCGACATCATCGACCGCACGCTCGCCTGATTCGCAAAACCTGAAAGTCGAGCGATGATTCGCCGATCCGGCCCGCGCCAGCGCGCCGGATCGTTGCACTTGGCGCGCCCGCCCGCACACCCTGTTCATCCTTCCGGCTTTCATGAGTCTTTTCATGGCGTTAAAGGCTGCACCTGCCAAACTGATTCCACTTAAACTCTTGAGATTCATGAGAAAACGGGCAATGTAACAAAGCGATTTTTACGCCTCGTTAACGAATTCACTGGACAGGGCGAATCACTTTTGATTCATTGAATGCATTCGGAAGTGGCGTAATCCGAATCATATTTTGAACCAGGATTTCAGTTCATACAGTTCTGCGTAGAAGGGTGTTTCATGCGCTTCGCAGGGCTGTTCCATAGATTCAAGAGGGGCCGAGAGATGGCGAGCACCGACGCGTATGGCGCGCCGGCGGGGACGCATTGCGAATCCGTCCGGAAAAAGAGGAAAGGCAGGCTTTCGGGCCATGTCAGCCTTCTCGCGGGGCCGGCTTATAGCAAATTCATCATCATAGAACCCATCCTGCGCCGCCTCGTCCCGGCACTCATCATCATTTTCCTGATTATTCTCGGCGTGGCCCGCGTTTTCTCGCTTTTGGCCTGGCGCGACGATATCGAATTGCAGCACAAGGCTGCTCTTTCCGGCGCGACGGCGCATCTGGCGCAGATGATCGAGCGTGTCGCCAACGGGATCGAAACAGGCGCACAGCTTTCCGCCAAGGATTTGCAGGATGCCATGACGGAGCTTCGCTCGCGCGGCCTTACATCATCCGGCATGACCATCGCCATCGTGGATGCGCAATCCATGATTAAAGCCGCATCCGGCCCGGCCGGGATCGCTGGCAGCCAGATCGACACCATTCTGGGGGATGCACAGCCGCTGTTCCTTTTTGCCGAGCGCGCCGGTGTGCTCAGGGTTGTGTTGCAGGGCGAAGCCGCTTTCGGCGCGCTGGCCAAGCCAATGACCGCGCCCTATTCCATCATTGCGGTCGAACCGGAAAGCACCATCTTTGCCAAGTGGAAAAGGGCCGTATCGCTCAATGTGACGCTTTTTGCCGGCACGATCGGCGTGATGTTCGCAATTCTCTATGCCTATTTCAGCCAAGCCGCGCGGGCACGCGAGGCCGACGATCTCTCCGGGCAGATCCAGCGCCGCATCGACATGGCGCTGGCGCGCGGGCGTTGCGGCCTTTGGGATTGGGACATGGCGCGCGGGCGCATCTACTGGTCACGCTCCATGTATGAAATGCTGGGCTATGAAGCGCAGGATGCCGTGCTGCCCTTTGGCGATGTGGCGGCGATCATCAATGAGGAAGATGGCGATCTCTACTCCATCGCCGAACAGGCGGCGGCTGGCGATATTTCACATGTGGACCGGGTCTTCCGTATGCGCCACGCGGATGGTTCATGGGTCTGGATGCGTGTTCGCGCCGAAATCGCCAGCGAAGGCGACCTTCATCTGGTCGGCATCGCCTTCGATGTCAGCGAGCAGCATCGCTTCGCGCAGCAGACCGCCGAAGCCGACATGCGCATTCGTGAGGCAATCGAGAATATTTCGGAAGCCTTCGTTCTATGGGACGCGAATAACCGCCTGGTGATGGCGAATTCCAAGTTCAGCGAATATGCGGGCCTGCCGGTCTGGACGCTGAAACCGGGCGTCCCACGTAACGAAGTGGACGCGCATACCCGCCCCTTCACCTTCGAGCGCCGCATGGCAAACGAACACAACCGCGCAGGCGGCCAAACTTTCGAGCGGCAGTTGAGCGACGGGCGCTGGCTACAGGTCAATGAACGGCGCACACAGGATGGCGGCATGGTCTCCATCGGCACGGACATTACCCAACTCAAGCTGCATCAGGAGCGCCTTGTGGATAGCGAGCGCCGCCTGATGGCGACGGTTCACGATCTTTCCGTCGCCCGCAAGGGTGAGCGCGACCGGGTGCGCGAGCTTTCTGAACTGGCGCGCAAATACAGCCTTGAAAAGGAACGCGCAGAAGCGGCCAACCGGGCCAAATCGGAATTCCTCGCCAATATGTCTCACGAGTTGCGCACGCCGCTCAATGCAATCATCGGCTTCTCGGAAATGATTCAGGCAGGCACGTTCGGCCCGCTGGGTTCCGACCGCTATGAGGAATATATCAACGACATTCACACCAGCGGCAACTTCCTGCTCAACGTCATCAATGACATTCTGGATATGTCGAAGATTGAGGCCGGACATTTCTCGCTTGATCGCGAGGAAATCGATCTCTGCCCGCTCATCAATGAAACGGTGCGGATAATCTCGCTCCAGGCCGAAGAGAAGAACATCGCGGTCGAAACACGTATCGAAGACGCGATGGAGCTTTATGCAGACCGCCGCGCGATCAAGCAGGTGCTCATCAACCTTCTCTCCAATGCGGTGAAGTTCACCTCTTATGGTGGCCGGATCACAGTGCGCGCCCGCAAGACCGGTGCCGCCCTGTTCATGACCATTCAGGATACCGGCGTCGGCATTCCGAAATCCGCACTGCGCAAGATCGGCCAGCCCTTCGAGCAGGTGGAAAACCAGTTCACCAAAACTCATACCGGTTCGGGGCTTGGCCTTGCCATCTCCCGCTCCCTGGCGGAGTTGCATGGCGGCTGGCTGCGCATTCGATCCACCGAGAGGGTCGGCACGGTGGTTTCAGTCTGCATCCCGGATCGCAATCCCGCGCCCAATGCAGGACACGACGCCCGGACCCACGCTGCTTAGAGCATTTTCGAGCCAAAAGTGCGAAACGGTTTTCGCTCGAAAATGCTCTTAGCCTCGGCCGGTCTGTCACAAACGTAATATTAAATAATTACACAAGATATACATAGTGCAATCATATTTGATCGCATCACGCATAATGACACGGGCTAAAAAGGGTGGCATATCAGACAGAAATCCTTTGATGGTGTAAGAGCGGACCATAAAGGTGTCATCTGATAAGTTGAAGGGCGGTCATCATGCAAAAAATCGCAAGTGCGCTGAGGCAAGCGAATTAATGATATTGCCCGTATTCGTCATCAATATGGCCTCGCAGCCTGCCGCCTATAAGACCGTCGCAGCCTCCATTGAAGCTTACGGGCAGGGTTTCCAGCCTCACAGGATTGATGCGGTTAATGGGCATACAGCGACACAGCGCATTGGCATTGACGATGCACGTTTTGATGCGATCAATGGCCGTGAAATGCTGCCCGGTGAATACGGGTGTTATCGCAGCCATTTGAAGGCATTGGAAAGCTTCTTATCCGACGGCTCCCCTTACGGCCTCATTCTGGAAGATGATGTGGTTTTTACTGAAACCACATCATCGCGCATTCATGACATCATTAAAAGCCTGCCTGATTTCGACGTCGTGAAACTGGTTAATCATCGTTCACCCTTATTCATGAGCCTGCTTGAAACAGATGCAGGTGACAGGATCGGCCGCGCCATTCATGGCCCCCAAGGATCTGCCGCCGCCTATCTCGTCAGCAGAGAAGGCGCCCGGAAGCTTTTATCCGCACTATCGACCATGGAACTGCCGTGGGACGTTGCCATGGAGCGATTCTGGCATCACAAAGCCCGGCTGTTCAGCAGCGATGAAAACATCCTCGCTTTTTCTTCTCACAGCGAAATCTCAAATATTTCCGATCAGAATTCAGGTTACGATGAGGCCAAGCACCCTTGGTATAAGCGCTTGAGAACGTCATTATTTCGCACTTTTGATTATTATGTGCGTGTTCACCATACATTATTGCAACCTCAAAATCCCGATGGCAGCAGCATGAAAAGCCAGTCCGGAGCCTATAGGCTGCCCGGAATTTCATTAACTGGCGAACTGATTGCCGCCATCAGCTTGCTGATTTTCATGTCTACGGTATGGGTAGAGACGGACGCCTATAGATATATAGCCCTCGGTTTTGTGGTGGCTGCATTGATCCGTTATGCCCGCACCGACTTCTGGAAATACGAAAAACCGATGGTCGGCTGGGCCGGGTTAATTTGCGTGGCATGGACATTTTATGTCCTGGCGAGGTTCGCATATATCTATCTGTTCTACCCGGAAATGGGCACCGGCTCGGCAGAGGGCATATATCTTTTTCCGCTTTTCTACCCGACATTGGGGTTTGCGTTACTGCTTTTTATCCGACGGCCATTTCTTATTGCGGTCGCCTTCATGGCGATCAGCCTCGTAATTCTCATATTCGGCTTCCACTATGATCTGTCATGGAACGAACGAGCCGTTACGTTGCTCCAGCATAATCCGATCCATGCGGCTGTCAGCAGTGGCTTTATCGCCCTATGCGCAATGGCTTTTGGCATTCACACATTGAATCGCAACACGCTCGATACCAGAGCGCGCGTCGTTTTGTGCCTGCTCGCGCTTGCTACTTTTATTGCGGCCCTGATTGCAATCTACAGCCTCTATTCAAAAGGAGTCTGGCTCGCAATGGCAATTGCATTTCCGACCTTTGTGGTCCTTGTTGCGCTGACAGATAAAAGCCAGACCTCACGCATGGCTGCACTGGTGTGCATTCTCATTGGTTTGTTGAGTGTGTTTGCGGGAGAACATATCCTGCAACGCGTCGGCGGCAATACTGCCAATACATCCTGGGAATTGTTATCGGACCTCAAGACGGGCGATAACATCATGCAGGATTTCGACAAAGCCATCAAAAACCCGGAAACAGGCCTGAGCGAGCGCGAACGTCTGATGATATGGGCCAACACACTGCATATCTGGCATAAGAATCCGATATTTGGCGCAGGCGTTTCGTGGCTTCACTATTGGGAAAAGCGCCCTTATCAACAAACCGACTTCACCCTGCTCCACAATGGATATCTGGAAATTGCCATCCGCTACGGATTTCTGGGCTTGCTGTTCTATGGCGTTTTGACGATCTGGGCGGTTCGGTGCACATGGCAAGCCACGCGAGCAGGTCTCATCGACAGTGCTGCCTTTCAATGCTACGTCGCAACACTGGTATTTTTTGCAGTGACGATCTTGTCAAACTCCAATGTTCGTCTGGCAATAGGAGAATCCTATATGGCACTGGCATTCGGCTTTGCCTTTTATTGCCAGTACCTTCTGCAACAACACAACAGACAATACCCGCGCACCTATTTCTAGCGCGGTTCCACTTTTACCGCGTCTTTTCCTCAATCAGCGCCATGTCCGACAAGCTTGTCAAAAGCAGCCCTGACAAGCCTGTAATGTTCCAGCAACTCATATTCTATGCGTTCAATATCCGGCAATCCGGCAACGCGGCACAGGAGCTCACGCATACCGGGCGGAAATTGATCCAGCCCTGCGCTATCGTTCAGGCAAAGCCGTATCGCCTGGCTAAGATTGGTGTAGAAGCGATGCGCCTCCACAAGGCCATCCACCATTGCAGGGTCTGCAAAGGATGGATCGAGGTTTGCCAGCACTTCTTCGGTTGCAAATGGACGCGGCGTTTTCTTCACATACCCGGCAAGGGTTGCAAATTGGGCGATAAACTCCAGATCGATAATGCCGCCGGGCTTCAGCTTTAAGTCCCAATCATCCCGCGGCGGCTTTTCCTGCGCGATGAGTTCCCTCATCTCGCGCACATCCCCGGCAAGTTTCCGCACATCGCGCGGCATTGCGAGAACGTCCTCTATATCCGCCTTGATGCGAGCAATAAAAGCCTCATCGCCATGGATGGGCCGCGCGCGGGTCAGCGCCATATGTTCCCAGGTCCACGCATCGTTGCGCTGATATTTGCCGAAAGCTTCGATATGGGTTGCGACAGGGCCTTTGTTGCCCGACGGGCGCAGCCGCATATCCACCTCATAAAGCACACCTTCCGCCGTCGGGGCCGAAAGGGCCGCGATGAGGCGCTGTGTCAGGCGAATATAATATTGTGAAGGCGCAAGCGGCTTTTCACCATCGGATTCCTCGGCATCCTTGTCATGAGCGTAAAGCAGGATCAGGTCCACATCCGAGCCCGCCGTCAGTTCACGACTGCCAAGCTTGCCCATGGCGAGCAGCGCCACTTTCGCGCCCTTCACCTTGCCGTGGCGGCGTTGCAATTCGGCTTCCACCGCCTCCAGCGCCCTACCAACCATAAGTTCGGCAAGATCGGAAAAGGCCTGTCCGGCACGCACGCCATTGATCGCTCCCGTCAACAGGCGGATGCCGATGAGGAAGCGATGTTCGGCAGCAAAAATACGCAGCCTGTCCAGTACTTCCTCGAAATCCGTGGCGCTGCCCAGAAATGCCCGCAGGCGTTCTTCGAGATAGGCACGCGTTGGCACTTCTGAAAAAATGGCCGGATCGAGCAACCCATCAAAAACATGCGGGTTGCGTGTGATGATGTCCGCCAGCCGCGGGGCCGCGCTCATGATCATCACGAGGAGGTTCAAAAGCCGGGGATTGGATTGCAGCAGGCTGAAAAGCTGAATACCGGCGGGCAAGCCCTGCAAAAAACCGTCGAAGCGCAGAAGCGATTCATCCGCCCGCTTGGTTTCTGCAAAGGCTTTGAGAAGTGCGGGCGTCAGTTCAGTGAGGCGCTCCCGTGCTTCCGCCGATTGCGTGGCGCGATAACGCCCGAAATGCCATGTGCGGATCACGCGGCAGATATCGCTTGGGCGCTCGTAGCCCATGGCAGAAAGCGTTTCCAGCGTGCCCGGATCATCCACATCGCCGGTAAAAACAAGGTTGCCGCTCGCCGCGCCTAGTTCCGGCGCCTGCTCGAACAGCGCCGCATACTGCTTTTCCACCACCTTGAGCGCGGCGAGGAATATTTCGGAAAATTCCGCCGGGTCGGCATAACCCATCATATGGGAAACGCGGGCAAACCCTTCATCATCTTCAGGCAGGATATGGGTCTGCTCGTCCGCAATCATCTGGATACGGTGTTCGACATCGCGGAGAAACCAATATTCCTGCGCCAGCGCATCGCGCGCCTGTTGCGTTATCCATCCCCGTTCGGCAAGCCGCGCCAGCATCGGCACAGTCTGGTTGCCGCGCAGTTCGGGAAAGCGCCCGCCCGCAATCAATTGCTGCGTCTGGACAAAAAATTCGATCTCCCGGATACCGCCCCGGCCAAGCTTCACATTATGCCCGCGCACGGCAATATCGCCGTGGCCCTTATGGGCGTGAATCTGGCGCTTGATCGAATGGACATCGGCAATTGCCGCATAGTCGAGATATTTGCGCCAGACATAGGGCGACAGTTCCGCCAAAATCTGTTTGCCGGACAAGCGATCTCCGGCAACGGGCCGCGCCTTTATCATGGCGGCGCGCTCCCAGTTCTGGCCACGCCCCTCATAATAATGCAGCGCAGCGCCAACCGGAATGGCAAGCGGCGTGGAACCCGGATCAGGCCGCAGACGCAGATCGACACGGAAGACGTAACCATCGCCGGTGCGGTCCTGCAAGATGCGCACCAGCCGCCGCGTCAGCCGCGAAAACGTATCAACACATTCATAGGGATCGCCGATAGCAGGCTTGGTTTCATCAATGAAAACAATCAGATCTATATCGGAAGAATAGTTGAGCTCGCGCGCGCCGAACTTGCCCATGCCAAGAACGATCCAGCCACAATCCTTTTCCGGATTGCTGCGATCCGGCAGATTGATCCTGCCAGCCGCGTCGGCATCGAGCAATAGAAAGCGGACGGCTGCACCTGTGCAGGCTTCCGCAAGGCCGGTCAGCCGGTCGGTGGTTGTTTCCGTATTGAAAATGCGCGCCAGATCGCAAAGCGCAATCAGCACATGGGCCTCACGCTTTAACTGGCGAAGGCTCGTCATCAGTTCGCTTTCGCTGACACCCGCGATGGTCCCGCTGGCGGAAATTTCGTCCAGAATAGCCTCAAGTGCGCTTTCCGGCGTTGCGGAAACGATACGATCCAGAATGCGCGGCTGGCGCGTCAGCGCCTCGCGGATGAAAGGCGAAAGATCGAGGATGGCTGAAAGAAAATCCACCGCCTTTTTCCGGCCAAGCAGCGCCACAACGTCGGCAAGCTCTTCCTCGCGGGCACGGGCTTCCAGATCAGCCAGAAAGGCAGATGCCCTTTCAGGATCAAGCGGTATCAATGCACAAAGGTTTCTCTCGAAAAACAGTGCCTTTGCGTTTTCAACCGTCATGATCCCCTCAATATCTGCCGTTTAGAGCGCGTTTCGATCTGAATGAATCAGATCGGCGCTCTAATCCTTTGTTTTGACGCGCATCCCGAAAACCGTTTCACACTTTTCGGGATGCGCTCTAGCCAACCTCGCGATGCGGCAGCGGAAATTCCAGAACGGCGCGAAGGCCGGGTCCGTTGTCCTCAAGACGAAGCGCGCCACCGTGCAACTTCATAACCGCCTTGGCAAGACTTAAACCCAGGCCAGACCCCGGCTGTGTGCGGCTTTCTTCAAGGCGCACGAAACGCTCGGTCGCATGGTCACGTTTGTCGGCGGGGATGCCGGGACCATTATCGGCCACAACGATGCGGACCCATTGCGCATCCTTTTCCATCAAAAGCGTGACCGTCGCCGTGCGTCCTTCGCCGCCCGCATATTTGATCGCATTGTCGACCAGATTGGACACGGTCTGGCCAACCAGTTCGCGGTTGATGTGCAGGGCTACATCATCGAGCGCACCAAGCGTCAAGGTAACACCCGCATCCTCTGCCACCGGCTCATACATTTCCGCAACATCGCGTATGATCGGGGCAACAGGCATATCGTCGAGGTTTTCAGATGAATAACCGGCTTCAAGCCGCGAGATCATCAAAATGGCATTGAACGTGCGAATAAGCTGGTCGGATTCGCCGATGATATCTTCCAGCGCAGCGCGATATTCCGGCTCTACCTTCTCACCGCCAAGCGCCTCCTCAGCGCGGTTTCGCAGCCGCGTGAGCGGCGTTTTCAGGTCATGCGCAATATTGTCGGAGACCTGCTTCAGCCCTTCGTTCAATTCCAGAATGCGTGCCAGCATGACATTGAGATTGCCCGACAGCCGGTCGAATTCGTCGCCGGAGCCGTTGACGGGAAGCCTGCCGGTCAGATCGCCATCCATGATGCGTTGCGATGCGCGCGACACATCGTCGATGCGTTTGAGCGCGCGCCGCCCCACGAAGAGCCAGATCAAAAGCGCACCCACGCCCATGATGCCAAGCGCCAGCACCAGCGAGTTGCGTATCAGATCGCGAAATCGCTCAGGTTCGCCCAGATCCCGCCCGACGAGCAGCCGCATTCCATTCGGCAGGGCAATCACCACGGCAATGGCGCGGTGCTCCACCTGCGGCGCCTGTTCGCCGTAACGGCGATAGGTAAAGGCACGTTCGATAATACCGTCCGTGTTGAGCACGCCCGGTTCAACGCTTTCCACATTACCGGCAAGAATACGGCCCGTAGGGTCGGCGACAAGATAGAGATAGGCACCCGGCTGGCGCGAGCGATAATCAATGGTTCGCACAAGTTGCGGAATACCGCCGCGCGCATAGCTTTTGCCGATGCTCGCAACTTCCTCGCCCAGCGCCTGCTGGGTCTGCCCGGCCAGAATGGAAGCCGAAAGGTTGGTCATGTAAAAGACAAGTGCAACCGCGCCCACTGCAAAGAGCAGGAGATAAAGCGCGGAAAGCCGCGCCGCGGTGGTGCGCATGAGTGCGGAAAACCGGCTCATCATTCCGCCCTTGCGGCAGCGGATTGCTTGCCCCGGCCCGCCTTCAGCATATAGCCAGCCCCGCGCACTGTATGAAGCAGCGGCTCGTCAAAACCCTTTTCAATCTTGGAGCGCAGCCGCGAAATGTGAACATCGATCACATTGGTCTGCGGGTCGAAGTGGTAATCCCAGACGTTTTCAAGCAGCATGGTGCGGGTGACGACCTGCCCGGCATGGCGCATGAGATATTCGAGAAGACGAAATTCTCGCGGTTGAAGCGTAATATCAACACTCTGGCGGCGCGCGGTATGCGCAAGGCGGTCAAGCTCCAGATCGCCGACGCGGTAGATCGTATCCGCCTCGCGCGGGCTTGAGCGGCGCTGCAAGACCTCGACACGCGCCAGAAGTTCGGAAAAAGCATAGGGCTTGGTGAGATAATCATCGCCCCCAGCGCGCAGGCCGGTCACGCGGTCATCCACCTCGCCAAGCGCCGACAGGATCAGGACCGGCGTTTCCATGCCCTTGGCGCGCAGGCCCGCCACAACAGAAAGCCCGTCGCGTTTGGGCAGCATACGGTCCACCACCAGCACATCGTAATTGCCGTTTTCGGCAAGCGCATAGCCGGTTTCGCCGTCGCCCGCGATATCGGCCGAATGCCCTGCCTCGGCAAAGGCCTTTTCCAGATAACGGGCCGCTTCGCGGTCATCTTCGATAACGAGAATTTTCATGGCGCTACTATAGGTTCCGTTCAGCGATAAGGCGATGCCAGGCAGTTACCCGCCTGGCATCGGAATTCGATCAGTTTCTCTGCGAGGCGCCGATCATTCCTGATTGATCGGCAGCGCCACGAAGCGGCTCTGATCATTGCTCTGCAATTGCAGCAGCACCGCCTTACGGCCTGACTTTTCGGCTGCCGTGATGGCCTTATTGATATCACTAGCGGTCTTTACCGTCTGGTTGTTGACGCTCACGATCACATCGCCGGAGCGGATGCCACGGTCAGCCGCATCGCTGTCCGGGTCCACATCGGTAACGACCACGCCCTTACCGTCTTCAGACGGAACGACAGTCAAGCCGTAGGAGTCGAGCGTTTCACCCTGTCCGCCGCCATTGTCGTTGGACTGGCTGCCGCTCTTCCCCTTGTCATTGGGCATGGCAGCAATCGTGACGTTGATTTCCTCGGCCTTGTTCTTGCGCCAGACGGTCAGGGCAGCCTTTTCACCAGGGGCAATATTGGCAACCTTGCGCGCCAGGTCACGCGGGTCCTGAACCGTTTCGCCATTGACAGCGGTAATCACATCGCCCGCCTTGATGCCAGCCTTGGCAGCCGGGCCATCATCCTGCGGCGAGGCCACGATCGCACCCTTTTCCTCGGCAAGACCGAGCGAAGCGGCGATATCCTTGGTCACAGGCTGTATCTGGACGCCGATCCAGCCGCGCTCGACGGAACCCTTCTTGATGAGCTGGTCCACGACCTGCTTGGCGGTGGAGGACGGAATTGCAAAAGCAATGCCCACGCTGCCGCCGGATGGCGAGAAGATGGCGGTGTTGATGCCGATGACTTCGCCGGAAAGGTCGAAAGCCGGACCACCGGAATTGCCCTTGTTCACGGCAGCGTCAATCTGGATGAAATCGTCATAGGGGCCTGCGCCAATGTCACGGCCACGGGCCGAAACGATACCGGAAGTCACCGTGCCGCCAAGACCGAACGGATTGCCAACTGCGACAACCCAATCACCGACGCGCACCTTATTATCGTCGCCAAAGGCGACATAGACGAACTTGCGCTTCGGAGCGTTGATTTTCAACACGGCCAGGTCCGTGCGCGGATCAGCACCAATCAGCTTGGCATCAAGTTCGGTGCCGTCGTCCAGCACGACGGTATAGGCATCGCCATCGGAAACGACATGGTTGTTGGTAACGACATAGCCATCTTCGGAAATGACAAAGCCCGATCCTTGTGCAACAGGGCGTTCATGGCCCGGGCGCGGCTTGTTGGCCTTGCCGCGACGGTTATCGGAGCGTGAATCGCCACGCGGCTCCATACCAAAATCACGGAAAAATCGCTTCAGCGGATGACCGTCCGGCAACTGGTCGAAGCCGGGAGGGCCGAAGAACTGCGGACCACGGTTGGAAGCCTCCTGCACGTCCTTCTTGACGCGGACGCTGACGACCGCAGGGCGAACCTTTTCCACCAGATCGGCAAAGCCAGCCTGCGGCGGCGGCGTCACATGCACCGCTTCGGCGCGGGCTTCGTTCAGCGCACCAAGCGGGCCGGTTACGACGAATGCGCCGGCAAGCGCTGCGGAAAGCGCGACGGCGGCAACGCCTTTGCGATAGTTGGAAATCCTGGCTCTGGACATCTGTTTCTCCTTGCGAGCAGTAATCCGGGCCAGGAATTTCTGGCCTCGTAATTTTCTATGAGAGCAAGATAGTTAGAGCTACCTTACCGTTCAATTTCCGCAAGATGAAAGTTTCGTAAGGTTTCCAGAGGTTTTGTCGGTCTATTTTTCGTCGAGCAGGCGGGAAAGTTCCGCTTTTTCGCTTTCGGAAAGGGGCTGTGGGGCCGTGCCAGCAGCGCGCCGGCGCCGGAAAGCAACAAGAAGTGCGCCGCCGCCAATCAGGAGAATAATAACAGGAAAGCCCCAGAGCAGCGCCGTTTGCGCGTTGAAACGGGGTTTGAGAAGAACGAATTCGCCATAGCGATCGACGACGAAATCAATCACCTGCCCGTCCGTATCGCCTTTTGTCAGGCGTTCACGCACCAGAATGCGCAGATCGCGCGCAAGCTCCGCATTGGAATCATCGATAGATTCGTTCTGGCAGACCATGCAGCGCAGCTCGGCGGAAATTTCACGGGCACGCTTTTCAAGCTTCGGATCGGCAAGCACTTCGTCGGGGTTGACGGCAAGTGCCGCCGTGGCCTGTAAGGCGAATGTTGCGCTGATAAGCAGGGCCCGGAAAAGATTGCTCTTTTTCATGCGACAGCTTCCGCTTCCTTCGAGGCGGGCCTGCGCACCTTGGAAGGCGCCCCAACCCTGAGGCGGCGGTCGGCCAGCGAGAACAAGCCACCCAGCATCATGACAAGCGCACCGTACCAGATGAGCGTGACCAGCGGCTTCCACCAGATGCGCACGACCACCGCGCCATTGCCCGGCTCGTCACCAAGGGCGACATAGACCTGGCTGAACCAGAGCGTCTTGATACCGGATTCGGTCGTGGGCATCTGGCGCGCAGGGAAGAACCGCTTGGACGGTTCGATCACGGCCAGATCGCGGCCACTGGAATCAAGCAGCGTGAAGGCGCCCCTGTTCTCGGTGAAGTTGGAACCCGTGATGGGGCGCAGTCCTTCAAAACGCAACGTGTAATTCTGGACTTTGGCGGTCCCGCCGGGCTGCATGACGAGCACGTTTTCGGTGCCGAATGTCGTGACGCTGACAATACCGAGAAGCGTCAGACCAAGTCCGATATGCGCAAGCGACGTTCCGAAGACGGAGCGCGGCAGGCCCTTGAAGCGCGCGAATACCTTACCGGCAGATACCTTGCCAATACCAGCCTTGAGAACAAGATCGGTCAGGCTGCCGAAAATGAGCCATGCCGCAAGACCGATGCCGAGAGCAGCCAGAACCGAATGCGCGGAGGTTCGCCAGAGCATGATGCCGACAACGGCCAGCGACAATGCAAAAGCCGTCATCAGGCGCTGCCCGACACCATAAAGATCACCGCGTTTCCATGCGAGCAGCGGGCCGAAAGGCACGGCAAACAGGAGCGGCACCATCAGCGGACCGAAGGTCATGTTGAAGAAGGGTGCACCAACGGAAATCTTTTCGCCGGTCGTAACTTCAAGAAGCAACGGATAGAGTGTTCCGATGAGAACGGTTGCGGCGGCAGTCGTCAGAAAGAGATTGTTGAAGACCAGCGCGCCTTCGCGGGAAATCGGATGGAAAATGCCGCCTGCGCTCAGGCTTTGGACGCGCAGCGCGAAGAGCGACAGCGAGCCGCCAATGAAAAGCGCGAGAATGCCCAGAATGAACAGGCCGCGCCCCGGATCGGTCGCGAAACTGTGCACGGAGGTCAGCACGCCGGAGCGCACAAGGAAAGTGCCAAGCAATGACAGCGAGAAGGTGAGAATGGCGAGAAGCACCGTCCAGATCTTGAGCGCCGAGCGCTTCTCCATGACGATGGCGGAATGAAGCAGCGCCGTCCCGACCAGCCATGGCATGAGCGATGCATTTTCAACCGGATCCCAGAACCACCAGCCGCCCCAACCCAGTTCATAATAGGCCCAGTAAGAACCCATTGCGATACCGCCGGTGAGGAACATCCAGGCCATGAGCGCCCATGGGCGCACCCAGCGTGCCCATGCCGCATCAAGACGGCCTTCGAGAAGCGCCGCAACAGCAAAGGAGAAACAGACAGAAAAGCCGACATAGCCCAGATAAAGAAGCGGCGGATGGATGGCGAGGCCGATATCCTGAAGGACCGGGTTGAGATCGCCGCCTTCCATGGGAGCCGGGAAAATGCGGGTGAACGGATTGGACGTGAAGATGATGAATGCGAGAAAGGCGGTGCCAATCCAGCCCTGCACGGCCAGAACATTGGCACGCAGCGTTTCAGGCAGATTGCCGGAAAAGGCTGCAACCAGCGCGCTGAAAAGGGTGAGGATGAACACCCAAAGCAGCATGGAGCCTTCATGATTGCCCCAGACACCGGTGATCTTGTAGAGAAGCGGCTTTTGCGAATGGGAGTTCTCGACCACGTTAAGGACCGAGAAATCGGAAACGACATAGGCGTGGATCAGAGCGGCGGATGCCAGAACGATCAGAGCGAAGACTGCAAGCGCCGTTGGCACGGCCACTGCCATCAGTTGCGCATCGCGGCGATGCGCGCCGACCACCGGCACGATGGATTGAACAATCGACAGCGCAAGCGCCAGCACCAGAGCAAAATGGCCGATCTCGACACTCATATGCAGTCTCCCCGGCCCATCATTTGCCCTCCCATACGCCCTTTTTCTTCAGGCTGTCGGCAAGGTCTTTGGGAACATAGTTCTCGTCATGCTTGGCAAGCACATTATCAGCGCGGAACAGGCCATCGGAGCCGAAACGCCCTTCCGCCACCACGCCCTGCCCCTCACGGAACAGATCAGGCGGAATGCCTTCAAACACCACTTTCACGGTCTTGATCGTATCGGTGACGGTAAAGCGCAGCTCGCTGCCCGTGCGGCTGACGGAGCCTTCCTCGACAAGGCCACCAAGGCGGAAACGCGCGCCGGACGTCATATCCCGTTCAGTAAGATCAGCCGGGGTGCGGAAGAAACGGATATCCTGATTGAACGCCGTCAGCATCAGCCCGACCGCAACAGCCAGCACGGCCAGCGCACCGCCAATCAGGAAAAGGCGCTTGCGCTTTCTCTGGCTTACCGTGCGGGCAAAGCCGCCCTTGCCTTTGGGATTACGTGCATTTTGTTCGGCGGTCGCGCTCATTCTTGTGCAGTCCCCACGTCCAGTCCAAGTGTGGTGGCGAAGCTTTGAAGCTCGGTCCGGTTTTCACCCTGAAGAGCCTTCATGCCGCGAGCCAGCGCATCCTGCGCATCATTGCGGCGGTTGAGGATCATATAGGAGCGGACCAGCCGCTTCCAGCCATCGATATCCCCGCCATTCTGGCGAAGTGTTTCATCGAGACGTTGAACCATGCCTTCCACCATCGCCTGCCGATCTCCGGCGCTAAGCGCGGAAGCCGCCTCGACATCTTCGGCGGTCGGACCTTTCGCCTCCGCCTGTTTGGCGCTCGCGGGGTCACGAAGGATCGCAATGGTTTTCTCAAGCTTGCCGCGCCAAGGTGCATCCGCCGGAGCCTTGTCCAGAAAGGCCTGAAGGCGATCAGCAGCACGATCAGGATGGCCATCCTGCATTTCGCCCTGCGCAAGATAATATTGCGGACGAATATCATCGGGGCTGAGTTCGGCTGCTTTCTTGAACAGCTTCTCGGCCTCGGCCGTAACCGTGCCGCCGGAAGCAGCCGTCAAAGCCTCGCCCAGTCCAAGAATTCGCGCAAGGTTTTCGCCAGCGATCCGGATGGATGTGCGATAGGCGCTAACCGCGTCGGAAGCCCGCCCAAGCCGCAGATAGATCGGCGCCAGCACATCCCAACCCCGCACATCACCCGGATTCTGTGCCAGATGCGCTTCAGCACGCGCAATGAGGTCGGCGACGGAACTGCGATCCGGTGCGGCTGCAAGCCGTGGCGCGAGCGGCATGGAGGGCATGTCGGGCGCACCGAACAGCGGATAGATGCCCCAGGCGACCAGCGGCACGGCGAGAACTGCAACAAACGCCAGCACACGGCCGGAACGGCCCTGCCCGCCATCAGCCGTTGCGGCCATGGCAGCCTTTTCGGCATTGAGGATGCGGCGGGAGATTTCGATGCGCGCCTGTTCAGCACTTTGCAGATCGATCATCCCGCGCGCAACGTCTGCCTCAACTTCGCGAAGCTGGTCGCGATAGACCTCAAGGTCGTTTTTTTCGGCGGGCAAAGCGGCCTGCCTGCGCCGCGTCAGCGGCAACAGGACAGCCAGCGTGGCTGCGAAGGTTAAAAGTGCTGCTACAAGCCAGAATTCCATGGCTGCACACTTAGGCGTTGCCCGGAGAAAAACCAACCGGCTACCTGCACTAAGTTGGGGCCTAGGGCAATTCGCCGCAAACCACCTTTGGCGCCTGCAATCAGGTCAGCGGTGTCCAACTACCATCGGGATTGCGGCAAGCCGTGCCACGCACCGTCTGCTGATCCCCGCCAATGGTGAAACTATGCGAATATTGGCGGCAGTTCTGCGAGCCGACCTGATAGGGTTGCGCCGCCGTCACGTCGCCAGCGTTTGATCCGGCTCCGCTCCACAAAACCGATTTACCCGCTGGCGAATATTCAAGCGCGCGATATTCAGCTTCCAGCGCCTTCCTGCGATCGGCCGCACTCAACTGACTGGCCGAATTGCCAAGCAGGCCGTTGCCAAGCGAAGCGAGCAGGTTCGTTTCCGGCTTCTGCGATGAGCCGCCCAGCGATGGAAAACCGCTCCCCTTGCCGCCGCCCGTCGTGCCGCAGGCCGACAGTACCAGCGACATGACAAACATCATCGATATAACTGGAACGGGGCGAGAAAACCTGGAAACTATCATCATAACAAACCGGCCTTCATAACAATCAGCACGCATACAGTCGCATCATGCGACATTTGTTTTACCTTCTAATCCGCTTTCGGCCAAAGACAACCGTCAATCTTGCGTCAACGGCAACACGACCCGGACAGCCAATCCGCCAAGCGCACCCTTGCCAAGATGCAGACTGCCGCCATATTCGCGCACCGTATCCTGAACAATGGCGAGCCCCAGACCCGTTCCGGGCTTTGTTTCATCGACACGGCTGCCGCGTTTAAGGGCTGCTTCAATCTTGTCTGCCTCAAGCCCCGGCCCATCGTCCTCAATCACGATTTCAAATTGCCTTTGCTCGCCGGCGGCTGCGGCAAGGCGGATATTGATGCATTTGCGCCCCCATTTACCGCCATTTTCGAGAAGATTGCCGATGATTTCCTCCAGATCTTCCCGTTCGCCTGCAAAGACAGCACTAGGAAGATCATTCCTGAAGGAAATATTGAAAGTTGGATGCAATTTTGCCGTCACCCGCTGCAACCGTTCCAGAACGGGCGTGACGGGTGTGCGGAAAACCACGCTATCGCGCTGTGCTGCGATGCGCGCGCGTTGCAGATAATGCTGGATTTGCACCTGCATGGCCTCGCTCTGTTCCTGCACGATGCGTCCGGGAGCACCGCCCATGGCGCGGGCTTCGTTAACCAGAACGGAGAGCGGCGTTTTCAGCGAATGGGCGAGATTACCGACCTGGGTTCGCGACCGCTCCATGATGCGGCGATTGTTTTCTATGAGCGCATTCATTTCCCGTGCCAGCGGCGCGATTTCCAGCGGCAAGGTTGCATCCAGCCTGGACGAGCGCCCTTCGCGGATATCGGCAAGCGCCTGACGAACCTTATCCAGCGGGCGCAGGCCGAAAAGAATGACAGCGGCATTGATGAGAATGCTGCCAATGCCGAACACGCCAAGATAGACAAGTAGCCGCGCCCGGAAATTTGCTATTTCGTTGAGTACTTCGCTAAGATTGCCCATGACGCGAAAGCGCGCGACACGGTTGGAATTATCCAGCACCACTTCGGTTTCGACGATGGAAAGCTCTTCATTGTCGAGGCCCGGCAAGGTGTAGCTGCGCATGAAGGAGCTGTCGAAGGGGGCTTGTGATACGGGCATTTCCGGCACGATGCGCCCCACCAGCGAAGGCGACTCAAGCTTGCCGGTCAGATTGGGCGTTACCGGATCGACAGACCAGTACCAGCCCGAAAGCGGGCTCGAATAGCGCAGGTCCCCCAGTTCCGGGCGCCCCTGTAGCGTGCCCTCGCCCGAGACGCTGACCGCCCCGACGAGGCTGAAGAGATGCGCGGTCAAAAGCCGCTCGAAATTGTTTCGCGCAGCCTCGCCATAGAGTGAACTGATGAAAGTGGCGACCACGACCAGCGCCACAATGACCCATAACGTCGAAAGTGTGACGACACGGACGGCGAGCGAGCGCCAGGCGGGAAAGACGCGAGGAAGCTTCAGCTGCCGCTCCCTTTTGCTGCACCCTCGCCGCCCGAGCGCATACGATAGCCCATGCCGCGAACGGTCTCGATGAGATCAACGCCCATTTTCTTGCGCAGGCGCCCGACGAAAACTTCAATCGTATTGGAATCGCGGTCGAAATCCTGATCGTAAAGATGTTCCACAAGCTCGGTGCGCGAAACCACTTCGTCCATATGGTGCATCATATAGGACAGGAGGCGATATTCATGCGAGGTCAGCTTGAGCGCCACGCCATCGATACTCGCCTTCGACGTCTTTGTATCGAGATGCAGCGGCCCACAGACGAATTCTGACGATGCATGACCGGCAGCACGGCGGATCAGGGCGCGCAATCGGGCCAGAACCTCTTCAATATGGAAAGGCTTTGCCACATAATCATCCGCACCCGCATCGATGCCAGCCACCTTGTCGCTCCAGCGATCACGCGCGGTCAGCATGAGAACAGGCATGGTGCGCCCGCTGCGCCGCCAGCGTTCCACAACGCTGATACCGTCCATCTGCGGCAGGCCGATATCCAGCACCACCGCATCATAAGGTTCTGTATCGCCCAGATAATGGCCTTCTTCGCCGTCATAGGCGCTGTCAACGACATAACCTGCGGCAAGCATTGCTTCGGAAAGCTGGCGGTTCAGGTCCTTGTCGTCTTCAACAATCAGGATACGCAAGCGGACAGTCTCCCGTCAAAAGCTACAACGATAGCCCAATAGAGCGGCTACAGCCGATCCATCAATAAAGAGAATGCTTTATTGAGCTGGAACCGCCACTTCGACGCGGCGCGGACGTTCACCATTGCGACCGGGTATCAGCACGACGATAACGCACATAGCCCTGCCATTCTGCATGGTCGGGGTAGCCTTGGCGAGCTGACCGCCCTGCTGGGCGGCCACCTGTTCGCCAACCGCCGTGCAATCCCCGGCAGTGGCGATCAGAAGGTTGGGCTTCTGCGGCGCAGCCATCGGCAAGGCACCGGCATTGACCGGCAACAGGCCGACACTAACCGCCAGCAGCGCGAAAACTTTGAGAGCAGGGTTCTGTTTCATCATGCCGCCTCATATAGCACCCGAGAGCTGAACGATGCATGAACAACAGTTTCTTCTCCCGGGCCTAAAGGAAATTTTTCAAATTTGACGTTAGGCTGTAGTGACGAGGTTTATGCCGGACAAGTATCAAACGTCAAATTCAATCCACCAGGTTCATAAAATTATAAAATGCGGGATGTGGCGCCAATGCGGCCAATTACCGTAACGATACCCGCAATGGCCGTGGCCAGTTGCAACAAAACATCCGTCATGTTGCCCTGATCGATCATATCCGTCGCCACGCCGAACAGGCCCGCCACTGAAAGGAACAGTGCAACCAGCCCTGCCCAGACCGTGCGCGAAAGATACCATGATTTATTGGCTGTCATATTGTTTTCTCCGATTTTTTTAAAGCATGATGAAGTCAGGCCGCCATCCACTCCAAGTCCTTGTTTAAGCGTGATCTTGTTGCGAAAGCCGCTCATAGTTTTCAAGATCGCGCCCTGATTTGCCCGAAGGGTTCGGGTCGACATCCAGAACCGCAAAATCGCCGGGCCCTGTTTTTGCCCCGATCATCGCCACGCAGAACCGGAATGCATCTTCTCCCAGTTCCGCCCGGCGCTCCGCAAAACCGTAATTCCAGGCGGGCGCCGACACCTGTTCGCTCCGCACCAGAGTGTCGTCCCGCCAGATCTCCACGCGATACGCCTCCCGGTCCTCACCAAGCGGTATATCCTCGCCAAGCCAGCTATCGGCATCGATCCGCCCGCGCCTGATCCATGTAAAGCTCATGTCACCATCGAGGGATCGCATGACACCGAGATGCACCGGACTGAGCGGGCGAAGCGCACGCAAGCCGCCGCTCTGGCGCACCGTTTCAAAGAATTCATCCGAAAATGCCTTGCCCGCAGCGCCCACGCGCCAGCCAAGCTCCAGTCCAAGTTCGGATGCTTGCAGGCCTGCACTCACAACCCCGCTATCCAGCACGATGAATGGCGTTTCAACCGGCTTGTCATCCAAAGCCGCCGCCTCCGTTCCCAACTGGCCGCGCAGCAACTTGCCAAGCCGCCAGCGGTTTAGTCCTACTTCCTCCGCATCGAGAAACTGGAAGATTTCCCATTGGCCATCCGGTGATCTGAGCAGGCCGGTATTGGCTCCATTCAGAACCTGCACCAATGGCCGCGACTGCAATTCGCCGGAATAGAGCACCACTTCCACGAATTGCCCCTCAATCAGCCGCCCGCTCGGCCCGCCTGCAAGCGGAGCCGTCAACTCGCCCATGATGGCCCGTTCGCCAATGAGTGCACGTTCCGCAAATCCGTCATTCGACGGCGAGGCATAGACTGCAACGCCGCGCCAGGGCTTGGCATGGCAGGCGATGCGGAACTGCGCCGCCGGGTCTTCAGCTCCCGGCCATAGTGGCAGATCGACGAAATGAAAGACCGGCTTCATATCCAGTGCCGGGCCGCCCGGCGGGCGTGGCGGTGTTTCCCCCTTGTCAGCGAACACAAGGTTTGGCGCGAGCGCGGCAGCCGTGACGGCACGCGTTGCACCGTCTTCCAGTGCCGTCACAACATAATCGCGCTGTCCGTCCAGCACACCGAGGCGAACCCTATCCCCCACATGCAATGCTGCATTGGACCATGGCAGCGAGAAGCTTGCCGTGCGCCGCTCGGCATGGCGGCGGGCCATCCATGCTTCAGCAAGCGCCGTCGCCTGGCCGTTTTCCATCGAACCGGAAAGGCTGAGGCTTTCCGTGCCCTGCCCCACCTCGCGGCGCGCCGAAGCCCCGACGATCTGAAAATCGCGCAGCGGGTCATTGCAGTACAATTCAGCGGTGGAGGGCAGGTCACCCTGATCCTCGACCACCACCGTCAGCGCCTCACCTTCCTCGGGTTGTGCAAACTCACCCAGTTCCAGCGCTGCTTCGGCACGGGTGATATTCCTGAAAACAAACTGCCCGGCCCGCTCATAAACATGCACTCCAAACACATTCAGAAGCGGCTCCAAAACACCGCGCGCGCTTGATGGTTCTGACACTATGAAACCGGAAAGATGCCCCTCCACACCGATGCAATCTGCTTCGGGAAGGCCGAAATCCCTGAGGATCGCCGCGATCAGCTCATCCAGAGCAATGCCGCTGATACGCCCGTTAAGCCAATGGCCAAGCCGCCAGTTGGCCGTGTCGCCCCATATATCCTGCCCCAGCGGAAATTCCGGGAAAGGCCGGGTATCCCACGCCCAGAGATAAATGCGCTCCATATCGAGCATCGGCCCGCCATAGACCGGCGAGAGCGGGTTTTCGCCCTGCCAGTGCCGGTAATGAGCACGCAGAAAGCGATCCATTGCGGCATCGGACCGCGACCCGTTGGAAAAATAGGGGGCGGCATTTTCCGAGGATTTCGGATCGGGAAAGACATTGGGCTGGTTCGGCCCCTTGTCTACCGCCGGGCAGCCAAGCTCGGTAAACCAGAGGGGCTTGGATTGCGGCACCCATGCGGTCGACGCAGCCACCTCCACACCATCGATACGGTTATAATGCCTGTTGCTCCACCAGCCATGGAGATCCTTGTAGCGATAGACCCAGGGTTTGGCCGCAAGCCCATCGGTGATTGGTGTGCGCCTGCGGGCCAGCCGGTCCTCGGCACTGGCATAATACCAGTCGTAGCCCTCGCCGGAATTGACGCTGCGGCTTAGTCCTGCAAGATCGTAAGGCGTTTTGAACCCATCAGGATTGCCTTCGGAAAAATCGCTGTCGCGCCAGTCGGCCAGCGGCATGTAATTGTCGATACCGATGGCGTCGATGGCCGGATGCGCCCAAAGCGGATCGAGATGGAAAAAGAGATCGCCCGTTCCATCCGGCGCCTGATAGCCGAAATATTCCGACCAGTCCGCACCATAAGTAATCCGGCAGCCAGCACCAAGCTTGCCGCGCATTTCAGCGGCGAGCGCGCAAAGATGCGAAACGAAGGGGAAGCTGTCGCGCCCGTCACGAATGCTGGTGAGCCCGCGTAACTCCGACCCAATCAGGAAAGCATCGACCCCACCGGCCCGTAGCGCCAGATCGGCGCAATGGTTGAGAAAGCGGCGATAGCCCCATTCCCCATTCACGAAGGCTGCGGCCTGTTCACCGGCCGCAGGCGTCCCGTCTGGCGAGCCTTCCTGCCCGATTGCCGGATGGCAGGTGATGCGTCCGCGCCATGGATAGGCAGGCTGGCCAATACCGCCATAGGGCGAAGGAAGCTGATTTTCCTTCGGCACATCCATCATAATGAAGGGATAAAGCGTCACGCCAAGGCCGCGCGCCTTTGCATCGCGAATAGAGGCGATCACGCTCGCATCCGAAGGCGTGCCGCCATAAGCCGCTCCCTCCCCGCTCATGGAGATCAGGTGTGCTGCACCGCGCGACACGTTCTCCACCTTCCATGTCGTGCTCGGCTTGCGGACAGAGAGGCTCGTAACACCGGGGCGGATACGGCATTGCCCGGCCCGCAGGTCATCCCCGAACCAGGGCAGCACAATGGCCACATGGCGCAGGCCGGGGCAGAGCGCCTGCAACTCATCCAGCGCTGCCGTCCAGTCGCTGCGGGCGCGAATAGCGTTCCGGTTGATCCATCGTTTTTCGCCGGGTACGGGTTCGTCGCTGACCGTATCGGGCGAAAGGCCGAATTCGGTAGAACCGGGGATGAGCGCCACGGCGCGCATGTTCCGTGCCACCTCGCCCACCGGGCGCATGACCTCGAACTGGAACTGCGGAAGACGGTTGCCGAACGTGTCGAGCGGAATGCGCTCGAAAACCACATAGGCCGTGCCGCGATAGGCAGGCGCATTGCCCGTTCCCTGCTTCGCCTCGATCAGCGGATCGGGGCTCTGGGTGGCCGTGCCGCAATAGACACGCATATCGATCTCGGTCAGGTCAAGCTCCTGCCCGTCCGCCCAGACGCGGCGAATGCCAGCAATCTCGCCTTCCGCCACAGCATAGGCCGCATTGCCGAAATAGCTGTAATTGGTGACTTTGGGGCCACCTTTGCCGCCCTGACGGGTGGTGGTTTTGCGCTCTTCAAAGCGTGTTGCCCAGATCAGTGTACCCGAAACCCGCGCCGTGCCATAAATGAAGGGAAGTGCGCCACCTTCTTCCGCCGTCGCCACGCGCCCGCCATTCAGCCGCGCGCCTTCGATATGACGGGTGGAATTGAGAAGCGCGTTATCGACGGCATAACCGCCCATCGCACCAAGCCCAGCGCCAATGGCGGCCCCCACCGGCCCGAATATGCCGCCAACAGCAGCACCCACGGCCTGCAAAACAACTGTTGCCATGAATCAGACCTTTGGTTCAGGAAAAAGAAAAATGCCTGCCATGCGCCTGCGCCATTGCGGCACCAGCGCCGAGGCCAGCACGCCATGGCCCTGATAGGCATGGATGAAACGGCCCTCGCGCGCCATGATCCCCATATGCTTGGCCGCAAAGCCCGGTTTCCAGCGAAACACCAGAAGATCACCCGGCTGCGGCGCGTGTTCCTCACGCCGCACCATATACCGCGCCGCCGCCTCCAGCATGGGGTCGCCCTGCGAAACCTCCGCCCAGTCGGGCGCGTAGACGCCCGGATTTTCCGGCTCCACGCCGTAAAGCGCCCGCCAGATGCCACGCACCAGCCCCAGACAGTCGCAACTCACGCCAAGCGTGGAGGCGCCGTGCCGATAAGGCGTCCCGATCCACCGGTACGCCTCGGCAAGAACCCGTTCGGCAATCATCATGGAACGAGAACGCCCCCATCGTAATCATTCGTGCTGTTGACATAGGCATAGGCGGCGTCATTGCCGGGCAGATGCGGAAAGCCGCGAAAATTGACGCCGTTGGCGAATTTCGCCTTGCAGGTGGCGAAGCTCTTGTCGCACCCGCAGACAAGCCGGAAAGCATCACCCGCAGCCACCGGCAAGATCATCGGCTCGCCAAGTTGCAGGCTTGCCCCCGCATGGCCGACAACACGGACCGCCCGTCCCCGATTGGCCCCGCTGGTCCATGCAAGACGGCCTTCGGAAAACCAACCCGGCGCAAAGCCGTCAAGGCCCGCCACATCAAGCCGTGTGCCTTCGGCGGCAAGTACTGTTCCCTGCGCAAAAAAACGGGGATCATCCGTATCGATGCCGCAGCGTTTATCCCCCAGCATCGCATCGCAGTGGCGCAGAATACGCCGCCCGCGAACCGCATCGAAGGCTGCGGCAACACCCTTCAACTCCATCACGAAGCGGCTGCCCGAGCGGCTGATTTTGCCCGCCGTCCAGCGCCGCAACAGCATGTGCTGATCCGGCTCGTCCCAGTTGACGAGAAAAGCTTCGATCGTGGCGCCATCGTAGCGGCCCTGCTCGATATCCTCGTCGCTGATCTGCGTGGACGAGAGAACCCCCTCCACCTCGCCGCCAGCAATGCCGAGGCCAAGGGCGGTTGAGGCCTCGCTGCTGTTCAATCCCGTCAGCGGATCGCATATCACCTGATCGACGGTCAGGGGCGCATCATGGTCGGTGAACCCTAAAACAGCGCCGTTGAGCCGTCTTATAAGCCAGGCAAAACAATGTGTTGTCACCTCACCTTGCAAATGCGATTCAAGCGCGGGCGGAACCGGGATCATCTCTTCACCTCGACAATGGGAATGGATGGAATTTCGCCTGCCTGGAACGAGGCTATGCTGGCGGTGAGACGGTCCGTATCGAAGCGCGCAGGCACGTCGAACAGGAAGCCGGATGTGACAGGCACATCCCTTGCCGGTAGATAATCCGGCGTAAAGGTCACAATCCCGGTCAGTGGATCGACGGTAAATGCCTCGCCTTCCTGCACCTTCACACCATCGACACCGATGACCACCGAACCCGGCACGGGAAGCGTTATTGGACGGTCATAGCTTTCATATTTCTTGCGAAGCTGAAAGTGCACCGCCACTCCATCGCCGGTTCCAAGCGGCTGATCGAATGCCGATAGGGGTGCCTTGCCGGTCGCGGATGAAAAATCGAACGGATCGCGAAAACGAAAAGCGTGCAGGGAACCGCGCCGCGCCTCGAAAAAGGCGAGGACCGTCTGCAAGTCGTCCAGCGAGCGCAGCCCCGTTCCGGCATCAAAATGCCGCCGAGAATGTGCCCAGCGGGCGTTGCGCTTTTCCAGACCGGAGGTCAGCGTCACAATCTCGTTGCGCCATTCCGGCCCTCCTGTCGCACCAAACGATACGCCAAGGGGAAAGCGCACATCGTGAAAGGCTTCGACCATGTTCAAAGCCTCCGCGCGCCGCGCCGCACAGCGCCTGCCAGCATGGTGGCAAGCTGTGCTTCGGACTTACGGAAGGAGGACGCATCGGGCGAGGTCATGTTGAACACGACCTGAACCGGTTTGCTGCCACCGCCGGTGGCAATGCCAAGGCGCCCGTCGCTGCCGCGCGCAAGCGGCAAAATGGCCTCGGCGCCCGCTTCACCGGTCAGTCCCAACGAGCCGTTGCCCATGCCGAAATAGGTAGGGCTTGAAACCACCCCCCCCTTGGCAAAGGGCATGATACCGCGAATGCCGCTAAGCAGGCCGCCCATCATGGAAGAGGTCAGGCCCTGAAGCGGCTGGAGGCCCGCCGAGAGAGCCGTGCCCGCAAGGCTCGATGCAAGCCCGCGCAGCACGTCTTCCAGCCCCTTGCCGGATGTGATCGCGCCTTTCAGCGCGGAATTGAGGCTGTTGCCGAAGCTGGACGAGCGTTTTTCAAGGTCGCTCAAGGCGCGGTCAAAGGCGCTCGTATCCGCGTTGACGGATACGGTTACAGTTTCATCTGTCATGAATTTACCTGTCGGGGAAAGCAAGCATCAGCGCGTCGAGCGTCTGGCGCGAGGGAGCATCGAGCACAGGGGCCAGGGGGCTGATCGCCGCTGAAAGTTCGCGTGGCGTCATCGACCAGAAAGCCTGCGGGGTGAGCCGCAGCAAACCGAAACCCGCATACATCGCCTCGTCCCATGGGAAAGGCCGTACAGGCGAAGGTTTCGATTCAACTGCGGCACTCAAGGGTTTGGCGCAGAATCCTTTTCAGGCGTTCCGAAAGTAACCGTCAGCAGCGAAGACACGATACGGGCAAAGCCTGCCACGCCGCCCTCTGCCCGCATGTCGGCCACATCCTCTTCGCTCACCGTATGTCCGCCACCGCGAAGCCCCGCGCAGATAATGCGCTGCATATCCCGCGCCGAAAGCCGGCCCGAGGAAAAGCGCGCCACAAGAGCGGAGAGATTGTCTGTCTCGAAAGCCGATTCCAGCTCCGCCAGCGCGCCCAGCGTCAGGCAGAGCGTCCAGTCGCGGCCATCCAGTCTGGCGGCGACTTCGCCGCGGTGGCGATTGGCCATCATCATATCGCCTCTCCGAAGGTAATCAGGCTTGCCGATTCCAGCGCGATTTCAAACGTCACCTCGGCATCGTGATTGCCGCCATATTCCAGTGCGGTGATCTGGAACGGCCCGCTGATGGTGCCAAAATCCGGCAGAACGATCTGCCAGTCGCGAATTTCGCCATCGAAGAAAATACGCCTTATCAGGGCATCCGAGGCCGCATCCTTGAAGATGCCGGAACCGCTGACCGAAGCACGCTGCACACCGCTGCCCGCCAGCAATTGGCGCCAGCGCCCGGCAGCATCGGCATCCGTCACATCGACGGTTTCGGCATTGAACGCGATGCGCTTGGTGCGCAGCCCCGCACAGGTTTCAAACGTGCCATCATCGCGCACCGTTTTAAGCAAGATATCCTTGCCTCTTTGAGCGGCCATATAAATCTCCTTATCCCATACCAATCGTCAGGGGCAGCTATCAGGCGGGTTCCGTCACGGCGCGATAGCGCATCGTGCCCAGATAGCTGCCAAGACCGTCTGTATTGCGCGCCAGAACTTCGGTCAGCATCAGGTTGACGAGCCTGTGCCCCTCGATCTCGACCGGCTTTTCATCAAGCCGCATTGCAATCTTCGCTGCGATATCGAGCACCCGCTTGCGTCCGCTTTCCCTGGCCCATATCTGAATGTTGAGAAAGTGTTCGCCGCCCTCTTCACTGGCCGTACTCCAGTTGCGGCACATGGTTTCGCCAAGCGTTACATAGGGAAACGGTGTGCGCGCCGGCACATGGTCGTAGACCCGCTCTCCACCAAGGGTTTCAATCAGTTCGCCGTCATTCTTCAGAGCCTCAAACAGGGCTTTCTGGAGTGCTGCCGCGCCAGTCTTCATGCTTGCCCCCGCTTGCATCCTTGACATTCCGATTTTGTTCAATGGCAATCGCCTCACCCGCTGCCAGCGCCTTCCAGCGCAGGGCGCGAACGAGGCTTTCAAAGGTCAGTGTCATGGCAATATTCATCGCCCCTCCTCCCGTGCCAGACAGATGAGATAACGCCCGCTTTCATCCGGGTCGTGCACCGAGCGGAGTGCGAAAATGCGCCCCGCCTTGCGCAAGCGCTTATCGGTCGAAATGTCCTCACGGAAACGCATGAGAATGCGGTGCGTCACTTCCGGCTGCGGCCTTGCGCCGAAATCTTTCTGCGTGGTTGAAACCGGTTCGATCCGCCCCCAGACCATGGCAATTTCCGACCATGTTTCCGCATAACCGCCCATTCCATCCGGCAGAGGCTGCATCGCTTCCAGCGCCAACTCGGAAGTGAGCTGGCCCGGATCGATGAAAAGAACGTTGTTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP019390|390079:436805|400604_402296_-|WP_009364526.1|DBSCAN-SWA MSEANELPERESMEFDVVIVGAGPAGLAAAIRFKQINPELSVVVLEKGGEVGAHILSGAVVDPVGIDQLLPGWREEEGHPFKTPVTADHFLVLGPAGSVRLPNFAMPSLMNNHGNYIVSLGNVCRWLGTKAEELGVEIYPGFAATEVLYNDEGAVIGVATGDMGVERDGTHGPNYTRGMALLGKYVLIGEGARGSLAKQLIAKFKLDEGREPAKFGIGLKELWQVDPSKHKPGLVQHSFGWPLDMKTGGGSFLYHLEDNMVAVGFVLHLNYKNPYLSPFEEFQRFKTHPAIRDTFEGGKRLSYGARAITEGGWQSVPKLSFPGGALIGCSAGFVNVPRIKGSHNAILSGILAADKIAEAIAAGRANDEPIEIENSWRASAIGKDLKRVRNVKPLWSKFGTAIGIALGGLDMWTNQLFGFSFFGTMKHGKTDAQALEPAANYKKIDYPKPDGVLTFDRLSSVFLSNTNHDENEPVHLQVRDMELQKTSEHDVFAGPSTRYCPAGVYEWVDADGNAAADPGVKDVRFVINAQNCVHCKTCDIKDPNQNINWVPPQGGEGPVYVNM >NZ_CP019390|390079:436805|404976_405936_-|WP_002963767.1|DBSCAN-SWA MNAHTRAAADAARLEMNPMLGIGLKIASVAVFVAMSTLLKAAEGVPVGQLIFFRSFFAIFAILLYLGWRGQLGGVFSTRHGFSHFWRGLVGVCSMSMSFFALTKLPLPEAIAINYASPLITVILGAAILHEVVRFYRWSAVLIGLFGVMIIIWPRLTLFSAGSVGRDEAVGALAALGAAVMSAVAMMLVRRLVQTERTPTIVIYFSISASVIALVSLPFGWVVPNWSQLAMLVGAGFAGGIGQILLTECYRHAPMSTIAPFEYTSMLLGLVIGFMLFGDIPTFEMLIGSGIVMAAGGFIIYREHQLSLAPHKVNAPQAQ >NZ_CP019390|390079:436805|423905_425045_-|WP_076770713.1|DBSCAN-SWA MEFWLVAALLTFAATLAVLLPLTRRRQAALPAEKNDLEVYRDQLREVEADVARGMIDLQSAEQARIEISRRILNAEKAAMAATADGGQGRSGRVLAFVAVLAVPLVAWGIYPLFGAPDMPSMPLAPRLAAAPDRSSVADLIARAEAHLAQNPGDVRGWDVLAPIYLRLGRASDAVSAYRTSIRIAGENLARILGLGEALTAASGGTVTAEAEKLFKKAAELSPDDIRPQYYLAQGEMQDGHPDRAADRLQAFLDKAPADAPWRGKLEKTIAILRDPASAKQAEAKGPTAEDVEAASALSAGDRQAMVEGMVQRLDETLRQNGGDIDGWKRLVRSYMILNRRNDAQDALARGMKALQGENRTELQSFATTLGLDVGTAQE >NZ_CP019390|390079:436805|436466_436805_-|WP_004688127.1|head|DBSCAN-SWA MNNVLFIDPGQLTSELALEAMQPLPDGMGGYAETWSEIAMVWGRIEPVSTTQKDFGARPQPEVTHRILMRFREDISTDKRLRKAGRIFALRSVHDPDESGRYLICLAREEGR >NZ_CP019390|390079:436805|391529_392663_+|WP_076770701.1|DBSCAN-SWA MAQPRLTPLVESLPSTVPFIGPETLELQRGKPFKARIGANESSFGPAPSVIEAMRNEATEVWKYGDPENYALRHAIAAHHGLKAEHIMPGAGVDALLGLIVRQYVQQGDKVINSLGGYPTFNYHVAGYGGQLVTVPYRDDKPDLDALIDAAAREKPALLYIANPDNPMGTWHEGADIQSFIERLPETTLLILDEAYCETAPASAFPPFETDRPNVLRMRTFSKAYGLAGIRCGYAVGNSAAIKTFDKVRDHFAVSRMAQAAAIAALKDQAYLHEVVGKICAGRDRIAAIAEANGLQAVASATNFVAIDCGRGKDFAQAVLNGLISRDIFVRKPGTPVLDRCIRVSVGVKEQLDQFEAAFPEALEEARKIYAANAENT >NZ_CP019390|390079:436805|397473_397641_+|WP_008508600.1|DBSCAN-SWA MRAGAPSIAAIGLRIFLFTAIFTGPAAAFAYTQIGNHAEMNGAGLVLYISLQRSA >NZ_CP019390|390079:436805|396404_397115_-|WP_009364522.1|DBSCAN-SWA MAEQTKPIELYYWPTPNGFKISIMLEELGVPYAVKYINIGKGDQFEPGFLKIAPNNRMPAIVDPEGPGGEPISVFESGAILQYLGRKFGKFYPTDERKRVAVEEWLMWQMGGLGPMSGQAGHFRIYAPEKIQYGIDRYTNEVNRLYGVLNRRLEGRDYIADEYSIADMACIGWVNAYKNYEQNLDDFANLKRWHETMNARPAVQRGLLVGKEERERAAAAANKEEEQKILFGQKAR >NZ_CP019390|390079:436805|432440_432875_-|WP_076770718.1|DBSCAN-SWA MMIAERVLAEAYRWIGTPYRHGASTLGVSCDCLGLVRGIWRALYGVEPENPGVYAPDWAEVSQGDPMLEAAARYMVRREEHAPQPGDLLVFRWKPGFAAKHMGIMAREGRFIHAYQGHGVLASALVPQWRRRMAGIFLFPEPKV >NZ_CP019390|390079:436805|428573_432437_-|WP_076770717.1|tail|DBSCAN-SWA MATVVLQAVGAAVGGIFGPVGAAIGAGLGAMGGYAVDNALLNSTRHIEGARLNGGRVATAEEGGALPFIYGTARVSGTLIWATRFEERKTTTRQGGKGGPKVTNYSYFGNAAYAVAEGEIAGIRRVWADGQELDLTEIDMRVYCGTATQSPDPLIEAKQGTGNAPAYRGTAYVVFERIPLDTFGNRLPQFQFEVMRPVGEVARNMRAVALIPGSTEFGLSPDTVSDEPVPGEKRWINRNAIRARSDWTAALDELQALCPGLRHVAIVLPWFGDDLRAGQCRIRPGVTSLSVRKPSTTWKVENVSRGAAHLISMSGEGAAYGGTPSDASVIASIRDAKARGLGVTLYPFIMMDVPKENQLPSPYGGIGQPAYPWRGRITCHPAIGQEGSPDGTPAAGEQAAAFVNGEWGYRRFLNHCADLALRAGGVDAFLIGSELRGLTSIRDGRDSFPFVSHLCALAAEMRGKLGAGCRITYGADWSEYFGYQAPDGTGDLFFHLDPLWAHPAIDAIGIDNYMPLADWRDSDFSEGNPDGFKTPYDLAGLSRSVNSGEGYDWYYASAEDRLARRRTPITDGLAAKPWVYRYKDLHGWWSNRHYNRIDGVEVAASTAWVPQSKPLWFTELGCPAVDKGPNQPNVFPDPKSSENAAPYFSNGSRSDAAMDRFLRAHYRHWQGENPLSPVYGGPMLDMERIYLWAWDTRPFPEFPLGQDIWGDTANWRLGHWLNGRISGIALDELIAAILRDFGLPEADCIGVEGHLSGFIVSEPSSARGVLEPLLNVFGVHVYERAGQFVFRNITRAEAALELGEFAQPEEGEALTVVVEDQGDLPSTAELYCNDPLRDFQIVGASARREVGQGTESLSLSGSMENGQATALAEAWMARRHAERRTASFSLPWSNAALHVGDRVRLGVLDGQRDYVVTALEDGATRAVTAAALAPNLVFADKGETPPRPPGGPALDMKPVFHFVDLPLWPGAEDPAAQFRIACHAKPWRGVAVYASPSNDGFAERALIGERAIMGELTAPLAGGPSGRLIEGQFVEVVLYSGELQSRPLVQVLNGANTGLLRSPDGQWEIFQFLDAEEVGLNRWRLGKLLRGQLGTEAAALDDKPVETPFIVLDSGVVSAGLQASELGLELGWRVGAAGKAFSDEFFETVRQSGGLRALRPLSPVHLGVMRSLDGDMSFTWIRRGRIDADSWLGEDIPLGEDREAYRVEIWRDDTLVRSEQVSAPAWNYGFAERRAELGEDAFRFCVAMIGAKTGPGDFAVLDVDPNPSGKSGRDLENYERLSQQDHA >NZ_CP019390|390079:436805|420927_421404_-|WP_059242733.1|DBSCAN-SWA MKKSNLFRALLISATFALQATAALAVNPDEVLADPKLEKRAREISAELRCMVCQNESIDDSNAELARDLRILVRERLTKGDTDGQVIDFVVDRYGEFVLLKPRFNAQTALLWGFPVIILLIGGGALLVAFRRRRAAGTAPQPLSESEKAELSRLLDEK >NZ_CP019390|390079:436805|392680_393055_+|WP_076771786.1|DBSCAN-SWA MPSINGQPRSVLFGTLAGLCGALGIASYAGAAHMGESHLGTIAPLLLAHAPALLFLSLISPVSRAARIGGAILVVGLALFCGDLFMRDMTGDRLFPFAAPTGGSLMILGWLCLGCSGWSSANAK >NZ_CP019390|390079:436805|418443_419151_-|WP_002963761.1|DBSCAN-SWA MKILVIEDDREAARYLEKAFAEAGHSADIAGDGETGYALAENGNYDVLVVDRMLPKRDGLSVVAGLRAKGMETPVLILSALGEVDDRVTGLRAGGDDYLTKPYAFSELLARVEVLQRRSSPREADTIYRVGDLELDRLAHTARRQSVDITLQPREFRLLEYLMRHAGQVVTRTMLLENVWDYHFDPQTNVIDVHISRLRSKIEKGFDEPLLHTVRGAGYMLKAGRGKQSAAARAE >NZ_CP019390|390079:436805|428302_428509_-|WP_008508561.1|DBSCAN-SWA MTANKSWYLSRTVWAGLVALFLSVAGLFGVATDMIDQGNMTDVLLQLATAIAGIVTVIGRIGATSRIL >NZ_CP019390|390079:436805|432871_433747_-|WP_076770719.1|DBSCAN-SWA MIPVPPALESHLQGEVTTHCFAWLIRRLNGAVLGFTDHDAPLTVDQVICDPLTGLNSSEASTALGLGIAGGEVEGVLSSTQISDEDIEQGRYDGATIEAFLVNWDEPDQHMLLRRWTAGKISRSGSRFVMELKGVAAAFDAVRGRRILRHCDAMLGDKRCGIDTDDPRFFAQGTVLAAEGTRLDVAGLDGFAPGWFSEGRLAWTSGANRGRAVRVVGHAGASLQLGEPMILPVAAGDAFRLVCGCDKSFATCKAKFANGVNFRGFPHLPGNDAAYAYVNSTNDYDGGVLVP >NZ_CP019390|390079:436805|394523_395177_+|WP_076771788.1|DBSCAN-SWA MADIADSAPSRADASSPISVDRHRLYEDAIAMLIGTSFIALGITLYSHAMLMTGSTAGIALLIHYATGTGFGLLYFLINLPFYYFAVRRMGWAFTIRTFAAVALLSGFTRLMPLNVDFTSINPLFAALMGGTLMGMGVLALFRHRSGVGGVNILALYLQDAYGIRAGWFQLGLDVLIMLASLFFIPWENMVLSLVGAVAMNVIIAINHKPGRYIGIS >NZ_CP019390|390079:436805|411829_413941_+|WP_076770708.1|DBSCAN-SWA MILPVFVINMASQPAAYKTVAASIEAYGQGFQPHRIDAVNGHTATQRIGIDDARFDAINGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSSRIHDIIKSLPDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDVAMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYKRLRTSLFRTFDYYVRVHHTLLQPQNPDGSSMKSQSGAYRLPGISLTGELIAAISLLIFMSTVWVETDAYRYIALGFVVAALIRYARTDFWKYEKPMVGWAGLICVAWTFYVLARFAYIYLFYPEMGTGSAEGIYLFPLFYPTLGFALLLFIRRPFLIAVAFMAISLVILIFGFHYDLSWNERAVTLLQHNPIHAAVSSGFIALCAMAFGIHTLNRNTLDTRARVVLCLLALATFIAALIAIYSLYSKGVWLAMAIAFPTFVVLVALTDKSQTSRMAALVCILIGLLSVFAGEHILQRVGGNTANTSWELLSDLKTGDNIMQDFDKAIKNPETGLSERERLMIWANTLHIWHKNPIFGAGVSWLHYWEKRPYQQTDFTLLHNGYLEIAIRYGFLGLLFYGVLTIWAVRCTWQATRAGLIDSAAFQCYVATLVFFAVTILSNSNVRLAIGESYMALAFGFAFYCQYLLQQHNRQYPRTYF >NZ_CP019390|390079:436805|395998_396346_+|WP_002963776.1|DBSCAN-SWA MIRTLILGVTLAAGFAAPAFADEAIVGTWKRPNGTLISYAACGANKFCGTVMTGEYKGKSIGTMSGKDGNYKGEVNKLDEGKTYSGKASVKGNTLSLSGCVMGGLICKSESLARQ >NZ_CP019390|390079:436805|423411_423909_-|WP_076770712.1|DBSCAN-SWA MSATAEQNARNPKGKGGFARTVSQRKRKRLFLIGGALAVLAVAVGLMLTAFNQDIRFFRTPADLTERDMTSGARFRLGGLVEEGSVSRTGSELRFTVTDTIKTVKVVFEGIPPDLFREGQGVVAEGRFGSDGLFRADNVLAKHDENYVPKDLADSLKKKGVWEGK >NZ_CP019390|390079:436805|413970_416922_-|WP_076770709.1|DBSCAN-SWA MTVENAKALFFERNLCALIPLDPERASAFLADLEARAREEELADVVALLGRKKAVDFLSAILDLSPFIREALTRQPRILDRIVSATPESALEAILDEISASGTIAGVSESELMTSLRQLKREAHVLIALCDLARIFNTETTTDRLTGLAEACTGAAVRFLLLDADAAGRINLPDRSNPEKDCGWIVLGMGKFGARELNYSSDIDLIVFIDETKPAIGDPYECVDTFSRLTRRLVRILQDRTGDGYVFRVDLRLRPDPGSTPLAIPVGAALHYYEGRGQNWERAAMIKARPVAGDRLSGKQILAELSPYVWRKYLDYAAIADVHSIKRQIHAHKGHGDIAVRGHNVKLGRGGIREIEFFVQTQQLIAGGRFPELRGNQTVPMLARLAERGWITQQARDALAQEYWFLRDVEHRIQMIADEQTHILPEDDEGFARVSHMMGYADPAEFSEIFLAALKVVEKQYAALFEQAPELGAASGNLVFTGDVDDPGTLETLSAMGYERPSDICRVIRTWHFGRYRATQSAEARERLTELTPALLKAFAETKRADESLLRFDGFLQGLPAGIQLFSLLQSNPRLLNLLVMIMSAAPRLADIITRNPHVFDGLLDPAIFSEVPTRAYLEERLRAFLGSATDFEEVLDRLRIFAAEHRFLIGIRLLTGAINGVRAGQAFSDLAELMVGRALEAVEAELQRRHGKVKGAKVALLAMGKLGSRELTAGSDVDLILLYAHDKDAEESDGEKPLAPSQYYIRLTQRLIAALSAPTAEGVLYEVDMRLRPSGNKGPVATHIEAFGKYQRNDAWTWEHMALTRARPIHGDEAFIARIKADIEDVLAMPRDVRKLAGDVREMRELIAQEKPPRDDWDLKLKPGGIIDLEFIAQFATLAGYVKKTPRPFATEEVLANLDPSFADPAMVDGLVEAHRFYTNLSQAIRLCLNDSAGLDQFPPGMRELLCRVAGLPDIERIEYELLEHYRLVRAAFDKLVGHGAD >NZ_CP019390|390079:436805|435481_435895_-|WP_008935501.1|tail|DBSCAN-SWA MAAQRGKDILLKTVRDDGTFETCAGLRTKRIAFNAETVDVTDADAAGRWRQLLAGSGVQRASVSGSGIFKDAASDALIRRIFFDGEIRDWQIVLPDFGTISGPFQITALEYGGNHDAEVTFEIALESASLITFGEAI >NZ_CP019390|390079:436805|390079_391006_+|WP_076770700.1|integrase|DBSCAN-SWA MIIVGETKIDTGDKYAPIIDYNLNYISGKNPKHRLVEHYSVAELTAKYINILWDDGSHKYNVRSFLGEIDEILKGARFSGFDQEMLDSIIGTLRERGNSNATINRKMAALSKLLRKAHKMGDIFNLPEFIRQKERVGRIRFLEHKEEKRLFAAIKSRCEDSYRLSVFLVDTGCRLGEAIGLTWNDIQEQRVTFWVTKSNRSRTVPLTRRARKASHIPHERLKGPFSMLNQVRFRQIWNEAKAEVGLGADDQIVPHILRHTCASRLVRGGIDIRRVQMWLGHQTLQMTMRYAHLATHDLDSCVKVLEIH >NZ_CP019390|390079:436805|421400_423392_-|WP_076770711.1|DBSCAN-SWA MSVEIGHFALVLALALSIVQSIVPVVGAHRRDAQLMAVAVPTALAVFALIVLASAALIHAYVVSDFSVLNVVENSHSQKPLLYKITGVWGNHEGSMLLWVFILTLFSALVAAFSGNLPETLRANVLAVQGWIGTAFLAFIIFTSNPFTRIFPAPMEGGDLNPVLQDIGLAIHPPLLYLGYVGFSVCFSFAVAALLEGRLDAAWARWVRPWALMAWMFLTGGIAMGSYWAYYELGWGGWWFWDPVENASLMPWLVGTALLHSAIVMEKRSALKIWTVLLAILTFSLSLLGTFLVRSGVLTSVHSFATDPGRGLFILGILALFIGGSLSLFALRVQSLSAGGIFHPISREGALVFNNLFLTTAAATVLIGTLYPLLLEVTTGEKISVGAPFFNMTFGPLMVPLLFAVPFGPLLAWKRGDLYGVGQRLMTAFALSLAVVGIMLWRTSAHSVLAALGIGLAAWLIFGSLTDLVLKAGIGKVSAGKVFARFKGLPRSVFGTSLAHIGLGLTLLGIVSVTTFGTENVLVMQPGGTAKVQNYTLRFEGLRPITGSNFTENRGAFTLLDSSGRDLAVIEPSKRFFPARQMPTTESGIKTLWFSQVYVALGDEPGNGAVVVRIWWKPLVTLIWYGALVMMLGGLFSLADRRLRVGAPSKVRRPASKEAEAVA >NZ_CP019390|390079:436805|436302_436470_-|WP_004688128.1|DBSCAN-SWA MNIAMTLTFESLVRALRWKALAAGEAIAIEQNRNVKDASGGKHEDWRGSTPESPV >NZ_CP019390|390079:436805|434378_434924_-|WP_076770721.1|tail|DBSCAN-SWA MTDETVTVSVNADTSAFDRALSDLEKRSSSFGNSLNSALKGAITSGKGLEDVLRGLASSLAGTALSAGLQPLQGLTSSMMGGLLSGIRGIMPFAKGGVVSSPTYFGMGNGSLGLTGEAGAEAILPLARGSDGRLGIATGGGSKPVQVVFNMTSPDASSFRKSEAQLATMLAGAVRRGARRL >NZ_CP019390|390079:436805|395235_395493_-|WP_004683353.1|DBSCAN-SWA MAQFQSFNHNENPAAQDRILLEMGMKYAIGRDCEIDVIEAHKWLNIAAIRGNQKAERMRNQVAATMSKSELAAALRCAREWMTAH >NZ_CP019390|390079:436805|433743_434376_-|WP_076770720.1|DBSCAN-SWA MVEAFHDVRFPLGVSFGATGGPEWRNEIVTLTSGLEKRNARWAHSRRHFDAGTGLRSLDDLQTVLAFFEARRGSLHAFRFRDPFDFSSATGKAPLSAFDQPLGTGDGVAVHFQLRKKYESYDRPITLPVPGSVVIGVDGVKVQEGEAFTVDPLTGIVTFTPDYLPARDVPVTSGFLFDVPARFDTDRLTASIASFQAGEIPSIPIVEVKR >NZ_CP019390|390079:436805|395762_396002_+|WP_076770703.1|DBSCAN-SWA MITIGRIDVVRIGAKRALQTAMMKLKIAPSASPETRLRLRLRKLMRGAERMLLQRLKRMNSGFIAAPTIRDSRHFGRNE >NZ_CP019390|390079:436805|409203_411555_+|WP_076770707.1|DBSCAN-SWA MASTDAYGAPAGTHCESVRKKRKGRLSGHVSLLAGPAYSKFIIIEPILRRLVPALIIIFLIILGVARVFSLLAWRDDIELQHKAALSGATAHLAQMIERVANGIETGAQLSAKDLQDAMTELRSRGLTSSGMTIAIVDAQSMIKAASGPAGIAGSQIDTILGDAQPLFLFAERAGVLRVVLQGEAAFGALAKPMTAPYSIIAVEPESTIFAKWKRAVSLNVTLFAGTIGVMFAILYAYFSQAARAREADDLSGQIQRRIDMALARGRCGLWDWDMARGRIYWSRSMYEMLGYEAQDAVLPFGDVAAIINEEDGDLYSIAEQAAAGDISHVDRVFRMRHADGSWVWMRVRAEIASEGDLHLVGIAFDVSEQHRFAQQTAEADMRIREAIENISEAFVLWDANNRLVMANSKFSEYAGLPVWTLKPGVPRNEVDAHTRPFTFERRMANEHNRAGGQTFERQLSDGRWLQVNERRTQDGGMVSIGTDITQLKLHQERLVDSERRLMATVHDLSVARKGERDRVRELSELARKYSLEKERAEAANRAKSEFLANMSHELRTPLNAIIGFSEMIQAGTFGPLGSDRYEEYINDIHTSGNFLLNVINDILDMSKIEAGHFSLDREEIDLCPLINETVRIISLQAEEKNIAVETRIEDAMELYADRRAIKQVLINLLSNAVKFTSYGGRITVRARKTGAALFMTIQDTGVGIPKSALRKIGQPFEQVENQFTKTHTGSGLGLAISRSLAELHGGWLRIRSTERVGTVVSVCIPDRNPAPNAGHDARTHAA >NZ_CP019390|390079:436805|435935_436343_-|WP_076770722.1|DBSCAN-SWA MKTGAAALQKALFEALKNDGELIETLGGERVYDHVPARTPFPYVTLGETMCRNWSTASEEGGEHFLNIQIWARESGRKRVLDIAAKIAMRLDEKPVEIEGHRLVNLMLTEVLARNTDGLGSYLGTMRYRAVTEPA >NZ_CP019390|390079:436805|393099_393750_-|WP_076770702.1|DBSCAN-SWA MSIRTITAMAGTAFIMGAAAIAFSSPASALTMKECSTKYQAAKDAGTLGNMKWNDFRKAQCGDDAASAPAAAPAAAPATKKAAKAAAPASNDGAKSLTMKQCSAKYQAAKDAGTDNGMKWNDFRKAECGPGADPVALSTDGDSEPVAPSVAAPKGVKFPTAVSAKYSKESAGKARMHTCLDQYHALKDANALGGLKWVQKGGGYYSLCNARLKGNS >NZ_CP019390|390079:436805|427076_427766_-|WP_076770716.1|DBSCAN-SWA MRILIVEDDKDLNRQLSEAMLAAGYVVDSAYDGEEGHYLGDTEPYDAVVLDIGLPQMDGISVVERWRRSGRTMPVLMLTARDRWSDKVAGIDAGADDYVAKPFHIEEVLARLRALIRRAAGHASSEFVCGPLHLDTKTSKASIDGVALKLTSHEYRLLSYMMHHMDEVVSRTELVEHLYDQDFDRDSNTIEVFVGRLRKKMGVDLIETVRGMGYRMRSGGEGAAKGSGS >NZ_CP019390|390079:436805|419244_420786_-|WP_076770710.1|protease|DBSCAN-SWA MSRARISNYRKGVAAVALSAALAGAFVVTGPLGALNEARAEAVHVTPPPQAGFADLVEKVRPAVVSVRVKKDVQEASNRGPQFFGPPGFDQLPDGHPLKRFFRDFGMEPRGDSRSDNRRGKANKPRPGHERPVAQGSGFVISEDGYVVTNNHVVSDGDAYTVVLDDGTELDAKLIGADPRTDLAVLKINAPKRKFVYVAFGDDNKVRVGDWVVAVGNPFGLGGTVTSGIVSARGRDIGAGPYDDFIQIDAAVNKGNSGGPAFDLSGEVIGINTAIFSPSGGSVGIAFAIPSSTAKQVVDQLIKKGSVERGWIGVQIQPVTKDIAASLGLAEEKGAIVASPQDDGPAAKAGIKAGDVITAVNGETVQDPRDLARKVANIAPGEKAALTVWRKNKAEEINVTIAAMPNDKGKSGSQSNDNGGGQGETLDSYGLTVVPSEDGKGVVVTDVDPDSDAADRGIRSGDVIVSVNNQTVKTASDINKAITAAEKSGRKAVLLQLQSNDQSRFVALPINQE >NZ_CP019390|390079:436805|417040_418447_-|WP_025199586.1|DBSCAN-SWA MMSRFSALMRTTAARLSALYLLLFAVGAVALVFYMTNLSASILAGQTQQALGEEVASIGKSYARGGIPQLVRTIDYRSRQPGAYLYLVADPTGRILAGNVESVEPGVLNTDGIIERAFTYRRYGEQAPQVEHRAIAVVIALPNGMRLLVGRDLGEPERFRDLIRNSLVLALGIMGVGALLIWLFVGRRALKRIDDVSRASQRIMDGDLTGRLPVNGSGDEFDRLSGNLNVMLARILELNEGLKQVSDNIAHDLKTPLTRLRNRAEEALGGEKVEPEYRAALEDIIGESDQLIRTFNAILMISRLEAGYSSENLDDMPVAPIIRDVAEMYEPVAEDAGVTLTLGALDDVALHINRELVGQTVSNLVDNAIKYAGGEGRTATVTLLMEKDAQWVRIVVADNGPGIPADKRDHATERFVRLEESRTQPGSGLGLSLAKAVMKLHGGALRLEDNGPGLRAVLEFPLPHREVG >NZ_CP019390|390079:436805|425691_427002_-|WP_076770715.1|DBSCAN-SWA MALVVVATFISSLYGEAARNNFERLLTAHLFSLVGAVSVSGEGTLQGRPELGDLRYSSPLSGWYWSVDPVTPNLTGKLESPSLVGRIVPEMPVSQAPFDSSFMRSYTLPGLDNEELSIVETEVVLDNSNRVARFRVMGNLSEVLNEIANFRARLLVYLGVFGIGSILINAAVILFGLRPLDKVRQALADIREGRSSRLDATLPLEIAPLAREMNALIENNRRIMERSRTQVGNLAHSLKTPLSVLVNEARAMGGAPGRIVQEQSEAMQVQIQHYLQRARIAAQRDSVVFRTPVTPVLERLQRVTAKLHPTFNISFRNDLPSAVFAGEREDLEEIIGNLLENGGKWGRKCINIRLAAAAGEQRQFEIVIEDDGPGLEADKIEAALKRGSRVDETKPGTGLGLAIVQDTVREYGGSLHLGKGALGGLAVRVVLPLTQD >NZ_CP019390|390079:436805|402684_403557_+|WP_009364527.1|DBSCAN-SWA MTDGRETLDIEGLLRFYAEAGVDVPLCETPIDRFAAATHPAPAQSRMQAAAQQPESNPAQAREERARTVAVPSPSSKPVQAAMDLPDNAQIALAREAASQAETLEELREKLAAFDGCNLKFTAKNLCFADGDPSSDIMFIGEAPGRDEDMEGLPFVGKSGQLLNRMIEAIGLKREEVYIANTIPWRPPGNRAPTPLETELCRPFIERQIELAAPKVLVALGGPAGKALTGAAEGILRLRGNWKIHRTPTGMEIPVMPTLHPAYLLRTPAQKRFAWRDFLAVKLKLAELRG >NZ_CP019390|390079:436805|427844_428141_-|WP_002971526.1|DBSCAN-SWA MKQNPALKVFALLAVSVGLLPVNAGALPMAAPQKPNLLIATAGDCTAVGEQVAAQQGGQLAKATPTMQNGRAMCVIVVLIPGRNGERPRRVEVAVPAQ >NZ_CP019390|390079:436805|425145_425598_-|WP_076770714.1|DBSCAN-SWA MMIVSRFSRPVPVISMMFVMSLVLSACGTTGGGKGSGFPSLGGSSQKPETNLLASLGNGLLGNSASQLSAADRRKALEAEYRALEYSPAGKSVLWSGAGSNAGDVTAAQPYQVGSQNCRQYSHSFTIGGDQQTVRGTACRNPDGSWTPLT >NZ_CP019390|390079:436805|399359_400328_+|WP_154144614.1|DBSCAN-SWA MSKSRLNLPFLILVAATLVSIPLVLSFLNSLHPAFDTFSHLRIHLAVLMGLLALPLLFTKLRREGAMVLLLAVFAIAVTPHVFPASEDAHAGEAAQPHYRLLQMNLRFDNGSPEQALSLIAHIRPDVVTLEEVSSMWREKFGHIASAYPYSIFCPHPGAVFGVAILSRRPFIADSTPACDPKGMMAVASVDFGGRPVDVAALHLHWPWPFQQSEQIEALSEQFRGLSENAILSGDLNATPWSATTKRIAELAAMTPAPPTGPTWLYRRLPASLRFAGLPIDQTFAKGRVAISKITRQQPIGSDHLPVLVEFSIIPQPETVTS >NZ_CP019390|390079:436805|403715_404972_+|WP_076770705.1|DBSCAN-SWA MAGQLLPIAALLASTFLMLLAGGLAGILLPLRGGMEGWSTTTIGWMGTSYSLAFTIGCIFIPHLVRRVGHVRVFSALLTLLSMALLFHALVVNPAAWMIFRGIAGFSLAGSYMIIESWLNERVTNESRGMIFSIYMIITMVGLLLGQYILPFGNAATQTLFIICAIIYASALLPTALSSAQSPNPLTQVSLDLKGLYRRSPAAVVGSFIAGIVAGTWNFLAAIYGEMNGLSTFGIATMLASAMIGGAIFQYPLGRASDFVDRRYMMILAGAIGFILSFIMVLFHPTSPYTLYAMMFLFGSVVFPIYSLNVAHANDYADASEFVKISGGLLIVYGVGSVLGPAISGPLMDVIGANGFFVTMAIAYCIYGLHAWWRIYRLERPAISDQKTEFKFHTPDGQSTPETMQLDPRVEGTSGQAQ >NZ_CP019390|390079:436805|435143_435485_-|WP_070997847.1|DBSCAN-SWA MMMANRHRGEVAARLDGRDWTLCLTLGALAELESAFETDNLSALVARFSSGRLSARDMQRIICAGLRGGGHTVSEEDVADMRAEGGVAGFARIVSSLLTVTFGTPEKDSAPNP >NZ_CP019390|390079:436805|406161_408813_+|WP_076770706.1|DBSCAN-SWA MRTETGHTFRLEDYRQTPYAIPETKLDFTLEPEKTIVRATLTIERRPDTPAGTPLVLHGDALKLVSLAIDGKALSDNSFSATPDQLAISDLPKDARFTLQIVTEVNPTANRQLSGLYRSSGVYCTQCEAEGFRRITYFYDRPDVLSVYTVRIDADRKAAPILLSNGNPVESGMVEGHPERHFTVWHDPHPKPSYLFALVAGSLGVVKDHFTTRSGRPVDLAIHVEHGKEGRALYAMDALKRSMKWDEEKFGREYDLNVFNIVAVSDFNMGAMENKGLNIFNDKYVLADPETATDADYAGIEAVIAHEYFHNWTGNRITCRDWFQLCLKEGLTVYRDHEFSADQRSRPVKRIAEVKILKAQQFPEDAGPLAHPVRPREYREINNFYTATVYEKGSEVVRMIRTIIGPELFRKGMDLYFERHDGDAATIEDFIQVFADVSGQDFSQFALWYDQAGTPKVEPGFHHDAAAKTFTIKLEQSLAPTPGQSIRKPMHIPIAFGLIGPDGKDMQPSSVEGGEVRDGVIHLRRPSETIVFHGIEARPVPSLLRGFSAPVNLAAPLTAEDRIFLALNDSDPVARWQAMNSIFSAALLDGAKRVRGGHQPETDPKIVALAGKVAFDEMLDPAFRALCLTLPSESDIAREMGNNVDPDAILASRNHLIAAIASAYADGFAGLYDTLKQEGAFSPDAAPAGKRALRSALLDYLSVQEKSPERAERQFVEADNMTDRATALAVLVHRFGDSGEARQALATFEQKFGQDALVMDKWFIVQATRPGETALEAVRELTRHPLFSLDNPNRVRALIGAFTASNPTGFNRRDGAAYGFLADTLLTIDPKNPQLSARLLTAMRSWRSLEEVRREYARAALARIAGAGKLSTDLRDIIDRTLA >NZ_CP019390|390079:436805|397681_399184_+|WP_008508597.1|DBSCAN-SWA MTQRIEHPFLTDIKTPPPMEPEVFSDAQAAVAALCKLYERNTAFLRSAFEKVARGEIAPQRYRAFYPEICLSTSSFAHVDSRLAYGHVSTPGDYSATVTRPDLFGHYLREQIRLLMRNHGVTVTVRESSTPIPIHFAFKEGAYVEASVASAFTHPLRDLFDVPDLAATDDKIVNADFEPAPGEPMPLAPFTAQRIDYSLHRLSHYTATSPSHFQNFVLFTNYQFYMDEFCAYARQLMAEGGGGYDQFVEPGNIVTRAGETAPSTGNPLQRLPQMPAYHLQKAGHGGITMVNIGVGPSNAKTITDHIAVLRPHAWLMLGHCAGLRNSQQLGDYVLAHAYMREDHVLDDDLPVWVPLPALAEIQVALEEAVEEITGLQGYDLKQIMRTGTVATIDNRNWELRDQRGPVQRLSQARAVALDMESATIAANGFRFRVPYGTLLCVSDKPLHGELKLPGMATEFYKRQVAQHLRIGIRAMEKIASMPDERLHSRKLRSFYETAFQ |
42 | Rhodobacter_phage(37.5%) | tail,integrase,protease,head | attL 380829:380844|attR 391021:391036 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1520986 : 1528235
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP019390|1520986:1528235|DBSCAN-SWA ATTAAGGAGCGTTGCTGATGATCAGTTCCCGCACCTGTTTTCCCTTTCCTCCTGAAACCGAATAGGTCAGCTCAGCACCTTCGATCTGAAAGCCAGCGAATATGGTTCGTATCTGTGGAACATCATTGATCGACAGGACGAACCGTCCCTTGATGCGGTCAAGGCGCTCGGCAAGAATTTCGAACTGGTCCTGACTGAATAGCTCCTTCCCGTAAACACCTTCCGTGCCCCAATAAGGCGGGTCGAGATAGAACAACGTTTCTGGCCTATCATACCGATCAATAAACGCCTGCCAGTCCAGGTTCTCGATAACCACCCCAGCAAGCCGTTCGTGAACCTCCTGGAGTATAGGTGCGAGCGTTGTCAGGTTGAATCTTGATGATCCGCTGTAATTGATGCCGAATGATCGTCCGGCCACCTTTCCTCCAAACGTCAGGCGCTGAAGGTAGATAAAGCGCGCAGCCCTCTCCAGATCGGTTAGTGTATCGGGATCGGTGGCTGTCAGCCGTTCGAAGTCCGCTCGGCTCGTGATCTGGAACCGGAGCGTATCCATGAGCTGCGGATAATGACGCTGAAGAATGCGAAACAGGTTCGCGACGTCTTTTGAACGGTCATTGATGAACTCGGCGCGTGGCGCTGCGATGCGCCTGAAAAACACCCCGCCCATGCCGACGAACGGCTCAGCATACAGTGAATGCGATATAGCATTAATCTTCTGGCAGATGCGCTTTGCAAGCTGGCGCTTCCCACCGATATAGGCTGCGGGCGGCGAAACAGGATCGACGGTTGTGAAATCGGCCTTCTGGCTCATGTACGTTCCTAAGAGAATCGGTCACAAACCAATTGCCCGAAAGGGCACGGATGCGACGGTTATCTCGGATGGCTGTCGGGCGGGTTCTATCTTGCCGGATCGGCCCGCCGCCACGAATTGACGTGGCCGTCCCTACTTCCGTTTTGGTGCCGGAGGTTTTCCCCAATTGTCAGTTGCTGACCAAAGGTGAATACCATCCATGGACACCACGACCGGCCCATCAAGCTCAGTCGAAATAGTGCACTTCACCTCCGCTTTACGCTGCTGATGAAATTCGATCTCCGATCGTGTGAGTATTTCGCGATCACGCAAGGTCAGATCGCCAACCAGCCTTTGGCCGGTCAGATCAGGCCAATATTCGCCAGCACCAAGCTTCCACAGGTGGCGCATAGGCAAACTGTTCGCGACATGGCTATGCGGCGGATAAACCTCAATCGCCACTGCATCAGGTCCGAAATGTTCGTTCTTCAACTCCTGAAGCTCATCCCATGTGATCGAGCCGTCATGTTCGATCGATAGCAGGCCGCGCAACTCGTCCAGAAGAACGCCAATAACTGCACTCATAGTTCACCTCAGAGCTTTGCGGCCTGTCGCCAGAAATTGTCCTTCTGCTCGATTGTCCATTGCATGGCTTGCCGAACGATTTCCGCGAGAGGGTTGGTACGATTGAAATTCTGGGCACCCTTCACCAGCATCTGGGCATTGAACTTGTCATCATCCGTGGGGAGCTGATCAATGATCGCCTGTAGAGGCGCTGGAATTGCGCCGTCCAGTGCATCAAGCGCCTCTTGTCTGGTGATGATCTCAAGAACGGCCAACTGCTGAAAAAACTGGCGCCGGCTGATTTCATCTGGAACCAGCTCAGGTTGCGGCTCTGGCTCGACAGGTAGAGCGAACTCTTCACCGTCGTATAGCCAGCCACTCTCCACATCTTCATCACAAGGCGTTGGAGCTAATTCAACTGCAAAAACATCAAGCGGATCGATTACATCATCAAGTTCCACGACATCGATGACGCTGCCCTGGGATATCAGTGCATATCTCATGTTCTCATACCTCAGTACTGGAGAATAATTCGGCCATCGAAGCCAGATCCACCGGCAGCGTTGGTGCATGCGCCAGAACCGCCACCGCCCGGTGCGTATCCAAGATAGCCGATGTCATTGATGTGAACGTTGGAGGATGGCGTTCCGAATGCACCGCCTCCGGCTGGGGCGATCCACGCACCCTCCGCGATGCGGTAGCCGATCGAACCGCCTTGGCCCGAAAAATTCGCACTGCCACCAAATCCGCCGCCTGGGTCACCACCTGATACTCCGCCGCCCGAAGTGGAGTAACCACCGCCTTTGCCGCCCGTAGCTGATATGCCACCGACACTCGATGTGCCGCCATTCCCGCCTGTTTGAGGGGATGCGCCACCAGCTCCGCCAGCGCCTCCCGCACCGACGACGAGTGTCAGCGATTGTCCAGGCGTTACCGCGACCCATCCCTCAGAGTAGCCGCATCCGCCGCCTCCACGACCGACAGCGTTATTGTTTGCAGCCCCGCCACCACCGCCGCCACCGCCCCATACGCGCGCAAAGACCCAATAAACTCCGGTCGGGACAATGAAAGTGTAAGTTCCAGCGGCTGTGTAGGATTGCGTATGCCTTGCTGGCATTGAACTCACGTAGCTGCCTGCGATCTGCCACGCCGCACCGTCATAGTTCAATTCAACAATTGCCCCTGACAGGAGGTCGAGTGCTTCCGTCGCGCCGCCATCAGCCTTGCGAATAGGCTTGGCTCCAAGGCCGTTGACATTGAGCGTTGCCGGACCGCTGTTTGTCTGAGCGATCTTCACCCGGATAGCCATCCCAACGGCCAGCGATTGCGGAATTGGTGATAGGGTGGCTGTCAGGGCATTAGCGCTGCCGCCTGCAACCGCATAGCTCCATTGGCTTCCCTGAACGTCGAGAGCGAGTTTTTCGATGTATGTTCCCGCGATCTTCTCGAATACTCGACCATCGGGAAGACTGACCCCATGGCCGTCTTTCGTATTCACAATTCGCCACGATCTTCCGGTCCATTCAGCAAGCTTCTGCTGGTTACCAGCCCAAGCACCACTTGCCCCGGCAGGGATCACATATGCATCGCCCAAGACCGCATCGTTTGGCGGAGCTGTTATGGTCAGCGAAATGACCGGCAACCAGGCGGCACGGTTTGCAAAACCAGGAGCGGCGATCGATTGCAGCGCTTCCCAAAGCTGTTGCTGGTTTTGAGGGTCAAGGGCGAAGCCCGAATTCTCGATGACAGCGCAGATTTCCTCCTGCACATCATTCAGAATTTTGTCAGTGACTTCGGTTCCTGGAATACCCGCTGCAGCATTCTGCGAGCGAAACCCACGTCTGCCACCGCCAATATCGACCCAATCCGTGCCGTTGACGCGATCCATGTCAGTTCTCCACGTATGAGAAAACAAGTTGAGTGTGCGCGGGCTTCAGGCGGCGAAGCTCGCATTCGATATCGCTCATCTCGAAGCCACCGAGTGGCTGACCGGCCGTGTTGACGCCTGCACGGAATATCCATTCCGAGATAAGCTGAAGCTTTACCCGCCACGTGAATTGTTCGCCTTCGGCAATCAGAGCCTGACCGGCCCGAAGCACTCCCGCCTTGGAAGGCCAGAACTCTTCAATTTCGATGGTATGTCCGAGACTTGCCGCCATCTTGACGAAGTACGGAATGCTTGCACCGCCCTTGGCGATCCAGCGCTGGTGCGCCCGGCGCTGGCGCTGTTCGAGCGTCTGATTACTAAGATCGCGACCGCAAGGATCGGGGCCAAGAACGCGCTCGAAATCAGGCAACAGCGCATTGGCCGTGCGCGGATCAATTTCGTTCATCAGGCTTTCGGCGTCGGCCTCGGCCTGGACGAGAACCTTTGCAATGCTATCAAGAATCGCATCGAGAACGCCTTCACGCTTCCCGAGCGCAAATCCGCGTGGAAGCTTACCGATGAGACTGGCGAGGATGGTTGACTGCGGACGGGTCATAGCGGGTCCTCAAACATGATCTCGCCTGGAAGCGGATACTGGTCTCGGTCCAGTGTGAACGGCACCGAAGGCGAGATCAGATCGTGGGCGTATTCGCCCGAGGCGGCTGAGATCGCCTCGGAAATCCGTGACGGTTCAATACGGGCACCGATCGGACTTTCGTTCTGGTCGTCGTTCGCATCGCCGATCGACGCGATGAAGGCCGCATACGCCTCCTGCACGGCTGCGCGTGTGGCGACCTTGTCTGGTCTCACGCGGACCGTGATCGGGATCGCCCGCATTTCGGCAGGCACAATGACTACATGGGCTGTAACCGGCCGAACGCCTGAAGACGAACCGGGAGCGCCCAAATAGCTGAGCATCTCAGCCATTTCCGATTCAGTCGGCGCACGGGCAGAAGTGCCATCTTTCATGGCGACAATGACGCCGACAGAACCACGGCCGATCCAGTCGGTTTCCGGTTTTACGGCACGAACCGCGAATTTTTCGCGCAACCAGGTCGGAAAGTCGAAGCCAGCGCCGCCATGCGGGCGCTGGCGGATGTAGGCCATGGTCGCGTCGGCCAGTTCCGCTGGTGTTTCCGCTTCCGCGCCGCCTGCAATTCCTTCGGCCGCGACTGCAATACGATTGATCTCAGGAAAGGCATTGACCGTGCGCAGCCTGATACCAGCTTCCAGATTGCCCGTTGGTCCTGCAATAGAAGCGATGACCGAGACAGAAGCGCCGCCGTTTGGACCTATGATCGCGGTTTCCGTAGTTTTAAAGATCGTGCCATCAGAACTAGCGATTTCAAGATCGGCAGGAATGGGTGTTCCTGCCGCGCCCTCAATATCCAGCCTGCCGACAGCGAATGTGGCCGGACGGGCTACAATGCCCCAGATATCGGCATGACGCTGGACAAACTCGTCTTCGGCCGTATCAACAAAGTACTGTCTGCCCCACCATGCGACATGATCGTGGATCTCGCGCGCTTCCAGTGCGACGGCACGGCCAATCATGGCGAGCATGCCGCGAGCCGAACGAACGGCACGGGAAATCGCAAGCGGATCAACAAGCGGCCGCACAACGGAAATGCTGAACTCCATGGCCGAGGCGATGCGTTCAGCGATGGTCTTTGCGGATGGAACGGGCCAAGGCATCAGGCAGTCCTCCGCCCGGAAATAGCGGTGTCATCGACAAGGACGCGCCAGCCGAGCATTTGCGGCGCGACCCATACGGTCTCGATCTCGGCTGGAATGCCGGTGTCGGAGGTCACCCATTCAACGCTTTCGGCAAGCCAGCTCTGGTAAAGAAGGCGGGTTGTTTCGGTTTCTTTAGCGCGGTCAAGCAACCAGCAGCGCGAACCGATACGCTCACCATAGGGATCGAGCGCGTCGGCAGCAGCACCGCGCCGGACATCGATGCCGGACCCTGTCAGAAATTGTGAGCGGCCCTCCGGCAGCGGATCGTCGGGATTGGCACGGCGATCAAGGCCGACAGAAAGAAGAACCGGTGTGATTGGGGTTTCATCAATGACGAGATCACCATCCGCGCCGATCTCCAGATCAGCGCGACGGGTTTCCGGGTCATAGATGAGTGCCACATCGTAAAACATGCCCCGGTTCTATCGCGCGCGCGCGAAAACGATCATGCCCGCCGAAGCGGGCACGAAACTGATTATCCTTGCGGAACGCCGGTGGTACTGTTTCCGGGTGAGACGCCGATATGAACATGGGTTGAGCCGATGTTCTTGCCGTCATGAGTGACGGTTCCTCCCTCGATGGCAACGCCACCGGCTGATATTGTCACGGTGACGCCGCCAACCTTCAGCACGATCGACGCTCCAGCCTGAACGCTGATGGTACCGTCAGCGCCGACAAGTATTCCGTCGCCATGCTGATTATAAAGCGCAGTTTCACCGGGCTTCAAACCGCCCATACGCGCCGATGGATTTCCGACCGGCAAGAGAACAATGTCATCCTCGTTGCCACCAATGGCGACGGCGAGTGCAAGTGCGCCGTCTTCAGGTGCCGAGGTCGCCAACCCGTAAGGCTGCATGATCTCGACCTTGTCGCGCCAGATGCCGGGGGCAACTTCCACGGAAGCGGTCTGGGTTTCGCCATCATCGTTGATGTTTTTCAGAACGACGCGGCGGACAATCCCACGAACTTTGCTGGCAGTTTCATGATCCATGGTCCACCTCACAATGCCGATGCGGTGCCGTCCAGCGGCCCGCCCGATCCCTTGCCCTTGCTCTTTTTGCCGCTCTTCTTACGTCTCTTAACGTTCTTTCGACGGCCCTTTACAGGCTTGTTGTCGAAGGCTTCCGGCGACGTGACGGCAATCTCTGTTTCGCAGCCACTGTCCTCCTGCTGCAGGAACGTGACGCGGGAAATCAGCATGTCGCGAAACACATCCTGAAAGGAATCGGAGACCTCGACCATTTCATTGACCCGCCACAGACGCCCGTTTGCCTTGTAACCATGAACGCGATAGGAGATTTCCTCACTCTCGCCGCGCTTGGTGCGCATACGCCAGTCGGCTTCATCCTTGCAGCCCTTGTCGTCGGCCTTGGAGCGGGCCAGATGGACGATCGGGCGATAGCGCCTGATTTCGTCATCGGTCGCCTCGCCACTGGCGACAACGCCGCGCCGCTCGCGTTCGGTGGCCGAACCGTCGGTTGCTTGCCTGTCTTCCGGTTTTACCGGAGCGCTGCCGCCCAGAAGCGGTGCAGCGCGGCCGTCACGAACGGTTGCTGCCTTTTCCGATTGCCCGCGCACAATAACCTTGGAATGGCGGTCCTTATGGGTGAACTGGCCCGAAGAGGCTTTCACGTTCCCCGGCAGCGAAAGTGCTGCCGGAGCGCGATTGGCTCCGGTCCGGGTGATGACGACACCGCCGACGCCATCCGACATGACCAGGGCATGGCGCTGGCGCGTACCCTTGTCGATGGCACTCAGGCCCGTTTCGGAAAGATCGATACCGTAGCGCGGGAATGCATCGCCAGTATCGATCTCGGAACGGACGGAAAGCCCGAACGGCTCTGCAATGCGCTTGACCGCTTCTTCCAGCTTCACGTTGTTGAACTCGGACGGACCAGTCGGCGCGGCCGTGCTGTCGACCAGATCGCCCGCCTTGTCCTTGCCAGATATCGAGACCATGGCGCGCTCTTCATCGATATCGGGTGAAACAGTTTCGATATAGCCTTTCAGGACGAGCTGATCCTCGACATAAGCTTCCGCTTCCATACCAGGTTTCAGTTTGAAGACTGCATTTGCTGGCGATGCAAAATCGAAGGTGGAAAGCGCCCGGCTATAATCGCGCAGCTCGAAGCTGAAAGACCCGCTGAAATCCTTGAGGTCGCGGGTGATGTTGGCATTCGTCCACTGGTCGAATATCTGCCCGTTCACCTTCAGCCAGATTGAGCGCGCCAT
Protein sequences of DBSCAN-SWA_4 >NZ_CP019390|1520986:1528235|1520986_1521796_-|WP_076771236.1|DBSCAN-SWA MSQKADFTTVDPVSPPAAYIGGKRQLAKRICQKINAISHSLYAEPFVGMGGVFFRRIAAPRAEFINDRSKDVANLFRILQRHYPQLMDTLRFQITSRADFERLTATDPDTLTDLERAARFIYLQRLTFGGKVAGRSFGINYSGSSRFNLTTLAPILQEVHERLAGVVIENLDWQAFIDRYDRPETLFYLDPPYWGTEGVYGKELFSQDQFEILAERLDRIKGRFVLSINDVPQIRTIFAGFQIEGAELTYSVSGGKGKQVRELIISNAP >NZ_CP019390|1520986:1528235|1522368_1522842_-|WP_076771237.1|DBSCAN-SWA MRYALISQGSVIDVVELDDVIDPLDVFAVELAPTPCDEDVESGWLYDGEEFALPVEPEPQPELVPDEISRRQFFQQLAVLEIITRQEALDALDGAIPAPLQAIIDQLPTDDDKFNAQMLVKGAQNFNRTNPLAEIVRQAMQWTIEQKDNFWRQAAKL >NZ_CP019390|1520986:1528235|1527002_1528235_-|WP_076771243.1|DBSCAN-SWA MARSIWLKVNGQIFDQWTNANITRDLKDFSGSFSFELRDYSRALSTFDFASPANAVFKLKPGMEAEAYVEDQLVLKGYIETVSPDIDEERAMVSISGKDKAGDLVDSTAAPTGPSEFNNVKLEEAVKRIAEPFGLSVRSEIDTGDAFPRYGIDLSETGLSAIDKGTRQRHALVMSDGVGGVVITRTGANRAPAALSLPGNVKASSGQFTHKDRHSKVIVRGQSEKAATVRDGRAAPLLGGSAPVKPEDRQATDGSATERERRGVVASGEATDDEIRRYRPIVHLARSKADDKGCKDEADWRMRTKRGESEEISYRVHGYKANGRLWRVNEMVEVSDSFQDVFRDMLISRVTFLQQEDSGCETEIAVTSPEAFDNKPVKGRRKNVKRRKKSGKKSKGKGSGGPLDGTASAL >NZ_CP019390|1520986:1528235|1525960_1526416_-|WP_076771241.1|DBSCAN-SWA MFYDVALIYDPETRRADLEIGADGDLVIDETPITPVLLSVGLDRRANPDDPLPEGRSQFLTGSGIDVRRGAAADALDPYGERIGSRCWLLDRAKETETTRLLYQSWLAESVEWVTSDTGIPAEIETVWVAPQMLGWRVLVDDTAISGRRTA >NZ_CP019390|1520986:1528235|1524228_1524822_-|WP_076771239.1|DBSCAN-SWA MTRPQSTILASLIGKLPRGFALGKREGVLDAILDSIAKVLVQAEADAESLMNEIDPRTANALLPDFERVLGPDPCGRDLSNQTLEQRQRRAHQRWIAKGGASIPYFVKMAASLGHTIEIEEFWPSKAGVLRAGQALIAEGEQFTWRVKLQLISEWIFRAGVNTAGQPLGGFEMSDIECELRRLKPAHTQLVFSYVEN >NZ_CP019390|1520986:1528235|1522853_1524227_-|WP_076771238.1|DBSCAN-SWA MDRVNGTDWVDIGGGRRGFRSQNAAAGIPGTEVTDKILNDVQEEICAVIENSGFALDPQNQQQLWEALQSIAAPGFANRAAWLPVISLTITAPPNDAVLGDAYVIPAGASGAWAGNQQKLAEWTGRSWRIVNTKDGHGVSLPDGRVFEKIAGTYIEKLALDVQGSQWSYAVAGGSANALTATLSPIPQSLAVGMAIRVKIAQTNSGPATLNVNGLGAKPIRKADGGATEALDLLSGAIVELNYDGAAWQIAGSYVSSMPARHTQSYTAAGTYTFIVPTGVYWVFARVWGGGGGGGGAANNNAVGRGGGGCGYSEGWVAVTPGQSLTLVVGAGGAGGAGGASPQTGGNGGTSSVGGISATGGKGGGYSTSGGGVSGGDPGGGFGGSANFSGQGGSIGYRIAEGAWIAPAGGGAFGTPSSNVHINDIGYLGYAPGGGGSGACTNAAGGSGFDGRIILQY >NZ_CP019390|1520986:1528235|1524818_1525961_-|WP_076771240.1|plate|DBSCAN-SWA MPWPVPSAKTIAERIASAMEFSISVVRPLVDPLAISRAVRSARGMLAMIGRAVALEAREIHDHVAWWGRQYFVDTAEDEFVQRHADIWGIVARPATFAVGRLDIEGAAGTPIPADLEIASSDGTIFKTTETAIIGPNGGASVSVIASIAGPTGNLEAGIRLRTVNAFPEINRIAVAAEGIAGGAEAETPAELADATMAYIRQRPHGGAGFDFPTWLREKFAVRAVKPETDWIGRGSVGVIVAMKDGTSARAPTESEMAEMLSYLGAPGSSSGVRPVTAHVVIVPAEMRAIPITVRVRPDKVATRAAVQEAYAAFIASIGDANDDQNESPIGARIEPSRISEAISAASGEYAHDLISPSVPFTLDRDQYPLPGEIMFEDPL >NZ_CP019390|1520986:1528235|1521928_1522360_-|WP_179947171.1|DBSCAN-SWA MSAVIGVLLDELRGLLSIEHDGSITWDELQELKNEHFGPDAVAIEVYPPHSHVANSLPMRHLWKLGAGEYWPDLTGQRLVGDLTLRDREILTRSEIEFHQQRKAEVKCTISTELDGPVVVSMDGIHLWSATDNWGKPPAPKRK >NZ_CP019390|1520986:1528235|1526478_1526994_-|WP_076771242.1|plate|DBSCAN-SWA MDHETASKVRGIVRRVVLKNINDDGETQTASVEVAPGIWRDKVEIMQPYGLATSAPEDGALALAVAIGGNEDDIVLLPVGNPSARMGGLKPGETALYNQHGDGILVGADGTISVQAGASIVLKVGGVTVTISAGGVAIEGGTVTHDGKNIGSTHVHIGVSPGNSTTGVPQG |
9 | Ochrobactrum_phage(42.86%) | plate | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1611906 : 1647321
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP019390|1611906:1647321|DBSCAN-SWA GATGACAAGGCGCGATGAGTATGAGTCGGCCGATGACGCGTTGTGGGCGATGCTAAAGGCTGGGGCTGACGGCGATAAAGCTCGGGCCCTCTATGATGCCGCCATCAAGCGAGCAGAAGCCATCGGAATTTCATATGTGCCAGCCGATAGGCTGCTTTCGTTTACAGATGAGGCGTTGGCGGCCCGACTAAACCTCGTAACTGGCAATCCGGAGGAAGATGCGGCCGCAGTGGGTGCGGCAAGCATCCCGTCAGTGTCTGTGACGCAGGCCCTGAAAATCTACTTCGATGAGATCACGCCCGATGAGCTGACGGGTAAGAGCGAAATTCAGAAAAAGCGTTGGCGTGCGCATAAGCAGCGGGCGATCGATCATTTTGTGAAGATTGTCTCGGATAAAGCAATCGCTGACATAACCCGCGAGGACGCGCAGAAATTCTACAAGGTTTGGCTACAAATGATAACGAAGCCAGCCAAAGGGAAGCAGCCGATATCCGCCAGCATGGGCAATCGCATGATGGGTGGTATGCGCGTGCTATTTTCCGAATACTTCAAGCACATGGGCAATAGGGATCGGCCAAACCCTTTCCGCGATCTTAGTTTTGCGGAGAAAGTTGAAAAGTCGCGACCGCCTATCCCAACGGATATCATCCAAGGGAAGTTCCTGACCTATGGTCCTCTCGTCAGCTTAAACGAAGAGGCTCGCGGTATCGTCCTGGCGATGATTGAAACAGGCTGCCGACCAAGTGAACTTTGCAACATTACGGCCGAGCACATATTTCTTGCCGACAAGGTTCCGCATATCCTGATTGCGCCTCGAAAAGACGCTGCAGATCCGCGTGAGATTAAAACCGCTTCGTCTGTTCGCAAGCTGCCGTTGGTCGGCATAGCGCATGAGGTTTTTAAGAAACATCGGAATGGCTTCCCTCGTTACAAAAACAAGGAAGACACGCTATCGGCGACGCTGAATAAATATTTCAAGGATAACGAGCTTTTTCCGAAGGGTGCCGGCTATACCGTCTATTCGCTTCGCCACTCATTCGAGGATCGCATGAAAGAAGCGGGCCTGGACGATGAACTGCGCCGAATGCTGATGGGTCATACAGTTGACCGCCCACGATATGGCACGGGCGGTTCTTTGGAATGGCGAAAAGAGCAAATGGAAAAATTCACGCTGCCGTTCGATTCATCCGTGATTTAATGCGGTCGCGGATATTATCGTTCGCAGCGATGTTGCGTTGCATGCCTTCAACACGTTCATAAAGTGGCAGGAGATTCTGTTCAGTCTCAGGGAAGTACGCAATGACACCGGCAATGGCGTCTAGCCACCGCTCAGTGTCCGCGTAGGTGTATGCCGCCATCCTCACTCCCTTTCCCGCAGTGCGGCGCGGCCATGACTTGCGTATTTCGACGCGATTGAAAGGCACATATGCGCCAGCCAGCTCAGCTTGATTTGGTCGCCGCTTGTTATGATGACGTGCCAGCGGTCTGTCGGTATGTTTCGGCATAGATGCAGTCTGCCGATATCCGCAAACCTCTTGGCGACCTCATCGGTAAATGCGACATTCCCAGAAATGACGTTCCGTATAATGGCGCGGTCCAGACGGCTCATTCGTAACCGCCTTTCAGGGCTTGGCGACCGGCTTCTTCATCCAGTCGGTCCAGCCTTTCTCTGACGCTATCTGTGAAGCGGAAAACGCTTTCAGGGTCAGCTTCGCGCTCTCTGTCTTCAACAAAGTCGAGCAATTCAGCAGCCCAATCCGATGCATGCTTAGCGATATAAGCGCACCTTTGTGCGAAACGTGCCTCTGTCAGCTTCATTCTGCCTGTTCCCCGAGTGCGGAGGCGGCGAGGGAGGCGAGTATCTCATTCGCGGATTTCCCTACGTCTGCGAAGGAATACCTCCAGCCCAACTTGTTTAGACGGTCATATGCGATACGCAGCGCTTCCGCCTTTGTTCTGTCGCCATGGGCTACCTTCGTAATCGGTTCCTTACTAGGCATTGCTGGCCTCCGGGGCTGGGTAGCAAACGCCTTCGACATTATGCAGCGGTGCTTCGAAAATAAGCGTCTTGCCGTCATAGACCCGGACGCGCTCCATGTAGCAGTGCATGCCCGGCAGATCGTCGGTGACAGTGATGCGGGATACTTTCCCGACGATTTTCCACTGGCGATCAATGAGGCGTCGATAGTCGTGGCCGACTTCGTAATAGAAGCCGTCTTCGCCATGCTCGCCCTGCGGAATGCCACTTACTGAGAATGCTTCACGAATGTTCATGGCTTGCTGGCCTCCTTTGCGCGCAAGAGGGCGATGCAGATCGCCAGCGGGAATGTTGGCGCTTGCCCTTGGATAGTTTGTGAGCCGTCAGATATTTCGACGGTCCAGCCCATCGGCATGACGTCGTTATCCAATCCGCTCGCCCGGATGTAATCCTGCCCGATATGTTCGAAGGTCCACCCCGGCAACACCTTCTCAGCCAGCGCGAGAGCGGCGTCTACAGAGGCGGTGTATTCCGGCGACTTATGGGATTTGTTATCACCAGTGGCGAACCAACCGGGCACATGAGTGAACATTGCCCCTTCGTTTGGCCTGACCACGTAAGCATCTGGATCACCTGAGAAAAGAACCGCAATCTCCGCATCCACTTCCCTGTCAGGCGCGTCTAGCTTGGAGAGGCGGGTAATGAGGTCGGTCATCTCTTCGCGTCCTTCTTTTTCGGCCACGGGTAAACAGGCGCGGGAATGTCTGGATGATCAATGGGCTTGCTCGATAGAAAGACCACATCAGCAGCGCCAGCAGACAGCCCGACGCCGCTATGTGGTCCCTTCGACCTCTTAAACAATTCAGCAGCAGGCTCATCAGATTGACGGATAAGCCATTGCTCATAGTGCCCGTCGCCATACCAGCCAGACAGCGGACGCCCGAACCGATCAAGGCAATATTCCAGCACAACCGAAAATGACGCATCGGTCAGAATATATGTGGTCTGGTGATATCCACGGCCCTCGGTGGCATCGAAAGAGCTTCTGACAATGTAGCCCTTGAGATTATCGAGGCGCTTAGCGTTCGCCGCAAGTTCGCGCTCATGAGCCTGACAGGCTTCAACCGTGTCGAAAACCTTGCCATCATCAGAGATAAAGACTTCGCGTTTCTCGGCTTTCATGAACGCGCCTCCTGTAGAATGCGCCGCACTTCGTCGGCGGCTTCCTTATCGGTTGGTTTGCGACCAAGCCGAGTTTCAAGCTTTCCGTAAATCGTGTGAGGACCGGCATTGTGCCATGTGGTGGTGACGATCATGACGGCTTCCCTCCCAGCACGGCGCGGGCTTTGCGGCTTGCCTTGGCGTAAGGGCTTCTCGAACGGTCGCGTTCAATTTTCCGAAGCGCTTTCAGGTCATATTCTGAAAGGTCCATCGCTTTGACAGCCGCTCGCGTGACGCCAAGCTCAGACAACGTGCAATGCGCAATTGTCGGTTCATCATGGAGATTGACGATATTGCAGCAGGCCAGCACATAGCCGTGCTCAAAAGCAGACTTTTCAGCAGCCTCGAGCTTGGCTTCGAGGTCAATCGCCCTAAATTCAGCATCGATTTTCTCATTAACGAGATCGGCGGTTCCGTCCTGAAATGTTTTCAGGTCAGCCTCCAACTCCTTAACCCGCGTAGTCAGCGCCGCGTTGTCGGCTTCGGCCTTCTCAGCGCGTTCCAAGAGCTTTGCCGCTGTCTTCTGCATGGTCAGGAAGTTGTCACGGGCAGCATCGCGGTTACGCTCGGCAATCTCCTTCTCCGCCCGTTCCGCCGCCAATAACTCCCGCATGTGATGCTCGAAAAACCAGGCGCGACCGCCGCCCTTATAATTTTCCGGAATCGGATGAGTAACGAGAAACTGGTCAATTTCCTTTGGGTGAACTGACCAAAATTTATAGGCAAGCTCCGTCACCAGTCCCGTATCTGTAGCGGCAGGCGCGGGGCGGGTGTTCGCGGGTTTCAAACGCGGGTCGCCATCAGGGCAACATGTCAGGCAAACCGATCCTCTCATGTTGCGTGTTCCGCACACATAGCAGATGTCACTCGCCATGACGGTCGCCTCCTGATGGGGTGATAGGGGCGCAATAGGCATAGGCAGAACGGCGACTTGATCCGGGGTTGTCTACGTCAAAACGCAGATGATCGAGAGTGAGAAGGCAATCTTCGCGGGTCATTGGTTGTGTGCGTTCATAAATACCGCAGCCCATAGCTGAAATCCCAGCGCAGATAATAAGGATGTACATCACACCGCCCTCCGCAGCCGACGGCGGTTGTCATTGGCTGGGCTGCGACGCTGGCGTGGAGCACCGTAGCGGAACCGAACGCCAGTAATGCGCGCCAGTTCGAAAGCGGTGTCGCGCAGCGTCTCAGCTTCGGAGTGATAGCCAAGCGCGCGCACTGCCTTTGAGATCGACAGGACCTGCGCAGCGGTCGTGCCTTGTTGCTGGAATTCGACAGCGGTCAGGATGGGCGTCATAGCGGCGTGCGTGATGGATCTGGACATAGGGTCTCCTCGTGTTTGGTGGCTGGTGGTAAGTGTGCGCCGGCTGTGCTTACGCGGCAGCCCTGCGCCATTCGAGATACAGGTGCTTCACAAGATGCTTTTCGACATAGCGATGCGCGCGGTTCGCTGCTGCGGCCCTCACGCGGACATTCTTTCCCATCACGCCATCCCCAATCCAAAGCCGCACACAGCAACGCAGAAGGCCACAACGACGGTCATCTCGACCGCTTCGCGTGCCGCATAGCGCAGCCACGGCTGTGGTCGGCCATGTTTCTTGGCGCGATAATCGGGACGGCGCATGGGCTCGACATTACCGGCCTTAGGTGCGTCGGTTTCCGTTTCGAACTCGTCGTCGGCGAGCATTTCCAGGAGCGCGCGGTTCATGCTGCAGCCCTCCGGTCTTTTGCCGGTATGTTGTCGTTCGCGACAGGCTCAAGGTAGTAAAAGCCCTGATTGCCGCGTCCGCAGCCTTGCTTCGGAATTTTCCATCCATGGCGTGGCAGCATCTTGCGCAGGCCAACCATAGTTGTTCTGATGACGCCGACCGCGTTTTCTGGGCCGCCATTTGGATCAAATGCATAAACGTTATCGACAAGATCACGCATGTGCATACGACGGGGGTAGACGTCGGACAGGGCCTCTATGATTGCCAGTTCGGTGCGTCGGAAATGGACTGTGCCTAGTTCCTCATATGGATCGCGCTCCATTTTAAGCAGCCTCCGCCAGCTCTACCGGCGTGCGGCAGGCCACGGCGCCGCTCGTCTGGAAAACTTCGAATGTCTCGCCGGGGCAAAGGGCAGCGAGACGCGTCGCCTCGGCCACGGCCTGCTCAAACGAGCCGTGTTCGTATGGCATGGTGGTGAAGACGCCAACGCGGCCGGTCTTCTTGCCGCGGCGGAATACAAAGAATCCGCCGCCGATAATTTCATTCGGGCGAGGCTTTGAGTTTCTTCTTCTCGGTGCTGTTGCTGACATTTGGGTTTCTCCTCGTGTTTTGGTGGTTGGGTCAGCAGATCAAGCTGGTGAGGCTGTCTTCGTGCTGTTGAGGGGAATGTAATGGGATAAATCACATTAGTCAATGGGAAATTTCCCAAGTGACGAAAAATGCGTGATGGGGTAAAACCCATTTATGAAACAAGTCACTGTAGACAAAGATCGTTTCCGGGCCGCGATGAAGAAAGGCGACTGGAGTATGAAGGCCCTATCGCTGGAAGCTGGGATGGGGGAAACTTTCGTACGCGACATCTTGGATCGCGGTGCCGTTCCAAAAATCGACTCCCTGAAGTCGGTCGCTAACATTCTGAATACCACTGTCGGGTATCTTATCGGGGAAGAACCGGGAGTAACTTCAGTGGTAGGGCGAGTCGGGGCCGATACGAGCGGCGAAATAGTCTATGGAACTGGCGACGGCGGGTTCGGGGATGTCATTATCCCACCTGGCGCTGGGCCAGAGTCAGTAGCAGTTGAAGTGCACGGCTACTCTATGGGAACTTTCGTCGATGGGGCTTTGATCTTTTATAGCGATCAGAAGTTAGCCCCGTCCGACGAAATGCTTGGCGATATGGTTGTCGTGGGACTGAGCGACGGTCGTGCGCTGCTGAAGCGGCTCACGCGTGGATCTCGCCCAGGTCTTTACGATCTGGAATCGCTAAATGGCCCGACCATGCGGGACCAAGAAGTCGTTTGGGCTGCTGACATTGAATCTATCGTGCCGCCGCGACAGGCTCGACGAATACGCATCTGACATTTCTATAGTGATAAATCACAATGCCGCGCTTGCGGCATTTTTTATGTCCATCTGATAAAATGGGAAAAATCACATTGACAAATGGGATTAATCCCCATATAGTCCAATCATCAACCGACGCAATAGACGTCGGGATCACAAAATGGCAGGCAACAGAATGAACATCCTCCCACGGATTTTCGCCCTTAATAGATAGAAACCATTTTTCGAGGAGAAGCGCCATGATGAACACAGCCGAAAGAAGCCGCATCGCTTCCACGCCACAAGGACCGTAGATTTCAAAAGGGACACTGCCGACTGAAACGTCGGCGGTGCTTTCGGCGTGGCCAGAGAAGCCGCCGGACACCAAGAAGCGAATGCCACGTCGAAAGCAAATATGAAGCAGGCTGTTATGGGTCGGCACCACCCGTTACCCCAGCCTGCCCGTACGTCGCGGTTAACACGAGGAGGGCCGAAGCCGCTTCTACGCCGCGATTTCCTTACCGACACCACTGCCGGTCAACGAGACAGATGCCGTTAGCCGCGGCGCTTATCAACCATCACCAACGGACACGAGGAGACATGCAGCATTCATCCAGACACACACTGCAGGCAATTGCCGCCGCGCACCGTAAACAAGGAGCGTCATTCGGGAAGATCGCTGAACTGATGGGCATTACCAGGGGCCACGCATGGTCGCTGCTTTCGGAGAGGTCGCCCACGCTACCGCCACCAAGCCCAACAGAGAAGACCGTTGTGCGGCGCACAACTTTTAATGGCGGATATTCGGGAGGATGCATCGACATTTATGTCTCGTTGCCCCGGATCACCATTCTGGACGGGCCTTTTGCGGGCACAGTCCACTAGCCTTACGAGGCAGCGCAATTGCTGAAAGGCAGGCCGACCGCGAGGATGACGGGGCCGACGACTAACCTCCCGACGAGGAGGCTTTATTGAAAACGAAATACACACGAACTGGCGAGCGGGACATGACAAACCGCAAGCCTTACCGGACGGCTGCGCAAAAAGCAGAGGCACGCGCGAACGCCGTGCTTCGGAATGGGACGCACGTCTCGAACGCGCCAGTCACTTTCCACCGCGCACCGAAAAGAGGTGCCGCATGACCTGCGAATGCGGTGAATGCTGGGATTTGCCCGGCGAGATCGTCGTCCATCGGCTTTGGAAATGGAAGGGCATCATTATCGAGGAGCGCGACAGCTTTCGCTGGCTGACTGTGCGTTTCATGATTCCCGGCACCGGCCTTGTGCAGCTTGAGGTCTCGCGCTTCGAAGTCGAGCCCGATTTTGAAGAGGACGGCGGTGGTGTCGAGGCTGACAAGCCTGAAGATGACAACGTCATTCCGGTTGATTTCACCAAAAAAGAGAAACTGACGAAAAACACCAAGACGAGGGGAGTAGCGTGATGGCTGACAAGCAAACAGTGAAGGTTGGCGACAGAATTACGGCCAAATTCGATGATTCCTGCAATGACTACAGGGCCGGTAATACTTTTTACGTCCAAGAGATTGATGAACAAGACGGTGAGACGATTGTCGCCTTCACAGACAATGTTGGCGACAAACGTCATCGCCGCGTGAGCGAATTCACCGTGGAACACGCGCCCGTTGCGGAAGCAACTGGCAAGCCTGCTTTTAAGGTGGGTGATCGGGTGCGCTTGATAAAGGACGGGTTATCCACGACCGGAGCGGTTGGAAAGTTAGCCACGATTAAATCGTGGTCTGGCGGGAAGGTCGTGGACAATGGCCAATATCTGCTGAATATCGACGGACCTGTCGATTATGAAACGCTTGCGTTCCAGCCACAGTATACTCGTGCAAGCCCAGAATGTTTTGAACTGCTTGCGCCATCCCTCACCATCGAAACCGGCAAGTTCTATCGTACCCGCGATGGGCGCAAGGTTGGGCCGATCGACCATAATGGCTTTGGCGTCTACGGTGCGCCGGAGTTCCCGGGTCACTGGTACGAAAATGGACTGAGTTATTCGGACAACACGCAAAGTTTGACTGACCTCATCGCCGAATGGGTCGACGAGCCAGCCAGCAATGACAATGCGCCTGCGACCGCACCCGCAATTGTCGCTCTGATCGAAGGCGGGCAACCCAAGCCGTCAGAGCGGCCGAAGGTTCATAAAAGCGAACAGGCCGCCACTGATGAAGCTGAACGGCTTGCCGTCAAATATCCGGGCCAGAAGTTCGGCGTGTTCGTTCTCGCTGACTCGCGCATTGCCGATGTCGTTATTCGGAGGGCCGCATGACCTCCACCACGTATAGCCACACGCGCAACTTCCAGCCTAAAGACTACGCGGAGGGTGACTCATTCTACGAGCCGGAAACCACGCTCGGCCTTGGTGATCGCTTTCTATGGGGTTTGGCAGTCATTGCTGCGCTCGCTCTTACGGTCGGCTTCTACGGATGGGTGCTGGCATGAACGTCACCACAAAAGCCACAGCTGACCTGCCGTACATCGATCCCGGGCGGAAGCCCGGTGTCGGACGCATCGGGCAGTCCTTGGCGCTTGCAGCGTTCGCGCTGGCAATCGCCACGACAATTGCAGCGTTCCTGTTCTGGAACCTACTGCTTCCGTTCTATGGGCTGCTGTATCTGTGGGGTGCGCACTGATGCCCCTCAGACCGCCACGCGACTACTGCGCAGCTGCCCTCACAAGTCCACCGGCGTGGCTTGTCGGCTGGCTCATACTTGCGGCTGTCATCGCCGCAATCGCCGTTACCCACCACACCTACTGAACACGAGGAGACCTATGGCTCTCAACTGGAACGATGGTATCCCAGACGATACCAATGAAAACGAACCGGCGTTTTGTGTCTTCTACGCTGGCGCGAAGTCAGGGAAGACGACCCTAGCAAGCGAGTTTCCTTCGCCACTTTATATCCGCACGGGAAAGGGTGAACGCGCACCAGCCGGAGTAACAATGAAGTCTTTCGGCGTGTCTGAATCCTATCGCGATGTGATGGATCAGGCTGACTGGATGCTGGACGCGCAGCATGATCGCAAGACCTTCATCCTGGATTCCGCTGATGGGCTGGAACAGCATATCTTTGCCGAGGTATGCGCCAAAAACAAGGTGGCCAGCATTGAGGATATTCCATACGGCAAGGGATACACGCAGGCGTCAGAAATCTGGCACGAGTTCATCGCCAAGGTGATGCAGCTAAAAGAAGCTGGGTTCTATGTGGTCGTGATCGCGCACGTGAAGTCCAAGACTGTTCCCGGCGTCACGACAGACAGTTACCCCCGATATATGCCAAATCTGCGTGATGACGCTGTCGGTATCGTTGTGGACGCAGCAGATCTTATCGGGTTCCTGCATCAGCGCGTTTCCATTCGCAAGGAAGATGTCGGGTTCAATAAGAAAAATACACGCGGCGAAGGCGGCGGTGACATGTTGATTGCCGTGCAGGAACGACCTGGTTTCATTGCTGGCAACAGATACGACATCGAAAAGCCGACGCTGCCTTTCAAGCGCGGTGAGGGGTTCAAGGTGCTGGACTATTACTTCCAGCGCAATTCGTTGGCGCCCGACAATGACAATGCGTCGGAGCGGCAAGAAGAGGCCGCTTAACGGTGCTGCACGATAACGACAATGAGCTTCCGCGCACGCGTGCGGAGGCCAAGCTGACCGGAGCAACGCATTACTTTACGGGCAAGCCGTGCAAGCATGGGCACATTGCTGAGCGATCCACGAGCGATGCTGTTTGCCAAGAATGCAATCGGGAAAGATCAAGGGAATTTGCTCGCAAAAACCCGGAACTAAAAAAGCAAAAAGACCGGGAATATTATTGGTCTGACCCTGAAGCCCGTAGAGAAAGCGCGCGTATTTATGCGGCAGCCAATGCAGAAGCAGCCAGGGTGCGTGCATCTGAATGGCGGCTAGCGAACCCTGAGCGTGCTGCGCACAATGACCGAATTAAGCGAGCAAGAAAGCGCGGCGCTGGAGGCAGCCATACGCTGAAGGAGATTGCAGGTCTTCTTAAAAAGCAGAATTACCGATGCGTTTATTGTCGGGCACCAATCCGAAAAAAGAAGAACCGCCACGTTGACCATATTATGCCTCTCAAACTTGGCGGTTCGAATGACATAACCAACATCCAATTGCTGTGCCCGACTTGCAATATGTCAAAGAAGGCCAGCCACCCAGTCGATTACGCCCGGCGTATCGGGTTGCTTGTTTAACCACTCCACCAACAACACGAGGAAATTACAGATGGCAAAACTAGCCAGCAGATTTGATGCGACTGCCCACGATACGGAGCAGCGGGACTACGAAGAGCTGCCGAACGGCGATTACGAACTGGAAATCGAGGCATCGGAGGTCAAGGAAGGCGCTAACGGTACCGGCCTTAAGACAACGATGACGGTTCTTCGCCCTGACGAATATCAGGGCCGCAAGGTCTTCAATTTCTACAATCTGGAACACAAGAACGCGCAGGCGCAAGAGATCGGCCAGCGTCAGTTCGCGAGCCTTTGCCGGGCAATTGGTGTTTCGGAAGTTGAGGATTCCGAAGAACTGCACTTCAAGGCGTTCACGGCAAAGATCGGCCTCGGAAAGGCTTCGAAAAACAAGGAAACGGGGCAGGAATACCCGGCTCGCGCGGAGATCAAGAAGTACTACTTCCCCGACGAAGGTAACGTTCCCCAGCCTTCGATCGACGCCAACCAGCCTGTAGCACAGGCTCGCCCTGCCAATGACAACCGACCGGCTGCGGCAAACAGCAATAAGACTGCTCCAGCGGCTGCTGCGGCAGTCAAGAAGCGACCTTGGGGTTAAGCTAAACAACAGGCGCGGCCACCAACCGCGCCTTCCACCACCGAACACGAGGAGACTTTGATGAGAGTCAGCATTGACCGCTCACAGCTCGCGCACGCCTTGGCGACCGTCAACCGTGCCATCGAAAGCCGCAATTCCATTCCTATTCTCGCCAACGTGCTCTTGGCGGTAGAGGTCGGCCAGTTGCGTCTCACTGGCACCGATCTGGACGTTGAGATAACCACCAGTCTGCCGGTGCTCGACTGTCAACCGGGCAGCGTAACTGTTCCAGGCAAGATGCTTGCGGACATCGCAAAGCGCGCAACGAGCGACATTACCCTTGAACTGGATGGAGGCCGCCTTACGGTCGCATCTGGCCGTAGCCGTTACAAGCTTGATGTCTTGCCCGCCGAAGACTTTCCGTCCTTCAGCGCAGGGAAGTTCGACACGACGCTTGAACTCGATCTGGCAGCGCTTGTTGCGCCGTGTGTGCACTGCATCTCGACGGACGAGACCCGTTATTACTTGGCTGGCGTTTATCTCCATGCTGTTGACGGCCGCTTGGTTGCTGTCGCAACCGACGGGCACCGGCTGATGCGAAACACAGGTCCGGAAGGCACCTTGGATTACGGCGTGATCCTGCCGCGCAAGCTGGTAGGCCTAGTGCCAAAAGGCGCTGTTACCGTTGAACTGTCGCAAAATAAAGTGCGCGTCACGTCTGGCTCAACGGTTATCACGAGCAAGCTGATCGACGGGACTTTCCCCGACTATGTGCGTGTCATTCCAACCGGCAATAGCAACGTGCTTACCGTTGACAGACAGGCGCTCATGAAGGCGGTCGAGCGTGTCGCCGCTGTTGCGGACGACAAATCGCGAGCGGTGAAATTCTCCGTTGGCGATGTGCTGCGGCTGATGCTGGCTGACAAGGCCAGCGACGAAGTTGAGGCGACGTTCGAAGGCGAGCCATTGGAAATCGGCTTTAACGCCCGATACGTCAACGACATGCTTGGCGCGTTGGATGAACCGAATGTGCGCTTTGCTCTCGGCGACGCAGGCATGCCTGCTGTCGTCAAAGGTGATGGCGAGTGGACCGCGGTCTTGATGCCGCTACGGGTGTAGGGGATGGCAGTGACAATCCCGAACCCACGATCTTCAACGCCATTGAATATGCGCTGCGCCATGAAGGTGTGACCGAAATCGCGTTTTCAGAAGATGGCGAATACGAAGTCGAAATCCACGAGGCGTCCAGCTTGATGCCGTTCGTCAAATGCCTGTTGCGCGAGTTGGAGGTGATTACGTGACGTTGACGCAAAAGCGTGTGCGAGAGGTACTTCATTATAATCCGTGGACGGGGATTTTCACATGGCGACGTCGTCAAGACTTGCCCGCCAACTGGAACGAAAGATGGGCTGGTAAAGAAGCGGGTTCCGTCAATGATCGCGGGTATATTGTGATTAGAGTCGAATACAAGCGATACCGGGCGCATAGGCTCGCGTTCCTCTACATGAAAAATTACATGCCAGATGAAGTCGATCATATCGATCATTGTCGAGGAAATAACGCCTTCAGCAACCTGCGTGACGCTAACCGATCTACCAATGGAAAAAATGTATCGATGCATTCTACGAACACTAGCGGTGTTACTGGAGTTCATTGGGACAATAAGAGGGGGAAATGGGTTGCCCAAATTACGGTCGATGGGGAAGTGAAGTACTTGGGTGGGTATAACAGTATTGATGCTGCGAAGCATGTACGGATGGCGGCCAATGACCAATTTAACTTCCACAGTAATCACGGTAAACCAGAATATATTGTTGGTGCGGCGTAATGGCAGCGCTCCCAAAAGCCGAAAGCAGCACGGTCCGCGCCATTTATCAAGCTTACGAGGCCCAGGCTAAATCCTGGGATTCGTGGGGCATCAGTGTGGGCGAGGCAGGCACGGAATGCGACCGCGCGCTTTGGTATGGTTTCCGATGGGTGTCGGCGCACGAGGTTCATTCAGGCCGTCAGCTTCGGTTGTTCGCCACCGGCAATATCGAAGAAGATCGCTTAGTCGCCGACCTCGAAAGCATTGGCGTCGATGTTTACGGGCAGCAGGACAAAATCAGGCTGGTCTCGGGTTTCGTCCGCGGTAAGTGCGACGGCAAGGCAATGGGCGTCCCCGAAGCGCCGAAAACCGAGCATTTGTTAGAGTTCAAGTCGAGCAACGAGAAGGGCATCAAGGAACTACAGAAGCACGGCTGCCAGAAATCCAAACCACTTCACTACGCCCAGTGCCAGCTTGGAATGCAGGCTTTTGGATTGACGCGCTGCTTGTATCTGGCGTCGTGCAAGAACACCGATACGCTTTATGCCGAGCGTATCGAATACGATGTCGAATTCTGCCTTCGACTGCTGGCACGCTGCGAACGCATCGTGTTTTCGGACGAGCCGCCCAGTCGCATCAGCGAAGATCCGGAGTTCTTCGGCTGCATGTTCTGCAAACATCGTGGCGTCTGCCATGAAGGCGTGCAGCCGCGCGTGAACTGCCGCACCTGCCTTCATGTCCAGCCTGAACATGGCGGTGACTGCCACATGTCATGCGCGCGATGGAGCAAGCCCTTGTCGATCGACGAACAACGCGACGGCTGCCCGGCACATCTCTACCTGCCGGGGCTGATAAGTGGCGAGCAGATTGATGTCGATGAGGAGAGGGAGACAGTTACGTATCGACTGGCGACGGGTGAGATTTGGGTGGATGGGGTCAACGACAATAAGAAGGTAGCATAATGCAGCCGCGATACTATCAGAACGAAGCGACAGATTCTGTTTTTGACTACTGGGTCGAGGAGCCGGGGCACCCATTAGTGGACATGGCGACAGGCACCGGTAAGTCTATGACGCTCGCCATGATTTTCCAGCGCTTATACACCGGCTGGCCTGATATGCGGCTGTGCTGCTGCACTCATGTCGTGGAACTGGTCGAGGGCAACTTCAAAGAACTGCTTGGTATCGCGCCATTTGCCCCGCTTGGGATTTACGCTTCAGCACTTGGTCGCAGGGATGCAAGGGCGCAGATACTCTTTGCGCAGTTGCAGACCGTATATAGCAAGGCGGCGCAGATCGGGCACGTCGATGTACTGGCAATTGATGAAGTTCACTTAGTGCCGAATGATGCAAACACCATGTATCGGCAATTCATCGACGCGTTGTTGGCAATCAATCCCGACATGAAGATCGTTGGTTTGTCGGCAACGCCTTATCGCCTTGATAGCGGACGGCTCGATGAGGGTGACGATCGACTGTTCGATAAGGTCGTCTATACATACGGCATACGGCAGGGCATTGACGACGGATATCTTTCGCCAGTCACGTCAAAGCCCACTGAAACCAAACAAGACACTTCGCATGTTCCGATGCGCGGGAATGATCTTGCGAAGGGCGCATTGCAAGATGCAGTTGATCGGGACGATCTCAATGGCCGCATTCTTGAAGAGGTTTTTGACACGGAAGGGCAACGCAGGACAGCACTATTCTTCTGCGCCGGCGTGAAGCACGCCACAAACGTGCGTGACATGGTTCGCTCAATGGGAAAGTCCTGCGAAGTTCTTTCTGGTAATACGCCACGGGCTGAAAGACGCAACATTATCGAGGCATGCAAGGCGGGCGAGATATGGGGCATCACCAACGATAACGTCATGTCTACTGGAACTAATGTGCCGCGCATTGACCTCATCGTGGATATGGCGCGCACCAAATCGGCAAGCCGTTATGTGCAGCGCGTTGGTCGCGGAACTCGCGTGCTTTATCCGCCACGCTTTGATCCAGAAGCAGTTGGGCCGGAAGAACGCCGGGCAGCTATTGCTGGCTACTTGAAGCCTAACTGTCGTTACATGGACTTCGCTGGCAACGTCGCAGAGCACGGCCCCGTCGATATGATCGAACCGCGCAAGCCGGCGAAGGGTGACGGTCAAGCCCCGATCAAGGTTTGCCCGACCTGCAATGAGCAACTGCATGCTTCATTGCGCATTTGCTGGTGTTGCGGCCATGAGTTTGAGTTCGACGAAACGCCTAAACTGCAAAGCCACGCAACTGACGCGCCAATTGTCAGCGTTGCAACGCCCGAGACGAGGGAAGTTACGCGCCGCACCTTCGCATACCACGAAGGCAAGGGCGGCAAGCAGGACAGTGTGAAGGTGTCCTACTGGGTAGGCATGTCACCGATCAACGAATGGCTCGGCCCTGCGCATACCGGCTTCTTTAAGTCGAAGTCAGACAGGTGGTGGCGAAAGCACGGCGGTCAGGCACCGTTCCCGAAAACCGTTCTGGAATTCATGGAGCGCCAGAATGAGCTGCTGCCCACTGGTGAAATCGTTGTGAAACCGAACGGCAAATACTGGGAAGTGGTCGACGCCATTGCGGGCGTTGCGAATGACAACACACCGGAGGCGAGCAACGACAATGTGCCGGCATCAAACTATGGTCGCGTATCTGCCGGGCTGGCTGAATTATTGGACGACGAAATACCATTTTGACGCCCCGTTAACGCGATGCGCTGTGAGGCGTGTTAACTTACCTACCGCTGCTTATCGCAGCAACACCAACCACCAAAACACGAGGAGCAATAATGACGGACAATTACGACCCGTATAACCCGGCGCCTATTGGTCACAATAGACCGCCGCTTACTGCGTACGAGACCATCAAACAGGAAATCGAAGACCTGTTCGATGAGGCTAAGAATTTCGCGGACGGCGAAGCGATCGACAATCAGCAACTGGCCGACGCGATCACCGAACTGCACGATAAGCTGCATGAGGCCGGGAAGCGCGCCGACGAGGCCCGCAAGGACGAAGCAAAGCCACATGACGACGCAAAGGCTGAGATACAAGACCGATACAACAAACTGATCGGCAATACGAAGTCAGTCAAAGGAAAGGTCGTTCTTGGCAAGGAAACGTTGCAGGCCTTGCTGACGCCATGGCGCAATAAGCTTGTTGCCGAAAAGGAAGCTGTCGCGCGCGCTGCGCGTGAAGAGGCTGACCGCATTGCGCGTGAAGCGCAGGAAGCCATGCGAGCCAGCGCAGGCAATCTCGAAGAGCGTGAGAAAGCCGAAGAGCTGCTTGCTGAGGCGAAGAAAGCCGACCGTTGGGCGAAGCGCGAAGACCGCTCAGCAACCACGGGCACGGGGCTGCGCACTATCTGGCACTGCAAGTTGGAAGATGAAGGCAAGGCACTCGACTGGGCCTATGGCCGTGCGCCAGAGCGCTTCAAGGAACTCGTCCAGTCTATGGCCGAAGAAACCGTGCGCGCCGGGATGCGTCAGGTGCCGGGCTTTAAGGTCTGGGATGAGCGGGTGGCGCGATGAATACGACAGCAGCACCAGTGACGACGCCGGTCAATCCAGCAGATCTATCTGCCGTTATACGGCATGCCTTTATTGCGGGGCGAGGCAGGACAGAAGGGATGCGTCTGTCTCCCGACGATATCGACGCATGGGTTGATTACGACCCATCAGGCAATCCGGCTTATCAGCGCATAATTACTGCGCTTTTTGGATGGTGAGGGCGGCGTGAGCTACGCACCGTGGATGGCTATGCTGGGCCCTGACGACGAGCCAGACTACCACGCCTATTGGCAAGAAACGGAATCGCTAGCGGACGCAATGATTGACGTCGCCACCTACGATGACTGGTTTCCAGACATCCCGATTGCTGCCGTGCAGCAAGGCGGGGTATAAAGAATTATAAATATTACAGGGCGGGGGGGGGATTATGTCATCGTTAAAGCACAATAGGCCAAGACTACGTCTTTCGCATAACGATTATAGAGAAAACAATAAGGAACACACCTGGGTAAATGGTGGCCTATCATTAAACAAAAGACATTTGAAATCCAGAAAAGAAGGGGACCGATTTAAGTCGGCCTGTCTTGCTGTATTAGAAAGCGGAAAAGAGTTGCCATCTCCGTCAAAAATAGTGCGAAAAAGGATACATAGAGCCGTCAAAGGAAGTTCACAAGCACACATATTGTTGTGGCTTCGTACGGTGTCGTAACACACCACACCACCCCGCCAGCCACCAACTGGCGGGTTACCACCACGAAACACGAGGAGCAGATATGTCCTACGAGGATCTATTGGCGCGTAAAAGCGCTGATGTGCCTTTGCGCGGACTTTCAAACATACCCAAGTTGCACGACGGCATGTTCGCTTATCAGCGCGACGTGACTGAATTCCTTCTTGGTGTGGGCGGAGGTGCCGCATTCCTTGATACCGGGCTTGGGAAGAGCTTTGTTGCTCTCGAATGGGCCCGGGTAGTCGCTGAGCATACGGGCAAGCCTGTCCTAATGCTTGCCCCGCTCGCAGTTGCGCCGCAACACGTCAGGGAAGCCAAAAAGTTCGGATATGACGCTGCGCAGGTTGTGCGCTCCCGCGATGAGGTCGGCCCGGGGATTAACGTCACCAACTATGCCAAGATCGACCACTTTGACGCTGACGCGTTCGGCGGTGTGGTCTTGGATGAATCCAGTATCATCAAGAACTTCACTGGGCAGACAACGCGTAAGATGATGGCCATGTGGTCGGGCACACTCTATCGGCTTGCCTGTACGGCGACGCCAGCGCCGAACGATCATATGGAGTTAGGCCAGCATTCGCAGTTTCTGGGAGTAATGCAATCGAATGAAATGCTGACCCGCTGGTTTATTGCCGACCAGACCAATATGGGTCGCTACCGCTTGAAAGGTCACGCTGTTAAGCCATTCTGGAATTGGGTGGCGAGCTGGGCCCGGTGCATATCAAAGCCATCTGACCTTGGCTACTCGGATGATGGATTTGAGCTGCCGCCATTGGAAACGTTTAAGCACGAAATTCGAGCTGATGTATCCGTCGATGCCGGTGACCTTCTTTTCAGAATTCCTGATACGAGCGCGACCGCTATTCACAAGGAAAAGCGACTGACTGCCGATGCGAGGGCCGAAGCCATTGCGGAGCAAGTCAATTCCGAAAATGGAGAGCCTTGGATCGTTTGGTGCGATACCGATTACGAGGCTGACGCGCTGACCAGCCGCATTCCTGGAGCCGTCGAGGTACGGGGCTCTATGTCCGACGCGGTAAAAGAGGAAAGGCTCGTTGGGTTCAGTGAAGGAAATATCAGGGTACTTGTCAGTAAACCGTCAATAGCTGGCTTCGGCCTTAACTGGCAACATTGCGCGCGGATGGCATTTGTCGGCCTCTCCTTCAGTTATGAAGCTTATTATCAGGCCGTCCGGCGCTGTTATCGTTTTGGTCAGAAACGCCCTGTTCATGTTCACATTGCACTTGCCGACACCGAACGCGCAATCTGGGACACGGTCAACCGCAAGAGCGGTGATCATGAACAGATGAAGCGCGAGATGTACGCCGCCATGCGTCGAGCGCACGAAAAGCGACAAGTCAAAATCGACTACCAACCAACAAAGCCAATTGCCTTGCCATCATGGCTCAAAGGAGCGTCAGCATGACCTTTGTTCTCGATCAATCATCCGGTGAAAACTGGGCGGCCTATAACGCCGATTGTGTGCCTTTTGCAGCCGGTCTTCCTGATAGTTCGATTGATTTCAGCGTTTATTCCCCGCCATTTTCGTCACTTTATATCTATTCCGAAAGCGTTGCCGACATGGGCAACTGCGCATCCGATGACGAATTTTTCGAGCAATATCGTTACCTAGTGCGCGAAAAGCTGCGGGTCACTCGACCGGGGCGACTTACCGCCATCCACGTCAAAGACCTGGTTTACTACCAAAACAGCAGTGAACGTGGCACGGCCGGACTGCGCCCTTTTTCTGACGACTGCACCCGGCTGCACATTGATGAAGGATGGGATTTCCATTCGCGAATTACGATTTGGCGCGATCCGGTGCGAGAGATGCAAAAGACCAAGGCGCACGGCCTGCTTTGGAAGACGTTGCGCGCCGACAGCACTTTCAGCCGTATGGGACTCCCCGAATATCTGCTGGTTTTCAGAAAGTGGGCCAAACCAGGTGAGGAAGTCGTACCGGTAACTCACACCAAGGAGAGCTTTCCAGTCGAAGAATGGCAGGATCATGCTTCGCCAGTTTGGAATTTCAGTAAGCAGGATTTGCCAGAAACTGATGTACTCAACGTCAAGGTTGCGAGATCTGATAAGGACGAAAAGCATCTCTGCCCCATGCCTTTGAATATCACTAAGCGCGCGTTGCGCATGTGGTCCAATCACGGCGACAGGGTGTTTTCACCTTTTATGGGGATTGGCTCGGAAGGGTACGTTTCGCTTCAGGAAGGACGGCGCTTTGTTGGAACAGAACTCAACCCGAATTACTTCCGCCAGGCAGTGAAGAATCTGACCGAAGCCGCAGCTTACAGCGAAGCACCTACGCTTTTCGTATCCAACGACAACAACCCGGAGGCTGCAAATGCAGCCTGACACCGACATCTGTCACGTCTGCTACCGCCACGCCATCGGCCTCGGCGTGCAAGCAGACCGCGAGCCGGTGCGCTGGCTGTGCAAGGAATGCGCCGACATTGCCGAGCATATTCGGCATCGCCGTCGCCTGGACCCGTACGAGCTGCGTGCGCTTGATACCGGTGTCGAGGCGGTTGGGGAATACTTAGTGTCCATCCAAAAGACCGACCTTAAGGAAATGGACGAACTGGAAGCGCGCATGCTGGTGCGCGCCGCTTGGGAAGGCTGCGGGCGAGGGATGCGCGCGGCCCTTAGTGAAGCTCCATTCTGAGGCAGCCATGACAGCCTATTACAACGAGTTTGACCCGAAAGCGGCCGCTTGGATGCGCGAGTTAATCAAGGCTGGCCACATCGCACCGGGAGATGTTGATGAGCGTTCAATTGTCGATATTCGACCTTCCGACCTTATCGGATACACACAATGCCACTTCTTCGCCGGGATCGGCGTCTGGTCATATGCGCTCCGTCGAGCGGGATGGCCGGATGACCGCCCAGTCTGGACCGGCTCCTGCCCTTGTCAGCCTTTCAGCGCGGCAGGCAAAGGAGCAGGGTTTACTGACGAGCGGCACCTATGGCCGCACTTCCACTGGCTTATTGACAACTGCCGCCCTCCAGTCGTCTTTGGTGAACAGGTTGCGAGCAAGGACGGCCTTGGCTGGCTCGACCTTGTACAAGCTGACCTGGAAGGATCGGGCTACGCCAGCGGGGCGGTCGATACCTGCGCTGCGGGCTTCGGTGCGCCGCACGTCAGACAGCGGCTGTATTGGGTGGCCCACGCCTACGACAAGGGATCACAAGGACGGTGCGGAGTGCCAGAACGTACAGCTGAACGCGCTGTTGGGTCGGGTGGCTTGGCTGACGGGTTGGCCGACAGCGCAAGCTTCGGACGGGTCGGGCGGCGGTCAGGCGGCGAGGGCCATGAACTCGGAGAGATCGAACGATCTGAACGACTTCGCGATGCTAGCTGGCTGGCCGACACCCATGGCGGGGACACCTGCGCAGAACGGCAACAACATGGCGGGGAACAACGACAGCAGCCGCAAAACTGTGGAATGCTGCATCACGGATCAACCAGCCCGACTAACGGCCACTGGCGAGATGCTGACTGGCTCTTCTGCCGGGATGGAAAGTGGCGGCCAGTTGAACCCGGCACATTCCCGCTGGCTCATGGGGTTGCCGCCCGAGTGGGACGATTGCGCGGTTACGGCAATGCAATCGTTGCGCCCGCAGCGCAAGCGTTCATCGAAGCTTACCTAGGCACCGAGGTTGTGGCCGCCAACGATAACTATCTCAGCAGGCCTGACGTCGCTCCTCGAGCGAGTTGAGCAAATCGCTCAGTTCCTTGGCATCGCGGTAGTTCATTCCTACCGCCTGCCGTACGCCCATTTCGACTGGCTTTTGAACTTTCTGATCAACCACTGACCAGGTGTCGTTAATTCCGTTTGACGCTTTGTATCGTCCGCCGATGTACCGCCCACCCTTCAATGTTGATCTGCTCATCCCAGTCCTTTCAAGGTTCGAATCGATTGAAATCTATGATTACAGACATGGTAAACACATCGCAAGTTGCTGCCGTCGCAACCGACCCCATGCTCGACGGTCACTGCAGCGTAGCTGCTTCGCGCTGCGCATAAAGATAATTCAGCAAGTCGGCCAAATCGTCGGCTTGCTCCCACTCAAGGCCGATCTGTTTGATGCCGTTAACTTCCGCGGCGTCGCCCGTCCAGACATCAATTACAGCCCATACGCCGTCATACATTTCCAATGTGTCGTACCGAGCTTCTGCCATGGCTGTTTCTCCATCAATAGGAATTTCATGGAATATACTATCAAAGCCATCCCTACAGAATACGCTGGCGTGCAGTTCCGTTCACGTACCGAAGCCCGTTGGGCTGCATTTTTCGATCTCGTCGGGTTGAAATGGGATTACGAGCCGCTTGATCTTGAAAGGTGGGCGCCTGATTTCATCCTGAGAACGTCGCTGACGAATGTTCTGGTTGAGGTTAAGCCGGTAGACCTGACGGCCTATATTGACGCTGTAAATAGAGGCGGCGACGATGTCGCCCAGTTATCTTCCTACGATAAAGCACTGGCTCATTCCCGAAAGCACCAGGTTCTTTTACTCGGCATAGCGCCCTTGGAAATGCAGGGCGCGACTTTGCCGATCGGCATCCATACCACGCCGCCACGCGGGGCAGAATACTCGTTTGACGACATGCAGGACGCGCTAACGGTTGGAGATACATCGTTCGTAACGGACGCATGGCGCAAAGCCGGTAGCGTCACGCAATGGACAGTTGACGAACCGGATTTATCCACATCGCAGATCGTCAGCCGCGCACTTAATCGCGCGCATCGCAAAGCCGAACAAAAGAGGGCGGCATGAACATGACTCTTGCATCTGATCTGTCTCGAGATCTCGCGCTTGCCTATATCGCTGCTGATATCCCGGTTTTCCCATGTCGGGCGAAAGACGAAGAAACCAACGAATTTGACGAAGAAACCGGCGAAATCGTTGTTCTGAAAGCCAAAACACCTCTTTTGAGTAACGGGTTCAAGGGCGCAACGAAAAACCTACGCGTAACCAATATTCTGTGGGATCGTAATCCAGGTGCTATGGTGGGCATCCCAACCGGCGAACAGTTGGGCGCATGGGTCCTCGACGTCGATGTGCACAAGGACGAGAACGGCGAAATTATCGACGGATTCGAAACGCTCTCTGCCCTTGAAGATAAGTTCGGCCCACTCCCCAAAACTGCCACAGCCCGTACTGCGGGCGGCGGAGAACATCGTTATTTCAAATATGTGCCGGGCGTTCGCAACCGGGGCAGACTTGGCGCTGGGCTGGATGTTCGTGGATCGGGTGGTTACGTCGTCGCTCCCGGCAGCATTATGGACGATGGTCGCGCCTATAAATGGGTCGACTATTCCGGCCCTGGTCTGCCCCCTTTGGCAGACGCCCCGCAATGGCTTTTGGACCTCGTTTTGCCAAAGGAGCCAATATCGATCGGTGCGGATTACACCTATGACCGCGGGAGCAACGACGCTTATATCGACCGTGCAATTCAACTGGAGCTCGAAGAAACTGCGTCTGTGCCAATGGGTGCTGGGCGCAATAATCGTTTGAATGCAGCGGCATTTTCTCTTGGTACGCTCGTCGGCGCTGGAGCTTTGCCAGAACATGAGGCGCGCCAATTGTTGCAAGATGTTGCGCGTGGATGGGGGCGTGATTGGGTCAAGTGCTGTAAGACGATCGAAAACGGTTTGTCTGCTGGCATGCGCCAACCACGACAAATCCCAGAGCGGTCATTCTACGATGATGACAGCACCCCACCGGTTAGTGTGACTGGCCTTATTGAGAAATATCGCAATCGTCACGACGACGATGTAACTGATAGCGACCATCGAGCGGATGCAGACGAAGCAGTTCCAGAGTCCGACGACGACGATGTTCCTGACTACAAGTTGGAAGCTGTCGCCGATCTGGAAAGCCTGACATATCCGGGCGGTTTGGTTGAAGACATGATTGACTGGATCGTATCGAGCGCGGAACAGCCATCTCGCACGCTTGCCATGGCTGCTGTGTTGCCATTGCTGGCGTCGTTGGCTGGTGCTCGATATTCGACAGGTTCTCGCGATACACGTCCAAACCTGTACACTGTGGCGCTGGCGGAATCGGGCTTCGGCAAGGAACACGCCCGATCACAGATCAAGCGTATACTCATGGCCGATCAAGGCATATTCGATGCTTATAGTGGCCCTGCGCGCATCATGTCGGCGTCAGCATTGCGTGAGGTGCTTGAGAAACACTCATCGGTTAATTGTCAGATCGATGAATTCGGCGGTTTCATACGCGATATCACAGACCGAAAGGCAGGCAGTCATCAGCGGGCAATTTCGACGGATCTGCGAGACTATTACTCGGCATCGTCAACCTTCTTCGAAGGCGCGGCCTATCGTGGTGTTCCTCCGAAGAGAATTTATAACCCCACCCTATGCATTCACGGCACTTCGACGCCAGAGCAATTTTGGTCGGCCCTCAGTAGCGCAAGCGCCGAGGACGGGTTGCTGCCGCGCCTTATCCTGTTCCATGTCACAGGCGAAAAGCCTTACGCTGTAAAACCGTCCCGTGATGTTCGCGAAGTGCCATATCTTCTGATGGAGCGTATGGCATCGGTAGCTGGCATTAACGTGGCTGCGAAGCGTGGCAATCTGTCTGGGATGAATATTCAAGTGCCGGCGTATGGTGAAAACAAGCCGTACATCGTTCAATGGACACCAGACGCAACCGCTCTTTTCAGGTCGGTCAAGGATTCGATTGACGCTCGAGAAAAGATGCTGGCATCAGAAGCTCGACCGTTTGCACGACGCATCATTGAAAACGCGATCAAACTTGCGCTGATCGTTGCGGTTGGGAAGGACCCGACGGAACCGGTCATAACCGAAACTGATTTCGAATGGGCTTCATGCGTAGCCTGGACGTGCGCAGCAACAATGATCGCCGAAGTGACAGAACGTCTGGCCGACAATGACCGTGAGGCGAATTATAAGCGCATCGTGGGGTTGATCCGTAAGGCAGGCACCAAAGGCATCACGGAAAGTCGTCTGTTCGATCGTTGTAAGGCGATCGAGGGGCGTCGCCGCGAAGAGATATTGAAGGAGCTTTTTCACACCGGGAAGGTGATAAAACAAGATGCGAAATCCAAGCGAGGGAGACCGGCAAATCGACTTGTCTGGATGGACTGACACGACGGGGCTTCGGCCCCGTTTTTTTTGGTTTTGATAGGACGATTAAAAATCATCCACCGCGGCAATTCTGTCCATGGATTAAATTCGACTGCCGAATTCTGTCCCGGATTTTAATCGGTTCTGGATGGATAAAATTCAGGCCGAAAAGATCAATAAAATCAACAGGTTAATCTCTCTATATATATTAAAATCCATTCATCCACGTATTATATAATAAGTACCTTTTTATATAGATTCAGGGGGTCTGTATATAAAGGGGTTGCAGAATGGACAATTAATGTGACCGCTGTTTTTAGCCGCTCCGTGTCGGCCATTAACACGACATTTATTGAGGCATTCCACACTCCTCTTGCCGCTACCAACGGCACACCGCAATCAACACGAGGAGCTCACATGGCACGCAGCCGCACACGCGCGCCTTCGTCTACGACAACCACCCAGATCACACGCATCAACGGCGCTCGCGTAAAGATCACCACCAAGGCTGGCAAGGTGACGACAAAGCCAGCCTTGCCGCTCGAATGGGAACTACAGGCGGCGCAGGTTTCCGCATTGCGCCGCCTGCCGCAATACCAGCGCCAGTTCCTGTTGGCCGGAGACATGAACGCCAGTAAGCGTGGGCCAAGGGCTCAGGCCCAGGCAATCGCAACGGGAATGACCAGCGGCGAACCTGACCTCCGCATCTATGGCGAATACGGTCGGTTGCTGCTGATCGAGAACAAGGTCGGGCAGGGAAGACTGTCGCCAGCCCAGAAAGACCGCCACGCGGCCCTACAGCGGCTCGGCTACACGGTTCTGGTCATTCGGGCCACCACGACGACAGAAGCCGCTGAGCGGGCCATTACGGCGGTTCTGGAGTGGCTGGCACAAGAGAAGGGGAAAGCAGCATGAAGAATACAAGACATGGATCTCTCGCAGAACAGTTGAAGGCACTTATGGCGTATCGCAATCGACCGGAAGGTCAGCAAAAACCATTACAGACGAACTGGTCTGTTGTGCCCGGCGCGAATGACAATGACCCGGAGGAAGTTGCCGACATGCGTTATGAGCGAGACTGGCGACAAACACCGTCAGTGCAAGCCATCATGCAGAATGTAGCTACTGACGATATCGAGAAGAATGAGAGTGGACAGACCGTCCGCATAGGAAAGCTGCGATTCAGTGACGGTAACCAGACTGAAGTCGGATATGTGCTCGGCATAGACGGTGAAGTTATTCAGGCCGACATACGCATGCCGACCGGCGCAATGCTCGGCATGAAAGATAAGCCAGATCGAGCGTCGGGCGGCGGAGTAGACCCGAAGGATACCAAGGCCAGCAATCACTATTTTGAAGATATGCTTGGGACACTGCCGCATCGATATATTCCATCCGGCAAGCGCCGAAATGGTACGGATTACAGCACTGAAGAATCCGCCCGAACTCTCGCAGATGCCTACGCAAACACCGACATGGAGAAGGTTACGTTCACACGATATCCAAAAGGGTTGCCGTGTGGCTCGCCCAAGGTGGCCGATAGTTTCCTCGGTATGAGAAAGACGACGTGTGCCGGTGGAGGAGACGAAGCGTGGGAAGATACACTGTCTGCGATGATCGATCGTGATCTGTGGTTTGAAGCGCTTCAAGAGTTGAAGGATAGGGATCGCGACGTTCTGGATGCCGCGTTGGAGGCCAAGACATATACAGATGTTGGCGTGGCTGCTGGTCAGAGACCAAAATACGCCACTTACAACGGCGGAGGAAGGCGCGCGCTACTGGACGCAAACGATAACTTGGCAGCCGCAATCAAAAAATATGCTGCTTAGTGTTTAAATTCCTCGATCTCGTGCGGAGTATAGTGAAGGGGTTCAACCGCTATGCGGTTGCCCCGCACTGTTCCGTGCGCGAGGCGACGGACGCTCGGTCATGTTGCAGTTGGGTGCAACCGCTGAACCGGGCGTAACTTTCCAAAACAGCGCCCAAACCTCTGCTGTCGGCTTGTCCGGTATTATGTATGGGCCTTCTCATTCCAGCGCCGTTTCTCCTCCGGCTGCTGGTTCGGCGGGTTGAGCCTATTGCGGTAGGCTCCCCGCCATCTGATTTCATGCGGAGTGGAGAAGTGGTCATCTCGTCTGGCTCATAACCAGAATATCGTCGGTTCGAATCCGACCTGCCGCAACCAATCGACCTGTTCTAGGGCTTGGCCCGCGAGATCGCCTTTGCGGTCGCAGGTCAACCAATTGCCCGTGTAGCTCAGATGGTAGAGCAGCCGCCTTGTAAGCGGATGGTCCGGGGTTCAATTCCTCGCTAGGGCACCAATCAACGAAACCTGACACGCCTATGCTCGCATGCGCATCAGTCGGGTTGCTTTCAAAAACGAGGAGAGAAATATGCAGACAGAACACTTCGATACTTATGCCGTCGCCAAAGCATTTAACGAATGGATGCGTAGTTACACGGAAGATCCATCGCAGTTCGACCATGAATGGCAAACGGTAACATCCTTCTTGAAGCAGACAGACGAAGGAGTTGAGCCTGACTATGGCAAGTCTTGTGCTGCCTATCTTGCGAAGCTGCTAGATGAGGCCAGCTAACGCTGACCAACGCCCTTGGAAGAGCTGGTACAAGCTGGCCCGATGGGAGCGCAGACGACAAGAACTGTTCGCAAAGCAACCGCTGTGCGTCAAATGCCTTGAGCGCGAAGAGGTAACGGTAGCCGACACAGCTGATCACGTGGTGCCACATCGTGGAGACCCAGACCTATTCTGGCATGGCGAGCTTCAGCCCCTATGCGCCTCATGCCATAGCCGACTGAAACAGCGAGAAGAGTTGGGACAGGACGTCGTTCGGTTCGGATCCGATGGGTGGCCGGTCGGTTGACGACCCCCGGGGCATCGAAAAGTCCAAAGGCGCTGCAGCCCCGGACCGGCGGGGACCCACAGCGCACGCATCCGCAATTGAAAATATGACCCCATAAGGATTTCATTCCATGGCAAAGCCGAGAAATCCCCTCGGCAAGGCCAAAGTCGAGGGGCGGGACAAGAAAGACCCACAGCGCTTCAAAAACCGCGCAGACCCAGCCGCAAACGGCCCGCTTGGCGCTCCTCCCGTCTGGTTGAGGGACAACACCGATATCAAGGCGAAGTCAGCCTGGAAGTTGTTCGCGAAAGAGCTGCCGTGGCTGAATGAATCACACCGCACGCTTGTCGGCATGGCCTCAACTATTCAGGGTCGCATCATGGCCGGACAGGAAGTTGGTGTGCAGGCGATGAACTTGCTTCGACAGATGCTTGGCCAAATGGGCGCGACGCCTGCGGATGCCTCCAAGGTGGCAACGCCAGACGAGGGCGAAGAAAAGGATGATCTGCTTGACTGATATGCCTGCGCTTGAGCGTGTGAGCGCTTATGCGCAAGCTGTCATTGATGGCAAAGAAGTCGCTGGCCCTCACGTTCGCAATGCGTGTCGCCGCCATTTCGATGATCTCGAACATGGTCACGAGCGCGGGCTGTATTGGGACGACGATGCTGCCGACCGCGTGTTTCGGTTCTTCGAAGGTCGGCTCAAGCTTTCCGAAGGCCAGTTCGAAGGCAAGCCTTTCAAGCTGCATGCCTCGCAGGCTTTCAAGCTGGGTTCGCTGTTCGGCTGGAAACGTGCCGACGGATCGCGCCGCTTTCGCCGTGCTTACATCGAAGAAGGCAAGGGCAACGGTAAGTCACCGTTTGCTGGCGGTGTCGGTCTGTACGGACTAATCGCCGACAAGGAGGCTGGCGCCCAGATATATGCGGCGGCTGCCAAGAAAGAACAGGCGGGAATTCTCTTTCAGGACGCCGTCAAAATGGTGCGCGCCGCTCCTGCTCTGGTCGAGCGGTTGAAATTCAGCGGCGGTATCGGGCGCGAGTTCAATATCGCGCATCACAAGTCGCAATCGTTTTTTCGTCCGATCTCGAAGGATTCCGGCAAGTCTGGCTCTGGTCCGCGACCGCACTTCGCGCTTTGCGACGAGGTGCACGAACATCCCGACCGCTCGACCATGGAAATGCTGGAGCGCGGCTTCAAATTTCGTCGTCAGCCTCTGCTGTTGATGATTACGAACTCGGGCAGCGACAGAAACAGCATTTGCTGGGAAGAGCACGAGCACGCCGTCAAGGTGGCTGCTGGTACGCAAACGCCGGATGAGGATTTTACCTATGTCGGTGAGGTGATCGACGACACGACGTTTTCCTACGTCTGCGCGCTGGACAAGGGCGACGATCCGCTTAAGGACGAAACCTGCTGGAAGAAGGCGAATCCGCTTCTCGGCGTTATTCTGACGCAGGAATATCTGGCCGGTGTTGTCGCTCAAGCGAAGCAAATGCCAGGCAAACTGAACGGCATTTTGCGCTTGCACTTCTGCTGCTGGACCGATGCTGACAAGGCATGGATGCCACGCGAGACCGTTGAAGGCGTAATGGATGACTTCGACCCCGAAGTCGAACACGCTGACAAGCCGGTTTTCATGGGCGTCGACCTGTCGGGCAGCAAGGATATGACGGTTCTTGCGTGCGTTGTGCCGACTGGCTTCAAGGAAATGGAGCGGGAAGACGGATCTACCGTCAATCTGCCGACGTTCGATGCATGGGTTGAGGCCTGGACACCTGCCGATACACTCGAAGCGCGGGAGCAGGCTGACAAGGCACCCTATGCGCTCTGGGTAAAACAGGGCTGGTTGAATGCCCCGCCGGGCAAGCGAATTCGATATGACTTCGTTGCCTCGCGGGTGCAGCAAATCGATCAGTCCTTTGACATTCAAGCCATCGCCTACGACCGCTACGCTTATGACAAGTTCCGCGAGGAAGTCGCAGCGCTCGGGTTGGACATTGAACATGTCGCACATCCGCAGGGTGGCAAGGTTCGGGCACGTCCCGAGCCAGCAAAGGTAGAAGCGGCGAAAGCTGCTGGCTTACCGCTGCCTCAAGGGTTGTGGATGCCGGGTTCAGTTCTGGCGCTCGAGGATATGATTATCGACGGTCGCATTCGTATGCGGCGCAATCCGGTGCTCATGACCGCCCTGATGGGTGCCACCTTCGACCATGACCCGCAAGACAATCGGTGGTTTGTCAAAACGAAGGCATCGGTGCGCATTGACGCTGCTGTCGCTCTGGCAATGGCTGTCGGTGTGGCGATGGATACACCGATCGAGCCAGCCGACATCGACGACTTCGTCAACAACATGATCACCATAACCTGGTAGGAGTGCCCATGGGCCTTTTGACTTGGGTCGGGAAGCCTTTCGGGCTTCTTTCCGGCCCATGGCGCGCATTCTTTGGAATGTCGACGACAAGCGGCGAGACGGTCACATATGAACACGCCATGCAGCTTGATGCCGTCTGGGCGTGTGTGAACCTGATTTCGAATGCCGTGAAAACGCTGCCCTGCAATGTTTACAAGGGCGACGGCGTTGACGTCGACCGTGAGAATCCGCTGTACGAACTGCTACACGACTTGCCGAATCTGGATGACAGTGCGTCCGATTTCTGGGGCATGGCTGCCCTTTGCCTTTGCCTTGACGGCAATTTCTTCGCCGAAAAGAAGAAAAATGGCGACCGGCTGGTAGCGCTGAACCCGTTCAATCCGCTTTGCGTCGATGTAAAGCGCGATGACCGGAACAACCGCTACTACGAAGTCACCGAGCAGTACAAAAACGGCAAGAAGGGCGGCGTTCGAAAAATCCGCGAAGAAGACATGCTTCATGTCCGCGGATTGGTCATGCCCGGCTGCGATCGCGGCCTTTCGCCTATCGCCGCACAGCGCAATGTGATCGGCAACGCCATGGCCGGCGAAAAGACGTCGGGCCGTATGTTCAAGAACGGCATGATGGCTTCGGTCGTCTTATCGTCGGATCAGGTTCTGAAGGCCGATCAGCGCAAGCAGATTGCGGAGTCGTTGCAAGCATTTGCCGGCGCCGATAAGGCAGGCGGGATCGCGGTGCTGGAGGCGGGCCTAAAACCGTCGCAGATCACCATCAATCCAAAAGATGCGCAAATGCTTGAGACGCGACAGTACAGCGTCGAGCAAATCTGCCGCATTTTCGGTGTGCCGCCGGTCATGATTGGCCATGCCGCGAATGGCACGACGACATGGGGCAGCGGGATCGAACAATTGATCCTGCAGTTCACTAAGACGTGCCTTACGCCCATGCTCAGAAGCATTGAATCCGCGATCTACCGCGACTTGCTTGATGCAAAGACCCGCAAAACGACCGTCGTTAAGTTCAATATGGAAGGCCTGTTGCGTGGCGATAGTCAGGCGAGGGCAGAGTTCCTGCAGAAGATGGTCCAGAACGGCATTTACACGCCGAATGAGGCCAGAGCTTACGAGAACAAGCCAAAGATGGATGGCGGCAACGAGCTGATCGTCAACGGCACCATGCAGCCTCTGTCCATGGTCGGACACAACGGCGGCCCACCGCTGGATGATGCACAGCCAAGCGCTGGATAAGGGAAATTCATGAAATTCGAACACATTTTGACGGCCTTCGAGGCCGAACCGTGGGCGATTCAGCGCGAAAAACTGGCCGTTCTGGCTGATGTTCTTGCGGCACGTGTGGCGGGCGACAAGCTCGTCACACCTGAATTTGCAGCGGCTGTTTCCGACGCTCGCGCCAAGGAAATTGCTGAAATTGACGGCAAGGTCGCAGTGATCCCGGTTTATGGCGTATTGGCCGACCGAATGGACCTGTTTTCCGCGATGAGCGGCGGCACTTCTTATGCCGGCATCAAGCGCCAGCTGCACAAGGCACTGTCCAACGAGGATGTGAAGGCCGTTGTTCTTGATATTGATAGTCCTGGCGGCTCGGTACCGGGCACGGACGAACTCGCAACGGAAATTCGTAAGCTGCGTGGCGATAAAAAGCCGATCATAGCGCAGGTTAACTCGCTGGCTGCGAGCGCTGCCTACTGGATCGCGTCATCTGCCGACGAAATCGTTGTTACGCCGTCCGGGCGGGCAGGTTCGATCGGTGTCTATACGGCGCACGACGATATCTCTGCCGCGTTGGAAAAGGCTGGCGTCAAGCGAACCTACATTTCGGCCGGCAAGCACAAAGTCGAAGGCAACGAAACCGAACCGCTCGGCAAGGACACGCTGGCCTACATTCAGGACAGCGTAAACCGCTCCTACGGCCGGTTTTTGCAGAGCGTTGCCGATGGGCGTGGCATCACGAAATCCAAAGTCGAAGACGGATTTGGTCAGGGAAGGGTGTTCTATGCAGAAGCGCTCATGGACAGAGGAATGGCAGACCGTATTGCCACACTTGACGAGACCTTGGCCCGACTGGGCGCGAACACCGAGCCGGAATACGTACGCCGCGTAAAGGCGTCCAACGCCGCAAAGGCAGAAGCCGCGCAATTGTTGGCCAGCAAGATGGCCTCCGGCGAAGAAATCACAAAACGCGAATTCGAGAACGGGATCAGGGGACTGATCGGCTTGTCGAACTCGGAGGCGGAGCGAGCCGCATCGCTCTACTTCAAGGAACATCAGGGGGAACCTGATGCTGATGCGGAAAACGCCGCTGTTTCGGCGGCCCTGGAACGGCTTTTGGCCGAAACACGCACTTTCACAATTTAGTATCAGGAGGACATATGTCCGAAGTTTCACTTGCCGAGAAGATCGGCGAGCTTGGCCAGTCTTTGGCTTCTATCAAGGAAAAGGTCGGCAATCTTGCGGCCGACTTCACCACGCAGCTCCAGCAGCACGGAACCGTTTCAACCGAGCTGACCGGCAAAGTCGATAAGGCATTGTCTGAACTCGGCGACACCACGACCCGTATTAGCGAACTGGAAAAGCGCGCCGCACGTGAACGCGATGATGTCGCGCAGGGGCCGCAGGACGTCGGCGATATCGTTGTAGCGTCTGAAAAGTTCAAGTCGACCGACGTATCTGGCGCATGGCGCGGCTCGATCCGTGTTGGTATGGAACGCGCTGACATCACGTCCGGCAATACCACGGTTGGCGCCGGTCGTTCGGCCGGAACCTCGCTTGTCCCTGGACAGCGCGTGCCAGGCATTATTGCCCCGCCTAATCGCCAACTGACGATCCGCGACCTTATTGCTCCGGGCCAGACCTCGGCTGCAAGTGTCGAGTTCGTCAAGGAAACCGGCTTTACGAACAGCGCAGCGCCAGTCGCTGAAGGCACTCAGAAGCCGAAGTCTGACCTGACCTTTGATATGGAAACCACGCCTGTTCGCACTCTGGCCCATATCTTCAAGGCAAGCCGTCAGATCCTCGATGACGCTCCGGGCCTTGCAAGCTATATCAACGCTCGCGGCACGTACGGGCTCAAGTTCGTTGAAGAAGGCCAGCTTCTAAACGGTGACGGTACTGGTCAGAACCTGCATGGCATTCTCCCGCAGGCATCGGCCTTCGCTCCAGCCTTCACTCCGGAGAACGAAACGGCAATCGACCGCCTCCGACTGGCAATCCTGCAGGTCATTTTGGCCGAGTATCCGGCGAGCGGTTTCGTTCTGCACCCGACGGACTGGACCAAGATCGAGTTGACCAAGGATCTTGGCGGCAACTACATCGTCGGCAATGCCCAGTCGCCGATCGGTCCGTCGCTGTGGAATCTGCCGGTCGTCCAGACGCAGGCAATTTCTGCTGGCAAGTTCCTGACCGGTGCGTTCAATCTCGGCGCGCAGATCTTCGACCGAATGGGCGTCGAAGTGCTTCTCTCCAGCGAGAACGACAAGGACTTCGAGAACAACATGTTCACGATCCGTATCGAAGAACGCCTTGCACTGGCGGTTTACCGTCCAGAGGCCTTCGTGACCGGCGACGTCAATCCGCCTGTAACTCCTTAATCGTTGATGGGGCGCTTCGGCGCCCCTTTTCACGAGGAAATCATGAAAATCAAAGCGCTTAAAACACTTGTCGGCAATTACGGCCGATTGGATGAAGGCATGGTCGCCGATCTGCCGAACTGGCAAGCCGGCCCTCTTCTGGCGCTTGGTTACGTCGAGAAGTTTACGGAGGTTGGCGATGGCCGACACGAAGACACGCAAGCGCCGGGTGGCGAGCTACATCGGAGCGGGAATCGTCGATCCAAATCCGGCTCCCGAGCCAGAGCCGGAGCCTGAAACGCCATCGGAGGGTGGTGGCGATGGCACTGGTTGACCTCGAACTGCTGAAGAAACACCTTCGCGTGTTTCATGATGACGAAGATGCTGAGCTTGAAGTCTATCTGGCTGCGGCAGAGGCAATCGTCATCGAATGTGTCGACCGGGAGATCGTGGCTACCGGCGCGACGCCTACCTTGCCGGATGGCATTGAGTTAACCCCGCCCATCACGGCAGCAATTCTGTTGGTCGCGGCTGATCTGTACGAGAACCGAGAACCTGACATGAAAGCCGAAGGCAACGCCGTTCTGCCACGTCACGTTCGGGCGCTGCTGGCGCCATATCGGGTTTGGCGCACTCTTCTGGTGGAAGAATAATGCCATGGCTCAACTTCACAGCCACTTACGACTTCATCCCAAAGCCTGCGGTAACGATCAGATACCCGGCAGGGTACATCGGGCTGGTGACCACGCCTTGCGCTAACCGCGCCGTTGCTGCCGGCAAAGCCGAGCGGCTTCCAACTCCCACAAAAGACGAGGCTGAAGCATGGCGAAGCGCAAAAGGCCAGTCTATCCCGACATGACGTGTGAGGTATGTGGGTGTCAGACTCCTCGGCGTCACAATAGGCAAAAATACTGCGTCGAATGTAGCGAGAAGGTGATAATTGCCGCAGAAAGAGCCAGACCAAAGCGGGCAATCAAGTTAACAAATTGCATAACTTGCAACCGCATCGTTCAATTTAGATATGCCCCTCCGAAATTTTGCCCTGCATGCAAGGTTGAAGATAAGCGTGCACGTGACAGGGTAAATCAGAAGAAATACCGTCAAAGCGAAAAGTCTAAGCAGCGAGAACGTGAAAGATCTGCGATACGAAATAAAACGCCGGAAGTGCGTCAATATCGCCGAGAATATGAGCAATCTCGGAAGGATGCTGATCCCAAATTTGCTTTAGGCTTTAGAATGAGAACGCTCATTCGTGATTCACTCAGGCGACAAAAAGGAGGGGAATCATGGATAGAGATGGTAGACTTTACTGTCTCCCAACTCAAAGCTCACCTCGAGCGACAATTCTTACCGGGGATGAGTTGGGCGAATAGAAAAAAGTGGCACATCGATCATATCCGCCCGATTGCGTCGTTTAATTATGACGGCCCCGAACATCCCGATTTCAAAGCCTGTTGGGCGCTGACCAATCTTCGGCCGATGTGGGCGAAAGACAACCAATCGAAGGGTGCCAAACAGGTTTACCTTATATAGAGGGGCTAGAATGTCAGCGGGCCAACTCAATCAAAAAATCACATTCCAGCGGCGTGAGGTTATTGAAGATCCTTTCGGCGGAACTCGCGGTGGGTGGGTCGACCAGTTCACTGTTTCGGGAAGGCTGGAACCGCGATACGGCAGCAATGCAGAAAGCGTTATGGCTGCGCGAATGCAGTCCATGCAGCCGTATAACCTGACGATCCGTGGCAGCACTGCGGCAAGGCAGGTGACAGCGTCTTGGCGGGTCTATGACGCTCGGGCGGGGAAGACCGGGGATAAGCCAAACCGCGTGTTTGGCATTAAGACCGTCGTCAATCCGGACGAACGCAATGCCTATTTAGAAATGCTCGTCGTCGAAGGCGAGGAAACGTAATGGCGGTAAAGACCAAAGGTCTGGATCGCCTGCAGATCAAGTTAAAGAAGTTCCCGGAAGTTGCTGAAAAGCTTGTCCGCGCTGCTATGGAACAGGCGGCCGACGAAATCGTTGCGATGATGAAGCGTCTCGTCCCGGTTGATAATGGCGATCTCCGAGATAGCATCGGGTGGACGTGGGGCACAGCCCCAAAATATAGCCAGCGCATTGGCAGCGTTAAGTCGAATGACGGCAAGCTGACAATTACGATATACGCCGGCAATTCAAAGGTGCGCTATGCACATCTTGTTGAATGGGGGAGCGCACCGCACGTCAACGGCGGCATGTATCCAGGTACTTTCAATCCCGGGGCAAAGGCACAGCCGTTCTTCTACGTCTCGTGGCGAGCCAAACGGCGAAGTGCGCGGGCGAGGGTATCTCGCGCTATTACCAAGGCAGCCAAACAGATAGCGGCGGATCGCTAA
Protein sequences of DBSCAN-SWA_5 >NZ_CP019390|1611906:1647321|1633219_1633789_+|WP_076771318.1|DBSCAN-SWA MEYTIKAIPTEYAGVQFRSRTEARWAAFFDLVGLKWDYEPLDLERWAPDFILRTSLTNVLVEVKPVDLTAYIDAVNRGGDDVAQLSSYDKALAHSRKHQVLLLGIAPLEMQGATLPIGIHTTPPRGAEYSFDDMQDALTVGDTSFVTDAWRKAGSVTQWTVDEPDLSTSQIVSRALNRAHRKAEQKRAA >NZ_CP019390|1611906:1647321|1620588_1620795_+|WP_076771306.1|DBSCAN-SWA MGAGMNVTTKATADLPYIDPGRKPGVGRIGQSLALAAFALAIATTIAAFLFWNLLLPFYGLLYLWGAH >NZ_CP019390|1611906:1647321|1631681_1632728_+|WP_076771316.1|DBSCAN-SWA MTAYYNEFDPKAAAWMRELIKAGHIAPGDVDERSIVDIRPSDLIGYTQCHFFAGIGVWSYALRRAGWPDDRPVWTGSCPCQPFSAAGKGAGFTDERHLWPHFHWLIDNCRPPVVFGEQVASKDGLGWLDLVQADLEGSGYASGAVDTCAAGFGAPHVRQRLYWVAHAYDKGSQGRCGVPERTAERAVGSGGLADGLADSASFGRVGRRSGGEGHELGEIERSERLRDASWLADTHGGDTCAERQQHGGEQRQQPQNCGMLHHGSTSPTNGHWRDADWLFCRDGKWRPVEPGTFPLAHGVAARVGRLRGYGNAIVAPAAQAFIEAYLGTEVVAANDNYLSRPDVAPRAS >NZ_CP019390|1611906:1647321|1614175_1614598_-|WP_076771296.1|DBSCAN-SWA MTDLITRLSKLDAPDREVDAEIAVLFSGDPDAYVVRPNEGAMFTHVPGWFATGDNKSHKSPEYTASVDAALALAEKVLPGWTFEHIGQDYIRASGLDNDVMPMGWTVEISDGSQTIQGQAPTFPLAICIALLRAKEASKP >NZ_CP019390|1611906:1647321|1614594_1615065_-|WP_076771297.1|DBSCAN-SWA MKAEKREVFISDDGKVFDTVEACQAHERELAANAKRLDNLKGYIVRSSFDATEGRGYHQTTYILTDASFSVVLEYCLDRFGRPLSGWYGDGHYEQWLIRQSDEPAAELFKRSKGPHSGVGLSAGAADVVFLSSKPIDHPDIPAPVYPWPKKKDAKR >NZ_CP019390|1611906:1647321|1633785_1636149_+|WP_076771319.1|DBSCAN-SWA MNMTLASDLSRDLALAYIAADIPVFPCRAKDEETNEFDEETGEIVVLKAKTPLLSNGFKGATKNLRVTNILWDRNPGAMVGIPTGEQLGAWVLDVDVHKDENGEIIDGFETLSALEDKFGPLPKTATARTAGGGEHRYFKYVPGVRNRGRLGAGLDVRGSGGYVVAPGSIMDDGRAYKWVDYSGPGLPPLADAPQWLLDLVLPKEPISIGADYTYDRGSNDAYIDRAIQLELEETASVPMGAGRNNRLNAAAFSLGTLVGAGALPEHEARQLLQDVARGWGRDWVKCCKTIENGLSAGMRQPRQIPERSFYDDDSTPPVSVTGLIEKYRNRHDDDVTDSDHRADADEAVPESDDDDVPDYKLEAVADLESLTYPGGLVEDMIDWIVSSAEQPSRTLAMAAVLPLLASLAGARYSTGSRDTRPNLYTVALAESGFGKEHARSQIKRILMADQGIFDAYSGPARIMSASALREVLEKHSSVNCQIDEFGGFIRDITDRKAGSHQRAISTDLRDYYSASSTFFEGAAYRGVPPKRIYNPTLCIHGTSTPEQFWSALSSASAEDGLLPRLILFHVTGEKPYAVKPSRDVREVPYLLMERMASVAGINVAAKRGNLSGMNIQVPAYGENKPYIVQWTPDATALFRSVKDSIDAREKMLASEARPFARRIIENAIKLALIVAVGKDPTEPVITETDFEWASCVAWTCAATMIAEVTERLADNDREANYKRIVGLIRKAGTKGITESRLFDRCKAIEGRRREEILKELFHTGKVIKQDAKSKRGRPANRLVWMD >NZ_CP019390|1611906:1647321|1616811_1617138_-|WP_154146644.1|DBSCAN-SWA MERDPYEELGTVHFRRTELAIIEALSDVYPRRMHMRDLVDNVYAFDPNGGPENAVGVIRTTMVGLRKMLPRHGWKIPKQGCGRGNQGFYYLEPVANDNIPAKDRRAAA >NZ_CP019390|1611906:1647321|1622142_1622367_+|WP_076771988.1|DBSCAN-SWA MKEIAGLLKKQNYRCVYCRAPIRKKKNRHVDHIMPLKLGGSNDITNIQLLCPTCNMSKKASHPVDYARRIGLLV >NZ_CP019390|1611906:1647321|1615195_1615978_-|WP_154144882.1|DBSCAN-SWA MASDICYVCGTRNMRGSVCLTCCPDGDPRLKPANTRPAPAATDTGLVTELAYKFWSVHPKEIDQFLVTHPIPENYKGGGRAWFFEHHMRELLAAERAEKEIAERNRDAARDNFLTMQKTAAKLLERAEKAEADNAALTTRVKELEADLKTFQDGTADLVNEKIDAEFRAIDLEAKLEAAEKSAFEHGYVLACCNIVNLHDEPTIAHCTLSELGVTRAAVKAMDLSEYDLKALRKIERDRSRSPYAKASRKARAVLGGKPS >NZ_CP019390|1611906:1647321|1630419_1631364_+|WP_076771314.1|DBSCAN-SWA MTFVLDQSSGENWAAYNADCVPFAAGLPDSSIDFSVYSPPFSSLYIYSESVADMGNCASDDEFFEQYRYLVREKLRVTRPGRLTAIHVKDLVYYQNSSERGTAGLRPFSDDCTRLHIDEGWDFHSRITIWRDPVREMQKTKAHGLLWKTLRADSTFSRMGLPEYLLVFRKWAKPGEEVVPVTHTKESFPVEEWQDHASPVWNFSKQDLPETDVLNVKVARSDKDEKHLCPMPLNITKRALRMWSNHGDRVFSPFMGIGSEGYVSLQEGRRFVGTELNPNYFRQAVKNLTEAAAYSEAPTLFVSNDNNPEAANAA >NZ_CP019390|1611906:1647321|1631353_1631674_+|WP_076771315.1|DBSCAN-SWA MQPDTDICHVCYRHAIGLGVQADREPVRWLCKECADIAEHIRHRRRLDPYELRALDTGVEAVGEYLVSIQKTDLKEMDELEARMLVRAAWEGCGRGMRAALSEAPF >NZ_CP019390|1611906:1647321|1613897_1614179_-|WP_076771295.1|DBSCAN-SWA MNIREAFSVSGIPQGEHGEDGFYYEVGHDYRRLIDRQWKIVGKVSRITVTDDLPGMHCYMERVRVYDGKTLIFEAPLHNVEGVCYPAPEASNA >NZ_CP019390|1611906:1647321|1613510_1613723_-|WP_076771293.1|DBSCAN-SWA MKLTEARFAQRCAYIAKHASDWAAELLDFVEDREREADPESVFRFTDSVRERLDRLDEEAGRQALKGGYE >NZ_CP019390|1611906:1647321|1613071_1613263_-|WP_179947174.1|DBSCAN-SWA MAAYTYADTERWLDAIAGVIAYFPETEQNLLPLYERVEGMQRNIAANDNIRDRIKSRMNRTAA >NZ_CP019390|1611906:1647321|1618134_1618584_-|WP_179947175.1|DBSCAN-SWA MVPTHNSLLHICFRRGIRFLVSGGFSGHAESTADVSVGSVPFEIYGPCGVEAMRLLSAVFIMALLLEKWFLSIKGENPWEDVHSVACHFVIPTSIASVDDWTIWGLIPFVNVIFPILSDGHKKCRKRGIVIYHYRNVRCVFVEPVAAAR >NZ_CP019390|1611906:1647321|1619277_1619580_+|WP_076771304.1|DBSCAN-SWA MTCECGECWDLPGEIVVHRLWKWKGIIIEERDSFRWLTVRFMIPGTGLVQLEVSRFEVEPDFEEDGGGVEADKPEDDNVIPVDFTKKEKLTKNTKTRGVA >NZ_CP019390|1611906:1647321|1632693_1632903_-|WP_025091060.1|DBSCAN-SWA MSRSTLKGGRYIGGRYKASNGINDTWSVVDQKVQKPVEMGVRQAVGMNYRDAKELSDLLNSLEERRQAC >NZ_CP019390|1611906:1647321|1613265_1613514_-|WP_076771292.1|DBSCAN-SWA MSRLDRAIIRNVISGNVAFTDEVAKRFADIGRLHLCRNIPTDRWHVIITSGDQIKLSWLAHMCLSIASKYASHGRAALRERE >NZ_CP019390|1611906:1647321|1617139_1617406_-|WP_076771302.1|DBSCAN-SWA MSATAPRRRNSKPRPNEIIGGGFFVFRRGKKTGRVGVFTTMPYEHGSFEQAVAEATRLAALCPGETFEVFQTSGAVACRTPVELAEAA >NZ_CP019390|1611906:1647321|1617560_1618175_+|WP_076771303.1|DBSCAN-SWA MKQVTVDKDRFRAAMKKGDWSMKALSLEAGMGETFVRDILDRGAVPKIDSLKSVANILNTTVGYLIGEEPGVTSVVGRVGADTSGEIVYGTGDGGFGDVIIPPGAGPESVAVEVHGYSMGTFVDGALIFYSDQKLAPSDEMLGDMVVVGLSDGRALLKRLTRGSRPGLYDLESLNGPTMRDQEVVWAADIESIVPPRQARRIRI >NZ_CP019390|1611906:1647321|1646491_1646857_+|WP_076771331.1|head,tail|DBSCAN-SWA MSAGQLNQKITFQRREVIEDPFGGTRGGWVDQFTVSGRLEPRYGSNAESVMAARMQSMQPYNLTIRGSTAARQVTASWRVYDARAGKTGDKPNRVFGIKTVVNPDERNAYLEMLVVEGEET >NZ_CP019390|1611906:1647321|1624238_1624769_+|WP_179947177.1|DBSCAN-SWA MTLTQKRVREVLHYNPWTGIFTWRRRQDLPANWNERWAGKEAGSVNDRGYIVIRVEYKRYRAHRLAFLYMKNYMPDEVDHIDHCRGNNAFSNLRDANRSTNGKNVSMHSTNTSGVTGVHWDNKRGKWVAQITVDGEVKYLGGYNSIDAAKHVRMAANDQFNFHSNHGKPEYIVGAA >NZ_CP019390|1611906:1647321|1638711_1639011_+|WP_076771322.1|DBSCAN-SWA MRPANADQRPWKSWYKLARWERRRQELFAKQPLCVKCLEREEVTVADTADHVVPHRGDPDLFWHGELQPLCASCHSRLKQREELGQDVVRFGSDGWPVG >NZ_CP019390|1611906:1647321|1633003_1633192_-|WP_076771317.1|DBSCAN-SWA MAEARYDTLEMYDGVWAVIDVWTGDAAEVNGIKQIGLEWEQADDLADLLNYLYAQREAATLQ >NZ_CP019390|1611906:1647321|1627547_1628288_+|WP_076771312.1|DBSCAN-SWA MTDNYDPYNPAPIGHNRPPLTAYETIKQEIEDLFDEAKNFADGEAIDNQQLADAITELHDKLHEAGKRADEARKDEAKPHDDAKAEIQDRYNKLIGNTKSVKGKVVLGKETLQALLTPWRNKLVAEKEAVARAAREEADRIAREAQEAMRASAGNLEEREKAEELLAEAKKADRWAKREDRSATTGTGLRTIWHCKLEDEGKALDWAYGRAPERFKELVQSMAEETVRAGMRQVPGFKVWDERVAR >NZ_CP019390|1611906:1647321|1622398_1622962_+|WP_076771307.1|DBSCAN-SWA MAKLASRFDATAHDTEQRDYEELPNGDYELEIEASEVKEGANGTGLKTTMTVLRPDEYQGRKVFNFYNLEHKNAQAQEIGQRQFASLCRAIGVSEVEDSEELHFKAFTAKIGLGKASKNKETGQEYPARAEIKKYYFPDEGNVPQPSIDANQPVAQARPANDNRPAAANSNKTAPAAAAAVKKRPWG >NZ_CP019390|1611906:1647321|1620427_1620604_+|WP_154144888.1|DBSCAN-SWA MTSTTYSHTRNFQPKDYAEGDSFYEPETTLGLGDRFLWGLAVIAALALTVGFYGWVLA >NZ_CP019390|1611906:1647321|1616590_1616815_-|WP_076771300.1|DBSCAN-SWA MNRALLEMLADDEFETETDAPKAGNVEPMRRPDYRAKKHGRPQPWLRYAAREAVEMTVVVAFCVAVCGFGLGMA >NZ_CP019390|1611906:1647321|1624768_1625710_+|WP_076771310.1|DBSCAN-SWA MAALPKAESSTVRAIYQAYEAQAKSWDSWGISVGEAGTECDRALWYGFRWVSAHEVHSGRQLRLFATGNIEEDRLVADLESIGVDVYGQQDKIRLVSGFVRGKCDGKAMGVPEAPKTEHLLEFKSSNEKGIKELQKHGCQKSKPLHYAQCQLGMQAFGLTRCLYLASCKNTDTLYAERIEYDVEFCLRLLARCERIVFSDEPPSRISEDPEFFGCMFCKHRGVCHEGVQPRVNCRTCLHVQPEHGGDCHMSCARWSKPLSIDEQRDGCPAHLYLPGLISGEQIDVDEERETVTYRLATGEIWVDGVNDNKKVA >NZ_CP019390|1611906:1647321|1643751_1644969_+|WP_076771326.1|capsid|DBSCAN-SWA MSEVSLAEKIGELGQSLASIKEKVGNLAADFTTQLQQHGTVSTELTGKVDKALSELGDTTTRISELEKRAARERDDVAQGPQDVGDIVVASEKFKSTDVSGAWRGSIRVGMERADITSGNTTVGAGRSAGTSLVPGQRVPGIIAPPNRQLTIRDLIAPGQTSAASVEFVKETGFTNSAAPVAEGTQKPKSDLTFDMETTPVRTLAHIFKASRQILDDAPGLASYINARGTYGLKFVEEGQLLNGDGTGQNLHGILPQASAFAPAFTPENETAIDRLRLAILQVILAEYPASGFVLHPTDWTKIELTKDLGGNYIVGNAQSPIGPSLWNLPVVQTQAISAGKFLTGAFNLGAQIFDRMGVEVLLSSENDKDFENNMFTIRIEERLALAVYRPEAFVTGDVNPPVTP >NZ_CP019390|1611906:1647321|1639508_1641362_+|WP_076771324.1|terminase|DBSCAN-SWA MPALERVSAYAQAVIDGKEVAGPHVRNACRRHFDDLEHGHERGLYWDDDAADRVFRFFEGRLKLSEGQFEGKPFKLHASQAFKLGSLFGWKRADGSRRFRRAYIEEGKGNGKSPFAGGVGLYGLIADKEAGAQIYAAAAKKEQAGILFQDAVKMVRAAPALVERLKFSGGIGREFNIAHHKSQSFFRPISKDSGKSGSGPRPHFALCDEVHEHPDRSTMEMLERGFKFRRQPLLLMITNSGSDRNSICWEEHEHAVKVAAGTQTPDEDFTYVGEVIDDTTFSYVCALDKGDDPLKDETCWKKANPLLGVILTQEYLAGVVAQAKQMPGKLNGILRLHFCCWTDADKAWMPRETVEGVMDDFDPEVEHADKPVFMGVDLSGSKDMTVLACVVPTGFKEMEREDGSTVNLPTFDAWVEAWTPADTLEAREQADKAPYALWVKQGWLNAPPGKRIRYDFVASRVQQIDQSFDIQAIAYDRYAYDKFREEVAALGLDIEHVAHPQGGKVRARPEPAKVEAAKAAGLPLPQGLWMPGSVLALEDMIIDGRIRMRRNPVLMTALMGATFDHDPQDNRWFVKTKASVRIDAAVALAMAVGVAMDTPIEPADIDDFVNNMITITW >NZ_CP019390|1611906:1647321|1624026_1624242_+|WP_156884116.1|DBSCAN-SWA MDRGLDAATGVGDGSDNPEPTIFNAIEYALRHEGVTEIAFSEDGEYEVEIHEASSLMPFVKCLLRELEVIT >NZ_CP019390|1611906:1647321|1637039_1637957_+|WP_076771321.1|DBSCAN-SWA MKNTRHGSLAEQLKALMAYRNRPEGQQKPLQTNWSVVPGANDNDPEEVADMRYERDWRQTPSVQAIMQNVATDDIEKNESGQTVRIGKLRFSDGNQTEVGYVLGIDGEVIQADIRMPTGAMLGMKDKPDRASGGGVDPKDTKASNHYFEDMLGTLPHRYIPSGKRRNGTDYSTEESARTLADAYANTDMEKVTFTRYPKGLPCGSPKVADSFLGMRKTTCAGGGDEAWEDTLSAMIDRDLWFEALQELKDRDRDVLDAALEAKTYTDVGVAAGQRPKYATYNGGGRRALLDANDNLAAAIKKYAA >NZ_CP019390|1611906:1647321|1638521_1638725_+|WP_076771990.1|DBSCAN-SWA MQTEHFDTYAVAKAFNEWMRSYTEDPSQFDHEWQTVTSFLKQTDEGVEPDYGKSCAAYLAKLLDEAS >NZ_CP019390|1611906:1647321|1616170_1616431_-|WP_076771299.1|DBSCAN-SWA MSRSITHAAMTPILTAVEFQQQGTTAAQVLSISKAVRALGYHSEAETLRDTAFELARITGVRFRYGAPRQRRSPANDNRRRLRRAV >NZ_CP019390|1611906:1647321|1623022_1624060_+|WP_076771308.1|DBSCAN-SWA MRVSIDRSQLAHALATVNRAIESRNSIPILANVLLAVEVGQLRLTGTDLDVEITTSLPVLDCQPGSVTVPGKMLADIAKRATSDITLELDGGRLTVASGRSRYKLDVLPAEDFPSFSAGKFDTTLELDLAALVAPCVHCISTDETRYYLAGVYLHAVDGRLVAVATDGHRLMRNTGPEGTLDYGVILPRKLVGLVPKGAVTVELSQNKVRVTSGSTVITSKLIDGTFPDYVRVIPTGNSNVLTVDRQALMKAVERVAAVADDKSRAVKFSVGDVLRLMLADKASDEVEATFEGEPLEIGFNARYVNDMLGALDEPNVRFALGDAGMPAVVKGDGEWTAVLMPLRV >NZ_CP019390|1611906:1647321|1628284_1628485_+|WP_154144892.1|DBSCAN-SWA MNTTAAPVTTPVNPADLSAVIRHAFIAGRGRTEGMRLSPDDIDAWVDYDPSGNPAYQRIITALFGW >NZ_CP019390|1611906:1647321|1641370_1642609_+|WP_076771991.1|portal|DBSCAN-SWA MGLLTWVGKPFGLLSGPWRAFFGMSTTSGETVTYEHAMQLDAVWACVNLISNAVKTLPCNVYKGDGVDVDRENPLYELLHDLPNLDDSASDFWGMAALCLCLDGNFFAEKKKNGDRLVALNPFNPLCVDVKRDDRNNRYYEVTEQYKNGKKGGVRKIREEDMLHVRGLVMPGCDRGLSPIAAQRNVIGNAMAGEKTSGRMFKNGMMASVVLSSDQVLKADQRKQIAESLQAFAGADKAGGIAVLEAGLKPSQITINPKDAQMLETRQYSVEQICRIFGVPPVMIGHAANGTTTWGSGIEQLILQFTKTCLTPMLRSIESAIYRDLLDAKTRKTTVVKFNMEGLLRGDSQARAEFLQKMVQNGIYTPNEARAYENKPKMDGGNELIVNGTMQPLSMVGHNGGPPLDDAQPSAG >NZ_CP019390|1611906:1647321|1642618_1643737_+|WP_076771325.1|DBSCAN-SWA MKFEHILTAFEAEPWAIQREKLAVLADVLAARVAGDKLVTPEFAAAVSDARAKEIAEIDGKVAVIPVYGVLADRMDLFSAMSGGTSYAGIKRQLHKALSNEDVKAVVLDIDSPGGSVPGTDELATEIRKLRGDKKPIIAQVNSLAASAAYWIASSADEIVVTPSGRAGSIGVYTAHDDISAALEKAGVKRTYISAGKHKVEGNETEPLGKDTLAYIQDSVNRSYGRFLQSVADGRGITKSKVEDGFGQGRVFYAEALMDRGMADRIATLDETLARLGANTEPEYVRRVKASNAAKAEAAQLLASKMASGEEITKREFENGIRGLIGLSNSEAERAASLYFKEHQGEPDADAENAAVSAALERLLAETRTFTI >NZ_CP019390|1611906:1647321|1639120_1639507_+|WP_076771323.1|DBSCAN-SWA MAKPRNPLGKAKVEGRDKKDPQRFKNRADPAANGPLGAPPVWLRDNTDIKAKSAWKLFAKELPWLNESHRTLVGMASTIQGRIMAGQEVGVQAMNLLRQMLGQMGATPADASKVATPDEGEEKDDLLD >NZ_CP019390|1611906:1647321|1620934_1621756_+|WP_179947176.1|DBSCAN-SWA MALNWNDGIPDDTNENEPAFCVFYAGAKSGKTTLASEFPSPLYIRTGKGERAPAGVTMKSFGVSESYRDVMDQADWMLDAQHDRKTFILDSADGLEQHIFAEVCAKNKVASIEDIPYGKGYTQASEIWHEFIAKVMQLKEAGFYVVVIAHVKSKTVPGVTTDSYPRYMPNLRDDAVGIVVDAADLIGFLHQRVSIRKEDVGFNKKNTRGEGGGDMLIAVQERPGFIAGNRYDIEKPTLPFKRGEGFKVLDYYFQRNSLAPDNDNASERQEEAA >NZ_CP019390|1611906:1647321|1611906_1613103_+|WP_083699655.1|integrase|DBSCAN-SWA MTRRDEYESADDALWAMLKAGADGDKARALYDAAIKRAEAIGISYVPADRLLSFTDEALAARLNLVTGNPEEDAAAVGAASIPSVSVTQALKIYFDEITPDELTGKSEIQKKRWRAHKQRAIDHFVKIVSDKAIADITREDAQKFYKVWLQMITKPAKGKQPISASMGNRMMGGMRVLFSEYFKHMGNRDRPNPFRDLSFAEKVEKSRPPIPTDIIQGKFLTYGPLVSLNEEARGIVLAMIETGCRPSELCNITAEHIFLADKVPHILIAPRKDAADPREIKTASSVRKLPLVGIAHEVFKKHRNGFPRYKNKEDTLSATLNKYFKDNELFPKGAGYTVYSLRHSFEDRMKEAGLDDELRRMLMGHTVDRPRYGTGGSLEWRKEQMEKFTLPFDSSVI >NZ_CP019390|1611906:1647321|1636545_1637043_+|WP_076771320.1|DBSCAN-SWA MARSRTRAPSSTTTTQITRINGARVKITTKAGKVTTKPALPLEWELQAAQVSALRRLPQYQRQFLLAGDMNASKRGPRAQAQAIATGMTSGEPDLRIYGEYGRLLLIENKVGQGRLSPAQKDRHAALQRLGYTVLVIRATTTTEAAERAITAVLEWLAQEKGKAA >NZ_CP019390|1611906:1647321|1646856_1647321_+|WP_076771332.1|DBSCAN-SWA MAVKTKGLDRLQIKLKKFPEVAEKLVRAAMEQAADEIVAMMKRLVPVDNGDLRDSIGWTWGTAPKYSQRIGSVKSNDGKLTITIYAGNSKVRYAHLVEWGSAPHVNGGMYPGTFNPGAKAQPFFYVSWRAKRRSARARVSRAITKAAKQIAADR >NZ_CP019390|1611906:1647321|1615061_1615199_-|WP_154144880.1|DBSCAN-SWA MIVTTTWHNAGPHTIYGKLETRLGRKPTDKEAADEVRRILQEARS >NZ_CP019390|1611906:1647321|1645268_1645601_+|WP_076771328.1|head,tail|DBSCAN-SWA MALVDLELLKKHLRVFHDDEDAELEVYLAAAEAIVIECVDREIVATGATPTLPDGIELTPPITAAILLVAADLYENREPDMKAEGNAVLPRHVRALLAPYRVWRTLLVEE >NZ_CP019390|1611906:1647321|1625709_1627455_+|WP_076771311.1|DBSCAN-SWA MQPRYYQNEATDSVFDYWVEEPGHPLVDMATGTGKSMTLAMIFQRLYTGWPDMRLCCCTHVVELVEGNFKELLGIAPFAPLGIYASALGRRDARAQILFAQLQTVYSKAAQIGHVDVLAIDEVHLVPNDANTMYRQFIDALLAINPDMKIVGLSATPYRLDSGRLDEGDDRLFDKVVYTYGIRQGIDDGYLSPVTSKPTETKQDTSHVPMRGNDLAKGALQDAVDRDDLNGRILEEVFDTEGQRRTALFFCAGVKHATNVRDMVRSMGKSCEVLSGNTPRAERRNIIEACKAGEIWGITNDNVMSTGTNVPRIDLIVDMARTKSASRYVQRVGRGTRVLYPPRFDPEAVGPEERRAAIAGYLKPNCRYMDFAGNVAEHGPVDMIEPRKPAKGDGQAPIKVCPTCNEQLHASLRICWCCGHEFEFDETPKLQSHATDAPIVSVATPETREVTRRTFAYHEGKGGKQDSVKVSYWVGMSPINEWLGPAHTGFFKSKSDRWWRKHGGQAPFPKTVLEFMERQNELLPTGEIVVKPNGKYWEVVDAIAGVANDNTPEASNDNVPASNYGRVSAGLAELLDDEIPF >NZ_CP019390|1611906:1647321|1619579_1620431_+|WP_076771305.1|DBSCAN-SWA MADKQTVKVGDRITAKFDDSCNDYRAGNTFYVQEIDEQDGETIVAFTDNVGDKRHRRVSEFTVEHAPVAEATGKPAFKVGDRVRLIKDGLSTTGAVGKLATIKSWSGGKVVDNGQYLLNIDGPVDYETLAFQPQYTRASPECFELLAPSLTIETGKFYRTRDGRKVGPIDHNGFGVYGAPEFPGHWYENGLSYSDNTQSLTDLIAEWVDEPASNDNAPATAPAIVALIEGGQPKPSERPKVHKSEQAATDEAERLAVKYPGQKFGVFVLADSRIADVVIRRAA >NZ_CP019390|1611906:1647321|1629040_1630423_+|WP_076771313.1|DBSCAN-SWA MSYEDLLARKSADVPLRGLSNIPKLHDGMFAYQRDVTEFLLGVGGGAAFLDTGLGKSFVALEWARVVAEHTGKPVLMLAPLAVAPQHVREAKKFGYDAAQVVRSRDEVGPGINVTNYAKIDHFDADAFGGVVLDESSIIKNFTGQTTRKMMAMWSGTLYRLACTATPAPNDHMELGQHSQFLGVMQSNEMLTRWFIADQTNMGRYRLKGHAVKPFWNWVASWARCISKPSDLGYSDDGFELPPLETFKHEIRADVSVDAGDLLFRIPDTSATAIHKEKRLTADARAEAIAEQVNSENGEPWIVWCDTDYEADALTSRIPGAVEVRGSMSDAVKEERLVGFSEGNIRVLVSKPSIAGFGLNWQHCARMAFVGLSFSYEAYYQAVRRCYRFGQKRPVHVHIALADTERAIWDTVNRKSGDHEQMKREMYAAMRRAHEKRQVKIDYQPTKPIALPSWLKGASA |
49 | Rhizobium_phage(40.0%) | portal,capsid,terminase,tail,integrase,head | attL 1595782:1595798|attR 1633188:1633204 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
1662246 : 1672021
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP019390|1662246:1672021|DBSCAN-SWA CATGAATAAAACAACGTTCTTCGCGTATGCGAGGCGCGCGCCTTTTGGCGGGCGTCTTTCGCAGGCGCAGGTCGACGGCACGTCGGCAATTCTGGCCGAAGCAGAGCGCCGAGGCCTGCCGGATGAGCAGACCGCTTATGTGCTCGCCACGGCATTCCACGAGACCGGCGGCAAGATGCAGCCGATCGAGGAAAACCTCAACTATACCAGTGCGGCACGCATCCGGCAGGTCTGGCCGTCGCGGTTTGCTTCTGTTGCCGCCGCACAGCCTTATGTGCGTGATCCGCAGGCATTGGCTAACAAGGTATATGGCGGCCGCATGGGCAATACCGGTGCGAACGATGGCTGGCTATACCGCGGTCGCGGGCTGGTGCAGATCACAGGTCGCGACAACTACAAGAAATACGGCATAGGCGATGCTCCAGAAAAGGCGCTGGAGGACAGCACGGCCGTCCGAATCCTGTTTGACGGGATGATCAACGGCAAGTTTACCGGCAAGCGGTTGGCCGACTACGTCGGTGGCGGCAAGGAAGATGCGGTTGGCGCCCGCGCTATCGTCAACGGCAGCGACAAGGCCAGCCTGATTGCTGGTTATTACCGCAACTTCCTCGACAGCATCGTGGCGGCTCGCGAAATGAAACCCTCCGTAACCGAAGACGCCAAGCCTGACGATGTGCCGCTCCTGCAGGACAAGACCGTGCGGACGATCGTCGCAGGCACGGGTGGCACGCTTGTCACTGGCCTTATCGGCGCTGTGAGCAACCCATGGGCGTTCGCAACGGTGGCGCTCCTGCTGGTCGCAGTAGGCGCGGGCTTCTGGCTCTGGAAGAGCGGCAGGCTCGAACTGAAGAGGGCGGCGGTGTGAGCCTCACAAGGATTGCCATTGAATATGACAGCGACGCGGGAACGGCCACGGTGCGGATCGATAACGGCTCGCAGCAGTGGGACCATGCAAAGCTCATGGTATGCGATGCCGTCGAGACGCGCGACGGCTATCTGCTGCCGCTCACAGGGCAGCAACGCATGCTGATTTTGACAGGAGTGCCAACATGACCTGGCTCCTAACACTCCGCTCCAAGATCACAGGCTGGGCCGTGGCAATCGCTGCGGCCCTTGCGATTCTGGCGGGCGCTTACCTCAAGGGCAGGGCGGACAACGCCACAAGCGCCACCGCTGACCGGCTGAAGGCTGCCAACAAAGCAAGGAAAATCGAAGATGAAACGAGCAAGCTTGGCGGCGGTGATGTTGACGCTGCTCTGTCTCGGTGGGTGCGTGACAGCCGGTAGCTACTGCGACGTTGCCCGGCCTGTCCGCCCGAGCGTCGAGGACAGCCTGACCGAGGGTACGAAGCGCCAGATCCTCGCGGAGAACACAAAACTGGAAAAGCTGTGCGGGGTGAAGCCGTGAGCAGCCTGATATTACGCCTGGATATGGGTAGCGCCTCACAAGCTCTATCGGTCCTCTCCGACGTCTCTAATCGACCTCCTGAGTTTGGCGGACGCCTCCTCGGCCTTCTTAATTCCGGTGAAGAGCTTTTCTTTATCAATGACGAAGTCAGAACTGCACCCGGTACAGGTGACCTTGTCGCTGGTTTTAAGCCATCCGATAGTCGTCTCAATCTGGTTTCTGCATTTAGGGCAAGTCACCCCGATAGTCTGATTGTCAAACATCTCCCCCATCCCTTTAGATCAGACGATTGCAACATAACTTCCGAGGGTGTGCAATGACCGGTGCTGAAATCATGGCCGTCGTCGGCTTTATCGTAATGTTGATGGGTTTTCTGTTCGGGCTTTGGAAGTACGTCGAAAGCCAGATCGCAAAAGCTGAGGCGCGCAACGCGGCGAAAGCGGATGCTGCAACGGCTCTCGCTAGCCTGACGCGCCAAGAGCTTTCCGACTACAAGCTGCGCGCGGCTGAGACGTTCGCCACGAAGGCGGGCATGCAGGAGCAAACCTCGCAGATCATGCGCGCTATCGAAAGCGTGGCGCATCGCATTGACGGGCTTACCGAGCGGATTGATAACATCATGCAGGCGAGGACGACACGGACCAGACCATAGGCGGCCTTCAGGCCGCTATTTTTTTATTTGCCCATTAACACCACATTTATCGAGACATTCCATCCTTCCCTTTGCGCCGCTCACCACGGCTACCAACCAAACACGAGGAGACTGTATGTCCCATGACAGACAGGGCGCGGGTGCGCGCCTTTCACACGAAGAACTCCTGCGCCGCGCGGAGGCTTACCGCGAGCACGGCACACTGGTTAAGGCTGCAGCTGCGCTTGGCATAAAGAAGTCGGCATTTCACGACAGCATCAAGCGGGCGGCGGAGTTGGGGCTGTTGGGCACGAGTCCTGTCTTGCCGGGCTTTCGTATCGCCAAGGTTAGCAATACGCCGAACGGCGACTTCATCCAGCAGCGTCCCGACCTTGGCGATCAATTCAATGTTCCGGATGGGCATTCGGTCAAGGGCGTTTCCGCGCTTGTCGATGCTCAAGGCCGGCTTATGCAGCAGTGGGTTAAAACCCGCGAGGAGCCGTCGGCGGTGGATATCGCCGAAACCCTCAAAGCCGCATTCGAAGGCTGGCAGCCAGCAGCGAAACCGCAGCCCGCGCCGACCGTTGCAAACACTGACCTCCTTACGCTAACGCCGTTAGCTGATTTGCATCTCGGCCTTTTTTCTTGGGGCAAAGAGACCGGTATTAACTGGGATCTAGAAATTGGCGAGAAAGTCATTGGTGAGGCAATCGAGGATCTTGTAGCCAGAACGCCGCCGAGCGGAGAGGCCATCGTGCTTGGAGGAGGCGATTTGCTTCACAGCGATAATAATGAGAACAAGACGGCTCGATCTGGCAACGTTCTGCAGGTTGACGGCCGCTATCAGAAAGTCCTCATGGCCGCGTGCCGTCTTATCGTGAAGTCGGTCGATGCCAACCTTCGCCAGCATTCGCGCGTGACTGTCCGCATCCTGCCCGGTAATCACGACGAGCACGCTTCTGTGGCCGTCGCATATTTCCTGCTGGCCTGGTATCGCAACGAACCGCGCGTCACCGTTGATGTCGATCCTTCGTTGTTCTTCTGGTTCCGCTTCGGTGCAGTCCTCCTCGGCGCAACGCATGGCCATACGGTCAAGTTAAAGGACATGGCCAGCATCATGGCACATCGTCGGGCCGAGGACTGGGGCGCGACAAAGCACCGATATATTCACGGATTTCACATTCACCATTCGAGCAAGTTCGCCACAGAAGGCAACGGCGTGATTTCAGAATCGCACCAGACGCCGACACCTCAGGATGCATGGCATTTCGGCTCTGGCTTCCTGTCTGGGCGCTCAATGCAGGCGATCAGTTATCACCGGCTGTTCGGTGAGATCAGCCGCGTCCGTGTCGCGATGATGGACGGCGCACAGACGAAGTATCAGGCCGCGAATGATAACTTGGCGAGCGAGAGGAGGGTGGCTTGAAAGACATGAAACCGATCCCGATTGCCGCCGCCAAGAGCATTGCGAGTGAGTATGGTTATGATCAGGTCGTAATATTTGCTCGCCGCTGTCACGATAGCCCACTGCCGCATGGCGAACACATGACGACATACGGCAAGACGAGGGAACATTGCGATGTTGCCGCTCGCATGGGTGATGCCCTGAAAAAGTTCATGGGATGGGCGGTACGATGAGATGCGAATATTGCGGCAAAGATGGCCATCCTTATACCTGCTGCCCGTCGAACGGCACAGCAAATAACCTGCGCTGTAGCTACTGCGGCGGTCGCGATCATAACTATGAGGCGTGCACGAAGCATTGGGGCGGCGGCAAGTTGCCCGGTGCCATTCGCCTTAAATAATCACCGCGCCGCCCACCAAGCGGCGCTTTCACCACGAAACACGAGGAGAGAAGTATGGAACTGCACCAGCTTTACGGCGTGCATGGACCGGGCGATGAATGGTCGCAAGAAGACCGCGCCGCGCGAGCGGCGGTTGAAGGGCGGCAGATGAGGGCAGGTGGGAAGGTTATAAGTCACCTGCCAAAACCTTATAACGACAATGTGCCAGTTGCCAAGCAACCGGCACGTAAAGGCGACTGGCTGCAAACGTATAGCGGTCGGCAATTCTGGCCCTTGGACCCGCGCGCCGATGAGGTGTTTATCGAAGACGTTGCGCACGCGCTGTCGATGCAATGCCGATATGCTGGTCATTGCTTGCGATTCTATAGCGTCGCCGAACACAGCGTTCTGCTGGCCCGCCACGTATCGCCAGAAAGCCGCCTATGGGCATTGCTGCACGACGCCAGCGAAGCCTACCTCGTCGATGTGCCTCGACCGGTGAAGCCCTTTCTGCCGGGTTACAAACAGGCCGAGAACGCCGTCATGGCGGCGATTTGCGAGAGGTTCAACCTACCGCATGAAATGCCGGCGGAGGTGAAAGCTGCCGACCGAGCTATTATTGGCGATGAGCGCTCCAATATGGCAGCCTGCGTTGCCGAATGGTATGCGACGGGGCCGGGCATCGGGGCGCAACTACAATATTGGTCGCCTGAGAAGGCAAAGGCGCAGTTTCTGCATGAGTTCAGGAAATTGACAGCATGGAGGACGGCGGCGTGACCATACCACAAGACATTATGGCAATTGCCCGAACGGTAGCAGACAGAGACGTGAACTTCGGATCGTACGAGGTTAACACCTTGGCTATCGCCCGCGCCATCCTCGCAGAGCGTCAGCGTTGCGCGGATGTGGCTGGATCGTACGGCACAATTCCGGGTTATCCCGACGGGAAAACCGGTGCATTTCGCAAGCATCGCCACCGAATTAGGCAAGCCATTCTCGCAGGAGAAACCGCATGACCCCTTTCAAAGTTGGCGATCAGGTCGCTTGCATCGATAGCACTGTCGGCTTCGAGCAGTTCATCGAGATCAAGGAAGGCGAAGTCTACACCATAAGCTGGATCGGCCCATTCGAGCATTACACGCAGGGCATCTACATCGGCGTTCGTCTGAAGGGCGTTGATCGAGGCATCTGTCCGCAGTTCGGTTATGACAATCCGCCATTCGCCGCACGCCGGTTTCGGCCGCTTGTTCGCGATAAGTTGTCGGCGCTGCGCGGTCTGCTTGCAGGCGGGCCTGTGACTGAGAAGTTCGAGGAGCCGAAGCGGAAGGTCAGGGAGGAAGTATGAATTTCGCCGTCGACAACACAGCGCCTATCTATGCCGACCTGACTTTTGCGCCCGTCACATCTGACGGCGGCAGCACGAGCTATTACGAATTGCCAGCCGAGGCGACCGAACTAAACGACCTGATAGAACACAAGGGCATGTCCTTTGCGCTCGGCAACATCTTCAAGGCTTGCTATCGGTTTGGGGAAAAAGACGCGGCCAGCCGGATGTATGATCTGAACAAAATCATATATTTCGCGGAGAGACTTAAGAAAGTAGAGGAGAGAAAAGCGGCCTAAAAAAGAACCCCGCAGCGTGATCGCGCGGGGCAAAATAGAAGGTTGTCATATGCCTGCGGTCAAAATTTTATTCAGATGAGACAAAAAGTAAAGCCCCGACGCCCTTCCCGGAGTCGTCGGGCCGCGCTTGGCGTAGTGTGCCTGACCTAGCTTCGCGCCATCATCCAAAGGCGTTTCACATAACCTCGGATGATCTCAATGCCACTGACAGGGCCGGATTGTCATCCCTGCAGCTTTTGCAGCTTTTACGAATGATTTTCGGGCTGTGTTTGGCAGACCTTCGAGATCGGCAAGGCATGCCTGCAAGGCTTCCTCGTATTCCTTGCCGTGGGCGTTGTGTGGCCAGTCGTTCAATAGACAGCGCGCAGCAGTCGGCGTGTCTGGCACAACGCGGTATTTTCCAATTCCATAAAGTTCAACGTCTACTGGTTTTCCCCAAGCCATATCTCCCTCCCAAAGAAATTGCCCTGCCGAGGGACGAGCTAGGCAGGGCTGCGCTACCGACGGCTCTCGCAGTAGATGGCGCTTAGTTTTGCTCTTGCTACGAGTAAACTCGGGCGCGAAATATTTGTTCCATCACGATAACTCGTCATTCCTCCGGGTCTGTGCCCATTTCTCCCATCTGGACGCCCTTGGCCATACTTGCGATAGCACGAGCTATTTCGATGCGATCCCATCCAGCCTCTTGCGCATCGTCAATCAGGCCGAACAGCCCGTCGGCCAAGGCTTCCTGACAATCCATAAATCTGTCTGCGTAGTTTCCGTCTTTCTTCGGCCCTTTCATTCCATCCTCCCTAGAACAAACTTGCCTGCGCTTCCTTGTTGTCATTGCTTGGTGTGAGGTCGATCAGATCAGCATCGGGTAGGGGCTTTTGCATTTCCTTGGCTTCATCCCATGGAGCGCGTAGCCAAGCGTCGATTTCTTCCGTGGTGCGGAGAATGACCGGCATAGCCTTCGGGTGAACCGACTTCACGACAGCGTTTGGCTCTGTGGTTAGAAATCCGAAGATATCGACCTCCACCGGACCTTCCTTCTTCTTCCTCACACCTTTCCAGCTTGTCCAGATGCCAGCGAATGCAAAGAGCGGCTTCTCTTCATTGAGAGCGAACCAGTGAAGCGGCTTGCGCTTCGTCTTCGGGTCTGGCTCCTGTCCGTATTCGGAGAACGATGTAGCAGGAACCACGCAGCGGCTTTCAACACCTTGCCAGCGCCGCCAGTGAGGCGAGGTCAGATTGCGGATATTGGTCACGCCGCTATCCGCTTCACCCTTCACATACACTGGCGGTGTCGGCATGCCCCAGCGGAGCATTGCGAGCTCTGGTTCGTCGTCTTTGATGTTTCGCAAAACCGGGGCTGGATAGTCGGGATATACGTCAAGTTGCGGATCAACCCGGTTCGTCACGTCACCGAACTTCGGGAACAGGCGGCGCATAGCCTCATGTGTCGTCGTGATATTATACAGATTGCACATGCGCGCCTCCTCGTTGGGGAGAGAATAGCGCGGTTATTTGCGCCGTCCAGCTAATGCATCCTCACCTTCCTGCTTATGAGCGCGACAAAGCCAAAGCTGGCCGTTCGACAATTTATAGCCAAAGGTTCCCCATTCCTTGCAGCCTTTGGTATCGCACCAGTGTTCGAACAGGCTTCCAGCCTTCGCCACGTGTGCGTTGTCGTTCTTATATCCCGCCATTTATCACCTCGGCATTTTGGTAATTCCAAATTGCGCATAGCCTTTTGTCGTGCAGACCTTGCAGCGCATGCGCCTGTGTAGGTCGACGAACCAGGTATGCGTTCCGTATTTGCGTAGAAGCATCTCTCTATCGACTGCGCCGATGTGCCCGCACTGGCAACAGAACCCGTAGAGCTCATACCATTTGAACAGGTCCATGATCCGGGTCGATACCGGCATTTCGGTCAGGTAGGGTGGACGCTGTTTCATCGGTCAAAGTACGTCTCCCACGGCGTGGATTTCTTGTCTGCCGGGTCGTAGGGTACGCCGCCATATAGCTGGATGAACTCTCGCTGGCCTTCTTCGGTCCGGAACATTGCTACACTGAACAGTGTGATATTGTGTCGAGCGCTGCTCCATAATCGATAACCGCCAAGCCGCTCACGATCGTCCAGCATTTGAACGAGATTCGTTCGGTGCCATTCAGTCAAGAACAGCACGACCTGAAACGGATATTCCTTATTGACGAGCGTCTTTGGCGGCTCGCCACGCGATCTCCCGCTCATTTTACGAACACCGTCGGCTTCCAGCCTCGAGTATAACCATGGCTGACAGCCATGCTGGCTATAGCGATCTCTTTGACGAGAAAATCCCGATCTTTGAGCAGTGTTTCTATCGCAGCCCGCACATCGCCCTTGTGATGGGCGAGAACCAATTCAATTTCATTATCGTACTCATTATCTTGCGCAACTGCACTCATTGTTCTGCTCCACTGAAGCAGACACACACGCCACCAAGCATGTCTGGATTTTATCAGCGCCGCCTGCTGATGTTCCTAAAATGTTCTCATCTGAGGTGAGAGTCAATTGCGATTTTGAAGCTGTGGAAATCCGGTGAATTGTCTGTGGATTTCCTGTGCTTTGTCGCGCTCAACTAACGACCGTCTTCTGCCAAATGCCGTCGCCAAAAGGCTTGCCACCATGCCGACGCATGAAAGCGCTCATGCACTTTTCAGTAGGGAAACTGAATACGCAGAAATCAATCCCGGACTTGCGGGCCATGCGATGAGGCAAGCGTGCGCCAAGACGGTCGGCGTCGCGGTGAACTTCGCGACGAGCAAACTTCAAAGCATAAGCTTCGGGTAGAGCGACCTGGAACGTTTTCATTATGCTGCCCTCTTAAATAGATCGACAGGCTGGCCATGCGTTACGGCGAGACGCGAGAGTTCGACTTCTCTAACGAGGAAGTCGCGGTCTTTCAAAAGCGCCTCAATAGCCGCGTGCACATCTCCACCGTGATAAGCGAGAACTTCCTCGATTTCGGCATCATAGCTTTGGACAAGAGCGTTCATAGGTTTCTCCTGTTTTCTACAGATGTTCCTATTTTGTTCTCATTTTTGATTGCAGTCAAGGCGCTATAATGGCGTACATCAACGTGCAACAATTGGTGCTACTAACTGTGTAACAAACTGTGCGACATTTTCGGATATGTAACGAAAATAACGTAATAAAAACAATATCTAAAGAATCGGTTTTATACCAACCGACCGTACCATAGCCTATAAGCCATCTAACTGAAATTATGTGGAAATAACGATAATTTGCGTTATTTGTCGGCGCTCGTTCGAGCTTTGGCGCACGAATTTGTACCGCATATTATGCAACAAAATGCGATACAAGCTGTGCTATTATACCTTTATAGCTGGTGCGCTCTCGTGTTGCTCGCTGACAGGGACGCGCGTGCGCCAATTGCCCGCATAAGCCAAAAAATACACGATCTGGCCCAGGCGATAACAAAGCGTGCGCACTTTATGCTGCTGCCATTAATTGACTAGAATTCAAAGGCCTCCTAATGAATGACCATTCCAAGGGGGGCTTTATGAAGTGTTATCTTATTAATCTCGATAAGAGCCGGGATCGCCTCGAATTCATGGCGTCTCAGTTTGACCGCCTCGGCGCACAATTCGAGCGAGTGGAAGCCGTAAATGGACGAGCCATGTCGCCGCTTGAACTGGCTTCCTTTACTCAAATAAGTAAAGAATGGCCCGCTCCTTTGTCGCCTGCCGAAATTGGCTGTTTTCTTTCTCATCGCAAATGCCTTGAAAAGATTGCTGCCGGCGAAGATGCCTATGCGGCAGTCTTTGAAGATGACATTCGATTGAGCCAGGGTTCCTCGCGGTTTTTGGCTTCAGATCATTGGATACCCAAGCAAGCGGATATCGTCAAAATCGACGCTTACGGACATGAGGTATTGATTTCCAACCCTGTTAAAAATGAAGGCCCGTATTCGATTTCCCGGCTGCGCTCGCGACACTTGCAGACGGGCGGATACGTTGTTTCGCGCGAAGCAGCTCGCAAGCTCCTTCCATTGATGGAAAAAGCCTCAGCGCCGGTAGACCATTTCTTGTTCGACCCCAATGATGGGCCATTCAACGATTTTGAAATTTATCAGATTTCCCCTGCAATTTGCCGCCAGTCGGGAATGGAAAGCACAATCGGTCAGAATCGGCGCCCCAAACAGCGCCCCTCTCTTTTGGGCTTGGTCTGGCGTGAGGCTAAAAGATTGGTTATGCGGACCCGACGCAATCTAAAAGGCTTCATAACCAACGTCACCAAAACAGGCCGGTGGGGCTCCATTCCATTTGATCGAGATATTGCGTAG
Protein sequences of DBSCAN-SWA_6 >NZ_CP019390|1662246:1672021|1663106_1663298_+|WP_076771350.1|DBSCAN-SWA MSLTRIAIEYDSDAGTATVRIDNGSQQWDHAKLMVCDAVETRDGYLLPLTGQQRMLILTGVPT >NZ_CP019390|1662246:1672021|1664419_1665709_+|WP_076771353.1|DBSCAN-SWA MSHDRQGAGARLSHEELLRRAEAYREHGTLVKAAAALGIKKSAFHDSIKRAAELGLLGTSPVLPGFRIAKVSNTPNGDFIQQRPDLGDQFNVPDGHSVKGVSALVDAQGRLMQQWVKTREEPSAVDIAETLKAAFEGWQPAAKPQPAPTVANTDLLTLTPLADLHLGLFSWGKETGINWDLEIGEKVIGEAIEDLVARTPPSGEAIVLGGGDLLHSDNNENKTARSGNVLQVDGRYQKVLMAACRLIVKSVDANLRQHSRVTVRILPGNHDEHASVAVAYFLLAWYRNEPRVTVDVDPSLFFWFRFGAVLLGATHGHTVKLKDMASIMAHRRAEDWGATKHRYIHGFHIHHSSKFATEGNGVISESHQTPTPQDAWHFGSGFLSGRSMQAISYHRLFGEISRVRVAMMDGAQTKYQAANDNLASERRVA >NZ_CP019390|1662246:1672021|1669626_1669926_-|WP_076771363.1|DBSCAN-SWA MSGRSRGEPPKTLVNKEYPFQVVLFLTEWHRTNLVQMLDDRERLGGYRLWSSARHNITLFSVAMFRTEEGQREFIQLYGGVPYDPADKKSTPWETYFDR >NZ_CP019390|1662246:1672021|1670525_1670711_-|WP_029926953.1|DBSCAN-SWA MNALVQSYDAEIEEVLAYHGGDVHAAIEALLKDRDFLVREVELSRLAVTHGQPVDLFKRAA >NZ_CP019390|1662246:1672021|1665714_1665921_+|WP_076771354.1|DBSCAN-SWA MKPIPIAAAKSIASEYGYDQVVIFARRCHDSPLPHGEHMTTYGKTREHCDVAARMGDALKKFMGWAVR >NZ_CP019390|1662246:1672021|1662246_1663110_+|WP_076771349.1|DBSCAN-SWA MNKTTFFAYARRAPFGGRLSQAQVDGTSAILAEAERRGLPDEQTAYVLATAFHETGGKMQPIEENLNYTSAARIRQVWPSRFASVAAAQPYVRDPQALANKVYGGRMGNTGANDGWLYRGRGLVQITGRDNYKKYGIGDAPEKALEDSTAVRILFDGMINGKFTGKRLADYVGGGKEDAVGARAIVNGSDKASLIAGYYRNFLDSIVAAREMKPSVTEDAKPDDVPLLQDKTVRTIVAGTGGTLVTGLIGAVSNPWAFATVALLLVAVGAGFWLWKSGRLELKRAAV >NZ_CP019390|1662246:1672021|1666235_1666844_+|WP_154144917.1|DBSCAN-SWA MRAGGKVISHLPKPYNDNVPVAKQPARKGDWLQTYSGRQFWPLDPRADEVFIEDVAHALSMQCRYAGHCLRFYSVAEHSVLLARHVSPESRLWALLHDASEAYLVDVPRPVKPFLPGYKQAENAVMAAICERFNLPHEMPAEVKAADRAIIGDERSNMAACVAEWYATGPGIGAQLQYWSPEKAKAQFLHEFRKLTAWRTAA >NZ_CP019390|1662246:1672021|1668279_1668474_-|WP_076771359.1|DBSCAN-SWA MKGPKKDGNYADRFMDCQEALADGLFGLIDDAQEAGWDRIEIARAIASMAKGVQMGEMGTDPEE >NZ_CP019390|1662246:1672021|1663294_1663531_+|WP_076771351.1|DBSCAN-SWA MTWLLTLRSKITGWAVAIAAALAILAGAYLKGRADNATSATADRLKAANKARKIEDETSKLGGGDVDAALSRWVRDSR >NZ_CP019390|1662246:1672021|1663968_1664304_+|WP_076771352.1|DBSCAN-SWA MTGAEIMAVVGFIVMLMGFLFGLWKYVESQIAKAEARNAAKADAATALASLTRQELSDYKLRAAETFATKAGMQEQTSQIMRAIESVAHRIDGLTERIDNIMQARTTRTRP >NZ_CP019390|1662246:1672021|1669922_1670120_-|WP_076771364.1|DBSCAN-SWA MSAVAQDNEYDNEIELVLAHHKGDVRAAIETLLKDRDFLVKEIAIASMAVSHGYTRGWKPTVFVK >NZ_CP019390|1662246:1672021|1667885_1668134_-|WP_076771358.1|DBSCAN-SWA MAWGKPVDVELYGIGKYRVVPDTPTAARCLLNDWPHNAHGKEYEEALQACLADLEGLPNTARKSFVKAAKAAGMTIRPCQWH >NZ_CP019390|1662246:1672021|1670289_1670526_-|WP_029926951.1|DBSCAN-SWA MKTFQVALPEAYALKFARREVHRDADRLGARLPHRMARKSGIDFCVFSFPTEKCMSAFMRRHGGKPFGDGIWQKTVVS >NZ_CP019390|1662246:1672021|1667079_1667412_+|WP_076771356.1|DBSCAN-SWA MTPFKVGDQVACIDSTVGFEQFIEIKEGEVYTISWIGPFEHYTQGIYIGVRLKGVDRGICPQFGYDNPPFAARRFRPLVRDKLSALRGLLAGGPVTEKFEEPKRKVREEV >NZ_CP019390|1662246:1672021|1668484_1669162_-|WP_076771360.1|DBSCAN-SWA MCNLYNITTTHEAMRRLFPKFGDVTNRVDPQLDVYPDYPAPVLRNIKDDEPELAMLRWGMPTPPVYVKGEADSGVTNIRNLTSPHWRRWQGVESRCVVPATSFSEYGQEPDPKTKRKPLHWFALNEEKPLFAFAGIWTSWKGVRKKKEGPVEVDIFGFLTTEPNAVVKSVHPKAMPVILRTTEEIDAWLRAPWDEAKEMQKPLPDADLIDLTPSNDNKEAQASLF >NZ_CP019390|1662246:1672021|1671238_1672021_+|WP_076772003.1|DBSCAN-SWA MKCYLINLDKSRDRLEFMASQFDRLGAQFERVEAVNGRAMSPLELASFTQISKEWPAPLSPAEIGCFLSHRKCLEKIAAGEDAYAAVFEDDIRLSQGSSRFLASDHWIPKQADIVKIDAYGHEVLISNPVKNEGPYSISRLRSRHLQTGGYVVSREAARKLLPLMEKASAPVDHFLFDPNDGPFNDFEIYQISPAICRQSGMESTIGQNRRPKQRPSLLGLVWREAKRLVMRTRRNLKGFITNVTKTGRWGSIPFDRDIA |
16 | Sinorhizobium_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP019391_1 | 127054-127196 | Orphan |
NA
Consensus repeat of NZ_CP019391_1
|
1 spacers
spacers of NZ_CP019391_1
>1.1|127102|47|NZ_CP019391|CRISPRCasFinder CAGAGCGGGTTCCTGTTTTAACGGAACCGCTCTAACTATTTGTTTTG |
CRISPR arrays and Neighbor proteins around NZ_CP019391_1
The CRISPR arrays of NZ_CP019391_1 >merge|NZ_CP019391|1|127054-127196|CRISPRCasFinder TCGCATTTTCCAACGCAAAACCGCTTCACACTTTTGCTGGAAATGCTCCAGAGCGGGTTCCTGTTTTAACGGAACCGCTCTAACTATTTGTTTTGTCGCATTTTCCAACGCAAAACCGTTTCACACTTTTGCTGGAAATGCTC >NZ_CP019391|1|1|127054-127196|CRISPRCasFinder TCGCATTTTCCAACGCAAAACCGCTTCACACTTTTGCTGGAAATGCTC CAGAGCGGGTTCCTGTTTTAACGGAACCGCTCTAACTATTTGTTTTG TCGCATTTTCCAACGCAAAACCGTTTCACACTTTTGCTGGAAATGCTC
>NZ_CP019391.1|WP_076772240.1|124881_126999_-|flagellar-biosynthesis-protein-FlhA MSVAKQEADVQQAAAKPFTKGKFLTGSDIGLAVGIIIILTVLFLPVPAVVLDIGLAFSIAFSVLILMVALWIQRPLDFSAFPTVLLIATMMRLSLNIATTRVILTHGNEGYLAAGHVIHGFSQFVMGGDFVIGLVVFAILIIVNFLVITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIDEKEAQRRRRELEEESSFFGSMDGASKFVRGDAIAGLIITAVNIFGGIVIGATRHGMDISQAADVFTKLSVGDGLVTQIPALIVSLAAGLLVSKGGTRGSADQAIFGQLGAYPKALLIAALLLFILGVMPGLPAFPFFLLGGAMAFVGIAVPRRQARQREADAAEAGKKQREAEEQERNSVKASLETNQIELCLGKQLSARLIASQEELAHRVNKMRRKFAQEYGFVIPEIKVTDDIALPPKSYRIKIHGTAVASHELRVGEILVVLGERPVPSVPGEEVREPAFGMRAYSVPETFTADLRREGYMTVDNLSVLLTHLSEIVRNNLAQLLSYKDMRILLDRLGPEYRKLLEDICPAHISYSGLQAVLKLLLAERISIRNLHLILEAIAEIAPLVRRPEMIVEHVRMRMAQQICGDLSDNGVLNVLRLGNRWDLVFHQSLKRDAKGEIVEFDIDPRLLEQFGTEASAAIRKHFDNGERFVLVSSPEARPYIRMIIERLFATLPVLSHVEIARGVEVKSLGAIS >NZ_CP019391.1|WP_076772237.1|124098_124866_-|flagellar-type-III-secretion-system-protein-FliR MPVTQLPLNEIILAAVLAFCRIGACLMLMPGISSARIPLQVRLFIALACSFAVLPLVLQRIVTAVDAGQPWTLFRLMAGEMFIGALIGILAHIYFWALQFMANMLAMAVGYSGSPADSVTENEPQATLATIVTFSALFLFFVTDMHLEIFRALLNSYVAIPVDGRFRPDAAMIDFTDALSAAFLATLRIAAPFIVFAILVNFAIGLVNKLTPAIPVYFISMPFVLGGGLLVVYFILPELLRFFTHETATELRELF >NZ_CP019391.1|WP_015799984.1|123683_124091_-|hypothetical-protein MAQAGTRNLRKLVELQKLGCARHEAALAIANARKSALDEERAALIAMQDRRYDANALDIDPSLVIRRLETNAVEMQQVESRLELARKALLKEQRRVELLQDRLNDAQADRERRELASLIEEFVSRKTSDESQKRS >NZ_CP019391.1|WP_069714511.1|123025_123616_-|rod-binding-protein MAINPPSDLVMDVARAADPQAYRMAAERLNAASGVAAPMGGSAATGLTRDNFGSFSENLAAGVSVRPDAQSAASPAYRKFEAFMLQSFVQSMFTSDTTATFGKGIAGEYWKSMMAEAMANKMADGGGVGIARLLEEQAARNRRAEAPATLALGDVIDTLDVNAGEKAVSKDIIHGLERKLIQKQLGNNNSTDRAHG >NZ_CP019391.1|WP_076772235.1|122408_122903_-|flagellar-protein-FlgN MIKDQKMTETSSENSLPPEDLLIVAASCEPEQEHVREPALVRDAALKPVMRAIERLEDVIETETRLLLEGGNPDLAEINARKSRGLYDFNKAIKKAADTAEPATMKGLQPFLDRLKQKLERNCEALQLHLRAVGELADLIRGALETQEADGTYNMQSARLGHAR >NZ_CP019391.1|WP_076772233.1|121884_122412_-|hypothetical-protein MIRIIVIGLWICAVAFGSLFLAVNRDVSASAVVEAMPGGFGGVDYVKTDVMSVPIISNGAVTGYVVAQLVYTVDSNIHKKLTVPLEYFISDEIFHKFYGSYSDTKSVEKVSFEDVRSSIINDLNARFPEPVIKDLLVEQFNYISSEEIRTMNMRAHAQPSGRNRKADAEPAAQSE >NZ_CP019391.1|WP_002966657.1|121561_121897_+|PilZ-domain-containing-protein MARVKPPQERRREERQRTRLRSGKIVNLDGRFVVECQFIDIAPHGAKIRVREALYLPERFWLFDDHYARALLARLAWRKGREFGVEFIIDPTVIPLDEERLAHLAGKYYSL >NZ_CP019391.1|WP_076772231.1|120997_121435_+|transcriptional-repressor MTTHHHHHAPRDLTRNQTLVFDVLSRADGPLSAYTILDQLRDDGFRAPLQVYRALEKLLDYGLIHRLESLNAFVACAHPQCHQQGLVAFAICEKCGQVTEFSDAAIENLVTAWSVQNGFKSRKTTLELRGICEACDDHKSASRKV >NZ_CP019391.1|WP_076772229.1|120140_121001_+|metal-ABC-transporter-permease MTMNTIITKGTSRVLDDFFTRAIIAGIGLALTTGPLGCFIIWRRMAYFGDTMAHSALLGVALALIFDINLMVGVFAVAVAISAILLLLQRRHTLSADSLLGILSHATLSLGLVLMAFMTWVRVDLLSFLFGDILAVSRIDIAFIYGGGALILAVLAWLWRPLLAATVSEDIARAEGMNPALSRIIFMLLLAIVIAIAMKIVGILLITSLLIIPAATARRFASTPEQMAVLASLIGAAGVIGGLYGSIHFDTPSGPSIVVAALAIFILSLLPLNKGRETSPKRIAGQ >NZ_CP019391.1|WP_076772226.1|119290_120187_+|metal-ABC-transporter-ATP-binding-protein MDKKSSHPAGAARDILIELKNAGVYRDGRWLVRNVDLSVERGEIVTLIGPNGAGKSTAAKMALHILKPDEGMVSHKPGLRIGYVPQKINIDRTLPLSVERLMTLTGPLPRKEIDAALEAVGIAHLAKAETAHLSGGEFQRALMARALARKPDIMVLDEPVQGVDFSGEAALYELIARLRDDTGCGVLLISHDLHLVMAATDRVICLNGHVCCSGTPRDVTSSPEYVRLFGSRAVGPLAVYEHHHDHTHLPDGRVLYADGTTTDPIAGSTRGPRGHCHVEDGHHHDHEHHHHEGDQPRA >NZ_CP019391.1|WP_002966650.1|127228_127495_-|flagellar-biosynthetic-protein-FliQ MNEADALDIVNSAIWTVLTASGPAVLAAMLAGIGIALFQALTQIQEMTLTFVPKIIVIFVVLALTAPFVGAQINAFTLLAYSRIEKGF >NZ_CP019391.1|WP_002966649.1|127506_127920_-|flagellar-hook-assembly-protein-FlgD MTTTSPVGSNTTNSASTASNSTSAANKASVDYDSFLKLLVTQMQNQDPTQPMDPTQYVSQLATFSNVEQSVQMNSKLETLIANTSLTQAEGWIGRTLTNADGSISGVVKSVTIQSSGMLAELEDGKTLTIGEGVRIS >NZ_CP019391.1|WP_004680988.1|127916_128375_-|flagellar-biosynthesis-repressor-FlbT MAANSKTAIRLSLRAGERIFINGAVLRADRKVSLELLNDATFLLENHVLQPEDTTTPLRQLYFAAQMMLIEPAMREQAGATFAQMLKGMFATFKDAEILNALKLVDELVHNGRVFEALKTIRAQYPREAELMGAQPVVWPVTKSGKSAGANP >NZ_CP019391.1|WP_002966647.1|128376_128721_-|flagellar-biosynthesis-regulator-FlaF MYQLRYEDVMNDDMASAKERERMLFDRSIAMLEAARANGAESREGIDAAYFTSKLWTTIIDDLGSEENALPKELRAAIISIGIFVLKEIERIRQGESNDYATLIEITQSIRDGL >NZ_CP019391.1|WP_076772241.1|128852_129899_-|flagellar-hook-associated-family-protein MKAQSISTYGATSALRALVAKNKAEMVKAQQEATTGTVFDVGLSLGSRTGQTVSLRKEYDRLSVLTDMNKLVQQRMTATQTAAGKIIENTQNFLGDLAGANNSGETAKTVAKSARSMLDSVTGLLNTSFNGEYIFAGVNTDVKPISDYADGSTAQNAVRQAFQDHFGFAMDDPQVANISGDEMKAFLEGDFAEQFNDANWAANWSDASDTRIKSRISPTETADTSISANADGFRKTVMSAVMVAEFADIGLTASAFDALTTQALQITTQAVTETTAEQTTLGLAQSRTEAATTRIAAQQKILNQSVLNLEEVDPYDAATRVNALKTQIETSYSLTVQLQNMSLLNYLR >NZ_CP019391.1|WP_076772244.1|129904_131359_-|flagellar-hook-associated-protein-FlgK MSLSSALLTAKSSLAATSKQTSVVSRNISGAKDADYSRRTASLVSGPYGSLYVGISRSADEAMFNRYIQSNSAASASSTLAGGLDRLSALYSADNYSGSPSGLIGDLRDALQTYAASPSNSALGDSVVSVAQSLANALNDGTRQVQSLRNDADREIADSVANINDLLAKFEKANQDVVGGTRMGRDVSDYLDQRDALLKQLSGEIGITTMMRGDNDMVIFAENGVTLFETTARKVTFEQSTALTPGVAGKAVTVDGVPLSHDTFDQPFGTGRLSGLLQLRDQIAPQYQMQLDEIARGLVTVFAESDQTGSSPDQTGLFSWSGSPAIPGAGLSAGIAGTIGVSVPFIASEGGSALLLRDGGANGANYKYNAQGAAGFSDRLRALNEAFSEPMVFDAAAGISSSSSLTSYSASSLGWLEGKRQKANSEFTYNGTVASQADFALSNATGVDIDTEMALLLDLEHSYQASSRVLTTVSAMLDDLLNAV >NZ_CP019391.1|WP_076772247.1|131483_132674_-|flagellar-hook-protein-FlgE MSLYGMMRTGVSGMNAQANRLSTVADNIANASTVGYKRAETQFSSLVLPSTAGQYNSGSVLTDVRYGISDQGGIRSTSSTTDLAIDGNGYFVVQGPGGSTYLTRAGSFVPDKNGDLVNSAGYYLLGAGADEAAGGLTVAGLNVVNVNAAALPAEGSTAGDFTVNLPSTDQAPAAGEYNHKTSLISYNDKGEKITLDVYFTKTGADEWNVSVKNAADGVEIGTTVLNFDPTTGDLVSGGNVAVNLGAYGGQTLNLNLGGSTQRAGDYTISQAVINGQAPSSIKGVDVGNDGAVVAVYENGTQKVLYRIPLANVASPDRMTVVSGNVFLPSAESGDVRLGFPQGDGMGKIMSGTLEESNADIAQELTDMIEAQRSYTANSKVFQTGSELMDVLVNLKR >NZ_CP019391.1|WP_002966643.1|133082_133766_-|response-regulator-transcription-factor MIVVVDDRDMVTEGYSSWFGREGITTTGFTPTDFDEWVESVPEQDIMAIEAFLIGECADQHRLPARIRERCKAPVIAVNDRPSLEHTLELFQSGVDDVVRKPVHVREILARINAIRRRAGASATSGADGTQLGPIRVFSDGRDPQINGIDFPLPRRERRILEYLIANRGRRLNKVQIFSAIYGIFDSEVEENVVESHISKLRKKLRGQLGFDPIDSKRFLGYCINIE >NZ_CP019391.1|WP_076772252.1|134092_134665_-|lytic-transglycosylase-domain-containing-protein MAVALLISVGAVWLSDVPLAQAENICEREMHRASARYGVPLGILYAVGLTETGRKNSLQPYAMNIEGRAEFPPSQSAAIRRFGEVRAEGAKLIDLGCMQINYHYHSQAFPSVAAMLMPSLNVDYAARFLKRLREREGNWTMAVARYHAGPNNDPAQKRYVCRVMANMVAAGFGNWTPQARSFCFSRDGLL >NZ_CP019391.1|WP_076772253.1|134618_135893_-|flagellar-hook-length-control-protein-FliK MSVDMLLTASGRLASLARNSSTQARVAGDEQDGTADPATLFGALLEKPQGKVDASGKDEAEAKSDEKEAGEGSGQKPVAVFGLPQNLLSLASALPGQLHGEGGFSVEEKAGNPARAAVSEPGLADTIDPAGIDKAAAEFAGDDEKRPLRDLPRVKAPQTNGDPAIRAGQVSGEAALDISGDIATNKPAEGAAVDVAGVKAPKPATDHEPAKGMDVRADLLPQQVAAKSTVPAEAKAAAGAPGAARIADIEVVSERSFGTVKTLQIRLDPVELGAVTARIRVAGDGVEVHLVADKSHAAEMLAADRSMIEKALKVAGVGDDTKISVTVADRNAQGAAQHVAAAQNAGQQQASAQQQGHQLASNMQQQGSEGRGGEAQAQFMSGRSGGEGGRNGESGQAGREHANSQAKPENGRGAPHIGGRGLVV |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
645148 : 652062
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP019391|645148:652062|DBSCAN-SWA CTTGAGCAGACGAAGCCTTACAGATGAGCAATGGAACCGGATCGAAGCATATCTTCCGGGGCGAGTTGGTACGCCCGGCCGCAGTGGCGTCGATAACCGATTATTTGTCGACGCCATCTTGTGGATGGCTGCCAATGCAGCGCACTGGCGCGATCTGCCTGCGACCTTCGGCAAATGGACAGCGGTTCATGCCCGCTTTCGGCGCTGGTCGCACGCCGGTGTATGGGAAAGGCTTTTCCATGCCCTGGCTGATACGCCGGACTTTGAATATGTCCTCATTGACAGCACCATATCGAAAGTCCACGCAGATGCGGCGGGCGCAAAAGGGGGGCTGAAGCTGCCTGCATCGGTCGCTCGCGTGGGGGATTGACGACCAAGCTGCATGCTGTTGTCGATGCTATCGGCCTACCGCTGCGAATAAAGCCAACACCCGGCCATTATGGTGACTGTCCGCAAGCTTCAAGCCTTCTATCCGGCTTAAAGGGTGTGGGGCATGTCATTGCTGACGCTGCCTATGATGCCGATCACTTAAGGGCCTTCATTGCCAGCGATCTCAAGGCAACGGCTCAGATAAAGGCCAATCCAACACGTTCCAGTGTTCCAACAATCGACTGGAGGCTGTACAAGGAGCGCCATCAGATTGAATGCTTTTTTAACAAGTTGAAACGCTATCGTCGCATTGCGCTGCGATGCGAGAAAACATTGACCGCATTCATGGGGTTCGTCCATCTCGCATGCGCTATGATCTGGTTACGTTGAATGCAGACAGACCCTAGAGCGCGTTTCGATCTGATTGAATCAGATCGGCGCTCTAAACAACCAGTATAACATACCAATATAAAAGCCCGGAATTAACGTTCCGGGCTATATCGTTTCTCGCGTTGCAGCTATCTGTTCAGCTGAAGGCAAGCAATTGCGCGCGCCGCACGCCATGCATCAACATATCGAAATGATCGTGATCCTTGGCGACAATCAGTTGCATAGGGAGTGTTGGATTGGCTTCCGAAATCCTTTGCGCAGCCTCAGTGATGTTGTCTACCATTGCCCGCTTTTCCAGATGCGCTCGCCGGCGGTCATTCAAAGGTTGCGAAGAATATTGCTCGTCTTGCCCAACGGTAATGACCAGTTTTTCACACCGGGCGATAGGTGGCAGCGCGCGTAATTGCCTGAGCACCATCGCCTCGCTGAACCAGAGCGAAGGACTTGCGGCCCAATAACGCTCGAAATGACCGGGATGATTCAGCAAGACATAGATAGCAAAAAGGCCGCCATATGAATGGCCGAAGAGCGTTTGCTGCGTGAGGTCGAGAGGGAATTTCGTCGCCAGCAGAGGCCGTAATTCCTCGGCTATAAAACACAGGAAGTCCGAGGCTTTGCCGAATTCTTCAGAAACATGGTCGCCAGTCGCACTGTCCGGTACGGGGGTGTAATTGCGGGCGCGGGCTGCGGGATTGTGAAGCGGGCTACCCGGAAAGCCGATTGCGACGATCAGGCCATCGGCTTCTCTGCCACCGACCAGCGAGCCGGTGGCGAAATTGCGCGTAGGCCGACTATAGGTTAGAGGAAAACTTGTTTCGGCATCGAGCATCCACAAAACGGGATAGCCGTTTTCTGGAACAGCCCCATCGGGAACGCGTACCAGAACCTGATATGTTTCACCGGTACATTTTGAATTGACCTCGAAGGACTGCGATCCAGGCAGGTAAACGGGTTGCCATTCACTGGTCTTCAAGGTCATTATCTGTCTCTTATCAATGAGATATGTTAGGCTACGAATTGTCTATACGCTTCAAAATAGCCATAACACAAGCTGTAGCGTCGCCTAAATCCGCTATATCCGCCAGAAGAGGTCACTGCGGGCGAGATCGAACAATTGCATGTTGCTGAGCGCGTGATGTGGTTCGGGCGGTCAATTTAAATTTGATGTCAAAGGCCCGCCCTGCCATATGTGTAACCCGCAATATCATATGGCAGTGACCGGGATAGTCCGGGATTAAGTGGGATTTCGCGATTTAGAGCGGAAAAACCCCGGAGAACCGGGGCTTTCGAGAAATCGGCGAACCGCCTATCGCAAAACAGCGGCGCAAGATCAAATGGCAGGGCGAGCCTTTGACATCAAATTTAAATAGACCGCCCGAACCACATTACGCGCCCAGCAACATGCAATTGCTCGATCTCACCTGCAGTGACCTCTTCTGGCGGATACAGTGGATTATCGCTTATCAACTGTAGAGTGCCATTGAGACGGCTGTGAACACGCTTCACCAACAAAATATCGCCGTAGACAATCACATAGATCGTATTGTCCTGAATATAATTGATAGAAGTATCTACGAGCAACACGTCACCGTTACGGATCGTCGGCTCCATCGAGTCGCCTTTGGCCGTTAGTACGCGAGCGAACGAAGGATTTATCTTTCTCGCTCTGAGCCAACTTTCCTGAAATGCAAGATAATCAATAGCTTGCTCGTTATGTGCAAGGGCTCCTGGTCCGGCTGATGCCTCAATATCTAATCGAGGCACCAGTGTGAAACCCGGAATATCTTCATCCGCGATCTGCAGATGATTGTTGAGGTGCGCTTCCTGCTTATCTCGCCCTGTCGCGAGCCAATCTAGCGATTTGCCAGCCGCATAGGTCAACGACGCGATACTGGCAAAATTAGGCTTGGCCTTGCCATCCCGCCAGCGGCGGATTTGTTCGTCTGTGACGCCCGCAATCTCACCGGCTCGAACTGTGGTGCCAATTATCCGAATCACCTCTGCCAGCCGATCACGAGCATCTTCATCAAACGAAAAGTCGGGAGTGGGAACTCCTGGCCCACTTTGATGCTCGGATATCATTTGCCACCAACCTATTGTTGTTAAGCGACTTTCCGTACTCATCGAATTAGTGGCTTCAAAATCAAGCGGGAACCACTTTTACGTTGATCGCCAACTAATAAGTTGCTACATTGGTCGTGTCTCTAGTTTGAACACATTGCAGCCGACCGCCCCTGCCAGGGCAAGCGACCAAAGGAGTTGTCATGACCACCAAAAAGTGGGACCGCCACGAAATTCTCGCGGAAATCAGACGCAGAGGAATGACCCTGACAGGCATTGCCCGTGATGCAGGTCTCTACGCCAGTGCTTGCCGCGCTGGCATGATCGGTGCCAGTCGCCCAGGCGCAGAAGCAATTGCCGCCGCACTCCAAGTCCCTTTCCGCGATCTCTTTCCCGACAGCTACACACGCGGCCGTCACGACGAGTTGCACACTACCAGCAACAAGAGTTGCAACGGAAGGGCAAAAACTCACCCGCGCGCTGACGGTACACAGTCAGCCGCTTAGGGATTCCGTCACGGCTGCTCAACAACCAGTTCCTCGGAGGGAAAGCCAATCCTCCGTACTGAGTTCGTCAGGGGCCAAGTGCCGTCTGAAGCGCGCAGGCCATGCGATCCGCCTGTCACTTTGCCTGATCGCGCGTTGTCGGCGAAAGAGTTGACACCGCTCTGGGGCAAGGATCGCGCCGCTCTTGCCGCCTGCGAACAACGGCGCGGTGCAGTCATTGCCGCTATCGACGCGGTGCCGGTGCCAGCGGAGCGGCCGAAGTGATGGAGGCGGGCAACAAGGCACATGAGCTTGCTGATCTGCGCGCCGAGCAAGAGCGGGACGCGGGAATCGCTGCCGCGTCGGCTGCATTGATCGGAGACGGATCGGACATCTGCGTCCGCTGTGGCGAAGAGATCGAGCCGGAGCGGCTTGCAGCCCTGCCATCGGCGCGCCGGTGTGTGGACTGTCAGAGCAAACTTGAACGCGAGCAGTCCAGGGGGCGTCGTTAATGCTGGAGAATATCTCACTCGCCCAGATCGGGCAGCTCATCAATTTCTTGCTGGCGGCTGCGGCCTTCCTCGGCGTGGTCAGCGGCTATATCGGCAAGGGTGCGAAAGAGATATCGGCCAAGGTTTCGGATCACGAGCACCGACTGTCGAAGGTCGAGAATGATCTGTCGCACATGCCGAACACTGAAACCGTCCATCAGCTCCAGCTCGCGATCACCGAGATCAAGGGGCAAATGGGCATTATGGCGAAGTCGTCGGAGGCGACCGAACGGACGACACGCCGTGTTGAAGAATTTTTGATGCAGAAAGGACGTTAGGAATGACGGCGGGTTACAAGGATTTTGTCGATCAGAATGTACGCCTCATCATCCTGAAGGTGCTCGCGATGGAGACGAATGCCAGTCTGAACGACAGCCTGCTTGAACGCGAGCTGGAGGTGTTCGGCTACAAACGCACCCGCGAATATCTGCGCAATCAGATGCGCTGGCTTGAAACCGAAGCTGGGGCTGTACGCATTAGTGCCGCAGGCACAGCCCTTATCGCCACATTGACCAGGACAGGCCGCGATCATGTCGAGCGTCGTCTCGTCCTCGAAGGCATCCAGCGACCGGGCGACGTGGAGTGATGAGCCATGACAGAGGATACGCAATCCTACACAGCCGCTGGAACTTACACTTTCATTGTCCCGACCGGAGTTTATTGGGTCTTTGCGCGCGTATGGGGCGGTGGCGGCGGTGGTGGCGGGGCTGCAAACAATAACGCTGTCGGTCGTGGAGGCGGCGGATGCGGCTACTCTGAGGGATGGGTCGCGGTAATGCCTGGACAATCGCTGACACTCGTCGTCGGTGCGGGGGGCGCTGGTGGAGCTGGTGGCGCATCCCCTCAAACAGGCGGGAATGGCGGCACATCGAGTGTCGGCGGCATATCAGCTACGGGCGGCAAAGGCGGTGGTTACTCCACTTCGGGCGGCGGAGTATCAGGTGGTGACCCAGGCGGCGGATTTGGTGGCAGTGCGAATTTTTCGGGCCAAGGCGGTTCGATCGGCTACCGCATCGCGGAGGGTGCGTGGATCGCCCCAGCCGGAGGCGGTGCATTCGGAACGCCATCCTCCAACGTTCACATCAATGACATCGGCTATCTTGGATACGCACCGGGCGGTGGCGGTTCTGGCGCATGCACCAACGCTGCCGGTGGATCTGGCTTCGATGGCCGAATTATTCTCCAGTACTGAGGTATGAGAACATGAGATATGCACTGATATCCCAGGGCAGCGTCATCGATGTCGTGGAACTTGATGATGTAATCGATCCGCTTGATGTTTTTGCAGTTGAATTAGCTCCAACGCCTTGTGATGAAGATGTGGAGAGTGGCTGGCTATACGACGGTGAAGAGTTCGCTCTACCTGTCGAGCCAGAGCCGCAACCTGAGCTGGTTCCAGATGAAATCAGCCGGCGCCAGTTTTTTCAGCAGTTGGCCGTTCTTGAGATCATCACCAGACAAGAGGCGCTTGATGCACTGGACGGCGCAATTCCAGCGCCTCTACAGGCGATCATTGATCAGCTCCCCACGGATGATGACAAGTTCAATGCCCAGATGCTGGTGAAGGGTGCCCAGAATTTCAATCGTACCAACCCTCTCGCGGAAATCGTTCGGCAAGCCATGCAATGGACAATCGAGCAGAAGGACAATTTCTGGCGACAGGCCGCAAAGCTCTGAGGTGAACTATGAGTGCAGTTATTGGCGTTCTTCTGGACGAGTTGCGCGGCCTGCTATCGATCGAACATGACGGCTCGATCACATGGGATGAGCTTCAGGAGTTGAAGAACGAACATTTCGGACCTGATGCAGTGGCGATTGAGGTTTATCCGCCGCATAGCCATGTCGCGAACAGTTTGCCTATGCGCCACCTGTGGAAGCTTGGTGCTGGCGAATATTGGCCTGATCTGACCGGCCAAAGGCTGGTTGGCGATCTGACCTTGCGTGATCGCGAAATACTCACACGATCGGAGATCGAATTTCATCAGCAGCGTAAAGCGGAGGTGAAGTGCACTATTTCGACTGAGCTTGATGGGCCGGTCGTGGTGTCCATGGATGGTATTCACCTTTGGTCAGCAACTGACAATTGGGGAAAACCTCCGGCACCAAAACGGAAGTAGGGACGGCCACGTCAATTCGTGGCGGCGGGCCGATCCGGCAAGATAGAACCCGCCCGACAGCCATCCGAGATAACCGTCGCATCCGTGCCCTTTCGGGCAATTGGTTTGTGACCGATTCTCTTAGGAACGTACATGAGCCAGAAGGCCGATTTCACAACCGTCGATCCTGTTTCGCCGCCCGCAGCCTATATCGGTGGGAAGCGCCAGCTTGCAAAGCGCATCTGCCAGAAGATTAATGCTATATCGCATTCACTGTATGCTGAGCCGTTCGTCGGCATGGGCGGGGTGTTTTTCAGGCGCATCGCAGCGCCACGCGCCGAGTTCATCAATGACCGTTCAAAAGACGTCGCGAACCTGTTTCGCATTCTTCAGCGTCATTATCCGCAGCTCATGGATACGCTCCGGTTCCAGATCACGAGCCGAGCGGACTTCGAACGGCTGACAGCCACCGATCCCGATACACTAACCGATCTGGAGAGGGCTGCGCGCTTTATCTACCTTCAGCGCCTGACGTTTGGAGGAAAGGTGGCCGGACGATCATTCGGCATCAATTACAGCGGATCATCAAGATTCAACCTGACAACGCTCGCACCTATACTCCAGGAGGTTCACGAACGGCTTGCTGGGGTGGTTATCGAGAACCTGGACTGGCAGGCGTTTATTGATCGGTATGATAGGCCAGAAACGTTGTTCTATCTCGACCCGCCTTATTGGGGCACGGAAGGTGTTTACGGGAAGGAGTTATTTAGTCAGGACCAGTTCGAAATTCTTGCCGAGCGCCTTGACCGCATCAAGGGACGGTTCGTCCTGTCGATCAATGATGTTCCACAGATACGAACCATCTTCGCTGGCTTTCAGATCGAAGGTGCTGAGCTGACCTATTCGGTTTCAGGAGGAAAGGGAAAACAGGTGCGGGAACTGATCATCAGCAACGCTCCTTAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP019391|645148:652062|649283_649589_+|WP_076772836.1|DBSCAN-SWA MTAGYKDFVDQNVRLIILKVLAMETNASLNDSLLERELEVFGYKRTREYLRNQMRWLETEAGAVRISAAGTALIATLTRTGRDHVERRLVLEGIQRPGDVE >NZ_CP019391|645148:652062|647268_647517_-|WP_076773855.1|DBSCAN-SWA MEPTIRNGDVLLVDTSINYIQDNTIYVIVYGDILLVKRVHSRLNGTLQLISDNPLYPPEEVTAGEIEQLHVAGRVMWFGRSI >NZ_CP019391|645148:652062|646042_646885_-|WP_076772830.1|DBSCAN-SWA MTLKTSEWQPVYLPGSQSFEVNSKCTGETYQVLVRVPDGAVPENGYPVLWMLDAETSFPLTYSRPTRNFATGSLVGGREADGLIVAIGFPGSPLHNPAARARNYTPVPDSATGDHVSEEFGKASDFLCFIAEELRPLLATKFPLDLTQQTLFGHSYGGLFAIYVLLNHPGHFERYWAASPSLWFSEAMVLRQLRALPPIARCEKLVITVGQDEQYSSQPLNDRRRAHLEKRAMVDNITEAAQRISEANPTLPMQLIVAKDHDHFDMLMHGVRRAQLLAFS >NZ_CP019391|645148:652062|650206_650680_+|WP_076771237.1|DBSCAN-SWA MRYALISQGSVIDVVELDDVIDPLDVFAVELAPTPCDEDVESGWLYDGEEFALPVEPEPQPELVPDEISRRQFFQQLAVLEIITRQEALDALDGAIPAPLQAIIDQLPTDDDKFNAQMLVKGAQNFNRTNPLAEIVRQAMQWTIEQKDNFWRQAAKL >NZ_CP019391|645148:652062|650688_651120_+|WP_179947171.1|DBSCAN-SWA MSAVIGVLLDELRGLLSIEHDGSITWDELQELKNEHFGPDAVAIEVYPPHSHVANSLPMRHLWKLGAGEYWPDLTGQRLVGDLTLRDREILTRSEIEFHQQRKAEVKCTISTELDGPVVVSMDGIHLWSATDNWGKPPAPKRK >NZ_CP019391|645148:652062|648170_648473_+|WP_076772832.1|DBSCAN-SWA MTTKKWDRHEILAEIRRRGMTLTGIARDAGLYASACRAGMIGASRPGAEAIAAALQVPFRDLFPDSYTRGRHDELHTTSNKSCNGRAKTHPRADGTQSAA >NZ_CP019391|645148:652062|648521_648737_+|WP_076772834.1|DBSCAN-SWA MLRTEFVRGQVPSEARRPCDPPVTLPDRALSAKELTPLWGKDRAALAACEQRRGAVIAAIDAVPVPAERPK >NZ_CP019391|645148:652062|648963_649281_+|WP_083699691.1|DBSCAN-SWA MLENISLAQIGQLINFLLAAAAFLGVVSGYIGKGAKEISAKVSDHEHRLSKVENDLSHMPNTETVHQLQLAITEIKGQMGIMAKSSEATERTTRRVEEFLMQKGR >NZ_CP019391|645148:652062|648736_648964_+|WP_076773857.1|DBSCAN-SWA MEAGNKAHELADLRAEQERDAGIAAASAALIGDGSDICVRCGEEIEPERLAALPSARRCVDCQSKLEREQSRGRR >NZ_CP019391|645148:652062|645148_645905_+|WP_099686906.1|transposase|DBSCAN-SWA MSRRSLTDEQWNRIEAYLPGRVGTPGRSGVDNRLFVDAILWMAANAAHWRDLPATFGKWTAVHARFRRWSHAGVWERLFHALADTPDFEYVLIDSTISKVHADAAGAKGGPEAACIGRSRGGLTTKLHAVVDAIGLPLRIKPTPGHYGDCPQASSLLSGLKGVGHVIADAAYDADHLRAFIASDLKATAQIKANPTRSSVPTIDWRLYKERHQIECFFNKLKRYRRIALRCEKTLTAFMGFVHLACAMIWLR >NZ_CP019391|645148:652062|651252_652062_+|WP_076771236.1|DBSCAN-SWA MSQKADFTTVDPVSPPAAYIGGKRQLAKRICQKINAISHSLYAEPFVGMGGVFFRRIAAPRAEFINDRSKDVANLFRILQRHYPQLMDTLRFQITSRADFERLTATDPDTLTDLERAARFIYLQRLTFGGKVAGRSFGINYSGSSRFNLTTLAPILQEVHERLAGVVIENLDWQAFIDRYDRPETLFYLDPPYWGTEGVYGKELFSQDQFEILAERLDRIKGRFVLSINDVPQIRTIFAGFQIEGAELTYSVSGGKGKQVRELIISNAP >NZ_CP019391|645148:652062|649595_650195_+|WP_156884125.1|DBSCAN-SWA MTEDTQSYTAAGTYTFIVPTGVYWVFARVWGGGGGGGGAANNNAVGRGGGGCGYSEGWVAVMPGQSLTLVVGAGGAGGAGGASPQTGGNGGTSSVGGISATGGKGGGYSTSGGGVSGGDPGGGFGGSANFSGQGGSIGYRIAEGAWIAPAGGGAFGTPSSNVHINDIGYLGYAPGGGGSGACTNAAGGSGFDGRIILQY |
12 | Ochrobactrum_phage(77.78%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
918955 : 925147
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP019391|918955:925147|DBSCAN-SWA GTCAATTCAGCCAATTCACCAAGACTACCGTTCGGGTCCCCGGCGTGGTCAGTTCGCGCCATGCCGCTTCACTGGTGGCGTTCTTGACCTTCTGGGGTATCGTTGTGCTCGGAAGAAGCGCACCGCGCCACACGGCGTTCTTTCGGGGCCATTCAAGCGCCTGCTCTACCGTCGCGGGTTCACCGATGAACCGCTGGCCGTACGTGCCATCCAGCCATGTCGTACCGCGTCGCAAAGCCTGTTCAATTTCTGGATCGGTCTTGTCGGCCAGAACATATCCGACCTTCCCGCAATAGGCCTTAAACTCATTCAGATCGACGTAACTGCCAGCGTCAGGATCGCCGGGAGTGACGACAAGTGCCATGACTATTCATCTCCAAAACAAATCTGACGCTTGGCGCGCTCCATCAGAATTAGCGTTTCGCCTGCGTTGGCAGAGCCACTCACCCAGAACGAGCCATCTTCAAGTTGCCCAAGCACAGCTACGGGATGCCACTTCGATGCCTTCAGCGGTCGGCTTCTGAAGCCAGGTCATCTGCGATGGTTCGTAGCGGATCGGGCCGAAGCCGCCGGCGATCTTGACCGAAGAGTGCTTCTGCTGGGTCAGATCGGTCGCGGCTGCTGCGGCTTGTGCCGCATACCGATCAACACGACGCTGTGCTGAGTGAATGGATGCAAAGAACGACTCCTGAAGGAAATCGCCCTCAAAACCATCCGTCGTGAGACGGATTGCGCCGTTGGAGGCTGCATTGAACTTCTGCACCATCTGCGCCAGCGTTTCGATGATCGCGGGCATGAAGTACTTGTTGAATACCTGCATCTGGGAAAGAGACATGGAAAGCTCCTATGCTGTGTCTCGGGATTGTTGGTTGGATGGACTTCCCGCCCGCTTGATGCCGCTTCTCATCCCGAGACAGCAGCAGAGAGGTTTCAGGGCGATCAGCCCTTCAGTTCAGGGAATTTGGCTGCGATAGCCGCCGCTCGTTCTTCACGGGTTCCACCGAAATTACCGCGTTGCGGAGGATTACCGATACCTTACCGTTCGCCAGCAGCTCTGATGCCTTAACCCAGACACTCTCAGGCTTGGTCTTGTCACCCACATTGTATGCTCTCCGGTACGCTTCCGATGCATTCCCGGTTTCCACATAGGCGCGGGCAAAGGCCTCTTGTTTCGGCGTGAGACTCACGGTTGTATTCCCGGATGATATTTTATGGAGGGGTCATGATTTTTGTTCGAGATACTGTTTCAGGGTATCGGCTCGGCACAATCAATCTTCGGTGTCCAGCATGCTCTCATAAAGGTGCTTTTGCATCACTAGGACATGATATTGTTGATGCTGTATGGAACGAATTAGATACCAACGGCAACAAAACTTACTCAAACATCCGAGCCGGCCTCAGACACTGTCCAAATGAGAAGTGTAGTGCGCTCGTGTTCTTCGTTGACCATGACGGTGATATGACGGTGTACCCTCCAGAAGTGATCGACTTCGATAGCTCAAATCTTCCGCCACGTATTCTAGCGTCTCTAGAGGAATCCATTAAGGCGCATGCCGCTGGGTGCTTCCGTGCATCTGCTTTGATGGTTAGGCGCGTTCTAGAGGAACTTTGCGAAGATAAGCAGGCCAAAGGCAATAACCTTATGCTTCGTCTACAGGCGCTCAGTTCGACAGTCATTATACCAAAAGAGTTGCTTGACGCTGCGGACGAACTTCGGCTCCTCGGCAATGATGCAGCGCACATCGAGGCGAAGACATATGACAATATAGGCGCTCAAGAATGTGAAATTGCCATCGAACTTACAAAAGAGCTTCTAAAAGCGGTTTATCAGTACACAAGCCTCGTCTCGAAATTGAGGGCACTCAAAAAGTCATAGACCCAGCCTGGATAAGCTTTGGCATTAAACGGCAAACTCAATCTCATGCTTTGCCAGATCAACGACCGTCTCCCCGTCATGGCTGCGAAGGGTGCCGCCGGTGACGGTGCCGATGTTGGCGGAATTGATCCATAGCTTTCCCTCGTGAAAAATCAGAGGCTGGGTGATCTCTGGATTGAAGTCGGATCCGGTCGGCTTCTCCGCCCTTGGAAGCGCCATTGCCGGAACGGCAGCAGCAACAGGAGCGACGCCGAGGAATTTGAGGAATGCGCGTCTCTTCATTATTTTCCCCTTGCGGTGGCTATCCCGAGCACAACCAACTGCATCAAGATCACAATGCCGATGAAGTGCCAGAACGATTGGAAGGCGAATTCAAGGAACGCCATCACACGACGCCGTTGCTTGCTGTAAGCCTGCCCTCGGCAATCTGGCGCTGACGGCGAGCCACTTTACGAGCGCCGTTTGGCTTGTACGTGCGGCCGCCAAGGCGGGTGCGTTCTCGCCGCCATTCCGTATCTCTACGGCTCGATCCGGTTAATCCGCCTCGCGCCTCGCATTCCGTAGCCCTCATGTCTCGCGCTACCCGGATAAGTGGGGCGGTACTGCATGAGGGCGGATATCTGCGAGGAACAGCCACGAACGAACCGCAACCTTCATGGCTTTCACCGCATCGCGTTCCGTTGAGGCATTGCCTCGAATAAAAAAGCCGCCCTGCGAACAAGGCGGCTCTATATATAATCAAATATCAAATTAGAACTTGTAAGCGACGCCCAATCGGATGTCGTGTGTCTTAAACTTGTTGCGAGCCTCAACGCTGAGATCGCCATCAACTACACTGAAATCCTTATGACCGTAGTCGGTATAACGATATTCAAGACGTACGATGACATTGTCGGTTGCCGCGTAATCAACACCAGCGCCAGCGGTCCAGCCAGTCATAGTCTTGCTCTGAGAAACCCCAATGCTCTCTATACCGTCCGTGAGTGAAACTGAATTCTTAACGTTTCCGAAGGCCACACCACCGGCAATGTATGGCATCCATCGATCCATTGCGACGCCCATGCGAGCGCGAACAGCGCCTGACCAGCGGAGCTTGCTGTCAACACTAGTGGTGAGATCAAGCTCTTGATCGGTGTCACTGTGGCTAGCATCTAGATCATTATAGGTAATGTCACCGTCAACACCGAGAACGAAGTTGTTCCCCATGTCAAAGTTATAACCGGCATATAGACCGCCGAGGAAACCGTCTGGCTTTACTCGGATAGATGTGTCTTCATCGCTCAGCGTCGAACGGCCCCAACCATAACCGATCTGGCCGCCGAGATAGGCGCCATTCCAGGTGAAGGTCGGAGCAACAACAACCGGTGCAGGTTCCTGTTCAAGGACGGCGTCAGCAGCCTTTGCGCCAGTCGCAGCAACGAGAACGACAGTTGATGCAAGAAGAAGCAATTTAATGTTCACGGTTCCCTCCAGACATTCCAATATTCATTGATCTCTTAATTATATCATCCGAGTGAAATGTCTGTAGTTTTTCAGCAACACTTGCAACCATAACGCTCATACTGAACGCCACACATTTCATAGTAACCAAACAGTAAAAAAGCGGCCCGAAAGCCGCTGTAAATCTCTCCGTAGGCGCAAATCACCTACCATATTCAGGCTGACTAAATGTCATTCGTTAGTCAAGGGCGCTGGTTCTACGTTCTTCGGAATACCGTAGAAATCCACGAGAACATCCAACCCTGTGACGAGAAACTTGACCATATGCGGAAGATGCATGCCGGTATCTTCGTCAAGGATGCACACGCGACGGATAACGCTGGCCACCGGTCGGCCGGATTGATCCACCTTGCGGATTTGTTCACCCAGATCAAGTATACGTTCACGCGCCTGCTCTGCGGCTTCCGGCTTATCAATGCCAATCCCGCGCACCCGTGTCATGTCATGCGCTCTGGCTGATGGATACGGAATGCCGCTGAGAGCATAGTACCGGCAATAGTCCTCCCCGTACCGGAGGCCCGCCAGGTAGTGAGCGTAGCTTATCCGATCACCAAAAAGGATATGCAGACGCCCCAGAACGCTACCTGCTTCTGGCCGCTCTGCGATGCTGCGCGTTAGACCGGCCTTCATACGCGCTGCAATCACAACCATCTTGGTTTCTGCTTCGGTTTCATTGGTTCTGCGCATGACCGCGGATTTACGCCTGCTCTTGCGTCCGTTTGGCTCCCTTTCTGCCGCTGGCAACAACGGCCTGCCCTTCTTGAGCTTTCGTTTGCTCGCCTTTGCAATCGGCCCTTCTGCGAGGATTGCCATACGCTTGGCGAACCACATCGGATCGTATTTCGCCACACCAGGCTTCGGATTGAACGTGGTTTTCTGCTCTGGAGAAGAAATCGTGACGACCCGTTTCCACCGCTTTTCCGAAAGGTAGCGGCTGAAACGGCAGATCGTCTTGCGCCCACCCTTCCTGCTTTGATCGACATAGGTGTCGAGCCAGCGTAGAGCGTCAGAGCGGTCGGCTTCGGAAAGGCTCTCCCATGCCGTCAACGCTTCGCTTTCACTGTCGCTGACGTAGGAACAGATCGTTTCAAACATCGCCAAAGGTGTCGATGCATACCGGGTTCATACCGATGGCCATGCGACCGCGACTGCGAACGATCTTGAGAAGTTTATCGCCTTTCATGGCGCAGGATCGAAAGCCGTCCTTTGCCTATCGATCCAGATGTTTACAGGATTTCGTGTTTCTGATTTGGCTGTCCTTGGACCTCAGCACAGGAAGAACGATGAATTCCGTCTTCGCCTGTTCAAGAACCGCAATCGGACGCCGGTTGATATCGTAATCCCAATTCATCCGATCCTGTTGGCCGTTCTGGATCAACATAAGCTGGTGGGGATGACGTATTTACAAACCGAGTTCGGGAAACCATTTTCGGTGAAAGGTTTGGGCAATCGCATCTCTGATTGGTTCCGACAGGCTGGACTACCTCATCTCACTTCACATTCAGTTCGGAAAGGTTTGGCGACAGACCAAGCCCATAATGAAGCGACCGATAATATGCTGGAAGCTATGTTCGGATGGAAAGACGCGAAGACGTCGAAAATCTATACGAGAAATGCTGAGCGTGCACGTTTGGCAAGAGCAGCGGTTGCCAAGATCAATTGGGACGGTATAGGACAGAAGCTGCTAAGCTATGGAGAGGCCAAATAGGTGTGGCATGGTGTGGCACATCACATTGAAATATATTGGCAATTTATAAAGCATAGTGTGGCAATATTTCCGAATAAGCAATTGATACAAAATAAAATACGCACTCCGGGCGAGGCCTCCAAACTTTTATCATTTGAAATCAATAGTATATCAAGCTGGCGTCAAAACCGGCATGACATGCCTTGACGCATCATGCGTTCAAATTGATTCCCATAAATTACAACGGATTACAAAAAAGTCGGGTGAACAGCCTAGTATGGCGAATCCGAAATTCGCATCATTTAGAGCATTTTCCAACGCATCAAAACGGGATCAGAAATCATCCCGGTGAACTGATTTCCTCGCAGAGGCCACTCCCACGTTCGGGAGACCCGCTCTAAACATCATGACTACCTTTGACTTTTCTTTCGGTAGAGTTGTTCATCCGCGATTGACAGGACTTCCTCGAATATCTGCCCTGTCTTACAGGAGATGCGGCCGACTGAAACAGTCAAAGAACTTCCAACATTGGGCCTTTTCTCCCGCAGCAGCCCGTCTGCTCTCAAACTGATGTCCGAAAGAGCGCTATCGACGTCGTTGATCGCTCCCTTCGGTACGAATGCACAGAATTCGTCACCACCGATGCGAGCAATCAGGCTATCGCTGGATAAACAATCTTCCAATGCTTGGGCTACCGCGATTATCGCATCGTCGCCGGCTGGATGGCCATATTGGTCATTGATCTGCTTCAGATAATCAACGTCTACAATTAAAAACCAACCACGAACACCCGCCTTATTTGCGGCATTAAACTGCTGGATGAAGCTCCGACGGTTAAGCAGGCCCGTTAGCGGATCGACACTCGCCACCCGCGCCAGCTCATTGGCTCTTTTGACCGCGGTCCGATAAGACCGCTCCAGCTCTTCTAATCGTGAAAACCAGAAAACTCCCAACGGCATTGCAATCAGAAACGGCAACAGCAACCTGACCAAGATCGTAACCGTGTCAGATTTCACTCCAAGCGCGAAACGAATCCCGCTCGACAACGATATCGATATGATCGTTGCTATCACGGCTACAATTACCGTGCGCCTCCAAACATTACCCGACGGTAATTGCAAACCAGCCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP019391|918955:925147|921580_922291_-|WP_076773882.1|DBSCAN-SWA MNIKLLLLASTVVLVAATGAKAADAVLEQEPAPVVVAPTFTWNGAYLGGQIGYGWGRSTLSDEDTSIRVKPDGFLGGLYAGYNFDMGNNFVLGVDGDITYNDLDASHSDTDQELDLTTSVDSKLRWSGAVRARMGVAMDRWMPYIAGGVAFGNVKNSVSLTDGIESIGVSQSKTMTGWTAGAGVDYAATDNVIVRLEYRYTDYGHKDFSVVDGDLSVEARNKFKTHDIRLGVAYKF >NZ_CP019391|918955:925147|924427_925147_-|WP_076773160.1|DBSCAN-SWA MAGLQLPSGNVWRRTVIVAVIATIISISLSSGIRFALGVKSDTVTILVRLLLPFLIAMPLGVFWFSRLEELERSYRTAVKRANELARVASVDPLTGLLNRRSFIQQFNAANKAGVRGWFLIVDVDYLKQINDQYGHPAGDDAIIAVAQALEDCLSSDSLIARIGGDEFCAFVPKGAINDVDSALSDISLRADGLLREKRPNVGSSLTVSVGRISCKTGQIFEEVLSIADEQLYRKKSQR >NZ_CP019391|918955:925147|920176_920830_+|WP_076773881.1|DBSCAN-SWA MIFVRDTVSGYRLGTINLRCPACSHKGAFASLGHDIVDAVWNELDTNGNKTYSNIRAGLRHCPNEKCSALVFFVDHDGDMTVYPPEVIDFDSSNLPPRILASLEESIKAHAAGCFRASALMVRRVLEELCEDKQAKGNNLMLRLQALSSTVIIPKELLDAADELRLLGNDAAHIEAKTYDNIGAQECEIAIELTKELLKAVYQYTSLVSKLRALKKS >NZ_CP019391|918955:925147|920854_921112_-|WP_076773153.1|DBSCAN-SWA MKRRAFLKFLGVAPVAAAVPAMALPRAEKPTGSDFNPEITQPLIFHEGKLWINSANIGTVTGGTLRSHDGETVVDLAKHEIEFAV >NZ_CP019391|918955:925147|921096_921267_+|WP_154144825.1|DBSCAN-SWA MRVSSLFSPCGGYPEHNQLHQDHNADEVPERLEGEFKERHHTTPLLAVSLPSAIWR >NZ_CP019391|918955:925147|918955_919318_-|WP_076773149.1|DBSCAN-SWA MALVVTPGDPDAGSYVDLNEFKAYCGKVGYVLADKTDPEIEQALRRGTTWLDGTYGQRFIGEPATVEQALEWPRKNAVWRGALLPSTTIPQKVKNATSEAAWRELTTPGTRTVVLVNWLN >NZ_CP019391|918955:925147|919901_920141_-|WP_076773151.1|terminase|DBSCAN-SWA MSLTPKQEAFARAYVETGNASEAYRRAYNVGDKTKPESVWVKASELLANGKVSVILRNAVISVEPVKNERRLSQPNSLN >NZ_CP019391|918955:925147|923411_924038_+|WP_083699695.1|integrase|DBSCAN-SWA MVSNIAKGVDAYRVHTDGHATATANDLEKFIAFHGAGSKAVLCLSIQMFTGFRVSDLAVLGPQHRKNDEFRLRLFKNRNRTPVDIVIPIHPILLAVLDQHKLVGMTYLQTEFGKPFSVKGLGNRISDWFRQAGLPHLTSHSVRKGLATDQAHNEATDNMLEAMFGWKDAKTSKIYTRNAERARLARAAVAKINWDGIGQKLLSYGEAK >NZ_CP019391|918955:925147|922501_923425_-|WP_076773155.1|DBSCAN-SWA MFETICSYVSDSESEALTAWESLSEADRSDALRWLDTYVDQSRKGGRKTICRFSRYLSEKRWKRVVTISSPEQKTTFNPKPGVAKYDPMWFAKRMAILAEGPIAKASKRKLKKGRPLLPAAEREPNGRKSRRKSAVMRRTNETEAETKMVVIAARMKAGLTRSIAERPEAGSVLGRLHILFGDRISYAHYLAGLRYGEDYCRYYALSGIPYPSARAHDMTRVRGIGIDKPEAAEQARERILDLGEQIRKVDQSGRPVASVIRRVCILDEDTGMHLPHMVKFLVTGLDVLVDFYGIPKNVEPAPLTNE |
9 | Pseudomonas_phage(16.67%) | integrase,terminase | attL 911533:911553|attR 930108:930128 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|