Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP022549 | Sphingorhabdus sp. YGSMI21 plasmid unnamed, complete sequence | 1 crisprs | csa3 | 0 | 1 | 0 | 0 |
NZ_CP022548 | Sphingorhabdus sp. YGSMI21 chromosome, complete genome | 2 crisprs | DEDDh,WYL,DinG,csa3 | 0 | 0 | 3 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP022549_1 | 18917-18990 | Orphan |
NA
Consensus repeat of NZ_CP022549_1
|
1 spacers
spacers of NZ_CP022549_1
>1.1|18941|26|NZ_CP022549|CRISPRCasFinder CTGGACCGCGAATCCAGTGGCGAAAA |
CRISPR arrays and Neighbor proteins around NZ_CP022549_1
The CRISPR arrays of NZ_CP022549_1 >merge|NZ_CP022549|1|18917-18990|CRISPRCasFinder AAACTTTCTTCGAGGAAAGTTTTTCTGGACCGCGAATCCAGTGGCGAAAAATACTTTCTTCGAGGAAAGTTTTT >NZ_CP022549|1|1|18917-18990|CRISPRCasFinder AAACTTTCTTCGAGGAAAGTTTTT CTGGACCGCGAATCCAGTGGCGAAAA ATACTTTCTTCGAGGAAAGTTTTT
>NZ_CP022549.1|WP_100095772.1|17665_18913_-|replication-initiator-protein-A MARTKGVRGRRAVGEQFDLFLPYIADLNFRDQREMMERPFFSLAKTRRMKPIDYKSPDGKLWVHVSANPDYGMATIWDADILIYCASVLADMARRGQNDIPRTLHIMPYDLLRAIGRPTTGRAYELLGQGLDRLVSTTVKTNIRAENRREATFSWLDGWTQLVDEKTERSRGMTLELSNWFWEGVLMKGGVLSIDRAYFDISGGRERWLYKVARKHAGGAGDLGFAISMPTLFEKSGAEGQYRRFKFEILKLAEKNGLPGYDLIIEDVERGEPKLRMRKQGDHEPVKLASAGQGSETPKAASPENPVRTKPAAAAPKSDNHVTKTGALDEPVIDARTMIKRTLTGLSTAATRGYMTDETIELLRSECPGWDLYRLHADFETWVNGSEDRTPVDWQRAFIGFVRRHNKKHGHEIRR >NZ_CP022549.1|WP_100095771.1|16688_17120_-|DUF2384-domain-containing-protein MDSQDYSMNQLLGVSSDGEDLLSLALHIEEGLPVEAIDHLVAIVAPGNISLKYRFLPKPTLARRKKSAKKILTSEEGNKLARVAKVMRFALEIYGDIEKARTFLMKPHIMLGNRIPLEIVLATGPGADSVINILGRAAYGGGV >NZ_CP022549.1|WP_100095770.1|16209_16692_-|RES-domain-containing-protein MSAQHTDQILTAYRIGALNGKHPLYGDEGARFYPGRWNKWSSPIVYCSEHYSTQILEKLVLANAVMPPDLHFITIIIPKGICYEKFQTAAHPGWDGKSQRICQEFGDKWVRERRSAILFVPSIPARFETNILINLAHDASKAITYERAKPISWDDQLYRS >NZ_CP022549.1|WP_100095769.1|11705_15917_-|methylase MPIDPYSISHPKASILLSAGVNLSSKLDKPLTRDMLIKHLAAATGTSMVQFTIRDAFEMLECAIVAHIQRDEKAASQQLPSLIATYPDLPTHTVRSELQVRRQQFSTPAPLALFAQVRAALTPDDFVLEPSAGTGLLATEALRLGATLRLNELDTCRRALLTGLYPGAAITNCDGAEIAARWQGRAPTIVLMNPPFSSNADGYEDHFTALRHLHSALKVLSTGGRLVAILPSWIDRPGRNAASLAKMITGASIVERFTLGEGCFAKHGTNIATTLLVLDKLPDLPMAHNIHIAALAELTAQPKPLPRREITPPNKTRSPTKSLFGGFVRPAEPVQPIPAHRPVEDAITPVVYTALAEPAALGEQVGDYLPYRPARMIFEQAGEHPTALVESLAMGSVAPPKPHYRPMLPTRIVKDRLLSVAQLETVAYAGQAHADWLTGNIDDENKGEVIEQRIRKGYFLGDGTGAGKGRQIAATILDNWLKGYRRHIWVSKNATLLEDARRDWTAVGGIAADIQPLKNWPLGTNVAMSEGVLFVPYATLQSQREDSSRLTQILDWAGPDFEGVIAFDEAHAMGGVAGGETARGKTKGSEQGIAGVTLQDKLPAARVLYASATGASNVNNLAYAVRLGLWGPNTAFPTRDLFIEEIAAGGIAAMEVVARDLKSMGLYAARALSFAGVEYEMLEHILSAEQIETYDRYAEAWSIIHQNMEAALEESGVIDAVSGDTLNAGAKGAARSRFESVKQRFFQQLLMSLKLPSLFPAIETHIADEMAIIIQLVSTGEAMLDRRLADHDENGTDELDIDLSPREYLFDYLTRAFPTRQMQTYIDLEGEMRSQPMQDEHGNPVHCADAIARRDACLEQLGAMPPISSALDAIITRFGEDKVAEITGRSRRLSTASDGRQLVQRRSARSNAAETDAFMEDRKQILIFSDAGGTGRSYHASLDVPNQRRRVHFLLEPGWRADAAIQGLGRTHRTHQASSPLFRPVTTDCRGERRFISTIARRLDSLGALTRGQRQTGGQNLFDPADNLESDYAVSALNAWFRLLHSGKLKSASFQDFQKRTGLELEGEGGELRMDLPPIQRWLNRILALPIAMQNGVFDEFLGLVEQRVAKAREAGTLDLGVETIAVQTLEIVSERLLRTDRCGATTNLAELAITRKRFVRSNDDIDFRRSWDQSAEPMKNDKSGKVALMVSQRDSLLDDGTTVRRFKLIRPSRIDHIDDQALAESNWRKIHDTEFAKLWTVEADEARDTLENDTLFIATGLLLPVWRKIPSKMLSVVRIAAADGRSILGRVVDAGDLGALCQGLGLEAPKLTPNAMIESARSGARVPVNGLDELTLKTSRVAGAQRLEIIGAHAARLGWYKSRGCFTEIIAYKTRLFMPDTQAVTILEAICAESQTERTKVA >NZ_CP022549.1|WP_100095768.1|9401_11465_-|ParB/RepB/Spo0J-family-partition-protein MTIQIIPLNKLRLSLHNVRKSGAETNLDWLVANIEAKGVLQNLIGGTAKKKGFYDIFAGGRRLRALNILAASGKIAKNHGIPVMVSDESAEGIAETSLAENFLREKMTPTDECQAFLHFLGTEGDIDAVAKRFGQTRRFVEGRLRLATLAEPIFDILAVGEMTMEVAKAYAATADQAKQLAVFEEVKGSWLENNAHEIRKRILGASIPASHPVALLVGEDAYIKAGGRVARDLFTEVEAAEWLDGDLAVDLASTLLQEAAAIHAEKSGIGTVVPLLAKGSTWTDREELMQARLDREPLSEAAEARIAEIEAEHAQIEQLFESTEMDDEELDAINERVAKLEREEEELRDTEVLIDEERKASLTQFMVIGEDGKPRVETNYWEVPQARKLGDGDGHKDPKKVAAKVQGLSGVLADELSIARRDILALHVASNPAIALDLAIFTLADRAVGKYSEHGCSLEVARRNDPNVRGNLPDSLAGAALGEIRNQLDTTWAEHSTAAERFEAFRALEEETRATWLAVCMASSMEASLGGKGLQHSNPFHDHLGQLLDIDVAAWWRPSAGGYFARVRKAGMQRALDAIGGPVLAAKYASSKVAEMADACEKLCDGSAITEPEIREAGIAWLPAAMRFDPNQVEETGADGEDAPHDEESEGGAAGEKNNHPAGEDAESGSEGEEDSNSDAATTSQNA >NZ_CP022549.1|WP_100095767.1|8324_9305_-|DUF1738-domain-containing-protein MKTDISATITATIIAKLEAGTKPWVKPWTGQPISRPLRHCGSAYRGINTLILWMAAEERGYISPYWMTYRQAIILKGQVRKGEKSSQAVFYKTLVTKDGTADKESIDDNAGGSGDEGQTRRLLRQYAVFNAEQIDGLPAHFYPIPKPPQKIPESIHRPRLEAIFAKIPAAVRHNGNQAYYNRARDEIVLPAIDQFPNFEAYFAVRAHETAHWTGAKRRLDRTFGKRFGDAAYAMEELIADIAAAILGATLGLPEAQLDNHASYLAHWLKVLKSDKTAILTAASKADEAVNFILGFAQEGPCRIVGREPQLVAACSPSRPKSTVMTC >NZ_CP022549.1|WP_100095766.1|7831_8314_-|hypothetical-protein MANYFTHISFKLAVTREEAEQFVSVIASAASIEDGNGPLLTPEIEKAFNTDSQSAEQNFCEIMDDLIFGIECIFNETSSTLTIFDSDGAPNLSALGQCLQCLYPEKLPLGFVYADTCDKARADGFGGGYFVITGDTISQQTLAQMLSNDLTALAETSGVS >NZ_CP022549.1|WP_100095765.1|7644_7842_-|hypothetical-protein MSADQRTAGHVATLENAYRVATRVAEATNQDMQVSATGKPERPFIAEPANTGTQRVIARILADRD >NZ_CP022549.1|WP_123906366.1|7407_7620_-|hypothetical-protein MRVTTYKKRSCEAIGVPIPVRDLHRCKVSHCNNTGAYSVLVIVAAVDYLIDISQHDLQILADVKANKPSA >NZ_CP022549.1|WP_100095764.1|6828_7332_-|hypothetical-protein MADRCSASITIGGKLAAADLPRLLTAISDEGLSTEFDGPKFELKELVSGEPLLLCAHEVSWGTFTELEAFCRTHKLAYTRWTGQCAGVWGCFRTVYRGAAETREGDDAVDEYDVSEDDQILIGEQLARTLGSFRAIADHFARAAFAVPPLVIVASGPEQTEPIAPQA >NZ_CP022549.1|WP_100095773.1|19025_20090_-|ParB/RepB/Spo0J-family-partition-protein MAGGKQKDYLADLLADDQPVSADEAASIEPSATPAETSGRLARGGGMALLGRESALARVASGDVRQVTQLRLDPTRVRIWKGNARIQERLNEENVRDLIDTIIAEGGQKVPAIVRRVNDDAVYDYEVIAGTRRHFAISWLRTNSYPDMTFLAQVADLDDEAAFRLADIENRARKDVSEIERARNYASALEQHYDGKQRRMAERLKLSEGWLSKMLKVATLPDIVLAAFANLSALTLNPAYKLSQALADERRHKSIIGAAQRVTEENGQRAQDGQPPLGGTVVLTRLMAAGEADKQRPKLYEGISRHNRPALSVTTASRQGLTIKVHAASGADEDEMVALFREALASHEYKTEVE >NZ_CP022549.1|WP_100095774.1|20150_21347_-|AAA-family-ATPase MNSEVNRIAEMAEAGERMIERLRKKAFLPEARKGLAVRYGIAEAAQLLGCSTNRIRMAETDGRLPPPPPTKNGRRPGYSIEDLLNMRQVLDASPARAPLDQPAIIAVQNFKGGVGKSTVTTHLAHYFGVLGYRVLVVDCDSQATTTTLFGFNPHFNITREQTLYPYLSIEPTQTDLLYAVQRTPWPNVDLIPSNLELFDVEYELAASGADGGSVLAARFRKLKSGLLDLAQQYDIVLLDPPPALGTISLAVMQAANALLIPLAATTPDFCSTVQFLSMMEQVTEQIRQAGIDVSYDFVRLICSKFDSNDPSHAMVQQIMEQSFGPALLPVPILESAEISHAALRMMTVYELERPIGTPRTHKRCRANLDQALGQVEQLVRRNWGVTSPASAAEVVNAT >NZ_CP022549.1|WP_100095775.1|21689_22193_-|helix-turn-helix-transcriptional-regulator MDYREDRQYLEEQEELSRPQPKIRITPQKFLPLIIEVMARGNIKSSDLVKRTGISKSKISRLLSEQRQIDNAALYLIFDALGIDPMRALLAVGRFGQWEQYFDLDVEIIADLIDVLPSFLSKARSESIRTSISRPGAIVLAERLSEMIAKNDRETEKRQLERPIAGI >NZ_CP022549.1|WP_123906367.1|22390_22573_+|hypothetical-protein MSKPWSDSHDQVAVLTNNIANVPTFVAKMEIIVATAAKTDAKIGNIVANLTTIIDSHGKK >NZ_CP022549.1|WP_100095896.1|22690_23014_+|hypothetical-protein MNIASQCGPIPLASIVKAVAVTAVVAVAGSGVAIAGADATFNTALTSFTGFLEGSGGKIITVLSLAGGIVGLASGRFSLGQVAIPVGVGVGAGTGVPIVTAAVTATI >NZ_CP022549.1|WP_100095776.1|23070_23358_+|type-IV-conjugative-transfer-system-protein-TraL MADKYAIPARLDDPELIGFWTLDEFIAMVLPFIWGVLTQHILIGIILGFGCWWGLRKAKAGRAASWLLHLAYWYLPGGFVGLRATPPSFLRLMAG >NZ_CP022549.1|WP_100095777.1|23363_23939_+|conjugal-transfer-protein-TraE MDLAFSHTQSQRVLKQRNMLLVACLVLAALAAILGIAAGSRDREVVLQPVLRTPLTLSSSGVSREYLEAVTRDAAVLTLNRTPQSLDYWMNAILEMVHPSAYGEVKADLLKIMEDQRGSSIAQFFTMESLKVDTEGLSSEVSGVLHTMVGRQEVSAVQKTFHYGWVYNGLSLKLVQFGMVEKVEPKKAVSS >NZ_CP022549.1|WP_100095778.1|23935_24739_+|type-F-conjugative-transfer-system-secretin-TraK MNTVFWCCGIACGSILASRIHSISTRSLGLGLIGASSIALSSPALADQTLMASDNARVDCIASSRDLTRISLVGDEIASVSKLQTGNPNEDFSVVNEPVRGDIYVSVPEGFSPKVLSFFATSKKGYVYKFACRLSGEEAQQVFVANPAIAGEQTGDSRIVAAADPQEQAVKLVQAMYTSSIPEGYRMRQPVRASVRVGDLKVRMIAEYRGTEFTGKVIRIENQGSELLVIADDVVAPSNATAVSIAEPNLAPGASTTAYIVQPAGEN >NZ_CP022549.1|WP_100095779.1|24738_26064_+|conjugal-transfer-protein-TraB MKFMDRFRKQKITEDTPDVGEDEVSPVGHAMSGNDDVQRKQRLLLGGVALTALVGGAWWVLDSSVSEEELANSGVKEVQVSTNDMVNKNMSEQEWIARSENRFESTDNRLRTVDGQQAQLAAMQEEMAALRGQNSAMSSDGQRVLSAYQTENETLKSQLAAAQSAPPAAGGPNGLYGPSSAALYQQPSGPGAAPLPAPSPREVKTVAFSNGPGGNAVRAERGTTVFSDSPDYLPANSFATARVIVGVDASAGVSSQTDPLPVVLRITGPARSVADKGRVLTTKLQGCLVNGAARGDLSAEKVYVKLQRMTCAQPGGRYAVSDVKGFIAYAGKSGVRGRVVSREGGLVTQAFLAGLAGGFGRGFSANTDSVLQGSNITVNGKRDKLGLGEIAQGGLGEGVSTAADTVSKYLIERAEQYQPVIEMPTGIDVEIVFLDGAYVRN >NZ_CP022549.1|WP_100095780.1|26081_26930_+|DsbC-family-protein MAKYNWVIGASLVSVLIAGTAFAATQSANATVEAALKTRLPKTEFAKVDCNVIGSICEVTAGKSIFYVDKQARYLIVGHVFDMETRQDLTSARLLEINPEALLGGASQISDDGSDELGSAQAGTPVARDYPMPKSRIKAGAGRTERVSLASLPSSGAIEWGSGKTKVTVFTDLRCGYCRALTQQLETMDVRVVERPISVLGTRDLSNRVYCAKDRPRALHAAYAGNIPESAKCDTSGLDANERFARENGFTGTPVIVRSDGAVHHGYLPKDRLLAWLKGASS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP022549_1 | 1.1|18941|26|NZ_CP022549|CRISPRCasFinder | 18941-18966 | 26 | NZ_CP022549 | Sphingorhabdus sp. YGSMI21 plasmid unnamed, complete sequence | 18941-18966 | 0 | 1.0 |
1. spacer 1.1|18941|26|NZ_CP022549|CRISPRCasFinder matches to NZ_CP022549 (Sphingorhabdus sp. YGSMI21 plasmid unnamed, complete sequence) position: , mismatch: 0, identity: 1.0
ctggaccgcgaatccagtggcgaaaa CRISPR spacer ctggaccgcgaatccagtggcgaaaa Protospacer **************************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP022548_1 | 2034684-2034772 | Orphan |
NA
Consensus repeat of NZ_CP022548_1
|
1 spacers
spacers of NZ_CP022548_1
>1.1|2034707|43|NZ_CP022548|CRISPRCasFinder TCGGGGGTGGGGATATTCTGTGTGTCCGGCTGGCCCACCCCTG |
CRISPR arrays and Neighbor proteins around NZ_CP022548_1
The CRISPR arrays of NZ_CP022548_1 >merge|NZ_CP022548|1|2034684-2034772|CRISPRCasFinder TCCCCTCCCGCTTGCGGGAGGGGTCGGGGGTGGGGATATTCTGTGTGTCCGGCTGGCCCACCCCTGGCCCCTCCCGCTTGCGGGAGGGG >NZ_CP022548|1|1|2034684-2034772|CRISPRCasFinder TCCCCTCCCGCTTGCGGGAGGGG TCGGGGGTGGGGATATTCTGTGTGTCCGGCTGGCCCACCCCTG GCCCCTCCCGCTTGCGGGAGGGG
>NZ_CP022548.1|WP_100093877.1|2032216_2032552_-|phosphotyrosine-protein-phosphatase MSGTAANLLFVCSRNRLRSPTAEAVLNSVDRVTAISAGTNNDAETPLSGDLILWADSIYVMETGHRKKITQKFGPLLKDKPVRNLAIPDNYTFMDPDLVAIIKSRFPEYFA >NZ_CP022548.1|WP_100093876.1|2031848_2032220_+|DUF4345-domain-containing-protein MAAFRILSFLIGFIPLWFGVNGLLFGAAEHMGGDAFSTAMDNQYRYLSGVYIGVALMIFYSAGEIRKRAKLFRFAMLFFFIGGCARAVSYLTVGPPPTEQFAAMLVELLSPLLLLWQARVLRY >NZ_CP022548.1|WP_100093875.1|2029989_2031456_+|cytochrome-P450 MEWLADINESSYHRLKFQIRSTPVATIAAKPHGYPFQESANPHWVRRPSEEELAHIPGEKGLPVLGCTLDLLKDPGGFGRRMFEQYGSVYRAFSFGGWSVNMLGADATELILFNKDKIFSSEQGWGPILNNLFPRGLMLMDFERHRADRKALSVAFKPGPMKHHCQVLGEGIAQRVGEWSGTEFNFYPAIKSLSLDTAASAILGIPWGPEADKINKAFVDEVQASVGVARKPIPFTKMWKGVKAREYLLDYFTPQVKERRAKGGEDIFTQICLATHENGELLTEGEVVDHMNFLMMAAHDTITSSATSLIYLLAKHPEWQDKVREECLAVAPAGTTLSMEQLGKLELTEMAFKEALRLIPPVPSMPRRAVKEFTFRNYTIPAGTSVGVSPSFVHKMEKHWPNPDKFDPLRFTPENSVGRHKYAWVPFGGGAHMCLGLHFAYMQIKILMHAMLTQNRVLLQGGPDYEPKWQVFPIPQPKDGLPVRLEHL >NZ_CP022548.1|WP_100093874.1|2029143_2029887_-|hypothetical-protein MTNLLTARLLKSAVAAGALLSLGGCMYGGGAYGDGYVNGRGYDCDPYAPFDDYYACDYGYGFANIGYGGGWYDQYYYPGYGIYIFDRGGRRLAMRDNHRRYWARQRAAYGGHHARDRRDRDGRADRRGDRRDSGYDGRDRTRDPDRNSSRRQRDSDDRDNRGTRNDRRNRTGTSGTKAGGTDQAAPSRDRGDRGSRTDRPPASGAEQPQARPQPAPRADSPPRSTRADPIRNSRETSRRGRKPLSDD >NZ_CP022548.1|WP_100093873.1|2027897_2029031_-|acyl-CoA-desaturase MATAFSDIQPDEYTPSEQHPQHGVIIGGPAARAKRIEYGILLSSMVLGSLGALYWTIFHATGWVEWSAFLFGFALINMGVGIGYHRYFTHRAFEAGPKMRLAIGIMAQMACQASVLKWAADHRRHHAFADGVGDVHSPYVDGHGKNMSKWKGLGIAHFGWLFDNTTSDMKVYGKGLIDDPQVLWCHKHRWSIAIFSSIVLPALWALAFGGPEHIIGTILIGGFFRNFFFLNFVMGVNSIGHVFGSQRFETKDGSRNNWFMAWMTFGDGWHNNHHQHPRAASSQIAWWEFDLNGQIIYLWEKMGLVWNVQRAPAYIRNEKGEWVQKKPKVATAKFDEPEVMSADDGGILFPADKPCSRRTSSAISNPPSEGPRTSPLM >NZ_CP022548.1|WP_100093872.1|2027432_2027906_+|type-II-secretion-system-protein-M MIDNAKNWFAALSQRERILVAIAGLLLAGLVGYYGIARPMFGAMTAAEEGYVEAVERQARIETKVAALSQPVDGQVAKFSGAIDAFVSQSAGETGIAVASVTPQSGNRVNMVVESAKPTALFGWLARIEREGVAIESLSVNPAGAGTVSATLTLRLH >NZ_CP022548.1|WP_100093871.1|2026308_2027436_+|hypothetical-protein MNPDLLLIAFPAADDEALHWWHVSGGAVRAAGCDPDPVLASGAYGQDAVQADLSVIALLPSHLTSIKDHDAISGANESQTLAAATLAARAQSLESDKVHIAATIDGEGGAVTASVGRDILSAGLVRLQALDIDPDAILPSGWLLPVAGDDQAVAADFGFDRVMRAGHIIAVDDPALREFLFGDRTVEQITGDAFDEMLAGAGRKTDLNFRSGPFAKKTKRAMDAKQKRRLGWMVAALVIVSLAIPLMQLAKYHWAANAADEAALASAAAIVGDQQDPESAERALDQRLIAENRGNIMFPVPASALFSALQQVPGVSIVRLDYGENGIVSATLSAVRNEDINPALLAIQKAGFLITATPRTDATGSAQADITVRAP >NZ_CP022548.1|WP_164089107.1|2025822_2026296_+|type-II-secretion-system-major-pseudopilin-GspG MTTEIEHPDTVSAPEPKTQKKRNGFTLVELMVVIFILGLLTTIVVINVLPSQDRAMVEKARADIATLGQALEMYRLDNLAYPGSSDGLQALVTAPPSLATTARYRKGGYIKKLPDDPWGRPYQYDNPGRQGPGYDLYSLGADGAPGGEDDNADIYAE >NZ_CP022548.1|WP_100095542.1|2024578_2025802_+|type-II-secretion-system-inner-membrane-protein-GspF MPEFAYIAIDPKGREKTGRLAAASDDAARAKLVERNFYVVKMEAAQGSIGNQAKASAGLFGPKKLSSKELTLFTRQLSSLAQVSPLEEALRLIGNQNEKPHVQQRIATVHSGVIEGQRLSEAMRREPKSFPPLYRAMIAAGESSGTLPDITERLADLLERQAELRGKLIGTLAYPSVLALVSVIVVALLMIAVVPKVVEQFDDVNQQLPLVTRIVIGISNFLADWWWAIAIVIGAILLIGWRALKEPQLKYRFDAFLLRLPFFGRLLRDLHAARLARTLSTMVASRLPIIEGLRLTTDTIHNSVLRKASEDMVEAIRGGGSLSTALRNTGVFPPMLVYMTSSGEASGQLDDMLARAAGYLEREFDNFSSTALSLLEPIIIVALGGLVALVILAILLPILQLQNLAGL >NZ_CP022548.1|WP_100093869.1|2023038_2024586_+|type-II-secretion-system-ATPase-GspE MEIARGTEAVVSEADGGLGALAAVRDLPYAYARDQGVLIQDRSGERLSLALREGADPLALLEVRRYLALPFDVERVDGPAFEKLLSAHYAMDGSAAAMAGDMALAGEGLDDIAGDIPSAEDLLDSADDAPTIRLINGIIAEAARQGASDIHIEPYETGLVVRMRVDGVLTEKLRMPPHVAGVIVNRIKVMARLDIAERRIPQDGRISLTLGGKLLDVRVSTLPNRASERVVMRILDKESAGISLDLLGMSPNTHAILTDALSEPNGIILVTGPTGSGKTTTLYAGLKQLNDGSRNILTVEDPVEYAIEGIGQTQVNAKVGLTFAAGLRAILRQDPDVVMIGEIRDRETAEIAVQASLTGHLVLSTVHTNDAVGAITRMRDMKVEPFLLASTLRAVVAQRLVRKLCEHCRTPIQADGSLASLLGFDNGTVVYRETGCEHCNQTGFQGRIGVFEAIRIDDTLRKLINDGGDEAAITAHAFRNQPHLGAAARALVRDGLTTPEEAIRISRRESDAVDA >NZ_CP022548.1|WP_100093879.1|2035778_2036129_+|hypothetical-protein MNNIEILVEPKSKSHVGWFVHLFSWNKQRRDLIAHRLKLKSRTGGKFTLSSEAKLPNGTYGLRLHVIESGNKASITVVEKPPISYPMGAAWPMTVEVPAQFTQDSDTWFFEHGDEA >NZ_CP022548.1|WP_100093880.1|2036125_2037241_+|hypothetical-protein MRFVNIFNTVSVLLATVPMISISQDADAQTLRGPYAVEEGPPAPSDRFDTITRGVSTSIDISEKEAKASASIGGVLTADRDTGGSDQKQDAFFWKLGLDAPVGGTTDLLDPKLDVLDNQVSLTGSLTWKRFSSSLSDLTDPRFQAYVRAAIEACEEQKKKDCEAHRRVPASVFVQDHLPEVHRRLSSAIYDPFYAVSVKGSVGVKEFDYILPGAFTDEDDTKISYSLAIAGVIYPSDTVSAWKLEAEYGNAFEAAEEGIVCRTVVTDPAEDCKSAAPSGPTSKETLVVRAEYRRFFMLSDVNKGIGIAPTGSVDFLSGDFGLEFPIFYKVGSKSPVLPGIKLGYTKDASKPNNKDEDFTVAFFIKTSFNLN >NZ_CP022548.1|WP_123906297.1|2037255_2039286_-|hypothetical-protein MTELREIDVLIVGGGFSTMPLIRELDASGIGWLMVSPMMPIWQQLEAADALDFDLVSSLQSSVYSFELVEMLREQGEDFSDGFPTAREFYAIHKKYAARYADRIHHGLVDRIDNHADHSIVHLESGEQFRAKHVVVATAFRRKMNANLKQITVDESFAGKSVAVTSTGDSSNLLIAKLVAYGAKVHLVSNGFIILDKMFATFSPFDDGPRFVPLDQLECHNFSEISKWSYRAFIDGGYIHGLIHHRFAQLFDRNSLGVRHPKSIRPHKDIRHFFKAKAPVENGHIAIKYWPIDVYKLYCEETLEQKIADGYLINDLPFFLEHGHVKLWDKAATTIDHEAMTLSEDGESASFDMLIDGDQEIPAVPEIRAHGESGVEVFAYKPREQFMGVTSPGLQNIYTIGFTRPFTGGLNNITEMQCLLVHRMIADPDFRDSIRTNIDQRIADYNATYYTKRPEKKTDHLVWYGTFTDEVAKLLDIRPARADMPGKKGLMRYYMFPNNVFRFREKGRYAVDGVDKLIEHTSKQYHDYKVLALLVIRYPFFELLALATILLAPVPWWVKIPAAIIHNRLPFTSTLVGKFGLPTRESKAIFNYRKAISFPVLAYPLVAAVVWAAAGMDAAFAFSAGLLAYVYAMIHLGTAKGWNRKFFCDMKSKRSPEMVGFFERYRAVFGRVRSNH >NZ_CP022548.1|WP_100095544.1|2039738_2040407_-|guanylate-kinase MADNIASQNIDNDHRELNRRGLLFVLSSPSGAGKSTLAKMLLEADDEIAMSVSVTTRPKRPGEEHGKDYYFVSGDQFEEMVADQSFLEYATVFGNRYGTPSVPVNQSLEAGQDVLFDIDWQGTQQLYQRAGQDVVRVFILPPSLGELRNRLESRNTDAPEIIESRMQRAASEISHWDGYDYVLINDDLDACFTKVKQILATERMRRARQTGLIGFVRDLMAE >NZ_CP022548.1|WP_100093882.1|2040524_2041643_+|AI-2E-family-transporter MIQGRISTIFYAIGITIMIGWLFYVGQDIILPIVTAIILLQILYAASATVARIPGLGQTPLWLRATLALIAVLAMLFAMTRMLVASLQNLVPSLPVYQRNLENLFAAWFPMPADESIGRPEVSELMASPSAIFDRGTEAVAKAISEFATRFFTEIDIAALAQSVLSSLTSFGGFVFITILYASFLLSEITGFPEKVRRAFGSDSDSTDTLNMITRINHDIGSYLATKTLINIILGVVCWVILLILGVEYAALWAIIIGLLNYIPYIGSFVGVAFPALFAVAQFGTLTVPLIATGLMTTAQVIIGNVVEPKMLSKSVNLSPMVVLVSLAIWSSLWGIPGAILAVPFTTILMIVLARRPGTRPIAALLSSDGNI >NZ_CP022548.1|WP_164089109.1|2041639_2042920_-|MFS-transporter MATTAQTGHSPRKFYGWTNVGLFFFIQFAASGFVYFAYSVVFPVMVETMNWNRGSASIAQSVALITLGLSYPLTGYLLSRSGVRRTVTTGLLVMLAGLLLLVFGVTELWHWILVWGVVMGLSFGLTGPICAQTAMISWFSIKRSTTIGIVMTGGALGGAMAQPALAEMMERFDSWRAAWTIAAAMVVIALVATRFIINRPSDIGQFPDNIDPARAVSEEQAHHTRPRTYRTDHAWTMREIIRKPTLYLMMVISVGYLGTFFALLNHGVLHLTDNGLKGLEAAGIFGLLILGSGLARIPAGWLGDQFELRWTIFGSIALMALALAGFWQGQEIWLLSLMGMLFGAGYGSLLVLLPAAIGNYFGERAFPIINSLFAPVVLPFAAAAPAGAGFVFEATGSYDFAFAGAIILLAAGMLAAFCLKPPATPV >NZ_CP022548.1|WP_100093884.1|2043217_2043646_-|hypothetical-protein MTIFKFDKHGFLWEPNLHDRWVSRLILNSEQCIEIDFTSSHKSSAVCLSFVDVDDFQSVIRGPRIFTELFVSGNGTGTENDEISIMESLVPKEFGKQIIDSYKKRENSEKQKTFLVLAMMDGYVSFWFTEMFFDDSSPTTLN >NZ_CP022548.1|WP_100093885.1|2043827_2044442_+|DUF2585-domain-containing-protein MIKWPLRISKTSALIAFILFAGFALILFGMGRPPICSCGTVKLWQGVVQSSENSQHLADWYSMSHIIHGFVFFGLGHLLRKRLPKLFPLGVILALSILVEGAWEVLENSPVIIDRYREATISYGYEGDSIINSMADIAWMIIGFFLASRLPWKATLAIAVIFELFTGYMIRDNLALNILMLTVPNEAVKDWQAAGTGYWLTGKT >NZ_CP022548.1|WP_100093886.1|2044424_2044718_-|hypothetical-protein MSLIRNIFTAAVGYEARKHTRGHGLRNFGLGMLAARLATRSVPGALLVGGALIAKKLYDDKKEAEALPASEAVIDIEGRPVPVDSETTADPALRLPG >NZ_CP022548.1|WP_100093887.1|2044733_2046128_-|class-II-fumarate-hydratase MSGTRTETDSLGEIEVPADAYWGAQTQRSIENFPFPRQERMPIRLIHALAAVKQAAARVNGIHGLEPEIAAAIEKAANAVRTGEYDNQFPLTIWQTGSGTQSNMNVNEVIAGVANEAIYGTRGGKYPVHPNDHVNRGQSSNDSFPTAMHVAAARALEVELFPALEQLHDALAEKAQKWKSIVKIGRTHLQDATPLTLGQEFSGYAAQLVAARDRIESVLPRLLQLAQGGTAVGTGLNAPENFGENIAKEISKITGLKFTSAPNKFAELAAHDTMVELSGALNTLAVSLTKIANDIRLLGSGPRSGLGELDLPANEPGSSIMPGKVNPTQCEMLTMVAAQVIGNHQAVTVGGLQGHLELNVFKPLIGANVLRSIEIISIGMSSFSERCVEGLEANEARIKELVDNSLMLVTALAPVIGYDKAAAIAKYAHKTGQTLRASALELGLVDAETYDTHVRPENMVGNHS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP022548_2 | 2924474-2924587 | Orphan |
NA
Consensus repeat of NZ_CP022548_2
|
1 spacers
spacers of NZ_CP022548_2
>2.1|2924502|58|NZ_CP022548|CRISPRCasFinder GTCGCGACCGGGAGTCGTGGTTCGACAGGTTCACCATGAACGGAATTCTCGCCGCCCT |
CRISPR arrays and Neighbor proteins around NZ_CP022548_2
The CRISPR arrays of NZ_CP022548_2 >merge|NZ_CP022548|2|2924474-2924587|CRISPRCasFinder CCGTTCGTCCTGAGCTTGTCGAAGGACCGTCGCGACCGGGAGTCGTGGTTCGACAGGTTCACCATGAACGGAATTCTCGCCGCCCTCCGTTCGTCCTGAGCCTGTCGAAGGACC >NZ_CP022548|2|2|2924474-2924587|CRISPRCasFinder CCGTTCGTCCTGAGCTTGTCGAAGGACC GTCGCGACCGGGAGTCGTGGTTCGACAGGTTCACCATGAACGGAATTCTCGCCGCCCT CCGTTCGTCCTGAGCCTGTCGAAGGACC
>NZ_CP022548.1|WP_100094576.1|2923597_2924263_+|Crp/Fnr-family-transcriptional-regulator MNAPAPVAGTGFVQGLPERIRLDLLARGQERAFARGQIIQQRGDEAKEFWYIESGSVQVGRYGLDGRLTLFALLGAGETFGELAFMGEFPRTVDAIAGSDCRLIRIGKSEFRSLMDADPAVTGLLLKTMALTVQDAFDLVEFSRSLSVPQRLALALSQLCGDQGNGARLTVTQQELADLVGVSRVSLGKALTRLQGGGLIEPGYGFVTIKDGAGLRDLAGG >NZ_CP022548.1|WP_164089232.1|2922651_2923509_-|sterol-desaturase-family-protein MMRPELLAITGIFLVFGLLEFFRTRLFSKPDQTRDDALVEIIGSFVLLVITQPGIILAAQFILGNVAPQWQGALAGIPLIAAIALFLVFDDMMQYFWHRLSHSVPFLYNLHRSHHNARYMSVRLVYRNNIFYYLTMPSIWFSGALIYLGLGWVYVGYIIVKMAVIVGAHSDVAWDAPLYKIKWLSPVMWLVERTISTPATHHAHHGRHKSDGVTHYKGNFGNLLFFWDVLFGTAHITRRYPESYGVENLAPATLGQQLAWPIFPENKEGGHGVKELAAVEATAKV >NZ_CP022548.1|WP_100094574.1|2919931_2922244_+|DNA-ligase-(NAD(+))-LigA MTTKPISEADAANELMRLARKIAHHNKLYHAEDTPEISDADYDALVRRNNELEEAFPHLVRDDSPNKLVGAAVAASPLSKVKHEQRMMSLDNGFSDEDIAEWLARVRRFLNLDEGAPVAVTAEDKIDGLSCSLRYEKGELVLAATRGDGQVGEDVTANVRMIADIPERLYPREGGDPSPEGAAEARAGDGLLPSQEHSSPPVPDLFEIRGEVYMAKADFAGLNERLMDEGRADAEAKGVDFDPEKIRQFANPRNAAAGSLRQKDASVTAKRPLKFWAHGWGVHSEVPGETALDVMKALESWGVPVSPLVKRFETLDDMLSHYRHIEAERADLPYDIDGVVYKVDRLDWQQRLGFVAKAPRWALAHKFPAEQAQTTLESIDIQVGRTGKLTPVGRLKPVTVGGVVVSNVTLHNRDEVERLGVRPGDRVVIQRAGDVIPQVVENLTRDAKREPYHFPDHCPECGSEAVAEEGEVDVRCTGGLICPAQRFQRLVHFVSRVALDIDGLGEKSIAEFIEAGWLESPADIFRLRDHRAEILAREGWQDKSVDNLLAAVEKNRQPDAARLLFGLGIRHIGTVTAKDLLKNFRTLPKLREVAEAAQESEKYRSSRAQSRDGEGADAGVSTSLDTNGEAASETTGTEAGDAAWADITSIDGVGPTVAQALADFFHEPHNVAVWEDLLNEVSPPDYIVETRDSAVAGKTVVFTGKLETMSRDEAKAQAEALGAKASGSVSAKTDLVVAGPGAGSKAKKAAELGIEVIDEAAWAEIVAAAG >NZ_CP022548.1|WP_100094573.1|2919447_2919942_+|hypothetical-protein MTTTQGPRDKKNIILLLQSRDPDDFSETALCEWMDHDDAEVRDWATFTLGVQTEKDSEAIRQALLLRACDSDFDTVSEALLGLARRRDRRAIPILLDRLCSDNVGELDVQAAGILADKLFVAPLQSLLDWWDEDTGLLERALKQCCGESAEDGSLDDWVTANDD >NZ_CP022548.1|WP_100094572.1|2917783_2919448_+|DNA-repair-protein-RecN MLQSLSIRDIVLIEALDLEFGSGLTVLTGETGAGKSILLDSLGLALGNRADSGLVRQGTEKAQVTASFEPPAAGSRLAATLAENDIDIEPGEPLIIKRSVKADGGSRAFINDQPCSAALLRAVGGQLIEIHGQHDGRGLINPKGHRALLDIFAGVEGVAVREAFDDWQAAKSALEQAKQEQEAAENDRDYLEHSVAELSKFAPQPGEEQELALERATMMKGEKLTGDLEAIRAAFDGSNGGLASLRSAARRLDMIAEDHALLKEALESLDRAIIDATDAEDKLNAAADALSFDPQRLDDMETRLFDLRGLARKHRVEPDKLSDLLQDLQKRLAAISDGGRTLEDLEVSLAGARKSYIAAAESLRAKRSKAAKALDKAVAGELAPLKLDAARFETLLEEMPEDHWNKSGMDRIEFLISTNPGAPLAPLGKIASGGELSRFILALKVALAEEYGLKTIIFDEVDRGVGGAVASAIGDRLSRLSRGAQLLVVTHSPQVASKGDAHMLINKSRSGTVSRTDVSALDNEGRRQEIARMLSGADVTDEARAQADRLLEGV >NZ_CP022548.1|WP_100094571.1|2916940_2917726_+|outer-membrane-protein-assembly-factor-BamD MTKHIFKWAVIGLSATALSGCAMFGGSGGGRGTDTRYVARDVDTLYNAAKSRLDRGEYKIAAALFDEVERQHPYSPWARRAQQMSAFSYYLSGDYSKATESARRFLSIHPGNKDAPYAYYLIALCYYEQIGDVTRDQKITEQALATLGEVARRYPNSRYAADAKLKIDLVNDHLAGKEMEIGRFYQRRGKWLAATIRFRTVIEKYETTTHAPEALMRLTESYLALGVPVEAKKTAAVLGANYPGSKWYERAYELVNKHGAT >NZ_CP022548.1|WP_100094570.1|2914467_2916747_+|polyribonucleotide-nucleotidyltransferase MFDVKKVEMEWGGQTLTLETGRIARQADGAVLATLGETVVLCAVTAAKSVREGQDFFPLTVHYQEKFSAAGRIPGGFFKREGRATEKETLTSRLIDRPCRPLFPEGFYNEINVICQVMSYDGVNEPDILAMVAASAALTISGVPFMGPIGAARVGYKEGEYQLNPSMEEVAEGELDLVVAATHGAVMMVESEAQELSEEVMLGAVQFAHEASKQVAGLIIDLAEQAAKEPWKLDAGDDNSEIKEDLRKLVGKDIAAAYKKTDKSERSDALNAVREKAKEKYADADGQTQMTAGKMVKKLEAEIVRGAIIKDGTRIDGRKLDQVRPIEAMVGLLPRTHGSALFTRGETQAIVTTTLGTKDAEQMIDGLDGLSYTNFMLHYNFPPYSVGEVGRFGFTSRRETGHGKLAFRALRPVLPTVEEFPYTIRVLSDITESNGSSSMATVCGGSLAMMDAGVPLARPVSGIAMGLILEGEEFAVLSDILGDEDHLGDMDFKVAGTEKGITSLQMDIKVAGITQEIMKSALEQASGGRAHILGEMSKALSSVRTEMSEHAPRIETLQINKEKIRDVIGTGGKVIREIVAETGAKVDIDDEGLIKVSSSDKSQIDAAIAWIKGIVEEAEVGKVYDGKVVNLVDFGAFVNFMGGKDGLVHVSEIKNERVEKVSDELSEGQEVKVKVLEIDQRGKVRLSMRVVDQETGEELEDTRPPRESKPRGPRRDGGGGRGRDGGGRGRDGGGRGRNNDGPKNEGGGDIPLPSSITED >NZ_CP022548.1|WP_067200048.1|2914023_2914293_+|30S-ribosomal-protein-S15 MTITAEKKQELIQKFGQTKGDTGSPEVQVAVLTERIVNLTEHFKGHHKDNHSRRGLLMMVNKRRSLLDYLRKKDVERYAKLIKDLGLRK >NZ_CP022548.1|WP_100094569.1|2912991_2913999_+|tRNA-pseudouridine(55)-synthase-TruB MGEAGVRPHGWIILDKPLELGSTQAVGAVKRNLREAGLLGKGKNKLKVGHGGTLDPLATGVLPIALGEATKLCGRMLDASKIYEFTIGFGTETETLDVEGAVTETSGHRPTLAEVEAILPQFSGAIEQIPPKYSALKIDGQRAYDLARAGVDVEMKLRGVTIYALGIRGVDSSIRHPELVSGSPEVLKQVQDDEPLTEITLTANVSKGTYIRSLARDIARALGTFGHVTMLRRTKAGPFTLKQAISLDKLNEIGKGADIKEYLLPLEAGLDDIPAFDLDPDQARMLRQGLTLDEQDLIGNPAVNGLFLATENEGSPVALAEIVDGTVKVVRGFNM >NZ_CP022548.1|WP_100095658.1|2912299_2913010_+|class-I-SAM-dependent-methyltransferase MTGDINGWARSAEAWIASIGEAGDWTRRTLLDQVMLERVARHSGTALDIGCGEGRFVRMLKEQGWSAVGIDPVERLIAEAGKHDPDGDYKIGFGEDLAFEDESFDLVVSYLSLIDIEDFRVAIREMTRVTKPGGSLLIANLTGYFTAGKWVRGLDGQHKRFVIDNYHQERASREQWLGIDILNWHRPLSAYLQEFLANGLILRHFDEPTPPDQNDPKSDRYSRVPGFVVMEWEKPA >NZ_CP022548.1|WP_100094578.1|2924618_2925320_-|TIGR02117-family-protein MATTALKWLKRMVVASGAILCAYLLAALAGSLLPANQHWQSPEDGIELFIETNGLHSGIVMPIWSDVHDWTPLVRPEHLADPSYYGSHILVGWGHEGVYRNTRQWRDLRANDAISAIFGSDDVLVHVYHLKYPQAYPYYRRPLKVSEAEYRKIASAIEARFALDDQYRSQPSPGYGQDDLFYQANGHYSAFYTCNNWTSDILQQAGIRTGRWTPFQGGVMRWVPGNSRVNRSH >NZ_CP022548.1|WP_100094579.1|2925349_2926246_+|phenylalanine-4-monooxygenase MLDTPETETHVYDKPPAHANADWTIDQDWQRYSAEDHETWDILFARQQKMLPGRAAQAFLDGIDILKLSRPGIPDFEELNRILMDATGWQVVAVPGLVPDDVFFDHLANRRFVSGNFIRRRDQLDYLQEPDIFHDVFGHVPMLADKRFGDYMAAYGRGGLRALKFGTLKQLARLYWYTVEFGLIQEAEGLRIYGAGIVSSYGESVFALDDPSPNRILFDLERVLRTEYRIDDYQQNYFVIPSLDELLRVTVETDFKPLYDKIAGLPDIRIAEIVEGDVVLTEGTQDYARSQAGEATVV >NZ_CP022548.1|WP_100094580.1|2926330_2927326_-|50S-ribosomal-protein-L11-methyltransferase MDSPLSGTWKVSIPCTRAEARTLAADNLFISSSDSTPTIVASEVDAKRPDEWMIDVYCEEMPDPEMVENLLQLSPSAEAAGFQPIVTQLGDEDWVTLSQQGLEPLRAGRFHVHTSSDQPSGNPKLTNICIDAGQAFGTGHHETTMGCLQSLDRLKKTGHRFRNIADVGTGTGLLAFAAHHLWRDAKIIASDIDPVAVTVTGENARLNDIPLGNGRSQIRLVQSNGLDHPALRQRAPFDLIIANILAGPLVALAPQIAAASGAGTVLLLAGLLSRQKAEVVRAYIREGFRLKHSRLENEWPCLTLVKAKKYGWTRKPHRKKSPLASDYFGEC >NZ_CP022548.1|WP_100094581.1|2927341_2929222_-|ABC-F-family-ATP-binding-cassette-domain-containing-protein MLNFNNITVRLGGTVILDGATAALPPGARVGMIGRNGAGKSTLMKVIAGMLEPDDGSVDMPKDARMGYIAQEAPAGQDSPLDTVLAADTERAALLEESETATDPHRIGEIHERMNMIDAHSAPARAAKILIGLGFDEAAQQQPMDSFSGGWRMRVALAALLFSQPELLLLDEPSNHLDLEATMWLENFLTSYQATILVISHERDLLNNVVDHILHLQHGTTTLYPGGYDAFEKQRAERQAQQASAREKQIAQREKLESFVSRWGAKAHSAKQAQSRVKALARMEPIAAAIDDPSLSFHFPSPAELKPPLITLDDAAVGYGDNIILSGLNMRLDPDDRTALLGRNGNGKTTLARLLASQLEPKKGAINKSGKLTVGYFTQYQVEELDTSDTPLEHMTRLMKDAKPGAVRAQLGRFGFSGDKATIKVGKLSGGERARLALAMITRDAPHLLILDEPTNHLDVDAREALVHALNEYSGAVILVSHDRHMLELTADRLVMVDSGKAEEFAGTLKDYTDFVLGKNQPGSSGGKSGSGNRKEERRAAAERRKQQGELRKTVSAAEKEMKVLAEEISAIDRAMFDPASAAAKHADLSMTELMKLRAEHQDRLDKAEARWMEASDRLEQENVNA >NZ_CP022548.1|WP_100094582.1|2929365_2930733_+|MgtC/SapB-family-protein MPGRLSLRLDDDDKIRISPRDDALAMLTEFSGEWLIYRDLAIALAAGLLIGVERGWTMRDENKGARVAGIRTFGLIAGLGGLVALISQSLNLVIAAILLTGVVAMLSISHFKTIHEPDGRSITNFVVSLLTLCLGLVAGSGYPALAMAGAAIVTGLLSMRSELHGLMRRLGPTDINVMIRFALIALAVLPFLPDRDFGPYEAFNLQQLWFVVILVTGLSFAGYVANRIFGAAHGTVVTAVIGGAYSSTAVTASLAQRLRGEETAGRTLSAGIALASSVMYIRVLLLTFLLARFAFVPLAMILAPAALVAAIIGLLLLRRSGAARSDESVETRNPIRLLPAFFFALTVAAMSLAARWAEAKYGSSGIAYLVLIVGSFDVDAAIITLGALPEETIMSDLAGFVLAMTVLANMLFKTGVAGFTAGWKKGRAAVAALAASSLVLAIAGLVSFVAFDIRF >NZ_CP022548.1|WP_100094583.1|2930740_2931421_-|spermidine-synthase MIAREHLDTAPIPGGDELRLFRRALKGADEYSIMLGRIELMNSRLSGSEEALATLACERLAVKNPRILIGGYGMGFTLRAALSVLPPQAHVEVAELIPAIVQWARGPMADLSQKCLEDSRVTVTIGDVANVIGKAERDYDAILLDVDNGPDGLTHDANDRLYSMAGLATAKKALRSNGILAVWSSEGDNRFTARLKKSGFAVEVKTVSARSNGKGAKHTIWLAANR >NZ_CP022548.1|WP_100094584.1|2931421_2932201_-|hypothetical-protein MTEETTKKKPNWLVWLFAAFGLAGVAAVIFIAIMIANIGGGGREERAPLAVADGETKYSMGQVIDLRGNDLSALTIISGDRNAGCGSFSSKSYRENITHNLIFFDRNTRKSRKLLNDNRGKVIAAVFLPDQETGFPLAIGDGMNDATNVAEAAAQAAMEAAGIVGAEERDQFKRRMPLKNYMAIVALQDGETMKNSLLLGRLSDGKQVMTLDGVETVRRFWILSPTEVGMIVQRNGEIFHHIANFESLSITQSTKIEVN >NZ_CP022548.1|WP_100094585.1|2932213_2932927_-|SDR-family-oxidoreductase MSNKKTHILVTGASRGIGEAIVDSLGMDNVTVIGQSTAGGDDLIAADFTQDGSAPLLWQQALDRLDGRIDVLVNNAGLFQPNPLAADDWLESWNRTMTINLTASAELCRFAVLHFQQRGEPGRIVNIASRAAYRGDSPDHWHYAASKGGMVAMTKTIARAYAKDGILAFAICPGFTMTGMAEDYLESRGGDKLLADIPLGRVALPEEIASIARYCALDAPPSMTGAVLDANGASYVR >NZ_CP022548.1|WP_100094586.1|2933128_2933542_+|succinate-dehydrogenase,-cytochrome-b556-subunit MSGVSGSASNRPLSPHLQVYKWGPHMLVSILHRATGDAMALVGVPLLVWWLYAIAAGPESYDYFLSWFNWGYIGYIVLVGMSFAFFEHMYSGLRHFVLDAGAGYELQTNKMWSVAVPLAAIITTGAMWLYIASKNIL >NZ_CP022548.1|WP_100094587.1|2933557_2933947_+|succinate-dehydrogenase,-hydrophobic-membrane-anchor-protein MGSGTNIGRVRGLGSAQEGAHHWIVQRVTAVSNLLLVLWLVFSLVRLPQYGHELMVEWMSSPLVAVPMMLMLVSIFWHLRLGLQVLIEDYVPDHGVKFGLIVLLNFFAIGGAALGIFSVAKIAFTGVVA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
8013 : 64224
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP022548|8013:64224|DBSCAN-SWA ATCAGAGTCCTTCGAGGAACGCGAGGAGGCCATCATCGGGCCGGTATCGGCCTGTCGGCACCTGCGGAGCAGCGACCCGAGCGAGTGCCTTTTCCTTCATTCGCATATCGGCGTGGATGTATATCTGGGTGGTTGCGACCGACTCATGTCCGAGCCAGAGAGCGATCACCGCTTGATCGACGCCATGATGCAGGAGGTCCATCGCGGTGCTGTGGCGCAGCGTATGCGGTGTGACACGCTTGGTCCCGATGCTCGGGCACGCGCGCGATGCCGTGAGGCAGTGCTTGCGGACCAGATGTTCAAGTGCATCACGGCTCAGCCGCTCACCCCGGATCGACGGGAACAGCGGGCTGTTATCGTCACAGCGCTCGCTCATCCATGCCGCCAGCAGCTTGGCGGTGTCGCGGCGCAGCGGTGTGCATCGCTCCTTCCGGCCCTTGCCCATGCAACGGATATACGCGCCCGTACCAAGCGTCAGATCGCCGCACTTGAGGTTCACCAGCTCGGACGCCCTGAGACCCGTCTGAACCGCGAGCAGGAGCAGCGCATGATCACGCCGTCCGGCCCACGTCGCACGATCAGGAGCCGCCAGCAATGCCGCGATCTCCGGTGTATCGAGGAACGTCACCGTGCGTTTCACATAGCGTTTGCTGGGCATCGCCAGCACGCGCTGACAGTGGAGCAGCCAGGTCGGATCGGTCATCGCGACATAGCGGAAGAACGAGCGGATCGCGGCAAGCCGGATGTTGCGGCTGCGGGCGCTGTTGCCGCGCACGGTCTCGCAATGAACGAGGAAGTCGGCAACCAGATCGGCGTCGATATCCTCGACCGCGAGCTTGACCGGCGGCTTACCGCAGCGCGCACCCGCGAACCGCAACAGCAGGCGGAACGTGTCGCGATAGCTTGCAACCGTATGAGGGCTGGCTTCCATCTGGGTGCACAGCCGGTCAGTGAAGAAGCGCTGGATCAGCGCAGGCAAGGTTGCAGCGCTCATTGGTCCGTCTCCCCGCAAGAGGTTGCCCGCGCCATCGCCAGATCGAGCAGTTCGGGCACCGCTTCGAGATACCAGAACGTGTTGTCGGGGTCGGTATGGCCCAGATAAGTGGTCAGCCGGATCATCTCGCGGGCCGGATCCTTGCCGGTACGGTACCAGTTTATCATTGTGCGCACCGCAAAGCTGTGGCGCAGATCATGGATACGCGGTCCCCGGCCATGCCGACCATATCGCTGATGCGCTCGCAAGCCGATATGCTGGCATACCTGCGCAAAGTTGTAGCGCGTGCCGCAATCGGTCGGCCTTGCTCCCTTGCAGTTAACGAAGAAGGATGTCGGGACACTTCCCAGCAACCGGTCGCGCTCGGCAGCATAGGCGACGAGCCTTGTCACCACCGTGGGATCAAGGGGAAGCAGCCGCTCCTTACCGAGTTTTCCGCGCCGAACGCGCAGCACACCGTTCGCAAGATCATCATGGTCGAGCGCAAGCGCTTCGCTGATCCTGAGACCCGTCACCGCAATAAGCCCGAACAGGGTCGAGAAGGTCAGTCCGCGCATACCGTAGATCGAGGGCAGCGCCCCCGCAGCCGCGATGATCGATGTGATCTCGGCCTCGCTGTATATATACGGGCGCGAGCGCCCGGTATGGCCCGGCAGCAGCCCACGCGGCGGGACTTCATGGGCGGGATCAATGTTGCTCAGCCACTGCGCGAACAGCCGCACCTTGCCAAGCCGGGCCGAACGTGTCGATGTGCTGACGTCAGGCAGGCTTGCATCCCAGCGCAGGAACAGCGCCGTATCAATGCGCGCAGCGCCTTCCTGATCGGTAAACCGGGCGAACCGGCGCAGTACCCGTTCATCGGTGCGAAGGTCATAACCGAGGCTGCGCCTAACGCCAAGATAGCGGTCCAGTTGGGATGTGACATTCATTGTCCGCCTCCTGCAACCGGCCATGGCTGCGCGATGGACCTGAGGCCCTCAATATCAAGCCGAGCATATATCATGGTCGATGATCGCGAGCGGTGTCGCAGCATATCGCCCACCTCGTCGAGCGATGCACCAGCATTGATGAGCCGGGTCGCAAGGCTATGACGCAGGAGGTGCGATCCCACATAAGGTGTGACCGGCTTCTGCCCCGTGGCCGCCAGTGCATCCTTGAGGATAGCGTTGGCGATCTGGCCGTCCTTGAACCCGCGATGCGGCGCACGGTGCGTGACGAACATCGTTCGGCAGGTCGTCGGCCCACGCTCCTCACGAAGATACCGGCTCAACGCTTCGCCAACCTCGGCTGTTATTGGCAGCCTGTCATGCAGCCCGCCCTTGCCGCGCACGAGCAGTTCGCCCGCTCGCCAGTCTATATCATCAAGCTGGACCTTGATAATCTCGGAAGCGCGCAGCCCCAACCGCGCCATGAGCAGCAACATTGCATAATCGCGTGCGCCATGACGGGGATTGCTGTGCACCGAAGCGAGAATGGCCTCGACTGCACCGGGCGACAGGTGCCGGGGCAGCCGCGCATTCCAGCGCTGGGCCGTCTTTGGTATACTCAAGGCCAGATTTGCCGAGGTCGCCCCGCACGCGAACAGATATTGGAAGAAGGTCCGCAAATGGGTCGTCACTGTCTTGTCGCGATATGGCGTCCTGCGCGCCAATATGTGCTGCACAAAGTTGGTAGTGTCGGCGGCGCGAAGGCGCGTCAGGTCGATCATTCGGCTCCCGAAGCGGTGATCGAGGAACCGGTTGGCGAAGCGAAGCGTGTGGTAGATGGTGCGCGGACTGAGGCCTCGCTGCCTTACAAGATAGTCCTCAAAGTCCATCAGCAACGCTGCTCGTGTGCGCTGTATCTCGTTAATAGATTCAGTCATAATCATCATACTCCATATTGTTGGAGCAGCAAAGATAATGCCGAACGCCCTTGGCGGTATCATACTGAAAGCCGCCTTTCTTCGATGCGATGCGGCATAATCCCGGTTACGGCATTATGACCAAGTTCATGCAGGGCCGTGCGGTAATAGTCGATCTGGTGGGTGAAGGCCGGTTGCGGAGGAACCTGCACAAAATCCGCGCCAACATTATAAAAGGCTTTGGTCCCGCCCGTCCGAAAGTCCGCTGCTGTCGCGGCAATCAAAGCCTCGGCCTGACCGTGCAATTCGCGCTCCGGTAAGGGCGCGGGTTCGGCGGTCAGCCGCTCGGGCAAGCCATCGCACTGCGCGGCGTTGAACACGGTAAAGCGTTTGAGAAATGGAATCGAGCGAGGCGCACCTCCATCGCCAGCGCCGCCCTCGCCTTGCTGTTTTCTGTCCTCGTCGGTCGTAAAGCGGTTAGCGTAAAAAACCGTCCGGCCCTTCTCGCCCTTGCGAACGCATCCGCCAGCGGCCAAGGCTTGCCGGAATGTCAGCCAGTCCTGCGAAGGATAGTCGCCATCAATGACCGCGCCCCAGAGAATCAGAATATTGATGCCCGAATATTGTCGCCCTGAAATCGCGTTGCGCGGCAAGCCGGTCACGGCGTTGCCGCTGTTCCAAGGCTTCACCCAAGGAAAAATCCCTTCCTCCAGCTGCGCGATAATCTGCGCGGTCACTTCGTCATAAAGCGACGCCTTGGGCGCAGCACTGCCGCTACGGCGGGTCTTGCGCGCTGCGCGCGAAGCGCGCTTTGCTTGTCTTGTCCTGCCCGTGCCGCTATCCGAAACCCTAGAATTTTCATGCACCATAACCGCCTCCGTCACCCCAAAAATGCCGCAACCGGCCCCGGAGAGACCCCATGGAGATCAGACCGCGCGCTCGGACCAAAATCCTCGCACGCGCCTGACCCAGGGAGGGGGGTGGGCGGCGAAGAACGACCGGAAAGGCCCGCTGATTGAGGCAGGCTGCACCCGAAGGGCCGGAACGAAGAGGAGGTAGTGAGCGAAGCGAGCATTGCGGCATGCCGCAGCGGGCCTAGAAGGGAGTGCCGCCCAACCCGCGACCTGGATCAGGCCTAAAACAACGTCATCGCGCAGCGATGGCAACACCGCGCCTTCGAATCCGCTTCGTGGTTCAACAAGTCGCGGCGCCACAAAACCACGAAAGGAAGCGAAGGATGAAAAATGAACATGTTGTAGCAGACGCAATTGACGTTGAATGCGCACCGGCGGTGCAAACCACGCTTGGTGCAATTTTTGTGTCGATGGAGCTCAGCCGTTCAACCTGGCTGATCACATCGCTGCTGCCGGGCGACGGCGAGCGGATGTCGAAATGCACCTTGCGTGCTGGCGACATTGGCGGCCTGATGGAGTGATTGTCCAACCTCAGGCAAAAAGCGCTGGCGCGAACCGGTCAGACCTTTCCATTCGTCGTGATCCAAGAAGCCGGTCTCGATGGCTTCTGGATCCACCGGGTGCTCGAACAGGAAGGCATCGAAAGCCATGTTGTCGATGCCGCATCGATCGCAGCATCCCGTCGGCGTCGTCGGGTTAAGACGGACCGGATCGACGGCGAGATATTGCTGCGCAGCTTGCTCGCGTTCAAACGTGGCGATCCGCGCGTTTGCGCTATGGTGCGTCCGCCATCGCCCGAAGACGAGGATCGCCGCCGCAATAGCCGCGAGCGCAAAACGCTAATTGCCGAGCGTGTCAAACTCGTCAACCGGATCAAGGGCTTGCTGTTTGCCCAAGGAATCACCGGCTTCGAGCCCCTCAAACGGGATCGTCGTGTACGCTTTGACGAGCTCTTTACAGGCGATGGGCGCGAATTGCCACCGCATGCCAAGGCAGAGATCAGCCGGGCGCTCGACCGTATTGAGCTGATACTTGAGCAGATGAAAGCTATTGAGGCGGCGCGCGATGCGCTCAGTACTGAAGCAGCCCATTCGCCAGAAGGCGTATCGGGTACCACCATGCTGCGCCGGTTGAAAGGGATCGGCCCCGACTTTGCCGAAGTGCTCTGGGCTGAAGGCCTGTATCGGCATTTTGATAATCGACGACAGCTCGCATCCTATGCGGGCCTCACGCCAACCCCTTGGCAAAGTGGGTCGGTCAGCAATGAGCAGGGCGTGTCCAAGTCAGGTAATCCGCGCCTGCGCACGATCATGGTGCAGCTATCGTGGTTCTGGCTATTGCATCAGCCGGACTCGGCATTGTCGCGCTGGTTCCACGAACGGGTCAAACTCGACGGCGGGCGTCGCCGCAAACCGGCGATTATCGCCTTGGCCCGAAAACTTCTGATCGCGCTGTGGAAATATGTGCGCGAGGGCCTTGTGATCGAAGGAGCCATAATGAGCCAAGCATAGCCAATTGCAAATTTACCCCATTAGCCGGGGATGATCAAACCCGGCTGATCCAGGGTAGGCGAACCTCCACCCCGCATGGCTTAAAATGCCGCCGGAGAGAATGGTCCCGTCTTCCCTGAGCCTGACCCGACGAATTCGGGGGCTTGGTGCAGCCATTGCGAATGGCGACCGCATGTGAGGTTGGACTGGCAGGAACTGCCATGCCGTTATGAGCAGGCTCGGACCTTGGATATATTTTGAAAGGACACGGCAATGACAAGCAGGCGTTGACATTTAGTTCTCCATGTGAGGCGGGGTGGGCGGCAAGAGCGACCAGAAAGGCCCGCATCAAGCGGTCAGGCGCACCCGAAGGGATGAAACAAAGAGAAATACCCGGTCCTGAAAGGCCGGGTTGCGCCTGAGCGCGGCGGGCCTAGCCGGGAGCGCCGCCCACCCCGACTCGTCGGAGCCGGAAAACAACGCCGGCGCGTCCGCGCCGTCCGCAAAACCTGCTCGTGCCACAAATCGCTGGAGCGGCCAGCGGCATTGCTAAAAACCAAAAAACCGCACCCTGGAAGGAGGTGCAAATTGCCAATCATCGCTTGCGCAAACGGTCGGTCGGGGCGTGTCGTCCGGCCGGATTGCGCTTTAAACTATGACCACCGGTCCAGAGTGCGTGCGGGGCAAAGAAAAAAGCCGCGCCCTTAGGAGCAGGCCGCACGGCCTGCGGAACAGAGCGCGTCATCAGCCCCGCTCATTTGCCCGGGGCGTTGCGGGGTGTCCCCGCAGAAGGGGTAGCCGGAGACAGCGAAGCGGCTCCGGGTCAGGGGAAGGCTTTCCCCTCAAATCATGATATTATAGTTTTCTCTCATTGCAAAGCTATCAATTTCGGCACGGTGCATGTTATCTAACAACATGGTTGACCTCCTCCCCGCCATTGACATTGCCGTTCCAATCGAGCTCTCTATTTTTTTCGAACGCATAGCGGAAATTGGAAATTCATCGGGGAATTTTAAAGTCGAGCATTTTAAAGGTAAAGAAGCCGGACCGGATTTGGAGGTTGTGAACTTTCGGTCTGAAAAGGACAGCCAACATTCAGGGCTCGGTTTCCAATTGATTGCACGAAAGGATATTCCCCAGCGCGTGTTGCTCGAAGTGCGGGCGAGAAGGTGGAATCCGAATCCACCAACCCGCGCGGCATATATCGATGCTGCAAGATTGCTGGCAGGTCCATTACTAAAAACATACAATCAGACCAACTCCGCACGGATCCGCCTGAGGATAGAACAGGCCGGACAGGGTCGATTTTCGCCGTCAAAGCGCACTGCCGCACTGCTGGAACACTTTACTGCGTTGGCGAACACATCCTCCCTGCATCCGCTCGACTGGAAGCGTTTTTATACGCTCATAAAAGAAAGTCGTCAGAAAATTCCCGAAAATGAACTCAGATCAATATTGGTTCGCAGAGGCTTTCCAAGCGAAACTGCCGAATACCTTGCCAACATCTATAAACATTTGTGGAAATTTAGGCGGTTCAAATAGACTGCTTTGAGATGTGATGCGCGCATAGTTGCATTATCGCGAGCGTTTGTTCACGATCAAGTCATGACAGAAATCACCATCTCCATTCCCGACTTCGGTAGCATGACCGATGATGAAGTGAGTAGCCACCCACTCGCATGGGGACGAGACGGCAAGTGGAGTCGTCAAACGCGATGGCTATTGTCGCATTTCGGGACCCATCGCCTTGAGTTAAATGAAGCATGGGCCATGACATCACCGGTTTGGCAATGCCCCTGCTGCCAGCGATACAAACCTGAAATTGCAAAGCTCACAGATCAAGGCGTGCTGCATTGCCAACTTGACCATCATCATGATCATCTTGGCGATCTAGCTGGGGAAATCCTGCGAGCGGCAACCTGTCATGACATTCCGGATGAACTCGTACGGGTACGCAAGCGTGCGTGTAACGCAGCTCTATTGCTCATAGAGCGCTTCGCAGAAACATTGATCTGCAACGATTGCAATGCGGCTGATGCGGCGATGAAGGCGGCGCTGGGTCACCGGGTAGATCGCCATTTCAGCTTCAGTCCATTGGAAATAGCAAATTTCATTTTGCCAAAGCCCAACAGTTCTCACGAGCCCGATAACGCCCGGGGGCTTGAAATTTGGACGCACGCTCAGGCGCAGTTTTCCGAGCGACTGAAATTTGCAGAACAACTGGCAAAAAGGATCACCGCAGGACTCCATGACCGTGAACGCTTAAACTACACTGGCCTCGATTCCGGCTATGACGACGCAAAACTCCTCTTTAAACTTGCGGCTGATCAAGGAGGGCCGCGGAACCACCCAAGGGGAATTGGTGATGCCCTTCTAGCCCGATCGCGGTCTACAGCGGGGCGCTTCTCAAATCCCAAGAAATTGTCTGCAGCCCCTTTTCGTATTCCAACTGCAGAAGAGTTCCAATTGTTAGATCAGACGAAAGCGAACTCGTCTCCATGGCGCAATTCTGGGGCGAACTGGAGCTGTGCAGCCTGCTCGCGAAGCAAGTTCGAAATCACACGCTTGTCCAAAAAAGGCAAATGGATGGCACTCATCATGGAACTAAACGATTTTGAAGAAGAGACCAATCCCATTTCACTACAACGCCGATCTTTTCATTACGATCTGCCACTAATCTTTGGGCAATGTAAGCGGATTACGGTTTGCCAGGATTGTCGTCAGATAGTGACCGACGCCAAAACTTTGGTGCCAAGCGTTATCGAAGATTGCATGCCCATAGATGCTATCAGGAAGCTGGCGCAGGACCCCAAGCCGCACCAAGGGCATGTCATCGATAGGCCCGAAATCATTCGCGTCGTGGAAGCCAACACCCAGTGGGCCAAGGCTGCAGAGGATTTTTGGATTCACCGCGATCATGCAAACGACATCGCATTTCATCAATTGCGGTTGGTTCGTAACACCGGATTATCCGACAGTGCTGCAAGGCGGCAATTGATCCCCGAATTGGTTGCGGCAAAGAAATTACCGGGATTTGAATCGGACGAATGGTTTGATTGGCTGATTGCTGAGAGTAAGCGCTCGTTTTAGCTGGAGGCAGCGCCGCAGCTGTGGTGAAGCCTTTCTTCCTATTAGGCCGAACAGATTTATCAAGAAATCAAAATGCTGCGTTCCAATGTGCACCATCACTAGTAAGACTTCGCTTTCATCCATTGATCACGGCGATCGGTTATTTTTACACCGGAACCGCAAAGAAGCTTGCATCGCGTGATTCGTCGGCGCTATGCAACGTCGATGAGTATGCCAAAATTGAAAGATGTTGATTTGCGTATTGCTGCACATGAGCGTCTTTTTGCGCGTGCCAATCAATGTCCAGACACACTCGTTATTGATGAACTCGGCCTCTCTCACGGTGCTTGCAGGGTCGATATCGCTGTAATAAATGGACATATTCGAGCCTATGAAATCAAGGCTGAAGCGGATAACTTGCTTCGCCTCCCTCGCCAGGTAGAAGCTTATAGCGAAGTCGTTGATGCTGCCTCATTGATTGTGACAGAACACCATCTTGATGCCGCCGTTAAACTTGTTCCAGAATGGTGGGGAGTTATTCTTGCGGAACGCCGCAAAACTGGCGATGTTGCTTTTCGCCGCATGCGTAAAGAATATATAAATCGCGGAGCATTGCCGCTTACATTGGTGAAGCTGTTGTGGCGAACCGAAGTAGCTGATTTACTTCGGCAATATGAAATACCAGAGAAAGAACTGCGTGCGCCCCGCGCGATCTTATATGAACATCTCGTCTCAATCTTGCCGCGACGAACTTTAGGCCGAACTGTCCGAGAAACGCTCAAATCCCGTAAAACGTGGCGAGATCGTGCACGACCTTTGTAATATGGTGGTTAGTTCCGACCCACCGCCAGGTTGTCAGGTTCCCAGTATTTTCAGTTCCAACCTGACATCCGGCAACATAGTTGCTTCCGGGCGAAAAAGCAGCGCCGAGATAATAGGCCGAGCCTGTAATGTGGCCACTGCAAGTGCGATATTGGCCAAATCCATTGTCACGAACATTGTTGCCTTTAGCGATTAGCCATCCATCATCAACCGTATAGCGTATGGTAGCGGAAGGTTTTAGCAAGCGCATGTCGCCTTGCGCAATTGTCGGTGCTGAAATGGCATAATCGCCAAATCCAGGTCGCCTAATGTTTGGGCCAAGTGCACCCAAAAGTGCTTTGTAAAGCAACCACTCGCGTCGTGGCCACAATTGTAATGGACCGGTCACCGATCCCATTGAATCCGGAAATGACGCTCCCACTATCGTCAGACTACGCGCCGAGTTAAAAATAGCGGCACCGGATAAGGCTGCAGAGATCAAAGCTATCAAGTCATCCTGAGGATCAAAATTTGGAGATTCCAAATCTAGCACAATATCTAGCTCGTTGGGTCCAATTCCAATAGTTTCACAAAGCAGATATACATTGGTGTCAAAGTCAGGATCAATCGCATCTTCGAGCGAACAGCGAAGCGCCGCCCCATGGCCAGACCATGTATGGATTTCATGAACAGCGTGTTGGTAAGCATCATCACGGTCCAATCCAGTCACAGGAATGAGCGAAGCGTTGAGCGTTGTACATTCATCAAACAGATGGGCCAACGGATGTCGACCATCTGCCATTCGGGTTGCTGGCGGAAGTAATCCGCAATCTAGCAATGCGTGCCGAGAACCCCATTTTTTTTGAATTCGTGTAGCAAAAGAAACAACGTGTTCGTCGATGGTTTTTGACGGAGTTTGCGTTTCGAAATCAAAGTCCGGCGGCGTGACCTCAATTAGAGGGACAATCCTATCTTTTTGCGCGTCCTCCAAGCGTAGGAGAGCCTGATATTCTCCCTGCCGCCACTTCAAAACTGGCATATAACAATGTTGATTTATCGGCATAATTCTATCCTTACAATCCAAATTGACTACGAACGGCTCCTGCCCCGACCAGAGTTCGTAACAACGTATAGTCACCGGCTTCCCGCCGAACACGGCCAGCGATAATCGCAGAGCCCACGCCGAACCGTTTTGCATCTGCAAATACAGCCTTCTCTGTGCGAGTGAATCTCGACACAGCTATTTTCCACTTATCACCGGGCAACAACGCTTCCTGCGCGAACAGATCCGCTTCATCCTCAACATCGTTATTTGCCGGAGCTTCTGTATCGTCGAAGATTGCCGCGAACTTTCCGGTCCCGATATGCAAAACCAAATGGCCAATTTCATGAAATAGCGTGAACCAAAAGTTATCGAGCCGGTCATGCCGCAAAGTTAAAGCAATAATCACATAATCATCCCACGAACTTAGTGCCGCACCATCAAGCAAGGTTCCGGGCAAATGCCGCTCGATCACTAAGGAAATTCCGATTTGCCGGAGATAATCATGCACCCTGCTCGGGCCATTTTCATCTAGCGAAAGCGAAACAAGGCCAGCAAGCCAAGTTTCATCAACTCTATTCAAATCAAATACAACCGACGGTGGATTACGTTCGGCGAGCATCCGAACACGCGCCTCCCAAGCGGCGATTGCCGCCTCGTGGACCTGGCCAGATGATCGCACTGATTTGCGGTGAAATGCCGCTGCAGAATTATTGGCTTGGACACCTTGAAGAAATGCAGGGACAAGGTCTGCAGCAGACTTGCGGGCGTGAGCTAACGTACCCGAGAAATCTTCAAACCAGCCACGTTTATACATTTCTGCGATCGGAAAAGATTTCCAGGATTCTGGATCAACACTTTCCATACGCCCATCGCCGATGAGAGCTGCTCGTTCTCTGATTTGCACACCCAAAGCGTCCGATATTTCAACAAGACGATCAAGGCTTGCAGACCGGTAACGCTCAGCTTCGTAGCGCTGAATTTGCTGTTCTTTGAAACCCAGAAAGTCCGCCAAATCCTTTTGGCTCATTCCTCGCGCGATCCGTGCGTGGATGAGAATGTCCGGAAGTTCTCCAAGACTTTCAGCTTCAAATTCTCCGATTTGACCGGAACGGAGGGCATCATAGAAGTTAACATCCTCTTCCATTTCATCAATTTGACTTTCAAGCGCTTCGCGCTGTGCGGCAATCAACATTGGATGCAATTCGGCGTTTTCGCCGCTGCCTATCGCTTCTAAGCCCAATTTCATTTTATCGATGAGTATCTTGCTCGACCGATATTGCTTTTCATTGGTAATCATGGCGTCATTCCTGATTGTGGAGCGAGCGAAACCCGCAATATACCTTTGGTGAGGCCGGAATATTTCTCGACTTGGAAAAAGTCCAAGAAAGTTGTCCCGAGGCCACTATCAGCACCGGCGGGAAACATTTCACCGAGGTATTTTGCTTTTTGGGCCGCCCGTTTGTTTACAAAGGTCAACAAAACCGGATCAAGTTGTCCGAAATCTACACCCTTGTGGTCCCAGCAGCCGTCAAAGTCGTCTGGGTGAGCCTTATCCGTTGTAAAGCTGCCATCAATATAAACCGTAGCACAGCCAGCTAAAGCAAGAGCATCCGCAGCACGTTGAAATCCATCAAATAGCCAACGTCTGTGTGGCGTTGTCGCGAATCGGCTCTCTACTTCTTGCATTGTTACATCATGAATACCCGGCGGTAACACTGGATATGGAGATGGTGAGCCAAGGTCGATCAAGTCCGGTATCATGTACACAACAATATTGTTGTGTTAGATAAATGTCAAGTATGAATTGATATCGTAATATTAGCTGATGGGAGGTATTGCTTTAGACACGAGGGGGCATATTGCTCGCGTCAAGTCGACGGTGGATCAACGACACGATCGTCAGTAAATCCTCAGCGTCCGCTTTGCTCATTTCCCAACGGATACGCGCCTCATGCGCGGTCGGATTACGGAACATTCCGAACGCACCTTTGACGAGATTAGCGAAACCGCTTTGCTCACTCTTCTCACTCACTGTACGACGAGCGTTTATCGCAAGCAGAGGTGGATTGCCGCCAAACGCGCGATCAATGAGCGTGCCGCCGTCATCGATCAGGCCCGTCTTGCTACGCACTTTGTCAGCGACGCTTTTCACCGCCTCCTGAACCGCGTGAAAATAGTTGTCCACGAGTAGCTCCGCCCGGCAGAAGGCGAGCACGTCGGGATGAACACCGCGGCCCTCTAGATCCGCTCTCAGATTTTGCGCTCGCCGCTGGGCCTCGGGTAATGTGGTTGCGGAGCTAACGGTTCTCAGGACACCTGTTTCAGCAACGAGAACCCCCGCAAATGCGAGCGCCTGATTTAAAGCTGCCCGCATCGGTTCATATCTGGCAGGCTCTCGGCTGTAACGCGCCGGGCTCATCGCCAACCGAATAAACTCCAGAACATGGGTCCGGTTACGCTTACTATTCTGGCTCTCAGCAAAAGCATTGTAGATACGATGCCGCTTGGTCATCTTTCCAGGATCGATCATCTTTGCCGAGCGGATGAGCAGTTCGATCTCCGGGTTGGTGAGCCCTTCAACAGTGTCGCCCAGCGCACTGGCAATCGCTTCCAGCTCAGCTTGGGTGAAAAGTTTCATTCGGATTTTAAACCTTCTACCTCGGTAAGACCGTCAAAGCTGGCGTCATACTGTCGTGATCGGCGCGGAAGGGATACGATGAGCTCGTCAGTATTTAGCAATCGCTTCTTCAACAGAAAACTCCCATGCTGCTCGTGTATCTCGAAAAACTGTACGGCGCGATCCCGATCTTCGAGCGCCTTCACAACGGCAAACTTCACCGAGGGCGTGCAGCGCGTGCCGAATATTACCTCTTCAAGTTCTAATGGTGAACTGTGCTCACCGCGCATCCCTATCAACCTCCATTCCCGCTCATAGCCCCAAGGTTTTGCCTTCTTGAGTAGTACGGATTCGTCGACCTTCTGACGAGCGGCCTCGTCGCCGTCGAGCATTGCAAAAACGGAACTTGCCTCTACCAAACGGCTGCCGCCATATTGGATTTTATATACGTCATTAGCGGATGCATCCGGAATCGAATACCCAAGGCAAAGACCTCGGTGCTGATCGCCGTAATGGCTCCACATGAGTGGGCAGTTTGCACGCTCCGCCAGCGAGAAGACGCCTTTGTTGTAACGCCGTAAAAGTTCCTCCTGTAGATAATAGCGAAGTAGGTGTTGTGTGGGATCATCAATCTCATAATCGGGATTAGTCGCATTGTAGCGAATATCGGCGAGTAACTGGTCAGCCTTCTTTCGGCTCTGCCGAGCGATATGATTGATGGTTTTAGGTCCACGGTATTTGATCGACTTCGCGGCGGCGGACATCTCTGCCGTCGAACGTTGTTCGATGAGCTGAGACAGGATCCCCTCCAACGCATCGTTGTCAATATCAACGTTAAGAGTGGGTTTCGTATCTAAGGGATCATTGAAGGTGGTTGGATCGGCTAAATAGACGACGTCCTCGACCAACATGGTGAGTGTGAGATTACTAAAGCTGCGATACTTGTAAAGCCGCCTTGGCCGTTTGCCTTTAATCATGTGAGCGCAGCCCTAGGTTTTCTCTCGGAGCTCATCCGAAGTGTTGCTTTTCATATATGGTTACAACCTCTGATCCGTCCCAAACATAACAAAGATGTCGCTCTTGCCGACAGTTGGACAAGAGTTTTGGTCGAAACACTGCTGCGCATTCCCCGACATCATGTCTTACGCTTTGATAGATTATTCCATCTGACCCCGCTTTTCGTAGCTCAATTGCAAGGATTTGAGACGCACTATAATTATCGGGATCATAATACCCTATAGCGGCATCATGCATTTCCCGAATGTCGTGTAGATTTGCATCAAGATCGACCGCATAAACTCGCATATCCAATTCCTGCGGAGGCTCTTCGGTAGCCAGCATAAACTTTACGCGGTGATACCTGGTTTCTGCAACTGCTGTTTCGATACTGCCGCCTACATAGAAGACCCCGAATGTCCCATCAGTAAAACGATCTCCATCGGGATTGAGGTGGGTGAATGCGGCCATGATCGGCGAGGAACCCGGTCCGGATACACGATCCTCTAACGGCACAAGACTCAAATCACCGGCTTCCTCTCGAAGCCGATCATTGGTCAGTGCTTCGACAGCATAAATTGCTTCAAGGTCGTCTGGATCGGTAACCTCCTCGAACAGGTTTATTGGAGGAAACCTGCTGGGTATTATGCGATAACATGGATGCCAATTGACGGCGGTAACCGATATATTCACGGATATCCACGCTGTGCGTCGATATATTGGCGTACCACGAAGAGATCAGCCACATTTCCAGAAACCAAACGATTAAGTGCCGGTTCGCCGCCAAAGAGAGGGGCCTCATTGGGTTTTCGCACCCACTCATCAGCAGATTTTGGCAGCAAAACTTTCAAGCCTTTGTAGATCCCCAATATATAAGATATCCGTTCAAGGGCATCTTTCGATATTGCAGCCACGCCGCCCTTCTTCCAGGACTGAAGCGTCGACCGGCTATCAAGTCCCAATATATTCATTTGCTCCTGTTCACTGAGCTTCCAAGCATCAGCCACACGAAAAAAGGTGCGCAAAGCTGGACCGGTCAAGTCTTTGCGACTCTTGGCCTTGGCTGTTGAAACATTGGACATACGATACCTCCTTGGAATCATATGTGCATATTTTATACACAATTGTCAAGTTATGTGTATTTTATAGACAATCAACCTTTCCGCTAGACAACCACTGTTTTGCAAACGCATTTGTTCTCGCATCAATACTGGCAATGATCGTTTCAGAAATCCGTTCCTGCTGCTTGCGAAGATGGACACGCAACAACTTATCGCGAGTAATATAGCCCGCATTCACACCGCGCAGCGAATGGTTCATCAACAGGTGGATATCGAGTTCGGAAATCTCCGCTGCCTGAGCCAATGTTCGGTACGTCTGTCGCAGATCATTACCCCATTTCGAGAGAACAGATCGAGCTTCTTTGTGCTCGATCATATGCCCAGATTCACTTTCGGCCGGAAAAAGCCAATATTCTGCCTGCACTGGAAAGAGCATCCGACCTATCCGCATGATGCGTATTATGCACCGGATCATCATCCGCGACATGGGGATGTCAAAGGCCCTGTCTGCACCACCTTTGGGCTTTGGGATATGCAATATTCTCCGGCCAAAATCGATATGCTCACTTCTAACAGCCTTGAGTGCTGTCGGACGCGAACCAGTCAACAACGTCAACAGATGAAATTCACGGCGCAGCGGCTTATCCAGTTTGTAAAGCTGCTCAAACCAATCGCTCAGATCATCTTGACCCATGCCCGTGTTTCGCCGCTGCTCCGGATTCCAATCAACTGCGAGAACGGGGTTAAATGATGGTAAATCTCGATTGCCCTTCAAGGCATGGTTATAAATGGCACGCAGCGCCTTCATAGTGCTATTTGCTATGTAGGGACCGTTATTCGAAGAGATGTCTTCATGCCGAATGATGACTAAATCCGGCCGCCTGCCTAATTTCGATAGCGGCCAATCAAGCCAATCTGCCAGTAGGCGCTCCAAATAATATTGATAATTTTCGATCGTGCGTTCTGCCCGCCCCTTTCGTACCATATGAGCGATGCGGTAGCGCTCCCAAGCCATGCGTAATGTAATAGCCCCAGGCCTCAGCCTACTTACATCTCCAGGCTTTTCTCCCTTTGAAAACTTTACCAGCAACTCTTTGGCATTGGTTCGTGCATCACGAGCTGAAATTTCGTTGAATTCACCCAGTTTTTTTTGGACCGAGAATTCGCGGCATCCGTCGCGCCAATATTCTCCTTGAACCATGTAGGATTTTCTACGCTTGCCAATACGGACAAAAAAGCCCTTCAGCTCTACATCGCGCACATTGTATTGACCGCTTTGAGCGAGTGGCATGCGAGCAATTGACTTCTCGGTAAGCAGCATAACTTCACTTTTCATAGAACCTCCACACTTTTCCGTATCTGGCCATCACACTGATTTTCAGAAACAACATTAGTCTGGGTTTAGTCTGGGGAAAGCAATTTCAAGAGTGTGCTGGTGCAACAACGCTATGATACGCTATCCTACTATAACTCATTAACAAATAATATATTTTTGTAGAACTACTTGTCAATATGGAAACCACAGGAAACTCTAGATCAAGCTCATAATTAGTTCTGGGGGTGTGGGGGTCGGAGGTTCGAATCCTCTCGTCCCGACCAGTTTTCAGCCATTTTTTAAAATGCTGAATTCTTATATCACTGCTGTAAGTGCAATGTAAGTGCAGATTTTCTGTTTTTGTTCTCCAAAGCATCACCATCATCCCGGGGATCATCATCGGAGGAATATTTGCGCCAACCAGATAGGTTCTTCAAATATGAGATCGCCTGAATTGGAATAGTGCCAAACATCACCATCCGGACACTTGCCGCAAATACTAACTTTTAATCAGTAGGTCGCTGGTTCGAACCCAGCAGGGCTCACCATAATACGTTGAAGATAAACGATTTTCTGACAGAATGATATTCGAAGTTGCAATATTTGCAACTTGGGTAGCATTTTGGGTAGCAAATGGAAGAAATTCCAATATTCGAAGAACTGATGATGGACCAGTTCTTTCAAAGGTAGGGAATACAACAGCGTTGATTGCTAAACCAACGAACCATCCAACCCAAAACAGTGATAAATGTAAAAACGATTTCCGGTAGCGGCGCCTGCTACCATTTCTCGTCGCGAGTAAACTCGACTGCGGTCGCGTCAAGATCTTTCGCCAATTGATTGAACTCACGATATGCTCCAAGGCGCTCTAGACCATCAAGTCGCTGATCGCCGGAAAGGCGGTCCATGCGGGTTCGAATTAGCGCCGCCACAATTTTCAATGTTGCTGGATCTGGCTTGGTCATAGTTGCGTTGTATCACGATTCAAATTGCGATGCGAACAAGAGCTTTAGGTGTTCAGCGAAACAACACCAGAGGGCAGACGAATAAGATGTGTCATGGTAATCTTTTTTTCTGACAAAAATAAACACTGATTGAACAATCCTGGCTAAAGATGATCGCGTTATCCTGAGCAGTCGATGCTGGGGGGGATCGGAGGTTCGAATCCTCTCGTCCCGGCCAGTTTTCAGCCATTTTTTCAAAATGCTGAATTCTTATATCACCGCTGTAAGTGCAATGTAAGTGCGGATACTCTCTTTTTGTTCTCATCGCACCCTATGGATTTCACGCTCATTCTAGAACATGGTCAAAAAGATCAACCAAATAGGTTCAATTGAAAACAATTTCAATTGAGCGATCTTGGCATGACAAATCAGGCTTTCCTATACTATTTTCAGAAAGTGGCAGTCGCACGTCAGGCTTGAACGACCTGCTGGTCGATCACACCACATGGGGATGAGCCGTCAATTCCGCGCCCTTCGATGCGAATAGGGTCACCAAAAGACAGGAACGGCGTTGTGGCTAGGCCCTTGTCGCGGATCACCTCGATCGCACGAACCTCTGAAATACAGCTTGATCCGACTTCTGTATAGTTTGAATCGGAAACGGTACCCGACTCAATAATCGTTGCCGCTACCAGATCGCGCGTCAATGCAGCATGAGCAACCAGTCCCTGGAAACCAAAATCCATGGCTGCACCAAAACGTTCACCGTACAAGTCAATTTATAAGTCGTGTCAAACACGCCATCTTTCTAGACTTCACCCAGTTCATCGGGTGTCACCGCAAAGGGCACCATCGAACAGGCCGGTTCTGCCTGCACCCAACCAAAGCCGGTTGCCATTTCTATCGGAGCAATCGCGCGCAAAGAGCAGTCGTTGATTTTGACGATTAGTCTGATGTGTTTCGTTGCGTCCGTAGCGCATGTTCCCATTGGAACATGATCTACAATAACACCGAACTCGCCTTCAAAATCAATACCATGTTCGATAGTGGGCAACGACACATCTTCGGTCGGGCCGAAAAACCGATCCGAACACCCTGATACATGAGCGGTTTGTCGGTATTGATTGGATCCAGACCAAACGCGACTTGCATAAGGTCTCCATGAGTTGAAAAGGCAGAGCCGTCGAGCCATTGCCAGCTGCGCGGCATCAGCGCAGCACAGTCGGTGGTCGCAAACTCTTCACCTTCCCCCGAAAAAAGGCGATCAAGTGCTTTGGCAACGCTCGGCTGAACCACTTGCCAGTCACCGAAAACAGCGAGCAAATTGGTTGCCGGACCATCGACCCGAAAAACAGCGCCTCCTGCATCACCGTCCGCTCGCCCATCATCGCAGCATCTCCCGCCTCTACAGGAATGCTGAATCGCATCTCTACAATGCTGACAAGACCGAGTTTTTCAACACAACCCGCCCTCTTGTCGGTCGGTCGACCACCTCTTCGGTATTCTTACAAGCAAACACAATCCGCGCTGGGAGATTCTCGGTAGTCTCGGGCGCTATGCCGTCCAACGGCTACATGGCGCAGTGTAGATCATATCCGCTGTATATAAGGATGAGGATATCTACTCTATCCACGTCGTACATCCCGTTAAGCAACTTCTAGCCGCGGATGACTGCCTGATTTATGCTCCGGCAGGAGTCGATCGCTAGCAACGCCAATAACTGGCTTATACCAAAAAAGGGAGCGAGTGGCCCCAGCCACTCGCTCCAGCTGATCCTATGGGAGAGAAGGATCAGAACCAGTTAGATTTTGGGATCTGACGTGTTCAGCACACCCTTAGATTCCCAACCATTTTTTCCATCGCGGTTCGTATGATGGTCAAGTCTGCTTTGGAAATGCCTTTGAACATCTTCTGATAAACGGCGTTGGCTTCTTTCCGAAGTTCCGGAATAATGTCGATTGCCTTTGGACGCAGGAATATCCGCCAGACACGCCGGTCATCTTCTGCGGACCGACGTTCGACAAATTCCAGTTTTTCAAGATGATCAATCGTCAATCCGATCGTAGCTCGTTCCAGTTCTAGGCAAACTGCCAGCTCGGTTTGCGTCATTCCCTCGGTCTTGATCAGATAGGCCAATGTTCTCCACTGGGTGCGGGACAAGCCGAACTGGGCAATCGATTCATCAAATGACTTGCGCAGGACTCGCGGTACTTCCTCCAGCAAAAACGCAAACTCATCGGTCTCATTCAGGAAGCCGTCTTGCGCGTTTGCGCTCATTGTAGCTGTTTCGGTTGTCATGATAGTGACCATATCGTAAATTCAATATAATATAAAGCCCTTTACTGCATAAATGCCGTCAAGGCGACACGCTCCCGCGATTACTCCATGTAGGTTCACGGGTTGGGGCTGAAGCCAGCTTTGTTTGGGTTAGTATGCTTGTCGAGATAATTGAGGATCACGTCGTCGGTGATATTGCCTGAGGTGGTCGAGAAATAGCCTCGTTGCCAGAAGCGCTGGCCCCAGTAGCGTTTGCGGATATGCTCGAACTCCTGCTGTATCTTGCGCGATGAGCGTCCCTTTGCCCGGCGCACGAAGTCGCTGACTGCAATATGGGGCGGGATCTCGACAAACATATGCACATGATCGCGTGACAGAACGCCATGGATGATCTTCACGCCCAGTTCGCTGCAGACCTGCCGGATGATCTCCCGCACCCGCAGGCGGACCTCCCCATGCAGCACCTTGAAGCGGTATTTTGGAGTCCGGACAATATGATAATGATGATGAAAAGCCGTGTGGCTCCCGCATGAATATGGCATGAGCACAACCCTCGAGAAAATGACTTCGTCTGCGAGCGGGTCATTTTCTGGAGGATAGTGCTGAGCAAACAATAACGGCTGAAGCCTTCTACCAGCTATAAGCTGGTGAGTTTCCTTCAAAGGCGAGACATTAAATAACTTTACTGTTTGGGGCGCCGGAAGCTAGGGGGTAGTCCCATGTATTGCGAAACAGTCAGCATTGAAGGAGCTGGAGGCTTGACCCTCACAGCCGAAGCCAACGGCGAGAGTGGTGCGATGCCCGTCCTGCTCGCTCATGGTGGTGGGCAAACACGTCGTGCATGGAAGAACGTGGTCGGCGACCTTGCGGAGGCGGGTTTCCACGCAATCGCCATCGACATGCGGGGCCACGGCGACAGCGAATGGGCAGCCGATGGCGCTTACGAAACACACGATTTCGCGTCCGATCTGGTCGCCATCGTATCACGGATGGAGCGCAAGCTTGCGCTGGTCGGGGCGTCGCTGGGCGGAATGGCCGGTATACTCGCCGAAGGGGATTTGGCGCCCGGCAGCTTCGCTTCCCTGACACTTGTCGACATCGCACCGCAGATGGAAGTAAGCGGGGTCAACCGCATTGTCGGATTTATGGAAGAGCATATCGAAACCGGTTTCGCCTCGCCCGAAGAGGCGTCGGAAGCGATCGCGCGTTATATGCCCCTTCGACGCAAGCGGTCGAGCGGCGAAGGGCTGCGTAACTATCTACGACACAAAGCCGATGGGCGGTATTACTGGCATTGGGATCCAGCCTTTATCCTTAGCAGCAGGACGGTCAGCAAACGCGATGAAGGGCGCCATCAACGCCAATTTGAAGCATTAAGCCAGGCGACCCAGAACCTCACCTTGCCGGTCCATCTCATTCGCGGCGGTTCGAGCGACCTCGTTTCCGAAGACGCGGTGGAGCATCTACGCGAACTTGCGCCGCATGCGGAATATACCGACATCGCCGGCGCCACTCACATGGTTGTGGGCGATGCGAACGATTCGTTTTCTCACGCCATTCTCGATTTCCTGAAACGACATCACTCCATGCGGGGGCGAGAAACATGAGCGAGGCGCCAGCGCGAATAGATCGGGCACGCATAGAGGAAATTCTCCGTATCGCGCCTTTCCATGCGTGGCTCGATCTGAAGGTTAGGTCGCTGACTCCGCAGCGGCTGGAGCTCGAAATGCCATGGCGCGACGAAATCGTTTCCAATCCTATAATCGGATCGGCCCATGGCGGCGTACTTGCGTCTCTGATCGACCTGACCGGTTTCTACGCCCTCATTGCACAGGGCACGAAAGTAAAGGCTACTGCCGATCTGCGGGTCGATTATCATCGCCCGGCAACCTCAGGCCCGCTCGTCGCAACCGGTTTGATTGTGAAAGTCGGTCAGCAAATTTCAGTGGCCGAAACCAGCGTTACCGGGCCGAATGAAAAATTGCTCGCTAGCGGGCGCGGTGCCTATATATGCGGTGACCTGTAGGGTCGGCATCTAGGATGGCGCCTGCCATCCCAGGCCGAGCGATTTTTGCACCGCAATCCACGCAAGAGTAAGGGCAGCTCTGCCCCGGGCCAAGTCCGTATTGGCCTTCTCCTGCTCGCGCAAAGTCCGGTTGAGATCGATGCGCGAGATCGCCCCCGACTCAAAGCGCTGCCGGTTGAGATCCGCCGATTTATCGGCATGTCGCTTGACCTGGATCAAAGCCGCCAGCGCAGCGCGCTGCTGCGCGAAACGGGCTAAGGCGCGCTCGGCATCCTGCAAAGCTGCAAGCACGGTCTGCCGGTAATTTGCAGCAGCCTCGCGGCGAGCCGCACCTGCCCGGTCGATCGACGCGTCGACCCTTCCGAAATCAAGGAAATTCCATTGCAGGCGCGGAATCGCAATGGCAGAAAACTCGCCTACGTCGAAGATATCTTCCGGCGAGGTTCCCCCCAGCCCGAGGATGCCCATGAACGAGAGTTTCGGAAACTTCGCAGCCTCGGCCACACCGACCCGCGCCGTCGCGGCGGCCAAGGTGCGTTCAGCGGCGCGGATATCGGGCCGCCGCGCGATGAGACTAGCGGGATCACCTACCGCCACTCTTTCGGGCGGCAGGGGGATCTCGCCAATCGGTGCAACTTCGAGAGAAGGCGCTCCCGGAGCGCGCCCGGTCAGAACGGCGAGCGCATCGAGCAACACGGCCTCATCTGCTTCGGCCTCGGCCAGTTGCGACTTGAGCAACTCCAGTTCTGCATTAGCATTTCCGACCGGAAACAGCGGTAGGGTACCTTGCTGATAACGCTGATAGGTCAGAACGAGTATCTGCTCCTGCAACTCGCTTTGCAGGCGGTACCGCACGACCCGCTGCTGTGCCTCGCGCAGGTTCACATAGGCGTTGGCGACGTCTGCAGTTAGCTGGACTCGCGCATCCTCGGCATTCGCTACGGCAGCGGCCAACTGTGCATTGGCTGCCTCGATCCGCCGCCCGGTCCCGCCACCAAAATCGATCTCCCAATTGGCGTTGAGGCCGGCATTATAAAAACTGATGGTATCGTCTTCCTCTTCTGGCGGCATTCCGGGCGTCCCTGGAGGAAGTGGCGGACCGCCCTGAATGTCTAAACCCGGCAGACGGCCCTGCACGACCGTTGCTTGGGTGCCAAGCGATGGCAGACGGCCTGCATTTTCCTGCGCGAGGGATGCGCGGGACTGCGCGATGCGCGCACCCACCGCCTCCAGCGAAGGATTATCGGCGAGCGCCGCCTCGATCAGTCGATCGAGTCCGGGATCGTCGAGTAGAATCCACCACTCGGCGAGTTCCGGCACTGTGCTGCTCACTTTGTCGCCGGCCCGGATGAAACCCCCGGCTCCGCCGGCGGCAGCCAACTCGGGCGGCCCCGCATAGTCTGGGCCGGCGGTGCATCCTGCCAGAGTGGCGGCAATAAGAAGAAAAGGAATTGATTTTCGGATCATGTCGGTATCAGTGCATCGAGAGGGAAACGTTCTTGGGCAGCGGCTTGAGGAACACGACAAGCGGCACCGTTATCACCGTCAACAGACCCATCACGAAGAACACGTCGTTATAGGTCATAACAAAGGCTTCACGTTGGATGGTCCGGTCCAGCACGGCCATTGCTTCATCCATACCGCCGAAGGTCCGCGCCAGCGAATCCAGGTAGCTCTGCAGCGAGACGCTGTTTGCATTCAGAGTTTCTTCCATCCGGCGGCTGTGATGCCAGGTGCGCTGGTCCTGGAACGAGGCGAGGCCCGCTAATGCGAATGACCCCCCCAGATTGCGCGCGGCATTGAAGATGCCCGAAGCATCGCCGGCTTCGTCCTTCGCCACCGAGGCAACAGTCGCCTGGTTGAGGAACATCATCGCCAGGATCATACCCACCCCGCGTATTAGCTGGCTTTCGGTAAACACTGCGCCACCGGATTCGGCTGTAAGGCTGGTGCTCACAAAACAGCTAAGGGCCATCAAGAGCATACCAGCCCCTACTGCGATGCGGATATCAACGCGACGGATCATGAAGGGTAGAACCGGCATCAACAGGAACGCCGGCACACCCATCCAGAAGATTACCAGCCCGGTCTGGAAGGCGTTGTAATCAGATATAATCGCCAGGAACTGTGGGATAACGTAGATCGAGCCATACATCACCATCCCCAACGCGAGCGCCATCACAGCGACGCTCCCGAATTGCCGACTTAAGAGTAGCCGGAGTTTAAGAACGGGTTTGGAGGCGTACAATTGACCGTAGATGAGGGACGCGAACCCGATTACAGTGATCAGGGTAAGCCATCGGATGAGAGAGGAATCGAACCATTCCTCGCGGTGGCCTTCCTCCAGCACGACCGTCAGTCCACCAAGCCCGAGAATCATGCCAAGGATACCGGCCCAGTCTGCCTGGGTCAGATATTCCCAGTTCGGCTTTTCGTGTGGCAAACCTACGAATAGGAGCAAGAGGAGGGTCGCGCAGACCGGGACATTGACAAAGAAGGCGTAGTGCCAACTAAGATTCTCAGTCAACCATCCGCCGATCAGCGGCCCCATGACCGGACCTAAGACGACGGTCATGCCGAACAGCGCCATACCGATGGGTTGCTGGTGGGGAGGCAGCCGCTTCGCCACGATCGTCATCGCAGTCGGAATCAGTACGCCGCCCATAAATCCCTGACCTGTACGACCGATAATCATAGTCGTCAGATCGGTCGCGATCCCGCATAGTATCGAGAAGCCTGTAAACGCAGTCACGGCGATGATGAGCAGCGTACGCAAGCCAAACAGGCGCTCCAGCCAGGCGCTGAGCGGGATGACGACAATCTCGGCGACGAGAAAGGAGGTAGCGATCCAGGTGCCCTCGGTACCGGTCGCCCCGATTTCGCCCTGAATGACGGGAAGCGCGGAATTGACGATTGAGATGTCCAAGGTCGCCAGCATAGCGCCGAGCGCGCCCGCCACCACCGCGACCCAGGCAGTGACGTCGGCATTCTGCGGCTTGGTTGGAGCACCGGTCGGGCCAGCTTGGGCAGGCGCTGCCTCGGTCATTCGGCAGCGCTCGAAATCTCTTCGAGTTCGCCGACGGCACCGCGGGTGTCGATGGTAGCGACGACCGACATGCCGGGCACGAGCAGGCGGCGCACTTCGGGTGCTGCGTCGATGGATATGCGGACCGGGATACGCTGGACGATCTTGGTGAAGTTCCCGGTCGCGTTTTCTGGTGGCAAAATCGAAAATTCGGCCCCGGTTCCGGGCGAGATGCTGTCCACCGTTCCAGCAATCTTGCCATCTGGCAATGCATCCACTTCGAGTCTGACCGGCTGACCCGGTCGCACGAGACCGACCTGCGTTTCCTTGAAATTGGCGGTAACGTAGATCTGGCTCACCGGCACCAGAGTCATGAGCCGCTGGCCGGGCTGGACGAACTGCCCGACCCGGACCGACAGGTCACCAATGCGCCCTGCCCTGCTCGCTTCCAGCCGGGTCGAAGAGACATTGAGGTCGGCAACATCGAGCTGCGCGCGAGCAGCATCGGCCTGTGCCCGGGTCTGCTCGATCTGCTCAAACAACGTGGCGCGGCGCCCTGTGGCGGCCGCGACCGCAGCCTGCGCGGAGGCAAGTTCGGCCCGAGCCTGCTGCGCTTGTGTCACGTACTGGTCGAGTTTTTCGCGCGGTTCCGCCCCGCTGGCGGCGAGCGGGCGATAGCGGTTCACCTGTTTGTTGGCAAGGTCTAGCGTCGCGCGCGCGACGGCCAATTGCGCGCGCGCCTGGCGGATCGCGGCGTTCTGTTCGCTCACCTGCGAGCGGATCGTGTCGGCACCGGCCACCGTGGCCGCGATCTGGGCGCGTGCCTGAGCGGCTTGGGCGCGGTAATCACGCAGGTCGAGCTGGACCAGTGCTTCGCCGCGGGCAACCTGTTGGTTTTCCGAAACCAGCACCGCATCGACGTATCCGGCCACCTTGGAAGAGACCACCACACTGTCCGCTGCAACGTAGGCATTGTCGGTTGACTGCATGTATTGCCCGTAGGTAACATAACGATAATACCACCAGGCACCTGCAAGAACGGCCACGATCACCGCCGCGAGGATGAACAGGCGTAGTTTCCTGTTGGAACGCGATCCCGCAGCCGGTTCATTGTCGCCCTCTACAGCTTCGGAAATATCCTTCATTTGTCCGCTCACTCTATCGATCCCGATTCATTTAACGGGTGAGTTAGAGATAGCCCTACAGATCGTAAAGCACTTTATTATATTTGTTTGGTGTTTCATACCAAGTTGGCTAAAGCAATCCCATGGCCATCTATGAACTCCAAGTGCGCGAATCGAAAGACCGTCGGGAGATAATCACTGACGATGATCGGATATCCAAGAACCTTGAAAAACTCCCGGACGCAACGGACTGAGCCAGATGCCCGGAGATTCGGAAAAACTCATAGGCGACCTTTCGCGATTGGCGGAACTCGGTCGAATTTTTGCCCGTCACGGTGCAATGAATCTGGGTAATGCCCTTGGAATTGCACCTTTTCCGGAAGAAGATGTCAGTCCAGAGCACTTGCGACCGAAATCAATGGTTGCCTTGCTGCGCGATATTGGCCCGATCGGTGTGAAACTCGGCCAACTTCTCGCAACGCGCGGCGATGTGTTCTCTGACGAGTGGATAGACGCCTTGGCCAGTCTTCAGGACCGGGTCGAGCCCCTGCCGTTTGACGCGATCGAGCCTGTTCTCGTCGCCAGTTGGGGCGAGAAGTGGGAGGAAGATTTCACGGCATTCGAGCGCGATCCGATGGCCTCCGCCTCTATCGCCCAAACCTATACGGCAACCTTGCTGAATAGCAGTGACGTGATCGTGAAGGTTCGACGGTCGGGCACTGCGGCCAGACTGGAAGCAGACATGCGCATACTGGTGCGGCTGGCGAAGCTATCCGAGCGGCGGTCGAGCGACATCGCCCGCTATCGTCCAGTCGAGTTCCTCCAGACCTTCGGCAAGAACCTGGCCCGCTAGATGGACCTCGCTGCAGAAGCGCGGGCCTGCGAGCGGATCGGCGGCTACCTTGAAACGCTGGGAGCGAAGTCCCCGGCGATACACTGGCATCTCACCGGCCTGCGGGTGAATGTGCTGGAACGCCTGTGTTGCATCGACCGGTTGAGATCGCACCGCATCGCAGATCGACCGTGTGGCGGAGAAGGCGCGAAACATCGTATTTGCCAAACGCTATGCCGATGCTGTGCTGCGGATGATGATCCTCAATGGCAAATTCCATGGCGACCCCCATCCCGGTAATATTTTCCGGATGGATGCTGACCAGGTTGGCTTCATCGATTTCGGCGCGGTCGGTTTTCTCAGCGGAGCGAGTCGGCAGGAAATCGTCCGTCTGATCCTCGCAATCGCCGACGAACAGGCGGAGGACGTTGCCGATCTGCTGCTCGACTGGGCCGGCAATCCGGCGGTGGATCGCGACCAGCTCGCCCTCGATCTGGACCATATCATCGAAGAATTTCGCGGCACGGTGCTGTCGGGGATCGAATTATCGGCGCTTTTCGACCGGGTTTTCGGCCTTCTGCGCGAATACAGCCTCGTCCTGCCGCCCGATCTTGCCATCCTGCTGCGCACACTGCTGACGACCGAAGGCCCTGTCCTCGCGTTCGCGCCAGACTACAACATTGCCGAAGAGACCAGACCCATCCTTATTGAACTGCTGTCGGAACGGTTCTCGTTCGGTGCGACCCGCAGCCAGCTCACCAAGCTGCGCCGCCGACTGTTCGGTCTTTCGGCTTCCCTGCCAGACCTAATCGGCAATGCCACCATTATCGCCCGATCTGGCTCGATTCCCGTGACCATCGACCCGACCAGTCGTGGGTGCTGGCAGGGATCACCCTAGCGCTCGCACTGATCGCATTGGTCCGGAAATGGAATTGAACCAGCCGACTGCCCATGTCTTAAAGCCAACACAAATAAGCGAAGATCAAACCGGAGAGAGAAATATGACCAGCCCGCAACCCGTCGATCCCGTCCCCGCAGCCACCATGTTGTTGCTGCGCGACGAGCCGGAATTCCAAGTGCTGATGGTAAAGCGGCACCACCAGATCGACTTTGCCTCCGGCGCGCTGGTCTTTCCGGGTGGGAAACCAGCTACGGGTGATGATGCTGCGCAATGGGCCGACCATTGCGCCGGCTGGGACACACTTGAACCGACACAGCGGACCCTGCGGATCGCCGCAGTTCGCGAAGCCTATGAGGAAGCGGGCATCCTGCTGGCCGATAAAAGCGATGGCAGCGCTTTCGAGGGTTCCTGCGACCCGGCAAGCCGCAAGGCGGTCGAACAAGGCGAACTCGCGTTCCTCGACGTGGTTCGTGAGGCCGATGTGCAATTGCGGCTGGACAGGCTTAGCGATTTCGCTCGGTGAATCACACCCACCTTCATGAACAAGCGCTTCGACACTTGGTTCTACGTCGCCCGCGCTCCCGAACGCCAGATCGCGGCGTGCGATGGCTACGAAACCGTGGATGCCGAATGGTTATCACCTGGCGGCGCACTGGCTATGGGCGAAGCCGGGGAGCGGACGATTATCTTTCCCACCAGGCTGAACCTGGAACTGCTGGCCCAGGCCAGCAGTGCGCAGGACTGCGTCGAACGTGCCGAAGCGCGCTTGATCGCGCCAGTCCTCCCGAAAATCGTTCAGCGTGACGGGAAGAACATTCTCACCATTCCCGAGAGTGCGGGTTATGGGGCAGCCAGTGAAGTAGTAGGTTGACGAGATTGCATTCCCGATAGCTCCTGAACGAATGGGTGTTCGAATGAAGGGCAAACAGTTCCGCCTATTCGCAGGCCGTCCAGCCCGGGCTGTTCTGATTGGACTCGGGGGCGGACAGAATTGACGGGGGAAGAATCAACCTGCTGAGAGGAAGGCAGAGGTTGGCAGCAAGTCGGTGCCGAACTCGCTTACGAAAGAAGTAGGCCCGAATGCATTTAAACCTCCACTGTGCTGTTCGACGGACAGTCTCAGCACCTGCGAAAAATGATCGGGCGGATGAGACGACCGCTCGGATCGTGGGCGCGAGAGTTGCGTGCCCACATGCCGCTTAGCTAGCCTTCCTCGCGCAGCTAGCGGCAGGCGAAAGCTCAGCGCGGTATTGAAGCGCTAAGAAGCGAGCATCTGGTTGAACCGCACGTGCTGCTTCAGGTAGTCAAGCGTGCTGTCGAACAATGCCTGCGCCAAATCGAGCATCTCGGCAGCAGCCTGCCACTGCCTCCATATGCCTTGGGTCCAGTGGGCGTGCCCAGCCCCGAGCACCAATATGACTCACGCCATTTTGTATGAAGAGGCGTTTCGTTTGTCGGAATTTTCAGGCCTGCTAGCAAGGTTGACCTTGCTCTTGAGTTCTTGCGGGAATTTTTCGCTAAGGAATTCGTGTACTTCTGTGCGAAACGCGTCGTCGTCCGCGTGGGTGTTCTCTTTCAGCTGGGTCGCCATCATGGTCTCCATCTGAGGATCAGGGTTGCTTCTCCATCGATTGGGCGAGGATTGCGAGATCTATCTCGATGGATCAATATTGCTGTTCGGGCACGTGAAGCACTAGGCCGTCCAGCATCGGCTTCATCGGAATCTGGCACGCGAGGCGGCTGTATTCTGTAGTGCCTTCGGCGAACTGCAGCAGTTCGGCTTCCATCCCGTCTTCCTGCAGCCGACCGACCTTGTCGATCCATTCGGGGTCGACATGAACGTGACAGGTCGCGCAGGCGCAAACGCCTCCGCAATCGCCGTCGATACCCGGCACGTCGTTGAACAATGCCGCTTCGCGCGCGGTTTCGCCTTTGTCGATCTCGACTTCCCGGCGCTCTCCGTCGCTCGCGACGAACGTTACCTTTACCATTTCTATCTCCGCAATATTTGGTTCGGACAATCAGCGGGCATGCAAACGCACCGGCAGATTGGTGATTCCCCGCACCAGATTTGAAAACAGGCGCTCAGGCTCGCCGGTCACTTCGACCCTTGAGAAACGCTTGTGAATCTCTTCCCAGATGATACGCAGTTGCAGTTCGGCGAGGCGGCTGCCCATGCAGCGATGAATTCCATAGCCAAAAGAGAGGTGATGGCGCGGATTTTTGCGGTCGATAATGAATTCGTCCGCTCGGTCGATGACGCTGCCGTCGCGGTTGCCGGAGAGATACCACATCACGACCTTGTCGCCCTTTTTAATCTGTTTGCCGCCGATCTCCCAATCCTGGAGCGCAGTGCGGCGCATATGGGTCAGCGGCGTCTGCCACCGGATGATTTCCGGCACCATGCTGCCAATGAGCGAAGGGTCGTCCGACAGCCTGCGATATTGATCGGGGTTCTCATTAAGAGCCAGCACGCCTCCGCTGATCGAATTGCGGGTCGTGTCGTTGCCGCCCACGATGAGCAGCAGAAGGTTACCCAGAAATTCCAGGAAGGGCATGTCCCTTGTCGCCGGGGAGTGCGCCATAATGGAGATCAGGTCGTTCTTTGGCTCCTCATTGATGCGCTGTTCCCACAGCCCCTTGAAATACATTGCGCACTCGATGAGCTCTTCGCGCCTCGCCTCGTAGGATTCGACGATGCCAGTTTCCGGCGCAGCCGTTGTGATGTCCGACCAGCGGGTGAGCTTGCGCCGTTCTTCCCAGGGAAAGTCGAACAACGTCGCGAGCGTCATAGTGGTCAATTCGATCGAAACCTTGTCGACCCAGTCAATCTCCTCACCGACCGGCAGGTCGTCGAGAATCTTGGCGGCACGTTGGCGAATGATCGGTTCCAGCTGCTGCAAGTTGGACGGCGCGACCGAGGGCGTCACAGCCTTTCGCTGCTGATCGTGCTTGGGTGGATCCATGGCAATGAACATTTCCAGGTCAAGCGCACCCTCAACCGAATGCATGTCCTGAATCGCGATACCGCCAAGCTTCGCCTCGGACGAGAACACCTTGTGGTTGGTATCGACCGCCATGATGTCGTCGAACTTGGTGATCGACCAATAAGGGCCGACATAGCTGTCGCGGCAGTAATGAACCGGATCTTCCTTGCGCAGCCGATCAAAGAACAATCCGATGGTATCGTTCTGGAACAGGCTTGGCCGGGCAACGTCGATCTCTTCAATCGGAATCGCGGCGATCTTGTCTGCAGTATCCATTTCCATGATCTGGGTATTCATATCCTGTCTTCCTCTCCCGCAAGCAATCCTGCCCTAATCTTACAAAAATAATGGTCTAGCCGATTCTTGTCAAGAAAGCTTATAATATTTTCTACTTTTTACCCCACCCTATCGTTTCAAGGCCATTGACAGCGGGCATTGAACGCTTTTAAATGTTATACGACTTTAGCCAAGGGGCAACGATGCAGCCGCAGGTCGAAACAGTGAAGAAAACCAAGCGCGCCTTATCGCTCGCCCAGAACATCGTGCGCGACATCGAGGCGGGGGCGCATTCTCCTGGGGACCGGCTGCCGCACGAGGACGAGATGCTGGCGCGCTACGAGGTTGCCCGTGCGACCTTGCGCGAAGCCTTGCGCTTCCTGGAACTCCAGGGCGTGATCCACCTGCAACTGGGGCGCGGAGGCGGGCCGGTGGTGGCACGCCCGCAGACTGGTGATTTCGCCAATAGCCTGTCGCTGATCCTGCATTTCATGGAAGCGGACCTGCGCGGATTGCTGGAACTGCGCGAGGCGATTGCGCCCGATGTCGCTGCCTATGCCGCGCTGCGCGCGACCACGGGCGACCTGAGTGCACTTGCCGATTGCTTCAAAGAACTGGAGCGCAATGAAGCCGACAATAATTTCGAGGAACTGAACCGCCGTTTTCACGACCTGCTTGGCTGGGCAAGCGGCAATCCGCTCTTTGGCTTGCTAACCTCGACGATGCATTTGCTGACCCGCGAATTCTCGAACTCGCTGGGGTATTCGGCACAAGAGCGAGCGGTCCAACTACGATTTTTGCGCAGCGTTCTCGAATCGGTGCGCACTGGCGACCAGGCAGCTGCGCGGCAGGCGATGAGCCGCCTCGTCTCGGGCTCGGCATCCTATCTTGCGGAACGCAGCCCGGAACTGGTGTCCCAACGGGTCAAATGGGGGCAAATTTAGCAGTTGTAACCAATTGAAGCGTGCCAAAAGGCATCCATGCGGAGAGGAAGAGAAAATGGCACTGAGAAGCCGAATTTGTGAAATGCTGGGAATCGAATATCCGATCCTGCTGGCCGGCATGGGCGGTGCAAGCGTTCCGGCACTGGCCGCGGCAGTATCGAATGCGGGCGGGTTGGGCGTGCTGGGTGCGGCGGCGTGTTCGCCTGAACAGCTGCGCGACTGGATTCGTCAGACCCGCGAGCTGACCGACAAGCCCTTCGGCGTCGATACGCTGCTGCCCGCCTCGGTCACGCGCGGATCCGCACCGCAGAGCGGTGGCGCACCCGAAAATCCGATGGAACTGCTGGGCGAATACCAGCAGTTCACCCGCGATTTCATGGACCGCGAAGGCCTTCCTCAAGTTGATACTGAGGCGGCCATGCGGGCGGCTGGCGCCCCCGAAATGGGGAAAGGCGGGCCTCAGCTCTTCTCGAAGGAGTTCTTTGAAGCGCAGATGGAAGTGGTGATCGAGGAAAAAGTGTCCGTCTATGCTGCGGGTCTTGGCAATCCCGAACCGTGGATGGACCGCCTGCATGCCAATGGCACCAAGGTCATGGCGGTGATCGGCAAGGTTAAACATGCCGAGCAAGTGGTCGGTTCGGGCATCGACATGATTGTGGCGCAGGGTCACGATGGTGGCGGTCACAATTCGCCGATCGGCACCATCTCGCTCATCCCGCAAGTGGTCGATGCGGTAGGCGATCGCGTTCCCGTCATCGGTGCCGGCGGCATTTCGGATGGCCGCGGCGTTGCTGCCGCGATGATGCTGGGCGCCGAGGGCGCATGGATCGGAACGGCGTTCCTCGCGACCGAGGAAGCGGGTATCGAACACTTCCAGAAAGAAGCGATCACCGAAAGCGGCGAGGACGACACGGTGGTCAGCCGCTCTCTTACCGGCAAGCCTGCGCGCATGATTCGCAACAGATGGGCCGATGCATGGGTTGAGGCGGGCAAAGAGCCCCTCCCCATGCCGTACCAATCGATGATTTCCGGGCCGGTAATGGCCTCCGGCATCAAGGCGGAGCGAAAAGACGTTATTCCCGGTTTCGCAGGTCAGGGGATCGGACTGATCGACGCGATCCGTCCCGCCTCCGAGGTCATGCAGGATCTCGTCACGGGGGCCGAAGAGGCGCTTTCATCCGCCAAGTCCTACTCGTGAACCGCGACGGAGGTATTGTCATGAACGAAGTACCAAACTTGCCGCAAGGGCACGAATTGCGCCGCGGCATTCGCCCTGGAGAAGCCTTTTCGGCAATGCGCCGGCTGATAGCGGACAAGGAAGACACTTCGCAGGTATTTCGCATCGTCCAGGCGCTATCGGGCAATTCCTACTACCGCAACTTCCGCCGCTTCGCCGCCTCGCCGCAAGGGCAGACGATCCTGACGGAACGGGCCGATTTGCTCGGCACGCTCAGCGACCGGGAGCGTCTGGCGCGCTGCGCGCCGGGGACGCTTGGCCGCGCTTATCTCGATTTCGTCTATGGCGAGGGACTGACCGCTCAGGGCTTGGCCGAGGCGAGCGAGGCAAGCGGCATGGCAGAATTCAGCGATCCTGCGGTCACGCTCTATCGCCAACGTCTGCGCGATTCGCACGATCTGTTCCATGTGGTGACCGGTTATGGCCGCGATGCATTGGGCGAATTGTGCGTACTCTCTTTTGGCAATGCACAATTCTACAATCACGGCATCACCTTCATCGTTGCGGTCGGTATTCCCAAACTGCTGGCGGAGCAATGGCAGCTCCCCGTCGCGCGCGCCGCGTTTGAGGCGTGGCGACTGGGTCGCAAGGCAGCGGACCTGACCACTTTCTATTGGGAAAGGTACCTCGATCACCCGCTTGAGGAAGTGCGCCGCGATCTGAACCTCCAGCCCCCGGCAGTATATGTAGGCGTGCGTGATCTTTCACAGCGACTGGAGCGTGAGTTTCAGGCGCGCCGGCAAGGCGAGCTGCAGGCATGAACGGTCCCGCCGCCCTAAGCTTCAGCGACAGGTCCGCTATCACTGGCGTTGGCGAAACCGCTTTCGTCAAAGGCACCGAACGCACCGCAGTGGACATGATGCTGGAAGCATCGCGCCGCGCCATCGCGGATGCCGGGCTGAAACCTTCCGATATCGATGGCATGGTGCCGCCGCCGATCTATACGACGTCAGAGGAAATGGCCGCCAATCTGGGTATCGATGTGCTGCGCTATGCCGCCACAGTGCATATGGGCGGGGCCAGCCCGACTACGGCGCTGCAAAACGCGGCAATGGCGATCGCCAGCGGCCTGTGCGACCATGTGCTCATAACGCTGGGCTGGAACGGATATTCGGCGCTCAGGCCCAAGCCAGGTGCGCCGCCGACGCGGCCGATGAACATGAACACGCTGACCAACACGGTCAAAGGCTATTACAGTCCCTATGGCGTGTTTTTGCCGGTGCAAATGTATGCCTGGCTCGCGACCCGCCATTCAAAGCTTTACGGCGTCGGGCCTGACGCAATGGCGGCCGTCGCGCTGGCCTGCCGCCGCCATGCGCAGATGAATCCGCAGGCCTTTACCTATGGGCGCGAGCTGGACGCGGAAACCTATCACTCGGCGCGGTTGATTTCCGAACCTTTCCGCCTTTACGATTGCTGCCTGGAAACCGACGGAGCCTGCGCGGTCGTCGTCTCGCGGATGGACCGGGCAAAAGACATGCCGCATGTCCCCGTCAGCATTGCGGGCGCAGCCGAGGGACATCCCTATCCGGCCGACGACATTCCTTCCCGGCCCGATCCGTTCAAAATCGGCCTCAGCTATGCCGCCCCGCGCGCCTTCGACATGGCGGGCGTTCGCCGCGAGGACATGGATTTCCTGCAGATCTACGACTGCTTTACTTATGTCGTCCTGCTACAGCTCGAAGCGCTGGGTTATTGCGAACCGGGCGGACAGGGTGAATTTGTCGCCGACGGGCAGATCGAGCTGGGCGGGCGCTATCCGGTCAACACCCATGGCGGCCTGCTGAGCGAAGCGCATGTCTGGGGCCTCAACCACGTGGTCGAGGCGGTGCGTCAACTGCGCCACGATTGCGGCGAACGCCAGGTCGAAGGCGCGCAGACCGGGCTGGTCACCGGCTGGGGCGACTTGGGCGATGGCAGTATCGCTATCCTGAGGAGGTTTGCATGAGCGACGAGAAGAATCTTCCGCCCAAAATGCGTGCCTCGGCGCAGAAGGCGCCGCCAAAGCCGCAACCGCGCCCGCAGGATCCGGTGGAACAGGAATTCTGGAACCGTTGCCAGGACGGCAATCTCTATTTCCAGCGCTGCGGCGAATGTGGCACCTTCCGCCACCTGCCGCGCTACATGTGCGCAAAATGCGGCTCGCCCGACTTTTCTTGGGAGCAAAGCAGCGGAAAAGGTACGCTGTTTTCCTGGACGGTCACACATCAGGCTCTGCACCCTGCCTTTGCCGCTGACATTCCGTTCGTCTCGGCGGTGGTGGAACTGGAAGAAGGCGTCCGCATGGCCACGCGTCTGATCGATTGCGACCGCGACAAGCTAGAGCTGGACCTTCCGGTAACGCTCGATTTCGAACTGATCGGCGCGGATTTCCGCCTGCCGGTTTTCCGCCCCGATACGAACCTACCCCACGAAGGATAAGCACGAAATGGATCTCGAATATGGACCCGAATATGACGCTTTCTGCGAGGAAGTGCGACAATTCTTAAAGACGTACGGGAATCGTGCGCCAGCCGAGCAGGGCCGGGCCGCCCGCCCCTGTCCCGAGGCGGTCGAGTGGCAAAAGCTGCTGATCGAACATGGCTATACCGCGCGCACGATCCCCACGGAATACGGGGGTTACGGGGCCGAGCCCGACATTCTCAAATCGCGAATCATCGCCGAGGAATTCGCGCGCGCCGGCATTCCCGGCGGACTTGCCAACCAAGGCATATCGATGCTGGTCCCTACACTGCTGGAACTGGGAACTGAGGCACAGAAACGCCAGTGGGTCGAGCCGACGCTGAAAGGCGAGGTCGTATGGTGCCAGGGCTATTCCGAGCCCGGCGCGGGATCGGACCTTGCCAGCCTGAAAACCAGCGCGCGGATCGAAAACGGCGAATTCGTGATCAACGGCCAGAAGATCTGGACCAGCACGGCCAAGCAGGCGGACATGATCTTCTGCCTGGTCAGGACCGAACCTGACGCGCCCAAGCACGGCGGCATTTCCTATCTGATATTCTCGATGGATACGCCGGGGATTGAGGTGCGTCCGCTCAAGACCATGACCGGTCACGCCGAATTTAACGAGACGTTCTTCACCGACGTGCGGGTGCCGATGGACCAGATCGTCGGCCAGCGTGGTCAAGGCTGGTTCGTCGCCAATGCGACACTGGGTCACGAACGCGGCATGCTGGGCGATCCCGATGCGCTGGAAAACCGATTTCAGGCACTGGTCAAGCTAATGCAGAAAGAACGTGTCGGCCAAGGCCGCGCGATCGACAATGCCGTGCTGCGCGACCGGTTGGCGGCGCTTCAGGCCGAAGTTGCAGCGATGAAATACAACGGAATGCGTATCCTTTCGGACAATCTGAAGGGCGAACCCGGCGGCATGGCCAAGCTGATTGTCAAGCTGCAAAGCTGCGAACTGGCGCATCAGATATCGGCTCTGGCGATCGACGCCATGGGCGAGATCGGCATCCTTTATCACGGTAGCAATCGCGAGCGCGAAGGAGGCGCGTGGCAGTGGAACTACATGTTCCAGCTGGGCCTTATCATCGGTGGAGGTACCGCGCAGATCCAGAAGAACATCATCGCCGAACGCGGTCTCGATATGCCGCGCGAACCTAAGCTGGCCAAACCTGCGCCGGCCGAACAGGGCGCCGCCGTCGCGGCCCGCCAGAAACAGGGAGCCGCCTGATGGATTTCGGACTGTCGCAAGACCAGCAGATGCTGCGCGATACAGTCGCGCGATGCCTCGCCGACACCTGCCCGCTCGACCATGTGCGTGAGTGCGCCGAAGGCGAAGCTTCTTACAGCGAAAAGGTGCAGTCTGCACTGAGCGAACTGGGCATCACAGGAATGATGGTTCCCGAACAGCATGGCGGCCTCGGTATGACCTTGCTAGATGCAGCTATTGTGGCCGAACAGCTTGCCTATGCGGTCGCTCCGGTGCCGTTTCTCGCCAGCCATGTGCTTGCACCCATCGCGCTGGCGCAAGCCGGCAACGAGCAGCAGCAGGCTGAGTGGTTGCCTCAGATCGCCAGCGGCAAAGCGCAGATCGCCGTTGCCATTGCCGAGACTATTGAGGCGCGCGACGGGGCGGGCGTAACCTGCAACAGCGGCAAGCTGGACGGCACGGCGCTGTTCGCACTCGATTTCACCGGCGCCGAGGCCTTCCTGGTGGCCGATACCGCGGGGCGGATGCACTTCGTCCCCGCCGATGCACCGGGGCTGGAGAAAATCCCGCTGACGACGATCGACCGCACCCGCAGCGTAGGGGAAATGCGGTTCGAGACCGTTGCGGCCGAACCACTGGCCAACGATGGCGGAGCGACTGCGGCACGGCTGCGCGATGCGGGCCGGGTGATCCTGGCCGCTGACAGCCTCGGCGCGGGTCAGGCCATGATTGAGAAGGCGGTCGATTATGCCGGACAGCGCGAACAATTCGGGCGGATCATTGGCAGCTTTCAGGCGGTCAAGCATATGTGCGCGGAAATGGCAGCCCGTCACGAACCATGCCGCTCCCTGATTTGGTACGCAGCGCATGCTTTCGACGCGGCGTCGGATGAAGCCACGCTCGCCGCGTGCCATGCGAAATCGCATACCGGCGAGGTGTATCGCTTCGTCGCGCGCACTTCCACCGAAGTGCATGGCGGGATGGGCTTCACCGACCTGCTTGGGCTCCATTATTGGTTCAAGCGGATCGGTTTCGATCGCCAGGTTCTTGGCGGCCCCGAAGCGGTGAGAGCCGAAGCAGCAGCGCGGCAGGGGTGGACCCGGGCAGCGTGATTTAAGTAGCTGCGGGGACGCCGCGAGACAAGCGGCGCCCCGCCAAGAGATGAGATGGAAACGTGATCGACTGGAACCTTGGCGACATTCTTGATGCTATCGAACCGGCCATGCCGAAGGATGCGCCCGCGCTGATCCATGGCGACCGGATCATCACATGGCCGGAAATGTCAGTCCGTTCGAACAATCTCGCGCGCAACCTGCGGGAACGCGGTGCCGTCGACGGAGCGAAGGTGGCGTTCTACATGCGAAAACGTCCCGAATACGGCGAGCTGATGGCGGCCTGCTTCAAGGGTCGGCTCACGCATGTGAACATCAATTATCGCTACGTGCCCGAAGAAGTATTCTACATCTTCGACGATTCCGACAGCGAGGTGATCGTCTACAGCTCCGAATTTCGCGACTATATCCTTGAGCTGAAGGACCGGCTTGAGAAGGTCCACACTTTCGTCGAGATCGGTGATGCGTCGGAAATCGCCCCTTTCGCCGTTCCCTATGAACATCTCACGACGCAGGGAGACGGTTCGCCTCTAGGCATCGAGCGCTCACCAGGCGATCTGCTGTTCATCTATACCGGCGGCACCACCGGCATGCCCAAGGGCGTGATGTGGCGGCACGACGACATGCGCAAGGCGCAGCTCGACGCGCAAAAGCTGCTCGGCCCGGTGCCGCAGTCGCACGAAGAAAACGTCGCGCTGATCAAGAGCCAGGGGCCGGGACGCCGCACCCTCCCCTCCTGCCCGCTGATGCACGGCACAGGCTTTATCACCGGGATCGGCGCGCTGATGTCGGGCGGTGCAATCGTTACACTCTCCGACCCGTCATTCGATGCCGAAGAGTTGTGGGAGACGGTCGAGAAACACAAGGTCGAGAGTATCGCAATCGTGGGTGACGCTTTCGCCAAGCCGATGCTTCGCGCCCTCGACGAACATCCCGGCCGCTGGGATACCAGCAGCCTCGTGTCGATCATCTCGTCCGGCGTGATGTGGAGCAAGGAGGTCAAGGCCGGTCTTTGCAAGCACATTTCTCAAGTCATCCTGATGGACAGTTTCGGCGCTTCCGAAGGCCTCGGCTACGGCCTTTCGGTCACCACCGCGCAAGGCGGCACCAACACCGCCAAATTCGGCATCGGCGAGTTCTGCGACGTGTTCGACGAGAACGACCAGACGGTCGAACCGGGCAGCGGCGTGCCCGGCTTTATCGCGCGCAAGGGTGCGATCCCGGTCGGCTATTACAAGGACCCCGAAAAGTCGGCCAAGACTTTCAAGACAATCGACGGTGTGCGTTACTCGATACCGGGCGACTGGTGCCACGTGGAGACCGATGGCAGCCTGACCCTGCTCGGCCGGGGCAGCGTGTGCATCAACACCGCTGGTGAAAAGGTCTATCCCGAAGAAGTCGAAGAAGTGCTGAAGACCCACCCCGCCATCGCCGATGCGCTGGTGGTCGGCGTGCCGGACGAAAAATGGGGTCAAGCGGTAACCGCCGTGGTTCATCTCGACAAGCAGGCTGAATTTGATGAGCAAGCTGTAAAGGATCACGTCCGCCAGCAGCTGGCGGGATATAAAACGCCCAAGGCGATCCATCCTACCGACACGCCTCTGCGCGCCTCGAACGGAAAGGCGGATTACGCAGCGGCAAAGAAAATTGCGGAAGGCAGCAGGGCGGCAGCGTGAGCGGCAAGACCACGCCAGACTATGACGCGGTGATCGTCGGTGCCGGTTTCAGCGGCATCTACCTGCTTCACAAACTGCGCGAGGCCGGGTTCAATATCCTGCTGGTCGATGCTGCCGCCCAGCCCGGCGGCATCTGGTACTGGAACCGCTATCCGGGCGCGCGGGTCGATTCGCAGGTTCCGCTTTACGAATTCTCACTCCCAGATATATGGCAGACATGGTCTTGGACCGAGCGCTTTCCCGGATGGGAAGAGCTGCGCACCTATTTTCGTCATGTCTGCGATACCCTCGATCTGTGGCCGCACATGCGTTTGGGCACGAGGGTCGAAAGCGCGCGCTTCGCCGAAAAGGCATCGCTCTGGCGCCTGCATCTGGACGGTGGGGACACGGTGACAGCGCGGTTTCTCCTGCCGGCTCTCGGCTTCGCGTCGAAGCCTTATGTCCCCGACATTCCCGGCCTCGACACCTTCGCTGGCGAATGGTGCCACACCGCGCGCTGGCCTCAGGAAGGCATTGATCTGGCAGGCCGCAAGATTTGCCTGATCGGCACCGGCGCCAGCGGTGTGCAGGTGGCACAGGAGGCAGCAAAATCAGCAGACCGGCTGACACTGTTTCAGAGGACGCCGATCCTGGCCCTTCCCATGCGCCAGGAGACAATGACCGAGGAAGGGCAGGTGTGCGAAAAGCAGGGGTATCCGGCCATTTTCGAACAACGCAAGCAGACCAGCGGGGGTTTTGAATACCAGTCGCTGGAAAAATCGGCACTTGAGGTTAGCGATGAGGAGCGCACCGCACATTTCGAGCATCTCTGGCAGCGCGGCGGTCTAAAATTTTGGTATCACAATTTTGCGGATATGCTCACCAACCGCGAAGCAAACCGCTACGCTTATGATTTCTGGCGCGACAAAGTCCGGGAGCGGATCGTCGATCCGGCGCTTGCGGAAAAACTTGCTCCTGCCGAGCCGCCCCACCCCTTCGGCACCAAGCGGCCATCGCTGGAACAGAATTATTACGAGATTTTTGCGCAGGAAAATGTCGCGCTCGTCGATCTGAAGGAAACGCCGATCGTCCGGATCAGCGCGGACCGTATCGAAACTCAGGATGGGGCGTTTGACTGCGATCTGATCGTCTTTGCAACAGGCTTCGATGCCGGACGCGGCGGACTGATCGACATGAATATAACCGGGCTGGGCGGGCTCGGTTTGGCACAGGCCTGGAACGATGGCTTGCGCGCCTATCTGGGCATGTCCGTGACGGGCTTTCCCAATATGCTGTTTTCTTACGGTCCCCTGAGCCCATCAGGTTTTGCGAACGGCCCAACGAGCGCTGAGATACAAGGCGACTGGATTTGCGACTTTCTGATATGGCTGAGGCAAGGCGGGATCAAGCTCTTCGATGCCGAACCGCAGGCCGAGACGACCTGGACCGAAATAGTCGCCCAAGCTGGCGCGATGACACTCTTTCCCGAAGCGGACTCCTGGTACATGGGCGCAAACATCCCGAGCAAGAAGCGCCAGCTTATCAATTTCCCAAGCGTGGTTGGCTATGCCGCCGTATGCGACGACGTAGCGCAAAATGCCTATCGCGGATTCGCCACCTAAAAACTCACCAACAAGGAACCTTTCGCGCAATGAATGAAATGACCCCTGTAAACAACCTTCCGACTGACATCGATTCCTTGTGGGCCGGGGACGGTTCGCACCTGCCCGAATGGTTCATAGCAGCACTGAACGTGCCGCGCGAAGAAGCTTATGTAGAGATTTCCGGTGCCAAGGTGCACTATCTCCGCTGGGGCAGCCCCGATAAGCCCAAATTGCTAATGACACATGGCTTCCTCGCCCATGCCCGCTGCTTCGCTTTTATTGCGCCGTTCTTCGCGGAAGAATATGATGTCGTGGCATTCGATTTGGCGGGTATGGGGGACAGCGAAATGCGGGGGGCAGCTGACACGGCGGCCCGCGGGAGGGAATTCACCGAGGTTGCCGAGGCCCTGAACATGTTTGCGGATGGCCAAAAGCCAACGATCATCGCGCACAGCTTCGGTTCGGGTGCCGCGCTGACAGCCGTGACCCAGTCGCCCGACGCCTTTGCTGGAGTGGTCGTATGCGACCTTATGATCATGCGCCCTGAATTGCTTGAAAAATACTGGAAGAACGATCGGTCGAGCCCGGGATCAGGGAATCCGGACAAGCCCAACAAGCGCTATCCCAGCTATGAAGCAGCGCGTGCGCGCTACATCCTATCGCCGCCTCAGCCCGCCGAGGAGCCATTCCTGCTAGATTATATTGCGTATCATTCGCTGAAGCATAAAAGAGGCAGTTGGACGTGGAAATTCTCACCAGAGGTTTTCCGGCGGAGCAATAGGCCCGAGGAGTGGCTGAACATGGGTAAGCGGCTGGTGCAAGCGCCGGGCCGCAAAGCAATCGTGCACGGCGGTAACAGTCTGCTCTTCTCACAGGATTCTGCCGATTACATCCGCGAGATGGGTGGAAACGACATCCCGATAATAGCCATACCGGAAGCCCGCCACCACCTGATGCTCGATCAGCCATTGGCCTTTGTCGCAGCTCTCAGGAGCGTTCTTTCTTTTTGGAAAAGCTCCCACGAACGCTAACCGCAATGTTAGAATGACCTGTCATATCTTCGATGAATTGGGTGGTCGAGAATACCAAATATGAATTTTGATACCTTGATTTGAAGCTAGGTGTTTAGTCTAGGCATTTGATGAGTTGGATTAAGTGTAAATCTGTTTGGTTCAGTTGTCCACATTTTGCAGATGTATTCGAAGGGTGTGAGTCTCTTGAGGGTCTTCAGCCGCCGACCATAATTGTAGGCGTCAATGAAGTCGTTGAGATGCGCGGTCAGCTGGGCATGGCTGTCATAATTGTCAACGGCGGAGTAAAATCAGGCCATATGGCGGCGCAAAAGTAGACCAGTTTTTGGTTGAGCGTGATGGGTGCAAAATGGCGGCGGCCAGTCAGCCGCGCTCCCCAAATAGCTGGCGGTTGACTGGCCGCTTTGGCCCGTCAGGGCCAAATCTGGATTGGCTGGATCAGGTGGCGGCTTTTGCCTTCCGGGCACGGCTTTGGCCGAGTCGATAGCTGTCGCCATTCATCTCGAGGATGTTGACGTGGTGGGTCAGCCGATCGAGCAGTGCTCCGGTCAGACGTTCGGTGCCGAAGGTTTCGGTCCATTCGTCGAATGGCAAATTGCTGGTTATCATCGTGGAGCCGCGCTCATAGCGCTGCGAGATCAGTTCGAACAAGAGCTCTGCACCGGTCTTGGACAATGGCACAAAGCCCAACTCGTCGATGATGAGCAGCTTGACCGCGGCCAGCTGCTTTTGCAGGCGGAGCAGACGGCGCTCGTCTCGGGCTTCCATCATCTCGTTGACCAGCGTCGCGGCGGTGGTGAAGCTGACCGAGAGGCCCTTCTGGCAGGCAGCCAGCCCAAGTCCGAGGGCGACATGGGTCTTGCCTGTGCCGCTCGGGCCAAGGGCGATGACATTCTCGCGCCGGTCGATCCATTCGCAGCGGGCCAGTTCCAGCACCTGCATCTTGTTCAGCTTTGGGATCGCCTTGAAGTCGAAGCTGTCGAGGCTCTTGGTGGCCGGGAATTTCGCGGCCTTGATACGGCGTTCGATCATTCTGCGCTCGCGATCGATCAGCTCCAGCTCGACCAGACGTCCCAGGAACTGGACATGATCCAGCCCCTCGGCGGCGCACTGGCGCGCGAGCTTTTCATACTCGCGCAGGAATGTCGGCAGCTTCAGCGTCTTCAGATGATCGGCCAGCAGGATCTTCGGCGCGTCGGTCATTCCGCCGGCTCCGTCATCAGGGCCATGTAGCTGGCGGCCGATGTGGTCTCGACATTGGCGCGCGGCAGGTAGGGATAGACATCGAGGTCGAGCTTCGGCGGCCGTTTCTCGACATGGCAGAGAACAAGGTGCTTGACGGCATCAAAGCCAATTGCCCCCAATTGCAGGGCCTTCTTCACGGCGGCATGCAGATCGTCGATGTCGAAGGTCTCGAGCAGGCGCAAAACCTGCACGTACTCGCGGCGTCCGGCCTTGATCATCCGCGCCTCCATCAGGCGGCGCAGTGTTGCGAACTCGGGCGGCAGGCTCCATTCAGCCAAGGGAGCTGCTTGGTCCAGGGCGTTGATCTTGCGCTCGATCAGCGGAAGGTAATGCAGCGGATCGAAGACCATGTCCTCGCGGTCGTAGCAGCGGGGATGGCGTGTGATGACTTCGCCGCTGCAGCCAATAACCACCTCGTCGACATAGCCCCTGATCCAGACGTCGCGGTGGCCGTAAGCGACCGGTACCGAGTAATCGTTGGTTTTGTAGCGCACCAGCGCCTGCGAACTGACCCTTCCGGTTGCCTGGTCACAGGCATCGAACGGTGCCGCAGGAAGGCCGGTCATCGCCGCCAGATCCCGCGCGAGGCGTTGACCGATCGTCTCGGACTGGCCCCGCAGGACATCCGCCTGACGTTCGCGGCACTGCTCCTCCAAATAGGCGTTGAACGCCTCCCAGCTCGCAAACCGCGGGATCGGTACCATGAAGTTGCGGCGGGAATAGCCGACCAGCCCTTCCGCGTTACCCTTGTCGTTGCCCTTGCCGGGGCGGCCGTAGCGATCGCGGAACAGGTAGTGCGACAGGAACCCGCTGAAGAGCGTGGCCCGCTTACGCGTTCCGTTCGGAAGAATCTTCGCAACCAGGCAGCGGTCGTTGTCGTAGAGGATCGATATCGGCACCTTGCCGAAGAACGCGAAGGCGTGGACGTGGCCATCAACCCAGGCTTCCGCCGTTGCCGCCGGATAGGCCCGCACAAAGCAGGCATCACTGTGCGGCAGGTCCATGACGAAGAAATGCGCCTTCTGCTCGACACCGCCAATGATGACAACCGCCTCGCCGAAATCGGCCTGGGCATGGCCGGGCCGATGGACCAGCGGCACGAACATCTCGCGACCGCGCCGTTCGCGCTCCCGCATGTAGTCCTTCACGATCGTGTAGCCGCCGGTGAACCCGTGTTCATCGCGGAGCCGGTCGAACACCCGCTTCGCTGTGTGCCGCTGCTTGCGATGAACCCCCAGATCGCCGTCCAGCCAGCCGTCGATAATCCCAGTGAAGCCATCCAGCTTCGGTCGCTTGACCGGTGCCGATCGCCGGTAACCCGGCGGAACCGAGAACGCCATCATCTTGGCTACACTGTCGCGCGAGATGTTGAAATGCCGCGCTGCCTCCCGCTGGCTCATGCCCTCAGCACAGGCCAGCCGAACCTTCCGATATAAATACACAGTAAAAATCCTCCCGCCCTCCCTGTCGCCAGAAAGGGTAAAGGTGGACGACTTTTACACCGCCCGCAGCAAGCTCATCCCGCCGCTACCGTGGCCGACTTTTGCACCGCCGTTCTCACGCGTGAATATGGCATAAGCACTATCCTCCAGAAAATGACTTCGCCTGCGAGCCGGTCATTTTCTGGAGGATAGTGCTGAGCAAACAATAACGGCTGAAGTCTTCTACCAGCTATAAGCTGGTGAGTTTCCTTCAAAGGTGAGACATTAAAATTACAAAATATGGCCCTGCCGAGAAATCGGCAGGGCTTTTTTGTAGCAAAATTGCATCAATTTGCCTGTGTATCGTGCTAAAATGGAGCAAAACTAGATGTTTCACGATTTGAGCCTGGACTTAATCGAGCGATTAGTTAAGTAATAACGGAATGGGGGGGGGGGGTGCGTAAAAATATCACTCTCCGATAAAATGTTGGGATTTGGGAGAGAGAAATGCGTAAATCGTTATTATTAGCCGCCACAGCTATATCTACAATGTCCGTACCTGCCATGGCGCAGGACCGTGATGATGGTACCATTGATAGCAATGTCATTATCGTGACCGCGCAGAGGCAGGCACAAAGTGCTCAGGATGTACCGATTGCGGTTTCCGCATTTTCCGGGGCAGCCTTGGAAGCCCAACAGATCGAGAACAGTTCCGATCTCCAGCTAACGCTCCCGAACATCACCTTTACCAAAACCAATTTTGTCAGTTCCAGTTTTACAATTCGCGGGATCGGCGATCTTTGTGTCGGCGCGTCGTGTGATCAGGCAACCGCCATTCACCTGAACGATTCGCCTCTTTTCGCGACGCGACTTTTTGAAACAGAATTTTTCGATCTCGAACGCGTCGAAGTGCTGCGCGGTCCGCAAGGCACGTTGTTCGGACGCAATGCGACCGCGGGTGTGGTCAATGTTGTCACTGCCAAGCCACAAATGGGAGAGTTCAAGGCATCCGGCGATGCCGAATATGGCAATTATAACGCGATCAAAGTCAAGGGCATGGTCAACATTCCCATCGGTGACAATATCGCATTCCGTGGTGCCGGCGTTTATGTAAAACGCGATGGTTATACCACCAATCTTAACGGCGGCCCTGATCTTGATGACCGGGATATGTATTCGGTGCGTGGTTCGCTGCGTTTCGAACCAACTGCGGATACGACGATTGACCTTTATGCTTCATATTTCCGCGAAGATGACAATCGTATGCGGATTCAGAAGCAATATTGTCAACGTGACCCTACAGGTATTTTGGGTTGTCTGAACAGTTCTCGCAATGCAGAATCGTTCAACGCGAACTCGACTATTGCGGCTACATTATCGTCGCGCGAATTCCTTGCAACTCAGGGTATTCCAACGGCTTTTGCTCTCGGTAGTCTTTACGGTCCAGACCAATATGCAGGTGTTTCCGTTCCCGCGGATCCGCGGACGGTCAACACGGCCTTCACGCCGGAATATTTTGCCAGCGAGTTGACGTTCCAGGGCAAGATCGAACATGACTTCGGGCCGATCAGCGCGCAACTTTCCGGTACCTACCAGAAGGTCAAGCTGGATTCTCGGCAGGACTATAACAACAATATTGGCCGTAGGGATATTTATGCAACTGGATTAAACACGCTTGCTGCAGCTGCAGCTGGTGCGATCCCAGGTCTTCCTGCAGCTTATTTTGCACCTCTTGCTTCTGCGATCATTCCGGATGGACCGAACGGCGTGCTTTGTACTTCAGATACTGATACCACCGGTCTCGGGGTATTTGGGGGCAACAGCATTTGCGACGCAACACCGCTTCAATTTGACCGGTCGAATCTGGATAACAGCAGTTGGTCCGTCGAGGCGATCATCTCAAGCGATCTGGATGGTCCGTTCAACTTCCTGGTCGGCGGTATTTATGCCGACTCTCATCTGACCGAGAACAGCTATTATGTGAACGCCTTCGGGCTCGATTATGGGGCAGGTTTGCTCGGTAGCTTCATTTCGCTTGCCGATGGCTTGCCGCCTTCATTCCTCGGCACGCCATATTATCGCAACCATTCGGATGATCTGACGGTCAAATCCTATGGCCTGTTCGGCGAAGTCTATTTCGATATCAGCGACAAGCTGAAACTGACGGGTGGACTGCGTTACAACAACGACAAGAAAAAAGTTAGAGCGCGTACAACGCTGGCAGACTTCCTTGTGCCATATAGTCAAACAACCGATGCGTTTGAATCACCATTCGTCGGATCTTTGGATGCCGATCCAGGTATACCCGGTAATCAACTTTTCCAGAATCGGGAGGTCAAGTTCAATGAAATCACTGGCCGTGCGGTGCTTGATTATAAAATCACCGACGACAATCTGATCTATGCATCCTATTCTCGCGGTTATAAGTCAGGTGGTATAAATCCGCCACTGCAGCCAATTTTCGCAGCTCCGGAATCCTTCCGGCCCGAGCAGGTCGATGCGTTTGAAATCGGATCGAAAAATACATTTGGCGATGGGGCTCTGCAACTCAACCTGACTGCATTCTACTATAAATATAAGGGTTTGCAGCTCAGCAAGATTGTTGCTCGGACCGCTGTAAACGAAAATATCGATGCCGATATTTACGGTGCGGAAGTCGAGGCAGTCATTCGTCCTGATCCGGACTGGATGATCAACATGGGCTTCAGCTATCTGCACACCAAGGTCAAGGGCGATACGTTTAACAGCGATCCGCGCGATTTTGGCGGAGGCCGGTCGGATGCGGTCATCATCCAGGATATTACCAATGCATCCAATTGTGCGGTGGCTTCCACGTCCGGAAATGCAGCTGGTGTAAATGCTTTTGTCAACACGATAAATGGTGCGATCAATGCCGGTCTGGTGCCGGGACTTGCACCTGGAGCAGGCCTGCAACCAACCACGGCATTCCCTGCTGATGGTGGCATTGCTTCAACCGGTGCCTTTGGTATTTGCGCTGTGCTGGACGCAGCGGCCCAAGGAGCTTTTGCTGGTGCAGGCGTTGTTCCGGCAGCCTTTGGCGGGATCGAGTATTTCTCTGCCGGTGTTCCCAAGAACATTCGTGGGAACCAGTTGCCACAGGCACCTCAGCTGAAATTCTCCACTGGCGTGCAATATACCATGAACTTTGACAATGGCATGAGTCTGGTCCCGCGTGTTGACCTTGCTTATACCGGTGAAAGTTTCGGCAGTATCTTCAACGGCAATGTCAACCGGATCAAAGGTTATGCGCAAGCAAACGCCCAGATTCAGTTGAACGGTACGGATGACCGCTGGTATGTTCGTGGATTTGTCCAGAATATTTTCGACAGCAACTCGGAGACCGGACTATATGTCACCGATCAGTCGTCGGGCTTGTACACGAATATCTTCTCACTTGAGCCACGCCGCTATGGTATTGGAGCTGGCTTCAAGTTCTGATCTGAAGCGACCATGATATGAAGAAAGCCCTGCCGGGAAACCGGCGGGGCTTTTTTTGATCATCTAAGATGCCTTTTGGCGAAAGCGCGACCTGATCCGGATTTTAGATATATCTCGAAATTGGCAGCCATTTTGCGATCAGGAAAGCCGTGCATGGAGACTAACTCCCAAGGGCGAGGCGTGCTGTAAGCGTGGCAAGGACCTTCCAATTACCCACAAGTCGTCGCACGTGTGATTCCATTGGCCGCGCGGACCTTGCTCACGTGGAATCGGGTATGATTCATCCTCTGGCACCTGAAATAGCTGACGGGGTGCACAAGGGGGAATTCTGGTTTCGACGCAGTGAGATGGGCCGATGGGGAAGTTGATATCGGCGCATTTCGCGACAAGCGTCTGGGCGAGCGGCTTAGAACGATGCTTGCGCAGATGGCAGGAGCGATTGGCGCACCGATCCCGATGGCCTGTCAGGACTGGGCCAATACCAAGGCAGCTTACCGCTTCCTGTCTAACGGCTCAGTGAACGAAGGCGAAATCCTCGCCGGCCATTTTCAAGCAACGCGAGCGCGTGTTGGAACGCTTGAAGGGCCAATCCTCGTTCTACAGGACACCACTGAATTCTCCTATCAACGCCGCGCGCCGGAAAAAATCGGTGCAATCGGCCTGGCACCCAGCCGACGCGATGAAAATGGCAGGCTGCGACTTCATACGGTCTGTGGCCTGCTCATGCATTCTAGCCTTGCGATCACGACCGAGGGATTGCCGCTGGGTCTGACAGCAGCGAAATTCTGGACCCGCACCAAATTCAAGGGCACCAACGCGCTGAAACGAAGGATCAATCCTACGCGTGTGCCGAACGAGGAGAAGGAAAGCTACCGCTGGCTCGAGAACATGCGTCAGTCGACCGCTATGCTGGGCGAGTCCGAGCGACTGGTCCACATCGGCGACCGGGAGAACGACATTTACGAATTCTTTTGCGAAGCGCAGGCACTCGGCACACATTTTCTGGTCCGCACCTGTGTCGATCGTCTGGCCGGTGACGGTGACCACACCATTGCCGACGAAATGAGCGAGGTTTCCGTGCAGGGCATGCAACGGGTCGTCATCGACAAGGACGATCATGCCGACATTGAGTTACGCTATAGACGGATCCAGGTGTTGCCACCGATCGGCAAGCAAAAGCGCTATCCGTCTCTCAACCTTACCATCCTGCATGCCCGTGAGTGTAGAGCACCTGAAGGGCGGGCCCCCTATCGATTGGAGGCTTATCACTGATCTGCCGGTAACAACGCCTGCCGAAGCGATTGAAAAGCTCGACTGGTACGCCCAACGCTGGAAGATCGAGCTTTTCCACAAGATACTAAAATCCGGCTGTCGGGCTGAGGATGCACGGCTCCGGACTGCCGATCGACTTGTCAATCTGATCGCAATATTCTGCATCCTGTCATGGCGCATCTTCTGGATGACCATGATCAATCGCGCCGCGCCTTCCGCATCACCCAGGCTCGCGCTGACCGATGACGAAATCGTCCTGATCGATCATCTTGTGGCTTCCAGAAAGAAGGTGCCCAGGCTCAGAACCATATCCGATTACCTGTACGAAATCGCGCGGCTCGGCGGCTATCTCGCTCGCGCCAATGACCCGTCTCCAGGTAACATTGTGATCTGGCGAGGATGGACCCGCCTCATGGACATCAAAATCGGTGCCAACGCCATCCGTAAGACTTGTGGGTAATTGGAAGGTCCTCGCCACGCTTACGTGGGTGTGGCCCCTCAATACGTATCAATGTTGGGCAAGTGCACGCGGCCTCACATGGGCCGTCGGCATCCCGTACAAGCAGAAAGTCTATCCCGCCGACGTGGCGATGATTTTTCCCGTCGCCGGGCGTGGTCGCCCCCGCAAGCGGCACATTCCCGACGCTACATCGGTATCGGCAAAGGCGATGTTGGAGGTAGCAAAGTGGCGCAAGGTCAGTTGGCGGCGCGGAACCAAGGGGCTGCTCTCGGCACGCTTCGCCGCCGTTCGCGTGCGAGTTGCCGATGGCCCGCCTCAACGCATCCGGAACAGTGGCGCGCAACATCTGCCCGGCGAGGAAGTGTGGCTGGTCGGCGCACATCGCTCAACCGGCGAGCGCAAATACTACCTCCCCAACCTGTCCTCCGACACGCCGATCAAGCCGCTTGCCGGAGCCATCAAGGCCCGTTGGGTCTGCGAACAGGGACATCAGCAACTCAAGGAAGAGCTCGGCCTCGATCACTTCGAAGGCCGATCATGGAAGGGGCTTCACCGACATGCGCTGATGTCGATGATTGCCTTGGCCTTCCTGCAATCCCGCCTCCTCAAGCAGGCCAAGGGGGAAAAAAGAATCGCCGGGTCGCCACCACAGCCGAGCCTGCCGACGATCAGGCAGGCCATCATTGAACGCCTCGCCAAACCTCCTGATCTGACGTGTCCACAGTGCGGACACTGCATCTCTCATCGCTCGCTCACAAATCTGCCAAAGTAGTGTTAGAACTGGCCACGCAATGTGATGCCATAGGTTCTTGGTTCCGCAAGGAAAGCCGAATAGGTCTGCTGCGATGCGACAAATGGCGTGTTGAACGCAACCTGCGAATAACCAACATTGAAGACATTTTGTGCCCAGCCTTCGATCGCCCATAGATCATCCGGACCACGGATACCGATACGCGCATTCACCAGCGTGAACCCGTCCTGTTCCTTGCCATACAGCAAATCCGAACCGGTGTTATAGTCGCTGGTCGTACGAGCGTTCACATAGAATAGACCGGTCAGCCCGCTATTTCCGATTGGCGGGGTATAGGACAATGACGCCGTAGCAGTGATTTCAGGCGCGTTGGACAGATTGTCGCCTGGCAGCAGGCGCAAAGCCGGATCGAGTGGAGAGCCAGTATCACGGCCAACCAGATTATTTTCATAGCTGGTATCGGTATAAATCAGGCCCATCGAGATATTCACGTCTCTGATCGGGTTGATCGACGCTTCCAGTTCGACACCCTGCGCGACAACGCCATATTTGACGTTATCCGGGGTACAATTGCCCGTTGCGCCGCTGGCATCGCGATCCGCACCACCGAGATCATCGCTACACGCGTTTACATTCTGAACCACATAAGCGGAGCCATTGAACGTATTCAGCTGGAAATTGTCAAACTGCTGACGGAATGCCGCCACGCTCAGCGAAAATTCACGGGTCGAATATTTTGCACCGATTTCAAATGCATCTACCGTTTCCTGGTCAAACTGGAGATTGGCGGTGCTGCCCGTCACTGCCGGATTGCCCAAAGTTGAACTGGTCAGCGCAGACCGGTCCAGGTTGAAACCACCAGCTTTATAACCGCGCGAGTAGCTACCATAAATCAGCAGGTCGTCGGTTGGCTTGTAGGACAAAATGGCTGTGCCGGTAAATTCGCTCTCGCTCCGGCTATCGGCCAGCGATACGCCATCCAGTTCAGAGGTTGAGTTGCCCTGACAGGACAGACCGATCAGACCGCCTGCCAATGCACCAAGTCCTCCGCTAAGTAAGGGAGTGAACAATGCCCGCTGGATCGGACACATCGTATTGTCGTTGGTGAAGGCTGCGTTAAAATCCTTGGTTTCATTGGTATAGCGCAGTCCCAAAGTCAGATCGAGCTTGTCGGTAACGTGGAAGATATTGTGGGTGAAAAAAGCAAAATTTTCGCTTTTCTGATTATATGTGTCGAGCACCGTTCCCTTGTCATTGATCTGAGCCAGATTGTCCAGACCGGCAAAAATCAACGGTGTCGCTGCACCGAATGCGCCAGCACCTCCATTTGCGGCTTGCAATGCAGCCCTGCCGCCAGCGCTTAGACAGCCTGTGCTGGTTGGAGATGCAAGGGCCGGGTTAACGGCATTGACGACACGGCATGGTGCGAAGGCACCATATTGCGAACCAAAGCGCAGATTATCTCGTACCTCAAGCTTTTCGTTGGCGTAAAAACCGCCGACCAACCAATCCAGCTTGTCGTCGAACGCCGAACCCTGCAGTCGCAGTTCTTGCGTAAAGGTTTTAAACTCACGAGCACCGGCATTTTCATTCGGCTCGCGATAAAGGATATCTACCTGTGTGTAATCGGTATCACTACCCTGAAAGTTGGAATATTCACGATATCCGGTGATCGACGTGAAGTTCATCTCACCCAGTTCGGCATTGAGTTCAACCGAAGCGCCAAAATCCTCGGTTTCACCGGCATATGAACGACCGGGTGTGACATAGATATCCCGGTTGAACGTACTTTGCGTAAGCGCGTTGCTGTTTTGACCGAGCGCCAACAGCACCGGTATGATCGGATTCGCCGTGTCGGTAAGAGCGGGCGCACCAGGCTGTGGAAGAGCGAAAGGATCAAGTCCCGGGCTTACTCTTGCCCGCTGCGCAAATTCAGGCTGGACGAACGTCGCTGCACAGCAGGATTCGTCTTTCTTTGAATAGTCACCAATCACCCGGACCGTCAGTCCATCCTTTGGCTCAAACAACAGCTGGCCACGCACAAGAAACCGGTCTTTGTTATTGACGTCGGTTCCGTTTACAACATCATTGTAAAAGCCGTCGCGCTTCAAATAAACGCCGTCGACGCGCGCAGCAATCGTATCGCTCAACGGGCCATTAATGCCCCCTTCCAGACGTATCTGGTTGTAATTGCCGTAGGATGCTGCGGCATGTCCTGAGAATGTAAATTCAGGGGCGGCAGTGGTAATGCTGATCATACCGGCAGAAGAGTTGCGGCCACCCAATGTTCCTTGCGGACCCCGCAGAACTTCGATCCGATCAATCGGGCCAAGTTCGCTGAGCGCATTACCACTACGCGAACGATAGACACCATCGATGAACACGGCCACGGAGCTTTCCAGTCCGGGGTTGTCACCCACGGTACCGATACCGCGAATACGGGCAGAGCCATTGGCTTCGTTTCCGGTCGATGACACAAGCAATGAAGGCGCAAGCTGGTTGAGCTCACGAATGTCGGTTGCCCCACTCTTCTGCAATTGTTCCGCGTTAATGGCCGAAACAGCAATAGGAACATCGGACAAAGCTTGCGACCGGCCCTGTGCGGTCACCACGATCACATTATTATCGACGGCACCCTCATCACTTCCGTCAACTTGGTCGGCGGCATCCGATTGCGTCTGTGCATAAACGGGGGCGGTGGTGATTGCTGCAAGTGCAAATGCGCACGCAGATAGATTCAACGCTAATTTCATAACATGCTCTCCTGAAAATTTATTTTTTATCCTCATTAATGTATAGCCTAGCCCGTCACATTTGATAGCCAAATCTCGCATCAATGCACCGTTTTCTGCAAACTGTGGCTTTATTGTCAGCCTATGCGGCACTTTTGGCGCAACAAATGAGCAGCCGATTTGCTTCCAACATCCAATGGCCCGCATCTTCTGGAAACAGAGAATAATTAATAAGAGCAGGATGACGTAGCGTCATAGCTGGCGCGGCCTGATCTCCCCGCATTTTGGATGGTTTCCAGGCAAATTTTTGGCCGTCCCAAACACTGCTTGGACCGAAACCGGGTCAATCACTATTTATCGGCCGTCGAGAGCGCAACGCCTGCGCCAGAGTCCCCTCATCGAGATAGTCCAGCTCACCGCCCACCGGTATCCCGTGCGACAGCTGACTGAGTCTCACGGGATAATTTTCCAATCGCTCGGCGATATAATGGGCCGTCGTCTGACCCTCCAAAGTGGCATTCATCGCCAGAACGACTTCGTCAATCCCGCCCTGCTCGACGCGCTTGATCAGCTTGTCGATCGCAATGTCTTCGGGCCGCACGCCGTCCAGTGCTGACAACCGCCCGCCAAGCACATGGAATTTTCCGGGAAACAGCCGCGGTGGGGCATAATGGGTAAAAGGAGTCCCACTATGTGACGACGCAAAGAGCCGACGATCCCGGCAGAGTTACTGGATCAACTATTGGCATGATCGGATGCAGCTTCTGCGCTGGATCAGGGAGGGTTGCTGGATTCGTTGAAGAAGGCGCTCGCCGAACGGGCGTTGAATGCCGAGATGGATCATCATCTGGGTGGCGATGAACAGGTAGGCAACAGCCGCAATGGCTATGGCCGCAAGCGCGTCATCACCGATAGCAGCAAAATCGAGATCGAGGTGCCGCGCGACCGCGAGGGCAGCTTTGATCCGCAGTTGATTGCTAAATACCAGCGCCGGTTCCCTGGTTTTGATGAGAAGATTATCTCAATGAATGCGCGCGGGATGAGCACACGAGAGATCACCGGGCATCTGCGCGCCCTGTACGGCATCGACGTATCGCCTGACCTGATCTCGACTGTCACCGACGGCGAGCATCGCGCCGTGCCGGGGGCACGGGAAGACGCGCAGCGCTGGAAGAAGTTACTGCCTGGCAGCAGCGGCCGCTTGATCCGGCCTATCCACTGGTGTTTTTCGACGCCATTCGCGTCAAGATCCGCGATGAAGGCATGGTTCGCAGCAAAGCTATTCATATCGCGCTTGGCGTCCGCGCTGATGGCCGCAAAGAGGTTCTCGGCCTGTGGATTGAACAAAATGAAGGTGCCAAATTCTGGTTGCGCGTTATGAACGAGCTTAAAAACCGGGGCACCGAGGATATCATGCTGGCAGTCGTTGATGGTCTCAAGGGCTTTCCCGATGCGATAACGGCGGTATTTCCCGAAGCCGTCGTGCAGTGAGAACGGCGGTGCAAAAGTCGGCCACGGTAGCGGCGGGATGAGCTTGCTGCGGGCGGTGTAAAAGTCGTCCACCTTTACCCTTTCTGGCGACAGGGAGGGCGGGAGGATTTTTACTGTGTATTTATATCGGAAGGTTCGGCTGGCCTGTGCTGAGGGCATGAGCCAGCGGGAGGCAGCGCGGCATTTCAACATCTCGCGCGACAGTGTAGCCAAGATGATGGCGTTCTCGGTTCCGCCGGGTTACCGGCGATCGGCACCGGTCAAGCGACCGAAGCTGGATGGCTTCACTGGGATTATCGACGGCTGGCTGGACGGCGATCTGGGGGTTCATCGCAAGCAGCGGCACACAGCGAAGCGGGTGTTCGACCGGCTCCGCGATGAACACGGGTTCACCGGCGGCTACACGATCGTGAAGGACTACATGCGGGAGCGCGAACGGCGCGGTCGCGAGATGTTCGTGCCGCTGGTCCATCGGCCCGGCCATGCCCAGGCCGATTTCGGCGAGGCGGTTGTCATCATTGGCGGTGTCGAGCAGAAGGCGCATTTCTTCGTCATGGACCTGCCGCACAGTGATGCCTGCTTTGTGCGGGCCTATCCGGCGGCAACGGCGGAAGCCTGGGTTGATGGCCACGTCCACGCCTTCGCGTTCTTCGGCAAGGTGCCGATATCGATCCTCTACGACAACGACCGCTGCCTGGTTGCGAAGATTCTTCCGAACGGAACGCGTAAGCGGGCCACGCTCTTCAGCGGGTTCCTGTCGCACTACCTGTTCCGCGATCGCTACGGCCGCCCCGGCAAGGGCAACGACAAGGGTAACGCGGAAGGGCTGGTCGGCTATTCCCGCCGCAACTTCATGGTACCGATCCCGCGGTTTGCGAGCTGGGAGGCGTTCAACGCCTATTTGGAGGAGCAGTGCCGCGAACGTCAGGCGGATGTCCTGCGGGGCCAGTCCGAGACGATCGGTCAACGCCTCGCGCGGGATCTGGCGGCGATGACCGGCCTTCCTGCGGCACCGTTCGATGCCTGTGACCAGGCAACCGGAAGGGTCAGTTCGCAGGCGCTGGTGCGCTACAAAACCAACGATTACTCGGTACCGGTCGCTTACGGCCACCGCGACGTCTGGATCAGGGGCTATGTCGACGAGGTGGTTATTGGCTGCAGCGGCGAAGTCATCACACGCCATCCCCGCTGCTACGACCGCGAGGACATGGTCTTCGATCCGCTGCATTACCTTCCGCTGATCGAGCGCAAGATCAACGCCCTGGACCAAGCAGCTCCCTTGGCTGAATGGAGCCTGCCGCCCGAGTTCGCAACACTGCGCCGCCTGATGGAGGCGCGGATGATCAAGGCCGGACGCCGCGAGTACGTGCAGGTTTTGCGCCTGCTCGAGACCTTCGACATCGACGATCTGCATGCCGCCGTGAAGAAGGCCCTGCAATTGGGGGCAATTGGCTTTGATGCCGTCAAGCACCTTGTTCTCTGCCATGTCGAGAAACGGCCGCCGAAGCTCGACCTCGATGTCTATCCCTACCTGCCGCGCGCCAATGTCGAGACCACATCGGCCGCCAGCTACATGGCCCTGATGACGGAGCCGGCGGAATGA
Protein sequences of DBSCAN-SWA_1 >NZ_CP022548|8013:64224|42674_43151_+|WP_100092241.1|DBSCAN-SWA MSDEKNLPPKMRASAQKAPPKPQPRPQDPVEQEFWNRCQDGNLYFQRCGECGTFRHLPRYMCAKCGSPDFSWEQSSGKGTLFSWTVTHQALHPAFAADIPFVSAVVELEEGVRMATRLIDCDRDKLELDLPVTLDFELIGADFRLPVFRPDTNLPHEG >NZ_CP022548|8013:64224|43158_44409_+|WP_100092242.1|DBSCAN-SWA MDLEYGPEYDAFCEEVRQFLKTYGNRAPAEQGRAARPCPEAVEWQKLLIEHGYTARTIPTEYGGYGAEPDILKSRIIAEEFARAGIPGGLANQGISMLVPTLLELGTEAQKRQWVEPTLKGEVVWCQGYSEPGAGSDLASLKTSARIENGEFVINGQKIWTSTAKQADMIFCLVRTEPDAPKHGGISYLIFSMDTPGIEVRPLKTMTGHAEFNETFFTDVRVPMDQIVGQRGQGWFVANATLGHERGMLGDPDALENRFQALVKLMQKERVGQGRAIDNAVLRDRLAALQAEVAAMKYNGMRILSDNLKGEPGGMAKLIVKLQSCELAHQISALAIDAMGEIGILYHGSNREREGGAWQWNYMFQLGLIIGGGTAQIQKNIIAERGLDMPREPKLAKPAPAEQGAAVAARQKQGAA >NZ_CP022548|8013:64224|12086_12284_+|WP_103143265.1|DBSCAN-SWA MKNEHVVADAIDVECAPAVQTTLGAIFVSMELSRSTWLITSLLPGDGERMSKCTLRAGDIGGLME >NZ_CP022548|8013:64224|30878_32450_-|WP_100092232.1|DBSCAN-SWA MTEAAPAQAGPTGAPTKPQNADVTAWVAVVAGALGAMLATLDISIVNSALPVIQGEIGATGTEGTWIATSFLVAEIVVIPLSAWLERLFGLRTLLIIAVTAFTGFSILCGIATDLTTMIIGRTGQGFMGGVLIPTAMTIVAKRLPPHQQPIGMALFGMTVVLGPVMGPLIGGWLTENLSWHYAFFVNVPVCATLLLLLFVGLPHEKPNWEYLTQADWAGILGMILGLGGLTVVLEEGHREEWFDSSLIRWLTLITVIGFASLIYGQLYASKPVLKLRLLLSRQFGSVAVMALALGMVMYGSIYVIPQFLAIISDYNAFQTGLVIFWMGVPAFLLMPVLPFMIRRVDIRIAVGAGMLLMALSCFVSTSLTAESGGAVFTESQLIRGVGMILAMMFLNQATVASVAKDEAGDASGIFNAARNLGGSFALAGLASFQDQRTWHHSRRMEETLNANSVSLQSYLDSLARTFGGMDEAMAVLDRTIQREAFVMTYNDVFFVMGLLTVITVPLVVFLKPLPKNVSLSMH >NZ_CP022548|8013:64224|18196_19465_-|WP_100092218.1|DBSCAN-SWA MITNEKQYRSSKILIDKMKLGLEAIGSGENAELHPMLIAAQREALESQIDEMEEDVNFYDALRSGQIGEFEAESLGELPDILIHARIARGMSQKDLADFLGFKEQQIQRYEAERYRSASLDRLVEISDALGVQIRERAALIGDGRMESVDPESWKSFPIAEMYKRGWFEDFSGTLAHARKSAADLVPAFLQGVQANNSAAAFHRKSVRSSGQVHEAAIAAWEARVRMLAERNPPSVVFDLNRVDETWLAGLVSLSLDENGPSRVHDYLRQIGISLVIERHLPGTLLDGAALSSWDDYVIIALTLRHDRLDNFWFTLFHEIGHLVLHIGTGKFAAIFDDTEAPANNDVEDEADLFAQEALLPGDKWKIAVSRFTRTEKAVFADAKRFGVGSAIIAGRVRREAGDYTLLRTLVGAGAVRSQFGL >NZ_CP022548|8013:64224|20008_20806_-|WP_100092220.1|DBSCAN-SWA MKLFTQAELEAIASALGDTVEGLTNPEIELLIRSAKMIDPGKMTKRHRIYNAFAESQNSKRNRTHVLEFIRLAMSPARYSREPARYEPMRAALNQALAFAGVLVAETGVLRTVSSATTLPEAQRRAQNLRADLEGRGVHPDVLAFCRAELLVDNYFHAVQEAVKSVADKVRSKTGLIDDGGTLIDRAFGGNPPLLAINARRTVSEKSEQSGFANLVKGAFGMFRNPTAHEARIRWEMSKADAEDLLTIVSLIHRRLDASNMPPRV >NZ_CP022548|8013:64224|9002_9935_-|WP_100092212.1|integrase|DBSCAN-SWA MNVTSQLDRYLGVRRSLGYDLRTDERVLRRFARFTDQEGAARIDTALFLRWDASLPDVSTSTRSARLGKVRLFAQWLSNIDPAHEVPPRGLLPGHTGRSRPYIYSEAEITSIIAAAGALPSIYGMRGLTFSTLFGLIAVTGLRISEALALDHDDLANGVLRVRRGKLGKERLLPLDPTVVTRLVAYAAERDRLLGSVPTSFFVNCKGARPTDCGTRYNFAQVCQHIGLRAHQRYGRHGRGPRIHDLRHSFAVRTMINWYRTGKDPAREMIRLTTYLGHTDPDNTFWYLEAVPELLDLAMARATSCGETDQ >NZ_CP022548|8013:64224|62736_64224_+|WP_123906195.1|transposase|DBSCAN-SWA MYLYRKVRLACAEGMSQREAARHFNISRDSVAKMMAFSVPPGYRRSAPVKRPKLDGFTGIIDGWLDGDLGVHRKQRHTAKRVFDRLRDEHGFTGGYTIVKDYMRERERRGREMFVPLVHRPGHAQADFGEAVVIIGGVEQKAHFFVMDLPHSDACFVRAYPAATAEAWVDGHVHAFAFFGKVPISILYDNDRCLVAKILPNGTRKRATLFSGFLSHYLFRDRYGRPGKGNDKGNAEGLVGYSRRNFMVPIPRFASWEAFNAYLEEQCRERQADVLRGQSETIGQRLARDLAAMTGLPAAPFDACDQATGRVSSQALVRYKTNDYSVPVAYGHRDVWIRGYVDEVVIGCSGEVITRHPRCYDREDMVFDPLHYLPLIERKINALDQAAPLAEWSLPPEFATLRRLMEARMIKAGRREYVQVLRLLETFDIDDLHAAVKKALQLGAIGFDAVKHLVLCHVEKRPPKLDLDVYPYLPRANVETTSAASYMALMTEPAE >NZ_CP022548|8013:64224|24936_25122_-|WP_100092225.1|DBSCAN-SWA MTKPDPATLKIVAALIRTRMDRLSGDQRLDGLERLGAYREFNQLAKDLDATAVEFTRDEKW >NZ_CP022548|8013:64224|39548_40691_+|WP_100092238.1|DBSCAN-SWA MALRSRICEMLGIEYPILLAGMGGASVPALAAAVSNAGGLGVLGAAACSPEQLRDWIRQTRELTDKPFGVDTLLPASVTRGSAPQSGGAPENPMELLGEYQQFTRDFMDREGLPQVDTEAAMRAAGAPEMGKGGPQLFSKEFFEAQMEVVIEEKVSVYAAGLGNPEPWMDRLHANGTKVMAVIGKVKHAEQVVGSGIDMIVAQGHDGGGHNSPIGTISLIPQVVDAVGDRVPVIGAGGISDGRGVAAAMMLGAEGAWIGTAFLATEEAGIEHFQKEAITESGEDDTVVSRSLTGKPARMIRNRWADAWVEAGKEPLPMPYQSMISGPVMASGIKAERKDVIPGFAGQGIGLIDAIRPASEVMQDLVTGAEEALSSAKSYS >NZ_CP022548|8013:64224|8013_9006_-|WP_100092211.1|integrase|DBSCAN-SWA MSAATLPALIQRFFTDRLCTQMEASPHTVASYRDTFRLLLRFAGARCGKPPVKLAVEDIDADLVADFLVHCETVRGNSARSRNIRLAAIRSFFRYVAMTDPTWLLHCQRVLAMPSKRYVKRTVTFLDTPEIAALLAAPDRATWAGRRDHALLLLAVQTGLRASELVNLKCGDLTLGTGAYIRCMGKGRKERCTPLRRDTAKLLAAWMSERCDDNSPLFPSIRGERLSRDALEHLVRKHCLTASRACPSIGTKRVTPHTLRHSTAMDLLHHGVDQAVIALWLGHESVATTQIYIHADMRMKEKALARVAAPQVPTGRYRPDDGLLAFLEGL >NZ_CP022548|8013:64224|12284_13274_+|WP_103143266.1|transposase|DBSCAN-SWA MSNLRQKALARTGQTFPFVVIQEAGLDGFWIHRVLEQEGIESHVVDAASIAASRRRRRVKTDRIDGEILLRSLLAFKRGDPRVCAMVRPPSPEDEDRRRNSRERKTLIAERVKLVNRIKGLLFAQGITGFEPLKRDRRVRFDELFTGDGRELPPHAKAEISRALDRIELILEQMKAIEAARDALSTEAAHSPEGVSGTTMLRRLKGIGPDFAEVLWAEGLYRHFDNRRQLASYAGLTPTPWQSGSVSNEQGVSKSGNPRLRTIMVQLSWFWLLHQPDSALSRWFHERVKLDGGRRRKPAIIALARKLLIALWKYVREGLVIEGAIMSQA >NZ_CP022548|8013:64224|47172_48777_+|WP_100092245.1|DBSCAN-SWA MSGKTTPDYDAVIVGAGFSGIYLLHKLREAGFNILLVDAAAQPGGIWYWNRYPGARVDSQVPLYEFSLPDIWQTWSWTERFPGWEELRTYFRHVCDTLDLWPHMRLGTRVESARFAEKASLWRLHLDGGDTVTARFLLPALGFASKPYVPDIPGLDTFAGEWCHTARWPQEGIDLAGRKICLIGTGASGVQVAQEAAKSADRLTLFQRTPILALPMRQETMTEEGQVCEKQGYPAIFEQRKQTSGGFEYQSLEKSALEVSDEERTAHFEHLWQRGGLKFWYHNFADMLTNREANRYAYDFWRDKVRERIVDPALAEKLAPAEPPHPFGTKRPSLEQNYYEIFAQENVALVDLKETPIVRISADRIETQDGAFDCDLIVFATGFDAGRGGLIDMNITGLGGLGLAQAWNDGLRAYLGMSVTGFPNMLFSYGPLSPSGFANGPTSAEIQGDWICDFLIWLRQGGIKLFDAEPQAETTWTEIVAQAGAMTLFPEADSWYMGANIPSKKRQLINFPSVVGYAAVCDDVAQNAYRGFAT >NZ_CP022548|8013:64224|41487_42678_+|WP_100092240.1|DBSCAN-SWA MNGPAALSFSDRSAITGVGETAFVKGTERTAVDMMLEASRRAIADAGLKPSDIDGMVPPPIYTTSEEMAANLGIDVLRYAATVHMGGASPTTALQNAAMAIASGLCDHVLITLGWNGYSALRPKPGAPPTRPMNMNTLTNTVKGYYSPYGVFLPVQMYAWLATRHSKLYGVGPDAMAAVALACRRHAQMNPQAFTYGRELDAETYHSARLISEPFRLYDCCLETDGACAVVVSRMDRAKDMPHVPVSIAGAAEGHPYPADDIPSRPDPFKIGLSYAAPRAFDMAGVRREDMDFLQIYDCFTYVVLLQLEALGYCEPGGQGEFVADGQIELGGRYPVNTHGGLLSEAHVWGLNHVVEAVRQLRHDCGERQVEGAQTGLVTGWGDLGDGSIAILRRFA >NZ_CP022548|8013:64224|58421_61151_-|WP_100092250.1|DBSCAN-SWA MKLALNLSACAFALAAITTAPVYAQTQSDAADQVDGSDEGAVDNNVIVVTAQGRSQALSDVPIAVSAINAEQLQKSGATDIRELNQLAPSLLVSSTGNEANGSARIRGIGTVGDNPGLESSVAVFIDGVYRSRSGNALSELGPIDRIEVLRGPQGTLGGRNSSAGMISITTAAPEFTFSGHAAASYGNYNQIRLEGGINGPLSDTIAARVDGVYLKRDGFYNDVVNGTDVNNKDRFLVRGQLLFEPKDGLTVRVIGDYSKKDESCCAATFVQPEFAQRARVSPGLDPFALPQPGAPALTDTANPIIPVLLALGQNSNALTQSTFNRDIYVTPGRSYAGETEDFGASVELNAELGEMNFTSITGYREYSNFQGSDTDYTQVDILYREPNENAGAREFKTFTQELRLQGSAFDDKLDWLVGGFYANEKLEVRDNLRFGSQYGAFAPCRVVNAVNPALASPTSTGCLSAGGRAALQAANGGAGAFGAATPLIFAGLDNLAQINDKGTVLDTYNQKSENFAFFTHNIFHVTDKLDLTLGLRYTNETKDFNAAFTNDNTMCPIQRALFTPLLSGGLGALAGGLIGLSCQGNSTSELDGVSLADSRSESEFTGTAILSYKPTDDLLIYGSYSRGYKAGGFNLDRSALTSSTLGNPAVTGSTANLQFDQETVDAFEIGAKYSTREFSLSVAAFRQQFDNFQLNTFNGSAYVVQNVNACSDDLGGADRDASGATGNCTPDNVKYGVVAQGVELEASINPIRDVNISMGLIYTDTSYENNLVGRDTGSPLDPALRLLPGDNLSNAPEITATASLSYTPPIGNSGLTGLFYVNARTTSDYNTGSDLLYGKEQDGFTLVNARIGIRGPDDLWAIEGWAQNVFNVGYSQVAFNTPFVASQQTYSAFLAEPRTYGITLRGQF >NZ_CP022548|8013:64224|21793_22474_-|WP_100092222.1|DBSCAN-SWA MNISVTAVNWHPCYRIIPSRFPPINLFEEVTDPDDLEAIYAVEALTNDRLREEAGDLSLVPLEDRVSGPGSSPIMAAFTHLNPDGDRFTDGTFGVFYVGGSIETAVAETRYHRVKFMLATEEPPQELDMRVYAVDLDANLHDIREMHDAAIGYYDPDNYSASQILAIELRKAGSDGIIYQSVRHDVGECAAVFRPKLLSNCRQERHLCYVWDGSEVVTIYEKQHFG >NZ_CP022548|8013:64224|52963_55915_+|WP_100092249.1|DBSCAN-SWA MRKSLLLAATAISTMSVPAMAQDRDDGTIDSNVIIVTAQRQAQSAQDVPIAVSAFSGAALEAQQIENSSDLQLTLPNITFTKTNFVSSSFTIRGIGDLCVGASCDQATAIHLNDSPLFATRLFETEFFDLERVEVLRGPQGTLFGRNATAGVVNVVTAKPQMGEFKASGDAEYGNYNAIKVKGMVNIPIGDNIAFRGAGVYVKRDGYTTNLNGGPDLDDRDMYSVRGSLRFEPTADTTIDLYASYFREDDNRMRIQKQYCQRDPTGILGCLNSSRNAESFNANSTIAATLSSREFLATQGIPTAFALGSLYGPDQYAGVSVPADPRTVNTAFTPEYFASELTFQGKIEHDFGPISAQLSGTYQKVKLDSRQDYNNNIGRRDIYATGLNTLAAAAAGAIPGLPAAYFAPLASAIIPDGPNGVLCTSDTDTTGLGVFGGNSICDATPLQFDRSNLDNSSWSVEAIISSDLDGPFNFLVGGIYADSHLTENSYYVNAFGLDYGAGLLGSFISLADGLPPSFLGTPYYRNHSDDLTVKSYGLFGEVYFDISDKLKLTGGLRYNNDKKKVRARTTLADFLVPYSQTTDAFESPFVGSLDADPGIPGNQLFQNREVKFNEITGRAVLDYKITDDNLIYASYSRGYKSGGINPPLQPIFAAPESFRPEQVDAFEIGSKNTFGDGALQLNLTAFYYKYKGLQLSKIVARTAVNENIDADIYGAEVEAVIRPDPDWMINMGFSYLHTKVKGDTFNSDPRDFGGGRSDAVIIQDITNASNCAVASTSGNAAGVNAFVNTINGAINAGLVPGLAPGAGLQPTTAFPADGGIASTGAFGICAVLDAAAQGAFAGAGVVPAAFGGIEYFSAGVPKNIRGNQLPQAPQLKFSTGVQYTMNFDNGMSLVPRVDLAYTGESFGSIFNGNVNRIKGYAQANAQIQLNGTDDRWYVRGFVQNIFDSNSETGLYVTDQSSGLYTNIFSLEPRRYGIGAGFKF >NZ_CP022548|8013:64224|28097_28985_+|WP_100092229.1|DBSCAN-SWA MYCETVSIEGAGGLTLTAEANGESGAMPVLLAHGGGQTRRAWKNVVGDLAEAGFHAIAIDMRGHGDSEWAADGAYETHDFASDLVAIVSRMERKLALVGASLGGMAGILAEGDLAPGSFASLTLVDIAPQMEVSGVNRIVGFMEEHIETGFASPEEASEAIARYMPLRRKRSSGEGLRNYLRHKADGRYYWHWDPAFILSSRTVSKRDEGRHQRQFEALSQATQNLTLPVHLIRGGSSDLVSEDAVEHLRELAPHAEYTDIAGATHMVVGDANDSFSHAILDFLKRHHSMRGRET >NZ_CP022548|8013:64224|40786_41491_+|WP_100092239.1|DBSCAN-SWA MRRLIADKEDTSQVFRIVQALSGNSYYRNFRRFAASPQGQTILTERADLLGTLSDRERLARCAPGTLGRAYLDFVYGEGLTAQGLAEASEASGMAEFSDPAVTLYRQRLRDSHDLFHVVTGYGRDALGELCVLSFGNAQFYNHGITFIVAVGIPKLLAEQWQLPVARAAFEAWRLGRKAADLTTFYWERYLDHPLEEVRRDLNLQPPAVYVGVRDLSQRLEREFQARRQGELQA >NZ_CP022548|8013:64224|50989_52477_-|WP_123906195.1|transposase|DBSCAN-SWA MYLYRKVRLACAEGMSQREAARHFNISRDSVAKMMAFSVPPGYRRSAPVKRPKLDGFTGIIDGWLDGDLGVHRKQRHTAKRVFDRLRDEHGFTGGYTIVKDYMRERERRGREMFVPLVHRPGHAQADFGEAVVIIGGVEQKAHFFVMDLPHSDACFVRAYPAATAEAWVDGHVHAFAFFGKVPISILYDNDRCLVAKILPNGTRKRATLFSGFLSHYLFRDRYGRPGKGNDKGNAEGLVGYSRRNFMVPIPRFASWEAFNAYLEEQCRERQADVLRGQSETIGQRLARDLAAMTGLPAAPFDACDQATGRVSSQALVRYKTNDYSVPVAYGHRDVWIRGYVDEVVIGCSGEVITRHPRCYDREDMVFDPLHYLPLIERKINALDQAAPLAEWSLPPEFATLRRLMEARMIKAGRREYVQVLRLLETFDIDDLHAAVKKALQLGAIGFDAVKHLVLCHVEKRPPKLDLDVYPYLPRANVETTSAASYMALMTEPAE >NZ_CP022548|8013:64224|44408_45500_+|WP_100092243.1|DBSCAN-SWA MDFGLSQDQQMLRDTVARCLADTCPLDHVRECAEGEASYSEKVQSALSELGITGMMVPEQHGGLGMTLLDAAIVAEQLAYAVAPVPFLASHVLAPIALAQAGNEQQQAEWLPQIASGKAQIAVAIAETIEARDGAGVTCNSGKLDGTALFALDFTGAEAFLVADTAGRMHFVPADAPGLEKIPLTTIDRTRSVGEMRFETVAAEPLANDGGATAARLRDAGRVILAADSLGAGQAMIEKAVDYAGQREQFGRIIGSFQAVKHMCAEMAARHEPCRSLIWYAAHAFDAASDEATLAACHAKSHTGEVYRFVARTSTEVHGGMGFTDLLGLHYWFKRIGFDRQVLGGPEAVRAEAAARQGWTRAA >NZ_CP022548|8013:64224|22924_24178_-|WP_100092224.1|integrase|DBSCAN-SWA MKSEVMLLTEKSIARMPLAQSGQYNVRDVELKGFFVRIGKRRKSYMVQGEYWRDGCREFSVQKKLGEFNEISARDARTNAKELLVKFSKGEKPGDVSRLRPGAITLRMAWERYRIAHMVRKGRAERTIENYQYYLERLLADWLDWPLSKLGRRPDLVIIRHEDISSNNGPYIANSTMKALRAIYNHALKGNRDLPSFNPVLAVDWNPEQRRNTGMGQDDLSDWFEQLYKLDKPLRREFHLLTLLTGSRPTALKAVRSEHIDFGRRILHIPKPKGGADRAFDIPMSRMMIRCIIRIMRIGRMLFPVQAEYWLFPAESESGHMIEHKEARSVLSKWGNDLRQTYRTLAQAAEISELDIHLLMNHSLRGVNAGYITRDKLLRVHLRKQQERISETIIASIDARTNAFAKQWLSSGKVDCL >NZ_CP022548|8013:64224|29413_30871_-|WP_100092231.1|DBSCAN-SWA MIRKSIPFLLIAATLAGCTAGPDYAGPPELAAAGGAGGFIRAGDKVSSTVPELAEWWILLDDPGLDRLIEAALADNPSLEAVGARIAQSRASLAQENAGRLPSLGTQATVVQGRLPGLDIQGGPPLPPGTPGMPPEEEDDTISFYNAGLNANWEIDFGGGTGRRIEAANAQLAAAVANAEDARVQLTADVANAYVNLREAQQRVVRYRLQSELQEQILVLTYQRYQQGTLPLFPVGNANAELELLKSQLAEAEADEAVLLDALAVLTGRAPGAPSLEVAPIGEIPLPPERVAVGDPASLIARRPDIRAAERTLAAATARVGVAEAAKFPKLSFMGILGLGGTSPEDIFDVGEFSAIAIPRLQWNFLDFGRVDASIDRAGAARREAAANYRQTVLAALQDAERALARFAQQRAALAALIQVKRHADKSADLNRQRFESGAISRIDLNRTLREQEKANTDLARGRAALTLAWIAVQKSLGLGWQAPS >NZ_CP022548|8013:64224|10929_11718_-|WP_100092213.1|DBSCAN-SWA MVHENSRVSDSGTGRTRQAKRASRAARKTRRSGSAAPKASLYDEVTAQIIAQLEEGIFPWVKPWNSGNAVTGLPRNAISGRQYSGINILILWGAVIDGDYPSQDWLTFRQALAAGGCVRKGEKGRTVFYANRFTTDEDRKQQGEGGAGDGGAPRSIPFLKRFTVFNAAQCDGLPERLTAEPAPLPERELHGQAEALIAATAADFRTGGTKAFYNVGADFVQVPPQPAFTHQIDYYRTALHELGHNAVTGIMPHRIEERRLSV >NZ_CP022548|8013:64224|20802_21762_-|WP_100092221.1|DBSCAN-SWA MIKGKRPRRLYKYRSFSNLTLTMLVEDVVYLADPTTFNDPLDTKPTLNVDIDNDALEGILSQLIEQRSTAEMSAAAKSIKYRGPKTINHIARQSRKKADQLLADIRYNATNPDYEIDDPTQHLLRYYLQEELLRRYNKGVFSLAERANCPLMWSHYGDQHRGLCLGYSIPDASANDVYKIQYGGSRLVEASSVFAMLDGDEAARQKVDESVLLKKAKPWGYEREWRLIGMRGEHSSPLELEEVIFGTRCTPSVKFAVVKALEDRDRAVQFFEIHEQHGSFLLKKRLLNTDELIVSLPRRSRQYDASFDGLTEVEGLKSE >NZ_CP022548|8013:64224|36706_36880_-|WP_164088927.1|DBSCAN-SWA MMATQLKENTHADDDAFRTEVHEFLSEKFPQELKSKVNLASRPENSDKRNASSYKMA >NZ_CP022548|8013:64224|22470_22863_-|WP_100092223.1|DBSCAN-SWA MSNVSTAKAKSRKDLTGPALRTFFRVADAWKLSEQEQMNILGLDSRSTLQSWKKGGVAAISKDALERISYILGIYKGLKVLLPKSADEWVRKPNEAPLFGGEPALNRLVSGNVADLFVVRQYIDAQRGYP >NZ_CP022548|8013:64224|14153_14795_+|WP_100092214.1|DBSCAN-SWA MLSNNMVDLLPAIDIAVPIELSIFFERIAEIGNSSGNFKVEHFKGKEAGPDLEVVNFRSEKDSQHSGLGFQLIARKDIPQRVLLEVRARRWNPNPPTRAAYIDAARLLAGPLLKTYNQTNSARIRLRIEQAGQGRFSPSKRTAALLEHFTALANTSSLHPLDWKRFYTLIKESRQKIPENELRSILVRRGFPSETAEYLANIYKHLWKFRRFK >NZ_CP022548|8013:64224|37307_38570_-|WP_100092236.1|DBSCAN-SWA MNTQIMEMDTADKIAAIPIEEIDVARPSLFQNDTIGLFFDRLRKEDPVHYCRDSYVGPYWSITKFDDIMAVDTNHKVFSSEAKLGGIAIQDMHSVEGALDLEMFIAMDPPKHDQQRKAVTPSVAPSNLQQLEPIIRQRAAKILDDLPVGEEIDWVDKVSIELTTMTLATLFDFPWEERRKLTRWSDITTAAPETGIVESYEARREELIECAMYFKGLWEQRINEEPKNDLISIMAHSPATRDMPFLEFLGNLLLLIVGGNDTTRNSISGGVLALNENPDQYRRLSDDPSLIGSMVPEIIRWQTPLTHMRRTALQDWEIGGKQIKKGDKVVMWYLSGNRDGSVIDRADEFIIDRKNPRHHLSFGYGIHRCMGSRLAELQLRIIWEEIHKRFSRVEVTGEPERLFSNLVRGITNLPVRLHAR >NZ_CP022548|8013:64224|14858_16343_+|WP_123906193.1|DBSCAN-SWA MTEITISIPDFGSMTDDEVSSHPLAWGRDGKWSRQTRWLLSHFGTHRLELNEAWAMTSPVWQCPCCQRYKPEIAKLTDQGVLHCQLDHHHDHLGDLAGEILRAATCHDIPDELVRVRKRACNAALLLIERFAETLICNDCNAADAAMKAALGHRVDRHFSFSPLEIANFILPKPNSSHEPDNARGLEIWTHAQAQFSERLKFAEQLAKRITAGLHDRERLNYTGLDSGYDDAKLLFKLAADQGGPRNHPRGIGDALLARSRSTAGRFSNPKKLSAAPFRIPTAEEFQLLDQTKANSSPWRNSGANWSCAACSRSKFEITRLSKKGKWMALIMELNDFEEETNPISLQRRSFHYDLPLIFGQCKRITVCQDCRQIVTDAKTLVPSVIEDCMPIDAIRKLAQDPKPHQGHVIDRPEIIRVVEANTQWAKAAEDFWIHRDHANDIAFHQLRLVRNTGLSDSAARRQLIPELVAAKKLPGFESDEWFDWLIAESKRSF >NZ_CP022548|8013:64224|9931_10870_-|WP_100095342.1|integrase|DBSCAN-SWA MTESINEIQRTRAALLMDFEDYLVRQRGLSPRTIYHTLRFANRFLDHRFGSRMIDLTRLRAADTTNFVQHILARRTPYRDKTVTTHLRTFFQYLFACGATSANLALSIPKTAQRWNARLPRHLSPGAVEAILASVHSNPRHGARDYAMLLLMARLGLRASEIIKVQLDDIDWRAGELLVRGKGGLHDRLPITAEVGEALSRYLREERGPTTCRTMFVTHRAPHRGFKDGQIANAILKDALAATGQKPVTPYVGSHLLRHSLATRLINAGASLDEVGDMLRHRSRSSTMIYARLDIEGLRSIAQPWPVAGGGQ >NZ_CP022548|8013:64224|45562_47176_+|WP_100092244.1|DBSCAN-SWA MIDWNLGDILDAIEPAMPKDAPALIHGDRIITWPEMSVRSNNLARNLRERGAVDGAKVAFYMRKRPEYGELMAACFKGRLTHVNINYRYVPEEVFYIFDDSDSEVIVYSSEFRDYILELKDRLEKVHTFVEIGDASEIAPFAVPYEHLTTQGDGSPLGIERSPGDLLFIYTGGTTGMPKGVMWRHDDMRKAQLDAQKLLGPVPQSHEENVALIKSQGPGRRTLPSCPLMHGTGFITGIGALMSGGAIVTLSDPSFDAEELWETVEKHKVESIAIVGDAFAKPMLRALDEHPGRWDTSSLVSIISSGVMWSKEVKAGLCKHISQVILMDSFGASEGLGYGLSVTTAQGGTNTAKFGIGEFCDVFDENDQTVEPGSGVPGFIARKGAIPVGYYKDPEKSAKTFKTIDGVRYSIPGDWCHVETDGSLTLLGRGSVCINTAGEKVYPEEVEEVLKTHPAIADALVVGVPDEKWGQAVTAVVHLDKQAEFDEQAVKDHVRQQLAGYKTPKAIHPTDTPLRASNGKADYAAAKKIAEGSRAAA >NZ_CP022548|8013:64224|16547_17144_+|WP_123906194.1|DBSCAN-SWA MSMPKLKDVDLRIAAHERLFARANQCPDTLVIDELGLSHGACRVDIAVINGHIRAYEIKAEADNLLRLPRQVEAYSEVVDAASLIVTEHHLDAAVKLVPEWWGVILAERRKTGDVAFRRMRKEYINRGALPLTLVKLLWRTEVADLLRQYEIPEKELRAPRAILYEHLVSILPRRTLGRTVRETLKSRKTWRDRARPL >NZ_CP022548|8013:64224|36950_37277_-|WP_100092235.1|DBSCAN-SWA MVKVTFVASDGERREVEIDKGETAREAALFNDVPGIDGDCGGVCACATCHVHVDPEWIDKVGRLQEDGMEAELLQFAEGTTEYSRLACQIPMKPMLDGLVLHVPEQQY >NZ_CP022548|8013:64224|50228_50993_-|WP_100092247.1|DBSCAN-SWA MTDAPKILLADHLKTLKLPTFLREYEKLARQCAAEGLDHVQFLGRLVELELIDRERRMIERRIKAAKFPATKSLDSFDFKAIPKLNKMQVLELARCEWIDRRENVIALGPSGTGKTHVALGLGLAACQKGLSVSFTTAATLVNEMMEARDERRLLRLQKQLAAVKLLIIDELGFVPLSKTGAELLFELISQRYERGSTMITSNLPFDEWTETFGTERLTGALLDRLTHHVNILEMNGDSYRLGQSRARKAKAAT >NZ_CP022548|8013:64224|26925_27378_-|WP_100092227.1|DBSCAN-SWA MSANAQDGFLNETDEFAFLLEEVPRVLRKSFDESIAQFGLSRTQWRTLAYLIKTEGMTQTELAVCLELERATIGLTIDHLEKLEFVERRSAEDDRRVWRIFLRPKAIDIIPELRKEANAVYQKMFKGISKADLTIIRTAMEKMVGNLRVC >NZ_CP022548|8013:64224|38752_39493_+|WP_100092237.1|DBSCAN-SWA MQPQVETVKKTKRALSLAQNIVRDIEAGAHSPGDRLPHEDEMLARYEVARATLREALRFLELQGVIHLQLGRGGGPVVARPQTGDFANSLSLILHFMEADLRGLLELREAIAPDVAAYAALRATTGDLSALADCFKELERNEADNNFEELNRRFHDLLGWASGNPLFGLLTSTMHLLTREFSNSLGYSAQERAVQLRFLRSVLESVRTGDQAAARQAMSRLVSGSASYLAERSPELVSQRVKWGQI >NZ_CP022548|8013:64224|17100_18186_-|WP_164088925.1|DBSCAN-SWA MPINQHCYMPVLKWRQGEYQALLRLEDAQKDRIVPLIEVTPPDFDFETQTPSKTIDEHVVSFATRIQKKWGSRHALLDCGLLPPATRMADGRHPLAHLFDECTTLNASLIPVTGLDRDDAYQHAVHEIHTWSGHGAALRCSLEDAIDPDFDTNVYLLCETIGIGPNELDIVLDLESPNFDPQDDLIALISAALSGAAIFNSARSLTIVGASFPDSMGSVTGPLQLWPRREWLLYKALLGALGPNIRRPGFGDYAISAPTIAQGDMRLLKPSATIRYTVDDGWLIAKGNNVRDNGFGQYRTCSGHITGSAYYLGAAFSPGSNYVAGCQVGTENTGNLTTWRWVGTNHHITKVVHDLATFYGI >NZ_CP022548|8013:64224|28981_29404_+|WP_100092230.1|DBSCAN-SWA MSEAPARIDRARIEEILRIAPFHAWLDLKVRSLTPQRLELEMPWRDEIVSNPIIGSAHGGVLASLIDLTGFYALIAQGTKVKATADLRVDYHRPATSGPLVATGLIVKVGQQISVAETSVTGPNEKLLASGRGAYICGDL >NZ_CP022548|8013:64224|32446_33574_-|WP_100092233.1|DBSCAN-SWA MKDISEAVEGDNEPAAGSRSNRKLRLFILAAVIVAVLAGAWWYYRYVTYGQYMQSTDNAYVAADSVVVSSKVAGYVDAVLVSENQQVARGEALVQLDLRDYRAQAAQARAQIAATVAGADTIRSQVSEQNAAIRQARAQLAVARATLDLANKQVNRYRPLAASGAEPREKLDQYVTQAQQARAELASAQAAVAAATGRRATLFEQIEQTRAQADAARAQLDVADLNVSSTRLEASRAGRIGDLSVRVGQFVQPGQRLMTLVPVSQIYVTANFKETQVGLVRPGQPVRLEVDALPDGKIAGTVDSISPGTGAEFSILPPENATGNFTKIVQRIPVRISIDAAPEVRRLLVPGMSVVATIDTRGAVGELEEISSAAE >NZ_CP022548|8013:64224|48806_49790_+|WP_100092246.1|DBSCAN-SWA MNEMTPVNNLPTDIDSLWAGDGSHLPEWFIAALNVPREEAYVEISGAKVHYLRWGSPDKPKLLMTHGFLAHARCFAFIAPFFAEEYDVVAFDLAGMGDSEMRGAADTAARGREFTEVAEALNMFADGQKPTIIAHSFGSGAALTAVTQSPDAFAGVVVCDLMIMRPELLEKYWKNDRSSPGSGNPDKPNKRYPSYEAARARYILSPPQPAEEPFLLDYIAYHSLKHKRGSWTWKFSPEVFRRSNRPEEWLNMGKRLVQAPGRKAIVHGGNSLLFSQDSADYIREMGGNDIPIIAIPEARHHLMLDQPLAFVAALRSVLSFWKSSHER >NZ_CP022548|8013:64224|27494_27920_-|WP_100092228.1|transposase|DBSCAN-SWA MPYSCGSHTAFHHHYHIVRTPKYRFKVLHGEVRLRVREIIRQVCSELGVKIIHGVLSRDHVHMFVEIPPHIAVSDFVRRAKGRSSRKIQQEFEHIRKRYWGQRFWQRGYFSTTSGNITDDVILNYLDKHTNPNKAGFSPNP >NZ_CP022548|8013:64224|19461_19929_-|WP_100092219.1|DBSCAN-SWA MIPDLIDLGSPSPYPVLPPGIHDVTMQEVESRFATTPHRRWLFDGFQRAADALALAGCATVYIDGSFTTDKAHPDDFDGCWDHKGVDFGQLDPVLLTFVNKRAAQKAKYLGEMFPAGADSGLGTTFLDFFQVEKYSGLTKGILRVSLAPQSGMTP |
43 | Bacillus_phage(16.67%) | transposase,integrase | attL 19735:19750|attR 66948:66963 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1449445 : 1474624
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP022548|1449445:1474624|DBSCAN-SWA TTCAACGGATGAGGTTGGCGCCAATTTTGAGGTCCATGAGGCGGGACCACCCTCGCCAGATTACGATGTTGCCCGGGGGCGGATCATGAGCTCGAGCGAGATAACCGCCGAGCCGTGCAATCTGGGTTACGTAATCGGCGAGGGTTCTGCCCATAAGCGCTTTCTTTCGGGAAGCCACGAGGCCTTCAATCAGGGCGATTTCAACGTCGGTGAGCACGAGCCTGGGTGATGCGGCAGGCGCGGCGCGGTTGATCATGGTCACCCAGAACAGTCGCCAGGACAGGATGCAGAATATCGCAATCAGATTGGCAAGGCGTTCGGCGGTTCGGAGCCGAGCATCCTCGGCCCGACAACCGGATTTGAGTATCTTGTGGAAAAGCTCGATCTTCCAACGCTGAGCATACCAGTCGAGCTTTTCAATCGCTTCGGCTGGCGTTGTAACCGGCAGATCAGTGATAAGCCGCCAGTCGATAGGGGGCCGACCTTCAGGTGCTCCACATTCACGGGCATGCAGGACGGTCAGGCTCAGGGACGGATAGCGTTTTTGCTTGCGGATCGGTGGCAACACCTGGATGCGCCGATAGCGTAGCTCGATGTCGGCATGATCGTCCTTGTTGATGGTGACCCGGTGCATTCCCTGGACAGCGACCTCGCTCATTTCGTCGGCGATGGTGTGACCACCGTCACCGGCCAGACGATCGACACAGGTGCGGACCAGAAAATGCGTGCCGGCCGACTGCGCCTCGCAGAAAAATTCGTAAATGTCGTTCTCGCGGTCGCCGATATGGACCAGCCGCCCCGGCTTGCCCAACAGGGCCGTCGACTGACGCATATTCTCGAGCCAGCGGTAGCTCTCCTTCTCTTCAATCGGCACCCGGGTGGGATTGATCCGGCGCTTCAGTGCGTTGGTGCCCTTGAACCTGTTGTGGGTCCAGAATTTTGCCGCCGTCAGCCCAAGCGGCAATCCTTCGGGCGTGACCGCAAGGCTGGAATGCATGAGCAGCCCGCAGACCGTATGAAGCCGCAGCCGACCATTTTCATCGCGTCGGCTGGGAGCCAGGCCAATAGCGCCAATCCGTTTGGGATCGCGCCGTTGATAGGAAAATTCAGTGGTGTCCTGTAAAATCAGGATAAACCCGTCAAGCGCAGCAACGCGCGCCTGGGTCGCCTGGAAATGACCAGCGAGGATATTGCCTTCGTTCACCGCGTTATTCGACAAAAAGCGGTAGGCCGCCTTGGTATTCGCCCAGTCCTGACAGGCCATCGGGATCGGCGCGCCAATCGCTGCTGCCATCTGCGCAAGCATCGTATGCAGCCGCTCGCCCAGTCGCTTGTCGCGAAATGCGACGATATCAACTTCACCATCAGCCCATTTGGCTGCATCAAAACCGGAATCTTTCACTGCGCACCCCGCCAACTATTTCAGGTGCCAGAGAATGAATCATGCCCGATTCCAGGCGAGCAACGATCATGCGATGAACGGAATCACGCCAACGACGACTTGTGGGTAATTGAAAGGTCCTCGCCACGCTTACTCCATATCAACGTCGCCTTTATAGCTTAGCAGCTGGCAAAATTCCTGCTGGTCAGAGCGGTGAGTTTTTCCCGTCAGGGGTTTGATTGGCTTAGATTGACCGATTGAGGAGCGTTAATCGGGATATGGATCACCTGAGTCGCAGCGGCATCTGCAAAAGGTTGCGTTGATAACGTGATGAGACAATTTGACTAAGCGCGGCGCGCTCTCCAGAATTGTCCCGATAGCTTTGCGCGAACGCGAATTTTTAGGCGCGCATAGTCATCGGGCAAGACATAAGCAGCAGGCACGACCAACAGAGTCAAAAAGGTCGAGGTAGTCAAGCCGCCTATAATGATAAAGCCCATAGCATTGCGCCATTCGGCTCCATCGGACTGAGCAAAAGCAACCGGCATCATGCCGAAAATTGCAGCCAGGGCCGTCATCAACACCGGCCGGAGCCGCTCGGGTGCCGCCTTTTGCATTGCATCACCTGCGCTCATACCGCCTGTCCGATATTCATTTGCAAGATCAACAAGAAGAATCCCGTTTTTCATCACGATCCCCATCAGGGCAATCATTCCGATTTGGGCGAACATACTCAATTCCTGATTGGAAAGCCAAAGTGCAATAAACGCCCCGGAAAAAGAAAGGGGAGCCGTAAGCATGATCAGGACAGGTTGTCCGAAGCTGTTAAATTGGCTGGCGAGGACAATGTATAGGGCCACAAAGGCAAGCGCGAAGGCCAGCAATATGGCAGAGGTTGTGTCTGCCAGTGTCCGCGCCATACCCTCCATTTTGGTGGACATTCCTTCAGGTGGGCGATGCTTGGCCAAAATCTTATCAACCTGAGCTACAGCGACGCTCAATGCTACGCCGGGCGGCGTGTTTGCCTTGACTGTGATTTGCCTCGCTCGGTCCACGCGATCAATCTGAGCTGGGCTGTCGGCAACTTCAATATCTGCCACCGCCGCTAGGTCGACCAAACTCCCGTTACCTGCCCGAACCGGAAGTCTGGCGATGTCATTTAACTTTTGCCGTTTATCTTCATCCGCTCGCACTCTAACGTCATAGCGTCTGCCGTCGGCTTCAAATGTTCCCGCATCACTCCCGCCAATTAAAGTTCGCGATGCTGCTGCGACGTCCCGTGCCGTCACCCCTAAATCTGCGGCCCGATTGCGGTCTAACCTGATCTGCAATTCCGGCCTGCCGCCTTCGTAGGTAGATGCAACATCCACCAATTCCGGGATATTGCGCATTTCCTCAGCGATGATGTTCGCGTAATCGTTGATGCTCGTCAGGTCAGACCCGGAAATTATAAGCTCTATCTGAGCCGATCCCACGCCGGCCCCGGACACCCATGGAACTTCCACGGCGGAAATATCCTTGGCCTCGGGTAGCGCCTTTGCAAGAATTTCGCGTGCCTGGTCCATGATGGCACCCTGTTTGACTGATCGACCCTGCTTGGGTGTGAGATCAACATAAAAGTCCAGCAAATTGGTTTTTTGATTTGTTCCTGCCCCTATGGTTACGAAAACATATTCTACATCTTCAATATTGCGTAGCGCTTCGTCGGCTCGTAGCGACGCCTCTTTAGCTGTCGCAATCCCAGTTCCCAGCGGAAGTTCAACACTTGCAAGAAACTCTGATCTGTCCGTGAGGGGCATGAACGTGGCGGGCACCATTGCTGCGAAAAGACCGCCGATTAAAAGACTTATGATCGCTCCCAAAAAAACGAGATATCGTCTTTTGATAGACCAACCTACAAGCTTTTCGTAACGCTTCCGCATGGTCTCGTGGAAGGTTTCAATTTTACCCAGCCAGCCTGCTTTTTCTTTCTCCGATTTCAGGAACCGCGATGCAAGCATCGGCGACAGGGTAAGCGCCACCAGCAATGAGACGCTGACCGAAAATACAATCGCCAAGCCATATTGAAAGAAGAAGCGGCCAACGATCCCTTCCATGAAGGCAATCGGCACAAACACTGCCAATGTGGCAAATGTCCCAGCAAGCACCGCCAATGCGACCCGCTTGGTTGCTTTTGGTGCCGCCTCCATTGGATCAACGCCATCATCGACATCGCTTTGCACAGCTTCAACAACGACGATAGCGTCATCAACAAGCAATCCAATTGCCACCGTCAACGCCAATAATGTCATGAAATTAAGTGTAAAATCGAACGCAGCAAAGGCAGCGAACGTTGCAATAACGGAAGTCGGAATGGCCAGCATAACAATAAAAGTTGCCCGCCAGCTGAGCAGAAAAAAGAATGTGACAAAAACAACCAATACCACTGCGATCATCAGATCAAACAGCACGTCACCAATGGCCGATTCGATGAAACGCGCTGTTTCTCTGGTGACAACAATTTCAACCCCGCTCGGCAGCTGGTCCTTGATCTGCTCGATTTCCGCTTTGATTGCTCTGGTCATTGCAACGGTATTGCTTCCAGATTGCTTGCGAACCTCAAGCACAACCCCGGGGCGACCATTTAATTGGGCATAGCTGGTTTCATCCTCGACCGAGTCTTCCACGCGACCGATATCGCGGACCCGTGTGACCTGTCCGTTGGGCCGATAAGCTACCGCGAGATTACCAAATTCCGAGGCTGTATTGGCTTCCGCCAATGTTCTGGTTCCATATTGCTTGCTGCGACCATCAATCTCAAGACGGCCGCCCGGCAACTCGGCATTTTCGTTTCTCAAGGCGGCAACGACATTATCCGTGCTAATAGCGCTTGCTCGCATTTTTGCCGGATCAAGCCAGACCCGCATTTCTCTTTCCCGCCCACCTACAATTGAGATAGAACCTACACCGCGAATACGCTGAAGCCTCTCTTTAACGACCTCATCTGCAAAGACCGTGAGTTCTCGGACCGGCGCATCTCCTGCTATCAAGAGCGACAGTATCGGCGCCGCGTCCGGATCAAGCTTTTCGACAATAGGCGCGTCCGATTCTTCCGGCAAATTGCTTAATATGCTGGCCACCCGGTCCCTGACATCCTGAGCCTTCACATCGGCCTCTTCCGACAACTCAAACTCCAACACCACCTGACTGACACCTTCGGCGCTGACTGAGCGCATTTGCCGAATGCCCGAAACGCTGTTGACCTCTTCCTCGATAACGTCGGTTATTTCAGTCTCAATCGTATCTGGAGACGCACCAGGCAAAGTCGTTGTTACCGAAACATAAGGAAATTCGATTTTGGGGAACAGATCAACACCTAATCTCCCAAAGCTCAACAGGCCCAGAACGACTAGCGAAGCAATCACCATCGTCGCAAAAACGGGACGCCGGATCGAGACATCAGAAATCCACATCAGCTTACCGTGCAGGCTTTCTTTGCTTGCGCTCTACTTTTTGCCCATCGCGCAAAGTGGATGGTGGGTTTTGGATGATTTTTTCGCCGGCCCGGACGCCTTGTGTGACCTTGACCCGTTCGAAATCTATGCTTTCGACCTTTACATTCCTTCGGCGTGCGATGCCTTTCTCAAACAGGAATATATATGGAGCCGCGCTATCACCTTTAACAGCCGATCTTGGAAGAATGATTGCAGATTCTGGCGGCGTCTCGATTAGCGCCCGCACACCAAGTCCGCTACTGATCCGATAGTTCGGGTTTCTTATCGGCAATCGCAATTCGACTGTTCGGGATTCCGGGTCAACCCGGTCATTGATGATGAACACAAAGCTGTCGAAAGGCTCATCAAAGCCATCAATATAAACTTTTGCCTTCTGGTTGAGCTTGAAATCATCAACCCGGGCCTGCGGAGCACTGACGATTGCTGCCACTATTTCTAATTCCTGTAGTTGCAAGGCTGCAGATTGTTCACCCATCGAAAAGCGATTGTTGAGATAGGACCCCTCGTCGACCAATCTGGCTGTAACGACGCCGTTATATGGAGCGCGGGCAACAGTATCGCGCAGCGCTTGTTTGGCCGTGGCCAGTAGCGATTGCGATTGCGATACCTGCGCTTTCGCGACGACCAGGTCTGTTTCAACCCTTTCGACCTGTGCTTTCGATACAAAGCCTCGCGGGGCAAGCGCAATTACGCGGTCATATTGGCGCTGGGCCTGAATAGCCTGTGCTTCCGTCAGGCGTAGCGCCGCTGATGCTTCCGCAACCCGTCTTTTATAATCTGCTTGTCTGATCTGGAACAGCGCTTGGCCCTTCTTCACCCGGTCACCGACCGTCACAAACATGGTCTCCAGCGGTCCTTCGACGAGAGCGCCAATTGCACTGCTTTGCTTGGCCGCAATCGTGCCAAAAACTTCTACGGGTTCGGCCAAGGGTTCCAGTTTCGTCGTTAAAATGTCCACTTCGAGTGGTATGTTCTTTCGAATGCCGTCTACATCGACATTTTCCGTATCCTCGATTGCACAGCCCGAAACCAAGCCCAGCATGACTAAAGCCAAGATGATTATGATTATGGAATGGCCCGCCTCAAACGAAGCTGTCATTATGGGCTGAGGTACAAAATGAGGAGATTGTCATGAACAACACTAACATTGCTGTTTTGGGCGTGGATCTAGGCAAGAACATTTGCAGCCTCGCCGGTTTCGATATCAGCGGACAGTTAATATTCCGCAAGCGCATGCGGCCCGATACGATTCCAAAATTTACAGAGCGACTCTCAACATGCATCGTTGCAATGGAGGCCTGTTGCGGTGCTCATCATCTGGGCCGTTTAATTGCCCGGCAAGGACACGAGATCCGACTGATGTCGCCGGAATACGTGCGGCCATATGTTAAAGCTCAGAAGAACGATGAACGCGATGCCGAAGCTATTGCTGAAGCTGCTTCGCGCCCGACAATGCGCTTTGTTGAACTTAAGAGCGCCGAACAGCTCGACATGCAGACGCTGCACCGTTCGCGTTCACGGCTTGTGGGTACACGCACGATGCTAATAAATCAGATGCGAGCCATTCTGTTAGAGCGGGGGCTCAGATTTCCGCAAGGCAGGCGCAAGCTTGAGCTGGCGATTGATGCGATGCTGGGCGAACCGGCTACCGGCATCAGTCAAAGAATATTCGAACTAATAGCGGATATGCGAGATGAATGGAGAGAACTCGATGGCCGTATTGCCATTCTGGACCGTGAGTTTGCTCAATGTGTCCGGTCAGATCCGGATACTCAGCGCCTGACTTCTGTTCCGGGTATCGGACCGTTGACAGCAACAGCACTGGTTGCAGCGATTGGCAATGCCAGTTCTTTTACGCGTGCCCGTGATCTTAGCGCATGGCTTGGCCTCGTTCCAAAACAACGAACCACCGGCGGCAAGCCTAAGCTTCTGGGCATAAGCAGGCGGGGTAACATCTATTTGCGAACACTGTTTATCCACGGTGCGCGAGCTGCTTTACCGTGGCTGGCGAAAAGCGAGACCCAATTGGGCTTTTGGTTGCGCGGGCTGCTTGCCCGGTCTCACCGCAACACTGCAGTTGTAGCGCTTGCCAACAAACTGGTGCGTATCGCGTGGGCAACCTTGCGTCACCAAGAAGCATATCGGAATGCGCCCGCCAAATCTGTGGCATAAATATCAAGGAGTTACAATCTATCGAGAGATACGCTTTTGCGAGTGGAAAGTGAGATGGCCGAACGGTTGATCGGCGGACCGGACACCTGGTTAAAAAAATGGCTCTTCGAAGCCGACATGCTTATGAGGACCGGGCCGCGCGAATATCCATCTTGGCGACGATCAGATGATCGATATGCCGCATACGTTTACGCAGACCGTGCGATTGCTATCAAAAAATCTACTATTGCAAAGAGGCGGGCCATACGTTTTTTGACGACCATTTACGTGCCCAAATAGGTCCCGCGACCGCTAGCCACCAAGCGATCATCATCGTCGATAATGCGCGTGTCTGCGGTACTTATGGTTTTTCCAAGCTTCACGACCTGCCCAATTGCACGAAGCGGGCCGTTGGTGGCTGCGCGATGATAATCTACCCTCAGATCCACGGTCGCCCTGGCGACACCTCCTGCTGCAATAATCGCATAAAGCCCGGTTAAATCGATCAGAGATGCCAGTATGCCGCCGTGCGCCGCTCCAATCAACGGATTAGATACAAGTTCGTCCCGCCACGGCATTTCCAGTTCCAAACAATCGCTGTCTTGGCGAACGATTTTCAGACCTAACCAGCGATGGAAAGGCGCAATCTGTATCGCGTCAGCTAGATTTTCCGGATTGTGTTTCATGCTGCAGCACCGCGCTTGTGGGTCGCATTCAAGAAAGAGAGGATCGCCGAGCAAAATGCATCGTTGCGGTCACCGGCGACCATATGGCCGGCATCTGCAATGTCTGTGAAATGCGCGCTGGGAACCAGCGTTTGGAAATGAGCTACGGCCTCTTCAGAGACCATGTCGCTGGAACCTCCGCGTATGAGATGAACCGGCAGTGTCAATTTCGCGGCCGCCGCACTCAATCTGTCGAAGCCATTGTCTTTTGCCTCGGTAGCGTCGTCACGAGCATGCGTCACATGGTGAATAAAATTGGGGTCCCAATGCCAATAATATCGCCCGTCCTCCCGCTTGCGCAAATATCGATCAAGCTTGCCTGTGCCGGTGCGTTTCTTTCGTTGTGGCATATATTCGGCGATAATTTCTGCAGCTTCTTCAGGGGAGGAAAATCCGTCTGTCATATGGGCCTGCATGAACCCCACCACTCTTGACACTCCGACGGCTTCCATCTTTGGCGCAATATCAACCAAGGTGAGTGACTTAAAGCTTGCGGGGGCCAATTCCCCGGCTGCAATCATACCCGCTATTCCGCCCAGTGATGCTCCGACCATCGCTGGGGGGTTATCGAGTTGATTGGAAATAGCGATCAGATCGCGAGCAAAATCGCGCATATCGTAGGCGCCATCTGGTGCCCATTCACTGTCCCCATGACCACGCATATCGATAGCCGTCGCGCGGTAGCCAGCATTGGCGAGTTCATCCAGCACCTTTGCCCAGGCATGACGGGTTTGCCCGCCGCCATGCGCAAGCATTACCGGAAAGCCGGTTTTTGGCCCAGTCGTGCTTGCCTTAATCTTCAAGCCATTGTGGCCCGTAAAGGTGCTCTCTTCATGATCCATACCGATTTTATAGACTGTATATGGACAGACAGTAAAGGGCTTTATATTATATTAAATTTACGATATGGTCACTATGATGACAACCGAAACAGCTACAATGAGTGCCAACGCGCAAGACGGGTTCCTCAATGAGACCGATGAGTTTGCGTTTTTGCTGGAGGAAGTGCCCCGAGTTCTGCGCAAGGCATTTGATGAATCGATTGCTCAGTTCGGCCTGTCCCGCACACAGTGGAGAACATTGGCCTATCTGATCAAGACCGAGGGCATGACACAAACCGAGTTGGCAGCCTGCCTTGAACTTGAACGGGCTACTATCGGATTGACGATTGATCATCTGGAAAAACTGGATTTTGTCGAACGTCGGGCGGCAGAAAGTGACCGGCGTGTCTGGCGGATATTCCTGCGTCCAAAGGCAATCGATATTATTCCGGAACTCCGGAAAGAAGCTGACGCCGTTTATAAGAAAATGTTCAAAGGCATATCCAGCGCAAACATAGCCATCATACGAACCGCGCTGGAAAAAATGGTTGGGAATCTAAGGGTGTTGAACCCATCAGATCCCGAAATTTAACTGGTTCTGATCCCTCTCTCCTGTTGGATCAGTATGGGCAAGTGGCGGGGGCCGCTTGCCCTTTTTTTACTGTCCAGGTCAATTAATTTCATAATTATCGGACGGTGAAATATACCGTTGATACCTGCCGGGGCACCAATCAAATATGCGCTAGCGCACAAAAGTTGCTATGGCTGTGGCGAGCGGTTTCTCTGGTTTATGGAAACAGGTGGCGCGGACAAAGCTCGCGGCCTTGCCTTCACGTTCAAAACAGCTGTTCGTGAATAGATCGCGCGAACCAATGGCTGGACGCACATAACGAATGTCGATCGACTCTAGCTGTTGGCCGATCGAACTTTCACCACCTGCAACACGGCCTTGGCATGCAGCCATCAGCGCCGCAGCAACCGCGCCCCCATGGACAATATTTCCGCTCTCCCAGCCCAAAACAGCCCGGTTTCCTGCTACAATTTGTGGCGAGTCTGGAGCGCCCGATAATCCCAAACTGGTGGTCATCGGTCCGGGCATTGCTTGAGCATCGAAATCACCGACCAAATTATAGTCGGCCGGAGCGCCCCCCGGAAAGCTCCCCAAACGAAACAGTCCTGTTGCTGTTCCGATGATATCTTTGGACGAGTTTGTTACGCTACCTGTTGCAGTGGCCGATGTTTTTCCAACCATCACCGCTTCGGCCGTCAGCGTCAACTCATCCCCCATTGAAGATCCTGATGAAAGTCCGATCCGCAAATCAAAAGTTGAAATTCCGATTTCTGTACCAATCAAACCTTGCAACGATTCGGACAATCCTTGGTCGATAAAACCCATGAGCGCCAGCGGATCAAGGTTTGGATTTCCCGGTGCTTTGGCAATCCGCCTATTGGCTTTCATTCGACTGATCGAACGCCCGCTGCTGCTTTCCAGCAATTCCATTCCGAGCGTCAATTGGTAAGGGGTTACAAGTCGAACCCGCATTTTCTCACCCGTCTGTTCATCTGGCTTAGATTGCGCAGTCATCCAATTATCTTCCAATAAGGTATGGTAAACAAGACATACTCAAAATTCCTTAGCCAGGCATAAGCTGCAAACAAAGGACTCCGTTCGCCGGAGAGGCGAAAAATTATGTCCGCAAAAACCCGAATGGTGCAAGGTAATCATGATTTTGAAGGATAAGGTCCAATGATTGCATTTACAATCCTGACCAACATCCCCTTTGTCATTTGGTCATGTTAATAGGAGGTGGCCGCGGTGCCGGTTCGGCTTGCCGAAAGTTTCGCAGCGTTGCTCATGCCACCCATCAGAATTTCACCGTTGCTTCGATGCCATATGTGCGCGGTGTTCCCCGGTTAAGATAATCGAGTCCGAAGATGTCTATGTTCAACCCGTAGCTATAATATACCTTGTTGGTCAGGTTCTTGGCCCACAGGCTCGCTGAAAAATTATCCTTCTGAAATGTCAACCGGCTATTGACGAGCCAGTAGCCAGGGTTTCCGCATGTGATCGCCGGACCTGCCGGACGAACACTACCAGGTGCACCTGGGCGATCACACGGCGTTTGGCCGTAGTCGCCAAACGGATCGAAGAAATATTTCCCCGTATACGCAGCATCGCCGCGCAGACTTAGTTTGCCTGCATCAGTGTCAAAAACGTCCCAATCGAACCCTGCAGAAAACGTTTCGCGCGGAGCGTTGGGGAACGGATTTCCATTGACATTGCGGGTCGGGCTTCGAGGATCGGTAGGATCGACAACATTTCCTTGATATTTACTGCGCAACAAGCTGACCGATGCGTCAAAGCGCAGATTATCGGTAGCTATGATTGCGAGTTCCGCCTCACCGCCATAGAGTTTGCCATCGGCGCTGCGAACAAACGTCGTCGCTCCAATCACCTGAGTGACCTGATGGTTGCTGTAGTCGTAGTAAAAACCGGCTAGATTAAGCTGGACGCGCCGATCAAACAGGAGCGTTTTCAGACCGCCTTCATATGCGTTTATCTGCTCCGGTTCAACATAATAGACCTGATCGGTTCCCTGATACGCTAACCCATTAAAACTGCCGCTGCGATATCCTCGGCTGAAATTGACATATCCCAACACATCGTCGGTGAAATCATAACTGACATTGATCCGCCCCGTGACTCGGCTTTCTTTTTCGCGTTGCTCAAGCGGAGGTTGGTTTGGGTTGAAGGGGAAGGTATAGGGAATCGTACTTGCTCGCTCGGTTCCCCCCAAATCAACCAGCACTGTTCGCCCGTTGAGGTAGTTCACTTTGTCTTTGGTGTAACGCAAGCCGACACTGACCGTCAGTTCGTTGGTCACATCATAACTGGCATCAGCATAAATGGCAGCGGAAGGACGTTCGACGGTGAAATTCTGTTCACCGGTTATTGGTCCAAATGGGGGTGCGCCAGCTGCCCGACACGCAGCAGAAAGGGCTCCACCGAAGCCGCCGCCAGCCGAATTATCAATCTGGATATCGGTTTGGAGGGCAACAAGGGATCTCGCATCAAGGAACCCGTTGGGATTCCCGGTTGCAACCGGTGCACAGCTCGATGCAAGAGTGGGGTCAAGGGCTGGATTGACCAAAAATGCCGGGACAATCGCAGCAGCCTGAGGTGCTGAAACAGGTGCGTTAAAGAACGAGTCGCCAATGCCGAAACTGCGCAACACTGGATCCAGAACACCAAAGAAATCGATCCCGTTCCGCGTTTTGATTTTATCTACACCATAATAAAGTCCAGCGATCAGATTGAAACGGTCGTCCTTGTAAGCAAAGCGCAGATCCTGATTGAAATTTTTGCTCGTTGAATTGAAACGGATCGCGCAAACATCATTCGGTGAACCGTCACAGTCAAACGGAGAAATGCTGTAATTCCCAGTATCATAACCGGTAATCGATGTAACCGAGAGGCTGTCACTAATCTCGTAGGCGACGTTTAGCGCCAGACCCTTCGACAGAGTGTAGTAGTTTCCGCCCGTATCGGCAGATACCTCGTCGCGGTTTAGCGCGCGCCCCCCGTTCTGGGCGGGATCGAAACGTGAATAACCCAAAACATCACGGCCCTCTGCATATTGACCGAACGCATACGGGTTGGGTGCAACCGGATTATCCCGGGAGATATATCCCTTCAAGTTGATATCCAGTGTATCAGTTGGCTTGAAACGGACCGACAGGCGGCCAGCAATCGAATTTGTCGTGCCGGTATCGCGGTTCTGAACGGGATTGAATTGCCAGCCATCGCCCTTGGCAAAGGTTCCCGCAAAGCGGATACCAAGCACATCAGGAACCAGAGTTGTTTCTATCGCACCTTGGATTGTCTTTGTGTCGTAATTACCGTAGCCAACCGAAAGCGTTCCAGTTGTATCAACAAGTTCGGGTTTGTTTGAAAAAAAGCTGATGGCACCGCCGGTTGTGTTCCGCCCATAAAGCGTTCCTTGCGGCCCACGCAACACCTCAACGCGTTCCAGATCGTAAAGCTGCTGACCGTGGCTGGCACGGAAGCTCTGGCAAATATATACTTTACTTAGCAGGTCAATCTTGATATAAGGCTGTATCACAAGGAAAACAGCCAATGACGAACCTAACAGACCCCATTTTTCACAACGAAAAAGCAGCCGAGAAACATATCGAAGTATCGCGCTGGAACGGAGAGCCTTTTTGCGCTCACTGTGGTTCAACCAACGTCACCCGTATGAAGGGCAAAACGCAGCGCGGAATGTTCCAGTGCAACGATTGCCGCGACAAGTTCACAGTCCGCACCGGCACCGTCATGGAACGCTCCCATGTTCCCCTTCATAAATGGCTTCTGGCCATGCACCTGCTGGCATCCAGCAAGAAAGGCATTTCAGCCAGCCAGATCGCCCGCAACATCGGTGTGACCTATAAAACCGCCTGGTTCCTTTGCCACCGCATCCGCGAAGCAATGGACGGTGCCAACGGTAATGGCCCTCTAGGCGGTCCTAACCGCGTTGTGGAAGCCGATGAAACCTTTGTAGGCGGCAAGGCCAAGAACCGTGCGCACCGCAAGCCTCGCGACAAGCAGCCCGTTGTCGCACTGGTTGACCGTGAAGGCCATGTTCGCAGCTTCCATGTTGCCAACGTCAATGCAAAAGACCTTCGCAACCTTATCGTTACCAATGTTCACCGCGACAGCCACCTTATGACTGACGAAGCAACCGTATATACCCGCGTAGGACGCGAATTTGCCGGTCACAGTGTAGTCAATCACTCCGCTAAGAAATATGTCACCACGGGCGGCTTCAAGCACTCCAACACCGCTGAAAACTTCTTTTCTATCTTCAAACGCGGCGTGATCGGTGTCTACCACCATATGAGCGAAGCGCACCTTGGCCGTTATACCAAGGAATTCGATTTCCGTTACAACACCCGCGACATTACAGACGGTGAGCGCGCCGCCGTTGCCCTTAAGGGCATCGAAGGCAAGCGTCTGACCTATCGGCGGACTGACAAACTCGCCGCCTAAATGGACGGTTGAAATTCGCCAGATCGGCGAGAAGCGTAAACGCTATTTTGATTGAGACTCTTTACCCTTTGTCTTGCTAGACTTGTGGGGCTTTGGCGGAGTCGAAAGTAGCTTTTGCAACGTGGCCTCGCGCCGCGCCTCTGTCTCCGCTTCGGAGTATGTTTCATCTTTTTCTTGCCGAGTATTCAAAATGACCTCCATATCGCCCCTAACCAGCCCTGTATATGAGATACCCCTCTCAGACGAACAACTTATAATTTTAGGAAGGGTTGCTGTTGTTTGGGGGCATATTGTGTTCAAACTTGACGAACTTCTCCGCGATCTAATGAGCCTAAACACGGTGGATGATTTAGCGACCTACTCAACAAAAAATTTAAAAGCCAAGTTAGCAGATTTATGGCGAGAAATCTCGAAGCCTGAAAATGCAGGAAACAGAAGCCAGTTAATTGAAATACATCACTCTATCAGCGCCCTTTCGTCTGACAGAAACATCTGCTTTCATGGCTTGTGGGGCTACACATGGTGTCCCGCCTCAGAATCTTGGAAGGAAATCAGCCAGGGCTATGCCCGAGAAGAACCTTTCTTTTCTGATTCCTTAACTGAATTACACAACAAAATGATATTATCATCTAAGCTTCTGGCTGATGCGAAATGGCGCAAAGAGGTAGGTAACGATCCTCCAGAGACGAGAAATCGACGACAAATTTGGGGGCACCGCCCGCCGATTGAATCCGACCCCCATCCGCCTGAAAGAAAGATTCGATAATCTTAATTACAGCCTCGCCCACTGAATACGCTGGCGTATTAAATTTATAAAACGCGCCAATACCCGCCTCAATCATAGCGTCCGTAATCTCAATTTTAGCGCCGGCCTGTCCAGACATCTATGAATCCGATTCTCTGCTAAGTAAAGTATATAATTGCCGAAGCTCTGGTAAACTTCGTCAACATACACACCGACAGGCGAAGCGGTTGAAGCATTAAATTCATTAGCCACTGAAACGCCGCGCAGTGCGAAGTTCGGTTGGGTGCGGCCATAAGGCGTGGTGATCTGAAGACTTGGTACAAATCCTTGCAAATCTGATGTTTCATCTACGCCGCGAGATTCCAGTTCCTTTGCCGTCAATGCGGACACAGCGACGGGAACATCTTGAAGCGCTTGTGACCGCTTTTGCGCAGTCACAACAATTACCTCAAAACCACTGGTATCTTGTTCTTTGGTTTGGGCCAAGGCAGTTCCCGATAGAGCTATTTGCAATACAAATAACGCGCAACCTCCAGCCAAATATGTTTTGTTCATGTTCTTCCTCCCTATTTTTCGGACTTTTTTTATTTTTGGTTTTCCAGAATAAACTCTGCAGCGCGCTCGGCGATCATCGCAACCGGGGCATTGGTGTTGCCGGATATTAGATTGGGCATAACGGAGGCATCGGCGATCCAGATATTTTGAATGCCGCGAACCTTTAATTTTTCGTCACACACAGCGGTTTCATCATTCCCCATCCGGCATGTCCCAACCGGGTGATAGACTGTGTCTGCGCGCGCCTTTATCGCATCAATGAGTTCTGAATCTCCGCTCATGCCCTTTGTAAAAAGCTCAGTGCCGCGAATGGACTTAAGCGCGTCTGATTCCATAATATCGCGCACCAGCTTGAAGCCGGACAACAGGGTTTTAATATCACCATCTGTCGCGAGAAAATTTGGATCGATGACAGGAGCGGCAAAAGGGTCGGCCGAACCGAGTGTCACGGAACCCCTGCTCTCGGGTCGCAGCACGCAAACATGGCAGCTCATCCCTCCGCCCAAATGGATCTTGCGAGAATGATCATCAATAATGCCGTGGACGAAGTGCATCTGGACATCTGGTCGATTGCCACGACCTTTGGTGGATAGAAAAGCACCGCCTTCCGCCGCGTTGCTGGTAAGCGGACCACGACCATTACGGATATAGTCGGGAATCGACGCACCAATACGCGCGACACCGCCAGGGCTGACAGCAAATAAATCAGGACTTTTGGCGCGATATACCGTGACATGATCAGGATGGTCCTGAAGATTCTGGCCAATTTCCGGTATGTTAAACCGGACGGGTATAGAATGGCGTCGCAGTTCGGACTCCGGCCCAACGCCCGATACCATCAGAAGGTGCGGCGTGCCGAACGCACCGCTCGTCAGCACAACGCCGGCATCAGCGCCAAGTGTCATAGACCGTTTGTTGCAAAGCACCTCCAGGCCCGTCGCTCGTTCACCATCCAATACAATGCGAAGCGCCTTTGTTCGTGTCATCACCGTAAGGTTCGGTCGATCAAGGATTGGTCGCAGATACCCGCGCGCTACACTCCAACGCCGGCCATCCTTTTGGTTTACCTGATAACGCCCAACGCCTTCTTGTTCGGCTCCGTTAAAATCGTTTGTTGTTTTGTGTTGTTGTTCCGCTGCAGCCTCGATGAAGAGATCGGTAACGGGGTTGGGTGAACGGTGATCTGCTACGTTAAGCGGTCCCGACACGCCGTGATATTGATCTTCACCGCGTTCATTGCATTCCGACATCTTGAAATAAGGCAGCACATCATCATAGGACCAGCCCGGCGCACTGTCCGCCGCCCAAGCGTCATAATCCTCGGCCTGGCCGCGGATATAGCACATCGCATTGATCGAGCTTGATCCACCAAGCACACGGCCACGCGGCTGATAACCTTTTCGGCCATTGAGTCCCGGCTGCGGAACAGTTTGATAGCCGTAATTATTGATTTTGGTCGGCAACATGGCAGCCATGCCGAGCGGCACGTTGACCAGCATCTGATCGTCATCTCCGCCAGCTTCAATCAAGCAGACCTGTATCGCCGGGTTCGCTGAAAGACGGTTAGCCACTACGCATCCGCCTGAACCTGCCCCAACGATTATATAGTCGAAATTATTTTTCCGGCTGGCCCCGGTCATTGAACGGAAATCAATGCTGCGCGAGAGGCAAGCGAACGATCAGTCCGTCAAGCTCTTCCGTAATCGATATTTGGCAGGCCAGACGGCTATATTCATCGGCTCCTTCCACAAATTCGAGCATATCGCTCTCACTGCCGTCCGAAGCGTGACCGGTTTTTTCAATCCACGCCGCGTCGATATGAACATGGCAGGTGCCGCAAGCTGCACATCCGCCGCAATCGCCATCGATGCCCGGGACTCCGTTGTCTCGGGCTGCCTCCATGACTGATGAATTGGCATTTACATTGACAAGATGGTCGTCACCTTGAGCGGTAATAAACTTTACTAATGGCATTGGTATTCCTTCACTCTTATCGCGCGTGAACGCGGACCATCATTTTGGTGTAACCTCGGACAAAATTCGACTGAACACGTTCGGGCTCACCGATCACTTCAATGCGGTCGAAGCGATTGAGTATTTCCTCCCACAAAATTCGCAGTTGCATTTCGGCAAGCCGATTACCAACGCATCGGTGGATGCCGAACCCGAAGGATAAATGCTGGCGAGGGTTCGGGCGGTCGATGATAAATGAGTCAGGCCGATCAATGACTTCATCATCACGATTTCCCGAAAAATACCACATGACGACTTTGTCACCTTTGCGGATGGTTTTACCGCCAACAACCGAATCTTCCTTTGCTGTACGCCGCATGTGCGCCAAGGGTGTCTGCCAGCGGATAATTTCCGGCACCATGCTTTCAATTAGCGCTGGAGAGGCCCTCAATTTGGCCATCTGGTCCGGGTTCTGATGCAGCGCAAGAACCCCACCCGTCATCGAATTTCGCGTTGTGTCATTGCCGCCCACAATCAGCAGCAGCACGTTGCCGAGATATTCTTCCGGTGTCATGTTTCGGGTCGCCGGGGAATGGGCCAACATTGAAATAAGGTCATTGCCTGGATCCTCGTTTACCCGTTCATTCCAAAGCGTCTGGAAATATTGCGCACATTCCATTAATTCATCTTTGCGCTGCGCCCATGAATCAACGACGCCCCCGCCAGGTGCAGCTGTCGTCACATCCGACCAGCGGGTAAGCTTGCGGCGATCTTCAAAAGGAAAATCAAAGAGAGTGGCCAGCATCTGGGTGGTAATTTCGATCGAGACCCGATCAACCCAGTCAAATTCTTCTTCTATTGGCAGGCTATCCAATACGGCCCCGACACGCTCGCGAATGGTGCTTTCCAGAAGAGCAAGGTTACCGGGAGCTACGATGGGGCTGACGGCTTTTCGCTGTTCGTCATGGCGCGGTCGGTCCATCGCAATGAAATTAGGAAGGTCAAGCCCCTCTTCACCACCACCTTTTTGAATATTGTCATCGATGATGATGCCGCCGAAGCCTGCCGACGACGAAAAGATGTCATGGTGGGTATCAACATGCATGATGTCTTTATATTTGGTGATCGACCAATAGGGCCCGTAAGCACTCTCAGCGCAATAGTGGACAGGATCCTCTTCCCGCAGACGTTTAAAATACGTGCCAGCCGAATCGTCCTTGAAAAGTGCTGGTGAGCTTACGTCGATTTTATCAAGTGGTATCGCATACGCAGCTTCAGTGCCGGTTCGTGTCATGCTGGCCATCCTTTCCTGGTGATATATGATTTTTATTGAGTATAGTATCATCTACAAATGAGTATAATGTCAACAAGCAAAATCTGTTGCGATTTTTTATGATGACCGGCACATAATAATCCCAGATTTAAGCGGTGATGTCGGCCTTCAATCAGGAGCCTAATATATTGAAATACAAACACGATTTGGGAAAAAGGCGCCTGAAGCCGGACGAGAGGCGCAAACAGTTGACACTCTGCGCGCTTAAAGCCTTTGCGGAAAATGGTGTCGCTCGTGCCACTCATTCTCACGTCGCGAAGCTGGCCGGAGTTGCGATACCGACCGTGCATTCCTATTTTAGATCACGTGAGGATCTTGAGTCAGCGGTGCTCTGCGAAGTTGAAAACTACCTGATCAAACTCGTCACTGATTCGCTAAGTGGAAAAAAATCCGTGGAAGAAGCGCTTACCACGCTGGCAATGCGGTTTGCCAACGACGCGACCACGAAATCTGATATCATTAAGGTTTGGTTAGACTGGAGCACCGGAGTTCGCGCGGGCGTATGGCCCCGATATCTTGCGTTGTTGGACAAGTTACACGCGATAACTCGGCCGGTCTTTCTGCGTGGCAAACGTGAAGGTATACTGAGCGAAAATCTCAATGTGAAAGCAGCAACCCGATTGTTCATAGGCGGCGGACATACTCTCGCGCTGATGCAGTTTGCCAAAGTCCCCAGCCGGGATCTAGCGATTTACATGGATCAATTTATACAGAGCTTGATGAACATCGGAAAGCCTTGATCAACCGACAAAACCAGCGATACGCAAGATTGAGATGAACTTCCTCGGAGAGTCTTCGTTCCGAACGGATGTGGTTCTTCAGTGCGACCATTTATTGTGTCGGCTCTAAGCGACCTTCCTTTTGCTCATTGCTTCGCTCGGTAGATTCGGAAACTGCATCAACTTGATATGGCAGGCGCTTTGGGCCGGGCTGATCAGAAAGCCCTGGCCCAGGTTGCAGCCGAGTTGTATCAGGAGATTGCGTTGCTCTTCCGTCTCGACGCCTTCCGCTATGACCTCGATGCCAAGATTGCTGGCCATGTTGATCAGCGCCTGGACGATCAGCCTCGACTGGTGCTCGGCGGTGATCGACTTGATGAAGCTCTTGTCTATTTTGACCGCGTCTATGGGGAAATCCCTGAGATGCGACAGCGATGAAAAGCCTGTTCCGAAATCGTCCAGCGCGATGCGCATTCCCGCTGCGCTCAGTTCGGAGAGAACATTCTGCACTGTGGTTGTATCATTGACCATCAGCATGGTTTCGGTAACCTCGAGCGTGATTTTCGCGGGACTGATTTTGACCTCATTCAACCTCGACAATAAATTGTCCGCAAAATCGCGGCTCAACAGGTCCGCTTCGGTTGCATTGATGCTGACAAATTGCAGATCTGGCTGGGACTGCTCGATTTCTGCCAGTTCCCTGTACACGCTGGACAACATGCGCTCGCCAATCTCGCGAGACAGGATCGGATCCAGGATTGCCGGCAGAACCTGCGATGCGGTAATCTGCTCGCCGTTCGAGACGCTCAATCGCATCAGCGCTTCCAGTCCGACCAACCGGCTTGTCCTGAGATCTATAATGGGCTGGTAGCCGGCGAAGATGCGGTCACGATCCAGAGCTTCCCGGACTTTCGCGATCGCCGCTTGCCGGTGATGGTTCGCTTGCTCCAGTTCGGGTACATATGCGTGTACGCGTCCGGGTTCGCGCTCTTTTCCGTGATAAAGCGCCAGATCAGCCTTGCGAATAATCTCTCTGGCGATGGATCGCTCGTCGATCTCGACCAGACCGCACGTCGCGCTGACTCCCATCTTGCGTCCTGAAATTTCGAACTGACGTTCCACGCTGACAAGTATTTCATCACCGAGCTGCTTCGCCGCCTGCAAGTCAATGTCCGGCGCAGTCAGGATCACAAATTCATCGCCACCCCATCTTGCGACAATCGCCCCGTGGGGAAGCGCGTCCACTATGCGGGAGCTGATCTTCTCCAGAAAAACATCGCCGATCAAATGTCCGAACGTATCGTTGATGTCCTTGAAGCCATCCAGATCAAGCATCAGCAAGAAGGCGTTCGAACCGGTCTTCTTGAGCTTCCCAAGCGTGTTCTGCAGAAAGCGATCCAGGGCATTGCGATTGTAGAGATCGGTCAGGGCATCGCGGTCCGCGGCGCGCTGCAGGGCGAGGCGCGTGTGATGGGATTCCGTTATATCCTGGATGATCCCCACCAGGCGCGCAGGACTTTGTTGGTCTGCCTCCAGATATTCGCCCATGGCCTTCACCCGCTTGAACGTACCGTTCGCCGCGTTCAAGCCTGTTTCAAATTCAAAGGAGTCCTTATGCTTTTTTCCATTTTCGATCGCCTCTGTTGCTCCTTGCAGATCTTCCTGAGCGCAGTAGAGAACTGCTTCTTCAATAGAAGTTGGTACATTTTCCGGAGAACCAAAAATCGCAAATGCTTCGTCCGACCAGGTCAGTTCACGAGTCTCCTCGTTCCACTCCCATGAACCAATTTTTGCAATTTTCTCCGCCTGCTTGAATATCCCATTGGCTCGGTTGAGCCGGCTGCTTCGCGCATCCAGCTGGGAGTTCAGATAGGAGGTGACGGAAGCCTGAGAGTGCGCTCTTATCAGATCTTCAATGGTCGCTGCGAGGATTTCCAGCTTGAGGATGTGGTCATCGTCGAAACGGCCTGCGCGCTGATCCGCAATGCACAGGGTACCGATCATTGTTCCGTCGGGACTATTGATGGGTGCGCCGAGATAGGAACGGATCGCCGGTGCATGACGGATGAGCGGATTATCGGAGAATCGCTGGTCTGCCAACGCATCGGGCACGAGCATACTGCATCCAGCCTCAATTGCATGGTTACAGAAGCAATGTTCGCGCGGCGTTTCGCGGACATCCACACCGACCGAGGACAGGAACCACTGGCGGTGCTCATCAACCAGCGAAATGAGAGAGATGGAGCAATCGAAAATATCTGCACATAGTTCGGTAATACGATCAAAGGAAGCCTGACGGGGAGTGTCCAGGATATGAAGCTGGTGCAACGATTCTACCCGCTGGAAATCTTCCCGATTGATCGGCTTCATGTCTGCGGCCCTCTTCATAAACCAATGCGACTTTTAAGCGCGAATATCGCTTTCGAATTTGCCGAGCGCCATTTCTCTGTGGCCAAACAATGAAACGCTGGCCACCGTTCTATTTGCATCGTTCAAGGGAAAAACGCTCTAGGAAGAATTAATTAACTTTCCATTAAATTACCGTCATGTATCGGCTCGACAATCCATCCGTGTCGTCATCGCGCAACATAATCGTCGAGCACCGTCGACAGATAGCGCAGGCGGCGCACGATGCGGGGCATCATGCCAAAGCCGAGACAGCCCCAGCCAATGATTCTGAATTAGGTAAAATCGGTCTGCCGCATCTCGTTCGCCCGCGTAGTCTGGGTATGGAAGTGATTGGCAGCCTTGATCACGACATAGGCAGGGCTGGTAATTAGATCATGGGCTTTCAGTAAGCGGTATACCGTCGCCTCCGATACGAAGTAGCGCTTCTCATCGGTGAAGCGCACGGCCAGCTCCCGCGGGCTCAGCTCGGACTGCTCCAGCGCCAGTTCGATGATCTGTTCATGGATGTTTGCTGGGATCCGGTTCCACACCCGGCTCGGGACAGATGGCCGGTCAGCCAGCGCTTCAGGCCCACTTTCCAGCTACTGGTCCAGATCAACCCCTTGCGGGCCAGGCGTTTTGCCCAGGCTACAGGAACACTTGCCAAAACCGACCGGATCGATGCCGGGGTCCTTGCCCGCATGGCGGCCACGTTCCAGCCTGATGTCCGGCCTATCAAAAGCCCGGAACTGGCTGGGTTGGCCGAGCTCATGAACGGCCGCGATGGCTTGGTCAGGGATCGTACCGCCCTCAAGAACCGCGAGAAGAACCTGCTGCTGCCGCTGCTCAAACGCCAGGTAAAGGCACGTCTCGAGCAAATCGCACGGCACATCAATGCCATCGATATACAGGCGCAGGCGCTTATCGCTGCAGATCTCGCGCTTGCGCGCAAGCGCGAAATTATCGCCAGCATCAAAGGCCTGGGACCAATCACGGCCGCACAGTTGATCGCTACCATGCCTGAGCTGGGCTCACTGGAAAATAAACAGGCAGCTGCCCTTGCCGGGCTCGCTCCAATTACCCGGCAATCAGGCCAATGGGCCGGCAAGGCACCCATCCATGCAGGCAGAGCCAATGTCCGCCGGTCGCTCTACATGCCCACCCTTGTCGCTGCTCGCTTCAATCTCGACATCAAGGCAAAATACTTTCATCTCATCAGCATCGGAAAGCTGGCAAAGGTCGCCATCACTGCCGTCATGCGAAAGCTCATCGTCATGGCAAACGCACTACTCAAAGCCAACCGGCTATGGAAGGAATCTATGGCTTGACCATCACGGATACTTGTGTTTGGCGTCATCCACTAGGATGGATAGGCAAGGAATTCACCGACAGGCGGAGCCCTTTTTTTAGATTTTTGTGGGTTTACTTTTCCCTGCGGGCCGGAGTACCTCGCGCCAACGAATTTCATCATCGTTTGACCGCAAGATGCGGCGAACGGATTGATCCGAAGAAAGGAAATTCATGTTTAACACCCCCAAAATTATTCTGGCATCGACCCTGTTGGCGATGGTACCGGCATGCTCGTCGGAACTGAGCAAAGAGGAACAGGAAATCCTCGACGCGCGAATCGCCGAAGACACGAAGCCGACCAATATGATCGCCGTTCTGCAAGACGATCCCGATCTCAGCACCGCGAGCATTTTGGTCGGTTTGTCCGGTGTTGGCGCTGAATTGCAAGATAATGGTCCGTTCACCGCATTTGCCGCAACCAATGATGCGTTCAACAAAATGGATGCGAAGCGGCTGAGCGAGCTGCTGTCGGTGGATAACAAGGAAGAGCTGGAGACCATCGCGAAATTTGGTCTGGTCAAAGGCAGCATGACCTCCGCAGATATCGGAAAGGCCATTGCCGGTGGCGGTGGCAGCGCGAGCATCACCACCGTGCAAGGTGGCGTGATCAAGGCGACCATGGACGGTGATACGATCATTCTCCAAGATGGTGCCGGCAACAAGGCGAATGTGACCCAAGCGGATGTGAAGTCCAGTAATGGAACGCTACATGTAGTCGACAATCTGCTGATGCCGGAATAAGTCGGCGACGGAATAGACAAGGAAGGCCCTGCGCGATTGCGGGGCCTTTTTTGTTGCTGTTCCCCGTGAGCATTCAATAGACAGACTGGCTGCATAAAATAAGTAAGTGTTTTGGTTTCATCGTAGTGACGAAGGAACGAAGCATGAAACCAAAACACTCCAGCGCAAAAAAGCCCGCAGAGCGGGTGGTCAAGGACATACGCCGTGCGACCCGCCAGCATTTCTCTGCCGAAGACAAGATACGCATCGTGATCGACGGCCTGCGCGGTGATGACAGCATTGCCGAGCTGTGCCGCAGGGAGGGCATCGCCCAAAGCCTGTATTACACCTGGTCGAAGGAGTTCATGGAAGCGGGCAAGCGCCGACTTGCTGGCGACACGGCGCGTGCTGCCACATCGGATGAGGTCAAAGACCTGCGCCGTGAAGCTGGCGCGTTGAAGGAATGCGTTGCTGATCTGACATTGGAAAACCGTCTCCTCAAAAAACGTATGGCCCGCCTCTTTGCAATAGTAGATTTTTTGATAGCAATCGCACGGTCTGCGTAAACGTATGCGGCATATCGATCATCTGATCGTCGCCAAGATGGATATTCGCGCGGCCCGGTCCTCATAAGCATGTCGGCTTCGAAGAGCCATTTTTTTAACCAGGTGTCCGGTCCGCCGATCAACCGTTCGGCCATCTCACTTTCCACTCGCAAAAGCGTATCTCTCGATAGATTGTAACTCCTTGATATTTATGCCACAGATTTGGCGGGCGCATTCCGATATGCTTCTTGGTGACGCAAGGTTGCCCACGCGATACGCACCAGTTTGTTGGCAAGCGCTACAACTGCAGTGTTGCGGTGAGACCGGGCAAGCAGCCCGCGCAACCAAAAGCCCAATTGGGTCTCGCTTTTCGCCAGCCACGGTAAAGCAGCTCGCGCACCGTGGATAAACAGTGTTCGCAAATAGATGTTACCCCGCCTGCTTATGCCCAGAAGCTTAGGCTTGCCGCCGGTGGTTCGTTGTTTTGGAACGAGGCCAAGCCATGCGCTAAGATCACGGGCACGCGTAAAAGAACTGGCATTGCCAATCGCTGCAACCAGTGCTGTTGCTGTCAACGGTCCGATACCCGGAACAGAAGTCAGGCGCTGAGTATCCGGATCTGACCGGACACATTGAGCAAACTCACGGTCCAGAATGGCAATACGGCCATCGAGTTCTCTCCATTCATCTCGCATATCCGCTATTAGTTCGAATATTCTTTGACTGATGCCGGTAGCCGGTTCGCCCAGCATCGCATCAATCGCCAGCTCAAGCTTGCGCCTGCCTTGCGGAAATCTGAGCCCCCGCTCTAACAGAATGGCTCGCATCTGATTTATTAGCATCGTGCGTGTACCCACAAGCCGTGAACGCGAACGGTGCAGCGTCTGCATGTCGAGCTGTTCGGCGCTCTTAAGTTCAACAAAGCGCATTGTCGGGCGCGAAGCAGCTTCAGCAATAGCTTCGGCATCGCGTTCATCGTTCTTCTGAGCTTTAACATATGGCCGCACGTATTCCGGCGACATCAGTCGGATCTCGTGTCCTTGCCGGGCAATTAAACGGCCCAGATGATGAGCACCGCAACAGGCCTCCATTGCAACGATGCATGTTGAGAGTCGCTCTGTAAATTTTGGAATCGTATCGGGCCGCATGCGCTTGCGGAATATTAACTGTCCGCTGATATCGAAACCGGCGAGGCTGCAAATGTTCTTGCCTAGATCCACGCCCAAAACAGCAATGTTAGTGTTGTTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP022548|1449445:1474624|1466356_1467622_-|WP_100093411.1|DBSCAN-SWA MASMTRTGTEAAYAIPLDKIDVSSPALFKDDSAGTYFKRLREEDPVHYCAESAYGPYWSITKYKDIMHVDTHHDIFSSSAGFGGIIIDDNIQKGGGEEGLDLPNFIAMDRPRHDEQRKAVSPIVAPGNLALLESTIRERVGAVLDSLPIEEEFDWVDRVSIEITTQMLATLFDFPFEDRRKLTRWSDVTTAAPGGGVVDSWAQRKDELMECAQYFQTLWNERVNEDPGNDLISMLAHSPATRNMTPEEYLGNVLLLIVGGNDTTRNSMTGGVLALHQNPDQMAKLRASPALIESMVPEIIRWQTPLAHMRRTAKEDSVVGGKTIRKGDKVVMWYFSGNRDDEVIDRPDSFIIDRPNPRQHLSFGFGIHRCVGNRLAEMQLRILWEEILNRFDRIEVIGEPERVQSNFVRGYTKMMVRVHAR >NZ_CP022548|1449445:1474624|1467780_1468392_+|WP_164089022.1|DBSCAN-SWA MKYKHDLGKRRLKPDERRKQLTLCALKAFAENGVARATHSHVAKLAGVAIPTVHSYFRSREDLESAVLCEVENYLIKLVTDSLSGKKSVEEALTTLAMRFANDATTKSDIIKVWLDWSTGVRAGVWPRYLALLDKLHAITRPVFLRGKREGILSENLNVKAATRLFIGGGHTLALMQFAKVPSRDLAIYMDQFIQSLMNIGKP >NZ_CP022548|1449445:1474624|1473583_1474624_-|WP_100092283.1|transposase|DBSCAN-SWA MNNTNIAVLGVDLGKNICSLAGFDISGQLIFRKRMRPDTIPKFTERLSTCIVAMEACCGAHHLGRLIARQGHEIRLMSPEYVRPYVKAQKNDERDAEAIAEAASRPTMRFVELKSAEQLDMQTLHRSRSRLVGTRTMLINQMRAILLERGLRFPQGRRKLELAIDAMLGEPATGISQRIFELIADMRDEWRELDGRIAILDREFAQCVRSDPDTQRLTSVPGIGPLTATALVAAIGNASSFTRARDLSAWLGLVPKQRTTGGKPKLLGISRRGNIYLRTLFIHGARAALPWLAKSETQLGFWLRGLLARSHRNTAVVALANKLVRIAWATLRHQEAYRNAPAKSVA >NZ_CP022548|1449445:1474624|1471052_1471361_-|WP_100093414.1|DBSCAN-SWA MESGPEALADRPSVPSRVWNRIPANIHEQIIELALEQSELSPRELAVRFTDEKRYFVSEATVYRLLKAHDLITSPAYVVIKAANHFHTQTTRANEMRQTDFT >NZ_CP022548|1449445:1474624|1464395_1466006_-|WP_100093410.1|holin|DBSCAN-SWA MTGASRKNNFDYIIVGAGSGGCVVANRLSANPAIQVCLIEAGGDDDQMLVNVPLGMAAMLPTKINNYGYQTVPQPGLNGRKGYQPRGRVLGGSSSINAMCYIRGQAEDYDAWAADSAPGWSYDDVLPYFKMSECNERGEDQYHGVSGPLNVADHRSPNPVTDLFIEAAAEQQHKTTNDFNGAEQEGVGRYQVNQKDGRRWSVARGYLRPILDRPNLTVMTRTKALRIVLDGERATGLEVLCNKRSMTLGADAGVVLTSGAFGTPHLLMVSGVGPESELRRHSIPVRFNIPEIGQNLQDHPDHVTVYRAKSPDLFAVSPGGVARIGASIPDYIRNGRGPLTSNAAEGGAFLSTKGRGNRPDVQMHFVHGIIDDHSRKIHLGGGMSCHVCVLRPESRGSVTLGSADPFAAPVIDPNFLATDGDIKTLLSGFKLVRDIMESDALKSIRGTELFTKGMSGDSELIDAIKARADTVYHPVGTCRMGNDETAVCDEKLKVRGIQNIWIADASVMPNLISGNTNAPVAMIAERAAEFILENQK >NZ_CP022548|1449445:1474624|1466016_1466340_-|WP_089132075.1|DBSCAN-SWA MPLVKFITAQGDDHLVNVNANSSVMEAARDNGVPGIDGDCGGCAACGTCHVHIDAAWIEKTGHASDGSESDMLEFVEGADEYSRLACQISITEELDGLIVRLPLAQH >NZ_CP022548|1449445:1474624|1471382_1472087_+|WP_164089026.1|transposase|DBSCAN-SWA MRARRFAQATGTLAKTDRIDAGVLARMAATFQPDVRPIKSPELAGLAELMNGRDGLVRDRTALKNREKNLLLPLLKRQVKARLEQIARHINAIDIQAQALIAADLALARKREIIASIKGLGPITAAQLIATMPELGSLENKQAAALAGLAPITRQSGQWAGKAPIHAGRANVRRSLYMPTLVAARFNLDIKAKYFHLISIGKLAKVAITAVMRKLIVMANALLKANRLWKESMA >NZ_CP022548|1449445:1474624|1457152_1458037_-|WP_100093403.1|DBSCAN-SWA MDHEESTFTGHNGLKIKASTTGPKTGFPVMLAHGGGQTRHAWAKVLDELANAGYRATAIDMRGHGDSEWAPDGAYDMRDFARDLIAISNQLDNPPAMVGASLGGIAGMIAAGELAPASFKSLTLVDIAPKMEAVGVSRVVGFMQAHMTDGFSSPEEAAEIIAEYMPQRKKRTGTGKLDRYLRKREDGRYYWHWDPNFIHHVTHARDDATEAKDNGFDRLSAAAAKLTLPVHLIRGGSSDMVSEEAVAHFQTLVPSAHFTDIADAGHMVAGDRNDAFCSAILSFLNATHKRGAAA >NZ_CP022548|1449445:1474624|1451169_1454277_-|WP_100093399.1|DBSCAN-SWA MWISDVSIRRPVFATMVIASLVVLGLLSFGRLGVDLFPKIEFPYVSVTTTLPGASPDTIETEITDVIEEEVNSVSGIRQMRSVSAEGVSQVVLEFELSEEADVKAQDVRDRVASILSNLPEESDAPIVEKLDPDAAPILSLLIAGDAPVRELTVFADEVVKERLQRIRGVGSISIVGGREREMRVWLDPAKMRASAISTDNVVAALRNENAELPGGRLEIDGRSKQYGTRTLAEANTASEFGNLAVAYRPNGQVTRVRDIGRVEDSVEDETSYAQLNGRPGVVLEVRKQSGSNTVAMTRAIKAEIEQIKDQLPSGVEIVVTRETARFIESAIGDVLFDLMIAVVLVVFVTFFFLLSWRATFIVMLAIPTSVIATFAAFAAFDFTLNFMTLLALTVAIGLLVDDAIVVVEAVQSDVDDGVDPMEAAPKATKRVALAVLAGTFATLAVFVPIAFMEGIVGRFFFQYGLAIVFSVSVSLLVALTLSPMLASRFLKSEKEKAGWLGKIETFHETMRKRYEKLVGWSIKRRYLVFLGAIISLLIGGLFAAMVPATFMPLTDRSEFLASVELPLGTGIATAKEASLRADEALRNIEDVEYVFVTIGAGTNQKTNLLDFYVDLTPKQGRSVKQGAIMDQAREILAKALPEAKDISAVEVPWVSGAGVGSAQIELIISGSDLTSINDYANIIAEEMRNIPELVDVASTYEGGRPELQIRLDRNRAADLGVTARDVAAASRTLIGGSDAGTFEADGRRYDVRVRADEDKRQKLNDIARLPVRAGNGSLVDLAAVADIEVADSPAQIDRVDRARQITVKANTPPGVALSVAVAQVDKILAKHRPPEGMSTKMEGMARTLADTTSAILLAFALAFVALYIVLASQFNSFGQPVLIMLTAPLSFSGAFIALWLSNQELSMFAQIGMIALMGIVMKNGILLVDLANEYRTGGMSAGDAMQKAAPERLRPVLMTALAAIFGMMPVAFAQSDGAEWRNAMGFIIIGGLTTSTFLTLLVVPAAYVLPDDYARLKIRVRAKLSGQFWRARRA >NZ_CP022548|1449445:1474624|1454281_1455361_-|WP_164089020.1|DBSCAN-SWA MLGLVSGCAIEDTENVDVDGIRKNIPLEVDILTTKLEPLAEPVEVFGTIAAKQSSAIGALVEGPLETMFVTVGDRVKKGQALFQIRQADYKRRVAEASAALRLTEAQAIQAQRQYDRVIALAPRGFVSKAQVERVETDLVVAKAQVSQSQSLLATAKQALRDTVARAPYNGVVTARLVDEGSYLNNRFSMGEQSAALQLQELEIVAAIVSAPQARVDDFKLNQKAKVYIDGFDEPFDSFVFIINDRVDPESRTVELRLPIRNPNYRISSGLGVRALIETPPESAIILPRSAVKGDSAAPYIFLFEKGIARRRNVKVESIDFERVKVTQGVRAGEKIIQNPPSTLRDGQKVERKQRKPAR >NZ_CP022548|1449445:1474624|1455450_1456491_+|WP_100092283.1|transposase|DBSCAN-SWA MNNTNIAVLGVDLGKNICSLAGFDISGQLIFRKRMRPDTIPKFTERLSTCIVAMEACCGAHHLGRLIARQGHEIRLMSPEYVRPYVKAQKNDERDAEAIAEAASRPTMRFVELKSAEQLDMQTLHRSRSRLVGTRTMLINQMRAILLERGLRFPQGRRKLELAIDAMLGEPATGISQRIFELIADMRDEWRELDGRIAILDREFAQCVRSDPDTQRLTSVPGIGPLTATALVAAIGNASSFTRARDLSAWLGLVPKQRTTGGKPKLLGISRRGNIYLRTLFIHGARAALPWLAKSETQLGFWLRGLLARSHRNTAVVALANKLVRIAWATLRHQEAYRNAPAKSVA >NZ_CP022548|1449445:1474624|1463904_1464366_-|WP_100093409.1|DBSCAN-SWA MNKTYLAGGCALFVLQIALSGTALAQTKEQDTSGFEVIVVTAQKRSQALQDVPVAVSALTAKELESRGVDETSDLQGFVPSLQITTPYGRTQPNFALRGVSVANEFNASTASPVGVYVDEVYQSFGNYILYLAENRIHRCLDRPALKLRLRTL >NZ_CP022548|1449445:1474624|1459881_1462260_-|WP_100093406.1|DBSCAN-SWA MNHSERKKALRSSAILRYVSRLLFRCEKWGLLGSSLAVFLVIQPYIKIDLLSKVYICQSFRASHGQQLYDLERVEVLRGPQGTLYGRNTTGGAISFFSNKPELVDTTGTLSVGYGNYDTKTIQGAIETTLVPDVLGIRFAGTFAKGDGWQFNPVQNRDTGTTNSIAGRLSVRFKPTDTLDINLKGYISRDNPVAPNPYAFGQYAEGRDVLGYSRFDPAQNGGRALNRDEVSADTGGNYYTLSKGLALNVAYEISDSLSVTSITGYDTGNYSISPFDCDGSPNDVCAIRFNSTSKNFNQDLRFAYKDDRFNLIAGLYYGVDKIKTRNGIDFFGVLDPVLRSFGIGDSFFNAPVSAPQAAAIVPAFLVNPALDPTLASSCAPVATGNPNGFLDARSLVALQTDIQIDNSAGGGFGGALSAACRAAGAPPFGPITGEQNFTVERPSAAIYADASYDVTNELTVSVGLRYTKDKVNYLNGRTVLVDLGGTERASTIPYTFPFNPNQPPLEQREKESRVTGRINVSYDFTDDVLGYVNFSRGYRSGSFNGLAYQGTDQVYYVEPEQINAYEGGLKTLLFDRRVQLNLAGFYYDYSNHQVTQVIGATTFVRSADGKLYGGEAELAIIATDNLRFDASVSLLRSKYQGNVVDPTDPRSPTRNVNGNPFPNAPRETFSAGFDWDVFDTDAGKLSLRGDAAYTGKYFFDPFGDYGQTPCDRPGAPGSVRPAGPAITCGNPGYWLVNSRLTFQKDNFSASLWAKNLTNKVYYSYGLNIDIFGLDYLNRGTPRTYGIEATVKF >NZ_CP022548|1449445:1474624|1463358_1463832_+|WP_123906268.1|DBSCAN-SWA MFKLDELLRDLMSLNTVDDLATYSTKNLKAKLADLWREISKPENAGNRSQLIEIHHSISALSSDRNICFHGLWGYTWCPASESWKEISQGYAREEPFFSDSLTELHNKMILSSKLLADAKWRKEVGNDPPETRNRRQIWGHRPPIESDPHPPERKIR >NZ_CP022548|1449445:1474624|1458134_1458608_+|WP_100093404.1|DBSCAN-SWA MSANAQDGFLNETDEFAFLLEEVPRVLRKAFDESIAQFGLSRTQWRTLAYLIKTEGMTQTELAACLELERATIGLTIDHLEKLDFVERRAAESDRRVWRIFLRPKAIDIIPELRKEADAVYKKMFKGISSANIAIIRTALEKMVGNLRVLNPSDPEI >NZ_CP022548|1449445:1474624|1472280_1472850_+|WP_100093416.1|DBSCAN-SWA MFNTPKIILASTLLAMVPACSSELSKEEQEILDARIAEDTKPTNMIAVLQDDPDLSTASILVGLSGVGAELQDNGPFTAFAATNDAFNKMDAKRLSELLSVDNKEELETIAKFGLVKGSMTSADIGKAIAGGGGSASITTVQGGVIKATMDGDTIILQDGAGNKANVTQADVKSSNGTLHVVDNLLMPE >NZ_CP022548|1449445:1474624|1458758_1459601_-|WP_123906267.1|DBSCAN-SWA MTAQSKPDEQTGEKMRVRLVTPYQLTLGMELLESSSGRSISRMKANRRIAKAPGNPNLDPLALMGFIDQGLSESLQGLIGTEIGISTFDLRIGLSSGSSMGDELTLTAEAVMVGKTSATATGSVTNSSKDIIGTATGLFRLGSFPGGAPADYNLVGDFDAQAMPGPMTTSLGLSGAPDSPQIVAGNRAVLGWESGNIVHGGAVAAALMAACQGRVAGGESSIGQQLESIDIRYVRPAIGSRDLFTNSCFEREGKAASFVRATCFHKPEKPLATAIATFVR >NZ_CP022548|1449445:1474624|1468497_1470741_-|WP_164089024.1|DBSCAN-SWA MKPINREDFQRVESLHQLHILDTPRQASFDRITELCADIFDCSISLISLVDEHRQWFLSSVGVDVRETPREHCFCNHAIEAGCSMLVPDALADQRFSDNPLIRHAPAIRSYLGAPINSPDGTMIGTLCIADQRAGRFDDDHILKLEILAATIEDLIRAHSQASVTSYLNSQLDARSSRLNRANGIFKQAEKIAKIGSWEWNEETRELTWSDEAFAIFGSPENVPTSIEEAVLYCAQEDLQGATEAIENGKKHKDSFEFETGLNAANGTFKRVKAMGEYLEADQQSPARLVGIIQDITESHHTRLALQRAADRDALTDLYNRNALDRFLQNTLGKLKKTGSNAFLLMLDLDGFKDINDTFGHLIGDVFLEKISSRIVDALPHGAIVARWGGDEFVILTAPDIDLQAAKQLGDEILVSVERQFEISGRKMGVSATCGLVEIDERSIAREIIRKADLALYHGKEREPGRVHAYVPELEQANHHRQAAIAKVREALDRDRIFAGYQPIIDLRTSRLVGLEALMRLSVSNGEQITASQVLPAILDPILSREIGERMLSSVYRELAEIEQSQPDLQFVSINATEADLLSRDFADNLLSRLNEVKISPAKITLEVTETMLMVNDTTTVQNVLSELSAAGMRIALDDFGTGFSSLSHLRDFPIDAVKIDKSFIKSITAEHQSRLIVQALINMASNLGIEVIAEGVETEEQRNLLIQLGCNLGQGFLISPAQSACHIKLMQFPNLPSEAMSKRKVA >NZ_CP022548|1449445:1474624|1462154_1463066_+|WP_100093407.1|transposase|DBSCAN-SWA MTNLTDPIFHNEKAAEKHIEVSRWNGEPFCAHCGSTNVTRMKGKTQRGMFQCNDCRDKFTVRTGTVMERSHVPLHKWLLAMHLLASSKKGISASQIARNIGVTYKTAWFLCHRIREAMDGANGNGPLGGPNRVVEADETFVGGKAKNRAHRKPRDKQPVVALVDREGHVRSFHVANVNAKDLRNLIVTNVHRDSHLMTDEATVYTRVGREFAGHSVVNHSAKKYVTTGGFKHSNTAENFFSIFKRGVIGVYHHMSEAHLGRYTKEFDFRYNTRDITDGERAAVALKGIEGKRLTYRRTDKLAA >NZ_CP022548|1449445:1474624|1456754_1457156_-|WP_100093402.1|DBSCAN-SWA MKHNPENLADAIQIAPFHRWLGLKIVRQDSDCLELEMPWRDELVSNPLIGAAHGGILASLIDLTGLYAIIAAGGVARATVDLRVDYHRAATNGPLRAIGQVVKLGKTISTADTRIIDDDDRLVASGRGTYLGT >NZ_CP022548|1449445:1474624|1449445_1450846_-|WP_100093398.1|transposase|DBSCAN-SWA MKDSGFDAAKWADGEVDIVAFRDKRLGERLHTMLAQMAAAIGAPIPMACQDWANTKAAYRFLSNNAVNEGNILAGHFQATQARVAALDGFILILQDTTEFSYQRRDPKRIGAIGLAPSRRDENGRLRLHTVCGLLMHSSLAVTPEGLPLGLTAAKFWTHNRFKGTNALKRRINPTRVPIEEKESYRWLENMRQSTALLGKPGRLVHIGDRENDIYEFFCEAQSAGTHFLVRTCVDRLAGDGGHTIADEMSEVAVQGMHRVTINKDDHADIELRYRRIQVLPPIRKQKRYPSLSLTVLHARECGAPEGRPPIDWRLITDLPVTTPAEAIEKLDWYAQRWKIELFHKILKSGCRAEDARLRTAERLANLIAIFCILSWRLFWVTMINRAAPAASPRLVLTDVEIALIEGLVASRKKALMGRTLADYVTQIARLGGYLARAHDPPPGNIVIWRGWSRLMDLKIGANLIR |
21 | Wolbachia_phage(50.0%) | holin,transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1792506 : 1799652
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP022548|1792506:1799652|DBSCAN-SWA TATGACATTCTGGGAAAATATCGCGCTCGCCTTCAAGGGCGGGGGTGGACCATCGCGGCCGCCTTTGGGGCGGTCTTATATCGGCACCTATGGTGGCGCGGCTCTTTCAGGGGACGCGCCCTTTTCCTATGAGGGCCGGGTGCGTGAGGCTTATGTTGAGAATGCCATTGCCCAGCGAGCAGTGCGGATTGTCGCGGAGGGTGTGGGCGGCGCGCCGTTGTTGCCGCTGGATGACAAGGTCGTGGCGCTGGTCGGTGGACCGGGCACGAGCCAGTCCTTGCTGGAGACCATTGCCGCGCATCTGTTGCTCCATGGCAATGCTTATGTGCAGGTGATCCGCGGCGACCAGAATCAGCCGGCGGAACTTTATGCGCTGCGCCCCGACCGGATCACCGTCGAACCCGATGCCAAGGGCTGGCCGGTCGCCTATAATTACCGTGCTGGCGAACATCGGACGCGCTTTGCCGCGCGGGATGCAATGGGTCGTCCGGCGATCATTCACCTCAAGGGCTTCCACCCGACCGATGACCATTATGGACATGGTTGCCTTGGCGCGGCGGCGAAAGCGGTGGCGGTGCATAATGCGGCGGCGAAGTGGAACAAGGCGATATTGGACAATGCGGCGCGGCCATCGGGCGCTTTGGTCTATGATCCGGGTGCCGATGGGTCGGCGCTGACCGGGGAGCAGTTCGACCGGCTGAAGGCCGAAATGGATGCGAGCTTTGCCGGTTCCGGAAATGCCGGGCGGCCGATGCTGCTCGAGGGCGGTCTGAAATGGCAGTCGATGAGCATGACCCCTGCGGACATGGATTTTGTCGCGTTGAAAGAGGCGGCGGCGCGGGAGATTGCGCTCGCTTTTGGCGTGCCGCCGATGTTGCTCGGTTTGCCTGGCGATAATAGTTACGCCAATTATCGCGAGGCCAACCGGGCGCTGTGGCGGCTGACGATCTTGCCGCTGGCGGGGAAGATATTGGACGGGCTGTCGGGGGCGCTGGGTGCGTGGTGGCCGGATGCGAAGCTGGCGGTGGATCGCGACCAGATCCCGGCGCTGTCGGAGGATCGCGAGCGGCTGTGGAAGCAGGTGTGCGAGGCGGATTTTCTGTCGCCGGAAGAGAAACGGGCGATGCTGGGGGTGTAATTTTGTCTCGCGGCAGTTCGCTGCGCGAACGCCTCCGACGGGGCGGGCTTTCGCCCGGCGCGCGGTCGCGCTTGCCTCGGCTTCGCTCGGTAGCAGGACATCAACCGAGCGTAGCGAGGCAAGGCCGACTGGCCGCCGCGCCTTATTGGCGCGAAAGCCAAGGGCGCGGATGCGGCCGCCCGGCGTTTGAGGTTAAAACGGAGTTTTATACAAAATGCAAAACAACGAAATGCTCGCCCGCCTGATGGCGCAGGCGGAAGGCGATGGTGCCGATCTGGTGACGCTGCGCGCGATTGTCGAGGAGGCGACGGACAGCGGCGCGGTGCGGGTGCTCGACCGGTTGGGGCTGTCTGATCCCGGTGCCGAGGATGATATTGATGAATTGCGCGAGCTGCTTCGCGCCTGGCGCGACGCGAAGGCGAGCGCGTGGAAAGCGGCGATCCGCTGGATAATTCGCGGAGCGCTGGCACTGCTGCTGGTCGGCATCGCGATGCGCTTGGGTCTGGGACACCTGATATCGTGAAGAAACTGGATCAAATAGGTCCCTTCCCTGAAGGGGATGGGCCAAGAGATATTCGCTTCGCCGGCTATGCCGCGATCTTCAACCGGATCGACAAGGGCGGCGATATCATCCGGCCGGGAGCGTTCGGCGATCTGGCGGACGGGCAATCCTTGCCGTTGCTCTGGCAACATGATCCGCGCCAGCAGATCGGCCGCGTCGACTATGTGCGCGAGGACCGGCGCGGACTGCGGGTGATCGGGACCATATCGACCGCGACGCGGGCGGGGCGGGATGCGGTGGCCGGTCTGGCGAGCGGCGCTCTCGGGGGGTTGAGTTTCGGCTATCGGGTGAACCGTTCCTCCGGTCAGAAGCCGCGCGAGCTTCTGGATCTCGACGTAGCAGAAATTTCGCTGGTGACATTTCCGATGCAGGGACTGGCGCAGGTTCACCGGGTTGAACGAGATGCCGATTTATCTTGATCGTGTCCTTCTCAAGAGTATTAGTAATGCAATACACTGCGGAGAAAGATATATGAAAGCCCTTTTTGCGACCCTGCTGGCGCTTTGCTTTGCGGCTCAGCCGGCTCTGGCGGCGGACACCACCGGTCAAGCGCCTGAACCCGTGCGGGTCATGGTGCTGGGTGTCTATCATTTTGCCAATCCCGGAGCCGATCTGAACAATGCCAAGGTTGATGATGTGCTGACACCGCAGCGGCAAAAGGAGCTTGAGGCTCTGGCCGAGACCTTGAAGACATTCCAACCCACTGTTGTCGCGGTCGAAGCGTCGGCCGAGCCGCCTTATGCGGATACAGGCTATAGTGGTTTCAAACCGGAGGATCTGACCAAGGAACGCAACGAGGTGGTGCAAATGGGTTATCGGGTCGCCCATGCCGCCGGCATTGAAAGAGTCTATGCGATTGATGAACAGCCGTCAGAGGGAGAACCGGACTATTTTCCCTATGGCAGCGTCCAGCAACAGGCCGAGGAAACCGGCGAGGCCGAACGCCTCAAGATCATGTCGGATTTCGGTGCGATGATGGCCAGGTTTGAAGAGGAGCAGAAGAGCAAGTCCATTCCCGAGTTGCTCATGTTCTGGAATGGCGACACACTGCCGGACGATTTCTACTGGAACATCATGACCATCGGGCAGGGCGAAAAACAGACGGGCGCCGAGCTCGCGGCCTACTGGTTTATGCGCAACGCGAAAATATTCAACAAGCTGGTGCAGGTGACCCAGCCGGGTGACCGGGTAATCCTGATCTTTGGCAGCGGCCACCGGGCCTGGCTGCGCGAAATGGTCGAGAAAACACAAGGCTACGAGCTGGAGCCGGTGATGCCCTATTTGCAGCAGGCTGCTGGCGCCCTGTCCGAATAGAGCCGGACAATTTGAAAACCCCGAGCCGCCCCTCACCGGGCGGCTTTTTTATTGCCCACAGGAAAGGAAAGACATGACAGATTCTCCCCAATTCGAAACCAAGGCCGACCCGCTTGAAGCGTCCTTTGACGCGGTGCTGATGGCCGAAGATGTCGCCAGCCAGAGCGACCAGATCAAATCGCTGCGCACCGATGTCGATGGCCTGAAGGTGCAGATGAGCGATATTTCGAAGGCTTCGGCCCGGCCTGTGCTGGCCGGAAACATCGATGGTGCGAAGGGAATGCCCTCTTCGGCGGCCGCGCAGGATTTTGTCGCCAAATATCTCCGGCGCGGAGACCAGAGCGGTGTCGAGCTGAAAAGCTTTTCCGGTGCCTCCGGACCCGAGGGCGGTTTTGCCGTGCCGCAGGAAATTGATGCGCTGATCGGGGCGACGCTCAAGGATATCTCGCCAATACGCTCCATCGCAACGGTCGTGCAGACCGGCACGGCGGGCTATCGCAAGCTGGTGACCACCGGCGGCACGCCCTCCGGCTGGGTCAGCGAAACGGCAGGGCGTCCGGAAACCGATACGCCGGATTTCAACGAGATCGCGCCGCCGAGCGGTGAGCTGTACGCCAATCCGGCAGCCTCTCAGGCGATGCTTGATGATGCGGCTTTTGACGTCGAATCCTGGCTGGCGGACGAAATTGCCCGCGAATTTGCCCAGGCGGAAGGCGCCGCCTTTGTCGGTGGCTCCGGCGTCAACCAGCCGCGCGGCTTTCTCAACGCGACGGTGACCGACGAGAGTGATGACGTGCGGGCGTTCGGGTCTTTGCAATATGTGCCGTCGGGCGCGTCTGGCAGTTTCGACAGCGAAGATGTGCTGGTCGATCTGGTCCACACGCTGCGTCCCGCCTACCGGCAGGGCGCATCCTTCGTGATGAACAGCTCGACGCTGGCGCATATTCGCAAGTTCAAGACGGCGGACGGTGCCTTTCTGTGGCAGCCTTCGCTCGCCAGTGGCCAGCCCGCGACCCTGCTCGGCTATCCGGTGGTCGAGGCGGAAGACATGCCCGATATCGCGGCGGACAGCCTGTCGATTGCCTTCGGTAATTTCCGCGCCGGCTATCTGATCGCCGAACGCAGCGCGACCAGCATCTTGCGCGATCCGTTCACCAACAAGCCGTTTGTCCATTTCTACGCGACCAAGCGGGTTGGCGGACAGATCATGAATTCGGAAGCGATCAAGCTGATGCAGTTCAGCGCTTCCTGACCCCTTGCTGCGCTTCGGCGCAGCGCGCCCGTGCCGGTTGCTCCCCCTCTCGTTCCGGCGCGGGCGCATATTTCTAAACCCAAGTGAAAGGATGCCGCGACCGTGAGCTTCCCCATTGCAGACTGGCCGGACCTACCGGCAGCGCTGATCGCAGAGGTCAGGGATTTTGTCCGGATTGATCATCAGGCCGATGATGCCGCCATCGACGCGTTTCTGCGCAGCGCGGCGTCCTTGTGCGAGGATTTTACCGGGCAGATGCTGATTGTCCGGTCGGTGACAGATATGTTGCCGGCGCGGCGTGAGTGGAAGAAGCTGAAGCGTCTGCCGGTGCAGTCGATTGTTTCGGTCGAGGCGGTGGGCGCGGACGGGATCGCGGCGGCGTTTGCGGTCGAGGACTATGCGCTCGACATCGACAGCGACGGTATCGGCTGGATCAGGCTGCACCGGAGTGATGGCGGCTCCCGGGTCCGCGTGACCTATAACGCCGGTCTGGCGACAGATTGGGATGAACTGCCCGCGAGCCTGCGCCAGGGGATCGTGCGGATGGCGGGCTATCTCTACGCCAATCGCGACGGTGTCGATGCGGGCGGCCCGCCGAGTGCGGTGACCGCTTTGTGGCGACCCTTTCGCCGGATGAGGATCGGGTGATGGGACAGGAATTTTCCGGCATTTTGCGCGAACGCATATCGATTGAACGGCAGAGTGTCGGGCGCGATGCGCTGGGCTCGGCCGAACCGCAATATCTTACCGTGGGCGTCTTCTGGGCGGCTGCCGAAGCCTTGCACGGCGGAACGGCCAGCGAAGCGGAGAGCCGCTCGGCGATGCCGCGCTGGCGGTTTACCTTGCGCGAAACCCAGGTGATCAAGCCGGGCGACCGGCTGGTCTGGGGGGACCGGATAATGACCGTTTCAAGCGTGATTCTGGAGCACCGTCTGGTCCCGAAAACCATATTGCAGGCGGAAGAGACAAGATGATGGAAAAATTGCAGAAGCGCGGGGAGGCGATTGCCGAACAGCGGCTGGCTCGCGCCAAATCCGAGATCAAATCTGTGCTTGTGGAAGAGTTGCCTGCCGATGTGCGGGTTACAGAAACTGGCGAAGGAGTGCGGGTGGAAGCCCGGCGGCTGAAACAGCGGCTGATCGGGAATAGCAGCCTGCGCGATGTCGCTTTTCTGATGCGGGCCGTACGATGAGCAGCGCGCTGGAAGCGGTACAGCAGCAGCTGGTGACGCAGCTGAACGGGCATGGGCCATTGATGGATTTGATCAGCGGCATATTTGACGGGCCGCCGCCGCGTGCCGAATTTCCCTATATCGCGCTGGCCACCGGGGCCTCGCTCGACTGGAGCCACAAGGGCGGTGTCGGCCGCGAGTTGAGCCTTGCGCTGACCGTCCACGATGACGGCGAGACGGCGGCGCGCCTGCACCGGGTGATGGCGCTGGTCGAGGAGGCGCTCGAGCCGGGACTGGATGATCCGGCTAGCTGGCAGATCGTCACTTTTGATTTCCGTCGCACGCGTGTTCTGCGCAGCGCGGTCAGCCCGTGGAGCGGGCTAGTCGAATATCGGGCGAGGGTTTTGAGGACGTAGTGCCCCTGCGCAGGCAGGGAGATGTTTCTATTTTAACCTCAGGCGCCGGGCGGGCGCATCCGCGCCCTTGGCTTTCGCGCCGTATGGCGCGGCGGACGCCATTAGTTTTTTGGCGCCCTTGCCTCGCTGCGCTCGGTTAGAAACTTACCACCAGAGGGAGCACCAAACCGACGCGCAGCGGAGGCAAGCGCGACCGCGCGCCGGGCGAAAGCCCGCCCCGTTGGAGGCGTTCGCGTAGCGAACTGCCGCGAGACAAAAGAAATACAACAGGAAGGAACCAGAATATGGCAGCAGAAAAAGGCAGCGCCTTTCTCCTGAAAATCGGCGATGGCGAGAGCCCCGTCAGCTACACGACCATCGCGGGTCTGCGGACCACACAGATGTCGATCAATGGCGAACCGGTAGCGATCACCAGTAAGGACAGCGGCGGCTGGCGGCAGTTGCTGTCGGGTGCGGGAGTTCGGTCTGTGTCCGTGTCAGGGGCTGGCGTGTTCACCGGCTCGGACGCGGAGATGCGGATCAAGAATCATGCGCTGGGCGGCATCATCGACGCCTATGAACTCAGCTTCGAGGGCGGCGAGCGGATGCAGGGCGATTTTCTCGTCGCGCGGCTGGATTATAGCGGCGATTATAACGGCGAGCGCAGTTACACGCTGAGCTTGGAAAGCTCCGGAGCGGTCGCCAGTGTCTGAGCGGCGCGCCAACGCCCTGCGCGGCGAGGCAGAGATTGTGATTGAGGGCACGCGCTTGATCCTGCGGCCCCGCTTTGCCGCCCTGGTCGCGGCCGAGGATGAGCTGGGATCGCTGTTCGAACTGGTTGAACGGGCCGCTAATGGACGGCTTTTGTTGTCGGAAATCGTCACGCTGTTCTGGCATGTTGCGCGCGATCGACCCGCGCAATTGACCCGCGACCAGCTGGGCGAGGGGATGATGAAGCTGGGGCTGGCCGGGGTGACGCCGGCGCTAAAGATCCTGCTGAAGCAGATATTGTCGGGCGGCGATGCGTGAGGCGTGTCCCCGCGAAGGCGGGGATCCATCTCCCATTGGTTCGGCAGACGATCTGTTCCAAGATGGACCCCCGCCTTCGCGGGGGCACAATGATTTTTCATCTTCGGCACAAAGATTATCCGGCACAGTCTCCGCCGTCTTCGGCTGGACACCCGACCAGTTCTGGCACGCGACGCCCGCCGAATTGGCAACCATATTTTCGGTCTTCGCCGGCAGCGGGCCCGGTCAGGCTCCACTCGGCTCCGAACAATTTGAAAAACTGAAAAAGGCTTTCCCCGATGGATGA
Protein sequences of DBSCAN-SWA_3 >NZ_CP022548|1792506:1799652|1795592_1796771_+|WP_100093675.1|capsid|DBSCAN-SWA MTDSPQFETKADPLEASFDAVLMAEDVASQSDQIKSLRTDVDGLKVQMSDISKASARPVLAGNIDGAKGMPSSAAAQDFVAKYLRRGDQSGVELKSFSGASGPEGGFAVPQEIDALIGATLKDISPIRSIATVVQTGTAGYRKLVTTGGTPSGWVSETAGRPETDTPDFNEIAPPSGELYANPAASQAMLDDAAFDVESWLADEIAREFAQAEGAAFVGGSGVNQPRGFLNATVTDESDDVRAFGSLQYVPSGASGSFDSEDVLVDLVHTLRPAYRQGASFVMNSSTLAHIRKFKTADGAFLWQPSLASGQPATLLGYPVVEAEDMPDIAADSLSIAFGNFRAGYLIAERSATSILRDPFTNKPFVHFYATKRVGGQIMNSEAIKLMQFSAS >NZ_CP022548|1792506:1799652|1792506_1793643_+|WP_100093672.1|portal|DBSCAN-SWA MTFWENIALAFKGGGGPSRPPLGRSYIGTYGGAALSGDAPFSYEGRVREAYVENAIAQRAVRIVAEGVGGAPLLPLDDKVVALVGGPGTSQSLLETIAAHLLLHGNAYVQVIRGDQNQPAELYALRPDRITVEPDAKGWPVAYNYRAGEHRTRFAARDAMGRPAIIHLKGFHPTDDHYGHGCLGAAAKAVAVHNAAAKWNKAILDNAARPSGALVYDPGADGSALTGEQFDRLKAEMDASFAGSGNAGRPMLLEGGLKWQSMSMTPADMDFVALKEAAAREIALAFGVPPMLLGLPGDNSYANYREANRALWRLTILPLAGKILDGLSGALGAWWPDAKLAVDRDQIPALSEDRERLWKQVCEADFLSPEEKRAMLGV >NZ_CP022548|1792506:1799652|1797418_1797745_+|WP_100093676.1|head,tail|DBSCAN-SWA MGQEFSGILRERISIERQSVGRDALGSAEPQYLTVGVFWAAAEALHGGTASEAESRSAMPRWRFTLRETQVIKPGDRLVWGDRIMTVSSVILEHRLVPKTILQAEETR >NZ_CP022548|1792506:1799652|1794676_1795519_+|WP_100093674.1|DBSCAN-SWA MKALFATLLALCFAAQPALAADTTGQAPEPVRVMVLGVYHFANPGADLNNAKVDDVLTPQRQKELEALAETLKTFQPTVVAVEASAEPPYADTGYSGFKPEDLTKERNEVVQMGYRVAHAAGIERVYAIDEQPSEGEPDYFPYGSVQQQAEETGEAERLKIMSDFGAMMARFEEEQKSKSIPELLMFWNGDTLPDDFYWNIMTIGQGEKQTGAELAAYWFMRNAKIFNKLVQVTQPGDRVILIFGSGHRAWLREMVEKTQGYELEPVMPYLQQAAGALSE >NZ_CP022548|1792506:1799652|1794180_1794624_+|WP_100095495.1|head,protease|DBSCAN-SWA MGPFPEGDGPRDIRFAGYAAIFNRIDKGGDIIRPGAFGDLADGQSLPLLWQHDPRQQIGRVDYVREDRRGLRVIGTISTATRAGRDAVAGLASGALGGLSFGYRVNRSSGQKPRELLDLDVAEISLVTFPMQGLAQVHRVERDADLS >NZ_CP022548|1792506:1799652|1797741_1797963_+|WP_100093677.1|DBSCAN-SWA MMEKLQKRGEAIAEQRLARAKSEIKSVLVEELPADVRVTETGEGVRVEARRLKQRLIGNSSLRDVAFLMRAVR >NZ_CP022548|1792506:1799652|1793857_1794166_+|WP_100093673.1|DBSCAN-SWA MQNNEMLARLMAQAEGDGADLVTLRAIVEEATDSGAVRVLDRLGLSDPGAEDDIDELRELLRAWRDAKASAWKAAIRWIIRGALALLLVGIAMRLGLGHLIS >NZ_CP022548|1792506:1799652|1797959_1798358_+|WP_100093678.1|DBSCAN-SWA MSSALEAVQQQLVTQLNGHGPLMDLISGIFDGPPPRAEFPYIALATGASLDWSHKGGVGRELSLALTVHDDGETAARLHRVMALVEEALEPGLDDPASWQIVTFDFRRTRVLRSAVSPWSGLVEYRARVLRT >NZ_CP022548|1792506:1799652|1799358_1799652_+|WP_100093681.1|tail|DBSCAN-SWA MREACPREGGDPSPIGSADDLFQDGPPPSRGHNDFSSSAQRLSGTVSAVFGWTPDQFWHATPAELATIFSVFAGSGPGQAPLGSEQFEKLKKAFPDG >NZ_CP022548|1792506:1799652|1796885_1797419_+|WP_100095496.1|DBSCAN-SWA MADWPDLPAALIAEVRDFVRIDHQADDAAIDAFLRSAASLCEDFTGQMLIVRSVTDMLPARREWKKLKRLPVQSIVSVEAVGADGIAAAFAVEDYALDIDSDGIGWIRLHRSDGGSRVRVTYNAGLATDWDELPASLRQGIVRMAGYLYANRDGVDAGGPPSAVTALWRPFRRMRIG >NZ_CP022548|1792506:1799652|1798642_1799050_+|WP_100093679.1|tail|DBSCAN-SWA MAAEKGSAFLLKIGDGESPVSYTTIAGLRTTQMSINGEPVAITSKDSGGWRQLLSGAGVRSVSVSGAGVFTGSDAEMRIKNHALGGIIDAYELSFEGGERMQGDFLVARLDYSGDYNGERSYTLSLESSGAVASV >NZ_CP022548|1792506:1799652|1799042_1799366_+|WP_100093680.1|DBSCAN-SWA MSERRANALRGEAEIVIEGTRLILRPRFAALVAAEDELGSLFELVERAANGRLLLSEIVTLFWHVARDRPAQLTRDQLGEGMMKLGLAGVTPALKILLKQILSGGDA |
12 | Geobacillus_phage(25.0%) | portal,capsid,protease,head,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|