Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP034445 | Mesorhizobium sp. M2A.F.Ca.ET.043.02.1.1 chromosome, complete genome | 5 crisprs | cas3,csa3,WYL,DEDDh | 1 | 3 | 4 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034445_2 | 1984299-1984384 | Orphan |
NA
Consensus repeat of CP034445_2
|
1 spacers
spacers of CP034445_2
>2.1|1984328|28|CP034445|CRISPRCasFinder AAAATTAGGTCCCCACCAGCGACGAGGC |
CRISPR arrays and Neighbor proteins around CP034445_2
The CRISPR arrays of CP034445_2 >merge|CP034445|2|1984299-1984384|CRISPRCasFinder CTTTGCCGAGGAGCTTACGGGTGGGGCAAAAAATTAGGTCCCCACCAGCGACGAGGCCTTTGCCGAGGAGCTTAAGGGTGGGGCAA >CP034445|2|2|1984299-1984384|CRISPRCasFinder CTTTGCCGAGGAGCTTACGGGTGGGGCAA AAAATTAGGTCCCCACCAGCGACGAGGC CTTTGCCGAGGAGCTTAAGGGTGGGGCAA
>CP034445.1|AZO03293.1|1983562_1984219_-|TetR/AcrR-family-transcriptional-regulator MDVSRQQAVAKDEVAQAPAVEKGARARTRRLMLETATRLMQSGMTPSVSEVAEAAEVSRATAYRYFPSQAALVQAVVDEGLGPILTWKSASDDAERRVADLFATAMPRIEAFEATFKAALKLSLDQWARRQAGTLGSEPAFTRGHRVDLLKDAIAPLKGRLKPRQFRRLAQALSLVFGVEVVTVLKDIWGLDSAEMMSVAQWAAGALVRAAMAESGPK >CP034445.1|AZO03292.1|1981852_1983169_-|carbohydrate-ABC-transporter-substrate-binding-protein MRKIVTSVLAGVGLALACGTSVHAQEKSLTIFWAEWDPANYLQELGNEYEKETGVKITVETTPWSDFQTKAFTEFNAHGDAYDLVVGDSQWLGAGSTGGHYVDLSDFFNKHKLNDVMAPATVKYYAEYPGNSGKYWAIPLEGDAVGWSYRKDWFEDPKEKEAFKAKYGYDLDVPKDFKALRDIAEFFYRPDQKKYGIAIYTDNSYDAMAMGFENALFSYGGELGDYTTYKVDGIVNSDKAVAALDAYKELYKFTPPGWAKSFFVEDNQAITENLAAMSMNYFAFFPSLINEASNPNAKNTGFFANPPGPNGDQFAALGGQGISIVSYSQKQDEAMKFLEWFIKDETQKKWAALGGYTCSAAVLKSAEFQNATPYNKAFYETMFKVKDFWAVPEYAELLQQLNQRVYPYMIGGQGTAKETLDALAADWNATFKKYGRVK >CP034445.1|AZO03291.1|1980877_1981783_-|sugar-ABC-transporter-permease MATTMITTLDRSSRAAARGLSDISIRNLFIIPTILFLIVFNIFPLIYSLGYSFTDFRASTNAPANFVGLQNYRDLLNDPYVWSNFAITAKYVIISVAGQLFVGFGVAMLLNRDIPMKGLLTTLLLLPMMLSMAVVGLFWKLLYDPSFGIINYALGLGTFEWLADPKMALYAVALTDIWMWSPFVMLLSLAGLSAVPKHLYEAAAIDRAGSFYTFFRITLPLVAPILMIAVIFRTMEAFKTFDLAYILTSQPTAELISIRLYKMAFQEWQTGRSCAMAYIVLIMVLAITNIYVKYLNKVKER >CP034445.1|AZO03290.1|1979916_1980876_-|carbohydrate-ABC-transporter-permease MAAVRTSSEVAFNRIAIVGVLVVTIIFLAPIYWIASTAFKPRNLATTIPPTIVFQPEISPFVKLFTKRSQLRSPPTPEEYAAAPWWERVVFDGGEKIVRDGKGAVQWSGYPNRFMNSLIVAITSTVLAVGMGTFTAYGFSRFRVKGEADLLFFILSTRMLPPVVVAIPMFLMYRAVGLNDSHLGLIILYTAFNLSFSVWLMKGFIDEIPKEYEEAALVDGYTRMEAFFKIVLPEAATGIAATAVFCFITAWNEYAFALIMTNRRAQTAPPFIPSQVGAGLPDWTVIAAGTFLFLLPVAIFTFLLRNHLLRGMSFGAIRK >CP034445.1|AZO03289.1|1979683_1979920_-|hypothetical-protein MSFRAINEKYLEPGSQILMVLGIVALCQPWNMPLHTYGVTIILIGLIGFNVTSKIPREEQAGTGHGADQQAHNAGAQH >CP034445.1|AZO03288.1|1978583_1979687_-|ABC-transporter-ATP-binding-protein MTQIELRGVQKFFGAVQVIKDLNLKIDDNEFIVLLGQSGCGKTTTLRAIAGLETIDQGDILIDGKPVQHLKAADRDIAMVFQSFSLYPHMNVYENIAFPLRATRKSRSEIDTEVRSVAKTLQITDLINKKPSALSGGDMQRVAIGRALVRRPKAMLMDEPIGALDAKLREEMRAEIKRLHIKQGSTSIYVTHDQIEAMSLADRIVIMHEGVLQQVGTPDEVYSHPANLFVAQFVGSPVMNVAEAKVSENASAVSVTVGDAAAGFEFPRALLSQLNGHAGDQFTLGVRPEGVLVRREAAEGFLPVETQIVEPLGSFDIVDLKVGSSMLRARTKSGFVSGAGEKVYVRIDPSQTHFFDAASGKALGVRL >CP034445.1|AZO03287.1|1977480_1978584_-|ABC-transporter-ATP-binding-protein MAHIQLKNISKTFGNHTALSNLNLDIADGEFFVLLGETGAGKTTTLRMVAGLEKPTEGQVFIDGVDVADWGAAERDIALVLQQYSLYPRYTVRQNLEFPLKPKIRRLPDAEIKDRVARAARTLRIEHLLDRKTDRLSGGEMQRVSIGRAIVRKPRVFLMDEPLSALDAKLREALRTELKNLQMQLGATFLFVTHDQIEAMSMGDKVGVLNHGRIVQAGTPHEIYNNPRDTYVASFVGSPPMNLIDGKLVNDRAVMAPVNFELPLSTGAKSFGGGRTSGATDGRPLVFGIRPEDVHLESGAPVEARVHDVENHGVEKILTLRVGDTMLRATVPARTDIAIEQAVRFAWNPDKVVLFDKGSGVSLRHAG >CP034445.1|AZO03286.1|1976300_1977383_+|Tat-pathway-signal-protein MNRRTMLVGAGAALVAAGTGVAGWRSAVGSMAQYEVFAAGLRDRLTPDLGAIVRYATLAANSHNTQPWRFQLEGHAIEIRPDLQRRTPIVDPDDHHLYVSLGCAAANLMLAAAATGRTGEASLTADGNGIRYDYLMGEAKADPLANAIPKRQSTRAEYDGRATPAADLVELERAAAIPGVSLALVTDQGRMKQVRDLVLAGNEDQMNDPAFMHELKQWIRFNPRSAMARGDGLFSAASGSPVLPSGLGRIALDRLFSAAQENEKYARQIDSSAGVAIFFAERPDHDHWVRVGQACQRFALAATSLGLKLAFINQPVEVARLRADLAGIVGETRRPDIVMRFGYGPALPFSPRRPVASVVL >CP034445.1|AZO03285.1|1975863_1976211_+|GFA-family-protein MLYKGSCHCGKVAFEVKGEIGGAVRCNCSICARKGALLWAVPHEKLSLVAWGDDLGRYTFGKAQIAHRFCRTCGIHPFAEDVGESGERTAYININCLDDVDVAGIEVFEFDGRAA >CP034445.1|AZO03284.1|1974446_1975742_-|amidase MQSVRDRLETILSRLANRAAEERVYTKLYAAAARAAADASDARRKAGVSLGPLDGRIVSIKDLFDVAGEPTTAGSLILAGTAPATRDAAIVARLRRAGAVIVGKTNMTEFAFTAIGDNLHYGTPGNAADASLIPGGSSSGAGVAVGEGTSDISIGSDTGGSVRIPASLNGIVGFKPTAGRVPLTGAFPLSMTLDSIGPLARSVADCAIADAIMAADEPAALQPVPLATLRIGIPRGVLFGETQAEVAEAFEACIDRIGQAGAGAVDLPIDDLIAEMRAATRRGTIASMEGAEVHADWLASGASVPVDPHVTGPLSRALSVPASAYIRTIRRRGELATAMAERLEAVDVLALPTVPIVAPSIAAMAGDEALRDRTEGLLLRNTQVANQFDLCAISLPMPGTKLPAGLMLVGKHGHDRRLLAIAAVVEALLGR >CP034445.1|AZO03294.1|1984431_1985628_+|UPF0261-family-protein MKRIYVVGTADTKGEELAFLADAVAAAGGAVVRVDIGTRGATVPVDIPASEVAAHHPKGAGAVLGIDDRGAAVAGMGVAFAGFIRSRDDIAGMIGIGGGGGTSIVTAGMRALPLGLPKIMVSTLASGDTAPYVDVSDIIMMPSVTDMAGLNRLSRVVLHNAAQAIAGMAAKPAPIAAGKPALGLTMFGVTTPCVTAIVERLRADYDCMVFHATGTGGRSMEKLADSGLLAGVLDITTTEVCDLLFGGVLPATEDRFGAIARTKLPYVGSVGALDMVNFWAPPTIPDKYRGRLFYEHNPNVTLMRTTADECRRIGEWIGDRLARCDGPVRFLIPEKGVSALDIEGRAFFDAEADAALFDAIERTIEPTKDRTVTRLPLHINDPAFAKAAAEAFLDIARK >CP034445.1|AZO03295.1|1985639_1986479_+|phosphoenolpyruvate-hydrolase-family-protein MAAIPRKTILEKFRRMIADGVPIVGGGAGTGLSAKAEEAGGIDLIIIYNSGRYRMAGRGSAAGLLAYGNANEIVKEMAYEVLPVVKKTPVLAGVNGTDPFVIMPLLLSELKTMGFSGVQNFPTVGLFDGTMRQSFEETSMGFGLEVDMIAEAHQLDLLTTPYVFNPDEARAMTRAGADIVVAHMGVTTGGSIGATSAKTLDACVKEIDAIADAARSVREDVILLCHGGPISMPDDARYILERCEGLHGFYGASSMERLPAEAAIARQTADFKAIMKRKG >CP034445.1|AZO03296.1|1986483_1986900_+|cupin-domain-containing-protein MADKSKVFVYPKDVSAFGFDWGKLSLTVAPEVNGAERFSGGVVDLPSGKGHTRHNHPGAEEIIFVISGHGEQMVEDAKGNPVVAKVGPGCTIYVPESRFHSTLNTGDQPMQLFVVYSPTGPELVLRELPDFKLLPAGT >CP034445.1|AZO03297.1|1987074_1988268_-|aminopeptidase-P-family-protein MNIAPGKTEVSSIPFDQARVDRLMEEAGIDVLFATSKHNTQYLLGGYKFIFFAAMDAIGHSRYLPIVLYEKGGPEHAAYIGNKMEGGEHQNHPFWTPTLHAACWGTLDAANLAVEHLRQIGKSAARIGIEPGFLPSDAYMLIRKALPDAKLIDATGMLETMRATKTEAELEQLRIASELITDSMLATIAWAREGTSKTDIIERLRREETNRGAHFEYCLLTLGSSHNRAASSQAWKKGEVMSIDSGGNYHGYIGDLCRMGVLGEPDAELEDLLAEVEAVQQAAFSKVKAGTLGGDMIAHAEGVLKASKVAAYTDFFAHGMGLITHEAPFLMTNHPVAYEGTYAAKPLEKNMVLSVETTMLHPTRGFIKLEDTVAVTDSGYVMFGDRGRGWNRGGAAA >CP034445.1|AZO03298.1|1988602_1990213_+|ABC-transporter-substrate-binding-protein MSELTISRRGLLAGSALLLASNALPTIGFAQTPKKGGRLVLAADSEPRNLNPAIVASNGVFFISSKIVETLAEASFDAKDGLQPRLALSWEGAADGLSVSFKLRDGVRWHDGKPFTSADVAFSALQIWKPLQNLGRTVFKDLEAVDTPDELTAVFKFAKPTPFQLIRNALPALSSVVPKHVYENGKIEDNPANNAPVGTGPFKFAEYKAGQYYRLTRNDAYWGKDEPYLDEIVYQVLPDRTSAAAALEAEEIQLAAFSAVPLADLNRISKVPGLKVITKGYEGLTYQLVVEINHRRKELADLKARQAIAHAIDKDFVVKTIFLGYAATATGPVPKNDPQFYTADVPTYPFDVAKANALLDEAGYKRADDGKRFALKLLPAPYFNETKQFGDYLRQALAAIGIDAQIVNNDSAAHIKAVYTDHAFDLAVGPPVFRGDPAISTTILVQSGIPDGVPFSNQGGYKNAELDALIVKASETLDTSARTELYKEFQKKVAADLPLINVAEWSFISVARDTVGNIANNPRWAVSNWADTYLES >CP034445.1|AZO03299.1|1990336_1991323_+|ABC-transporter-permease MTRALTLLRRRLVGSLFVLLIVVIGSFLLLEAAPGDAVDAYIVSTGGDAGMIELLRHRWGLDQSELTRLANYLWALLHLDLGQSVTFSRPIRDVILERLPTTLVLMGSATALSFGLGSALGIYAGAAPGSFRDRFLSIGSLALYAVPGFWLGLVLIVVFAVDLRWLPIGGIETIASGKTGFSRAADIATHLVLPVSALGFIYLALYLRMMRAGMAEAWRQDFVLAARARGLPRRRVVLAHVARNALLPLVTMLGLQSAQMLGGSVVIESVFAVPGLGRLAQEAVAGRDTPLLLGIILVSAVLVVVINLLVDLAYAVLDPRVGAGEASA >CP034445.1|AZO03300.1|1991319_1992150_+|ABC-transporter-permease MNRLRRFLRTPEAIAGASILALLVVMALAAPMLFPGDPQAIAGPALLPPFQDWRLPLGTDRLGRDVLAELFHGARTSLAVGLAAAAAALLIGAVVGTLAGFAGGLIDEVLMRITDAFQTVPSFLLALAFVSIVGPSLGAVVAAIALSAWTGPARVARAEVLSIRERDYVAGARVIGMHPLEIAFREVLPNALPPVLALSSVIVAAAILTEAALSFLGLGDPNRVTWGGMIAEGRTVLRTAPFLSIVPGVALVLTVLGVHLAGEGVVESTAVRRSLS >CP034445.1|AZO07364.1|1992230_1993148_+|ABC-transporter-ATP-binding-protein MDVLEGERLAIIGESGSGKSTLALAVAGLLARGAEIGGRMDWSLPARVGAKAPLSGLPAISPSRGEIGSSVAGSHLATLKIGEGGDHDLISSPGEMSGRTKRGAKDRQPSAPATSHPLLGRDIGFVFQDPSSSLDPVMPVGKQIAEVARTHLDLTWREAIAKAKTLLERVRLPNPDATLHAYPHQLSGGQKQRVAIAAAIAAGPKLLIADEATSALDTIVQAEIVALIRRLVTEDGMTLIFVSHDIALAASLANRIAALRHGELVELGETSQIVNAPQHAYTRALLDAHLGLDAEPPLDRQGISA >CP034445.1|AZO03301.1|1993144_1993921_+|ABC-transporter-ATP-binding-protein MILLAVSNLTKRYHRGGKAFAAVDDVSFEIGPAETLALAGPSGSGKSTLARLVLRLVEPDTGRVDFEGGDFLALSGAALRARRARLQMVFQDPLAAFNPRATVARVLDDPLRIHNIAPRAARPRRIAALLERVGLDTGLAARAIHEISGGQRQRVAIARAIATRPSLIVLDEAVSALDVSVRGQILELLLDLQRRERIAYLFISHDLGVIRAVAHRVIILDAGRIAESGDARAVIANPQSPVGKALVEAAPRLNRNRP >CP034445.1|AZO03302.1|1993985_1994492_+|hypothetical-protein MTDPEAARAEKLSRELDSAFRNRADLYRLFLDELTAELGAEQAEAVMIRSIEKRGREVAAAAFAGFGPNDAPAIGEAFLAVSPDGGRMYPTDIERGPDHIAFKVKRCPLKDAWVEAGVGEEKLATLCRIAGAFDRGLFEATGVRFANVTWTPGHGSGCCHIALTNRDA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034445_3 | 2433807-2433945 | Orphan |
NA
Consensus repeat of CP034445_3
|
1 spacers
spacers of CP034445_3
>3.1|2433848|57|CP034445|CRISPRCasFinder CTCAGTTCATCGTTGGAATGACGAACTCCGCGCCGTCCTTGATGCCCGCCAGCGCGC |
CRISPR arrays and Neighbor proteins around CP034445_3
The CRISPR arrays of CP034445_3 >merge|CP034445|3|2433807-2433945|CRISPRCasFinder CCGCAGCCGAAGGCGAGGACGCGCAATCAGGAAACCGTCCGCTCAGTTCATCGTTGGAATGACGAACTCCGCGCCGTCCTTGATGCCCGCCAGCGCGCCCGCAGCCGAAGGCGAGGACGCGCAAGCAGGAAACCGTCCG >CP034445|3|3|2433807-2433945|CRISPRCasFinder CCGCAGCCGAAGGCGAGGACGCGCAATCAGGAAACCGTCCG CTCAGTTCATCGTTGGAATGACGAACTCCGCGCCGTCCTTGATGCCCGCCAGCGCGC CCGCAGCCGAAGGCGAGGACGCGCAAGCAGGAAACCGTCCG
>CP034445.1|AZO03681.1|2433171_2433474_-|hypothetical-protein MTRFVVAAMLATAAITLGAAPASAAAQCAARADIIKALGDKFHETEAGRGLINPNVVLEIFVSDQGSWTVLASDTKGQSCVLSVGEGWDSPTIRAAMPGA >CP034445.1|AZO03680.1|2432282_2432660_-|MarR-family-transcriptional-regulator MTRTDKKVRYLPESEPELLTPSSGAIGVENVLRILEGRWKLVILFHLFGGKVLRFSDLERAIPAISQKMLIQQLRQMEADGIVRRIVHHQVPPKVEYCLTDWGQALCPALDALLKWAARKEPAEG >CP034445.1|AZO03679.1|2431699_2432017_-|hypothetical-protein MARDFSFKWIMLVAVGALMAISAVPARAQIICGGHNYLVARLAEAFEEKRLGYGVAGQVAIFEVFVSASGTWTILMTDVKGQSCILAAGEGWEDTLATAVGQPGG >CP034445.1|AZO07390.1|2431012_2431672_+|GntR-family-transcriptional-regulator MISADANAPFEPAATKAYRALEHMIVTLELAPSSFVTEGALIDRLGLGRTPVREAIQRLAWEGLLDIRPRAGIAVAPLHAGDWLRVLDARRGIEVVLARSAARFVTREAADLFHEAALAMQKAVISGNVLAFIQADKALDEALAIAADNPFAARVAAPLQSHSRRFWFRYKADTGLAESAEHHVALIRSILDGDEEGAAKDAKKLMALLRGHAEVAATR >CP034445.1|AZO03678.1|2429963_2430899_-|dihydrodipicolinate-synthase-family-protein MWTGVFPAVTTKFTADDRLDHAEMERCYSLQMEAGCDGIIVCGSLGEGPMLSPDEKIEVLKTAQKVAGKKPVLLTVNEPGTREAASIAKRAAREGADGLMVVPSPIYHTNPEETVAALRAVAEAGDLPVMIYSNRLAYRVDVTVDLMEELATDKRFVAIKESSDDIRRSTEIINRFGDRYDLFTGVDNLAFEALSVGAIGWVAGLVTAFPRETVAIYQLMRQGRREEALAIYRWFRPLLDLDVSTYLVQNIKLAEVLAIGTNDRMRMPRQPLSGERRKAVEKVVRDALAVRPTLPSLQTSPRSTDRLVAAE >CP034445.1|AZO07389.1|2428640_2429864_-|FAD-binding-oxidoreductase MAIIGGGIIGICAATLLAEAGRSVIVFDRTGVCEETSSGNAAAFAFSDVLPLAHKGMIRQLPKWLADPLGPLAIPPAYLPKLLPWLIRFWRAGAAKHYETSLATQAGMMKLAEAEWMGLLDRSGTRPMLREDGSLEFYESEAEFRASLRGWAERQRFGIGFRHVEGEEMAALQPGLSPRFVKGTFVPGWKTVADPKLLGKAVWTYAEKLGARFEHARVERVESGANGPTIVLADGTRRTASKLVIAAGAWSHLLAKNLGERIPLETERGYNTTLPASAFDVKRQLIFSGHGFVITPLQTGLRVGGAVELGGIDRPPNFARSKAMLEKAKRFLPGLDPSGGREWMGYRPSLPDSLPVIGAARAPNVYYAFGHGHLGLTQSAATGRLIRDLILGQTPPLDLTPFRAQRF >CP034445.1|AZO03677.1|2427563_2428565_-|4-hydroxyproline-epimerase MAKKSFFCIDGHTCGNPVRLVAGGGPLLQGSTMMERRAHFLAEYDWIRTGLMFEPRGHDVMSGSILYPPTRDDCDIAILFIETSGCLPMCGHGTIGTVTMAIEHGLVKPKTPGVLRLDTPAGLVIAEYKQVGEYVEEVRITNVPSFLYAEGLTVECPVLGEISVDVAYGGNFYAIVEPQKNYRDMADYTAGDLIAWSPVVRQRLNEKYSFVHPENPGINRLSHMLWTGKPRNAEADARNAVFYGDKAIDRSPCGTGTSARMAQLHAKGKLKEGDSFVHESIIGSLFKGRVEKEVSVAGKPAIIPSIGGWARMTGLNTIFIDDRDPFAHGFIVT >CP034445.1|AZO03676.1|2426650_2427532_+|LysR-family-transcriptional-regulator MDYLPLNAIRAFEATARHLSFSAAGEELHVTHPAISHQIRRLEEWLGVALFHRDARKVRLTEAGLILQASASAALAELGATCRRIRRSAAQASLSVGCIPSIASRWLVPRLSDFTARHPEIAIRVAYAKAEDRLEDDHDILITLGADPSPHVTSLKLFSRISRPACSPHYLARKGRLETAAAIAAADLLHDETRQGWQEWFSKSGVEERDVGSGPVFADFNILATAVIAGHGVALCPVEVFREELRRGDLVVLSDVSTDDDKGYFLTMSAQPSSAEARFAEWFRDQVSVKAEA >CP034445.1|AZO03675.1|2426084_2426471_-|ectoine-synthase MFTRQLADVEKTDFFVDWGNGTSHRLLTSHDGMGFTICHTVVRAGSESRLQYRRHLEACYCISGTGEVEDMTGTVHRVEPGTVYVLDAHDDHFLRADSAGDMVLVSVFNPPLKGTEKHNLNGEGGSAY >CP034445.1|AZO03674.1|2424764_2425667_-|branched-chain-amino-acid-ABC-transporter-permease MQYFVQQLINGLTLGSIYGLIAIGYTMVYGIIGMINFAHGDIFMVGAFTALIVFLILGALFYSVPVVVALLIMMIVAMLLTSLYNWTIEKVAYRPLRGSFRLAPLITAIGMSIALSNFVQVTQGPRNKPIPPLVSQVYTIDGISISLKQIIIVIVTIALLAVFWYLVNRTALGRAQRACEQDRKMAALLGIDVDRTISITFIMGAALAAVAGTLFLMYYGVVVFSDGFVPGVKAFTAAVLGGIGSLPGAVLGGLLIGFIESMWSAYFSIDYKDVAAFSILAIVLIFLPSGILGRPEVEKV >CP034445.1|AZO03682.1|2433946_2435443_-|CoA-acylating-methylmalonate-semialdehyde-dehydrogenase MIEYGHFIGGKRVAGTSGRKQDVMQPMDGSVRGTVALASQAELRAAVENAKAAQPKWAATNPQRRVRVLMKFLELVARDYDELADILAREHGKTIADARGDIQRGLEVVEVCIGAPHMMKGEYTDGAGPGIDVYSMRQPLGVVAGITPFNFPAMIPLWKIAPAIACGNAFILKPSERDPGVPLRIAELFIEAGLPEGVLNVVNGDKEVVDAILDDPDIKAIGFVGSTPIAHYIYSRGTASGKRVQCFGGAKNHMIIMPDADMDQTVDALIGAGYGSAGERCMAISVAVPVGNDTANRLMEKLVPRVESLKVGPSTDSSADFGPLVTAQALERVKGYVDIGVKEGANLVVDGRGFKMQGYENGYYMGGCLFDNVTADMRIYKEEIFGPVLSVVRAPRYEDAIKLANDHEMGNGVAIFTRDGDAARDFASRVQVGMVGVNVPIPVPIAYYTFGGWKASSFGDLNQHGPDAFRFYTKTKTVTSRWPSGIKDGAEFVIPTMN >CP034445.1|AZO03683.1|2435609_2435837_+|hypothetical-protein MSSFAPPSVLPDISPTWGEISSFAAGTISCNAGDWRKPARHPISPLVGEMSGRTEGGATERGLGCYHQDHPHELG >CP034445.1|AZO03684.1|2435823_2436705_+|LysR-family-transcriptional-regulator MNWDDVRIFLAVARAGQILGAAKRLELNHATVSRRIAALEEALRTKLFRRLTTGSELTPAGERFLDIAERMEGDMIAARSTIAGEGDDVSGTVRIGAPDGFGVAFLAKRLGELTAQHRELTIQLVPVPRSFSLSRREADIAITVERPTEGRLVAGKLVDYTLGLFASRAYAEANGLPKTPAELARHTLIGYVPDLIVSPSLDYAAEFSPEWRTSFAISSALGQAEAVRSGAGIGILHTFVARSMPELVPVDIVAPIRRAYWLVYHESVRPLRRVQLVANFITKAVERERGLFA >CP034445.1|AZO03685.1|2436701_2437274_-|hypothetical-protein MRLFPVAATAYLACAAAALAADKTFHDREHGFSLTYPEEWASETAFDNTIRLKLKSGEKGLTCRVSQNAYDPTAPDNPPDVRAFMEKDWQMSNWQTTVGVAYQSASFSQDRLAHFADGYPVRIADMDFHYADDNVSFYGHSRIALTLRSSHYGFIDCGVTGDSADEATRKWAPLADQADKIVSSFVLDAD >CP034445.1|AZO03686.1|2437507_2437948_+|hypothetical-protein MRAVLAILPLLFLSDCANPWAKVPEAELPKPVRYAMSRPSPFVIGNYCGPGTRTGDLSARPVDRLDAACRVHDACYIARHNHCDCDGELVASARKIRDDRTAPRKMRNEAELLIATFAFPVCRIFPQGFLPPRDPAELKAVSGAAG >CP034445.1|AZO03687.1|2437944_2440107_+|phosphatidylserine/phosphatidylglycerophosphate/-cardiolipin-synthase-family-protein MSLGSAVAKALLGLALTVLAGCAGMPGHSACAFFPGDDACDRLPVSADSTGAGIATAHYFASDGNSAYEKRFQALLAQSRLDALDRANGTGRFGRDWRDVQNSGFHATYPALQGLARPLDDTELAPTPGKPGQAHFAGRWRMSRQTLYLDAAARPSAPKTFSFSGLQARSVEIVLRQAGQQPVDIGGTCNGRLAIRAPGRSMTVAAAAPFHLSLPVAETSVSLFPDQALTRCDLRVGSALAPAGAPLTLLREETADPWITALDSRYDRCPVPDPAGMEELDRVFYASRWLSQTCALPLGSPTLLRKSRDGFNAKVEALLGKRLPDSAFDKADPELPLDFSHAPKLRLIYLSSLEFKADFSGRVMERLIRHHAALGTKVRILVTDVLEREKDDAMLHRLASEFPNVELQEYRWQADHGAPFDEQISQLHKTHHVKMLATLAEQPGRSRVIIGGRNIHDGFLFHRPVDLTRYPDLEQYGKTDGFSLNYYSNWSDFDMEIADPATVETLAAHLSTIWLRDADTNLSRPFSIPVRSRAAPRGVARHFISVPYEDGHALEAYFVELIDTAEHRIEIVNPYLNLTPDIARAFDGALARGVKIDVVGRIDLKGDIGGRFLTALNKLFVEKYGDRIDIREFKAPDVVLHSKIMMIDERLVAISSVNLNNRSFFHDSENGMVVLDPAFYARMKPIYQDYVAHSRPVATNVTIGWAYRLLFSDNWVREAF >CP034445.1|AZO03688.1|2440150_2441413_-|D-amino-acid-dehydrogenase MKVIVLGGGVIGVTTAYFLTEAGHEVTVYDRQPGPALETSFANAGEVSPGYASPWAGPGVPIKAVKWLLTKYGPLVVRPAFDPNMWTWLLKMLRNCTAERYALNKSRMVPLAEYSRDTLKALREATGITYDERAKGTLQLFRKQKQLDGTGGDVEVLKKYGVPYEILDRDGCVAAEPALAAVRDKLVGGLRLPGDETGDCKMFTDKLAELCVARGVTFEYGSTIRKVVRNRNRLTNVLTDSGWKSADAFVMALGSYSAQFMRKLGRPIPVYPVKGYSITVPITNAEAAPVSTVMDETYKVAITRLGDRIRVGGTAEISGYDLRLHESRRRTLEHSVGDLFPGGGDLKAASFWCGLRPMTPDGPPLVGLSEVANLYLNTGHGTLGWTMACGSAKVLADIMSNRVPEINARDLSPERYLKPI >CP034445.1|AZO03689.1|2441501_2442650_-|alanine-racemase MSDAEVTSLQTVAANGSTVSEAAAGAILTIDLGAIRENYRRLKARLGGVRCAGVLKADGYGLGAAQVASALAKEGCDIFFVALPDEGIALRNAIGPGPDIFVLNGLPPGSEPEAQAAGLCPVVNSAVQLKAWRQAARSAGRSLPAAIQVDSGMARLGMAPTDVEAVAGEAGAFDGIDIRFVMSHLARADEPQQAANEKQRHEFDRLRKMLPAAPASLANSSGIFLGPAYHYDLARPGAALYGVNPTPHEANPMLPVIRLEAKVAQTREIGAGTGIGYGHTHQADGPLRLATISLGYGDGWHRRAASAAWFEGVRLPFVGRVSMDSIILDISALPAGRLGEGDLVELIGPSQSVDDAAGHAGTIGYEILTSLGTRFHRRYVGA >CP034445.1|AZO03690.1|2443033_2443549_-|RNA-pyrophosphohydrolase MAKTIDPETLPYRPCVGLMILNRAGLVWVGHRIAEPDSEFAGTTQLWQMPQGGIDKGEEPIEAAGRELYEETGMRSVSLLAEAPRWINYDLPPHLVGVAFKGRYRGQTQKWFAYRFEGDESEIAINPPPGGHTAEFDEWAWRPMRELPELIVPFKRKVYEQVVAAFQHLTR >CP034445.1|AZO03691.1|2443590_2444781_-|divergent-polysaccharide-deacetylase-family-protein MADIGKDIERPLGQKLQPKRRASRGISGGTLAAVLAVLAVIGVSGAIALRDKPFRKPQDVAVSTPKVVAAPTTPAPPAAPAPVAAATPQASTPMKSGGPQIIHVQTEEGDSPPKAAIVIRDPSTLGQNLKVAHIPDGALIEASETGPLPMRSADGRRPFDVYARPWSGARGARVAIVIGGLAVSQTGTQAAIAKLPAEVTLAFAPQGNSIGRWMQAARQSGHEIVMQVPLEPFDYPNVNPGRNTLTVSASADENLKSLHWALSRTTNYTGVMNYMGARFSADASAMEPFMAELGKRGLAYIDDGSSSRSVAPDLALKDGMPFVAGDMAIDAVQDRGEILKKLDSLEATARAKGSAVGIGSAFDITVDTVTSWIAEAKKRGIEIVPISAVAIDPQKG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034445_4 | 3331289-3331633 | Orphan |
NA
Consensus repeat of CP034445_4
|
6 spacers
spacers of CP034445_4
>4.1|3331310|24|CP034445|CRT GGCAATGGCAATGTCGGCGTCGGC >4.2|3331355|24|CP034445|CRT GCCAATACCGGCGCCTACAACGGC >4.3|3331400|18|CP034445|CRT TTCGGCTTCGGCAACGGC >4.4|3331439|51|CP034445|CRT ATGGGCGCCTTCAACGGCAACTATAACGGCAACCTGAATGCCGGCGTCGCG >4.5|3331511|57|CP034445|CRT GGCAATTTCGGCCTCGGCAACGGCAACGTGAACGGCAACGGGAACTGGGGCTTCGGA >4.6|3331589|24|CP034445|CRT GGCAACTGGGGCGCCGGCAACGGC |
CRISPR arrays and Neighbor proteins around CP034445_4
The CRISPR arrays of CP034445_4 >merge|CP034445|4|3331289-3331633|CRT ATCGGCAGCGGCAATGGCAACGGCAATGGCAATGTCGGCGTCGGCAACGGCAACGGCAATGGCAACGCCAATACCGGCGCCTACAACGGCAACTTCAACGGCAATGGCAACTTCGGCTTCGGCAACGGCAACGGCAACGGCAATGCGAATATGGGCGCCTTCAACGGCAACTATAACGGCAACCTGAATGCCGGCGTCGCGAACGGCAATGGCAACGGCAACGGCAATTTCGGCCTCGGCAACGGCAACGTGAACGGCAACGGGAACTGGGGCTTCGGAAACGGCAATGCCAACGGCAACGGCAACTGGGGCGCCGGCAACGGCAACGGAAACGGCAACGCCAAC >CP034445|4|1|3331289-3331633|CRT ATCGGCAGCGGCAATGGCAAC GGCAATGGCAATGTCGGCGTCGGC AACGGCAACGGCAATGGCAAC GCCAATACCGGCGCCTACAACGGC AACTTCAACGGCAATGGCAAC TTCGGCTTCGGCAACGGC AACGGCAACGGCAATGCGAAT ATGGGCGCCTTCAACGGCAACTATAACGGCAACCTGAATGCCGGCGTCGCG AACGGCAATGGCAACGGCAAC GGCAATTTCGGCCTCGGCAACGGCAACGTGAACGGCAACGGGAACTGGGGCTTCGGA AACGGCAATGCCAACGGCAAC GGCAACTGGGGCGCCGGCAACGGC AACGGAAACGGCAACGCCAAC
>CP034445.1|AZO04380.1|3330264_3331086_-|sensor-histidine-kinase MSGKVEAIRKKLTGIKASGGCILLGHSCLFSPTRGWLATLATAMGRERLMWAGMTTGKDDRALRRRLARRLIAVQEEQRLRLSRELHDDLGQMLASVALELHNVRAGTQEMDGRLERAAMLIDRLSAKVHDAAWNLRPADLDRLGLRASVEDLATMLCSQLGIPCEMDLDALSNPLPAETALTLYRVAQEALTNIGKHAQPSRVSVTAHIQDDRMRLTIEDDGRGFDGRADPGPGHLGLAGMRERLALVGGELTVESATMKGTTVYADVPLAH >CP034445.1|AZO07442.1|3329258_3329861_-|response-regulator-transcription-factor MLSGVGAMINSQDDLTVVGLAGNAEDALEGIGKTLPDIAVVDISLPGDSGVVLIERLGERFPEVSCIALTAHEDPGCLRQVLAAGGRGFVVKRSTATDLLQAIRCVLAGDNYVDPALAARILSPRSAPQGDPGNLSQRERSVIQLVALGYSNKEISSRLNLSIKTIETYRTRASEKLELRSRAAIVRFAHSNGWLAELPV >CP034445.1|AZO04379.1|3328300_3329146_+|LOG-family-protein MTPMEKAGWTPLPHSDEDLERAKSVPDTPQTRAETYRLAWNDPDFMTRRELRAVRLQLELLKPEMILAERGIRSTVILFGGARLPEPGGEAWAAKNETQKKNLEANSKYYEEARKFARLCSQQSATSYYREYVVVTGGGPGVMEAGNRGADDVGAPSIGLNIVLPHEQAPNAYVTPELCFNFHYFAIRKMHFVMRAKAVAVFPGGFGTMDEFFETLTLIQTGRMERVPVILFGKAFWRRVIDLDFLAEQGTISPGDQDIIDFVDTAEEAWEIIRRFYKLGE >CP034445.1|AZO04378.1|3327302_3328157_-|2,3,4,5-tetrahydropyridine-2,6-dicarboxylate-N-succinyltransferase MSKPDLASLETTIEKAFEERDTISTATRGETRDAIQSALDLLDRGVARVAERRDDGTWHVNQWLKKAVLLSFRLNPMEIIKGGPGQAVWWDKVPSKFDGWSAVDFEKAGFRAVPSSIVRRSAYVAPGAVLMPSFVNVGAYVDSGTMVDTWASVGSCAQIGKNVHLSGGVGIGGVLEPMQAGPTIIEDNCFIGARSEVVEGCIVREGSVLGMGVFLGQSTKIVDRATGEVFYGEVPPNSVVVAGSMPGKNFPNNEPGPSLYCAVIVKRVDAKTRSKTSINELLRD >CP034445.1|AZO04377.1|3326804_3327194_-|DUF805-domain-containing-protein MTERTRLSERSQFLWLFFSLSGRLSPVAYALAGVLLVLVQFFPFYQFARVELNTTASQNWAMICWGVLLISAWATFAVTAKRFHDFGKPTYFALISLIIGPILLIILSCFRSDPGPNRYGRRTNAPADS >CP034445.1|AZO04376.1|3326095_3326797_-|hypothetical-protein MRYRALDPQLIIETAERLEGRIGERFPDAGLRGVAAELVSLSRDLAKAARELETPIWWLRGVIIAAFIAGVAVFLFVGTILPLDRISGADDAVQSMQGIEATINTVILAALGLLALVRTEERIKRKKVFRQLHGLRSLIHVIDMHQLTKDPAALSADFKPTAHSPARMTNAADLARYLDYCSEMLSIAGKVAALFAQSVNDDVVIDGVNDIENLSSNLSRKIWQKITLIEGRR >CP034445.1|AZO04375.1|3324985_3326020_+|2-dehydropantoate-2-reductase MASETSTIAIAGAGSIGCYVGGCLALAGRKVVFLGRGRVVEAMRESGLRVSDLDGRDRRIEAQAISATVDPAIALADADVILVTVKSGATGEMAKLIAAHGRPDAVVVSLQNGVDNADRLRDALPGRRVLTGMVMFNVVQSADGELPFRIHRASQGEVMIDDGVDGLAELLDVDGLAVEARADMKAVQWSKLLMNLNNALVALSDLPLASQLADRIWRVILAAQIDEALAAMRAAGIAPARITGLPPALLPKVLRLPDWLFGLLARRMLAIGPQARSSMWDDLKRGRPTEIDELQGAVIRLARQAGIPAPMNERVAALVRQAEAEKRGPPGLGPDAVSAIPGKV >CP034445.1|AZO04374.1|3323610_3324810_-|succinyl-diaminopimelate-desuccinylase MTLPTDPAANLAALIRCPSVTPAEGGALAALGDMLKPLGFSVERPVFSEDGTPDIENLYARRSGNGPHLMFAGHTDVVPVGDESAWTHPPFAAEIAKGEMYGRGAVDMKGGIACFIAAVARHVEKNGGPKGSVSLLITGDEEGPAINGTTKLLDWAAAKGEKWDASIVGEPTNPDTLGDMIKIGRRGSLSGTVTVNGRQGHAAYPQLADNPVRGLMSLVDALLHPVFDKGTKDFQPTNLEVTSIDVGNPATNVIPAKATATFNIRFNDTWDAETIQAEIHNRLDQAAGRKKYRPGKKTPVDYELVWRDRPSHVFLTRDDRLIDTLSSSVKAVVGRTPALSTSGGTSDARFIKDYCPVVEFGLVGKSMHMVDERVALADLETLTQIYERFIEDWFAKGLS >CP034445.1|AZO07441.1|3323002_3323608_-|transporter MLSADETYASLVGAWRLMFGKADGLRLLDLSADGFWNSFYAIVVAAPALIVGWVGIANEIGDPEAFAGRLGMLIRLATVDIGSWVLPLVALALVAPRAGIGGRFVHYVVASNWASAIIAWLMLPSALLRLFLPSTSEISSLVSLFLFALSALLTWRMTNASIGKGAAVGTAVFVGMFIASLLVLFGLQALLGIDIPDSTTG >CP034445.1|AZO04373.1|3322217_3323006_+|tRNA-pseudouridine(38-40)-synthase-TruA MPRFRLDIEYDGSQFAGWQHQVDQPSVQQAIEQAIEKFCGEDVRIRAAGRTDAGVHATAQVAHVDLAKAWPGDKVRDAVNAHLQAVGARVAILKAAVVADDFDARFSATGRHYLYRILNRRAPPALEKGKVWWVPKRLDAEAMHEAAKVLLGKHDFTTFRSTQCQAESPVRTLDRLEVSRDGDFIEVRSSARSFLHNQVRSMVGSLRRVGDGSWGAADLKAALEARDRAACGQVAPPDGLFLVGVDYPETPGKFTIAGDADT >CP034445.1|AZO04382.1|3331869_3332316_-|hypothetical-protein MISGRSSAIPPTVEVAVEGAGIAVAVERPAVEVAIAGADVAVAVIFAGVAVAVEGADVAVAARRRGVVVAVERADIAGTAVEGADIAGAAVLVAEITAEVAERIHAPDVRADSLAVFADRRLDGRLAEGLAARRCVGLRCGCAQQPQA >CP034445.1|AZO04383.1|3332835_3333543_-|pyrimidine-5'-nucleotidase MTMTPDPSRFAHVTDWVFDLDNTLYPHHSNLFAQIDVKMTAYVGELLTLSRDDARKLQKELYLEYGTTLNGLMKRHGIDPDDFLEKVHDIDYSRLVPDPVLGAAIRQLPGRKFIFTNGDRRHAERTARQLGILEHFDAIFDIVAAGLNPKPERQTYERFAELHSVTGHNAVMFEDLARNLAVPKSLGMTTVLVVPRNFEPTFSEIWERDPANDDDVDFVTDDLAGFLTTIVDVVA >CP034445.1|AZO04384.1|3333692_3334454_+|GGDEF-domain-containing-protein MSRIFLKAATVAFASVAVSLLLTLIVVPAIGFPMSRTIWLASTLCPLVLAWAASAGSFWHSDRLQNAHRELARAHAQLAAAHRRLSEKASRDDMTGMLNRETFFAALDGSRRKSDRGALLIIDADHFKRINDSYGHLTGDEALLLIAGAIERGVRNGDVLGRIGGEEFAAFLIGAGEREAKHVAERIRREVELIRFRPSDGRTVPLTVSIGGISCGEGATVSDLMRAADRRLYEAKNRGRNLTILDRELPEAA >CP034445.1|AZO04385.1|3334899_3336339_+|GMC-family-oxidoreductase MIFDSYEAYKQAGFKPKACILGSGPAGTTIARKLGAAGIPVVVLEAGSREFSDESQDFYRGKTVGDFYFDLDITRLRFMGGSSNHWAGWCRVLDSQDFEPKAWAPDTGWPISRADIEPYLGEVHDILELPDFRPDVPVSEDICWVQLIKSPAVRFGEKFADELDRSKNIAVVLNTYATELTGDGKRVTGAKLWSNGQVAGAFSADYFVTCTGGLENSRLLLWSNERSNGGVVPNAAALGRYWMEHPTFEGGNAILASYSEFEVDASNEAFFSPTLAAMERLQIMNFGIRLIESPYPNVKKLIADLACTAPNMAEWMSSQLDQRLRCAAQLYVAWEQAPLASNQVELSKTDVDHAGVPRIELHWKKSPLERRTLLEGLKLFGTTLAQKNLGRVRIDDWISNGGDYPTNEETAGHHHMGGTRMGTDVFKSVVDANCKVHGMDNLYVGGSSVFCTSGQCNPTTTITALACRLGEHLGKLIAV >CP034445.1|AZO04386.1|3336490_3336913_+|MarR-family-transcriptional-regulator MRWQDATQAYDEAVGARLGLIAAERHCLGLLYAGPQSAGAVAAATGLTPAAVTALIDRLEARGYVTRARSLEDRRKVVIEATELTRELSERYYGTIAREGEKLVASFSDAELATVLRFINAALDLQSEQLARIKAEPNKA >CP034445.1|AZO04387.1|3337048_3337882_-|phenylalanine-4-monooxygenase MTISVADYAAECAAQGLRGDYSVCRGDFTVAQGYDYSAEEQAVWRTLCDRQTKLTQKLAHRSYLDGVAALGLLDRIPDFDAVSEKLSKLTGWEIVAVPGLIPAGPFFDHLANRRFPVTNWLRTKKELDYIVEPDMFHDFFGHVPILTQPVFADFMQMYGEKAEDMIALGGDEMITRLYWYSAEYGLIQEPGQPVKAFGAGLMSSFTELQFAVESKDAHHVRFDLETVMRTGYEIDKFQRAYFVLPSFDALRDAFANGDLAGIVARFKGRPALDPSMV >CP034445.1|AZO04388.1|3338021_3338498_+|Lrp/AsnC-family-transcriptional-regulator MPQFDEFEIRMLDILQRDGRKPVSELAQEIGLSTTPCARRFEALQETGIIKGFAAVLSRRAVGLTVEVFIQVRLVSHSDGSPESFIAAVQRMDEVSSCWTMTGDHDFLLHVMVPSVDDLNAFVMHRLMRLPGVRDVHTQLVLQNIKGPGHVPLSHLRK >CP034445.1|AZO04389.1|3338609_3340160_+|magnesium-protoporphyrin-IX-monomethyl-ester-anaerobic-oxidative-cyclase MNIVLINPPHTAIGSRVPDDHLPPLGLLAIGGPLIDSGHQVRLVDAEFGPMSLAVLVDDALCGDPDLILIGHSGSTSAHPTALKIAEMIKARAPGVIVIYGGVFPTYHWRDILTATDVFDFIVRGEGEATATALVEAIEMRQPVGSVAGIAYRDDLGRPVATQAAMTIADLDAWRVGWELIDHRRYSYWGGKRAVVMQFSRGCPHLCNYCGQRGFWTRWRHRDPVKFAREIAWLHREHGVELVNLADENPTSSKKAWRAFLHAMIAENVPVLIVGSTRADDIVRDADILHLYRKAGVIRWLLGMENTDEATLSLIRKGGSTKSDREAIRLLRRHGILSMATWVAGFEDETLSDLWRGFRQLIAYDPDQIQALYVTPHRWTPFFRVAADRKVIQKDVRLWDYKHQVLAMTRLKPWMLFFAVKLIEVAVQSRPKALARILFHPDPEQRHSMRWYTKMGRRVWFREVWGFLARDRRVTDGPTLAEFWGAPQDAEEESMIVRRPVRKPAAIIEDQRRLAG >CP034445.1|AZO04390.1|3340203_3341097_-|acetylglutamate-kinase MTDVAANAEMQAALLSRALPYMQRYENKTVVVKYGGHAMGDHELGKAFARDIALLKQSGVNPIVVHGGGPQIGAMLTKMGIESKFEGGLRVTDQKTVEIVEMVLAGSINKEIVALINAEGEWAIGLCGKDGNMVFAEKARKTMIDPDSNIERVLDLGFVGEPVEVDRTLLDLLARSEMIPVLAPVAPGRDGHTYNINADTFAGAIAGACRASRLLFLTDVPGVLDKNKKLIDELTVAEAKALIKDGTVSGGMIPKVETCIEAIERGVEGVVILNGKTPHSVLLELFTEHGAGTLIVP >CP034445.1|AZO04391.1|3341652_3342192_+|sigma-70-family-RNA-polymerase-sigma-factor MAPQDISKLIVRTSMKDRAAFDLLYKQTSAKLFGVCLRILRDRGEAEEALQEVFVKIWTKADRFAVSDLSPISWLVAIARNHAIDRIRARRSPSANIDAALDVADPTPGPEAMAVAGGEAERIHHCLDELEQDRAAAVRGAYLSGESYAELAERFKVPLNTMRTWLRRSLLRLRECLER |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034445_5 | 5336462-5336556 | Orphan |
NA
Consensus repeat of CP034445_5
|
1 spacers
spacers of CP034445_5
>5.1|5336495|29|CP034445|CRISPRCasFinder GCATTTCGCCCAAAAGTGCGCAGCGGTTC |
CRISPR arrays and Neighbor proteins around CP034445_5
The CRISPR arrays of CP034445_5 >merge|CP034445|5|5336462-5336556|CRISPRCasFinder TGGGACAACGACATGCATCAAGAAAGTTTTTGAGCATTTCGCCCAAAAGTGCGCAGCGGTTCTGGGACAACGACATGCATCAAGAAAGTTTTTGA >CP034445|5|4|5336462-5336556|CRISPRCasFinder TGGGACAACGACATGCATCAAGAAAGTTTTTGA GCATTTCGCCCAAAAGTGCGCAGCGGTTC TGGGACAACGACATGCATCAAGAAAGTTTTTGA
>CP034445.1|AZO06009.1|5335455_5336364_-|MBL-fold-metallo-hydrolase MALEFDTSFDPAYGRAVTVAPDVLRITAGNPSPFTFHGTNSYLVGRDTLAVIDPGPEDDAHLETLLTAIAGRPVSHIFVSHTHRDHSPLAARLKERTGAPTLAEGPHRPARPLRIGEVNPLDASADLAFVPDIALANNALTEGDGWAIRTMLTPGHTANHAVFALEGTGILFSADHVMAWSTSIVAPPDGAMADYMSSLDRLLAREDRLLLPGHGGPVTAPQRFMRGLKTHRKMRERAILERIRAGDRTIREMVAAIYRDTDPRLHGAAGLSVLAHLEDLAARGLIATEGDPAIDGIFSPAG >CP034445.1|AZO06008.1|5334669_5335455_+|DUF1499-domain-containing-protein MASIPERQTSRAAGWSRRIGAFSLVLLLTAVAGYRLGFVETPPFLWVLAVVALLAALALLLAGLGLSRVWSFGDRGGRDLSVGALLALLVLAPYGVAVYWATIYPPLRDISTDLDDPPVLDTSDRTSDMNELSPPTPGEQSLQADTYPLVSGHSYNLPFETVVDAVETVLDRRDWQLTAPYPDINGQSEATINALAKGFVLGLPADVAIRVTDDGEQVIVDMRSASRYGRYDLGDNAARITDFLGELDQEVAGQVGAAPAE >CP034445.1|AZO06007.1|5332959_5334588_+|fatty-acid--CoA-ligase MLGLMQEWPLLCHKLIDHAERQHGSREVVSRSIEGPIVRTTFAEIHRRSLKAAQRLERDGFVLGDRIATLAWNTTRHIEAWYGIMGIGAIYHTLNPRLFPEQIAWIMNNAEDKAIFVDLTFVPLLEKIAGAVKSLRRVIVLTDQAHMPQTSLANAVAYEEWLAEADGDFAWKTFDENTAAGMCYTSGTTGDPKGVLYSHRSNVLHAMIAAMPDAMGLSTRDVVLPVVPMFHANAWGLGQSGPMIGTKLVMPGCKMDGASIYELLDTEKVTFSAAVPTVWMMLLQYLEETGKKLPYLNKVVIGGASCPRAITAKFQDDYDVQVVHAWGMTEMSPLGTLCTLKPQYQTLTGEARLDVQGKQGFPPFGVEMKVTDDDNNALPWDGKTFGHLKVRGPAVARAYYGGAGAEQFDADGWFDTGDVAHIDASGYMQITDRAKDVIKSGGEWISTIDLENLAVGHPDVAEAAAIGIPHSKWGERPLLVVVRKPGREPSKGDILTFMSGKVAKWWMPDDVAFVGEIPHTATGKIQKTSLRAQFKDYRLPTD >CP034445.1|AZO06006.1|5332411_5332768_-|hypothetical-protein MDAQLDEKALESTLAESLDDLAPDIKTVSEDEFAEVVGGALEAVGGTLLFKMRVENGENGEHVAAACIGDAGNRQFLLLTLPTSGGALKVETAARSTNPVAGIAAAYAGLMDAFKTAA >CP034445.1|AZO06005.1|5331818_5332208_-|DUF427-domain-containing-protein MDKPANPAPGFQRNPDKVITVEPYRGSVTVRAGDTVIARSTRAKVLSEPPYPAAFYIPFDDIDFSKLAGTEHSTHCPYKGDASYWSVQAAGEAGTNAMWAYEHPIDEMTEIRNHGAFYTSKVTVEAEPG >CP034445.1|AZO07582.1|5330936_5331680_+|polysaccharide-deacetylase MGLLRNSLCVLALSLLAASAGNAATLLEPTLHLKTQAPEGRGRVALTLDACGGKTDTRILSALVDNRIPATIFVTGIWLKRNAAAAEIMRAHPDLFELENHGGRHIPAVDTPRKIYGISSAGSPDAVQAEVESGAAALASAGGPAPKWFRGATAEYSPSAIAMIRKLGFKIAGFSVNGDGGSLLGAKETARRIGAAKDGDVIIAHINQPTHAAGEGVVQGLLALKQKGMTFVRLDDADGIGNDGTTD >CP034445.1|AZO06004.1|5329687_5330791_+|glycosyltransferase-family-9-protein MLQGPFAGVRSILILQTKYIGDLVLASTLAKNLRIAYPEARIVFLCEARFAGFLTAHGIADETVAFKRSSARGTVMQRGLELYHVLRKLRRLACDMTIDLTDSKTTHFVAMALNARIRVGYFPTERRLGRFERQSANVRAKPFGYGERHFLYRYLSPLEALGEDLRVRVPSIEPLPSETARVLTLLDSCGLRRKAFLVVHAGASFPGRRWQPERFAAAVDTISSETGLSVVLVGGPDEAQANDHIVAAVKAPVVNLVGKLSLETLMALLKEARLFLGNESGPMHMAAAAGTPVVGLYGLTNPVAWGPVGAPSISLRPPMPCECVAADMCHRTNPAKAFCVWRLEVDTVAEAVRELLARTDRPIKTAV >CP034445.1|AZO06003.1|5328372_5329476_+|glycosyltransferase-family-9-protein MLQIAPTPFRSILVLQTKFIGDIVLASALANNLQLAYPGVRIVFLCETHLAGFLTAHGIAAEAIPLSRARMRGMPFERGRELFRVVRELRRRRFDMTIDITDSKTSRLISGLVNAPVRVGYSPTERPLRWHERQPANVRMKPFGFGKKHFLYRYLSPLEALGVDLRVKAPAIRPLPFETTRVLALLARHHILPNAFVAVHAGASFAGRRWQPERFAEAIDRIAGETGLRVVLVGGPDENEATDRIVAAAKTPVVNLAGALRLETLLALLRQARLFLGNESGPMHMAAAAGTPVVGLFGLTNPVRWAPVGVPSISLRPSVPCDCVGGDLCRRTDPSKACCVWRLEVDPVVEAVLELLARTEAVLEAAV >CP034445.1|AZO06002.1|5325546_5328231_+|pyruvate,-phosphate-dikinase MTKWVYTFGDGAAEGRAGDRNLLGGKGANLAEMCSLGLPVPPGFTITTEVCNAYYANGRAYPDGLEADVVAALDHIGRITGRRFGDPSKLLLVSVRSGARASMPGMMDTVLNLGLNDETVEALAADSGDARFAYDSYRRFIQMYSDVVMGLDHEVFEEILEDQKASLGHELDTELTAAEWQGVISLYKAKVEEELGKPFPQDPHEQLWGAIGAVFSSWMNSRAITYRRLHDIPESWGTAVNVQAMVFGNMGDTSATGVAFTRNPSTGDRQLYGEFLVNAQGEDVVAGIRTPQNITEAARIAAGSDKPSLQKLMPDAFQAFVDISDRLEKHYRDMQDLEFTIERGKLWMLQTRSGKRTAKAALKIAVEMARDGLITKEEAVARIDPASLDQLLHPTIDPKAARDVIGMGLPASPGAATGEIVFSSADAEDARAQGRKAILVRIETSPEDIHGMHAAEGILTTRGGMTSHAAVVARGMGKPCVSGAGSLRVDYKAGTLVSMGQTFRKGDVITIDGGNGQVLKGAVAMLQPELSGDFAAIMEWADAARRMKVRTNAETPLDARMARSFGAEGIGLCRTEHMFFDGDRIVAMREMILADTEKDRRSALDKLLPMQRSDFLELFEIMAGLPVTIRLLDPPLHEFLPKTEAELAEVASAMNVSADKLRQRTEALHEFNPMLGHRGCRLAVSYPEIAEMQARAIFEAAVEAGRKAGALVVPEIMVPLVGLVKELEYVKARIDAVAQSVMQETGTKIDYLTGTMIELPRAAIRAHVIAEAAEFFSFGTNDLTQTTFGISRDDAASFLETYRQKGIIEQDPFVSLDVDGVGELVRIAAEKGKATRPGIKLGICGEHGGDPASIRFCEEVGLDYVSCSPYRVPIARLAAAQAAVAAAKGAAKRA >CP034445.1|AZO06001.1|5324987_5325332_-|VOC-family-protein MKLNGKLDYLELPATGGTLDSVKSFYSAAFSWSFTDYGPTYSAFAEGLDGGFQADAGEAPAKPLPVLYSENLEETLDAVESAGGTIVKPIFSFPGGRRFHFTDPAGNELAVWGH >CP034445.1|AZO06010.1|5336603_5337194_+|biotin-transporter-BioY MAIATTMRPLVSLTLPERGAARLATQLFLALAGTLLLTLSAKTKVVLGPVDISLQTLAVLLIASAFGLRLAVATLLLYLAEGAFGLPVFQGTPEKGIGIAYMLGATGGYLAGFVVMAAIAGWAADRGWDRSPFKLFGAMLTAEVVMMAMGFAWLAMLIGPEKSWQFGVLPFIAGDLIKVALAASLVPAVWALLKRG >CP034445.1|AZO06011.1|5337413_5338304_+|glucose-1-phosphate-thymidylyltransferase MKGIILAGGSGTRLYPLTLAVSKQILPIYDKPMIYYPLSVLMLAGIRQILVISTPRDLPVFQALLGDGSEFGLELCYAEQAEPNGLAEAFIIGRDFIGKDSVSMILGDNIYFGGGLSQLCGEAAARDGGASVFAYYVDDPERYGVVSFDKVTGRALTIEEKPQKPKSNWAVTGLYFYDNNVVDIASTIRPSARGELEITAVNNVYLERGELHVHRLGRGYAWLDTGTHDSLLEASSFVRTIEHRQGIKIACPEEIALEQRWISADEVLDRAARLGKNEYAAYLRRRVADLMEDQGA >CP034445.1|AZO06012.1|5338296_5338851_+|dTDP-4-dehydrorhamnose-3,5-epimerase MLEVRTLGLEDVLEIVPKRHGDARGYFCETWNAERFAQAGIDLTFVQDNHSYSAAAGVLRGLHYQLPPRAQDKLLRVVRGSVFDVVVDIRRSSPTFGKWSALEVSAEKGNQILVPKGYAHGFVTLVPDTEILYKVTDTYSPEHDRSIRFDDPAIGIAWPSLAGDFQLSDKDRKAPPLSEAEVFA >CP034445.1|AZO07583.1|5338865_5339999_+|dTDP-glucose-4,6-dehydratase MNFLVTGGAGFIGSAVCRHLCANPAYRVTNLDKLTYAGNLASLRTIENAHNYRFEQGDICDERAVLEILRRDDIDIVMNLAAESHVDRSIDGPGAFIETNIVGTYRILNAALEYWRGLSEERKAGFRFHHVSTDEVFGDLPFDGGMFVEETPYAPSSPYSASKAASDHLVRAWHETYGLPVVLSNCSNNYGPYHFPEKLIPLVILNALDEKPLPVYGAGANVRDWLFVEDHARALELVATKGRPGESYNVGGNSERTNLGVVEAICDLLDVRRPRAAGKRYRDLITFVTDRPGHDRRYAIDASKIARDLGWAPRENFDSGLARTVDWYLDNKWWWGPIREQRYAGERLGHAAAANSQSGPVAGNPARTAAESRKVTT >CP034445.1|AZO06013.1|5339995_5340904_+|dTDP-4-dehydrorhamnose-reductase MRLAVTGREGQVAASLVEAARGRDGVEVVAVGRPGLDLAQPDTVLAALEAARPDIVVSAAAYTAVDQAEDDKDLAFVVNATGAGKVAEAAARLGVPVIHLSTDYVFDGAKDGAYVETDATAPLGVYGASKLVGEEAVAAANPRHIILRTAWVYSPFGRNFVKTMLRLAADRDEIAVVADQWGNPTSALDIADAILHAAARLRDDRNFAAFGVYHLAGEGDTNWSSFARHILDTSRALGGPYAKVRDIATADYPTRARRPQNSRLSSAKFEGVFGWRARQWREATGTVVSRLQSGATEATAAL >CP034445.1|AZO06014.1|5340908_5341667_+|hypothetical-protein MNMISNTSTLWFRLAQFDIKLRYRNSTFGPLWITLNTAIFAASVGFLYAVLLNQPIREYLHHLATSIVLWQFLSATVVEGAESVINAHDLILNTKMSPASCIFRCVTKNFMILLHNLIVVLFTMLIAYPNVGLNIPMFVIGVVLLIAHATWISSIVSVISVRFRDVPLITASAMQLLFILSPILWTAKVLPSESLFLVLNPITYMIDAARTPILNGGTDYTSVLVSAAIALLGSLAAYALYRRTQHRIPYWL >CP034445.1|AZO06015.1|5341695_5342421_+|ABC-transporter-ATP-binding-protein MSQPSIQLTDVSVRIPIWSAPANRSLKQAALRFTTGGRLLAGDSGRFEVAALDGVSLDLQPGTRLGLCGHNGAGKTTLLRVLAGILKPTSGTAAITGDTAVLIKPSMGLSPELTGREFIRLQSLMAGISPREVEQQIDSIIDFSELGDFIDLPVRTYSAGMQTRLSFSAVTAYPTDIVLLDEGLGTGDESFQKKARERMDHWLNNAAIIVLASHSDALIRSMCHSAVYLEKGRIMREEIFS >CP034445.1|AZO06016.1|5342420_5343899_+|hypothetical-protein MRRQVIAALAGAGSMLFGHQELKQSYHKVFDPNIKLQSYKSRLAYLCLFAAHRLGLLADLQSSYLAGLPLIPAIAPPNHVYDRDFLNLRLNRVYDLPDVEIDKRRPQTINVLVPAFDFKSISAGFFGVFQMALFLRKTGVNVRLVLFDNFYFNLPEFKEKVLKYPGMERIFDELEVEYIGERREPLRVSEYDSAVATVWYSAYFAQKINLAIGKDKFIYLIQDYEALFYPANSLHAIADRTYEMNYHAIFSSESLMRFFVENDVGGIKSRNMSYTFFNNACSANLLPKEIFIHRNRSKEKKKIVFYSRPVVDRNMFELTALALSTAFRSGIFDPDEWDCVGMGLGEGVVELLPGVRSVSLPRMDLTTYIEEVASFDICLTLMASPHPSMIPMDLAASGCVVVTNTFKTKTESYLQSLSGNIIPAAPGLGEIVAALELAKFKSLDLEERYRLAKTMRYPRNWDQSLTSRHLNFLKRHVRAMASEKLAGERKTA >CP034445.1|AZO06017.1|5343959_5344655_+|hypothetical-protein MVTSHRAISLEPPNAERTLIVVGCGRGGTSLVAGAAIILGVPMGADSDSVNHEDVELVNAAQGRDVHGRPTSPVADSVTENLRRLIQERNDSNSLWGWKDPSADLYLEKVSTEVRNPLVVFVNRDMAAIAQSEFNKMQYSIEQAYEQALHRFSRYWSLLQKLQWPTLLVSYERAALDPEALLCEMADFIGLDQPTKQQREAVKRFAATRDYQAIPQSSTILEAPIIRSETA >CP034445.1|AZO06018.1|5344676_5345441_+|sulfotransferase MRNGRLGFVVAGVQKAGTSALFTYLTRHPSLLPPRRKEIHFFDDETGVDWISPDYERLHSFFPSDDSERIAFEATPISIFWPNALERIAEYNPEIQIILIFRDPIERAWSHWRMEMSRGADNVPFSYAIRNGRARLNGFARNHPAWRTYSYVERGLYGAQISNLLRLFHPSKVLLLRSNDLKRDPAGVLALIARFLQISPFPIKEEIAEHVGGGAYVPPSKEDIAYLRDFYQNDMELFVDTACVNVADWPTYWR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034445_6 | 6124771-6124854 | Orphan |
NA
Consensus repeat of CP034445_6
|
1 spacers
spacers of CP034445_6
>6.1|6124795|36|CP034445|CRISPRCasFinder CTAACGACATGCACTCTTCAAACAAGTGCATGTCGG |
CRISPR arrays and Neighbor proteins around CP034445_6
The CRISPR arrays of CP034445_6 >merge|CP034445|6|6124771-6124854|CRISPRCasFinder CCAAAAGTGTGCGGCGGTTTTGGGCTAACGACATGCACTCTTCAAACAAGTGCATGTCGGCCAAAAGTGTGCAGCGGTTTTGGG >CP034445|6|5|6124771-6124854|CRISPRCasFinder CCAAAAGTGTGCGGCGGTTTTGGG CTAACGACATGCACTCTTCAAACAAGTGCATGTCGG CCAAAAGTGTGCAGCGGTTTTGGG
>CP034445.1|AZO07653.1|6121795_6124759_+|isoleucine--tRNA-ligase MTDTAETIDYSKTLYLPQTDFPMRAGLPEKEPGMVKRWQDMDLYRKLREEAAGREKFVLHDGPPYANGNIHIGHALNKILKDVINRSFQMRGYDANYVPGWDCHGLPIEWKIEEQYRAKGKNKDEVPVNEFRRECRDFAADWIKVQGSEFQRLGVIGDFDNPYTTMAYHAESRIAGELLKFAMSGQLYRGSKPVMWSVVERTALAEAEVEYQDYESDTIWVKFPVASLAQPVAGAAPALDGAALDLVEAHVVIWTTTPWTIPGNRAVSYSPRIAYGLYEVTAAENAFGPQPGEKLIFADALAEECAAKAKVTLNRLHSVSAEQLGKITLSHPFKGLGGGYEFPVPMVAGEHVTDDAGTGFVHTAPGHGREDFDAWTEAAADLRARGVDTTIPFTVDDAGFFTKDAPGFGPGREGGAARVIDDNGKKGNANQAVIDELIKRNALFARGRLKHSYPHSWRSKKPVIFRNTPQWFVYMDKDLGDGTTLRSRALKAIDDTRFVPAAGQNRIRAMIEERPDWVLSRQRAWGVPIAVFADEDGNVLKDEAVNQRIMEAFEKEGADAWFADGAKERFLGNHDASKWHQVMDILDVWFDSGSTHVFTLEDRPDLKWPADVYLEGSDQHRGWFHSSLLESCGTRGRAPYEAVITHGFTMDEEGRKMSKSLGNTVVPQDVIKQSGADILRLWVVTTDYWEDQRLGKNVLQTNIDAYRKLRNTIRWMLGTLAHDDGKDVPVEAMPELERLMLHRLSELDEVVRQGYDAFEFKRITRALLDFMVVELSAFYFDIRKDALYCDGPSSLRRRAAVQVVRHLFECLVKWLAPMLPFTTEEAWLDRHREAVSVHLDQFPEIPQNWRNEALAEKWRKVRQVRRVVTGALEIARAEKLIGSSLEAVPVVTLDDAALEAAIADVDMAEMAITSDLVIKHGKPPEGAFTLDDVKGVAVVVEKAEDRGLTKCARSWRYTADVGQDPEFPDVSARDAAVLHELKALGRL >CP034445.1|AZO06733.1|6120677_6121475_+|MipA/OmpV-family-protein MRIVGTIPLAVAMGLFAASIAQAGEGSWISGDWYLTLGATGLVAPNFEGGKKYMFSAQPIISLGKVGPEARFTSRNDNISLALVDDGSVRAGLTGKFLFHRTSKDELQGLDPVRFGGEVGGFFEFYPLDWLRARAELRHGVRSHNGFVADIAADAFYDITPSVRISGGPRVTFATANYFDAYYGVNATEAAASGLSEYHPGGGVKSTGLGGAITWKVTEPMTASVFTEYSRLMGPAADSSLVKERGDRNQWTFGVSTTYRFNFTM >CP034445.1|AZO06732.1|6119452_6120568_+|glycosyltransferase MRICCAISALFMNGGLQRDCLNISDRLVARGHSVTILTTRQVGHVESSATIKVHPARVLSNPGTEFALGKTLLSARRDFDCVVGFNKMAGLDIYYAGDPPYFASKLGWWRPFSPRFIRQQKLESMIFAPSSAVQIIALSEHQAALYRDFWRTEESRIHVVGPTLDPKRRRPELLAGERNEIRDRFGLPRQAVIALSIANRMKVKGLDRAVKALQPFPDIHWLVAGLRKDSEEEIRLRSLIAKCGMSNRVILLGIQEDIPAVVVASDFMLHPARLENTGTVILEALANGLPVIASAACGYAKYVEASGAGLVVANHDDPEIWRQAVRKAGDTAVRVQWHQAALAYGSSNPLTGGLEAACDLIESLRRNRHVG >CP034445.1|AZO06731.1|6118250_6119183_-|hypothetical-protein MPKALREYNWRQWLRLQPLTHAVKTARYRKIDRRFLSLPAKACDIQGTRDAARGRDVLLTIAFGNAQCLDLQVRLIRGLVRHDLHIVADNSVSEAAADENRQVCAAYGASYVRLPANPWTVKNPSRSHGAALNWMWHNILKPAAPAAFGFLDQDIFPTQPCDPFAPLQHVAFYGDLRRAGARWYLWAGYCFFRFDAVARKPLDFGLDWFAGLDTGGANWEVLYRDVDPNALPQRPITAFAALPGVELRQAYLRMAGNLAARSRPRRRSLVEGEKARGRFTLAGTHPSGPAQIGATMRLAVDTVIELCDGT >CP034445.1|AZO06730.1|6117202_6118261_-|glycosyltransferase MELESRPQAPRTQTGRFNGHSVTPDYRKDVLLPTVSVIIPVRDGAPYLGKALQSILDQDLADIEVIVVDDGSSDDSAAIATSLGEADGRVKVVANRGKGIVDALNMGIALARAPLVARMDGDDISLPDRLRLQVACFDADADLCLLGTAGLQIDRDGKPLRPIEVPIGNEPLRSALTGYNPMLHPTVMFRTEAVRRLGGYRRAFTYAEDYDLWLRLSEAGKLANMDARLVKLRSHPGQVSRVKEDQQKAAAALARQSALLRRSRAEPFDANGQPAEAIGTFLAWRAATGTGISSLERRDIELLLRTTGIPFATTCKLLLLAAVGAPSFRTLALGPRLAWRRMIARPAKMRPN >CP034445.1|AZO06729.1|6115752_6117189_-|glycosyltransferase-family-1-protein MRLVVDLTSMARWLGPPVGLVRVQRHYSAAAAAFTASDVAFTVFDPVDRVLRQVSPQLAAEIIAGDISCDMHFYPDPARVKVQFYDRWPNWARQALMAIIRPRRILITEIGAFTRHNPDSSLVPLLERIENWLMKPNERRLVARDDGSRVRIVPLRTVAGNEYEPAPGDHLLLMNNDWSHTDIGVISRACCSAGARLIVLLNDIIPIQFPDWYKRHDVKRFTDYVALAITLADRFILTSKRVTADMEEHAAGLGLHLPDLKLVPLGCDSARKSKAGGPLPDGLETGRYILFVSTIEPRKNHSLLMDVWRRLVQDGTVAHSGMKLVFVGRSGWMVDGVLERLHSHPDYGQSLIHLDNVDDGTLSLLYQDSAFCVYPSLYEGYGLPPVEALAYGKALIASTGGAIPEVVGPFGLCLDPLDVEGWEKAMREWITSPEVRTSYEAKAGEFTARSWEQAGRETLEATLAPFPDKVTPARERAP >CP034445.1|AZO06728.1|6114959_6115514_-|HdeD-family-acid-resistance-protein MTLPSDALRDAIGQTRDKWGWFVALGVLLLIFGGIAFGNLFIATVASVYVVGWLMLMAGIIEIIHAFGVKTWGRFFYWLLSGLLYAVAGFFAFDNPLLASAVLTLLLAIALIASGLLRSWVAFSHRPEQGWGWLLAAGIITILLGLMIAMGWPVNSLWVLGIFLAIDLVFQGWSFIAIGLALKR >CP034445.1|AZO06727.1|6113964_6114948_+|bifunctional-riboflavin-kinase/FAD-synthetase MTQAFKRLSTAAPLPAHLRGGVVAIGNFDGVHRGHQAVLERALAEAGRNGVPALVLTFEPHPRKVFRPQVPLFVLTPPPMKARLLAGLGFAALVEQPFTRDFASLSAEAFVTDVLEKNLGIHHAVTGFDFHFGKDRQGGPAFLMAAGERHGFGVTLVDAFRDEGAEVVSSSRIRGLLAEGKVEEAAGLLGYRFTVEAEVIGGQQLGRTLGFPTANMRLSSEAALREGIYAVRFRRADGTLHDGVASFGRRPTVDDNGAPLLETYVFDFSGDLYGETCEVSFFGFLRPELKFDGLDALVAQMKTDEAEARALLAGVRPLSQLDAEIAF >CP034445.1|AZO06726.1|6113083_6113932_+|TIGR01459-family-HAD-type-hydrolase MADSPDIIASLDDLAGRYAAILCDVWGVVHNGEWHFPAAAAALARARAANVPVVLITNSPRRSADVIAQMKVIGVPADACDRVVTSGDVTRDLIADGPRRIFHIGPERDFTLYDGLDIDLVEEFEASGVVCTGLYDDEVEKPADYAELLQRLRARNLQFICANPDILVERGERTIWCAGALARDYAQLGGRTLIAGKPFAPIYHVAMKEVAGLLGRAVERSEVLAIGDGMMTDVKGAADNGFDVLYVSGGIHAREHGDDPARLAAFLEKHGYRPVAVIPRLQ >CP034445.1|AZO06725.1|6110902_6112801_-|hypothetical-protein MIRDFFRDRRGNYALMTVITMIPLMGGVAIAVDYTELIRQKQETLNALDAAGIATAQQIVAGASDADAKSYAKTFFEANLRHVLPANTALTVTLPNNNAGGGTLVLEAALQYKPYFLPAAVALLGGTPGQTTVNFSARSEIRLKNTLEVSLVLDNSGSMSDISPTGGQQRIALLKTAATELVNMLAGQADLMRQIDRPVQFSLVPFSASVNVGPANKDKAWMDLDGISPIHHENFDWTKMSKTASGDPNKYIDKVGDAYWKRGSGWGAGQNTPMTRFKLYEEMTATTRTCTKKRSNGSCQTYSTPTTGQYEAWKGCVEARPYPYNVDDTTPTASKPASLFVPMFAPDEAGNLWTDSTRTSTNSWGYSNNWWIDSSDGLSVPQRQADMRKYFLTKPYDASTVSADDGPNAGCTTSPITPLQDVTTTAGKQTILGAIDAMTPTGNTNVPEGLAWGWRTLSSNEPFTEGRDNNERGNDKVVIVLTDGANTYSSVNDSSYANNRSTYAAYGYTGLAYLGSGSVTRLFMNTSSAVGKSTYTDANYTAALDEQMQTLCANAKANNIIVMTVSLDLSIQKTAEKKAISALTACASDSRFRRDPTDPSKPAKLFWNSTGATLSDDFKAIGSELSNLRIVS >CP034445.1|AZO06734.1|6125066_6125741_+|hypothetical-protein MTDRTFARAALVAPLVVSAIALSGCMSSPTYGTDKTAAAQLFDDVSGAASITPKRRTPIDYKPRPDLVKPAPGQKQNLPPPQESIETASTEWPESPEARRARIRADATAHQNDPNYQPEAVEDVQTDPASVKKAMADSASSHPPRWSPDDSSATRTAEIQRRLAEQKQGDPTTRKYLSEPPLAYRQPSDAAPQNELGEDEYKKERRLKAQAEGKKGGWFDWLGL >CP034445.1|AZO06735.1|6125807_6126269_-|nucleoside-deaminase MKRPDFMALALEEAEAAAGRGEVPVGAVVVSGNTVIAKAGNRTRELADPTAHAEMLAIREACRKLASERLTGHDLYVTLEPCAMCAGAISFARLRRLYFGAADEKGGAVVNGTRFFASPICHHAPDIYPGIGESEAALILKDFFRGKRNGGEW >CP034445.1|AZO06736.1|6126449_6128402_+|pseudouridine-synthase MDDDKKSPRGPRKGGSKPAGPRGGKPAHGAKKPFPKRERPIAAEGERNLKRYQPRETGPREAGAERGDRSFRKGPPRDGKPFEKREGRKPFAPRGDRPMAEGERPARDFKRGYKPREMGEGAERGDRPFRKGPPRDGKPFEKREGRKPFVPRGDRPVAAEGERGERRFDRPKRDFGDRPARDFADRPKRDFSDRPRGAGKPEGGFKPRPRPSEAAPEAGERIAKRLARAGIASRRDAEELIAAGRVKVNGKVLDSPAFNVSATDVIHLDGTEIPPIERTRLFLFHKPAGVVTTNRDPEGRKTVFDVLPSDLPRLMTIGRLDINTEGLLLLTNDGGLSRVLELPATGWLRRYRARVHGKVEESALAGLREGIAVDGVFYGSIEASLDREQGTNAWLTLGLREGKNREVKNILGALGLDVTRLIRISYGPFQLEDLPEGHVLEIKGRVLREQLGERLIEEAGANFDAEIQKPFSNKPVRGGGPRREEADRPKFTRDGDRRPIGEGGLIKARKRREDSRDEALSKLSTKPDRAFGERGAKSDRGGFGDKPRGGFGDKPRGKKSEREQRPIEPPGQRKANVWMAPGARPIGKGRAEADAARAAEAKARKAPFKPGGKGKPGAKPFGKPRGERPEGDGGNRPRGPKRGGDADRRR >CP034445.1|AZO06737.1|6128382_6128937_+|16S-rRNA-(guanine(966)-N(2))-methyltransferase-RsmD MRIVGGEFRGRPLATPKSNAIRPTTDRTREAVFNVLAHRYADKLEGGRVLDLFAGTGALGLEALSRGASYCVFIEESTEGRGLIRENVEAYGLTGRTKIFRRDATHLGEAGTISPFGLIFADPPYGKGLGERALRSAKDGGWLLPGALCVVEEAASAAFDPGAGFSVMDERNYGETVIRFIEAG >CP034445.1|AZO06738.1|6129031_6130393_+|insulinase-family-protein MTPTRIALRTALLVGTLAIVASAPARAADDGDVKDFLLDNGMEVVVIPDHRAPIVTHMVWYKIGSADEPAGKSGIAHFFEHLMFKATTHHAAGEFDRAVSEIGGSNNAFTSYDYTAFHETVPPSALEQMMGFEADRMRNLILTDDVIKTERDVILEERRSRIDSNPQAVLDEEVDATLWQNQPYRIPVIGWMQEMEQLNRPDAKAFYDNYYRPNNAVLIVAGDVEPDAVKAMAERTYGKVARGPDLRPRIRPVEPEQNTRRTVTLTDARVSVPSFSTQWVVPSYHTAKPGEAEALDLLAEILGGGNRSRLYQELVVKQGIASDAAAYFQGTMLDDTNFTVYGAPRGDAKLADVEAAVDAEIARIVKDGVSDDELERAKTRYVRSMIFARDKQDDMANMYGSTLATGGNVKDVQEWPGRIRKVTADEVKAVAARYLVLEHSTTGYLLPQQQAGN >CP034445.1|AZO07654.1|6130401_6131790_+|insulinase-family-protein MAPSHTSTLPSPAREEGGHIYRVVATLCFALFFLLLPALAARAEMNIQEVKSKKGITAWLVEDHSIPLIAIRFVFDGGSAQDPAGKEGLVNLMTGLFDEGAGGLDSDAFQQKLDDAGAEMSFQAARDGTYGSMRMLSDQKDEAFGLLKLAVNSPRFDQAPIDRIRAQVLSGILANERDPNTVAQQRWLRAIYGEHPYSRSDQGTKGSLTSITADDIRAFHKANFARGGLHVAVVGDIDAATLGKKLDDVFGDLPERQTLAPVSDVTPKLGQQLAVNYDLPQTSLQLAWPGVKRSDPNFFATVLMNEILGGSTFTSRLFSEVREKRGLAYGVSSDLVDNEHSHALLVTTATRSDRAAETLSIVRQVVKDMAENGPTEEELAIKKYMIGAYAINNLDSSASIAATLVELQVDNLGIDYMKRRAALINAVTLAEVKAAAKKLLSADPAVMVIGPPLVQVAGGGKG >CP034445.1|AZO06739.1|6131786_6132644_+|patatin-like-phospholipase-family-protein MSPTFGVAFGGGGARGLAHIHIIEALDELGIKPVAIAGSSIGSIMGAGMASGMTGAEIHGYARSILGSRAEVAARMWRSRPGTIAEAMQGGIRVGQFNIERILKAFLPEPIPRTFDALKIPLKVTATDYFGHKLAVLAEGELHSALAASAAIPAVFRPVVRNGCLLIDGGIYNPVPFDLLEKDADIIIAIDVVGAPSDAERKHPTTVDLMYGASQLMMQSIIANKLQQSRPDILVRPKVSKYRVLDFLKIEALMAETAEIKDEVKRAVEKAVARHGGKRGKKKVV >CP034445.1|AZO07655.1|6133466_6134723_-|porin-family-protein MSAAHAADVVQEQAAPGFNWSGVYVGFGVGAGANVHKLSSDFLPGSSLNGIGGEGIYGQATVGYDYMVSQRFLLGGLIDAHVGTIKTSLDVGALGGLSADLKETYGFDVGVRAGYLLTPSTLGYVLGGYAWQKYKLDTNAGFGMDWDQGGYFVGAGVETAINSNWTLKGEYRYTRFGTKDNLLSQFGLPDGALNLDTSRHTFEVAASYRFNANDGGAASFETPSYNWTGFYVGGGLGAGAVVHQIEVPPADVKVNGLGGEGVFGEASLGYDQDMGSWVVGGLVDARLSGIKSKLELGGLLGGGVDSISLNTDYGFDVLGRIGMKVNEATLAYALAGYSWQHFKLDAPAPLDVDWGSSGFSVGAGLETAVSDKMTVGIEYRYSQFAKEDFSDEFPILSGLVTSTPSFHTVRIDAKYKFN >CP034445.1|AZO06740.1|6135298_6136183_+|restriction-endonuclease MAFGVFIHRTDSIYDDSPAEQYQFPRQYLRRVEACVGDWIIYYEPSKVTETRGYFAMAKVQQVIPDLSAPDMYLALIEPGTYLDFVNPVPFSGADGLVERGLLNNEGRISGRAQSAVRALSPADFNRIIDLAFDASEVVPRVEETGFQEEQAPFQFEQSRDRANYIGSRIVRDRIFRRIVLRAYDERCAITGLKLINGGGRAEVSAAHIRPVERNGPDVINNGIALSGTAHWMFDRGLISLSDDLEILISRQVNDVDSVQGFINKTRRALLPSRLSERPHPRFLQWHREHCFKQ >CP034445.1|AZO06741.1|6136303_6137509_+|hypothetical-protein MPIKSKDGGYPVRFSVPRQRWSKIFRTQDELRAFLTSEIKHWQEYNQPRDFPLPGGSQGLAFLDPLTTAIFNEAMSTGPHPEDALKILEDRGALLSEGLYGRFLSKLRAEKPKLYPGAVAAIAATLGPHNAWPDQSGRQPFPWAVWLSGLAAILELVPSSLIKPEADQLTAVVSDALAHRDETEEMKHAFETWSGKTQSDTQAEIDRFRQSAADAAKFTQAQLTQALADSSTRILELEDKVRKRLVLEAPTTYWSKKANGHVAIAFGFGALFLLGLGSGIYWLTHYGVDLVADAHQRIVGNVQDPGLLALVPLAFITLPTLAFAWLLRHVSRVIVQNLALGADARLRGTIATTYSALTVDQAATPAELAIVFNALFRPVDGSTHSEIAPPNLADLMELTKK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | CP034445.1 | 100944-100972 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | CP034445.1 | 2937059-2937087 | 2 | 0.931 |
1. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to position: 100944-100972, mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccagaagtgcgcagcggttc Protospacer **** *******.****************
2. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to position: 2937059-2937087, mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccagaagtgcgcagcggttc Protospacer **** *******.****************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP034445_4 | 4.6|3331589|24|CP034445|CRT | 3331589-3331612 | 24 | NZ_CP022999 | Rhizobium sp. 11515TR strain 10195 plasmid p11515TR-A, complete sequence | 979180-979203 | 2 | 0.917 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP050092 | Rhizobium leguminosarum bv. trifolii strain 22B plasmid pRL22b6, complete sequence | 77392-77420 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP050104 | Rhizobium leguminosarum bv. trifolii strain 4B plasmid pRL4b2, complete sequence | 345032-345060 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NC_019849 | Sinorhizobium meliloti GR4 plasmid pRmeGR4d, complete sequence | 499073-499101 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP019586 | Sinorhizobium meliloti strain CCMM B554 (FSM-MA) plasmid pSymB, complete sequence | 1204633-1204661 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP025507 | Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvA, complete sequence | 47807-47835 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP009146 | Sinorhizobium meliloti strain RMO17 plasmid pSymB, complete sequence | 494097-494125 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP050109 | Rhizobium leguminosarum bv. trifolii strain 3B plasmid pRL3b2, complete sequence | 345032-345060 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP048281 | Rhizobium leguminosarum bv. viciae 248 plasmid pRle248e, complete sequence | 63204-63232 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP030761 | Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed1, complete sequence | 237277-237305 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP021828 | Sinorhizobium meliloti strain KH35c plasmid psymB, complete sequence | 805730-805758 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP022666 | Rhizobium leguminosarum bv. viciae strain BIHB 1217 plasmid pPR1, complete sequence | 761927-761955 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP021831 | Sinorhizobium meliloti strain HM006 plasmid psymB, complete sequence | 747195-747223 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP021218 | Sinorhizobium meliloti RU11/001 plasmid pSymB, complete sequence | 639956-639984 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NC_009620 | Sinorhizobium medicae WSM419 plasmid pSMED01, complete sequence | 997609-997637 | 2 | 0.931 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP050086 | Rhizobium leguminosarum bv. trifolii strain 23B plasmid pRL23b7, complete sequence | 359057-359085 | 2 | 0.931 |
CP034445_4 | 4.1|3331310|24|CP034445|CRT | 3331310-3331333 | 24 | NZ_CP013436 | Burkholderia latens strain AU17928 isolate AU17928 plasmid pAU17928, complete sequence | 44120-44143 | 3 | 0.875 |
CP034445_4 | 4.1|3331310|24|CP034445|CRT | 3331310-3331333 | 24 | NZ_CP013436 | Burkholderia latens strain AU17928 isolate AU17928 plasmid pAU17928, complete sequence | 4346-4369 | 3 | 0.875 |
CP034445_4 | 4.1|3331310|24|CP034445|CRT | 3331310-3331333 | 24 | NZ_CP006373 | Aureimonas sp. AU20 plasmid pAU20f, complete sequence | 12631-12654 | 3 | 0.875 |
CP034445_4 | 4.1|3331310|24|CP034445|CRT | 3331310-3331333 | 24 | MH539650 | Siphoviridae sp. isolate ctcf5, complete genome | 38977-39000 | 3 | 0.875 |
CP034445_4 | 4.6|3331589|24|CP034445|CRT | 3331589-3331612 | 24 | MH697579 | Mycobacterium phage Cane17, complete genome | 60679-60702 | 3 | 0.875 |
CP034445_4 | 4.6|3331589|24|CP034445|CRT | 3331589-3331612 | 24 | MK359359 | Mycobacterium phage Colt, complete genome | 61275-61298 | 3 | 0.875 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP050081 | Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b5, complete sequence | 237158-237186 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP050099 | Rhizobium leguminosarum bv. trifolii strain 9B plasmid pRL9b5, complete sequence | 27785-27813 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NC_012586 | Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence | 174995-175023 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NC_012586 | Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence | 243467-243495 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NC_012586 | Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence | 1231300-1231328 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NC_012586 | Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence | 1528410-1528438 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP025013 | Rhizobium leguminosarum strain Norway plasmid pRLN1, complete sequence | 287467-287495 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP025014 | Rhizobium leguminosarum strain Norway plasmid pRLN2, complete sequence | 4947-4975 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP025014 | Rhizobium leguminosarum strain Norway plasmid pRLN2, complete sequence | 27343-27371 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP049731 | Rhizobium leguminosarum strain A1 plasmid pRL12, complete sequence | 233828-233856 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP049732 | Rhizobium leguminosarum strain A1 plasmid pRL11, complete sequence | 404201-404229 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP053441 | Rhizobium leguminosarum bv. trifolii strain CC275e plasmid pRltCC275eE, complete sequence | 27780-27808 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP013634 | Rhizobium sp. N324 plasmid pRspN324d, complete sequence | 349340-349368 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP018230 | Rhizobium leguminosarum strain Vaf-108 plasmid unnamed2, complete sequence | 399791-399819 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP013110 | Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence | 606865-606893 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP013110 | Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence | 742711-742739 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP013110 | Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence | 1924354-1924382 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP013110 | Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence | 2176432-2176460 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP029452 | Sinorhizobium fredii CCBAU 25509 plasmid pSF25509b, complete sequence | 348752-348780 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NC_012858 | Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132502, complete sequence | 586807-586835 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NC_011366 | Rhizobium leguminosarum bv. trifolii WSM2304 plasmid pRLG202, complete sequence | 41460-41488 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NC_011366 | Rhizobium leguminosarum bv. trifolii WSM2304 plasmid pRLG202, complete sequence | 277940-277968 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NC_011368 | Rhizobium leguminosarum bv. trifolii WSM2304 plasmid pRLG201, complete sequence | 1146179-1146207 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP054022 | Rhizobium sp. JKLM12A2 plasmid pPR12A201, complete sequence | 109175-109203 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP054023 | Rhizobium sp. JKLM12A2 plasmid pPR12A202, complete sequence | 271254-271282 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NC_012848 | Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132501, complete sequence | 680338-680366 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP024310 | Sinorhizobium fredii strain NXT3 plasmid pSfreNXT3c, complete sequence | 1698070-1698098 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP032695 | Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525c, complete sequence | 2199411-2199439 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP013054 | Sinorhizobium americanum CCGM7 plasmid C, complete sequence | 1553123-1553151 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP013054 | Sinorhizobium americanum CCGM7 plasmid C, complete sequence | 1642415-1642443 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP013054 | Sinorhizobium americanum CCGM7 plasmid C, complete sequence | 1742963-1742991 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP022566 | Rhizobium leguminosarum bv. viciae strain BIHB 1148 plasmid pSK02, complete sequence | 272085-272113 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP053206 | Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1D, complete sequence | 233800-233828 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP053208 | Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1B, complete sequence | 355860-355888 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP035000 | Rhizobium acidisoli strain FH23 plasmid pRapFH23b, complete sequence | 102067-102095 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP029232 | Sinorhizobium fredii CCBAU 45436 plasmid pSF45436b, complete sequence | 30321-30349 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP029232 | Sinorhizobium fredii CCBAU 45436 plasmid pSF45436b, complete sequence | 179903-179931 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP029232 | Sinorhizobium fredii CCBAU 45436 plasmid pSF45436b, complete sequence | 1334310-1334338 | 3 | 0.897 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP013110 | Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence | 2094555-2094583 | 4 | 0.862 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP032695 | Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525c, complete sequence | 1718305-1718333 | 4 | 0.862 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP015881 | Ensifer adhaerens strain Casida A plasmid pCasidaAA, complete sequence | 333399-333427 | 4 | 0.862 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP054033 | Rhizobium sp. JKLM13E plasmid pPR13E02, complete sequence | 430939-430967 | 4 | 0.862 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP023068 | Ensifer sojae CCBAU 05684 plasmid pSJ05684b, complete sequence | 1258985-1259013 | 5 | 0.828 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP021033 | Rhizobium sp. NXC14 plasmid pRspNXC14c, complete sequence | 273279-273307 | 5 | 0.828 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP032695 | Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525c, complete sequence | 2127702-2127730 | 7 | 0.759 |
CP034445_5 | 5.1|5336495|29|CP034445|CRISPRCasFinder | 5336495-5336523 | 29 | NZ_CP006990 | Rhizobium sp. IE4771 plasmid pRetIE4771d, complete sequence | 288654-288682 | 7 | 0.759 |
1. spacer 4.6|3331589|24|CP034445|CRT matches to NZ_CP022999 (Rhizobium sp. 11515TR strain 10195 plasmid p11515TR-A, complete sequence) position: , mismatch: 2, identity: 0.917
ggcaactggggcgccggcaacggc CRISPR spacer ggcaaatggagcgccggcaacggc Protospacer ***** ***.**************
2. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP050092 (Rhizobium leguminosarum bv. trifolii strain 22B plasmid pRL22b6, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
3. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP050104 (Rhizobium leguminosarum bv. trifolii strain 4B plasmid pRL4b2, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
4. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NC_019849 (Sinorhizobium meliloti GR4 plasmid pRmeGR4d, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
5. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP019586 (Sinorhizobium meliloti strain CCMM B554 (FSM-MA) plasmid pSymB, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
6. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP025507 (Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvA, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
7. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP009146 (Sinorhizobium meliloti strain RMO17 plasmid pSymB, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
8. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP050109 (Rhizobium leguminosarum bv. trifolii strain 3B plasmid pRL3b2, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
9. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP048281 (Rhizobium leguminosarum bv. viciae 248 plasmid pRle248e, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatatcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
10. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP030761 (Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed1, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
11. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP021828 (Sinorhizobium meliloti strain KH35c plasmid psymB, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
12. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP022666 (Rhizobium leguminosarum bv. viciae strain BIHB 1217 plasmid pPR1, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
13. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP021831 (Sinorhizobium meliloti strain HM006 plasmid psymB, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
14. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP021218 (Sinorhizobium meliloti RU11/001 plasmid pSymB, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
15. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NC_009620 (Sinorhizobium medicae WSM419 plasmid pSMED01, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
16. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP050086 (Rhizobium leguminosarum bv. trifolii strain 23B plasmid pRL23b7, complete sequence) position: , mismatch: 2, identity: 0.931
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgcgcagcggttt Protospacer **** ***********************.
17. spacer 4.1|3331310|24|CP034445|CRT matches to NZ_CP013436 (Burkholderia latens strain AU17928 isolate AU17928 plasmid pAU17928, complete sequence) position: , mismatch: 3, identity: 0.875
ggcaatggcaatgtcggcgtcggc CRISPR spacer tgcaacggcaatgtcggcggcggc Protospacer ****.************* ****
18. spacer 4.1|3331310|24|CP034445|CRT matches to NZ_CP013436 (Burkholderia latens strain AU17928 isolate AU17928 plasmid pAU17928, complete sequence) position: , mismatch: 3, identity: 0.875
ggcaatggcaatgtcggcgtcggc CRISPR spacer tgcaacggcaatgtcggcggcggc Protospacer ****.************* ****
19. spacer 4.1|3331310|24|CP034445|CRT matches to NZ_CP006373 (Aureimonas sp. AU20 plasmid pAU20f, complete sequence) position: , mismatch: 3, identity: 0.875
ggcaatggcaatgtcggcgtcggc CRISPR spacer ggcaatggcaatggcggcatcgga Protospacer ************* ****.****
20. spacer 4.1|3331310|24|CP034445|CRT matches to MH539650 (Siphoviridae sp. isolate ctcf5, complete genome) position: , mismatch: 3, identity: 0.875
ggcaatggcaatgtcggcgtcggc CRISPR spacer tgcaatggcaatgttggcgtccgc Protospacer *************.****** **
21. spacer 4.6|3331589|24|CP034445|CRT matches to MH697579 (Mycobacterium phage Cane17, complete genome) position: , mismatch: 3, identity: 0.875
ggcaactggggcgccggcaacggc CRISPR spacer ggcaactggggcgccagcaacccc Protospacer ***************.***** *
22. spacer 4.6|3331589|24|CP034445|CRT matches to MK359359 (Mycobacterium phage Colt, complete genome) position: , mismatch: 3, identity: 0.875
ggcaactggggcgccggcaacggc CRISPR spacer ggcaactggggcgccagcaacccc Protospacer ***************.***** *
23. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP050081 (Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b5, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcgcaaaagtgcgcagcggttt Protospacer **** **** ******************.
24. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP050099 (Rhizobium leguminosarum bv. trifolii strain 9B plasmid pRL9b5, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
25. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NC_012586 (Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
26. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NC_012586 (Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
27. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NC_012586 (Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
28. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NC_012586 (Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer acatgtcgcccaaaagtgtgcagcggttc Protospacer .*** *************.**********
29. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP025013 (Rhizobium leguminosarum strain Norway plasmid pRLN1, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcgcaaaagtgcgcagcggttt Protospacer **** **** ******************.
30. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP025014 (Rhizobium leguminosarum strain Norway plasmid pRLN2, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
31. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP025014 (Rhizobium leguminosarum strain Norway plasmid pRLN2, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
32. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP049731 (Rhizobium leguminosarum strain A1 plasmid pRL12, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcgcaaaagtgcgcagcggttt Protospacer **** **** ******************.
33. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP049732 (Rhizobium leguminosarum strain A1 plasmid pRL11, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
34. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP053441 (Rhizobium leguminosarum bv. trifolii strain CC275e plasmid pRltCC275eE, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
35. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP013634 (Rhizobium sp. N324 plasmid pRspN324d, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcgcaaaagtgcgcagcggttt Protospacer **** **** ******************.
36. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP018230 (Rhizobium leguminosarum strain Vaf-108 plasmid unnamed2, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgggcagcggttt Protospacer **** ************* *********.
37. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP013110 (Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
38. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP013110 (Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
39. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP013110 (Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
40. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP013110 (Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
41. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP029452 (Sinorhizobium fredii CCBAU 25509 plasmid pSF25509b, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
42. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NC_012858 (Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132502, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
43. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NC_011366 (Rhizobium leguminosarum bv. trifolii WSM2304 plasmid pRLG202, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcgcaaaagtgcgcagcggttt Protospacer **** **** ******************.
44. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NC_011366 (Rhizobium leguminosarum bv. trifolii WSM2304 plasmid pRLG202, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
45. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NC_011368 (Rhizobium leguminosarum bv. trifolii WSM2304 plasmid pRLG201, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcgcaaaagtgcgcagcggttt Protospacer **** **** ******************.
46. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP054022 (Rhizobium sp. JKLM12A2 plasmid pPR12A201, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcgcaaaagtgcgcagcggttt Protospacer **** **** ******************.
47. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP054023 (Rhizobium sp. JKLM12A2 plasmid pPR12A202, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgggcagcggttt Protospacer **** ************* *********.
48. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NC_012848 (Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132501, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcgcaaaagtgcgcagcggttt Protospacer **** **** ******************.
49. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP024310 (Sinorhizobium fredii strain NXT3 plasmid pSfreNXT3c, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
50. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP032695 (Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525c, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcgcaaaagtgcgcagcggttt Protospacer **** **** ******************.
51. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP013054 (Sinorhizobium americanum CCGM7 plasmid C, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
52. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP013054 (Sinorhizobium americanum CCGM7 plasmid C, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
53. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP013054 (Sinorhizobium americanum CCGM7 plasmid C, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
54. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP022566 (Rhizobium leguminosarum bv. viciae strain BIHB 1148 plasmid pSK02, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
55. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP053206 (Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1D, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcgcaaaagtgcgcagcggttt Protospacer **** **** ******************.
56. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP053208 (Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1B, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
57. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP035000 (Rhizobium acidisoli strain FH23 plasmid pRapFH23b, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcgcaaaagtgcgcagcggttt Protospacer **** **** ******************.
58. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP029232 (Sinorhizobium fredii CCBAU 45436 plasmid pSF45436b, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
59. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP029232 (Sinorhizobium fredii CCBAU 45436 plasmid pSF45436b, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
60. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP029232 (Sinorhizobium fredii CCBAU 45436 plasmid pSF45436b, complete sequence) position: , mismatch: 3, identity: 0.897
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcccaaaagtgtgcagcggttt Protospacer **** *************.*********.
61. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP013110 (Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence) position: , mismatch: 4, identity: 0.862
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer ggatgtcgcccaaaagtgtgcagcggttt Protospacer * ** *************.*********.
62. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP032695 (Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525c, complete sequence) position: , mismatch: 4, identity: 0.862
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer ccatgtcgcgcaaaagtgcgcagcggttt Protospacer *** **** ******************.
63. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP015881 (Ensifer adhaerens strain Casida A plasmid pCasidaAA, complete sequence) position: , mismatch: 4, identity: 0.862
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcgcaaaagtgcgcagcggctt Protospacer **** **** ****************.*.
64. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP054033 (Rhizobium sp. JKLM13E plasmid pPR13E02, complete sequence) position: , mismatch: 4, identity: 0.862
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gtatgtcgcccaaaagtgggcagcggttt Protospacer *.** ************* *********.
65. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP023068 (Ensifer sojae CCBAU 05684 plasmid pSJ05684b, complete sequence) position: , mismatch: 5, identity: 0.828
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer tcatgtcgcccaaaagtgtgcagcgggtt Protospacer *** *************.******* *.
66. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP021033 (Rhizobium sp. NXC14 plasmid pRspNXC14c, complete sequence) position: , mismatch: 5, identity: 0.828
-gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer agca-gtcgcgcaaaagtgtgcagcggttt Protospacer *** **** ********.*********.
67. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP032695 (Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525c, complete sequence) position: , mismatch: 7, identity: 0.759
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer gcatgtcgcgcaaaagtgcgcagtgcgca Protospacer **** **** *************.* .
68. spacer 5.1|5336495|29|CP034445|CRISPRCasFinder matches to NZ_CP006990 (Rhizobium sp. IE4771 plasmid pRetIE4771d, complete sequence) position: , mismatch: 7, identity: 0.759
gcatttcgcccaaaagtgcgcagcggttc CRISPR spacer ataagtcgcgcaaaagtgtgcagcggttt Protospacer ..* **** ********.*********.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
431107 : 439481
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP034445|431107:439481|DBSCAN-SWA TCTATTCGAGATAGCCGGACGGATCGACCGGCGCCGAGTTCTTGCGCACTTCAAAGTGCAGCTTGGGCGAGTCGGTGGTGCCGCTCATGCCGGACAGCGCGATCTCCTGGCCGCGCTTGACCTTCTGGCCGCGCTGCACCTCGATCGAGCTTGCATGGCCGTAGACGGTGACCAGGCCGTTCTCGTGCCGCACCAGCACGGTGTTGCCGAACTCCTTGAGACCGTCGCCGGCATAGATGACGACGCCGTTCTCGGCCGCCTTGATCGGCGTTCCTTCCGGCACCGCGATGTCGACGCCGTCCTTGCCGGAGCCGAAGCTGGAGATCACGCGGCCGCGCACCGGCCAGCGCATCTTGCCGATGCCGGTCGCGTCGGGCGCCTCGGCATTGTCGTCCTCGGCCTGCTGGATGACCTTGGCATCCTTCTTCGGCGGCGTGTAGGAGGCGAGCGTCTCGGTGGTCTTCGCCGGCGGCGGCGTCGTCGCCGTCGTCACCGGATCGACCGCGGGCTTGGTGGTGGCCGCCGGCTTGGCAGCCGCCACGACGGTGCCGCCAGCCGGAACCTTGAGCGTCTGTCCGATCTTGAGCAAGCCGTCCTGCATGCCGTTGGCCTGCTTCAGCGTCGTGACGCTGACGCCCGTCTTCCTGGCGATCGACGACAGCGAGTCGCCCTGCTGGACAGTATAGGTGCCTCCGGCGCCGGCGGGCTTTGCCTCGGCAACCTGCGGCGCGGGCTTGGGGTTCTTGGAGCCCGTGGCCGCAGCCGACGCATCGACCTGAGCGGCCGCCTTGCCGTCCTTGACCTTCGGCTGCTGCGGCAGCACCGCGACCTTTTCCGGCGCGGGCTGCTTCGGCGTGTTCGCCGGCTTGCCTTCGGCAACCTTCGGCTCCGCCTTGCTCGAATAGGCATAAGCGGGGATGACGATCTTCTGGCCGGACCGGATGCCCTTCTTCGGATCGATGCCGTTGACCTTGGCTATAGCATCGGCCGGCACGTGGTAGTGCGCGGCCAGGCCTGAAAGCGTCTCGCCGTCCCGGACAACGATTTCGGTCGCATGTGGCGCAGCACTTGCGTTGCGGACCGCGTCAGGCTCGGCGGTCTTGAACGGTTTTGCCGGCGTGACCGTGCCGGTGGCCGTCCTGTCGAGATGCGGTGCCGGCGCAAGCGCCGGAGCCGGCGCGAGGGCGGGAGCCGTAGCGGTACGCATCGGCTTTGCCGTAGCCGGCGCCGAGGGCGGCGGCAATTGCTGGGTCGTGACCGGCTCGAGGCTGGAGCGGCTAACCGACTGGGTGTGCGTGCCATCGACCGGCGCAGGCGCGACCGTGTCGCCCGGATAGGGCTGGTCGACATTCTGCTTGTTGATGATGGCGCGCTGATTGTTGGTGGATGACGTGAAGACATCATCGACACCGTTGAAGCGCATCGACTGGGAACTGCACCCGGCCGCCGCGCCGGCAATCATCAGCACAGCGCAACCGCGCGCCAGATTGCGTCTGTTTGCTTTCAAGTGATTGAATTGCATCGCACTAACCCGCACAGACCCGGTACAAACTGGCCCTGATTAAAGCGCGTTAATGTTACTGCCGGGTTAACCCCTTAGAATCCGACGAAAATTTTCTTAAAACATTTCGGGCCGCTGGACGGGGCGGAGGTTCAGGTCAGCCCGAGTCGAGAATGGTGGTGTCCGAGAACCGGAGCGGAGCGTATTATCAGTACGTGAGCACCGGAAGCGCAGGAGACCGCCTTTCGCAGGCCGGGCTCACCTGAATATCGGCCCCGCCCCTAGATCACCGCGGCGACGCTGCGCAGGATCGGCTGCAGCCTGACCATGCCGATGTCCTCGCGCTCGAAGCGGCTGCCGACCTTGGTGAGCTTGGCCAGCACCTGCTCGCCCTCCTCCGGGCCGATTGGCGCGATGACGATGCCGCCGCTCGACAATTGGTCGAGCAGGAAGCGCGGCAGGCTGTCGAAGGCCGCCCAGGCGACGATGCGGTCGAACGGCCCTTCATTGGGCAGGCCGTTGGAACCGTCGGCCTGGCGGACGATGACGTTGCTGATGGCAAGCGCCTCGAAGCGCTGCTTGGCCTGCTCGGTGAGCGTCTTGTAGCGGTCGATGGTGATGACGCGCGCGGCAAGCCGCGACATGACCGCTGCCGTGTAGCCCGAGCCGGTGCCGATCTCGAGCACGCGGTTGCCCGGCTCGATGTGCAGCGCGGCAATCACGGCTGCCTGCAAATCGGCGCCCTCGATCGCCTCGCCGCATTCGATCGGCAGCATGCCGTCCGACCAGGCGATCGAATGGAACTGGGCCGCCAGGAAACCGCGCCGCGGCGTCGCCTCGAAGGCGGCGACCAGTGCCTTGGGCGCCGTGCCCCTGCCCCGCAGCCTGAGCAGGAAAGCGGCGAAACCTTCGCGGTCGTCGATGCCTGTGTTCATGCCAAAACCACGTTCATGCCAATGCCTTGTTCATGACAGCGCCTTGGTCAGCTGGTCGCGGAGTTCATGGGCGGTGAGGTCGAGCTGCAGTGGCGTCACCGACACCAGCCGGTTGCGCAGCGCGTAGAGGTCCGTGCCCTTCTTGCCTTCGACCGGCTCGCGGCCGAAGCGCAGCCAGTAGTAAGGCAGGCCGCGTCCGTCGCGCCGCTCATCGACCCAGAGGCTGTGCACCAGCTTGCCCTGCGAGGTGACGACGGTGCCCTCCACTTCGTCGGGAGCGCAGTTCGGAAAATTGACGTTGAGCAGCACGCCGTCCGGCAGCGGCGTTTCCACCAGTCGCTTCAACAGCGCCGGCGCCAGCGACTCGGTCGTCTCGTAAGGAACCACGCGATCCTCGCCGACATAGGAATAAGCCTGGCTGACCGCGATCGAGCGCACGCCGAGCAGCGCGCCTTCCATGGCGCCTGCAACGGTGCCGGAATAGGTGACATCATCAGCAATGTTGGCGCCGGAATTGACGCCGGACAGGATCAGGTCCGGCGGTCCGGGCAGGATCTTCTTCGTACCCATGATGACGCAGTCGGTCGGCGTGCCGCGCACCGCGTAATGTTTCTCGCCGATTTTGCGCAGCCGAAGCGGCTCCGAAATCGACAGCGAATGCGCGTAACCGGACTGGTCCTGCTCGGGCGCCACCACCCACACGTCGTCCGACAGCGTGCGGGCGATGCGTTCGAGCGACGCCAGCCCTTCGGCGTGGATGCCGTCATCGTTAGTCAGCAGTATGCGCATCAAGTCACTTCGCTTCGATCTTGGCGAGACCACCCATGTAAGGCCGCAGCACTTCAGGAATGGTTACGCTGCCATCCTCATTCTGGTAGTTTTCGATGACAGCTATGAGAGCGCGGCCGACGGCGGTACCCGAACCATTGAGCGTGTGAACGAAGCGGTTGCCCCTGCCGTCCTTGTCCTTGTAGCGGGCATCCATGCGCCGTGCCTGGAAGTCGCCGCAGACCGAGCAGGACGAGATTTCGCGATAGGCGTTCTGCCCCGGCAGCCAGACCTCGATGTCGTAGGTCTTGCGCGCGCCGAAACCCATATCGCCGGTGCAGAGCGTGACGGTGCGGAACGGCAGGCCGAGCCGCTTCAGCACCTCTTCGGCGCATTGCGTCATGCGCTCATGCTCGGCGATCGAGCTTTCCTGGTCGGTGATCGAGACCAACTCGACCTTGTAGAACTGATGCTGGCGCAGCATGCCGCGCGTGTCGCGGCCGGCCGAGCCCGCTTCCGAGCGGAAGCACGGCGTCAGCGCCGTGTAGCGCAGAGGCAGCTTTTCATGCGCGGTGATCTCCTCGCGCACGAGGTTGGTGAGCGGCACTTCCGCTGTGGGGATCAGGCCAAGCCGACCGTCGCCATGCGGCGTGAAGAACAGGTCTTCCTCGAATTTCGGCAGCTGGTTGGTGCCGAAAAGCACCTCGTCGCGCACCATCAGCGGCGGAATGACCTCTTCGTAGCCGTGCTCGGTCGTGTGCAGGTCGAGCATGAACTGGCCGATCGCGCGCTCCATGCGGGCCAATTGGCTTTTCAGCACGGTGAAGCGCGCGCCGGACAATTTCGCCGCCCGCTCGAAATCCATCATGCCGAGCGCTTCGCCGATCTCGAAATGCTCCTTCATCCAGTTCGGCCGCGTCGGCACCTCGCCGACGATGCGCTTGACGACATTGTCGTGCTCGTCCTTGCCGACCGGCACGTCCTCCAGCGGCACGTTGGGCAGCACCGCAAGCGCGTCGTTCAGCGCCTTGTCGAGCTCGCGCTCGCGCGCCTCGCCGTTCTGGATGAACGCCTTGATGTCGCTGACTTCGCCCTTGAGCTTCTCGGCAAGCGCTGCATCGCCCGAACGCATGGCGTTGCCGATTTCCTTCGAGGCGGCGTTGCGGCGCTCCTGCTTGACCTGCAATTCGCTGAGATGCGAGCGCCGCGCCTCGTCCTTCGCAATCAGATCGTCGACCATGGACTGCGCCTCGTCAGCCGACCACGAGCGCTTCACCAGCGCCTCGACAAGGGCCTTCGGGTTGTCGCGAATCCATTTGATATCAAGCATGGTCGATCCTCAGAAGTAAGGACCGAGGGCGTAAACCGCATCGGCTATCGATGCAAGACGGCGGATAGGCGCGCCGCCTCGATTTCCCTGCCTTGCCGCCGCTCAGGTGGACGGCGCGTCCGGCGTCTTGTCGCCCGCGACAGAGCTGTCCGCATTCGCCTGCTCGCGCGCCAGCCGGTCCCGTGCCTGGGCGCGCTCGATCACGCGCGATGACCAGATGGCGACCTCGTAGAGAATGATCGTCGGGATGGCGAGGCCGCACTGGCTCATCGGATCGGGCGGCGTCAGCACCGCTGCGACCACGAAGGAAAGGACGATTGCCCATTTGCGCTTCTCGGCCAGCGCCTGCGACGACAACAGGCCGACGCGCGTCAAAAGGCTGGTCACCACCGGCAGCTGGAACACCAAGCCGAAGGAGAAGATCAGCGTCATGATCAGGCTGAGATATTCCGAAACCTTCGGCAGCAGCGAAATCTGCACCTGGTCGTTGGTGCCGACCTGCTGCATGGCCAGGAAGAACCACATCACCATCGGGGTGAAGAAGAAATAGACCAGCGAGGCGCCCATCAGGAACAAGATCGGCGACGCGATCAGGAACGGCAGGAAGGCGCTGCGCTCATTCTTGTAGAGGCCCGGCGCGATGAACTTGTAGATCTGCGTGGCGATCAGCGGGAAGGCGATCACCATGCCGCCGAACATGGCAAGCTTCACCTGGGTGAAGAAGAACTCCTGCGGCGCGGTATAGATCAGCTCGACCTTGTGCGGGTCGAGGCCTGCCCATTGCGTTGCCCATTTGAAGGGGATGACCAACAGGTTGAACAGGCGCTTGGCGAAAAAGAAGCAGACGAGGAAGGCGACGAAGAAGCCGCCCAGCGACCAGATCAGGCGGCGGCGCAACTCGATGAGATGCTCGATCAGCGGCGCCGACGATTTCTCGATTTCTTCCCGCTCCTTGTCCGATACGCTCACTTGGCGGCTCCCGCCGTCTTCTTGGTGTCAGCCGTCTTCTTGGAGGCAGGCTTCCTGGCCGCGGCAGGCTTTGCCTCAGCCTTGGGCGCGGCCTTTGCGGCAGCCGTTGCGGCTTTGGCCGGCGCCGAGGATTTGGCCAGCGCCTTTGCAGCCGATGCCTTCGATGCCGCTGCCGTTTTCGCCACAGCGTCGGCCGTCTTGGCCTTAGTCACCGCAGTCTTTTTCGCCGGTTTGGGCGCCACAGGCGCATCCGAAACCGAGGCGGTGACCGAGGCGTCCGTCATGGCCGGGAAGGTCGGAGCGGCCGGGGCGGGCTCGGTTGCGCCGACGCCGGGCATGTCCGTCGCGCCGTTCTTGAGCGGCTCGGCGGGCTGCGGCGTGTCTGCCGCCGCCGCGGCAGGATCGGCGGCGGGCTTCGGCTTCATCATCGTGTCGACGCCGGCCCGCACATCGGCTGCGGCCTGCTCGAACGGATTGAGCTGCTTGCGCACCTCGTTCATCGGATTGAGGCTGCGAAGACTGTCGATCGAACTCTTGACGTCGTCGAGCTCGGCTTCCTTCAGCGCCTCGTTGAACTGCTTCTGGAAGTCGCCGGCCATGGCGCGCAGTTTCGCCGCGGTGCGGCCGAAGGTGCGCAGCATGTTGGGCAAATCCTTGGGCCCGACGACCACGATCATGACGACCGCGATCACCAGCAGTTCGCTCCAACCGACTTCGAACATGACAATCTATTCCGACGAGAACACGCCGCCCGCATATGCGAAGCGGTTTTGGTACAACGGCATGCTCAAACCAACAATCGGATCAGCTCTTGCTGACCTTTTCCTTGCCCGGCGAAACAGTCTCGTCGGCGCGGTGCTCGACGGTACGCTTGTCGTCAACGTCGTCGTCAGCCATGCCCTTCTTGAAGCTCTTGATGCCCTTGGCCATGTCGCCCATCAGCTCGGGGATCTTGCCGCGGCCGAACACCAGAAGCACGATGACCAGCACGATCATCCAGTGCCAAATCGAAAACGAACCCATAACGATCTCTCTCGAATGTTTTCCCTCGGCGATGATCTATGCGTTTTCGCTCGGGGATTCAAACACAACTGCAATGAAGTTGATAAAATGGCTAGCGGATGTCACGCACTTTCGACGTGAGCGCCTTGCTACGAATCCGCGCAGGCCTTGCTATTAGACCAGTTGTGCCGGTTCCCTAAAAGCATGTCTCCCGAAAGTGGGAACCGGTTTCGGGATAAAGACATGCGTAAAATCAAAAACCTAAAGCGCGTGGAGCGAACCTGAAAAGAACCTGAAAGATCGCGACGCGCTTTAGGTCACTCTTCCGTGACGCGCGGCGTCAGCAGCCCGAGCTCCTCGAGGTCGATATCGGTCAGCTCATCCTCGTCCTCGGCGAGCGCGTCGGAATCGGCTGGCGGCACCGGCATCGAGAAATTGGCCGGCATGCGCGTCGAGAGCAGGCCGGCGCCCTTCAGCTCTTCCATACCGGGCAGATCGCGGATCTCCTCCAGCGCGAAATGATCGAGGAAGGCGTCGGTGGTGCCATAGGTCACCGGCCGTCCCGGCGTGCGGCGGCGGCCGCGCATCCTGACCCATTCCGTTTCGAGCAGCGTGTCGAGCGTGCCTTTCGACGTCTCCACGCCTCGGATGTCTTCGATTTCGGCGCGCGTGACGGGCTGGTGATAGGCGATGATCGCCAGCACTTCGAGCGCCGCGCGCGACAGCTTCCGTTGCTGAACGGAGTCCCGGCTCATCAGGAAAGCCAGATCGCCTGCGGTGCGGAAGGCCCAGGCATCGCCGACCCGCACCAGGTTGACGCCGCGCTTGGCATAGATCTGCTGAAGATCGGCCATCGCGGCGGCGATGTTGACGCCTTCAGGCAGGCGCGCGGCAAGCTGCTTTTCGCTGACCGGCTCGGCGCTGGCGAAAACGATCGCCTCGGCCATGCGCACCGCTTCCGACAGCTGCAGCCGCTCGGCGGGGTTTTCGAGTGAACCCTGCTCGGCCATCTCGTCTTCCGGCTCATCCTCAACCCTGAACGGGATGACTGAGGCGTTGGCGCGTTCACTCATGATACCACCTCGACTGCCTTTGCACCTTGCGCGCGGCTGCGCAGATAGATCGGCGCGAACACCTGGTCCTGCCGCACTTCCAGTTTGCCCTCGCGCACCATTTCGAGGGTCGCCGCGAACGAACTGGCCATCGCCGTGCGCTTCTCTTCCGGCGCCGCCAGATACTCGATCAGGAAACTGTCGAGCGCCGTCCAGTCGCCGACCGCGCCGATCAGCCGCGCCAGCACCTCGCGCGCATCCTTCAGCGACCACACGGCGCGCCGGGCGATCGTCACATTGTTGATGGCCTGCCGCTGGCGCTGCTGCGCGTAGGCGGTGAGCAGATCGTAAAGCGAGGCCGAAAAGCTGTTGCGCTTCTCGATGATGACCATTTCCGGCATGCCGCGCGCGAAGACGTCGCGGCCGAGGCGATTGCGGTTGACCAGCCGCGCCGAGGCGTCGCGCATGGCTTCCAGCCGCTTCAGCCGGAATTGCAGCACCGCCGCCAGTTCCTCGCCGCTTTCGCCCTCCTCGCCCGGCTGTTTCGGGATCAGGAGCTTCGATTTCAGGAAGGCGAGCCACGCCGCCATCACCAGATAGTCGGCGGCAAGCTCCAACCTCAGCGCCCGCACCTTCTCGATGAAGGCCAGATACTGCTCGGCCAAAGCCAGGATCGAAATGCGTGCCAGATCGACCTTCTGGTTGCGGGCAAGGTGCAGGAGAAGATCGAGCGGACCCTCGAAACCGGCCACGTCGACGACCAGCAATGGGTCGCCGGTCAAGCGCGAATCGTCGTTCTCGGCCCACAGACGGTCCAT
Protein sequences of DBSCAN-SWA_1 >CP034445|431107:439481|437413_437632_-|AZO01994.1|DBSCAN-SWA MGSFSIWHWMIVLVIVLLVFGRGKIPELMGDMAKGIKSFKKGMADDDVDDKRTVEHRADETVSPGKEKVSKS >CP034445|431107:439481|436605_437331_-|AZO01993.1|DBSCAN-SWA MFEVGWSELLVIAVVMIVVVGPKDLPNMLRTFGRTAAKLRAMAGDFQKQFNEALKEAELDDVKSSIDSLRSLNPMNEVRKQLNPFEQAAADVRAGVDTMMKPKPAADPAAAAADTPQPAEPLKNGATDMPGVGATEPAPAAPTFPAMTDASVTASVSDAPVAPKPAKKTAVTKAKTADAVAKTAAASKASAAKALAKSSAPAKAATAAAKAAPKAEAKPAAARKPASKKTADTKKTAGAAK >CP034445|431107:439481|438680_439481_-|AZO07276.1|DBSCAN-SWA MDRLWAENDDSRLTGDPLLVVDVAGFEGPLDLLLHLARNQKVDLARISILALAEQYLAFIEKVRALRLELAADYLVMAAWLAFLKSKLLIPKQPGEEGESGEELAAVLQFRLKRLEAMRDASARLVNRNRLGRDVFARGMPEMVIIEKRNSFSASLYDLLTAYAQQRQRQAINNVTIARRAVWSLKDAREVLARLIGAVGDWTALDSFLIEYLAAPEEKRTAMASSFAATLEMVREGKLEVRQDQVFAPIYLRSRAQGAKAVEVVS >CP034445|431107:439481|434335_435640_-|AZO01991.1|tRNA|DBSCAN-SWA MLDIKWIRDNPKALVEALVKRSWSADEAQSMVDDLIAKDEARRSHLSELQVKQERRNAASKEIGNAMRSGDAALAEKLKGEVSDIKAFIQNGEARERELDKALNDALAVLPNVPLEDVPVGKDEHDNVVKRIVGEVPTRPNWMKEHFEIGEALGMMDFERAAKLSGARFTVLKSQLARMERAIGQFMLDLHTTEHGYEEVIPPLMVRDEVLFGTNQLPKFEEDLFFTPHGDGRLGLIPTAEVPLTNLVREEITAHEKLPLRYTALTPCFRSEAGSAGRDTRGMLRQHQFYKVELVSITDQESSIAEHERMTQCAEEVLKRLGLPFRTVTLCTGDMGFGARKTYDIEVWLPGQNAYREISSCSVCGDFQARRMDARYKDKDGRGNRFVHTLNGSGTAVGRALIAVIENYQNEDGSVTIPEVLRPYMGGLAKIEAK >CP034445|431107:439481|437928_438684_-|AZO01995.1|DBSCAN-SWA MSERANASVIPFRVEDEPEDEMAEQGSLENPAERLQLSEAVRMAEAIVFASAEPVSEKQLAARLPEGVNIAAAMADLQQIYAKRGVNLVRVGDAWAFRTAGDLAFLMSRDSVQQRKLSRAALEVLAIIAYHQPVTRAEIEDIRGVETSKGTLDTLLETEWVRMRGRRRTPGRPVTYGTTDAFLDHFALEEIRDLPGMEELKGAGLLSTRMPANFSMPVPPADSDALAEDEDELTDIDLEELGLLTPRVTEE >CP034445|431107:439481|433572_434331_-|AZO01990.1|DBSCAN-SWA MRILLTNDDGIHAEGLASLERIARTLSDDVWVVAPEQDQSGYAHSLSISEPLRLRKIGEKHYAVRGTPTDCVIMGTKKILPGPPDLILSGVNSGANIADDVTYSGTVAGAMEGALLGVRSIAVSQAYSYVGEDRVVPYETTESLAPALLKRLVETPLPDGVLLNVNFPNCAPDEVEGTVVTSQGKLVHSLWVDERRDGRGLPYYWLRFGREPVEGKKGTDLYALRNRLVSVTPLQLDLTAHELRDQLTKALS >CP034445|431107:439481|435742_436609_-|AZO01992.1|DBSCAN-SWA MSVSDKEREEIEKSSAPLIEHLIELRRRLIWSLGGFFVAFLVCFFFAKRLFNLLVIPFKWATQWAGLDPHKVELIYTAPQEFFFTQVKLAMFGGMVIAFPLIATQIYKFIAPGLYKNERSAFLPFLIASPILFLMGASLVYFFFTPMVMWFFLAMQQVGTNDQVQISLLPKVSEYLSLIMTLIFSFGLVFQLPVVTSLLTRVGLLSSQALAEKRKWAIVLSFVVAAVLTPPDPMSQCGLAIPTIILYEVAIWSSRVIERAQARDRLAREQANADSSVAGDKTPDAPST >CP034445|431107:439481|431107_432628_-|AZO01988.1|DBSCAN-SWA MQFNHLKANRRNLARGCAVLMIAGAAAGCSSQSMRFNGVDDVFTSSTNNQRAIINKQNVDQPYPGDTVAPAPVDGTHTQSVSRSSLEPVTTQQLPPPSAPATAKPMRTATAPALAPAPALAPAPHLDRTATGTVTPAKPFKTAEPDAVRNASAAPHATEIVVRDGETLSGLAAHYHVPADAIAKVNGIDPKKGIRSGQKIVIPAYAYSSKAEPKVAEGKPANTPKQPAPEKVAVLPQQPKVKDGKAAAQVDASAAATGSKNPKPAPQVAEAKPAGAGGTYTVQQGDSLSSIARKTGVSVTTLKQANGMQDGLLKIGQTLKVPAGGTVVAAAKPAATTKPAVDPVTTATTPPPAKTTETLASYTPPKKDAKVIQQAEDDNAEAPDATGIGKMRWPVRGRVISSFGSGKDGVDIAVPEGTPIKAAENGVVIYAGDGLKEFGNTVLVRHENGLVTVYGHASSIEVQRGQKVKRGQEIALSGMSGTTDSPKLHFEVRKNSAPVDPSGYLE >CP034445|431107:439481|432888_433542_-|AZO01989.1|DBSCAN-SWA MNTGIDDREGFAAFLLRLRGRGTAPKALVAAFEATPRRGFLAAQFHSIAWSDGMLPIECGEAIEGADLQAAVIAALHIEPGNRVLEIGTGSGYTAAVMSRLAARVITIDRYKTLTEQAKQRFEALAISNVIVRQADGSNGLPNEGPFDRIVAWAAFDSLPRFLLDQLSSGGIVIAPIGPEEGEQVLAKLTKVGSRFEREDIGMVRLQPILRSVAAVI |
9 | uncultured_Mediterranean_phage(85.71%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
3154863 : 3212569
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP034445|3154863:3212569|DBSCAN-SWA CTCACGCCACGGACCTTCGCGCGCCAAGCTCGGCATTGCGGACATGCGCGACGATGATGGCAAGCGCCGCCGGCGTCATGCCTTCCATGCGTTGCGCGTCGGCGATCGACCGCGGCTGGCGCGTCTTCATCTTCTGCTTTAACTCGTTCGACAGGCCTGGCACATCCGAGAAGTCGATCTCCTCCGGGATGAGCCTCGACTCTTCATGCCTGATCTGCGCCACGTCCGCCTGCTGGCGATCGAGGTAAACCGAATATTTCGCTTCCGTTTCAAGCCGCTCGGCTGTCTTGGCGTCAAGCGCGGCAAAACGCGGCTCGACACGAGTAAGCCAGGCCATGTCGACGCCCGGATGCGCCAGCAGTTCATAGGCCGAGCGGCGCACGCCGTCCTTGTTGATCTCCAGCCCGTGCCGCGCCGCCTCGTTCGGCGTCCAGGCTACCGACTTCGCTAAGATACGAGCCTCATCGAGCCTCTGGATCATCCCGCCGAAACGCCTCAGCCGCTCGGGCGACGCGATGCCCAGCTTCCCGGCCAGCGGCGTCAGCCGCTCGTCGGCGTTATCGGCGCGCAGCGAAAGCCGGAACTCGGCACGCGAGGTGAACATCCGATAGGGCTCTGCGATACCGCGGCTGGTGAGGTCATCGACCATCACGCCGATATAGGCTTCGGTCCGGCTGAGCACGAATTCGTCGCCGCCGGCCGCCTTGCGCGCCGCGTTGATGCCGGCGAGCAGGCCTTGCGCGCCGGCTTCTTCATAACCGGTCGTGCCGTTGATTTGCCCGGCGAGGTACAGGCCGCCCACTCGCTTCGTCTCCAGCGTCGTCTTCAGTTCGCGCGGGTCGACATGGTCGTATTCGATCGCATAGCCCGGCTGCAGCATCGTCGCCTTCTCCAGCCCGGGAATTGTCTTCAGGATCTCGAGCTGAACGTCCTCCGGCAGCGAGGTCGAGATGCCGTTCGGATAGACCGTGTCGTCATCGAGACCTTCCGGCTCCAGGAAAATCTGATGGCCCTCGCGGTCACCGAACTTGACGATCTTGTCCTCGATAGATGGGCAATAGCGCGGGCCGACGCCTTCGATCGAGCCGGAATACATGGCCGAGCGGCCGAGATTGGCGCGGATCAGTTCATGCGTTGCCGGCATGGTGCGGGTGATGCCGCAATGGATCTGCGGGTTGGCGATAGCGTCCGTCATCAACGAGAACGGCACCGGATCTTCGTCTGCGGCCTGGCTTTCCAGCGAGGCCCAGTCGATGGTGCGGCCGTCGAGGCGAGGGGGCGTTCCGGTCTTCAGGCGCCCGAGCTGGAAACCCGCGCGCGCCATCGTCGCCGACAGGCCGTGGCTCGCCTGCTCGTTCATGCGGCCGGCGACGATCTTCTTCTCGCCGATATGGATCAGGCCGCGCAGAAAGGTGCCGGTGGTCAGCACCAAGGCGCCGCAAGCCAACCGGCGGTCATCGGAAATCTGAACGGCAGCGATTCGACCGTCCATGATTTCGAAGTCGAGAGCCTCGCCTTCGATCACGTCGAGATTGTTCTGCTGCCGAATCGCTTCCTGCATGGCCAGCCGATAGAGCTTGCGGTCGGCCTGGGTGCGCGGGCCGCGCACGGCCGGCCCCTTGCGGCGATTGAGCAGGCGGAACTGGATGCCGGCGGCGTCGGCGATGCGGCCCATCAACCCATCCATGGCGTCGATCTCGCGAAGGAGATGGCCCTTGCCGAGACCGCCGATCGCGGGATTGCAGGACATGACGCCGATCGTATCGAAGCGAAGCGTCACCAGCGCGGTCCTGGCGCCGGCGCGCGCGGCAGCACTCGCTGCCTCGCAGCCGGCATGGCCGCCGCCCACCACCACAACATCATAGTGATCGGTCATTTTTCCAAAAATCCGCTTTCAGAGATGGACAGACACATGTCCCTGCCACCTCGAGCTGTCAAGGATGAAGACCGGAGCCGTTTCACGTGAAACATTAATCCGGCCCAATTGCGTATTGTTTCACGTGAGTCACTGCCGGACAGAGGCCGCTCCATGTTTCACGTGAATCATTTACCAATGCAGAACTGCGAAAAGATGACGTCGAGCAGATCCTCTACATCGACGGCACCGACGATCCGCCCGAGGCGCTCCGCCGCCAACCGCAATTCCTCGGCGCGCAACTCTTGGCTTTGCCCTGACAGCGCCGCCCTCAGGAAATCCATCGCCTCTTGCAGCAGCTCGACATGGCGCATTCTGGAGGGCAGGACGTCACCGGCCTCGCCAACCGCGGCGGCGGCGCGGCTTCCGATCTCGTCGAGAAGGCACGCCAGTCCGCTGCCGTCCCTAGAAGAAATCATCAGGTCGTAGGCGCCACCGCCCGCATCGGCGATGTCCGACTTCGTGCCGATCCGCAGCAGCGGAGCCCCAGCGGGGACTCCTCCGACAGGTACCGGATTGGCCATATCCTCTAACAGCAGGACGAGATCGGCCACGTCAGCCTTGGCCCGCGCCCTTTCGATGCCGATCGATTCCACCTTACCCGCGGCGTCGCGCAGCCCGGCGGTATCGACAAGCCGTACCTTCAGCCCGTTGAGGTCCATGACGACCTCGAGCAGATCGCGGGTGGTGCCCGGTTCGTCGGTGACGATCGCCGCCTCGCGCCGCGCAAAGGCGTTGAACAGACTGGACTTGCCGGCATTCGGCGCGCCGAGGATCACGACCTCAAAACCGTCACGGATGATCTCCGCTGCCTTGAACCCCTCGACGTGCCGTTCGATCTCGCCAATCATCGCTGTCACGTCTGACCAGACCGCCTCCGAAACGGAACCCGGCACGTCCTCCTCGTCGGCGAAGTCGATCTCCGCCTCGATCATCGCCCGCGCATGGATCAGTCGGCGGCGCCAGCCTGAATAGAGCTCGCTCTGCGCGCCTTCGGCGTTGCGCAAGGCAAAGCGACGCTGCGCCTCGGTCTCGGCATTGACCAGATCGGCCAAGGCCTCCGTCTCGACCAGATCCAGCCTGCCGTTGAGAAAGGCGCGCCTAGTGAACTCCCCGGGCTCGGCATGTCTGACGCCTTCGAAGCCGGTGATCCTTTCCAGCATCTTGGCGGCGACCGCGCGGCTGCCATGCACCTGGAACTCGGCGACATCCTCGCCGGTGAAGCTGCCGGGGCCCGGAAAGAACAGCACGAGCCCGTGGTCGATCGTCGAGCCATCCGCCGCCCTGATCGTCCGTAAATTGGTAAACCGATCCTGAACCGGCCCGGCGATCGTTTCGACCACGAATCGAGTTTTCGGCCCGGAAATACGGATAACGGCGATGCCTGCGGGGAGGCGGCCGCTGGAGAGCGCGACGATCGAATCCTTCGAAGCCATTGGCGAAACCATTTGCCGGACCTGCCGAACCGCCTACATTAAGAGGTAGAGCGTGTCGGCCGCTTCACCTTTCGGCATCACGCTCTGACCTCTGCCTCAAACCACGCCGAGCCGCTCATCGTGACCTTGCCCACGCGAAACCTGCTTGCCGAAGAAGCCAGCCCCTATCTGCGGCAGCACAGCGACAATCCCGTCCATTGGCGCGGCTGGTCGCGGGCCGCCCTTGCCGAAGCCAAGGAACTGGGCCGGCCGATCCTGCTCTCCATCGGCTACGCCGCCTGCCACTGGTGCCATGTCATGGCCCATGAAAGCTTCGAGAACGACGCCGTCGCGGCCGTCATGAACCGGCTCTACGTGAATATCAAGGTCGATCGCGAGGAGCGGCCGGACATCGACCAGATCTACATGGCTGCGCTCCATGCCATGGGCGAGCAGGGCGGCTGGCCGCTGACCATGTTCCTGACGCCCGACGGCAAGCCGTTCTGGGGCGGCACCTATTTTCCGCGCGAGGCGCGTTACGGACGACCGGGCTTCATCCAGGTGCTGGAAGCAGTCGACAAGGCCTGGCGGGAAAAGCAGCAGAGCCTCGCGGAAAGCGCCGACGGGCTGGCCGTCCATGTCGAATCCCGGCTTGCCGGAACAAAGGGCAAGGCGGTGCTCGACCGCGATACGCTCAGCGATCTCTCCGGCCGCATCGACGGCATGATCGACCGCGATCTCGGCGGCTTGAAGGGCGCGCCAAAATTCCCGAATGCGCCCTTCATGCATACGCTTTGGCTGTCCTGGCTGCGCGACGGCCAGGTGGACCACCGCGATGCCGTGCTCACCAGTCTCGAAAAGATGCTTGCCGGCGGCATCTACGATCATGTCGGCGGCGGCCTCAGCCGCTATTCCACCGACGCCGAATGGCTGGTGCCGCATTTCGAAAAAATGCTCTATGACAATGCCCAGCTCATCCGCCTGTGCAACTGGGCCTATGCTGCCACCGGCAAAGGCTTGTTCCGGCTGCGAATCGAAGAGACCGTCGCCTGGCTGCTGCGCGAGATGCGTGTCGAAGGCGGCGCCTTCGCCGCCAGCCTCGACGCCGACAGCGACGGCGAGGAGGGGCTCTTCTACACGTGGAGCCGCGACGAGATCGAGGCCGCGCTGGGCGACGACGCCCCGACTTTCTTTCAGTATTTCGAGCTGGCCGCACCGCATGGCTGGGAAGGCAAGCCGATCGTCCGCCAAAGCGAAATGCAGCAAAGCAAAGGCATTGCCGACTACGCCAATCTTGCGCCTTTGAAAGCGAAGCTGCTGGCTTTCCGCGAACAGCGGGTCAGGCCGGGCCGTGACGGCAAGGTGCTCACCGACTGGAACGGTCTGATGATCGCAGCACTCGCCGAAGCCGGCCGCTCCCTTGGCAGGCGCGACTGGATCGATGCCGCGGCAAAAACCTTCACCCATATTGTCGAGGCCAGCCTGGACGGCCGTCTGCCGCATTCCATGCTCGGCGCGAAGACGCTTTTTCCCGCACTGTCCAGCGATTACGCCGCCATGACCAATGCCGCGATTGCGTTGTTCGAGGCAACCGGCGACCCGGCCTATTCCGATCACGCCAGGCTGTTCATCGGCGAGCTCGATCGCTGGCACCTGGATGCGGAAAAGACCGGCTACTGGCTGACGGCGTCGGATAGCGGCGACGTGCCGATCCGCATCCGCGGCGATGTCGACGAAGCAATTCCCTCGGCCACCAGCCAGATCGTCGAGGCGTTGGTTCGGTTGTCTTCGCTCACCGGCGACCTCGAACTCGGGGAGAAGGCCTGGGCGGCCGCCGAGCACGCCATGGGCCGCGCATCCCAGCAGGCCTACGGCCAGGCCGGCATCGTCAATGCCTGTGCCCTGGCGCTCGAACCGCTGAAGCTTGTCATCATAGATGATCCAGGATCTTCAAGCCTCGTTCCGGTCGCGAATCGGAATCCCGACCCGCGCCGGGTCGACATCGTCGTGCCGATCGGCACGGAAACGAATCGACCCTTGCTGCCGGGCGGCGTGCTGCCGCCGACGGACAAGCCAAGCGCCTGGTTCTGCAGCGGCCAGCTGTGCCTGCCGGCGGTGACCGATGCACAGGAGTTGGAGAAGTTGCTGAGGCGATAGCGCCTTTTCCTTCTCCCCTTGTGGGAGAAGGTGTCGCCGAAGGCGACGGATGAGGGGTGCTCCAGATGCCACCAGCGTCTCACTCCGCTGGAACACCCCTCATCCGTCTCGGCGCTGCGCGCCGATCCACCTTCTCCCACAAGGAGGGCTTTGCACGGAACTGAGTTGGTGATTCCGTATGTCCGGCATGAAGGGGAGATGATTCGGATGGTCGATCGCGTGCTTGGACAACTGAGCCTTGCGGATGGGCTGGTGTGGTCGGATGTGACGGAGCTTGACCAGATCTCGCGAGTGATCGACTGGGTGCCGATCAAGGCGCTTCTTGGTCAGCGCAGCGGACCAGGACGGGGCAACATTAGCTATCCGGCTGAGGCCTTGCTGCGCTGTCTGCTGCTTGGCGTGTGGAACAATCTCAGCGATCCTTCGCTGGAGGCACAACTGCGCGACCGGCTGTCGTTCCGCCGCTTTGCCGGTTTCAGCCTGTCGGACCCCACGCCGGACCATTCGACGTTGTGGCGTTTCCGCGAGGAGCTGAAGTGCGATGGGCTGATCGACCGGGTGTTTTATGAGATCACGCGGCAGCTTGAGCAGAAGGGGCTGATCGTCAAGCGCGGCACGTTGATTGATGCCTCCTTCATGCAGGCCGCCGCGCGCCCGCCGGCGAAGCCGGAGGCGACGGCGCGGCCATCGGTTGATGGCGAGCACGTTGGGGACGCAAGGGCAACAAGACCGTGTTCGGCTATAAGGTGCACAACGGCGTCGACGATGCTCATACGCTGATCCGGCGCATGGACTTCACGGATGCATCGGTCACCGATACCGAGCCGGCCGACGGCCTGATCATCGGGGACGAGAAGGCGGCCTATGGCGACCAAGCCTACTACACCCATGCCCGCCATGCCCGGCTGAAGCAGGCCGGCATCAAGGACCGGCTGATGCACAGAGCCAACAAGCACCATCCGCTGACGCCGCGTCAGAAGCAGCGCAACCGGCTGATCTCGAAGGTGCGGGCGGCAGTGGAGCGGCCGTTTGCAGTGTTCAAGCAGCGCTACGGCATGCGGCGACTGCGCTTCTTCAACCTTGCGACCAACCGGACGCAGTGCATGCTTGCCGGCTGCGGCTACAATCTGCAAAGAGCGGCCGCAGTCCTCTTCCCAAAGAGGAAGCCGGCATGAGGCGCAGTCCGCCCAAAATCCACTGCCATGGCGGCCTGAGGCCCCAAGAGCGAGGCAAAATAACCCGCCAAAGCGGCCAGAGCGCCTTTCAAAAAACGCCCATCCGCCTCGTGCGCTAAATATCGACTCCCATGCAAAACCCTCACAAGGGGAGAAGGGGGACCGCACTCCCCATCACGTATTCATCGAATCGAAGAAGTCCGCGTTGGTCTTGGTCTGCTTGAGCTTGTCGATGAGGAACTCGATCGCGTCGGTCGTGCCCATCGGCGCCAGGATGCGGCGCAGCACGAAGATCTTCTGCAGGTCCGATCGCGGCACCAGCAGGTCTTCCTTGCGGGTACCAGACTTGAGGATGTCGATCGCCGGATAGATGCGCTTGTCGGCCACCTTGCGGTCGAGCTGGATTTCGCAGTTGCCGGTGCCCTTGAACTCTTCGAAGATGACTTCGTCCATGCGGCTGCCGGTGTCGATCAGCGCGGTGGCGATAATGGTCAGCGAACCGCCTTCTTCGATATTGCGGGCCGCACCAAAGAAACGCTTCGGGCGTTGCAGCGCGTTGGCGTCGACACCGCCGGTCAGCACCTTGCCGGATGACGGCACGACCGTGTTGTAGGCGCGGCCGAGGCGGGTGATCGAATCGAGCAGGATGACGACGTCGCGACCATGCTCAACCAGGCGCTTGGCCTTCTCGATCACCATTTCGGCGACCTGCACGTGACGCGCGGCTGGCTCGTCGAAGGTGGACGAGATGACCTCGCCCTTCACCGAGCGTTGCATGTCGGTCACTTCCTCGGGACGCTCGTCGATCAGAAGCACGATCAGGTAGCATTCCGGATGATTGGCGGTGATCGAATGCGCGATGTTCTGCAACAGCACCGTCTTGCCGGTGCGCGGCTGCGCGTTGATCAGCGCGCGCTGGCCCTTGCCGATCGGCGCCACCAGGTCAATCACGCGCGGCGAGATGTCCTTGGTGGGGGGAGTGTCGACCTCCATCTTCAGCCGCGAGGTCGGGTAGAGCGGCGTCAGATTGTCGAAGTGGATCTTGTGTCGGATTTTTTCCGGATCGTCGAAATTGATGGTGTTGACCTTGAGCAGCGCGAAATAGCGCTCGCCCTCTTTCGGGCTGCGGATCGGTCCTTCGACCGTGTCGCCGGTCTTGAGCGAGAAGCGCCGGATTTGCGACGGCGAGATATAGATGTCGTCGGGGCCGGGCAGGTAGTTTGCGTTGGCCGAGCGCAGGAAGCCGAAGCCGTCCTGCAGCACCTCCACCACGCCATCACCGATGATCTCGATGTCCTGCGCGGCGAGCTTCTTCAGGATCGCGAACATCAGCTCCTGCTTGCGCATGACGCTGGCGTTCTCCACCTCGAGCGTTTCGGCGTAAGCGATCAGCTCCGGCGGCTTCTTGTTCTTGAACTCTGCGAGTTTCATTTCTTGCATTAGATAGACTCTGGGTAGGCTTTTGGGAGAAAATCGGCTGAATACGACAAATGCCGGGCGCGGCCGAAAGCGAGTAAAGGGGGCGGCGCATGACGGGAAGGAAGACCCGTCTTTTATCGTCGGTCGCCCGAAAGAGCAAGCCGCCTTTTGCTGGGCTTCTTCTGGTTTTGGGGGGAGCCTAGAACGGCTTCACGACGACCAGCATGACGATGACGATCATCAGCAATGTGGGGATCTCGTTGACGATCCGCCAGTGCCTGGCCGGCTTTTCGTTCTTATCCTCGGCGAATTTCCTGACCGCGCCGGCGAGATAGCCATGCAGACCCGACAGGACGAGCACCAGCGCGATCTTGGTGTGCAGCCAGCCGCCCTGGAAGCCGAAGCCTTTCCAGGCCAGCCAAAGGCCGAACACCCAGGTCACGATCATCGCCGGATTGATGATCCCCCTGAGCAGCCGGCGCTCCATCACCTTGAAGGTCTCGGACTGCACCGAGCCTTTCTCGGCATCGACATGATAGACGAACAGCCGCGGCAGATAGAGCATGCCCGCCATCCAGGCGATCACCGCGATCACATGGATGGCCTTCGCCCAGAGATAGAAGCCGTCGCCAGCCACCAAATAGAGAAGGGCCGTCGCCGCCACGAGAACGACAATGCCGATCACCATGCGCTTCATCGCCTGGCCGGTGGTGTTCTCGTTGCTCACATTGCTCATCGGTGGCTCCTCACCATTTTCACCATCGCTTCGACATGCGCCACCGGCGTTTCCGGCGTGATGCCATGGCCGAGATTGAAGATCAGCGGCCCGCCGCCCAGCGTCTTCAGGATGGCATCGACGCCATCGGCCAGCGCCTTGCCGCCGGCGACCAGCCGCAGCGGATCGAGATTGCCCTGCACCGCGCCTTCGCGCTGCAGCTCCTTCGCCATGGACAAAGGCACGGTCCAGTCGAGGCCGAGACCGGCAATGCCGGTCTTCTTCCTGTAGTCGCGATAGCGTTCGCCGGCGCCCTTCGGAAAACCGATCACCGGCACGTCGGGATGAACCGCCTTCACCTGCCGCACGATCTCAGCCACCGGCTCGACACAGAAAGCCTCGAAGGAGGGCTCGTCCAGCACACCCGACCAGGAATCGAAAATCTGCACCGCATCGGCGCCGGCTTCGATCTGGCGGATGAGATAGGCAGCCGAATGATCGGCCAGCGTCTTCAAGAGCCTCGCGAAGGCCTCGGGCTCGCGATAGGCGAACAGCCGCGCCGGTCCCTGGTCGGGCGTGCCGTGCCCGGCGATCATATAGGTCGCCACCGTCCATGGCGCGCCGCAGAAGCCGAGCAGCGTCGTCTCGTCGGGCAATTTCGCGCGCAGCCGGCGCACGGTCTCGTAGACCGGCTCGAGATTCACGTGAAACATCTCGCCGCTCAACGCCGCAATCTCGGCCGCCGAGATCGGCTTCAGGACCGGCCCGCGGCCTTCCTCGAAACGGACATCCCGCCCCAGCGCATTGGGCACAACGAGAATGTCGGAGAACAGGATCGAGGCGTCGAAGCCGAAGCGTTCGATCGGCTGCAGCGTCACCTCGACGGCGAGGTCCGGATCATAGCAGAGATCGAGGAAGGATCCGGCCCGTCTCCTGGTCTCGCGATACTCCGGAAGATAGCGGCCTGCCTGGCGCATCATCCAGAGCGGCGGCGGTGAAACCGTCTCGCCCTTGAGGACGTCGAGCACGATCCGTTTCCCAGCCATCCAGCGCCCGCCTCTCTCTACAGCGCCGCGCGTCCTAAGACGCGCAAAGGACGCTGTAGCACTTTCATCTAGCGCATGATCCTTTCCGAACCTCCGGTTCGGGGTCATGCGCGGTCTCTTAAATCAAATATCTATTTTAAAAGAGTCTTCTGATTCTAAGAGTCTGTTGGTTGTGATGATTGCGACCTGTCCAGCCTGCGGCTCGAAAAGGCCAGTCAGCATATTGTCGCGTACTTGTTTTCGCAAACTGGAGGGATCTGGGGACAAGGCCGATTCAAGCTTTCGAGTCAAAAGCTTGGCCGGTCCGATCCGATGTCCTGGTTGTGGATGAAATCGGCCCGGAAAAGTTGATCCCCAGTTTTGTCCCCAGCGCCGCGACAAAGCCTACAGCTGGTCGAATTGTGGACAAGCGGCCATCTGATACAGTCGCTTCCCTACCCAAGGCCCGCCCATCCAGCCTTGTCCACAATTGCCCACAGCCTGGACCCACCCGTTGTGAACAAACCTCAGAGCTTCTTCCACCTGCATCTGATCTCCGACGCCACCGGCGAGACGCTGCTGGCCGCCGGCCGGGCGGCGTCAGCGCAATACAAGGACGCGCGCGCCATCGAACATATCTATCCGCTGATCCGAACCGAGAAGCAGGTCGCCAAGGTCTTCGACGACATCGAGGAGGAGCCCGGCATCATCCTCTACACCGTGGTCGATCAGAAGCTCGCGCGCTCGATCGACGAGCGCTGCGCGGCCATGGGCCTGCCTTGCGTCTCGGTGCTGGAGCCGGTGCTCGCCGTCTTCCAATCCTATCTCGGCACGCCCGCCGGTCGCCGCGTCGGCGCCCAGCACGTGCTCGACGCCGAATATTTCCGCCGCATCGACGCGTTGAACTTCACCATGGAGCATGATGACGGCCAGCTGCCGGCCAATATGGACGATGCCGATATCGTGCTGATCGGCATCTCGCGCACCTCGAAGACGCCGACCTCGATCTATCTCGCCAATCGCGGTATCAAGACAGCCAACATCCCGATCGTGCTCGGCGTGCCGCTGCCCGAGAGCCTGATCGCCGCCAAGACGCCGCTGATCGTTGGGCTGATCGCGACGGCCGAGCGTATCTCGCATGTGAGGCAGAACCGCATCCTCGGCAACAGCGCCGCCTTCGTGCCGACCGACTATGTCGATCGCGCCGCGATCAACGAAGAGCTCGCCTATGCGCGCCAGCTCTGCACCAGGCATGGCTGGCCGATGATCGACGTCAGCCGCCGCTCCATCGAGGAAACGGCGGCGGCAATCGTTGCCCTGCGGGGCAAGACGCGGTAACGAAGGTTGGCCGAGACGCCGGCTGCTTCGTCATCCTAGGGCGGAGCAAGGAGCGCAGCGACGCGGCGTAGACCCTAGGATCCATGCCATGACGTCGAAGTATTGCAACGGTGTAGAATTCTGCTCCGCTGCGCCCCTCGGCTGAGGTTACGGCATGGATCCTAGGGTCCTCGCAACGGAGCTTCGCTCCTGCTTCGCCCTAGGATGACGAAGTTCGTAGGGCCTTGGCCATTCTCCAAAGTTTGTGCTGGACCGCCGGACAAACGATGGGTCAACAGCGTCAGGGAACTAGTGCATGTCCGAAAAGATCATCCTCGCCTCCGGCAGCCCGTTCCGAAAGACCATGCTCGTCAATGCCGGCCTCGACATCGAGGCGGTGCCGGCAAATGTCGACGAGCGCGCCCTCGAGGCTCCGCTGAAAGACAGCGGCGTCTCGCCGGAAGACGTCGCCTCGATCCTGGCCGAGGCCAAGGCGACTGAGGTCAGTGAGCGGAGGCCCGGCTCCCTGGTGCTCGGCTGTGACCAGACGCTGTCGCTGGGCGACGAGGTCTTTCACAAGCCGGCCGACATGGAAGGCGCGCGCCGCCACCTGCTCGCGCTCTCGGGCAAGACGCATCAGCTGAACAGCGCGGCCGTGCTTTGCCGCGACGGCGAAGTGCTGTGGCGTCATGTCGGCATTGCCAACCTCACCATGCGCAAGCTCGACCCCGCCTTCATCGGCCGGCATCTGGCGCGCGTCGGCGCCAAGGCCCTGGGGAGCGTCGGCGCCTATCAGGTGGAGGGCGAAGGCATCCAGCTGTTCGAGAAGATCGAGGGTGATTATTTCACCATCGTCGGCCTGCCGTTGCTGCCGGTCCTGAAGGAGCTGCGCGCGCTGGGAGCCATCGATGGCTGAGAAAAAGGCCTTCGTTACCGGACATCCGATCGCCCATTCGCGCTCGCCGAAGATCCACGGCTATTGGCTCAATAAATACGGCATCGACGGCAGCTACCAGGCGATCGACGTCGCGCCGGCGGATTTCACCGATTTCCTCAAATCTCTCGGCGAGAACGGCTACCGCGGCGGCAACGTCACCATTCCCCACAAGGAAGCCGCCTTCGCCGGCGTTGCCCGCCGCGACCATGCGGCTGATGAAATCGGCGCCGTCAACACGCTGTGGTTCGAGGACGGCGTTCTCTGGGGTGGCAATACGGACGGCTACGGCTTTGCCGCCAACCTCGATGACCATGCGCCTGGATGGGCGGACAATGGGCCGGCCGTGGTGCTGGGCGCCGGCGGCGCCTCCCGCGCCGTCATCCACGCGCTGAAAGAGCGTGGCATCAAGGACATCCGCATCGTCAATCGGACGCTGGCGAGAGCCGAGGAACTCAGCCGCCATTTTGGCCCTGGCGTCTCGGCGCATGGCGCAGTCGGCGAGCTGCTTGCCGATGCCTGCCTGCTGATCAACACCACGGCGCTCGGCATGCATGGCAACGAAACCCTCGTCGCCGATCCAGCCGGCTTGCCGGATCACGCCATCGTCACCGACATCGTCTATGTGCCGTTGGAAACGCCTTTGCTTGCCGCCGCCAGGGCGCGGCGGTTGAAGACGATCGACGGGCTCGGCATGCTGCTGCATCAGGCGGTGCCCGGCTTCGAACGCTGGTTCGGCAGGAAGCCGGAAGTCACGTCCGAGCTGCGGAGCATGATCGTTGCCGACATCGAGGGCCACTGATGATCGTGCTCGGCCTCACCGGATCGATCGGCATGGGCAAGTCAGCGACGGCGAAAATGTTCGCCGAGGCCGGCGTGCCAGTGCATGATTCGGACGAGACGGTGCATCGCCTCTATGCCGGCAAGGCAGCGCCTTTGGTCGAGGCGGCCTTCCCGGGCACGACCGAGGCAGGCGTGGTCGACCGCGTGAAGCTGGCGAGCCAAGTGCTCGGCGATCCCGCCACGCTGAAGAAGCTCGAGTCGATCATTCATCCGCTGGTGCGCGCCGATGCCGATGCGTTTCTGGCGAGGCATCGCGCCGTCGGCGCGCCGCTCGCCGTGCTCGACATCCCGCTTCTGTTCGAGACCGGCGGCCGCAACCGCGTCGACAAGGTCGTGGTCGTCACCGCTTCGCCCGAGATCCAGCGCGAGCGCGTGCTCGCCAGGCCTGGCATGAGCGAGGAGAAGTTTTTGTCGATCCTCGCCAAGCAGGTGCCCGACGCCGAAAAGCGCCGCCAGGCCGATTTCATCATCGACACCGGAAATGGCTTCGAGGCGGCGCGGAGGGCGGTTGAGGCCGTCATCGGCGAATTGACGGGGGATAAGTCCGGCCGGGATGGTTCCTGACGTCCGCTGACGGCAGTCCCCATTGCCATTTTGTTCTGGTTTGTGATTCTGCTCCTTGGAGCGGATTGATTCGCATGCGTGAGATCATCTTCGATACGGAAACCACCGGCCTCGATTTGCGCGAAGACCGCATCATCGAGCTCGGCGGCGTCGAGCTGGTCAACCGTTTCCCGACCGGCCGCACCTTCCACAAATTCATCAACCCGCAGGGCCGCGCCATCCATGCCGAGGCGCAAGCCGTGCATGGCATCAGCGCCGCTGACCTGGTCGGCAAGCCGACCTTCGCCGAAATCGTCGATGAGTGGCTGGCCTTCACCGACGGCGCCAAACTGATCGCCCACAACGCCACCTTCGACCTCGGCTTCCTCAACCTCGAATACAGCCGGCTCGACCATCCGGCCATCGATCCCGGCCGCATCATCGACACCCTCGCTTTGGCGCGCCGCAAGCATCCGATGGGCCCGAACTCGCTCGATGCGCTCTGCCGCCGCTACGGCATCGACAACACCCGCCGCACCAAGCACGGCGCGCTGCTCGACTCCGAGCTGCTGGCCGAAGTCTATATCGAGCTCATCGGCGGCAAGCAGGCGGCGCTGGTGCTGGAGGCCGTGTCGGTGCAGATGAACGGCGCCGGCGAGGTGGCCGATATCGACATCTCCGTCGGCGCGCGGCCCATCGCCCTGCCGCCGCGTCTGAGCGAAGCGGACCGCGCGGCGCATGCCGGGCTGGTCTCGACGCTCGGCGAAAAGGCGCTGTGGCTTAAGGTGGCGGTGGGGTGAGCGGGCGCTTGTGGCGCTGATCCGCGCGATTGGCCGAATCCCTCCGCGACGTCGTCATACTCGGGCTTGACCCGAGTATCCATGCCGTGACCTCCGCCGCAGGGTGCAACGGTGCAGAATTCTATAACCGTTGCAACGCGTTAGCGTCACGGCATGGATTCTAGGGTCTGCGCGCGTCGCTTCGCTCCTTGCTCCGCCCTAGAATGACGACCGTCCGGAGGTTTTCGCCAATCTCCGGCATCTCCAACAAAAAAAGCCCGGCACGAAGCCGGGCTCAGCCCGCCTCAGCTCACCTGGACCTTGCTAGCGGCCTGGTCCTCGGCGATGCGCTGCTGGAACATCTGGGCGAAATCGATCGGGTCGAGCATCAGCGGCGGGAAGCCGCCATTGCGGGTGGCATCCGAGATGATCTGGCGGGCGAAGGGGAACAGGAGCCGCGGGCATTCGATGAACAGGATCGGCAGCATGTGCTCCTGCGGGAAGCCCGAGATGGCGAAGACGCCGCCATAGACCAGCTCGACATTGAACAGCACTTCCTGGTCGAAGGAGGCCTTGGCGTTCAAGGTCAGGTTGACGTCGAACTGCTTGTCCGAAAGCGGATTGGCATTGACGTTGACATTGATGGCGATGCCGGGAGCCTTGTCGCGGCCGCGCAGCGAGTTCGGCGCACCGGGACTTTCGAAGGAAAGATCCTTCACATACTGGGCAAGCACATTGAGCGAGGGCTGGTTCTGGGTGCCGTTGCCGTTGGCGGCGCCGGTCGCGGCGTCATCGTTGCTGGCCATGAGCCTTTAAGTCCTTCTGCCTGGGCAGCATTGCTGACCGGATTGAAATCGCGGGTTCGCTACCATGGCCCGCCGGACAAAACAAGTTTTGCCGCTTTATCGGGCAAAGGCCCTGGGACTAATCGTTTTTCAGCCGCCGCCAGGGCGAATTGTGATCCGGACGGTTCGGATAGTCGTCGCGCGAATAGTCGCTGTCGTCGAGGTCGATGACGGTCTCGCGCCGGCGCGCCGAAAAGCCGCTCGAGAAGCTGGTCGCCATGACGATGCGGCTCTTGAACAGGCGCCAGGCGAAGTCGCGCACCGGCGGCAGCAGCAGGAGAATGGCGAAGATATCGGTGATGAAGCCAGGGATGATCAAAAGGATCGCCGCCAGCACGATCATCGCGCCATGCGCGAGCTGCCGGCTTGGATCATGGCCGGCATCCATCTCGGCCCGCACCCGCGTCATGACGCCGAAACCCTGGTGCCTGAGCAGCAGGCTGCCGGCGACGCTGGACAGGATAACCAGCCCGACCGTCGCCAAGGCGCCGATCTCGCGGCCGACGACCACGAAGCCGGCGATCTCCAGCAGCGGCAGCAGGAGCAGGAACAGCGGGATGAAGGAAATGCGCAAAGTCTCGACCGATCGCTTTCTTCTTTTGGGCGCTCGCGGACAATCGCCGCTGGCCTTGGCGCCTTTAGATAGGTATGGCTGCCTTTGATTTGAATGATTGCGCCTGCCGTTCTATATGCTTTGAGTGTTGAACGCTATGCGACCGCGGTCCCCGAGGGGGGTCACGCCGGTCGGGCAAGAACAGGGTTCGAGTTGGCGGAAAATATGGGTTTCTTCGACTTCGGCACGATTTTCTTTCTGATTGCGGCGGTTGTGATCTTTTTCCAGCTGCGCAACGTGCTCGGACGTCGCACCGGCAACGAGCGCCCGCCCTTCGATCCCTATTCGGCGAGCCGCACGCGCGAGGCCGATGCGGCGCAGAAGCCGGAAAATGTCGTGTCGCTGCCACGCAAGCGCGCACCGGGCGAATCCTCAGCCGAGACCTATGCCGCGATCGACGCCTTCGCCAAGCCGGACACCGATCTCAACAAAGGCCTGCGCACGATCAAGGACAACGATCCGTCCTTCGAACCCAAAACCTTCGTCGACGGCGCCAAAATGGCCTATGAGATGATCGTCATGGCCTATGCCGACGGCGACCGTAAGACGCTGAAGAATCTCCTGTCGCGCGAAGTCTATGACGGGTTCGTCGCCGCCATCGGCGAGCGTGAGGCGAAGTCAGAGAAGATCCAGTCCTCCTTCGTCGGCATCGACAAAGCCGACATCGTCGCCGCGGAGATGAAGGGCTCCGAAGCGCATATCACGCTGCGCGTCGTTTCCGAGCTGATCTCGGCGACCCGCGACAAGGCCGGCGCCGTCATCGACGGCGATCCGGAAACGGTGGCCGAGGTCAAGGATGTCTGGACCTTCGCCCGCGACACACGCTCGCGCGACCCGAACTGGAAGCTCGTCGCCACCGAAGAAGAAGATTGAGATCAAGCGGCGGGGCTCGGTCGTGCCGCTCTCTCCCATACTCAGCGAAAAATCTTTTGATGACCTGCCCGGCTGGGGCGAGGACGACCATGCTGCGGCCTTCGCGGCCTTTCGCCGCTCGGCCCTTCATGCGCCCGTTAAACCCTATCGCACCGGCTCGCTCGGCGTCGATTTCAATGCCTTTGCCGAGGCCTATGCTGAGGCCCGCGCCGTTTCGGCGCCGAATCGGTCCGAGGCGCGGTCCTTCTTCGAGCGGCATTTCGTGCCGATGCTGGTGAGCGCCGAAAACGGCTCGCGGCTAGTCACCGGCTTCTACGAGCCGGAGGTCGAGGCCTCGCCGGTCAGGACGGAGCGGTTCGCCGTGCCGCTGCTGTCCCGCCCCGCCGATCTGATCGACATCGACGACGGCAACCGGCCGGCCGGCATGGATCCTTATCTGGCCTTCGCCCGTCGGACGGATAATGCTCCGGTCGAATATTTCGACCGCGGCGAAATCGAGCGCGGTGCGCTCTTAGGCCAAAATCTCGAGATTGCCTGGCTCGCCGAAAAGGTCGATGCCTTCTTCATCCATGTGCAGGGCGCAGCTAGGCTGAAGATGACCGACGGCAGGCTCGCCCGCGTCACCTATGCCGCGAAATCAGGACAGCGCTTCATTGGCCCGGGCAAGATTCTGAGCGAGCTCGGCGAGATCCCGCTGGAAAAGGTGACGATGCAGTCGATCCGCGCCTGGTTCAAGGCGCATCCTAGCCGCGTCGACGAGATCCTCTGGCAGAACCGCTCCTACATCTTCTTCCGCGAGGCTGCAGTCGACGATGCCGCGCTTGGGCCGATCGCCGCCGCCAAGGTGCCGCTGACGCCGGGGCGCTCGGTCGCGGTCGATCGCCTGCTGCACACATTCGGCACGCCCTTCTATATCGACGCGCCGACGCTCACCGCCTTCGATGAAAAACCGTTCCGGCGGCTGATGATCGCGCAGGACACCGGCTCGGCCATCACCGGGCCGGCGCGCGGCGACCTCTTCGCCGGCTCGGGCGACGCCGCCGGCGAGATCGCCGGCGTGGTCCGCAATGCGGCCGATTTCTATGCGTTGGTTCCGCGTGGGCTTCTGAATGGAGCGACCCGGTGAGCCGCCGCGACGACCATCTCAGCGACGACGACCGCATCCTGTGGAACCTGGTGGCGCGTTCGGCCCGGCCGTTGAAACGCAAGGCCGCGGTCGAGATTCCCGAAATTATCGAGCCCAAGCCGACGCCGGCCGCCGCTTCCGTCAACGGTGTCCCGGCCGTCGCCGCGAAACCGAAGACGCCGCATGTCTCGCATTCGCTCGACGATCAGACGCTGCACAAGCTGAAAAAAGGCCGGCTGCCGATCGAAGGCCGCGTCGACCTGCACGGCATGACGCAGGACGAGGCCTATTCGCTGCTGCTCGCCTTCCTGCATCGGGCGCATGCCGGCGGCATCCGCTATGTTCTGATCATCACCGGCAAGGGTTCGTCCTCGGGCGGCGACGGCATCCTGCGCCGCGCCGTACCGGCCTGGCTGTCGACCCCGGCCTTCCGCCATCTCGTCTCCAGCCACGATCACGCCGCCCGCAACCATGGCGGCTCGGGCGCGCTCTATGTGCGGCTGCGGCGGACGCGCCCATGAGGGCCCGGCGATGAGCGGCTCGGCAATGAGGGCCCGGCGATGAGCAGGTCGACATGACTCCGTTCGGCGAGAAGCTCAGGGCGCTGCGCGCCGAGCGCGGCGTCAGCCAGAAAGCTATGGCCGAGGCGATCGGCGTCAGCGCCGCCTATCTGTCGGCGCTCGAGCACGGCCGCCGCGGCGCGCCGACCTGGACGCTGATCCAGAAGATCATCGGCTATTTCAACATCATCTGGGACGATGCCGAGGACCTCGCCCGCCTTGCCGAGAGCTCGCACCCGAGGGTCAGGATCGACACGTCCGGCCTGTCGCCGGCGGCCACTGAACTGGCCAATCTTTTGGCCGAAAGCATCGAGAGGCTGGATGAGGCGGAATTGCGCCGCCTCAGCACGTCGATCCGCGCCGCGCTCGCACGACGTAGCTAACAAGCGTCAACGGGCGAGGTCGGCGCTGCCCCTCATCTGGCTGCCGCCATCTTCTCCCCGCGAACGGGGAGAAGGGCACTGTGCCGCTGCTTTCGCCAATCACCAGCGTCGGAAATTGGCGCTAAGAAATTGGCGCTAAGGTTGCTGCCAGCCTCTTTCTCCCCGTTTTACGGGGAGAAATGCCCGTCAGGGCAATGAGGGGCGGCGCCAACCTGCCATATCCGGATACTCACGATGAAACCGCGTTGGACGAGCGCGATCCGCTTTTCGCTTCGCGCATCTGTAAATGAAATCAGAACTTGCCGGCGCCGGCGCCATTGGAGCTGCCCCGACCGCCCTTGCCTTCATGGCCTTGATGATCGCGGCCGCCATTGCCGCCGCCCAGGCAGTTCGCGGAACCGACGCTGCAGCAGCGGCCGCTCCACAGATCGTTGAGGTTGGTCGCGGCGCAGCTTTTGTAGGTCTTGTCCGCCAGGGCCGATCCGGTCAGGGCCGAAATCGCGACGGCGGCAAGAAGGGTGTTGCGCAGGAAAGAAGTCATTTGAGTTCTCCATGGTTGGATTTGCCATTCACGGCTGTGTGTCGCCGGCCGAACGGGAATGGTTCATGGAGTTGTGCGATTTTTTCGGAGGCCCGTTGCTCTCGTCATCCACGGGCGGAGCGGGAGCGAAGCGGACGCGTAGACCCGAGGATCCTGTGTGTTGGTTCTGGTGTATGGTTGAGGTGGGAAGGAGACCGAGTGGATCGTGGTCGTCCTTTCGGACAGGCCGTGGCATGGCCCGGTCCCTTCCCACCGGTGAACCATCAACCTGAACAGTTGCACGGGGCTGTTCGAAGCAGACCCCGCACCACAGGAAGAGGCGTGGACATGATAGCCCAGAACTATGTCGGCTGCGACATCTCCAAGCAGTGGCTCGACTTTTTTGATGAGACAAGCGGCCGTCTTCGGCGCATCGACAACCAGGCCGATGCGATCGCCGCCTATGTGGCAGGCCTTGATCCCAGTCGGGACTTCGTCGTCATGGAGGCGACAGGTGTCCATGACCGGCTGCTGCGCCATGCGCTGGCCAAGGCCGGCGTGCCGTTCTCGCGGCACAACCCGGCGCACACCCACCACTATTCGAAGTCGACGCGCCGGCGCGCCAAGACCGATCCTCTCGATGCCAGGATGCTGAGCGACTATGGCCGTCGCTATAACCCTGAGGCCGAGCCGGCGCCGAGCGAAGAAGTCGAGCGGCTTCAATCGCTTGCCGGCCACCGCGATCATTTGGTCGAGATGCGGGCAGCGCTGAAGAAGCACCTCGCCGAGGCTTTCGAGGAGGCCGTCATTGCCGATCTGGAAGAGACGATCGCCTTCCTCGACACGCGTATCAAGGCGTTCGAGAGCCAGATTGCCGAAGTCATTCGTCAAAACCAGGACACCGCTCGCGACCATGCGCTGATGGTCTCCGTGCCCGGTGTCTCCAATGTCGGCGCGCTCTCTTTGCTGGCGCTCCTGCCCGAGCTCGGCCAGCGCTCACCCAAGGCGATCGCCGCGCTCGCCGGCCTTGCCCCGTTCGACAACAAGAGCGGCAAGCTCAGCCGCAGGAGCCAGATCCAGGGCGGGCGATCACGCGTGCGTCGCGCCCTCTATATGGCTGCCCTCACCGCCATCAGAACCTGTGAGCGCTTCAAGACCTTCTATACCGCACTTGCTGCCCGCTCAGGGTCCAAGAAACTCGCCATCATCGCCGTCGCAAGGAAGCTCCTGGTCGTCCTCAACGCCATCATGCGCGACAAAACCGCATTCGCGTGAACACAGTTGCCATGCCGTTACGGCAAGGCGTTCCAACGGTTGCAGAATTCCGCACCGTTGCACTCTGCGGCAAGGGTCACGGCATGGATACTCGGGTCAGGCCCGAGTATGACGACGTCACGTAGGGATTCGGCCAATCTCCAAGGGTGGCGCCAGAAGCTCAACCCACCGCCACCCTCAGCCACGGGGCCTTTTCGCCGAGCGTCCAAACCAGCCCGGCATGCGCCGCGCGGTTCACCCTCCGCACAGGGAGGTAGAGCGCCTCACGAAGAAAGGCAGGATGCCCAGTCCCTACTGCACCGAGCCGACCAGCACGGCCTGGCCGTCGCGGATCGAGACCCAGCGTCCGGTGTTGTCGATCGCCTGGCGCTTGAGGAAGCGGTAGCCGGTCGCGTCCCAGGCTCTGACCTTGTCGGTCAGATTGTCGAGCACGTAGTCGCCCTTGTCGGTGCGCACCGTCAGCACCGAATGGCCTTCGCCATCAGGCTTGCGCACGACGGTGATCAGGAGGTCGGCGAGCGACATGCCCATGCGATAAAGCTGGCGGCGCTTTTCCAGCACATAGTCCTCGCAATCGCCATAGCCGTCATCCGGATAGGCCCAGACTTCATCCTTGCCGTAGTGATCGAGATCGCTCAACGGCTTGACGGCGGCGTTGACCTTGGCGCTGACGCTGACCAGCTTGCGCCACAGGACGTCCGTCATCCTGGCCGGCTCGAGATCGGCCGGATGGATGTTGCATTCGTTCGGACGCGCCTTGCAGAAATCATAGTGCCCGATCGGCTGCGAGGTCAGGCCGCCTGTGGTCATTGCGCCCGCCGCCCAGGCGGGTACCGCCCAGTGAGCCGAAATCGCCAGCCCTGCGACTGAACACAAACGCAGAATCCGCGTCATCGCTGAGGAAGTCATTGCCCCCGCCGCCCTGTTCTTTATTAACGAAACGTTAAAGGCGATTTACAGTCCGTGTCAATTAGAGACCGCGGCCAGTTTGACGGCATGGTTACCTGGACGATCCCGTAGCGGCACTTTCGCAACAGAGACACCGTCGCAACACCAGCGGCCGCCGTCTGGCGGCGGAACGGAAGTTCGCTCTGGCGCGGTCCGCGGCGTTCAGCCTTTCGCGGAACGGACCAGTTTGGGCCTCGTCGCCACACGCTGCTGGCGGCCATAGCTGGAAATGAAGTCGGCGATGCGCGGCACGATCTCGGAGCGGAAGCGCGAGCCGTTGAAGACGCCGTAATGGCCGACGGCCGGCTGCACGTAATGGACATGCTTGTCGGCGGGAATGTTGACGCACAGATCATGCGCGGCCTGGGTCTGGCCGAGCCCGGAAATGTCGTCGTTCTCGCCCTCGATGGTGAGCAGCGCGACGTTGCGGATCGCTGCGCAGTCGACCGTCTCGCCGCGATGCGTCATCTCGCCCTTCGGTAAAGCGTGGCGCACGAACACGGTGTCGACCGTCTGCAGGTAGAATTCCGCCGTCAGGTCCATCACCGCCAGATATTCGTCATAGAAGTCGCGGTGCTTTTCGGCGCTGTCCCCGTCATGCTTCACAAGGTGCATGAAGAAGTCCTTGTGGGCGATGATGTGGCGGTCGAGATTCATGCTCATGAAGCCGGAGAGCTGCAGGAAGCCCGGATAGACCACGCGCCCGAAGCCCGGCACCGGCCAGGGCGCCTGCATGATGACATTGTCGCGGAACCAGTCGATGCCCTTTTCCTTGGCCAGGAGATTGACCGCGGTGGGGTTGCGGCGCGTGTCGACCGGCCCGCCCATCAGCGTCATGGTGGAGGGCACGAAGGGATCGCCGCGCCTGTCCATCAGCGCCACCGCCGCCAGCACCGGCACCGAAGGCTGGCAGACGGCCATGACATGGGTGTCGGGGCCGAGCGCATGGAACATCTCGATGACGTAGTCGATATAGTCGTCGAGATCGAAGCTGCCTTGCGCCACCGGCACCATGCGGGCGTCGACCCAGTCGGTGATGTGGACGTCGGCGTGCGGAAGCATCGCCTCGACCGTGCCGCGCAGCAGCGTCGCGTAGTGGCCCGACATCGGTGCCACGATCAAAAGCTTCGGGTCAGGCTTGCGCCCCGCGGGCAGCGCGCGCTCGAAGCGCACCAGATTGCAGAACGGCTTCGACCAGACCGTCTTTTCGGTGACGTCGACGCTCTTCCAGTCGACCACGGTCTTGTCCAGGCCGAATTGCGGCTTGCCGTAGCGGCGGGTGGTGCGCTCGAACAACTCGGCGGTCGCCGCGACCGAGCGGCCTAAGGGGGTGTGGGAGATCGGGTTGAGCGGGTTGGAGTAGAAGAGACGCACCGCGTCGGCATACAAGCGCGCGGGCTGCAGCGCCGCATGGTTCATTTCGTAGAGCTGGTAGAACATCGAGAACCCCGTTGCCGTCCGTCGACCAAAGGACGACGAACAGCGCCGAGCCTTGAATGTCTCCCGGCGAGGTTAACAAGATATTGTTGCGATGCAATACGTCATTTTGTCTGACGAATGCTTCTTTGGTTCAACCTGTTGGAAATCAGGCTTGAGCGGGCGAGCCGGCTCACTGTGCGGTGCCGAAATGTGAAGGATATGTGTCCGGCCTCTCTTACCGGAGACCGGCCATGATCGCCGGACCGCAGCGGTCGCCTTGGATCATGGTCCGATAATGGAGAGATCAGACGCGCGCCATGCAGACGGCGGTGTGGCCCGAACCGATGAAGCCCAGCTGTCCGGCTGCGCCGCGGGACAGATCCAGCACGCGACCCTTCACGAACGGACCGCGATCGTTGATGCGCACGACGACGCTGCGGCCGTTGTTCTTGTTGATGACCCTGAGCTTGGTGCCGAAAGGCAGCGTGCGGTGGGCGGCGGTCAGTGCCGAGGGGTTCATGCGTTCCCCTGAGGCGGTTCTGGAATGCAGCGCATACCAGGAAGCGCGGCCGCATTGCGCGGCGGCGGAGGAGGCGGACGAGGCGCCAGCCATCAGGCCAGCCGCGATCGTTACCGCGACAAGCGCGGTCTGGTTGGTTCTCTTCATGCTTTGGGGGTGGCTCCGTTGATCGAATAGCCTCGCCTAAGTGCCGAATGAGGCGGGAAATGCGGCAGCCTTCTGGCGGTTCCATGGCGAAGGCACACGTTTGGAGTCGCCGCGAAACAGTTTGTCATGAGATTCTTGGAGCCCGAGAGGCGTATTCTTGATGAATATCGGACCACTCTCCGGGACGTTCCGGACAAGATCGGTCCTCAAGATTGGAAGTACAGAGGAGCCGGACTATTCCGATTCCTCCGAGTCTTTTGTTGCGTCCCTGTCACCGCTCGAAATCGACTTCCGGAGAGGATCCAGCCGATGCGATTGATCAGGTCTGTGCTGGCGCTGATTGTGCTGGGCGTGGGCGCGCTATGGTCGCTGCAGGGGCTCGGCCTGGTCGGCGGCAGCTTCATGACCGGCCAGACACGATGGCTCTATATCGGCCTCGTCACCATGCTGGCCGGCATCGTCGGACTGCGCTGGGCGAGCCAGTCGCGCGTTTGATCTTACGATATGATTTCGTGATTTGGGACCACGGCCGGCGCCGCGCTCAAGCCCGGTTGAACAGGTGATCCAGGATGGTGCCCTCGATATGGGCAAGCTCCGCCGAACCGTGCCGGCGGGGCCGGGCCACAGGCACGTCGAGGTCGAGGGCGATCCTGCCGGCGCCGATCACCACGACGCGGTCGGCGAGCGCGACGGCTTCGCCGACGTCGTGCGTCACCAGCACGGCGGTGAACTTCTGGTCGAGCCAAATCCGTTCCAGAAGCTGCTGCATCTCGATGCGGGTCAGCGCGTCCAGCGCGCCGAGCGGCTCGTCCAGCGCCAGGATCTGCGGCTGGCCGACCAGGGCGCGGGCAAGCGCGACGCGCTGCTTTTGACCGCCCGACAGCACCGAGGGCCATTCGTCGGCGCGGTCGGCGAGCCCGACCTCCTCCAGCATGGCAAGCGCCCGCTGCCTGGCTTCGGTGCCGCCGGCGATGCCGGTCAGCCCGATCTCGACATTCTTCACCACGCTCGCCCAGGGCAGCAGCCGCGGCTCCTGGAACATGAAGCGCGTGCGGCTGTGACCCTCCTCGGCGGCGCCAAAGGTCAGCGAGCCGCTGCTCGGGCGGTCGAGGCCGGCCAGCAGCCGAAGCAGCGTGCTCTTGCCGCAGCCGCTTTTGCCGATGATGGCCAGGAACTGCCCGCCCCGCACGTCGAGGCTGATGCCGTCGAGCACGATCTTGTCGCCGAAGCGTTTCCCCACCTCCTTGAAGGCGAAGGCCCGGCGGCCCTCGACCGAGCCGATCGTCCGGGCTTGCGCCGGCGTGGCGACGAGGGAGCGTGCTGCGGTGAGGGTGGCGGCTGGCATCTTGTTCTCACCCTTTCTGGAAGGCCGGATGCCAGCCGAGCGACAGGCGCTCGAGCAGGCGGGCCAGGCTGTCGGCGAGCTTGCCGAGCAGCGCGTAGATCAGGATCGACAGCACCACGACGTCGATCAGCAGGAACTCGCGCGCCTGCATGGCCATGTAGCCGAGTCCGGAACTGGCCGAGATCGTTTCGGCGACGATCAGGGTCAGCCACATGATGCCCAGCGCATAGCGCAGGCCGACAAAGATCGAAGGCAAAGCGCCAGGCAGGATCACCCGGAAGAACAAGGCGCGCCGGTCGAGCCCGTAGACCCGGCCCATCTCGACCAGTTGCGGATCGACGCTCTGGATGCCGAGCAGCGTGTTGACATAGATCGGGAAGAAGACGCCGAGCGCCACCAGGAACAGCTTGGCCTCTTCGTCGATGCCGAACCACAGGATAACCAGCGGGATCAGCGCCAGATGCGGGATGTTGCGGATCATCTGCAGCGTCGTGTCGGTCAGGCCGCGGCTCAGCGCCGACAATCCGTTGGCGAGGCCGAGCGCGAAGCCGATCGAGCCGCCTATGGCGAAGCCGGACAGCGCCCGCAGGGTGGAGACGCCGATGTTGCGGATCAGCTCGCCTGACAATGTCAGCCGCCAGAAGGCCCCGGCCACCGCGCTCGGCGCCGGCAGCACATTGGCCGGGATCAGGCCCGCGCGCGCCGCGGCTTCCCAGCCGGCGATGATTGCCGCCGGCAGCAGCCAGCCGGCCACGCCGTTGCGGGCAAGGCGCGTGGCCAGGCTCATGACGCCGCCTGCAGGCGGTTGGCGCCGTGGAAGCCGACCGAGAATTCGTTGGCGATATCCTTGTGCGCCCGTGGCCGCTGCGTGCCCAGGCCGAGCCTCGGGAACAACAGCTCGGCGACGCGATAGGCCTCCTCGAGATGCGGGTAGCCGGAGCCGATGATGGTGTCGATGCCGATCGCCTGGTATTCGCCGATGCGCTCGGCGATCTGCTCCGGCGTGCCGACCAGCGCGGTGCCGGCGCCGCCGCGCACCAGGCCGACGCCGGCCCAAAGGTTCGGCGACACGACAAGCCGGTCGCGCCGGCCGCCATGCAGTTCGGCCATGCGCCGCTGCCCGACCGAATCCATCTCCTTGAGGAAACGCGCCTGCGCATTCTCTATCTGCGCGTCGGTGACGTGGCTGATCAGCCGCTCGGCGGCGCGCCAGGCCTCGTCCTCGGTCTCGCGCACGATGAAATGCAGCCTGATGCCGAAACGCAGCTTGCGGCCGCGCCGCGCCGCTTTTTCACGCGCCGAAGCGATCTTTGCGGCAACCTGCGCCGGCGGCTCGCCCCAGGTGAGATACATGTCGACGAGGTCGGCCGCGAGCTCCTGTCCGGCCTCGGACGAGCCGCCGAAATAGAGCGGCGGCCGCTCCTGCAGCGGCAAGAGGTCGAGGCGGCCGTTCTCGACGCGGTAGTGTTTGCCTTCGAAATTCACCCGCTCGCCCGAGACCAGTCCGCGCCAGATGGTGAGGAATTCCTGCGCCTGCGCATAGCGCTCGTCATGCGGCAGAAACACGCCGTCGCCGGCAAGCTCGGTCGGGTTGCCGCCGACCACGACATTGAGCAGCAGGCGGCCGTTGCTCAGCCGGTCGAGCGCCGCGGTCTGGCGTGCCGCGAAGGTCGGCAGCGTCACGCCGGGTCTCAGCGCGACCAGGAACTTCAGCCTTTCCGTCAGCGTCGCGAGCCCGGTGGCGGTGATCCAGGAATCCTCGCAATTCTGCCCGGTGGGCAAAAGCACGCCGGGGAAGCCCAGCCGGTCGACCGCCTGCGCGATCTCCTTGAAGTAGCCGAATTCAGGCGGCCGCTGCTGCTTCTCGGTGCCGAGATAGGAGCCGTCGCCATGCGTCGGGATGAACCAGAAGAAATCGAGCGGGCTGGCTGAAGCGGTCATGGCTGAACCCTGAGAGAAGACAGGAGAAACTACGGCGGCGCCGGGCGTTCGCCGCCCGGCGCCGTGAAAGAGATCAGTTGCCCGGCGCGGTCCAGACGGCGTCGGAGATCCGCACCGCTTTCGGGATCAGCCCGAGCTTGAAGAAGCGGTCGGCCGTCGCCTGCTGGCCGGCGACGATCTCGGCGGTGATCGGATAGATGCCGAACTTGGTGCGTTTGGCGGCGATCGTCTGCGCGTCGAGCGGCACGCCGGTCACCTCATGCAGCGCCTCGGCCACCTTGTCGCGGTTCTGGTCCGCCCATTTCGCCGCGTCGCCGAGCGCGGCGATCGTCGTCGAGACGAAATCCGGATGCGCCTTCGCGAAATCCTTGTTGGCGAGGAAATAGGTGTTGACCTGGAGCACGTCGATGGAGCGGGCAAGCACCCGCGGCTGGTAGCGCGTTTCAGCGATGGCGAAGAACGGATCCCACACCGCCCATGCATCGATCTGGTCGCTGGCGAAGGCGGCCGCCGCATCGGCGGGGCTGAGGTAGACGGGCGTGACCTGGTCGAAGGGGATGCCGGCTTTCTCCAGGGCGGCAACGACCAGATTGTGCGCGCTGGTGCCCTTGCCGACGCCGATGCGCTTGCCCTTGAGATCGGCCAGCGACTGGATCGGCGAGGCCGGCTTGACGAAGATCGCCTCGCCCTCGCCATTGGAGGGCAGCGCCGCCGCATAGACGATGTTGGCTCCGGCCGACTGGCCGAAGATCGGCGGCGCGTCGCCAGTCCAGCCGACATCGATGGCGCCGACATTCAAGGCTTCCACCAGCGGCGGGCCGGCGGTGAACTCGACCCACTTCACCGTCACGCCCTTGTCGGCCAGCGCCTTTTCGATGATCTGCTGCTGCCTGGCGATCACCGGCAGGCCGGTCTTCTGGTAGCCGATGTTGAACGCCTTGAGGTCCTGCGCGGCGGCGGGGGATGTCGTCCCAAAGGATGCGGCAAAGGCCGCCGCTGTAAGCAGGCCGACGCCGAATGCGCGTCTCGTCAGATTGAACATGACCCTCTCCTTCGTTGGTCCCGAACTGCCGGGCGATTGGAATAAGGTCAGGCTAGGTAAATCTATAAAGCTAGTAGAGAAACGATCTTGCGAGTTCGGCGGATTTCATGGAATTATTTTCTGCGCGTAACCCTGGGTGTCGTCATCCTAGTTGAGACCGCACAGGGATTTTGATCGGTCGGCCCTGTTTGCGTTGGCGGGCGGCATGCGTTGGAGATTGGCCGGAACGCCCTTCCTGATCGTCATTCTAGGGCGGAGCAGGAGCGAAGCTCCGTCGCGAAGACCCTAGAATCCGTGCCGTTACCTTCGCCACGGCGTGCAACGGAGCAGAATTCTGCTCCGCTGCGCTCTTCGACTGAGGTTACGGCATGGCAACTGTGTTCACGCGAATGCGGTTTTGTCGCGCATGATGGCGTTGAGGACGACCAGGAGCTTCCTTGCGACGGCGATGATGGCGAGTTTCTTGGACCCTGAGCGGGCAGCAAGTGCGGTATAGAAGGTCTTGAAGCGCTCACAGGTTCTGATGGCGGTGAGGGCAGCCATATAGAGGGCGCGGCGCACGCGTGATCGCCCGCCCTGGATCTGGCTCCTGCGGCTGAGCTTGCCGCTCTTGTTGTCGAACGGGGCAAGGCCGGCGAGCGCGGCGATCGCCTTGGGTGAGCGCTGGCCGAGCTCGGGCAGGAGCGCCAGCAAAGAGAGCGCGCCGACATTGGAGACACCGGGCACGGAGACCATCAGCGCATGGTCGCGAGCGGTGTCCTGGTTTTGACGAATGACTTCGGCAATCTGGCTCTCGAACGCCTTGATACGCGTGTCGAGGAAGGCGATCGTCTCTTCCAGATCGGCAATGACGGCCTCCTCGAAAGCCTCGGCGAGGTGCTTCTTCAGCGCTGCCCGCATCTCGACCAAATGATCGCGGTGGCCGGCAAGCGATTGAAGCCGCTCGACTTCTTCGCTCGGCGCCGGCTCGGCCTCAGGGTTATAGCGACGGCCATAGTCGCTCAGCATCCTGGCATCGAGAGGATCGGTCTTGGCGCGCCGGCGCGTCGACTTCGAATAGTGGTGGGTGTGCGCCGGGTTGTGCCGCGAGAACGGCACGCCGGCCTTGGCCAGCGCATGGCGCAGCAGCCGGTCATGGACACCTGTCGCCTCCATGACGACGAAGTCCCGACTGGGATCAAGGCCTGCCACATAGGCGGCGATCGCATCGGCCTGGTTGTCGATGCGCCGAAGACGGCCGCTTGTCTCATCAAAAAAGTCGAGCCACTGCTTGGAGATGTCGCAGCCGACATAGTTCTGGGCTATCATGTCCACGCCTCTTCCTGTGGTGCGGGGTCTGCTTCGAACAGCCCCGTGCAACTGTTCAGGTTGATGGTTCACCGGTGGGAAGGGACCGGGCCATGCCACGGCCTGTCCGAAAGGACGACCACGATCCACTCGGTCTCCTTCCCACCTCAACCATACACCAGAACCAACACACAGGATTCCAGGGTCTGCGCGCGTCGCTTCGCTCCTTGCTCCGCCATAGAATGACGAGGTTGGGAGGGCGGCTTCAACCAATCGACGCGGCGCAGACATTAGGATGACGAAGTCGCCAGCGCTGGCGTCCCCCACTTCCCGATCATTGCGACCAGTACTGCCAGACCCAGTAGAAGCTCTCGTCGCGCGGCGGCAGGCGATCGCGGTCGCCGGCAAGCAGCACCGGCGTCGGCGAATTCGGTTCGTCTGCCTGCTGTTTGTCGCTCGGCGGCAGACGCAACAGCCATGTGCCGACATGCGCAAACAGCACTGCGGTGGAATGGGTTATCGTCCTTCTCCTTCGCTTGAGCTCATTCCCGTCCCGCCCAGGGCACCAGCACCCGCTCGATCCGGCGGATGACGAATTCGAGCGCAAAAGCGATGATCGCGATCACCAGTATGCCCATCACCACCACGTCGGTGACCAGGAATTGCGCCGCCGACTGGATCATGAAGCCGAGCCCCCGCGTCGCCGCGACCAGCTCGGCTGCGACCAGCGTCGACCAACCGGCGCCGAGCGCGATGCGCAGGCCGGTGAGGATCGAAGGCAGGGCGCTGGGCAGGATCACGTGGCGGATGACCTGGGCGCGCGTGGCGCCGAGCGACCGCGCCGCGTCGACACGCTCGCGCGAAACGCCGCGCACGCCGGCGGCGGTCGACAGCGCCACCGGCGCCAGCATGGCGATGGCGATCACCAGGATCTTCGACGGCTCGCCGATGCCGAACCAGATGATGATCAGCGGCAGATAGGCGAGCGGCGGAATCGGACGCAGGAATTCGAGCAGCGGGTCGAACACGCCGCGCCCGATCCGGCTGATACCGATGGCGAGGCCGACCGGCACGCCGACGAGGATGGCCGCGATCAGCGCCGCAAACACACGCCAGAGCGACGCGAGGATGTGCTGGAGCAATGTGGCATCGACGAAACCGTCGCGGACAACGATGACGAACTTTGCCCAGACCGCGCCGGGCGAGGGCAGGAAAACCGGCGACACCAGCTGCAGCGAGGACGCCGCCGTCCAGGCAGCGAGCAGCACGAGGATGGTGATCGCGCTGACCAGCCTGGCTGAAATTACCGCCCGTCTTGGCCGCGGCCGCGACCATGGCACGGAAAGCCCGTCGGTCTCGCCGGCCTTCGCGGCGGGCCTTTCGCTGTAAGTGCTCACGCTCATGCCGCCTCTCCTTCGAAGATCGAGTCGGTCAGATCCGCGCGGGCCGCGGCGAAGCCGGCATCGGCCTTGATCGCGCGGATCGGCTCGCCGGCGGCGTAGCGGCGGCCGAAATTCGTTTCGAAAGTCCGCACCACGTGCCCTGGACCGGGCGCCAACACGACGATGCGGGTCGCCAGCACCAGCGCCTCCTCGATGCCGTGCGTCACCATCAGCACGCCGACTTGGGCCGCTGCCCACAGGTCGAGCAGCGTCGTCTGCATGCGCTCACGCGTCAGCGCGTCGAGCGCGCCGAGCGGTTCGTCGAGCAGAAGGAATTCGGGTTCTGCGGCCAAGGCTCGGGCAAGGCCGACGCGCTGGCGCATGCCGCCGGAAAGCTCCCAGATCCGCTTGTCGCCGGCGTCGGCGAGCTTGACCAGCGACAGCAATTCGTCGGCGCGGCGTGCTCTGCTATCGGCCGGGACGCCGCGCAGCCTAAGCGCGAAGGCCACATTTTCCCGCGCCGTCAGCCAGGGGAACAGCGCATCGTTCTGGAACACCACGGCGCGATCGGATCCGGGCCTGGTGATTGGCTTGCCGCCCACCGTGGCCAGGCCGCCCGCGGGCTCGACCAGGCCGGCGGCGACGTTGAGCAGCGAGGTCTTGCCGCAGCCGGAACGGCCGACCAGCACGACGAAATCGCCCTTCGCAATGTCGAGCGAAACCCGCTCGACCGCCGGCACCAGTTGGCCGTCATAATGGACGCTGATCTTGTCGAGGGTGAGATGCGTCATGCGGGCCTCTGCTGGATCAGGTGTTGGATCCTGGGGCCTGCCCCGAAGCCGCGCCTCGTAAGCGACTCCGGGACAGGCAAGCGCAAGGCCAGGGAAGGAGGCCGCGCTTCCGAACAATCCCCCGGAAAGGCGGTCATCCTTCCGGGAATTGTGTGGACAGGGCTCAGTTCGAGGCGAGCGCCTCGCTGGCATATTTCGCCGTCACATATTTCGAATAGTCGGGCAGCACGGCATCGACCTTGCCCTGCTCCTTGAGGAAGGCGGACGTCGCCGCGACGGCCTTGACGGTCGCGCCGCCGAGGAACTTGTCCGAAGCCTGCTCTTCGAGCGACGGGAAGACATAGCCCTTGAGCAGCTCCGGCACCTCTTCCAGCTTGGCGCCGGTGAGCTTGGCGATCTTGCCGGCTTCCGGCGAGGACACCGACCAGGCCTCCGGCTTGGCGAGGAACTGGGCATAGGCTTCGCCGGTCACCTTGACAAAGTCGCGCACGGCTTCGGGGTGCTTTTCGGCAAAATCGGTGCGCACGATCCAGGCGTCGAAGGTCGGCGCGCCCCAGGCGGCCACTTGCGAAGAGTCCAGCACTACCTTGCCGGTCGTCTTCACCTGGCCGAGCGCCGGGTCCCAGACATAGGCCGCGTCGATATCGCCGCGCGCGAAGGCGGCGGCGATCTCCGGCGGCCTCAGATTGAGGATCTGCACCGACTTCGGATCGACATTCTCATGCTTCAGCGCCGCGAGCAGGCTGTAATGCGTGGTCGAGACGAACGGCACGGCGACCTTCTTGCCGGCCAGATCCGCGACCTTCTCGATGCCGGAGCCGTTGCGGGCGACCAGCGCCTCCGACGGACCGATCAGCCCGACGACGAAGATGGTCTGGATCGGTAGCTCGCGGCTGGCTGCCGCGGCCAGCGGGCTCGAGCCGACATAGCCGATATCGACCGAGCCGGAGGCGATCGCGGCGATGACGTCGGCGCCGGAATCGAACTTGCGCCAGTCGATCGCCGCCTTGGTCGCCTTTTCATAGGCGCCGTCCGCCTGCGGCACTTTCGAAGGCTCGACCACCGTCTGGTAGCCGATGGTGATCTTGAGATCCTCGGCGCTGGCAGGGCCGAGCAGGGCGGCGCCGGCAAGAGCGGCGGCCGACGAAAGCAACAGAATGCGTCGTGAAAGGATTGACATTGAAAGGCTCCATAAACCCCTGAAAACTGGATTGCCCTTCTATCGTCCGTTAATTCTATCAGGTTTGTAGAGTTAAATCCTTCCAACGAAGGGCGCTTTGTTGGAAAAGTGTCGCCGGGTAACGGCGGAACGGGCGCAAAAGGACGGTCGCGGGGTTGGTGCGGGTGGACGCGGCCGCCAGCCGCGCTCTTCGGTGCCCTGGGTGGTTGGCAAAGCCGTCCTTCCAGTCGTCATTCTAGGGCGGAGCAAGGAGCGAAGCGACGCGCGCAGACCCTAGAATCCTGTGTGTTGGTTCTGGTGTATGGTTGAGGTGGGAAGGAGACCGAGTGGATCGTGGTCGTCCTTTCGGACAGGCCGTGGCATGGCCCGGTCCCTTCCCACCGGTGAACCATCAACCTGAACAGTTGCACGGGGCTGTTCGAAGCAGACCCCGCACCACAGGAAGAGGCGTGGACATGATAGCCCAGAACTATGTCGGCTGCGACATCTCCAAGCAGTGGCTCGACTTTTTTGATGAGACAAGCGGCCGTCTTCGGCGCATCGACAACCAGGCCGATGCGATCGCCGCCTATGTGGCAGGCCTTGATCCCAGTCGGGACTTCGTCGTCATGGAGGCGACAGGTGTCCATGACCGGCTGCTGCGCCATGCGCTGGCCAAGGCCGGCGTGCCGTTCTCGCGGCACAACCCGGCGCACACCCACCACTATTCGAAGTCGACGCGCCGGCGCGCCAAGACCGATCCTCTCGATGCCAGGATGCTGAGCGACTATGGCCGTCGCTATAACCCTGAGGCCGAGCCGGCGCCGAGCGAAGAAGTCGAGCGGCTTCAATCGCTTGCCGGCCACCGCGATCATTTGGTCGAGATGCGGGCAGCGCTGAAGAAGCACCTCGCCGAGGCTTTCGAGGAGGCCGTCATTGCCGATCTGGAAGAGACGATCGCCTTCCTCGACACGCGTATCAAGGCGTTCGAGAGCCAGATTGCCGAAGTCATTCGTCAAAACCAGGACACCGCTCGCGACCATGCGCTGATGGTCTCCGTGCCCGGTGTCTCCAATGTCGGCGCGCTCTCTTTGCTGGCGCTCCTGCCCGAGCTCGGCCAGCGCTCACCCAAGGCGATCGCCGCGCTCGCCGGCCTTGCCCCGTTCGACAACAAGAGCGGCAAGCTCAGCCGCAGGAGCCAGATCCAGGGCGGGCGATCACGCGTGCGTCGCGCCCTCTATATGGCTGCCCTCACCGCCATCAGAACCTGTGAGCGCTTCAAGACCTTCTATACCGCACTTGCTGCCCGCTCAGGGTCCAAGAAACTCGCCATCATCGCCGTCGCAAGGAAGCTCCTGGTCGTCCTCAACGCCATCATGCGCGACAAAACCGCATTCGCGTGAACACAGTTGCCATGCCGTGAAATAGAAGAGATGCTGCGGTGCAGGCCGATCTTTCGGCGGGGGTAGATAAGGGTTCTATTCTCTACCGTTGCACCTCTCACGAAGGTAACGGCATGGATTCTAGGGTCTACGCGCCGCTTCGCGTCGCTCCGCCCTAGAATGACGAAGTCGTTGGATTGGCGCTCCCTTTCAAAGATACCAAAGCGGACTCTCCGGCCTAGAATGACGACAGCGGATGCGGCGCCCTTCCACAATCGCGGCAGCCTGAACGAAAATCCTCCTGTCACCGGATTTCAGCAAATCCATGCTTCAAATCGCCGGCGAATTGAAACTCGAGTTTCGTGCCCAGCCCTCTTAGTATAGTAATCTTATCAACATTGGGAGTGAGAGAACGATGCGCAACCTTCTGAAAACCGTTCCGGCGATCGTTCGTGGCCTGCGGCCGAGGCCGCCCGCGGCGCTCGCCGCGCGCTGATCGCCCCGTTTCGGCTCGCAACCATGGTTGGACGCGGCAACTCCATCATCATTGTCGGCGGCGGCGCCAGCGGCGTTGTGCTGGCCGTGCATTTGCTGATGTCACCAAACCCCGATCTGCGCGTCACGCTGATCGAGAAGCGGCCGCATTTCGGCCAGGGCATGGCCTATTCCACTCTCCTGTCGGCGCATGTGCTCAACGTGAAAGCCTCCGGGATGAGCGCCTATGCCGACGACCCCACCCATTTCGCACGCTGGGTGCTGGAGCGCGGCTTCGCCAAGCCCGACCAGGGGCCATTCTATGCGCCGCGCAGCCTCTATGCGAGCTATCTGAGGGAACTGCTCGACGATCTGATGGAACGTGAGCGCGAGACCGGCCGGTTGCGGCTGATTGCCGAGGAGAGCCTGTCGATCTCGCCGACGTCGGCTGGTGTCGAAGTGGTGCTCGCCAACGGCACCAGCGTCGTTGGCCATCTGGCCGTGCTTGCCACCGGCCATGACGAGCAGCCCGGCGCCTTCCAGGGCCACGCCATCCGCATGGGAACGGAGGCCGACACCGCGCTCGATCCGCAAGCCGGTGTCTTGCTGCTCGGCACGGGCCTCAGCATGGTCGACGCCTTCCTGTCGCTGGAACAGCGCGGTCATCGCGGCCCGATCGTCGCGGTATCGCGGCGCGGCCTGCTGCCGTCGCCGCATCGCAAGGGCAACCCGATCAAGCTCGACATCGCCGATATCCCGCTCGGCACCGAGCTTTCCTATTTCGTCGGCTGGTTCCGCAACCTGATCCGCGAGACCCAGAAGGCCGGCGGCGACTGGCGCGACGTGGTCGACGGGTTGCGGCCTTTCAACCAAACGATCTGGCAGAACTGGCCGTCCTCGGCCAAGCGCCGCTTCGTCGAGCATACCAAGGCCTGGTGGGACATCCACCGCCACCGCATGGCGCCGGAAGTCTACGCGCGCGTCACCGAGGCGGTCGGGTCGGGCCGCATCCGCCTCGTCGCCGGCAGGGTGGTGAATGTCGAGGCGAACGGCAGCTTCACCGTGAAAATCCAGCCGCGCGGCACACAGGATGTCGAGACCCTGCAGGTCGCCCGTCTTTATGATTGCATGGGCATCGCCCGCGATATTTCGAGGACGTCGAACGGCGTCGTCCGCTCGCTGATCGAGCGCGGCGTAGCGCGGCCGGATCCGCTGCGCCTCGGACTCGACGTCACCGCGAAATGCGAGCTGATCGCGGCCGACGGCACGGTGTCGTCGAAGCTGCTCGCAGTCGGGCCGCTGACGCGCGGCACCTTCTTCGAGATCGACGCCATTCCCGACATTCGCGTGCAGTGCGCGAAGCTGAGCAAGCAGCTGCTGGGGTGACATTTTTGTTGAGGTCGGCACTCTACGGCGCCCCCTCTGGCCTGCCGGCCATCTCCTCCACAAGTGGGGAGATTGGCTGTCGCGTAGGCTTTCGCCAATCGCCGACATTGCAGAAGAAGGCGCTGCGGCGGAGCTGCTGATCTCCCCCCAAGTGGGGGAGATGTCCGGCAGGACAGAGGGGGGCGCTGTCCCGCCAACTGCACCAGCAGAGGCGATAGAGCGCCCTCCGTGAAATTCCATTCCTCCACTCGCCGCCGGATTGGCACGATCTATCTTCTTTCCGGATGAAAAGGGCTGGAAAAGACAGGGCGCGCGCTCCGCCAAGCCGGAATGGATTCCGGCGGCTTGATGCGCGAGATTGCCGACATCCGCCTCGAACTGCTTCGATGGAGCACTCAAGATGAGCGAGCAGACAAAAGCGCCCGGCGGCGCGGACACCAACCTGTCGAGGCTTCGCCCGCCTTACGCTTCGGTGCTCGACCTGATCGGCCAGACGCCGATCGTCGAGCTGACCAAATTCGACATCGGCAAATGCCGGCTGTTCATCAAGCTCGAAAGCCAGAACCCCGGCGGCTCGATCAAGGACCGCATCGCTTTGTCGATGATCGCAGCGGCCGAGAAGTCGGGCGCGCTGAAGCGCGGGGGCACGATCGTCGAGGCCACGGCAGGCAACACCGGCCTCGGCCTCGCCCAGGTCGGCATTCCGAAAGGCTATCGCATCATCCTGGTCGTGCCCGACAAGATGTCGCGCGAGAAGATCCAGCATCTGCGGGCTTTGGGCGCCGAGGTGCGCATGACGCGCTCCGATGTCGGCAAGGGCCACCCCGAATATTACCAGGACATGGCCGAGAAGATCGCTTCCGAGCTGCCCGGCGCGTTCTATGCCAACCAGTTCGCCAACCCGGCCAATCCGCTGGCGCATGAGACGACGACTGGCCCGGAAATCTTTTGGCAGCTCGACGGCGATGTCGACGCGGTGGTGGTCGGTGTCGGCTCCGGCGGCACGCTCACTGGCCTTGGCCGCTTTTTCGCGAAACATTCACCGAAGACCGAGATGGTGCTCGCAGATCCGGTCGGCTCCGTGCTGGCGCCGCTGATCAAGACCGGCAAGATGGAGGAGGCGGGCAGCTGGACGGTCGAGGGCATCGGCGAGGATTTCGTGCCGCCCAATGCCGATCTCTCGCTGGTGAAAAAGGCCTATTCCATCCCCGACAAGCAGAGCATGCTGGCGGTGCGCGATCTCTTGTCCAAGGAAGGCATCCTCGCCGGTTCGTCCTCGGGCACGCTGTTGTCGGCAGCGCTTCGCTATTGCCGCGAGCAGACGGCGCCCAAGCGCGTCGTGACCTTCGTCTGCGACAGCGGCAACAAGTACCTGTCGAAAGTCTTCGACGATTTCTGGCTGGCCGAGCAGGGCCTTGCCGAGCAGGAGCAGCATGGCGATCTGCGCGACCTCGTCATGCGTTCGCACCGTACCGGCGACACCGTCTGGGTCGGGCCGGAGGAAAGCCTGCTCAACGCATATGGCCGCATGCGCCGTTCCGACGTCTCGCAATTGCCGGTGCTGGATCAGGGCAGGCTGGTCGGCATCGTCGACGAGAGCGACATCCTGGCCAAGGTCGACGGCCCCTATGACGGCCGCTGGGACCGCTTCAACGCGCCGGTGCGCACGGCCATGACGTCGAACCTGCACACGCTGCAGGCCAACCAGACACTGGACGCGCTGCTGCCTGTCTTCGACCGCAACGAGGTCGCCATCGTCTTCGATGGCGAGGAGTTCATTGGCCTGATAACCCGTATCGACCTGATCAACCATCTGAGGCGCCGCGCAAAATGACTTCAAACGGCAAGAACCGCCTGGCCTTTTCCACCCGCACCATCCATGGCGGCCAGAGCCACGACCCGACGACCGGCGCGGTGATGGTGCCGATCTACGCCACCTCCACCTATGGCCAGCAGTCGCCCGGCGTGCACAAGGGATTTGAGTACGCGCGCAGCCAGAACCCGACGCGCTTCGCCTTCGAGCGCGCGGTCGCCGACCTTGAAAGCGGCACGAAAGCCTTCGCCTTCGCCTCCGGCCTAGCGGCGATCTCGACGGTGCTGGAGCTGCTCGATTCCGGCGCCCATATCGTCGCCACCGACGACATCTATGGCGGCTCGTTCCGGCTGATGGAGCGGGTGCGCAAACGTTCCGCGGGCTTGCAGGTCAGCTTCGCCGATTTCACCGACCTGGCCGCGGTCGAAGCCGCGATCCGGCCGGACACGAAGCTGCTCTGGGTCGAGACGCCGACCAACCCGCTGCTGCGCATCGTCGACCTCGAAGCTCTCGCGGCGCTTGCAAAGCGTAAGGGTCTGCTTACCGTTGCCGACAATACTTTTTGCAGCCCCTATATCCAGCGCCCGCTGGAGCTCGGCATCGACATCGTCGTGCATTCAACGACGAAATATTTGAACGGCCATTCCGACATGGTCGGCGGCGTGGCGGTCGTCGGCGACAACAAGGAACTCGCCGACCAGCTGAAATTCCTGCAGAACGCCATCGGCGCCATCTCCGGCCCCTTCGACAGTTTTTTGGCCCTGCGCGGCATCAAGACTTTGGCGCTGAGGATGGAGCGCCACTCGGCCAATGGCCTGAAAATAGCGCAATGGCTGGAAACGCGAAAAGATGTCCGCCGGGTCATCTATCCAGGCCTCGCCAGCCACCCGCAGCATTCCATCGCCGTCCAGCAGATGCACGCCTTCGGCGGTATGATCTCGGTCGACCTCGACCGCGACCTGGCGGGAACGAAACGCTTCCTCGAACGCACCCAGCTTTTCACGCTGGCCGAAAGCCTGGGCGGCGTCGAAAGCCTGATCGAGCACCCGGCGCTGATGACGCATGGCTCGATCCCGGCGGAGAAACGCGGCGCCATCGGGATTACCGATTCGCTGGTGCGGCTCTCCTGCGGCATCGAGGACGGCGATGATCTGGTCGCCGATCTGGAGCAGGCGCTGGCGCACTGAGCAATTCCAGGAAAAGTGTGAGCGGTTAGGCTCGGCATCTTCGCCGTAGCCTTCCGCCCGGGAATTGCGACAAGCAAGACAAACAAAAAGATAGAGCGCTTCGCCGTTTCCATGAAACGGTGAAACGCTTTAAGCGCAACTGAGTGCACTTTGCGATTTCAGAACGTATCATAGAAGCGCCCGCCGATTGGCCGGCGGCGCCGGGATAGGAACTGATGACACAAGGACGCGGCAGTGGCGAAAAATCGGCGGCTACGCGTGCCGTCCGCGCGGCCGACAGCGTCGAGCGTGTCTATGCCCGGATCAAGGACTTCGCGATCGACTATCGTTTCCGGCCGGGCGAGCGGATCAACGAGGTCGAGCTCGCCGCCGAGCTGGCGGTCAGTCGCACGCCGGTGCGCGCCGCGCTCAATCGGCTGGAGCGCGATGGTTTTGTCACCTCCGTGCCGAACAAGGGCTTTTTCGCCCGCGAGCTGACGCCGGAGGCGGTGCGCGATCTCTACGAGCTTCGCGCGGCGATCGAGCGTGCCGCCTTCGTGCTTGCCTGCGAGCGGGCGAGCGATGCCGAGATCGAGGCGGCAGCCGGGACCTGGACAAGACACAGCAGCCCGGAAGACGAGGGCTCATGGGCGAAGATTGCCGTCGCCGACGAGAGCTTCCACATGGGCCTGACGCGATTGTCGAAGAATGCGCAGATGATAAGCGCGCTCGAAGGTCTCTGCTCGCGTATCCGGTTCTTCAGATGCATTGATCTCGAGTCTTCGTCGCGTCGCGAACGCACCTATCGGGAGCATGACGCGATCATTCGCGCCCTGCGCCGCCGCGACGCCGCGGGCGGAGCGAGCCTGCTCGAAAAGCACATCACGCTGAGCTCCGCGCACGCCATCGAAGTGGCCGCGCGCGGCCTGGCGCGGCTTTTCCCGGAGACCGCGGCGTAGAGCTTTCGCCGAAAACCTCAACGCCACGATCCGCCGGCATCTCAAAAGACGGCTATTTTGCAGGCTGCCCCTTGTGGATCTCGCGATAGATTCTAAAGTGTACTCACTTTGCGATCTCCGGAGGCGTAATGCGCGTGATCGTATTGGGTGGCGGCGTCGTCGGCGTCACCACCGCCTACCAGCTGCAGAAGGACGGGCACGAGGTCGTCATCCTCGAGCGCCAGCAACAGGTCGCGGCTGAGACCAGCTGGGGCAATGCCGGCATGATCGCGCCGGGGCATTCCTTCGTCTGGTCCTCGCCCAGGGCGCCGATGATCCTGCTGAAATCGCTGGTGCTGAAGGATCAGGCGCTGCGCTTCAGGCTTTCGGCCGATCCGAGGCTCTACAGCTGGTCGTGGCTGTTCCTGACGGAATGCACGGCCGAGAAGGCCAGGCGCAACACGCTGCTCAAGCACCGGCTCGCCGTCTATTCGCAATCGGTGCTGCAAGAAGTGGTCGCCGACGAGGCGATCGACTACGACCGCAACGATCGCGGCATCCTTTATTTCTATCGCAGCCAGCAGGCGCTCGACAAAGGCGTCGAGCATATGCGGCTGCTGGAATCCGACGGCCAACTGATCAAGGTGCTCGACCGCGACGCCATCGTCGCGCTCGATCCCTCGCTGGCGTCGGCCAGGGAAAAGATCGCCGGCGGCATTCATTGCCCGACCGACGAGACCGGCGACCCGGCGAAATTCACCCGGGCGCTGGCCGCCAAGGTGGTCGGTCGCGGCGGCGAGATTCGCACCGGCACCACAATCACCGGCATCGAGACGTCGGGCGACGGCGTCGCGCAGGTGATGACGGACAAGGGCGCGGTCAAGGGCGACGCCTATGTGTTGGCGCTCGGCTCCTACAGCCCGCTGATCGCCAGGACGATCGGGCTAAGCCTGCCGATCTATCCGGTCAAGGGCTATTCGCTGACCATCCCGATCGGCAACCGCCCGGCGCCGCCGACCATCGCCGCGATCGACGAGCACAATCTGGTCGCCGTCTCGCGCTTCGGCGACCGGCTGCGCGTCACCGCCACCGCCGAATTCGCCGGCTACGACACCAGCCACAAGCCGGCCGATTTCGCCTTCATGAAGGGCGTGACCGAGGAGCTTTATCCGGAAGGCGCCGATTACGACCGCGCCGAGATGTGGGCGGGCCTCAGGCCGATGACGCCGAACAACCTGCCCGAATTCGGGCAGCGGCGCCTACGCAACCTCTACCTCAACACCGGGCACGGCCATATCGGCTGGACCATGTCGCACGGCTCGGCCCGCATCACCGCCGACCTGATCGCCGGCCGCAAGCCGGCCATTTCGATGGATGGACTTTTGAACTGAACGCATGATCCCCGACGGGATCCTCCAGGAAAGGGAATGCCATGTCTGTCAGCGCGGCCGCTACCCGCCCGCAATCTTCTGGACCGGTGGACCTTGGCGTCCTTCCATCGGAGATCGATGGCGGGCTCGCCTCGCGCGCGGCGATCGGCCTGGCGATCCTTGCCACCGATCAGACGCTGGAGCACGAGTTCCGCGCGCTGGTCCGGATTCCGGGCGTCGCCTTCTACGAGGCGCGCCTGTTCAACGACAATGACATCACGCCGGACACTTTGCGCGCCATCGGCCCGCGCATCGCCCCCACCGTCGATCTCATCCTGCCCAGCATCCCGCTCGACGTCGTCGGCTTCGGCTGCACTTCGGCCACCGTGACGCTTGGCGAGGAAGCCGTCTTCGCCGAAATCCGCAAGGCGCGGCCGGGCGTCGCCTGCACCACGCCGGTCACCGGCGCGCTGGCCGCCTTCAAGGCGCTGGGCGCGAAGGGCATCGGCCTGCTCACCCCTTACGCGCCCGAGATCAACCAGGGCCTGGTGCGCTACTTCACCGGCCGCGGCCTCGACATCGCGGCCGTCGCGACCTTCGACCGCCGCGACGACCGCGAGGCGGCCCGCATCTCGCTCGCCTCGATCGAGGCTGCCGCCGAGCGCATGACTGAAGTGCCCGGCGTCGATGCGATCTTCATCTCCTGCACCAGCCTGCGCGTCGCCGAGGCGGTCGCCGACCTGGAGCGGCGCATCGGCATTCCCGTCACCTCCTCCAACCACGCCATGGCCTGGCATTGCCTGCGCCTCGCCGGCATCGACGACGTGGTGCCGGCGGGTGGCAGGCTGTTCGCGCTGCCTGCACGCTGATCAGCGCGGCGAACTTTTCAGGCGATAGGAAAGCGCATCCGGCATCCAATCGCATAGGGCGCTCCCCTCTCCGTCTCGGCTTCGCCGAGCCACCTCTCCCCGCCTCTGGCGGGGCGAGGAACCCAGGCTTGCGAAGGCCGTGGCTTTGGCGATTGGCATTTCCTCGCCCCCACAAAGTGGGGGGTCCGAAGGACGGGCGAGACCCGTGGCTCGCCCCGGCACGGTGGATCGGCGCGCAGCGCCGAGACGGAGAGGGGCCAGCGCTGCCATGGGCGATTGATCTCAGGCAGCCGCAGACGTCACTTGCTCTCCCATGGCTGTGGGGCGTTCATGAGCCTGGATAGCTTCTCATTGGATTTGGCCGGCGCATCGAGAAGCGCCCGGAACTCCGCATGGACCTTCGGCGGAACCTTGATGCGGTTCTCGAGTGTCTCACCAAACGGAGGCGCGTCTGTGCGGTCGTTCTTCATGACTTCAACATAGCACATCGTCGCCCATTAACGGAATCGGAAGGAGATGTTTAACGCTTCCTCTCTATCTCCCGCCCCGGGTGGCGCAGACAGAGCGAGCATGATGAACGACAGCCCCGAAAAGCGAAACGAATGGCGTGCCATCTTCTACGTTCAGGCCCGCATCGCCATCCTGTTTGTGCCGGTTGTGCTTAGCCTGCTGCTCATCGGCATTTTTGCCGGCAACGAGCGCAAATCCGTGCCCGACGCCATCGACCCGACTGTGACGGGCTCGGTGCGGTAAAGCGCGGCCCGGAACGGCTGGACTCAAACAAACTTCGGGATCGGCCCGTTCTGGGGCGTCGTATCGCCCTCAAGGTCGACGAAAACCTCGTCGACGAAGCACCAGGCCCAGCCTTCCGGCGGGTCGTAGCCCTCGATGATCGGATGCTGCGTGGCACGGAAATGTATCGTCGCGTGGCGGTTGGGCGAGTCGTCGCAGCAGCCGACATGGCCGCAGGTGCGGCAGAGCCTGAGATGCACCCACCAGGATCCGCTCTTCAGGCATTCCTCGCAGCCCAGCGCGCTCGGCGTGACCTTTTTGATGTCCCTGGTGTGCCTGCATTCGTCCATCGCGGGTTCCTCGCCGGTTCGGGCTGATTTATTCGTTTGCGGGGGCGCCGTTCCGCGCCAGATAGGCATGTAGCGCCGCGACCACCTGCGCGCCTTCGCCGACGGCGGCTGCGACGCGCTTGGTCGAGCCGCAGCGCACGTCGCCAATGGCGAAGACGCCGCTGCGGCTGGTTTCCATCAGCCCGTGCCCCGGCGTCGTGTCCGGTCCGGTCCGCACAAAGCCCTTCCCATCCAGCGCCACATTGCAGTTCGCCAGCCAGTCGGTGTTCGGATCGGCGCCGATGAACAGGAAGAGGTGGTGGATGGCGCGCTCCGTCTCCTCGCCGGTCACCCGGTTGCGCCAGCGCAGCCGCTCGAGATTGCCGTCATGGCCTTCCAGCGCCACAACCTCGGTCTGCGTCAGCACCTCGATGTTGGGCTGCGCCTTGATGCGCTCGACCAGATAGCGCGACATGGTGGCATCGAGGCTGTCGCGGCGCGCCAGGAGCGTCACCTTGCGCACCTGGCTCGCCAGATAGACGGCCGCCTGTCCGGCCGAATTGCCGGCGCCGACCAGCGCCACCTCCTGGCTCTGGCAAAGCCTGGCCTCGATCGGCGAGGCCCAGTAATGCACCGAGGTGCCCTCGAACTGCGCCAGGTTGGCGATATCGAGGCGGCGGTAACGCGCGCCGCTGGCGATCACCACCGCGCGTGAGCGAACGCTCTCGCCATCGCCGATGGCAAGCCTGTAGCGGGCGCCGTCGCCGGCGTCGTCGAGCAGCTTCGCCTCGTCTGGGATCACCATCTCCACACCGAATTTCTGCGCCTGGTTGTAGGCGCGCGCCATCAGCGCCATGCCGGTGATGCCGGTCGGGAAGCCGAGATAGTTCTCGATTCGCGCCGAGGCACCGGCCTGTCCGCCGAAGGCGCGGCAGTCGAGCACGATGGTCGACAGTCCTTCCGAGGCCGCATAGACGGCCGCCGCCAGCCCCGCCGGCCCGGCGCCGACGATCGCCACGTCATAGAGCTTGTCGGCATCGATCGGCCGTAGCAGGCCGACGCAGCGGGCAAGCTCGTTCTCGCCCGGATTGTGCATCAGCTTGCCGTTCGGGCAGAGCACGATAGGCAGATGGTGCGGATCGACATGGAACCGCTCGATCAGCGTCTTGGCGCAAGGGTCGGTGTCGGAATCGAGCGCGCGATGAGGAAGCCCGCTGCGCCGCAGAAACCCTTGCAGCCGCAGTACGTCGCCATTGCCTGGAGGGCCGATCACCACCGGCCCGCTGGCGCCGCTTTCGAGCAGGCCGACGCGGCGCAGGATAAGCGCCCGCATCACGCGCTCGCCGAGATTGGCCTCCTGCACCATCAGGTCGCGCAGCCGCTGCGAGGGGATGACGATCGCCTCGACCGGCTCGACCGCCTCGGCATTGACCAGCGACGGCCGGTTCGACAGCTGCGCCAGCTCGCCGATGAAATTGCCGGCGCCATGGGTGACGATCGGCTCAGGCTGACCGAGCCCGCCGGCCTGGGTGATGTCGACCTTGCCCGACAGGATGAGGATCACGCCCGGCGAAATCTTGCCGGCCGTCACGATATGCTCGCCGGCGGCATAGGCGCGCGCCTCGCCGAAGCGCCGCATGCGCTCGATATCCGCCTCCGAGAGGATCGGGAACATCTGGTCGCGGCGGGCGGCGATGGTCGGGCTGACGGAAGGGGCCATGGCGCGACCGTAGCGTCATGGCCGCCGGGTTTGAAGCGGTTTGTTCGAGGACTGGCCGGAAGGGGAGGTTGGGGCGGCACCCTATGCGTCGTCATCCTCGGGCTTGACCCGAGGATCCATGCCGTGACCTTTGTCGGAGAACGCAGCAGAGCAGAATTCTGGAGCGCCGCGCCTCGACGTCACGGCATGGATCCCAGGGTCTCCGCGACGGAGCTTCGCTCCTGCTCCGCCCTGGGATGACGAAGTTCCGAGGTCAAAGGCTAATCTTGAACGTCGTGATCGCTGGCGGATAAGGCCGATCCCGTTGGTAGTCGGGAAAAGATGGTGCCCAGAAGAGGACTCGAACCTCCACTCCTTGCGGAACACGGACCTGAACCGTGCGCGTCTACCAATTCCGCCATCTGGGCTGGTGGGCGCTCATGTAAGCGGGCAAACGATATGTGTCAACGCGCTTTTTTCGTCGATGTCTGCGCTGCCCTCATCGCCTTCCTGCAACGCTGGCGATTGGCGAAACCGGCGGCGGGAGCTTCCCTCTCCCCGCCACTATACGGGGAGAGGATGCCGGCAGGCAGGTGAGGGGCGGCGCCGACGCTGGCCAAGTGGGGTTCCGGCTCTTCTCTGGAGACCGGACGAGATAATCGTCGAAGCCGGCGCTGCCCCTCATCCGCCTGCCGGCACCTTCTCCCCGTGAACGGGGAGAAGGAAGCTGCTTCTCACCCCTCCAGCACTTCCTTCTTCGCCACGGTGGAATCGGCGTTGAGCTTGTAGATCACCGGCACGCCGGTGCCGAGCTCGAGCTTGACGATCTCCTCGCCGCTCTTGCCGTCCAGCGCCATGATCAGCGCGCGCAGCGAATTGCCGTGCGCCGCCACCAGCACCGTCTCGCCGCGCAGCACATGCGGCTGCACCTCGTGCAGATAATAGGGCCACACGCGCGCGCCGGTGTCCTTCAGGCTTTCGCCGCCGGGCGGCGCGATGTCGTAGGAACGGCGCCAGATATGTACCTGCTCCTCCCCCCATTTCTTGCGCGCGTCGTCCTTGTTGAGGCCGGAAAGGTCGCCATAGTCGCGCTCGTTGAGCGCTTGGTCGCGGATCGTCTTGAGCTCGCTCTGGCCGACGACGTCGAGGATCAGCTGGCAGGTCTTCTGTGCCCGCTGCAGCGCCGACGTATAGGCGGTGTCGAATTTCAGGCCGCGCGCCTTGAGCTTTTCGCCGGCCGCGAGCGCCTCGGCGGTGCCCTGCTCGGTCAGGCCGACATCACGCCAGCCGGTGAACAGGTTCTTCAGGTTCCATTCGCTCTGGCCATGGCGCACGAGCACGAGAGTTCCCGACATGTTTGCTCCTTTTGAATTTTACGGAAGAGACGCCTCAGCTCAGGCCGAGCACGTCCCGCATGGAATAAAGCCCGGGCTTCTTGCCGCGCGCCCAGAGCGCCGCCTTGACCGCGCCGCGGGCGAAGATCGCGCGGTCCTCGGCATGGTGGGAGAGCGTGATGCGCTCGCCGGTGCCGGCAAGGATGACGCTGTGGTCGCCGACGACGGAGCCGCCGCGCAGAGTGGCGAAGCCGATCGAGCCTGCCTTGCGCACGCCGGTATGGCCGTCGCGCACCCGCACGCTGTTGTCGGCAAGGTCGATGCCGCGTCCCTTGGCCGCCGCCTCGCCGAGCAGCAGCGCCGTGCCGGACGGCGCGTCGACCTTGTGGCGGTGGTGCATTTCCAGGATCTCGATGTCGAAATCCTCCGGGTCGAGCGCCTTAGCGGCCTGCTCGACCAGCACCGCCAAAAGGTTGACGCCGAGGCTCATATTGCCGGATTTGACGATCGTGGCATGGCGCGCGGCGGCGGCGATCTTGGCATCGTCGTCGGCTGAGCAGCCGGTGGTGCCGATGACATGGACGATGCGCGCCTGCGCGGCATAGCCGGCGAATTCGACGCTCGCCGCCGGCGCCGTGAAATCGAGCACGCCGTCTGCCCTGGCGAAGGCCGGCAGCGGGTCGTCGACGATCGGCACGTTGATGATGCCGATGCCGGCAAGCTCGCCGGCGTCCTTTCCGAGATGGGGGGAGTCCGGCCGCTCGATCGCGGCGGCGACGCGTGCGCCGGGCATGGTGTGGATGGCGCGGATCAGCGTCTGGCCCATGCGGCCGGCGGCGCCCACCACGACCAGGCCCATATCGCCGGCTTCACTCATGCGCTTCTCCTGACGGTCTTCCTGGCGCGGCTGCGCGGCGCCGGAGTTTGGCCGGAAGTCTGGCTTGCCTTGGCGTCGTCGACAAGCCCCAGCCCCTTCTTGCCCTGGATCCGGTGGAAGCGGGCATAGATGCCGTGCGGGTCGGCCATCAGCGTCGCATGCGTGCCTTCCTCGACCAGCCTGCCCTCTTCCAGCACGATGATATGGTCGGCATTGACCACCGTCGACAGCCGGTGCGCGATGACGATCGTCGTGCGGCCTTCCATGACATGGGTCAGCGCCTCCTGGACACGCGCCTCGGCCTCGTTGTCGAGCGCCGAGGTCGCTTCGTCGAGCAGCAGGATCGGCGCCTGGCGCACGATGGCGCGCGCGATCGAGACGCGCTGGCGCTGGCCGCCGGAAAGCGTCGAGCCGCCTTCGCCGACCGGTGTGTCGTAGCCTTGCGGCTGCTGGCGGATGAACTCGTCGGCCGCCGCGAGTTTCGCCGCCTGCTCGATCTCGGCGTCGGTGGCCGACAGCCGGCCGAAGCGGATGTTGTCGCGGATCGTGCCCTCGAACAGATAAGGCGCCTGCGCCACATAGGCGATCGATTGGCGCAGCGAATGCTTGGTCACCTTGGCGATGTCCTGGCCGTCGACCTCGATCGAGCCCTTGTCGACGTTGTAGAAGCGCTGCAGCAGCGCCACCAGTGTCGACTTGCCGGCGCCCGACGCGCCGACGATGGCCGTCACCTTGCCGGCCGCGGCGGTAAAGCTCAGGTCCCGCAGCACCGGCGTGTCGGCATTGTAGCCGAACGTCACATTGTTGAAGCGCACCTCCCCGGTGGTGACCTTCGCCTCGACCGCGTCGGGCGCGTCGCCCTGCTTCGGCTCGAGGTCCAGCAGCTCGTAGATCATGCGCGCGTTGACCAGGGCGCGTTCCATGCCGACCTGCGTGCGCGCCAGGCGCCGCGCCGGGTCGTAGGCCAGGATCAGCGCGGTGATGAAGGAGAACACCGCGCCCGGTGGCTGCCCGAGCACCAGCGCCCGATAGCCGGAATAGGCAAGCACGGCGGTGACGGCGAGGCCGCCCAGGATTTCGGAAATCGGCGACAGCCGCTCCGCGACGCGGGCGATCTTGTTGTTACGCTGCTCGGCCGTGTCGGCCATGATGCCGATGCGGCGCGCCAGCTCGTCCTCCAGGGTGAAGGCCTTGACGATGGCGATGCCCTGCGTGGCCTCCTGCACCGACCCGTTCAGCCGCGAGTTGATCAGCACGGATTCGCGGTTGATCTTGCGCAGGCGGCGGGTGATGTAGACGACGGCCCAGATCAGCGGCGGCCCGATCAGCAGCGAGCTCAGCGACAGCACCGGATCCTGGTAGATCATCACGCCGACGAGCGCGACGAGCGAGACCGCGTCGCGGCTGATGGAGGTCAGCGTCAGCGACAGGAGATCGCGGATGCCGCCGACATTCTCGTTCACCTGAGCCGCCAGCCGGCCGGAACGGGTCTCGTTGAAGAAGTCGACGCCGAGCTTCATCAGATGGTCGAAGCTGCGCTTCTGGTAGCGGGCGACCAGATTGTTGCCGATCCTGGCCAGCGCGACCGCCTGGCCGTAGCCGGCGAAGCCGCGCAGCACGGAGGCGGCCATGAAACCGGCGCAGATCCAGACGATCATGTCGCTGCGCCGTTCGTAGAAGATCTGGTTGATCATCGGCGCCATGATCCACGCCGTGAAGGCGGTGGTGCCGGAAACAACCAGCAGGCAGGCGACGGCGACGACATAGGTCCACCGATATTCCTTGCCGTTTTCGGCCAGGATGCGGCGCAGCACCGCGCTGACCTCGGTGGGCTGGACTTTAAGTTTGAGCGAAGTTTGAACGGTCAAATCAGGGGCGCTTCGTGGGTTTTCGTCGGACAGGGCTGCCTCTCGGCACCCCTCTTAGCGCCCCGGCTACGGCTTGCCTATGGCGAAACTTCGTCGCGCCGGCCGATCGCCGGCCCGAATCCGGGCGCGATCAGGCGCGCCAGTGCCGCCCCGCCGTGTTGACGCCGAACAGCGACGGCGTCCTGGCATAGGCCGCAAGCCCGAGCAGCGCCGCCAGCGGATGGGTGATGACATAGACCGGCATTTCGCGCATCAGCGCGCTGTGCGGCGCCTTGTCCTCGAAGGCGGCGCGGAAATTGCCTTGCTTCAGCGCCGGCACGATCTTCTGCGCGATGCCGCCGGTGAGGAACACACCGCCCCGGCTCATGAACACCAGCGCCAGGTCACCCGCGGTGCGGCCGAGGCAGGTGACGAACAGTTCCAGCGCCTCCTCGGCGATCGGGTCGGATTTCGCAAGCGCCGCCGCGGTGATCTCGGCCGGCGTGGTGAAGGGCGCATGCCTAGCATCGGCCTTGGCCACGGCGCGATAGACATTGACCAGCCCGCGCCCGCACAGGATCTGCTCGCCGGAGATGCGGCCTTCGAGCTTGTCGATATGCGGGAACACCTCGAAGTCGCGCGGCGTGCGCGGGCCGATATCCATATGCCCGCCCTCGCCCGGCACCGGGATCCAGTGGTCGAGCGCATAGATCAGTCCGGCGACGCCAAGGCCGGTGCCGGGGCCGAGCACGACGCGGCTCGCATTGGGTTCCGGCGTGCCGCCGCCGACCTTTTCCATATGCTCCTCGCCGAGCGCCACGACCGCCAGCGCCTGCGCCTCGAAATCGTTGAGCACCACGACCTCGGTCAGGCCCAGATTGGCGATCATCTGCCGGGGCTTCACGACCCAGGGGCAGTTGGTGAGCGGAATCTCGTCGCCGTCGACCGGCCCGGCGATCGCCAGCACCGCCGAATTCGGCTGGATGGACGAGCGGTCGAGCACCGCCGCCTGGATCGCGTCGTCGATGGTCTTGAAATTCGCCGTCTGGACGATCTGCGGCTCGGTCGCCTCGGAATTAGCGTCGAGCACGATCGAGAAACGCGCATTGGTGCCGCCAATGTCGCCGATCAGGACCGGAAACCGCAGTCCCTTCTCGTCTTCGCCTGCCATGCTCTGATCGTCCTCGTTGCGGCGCGGCTGCGCCGTTTCAATCGATTGACTAGCGCATGACCCCGGAAAGGGGAAGCCCGCTTCGGCGTCAGTTCCCCGGTCTAGGCCTTGGCAGCGGAATGCCGATCTTGGGCGTCGGCGCGAAAGCGCTCGCCGCCGCCGGCATCGCGGCATTGCCGCTGGAGATCGTCTCCGCCGGCTCGGTCGGCTGCGGTTCCGCCACGGCGACCGCGGTGGCGCTGCCGCCCTGGTAGGTCACGGCCGCCAGCGACGGCGCATCCGTCGCGGCGAGCACCGCCTGGCACTCTTTCGGCAGATTGGCCATCGTCATCAGGTCGCGGGCCTTGGGCGCATCGGGGTTCTTGTTGGGCCGCCACGGCTCCTCGGTGAACCACCAGGCCAGCGGCTTGCCGCAGCCGTCGTCAGCGGGCGTCGCCTCCTGCCCCTTGCAACCGGGCGAGCCCGGCTGGCAGCCGATGCGCATGTGGAAGTGATAGTCGTGGCCCCAGAACGGCCGGATCTTGCGCAGCCAGGAACGGTCGCCGGTGACGGTGTCGCAAAGCTCCTTCTTGATGCCGGGATTGACCAGGATGCGCTCGACTTCCGGATAGCTCGCGGCGCGCTTGAGCAGCCTTGTGTGCGCCGGCGTCCACAGCGCGTCCTTCACCAGATGCGTCTTCTCGTCGACCATCAGCGTGGCGCTCATCGATTCGCGCTGGGCCATGCTCAGCGGCCGCTTCGGCATCGGCGTCAGCCAGATGTCGGCGTCGAGCCCGATCTGATGCGAGGCATGGCCGGTCATCATCGGTCCGCCGCGCGGCTGCGAGACGTCGCCGACCAGAAGCCCCGGCCAGCCGTCGGCTCCCGCGTCGCGCGACAGCTTTTCGATCAGCGCGATCATGGCCGGATGGCCCCAGCGCCGGTTGCGCGAGGGGCGCATCACTTCCCAGGTCGGACCTTCCATCGGCAGGGCGACGCCGCCGGCGAAGCAGCCCTTGGAGTAGAAACCGAAGGATTGCGCGGGCACCACCGCAGGCAGTTTCTTGGTGCCGAACAGGTCCTTGGCGCGCGGTTCGGCCGAAATCGCCGCCGCGACCAGCGCAGCCAAGGTCACCAGCGCCAGCGCGGCGGTCAGGAACGGCTTCCTGTCGGGCAGCGATCGCAAATTCATCAGGCCGGATCCCTCTCCAGTCAATCCGTTTGCGGCACGGTCAGGAACGAATCATACCACGCCGCGCGTCCCGCATCGATCTTCAGCGCCTCATCGGGCTGCGCGGCGCTCCGTCCCGCCAGTTGCCCTCGGCCCACATCCGCTCCAGCGCCACCCGGTAGTTCGGGAATCGGAAGCGATAGCCGGCCGCCTTGATCGCCTTGTTGGCGACCCGCTTGTTCTCGCCATAGAAAGAACGCGCCATGGGCGAAAGTTGGGCGGTCTCGAAGGGAATTTCCGGCGGCGGCTCGACGCCCATCAGCCCGGCGGCGTAGGCCACGACATCCTGCGGCGGCGCCGGCTCGTCGTCGGTGACATTGAAGATGCCGCCGAGGTTGCCGCCGGCGAGGTGCCAGAGCGCGCCGGCAATGTCGTCGCAATGTATGCGGTTGAACACCTGCCCCGGCTTGACCAGCCGCCGCGCCGTGACCTCCTCGAGATTGGCCAGCGCGTTGCGGCCGGGCCCGTAGATTCCGGAAAGCCGCAGCACGGCCGCCGGCTTGCCGATCTCGCGGCCAAGCGCCAGCCACTCCTGTTCGGCCGCCACCCGCATCACCGAACGCTTCGACACCGGCCGGCAGTCGCTGGTCTCGTCCACCCAGGCGCCGCCATGGTCGCCATAGACGCCCACGGTCGACAGATAGCCGATCCATTCCAGCGCCGGCATCTTTTCCCGCAGCGCATCGCCGGCGGCCTTCAGCACCGGATCGCCGGCCTCGTCGGGGGCGACCGAGACGATGAGATGCGTGGTCCTGGCAAGCGCCTCGCCGAGCTCGGGCGACAGCGCGCCGTCGAATTGCAGCGGCTCGATGCCGGCCGATCGCAGCGCCTCGAACTTTTCCGGCGCCCGGGTCGTGCCGGAGATCGGCGCGTGCTGGGCGTTGGCCCGCGCGAAAGCCTTGCCGGAATAGCCGGCGCCAAAGATAAAGAAGCGTCTTTCGCTCATCGATGAACCTCTACGGATGCCAAGCCCGCCAGCCATTCTTCGCGAACCGCCGCATCGGTCTCGGTCCTCGAAGCCATGGCGGCAAGCTCGCCAAATTCGCGGTCGGGGAGCAGTCGCGACAGGCCCCAGATTGCCGCACCCCGCACCAGCGGCGAGGCGTCGCCGAGCAGGCCACGCACAATGGGCGCCAGCAACGGGTCGCCCGAATTGCCGGCGGCGATCAGCACGTTGCGCACAAAACGGTCGCGGCCGATGCGCTTGATCGGTGAGCCAGAGAAGAAGGCGCGGAACGCCTCGTCATCGAGCGCGAGCAGGTCCGAGAGCTTCGGCTCGCGCAAATCCTCGCGTGCGGCAAGCTTCGCCTCCGAGGCCGCGCGGGCGAATTTGTTCCACGGGCAGGCGGCCAGGCAGTCGTCGCAGCCATAGATGCGGTTGCCGATCTTGTCGCGGAATTCGCGCGGGATCGGCCCCTTGTTCTCGATGGTCAGGTAGGAGATGCAGCGCCGCGCATCGAGCCGGTAGGGCGCCGGAAACGCATCGGTCGGGCAGGCGTCGAGGCAGGCGCGGCAGGAGCCGCAATGGTCGATCTCGGGCTTGTCCGGCTCCAGCTCGGCCGCGGTGAAGATCGTGCCGAGGAACAGCCAGGAGCCATGCGCGCGGCTGACCAGGTTCGTGTGCTTGCCCTGCCAGCCGAGCCCTGCGGCTTCCGCCAGCGGCTTTTCCATCACCGGCGCCGTGTCGACGAACACCTTTACGTCGCCGCCCGCCTTGGCCACGATCTTGCCGGCGATCTCCTTCAGCCGGCCCTTCATCACGTCATGATAGTCGCGGTTTTGCGCATAGACCGAGATCGCGCCGCGGTCGGGTCTTGCGAGGATGTCACGTGGATCCTGGTCGGGGCCGTAATTCATCGCCAGCACGACGATAGAGCGCACCTCCGGCCACAGCGTCGAGGGCTCAGCCCGGCGCTCAAGCGTCTCGGCGATCCACCCCATCGAGCCGTGAAAACCGTCGGCGACGAATTCGGCCAAGCGTGCGGGCGCCAGCGGGATCGCGTCCGGACGTGTCACGGCGACCGCCTCGAAGCCGGCGCGGCGCGCCTCCGCCTCGATCAGCGCGCGCAAGGTTTCAGAAGTCGAGGTCCGCATAATGCGACACCGGCGACAGGCCGCGCACCCGGTCGGAGAGCAGCGGCCGGAACGACGGCCGCGATTTCACCCGCGTGTACCATTCGCGCGCGGCGCTGTGTTCGCGCCAGTCGATCTCGCCGAGATAATCGAGCACCGAAAGCGTCGCCGCCGCGGCGAGGTCGGCATAGGTCACCTTGTTGCCGGCCAGCCAATGGCGCGTGCCGGCCAGCCAATTGGTGTATTTCATGTGCTGGCGGATGTTGGCGCGCGCCGCCCGGATCGCCGCCGAATCGGGCGAGCCGCCGCCGGCGGTTTCCGGCATCACCGGCTTCAGCACGCGCTCGCGCACCAGGTGGCGGGTGACCTCGCTTTCGGCCTTCGCCAGATACCAGTCGGTCAGCCGGCGGATTTCGGCGCGCTGCATCGGATCCTCGGCGAACAGCCTTTTGTCGCGCTTGAGCACGCCGCGCGTCTCGTCGAGATATTCGGCGATCACCATGGCGCCGACGATCGGCACGTCGCCTTCGGCCAGCAGGATCGGCAGCGTACCGGCCGGGTTCAGCGCCAGGAACTCCTTGCGCCGCGTCCACGGCTTTTCCTCGATCAGCGCCAGCTCCTCGCCATACTCGCCAAAGGCGAGGCGGACGAACCGGCAGGTGGCGAACATGGGATGATGGAAAAGCGTCAGCATGGTTCCGCGATGATAGGGCGCACTGGCGAATCGGTGTCAGCGCCGGTAAGTCTTGGCGGCCCATCATGCGGCCGTCACGGTGTTGCGACCTATAGGGGGACTTCCGCGCCGTGACAAGCAAGCCGTTGTCCCAGCTATCCCCGAAATCCTGAGGTTGCCATGGAAAGCCAGACCATCGTCGAAGCGCTGTTGCTTGGGCTTTTGGAGGGCCTGACCGAGTTCATCCCGGTGTCCTCGACCGGCCACATCCTGCTTGCCGGCCATTTCCTTGGCTTCCACTCCACCGGCAAGGCCTTCGAGATCCTGATCCAGCTCGGCGCGATCCTGGCCATTTTGAGTGTCTATTTCGGCAAGCTCTGGCAGATGCTGATCAAGCTGCCCAGCGACCCGCAGACTAGGCATTTCGTCATCGGCATCCTGATCGCCTTCCTGCCGGCCGCGGTCATCGGCGCCGTAGGTCACGACTTCATCAAAAACTATCTGTTCGAATCGCCAAAACTGATCTGCAGCATGCTGATCATTGGCGGCGTGGTGCTGCTCGTCGTCGACCGCATCAACTTCAAGCCGGTGCATCACGACGTCGAGCGCTTCCCGTTGAGCGTCTATCTCAAGATCGGCCTGTTCCAGTGCCTGTCGCTGATCCCGGGCACCTCGCGTTCCGGCTCGACCATCGTCGGCGCGCTGCTGATGGGCGTCGACAAGCGCGCGGCGGCGGAATTCTCCTTCTTCCTCGCCATGCCTACCATGGTCGGCGCCTTCGCCTTCGACCTCTTCAAGAACCGCAACGTGCTGACTTCGGCCGACCTGCCGATCATTTCCGTCGGCTTCATCGCCGCCTTCGTCGCCGCGCTGATCGTCGTCCGCTTCCTGCTCGATTACGTCTCGCGCAAAGGCTACGCGCTGTTCGGCTGGTGGCGGCTTGCGGTCGGGGTAGCGGGGCTGGCGGCGCTGATGGTTTGGGGGTGAGAACCCGCCCGCAGGATATGGCAACCACACACCGCGCTCCCCCTCACCCAGCCTCCGCTGACCTCTCCCCGAGGGGAGAGGAGACTGTCGGCGTCGGCGGAGAAGGTGGCCGCGAAGCGGCCGGATGAGGGGGCCAGCGCTGCCATATGCGATTGTCTACCCATTATAGCGGGATAAGCCCGCTCAATTCCTCCCATAAAACGCGTTCTCGACATGCACCGGCCAACGCCAGGGCAGCACGGCCACCATGTCGAAACGCATCGAAAGCCGCCCGTAATCCGGCTGCCGCGACAGCCACAGATCTGCGGCGCCCTCGATGCGGCGCTCCGATTCATGGCCGATCGCTTCCATCGCCTCGATCAGCGTGCGGCGCGCCTTGACCTCGACGAACAGCACGAGATCGCCGCGCCGCGCGATCAGGTCGATCTCGCCGAGCCTTGTGCGATGGCGGCGGGCAAGGATGCGGTAGCCTTTCAGCATCAGCGCCAGCGCCGCCAGCCATTCGCCGCGATGACCGCGCCGATAGGCCTTGCGGCGATGGCCGACCGTGCGCTCAGCCACCGTCGGCTCCTTTTTGCTCGACCATGATCCTATCCAAAACCGGCTTCCACTTTTTGGGATCATGGTCCGTCCTTGAGTTCCAAGAGCCGCCGGTAGAGCGCCTGTTTCTGGACGCCGGTCATCTTCGCCGCTTCCGATGCCGCCTTCGAAGCCGGCATTTCGGCCGCCAGCGACAACAGGAGCCGGTCGATATCCGCCGGCTGCTCTTCCGCTGCTTCGGGCGGGCCGACGCAAATGACGATCTCGCCCTTCGGCGTATCGGCGGCGGCATAGTGGCCGGCCAGCTCGGTCAGCGTGCCGGTGCGCATCTCCTCGAAAGCCTTGGTCAGCTCGCGCCCGATCGCGGCCTTCCGCGTGCCCCCCAGTGCCTCGACCATGGCGGCAAGCGTCTCGGCCAGCCGTCTGGGCGATTCGAAGAAGATCAGCGTCGCCGGCACCGCCTTGAAGGTTTCGAGCTTGGTCACTCGTTGCCCTGCCTTCACCGGCAGGAAGCCGGCGAACAGGAAAGCATCGGATGGCAGGCCGGAAGCCGTCAGCGCCGCAAGCGCTGCCGACGGTCCAGGAATCGGCACGACGCGGATGCCCTGCTCCAGCGCCTCGCCGACTAGCCGGTAACCGGGATCGGAAACCAAGGGCGTGCCGGCGTCGGAGATCAGCGCCACGCTCTGCCCGGCCGCCAGCGCCTCGATCAGCTTCGGCCCGGCCTCTTGCGCATTGTGCTCGTGATAGGCGGTGGTGCGGCGGCGGATGCCGTAGCGTTCGAGCAGCACGCGCGAGACGCGCGTGTCCTCGCAGGCGACGATGTCGGCTGCGGCCAGCGTCTCCAGCGCGCGCAGCGTGATATCAGCCAGATTGCCGATCGGCGTCGCCACCAGATAGAGCGCCGGCTGGAGCGCCCGCGCCGCGATCTCGGTCTGTCCGATCAGGTAGCTGCTTCTGTCGCCGGTCAAGGATAACCCTCCGTTCACCCTGTTTGGCATGGCTGACGGCCTGTTGCAAAACGTGAGGTCGGGTTACGAAATCGCCATGCCTTTGCCACAGTGCGGAACGAAACCCACAGCGGCGAATTTAACCCCTGATTCCCCGGGAGGAGGCAGGACCGAACGATGCCCAATCGCTTCGACATCTTCATCAGCCGACTTGAAAGCAAGACCGCCCGCGAGGGCATCCCTTCACACCCCATCGCCGCCCAGAGCGGCGTCCGCCGCGACAAGAGCGAGCAGAAGCAGGCCCGCCCCGATGACGCGCATGGCGAGAAGCACCGCGGCAAGCACCGCCAGAAGCACGACGCTTAAGAGCGTCCCCGACCTTCGCGAGTGGCGAAGACCTCTCTCGCTTCTTCATTCCAGGGCGGAGCGAGGAGCGAAGCGATGCGCACAGACCCTGCAATCCATGCCGTTACATCGAAGTGCCACTAACGGTGCAGAATTCTGGACCGCAACACTCTTCGCATAGGTCACGGCATGGCAACTGTGTTCACGCGAATGCGGTTTTGTCGCGCATGATGGCGTTGAGGACGACCAGGAGCTTCCTTGCGACGGCGATGATGGCGAGTTTCTTGGACCCTGAGCGGGCAGCAAGTGCGGTATAGAAGGTCTTGAAGCGCTCACAGGTTCTGATGGCGGTGAGGGCAGCCATATAGAGGGCGCGACGCACGCGTGATCGCCCGCCCTGGATCTGGCTCCTGCGGCTGAGCTTGCCGCTCTTGTTGTCGAACGGGGCAAGGCCGGCGAGCGCGGCGATCGCCTTGGGTGAGCGCTGGCCGAGCTCGGGCAGGAGCGCCAGCAAAGAGAGCGCGCCGACATTGGAGACACCGGGCACGGAGACCATCAGCGCATGGTCGCGAGCGGTGTCCTGGTTTTGACGAATGACTTCGGCAATCTGGCTCTCGAACGCCTTGATACGCGTGTCGAGGAAGGCGATCGTCTCTTCCAGATCGGCAATGACGGCCTCCTCGAAAGCCTCGGCGAGGTGCTTCTTCAGCGCTGCCCGCATCTCGACCAAATGATCGCGGTGGCCGGCAAGCGATTGAAGCCGCTCGACTTCTTCGCTCGGCGCCGGCTCGGCCTCAGGGTTATAGCGACGGCCATAGTCGCTCAGCATCCTGGCATCGAGAGGATCGGTCTTGGCGCGCCGGCGCGTCGACTTCGAATAGTGGTGGGTGTGCGCCGGGTTGTGCCGCGAGAACGGCACGCCGGCCTTGGCCAGCGCATGGCGCAGCAGCCGGTCATGGACACCTGTCGCCTCCATGACGACGAAGTCCCGACTGGGATCAAGGCCTGCCACATAGGCGGCGATCGCATCGGCCTGGTTGTCGATGCGCCGAAGACGGCCGCTTGTCTCATCAAAAAAGTCGAGCCACTGCTTGGAGATGTCGCAGCCGACATAGTTCTGGGCTATCAT
Protein sequences of DBSCAN-SWA_2 >CP034445|3154863:3212569|3194583_3195825_+|AZO04253.1|DBSCAN-SWA MRVIVLGGGVVGVTTAYQLQKDGHEVVILERQQQVAAETSWGNAGMIAPGHSFVWSSPRAPMILLKSLVLKDQALRFRLSADPRLYSWSWLFLTECTAEKARRNTLLKHRLAVYSQSVLQEVVADEAIDYDRNDRGILYFYRSQQALDKGVEHMRLLESDGQLIKVLDRDAIVALDPSLASAREKIAGGIHCPTDETGDPAKFTRALAAKVVGRGGEIRTGTTITGIETSGDGVAQVMTDKGAVKGDAYVLALGSYSPLIARTIGLSLPIYPVKGYSLTIPIGNRPAPPTIAAIDEHNLVAVSRFGDRLRVTATAEFAGYDTSHKPADFAFMKGVTEELYPEGADYDRAEMWAGLRPMTPNNLPEFGQRRLRNLYLNTGHGHIGWTMSHGSARITADLIAGRKPAISMDGLLN >CP034445|3154863:3212569|3158333_3160349_+|AZO04217.1|DBSCAN-SWA MTLPTRNLLAEEASPYLRQHSDNPVHWRGWSRAALAEAKELGRPILLSIGYAACHWCHVMAHESFENDAVAAVMNRLYVNIKVDREERPDIDQIYMAALHAMGEQGGWPLTMFLTPDGKPFWGGTYFPREARYGRPGFIQVLEAVDKAWREKQQSLAESADGLAVHVESRLAGTKGKAVLDRDTLSDLSGRIDGMIDRDLGGLKGAPKFPNAPFMHTLWLSWLRDGQVDHRDAVLTSLEKMLAGGIYDHVGGGLSRYSTDAEWLVPHFEKMLYDNAQLIRLCNWAYAATGKGLFRLRIEETVAWLLREMRVEGGAFAASLDADSDGEEGLFYTWSRDEIEAALGDDAPTFFQYFELAAPHGWEGKPIVRQSEMQQSKGIADYANLAPLKAKLLAFREQRVRPGRDGKVLTDWNGLMIAALAEAGRSLGRRDWIDAAAKTFTHIVEASLDGRLPHSMLGAKTLFPALSSDYAAMTNAAIALFEATGDPAYSDHARLFIGELDRWHLDAEKTGYWLTASDSGDVPIRIRGDVDEAIPSATSQIVEALVRLSSLTGDLELGEKAWAAAEHAMGRASQQAYGQAGIVNACALALEPLKLVIIDDPGSSSLVPVANRNPDPRRVDIVVPIGTETNRPLLPGGVLPPTDKPSAWFCSGQLCLPAVTDAQELEKLLRR >CP034445|3154863:3212569|3187751_3188678_+|AZO04248.1|transposase|DBSCAN-SWA MIAQNYVGCDISKQWLDFFDETSGRLRRIDNQADAIAAYVAGLDPSRDFVVMEATGVHDRLLRHALAKAGVPFSRHNPAHTHHYSKSTRRRAKTDPLDARMLSDYGRRYNPEAEPAPSEEVERLQSLAGHRDHLVEMRAALKKHLAEAFEEAVIADLEETIAFLDTRIKAFESQIAEVIRQNQDTARDHALMVSVPGVSNVGALSLLALLPELGQRSPKAIAALAGLAPFDNKSGKLSRRSQIQGGRSRVRRALYMAALTAIRTCERFKTFYTALAARSGSKKLAIIAVARKLLVVLNAIMRDKTAFA >CP034445|3154863:3212569|3183037_3184234_-|AZO04244.1|transposase|DBSCAN-SWA MSAPRRLVEAALPTSSFYGGARSEATRADPGILCVGSGVWLRWEGDRVDRGRPFGQAVAWPGPFPPVNHQPEQLHGAVRSRPRTTGRGVDMIAQNYVGCDISKQWLDFFDETSGRLRRIDNQADAIAAYVAGLDPSRDFVVMEATGVHDRLLRHALAKAGVPFSRHNPAHTHHYSKSTRRRAKTDPLDARMLSDYGRRYNPEAEPAPSEEVERLQSLAGHRDHLVEMRAALKKHLAEAFEEAVIADLEETIAFLDTRIKAFESQIAEVIRQNQDTARDHALMVSVPGVSNVGALSLLALLPELGQRSPKAIAALAGLAPFDNKSGKLSRRSQIQGGRSRVRRALYMAALTAIRTCERFKTFYTALAARSGSKKLAIIAVARKLLVVLNAIMRDKTAFA >CP034445|3154863:3212569|3171452_3172556_+|AZO04231.1|DBSCAN-SWA MPLSPILSEKSFDDLPGWGEDDHAAAFAAFRRSALHAPVKPYRTGSLGVDFNAFAEAYAEARAVSAPNRSEARSFFERHFVPMLVSAENGSRLVTGFYEPEVEASPVRTERFAVPLLSRPADLIDIDDGNRPAGMDPYLAFARRTDNAPVEYFDRGEIERGALLGQNLEIAWLAEKVDAFFIHVQGAARLKMTDGRLARVTYAAKSGQRFIGPGKILSELGEIPLEKVTMQSIRAWFKAHPSRVDEILWQNRSYIFFREAAVDDAALGPIAAAKVPLTPGRSVAVDRLLHTFGTPFYIDAPTLTAFDEKPFRRLMIAQDTGSAITGPARGDLFAGSGDAAGEIAGVVRNAADFYALVPRGLLNGATR >CP034445|3154863:3212569|3166316_3166916_+|AZO04224.1|DBSCAN-SWA MSEKIILASGSPFRKTMLVNAGLDIEAVPANVDERALEAPLKDSGVSPEDVASILAEAKATEVSERRPGSLVLGCDQTLSLGDEVFHKPADMEGARRHLLALSGKTHQLNSAAVLCRDGEVLWRHVGIANLTMRKLDPAFIGRHLARVGAKALGSVGAYQVEGEGIQLFEKIEGDYFTIVGLPLLPVLKELRALGAIDG >CP034445|3154863:3212569|3207946_3208639_-|AZO04265.1|DBSCAN-SWA MLTLFHHPMFATCRFVRLAFGEYGEELALIEEKPWTRRKEFLALNPAGTLPILLAEGDVPIVGAMVIAEYLDETRGVLKRDKRLFAEDPMQRAEIRRLTDWYLAKAESEVTRHLVRERVLKPVMPETAGGGSPDSAAIRAARANIRQHMKYTNWLAGTRHWLAGNKVTYADLAAAATLSVLDYLGEIDWREHSAAREWYTRVKSRPSFRPLLSDRVRGLSPVSHYADLDF >CP034445|3154863:3212569|3190518_3190749_+|AZO07437.1|DBSCAN-SWA MFVEVGTLRRPLWPAGHLLHKWGDWLSRRLSPIADIAEEGAAAELLISPQVGEMSGRTEGGAVPPTAPAEAIERPP >CP034445|3154863:3212569|3176404_3177682_-|AZO04237.1|DBSCAN-SWA MFYQLYEMNHAALQPARLYADAVRLFYSNPLNPISHTPLGRSVAATAELFERTTRRYGKPQFGLDKTVVDWKSVDVTEKTVWSKPFCNLVRFERALPAGRKPDPKLLIVAPMSGHYATLLRGTVEAMLPHADVHITDWVDARMVPVAQGSFDLDDYIDYVIEMFHALGPDTHVMAVCQPSVPVLAAVALMDRRGDPFVPSTMTLMGGPVDTRRNPTAVNLLAKEKGIDWFRDNVIMQAPWPVPGFGRVVYPGFLQLSGFMSMNLDRHIIAHKDFFMHLVKHDGDSAEKHRDFYDEYLAVMDLTAEFYLQTVDTVFVRHALPKGEMTHRGETVDCAAIRNVALLTIEGENDDISGLGQTQAAHDLCVNIPADKHVHYVQPAVGHYGVFNGSRFRSEIVPRIADFISSYGRQQRVATRPKLVRSAKG >CP034445|3154863:3212569|3211272_3211461_+|AZO04269.1|DBSCAN-SWA MPNRFDIFISRLESKTAREGIPSHPIAAQSGVRRDKSEQKQARPDDAHGEKHRGKHRQKHDA >CP034445|3154863:3212569|3195866_3196673_+|AZO04254.1|DBSCAN-SWA MSVSAAATRPQSSGPVDLGVLPSEIDGGLASRAAIGLAILATDQTLEHEFRALVRIPGVAFYEARLFNDNDITPDTLRAIGPRIAPTVDLILPSIPLDVVGFGCTSATVTLGEEAVFAEIRKARPGVACTTPVTGALAAFKALGAKGIGLLTPYAPEINQGLVRYFTGRGLDIAAVATFDRRDDREAARISLASIEAAAERMTEVPGVDAIFISCTSLRVAEAVADLERRIGIPVTSSNHAMAWHCLRLAGIDDVVPAGGRLFALPAR >CP034445|3154863:3212569|3174367_3175294_+|AZO04235.1|transposase|DBSCAN-SWA MIAQNYVGCDISKQWLDFFDETSGRLRRIDNQADAIAAYVAGLDPSRDFVVMEATGVHDRLLRHALAKAGVPFSRHNPAHTHHYSKSTRRRAKTDPLDARMLSDYGRRYNPEAEPAPSEEVERLQSLAGHRDHLVEMRAALKKHLAEAFEEAVIADLEETIAFLDTRIKAFESQIAEVIRQNQDTARDHALMVSVPGVSNVGALSLLALLPELGQRSPKAIAALAGLAPFDNKSGKLSRRSQIQGGRSRVRRALYMAALTAIRTCERFKTFYTALAARSGSKKLAIIAVARKLLVVLNAIMRDKTAFA >CP034445|3154863:3212569|3177965_3178328_-|AZO04238.1|DBSCAN-SWA MKRTNQTALVAVTIAAGLMAGASSASSAAAQCGRASWYALHSRTASGERMNPSALTAAHRTLPFGTKLRVINKNNGRSVVVRINDRGPFVKGRVLDLSRGAAGQLGFIGSGHTAVCMARV >CP034445|3154863:3212569|3154863_3156738_-|AZO04215.1|tRNA|DBSCAN-SWA MTDHYDVVVVGGGHAGCEAASAAARAGARTALVTLRFDTIGVMSCNPAIGGLGKGHLLREIDAMDGLMGRIADAAGIQFRLLNRRKGPAVRGPRTQADRKLYRLAMQEAIRQQNNLDVIEGEALDFEIMDGRIAAVQISDDRRLACGALVLTTGTFLRGLIHIGEKKIVAGRMNEQASHGLSATMARAGFQLGRLKTGTPPRLDGRTIDWASLESQAADEDPVPFSLMTDAIANPQIHCGITRTMPATHELIRANLGRSAMYSGSIEGVGPRYCPSIEDKIVKFGDREGHQIFLEPEGLDDDTVYPNGISTSLPEDVQLEILKTIPGLEKATMLQPGYAIEYDHVDPRELKTTLETKRVGGLYLAGQINGTTGYEEAGAQGLLAGINAARKAAGGDEFVLSRTEAYIGVMVDDLTSRGIAEPYRMFTSRAEFRLSLRADNADERLTPLAGKLGIASPERLRRFGGMIQRLDEARILAKSVAWTPNEAARHGLEINKDGVRRSAYELLAHPGVDMAWLTRVEPRFAALDAKTAERLETEAKYSVYLDRQQADVAQIRHEESRLIPEEIDFSDVPGLSNELKQKMKTRQPRSIADAQRMEGMTPAALAIIVAHVRNAELGARRSVA >CP034445|3154863:3212569|3163675_3164704_-|AZO04222.1|DBSCAN-SWA MAGKRIVLDVLKGETVSPPPLWMMRQAGRYLPEYRETRRRAGSFLDLCYDPDLAVEVTLQPIERFGFDASILFSDILVVPNALGRDVRFEEGRGPVLKPISAAEIAALSGEMFHVNLEPVYETVRRLRAKLPDETTLLGFCGAPWTVATYMIAGHGTPDQGPARLFAYREPEAFARLLKTLADHSAAYLIRQIEAGADAVQIFDSWSGVLDEPSFEAFCVEPVAEIVRQVKAVHPDVPVIGFPKGAGERYRDYRKKTGIAGLGLDWTVPLSMAKELQREGAVQGNLDPLRLVAGGKALADGVDAILKTLGGGPLIFNLGHGITPETPVAHVEAMVKMVRSHR >CP034445|3154863:3212569|3190917_3192351_+|AZO04250.1|DBSCAN-SWA MSEQTKAPGGADTNLSRLRPPYASVLDLIGQTPIVELTKFDIGKCRLFIKLESQNPGGSIKDRIALSMIAAAEKSGALKRGGTIVEATAGNTGLGLAQVGIPKGYRIILVVPDKMSREKIQHLRALGAEVRMTRSDVGKGHPEYYQDMAEKIASELPGAFYANQFANPANPLAHETTTGPEIFWQLDGDVDAVVVGVGSGGTLTGLGRFFAKHSPKTEMVLADPVGSVLAPLIKTGKMEEAGSWTVEGIGEDFVPPNADLSLVKKAYSIPDKQSMLAVRDLLSKEGILAGSSSGTLLSAALRYCREQTAPKRVVTFVCDSGNKYLSKVFDDFWLAEQGLAEQEQHGDLRDLVMRSHRTGDTVWVGPEESLLNAYGRMRRSDVSQLPVLDQGRLVGIVDESDILAKVDGPYDGRWDRFNAPVRTAMTSNLHTLQANQTLDALLPVFDRNEVAIVFDGEEFIGLITRIDLINHLRRRAK >CP034445|3154863:3212569|3193732_3194455_+|AZO04252.1|DBSCAN-SWA MTQGRGSGEKSAATRAVRAADSVERVYARIKDFAIDYRFRPGERINEVELAAELAVSRTPVRAALNRLERDGFVTSVPNKGFFARELTPEAVRDLYELRAAIERAAFVLACERASDAEIEAAAGTWTRHSSPEDEGSWAKIAVADESFHMGLTRLSKNAQMISALEGLCSRIRFFRCIDLESSSRRERTYREHDAIIRALRRRDAAGGASLLEKHITLSSAHAIEVAARGLARLFPETAA >CP034445|3154863:3212569|3172552_3173077_+|AZO04232.1|DBSCAN-SWA MSRRDDHLSDDDRILWNLVARSARPLKRKAAVEIPEIIEPKPTPAAASVNGVPAVAAKPKTPHVSHSLDDQTLHKLKKGRLPIEGRVDLHGMTQDEAYSLLLAFLHRAHAGGIRYVLIITGKGSSSGGDGILRRAVPAWLSTPAFRHLVSSHDHAARNHGGSGALYVRLRRTRP >CP034445|3154863:3212569|3166908_3167736_+|AZO04225.1|DBSCAN-SWA MAEKKAFVTGHPIAHSRSPKIHGYWLNKYGIDGSYQAIDVAPADFTDFLKSLGENGYRGGNVTIPHKEAAFAGVARRDHAADEIGAVNTLWFEDGVLWGGNTDGYGFAANLDDHAPGWADNGPAVVLGAGGASRAVIHALKERGIKDIRIVNRTLARAEELSRHFGPGVSAHGAVGELLADACLLINTTALGMHGNETLVADPAGLPDHAIVTDIVYVPLETPLLAAARARRLKTIDGLGMLLHQAVPGFERWFGRKPEVTSELRSMIVADIEGH >CP034445|3154863:3212569|3200822_3201644_-|AZO04259.1|DBSCAN-SWA MSEAGDMGLVVVGAAGRMGQTLIRAIHTMPGARVAAAIERPDSPHLGKDAGELAGIGIINVPIVDDPLPAFARADGVLDFTAPAASVEFAGYAAQARIVHVIGTTGCSADDDAKIAAAARHATIVKSGNMSLGVNLLAVLVEQAAKALDPEDFDIEILEMHHRHKVDAPSGTALLLGEAAAKGRGIDLADNSVRVRDGHTGVRKAGSIGFATLRGGSVVGDHSVILAGTGERITLSHHAEDRAIFARGAVKAALWARGKKPGLYSMRDVLGLS >CP034445|3154863:3212569|3170722_3171430_+|AZO04230.1|DBSCAN-SWA MGFFDFGTIFFLIAAVVIFFQLRNVLGRRTGNERPPFDPYSASRTREADAAQKPENVVSLPRKRAPGESSAETYAAIDAFAKPDTDLNKGLRTIKDNDPSFEPKTFVDGAKMAYEMIVMAYADGDRKTLKNLLSREVYDGFVAAIGEREAKSEKIQSSFVGIDKADIVAAEMKGSEAHITLRVVSELISATRDKAGAVIDGDPETVAEVKDVWTFARDTRSRDPNWKLVATEEED >CP034445|3154863:3212569|3169404_3169905_-|AZO04228.1|DBSCAN-SWA MASNDDAATGAANGNGTQNQPSLNVLAQYVKDLSFESPGAPNSLRGRDKAPGIAINVNVNANPLSDKQFDVNLTLNAKASFDQEVLFNVELVYGGVFAISGFPQEHMLPILFIECPRLLFPFARQIISDATRNGGFPPLMLDPIDFAQMFQQRIAEDQAASKVQVS >CP034445|3154863:3212569|3211642_3212569_-|AZO04270.1|transposase|DBSCAN-SWA MIAQNYVGCDISKQWLDFFDETSGRLRRIDNQADAIAAYVAGLDPSRDFVVMEATGVHDRLLRHALAKAGVPFSRHNPAHTHHYSKSTRRRAKTDPLDARMLSDYGRRYNPEAEPAPSEEVERLQSLAGHRDHLVEMRAALKKHLAEAFEEAVIADLEETIAFLDTRIKAFESQIAEVIRQNQDTARDHALMVSVPGVSNVGALSLLALLPELGQRSPKAIAALAGLAPFDNKSGKLSRRSQIQGGRSRVRRALYMAALTAIRTCERFKTFYTALAARSGSKKLAIIAVARKLLVVLNAIMRDKTAFA >CP034445|3154863:3212569|3208798_3209605_+|AZO04266.1|DBSCAN-SWA MESQTIVEALLLGLLEGLTEFIPVSSTGHILLAGHFLGFHSTGKAFEILIQLGAILAILSVYFGKLWQMLIKLPSDPQTRHFVIGILIAFLPAAVIGAVGHDFIKNYLFESPKLICSMLIIGGVVLLVVDRINFKPVHHDVERFPLSVYLKIGLFQCLSLIPGTSRSGSTIVGALLMGVDKRAAAEFSFFLAMPTMVGAFAFDLFKNRNVLTSADLPIISVGFIAAFVAALIVVRFLLDYVSRKGYALFGWWRLAVGVAGLAALMVWG >CP034445|3154863:3212569|3180456_3181614_-|AZO04242.1|DBSCAN-SWA MTASASPLDFFWFIPTHGDGSYLGTEKQQRPPEFGYFKEIAQAVDRLGFPGVLLPTGQNCEDSWITATGLATLTERLKFLVALRPGVTLPTFAARQTAALDRLSNGRLLLNVVVGGNPTELAGDGVFLPHDERYAQAQEFLTIWRGLVSGERVNFEGKHYRVENGRLDLLPLQERPPLYFGGSSEAGQELAADLVDMYLTWGEPPAQVAAKIASAREKAARRGRKLRFGIRLHFIVRETEDEAWRAAERLISHVTDAQIENAQARFLKEMDSVGQRRMAELHGGRRDRLVVSPNLWAGVGLVRGGAGTALVGTPEQIAERIGEYQAIGIDTIIGSGYPHLEEAYRVAELLFPRLGLGTQRPRAHKDIANEFSVGFHGANRLQAAS >CP034445|3154863:3212569|3192347_3193517_+|AZO04251.1|DBSCAN-SWA MTSNGKNRLAFSTRTIHGGQSHDPTTGAVMVPIYATSTYGQQSPGVHKGFEYARSQNPTRFAFERAVADLESGTKAFAFASGLAAISTVLELLDSGAHIVATDDIYGGSFRLMERVRKRSAGLQVSFADFTDLAAVEAAIRPDTKLLWVETPTNPLLRIVDLEALAALAKRKGLLTVADNTFCSPYIQRPLELGIDIVVHSTTKYLNGHSDMVGGVAVVGDNKELADQLKFLQNAIGAISGPFDSFLALRGIKTLALRMERHSANGLKIAQWLETRKDVRRVIYPGLASHPQHSIAVQQMHAFGGMISVDLDRDLAGTKRFLERTQLFTLAESLGGVESLIEHPALMTHGSIPAEKRGAIGITDSLVRLSCGIEDGDDLVADLEQALAH >CP034445|3154863:3212569|3185342_3186116_-|AZO04246.1|DBSCAN-SWA MTHLTLDKISVHYDGQLVPAVERVSLDIAKGDFVVLVGRSGCGKTSLLNVAAGLVEPAGGLATVGGKPITRPGSDRAVVFQNDALFPWLTARENVAFALRLRGVPADSRARRADELLSLVKLADAGDKRIWELSGGMRQRVGLARALAAEPEFLLLDEPLGALDALTRERMQTTLLDLWAAAQVGVLMVTHGIEEALVLATRIVVLAPGPGHVVRTFETNFGRRYAAGEPIRAIKADAGFAAARADLTDSIFEGEAA >CP034445|3154863:3212569|3179680_3180460_-|AZO04241.1|DBSCAN-SWA MSLATRLARNGVAGWLLPAAIIAGWEAAARAGLIPANVLPAPSAVAGAFWRLTLSGELIRNIGVSTLRALSGFAIGGSIGFALGLANGLSALSRGLTDTTLQMIRNIPHLALIPLVILWFGIDEEAKLFLVALGVFFPIYVNTLLGIQSVDPQLVEMGRVYGLDRRALFFRVILPGALPSIFVGLRYALGIMWLTLIVAETISASSGLGYMAMQAREFLLIDVVVLSILIYALLGKLADSLARLLERLSLGWHPAFQKG >CP034445|3154863:3212569|3173130_3173499_+|AZO04233.1|DBSCAN-SWA MTPFGEKLRALRAERGVSQKAMAEAIGVSAAYLSALEHGRRGAPTWTLIQKIIGYFNIIWDDAEDLARLAESSHPRVRIDTSGLSPAATELANLLAESIERLDEAELRRLSTSIRAALARRS >CP034445|3154863:3212569|3161080_3161524_+|AZO04219.1|transposase|DBSCAN-SWA MFGYKVHNGVDDAHTLIRRMDFTDASVTDTEPADGLIIGDEKAAYGDQAYYTHARHARLKQAGIKDRLMHRANKHHPLTPRQKQRNRLISKVRAAVERPFAVFKQRYGMRRLRFFNLATNRTQCMLAGCGYNLQRAAAVLFPKRKPA >CP034445|3154863:3212569|3181687_3182656_-|AZO04243.1|DBSCAN-SWA MFNLTRRAFGVGLLTAAAFAASFGTTSPAAAQDLKAFNIGYQKTGLPVIARQQQIIEKALADKGVTVKWVEFTAGPPLVEALNVGAIDVGWTGDAPPIFGQSAGANIVYAAALPSNGEGEAIFVKPASPIQSLADLKGKRIGVGKGTSAHNLVVAALEKAGIPFDQVTPVYLSPADAAAAFASDQIDAWAVWDPFFAIAETRYQPRVLARSIDVLQVNTYFLANKDFAKAHPDFVSTTIAALGDAAKWADQNRDKVAEALHEVTGVPLDAQTIAAKRTKFGIYPITAEIVAGQQATADRFFKLGLIPKAVRISDAVWTAPGN >CP034445|3154863:3212569|3175585_3176203_-|AZO04236.1|DBSCAN-SWA MTSSAMTRILRLCSVAGLAISAHWAVPAWAAGAMTTGGLTSQPIGHYDFCKARPNECNIHPADLEPARMTDVLWRKLVSVSAKVNAAVKPLSDLDHYGKDEVWAYPDDGYGDCEDYVLEKRRQLYRMGMSLADLLITVVRKPDGEGHSVLTVRTDKGDYVLDNLTDKVRAWDATGYRFLKRQAIDNTGRWVSIRDGQAVLVGSVQ >CP034445|3154863:3212569|3165199_3166021_+|AZO04223.1|DBSCAN-SWA MNKPQSFFHLHLISDATGETLLAAGRAASAQYKDARAIEHIYPLIRTEKQVAKVFDDIEEEPGIILYTVVDQKLARSIDERCAAMGLPCVSVLEPVLAVFQSYLGTPAGRRVGAQHVLDAEYFRRIDALNFTMEHDDGQLPANMDDADIVLIGISRTSKTPTSIYLANRGIKTANIPIVLGVPLPESLIAAKTPLIVGLIATAERISHVRQNRILGNSAAFVPTDYVDRAAINEELAYARQLCTRHGWPMIDVSRRSIEETAAAIVALRGKTR >CP034445|3154863:3212569|3210225_3211146_-|AZO04268.1|DBSCAN-SWA MPNRVNGGLSLTGDRSSYLIGQTEIAARALQPALYLVATPIGNLADITLRALETLAAADIVACEDTRVSRVLLERYGIRRRTTAYHEHNAQEAGPKLIEALAAGQSVALISDAGTPLVSDPGYRLVGEALEQGIRVVPIPGPSAALAALTASGLPSDAFLFAGFLPVKAGQRVTKLETFKAVPATLIFFESPRRLAETLAAMVEALGGTRKAAIGRELTKAFEEMRTGTLTELAGHYAAADTPKGEIVICVGPPEAAEEQPADIDRLLLSLAAEMPASKAASEAAKMTGVQKQALYRRLLELKDGP >CP034445|3154863:3212569|3161698_3162964_-|AZO04220.1|DBSCAN-SWA MQEMKLAEFKNKKPPELIAYAETLEVENASVMRKQELMFAILKKLAAQDIEIIGDGVVEVLQDGFGFLRSANANYLPGPDDIYISPSQIRRFSLKTGDTVEGPIRSPKEGERYFALLKVNTINFDDPEKIRHKIHFDNLTPLYPTSRLKMEVDTPPTKDISPRVIDLVAPIGKGQRALINAQPRTGKTVLLQNIAHSITANHPECYLIVLLIDERPEEVTDMQRSVKGEVISSTFDEPAARHVQVAEMVIEKAKRLVEHGRDVVILLDSITRLGRAYNTVVPSSGKVLTGGVDANALQRPKRFFGAARNIEEGGSLTIIATALIDTGSRMDEVIFEEFKGTGNCEIQLDRKVADKRIYPAIDILKSGTRKEDLLVPRSDLQKIFVLRRILAPMGTTDAIEFLIDKLKQTKTNADFFDSMNT >CP034445|3154863:3212569|3167735_3168341_+|AZO04226.1|DBSCAN-SWA MIVLGLTGSIGMGKSATAKMFAEAGVPVHDSDETVHRLYAGKAAPLVEAAFPGTTEAGVVDRVKLASQVLGDPATLKKLESIIHPLVRADADAFLARHRAVGAPLAVLDIPLLFETGGRNRVDKVVVVTASPEIQRERVLARPGMSEEKFLSILAKQVPDAEKRRQADFIIDTGNGFEAARRAVEAVIGELTGDKSGRDGS >CP034445|3154863:3212569|3186279_3187296_-|AZO04247.1|DBSCAN-SWA MSILSRRILLLSSAAALAGAALLGPASAEDLKITIGYQTVVEPSKVPQADGAYEKATKAAIDWRKFDSGADVIAAIASGSVDIGYVGSSPLAAAASRELPIQTIFVVGLIGPSEALVARNGSGIEKVADLAGKKVAVPFVSTTHYSLLAALKHENVDPKSVQILNLRPPEIAAAFARGDIDAAYVWDPALGQVKTTGKVVLDSSQVAAWGAPTFDAWIVRTDFAEKHPEAVRDFVKVTGEAYAQFLAKPEAWSVSSPEAGKIAKLTGAKLEEVPELLKGYVFPSLEEQASDKFLGGATVKAVAATSAFLKEQGKVDAVLPDYSKYVTAKYASEALASN >CP034445|3154863:3212569|3168415_3169120_+|AZO04227.1|DBSCAN-SWA MREIIFDTETTGLDLREDRIIELGGVELVNRFPTGRTFHKFINPQGRAIHAEAQAVHGISAADLVGKPTFAEIVDEWLAFTDGAKLIAHNATFDLGFLNLEYSRLDHPAIDPGRIIDTLALARRKHPMGPNSLDALCRRYGIDNTRRTKHGALLDSELLAEVYIELIGGKQAALVLEAVSVQMNGAGEVADIDISVGARPIALPPRLSEADRAAHAGLVSTLGEKALWLKVAVG >CP034445|3154863:3212569|3178637_3178823_+|AZO04239.1|DBSCAN-SWA MRLIRSVLALIVLGVGALWSLQGLGLVGGSFMTGQTRWLYIGLVTMLAGIVGLRWASQSRV >CP034445|3154863:3212569|3201640_3203512_-|AZO04260.1|DBSCAN-SWA MTVQTSLKLKVQPTEVSAVLRRILAENGKEYRWTYVVAVACLLVVSGTTAFTAWIMAPMINQIFYERRSDMIVWICAGFMAASVLRGFAGYGQAVALARIGNNLVARYQKRSFDHLMKLGVDFFNETRSGRLAAQVNENVGGIRDLLSLTLTSISRDAVSLVALVGVMIYQDPVLSLSSLLIGPPLIWAVVYITRRLRKINRESVLINSRLNGSVQEATQGIAIVKAFTLEDELARRIGIMADTAEQRNNKIARVAERLSPISEILGGLAVTAVLAYSGYRALVLGQPPGAVFSFITALILAYDPARRLARTQVGMERALVNARMIYELLDLEPKQGDAPDAVEAKVTTGEVRFNNVTFGYNADTPVLRDLSFTAAAGKVTAIVGASGAGKSTLVALLQRFYNVDKGSIEVDGQDIAKVTKHSLRQSIAYVAQAPYLFEGTIRDNIRFGRLSATDAEIEQAAKLAAADEFIRQQPQGYDTPVGEGGSTLSGGQRQRVSIARAIVRQAPILLLDEATSALDNEAEARVQEALTHVMEGRTTIVIAHRLSTVVNADHIIVLEEGRLVEEGTHATLMADPHGIYARFHRIQGKKGLGLVDDAKASQTSGQTPAPRSRARKTVRRSA >CP034445|3154863:3212569|3206814_3207966_-|AZO04264.1|tRNA|DBSCAN-SWA MRTSTSETLRALIEAEARRAGFEAVAVTRPDAIPLAPARLAEFVADGFHGSMGWIAETLERRAEPSTLWPEVRSIVVLAMNYGPDQDPRDILARPDRGAISVYAQNRDYHDVMKGRLKEIAGKIVAKAGGDVKVFVDTAPVMEKPLAEAAGLGWQGKHTNLVSRAHGSWLFLGTIFTAAELEPDKPEIDHCGSCRACLDACPTDAFPAPYRLDARRCISYLTIENKGPIPREFRDKIGNRIYGCDDCLAACPWNKFARAASEAKLAAREDLREPKLSDLLALDDEAFRAFFSGSPIKRIGRDRFVRNVLIAAGNSGDPLLAPIVRGLLGDASPLVRGAAIWGLSRLLPDREFGELAAMASRTETDAAVREEWLAGLASVEVHR >CP034445|3154863:3212569|3178869_3179673_-|AZO04240.1|DBSCAN-SWA MPAATLTAARSLVATPAQARTIGSVEGRRAFAFKEVGKRFGDKIVLDGISLDVRGGQFLAIIGKSGCGKSTLLRLLAGLDRPSSGSLTFGAAEEGHSRTRFMFQEPRLLPWASVVKNVEIGLTGIAGGTEARQRALAMLEEVGLADRADEWPSVLSGGQKQRVALARALVGQPQILALDEPLGALDALTRIEMQQLLERIWLDQKFTAVLVTHDVGEAVALADRVVVIGAGRIALDLDVPVARPRRHGSAELAHIEGTILDHLFNRA >CP034445|3154863:3212569|3160517_3161129_+|AZO04218.1|transposase|DBSCAN-SWA MIPYVRHEGEMIRMVDRVLGQLSLADGLVWSDVTELDQISRVIDWVPIKALLGQRSGPGRGNISYPAEALLRCLLLGVWNNLSDPSLEAQLRDRLSFRRFAGFSLSDPTPDHSTLWRFREELKCDGLIDRVFYEITRQLEQKGLIVKRGTLIDASFMQAAARPPAKPEATARPSVDGEHVGDARATRPCSAIRCTTASTMLIR >CP034445|3154863:3212569|3184277_3184463_-|AZO07436.1|DBSCAN-SWA MTHSTAVLFAHVGTWLLRLPPSDKQQADEPNSPTPVLLAGDRDRLPPRDESFYWVWQYWSQ >CP034445|3154863:3212569|3189176_3190517_+|AZO04249.1|DBSCAN-SWA MVGRGNSIIIVGGGASGVVLAVHLLMSPNPDLRVTLIEKRPHFGQGMAYSTLLSAHVLNVKASGMSAYADDPTHFARWVLERGFAKPDQGPFYAPRSLYASYLRELLDDLMERERETGRLRLIAEESLSISPTSAGVEVVLANGTSVVGHLAVLATGHDEQPGAFQGHAIRMGTEADTALDPQAGVLLLGTGLSMVDAFLSLEQRGHRGPIVAVSRRGLLPSPHRKGNPIKLDIADIPLGTELSYFVGWFRNLIRETQKAGGDWRDVVDGLRPFNQTIWQNWPSSAKRRFVEHTKAWWDIHRHRMAPEVYARVTEAVGSGRIRLVAGRVVNVEANGSFTVKIQPRGTQDVETLQVARLYDCMGIARDISRTSNGVVRSLIERGVARPDPLRLGLDVTAKCELIAADGTVSSKLLAVGPLTRGTFFEIDAIPDIRVQCAKLSKQLLG >CP034445|3154863:3212569|3173791_3174040_-|AZO04234.1|DBSCAN-SWA MTSFLRNTLLAAVAISALTGSALADKTYKSCAATNLNDLWSGRCCSVGSANCLGGGNGGRDHQGHEGKGGRGSSNGAGAGKF >CP034445|3154863:3212569|3205915_3206818_-|AZO04263.1|DBSCAN-SWA MSERRFFIFGAGYSGKAFARANAQHAPISGTTRAPEKFEALRSAGIEPLQFDGALSPELGEALARTTHLIVSVAPDEAGDPVLKAAGDALREKMPALEWIGYLSTVGVYGDHGGAWVDETSDCRPVSKRSVMRVAAEQEWLALGREIGKPAAVLRLSGIYGPGRNALANLEEVTARRLVKPGQVFNRIHCDDIAGALWHLAGGNLGGIFNVTDDEPAPPQDVVAYAAGLMGVEPPPEIPFETAQLSPMARSFYGENKRVANKAIKAAGYRFRFPNYRVALERMWAEGNWRDGAPRSPMRR >CP034445|3154863:3212569|3209788_3210229_-|AZO04267.1|DBSCAN-SWA MIPKSGSRFWIGSWSSKKEPTVAERTVGHRRKAYRRGHRGEWLAALALMLKGYRILARRHRTRLGEIDLIARRGDLVLFVEVKARRTLIEAMEAIGHESERRIEGAADLWLSRQPDYGRLSMRFDMVAVLPWRWPVHVENAFYGRN >CP034445|3154863:3212569|3197243_3197426_+|AZO04255.1|DBSCAN-SWA MMNDSPEKRNEWRAIFYVQARIAILFVPVVLSLLLIGIFAGNERKSVPDAIDPTVTGSVR >CP034445|3154863:3212569|3200167_3200788_-|AZO04258.1|DBSCAN-SWA MSGTLVLVRHGQSEWNLKNLFTGWRDVGLTEQGTAEALAAGEKLKARGLKFDTAYTSALQRAQKTCQLILDVVGQSELKTIRDQALNERDYGDLSGLNKDDARKKWGEEQVHIWRRSYDIAPPGGESLKDTGARVWPYYLHEVQPHVLRGETVLVAAHGNSLRALIMALDGKSGEEIVKLELGTGVPVIYKLNADSTVAKKEVLEG >CP034445|3154863:3212569|3197783_3199454_-|AZO04257.1|DBSCAN-SWA MAPSVSPTIAARRDQMFPILSEADIERMRRFGEARAYAAGEHIVTAGKISPGVILILSGKVDITQAGGLGQPEPIVTHGAGNFIGELAQLSNRPSLVNAEAVEPVEAIVIPSQRLRDLMVQEANLGERVMRALILRRVGLLESGASGPVVIGPPGNGDVLRLQGFLRRSGLPHRALDSDTDPCAKTLIERFHVDPHHLPIVLCPNGKLMHNPGENELARCVGLLRPIDADKLYDVAIVGAGPAGLAAAVYAASEGLSTIVLDCRAFGGQAGASARIENYLGFPTGITGMALMARAYNQAQKFGVEMVIPDEAKLLDDAGDGARYRLAIGDGESVRSRAVVIASGARYRRLDIANLAQFEGTSVHYWASPIEARLCQSQEVALVGAGNSAGQAAVYLASQVRKVTLLARRDSLDATMSRYLVERIKAQPNIEVLTQTEVVALEGHDGNLERLRWRNRVTGEETERAIHHLFLFIGADPNTDWLANCNVALDGKGFVRTGPDTTPGHGLMETSRSGVFAIGDVRCGSTKRVAAAVGEGAQVVAALHAYLARNGAPANE >CP034445|3154863:3212569|3163142_3163679_-|AZO04221.1|DBSCAN-SWA MSNVSNENTTGQAMKRMVIGIVVLVAATALLYLVAGDGFYLWAKAIHVIAVIAWMAGMLYLPRLFVYHVDAEKGSVQSETFKVMERRLLRGIINPAMIVTWVFGLWLAWKGFGFQGGWLHTKIALVLVLSGLHGYLAGAVRKFAEDKNEKPARHWRIVNEIPTLLMIVIVMLVVVKPF >CP034445|3154863:3212569|3184485_3185346_-|AZO04245.1|DBSCAN-SWA MSVSTYSERPAAKAGETDGLSVPWSRPRPRRAVISARLVSAITILVLLAAWTAASSLQLVSPVFLPSPGAVWAKFVIVVRDGFVDATLLQHILASLWRVFAALIAAILVGVPVGLAIGISRIGRGVFDPLLEFLRPIPPLAYLPLIIIWFGIGEPSKILVIAIAMLAPVALSTAAGVRGVSRERVDAARSLGATRAQVIRHVILPSALPSILTGLRIALGAGWSTLVAAELVAATRGLGFMIQSAAQFLVTDVVVMGILVIAIIAFALEFVIRRIERVLVPWAGRE >CP034445|3154863:3212569|3156905_3158213_-|AZO04216.1|tRNA|DBSCAN-SWA MASKDSIVALSSGRLPAGIAVIRISGPKTRFVVETIAGPVQDRFTNLRTIRAADGSTIDHGLVLFFPGPGSFTGEDVAEFQVHGSRAVAAKMLERITGFEGVRHAEPGEFTRRAFLNGRLDLVETEALADLVNAETEAQRRFALRNAEGAQSELYSGWRRRLIHARAMIEAEIDFADEEDVPGSVSEAVWSDVTAMIGEIERHVEGFKAAEIIRDGFEVVILGAPNAGKSSLFNAFARREAAIVTDEPGTTRDLLEVVMDLNGLKVRLVDTAGLRDAAGKVESIGIERARAKADVADLVLLLEDMANPVPVGGVPAGAPLLRIGTKSDIADAGGGAYDLMISSRDGSGLACLLDEIGSRAAAAVGEAGDVLPSRMRHVELLQEAMDFLRAALSGQSQELRAEELRLAAERLGRIVGAVDVEDLLDVIFSQFCIGK >CP034445|3154863:3212569|3203642_3204662_-|AZO04261.1|DBSCAN-SWA MAGEDEKGLRFPVLIGDIGGTNARFSIVLDANSEATEPQIVQTANFKTIDDAIQAAVLDRSSIQPNSAVLAIAGPVDGDEIPLTNCPWVVKPRQMIANLGLTEVVVLNDFEAQALAVVALGEEHMEKVGGGTPEPNASRVVLGPGTGLGVAGLIYALDHWIPVPGEGGHMDIGPRTPRDFEVFPHIDKLEGRISGEQILCGRGLVNVYRAVAKADARHAPFTTPAEITAAALAKSDPIAEEALELFVTCLGRTAGDLALVFMSRGGVFLTGGIAQKIVPALKQGNFRAAFEDKAPHSALMREMPVYVITHPLAALLGLAAYARTPSLFGVNTAGRHWRA >CP034445|3154863:3212569|3170023_3170518_-|AZO04229.1|DBSCAN-SWA MRISFIPLFLLLLPLLEIAGFVVVGREIGALATVGLVILSSVAGSLLLRHQGFGVMTRVRAEMDAGHDPSRQLAHGAMIVLAAILLIIPGFITDIFAILLLLPPVRDFAWRLFKSRIVMATSFSSGFSARRRETVIDLDDSDYSRDDYPNRPDHNSPWRRLKND >CP034445|3154863:3212569|3197449_3197755_-|AZO04256.1|DBSCAN-SWA MDECRHTRDIKKVTPSALGCEECLKSGSWWVHLRLCRTCGHVGCCDDSPNRHATIHFRATQHPIIEGYDPPEGWAWCFVDEVFVDLEGDTTPQNGPIPKFV >CP034445|3154863:3212569|3204750_3205833_-|AZO04262.1|DBSCAN-SWA MNLRSLPDRKPFLTAALALVTLAALVAAAISAEPRAKDLFGTKKLPAVVPAQSFGFYSKGCFAGGVALPMEGPTWEVMRPSRNRRWGHPAMIALIEKLSRDAGADGWPGLLVGDVSQPRGGPMMTGHASHQIGLDADIWLTPMPKRPLSMAQRESMSATLMVDEKTHLVKDALWTPAHTRLLKRAASYPEVERILVNPGIKKELCDTVTGDRSWLRKIRPFWGHDYHFHMRIGCQPGSPGCKGQEATPADDGCGKPLAWWFTEEPWRPNKNPDAPKARDLMTMANLPKECQAVLAATDAPSLAAVTYQGGSATAVAVAEPQPTEPAETISSGNAAMPAAASAFAPTPKIGIPLPRPRPGN |
58 | Bacillus_virus(33.33%) | transposase,tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
3532281 : 3544784
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP034445|3532281:3544784|DBSCAN-SWA GTCAGCCGCTTGCCGATTTCCGCTTTGATCCCCGTGCTTTAGCGGCTGTCGCCTTGGTGGCTTGTACGCGTCCTGTACGGAGGGCGGCTTGTACCTTGCCGGTCGCATCCTCGGCGTGAGTGTAGGTTTTCTGCATCAGCGTCAAGTCTGCCCAGCGGCCGGCCTTGGCCACCGTGCGGGCGTCGAGCTTCTGCCTGACAAGCATTTCGGTGGCGAAGCCATGCCGGCCGGCGGCGTGGGGCATGATCTCGCTGATGCCGGCTTTCTTGCATGCGGTCTTCCATGCCTTGTAGACGCCGTCCTTGTTGGCGTAGCCGAACACGCGCGCTTCCTTGACGCGCGTCTTGCCGTGCGGACCTTTGCGCGGGCGGCGCGGCGGAAGGTTCGCCAGCTCGACAACCAGGTCCATCGGGATATCGACCCATTGCGCTTCCATGCCCTTGGCCTCGGGCATCCAGACGCGGGCGTTCTGCAGGTCGAGGTCGGCCGGCCTGAGCGCGACGGCCTGGCCGATGCGGGCGCCCGTCACGAACATGAAATGCGCCATGGCAGCCTGGTAGCGGTTCGCCTTGGTGCGGAAGGCGAGCAGCCATTCCCATGAGCCCGGCGTCTTCTCGACGCGGCTCTTCCTGCCGCGGGCTCGGTCCTGTTTCAGCCGCTCGGCTTTCGGATAGGCCTTGATCCGGATCGGCGGGCATTTGCCGAGATCGTGCGCGTTGTTGATGACCGCGCGCACTGGCGTCACGACGTGGCGCTGCCAGGTGTCGGTCGAGCCCATGGGATAGAGCGTGGGGCCGAGGTTGCGCACCTCCTGCGGCATGATCGAGGCGCAGGCGCGGTCCTCGAGGTGCGGCAGGATCTTCGCAAGGTCGCCGGCTTCCATCGGGTTCGCGGGATAGAGCAGCACCGCGTCGGCGAAGGTCAGCGCCCTTTCGTCTCCAACGAGATCACGTTTGATCTCCTTGAGTTCGAAGGCCGCGACTGCCGCTTGGGCGTCCCTTTCTGGCGCTGTGAAAGGTATGCCAGAGCTTTGTCGGTAGTATTTGCCGCCCGGTATGCGGTCGATTCGGCCTTTGAACCACCACCATCCGTCGCGTTCGTAGAGTTCGAGGGGCAAGTCTGCGGCTCCTGAAACAGTCGGTCGATATGTTCCGGCAGAAGCATCATCACTTTGCCGAGCATCCGGCAAGCCCCGAGCCGGCGCGCGTTCTCGCGGATCGTCCTGGCGGACAGATGCACGCCGCGCTCCGCTTTGAAGATTTTGACCCATTCCTCGGGCGTCTTGCCGTTTTCGAGAATGAGGCTGGTCACGGCTTCGCCCTCCGGATGGCGTTGCCGCTGGTGACCATGACGAAGCCGTCGGCGAATTCGACACGGATCGAGTTGAAGCGCGGCGACGCCGGCTGGCCGAAGCCCGGCAGCGCGAATGTTCCCGCCGCCGCCTTGCTGCGCGCAGTAATAGCGCAGGGCTGTCCTTTGCGGCTGAACCGGTCCCAACGGTATATGTAGGTCATAGCAGTGACCCCTGCGCGATCGGTGCCGGCGTTCCTGGCGGCAGCATCCATCGCGCCGGCTTGGGCACGTCCGCCGCGTCCATTGGCTTCCACTCAAAGAATCCGAGTGCGCCTGATGCCGGAACGAAATCGACGGCGCGGACGTCGGCGAGCACCAGGGCGCGCGGTCCGAAGAACCACGGGCTGTCGTGCTCCTTGACGATGTCGACAACGGTTGCCGCGCCGACGATGCCGCCGCGGGGGAGATCGGCCGGCGCCGGGCAGTCATGGCCGAGTGAGCGAAAGAGATGCGCCGCGTCTTCGTACTCGGCGCGCGTCATGCCTTGGGCGGCATGCACGGCGAACTCGCCCCGGAATTTCAGCGCTGGATTGGGCGCGCGCCAACTGCGGTTTTCGACGGGCTTCCAGCCCATGACGATGGCATAGGCCCAAGGCTGGCGGATGGAGAGGGCGAGGCGCGGAATCAAAGCAGTGTCCTCTGTCTCGGGTCGGCGGGCTTAGACGCGTCGCCGAAGCCCTTGGCGCGCAGCTCGGCGATCTGCTTCAGCTTTTCCTCGTCGCCCTTGCGCCTGGCTTCGAATTCGAGCGGGCCATACTTGTCTGTCTCGATCGCGCCGGCGGCGACGGCCAGCGCGCGCTTGGACTGCGCGATGTCGAAATGCTCCCAGGATGCATCCATGCCGGGCAGGCCGGTGCCGGCCGGGCGCTGGAACCATTTGCGCTGGACGCCGATACGGTCGGCCATGGCGAACAGCTCGGCGCGCGTGTCGGCCCACATGTGGCACATGACCATCTTGCCGAAGGGCGCGCGCATGTTGTCGACGTAGACACTCATTGCGCCAGCACCTTGCGAAACTTGGGCATGCCGTTGTGCTCGACGCCGCCGGCGTCGCGCTTGCTGATGCGTTTGGCGCTGCTAAAGTGACGGTTATGAACACGACCCACGCATACCTTGCCGGAATTCTCGATGCGGATGGGTTCATCACGATCCAACGGACGACCAAAAACCGCGTGGCTCCCAGCGGGCACAAGTGGTCGCCGACTTACTACACCGCCAAGGTTGGGATTGCTGGAACTCGCGACGCGCCACACAAGCTCGCGCTGGAGGTATTTGGTGGATCGATCACAACCTATCACCCCGCCAATGCCAAACACCGAACGACTTACCAATGGAACATTTCCGGGCCGAAGGCGGCGTTCGTGCTCCGAGCGGTGTTGCCATATCTGCGGGTCAAGCACCGGCAAGCAGAACTCGCGCTTGAGCTCGTAGAGCTGGTTCACGTGCAGTTCGAGGAGATAAAGCGGACGCTTAAGCCGCCTTACAGCGTGCCGCCGGCAATGACCGCCGCGCGTCATGCTCTTTATGAAGCGGTGATTGGCTTGAACGAACCTAGAAACAGGGGACATGGTCGCCTCTTGGAAACTCGTTCCATTCACGATCATCGAGAAGACGACCGCTCGCCTTCGATCTTACCCCGCCCCATTGCTTGAAGAAGAACGGCACGCCGGCGGCTGCACATTGATCGCGGATCGAACGCGCCCATGCGGGATGCATCGGACGGGCATCCGGTCCGCTCTCGCCGCCGACGATGATCCAGTCGAGGCTTGGGAGTTCGGGCGGCGTTATCTGCGTGCGGGTATTCAAGCTGCATTCAGCTGGCGATCGCGCGAGATAGCCGCCGTGGGTCACGAGACCGCGAAGCCCGTCAATCCAGGGCTCGCCCTGGGGGTTGTATTGCCGAAGATTTCTGAAATCGATCGGCCCAAGCAACGGCTCGGCGGAGACGAAGCGGATGGCTGCCGGCGTCGCCAGCAACTCGGGTATTCGCTCGTCGGCCTCCTGCTGTCGCTCGGCCGAGACGCCGAGCCAGACGTTCGGCAGCGGCCATTGGTCGAGATAGACGCGAAGTCCGGGCGGCGCCTGACGCTCGTCCATGCCGGGCGCAATCAGCACGACCTGAATCTGCCGCTCGATTACCAGATTGCAGACGAGGTCATAGACGCGATGCTTCGTCGCCGGGTCGGAAAGATAGCGCCGCATCCGGTCCGCGCGCTTCGTCAACGGCTGATAGGTATGATGCGGCGTGACGGCCATCACCGCGAAGGTCTGGTCGATCCACTCTTCGGGCACATCCTCATGGAACAGGTCGCCCATGGAGTTGACGAAGATGCGGCGCGGCTCTTTCCAGCGCAGCGGCTGGGTCAGGATCCGCTCGGGCGCCAGCGCCAGCTTTCCGGTCCAGACGGCATTACCGTTCACCTTCTTCGTGGTGCCGGCGTAGTGCGGCGCGCCGCCCATGGCCTCAATCCGCGCCGCCATCTTCATGGCGTAGCAATGCGTGCAGCCGGCGGTGGCGATCGAGCACCCGACAATCGGATTCCACGTCGCGTCGGTCCATTCGATTGGCGATTTGTCAGCCATCAGCGCAGCCCCAAATAGAGCAAGCCAACGCCGATCACGGCGGTGGCGGTCATCAGGCCGATCAGGATGCGGCGCGCGGCTTTCGATTCAGCCGGCCGGACGTCGATGCAGCCGTCTCCGCACAGGCAATCCGGCCAGCGGCGGCAGATAGTCAGGTGTTCGCGCTGGACCTTCATGGCAGCAACCAGCCGAGCGCGATGAAGGGCGTGGCGAAGGCGATCAGGCAGGCGGTGATGAGGCCTGCCGGGCCGTCGCCGCCGATGGCGGCGAGGAGCTTGGCGGGATCGATGGCGCGATCAAGGTCGGCCAGGATCTCGGACGGGCGGCGCTGCGCCATGAGCTGGCGCGAAATCCACGCCTCGTGCTCGGGCGATCGCGCGGCCTCGGCGATCAGCTTTTCGGCGACGGTCGGCATGGCGGTCAGGCCACGCGGCGGACGGCGCGCATCTTGGCCATGTCGGCCGCGTCCAAGCCGTGTCTTTCGATCTCGGCACGGGTGAAGCTGGAATGCTCGGCCAGGCTGTCGGCCGTCACGGCGATGCCGTTCATGCCCATTTCGAGCATCTTGTCGGCCATGCTCTCGACAATCGATTTCTTTGCGAGAGCCGCCTTGGTCGGCTTCGGCATGGTTTCGTGCAGGGCATAGTGAAGCATTGGTCTGTCTCCGCTCGCGGGATGCGAGTGGGGACGGATAATATGCGCAAACGGATAACTGTCAATAAAAGAAAATGCGCAAACGGATAATATCAATCCGTCAAACGGATAACGGCGGGTTCCTTGGCTATGCCGATTTCAGGCGCAATTGGCCCTGAAGACCGCCCTGCCGGTCAACCTTGCGCAGGACCGAAGGCGGCAACACCGCGAAAATTTCGCCGATCCAGTCCACCGAGACTTCCTCGATGGGCGCCGCGTTCCACGAGATCAGGTTCACGCCTCCGCCTGAGCCTCGCATGATGGTCTTGATGAACCGACGACCATCCGATGTGCGGACTGCCGCTTCCTCGCCATAGAAGGATTCCAGCGGCTTCTTCTGATCCCGATAAACCACAATGATGGTGCCGTCCTTGAAGACCGGCAGCATCGAAACGCCCCGGACCTTGAAAGCTATCATTTCGTCCGGCAATGGGAATGGTATGCTTACCTGGTCGAGCCCTTCCGGAGGCACCTGCTCGAAGTCAGGTTCTACTTCGGCGCCTGCGCCGAGATACCCCATAAGTGGGATTTCCGAGGCGTCAGAAGCGGTGGGGCGCTCGTCGACGAGGTTTTCGAAAGTTTCATTGATCGCATCGCGCCTATGACCTTCTGGCTCGGATCCGGCGAGCCAGCGATTGACCGTCGATTGTGACACCTGGAAATGCTCAGCCAGCTTCTGCTGCTTCCAGCCGGTGGCTTTCATGATGGCGCGGATTTTCTCTTCGATCGTCATGGCGCCGACGCTGCCATGTCGTTCCCCAGAACCAAAATCCGCTTGCGCATAAAATTGCGCTTGCAAACTATCCGTTTGCGCATAATATCCGCGCTCATGAACACGATGCGTCATATCCGGACATCTGTTTTCAGGGTCACACAGGCCGAGTTCGGCAGGCTAGCCGGCGTCGGGCAGGCCACCGTTTCTCGTTGGGAGGGTGGCGTCGCGCCGTCGCTTGAGGAAATGCAGGCGATTCGCAAGGCAGCCTTCGAAAGGGGCATCGACTGGGATGATCGGCTGTTCTTCGAGGCGCCGGAAAACAGCGAGGAGGCTGCATGAGCCTGCTCGCTTCCGCCAGCTTCAACGGGCGGCTCGTGCCGCGTGAAGTCGGCACCGCCGATTACGAATTTCCCGCGCCGGCTTCCCAAGAGCCTGACCGCGCGACCGCCGGGGCGATTGTTCGCGCCTCGGCGGTTCCTTTCACCGAACCCTGTGGCGATGCCCCGCTCCAGGTGAAGACCGCGCCGGCTGACGCCACCCCCGGAACCGGCGCGGTCACGCTTTCCGAAGCCTCGCGCTTCATCCTGTCGCTGATCGCCGCCCATGAGGGCCGCTCGACAAGCGACATCCTGGCCGAGCTTATCGCTGCCGCCGCTGACGCGATCGGGATTTCCGCGCTGCTCTCGCGCGGCGCCGACGACATTGGAGATATCGCCGACCTGCCGGGCTACGCCCGCAAGGCGGCCAACCGATTCCGGAGCGGCGGACCATGAGCACATCCTGTTGCGGGCCTCCTGTGAGCTGACGCCCGCACACTGACTCCACAAACCCTTTCCCACCACGGGAAACGACGCCGGGTTTTCCCGGCGCGGGAAACCTTTTGCGACTGACGAGGAGGCCACAATGGTACCGAACGCGAATGCCCGCCACTTCCTGCTCAAGGCCAAGCAGCGCGACCTGATCACCGCCGCCGGCGGCATCGAGCGCGCGGCCGATATCTGTTCCTATAGCAAGTCGCAGGTTGGCCGCTGGGCCAATGCGGACGCGCCGGAGCTGATGCCGCTCGACGCGGTGTTCGCGATCGAGGAGGAATGCGGCCGGTTCGACATGAGCGAGGCCATCTGTGCGGCGCGTGGCCGGCGCTTCGCCGACGTCGAGGCGGTGGCCGCGAACGGCTCGGTCATGGCGGCACATGCGGAAGCGGTCGTGCGCATGGGCGAGTTGATGAGCGAGGGCGCGCTCGCCTTCGCCGACGGCCAGTTGACGCCGGCCGAAAGCGCGCAGATCGACCGCTCGCTCGCCAAGGTGAACAACGCCATCGCCGAATATCGCAAGGTGCTTGCCGGCGCGAAGGCCGCCGGCGGGCTGAAAGTCGTGGGCGAATAGATGTCCTCGCGCCACTTCCCCTCCGGCCGGATCCAGCATGGCGACCGCACCGTCGCCACGTCGGTGATCGAGTGCAAGTGCGGGGTCGTGGCGCATTTCAAGCAGACCGGCGCCACGCGCAAGCCGCCGGAAGCCGCCGAACAATATTTCCGGGCGCATGGCTGGACGGTCGGCGCGCGCGCCACCGCCGACCGCTGCCCGGACTGCACGACGCGTCTCTCAAAACCAGTTCTGAAGGTGGTTCCCATGGAAGCCAAGCCGGAAGCGCCGCGCGAGATGTCGCGCGAGGACCGCCGCATCATCTTTGCCAAGGTCGACGAACTCTACCTCGACGATAAGACCGGCTACGCCGCGCCTTGGACCGATGCCGCTGTAGCCCGCGACCTCGGCGTGCCGCGCGCCTGGGTGGCGCAGGTGCGCGAGGAGCTGTTCGGGCCGGAAGGCTCGAACGCCGAGTTCGACGACTTCCTCGCCAAGGCGGCGCCCGTCATCGCCGACATGAAGAACCTTTGCCGCTCGGCCACCGCGCAGCTCGAAGAGGCGCGCCGGCTGTCCGAACGGGTCGACGAGCTTGAGCGCATCGCCCGCCGCGTCGAGCGCGAGATCGGCAAGGCTTCGTAACCCCCAACCCTCGGAGAGGAAAACCATGAGTGTCCTTCGTTCATTCCGGCAGATGATCGGGCTGCTTTCGCGGGGCGATTTCTCGCGCCACTGCGACAAGCTTCTCGGCGAAGCGATCGAGGCGCTGGAAGCCTCGCCGGCCGACAAGTGCCAGGCCAAGATCACAGTGACGATCACGCTCGATTACGAGCTTGGCCGCATCGACGTGAAGGCCGATGCCAAGTCGAAACTGCCCGACACCGTCAAGTTCATGAAGACGCCGTTCTGGTCGATCGACGGCGCGCTGTCGGTCGAGCACCCGAACCAGATCGACATGTTCCCCGCCCGGAAAGTCCACGACGCCGACGACGAGGACGAAGAGCGGGAAACCGCCTGACCTTTCCACGCCTGCAACACAGCCAAACCCTGAAAGGAATTTCGCATGTCCGATAAAGATTCACCGCTCAATGCCCATGGCATCGAACTGATCAAGGCCCTGGCCGCAGAGGCTGCCTCGGCTTCGACCGTATCCATCACCACTGGCGGCCTCGGCGAAGGCCTGCCATCCAGCGTTCCGCTCGCCTTCGATCGGAAGAGCCAAGCGTTCAAGTCGCTGAAAGGCCTGATCGAGGAATTCCGCCAGGCGCCGGACCGGCGCAAGGGGACCGCCACGGTCGAGACGTTGGCCAGCTTCATCGAATTGACGAAGCGGCATCAGGACGAGCATTCCGCGCTGTTTGGCAAGACGATGTGGCCGGACCCGAAGATCACGGCGGTGCTCGACTATGACATGGAAGGCGTGCCGGCGCGCAACCGCTCGCATCGCATCGTCTATGCCTTCCCGCTGACGGAAGAGTTCAAGTCGTGGGTCGGCGCCAACGCCAAGCCGATGGACCAGGAAATCTTCGCCGCCTTCCTCGAAGAGCATGCGGCGGAACTGGCCGCGCCGACCGATGGCGAGGTCTCCGAGTACGAGCGGCTGTTCAAGGAAAAGATGGCGACGCCATCGGAAGTCATCGCGCTGTCGCGGCATCTGGAAGTGTTCGTCGCCGCCCGCGCCAAGCAGGGCATCCGTCTGCAGAGCGGCGAGCGCACCGTGGAGTTCGCCGAAGAGCATATGAACGCCAAGGGCGAGGCTATCGTCGTCCCCGGCATCTTCATGGTGTCGGTGCCGGCCTTCCTTGACGGCGACGCGGTGCGCATCCCGGCGCGGCTGCGCTACCGCGTCGGCGGCGGCAAGGTGACCTGGTTCTATCAGCTCTACCGCTGGGAGTTCTATCTGCGGGAGCAGGTGGGCCATGACCTCAAGCGCGCCTCGGAAGAAACCGGCCTGCCTGCCTTCGAGGGTGCGCCTGAAGAGGGCTCGCCGGCCTGATGCCGAAGGCGGTCGCCCTCGCTGCAGCCAAGCCGCTGCCGTCTCTTGTGAAAGAGGCGGCGGCGGCGCTTTCGCGCGCCACGTCCGCCGCGGAAGTGCTCGACGCACGCGACAAGGCCGTCGCCGTCTATGACGCCGCCAAGCGCGCGGCGCGGCTCGGCAAGGCGAAGAAAGCGCATGACGAGCTGATCGCCGCCACCTATCGCGTGCAGGCCGACGCGCTGGAAATCGAGAGCCAGGCCAAGCGCCGGCTCGCCGACGAATATGATGCCGCGCAGCCGGCCGGCGCCGCCAAGGGCGGACGCCCGAAAACCGTTCCGGACGGGAACGGTTTCACGGCAAAGCAGGCGGGGCTCACCCGCAAAGAGATTCACGACGCGCGCCAGATGCGCAGCGCGATTGCCCGCAACCCCGCGATCGTGCGCGAGGCGCTGGACGATATTCTTGAGAGCGGCGACGAGCCGACGCGGGCAGCGCTGAAGCGCGCCATTGCGCCGGCGGTGAAGACCATTCGAGCCGAGGCGCAGGCCGAGAAGAAGGAACGGCGCAACGCTCGCGAGGTCGTGCTTGCGGCCGGCTACAAGGAACTGCCGGCGAAGAAATACGGCGTCATCTACGCCGACCCGGAATGGCAGTTCGATCCCTATTCGGCCGAGACCGGCATGGATCGCGCCGCCGACAATCATTACCCGACCAGCGATCTTCTGACGCTGATGAAGCGCGATGTCGGCGCGCTGGCCGCCAAGGATTGCGTGCTGTTCCTGTGGGCGACGGTGCCGATGCTGATCGAGGCCATTTGCGTGCTCGACGCATGGGGCTTCTGCTGGATCGAGCGCGACCCGAACACCGGCTATCTCGCGCCGAACAAGGCGCATGCCCGCTACGTCTCGCACTGGGCGTGGCTGAAGAACCGTGTCGGCACCGGCTATTGGACGCGCGGCAAGCACGAGATCCTGATCATCGCCACGCGCGGCAAACCGGTCGCGCCGGCCATGGGCGAGCAGCTCGAAAGCTGGTGCGACGACATGGCGATCGAAGCCGATGTCGGCCGGCATTCGGCAAAGCCCGACGTGTTCGCCGCGTGGATCGAAAAGCACTGGCCGAACACGCCGAAGATTGAACTCAACGCGCGCCAGGCGCGGCCCGGCTGGGACCGTTGGGGCAACGAATCCGAAATGGTGGCAGCATGAACGCGATCCTTCAGGTCGATCAGGATCCGTATCTCGATTTCCTGCGCCGCAAGATGCAGCTCGCCAAGGCGGACGGGTTCGATGTCGAGCCGGAGGAGATCAACCCGGCCGGCGCGCCGCATTGCCGCGCCATCGTGCGCTGGGCGCTGAAGGGCGGCAGCCGCGCGATCTTCGCATCGTTCGGTCTGCACAAGACTTTCATGCAGATCGAGCTGATGCGGCTGGTCGGGAAGTTCGTGCCCGGCTTGCGCCTGATCGTGATCCCGCTCGGCGTCCGGCATGAGTTCTTCGACGAGGCGAAGGAGCGCTTCCAGGGCGAGTACGGCGTAACGCTGAAGTTCATACGCTCTGATGCCGAGATAGATGGCGAAGATACGATCTACCTGACCAATTACGAGAGCATCCTAGCCGGCAAGGTCGACGCCTCTCGTTTCGTCGCAGCCTCGCTCGACGAGGCGGCGGTGCTGCGCGGCTACGGCACCAAGACCTTCCAGACCTTCCTGCCGCTTTTCCAGCCGGTGCGCTTCAAGTTCGTCGCCACCGCGACGCCGTCGCCGAACCGCACCAAGGAGCTGATCCACTATGCCGGCTTCCTCGGCGTGATGGATACCGGCCAGGCGCTGACACGCTTCTTCCAGCGCAACTCGGAATCGGCCGGCGACCTGACGTTGTTCCCGCACAAGGAAGACGAGTTCTGGCTGTGGGTGCACAGCTGGGCCGTGTTCCTGCAATCGCCGGCCGACCTCGGCTTTCCCGACGACGGCTATGTGCTGCCGGCGATGACGGTGAACTGGCACGAGGTGCCGATCGACCATTCGACGGCCGGCTATGACCGCGACGGACAGGGGCTGCTTATCCGCAACACGGCGCTTGGCGTGACGCAGGCGAGTGCCGCCAAGCGCGACAGCCTGCCCGCGCGCATCGCCAAGATGTCGGAGCTGATCGCGCAGGATCCTGAAGCGCATAGCGTGCTCTGGCACGACCTCGAGGATGAGCGGCGCGCGATCGAGGCGGCGGTGCCGGGCGTGCGCTCGATCTACGGCTCGCAATCGATCGACGCCAATGAGGAGAACGCGGTCGGCTTCAAGAACGGCAAGTTCCGGCACCTTGCGACCAAGCCGGAGATGTCAGGCGCCGGCAACAATTTCCAGAAGCATTGCCATTGGGCGATCTTCGTCGGGATCGGCTTCAAGTTCCACGACTTCATCCAGGCGGTGCACCGCATCGTGCGCTTCGGCCAAACGAGCGAATGCCGCATCGACATCATCTATTCGGAAGCCGAGCGCGAGGTGCGTCGCAACCTCGAAGGCAAATGGGCCGAGCATGAGCGGCTGATGGCGCGCATGGCCGAGATCATCCGCCGCTATGGGCTCGACGGCCTGCCGCTGGACGACGTGCTGCAGCGCTCGATCGGCGTGGCACGGCGCGAGGAGCGCGGCGAGAACTTCTGCATCGCTCACAATGACGCGGTTCTAGAAGCACGGCGCACGGCCGATGCGTCGATCGGCGAGATCATCACCTCGATCCCGTTCGCCAACCACTATGAGTACACGGCGAGCTACAACGATTTCGGCCACACCGACGACAACGGTCATTTCTGGGCGCAGATGGATTTCCTGACGCCGGAACTGCTCCGCATCCTGAAGCCCGGCCGGCTCGCCTGCATCCATGTGAAGGACCGCGTGCTGTTCGGCTCGGTGACCGGCGAAGGCGTGCCAACGGTCTCGCCCTTCCATGCCGAAGCGATCTTCCACTATCTGAAGCACGGTTTTCAGTACATCGGCATGATCACCGTCGTCACCGATGTGGTGAAGGAAAACAACCAGACCTACCGCCTGACCTATTCCGAGATGATGAAGGACGCCACCAAGATGGGCGTCGGCTGTCCGGAATATGTGCTGCTCTTCCGCCGGCCGCAGAGCGACCTCTCGCGCGGCTATGCCGACGAGCCCGTGGTGCATGACAAGCCGCTGGTGAGCACCGCCGACGGCGGCACGACCAGGTGGAAAGACGGCGACCGGCGCGCGCAGGTGCCGGGCAGCGGCTACACGCTGGCGCGCTGGCAGCTCGACGCGCATGCCTTCTGGCCGTCGTCGGGCGACCGGCTGCTGACTACCGACGAGCTGGTGCGGCTTGGGCCGAAGCCGCTGCGCGAACTGTTCCAGCGCTCGTTCGAGGGCGCGATCTACGATTTCGAAAAGCATGTGCAGCTCGGTGAGGAACTGGCGGCCCGTGATGCTCTGTCCAAAACCTACATGACGCTCGATCCGCGCTCGGGCGATCCGGGCGTGTGGCACGATGTGGTGCGCATGCGCACGCTGAATGGCGAGCAGGCCTTCCGCAACCTGGAGAAGCATGTCTGCCCGCTGCAGTTCGACATCGTCGATCGGCTGATAGACCGGTACTCCAATGCCGGCGACATCATCTACGACCCGTTCGGCGGGCTGATGACCGTGCCCTACCGCGCCATCCTGAAAGGACGCCGCGGGCAGGCTTCGGAGCTCAGCGAGACCTACTTCCGCGACGGCCTGCGCTACTGCCAGGAGGCGGAGCGCAAGCGCGCCATCCCGACGATGTTCGACATGCTTGGCTTGGAGGCGGCTGAATAA
Protein sequences of DBSCAN-SWA_3 >CP034445|3532281:3544784|3532281_3533550_-|AZO04551.1|DBSCAN-SWA MGQNLQSGARRASVRQDDPRERAPARGLPDARQSDDASAGTYRPTVSGAADLPLELYERDGWWWFKGRIDRIPGGKYYRQSSGIPFTAPERDAQAAVAAFELKEIKRDLVGDERALTFADAVLLYPANPMEAGDLAKILPHLEDRACASIMPQEVRNLGPTLYPMGSTDTWQRHVVTPVRAVINNAHDLGKCPPIRIKAYPKAERLKQDRARGRKSRVEKTPGSWEWLLAFRTKANRYQAAMAHFMFVTGARIGQAVALRPADLDLQNARVWMPEAKGMEAQWVDIPMDLVVELANLPPRRPRKGPHGKTRVKEARVFGYANKDGVYKAWKTACKKAGISEIMPHAAGRHGFATEMLVRQKLDARTVAKAGRWADLTLMQKTYTHAEDATGKVQAALRTGRVQATKATAAKARGSKRKSASG >CP034445|3532281:3544784|3536997_3537642_-|AZO04558.1|DBSCAN-SWA MTIEEKIRAIMKATGWKQQKLAEHFQVSQSTVNRWLAGSEPEGHRRDAINETFENLVDERPTASDASEIPLMGYLGAGAEVEPDFEQVPPEGLDQVSIPFPLPDEMIAFKVRGVSMLPVFKDGTIIVVYRDQKKPLESFYGEEAAVRTSDGRRFIKTIMRGSGGGVNLISWNAAPIEEVSVDWIGEIFAVLPPSVLRKVDRQGGLQGQLRLKSA >CP034445|3532281:3544784|3534257_3534629_-|AZO04553.1|DBSCAN-SWA MSVYVDNMRAPFGKMVMCHMWADTRAELFAMADRIGVQRKWFQRPAGTGLPGMDASWEHFDIAQSKRALAVAAGAIETDKYGPLEFEARRKGDEEKLKQIAELRAKGFGDASKPADPRQRTLL >CP034445|3532281:3544784|3540052_3540985_+|AZO04563.1|DBSCAN-SWA MSDKDSPLNAHGIELIKALAAEAASASTVSITTGGLGEGLPSSVPLAFDRKSQAFKSLKGLIEEFRQAPDRRKGTATVETLASFIELTKRHQDEHSALFGKTMWPDPKITAVLDYDMEGVPARNRSHRIVYAFPLTEEFKSWVGANAKPMDQEIFAAFLEEHAAELAAPTDGEVSEYERLFKEKMATPSEVIALSRHLEVFVAARAKQGIRLQSGERTVEFAEEHMNAKGEAIVVPGIFMVSVPAFLDGDAVRIPARLRYRVGGGKVTWFYQLYRWEFYLREQVGHDLKRASEETGLPAFEGAPEEGSPA >CP034445|3532281:3544784|3539656_3540007_+|AZO04562.1|DBSCAN-SWA MSVLRSFRQMIGLLSRGDFSRHCDKLLGEAIEALEASPADKCQAKITVTITLDYELGRIDVKADAKSKLPDTVKFMKTPFWSIDGALSVEHPNQIDMFPARKVHDADDEDEERETA >CP034445|3532281:3544784|3539073_3539631_+|AZO07461.1|DBSCAN-SWA MIECKCGVVAHFKQTGATRKPPEAAEQYFRAHGWTVGARATADRCPDCTTRLSKPVLKVVPMEAKPEAPREMSREDRRIIFAKVDELYLDDKTGYAAPWTDAAVARDLGVPRAWVAQVREELFGPEGSNAEFDDFLAKAAPVIADMKNLCRSATAQLEEARRLSERVDELERIARRVEREIGKAS >CP034445|3532281:3544784|3537959_3538397_+|AZO04560.1|DBSCAN-SWA MSLLASASFNGRLVPREVGTADYEFPAPASQEPDRATAGAIVRASAVPFTEPCGDAPLQVKTAPADATPGTGAVTLSEASRFILSLIAAHEGRSTSDILAELIAAAADAIGISALLSRGADDIGDIADLPGYARKAANRFRSGGP >CP034445|3532281:3544784|3542171_3544784_+|AZO04565.1|DBSCAN-SWA MNAILQVDQDPYLDFLRRKMQLAKADGFDVEPEEINPAGAPHCRAIVRWALKGGSRAIFASFGLHKTFMQIELMRLVGKFVPGLRLIVIPLGVRHEFFDEAKERFQGEYGVTLKFIRSDAEIDGEDTIYLTNYESILAGKVDASRFVAASLDEAAVLRGYGTKTFQTFLPLFQPVRFKFVATATPSPNRTKELIHYAGFLGVMDTGQALTRFFQRNSESAGDLTLFPHKEDEFWLWVHSWAVFLQSPADLGFPDDGYVLPAMTVNWHEVPIDHSTAGYDRDGQGLLIRNTALGVTQASAAKRDSLPARIAKMSELIAQDPEAHSVLWHDLEDERRAIEAAVPGVRSIYGSQSIDANEENAVGFKNGKFRHLATKPEMSGAGNNFQKHCHWAIFVGIGFKFHDFIQAVHRIVRFGQTSECRIDIIYSEAEREVRRNLEGKWAEHERLMARMAEIIRRYGLDGLPLDDVLQRSIGVARREERGENFCIAHNDAVLEARRTADASIGEIITSIPFANHYEYTASYNDFGHTDDNGHFWAQMDFLTPELLRILKPGRLACIHVKDRVLFGSVTGEGVPTVSPFHAEAIFHYLKHGFQYIGMITVVTDVVKENNQTYRLTYSEMMKDATKMGVGCPEYVLLFRRPQSDLSRGYADEPVVHDKPLVSTADGGTTRWKDGDRRAQVPGSGYTLARWQLDAHAFWPSSGDRLLTTDELVRLGPKPLRELFQRSFEGAIYDFEKHVQLGEELAARDALSKTYMTLDPRSGDPGVWHDVVRMRTLNGEQAFRNLEKHVCPLQFDIVDRLIDRYSNAGDIIYDPFGGLMTVPYRAILKGRRGQASELSETYFRDGLRYCQEAERKRAIPTMFDMLGLEAAE >CP034445|3532281:3544784|3536636_3536870_-|AZO04557.1|DBSCAN-SWA MLHYALHETMPKPTKAALAKKSIVESMADKMLEMGMNGIAVTADSLAEHSSFTRAEIERHGLDAADMAKMRAVRRVA >CP034445|3532281:3544784|3534625_3535201_-|AZO04554.1|DBSCAN-SWA MSPVSRFVQANHRFIKSMTRGGHCRRHAVRRLKRPLYLLELHVNQLYELKREFCLPVLDPQIWQHRSEHERRLRPGNVPLVSRSVFGIGGVIGCDRSTKYLQRELVWRVASSSNPNLGGVVSRRPLVPAGSHAVFGRPLDRDEPIRIENSGKVCVGRVHNRHFSSAKRISKRDAGGVEHNGMPKFRKVLAQ >CP034445|3532281:3544784|3537570_3537963_+|AZO04559.1|DBSCAN-SWA MLSQLLLLPAGGFHDGADFLFDRHGADAAMSFPRTKIRLRIKLRLQTIRLRIISALMNTMRHIRTSVFRVTQAEFGRLAGVGQATVSRWEGGVAPSLEEMQAIRKAAFERGIDWDDRLFFEAPENSEEAA >CP034445|3532281:3544784|3533587_3533794_-|AZO04552.1|DBSCAN-SWA MTYIYRWDRFSRKGQPCAITARSKAAAGTFALPGFGQPASPRFNSIRVEFADGFVMVTSGNAIRRAKP >CP034445|3532281:3544784|3536388_3536631_-|AZO04556.1|DBSCAN-SWA MPTVAEKLIAEAARSPEHEAWISRQLMAQRRPSEILADLDRAIDPAKLLAAIGGDGPAGLITACLIAFATPFIALGWLLP >CP034445|3532281:3544784|3538527_3539010_+|AZO04561.1|DBSCAN-SWA MVPNANARHFLLKAKQRDLITAAGGIERAADICSYSKSQVGRWANADAPELMPLDAVFAIEEECGRFDMSEAICAARGRRFADVEAVAANGSVMAAHAEAVVRMGELMSEGALAFADGQLTPAESAQIDRSLAKVNNAIAEYRKVLAGAKAAGGLKVVGE >CP034445|3532281:3544784|3535184_3536216_-|AZO04555.1|DBSCAN-SWA MADKSPIEWTDATWNPIVGCSIATAGCTHCYAMKMAARIEAMGGAPHYAGTTKKVNGNAVWTGKLALAPERILTQPLRWKEPRRIFVNSMGDLFHEDVPEEWIDQTFAVMAVTPHHTYQPLTKRADRMRRYLSDPATKHRVYDLVCNLVIERQIQVVLIAPGMDERQAPPGLRVYLDQWPLPNVWLGVSAERQQEADERIPELLATPAAIRFVSAEPLLGPIDFRNLRQYNPQGEPWIDGLRGLVTHGGYLARSPAECSLNTRTQITPPELPSLDWIIVGGESGPDARPMHPAWARSIRDQCAAAGVPFFFKQWGGVRSKASGRLLDDREWNEFPRGDHVPCF >CP034445|3532281:3544784|3533790_3534207_-|AZO07460.1|DBSCAN-SWA MGWKPVENRSWRAPNPALKFRGEFAVHAAQGMTRAEYEDAAHLFRSLGHDCPAPADLPRGGIVGAATVVDIVKEHDSPWFFGPRALVLADVRAVDFVPASGALGFFEWKPMDAADVPKPARWMLPPGTPAPIAQGSLL >CP034445|3532281:3544784|3540984_3542175_+|AZO04564.1|DBSCAN-SWA MPKAVALAAAKPLPSLVKEAAAALSRATSAAEVLDARDKAVAVYDAAKRAARLGKAKKAHDELIAATYRVQADALEIESQAKRRLADEYDAAQPAGAAKGGRPKTVPDGNGFTAKQAGLTRKEIHDARQMRSAIARNPAIVREALDDILESGDEPTRAALKRAIAPAVKTIRAEAQAEKKERRNAREVVLAAGYKELPAKKYGVIYADPEWQFDPYSAETGMDRAADNHYPTSDLLTLMKRDVGALAAKDCVLFLWATVPMLIEAICVLDAWGFCWIERDPNTGYLAPNKAHARYVSHWAWLKNRVGTGYWTRGKHEILIIATRGKPVAPAMGEQLESWCDDMAIEADVGRHSAKPDVFAAWIEKHWPNTPKIELNARQARPGWDRWGNESEMVAA |
17 | Sinorhizobium_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
3547902 : 3566414
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP034445|3547902:3566414|DBSCAN-SWA TATGACCGGGCCGCGTCTCTCAATCATCCCGGCGCGCGCGGCGACCGACAGGGCGCTGAAACCGCGCGACCTGCAGGTGCTGTGCGTGCTCGGGCGCCACACCGACGATCTCGGCTGGTGTCGCAAGAGCCAGGTGAAGATGGCCGACGAGATGGGCTGCGCACGCGCAACCGTGTTCGAGGCGATCGAGCGGTTGGTCAAGGCCGGCTATCTCGAGCGCTATGTGCAGGAAGAGCAGAACGGGCGCGACAGCCCGCATGTCTATCGCGTCATCCTCGATCCGAAGCACGCCGATCCGGGCAGCGTACGCGAGGCCGATTCCGACGCGGATGACCCCTGCCGGCAGGCCGGCACCCCTGCCGGTATATCGGCACCCCCTGCCGGTCCAGAACCGGCACCCCCTGCCGGTCCAGAACCGGCACCTAAGAACGACCCTCTTAGAACGACCCACCAGAACGAATCCGAGAAAGAGCGCACGCGAGAGGAAGAAAAGGCAATCGAGCGCTGGCTGAAGAAGGCGCATCCGAACTGGCCGAGCTACATCAGCGACAGCGGGCCGAAGGCGCTGGCCGAGGCGCGCAAACTGACCGAGGCGGAGCGCGAGGTGGCGGCCGACCGCATGGGCGATTACGTGGCCAGCGCCAAGGTCGGCGGCCGGACAGTGATCTGCACCTTCGCCGTCTACCTCGCCGAGAAGCGCTGGGAGAAGCTGCCGCCAAAGGCTGCGCCGGCTGTCGAGGCCGACGACTATGCGCCGCCCTTCGGACCGGTGTGGATGGCGTTTGTGCTGGGCCACATGCTGGACGGCCCGACCAATCCTGATGCGACTGCTTTCGCGGAGCGCTGGCCGTATCTGTTCAAGCTGTTCGGCCTTGCGCGCGCCGGGCGCGGCTATCGCTTCGGCGAGCGCTGCCATGCGCTGAAAGGCCAGATGGTCGCCGTACCCGTCGACATGCCGCTGTGGAACGACTGGAAGGCCTGTTTCCAGCAACGCGGCTGGCCTTGGCTGCCTGACCCCGGCCGCGTGGCCTACTTCCCGGTTGGCGGACCGGACGGGCTGGAAGCATTCGAAAAGGCATGGCGAGGAAAGCAAGATGATGGCGATCGGCACCAAGCGGCTGAGTGACGGTGAGCGTGCCTGGGTCGATCCATCGACTGGTGAGATCATCAACGTCGACCGCGCCTGGCAGAAGAGCGACAAGCGAATCGCATTGCGGCGGCGAGAGCAGGCGCTGCTCGCCGCGGCTGGAATGGATGGCCCGGAAGCGCATTGGTACGTGCTCCGGGTCGAGGATCGAGCCGACATTGCTGTGGATAAGTCGCTTGAAGATGCCAATGTCGAGCGCGTCATGCTGCACACGGAAGCCGAGCCGAAGCGTCGTGGCGGGCGCAGACACCAGTGTCTTGAGCCTGTCCGCGTGCCGTCATTCCCGGGCTACATCTTCGTGAAAGTGGTGTCCTGCCCGGTGACATGGGCAGGGCTTCGCACCATCGCCGGTGTCCTTGGTCCGATTGGTGGATGCGAGAACCCGAGCCCGGTGAGCGAGCAGGAAATCGTGAAGTTTCAAGCGCGTATCGAGAACGACCCAGACGCAATCGCCGTGCTGACCAACGCGCTGAAGGCAGGCGACAAGGTGTCGATCGACAGCGGACCGTTCGCTGGCTTCGAGACTGTCGTGCTGCTGCTGAAGGACAAGCATCGCCTGACGGTCGAGACCAGCGTGTTCGGTCGCACGGTGCCGCTCGACGTCGACCTTGCCAACGTCACGAAACTGGATTAGCGAGTATGCCCCAGGACGAGCTGAAAGAGTCGCACGCCGAAAAGGCGGACCGCGCCATCAGCCCCAGAGGAAGCAACCCAGCTTCCTCGCAATGGAAAAGATTCCCGACAATGCCGCATATGCCGAAGACCTTTCGGCCAAGCTACCTGCCATCACGTCAGGAGCAGCAGCGCCAGCATGACAGCCGGCGCGACCAGACCGAGCAGCGTGCCTGGTATAAGACCGCACGATGGCAGAAGCTGAAGCAACGCGTTCATGTTCGCGATCTCTTCCGATGCCAGGAGACAGGCGTGCTGTGCTCGGGCAAGTATCCGGCCGACGACAGTCCGGTCGCCGACCACATCGTCGAACCCGATGGCGACCCAGCGCTGTTCTGGGATGAGAACAACATCCGGACGGTCTCGAAGGCCTACCACGACGGCGAACGCCAGCGCGAGCAGATCGCCGAGCGGGGCCGGGGGGTGGGTCGAAAGTCCAGAGCCCCCACCCGCTAGACCGGCGCCTCCCCAAAGTTTTTTTGCGCGCAGGTTTTCGGATTTATTTTTTTGTTGGAGCTGAACCATGGGCCGCAGGGGACCGAAGCCGGAACCTGCCAGCGTCAAGCTTGCGAAGGGGAAGTCTGGTCGCCGTCCTATCGGCGTCGAGCCTTCGGCCGAAGAGCAGGCGAAGACCGTCGCGACCAGTGGCACGGTCGAGCCGCCGGCATGGCTGAAGGGTGAAGCCCTCGACGTTTGGAACCGGCTGGCGCCGCGCGTCATCGGCATGAAGTTGCTGTTCCCGATCGACGCCGAGACCTTCGGCCGCTACTGCCGCAACTTCGCCCGCTGGCTGAAGATGCAGCAGCGTCTCGACGAGATGGGCGAGATTTACGAGATCGAGACGGCCAGCGGAAAGGTGCGCCGCGCCGATCCGTCGTTCATCATCGGCTATCACCTTGAGCGTCAGCTTGTGACGGCCGAAGCCAACTTCGGCCTCAACCCGGCAGAGCGTCAGCGCCTGTTCGCAGCTCGCGCCGCGGGATCGACCGGCGATCTGTTCTCCGGCATGGGCTCGGCGCCGGCCGCGACTGGCGCCGACAAGACGCCGGCTTCACCGGCTCCGGCCAAGCCGGCCGCCAAAGGTTCGGCGGTAGGATTCCTGCAGTAGGGCCATGCCGGTCAAGGCAGGTCGAGCAGCGCACCGGCTCGGACCGCGCAACCGCCCGCTCGGCGTTGGCGCAGATGCGAAGTGGTGCAAGGACCGGCAAGCGTGGTGTGCCGGCGAGTATTGGTTCGATGAGGTAGCGGCCGACAAGGCGGTCGCGTTCTTTCCGACGCATCTGTGCTTCACCAAGGGCGAATGGGCAGGACGCCCGTTCGAACTGGAGCCCTGGCAGGCCAACGACATCGTCCGGCCCGTCTTCGGCTGGAAGCGCCCCGACGGCACGCGTCGATATCGCCGCGTCTATGTCTGGGTGCCGCGCAAGAACGGCAAGACAGAACTCGCTGCCGGCATCGCGCTCCTAGTGCTGCTCGGCGATGGCGAACTCGGCGGCGAAGTCTATTCGATCGCCTCGCATGAAGGTCAGGCGCGGCTGGTGTTCAACCAAGCCGCCACGATGGCGGGCAAGTCAGAGACGCTCAGCAATGATCTCGTCTGCCTGAAATCGTCGATCTATTGCGCGGCGCTCAATGCCGGTTTCAAGCCGCTGTCGGGCAAGGCCGAAGGCAAGCATGGTTTCTCGGCGTCGGGACTGATCGGCGACGAAATCCACGAATGGGTGTCGGGCGACCTCTACCAGTTCGTGCACGATTCGGAAGACGCCCGGCGCCAGCCGCTGGAATTCCTGATCTCCACCGCCGGCAAGAAAGGAACCTACGGCGAGGAAGTCTGGGACGAGTGTCAGAAGATTCTCGACGGCACATTTGAGGATCCCGAAACGCTCGTCATCGTCTATGCCGCGGACCCGGAAGACGATTGGCAATCGGAAGAGACCTGGCACAAGGCCAATCCGAACCTCGGCGTCTCGAAGAAGTTGGACACGATGCGCACCAATGCGCGCCGTGCCCGCCAGTTGCCGCGTCTCGAAAACCACTTCAAGAACTATCACCTCAATCTCTGGACGGAACAGGCCGTCCGCTGGCTGCCGATCGACGCGGTTGACGACGAGGGGAAAAAGTTCGGCTGGGACCATTGCGCCGGCCCGGTCGGCTGGAAAGAGCTTGAGGCGAAGCTTGCGCACAAGCGCTGTTTCGGTGGCCTCGATCTTTCGTCGGTTGTCGACCTTTCGGCGCTGATCTGGTGGTTCCCGATCCAGCAAGGGCTTCCGGCGCCGGCGCTGCTGGCGCGGTTCTTCAAGCCTGCCGCGCTGATCAAGGAACACGCCAAGCGCGACAAGCTGCCCTACGAGAAGTGGGTGAACGAAGGCGCGATCATCGCTACACCGGGCAACGTCGTCGATTACGCCTTCATTCAGGAACAGATTTACCGCGATGCCGAGAAATTCCGCATCGCGCATGTCGGCAACCACGAACTTGAGGCCGGACAGGGCGGCCTCGCAATCGACCGCTGGAACGCGACCGAGACGGCGGTGAAGCTCCAACAGGAAGGCCTGCCGGTCGTGCTCTTCGGCCAGGGCTTTGCCTCAATGTCGCCGCCGGCCAAGGAACTGGAACGCCTTGTTCTTTGCAATGGCTTCCATCATGGCGGCCACCCGGTGCTGCGCCGCCATGCCCAGGTTGTCGCTGTCGAAACGGACGCCGCCGACAACATCAAGCCGGCGAAGAACAAATCGACCGAGCGGGTGGACGGCATCATCGGCACGACCATGGCGATCGGCATCGCCATCAGAGACAAGGGCGAGGACGCTGGCATGGATGATTACTTCAAATCCCTGACAGGTGCGGCTTGAAACTCTTTCGCAAGATGGCCGATGTCGTGGTTCGGGCGCTCAGCGTCAGGCAGCCGGACGGCTGGTATCCGGCCGGACAGATCGGCGACGCCGGCGAGCCTGTCACTGGCAGCAACGCGCTCGCCCTCTCCGCCGTCTGGGGCTGCGTCAATCTCCTGGCCGGCACTATCTCGTCATTGCCGTTGATGGTCTACCGGACCTTAGCGGACGGCACGCGTGAAGTTGCCAAGGACCATCCCCTTTATCGGCTGCTGCACGACAGTCCGAATTTCGATCAGACCGCCGTCGACTTCTGGGACTTCATGGCATCATCAATCGAGCTTTGGGGCAATGCCTACGCCCGGCAGCTCCGAGGCTCGACCGCTCTCGCTGCGCTGGTCCCGGTTCGTCCCGATCTCGTCTCTGTTCGCCGTCTCACCAACGGCACGCTCGAATATCGCTGGTCCTATGAGGGCCGTGCATATGTCGAGACCGACCAGACGATGCTGCACATCCGCGGCCCTGGCGGCGATCCGCTCGGCGGCATGTCGACGCTGCATTTCGGCCGTCATGCCTTTTCCCTCGCCCGTGCGACCGACAAGGCAGCCGGGAAAACCTTCGCCAATGGTTTGCGGCCGTCAGGCGGTCTGAAGTTCGCCAATTGGCTGAACAAGGACCAGCGTGACATCGCCAGGTCCGAGATCGCCGACAAGATCGGCGCCGACAATGCCGGCAAGCCGATCATCCTTGAGGGCGGTACCGAGTGGGTCCAGTTCCAGCTCAAGCCGGAAGACGCGCAGATGCTCGAATCGCGCGCGTTCTCGGTCGAGGAAATCTGCCGCTTCTTCGGCGTGCCGCCGGTCATGATCGGCCATACGTCGAAGACGACGAGCTGGCCGACGGGCGTCGAGCAACAGGGCTTGATCCTTCAGAAGTTCACGTTGCGCCGCCGGTTGAAGCGCATCGAACAGGCGCTCGAAAAGCAATTGCTGACGCCGAAAGACAGGGCGGACGGCATCTCGATCGAGTTCAACCTTGAAGGTCTGTTGCGCGCCGACAGCGCCGGCCGCTCCGCCTTCTACAAGGACATGACCTCGATCGGCGCGATGACCATCAACGAGGTGCGCATGCGCGAGAACATGCCGAAGGTCGAGGGCGGGGACGTGCCGCGTATGCAGATGCAGAACATCCCCATCACGGCTGATCCGGCCGACCAACAGAACCTGATCGGAGGTCCGAAATGAAGACGAAGGATTTCGCCCTGCAGGTCAAGGACCTGTCGGATGACGGCACGTTCGAGGGCTACGGCTCGATCTTCGGCAACGTCGATTCCTACGGCGAAAAGGTGATGCCTGGCGCCTTCGTCGAAAGCCTCGCCCGCCACAAGCGCGAGGGCAGCAATGTGCTGATGCTGTGGAACCACGACATCTATCAGCCGATTGGCGTCTGGGAAGACCTGGCGGAAGACGCCAAGGGTCTGTGGGGCAAGGGCCGTTTCCTGCTCGACATCCAGCGCGCCCGCGAGGTTCACACGCTCGCCAAGAACAAGGCCATCGGCGGCCTGTCGATCGGCTACAAGGAAGAGGAAGCCGACCAGGAACAGGCTGTGCGCCTCCTGAAGAAGCTGAGCCTCTATGAGATCAGCCCGGTCACCTTCCCGGCGAACCGCCGGGCGCGGATCGAGGCGGTCAAGTCCGAGCGCATTGATGAATTCGCGCGCCGGCTGCGCGACGGCGATCCCATGCCCGTGAAAGATTTCGAGGACATCCTGCGCGAGGCAGGCGTCCCCAAAGCCATGGCCGTACAGATCGCCTCTGTCGGCTATGCGAAGGCCATTCGGAGCGAGTCCGAGGGCAGCAAGGCGAAAGAACCGGCCGCATTCCTGCAGGCGCTCCTGCGCGGCTGATCACTCACCAACTGTTCAAAGGACATTGCTATGAAACTGAACATCGCCACGATGCTGGCGGTAGCGACCATCGCTGCGCTCGGCATTCTCACCGCCGGATCCGCGCATGCCGCCTTCGGCGTCGTCGATCCTTCCGCCCTGGCCGGACTCGCGCCCTTCGCTGCGGTCGGTGCTGGCGGCATTGCCCGCCTCGCTGCCTCGACCATCGGTCCGCGCATCTTCTTCGACAAGCCCAACGACGGAACCGGTGGCGACGTCGTCGACGTCGAGAAACTGGCGAAGGAGGTCAAGGAGAAGTTCCAGAAGGCGCTCGACGACGTCAAGGCCATCGCCGAGGAAGCCCTTGGCAAGGCCAAGGCCGGCGAGGACCTGACCAAGTCCACCAAGGAAAAGGCGGACCAGGCCCTCACCAAGATGAACGGTCTGAACGAGCAGCTTACCACGCTCGAGCAGAAGATCGCCCGCGAAGGTCCGAAGGACGATCAGAAGGCAAAGTCGTTCGGCGAGCAGTTTGTCGAGGCCGAGAGCGTGAAGTCTTGGCTCGGCAGCTCGCCCAGCAAGGGCAAGACCGACCTGCGCATTAAGGCAACGCTGACTTCCGCCACGACCGACACCGCCGGCGCGGTTGGCGACGCCATCGCGCCGACCCGCCTGCCCGGTATTCTGCCGCTCCCACAGCGCCGCCTGACCGTTCGCGACCTGATCTCGACCGGCCGCATGGATGGCAACACACTGGAATACGTCAAGGAAACGGGCTTCACCAACAATGCCGGCATGGTGGCGGAAGGCGCCTCCAAGCCGTCTTCGGACATCAAGCTCGATCTGATGACGACGTCGGCAAAAGTCATCGCGCACTGGATGAAGGCCTCGCGCCAGGTGCTCGAAGACATCTCGCAGCTCCGTTCCATGATCGACCAGCGTCTGATCTATGGTCTTGCCTACAAGGAAGAGTCCCAGATTTTGAACGGTGACGGCACCGGCCAGAATCTCAACGGCATCATCCCGCAGGCGACCGCATATGCGGCGCCGATTTCACTGGCGGACATAACGATGATCGATGTGCTTCGCTTGGCGATGTTGCAGGCGGCGCTGGCTGAGTACCCGGCCACCGGCCATGTCCTCAACCCGATCGACTGGACTTATATCGAGACCCAGAAGGACGATATCGGCCGGTACATCATCGGCAATCCGCAGGGCTCGATCACGCCTACCTTGTGGGGCCTGCCGGTCGTGCAGACGCAGTCGATGACGGTTCGCAAATTCCTGACCGGTGCGTTCAAGCTCGGCGCGCAGATTTTCGACCGTTGGGATGCGCGTGTCGAGGCTGGTTTCGAGAACGACGACTTCACCAAGAACCTCATCACGATCCTTGGCGAAGAGCGCGTCGCGCTCGCCGTCTATCGTCCGGAATCCTTCATCTACGGTGATTTCGACACCGCGCTCGCGGCCTGAGTTCGGCTGATCAAGGCGGGCGGCGTCAGCCGCCCGCTCTATCAACCGAAGGAGGTCGCAATGACTGAAAAGCTGCAATGGAAAGTCCTGCGGGGCCATGAAGGCGACCGCGTCTACAAGGAAGGGGACACCCGCGAGGGCACCCGGGCCGAGCTTGGCCACCTCGAGGGCAAGACGCTGGAACTGATTGGCCCGGTGAAGGCTGAGAAGGCCGAGCCGGCGCCGAAGAACAAGGCCGAGGGCGCTGCGCCCGCGAACAAGGCCGCGAAACGGCGGAAATAGCTCCGTCGGCACCAGGCGCCTTCTCCCAAGGCCGCCGCAACTCCCGAGAGGAAAACACCATGCGTCGCTACAAGGTAGCCATCACCACTGCTGCCGACGGCAGCGCCACGGCGTACACGCCGCGGCTGTCCGGCAAGATTCACAGCATCCAGTATGTGAAGACCGATTTTGCCAACGGCGTCGGCTTCACCGTCACGTCGGAAGCGACGGGCGAGAGCATCTGGGCGGAAGCCAATGTGAACGCGTCCGCGGTGCGCTATCCGCGCGCGCCTACTCATTCGCAGGCCGGCGCCGCGGCTCTCTTTGCCGCTGGCGGCACCGCCGTGCAGGACAAGATCGGTGTCGCCAACGATCGTGTGAAGATCGTCGTCGCGGCTGGCGGCAACGCCAAGTCCGGCACCGTTCACGTGTTGGTCGACTGATCATGCTCGCACCTGTCCGCATCGAGGATCCGGAAGACACGCCCGTCTCGCTCGACGATGTGAAGCTGTATTGCCGCGTCGATTTCGACGATGACGACGTGCTTCTGGAAGCGTTCCTCGATGCGGCCACAGCGCATTTGGAAAAGGTTCTCGATATCGCTCTGGTGACCCAGACCTGGCGGCAGGATTTCGACAGCTTCAAATGCCTGCGCCTGGTCAAGGGACGCGCGCAGAGTGAAGGCCTGATCGTTGAGTATTGGGACGCCGATAACGCCGAGCAGACGCTTCCCGGCCCGACCTATCGGATGCTTGTCGACAGCGTCGGCGCATATGTCGCGATCGCACCTGGCGCAGCATGGCCGCAGGTCTACTCCCGCCCCGATGCGGTGTCAGTCTTCTATGTCGCCGGCAGCGCGCCGGAAGATGTGCCGGCGGCGTTGAAGGCGGCGATCCGGATGCATGTCGCTCACCTCTACCAGAACCGCGAGGCGGTCACCGTCGACGCGGTCTCGAACTTCCTACCGCTCGGTTACGAGGCTCTGATCTGGCCCTTCAAAAAGCCGGGCCTATAGCCGCTCACCTCTCTCAAACCGGAGAACTCTCCCATGGCCGACCTTGTCATCACCGCTGCCAGCGTTGTCGCTGGCGCGAATGCCGAAACCGAAACCGGCGCCGCCGGCGAGGCGATCACCGCCGGGCAGGCGGTTTATCGCTCCAGCACCACGAAGAAGCTCATGAAGGCGGATTCGAACGGCGCCAGCGCCGAGATCCGTACCCCAATCGGCATCGCGCTCAATGGCGGCGCGCTCGATCAGCCGATCAAGTTCCAGAAGAGCGGCGACATCACGATTGGCGCGGCCCTGACGCCGGGCCTCGCCTATTACCTGTCGGATACGCCCGGCGGCATCTGCCCGGTCGCCGACATCGGGGCGGGCGAATATGTCTGTCTGATCGGTCTGGCATCGTCGGCGAGCGTGCTGGCGCTCGACATCCGCTACACGGGCGTGTCGAACTGACATGTGGGTCCGCTTCACCGCCGACTTCGACTGGAAGCCGACGCCGCAATCCGTCATCGCCTACAAGGCGGGGATGGAGCAGAACGTCACGCGCGCCTGCGCGGATGCCGCGCTTTCGGCGGACAAGGCGGTGAAATGCTCCAAGCCGCGCAAGGGTGCCGACGATGGCCAGGCCTGACGCCGGCTCATTGCGCGAGCGCGTCGCCTTCGATGCGCGTTCCGAGATCGACGACGGCTACGGCAACACCGTCGCCGGCGATTTCGAGGAACGCTTCCAGTGCCGCGCCGATTTCCGCAGCAGAGGCGGCTCCGAAGCCGTTCTGGCGGCGCGCCTCGAGGGCCGCAACACGTTCGGCGTCTATGTGCGCTCGTCCTCGCAGTCTCGCCGCCTGACGACCGATTGGCAAATGCGTGATGTTAGAACCGGCACGGTCTACGCGGTCATCGCCGTCGACATCATCACCGATCCAGCGTGGGTCTATCTCACAGTGCAATCAGGCGTGGCGGCATGAGAACACAGGCCAAGTGGATGGGGCGTGAAGCGCTCTATCGGCGTCTCAATCAGCTGCTTCCGAATGTCGAAAAGGAAGTTGCCGTAGAACAGCTCGAGGGCGCGAAGGAACTTGCCAACAGGATCCGGCCGCGCGCGCCGCGTGATACCGGGCATTACGCCGATACCATCCAGGCGGATCGCCTCGCCAACCGACCGGGAGAACGGCGGGTCGGCGGCAAGGTCAAGACGATGTCGACTGGCAACGCGCTGAACGCGACCAAGGATCCGAACGCGACCGGAATATTCGCGGAATTTATCTGGCGATTCCTCGAATTCGGCACAGTCAAGATGGCGCCACGCCCCCATATCTTCCCGACCTACCGGGCGTATCGCAAGCGGCTCCGCCGCCGCATCGCCGGCGCCGTCAATAGAGCCGTTCGCAAGGCAAAGGCCGACGAACAGCGTCTGATGGACCAGAATGAAGAGCTGATCGAAGCTATCGAGAGGTCTAGATGACATCGCCAAGCCTCGAACTGCAGGGCGCGATCGTGGCGCGGCTGAAAGGCGTGGTCGGAGTAACGTCGCTCATCGGGCAGCGGGTCTACGACACCGTTCCCGACAAACCGACCTTTCCCTATGTGACGATCGGCCCGTTCGATGAACTTTCAGACGATGCCGATTGCATCAAAGGCTTCAACATCGCCTTCGACATCAACGTGTGGTCGCGCGCTGTCGGGCTCCCCGAGGCGGAAAAGATCAGCGACGCGGTCCGGATAGGCATTCTCGAGCCGGAACTGATCCTTGCCGACAACGCGCTGGTCTATCTGCAGCACCGGCAGACAGTCTTTTCGCGCGACCCGGACGGACTGACGAACCGGGCTCGCATGAGCTTCGAGGCTTTCGCGGAACAACCGTGAACAGGGTATCCGACTTTTAACAGGCAGGTGGCGCGCCTGCCGGCATCCATGCCGCGCCGATCAACCCACATGGAGACGATACGATGGCATCCCCCACGACAATCAAGGGCGGCAAGGTCAAGGTCATGCTCGGCGACGGCGCGACCCCGGTAGAAAATTTCGCCGTTCCCTGCGGCTTCACGTCCCGCTCGGTCACGCTGACAAAGGCCCTCAACGAGTTCCAGCTTCCCGACTGCGACGATCCCGACAAGGTCGACTGGCTCGGCCGCGATGCGACCTCGCTGTCGATGGCGATCAGCGGCGAGGGTGTGCTCGCCTCCGAGTCGGTCGAAACCTGGCTCAACGCTTGGGAGAGCGTCGATTCGGTCCACGTGAAGGTCATCTGGGAGTTCCCGACCAAGACTATCACCTGGACCGGCTCGATGCATGTCGAGACTTTCGCGGCCACGGCGCCGAACGCGCAGCGCGTCACCGCGAACGTGTCGCTGCAGTCCGACGGCGCCATGACTCGGGTGGTTTCGTGAGCCGAGACGCCTCGACCACCCAGACCTTCGCAAATGACGACTACGTCTTCCGCATGGGCTGGGGCGAGTTGGAGCAGCTTCAGGAGGCCTGTGACGCCGGTCCCTATGTGGTGCTCGACCGGCTTGTGTCCGGCCGCTGGCGCATGGGCGACATTTCCAACGTCATCCGCCTTGGCCTGATCGGCGGCGGAGCGGAACCCGTCAAGGCGCTGAAGCTGGTCCGTGCCTATGTGCAGGACCGGCCGCCGCTGGAAAATCTCGTGCTTGCGCAGCTCGTCCTCGGCGCGGCGGTAGCCGGCGCGCCAGAGGAGGACGTCGGAAAAAAATCAGAGGCTCCGGATCAGAGCGATCCGATGAACTCCCAAACGGAAAGTTCCGGTTCGGAGCCATCTATGGCAACGGAGCCGTCCTCGGCTTCACCCCGCAAGAAGTCCGCCGAATGAGCATGTGGCAATACCTCGCCGCGCTCGAAGGCTGGCAGAAGGCCAACGATCCGGACGGGGACAAGGCCCTCACGTCCAAGGAAGTCGACGAACTCTGGGACTGGATCAGGGAATAGCGATCAAATGGCCGGCGATACCGAAGACCTAATCCTGTCGATCTCTGCCGACACGAAGCAGATACAGCGCGCGCTGCAACGCCTGACCGGGGACACGCGCGCGACGACGACGGCCATCCAGCAGCAGTTCGACGATCTCGGCAACCGCACGGCCGGCGCCTTCGACAATACCGCGACGCGCGCCAGGCAGTCGTTCCGGGTCATCCAGGGCGGCGCCAAGGACATCCAGACCGCCATGAAGGCATCGTCCTTCCAGACGGCGAACCTCGGCGCACAGCTGCAGGACATCGCCGTGCAGCTGAAGGGGGGAGCCTCGCCGCTCACGATCGCACTGCAGCAGGGCACCCAGATCAACCAGGTGCTTGGCCAGGCTGGTGCCGCCGGCGCTGTCAAGGCGCTTGGCGGTGCCTTCACGTCGCTGATCAACCCAGTGTCCCTCGCGACCATCGCCACAATCGCTCTCGTCGGCTACGCCGTCGAATATTTCTCGGAGGTTGTGAGCGGTGGCGACAAGAGCGCCGAGACGCTGAAACAGGAAGCCGAGCTGCTCGACAGGGTCGCGCAGAAATGGGGCGATGCCCTGCCTGCCGTCAAGGCTTATGCCGACGAGCGCAAGCGCGCGCTTGAAGAAGGCGAGATCCGGCAGGCCACGCAACTGACGATCGACGAGCAGTTTAAGGAAGCCCGGGAAACGGTCAAAACCCTGACCGTCGATATCGCCGACCTGGTGAGCCAGCTTCGGCTTGCCGGCGCGCCGAATGAGGAAATCGTCGCCGTCCAGCGCGCGTTCACTGCGCTGCAGAAGGCGGTGGAGGAGGGCAAGGATTCGACCAAGGAGAACAAGGCGCTGCTGGAAGCGCTATCGACGGCCTATTTCAACCGTGGCGTACCGGCGGCCGATGCCTTCGCCCGCAAGGTCCACGAAATCGCCGACGGCTTCGCCGCTGTCGCCAAACAGGCCGACGAGGCGCGCAAGGCGCAGGATGAGGCGTTGAACGCGGAACGGTTTCGCGGCTTCAACGGGCCGCCCGGCGCGTTTACCAACCCGCAAGGCATCCACCTTCCAGCCTCCGCCCCTACGCCGGATCAGCGTCCGTCCTTCGAGGATGTCGGCCAGTCGATCAACAGCCTGAATTCAGCGATCGACGCGTTCGTGCGGCGCGTCGATCGTGCCGAAGGCCGAGGCGATAATCCGAATTCGAGCGCGTCGGGCGTCGGCCAGTTCATCGAGAGCACCTGGCTCAACCTGTTTAAGAAATACTACCCGCAACAGGCAGACAGCATGTCCCGCGACGCCATCCTGGCGCTGCGCGACAATGCGGATGTCTCTTACGACCTGATCCGCAAATATGCCGCCGAGAACGCCAAGGTGTTGCAGGATGCTGGCGTGCATGTCGACGAGGCGGCGCTGCAGCTCGCACACTTCCTTGGCGCCGGCGACGCTGCCAAGGTGCTTAATGCCGCGCCTGGCACGCCGCTCGCCGGGCTGATCTCGCAGGCGTCAATCAAGGCCAACCCGACCATCCTCGGCGGCGGTCGCACGGTCGACGATGCCATCGCCTATGCGCAGCGGCGGGCCGGCGCCAGCGTCGGTAGTTCGGCAAAGACGCCGAGCGACATCTTCCAGGGCAGCATGGATGACATCCAGCGCCGTATCGATCTGCTCAACGCCGAGGCGCAAGCGCAGGCCGGCCTAAACCCGCTGGTGAACGATTACGGCTTCGCGCTCGACAAGGCCAAGATCAAGCAGCAACTCCTCAACGACGCCGCCAAAGCTGGCGTGGAGGTGACGCCAGAGCTTGCGGCCAAGATAGACGAACTCGCCGGCAACTATGCCAAGGCATCTTCCAGCGCCGATGCCTTCAAGGCCAGCCAGCAGAAGATCCTCGACCAGCAGCGCGAGTTGAACGACTTCGGCCGTGGCGTGCTCGGCGGCATCATCGACGATCTGCGCGCGGGCAAGGATGCCGGCGAGATCTTCGCCAACGTACTGAACAAGATAGCCGACAAGCTGGAAGACATGGCGTTGAACGCTCTCTTCCCTTCAGGCGGCGGCGGACTGTTCAGTGGCCTCTTTGGCGGCGGTGGTGGCCTGCTCGGCGGTCTCCTTATCCCCGGCATCCTGCATTCCGGCGGCGTGGCCGGCGTGGACGGCTATGGCCATGGTCGCGCCGTCGCGGCATCCACGTTCGCCGGCGCCAGGCGTATGCATGGCGGCGGCGTGGCCGGAGGCCTTCAGCCCGGCGAGGTGCCGGCTATCCTGCAGAAAGGCGAAGTGGTGCTGCCGCGCGGCGCGCGCGCCGGCGGCTCCAGCGATACCGTGCGCGTCGTGCTTCAGGATGACAGCGGGCGCATGTCCGACATCGCCGACCAGCGCATCAAGACGCATTCCGGAACGATCGTCGATGTTGCCGTGCAGCGCAGCACCAAGGCGGTCCGCAACGGCATGCCGGGCTATCTCGCCGAAGCGCAAAGCCGGTCTTTGTAAGGGGCTTCCATGACCATCCGTTGGCCGTGCGAGGTCCTGGTTCCGCGGGACGTCGCCTTCGATCTTGCGCCGCGCTCGCTTGCCGGGCCGGCATCCGTCAATGGCGCGACACAGGTCGTGTCGAGCGATGCCGGCATCTGGAAGGCGACCTATAGCTCGATCGTCGTCAACAATCGCAATGCCGTGCTGGCGCACCGGGCCATCAGCACGCTCCTCGAGGGGCGCTTGGGCTCGATCCTCGTCCCTCTCTGCCGCGGTTATCAGCCGGTCCCAGACGGCGCGGTTGCCGCTGGGCTCTACGACCAGGTTCCACATAGCGACGATGCCTTGTTCGACGATGGAACCGGCTATGTCGGCGAAGTCATCGACGTCGTCGCGGCGGCGCCGGCGGCGCTGCGTGCGACCACGATGACAGTGACGGTCGGCTACGCAGGCGCCATACAGCCGGGTCAGCATTTCTCGCTCGGCGAACGGCTCTATAGGATCCGGACCTTCGATCAGAAAACCGGCACCATGACTTTCCGGCCGCCGCTACGCGAGCCTGTGCTGTCCGGCGACCGGCTGGAATTCGACGATCCGGTCTGCCGCATGCGGCTCGCAAGCGACGACAGCATGGACCTGCAGTTGAGCCTGCGCCGCTTCGGCACCCCGACCGTCCAGTTCGTAGAGGACGTCTGATGTCGGCTGATTTCTTCACTGCCGAGCAATTGGCGCTGCTTACGGCGGGCCGGGTCTATGTCAGCATCCTGGTCAAGTTCGACTTCGCCTCCGGTCCGGAATATGCCTGGAACGGCAACACCAAGCTTACGGTAAACGGCAACACCTATCTGCCGATGTATGGCGCCGCACAGATCGAAGGGCTTGGGCTTTCTGGCAGCGGCGCGAGCGACAGCGTCACCGTCTCGGTCGACGGTCTGCCCGACCAGGCGCTCGGTTTCCTGGCGAAGGCCTTGGAGGATACGCCGCTCATCGACCAGCAGTTGATGACGGTCTACCTGCAGCTCTTCGATTCCGAATGGCAGACGGTTGGCAATCCAATTCCGATCTTCTGGGGGTTCATGCAGCCGCCCAAGGTCGGCCGGACCGAAATGCAGGATGATCAAGGCGCGATCCAGTCGATTGCCATCGTTGCTGAGAATGCCTTCTTCAATCGATCGCGCCCGCCCTATGGGCGCTACACCGACCGCGACCAGCAGGCGCGCTCGCCCGGCGACAAGTTCTTCGGCTTCGTCAGCTCGATCCTGATGAAGACCGTCACCTATCCTGACTATTGAGGCGCGACGTGACCAAGGCGGAACGATTGCGAGCGGTGCGTGACTACATCACCGCCGAGATGCAGCGCCCTTATGAGAAGGGCGTCACCGACTGCGGCGGCACGATCGACCGCTGGGTCAGGTTTCTATCAGGTGTCTCGCCGGTTAACGCCTTCGGCAGGCAGCTGCGCAATGCGCAGGACGCGGCCGAATGGATGGTCGTGCCGCAGATGTTCGTTGTCATGGTCAATCGCGCTGCGCGCGCAGGAGGATTCAAAAAGACGACCATGCCGATCGTCGGTGATGTCGGCCTAGTGTTTCCGGACAAGGGCTTGATCTGTCCTGCCATCCATGCCGGCGACTGCTGGTTTTCGCGGCATGAGACTGGTGCGCTTGCCGTGCCGATCGACAGATTCTGGAAGGCTTGGTCGGTATGAGCTACATCGGGAACAATCTGGACCCGACAGACCAGAGGTCGATCTTTGCGCGAGCGCGCATGGGTTTGGAAAGCCTTGTTTTGTTGGCACTCACATCGGTTGGCGTCAGCGGCGCGGCTCTCGGCATCGCCGTCGGCGCCATCGTCGGTGCCGTCCAGATCGGCGTCGCCATCGGCCTTTCTTATCTCGCCAGCTCGCTTTTCAGGCCTGACCCGCCGAAGCCGCAGGATGTGCAGACCTCGGTCAAGAACCCCGTTGCGCCGCGCGTGCGGCACTATGGGCGGGTCAAGGCCTCCGGTCCGTGGGTATTCGTCGAGAACAAGAGAGGCCTGCTGTTCAAGGTCATTGCCCTTGGGACCGGCCGGCTCGATGCGATCGAGGAGTATTGGATCGACGACAATCTGGCGACCCTAAGCGGCGTCACTGTTACATCGGCGCCATACAACGGCAAGGCGTTCATCCGTTCCCGTCTCGGCCTGCCCACTGAAACGGCCTATAGCGATCTCACGGGCAATTTTCCCGAGTGGGATTCCAATCACCGGGGAGACGGAATCTCCTCTCTCTATGCGGTCCAGGCGTCGGTGGCGTCGGACAAGATCACGGAAGTATTCCCGAACCTCTCCAATACGCTTTACCGAGTGGTCGCGCGGGGATCGATCGTCCACAATCTCGGCACCGGCACCGACATCTGGTCCGAGAACGCCGCCGACATTATCCGCGACTACATGCTCCATGCAGACGGCATGCGTCTGCCTGCAAGCATCGTCGACACGCCTATCGCCGCGGCGGGATGGCTTGCTGCTTACAACCGCGCCGCCGAGGCAGTCCCGCTGAAAGCCGGCGGAACGGAAAAGCGCTATCGCCTCTGGGGATCGTACCAACTAAGCGAGCGGCCGGCCGACGTGTTGTCGCGCATGACGGCAGCTTGCGACGGTCGGCTTGTGCCGACCTCGGATGGGGGCCTCACGCTGGATATCGGCACTTGGGAAGAGCCGACCGTCATTCTTGACGCCGACGCGATTACGGGGTTTTCCGAGGTAGCGCGCGGGCGCGACGTGCTGACCACGGCCAACATCATTCGCGCGACCTTCACCTCGCCGTTCCACGACTACCAGTCTACCGACGCCGACCAATGGATCGACGAGGATGACGTCGCGCTACGCGGCGAGATCCCGCAGGACACGCCCTTCAATATGGCCCCGTCGCATGGGCAGTGCCGGCGCCTCATGAAGCTCGCCGCCTATCGCGCCAATCCATCATGGGTGGGCGTCTTCCAGTGCAATCTGCGCGGTCTGGCGGCGTTCGGGAAACGCTTCGTACGGCTCACCTATCCGCTGTTCGGCATCGACGAAGTCTTTGAAGTCCAGGACTTTCGCTTCAACATCGCCGATGGCGGGATCCTGACCGGCGTATCGCTTCAGGTGCAGTCGATGCCGTCGCAAGCCTATGATTGGGATGCCGCCGCCGAGGAGGGCACCGCGCCGATCTCGGAAGAGACGACGGTCGACAAAACCATACCCCTGCCGACCGGCTTCTCCTTCGGAGTGTCGCGCATTACCGTCGGTAGCCAACAGGTCCCTTACGGAGTGTTGGCCTTCGATGCCTCGCCTTCTGATGCTCTGAAGATCCAGGGGCAATACCGTAAGGTTGGAGCCACCGATTGGCTGGTTGTACCGATCGCCGATGATGCAACGTCTGCAAATACGCAGGCGCTGTCAGATGGTGTGCAATACGAAGCGCAAGTGCGCTTCGTTACGCTGACTGGGCGCGAAGGTGCCTGGACGTCGCCAAGCTTGAAGGTGACGCCGGTTGCCGACCCTACCGCGCCAGGCGTGGTCACCTCGGTTAGCAAGACCGGCGGCACTGGCCAGGTGACGCTTAACTGGACGGCACCGAACAGCGCCAATTACACCGCTGCCAATATCCGCCGCAACACGGTGAATACCGAGGGCTCCGCGGCGCTTGTCCGCACCGAGTACGGTCCGCCGTCGACGGCGGATAGCTACGTGGACGGCGGCTTGGCGGCGGGCACGTACTACTACTGGATCAAGGCCGCCAACGCATCCGGCGTCGAATCGGCCAGCGTGGCCACCGGCTCGGTGACCGTTACCTGA
Protein sequences of DBSCAN-SWA_4 >CP034445|3547902:3566414|3564359_3566414_+|AZO07465.1|DBSCAN-SWA MGLESLVLLALTSVGVSGAALGIAVGAIVGAVQIGVAIGLSYLASSLFRPDPPKPQDVQTSVKNPVAPRVRHYGRVKASGPWVFVENKRGLLFKVIALGTGRLDAIEEYWIDDNLATLSGVTVTSAPYNGKAFIRSRLGLPTETAYSDLTGNFPEWDSNHRGDGISSLYAVQASVASDKITEVFPNLSNTLYRVVARGSIVHNLGTGTDIWSENAADIIRDYMLHADGMRLPASIVDTPIAAAGWLAAYNRAAEAVPLKAGGTEKRYRLWGSYQLSERPADVLSRMTAACDGRLVPTSDGGLTLDIGTWEEPTVILDADAITGFSEVARGRDVLTTANIIRATFTSPFHDYQSTDADQWIDEDDVALRGEIPQDTPFNMAPSHGQCRRLMKLAAYRANPSWVGVFQCNLRGLAAFGKRFVRLTYPLFGIDEVFEVQDFRFNIADGGILTGVSLQVQSMPSQAYDWDAAAEEGTAPISEETTVDKTIPLPTGFSFGVSRITVGSQQVPYGVLAFDASPSDALKIQGQYRKVGATDWLVVPIADDATSANTQALSDGVQYEAQVRFVTLTGREGAWTSPSLKVTPVADPTAPGVVTSVSKTGGTGQVTLNWTAPNSANYTAANIRRNTVNTEGSAALVRTEYGPPSTADSYVDGGLAAGTYYYWIKAANASGVESASVATGSVTVT >CP034445|3547902:3566414|3556250_3556613_+|AZO04579.1|DBSCAN-SWA MRRYKVAITTAADGSATAYTPRLSGKIHSIQYVKTDFANGVGFTVTSEATGESIWAEANVNASAVRYPRAPTHSQAGAAALFAAGGTAVQDKIGVANDRVKIVVAAGGNAKSGTVHVLVD >CP034445|3547902:3566414|3558634_3559039_+|AZO04584.1|DBSCAN-SWA MTSPSLELQGAIVARLKGVVGVTSLIGQRVYDTVPDKPTFPYVTIGPFDELSDDADCIKGFNIAFDINVWSRAVGLPEAEKISDAVRIGILEPELILADNALVYLQHRQTVFSRDPDGLTNRARMSFEAFAEQP >CP034445|3547902:3566414|3555969_3556191_+|AZO04578.1|DBSCAN-SWA MTEKLQWKVLRGHEGDRVYKEGDTREGTRAELGHLEGKTLELIGPVKAEKAEPAPKNKAEGAAPANKAAKRRK >CP034445|3547902:3566414|3563892_3564300_+|AZO04590.1|DBSCAN-SWA MTKAERLRAVRDYITAEMQRPYEKGVTDCGGTIDRWVRFLSGVSPVNAFGRQLRNAQDAAEWMVVPQMFVVMVNRAARAGGFKKTTMPIVGDVGLVFPDKGLICPAIHAGDCWFSRHETGALAVPIDRFWKAWSV >CP034445|3547902:3566414|3557218_3557629_+|AZO04581.1|DBSCAN-SWA MADLVITAASVVAGANAETETGAAGEAITAGQAVYRSSTTKKLMKADSNGASAEIRTPIGIALNGGALDQPIKFQKSGDITIGAALTPGLAYYLSDTPGGICPVADIGAGEYVCLIGLASSASVLALDIRYTGVSN >CP034445|3547902:3566414|3556615_3557185_+|AZO04580.1|DBSCAN-SWA MLAPVRIEDPEDTPVSLDDVKLYCRVDFDDDDVLLEAFLDAATAHLEKVLDIALVTQTWRQDFDSFKCLRLVKGRAQSEGLIVEYWDADNAEQTLPGPTYRMLVDSVGAYVAIAPGAAWPQVYSRPDAVSVFYVAGSAPEDVPAALKAAIRMHVAHLYQNREAVTVDAVSNFLPLGYEALIWPFKKPGL >CP034445|3547902:3566414|3549828_3550176_+|AZO07463.1|DBSCAN-SWA MPSRQEQQRQHDSRRDQTEQRAWYKTARWQKLKQRVHVRDLFRCQETGVLCSGKYPADDSPVADHIVEPDGDPALFWDENNIRTVSKAYHDGERQREQIAERGRGVGRKSRAPTR >CP034445|3547902:3566414|3559122_3559563_+|AZO04585.1|DBSCAN-SWA MASPTTIKGGKVKVMLGDGATPVENFAVPCGFTSRSVTLTKALNEFQLPDCDDPDKVDWLGRDATSLSMAISGEGVLASESVETWLNAWESVDSVHVKVIWEFPTKTITWTGSMHVETFAATAPNAQRVTANVSLQSDGAMTRVVS >CP034445|3547902:3566414|3550243_3550828_+|AZO04574.1|terminase|DBSCAN-SWA MGRRGPKPEPASVKLAKGKSGRRPIGVEPSAEEQAKTVATSGTVEPPAWLKGEALDVWNRLAPRVIGMKLLFPIDAETFGRYCRNFARWLKMQQRLDEMGEIYEIETASGKVRRADPSFIIGYHLERQLVTAEANFGLNPAERQRLFAARAAGSTGDLFSGMGSAPAATGADKTPASPAPAKPAAKGSAVGFLQ >CP034445|3547902:3566414|3563287_3563884_+|AZO04589.1|DBSCAN-SWA MSADFFTAEQLALLTAGRVYVSILVKFDFASGPEYAWNGNTKLTVNGNTYLPMYGAAQIEGLGLSGSGASDSVTVSVDGLPDQALGFLAKALEDTPLIDQQLMTVYLQLFDSEWQTVGNPIPIFWGFMQPPKVGRTEMQDDQGAIQSIAIVAENAFFNRSRPPYGRYTDRDQQARSPGDKFFGFVSSILMKTVTYPDY >CP034445|3547902:3566414|3558137_3558638_+|AZO04583.1|DBSCAN-SWA MRTQAKWMGREALYRRLNQLLPNVEKEVAVEQLEGAKELANRIRPRAPRDTGHYADTIQADRLANRPGERRVGGKVKTMSTGNALNATKDPNATGIFAEFIWRFLEFGTVKMAPRPHIFPTYRAYRKRLRRRIAGAVNRAVRKAKADEQRLMDQNEELIEAIERSR >CP034445|3547902:3566414|3549133_3549682_+|AZO07462.1|DBSCAN-SWA MLAAAGMDGPEAHWYVLRVEDRADIAVDKSLEDANVERVMLHTEAEPKRRGGRRHQCLEPVRVPSFPGYIFVKVVSCPVTWAGLRTIAGVLGPIGGCENPSPVSEQEIVKFQARIENDPDAIAVLTNALKAGDKVSIDSGPFAGFETVVLLLKDKHRLTVETSVFGRTVPLDVDLANVTKLD >CP034445|3547902:3566414|3547902_3549027_+|AZO04573.1|DBSCAN-SWA MTGPRLSIIPARAATDRALKPRDLQVLCVLGRHTDDLGWCRKSQVKMADEMGCARATVFEAIERLVKAGYLERYVQEEQNGRDSPHVYRVILDPKHADPGSVREADSDADDPCRQAGTPAGISAPPAGPEPAPPAGPEPAPKNDPLRTTHQNESEKERTREEEKAIERWLKKAHPNWPSYISDSGPKALAEARKLTEAEREVAADRMGDYVASAKVGGRTVICTFAVYLAEKRWEKLPPKAAPAVEADDYAPPFGPVWMAFVLGHMLDGPTNPDATAFAERWPYLFKLFGLARAGRGYRFGERCHALKGQMVAVPVDMPLWNDWKACFQQRGWPWLPDPGRVAYFPVGGPDGLEAFEKAWRGKQDDGDRHQAAE >CP034445|3547902:3566414|3557793_3558141_+|AZO04582.1|head,tail|DBSCAN-SWA MARPDAGSLRERVAFDARSEIDDGYGNTVAGDFEERFQCRADFRSRGGSEAVLAARLEGRNTFGVYVRSSSQSRRLTTDWQMRDVRTGTVYAVIAVDIITDPAWVYLTVQSGVAA >CP034445|3547902:3566414|3550832_3552572_+|AZO04575.1|terminase|DBSCAN-SWA MPVKAGRAAHRLGPRNRPLGVGADAKWCKDRQAWCAGEYWFDEVAADKAVAFFPTHLCFTKGEWAGRPFELEPWQANDIVRPVFGWKRPDGTRRYRRVYVWVPRKNGKTELAAGIALLVLLGDGELGGEVYSIASHEGQARLVFNQAATMAGKSETLSNDLVCLKSSIYCAALNAGFKPLSGKAEGKHGFSASGLIGDEIHEWVSGDLYQFVHDSEDARRQPLEFLISTAGKKGTYGEEVWDECQKILDGTFEDPETLVIVYAADPEDDWQSEETWHKANPNLGVSKKLDTMRTNARRARQLPRLENHFKNYHLNLWTEQAVRWLPIDAVDDEGKKFGWDHCAGPVGWKELEAKLAHKRCFGGLDLSSVVDLSALIWWFPIQQGLPAPALLARFFKPAALIKEHAKRDKLPYEKWVNEGAIIATPGNVVDYAFIQEQIYRDAEKFRIAHVGNHELEAGQGGLAIDRWNATETAVKLQQEGLPVVLFGQGFASMSPPAKELERLVLCNGFHHGGHPVLRRHAQVVAVETDAADNIKPAKNKSTERVDGIIGTTMAIGIAIRDKGEDAGMDDYFKSLTGAA >CP034445|3547902:3566414|3559559_3560006_+|AZO04586.1|DBSCAN-SWA MSRDASTTQTFANDDYVFRMGWGELEQLQEACDAGPYVVLDRLVSGRWRMGDISNVIRLGLIGGGAEPVKALKLVRAYVQDRPPLENLVLAQLVLGAAVAGAPEEDVGKKSEAPDQSDPMNSQTESSGSEPSMATEPSSASPRKKSAE >CP034445|3547902:3566414|3553791_3554457_+|AZO04577.1|head,protease|DBSCAN-SWA MKTKDFALQVKDLSDDGTFEGYGSIFGNVDSYGEKVMPGAFVESLARHKREGSNVLMLWNHDIYQPIGVWEDLAEDAKGLWGKGRFLLDIQRAREVHTLAKNKAIGGLSIGYKEEEADQEQAVRLLKKLSLYEISPVTFPANRRARIEAVKSERIDEFARRLRDGDPMPVKDFEDILREAGVPKAMAVQIASVGYAKAIRSESEGSKAKEPAAFLQALLRG >CP034445|3547902:3566414|3552568_3553795_+|AZO04576.1|portal|DBSCAN-SWA MKLFRKMADVVVRALSVRQPDGWYPAGQIGDAGEPVTGSNALALSAVWGCVNLLAGTISSLPLMVYRTLADGTREVAKDHPLYRLLHDSPNFDQTAVDFWDFMASSIELWGNAYARQLRGSTALAALVPVRPDLVSVRRLTNGTLEYRWSYEGRAYVETDQTMLHIRGPGGDPLGGMSTLHFGRHAFSLARATDKAAGKTFANGLRPSGGLKFANWLNKDQRDIARSEIADKIGADNAGKPIILEGGTEWVQFQLKPEDAQMLESRAFSVEEICRFFGVPPVMIGHTSKTTSWPTGVEQQGLILQKFTLRRRLKRIEQALEKQLLTPKDRADGISIEFNLEGLLRADSAGRSAFYKDMTSIGAMTINEVRMRENMPKVEGGDVPRMQMQNIPITADPADQQNLIGGPK >CP034445|3547902:3566414|3554508_3555909_+|AZO07464.1|capsid|DBSCAN-SWA MLAVATIAALGILTAGSAHAAFGVVDPSALAGLAPFAAVGAGGIARLAASTIGPRIFFDKPNDGTGGDVVDVEKLAKEVKEKFQKALDDVKAIAEEALGKAKAGEDLTKSTKEKADQALTKMNGLNEQLTTLEQKIAREGPKDDQKAKSFGEQFVEAESVKSWLGSSPSKGKTDLRIKATLTSATTDTAGAVGDAIAPTRLPGILPLPQRRLTVRDLISTGRMDGNTLEYVKETGFTNNAGMVAEGASKPSSDIKLDLMTTSAKVIAHWMKASRQVLEDISQLRSMIDQRLIYGLAYKEESQILNGDGTGQNLNGIIPQATAYAAPISLADITMIDVLRLAMLQAALAEYPATGHVLNPIDWTYIETQKDDIGRYIIGNPQGSITPTLWGLPVVQTQSMTVRKFLTGAFKLGAQIFDRWDARVEAGFENDDFTKNLITILGEERVALAVYRPESFIYGDFDTALAA >CP034445|3547902:3566414|3562619_3563288_+|AZO04588.1|DBSCAN-SWA MTIRWPCEVLVPRDVAFDLAPRSLAGPASVNGATQVVSSDAGIWKATYSSIVVNNRNAVLAHRAISTLLEGRLGSILVPLCRGYQPVPDGAVAAGLYDQVPHSDDALFDDGTGYVGEVIDVVAAAPAALRATTMTVTVGYAGAIQPGQHFSLGERLYRIRTFDQKTGTMTFRPPLREPVLSGDRLEFDDPVCRMRLASDDSMDLQLSLRRFGTPTVQFVEDV >CP034445|3547902:3566414|3560129_3562610_+|AZO04587.1|DBSCAN-SWA MAGDTEDLILSISADTKQIQRALQRLTGDTRATTTAIQQQFDDLGNRTAGAFDNTATRARQSFRVIQGGAKDIQTAMKASSFQTANLGAQLQDIAVQLKGGASPLTIALQQGTQINQVLGQAGAAGAVKALGGAFTSLINPVSLATIATIALVGYAVEYFSEVVSGGDKSAETLKQEAELLDRVAQKWGDALPAVKAYADERKRALEEGEIRQATQLTIDEQFKEARETVKTLTVDIADLVSQLRLAGAPNEEIVAVQRAFTALQKAVEEGKDSTKENKALLEALSTAYFNRGVPAADAFARKVHEIADGFAAVAKQADEARKAQDEALNAERFRGFNGPPGAFTNPQGIHLPASAPTPDQRPSFEDVGQSINSLNSAIDAFVRRVDRAEGRGDNPNSSASGVGQFIESTWLNLFKKYYPQQADSMSRDAILALRDNADVSYDLIRKYAAENAKVLQDAGVHVDEAALQLAHFLGAGDAAKVLNAAPGTPLAGLISQASIKANPTILGGGRTVDDAIAYAQRRAGASVGSSAKTPSDIFQGSMDDIQRRIDLLNAEAQAQAGLNPLVNDYGFALDKAKIKQQLLNDAAKAGVEVTPELAAKIDELAGNYAKASSSADAFKASQQKILDQQRELNDFGRGVLGGIIDDLRAGKDAGEIFANVLNKIADKLEDMALNALFPSGGGGLFSGLFGGGGGLLGGLLIPGILHSGGVAGVDGYGHGRAVAASTFAGARRMHGGGVAGGLQPGEVPAILQKGEVVLPRGARAGGSSDTVRVVLQDDSGRMSDIADQRIKTHSGTIVDVAVQRSTKAVRNGMPGYLAEAQSRSL |
22 | Brucella_phage(30.77%) | protease,terminase,portal,head,tail,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|