Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP039734 | Sulfurospirillum sp. ACSDCE chromosome, complete genome | 3 crisprs | cas3,DEDDh,cas2,cas1,WYL,csx20,csx1,csm3gr7,csx19,csx10gr5,cas10,cas6,cas4,csa3 | 0 | 9 | 3 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP039734_1 | 599208-600003 | TypeIII |
NA
Consensus repeat of NZ_CP039734_1
|
11 spacers
spacers of NZ_CP039734_1
>1.1|599243|34|NZ_CP039734|CRISPRCasFinder,CRT CAAATCAAAAAGTAGAAAAAGCTATGGTTTATCC >1.2|599312|34|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR AATCTGCTGATGTTCTTTTTGAAGAACAAGAAAT >1.3|599381|34|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR CAGAATTTATGGGTTACAGTAACCACAATACTAT >1.4|599450|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR TGGTGATGCTTGGGATTGCACGCTTGATGAGAT >1.5|599518|34|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR TCATTCAGAAAACTTAAAAGATTCTGATGGTTGG >1.6|599587|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR TAAAATTTATTAAAAGGAAATAAAATGATTAAA >1.7|599655|34|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR TCATGAACCTTTTCCTAAGCTAAAAATTCTTCTA >1.8|599724|35|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR AAAGTAAATGTAAAAGAAGATGGTAGAAAATCTCA >1.9|599794|34|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR CGAAAATGCGTCTAGTTATTTTAGCTGTTTTGAC >1.10|599863|37|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR ATCATTAAATTATTAAAAGGAGTTCCAATGTTATCTT >1.11|599935|34|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR AGAAACATTACAATGGTTTATTACAAATCAATCA |
cas2,cas1,WYL,csx20 |
CRISPR arrays and Neighbor proteins around NZ_CP039734_1
The CRISPR arrays of NZ_CP039734_1 >merge|NZ_CP039734|1|599208-600003|CRISPRCasFinder,CRT,PILER-CR AAGCAAATCCCCTTATAATCGGGTCAGTATGTAATCAAATCAAAAAGTAGAAAAAGCTATGGTTTATCCGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATAATCTGCTGATGTTCTTTTTGAAGAACAAGAAATGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATCAGAATTTATGGGTTACAGTAACCACAATACTATGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATTGGTGATGCTTGGGATTGCACGCTTGATGAGATGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATTCATTCAGAAAACTTAAAAGATTCTGATGGTTGGGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATTAAAATTTATTAAAAGGAAATAAAATGATTAAAGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATTCATGAACCTTTTCCTAAGCTAAAAATTCTTCTAGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATAAAGTAAATGTAAAAGAAGATGGTAGAAAATCTCAGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATCGAAAATGCGTCTAGTTATTTTAGCTGTTTTGACGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATATCATTAAATTATTAAAAGGAGTTCCAATGTTATCTTGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATAGAAACATTACAATGGTTTATTACAAATCAATCAGTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT >NZ_CP039734|1|1|599208-600003|CRISPRCasFinder AAGCAAATCCCCTTATAATCGGGTCAGTATGTAAT CAAATCAAAAAGTAGAAAAAGCTATGGTTTATCC GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT AATCTGCTGATGTTCTTTTTGAAGAACAAGAAAT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CAGAATTTATGGGTTACAGTAACCACAATACTAT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TGGTGATGCTTGGGATTGCACGCTTGATGAGAT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TCATTCAGAAAACTTAAAAGATTCTGATGGTTGG GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TAAAATTTATTAAAAGGAAATAAAATGATTAAA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TCATGAACCTTTTCCTAAGCTAAAAATTCTTCTA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT AAAGTAAATGTAAAAGAAGATGGTAGAAAATCTCA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CGAAAATGCGTCTAGTTATTTTAGCTGTTTTGAC GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT ATCATTAAATTATTAAAAGGAGTTCCAATGTTATCTT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT AGAAACATTACAATGGTTTATTACAAATCAATCA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT >NZ_CP039734|1|1|599208-600003|CRT AAGCAAATCCCCTTATAATCGGGTCAGTATGTAAT CAAATCAAAAAGTAGAAAAAGCTATGGTTTATCC GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT AATCTGCTGATGTTCTTTTTGAAGAACAAGAAAT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CAGAATTTATGGGTTACAGTAACCACAATACTAT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TGGTGATGCTTGGGATTGCACGCTTGATGAGAT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TCATTCAGAAAACTTAAAAGATTCTGATGGTTGG GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TAAAATTTATTAAAAGGAAATAAAATGATTAAA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TCATGAACCTTTTCCTAAGCTAAAAATTCTTCTA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT AAAGTAAATGTAAAAGAAGATGGTAGAAAATCTCA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CGAAAATGCGTCTAGTTATTTTAGCTGTTTTGAC GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT ATCATTAAATTATTAAAAGGAGTTCCAATGTTATCTT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT AGAAACATTACAATGGTTTATTACAAATCAATCA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT >NZ_CP039734|1|1|599277-600003|PILER-CR GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT AATCTGCTGATGTTCTTTTTGAAGAACAAGAAAT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CAGAATTTATGGGTTACAGTAACCACAATACTAT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TGGTGATGCTTGGGATTGCACGCTTGATGAGAT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TCATTCAGAAAACTTAAAAGATTCTGATGGTTGG GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TAAAATTTATTAAAAGGAAATAAAATGATTAAA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TCATGAACCTTTTCCTAAGCTAAAAATTCTTCTA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT AAAGTAAATGTAAAAGAAGATGGTAGAAAATCTCA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CGAAAATGCGTCTAGTTATTTTAGCTGTTTTGAC GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT ATCATTAAATTATTAAAAGGAGTTCCAATGTTATCTT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT AGAAACATTACAATGGTTTATTACAAATCAATCA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT
>NZ_CP039734.1|WP_167749315.1|598290_598785_-|porin-family-protein MTFIQRFLVGGVAIATFSSPIFAEDFYYDLHVGLSHLSINDGGDGANYTIGYGVNRIFDNKVFFGVSFDLETATISDTHIYGLNGDLKLGYNIWDKLNVYAIGGYKLQDLDGTSGYGLGYGAGMEYPITQHIITAIEYKTYHMSRSDYKDYDFDTIGLNLKYRF >NZ_CP039734.1|WP_167749314.1|597336_597561_-|helix-turn-helix-domain-containing-protein MDVELLVQKLLEKYGYMLTREEVAEVLNISISTVDRKRKQYPKLFPPFKKIGTGNRAPVKFPVHGVAQYLVQIQ >NZ_CP039734.1|WP_167749313.1|596558_597128_-|aminodeoxychorismate/anthranilate-synthase-component-II MVLMIDNYDSFTYNIVQYCLELGADLKVIRNDELSVEEIEALHPEKIIISPGPATPNEAGVSLEVIKYFGGKIPILGICLGHQAIGQAFGGKVVRAKRMMHGKTSITKQLHNSCLFDGLPETFTTTRYHSLTVEQEGLPDVVVPTAYSTDDHEIMALQIKDKQIYGVQFHPESILSEHGHAILNNFLKL >NZ_CP039734.1|WP_167749312.1|595371_596562_-|glycosyltransferase-family-39-protein MRERLFLLLILVISSALLTYGAQSISISYDEAYTFFNGNDLVHYLAVYSTQLLGQNDFALRLPFILLHLTSIILLYKIGKLFLKKRIDRIVSVAIYAFLPGVNGVALLVNSGIVVIFFSLLFTYLYLKEWKVASHIVLIICLFVDNSFAIFYIALFVYALMKRKTDLLIVTLILFSASMYLYGFDTGGKPRGYFIDTLGVYAIVFSPLLFLYFVYAMYRILIKEEKNLLWYISFFSLVVSLLLSLRQKLLLEDFAPFVVLSVPLMVKVFFNSYRVRLPAFRKLHTFFFILVLITLFINTMLSFFNKPLYAFMNEPSKHFAINYNIARELANELKARDIHKVITKDDKMALRLKFYNIERGGAYKLMNQKEIEEGFEQIDIAYYGKTVRTFYLYRIN >NZ_CP039734.1|WP_167749311.1|594687_595359_-|type-II-secretion-system-protein MKKAFTMLELVFVIVVVGILSYFVSTGFQRNPLREAADQLVSHIRYTQHLAMMDDKFSLTDASWALGRWQLYFSNNTGSDDQWAYTIFSDWKAGHTGNPDMGEVAVNPLNSSQYLTGGTSGTNIIHYSDQSATKELNIGHKYGIKDITFSGGCRSNVKYIHFDYLGRPMNSLSTNPYELPATGWHKLLTTQCKITLCDKDCTDGSAQKVSIAIEAETGYTHIL >NZ_CP039734.1|WP_167749310.1|582406_583297_-|DMT-family-transporter MKKNQSKAYFFAISAVLLWSTVASAFKLSLAYFDALNLLLYASFFSLCVFSCAMGYQRKFYILFLYSKKTYFKLALLGLINPFIYYLVLFHAYELLPAQEAQPINYTWALTLTYLSVFILKQKISVYDFLAGLICYFGVLIISTHGDLWGFSFYSLTGVFLALFSTVLWSLYWIYNTKLHVDPLVGLFINFLFGVPAILFYALITSHPLVFNIHGFLGSAYVGIFEMGITFILWLQAMKLSTNTAKIANLIFISPFLSLVFISIFVGEVILFSTYIGLIFIIIGLLLQQRVKKVDR >NZ_CP039734.1|WP_167749309.1|582017_582410_-|thioredoxin-fold-domain-containing-protein MKKWICVLIIFLLGTLSLHADFLEAERKKALNEKKLILLSVTKEFCPYCIKMEKDVFENALYRNQIEKKYLHVTINRENPELPQALHVKYFPTNLILSPKDLKIIDDFAGYIEPVSFIELLDEVYEQEFK >NZ_CP039734.1|WP_167749308.1|581146_582025_+|nucleoside-recognition-protein MSFSLQRSLQTSLKSSWTILKLIVPIYILADILFYYNLLSYITFIFKPLVALLGLPQETALSIVSGLFLNLYAAIAFAAPLGLSPKEWTVLAVFLGIAHALIVETEIMKRLGISRVYSILLRLCAGLLVGGLTSKLPQSWFSNELSQEAMVPSHPLYHSLSDLLQNSLYESLSLSLKVILLVTTLIFVLDFIKSLHMIEKHSQKVNSGFSILVGVILGITYGAGILIAEYEKGILQKREILFIGTYLMICHAIIEDTLLFVIFGANLWIMVGLRLTFATLIAYLVLKYTKIT >NZ_CP039734.1|WP_167749307.1|580083_580623_+|antirestriction-protein-ArdA MLEIFITDLCAYNNGFLIGKWITLPLSGKELYMAISEVLSEGEYACKSDSTHEEIFITDYSWKGKSIFDVEEYDSPWDLNDDVGKLFELSVAQQKAVAFLLSEQFTYDMDDAIQRSDDVIIHENQTLEDVAYCLLQECYELDKLPPIIANHIDYEGVARELDYDGNYFEVDGDVFEYCG >NZ_CP039734.1|WP_167749306.1|579513_580014_+|hypothetical-protein MGKVSEHYRQQQETAQAEILEIDYKTGEIILSADAKDTELIKTTKVKAFNLLARTDFVQINGVWEAKRDALIKILSSLPLSYSWHIKEAEMTTAYSKILGVLTITTGSLSRQAESFGICELSELKGNGGMHFMNARAETRALKRAIETLFGSVINYFVVTYMDKAA >NZ_CP039734.1|WP_167749316.1|600190_600457_-|CRISPR-associated-endonuclease-Cas2 MKNYLLCYDISDKKRLAKLAKLLEKEAFRIQNSIFLLLEPTSHEVDILVQKIEQCIDKAHDDVRLYTIKSNGFHFGSATNLDEPFLLI >NZ_CP039734.1|WP_167749317.1|600453_600669_-|hypothetical-protein MLQVNLKGSYIEVSDGVLSQIYGIAQLRSLYLHKEIDLNIATAYELSKYLDIFFINARGMILARYERIQTV >NZ_CP039734.1|WP_167749318.1|600677_601412_-|CRISPR-associated-endonuclease-Cas1 MTANANVPILYLTKDSKQFALTLPAMAKNGELKALQYANLSNNLSIAKKLLFDKFTTHKASLEHFDITISIDETLTHLALAQSIEEVMGIEGAFAKRYFAHYFSLFEKSLTKGFRSKNPPLDPINAMLSYIYTLSYYAITAKLYMRGFDPSLSYLHTPLRSHFALSSDLLEPLRASINCFVAELFLKQILHAEDFTCKNGVYLKYDTRRQLWTHLKPFMNSLNPQINRQIVMLKKNLEKNDALL >NZ_CP039734.1|WP_167749319.1|601531_602644_+|IS21-family-transposase MLKKGEIKMIKKFLAEGFSKSAIARKLGISRETVRRYANLPDDYIPHINRPPVINSVDPYLPHIAKMLETAEQQKSEIPLTVIYEEIKKLGYDGSLRWLQQVILRYELRARAKLDEPIIRFETKPAQQMQVDWVEFPKDNLSAFVATMGYSRASYVEYVNNEKIETLIGCHMNAFAYFGGVPKECLYDNMKTVILSRNDYGKGDHRFNPLFADFAKHCGFSIKVCKPYRAKTKGKVERFNHYLRYNFHNGLRVRLSMKHYTLTLDNANAEVLKWLDNTANKRIHQTTLQMPFELLAQEQLQLLPVPKAYQGIHPKALIESVAKKYSPINSHKDLEKLYIPNRDIQCYDEFIPMVANIILPVGFYGGALWS >NZ_CP039734.1|WP_025344704.1|602634_603453_+|ATP-binding-protein MELDTSIDELCKELKLSIIGEKYHDIASMAAKENWQYTQFLEEVLRVEVDNRLGRSKNMLTKLAGFPVIKTLEQFDYTFSVGVNRKQIEELSSLIFVKKYENIILLGESGVGKTHLAIALALKAVQHRYKVRFTTISELLSNANRAKKEKKYDSFLKSIASPSVLVIDEIGYFNMSKEEANHFFQIISKRYEKSSTIFTSNLVFSKWVQVFAGDKIVTTAILDRVLHHSHIINIQGDSYRLKEKKQTGVLHSEIYKFEAKSSNIEGQNQEVV >NZ_CP039734.1|WP_167749320.1|603477_603669_-|hypothetical-protein MNTIIIDKDKTEVTYKASKLYTAGQSIPIKLVDMLVITDSVCIDTKSIIQIANVKRSAELVSL >NZ_CP039734.1|WP_167749321.1|603665_603935_-|hypothetical-protein MMHNYLLCYDIFNGKRLYKIRKISYPFALGGQKSALEMPLSRKEAKVLLTTLSPHSAPEDKINLIEIEEHPLYIGKSIDVIFEEGMIIL >NZ_CP039734.1|WP_167749322.1|603924_604176_-|hypothetical-protein MRQWINFSLVHLTYKIKLYYALHKVFYTLLVLMIYQLLVQESILTLKNGIAICVTLILSMLCEHYYKRYLAQYYEARAQNNDA >NZ_CP039734.1|WP_167749323.1|604508_605513_-|WYL-domain-containing-protein MKSTTTEKKIIHIFTLMQKLYEGEELYPQNERILDELGVNERTLRRYLEDIHRLYGDILVSEKKQKYLHGKKVSVYRVPNKEQDISKTLRFFLEESNELSWILTLINENNPRFLKQLSLSEKEAIEQAIAQDKEVFLFKSNPFENLQDEHERLFSQLKIAVKHHEYRTIIYRYDNEETVESVKCLKLIFTNNNWYLAIETANEELRLLRIAFIKEVRYSAKTTYQKHVLAKYRYYFESMQNAMSLQGVALKTAVLKASPSIRRYFLKEMKPFFASQKFIEAPSDGSVIFSVDYTQPIEILPFVKQWLPELEILEPKELRILYKTELQKALAQQG >NZ_CP039734.1|WP_167749324.1|605512_605896_-|hypothetical-protein MIKKLFVLMNHEMLPSQISQAHEVLGIEKIISLNDTNWSSFDPDVSSIITALASYKSRLLKDASRGDYLLVQGDFGATYHMVCFAKKLGLTPLYATTKRVATQKMVDGSVVTQREFLHVRFREYEDA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP039734_2 | 616302-616815 | TypeIII |
NA
Consensus repeat of NZ_CP039734_2
|
7 spacers
spacers of NZ_CP039734_2
>2.1|616337|32|NZ_CP039734|CRT GTAACAGAAGTTGAGCATACAACTGCATATTA >2.2|616404|35|NZ_CP039734|CRT,PILER-CR TGGTTTTTTCTATGCCAATCTCTCTTCTTCTGGTA >2.3|616474|34|NZ_CP039734|CRT,PILER-CR TTATCAAGGAGAATAAATGATTGTTTCATTTATG >2.4|616543|34|NZ_CP039734|CRT,PILER-CR TGGATCAGCAGCAGAATAAAACAATCTTTTTAAG >2.5|616612|34|NZ_CP039734|CRT,PILER-CR CTGAAGCGGTAATGGAGAATTTGGAAGGTCTCTC >2.6|616681|33|NZ_CP039734|CRT,PILER-CR CAAAAGACGCAAGATCTGCAGAAATTTTAAAAA >2.7|616749|31|NZ_CP039734|CRT,PILER-CR CAAAATGCTATCGTACAACGATCCAATTGAA >2.8|616405|34|NZ_CP039734|CRISPRCasFinder GGTTTTTTCTATGCCAATCTCTCTTCTTCTGGTA >2.9|616475|33|NZ_CP039734|CRISPRCasFinder TATCAAGGAGAATAAATGATTGTTTCATTTATG >2.10|616544|33|NZ_CP039734|CRISPRCasFinder GGATCAGCAGCAGAATAAAACAATCTTTTTAAG >2.11|616613|33|NZ_CP039734|CRISPRCasFinder TGAAGCGGTAATGGAGAATTTGGAAGGTCTCTC >2.12|616682|32|NZ_CP039734|CRISPRCasFinder AAAAGACGCAAGATCTGCAGAAATTTTAAAAA >2.13|616750|30|NZ_CP039734|CRISPRCasFinder AAAATGCTATCGTACAACGATCCAATTGAA |
cas6,cas10,csm3gr7,csx10gr5,csx19,csx1,csx20 |
CRISPR arrays and Neighbor proteins around NZ_CP039734_2
The CRISPR arrays of NZ_CP039734_2 >merge|NZ_CP039734|2|616302-616815|CRT,PILER-CR,CRISPRCasFinder AGCAATGATGATGTTAAATCGGGTCAATATGTAATGTAACAGAAGTTGAGCATACAACTGCATATTAGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATTGGTTTTTTCTATGCCAATCTCTCTTCTTCTGGTAGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATTTATCAAGGAGAATAAATGATTGTTTCATTTATGGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATTGGATCAGCAGCAGAATAAAACAATCTTTTTAAGGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATCTGAAGCGGTAATGGAGAATTTGGAAGGTCTCTCGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATCAAAAGACGCAAGATCTGCAGAAATTTTAAAAAGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATCAAAATGCTATCGTACAACGATCCAATTGAAGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATA >NZ_CP039734|2|2|616302-616814|CRT AGCAATGATGATGTTAAATCGGGTCAATATGTAAT GTAACAGAAGTTGAGCATACAACTGCATATTA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TGGTTTTTTCTATGCCAATCTCTCTTCTTCTGGTA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TTATCAAGGAGAATAAATGATTGTTTCATTTATG GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TGGATCAGCAGCAGAATAAAACAATCTTTTTAAG GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CTGAAGCGGTAATGGAGAATTTGGAAGGTCTCTC GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CAAAAGACGCAAGATCTGCAGAAATTTTAAAAA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CAAAATGCTATCGTACAACGATCCAATTGAA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT >NZ_CP039734|2|2|616369-616814|PILER-CR GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TGGTTTTTTCTATGCCAATCTCTCTTCTTCTGGTA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TTATCAAGGAGAATAAATGATTGTTTCATTTATG GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TGGATCAGCAGCAGAATAAAACAATCTTTTTAAG GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CTGAAGCGGTAATGGAGAATTTGGAAGGTCTCTC GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CAAAAGACGCAAGATCTGCAGAAATTTTAAAAA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CAAAATGCTATCGTACAACGATCCAATTGAA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT >NZ_CP039734|2|2|616369-616815|CRISPRCasFinder GTTTCAATCCCCTTATAATCGGGTCAGTATGTAATT GGTTTTTTCTATGCCAATCTCTCTTCTTCTGGTA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAATT TATCAAGGAGAATAAATGATTGTTTCATTTATG GTTTCAATCCCCTTATAATCGGGTCAGTATGTAATT GGATCAGCAGCAGAATAAAACAATCTTTTTAAG GTTTCAATCCCCTTATAATCGGGTCAGTATGTAATC TGAAGCGGTAATGGAGAATTTGGAAGGTCTCTC GTTTCAATCCCCTTATAATCGGGTCAGTATGTAATC AAAAGACGCAAGATCTGCAGAAATTTTAAAAA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAATC AAAATGCTATCGTACAACGATCCAATTGAA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAATA
>NZ_CP039734.1|WP_167749333.1|615332_616160_-|CRISPR-system-precrRNA-processing-endoribonuclease-RAMP-protein-Cas6 MLRFIKIDITFSSSFTTSLDFLGSTLRGAFGVALKAVSCLNKTQECQGCFATQNCLYYDFFEIKNRAHAYRFSKPLHENNYNFSLYLFEAACEKLPYVLSALHTMFGRIGVGWNRHLIPIERIVCNGVVVSSGKNFDLSQIVVDVFETKEYFPDITLHFLTPLRIKSNNALLHTTPTLEQILSSIFNRHNELKGIPLAKLPFTPRYKELRTHLRFQELSRYSNRQESTMHLGGMMGTIDYEDVDEHSYALLKLGELLGVGKQTVFGLGEIKVGHQ >NZ_CP039734.1|WP_167749332.1|613884_615276_-|hypothetical-protein MAKYLYGASIQGIQEFIFKTNQLQEIIGASEIVRSLEGEFTAFANVADENILLNAAGNIKALFDDEENLRNVVKNFPKKIMQKAYGITLSQAVVDMTKAEYDTYEKAINALEKKLKTQRNKPSIPLDISINLMELAPKTARPVAKIEKKGNDEIRMDVSTAQKYQANPTNKGQLSDLKNSKGKIAIIHADGNGLGALIPTLSNIPVFSKGLDKATHEAYEEAKKVIQSGKIRKIILGGDDMSVICDANDALAFTRTFLEKFETLTYMHTGHKLTACAGIAFCNEKYPFHYAVSLAEALCGATKKHAKNIDKKLAPSSLMFHNIQSSNFQSWEKFIEDELTIRNDHDVIRCDFGPYYVAHVGQPSIEDLIDVVKVYQQEGSPISRLREWLGELSKSHHYAKDLLDRINENSAQNDKKALDTLNLKLKKVHPALGTQTMIIPKDGFDKTLIYDVLQILSVTEAKA >NZ_CP039734.1|WP_167749331.1|613360_613888_-|CRISPR-associated-protein MTLRYQLKFYDYWHIGSGLSAGARLDSTVIKDQNDLPYVAGKIIKGLSREMAETLDDADFVNTCFGNNGIEMGKCYFSNAQLLETTAKHICSLNLQSNLYDIIASTKIDETNGVAEDNSLREIEVVVPLELCGEIRDLPENYKEAMTRSLKMIKRMGLNRNRGLGRCEFFVEENQ >NZ_CP039734.1|WP_167749330.1|611978_613364_-|hypothetical-protein MKELVFHVEFLSDIVLPATSNTEGKIDHLDFIAGSNFLGMVAQKYDDFKDSFTVFHSGKVRFGDAHILKNEKLPTYKVPYSYVHEKLENERIYNHHTLTPDDFKTLGQLKQLRSGYITKEKELVFIDYSYSQKSAYDKENRKSLDSTMYGYKAIKKGSMWQLVVKVDGSIVPEDEKLMVETLQTSSRLGKSKSAEYGQVKITHLADQARENISEKDLPLNEVILYCNSRVALLDESGNPTYELKHLCEGLKEEQIAYEKTQIRTSTFTPYNTKRQTKDYERVCIKKGSVIVLKDISDAQLKQIQQGVGAYLSEGFGEIIINPSFLMEKGFTFASNEPEEVKKDERQKITQTFTDSTIQFLVNRHNTTIDTLRIAKNVADFIKNHKAEFKNINSSQWGNIRSIASSNQNDFIEKIKGYIGSGSKKWETHQVETLVKAMDHDAINKQTFIKLLAMKMGGKNDE >NZ_CP039734.1|WP_167749329.1|610732_611989_-|CRISPR-associated-protein MMNKRYRAHIVIEAQTPLKMGSNAMDFLQDSPVQKDWNGLPMILGTSIAGVLRKEFHGDKAADVFGLNNGSKIIISNALLIADTQGMVYEDLCVQKSDFLKLFENLPLREHTKITSKGTTQKGAKFDEEVVFKGTRFKFSIEFIDNNKELFMELIGLLRSSNFRLGGGSTKGFGKFKMIEIEYGLFDIEKYSSSLNKPLGGDKECNLETITCKDYTPYILQLKPDDFFMFGSGFSDNDADMTPVYEQVIDYEKRALSEKKVLIPASSIKGALAHRTTYHYNKLHGNTIEAGNGVDSVSTLFGAAKNSKQNIDGAKGKILITDCFKNDHGKTKTFDHVSIDRFTGGAMEGALFQEKTVANDEKWYEIELLVHSDIQGKELEAFELALKDVTTGMLPLGGATTKGHGIFLGTITKNGVAL >NZ_CP039734.1|WP_167749328.1|610349_610736_-|TIGR04423-family-type-III-CRISPR-associated-protein MKKNQIEIIEYINTLKGYEGYIQMSDAPIKDIWTTPSTITFSPQNGFIYEAHFFNGKDSIAIRQMNDVWFVDETKDVSLIDTQIYDAKQSLKIKMAQVWEEENDLLCENIPVLKLKKVVFAGFAGDRK >NZ_CP039734.1|WP_167749327.1|608289_610353_-|TIGR03986-family-CRISPR-associated-RAMP-protein MITAPYNFVPLNKEVFYPSWSEDVSHDIPFEDSESGEIDITITAKSPIFIRNHSDERDKPSEEFCQYNGEYYIPSTTVKGMVRSVLEIISFSKMNPDMVDDKTYSIRDLRNRDLYMNRMKPQEIFCGWLKKVGNDYKIEDCGVVGRISQDNIHKDFGSKFKAQQGGFINKPDFKTAKYKYDLLKKLNIELTQKFDFSKEAQGKKIYTKGTKLEGTLVLTGQASARKDNGRMGDGKIYEFVFFNAIGEITLSKETMENFKFAYFDGRKTEPKESPDWKYWKEKLANGQKVPVFFQKEMRKDSNGQNQTIIKHFGLSYLYKLPYNGSIHDGIFKSHFEKKLDLAQTLFGYVNNDTALKGRVQFSHFKAIDKTKIVKLAPRTENLGTPRASFYPFYIFQEDEKLYSTFMNDNFVLAGRKRYPIHKNNPDESKISYNPTSSIGTSFKPLKEGVIFKGKMRYHNLKKCEIGALLSALIFHSTPNTFHNVGLGKSLGYGKINVAIHYPNKEQYLQEFETLMKENINNWMSSNILKELLTMATPQDNQRNSQLSYMELADFAQTKNDKINGYLKNYSKLDNIVSVTFTSPSPQKQQAIQVATSDIISKTKMRKTITEAWQRVFGIFYHPHQIEEFLKGAFQTTPTDQQQIYLKLKDNHSFIELIRMIHQYNNGHLNDQEVGKLYLEVLNMRSYK >NZ_CP039734.1|WP_167749326.1|607510_608293_-|hypothetical-protein MTRFHKREAQKIVASFIGSNTLSIRGALIAIILMIIASWLPDGLIEVVAGVKEEDKVKVFCQLLISSGLLVWLMFFIKNLSKSYVPTVDIEIEPSPQASKVLILFLSANPKVHEVLHVNSIDELERNNFRMPLLAINHHKTRLKKVIVICSEDSQNIYEPFKNLVQKLLPTCDISMIEQTPSVIDFEDGNAVYELLETLYQDLAKAKYKHKEIMIDITGGTKVVTLASAIFAIPNDKELEYVSTSDYQIKTYDIRYSEAN >NZ_CP039734.1|WP_167749325.1|605965_607495_-|CRISPR-associated-DxTHG-motif-protein MTIISICGMIGTTKPNPNEKSFIQKTDADKAVYDVDLSLQYLIAPPKETYINMLPLLVDTFAKEHQIIALATASAEKIQKEVLSFEGLDVTKCSFEFIDDTAFEAYFSTVNQLLREHDEVIIDLSHGFRHLPLLTLVSLLVNHLKSPEKIKHILFAQEVIPSKHYKIIDLNEYLDIATIAYALVSFKDNYTIARSIVLRTDRYKPLLEILRDFSQHILSNAIQTLFERELPRQIREKIEELDSDPHVAALKELLSGIRLHLINLEAISKKADYERYYKIGKLLLEKGYLLNAITIINEALPLYIQNILHSKKLLCIPSGTDAYHVTKSMMDFIEKGKRDSTLMSEEIDAYFVCSNKVVFDAFSSLQQKMRQLRNDFAHAYGENAHETITSTLEMLFATFHTLVFEQNLFATIKPSDRTSPPCQYDYTLFELKANTMFLKLFPFVLFKNVFEEKRILKLYQKEIPPQWNIPKNMSEKQHRLIDILYHYFEHKNDPKEARHYMDAFYQDFK >NZ_CP039734.1|WP_167749324.1|605512_605896_-|hypothetical-protein MIKKLFVLMNHEMLPSQISQAHEVLGIEKIISLNDTNWSSFDPDVSSIITALASYKSRLLKDASRGDYLLVQGDFGATYHMVCFAKKLGLTPLYATTKRVATQKMVDGSVVTQREFLHVRFREYEDA >NZ_CP039734.1|WP_167749334.1|617884_618817_-|hypothetical-protein MKPIVVLPLIAAALFLGGCATTPSQPNPSSLQQATSLKEYIALKRGGETINYNVIAEREHQSIIEYRTYDEAMNVADARDIMEVLDDAKGYCESIKGKSVYGDKAIQALNARPTLLSLDYVSYKSAIREQGLGKYEGFYQCLSPNDGFSIVSMKDNVELHQSAILGNKRDLKETYSRFYLIQHDKAQSLGLKTWLKGTKYAQISSKYTSFEDLLDLEKNSSVFPWRYERITGAQKYCTYHGGELFVSNALTQFKPITMDEYLFMRLETMNPPTINVFMNQETFTCKNSANPAQSFTLIHTDKQLEYKKGE >NZ_CP039734.1|WP_167749335.1|618827_619136_-|hypothetical-protein MLSMIRDYLIMVLVKLVIAFVIFGLCLLGIFIAYWVFCDYTRATMQIHEIMINDFTKVVLGFISLLCLFQDVSTGNNNGNAKKFYKKTEKYFEMYDTAGIKW >NZ_CP039734.1|WP_167749336.1|619148_619403_-|hypothetical-protein MKKVLLALIMVVGCLFASEGQKSVTTCRFEARQIGAFADSPMTCSGDFKKSSTTQELYGDGWRLINSYTQNNNVYLVFEKNKNL >NZ_CP039734.1|WP_167749337.1|620000_621242_+|MBL-fold-metallo-hydrolase MTLTIHKGTNEIGGSCIELSTQSTTILFDYGTPLNLESTKLDFKNKKIDAIVISHPHQDHFGEITMVETTIPIYCGKLSKELMNATKLFTGQGLLANKFHHFEAWKSFQIGDITITPYLVDHSAVDAYAFLVEYDGKKVIYSGDFRANGRKSKLFENMLTQKKLKNADVLLMEGTMLQRNNEEFPTEISVENKIVETLKNTEVITFMIGSSQNIDSLVSAYRACKKAEKIFVIDMYTAWILEKMSSVSASIPTMDWKNVFVLKSYGGSYYEKIKKNRDYFGDFQYRLFSNVILLDDIQKEPSRYYVKISPWHIEKLLKKLDTSSANIIYSQWLGYLKPEFSDKKTVDLFKKLQENHNWVYAHTSGHADLESLKKFSEALSPKALVPIHTEHKDAFCKHFENTVVLEDGMPFTI >NZ_CP039734.1|WP_167749338.1|621258_621807_+|hypothetical-protein MQETNEKLILTFENVYKENSYRGYQIDDLTDFKSESIDVEMEDIKEDIIANIKAYLNIENNTSSFIYNDIETILEGDFLLDYTIETVDVVFHDHYTYDDYKHDNSLYIDMYEDHCLKIENKYEELDHDISQEQMDVFEDALHEETQEFIDNTSSYIANEGAYFISNYYYTVDVVITLTKSIF >NZ_CP039734.1|WP_167749339.1|621828_622485_+|hypothetical-protein MEFLEKLMQVIVDEGIQTPKAQVERYLSPILGLFLEEILKKTFHKEYQMIVPEFPIRKGTIAKSVGSEQSESNQSTNIDYLMYNQTENKFVFIELKTDSKSFKPSQRKIYEDLKCVAKDKNNIFGQLLYDDLEKILSKSTSKDKYKYLKTKWNDSMSAINDMEIIYIVPAKTGLKEEVGREDENKLCVLYFNDLPVELSLFSEEWKIILEYLKKLDMN >NZ_CP039734.1|WP_167749340.1|622532_622898_-|hypothetical-protein MKKNHFTNILVLVAVFFVFTGCAGKDMKKFGGEFALVNQTGEPISTAVTITLGVTIYGIGALVDSSREEEQKPQQQETIFLTENNQSMNESNTSQTSYFDSGIIHTNNVNHVDVMNGTPPQ >NZ_CP039734.1|WP_167749341.1|622894_623335_-|hypothetical-protein MFNDFIVHINNALGASNSKVIVIALVIFLYVSIGWFINSYGERHYQYPVVSENWNYILFLAACIGVAIFLGILISSGTGLLSGEEVMKLIVFGILVGFIALYVRIYRNTSLLFATFSYIYMVTFSIILITIAFILAMASSRRDDQN >NZ_CP039734.1|WP_167749342.1|624226_625144_-|SPFH/Band-7/PHB-domain-protein MDSSTGLSLSFFIILLLAGGYLLYQMVRIVPQGEEWIIERLGKFHTVLKPGLNFLIPVFDKVQMRLNTKELIQQMKAQEVITKDNAVVIISAVVFYKISDPAKAVYSIDNFELAVANMAATTLRSVIGNMELDTALSGREIIKSSVSEKISDHLEQWGLSLTAVEVQDIRPSDTLQEAMEKQAAAQREKKALIMKAEGEKQAAITKAEGFKQSLILEAEGKYEASKKEAEAKVALANGDKEAMVVIASQIKVGDAASYLLAQRYIDSVMHLGNSNNSKVVFIPTDLKHSLEGATGGLGTIFSQVK >NZ_CP039734.1|WP_167749343.1|625153_625594_-|NfeD-family-protein MPWYIWGIFGICAIVFEVASPTFFAGFIGVGFFGSAVLSYFQPNSLIWQILIALVGMFVGSFIFKRQKMGDTTSSKLGQSDEFIGIRGKVECDLKEGIQGSVMLSSPVLGSTQWKAVSENGVEIPNAATVKIVATHGTYLVVKQII |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP039734_3 | 623442-623821 | TypeIII |
NA
Consensus repeat of NZ_CP039734_3
|
5 spacers
spacers of NZ_CP039734_3
>3.1|623477|35|NZ_CP039734|CRT TAAGATCTTCCACTGTTAGAAAAAACCATCGAAAA >3.2|623547|35|NZ_CP039734|CRT,CRISPRCasFinder TTAAAAAAGACGGTAAGAGTAAAAGTTTAGCTTTT >3.3|623617|33|NZ_CP039734|CRT,CRISPRCasFinder ATATCTATCTAGAGAATTAATCAATTTCATCGA >3.4|623685|34|NZ_CP039734|CRT,CRISPRCasFinder AAAAAACTAAAGAGGTTAAATTAGAGCGAAATAG >3.5|623754|33|NZ_CP039734|CRT,CRISPRCasFinder CAAAGACAGGAAGGAAAATCGATATACACACAG |
cas6,cas10 |
CRISPR arrays and Neighbor proteins around NZ_CP039734_3
The CRISPR arrays of NZ_CP039734_3 >merge|NZ_CP039734|3|623442-623821|CRT,CRISPRCasFinder GCAAATCCATAAAAATTATTGGGTCAGTATGTAATTAAGATCTTCCACTGTTAGAAAAAACCATCGAAAAGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATTTAAAAAAGACGGTAAGAGTAAAAGTTTAGCTTTTGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATATATCTATCTAGAGAATTAATCAATTTCATCGAGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATAAAAAACTAAAGAGGTTAAATTAGAGCGAAATAGGTTTCAATCCCCTTATAATCGGGTCAGTATGTAATCAAAGACAGGAAGGAAAATCGATATACACACAGGTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT >NZ_CP039734|3|3|623442-623821|CRT GCAAATCCATAAAAATTATTGGGTCAGTATGTAAT TAAGATCTTCCACTGTTAGAAAAAACCATCGAAAA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TTAAAAAAGACGGTAAGAGTAAAAGTTTAGCTTTT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT ATATCTATCTAGAGAATTAATCAATTTCATCGA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT AAAAAACTAAAGAGGTTAAATTAGAGCGAAATAG GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CAAAGACAGGAAGGAAAATCGATATACACACAG GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT >NZ_CP039734|3|3|623512-623821|CRISPRCasFinder GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT TTAAAAAAGACGGTAAGAGTAAAAGTTTAGCTTTT GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT ATATCTATCTAGAGAATTAATCAATTTCATCGA GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT AAAAAACTAAAGAGGTTAAATTAGAGCGAAATAG GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT CAAAGACAGGAAGGAAAATCGATATACACACAG GTTTCAATCCCCTTATAATCGGGTCAGTATGTAAT
>NZ_CP039734.1|WP_167749341.1|622894_623335_-|hypothetical-protein MFNDFIVHINNALGASNSKVIVIALVIFLYVSIGWFINSYGERHYQYPVVSENWNYILFLAACIGVAIFLGILISSGTGLLSGEEVMKLIVFGILVGFIALYVRIYRNTSLLFATFSYIYMVTFSIILITIAFILAMASSRRDDQN >NZ_CP039734.1|WP_167749340.1|622532_622898_-|hypothetical-protein MKKNHFTNILVLVAVFFVFTGCAGKDMKKFGGEFALVNQTGEPISTAVTITLGVTIYGIGALVDSSREEEQKPQQQETIFLTENNQSMNESNTSQTSYFDSGIIHTNNVNHVDVMNGTPPQ >NZ_CP039734.1|WP_167749339.1|621828_622485_+|hypothetical-protein MEFLEKLMQVIVDEGIQTPKAQVERYLSPILGLFLEEILKKTFHKEYQMIVPEFPIRKGTIAKSVGSEQSESNQSTNIDYLMYNQTENKFVFIELKTDSKSFKPSQRKIYEDLKCVAKDKNNIFGQLLYDDLEKILSKSTSKDKYKYLKTKWNDSMSAINDMEIIYIVPAKTGLKEEVGREDENKLCVLYFNDLPVELSLFSEEWKIILEYLKKLDMN >NZ_CP039734.1|WP_167749338.1|621258_621807_+|hypothetical-protein MQETNEKLILTFENVYKENSYRGYQIDDLTDFKSESIDVEMEDIKEDIIANIKAYLNIENNTSSFIYNDIETILEGDFLLDYTIETVDVVFHDHYTYDDYKHDNSLYIDMYEDHCLKIENKYEELDHDISQEQMDVFEDALHEETQEFIDNTSSYIANEGAYFISNYYYTVDVVITLTKSIF >NZ_CP039734.1|WP_167749337.1|620000_621242_+|MBL-fold-metallo-hydrolase MTLTIHKGTNEIGGSCIELSTQSTTILFDYGTPLNLESTKLDFKNKKIDAIVISHPHQDHFGEITMVETTIPIYCGKLSKELMNATKLFTGQGLLANKFHHFEAWKSFQIGDITITPYLVDHSAVDAYAFLVEYDGKKVIYSGDFRANGRKSKLFENMLTQKKLKNADVLLMEGTMLQRNNEEFPTEISVENKIVETLKNTEVITFMIGSSQNIDSLVSAYRACKKAEKIFVIDMYTAWILEKMSSVSASIPTMDWKNVFVLKSYGGSYYEKIKKNRDYFGDFQYRLFSNVILLDDIQKEPSRYYVKISPWHIEKLLKKLDTSSANIIYSQWLGYLKPEFSDKKTVDLFKKLQENHNWVYAHTSGHADLESLKKFSEALSPKALVPIHTEHKDAFCKHFENTVVLEDGMPFTI >NZ_CP039734.1|WP_167749336.1|619148_619403_-|hypothetical-protein MKKVLLALIMVVGCLFASEGQKSVTTCRFEARQIGAFADSPMTCSGDFKKSSTTQELYGDGWRLINSYTQNNNVYLVFEKNKNL >NZ_CP039734.1|WP_167749335.1|618827_619136_-|hypothetical-protein MLSMIRDYLIMVLVKLVIAFVIFGLCLLGIFIAYWVFCDYTRATMQIHEIMINDFTKVVLGFISLLCLFQDVSTGNNNGNAKKFYKKTEKYFEMYDTAGIKW >NZ_CP039734.1|WP_167749334.1|617884_618817_-|hypothetical-protein MKPIVVLPLIAAALFLGGCATTPSQPNPSSLQQATSLKEYIALKRGGETINYNVIAEREHQSIIEYRTYDEAMNVADARDIMEVLDDAKGYCESIKGKSVYGDKAIQALNARPTLLSLDYVSYKSAIREQGLGKYEGFYQCLSPNDGFSIVSMKDNVELHQSAILGNKRDLKETYSRFYLIQHDKAQSLGLKTWLKGTKYAQISSKYTSFEDLLDLEKNSSVFPWRYERITGAQKYCTYHGGELFVSNALTQFKPITMDEYLFMRLETMNPPTINVFMNQETFTCKNSANPAQSFTLIHTDKQLEYKKGE >NZ_CP039734.1|WP_167749333.1|615332_616160_-|CRISPR-system-precrRNA-processing-endoribonuclease-RAMP-protein-Cas6 MLRFIKIDITFSSSFTTSLDFLGSTLRGAFGVALKAVSCLNKTQECQGCFATQNCLYYDFFEIKNRAHAYRFSKPLHENNYNFSLYLFEAACEKLPYVLSALHTMFGRIGVGWNRHLIPIERIVCNGVVVSSGKNFDLSQIVVDVFETKEYFPDITLHFLTPLRIKSNNALLHTTPTLEQILSSIFNRHNELKGIPLAKLPFTPRYKELRTHLRFQELSRYSNRQESTMHLGGMMGTIDYEDVDEHSYALLKLGELLGVGKQTVFGLGEIKVGHQ >NZ_CP039734.1|WP_167749332.1|613884_615276_-|hypothetical-protein MAKYLYGASIQGIQEFIFKTNQLQEIIGASEIVRSLEGEFTAFANVADENILLNAAGNIKALFDDEENLRNVVKNFPKKIMQKAYGITLSQAVVDMTKAEYDTYEKAINALEKKLKTQRNKPSIPLDISINLMELAPKTARPVAKIEKKGNDEIRMDVSTAQKYQANPTNKGQLSDLKNSKGKIAIIHADGNGLGALIPTLSNIPVFSKGLDKATHEAYEEAKKVIQSGKIRKIILGGDDMSVICDANDALAFTRTFLEKFETLTYMHTGHKLTACAGIAFCNEKYPFHYAVSLAEALCGATKKHAKNIDKKLAPSSLMFHNIQSSNFQSWEKFIEDELTIRNDHDVIRCDFGPYYVAHVGQPSIEDLIDVVKVYQQEGSPISRLREWLGELSKSHHYAKDLLDRINENSAQNDKKALDTLNLKLKKVHPALGTQTMIIPKDGFDKTLIYDVLQILSVTEAKA >NZ_CP039734.1|WP_167749342.1|624226_625144_-|SPFH/Band-7/PHB-domain-protein MDSSTGLSLSFFIILLLAGGYLLYQMVRIVPQGEEWIIERLGKFHTVLKPGLNFLIPVFDKVQMRLNTKELIQQMKAQEVITKDNAVVIISAVVFYKISDPAKAVYSIDNFELAVANMAATTLRSVIGNMELDTALSGREIIKSSVSEKISDHLEQWGLSLTAVEVQDIRPSDTLQEAMEKQAAAQREKKALIMKAEGEKQAAITKAEGFKQSLILEAEGKYEASKKEAEAKVALANGDKEAMVVIASQIKVGDAASYLLAQRYIDSVMHLGNSNNSKVVFIPTDLKHSLEGATGGLGTIFSQVK >NZ_CP039734.1|WP_167749343.1|625153_625594_-|NfeD-family-protein MPWYIWGIFGICAIVFEVASPTFFAGFIGVGFFGSAVLSYFQPNSLIWQILIALVGMFVGSFIFKRQKMGDTTSSKLGQSDEFIGIRGKVECDLKEGIQGSVMLSSPVLGSTQWKAVSENGVEIPNAATVKIVATHGTYLVVKQII >NZ_CP039734.1|WP_167749344.1|625620_626019_-|DUF805-domain-containing-protein MFSNYFSELKKTFVYKGRTSRKSFWLFAVMHTILLLSVLGLIFLSDALAKGDTKSIFSTIEAISSILLIPLYLFPTLSITTRRFHDLNMSGWHQLYSFLPYIGGLILFIYMCYKSVDENNRYNIVDESAIAL >NZ_CP039734.1|WP_167749345.1|626152_626815_-|hypothetical-protein MSKKKNIIDCSLLDMHKYSKDAFQYLYDKLTDKGFKRAYVAKDRPINAVTGILWDITQEDDEVTKIIEGMDKIDNIKAESSKSRLEKIANTWIKKAYREYFVNGYKISRSLFVKFTKNIFEPKSEENRKQLEAASKKLFDKTYNTYRNRQHRITNSDKFNELVARYPKLNIDKAYRYAIVSGKFNLNADDIEEFEYILKFALSKKEDSNKKNQYYMLKKV >NZ_CP039734.1|WP_167749346.1|626825_627941_-|site-specific-integrase MARVKSKSNVGVYKEVLDNGDTSFYYTYKDIDGKKRWVKVGLKSNGYSERDAVVQRRKTMIELEDLEEPLYIKKNKFQEIITLNQLATYYFNEKSDMKNHRDAYLKYMHQISPVFGLDNILTLRAEKIQRFKQKLIEKGYRAASVNYYLAQLKAIINYAVFTDRINMVNPCSKVKLLALNNQRKRILSENEIELLLKSLLHKQKAYLFVLIAIFTGARPKAILNLQCKNIDFSFNKITFIAMKNRPSYSVAIHSRLREALQQWVEYLNPEEYLFYRENPMLDKSSHMSYIGIKKQTQPTMDLLFNQGLQVNDRINKVTLYTLRHSFGSLLSKRGANAFVIKELMNHAKIDTTDRYVKVTNQEAKKYIDAIL >NZ_CP039734.1|WP_167749347.1|628487_629750_-|MFS-transporter MSAFYLLKTKRFAPLFTVQFLGAFNDNIFKNTLAILVTFHAGSWTSLPLEVLAPLIGAIFIVPFFLFSGLAGALADKYDKAALTRLVKLLEFGLMSIATLGFYMHWFSLLLVVVFGMGLHSTLFGPIKYAIIPQHMGENELVTANALVESGTFGAILLGTLGGGLIAASAYGGIIAGIMGMAIALIGYVCSRSIPTAPSLNAQMSLSYNIFSQTAQTLKLSYANKTVFLSIIAISWFWLYGALLLSQFPAFVKIVLEGDETTVTLLLSLFTVGIGVGSFLCEKLSHHTIRPSLIILGALGMAFFGIDFALTSSSFVPIERLFSSITFWHILVDLTLIALFGGLFSVPLYAIMQSQSDPSFRSRIIAANNILNALFMVVGAVLTMVLLDASWRIPEVFLSAAIGTGIIATWIAWVVYKRID >NZ_CP039734.1|WP_167749348.1|629746_631882_-|bifunctional-acyl-ACP--phospholipid-O-acyltransferase/long-chain-fatty-acid--ACP-ligase MIKIALRIILRFLFKLRVQGHFTANKNEPMLIIANHQSFLDGLILGVMLPVSPVFIINTQIAKNPFVRFFLMLADHLTVDPSNPMAIKAVIRLVESGRPVVIFPEGRITTTGSLMKIYEGSAFVAVKTNAMVVPVMIQGATFSRLSRMPKTFPHRLFPQITLHYCTPTKLTVSHEGTSHERRLLSGEAMRHLMQECSFDARPANDLFGTFLDAIETYGKNKAIVEDIKQVEYTYAQLLKMALGLGRLLSPITQKDEAVGVLMPTAVASLALVLGLSGMKRVPAMLNFTAGVDGLQSACTAAEIKTIVTSRAFVEQAKLAPKLEALQGVRIVYLEELKTSMRLVDKLWLMLYAIHFPRLVANSQDEKESAVILFTSGSEGKPKGVVLSHEALLANIAQISSIVDFSTEDKMLNALPIFHSFGLTAGSLLPIFRGMHLFMYPSPLHYRVIPEIAYDRSCTILLGTSTFLHNYARHAHPYDFYRVRYVIAGAEKLSENVRELWFEKFGLRIFEGYGATETAPVIAVNTPMAYKKGTVGQILPGIESKLVPVAGIEDGGILHVKGANVMSGYLRAENPGVLEKPTSEAGEGWYNTGDIVSIDEAGFVQIKGRVKRFAKIAGEMISLESVEKLATLTSSGFLHASSSIPDVARGEAIVLFTTDKNLNREALQKSAQSNGYPEIAVPRKIVHLETLPLLGTGKTDYVTLKSMASEIA >NZ_CP039734.1|WP_096047123.1|631883_632192_-|winged-helix-turn-helix-transcriptional-regulator MQTQYAKDLPLEWKPMSAIFMALGDAHRQRILLLFEDGERLNAGQIAEVSPLARTTVSHHLKILHQAEVLLSEKIGKEVWFWVNKPLLEATFSNVLNYLHGN >NZ_CP039734.1|WP_167749349.1|632252_633278_-|amidohydrolase-family-protein MKTIDTHVHLLSSEVSFNRIYDKVAVRFFAKRFGIDANALAQEPYKAYTDALMNSVKHSEHIEKIVLFGVDVKVDDEGNVLHKDKTVCASNEDVAALYAQFPELIIPFFSINPKRPNALELIEKYHALGFKGAKFLQNYWGVDTREARYRPYFEKLAALDLPLIIHVGSESSVHSVKTCESIEMLRQPLEVGVKVICAHMALSYEPRHIFKALSSNPKRFNEDYFTLLEMLKTHDNLYADVSALLTPVRAKVLRHLSTQSDIHPKLLFGSDYPVPFTTVWNSYDIALLKRICIAQERNVFDRYVKAMLVYFPANHPIYNNYRKVLCLEENTLHVKSQTDIF >NZ_CP039734.1|WP_167749350.1|633354_634704_-|triphosphoribosyl-dephospho-CoA-synthase-CitG MIDQPTSLDAILRAKEERAWKQKELLSRHPLASLISLTINIPSLIKLSHEAVVVHEIAHQALLEMIENEGIELLACESKQPSTGAESFFTCKADAKTLKALTCKLENSHPLGRLMDIDVLDLTGNILSRSTLGLSKRRCFICEEEAKLCARAQKHTYMELNAHIKHLVEKHAFAHSIALWCERAMQTEVELTPKPGLVDQANSGAHHDMDIHTFYASIRAIKPFVTQWIETAQIDAHEDAKQSFVRLREIGIACEKAMFEATSNVNTHKGMIFCLAVFCGAMGRLKGCDQRFTCKNLQAQMQALCANLVEDDLLHVKPNSAGARFFYETGSSGIRGIAQSGFAIIFETSLPFFQACKEEEGEAVALKRTLLFLMSLLEDSTLWSRGGMAGLEYAKTKAKALLHVKPNAQNLDIHLKALDEDMISKNLSPGGSADLLAMTWLMAHIVKDF |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP039734_1 | 1.6|599587|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR | 599587-599619 | 33 | HG422566 | Cardinium endosymbiont cBtQ1 of Bemisia tabaci plasmid pCHV, complete sequence | 1588-1620 | 5 | 0.848 |
NZ_CP039734_2 | 2.9|616475|33|NZ_CP039734|CRISPRCasFinder | 616475-616507 | 33 | MF036691 | Serratia phage CBH8, complete genome | 44348-44380 | 6 | 0.818 |
NZ_CP039734_2 | 2.9|616475|33|NZ_CP039734|CRISPRCasFinder | 616475-616507 | 33 | MF036690 | Serratia phage CHI14, complete genome | 44348-44380 | 6 | 0.818 |
NZ_CP039734_2 | 2.9|616475|33|NZ_CP039734|CRISPRCasFinder | 616475-616507 | 33 | MF036692 | Serratia phage X20, complete genome | 44191-44223 | 6 | 0.818 |
NZ_CP039734_1 | 1.1|599243|34|NZ_CP039734|CRISPRCasFinder,CRT | 599243-599276 | 34 | NZ_CP029750 | Staphylococcus aureus strain Smith plasmid pSS41, complete sequence | 10171-10204 | 7 | 0.794 |
NZ_CP039734_1 | 1.6|599587|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR | 599587-599619 | 33 | NZ_CP013332 | Fusobacterium hwasookii ChDC F174 plasmid unnamed1, complete sequence | 27232-27264 | 7 | 0.788 |
NZ_CP039734_1 | 1.6|599587|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR | 599587-599619 | 33 | KX229736 | Campylobacter phage PC5, complete genome | 91578-91610 | 7 | 0.788 |
NZ_CP039734_1 | 1.6|599587|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR | 599587-599619 | 33 | KX879627 | Campylobacter phage vB_CjeM_Los1, complete genome | 7646-7678 | 7 | 0.788 |
NZ_CP039734_2 | 2.1|616337|32|NZ_CP039734|CRT | 616337-616368 | 32 | MN694477 | Marine virus AFVG_250M1149, complete genome | 26205-26236 | 7 | 0.781 |
NZ_CP039734_2 | 2.1|616337|32|NZ_CP039734|CRT | 616337-616368 | 32 | MN693738 | Marine virus AFVG_250M1148, complete genome | 25989-26020 | 7 | 0.781 |
NZ_CP039734_2 | 2.3|616474|34|NZ_CP039734|CRT,PILER-CR | 616474-616507 | 34 | MF036691 | Serratia phage CBH8, complete genome | 44348-44381 | 7 | 0.794 |
NZ_CP039734_2 | 2.3|616474|34|NZ_CP039734|CRT,PILER-CR | 616474-616507 | 34 | MF036690 | Serratia phage CHI14, complete genome | 44348-44381 | 7 | 0.794 |
NZ_CP039734_2 | 2.3|616474|34|NZ_CP039734|CRT,PILER-CR | 616474-616507 | 34 | MF036692 | Serratia phage X20, complete genome | 44191-44224 | 7 | 0.794 |
NZ_CP039734_1 | 1.6|599587|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR | 599587-599619 | 33 | MN693026 | Marine virus AFVG_117M72, complete genome | 10033-10065 | 8 | 0.758 |
NZ_CP039734_1 | 1.6|599587|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR | 599587-599619 | 33 | NC_013168 | Rippkaea orientalis PCC 8802 plasmid pP880204, complete sequence | 5340-5372 | 8 | 0.758 |
NZ_CP039734_2 | 2.12|616682|32|NZ_CP039734|CRISPRCasFinder | 616682-616713 | 32 | NC_019298 | Rahnella sp. WMR121 plasmid pHW121, complete sequence | 740-771 | 8 | 0.75 |
NZ_CP039734_2 | 2.12|616682|32|NZ_CP039734|CRISPRCasFinder | 616682-616713 | 32 | NZ_LN907828 | Erwinia gerundensis isolate E_g_EM595 plasmid pEM01, complete sequence | 23475-23506 | 8 | 0.75 |
NZ_CP039734_1 | 1.8|599724|35|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR | 599724-599758 | 35 | AP014488 | Uncultured Mediterranean phage uvMED isolate uvMED-GF-C31A-MedDCM-OCT-S45-C60, *** SEQUENCING IN PROGRESS ***, 7 ordered pieces | 26553-26587 | 9 | 0.743 |
NZ_CP039734_1 | 1.8|599724|35|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR | 599724-599758 | 35 | AP014487 | Uncultured Mediterranean phage uvMED isolate uvMED-GF-C31A-MedDCM-OCT-S34-C76, *** SEQUENCING IN PROGRESS ***, 2 ordered pieces | 21621-21655 | 9 | 0.743 |
NZ_CP039734_1 | 1.8|599724|35|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR | 599724-599758 | 35 | AP014093 | Uncultured Mediterranean phage uvMED isolate uvMED-GF-C31A-MedDCM-OCT-S29-C126, *** SEQUENCING IN PROGRESS *** | 6592-6626 | 9 | 0.743 |
NZ_CP039734_2 | 2.3|616474|34|NZ_CP039734|CRT,PILER-CR | 616474-616507 | 34 | NZ_CP023318 | Bacillus megaterium strain A plasmid p1, complete sequence | 133390-133423 | 9 | 0.735 |
NZ_CP039734_2 | 2.3|616474|34|NZ_CP039734|CRT,PILER-CR | 616474-616507 | 34 | NZ_CP024036 | Bacillus aryabhattai strain K13 plasmid unnamed1 | 76394-76427 | 9 | 0.735 |
NZ_CP039734_2 | 2.6|616681|33|NZ_CP039734|CRT,PILER-CR | 616681-616713 | 33 | NC_019298 | Rahnella sp. WMR121 plasmid pHW121, complete sequence | 739-771 | 9 | 0.727 |
NZ_CP039734_2 | 2.9|616475|33|NZ_CP039734|CRISPRCasFinder | 616475-616507 | 33 | NZ_CP023318 | Bacillus megaterium strain A plasmid p1, complete sequence | 133390-133422 | 9 | 0.727 |
NZ_CP039734_2 | 2.9|616475|33|NZ_CP039734|CRISPRCasFinder | 616475-616507 | 33 | NZ_CP024036 | Bacillus aryabhattai strain K13 plasmid unnamed1 | 76395-76427 | 9 | 0.727 |
NZ_CP039734_3 | 3.3|623617|33|NZ_CP039734|CRT,CRISPRCasFinder | 623617-623649 | 33 | MN480762 | Streptococcus salivarius strain NU10 plasmid pSsal-NU10, complete sequence | 184756-184788 | 9 | 0.727 |
NZ_CP039734_1 | 1.1|599243|34|NZ_CP039734|CRISPRCasFinder,CRT | 599243-599276 | 34 | NZ_CP039853 | Salinimonas sp. KX18D6 plasmid plas12, complete sequence | 11567-11600 | 10 | 0.706 |
NZ_CP039734_1 | 1.1|599243|34|NZ_CP039734|CRISPRCasFinder,CRT | 599243-599276 | 34 | NC_011311 | Aliivibrio salmonicida LFI1238 plasmid pVSAL840, complete sequence | 11772-11805 | 11 | 0.676 |
1. spacer 1.6|599587|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR matches to HG422566 (Cardinium endosymbiont cBtQ1 of Bemisia tabaci plasmid pCHV, complete sequence) position: , mismatch: 5, identity: 0.848
taaaatttattaaaaggaaat-aaaatgattaaa CRISPR spacer gaaattttattaaaagtaaataaaaatgagtaa- Protospacer *** *********** **** ******* ***
2. spacer 2.9|616475|33|NZ_CP039734|CRISPRCasFinder matches to MF036691 (Serratia phage CBH8, complete genome) position: , mismatch: 6, identity: 0.818
tatcaaggagaataaatgattgtttcatttatg CRISPR spacer tatcaaggtgaataaatgattgttacagataaa Protospacer ******** *************** ** ** .
3. spacer 2.9|616475|33|NZ_CP039734|CRISPRCasFinder matches to MF036690 (Serratia phage CHI14, complete genome) position: , mismatch: 6, identity: 0.818
tatcaaggagaataaatgattgtttcatttatg CRISPR spacer tatcaaggtgaataaatgattgttacagataaa Protospacer ******** *************** ** ** .
4. spacer 2.9|616475|33|NZ_CP039734|CRISPRCasFinder matches to MF036692 (Serratia phage X20, complete genome) position: , mismatch: 6, identity: 0.818
tatcaaggagaataaatgattgtttcatttatg CRISPR spacer tatcaaggtgaataaatgattgttacagataaa Protospacer ******** *************** ** ** .
5. spacer 1.1|599243|34|NZ_CP039734|CRISPRCasFinder,CRT matches to NZ_CP029750 (Staphylococcus aureus strain Smith plasmid pSS41, complete sequence) position: , mismatch: 7, identity: 0.794
caaatcaaaaagtagaaaaagctatggtttatcc CRISPR spacer caaatcaaaaaggagataaagctattgagtctgc Protospacer ************ *** ******** * * * *
6. spacer 1.6|599587|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP013332 (Fusobacterium hwasookii ChDC F174 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.788
taaaatttattaaaaggaaataaaatgattaaa CRISPR spacer aaaaatttattaaaaggaaatacaataaaacca Protospacer ********************* ***.* *
7. spacer 1.6|599587|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR matches to KX229736 (Campylobacter phage PC5, complete genome) position: , mismatch: 7, identity: 0.788
taaaatttattaaaaggaaataaaatgattaaa CRISPR spacer tgaaatttatttaaaggaaagaaaatggcttta Protospacer *.********* ******** ******..* *
8. spacer 1.6|599587|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR matches to KX879627 (Campylobacter phage vB_CjeM_Los1, complete genome) position: , mismatch: 7, identity: 0.788
taaaatttattaaaaggaaataaaatgattaaa CRISPR spacer tgaaatttatttaaaggaaagaaaatggcttta Protospacer *.********* ******** ******..* *
9. spacer 2.1|616337|32|NZ_CP039734|CRT matches to MN694477 (Marine virus AFVG_250M1149, complete genome) position: , mismatch: 7, identity: 0.781
--gtaacagaagttgagcatacaactgcatatta CRISPR spacer tggcgac--aagttgataatacaactgcatattt Protospacer *..** ******* ***************
10. spacer 2.1|616337|32|NZ_CP039734|CRT matches to MN693738 (Marine virus AFVG_250M1148, complete genome) position: , mismatch: 7, identity: 0.781
--gtaacagaagttgagcatacaactgcatatta CRISPR spacer tggcgac--aagttgataatacaactgcatattt Protospacer *..** ******* ***************
11. spacer 2.3|616474|34|NZ_CP039734|CRT,PILER-CR matches to MF036691 (Serratia phage CBH8, complete genome) position: , mismatch: 7, identity: 0.794
ttatcaaggagaataaatgattgtttcatttatg CRISPR spacer ctatcaaggtgaataaatgattgttacagataaa Protospacer .******** *************** ** ** .
12. spacer 2.3|616474|34|NZ_CP039734|CRT,PILER-CR matches to MF036690 (Serratia phage CHI14, complete genome) position: , mismatch: 7, identity: 0.794
ttatcaaggagaataaatgattgtttcatttatg CRISPR spacer ctatcaaggtgaataaatgattgttacagataaa Protospacer .******** *************** ** ** .
13. spacer 2.3|616474|34|NZ_CP039734|CRT,PILER-CR matches to MF036692 (Serratia phage X20, complete genome) position: , mismatch: 7, identity: 0.794
ttatcaaggagaataaatgattgtttcatttatg CRISPR spacer ctatcaaggtgaataaatgattgttacagataaa Protospacer .******** *************** ** ** .
14. spacer 1.6|599587|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR matches to MN693026 (Marine virus AFVG_117M72, complete genome) position: , mismatch: 8, identity: 0.758
taaaatttattaaaaggaaat-aaaatgattaaa CRISPR spacer caaaatttattacaaggaaatcaaagcggacaa- Protospacer .*********** ******** ***..*. .**
15. spacer 1.6|599587|33|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR matches to NC_013168 (Rippkaea orientalis PCC 8802 plasmid pP880204, complete sequence) position: , mismatch: 8, identity: 0.758
taaaatttattaaaaggaaataaaatgattaaa CRISPR spacer gaaaatttattacaagaaaataaaaatttgtaa Protospacer *********** ***.******** * **
16. spacer 2.12|616682|32|NZ_CP039734|CRISPRCasFinder matches to NC_019298 (Rahnella sp. WMR121 plasmid pHW121, complete sequence) position: , mismatch: 8, identity: 0.75
aaaagacgcaagatctgcagaaattttaaaaa CRISPR spacer tatgaaggcaatttctgcagaaattttaaaat Protospacer * ..* **** ******************
17. spacer 2.12|616682|32|NZ_CP039734|CRISPRCasFinder matches to NZ_LN907828 (Erwinia gerundensis isolate E_g_EM595 plasmid pEM01, complete sequence) position: , mismatch: 8, identity: 0.75
aaaagacgcaagatctgcagaaattttaaaaa CRISPR spacer gaagtccgcaggatctgcagtaattttaagca Protospacer .**. ****.********* ********. *
18. spacer 1.8|599724|35|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR matches to AP014488 (Uncultured Mediterranean phage uvMED isolate uvMED-GF-C31A-MedDCM-OCT-S45-C60, *** SEQUENCING IN PROGRESS ***, 7 ordered pieces) position: , mismatch: 9, identity: 0.743
aaagtaaatgtaaaagaagatggtagaaaatctca CRISPR spacer gaattatatgtaaaagaagatggtagtataggaaa Protospacer .** ** ******************* * * *
19. spacer 1.8|599724|35|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR matches to AP014487 (Uncultured Mediterranean phage uvMED isolate uvMED-GF-C31A-MedDCM-OCT-S34-C76, *** SEQUENCING IN PROGRESS ***, 2 ordered pieces) position: , mismatch: 9, identity: 0.743
aaagtaaatgtaaaagaagatggtagaaaatctca CRISPR spacer gaattatatgtaaaagaagatggtagtataggaaa Protospacer .** ** ******************* * * *
20. spacer 1.8|599724|35|NZ_CP039734|CRISPRCasFinder,CRT,PILER-CR matches to AP014093 (Uncultured Mediterranean phage uvMED isolate uvMED-GF-C31A-MedDCM-OCT-S29-C126, *** SEQUENCING IN PROGRESS ***) position: , mismatch: 9, identity: 0.743
aaagtaaatgtaaaagaagatggtagaaaatctca CRISPR spacer gaattatatgtaaaagaagatggtagtataggaaa Protospacer .** ** ******************* * * *
21. spacer 2.3|616474|34|NZ_CP039734|CRT,PILER-CR matches to NZ_CP023318 (Bacillus megaterium strain A plasmid p1, complete sequence) position: , mismatch: 9, identity: 0.735
ttatcaag-----gagaataaatgattgtttcatttatg CRISPR spacer -----aagagtttaagaatgaatgattgtttcctttatc Protospacer *** .*****.************ *****
22. spacer 2.3|616474|34|NZ_CP039734|CRT,PILER-CR matches to NZ_CP024036 (Bacillus aryabhattai strain K13 plasmid unnamed1) position: , mismatch: 9, identity: 0.735
ttatcaag-----gagaataaatgattgtttcatttatg CRISPR spacer -----aagagtttaagaatgaatgattgtttcctttatc Protospacer *** .*****.************ *****
23. spacer 2.6|616681|33|NZ_CP039734|CRT,PILER-CR matches to NC_019298 (Rahnella sp. WMR121 plasmid pHW121, complete sequence) position: , mismatch: 9, identity: 0.727
caaaagacgcaagatctgcagaaattttaaaaa CRISPR spacer atatgaaggcaatttctgcagaaattttaaaat Protospacer * ..* **** ******************
24. spacer 2.9|616475|33|NZ_CP039734|CRISPRCasFinder matches to NZ_CP023318 (Bacillus megaterium strain A plasmid p1, complete sequence) position: , mismatch: 9, identity: 0.727
tatcaagg----agaataaatgattgtttcatttatg CRISPR spacer ----agagtttaagaatgaatgattgtttcctttatc Protospacer *..* *****.************ *****
25. spacer 2.9|616475|33|NZ_CP039734|CRISPRCasFinder matches to NZ_CP024036 (Bacillus aryabhattai strain K13 plasmid unnamed1) position: , mismatch: 9, identity: 0.727
tatcaagg----agaataaatgattgtttcatttatg CRISPR spacer ----agagtttaagaatgaatgattgtttcctttatc Protospacer *..* *****.************ *****
26. spacer 3.3|623617|33|NZ_CP039734|CRT,CRISPRCasFinder matches to MN480762 (Streptococcus salivarius strain NU10 plasmid pSsal-NU10, complete sequence) position: , mismatch: 9, identity: 0.727
atatctatctagagaattaatcaatttcatcga CRISPR spacer aatagtatctagagaatacatcaatttcatatc Protospacer * ************ ***********
27. spacer 1.1|599243|34|NZ_CP039734|CRISPRCasFinder,CRT matches to NZ_CP039853 (Salinimonas sp. KX18D6 plasmid plas12, complete sequence) position: , mismatch: 10, identity: 0.706
caaatcaaaaagtagaaaaagctatggtttatcc CRISPR spacer gaaatcaaaatgaagaaaaagctatggcgagatg Protospacer ********* * **************. . .
28. spacer 1.1|599243|34|NZ_CP039734|CRISPRCasFinder,CRT matches to NC_011311 (Aliivibrio salmonicida LFI1238 plasmid pVSAL840, complete sequence) position: , mismatch: 11, identity: 0.676
caaatcaaaaagtagaaaaagc--tatggtttatcc CRISPR spacer acaaccaaaaagtagaaaaagcgatgcgactcaa-- Protospacer **.***************** *..*..*.*
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
768621 : 780062
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP039734|768621:780062|DBSCAN-SWA ATCAATTAGTTTGGCTTCTAATAAAATATACTATTTTTTCAATCATATACCCAATTGCATCATCGCTCATACCAGGGTATAAACCAATCCAAAAACTATCACTCATGATTTTGTCTGTATTGGGAAGTAAACCTATTACTCTATAATCTTTATCTTTTTCAAGTGACTCAAATAGAGGATGTCGTAGCATATTGCCCGCAAAAAGATTACGCGTTTGGATGTTGTTATTTTCAAGGTATTCGACCAGTTCATTGCGTGAAAATTTCACATTATCTTTTATGGTCATCATAAAGCCAAACCAACTTGGATCGCTCTGCGGTTGTGCTTCTACCAAGATTAATTCTGGAACATCTTTGAATCCATTATAAAGTTTTTTAAAATTCACTTTTCTTTTTTCAACGAAACTTGGAAATTTCTCCAGTTGTGCGCATCCAACAGCAGCTTGCATATCGGACACTTTGAGGTTAAAACCAAAATGGCTATAGACATATTTATGGTCATACCCTTTAGGAAGGCTTCCAAACTGTTGCGTAAAGCGACATCCACAGGTATTATCCACACCACTCTCGCACCAACAGTCTCTTCCCCAATCACGCATGGAAAGCATAATCTTTTTAAGAAGAGGATTGTCTGTGTATGTTGCTCCGCCCTCACCCATTGTCATGTGATGTGGTGGGTAAAAACTACTCGTTCCTATATCGCCCCATGTTCCTGTTGGCTTACCTTCATACGTTGAGCCAAGCGCATCACAATTGTCTTCAATGAGCCATAAATTGTGTTTATCGCAAAACGCTTTCACAACCTTAATGTTAAAAGGATTTCCTAACGTGTGGGCTATCATCACCGCTTTTGTTTTGGAACTCAGTGCTTGTTCTAACTGCGTCACGTCAATGTTAAAGTGTGTCAGTTCCATATCAACAAACACTGGCACGGCACCGTATTGAACAATAGGTGCTACGGTTGTTGGAAAGCCAGCGGCGACCGTGATAACTTCATCGCCACGTTTGACTTGCCGCTCTTTTAGAAGTGGTGAAGTAAGTGCATAAAAAGCCAACAAATTGGCAGAACTGCCACTATTGACCAAGAATGCCCATCTTACATGTAAAAACTTTGCTAACTCTTTTTCAAATTTTTTAGAATAATCCCCATAGGTCAACCAAAAATCAAGCGAACTATCGACAAGGTTCATCATCTCTTTTTCATCAAAGACACGCCCTGCATAATTAACGCGACTCTCTCCCGCAACAAACGGTCTCGTTTGATTTTTTTTATGAACGAGTTCATAATATTGTTTTGTTTTTTCTAAAATCTCTTGCTTGAGTTGTTCTTCTTTGCTCATTATTTGCCTTTCCACATTGTATCTATTGCTTTTAGATACGGATCTTCTTTTATGGTTTGTATATCTAATTTATCTAAATTTTCTTGAAAATAACATCTCATCTTTGCTATCGCATTTTTGTGCGATGTGAACTGAAAATCGCCAAACTCTTTTAAAAGACGCTCATTATTGGACGTGTATTCGTTATTAAGCCCTTCATGAATGACTCTTATATCGGAATAAAAATCACCGACTTCATTCACTAAATTTGCTAATGTAATTAAATCTACTTTTGTGCCTGTCGTGACGTTATAGATTTTCTCTTTAGTATCGTTATAGATAAAAAAGTCGATTATTTTAACCAAATCATCACTGTAAATATAGTCAAAATAGACATTCTTATTGATCGTTATAGGCAAATGCAATAAGTTTTTAACAATGGCATTTGAAATAAACTTATAGCGATAGTTTTCATACTCACCGTATGCTCCAAAAATCCGTAACTGTACAATGTTCTCAGACTTCTCAATAAAATGCGATGTAATGGATTTATAAAATCCATATTCATCTAAGGGCAACGCTTGTAAATAATCATCTTCTTTTGCTTCGATAATGGGCTTTTGCTTACTATACTCGGCACCACTCCCCAAAAGAGATGATTTTCTTTACATTTTTTTCGTGTTTAGCGATATTGAAAAAAATACGTAAATTGTACTCTGTCACATTTTTCATATCAGCAGTATCGCGGCCTCCACCACGATTGGCAGTATGGACAATCACATCAATTTTATTTTTCTAAAATATATCTATCAACTGCTTTTTCATCACTTAAATCAAGCTCTTTACTAGATGGTATAAAAAGTGTGTCATGATGATGCCCGAGAAGATACTCTTTTAGATGAGAGCCAATAAAGCCGCTAGAACCTGTAATGAAAATATTCATAAATTAGTGTTCCAATGGTTTAATAATCATTTGGGCGTTGAACTCTTCTCTTTCTAAAAAAGGATACATATCTTCAATAGGTTTATTGACAATAAGCTTTGGCTCAACCGTTGTCATTTGCTCTTGTAGACAGATATTAATGATTTCTGCCTCGTCTGATAGGAGTGTTTCGTGAATCTTTTGAGAAATCTCCTCTATGGTCGAGATCGTACTGCTTTTAAGTCCATAGGCGTTTGCTATGCTTGCAAAATTTGGGACACTGTAATCTTTTTGTGTGCCAATGTATCTCTGTTCAAAATAAATTTCTTGAAATTGCCTCACCATACCAAGGTTTGCATTATTCATGATAAATACTTTAATGGGTAAATTTCTTCTTTTAATGACTTCTAATTCTTGAATATTCATCTGAAATCCACCATCACCCACAATCACAAGAGCTCTTTTTTTTGTGCCAATCGTTGCACCGATGGCTGTCGGTAAAGCAAAGCCCATGGCACCCATACCGCCAGAAAAAAGAACACGTTGATCTGCTTTGGTCTCAAAGGACTGGGCGACCCACATTTGATGTTGCCCCACATCAACACAAATCATGTCATGATCAGAGGAAGATTGAGCAATCCATTGAATGATTTGATTCGGTACTTTTGCTTTGCCATCAATGCCAGTTGTGGATGAATAGTTTTGCTTATACGTTACCAGTTTTTCAAGCCACTGACCTAAATTAAGTTTAAAATCATAGTGCGATAGTGTGTTGATAAAATCAGCAATATCACAATGCAAAGTGACATCTGCCTTGATTTTAGAGTCTAACTCATTGACATCTATATCAACGTGTATGATTTTTGCTGCTCTTGCAAACGTTTTCAAATCGGTACCCGTTTGTCGTGTATCAAGGCGAGAGCCTAAGACAATAATTAAATCAGCATTGGCTAAAGCTAAATTGCTGTATCGATTCCCGTAAGAGCCAATGAGTCCAAAATTATATTTATAATCGTCCTTAACAACATCTTTTCCCATTAGAGAGTACACAACAGGCAAATTGGATTTTTCTAATAAGCGATGAATGGCTTCTGGTGCCAAGGATGAAAGGCGGCTACCTCCACCGACAAGTATAATAGGACGCTGAGATACCCTCAACATTTCTACAATTTTTTCAAAATCAATCTTATCACCCTGAATACGCATCATTTTATATTCATCACTCTCAAAAAAAGATTTTTGTTCGGATGGATTGAAATCAGTCCGTTGAATATTCATTGGGATATCAATAAGTACAGGACCTTTTCTGCCATTTTGTGTTAAAAAATACGCTTTTTCAAGTTCATAACGAAGGTTTTGAATCATATCAATCATAACCGCATATTTTGTAATGGGCTTAACGATACTGACAATATCGGTTTCTTGAAACCCTATTTGCCTTACAGGCGTATCGTATTTGTATTCATACGTATTGACCTGTCCTGTAATAAAGAGAGTAGAAATTGAGTCAAAAAAGCAACTTCCAATAGGTGTTATAAGATTGGTTGCGCCTGGTCCACTTGTGGCTGTTGCCACACCTGTTTTCCCACTCACACGGGCATATCCCTCAGCGGCAAACCCTGCTCCTTGTTCATGAATGGTATTGACGATTTCAATACCTGTATTTTTATCAAAAGAATCATACAAATGAGCAACTGCACCGCCAATATACCCAAAAGCTTTATCAATGCCTTGATCAACTAAAAATTGCACAATATAATCAGATGCTTTCATTTGATAGTCCATGCTATATTTTTTTCTTTTGCATCAACAATGTAATTTTCCAAATCTTTTTGGCTTAGAATTTCATGGGTTTCATAAAACGCTTTATACCATTTAACGGTCTTCTCAAATGTATCCTCGCTATCCCACACATCTTTCCAATGAAGCTTGATATGGGCTTTTGAGCAGTCTAATTTTAAAAGATTGGCTTCATGAAGCTGGTTTGGATCACGGTTTATTTCATACGCAATCGCATCCCAATGCTTCTTTACATGTAAAACCACTTCTTCGACGCAAATGCTTCCCTCATCACTGGGTCCAAAGTTCCATGCTTCTGCAAATTCAACTCGTTCTTCAAGCAGTTTTTGCCCGACCTGTAAATAGCCACTGAGTGGTTCAAGGACATGTTGCCACGGTCTGGTCGCTTTTGGGTTTCGGATACTTACTTTTTTACCTTGACTGACGGAGAGCATAATATCGCTCATCAATCTATCTTGCGCCCAGTCTCCACCACCAATGACATTTCCAGCTCGACAAGTTGCTAAAAGCGTTTGATGCGATTTTTTGTAGTCGTTTGTATTAAAATAAGAATTGCGATATGAAGTTGCGAGTAAATCTGCACATCCTTTTGAAGCGCTGTAAGGGTCATACCCGCCCATTGGGTCATTCTCTCGGTATCCCCAAATCCACTCTTTATTTTCATAGGCTTTATCACTGGTGATGTTGACAATGGCTTTCACCTGATGTTTGCGACATGCCTCAAAAACTTTAAGGGTACCCATGACATTGGTTTCATAGGTTTCGATAGGATTGGCATAAGAAGGTCTGACCAACGCTTGAGCAGCCAAATGAAAAACAATATCTGGCTTATAGGTTGCAAAAGTTTTATCTAACGTTTCTAAGTCTCGTATATCACCGATGATCGACGCGATATCAAGATCAAGCAATGCTAAATGATTGGGGTGGGTGGGAGCTTCCAAAGAATAGCCCACGACCTTAGCACCCATCTGCTTGAGCCAATACACAAGCCATGAGCCTTTAAAGCCTGTATGTCCTGTGACTAAGACAGTTTTATCTTTGTAGATACCACCAAAAAGATGTTGCATTACCAAACTTTCCATGGCGCTTTGTTCTCTTTCCAGAGTTTATTGAGTTTTTGGTTATCACGAAGTGTATCCATGGGTTGCCAAAAACCATCATGTTTATAAGCAAACATTTCACCATCTTTGGCTAAATTTTGTAAGGGAGCTTGTTCAAAAATACAATTTTCATCTTCAATAAGATAATCAAATACTTTAGGTTCACATACAAAAAAGCCACCATTAACCCAACCCGCTTCTGTCTTTGGTTTCTCTATAAAACTGTTAATATTCATCGTTTCATCAATATCAAGATTACCAAACCGCGCCTCTGGCTGAATTGCAGACATTGTAAGCGCTTTTCCATGCTTTTGATGAAAGGCTACCGTTTTTGCAATATCAATATCACTCACGCCATCTCCATAGGTCAATAAAAAAGACTCGTCTCCTATATACTTTTGAGCACGCTTAATACGACCTCCGGTCATCGTATCAAGGCCGGTATCCACAAGGGTGACTTTCCATGGCTCACTGGTATTATTATGCACTTCCATGCTATTAGTTTGAAGATCAATGGTTATATCACTTTGGTGCAAAAAATAATTTGCAAAATACTCTTTAATGTAATAGCCTTTATAACCAAGAAGCACGACAAATTCATTAAAACCATAATGGGAATAGATCTTCATAATATGCCATAAAATTGGCTTGCCACCAATTTCCACCATCGGTTTTGGTTTTATATCGGTCTCTTCTGCTATTCGTGTTCCATATCCACCTGCGAGTAATACGACTTTCATTGTATCTCCATAAATAACTTATTCATTATTAACGTTTATTTTATACAAACATTCTTCCAAACTATCTTTCCAGTAAGAAATTTCTATGCCAAACTCATTTTTGATTTTTGATTTGTTTAAAAGAGAGTAGTGAGGCCTTGTTGCAGGTGTTGGGTACTGCGATGTTTCAATAGGATTGATTTTACATGTAAGCTTTGCCATCTGCATAATCTCTCGTGCAAAATCATACCAACTGGCAACACCTTCATTAGAATAATTATAAATCTCCGTTTGGGTATTTTTGATTCTTGGCAGAATATCTAAAATCACTTTCGCTAAATCTTTTGCATAAGTAGGACTCCCTACTTGATCGTAAATAACACCCAAAGAGTCTTTTTCTTTACCTAAACGAAGCATCGTTTTGACGAAGTTAGCACCATGACTACTGTAAACCCACGATGTTCGAATAATGACGCTATTGGGGAGCTTACCATGTAAAAGCGCATTCTCTCCACCCAACTTCGTTTTACCATAAACCGATTGAGGATTGGTCTTATCTGTTTCGCCATAAGGTTTATGATTAGTTCCATCAAAAACATAATCCGTTGAGATATGAATCAGTTTGATATGCTTTTCTTTTGCAATGCTTGCTAAATGCTCTACTGCTTGATGATTGATTTTATCGGCTAAAGCTTCTTCACTTTCAGCTTTATCGACTGCGGTATAAGCCGCACAGTTAATAATGGCATGAATAGTATTTTTTTCAACAAATGATTCAATAATGTGTTTATCCGTAATATCAAGTTGATCTTTACATGTAAAGAAAAATGTATAGGGGAAATGGGGAGAAAGTGCTTGTATCTCGCTTCCTAATTGACCGTTTGCACCTGTCACTAAGATATTAAGCATAGTAATTTACGCCATATTCAAAAAGGTCATTGGTCTCTGCAAGTTTAGGTTGCTTGGTATCTTTTGCGGAGAGTTGAAACAAATCTGCACTCATTTGCCAATCAATACCCAGTTTTGGATCATTAAATGAAATTCCTCGGTCACACTCAGGGGCATAGTAGTTATCGACTTTATAGGTAAAGGTACAGGTTTCACTCAGCACTACAAAACCATGAGCAAAGCCTCTTGGAATAAACATCTGTTTTTTATTTTTTCCATTAAGTTCTACAGCTACATACTTTCCAAATGTGGGACTCCCTACACGAATATCTACAGCAACATCCAGAACTCGTCCATCAATGACACGAACTAATTTTGATTGGGCAAAAGGTGCAAGTTGATAATGAAGTCCACGCAAAACACCATGATGACTTCGTGATTCATTATCTTGACAAAAATTCACTTTGAAACCTACAAATGCTTCAAACATATCTTGTCTAAAGGTCTCTACAAAGTACCCTCGCTCATCACCATGTACTTTGGGCTCTATTATGATCACGTCTGGTATAGAGGTAGGCGTAAATATCATCTCTTACTTCTTTCTTGTGTCATTTTTTATTTTTATATCTCTTCTTTTATTCTCATATAGATAGGAAACCGTGGTTTACCGCCTTTAGTCATCTCTTGAAATTTATAAGTAATCTTTGCACCAATGGCAGGAGGATTTTTACGTTCCGCATCACTAAAGCCTGAACCGATGTCAAACATCACGCCATCATCCGTTTTACATGTAAGCGAACCAAAGCTATTTTTGTACTTGCCTTCGCCTTTGTGATGCTCCACCACTTCACACTCTGCATCTTGAAAACTTTTCACTTTCAGGCTATTAGGATCACGCTTAACGACATATTTGGTATCGGGATTGCGCACGACAACACCCTCTCCACCGCCTTTTTCAACTTCTGTAAGAAAACGTTTCAGATCATCATTGTCTTTACATGTAAACTGTTTTGCCACTTTGAGGTAGTCGGCTTGATTGAGTTTAAGATAATACTCCAACTTTACTAAACGCTGGATCAAACCACCATTTTCATTCGGCACATCAAAGGCGTAAAAAGCGATGTTTTTCCATCCATCATGCGGCTCTTTTTTCTTCACAATAGAGATAATAGTTTCAAAATCACCCCGTTTACTCCAAAGCTCTCCATCCACAGCAAAAGGAGGAAATCCTTCGGTAAACCAAGCAGGAGCCGCAAGTTCAACGCCACCGCGTGAGATCAGTTTTTGCCCATCCCAGTACGCCCGCACTCCATCCATTTTTTCACTCATGAGCCACCCTGAAACGTCTTGTCCATCCCATTCTTGTAACAACATCAATTCAGGCTTAGCTCCAAACATTAAACTTGTTAACAGGAGCCAAGAGAGCATTAATCGCTTCATTTATGCCTCTGTTGTGTATTTTCTCAAATACCACTGAATCGTTTTAAGAATGCCGCTTTCAAAGTTTTCTTCTGCTTCCCAACCTAGTTTTGTCTCAATTTTGGTGGCATCAATGGCGTAACGTCTATCATGCCCTGCTCTATCTTCCACAAAGGTAATCTGCTCTTTATACGAAGTGGCTTTTGGTTTTAGGGTATCGAGTATCTCACAAATCTTATGAGCGATATAAAGGTTATCTCGCTCATTGCGTCCACCAATATTATAGGTTTCTCCTGAGTTTCCTTCATGGAAGACCAAATCAATGCCTTTGCAGTGATCTAGCACATAGAGCCAATCACGGATATTTTTTCCATCACCGTAAATGGGGATTTTTTGATTGGACAATGCTTTGCGTATGATCGTAGGAATCAGTTTTTCATCGTGTTGTTTAGGTCCATAGTTGTTGGAACAGTTGGTGATGACCGTATTCATTCCATACGTATGATGATAGGCGCGTACGATCATATCACTGCTTGCTTTGGAAGCTGAATAGGGAGAGTTGGGTGCATAGCTTGTGTTTTCTGTAAAAAGTCCCGTAGCGCCAAGCGTTCCATAGACTTCATCGGTTGAGATATGATGGAAACGACACTCCTCATAGCCCTCTTTGAAGATGAAAGGTTTTTCCATCCAGGTTTTATACGCCACATCAATGAGCGTAAAGGTTCCATTGACATTGGTTTCGATGAACACGCCTGGGTTTTTGATGGAGTTATCCACATGCGACTCTGCTGCAAAGTGAATCACGCCTTGGATATCATATTTTTCAAACAGAGACTCGACTAGAGCCCTATCACAAATATCCCCTTGTACAAAAGTATAACGATCACTTTTTTCTACCTCTTTGAGGTTATCTAAATTACCAGCATAGGTTAAAAGATCCAAATTGATCAGATGGTACTCTGAATACTTTTCCAAAAAATACGGTACAAAGTTAGACCCAATAAACCCTGCACAACCGGTGACTAAAATACATTTTTGACTCATTTAACCAATTCCTTGAGGTATTCTCCATAACCGTTTTTACTCAGAGGTTTTGCAATCTCCAAGACTTGCTCTTTGGTAATCCAGCCATAATTATAAGCGATCTCCTCCAAGCAAGCAATTTTAAAACTCTGTCGCTTTTCAATCGTTTGCACAAACATGCCAGCTTCCAATAAACTGTCATGTGTTCCCGTATCAAGCCAAGCAAAACCACGTCCTAAAACTTCAACTTGTAAATCGCCTCGTTTAAGATAGGCTTCATTTACAGAAGTGATTTCAAGCTCACCTCGATCACTTGGTTTAACCGCTTTGGCGATCTCAATCACACTGTTATCGTAGAAATAAAGTCCCGTTACGGCAAAAGGTGATTTGGGGTTTTTAGGTTTTTCTTCAATGCTGATTGCTTTTTGATTTTCATCAAACTCGACGACACCAAAACGTTGTGGGTCTTTGACCTGATACCCAAAAACAATCGCTCCCGATTTTAATTGAGCCGAGCTTTGAAGCAGTGGTGTAAAGCCTTGACCGTAAAAAATATTATCTCCCAAGATCAAACAAACGCTATCATCGCCGATAAACTCTTCACCTAAAAGAAACGCTTGTGCCAATCCATCAGGACTGGGTTGAATTTTATACGAGAGTGAAATTCCCCAATGAGAGCCATCGCCAAAAAGATCTTCAAATTTGCCAATATCTTGAGGGGTTGAGATAATCAAGATCTCACGAATTCCAGCTAGCATCAAAACAGAGAGCGGATAATAGATCATTGGTTTATCATAAATCGGCAACAACTGTTTGCTAATCGTTTGGGTAACAGGATAAAGTCTGGTTCCAGACCCTCCTGCTAGGATGATGCCTTTCATGTTAGTGAAGCTCTGCTTTAGCGTACGCGATCAGTTCGTTCAGTTTTTTCTCGTACAATTTCGCGTTATCCGCATTGGTCGATTCAAAACGTGTGACTAAAACGGGTGTTGTATTGCTCGCTCGCACCAAGCCCCAACCATCGTTAAAAATGACTCTTACGCCGTCAACATCAACAATCTCTTTAATAGCAGGAAAATCTTTAGGAGGATTTTTCAAAAGCTCTTTGACTTTTTCAACAAGAGGAAACTTATCGCTTTCATTGGTCTCAACTTTAAGTTCTTCAGTAGAATAAACGGTTGGTAATTTTGCGATCTCAGCATCCACATCCAAACCATTTTTAACCATTTCAATCATACGAAGCGTGGCGTAAATCGCATCATCGTAACCAAAATAACGGTCATTAAAGAAAAGATGTCCGCTCACTTCCGCTGCAAAATCGGCGTTCGTTTTAGCGATCATCACTTTAAGGTTACTGTGACCTGTTTTATACATAATCGCATGACCTCGTTTGTTAATCTCATCGTACATCACTTGAGAACATTTCACTTCACCAATGACCGTTGGGTTTTTCATAGCGCTGGAAAACAAAATCGCCATAATGTCGCCTTTGACATTGTTGTGCTTGGTCAAAAACGCCAATCTATCGGCATCCCCATCATACGCAAACCCAAGCGCATACTCGCCTTCAAGCTCTTTTTTAATGTCTTTGAGGTTTTTCTCAACGGTTGGGTCAGGATGGTGGTTAGGAAATGTACCATCGGGGTCTGTAAATAAACCTTTACATGTAAAACCTAAAGCTTTAAATACATCTTGCACCACAACGCCTGCAACGCCGTTTCCACAGTCATAAACAAACGGTTGCTTGAAGCCTTTAAGGTGGTCAAACTCTTTGATCAAAAAGGCGATATAACGCTCTTTCGCATTAATAAACGTTGAGCTTAAGTCATCTTCGATTTCCATCGCGTAATTTTGCATAATCTCGCGACCCAGTGCGTAAATGTCTTCGCCAAAAAAAGGCTTTTTATCGAGGGTTACTTTAAAACCATTGTATTCACTAGGATTGTGAGAACCCGTGATCATAATGGAAGCATTAGGTGTTACGCCATCGAAATTTTGAAAATTACTAAAGTAATTCACCGGTGTCGCAACGAGTCCCATGTTAAGCACGTTACAGCCAGCTTTGTTGAGTCCGCTAGTAAGATACTCACACAAAATCGGTGAATGTGAACGCGCGTCATACCCTATCGCAACATTTTTACCCACTTTAGCAATCTGTTTTCCTAAAAAATACCCAATAAGTTTAACCGTTTGCTCGTTCAGCTCTTTTTCGTAAATACCTCTGATGTCATACTCTCTAAATATCGCTTTCAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP039734|768621:780062|774518_775388_-|WP_096047223.1|DBSCAN-SWA MLNILVTGANGQLGSEIQALSPHFPYTFFFTCKDQLDITDKHIIESFVEKNTIHAIINCAAYTAVDKAESEEALADKINHQAVEHLASIAKEKHIKLIHISTDYVFDGTNHKPYGETDKTNPQSVYGKTKLGGENALLHGKLPNSVIIRTSWVYSSHGANFVKTMLRLGKEKDSLGVIYDQVGSPTYAKDLAKVILDILPRIKNTQTEIYNYSNEGVASWYDFAREIMQMAKLTCKINPIETSQYPTPATRPHYSLLNKSKIKNEFGIEISYWKDSLEECLYKINVNNE >NZ_CP039734|768621:780062|777826_778690_-|WP_167749443.1|DBSCAN-SWA MKGIILAGGSGTRLYPVTQTISKQLLPIYDKPMIYYPLSVLMLAGIREILIISTPQDIGKFEDLFGDGSHWGISLSYKIQPSPDGLAQAFLLGEEFIGDDSVCLILGDNIFYGQGFTPLLQSSAQLKSGAIVFGYQVKDPQRFGVVEFDENQKAISIEEKPKNPKSPFAVTGLYFYDNSVIEIAKAVKPSDRGELEITSVNEAYLKRGDLQVEVLGRGFAWLDTGTHDSLLEAGMFVQTIEKRQSFKIACLEEIAYNYGWITKEQVLEIAKPLSKNGYGEYLKELVK >NZ_CP039734|768621:780062|778691_780062_-|WP_096047228.1|DBSCAN-SWA MKAIFREYDIRGIYEKELNEQTVKLIGYFLGKQIAKVGKNVAIGYDARSHSPILCEYLTSGLNKAGCNVLNMGLVATPVNYFSNFQNFDGVTPNASIMITGSHNPSEYNGFKVTLDKKPFFGEDIYALGREIMQNYAMEIEDDLSSTFINAKERYIAFLIKEFDHLKGFKQPFVYDCGNGVAGVVVQDVFKALGFTCKGLFTDPDGTFPNHHPDPTVEKNLKDIKKELEGEYALGFAYDGDADRLAFLTKHNNVKGDIMAILFSSAMKNPTVIGEVKCSQVMYDEINKRGHAIMYKTGHSNLKVMIAKTNADFAAEVSGHLFFNDRYFGYDDAIYATLRMIEMVKNGLDVDAEIAKLPTVYSTEELKVETNESDKFPLVEKVKELLKNPPKDFPAIKEIVDVDGVRVIFNDGWGLVRASNTTPVLVTRFESTNADNAKLYEKKLNELIAYAKAELH >NZ_CP039734|768621:780062|773726_774500_-|WP_096047222.1|DBSCAN-SWA MKVVLLAGGYGTRIAEETDIKPKPMVEIGGKPILWHIMKIYSHYGFNEFVVLLGYKGYYIKEYFANYFLHQSDITIDLQTNSMEVHNNTSEPWKVTLVDTGLDTMTGGRIKRAQKYIGDESFLLTYGDGVSDIDIAKTVAFHQKHGKALTMSAIQPEARFGNLDIDETMNINSFIEKPKTEAGWVNGGFFVCEPKVFDYLIEDENCIFEQAPLQNLAKDGEMFAYKHDGFWQPMDTLRDNQKLNKLWKENKAPWKVW >NZ_CP039734|768621:780062|772629_773727_-|WP_167750624.1|DBSCAN-SWA MQHLFGGIYKDKTVLVTGHTGFKGSWLVYWLKQMGAKVVGYSLEAPTHPNHLALLDLDIASIIGDIRDLETLDKTFATYKPDIVFHLAAQALVRPSYANPIETYETNVMGTLKVFEACRKHQVKAIVNITSDKAYENKEWIWGYRENDPMGGYDPYSASKGCADLLATSYRNSYFNTNDYKKSHQTLLATCRAGNVIGGGDWAQDRLMSDIMLSVSQGKKVSIRNPKATRPWQHVLEPLSGYLQVGQKLLEERVEFAEAWNFGPSDEGSICVEEVVLHVKKHWDAIAYEINRDPNQLHEANLLKLDCSKAHIKLHWKDVWDSEDTFEKTVKWYKAFYETHEILSQKDLENYIVDAKEKNIAWTIK >NZ_CP039734|768621:780062|768621_769959_-|WP_167749440.1|DBSCAN-SWA MSKEEQLKQEILEKTKQYYELVHKKNQTRPFVAGESRVNYAGRVFDEKEMMNLVDSSLDFWLTYGDYSKKFEKELAKFLHVRWAFLVNSGSSANLLAFYALTSPLLKERQVKRGDEVITVAAGFPTTVAPIVQYGAVPVFVDMELTHFNIDVTQLEQALSSKTKAVMIAHTLGNPFNIKVVKAFCDKHNLWLIEDNCDALGSTYEGKPTGTWGDIGTSSFYPPHHMTMGEGGATYTDNPLLKKIMLSMRDWGRDCWCESGVDNTCGCRFTQQFGSLPKGYDHKYVYSHFGFNLKVSDMQAAVGCAQLEKFPSFVEKRKVNFKKLYNGFKDVPELILVEAQPQSDPSWFGFMMTIKDNVKFSRNELVEYLENNNIQTRNLFAGNMLRHPLFESLEKDKDYRVIGLLPNTDKIMSDSFWIGLYPGMSDDAIGYMIEKIVYFIRSQTN >NZ_CP039734|768621:780062|776807_777830_-|WP_167749442.1|DBSCAN-SWA MSQKCILVTGCAGFIGSNFVPYFLEKYSEYHLINLDLLTYAGNLDNLKEVEKSDRYTFVQGDICDRALVESLFEKYDIQGVIHFAAESHVDNSIKNPGVFIETNVNGTFTLIDVAYKTWMEKPFIFKEGYEECRFHHISTDEVYGTLGATGLFTENTSYAPNSPYSASKASSDMIVRAYHHTYGMNTVITNCSNNYGPKQHDEKLIPTIIRKALSNQKIPIYGDGKNIRDWLYVLDHCKGIDLVFHEGNSGETYNIGGRNERDNLYIAHKICEILDTLKPKATSYKEQITFVEDRAGHDRRYAIDATKIETKLGWEAEENFESGILKTIQWYLRKYTTEA >NZ_CP039734|768621:780062|775988_776807_-|WP_167749441.1|DBSCAN-SWA MKRLMLSWLLLTSLMFGAKPELMLLQEWDGQDVSGWLMSEKMDGVRAYWDGQKLISRGGVELAAPAWFTEGFPPFAVDGELWSKRGDFETIISIVKKKEPHDGWKNIAFYAFDVPNENGGLIQRLVKLEYYLKLNQADYLKVAKQFTCKDNDDLKRFLTEVEKGGGEGVVVRNPDTKYVVKRDPNSLKVKSFQDAECEVVEHHKGEGKYKNSFGSLTCKTDDGVMFDIGSGFSDAERKNPPAIGAKITYKFQEMTKGGKPRFPIYMRIKEEI >NZ_CP039734|768621:780062|775380_775956_-|WP_096047224.1|DBSCAN-SWA MIFTPTSIPDVIIIEPKVHGDERGYFVETFRQDMFEAFVGFKVNFCQDNESRSHHGVLRGLHYQLAPFAQSKLVRVIDGRVLDVAVDIRVGSPTFGKYVAVELNGKNKKQMFIPRGFAHGFVVLSETCTFTYKVDNYYAPECDRGISFNDPKLGIDWQMSADLFQLSAKDTKQPKLAETNDLFEYGVNYYA >NZ_CP039734|768621:780062|770884_772633_-|WP_096047221.1|DBSCAN-SWA MKASDYIVQFLVDQGIDKAFGYIGGAVAHLYDSFDKNTGIEIVNTIHEQGAGFAAEGYARVSGKTGVATATSGPGATNLITPIGSCFFDSISTLFITGQVNTYEYKYDTPVRQIGFQETDIVSIVKPITKYAVMIDMIQNLRYELEKAYFLTQNGRKGPVLIDIPMNIQRTDFNPSEQKSFFESDEYKMMRIQGDKIDFEKIVEMLRVSQRPIILVGGGSRLSSLAPEAIHRLLEKSNLPVVYSLMGKDVVKDDYKYNFGLIGSYGNRYSNLALANADLIIVLGSRLDTRQTGTDLKTFARAAKIIHVDIDVNELDSKIKADVTLHCDIADFINTLSHYDFKLNLGQWLEKLVTYKQNYSSTTGIDGKAKVPNQIIQWIAQSSSDHDMICVDVGQHQMWVAQSFETKADQRVLFSGGMGAMGFALPTAIGATIGTKKRALVIVGDGGFQMNIQELEVIKRRNLPIKVFIMNNANLGMVRQFQEIYFEQRYIGTQKDYSVPNFASIANAYGLKSSTISTIEEISQKIHETLLSDEAEIINICLQEQMTTVEPKLIVNKPIEDMYPFLEREEFNAQMIIKPLEH |
10 | Escherichia_phage(22.22%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
891162 : 898814
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP039734|891162:898814|DBSCAN-SWA GTCACCTACCTTTCGCGTAGGTCTCCCATCCTGCTTTTCGTAGCACACATGCAGGACATTCGCCACAACCATAGCCCCATGCATGCTTTTTATCGTGGACACCATTATAGCATGTATGTGACTCATCGATGACGAGTTCTAAGACACCCTCTTGGTGTGAAAGTTCAAAGGTTTCAGCTTTTGTGAGGTGCATAAGCGGGTAATGGAAAGTGATGTTGGCTTCGCTTCCAAGATTAAGGGCAAGTTCAAGCGCCTTTACAAAAGGCTCTCTACAATCAGGATACCCTGAATAATCGGTCTGATTGACTCCAATGATGATATGCTCAATACCTTGTTTTTGCGCAAATGCGTGCGCAAGTGTAAAAAAGATCGCGTTGCGATTGGGAACAAAGGAAGCAGGGAGATTGGTATGCGTTCTATGATGTGCGCCAATATCTTGCGTACCATCAATTAAAGCCGAATCATTCAGTTGTGAAAAGGCATCTAAGCTGAGCAGTGTATTTTTTACATGTAACGCTTTGGCGATCTTCTCAGCTTGCGCAATTTCCACGCGATGCTTTTGGCCATAATCAAACGTAATGCTTTCAACACTATCAAAACGATTCTTTGCCCAACCCAGACAAGTCGTACTGTCTTGTCCACCACTAAAGACCACTAATGCTTTGGAAGCAATTTTTCCCATTATTCCACTCCTAAAAACTTGTGCAACTGCACACTCAAGATAAATTGTGGATGTTCTTTGACAAACTCCACACAAAAAGCAACATTTTCTTTGTTGGGTCGATCCATCTCATTTTGAGGCTGAATGAAGATGGGTTTGTAGCTTGAAAATTCAAGTATTTTTTCAACTGCAGACTCTTTGCTCACCACAAATTTAATCTCATCGTAGCCATATTTTTCAATTTTATCCCAATCTTTTGGGCTATAGGTCACCCAGTTAGCACTCGCAATGTTGGAAAAATTATAGCCATTCGTTTCGACAGAAACAGCATACATATACGCTTGCAAATAATCAATAAAACCATTAAGATTATAAATACTCGGCTCGCCACCTGTAATGATGACATTCATGGCAGGATAAGGCTTAATACGTGTTAACACCTCATCAAAACTAAGACTTTCATATTCTCCTTTGTGCAAGAACTCATCACAAAAGGAACAGGTGAGATTGCACCCATACAGTCGAATAAAAATAGAAGGAACGCCCACATGTGCACCTTCTCCTTGAATAGAATAAAATATATCGACGACTTTTAGCATGCCACCCTTTTCCTTAATAATGTTTCAAAACATTATTATCTATTGTGTAACATTATAATATGAAATGGATTAGAAGATTATTTGAAAAAGTGCCTCTTGTTTTAAGATTAACAAGAGGCGTGTAGGTTTACATGTAAGCTTTTTAGTGATGTGATTTGATAATTAGAATGGTCATTCGGATGTTGATCATAATAAATTTTATCACTTGCCAAATCACACAGGTTCTTCTAAAAAGTGTAAACTTTGAAGGAATCGGTGTAAAAGCATTTTGTGAATACGGTGTTAATTTCATACTATTCTCCTTTGTTTTCATTTTTTGTCACGCCTTTAGAGCAAAGCTCTAAAGACTACGTTAACACTGAGTTTTGCTTGGCGCAATGCCCGTGCATCTTCGATGCTTTTAAGTTCTTATGGAACTTTGTCTCGCACTCAAAGCAAAGCTTCTCATGCTACGCTAACGCACTTTATTCTGGGATAATGGACCAAAAAGGTTTCATTTTGGCTTTCCAGATAAATGCATGGTGTAAAAGATGCTTAACCCAATGGGCAGCAAGACCAATTTCACCAAAAGTACCGCCAAAATCTCTACCATAATCAGGATATTTTTTATAATTCGGAATAATCGGATACATGGTCATCGCAACAGCGGTTCCTGAGAACAACCCTTTACCAGCACTCGCAACACACGCTGCACCCATTTCCGCCATACACGCGGTATGGGTTGGTGCACTCGCACCCCCTATCATATCAACGATGCTTGCAGCAACCGCTTTTCCCATCATCGCACTAGGCATTCCTGTTCGTGGAGGTGTTGGATTGATCGGTGTGCCATTAGGAGATTTCATCGGTTTAGAGATGATATGCGGTGGTGCAAAGGCAATACCTGCCGCAAAAATATTTTTATATTTAGGATTTTGACATGTTTTTGGCCAATCAGACGCATCCCATGTTTCATACGGAGCTGTTGCTTTTGAGTAATCCCCATCAACAAACATAAAACCACCCGCATTGAACATAGAAGAGGTAATGTCTTCTCCATTTTTATCGTATGCTTTCATACCAGCCCCTGCAAATGGAGGAATGAGCATAGAAAAGTCATACGTCTCTTCTTTCATTTCGCCTGCAAGATTTTCGTAATGGACTTTGTCTTTTTCAACTTTATTGACATGCGCTCTCGTGATCCATTTGACATCACGCTCGGCGTATAAAGATTCTGCAAAGACTTTGCCACTGGTGATGTAACCACCCATTTTAAGGTGCATACCGCCTACACCAAAGTCACCGAGTTCATATTCATTAGAGATATAAGTGATGGTCGCTTTGTCACGAACACCCTCAGCACGCAAGCGATGATCGACGTTAAACGTGTACTCAAACGCTGCACCTTGGCAGGTACACATTCCATGACCTGTACCAATCAAAATCTTTTTCTCTTCACCTTTTTTTAATGCTTCGATAATCTTTTCAAGCTCATCTGCGGCATGTTTAGCATGTGAAGCCGTACAAACGGAAACAGTAAAGCCATTGTCCGGTCCTAAACCTTCTGTTGCTGCAAAATTGAGCCGTGGTCCTGTGGCATTGATGAGATAATCATACGTGATTTCCTCTTTTTCGCCCTCTTTATTTGCACCAGTGTGTTCTATGGTAACAAAATTTTTACTATTTTCCACTGTGCCCTCAGGGTGAATAGTAAGTGCTTTTGCTTGAATATAGGTAATGCCCGCTTTTGCATAAATGGGTGCCAAATCAAACAGAACTTCCTCTTGATCCATTTGACCAACACCTACCCAAATGTTCGATGGAATCCAATTCCATTTGCTATTAGGGGTTACAACCACAACTTCATGAGTTCGATTGAGCCATTTTCGCGCAAACTGTGCGGCGGTATGCCCTGCAACACCACCACCCAAAATAACAACTTTAGCCATCAATTATCCTTATAAAAAATGTAAAACTTTTTGATTGTATAACAAGAAAAGATAAAATAAAATTGTACTATTACAAATCCGAAGTTATTTAAGGTTTAAAAAAGGTTTACTTTTTTTGACAGCTATATTTTGGTATCATTCCTAAGATATTTTTCGATGTAAAAGGGTATAACCAATGTTATGGCAAATCAGTAAAGAATTCGATTTTTGTTATGGGCACCGCGTTTGGTCACAAGAGCTGGATGCTGAATTTTCCCTCAGTGGGTGCTTAGCGTGCCGTCACTTGCATGGACATCAAGGCAAAATTATCGTTTTTTTACAAAGTAATGAACTCAAAAATGGTATGGTGACAGACTTTCATCATCTGAACTGGTTTAAACTCTTTTTAGACAATACCTTAGATCATAAATTCATCATTGACATTCACGACCCACTCTTTGCGACCTTATTGCCTCATTTTGCTGACAAACAGAATTTACTTTCCCATGAAAGTGGTTATAAAACACCCGATCTCTCTTTTATAGCGCATGAGCCAAATTATCTTGTAGAAATGTATGAGGGGTATATTATCGTTGATTTTGTGCCAACCAGTGAGAACATCTCAACGTGGCTGCTTCAAATCATTGCGAAAAAGATGAGTCGCTTGGGAGTGGAAGTCTCGCATGTAGAATTTTTAGAAACCCCAAAAAGCCGAAGTATCGTTTACAACCATTAGCGTTCTACCCGAACGCTGAACGTAGCCTCTAGAGCAAAGCTCTAAAGACTACGTTAACACTGGGTTTTCTTTGAAGTCAAGCCCGCGCACTTTCAGTGCTTTTAAGCCTTTGGATAGACCTTTTCGAGTTTTTTATAGAGCGCATCCACAGTGGGAGCAGACTCGCCATGAAGCAATGAAAGCATCACTTCGTTATCTTCTTTACCATTCCACACTTTGATGTAATGTCCCTCTTTGTAGCCATGATCTTGGCGGAATTGGTTAAGCACGTTTTTAGCCACATAGAATTTGTACAGCACTTTAAGGTTGATGCCACATTTTCTTGAAAGTGTAAAATACTCTTTCAACAGTCCATCAAAAAGATCCATTTCAAAACCTGTCGTATCGTGAATGATGCTCTCGATGTCATTGACCAGTTCCATTGGTACAGCCATACCGATAGAGAGTGGTTCTTTGGTAAATGCTTCAAATCCCTGAACATCCAACACATCTAAAACAAGTTGTTCAATATCGCCTCTGTTGTTCGTTTTATAATCTTCCAATAACAGGCTCATAATAAAATGCCAAATATCAACGATTTCAACGGTGACATTGTCCCAATCGGTCGGTTTATTAATGTTTTTCCAATGTTTCCAACTAAAACTATCAATGAGTTCCGCACACTCCATATAAATACATCGCTTCCAGTTAATCATGCGGTTATGTTTGGTATAGCCATTTTCCCAGCCAATACCATTGGTTTCGTCGTTCAGTTTTTGCTGTAACGCAAACATTTGCGTTAAATAATCTTTACTTGTCATTCTATTCCTTCGATCTTAAAAGGGTTTCATTATACTTTTTTGCGCCATAAAAGTCAAAATGGATTCACTGCCTTTGCGATATCCAAGCTCAAAACAAGCATCTATTTGTCTAAAAGTAAACAATCCAAAATCATTGAGTGTGGGATCGCTAATGCACAGATCACTCTGCTCTATCTGCTGTTTCGATGAAGCCAAAATGGAGAGATAAATAGCGCGCTCAAAACTGGAAAAAAAGTTACTTTTTTGCTTTACATGTAAAGGAAACAAATTAACACTCACCACAGGATAAGGAAGCTCTAAAAGTGGTGCTACAGGCAGGTTATCCATAAAACCACCATCAATAAGCGTGTAATTATCATAAGTGATGGGGCGAAAGATAGGAATGAGCGCGGAGCTTGCGATGGCAAGGTTAATTGTATTGCCATGGCTAAAGCGGACGATTTCGCCATGAGGGAGATCAACACATGTCATAAAAGTAGGAATGCTCATCTGCTCTAAACGCTCAATAGGGGCGATCTCTTTCAGAATAGCTGCTTTTTCATTGATGCGAAGAAGTCCCTTGCGAAAATAGTTAAAATGAAACACTTTGCGAAAGGCTCTGCTTTTAACGATACGCAACAGATCAAAAGCACTGACACCTGAACCCACACCGGCGGCAATCACCGAACCAATGCTACAGCCTGAAACAGCGGCTATCTCCACATTGTTTCGCTCCATTGCAGCAATGACACCCAGATGAAATGCCCCCCTTGCCGCACCGCCCGAAAGGGCTAAGGATATTTTCAATGTCCCAGCCATCGTATAATCCTAAATTCACCCTCTAAGGTTTCGACTATTGCGGTGCAAGACTCAACCCAATCGCCACAATTCAGGTACTTAATCCCCTCAATATCGCGAATCTCCGCCTTATGAATATGCCCGCAAATCACCCCATCGTAGTTATTGCGTTTGGCATGCTCGCTGAGAATATGCTCAAAATCGGTGATAAAAGAGATGGAGCTTTTGACATTGTCTTTGACGTATTTGGAGAGTGACCAGTGGCTGTGATAGCGCATTTTCTTGCGAAACCAACCAATAAGTTGGTTAACATTGAGCAAAAGGTCGTACCCCAAATCACCCAAGATGGCAAGCCACCGTTTGGTCATCGTAACCGAGTCAAAAAAGTCACCGTGCGTAATGAAAAATCGCTCGTTGTTGAGACTTGTATAATCCACTTCATCGACAACCGCGATGCGATCGCCCAACCCTAACGGCAAAAAGGAACGTAAAAAGTCATCGTGATTGCCGGTGATGTAAAAAACATTCGTCCCTTTTCGGGCTTTTCGTAAAATCTTTTGAATCACATCGGAGTGCGACTGTGCCCATTTGATTTTACGCTTAATCGCCCAACCATCGATCACGTCACCCACTAAATAGAGATTTTCACTGTTTGTAAATTTTAAAAAATCCAAAAGCTCCTCCGCTTGTGAGAAACGCGTTCCCAAGTGAAGGTCGGAGATAAAGATCGAGCGAAAGGCTATCGGGTCTTCATCCCCTGAAAATTCGTATTTGGTCATTTGTGTTTGTCGCCGTCCCCAAAAATATCATCCGCAAAGTGTGAGAACTTACCTCGTACCGATTTATCCAACTCTATCGCAAAGAGTGAGATGTGTGTCTGCTCTTTTTTGGTCTGTTTGAGCAGTGAGGTCAAACCGTTGTTAAAGCGGTTTAAGTACATGTTATCAATCTGTCGGTTTGAGCCAATCACAATCGCTTTACAGTTATTATCCAGACGGGACAAGATGAGCTGCGTGGTATTGTTGGAGCTGTTTTGCCACTCATCTAAAATCACAATGGCATTGGAAAGCGTTCGTCCTCTCGCCTCACCCGGCCAGAGTTTTTCGATATTGTACTTGGTGATCATCTCTTGTACTTTTTTCTCAATCGCCACTTGCGGTTCTTGGGAGTTGTCACGTTTTTTCATCTTCTTTTTAGCAATAAATTCCAATGTGTCATAGAGCGCCATATTGTAAATCCTAAACTTCTCATCATTGCCAGAGAGGTACCCGACATCCGCACCCTTGTCAACACTTTCAATGGAATTGCGCACATAGACGATCTTCTCGTAACTCCCTTTAGTGACCAAGCGCATCGCCGCAACAAAAGCCATCAAGGTTTTACCGCTACCTGCGCGTGCATCGATGACATGAATGTCGTACATATTGGAAAGCAGTGCTTTCATAAAAAATTTTTGTTTGAGGTTGATCGGTTTAATTTCCAAGCCTCTAAAGTCCAACTCTTCATTGAGCATATGAATAATGCCACCTGGAGAGATAATAGCATGTTCTGAATTGCCATCAGAACTGATAAATTCGTAGCAGTAGTTTCCACTTTTATATTCAGGGTCATACTCGCTGATCAGCTTTTTATCGAGTGTGTTGAAAAGGGCTGAATCAACAGGAAGCTGTTTAACAAACTCAAACTCTGGCACATCGCTTCGATCTTCTCTCAAAGACTCTACGGTGACATTGTAAAAAAGTCCAAACATACGCGCATACACATCTAATGAGAGAAAAATAACTTTATAATCAGGATAATACTCTTGAGCACCCGCAGCCACTTCAAGGATACGTTTGTCGTTAGATTCATTAATAAATTGCGAGTCTATATCGGACGTGTAGATCGTCTTGGAAAAAAGATGGATGGAAATATCTTCTTCGTACTGCATCTTGACAACGCGGAAGGGCTCCCCACTGTCCACTTCAATGACTTTACACGAAGCCAGCATCCTAGCAAAGCTTCTGGCTTGGTACCCAAGTTCTGTGAAGTTCTTTTTAAAATCTTCCAGCTCTATAAGCACCGTTTCTGGGATCACAATGATGTTGTTGCCGCCATCACACAACTCTTTGATAAAATTGGTATTGTGCAAGATAATATTGGTATCGAGCACATAAACTTTGTTGACTGACAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP039734|891162:898814|895165_895864_-|WP_087439311.1|DBSCAN-SWA MTSKDYLTQMFALQQKLNDETNGIGWENGYTKHNRMINWKRCIYMECAELIDSFSWKHWKNINKPTDWDNVTVEIVDIWHFIMSLLLEDYKTNNRGDIEQLVLDVLDVQGFEAFTKEPLSIGMAVPMELVNDIESIIHDTTGFEMDLFDGLLKEYFTLSRKCGINLKVLYKFYVAKNVLNQFRQDHGYKEGHYIKVWNGKEDNEVMLSLLHGESAPTVDALYKKLEKVYPKA >NZ_CP039734|891162:898814|892563_892713_-|WP_162494884.1|DBSCAN-SWA MKLTPYSQNAFTPIPSKFTLFRRTCVIWQVIKFIMINIRMTILIIKSHH >NZ_CP039734|891162:898814|891162_891843_-|WP_096047310.1|DBSCAN-SWA MGKIASKALVVFSGGQDSTTCLGWAKNRFDSVESITFDYGQKHRVEIAQAEKIAKALHVKNTLLSLDAFSQLNDSALIDGTQDIGAHHRTHTNLPASFVPNRNAIFFTLAHAFAQKQGIEHIIIGVNQTDYSGYPDCREPFVKALELALNLGSEANITFHYPLMHLTKAETFELSHQEGVLELVIDESHTCYNGVHDKKHAWGYGCGECPACVLRKAGWETYAKGR >NZ_CP039734|891162:898814|895879_896662_-|WP_096047311.1|DBSCAN-SWA MAGTLKISLALSGGAARGAFHLGVIAAMERNNVEIAAVSGCSIGSVIAAGVGSGVSAFDLLRIVKSRAFRKVFHFNYFRKGLLRINEKAAILKEIAPIERLEQMSIPTFMTCVDLPHGEIVRFSHGNTINLAIASSALIPIFRPITYDNYTLIDGGFMDNLPVAPLLELPYPVVSVNLFPLHVKQKSNFFSSFERAIYLSILASSKQQIEQSDLCISDPTLNDFGLFTFRQIDACFELGYRKGSESILTFMAQKSIMKPF >NZ_CP039734|891162:898814|891842_892421_-|WP_087439308.1|DBSCAN-SWA MLKVVDIFYSIQGEGAHVGVPSIFIRLYGCNLTCSFCDEFLHKGEYESLSFDEVLTRIKPYPAMNVIITGGEPSIYNLNGFIDYLQAYMYAVSVETNGYNFSNIASANWVTYSPKDWDKIEKYGYDEIKFVVSKESAVEKILEFSSYKPIFIQPQNEMDRPNKENVAFCVEFVKEHPQFILSVQLHKFLGVE >NZ_CP039734|891162:898814|894524_895064_+|WP_087439310.1|DBSCAN-SWA MLWQISKEFDFCYGHRVWSQELDAEFSLSGCLACRHLHGHQGKIIVFLQSNELKNGMVTDFHHLNWFKLFLDNTLDHKFIIDIHDPLFATLLPHFADKQNLLSHESGYKTPDLSFIAHEPNYLVEMYEGYIIVDFVPTSENISTWLLQIIAKKMSRLGVEVSHVEFLETPKSRSIVYNH >NZ_CP039734|891162:898814|896646_897420_-|WP_096047312.1|DBSCAN-SWA MTKYEFSGDEDPIAFRSIFISDLHLGTRFSQAEELLDFLKFTNSENLYLVGDVIDGWAIKRKIKWAQSHSDVIQKILRKARKGTNVFYITGNHDDFLRSFLPLGLGDRIAVVDEVDYTSLNNERFFITHGDFFDSVTMTKRWLAILGDLGYDLLLNVNQLIGWFRKKMRYHSHWSLSKYVKDNVKSSISFITDFEHILSEHAKRNNYDGVICGHIHKAEIRDIEGIKYLNCGDWVESCTAIVETLEGEFRIIRWLGH >NZ_CP039734|891162:898814|897416_898814_-|WP_096047313.1|DBSCAN-SWA MSVNKVYVLDTNIILHNTNFIKELCDGGNNIIVIPETVLIELEDFKKNFTELGYQARSFARMLASCKVIEVDSGEPFRVVKMQYEEDISIHLFSKTIYTSDIDSQFINESNDKRILEVAAGAQEYYPDYKVIFLSLDVYARMFGLFYNVTVESLREDRSDVPEFEFVKQLPVDSALFNTLDKKLISEYDPEYKSGNYCYEFISSDGNSEHAIISPGGIIHMLNEELDFRGLEIKPINLKQKFFMKALLSNMYDIHVIDARAGSGKTLMAFVAAMRLVTKGSYEKIVYVRNSIESVDKGADVGYLSGNDEKFRIYNMALYDTLEFIAKKKMKKRDNSQEPQVAIEKKVQEMITKYNIEKLWPGEARGRTLSNAIVILDEWQNSSNNTTQLILSRLDNNCKAIVIGSNRQIDNMYLNRFNNGLTSLLKQTKKEQTHISLFAIELDKSVRGKFSHFADDIFGDGDKHK >NZ_CP039734|891162:898814|892885_894349_-|WP_167749514.1|DBSCAN-SWA MAKVVILGGGVAGHTAAQFARKWLNRTHEVVVVTPNSKWNWIPSNIWVGVGQMDQEEVLFDLAPIYAKAGITYIQAKALTIHPEGTVENSKNFVTIEHTGANKEGEKEEITYDYLINATGPRLNFAATEGLGPDNGFTVSVCTASHAKHAADELEKIIEALKKGEEKKILIGTGHGMCTCQGAAFEYTFNVDHRLRAEGVRDKATITYISNEYELGDFGVGGMHLKMGGYITSGKVFAESLYAERDVKWITRAHVNKVEKDKVHYENLAGEMKEETYDFSMLIPPFAGAGMKAYDKNGEDITSSMFNAGGFMFVDGDYSKATAPYETWDASDWPKTCQNPKYKNIFAAGIAFAPPHIISKPMKSPNGTPINPTPPRTGMPSAMMGKAVAASIVDMIGGASAPTHTACMAEMGAACVASAGKGLFSGTAVAMTMYPIIPNYKKYPDYGRDFGGTFGEIGLAAHWVKHLLHHAFIWKAKMKPFWSIIPE |
9 | Campylobacter_virus(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2670786 : 2729793
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP039734|2670786:2729793|DBSCAN-SWA CTCAATCAATCTTAAAGCCATTGTGAAGAGATCCTAGAGCTCCTTTTGATAAAGCATCAATATAATTAGCTGATTTTTCAAGATCATGAGTTACATAAATTGAGGAAGTAGTGTTGACAGAAGAGTGATTTAAACATCTTGAAACTTCATAAATCGGCATTTTGAATTCATTTATATTTATCGAAGCAAAAGAGTGTCTCAAGGTATAAGGAACAACAGAACGTTCTCTTTGAATCTTATTACCGTATTGATCAAAATTGAAAAGTTGATTTAAAACCCTAGAAATCGAAGCTGGAAATTTTGAAAGCGCTTTATCTGTTGTCTCATGGAAAAATAGATAATCATTTTGATTTCTGATTCGACCGTCCAGATATTCGTTCACTGCTCGTTGACACTCTTCATTAAAAATGCTTTCATAATACATTTTTCTTTTGAAATCAAAAAGATTAATGCGATTGGTAGAAAAATAAAGATCTTTTATCCTTAATTTGATAACAGAATCAGGTCTTGCGCCTGTTAAAAGACAAATGAGGCAAATAACATAATCTCTTTTATTCTTCTTTTGTTTCAACACTTCAAAAAGTGTATTACATTCGTGAAGAGTTAAATATCGTGTTCTCGGTTTAAATTCCTTTTTCAAAGGAATCAGCTTCTTATCTATTTTAAGAAAAGGGTTTTCACTCTTTGGATAATTAAAATGTTTAGTAACTGTCAGCATAATGTCAATGTTTAGTTTCATTGATTTGGGCATATAAGCTTCACCATTATTCTTTTTTGCGGTCAAGATTACTTTATCAATAACTTCAGGAGTAATTCGAGAGAGTGGAAAGTTAATATACTTCTTAAAATTTTCATTGTTGAGCCAGAAATTTAGCCTGTTCCTATCTTTTCTTAAATTTATAGCTGTTTTATATCTTTTGTTCTTAACATCAACATCTTTTTCCTTAGAAACTTCGACATCTTTATCTGCAAGATATTTTTCACACATAACTTTTAAAGATGTTATCTTGTTTTTTCTTGCGATTTCAAGTTCGTCTTTAGAGGTTGCACTTCTGAAATTACGGGCTATGTCATATTTAATATTTCTTAAAATTTCAGCTGCTTTTGATTCGGTCATTCCACTTGATTTTCGACCAACATTTTCTTGCTTAACATTTTTACCAAGCTTGTATTTTATGATGTAAGAAATATCATCATTTTTTAGTTTTTGGTAATAAACACTAGGCGTCTTTGTTTTGATAAGTTCAGCCATTTGAAAACTCCTTTAAAAAAAAGAAAGTATATCAAAAAAAAGCACAACTAAAAGCACAACCTTTTTGTCTAAAGATGTCTTTAAGTGTCGTTGTTTTGATGAATTCTTAATTGAAAGATTATTCCAAGATGCGTTAAAAAACCCTAAAATAGGGATCTCCATTCAATAAAGGACACTTTAAGAATGGAGCGGGTGACGGGAATCGGACCCGCTACTTTCAGCTTGGGAAGCTAACACTCTACCAATGAGTTACACCCGCATTTTAGAGGTGACATTATAGCATATCAAACTGATGTATACTTTTACATGTAAAAATTTTAGTTTGAAAGTGGAGTATAATGTTTGAGCATTAAATGGCAAAGGAGCTAAAATGACACACGGTATAAACGTTAAAAAGGCATTGAAGCAGCAACAAAATGAACTCGATGATTTTACAATTTACAGTATGCTCTCCAAGTCTGATAAAAATGGAGCCAATCAAACCATTTTTCACAAAATTGCTGAAGAAGAGAAAAGACATTACCTCTATCTCAAAACCTATACGAATCAAGAACAACGACCTCGTCCTCACGTTGTTTTCTTTTATCTTTTACTTTCCAAAATTGTGGGTATCTCTTTTACGCTCAAATTTTTAGAGAAACGTGAAGAGGGTGCTAAAGCATTTTATCAAGAGCTTATCGCCATCGATCCTAAAGCAGAAGGTATTTTTGAACAGGAAATGCACCATGAAATAGAGCTCATTGACATGTTGCATGACAAAAAGCTCCTCTATGCCGGTGCTATTGTTTTAGGAATGAACGATGCCTTGGTTGAGTTAACAGGAACACTCAGTGGTATTGCTTTGGCATTTGATCGAAGTATTGTGGTAGGTGTTACGGGGCTTATTATGGGTATTGCAGCAGCACTGTCGATGGCAGGTTCTGCCTATTTGGAATCCAAAGAAAATATAGGGGATGAAGTCAAACCCTTGACGTATGCTCTTTATACAGGCATTTCGTATATTTTGACAACAGCACTGCTTGTTGCGCCTTTTTTTATCATAAATCAGATATCTGTAGCGATTATTTGGATGTTTATTGGAGCAATATTAACCATTTTTTTATACAATTTTTATATTTCGGTCGCAAAAGACTTGTCGTTTTGGCTACGTGTGCGTGAAATGTCTTATATTACCTTTGGTGTTGCGCTGATCTCATTTGGTATTGGTTATGTGGTTAAACACTATTTTGGCATTGAAATATAAGATGAATGATAGAATTTTGATGTAATTTTAAAAGAGAGATTTTTTTAAAACTTGCCTGTGCCATAATGACTCATATGAGATAAAGGAGTTACTATGGATTATAGAATTGAAAAAGATACTATGGGTGAGATCAAAGTTCCTAATGAGCGTTACTGGGGAGCTCAAACTGAGAGAAGTTTGGAAAATTTTAAAATAGGTACAGAAAAGATGCCAAAAGAGCTTATCCGAGCCTTTGCTCTTTTAAAACGCTCTTTAGCAACGGTTAATCAAAAACTGAAAAAACTGGATGGCACAAAAGCAGAGGCGATCGTTCAAGCGTGTGATGAAATTCTTGCTGGCAAATTTGATGGTGAATTTCCTCTTGCCATCTAGCAAACAGGTAGTGGTACACAGACCAATATGAACCTAAACGAAGTTATCGCTAACCGTGCAACTGAGATTTTAGGAGGAGATTTCCGAAAAGAAAAACTCATCCATCCTAATGATCATGTCAATATGTCTCAAAGCTCAAACGATACGTTTCCAACAGCCATGCACATTGCCAGTGTTATTGAAATAGAAGAGCAACTGCTTCCTTCACTTGAAAAGCTCAAGGAAGCACTACAGGCGAAAGAAAATGAGTTTGCTGGCATCATTAAGATTGGTCGAACACATTTACAAGATGCAACACCCCTGACGCTTGGACAAGAGTTTAGCGGGTATCGAAGTATGTTAGAACACTCTTCTTCGCACATTTTACAAGCACTGGAATCTTTACGTGAACTTGCGATTGGAGGCACAGCTGTAGGAACAGGTATTAATGCGCACCCCAAACTCAGCGAATTTGTCAGTGAAGAGCTGAGCAAACTGACAGGGAAACATTTTGTCTCAGCTCCCAATAAATTTCATGCTTTAACATCCCATGATGCCCTTGTCTTTGCAAGTGGTGCCAATAAAGGCTTAGCAGCAAACTTGATGAAAATTGCCAATGACATTAGGTGGTTGGCTTCAGGTCCAAGATGTGGCATCGGTGAACTCTCTATTCCTGAAAATGAACCAGGAAGTTCCATTATGCCCGGTAAAGTCAATCCGACTCAAGCCGAAGCGGTAACGATGGTCGCCTGTCAAGTATTTGGTGCTGACACTGCTATTGCCTTTGGTGCAAGTCAAGGAAACTTTGAACTCAATGTCTTTAAACCAGTTATTATCCTCAATTTTTTGCAACAAGTCAGGCTTTTGAGCGATGTGATGGAGTCATTTAGAATACATTGTGTGGAAGGTATTGAAGCCAATAGCGAAAAGATTGCGTTCAATCTTCACAATTCATTAATGCTCGTCACGGCACTTAATCCCTATATCGGCTACGAAAATGCAGCAAAAGTCGCCAAATTAGCGCATAAAGAGCATTCAAGTCTTAAAGAGGCATGTGTGAAACTTTCACTTTTGACACCAGAAGAGTTTGATCGTTATGTCATTCCTTCTGAAATGATCCATCCTAAAGCGTAAAAAATGTAGAGCGTAGTAACTTCTTTTTGCTACTACGCTTTACATGTAATAGTTTTGTGACTACGGTATGCCTTTTGCATAAACTCCTTTAAATTTTTAAGGAGTGATTATGGAAACAATCATCAAAGGTGTCAGCAAACCATTATTGAAAAATGAGCGGAAAATAGGCTCAGAGGCTCCTGCAATCATTTTAGAAATGCTCAATGGTGAGATGAAAGTCATCGGTATGATGGCAACCAAAGTGCAAGTGATGATCACCTTACCTTTTCAAAATTCATTGAGTAAAGAGCTTTTAAACATCATTGAGAAATACCAAGAACAAGCATTTATTTATCTTATCTGTAGTACCAAATTAGAAGAAGAGGTCAATATAGAAAATTCATCTGTTGATTTTGTAGAATTTTCAAAGAAATTTGGGGTTTATATTGATGAAACACTCTGTGCAAAATCTCTTTTTATCATCAACAAAGATGGTCAATTTGTCTATAAAGAGATCACCAAGGATGTTGAAGATGCATTTGATCTTGAGATGTTTGAGACAAAGCTTGATGAAGCGATTCATTTCAAGAAAAAAGGGCACGTACATGAGACTTGGATGGGCGCATGAGTGAGCAAACTTTACTTTGGCTTTCAGCTAATGGCTATGATGCCAATGATTTAAATTTCGTTGGTAAGTATGGTAACTCTGCACTGATGAAAGCAGTGCGGGAAGCCAATATTTCCGTTACCAAAGAGCTCATCGAAGCGGGAGTGGATCTGGAACTTAAAAATATTGATGGCAATACCGCCATTTGGAATGCCTGTTTTGGTGGGGACTTCACCTGTGTGGAACTTTTGGTAAAAGCGGGTATCCAACTGGATAATCAAAATGACAATGGTGTAACGGCATTGATGTACTGTGCCAGTTCAGGGAAAGAGGAAATGACAAAGCTTTTGTTGGCATCTCATGCTGATACAACCATCGCGAATTTAGATGGCTTTAAAGCAATTGATCTCGCGAGCACACCGACTATTTATAAGATGCTAAAAGCAAGTATTCACTAAAAAATATCTTTACATGTAAAGATGGTTGGCTCACTGTTGAGTGAGGACTTTTTTTTCAAAAATGCCCCAACCACACTCAAGCCAGAGACGTGCACTTTGATTAAATAGTTCACTTTGTAGGGTTGCTACTTTAGCTTTTGTACTTGCCAAGTTGGCAATAGCACGGCTAAGTTCATCGGCACTGATGAGTTGGTTATCAAAACGTCCTTGTGTAAGGTTGGTATAGGATTCACTGGCTTTTTCTTCGAGTTTTGCGCTTTGAAGCTCGGCATTAAGTGTTTTAATCTCAAGCTCTGTATTTTCCACTTCAAGGGCGACTTGTTGCTTGTATTCTTCTAAATTGAAAAAAGCAGACATCTTAGAAGCGCGTGCTGCATCAATGGTATGTGTATCACTAAAGCCATTAAATAGATTCCATGAAGCACTCAATCCTACATAACTTTTATCAGCATTGGTATAGCCATCACCATTGAGCTCGAGTGAGTCACCTTGCCCCTTCAAAACACCCACCATCGCAATGGTGGGATAATTTTTACTTTTGGCAAGCTCAACACTGCTTTGTGCGACGTCAATCGCTTTTGCCATGACATGTAAATCTTCACGTTCATTTAAGGCGATCTCTTTTGCTCCATCACCATTGGGTATTTCAAAGGTTTGAAGGGTATTTGCTTGAAGCGTTTCTATTTTGGTATTGACGATCAGTGAGAGTTGGTTGAGAAGCTGTTTTTTTTGATTTTGATGGTGAAGCAGTTGCGCTTCAATATCATATTTTTTAGCTTCGATCGCGTAGAGTTCGCTTTGAGCCAAAAGTCCATTGGTGTAAAATCCTTTGGCTTTTTGATATGCTTGATTAATGGCAAGCTCTGAGCTTTTTAAAGCGTCAATAACTCTCTCTTCAGCGATAATAGCGCTAAAAAGTTGCGTGACATGCATGGCAAGATTTCGCTTCAGATTGGTCAGTTTAAGCATTGCTTGCTCATTTTCTAAGCGGGCTTTATCAATGGTTGCTGAAATAGCAAAACCCGTAAAAAGTGGATAACTTAAAATGAGTGCACCCTCAAGATGTCTACGTGTCCCCACATCGGCTTTGGTAACAGGAAAAGAGGGAAGATGGAGCGTCATGTTAGGAATTTCATTAAACTCAATGGCGCTCAGTGTAGCATCCAAGGCTGGAAGGTTTTTCCCTAGTGCTGCTTGATAGAGTGATTCTGAAGCACTTTCAAGTTCTTTAGCACTTTTATAGCTATTACTTTTCTCCAACAAATTTAAAAGTTCCGTGTATGTTTGTGCACAAAGACACAAAGGAAATAAAAGTAGCAGTAAACGTCGCATGAACCGTCCTTGCGAAACATCATGATTATAGTGTTAATAGTATTGCTAATAGCGTTAAATTGCTCTTAATTTAGGTAAAAATATACCAAAGAAGTTATATAATAGTTTACTCTTCAACCAAGGAGCGTGAATGTTAGAAAAACTCAAACAGTATCGTTTAGGTGTGGCTATTTTAGCTTTGGTCTTAATTGCTTCGGGACTTATTATCCATAAACTCATTGCTCCAACGCTTGCAGCCAATTTAGTTCAAGGTTCTGGACGTATGGATGGTGATTTGATAAATTTAAATGCCAAGTATGCAGGGCGTATTTCTACTTTAAACATTCAAGAAGGGCAACACGTTGAGCTAGGTCAAAACATAGCCGTCCTTGCAAGCGAGGAATATGAAGCGCAAAAGGCACAAATAGAAGCCCAAATAAGTGCACGAACACAAGAGTTAAACGCCAAAGAGACAGAGCTTGAAATTGCTTCCAAAACCATTCCTGAAACACTCTCAAAAGCCAAAGACAATCTCTCTATTAAACAACATCAACGCACAGAACTTGATAAAAATATCGCCTCACAAACCAGTATTTTAGCGCAAGACAAGCACGATCTTGAGCGTATGAAAAATTTGTTTGAACACAATCTCATCGAAAAGCGTCAGGTAGAAACCGCAGTGCTCAAATTCCAAACCAGTGGCGATTTACTGGCAGGATTACAACAAAAAAGAGAGCAACTCAGTCTTGCAATCAATGTTGCCAACAGCGAGCTCATCGAAGCAACAGCGCATCAAAAAACATTACGTGCCTTGGAACAAGGTATCGAGGCACTCAAGTCATCCCTGAAAGCGTTAGAAGCGTCTAAAACTCAAATCGAAGCGATCTTACATGAGATGATATTGCGTTCTTCTGTGAATGGCGTTGTCGTGGAAAAAATTGCCAATCAAGGTGAAGTCATCGGGGCTGGAAGTGTGGTTGCTACCTTGCTTGATCCAAACTCCCTTTATCTAAAAATATTTGTCGATACCAAACAAAATGGAAACCTTCAGGTTGGCAATGATGCCGTCATCTTTTTAGATGGAAAACCCAATGAACCCATCCAAGCTAAAGTCGTGCGCATTGAGCAAAAAGCGGAGTTTACTCCCAAAGAGGTCAGTGTTCCGAGTGATCGCATTCAAAGGGTTTTTGCTTTACATGTAAAACCCCTTTCACCACAACCAACGCTCAAACTGGGCATTCCTGCTGTTGGTGTTGTCTCCATGGATGGCAAAGGGTTACCAAAGAGTTTAAACAACGTTCCTGAATGAGTCTAGTATGGAATTAAACGTTCATAACGTCAGTGTCCATCATAAAAAACGTTTAGGCATTGCTGGGGCAAATTTATCTGCACAAGAGGGTGAGATCATCGGCTTTATTGGTGCGGATGGTGCTGGTAAAAGTTCTCTTATTCATGCTATAGCAGGGGTTATTCCATTTGAAGGTGATGTCACATTTAACGGAGTGACATACCATTCACCTAAAGAGGCTGAACCGCTTAAAGCCTCCATCGGACTGATGCCTCAGGGTATTGGGTTGGTGCTGTATGAACTTTTAACCATTGATGAGCATCTTCGTTTTTTTGCCAATATTCACAATATCTATCAAGACAGTGTCTTTGAAGCCTACAAGCTACGTCTTTTAAAAATGGCTGGTCTGGATGCCTTTCAAGATCGTCAAGCAGGAAAACTGAGTGGTGGTATGATGCAAAAGCTCTCACTGATCTGTACACTCTTACACCGTCCCAAACTACTCCTTTTGGATGAGCCGACCACAGGTGTTGATCCATTGAGCCGTATTGAGCTCTGGGAAATTCTTGATGAGATACGCAAGTCTGAAGGAACAATCTCTATCATCAGCACTGCGTACATGCAAGAAGCTGCTAAGATGGATAGAATCTTACTCTTTGACGAGCAAGAGATCATTGCGCAAGGTACTTCAAGTGAATTGATAGATTCTGTTCGATCAATGAGTTATGTTGAGGGTATCACAACAGAAGAACCTTGCATTCATACCTTACATGCGACTTATTGTCTGAGTGCTTTAGAGGAAAGACATATCGAACCGAGTCTTGAAGCGCTCTTTTTTGTTAATGCCCTTCAAAAAGGCAGAAAAATGCCCCTCATTGAGATTACCAATAAAGAAAAAACCATTGATTTACCCGATATTGTTATGGAAGCCAAAGGACTCACCAAAATTTTTGGTAGTTTTATTGCCAATGAAAATGTCGATATGACACTTCATAAAGGTGAAATTTTAGGACTTTTGGGTGCAAATGGTGCAGGAAAAACGACATTTATCAAAATGCTTTTAGGACTTTACCCGATTGATGGAGGAGAGTTAACGCTTCTTGGAAAATCCATTCAAACGCAAGAAGATAGACAAGCACTGAAAGCGAGCATTGGTTATGTGAGTCAACATTTTGCACTTTACAATGATATGAGTGTTGAGGAAAATCTACTGTATGCTGCATCGATGCGGGGTATTACAAACGATGTTGCCAAAAGCCGTATTGCACGTTATGCGTTAGAACTTGGATTTGATGAATTTCTAAACTCGATGCCTCAAGAACTGCCTCTAGGTATCAATCAGCGCTTTTCGATCGCTTCCGCACTCTTGCATGAACCAGTTATTTTATTTTTAGATGAACCTACTTCTGGGGTTGATTCCATTGCAAGAGCAAAATTTTGGGAGCTATTAAAGCTTCTTAAAGAGCGCTGGGAAATCGCTATTTTGATCACCACCCATTATATGAGTGAAGCCGAATTTTGTGATCGTGTCGTATTGTTGCGTCAAGGCAAAAAGATCGCCGATCATACGATTGCGGAGTTTTACGCCAAACACCCTAATGCGCAAAGTTTTGAGGAGATTTTCTTGGAGTATTACCGATGAAATTGGGTGTCATCAAGGCCTATGTGCTCAAAGAGCTTACGGAGATTGTACGATCTCGCCTGATTATTATGGTTTATTTGTTACCTACGATGGTATTGGTGCTCTTTGGGTATGGTATTCGAATGGAAGTAACGGGTGCACGTACACTTATTATTGACAACGATCAGAGTCATTATTCACAGCTACTTGTGAGTAAGTTTGAACACTCTAAATACTTTGATTCCACTATTGAAAAACGAGGTGAAGCCAAGGCACTTGATGAAATTCATAAAGCTAAAAGTGACATTCTCATCATCATCCCTGAGAGCTTTGAGAAGCGGCTTTTACATGGGCAATCTACCCAAATCGGTGTCTTTGTGGATGCCGCTTTCCCGATGCGAGGCTCAACAATGGAGAGTTATGTCAAAGGTGTTGTCCTAGATGCTGCCAGTCAGATGTTGGAGCGCTTAGGAGGCAATGCTCCAACGATTAGCATTAACCAGAGGACACTCTTTAACCAAGCAATGCGTGATGAAGATGCTATCGTTCCAGGACTTATTGGGCTTGTTTTACTCATTGCTCCAGCTATTTTGGCTGCCCTTTTGATCGTTAAAGAGAAAGAGCGAGGGACTATTTTTAATTTTTATGCATCACCACTTTCGAAAGGGGAGTTTATTGCGGCTAAGCTGATTCCTGTTTTTTTACTGCATTCGATCAATATTTTTATCCTTTTTCTTTGGGCAACATATCTGTTTGAAGTGCCGTTTCGAGGTAGTTTCTTGCTCTATTGGCTCACGTCTGAGCTCTATTTGATGATTAGTCTCTCCATTGGGATGCTGATTTCTATCGTAACCAGTACGCAAATTGTAGCGGTGGTTTTAACGGTCATTGTGACCATCATTCCTGGATTTTTGTATTCAGGTATTTTGATGCCGATCTCTTCAATGATTGGTGTTTCACGCTATGAAGCGCACATTTTTCCGGTGATGTACTACAACCATATTCTTTACGATGTCTTTTTAGTGGGAGAAGGGCTGGCTTCGTCTAAAACTGTAATGTACATTGTTATTTTGGCCTTTTATGCTTTTTTTATGCTGACTCTTGGAAGTTTTTTACTTAAAAAGGAGCTCAAATGAAAATATTTTGGGCTGTTGTTGGGAAAGAGTTACTGAGCTTTATACGTTCATGGCAGTTGGTTTTTGTTGTGCTTTACGCCTTTAGTTTTGAAGTTTACATTGCTGGCAGTGGCATTGAACTTAAACCTCGTAATATTGCCGTTGGGTATATAGACAGTAGCGGAGGAGGACTCAGTCAGAAGTTTTTGAGTTACTTCCATGCGCCTGAATTTTTAGAACCCGTTTTATTTGAGTCACAAGAAAAACTCTCTCAAGCGGTATTTGATAAAGAGATCATGGTAGGCTTAGTATTTGATGATACGTTTGAGCAAAATTTTCGTAAAAAACATGCGACTACTTTGAATGTTTTACTTGATGCCACAGCAGCATCACAAGCCTTTACAGCTCTAAGTTATTTGCAAAATATCGCGATTAATTTCACAGCACGTTCCTTTCCTGTGGAACTTGTGACCCACAAACTCTTTAATGAAAACGCCGATAACCACACTTTTATGGCGCTGACAGAACTGCTCTCGATCACAACGCTTTTATCGGTCATTTTAACCGCCGTCGTGTTTGTGAAAGAGAAAGAGGAGGGGACATGGGACATTATGCTTTTGATGCCAGTGAATGCAAAAATTATCATTTTAGCAAAGTCTTTTTCACAGGTGATTATTGTGATGGTGGGCATTGTTATCTCGGTGGGCTTTGTGATTTTTGGAGTGTTTAATACGCCAATTAATGGTTCGTTTTTTGCCTTTTTACTGCTCAGTTTTCTCTATGCTTTTACAGGCGCTGGTATTGGACTTTTCATTGCAGCGATTGCTAAAGATGTCATGCAAGTCGCACAACTTGCCATCGTCATCATGCTTCCTCTCATTTTTCTCAGCGGTGCGTGGACGCCCATTTATGCGATGCATCCGTTGTTGCAGAAGTTCTCACTCATCTCGCCACTGCGGTATTACATCGAAGGAACGGAGAGTATTTTCTTTCGTGGAACACCTGTTTTGGAGCTTTACCCGTATTTCTTGGGGGTAACAGTAGTCGGGAGCGTGCTTTATTTTATTGGTTTTCGTAAGATTGGGCGATTGTTTTGATGTTACTGAAACTTGTCGATTACCATTGGCAGAATCTTTTTACATGTAAAGATTAATGTAAGAAATTTTTGATACCTGTAAAAATGAGCTGAGCTGAGAGTGCAGAAAGAACCAGTCCTGTTACTTTACTAAAGACAACAAGTCCCGTGCGCCCAATGAGCCTTTCGATATGCCCTGAGAGATACAATAATAACCCAATGCTAAAAATAGCCACTAAAAGTGCGCTTGAGCCTAAGATAAGATCTTCAAAACCATCCATATCGGCTCCCATAACCATTAAAGCACCCACGGTTCCAGGTCCTACGGTAACTGGTATTGCCAAAGGCACAACGGCATGTTTTAAGATATCGGCTTTACATGTTGGTTCTGCATTAATGTCTTTTTGAACCAAATCAACAGCGGTTAAAAAAAGTAGAGCACCAGCTCCAATACGAAATGCATCAAGGGTAATGCCAAAGAGTTCAAAAATATATTTTCCGAAAAAAAGAATAATAAGACATGTAATAGAAATAGCCAATGTCACTTTAACTGCAAGGCGCTTTTTATCGCTCTCATCAATTCCTTTTGTCATCGACAAAAAGATGGTCGTAACAAAAAATGGTGTCATAATAAAAAAGAATTTGACATAAATTGCAAAAAAAGTTGAGAGGTGAATCATGTATGCTCCTTAAAAAGAGTTGGCATTATAGCACTTTTTCAGTTCCTCTTTGGCAGAATAGGGGCATGAAAAAAGAGATCCTTTACCTAACCGAATACTTAGCTAAAAGTGAAAGCGAGCAAGAGCGAACTTTTTATGCACTATTGATTCAAAACCTTGCCGACTTAGAGGTGTACTCACCGACTAAATTGACACAAGCGCAGATTGCCTCTTTGATGTCACGACAAGGATTGAGCGTTCCTTCTAGTTTTAAAGAGGGAATTCAAGCACTCGATACTTTATTTGAATCATTCATTCCAAAGCCACTTCAAGAGGCAAAAAAAACTCTTTTTATGACTCTTCTTCATGCAAACTTTCCTAAAAAGAAAGGTTTTTTGAGTGTCTCTTTAGAGCTTTTTCTCTCACAACTAGAACCTGTTGAAATGAGCATCTATGAGAGCCTATTAGCCTATGTTGCTGGACTCAACCGTGCTCTTGCTCTTTTCTTTATCTTAGGTAAAGAAGATACTCAAAACTTTACACCTGAGCGTTTAGTTGCTTTTGGAGAATCTTTACATGGTAAACTTTTAGCATTTCTTTTTAATGAAGAAGAGACGGCACTTTTGAATCAAGGTCTTAAAGAATTACTGGGTGTTTACCTCAGTTTATATGGCAAATATCTTTACATGTAAATCGAAAACAGTCAATTTTAGTCTTTTGCTTACGTTATAATTGTCATTATAAATTCTAAAGGTTTAGAAAATGAATGAAGAAGAAAAAGAACTCAGTTGCGCTGAATGTGGAACACTGAATTGCCATAAACATGATAGTAGATATCCTAAATTTTGTTTAACAACCAATGTTGATGAGCAGATGCTAGAAGAATCTCTTGCCTGTTATAAAGAAGAAGAGGGAATGGATCGCAAAATTGCACTGGCCGCTGCTGATATTGAAGGTAAATACTATGGACAATTAACACGTGTTGAAGAAATATTAGCCTTTGCAAGACGCATTGGTGCAAAGAAAATTGGTATTGCATCCTGTGTTGGCTTAGCTGCCGAGTCCAAAATTTTTGCTGAGATTCTTAAAGTCAATGGATTTGATGTTTTTATGGCAATTTGTAAAGTAGGTTCTCGTGATAAATGCGATATTGGGCTTGAAGAAGAGCAAAAAATTCGCCCAAACACGTTTGAGCCGATGTGTAATCCTATTCTTCAAGCCAAATACCTCAATAAAGCAAAGACAGATCTGAATGTGATTATGGGGTTGTGTGTTGGACATGACTCATTGTTTATCAAATATGCGAAAGCAACCACAACGTATCTTGTAGTCAAAGATCGTGTTTTGGGGCACAATCCTATTGCCGCATTGCATTTAACACAGACGTACTATAAAAAACTTTTAACACCTAAAGCGTATTAATTGGTTATACTCTATCAATAAAGAGAATTGAAAGGCGGAAAATACAAAATGTATTTTTGGATAATGATGATATTTTTCGTACCATTTTCTCTTTTTGGTGTTGAACTATCGTTAGATGAAAATACTTATCTTAAAAAACTAGGCACTGTTAACGTCTGTGTTGATCCCGATTGGGAACCTTTTGAGATGATAGATCAAAAAGGGAATTATACAGGTATTGGTGCTGATCTTTTGCATCTGGTTGCCCAACGCATTGGGCTTAAGATTACTGTTTTGCCGACAAAAGATTGGGACGAGAGTATTGCGTATTCCAAAGCAGGTAAATGTCAAATCATTAGCTTTCTCAATCAATCTCCTTACCGAGATACATGGCTTTTGTTTACAAAACCACATTTTAGTGATCCCAATGTTTTTATTACTCGAGAAGAGCACTCTTTCATTGGTGATCCTCATGATTTGGTCAATGAAAGCATTGTTTTTCCCACAGGAACAGCGATGGAAGAGCTTGTTCGCACAGAGTATCCAAATCTCAACATTATAACCACGCACTCTGAAATGGATGCTTTCCAACTGGTCTCCAATAAAAAAGCAGACATCGCAATGCGCTCACTCATTGTGGCAGCGTATACACTGAAAAAAGAAGGAATGTTTAATCTCAAAATTGCAGGGCAATTACCCGATTATATCAATAAAATGCACATGGGAGTCATACAAAGTGAACCGATGCTTCGGGACATTTTAGACAAAGGTATTGCAACCATTAGTGCTGAAGATCGAGCGAATATTGTCAATAAATATGTGGCAATTAAAGCTCAAACCGTTTACGATTATAGCTTACTCTTGAAAATTGTTTTTGGGTTTATGATTTTGGGTCTATTGTTTCTATGGCGTTATTATGAGCTCAAAAAATATACTAAAGAGTTATTGTACCTTTCAGAAACAGACATTCTTACCAAAATGTACAATCGAATGAAGATCGAAAAAGAGTTGGTTATGCAAGTTGAGCGTGCTAAAGCCATGAAATATTCTTTTTCTATATTATTGATTGATTTTGATTTTTTTAAGATCATCAATGACACCTTTGGTCATCCTATTGGTGACAAAGTCTTGATTGAAATGGCAGACCTTATTAAGCGAAGTATTCGCTCTGATGATAGGATTGGACGTTGGGGTGGAGAAGAGTTTTTAGTGCTTTGTCCACAGAGTAATGAAGATGAAGCACTAAATATTGCCAGACGTATTCAAATGGCAATTCATACAGGTGTTTTTTCGACCCATAAACATCATACGGTTAGTATTGGTATACGAACATTAACAGATGAAGACACACCTTATACGCTAATATCACATGCTGATGACGCACTGTATAAAGCAAAAAATACAGGTCGCGATACAATTTGTTGTTCTTCTTCTAGCACATCTATTTAAATAAAATACTTCCCAAACGAATCATGTTAGAACCACAGGCAATTGCTAATTCAAAATCTCCACTCATACCCATTGAACAATATTTCGCACCATAATTTTGGAGTGATTCAAAGATTTTATGCGTGGTTTCAAAGCTCTTTTGAATAACAGCGCACTCTTCGGTATGTGCACCAATACTCATGACACCTTTAAGCTGAAGGTGTTTACATGTAAGCACGATTTGTTCATACACTTCGATTGCTTGCTCAGGCAGAACACCCGCTTTTTGTTCTTCATAGGCACTGTTGATTTGAAGCAGAACGTTCATGGTTTTATTCTTTACATGTAAACGTTTGTCGATCTCTTGGGCAAGCTCTAAAGAGCTTAGAGAGTGCATCAATGATGGCTCTAAATCGATGAGTTGGTTGATTTTGTTGGTTTGTAATCGTCCTATAAAATGCCATTCAAGTGGGAGGCGTGAAAGTGCATGAACTTTGTCACTCATATCTTGAATTTTATTTTCACCAAAACAACGTTGACCTGCATGGTACATCGCTTCAATCATAGAAGGATCAGCGCTTTTACTTGCTGCAACAATCTTAACAATCAAGTGTTGATCGACACTCAAACGCGCTTTTTCAACGCGTGTGAGAATATCATCCAAAGTATTTACAAAATTTTTAGTTTCCATTGGTTAGCCTGTAAATATCATTAAAAATACTCAAAGCCATAAGGCTGAGTAAAAATATCCATCCCATTGACGTCATAGCCGTAAGTACACGCTCACTGGGTGCTTTTTTGGTCAACATTTCGTAGGCATTAAACATAATGTGTCCACCATCAAGGGCAGGAATTGGTAAAAGATTGAGAACGCCAAGGTTAACAGAAATAAGTGCCGTCAATGCAAAAAGCGCCACAAGACCAGCAGCGCTCGCCTCTGAAGTGACTTGAACAATCGAGATGATACCCCCTAACTCTTTAGGTGAAACAACCCCTTCAATGAGTTTTTGAAGACTGGTTAAAATAAGTGTGGTTGCCTTTATGGTTTGTTCATAAGCAAATCCTGGAAGTTCACTGATGCTATAAACCATTTCAATCGTTTTACCGCTTGGGAGAACACCAATCATTTTTTTCTGTTTTGTCTCTCCAAACATATTTTTGTATTCACTAATTTTTGGTGACAAAACAATCGTTTGTACGCTTCCTGCTCGCTCAACTTTCATTTGCATTGAACCACTATTTTTTTGAATGAGTATACTCACTTCATCCCAAGTTTCAATGAGTTCGTTATTAATCATTACAATGCGGTCATTCTCTTGGAGACCTGCTTCTAAAGCAGGGGAGTTTGGACTAATTTTACCAATAATTGGAGCAAATTTTGTTACTCCCATCGTCCCTACAGCAAGAAAAAGCAAGAAAGCAAGTAAGAAGTTCGCAAAGGGACCTGCAAAAAGGATAATAATACGTTTCCATGGTGCTTTGGTTGTATAACTGTCAGCATCATAACTGACTTTTGTTGGATCACGATCGTCTTGTCCCTTCATCTGCACATAGCCACCTAACGGAATGAGACTTAGACAATACTCTGTATGACCAACGATTTTTGAAAAGATCTTTTTTCCAAATCCAATGCTGAAAACTTCAACGTGAACACCAAAAAATCGAGCGGCAAGAAAATGTCCTAATTCATGAAAAAAAATGAGAAACGAGAGAACCAATATTGAGGTCAGTGTACCCATCAGTTGCTCTCTATTTTTGTGTAGCCAATGATATATTCATAACCAGAGTACAATGTAAGTCCAACAGCAACCCATAAAAGTAACTCAGCATAAGGCCAGTTCATCATTAAAAAGCCAATGGCAATCATCTGAAAAACAGTTTTGACTTTACCCGCCATACTCGCAGCAATATTCTTACCTTCACCCATTGCCGCTACACGAAGTCCTGTAATAAAAAATTCACGTGTTAGGATAAGAAAAATGGCCCAAGGGTTAGCCCGATCAATCATCATCAGTCCCAAAAATGCCGCTAAGGTAAGCATTTTATCAGCAAGTGGATCAAGAATAGCACCCAGTTGTGTTTTCTGATTCCAATTACGTGCAATATAGCCATCAAAAAAATCTGTTGCGCTCGCAATAACAAAAATAAGTGCAGCAAAGTAATCTAACCAACTTACATGTAAGCCCTTAAACAAGGGTAAATCACGATTGACTAAAAGAATAAACATAAGGGGTGCTAAGCCAATACGCATAGAGGCTAGAAGGTTTGGAAGATTCATCATTTAAACGTTGTACCACCATCAATCAGCATGGTGTGCCCTGTAATCCATGATGCCTTATCTGAACAAAGGAATAAACAAGCTCCAGCTAAGTCTTGTGGTTGCCCCATACGACCTAAAGGTGAAAGTTTAGCGGTGATATCACGAACCTCTTCATAGTTAGTAAAAGCTCTTAAAGCATCGGTTTCGATAGGCCCACCACTCACAGCATTGACACGAATACCTTTACAACCAAGTTCCGCTGCAGCATAACGAACCATTGTCTCAACAGCCGCTTTTGCGGTACCATGACCTGCATAATTTTCAATATAAACGAGATTTCCAGTAGATGAAAGAGAGATAATACTTCCCCCTCCAATTTTTTCCATACGTTTAGCCGCTTCTTGTGCACCCACGACAAAAGCATTGACCGTTGCTGTAAAGATGTTGTTAATACCACGAGGTTTAAGTTTCATAAACTTGGTATATCCACCCACAACTGGTCGACCAGAAATGATCGCGTTAGAGATAAAAAAGTCTACACGATCAAAATCTTTGTCAATTTCCAAGAAAAGATCTTTATAGGTCTCTGGTTCTAAAATATTGAGAGGATATGCTTTGGCTTTAATGCCAAATTTTTCTTCTAAATCTTTCACTTGCTCTTGTGCTAACGCTTCATTTGAATTGTACGTAAATGCAATATTCACACCTGCTTGGGCAAATTCATGAACAATGGCTTGACCGATACCTCTGGTACCGCCGCTAATGACAAGTGTTTTTCCCTTCATTAAAATCCTTTAATCGTATATTTTTTAAGTGTATTTTCAATGCGTTTCATATTGTCGTTGCTTGGAGCACACAGAGGCAAACGATATTCCAGTGTTTCAAGCAAACCAGCAATATACATTGCCGCTTTTATAGGAATAGGATTACTTTCGCAGAAAAGTGTTTTATTGATCTCATACAGTGAATCATTAATGGCTTTTGCCCCATGAAAATCACCTTTAAGCGCTAAGTGTGTAAGCTCTGCAATTTGGTCAGGTAGAATATTTGATGTGACGGAAATTACACCTGAGCCTCCATTGGAAAGAATAGGGTAATTGATCGCATCTTCACCGCTAAAGACAGAGAGTTGCGGCTGATGTGCGAGTAAATCTACACAACGATCAATCGAGCCTGTTGCCTCTTTGACACCAAAAATATTAGGACAAGCACCAAAAAGTCTAAAGATGGTATCGGGGAGTAAATCACATCCTGTACGTCCTGGGACATTGTACAACAACACAGGAATATCAACAGAACCTGCAATCGCTTTATAATGTTGAAAAAGACCTTCTTGTGTTGGTTTGTTATAGTATGGTGTTACAGAAAGAATAGCATGCGCTCCATGTGCTTGTGCAAATTTAGCCAATCCAATGGCTTCATGGGTTGCATTACTTCCAGCGCCTGCAAGAACTTTGACAGCACTATGTGAGCAAACATCGACTGCTATTTCAATACAACGGCGGTGTTCATCATGATCTAGTGTTGCACTTTCACCTGTTGTTCCAACAGGGACAACAACATCTATGCCATTTTTAATCTGTCTTTCTATGAGTTTTGCATATTGAACTTCATCGAGTTTTCCATTTTTAAAGGGTGTGATAAGAGCGGTCATCGCTCCTTGTAACGCCTGTTGCATTTTAATAATCCTTTCTTAATATAACTGTGGTAGATGATTGTGGAACTAAATATTTATTGACAACTTTAATTATATCTTCTTTTTTGAGCTTGTTAATCCCCTCTTCATAGTTCAAAAGAGGTGTAATGTCACCTTTTGCAAAATAGGAACCAAAAAGGGTTGCAAGATCGCTAGAACTTTCTAAATTATGAATAAAATCAGCTTTTGTATTGATCTTAATCTTTTCAATATCTTTATCACTGACATCGTCTTTTTTAAGGCTATCAATAATTTTGAGTATTTCAGTTTCAACACTTTCTGCTTTTACACCTGGATTACAGACCGCTAAAAATAAAAAGACAGAGGGATCTTTTGCTTCCATTGCATAGCCATAGATTTGATTGACCAGTTTCTTTTCATCTACTAAAATACGGTGTAGCTTGCTACTTTTTCCACTGCTTAAAAGCTCACTTAAAGCAGAAAGTGCCACTTGGTCTTCGTGTAGAAAATTTGGAATTTTATAGGCAATAGCCACCATTTCAACTTCACTCTCTTTTTTAATAAAAAGACGTTTTGCACCATCTTGTTGTGGTTCAACTTGATGATGTACCTTTGGAAGAGGCGTAGAATTTTTAATGTCACCAAAATATTTTGTAACATTTTTGAAAACATCTTCGGGTTCAATGTCACCCGCAACCACAACAATGGCATTTGAAGGTTGATAGTATTTTGAGTGAAAACTACGAATATCTTCAATGCTCCAATTTTTTATATCATCTATAAAACCAATGGGTGTCCAATGGTATGGATGGTAGGTAAAGGCATTATTGAAAAGTCTAAAGTAGAGATAACCAATGGGTGAATTATCCGTTCTCCATAGCCTCTCTTCTAGAACAACATTACGTTCTGGTTGAAATTCTTCATCAGTGAGTTTAAGATTTTGCATCAGTTCTGCAAAAAGCTCCAATGATTTTGGTAAATTTTTGGATGAACTTTTGATGAAGTAATGCGTGTAATCAAAACCTGTTGATGCGTTATTCACGCCACCAAAGCCTTTAACAATTTCATCAAATTCACCGGTCTTAAGGTTCTTAGTCGATTTAAAATTAAGGTGTTCAAGCATGTGGGCTATACCACTTTTACCCATAATCTCATTGCCGCTACCGACTTTATAAAAAATATCTGTCGTAATAACATCGCTTTTATTATGCATGGGAATGACAACAATTTGAAGTCCATTCTCCAATGTTTTAGTAAAGTGCTCAGGTAGAGAGTTAGCCATTAATGCTCCTAAAAATAGTAAAAATAGTAGTATCTGTTTCATTGTTTATCATCTCCTATCGCTTCCTATAACTTCACTAATATGATTAAATCCATCTTTTTCCATGCGCTCCAAAATACCAAGATTGATTGCACGGTTAAGAGAAGGTCCTTTAAAAATGAATGCCGAATAAATTTGAATCAAACTTGCTCCCGCTTTCAGTCTTCGGTAGGCTTCATCCGCTGAATCAATACCCCCAACTGAAATGAGTGTCGTTTTGCCAAAAAACTCTTTCGCGAGAGCTTCAAACATGACAAAACTTTTTTCTGTTAGCACTTTGCCGCTCAGACCACCAAAGTCTCTTGCATTGGGAATCAAAGAATAGTCAACGGTTGTGTTGGTTGCTACGATTCCCGCAGCTCCATTTTCAAGGGCCGTTGAGCTTACATGTAAAGCATCATTGATCTCAAGATCGGGTGCAATTTTAAGAAGTACCGGTTTTAAGGTTAACTCTTTTGCCATCACAAAAAGATCTTTAATAAATTGCTCATTCTGCAAATCTCGAAGACCTGGAGTATTCGGAGAAGAGATATTGATGACCAAATAATCGCTTAAATCTTTAAAGCGTTTAATTAACTTTTCGTAGTCTTTGAGTGCATTTTCAACCGTAGTCGTTTTGTTTTTTCCTATATTGGCACCTAAAGGTGTTGCAAAGGGGTAAAGTGACTCTAAGCGCGTGCCGACTTTATACATACCTTCATTGTTAAATCCCATTGCATTTTGAATGGATTCTTGTTCTACGTAGCGAAATAACCTTGGTTTTGCATTGCCATTTTGTGGCTCAGGCGTAATCGTTCCATACTCAATATGACCAAATCCAAGAGCGGTAAGCATTTGAATCATCGTTGCATTTTTATCAAAACCTGCACCAAGACCCAGAGGATTTAAAAATGTTTTACCAAAAATAGTTTGTTCCAAACGTTTATCATGAATAAAAAAATGCTCAGCCAATGGTGAAAGAATAAAAGGAGCATAGGTGGCACCATTCTTAAAGAAAAATTCTGCAATATGGTGGGCATTCTCAGGAGAGAAGTTAAACATGAATTTTTTCATCAGGCCATAATAATCAAACATCTTCTATTTCCTTCAATATAAAGTACGCGATTATACTCCACTAAAGATAAAAGATGGTTGATTTTAGCCCTTTTCTAGTGCTGTAAACCTTTGAGCTTAAGATACTCTTGGGAGTTAGCCAATAGCTCCTCGTCACTTCCAATGGCACAAATTTTACCATGTTTAAGAAGCGCAATTTTATCTGCTTTTTTGATGGTACTCAGGCGATGGGCAATGACAAAAGTAATTTTATCTTTGATGAGATTTTCAATCGCCTCCGTAATTTTTTGCTCACTTTGAGAATCAAGTGCCGAGGTTGCTTCATCTAAAATAAGCACTTGAGGATTGGTATAAATGGCACGGGCTATGGCAATACGTTGCCTCTGTCCTCCTGAAAGGTTCGTTCCAAATTCATCCAAATGCGTATGAATACCTTCAGGAAGGTTTTGAATAAATTCGTAAGCATTTGCTTTTTTCAGAGCATCAATCACGTTATTTTCATTGATTTCTTTACCATAAGCAACATTGGCAGCAACACTATCATGAAAAATATAAACACGCTGTGTTACCATGGCAATATTGTGACGTAAATCGTGAAGATCAAAAGATTTAAGATCAATGCCATTGATACAAACACTTCCTGATGTTGGATCATAAAAACGCATCAGCATATTCATTAAGGAGCTTTTCCCTCCACCACTATCACCAATGAGAGCTATCATTTCGCCCGATTTTGCTTCAAGACTCACATTCTGTAAAGCTTCTTTTTCGCCATAACAGAGAGAGACATTATGAAAAGAAATCGTATCGATTTTAGAAGGTAACATTTTTGTTCCAACGGGGATAGATGAGCGTTGATCTAGTAAAAAGAAAATACGTTCACTGGCAACTAATGCATCTTGCATTTTGTTATAAAGTCCTGATATTCTCTTAATAGGTGTGTAAAGCATAAAAAGAGCTGTTAAAAAAGAGAAAAATGCTCCAACACTCATACCTCCTTCAATAACCTCTTTACCACCTACGAGAATCACTACGGCAACACCAATAGAACCTAATGTTTCCATTACAGGACTGACAAGTTCATTGACTTTAACCGATTTCATCGTCAGTTTGAAGTAACGCTCATTGTCTTTTTTGAAAAGTTCGTGCTCGTATGTTTGTGCATTATTGGCTTGAATAATCTCAATATTGTTAAAAATTTCACTGAGTTTTGCTGTGATATCCGAAACTTTTTCTTGCGATTGACGTGAGACTTTTTTCATTTTTTTAGAAAGCACACTCAAAGGGTAGATTGCTAAAGGCATAATAATGAGTGCATAAAAAGCAAGTTCAGGACTTTGGTAGATGACAACACCAATCAAGCCAAAAATAGTCAGCGTTTGGCTTAGAAATTCAGGAATGAGATTGGAGACAACGGTTCTAACACGCTCTACATCATTCGTATTACGACTGATCAGCTCGCCCGTTCGGTACTCATGAAAAAAGGAGAGGTCAAGTTTTAAAAGATTTTCCAAAATCATGTCACGAAAGCGTTTGATAATATCCTGGCCAATATAGGCGGTATAGTACGCTTGTGTATAACGACCTGCTTCTTTGAGTGCATAAACAGCGATAATCGCATAGGGCAAAAGTTCAAGCATCTCTTTATCTTTGGCAATGAAAATTTCATCCAAAAGAGGCTTTACAAGATACGCTGAATAGGCAGTTCCTCCACTGGAGAGAAGCATTCCAAAAATCGCTAAGAGAAAATAGGGGATATAGTCCTTAAAAAAAGGAGAAAAACGGGCTAATACACCTTTAATACCGAGCATTTAGTTCACCTTTTCCCACTGTGTCCCCATCGCTGTATCCATGAGTTGTATCTGCTTGGACTCTAGTAGTGTGCGAATTTCATCTGCTTTTGCAAAATCTTTGGCTTTTTTGGCAACTATTCGGTTTTCAATCAATGCTTCAATCTCCGCTTTTTCAGCGTCACTAACACCGATTTGAAAGTACATGTAAGGATTGTAAAGTCCAATGCCTAGGAGCAATTCAATCCATTGTAAATTGGCATGTGTGGTTGCTTTGAGTGCTTTATCTTTAGGATTAAGATCGAGCATTTCATTCGCTGAAGTGATCATCTCATCCAAACTTGCAAGGGCTTTGGAAATATTAAGATCATCACTGAGTGCTTCCAGCATTGCCTCTTTAAAAGGAGCCGAGACTTCACTTGGTATGTTTTCAAAAACTCTCTTTTTCAATCGATAGAGTTTATCAAGGCGTTTTTTGGTGTTTAATAGATCTTCTTCGGAAAAATTGAAATTCGCACGATAATGTGTTGCAAGCAAGTAAAAGCGAAGCACTTCGCCACTGTAAATGGCAAGTGCATCTTTTAAAAAGAAACTATTTCCTAGTGATTTACTCATTTTTTCGCCGCTAATGGTGACAAAGCCATTGTGCATCCAGTATTTGGCGAGTTCTTGATGACTTTTACATCTGGTTTGTGCCGCTTCATTTTCGTGGTGTGGGAAGAGAAGATCGGCACCTCCTGCATGAATATCGATCTGAAAATTGCCACTGCTCGCAAGATGTTTTTCAATCATCGCAGAACACTCAATATGCCATCCCGGGCGTCCTACACCAAATGGGGCAGAGTAGCTAGGTTCATTGGCTTTTGAGAATTTCCACAATGCAAAATCTTTTTGATCTTTTTTCTCTTCTTTTTGCTCGACGCGTGATTGACTCGCATCTTCTTCAATGTTGCGGTGACTGAGGCTAAGGTATTTGGCATCTTTTGAGGTATCAAAATAGATCCCATCACTGGTTGCATAAGCGACTTTTTTGCGAAGCATTTCATCCACAAACGAAATGATTTCCGACACCGTCTCCGTCGCCTTTGGTTCAATGTCCGCATCCTTTACATGTAAAGCATGCATTTCATTTTTGTAGCGCTCAATGTAAAAAGAGGTAATCTCCTCAAGTGACTTTCCACTCTCTTTCATTTTGTTAATAATTTTATCATCAATATCAGTGAAGTTCTTAACAAACGTTACGTGATAACCAAGAGCAATAAATACACGACGTAAAAGATCAAAAGCAATAGCACTTCTTGCGTGTCCTAAATGAGCATCATCATATACAGTTGGACCGCAGACATAAATTTTAACTTCATTCTCTTGAATAGGAATAAACGATACTTTTTTCTTCTGTACTGAATCATAAATAAACATGCATAAGTTCCTTTACATAGAGTAATACACCCCATTCTACAACAAGGAGTATGACTAAAATAGCTAATTTTTTAGTATATAGTATAGCAAAAAACTTTCCTAGCCCAAATGATCGGAGCGTAAAAAAGAGTAGAACAAACGCTGAAAGCGATCCAGCAAGAGCAAGGCCTGCAGCGCCCATAGGCTTGATAAGGCTAAAGGAAAAGATTAGGTTGGCAATGAGTGAATACATTGCGATAATGGCCGCTTCTTTTTGTCTCATCCCTGCATAAAGCCAAAGGGAGAAGAGTTTTGCTAAACCAAAAGGGATCAGTCCTATCATATACATTGCTAAAACAAAACCTGTATTAGCTGTATCTTGAGTCGAAAAAGAACCATGTTGAAACAGTAATTTAACAATTTCTTCACTGAGGATAAAACCACCCAATGTTGATGTTGTGAGTAAAAACGCTAAAATCCAAAAGCCTTGAGAAAGTAAATTGGAAGCTTCTTCCTCTTTATTGGCTTTGAGGAGTCGGCTAATTTTTGGAAAAATTCCTGTGGTGAGTGCTATCGCAAAAAGAGCCAATGGAAGTTGAAAAATACGGTTCCCATAGTAGAGATAACTAATACTACCTGTCACTAAAAAGCTTGCCAGTGTTGTATCGACAAAAGAGACGATTTGTGGTGTTGAATTTCCAATAACAGAGTGCCAAAAAGAGCGATTAAAGTTTTTGTTGCTCTCTTCTAATGCCGCATCTCTCTTATGACGATACTTAAAACCCAGCGCTAAAAATTTAAAAAAACGCTCTTTCTTAAGTGCAAAAAGATGGGCAATCACTTGCAATACGCCACCCACTAAAACACCATAGCTTAAAGCATAAACGATCGTTTTACGATCATACCCTTGAAAAAGAAGGAGGGCACCTATCATACCGATGTTAAGGAGTGCTGTTGAAAATGCTGAAACGGCAAAATGGTGCTTGTATTGAAGCAATGACCCAAAAAGGGTGACACAGAAAATAAGAGGTAAATAGTAAAAGTTAATAGCAACAAAAGGTGCTGAGAGTGCTACCGTTTCATCATCAAAGCCAAAGGCAATTATTTTGGCAAAAGATTCGCTAAAAAGTGTAACAATCAATGAGAAAATAATAAGAAAAAGTAAAAAGCGTGAGAAAATAGTATAGGTAAAAAGTGCCTTACGTGATGTTTTAATAAAACTCGGGATAAAGCTCTGTGTAAAAGCGCCTTCTGCAAAAATACGGCGAAAAAGATTAGGAAATTTAAACGCCACGAAAAAAATATCGCTATAGATATTTGCTCCTAAAATTGAAGCACTCAGTATATCGCGAATAAAGCCAAATATACGAGAAACAAGGGTTCCAATACTGTTTGTGAAAAATGATTTAATAAGCATTGCGTATCTTATCAAAAAATGGTTTAAACAATATTTTTATCTATTTTTCGCTAAGATTTCCTAACATTATAAGCAACATTTGTCGATATTAGAGCAAAATTGAGAGTTTATTCAAAAAACAGAGGTGCACATTGGGGCTATTTGATAAGATTAAAGGTCATAATACGGATACGGGTACACAAAATGATGGTGACGTGTCTGAATTTAAATCTATTGTTATAGACACTATTAACGTCATCAAAGAACTAAAAAATGTTGCTATTGCAAGTCATCTGAAGCCTTCTGAACTCTCTTTTAAGCTTCTACGTACAACAACGTACTACAGTGATGAAAAGAGTGAAAATAATGAGATGAATGAAGAAGAGTTAAAGCTTTTATCGGATGATAACTTTTTACTCAATCCCAATCTCAAACTAACACAGCATTATCGTGTTGAAATTTACAAAATTGCCGATCAAGAAGAAGATCATACGATTCTTCCTGATATAACACTCAGTGGAAATAAAACTTTAACCAAGATTATTGCAATGGTTGCAAAAAACCATGATGTCAAATATACCTCAAAACTTGAAGAAAAAATAATCGAAGATATTCAGATCAAAAAGATTAAAGCAGGTATTTTAGTAGGTATTCGCGATCAGAATATGTACAAAGAAGTCAAAAAAATTGTAGCTAATATTCGGGTGAATGGAATCATCGATCAAAATCAGACCTTTGTTGTCTGTCAAGGCGTGGATGAGATACCTTCCATCAATGATGATCTTATTTATCATTATAAGAAGAAAATCAATGCTAAAAGTACCGATGGCAAAATTGACTATGCTAAACGAGGATACGTATTAGCTGTCGATAAAGATGAATGTATTATTGAATATATTAAACCGCAGCTTGGGACTCCTGGACGTAACTGTAGAGGAGCATTTTTGCCAGTGAAAGAGCCTCGCAAGTCAAATGATACACCCATTGCCATTACGGCTAATCTCGTTAAAAAAGAGAGCGAAACCAGTATCAAATATATTGCAAATCGCGGTGGATATGTCAATTTTGACAAAGGCACTTATGATATACAAGATCAAATGGAGATCAACGAAATCAGCTTTAGATCCACGGGTTCTATCGATGCAAGTTTGGGATCAAACATCAAAATTAATATCAAAGAAAGTGATATTCTTAAAGATGCCATAGGTGCTGGTATGAGTGTTGAAACTTCCGAAGTGCATGTTCAAGGCAACATTGGAAGTGGTGCTAAAATTAAAGCAAAGATTGCTGAAATTGGTGGACAAACACATCAGAGTGCCTATATTGAGGCGGATAAAATTATTATTTCAGTTCATCGTGGTGAAGCCAATGGTCAAGACATCGAAATTGATCGTTTGGAAGGTGGAAAAGTCATTGGTCATACGGTGCATGTTAAACAGATGATTGGTGGCGAAATTATTGCCAATAGCGTTAAAATTGACAATCTTCTTTCCAATGCAAAAATAACAGCGTGTGATCTTATCGAAATAACAGAACTCAAAGGCAATAACAATAAACTTATTATTGATCCAAGTGTCACAAAAGAGTTCAATGAAATGATTGATACGATTAATGCTAAGATTGAAAAACTTGAAGAAGAGCTCAAAGCCTATCCAAGGCAACTCAGTTCTAAAAAAGAGTTTATCGATAAAAATAAACCTATGGCAGAGATGGTTAAAGATAAGATTATGGAGCTTAAACGTAATGGCGTTGAACCGCCTATGACACTTTTTGCTAAGATCAAAGATTTTCAAGAAAAAGTGATTGACTATAATACGTTTTTGCAGACCTTTAAAGATAAAAAAGAGGAGTTGCAAGAGTATCGTAAAGAGCTCAATCAAGTCCAAAATAAAGTGTTCTCTGCCAAAATCATTAACCATTCTGCTTGGAAAGAGTTTAATGAAGTACGATTTAGACTCATTTCTCCACCCAAAGATATTACCTACAATCCAAAAGAGCATGAAATTGTACGTGAAATTACGCTCAAAGATATGGGCAATGGTGAGCATAGAGTAATGCGTTCCGCGGAGTATAGTAGCAAATGATTGTGGGTATTGAAGGAAAAGTTGTTAAAAAAGAGGTGACGTTTGTCCATATTAAAACAGCTGCAGGACTAACGTATAAAGTGTTTGTTTCATTATCGTGTCTTGGAAAAATCAGCAGTGAAATAATCTCTTTACATGTAAGCCAAATTATCAGAGAAGATCAGCACAGTTTGTATGGTTTTATCGACGAAAATGAGAAAAAAGTATTTGACACACTTATTAAACTAAATGGTATTGGACCATCAACGGCTTTAGCGGTTTGTTCGACGCTCAGTCCCGATGATTTTGCACAAGCCTTGGTGAGTCAAAATGTTCAAGCTTTTCAAAAAGTTCCTGGCATTGGACCCAAGAGTGCCAAACGTATTTTAGTAGAACTGAGTGATTTTTCGTTGCAACTTAGTTCTGATGAGCATAACTCGAGTAGTATGATTGAAGCCTCTTTAGCCCTTGAAAGTCTTGGATTTAAAAAAGAGATGATTAAAAAAGCATTGAGTACCTGTCAAGGGGTAGATACTCAAACACTTATCAAAGAAGCTCTTAGAAAGCTTAGTTAAATAAAGGATGAAAATGACTATAGCCGTTTTGTTTGGCGCCCAAAGTTTTGAGCATGAGATTAGCGTGGTAAGTGCCATTGCTCTCAAAAAAGTGCTAAAAAGTGACATTGTTTATATTTTTTGTGATTATTACCGTAATTTTTATTTGATTCCAACCGATAAAATTACTTCAAAACGTTTTAGCAGTGGTGAATATAAAAAAGATAAACTGCTTTATCTTAAGCAAGGTGGTTTTTACGCTAAAAAGATGTTGGGTGAAGAGAAAATAATCTTCGATGTCATGATTAACCTCGTGCATGGCATGGACGGAGAAGATGGAAAACTCAGTTCTATGCTTGATTTCTTTAGTGTACCCTATATTGGGCCTCGGATGGAAGGTAGTTGTATCAGTTATAACAAACTTTTTACAAAGCTGTATGCCAAGGAAGTGGGTGTTAATGTTTTAGACTACCAAGTGCTTCGTAAAGGAAGTGGTGAATCAATTAAAATTGCTTATCCATTTATCGTCAAGCCTCTACGACTAGGAAGTTCCATTGGAATAGGGATTGTTAAAGAAGAAAAAGAGCTTGCATACGCACTTGATGTTGCTTTTGAATTTGATGATAGTGTCCTAATAGAACCGTTCATAAGTGGTGTAAAAGAGTACAATCTTGCAGGTTGTAAGACCGATATATTCCACTTTTCTATTATAGAAGAGCCTCAAAAAGAAGAATTTTTAGATTTCGATAAAAAATATTTAGATTTTTCTCGTACAAAAAGAGTGAATGAGGCTACACTAGATATCAAAGCAGAAGAGGGAATTCGAGACGCATTTATGAAGCTTTATGATCCACTTTTTTTAGGTGCACTTATTCGTTGTGATTTTTTTGTGATTGATGGAATGACGTATCTTAATGAAATCAACCCAATCCCTGGTAGTATGGCCAATTATCTTTTTGATGATTTTGACAGAATAATTAAAAATCTTTCAAAATGGTTGCCAAAAAGTATCACGATTCCAAAAGAATACCGCTATATCAACTCCATTCAAGCAGCTAAAGGTAAGTAAAAGTCTTGTTTATGTAGGATATTACATAAAATTTTGCACAAAGGCTCGAATCGATGACCTATGCTAAAAACGAGATAATGACTGCAACAGACATGGTGCGCAACTTTAGTTCTGTGTTGGGTAGTATTACCAAAGGGAAAAGTAAACGTGTGGTCATCGTGAAAAATAACCGTTTTGAAGCGGTTATGATCACGGTGGATGAATACGAAAAAATGAGTGAAGCCGTTAATATTTTGGAAAAAATCTATGCCAACACGAAAAAGAAGAGTGATGGCTAAGAAAGAGATCGTATTTCAAAATACTTCTTACGAATTATCGTATGAGTTGTTAAACCAAAACCAACCGCAAACCATTCTTTTTTTACATGGTTGGGGCAGTAATAAAGAGATTATGAAGCAAGCGTTTGGGAAGACGTTTTCTCAGTACCAACATCTCTATCTTGATCTTCCTGGTTTTGGGCACTCTTCAATTCATGATGTCATCACTACAGGAACTTACTCAGACATTGTATCTGTCTTCTTAAAAGCGTTACATGTAAAACCTCTTATCATTGTGGGACACTCGTATGGTGGTAAAGTGGCAACATTGCTGCAACCAGAGGTCTTGGTTTTGCTCTCAAGCGCTGGTATTGTTCCACCAAAATCTTTGAAAGTCAAACTAAAAATAGCGCTTTTCAAACTGCTAAAACCCTTTGCTCCACGCTCTTTTTACCGCTTTTTTGCGACCAAAGATGTGGAAGGGATGAGCCAAACCATGTACGAAATTATCAAGCGGGTCGTCAATGAAGACTTTAGTGAGCAATTTCTTACATGTAAAGCAAAAACGTTTCTGTTTTGGGGCAAAGAGGATAGTGCAATGCCACTTTTCTGTGGAGAAAAGATGCACTCTCTCATTAAGGGAAGTCACTTTTACCCGATGGAGGGTGATCATTTCTTTTTTATGAATCAAGCAAAACAGATCGAAAAGACACTTGGGGAGTTTGGATTTTGAGTACTTATAGATTGATTGTAACAGGACGCGTACAAGGTGTTAATTTTCGCCGATTTGTTGTTGATATAGCGCATGCATTGAATTATGTGGGGTATGTCAAAAATAGTGCCGATGGCAGTGTTGAAGTTGTGATTAACTCTGCTTATGAGGAAGATTTGGAATTTTTTATTAGTAAACTCTATGATGGTTCAATGTTTTCAGATGTTCAAGACGTTACATGTCAGAAGATTGAAAGTATGATCTTTGATGATTTTGAGAAGAGATAAAGAGCATGGAAATAATGGTTACAAGCATCACTCACCTCTGTTTTATTTTAGGGCTTAGCTACTACTTTATAATAGCAATGCAATGGTACAGTTATAGGCTAGAGCGTATTTTATTTCACTACAATCGTTATGATTGGCACGTGTTCTACTTTTTAGTTCCTTTAGTAGGTTACTACCTTCTAAACGGTGTTGTACTTTCTCTTTTTGTTGCACTTTTTTTGATCTCCCTTTTCATATGGCAGAAAAAAATGGACAAAAAACTTGTTTGGACAGCACGAGTAAAGCGATTTTTTCTCTTTTTAGTTTTAGCGACTCTTTTTCAAGATCTTTTGTGTACGGTATTGGTTGCATCATGCTTGAAGTTAGGTGTCATTATTCCTTTAATGGTGGCACAAATAGCAAGTATGTTCTATGAAAAGATGCTTTTTCTTAGCTTTAAAAAAGAGGCTCAGAAAAAATTGATGGCAAACAGTGCGTTAAAAGTAGTTGCCATTACAGCGAGTTATGGTAAAACCAGTATTAAAAACTTTTTAGCACAAATTCTTTCCACAAAGTTTAATGTCTATAAAACACCACGTAGTGTCAATACAATAGGTGGCATTATTAAGGATATTAACAACGATTTGCCTGAACAATGTGATGTTTATATTGTTGAAGCGGGTGCTCGAGCTCGTGGAGATATTGATGAAATTGCGAGACTTGTCAATCCACACATTGCCGTTGTTGGGTGTATTGGCGAGCAACATATTGAGTATTTTAAAACATTGGAAAATATCCGAAATACAAAGATGGAACTCCTTCACTCCTCTAGGCTTGAAAAAGCATTTGTGCATGAAAGTACCAATGTCAAAGGATCAGAATCTATTCTTTCTTTTGGCGCTGAGCTTAGTGATGTTGAGGCTTCTTTGCAAGGGCTAAGTTTTTCAATGCTCCTAAATGGCGTAAAAGAGAGTTTTACATGTAAACTTTTGGGTGCTTTTAATGCGATTAATATAGCAGCAGCCATTCACGTTGCACGCACACTAGGGCTCTCTATTGAAGAGATTAGGTCTGCCGTATCGCATTTAGAAGGGGTTGAGCATCGTCTTCAAAAAATTGAAGCAGGTGGAAAGCTCATTATTGATGATAGTTTTAATGGTAATTTAGAAGGCATGTTAAGTTCCTATAATCTGGTTTCACAGCATCAAGGGCGTAAAGTGATTATTACGCCAGGTATTGTTGAAAGTACCGAAGAAGCGAATAGGATACTTGCCAAAAAAATTGATGATGTCTTTGATCTTGTCATGATTACAGGTAAAATAAACGTTACTATTTTACACGATAACATTCATAAAGCTCAAAAAATTATCATTTCCGATAAATCAAAGCTCCAAGAGACCTTATCAGAACAGACATATGCAGGAGATGTGATACTTTTCTCCAATGATGCACCAACTTTTCTCTAATAATCGTATCCTTTTTATATTCCTATTATCTGTTGTAAAGTATGATGGGCAATCAATTTAAAAGGAGTGAGATGAAAAGTAAGAGTTTGTATATCTCATCGCTTGCACCTGCCGCAGGCAGTTTGATTGTAGCGATGGGCATTATGGAACTATTAAAAGGGCGTCTTGGTAAGGTTGCTTTTTTTAGACCGGTTATTTTAGATGCGAATGAAGTTGATAAAGACATTGATTTCATGCTGGAATATTATGCCCTTAAAATGGACTATAACGCTACTTATGGTTACACTGTTCATGAAGTTGAAAGTTTAATTGCTGAAAATAAATACAATGAAGTTTTGGAAAATCTTATTGATAAATTCAAAATTTTAGAGAGCCAGTATGACTTTGTACTCATTGAAGGACTTAACCAAGCGAATTTTTCACAAACTCTTGATTTTGATATTAATCTTTCAATCGCTAAAAACTTGAGTAGCCCTTTTATCAGTGTTTTAAAAGGTAAGCAAAAAAGTGTTAAAGAGGTATTGGATGAAATATCTATCGAGGCAGATGCCATTAAGGGTGCTGGATGCCAACATTTTGCGACATTTGTCAATCGTTTAGGTGACCAAGAGGTTCAAGAGCTTAAAGAGCTCAATCGCGCTAAACCAATTCAAAATGTTCCTGTCTATTTTTTACCTGAGGTGCCAGAGCTTGACACTCCGACTGTTGCTGAGATAAAAAATAAGCTAGGTTGTTCCCATATCTATGGTGAAGAAAAAGATTTACGTAGAGTTGTCAAGCAGAGTAAAATCGCTGCAATGAAACTTGATAATTTCTTAGAATATATTGAAGATGGTGACTTGGTCATAACTTCAGGGGATAGATCAGATATTATCGTAGGATGTCTTAGTACCGTATTTTCTAATAATTATCCAAATATCTCCGGCATTTTATTGACTGCAGGGATGATGCCCCATAAGTCCATTAACAAACTTATTGCTGGGTTTAAAGACCTTTCCATTCCTATTTTAAGTGTCGACAATGGTACTTTTGATACTGCTGTCAATGTCTCCAATGTTCCAGCGACAATTACACCACAAAGTGTACGCAAAATTGCATTGGCTATGGGACTTTTCTCTTCAAATGTCAATATTGAAGAAATCGAAAAAAGTATTGATACTGAATCCACCACGAGTTCTATTACTCCAATTATGTTTGAGTATGCTCTTTTTGAACGCGCAAGGAGAAATCGTAAAAAAATTCTTCTTCCTGAGAGTAATGATGAGCGTATTCTTCGAGCAACGGAGATTTTATTACGTCGTGATGTAGCTGATATTATCCTTTTAGGTGTTGAAGAAGAAGTACGTCGAAAGAGTGCAACACTTGGACTTGATATAAGTAAAGCGACTATCATTGATCCTTTAACATCACCTCTAATGGAAGAGTTTGTCACTTCTTTCTATGAAATGCGTAAAGCCAAAGGATTGAGCTTAGATGTTGCTCGTGATAGTATGATGATGAAAAATTATTTTGGAACAATGATGGTTTACCTAGGTTATGCCGATGGCATGGTCTCAGGTGCGATTCATACAACCCAAGAGACGATTCGTCCAGCACTTCAGATTATCAAAACCAAACCAGGTATTTCCATTGTTTCAAGCCTTTTCTTTATGTGTTTAGATACAAGAGTCTTAGTTTACGGTGATTGTGCCGTCAATCAAGATCCAAATGCGGAAGAGTTGGCTCAAATTGCTATTTCTTCTGCAGATACGGCTAAGATATTTGGTATATCTCCAAAAATTGCAATGCTCTCGTATTCTACAGGTGATTCAGGGAAGGGTGAAGAGGTTGAAAAAGTGCGTTTAGCCACTAAAATCGTTAAAGAAACACGACCTGATCTTTTGGTAGAAGGACCTATTCAATACGACGCAGCCATTGATCCTATTGTTGCTAAAACAAAATTACCGAATAGTAAAGTCGCTGGTGAAGCCACCATCTTCATCTTTCCAGATCTTAATACTGGTAACAACACCTATAAAGCGGTTCAAAGAAGTTCAGGTGCTGTTGCTATTGGCCCTGTACTCCAGGGACTTCGAAAACCCGTTAATGATCTCAGTCGAGGTTGTTTAGTTCCAGATATTGTCAATACCGTCGCCATTACAGCTATTCAAGCACAAACCAATGATGGAGCTAATAAGTGAAAATTTTAGTCTTAAATGCGGGTAGTTCTTCTGTCAAATATCAACTCTTTAATATGGCCAATAATGAGGTTTTGGCCAGTGGAGTGATTGAGCAAATTGGCGAAAAAGAGTCAATGGCTAAAATAAAATATAAAAAGCCAGCTGGTGATGAGCAAAAAAGAGAAGAAAAATGCTCCATCCACGATCATGATGCAGCCTTAACATGGATGAGTGAGGCATTGATCCAATCAGGTGTTATTCACAATCTTAATGACCTTGATGGGATTGGTCATCGTGTTGTTCAAGGTGGTTCTTCGTTTCAGGAACCAGCAATGGTTGATGACTATGTGATGTCAGAGATTGAGCGTTTAATTCCACTTGGACCATTGCACAATCCAGGTCATCTTGCCGGTATGAAAGTTTCGGTACATCAAAGCCCTAATGTTCCCCAAGTGGCTGTTTTTGATACCGCATTTCACTCGACATTACCTAACTATGCTTACATGTATGCTATTCCTTACAAATACTATGAGGAATTACGCATTAGACGTTATGGCTTTCATGGAACTTCGCACTATTACGTTACCAAAGTAGCTGCAAAATATTTGAAGCAAGACATCAATACACTTAATGCCATTACGCTTCATTTAGGAAATGGTGCAAGTGTTGCAGCGATTGAAAATGGTCAAAGTGTTGATACATCAATGGGCTTAACACCACTGGAAGGGCTCATTATGGGAACACGAAGTGGTGACCTTGATCCAGCTATTCTCTTCTATTTAGCACGTAAACGTGGACTTACTCTTGATGAACTGGATAAAATGCTCAATAAAGAGAGTGGATTAAAAGGAATATGCGGAAGTAATGATATGCGTGAAATTACACGTATGGCAGAGGAGGGTGATGAAAGAGCACAGCTTGCCTGTGATATGTTCAATTATCGCCTTAAAAAATACATTGGTTCTTATTCTGCTGTTCTTGGGCGGGTAGATTGTATTGTTTTTACAGGAGGTATTGGCGAAAATGCAAATGATGTTCGCCTCAAGTCTTGTGAAAAATTGGAGAATTTTGGCATCAAAATAGACCCAATACTCAATAGTGTTCGATCAAGTGAAATTAGAACGATTAGTGCAGATGATAGCAAAGTAAAAGTTTTGGTTATTCCAACCAATGAAGAGCTTGAAATTGCAATAGAAACTTTGGAAATGATCCAAAAGCATCACTCGTAAAATATGTGTTTACATGTAAAGATTTTCTTTACATGTAAACCTTCTTTTTTAGCTCAAAAGTTCTTTCGCAATCGTATTAATACGTTTACCATCAGCTACGCCTTCACATTCAGCAAGAACTCCAGCCATGATCTTACCAATTTCTTTTAAACTCGTTGCTCCAGTTGCAAGAATGTGCTTTTGAATGATGACTTTAAGACTTTCATCACTGAGTTGCTGCGGTAAATAACTTTTTAAAATCATTGCTTCTGCCAACTCTTTTTCATAAAGATCTTCTCTTCCTGCATCTTTAAAAGCAGATAAGGCATCTTCGCGTTGCTTGAGTGATTTTTGGATAATTTTTACAATATCAGAATCACTAAGCTCTTTACGTTCATCGACTTCAATTTGTTTAAGTGCACTCATCAAAAAACGGATAACATCACGTTTAAAAGTATCTTTGGTTTTCATCGCATCTTTTAGATCATCTTGAAGTTTTGACTTGAGTTCTGACATCTTTTTTCCTTAGAACATAATTTACTTTAGAGGTGAGTATTTTATAGTATAATAGTAAAAAATAGAATAAAACGGAGTCGTATGAGAGTTTTTACATGTAGCTTTGTAGCCGCTCTTTTTTTGGCAGGCTGCTCAGTACATCCAGCTGATCCAAAAATTAGTATGAAAGCACCTGTATATGTTGATGAAACGCCTTCCAAAGTGAATGAAGTGATGCCAACAAATCCAGGTAGCCTTTTTGGACAAGGTGACAACCCTTTATTTGCCGATCTTAAAGCAATGCATGTTAATGACGTTGTAACTGTTACCATTACTGAAAAAACAGCACAAACATCTACGGGTAAAAAAGCTTTAACAAAACAAAGTAGTGATTCTCTTGGTGCTGGTATTACCACTGCTGCAGGTGGTGGTGTTTTAGGTACAGTATCAAAAAATTTAAATGATGTTGGTAATATTGGTTTTACGACAGGATCCAATAACTCTTTTACAGGCAATGGAAGCAATACTCGTAACGAAACCTTTAGTACGACTATTTCGGCACGTGTCATCAAGATTTTAAATAATGGACATTACTTTATTGAGGGTAGTCGCGAATTATTAATTAATGGTGAAAAGCAAATTATTCAAGTGAGTGGCGTTATTCGACCTTATGACATTGATAAAAATAACAACATTGATTCAAAATACATTGCGGATGCAAAAATTCTTTACAAAACAGAAGGTGATATAGATCAAACTACGACTAAGCCATGGGGTGCCAAATTTATGGAAACCATTTGGCCATTTTAATGTAACTATACTAAAGGATATTTTTTGGAAAACAGTAACCCACCTAAGTTTATTGAAAGAGACAAAATTTTTAAGGCAAAAGATATTATTGTTGCACTTAAATATTTTGGTGTCAGTTTTGATAAACTTAAAACAAATACACCCAATCGTGCTCGTGCCATCGTTCTAGGCTATAAAGCATGGCGATTAGGATTGAATGAAACACAATTACGTTCTGTCATTGAACGCAAAATTGATGATAAAGAAATTATTGAGATTCTTGAATATAAAGAAAAAAAAAGTATTCGAAGTTGGAGTATTTTTACTAAAATTAAAGAAGATGACTATAAAATTAAAGTTGAACGTTTATGGTGTAAAAAGCTAGGAGCACTTTGCTTAATTGCAAAAATAGGGCAAAAAGAACTTATTACGCTCGCATGCGAAACATTCAAAGATCAATTAGATTGCACAATTCCTAAAGAGTTTTAAACCCAATTTGCCTTACATGTAAAACCATATAAGGCAAAATTCTCTCTTTTATGATTTAACAGCTTTTGCCATATCCCACATAGGCATGAAAATACCTAAGGCCATCAAAAGTACCATAGCTGCAATAAAGCCCAAAAGAATAGGCTCAATATAACTTGCGATATTGTCGATAATTTGGCTAAAACGTGATTTGAAATAAAGGGTGACTTTTTCCAGCATTTTATCCAGAGTACCACTTTGTTCACCTGCTTGAATCATTTGAATAAGCATACCTTCAAAAAGTCCTGTATCACGAAATGATTCTGTTAAAGAAATACCTCTTTGTACTGAGATCTTAACACTAGAAAGTTTCTTTTTAAGATGGGTGTTTTCTAACGTTAATAAAGCTGTATCGAGCGCATCTGCAATAGGAATACCTGCACGAATAAGTTCCGTAAAAACAAGGCAAAAACGGCTCAGTGTTGCATAAAAAATAATATTTCCAATTAAATAGACCTTAAGAATATAGATATCAAATTTTTTCTTAAAGTCTTCATTATTTTTGAGTAAATACTGAAATAGTAGAATAGTGCCAATAATGCCAGATAGAAGGTAGAGACCATAATGATTAATAAGGTTTTCCATAAAAAGAAGGATTTTGGTTGGTAGGGGCAATTCCGCTTTAAGTTGAGCAAAAATTTCTTTAAATTTTGGGACAACATAGATCATAAGTATTGAAAAAGCAACAGCAATTGCAATAACAACCGTAATAGGGTATCGCATCGCTTTTTTAAACTTTTGTCTATTTTCTTCAATTTCTTCAAGAATTTCTGAAAGTTTTTCTAAAGATTCTGCCATATTTCCTGTACTCTCACCTAACTCAACCATAGCAAGGGTCACGTCACCTACTTCATTGCGATATGTCATCAATGATTGCGTAAGGCTAAGTCCAGAGTTTAGATCATCATTAACGCTGTTAAATATTTCCTTCAGTGTCTTGTCTACGGTAGCATTTGCAACTTCTTTAACACTATCATGAATAGAAATACCAGCATTGGTCATAACACTGAGCTGTCTCATTGCAGCAATAAGGTTTGGCATTTTTATTTTTTTGCGAAAGAGAGCATTCATGATTTGATCTTTTAATTCACCTAACTGATCTTCAAGGGGTGCCGAAGTTTCTTTAATATTTAAAATTACGCCTGGTATTTTTAATTTTGCTATTGATATTGCATCATTTCGTGTGGGCGATTTGACAACGGTTTTAGTCTTTTGCCCTTTGAATAGGTAGTTTACTTCAAAATATTTCATTCTTTTGCCACCCTTATGATTTCATCTAGCGTTGTAATGCCATGTGCAGCTCGGGTAATACCATCTTGGAACATATCGACAAAGCCTTCTTTGATTGCCTGCTCTTTAATATCACTTTTACTGCCACCTTGTGCAATCATACTTGAAATTTTTTCACTTATAGGAAGAATTTCAGAGATCATTTCACGCCCTAAATAGCCTGTTTGTGAACATTTTTCACAACCATTATTTTTATAGAATTGAAAATTTTCAGGAAGCATATCTTTAATTTCATCATGAGCAGTTTTTGGTAGTGTATATTTTGTTTTGCAATAAGGGCACAATTTACGTACTAGCCTTTGTGCTTCAATCGCAATTAATGATCCACTAATTAAATACGGTTCAATTCCCATATCTACAACTCTTGTAACGGCACTGATTGAATCATTTGTGTGAAGCGTTGAAAAAACCAAGTGACCTGTAAGTGCTGCTTGCACTGCAATACGAAGGGTTTCAGTGTCACGAATCTCACCAATCATAATAATATCTGGATCTTGTCTTAAGATAGAACGTAATGCCGTTGCAAACGTTAAATTAGCCTTTTCATTAACTTGTACCTGTTGCGTTAAATTGAGTTGGTACTCCACTGGGTCTTCGACGGTAATAATTTTACTTTGCACACTTTTAATAGCATTTAAAGCCGCATAAAGCGTTGTGGTTTTACCACTTCCTGTTGGCCCTGTAACTAAGATAATACCATAGGGAGACTTCATAGCTTGTGCAAACTTAATGTAGTTATTAGGATGCATACCAAGATCTTCAATTTTAATCATCACTTTAGATTTATCTAAAATACGAATTACAATGGATTCTCCATTAATTGTAGGCAATGCAGAGATCCTGAAATCATACTCACGGCCTAGAATCGTAGCTGAAAAACGACCATCTTGAGGCTTTCTCTTTTCAGCAATATCCATATTGGAAAGCAACTTCATCCGTGAGCCCAGAGGAGGGTATATATCTTTATCAAAAATGAATGTTTCTGTGAGCATTCCATCAATACGGCTACGAACAATGCAATTGTTCTCTGTTGGTTCAATGTGAATATCACTAGCACGTGCCAAAATAGATGTTTTAAGGATAATTTCTATGAGTTTTAAAATCCCAGAAGATTCTTGTGGATTTTCAGCAGCACTGGAAGTAATCTCTTTGCGAATTTCGCCAATGAGTCCTTTGATACTTTCACCTAACTCCATTTTAATCAGGTATTTATCAATTTGTGTAGGCTCTGCAATGATAACTTTGAGTAGTTTTCGTGGAAAAATTCTTTGAACGCCTTCTTGAGCATTGATGTCCAATGGATCTTTTAATGCAACAAAAATATTAATTTCATCTTCTCTAATAGGGAGTGCTTTAAATTTCTTGAGTTGAGCAAAAGGAAGTTTTGAGGCAAGACGATAATCAATGTCTATTGAATCAAGATCAAGGTACTCATAATTAAGATTTTTAGCCAAAGATTGTAAAAACTTTTCACTGTCAATTGCAAAATTAGCGTTAACATCTTCTAAATTTAGATACCCTTTTTTAAAAAATTCAATGACTAAAATAAGAAGATCTTCTTTTGTAAAAAAATTATTTTCTAAAAGAATTTCACCTAATACTTTATGATTATTCTGTTGTGTTTCTTCTCTTATTTTAGCAGCACTATGTTCATCAATTATACGGTTGTGAATAAGATATGTTAAAAGTGTTGTTGTAATATCATTCATTTTTTACCACTATTTATAAATTTTTCTATTGTACCATTTATTAAAAAAAGAAAACAAATTCATTTAGTCAATAACTTCACCTACACTAATTTGATTCAGTAGACTTTGAGCTGCTTTTGATTTATTACTTTTTAGGTAGGCTTGAAGCGCAAGGACAGCATCTTCTTTATGACCAAGTTTAACTTTAGATTTAGCAAACCAAATCCAACTTTTTTCACTATCAGGATCTGCATTGTTGGCCATAAGAGCCCATTTATTACATTCCGTATAGTTTTTCATTGCATAATATTCTTCTGCGAGCATTAATGCAAAAGTAATGTTGTGTGTTTTCTCAAATTTATCTTTAAGATATTGTACAGAATTTACCTCATGCGTCTCAATTTTTATAAGACCCTTTGGTTTTGGTTCATCTAGTAGAGGAGGGGGCAAAAGAGCTGTGTCAACTTTGTCCTCTTTATTGCGATAAAAATTTTCATCATCAATAGTTGGCATTTTACGCATTAAAACTTTATTGTCAAGTTCTTCTTCTTGAATACCAAAATTTGTTTTTTTCTCAGGAGTTTCTGGTATATAAGAGGACTTTTTTTCTGCTTTATTTCTATTAATAGTTGGAAGTTGGAGAAAAAGTGTTTGATTTTCTTGTTCATTCATCACAATAGGTGGAGTATATTTCTTCACACCAATATTCACATCAGGAACATTTAAGGGCTGTTCTTGAATACTAGTACTGTTAGTATCAAGAGGTTTTATTGTTTCTGCTGTTGTTTGAACATCAGATAATGACGTATTGGTATCTGTCCCATGAATAATCGTAGGATAAGAGTAAATGGCAATAGCCCCTATTAGTAAAATAATGACACTAATAATGATGTAATGTATTCTTTGTTTAAGTCTATATTTGAAGACTCTTTTTTCGAGCTCCATAATTTCTACAGCACTAAGCATGGATCATTCCTAGACTTATACCTGCCATTTCAATATATTTTACATGTAAAGCATTTGTTTTGATAAGGGAGGGTTGATGCATATCGTAATATTCCAAAATTTCAAAAAGTTTGTACATTAGTTTATTAATAGTTCTCAAATTTCCATCCGTAACTTTTAAAATAAGTTTATAGTGCTTATCTCTAAAAAGGTTGAGATATTCAAAGCGATTATGGAAGAGGAGTTTTTTTTCAATATATGTTTTAATTTCATTCTTTGAAGCATTGTCAATTTCAATTGTTTCCCAAATACGTGTCTGAAAATAGTCTTTAGCTAAAACATCTTCTTTTTCTGTTTTATGAACCGTAAATAAAAATTTAAATAAACGCGAATCTGCCATCAGACGAATTTTTTCAATCAAATCTGTTGGATAAAGCTGTGCTTCATCAATTAATACGGTAATAGATTGATCTGTAGATGTAATATGCTTGCTTAAGAGTGCTAAAAATTCATTGTAATTTGAAAATCCAGGAGATTTCTTTGCAAAAATATGCTCAAACAATGCATCAATAAAAACTTTTTCTTCAAAAAAAGGTCTTGGAAAAAAAATAATCCGTTTTTTTGTTCTTAAATCATTAAAAATTTTTTGGAGTAAAAAAGTTTTACCCGTTCCTGGCTTGCCATAAAAAAGGATTAACTTTAGAGGTTTATCTAAAGTCTGTACCAACTTATCATACGTTGTTGCAGACTTATCAAGATTGACATAATCAAAAACTTCACCATCAACAAAGATAGTTTTAAGATCACTGTAATGGCTATTCATTTATCTTTTGACTGTACCCAAGATCTTTAAGTGTTGCTTTATCTGTGCCTTTTGCTCCGATAATACGTGGTGTGATTACAAAAACAAGTTCATTGCTAGACAATGTATCAGCCGAATGTTTAAAGGCTTCTCCCAGCAGAGGAATACTACTGAGTAGCGGTACACTCGTATCTTGTTTAGATTTATTATTTGTTATAAGACCACCTAAAATAATCGTTGAGCCATCTTTAACTTTTACAACTGTTGAGAGTTTTTTCTCAGAAGTATCTGGTGCAATAACTCTAGGGTCAGTCGTTTTAACATCATCTGCTGTGTATTTAAACGTACTAACAGATGGATTTATCCTAAGAATAATTTCATTGTCTTCAGAAATTTCAGGAGTAATATTCAAAAGAACACCAATAAAAATAGAATAATTAGTATAGGTTGTTGCCAATGTAGCGGTTCCAGTTGTGCTAGCTGCATTGGTTGTTTCAGGAACACGGTAATTAACATTATCACCAATAGTAATGAGTGCTTGTTGATTATTCATTGTTAAGACTTTTGGACTAGATACAACTTTTGTTTCACCACTTGAACCTAGAAAATCAATCAGTCCTGCAATAGAAAAAGATACGTCATTGACAACTGTTAAATTATTGAGATTGCTTGATGCATTTGCAGTACCAGTACTGGCTGAATTATATAAAGTATTTGGATTGCTAGCAGATAATGTTCCTTGATTGCTGTTGTTAAAAATAGAGTTACTACCAATATTTAAAGAAAATTTACTCCAATCAACACCTGATGTATTACTATTATTTAATAATACTGAGACAATTGAAACATCAATTAACACTTGCCTATGAAGTCTCTCTTTAAGTACATCTATATATTCACTGACACGGTCTAATTGCCTTTTTGTAGCTGTAACAGTAATCATACCTGCATTGATATTGATAATAGGTGATTGTGCTTTATACGTTTCACCTCCCGTATTTAAAACGGAGGTAAGTTCTGTTGCAATAGTTTTCCAAAAATCAAAAGATTCATTTGAAGAAATTGTATTTTGACCTTGTGTTGCTGTATTTTTTGTGCCACTGGTACTTGTTTCAGTCGGTGTAGCATCGACTGAAGCATTGATAACCGCTTTACCTTCTCGAATTGATGAAATGTAATCTACTTTAAAAGATTTTGTTGATAAAGCTGAAATCTTTAAATAGTTCTTATCATAAGTATAAAAAAGATCATTATCTTGAATAATTATTTGAAAAACTTCATCCAAAGAGAGATTTTTGATATTAACCCCATTAAGATTTTTAGCAAGCACCTTTTCTGCTTCAGTATCTTTGACAACAATACTAAAATCACAAACTTCTGCAAGCTCGCCTAAAAGTTCTAACCCTGTTGCTTTGTTATTTGTTTTAATGTTAAATGTACGATACTCACATGATGAACGATTATTTTTTTCTGCTGCATGTAAAGGCAAAAGAGCAATCGTGAAAACTGCAACCAATGCAATAATTTTACTTGGAAAATATTTTAATATTGCTATCATCTTTTGTCCTTATTACAAGTTCTTTTTTTTCAATTTCATTCTGAAGAATAACACTATTGCGTGTAATTTTTATTAGCATATAAGATCCTATAAGATCATTTTTTTTGTACCAATTACCATTTATTTTTGCTTTTTGATCAAAAGTAGCTTCTAAAACATAAGTAGATGCTTGTGCGGTTGCATTGTTTTCACTCTCATTTTGTTCTGAAGAAAGAATAATAAAAGGATTTTCGAGCTTATCAATCATAATCGAATCAGCACCTGATCTTCTTTCCGCAATTTTTTCAAAAATTTTATCATACTCTTTGACTTCACTGCTCGCAGGCAATTTATCTTGTGCTTCTATAACAGTAAACATAAGCAATAATCCTAAAATAGGCTTCATTAAAGTTTTCAATATTTCATCCCCCAAACAGCAATATTGAATTGACCCTCAATATCTTTTTGACCTGTACAGTTGAGCTCATAAATATCAACAACCAATTCACTTTCTTCGATAGCATTCATATACTTCATTGTATTGGCGAAAGAACCATTAAAAGAAACTTTTAGAGTTAAAATTTGCTCTATTTTTTGAATACTTGGCTCGTTGATTTTATTTTCAATAACTTTAATACGAACAGAATATTTTTGTGCCAATTGGCTAATTGTATTAAGAAATTTTGCCCAATTTTCATCATTGAATAACAGATAGGAGAGATCTTTGAGTTTATTATCTACGTAATCATTCATATAAATTGTTTTTTCTAAAAGGAGTTTAGAATTTTCAATATCTGACTTAACTTTTTTAATTAAAAAAGTTGCATCTCCATCTCTCGATACAGAAGTCAAGTAAGCTTGTTCATCATGAAGCTTCTTTTCAATGTCTTTTGCACTTCGCATTGACTGATTCAATATTTTTTCTGTTACAGGATAAAGATAGCTGTAGAAAATAAATCCAATGACGACAAAGGCCATCGCAAAAATTAAGTAAATTTCGTTATTTTTTTTATCTTTAAAAAAAAGATCAATTTTATCTAAAAGTGAATCAATACTATTCATTGCTCAATCCTATAGAGATTTTACTGGTATAGAGTTTAATCTTGTCATCCTGTATAATCTTTTCCGTATTGACTTTATATTTTTTTAGAGCTGTTAAATCTTTAATAAATTCTGTAATCTTTTTTTCATTTTTATTTCGAACAAAAAAATCAAGTTGCTGGTTTTTGAACTCAATTGCCTCGACTTTACAACCATTTTGATTTGAAAGCTGAAATATTTCTAGTAGCATTAGTGCTTTCATAGGATATGATATTTTTTTATTATAAATTTCACTTAAAAGCTTTTTTCTAAATTCAAACTTCGTTGTTTCATTCTTAACCAAACCATCAACTTTTTCTTTCTCAGTTTTTAAAAGAGATAGTTGTTGCCTGATATCAGATGTCTGTTTGTATAACTCATTATACTCACTTGTCTGTTTTACAATTATCAAACTTAAAAAAGTATGGTATGCAAATTGATAAGCAGGATATGCGAGACTAATAATAACACTTGCTGCCATAATGCCAAGAAATTTTCCTGAAGCTCTGTATTTAAGAGGAGGGGGTCTTTTATAAATGGAAAAATTAAGATTATCATCTGGATTTTCAATATAAAGTTGTGCACTAAGCATCATCAAAATATGAATTTGATCAATATACCACTCTTTGGAATTAATCGCTATACTAAAATTAAATTCATATGACTCTAATCCGAGATAGCTTTTCCCATATTCTTCAATACCCGAAAAAGTACCAATCTCTGATCCTAAATAGATTTTATCAATAAAATCTATATTGTAAGAACGTTTTGTAAAAACTAATACATCATTAATATAAAGGAATATCTCACCAAAAAGCTGCATTAAGCTTTGTTGATATTGGCTATTGGTAGCTCTTAATCCCTCGTTAGTAAGAAGTTTATAGAAATCTTCTTCATCAACACGCTCGCCGATCAGTTCACAAAATTTTTCATTAATCTCTTTTAATGAATAATGAAGTGATTTAGAATAAACATATTCACCATTTTTGTAAATTGCTAAAAAAGCATCTGTCTTTTGGAAATAAACAAAACAGTGTGTTCCATCTGCTTCAATAAAATTTTTGCGATATAAAGCTTTAATAAGAAAAGGTGCAGAGGTTACATAATCAATATAACGTGTTTTTTCTTTAATGGGTGTCAATTTTGTATAAATAAGTGCTGCATCAATAATAAAAACATTAAAAGAGCGATTTTTAGAATCTTTTGATTCTGTTTCAATATAACTTATAGAGTATTCGATAGCAGAATCAAGGGCTAATTCATCATATACTTTTATTTCAATAGCATCTTTAAGATCGCTATCAGGAATATTCCGACTTACATCAATAGTAGCACTAATAACATCACGCGTTTGAAGATATGAGGTATAAAACGCATTCTTATCACTTTTTTCTAGTTTGTTAAGTTTGATTTCATTTTTTGAATAGGTATAAGAGACAAGTGATAAAGGGTCTATTGATACAATTTCTGAGCTAAAACGTCTATCCTCAGTTACGGCCATTTTAAAAACCCCTTTTTAAATAATTATATCTCTAGATAACGATGATATTACAGTAACTTTATTCTACTATATAACCAATTTTCTTCAACATTTCTTTATCTTGACTCCAACCCTGCTTCACGACAACAACAAGTTTCAGAAAGACACGTTTTTGTGACAAGGATTCAATGAGCAAACGAGCGCTAGAACCGATTCTTTTGATCGTTTGACCTTCTTTACCAATCACAATACCTTTTTGACTTTTTTTATCAACAATAATAGTAGCATAAATATTATCGATTGTCTCTTTTTCGTCTACCTTATCAATGATTACATCCGCAAAATAGGGGATTTCATCACTCGTATTTTCAAAAATAGCCTCTCGAATCAACTCTTTATAAATATCACGTGAACGCTCTGTTGTAAGAATTTCAGGGTCATAAAGATAAGGATGTTCTGGCATGTATTTAGAGATTGCTTCTAAAAGATAGGATTGTGAGATTCCTTTTTTAACTGAAACAGGAATGAGTGCTACAAATTTATCTTGATAGGCTTGATATTCTGTAATCTTATCAAAAAGTGCTGTTTGAGAAACACTGTCTGTTTTTGTAAGGAGAACCATGTGTGGAATATTGTTTTTATTGAGGGATAAAAAATCTACATAGTGCTGAAGCGCATCGCTTGCTGGAGCTAAAAAGAGAATAAGATCACAATCACCCATCGCTTTCATTGCTTCTTCAAGCATAAATTGGTTTAACAAACGTTCTTGTTCATGAATACCTGGCGTGTCAATAAAAATAATTTGTGTATTTTCGTGCATTGCAATGATGTTTAAACGTTTTCGTGTTGCATTGGCTTTGTGTGAAACCATCGCCAGTTTTTCACCTACAAGCCAGTTTAATAGGGAACTTTTCCCCGCATTAGGTCTGCCAATAACGGCAACATACCCTGTTTTTGTTTTTTCGCTCATTTCTCTTCTTTGGTTATAAAATATAGCGACTACTATCTTCGCTTTCTACGATGGCATCAAGTTTTGCATGGACGAGTTCTTTGGTTACATGTAATGTCTCACCTTCGTGCTCATCGGCTGTATAACTGATATCTTCTAATACTTTTTCTATAATGGTATGCAATCTTCTTGCTCCAATATCTTCTGTTTTTTCATTGGTAATTTGTGCAATTTTTGCAATGGAAGCAATAGCATCATCATCAAAAACAAGTTCAACACCTTCAGTTTTTAAAAGGGCTTGGTATTGGCGCAAAAGCGAGTTTTTGGGTTGTGTTAAAATTTGGTAGAGAACTTCTTCTGTCAATGAACTGAGCTCAACGCGCAGTGGAAAACGTCCTTGAAGCTCAGGAATAAGATCACTGGGTTTACTCAAATGAAAAGCACCTGCTGAAATAAAGAGAATATGGTCGGTTTTAATGCTACCGTATTTGGTATTGACATCACTGCCTTCAACAATAGGCAAAAGATCGCGTTGAACACCCTCTTTGCTTGGATCGCTTCGGTGAGATTGTGAAGAGCTTACCGCGATTTTATCAATCTCATCAATAAAAATAATACCACCATTTTGAGCACGCTCTCTGGCTTCACTTTTGACAGCTTCCATGTCCAGAAGTTTTTCGCTTGCTTCAATTTTAAGTGCCTCTTTGGCATCTTTGACTTTCATCTCTTTTTTAATATTACCCTGACCCGCACCCAAGATTTTAATAAAAGACTCTTGCACTTTAATCATTTCAGGTGGTAAGGATGAATCCCCTAAATCATTATTGTTTTGAGAAATCTCCACTTCAATTTTCAGATCATCGAGTTCGCCATCACGAAGTCTTTGTCGCATCTTTTCATAACTTTTTTCATAGTCAGCTTTTTTCTCTTCACTTACGCCTTTTGGAAGTGGTGGAAGCAGTTTTTCAATGATAGCTTTTTCTACATGTAAAGCAATATCATCTTGATTTTTCTCTTTGTGCTCTGCTTTGACAAGGTTAATGGACGCCATCATAAGATCACGTACCATTGACTCAACATCACGTCCCACAAAACCTACTTCGGTGTATTTGCTGGCTTCTACTTTGACAAAAGGAAGCGAGAGCATTTTTGCCATACGACGTGCTATTTCTGTTTTTCCTACACCTGTTGAGCCAATCATTAAAATATTTTTTGGCATCACTTCTTCTGCCATTTCACCTTCAAGCTTCAATCTCCGATATCGATTTCGAAGGGCAATAGCAATAGATTTTTTTGCATTAAATTGTCCGATGATATACTCATCTAAGTAAGCAACGGTTTGTTTGGGTGTTAAATTCATGCTTCTTTACTCTCTAAAACATATGTTTTGATGTTGTCGTTAGTATAAATACAAAGCTCACTGGCAATTTTCAAACTTTCTTTTACGATTGTTTCTTCATCAAGATTGGTATGGCGATCCAATGCACGTGCAGCTGATATGGCGTAGTTTCCACCACTGCCAATCGCTGCAATTTTGCCATCTTCTGGTTCAACCACATCTCCATTACCACTTAAAATAAAAATATGCTCACAATCTAAAACAATCATCATCGCTTCTAAACGACGCAACATTTTATCTTTACGCCACTCTTTTGAAAACTCAATCACTGATTTGTACAAATCACCTTTTTTTTGTTCCAAAATACCTTCGAACATATCAAAAAGGTTAAACGCATCCGCTGTGCTTCCTGCAAATCCTGCTAGGATTTTGCCACTAAAGAGTTTACGAATTTTTGTGGCATTGTTTTTAAGCACGGTATTGCCAAAAGTGACCTGCCCATCACCGCCAATAACTGCTTTCTTATCACCGCGACACGCAAGGATAGTAGTTGCTTCAAACATAGAATTTATTCTCCAAGAATAATAAGTGTTAAGGTAGCGTGAATACCGTGCCCTAACTTAATACTCACACTAAAATTGCCCGTTGTTTTGATCGCTTGCTCAAGTTCAACAGTTTTTTTATCAATTTCAATTTTATACTGCTCTTTAAGTTCGTGTGCAATTTCATCTTTGGTAACGGCGCCAAAAAGACTTCCGTTTGCACCCAATTTACGTTTAACCGTAAGTTTAAGTTCACCAAGTTTTTTTTCAATGGATTTAAGATTTGCAATTTCTTCAGCTTCTGCTGCTGCTTTTTTACGTTGATCTGATTCATATTTTCGCATGACTTCGTTGGTCGCTAAAAGCGCAAACCCTTTACCAATGAGGAAGTTTTGACCATAACCGTCTTTTACCTCTTTGATTTCGCCTTTTTTACCTAAATCTTTGACATCTTTAATCAATAATACTTTCATTTTTTTCCTTTTATCATGATCGAATATTTTTGGGAAATGGGGCATTTTTCAGTTTTTCACCGTGAAAGGCATCAATGCCGCCACTGAGATGGGCAATATTTGAAAAGCCCATTCTTCGTAAAATAAATATCATTTGTGCTGTGCGCCCACCGGTATGGCAGTAAAAAATCAAAAGCTTACTCGCAAGTTTTTTAAGTTCATCCATGTGTTGATGAATGGTTGAAGTCGGTAATAGCATGTCTGTACCTTTGATACAAGACAGTGAGAACTCATACATCTCTCGAATATCGACCAGCACAAAATCAAGTTTTTTTTGTGCTCTTAAATTCAGCATTACATGTAACTCTTCACCACTGACTTCACAGCTTCTCGTTACTGTTTTAACTAAATTCATCATGGCTTCTTCACTCCATTCATCATTGTTTGGCTTATCCACTTTGGTTTAAGCTTTGCCTTGATTTCTTCTGTTTTTACCACTAATGATAAAGCGTAGCGCGTTCAGCTTAATAAAGCCTGCAGCGTCTTTTTGGTTATACACTTCATCCTCTTCAAAGGTGCAGAATGCTTCGCTGAAGAGATTATTGGTTTTAGAATCTCTACCGATGACTGAAACATTCCCTTTGTAGAGTTTAAGTTTAACCGTGCCATTGACGGTCTCTTGCGATTTATCAATCGCTGCTTGCATCATCTCTCGCTCTGGTGAAAACCAAAATCCGTTGTAAATGGTTTTGGCATACGTTGGCATAATTTCGTCTTTAAAGTGAGCCGCTTCACGATCCAGTGTAATGCTTTCAATGGCACGGTGTGCACGAAGCATAATTGTACCACCCGGTGTTTCGTAACACCCTCTACTTTTCATACCGACAAAACGGTTTTCAACGATGTCAATACGACCAATGCCGTGTTTGCAACCCAGTTCATTCAGTTTTGCCAGCATCGTTGCGGGACTGAGTTTAACACCATTGAGTGCGACTGGATCACCTTTTTCATAGGTAATTTCAATGACTTCAGACTGGTCAGGTGCTTTTTCAGGGCTTACTGTCCAGCGCCACATATCTTCTTCTGGCTCTGCCGCTGGATTTTCCAAGACAAGTCCCTCATAAGAAATATGAAGAAGGTTTGCATCCATAGAGTAGGGTGATTTACCTGGTTTTTTCTCAATCACGATACCGTTTTTCTCTGCATAGGCAAGGAGTTTTTCACGAGAGTTCAAATCCCATTCTCTCCACGGTGCAATGATGACAAGGTCTGAATTCATACTCAAATATCCCATCTCAAAACGGACTTGATCATTTCCTTTACCCGTAGCACCATGACTGACACCGTCAGCTCCCACCATCGCAGCAATCTCTGCTTGACGTTTAGCAATTAAAGGTCTAGCAATAGAGGTGCCTAAAAGGTATTCGCCCTCGTAAATGGCATTCGCTCTAAACATAGGGAATACATAATCTTTTACAAACTCTTCTTTTAAATCTTCAATAAAAATATTTTCTGGTTTAATGCCCAAAGAGATAGCTTTCTTACGCGCTGGCTCTAACTCTTCGCCTTGACCGATGTCGGCTGTAAAAGTGACAACTTCACATTTATACTCATCTTGGAGCCATTTTAAAATAATACTGGTATCGAGCCCACCTGAATAGGCTAAAACTACTTTTTTGACACTTTTTTTCATAATACGAATCCTCACAGAATTTATAGAATTGCGTGATGGTAGCGCAACTTTGTAAAAACTTTGCTTATGCTTTAGGGGAGACTTTACATGTAAAGATAAAAAAGCAAGTTTAAATATTCTTTTGATAGAATGGGGATTAAATCCTAAGGATGGGGTATGCGAGTCGATAAATTTTTAAACAGTGTCAATATTACGAAACGTCGCGCAATATCGGAAGATATGTGCAAAAATGGCGTTGTCTGCATCAACAGTGTGGTTGTTAAGCCTGCAAAAGATGTCAAAGTCGGTGATATTATTACGATTAATTATCTTGAAAAAACAGTCAAATACGAGGTTTTGCAAATTCCTGAAGCCAAAACCATTCCAAAAACGAAACAAAACGAATACGTAAGGGAAGTGTGATGAATTACCAAGAGGCCTATCAACAGTTTAATGCGCTTTTTGAAAATGAGCTGAGCCCAGAAGCAGCAGCACAATTTTTAGTTGAGCTTTACGAGCGAGGTGAGAGTTTTGAGGAGATCGCCGCAGCTGCAAATGTTATGCGCGAGCACAGTGTTAAACTGGACATCCCGGAGCATTTGAAAAGAGAACTTATTGATATTGTTGGAACGGGTGGCGATAAAAGTGGCACGTTTAACATCTCCACAACAACGTCCATTGTTTTAGCCACTCTTGGTTGTAAAGTTGCCAAACACGGCAGCGGTTCTGCCACTTCACTCTCTGGCAGTGCCGATGTACTTAAAGCCTTAGGACTCAACCTTAGTTTAACACCCGAAAAACAGATTAAAATGCTCGAAGGATGTGGTTTTGTCTTTATGTTTGCGATGAATCACCATCCATGCATGAAGCATATTATGCCCATTCGTAGAAGTCTCTCACATCGAACCATTTTCAATATGTTAGGACCCCTTGCCAATCCTGCAAGTGCTCAAAAACAGATGGTGGGAGTGTTCCATGTGGATTACATTGATCGTTTTAGTCAAGCATTAAGGGAGCTAGGCACAACAAAGAGCATGGTGGTAAGTTCTCTGGATGGTTTGGATGAAGTGAGTATTACGGCACCAACACGTTACACGATGATTGAAAACAAAATCATTACAGAAGGTGAAATCAATCCAGAAGCGTTTGGTTTTACCTTTGCTCCACTTGAGGCAATTAAAGGTGGTGATAGCATTCAAAATGCTGAAATTACGCGTGCGATTTTACGAGGCGATGAAAAGGGTGCAAAACTCGATGTTGTCCTTTTAAATGGGGCATGCGCTTTAATGATCGATGGAAAAGCAAGAGATATGCAAGAAGGAATTGAGCTCATGCGAGATGCCATTGAAAGCAAAAAAGCATGGGATAAACTGGGCGAAATTATTAAGCTCTCTTATCTTATATGAGTCAATGTTATGAAAAAGTGTGCGATTTAGCGCACTTAACTGCTTTTGCCGACGAGATAAAAAGTAAACTTGGAGATTCAGGCGTACTCTTATTACGAGGGAATCTCGCTAGCGGAAAGACCGCTTTTGTTAAAGCGTTTGCTAAAATATTAGGTATAGAAGAAGCGATCTCATCGCCAACCTTTTCGATTTTACATGAGTATGATGAAAAACTATTTCACTACGATATTTACCAATGCGGAAGCAATGGTTTTTTACAAAGTGGATTGATCGAAAAATTAGATGCTGAAGGTTACCATCTCATTGAATGGGGTGATGCTGAATTTGAAAAATTACTACACCATTTTGGTGTTGACTATAGTACAATAGATATTGAGACAATGGATTTAAAACGCAATTATAAGGTTCACATTAATGCATACGCTTAAAGTTCAAGAATTACAAAAAGTGATTAAAAAAACGGAAATTATTAAAGGAGTTTCCCTTGATGTTCAAAGTGGTGAAGTTGTAGGTCTTTTAGGACCCAATGGTGCAGGTAAAACAACAATGTTTTATATGATTTGTGGTCTTATCCCTCCAAGCTCTGGCGTTGTTTTTTTAGATAATCAAGATGTCACTCAAATACCTTTACATGTAAGAGCAAAACTGGGTATTGGTTACCTTCCTCAAGAATCAAGTATTTTTAAAGATTTGAGTGTGGAAGAGAACATTATGCTTGCTGCTGAAATTGTTTATCCCAATAAAGAAAATGCAATGAAGCGCGTGGAAGAGCTTTTAAATTTGCTCAATATTGAACCGATTCGTAAACGAAATGGCGTCAGTCTGAGCGGTGGTGAAAGACGCCGTTGTGAAATTGCACGTTCCTTGGTCTTAAAACCCAAATTTCTTCTTCTTGATGAACCTTTTGCGGGTGTCGATCCTATCGCTGTATCAGACATTCAAGGTATTGTACAAGAGCTTGCAAAATTGGATATTGGTGTTTTGATTACGGACCATAATGTACGCGAGACCCTCGCCATTTGTGATAGGGCATATGTACTTAAAGATGGAGCTCTTTTGGCGAGCGGTAGCAGTGAAGAAGTTGCTCAAAACAAACTGGTGAAGACCTATTATCTTGGTGAAGATTTCAGGTTTTAACCCACAAGGAGATTGGTTTTGAAACTTAGGGTTTCTAGCACACAAACAACCAAACAAAAGTTCTCCTCAACTCTTAGAGGTTGGCTTCCTATACTTCAAGCCAATTTAGATAGTTTGGTGGAAACACTGGAACCCTTTGTGCAAGAAAATCCTTTCATCAGTGTTAAATCAGGTTCAGAAACACCCGATAAACGTTTTGAAAAGAAAAGCTTTTTCTCCGAAGTTGCAAAAACATCGGTTTCTGATACCATTGAAGCACTGACTTTAGATAAAAAATCGCTCTATCAAGTCCTTAATGAGCAGGTAAATCCACCCCTTTTCCCTACTGAAAAATCACAACGAATTGCGTATGAAATTATTGAAAATATTAATTCAGAAGGATATTTTGAAGCTCAAGCATTGAGTGAGATTGCAAAAAAGTTGGATGTTAAAATCGAAGACGTGGAAAAGATAAGACAACGCTTTGCTTACCTTGAGCCTTTGGGCATTGGGGCACTTGATTTAAAAGAGACCTTTTTATTTCAACTCCAAGACCTTTCTTTAGAGAATGAACTCTATGACATGGTTGAAATGCTGATTATCAACTTTGATACTATCGAATCGTTCGGCAAAGAACCCCTGTTTCATGATGCCCTCGCTATTATTAAGCGGTTTCACAATCCACCCGCAATCGATTTTATGGAAGATGAGAAAGAGGTTATCCCTGATATTTTTATCTATAATCTTGAAGGTGCGATTGAAGTGAGACTCAATGATGCTTATTATCCTGAAGTGATACTTGATACGGAAGGGCTGGATGTTGATCATAGTTTTGTTTCTCAAAAAATTAAAGATGCTAAAGATCTTATTGATGCTCTTGAGATGCGTAAAGCAACGCTTTATAAAATCGGATTGATGATCGTAGAATATCAGTATGATTTTTTCTTTGGCAAAGCGATAAAACCGATGAAACTTAAAGACTTAGCCGATGATCTAGGGCGAAATCCTTCGACTATTTCACGTGCCATTGCGGGAAAATACCTTTCCTGTAGCCGTGGTGTGATTCCTTTAAAACAGTTTTTCGCAACGGCATTGGAAGAAGATATTTCCAATAGCGCTATTAAAGAATATATGATTGAACTTGTTAAAAATGAGAGTAAAATAAAACCTCTGAGTGATATTAAACTTTTAGAACTGATAGAAGGAAAATTTAACATCAAAATGGTGCGAAGAACCATTACCAAATACCGACAACAGTTTAATATCGCAAGCTCATCCGAGCGTAAAAAACTCTACACTCTTCAATTTTAAGCACTTTTTGCATCGGCATGTCTGAAGTGCATTTTAAGCATTTCATAGGCATTTTGAACCTCTTGAAACTTCGCAACATACTTTACATGTAAATCGCCATCGTAAAGCCCACAACTATCCGGATGATATTTTTTAACCAGTGTAAGATAATTGCTACGAATCATATCAAAGGAGTCTTCCATCTTACATCCAAGAACACCAAAATACTCTTCAAGTAAAGAGAAGAGGGCATTATGTCGGCTTCGTTTAGCACGTTTGGATACAAAACTCTTTTTATAGGCTAAAAAATCGTCCATAGAATAAGAAAACTCTACAAAACACCCAAAAAGCTCCTTTTGTGCCAAGAGTTTCTCTAAAAGACATACCGTAATCTCTGAGTTTGGATAGAGCGTAATGGTGCCATTCTTGGGGCGAATTCTGACTAAATGGTCTTTAAAATAGCTTTTCAGGTAGGACGCAAAAAGCGTGTTATGTGAACCCATATAAAAGACAACTGCATAATTGTCTTCAATGCAGAGTTGAATGTTCATTTTTTGAAGGACTTGATTGGATTTTAGAAGCGAAATCTTGATACTTTTATCGATTGTATTTTCGATGATACGTGCTTGTTCCACATTACCCGTTTTTCGTAGATAAATTTTACTGATCAATTTTAAAAAGTAACGGCGTTGCACTAACTCATCTTGCTCTTTAAAAATAATCATCTTATCTTTGCGTCCAATCTTCTTTTGAAAGTTTTTATCGGCAATATTTTGTAAAAAATAAAAGGTTTTTGAATCTTCTTCTATCGTAAGAGCTAGCATATCATGAGAGAGAGAAAGGTGCATCTATCCATCCTTTTGTATCAAGTTATTTACCTTGAAGCAAAAAGAATTCCAAGTTTTAAAACGTTATATCCAAATGGCATCTTCATCTAAATTTGGATTGCCTTGAAGCGTCTGATAAGGGGCATAGGATGCTTTATATTTAAGGCTTTGACACTCTTTGACGTAATAACCAAGATAGATCCATTCAAGATCATACTCTTTTGCCAAAATAATCTGCTCGTAGATGGAAAGTCGGCCTAAAGAGAGTTTTTCAAAATCGGGATCGTAGTAAAAATAGATTGAAGAGATACCATCATCTAAGAAGTCAATGAGATCAACACCAATAAGCTTTTCACCCTGAAAATAAAGCACCTCTTTGCCAAAATTATGTGCGCCACTTACATAGAGTTCGTGGTAACTTTGTGGCTTCAGGTTGTAGTATTGCCAGCCTCGTTTTTGTTCCATATGCCGATGATATTTGTCATAAAGTTCAAGATGTTCCGTGCTAATTGTTGGTGCTTGAATGACATAACGAATACCTTCTGCTTTTTTGAAAACACGCTTAGCCGAACGTGAAAAATTGTAATTTTTAACATCAATACGAAGGCTTAAACATAATTTACACTTTTGACATTGTGGTCTTGAGTAGTAGTTTCCAAAACGTCTCCAACCGCGTTCAATCAGTTCTTGATTGAGCTCCATTGTTGCATTTTCAATGTACTTATATTCCATACGTGTTTTACAGTCATCAAGGTATGAACATTTCGTTTCTAATGTTGAAAATTCAATAATGCGTGAAAATGATCTCATTGATTAGTCTAGTTTTCTAACTCCAGAGTTTTGCATAAACTTTTCAAATTGGTCATGAAGTTTTTTTTCTTTTTCTTGCTTTTGTTCTATAATAAGTTTTTCTTGAACATTTTTTTCTTCTTGACTTATTTTCTCTTTGAGTAATTTAAGATCATCAAATAAAATTCCTTGCATTGCATATGCCTTTACATGTAAAAATAAACAAGGTTAGATTATAGCAAAATTTCGTGTGAAGAAGTTGCAATACATCTTAACAATGTTTAATTGATTGACAAGCGAAGCACAGAGATCGCTAGTTATTACATTGAGACGTATCGTTGATGTCTTTAGTGAAGAAAAGCGCAAATTGCTTATTGAACATTCAAGTGGGCAACTTCATAACCTGATTCAGTTTGTTTTTTACAGCGGACTGAGAGCAGGGGAGATAATAGGCCTTAAGTGGGACAATATAGACTTTGACAAAAACAAGATTGATGTCTATATGAGAGTCCGAAAAGGTACTGTTGATCTGCCAAAAGGCGATAAAATCCGTTTGATTGATCTGTTACCGCAAGCGAAGCAAGCGCTTCTTGTACAAAGACAGCTAACAGGTCTTTCGACCTTTGTGTTTCATTCAAGAGAAGGTAAACCCTATTTTAGTGAAACAAGTATTACGCAATCTATTCAAGAGTTGTGTAAAAAGCTAGGGATTGAAAGTGGTGGTGGATTACAAAAGATGAGGCGAACGCATAACACAATGCTTAAGCAATGTGGTTTACCTCTTGATTGGATTCCTCATCAAATGGGTCATGAAACGGATGAAGTTAATCGAAATCATTATACGGGAACTATTACGGTAGATGTAAGTAAAATTATTGCGTAACTGACATTTACTGACACTCGTACTGACACCGTGTATATTGTAGGTTGGGAATATCGTATTTTAGGGATTTAGGGTGGTACTGAGGCCGGACTAAAATATATGCTTCAACTACAGTAATATCAGTCTTTAATTATTTTAATGTGGACAAGTACTGTCAAAAGTACTGTTATTTTAAAGTGTTTCAACTATCAGCTTATGCGTGATCTAGTTTTTAGGATGGGAACATAATTGTTATATTATTTTAATAGTAAAATACAAAAATCACTTCATTCAGTTAAAACAATATAATCTGCGATTCGTTTTATAACCCCAAATTAAATACTTTTTCTGAACCAATACTCATTTAACTATAAAATATTTTAATTAATAAAAAAAATTGAAGTATTTAAAACTTATCTACAATTTCCCTACAACTTAAATGTATATTTACCATGAAAATATATAAATTTAGATTAATAATGATGAAAAATAGTTAGAAGGAGGATTATATAAGTTCGGTTAGTTATTGTTTTGTTGACTTACTGTTTGTAGTTTGCTCTCTTTAATTTTAGTGTTTTACTATTGATTAAGAGTATTGGTGACGTCTTTTTGTTTTGCTGTATGTTACCAATTGTTAAAAAACGAACAAGTAACTTTTTTTATGGATCTGTGGTAGTTACAATTATCATAGGTCTTCATGTTAGCGTTAATAAATAGACATCTTGCAAAACTCATTTGTATTCCACAGCAAAACCACGGTCAAAGTTGATTAATCCACATATCAAATTAAAGCGTAGATTGAATCGTTTTCTTCTGTTTCTGTATTTTTGGGTGAGTATCTGAAATGTTTTGATTTTAGCATTCACATGTTCAACACATATTCTAGCACTTGCTTTTTGTCTGTTTTCCTCTTTTTGCTCCTCACTCAAAGGATGAAGTTTGGAGGCTTTATGGGGAATTTGACAATGGCTATGTTCTTTGGCAATACCCAGATATCCCAAATCAACATAAACACAGGTCTTGGGCATCAAGGGGAGATTAGATTCTTGAAACAGTCTAAAATCATGTGTCGTACCTTTAGCGGTATGCACACACATAATCCTCTCCTCTTTATCAATCACAATCTGTCCTTTAAGGGTATGACGCTTTTGCTTACCTGAGTAGTACTCTCTTTGCTTTTTTTGGGTCTTTGACAAGGCGTTTCTGTCGCATCAATGACAACAAAAGAGAGGGCAACATCACTTTTATAGAGTTCTCTCTTGCTTGGCAATGAAAATCTACCAGACTTAATCAACACTTCTTCTACTTCACATACAATCCTTGAAGCCGTTGATTCACTCACACCATAATCAAATCCAATATGTTCAAGGGTGCGATACTCACGATAATAGCCAAGCATCAAAAGTACTTTATTTTCTGGTGAAAGAGATCTCCTCCCTCCTACACCTAATCCTTTTTTACGATTCTCATCGTATTGCCTAAACACCTCTATCATTGCGTTAAATGTCTCTTCATTAACGCCAATATGGCGCTTAAAATCTTTGGGTTTATGCTTTTGCATTTGTTCATATTTTTTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP039734|2670786:2729793|2674874_2675372_+|WP_087438593.1|DBSCAN-SWA METIIKGVSKPLLKNERKIGSEAPAIILEMLNGEMKVIGMMATKVQVMITLPFQNSLSKELLNIIEKYQEQAFIYLICSTKLEEEVNIENSSVDFVEFSKKFGVYIDETLCAKSLFIINKDGQFVYKEITKDVEDAFDLEMFETKLDEAIHFKKKGHVHETWMGA >NZ_CP039734|2670786:2729793|2678436_2680050_+|WP_167750571.1|DBSCAN-SWA MELNVHNVSVHHKKRLGIAGANLSAQEGEIIGFIGADGAGKSSLIHAIAGVIPFEGDVTFNGVTYHSPKEAEPLKASIGLMPQGIGLVLYELLTIDEHLRFFANIHNIYQDSVFEAYKLRLLKMAGLDAFQDRQAGKLSGGMMQKLSLICTLLHRPKLLLLDEPTTGVDPLSRIELWEILDEIRKSEGTISIISTAYMQEAAKMDRILLFDEQEIIAQGTSSELIDSVRSMSYVEGITTEEPCIHTLHATYCLSALEERHIEPSLEALFFVNALQKGRKMPLIEITNKEKTIDLPDIVMEAKGLTKIFGSFIANENVDMTLHKGEILGLLGANGAGKTTFIKMLLGLYPIDGGELTLLGKSIQTQEDRQALKASIGYVSQHFALYNDMSVEENLLYAASMRGITNDVAKSRIARYALELGFDEFLNSMPQELPLGINQRFSIASALLHEPVILFLDEPTSGVDSIARAKFWELLKLLKERWEIAILITTHYMSEAEFCDRVVLLRQGKKIADHTIAEFYAKHPNAQSFEEIFLEYYR >NZ_CP039734|2670786:2729793|2727677_2728235_+|WP_167750598.1|integrase|DBSCAN-SWA MRRIVDVFSEEKRKLLIEHSSGQLHNLIQFVFYSGLRAGEIIGLKWDNIDFDKNKIDVYMRVRKGTVDLPKGDKIRLIDLLPQAKQALLVQRQLTGLSTFVFHSREGKPYFSETSITQSIQELCKKLGIESGGGLQKMRRTHNTMLKQCGLPLDWIPHQMGHETDEVNRNHYTGTITVDVSKIIA >NZ_CP039734|2670786:2729793|2701529_2702969_+|WP_167750581.1|DBSCAN-SWA MEIMVTSITHLCFILGLSYYFIIAMQWYSYRLERILFHYNRYDWHVFYFLVPLVGYYLLNGVVLSLFVALFLISLFIWQKKMDKKLVWTARVKRFFLFLVLATLFQDLLCTVLVASCLKLGVIIPLMVAQIASMFYEKMLFLSFKKEAQKKLMANSALKVVAITASYGKTSIKNFLAQILSTKFNVYKTPRSVNTIGGIIKDINNDLPEQCDVYIVEAGARARGDIDEIARLVNPHIAVVGCIGEQHIEYFKTLENIRNTKMELLHSSRLEKAFVHESTNVKGSESILSFGAELSDVEASLQGLSFSMLLNGVKESFTCKLLGAFNAINIAAAIHVARTLGLSIEEIRSAVSHLEGVEHRLQKIEAGGKLIIDDSFNGNLEGMLSSYNLVSQHQGRKVIITPGIVESTEEANRILAKKIDDVFDLVMITGKINVTILHDNIHKAQKIIISDKSKLQETLSEQTYAGDVILFSNDAPTFL >NZ_CP039734|2670786:2729793|2693809_2695207_-|WP_088437348.1|tRNA|DBSCAN-SWA MFIYDSVQKKKVSFIPIQENEVKIYVCGPTVYDDAHLGHARSAIAFDLLRRVFIALGYHVTFVKNFTDIDDKIINKMKESGKSLEEITSFYIERYKNEMHALHVKDADIEPKATETVSEIISFVDEMLRKKVAYATSDGIYFDTSKDAKYLSLSHRNIEEDASQSRVEQKEEKKDQKDFALWKFSKANEPSYSAPFGVGRPGWHIECSAMIEKHLASSGNFQIDIHAGGADLLFPHHENEAAQTRCKSHQELAKYWMHNGFVTISGEKMSKSLGNSFFLKDALAIYSGEVLRFYLLATHYRANFNFSEEDLLNTKKRLDKLYRLKKRVFENIPSEVSAPFKEAMLEALSDDLNISKALASLDEMITSANEMLDLNPKDKALKATTHANLQWIELLLGIGLYNPYMYFQIGVSDAEKAEIEALIENRIVAKKAKDFAKADEIRTLLESKQIQLMDTAMGTQWEKVN >NZ_CP039734|2670786:2729793|2696734_2698666_+|WP_167750577.1|DBSCAN-SWA MGLFDKIKGHNTDTGTQNDGDVSEFKSIVIDTINVIKELKNVAIASHLKPSELSFKLLRTTTYYSDEKSENNEMNEEELKLLSDDNFLLNPNLKLTQHYRVEIYKIADQEEDHTILPDITLSGNKTLTKIIAMVAKNHDVKYTSKLEEKIIEDIQIKKIKAGILVGIRDQNMYKEVKKIVANIRVNGIIDQNQTFVVCQGVDEIPSINDDLIYHYKKKINAKSTDGKIDYAKRGYVLAVDKDECIIEYIKPQLGTPGRNCRGAFLPVKEPRKSNDTPIAITANLVKKESETSIKYIANRGGYVNFDKGTYDIQDQMEINEISFRSTGSIDASLGSNIKINIKESDILKDAIGAGMSVETSEVHVQGNIGSGAKIKAKIAEIGGQTHQSAYIEADKIIISVHRGEANGQDIEIDRLEGGKVIGHTVHVKQMIGGEIIANSVKIDNLLSNAKITACDLIEITELKGNNNKLIIDPSVTKEFNEMIDTINAKIEKLEEELKAYPRQLSSKKEFIDKNKPMAEMVKDKIMELKRNGVEPPMTLFAKIKDFQEKVIDYNTFLQTFKDKKEELQEYRKELNQVQNKVFSAKIINHSAWKEFNEVRFRLISPPKDITYNPKEHEIVREITLKDMGNGEHRVMRSAEYSSK >NZ_CP039734|2670786:2729793|2683640_2684300_+|WP_088437345.1|DBSCAN-SWA MNEEEKELSCAECGTLNCHKHDSRYPKFCLTTNVDEQMLEESLACYKEEEGMDRKIALAAADIEGKYYGQLTRVEEILAFARRIGAKKIGIASCVGLAAESKIFAEILKVNGFDVFMAICKVGSRDKCDIGLEEEQKIRPNTFEPMCNPILQAKYLNKAKTDLNVIMGLCVGHDSLFIKYAKATTTYLVVKDRVLGHNPIAALHLTQTYYKKLLTPKAY >NZ_CP039734|2670786:2729793|2712886_2714431_-|WP_096046632.1|DBSCAN-SWA MIAILKYFPSKIIALVAVFTIALLPLHAAEKNNRSSCEYRTFNIKTNNKATGLELLGELAEVCDFSIVVKDTEAEKVLAKNLNGVNIKNLSLDEVFQIIIQDNDLFYTYDKNYLKISALSTKSFKVDYISSIREGKAVINASVDATPTETSTSGTKNTATQGQNTISSNESFDFWKTIATELTSVLNTGGETYKAQSPIININAGMITVTATKRQLDRVSEYIDVLKERLHRQVLIDVSIVSVLLNNSNTSGVDWSKFSLNIGSNSIFNNSNQGTLSASNPNTLYNSASTGTANASSNLNNLTVVNDVSFSIAGLIDFLGSSGETKVVSSPKVLTMNNQQALITIGDNVNYRVPETTNAASTTGTATLATTYTNYSIFIGVLLNITPEISEDNEIILRINPSVSTFKYTADDVKTTDPRVIAPDTSEKKLSTVVKVKDGSTIILGGLITNNKSKQDTSVPLLSSIPLLGEAFKHSADTLSSNELVFVITPRIIGAKGTDKATLKDLGYSQKINE >NZ_CP039734|2670786:2729793|2675368_2675809_+|WP_087438594.1|DBSCAN-SWA MSEQTLLWLSANGYDANDLNFVGKYGNSALMKAVREANISVTKELIEAGVDLELKNIDGNTAIWNACFGGDFTCVELLVKAGIQLDNQNDNGVTALMYCASSGKEEMTKLLLASHADTTIANLDGFKAIDLASTPTIYKMLKASIH >NZ_CP039734|2670786:2729793|2719826_2720273_-|WP_167750591.1|DBSCAN-SWA MKVLLIKDVKDLGKKGEIKEVKDGYGQNFLIGKGFALLATNEVMRKYESDQRKKAAAEAEEIANLKSIEKKLGELKLTVKRKLGANGSLFGAVTKDEIAHELKEQYKIEIDKKTVELEQAIKTTGNFSVSIKLGHGIHATLTLIILGE >NZ_CP039734|2670786:2729793|2705145_2706357_+|WP_167750583.1|DBSCAN-SWA MKILVLNAGSSSVKYQLFNMANNEVLASGVIEQIGEKESMAKIKYKKPAGDEQKREEKCSIHDHDAALTWMSEALIQSGVIHNLNDLDGIGHRVVQGGSSFQEPAMVDDYVMSEIERLIPLGPLHNPGHLAGMKVSVHQSPNVPQVAVFDTAFHSTLPNYAYMYAIPYKYYEELRIRRYGFHGTSHYYVTKVAAKYLKQDINTLNAITLHLGNGASVAAIENGQSVDTSMGLTPLEGLIMGTRSGDLDPAILFYLARKRGLTLDELDKMLNKESGLKGICGSNDMREITRMAEEGDERAQLACDMFNYRLKKYIGSYSAVLGRVDCIVFTGGIGENANDVRLKSCEKLENFGIKIDPILNSVRSSEIRTISADDSKVKVLVIPTNEELEIAIETLEMIQKHHS >NZ_CP039734|2670786:2729793|2701254_2701524_+|WP_087438613.1|DBSCAN-SWA MSTYRLIVTGRVQGVNFRRFVVDIAHALNYVGYVKNSADGSVEVVINSAYEEDLEFFISKLYDGSMFSDVQDVTCQKIESMIFDDFEKR >NZ_CP039734|2670786:2729793|2723744_2724467_+|WP_096046645.1|DBSCAN-SWA MHTLKVQELQKVIKKTEIIKGVSLDVQSGEVVGLLGPNGAGKTTMFYMICGLIPPSSGVVFLDNQDVTQIPLHVRAKLGIGYLPQESSIFKDLSVEENIMLAAEIVYPNKENAMKRVEELLNLLNIEPIRKRNGVSLSGGERRRCEIARSLVLKPKFLLLDEPFAGVDPIAVSDIQGIVQELAKLDIGVLITDHNVRETLAICDRAYVLKDGALLASGSSEEVAQNKLVKTYYLGEDFRF >NZ_CP039734|2670786:2729793|2685720_2686398_-|WP_087438600.1|DBSCAN-SWA METKNFVNTLDDILTRVEKARLSVDQHLIVKIVAASKSADPSMIEAMYHAGQRCFGENKIQDMSDKVHALSRLPLEWHFIGRLQTNKINQLIDLEPSLMHSLSSLELAQEIDKRLHVKNKTMNVLLQINSAYEEQKAGVLPEQAIEVYEQIVLTCKHLQLKGVMSIGAHTEECAVIQKSFETTHKIFESLQNYGAKYCSMGMSGDFELAIACGSNMIRLGSILFK >NZ_CP039734|2670786:2729793|2686387_2687446_-|WP_087438601.1|protease|DBSCAN-SWA MGTLTSILVLSFLIFFHELGHFLAARFFGVHVEVFSIGFGKKIFSKIVGHTEYCLSLIPLGGYVQMKGQDDRDPTKVSYDADSYTTKAPWKRIIILFAGPFANFLLAFLLFLAVGTMGVTKFAPIIGKISPNSPALEAGLQENDRIVMINNELIETWDEVSILIQKNSGSMQMKVERAGSVQTIVLSPKISEYKNMFGETKQKKMIGVLPSGKTIEMVYSISELPGFAYEQTIKATTLILTSLQKLIEGVVSPKELGGIISIVQVTSEASAAGLVALFALTALISVNLGVLNLLPIPALDGGHIMFNAYEMLTKKAPSERVLTAMTSMGWIFLLSLMALSIFNDIYRLTNGN >NZ_CP039734|2670786:2729793|2727376_2727547_-|WP_167750597.1|DBSCAN-SWA MQGILFDDLKLLKEKISQEEKNVQEKLIIEQKQEKEKKLHDQFEKFMQNSGVRKLD >NZ_CP039734|2670786:2729793|2712084_2712894_-|WP_096046631.1|DBSCAN-SWA MNSHYSDLKTIFVDGEVFDYVNLDKSATTYDKLVQTLDKPLKLILFYGKPGTGKTFLLQKIFNDLRTKKRIIFFPRPFFEEKVFIDALFEHIFAKKSPGFSNYNEFLALLSKHITSTDQSITVLIDEAQLYPTDLIEKIRLMADSRLFKFLFTVHKTEKEDVLAKDYFQTRIWETIEIDNASKNEIKTYIEKKLLFHNRFEYLNLFRDKHYKLILKVTDGNLRTINKLMYKLFEILEYYDMHQPSLIKTNALHVKYIEMAGISLGMIHA >NZ_CP039734|2670786:2729793|2723326_2723758_+|WP_167750595.1|tRNA|DBSCAN-SWA MSQCYEKVCDLAHLTAFADEIKSKLGDSGVLLLRGNLASGKTAFVKAFAKILGIEEAISSPTFSILHEYDEKLFHYDIYQCGSNGFLQSGLIEKLDAEGYHLIEWGDAEFEKLLHHFGVDYSTIDIETMDLKRNYKVHINAYA >NZ_CP039734|2670786:2729793|2717953_2719279_-|WP_167750589.1|DBSCAN-SWA MNLTPKQTVAYLDEYIIGQFNAKKSIAIALRNRYRRLKLEGEMAEEVMPKNILMIGSTGVGKTEIARRMAKMLSLPFVKVEASKYTEVGFVGRDVESMVRDLMMASINLVKAEHKEKNQDDIALHVEKAIIEKLLPPLPKGVSEEKKADYEKSYEKMRQRLRDGELDDLKIEVEISQNNNDLGDSSLPPEMIKVQESFIKILGAGQGNIKKEMKVKDAKEALKIEASEKLLDMEAVKSEARERAQNGGIIFIDEIDKIAVSSSQSHRSDPSKEGVQRDLLPIVEGSDVNTKYGSIKTDHILFISAGAFHLSKPSDLIPELQGRFPLRVELSSLTEEVLYQILTQPKNSLLRQYQALLKTEGVELVFDDDAIASIAKIAQITNEKTEDIGARRLHTIIEKVLEDISYTADEHEGETLHVTKELVHAKLDAIVESEDSSRYIL >NZ_CP039734|2670786:2729793|2714399_2714828_-|WP_096046633.1|DBSCAN-SWA MKTLMKPILGLLLMFTVIEAQDKLPASSEVKEYDKIFEKIAERRSGADSIMIDKLENPFIILSSEQNESENNATAQASTYVLEATFDQKAKINGNWYKKNDLIGSYMLIKITRNSVILQNEIEKKELVIRTKDDSNIKIFSK >NZ_CP039734|2670786:2729793|2703040_2705149_+|WP_167750582.1|DBSCAN-SWA MKSKSLYISSLAPAAGSLIVAMGIMELLKGRLGKVAFFRPVILDANEVDKDIDFMLEYYALKMDYNATYGYTVHEVESLIAENKYNEVLENLIDKFKILESQYDFVLIEGLNQANFSQTLDFDINLSIAKNLSSPFISVLKGKQKSVKEVLDEISIEADAIKGAGCQHFATFVNRLGDQEVQELKELNRAKPIQNVPVYFLPEVPELDTPTVAEIKNKLGCSHIYGEEKDLRRVVKQSKIAAMKLDNFLEYIEDGDLVITSGDRSDIIVGCLSTVFSNNYPNISGILLTAGMMPHKSINKLIAGFKDLSIPILSVDNGTFDTAVNVSNVPATITPQSVRKIALAMGLFSSNVNIEEIEKSIDTESTTSSITPIMFEYALFERARRNRKKILLPESNDERILRATEILLRRDVADIILLGVEEEVRRKSATLGLDISKATIIDPLTSPLMEEFVTSFYEMRKAKGLSLDVARDSMMMKNYFGTMMVYLGYADGMVSGAIHTTQETIRPALQIIKTKPGISIVSSLFFMCLDTRVLVYGDCAVNQDPNAEELAQIAISSADTAKIFGISPKIAMLSYSTGDSGKGEEVEKVRLATKIVKETRPDLLVEGPIQYDAAIDPIVAKTKLPNSKVAGEATIFIFPDLNTGNNTYKAVQRSSGAVAIGPVLQGLRKPVNDLSRGCLVPDIVNTVAITAIQAQTNDGANK >NZ_CP039734|2670786:2729793|2675839_2677141_-|WP_087438595.1|DBSCAN-SWA MRRLLLLLFPLCLCAQTYTELLNLLEKSNSYKSAKELESASESLYQAALGKNLPALDATLSAIEFNEIPNMTLHLPSFPVTKADVGTRRHLEGALILSYPLFTGFAISATIDKARLENEQAMLKLTNLKRNLAMHVTQLFSAIIAEERVIDALKSSELAINQAYQKAKGFYTNGLLAQSELYAIEAKKYDIEAQLLHHQNQKKQLLNQLSLIVNTKIETLQANTLQTFEIPNGDGAKEIALNEREDLHVMAKAIDVAQSSVELAKSKNYPTIAMVGVLKGQGDSLELNGDGYTNADKSYVGLSASWNLFNGFSDTHTIDAARASKMSAFFNLEEYKQQVALEVENTELEIKTLNAELQSAKLEEKASESYTNLTQGRFDNQLISADELSRAIANLASTKAKVATLQSELFNQSARLWLECGWGIFEKKVLTQQ >NZ_CP039734|2670786:2729793|2684363_2685728_+|WP_167750574.1|DBSCAN-SWA MMIFFVPFSLFGVELSLDENTYLKKLGTVNVCVDPDWEPFEMIDQKGNYTGIGADLLHLVAQRIGLKITVLPTKDWDESIAYSKAGKCQIISFLNQSPYRDTWLLFTKPHFSDPNVFITREEHSFIGDPHDLVNESIVFPTGTAMEELVRTEYPNLNIITTHSEMDAFQLVSNKKADIAMRSLIVAAYTLKKEGMFNLKIAGQLPDYINKMHMGVIQSEPMLRDILDKGIATISAEDRANIVNKYVAIKAQTVYDYSLLLKIVFGFMILGLLFLWRYYELKKYTKELLYLSETDILTKMYNRMKIEKELVMQVERAKAMKYSFSILLIDFDFFKIINDTFGHPIGDKVLIEMADLIKRSIRSDDRIGRWGGEEFLVLCPQSNEDEALNIARRIQMAIHTGVFSTHKHHTVSIGIRTLTDEDTPYTLISHADDALYKAKNTGRDTICCSSSSTSI >NZ_CP039734|2670786:2729793|2687445_2687988_-|WP_167750575.1|DBSCAN-SWA MMNLPNLLASMRIGLAPLMFILLVNRDLPLFKGLHVSWLDYFAALIFVIASATDFFDGYIARNWNQKTQLGAILDPLADKMLTLAAFLGLMMIDRANPWAIFLILTREFFITGLRVAAMGEGKNIAASMAGKVKTVFQMIAIGFLMMNWPYAELLLWVAVGLTLYSGYEYIIGYTKIESN >NZ_CP039734|2670786:2729793|2720286_2720709_-|WP_167750592.1|DBSCAN-SWA MDKPNNDEWSEEAMMNLVKTVTRSCEVSGEELHVMLNLRAQKKLDFVLVDIREMYEFSLSCIKGTDMLLPTSTIHQHMDELKKLASKLLIFYCHTGGRTAQMIFILRRMGFSNIAHLSGGIDAFHGEKLKNAPFPKNIRS >NZ_CP039734|2670786:2729793|2672407_2673280_+|WP_087438591.1|DBSCAN-SWA MTHGINVKKALKQQQNELDDFTIYSMLSKSDKNGANQTIFHKIAEEEKRHYLYLKTYTNQEQRPRPHVVFFYLLLSKIVGISFTLKFLEKREEGAKAFYQELIAIDPKAEGIFEQEMHHEIELIDMLHDKKLLYAGAIVLGMNDALVELTGTLSGIALAFDRSIVVGVTGLIMGIAAALSMAGSAYLESKENIGDEVKPLTYALYTGISYILTTALLVAPFFIINQISVAIIWMFIGAILTIFLYNFYISVAKDLSFWLRVREMSYITFGVALISFGIGYVVKHYFGIEI >NZ_CP039734|2670786:2729793|2708157_2709399_-|WP_087438619.1|DBSCAN-SWA MKYFEVNYLFKGQKTKTVVKSPTRNDAISIAKLKIPGVILNIKETSAPLEDQLGELKDQIMNALFRKKIKMPNLIAAMRQLSVMTNAGISIHDSVKEVANATVDKTLKEIFNSVNDDLNSGLSLTQSLMTYRNEVGDVTLAMVELGESTGNMAESLEKLSEILEEIEENRQKFKKAMRYPITVVIAIAVAFSILMIYVVPKFKEIFAQLKAELPLPTKILLFMENLINHYGLYLLSGIIGTILLFQYLLKNNEDFKKKFDIYILKVYLIGNIIFYATLSRFCLVFTELIRAGIPIADALDTALLTLENTHLKKKLSSVKISVQRGISLTESFRDTGLFEGMLIQMIQAGEQSGTLDKMLEKVTLYFKSRFSQIIDNIASYIEPILLGFIAAMVLLMALGIFMPMWDMAKAVKS >NZ_CP039734|2670786:2729793|2719275_2719821_-|WP_167750590.1|protease|DBSCAN-SWA MFEATTILACRGDKKAVIGGDGQVTFGNTVLKNNATKIRKLFSGKILAGFAGSTADAFNLFDMFEGILEQKKGDLYKSVIEFSKEWRKDKMLRRLEAMMIVLDCEHIFILSGNGDVVEPEDGKIAAIGSGGNYAISAARALDRHTNLDEETIVKESLKIASELCIYTNDNIKTYVLESKEA >NZ_CP039734|2670786:2729793|2670786_2672037_-|WP_167750569.1|integrase|DBSCAN-SWA MAELIKTKTPSVYYQKLKNDDISYIIKYKLGKNVKQENVGRKSSGMTESKAAEILRNIKYDIARNFRSATSKDELEIARKNKITSLKVMCEKYLADKDVEVSKEKDVDVKNKRYKTAINLRKDRNRLNFWLNNENFKKYINFPLSRITPEVIDKVILTAKKNNGEAYMPKSMKLNIDIMLTVTKHFNYPKSENPFLKIDKKLIPLKKEFKPRTRYLTLHECNTLFEVLKQKKNKRDYVICLICLLTGARPDSVIKLRIKDLYFSTNRINLFDFKRKMYYESIFNEECQRAVNEYLDGRIRNQNDYLFFHETTDKALSKFPASISRVLNQLFNFDQYGNKIQRERSVVPYTLRHSFASININEFKMPIYEVSRCLNHSSVNTTSSIYVTHDLEKSANYIDALSKGALGSLHNGFKID >NZ_CP039734|2670786:2729793|2706405_2706852_-|WP_167750584.1|DBSCAN-SWA MSELKSKLQDDLKDAMKTKDTFKRDVIRFLMSALKQIEVDERKELSDSDIVKIIQKSLKQREDALSAFKDAGREDLYEKELAEAMILKSYLPQQLSDESLKVIIQKHILATGATSLKEIGKIMAGVLAECEGVADGKRINTIAKELLS >NZ_CP039734|2670786:2729793|2706933_2707641_+|WP_167750585.1|DBSCAN-SWA MRVFTCSFVAALFLAGCSVHPADPKISMKAPVYVDETPSKVNEVMPTNPGSLFGQGDNPLFADLKAMHVNDVVTVTITEKTAQTSTGKKALTKQSSDSLGAGITTAAGGGVLGTVSKNLNDVGNIGFTTGSNNSFTGNGSNTRNETFSTTISARVIKILNNGHYFIEGSRELLINGEKQIIQVSGVIRPYDIDKNNNIDSKYIADAKILYKTEGDIDQTTTKPWGAKFMETIWPF >NZ_CP039734|2670786:2729793|2726647_2727373_-|WP_096046648.1|DBSCAN-SWA MRSFSRIIEFSTLETKCSYLDDCKTRMEYKYIENATMELNQELIERGWRRFGNYYSRPQCQKCKLCLSLRIDVKNYNFSRSAKRVFKKAEGIRYVIQAPTISTEHLELYDKYHRHMEQKRGWQYYNLKPQSYHELYVSGAHNFGKEVLYFQGEKLIGVDLIDFLDDGISSIYFYYDPDFEKLSLGRLSIYEQIILAKEYDLEWIYLGYYVKECQSLKYKASYAPYQTLQGNPNLDEDAIWI >NZ_CP039734|2670786:2729793|2695193_2696603_-|WP_088437349.1|DBSCAN-SWA MLIKSFFTNSIGTLVSRIFGFIRDILSASILGANIYSDIFFVAFKFPNLFRRIFAEGAFTQSFIPSFIKTSRKALFTYTIFSRFLLFLIIFSLIVTLFSESFAKIIAFGFDDETVALSAPFVAINFYYLPLIFCVTLFGSLLQYKHHFAVSAFSTALLNIGMIGALLLFQGYDRKTIVYALSYGVLVGGVLQVIAHLFALKKERFFKFLALGFKYRHKRDAALEESNKNFNRSFWHSVIGNSTPQIVSFVDTTLASFLVTGSISYLYYGNRIFQLPLALFAIALTTGIFPKISRLLKANKEEEASNLLSQGFWILAFLLTTSTLGGFILSEEIVKLLFQHGSFSTQDTANTGFVLAMYMIGLIPFGLAKLFSLWLYAGMRQKEAAIIAMYSLIANLIFSFSLIKPMGAAGLALAGSLSAFVLLFFTLRSFGLGKFFAILYTKKLAILVILLVVEWGVLLYVKELMHVYL >NZ_CP039734|2670786:2729793|2700321_2700546_+|WP_087438612.1|DBSCAN-SWA MTYAKNEIMTATDMVRNFSSVLGSITKGKSKRVVIVKNNRFEAVMITVDEYEKMSEAVNILEKIYANTKKKSDG >NZ_CP039734|2670786:2729793|2681161_2682241_+|WP_167750573.1|DBSCAN-SWA MKIFWAVVGKELLSFIRSWQLVFVVLYAFSFEVYIAGSGIELKPRNIAVGYIDSSGGGLSQKFLSYFHAPEFLEPVLFESQEKLSQAVFDKEIMVGLVFDDTFEQNFRKKHATTLNVLLDATAASQAFTALSYLQNIAINFTARSFPVELVTHKLFNENADNHTFMALTELLSITTLLSVILTAVVFVKEKEEGTWDIMLLMPVNAKIIILAKSFSQVIIVMVGIVISVGFVIFGVFNTPINGSFFAFLLLSFLYAFTGAGIGLFIAAIAKDVMQVAQLAIVIMLPLIFLSGAWTPIYAMHPLLQKFSLISPLRYYIEGTESIFFRGTPVLELYPYFLGVTVVGSVLYFIGFRKIGRLF >NZ_CP039734|2670786:2729793|2682964_2683570_+|WP_087439857.1|DBSCAN-SWA MKKEILYLTEYLAKSESEQERTFYALLIQNLADLEVYSPTKLTQAQIASLMSRQGLSVPSSFKEGIQALDTLFESFIPKPLQEAKKTLFMTLLHANFPKKKGFLSVSLELFLSQLEPVEMSIYESLLAYVAGLNRALALFFILGKEDTQNFTPERLVAFGESLHGKLLAFLFNEEETALLNQGLKELLGVYLSLYGKYLYM >NZ_CP039734|2670786:2729793|2698662_2699220_+|WP_167750578.1|DBSCAN-SWA MIVGIEGKVVKKEVTFVHIKTAAGLTYKVFVSLSCLGKISSEIISLHVSQIIREDQHSLYGFIDENEKKVFDTLIKLNGIGPSTALAVCSTLSPDDFAQALVSQNVQAFQKVPGIGPKSAKRILVELSDFSLQLSSDEHNSSSMIEASLALESLGFKKEMIKKALSTCQGVDTQTLIKEALRKLS >NZ_CP039734|2670786:2729793|2711210_2712092_-|WP_167750586.1|DBSCAN-SWA MLSAVEIMELEKRVFKYRLKQRIHYIIISVIILLIGAIAIYSYPTIIHGTDTNTSLSDVQTTAETIKPLDTNSTSIQEQPLNVPDVNIGVKKYTPPIVMNEQENQTLFLQLPTINRNKAEKKSSYIPETPEKKTNFGIQEEELDNKVLMRKMPTIDDENFYRNKEDKVDTALLPPPLLDEPKPKGLIKIETHEVNSVQYLKDKFEKTHNITFALMLAEEYYAMKNYTECNKWALMANNADPDSEKSWIWFAKSKVKLGHKEDAVLALQAYLKSNKSKAAQSLLNQISVGEVID >NZ_CP039734|2670786:2729793|2709395_2711147_-|WP_087438620.1|DBSCAN-SWA MNDITTTLLTYLIHNRIIDEHSAAKIREETQQNNHKVLGEILLENNFFTKEDLLILVIEFFKKGYLNLEDVNANFAIDSEKFLQSLAKNLNYEYLDLDSIDIDYRLASKLPFAQLKKFKALPIREDEINIFVALKDPLDINAQEGVQRIFPRKLLKVIIAEPTQIDKYLIKMELGESIKGLIGEIRKEITSSAAENPQESSGILKLIEIILKTSILARASDIHIEPTENNCIVRSRIDGMLTETFIFDKDIYPPLGSRMKLLSNMDIAEKRKPQDGRFSATILGREYDFRISALPTINGESIVIRILDKSKVMIKIEDLGMHPNNYIKFAQAMKSPYGIILVTGPTGSGKTTTLYAALNAIKSVQSKIITVEDPVEYQLNLTQQVQVNEKANLTFATALRSILRQDPDIIMIGEIRDTETLRIAVQAALTGHLVFSTLHTNDSISAVTRVVDMGIEPYLISGSLIAIEAQRLVRKLCPYCKTKYTLPKTAHDEIKDMLPENFQFYKNNGCEKCSQTGYLGREMISEILPISEKISSMIAQGGSKSDIKEQAIKEGFVDMFQDGITRAAHGITTLDEIIRVAKE >NZ_CP039734|2670786:2729793|2720715_2721945_-|WP_096046641.1|DBSCAN-SWA MKKSVKKVVLAYSGGLDTSIILKWLQDEYKCEVVTFTADIGQGEELEPARKKAISLGIKPENIFIEDLKEEFVKDYVFPMFRANAIYEGEYLLGTSIARPLIAKRQAEIAAMVGADGVSHGATGKGNDQVRFEMGYLSMNSDLVIIAPWREWDLNSREKLLAYAEKNGIVIEKKPGKSPYSMDANLLHISYEGLVLENPAAEPEEDMWRWTVSPEKAPDQSEVIEITYEKGDPVALNGVKLSPATMLAKLNELGCKHGIGRIDIVENRFVGMKSRGCYETPGGTIMLRAHRAIESITLDREAAHFKDEIMPTYAKTIYNGFWFSPEREMMQAAIDKSQETVNGTVKLKLYKGNVSVIGRDSKTNNLFSEAFCTFEEDEVYNQKDAAGFIKLNALRFIISGKNRRNQGKA >NZ_CP039734|2670786:2729793|2700538_2701258_+|WP_167750580.1|DBSCAN-SWA MAKKEIVFQNTSYELSYELLNQNQPQTILFLHGWGSNKEIMKQAFGKTFSQYQHLYLDLPGFGHSSIHDVITTGTYSDIVSVFLKALHVKPLIIVGHSYGGKVATLLQPEVLVLLSSAGIVPPKSLKVKLKIALFKLLKPFAPRSFYRFFATKDVEGMSQTMYEIIKRVVNEDFSEQFLTCKAKTFLFWGKEDSAMPLFCGEKMHSLIKGSHFYPMEGDHFFFMNQAKQIEKTLGEFGF >NZ_CP039734|2670786:2729793|2707665_2708109_+|WP_087438618.1|DBSCAN-SWA MENSNPPKFIERDKIFKAKDIIVALKYFGVSFDKLKTNTPNRARAIVLGYKAWRLGLNETQLRSVIERKIDDKEIIEILEYKEKKSIRSWSIFTKIKEDDYKIKVERLWCKKLGALCLIAKIGQKELITLACETFKDQLDCTIPKEF >NZ_CP039734|2670786:2729793|2687984_2688752_-|WP_087438603.1|DBSCAN-SWA MKGKTLVISGGTRGIGQAIVHEFAQAGVNIAFTYNSNEALAQEQVKDLEEKFGIKAKAYPLNILEPETYKDLFLEIDKDFDRVDFFISNAIISGRPVVGGYTKFMKLKPRGINNIFTATVNAFVVGAQEAAKRMEKIGGGSIISLSSTGNLVYIENYAGHGTAKAAVETMVRYAAAELGCKGIRVNAVSGGPIETDALRAFTNYEEVRDITAKLSPLGRMGQPQDLAGACLFLCSDKASWITGHTMLIDGGTTFK >NZ_CP039734|2670786:2729793|2722346_2723330_+|WP_167750594.1|DBSCAN-SWA MNYQEAYQQFNALFENELSPEAAAQFLVELYERGESFEEIAAAANVMREHSVKLDIPEHLKRELIDIVGTGGDKSGTFNISTTTSIVLATLGCKVAKHGSGSATSLSGSADVLKALGLNLSLTPEKQIKMLEGCGFVFMFAMNHHPCMKHIMPIRRSLSHRTIFNMLGPLANPASAQKQMVGVFHVDYIDRFSQALRELGTTKSMVVSSLDGLDEVSITAPTRYTMIENKIITEGEINPEAFGFTFAPLEAIKGGDSIQNAEITRAILRGDEKGAKLDVVLLNGACALMIDGKARDMQEGIELMRDAIESKKAWDKLGEIIKLSYLI >NZ_CP039734|2670786:2729793|2715464_2716991_-|WP_167750587.1|DBSCAN-SWA MAVTEDRRFSSEIVSIDPLSLVSYTYSKNEIKLNKLEKSDKNAFYTSYLQTRDVISATIDVSRNIPDSDLKDAIEIKVYDELALDSAIEYSISYIETESKDSKNRSFNVFIIDAALIYTKLTPIKEKTRYIDYVTSAPFLIKALYRKNFIEADGTHCFVYFQKTDAFLAIYKNGEYVYSKSLHYSLKEINEKFCELIGERVDEEDFYKLLTNEGLRATNSQYQQSLMQLFGEIFLYINDVLVFTKRSYNIDFIDKIYLGSEIGTFSGIEEYGKSYLGLESYEFNFSIAINSKEWYIDQIHILMMLSAQLYIENPDDNLNFSIYKRPPPLKYRASGKFLGIMAASVIISLAYPAYQFAYHTFLSLIIVKQTSEYNELYKQTSDIRQQLSLLKTEKEKVDGLVKNETTKFEFRKKLLSEIYNKKISYPMKALMLLEIFQLSNQNGCKVEAIEFKNQQLDFFVRNKNEKKITEFIKDLTALKKYKVNTEKIIQDDKIKLYTSKISIGLSNE >NZ_CP039734|2670786:2729793|2699233_2700268_+|WP_167750579.1|DBSCAN-SWA MTIAVLFGAQSFEHEISVVSAIALKKVLKSDIVYIFCDYYRNFYLIPTDKITSKRFSSGEYKKDKLLYLKQGGFYAKKMLGEEKIIFDVMINLVHGMDGEDGKLSSMLDFFSVPYIGPRMEGSCISYNKLFTKLYAKEVGVNVLDYQVLRKGSGESIKIAYPFIVKPLRLGSSIGIGIVKEEKELAYALDVAFEFDDSVLIEPFISGVKEYNLAGCKTDIFHFSIIEEPQKEEFLDFDKKYLDFSRTKRVNEATLDIKAEEGIRDAFMKLYDPLFLGALIRCDFFVIDGMTYLNEINPIPGSMANYLFDDFDRIIKNLSKWLPKSITIPKEYRYINSIQAAKGK >NZ_CP039734|2670786:2729793|2728945_2729410_-|WP_084613060.1|transposase|DBSCAN-SWA MSKTQKKQREYYSGKQKRHTLKGQIVIDKEERIMCVHTAKGTTHDFRLFQESNLPLMPKTCVYVDLGYLGIAKEHSHCQIPHKASKLHPLSEEQKEENRQKASARICVEHVNAKIKTFQILTQKYRNRRKRFNLRFNLICGLINFDRGFAVEYK >NZ_CP039734|2670786:2729793|2680046_2681165_+|WP_167750572.1|DBSCAN-SWA MKLGVIKAYVLKELTEIVRSRLIIMVYLLPTMVLVLFGYGIRMEVTGARTLIIDNDQSHYSQLLVSKFEHSKYFDSTIEKRGEAKALDEIHKAKSDILIIIPESFEKRLLHGQSTQIGVFVDAAFPMRGSTMESYVKGVVLDAASQMLERLGGNAPTISINQRTLFNQAMRDEDAIVPGLIGLVLLIAPAILAALLIVKEKERGTIFNFYASPLSKGEFIAAKLIPVFLLHSINIFILFLWATYLFEVPFRGSFLLYWLTSELYLMISLSIGMLISIVTSTQIVAVVLTVIVTIIPGFLYSGILMPISSMIGVSRYEAHIFPVMYYNHILYDVFLVGEGLASSKTVMYIVILAFYAFFMLTLGSFLLKKELK >NZ_CP039734|2670786:2729793|2729331_2729793_-|WP_025343414.1|transposase|DBSCAN-SWA MKKYEQMQKHKPKDFKRHIGVNEETFNAMIEVFRQYDENRKKGLGVGGRRSLSPENKVLLMLGYYREYRTLEHIGFDYGVSESTASRIVCEVEEVLIKSGRFSLPSKRELYKSDVALSFVVIDATETPCQRPKKSKESTTQVSKSVIPLKDRL >NZ_CP039734|2670786:2729793|2717049_2717940_-|WP_167750588.1|DBSCAN-SWA MSEKTKTGYVAVIGRPNAGKSSLLNWLVGEKLAMVSHKANATRKRLNIIAMHENTQIIFIDTPGIHEQERLLNQFMLEEAMKAMGDCDLILFLAPASDALQHYVDFLSLNKNNIPHMVLLTKTDSVSQTALFDKITEYQAYQDKFVALIPVSVKKGISQSYLLEAISKYMPEHPYLYDPEILTTERSRDIYKELIREAIFENTSDEIPYFADVIIDKVDEKETIDNIYATIIVDKKSQKGIVIGKEGQTIKRIGSSARLLIESLSQKRVFLKLVVVVKQGWSQDKEMLKKIGYIVE >NZ_CP039734|2670786:2729793|2725753_2726584_-|WP_096046647.1|DBSCAN-SWA MHLSLSHDMLALTIEEDSKTFYFLQNIADKNFQKKIGRKDKMIIFKEQDELVQRRYFLKLISKIYLRKTGNVEQARIIENTIDKSIKISLLKSNQVLQKMNIQLCIEDNYAVVFYMGSHNTLFASYLKSYFKDHLVRIRPKNGTITLYPNSEITVCLLEKLLAQKELFGCFVEFSYSMDDFLAYKKSFVSKRAKRSRHNALFSLLEEYFGVLGCKMEDSFDMIRSNYLTLVKKYHPDSCGLYDGDLHVKYVAKFQEVQNAYEMLKMHFRHADAKSA >NZ_CP039734|2670786:2729793|2689646_2690948_-|WP_096046614.1|DBSCAN-SWA MKQILLFLLFLGALMANSLPEHFTKTLENGLQIVVIPMHNKSDVITTDIFYKVGSGNEIMGKSGIAHMLEHLNFKSTKNLKTGEFDEIVKGFGGVNNASTGFDYTHYFIKSSSKNLPKSLELFAELMQNLKLTDEEFQPERNVVLEERLWRTDNSPIGYLYFRLFNNAFTYHPYHWTPIGFIDDIKNWSIEDIRSFHSKYYQPSNAIVVVAGDIEPEDVFKNVTKYFGDIKNSTPLPKVHHQVEPQQDGAKRLFIKKESEVEMVAIAYKIPNFLHEDQVALSALSELLSSGKSSKLHRILVDEKKLVNQIYGYAMEAKDPSVFLFLAVCNPGVKAESVETEILKIIDSLKKDDVSDKDIEKIKINTKADFIHNLESSSDLATLFGSYFAKGDITPLLNYEEGINKLKKEDIIKVVNKYLVPQSSTTVILRKDY >NZ_CP039734|2670786:2729793|2692093_2693809_-|WP_088437347.1|DBSCAN-SWA MLGIKGVLARFSPFFKDYIPYFLLAIFGMLLSSGGTAYSAYLVKPLLDEIFIAKDKEMLELLPYAIIAVYALKEAGRYTQAYYTAYIGQDIIKRFRDMILENLLKLDLSFFHEYRTGELISRNTNDVERVRTVVSNLIPEFLSQTLTIFGLIGVVIYQSPELAFYALIIMPLAIYPLSVLSKKMKKVSRQSQEKVSDITAKLSEIFNNIEIIQANNAQTYEHELFKKDNERYFKLTMKSVKVNELVSPVMETLGSIGVAVVILVGGKEVIEGGMSVGAFFSFLTALFMLYTPIKRISGLYNKMQDALVASERIFFLLDQRSSIPVGTKMLPSKIDTISFHNVSLCYGEKEALQNVSLEAKSGEMIALIGDSGGGKSSLMNMLMRFYDPTSGSVCINGIDLKSFDLHDLRHNIAMVTQRVYIFHDSVAANVAYGKEINENNVIDALKKANAYEFIQNLPEGIHTHLDEFGTNLSGGQRQRIAIARAIYTNPQVLILDEATSALDSQSEQKITEAIENLIKDKITFVIAHRLSTIKKADKIALLKHGKICAIGSDEELLANSQEYLKLKGLQH >NZ_CP039734|2670786:2729793|2724485_2725757_+|WP_167750596.1|DBSCAN-SWA MKLRVSSTQTTKQKFSSTLRGWLPILQANLDSLVETLEPFVQENPFISVKSGSETPDKRFEKKSFFSEVAKTSVSDTIEALTLDKKSLYQVLNEQVNPPLFPTEKSQRIAYEIIENINSEGYFEAQALSEIAKKLDVKIEDVEKIRQRFAYLEPLGIGALDLKETFLFQLQDLSLENELYDMVEMLIINFDTIESFGKEPLFHDALAIIKRFHNPPAIDFMEDEKEVIPDIFIYNLEGAIEVRLNDAYYPEVILDTEGLDVDHSFVSQKIKDAKDLIDALEMRKATLYKIGLMIVEYQYDFFFGKAIKPMKLKDLADDLGRNPSTISRAIAGKYLSCSRGVIPLKQFFATALEEDISNSAIKEYMIELVKNESKIKPLSDIKLLELIEGKFNIKMVRRTITKYRQQFNIASSSERKKLYTLQF >NZ_CP039734|2670786:2729793|2688751_2689645_-|WP_087438604.1|DBSCAN-SWA MQQALQGAMTALITPFKNGKLDEVQYAKLIERQIKNGIDVVVPVGTTGESATLDHDEHRRCIEIAVDVCSHSAVKVLAGAGSNATHEAIGLAKFAQAHGAHAILSVTPYYNKPTQEGLFQHYKAIAGSVDIPVLLYNVPGRTGCDLLPDTIFRLFGACPNIFGVKEATGSIDRCVDLLAHQPQLSVFSGEDAINYPILSNGGSGVISVTSNILPDQIAELTHLALKGDFHGAKAINDSLYEINKTLFCESNPIPIKAAMYIAGLLETLEYRLPLCAPSNDNMKRIENTLKKYTIKGF >NZ_CP039734|2670786:2729793|2677271_2678429_+|WP_167750570.1|DBSCAN-SWA MLEKLKQYRLGVAILALVLIASGLIIHKLIAPTLAANLVQGSGRMDGDLINLNAKYAGRISTLNIQEGQHVELGQNIAVLASEEYEAQKAQIEAQISARTQELNAKETELEIASKTIPETLSKAKDNLSIKQHQRTELDKNIASQTSILAQDKHDLERMKNLFEHNLIEKRQVETAVLKFQTSGDLLAGLQQKREQLSLAINVANSELIEATAHQKTLRALEQGIEALKSSLKALEASKTQIEAILHEMILRSSVNGVVVEKIANQGEVIGAGSVVATLLDPNSLYLKIFVDTKQNGNLQVGNDAVIFLDGKPNEPIQAKVVRIEQKAEFTPKEVSVPSDRIQRVFALHVKPLSPQPTLKLGIPAVGVVSMDGKGLPKSLNNVPE >NZ_CP039734|2670786:2729793|2682293_2682899_-|WP_088437344.1|DBSCAN-SWA MIHLSTFFAIYVKFFFIMTPFFVTTIFLSMTKGIDESDKKRLAVKVTLAISITCLIILFFGKYIFELFGITLDAFRIGAGALLFLTAVDLVQKDINAEPTCKADILKHAVVPLAIPVTVGPGTVGALMVMGADMDGFEDLILGSSALLVAIFSIGLLLYLSGHIERLIGRTGLVVFSKVTGLVLSALSAQLIFTGIKNFLH >NZ_CP039734|2670786:2729793|2690954_2692019_-|WP_167750576.1|DBSCAN-SWA MFDYYGLMKKFMFNFSPENAHHIAEFFFKNGATYAPFILSPLAEHFFIHDKRLEQTIFGKTFLNPLGLGAGFDKNATMIQMLTALGFGHIEYGTITPEPQNGNAKPRLFRYVEQESIQNAMGFNNEGMYKVGTRLESLYPFATPLGANIGKNKTTTVENALKDYEKLIKRFKDLSDYLVINISSPNTPGLRDLQNEQFIKDLFVMAKELTLKPVLLKIAPDLEINDALHVSSTALENGAAGIVATNTTVDYSLIPNARDFGGLSGKVLTEKSFVMFEALAKEFFGKTTLISVGGIDSADEAYRRLKAGASLIQIYSAFIFKGPSLNRAINLGILERMEKDGFNHISEVIGSDRR >NZ_CP039734|2670786:2729793|2722101_2722347_+|WP_167750593.1|DBSCAN-SWA MRVDKFLNSVNITKRRAISEDMCKNGVVCINSVVVKPAKDVKVGDIITINYLEKTVKYEVLQIPEAKTIPKTKQNEYVREV >NZ_CP039734|2670786:2729793|2714824_2715472_-|WP_096046634.1|DBSCAN-SWA MNSIDSLLDKIDLFFKDKKNNEIYLIFAMAFVVIGFIFYSYLYPVTEKILNQSMRSAKDIEKKLHDEQAYLTSVSRDGDATFLIKKVKSDIENSKLLLEKTIYMNDYVDNKLKDLSYLLFNDENWAKFLNTISQLAQKYSVRIKVIENKINEPSIQKIEQILTLKVSFNGSFANTMKYMNAIEESELVVDIYELNCTGQKDIEGQFNIAVWGMKY |
60 | Bacillus_phage(18.18%) | protease,integrase,tRNA,transposase | attL 2668073:2668095|attR 2729827:2729849 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|