Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_020515 | Bibersteinia trehalosi USDA-ARS-USMARC-192, complete sequence | 5 crisprs | cas14j,csa3,DEDDh,cas3,cas2,cas1,cas9,cas5,cas8c,cas7,cas4,DinG | 3 | 28 | 4 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_020515_1 | 843484-844575 | TypeII |
NA
Consensus repeat of NC_020515_1
|
16 spacers
spacers of NC_020515_1
>1.1|843520|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT GCCGATTTTAAATTCCATCTCAAGCTTTTC >1.2|843586|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT AAAGTCCCACGTCAAAATAATACTTGACAA >1.3|843652|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT ATGCTGACAAATTATTAGGCGTATGGCAAC >1.4|843718|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT GGGCAGTTGCAAGACATGTATCAAAATCTT >1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT CACAATCAAAAGCGATTGTTGATGATTCAA >1.6|843850|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT CGTACATTCGGTTATACATCAACGCTTAAA >1.7|843916|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT CCTCTTTGAGATGTTCCACGAACCACAACG >1.8|843982|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT CAGTGTATTCGCATTGGAAAGCGTAAAAGA >1.9|844048|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT TATTTTCTGTACCACAACCTTGCCTTGCTT >1.10|844114|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT CGTACATTCGGTTATACATCAACGCTTAAA >1.11|844180|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT CCTCTTTGAGATGTTCCACGAACCACAACG >1.12|844246|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT CAGTGTATTCGCATTGGAAAGCGTAAAAGA >1.13|844312|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT TATTTTCTGTACCACAACCTTGCCTTGCTT >1.14|844378|30|NC_020515|CRISPRCasFinder,CRT CGAAGTAAAAATCATTGGTTATGTAGGGCA >1.15|844444|30|NC_020515|CRISPRCasFinder,CRT AAGTAAATATTACACAGGAATTATGGGAGA >1.16|844510|30|NC_020515|CRISPRCasFinder,CRT CCGGCTCGGTGATTTGAGCAATGAGGTAAT |
cas2,cas1,cas9 |
CRISPR arrays and Neighbor proteins around NC_020515_1
The CRISPR arrays of NC_020515_1 >merge|NC_020515|1|843484-844575|PILER-CR,CRISPRCasFinder,CRT GTTGTAGCTCCCTTTTTCATTTCGCAGTGCTACAATGCCGATTTTAAATTCCATCTCAAGCTTTTCGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATAAAGTCCCACGTCAAAATAATACTTGACAAGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATATGCTGACAAATTATTAGGCGTATGGCAACGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATGGGCAGTTGCAAGACATGTATCAAAATCTTGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATCACAATCAAAAGCGATTGTTGATGATTCAAGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATCGTACATTCGGTTATACATCAACGCTTAAAGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATCCTCTTTGAGATGTTCCACGAACCACAACGGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATCAGTGTATTCGCATTGGAAAGCGTAAAAGAGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATTATTTTCTGTACCACAACCTTGCCTTGCTTGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATCGTACATTCGGTTATACATCAACGCTTAAAGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATCCTCTTTGAGATGTTCCACGAACCACAACGGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATCAGTGTATTCGCATTGGAAAGCGTAAAAGAGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATTATTTTCTGTACCACAACCTTGCCTTGCTTGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATCGAAGTAAAAATCATTGGTTATGTAGGGCAGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATAAGTAAATATTACACAGGAATTATGGGAGAGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAATCCGGCTCGGTGATTTGAGCAATGAGGTAATGTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT >NC_020515|1|1|843484-844377|PILER-CR GTTGTAGCTCCCTTTTTCATTTCGCAGTGCTACAAT GCCGATTTTAAATTCCATCTCAAGCTTTTC GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT AAAGTCCCACGTCAAAATAATACTTGACAA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT ATGCTGACAAATTATTAGGCGTATGGCAAC GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT GGGCAGTTGCAAGACATGTATCAAAATCTT GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CACAATCAAAAGCGATTGTTGATGATTCAA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CGTACATTCGGTTATACATCAACGCTTAAA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CCTCTTTGAGATGTTCCACGAACCACAACG GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CAGTGTATTCGCATTGGAAAGCGTAAAAGA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT TATTTTCTGTACCACAACCTTGCCTTGCTT GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CGTACATTCGGTTATACATCAACGCTTAAA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CCTCTTTGAGATGTTCCACGAACCACAACG GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CAGTGTATTCGCATTGGAAAGCGTAAAAGA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT TATTTTCTGTACCACAACCTTGCCTTGCTT GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT >NC_020515|1|1|843484-844575|CRISPRCasFinder GTTGTAGCTCCCTTTTTCATTTCGCAGTGCTACAAT GCCGATTTTAAATTCCATCTCAAGCTTTTC GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT AAAGTCCCACGTCAAAATAATACTTGACAA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT ATGCTGACAAATTATTAGGCGTATGGCAAC GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT GGGCAGTTGCAAGACATGTATCAAAATCTT GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CACAATCAAAAGCGATTGTTGATGATTCAA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CGTACATTCGGTTATACATCAACGCTTAAA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CCTCTTTGAGATGTTCCACGAACCACAACG GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CAGTGTATTCGCATTGGAAAGCGTAAAAGA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT TATTTTCTGTACCACAACCTTGCCTTGCTT GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CGTACATTCGGTTATACATCAACGCTTAAA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CCTCTTTGAGATGTTCCACGAACCACAACG GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CAGTGTATTCGCATTGGAAAGCGTAAAAGA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT TATTTTCTGTACCACAACCTTGCCTTGCTT GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CGAAGTAAAAATCATTGGTTATGTAGGGCA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT AAGTAAATATTACACAGGAATTATGGGAGA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CCGGCTCGGTGATTTGAGCAATGAGGTAAT GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT >NC_020515|1|1|843484-844575|CRT GTTGTAGCTCCCTTTTTCATTTCGCAGTGCTACAAT GCCGATTTTAAATTCCATCTCAAGCTTTTC GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT AAAGTCCCACGTCAAAATAATACTTGACAA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT ATGCTGACAAATTATTAGGCGTATGGCAAC GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT GGGCAGTTGCAAGACATGTATCAAAATCTT GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CACAATCAAAAGCGATTGTTGATGATTCAA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CGTACATTCGGTTATACATCAACGCTTAAA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CCTCTTTGAGATGTTCCACGAACCACAACG GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CAGTGTATTCGCATTGGAAAGCGTAAAAGA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT TATTTTCTGTACCACAACCTTGCCTTGCTT GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CGTACATTCGGTTATACATCAACGCTTAAA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CCTCTTTGAGATGTTCCACGAACCACAACG GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CAGTGTATTCGCATTGGAAAGCGTAAAAGA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT TATTTTCTGTACCACAACCTTGCCTTGCTT GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CGAAGTAAAAATCATTGGTTATGTAGGGCA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT AAGTAAATATTACACAGGAATTATGGGAGA GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT CCGGCTCGGTGATTTGAGCAATGAGGTAAT GTTGTAGCTCCCTTTCTCATTTCGCAGTGCTACAAT
>NC_020515.1|WP_015432199.1|842071_843472_+|PLP-dependent-aminotransferase-family-protein MVTYTFKKNNIPLYEQLYCFIKLDIEQGDIIAGEKLPSKRAFAKHLGISVMTVETAYQQLVAEGYLSAKAKQGFFVNPLNLPKTSTSRSVFTETSGSKPIENSKKWQADLTNSQTSAENFPFSVWTKLVREVLKHHQSALMERAESGGVLMLRQAIAKHLHDFRGMNVSPAQIIVGAGTEYLYGLLVQLLGLDKTYALPDPSYDKLHKIFQSYGLNHISISMDYATNVLKNVNVLHTSPSHHFPTGLVMPIAKRYELLAWAAEKADRYIIEDDYDSEFRFVGQPIPALQSIDMLGKVIYMNTFSKTLSSTVRIAYMVLPPELLTRFHQQLGFYASTVSNFEQYVLAEFIQQGYFEKHINRMRAYYQKKRDHLLFSLKNSPLAEKITIKEENAGLHFIVQFHTELSDEQILQSANEQGIKMVSLARYYQDKSQAPKNAFVVGYSNLADKQVDEVVRWLKNIFSRRVL >NC_020515.1|WP_025328973.1|841101_841974_-|pyridoxal-5'-phosphate-synthase-lyase-subunit-PdxS MTHRYDLNKQLAQMLKGGVIMDVTTPEQARIAEEAGACAVMALEKIPADIRAAGGVSRMSDPKMIKSIQEAVSIPVMAKVRIGHFTEAQILQAIEIDYIDESEVLSPADDTFHINKREFDVPFVCGAKDLGEALRRINEGAAMIRSKGEPGTGDVVQAVRHLRKIKQEIARVASLSTDELYHAAKELQVPFDLIQYVHQHKKLPVVTFAAGGVATPADAALMMQLGAEGVFVGSGIFKSGDPAKRARAIVQAVTNYNDAKLLAELSEDLGEAMVGINEQEIELLMAARGI >NC_020515.1|WP_015432197.1|840535_841102_-|pyridoxal-5'-phosphate-synthase-glutaminase-subunit-PdxT MKIAILALQGAFAEHADKLKQLGVESVEIRQLADLNQDFDGLILPGGESTVQGKLLRELGLFEPLRQKILNGLPTLGTCAGLILLAEKLANDDKQHFATLPVTVKRNAYGRQLGSFLTESEVKHIGKIPLPFIRAPYVESIGEGMEILAEVSGNIVGVKCKNQIGISFHPEVSDDLRFHRYFVEMCKN >NC_020515.1|WP_015432196.1|839731_840493_+|DNA-binding-transcriptional-repressor MKPIERQKQILDYLSQHGRTDVEVLAEYFKLTGATIRKDLTVLEQQNKVLRTYGSVVILQDEVFDASIDQKNHINLLQKQKIGQKASELINDGDSIIMDAGSTVLQMIPHLVKFDNLTVMTNSLHIINGITQLKKNYNLMISGGTYRERSASFHGYFAESAFNDSTFDTLFIGTDGFDLEVGLTTFNEIYGVSSAMCRAAKKIVVLADSTKFGRKSPNIVCGLEKIDVVISDNQLSEEMKERIEQKGIQVIIV >NC_020515.1|WP_015432195.1|839060_839705_+|hexitol-phosphatase-HxpB MQIKAVIFDMDGVIIDSEPMWAEAQIKTLHALGQQITEQDCEHLTRGKRIDQIAHIWIERYQLNANAEEVANQILRYAYEAILAQGCAMEGLYPLLDLLQKKNIPLALATSSAPMIIEAVFNKLNLWDYFRVQCSANDEAYGKPHPAVYLTAVQKLNVNINDCLVIEDSVTGLIAAKAAGLQTVIVNPNYADPRFSLADKRVDSLSKLMATFSY >NC_020515.1|WP_025328974.1|838642_838999_+|hypothetical-protein MSSTAILIGFAVCMWLLQILLGWRQIRLFNQAYAEIAKKGKVLVGRNEGRFTPKAVIVLAVDNHNIVQECLTMQGFSVFAKPAFSTVLTGKSLTEIQPEQAFPNNKALQNALKIALIR >NC_020515.1|WP_015432193.1|837814_838594_+|sorbitol-6-phosphate-dehydrogenase MKKVAVVVGGGQTLGAFLSEGLADSGYRVVVADLNGENAQAVAGIINGKYGAENAIGVQVDATNETSVEAMAKATDEAFGRVDLVVYSAGTAKAAKITDFDLKDFDLSVKVNLTGYFLSAKHFSRLMIRDGIKGRIIQINSKSGKVGSKHNSGYSAAKFGGVGLTQSLALDLAEHGITVHSLMLGNLLKSPMFQSLIPQYAKKLGIPESEVEQVYIDKVPLKRGCDYQDVLNVLRFYASEQAAYCTGQSINITGGQVMF >NC_020515.1|WP_015432192.1|837435_837801_+|PTS-glucitol/sorbitol-transporter-subunit-IIA MSVIYQVVVEQIGDFAQDALQDNMLIMFKSGAPADVVDYCFVHSHDDLKQPLAVGGELQINAKRYPITAVGEVASENLAQLGHITLFFDGASEAQFPGSIHLQGDVPNEISVGSEFVFLNN >NC_020515.1|WP_015432191.1|836427_837426_+|PTS-glucitol/sorbitol-transporter-subunit-IIB MSKVIYIEKGNGGWGGPLSIPVVEGKKIVYVTGGTRPAIVDRLVELTGWEAVDGFKDGEPPQEEIGVAIIDCGGVLRCGLYPKRRIPTINIHTTGKSGPLAQFIVEDIYVSAVKPNNIHVKDDDVQAVQSTENSAKNSENPTAYREYDSSKKITEQSDGLLAKIGTGMGSVVAVFYQAGRETIETVLKTILPFMAFVSALIGIIMASGIGDLIAHALTPLATNPLGLVTLALICSFPLLSPFLGPGAVIAQVIGVLVGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSMAEAKQDTVRVGVPSVLVSRFLTGAPTVLVAWLVSGFIYS >NC_020515.1|WP_015432190.1|835844_836399_+|PTS-glucitol/sorbitol-transporter-subunit-IIC MIESITKGAEWFIGLFQKGGEVFVGMVTGILPLLISLLVVMNALIYFIGQERIERLAQRSSGNPFSRYFLLPLIGTFVFCNPMTLSLGRFLPERYKPSYYAAASYSCHSMNGLFPHINPGELFVYLGIAQGLTTLGLPLGPLAVSYFLVGLFTNFFRGWITDFTTRIFEKRMGIQLEREVHLAK >NC_020515.1|WP_015432200.1|844833_845160_-|CRISPR-associated-endonuclease-Cas2 MSEATFMRIIVFFDLPVTTKAKRKAANQFRQFLLKDGYQMLQLSVYTRIVRGRDSLEKHNKRLTAHLPEEGSVRCLEITEKQFTSMLLLVGELKPQEEKVNANQLLLF >NC_020515.1|WP_015432201.1|845152_846067_-|type-II-CRISPR-associated-endonuclease-Cas1 MSWRSILISNGGKLSLRQNQMVIWQEEQEFCVPLEDIAVIVIEHRETVITTPLLSALALNGITLLTCDEQFIPCGQWLPFGQYHRQLKTLKLQLEMSQPLKKQLWQVIVQQKIRNQAFVLVQTKRLDMAEKLQHLAKRVKSGDKENLEAQAALIYFQTAFGSDFRRWQENAINAHLNYAYTVLRSAVARSLVLYGWLPTLGLFHHSELNPFNLADDMIEPFRPLVDLMVWQLWQDDKLADSLTPHNKQKLVGLLHYQMRFQDQTFSTLAAIDRTIGSLQNAISQKDPSLLKLPEILPLKEHQYE >NC_020515.1|WP_015432202.1|846119_849257_-|type-II-CRISPR-RNA-guided-endonuclease-Cas9 MKPRNLNYILGLDLGIASVGWSVVEIDENEYPIRLIDVGVRTFERAEVPKTGESLALARRLARSTRRLIRRRAFRLLKAKRLLKHHQIVNAEELTQLPNQCWELRVKGLDSLLSNTEWAAVLLHLLKHRGYLSQRKNEAQNADKELGKLREGMDNNSKLLLENNYRTPADIAVKKFAVEEGHMRNQRGAYTHTFNRLDILAEMQLLFKTQRELGSTYANGELEQAFCELLLWQKQALNRSQMLSLVGKCTFEKEEKRAAKASYSAERFVAIQKLQNLRILENGEERGLFDSEFSLLLENAYNLKSGLTYKQVRKILSLSENAIFKGLPYLSDDLEKPEKTQFLAFKFYHQLADILKNNGFSDEWQKLSQEPTLLDKLGTELSLCKEENEFIAQFNGELPEAMLSTLFNHTNFDKFIHISLKALNNILPLMEQGNDYTKAWRKVYPEPTKKDEKTLPPIPADEIRNPVVLRSLSQARKVINAVIRLYGSPARIHIETARELGKSYDDRQKIKKQQDKNSDERDQAVKKFLEECPNFANKVKGKDILKIRLYINQDGKCLYSGKPLDPHRLLEIGYVEIDHALPFSRTWDDSQNNKVLVLANENQNKGNQTPFEWLGKDEHQWALFVARVNGCRFPYAKKQRILTKKLDEQGFLKRNLNDTRYVSAYLMKHIKENLHLVGKGSDKVFASNGQVTNFLRRCWGLEKKREEGDRHHALDAIVVACSTASMRQKITLFKKYQRWNLKTGKHIDQETGEIIPLHFPAPWDFFRQEVMIRIFSEMPQEDLIMQLPDRPQANHEFVQPLFVSRAPSRKMSGQGHEAKLRSARMLEATGKSIKKEFLTDLTIKNLEHMVNKEREPELYQALVEHFKKYGDKPKEKFFKKGGVEVKSIRISKTQNKSVNLGNKTIADNGDIVRTDLFLKNKKYYFVNIYAWQVSKGILPKETTTGNILDNSYEFQFSLFKNDLLEIPHPKNENDSILAYFIRPDDERRWILKFHDNAKIPDIYGKKDETSIRLSIQGQKFIKKYQVDELGKNIRPCRPTKRQGVR >NC_020515.1|WP_025267174.1|849446_850034_-|response-regulator-transcription-factor MIIHILDDEESILDAMSFLLAPLGIEIQTWQSSVDFLAQADLHQQGVLLLDIRMPLPDGQQVHQQLREVQSTLAVVIMTAHGDVPMAVAELKKGAVDFLQKPASFEQLKQAITQVKTVSEQAVKIREISQNYAKLTEKERNLVPLIMQGFTNKQIADHLAISVRTVEVHRANVMEKMQAESLAELVQKLGLLPTP >NC_020515.1|WP_015432204.1|850159_851815_-|PhnD/SsuA/transferrin-family-substrate-binding-protein MKRLFLFFLFLSHTVLAETWHIGILAQRGETYTRTHWQPWVDWLNEQFPNERFELVPLGLGEANERAELDFLLTNQAQFFYLSRQNVRWLATLGSPFTENGEQGAVGSSIWVRADSHYRQLSDLKNQTISAVDNDAFGGFLLGLYQFHQAGMQQNRDFSVQFSGFPVENALALLAEKQVEAAIVPVCLLEELEKEGKFKRSDFRLILQNPQAQGCLASTPLLPNWSLAAMENVPNELAVQFATRLLNSHNPDLPRWTLPFSSAQADHILRELYRHPQQKSLWATVLDWVRLNKFGLLAVALFILLNLVALRYQVYRKSKALQQAHRKMQQYQQELTRADRLALLGEMTTGFAHELKQPLSAVRMYAEGLKSQNSNPYQQRILDKLIAQVDRAVKTMQSIRDWVQNRPSGEPQAVILNQLIANVIEFVAVENRQNAQISLIADRTFRLNLHATVLEQVLTNCLLNALQAGASEITVRLQAVENGLEIAIEDNGGGFSPAQLEFPFVPFRTDKPHGLGLGLVLCQRLMQSLNGRIVLTNGEKGARVSLFIPDD >NC_020515.1|WP_025267173.1|851947_852904_+|transcriptional-regulator MSLSMIELAMQILSCSQKELANKLSVSPSQITKWKKGEYMSFDMENKVRKLLDIGDLDPNVILAFGHISYAQKWQKIITQLAESANDNGETGFNVAPLIDETDLILSNLVDSLRMIGYKFPLSFPPELENDYDHYIDWSDSFIETIYSNEFCSLIYSIFCSFADVYSFYEAYISELDTQLSMENPEFSNLIVDIEANLLNLAIIKADTILPELTGFQKLKRETEKDYREWLQQLKLYAFKSNIPLRAELMHLIDDEHDNLGVEAEAEYLGFNDNRLHPDIYMNELLVGMRLLHQVLPVILDKLEIKDFDIDNKSLRKF >NC_020515.1|WP_015432206.1|853076_853802_+|tetrathionate-reductase-subunit-TtrB MDLSKRSFLKELSALTVGASFVPLQSAQAFMPARREGDENKRYAMLIDLRKCIGCQACTVSCLVENATPLHSFRTTVRQYEITNGTQVANNVVLPRLCNHCDQPPCVPVCPVQATFQRKDGVVVINNEQCIGCGYCVQACPYDARFINEETKTADKCTFCTHRLEAGLLPACVESCVGGARVIGDLNDPKSQISQLYQTHKDDLKVLKPEAGTVPHVFYVGLIEAFVSKIDGQPMLWTGEA >NC_020515.1|WP_015432207.1|853803_854901_+|polysulfide-reductase-NrfD MIREVLVEPQHIVWLPWIVHYFFFVGVAATAVFTAVLFAKKQRQNACVLNVGSANGPKGASEQCSRVKPTACELAAVTVALIGSIVAPVALTADLHQPSRILHFYTDFAWWSPMAWGAMILPLFSVAVAGYFVLALAHHTQPNLPKWLAWLQFPILKNQDLLWAFRLFAALTAVGIIGYTVLETYQTGTRILWHSAWLLPIMLFSAWAVALGLTQVISQFLLPLAGEVPEQPSGAGGKICLILTALSIIGLAFSSETAQRDFALLFNGSITAYLVGIFWLIALVCNFSAKNHRLQWLGVLALIAFGWLLRWVLVIQVQTIAKTNALQNPYHFDWTAVDGGLGIVSILGLAVLVTVGVGQIISLTT >NC_020515.1|WP_015432208.1|855079_856297_+|ATP-binding-protein MINRPNYLQQLKPFINTPLIKVMTGIRRSGKSTVMKLLREELISQGIAEQQIIHINFESFAFSDFKTADKLYMLVKEKILTTDKYYLLLDEIQEVSEWEKAVNAFMVDFNLDLYITGSNSHLLSSELSTYLAGRYVEIPIFTLSFQEFLDFKTAYSPETSSNPTALFHEYLRKGGFPMVHTANYEAETVYKIVQDIYASVILRDTVQRHKIRDVELLERIVKYAFDNIGNTFSGKNVADYFKSQQRKIDLNTVYNYLKALESAFILHRVERFDIKGKEILKTQEKFYLGDVSLLYATMGFRTSLISGILENLVYLELKRRGYQVYIGKLDKQEIDFIAQKQNEKIYIQVAYKLESEETVKREFSPLQEIADNYPKYVITMDELWKENLGGVEHIYTTDFFMRDAL >NC_020515.1|WP_015432209.1|856293_860289_+|hypothetical-protein MSFYIDRTLYNSDNQEIKESELISLIPKVIIILAEPGAGKSYLLNNISQQLGVQKKTANIFIHLPVEKTKLLIIDAFDELVRIDSSSVTKSLVMAEKTSAEKIILSSRSSEWSESYSQTCKELFREAPLLLYLKPFNQQEQKEIFNSYYPEENAEIFLQEAGKIELAPLLSNPLFLRLFSKSYIENNKIFENRYSAFKIAIEGLARESNPSHSSSLPGNKKVELVEDVFCKLILSGSEGISTSDFSSKHLFPNIKNLNSDTNIFQILSTQFFTPGENQGQHRPIHKIVVEYCAGRYLANKLTSSTPLSLNKILSIIAPNNIVRDELRGLFSWIAVLSENQQIQEKLINLDPYAILANGDPSLLLPRSKRILLSRLKDVNQKDPYFTRHDIWRNFSISDFFSDDMIDELKELLSEKYRSGNLQRVLLKLLLESSVVSKVVDELRTIICDTNPKYKELSIRRLAGECLLKVKDYPIKDVWAQLISEKNSVSLSIASEIMNLSDRSYFDINDYITFLNICSELYPTNYEIERVFGKRYFIRLFIQELELDLTIKLLDSLSNNLACTCQKKYDCECRVGISKIIGMLLDRYFELSSKFNMSKIWEWIKNLYFQNGISKEQSLSVKTLSENDELRQGIIKLAFENLWVREEIHNMKFYHFEWHSHSGLNFQYQDLRFIIDLSFETNNIELWKYFIARHDFYNKEKKPNLLRRYMRNQALQKPAFLMAWYRINQESKEFYQRNTLKYNCKKSIRNINRKQRKSLARNIKYIQQNRELIENGSHWDSLLSFSYTFLQEPEKIAEKFGDEQLVRNALYNCINYIQSNIPNLTDLAKLHSDSSVLNIEIILYASCIEIFTREGTLKSLPKDILEALYTNFSVYLIGMEHNIRRQLKNETEKSLFNNIEAIEKFINDYIEPQLSNANYKNSPVCWLEKDLFKQFQKKYALEWLIKYPHLKLPEIEKLMSMAIQSGNRESLKLLVKKRCDELVSGANNQQNGGQKTQRDFWYIHAFYLLDKDYESYWKELTKDPNSIFILEKDLGIFGDYKDWLALSPEKIELILDTFWQEYPEVELSDIYGSESPDDEKAYRFLTNVIFFLGKNESDNTITVIDRLLNTPEFINLHSDLKSIRYNYLKKTVMQSFQAPKPEQIVKLIERDEIISVENLRAVIIEELQSYQDDLNGHDITTKSIFYNGNMELTNRVDENTATQYIAERLRRLENRQIIVSREHYMKDDKRCDIVLSKLIEGNRKIVPIEVKGQWHSEVYSAFENQLNRLYSIHPDSDGQGIYLVLWFGSDEKIAGVKRHGINSAQELYWKVYEKISEELRNRIDLFVLDLSS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_020515_2 | 950110-950541 | Orphan |
NA
Consensus repeat of NC_020515_2
|
6 spacers
spacers of NC_020515_2
>2.1|950142|34|NC_020515|CRT TACTTGAGGGGGTAACGGTATGCAAAACCATTAA >2.2|950208|34|NC_020515|CRT,PILER-CR,CRISPRCasFinder GATTACCTGCTACACGATTTACCCAACCTTTGCC >2.3|950274|35|NC_020515|CRT,PILER-CR,CRISPRCasFinder GCACGAACTCCGCCGTGTTCTTTCATAAAGTTGCG >2.4|950341|35|NC_020515|CRT,PILER-CR,CRISPRCasFinder ATTTCTGAAACAGGGATTTGCGTTTCATTCCATTT >2.5|950408|35|NC_020515|CRT,PILER-CR,CRISPRCasFinder TTGCTGCCGTCTCTAAATAGCCACTATCAGCAAAT >2.6|950475|35|NC_020515|CRT,PILER-CR,CRISPRCasFinder CAAGCAACACGATTTATTTGTAATACCGCTCTGTC |
CRISPR arrays and Neighbor proteins around NC_020515_2
The CRISPR arrays of NC_020515_2 >merge|NC_020515|2|950110-950541|CRT,PILER-CR,CRISPRCasFinder AAATAGAATGGCAGCTACCCGTAGGTAGCTGCTACTTGAGGGGGTAACGGTATGCAAAACCATTAAGTTTCAACTCTCAGCCACCTACAGGTAGCTGCGATTACCTGCTACACGATTTACCCAACCTTTGCCGTTTCAACTCTCAGCCACCTACAGGTAGCTGCGCACGAACTCCGCCGTGTTCTTTCATAAAGTTGCGGTTTCAACTCTCAGCCACCTACAGGTAGCTGCATTTCTGAAACAGGGATTTGCGTTTCATTCCATTTGTTTCAACTCTCAGCCACCTACAGGTAGCTGCTTGCTGCCGTCTCTAAATAGCCACTATCAGCAAATGTTTCAACTCTCAGCCACCTACAGGTAGCTGCCAAGCAACACGATTTATTTGTAATACCGCTCTGTCGTTTCAACTCTCAGCCACCTACAGGTAGCTGC >NC_020515|2|2|950110-950541|CRT AAATAGAATGGCAGCTACCCGTAGGTAGCTGC TACTTGAGGGGGTAACGGTATGCAAAACCATTAA GTTTCAACTCTCAGCCACCTACAGGTAGCTGC GATTACCTGCTACACGATTTACCCAACCTTTGCC GTTTCAACTCTCAGCCACCTACAGGTAGCTGC GCACGAACTCCGCCGTGTTCTTTCATAAAGTTGCG GTTTCAACTCTCAGCCACCTACAGGTAGCTGC ATTTCTGAAACAGGGATTTGCGTTTCATTCCATTT GTTTCAACTCTCAGCCACCTACAGGTAGCTGC TTGCTGCCGTCTCTAAATAGCCACTATCAGCAAAT GTTTCAACTCTCAGCCACCTACAGGTAGCTGC CAAGCAACACGATTTATTTGTAATACCGCTCTGTC GTTTCAACTCTCAGCCACCTACAGGTAGCTGC >NC_020515|2|2|950176-950541|PILER-CR GTTTCAACTCTCAGCCACCTACAGGTAGCTGC GATTACCTGCTACACGATTTACCCAACCTTTGCC GTTTCAACTCTCAGCCACCTACAGGTAGCTGC GCACGAACTCCGCCGTGTTCTTTCATAAAGTTGCG GTTTCAACTCTCAGCCACCTACAGGTAGCTGC ATTTCTGAAACAGGGATTTGCGTTTCATTCCATTT GTTTCAACTCTCAGCCACCTACAGGTAGCTGC TTGCTGCCGTCTCTAAATAGCCACTATCAGCAAAT GTTTCAACTCTCAGCCACCTACAGGTAGCTGC CAAGCAACACGATTTATTTGTAATACCGCTCTGTC GTTTCAACTCTCAGCCACCTACAGGTAGCTGC >NC_020515|2|2|950176-950541|CRISPRCasFinder GTTTCAACTCTCAGCCACCTACAGGTAGCTGC GATTACCTGCTACACGATTTACCCAACCTTTGCC GTTTCAACTCTCAGCCACCTACAGGTAGCTGC GCACGAACTCCGCCGTGTTCTTTCATAAAGTTGCG GTTTCAACTCTCAGCCACCTACAGGTAGCTGC ATTTCTGAAACAGGGATTTGCGTTTCATTCCATTT GTTTCAACTCTCAGCCACCTACAGGTAGCTGC TTGCTGCCGTCTCTAAATAGCCACTATCAGCAAAT GTTTCAACTCTCAGCCACCTACAGGTAGCTGC CAAGCAACACGATTTATTTGTAATACCGCTCTGTC GTTTCAACTCTCAGCCACCTACAGGTAGCTGC
>NC_020515.1|WP_015432295.1|948686_950093_+|YcjX-family-protein MFNRIQNKVTHFVQRGFDNHIRLAVTGLSRSGKTAFITSFVDQLLHIQPEKNAHLNLFAAARNGQILSVKRIAQGDPTVPRFEYDRNRACFEQEEPKWCPSTTGISEIRLAIRYRNRSSFSRIFKETSTLYLDIFDYPGEWLLDLPLMSQSFKEWSQAQQVVHTGERAKLAQAWLNEVKKLDLSAVADENRLADLSDIYTAYLLQCKQAGMQYIQPGRFVLPNAERGAPVYQFFPLLDLSEQEWQSLENSAVNSVFHTLKKRYRQYQDKIVKPFYKDYFSQFDRQVILADCLTPLNHSQQAFIEMKIGLQQLFKHFHYGNRSLFHRLFSSNIDKLLFAATKADHITSDQLPNLESLMRQLVQEGGRHAEFDGIETAYQAISAIRATEAVTVSENGQSFKAIRGVRTKDKRQVTQFAGSVPNRLPNSDFWQNHTFDFDQFEPRKIDFDQALPHLRMDSVLQFLLGDLFE >NC_020515.1|WP_015432294.1|947759_948599_-|CYTH-domain-containing-protein MQDEIEIKIMLLPENIALIKQWLTQQPIQKYQRQTLGNTYFDTPELFFAKAQMGLRVRTKNNQHEITLKMKGDIVGGLHIRPEYNLDLPNSQPDFKRLVSHYNLQIANSDAIAENLQATFSTDFERESWLLNYQHSQIEIALDMGIIKNRFGEEPICEVEFELKQGNLADLFALIQNMPKRDGMWLSSLSKAQRGYFVGRMDKIAKEIEKLSACHLDNMAEVERYQVQQQMADFLRLSPEATILRSQLGLEHIPLGDIFDYLTSARYLDQQLSHMQQRC >NC_020515.1|WP_015432293.1|947243_947708_+|hypothetical-protein MKKLLALASIATVGALTVSVQAVAQTAEIPRAYLSVMDMSGKVAKNTGNQIYSVSNSNLQLCWAASGIPLEPANLNKVTELFIAPSVKAKFVKPGATIKVENERSAITSMMGSADGKMIQTCWRFDQNDPLGKYKLRLNVNDIQFDDLVFELVK >NC_020515.1|WP_015432292.1|947087_947243_+|YoaH-family-protein MDNLLLNLTHEQQQQAVEKIQQLMQQGVSSGEAIAIVAQELRHTYSKNSEN >NC_020515.1|WP_025267143.1|946341_947070_+|tRNA-pseudouridine(65)-synthase-TruC MTLDILYRDESLIAINKPAGMLVHRSWLDKHETVFAMQTLRDQIGQHVFPIHRLDRPTSGVLLFALNAEMARQMSEQFEQHQLMKSYLAVVRGYLNGEARIDYPLKVKLDKIADKFSTAKEAQQAVTDYKHLAGIEMPYPAGKYQTARYSLVQLWPQTGRKHQLRRHMKHLFHPIMGDTNYGDLHQNRALTENTGCDRLFLHANSLQFTHPDTLQKIMINAPLDHQWQQLFLQFGWNFPQFF >NC_020515.1|WP_025267144.1|945983_946292_+|YqcC-family-protein MKTQVRVQLDRLQVVLHRYQLWETEAPSTEKLASTQPFALDTLTATQWLQWIFIPRMHALLDANAELPTNFAVSPYLEESLKNERYLAELVQPIVEIEKLLK >NC_020515.1|WP_015432289.1|944516_945932_-|bifunctional-indole-3-glycerol-phosphate-synthase-TrpC/phosphoribosylanthranilate-isomerase-TrpF MQNQPTILQKIVKDKALWVANAEQQFPLSLFQAQLQPSDRDFYAALAKGSHQQPVYILECKKASPSKGLIRAEFDLDAIAQVYKHYASVISVLTDEQYFQGNFHFISQVRNQVSQPILCKDFMISSYQVYLARYHHADAILLMLSVVDDPTYRQLSDLAHSLGMGVLTETSNEQEFERALALGAKVIGVNNRNLHDLSIDMNRIIALVAKYRDQIPADVRLISESGIYDHSQVKSISQSAHGFLIGSSLMGSHDLNNAVRAVIFGENKICGLTRPQDVQAAYANGALYGGLIFAEKSVRALSLRQAQELVVQALLRFVGVFQNQAVEFVVKIAKQLELYAVQLHGNEDELYIAQLAEQLEGNVQIWKAISIDVDAQRFDFADNPLIQRYILDSKTTNQQGGTGKTFNWALIPEKLKSKAILAGGINLENLEQALQVGCLGVDLNSGLETAKGIKHADKIAQAFQLIRLHAK >NC_020515.1|WP_015432288.1|943474_944233_+|molybdopterin-synthase-adenylyltransferase-MoeB MELSDQEMLRYNRQIVLKNVDFDGQEKLKASRVLVVGVGGLGCAASQYLASGGIGHLTLVDFDSVSLSNLQRQILHTDATIGEPKVFSAQQRLQQLNPHIEIKPIHAELSELQWQTLIPEYDVVLDCTDNVNIRNLLNQICFQHKIPLVSGSAIRFEGQLSVFRYTDNEPCYQCLSTLFGENILSCVEAGVIAPIVGVVGSLQALECIKVLLGIGKTLSGKLLMIDGLNFSVREMKLPKQPHCQICKNFLES >NC_020515.1|WP_015432287.1|942225_943455_+|molybdopterin-molybdotransferase-MoeA MSLLSLSSALENLLTCLPMPNQFETIALHEAANRVLAEDVFSPINVPNFDNSAMDGYAISLQNFVENQPLAVIGKAFAGNPFSGKIQSGQCVRIMTGAKIPENTDAVVMQEDTIIRDDGTMMITKPVKLGANIRRVGEDVAQGSLVLAKGSQLNVSSLPLLASLGIAEVKVFPKVKVAILSTGDELVSVGEPLNEGQIYDTNRFTVRLMLEKLNCEILDFGTLPDNPEIFERTFVQAQRQADVLITSGGVSVGEADFTKTVIEKLGKIDFWKIAMKPGKPFAFGKLEKAWFFGLPGNPVSALVTFYQLAQPALMKLAGFSAEKIANFSPKLTACAAVSMKKAVGRQDFQRGFFYADENGQLVVKTVGTQGSHIFSAFNESNCFIVLEQERGNVEVGERVVIQPFNLLLS >NC_020515.1|WP_025267145.1|941589_942210_+|MarC-family-protein MFDSLVVQFVVLWAVIDPVGSIPVYLAKTIGLSPDDRRKIARNATLIAAGILLFFLVLGQWLLEAMQIPLSAFQIAGGLVLLLFALTMIFGQSKPDQEIKMKSSLSELAVYPLAVPSIASPGAMMAVVLLTDNHRYNLLEQAITGGIMLAVVAITYVLLLLANHIQKYIGNAGAAIISRVMGLILSAVAVNNILVGLRDFVQQAAL >NC_020515.1|WP_015432296.1|950876_952001_-|alanine-racemase MKPATATLSGKNLRHNMQLIKTLAPHSKHCAVAKANAYGQGLHHLVRNLNDLVDGFCVARIKEALAIQESGYEGKILLLEGFFDREELLKTVSRRFDTVVHCIEQLELLEQVSAEWQTEQAKGFWKRKAKIYFPITVWLKIDTGMHRLGIHPEQIAEFHQRLTACALVEKVNFVSHFSRADEPDCGYTEKQIAIFEQATKGYEGERSISASNGILYWQQAHYDWVRPGIIMHGISPHTHPITSLGFKPVMKFASSLIAIRSHKAGEPVGYGGAWVAEQDTKIGVIAVGYGDGYPRNAPPGTPVFINGRRVPIVGRVSMDMMTVDLGINSNDKVGDEAELWGENLLIEEVANAMGVINYELITKLTPRVLFEYLD >NC_020515.1|WP_015432297.1|952124_953048_+|SPFH/Band-7/PHB-domain-protein MFGLDFSLLPILFVLLIVFTLSSTIKIVPQGYHWTVERFGRYTKTLTPGLNIVVPFIERIGRKINMMEQVLDIPSQEVISKDNASVAIDAVCFVQVVEARRAAYEVNNLEDAIVNLTMTNMRTVLGSMDLDDMLSQRDLINGKLLTIVDEATNIWGVKVTRIEIRDVRPPRELVEAMNAQMKAERNKRADILEAEGIRQAEILRAEGEKQSRILKAEGERQEAFLQAEARERAAEAEAKATQMVSEAIAKGDTTAINYFIAQKYTEALKDIGSADNSKVVLMPLEAGNLIGSVAGIAELLKSNKSSS >NC_020515.1|WP_015432298.1|953084_953522_+|NfeD-family-protein MDWLLNWAGWLSLGFLLLALELIVPGVFIMWWGLAALILAAVSALLPNLEPAYQVTIFAVLAITFSLVWWKYQHGKDQQDDEHSSLNSREHAMIGARGVIVEILENGIARGKFDDTTWRVIGENLRIGDSVQVFRVEGITLFVKK >NC_020515.1|WP_015432301.1|957816_958269_-|DUF441-domain-containing-protein MTLQFNAVALLLVVLIILGFISQNSAVTISAAVLLIMQQTLLSKFIPFVDQYGLKIGIIILTIGVLSPLVSGRITLPELSQLLNWKMALSIVAGVLVAWLGGRGVNLMGSQPVLVTGLLIGTVIGVAFLKGVPVGPLIAAGILSLVLGKS >NC_020515.1|WP_015432302.1|958329_958599_-|hypothetical-protein MDSTMSLRFDKLRFVKRLQEANQTPEMAEAFADALDGALEQSQSPLATKADLQLELEKLKNEINTTIFKAITLNITILGFLMAMMKFIN >NC_020515.1|WP_015432303.1|958695_959271_-|nucleotide-exchange-factor-GrpE MTNQTEKEPVEQEIVEETVEQAVETEQENANVEIDPLDAANARIAELEAYIAEADAREQDIALRARAEIENVRRRAEQDVEKAHKFALEKFSKELLNVVDNLERGLQALEGAEESVKSGVELTHKGLVSTLAQFGVEAVGVVGEAFNPDLHQAISMQPAEGIEANHISVVLQKGYTLQGRVIRPAMVMVAG >NC_020515.1|WP_015432304.1|959431_961198_+|aspartate--tRNA-ligase MMRSHYCGALNRSNVGEQVTLSGWVHRVRNLGRFIFMQIRDREGIVQVFFDEKDEALFKAASALRNEACVQIKGEVIARDESQINPDMATGEIEVLVRELKVYNNAEVLPLDFNQNNTEEQRLKYRYLDLRRPEMAEKLKTRAKITSFVRRFMDDHGFLDIETPMLTKATPEGARDYLVPSRVHKGKFYALPQSPQLFKQLLMMSGFDRYYQIVKCFRDEDLRADRQPEFTQIDVETSFMTAEEVRSTMEEMIHGLWLDRLNVDLGKFPMMTWQEAMTRFGSDKPDLRNPLELVDVADILKNVEFKVFAEPANSPDGRVTVLRVPNGASLTRKQIDEYTQFVGIYGAKGLAWAKINDVNAGLEGVQSPVAKFLNEEVINALIERTQAQTGDILFFGADKWQTVTDSMGALRLKVGRDLELTDLTAWKPLWVVDFPMFERDEEGNLSAMHHPFTSPKDFTPEQLAADPTNAVANAYDMVINGYEVGGGSVRIYDPKMQQTVFNILGINEEEQREKFGFLLDALKFGTPPHAGLAFGLDRLTMLITGTENIRDVIAFPKTTAAACLMTEAPSFANPQALEELAIRTIPQE >NC_020515.1|WP_015432305.1|961312_962227_+|glutaminase MNYQTIISTIYQRIRAEENGGELAMYIPELANISPDKFGVAYFDLKDSTIGVGDYQEKFSIQSIVKVLSLVFAYKHLGDSIWKRVNVEPSGTSFNSLLQLETDCGIPRNPFINAGAIVICDILLSLFENPKQAFLDFVRDLANNSNIHYSEKVAESEKAVGYRNFALCYYIKSFGNIQNDPNEVLDFYFHICSIEMSCEEIAYAFSFLANDGVKLHDNQQVLNKSQIKRTNALMQTCGFYDESGEFAFRVGLPGKSGVGGGIVAIMPNHHCITVWSPKLNEKGNSYRGMKFLEEFTTQTKISVF >NC_020515.1|WP_025267140.1|962278_962566_-|ribosome-associated-translation-inhibitor-RaiA MTINISSKQMDVTPAIRSHIEDRLAKLNKWHTQLINPHFMIHKLPNEYEVEASIGTPIGDLFAKAKHEDLYQAINEVEVKLEGQLVKLKEKKEHR >NC_020515.1|WP_015432307.1|962742_963684_+|LpxL/LpxP-family-Kdo(2)-lipid-IV(A)-lauroyl/palmitoleoyl-acyltransferasee MAAEKSLPPFQMTFLHPKFWGLWLGLGLFRLMLCLPYPVLVKIGLGLGKLFGSLGFGKKRIRIAKKNLELCFPEYSEAQIQQILAKNIQSVGMAIIETGMAWFWSDKRILKWSKIEGLEHLKNPPQGTGIIFVGVHFLTLELGARIVGLHHQGIGVYRPNDNPLLDWIQFRGRVRSNKAMLDRKDLRGMIKVLRAGETIWYAPDHDYGRKNSVYVPFFAFPTACTTAGTRMLLRSAPNSIVVPFTPMRNEDFSGYTVKISPMVDFGDCDDEISTATKMNKVVEQEIMQAQSQYMWLHRRFKHLPDGTDGKLYS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_020515_3 | 956232-956348 | Orphan |
NA
Consensus repeat of NC_020515_3
|
1 spacers
spacers of NC_020515_3
>3.1|956266|49|NC_020515|CRISPRCasFinder CTCACGCTCGCGCGTCGTCGTACGTCGTACGTACGTACGTAGTACGTTA |
CRISPR arrays and Neighbor proteins around NC_020515_3
The CRISPR arrays of NC_020515_3 >merge|NC_020515|3|956232-956348|CRISPRCasFinder AGTGCCATCATTAGAATGGTTTTCCCCCCCCCCTCTCACGCTCGCGCGTCGTCGTACGTCGTACGTACGTACGTAGTACGTTAAGTGCCATCATTAGAATGGTTTTCCCCCCTCCCT >NC_020515|3|3|956232-956348|CRISPRCasFinder AGTGCCATCATTAGAATGGTTTTCCCCCCCCCCT CTCACGCTCGCGCGTCGTCGTACGTCGTACGTACGTACGTAGTACGTTA AGTGCCATCATTAGAATGGTTTTCCCCCCTCCCT
>NC_020515.1|WP_015432298.1|953084_953522_+|NfeD-family-protein MDWLLNWAGWLSLGFLLLALELIVPGVFIMWWGLAALILAAVSALLPNLEPAYQVTIFAVLAITFSLVWWKYQHGKDQQDDEHSSLNSREHAMIGARGVIVEILENGIARGKFDDTTWRVIGENLRIGDSVQVFRVEGITLFVKK >NC_020515.1|WP_015432297.1|952124_953048_+|SPFH/Band-7/PHB-domain-protein MFGLDFSLLPILFVLLIVFTLSSTIKIVPQGYHWTVERFGRYTKTLTPGLNIVVPFIERIGRKINMMEQVLDIPSQEVISKDNASVAIDAVCFVQVVEARRAAYEVNNLEDAIVNLTMTNMRTVLGSMDLDDMLSQRDLINGKLLTIVDEATNIWGVKVTRIEIRDVRPPRELVEAMNAQMKAERNKRADILEAEGIRQAEILRAEGEKQSRILKAEGERQEAFLQAEARERAAEAEAKATQMVSEAIAKGDTTAINYFIAQKYTEALKDIGSADNSKVVLMPLEAGNLIGSVAGIAELLKSNKSSS >NC_020515.1|WP_015432296.1|950876_952001_-|alanine-racemase MKPATATLSGKNLRHNMQLIKTLAPHSKHCAVAKANAYGQGLHHLVRNLNDLVDGFCVARIKEALAIQESGYEGKILLLEGFFDREELLKTVSRRFDTVVHCIEQLELLEQVSAEWQTEQAKGFWKRKAKIYFPITVWLKIDTGMHRLGIHPEQIAEFHQRLTACALVEKVNFVSHFSRADEPDCGYTEKQIAIFEQATKGYEGERSISASNGILYWQQAHYDWVRPGIIMHGISPHTHPITSLGFKPVMKFASSLIAIRSHKAGEPVGYGGAWVAEQDTKIGVIAVGYGDGYPRNAPPGTPVFINGRRVPIVGRVSMDMMTVDLGINSNDKVGDEAELWGENLLIEEVANAMGVINYELITKLTPRVLFEYLD >NC_020515.1|WP_015432295.1|948686_950093_+|YcjX-family-protein MFNRIQNKVTHFVQRGFDNHIRLAVTGLSRSGKTAFITSFVDQLLHIQPEKNAHLNLFAAARNGQILSVKRIAQGDPTVPRFEYDRNRACFEQEEPKWCPSTTGISEIRLAIRYRNRSSFSRIFKETSTLYLDIFDYPGEWLLDLPLMSQSFKEWSQAQQVVHTGERAKLAQAWLNEVKKLDLSAVADENRLADLSDIYTAYLLQCKQAGMQYIQPGRFVLPNAERGAPVYQFFPLLDLSEQEWQSLENSAVNSVFHTLKKRYRQYQDKIVKPFYKDYFSQFDRQVILADCLTPLNHSQQAFIEMKIGLQQLFKHFHYGNRSLFHRLFSSNIDKLLFAATKADHITSDQLPNLESLMRQLVQEGGRHAEFDGIETAYQAISAIRATEAVTVSENGQSFKAIRGVRTKDKRQVTQFAGSVPNRLPNSDFWQNHTFDFDQFEPRKIDFDQALPHLRMDSVLQFLLGDLFE >NC_020515.1|WP_015432294.1|947759_948599_-|CYTH-domain-containing-protein MQDEIEIKIMLLPENIALIKQWLTQQPIQKYQRQTLGNTYFDTPELFFAKAQMGLRVRTKNNQHEITLKMKGDIVGGLHIRPEYNLDLPNSQPDFKRLVSHYNLQIANSDAIAENLQATFSTDFERESWLLNYQHSQIEIALDMGIIKNRFGEEPICEVEFELKQGNLADLFALIQNMPKRDGMWLSSLSKAQRGYFVGRMDKIAKEIEKLSACHLDNMAEVERYQVQQQMADFLRLSPEATILRSQLGLEHIPLGDIFDYLTSARYLDQQLSHMQQRC >NC_020515.1|WP_015432293.1|947243_947708_+|hypothetical-protein MKKLLALASIATVGALTVSVQAVAQTAEIPRAYLSVMDMSGKVAKNTGNQIYSVSNSNLQLCWAASGIPLEPANLNKVTELFIAPSVKAKFVKPGATIKVENERSAITSMMGSADGKMIQTCWRFDQNDPLGKYKLRLNVNDIQFDDLVFELVK >NC_020515.1|WP_015432292.1|947087_947243_+|YoaH-family-protein MDNLLLNLTHEQQQQAVEKIQQLMQQGVSSGEAIAIVAQELRHTYSKNSEN >NC_020515.1|WP_025267143.1|946341_947070_+|tRNA-pseudouridine(65)-synthase-TruC MTLDILYRDESLIAINKPAGMLVHRSWLDKHETVFAMQTLRDQIGQHVFPIHRLDRPTSGVLLFALNAEMARQMSEQFEQHQLMKSYLAVVRGYLNGEARIDYPLKVKLDKIADKFSTAKEAQQAVTDYKHLAGIEMPYPAGKYQTARYSLVQLWPQTGRKHQLRRHMKHLFHPIMGDTNYGDLHQNRALTENTGCDRLFLHANSLQFTHPDTLQKIMINAPLDHQWQQLFLQFGWNFPQFF >NC_020515.1|WP_025267144.1|945983_946292_+|YqcC-family-protein MKTQVRVQLDRLQVVLHRYQLWETEAPSTEKLASTQPFALDTLTATQWLQWIFIPRMHALLDANAELPTNFAVSPYLEESLKNERYLAELVQPIVEIEKLLK >NC_020515.1|WP_015432289.1|944516_945932_-|bifunctional-indole-3-glycerol-phosphate-synthase-TrpC/phosphoribosylanthranilate-isomerase-TrpF MQNQPTILQKIVKDKALWVANAEQQFPLSLFQAQLQPSDRDFYAALAKGSHQQPVYILECKKASPSKGLIRAEFDLDAIAQVYKHYASVISVLTDEQYFQGNFHFISQVRNQVSQPILCKDFMISSYQVYLARYHHADAILLMLSVVDDPTYRQLSDLAHSLGMGVLTETSNEQEFERALALGAKVIGVNNRNLHDLSIDMNRIIALVAKYRDQIPADVRLISESGIYDHSQVKSISQSAHGFLIGSSLMGSHDLNNAVRAVIFGENKICGLTRPQDVQAAYANGALYGGLIFAEKSVRALSLRQAQELVVQALLRFVGVFQNQAVEFVVKIAKQLELYAVQLHGNEDELYIAQLAEQLEGNVQIWKAISIDVDAQRFDFADNPLIQRYILDSKTTNQQGGTGKTFNWALIPEKLKSKAILAGGINLENLEQALQVGCLGVDLNSGLETAKGIKHADKIAQAFQLIRLHAK >NC_020515.1|WP_015432301.1|957816_958269_-|DUF441-domain-containing-protein MTLQFNAVALLLVVLIILGFISQNSAVTISAAVLLIMQQTLLSKFIPFVDQYGLKIGIIILTIGVLSPLVSGRITLPELSQLLNWKMALSIVAGVLVAWLGGRGVNLMGSQPVLVTGLLIGTVIGVAFLKGVPVGPLIAAGILSLVLGKS >NC_020515.1|WP_015432302.1|958329_958599_-|hypothetical-protein MDSTMSLRFDKLRFVKRLQEANQTPEMAEAFADALDGALEQSQSPLATKADLQLELEKLKNEINTTIFKAITLNITILGFLMAMMKFIN >NC_020515.1|WP_015432303.1|958695_959271_-|nucleotide-exchange-factor-GrpE MTNQTEKEPVEQEIVEETVEQAVETEQENANVEIDPLDAANARIAELEAYIAEADAREQDIALRARAEIENVRRRAEQDVEKAHKFALEKFSKELLNVVDNLERGLQALEGAEESVKSGVELTHKGLVSTLAQFGVEAVGVVGEAFNPDLHQAISMQPAEGIEANHISVVLQKGYTLQGRVIRPAMVMVAG >NC_020515.1|WP_015432304.1|959431_961198_+|aspartate--tRNA-ligase MMRSHYCGALNRSNVGEQVTLSGWVHRVRNLGRFIFMQIRDREGIVQVFFDEKDEALFKAASALRNEACVQIKGEVIARDESQINPDMATGEIEVLVRELKVYNNAEVLPLDFNQNNTEEQRLKYRYLDLRRPEMAEKLKTRAKITSFVRRFMDDHGFLDIETPMLTKATPEGARDYLVPSRVHKGKFYALPQSPQLFKQLLMMSGFDRYYQIVKCFRDEDLRADRQPEFTQIDVETSFMTAEEVRSTMEEMIHGLWLDRLNVDLGKFPMMTWQEAMTRFGSDKPDLRNPLELVDVADILKNVEFKVFAEPANSPDGRVTVLRVPNGASLTRKQIDEYTQFVGIYGAKGLAWAKINDVNAGLEGVQSPVAKFLNEEVINALIERTQAQTGDILFFGADKWQTVTDSMGALRLKVGRDLELTDLTAWKPLWVVDFPMFERDEEGNLSAMHHPFTSPKDFTPEQLAADPTNAVANAYDMVINGYEVGGGSVRIYDPKMQQTVFNILGINEEEQREKFGFLLDALKFGTPPHAGLAFGLDRLTMLITGTENIRDVIAFPKTTAAACLMTEAPSFANPQALEELAIRTIPQE >NC_020515.1|WP_015432305.1|961312_962227_+|glutaminase MNYQTIISTIYQRIRAEENGGELAMYIPELANISPDKFGVAYFDLKDSTIGVGDYQEKFSIQSIVKVLSLVFAYKHLGDSIWKRVNVEPSGTSFNSLLQLETDCGIPRNPFINAGAIVICDILLSLFENPKQAFLDFVRDLANNSNIHYSEKVAESEKAVGYRNFALCYYIKSFGNIQNDPNEVLDFYFHICSIEMSCEEIAYAFSFLANDGVKLHDNQQVLNKSQIKRTNALMQTCGFYDESGEFAFRVGLPGKSGVGGGIVAIMPNHHCITVWSPKLNEKGNSYRGMKFLEEFTTQTKISVF >NC_020515.1|WP_025267140.1|962278_962566_-|ribosome-associated-translation-inhibitor-RaiA MTINISSKQMDVTPAIRSHIEDRLAKLNKWHTQLINPHFMIHKLPNEYEVEASIGTPIGDLFAKAKHEDLYQAINEVEVKLEGQLVKLKEKKEHR >NC_020515.1|WP_015432307.1|962742_963684_+|LpxL/LpxP-family-Kdo(2)-lipid-IV(A)-lauroyl/palmitoleoyl-acyltransferasee MAAEKSLPPFQMTFLHPKFWGLWLGLGLFRLMLCLPYPVLVKIGLGLGKLFGSLGFGKKRIRIAKKNLELCFPEYSEAQIQQILAKNIQSVGMAIIETGMAWFWSDKRILKWSKIEGLEHLKNPPQGTGIIFVGVHFLTLELGARIVGLHHQGIGVYRPNDNPLLDWIQFRGRVRSNKAMLDRKDLRGMIKVLRAGETIWYAPDHDYGRKNSVYVPFFAFPTACTTAGTRMLLRSAPNSIVVPFTPMRNEDFSGYTVKISPMVDFGDCDDEISTATKMNKVVEQEIMQAQSQYMWLHRRFKHLPDGTDGKLYS >NC_020515.1|WP_015432308.1|963771_964368_+|RdgB/HAM1-family-non-canonical-purine-NTP-pyrophosphatase MTKQKIVLATGNKGKVKEMADVLADFGFEVVAQSEFGIESPEETGLTFVENALIKARYAAQMTGLPAIADDSGLAVDALGGEPGLYSARYAGVDGDDAANRQKLLTEMANVADENRTAKFVSCIVMLQHATDPTPKIAIGECFGTILNEERGENGFGYDSLFFYPPKNCSFAELETVEKKKISHRAIALQSLKQQLQK >NC_020515.1|WP_015432309.1|964378_964696_+|YbjQ-family-protein MIITTTPTIEGKQISEYKGLVFGEVVVGANIIRDFFAGITDIIGGRSGAYESKLNAARKEALKELEFEARKAGANAVVGVSFDYQTLGTKDMFVVAATGTAVVVQ >NC_020515.1|WP_015432310.1|964695_965160_+|YcgN-family-cysteine-cluster-protein MLQKNSQNLPLEPNFWQKKSLLEMNETEWEALCDGCGKCCYRKYIQGRGKRERLYYTRVACNLLDVETGKCTNYPNRFKIECDCTKLTKKNLPDFGWLPQTCAYRLLYEGKPLPDWHPLISKDAHSVKTAGMLIPNGIHEKDVIDWFEFVIDEI |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_020515_4 | 1552928-1554892 | TypeI |
NA
Consensus repeat of NC_020515_4
|
29 spacers
spacers of NC_020515_4
>4.1|1552960|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT AGGTTAAATCTAGACGAGATAATGGAGATTGCCAA >4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT TGTTAAAAATAAACCCTGCACGGGGCAGGGTCGG >4.3|1553093|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT ACCGAAACCGAAATCTGGGCAAGTGAGAGCGACA >4.4|1553159|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT ACATTAGGAACGCAAGACCAAGCCTCATTACAAGGC >4.5|1553227|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT TTGCAGAGGATAGAAATTTCCGCACCCATATTTTCG >4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT TTGATATTATTGATAATATGGAAAAAGAGATGAC >4.7|1553361|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT ACTCGAAGCCTTAGAGCCGGATTTTGTGCCAATC >4.8|1553427|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT AAATGAAAGCGTATAAATCTCGCCACTTTGCAAT >4.9|1553493|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT TGGCGTGGTAAGTACCGATAACATTCCCGAAGAC >4.10|1553559|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT TATTACAACATTTGACAATCAAACATTATGGGAG >4.11|1553625|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT CAAAGGTGCTATTGTACTTAGTAGCAAGTCAATTC >4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT TAAGCAATTATAAAGAAAGAATAAAAGACGTTGC >4.13|1553758|37|NC_020515|PILER-CR,CRISPRCasFinder,CRT ATCACCGTCCAACTTACCTTTGTTGAATGCTTGTGAG >4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT TAATATGCCCTTGCATAAATTCCACTTTGCCGTG >4.15|1553893|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT CTGCCGAGTAATAAGCCAAGCAGAATTTCAAGCA >4.16|1553959|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT TTGTGAACATCGAGCAAAATATCGGTCATTGCAC >4.17|1554025|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT ATCATAAGTGGCTTTACGAGTGGCTCTTGGCTCTT >4.18|1554092|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT CCTAGCCCCCAAGCACCTTTGGCAATAGTCTGTG >4.19|1554158|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT GAGATTTAGTAATTGCTCCATTGAAACGTGGAGA >4.20|1554224|37|NC_020515|PILER-CR,CRISPRCasFinder,CRT GTGCCGAGACTTGCCGGTGTATCGGTCACAGCTAAAC >4.21|1554293|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT ATAGCCGCCCAGCTCCTTAATCTTATCCAGCGACAT >4.22|1554361|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT AAATTATGCCTTAATAATACTTTAAGTTTTTAAAA >4.23|1554428|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT TCTTCTGAGCTTTCCAGGCATTAAAACCTTGCTCA >4.24|1554495|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT TCAGGGTCGCATAACTCCACTTTGCAGCGATGTT >4.25|1554561|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT AGTGAATATACCGCTCAGCGTGCGTTAGAAAATC >4.26|1554627|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT GCCGTTTAGATAACCTTGTCAGGCTTGGCACGATT >4.27|1554694|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT ACTATTTTTCACGTTATAGAGATCGTATTAGCAA >4.28|1554760|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT ATTACCCTGCCAAATTATCGGTTGGCTGAGTTCG >4.29|1554826|35|NC_020515|CRISPRCasFinder,CRT TCATCGGGCTTTAAGGTCAAGGCATTATCCACTAA |
cas3,cas5,cas8c,cas7,cas4,cas1,cas2 |
CRISPR arrays and Neighbor proteins around NC_020515_4
The CRISPR arrays of NC_020515_4 >merge|NC_020515|4|1552928-1554892|PILER-CR,CRISPRCasFinder,CRT CCAGCTACGTACGCGTAGCTGTGAGTTGAAACAGGTTAAATCTAGACGAGATAATGGAGATTGCCAACCAGCTACGTACGCGTAGCTGTGAGTTGAAACTGTTAAAAATAAACCCTGCACGGGGCAGGGTCGGCCAGCTACGTACGCGTAGCTGTGAGTTGAAACACCGAAACCGAAATCTGGGCAAGTGAGAGCGACACCAGCTACGTACGCGTAGCTGTGAGTTGAAACACATTAGGAACGCAAGACCAAGCCTCATTACAAGGCCCAGCTACGTACGCGTAGCTGTGAGTTGAAACTTGCAGAGGATAGAAATTTCCGCACCCATATTTTCGCCAGCTACGTACGCGTAGCTGTGAGTTGAAACTTGATATTATTGATAATATGGAAAAAGAGATGACCCAGCTACGTACGCGTAGCTGTGAGTTGAAACACTCGAAGCCTTAGAGCCGGATTTTGTGCCAATCCCAGCTACGTACGCGTAGCTGTGAGTTGAAACAAATGAAAGCGTATAAATCTCGCCACTTTGCAATCCAGCTACGTACGCGTAGCTGTGAGTTGAAACTGGCGTGGTAAGTACCGATAACATTCCCGAAGACCCAGCTACGTACGCGTAGCTGTGAGTTGAAACTATTACAACATTTGACAATCAAACATTATGGGAGCCAGCTACGTACGCGTAGCTGTGAGTTGAAACCAAAGGTGCTATTGTACTTAGTAGCAAGTCAATTCCCAGCTACGTACGCGTAGCTGTGAGTTGAAACTAAGCAATTATAAAGAAAGAATAAAAGACGTTGCCCAGCTACGTACGCGTAGCTGTGAGTTGAAACATCACCGTCCAACTTACCTTTGTTGAATGCTTGTGAGCCAGCTACGTACGCGTAGCTGTGAGTTGAAACTAATATGCCCTTGCATAAATTCCACTTTGCCGTGCCAGCTACGTACGCGTAGCTGTGAGTTGAAACCTGCCGAGTAATAAGCCAAGCAGAATTTCAAGCACCAGCTACGTACGCGTAGCTGTGAGTTGAAACTTGTGAACATCGAGCAAAATATCGGTCATTGCACCCAGCTACGTACGCGTAGCTGTGAGTTGAAACATCATAAGTGGCTTTACGAGTGGCTCTTGGCTCTTCCAGCTACGTACGCGTAGCTGTGAGTTGAAACCCTAGCCCCCAAGCACCTTTGGCAATAGTCTGTGCCAGCTACGTACGCGTAGCTGTGAGTTGAAACGAGATTTAGTAATTGCTCCATTGAAACGTGGAGACCAGCTACGTACGCGTAGCTGTGAGTTGAAACGTGCCGAGACTTGCCGGTGTATCGGTCACAGCTAAACCCAGCTACGTACGCGTAGCTGTGAGTTGAAACATAGCCGCCCAGCTCCTTAATCTTATCCAGCGACATCCAGCTACGTACGCGTAGCTGTGAGTTGAAACAAATTATGCCTTAATAATACTTTAAGTTTTTAAAACCAGCTACGTACGCGTAGCTGTGAGTTGAAACTCTTCTGAGCTTTCCAGGCATTAAAACCTTGCTCACCAGCTACGTACGCGTAGCTGTGAGTTGAAACTCAGGGTCGCATAACTCCACTTTGCAGCGATGTTCCAGCTACGTACGCGTAGCTGTGAGTTGAAACAGTGAATATACCGCTCAGCGTGCGTTAGAAAATCCCAGCTACGTACGCGTAGCTGTGAGTTGAAACGCCGTTTAGATAACCTTGTCAGGCTTGGCACGATTCCAGCTACGTACGCGTAGCTGTGAGTTGAAACACTATTTTTCACGTTATAGAGATCGTATTAGCAACCAGCTACGTACGCGTAGCTGTGAGTTGAAACATTACCCTGCCAAATTATCGGTTGGCTGAGTTCGCCAGCTACGTACGCGTAGCTGTGAGTTGAAACTCATCGGGCTTTAAGGTCAAGGCATTATCCACTAACCAGCTACGTACGCGTAGCTGGTGAGCTAAAA >NC_020515|4|3|1552928-1554825|PILER-CR CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC AGGTTAAATCTAGACGAGATAATGGAGATTGCCAA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TGTTAAAAATAAACCCTGCACGGGGCAGGGTCGG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ACCGAAACCGAAATCTGGGCAAGTGAGAGCGACA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ACATTAGGAACGCAAGACCAAGCCTCATTACAAGGC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TTGCAGAGGATAGAAATTTCCGCACCCATATTTTCG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TTGATATTATTGATAATATGGAAAAAGAGATGAC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ACTCGAAGCCTTAGAGCCGGATTTTGTGCCAATC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC AAATGAAAGCGTATAAATCTCGCCACTTTGCAAT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TGGCGTGGTAAGTACCGATAACATTCCCGAAGAC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TATTACAACATTTGACAATCAAACATTATGGGAG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC CAAAGGTGCTATTGTACTTAGTAGCAAGTCAATTC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TAAGCAATTATAAAGAAAGAATAAAAGACGTTGC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ATCACCGTCCAACTTACCTTTGTTGAATGCTTGTGAG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TAATATGCCCTTGCATAAATTCCACTTTGCCGTG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC CTGCCGAGTAATAAGCCAAGCAGAATTTCAAGCA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TTGTGAACATCGAGCAAAATATCGGTCATTGCAC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ATCATAAGTGGCTTTACGAGTGGCTCTTGGCTCTT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC CCTAGCCCCCAAGCACCTTTGGCAATAGTCTGTG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC GAGATTTAGTAATTGCTCCATTGAAACGTGGAGA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC GTGCCGAGACTTGCCGGTGTATCGGTCACAGCTAAAC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ATAGCCGCCCAGCTCCTTAATCTTATCCAGCGACAT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC AAATTATGCCTTAATAATACTTTAAGTTTTTAAAA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TCTTCTGAGCTTTCCAGGCATTAAAACCTTGCTCA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TCAGGGTCGCATAACTCCACTTTGCAGCGATGTT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC AGTGAATATACCGCTCAGCGTGCGTTAGAAAATC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC GCCGTTTAGATAACCTTGTCAGGCTTGGCACGATT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ACTATTTTTCACGTTATAGAGATCGTATTAGCAA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ATTACCCTGCCAAATTATCGGTTGGCTGAGTTCG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC >NC_020515|4|4|1552928-1554892|CRISPRCasFinder CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC AGGTTAAATCTAGACGAGATAATGGAGATTGCCAA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TGTTAAAAATAAACCCTGCACGGGGCAGGGTCGG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ACCGAAACCGAAATCTGGGCAAGTGAGAGCGACA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ACATTAGGAACGCAAGACCAAGCCTCATTACAAGGC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TTGCAGAGGATAGAAATTTCCGCACCCATATTTTCG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TTGATATTATTGATAATATGGAAAAAGAGATGAC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ACTCGAAGCCTTAGAGCCGGATTTTGTGCCAATC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC AAATGAAAGCGTATAAATCTCGCCACTTTGCAAT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TGGCGTGGTAAGTACCGATAACATTCCCGAAGAC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TATTACAACATTTGACAATCAAACATTATGGGAG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC CAAAGGTGCTATTGTACTTAGTAGCAAGTCAATTC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TAAGCAATTATAAAGAAAGAATAAAAGACGTTGC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ATCACCGTCCAACTTACCTTTGTTGAATGCTTGTGAG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TAATATGCCCTTGCATAAATTCCACTTTGCCGTG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC CTGCCGAGTAATAAGCCAAGCAGAATTTCAAGCA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TTGTGAACATCGAGCAAAATATCGGTCATTGCAC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ATCATAAGTGGCTTTACGAGTGGCTCTTGGCTCTT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC CCTAGCCCCCAAGCACCTTTGGCAATAGTCTGTG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC GAGATTTAGTAATTGCTCCATTGAAACGTGGAGA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC GTGCCGAGACTTGCCGGTGTATCGGTCACAGCTAAAC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ATAGCCGCCCAGCTCCTTAATCTTATCCAGCGACAT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC AAATTATGCCTTAATAATACTTTAAGTTTTTAAAA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TCTTCTGAGCTTTCCAGGCATTAAAACCTTGCTCA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TCAGGGTCGCATAACTCCACTTTGCAGCGATGTT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC AGTGAATATACCGCTCAGCGTGCGTTAGAAAATC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC GCCGTTTAGATAACCTTGTCAGGCTTGGCACGATT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ACTATTTTTCACGTTATAGAGATCGTATTAGCAA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ATTACCCTGCCAAATTATCGGTTGGCTGAGTTCG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TCATCGGGCTTTAAGGTCAAGGCATTATCCACTAA CCAGCTACGTACGCGTAGCTGGTGAGCTAAAA >NC_020515|4|3|1552928-1554892|CRT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC AGGTTAAATCTAGACGAGATAATGGAGATTGCCAA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TGTTAAAAATAAACCCTGCACGGGGCAGGGTCGG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ACCGAAACCGAAATCTGGGCAAGTGAGAGCGACA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ACATTAGGAACGCAAGACCAAGCCTCATTACAAGGC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TTGCAGAGGATAGAAATTTCCGCACCCATATTTTCG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TTGATATTATTGATAATATGGAAAAAGAGATGAC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ACTCGAAGCCTTAGAGCCGGATTTTGTGCCAATC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC AAATGAAAGCGTATAAATCTCGCCACTTTGCAAT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TGGCGTGGTAAGTACCGATAACATTCCCGAAGAC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TATTACAACATTTGACAATCAAACATTATGGGAG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC CAAAGGTGCTATTGTACTTAGTAGCAAGTCAATTC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TAAGCAATTATAAAGAAAGAATAAAAGACGTTGC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ATCACCGTCCAACTTACCTTTGTTGAATGCTTGTGAG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TAATATGCCCTTGCATAAATTCCACTTTGCCGTG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC CTGCCGAGTAATAAGCCAAGCAGAATTTCAAGCA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TTGTGAACATCGAGCAAAATATCGGTCATTGCAC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ATCATAAGTGGCTTTACGAGTGGCTCTTGGCTCTT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC CCTAGCCCCCAAGCACCTTTGGCAATAGTCTGTG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC GAGATTTAGTAATTGCTCCATTGAAACGTGGAGA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC GTGCCGAGACTTGCCGGTGTATCGGTCACAGCTAAAC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ATAGCCGCCCAGCTCCTTAATCTTATCCAGCGACAT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC AAATTATGCCTTAATAATACTTTAAGTTTTTAAAA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TCTTCTGAGCTTTCCAGGCATTAAAACCTTGCTCA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TCAGGGTCGCATAACTCCACTTTGCAGCGATGTT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC AGTGAATATACCGCTCAGCGTGCGTTAGAAAATC CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC GCCGTTTAGATAACCTTGTCAGGCTTGGCACGATT CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ACTATTTTTCACGTTATAGAGATCGTATTAGCAA CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC ATTACCCTGCCAAATTATCGGTTGGCTGAGTTCG CCAGCTACGTACGCGTAGCTGTGAGTTGAAAC TCATCGGGCTTTAAGGTCAAGGCATTATCCACTAA CCAGCTACGTACGCGTAGCTGGTGAGCTAAAA
>NC_020515.1|WP_025266960.1|1551140_1552658_+|ribosome-biogenesis-GTPase-Der MTPVVALVGRPNVGKSTLFNRLTRTRDALVADFPGLTRDRKYGQANIAGHDFIVIDTGGIDGTEEGVEEKMAEQSLLAIEEADVVLFLVDARAGLVPADIGIAQYLRQRDKTTVVVANKTDGIDADSHIAEFYQLGLGDVEPIAAAQGRGVTQLIEQVLAPLAEKIEEQAVENAENSANTTEEQDEWENNFDFENEEDTALLDEALEESEEESDKNIKIAIVGRPNVGKSTLTNRILGEDRVVVYDMPGTTRDSIYIPMERDGQQYTIIDTAGVRKRGKVHLAVEKFSVIKTLQAIQDANVVLLTIDARDGVSDQDLSLLGFILNAGKSLVIVVNKWDGLSQDIKDNVKSELDRRLDFIDFARVHFISALHGSGVGNLFDSIQEAYACATKKMTTAMLTRILQMATDEHQPPLVNGRRVKLKYAHPGGYNPPIIVIHGNQIERLPDSYKRYLSNYYRKSLKIIGSPIRVLFQEGNNPFAGKRNKLTPSQLRKRKRLMKFIKKNRK >NC_020515.1|WP_015432820.1|1549507_1550821_+|glutamyl-tRNA-reductase MTILALGINHKTASVSLREKVAFVESKRQLAFEQISQQNLAESAVILSTCNRTELYFHQADIPPQEDHPENIAWRERCFQWFAEIHQLDHNELRQCIYFKQNMDTARHLMEVACGLDSLILGEPQILGQVKQAYQDSEYFYHQQGKSISTNLSRLFQKTFSTAKRVRSETEIGASAVSVAYAACGLARQIFDDFAKLRFLLVGAGETIELVARYLIQHGAQNLMVANRTHIRAEMLAEKLETPMQILSLSALQVGLNQADVVISSTGSPDLLISKEMVETAQKQRRFDPMLLIDIAVPRDIDEKAGELDSVYAYSVDDLQHIIQQNLAQRQQAAEQAKEIVEQECKDFFAWLKQQQSSQLIKHYRQNAEEIRLDLLEKARNALEQGQDSEKILQELSYKLMNQLLHAPTSALQNLAKDGNVKGLQRFSQALKLDDIN >NC_020515.1|WP_015432819.1|1546998_1549209_-|GTP-diphosphokinase MVAIRHSHQLDPNNFELASWSAGLKMSPVTFDELQTAWRYAEEKLDTEQLHLMWVGLEMVEILHGLNMDDDSLVAAMLFPLVKHNIADLAQIKEQFGNGVKNLVKGVLEMENIRQLNANNASDLQIDNIRRMLLAMVDDFRCVVIKLAERIVYLRDTEHHSEEDLVLAAKECSHIYAPLANRLGIGQLKWELEDYSFRALHPQDYRQIAKFDLAERRLDREQFIADFVAHLTACIGEEIDNVQVYGRPKHIYSIWKKMQKKNLRFDQLFDIRAVRIIVQNLEECYTALSIVHSHYKHLPEHFDDYIADPKPNGYQSLHTVVLGKGDKPIEVQIRTQKMHDDAELGVAAHWKYKEGAGAGRSGYEEKIVWLRKLLAWQNDIADSGEMVDDLRSQVFDDRVYVFTPKGEVIDLPSNATPLDFAYSIHSEIGHRCIGAKVAGKIVPFTYILQMGDQVEIITQKNPNPSRDWLNPSQGFVNTPRARSKIIAWFKKLDREKNLPIGKEMLESEMVKHQFSLKQIEDYALPRYNLKQLDDLYAAIGGGDIKLNNLMNYLQGKLVKTSAEQADEAILKHMAHKAQHTQTKTGRAGAIIVDGVGNLMHHIARCCQPIPGDKIVGYITQGRGISIHRADCEQLFDLQSSSPERVVDAEWGGNFTSGFSLVIRVIANDRNGLLRDVSAIMANEKVNVIGVASRTDIKRSIATIDIEVELNNIELLDKLLKRIMQLDDVIEAKRLSN >NC_020515.1|WP_015432818.1|1546069_1546990_-|glycosyltransferase MTSPKISFIIPIYNTAIYLSECIESILTQRVELEIILVDDGSTDDSLTICLNYVKKYSFITLVHSQNKGQSAARNKAINLAQGKYIYFIDSDDYITGDHFPEIIRVADQYGVDMIRLQAEKVAQLTGKRLAIPTLKANNNVNQGYLLSGKETLSLMVQQTWIPAICWTLIRREFLLKHQLNFIEGIKAEDQLFYLQLLTIDPNATLIELPFWVYCYRIRPNSITTTINPAYFYDHFRMIELINQYFEQHNLLSDESIYHDGKHIVLNLCRTAFNMLNKFPPEVRHECENYLTQNWQNLTNIWNYFK >NC_020515.1|WP_015432817.1|1544796_1545987_-|1-deoxy-D-xylulose-5-phosphate-reductoisomerase MKKLVILGSTGSIGKSTLSVVKHNPEKYAVFALVGGKNVALMTEQAVQFRPEFVAMDDENAAKQLAQNLKQANVNCEVVAGQKAICELAAHPEVDQVMAAIVGAAGLLPTLSAVQAGKTVLLANKESLVTCGQLFIDEAKKSGAKLLPVDSEHNAIFQSLPPEAQEKVGFCPLAELGVSKIILTGSGGPFRIKPLAEFSAITPEQAVAHPNWSMGKKISVDSATMMNKGLEYIEARWLFNASADEMEIIIHPQSIIHSMVRYIDGSVIAQMGNPDMRTPIAHTMAYPDRIHAGVAPLDFFQLKELTFIEPDFVRYPNLKLAMDAFTEGQYATTAMNAANEVAVDAFLNGRIRFTDIVAVNRATVENITPIAVREIADVLHIDKLAREVAQQQIFQC >NC_020515.1|WP_015432816.1|1544111_1544723_+|hypothetical-protein MVLIFTGLYYFFGELISILFSDVGLSSYTKHTTIELLLPKDFVVNEKSGSLTPLFYTMQLAFQLQAWNFILGYIICVTYITGQKLRLLGVLFSLLFSIGSSIISASQGGHYSTFGYLQNLGFEVTFLIGNLAMVAIGFAIDNNHIKRFKYYSIIAGLIGLSCIISTVFITTAYTPWLERISIYSLMIWEISLGFAVLKAMESK >NC_020515.1|WP_015432815.1|1542381_1543791_+|amino-acid-permease MSNKKIGLISLTALVLSSMIGSGIFSLPQNMAEVAGAEALLIGWGITGVGIIFLGLSFFFISRLRPDLDGGIYTYAREGFGELVGFMSAWGYWLCATIGIVGYLVVAFEGIGTFTDSETNIIFGQGNTLAAFIGASIIVWLVHILVASGVKEAASVNLVATIVKVFPLVLFIGLAIWYFSPNTFTQDIQATSLNNGVSDQVKNTMLITLWVFTGVEGASVLSAHARKKSDVGLATVLGIIIALVLYVAITVLSLGILPRETIANMSNPSMAGLLEAMIGSSGKIIITLCLIVSVLASYVSWTMYSAEVPYRGAKNGAFPKILDKLNANDVPINSLWFTGFVVQFCLFLVLLTGKSYNALLLISTSMILVPYFLIGAYLLKLAIQQKAKWYIQLTGFIASLYGLWIVYAAGIDYLLLSVLLYVPGIGLFLYSRRQQQKAPLTTVEKVILTIIALLFIWAVYHSFTQVNWE >NC_020515.1|WP_015432814.1|1541454_1542369_+|homocysteine-S-methyltransferase-family-protein MTTHITILDGGMGRELARVGAPFKQPEWSALALYEAPPAVIRVHQDFIHAGAEVITTNSYAVVPFHIGEKRFQADGFTLAKLSGQLAKQAVDTSTNSATDSKKVKIAGSLPPLFGSYRFDLFQADQVERVARPLIDGLSAYADFWLFETQSHSQEVLSVIPFLPHDNRPIWVSFTLQDECLTDTPYLRSGERVVDAVKAVLEQGVQAVLFNCCQPEVIEQAIIAAKQVIGKKAVQLGAYANAFPPQSKEATANDGLDEIRTDLDPNAYLAWAQKWRNAGATIIGGCCGITPAHIQVLAQHLNSN >NC_020515.1|WP_025266962.1|1540020_1541175_-|methionine-adenosyltransferase MAINLFTSESVSEGHPDKIADQISDAVLDEILRQDPKARVACETYVKTGMALVGGEITTSAWVDVENLTRQVICDIGYTHSEMGFDGHSCAVLNAIGKQSSDINQGVDRENPLDQGAGDQGIMFGYATNETEVLMPAPITYAHRLMEQQAKVRKSGKLNWLRPDAKSQLTFAYENNKIVGIDAVVLSTQHAEDVDQKTVYEGVMEEIIKPILPSEWLSQNTKFFINPTGRFVIGGPMGDCGLTGRKIIVDTYGGAARHGGGAFSGKDPSKVDRSAAYAARYVAKNIVAAGLADRCEIQLSYAIGVAEPTSIMVETFGTGKVANDVLVKLIYQFFDLRPYGLIKMLDLIRPIYRETAAYGHFGREHFPWEKTDKAAELREAAGLK >NC_020515.1|WP_015432812.1|1538619_1539774_+|hypothetical-protein MKKLSLTLVSLLSTSLFAQIQLSPFPMQAIGKAAQLAVSDKDELFIINTQGELWQATPIMNKLSDGFSTQIAPSVAYNRVAGADKQGNFMLWTAKQLYTSTIPLAKQAGMYPLAFATIAVSKQGKQHKLVRIKTKGTQAEITAMASTEVLPDAQPMQIDFKHSAPNQGHIAILAKPDNSTYLHGVLGDAIEAAEVQYLERHTLEPLAEGLSMKGLVFEANRFEHFATNNGAKLVSVMSGNGEGGRTVLIGEQNGKLVLEQSSSSLPNNRWQSPFVFNRKLYAVQMPHLRGKLVEYTPQGAKLAEHSMQDGFSNHRYGEYETNLAASASHFAVLPLRDYRHIAILDSQGQLQTLAQTLPAEIQKTRASKDSVYLLLENGQIWLAQ >NC_020515.1|WP_025328951.1|1555072_1557535_+|CRISPR-associated-helicase/endonuclease-Cas3 MSKTEFIAHVRKSNKQLQSVSNHLLETASIARTLAAKLDLADAGELLGLMHDFGKYSKKFQTYIRHVTGILTYADLDSEDENNGGDHSTAGAQWIYGRLRKLGAAKNADGKIIGIGELVGQILGLCIASHHGAGLIDCLSPEGSEKPKWRERFDKDDKLTHLSECEKNADAVIINRAEELVGIDLVRLVDKPIRAILNQKEIPFKLKEFYLGCLTRFLFSCLIDADRINTSDFENERQKEIRNLTNTPNWQKAIDKLESHLSGFSIKYPIDGIRREISESCLERSTDQQGIYTLTVPTGGGKTLSSLRYALHHAKLHNLDRIIYIIPYTSIIDQNAQAVRKILGEEWVLEHHSNIEPDQQTWQNKLLSENWDKPIVFTTMVQFLDAWFGSGTRGVRHIHAMTKSVLIFDEIQTLPIKCVHLFCNVLNWLTHFGKSSAVLCTATQPLLNSLKNPHLGQIQLADNAELIGNQFKIRELFDKLSRVEVNYCPQTGGYSLENAGEFLLEQFGQYSSCLFIVNTKKWAQDLYRYCQNRNLPQEALFHLSTNQCSAHRKTIFDKIKARLNNKEPVICISTQLIEAGVDISMACVIRALGGLDSIAQAAGRCNRHGENKGKGQVYVLNLQEPNLESVLPDIYIGQQQSERVFNDFEGQDILQPNAMSQYFDYYFYNRSNEMGYSLPNNYSGTLLDWLSDNAQNTYVPKNNQRKTVFPLLMQSFKSAGKLFQTIDAPTQAVIVPYENAKELIATLCGTDDNEKKYKALSQAQRYSVNVFPNVWKKLQENEAIQETQLGSGIFYLKDRHYTEEYGLSIEETGNLTFYDL >NC_020515.1|WP_015432824.1|1557546_1558287_+|type-I-C-CRISPR-associated-protein-Cas5 MSNENTFRSRLFSFRVWGRQALFTDPITKIGGEKFTYPVPTYEALKGILRSIYWKPTLIWHISRIRVMKPIQTQAKSTKPLDWNGGNTLAIYTFLHNVEYQVEAYFTWNMHWEELAGDRNVGKHTAIIERMLERGGRQDIFLGTRDCQGYIAPCQFGEGEGFYDKVDEPIDFGLMFHSFGYPEETGNHELISRFWQANMQKGVIKFPAVSDGELKTRFIKKMKPFKPFKRGENVKAVEEEAKELEL >NC_020515.1|WP_025328950.1|1558283_1560275_+|type-I-C-CRISPR-associated-protein-Cas8c/Csd1 MSWMQKLYRTYEAALQKASNLSEEPLTPIGHTQQNAHIVIVLNGDGEFRTAQVMPPKTAIMLPATESSENRTSGEAPHPLADKIQYVARDYSAYGGEKKAYFQGYLNQLQAWCDSAASHPKVSAVLHYVKKGKVVEDLITAGVFQLGADGKVLSKWVEKGDAPAIFSTLPKTKGEIEFGSALVCWRVEIKGDPQSDTWTDVTVQQSWIDYLALADSQTGFCFIQGKESPVSNMHPAKLRHTGDKAKLISSNDTAGYTFRGRFETAEEAASISTEVSAKAHSALRWLISRQGIRNGEQVTVAWAISGEKVPSPLQDPFDECYDYDLEEISAVENNVESEMPSETRGKIDHSVDLGKNAAEMIKKKYQGYKAKLKAHEQISLLMLDSATPGRMALTYYQEFLPADYFANLDAWIDDFSWYQRHSIETKNGKKNDKRLVWAIVPPSPFAIGNAVYSKSLSDSLKKQLYARLLPVIAGGKSVPIPYDLVQQSFQVACNPHGCENWEWQRNIGVACALYKGWRARHHNESERRTYDMSLDKENRSRDYLYGRLLAVAENIEAYALYLAGEKRSTNAERYMPKFANRPFYTWRNIEIALKPYQERLRNHNKDTGSQALAEITDLFVTEDYTNDSPLSAEFLLGYHCQKMEIARQLAELTAKKSKTTETE >NC_020515.1|WP_015432826.1|1560304_1561147_+|type-I-C-CRISPR-associated-protein-Cas7/Csd2 MSLTKKIDFALIISVKNANPNGDPLNGNRPRTDFHGFGEITDVCLKRKIRDRLQDAGESIFVQSDEKKTDSMTSLANRAKDKDVGLGSDAFNAKKSSRDETAKKACKKWLDVRSFGQVFAFGKSDDGAGVSIAVRGPVTIHSAFSVAPVSVTSTQITKSVSGEGDGSKKSSDTMGMKHRVDGGVYVAYGAMSPQLAERTGFSDSDAEKIKSVLTKLFEGDASSARPEGSMQVVKLIWWEHNCKSGQYSSAKVHSSLKVNADGSYELNALDSLIPQEIDGF >NC_020515.1|WP_015432827.1|1561147_1561810_+|CRISPR-associated-protein-Cas4 MLSVLQKTEQNQSLVTEDKQLIVPLSALQHYAFCPRQCALIYNEQAWAENYLTAQGQALHERVDSGEPETRKGVRFERTVHVAAEKLGISGILDLVERDLKTGELKPVEYKRGKPKPEPMDEIQLCAQALCLEEMTGQTINEGALWYMQTRHRVPVVFSDGLRQATLDTIAQVRALLISGKTPLPEYGKHCKACSLVEICQPKLLEKDKSAGYVKGVFEE >NC_020515.1|WP_015432828.1|1562357_1563371_+|type-I-C-CRISPR-associated-endonuclease-Cas1 MRKLQNTLYITTQGSYLHKERETLVVEQDRKKVAQLPVHSIGHIFCFGNVLVSPFLMGFCGENNVNLAFFTENGRYLGRLQGRQNGNVLLRRAQYKKSETNPEPVARNIIAAKIQASKRVLQRRLRNHGECEPVEQAVTALNMSLKQLQKADNLDLIRGIEGDAASRYFGVFQHLLSEQCEFHFDGRNRRPPRDGVNALLSFLYSIVGKDISGALQGVGLDPQIGFLHADRPGRDSLAQDILEEFRAWWVDRMVLSLINRGQIKPNDFITESGGAVMLKPEVRKLLFQTLQAKKQEKIIHPFLGEEVEIGLLPYIQAMLLARYLRGDLAEYPPFLMR >NC_020515.1|WP_015432829.1|1563415_1563709_+|CRISPR-associated-endonuclease-Cas2 MMMLITYDISFDDAEGQKRLRRIAKHCLDYGIRAQYSVFECDVTPDQWVKLKQKLLDTYNPETDSLRFYHLGSKWRNKVEHHGTKKAVDIFKDTLIL >NC_020515.1|WP_015432832.1|1565309_1566737_+|bifunctional-D-glycero-beta-D-manno-heptose-7-phosphate-kinase/D-glycero-beta-D-manno-heptose-1-phosphate-adenylyltransferase-HldE MMMHYSSQFNHAKVLVLGDVMLDRYWFGSTNRISPEAPVPVVKVQENEERAGGAANVAMNIAALNVPVTLHGLVGQDDAGSALDKLLNSHQIQNHCVALDSHPTITKLRILSRHQQLLRLDFEEGFHNVDSSELLAKLSSEITAYGALILSDYGKGTLNDVQKMIQIARQANVPILIDPKGTDFERYRGATLLTPNMSEFEAVVGHCATEDDIVHKGLKMIADFELSALLVTRSEKGMTLLRPNFEPFHLPTQAREVYDVTGAGDTVISVLATAIADGRNLEEACYIANAAAGVVVGKLGTSTVSPSELEQAIHQRTETGFGVVSEAELKQIVQQSKARGEKIVMTNGCFDILHPGHVSYLENARKLGDRLIVAVNTDNSVKRLKGENRPINDLASRMAVLAGLASVDWVVPFDEDTPQRLIGEILPNLLVKGGDYKVEEIAGHQEVLANGGEVRVLNFENGCSTTNVIKKIQSL >NC_020515.1|WP_015432833.1|1566798_1567740_+|peptidylprolyl-isomerase MKFISLKSLFVATFALFAVSQIHAVEERVVASVDGHPIMQSQVLKTLGKRKNTEANRKAATDDLINDFLVQRAIQQSGIKVNTAYVDQVIENMVVQNGITYGQFLDYLDYNNISLNQYRQQIAHQILMDNVKQQAIGQSIRVEPQDVQSLATKMLEEAKTNGKLKTITALQHRVSHILIKTNPILNDAQAKAKLNSIVADIKAGKISFEDAARANSVDYASGAEGGDLGWNFLDAYDKTFAQTAQKSKLGVISAPFKSQFGWHVLKVTDTRQSDRTEDAYFQRAYEQLFDKQAQDASKDWVKALKNRAEIKYY >NC_020515.1|WP_015432834.1|1567839_1568706_+|16S-rRNA-(adenine(1518)-N(6)/adenine(1519)-N(6))--dimethyltransferase-RsmA MSSNSKKHLGHTARKRFGQNFLHDMNVIHNIVAAINPKKDQFLLEIGPGLGALTEPVAEQVEQLTVVELDRDLAERLRHHPFLHHKLTIIEQDALRFNFREYFESLNLPEGQGVRVFGNLPYNISTPLMFHLFKFHDLVQDMHFMLQKEVVKRLCAAPNSKAYGRLTIMAQYYCQVMPVLEVPPNAFKPAPKVDSAVVRLVPYKTLPYPVKDIYWLNRVTTQAFNQRRKTLRNALSTLFTPEQLEALNIDLNARAENLAIADYTRLANWLCDNPPAAGKIEIIENDVE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_020515_5 | 1563910-1565142 | TypeI |
NA
Consensus repeat of NC_020515_5
|
18 spacers
spacers of NC_020515_5
>5.1|1563942|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT AAAATTGCGTGGACGGACAATCCGGCGTGGATTTTG >5.2|1564010|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT TAAAATTATGTTACTACACTTGGTTGACGTAACAT >5.3|1564077|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT GCTTCTGAATATGATGCTCAAGAATGGCTAAATGC >5.4|1564144|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT ACAGTCATAACGTTGGTGCAGCACAAAGCCGTTAT >5.5|1564211|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT TTGCCAGTTTAAGTAATACCATTTCCACAGCAGCT >5.6|1564278|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT GTGTTTTACCAGGATTTCTTTTGCTAGTGTCTCAG >5.7|1564345|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT TTTCGCACGTTGTGGCGATAAAGCAGGAAATAAA >5.8|1564411|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT TAGTTCTTTACTTTAATGAACTAAAAGTACATAA >5.9|1564477|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT GTACGTAGCTGGATACAAAAGGCGCAACCGTTAGA >5.10|1564544|33|NC_020515|PILER-CR,CRISPRCasFinder,CRT TTTTATGTACACTAAACGTATAATGAAGTCAAT >5.11|1564609|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT TAAATAACGGAGCAAATCATCAAGACAACTATCAA >5.12|1564676|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT AAAATGACGTGGTCGGAGCAATGCCGAAAAGCCAA >5.13|1564743|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT ATCTATTGAAGGCTGATAATATGCTTGACTCTGA >5.14|1564809|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT AATTAAGTACCGTACGTTGATGACGGAACTTGAAG >5.15|1564876|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT GCTATGGGGGATTCGTCAATTTCAATTTTATGGTG >5.16|1564943|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT CATTGCAACCTGCCATTCACTACGAAATCTACAT >5.17|1565009|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT CGATAAGCCGGTCAAATTACCGAAATGCTCCAAT >5.18|1565075|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT CTTGCGAGTGTTGATACGGTCAATTAGCGAGAAGTT |
cas2,cas1,cas4,cas7,cas8c,cas5,cas3 |
CRISPR arrays and Neighbor proteins around NC_020515_5
The CRISPR arrays of NC_020515_5 >merge|NC_020515|5|1563910-1565142|PILER-CR,CRISPRCasFinder,CRT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAACAAAATTGCGTGGACGGACAATCCGGCGTGGATTTTGGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACTAAAATTATGTTACTACACTTGGTTGACGTAACATGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACGCTTCTGAATATGATGCTCAAGAATGGCTAAATGCGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACACAGTCATAACGTTGGTGCAGCACAAAGCCGTTATGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACTTGCCAGTTTAAGTAATACCATTTCCACAGCAGCTGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACGTGTTTTACCAGGATTTCTTTTGCTAGTGTCTCAGGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACTTTCGCACGTTGTGGCGATAAAGCAGGAAATAAAGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACTAGTTCTTTACTTTAATGAACTAAAAGTACATAAGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACGTACGTAGCTGGATACAAAAGGCGCAACCGTTAGAGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACTTTTATGTACACTAAACGTATAATGAAGTCAATGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACTAAATAACGGAGCAAATCATCAAGACAACTATCAAGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACAAAATGACGTGGTCGGAGCAATGCCGAAAAGCCAAGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACATCTATTGAAGGCTGATAATATGCTTGACTCTGAGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACAATTAAGTACCGTACGTTGATGACGGAACTTGAAGGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACGCTATGGGGGATTCGTCAATTTCAATTTTATGGTGGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACCATTGCAACCTGCCATTCACTACGAAATCTACATGCAGCCACCTTCGGGTGGCTGTGAGTTGAAACCGATAAGCCGGTCAAATTACCGAAATGCTCCAATGCAGCCACCTCCGGGTGGCTGTGAGTTGAAACCTTGCGAGTGTTGATACGGTCAATTAGCGAGAAGTTGCAGCCACCTGCGAGTGGCTGTGAGTTGAAAC >NC_020515|5|4|1563910-1565142|PILER-CR GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC AAAATTGCGTGGACGGACAATCCGGCGTGGATTTTG GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TAAAATTATGTTACTACACTTGGTTGACGTAACAT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC GCTTCTGAATATGATGCTCAAGAATGGCTAAATGC GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC ACAGTCATAACGTTGGTGCAGCACAAAGCCGTTAT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TTGCCAGTTTAAGTAATACCATTTCCACAGCAGCT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC GTGTTTTACCAGGATTTCTTTTGCTAGTGTCTCAG GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TTTCGCACGTTGTGGCGATAAAGCAGGAAATAAA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TAGTTCTTTACTTTAATGAACTAAAAGTACATAA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC GTACGTAGCTGGATACAAAAGGCGCAACCGTTAGA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TTTTATGTACACTAAACGTATAATGAAGTCAAT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TAAATAACGGAGCAAATCATCAAGACAACTATCAA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC AAAATGACGTGGTCGGAGCAATGCCGAAAAGCCAA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC ATCTATTGAAGGCTGATAATATGCTTGACTCTGA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC AATTAAGTACCGTACGTTGATGACGGAACTTGAAG GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC GCTATGGGGGATTCGTCAATTTCAATTTTATGGTG GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC CATTGCAACCTGCCATTCACTACGAAATCTACAT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC CGATAAGCCGGTCAAATTACCGAAATGCTCCAAT GCAGCCACCTCCGGGTGGCTGTGAGTTGAAAC CTTGCGAGTGTTGATACGGTCAATTAGCGAGAAGTT GCAGCCACCTGCGAGTGGCTGTGAGTTGAAAC >NC_020515|5|5|1563910-1565142|CRISPRCasFinder GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC AAAATTGCGTGGACGGACAATCCGGCGTGGATTTTG GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TAAAATTATGTTACTACACTTGGTTGACGTAACAT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC GCTTCTGAATATGATGCTCAAGAATGGCTAAATGC GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC ACAGTCATAACGTTGGTGCAGCACAAAGCCGTTAT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TTGCCAGTTTAAGTAATACCATTTCCACAGCAGCT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC GTGTTTTACCAGGATTTCTTTTGCTAGTGTCTCAG GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TTTCGCACGTTGTGGCGATAAAGCAGGAAATAAA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TAGTTCTTTACTTTAATGAACTAAAAGTACATAA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC GTACGTAGCTGGATACAAAAGGCGCAACCGTTAGA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TTTTATGTACACTAAACGTATAATGAAGTCAAT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TAAATAACGGAGCAAATCATCAAGACAACTATCAA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC AAAATGACGTGGTCGGAGCAATGCCGAAAAGCCAA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC ATCTATTGAAGGCTGATAATATGCTTGACTCTGA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC AATTAAGTACCGTACGTTGATGACGGAACTTGAAG GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC GCTATGGGGGATTCGTCAATTTCAATTTTATGGTG GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC CATTGCAACCTGCCATTCACTACGAAATCTACAT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC CGATAAGCCGGTCAAATTACCGAAATGCTCCAAT GCAGCCACCTCCGGGTGGCTGTGAGTTGAAAC CTTGCGAGTGTTGATACGGTCAATTAGCGAGAAGTT GCAGCCACCTGCGAGTGGCTGTGAGTTGAAAC >NC_020515|5|4|1563910-1565142|CRT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC AAAATTGCGTGGACGGACAATCCGGCGTGGATTTTG GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TAAAATTATGTTACTACACTTGGTTGACGTAACAT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC GCTTCTGAATATGATGCTCAAGAATGGCTAAATGC GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC ACAGTCATAACGTTGGTGCAGCACAAAGCCGTTAT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TTGCCAGTTTAAGTAATACCATTTCCACAGCAGCT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC GTGTTTTACCAGGATTTCTTTTGCTAGTGTCTCAG GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TTTCGCACGTTGTGGCGATAAAGCAGGAAATAAA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TAGTTCTTTACTTTAATGAACTAAAAGTACATAA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC GTACGTAGCTGGATACAAAAGGCGCAACCGTTAGA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TTTTATGTACACTAAACGTATAATGAAGTCAAT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC TAAATAACGGAGCAAATCATCAAGACAACTATCAA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC AAAATGACGTGGTCGGAGCAATGCCGAAAAGCCAA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC ATCTATTGAAGGCTGATAATATGCTTGACTCTGA GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC AATTAAGTACCGTACGTTGATGACGGAACTTGAAG GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC GCTATGGGGGATTCGTCAATTTCAATTTTATGGTG GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC CATTGCAACCTGCCATTCACTACGAAATCTACAT GCAGCCACCTTCGGGTGGCTGTGAGTTGAAAC CGATAAGCCGGTCAAATTACCGAAATGCTCCAAT GCAGCCACCTCCGGGTGGCTGTGAGTTGAAAC CTTGCGAGTGTTGATACGGTCAATTAGCGAGAAGTT GCAGCCACCTGCGAGTGGCTGTGAGTTGAAAC
>NC_020515.1|WP_015432829.1|1563415_1563709_+|CRISPR-associated-endonuclease-Cas2 MMMLITYDISFDDAEGQKRLRRIAKHCLDYGIRAQYSVFECDVTPDQWVKLKQKLLDTYNPETDSLRFYHLGSKWRNKVEHHGTKKAVDIFKDTLIL >NC_020515.1|WP_015432828.1|1562357_1563371_+|type-I-C-CRISPR-associated-endonuclease-Cas1 MRKLQNTLYITTQGSYLHKERETLVVEQDRKKVAQLPVHSIGHIFCFGNVLVSPFLMGFCGENNVNLAFFTENGRYLGRLQGRQNGNVLLRRAQYKKSETNPEPVARNIIAAKIQASKRVLQRRLRNHGECEPVEQAVTALNMSLKQLQKADNLDLIRGIEGDAASRYFGVFQHLLSEQCEFHFDGRNRRPPRDGVNALLSFLYSIVGKDISGALQGVGLDPQIGFLHADRPGRDSLAQDILEEFRAWWVDRMVLSLINRGQIKPNDFITESGGAVMLKPEVRKLLFQTLQAKKQEKIIHPFLGEEVEIGLLPYIQAMLLARYLRGDLAEYPPFLMR >NC_020515.1|WP_015432827.1|1561147_1561810_+|CRISPR-associated-protein-Cas4 MLSVLQKTEQNQSLVTEDKQLIVPLSALQHYAFCPRQCALIYNEQAWAENYLTAQGQALHERVDSGEPETRKGVRFERTVHVAAEKLGISGILDLVERDLKTGELKPVEYKRGKPKPEPMDEIQLCAQALCLEEMTGQTINEGALWYMQTRHRVPVVFSDGLRQATLDTIAQVRALLISGKTPLPEYGKHCKACSLVEICQPKLLEKDKSAGYVKGVFEE >NC_020515.1|WP_015432826.1|1560304_1561147_+|type-I-C-CRISPR-associated-protein-Cas7/Csd2 MSLTKKIDFALIISVKNANPNGDPLNGNRPRTDFHGFGEITDVCLKRKIRDRLQDAGESIFVQSDEKKTDSMTSLANRAKDKDVGLGSDAFNAKKSSRDETAKKACKKWLDVRSFGQVFAFGKSDDGAGVSIAVRGPVTIHSAFSVAPVSVTSTQITKSVSGEGDGSKKSSDTMGMKHRVDGGVYVAYGAMSPQLAERTGFSDSDAEKIKSVLTKLFEGDASSARPEGSMQVVKLIWWEHNCKSGQYSSAKVHSSLKVNADGSYELNALDSLIPQEIDGF >NC_020515.1|WP_025328950.1|1558283_1560275_+|type-I-C-CRISPR-associated-protein-Cas8c/Csd1 MSWMQKLYRTYEAALQKASNLSEEPLTPIGHTQQNAHIVIVLNGDGEFRTAQVMPPKTAIMLPATESSENRTSGEAPHPLADKIQYVARDYSAYGGEKKAYFQGYLNQLQAWCDSAASHPKVSAVLHYVKKGKVVEDLITAGVFQLGADGKVLSKWVEKGDAPAIFSTLPKTKGEIEFGSALVCWRVEIKGDPQSDTWTDVTVQQSWIDYLALADSQTGFCFIQGKESPVSNMHPAKLRHTGDKAKLISSNDTAGYTFRGRFETAEEAASISTEVSAKAHSALRWLISRQGIRNGEQVTVAWAISGEKVPSPLQDPFDECYDYDLEEISAVENNVESEMPSETRGKIDHSVDLGKNAAEMIKKKYQGYKAKLKAHEQISLLMLDSATPGRMALTYYQEFLPADYFANLDAWIDDFSWYQRHSIETKNGKKNDKRLVWAIVPPSPFAIGNAVYSKSLSDSLKKQLYARLLPVIAGGKSVPIPYDLVQQSFQVACNPHGCENWEWQRNIGVACALYKGWRARHHNESERRTYDMSLDKENRSRDYLYGRLLAVAENIEAYALYLAGEKRSTNAERYMPKFANRPFYTWRNIEIALKPYQERLRNHNKDTGSQALAEITDLFVTEDYTNDSPLSAEFLLGYHCQKMEIARQLAELTAKKSKTTETE >NC_020515.1|WP_015432824.1|1557546_1558287_+|type-I-C-CRISPR-associated-protein-Cas5 MSNENTFRSRLFSFRVWGRQALFTDPITKIGGEKFTYPVPTYEALKGILRSIYWKPTLIWHISRIRVMKPIQTQAKSTKPLDWNGGNTLAIYTFLHNVEYQVEAYFTWNMHWEELAGDRNVGKHTAIIERMLERGGRQDIFLGTRDCQGYIAPCQFGEGEGFYDKVDEPIDFGLMFHSFGYPEETGNHELISRFWQANMQKGVIKFPAVSDGELKTRFIKKMKPFKPFKRGENVKAVEEEAKELEL >NC_020515.1|WP_025328951.1|1555072_1557535_+|CRISPR-associated-helicase/endonuclease-Cas3 MSKTEFIAHVRKSNKQLQSVSNHLLETASIARTLAAKLDLADAGELLGLMHDFGKYSKKFQTYIRHVTGILTYADLDSEDENNGGDHSTAGAQWIYGRLRKLGAAKNADGKIIGIGELVGQILGLCIASHHGAGLIDCLSPEGSEKPKWRERFDKDDKLTHLSECEKNADAVIINRAEELVGIDLVRLVDKPIRAILNQKEIPFKLKEFYLGCLTRFLFSCLIDADRINTSDFENERQKEIRNLTNTPNWQKAIDKLESHLSGFSIKYPIDGIRREISESCLERSTDQQGIYTLTVPTGGGKTLSSLRYALHHAKLHNLDRIIYIIPYTSIIDQNAQAVRKILGEEWVLEHHSNIEPDQQTWQNKLLSENWDKPIVFTTMVQFLDAWFGSGTRGVRHIHAMTKSVLIFDEIQTLPIKCVHLFCNVLNWLTHFGKSSAVLCTATQPLLNSLKNPHLGQIQLADNAELIGNQFKIRELFDKLSRVEVNYCPQTGGYSLENAGEFLLEQFGQYSSCLFIVNTKKWAQDLYRYCQNRNLPQEALFHLSTNQCSAHRKTIFDKIKARLNNKEPVICISTQLIEAGVDISMACVIRALGGLDSIAQAAGRCNRHGENKGKGQVYVLNLQEPNLESVLPDIYIGQQQSERVFNDFEGQDILQPNAMSQYFDYYFYNRSNEMGYSLPNNYSGTLLDWLSDNAQNTYVPKNNQRKTVFPLLMQSFKSAGKLFQTIDAPTQAVIVPYENAKELIATLCGTDDNEKKYKALSQAQRYSVNVFPNVWKKLQENEAIQETQLGSGIFYLKDRHYTEEYGLSIEETGNLTFYDL >NC_020515.1|WP_025266960.1|1551140_1552658_+|ribosome-biogenesis-GTPase-Der MTPVVALVGRPNVGKSTLFNRLTRTRDALVADFPGLTRDRKYGQANIAGHDFIVIDTGGIDGTEEGVEEKMAEQSLLAIEEADVVLFLVDARAGLVPADIGIAQYLRQRDKTTVVVANKTDGIDADSHIAEFYQLGLGDVEPIAAAQGRGVTQLIEQVLAPLAEKIEEQAVENAENSANTTEEQDEWENNFDFENEEDTALLDEALEESEEESDKNIKIAIVGRPNVGKSTLTNRILGEDRVVVYDMPGTTRDSIYIPMERDGQQYTIIDTAGVRKRGKVHLAVEKFSVIKTLQAIQDANVVLLTIDARDGVSDQDLSLLGFILNAGKSLVIVVNKWDGLSQDIKDNVKSELDRRLDFIDFARVHFISALHGSGVGNLFDSIQEAYACATKKMTTAMLTRILQMATDEHQPPLVNGRRVKLKYAHPGGYNPPIIVIHGNQIERLPDSYKRYLSNYYRKSLKIIGSPIRVLFQEGNNPFAGKRNKLTPSQLRKRKRLMKFIKKNRK >NC_020515.1|WP_015432820.1|1549507_1550821_+|glutamyl-tRNA-reductase MTILALGINHKTASVSLREKVAFVESKRQLAFEQISQQNLAESAVILSTCNRTELYFHQADIPPQEDHPENIAWRERCFQWFAEIHQLDHNELRQCIYFKQNMDTARHLMEVACGLDSLILGEPQILGQVKQAYQDSEYFYHQQGKSISTNLSRLFQKTFSTAKRVRSETEIGASAVSVAYAACGLARQIFDDFAKLRFLLVGAGETIELVARYLIQHGAQNLMVANRTHIRAEMLAEKLETPMQILSLSALQVGLNQADVVISSTGSPDLLISKEMVETAQKQRRFDPMLLIDIAVPRDIDEKAGELDSVYAYSVDDLQHIIQQNLAQRQQAAEQAKEIVEQECKDFFAWLKQQQSSQLIKHYRQNAEEIRLDLLEKARNALEQGQDSEKILQELSYKLMNQLLHAPTSALQNLAKDGNVKGLQRFSQALKLDDIN >NC_020515.1|WP_015432819.1|1546998_1549209_-|GTP-diphosphokinase MVAIRHSHQLDPNNFELASWSAGLKMSPVTFDELQTAWRYAEEKLDTEQLHLMWVGLEMVEILHGLNMDDDSLVAAMLFPLVKHNIADLAQIKEQFGNGVKNLVKGVLEMENIRQLNANNASDLQIDNIRRMLLAMVDDFRCVVIKLAERIVYLRDTEHHSEEDLVLAAKECSHIYAPLANRLGIGQLKWELEDYSFRALHPQDYRQIAKFDLAERRLDREQFIADFVAHLTACIGEEIDNVQVYGRPKHIYSIWKKMQKKNLRFDQLFDIRAVRIIVQNLEECYTALSIVHSHYKHLPEHFDDYIADPKPNGYQSLHTVVLGKGDKPIEVQIRTQKMHDDAELGVAAHWKYKEGAGAGRSGYEEKIVWLRKLLAWQNDIADSGEMVDDLRSQVFDDRVYVFTPKGEVIDLPSNATPLDFAYSIHSEIGHRCIGAKVAGKIVPFTYILQMGDQVEIITQKNPNPSRDWLNPSQGFVNTPRARSKIIAWFKKLDREKNLPIGKEMLESEMVKHQFSLKQIEDYALPRYNLKQLDDLYAAIGGGDIKLNNLMNYLQGKLVKTSAEQADEAILKHMAHKAQHTQTKTGRAGAIIVDGVGNLMHHIARCCQPIPGDKIVGYITQGRGISIHRADCEQLFDLQSSSPERVVDAEWGGNFTSGFSLVIRVIANDRNGLLRDVSAIMANEKVNVIGVASRTDIKRSIATIDIEVELNNIELLDKLLKRIMQLDDVIEAKRLSN >NC_020515.1|WP_015432832.1|1565309_1566737_+|bifunctional-D-glycero-beta-D-manno-heptose-7-phosphate-kinase/D-glycero-beta-D-manno-heptose-1-phosphate-adenylyltransferase-HldE MMMHYSSQFNHAKVLVLGDVMLDRYWFGSTNRISPEAPVPVVKVQENEERAGGAANVAMNIAALNVPVTLHGLVGQDDAGSALDKLLNSHQIQNHCVALDSHPTITKLRILSRHQQLLRLDFEEGFHNVDSSELLAKLSSEITAYGALILSDYGKGTLNDVQKMIQIARQANVPILIDPKGTDFERYRGATLLTPNMSEFEAVVGHCATEDDIVHKGLKMIADFELSALLVTRSEKGMTLLRPNFEPFHLPTQAREVYDVTGAGDTVISVLATAIADGRNLEEACYIANAAAGVVVGKLGTSTVSPSELEQAIHQRTETGFGVVSEAELKQIVQQSKARGEKIVMTNGCFDILHPGHVSYLENARKLGDRLIVAVNTDNSVKRLKGENRPINDLASRMAVLAGLASVDWVVPFDEDTPQRLIGEILPNLLVKGGDYKVEEIAGHQEVLANGGEVRVLNFENGCSTTNVIKKIQSL >NC_020515.1|WP_015432833.1|1566798_1567740_+|peptidylprolyl-isomerase MKFISLKSLFVATFALFAVSQIHAVEERVVASVDGHPIMQSQVLKTLGKRKNTEANRKAATDDLINDFLVQRAIQQSGIKVNTAYVDQVIENMVVQNGITYGQFLDYLDYNNISLNQYRQQIAHQILMDNVKQQAIGQSIRVEPQDVQSLATKMLEEAKTNGKLKTITALQHRVSHILIKTNPILNDAQAKAKLNSIVADIKAGKISFEDAARANSVDYASGAEGGDLGWNFLDAYDKTFAQTAQKSKLGVISAPFKSQFGWHVLKVTDTRQSDRTEDAYFQRAYEQLFDKQAQDASKDWVKALKNRAEIKYY >NC_020515.1|WP_015432834.1|1567839_1568706_+|16S-rRNA-(adenine(1518)-N(6)/adenine(1519)-N(6))--dimethyltransferase-RsmA MSSNSKKHLGHTARKRFGQNFLHDMNVIHNIVAAINPKKDQFLLEIGPGLGALTEPVAEQVEQLTVVELDRDLAERLRHHPFLHHKLTIIEQDALRFNFREYFESLNLPEGQGVRVFGNLPYNISTPLMFHLFKFHDLVQDMHFMLQKEVVKRLCAAPNSKAYGRLTIMAQYYCQVMPVLEVPPNAFKPAPKVDSAVVRLVPYKTLPYPVKDIYWLNRVTTQAFNQRRKTLRNALSTLFTPEQLEALNIDLNARAENLAIADYTRLANWLCDNPPAAGKIEIIENDVE >NC_020515.1|WP_015432835.1|1568797_1570483_-|long-chain-fatty-acid--CoA-ligase-FadD MEKIWFENYPPNAERIIDVEPYESLVEMFEKAVQRHPDLAAYINMGQVLTYRKLEERSRAFAAYLQNELRLEKGDRIALMMPNLLQYPIALFGALRAGLVVVNVNPLYTPRELEHQLNDSGAKAIVVVSNFAATLEKIVFNTAVKHVILTRMGDQLSFGKRTLVNFVVKYVKKLVPKYKLPHAVSFREALSIGKQRQYVRPTIYQDDLAFLQYTGGTTGVAKGAMLSHRNMVANIMQAKWVAYPLTQARQNRLAVIALPLYHVFALSANCLLFIELGVTGLLITNPRDIPGFVKELKKYPVMAITGVNTLFNALLNNEHFSEADFSNLKLSIGGGAAIQRSVADRWHKATGCHIIEGYGMTECSPLISATRNDSIEYSGSIGVPVPNTDIRVVDDAGNDVPVGERGELWVKGPQVMRGYWQRPDETAEVLKDGWMATGDIVTFGEDLNLRIVDRKKDMIIVSGFNVYPNEIEDVVALHPKVNEVVAVGIPSEKSGESIKVYVTKKDESLTREELRNHCRQHLTGYKIPRDIEFRDDLPKSNVGKILRRVLRDEEIARMEKS >NC_020515.1|WP_015432836.1|1570492_1570918_-|SoxR-reducing-system-RseC-family-protein MMLEQALVLRYQNGIATIQAFAKSGCGGCAAEGCGTKSLSALVGEKRAPQFDIAVSQHLNSGDQIEIGITENHLLLSVFWLYAVPLFVLIASTLLFSMWFANELVIAGLILCSTLVAFISIKKIIKRQIINGLNPIFVRKL >NC_020515.1|WP_015432837.1|1570929_1571889_-|MucB/RseB-C-terminal-domain-containing-protein MIKKSSLLALLSVWCFSLVVRAETMATPLSYLVAMSQAQQQANYEQFYLFQEGRSPESWRYRHVHWDNQQYAQLLSLDGSREEFLQQDNLVGYFGDFQPFSLQTNKILDNRPMVLYGDFNRLEGYSFIDMGKDRIANRVARQIRIVPKDEFRYQYRLWIDEESKLLLKSELLDREHNVLELFRVINLRLDDQLLDMVDAIRPLILPPMIPSKAPMNSDNLSWQPKWLPRGFRLQSVAREQLPDGEEVDSQLYSDGLFSFTIYLSDSKELPLNEHTWQDGKTTVYTLSLAQKDLVLVGEIPLTTARHILQNIKIKQPLEK >NC_020515.1|WP_015432838.1|1572023_1572413_-|hypothetical-protein MKFPSKLALISSALLLSACALTPEQKAAQEAKRLRAEQALQVKLARQCDTEAAQLLHQQFNPPLSQTEQQKQEFEQRYAEKIGQPMFQACYKLALENYKAQEELEYMRQRYYWDDYPRWGWHRFCYSCW >NC_020515.1|WP_015432839.1|1572416_1575320_-|bifunctional-[glutamate--ammonia-ligase]-adenylyl-L-tyrosine-phosphorylase/[glutamate--ammonia-ligase]-adenylyltransferase MESLLFQSAEQKLQTLFSSQRIPDILQNSAQIAPLVKAIAMSDFVYTTLQNQPELLTKWLEMPPTEQHCEHYSTRLHQLLDSVETEEELHSTLRHFRHRELAALSYLQSNNPHLVQVVFEKLSELAEALIINARDWLFTRLCQDYGTPMNEQGEVQELIIIGMGKLGGRELNFSSDIDLIFAYPDMGETTGGRKPMENSKFFTRLGQRLIQALDQITEDGFVYRTDMRLRPFGESGALVLSFTAMEDYYQEQGRDWERYAMIKGRILGENLQNHNHRYLSQMLRPFVYRRYLDFSAIQSLREMKLKISREVARRGLTENIKLGAGGIREIEFIVQAFQMMRGGRDKILQQRSLLKVLPHLAELKLLSNEQVAQLQQAYLFLRLVENSLQAIEDKQTQTLPHDEKEREILIYLTKQYLASTAKENVHSWQDFLAVLAQHQKNVRAIFNELVGEEDESEKSDEKQTYAAWRDILHYQITLEELIVNLRAYTVQEKDYAEIFQHLSTILQEWVKRPIGVRGRDVLRQLMPRVVDQIFSQQDYLVLLPRILKIIDQIVTRTTYLELLLEKEQILPQLLSLCGKSVMIAEQIARFPILLDELIVQKSLTRVIGLDEYPAALQEYLMRIPEEDEEALMDSLRQFKQSQILRIAAADILGVLPVMKISDHLTYLAEAIIAVVVKLAWQSVARRFGVPGHLQDAAQDFVVVGYGKLGGIELGYNSDLDLVFLHNAPENSETQGGKKSISAHQFYLKLAQKINGIFNLNTAAGVLYEVDMRLRPSGEAGLLVSTFEAYDFYQKNEAWTWESQALVRARAVYGSPELRQKFARIRQETLCQKRASGQLSEEICKMRSKMHMHFAKNQSDVFHLKNDRGGITDIEFIAQFLVLNYAAYYPEMAVWSDNVRIFDSAIACGILSAEQGELLKQCYICLRNRVHQLNLLNQESYVVKTEFATEREIVCQIWNNLFSSLKQE >NC_020515.1|WP_025289530.1|1575366_1575663_-|YciI-family-protein MYYVIFAQDLPNSLEKRLSVRDKHLARLQALQAEERLLTAGPNPAVDSSTPGEAGFSGSTVIAKFPSLEAAKEWASQDPYVEAGVYGDVIVKPFIKVF >NC_020515.1|WP_015432841.1|1575683_1576130_-|acyl-CoA-thioester-hydrolase-YciA MTESNERPHGSLVLRTLAMPADTNANGDIFGGWLMSQMDLGGAILAKEIAKGRVVTVCVDKMVFLTPVSIGDVVCCYGSCTRVGRSSMEVKVEIWIKKVYDGTGRRTKVTEAHFTYVAVGEDKKPRPIPRENNPELDQALALIERHSN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NC_020515_1 | 1.4|843718|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843718-843747 | 30 | NC_020515.1 | 1395473-1395502 | 0 | 1.0 |
NC_020515_1 | 1.16|844510|30|NC_020515|CRISPRCasFinder,CRT | 844510-844539 | 30 | NC_020515.1 | 2057273-2057302 | 0 | 1.0 |
NC_020515_4 | 4.15|1553893|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553893-1553926 | 34 | NC_020515.1 | 1146994-1147027 | 0 | 1.0 |
1. spacer 1.4|843718|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to position: 1395473-1395502, mismatch: 0, identity: 1.0
gggcagttgcaagacatgtatcaaaatctt CRISPR spacer gggcagttgcaagacatgtatcaaaatctt Protospacer ******************************
2. spacer 1.16|844510|30|NC_020515|CRISPRCasFinder,CRT matches to position: 2057273-2057302, mismatch: 0, identity: 1.0
ccggctcggtgatttgagcaatgaggtaat CRISPR spacer ccggctcggtgatttgagcaatgaggtaat Protospacer ******************************
3. spacer 4.15|1553893|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to position: 1146994-1147027, mismatch: 0, identity: 1.0
ctgccgagtaataagccaagcagaatttcaagca CRISPR spacer ctgccgagtaataagccaagcagaatttcaagca Protospacer **********************************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_020515_4 | 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553827-1553860 | 34 | NC_008201 | Mannheimia phage phiMHaA1, complete genome | 25331-25364 | 1 | 0.971 |
NC_020515_4 | 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553827-1553860 | 34 | DQ426905 | Bacteriophage phi-MhaA1-BAA410, complete genome | 25406-25439 | 1 | 0.971 |
NC_020515_4 | 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553827-1553860 | 34 | KP137432 | Mannheimia phage vB_MhM_535AP1, complete genome | 25371-25404 | 1 | 0.971 |
NC_020515_4 | 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553827-1553860 | 34 | JN255163 | Mannheimia phage vB_MhM_1152AP, complete genome | 25525-25558 | 1 | 0.971 |
NC_020515_4 | 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553827-1553860 | 34 | KP137438 | Mannheimia phage vB_MhM_2256AP1, complete genome | 25732-25765 | 1 | 0.971 |
NC_020515_4 | 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553827-1553860 | 34 | NC_047750 | Mannheimia phage vB_MhM_1127AP1, complete genome | 26560-26593 | 1 | 0.971 |
NC_020515_4 | 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553827-1553860 | 34 | NC_028898 | Mannheimia phage vB_MhM_587AP1, complete genome | 26568-26601 | 1 | 0.971 |
NC_020515_4 | 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553827-1553860 | 34 | DQ426904 | Bacteriophage phi-MhaA1-PHL101, complete genome | 25331-25364 | 1 | 0.971 |
NC_020515_4 | 4.21|1554293|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1554293-1554328 | 36 | NC_028743 | Mannheimia phage vB_MhS_587AP2, complete genome | 42889-42924 | 1 | 0.972 |
NC_020515_1 | 1.7|843916|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843916-843945 | 30 | NC_007206 | Haemophilus influenzae biotype aegyptius plasmid pF1947, complete sequence | 27680-27709 | 2 | 0.933 |
NC_020515_1 | 1.11|844180|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 844180-844209 | 30 | NC_007206 | Haemophilus influenzae biotype aegyptius plasmid pF1947, complete sequence | 27680-27709 | 2 | 0.933 |
NC_020515_2 | 2.1|950142|34|NC_020515|CRT | 950142-950175 | 34 | NZ_CP054197 | Glaesserella parasuis strain YHP170504 plasmid unnamed1, complete sequence | 21958-21991 | 2 | 0.941 |
NC_020515_1 | 1.9|844048|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 844048-844077 | 30 | NZ_CP054197 | Glaesserella parasuis strain YHP170504 plasmid unnamed1, complete sequence | 36646-36675 | 3 | 0.9 |
NC_020515_1 | 1.13|844312|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 844312-844341 | 30 | NZ_CP054197 | Glaesserella parasuis strain YHP170504 plasmid unnamed1, complete sequence | 36646-36675 | 3 | 0.9 |
NC_020515_5 | 5.14|1564809|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1564809-1564843 | 35 | NC_028766 | Mannheimia phage vB_MhM_3927AP2, complete genome | 11699-11733 | 3 | 0.914 |
NC_020515_2 | 2.1|950142|34|NC_020515|CRT | 950142-950175 | 34 | NC_021724 | Aggregatibacter actinomycetemcomitans plasmid pS23A, complete sequence | 9066-9099 | 4 | 0.882 |
NC_020515_2 | 2.1|950142|34|NC_020515|CRT | 950142-950175 | 34 | NC_021724 | Aggregatibacter actinomycetemcomitans plasmid pS23A, complete sequence | 5656-5689 | 4 | 0.882 |
NC_020515_2 | 2.1|950142|34|NC_020515|CRT | 950142-950175 | 34 | NC_002579 | Aggregatibacter actinomycetemcomitans plasmid pVT745, complete sequence | 6923-6956 | 4 | 0.882 |
NC_020515_2 | 2.1|950142|34|NC_020515|CRT | 950142-950175 | 34 | GQ866235 | Aggregatibacter actinomycetemcomitans strain D11S-1 plasmid S57, complete sequence | 8534-8567 | 4 | 0.882 |
NC_020515_2 | 2.1|950142|34|NC_020515|CRT | 950142-950175 | 34 | GQ866235 | Aggregatibacter actinomycetemcomitans strain D11S-1 plasmid S57, complete sequence | 4686-4719 | 4 | 0.882 |
NC_020515_2 | 2.4|950341|35|NC_020515|CRT,PILER-CR,CRISPRCasFinder | 950341-950375 | 35 | NZ_CP045829 | Escherichia coli strain AUSMDU00014361 plasmid pAUSMDU00014361_02, complete sequence | 75334-75368 | 5 | 0.857 |
NC_020515_1 | 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843784-843813 | 30 | MN694671 | Marine virus AFVG_250M145, complete genome | 25900-25929 | 6 | 0.8 |
NC_020515_1 | 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843784-843813 | 30 | MN694753 | Marine virus AFVG_250M144, complete genome | 28198-28227 | 6 | 0.8 |
NC_020515_1 | 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843784-843813 | 30 | MN694231 | Marine virus AFVG_250M143, complete genome | 18426-18455 | 6 | 0.8 |
NC_020515_1 | 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843784-843813 | 30 | MN693981 | Marine virus AFVG_250M146, complete genome | 16038-16067 | 6 | 0.8 |
NC_020515_2 | 2.1|950142|34|NC_020515|CRT | 950142-950175 | 34 | NZ_KX753679 | Pasteurella multocida strain RCAD0259 plasmid pRCADGH-2, complete sequence | 24884-24917 | 6 | 0.824 |
NC_020515_2 | 2.4|950341|35|NC_020515|CRT,PILER-CR,CRISPRCasFinder | 950341-950375 | 35 | MK422450 | Klebsiella phage ST13-OXA48phi12.4, complete genome | 33739-33773 | 6 | 0.829 |
NC_020515_5 | 5.1|1563942|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1563942-1563977 | 36 | KM389287 | UNVERIFIED: Escherichia phage Phi06_2987 S clone contig00001 genomic sequence | 12565-12600 | 6 | 0.833 |
NC_020515_5 | 5.1|1563942|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1563942-1563977 | 36 | NZ_AP019820 | Enterobacter hormaechei subsp. hoffmannii strain OIPH-N069 plasmid pN069_3, complete sequence | 3539-3574 | 6 | 0.833 |
NC_020515_1 | 1.1|843520|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843520-843549 | 30 | NZ_KX853854 | Enterococcus faecium strain A120 plasmid pEMA120, complete sequence | 26737-26766 | 7 | 0.767 |
NC_020515_1 | 1.1|843520|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843520-843549 | 30 | NC_016967 | Enterococcus faecium plasmid pZB18, complete sequence | 42283-42312 | 7 | 0.767 |
NC_020515_1 | 1.1|843520|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843520-843549 | 30 | NZ_KX976485 | Enterococcus avium strain 19081 plasmid pEA19081, complete sequence | 49676-49705 | 7 | 0.767 |
NC_020515_1 | 1.1|843520|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843520-843549 | 30 | MG592433 | Vibrio phage 1.052.A._10N.286.46.C3, partial genome | 40892-40921 | 7 | 0.767 |
NC_020515_1 | 1.3|843652|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843652-843681 | 30 | NZ_CP026267 | Aminobacter sp. MSH1 plasmid pBAM2, complete sequence | 3769-3798 | 7 | 0.767 |
NC_020515_1 | 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843784-843813 | 30 | NZ_CP025078 | Enterococcus faecium strain LS170308 plasmid unnamed, complete sequence | 12559-12588 | 7 | 0.767 |
NC_020515_1 | 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843784-843813 | 30 | NZ_CP015437 | Anoxybacillus sp. B7M1 plasmid unnamed, complete sequence | 43728-43757 | 7 | 0.767 |
NC_020515_1 | 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843784-843813 | 30 | MG969411 | UNVERIFIED: Salmonella phage GE_vB_MG, complete genome | 53917-53946 | 7 | 0.767 |
NC_020515_2 | 2.1|950142|34|NC_020515|CRT | 950142-950175 | 34 | NZ_KX753679 | Pasteurella multocida strain RCAD0259 plasmid pRCADGH-2, complete sequence | 26889-26922 | 7 | 0.794 |
NC_020515_2 | 2.4|950341|35|NC_020515|CRT,PILER-CR,CRISPRCasFinder | 950341-950375 | 35 | CP051275 | Salmonella phage SW-37, complete genome | 23834-23868 | 7 | 0.8 |
NC_020515_4 | 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553295-1553328 | 34 | NZ_CP020439 | Streptococcus equinus strain FDAARGOS_251 plasmid unamed1 sequence | 177770-177803 | 7 | 0.794 |
NC_020515_4 | 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553295-1553328 | 34 | NZ_CP018188 | Streptococcus salivarius strain ICDC2 plasmid, complete sequence | 60391-60424 | 7 | 0.794 |
NC_020515_5 | 5.1|1563942|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1563942-1563977 | 36 | FJ982340 | Burkholderia phage KS9, complete genome | 29821-29856 | 7 | 0.806 |
NC_020515_1 | 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843784-843813 | 30 | NZ_CP045339 | Vibrio sp. THAF190c plasmid pTHAF190c_a, complete sequence | 415673-415702 | 8 | 0.733 |
NC_020515_1 | 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843784-843813 | 30 | MK448712 | Streptococcus phage Javan237, complete genome | 26250-26279 | 8 | 0.733 |
NC_020515_1 | 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843784-843813 | 30 | MK448880 | Streptococcus phage Javan238, complete genome | 26250-26279 | 8 | 0.733 |
NC_020515_1 | 1.14|844378|30|NC_020515|CRISPRCasFinder,CRT | 844378-844407 | 30 | MT234670 | Pseudanabaena phage PA-SR01, complete genome | 106347-106376 | 8 | 0.733 |
NC_020515_1 | 1.15|844444|30|NC_020515|CRISPRCasFinder,CRT | 844444-844473 | 30 | NZ_CP022536 | Spiroplasma corruscae strain EC-1 plasmid unnamed, complete sequence | 16419-16448 | 8 | 0.733 |
NC_020515_4 | 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553295-1553328 | 34 | NC_048107 | Staphylococcus phage Pabna, complete genome | 13011-13044 | 8 | 0.765 |
NC_020515_4 | 4.8|1553427|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553427-1553460 | 34 | NC_041925 | Proteus phage VB_PmiS-Isfahan, complete genome | 29323-29356 | 8 | 0.765 |
NC_020515_4 | 4.8|1553427|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553427-1553460 | 34 | MN840487 | Proteus phage 2207-N35, complete genome | 17007-17040 | 8 | 0.765 |
NC_020515_4 | 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553692-1553725 | 34 | MN694439 | Marine virus AFVG_250M296, complete genome | 4136-4169 | 8 | 0.765 |
NC_020515_4 | 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553692-1553725 | 34 | MN694282 | Marine virus AFVG_250M297, complete genome | 4153-4186 | 8 | 0.765 |
NC_020515_4 | 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553692-1553725 | 34 | GQ443085 | Clostridium phage CP26F, complete genome | 37097-37130 | 8 | 0.765 |
NC_020515_4 | 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553692-1553725 | 34 | JF767210 | Clostridium phage phi9O, complete genome | 37503-37536 | 8 | 0.765 |
NC_020515_4 | 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553692-1553725 | 34 | NC_011318 | Clostridium phage 39-O, complete genome | 36601-36634 | 8 | 0.765 |
NC_020515_4 | 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553692-1553725 | 34 | NC_019496 | Clostridium phage phiCP26F, complete genome | 37097-37130 | 8 | 0.765 |
NC_020515_4 | 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553692-1553725 | 34 | NC_015917 | Borreliella bissettii DN127 plasmid lp28-4, complete sequence | 12351-12384 | 8 | 0.765 |
NC_020515_4 | 4.20|1554224|37|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1554224-1554260 | 37 | NZ_CP045223 | Achromobacter xylosoxidans strain DN002 plasmid unnamed | 120350-120386 | 8 | 0.784 |
NC_020515_5 | 5.1|1563942|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1563942-1563977 | 36 | KT367887 | Klebsiella phage vB_Kp3, complete genome | 30636-30671 | 8 | 0.778 |
NC_020515_5 | 5.1|1563942|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1563942-1563977 | 36 | MT075871 | Klebsiella phage vB_KleS-HSE3, complete genome | 29442-29477 | 8 | 0.778 |
NC_020515_1 | 1.4|843718|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843718-843747 | 30 | MK250029 | Prevotella phage Lak-C1, complete genome | 260871-260900 | 9 | 0.7 |
NC_020515_1 | 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843784-843813 | 30 | MN693046 | Marine virus AFVG_25M413, complete genome | 14633-14662 | 9 | 0.7 |
NC_020515_1 | 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843784-843813 | 30 | MN693008 | Marine virus AFVG_117M9, complete genome | 14614-14643 | 9 | 0.7 |
NC_020515_1 | 1.8|843982|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 843982-844011 | 30 | AJ783769 | Sulfolobus tengchongensis spindle-shaped virus STSV1 complete genome | 21133-21162 | 9 | 0.7 |
NC_020515_1 | 1.12|844246|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 844246-844275 | 30 | AJ783769 | Sulfolobus tengchongensis spindle-shaped virus STSV1 complete genome | 21133-21162 | 9 | 0.7 |
NC_020515_1 | 1.14|844378|30|NC_020515|CRISPRCasFinder,CRT | 844378-844407 | 30 | NZ_CP032532 | Bacillus megaterium NCT-2 plasmid pNCT2_4, complete sequence | 96830-96859 | 9 | 0.7 |
NC_020515_2 | 2.4|950341|35|NC_020515|CRT,PILER-CR,CRISPRCasFinder | 950341-950375 | 35 | MT028491 | Ochrobactrum phage vB_OspM_OC, complete genome | 25031-25065 | 9 | 0.743 |
NC_020515_4 | 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553295-1553328 | 34 | NZ_CP010312 | Geoalkalibacter subterraneus strain Red1 plasmid pGSUB1, complete sequence | 161523-161556 | 9 | 0.735 |
NC_020515_4 | 4.10|1553559|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553559-1553592 | 34 | MN694724 | Marine virus AFVG_250M441, complete genome | 4883-4916 | 9 | 0.735 |
NC_020515_4 | 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553692-1553725 | 34 | NC_011251 | Borrelia duttonii Ly plasmid pl41, complete sequence | 348-381 | 9 | 0.735 |
NC_020515_5 | 5.15|1564876|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1564876-1564910 | 35 | NZ_CP013487 | Vibrio alginolyticus strain ATCC 33787 plasmid pMBL287, complete sequence | 109315-109349 | 9 | 0.743 |
NC_020515_4 | 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553027-1553060 | 34 | NC_041977 | Citrobacter phage Mordin, complete genome | 10862-10895 | 10 | 0.706 |
NC_020515_4 | 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553027-1553060 | 34 | NC_028247 | Citrobacter phage Michonne, complete genome | 81118-81151 | 10 | 0.706 |
NC_020515_4 | 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553027-1553060 | 34 | MF158044 | Shigella phage Sf18, complete genome | 49096-49129 | 10 | 0.706 |
NC_020515_4 | 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553027-1553060 | 34 | MF158040 | Shigella phage Sf13, complete genome | 33142-33175 | 10 | 0.706 |
NC_020515_4 | 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553027-1553060 | 34 | KM236239 | Citrobacter phage Moogle, complete genome | 80508-80541 | 10 | 0.706 |
NC_020515_4 | 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553027-1553060 | 34 | MF158041 | Shigella phage Sf15, complete genome | 1130-1163 | 10 | 0.706 |
NC_020515_4 | 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553027-1553060 | 34 | MH920362 | Citrobacter phage Maleficent, complete genome | 81154-81187 | 10 | 0.706 |
NC_020515_4 | 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553027-1553060 | 34 | KY654690 | Citrobacter phage Mijalis, complete genome | 80507-80540 | 10 | 0.706 |
NC_020515_4 | 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553027-1553060 | 34 | MF327003 | Shigella phage Sf14, complete genome | 83779-83812 | 10 | 0.706 |
NC_020515_4 | 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553027-1553060 | 34 | MF327005 | Shigella phage Sf19, complete genome | 86019-86052 | 10 | 0.706 |
NC_020515_4 | 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553295-1553328 | 34 | MN694255 | Marine virus AFVG_250M362, complete genome | 32274-32307 | 10 | 0.706 |
NC_020515_4 | 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553295-1553328 | 34 | NC_013544 | Lactobacillus paracasei subsp. paracasei plasmid pCD02, complete sequence | 4533-4566 | 10 | 0.706 |
NC_020515_4 | 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553295-1553328 | 34 | KC013023 | Leuconostoc phage phiLN04, complete genome | 22507-22540 | 10 | 0.706 |
NC_020515_4 | 4.8|1553427|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553427-1553460 | 34 | MH586730 | Salmonella phage Solent, complete genome | 33662-33695 | 10 | 0.706 |
NC_020515_4 | 4.8|1553427|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553427-1553460 | 34 | NC_047786 | Salmonella phage vB_SenS_Sasha, complete genome | 30544-30577 | 10 | 0.706 |
NC_020515_4 | 4.8|1553427|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553427-1553460 | 34 | KY002061 | Salmonella phage vB_SenS_Sergei, complete genome | 33476-33509 | 10 | 0.706 |
NC_020515_4 | 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553692-1553725 | 34 | NZ_CP024238 | Escherichia coli O15:H11 strain 90-9272 plasmid unnamed | 166303-166336 | 10 | 0.706 |
NC_020515_4 | 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553692-1553725 | 34 | NC_013507 | Escherichia coli ETEC H10407 plasmid pEntH10407, complete sequence | 38240-38273 | 10 | 0.706 |
NC_020515_4 | 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553692-1553725 | 34 | NZ_CP024249 | Escherichia coli O182:H21 strain D181 plasmid unnamed1, complete sequence | 6097-6130 | 10 | 0.706 |
NC_020515_4 | 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553692-1553725 | 34 | NC_017722 | Escherichia coli ETEC H10407 plasmid p666, complete sequence | 65168-65201 | 10 | 0.706 |
NC_020515_4 | 4.22|1554361|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1554361-1554395 | 35 | KT588073 | Acinetobacter phage Ab105-3phi, partial genome | 57805-57839 | 10 | 0.714 |
NC_020515_4 | 4.23|1554428|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1554428-1554462 | 35 | NZ_CP017773 | Paenibacillus crassostreae strain LPB0068 plasmid pPC03, complete sequence | 23790-23824 | 10 | 0.714 |
NC_020515_4 | 4.8|1553427|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1553427-1553460 | 34 | NZ_CP033124 | Acinetobacter wuhouensis strain WCHAW010062 plasmid p4_010062, complete sequence | 9550-9583 | 11 | 0.676 |
NC_020515_5 | 5.6|1564278|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1564278-1564312 | 35 | NC_048765 | Vibrio phage VAP7, complete genome | 99485-99519 | 11 | 0.686 |
NC_020515_5 | 5.14|1564809|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT | 1564809-1564843 | 35 | NZ_CP044084 | Pseudomonas luteola strain FDAARGOS_637 plasmid unnamed1, complete sequence | 520031-520065 | 12 | 0.657 |
1. spacer 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_008201 (Mannheimia phage phiMHaA1, complete genome) position: , mismatch: 1, identity: 0.971
taatatgcccttgcataaattccactttgccgtg CRISPR spacer taatatgcccttgcataaattccactttaccgtg Protospacer ****************************.*****
2. spacer 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to DQ426905 (Bacteriophage phi-MhaA1-BAA410, complete genome) position: , mismatch: 1, identity: 0.971
taatatgcccttgcataaattccactttgccgtg CRISPR spacer taatatgcccttgcataaattccactttaccgtg Protospacer ****************************.*****
3. spacer 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to KP137432 (Mannheimia phage vB_MhM_535AP1, complete genome) position: , mismatch: 1, identity: 0.971
taatatgcccttgcataaattccactttgccgtg CRISPR spacer taatatgcccttgcataaattccactttaccgtg Protospacer ****************************.*****
4. spacer 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to JN255163 (Mannheimia phage vB_MhM_1152AP, complete genome) position: , mismatch: 1, identity: 0.971
taatatgcccttgcataaattccactttgccgtg CRISPR spacer taatatgcccttgcataaattccactttaccgtg Protospacer ****************************.*****
5. spacer 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to KP137438 (Mannheimia phage vB_MhM_2256AP1, complete genome) position: , mismatch: 1, identity: 0.971
taatatgcccttgcataaattccactttgccgtg CRISPR spacer taatatgcccttgcataaattccactttaccgtg Protospacer ****************************.*****
6. spacer 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_047750 (Mannheimia phage vB_MhM_1127AP1, complete genome) position: , mismatch: 1, identity: 0.971
taatatgcccttgcataaattccactttgccgtg CRISPR spacer taatatgcccttgcataaattccactttaccgtg Protospacer ****************************.*****
7. spacer 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_028898 (Mannheimia phage vB_MhM_587AP1, complete genome) position: , mismatch: 1, identity: 0.971
taatatgcccttgcataaattccactttgccgtg CRISPR spacer taatatgcccttgcataaattccactttaccgtg Protospacer ****************************.*****
8. spacer 4.14|1553827|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to DQ426904 (Bacteriophage phi-MhaA1-PHL101, complete genome) position: , mismatch: 1, identity: 0.971
taatatgcccttgcataaattccactttgccgtg CRISPR spacer taatatgcccttgcataaattccactttaccgtg Protospacer ****************************.*****
9. spacer 4.21|1554293|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_028743 (Mannheimia phage vB_MhS_587AP2, complete genome) position: , mismatch: 1, identity: 0.972
atagccgcccagctccttaatcttatccagcgacat CRISPR spacer atagccgcccagctccttaatcttatccagcggcat Protospacer ********************************.***
10. spacer 1.7|843916|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_007206 (Haemophilus influenzae biotype aegyptius plasmid pF1947, complete sequence) position: , mismatch: 2, identity: 0.933
cctctttgagatgttccacgaaccacaacg CRISPR spacer cctctttgagttgctccacgaaccacaacg Protospacer ********** **.****************
11. spacer 1.11|844180|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_007206 (Haemophilus influenzae biotype aegyptius plasmid pF1947, complete sequence) position: , mismatch: 2, identity: 0.933
cctctttgagatgttccacgaaccacaacg CRISPR spacer cctctttgagttgctccacgaaccacaacg Protospacer ********** **.****************
12. spacer 2.1|950142|34|NC_020515|CRT matches to NZ_CP054197 (Glaesserella parasuis strain YHP170504 plasmid unnamed1, complete sequence) position: , mismatch: 2, identity: 0.941
tacttgagggggtaacggtatgcaaaaccattaa CRISPR spacer cgcttgagggggtaacggtatgcaaaaccattaa Protospacer ..********************************
13. spacer 1.9|844048|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP054197 (Glaesserella parasuis strain YHP170504 plasmid unnamed1, complete sequence) position: , mismatch: 3, identity: 0.9
tattttctgtaccacaaccttgccttgctt CRISPR spacer tattttctgtaccgcaaccttgtcttgcct Protospacer *************.********.*****.*
14. spacer 1.13|844312|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP054197 (Glaesserella parasuis strain YHP170504 plasmid unnamed1, complete sequence) position: , mismatch: 3, identity: 0.9
tattttctgtaccacaaccttgccttgctt CRISPR spacer tattttctgtaccgcaaccttgtcttgcct Protospacer *************.********.*****.*
15. spacer 5.14|1564809|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_028766 (Mannheimia phage vB_MhM_3927AP2, complete genome) position: , mismatch: 3, identity: 0.914
aattaagtaccgtacgttgatgacggaacttgaag CRISPR spacer aattaagtaccgcacgttgatgacggagcttgagg Protospacer ************.**************.*****.*
16. spacer 2.1|950142|34|NC_020515|CRT matches to NC_021724 (Aggregatibacter actinomycetemcomitans plasmid pS23A, complete sequence) position: , mismatch: 4, identity: 0.882
tacttgagggggtaacggtatgcaaaaccattaa CRISPR spacer caaatgagggggtaacggtatgctaaaccattaa Protospacer .* ******************* **********
17. spacer 2.1|950142|34|NC_020515|CRT matches to NC_021724 (Aggregatibacter actinomycetemcomitans plasmid pS23A, complete sequence) position: , mismatch: 4, identity: 0.882
tacttgagggggtaacggtatgcaaaaccattaa CRISPR spacer caaatgagggggtaacggtatgctaaaccattaa Protospacer .* ******************* **********
18. spacer 2.1|950142|34|NC_020515|CRT matches to NC_002579 (Aggregatibacter actinomycetemcomitans plasmid pVT745, complete sequence) position: , mismatch: 4, identity: 0.882
tacttgagggggtaacggtatgcaaaaccattaa CRISPR spacer aaaatgagggggtaacggtatgctaaaccattaa Protospacer * ******************* **********
19. spacer 2.1|950142|34|NC_020515|CRT matches to GQ866235 (Aggregatibacter actinomycetemcomitans strain D11S-1 plasmid S57, complete sequence) position: , mismatch: 4, identity: 0.882
tacttgagggggtaacggtatgcaaaaccattaa CRISPR spacer caaatgagggggtaacggtatgctaaaccattaa Protospacer .* ******************* **********
20. spacer 2.1|950142|34|NC_020515|CRT matches to GQ866235 (Aggregatibacter actinomycetemcomitans strain D11S-1 plasmid S57, complete sequence) position: , mismatch: 4, identity: 0.882
tacttgagggggtaacggtatgcaaaaccattaa CRISPR spacer caaatgagggggtaacggtatgctaaaccattaa Protospacer .* ******************* **********
21. spacer 2.4|950341|35|NC_020515|CRT,PILER-CR,CRISPRCasFinder matches to NZ_CP045829 (Escherichia coli strain AUSMDU00014361 plasmid pAUSMDU00014361_02, complete sequence) position: , mismatch: 5, identity: 0.857
atttctgaaacagggatttgcgtttcattccattt CRISPR spacer atttggcgaacaggtatttgcgtttcattccattt Protospacer **** .****** ********************
22. spacer 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MN694671 (Marine virus AFVG_250M145, complete genome) position: , mismatch: 6, identity: 0.8
cacaatc--aaaagcgattgttgatgattcaa CRISPR spacer --ccattaaaaaagagattgttaatgattcaa Protospacer * **. ***** *******.*********
23. spacer 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MN694753 (Marine virus AFVG_250M144, complete genome) position: , mismatch: 6, identity: 0.8
cacaatc--aaaagcgattgttgatgattcaa CRISPR spacer --ccattaaaaaagagattgttaatgattcaa Protospacer * **. ***** *******.*********
24. spacer 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MN694231 (Marine virus AFVG_250M143, complete genome) position: , mismatch: 6, identity: 0.8
cacaatc--aaaagcgattgttgatgattcaa CRISPR spacer --ccattaaaaaagagattgttaatgattcaa Protospacer * **. ***** *******.*********
25. spacer 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MN693981 (Marine virus AFVG_250M146, complete genome) position: , mismatch: 6, identity: 0.8
cacaatc--aaaagcgattgttgatgattcaa CRISPR spacer --ccattaaaaaagagattgttaatgattcaa Protospacer * **. ***** *******.*********
26. spacer 2.1|950142|34|NC_020515|CRT matches to NZ_KX753679 (Pasteurella multocida strain RCAD0259 plasmid pRCADGH-2, complete sequence) position: , mismatch: 6, identity: 0.824
tacttgagggggtaacggtatgcaaaaccattaa CRISPR spacer aaatgaggggggtaacggtatgctaaaccattaa Protospacer * * ..**************** **********
27. spacer 2.4|950341|35|NC_020515|CRT,PILER-CR,CRISPRCasFinder matches to MK422450 (Klebsiella phage ST13-OXA48phi12.4, complete genome) position: , mismatch: 6, identity: 0.829
atttctgaaacagggatttgcgtttcattccattt CRISPR spacer atttggctaacggggatttgggtttcattccattt Protospacer **** ***.******** **************
28. spacer 5.1|1563942|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to KM389287 (UNVERIFIED: Escherichia phage Phi06_2987 S clone contig00001 genomic sequence) position: , mismatch: 6, identity: 0.833
aaaattgcgtggacggacaatccggcgtggattttg CRISPR spacer aagtgggcatggacggacaatccggcgtggattttt Protospacer **. **.**************************
29. spacer 5.1|1563942|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_AP019820 (Enterobacter hormaechei subsp. hoffmannii strain OIPH-N069 plasmid pN069_3, complete sequence) position: , mismatch: 6, identity: 0.833
aaaattgcgtggacggacaatccggcgtggattttg CRISPR spacer aaatgggcgtggacagacaatccggcctggattttt Protospacer *** ********.*********** ********
30. spacer 1.1|843520|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KX853854 (Enterococcus faecium strain A120 plasmid pEMA120, complete sequence) position: , mismatch: 7, identity: 0.767
gccgattttaaattccatctcaagcttttc CRISPR spacer gccaattttaaattccatttcaagtatggt Protospacer ***.**************.*****. * .
31. spacer 1.1|843520|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_016967 (Enterococcus faecium plasmid pZB18, complete sequence) position: , mismatch: 7, identity: 0.767
gccgattttaaattccatctcaagcttttc CRISPR spacer gccaattttaaattccatttcaagtatggt Protospacer ***.**************.*****. * .
32. spacer 1.1|843520|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KX976485 (Enterococcus avium strain 19081 plasmid pEA19081, complete sequence) position: , mismatch: 7, identity: 0.767
gccgattttaaattccatctcaagcttttc CRISPR spacer gccaattttaaattccatttcaagtatggt Protospacer ***.**************.*****. * .
33. spacer 1.1|843520|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MG592433 (Vibrio phage 1.052.A._10N.286.46.C3, partial genome) position: , mismatch: 7, identity: 0.767
gccgattttaaattccatctcaagcttttc CRISPR spacer ccctccttaaaattccatctaaagcttttg Protospacer ** .** *********** ********
34. spacer 1.3|843652|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP026267 (Aminobacter sp. MSH1 plasmid pBAM2, complete sequence) position: , mismatch: 7, identity: 0.767
atgctgacaaattattaggcgtatggcaac CRISPR spacer tggctgacaaattattaggcatattgtcat Protospacer ******************.*** *. *.
35. spacer 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP025078 (Enterococcus faecium strain LS170308 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
cacaatcaaaagcgattgttgatgattcaa CRISPR spacer acaaatcacaagtgattgttgatgataaaa Protospacer ***** ***.************* **
36. spacer 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015437 (Anoxybacillus sp. B7M1 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.767
cacaatcaaaagcgattgttgatgattcaa CRISPR spacer aagagcgaaaagcgcttgttgaagattcaa Protospacer * *.. ******* ******* *******
37. spacer 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MG969411 (UNVERIFIED: Salmonella phage GE_vB_MG, complete genome) position: , mismatch: 7, identity: 0.767
cacaatcaaaagcgattgttgatgattcaa CRISPR spacer cacaatcaaacacgattgttgataagctca Protospacer ********** .***********.* .. *
38. spacer 2.1|950142|34|NC_020515|CRT matches to NZ_KX753679 (Pasteurella multocida strain RCAD0259 plasmid pRCADGH-2, complete sequence) position: , mismatch: 7, identity: 0.794
tacttgagggggtaacggtatgcaaaaccattaa CRISPR spacer aaatgaggggggtaacagtatgctaaaccattaa Protospacer * * ..*********.****** **********
39. spacer 2.4|950341|35|NC_020515|CRT,PILER-CR,CRISPRCasFinder matches to CP051275 (Salmonella phage SW-37, complete genome) position: , mismatch: 7, identity: 0.8
atttctgaaacagggatttgcgtttcattccattt CRISPR spacer atctggctaaccgggatctgcgtttcattccattt Protospacer **.* *** *****.*****************
40. spacer 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP020439 (Streptococcus equinus strain FDAARGOS_251 plasmid unamed1 sequence) position: , mismatch: 7, identity: 0.794
ttgatattattgataatatggaaaaag-agatgac CRISPR spacer aatttattattgataatgtgaaaaaagaagatga- Protospacer *************.**.****** ******
41. spacer 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP018188 (Streptococcus salivarius strain ICDC2 plasmid, complete sequence) position: , mismatch: 7, identity: 0.794
ttgatattattgataatatggaaaaag-agatgac CRISPR spacer aatttattattgataatgtgaaaaaagaagatga- Protospacer *************.**.****** ******
42. spacer 5.1|1563942|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to FJ982340 (Burkholderia phage KS9, complete genome) position: , mismatch: 7, identity: 0.806
aaaattgcgtggacggacaatccggcgtggattttg CRISPR spacer aagccggcgtggacgaacaacccggcgtggattttc Protospacer **. . *********.****.**************
43. spacer 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP045339 (Vibrio sp. THAF190c plasmid pTHAF190c_a, complete sequence) position: , mismatch: 8, identity: 0.733
cacaatcaaaagcgattgttgatgattcaa CRISPR spacer ttcaatcaaaagcgattggtgttgaaatca Protospacer . **************** ** *** . *
44. spacer 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MK448712 (Streptococcus phage Javan237, complete genome) position: , mismatch: 8, identity: 0.733
cacaatcaaaagcgattgttgatgattcaa CRISPR spacer aacaatcaaaagcgactgtggatgccatca Protospacer **************.*** **** . . *
45. spacer 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MK448880 (Streptococcus phage Javan238, complete genome) position: , mismatch: 8, identity: 0.733
cacaatcaaaagcgattgttgatgattcaa CRISPR spacer aacaatcaaaagcgactgtggatgccatca Protospacer **************.*** **** . . *
46. spacer 1.14|844378|30|NC_020515|CRISPRCasFinder,CRT matches to MT234670 (Pseudanabaena phage PA-SR01, complete genome) position: , mismatch: 8, identity: 0.733
cgaagtaaaaatcattggttatgtagggca CRISPR spacer tacagtaaaaatcaatggtcatgtagctct Protospacer .. *********** ****.****** *
47. spacer 1.15|844444|30|NC_020515|CRISPRCasFinder,CRT matches to NZ_CP022536 (Spiroplasma corruscae strain EC-1 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.733
aagtaaatattacacaggaattatgggaga CRISPR spacer taggaaatattacacaggtattattcaact Protospacer ** ************** ***** .*
48. spacer 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_048107 (Staphylococcus phage Pabna, complete genome) position: , mismatch: 8, identity: 0.765
ttgatattattgataatatggaaaaagagatgac CRISPR spacer aaaatcacattgataatgtggaaaaagaaatgac Protospacer .** .*********.**********.*****
49. spacer 4.8|1553427|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_041925 (Proteus phage VB_PmiS-Isfahan, complete genome) position: , mismatch: 8, identity: 0.765
aaatgaaagcgtataaatctcgccactttgcaat CRISPR spacer gagtcacagcgtataaatctcgccattttacagc Protospacer .*.* * ******************.***.**..
50. spacer 4.8|1553427|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MN840487 (Proteus phage 2207-N35, complete genome) position: , mismatch: 8, identity: 0.765
aaatgaaagcgtataaatctcgccactttgcaat CRISPR spacer gagtcacagcgtataaatctcgccattttacagc Protospacer .*.* * ******************.***.**..
51. spacer 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MN694439 (Marine virus AFVG_250M296, complete genome) position: , mismatch: 8, identity: 0.765
taagcaattataaagaaagaataaaagacgttgc CRISPR spacer gtaaaatttctaaagaaagaataaaagatgttga Protospacer *. * ** ******************.****
52. spacer 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MN694282 (Marine virus AFVG_250M297, complete genome) position: , mismatch: 8, identity: 0.765
taagcaattataaagaaagaataaaagacgttgc CRISPR spacer gtaaaatttctaaagaaagaataaaagatgttga Protospacer *. * ** ******************.****
53. spacer 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to GQ443085 (Clostridium phage CP26F, complete genome) position: , mismatch: 8, identity: 0.765
taagcaattataaagaaagaataaaagacgttgc CRISPR spacer agagaaggtataaagaaagaacaaaagaagttgt Protospacer .** *. *************.****** ****.
54. spacer 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to JF767210 (Clostridium phage phi9O, complete genome) position: , mismatch: 8, identity: 0.765
taagcaattataaagaaagaataaaagacgttgc CRISPR spacer agagaaggtataaagaaagaacaaaagaagttgt Protospacer .** *. *************.****** ****.
55. spacer 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_011318 (Clostridium phage 39-O, complete genome) position: , mismatch: 8, identity: 0.765
taagcaattataaagaaagaataaaagacgttgc CRISPR spacer agagaaggtataaagaaagaacaaaagaagttgt Protospacer .** *. *************.****** ****.
56. spacer 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_019496 (Clostridium phage phiCP26F, complete genome) position: , mismatch: 8, identity: 0.765
taagcaattataaagaaagaataaaagacgttgc CRISPR spacer agagaaggtataaagaaagaacaaaagaagttgt Protospacer .** *. *************.****** ****.
57. spacer 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_015917 (Borreliella bissettii DN127 plasmid lp28-4, complete sequence) position: , mismatch: 8, identity: 0.765
taagcaattataaagaaagaataaaagacgttgc CRISPR spacer ttagaaattataaagaaagaaaaaaataaaatgg Protospacer * ** **************** **** * . **
58. spacer 4.20|1554224|37|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP045223 (Achromobacter xylosoxidans strain DN002 plasmid unnamed) position: , mismatch: 8, identity: 0.784
gtgccgagacttgccggtgtatcggtc---acagctaaac CRISPR spacer gtggcgagaattgccggtgtatcggtctggacggcct--- Protospacer *** ***** ***************** **.**.
59. spacer 5.1|1563942|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to KT367887 (Klebsiella phage vB_Kp3, complete genome) position: , mismatch: 8, identity: 0.778
aaaattgcgtggacggacaatccggcgtggattttg CRISPR spacer aaacaggcgtggacggataacccggcgtggatcgtc Protospacer *** ***********.**.***********. *
60. spacer 5.1|1563942|36|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MT075871 (Klebsiella phage vB_KleS-HSE3, complete genome) position: , mismatch: 8, identity: 0.778
aaaattgcgtggacggacaatccggcgtggattttg CRISPR spacer aaacaagcgtggactgacaacccggcgtggatcgtt Protospacer *** ******** *****.***********. *
61. spacer 1.4|843718|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MK250029 (Prevotella phage Lak-C1, complete genome) position: , mismatch: 9, identity: 0.7
gggcagttgcaagacatgtatcaaaatctt CRISPR spacer tattctcttcaagacatgtttcaaaatctt Protospacer . . .* ********** **********
62. spacer 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MN693046 (Marine virus AFVG_25M413, complete genome) position: , mismatch: 9, identity: 0.7
cacaatcaaaagcgattgttgatgattcaa CRISPR spacer ctttcattaaagcgattgtggatgattcat Protospacer * . . *********** *********
63. spacer 1.5|843784|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MN693008 (Marine virus AFVG_117M9, complete genome) position: , mismatch: 9, identity: 0.7
cacaatcaaaagcgattgttgatgattcaa CRISPR spacer ctttcattaaagcgattgtggatgattcat Protospacer * . . *********** *********
64. spacer 1.8|843982|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to AJ783769 (Sulfolobus tengchongensis spindle-shaped virus STSV1 complete genome) position: , mismatch: 9, identity: 0.7
cagtgtattcgcattggaaagcgtaaaaga CRISPR spacer aaagagatacgcattggaaagcgtaaatat Protospacer *. . ** ****************** .
65. spacer 1.12|844246|30|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to AJ783769 (Sulfolobus tengchongensis spindle-shaped virus STSV1 complete genome) position: , mismatch: 9, identity: 0.7
cagtgtattcgcattggaaagcgtaaaaga CRISPR spacer aaagagatacgcattggaaagcgtaaatat Protospacer *. . ** ****************** .
66. spacer 1.14|844378|30|NC_020515|CRISPRCasFinder,CRT matches to NZ_CP032532 (Bacillus megaterium NCT-2 plasmid pNCT2_4, complete sequence) position: , mismatch: 9, identity: 0.7
cgaagtaaaaatcattggttatgtagggca CRISPR spacer tgaagtaaaaatcattgattattttaccgt Protospacer .****************.**** * .
67. spacer 2.4|950341|35|NC_020515|CRT,PILER-CR,CRISPRCasFinder matches to MT028491 (Ochrobactrum phage vB_OspM_OC, complete genome) position: , mismatch: 9, identity: 0.743
atttctgaaacagggatttgcgtttcattccattt CRISPR spacer atttctgaaacagggaattgcttttgctcgccagt Protospacer **************** **** *** *. * *
68. spacer 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP010312 (Geoalkalibacter subterraneus strain Red1 plasmid pGSUB1, complete sequence) position: , mismatch: 9, identity: 0.735
ttgatattattgataatatggaaaaagagatgac CRISPR spacer gtcgcgtgattgattatttggaaaaagagatgtc Protospacer * ...* ****** ** ************** *
69. spacer 4.10|1553559|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MN694724 (Marine virus AFVG_250M441, complete genome) position: , mismatch: 9, identity: 0.735
tattacaacatttgacaatcaaacattatgggag CRISPR spacer tggtcaaacatttgatagtcaaacattatgctac Protospacer *. * *********.*.************ *
70. spacer 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_011251 (Borrelia duttonii Ly plasmid pl41, complete sequence) position: , mismatch: 9, identity: 0.735
taagcaattataaagaaagaataaaagacgttgc CRISPR spacer tgttatacaataaagaaagaataaaagattttgc Protospacer *. *. *******************. ****
71. spacer 5.15|1564876|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP013487 (Vibrio alginolyticus strain ATCC 33787 plasmid pMBL287, complete sequence) position: , mismatch: 9, identity: 0.743
gctatggg--ggattcgtcaatttcaattttatggtg CRISPR spacer --tgtgaatcaaattcgtcaatttcaatttattggtg Protospacer *.**.. ..****************** *****
72. spacer 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_041977 (Citrobacter phage Mordin, complete genome) position: , mismatch: 10, identity: 0.706
tgttaaaaataaaccctgcacggggcagggtcgg CRISPR spacer atttaaaaataaaccttgtacggggctttttaga Protospacer *************.**.******* * *.
73. spacer 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_028247 (Citrobacter phage Michonne, complete genome) position: , mismatch: 10, identity: 0.706
tgttaaaaataaaccctgcacggggcagggtcgg CRISPR spacer atttaaaaataaaccttgtacggggctttttaga Protospacer *************.**.******* * *.
74. spacer 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MF158044 (Shigella phage Sf18, complete genome) position: , mismatch: 10, identity: 0.706
tgttaaaaataaaccctgcacggggcagggtcgg CRISPR spacer atttaaaaataaaccttgtacggggctttctaga Protospacer *************.**.******* * *.
75. spacer 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MF158040 (Shigella phage Sf13, complete genome) position: , mismatch: 10, identity: 0.706
tgttaaaaataaaccctgcacggggcagggtcgg CRISPR spacer atttaaaaataaaccttgtacggggctttttaga Protospacer *************.**.******* * *.
76. spacer 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to KM236239 (Citrobacter phage Moogle, complete genome) position: , mismatch: 10, identity: 0.706
tgttaaaaataaaccctgcacggggcagggtcgg CRISPR spacer atttaaaaataaaccttgtacggggctttttaga Protospacer *************.**.******* * *.
77. spacer 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MF158041 (Shigella phage Sf15, complete genome) position: , mismatch: 10, identity: 0.706
tgttaaaaataaaccctgcacggggcagggtcgg CRISPR spacer atttaaaaataaaccttgtacggggctttttaga Protospacer *************.**.******* * *.
78. spacer 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MH920362 (Citrobacter phage Maleficent, complete genome) position: , mismatch: 10, identity: 0.706
tgttaaaaataaaccctgcacggggcagggtcgg CRISPR spacer atttaaaaataaaccttgtacggggctttttaga Protospacer *************.**.******* * *.
79. spacer 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to KY654690 (Citrobacter phage Mijalis, complete genome) position: , mismatch: 10, identity: 0.706
tgttaaaaataaaccctgcacggggcagggtcgg CRISPR spacer atttaaaaataaaccttgtacggggctttttaga Protospacer *************.**.******* * *.
80. spacer 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MF327003 (Shigella phage Sf14, complete genome) position: , mismatch: 10, identity: 0.706
tgttaaaaataaaccctgcacggggcagggtcgg CRISPR spacer atttaaaaataaaccttgtacggggctttttaga Protospacer *************.**.******* * *.
81. spacer 4.2|1553027|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MF327005 (Shigella phage Sf19, complete genome) position: , mismatch: 10, identity: 0.706
tgttaaaaataaaccctgcacggggcagggtcgg CRISPR spacer atttaaaaataaaccttgtacggggctttttaga Protospacer *************.**.******* * *.
82. spacer 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MN694255 (Marine virus AFVG_250M362, complete genome) position: , mismatch: 10, identity: 0.706
ttgatattattgataatatggaaaaagagatgac CRISPR spacer ataatattattgataatattgataaagtaggaat Protospacer *.**************** ** **** .. .*.
83. spacer 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_013544 (Lactobacillus paracasei subsp. paracasei plasmid pCD02, complete sequence) position: , mismatch: 10, identity: 0.706
ttgatattattgataatatggaaaaagagatgac CRISPR spacer ttgatagtattgataatatagaaattaaaccgca Protospacer ****** ************.**** .*. .*
84. spacer 4.6|1553295|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to KC013023 (Leuconostoc phage phiLN04, complete genome) position: , mismatch: 10, identity: 0.706
ttgatattattgataatatggaaaaagagatgac CRISPR spacer agattattattgataatattgaaaaggaaaataa Protospacer . *************** *****.**.* *
85. spacer 4.8|1553427|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to MH586730 (Salmonella phage Solent, complete genome) position: , mismatch: 10, identity: 0.706
aaatgaaagcgtataaatctcgccactttgcaat CRISPR spacer tcgtgatagcgtataaatcgcgccacttaatgtt Protospacer .*** ************ ******** ... *
86. spacer 4.8|1553427|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_047786 (Salmonella phage vB_SenS_Sasha, complete genome) position: , mismatch: 10, identity: 0.706
aaatgaaagcgtataaatctcgccactttgcaat CRISPR spacer tcgtgatagcgtataaatcgcgccacttaatgtt Protospacer .*** ************ ******** ... *
87. spacer 4.8|1553427|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to KY002061 (Salmonella phage vB_SenS_Sergei, complete genome) position: , mismatch: 10, identity: 0.706
aaatgaaagcgtataaatctcgccactttgcaat CRISPR spacer tcgtgatagcgtataaatcgcgccacttaatgtt Protospacer .*** ************ ******** ... *
88. spacer 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP024238 (Escherichia coli O15:H11 strain 90-9272 plasmid unnamed) position: , mismatch: 10, identity: 0.706
taagcaattataaagaaagaat-aaaagacgttgc CRISPR spacer accgcaattataatgaaagaatcaaaggataata- Protospacer ********** ******** ***.**.. *.
89. spacer 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_013507 (Escherichia coli ETEC H10407 plasmid pEntH10407, complete sequence) position: , mismatch: 10, identity: 0.706
taagcaattataaagaaagaat-aaaagacgttgc CRISPR spacer accgcaattataatgaaagaatcaaaggataata- Protospacer ********** ******** ***.**.. *.
90. spacer 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP024249 (Escherichia coli O182:H21 strain D181 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.706
taagcaattataaagaaagaat-aaaagacgttgc CRISPR spacer accgcaattataatgaaagaatcaaaggataata- Protospacer ********** ******** ***.**.. *.
91. spacer 4.12|1553692|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_017722 (Escherichia coli ETEC H10407 plasmid p666, complete sequence) position: , mismatch: 10, identity: 0.706
taagcaattataaagaaagaat-aaaagacgttgc CRISPR spacer accgcaattataatgaaagaatcaaaggataata- Protospacer ********** ******** ***.**.. *.
92. spacer 4.22|1554361|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to KT588073 (Acinetobacter phage Ab105-3phi, partial genome) position: , mismatch: 10, identity: 0.714
aaattatgccttaataatactttaagtttttaaaa CRISPR spacer ttgctttcatttaataatactttaacttttaaaaa Protospacer ..* * .*************** **** ****
93. spacer 4.23|1554428|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP017773 (Paenibacillus crassostreae strain LPB0068 plasmid pPC03, complete sequence) position: , mismatch: 10, identity: 0.714
tcttctgagctttccaggcattaaaaccttgctca CRISPR spacer gccctagatccttccaggcattaaaaccttgaacc Protospacer *... ** *.******************** *
94. spacer 4.8|1553427|34|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP033124 (Acinetobacter wuhouensis strain WCHAW010062 plasmid p4_010062, complete sequence) position: , mismatch: 11, identity: 0.676
aaatgaaagcgtataaatctcgccactttgcaat CRISPR spacer gcgccaaagcgtacaaatctggccactttttatc Protospacer . .. ********.****** ******** .* .
95. spacer 5.6|1564278|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NC_048765 (Vibrio phage VAP7, complete genome) position: , mismatch: 11, identity: 0.686
gtgttttaccaggatttcttttgctagtgtctcag CRISPR spacer gatacgagccaggatttcttttgcttgtttctctt Protospacer * . .***************** ** ****
96. spacer 5.14|1564809|35|NC_020515|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP044084 (Pseudomonas luteola strain FDAARGOS_637 plasmid unnamed1, complete sequence) position: , mismatch: 12, identity: 0.657
aattaagtaccgtacgttgatgacggaacttgaag CRISPR spacer tgagaagtaccgttcgttgatgacgaaaccaccct Protospacer . ********* ***********.***.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1202953 : 1209983
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_020515|1202953:1209983|DBSCAN-SWA ATTAGTGTCTAAAATGACGAATCCCTGTTTTCACCATAGCGATATTGTGTGCGTTACACACGGCAACCGAATCCGCATCTTTAATTGAGCCACCTGGTTGAATAACAGCGGTTACACCATAGTCTTTTGCCATTTCCATCGTATCGCCCATTGGGAAAAACGCATCGCTAGCAAGCACTAACAGTTGTTCGTTAAATAATGGATGCGCTTTGGCTTGTTCAAGGGCGATATGCGCCGCACCCACACGATTCATTTGCCCAGCACCGATACCAAGTGTCATATAACTATTACCCACGACAATCGCATTACTTTTCACGTGTTTGCACACACGCATGGTGAAGTTAAGTGCGTGAAGTTCTGCGGTATTTGGTGTGCGTTCGGTCATCAATTCCCATTCTGCTGGTAGGCTGTTTGGGGTGTCTTTATGTAAGTCGTTTGAGTGGTCTGCACTTTGCTCTAAAATGCCGCCGTTTACAGACACAAATTCACGCTCGTAAGAGCGATCTTTTGCCGAGAAATCCAACGTCATTAAGCGGATATTTTTCTTTTGCGAAAGTAGTGCAAAAGCTTCATCATCAAAATGTGGTGCGATGATGATTTCTAAGAAAATGTTGTGCAATTTCTCTGCAAGCGTAAGAGTAACAGGGCGGTTTAATACGATAATACCACCGAAGATAGACACAGGATCGGCTGCATAGCAACGGTCAAAGGCTGTATTGATCTCATCGCCAATTGCCACGCCACAAGGGTTCATATGCTTAACGGCAACTGCACAAGGTTCATCAAATTCTCGGGCAATTTTGATTGCGGCATCGGCATCTTTGATATTGTTGTAAGAAAGCGCTTTGCCGTGTAATTGGGTCGCATTTGCTAATGAAAATTGAGGGGCGAACACACTCGTGTAGAAATTCGCTTGTTGATGGCTATTTTCGCCATAGCGTAAAGTCTGTTTGTTAGAATAAGCAAGGGTTAAGTGATCCCATTCATTTTTCGGGAAGTCCACTTCGTTGTTAGCCTGTTCACGTGCAAGGTATTCGCCAATTAAGCTATCGTAATGGGCGGTTAATTGATACACCTTACGAGCTAAGTATTGACGGAAAGCAAGACTGGTTTCGCCATTTTGTTCTAATTGAGCGATTAAGGTTGGGTAGTCCGCAGGATCGGTTACCACCGTAACAGAACGATGATTTTTTGCCGCACTGCGTAGCATTGAAGGGCCACCAATATCAATGTTTTCAATGGCTAAATCAAAGGTACAATCAGGGCGTTCAATGGTCTGACGGAAAGGGTAGAGATTGACAATCACATAATCAATATGGGCAATTTGATGTTCGGTCATCGCTTTCTGGTGTTCGGCATTGTCTCGCACGCCAAGCAAACCACCGTGAATTTTAGGGTGAAGTGTTTTTACTCTACCGTCCATCATTTCAGGAAATCCGGTATAGCTATCTACACTTTGCACAGGAATACCTGCATCTTTAATTGTTTGGAAAGTCCCGCCAGTGGAAAGTAAGATAATGCCTTGTTTAACAAGTGCATTGGCTACATCTAACACGCCAGTTTTGTCAGAAAGGCTAAGAAGTGCGTATTGTTGAGTCATTAAAAATTCCTGTGTATCAATTTGAATGAATGGGCGTATGCAATACGCCCCTACAATTTATCGTTGTGAAACAATCTGCTGAATCGTTTTCGGATAAAGCTGATGTTCAATTTTATGGATTTCATTGGCAAGTTGATCTTGCGTCCAATTTGGATCGATGGCTAAGGCTTGTTGCGCAATGATTTCGCCAGTATCTACACCGCTATCTACCCAGTGAATAGTTACGCCTGTTTGAGAAACACCAGCACGGTAAGCTTCACCAATAGCATCTACTGTCCCACCAAAAGCAGGAAGGAGAGAAGGGTGAATGTTGATAATTTGGTTAGGATAGGCATCAAGCATCGTATTGCCTAAGATCCGCATAAATCCTGCTAAAACCACCAAATTGATCTGTTTGGGTTGTAAATAGGCAACGACTTGTTGTTCCCATTCAGTGCGGCTCATACCTTTACGAGCAAGTACGAAACAATCAATATTGGCAAGTTTGGCACGCTCAATCACATAGGCTTGCGGTTGGTCGCAAACAATGCAGGCAACTTTTGCAGAAATTTCACCATTTTGTACCGCTTGAATAATCGCTTGTACATTTGAGCCGTTACCAGAGGCGAAAAGGGCTAGGTTCATTTGGGGTATTATCTCCTGTTATTCCCTCTCCCTTTGGGGAGAGGGTGGAGCAAATTCGTAGAATTTGCGGAGGGAGAGGGGATAGTACTCAACTACTCTCTCTCCTAATTGCATAGTTCTACTACTATACCATATCTCTAAGCTAGTCAGTAGATAACCCTCTCTCTGCCAAAATTGCTAAACGCAATTTTGCCTGTCTCTCTCCCTAGAAGGGAGAGAGTATAAATATTATTTAATTAAACGCACCGTCGCAGTTTCATCTGCAACGACTTTACCCACCACTTTCGCAGTCGGCACTTGTTTTAGCACGCTATCCACATTTTCCGCAGGCACAGCAATCACCATACCTAAGCCCATATTGAACACTTGATACATTTCATCTTCTGCGATGTTGCCGAGTTCTTGAATGGTTTGGAACACTAATGGAATATCCCACGAGCCAAGTTTAATTTCGGCACATAAGCCGTCTGCGAACATTCTTGGCACATTTTCATAAAAACCACCGCCAGTGATATGAGAAATGCCGTGAATTAAGCCTTGTTGCACCAGAGGTAATACCTCTTTTACATAAATTTTGGTAGGGGTGAGTAGCGTGTCAATGAGTGTTCTTCCATCTGCAAGTTTATCGTCAAATTGCCAGTTATGGTCTTTGAAGAAGACTTTACGCACCAGTGAAAAACCATTTGAATGCACACCTGAACTTGGCAAACCGATGAGTACATCGCCTGCAATCACTTGGTTACGGTCAAAAAGTTTCGCTTTATCGCCAATCCCCACACAGAAACCTGCAAGATCAAATTCATCTTCTTTATAGAGATCAGGCATTTCAGCGGTTTCGCCACCAATTAACGCAGCACCTGCTTGTACACAACCTTCGGCAACGCCAGCCACCATTTGCTCAATCTTAGCAGGATCATTTTTGCCTACGGCTAAGTAATCTAAGAAAAAGAGAGGCTCTGCACCTTGTGCTAACACATCGTTTACGCACATTGCCACACAATCAATCCCAACGGTATTTAAAATACCTGATTGTTGGGCAAGCAAAATTTTTGTACCGACACCGTCTGTGCCACTGACTAAAACAGGCTCTTCAAATTTGTGCCCTGCAAGGGAAAATAAGCCTCCAAAACCACCGATTTGGGAAAGCACTTCGGGGCGGTTGGTGCGTTCAATATGGCGTTTCATTCGTTTAACGGCTTCGTAGCCTTGTTCAATGTTCACACCAGAAGCACTGTATGCGTTACTCATCGACATTTTCTCCGTTTAATATTTGGCGTTGGACTAGGGTTAATTGGCTCTCTAATTGAGCTTGATAATCGCATAATTCAGCGGGGTATTCTCCGGTGAAGGCGGCAACACATAGGCCGTTATTTGGTGCATCAAATTTTGTACCAATTCCTTCAATTAGACCTTGCACAGAAAGGAAACCTAAGCTATCACAGCCGAGATATTCCACCATTTCACTATGTGTTTTATTCGCGGCGAGCAATTCGCCACTGGTACTCATATCAATCCCATAGAAATTGGGGAAACGAAATGGCGGACTAGCAATGCGTAGATGAATCTCTTTTGCCCCTGCTTCACGCAGTAACTGTACAATGCGCTTACAAGTTGTACCGCGTACGATAGAATCATCTACTAAAACGATAGATTTTCCACGCACCACACTGTTTACGGCAGAAAGTTTTTTACGTACTCCTTGTTCTCGTAAGGCTTGAGTAGGCGCAATAAAGGTTCGTGCGACGTATTGGTTTTTGATCAACCCCATTTCATAAGGCAAACCTGATTCTTCTGCATAGCCACTTGCGGCAGAAAGTGAAGAGTTCGGCACTCCAACGACCATGTCCGCAAGCGGAGCAGGGCACTCCCGCGCTAATACCTTACCAGTTTCTTTACGTACAGAATGGACATTAATACCGGCAATATCAGAATCAGGGCGAGCAAAATAGATATATTCCATGGCTTCAATTGCTACTTGCGTTTCATGGGTATATTGCTCAAGCCGAATACCATCATTGTTAATGACGACATATTGACCAGCATGGACGTTACCCACAAATTTTGCCCCGATAGTATCAAGTGCGCAGGTTTCACTTGCTAGCACATATGCACCATTAGATAACTGCCCAATAGCAAGAGGGCGGAATGCATTTCTATCTACAGCGCCCATTAAGCCATCATTGGTTAAAATGACATAGTTAAAACCACCACGCAGCTGAGCAAGCGATTCTGCAAGTTTTTGCTCAAAACCAACCGCTTGACTTTGACGGATTAAATGGATTAACACCTCGCTATCACTGGTAGCATGAAAGACTGCACCGTTTTGTTCTAATTGCTTACGTAACGTTTTAGCATTAGTGATATTACCATTATGCGCAATTGCTATGTTTTGATCATGAAAGCGAAATAAAAATGGTTGCACATTCACTAAATCATCTGAACCGCCACAAGTCGCATATCGAACATGACCAATGCCATTATTGCCCTTTAATGTGCCAAGGACATTGGGATCTTCAAATACCTGGCTGAGTAAACCAATACCACGATATTGATGAAAGTCTGCTTTTTGCGAAGTAACAATCCCGGCGCCTTCTTGCCCACGGTGTTGTAGTGAATGTAAGCCATAATAGACAATTTGGTTAGCATCTTCGTGTCCCCAAACACCGAAGATACCGCATTCTTCATTTAATGAGCGAGAGTGTTCATATTCGTATTCAAACATAAAGTTTCCGTAGGGGCGAACCGATATGTTCGCCCAGATATTTTACTGATGTTTATTTGGGCGGGAACATTAGGCCGCCCCTACAATAATTAAAGAGAAATATGTTTAGCTAAACGCTCATATACTTCTTGATAAACAGGAATAATATCGCCCAAATCACGGCGGAAGTTGTCTTTATCTAATTTTGCTTGGGTGGTTTTATCCCATAAGCGGCTGGTATCTGGCGAGATTTCATCGCCTAAAATAATCTGCCCGTCACTGGTTTTACCAAATTCAAATTTAATATCCACGAGGATAATGCCAATCGCATCAAACAATTCAACCAATTTCTCATTCACTTGGAGAGAGAGTTTTTCAATTTCTTGCAGTTCGTCCATTGTCGCAATGCCGAGTGCTGAAAGTTGGCTGTTATTTACCATTGGGTCGCCTAAAGCATCATCTTTATAACAAAATTCTACGATAGGGTGGTTAAGTGTTTTCCCCTCTTCCAACCCCATGCGCTTTGCAAATGAACCCGCCACACGGTTACGCACAATGGCTTCAACTGGCACAATAGTTAATTTGCGTACAATTTGGGAGGTTTCATCAACTTGTTTAACGAATTGGGTGGGAATCTGTTGGGCGTTGAGCCATTGGAAGATAAGTGAAGAGATTTTGTTGTTTAGTACACCTTTGCCTTCAATCTGTGCTTTTTTTTCGCCGTTAAAGGCGGTCGCTTGGTTAGAATAGACGACGAGCAGTTCGTCAGGATTATCGGTAGTGTAAAGGGCTTTTGCTTTCCCTTCGTAGAGTTTTTGCATAAGAAGTTACTCCAGTAGTGAAATGTCTGACGTGATTCTATCACGCAAACGGTTGCGCACTCAAGCAATCGTTTGCCTTATTTTTTCATTGAGCAATAAAGACGTTTATGGAAGAATAGCCGTACAGTGAGCACAATTCTTTAACAAGCGGTATAAAAATACCCCAAATTTACTAAAGGAAGACATAATGCAATATATCAACATCGCTAATGACGGCGTAAAAACCCTTTCTCCTTACCAAGCAGGCAAGCCTATTGAAGAATTAGAACGAGAGTTAGGTATTTCAAATATTATCAAACTTGCCTCAAACGAAAATCCATTTGGTTTCCCTGAATCTGCTAAGCAAGCAATCATCAATCAACTCAACGAGTTAACTCGTTATCCTGATGCCAACGGCTTTGAATTGAAAGCAACAATTGCCAAAAAATTTGGTGTGAAAGCAGAGCAGATTACTCTCGGTAATGGTTCAAATGACCTCCTTGAGCTTTTTGCCCACACTTTTGCAGATCAAGACGATGAAATTATTTATTCGCAATATGCGTTTGTGGTGTATCCACTGGTAACCAAAGCAATCAATGCCACAGCTCGTGAAATTCCTGCTAAAAATTGGGGGCATGATTTGGAGGCTTTTTTGGCGGCAATTAATGAAAAAACCAAATTAATTTTTATCGCTAACCCAAATAATCCAACAGGCAATTTCTTGACCCATAATGAACTTGATGCTTTCTTGGCAAAAGTACCTGGAAAGGTCTTGGTAGTGCTTGATGAAGCTTACACAGAATTTACCGCAGAACAAGAGCGGGTGGACTCTTTCGCATTAAGTGCAAAATATCCAAATTTAATCATTTCTCGCTCGCTCTCAAAAGCTTATGGTTTGGCTGGACTAAGAATTGGCTATGCGGTCTCGAATCCTGAAATTGCCGATCTCCTCAACCGTGTTCGTCAGCCATTTAACTGCAACAGCCTTGCATTAGCAGCAGCGCAAGCGGTCTTAAATGACGATGAATTTGTCAAAAAAGTGGCAGAAAATAACCGCTTGGAAATGGCTCGCTATGAGGCATTTTGCACAGCTCAAGGATTAGAATTTATTCCATCAAAAGGCAATTTCATCACAATTGATTTTAAGCGTCCCGCTCAAGCGATCTATGAAGATCTTCTGCGAGAAGGGGTTATTGTTCGTCCAATTGCAGGTTACGGAATGCCAAATCAGCTTCGGGTAAGTATTGGATTACCAGAAGAAAACGATCGGTTTTTTAGTGCTTTGTTGAAAGTATTGAATAAATAA
Protein sequences of DBSCAN-SWA_1 >NC_020515|1202953:1209983|1204609_1205176_-|WP_015432526.1|DBSCAN-SWA MNLALFASGNGSNVQAIIQAVQNGEISAKVACIVCDQPQAYVIERAKLANIDCFVLARKGMSRTEWEQQVVAYLQPKQINLVVLAGFMRILGNTMLDAYPNQIINIHPSLLPAFGGTVDAIGEAYRAGVSQTGVTIHWVDSGVDTGEIIAQQALAIDPNWTQDQLANEIHKIEHQLYPKTIQQIVSQR >NC_020515|1202953:1209983|1206416_1207898_-|WP_015432528.1|DBSCAN-SWA MFEYEYEHSRSLNEECGIFGVWGHEDANQIVYYGLHSLQHRGQEGAGIVTSQKADFHQYRGIGLLSQVFEDPNVLGTLKGNNGIGHVRYATCGGSDDLVNVQPFLFRFHDQNIAIAHNGNITNAKTLRKQLEQNGAVFHATSDSEVLIHLIRQSQAVGFEQKLAESLAQLRGGFNYVILTNDGLMGAVDRNAFRPLAIGQLSNGAYVLASETCALDTIGAKFVGNVHAGQYVVINNDGIRLEQYTHETQVAIEAMEYIYFARPDSDIAGINVHSVRKETGKVLARECPAPLADMVVGVPNSSLSAASGYAEESGLPYEMGLIKNQYVARTFIAPTQALREQGVRKKLSAVNSVVRGKSIVLVDDSIVRGTTCKRIVQLLREAGAKEIHLRIASPPFRFPNFYGIDMSTSGELLAANKTHSEMVEYLGCDSLGFLSVQGLIEGIGTKFDAPNNGLCVAAFTGEYPAELCDYQAQLESQLTLVQRQILNGENVDE >NC_020515|1202953:1209983|1202953_1204552_-|WP_015432525.1|DBSCAN-SWA MTQQYALLSLSDKTGVLDVANALVKQGIILLSTGGTFQTIKDAGIPVQSVDSYTGFPEMMDGRVKTLHPKIHGGLLGVRDNAEHQKAMTEHQIAHIDYVIVNLYPFRQTIERPDCTFDLAIENIDIGGPSMLRSAAKNHRSVTVVTDPADYPTLIAQLEQNGETSLAFRQYLARKVYQLTAHYDSLIGEYLAREQANNEVDFPKNEWDHLTLAYSNKQTLRYGENSHQQANFYTSVFAPQFSLANATQLHGKALSYNNIKDADAAIKIAREFDEPCAVAVKHMNPCGVAIGDEINTAFDRCYAADPVSIFGGIIVLNRPVTLTLAEKLHNIFLEIIIAPHFDDEAFALLSQKKNIRLMTLDFSAKDRSYEREFVSVNGGILEQSADHSNDLHKDTPNSLPAEWELMTERTPNTAELHALNFTMRVCKHVKSNAIVVGNSYMTLGIGAGQMNRVGAAHIALEQAKAHPLFNEQLLVLASDAFFPMGDTMEMAKDYGVTAVIQPGGSIKDADSVAVCNAHNIAMVKTGIRHFRH >NC_020515|1202953:1209983|1205404_1206424_-|WP_015432527.1|DBSCAN-SWA MSNAYSASGVNIEQGYEAVKRMKRHIERTNRPEVLSQIGGFGGLFSLAGHKFEEPVLVSGTDGVGTKILLAQQSGILNTVGIDCVAMCVNDVLAQGAEPLFFLDYLAVGKNDPAKIEQMVAGVAEGCVQAGAALIGGETAEMPDLYKEDEFDLAGFCVGIGDKAKLFDRNQVIAGDVLIGLPSSGVHSNGFSLVRKVFFKDHNWQFDDKLADGRTLIDTLLTPTKIYVKEVLPLVQQGLIHGISHITGGGFYENVPRMFADGLCAEIKLGSWDIPLVFQTIQELGNIAEDEMYQVFNMGLGMVIAVPAENVDSVLKQVPTAKVVGKVVADETATVRLIK >NC_020515|1202953:1209983|1208885_1209983_+|WP_015432530.1|DBSCAN-SWA MQYINIANDGVKTLSPYQAGKPIEELERELGISNIIKLASNENPFGFPESAKQAIINQLNELTRYPDANGFELKATIAKKFGVKAEQITLGNGSNDLLELFAHTFADQDDEIIYSQYAFVVYPLVTKAINATAREIPAKNWGHDLEAFLAAINEKTKLIFIANPNNPTGNFLTHNELDAFLAKVPGKVLVVLDEAYTEFTAEQERVDSFALSAKYPNLIISRSLSKAYGLAGLRIGYAVSNPEIADLLNRVRQPFNCNSLALAAAQAVLNDDEFVKKVAENNRLEMARYEAFCTAQGLEFIPSKGNFITIDFKRPAQAIYEDLLREGVIVRPIAGYGMPNQLRVSIGLPEENDRFFSALLKVLNK >NC_020515|1202953:1209983|1207987_1208698_-|WP_015432529.1|DBSCAN-SWA MQKLYEGKAKALYTTDNPDELLVVYSNQATAFNGEKKAQIEGKGVLNNKISSLIFQWLNAQQIPTQFVKQVDETSQIVRKLTIVPVEAIVRNRVAGSFAKRMGLEEGKTLNHPIVEFCYKDDALGDPMVNNSQLSALGIATMDELQEIEKLSLQVNEKLVELFDAIGIILVDIKFEFGKTSDGQIILGDEISPDTSRLWDKTTQAKLDKDNFRRDLGDIIPVYQEVYERLAKHISL |
6 | Synechococcus_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1338111 : 1352806
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_020515|1338111:1352806|DBSCAN-SWA TTCATTCTGCCACAGCACGGATTGTCGATTCATCAATTTCTGCTTCAAAAGCCTGTTGTAACAGCTTTAATTTATCATCAGCAAAAAGTGACACTCTAGCAGCTTGAGTTAGCTCCTCAAAAATAGCACGCCGAATTTCTAAAGGTGTTTTATACCGGTCATCTTCGCCAATAATCAGTTCTAAATCCAACCCTTTCGCTTTGAGAGCATCAGCCAGATTTTTACGCGATTCATCATTATTCAAGCGAGACATTGCAGGCTTTAACACTAATTTAATGTAATCATCTCGTTTTTCTGCCATGTAACTATTTAAAGCAAGCTGACGAGAAAAACCACCTAAATTCAGTTCATTAGCCAATTTTGCCCATGCATCTTGTTGGCAAGAGAGTTCAATGGTTTTTTTGACTAATTCTGGTGTACGTTCTTGCAAAATCGCTTGTTTAATTTCCGAAGGATTTGCTGTTTCCTGCTGGTTTTCCAATTCAGGATTTAACCATTGCCAACGATATTCTTCATCGCTGGTTACTTGTTCTTGAGTTGGGGCTCGGGGAAGATCGATTTGACGTTTTTTATCTTGGCTAGAAAGCAGCTTAGCAAAGCTTTGTTCCACATTTTTCATTGCTTGCTTAGCAGAACTTGCTTTGACTAAATTAGGTGTGGATATTTTTACCGCTTGCTCAGACTTTTTTTTTTCACTTTCCTGTTGCTGTGCTTTAAGCTGGCGGCTCGCTTGCAATGCTTGCATAGCAGCGCTGACACCACCACTTACTGCTGTCTGCACAGTAGGCGTAGAACTCATATTGGAAGCACTATTTGCAGAATTTCGGTCATTTTGTACCGCTTGCGAAGTCACACTAGCTTGCTGAGCCACTAACTTATCACGTAACTGTGCGATTTGCTGCCCACTATTTACTGCTGCTGAAGTGACTGGCACTGTGCCAGTAGCAACAATCGGACTTTTATCCGCATTTTTAGGATGAAATGCTAAGGCTCGTAACATTGTCATTTCAATGCCACGGCGAGGTTCAGGTGCATATGGCAACTCTTTTTTGCCGCTTAGCATCACTTGATAGAAAAATTGTACATCTTCAGGTGATATGCTACGAGCAAGAAAATGCAGTGGTGTTTCCTCTCCATCTTTATTTGGCAGTAGCTGCAACATCGCAATTTGGTGTAATGTTTCAGCAACATCACTAAGTAATTGCTGCCAATCAACTCCCCTTTCCGCTACATCTTGCACAATTTGCATTGTTTTCTCACCATCCGCAAGAGCAAGTGCTTGCACTAATTCTAATGGCTGATTTTCGTCAAGCAACCCTAGCATTTGGCTAACAATCGCTAAGGTAATATTCGCATTGCTTAGCGCTATAGCCTGATCAGTTAAACTCAAACTATCTCGAATACTACCCTGTGCCGCTTTCGCTAAACGCTCTAACGCTGGAAATTCAAAAGGAATTTGCTCTTCACCTAGCACAAATTCCAGATGTTTACGAATCTGACTTTGGTCTAAAGCACGCAGATGAAACTGCATACAACGAGAAAGAATTGTGATCGGTAGTTTTTGCGGATCAGTGGTTGCTAATAAAAATTTGACGTACTCTGGCGGTTCTTCTAAGGTTTTCAGTAGAGCATTAAAACTACTACGAGATAGCATATGCACTTCATCGATCAGGTAAACCTTAAAACGCCCAACAGTTGGCTTATATTGCACATTATCTAATAGCTCTCGCGTATCTTCCACTTTAGTACGAGAGGCAGCATCAATTTCAATTAAATCAATGAAACGTCCTTCTTCAATCGCTTTGCAATTGGCACATTCACCACAAGGATCGGCACTAATCCCTTGTTCGCAATTTAGTCCTTTGGCAAATAAACGAGCAATTGAGGTTTTGCCCACACCACGGGTGCCTGAAAAAAGATAAGCATGATGTAAACGGTTTTCTCGCAAACTATTTTCAAGGGCTGATAAAACATGTTGCTGACCAACTACTTGACTAAAACGCTGTGGTCGCCATTTACGGGCAAGAACCTGATAAGACATAATGATAAATAAAATTGATAATGAAGAATTGATAGTTACTGGCGGAGAATTGAAAAATCGAAATAAACAACGTTCTACGATAAAAAAGAATTGAAAATGTTCTTAAGTGCCCACTTTCAATTCTTACCCAATCCTCAATTCACTTAGTGTCCTTCAAATGTCACTAAACTGAATGATTTCACACCCTGTTTTTCTAAACGTGCTGCACCACCTAACTCAGGCAGTTCAATTACAAAAGCAGCATCTTCTACTGTACCGCCCAAACGGCGAATTAAATTTACTGTCGCTTCTACTGTGCCGCCAGTGGCAAGTAGATCATCAATAACCAATACTTTATCACCTGCTTGAACCGCATCAATGTGCATTTCTAAGGTATCTTGACCGTATTCTAATTCATAAGACTGTGCAATCGTTTCACGGGGTAATTTTCTTGGCTTACGCACTAATTCAAAGGGTAATCCCAAGGCAAGCGCAACAGGTGCCCCAAAAATAAAGCCGCGAGATTCTGTACCCACGATTTTGGTAATGCCTTTATCTTTAAATTCAGCCACAATCGCATCAATAGATGCTTGGAAAGCCGCCGGAACTTCTAATAATGAAGTAATATCACGGAAAATAATTCCTGCTTGCGGATAATCAGGAATGGATTTAATTGAAGATTTAATCAAATCTAATTGGCTCATAAATTGACCTATTTTTATTAGTAAAAAACCGAAAGAATTTAGCAAAATTCCCAAAAATTACAAGTAAAAATCGCTGTTTTGCAAAAAATTTATGGTTTTTCACCGCTTGTTGTCTGCATTTTCACAATAAACTTTTTGGGCAGAATAAATAATGCGGGAATAACAGTGGCTGCCATTAATCCAAAACTTACCTGTGGTGAAACCAGATAAAGTTTACCCGCAATAAAGGTCAATGCGGCCATTGCCCCACAGTTTGCTACACTAAAATAGAGCGCTTGCAATTTTGCTGAATGAGCCATCGGTTGAGTAGCGATGTAACGGATCATCGCATAGTGTGAAATCCCAAATGTCGCCGTATGCAATAATTGAACTAAAACTAAGCAAGAAAACTCAACTGTACTTGCTAATAGCCCCCAACGAATCATTGCACCAGCAATTGCAACCAGCACTAATGTACGAATCCCCATATTGCCAAATAAACGTTTTGAGAAATAGAAAAATAATACTTCTGCTAAAACTGAGAGCCCCCAAAGCAAACTCGCCTGTTGGGTAGAAATGCCATTGCTCGTCCAATGCATTGTGCTGTAGGCATAATAAGCAGCGTGTGATGCCTGTACTAATGAAACCGCAATAATCATCCGCACTGTAGTACGATTAGAGAGCAATTGCCGATAACTAATTTCTTGGTGAGTCTGATTCTCCCTCACACTTTTAAATGTTTGACTTGGCTGAAAAGAGATGCCAATCCCAAGTAAAATCAGCCACAATGCTAAAATGGTGATAATCGCATCCTGCCCCGTCCATTCCAGCAAAAAGCCCGTTGAAACCGAGCTTACCACAAATGCAAGTGAACCAAATAATCGACTTTTACCATAATCCAACATATTTTGCTGTTGCCAATTTGCTGCAATCGTATCTGCAATTGGCATAGCACCTCCGTTTACAATATGGAAGAACGCGAGCAGAGGGACAAAAAACCAAATAGATGAAACACCCCATGCCATTCCTATCACAAGCACCACAGCACCCCATGTTAAAAAACGGCTCAACGGTAAAAGCTGATTAGGTTCTTTAATTTTTTTAGAAAAAAACATCGCACCAACTAAACGAAAGAGATACCCCAGCGCGATCAAGGTTCCTATGGTATCCACATTATATTGATGATGATTTAACCAAACCGGCAAAAATGGCACGAGTACACCAAATGCCGCATAAAAGCCAAAAAAGTTAAATGATGACCACTGAAAAGGTGTTAATTTCATAGCGATTTTTCCATTTCCTCTGCACGCAAAATTGCGGCATTCATTGCTTCCTCCACAAGCTTATCTAACTGATACTGTTCAAATACGGCTAGAGCTTGTGCTGTTGTTCCGCCTTTTGAGGTTACATTTTCCCGTAACTGAGATAATGGTAGCTCAGGATTTTCTTGCACCATCTTTGCGGCGCCTAATGCAACCGATTGAATCAATTCTCTCGCATCTTGCTGGCTAAACCCCATCGCCACGGCAGATTTTTGCATCGCTTCCATAAAACGAAAAAAATAGGCAGGGCTCGATCCTGTAATGGCAATGATTTGGTTAATTTGGCTCTCTTGTGTCACCCAATAACATTTCCCCACAGCATTCAGCAGGCTTTCAACAAACTGGCAAGCGCTTGAATTAACCGTTTTTTCTGCAAAAAGCCCCGTCATCCCTTCGCCAACTAACGCTGGCGTATTTGGCATTGCACGAATAATCTGGTTTGCTGTTGGTAACAATTGGGTTAAGCGTGCTACCGAAATGCCAGCTGCAATTGACAAGAGCCATTTTTCAGTAAAATCAATCTTAGCTAATTCTTCACAGACCTCCGCCATCACCTGCGGCTTTATGGCTAAAACCACACAATCTGCCCATTTGACGGCTTGACAATTCGAAAAAGCGATCTGCACCCCCAGTTGCTGTAATTCAGCCAATCTCGCCTGATTACTCTTATTACAAGCCGCGATCAAGTCTGCAGGATAGCCACTGCGAATAAGCCCTTTAATGATCGCAAACGCCATATTACCAGCCCCGATAAAGGCGATTTTTTTACTCAGCATAGTCTGTTCCTTTTGGTGAAATGGGTTTTACGGTAGAATAGTCGTTATTCTACGCCACACTTAAGGATTACCCAAATGTTCTGGTTCAAAAATATTATGATCTATCGCTTAACCAGCGCGTTAAATTTAAATCATAGCGAATTAGAAGCTCAACTTGTACCGCAAAAATTTAGCCCTTGCTCGCAACACGATTTGCAAAAATTTGGTTGGTTTGCACCGCTTGTAGGCAGTGAAATGCTGCATTTTTCGCAAGGCAACCAATTTCTACTTATTTCACATAAAGAAGAAAAAATATTGCCGGCTAATGTGATAAAAAAAACCTGTGAAGAGCGCATTGCCGTGTTAGAAGAAAAAGAACAACGTAAATTAAAGAAAACGGAAAAACAGGCAATTAAAGATGATGTGATAGCTACCCTACTGCCGCGCGCTTTTAGTAAACATCAATCAACCGCAATTTGGCTTGATTTGGACAACCAGCTTATTTATGTCGATGCGGGCTCAGCTAAACGTGCAGAAGCTACTTTAGCTCTACTGCGCAAATCACTCGGTTCATTACCTGTAGTGCCGTTGTCATTTGTATTAAGCCCAAGCGAAGTGATGACAAATTGGGTCGCCAAAGGCCATACACCTAGTTGGCTCACCCTACTTGAAGAAGCGGAATTAAAAGATTTCGAGAATGACAGTATTATCCGCTGTAAACGACAAGATTTAGAAAGCGAGGAGATCGCCAACCATTTACAAGCGGGTAAATTTATCACTAAACTGGCTATTGATTGGGAAACCCATTTTTCCTGCGTGCTCAATGAAGATGCGACCCTTACTCGCCTTAAATTTGCCGACGAAGTGCGGGAGAAAAATGATGACATTCTCAAAGAGGACAAAGCTCAACGCTTTGATGCTGATTTCTTGTTAATGACAGAAGAGTTACGCCTATTTAGCGCAAAATTAAGTAACGAATTTGGTGGAGTGAAACAGTAAATGCAAATCTACCTTGTCGGCGGGGCGGTACGAGACCAACTGCTTGGTTTACCAATTAAAGATCGTGATTGGCTGGTAGTAGGCGCAACCCCTACCGAACTGCTTGCTCTCGGCTATCAGCAAGTCGGCAATGACTTCCCTGTTTTTCTACACCCTAAAACAAAAGAAGAATATGCTCTCGCTCGCACTGAGCGAAAGAATGGCAAAGGCTATAATGGCTTCATTTGTGATTTTTCTCCTGAGATTACTTTAGAGCAGGATTTACTACGCCGTGATCTAACCATCAATGCTATTGCTCAAGATGAACATGGTACACTCTATGATCCATATCAGGGCGTCGCAGATTTAAAACAGCGTATATTACGCCACGTCTCTGCCGCATTTAGTGAAGACCCACTACGGGTTCTACGTGTCGCGCGTTTTGCTGCTCGGCTACATACCTTTGGGTTTAAGATTGCACCAGAGACACTCAAGCTTATGCGGCAAATCAGCGCATCTGGCGAATTAGAAAATCTCACACCGGAACGAGTTTGGCTAGAAACACAAAAAGCCTTTAATTGTGATAATCCGCAAGTTTATTTCCATATTTTACGCTTAGTTGGCGCATTAAAAGTATTATTCCCCGAGATCGATGCATTATTTGGTGTACCACAGCCAGAAAAACACCATCCTGAAATTGACAGTGGCTTACATACATTAATGGTTCTAGCTCAAGCAAAAAAGCTCTCTGCTAAAGCGACAGATTCTGAAAGTGTTTTATTTGCAGCCCTCTGCCACGATCTCGGTAAAGCACTTAGTCCTAAGGATAATCTGCCTCACCATTATGGGCATGAGGTGAATGGCATCACCCCCACAGTGAACCTTGCCAATCGCCTAAAAATTCCGAATCATTGCAAAGAATTTGCTCTATTGGTCACAGAATTTCACTCCCACTGCCACAAAATGACAGAGCTTCGACCTGAAACGGTGATTAAACTGTTCAATAAATTGGATGTATGGCGTAAACCCGAACGTTTTTTTGACTACCTCTTAGTTTGTGAGGCTGATAGTAAAGGGCGTTTAGGTTTTGAAGATCGTCCATACCCACAAGCTGAGCGAGCCAAATCCTATTATCATGCTGCAATGCAAGTTGATGTACAACAGGTCATTCAAGATGGTTTTGAAAAACAGGCTATCCGACAAGAATTAGATCGTCGCCGAATGTTAGCCATTAGGCAAACTAAAGCAGAGAATGAATAATTTGTATGCTTCACAACCACTAACAAGCGGTTGATTTACTAAGAAAATATGCAAAAGCCGAGTAAGCAACCTTACTCGGCTTTGTTTCATGCCAGATACTCGTTACTGCTTCGCCACAATCTCGCCATTTTCTACGTCAATCACCACCGCTTTATTCGGCAGTAGTTTTCCGCTGAGAATTTGGCTTGCCAATGGATTTTCCAGTTCTTGCTGAATCGCTCGTTTAAGCGGTCTTGCGCCAAAGAGTGGAGCAAAGCCTGCTTCACCAATATGATCCAGGGCTGCATCGCTCACACTCACTTCGTAGCCTCGCTCCGCTAAGCGTTTAATTAAACGCGCCAATTGAATACGCGCGATTGAACGGATATGCTCTTTGCCTAAACTGTGGAACATCACGGTTTCATCAATACGGTTGATAAATTCTGGGCGGAAATGTTGTCCAACCACTTCCATCACCATCTCTTTCACCGCATCATAGCCTTTATGGGCGTTTTCTTGAATAAGATGTGAGCCTAAGTTAGAGGTCATAATCACCACCGTGTTGCGGAAATCCACCGTTCTACCCTGCCCATCGGTTAAACGCCCATCATCAAGCACTTGTAACAGAATGTTGAACACATCAGGGTGAGCTTTTTCCACTTCATCAAGCAGAATCACCGAATATGGACGGCGGCGAACCGCTTCGGTGAGATAGCCCCCTTCTTCATAGCCCACATAACCCGGAGGCGCACCTACCAAGCGAGAAACGCTGTGTTTTTCCATAAATTCCGACATATCAATGCGAACCATCGCATCTTCACTATCAAACAAGAAATTGGCGAGCGTTTTAGAAAGCTCGGTTTTCCCCACACCAGTCGGCCCTAAAAAGAGGAATGAACCAATCGGGCGATTTGGATCTGAAAGCCCTGCTCGGCTACGGCGAATGGCATTTGCCACCGCATCTACCGCTTCATTTTGCCCAATGACACGTGCGTGTAACACCGTTTCCATTTGTAGCAATTTCTCTTTTTCGCCTTCCATCATTTTGGCAACAGGAATACCCGTTGCACGAGAAAGCACTTCGGCAATTTCTTCATCGGTTACTTTGGTGCGTAGCAGTTGGTTTTCTGCATTTTTTTCGCCATTTTCTACCGCTTGCAGCTGTTTTTCCAAGTTTGGAATCACACCGTATTGCAATTCCGACATTTTTGCCAAATCGCCTTCACGGCGTGCTTGCTCCATTTTAATACGCGCATTTTCCAGCTCTTCTTTGATATGCTGCGTGCCGTGCACTGCGGATTTCTCTGCTTTCCAAACCTCTTCTAATTCCGCGTATTCACGCTCTTTTTCGGCAAGCTCGGTATTAAGTTTCTCTAAACGCTGACGGCTTGCCTCATCATCTTCTTTTTGCAGTGCTTGTTGCTCTAATTTAAGCTGAATAATACGGCGATCTAATCTGTCTAGCGGTTCTGGTTTAGAATCAATTTCCATACGTAAACTACTTGCCGCTTCATCAATTAAATCAATCGCTTTATCCGGCAACTGGCGGTCTGAAATATAGCGGTGTGAAAGGGTTGCCGCTGCTACGATAGCAGGGTCGGTAATTTGTACGTGGTGATGGATTTCATAACGCTCTTTCAAACCACGCAAAATCGCAATAGTATCTTCCACACTCGGCTCATCCACCAACACTTTTTGGAAACGGCGTTCTAATGCCGCATCTTTTTCAATATATTGGCGATATTCATCTAACGTAGTTGCCCCCACACAGTGTAATTCGCCACGCGCAAGGCTTGGTTTGAGCAAGTTACCTGCGTCCATCGCTCCATCGGTTTTACCTGCCCCCACCATTGTGTGAATTTCATCAATAAACAGAATGATTGAGCCTTCTTCTTTGCTGATTTCATTCAATACCGCTTTCAAACGCTCTTCAAACTCGCCACGATATTTCGCCCCAGCAATCAATGCCCCCATATCCAAAGAAAGCACTTTTTTATGTTTCAACCCTTCCGGCACTTCACCATTCACAATGCGTTGTGCCAAACCTTCCACAATCGCCGTTTTCCCCACACCCGGTTCACCAATTAACACGGGATTGTTTTTGGTACGGCGTTGTAACACTTGAATGGTACGGCGAATCTCTTCATCACGACCAATAACAGGATCTAATTTGCCATTTCTTGCTCGTTCGGTTAAATCAATCGTATATTTACTTAACGCTTGACGAGTCTCTTCTGCATTTTGGCTATTCACTTTTTCTCCTCCCCGAATTTGGTTAATCGCTTGTTCAACGTTGGCTTTACTTAGCCCTAAATTTTTCAATAATTTGCCTAAATCGCCACTATCGTCCAATGCCGCAAGTACAAACAATTCAGAGGAAATAAAACTATCGCCCATCTGTTGTGCCAGTTTATCGCATTGATTTAATAGACGAATTAAGGCTTGAGAGGGCTGCGTGGTTGCCCCTTGTACTTTCGGTAAACGGTTGATAAGGTTTTCCAACTCATTTTGTAATTGCGCAGGCGATACACCAAGTGTGGTAACAAGCGGTGAAATTGAACCATCATTTTGCAAAAGTAAACTTTGCATCAAATGCACTGGCTCGATGTAATTGTGATCACGCCCTACTGCCAAAGATTGGGCGCTAGCAATGGCTTCTTGAAATTTGGAAGTGAATTTATCAATGTTCATCATTTACTCCTAAAATCGTTATTCAGCCTTATGCTGAATTTCAACGAGAACATAAATGGAGTTTAGAGAGGAAATTTCAAGCATTTTTTGTAAAAATTTCTTTACAGCCTAAAAACCTCGACCCATTGAGGGTTCAGATCGCTAATATTTCCAATGCAAATCTAACAAGTTGAGCTAATAGATTATCAGCAAAAAATTAAGCTTAAAAATAAACAAAAAGAAGTGAAAATGAATCGATAATTGGCAACTTTATGCCAATTATCCTTTGAATAGATGATTTATAGAAAGCCCTTAGAGGCAAGTAGAAAATAAGAATCTCTAAAATACGAAAACTAGCATCTCTGCTTCAGAAAACAGTTTCACACGCCGAGCTCTAAACTCGACATAAGTTTAGAGAACGTATCCCTATGAGTCATGATCAATCCGACTCGAAACCGCATTCTGCTGATTTTTATATTTTGCGTCTTTACGAGCATTATATGGACGTTCAGCATAGCCACTTAAAATTTCAAAACTTAATGCACCAATCGCCATATTAGGCCGTAATGCCAAAGGTAACTTACCTGCATTAAAAAATTCCAGTACAATTTTGCCTTCCCAACCGGGATCAATGCGGTGTGCCGTAACATGCACCATTAAGCCAAGACGGGCTAAAGAAGAACGCCCATCTAACCAGCCTACAATGTTGGCAGGTAATTTAACACTTTCTAAAGTAGTAGCTAAAGCAAGTTCCCCTGGGTGTAAGAAAAACGCGTCACCTTCTGCGATAATAATTTCTTCACTCATCACGCGCTCAAGCTGTGCAGACATTTCTTCTCTAGGACCACTTAAATCAATATATGGCGTTGAGTGCTCGCGAAACACTCTAAAAGAGTTGCCTAAACGCACATCAATAGTTGCGCCATTAATTTTTTCATTTGAAGGACGAGGCGTTAAGCTGATAATGCCTTTATCTAAGTAACGCTCAATATCGCTATCACATAAACGCATAACTATTCCTATTTTTTACCAAGTAACTCTTTAATTTGTGCTTTAAGAATATTAATGGCTACACGATTTTTACCACCTCGTGGCACCACAATATCCGCGTACTGTTTCGAAGGCTGCACAAACTGTAAAAACATTGGGCGAACGGTAGTTCGATACTGATTAAGCACTGATTCTAATGTACGTCCACGCTCTTCCATATCCCGTTGTAAACGACGAATAAAACAGATATCTAATGGCGCATCAACAAAAATAGACATATCAATTTCATTACGAATATGTTCATCTGTAAGCAACAAAATCCCTTCCAGAATAATCACTTTTTTCGGAGAAAAGTAAGTGACTGTTGCTTTGCGATTATGCTCAGCGTAATCATATTCTGGAATCTCAACCGACTGCCCCTGCTTTAACTGCTTTAAATGTTGGACTAATAATGCGTGATCCATAGAATTAGGGTGATCGTAATTGGTTTTTATCCGTTCTTCCATCACCATGTGGGATTGATCTTTATAATAAGCATCCTCAGAAATAATCCCTATATCTTCCTGACCTAATTCTGCTTTTAATTCCTTATAGATTGTCGATGCAATTAAACTCTTACCGGATGCGGATGCACCTGCAATCGCAATAACAATGCAGGATTTTTCTGTTGGATTAGACATATTTACCCCAAGCTGACTAGAATGCTATATTATAACGTAAAATACTCAAATTTTTCATCCTTTTAACGTTCTCGGAGCAATTTATGCATTCATTCTTACACGGTTTTGTGGTCTGCTTTGGGCTGATTGTTTCAATTGGCGCACAAAATGCTTTTTTACTCAAACAAGGTATCCTCAAGCAGCACGTATTTTGGGTTGCATTTATCTGTTTTTTCTGCGATGTAGTGCTGATGACATTGGGGGTTTTAGGGCTTGGTTCTATCATTGCACAGTCACCATTATCTAGCTTAATACTCGCTCTAGTTGGCGCTATTTTTCTTTTTACTTATGGCAGTCGTTCATTTATTGCCGCCTATCGAGGGATTGGCGCACTCCAAGCTGAACAAGGCAGAAAAGTTAGTTTGAAACGAGCAATAGCTATTACTCTCGCAATTACACTACTTAATCCACATGTATATATTGATACTGTTGTTATTGTCGGTGGAATTGGAGGAACTTTAACTTTAGAGCAAAAAATCCAATTTTTAAGTGGTGCTCTGATCTGCTCTTTTCTCTGGTTTTTTGGTGTAGGTTATGGCGCAGGACTATTATCTGGCTATTTTGCACAACGACGCACATGGCAAATTTTAGATGCTATTACAGGCTTAATTATGTATGCCATTGCATTGAGTTTATTACTCTATGCCATAGATCTTGGGTATAAATTATCATAATTAAGCTATCGGAATACGCTAAATAAACAGAAATTTTTCCGGATGTTCAATTCTAATTAAATGTCATTACTAAAAAAGAGTTCCCTACTTACCGTGAGCAGGGAACTCTTTTAAAAAGGTTTACTAAAGTTAGCGATGTGGTTAGATTGTACGCTTAACTTCAACCACTTCAAAGACCTCAATTTGGTCGCCGACTTTTACATCGTTATAGTTTTTCACACCAATACCACATTCCATACCATTACGTACCTCTGATACGTCATCTTTGAAACGGCGCAGCGATTCTAATTCCCCTTCAAAGATAACCACATTATCACGTAATACACGGATTGGATTATTACGTTTAACCACCCCTTCGGTTACCATACAACCTGCAATCGCACCAAATTTCGGATGGCGGAATACATCACGTACTTCTGCCAAGCCAATAATTTCTTGTTTAAACTCAGGTTGTAGCATACCGCTCATTGCCGCTTTAATTTCATTGAGCAATTCGTAGATGATTGAATAGTAGCGTAAATCAATGTTTTCTGTTTCAATCACACGGCGAGCGGTTGCATCGGCACGCACGTTAAAGCCAAGAATAATTGCACTTGATGCAGCTGCAAGGGTTGCATCGGTTTCGGTGATACCACCTACACCAGAACCCACTACTTTCACTTTCACTTCATCGGTTGAAAGTTCGTGTAATGATTGAACAATCGCTTCCACAGAGCCTTGTACGTCTGCTTTCACAATAACATTCAATTCCGCTACATCGCCTTCAGCCATATTGCTAAACATATTTTCCAGTTTCGCTTTTTGCTGACGAGCTAATTTCACTTCACGGAATTTACCTTGACGATACAATGCAACTTCACGAGCTTTCTTCTCATCACGTACCACAGTTGCTTCATCACCTGCCGCTGGCACACCTGACAGACCTAATACTTCCACAGGGATAGAAGGGCCTGCTGATTCAATGTCTTTACCATTTTCATCACGCATCGCACGCACACGACCATATTCAAAGCCACAAAGTACGATGTCGCCTTTGTTTAGCGTACCTGATTGAACCAAGATAGTTGCTACAGGACCACGACCTTTATCTAAGTAAGATTCAATCACGACACCACTTGCCATACCATCTTTCACTGCGGTTAATTCAAGCACTTCAGATTGCAGGATAATCGCTTCTAATAAGTCATCAATCCCCATCCCCTTTTTCGCAGAAACAGGAACAAATTGCACATCACCACCAAATTTCTCCGAAATGACATCGTGTTGTAATAACTCTTGCTCTACACGCTCTGGATTAGCTTCTGGCTTATCAATTTTGTTTACTGCAACTACTAATGGTGCCCCTGCCGCTTTGGCGTGCTGAATCGCTTCAATAGTTTGTGGCATCACACCATCGTCCGCTGCAACCACAAGTACTACAATATCCGTTGCTTTCGCACCACGCGCACGCATAGAAGTAAAGGCTGCGTGTCCCGGTGTATCTAAGAAGGTAATCATCTTACCATCATCAGTTTCCACATGGTATGCACCGATGTGCTGAGTAATACCCCCTGCTTCACCAGCCGCCACTTTCGCTTTACGGATATAGTCAAGCAATGAAGTTTTACCATGGTCAACGTGCCCCATAATCGTCACCACTGGCGCACGAGTCACTTTTTCTGCCGCGGTATCACGATCACCTAATACCGCTTCTTCCAACTCATTTTCTTTGCGTAAGATAACAGTGTACCCCATCTCTTCTGCAACAAGCTGTGCGGTTTCTTGGTCAATCACTTGGTTGATGGTGACCATTTCTCCCATTTTCATCATTGTTTTGATCACTTCGGTGGCTTTCACAGCCATTTTTTCCGCAAGATCTTTTACGCTGATGGTTTCACCAATGACTACATCAGATTTTGCAACTTGTACCGGTTTTGTGAACGCTTGCTGTAATGCAGAACCTTTTTTGCCCTGCTTACCTTTACCTTTGGCATCTTTACTATTACGACGGTCATTACGCTCATTACGCTCTTCATCTCGTCCGCCTTTTTTGCCTTTCGCATTCGCCTTATTACCGCCTCGGTTACGGTTATTTTCACTACGGCGATCTGAATCACGATCCGCTTCACGTGCATAACTAGAACTAAAACGCTCATCTTCAAAATCATCATTATCTTCGATAATGGTTTCTTCACGCTCTTGTTCTGCTAAAAGGCGAGCATTTTCAGCCGCTCGTTTTGCCTCCATCTCGGCTTTTTCACGAGCTAATTCTTCTTGCTTACGGCGTAATTCAGCTTCTTCTTTTCTTTTTGCTTCTTTCTCTGCATCAACTGCTTTTTCGGCTTTCATAGCGACTGCTTTCGCTTTTTCTTCTGCTGCTTTCTTAGCTTGAGCCTCTGCTGCAGCTTTTTCTTGAGCGGCTTTTTCTTGAGCGGCTTTTTCTTGAGCGGCTTTTTCTGCTTGCGCTTTTGCCTCAGCTTCCGCTTTCAATTTTGCTTCAAGCGCTGCTTTTTTGGCTGCTTCAGTATCCACTTTACGTGATTTACGTACTTCTACCTGTACTTCTTTTGACTTACCGCCCGCGGTCGTGCCGCTTACTGTCGTTTTAGTACGACGCTGTAAACTCAGTTTCTTTGGTGCTTCTGTTTTAATCTCTTCACTCATAAT
Protein sequences of DBSCAN-SWA_2 >NC_020515|1338111:1352806|1338111_1340151_-|WP_015432625.1|DBSCAN-SWA MSYQVLARKWRPQRFSQVVGQQHVLSALENSLRENRLHHAYLFSGTRGVGKTSIARLFAKGLNCEQGISADPCGECANCKAIEEGRFIDLIEIDAASRTKVEDTRELLDNVQYKPTVGRFKVYLIDEVHMLSRSSFNALLKTLEEPPEYVKFLLATTDPQKLPITILSRCMQFHLRALDQSQIRKHLEFVLGEEQIPFEFPALERLAKAAQGSIRDSLSLTDQAIALSNANITLAIVSQMLGLLDENQPLELVQALALADGEKTMQIVQDVAERGVDWQQLLSDVAETLHQIAMLQLLPNKDGEETPLHFLARSISPEDVQFFYQVMLSGKKELPYAPEPRRGIEMTMLRALAFHPKNADKSPIVATGTVPVTSAAVNSGQQIAQLRDKLVAQQASVTSQAVQNDRNSANSASNMSSTPTVQTAVSGGVSAAMQALQASRQLKAQQQESEKKKSEQAVKISTPNLVKASSAKQAMKNVEQSFAKLLSSQDKKRQIDLPRAPTQEQVTSDEEYRWQWLNPELENQQETANPSEIKQAILQERTPELVKKTIELSCQQDAWAKLANELNLGGFSRQLALNSYMAEKRDDYIKLVLKPAMSRLNNDESRKNLADALKAKGLDLELIIGEDDRYKTPLEIRRAIFEELTQAARVSLFADDKLKLLQQAFEAEIDESTIRAVAE >NC_020515|1338111:1352806|1348796_1349447_-|WP_015432634.1|DBSCAN-SWA MSNPTEKSCIVIAIAGASASGKSLIASTIYKELKAELGQEDIGIISEDAYYKDQSHMVMEERIKTNYDHPNSMDHALLVQHLKQLKQGQSVEIPEYDYAEHNRKATVTYFSPKKVIILEGILLLTDEHIRNEIDMSIFVDAPLDICFIRRLQRDMEERGRTLESVLNQYRTTVRPMFLQFVQPSKQYADIVVPRGGKNRVAINILKAQIKELLGKK >NC_020515|1338111:1352806|1342986_1343889_+|WP_015432629.1|DBSCAN-SWA MFWFKNIMIYRLTSALNLNHSELEAQLVPQKFSPCSQHDLQKFGWFAPLVGSEMLHFSQGNQFLLISHKEEKILPANVIKKTCEERIAVLEEKEQRKLKKTEKQAIKDDVIATLLPRAFSKHQSTAIWLDLDNQLIYVDAGSAKRAEATLALLRKSLGSLPVVPLSFVLSPSEVMTNWVAKGHTPSWLTLLEEAELKDFENDSIIRCKRQDLESEEIANHLQAGKFITKLAIDWETHFSCVLNEDATLTRLKFADEVREKNDDILKEDKAQRFDADFLLMTEELRLFSAKLSNEFGGVKQ >NC_020515|1338111:1352806|1348203_1348788_-|WP_015432633.1|DBSCAN-SWA MRLCDSDIERYLDKGIISLTPRPSNEKINGATIDVRLGNSFRVFREHSTPYIDLSGPREEMSAQLERVMSEEIIIAEGDAFFLHPGELALATTLESVKLPANIVGWLDGRSSLARLGLMVHVTAHRIDPGWEGKIVLEFFNAGKLPLALRPNMAIGALSFEILSGYAERPYNARKDAKYKNQQNAVSSRIDHDS >NC_020515|1338111:1352806|1349530_1350160_+|WP_015432635.1|DBSCAN-SWA MHSFLHGFVVCFGLIVSIGAQNAFLLKQGILKQHVFWVAFICFFCDVVLMTLGVLGLGSIIAQSPLSSLILALVGAIFLFTYGSRSFIAAYRGIGALQAEQGRKVSLKRAIAITLAITLLNPHVYIDTVVIVGGIGGTLTLEQKIQFLSGALICSFLWFFGVGYGAGLLSGYFAQRRTWQILDAITGLIMYAIALSLLLYAIDLGYKLS >NC_020515|1338111:1352806|1340294_1340834_-|WP_025267013.1|DBSCAN-SWA MSQLDLIKSSIKSIPDYPQAGIIFRDITSLLEVPAAFQASIDAIVAEFKDKGITKIVGTESRGFIFGAPVALALGLPFELVRKPRKLPRETIAQSYELEYGQDTLEMHIDAVQAGDKVLVIDDLLATGGTVEATVNLIRRLGGTVEDAAFVIELPELGGAARLEKQGVKSFSLVTFEGH >NC_020515|1338111:1352806|1340923_1342096_-|WP_015432627.1|DBSCAN-SWA MKLTPFQWSSFNFFGFYAAFGVLVPFLPVWLNHHQYNVDTIGTLIALGYLFRLVGAMFFSKKIKEPNQLLPLSRFLTWGAVVLVIGMAWGVSSIWFFVPLLAFFHIVNGGAMPIADTIAANWQQQNMLDYGKSRLFGSLAFVVSSVSTGFLLEWTGQDAIITILALWLILLGIGISFQPSQTFKSVRENQTHQEISYRQLLSNRTTVRMIIAVSLVQASHAAYYAYSTMHWTSNGISTQQASLLWGLSVLAEVLFFYFSKRLFGNMGIRTLVLVAIAGAMIRWGLLASTVEFSCLVLVQLLHTATFGISHYAMIRYIATQPMAHSAKLQALYFSVANCGAMAALTFIAGKLYLVSPQVSFGLMAATVIPALFILPKKFIVKMQTTSGEKP >NC_020515|1338111:1352806|1350301_1352806_-|WP_155800264.1|DBSCAN-SWA MMSEEIKTEAPKKLSLQRRTKTTVSGTTAGGKSKEVQVEVRKSRKVDTEAAKKAALEAKLKAEAEAKAQAEKAAQEKAAQEKAAQEKAAAEAQAKKAAEEKAKAVAMKAEKAVDAEKEAKRKEEAELRRKQEELAREKAEMEAKRAAENARLLAEQEREETIIEDNDDFEDERFSSSYAREADRDSDRRSENNRNRGGNKANAKGKKGGRDEERNERNDRRNSKDAKGKGKQGKKGSALQQAFTKPVQVAKSDVVIGETISVKDLAEKMAVKATEVIKTMMKMGEMVTINQVIDQETAQLVAEEMGYTVILRKENELEEAVLGDRDTAAEKVTRAPVVTIMGHVDHGKTSLLDYIRKAKVAAGEAGGITQHIGAYHVETDDGKMITFLDTPGHAAFTSMRARGAKATDIVVLVVAADDGVMPQTIEAIQHAKAAGAPLVVAVNKIDKPEANPERVEQELLQHDVISEKFGGDVQFVPVSAKKGMGIDDLLEAIILQSEVLELTAVKDGMASGVVIESYLDKGRGPVATILVQSGTLNKGDIVLCGFEYGRVRAMRDENGKDIESAGPSIPVEVLGLSGVPAAGDEATVVRDEKKAREVALYRQGKFREVKLARQQKAKLENMFSNMAEGDVAELNVIVKADVQGSVEAIVQSLHELSTDEVKVKVVGSGVGGITETDATLAAASSAIILGFNVRADATARRVIETENIDLRYYSIIYELLNEIKAAMSGMLQPEFKQEIIGLAEVRDVFRHPKFGAIAGCMVTEGVVKRNNPIRVLRDNVVIFEGELESLRRFKDDVSEVRNGMECGIGVKNYNDVKVGDQIEVFEVVEVKRTI >NC_020515|1338111:1352806|1342092_1342911_-|WP_015432628.1|DBSCAN-SWA MLSKKIAFIGAGNMAFAIIKGLIRSGYPADLIAACNKSNQARLAELQQLGVQIAFSNCQAVKWADCVVLAIKPQVMAEVCEELAKIDFTEKWLLSIAAGISVARLTQLLPTANQIIRAMPNTPALVGEGMTGLFAEKTVNSSACQFVESLLNAVGKCYWVTQESQINQIIAITGSSPAYFFRFMEAMQKSAVAMGFSQQDARELIQSVALGAAKMVQENPELPLSQLRENVTSKGGTTAQALAVFEQYQLDKLVEEAMNAAILRAEEMEKSL >NC_020515|1338111:1352806|1345230_1347798_-|WP_025328958.1|DBSCAN-SWA MNIDKFTSKFQEAIASAQSLAVGRDHNYIEPVHLMQSLLLQNDGSISPLVTTLGVSPAQLQNELENLINRLPKVQGATTQPSQALIRLLNQCDKLAQQMGDSFISSELFVLAALDDSGDLGKLLKNLGLSKANVEQAINQIRGGEKVNSQNAEETRQALSKYTIDLTERARNGKLDPVIGRDEEIRRTIQVLQRRTKNNPVLIGEPGVGKTAIVEGLAQRIVNGEVPEGLKHKKVLSLDMGALIAGAKYRGEFEERLKAVLNEISKEEGSIILFIDEIHTMVGAGKTDGAMDAGNLLKPSLARGELHCVGATTLDEYRQYIEKDAALERRFQKVLVDEPSVEDTIAILRGLKERYEIHHHVQITDPAIVAAATLSHRYISDRQLPDKAIDLIDEAASSLRMEIDSKPEPLDRLDRRIIQLKLEQQALQKEDDEASRQRLEKLNTELAEKEREYAELEEVWKAEKSAVHGTQHIKEELENARIKMEQARREGDLAKMSELQYGVIPNLEKQLQAVENGEKNAENQLLRTKVTDEEIAEVLSRATGIPVAKMMEGEKEKLLQMETVLHARVIGQNEAVDAVANAIRRSRAGLSDPNRPIGSFLFLGPTGVGKTELSKTLANFLFDSEDAMVRIDMSEFMEKHSVSRLVGAPPGYVGYEEGGYLTEAVRRRPYSVILLDEVEKAHPDVFNILLQVLDDGRLTDGQGRTVDFRNTVVIMTSNLGSHLIQENAHKGYDAVKEMVMEVVGQHFRPEFINRIDETVMFHSLGKEHIRSIARIQLARLIKRLAERGYEVSVSDAALDHIGEAGFAPLFGARPLKRAIQQELENPLASQILSGKLLPNKAVVIDVENGEIVAKQ >NC_020515|1338111:1352806|1343889_1345128_+|WP_015432630.1|DBSCAN-SWA MQIYLVGGAVRDQLLGLPIKDRDWLVVGATPTELLALGYQQVGNDFPVFLHPKTKEEYALARTERKNGKGYNGFICDFSPEITLEQDLLRRDLTINAIAQDEHGTLYDPYQGVADLKQRILRHVSAAFSEDPLRVLRVARFAARLHTFGFKIAPETLKLMRQISASGELENLTPERVWLETQKAFNCDNPQVYFHILRLVGALKVLFPEIDALFGVPQPEKHHPEIDSGLHTLMVLAQAKKLSAKATDSESVLFAALCHDLGKALSPKDNLPHHYGHEVNGITPTVNLANRLKIPNHCKEFALLVTEFHSHCHKMTELRPETVIKLFNKLDVWRKPERFFDYLLVCEADSKGRLGFEDRPYPQAERAKSYYHAAMQVDVQQVIQDGFEKQAIRQELDRRRMLAIRQTKAENE |
11 | Bacteriophage(11.11%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1852002 : 1903953
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_020515|1852002:1903953|DBSCAN-SWA GTTATTTACCGATACAAAATGAGCTAAAAATATTGCCAAGAAGATCATCAGAGGTGAATTGTCCAGTGATTTCACTCAAGGCATTCTGCACCATTCTTAACTCCTCTGCAAGCAACTCACCAGCCATAAATTGAGTCAGCTGAGTATGCCCCCGCTCCAAATGCTCCGCGGCAGTTTCTAATGCAACAAGGTGACGACGACGCGCGATGAAGCCACCTTCTGTTGAACTTTGATAGCCCATAGATTTTTTTAAATGCTCTCGCAATAATTCCACTCCCACTTTAGTTTGCGCGGATAAGCGGATAAGCGTGAAATTATCTTGTTGTTCAACACCTTCTTGTTCGCCCGATAAATCTACTTTATTACGAATTACCGTAACAGGCATTTTTACTGGCAATTTCGCTAAGAAATCAGCCCACTCTTGTTGAAATTGGTCAGCTGCCGACAGGGTACTATCGATCATCAATAAAACGTGATCTGCTTGTCCAATTTCATCCCATGCGCGCTGAATACCAATTTTCTCAACTTCATCACTTGCTTCACGTAAACCCGCTGTATCAATAATATGCAACGGCATTCCATCAATATGAATATGCTCACGCAAAACATCACGGGTAGTCCCAGCAATATCTGTCACAATAGCAGCTTCCCGTCCAGCTAAAGCATTTAAGAGGCTTGATTTACCCGCATTCGGTTTACCCGCAATCACGACTTTCATACCTTCACGTAAAATCGAGCCCTGCTTCGCTTCTTTACGAACGCCATCAAGCTGAGCGATAATTTCATTGAGTTTTGCTTCAATTTTGCCATCGGCAAGAAAATCAATCTCTTCATCAGGGAAGTCAATAGCAGCTTCTACGTAAGTTCGTAGGTAGATCACACTATCTACCAATTCATTGACTTTATTGGAAAATTCACCCTGCAACGATTTTAATGCTGAGCGGGCAGCCTGCTCAGAGGTAGCATCAATCAAATCAGCAATGGCTTCTGCTTGAGCAAGATCAAGCTTATCATTTAAGAATGCTTGCTCAGAAAACTCACCTGCTCTGGCAATTCGAATGCCCTCAATTTTTAAAATACGCTGTAACAAGAGATCCAAAATAATCTGCCCACCATGCCCTTGTAGCTCTAGCACATCTTCACCAGTAAAAGAATTTGGTGCTTTAAAAAAGAGAGCAATACCTTGATCTAACACAGTGCCATCTTGATCTTTAAAGGGCAGATAATTTGCCATTCGCGGTTTAAGTGTCTTACCTAGAACTTCCTGCGCAACCTTCTCCGCTAACGGCCCTGAAATACGTAAAATCCCAACACCTCCACGCCCGATCGGCGTTGCTTGGGCAACAATTGTTTCTTTCATTTTCTTCTCTTAATTTGCAGATTTTGGCTCAATATGAACCGCTTGTAATCCAGCAAGTTGCTCATGCATGGCAGCTTGAATTTGTTCAAGCATTGCATAACGTTTACGATACATTGAGCGCTTATTGCTTGGAATATCACTTAAATCTTTATTATTTAATGCAAGTGGTAAAAGCCAGTACTCCGCCAACGAAGCACCGCTTTCACTGAAAATAGCATCGTAATCAACGACATAACGTTTACTTTGAATTAAACGAGATTTATTCTGATATTTCTGTGGAATACCAAGCAATGTTGAATAACCTAATACTCTAGTTAATGCTTTCATACATTCCACCATTAAATACGCAGGGCGTAATCCATGGCATTTTTTCGTCAGTTTCTTCACCAACTCTTTTGAACCATCAAAATTTGGCCCCTGAATAACGGCAATCAACAAACTGTCTGCTAACTTACCAAAAGTGAGCACATAAATAAGTTGGTTCGTCTCGCTATGACGTAATTCTAGTGCCCAGAACCCTTCCATAGGCTGCTGCTCATTAATGTTCAAATAAAGTGAAAATTCAGGTACAACTTCACCAAAGCTTATCGACTGCTGCCATAAACTTTTAGGGAGAAAGGTTAGATTATCTACCATCGCTTGATAACGTTGCTTAGCATTAAAACGTTTATCCAAAAAACGATGTACAAGTGGGTAGCTATAATTTGCTCTCGTATTCAGGAAAGCCGCAAGATCTGGATTTTGATTCACATATTGCTCGAATGCCTGCACCGAACGTAAATTCATTAATGAACGTAAGCGAAAACGTAACCGTTTGAAACGATAAGATTTATTTCCTCTATCCGGATAAATAGCATTCGCACAAGGCCATTGATAAAGATTATTTTGCATTTGGGCGAACTAGATGATTTAAACAAAGTCGTTAGTTTAACAAAAATCTCTGCTTTTGGCTAAAACGCATATACAGAAGCTCGCTGTCAGCTTATAAACATCTACAAATACATTCGCATTAAGCCTGCTACAAGCCATTGCCTATACTAAAAAAATGCCGACTATATGTCGGCATTTTCTCTATAGGATTTATTCTCCAAACTTACTCACACCTTCCATATGTGGCAAGCTGTGGGCAATACCTTTGTGGCAATCAATACAGGTTTTACCTTCTTGTTCAGCCATAGCATGCATTTTCGCCGCAACACCTTTTTGCTGAGTAAAATCCATATCCGCGAAGTTATGACAGTTACGACACTCTTGTGAGTTATTCGCTTTCATTCTTGCCCATTCACGTTGTGCCATTTCTAAGCGATGCGCTTCAAATTTTTCTTTGGTATCTACTTTACCGGTGAAATGCGCCCAAACCTCTAACTGTGCACGGAATTTACGCGTCCATTTTGGGATAAATTCATGTGGTAAATGACAATCAGCACAATCTGCTTTCACGCCACTACGGTTCATAAAGTGTGGTGAAGCACGGTATTCGGGCACCACATCATTCATATGGCAGCCTGAGCAAAATTCTTCGGTATTAGTATATTCCAATGCCGTATTCGTCCCGGCCCAGAAGAATATGCCGCCTAATGCCGAGAGAATAATCACCAATCCTACCGCCATTTTGCTTGGGCTTTTCGCCCATTTCCAGAGTCTACAAAATAATCCTTTCATTATTTACCCCCGAAGGCTTTGGTGGACTGAAATTTATTCTCCACAATTGGTTTAACGTCTGCTTGTTCTACATGACACTGTAAACAGAAATAACGGCGTGGTGAAGAATCACCAGTCACATTACCATCACGATCTTGAAAGTGCGTTGGGCTAATGCGCGGAGCACCTGTTGTTCGATAATTTTCAACACCATGACAATTTAAACACTGGTTAGTGTTTTTAGTTACTTGATAACCTTTCACGCTATGTGGGATCATTGGTGGTTGGTTTACAAAAGTAAGCTGATGCTTACCACTATCTTTAGGCATATTATGCCACTCAGGGGAAATTGCCTCTGCAGCCCCTTCTAATGAAGATGGAATATTATTTGCCATCGCAATTGGCGATAAAGCAACAAGTAAAAAGGCAAGATATTTTTTCATAATTCACTCCGTTTCTAGGGAAATATGGCTGCTAAAAACTCAGTCATCTATTTCAACACACCTCAGGCATAAGTCATTGAGCTTGTCGAAATAAGCGTTCCCTACATTATTAAAATTTAAATAATTCTCTTCTCTTTCGCAAAGCGGTTGGAAAAAGCAAAAACATTTTCAGCACACACATCAATGCAGCGTCCACAGCTAATACAATCTTTGGCAAGCACAATCGTGCTATCTTCGGCTTTACCATGCAATGGTGAACGTAGCACCTGTGGTTCTGGGCAAACATTGTAACAATCCATACAGCGGTCGCATTTTTCACGATCAACGACTTTCACTCGTACAATGCTTTTTGCCCCAATTACGCCATAAATAGCACCAATTGGACAAAGATGCCCACACCAACCATGTTCAGCGACTAGTAAGTCAAACAAGAATACGGCTAACACTAACCAAAGTGTTGCACCTAAGCCATAGACAAAAACTCGTCCCAATGCAGCAACTGGATTGATCCACTCCCAAATTAAGTTACCTGTGACTGCTGAGCCAATAAGCACAACTGCCAAAATCACATAACGTAAGTTACGGGAGATTTTTAACGTTTGGCGAATGCCTAACTTACGACGTAGCCATGCAGCAGCGTCAGTCACTATATTCAATGGACAAATCCAACTACAGAACGCTTTACTTGCCAGCAAGGCGTAAGCCACAATCACAATCACGGCCCCTAGCAACGTTTTCCACTCAGGCAAATAACCTGTCATTAAACTCTCTGCGGTAATCAGAGGATCACTCATTGGAATTAAGTCAAACAGCATGCTGCCACTATAATTACCTTTGAGAATCCAGACATTCCACATCGGACCGCTTAAAAACATCAAGATGACGGAAAGCTGGCTTAAACGGCGGAGAATTAGCCATTTATTCGCTCGCCATAGCCCAAGTTTTTCTCGTGCTTCTTTGCCTGCCTCTTTTGGGGAATTTGCTACTGCCATTATTTCCCTCCTAGCTTAGACGGCACATTGTCTAAATCAATTGGATTAAGAATCGGTGCTTCTACGGTAGTACGGTTTGGCACATAGTCCATCGTTGCACGGCTTGGCGTAGAAACCTTCACACCCGGTTTCACTTCCATAAATTGATAGACTGGTTGATTTTGCCCTTCTGGCATACGCGCCTCAAAGGCAGGACGTAAACCGTCTGGATGGTAGTCTTCCAATAATGCTTTACCTGCATTCTGTTTTTCTTTCCAACCTAAACGATAATGTCTGCCGAGCAAGCCTTTGGCAAGATCCATTGGCAGCACTTTGATCGCAGCTTCTTCCAGTACACAAGCCTGCTCACATTTACCACAACCGGTACAAGCATCGCTATGCACAGTTGGGATCAATTTCGCATGAATCTGCGTGCGATCATTATGTACACGCTCAAGAGTGATCGCTTTATCAATCAATGGACAAACACGGTAGCATACATCACAACGCAAACCTTGCCAGTTCAAGCAAGTTTCGTGATCTAGCAATACCGCTAGCCCCATTCTTGCTTGATCGATCTCTTCTAATCCGCCTAATGCTCCACTTGGGCAAGCATTCATACACGGAATATCAGGACACATTTCGCAAGGTTTATCACGCGCAGTAAAATAAGGCGTACCGGCTTCCACTGGCGATAATAAACTTGCCAAATGCAACATATCATATGGGCAAGCCTGAACACATTGTCCACAACGAGTACAAGCGGCAGAAAATTCTTTATCATCTGCGATTGCACCTGGTGGCCGAAGTGCCACACCTTCTCGGGCAAGACTTTGATTTTGTTGCATCGCCAACACAATGCCCACACCACACACACCAGCTGCGGTACGACCGACATCTTTTAAAAATTGACGACGATTTGGATTTAGTTTCACGATAGCCTTCTTTTTGCAAAATTTTGGCTGAAATTGACCGCTTGTATTGCCCCTCTCCCTCCGAAAAATCCTACGGATTTTTCTCCACCCTATCCTCTAAAGGGAGAGGGATAGGCTTGCGGTGTGTTCAGCACTGCAAGTCAGGGAGAGAGGAACGTTCAATTATGCCTTAACCACCTTAACCGCACATTTTTTGAAATCGGTTTCAAAGGAGATTGGGTCGGTTGCATCAAGGGTCACTTTGTTTGCTAACTGACCAGCGTCAAAGAAGGTAGTATAAATTAAGCCTTCAGGGCATTTATTACGACCACGAGTATCTAAGTGGGAGATCACTTCACCACGACGAGAGATAACCTTCACTTTGTCACCGTGACGCAGCCCACGTTTCTGTGCATCCGTTGGGTGCATCCACACAAGGTTATTTGGGAAAGCTTTATGTAACTCTGGTACACGGCGAGTCATTGTCCCTGTATGCCAATGTTCAAGTACGCGTCCAGTACAAAGCCATAAATCGTACTCTTCATCTGGTGATTCTGCCGGCTCTTCATAAGGTACGCCAAGAATAACAGCCTTACCATCTGCATTGCCGTAGAAATTCACGCCTTCACCTGCTTTCACATATGGGTCATAACCTTCGCGATAACGCCACAAAGTTTCTTTACCATCGACTACAGGCCAACGTAAACCACGTACTTGGTGATAGGTTTCAAAGTCAGCTAAGTCATGTCCATGCCCACGCCCAAATTGTGCATACTCTTCAAAGAGACCTTTTTGGAGGTAGAAGCCAAAATGCTCTGCTTCATCATTGAGGTATTCAGGAATGTCGCTAGGTACTTGGAATTTGTTTACTTCACCATTTAAGTAAAGGATTTCATAAAGCGTTTTGCCACGATATTCTGGCACTTTCGCCATAATCTCGTCACTCCATACTTCTTCAGCTTTGAAGTATTTAGAAAATTCTACTAATTGCCATAAGTCTGAACGAGATTCTCCCGGCCCTTTTACCATTTGACGCCACGCTTGAGTACGACGTTCTGCATTACCATACGCCCCCTCTTTTTCAACCCACATTGAGGTTGGGAGAATTAAGTCGGCGGTTAATGCTGAAGCGGTTGGATATGGGTCAGAAACGACAATGAAGTTTTCAGGGTTACGCCAACCTGGTAAACGCTCCTGGTTGATATTCGGACCTGCTTGAACATTGTTGGTACACATTGTCCATAGGAAGTTGAGTTTACCGTCTTTTAATGCACGGTCTTGACCCACTGCATGATAGCCTAATTCAGAAGAGATAAAGCCTTCTGGTAATTTCCACGCTTTTTCTACGATTTCACGGTGTTTTGGATTTGCTACCACCATATCCGCTGGCAAGCGATGGATAAAGGTTCCAACTTCACGAGCAGTACCACAAGCAGATGGCTGACCGGTTAATGAGAATGGCCCATTACCAGGTAATGCAATTTTACCTGTTAATAAGTGAACGTTATAAATCATATGGTTCACCCACACGCCACGAGTATGTTGGTTAAATCCCATAGTCCAGAATGACACCACTTTCTGGTTCGGATCGGCGTAGATTTTCGCAAGTGTTTCTAATTGATCTTTCGGTACGCCAGAAATGCGATGAGCCTCTTCTAATGTATAAGGTGCAACGATTTTCTTAAATTCTTCAAAATCACTGTCATACATTTTGCCAGCTGTTTTTGCATTTGCTGCTTTTTGTTGTAATGGATGCTCTGGACGTAAACCATAACCAATGTCAGTTTCACCACGTTTGAATTTAGTGTGTTTATTGACGAAATCCCAGTTTACTTTATCGTTCTGAATAATGTAATTTGCTATGTAGTTTAGGATCGCAAGGTCTGATTGTGGTCTAAAGATAATTGGCGTATCCGCTAACTCAAATGAACGGTGCTCATAGGTTGAAAGTACAGCCACTTTACATTCTTTATTAGAAAGTACGCGATCAGAAATACGACTCCATAAAATTGGGTGCATTTCTGCCATATTTGAACCCCAAAGCACAAAGTTTTCAGCATGTTCAATATCATTGTAACAGCCCATCGGCTCATCCATACCGAAAGTACGCATAAACGCAACCGCTGCTGATGCCATACAGTGACGTGCATTCGGGTCAATCGTATTAGAACGGAATCCCCCTTTCCAAAGTTTTACTTTAGCGTAGCCTTCATAAATAGTAGATTGACCAGATGAGAACATACCTACGGCATTTGGGCCTTTCTCTTTAATGATAGATTTCACTTTGTCTGCCATAATGGTAAAAGCTTGATCCCAAGAAATTGGTGTAAACTCACCTTCTTTGTCAAACTTCCCATTTTTCATACGCAACATTGGTGAAGTTAAACGGTCTTCGGCGTACATAATTTTGGCAAGGAAGTAACCTTTGATACAGTTTAAACCACGGTTCACCTCTGCATCTGGGTCACCCTGCGTTGCCACCACTCGGCCGTTTTGTGTCCCCACTAAAACAGAGCAGCCGGTCCCGCAATAACGGCATGCAGCCTTGTCCCATTTGATTTTGTTATCATCGGCATAGACATTCTTAATTGGAATGGCGATACCTGCCGCCGCTGCCGCCGCTGCGGCAGCATTGGCTTTCATAAAGTCTCGACGATTGAGTTCCATAGTGTTCCCCACTTGTAAATAAATTAGTTAAATTACCCCTTCTCTCTTTGAGGAGAGGGAGGAGAAAATTTGTAGAATTTTCAGAGGGAGGAGAGAATATGCCTAACCCTCTCTCTGCTCAAAATAACCTAACGTTATCCGGAGCTGTCACGCTACCTAAAAGGGAGAGTGTGTTAATTAATTTCATCTCGTTGGCTATAAATTAATGAAACCGCCATCACTCCTTCCAAAGTTTGTAGGCTTTCCATTTTACTCACCAACGCTTTTTCTTTATCTGCTTCCATCACCACAACCAACTTACCGTCATCCGGTTTTTCACCGTGGATTTCGGTGTATGGCATTGTCGCTAATTTATCTTTTACCTCATTGATATATTCAGGCCTTACCTGTAATACAAGGCTTGCCACATACCAATTTCCTAACTCTGGATTAAAATTATTCTGCATTCTTTCTTCCCTCTTGCAATACTTGAATTGCATCACTTGGGCAACTTGCTAAACAAGCTCCGCAGCCATTACAGTTGTCTAAATTTAATATCGGTTTTGCAATACCGCCAACTGTTCGTTTAAAACGAATGGCTCGCTGTTCACAACTATCTTCGCAGGCTCGGCATTCGACTCCGATGTGAGTTAAACAGCCTTGCTGAATTTCTATCTTATGCTCCCATGCTATTTCACTTGTTACGCGAAAAACACCTTGTTGGCAAACATTTACACAGGCAGCACAAAAGCTACACTCTTTACGCCCGACACTAAAATCCACTTCGGGATAACCACCCGCACCACGTTTTAAAATCTGCATCTCGCAAGCATTAATACAAGCATCACAAGCGGTACATTTTTGTAAAAAATTTACTAAATCTGCCCAAGGGGGACGAATCGCATTCTGCCCTTGGCGTTTCACCTGATCAGTCTTTAATGTATTTAAAAACTGTCCGCGTAAAAATTGACGGCGAGGTAAGGTTTGTTCGGTGAAATCAGCCAAAATAGTTTAGTAAAATAGTAGGATATTAATGGATATTGGATTTTAACAATTTGGCAAAGAGCATAAAATTGTTAAAACAAACTAATAGGAGGTTTCTTGATATTAATCAATAAAAACAATGAGTTGAAAATAGGAATAGCTTTTATTCTCTCTGTTTTGAATAAAAAAAGACCGCTTATTTTATTCGATAAACGGTCTTAGACTTAACAGATTTTTTACGCTTTTTTAACCTCAACCTTGACGAGCTTTAAAACGTGGATTCGTTTTACAAATCACGTAAAGCTTGCCTTTACGGCGCACAATTTTGCAGTCTGGGTGACGGCTTTTGGCAGATTTCAATGAATTCAATACCTGCATATTTTCTCCTTATTTACGAGTTAAACTCGCAATACCTTTAAATTTCTCATTGAATTTGCTCGCACGACCTTCGTTATTAGCTTCACGGCGCTTACCAGTATAAACAGGGTGAGAAGCCGATGAAGTATCTAAGCTGTAAAGTGGATACTCTTTTCCATCGGTCCATACCATTGTTTTGTTAGTATTTGCACAAGAACGAATAACCCAGCCTTGTTGTACGCTGGCATCATAAAACAATACTTCTCTGTAGTTTTCAGGGTGAATGCCCTTTTTCATATTTATCTCCTTGAATAACCATAATTAGAAAGTGGGATTATTTTGCCTCAGTTAAATCACAAAAGCTACAATTTTTTTATGGTTTTTAGCTAAATTTTATTAAAAGGCAATAAATAATCTAATTTTTGCGATCTAAATCGAAGTTTAGCGAGGGAAAATATGCTAGGGTATCCACTTTAGGTCCCTAGATAATCTAAATTAACGATTACTAGCCTTAACAGTTTTTGTGAATGACAATAAGGATAACACATGGCTCAACAACCCTTTTTAATTGCTCCTTCCATTCTTTCCGCTGATCTTGCTCGTCTTGGTGACGATGTAAGAGATGTTCTCAATGCAGGCGCAGATTTGATCCATTTTGATGTGATGGACAACCACTATGTACCTAATCTCACCTTCGGTCCAGCTATTTGCAAAGCATTACGTGATTACGGTATTGAGTGTCCTATTGATGTGCATTTAATGGCAAAACCTGTTGATCGTCTCATTCCTGATTTTGCTAAAGCCGGCGCGAATTACATAACTTTTCATCCTGAAGCAACAGAACATATCGACCGCTCATTACAGCTTATCCGAGATAACGGTTGCAAATCTGGTTTAGTCTTTAATCCTGCTACCCCATTACATTACTTAGATTATGTGATGGATAAAGTTGATGTAATTTTATTAATGTCGGTAAACCCTGGCTTTGGAGGCCAAGCATTTATTCCTTCTACGTTAGATAAATTACGCGAAGTACGTAAACGCATTGATGAAAGTGGTTATAACATCCGTCTGGAAGTCGATGGCGGCGTGAAAGTCAATAATATTGCGGATATTGCACAAGCAGGGGCAGATATGTTCGTTGCTGGCTCTGCAATTTTTGATCAAGCAGATTATCAAGCTATCATTAGCCAAATGCGTCAAGAATTAGCAAAAATAAAAAAATAGATAAGCCATCAACAAGCGGTTAATTTTGTATAAAAAAATGCAAAGCATTTACGCTTTGCATTTTTTGTTTTTCTAGGCTAGCTCAAAGAGCTCACCAAGAGCTGCACCTTAAGTCGCATTTTAGCATAACGTATAAGGTGCAGCTGTATATTAATTATTGGCCATAGGTTTCGCCTAATTTCGCTTTATGCTTGCCTTCTTCAATCATGGTGATGAATGTCAATAATACAGCGAGAACACCCGCACCAATCATGACATAGAAACCGCCGTCCCAGCCGTAGGTTTCTGCAGCCCAACCTACTACGGCAGAAGCTGCTACTGTTCCACCAAGATATCCGAATAAGCCAGTAAAGCCAGCTGCGGTTCCTGCAGCTTTTTTCGGTGCAAGTTCAAGAGCGTGTAAACCAATGAGCATCACAGGTCCATAGATTAAGAAGCCTATGAGGGTCATTAATAAGAAGTCCATTAATTGATATGGGTTTTCATACCAAGCGGCATAGCTTGCCAATTCTGCTTCTGGTGTTGCAGGGTTGAGCCAATAAGCTACAACTGCTGCAGTGGTTAAGATCATAAAGATAAAGCCTGTTAAACCACGTTTACCTTTGAAGACTTTGTCTGATACCCAACCGCAAAGTAATGTACCTGGAATTGCAGCTAATTCATAAATGGTATAAGCCCAAGCTGTACCTTTGATATTAAAATGTTTCACTTCGCTTAAATATACCGGTGACCATTTCAATACACCATAACGAATGAGATATACAAAGACGTTAGCAATAGCGATGTACCACAACAGGCGGTTTTTCAACACATAGTCCACGAAAATTTGTTTGGTGCTAAGATCGTGTTCTGCAGTTTTTTCATCGTAGTTATCCGGATAATCATTACGCCATTTTTCGACCGGAGGTAAACCGCAAGATTGTGGAGTATCACGCATGACAAAGTACACAGGAATTGCACAGATCATCGCAGCAATACCAGGGAAGTAAAGTGATTGTTGCCATACATCTTTTGCCGTTGCTTCCACGCCGTGAGTACTATAGAAAAGCGCACTAGCAAGTAGAACCATTGCCCCTGGCATCATACCGCCTAAGTTATGTGCAGTATTCCAAACAGAAACTATTGTTCCACGTTCAGATTTAGACCACCAGTGTACCATTGTACGCCCACACGGTGGCCAGCCCATACCTTGGAACCAACCGTTTAGGAAGATCATAATCCACATAATGGTAATACCAGAAGTTGCCCATGGGAATAACCCCATTAAAGTCATACATAGACCAGAAAGCAATAAACCAAAAGGTAAGAAAACACGAGGGTTTGAGCGATCTGACATCCCTGCCATAACGAATTTAGATAATCCATAAGCAAGACCTGCAGCAGAGCCAATAACCCCTAGTTCTGCTTTAGAGTAAAGACCAGCTTCAATTAAACCTGGTTGAGCTAAATCGAAGTTAGCACGCACAAAATAATAAGCCGCATAACCAAAGAAAATCCCCGCAAACACCTGCCAACGCAGCCGTTTATAGGTTGAATCAATTTTATCTGCGGGCAGCTCCGCAATATGTGGAGCTGGTTTGAATGGACCAAACATAAGTAATCTCCTATCTTCTATTTATATAAATGTAGTAAATTCGTTAAACTCAAATCAATATACGTTAATTATTTCTTTAAAAACTCCACCCCCGTATCAGGAAAATCAGTGAACACACCTGTAGCACCGGCTTTATTTAATAAGGCATCATACATTTCGTCCACATTCGTGAAAAATTCAGGTAACGCATCTTTACGCACGGTGTATGGATGTAGCTCCATTTTATATTTTTCTAGCTCTTTCACGAGCGGCGTATAAACAATGTTACCCACTTTAGATTGCTTATCATCAATTAGCATATACCAACCAGGACCTACACCATCTGCATATTTAGCGACTTCAGCCATTGCGCCGTCCTCAAACATCCAGTCATAGTTGTAATTAACCCAGTTGCCATCTTTATCTTGTTCTTCTGTTTCATGCCAATCAGTATAAGCGACTAACTGAACCAATTTTAAATCCATACCCAATTGCGGCAACAATTCGGTTTTAATTCGTTTTAGTTCATTAAAATCAAAGGTTTGCAGATAAACTGGCGCATCTTTATTATCATAGCCGTATTTTTTCAGCACTTTCAGTGTTTCAAGTGCAATATCCTTGCCCTCTTTATGATGTAACCATGGCGCTTTAATTTCAGGGTAAATACCTATTCTTTTCCCTGTCGATTTTTCTAGCCCCTGAATAAACTCCAGCTCTTCTTCTAACGTATGAAGCACAAAATGTGATTTCCACATTGGAAAACGATTAGGATAAACTTGCACTTGTTTACCCTCTTTCATTTCAAAATTTTCTGTCATTTCGAGTGATTTAATCTCATCTAGAGTGAAATCCACCACATAATAACGGCCATCTGCGCGTTTTCGCTCAGGAAATTTTTTAGCCACATCAGTCAAACCATCTAAAAAATGGTCATGAATGACAATAAGACGATTGTCTTTCGTCATAGCTAAATCTTGTTCTAAATAATCTGCTCGTTGTGCAAAAGCCAACGCTTTTGATTCTAGCGTATGTTCAGGTAAATAGCCGCTCGCACCACGATGCGCGATAATGAGTTTTTCACTAGACGCTTGCGCTGCGATAGAGCCAAACATTGCCATACCTAACACTAGTCCAGACATCAACTTGCTTAATTTCATTGTTATTACTCCTTCATAAATTAGGTGGCGCATATCATAGAAGATTGCATCAACAAATGCATATTTATTGTGAAATTTTGTGAGCCATATCACACTATTTTTTTAATTTATTTTTCAAAAAAAATCATAACGTGATCTCATTCACACTTTTACACTCTTTTTCAACCAAAATATTGAGCGCATTCGAACATTAAGCATAAAATACACCCAATTTATTTGTTTTACTTTCATTTGTGGGAGGTTGTATGGTCTTATCACCTCAACTTTATAAAAATGCGGGCGACTTTTCGCCAATCAGTACTGATGTCATTATCATCGGTGGTGGGGCAACGGGTGCGGGAATGGCACGCGATTGTGCGTTGCGTGGCATAGATTGTATCCTACTCGAAAGGCGCGATATCGCCACGGGTGCAACAGGGCGCAACCACGGGCTATTACACAGCGGTGGGCGCTACGCAGTAAATGACAGAGAATCTGCCGAAGAATGTATTAAAGAAAACCAAATCCTACGCCGTGTTGCATCTCACTGTATTGAGGAAACTGAGGGCTTATTTATTACGTTGCCTGAAGATGATCTCAACTATCAAAAAACCTTTATTGATGCTTGCAATGCATCAGGTATTGAAGCGGTGGCTATTGAACCTGATCTTGCAAAACGCATGGAGCCTTCGGTCAATCCTTCCCTAATTGGTGCGGTTGTTGTGCCTGATGGTTCTATTGATCCTTTCCGTTTAACCGCTGCCAATATGCTGGATGCCACCGAGCGTGGGGCAAAAGTCTTTACCTATTGCGAAGTAAAAGGATTAATTCGTGAAGGGGGTAAAGTCATTGGTGTGAAAGCATATGATCATAAAAATCGGGTAGAACGCCAATTTTTTGCGCCGATTGTGGTAAATGCTGGTGGAATTTGGGGACAAGGTATTGCCGAATATGCGGATTTAAAAATCCGAATGTTCCCAGCCAAAGGGGCATTACTGGTCATGGGGCATCGTATCAATAATATGGTGATCAACCGCTGCCGTAAGCCTGCCGATGCCGATATTCTTGTGCCAGGGGACACGATTTGTGTGATTGGTACAACCTCTGATCGTATCCCTTACGATCAAATTGATAATATGGTCGTAACTCCAGAAGAGGTCGATATTCTCTTCCGCGAAGGGGAAAAACTTGCCCCAAGCTTACGCCATACACGGGTGTTACGCGCCTACGCAGGCGTACGCCCATTAGTCGCGACAGATGATGATCCATCTGGACGGAATGTGAGTCGTGGTATTGTATTACTCGATCACCAAGAACGTGATGGCTTAGAAGGCTTTATTACCATCACGGGAGGTAAGTTGATGACCTACCGTTTAATGGCGGAATGGGCAACTGACTTAGTTTGTAAAAAATTAGGCAAAACCGAGCGCTGTACCACCCATGAGCGTCCACTCCCTGGCTCTGATGAACCAAGAGTCGAAACCAATCGTAAAATCACCTCATTACCAAATACATTACGCTACTCTGCAGTTTATCGCCACGGCGCACGCACACCAAAAATGTTGGAAAACGAGCGTTTAGATAAATCGCTGGTTTGTGAATGTGAAGCGGTGACTGCTGGCGAAGTTCGCTATGCAGTAGAAGAACTGAGCGTAAATAACCTTATCGATTTACGCCGCCGTACCCGCGTAGGAATGGGCACATGCCAAGCTGAGCTTTGTGCCTGCCGAGCCGCTGGATTAATGAGCCGTTTTAAAGCGGCTACACCACGGCAATCTACAGTGCAATTAGCTTCTTTTATGGAAGAGCGTTGGCGCGGAATTGAACCAATAGCATGGGGAGAAGCCGTACGAGAAGCTGAATTTAGCAGTTGGATCTATTACAGCTTGCTAGGGTTAAATGATGTGAAACCGCTTGAAAATCAAGCGCAACAGGGGACAGACGACAATGAATTTTGATGTAGTCATTATTGGCGGCGGATTGGCCGGACTCACTTGCGGCATCCGCCTACAACAACAAGGTAAACGCTGTGTGATCGTCAATAATGGTCAAGCAGCAATGGATTTTTCATCTGGTGCATTTGGCTTGTTAGGCGAAACAAGCGGAACAAAAATTGCAAAATTTGACGAAAAACAGACCGCTTGTTTAAGTGAAAATCACCCATATCGCGTGCTTGGCTTTGCGCAAAGCTTAGCGATGGCACAGCAATTTGAACGTGATTTTGCCAAACCGTTAGCCCTGAGTGGATCTAGCGCCCATAACCATTGGCGTGTAACGCCTCTCGGAGGGTTACGCCCTGCTTGGCTTTCACCAGAAAATTCACCAATGTTAGGCTGGACAGAAAATTTTGCCTATCGCAAGTTGGCTATTTTGGGTATTGAAGGTTATCACGATTACCAAGCAGAGCTGTTTGCCGATAATCTAAAACAGCAACCCACTTTTGCTCAATGCGAGATCATCACAGATTATCTACATTTGCCAGAATTGGACGAACTCCGCCAAAGTGGACGTGAATTCCGCAGCGTACATATTTCTCAACGTCTAGAACAACAAATTGCATTTGATGCATTAGTGCGCGAAATTCGTCAGCGTGCTCAAGGCGCTGATGCGATATTCCTGCCAGCCTGTTTTGGTATTGATCACGATGAGCTGTTCCAACGTTTACAACAAGCCAGCCAAGCAACGCTGTTTGAACTCCCAACTTTACCTCCTTCACTGCTTGGTATTCGCCAACGCAAAGCCTTACGCCAGCTATTTGAATGTGCAGGTGGCGTGATGATTAACGGCGATAAAGCCGAACGGGCTGAAGTCGATGAAAATGGTAAAATTTTGCGGATTTTCACCCGCTTGCACGCTGAACACGGCTTATCAGCTACCCATTTTGTGCTTGCTTCTGGCAGTTTTTTCAGTGGTGGATTGAGTTCCGAATTTGATCGCATTTGCGATCCGCTTTTTGGTGCAGATATTCAAGGTTTAGGTGAATTTAATCCACAAGATCGCTTGTCTTGGACTGCGCATCGTTTTAGCGCTACTCAGCCGTATCAAAGTGCAGGAGTGATTATCAATGCCCATTGCCAAGTGCGTAAAAATGGGGAATTTCTGCCGAATCTCTATGCAGCAGGCAGTGTAATTGGTGGCTATAACGGCATCGCTGAAAATTGCGGTTCCGGTGTTGCCGTAGTGACCGCCTTAACAGTGGCACAACAGATCGGGGGGCAATAATATGAGCAATATTCAACAATTAATTGAACAAGCCAAACAACAGAGCCATCAGCCTCTACCTATGCATGGCTTTGATGAATCGTTTGAAAGTTGCATAAAATGTACCGCTTGCACTGCCGTCTGTCCTGTATCTCGAGTTAATCCAGCCTATGCTGGCCCGAAACAGTCCGGCCCAGATGGTGAAAGACTGCGCCTAAAAAGTGCAGAAATGTACGATGAGGCATTGAAATATTGTACAAACTGCAAACGTTGTGAAATTGCTTGTCCATCGGATGTTAAAATCGGCGATATTATCGTGCGTGCCCGTAATCGTTTTGTCGAGCGACAAAATAAACCGCTTATGCATAAACTGCGTGATGCGGTGCTGAGTAATACCGATATTATGGGCAGACTCAATACGCCGTTTGCACCGATTGTTAATACTATCACTGGCTTAAAAGTTACCCGTTTTTTACTGGATAAAACTTTAAATGTTAGCCGTCATCGCACATTGCCTAAATATTCATTTGGTTCATTTCGCAACTGGTATTTAAAAAAAGAAGCCGAGCGCCAAGCGTTCTTCCGAGAGAAAGTCGCTTATTATCACGGTTGTTATGTGAATTATAACAATCCAGAGCTGGGTAAAGATCTGATTACATTGCTTAATGCGATGGACATTGGTGTGGTGTTGCTGGAAAAAGAGAAATGTTGTGGCTTACCGCTTTCTGTAAACGGTTTTCCGGAAAGAGCCAAAAAACAGGCAGCGTTCAATATTGCACAAATTGAGCAAACTATTGATAAGCGGGCACTAGAAGTAGTGGGAACATCTAGCAGCTGTACGATGAACTTACGCGATGAATATCATCATATCCTAGGGATGGATAATGCGAAAATTCGTCCGCACATCAGCATTATCACTAAATTCTTATCACGTAAATTTGCAGAAGGTAGAGTACCGACATTCCGCTCAATGCCATTGCGGGTGGCGTATCACACCGCGTGTCACGTAGAAAAAGCCGGTTGGGCACCTTATACGCTCGATCTGTTGCGACAAATCCCAGACTTAGAAGTAGTGACATTGCCAAGCCAATGCTGTGGTATTGCCGGAACTTACGGTTTTAAATCGGAAAACTACGACAGTGCGCAGGCTATTGGTAAATCGCTGTTTGAGCATATTAATGGCGGAGGGTTCGATTTTGTCATTTCAGAATGTGAAACCTGTAAATGGCAAATTGATATGTCTAGCCAAGTCACCTGCTTACATCCTGTGACCTTGCTCGCTATGGCGTTGAAAAAATAACCTAACAAATACCTAGCTAAGAATAAGGAGGTGCCACCTCGCAAGCGCTTGAATTTCAACAAAAAATTGCAAAACTTTAGGGCGTATTGATATTTAAAATCGGCCGCCAAACGCCTTATTTGAAATAGTTGTACGCCCTAACCTCTCTTCCATTTCTATTGTGAGATCCCATAAAAACCAAGAAAAAAGAATAATAGGGATTTGGCGGAATTTTTCGGGAATAAGTGGGACAGAAGTATTGAAAACAGGGGCTTTGCGGTGCATTGCCCCTTTTTTGTTATCAGAAAAAGGAAAAATGATGAATTGCGAAAATTTTGGCAAAATCGCAAAAATGAACCATTAAAAACACGAAGTGAAAGTGGGTGTTGTGGGGGTGTTTAAATAGCGTTTAAATCATCTCGTGTCCGATGTACACCACCTGCCCAATCACTTCAAAATCTAACGCATCGTCAAACATCACATCAACAGGGCTGTATAGCTCTTTATTATCGCTAATCAAACGAATGCCGCCGATAATACCTTGCACACGCTTAACCCAAAGCTGGTCGTTTTGGCGGAATACATAGATCTTACCGTCTTTGGGTTGAGTGGAAGCCCTGTTTATCAGCAACATATCACCATCGCTAATCGTCGGGTACATAGAATCCCCCGAAGCGGTAATAAACATAAGACGGTTAAGATATAAGCCTCTCACTTCAAGCCAACGCTTACTTAGCCCGATATAGTCATCAGGGGCATACACTTCATTATTAAACGCCCCAAAGCCAGCAGAGGCTTGCACATCATAAAATGGCACACGTTCCATATCATCGGCGGTTTGGGCTGCAACTAATGCTCTAGGCTCTTCTCTTGCCATAGTTTGAGCAAAGCCTAAAGCCTTTTGTACGCTTTCAGGCAAAGAGCTGTAGTGGTACTCAACAGCCCCACCTTGTACACCATCTCTACTTTTCTTTTTCCACTCTTGTGTTCGTGCCTTTTTATTGATCCCCTGTGGGCTTGTTGGAAGCCCTTCTATTCCAGCCAACTCATTTGCTGAAAACCACTCTTTTAAGTTTCCCATAATGCAACCTTTCAGAAACTCAGTTTCTAAAAACAAATTATTCTTTAACTTATTGATTTTTAAAGCCATTGAGAGAGTTCTGCAAAAAACTTGCAAAAATCTTTTAGAAACTATTGAGTTTCTGAAAAGTTAAGTGTATAGTTTCCAAAGTTTCCAAAGAATAATTCTTATGTATAAGGATAGCACATAATGACAACGAGCAACAAAAAACAAGATATGCACCGAGCAGATATTGTTGCAGCAGTAAGAAAAGCTGGAACAACTCTTGCGAAGCTATCAACGGAAGCGGGTTTACACCCCAGAACCTTAAATAACGCACTCGAAAGAAAATATCCGAAAGGAGAATCGATTATTGCGAATGCGATTGGAAAAACTCCACAGGAGATCTGGCCATCTCGTTACGAGTAAGGAGTTCATCATGAAAGAGTGGTTCTCAGCAAATGAGCTATCGGGAATTACTGGAATGCCTAGTTCGCCACAAGGCGTAAACAAAAAGGCACAAAGAGAAGGTTTCCAAAAACAACAAAAAATAGGGACACAGGGTAAGGCATACGAATACCACATTACCAGCCTACCCGCCGAAACCCAACAAGCACTCCGGCTGGAAGAAGCCCGTGCACTAAGTGCCAAAAGTTTCGTGCCGGAAGTTCGACCCAACACCGCCCTTTGGGCTGAGTTTGATAACGAAAGTGGGGCGAAGAAGGCGAAAGCCGAAGCCAAGTGCCGAGCAGTGATGGCACTGAAAAATGAACTGCAATTTAGTCCGATAGAGCGTGCGCTGGTGGAAGTCGCCAACCGCTTTGAAATATCGGAAGGCTCTTTAAAAAACTGGTATTACAAAGTTAAGGCGCACCCGGAGAGCGATTGGCTGGCGTTGTTACTCAACCGCTCAGGCAAGAGCAAAATCAAGGAGAAGGCGGCGTTTACTTCGGAAGCGTGGCTCTACTTTAAAGGGGATTATCTCCGCTCCAATGCCCCAAGTTTTGCGACCTGTTACTACCGGTTACAGCTGGCAGCGAAAGCGAACGGCTGGGAAATCCCGAGTAAGAACACGGTGTTGCGCCGCTTAAATGCGGAAGTGGATACCTTAACCCAAACGCTGATGCGTAAGGGCGAATATGCAGTGCAGAGCCTGTTCCCGCATCAGGTGCGGACGGTGGAGCATTTGGCGGCACTGGAAATCATCAATGGTGATGGTTATCAGCATAACGTGTGGGTGCATTGGAACGAAGACGATCCAGATGCGAAGCCGATTCGGCCGAAAACGTGGTACTGGCAAGACGTTCGCACCCGCCGAATTTTGGCGTATGTGGTGGACGACTCCGAGAATGCCGACCAGCTACGAATGAGCCTGAAAATCCTGTTGGAGAAGTTTGGGTTGCCGAAGCAACTCACGCTGGATAACACGGTGGCGGCAAGTAATAAGCAGTTGAGCGGTTACAGCAAGAACCGCAAACGGTTTAAACAGGTGGCTGGTTCAGAAATTGACCCGGCAACCGGCAAACCGCGCGAAGTGAAGGGCATTTTTGAGATGCTCGGTCTCAAAATCTCACGCACCGACATCATTTGCGGGCGTGGTAACGGGCAGGCGAAACCGATTGAGCGGTGCTTTCAGGAGCTGGAAGAGTTGATTGATAAGCACCGTGATTTCCAAGGTTACTACACGGGGAGCGACCCGGACAGCCAGCCGGACGATTACCAATATAAGGTGGGCGTGGATAAAGCGACCTTTTTGAAACGGGTGGAAGAAGGCATCAAAGCCTATAACGCTCGCCCGAACCGACGTAATGAAATCTGCCAAGGCAGATACAGTTGCGACGAAGTCTGGGCAAGGGATTTTGCTCTAACAACGGTGGCGAAGCCGACTGTCAGCCAGCTCTCAATGCTGATGATGGTGTCGGAAAGCACGAAGTTGGAGAAGAAATACGGGCTGGCAAACGGGGCATTTAGGCTGAAAGCCGGTGGTGCGAAGTTTAGCGGGGGCTATAACCGCTACGCAGCCGAAGAGCTGATTGGCTCAAAGCTGGATTATGTGGTGGTGCGTTTCGACCCTTATCACTTGCACGATGATGTGTATGTATTTGACACGCAAGACCGCTTTTTGTGCAAGGCGAAATGTATCGACAACATTGCGTTTGATGATACGGAAAAAGCCCGTCAGCACAAACGCGCGAAAACACAGATGGTGAAAGCGGTGAAAGCACAGGCGAAGGCGGTTGAACGGATGAACGCTATCGAAATGGCGGCAGTCTCGCCGGAGTTGGAAGAGGTGGCGGAAGTGAAGCCGATACTGAAACCGCTCTATGGCTTTGACGGCAGTGCTGCATTAAAACCGCAAGCAGTCGAGCTAGACGATGAAGACGAGCAAGAAAGCCGTTTTGCCAAAGGCGTGCAATGGCTCAAAGCCACTATGGCGAAATAAACAAGCGGTCGGATTTTGCAAAAAATCGACCAAAACTGACCGCTTACCAACCTAGATTTAAACAGAGTTTAAACAACATTTAAACGGGAGTAAACAATGACACCAATCGAACAAATCAAACAAATTTTAGATGACGGCGTGATTAGCCAAGCCAAGTTAGCCAAAGAAGCCGGCATTAACCCGGGTGCATTAAGTAGCTATCTAAAAGGCAACTATGCCGGCAATAGTGAGAACTTAGAGCAGGCACTGAGCCAGTGGCTTGCCCGCCGTGAAACCAAACAGCAACGCTTTGTGCAAGCACCGGATTTTATCCAAACCGCGACGGCAACCCAAATCCACAACGCGTTTGAGTTTGCCCGCATTCTCGGCACGATTGCCACCGTTTATGGCATGAGTGGTGCAGGCAAAACCCGTGCAGCGCAAGAGTTTAAACGTAATAACCAAAACGTGTGGATTGTGACTGCCAGCCCATCCCGCTCAACACTGAGCGAAATATTGTACGAAATGGCGTTAGAAATCGGCTTAACCGATGCACCACGTCGCAGTGGGATGCTCTCTCGCCTGATTATGAAAAAGCTGACCGGCACACAAGGCTTGATGATTATTGATGAAGCCGACCACTTGCCTTATCAGGCGTTAGAAGAGATTCGCATTTTGCAGGAAGAAAGCGGGATTGGCTTTGTGCTGATTGGTAATGACAAGGTTTACACCCGTATGCGTGGGGCAACGCATCAAGCCCATGAATTTGCCCGCCTTTGGTCGCGTATCAGCAAGCACGTCAGCATCCAAAAATGCAAGAAAAACGATGTGGTGGCGATTGCCAATGCGTGGGGACTGGATACCAACGACCAAGAGATGATGAGCCTATTATCGGAAATTGGTGCAGGCGGTGGTGGCTTGCGTAGCTTAACCCAAACGCTACGCCTTGCCGGTATCCACGCCAAAGGGCAAGACAGCGTAATTACACGCGATCTAGTTTTAGCCGCACAAGCAGAATTAGGGGGTAAAAATGGATAAACGTATTGAAATCGACCGCCCACTGAGCCACGACAATGCGGTGATGTATGGGCAGGTGGTGAGTTTGGAGCTGGCCATTTTAGCGTGTAACCAGCTTGGCATTGATGTTGAACGAGTGGATTACACCGACTGGCGCAGACCTTGCCTGATTGTAAAAGCCAATGCGGTGACTCAGCAGATGCTACGCCAAGGCAAAGCCTTTAACTACGGCAGCCGTGTGCAGAACGGAATCCGAGTGTATTTAAACCACGCCATTGTAAACGGCGTAAAAATCAAGTGGGAATCATCCGATTATCGTCATTAACCAACAGGAGAACCAGTATGGACGACACAAACGACACCACATTACTTATCGGCTCTTACGCCATTACACCACCGCTGACGTGGGAAGAGATCGAAGACGGCAAGGAATATGCCGTCATTGATACCACCGATGCAGGTGACTGGTGTGTGAAGTTAGAAAAATTTAACAAAGACAACAGTTATTTTTGGACATTGGTACAGTCTGCACGAGTCTTTGAGAACACCGTCGAAGCAGGTGAATTTATTGAAGCATTAACCACATTAGGAGGAAAACATGGCCGCTAAAACCCGCGTAAAACAACCGGCAAAACTGCGTTTTACTGAACAAGCACAGGTGCAGAGCGCAATTAAAGAGATCGGAGACTTAACGCGTGAGCATACACGCCTAACTACGCTGATGAATGATGAAATCACGGCGATTACTGAGCGTTATACGCCTCAGCTCAACCGCCTGAGCGAAGAGCAGAAACCGCTACAAGATGCGGTGCAGGAATACTGCGAAGCGCACCGTGACGAACTCACCGACTTCGGGAAAACCAAAACCGCAAACCTTATCACCGGTGAAGTGAGCTGGCGAACCCGTCCGCCGTCGGTGTCGGTGCGTAATGCAGAAGGCGTGCTGGAAAACTTACAGAAACTCGGCTTTGACCGCTTTATTCGTACGAAACAAGAAATCAACAAAGATGCCATGCTCGCTGAGCCGGACATCGCCAAAGGGATTGCCGGTGTGACGATTAAGCAGGGTGTAGAAGATTTTGTGATTAAACCGTTTGAGGCGGAGGTGTGATGGACATTTTATTTTATGTCGTGCTTGCTGTTTTTCTCATCCCGATTGCTCTCGCATTATCGTTTGTGGTGATTGGCTTACTTATTGGAGCATTCAAAGTAATTTTCGATTAAAGCCTATTTAAACGCTCTTTAAATCCCGATTTGAGGGGCGTTTGTAATGTGTTTTAAATCAACAAGGAGTGCTTATGACAAAAGCAAAAGAAACTGCCGAAGCTGCCGCTCGAGCAGAACGATTCGGCAATTATACGCAAGCGGCGGATCTATGGAATAAAGCGGCAAAAGCAGCCGTGAATGCCAAGCAGCAGGAATGGTGCCGAAACCGCAGTGATTTTTGTGATCGAATGGCGGAACGGCCGTTTTAGGGGGAATGATGAGCGAATTACATCGAGACCTACTGGAAGCGCAATTATGTGAAGCGATTGTGCAACTGCAACAAGCTCAAACGGCGCTACAAAATAACCAATTTATCCACGCCTCAATCTATGTTAGTAACGTGCAAAACCAGCTACCACAGATGAGACAAAAATTAACGCAACTACATTAGGAGATACAAATGGCTATATCAGCAGAGCAATGGAAAGAAATTAAACAAGAACTTGATGGATTGATAGGGAAAGTTGAATTTCGGTACAAAGAGCATCAACTTACGGTGCAAGTGGAAAAGGTTAAACGTTCTTTAGAATTATGTGTGTATGTCGATGGAAAAATTAAGAGGGAATGGATAGATGAAACACACGAGCTGCGTCCTTTTTTAGAAGAAGTATGGTATCGAAAAGAGAAGTATCTTTTCAGTGCCAAACTGCGGAAAGAGTATAAAGGCTTTTTGAGCAAAAAAGAACTCAATAAAAAAAATGTTGTTTTTTTACCAGTATTTCCTTCTCCGACTGCGTTGATCCGCCAATATAAAAAACTGGATGGCTTAGCGTTAGTTGAAATCGGATTTGGAAGACTTCTTGATGATAAGGAGGAAATTATGTAACCGATAACTTCTCCACCGCCCCGAAACCCGCGTTCAAAGAAAATTGCATAAGCTCGGGGCGGTAGGTTTTAAAGAGCATCAAAAAGCGACTGCAAGCGGTGGTTTTTTGATGTTTTTTTACAACGGAGAACCGATATGGAGAAACCCAAACTGATCCAACTCGTCAAAATCGGGCAAAACCAGCTGAATATGTGCGATGAAGATTACCGCACAATGCTGCAACGGCTGACGAATAAAAAGAGTGCCACTAAACTCACGGTGGTGGAGTTGCATAAAGTGATCCATGAGCTGCAACAAAAAGGCGCGAAAATCACGCTATTTGCACGAAAAAAAGCGAAGCCGAGCGATTACAGCCCTGCTACCGGCGAACGTCCGGTCAAAAGCGAAATTACCCACAAAATCCGCGCGGTGTGGATTGCAATGGGCAAAGCCGGTATGTTACGAGATAGTAGCGAGAAAGCCTTGAATATCTATGCGCGTAAGGTATTTAAACACCGCTCACCAATGTTGCTGAATGTAGGAGCATTGGATGATAGAGAAGCCACGCAGTTGCTGGAAATGTTGAAAAAATGGCAAAAACGAGTAGAAAAAGAAAGGGGGAATGAATGAAGCTATGTCGATGCCCAATCTGCCACAGTGATCTGCACCTTGAAGCCCTGATTGAAGACGATGCCGGACGGGAATTACTGGGTAAAATCAGCCAGTTAGACAAAGGCTGTGCCTCTCCACTGGTTGCCTACTTAGGCTTATTTAAACCGGCAAAAAGCAATCTGAGCAACAGCCGAGCCTTAAAACTTTTTAACGAAGTGCTGGAACTGTTTGAACCGTCAAAATTGCTGGCGCATTGTTTATCGGAAACCGTGCAAGCGGTACGCAAAAAACGCCTGAACGGGCAGAAAGCTGAACCGCTCACCAATCATAACTATCTCAAATCGGTGTACGATACGCAAAAAGCCACTTTTACCACGCATCCGGCAACCCGTGCTGAAACGCCGAAAGCCAAAATGCAAGAAGATAAAACACGCACTGCGATTGAGTATATTGAGCGTTATGCACTAGCGGGGCAACTGGATTATGTGAAGCACCAGCCTGAGTATCAAATTTGGTTGAATTACAAGAACCAAAAAGAACAAAAAAACAGCAGTTAGATTTTAAGCAAAAATAGAAAATACCTTGTGCAACAAGGCGTTATCGACAAAAAAACAGAAACTCCCTAAAAGTTTTTTTCTGCGTCATATCCAAATTATTATAATTGTTTGGATAAGCATTTGAAGATTTTAGGGAGTTTTTTTATGCAGAGTGAGGTACAGCAAGAGCTATTTGACGGGGAACATGCAGAAATTGGTCAATTATTCGATCAGCTAGACCACATCCCTGAAAGTGAAGTACATAATCGTTGGCCACATTTGTTGGTGGAAGTGATTGACGTAATGCAGGCTGAGTTGCAACGCCAACAATTTGCAGAAAATAGTGCCAAATTGACCGCTTGTAAGTTGGCTGGTGTGATTGCCCATTATTTCGGAGGTAAATCCTTTTACTTGCCGGCGGGAGACAAAATCAAAGAAGCACTGCGTGATGTGCAAATCTATCGTGATTTTGATGGTAAGAACGTGCCGGATTTAGTCAAAAAATACCGATTGTCAGAAAGTACAATTTATGCGATCTTACGCCAACAGCGTTCGCTTCAACGGAAGCGGCATCAGATGGATTTGTTTAATTCATAGGTTTGGTTATGGCATTTAAATTTAGAAAGAGAATTAAAATTGCTCCAGGGATAACACTTAATCTAAGTAAAAAAGGGGTTAGTACAACTATCGGGGGAAAAGGAGCTTCCGTAAATATCGGGAAAAAAGGAACATATTTAAATACTAGTATTCCTGGTACTGGTTTCTATGATAGAAAACGATTAGATGTTCCAAATCAAGAGCAAAGTAACAGCGATGAAGAGACTAATTTTAGTTGGTCTTTCCCCAAATTAGAAGAGGGAGAAACATTTTCAGGATTATCGTTTAAAGAGAAGTGTATTTGGGTTTTAGTTCTGTGTTTACACCTTTTAGCTTGTTTATTACATAGTATAGGATTACTCATCACTTTATCTTGGAACTTAATAATATTCGTTTGTCTTGCTTTTATTTTTTATGTGATTTTTAAGTTGATGTTTTAAATGAGACATCGCAAACTTATTTAATTATTGCATAAGTTAGACTCCCACTATCAACCAATCTTGATAGTGGGAGTTTTTTTATGTCTTTACCTATCTTAAAAATCGTTGTGCATTGCTCGGCAACTCGCAACGGTAAATCTCTAAAACAATCCGGCAAAAGTGCGGCTCAAGTGATTGATGGCTGGCATAAGCAGCGTGGTTTTAAGCGTTCGGCCGGTGCAATCAAATCTTTTAATTCTCATCTGCCTCATCTCGGTTATCACTTTGTGATTGATGCGGACGGCACAGTCGAAACCGGTCGTCAAGTCGGTGAAATCGGGGCGCACGTGCGTGGGCATAACTCAAATTCTGTCGGCATTTGCTTGGTCGGCGGTATTACCGCAGAAGGCAAAAATCACGGTCAATATACAGAGCAACAGTGGCACGCCTTACATCAATTATTGCGCCAATTAGAAGCAAAACACCGTAAAGCCAAGATTTATGGGCATCGAGATTTATCACCAGACAAAAATGGCGATGGCTCTATCACACCGAATGAGTGGCTTAAAGATTGCCCTTGCTTTGATGTGTGGAGTTGGTTGGATAGCGAGCAGGTTGTGAATCTTGAGCATTTATTTGAGGTAAAAAATGGCATTTAAAGAGTTGATTAGCAATGCAGACGGGCGATTATCCACTACCGCCTTTATCCAATTTTTCGGGGCGTTGCTGATGGCCATTATTTTGGCGTATTCGGTCTATTTAGACCGTAGCAACGTGGGGGAATTATTTACCGTGTTTGCACTGTTTTGTGGTGGCCAAGTTGCGACCAAAGGCTTTGCGAACGCATTAGGGCGAGGAAAAGAATGATGGTGTTAGATCAAATTATTCCGCTGGTCGCATTAGCCGGTTCGGTTGCCAGTTTTATTGGCTATAAGAGCTGGCAACTGGCCAAAGAGCGTAAAGCTAACCAAAAATTAAGTGAGCAGAATCAGCAACTTCAAGCCGAAAAAGCCGTTGCTGAGGCTCAGGTGAAAAATCATCAAGTGAGAAAACAAAATGAAGAAAACATTAGTGGCATTAGCCGTGGCAGCATTATTGCCGAGTTGCACAAAAACGGTGACTTACGAGGTGACGAATAGCAGTTGTGCCGGTTTTAGTTTGATTAAAGCAAGCCGTCAAGATACAACCGAAACACTACGTCAGGTCTTAATCCATAACACGACCTACCGACAAATCTGTGATGTTGGTGAGAAAAACGAGAAACAAAATGAGTGATGATGTAGATCGTATCAATGAGCGTGAAGAACAGCTCCTTGCTTTACAGCTTGCGCCACATTTAACACAAAAACTCTCTGATGATGAAGTTGAGCTGATTGCGCTTGAAGGGCGCGATTGTATTGAGTGTGGTTTACCTATCCCTATGCAACGGCTGAGAGCCGTGCCGCTTGCGGTGCGTTGTATCTGCTGCCAACAGGATTATGAGGACAGCAAATAATGATTGAAGTATTTGAGGTGATTAAAGCCCATTGGGGCATTATTTTGACCCTCACCGGGCTACTTGCCTCGGTATTTTGGCTGAAACTCGACAGCCGTTACGCCAAGAAGAATGACATCGGAAAACTGCTTGAAGTTGCTCAAAACCACGAAGGACGTTTAAGTGGATTGGAAACTAAAGTGGATAATTTACCGACAGCAGTTGATATGGAGCGGTTAAAAACCCTAGTCACCGATGTGAAAGGCGATACTAAGGCAACCGGTAAACAGGTTGATAGTATTAGCCACCAGTTAGGATTATTGATTGAAGCAAAATTAAAGGAATAGCAATGGCATTAAAAGAGCTATTAACTCAAGACCAACGCCTCGTTATTTTACGGTCGCTTGCAGAGGCAGGTTATGACGCAAACGAGTCGATTTTAAACGATTGCTTGGATTTGTACGGTCACGATATTAGCCGTGATTTAGTCCGTACTCACTTGTGCTGGTTGGAAGAGCAAGGCTTACTAACACTTGAGCGTTTAAAAGGCGGTTATATGGTGGCAAGCATTACCCAACGTGGTTTGGATGTTGCACAAGGGCGTACTAAAGTGGATGGCGTAAAACCTCCTCGTCCTAAGATTTAAACGATTTTTAAACGAAATTTAAGGAGCGTTTAAATGGCAGAAAAAAACACCCGTGGGCGTGCGAGCAAAGTCGATTTGCTGCCACCGAATATCAAAACTCAGCTTGCGATGATGTTGCGTGACAAAACATTTTCGCAGGCTGAAATTTTAGCCGAAATTAACGATCTGATTCGTGATTGCGGATTAGATGAAAGCTACTGTTTAAGCAAAACCGGGTTAAATCGTTATGCCTCACGAATGGAGCAAATGGGGGCGAAAATCCGTCAGTCTCGTGAAATCGCAGAGATTTGGACAAAACAATTTGGTGAAGCCCCGCAGTCAGACATCGGCAAAATGTTGATGGAAATCGTCAAAAACATTGCCTTTGAAACCTCGCTCGGACTAAGTGAGAATGGTCAAGCCGACCCGAAATCTATCGCCCTGCTCTCATCGGCTGTGCAGCGTTTAGAGCAGGCAGAAAGTTTGAGCTTTAAGCGAGAGCAAGCGATTCGTAAGGAAGTGGCTCAACAGGCGGCAGAAACGGCGGAAAAAGTAGTGGTGCAAGCGGGGTTATCTGCGGAAACGGTACGAACCATTAAAGAACAGATTCTGGGGATTGCCTGATGTCATTAGTAAATGAACGCCCATTGAATGAACTATCGCAAGAATGCCAAGATTTCTTGGATTGTATCCATGTTTTTAACCCAAATGAGTTACTGCTTGGATATCAGAAGCGTTGGATTGCAGATGAAAGCCAACTAAAAATTGTAGAAAAATCTCGCCGTACCGGTTTAACTTGGGCAGAGGCTGCTGATAATGCACTAATTGCAAGCACTCGGAAATCTGATGGTGGTTGCAATGTATTTTATATCGGGTCTAACAAAGAAATGGCACGAGAATACATCGATGCTGTTGCTATGTGGGCAAAAGCATTTAATTACGCAGCAAGCGAAATCCAAGAAGAAGTTTTAACGGATGAGGAGGAAGGTAAAGATATTTTAACCTATGTAATCTACTTTGCCTCAGGTTTTAAAGTTAAAGCTCTCTCATCAAATCCAACAAATCTGCGTGGTATGCAAGGTGTAGTAGTCATTGATGAAGCAGGATTCCATAAATATCTTGCAGAAGTATTGAAAGCCGCTCTTGCACTCACTATGTGGGGGGCTAAAGTTAGAATTATCTCAACACATAATGGTGTTGATAATCTATTCAACCAACTTATTTTAGATAGTCGTGCCGGTCGTAAAAAATACTCTGTTCAAACAATTACACTCGATGATGCCTGTGCAGACGGGCTATATAAGCGGATTTGTCAGGTAACTAAACAAGATTGGACACAAGAAAAAGAGAATGAATGGAAAGCAGACCTGCTTAGAAATACAGCAACCGAAGATGATGCCCTTGAAGAGTATTATTGTGTGCCTAAACGCAGCTCCGGTGGCTATATCCCTCGCCCATTGGTTGATCGTGCGGCGGATGAAAGCAATGTGATTGTACGTTTTGAGTGCGATGATAAATTTATCACTTACTCAGATGTTGAACGTGAAACATTGGCATTGGAATGGCTTTTGAAAGAAGTCTTGCCACAGCTTGAACAGCTCAACCCGGATTACCGCCACAGTTTCGGCGTGGACTTTGCCCGCAGTGGCGATTTAAGTGTATTTGCGGTTTGTGCTTGCTTGCCAAGTACTGAACGCCGCTTGGCTTTAACCCTTGAAATCCGCAACTGCCCATACGACCAGCAAAAGCAAATTATGTTGTTTGTACTGGCGAATATGCCGAGATTTATCGGTTCTGCCTTTGACTCCACCGGTAACGGCGGCTATTTGGCTGAGAGTGCCTTGTTGCGTTATGGCTCGTCTATGGTGGAAACCGTCCACTTAAATGATAAATGGTATCGGGAGTGGATGCCGAAATATAAGGCGTTGTACGAATCAGACTTAATCAGCATTCCAAAGGACGAAGAAACCATCTTAGACCAAGGGCATATTGTGGTGATTAACGGTGTGCCGAAAATTGATAAAACACGCAGCCAAGGCAAAACCGGCAAACGCCACGGGGATAGTGCTGTGGCTTACTGTATGGCAGTACGAGCAAGTTATATGACCGGTGGGGAGATTGATTTTATTCCTTTACCAAGTAAACACGAAATCAATGATGACGATGATTTACCTCGTTCAGATTGGGACATTTAAACAATGAGAAGTAGCACAATTTTAGATATTCACGGCAATCCGTTTCGGTTCGAGGAGTCTGTGCAAACGGAAAACGAAAGCCGTTTAATGCAACTGCAACACCACTACAGCGAGCATCCGGCAAGCGGCTTAACACCGGCAAAAGCGGCTCGTATTTTGCGTGAGGCGGAGCAAGGTGATTTGATTGCACAATCGGAACTTGCCGAAGATATGGAAGAAAAAGACACCCACTTGCAGTCCGAGCTGGGGAAACGTCGTGGTGCAATTACTGCCGTAGAGTGGCGTATTCGCCCACCGGCAAATGCGAGCGCAGCCGAACAGCGTGATTGTGAGATGATTGAAGAAATCTTGCGTGATGCCGTATGGCTGGATGACTGTATTTTTGATGCCAGCGATGCCATTTTAAAAGGCTTTTCTTGCCAAGAAATTGAATGGGAAAGCGGTTTAATTGGTGGATTAAAGCTCATTAAAAACGTGCATTGGCGTGACCCTGCATGGTTTATGACCCCGACATTAGAACGTAATACGTTACGCCTGCGTGATGGCTCGGCACAGGGTGTAGAAATGCAGCAGTTTGGCTGGATTAAACATATCGCACGAGCCAAAACGGGTTATTTAAGCCGTATCGGTTTGGTGCGCACCTTAGTGTGGCCGTTTTTATTTAAGAATTATTCCCTGCGTGATTTTGCTGAGTTTTTGGAGATTTACGGCTTACCGCTCCGCTTAGGTAAATATCCAGAAGGTGCGGGAGATAAAGAGAAACAGACACTTCTGCGGGCAGTAATGAGCATTGGACACAATGCCGGTGGGATTATCCCTCGTGGCATGGAAATTGAGTTCCAAAAGGCGGCAGAAGGCTCAGAATCAACCTTTATGGCGATGATTGAATGGGCTGAAAAGACGATGAGCAAAGCCATTTTAGGCGGCACGCTCACATCTCAAGCTGATGGGGTTACAGCAACCAACGCCCTTGGCAATGTGCATAACGACGTGCGCAAAGAAGTTCGCAATGCGGACTTAAAACGCTTGGCTGCAACATTGACCCGTGATTTAGTTTATCCGTTGTATGCACTCAACTGTAAATCATACAACGATGCCCGTCGTATCCCTCGCTTTGAATTTGATGTGGCAGAAAGTGAAGACTTGAATGCCTTTGCGGACGGTTTGAATAAGCTGGTTGATATTGGTTTTAAAATTCCAAAACAGTGGGCACACGACAAATTACAAGTGCCGATTGCAGCGGAAGATGAAGAGATTTTGGTAAAAAATACGCAAAATCCGACCGCTTACTTGTCTGCGCGTGCAGATAAAAAAATTGCGGTACTCTCTGCAACGCCTGATCCTGATTATTTAATTGAACAGCTTGAACCAACGGTTGAAGAGTATCAAGAGATTATCGACCCAATGCTAAAACCGGTGGTGGAAGCACTTGAAAAAGGTGGCTATGAGTTTGCACAGGAACGCCTTGCAACCCTTTACGCCGAAATGGATGATAGTGAGCTAGAAAAGCTCCTTACTCGTGCAATCTTTGTGAGTGAATTACTGGGAAAAGCCAATGCCAAACGATAACGCATTAGATATGGGCTATGTGCTACGGCTTGAGCCGGAATTAGCGGTGGATTATCTGCGAGCCAAAGGGGTGAATATCACTTGGGACTGGCACGAGCAGCTTGAAGCCGCACACGCCAGAGCCTTTACCGTTGCAAAAGCGACTCGTGCAGAAGTGTTAGATACGCTTCGCTGGGCAACGGAAAAAGCCATCGCAGAAGGTACACCGGAGCAGGAATATATTAAAAACCTTGAACCTATGCTTAAAGAGTTGGGCTGGTGGGGCAAGACGGTCGATGAAAACGGAAAGACTGTGCAGCTTGGCAGCCCGCGCCGCTTAAAAACGATTTTGCGTACCAACAAATCGACCTCTTATCACGCTGCCCGTTACGCGGAGCAGATGGCAAATGTGGATGAACAACCCTATTGGCAATATGTGGCGGTAAAAGACAGCCGTACCCGTGCCAGTCATTTAGCTTTGCACGGTAAAGTCTATCGGGCTGATGATCCTATTTGGCAAACGATGTACCCACCGAATGATTGGGGCTGCCGTTGCCGAGTACGAGCATTAAGCGAATTTGCTCTGAAAAAGCAAGGACTGAACGTCTCAGATAGTGCAGGCAGAATCTCCGAAGAAACGGCGATTGCAGGTGTGAATAAAGACACCAGTGAAGAAATCCGCACAACGGTGAGCCGAATTAAAACCGACCAAGGCGAAATGAAAGTGGGTGCAGGTTGGAATTATAATGTCGGCTCTGCGGCATTTGGGACAGATGTGGCTGTTATCCGTAAACTGCGCCAAGTTAAAAATCGAGAATTACGTCAGCAGACAATTCAGGCGATTAACGATAACCCTATTCGGCATAAACTCTTTGAGCAGTGGGTAAAATCTAATTTAGGCAAGCGTGGTGCGAGTGCAAGATATATGTCTGCGGGGTTAGTCACCACAGAGATTGCGGAAAAGGTTGCGGAGTTATCAGGGCAGGAAAAAGCATCAGAACTGGTCTTGGTAATGACCGAGAAACGCTTAGAACATGCTAATAGTGATAAGCATCACCAAACCGGTGTAGGGCTAACGGCTGATGAATATGCCAGTATCTCTCGAATTATTGCTAACCCCGGAGCTGTGATTTGGGATAGTGAACGTGGGCATAATAACTTGATATATCTAAATCAGGATAAAACTATCAAAGTTATTGTTGATGCACCGAGTAAAGATAAACTAAAACCAACTGAAAAAGTAGATGCTGTGATCAACGCTTATAGAGTAGATTATGCAGAGGTGTTAAATAAGATAAAATCAGGCGTGTATAAAATTGTGAAGTAACTATGGGCTTGGCGAGTATCGAAGTCGCACAGAGCGTTTAGTTAAACGCTGTCCTACCAATTAGACAACAAGCCCATATAGTTTCGAGTAATTTAACGCTAATTTTTAAAGGATGTCAAATGGATTTAGAATTTAAATTCGACACGACCGAAATTCAAAATAAGTTTAAAAAACTTGCTCAGGTGGTTGATGGGCGTGAAATTACCCGTAAAGTCGCGAATGTCTTACTACAAGAAGCGGAAGCGGCGTTTGATAATGAAAAGTCGTCTGAAGGCGAGCCTTGGGCAAAACTCAATCAAGACTATAAAAAGAGACGTTATGATAAAGGCTACACCGGCAATATTTTGCAGGTGACCGGTGATTTGGTTAAAAGCCTGAACATTGACTATGGTGACAGTTTTGCGGTGATTGGTGCCGCTGAACCCTATGGGCAGTATCACCAAATGGGAACAAGTAAAATGCCGGCACGTCCATTCCTTGGTTTAGGTGATGATGGTGTAGCAGAAATTAAAGCTATTTTACATCGTGAATTATCGAAAATTATGCAGTCTTGATGTAAAATCGCAAAAAACGCCACAAACGCTTTCTAAGCCTATTTTGTTTTCAATACGCCTTGTGTTTCGATAAAAATTATTTAAACGCTCTCAGAGCGATTTAAACGGCATTTAAACGGTATCTTATTATTTATTTCCATTCTGTTTTTCAGCCTCCGAGAAATCGGGGGCTTTTTCATCTCTGATACATCACAAACCGACCTTTGCTTTGTTTTTGTCATTATGACGCTATGAAAGCAAACAAACACCCCTTAGCGGTATTAACCGCACAGCTTACAAGTCCTGATGGTTGGCAACAACTTTTGCCGAAAGGTAAATTTCGTGCAAGAGACGGTCGCCCGGCTGATGTACCACATTGGTATCTTGATGCAGAGATTGCCAAGCGTTTAATTCACAGAGCGAAAACCCTCCGACAAGACATTCTAGTCGATTACGACCACGCTACCTTGCTCAAAGCCAAGAAAGGCGATGATGCCGGGAATGTGGTGGCTGCCGGTTGGTTCAACAATGTGGAAATGCAGTGGTTTGATGACGATGAACGACAGGGTTTATACATTAAGCCCCGTTGGACGCCTAAGGCTTATCAGCAAATTAAGGACGGTGAGTTTGCTTTTTTGTCTGCCGTTTTCCCTTATGACGATAACGGTGAACCGATTGAGATCCGAATGGCAGCCCTAACCAATGACCCCGGTATTACCGGTATGCAACGGTTAGCCGTGCTTTCGGCAGTAACAAACCAGCAGGAGACAGCTCAAATGGGCAAATTGCGTACATTACTTAGTAAGCTCGGCATTGAGATTGCCGAGGGAACTGAAATCACGGATGAACAGGCAGAGGCTGCATTAAAAGCGTTAGATACGCTGCAAACAGATAAAGCCACCGCAGAAAATCAAGTTGTAGCACTGAGTGCAAAAGAGGTGGATTTAACCGCCTATGTACCTAAATCCACTTACGATGCGGTAGTCGCAAAAGTGGCAGTGTTATCGGCAAAAAATGATGAAGTGGAAATCGACAACACGATTACCAAAGCTCGCAATGAAGGTCGTGTGATTGAAGCTGAGGTTGAATACCTAAAAGGCTTCGGCAAACAGCAAGGTGTTGCGGCATTATCTGCAATGTTGGCACAACGCCCACAACTTGCCGTGCTGTCAGCACAGCAAACAGAAACCACGAAAGTGGAAAAACAGGTGAAAGGCGAAGCAGTATTAAGTGCTGCTGATAAAGAAGCAGCTCGTTTGCTTGGCATTAACGAGACGGATTTCGCAAAAGAATTGGAGGCTAAATAATGGCAAATGTAACCCCTGAACTCGTCAAAGCACTATTTGTTGGTTTTGGCAAAAATTTTAAAGAAGGTTTGGCGAAAGCGCCGAGTCAATATACCAAGATTGCAACTGTTACTAAATCTACAACCAAAAGTAATACCTATGCCTGGTTAGGTCAAATGCCTAAATTAGTCGAATGGATTGGTAAGCGTGCTGTAACAGCAATCCAGTCACACGGCTATTCAATCGAAAATAAAGATTGGGCAGATTCTATCGAAATTAAAAAGACTGATATTGAAGACGATAATGTTGGCGTTTATAGCCCGTTACTTGAAGAGCTTGGGCGAGCTGCCGGTGAACAGCCAGATGAATTAGTGTTTGGTGCATTAAAAGATGGCTTTAAAACAGCCTGTTATGATGGTCAGTATTTCTTTGACTCAGATCACCCAGTAGGCCGAAATGTTGATGGTACCAATCCGATCTCTGTTAGCAATATTACTGATGACGGTACTGGTGTCACAGATGAAAATGCCTGGTATTTACTTGATTGTTCTCGTTCGTTAAAACCAATTATTTTCCAAGAACGAAAAGCAGCTACGCCCGCACAGATGACTGACGAAACTGCCCAAAAAGTTTTCGAAGAAAATGTTTATACCTACGGCGTAGATTCGCGCTGTAATGTTGGGTATGGTTTCTGGCAAATGGCTCACGCTGTTAAAGGAAAATTAACCGCAGAAAACTTATGGAAAGCAATTTCAGCAATGCGAGCGGTGCGTGGCGATGGGGATAAACGCCTAGGTATTCGCCCAACTACCTTAGTTGTACCGACCTCATTGGAAGAAGAGGCAATTAAACTTTTAGAGCGAGAATTAAGAGTTGAAGATGGCGTAGCTATTGATAACGAATTCAAGAAAATGAACATCGAATTAGTAGTAGCCGATTATCTCTAACCTACAAGCGGTCGATTTTACTCTGTTTTTTGCAAAATCGACCCTGTTTAAACCTGATTTAAAGAGGATTTAAATGCAATGGACAATACTCAATTATTTAGTGCGGTGGTGCAAAACAAAATTAAAGACGGTTATCGCCGTGCTGGGATTAGCTTGGCAAAAGGTGAAAACGTATTGCCGTCAATCACCGAAACCCAGCTTAAACAGTTGCAAGCAGACCCACGCTTGGTGGTTACCCAAACCGAACAAGCAAGCCTGCAAAATGGTGGCAAAGGGTTATCTCAACACAGTCCGGATGACGGTGGCAAATCGAATTTGGACGGTGGCGTGGTACCAGCCAATTTAACGGTGGAACAGCTCAAAGCCAAACTCACCGAGTTAGGTGTTGAGTTTAAATCTGATGCGTTAAAAGCGGAACTGGTGGCTCTACTTACCGCTGTATTGAAACCAAAAGAAGGTGAGTAATGCCCTACGCAACCCCTGAGAGCCTGATCAAGCGTTACACGCTGGATGTATTATTAAGCATTGCGCGTAATGATGAACGGCAGCTGGATGAGCCAAAAGTTTATGAGGCATTGGAAGATGCGTCACAGACGATTGACAGCTACTTAGCCGGTCGTTATCGCTTACCGCTCAATGCGGTGCCATCGGTGCTAGAACGTCATTGTTGTTACATTGCCCGCTATTTTTTAGAGAAAAACCGAGCTACAGATCAAGCTCGCTTGGACTATGAAGATAGTATTCGGTTTTTAGAGAAAGTGGCTAGTGGAGCGATTTCACTCGGTTTGTCCGATATGGACGAACCGGTAGAAACAGACAATAGTGCGGTGATGGAGAGCAATGGCTCGGTATGGTCACGCGAACGGTCGAAGGGGTTTATCTGATGAGCATTATTGCCCAAACCAATGAGGCATTGATTGCCAAAATTAAGGCCTTGTGTGGCGACTATCTCAAAGAGGTCGAGACGCATCCGGGACAATGGGATGACAGCTCAGTGCGGCGTATTGTGCGTAACCCACCTGCGGTGTATATCGCGTGGCTTGGGCAGCAACCCAGCAAAAACAGCCATACCGTGACAGCTCGCTGGGGGGTATTTGTGGTTGCCGAAGTGTTGAACGGGCAACGCAATGACAGCATCGGGATTTATCAGGTGGTTGAGACATTGACTGCCGGTATCCATCAACAACGCATTGAGCCGTCAGGGATGTTTACGCTGCAATCTGTGCAAAATCTATGGAGTGATACCCAAAGCGGTATGGGCGTAGCGGTTTATGGCATGTACTTTAATGCTGACCAGCCCCTTGCTCACCAAATTGACGAGGATACGTTGTGCGATTTTAAGGTGTACGACCATACGTTTAACCAAGATAACGACCCACACACGATTGACGGAAAAACCCGTCTTACCTTGACCTTACCAACCCAATCATCCGAATAAGGAGAGGTATGCAAACCTTTAAAATTAAACCTAAAGCAGGATTGATTATTCGCGATCCTGAAACGTTTGAGCAGCTAAACGCCAAAGGCGAAGAGAAACCCCAAAGCGGTTACTGGTTGAAACATTTGAAAAATGGTGATGTGGAGTTAGTTGAGGCTAAATCCACCGAAAAACGAAAAAATAGTACGGAGAGCGCATAATGGCGATTTCTTTTAATGATATTCCGTCAGCCCTGCGTGTGCCATTGGCATATATTGAGTTTGACAATAGCAAAGCGGTGAGCGGCACACCGTCAGTCCTGCACAAAGTGCTGATGCTCGGCACAAAATTATCCAGCGGTACAGCAGAAGCTGGTCAAGCAGTACGTGTTACTGCTTACTCACAAGCTAAAACCTTATTCGGTCGTGGCTCGCAATTAGCTGAAATGGTGAAAACCTTTAAGGCACACAACAATATGCTGGATTTATGGTGCTTACCGCTCGATGAGGCCAAAAGTGGTGCAAAAGCGACAGGCACTCTGACTTTATCCGGTACAGCAACACAAGCCGGTACGTTAAGTGTAATGATTGCCGGCACAAACTACAAACAAGCGGTATCCAGTGGCGATACTGCGGCAACCCTTGCAACTAAACTGCAAAAATTGATTGCTGCAGACCAAGATGTGCCAGTGACGGCAACTGTCTCTGGTGAGTCAATTACACTCACTTGTCGCTTTAACGGCGAGACAGGTAATGAGATTGATGTACGTTGTAACTATTACAGCGGCGAAGTGTTGCCTGCTGGCATCTCGGTAAACATTACGCCGATGCAAAACGGGTCGGTCAATCCCAATATGGCGGAAGCCATTACCGGCTTTGGTGCGGAATGGTGGAATTATCTCGTCAATCCGTTTACGGATACCGAAAGCCTGAATTTGCTGCGCACAGAGCTAGTGACCCGCTGGGGGCCGCTAAAACAGATCGATGGTATCTGTTTTATGGCAAAACGTGGCACGCACGCAGAAGCGGCTACCTTTGCCGAACAGCGCAACGATTATCTGTTTAGTGTACTAGCAACCAACAAAGCACCACAGCCGGCCTATATTTGGGCATCTGCTTATGCGGCGGTGGTCGCTGGCTCGCTATCTATTGACCCTGCTCGCCCTGTGCAGACATTAACCATGGATTTATTGCCACCGGCAATGAGCGACCGCTGGGACTTACCAGAGCGCAACACCTTGCTGTATAGCGGCAACAGCACTTACATCGTCAATGCCAACAATCAGCCACAGGTTGAGGCGGCGATTACGATGTATCGTAAAAATGCGTTTGGTGATAACGATGAGAGTTATCTCTACGTAGAGACGATTGGCACACTCAGCTATATCCGCTACGCCATCCGCTCTCGCATTACGCAAAAATACCCGCGCCACAAGTTAGCAAATGACGGCACACGTATCGGACCTGGACAAGCGATTGTCACGCCGAAAATCATCCGCAACGAACTGTTGGCACTGTTCACTGAGCTTGAATCCGCAGGCTTGGTCGAAGATTTCGAGCAGTTTAAGCAAACATTGCTGGTTGAACGCGATGCAAACAATCCATGCCGTGTAAATGTGTTATCTAACGAGAACTTAGTCAATCAGTTCCGCATTTACGCCCATGCAATCCAATTTGTTTTATAGGAGCAAACAATGGCAACAAAATTCCAAGGAACGGCAACTATCCGTTTTAATGGCAAGGAGTACCCAACAGATAACGATGGTTCGCTGGATGTCGGTGGCAAAGAACGTGAAACTGTGAAAGGCTCTCAGGTGTATGGTTTCTCAGAGAAACCAAAGGAAGCTACTGTAGATGTGACGGTATTTAACTGTGAAGAAACTGATGTGATGGAACTCAAAAATATGACGAATGCTACCGTTGAATTTGAGACAGATGTCGGTCAAACCTATCTCTTGCCAAATGCGTGGGCGGTTGAAACGGGCACACTAAGTGCAGACGGCAAAATTAAAGTCAAAATGGCAGCAGTTGAATGTAAGCGGGTGTAAAAATGGAACTGATGTTAAAAACAGGTTTACGCTTTGGTGATGAACCTCAAACCGTGGTGACTTTGCGTGAATTAACCACAGGTGATTTACTAGATGCAGAAGTGGCCGCAGAACGAATGGTAATGTCGCCGGACGGTGTGCCGGTATTGGTGAAATCACCGGCACTTTTTGGCTATGAGCTGATACGCCGTCAGATTGCCTCTATCGGTAAAATTCAAGGGCCGATTTCGATGAAAATGTTACGGTCGATGACCTCGGAAGATTTACAGCTTATTTCAGTTTATGCTGAAACTTGGGAGGCGACCAAAGCCCAACAGGTGGTCGAGCGGGGGCGATTGGATGCAGCAGGTGGAGAAGCTGGAAAAGACCTGTCTGCTGTTAGCTAAACATTATCAGAGCAGCCCTGAGTGGCTGCTTTCCAGACCTATTTTAAACTTGCCACGCTACATCAAGTACATAAATTCAGGAGGGGAAGATGGCAAATAATTCAACTTCGTTTTATGTCAATTTAGCGGGTAATGTCTCTTCACAAGCGTCCAAGTTTGGCAATTCGTTATCGGCGATGGCAAATAAGGGCGTATCTAATATGGCTAAGCTCAGCAGTTCGATTACAAAAGTTGGCTCGGGTTTAGCTTCGCTCTCACAAAAAATCAATAATGTTGGCAATGTTGCACTGCCAGTCATTGGCGTTGGTGTCGGAGCCGGGGCTGCGATGGTAAGCAAGTCGATGATCCGCGTTGCTGCCGATTTTGAGATGGCCAATATCCGAATGAAGCAGACGTTTGGTAAGCGTGGCGATGAGGCAATGGCGTGGCTTAAAAAGTTTGCTACCGATACGCCTATGGCATTTGGTGACGTACAAGATGCAGCAATGCAGATGATGACAGCCGGTATCGACCCAATGAACGGCTCTCTACAAGCACTTGTGGACTGGAACGCCAAAGTTGGTGGTAGCACAGAAAATCTGAATGCTTATATCTCCTCATTTGCCAAAATGAAAATCAAGGGCAAGATGTCCTGGGAAGATATCCAACCACTGCTTGAACGCAATGTGCCCGTACTGAAAATGCTTGCTGAAGCCACTGGTAATAAATATACCGAAAAGCAGATTATGAAAATGATCCAAGAAGGCAAAATGCAGGGGGCAGCTCTTGACGCACTTTGGAAGCAGATGGGGAAAAATGCCAAAGGTGCAGCCAAAGAACAGATGAAGACCTGGGATGGCTTAGTCTCTAACTTAGGGGATACTTGGGTTGCGATGCAGGCTCAGTTTATGGGGCACGGTGCTTTTGACAGTCTAAAAGCTGAGCTAGGGAGTTTCTTGGAATGGCTAAACAGCAAGATTGATGACGGCACACTGGATGCATTTGCCAAAACCGTGAGCGAAACATTAACCGAAGCCTTGAAAGATCTCAAAGAGATGGCAACCGCTGTACAGCCAACTTTGGAAAAGATTGGCTCAGTGATGGAATGGGTCTCCGAAAAAGCCGGTGGTTACGGCAATATCGCTAAGTTTGTCGGGGGCTTTTATGTAGCCAATAAAATTGCGAATTTAGGCGTTACCAAAAAGATAGCCGGTGTAGGCTGGGGCACAACCAAATGGGTCGGCAGTAAATTCCGCCGAAACCCTAAAGGTGGTGCTGGGGCTGCAATGGAGACCGCAGGCTTATTAGGTGGTGTTGCCGGTGTGACGCCTGTCTATGTCACCAATATGCCAATGGTGGCAAACGGCTTGGGTGGTGGGTACATCGGACAGGAGCCAAATAGCAAAAAAACGAATAAAAAACTACCTAAAACACCGAAAGCGTTGCCGGGTGTTGCGGTAGCAACGACTGTGGCTGCCAATGCGACTCAGGCAACAGTGAATAAAGGGATTACACAAGCGGTCGGAAATACCGTGAAATCTGCAAGCCAAGCAATCAGTACTACTGCACATACCGCAACAGCTGCGGTTAGCCGTACCGCTGCTCGTGCTGTGCCGTATCTCAATGTAGCAGCAACCGCAGTAGAAGGGGCAATGGTGCTAATGGATAACCAAGCTAGCACGCAAGACAAATCGGAGGCGATTGGCTCTATTGCCGGTGCAACGGCTGGGGCGATTGTCGGGCAAGCCTTAATCCCAATTCCAGTGGTAGGGGCAGCGGTCGGATCTTATGTGGGTAGCTGGCTTGGTGAGTGGTTAGGCTCGGAAGTGGGCGAATATCTCTCTAATCCGGAGCCGATTAAAAACGAGCTTAACGGCACAATTCAAGTGGCAGTGAAAGCCTCAGAGCATTTAATTGCAACGGCCACGGCAAGCAAAGTGCAAACGAATCAGAAACAGGACAATATGAATATCGCCGTACAAATGGGCACTCTTGGCCCGGGCGTGGGGATGTGGTAGATGAGTAAGATAACCGGTAAAGGGAGCTTTCGCGGTGTTCCCTTTTTAATTGAGGATGAGCAAGGCCAGAATGGCGGACGACGCATTGTTACGCACGAATATCCGTTGCGGAATGACGGATTGACGGATGATTTAGGCAAGCGTATGCGTAATTACTCTGTCAGCTGTTTAGTGATTGGTGATGATCATATTCAGCAAGCAGAAGCATTAGTGGATGCACTTGAAGCTGACGGAGCCGGTACTCTAAAACACCCTTATTTCGGCACAATTGAGGTGTGTGTTGATGATTATCGGTTACGTCATTCCACATCTCATCAGCGTATTACCCGCTTTGATATTAACTTTGTTCCGGCACAAGAGAACAATGCACCGGAAATTACCGAAGATACGGCCTATTCGGTACTCCAAGAGTATCAGTCGGTGTTGGATAGTCTTGCAGAAGAGTTTGCTGACTCTATTGCCAATGTGTCCGGTTTTATCGACTCAATGGTGGATAACCCGTTATTCCGATTGGCTGATACCACAACCGGCTTTATTGCGACCGTTTTTGAGGGCGTTGCCAATACGGTTAGTGGCTTGACCGAAATGAAAGACAAAGCCTTGTCGATTAAAAATAACCTATATGGCTTGCTACTTACGCCGAAAGTATTGGCAAAAGAACTTCAAGACTTAACTCGTCTGAACGTTAAAAGCACGGTCAATGCTCAACGCCAATTTGTGCAGCATATTGTCATCACTGACTCTATTGATACTGCACTGAGTAACTTAACCAGCGGTAAATTGGAGATCACAAAAAGCACACTGGATGAGATGGTGGCTGCTAAAACCAACAATGTGAGCGAAACAGATATTTTAAGCCGTCAGTTTAGCAACTTGCACGAGCAAGAAGTATTTGACGCTCTGATGAATAAAACGACTTTTTTGCTCAAACGTTTGGTGCTCTCTACCCTTGCAGTGGAGTATGGCAAAGCTATCTCTGATGCAGTGACGGAGTCGGTTGCACAAAAGAGTGTGACCGAAGATACCGTTGCGGGCTTGATTGAGTCAAAAGCCGATGTAAAACGTTATATCGCGGATGTGGATGCACAGTTGGAAAATGTGATCTTAGATAATGCAGACGCAGAACAGTGGACGAGTTATCAAGCCCTTGAAGCCTATCGCTTAACGCTGCTCAAAGACTTACGTGTCCGCGGTGAGCGATTGGCAAATGCAAGCGAAATCACGCTGAAAGATACTTATCCCGCCGTATTGCTGGAATATCGGCACACCGGTAATGCGAAAAGCTGGAAGCGTTTGGCATTGCGTAACGGTATCTCCCACCCACTCTTTTGCTTAGGCGGGACAACTATTGAGGTATTGCAATAATGACAACCCCACAAGCCAACATTGAGCTCTATTTAAACGGCAAAATTTTTTCCGGCTGGAAAACAATTAACGTCCAACGTTCGCTCGAATCAATGAGCGGGCGTTTTGATTTGGGCGTGGCAGTACGTCCCACTGATGATATGTCCGGACTTGCTGCCGGCTCAGCATTGGTGCTTAAAATCGGTGGGCAGCCAATCATCACCGGCTATTTAGATGAGCGAAAGCAAAGTATTGATGGGGCAAATAAAACCATCCTAATCAGCGGCCGAGACAAAACCTGTGATTTGGTGGACTGTGCCATTATCCACAACAGCTACCAGTTTAAAAATCAAACCGCTAAGCAAATTGCGGAGGCAATTTGTAAACCTTTTGGCATTAACGTGGTGTGGTCAGTCAATACGCCCGAAGCGAATGAACGCATCCCGGTTTGGCAAGTCGAGCCTGGAGAAACGGCGTTTGATAATCTGAGTAAGATCGCACGACATAAAGGCGTATTAGTCACATCTGATGTCAATGGTAACTTGGTTTTTACCGAACCAAGCACCAAGCACGCCGGCGAATTAACACTTGGCGTGAATTTGTTGGAGTTGGAGCAAACTGATAGTTGGCATCAACGTTTTTCGCTCTACCGTGTCATTGGCGATGCCGAACAAGGCGGAGAGAAAGGCGATGTAGAGACCAAAAACAAAGCTACAAGCGGTAAAAACTCGGCAAAATCTGACAAAAAAGGGAAAGCAGAAAAAGATAATGTGACAGAGTTTAAAGAGTTTATTGGGAGCAATGAATGAGCGCAAGTGGTTTAAAAGTTGAAGTGACTGATAGTGAGATTAAACGCTATCGCCCTACTATCATTATTGCTGATGACAATATGACAGGAGCAAGCGGTTATCAGCGTGCCGATTGGGAGCGTAAACGCCGTGCGGCAGAAGGTACAAAAGAGACAGCCAAAGTGCGTGGGTGGTTTAAGCCTGACGGCTCATTGTGGCTACCCAACGAGATAGTTGTATTGGATGCACCTTTATTTGGGATCAACAAAGTTGAGCGTTTAGTGGTTGATTGTACCTACACACTTGATGAGAGCGGTATGCTAACCGTAATGACGCTAATGCACCGTGATGCGTTTGATGAGCCCGCTGATGAGACATTAGATGATGTGGATGATGCAAGCGGTTCAAAAAGCACTAAAAAAGGCAAATCTACTGGCAAGAAAAAAGGTGCAACGAAAAAAGCGAAGAAATCCGATAAAGACAATGTCGCCGAATTTACCGGTTTCATTAAATAGAATTTAAAGGGGATTTATGCAGGCTTTAAATCGTATGATTGCTCCAATTAAACGGGGATTACAGTTACTTGTGAGCCGTGCCGTGGTGTCTGTGGTCAATGATGCTTACGCTCGGCAAAATTTACAACTCCGTTTGCAATCTGACGAGGTGGCTGATGATGTGGAGCGTTTCCAAAATTATGGACACTATTCCGTACCCAAAGCCGGTGAAGCGATTGTGGTATCAGTTGGAGGTAAACGTTCGCATTTGGTTGCGGTGGTGGTTGATGATAAGAGTGTTCGCCCTGCTGGCTTGATTGCAGGCGACTCAGTATTGTACCATTTAGAGGGTCATCATCTCCGCCTGACTGAAAACGGCGAAGCCATCTTGTCTTGTAAAAAATTAGTGATTGAGACTGAAACGCTAGGCTGCTCTGCAACAGAAATTACGTTTGATAGTCCACAAACCACCTTTACCGGCGATGTGGATATTATGGGAATATCAACAGCAGCAGATCATCAATCTGGCGGAATCAGTGGAAAAGACCACGACCACGAGCAAAAAGTAGGTAAACCTGTTTCCGTCTAAGGGAGCAGAGTTTGTCGGATTTAGCTTTACAATGGCGTGACGGTGAGGGCGACTTAGTTTTAGATAACGAGTCGCTTTTGCTTGATGATACCTTAACTAATGCCATTATCATCAGCCTATTCACTGACTTGCGTGTCGGTAACGAGCGTGGCTGGTGGGGTGATTCTTACAACACTGATGACTATCAAATGGGCTCAAAATTATGGACTTTGAGCCGGTCTAAACAACTCCCAGAAATCCTTGATGATGCCCAGCGCTATGCTGAGCAAGCCTTAAAATGGATGATTGCTGATGGTGTGGTGCGCAGTTATCAAGTGGTTGCATCTAACCCTAAACATGCTGTCCTGCTGTTAGAGATTTCCGTGGTGTTGCCTGATGGCAGCACCGAGCAACGAACCTTTAACGCAAGCTGGAGTGTGTGATGGCATATCAATCCCCAACCTTATCCACCCTTATCCGACAAGGCGAACAGCAATTCCAGCATCGTTTCCCATCGCTCAAACGCAATAACGTGCTCACGGTAATAAACCGCATTTGTGCGGCATTAAGTGCCGGTGAGCATATGCACCTTGATTGGCTCGCACGGCAAATTATCCCGACCACCGCGGAAGAAGAATACCTGATTGAATACTGCCTTTACAAAGGCATTGTTCGCAAGCAAGCGACTAAGGCCTCTGGCGTGATTACCATTACTGCAGCCCGAGAATCAACCATTCCAGCAGACACAGTATTTGAGGATAGCGTAACGGGGCTTTCCTTTGTTACCACGGCAGAAAATATCGTGAGTGCCGGAAACAGTGAGATTGCTGTGCTGTGCGAAACAGAAGGGGCAGAAGGCAATTTAGCTGTTGGCACATCATTAGCTCTCACCTCTGCGATTTTAGGAGTGCAATCGACTGCCAAAGTCAAAGCGATGACAGGTGGTGCAGATATTGAGCCACTGTCTCGTCTATTGGCTCGTTTGATTTACCGAGTACAAAATCCGCCAGCTGGCGGTGCTCCGCACGATTATGTGCGTTGGGCCACGGAAGTCGCAGGTGTGACTCGGGCATGGTGTTTTCCGCGTTATTTAGGCGGTGGCTCTGTTGGTGTGGCATTTGCTTGTGACGACCGTGATGACATTTTGCCAACTGCGGAAGATATTGAGCGGGTCAAAGCCTACATCAGTGGGCACAAGAACGAAGCAACCGGACAATTTGAAGGGATGCCTGCAAATGTAGAGCTTTATGTTTTTGCACCACAATTTCAAACAGTCAATTTCTCGGTGCGTATCTTACCTGATACCGCAACACTACGGCAGGCAGTCCGTAAGAGCCTGCAAGCTTATCTGTCAAATGCTGGTGTTGGTGCGTTGCTCTACCTCTCTCAAATTCGGGCCGCAGTATCAAATACTGCCGGAGAAGTGGATAACAGCGTGATTTATCCGGCTAATGATGTGCAGTTGCTCAGTGATCACATCCCTACTTTGGGAGAGATTACATGGCTATGACAAAAATGCAGTATTTAGATGCTGCGGTAAAACTTCTTCCCGTTGGACTCGCCTGGAAACGGGCTTTAGATAGTCACCTTGCCAAAGTGCTTGCAGTGCGATGTGACCAGCTCGTAACAGTAAACACACAAGCTCATGATTTAATTAAAGAGCGAATGCCAGGACAAGCTACGCTATTGCTTGAAGAGTGGGAAGGTTTTTTGGGCTTGCCTGAAGCCGGACGACAAATAGTTGGTAAAAGTATTACTGAACGACAGGCTCAGGTTAAAGAAAAAGAAGAAGAGCTTGGATCAAGTAGCAAAATTTACTTGGAAGAAGCAGCCAAACGAGCAGGCTATCAAATTGAGATTGTGAACTACTACCCGCACCACTGCTTGCGGGATTGTCTGTATCCGCTTTACGAATATGAAAACGCTTGGCGTATTTTCATTTATACCGAAAGTTTGCCAACCAATCCGACTGATGATGTCGATAAAAATGAAGAATTACAAGAGATCTTAAAACGCTACTGCAACGCAGACGTGGAGATGGTTTTTATTTATAAGGACACCAAATAATGTACTCATTGGACAATAAATCTGGCATTAAGAATATGCCCCCAATCCCGGAGACTTTTAGTGACACGCCGTTGTGGTTTACCGAAGGGCGTGACGGCAATTCGCCGAGCTATCCAGGAGCACATTGGTTTAACATTGTGCAGGCAGAATTGCTGAATGTGCTGAAAGAAGCGGGAATTGAGCCTGAGAAGAAGTCGTTGGATCAGTTATGGCAGGCAATTCAGACTATCAACCGTCGCCGACCATCTCTTACTAAAAAGCGAATTAAACTAGACATCCCTTTAGTTTTTGACGGGTACGAAAGTGAGTTATCGGCAGCTAATTTAACGGATGCAACAGGCTATATCTATCCAGGCTCTTTTGCGATTGATGAACAAACAAATGAGCTGGTTATTTTGTATGGCGGTAGCTGGGACAGAGCTCCAATGTATCTTGTTGCTCGGGATTTTGATACTGGCGAGCAAAAATGGTGGGTTAAACTCAATACAACATCAATCGGTGAAGGTATTAGTATTAACTACGACTATGGTTCTCGAAAAGCGTTTATTGCTGGTCGTCAAGATGGTTTTCTGAACGAGTTTGATTTATCAAACATTACAAGCGGCACAACGCTTGATATTACGGCAAGTTATAACGTAGGCGTCTATAATCAGTTTAGTTATGACAATGGCATTTGGGCATTTGAACGCAATGCTCCTTTTATTGCTGGATTCATCGCTCGAAACACGATTGATTTTTATGATAAGAATTTCAACTTGCTCAACTCTACATCATTACCGATGTGGTCTAGTGGTTACGTCACAAAGACAACCAATGACTATGCAAAATACTTGCATAAGCGTCAAGGATTCGCACTCAAAGGCGATAAATTGTACTGTGCTTTTGGTGGAGCTCACGATAACAACACACCTGCGGTATGCACCGAATATCAAGGCACGAAGATTTTCAATTTAGCTGGTGATTGTTTAGAAGAGGCAATGTTAGAACCTATTGCTATGCGAAAGATTCTGACTAAGCATCTAGGCAAACAATCAGAGCTTAGACGAATTGAAAATGAAGGTATTGTGGTTACTTCAAAAGGCGAAGTTTATACACTTTACATTTATCATTCGCGGGCAACATCATTCGAAACGCGTAAGAAAGAAGGCATTGTCATCTTTCAAGAGCTAACAGATGCGGGCGATTGTGTAGATTATTCAGCGGCACTCACATACACCGCTCAACCTGATTTTATGAGCCTACGCCGTATGCCACGTGGAACAAGTGGCAAAATGATCGACCCTTTAACGGGCAAGGAAATCACTCAGATGAGTGAGATTTTTAAGTTCTTGAGAGAACTAAATATATCTGACGTATTATTTGATACAGGCGGTTTTACCAACATTACAGATATTGATGGCGAGTTGTTAAAAAGTGGTTTACTTGTTCGAATCGTTAATCAAAACACCGTAATGTATGTAGAAATTACAAGCCGTAACCACTCAACACTTCATGAATTGCCGTCTTCTTATGCAGCAACGTTAGCGGCAGACGGGAAAACGTGGACAAAAAATAAATGCGACTTGTCTATTGGTGGGGATTTAGTATTCGGTAAAAACCCTGATGGGAAATCAACCATTCTGGCTAGATTGAGTACGAGAAATTACACCAAAGGCAAGAACATTCTTTTTGCAGATGTTCAAAGTTCAGAAACGAACAATAATGCTTTCATCGGCGGTGGCTCTAGCCTTTATGAAGGAGTTAATCAGCTCAGATTTTTTACTGCGGAAAACAAAGGTGAAGTGGGAACTGCTCGTTGGGCGATTTTGAATAATGGGCATTTCATTCCTTGGGGGAATGGAGTTTACGAGATTGGAGCTAAAACAAATCGACTAAAAAGACTTTATACTCAAGATATATCCATTGCCAAAGATGACTCAAGTCAGGCATTGATTCGCATTGCAAATGCTTTAAGAGAAATCACAATAAATGTTTCTGCTAGCGGAAATGCGGGAATTTGGGACAACAATTTAGCTAAATGGTTGTTAGTTGCTGGCGATGATGGAGTATTGAAAGCAGGTACAGCACCAGTGATTACTGCGATAGCAAATGAGCTAATTACTGCGGGTTGGTTTAAGTCTCAATTTACTGCATCTTTGTCAGGGAATGGCTGGCAAAAGCTACCGAGCGGCTTAATTCTTCAATGGGGAACATATAACGCTAATGTGGAAAACACATTCAACTTTCCGATTGCGTTTACTCGAGAATGTTTTGCGGTAATTCCTGTTGATTATAATACTTCAGGCTCAAACTTAGTTGATATCACAGGCACAAATAAAACAGCAACATCATTCCAGATTTTATCTCAAGGTGGTGATATTGGTGCATTTTCGATGGTCGCAATAGGAGTATAAATGCAATATTTTTATGATTTATCTCAAAAAACTTTTTTAGTACAAGGTATTCATAATATTCCTCAGTCAGCTATACCAGTACATGAAAAAGACTATCAGCTCTTAATTGATGGAAGATCTAAAGGACGAGAAATTGTCTTAATGGGTAAAACTTTAACTCTTACAACTCCACGTCCATCAGCTTATCATGAGTGGGACGGCACAAAGTGGGATATTGAGCAATCTCAACAGGCGATTAAGCGGGCTAAAGAAATTGCGAGAATGCGAGAGACAATTAATGTCTTTCGAGACCAGAAAATCAATGGCGGTGTATATGTTGAGAGTGTTGGTAGGTGGATTGATACTGATGCTACAGCAGAACGCAATCTACTTAGCGTAAAAAGTAGCTTTGACTTATTTGGTGATTCGGTTGGCGAAATTGCGTGGACTTGTGCTGATAACTCAACCTTGATGATTGATAAATCCAAATTAATGCTAATTTGGCAGGCTCTCATGCAAGCCAAAACCAGTAATCACGCGAACGCTCTCCGCCATAAAACTGCCGTAGAACAGTCGGAAAATCCGTTAGAGTATGATTACTCTGGCGGTTGGAGTAAAACGTATCAGGATTTTTTAGTGGAGCAAGGAAATGAGTAAAGTGTATTTAGCTCTCTATAAAGGTTCGGGCGGTGGCCTTTATGACTGTTTTACCGATTGGCTTATCCGCAAAATCACAAAAGGCACATATTCCCATTGCGAAATTGCCGTGCAAAAAAGTGAAATCAAAGATCATTATCGCCGTGAGGAGTGGTTTGAATGCTACACCTCAAGCCCTCGTGATGGTGGAGTAAGACAAAAAGTTATTCCACTTGATGACGGCAAATGGGATTTGATTGAGCTTCCAAATCTAACGGAGCAAGAGGTAAAAGCCTATTATGAGCAAACCAAAGGCAAACCTTACGACTGGCGAGGTGTATTAGGGATTGCATTTGGTATCAAGCAAAAACAGGATAAGTATTTCTGTTCTGAGTGGTGCTTTAACTTAATCAAGCAAAGTAACGAGGGCTGGCGATTCAGCCCAAATCAATTAGCGACAATTTTCAAAAAGGGATAAAAAATGAAGAAAACTTTAACCGCATTAGCTGTTGCTTCTTTAGCTTCTGCAACTCAGACTAAACAGCAAGCAAGCAAGCAAGCAAGCAAGCAAGCAAGCAAGCAAGCAAGCAAGGAGTGTGAAATGGCAAAGGTATTTAAACAAGCCCCACTCCCATTTATCGGTCAAAAACGGATGTTTTTAAAACACTTTGAGCAAGTGCTGGCACATATTCCTGATGATGGTAATGGTTGGACTATCGTTGATGTGTTTGGTGGTAGTGGATTGCTTTCGCATACCGCCAAACGGCTCAAACCCAAAGCCCGTGTAATTTACAACGATTACGATAATTACAGTGAGCGTTTACAGCACATTGATGATATTAACCGGCTACGCCGCATTATCGCCGATTTAATGGCTGACACACCTAAATACAAGCGGTTGGATAATGCCAAAAAATTGCAAATTATTGAAGCGATTGAGGCATTTCAGGGCTATAAAGACCTGCATATTTTATGCAGTTGGTTGGCATTTAGTGGTCAGCAAGTTAGCTCTTTTGATGAGCTGTACAAACAAAATTTCTGGCATTGTATCCGCCAAAGCGATTACCTAACCGCAGATGGCTATTTAGACGGAGTGGAGATTGTGCGAGAGTCGTTTCATCAACTTGTACCACGCTTTACAGGGCAACCGAATACGCTGTTAGTGCTTGACCCACCGTATCTCTGCACACACCAAGAGAGCTACAAGCAAGAGCGTTATTTTGATTTGGTGGATTTCCTACGCCTGATTCATCTAACCAAACCGCCTTACGTCTTTTTCAGCTCTACCAAAAGCGAGTTTGTCCGCTTTATTGATGCAATGGTTGAGGACAAATGGGACAACTGGCAGGCTTTTGATGATGCACAGCGTATAGTAGTGCAAACCTCAGCTAGTTATAACGGCAAGTATGAGGATAATATGGTTTATAAATTCTAA
Protein sequences of DBSCAN-SWA_3 >NC_020515|1852002:1903953|1880620_1881190_+|WP_015433143.1|DBSCAN-SWA MAEKNTRGRASKVDLLPPNIKTQLAMMLRDKTFSQAEILAEINDLIRDCGLDESYCLSKTGLNRYASRMEQMGAKIRQSREIAEIWTKQFGEAPQSDIGKMLMEIVKNIAFETSLGLSENGQADPKSIALLSSAVQRLEQAESLSFKREQAIRKEVAQQAAETAEKVVVQAGLSAETVRTIKEQILGIA >NC_020515|1852002:1903953|1879333_1879606_+|WP_025328929.1|DBSCAN-SWA MVLDQIIPLVALAGSVASFIGYKSWQLAKERKANQKLSEQNQQLQAEKAVAEAQVKNHQVRKQNEENISGISRGSIIAELHKNGDLRGDE >NC_020515|1852002:1903953|1879961_1880288_+|WP_015433141.1|DBSCAN-SWA MIEVFEVIKAHWGIILTLTGLLASVFWLKLDSRYAKKNDIGKLLEVAQNHEGRLSGLETKVDNLPTAVDMERLKTLVTDVKGDTKATGKQVDSISHQLGLLIEAKLKE >NC_020515|1852002:1903953|1853370_1854252_-|WP_015433105.1|DBSCAN-SWA MQNNLYQWPCANAIYPDRGNKSYRFKRLRFRLRSLMNLRSVQAFEQYVNQNPDLAAFLNTRANYSYPLVHRFLDKRFNAKQRYQAMVDNLTFLPKSLWQQSISFGEVVPEFSLYLNINEQQPMEGFWALELRHSETNQLIYVLTFGKLADSLLIAVIQGPNFDGSKELVKKLTKKCHGLRPAYLMVECMKALTRVLGYSTLLGIPQKYQNKSRLIQSKRYVVDYDAIFSESGASLAEYWLLPLALNNKDLSDIPSNKRSMYRKRYAMLEQIQAAMHEQLAGLQAVHIEPKSAN >NC_020515|1852002:1903953|1886420_1887476_+|WP_015433148.1|DBSCAN-SWA MKANKHPLAVLTAQLTSPDGWQQLLPKGKFRARDGRPADVPHWYLDAEIAKRLIHRAKTLRQDILVDYDHATLLKAKKGDDAGNVVAAGWFNNVEMQWFDDDERQGLYIKPRWTPKAYQQIKDGEFAFLSAVFPYDDNGEPIEIRMAALTNDPGITGMQRLAVLSAVTNQQETAQMGKLRTLLSKLGIEIAEGTEITDEQAEAALKALDTLQTDKATAENQVVALSAKEVDLTAYVPKSTYDAVVAKVAVLSAKNDEVEIDNTITKARNEGRVIEAEVEYLKGFGKQQGVAALSAMLAQRPQLAVLSAQQTETTKVEKQVKGEAVLSAADKEAARLLGINETDFAKELEAK >NC_020515|1852002:1903953|1864206_1865274_-|WP_025289718.1|DBSCAN-SWA MKLSKLMSGLVLGMAMFGSIAAQASSEKLIIAHRGASGYLPEHTLESKALAFAQRADYLEQDLAMTKDNRLIVIHDHFLDGLTDVAKKFPERKRADGRYYVVDFTLDEIKSLEMTENFEMKEGKQVQVYPNRFPMWKSHFVLHTLEEELEFIQGLEKSTGKRIGIYPEIKAPWLHHKEGKDIALETLKVLKKYGYDNKDAPVYLQTFDFNELKRIKTELLPQLGMDLKLVQLVAYTDWHETEEQDKDGNWVNYNYDWMFEDGAMAEVAKYADGVGPGWYMLIDDKQSKVGNIVYTPLVKELEKYKMELHPYTVRKDALPEFFTNVDEMYDALLNKAGATGVFTDFPDTGVEFLKK >NC_020515|1852002:1903953|1895613_1896402_+|WP_015433159.1|tail|DBSCAN-SWA MTTPQANIELYLNGKIFSGWKTINVQRSLESMSGRFDLGVAVRPTDDMSGLAAGSALVLKIGGQPIITGYLDERKQSIDGANKTILISGRDKTCDLVDCAIIHNSYQFKNQTAKQIAEAICKPFGINVVWSVNTPEANERIPVWQVEPGETAFDNLSKIARHKGVLVTSDVNGNLVFTEPSTKHAGELTLGVNLLELEQTDSWHQRFSLYRVIGDAEQGGEKGDVETKNKATSGKNSAKSDKKGKAEKDNVTEFKEFIGSNE >NC_020515|1852002:1903953|1860432_1860981_-|WP_015433112.1|DBSCAN-SWA MADFTEQTLPRRQFLRGQFLNTLKTDQVKRQGQNAIRPPWADLVNFLQKCTACDACINACEMQILKRGAGGYPEVDFSVGRKECSFCAACVNVCQQGVFRVTSEIAWEHKIEIQQGCLTHIGVECRACEDSCEQRAIRFKRTVGGIAKPILNLDNCNGCGACLASCPSDAIQVLQEGRKNAE >NC_020515|1852002:1903953|1885755_1886190_+|WP_015433147.1|DBSCAN-SWA MDLEFKFDTTEIQNKFKKLAQVVDGREITRKVANVLLQEAEAAFDNEKSSEGEPWAKLNQDYKKRRYDKGYTGNILQVTGDLVKSLNIDYGDSFAVIGAAEPYGQYHQMGTSKMPARPFLGLGDDGVAEIKAILHRELSKIMQS >NC_020515|1852002:1903953|1855561_1856437_-|WP_015433108.1|DBSCAN-SWA MAVANSPKEAGKEAREKLGLWRANKWLILRRLSQLSVILMFLSGPMWNVWILKGNYSGSMLFDLIPMSDPLITAESLMTGYLPEWKTLLGAVIVIVAYALLASKAFCSWICPLNIVTDAAAWLRRKLGIRQTLKISRNLRYVILAVVLIGSAVTGNLIWEWINPVAALGRVFVYGLGATLWLVLAVFLFDLLVAEHGWCGHLCPIGAIYGVIGAKSIVRVKVVDREKCDRCMDCYNVCPEPQVLRSPLHGKAEDSTIVLAKDCISCGRCIDVCAENVFAFSNRFAKEKRII >NC_020515|1852002:1903953|1870142_1870814_-|WP_025328931.1|DBSCAN-SWA MGNLKEWFSANELAGIEGLPTSPQGINKKARTQEWKKKSRDGVQGGAVEYHYSSLPESVQKALGFAQTMAREEPRALVAAQTADDMERVPFYDVQASAGFGAFNNEVYAPDDYIGLSKRWLEVRGLYLNRLMFITASGDSMYPTISDGDMLLINRASTQPKDGKIYVFRQNDQLWVKRVQGIIGGIRLISDNKELYSPVDVMFDDALDFEVIGQVVYIGHEMI >NC_020515|1852002:1903953|1868476_1869754_+|WP_015433120.1|DBSCAN-SWA MSNIQQLIEQAKQQSHQPLPMHGFDESFESCIKCTACTAVCPVSRVNPAYAGPKQSGPDGERLRLKSAEMYDEALKYCTNCKRCEIACPSDVKIGDIIVRARNRFVERQNKPLMHKLRDAVLSNTDIMGRLNTPFAPIVNTITGLKVTRFLLDKTLNVSRHRTLPKYSFGSFRNWYLKKEAERQAFFREKVAYYHGCYVNYNNPELGKDLITLLNAMDIGVVLLEKEKCCGLPLSVNGFPERAKKQAAFNIAQIEQTIDKRALEVVGTSSSCTMNLRDEYHHILGMDNAKIRPHISIITKFLSRKFAEGRVPTFRSMPLRVAYHTACHVEKAGWAPYTLDLLRQIPDLEVVTLPSQCCGIAGTYGFKSENYDSAQAIGKSLFEHINGGGFDFVISECETCKWQIDMSSQVTCLHPVTLLAMALKK >NC_020515|1852002:1903953|1871232_1873236_+|WP_015433124.1|transposase|DBSCAN-SWA MKEWFSANELSGITGMPSSPQGVNKKAQREGFQKQQKIGTQGKAYEYHITSLPAETQQALRLEEARALSAKSFVPEVRPNTALWAEFDNESGAKKAKAEAKCRAVMALKNELQFSPIERALVEVANRFEISEGSLKNWYYKVKAHPESDWLALLLNRSGKSKIKEKAAFTSEAWLYFKGDYLRSNAPSFATCYYRLQLAAKANGWEIPSKNTVLRRLNAEVDTLTQTLMRKGEYAVQSLFPHQVRTVEHLAALEIINGDGYQHNVWVHWNEDDPDAKPIRPKTWYWQDVRTRRILAYVVDDSENADQLRMSLKILLEKFGLPKQLTLDNTVAASNKQLSGYSKNRKRFKQVAGSEIDPATGKPREVKGIFEMLGLKISRTDIICGRGNGQAKPIERCFQELEELIDKHRDFQGYYTGSDPDSQPDDYQYKVGVDKATFLKRVEEGIKAYNARPNRRNEICQGRYSCDEVWARDFALTTVAKPTVSQLSMLMMVSESTKLEKKYGLANGAFRLKAGGAKFSGGYNRYAAEELIGSKLDYVVVRFDPYHLHDDVYVFDTQDRFLCKAKCIDNIAFDDTEKARQHKRAKTQMVKAVKAQAKAVERMNAIEMAAVSPELEEVAEVKPILKPLYGFDGSAALKPQAVELDDEDEQESRFAKGVQWLKATMAK >NC_020515|1852002:1903953|1861212_1861338_-|WP_015433113.1|DBSCAN-SWA MQVLNSLKSAKSRHPDCKIVRRKGKLYVICKTNPRFKARQG >NC_020515|1852002:1903953|1855022_1855445_-|WP_015433107.1|DBSCAN-SWA MKKYLAFLLVALSPIAMANNIPSSLEGAAEAISPEWHNMPKDSGKHQLTFVNQPPMIPHSVKGYQVTKNTNQCLNCHGVENYRTTGAPRISPTHFQDRDGNVTGDSSPRRYFCLQCHVEQADVKPIVENKFQSTKAFGGK >NC_020515|1852002:1903953|1875712_1875889_+|WP_015433131.1|DBSCAN-SWA MMSELHRDLLEAQLCEAIVQLQQAQTALQNNQFIHASIYVSNVQNQLPQMRQKLTQLH >NC_020515|1852002:1903953|1896398_1896896_+|WP_015433160.1|DBSCAN-SWA MSASGLKVEVTDSEIKRYRPTIIIADDNMTGASGYQRADWERKRRAAEGTKETAKVRGWFKPDGSLWLPNEIVVLDAPLFGINKVERLVVDCTYTLDESGMLTVMTLMHRDAFDEPADETLDDVDDASGSKSTKKGKSTGKKKGATKKAKKSDKDNVAEFTGFIK >NC_020515|1852002:1903953|1873332_1874253_+|WP_015433125.1|DBSCAN-SWA MTPIEQIKQILDDGVISQAKLAKEAGINPGALSSYLKGNYAGNSENLEQALSQWLARRETKQQRFVQAPDFIQTATATQIHNAFEFARILGTIATVYGMSGAGKTRAAQEFKRNNQNVWIVTASPSRSTLSEILYEMALEIGLTDAPRRSGMLSRLIMKKLTGTQGLMIIDEADHLPYQALEEIRILQEESGIGFVLIGNDKVYTRMRGATHQAHEFARLWSRISKHVSIQKCKKNDVVAIANAWGLDTNDQEMMSLLSEIGAGGGGLRSLTQTLRLAGIHAKGQDSVITRDLVLAAQAELGGKNG >NC_020515|1852002:1903953|1876459_1876933_+|WP_015433133.1|DBSCAN-SWA MEKPKLIQLVKIGQNQLNMCDEDYRTMLQRLTNKKSATKLTVVELHKVIHELQQKGAKITLFARKKAKPSDYSPATGERPVKSEITHKIRAVWIAMGKAGMLRDSSEKALNIYARKVFKHRSPMLLNVGALDDREATQLLEMLKKWQKRVEKERGNE >NC_020515|1852002:1903953|1882764_1884330_+|WP_025328928.1|DBSCAN-SWA MRSSTILDIHGNPFRFEESVQTENESRLMQLQHHYSEHPASGLTPAKAARILREAEQGDLIAQSELAEDMEEKDTHLQSELGKRRGAITAVEWRIRPPANASAAEQRDCEMIEEILRDAVWLDDCIFDASDAILKGFSCQEIEWESGLIGGLKLIKNVHWRDPAWFMTPTLERNTLRLRDGSAQGVEMQQFGWIKHIARAKTGYLSRIGLVRTLVWPFLFKNYSLRDFAEFLEIYGLPLRLGKYPEGAGDKEKQTLLRAVMSIGHNAGGIIPRGMEIEFQKAAEGSESTFMAMIEWAEKTMSKAILGGTLTSQADGVTATNALGNVHNDVRKEVRNADLKRLAATLTRDLVYPLYALNCKSYNDARRIPRFEFDVAESEDLNAFADGLNKLVDIGFKIPKQWAHDKLQVPIAAEDEEILVKNTQNPTAYLSARADKKIAVLSATPDPDYLIEQLEPTVEEYQEIIDPMLKPVVEALEKGGYEFAQERLATLYAEMDDSELEKLLTRAIFVSELLGKANAKR >NC_020515|1852002:1903953|1902531_1902996_+|WP_025328925.1|DBSCAN-SWA MSKVYLALYKGSGGGLYDCFTDWLIRKITKGTYSHCEIAVQKSEIKDHYRREEWFECYTSSPRDGGVRQKVIPLDDGKWDLIELPNLTEQEVKAYYEQTKGKPYDWRGVLGIAFGIKQKQDKYFCSEWCFNLIKQSNEGWRFSPNQLATIFKKG >NC_020515|1852002:1903953|1874574_1874841_+|WP_015433127.1|DBSCAN-SWA MDDTNDTTLLIGSYAITPPLTWEEIEDGKEYAVIDTTDAGDWCVKLEKFNKDNSYFWTLVQSARVFENTVEAGEFIEALTTLGGKHGR >NC_020515|1852002:1903953|1889848_1890040_+|WP_015433153.1|DBSCAN-SWA MQTFKIKPKAGLIIRDPETFEQLNAKGEEKPQSGYWLKHLKNGDVELVEAKSTEKRKNSTESA >NC_020515|1852002:1903953|1879115_1879334_+|WP_015433137.1|DBSCAN-SWA MAFKELISNADGRLSTTAFIQFFGALLMAIILAYSVYLDRSNVGELFTVFALFCGGQVATKGFANALGRGKE >NC_020515|1852002:1903953|1903119_1903953_+|WP_025328924.1|DBSCAN-SWA MAKVFKQAPLPFIGQKRMFLKHFEQVLAHIPDDGNGWTIVDVFGGSGLLSHTAKRLKPKARVIYNDYDNYSERLQHIDDINRLRRIIADLMADTPKYKRLDNAKKLQIIEAIEAFQGYKDLHILCSWLAFSGQQVSSFDELYKQNFWHCIRQSDYLTADGYLDGVEIVRESFHQLVPRFTGQPNTLLVLDPPYLCTHQESYKQERYFDLVDFLRLIHLTKPPYVFFSSTKSEFVRFIDAMVEDKWDNWQAFDDAQRIVVQTSASYNGKYEDNMVYKF >NC_020515|1852002:1903953|1899509_1901903_+|WP_025328926.1|DBSCAN-SWA MYSLDNKSGIKNMPPIPETFSDTPLWFTEGRDGNSPSYPGAHWFNIVQAELLNVLKEAGIEPEKKSLDQLWQAIQTINRRRPSLTKKRIKLDIPLVFDGYESELSAANLTDATGYIYPGSFAIDEQTNELVILYGGSWDRAPMYLVARDFDTGEQKWWVKLNTTSIGEGISINYDYGSRKAFIAGRQDGFLNEFDLSNITSGTTLDITASYNVGVYNQFSYDNGIWAFERNAPFIAGFIARNTIDFYDKNFNLLNSTSLPMWSSGYVTKTTNDYAKYLHKRQGFALKGDKLYCAFGGAHDNNTPAVCTEYQGTKIFNLAGDCLEEAMLEPIAMRKILTKHLGKQSELRRIENEGIVVTSKGEVYTLYIYHSRATSFETRKKEGIVIFQELTDAGDCVDYSAALTYTAQPDFMSLRRMPRGTSGKMIDPLTGKEITQMSEIFKFLRELNISDVLFDTGGFTNITDIDGELLKSGLLVRIVNQNTVMYVEITSRNHSTLHELPSSYAATLAADGKTWTKNKCDLSIGGDLVFGKNPDGKSTILARLSTRNYTKGKNILFADVQSSETNNNAFIGGGSSLYEGVNQLRFFTAENKGEVGTARWAILNNGHFIPWGNGVYEIGAKTNRLKRLYTQDISIAKDDSSQALIRIANALREITINVSASGNAGIWDNNLAKWLLVAGDDGVLKAGTAPVITAIANELITAGWFKSQFTASLSGNGWQKLPSGLILQWGTYNANVENTFNFPIAFTRECFAVIPVDYNTSGSNLVDITGTNKTATSFQILSQGGDIGAFSMVAIGV >NC_020515|1852002:1903953|1879734_1879962_+|WP_015433140.1|DBSCAN-SWA MSDDVDRINEREEQLLALQLAPHLTQKLSDDEVELIALEGRDCIECGLPIPMQRLRAVPLAVRCICCQQDYEDSK >NC_020515|1852002:1903953|1898943_1899510_+|WP_015433164.1|DBSCAN-SWA MAMTKMQYLDAAVKLLPVGLAWKRALDSHLAKVLAVRCDQLVTVNTQAHDLIKERMPGQATLLLEEWEGFLGLPEAGRQIVGKSITERQAQVKEKEEELGSSSKIYLEEAAKRAGYQIEIVNYYPHHCLRDCLYPLYEYENAWRIFIYTESLPTNPTDDVDKNEELQEILKRYCNADVEMVFIYKDTK >NC_020515|1852002:1903953|1875898_1876324_+|WP_015433132.1|DBSCAN-SWA MAISAEQWKEIKQELDGLIGKVEFRYKEHQLTVQVEKVKRSLELCVYVDGKIKREWIDETHELRPFLEEVWYRKEKYLFSAKLRKEYKGFLSKKELNKKNVVFLPVFPSPTALIRQYKKLDGLALVEIGFGRLLDDKEEIM >NC_020515|1852002:1903953|1862698_1864138_-|WP_015433116.1|DBSCAN-SWA MFGPFKPAPHIAELPADKIDSTYKRLRWQVFAGIFFGYAAYYFVRANFDLAQPGLIEAGLYSKAELGVIGSAAGLAYGLSKFVMAGMSDRSNPRVFLPFGLLLSGLCMTLMGLFPWATSGITIMWIMIFLNGWFQGMGWPPCGRTMVHWWSKSERGTIVSVWNTAHNLGGMMPGAMVLLASALFYSTHGVEATAKDVWQQSLYFPGIAAMICAIPVYFVMRDTPQSCGLPPVEKWRNDYPDNYDEKTAEHDLSTKQIFVDYVLKNRLLWYIAIANVFVYLIRYGVLKWSPVYLSEVKHFNIKGTAWAYTIYELAAIPGTLLCGWVSDKVFKGKRGLTGFIFMILTTAAVVAYWLNPATPEAELASYAAWYENPYQLMDFLLMTLIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGTVAASAVVGWAAETYGWDGGFYVMIGAGVLAVLLTFITMIEEGKHKAKLGETYGQ >NC_020515|1852002:1903953|1861863_1862544_+|WP_015433115.1|DBSCAN-SWA MAQQPFLIAPSILSADLARLGDDVRDVLNAGADLIHFDVMDNHYVPNLTFGPAICKALRDYGIECPIDVHLMAKPVDRLIPDFAKAGANYITFHPEATEHIDRSLQLIRDNGCKSGLVFNPATPLHYLDYVMDKVDVILLMSVNPGFGGQAFIPSTLDKLREVRKRIDESGYNIRLEVDGGVKVNNIADIAQAGADMFVAGSAIFDQADYQAIISQMRQELAKIKK >NC_020515|1852002:1903953|1874245_1874557_+|WP_015433126.1|DBSCAN-SWA MDKRIEIDRPLSHDNAVMYGQVVSLELAILACNQLGIDVERVDYTDWRRPCLIVKANAVTQQMLRQGKAFNYGSRVQNGIRVYLNHAIVNGVKIKWESSDYRH >NC_020515|1852002:1903953|1894249_1895614_+|WP_015433158.1|DBSCAN-SWA MSKITGKGSFRGVPFLIEDEQGQNGGRRIVTHEYPLRNDGLTDDLGKRMRNYSVSCLVIGDDHIQQAEALVDALEADGAGTLKHPYFGTIEVCVDDYRLRHSTSHQRITRFDINFVPAQENNAPEITEDTAYSVLQEYQSVLDSLAEEFADSIANVSGFIDSMVDNPLFRLADTTTGFIATVFEGVANTVSGLTEMKDKALSIKNNLYGLLLTPKVLAKELQDLTRLNVKSTVNAQRQFVQHIVITDSIDTALSNLTSGKLEITKSTLDEMVAAKTNNVSETDILSRQFSNLHEQEVFDALMNKTTFLLKRLVLSTLAVEYGKAISDAVTESVAQKSVTEDTVAGLIESKADVKRYIADVDAQLENVILDNADAEQWTSYQALEAYRLTLLKDLRVRGERLANASEITLKDTYPAVLLEYRHTGNAKSWKRLALRNGISHPLFCLGGTTIEVLQ >NC_020515|1852002:1903953|1878056_1878488_+|WP_025328930.1|DBSCAN-SWA MAFKFRKRIKIAPGITLNLSKKGVSTTIGGKGASVNIGKKGTYLNTSIPGTGFYDRKRLDVPNQEQSNSDEETNFSWSFPKLEEGETFSGLSFKEKCIWVLVLCLHLLACLLHSIGLLITLSWNLIIFVCLAFIFYVIFKLMF >NC_020515|1852002:1903953|1891868_1892252_+|WP_015433156.1|tail|DBSCAN-SWA MELMLKTGLRFGDEPQTVVTLRELTTGDLLDAEVAAERMVMSPDGVPVLVKSPALFGYELIRRQIASIGKIQGPISMKMLRSMTSEDLQLISVYAETWEATKAQQVVERGRLDAAGGEAGKDLSAVS >NC_020515|1852002:1903953|1857513_1859997_-|WP_015433110.1|DBSCAN-SWA MELNRRDFMKANAAAAAAAAAGIAIPIKNVYADDNKIKWDKAACRYCGTGCSVLVGTQNGRVVATQGDPDAEVNRGLNCIKGYFLAKIMYAEDRLTSPMLRMKNGKFDKEGEFTPISWDQAFTIMADKVKSIIKEKGPNAVGMFSSGQSTIYEGYAKVKLWKGGFRSNTIDPNARHCMASAAVAFMRTFGMDEPMGCYNDIEHAENFVLWGSNMAEMHPILWSRISDRVLSNKECKVAVLSTYEHRSFELADTPIIFRPQSDLAILNYIANYIIQNDKVNWDFVNKHTKFKRGETDIGYGLRPEHPLQQKAANAKTAGKMYDSDFEEFKKIVAPYTLEEAHRISGVPKDQLETLAKIYADPNQKVVSFWTMGFNQHTRGVWVNHMIYNVHLLTGKIALPGNGPFSLTGQPSACGTAREVGTFIHRLPADMVVANPKHREIVEKAWKLPEGFISSELGYHAVGQDRALKDGKLNFLWTMCTNNVQAGPNINQERLPGWRNPENFIVVSDPYPTASALTADLILPTSMWVEKEGAYGNAERRTQAWRQMVKGPGESRSDLWQLVEFSKYFKAEEVWSDEIMAKVPEYRGKTLYEILYLNGEVNKFQVPSDIPEYLNDEAEHFGFYLQKGLFEEYAQFGRGHGHDLADFETYHQVRGLRWPVVDGKETLWRYREGYDPYVKAGEGVNFYGNADGKAVILGVPYEEPAESPDEEYDLWLCTGRVLEHWHTGTMTRRVPELHKAFPNNLVWMHPTDAQKRGLRHGDKVKVISRRGEVISHLDTRGRNKCPEGLIYTTFFDAGQLANKVTLDATDPISFETDFKKCAVKVVKA >NC_020515|1852002:1903953|1876929_1877472_+|WP_015433134.1|DBSCAN-SWA MKLCRCPICHSDLHLEALIEDDAGRELLGKISQLDKGCASPLVAYLGLFKPAKSNLSNSRALKLFNEVLELFEPSKLLAHCLSETVQAVRKKRLNGQKAEPLTNHNYLKSVYDTQKATFTTHPATRAETPKAKMQEDKTRTAIEYIERYALAGQLDYVKHQPEYQIWLNYKNQKEQKNSS >NC_020515|1852002:1903953|1871003_1871222_+|WP_015433123.1|DBSCAN-SWA MTTSNKKQDMHRADIVAAVRKAGTTLAKLSTEAGLHPRTLNNALERKYPKGESIIANAIGKTPQEIWPSRYE >NC_020515|1852002:1903953|1901903_1902539_+|WP_015433166.1|DBSCAN-SWA MQYFYDLSQKTFLVQGIHNIPQSAIPVHEKDYQLLIDGRSKGREIVLMGKTLTLTTPRPSAYHEWDGTKWDIEQSQQAIKRAKEIARMRETINVFRDQKINGGVYVESVGRWIDTDATAERNLLSVKSSFDLFGDSVGEIAWTCADNSTLMIDKSKLMLIWQALMQAKTSNHANALRHKTAVEQSENPLEYDYSGGWSKTYQDFLVEQGNE >NC_020515|1852002:1903953|1865519_1867211_+|WP_015433118.1|DBSCAN-SWA MVLSPQLYKNAGDFSPISTDVIIIGGGATGAGMARDCALRGIDCILLERRDIATGATGRNHGLLHSGGRYAVNDRESAEECIKENQILRRVASHCIEETEGLFITLPEDDLNYQKTFIDACNASGIEAVAIEPDLAKRMEPSVNPSLIGAVVVPDGSIDPFRLTAANMLDATERGAKVFTYCEVKGLIREGGKVIGVKAYDHKNRVERQFFAPIVVNAGGIWGQGIAEYADLKIRMFPAKGALLVMGHRINNMVINRCRKPADADILVPGDTICVIGTTSDRIPYDQIDNMVVTPEEVDILFREGEKLAPSLRHTRVLRAYAGVRPLVATDDDPSGRNVSRGIVLLDHQERDGLEGFITITGGKLMTYRLMAEWATDLVCKKLGKTERCTTHERPLPGSDEPRVETNRKITSLPNTLRYSAVYRHGARTPKMLENERLDKSLVCECEAVTAGEVRYAVEELSVNNLIDLRRRTRVGMGTCQAELCACRAAGLMSRFKAATPRQSTVQLASFMEERWRGIEPIAWGEAVREAEFSSWIYYSLLGLNDVKPLENQAQQGTDDNEF >NC_020515|1852002:1903953|1854441_1855023_-|WP_015433106.1|DBSCAN-SWA MKGLFCRLWKWAKSPSKMAVGLVIILSALGGIFFWAGTNTALEYTNTEEFCSGCHMNDVVPEYRASPHFMNRSGVKADCADCHLPHEFIPKWTRKFRAQLEVWAHFTGKVDTKEKFEAHRLEMAQREWARMKANNSQECRNCHNFADMDFTQQKGVAAKMHAMAEQEGKTCIDCHKGIAHSLPHMEGVSKFGE >NC_020515|1852002:1903953|1897475_1897886_+|WP_015433162.1|DBSCAN-SWA MSDLALQWRDGEGDLVLDNESLLLDDTLTNAIIISLFTDLRVGNERGWWGDSYNTDDYQMGSKLWTLSRSKQLPEILDDAQRYAEQALKWMIADGVVRSYQVVASNPKHAVLLLEISVVLPDGSTEQRTFNASWSV >NC_020515|1852002:1903953|1856436_1857351_-|WP_015433109.1|DBSCAN-SWA MKLNPNRRQFLKDVGRTAAGVCGVGIVLAMQQNQSLAREGVALRPPGAIADDKEFSAACTRCGQCVQACPYDMLHLASLLSPVEAGTPYFTARDKPCEMCPDIPCMNACPSGALGGLEEIDQARMGLAVLLDHETCLNWQGLRCDVCYRVCPLIDKAITLERVHNDRTQIHAKLIPTVHSDACTGCGKCEQACVLEEAAIKVLPMDLAKGLLGRHYRLGWKEKQNAGKALLEDYHPDGLRPAFEARMPEGQNQPVYQFMEVKPGVKVSTPSRATMDYVPNRTTVEAPILNPIDLDNVPSKLGGK >NC_020515|1852002:1903953|1897885_1898953_+|WP_015433163.1|plate|DBSCAN-SWA MAYQSPTLSTLIRQGEQQFQHRFPSLKRNNVLTVINRICAALSAGEHMHLDWLARQIIPTTAEEEYLIEYCLYKGIVRKQATKASGVITITAARESTIPADTVFEDSVTGLSFVTTAENIVSAGNSEIAVLCETEGAEGNLAVGTSLALTSAILGVQSTAKVKAMTGGADIEPLSRLLARLIYRVQNPPAGGAPHDYVRWATEVAGVTRAWCFPRYLGGGSVGVAFACDDRDDILPTAEDIERVKAYISGHKNEATGQFEGMPANVELYVFAPQFQTVNFSVRILPDTATLRQAVRKSLQAYLSNAGVGALLYLSQIRAAVSNTAGEVDNSVIYPANDVQLLSDHIPTLGEITWL >NC_020515|1852002:1903953|1888866_1889286_+|WP_015433151.1|DBSCAN-SWA MPYATPESLIKRYTLDVLLSIARNDERQLDEPKVYEALEDASQTIDSYLAGRYRLPLNAVPSVLERHCCYIARYFLEKNRATDQARLDYEDSIRFLEKVASGAISLGLSDMDEPVETDNSAVMESNGSVWSRERSKGFI >NC_020515|1852002:1903953|1861347_1861614_-|WP_015433114.1|DBSCAN-SWA MKKGIHPENYREVLFYDASVQQGWVIRSCANTNKTMVWTDGKEYPLYSLDTSSASHPVYTGKRREANNEGRASKFNEKFKGIASLTRK >NC_020515|1852002:1903953|1852002_1853361_-|WP_015433104.1|tRNA|DBSCAN-SWA MKETIVAQATPIGRGGVGILRISGPLAEKVAQEVLGKTLKPRMANYLPFKDQDGTVLDQGIALFFKAPNSFTGEDVLELQGHGGQIILDLLLQRILKIEGIRIARAGEFSEQAFLNDKLDLAQAEAIADLIDATSEQAARSALKSLQGEFSNKVNELVDSVIYLRTYVEAAIDFPDEEIDFLADGKIEAKLNEIIAQLDGVRKEAKQGSILREGMKVVIAGKPNAGKSSLLNALAGREAAIVTDIAGTTRDVLREHIHIDGMPLHIIDTAGLREASDEVEKIGIQRAWDEIGQADHVLLMIDSTLSAADQFQQEWADFLAKLPVKMPVTVIRNKVDLSGEQEGVEQQDNFTLIRLSAQTKVGVELLREHLKKSMGYQSSTEGGFIARRRHLVALETAAEHLERGHTQLTQFMAGELLAEELRMVQNALSEITGQFTSDDLLGNIFSSFCIGK >NC_020515|1852002:1903953|1880290_1880587_+|WP_015433142.1|DBSCAN-SWA MALKELLTQDQRLVILRSLAEAGYDANESILNDCLDLYGHDISRDLVRTHLCWLEEQGLLTLERLKGGYMVASITQRGLDVAQGRTKVDGVKPPRPKI >NC_020515|1852002:1903953|1896912_1897464_+|WP_015433161.1|plate|DBSCAN-SWA MQALNRMIAPIKRGLQLLVSRAVVSVVNDAYARQNLQLRLQSDEVADDVERFQNYGHYSVPKAGEAIVVSVGGKRSHLVAVVVDDKSVRPAGLIAGDSVLYHLEGHHLRLTENGEAILSCKKLVIETETLGCSATEITFDSPQTTFTGDVDIMGISTAADHQSGGISGKDHDHEQKVGKPVSV >NC_020515|1852002:1903953|1889285_1889840_+|WP_025328927.1|DBSCAN-SWA MSIIAQTNEALIAKIKALCGDYLKEVETHPGQWDDSSVRRIVRNPPAVYIAWLGQQPSKNSHTVTARWGVFVVAEVLNGQRNDSIGIYQVVETLTAGIHQQRIEPSGMFTLQSVQNLWSDTQSGMGVAVYGMYFNADQPLAHQIDEDTLCDFKVYDHTFNQDNDPHTIDGKTRLTLTLPTQSSE >NC_020515|1852002:1903953|1860170_1860443_-|WP_025266826.1|DBSCAN-SWA MQNNFNPELGNWYVASLVLQVRPEYINEVKDKLATMPYTEIHGEKPDDGKLVVVMEADKEKALVSKMESLQTLEGVMAVSLIYSQRDEIN >NC_020515|1852002:1903953|1875530_1875707_+|WP_015433130.1|DBSCAN-SWA MTKAKETAEAAARAERFGNYTQAADLWNKAAKAAVNAKQQEWCRNRSDFCDRMAERPF >NC_020515|1852002:1903953|1881189_1882761_+|WP_015433144.1|DBSCAN-SWA MSLVNERPLNELSQECQDFLDCIHVFNPNELLLGYQKRWIADESQLKIVEKSRRTGLTWAEAADNALIASTRKSDGGCNVFYIGSNKEMAREYIDAVAMWAKAFNYAASEIQEEVLTDEEEGKDILTYVIYFASGFKVKALSSNPTNLRGMQGVVVIDEAGFHKYLAEVLKAALALTMWGAKVRIISTHNGVDNLFNQLILDSRAGRKKYSVQTITLDDACADGLYKRICQVTKQDWTQEKENEWKADLLRNTATEDDALEEYYCVPKRSSGGYIPRPLVDRAADESNVIVRFECDDKFITYSDVERETLALEWLLKEVLPQLEQLNPDYRHSFGVDFARSGDLSVFAVCACLPSTERRLALTLEIRNCPYDQQKQIMLFVLANMPRFIGSAFDSTGNGGYLAESALLRYGSSMVETVHLNDKWYREWMPKYKALYESDLISIPKDEETILDQGHIVVINGVPKIDKTRSQGKTGKRHGDSAVAYCMAVRASYMTGGEIDFIPLPSKHEINDDDDLPRSDWDI >NC_020515|1852002:1903953|1890039_1891503_+|WP_015433154.1|tail|DBSCAN-SWA MAISFNDIPSALRVPLAYIEFDNSKAVSGTPSVLHKVLMLGTKLSSGTAEAGQAVRVTAYSQAKTLFGRGSQLAEMVKTFKAHNNMLDLWCLPLDEAKSGAKATGTLTLSGTATQAGTLSVMIAGTNYKQAVSSGDTAATLATKLQKLIAADQDVPVTATVSGESITLTCRFNGETGNEIDVRCNYYSGEVLPAGISVNITPMQNGSVNPNMAEAITGFGAEWWNYLVNPFTDTESLNLLRTELVTRWGPLKQIDGICFMAKRGTHAEAATFAEQRNDYLFSVLATNKAPQPAYIWASAYAAVVAGSLSIDPARPVQTLTMDLLPPAMSDRWDLPERNTLLYSGNSTYIVNANNQPQVEAAITMYRKNAFGDNDESYLYVETIGTLSYIRYAIRSRITQKYPRHKLANDGTRIGPGQAIVTPKIIRNELLALFTELESAGLVEDFEQFKQTLLVERDANNPCRVNVLSNENLVNQFRIYAHAIQFVL >NC_020515|1852002:1903953|1867200_1868475_+|WP_015433119.1|DBSCAN-SWA MNFDVVIIGGGLAGLTCGIRLQQQGKRCVIVNNGQAAMDFSSGAFGLLGETSGTKIAKFDEKQTACLSENHPYRVLGFAQSLAMAQQFERDFAKPLALSGSSAHNHWRVTPLGGLRPAWLSPENSPMLGWTENFAYRKLAILGIEGYHDYQAELFADNLKQQPTFAQCEIITDYLHLPELDELRQSGREFRSVHISQRLEQQIAFDALVREIRQRAQGADAIFLPACFGIDHDELFQRLQQASQATLFELPTLPPSLLGIRQRKALRQLFECAGGVMINGDKAERAEVDENGKILRIFTRLHAEHGLSATHFVLASGSFFSGGLSSEFDRICDPLFGADIQGLGEFNPQDRLSWTAHRFSATQPYQSAGVIINAHCQVRKNGEFLPNLYAAGSVIGGYNGIAENCGSGVAVVTALTVAQQIGGQ >NC_020515|1852002:1903953|1891512_1891866_+|WP_015433155.1|tail|DBSCAN-SWA MATKFQGTATIRFNGKEYPTDNDGSLDVGGKERETVKGSQVYGFSEKPKEATVDVTVFNCEETDVMELKNMTNATVEFETDVGQTYLLPNAWAVETGTLSADGKIKVKMAAVECKRV >NC_020515|1852002:1903953|1877616_1878048_+|WP_015433135.1|DBSCAN-SWA MQSEVQQELFDGEHAEIGQLFDQLDHIPESEVHNRWPHLLVEVIDVMQAELQRQQFAENSAKLTACKLAGVIAHYFGGKSFYLPAGDKIKEALRDVQIYRDFDGKNVPDLVKKYRLSESTIYAILRQQRSLQRKRHQMDLFNS >NC_020515|1852002:1903953|1874830_1875343_+|WP_015433128.1|DBSCAN-SWA MAAKTRVKQPAKLRFTEQAQVQSAIKEIGDLTREHTRLTTLMNDEITAITERYTPQLNRLSEEQKPLQDAVQEYCEAHRDELTDFGKTKTANLITGEVSWRTRPPSVSVRNAEGVLENLQKLGFDRFIRTKQEINKDAMLAEPDIAKGIAGVTIKQGVEDFVIKPFEAEV >NC_020515|1852002:1903953|1884316_1885636_+|WP_015433146.1|head|DBSCAN-SWA MPNDNALDMGYVLRLEPELAVDYLRAKGVNITWDWHEQLEAAHARAFTVAKATRAEVLDTLRWATEKAIAEGTPEQEYIKNLEPMLKELGWWGKTVDENGKTVQLGSPRRLKTILRTNKSTSYHAARYAEQMANVDEQPYWQYVAVKDSRTRASHLALHGKVYRADDPIWQTMYPPNDWGCRCRVRALSEFALKKQGLNVSDSAGRISEETAIAGVNKDTSEEIRTTVSRIKTDQGEMKVGAGWNYNVGSAAFGTDVAVIRKLRQVKNRELRQQTIQAINDNPIRHKLFEQWVKSNLGKRGASARYMSAGLVTTEIAEKVAELSGQEKASELVLVMTEKRLEHANSDKHHQTGVGLTADEYASISRIIANPGAVIWDSERGHNNLIYLNQDKTIKVIVDAPSKDKLKPTEKVDAVINAYRVDYAEVLNKIKSGVYKIVK >NC_020515|1852002:1903953|1887475_1888402_+|WP_015433149.1|head|DBSCAN-SWA MANVTPELVKALFVGFGKNFKEGLAKAPSQYTKIATVTKSTTKSNTYAWLGQMPKLVEWIGKRAVTAIQSHGYSIENKDWADSIEIKKTDIEDDNVGVYSPLLEELGRAAGEQPDELVFGALKDGFKTACYDGQYFFDSDHPVGRNVDGTNPISVSNITDDGTGVTDENAWYLLDCSRSLKPIIFQERKAATPAQMTDETAQKVFEENVYTYGVDSRCNVGYGFWQMAHAVKGKLTAENLWKAISAMRAVRGDGDKRLGIRPTTLVVPTSLEEEAIKLLERELRVEDGVAIDNEFKKMNIELVVADYL >NC_020515|1852002:1903953|1878568_1879126_+|WP_015433136.1|DBSCAN-SWA MSLPILKIVVHCSATRNGKSLKQSGKSAAQVIDGWHKQRGFKRSAGAIKSFNSHLPHLGYHFVIDADGTVETGRQVGEIGAHVRGHNSNSVGICLVGGITAEGKNHGQYTEQQWHALHQLLRQLEAKHRKAKIYGHRDLSPDKNGDGSITPNEWLKDCPCFDVWSWLDSEQVVNLEHLFEVKNGI >NC_020515|1852002:1903953|1888480_1888867_+|WP_015433150.1|DBSCAN-SWA MDNTQLFSAVVQNKIKDGYRRAGISLAKGENVLPSITETQLKQLQADPRLVVTQTEQASLQNGGKGLSQHSPDDGGKSNLDGGVVPANLTVEQLKAKLTELGVEFKSDALKAELVALLTAVLKPKEGE >NC_020515|1852002:1903953|1892341_1894249_+|WP_015433157.1|DBSCAN-SWA MANNSTSFYVNLAGNVSSQASKFGNSLSAMANKGVSNMAKLSSSITKVGSGLASLSQKINNVGNVALPVIGVGVGAGAAMVSKSMIRVAADFEMANIRMKQTFGKRGDEAMAWLKKFATDTPMAFGDVQDAAMQMMTAGIDPMNGSLQALVDWNAKVGGSTENLNAYISSFAKMKIKGKMSWEDIQPLLERNVPVLKMLAEATGNKYTEKQIMKMIQEGKMQGAALDALWKQMGKNAKGAAKEQMKTWDGLVSNLGDTWVAMQAQFMGHGAFDSLKAELGSFLEWLNSKIDDGTLDAFAKTVSETLTEALKDLKEMATAVQPTLEKIGSVMEWVSEKAGGYGNIAKFVGGFYVANKIANLGVTKKIAGVGWGTTKWVGSKFRRNPKGGAGAAMETAGLLGGVAGVTPVYVTNMPMVANGLGGGYIGQEPNSKKTNKKLPKTPKALPGVAVATTVAANATQATVNKGITQAVGNTVKSASQAISTTAHTATAAVSRTAARAVPYLNVAATAVEGAMVLMDNQASTQDKSEAIGSIAGATAGAIVGQALIPIPVVGAAVGSYVGSWLGEWLGSEVGEYLSNPEPIKNELNGTIQVAVKASEHLIATATASKVQTNQKQDNMNIAVQMGTLGPGVGMW |
63 | Shigella_phage(32.26%) | head,tRNA,transposase,tail,plate | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2075340 : 2080725
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_020515|2075340:2080725|DBSCAN-SWA TTTAAGAATTAACGACACCTAAATAATGGTGATAAGATTGTAATACGGTATCCGTAAAGCCAATCTCTTTTACACACCCTTTATCTAACCATATTGCATTTTGACACAGCTCTTTCACTTGATCCGCCGTATGAGAAACAAATAGTAATGTAGTACCATCAGCTAACATAGCTTCCATTCGTTTTTGACATTTTTGCTGAAAAGCAATATCTCCAACCGCTAAAACCTCATCAACAATAAGAATTTCTGGTTTTACAACTGTGGCAATAGAAAAACCCAAACGTGCAGCCATACCTGACGAGAAATTTTTAATTGGCATATCTAAAAAATCTTTCAGCTCTGCAAATTCAACTATTTCATCAAATTTTTCTTGAATAAATTTTTTACTGTATCCCAGTATCACACCATTTAGATAAATATTTTCCCTAGCGGTCAGGTCCCAATTAAAACCTGCTCCTAATTCAATCAAAGGAGAAATATTTCCAGTAATATTAACATTGCCTCTAGAGGGCTTTAATATACCACAAATTAATTTTAATAAAGTAGACTTACCTGAGCCATTTGTACCGATCAACCCCCAAGACTCCCCTTTTTTTACTTGGAAGTTAATATCTTTTAAAGCATAAAATTCTTCAAACATCAGCTCTTTTTTTAACAGCTTTATTACATATTCTTTCAAGCCAGTTACATTTTCCGAAGACTTATTAAAAATAACTGAAGCATTACTTACATCAATTACAATATCTTTATCCATAATTATATATATAAAATAAACTCATCCTGTTTTCTACTAAAAACAAACAACCCTATAATGAATACCAAAACTGAAATTGTTAAACCACAAAATATATCAAACATACATAAGGCATTACCATATAAAACTATCTCTCTAAATTGAAAAAGATAATGATACATAGGGTTTAAAGCTGTATATAAGTTACGGTAACTATCAGGAATAATACTCACTGGATAGAAAATTGGCGTTAGATACATCCACATTGTAACAAATACATTCCAGAGATATTGTATATCTCGAAAAAACACAGAAATAGTAGATAGAACGAAACTTAGTCCTAAACTAAAAATATATAACTGTACTATAACAATAGGGAAAAACAATAAACTCCAGCTAATTTCTACATCGCTAAATATAACAACAATAAATAATGCGATCAAAGAAAATAAAAAATTCACTAATGCACTAGTGACTTTTGATAGCGTAAATATATATTTAGGTAAATAGGTCTTCTTTAATAAAGGTGCATTTTCAACAATAGCATTTGCAGAGCTATTTGTTGCTTCTGCCATAAAGCCAAAAATAACTTGACCTGCAAGAAGATAAACTGGGAAATTGGGAATATCAAAACGGAAAATATTAGAGAATACAACCACCAATACCATCATCATTAATAATGGGTTAAGAATACTCCAAACATAGCCCAAATAACTACGGCGATATTTGAGTTTTATATCTTTTACGACCAATTGTTCTAAAAGTTCACCATACTTCCAAGATGAAATTAATTTAGATAATAACATTCGTTTTATTCCTAAAATGTTACAGCATCCTGTAAACTTTTCCCAGCCAAATCTTTCGCAGACAAGCTAGGCTCACCTTCTAAATGCCATTCAATTCCCACAGAAGGATCATTCCAAATTAAGGAATGCTCCGCTTTTGGATTGTAATAATCGGTGCATTTATATACAAATTCTGCCTCATCACTTAACACATAAAAACCGTGTGCAAAGCCTTCGGGTACCCAAAGTTGGCGTTTATTTTCAGCTGATAAAATTTCAGCAACATATTGCCCAAAAGTTGGTGAGCTTTTGCGTAAATCCACTGCCACATCAAGCACTGCTCCACTCACTACTCGGACTAATTTGCCCTGCGTATTTTCTGTTTGGTAATGTAAACCACGTAATACACCTTTTACGGATTTTGAATGATTTTCTTGTACGAAAGTGCGATTTGCCACATTTTGCTTAAACCATTCATCTCGGAAAGTTTCCATAAAAAAGCCTCGTTCATCACCGAAAACTTGTGGCTCTAATAATTTAACTTCTGGGATTTTCGTTTCAATAATTTTCATTGTTTTTCCTGATAATCTTTTAAATGCAACAAGGCTTTACGCCAATCACTTGGTTTAATACCAAACGCCTGTTGGATTTTCGTTAAATCCAAACGAGAATTTGCCGGGCGTTTCGCAGGTGTTGGATAATCACTAGTTGAAATGGCATTCACAAGCGGTGCTTTTTCTAGCATTTTTTGCAAAATGGCTTGTTGGAAAATTTCTTCGGCAAAGCCGTGCCAGCTGATGTAAGGCTGTCCGCTAAAATGGTATATGCCAAACGCATTATGTTTGCCTGTTAAAATTTGGTTGGCAATCTCAATCAATGCCGCTGCAATATCCCCTGCATAGGTTGGCGCCCCAATTTGATCGGCTACTACACCAAGTGTTTCACGTTCCTTACCTAATCTCAACATAGTTTTGACAAAGTTATGCCCGTGTTCACCAAACACCCAAGCAGTACGTAGTATCACCGATTTAGGGTTTGCATTTAATACCGCTATTTCACCCGCTTGTTTGGTTCTGCCATAAACACCTTGCGGATCAACACGATCATCTTCTGTGTAAGCTTGGTTGCCTTGCCCATTAAACACATAATCAGTGGAAATATGTAGCATTAACGCACCAATCTCCGCACTCGCTTCGGCAAGATATTTTGCCCCATTAACATTGATCGCTTCAGATAATGCTGTTTCACTTTCCGCACGATCGACCGCTGTATGAGCTGCAGCGTTAATAATCACATCAGGTTGAAATTCTGCAACCGTACGAAATACCGCCTCACGATCGGTAATATCTAATTTATCCCGATCTACGGCTAATACTTCTGCTTTACCTTGCAATTGCTGCGTTAAGCAAAATCCGACTTGCCCATTTGCACCTGTAATCAAAAATTTTGCCATTATTCGTTTCCTTTGATTAAGCGTAACAAATACTGACCATACTCATTTTTGCTCATTGGTTTGGCTATTTGTTCAACTTGTTCCGATGAAAGCCAGCCATTTCGCCACGCAATTTCTTCTAAACATGCTACCTGTAAATTCTGAATGTTTTGCACCGTACGCACAAATGAAGAAGCTTCGTGAAGGCTGGCGTGCGTGCCTGTATCTAACCAAGCAAAGCCACGCCCTAAAATTTGCACATTAAGCGAGCCATCATTGAGATACATTTCATTTAAAGTGGTAATTTCTAGCTCTCCACGTGCCGAAGGTTTAACTTGTTTGGCAAATTCCACCACACGATTATCGTAAAAATATAAGCCTGTTACTGCCCAATCGGATTTTGGTTGGCTTGGTTTTTCTTCAATCGAAACTGCACGGTAGTTCTCATCAAATTCCACGACACCAAAACGCTCAGGATCTTTGACTTGATAGCCAAATACGGTTGCTCTGCCTTGTTCTGCATCGGCTTTGGCATTGTGTAGTGTTTGAGTAAAACTTTGCCCATAGAAAATGTTATCACCTAATACCAAGCAAACACTCTCATCACCGATAAATTCTTCGCCAATTAAAAATGCCTGTGCTAAACCATCAGGGCTAGGTTGAATAGCATATTCCAATTTAATACCGAAATCAGCACCATCACCCAGTAAACGGCGGAAACTGGCGTTATCTTCAGGGGTAGTAATAATGAGTATTTCACGGATTCCGGCTAGCATTAACACGGAAAGCGGATAATAGATCATCGGTTTATCGTAAACAGGCAAAAGCTGTTTAGATACGCCACGAGTAATCGGATAAAGGCGAGTGCCAGAACCGCCTGCGAGAATAATCCCTTTCATTTTTCCCCCTATTTTGTGCCTAAACGTTCCATATTGTATGAACCGTCTAACACACGCTTCCACCAGTTTTCATTGGCTAAATACCATTCCACCGTTTTACGAATACCGCTTTCAAAAGTTTCTTGCGGTGTCCAGCCCAATTCACGCCCAATTTTTGCTGCATCAATCGCATAACGCACATCGTGTCCTGGTCGGTCTGTTACATAGGTAATCAAATCTTCGTATTTTGCGACCCCTGCTGGTTTATTTGGTACAAGTTCTTCCAATAAGGCACAGATGGTACGCACTACTTCAATGTTCGCTTTTTCATTGTGTCCGCCAATGTTGTAGGTTTCACCTACCGCCCCTTCGGTAACCACTTTATACAAGGCTCGTGCGTGATCTTCGACAAACAACCAATCACGAATTTGCATACCATTGCCGTAAACAGGCAATTTTTTACCGCTGATTGCATTTAAAATCATCAATGGAATCAATTTTTCGGGGAAGTGGAAAGGCCCATAATTATTAGAACAGTTGGTTACAATCGTTGGCAAACCATAAGTGCGTAACCACGCACGCACTAAATGATCGCTAGATGCTTTGGACGCAGAATAAGGGCTACTCGGGGCATAAGGCGTGGTTTCGGTGAATAGATCGTCCGTCCCTTCTAAATCGCCATAAACTTCATCGGTTGAAATATGGTGGAAACGGAATGCGGATTTTTTATCGTCTGGTAAGCCATTCCAGTAGTTACGAGCAGCTTCTAATAATGTGTAAGTACCAACGATATTGGTTTCAATAAATGCCGCTGGTCCATCAATAGAGCGATCTACGTGGCTTTCAGCCGCCAAATGCATGACTGCATCAGGCTGATATTGAGTAAACACACGATCTAACTCTGCACGATTACAAATATCTACCTGTTCAAAATGATAACGAGGACTATTAGCCACACTTTCAAGCGATTCCAAATTACCTGCATAGGTCAATTTATCTAGATTGATAACTGTATCTTGGGTGTGGTTAATAATATGGCGCACCACCGCAGAGCCAATAAAGCCCGCACCGCCTGTGATGAGGATTTTCATAATTGCCTTTTGAAAAGTAAAAACTTTTTAGAATTATAACAGAAAGGAAAACTTAATCTATCATTGCCCTAGGCAAACAATTATTAGATAATAACCTAATTTATAAATCTTGATCTAAAACAAAAATGAACGATATCAACATTCCTCTGACTTTCACCGATGCGGCAGCAAACAAGGTGAAGTTCCTTATCGAAGGCGAAGATAATCCAAACCTACGTTTACGTGTTTATATCACTGGCGGTGGCTGTAGTGGTTTCCAATATGGCTTTACATTCGATGATCAAATTAATGACGGTGATTTGACCGTAGAAAATCAAAATGTTGGCTTGGTGGTCGATCCAATGAGCTTACAATATTTGATTGGGGCAACAGTGGATTATGTCGATGGCTTAGAAGGTTCCCGTTTTGTGGTACAAAATCCAAATGCCAGTTCGACCTGCGGCTGTGGCTCTTCGTTCAGCATCTAA
Protein sequences of DBSCAN-SWA_4 >NC_020515|2075340:2080725|2075340_2076093_-|WP_015433339.1|DBSCAN-SWA MDKDIVIDVSNASVIFNKSSENVTGLKEYVIKLLKKELMFEEFYALKDINFQVKKGESWGLIGTNGSGKSTLLKLICGILKPSRGNVNITGNISPLIELGAGFNWDLTARENIYLNGVILGYSKKFIQEKFDEIVEFAELKDFLDMPIKNFSSGMAARLGFSIATVVKPEILIVDEVLAVGDIAFQQKCQKRMEAMLADGTTLLFVSHTADQVKELCQNAIWLDKGCVKEIGFTDTVLQSYHHYLGVVNS >NC_020515|2075340:2080725|2076095_2076878_-|WP_015433340.1|DBSCAN-SWA MLLSKLISSWKYGELLEQLVVKDIKLKYRRSYLGYVWSILNPLLMMMVLVVVFSNIFRFDIPNFPVYLLAGQVIFGFMAEATNSSANAIVENAPLLKKTYLPKYIFTLSKVTSALVNFLFSLIALFIVVIFSDVEISWSLLFFPIVIVQLYIFSLGLSFVLSTISVFFRDIQYLWNVFVTMWMYLTPIFYPVSIIPDSYRNLYTALNPMYHYLFQFREIVLYGNALCMFDIFCGLTISVLVFIIGLFVFSRKQDEFILYI >NC_020515|2075340:2080725|2077425_2078310_-|WP_015433342.1|DBSCAN-SWA MAKFLITGANGQVGFCLTQQLQGKAEVLAVDRDKLDITDREAVFRTVAEFQPDVIINAAAHTAVDRAESETALSEAINVNGAKYLAEASAEIGALMLHISTDYVFNGQGNQAYTEDDRVDPQGVYGRTKQAGEIAVLNANPKSVILRTAWVFGEHGHNFVKTMLRLGKERETLGVVADQIGAPTYAGDIAAALIEIANQILTGKHNAFGIYHFSGQPYISWHGFAEEIFQQAILQKMLEKAPLVNAISTSDYPTPAKRPANSRLDLTKIQQAFGIKPSDWRKALLHLKDYQEKQ >NC_020515|2075340:2080725|2078309_2079188_-|WP_015433343.1|DBSCAN-SWA MKGIILAGGSGTRLYPITRGVSKQLLPVYDKPMIYYPLSVLMLAGIREILIITTPEDNASFRRLLGDGADFGIKLEYAIQPSPDGLAQAFLIGEEFIGDESVCLVLGDNIFYGQSFTQTLHNAKADAEQGRATVFGYQVKDPERFGVVEFDENYRAVSIEEKPSQPKSDWAVTGLYFYDNRVVEFAKQVKPSARGELEITTLNEMYLNDGSLNVQILGRGFAWLDTGTHASLHEASSFVRTVQNIQNLQVACLEEIAWRNGWLSSEQVEQIAKPMSKNEYGQYLLRLIKGNE >NC_020515|2075340:2080725|2076889_2077429_-|WP_015433341.1|DBSCAN-SWA MKIIETKIPEVKLLEPQVFGDERGFFMETFRDEWFKQNVANRTFVQENHSKSVKGVLRGLHYQTENTQGKLVRVVSGAVLDVAVDLRKSSPTFGQYVAEILSAENKRQLWVPEGFAHGFYVLSDEAEFVYKCTDYYNPKAEHSLIWNDPSVGIEWHLEGEPSLSAKDLAGKSLQDAVTF >NC_020515|2075340:2080725|2079196_2080258_-|WP_015433344.1|DBSCAN-SWA MKILITGGAGFIGSAVVRHIINHTQDTVINLDKLTYAGNLESLESVANSPRYHFEQVDICNRAELDRVFTQYQPDAVMHLAAESHVDRSIDGPAAFIETNIVGTYTLLEAARNYWNGLPDDKKSAFRFHHISTDEVYGDLEGTDDLFTETTPYAPSSPYSASKASSDHLVRAWLRTYGLPTIVTNCSNNYGPFHFPEKLIPLMILNAISGKKLPVYGNGMQIRDWLFVEDHARALYKVVTEGAVGETYNIGGHNEKANIEVVRTICALLEELVPNKPAGVAKYEDLITYVTDRPGHDVRYAIDAAKIGRELGWTPQETFESGIRKTVEWYLANENWWKRVLDGSYNMERLGTK >NC_020515|2075340:2080725|2080383_2080725_+|WP_015433345.1|DBSCAN-SWA MNDINIPLTFTDAAANKVKFLIEGEDNPNLRLRVYITGGGCSGFQYGFTFDDQINDGDLTVENQNVGLVVDPMSLQYLIGATVDYVDGLEGSRFVVQNPNASSTCGCGSSFSI |
7 | Enterobacteria_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|