Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP022383 | Capnocytophaga sputigena strain H4486 chromosome, complete genome | 3 crisprs | DEDDh,cas3,cas2,cas9,PD-DExK,WYL,csa3 | 0 | 5 | 2 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP022383_1 | 1573979-1574629 | TypeII |
NA
Consensus repeat of NZ_CP022383_1
|
8 spacers
spacers of NZ_CP022383_1
>1.1|1574025|102|NZ_CP022383|CRISPRCasFinder TTAAAACCTTTGTAAATTTGCAGCGTGTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAACCGCTGGCAAGTCCATACAATGCACGGCTAC >1.2|1574173|30|NZ_CP022383|CRISPRCasFinder TTTATAAAGCAGTACAAACCCTTTGCGCTT >1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR CTATGGAAAACACTTACTTTTTTAAAGCCA >1.4|1574325|30|NZ_CP022383|CRISPRCasFinder,PILER-CR GATTTTTGCCTTACTTATGACCAACAAAGA >1.5|1574401|30|NZ_CP022383|CRISPRCasFinder,PILER-CR GACCCATTAGAGACCCACCAGAACCCAAGC >1.6|1574477|30|NZ_CP022383|CRISPRCasFinder,PILER-CR AGATTAGACAATATTCTGCACCATTATATG >1.7|1574553|31|NZ_CP022383|CRISPRCasFinder,PILER-CR ATTGCCTTCCAAGGCGTTCATACTCTATGAG >1.8|1574011|40|NZ_CP022383|CRT CGATTGATAACAACTTAAAACCTTTGTAAATTTGCAGCGT >1.9|1574083|44|NZ_CP022383|CRT TGATTGATAACAACCGCTGGCAAGTCCATACAATGCACGGCTAC >1.10|1574159|44|NZ_CP022383|CRT CGATTGATAACAACTTTATAAAGCAGTACAAACCCTTTGCGCTT >1.11|1574235|44|NZ_CP022383|CRT CGATTGATAACAACCTATGGAAAACACTTACTTTTTTAAAGCCA >1.12|1574311|44|NZ_CP022383|CRT TGATTGATAACAACGATTTTTGCCTTACTTATGACCAACAAAGA >1.13|1574387|44|NZ_CP022383|CRT TGATTGATAACAACGACCCATTAGAGACCCACCAGAACCCAAGC >1.14|1574463|44|NZ_CP022383|CRT TGATTGATAACAACAGATTAGACAATATTCTGCACCATTATATG >1.15|1574539|45|NZ_CP022383|CRT TGATTGATAACAACATTGCCTTCCAAGGCGTTCATACTCTATGAG |
cas2,cas9 |
CRISPR arrays and Neighbor proteins around NZ_CP022383_1
The CRISPR arrays of NZ_CP022383_1 >merge|NZ_CP022383|1|1573979-1574629|CRISPRCasFinder,CRT,PILER-CR GTTGTGAATTGCTTTCAAATTTTGTAGTTTTGCGATTGATAACAACTTAAAACCTTTGTAAATTTGCAGCGTGTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAACCGCTGGCAAGTCCATACAATGCACGGCTACGTTGTGAATTGCTTTCAAATTTTGTAGTTTTGCGATTGATAACAACTTTATAAAGCAGTACAAACCCTTTGCGCTTGTTGTGAATTGCTTTCAAATTTTGTAGTTTTGCGATTGATAACAACCTATGGAAAACACTTACTTTTTTAAAGCCAGTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAACGATTTTTGCCTTACTTATGACCAACAAAGAGTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAACGACCCATTAGAGACCCACCAGAACCCAAGCGTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAACAGATTAGACAATATTCTGCACCATTATATGGTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAACATTGCCTTCCAAGGCGTTCATACTCTATGAGGTTGTGAATTGATTTCAAATTTTGTAGTTTTGTGATTGATAACAAC >NZ_CP022383|1|1|1573979-1574629|CRISPRCasFinder GTTGTGAATTGCTTTCAAATTTTGTAGTTTTGCGATTGATAACAAC TTAAAACCTTTGTAAATTTGCAGCGTGTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAACCGCTGGCAAGTCCATACAATGCACGGCTAC GTTGTGAATTGCTTTCAAATTTTGTAGTTTTGCGATTGATAACAAC TTTATAAAGCAGTACAAACCCTTTGCGCTT GTTGTGAATTGCTTTCAAATTTTGTAGTTTTGCGATTGATAACAAC CTATGGAAAACACTTACTTTTTTAAAGCCA GTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAAC GATTTTTGCCTTACTTATGACCAACAAAGA GTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAAC GACCCATTAGAGACCCACCAGAACCCAAGC GTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAAC AGATTAGACAATATTCTGCACCATTATATG GTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAAC ATTGCCTTCCAAGGCGTTCATACTCTATGAG GTTGTGAATTGATTTCAAATTTTGTAGTTTTGTGATTGATAACAAC >NZ_CP022383|1|1|1573979-1574615|CRT GTTGTGAATTGCTTTCAAATTTTGTAGTTTTG CGATTGATAACAACTTAAAACCTTTGTAAATTTGCAGCGT GTTGTGAATTGCTTTCAAATTTTGTAGTTTTG TGATTGATAACAACCGCTGGCAAGTCCATACAATGCACGGCTAC GTTGTGAATTGCTTTCAAATTTTGTAGTTTTG CGATTGATAACAACTTTATAAAGCAGTACAAACCCTTTGCGCTT GTTGTGAATTGCTTTCAAATTTTGTAGTTTTG CGATTGATAACAACCTATGGAAAACACTTACTTTTTTAAAGCCA GTTGTGAATTGCTTTCAAATTTTGTAGTTTTG TGATTGATAACAACGATTTTTGCCTTACTTATGACCAACAAAGA GTTGTGAATTGCTTTCAAATTTTGTAGTTTTG TGATTGATAACAACGACCCATTAGAGACCCACCAGAACCCAAGC GTTGTGAATTGCTTTCAAATTTTGTAGTTTTG TGATTGATAACAACAGATTAGACAATATTCTGCACCATTATATG GTTGTGAATTGCTTTCAAATTTTGTAGTTTTG TGATTGATAACAACATTGCCTTCCAAGGCGTTCATACTCTATGAG GTTGTGAATTGATTTCAAATTTTGTAGTTTTG >NZ_CP022383|1|1|1574203-1574629|PILER-CR GTTGTGAATTGCTTTCAAATTTTGTAGTTTTGCGATTGATAACAAC CTATGGAAAACACTTACTTTTTTAAAGCCA GTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAAC GATTTTTGCCTTACTTATGACCAACAAAGA GTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAAC GACCCATTAGAGACCCACCAGAACCCAAGC GTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAAC AGATTAGACAATATTCTGCACCATTATATG GTTGTGAATTGCTTTCAAATTTTGTAGTTTTGTGATTGATAACAAC ATTGCCTTCCAAGGCGTTCATACTCTATGAG GTTGTGAATTGATTTCAAATTTTGTAGTTTTGTGATTGATAACAAC
>NZ_CP022383.1|WP_009388479.1|1573409_1573847_+|hypothetical-protein MKKSVQIFLELISAIALYLFINSIAYGDHLVTLPLYLTGVVIMSYLFVRERPQECLQSALRIVLPFTGLMLIAGSLPDAHFSLALIYIVSTGAAFLLGMLIATVSKTYQKIGLSALTFLLVVAVRLSFSAEFQYLIGVHWYTLGS >NZ_CP022383.1|WP_002678556.1|1572298_1572550_-|GlsB/YeaQ/YmgE-family-stress-response-membrane-protein MGIIATLIIGAIAGWLGGTIYKGSGLGLIGNIIIGILGSGVGYWLLGNVFHISLGEGWIGAILTGAIGAIVILFLINLVFKKK >NZ_CP022383.1|WP_002678420.1|1571848_1572226_+|hypothetical-protein MKKIIALVSVICLSLVAFSCSKNDDPADNDLFVGTYKGSISYLNGQEKKAFDNGSVTVVKTGNDYYFRFSDGIEDLKGVSFKKEGDNTLINVDLQDGLKVIRINASSLYILYTTDGKTWTANAKR >NZ_CP022383.1|WP_095901366.1|1571477_1571726_+|hypothetical-protein MNNKTYSSFEEIDNEIRLLRLEKEIDKLSLTQQVSASAESLSPKNLLANTWVSLLFNNKRWINIALGYGMQLLVKRYLNKNR >NZ_CP022383.1|WP_095901365.1|1571110_1571488_+|hypothetical-protein MAIYSIKETFVEVPTKANSAFNSQFAYYKLVLFRFIAKSSYGLITFFIYAFASLLILFFLSLAGAYAIGEALGSNGLGFAIVGAFYILLSLVVLLFRKKLIERSLLRKLSEIYFKTDPEEEENEQ >NZ_CP022383.1|WP_095901364.1|1570779_1571106_+|YtxH-domain-containing-protein MSKTANAVLGLAVGTAIGVGLGILFAPDEGKNTRKKIKDSLRDKSDELKEQLDNLTENVREKSLELKGSLEEKVDRLFSKASNKSEDVISLLEKKLASLKEEAAKVKK >NZ_CP022383.1|WP_095901363.1|1569869_1570403_-|DUF3127-domain-containing-protein MEIQGRIIKINPTQTVGQNGFQRRDVVIMTEEQYPQYIPFDFVQEKCVLLDNFQEGQTVRISFNIRGREWVNPQGETKYIVNLQGWRIENAEIAQNYPPQGYGQPAYPPQGYAQPQYQQAPPQYQQVAPQYQQQVPPQYPPQMPQQQFQSTPQQAFGTPPVQNAPVAPIEPEDDLPF >NZ_CP022383.1|WP_095901362.1|1569228_1569867_-|leucyl/phenylalanyl-tRNA--protein-transferase MTYLINDETPFPSPETASEEGIVAFGGNLTPARLVEAYSQGIFPWYNEGDPVLWWCPDPRFVLFPEKLHISKNMRKLLSKAPYRVTYNRCFTEVMQQCATVSRKDQHGTWIHPELIEAYTTLHQQGIAHSVEVWQDETLVGGLYGLQMGKIFCGESMFSKQPNASQYGFITFLQAHPHIALVDCQIHSEYLESLGAEEIPRATFLKLLTINR >NZ_CP022383.1|WP_095898360.1|1568612_1569101_-|hypothetical-protein MAILRLNKNAKRIIIVIMVITAIIIARIIISNYYENQKVELSKKCFNDSNIGFYHKEFSFYFPEELELQGAQILQIHNQDTIVIDYRILGHNIVINSPKNLKSEDIIKIILKDAVFTLRDFRNGPRFGGGRVFLGCFLQECVINNRKKICDDAGIFMFFSTL >NZ_CP022383.1|WP_095901361.1|1567262_1568303_-|aminotransferase-class-V-fold-PLP-dependent-enzyme MKISFKNDYAEGAHPMILEALMRTNAVQQAGYGNDSYSAAAKELIKNKIENPEAQIFFVSGGTQANMLVIGTLLRTYQSVIAPETGHIADNETGAIEAVGHKIHLAPATNGKITPESITDFATRYTNYPHQVQPRLVYITNATEIGTIYTLAELKAIYKCCQAHNLLLFMDGARLGSALNAEDNDIQWKDLARYTDVFYIGATKNGGLLGEAIVFNKPALADGFEYYLKQQGALLAKGRLLGLQFLTLFENGLYDALALHANKQMKRIKEAFVANGYTLLSDTCTNQLFPILTHTQIAALAEQFDFYQWQKVDETHTAIRLISSWATTDAQVDTLIQAIQLLSVAK >NZ_CP022383.1|WP_095901369.1|1574745_1575087_-|CRISPR-associated-endonuclease-Cas2 MMIADRYNAYRIMWVLVLYDLPTETKKMREEAQNFRDKLIKDGFTLFQFSMYIRHCPSRENAQVHINRIKQNLPQYGNIAIMCLTDKQFGDMEFFYCQRETAAPETYVQLELF >NZ_CP022383.1|WP_095901370.1|1575284_1579565_-|type-II-CRISPR-RNA-guided-endonuclease-Cas9 MKNILGLDLGTTSIGFAHIVEDENKEKSEIKELGVRIVSLTTDEQSDFEKGKSITTNANRTLKHGARLNLDRYQQRRKYLIDLLQKANLITSSSILAENGKNTTHSTWQLRAKAVTERIEKEEFARVLLAINKKRGYKSSRKAKTEDEGQAIDGMAIAKRLYDENLTLGQLSLQLLQQNKKLLPDFYRSDLQREFDLVWNFQKQFYPDILTDSFYKELQGKGKDATSKAFSKRYHFDTAENKGSKESVRLQAYQWRAEAMSKQLSKEEVAYVLTEINNNLNNASGYLGAISDRSKELYFNRQTVGQYLYAQLQENRHNSLKNKVFYRQDYLDEFERIWETQASFHKELTEELKKQIRDVVIFYQRKLKSQKGLISFCEFESKEIEIEKDGKTITKSIGARVVPKSSPLFQEFKIWQILNNVTCKRKGIRKKKSSAKSTELDLFNEPSQTIFSLDMECKQLLFEELNLKGDLKSDKVLKLLGYSPQEWEINYNQLEGNRTQKALYEAYLKIVEMEAHDVKDILQIKSAKDDWSLDESPLSASEIKEKIKDIFQALGICTKILYFDPLLPVKEFEKQDSYQLWHLLYSYESDDSTSGNETLYRILEKKYAFKREHARILANVALQDDYGSLSTKAIRKIYPYIKENQYSKACELAGYKHSKLSLTTEELEARELKNIIPLLKKNALRNPVVEKILNQMINVVNALIEKNSERDAEGKITKYFHFDEIRIELARELKKNAQKRYEMTQNINKAKLEHQKISEILQKQFGIKNPTKSDIIRYRLYQELEHNGYKELYTNAPIARDMLFSKNIEIEHIVPKARVFDDSFSNKTLTFHRINSDKGESTAFDYITSLNSEEELNQYLTRVENAYKTKSISPAKYKNLLKKASEIGDGFINRDLRDTQYIAKKAKEILFQVTKNVLSTSGSITDRLREDWGLVDVMKELNMPKYQSLGLTEVEERKDGNKVTVIKNWTKRNDHRHHAMDALTVAFTKPSYIQYLNHLNARKDENNKNYSVILAIEEKETIKVPTNNGKNKRVFIEPIPNFRQVAKKHLEEIFISHKAKNKVVTKNVNKPAGTDKQQITLTPRGQLHKETVYGKYQYYVNKEEKIGVKFDKETIAKVSNPLYREALLKRLQANDNDPKKAFTGKNALSKKPIYLDEAKTKTLPEKVNLTYLEEDFSIRKDISPDNFKDLKSIEKVIDQGVKRILIKRLQAYGNDPKKAFVDLEKNPIWLNKEKGIAIKRVTISGVNNAQPLHIGKDHLGKTTLNKEGKEIPVDYVSTGNNHHVAIYRDKEGNLQEQIVSFFEAVVRTQQGLPIIDKTYKQEEGWQFLFTMKQNEMFVFPNATTGFNPAEIDLLDPKNKKQISPNLFRVQKIATKDYFFRHHLETNVETDNVLKNTTWKREGLSGLKDIVKVRINHLGDIVSIGEY >NZ_CP022383.1|WP_095901371.1|1585422_1587237_-|DNA-binding-protein MKLNLLKPTAIALLSLFAITSCVKDDDYEIPNPNGEKPLPPFSGKVVTFDVATAQATNTVVTYGADEAIEGYVISSDEAGNFYQKIYIQNEDKTKGVTVAINKTGLYTDFPLGAKVQLRLKGLTSQINNGGVDFGSGIFQANNGRTSVGRMSEAIAKNHLFDKGGVRKTLAELAKADASINTLKVEANVNQLITLKGVHFKTADVGKEMHQKTNDARQGTDYTLTDAQGNTIPFRTSRYAKFKDEKVPAGTLDITGVLTKFGQNWQFMISNYADIVVVSGGTSTGTQTNTTVETIEASTATAASFVEGKKVKLHGNLTVEGTKAYILFSDGTKIQLYTRNFKNISNENKTNLKENGKEVTVTGTFGKFNDTLQISYEQDSDLVWGASNNNPNPPATVETVEASTATAASFVEGKKVKLHGNLTVEGTKSYIVFSDGTKIQLYIKNFKNLSKESKDNLKINGKEVTVTGTFGKFNNTLQIAYEQDSDLVWGASNNNPNPQPPATVTELDASTATAADFQLNKVVKLTGTIKIINNRSTIVFTTDNTEIQLTTKGYGSLPDDFKTKISTEGKKVIVKGTFIEFKDKKKGTVTKQLKYDSIDDVQFP >NZ_CP022383.1|WP_002682272.1|1587288_1587975_-|hypothetical-protein MNKLSIFRKGALALLLATTALVGCSKSNESNSDTEFQGVPYMLLTTNKPVGSNISLTIVAKEADKKDVWIDLNNNGVFDTGSDVRLTAATTTYKLQGNTVRVYGKVTTFKCDRNEVTSLDVTRNAVLERLDMSHNKIKEVNLHNNTQLTYLKVSNLPLVGLDLTNNKKLTELRFNLTFSGEKLKKCASTMHENGGGTVYMNPRITDDPAEKQTMQTLTNKKWKIGNCQ >NZ_CP022383.1|WP_095901372.1|1588026_1590744_-|TonB-dependent-receptor MFKKFYLFILLLMSASAFAQWRVEGTVVDELSKKPLEGVKVHVNASAWQVTTNGKGKYALSLPEGEYLIIYSLNGYTRQEQLVSVGGEKITQELPVVSLTVDVVQESEQMAVINESELEDDESSADAMSGLLQSSQDVFMRRAAFDFSSAFFKPRGYDSKDVTVLINGIPMNRFENGRAQWNNWGGLNDVTRNQEYSNGLSKSDYTFGGLMGSTYINIRPSLNRAGLRLTSSASNRSYTGRLMATYNSGVLRNGLSFMVSASRRFAPQGSWVDGTLYNAYAFSAAVEYQLNDHSSFNLLGMFSPVKRGKSSPLTREALDLFGYQYNPYWGRQMGDKRNSRNRVISEPIFVLSYNYLKNSTRLNIDLGYQFGEIGNTRIAYANAQNPEPNYYKNMPSYYLNQNSGPDFAAAENQKKDILENPQLNWEKLYYANYNNTDKHSTFIVSNDINRERTLSANVNFAMPIHDFIKWTSGVVYRNISSDNFAEIDDLLGGQYFMNYDYFENKPYDANEADMKKQKGDKWNYFYSLKSNVGEAFSQLEFTFKKAELFIAGRYHYTDVQREGKYNYPLYSDSYGKGAMQVQNGVSTKAGITYALTGRHLLQLNVGYFNTPQSLRNIYANIRNSNRLLPNLKNELAYSADASYIFRMPYLKGRLTGFFTEIENTAETNFFYTETAITDEIDKDFVAQTVDGIQKRHFGLEFGAEAQLHPTVKLTAAAAIGQYTYINNPSLYISAGDVNKAIDEVKMKNYHVPSGPQQAYSLGLEYRNPKYWWVGATANFLAQNYVSLSALNRTSQFFIDPATKATYSNIDFDKARQLLKQERLDDVFLVNLVGGKSWRVKKTYINLMASVNNLFNTKFLSGGFEQSRTANYGRMLQDNAQGVPSFGNRYFVGYGRTYMLNLAVSL >NZ_CP022383.1|WP_095901373.1|1591082_1593254_-|reprolysin-family-zinc-metalloprotease MKHIIYLFLYLSFTTAQAQWNITLPNGETLNLQWQERENSQQNKGIHTFVGYNQNQFVATLVVHSNKEASGSLQWGGITYQLSGSQQGKLSAKERFRHNPNAKCGIDTHQHKPLFPSPQNSPTARPITTTTSLMPNDPEGILYLYRLAVLVDYHDFAHTFGSDITQVKDFLLNLETFLNEVYVRDIGLKFSIVADNRLIIQEAAKQLYKQKSRRDIIENSTEKINEFIGDKQYDIGIVIAPGTDATLSGLAFFSGGFRLVRKGGASAIAENATIAHEIGHLFGADHTFKNVYSGNSLYTEPRYGQSLMGYSNNFPDGAFFSLPTAYQIRSGIVNRSYFKDSQRTQLVNRNGNDVSNFNYAYGIKTESSFPTIDRTKLQETYTIPKDTYFQFRIKATSPNNLPIYYTAQLTSKAGVNDPKFLTRKGKTEGNPITFQTQYSDLGGFIEYTRPNAKGEHLFWVATSNPAPQRFVNYDMVAVKVNIADGKTFAITNGMNDEYQGGDKIALHWQVDPNFFDSNSKVRILLSDDFGKTFKYTLVESTENDGTCEVTLPNIEIGTVEWGKQPKIQLPAGVIKVEVIDHIAFAITNVAPYKISNGKSVPNGGFKVKKKTGTPTPTEDSKPQQEPEKNIVIYNGVSTENANNYFTVEGADDNSPIHLLIFDEMGLKVYENEHYGKNGDYFRGNAKAKGFIGNNKALHGTYFYIVRYSKHGKKEQQKGFLYVR >NZ_CP022383.1|WP_095901374.1|1593371_1594334_-|elongation-factor-Ts MANITAADVNKLRQATGAGMMDCKAALIEAEGDFDKAIEVLRKKGQKVAAKRADRDSKEGAAIAQVNANHTVGAIISLNCETDFVAKNADFIALAKDLADLALTVNSKDELLALNYKGITVAEKLLEQTGVIGEKIELGAFEKVEAPFVGSYIHHGNKIAAIVGLSSAIADATEVSKNLAMQIAAMGAETLSYKDFSAEYIAKETEARIAIIEKENEELHRLGKPLKHVPQYVSQAQLTPEVLAKAEADAKAELKAEGKPEQIWDKILPGKIQRFISDNTTLDQEKALLDQVYIYDESKKVSEFVKSKNIEITAFKRVAL >NZ_CP022383.1|WP_016478130.1|1594457_1595183_-|30S-ribosomal-protein-S2 MANNIEIKDLLDAGVHFGHLTRKWNPNMAPYIYMERNGIHVINLYKTAAKIEEAAEALKKIVASGKKVLFVATKKQAKDVVAEKATSVKMPYITERWPGGMLTNFVTIRKAVKKMSSIDKMKKDGTFDTLSKREKLHVERMREKLEKNLGSIADMSRLPGALFVVDTLREHIAVKEAQKLNIPIFAMVDTNSDPREVDFAIPANDDASKAIEKVLSYITEAVNEGLSGRTAEANKEAEAAE >NZ_CP022383.1|WP_095901375.1|1595379_1595766_-|30S-ribosomal-protein-S9 MEVIHTIGRRKTAVARIYLKAGNGTITVNKRELNNYFTTPTLQYKVKQALTLVGAEDAYDVKVNVYGGGITGQAEAVRLAIARALCELNAENRTVLKPEGLLTRDPRMVERKKFGQKKARKRFQFSKR >NZ_CP022383.1|WP_002681881.1|1595765_1596221_-|50S-ribosomal-protein-L13 MDTLSYKTISANKATANKKWVLVDADGQTLGRLASKVAKLLRGKYKPDFTPHVDCGDNVIVINAEKINLTGNKWEDKTYLRHTGYPGGQRTTGVKQLLEKHPERIIEKAVKGMLPKTKLGAAVLRNLKVYAGTEHKQEAQQPVTINLNDLK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP022383_2 | 1962177-1962264 | Orphan |
NA
Consensus repeat of NZ_CP022383_2
|
1 spacers
spacers of NZ_CP022383_2
>2.1|1962206|30|NZ_CP022383|CRISPRCasFinder GGGGTAGGGGGGAGGTTTTATCATAGAGGC |
PD-DExK |
CRISPR arrays and Neighbor proteins around NZ_CP022383_2
The CRISPR arrays of NZ_CP022383_2 >merge|NZ_CP022383|2|1962177-1962264|CRISPRCasFinder TTGCGTCGCTACATTCCCCCTCTTTGAGGGGGGTAGGGGGGAGGTTTTATCATAGAGGCTTGCGTCACTACATTCCCCCTCTTTGAGG >NZ_CP022383|2|2|1962177-1962264|CRISPRCasFinder TTGCGTCGCTACATTCCCCCTCTTTGAGG GGGGTAGGGGGGAGGTTTTATCATAGAGGC TTGCGTCACTACATTCCCCCTCTTTGAGG
>NZ_CP022383.1|WP_095901636.1|1961479_1962151_+|16S-rRNA-(cytidine(1402)-2'-O)-methyltransferase MSKLYLIPTPIGNLEDITLRALRLLKEVDIILAEDTRTSSKLLKHYDIHTPMQSYHLFNEHKVVDSWVQRIKGGTTIALITDAGTPAISDPGFLLTRACIAEGVAVECLPGATAFVPALVNSGLPNDRFTFEGFLPDKKGRQTRLSQLATENKTMIFYVSPHKLLKTLTDFITTFGADRPASLSRELTKLHEETQRGTLQELLDFYKDKNVKGEIVMIVSGNK >NZ_CP022383.1|WP_095901635.1|1960406_1961480_+|hypothetical-protein MKEALEHDLIILATEILANEGEWNLAQLNKQAHKITEKITILTFVEKYYQTLGASEDRMYRTMRKVSDFIDDNRQEDLFDIEVEASEVQPITAPKTAAKETPKEEPATFVAMEPAPITKEEKTAPAEKPTPAEKPVAKKTLKEEQYPEPHWALPVNRSKTEEVAVPVEKVEEVRKVEVVEKPKPKETPAVAFEIEQPAIEQPTIEQPTIEKPEIDESELSKSAIKQLADFIQQPEYTDEERILKQTPSLEEFISQSKHTVFDKKDADEEVKPAQSLNDKFGKTAQIGLNDKLAFVQKLFFGSESEYNKVVKHIADLHSMQDAVIYIEQEVKPTYNYWKGKEEYEQRFLDLTLKRFEV >NZ_CP022383.1|WP_002680356.1|1959488_1960379_+|fructose-bisphosphate-aldolase MNKEQFEQMKNSKGFIAALDQSGGSTPKALKLYGVTEDQYSNEEQMFDLIHQMRTRIIKSPAFSHNKIIGAILFEQTMDRQIDGKYTADYLWDEKHVVPFLKVDKGLEPLDSDGVQLMKPISGLAELLSRANERHIFGTKMRSVVKKASPKGIARVVQQQFEVAKQIIAAGLVPIIEPEVDINIPDKSPAEAILRDEVRKQLDALPSTANVMLKLSLPTINNLYKEFTQHPRVVRVVALSGGYPREKANAILAENKGVIASFSRALTEGLSATQTDEQFNNALAHTVEGIYEASIK >NZ_CP022383.1|WP_095901634.1|1958459_1959251_+|superoxide-dismutase MKLKNAITLSAVALATAFTSGNLYAQKKKSKTKKAQEETVIGVQRGSNYGDPKDVKAEAGAFAIVPLKYKYSSLEEYIDEETMFIHFSKHYVAYLNNLNKAVAGTPQAQMSIEELLRTLDMNNATLRNNAGGYFNHNMYFAGMSRSGGSPKGALAAAIDRDFGSFENFKKAFAEAGAKRFGSGWAWLVVNNGKLQVVSTANQDNPLMPGLGVSGIPVLAMDVWEHAYYLKYQNRRGDYINNFFNVIDWDVAEQLYQKAIATKP >NZ_CP022383.1|WP_095901633.1|1957611_1958418_+|toxin-antitoxin-system-YwqK-family-antitoxin MKFLFLSISLLFVSTSFAQKLLPERDVRQQVFAGEKFFKLYTTDEPLDGTYKVVMSGGAYYEATFVEGRINGAYKRYNGEGKLTAEEHYKNGKNDGTWKYYNDKGKLISLKNYKDGKPWGKWQLWDDDAEWLQSESDYKGEKNYTLRIYYDNGKLSEEHTYKEGKEDGSQKYYQREGYLSKELLKENGELVLYKTYHSNGKLYSEMPYKDGKKNGKSVIYYPNGKITEEGTYKNNRKEGVWKYYNQAGRLTSEETYDHDNVVKTVNYN >NZ_CP022383.1|WP_095901632.1|1955086_1957558_+|glycoside-hydrolase-family-2-protein MNKINLLLLLSVVATSWAQQQQTLQKGWKFTREDKPAFSQTNYNDAKWQRVTIPHDWAIYGPFDMENDIQRTAIKQDGQKAAIEHTGRTGGLPFVGVGWYRTQFNVPELTSDKQVFVQFDGAMSNPEVFVNGQKAGEWHNGYNTFFLDITPYVKANNNTLAVRLNNLTQMSRWYPGAGLYRNVHIITKNKTHIPIWGVQITTPEITNNFAKVVVNTEFVADKKTSIAAETVIFNNQGERVAYANTKATPYTTDKISAELYIDNPRLWDIGKPYLYKAVTKLYEGDTVKDEVTTTFGVRSIELKPNDGLYLNGRRIKIQGVCMHHDLGALGAAVNESAIRRQISIMQDMGVNAIRTSHNMPAPEYVRLADEMGMLLAVESFDEWAIPKVDNGYHLYFKEWAEKDLTNLVKHYRNNPSVLMWFIGNEVEEQSVESGSQVARYLQDIIKKYDTTRPVSNGMDRPHDVLKNNMAATMQLMGFNYRPFKYKEAYRKLPQQLILGSETASTVSSRGVYKFPVERKSMAKYPDMQSSSYDVEHCGWSNLPEDDWIHQEDLPYTVGEFIWTGFDYLGEPTPYYVEWPSHSSYFGAVDLAGLPKDRFYLYRSHWNKEDETLHILPHWNWEGREGETTPIFVYTNYPSAELFINGKSQGKRSKDLSIKLEEEEKDGNPSDLNRQKRYRLMWMDTKYEPGIVKVVAYDANGKAVAEKEIHTAGKPYALRLSTEHKTELTPNSKDLAFITVEAVDKDGNLCPTVNDLVTFTVKGAGFYKAGANGDPTCTDQFHLPKMHLFNGKLVMMVQAGDKSGIINIEAKTKKLKGKIDIMVR >NZ_CP022383.1|WP_095901631.1|1954562_1955009_+|hypothetical-protein MKKLINVTFALLLLCSAVTLVSCGKDDDKGSGKEYPEGTRELTQNGITAKIDQERWGGAGGNTVLYVNLLLTKSNANTKPTGRVYLQARTTDGEILQAWSNGELGKGAEFGSRWIGNNHHLEFSFSLLNGKQMKANSVTISKIEVYSN >NZ_CP022383.1|WP_095901630.1|1953513_1954518_+|BtrH-N-terminal-domain-containing-protein MATIDFKHRQSAHCENGVASNLLYFNGVQLSEPMVFGIGSGLLFFYFPWIKVNEAPAISYRTMPGHIFSKVAKRLGFKVKRQKFSSPEKAQQALDDNLAKGIPTGLQVGVFNLPYFPDEYRFHFNAHNLVVYGKEDGKYLISDPVIPHVTTLTPEELTKVRFPSGVLAPKGHLYYPTHLPKELSLEKAIWKGIKQTCSTMLAPVPIVGVAGIKTVSKDILKWYRKKGAKTTNHYLAQLIRMQEEVGTGGAGFRFIYGAFLQEAGKLLKNDALIELSKEITDIGDAWRDFAVAIARVYKNRSTQADVYNALSQQLYAIAVREEAFFKKLKLVSKK >NZ_CP022383.1|WP_095901629.1|1952558_1953422_+|M48-family-metalloprotease MNPFFLYIIYVIAIFALKIFVAYLSFKIDKNKKYAFTDIFIWKDELFDSAVVSLIFILMMLFFSHIQFTNAMAVIMLALINSYYFLIVPLRALFQKKKYLKNEEIESFLRNEGYSYRIRIIKGKIENAFATGVFPFTKTILIGEPLCEKMTDNELKGVVFHEIGHLKLGHLYKMFFLNMVSSIILVFLYSFSTDIIESQHYRDTIMEPVIVALVGGIYGVIAFILIPYFFQRRLEYQADAFAVRKVGAEQYVQTLEKLNEISENKMMKGSVTHPSLKDRIKNAYNTR >NZ_CP022383.1|WP_095901628.1|1950235_1952200_-|1,4-alpha-glucan-branching-enzyme MNDPLLLPYLPIIQGRHQHFINTLHRVQGDASRLADACNSHLYYGLHRNNTEWVLREWAPNATAIFLLCDSNDWQKNNHYSFTKLNDQDWELRLPANILRHEMLYKLLVEWEGGSGERIPSHTTRAVQDDYTKVFSAQVWCPDHPYHWQHPRPKAAPHPLIYEAHIGMSTEHQRVSTFIEFRLYVLPRIAALGYNTIQLMAIQEHPYYGSFGYQVANFFAVSSRFGTPEELKELIDAAHGLGIRVLLDIVHSHSVSNEAEGLSLFDGTDYLYFHRGERGKHPAWDSRCFDYGKPQVLNFLLSNCKYWLEEFRFDGFRFDGVTSMIYYDHGLGKAFTDYSFYYDGNEDNDALVYLTMANQLIHELYPEALTIAEEMSGLPGLASPISEQGMGFDYKLSMGIPDYWIKLLKEVPDEQWHVGDIYYELTNKRAEERTISYAESHDQALVGDKTIFFRLTDKEVYTGMSVFDHNLVIDRAMALHKMIRLITLATAGGGYLAFMGNEWGHPEWIDFPRQGNNWSYAHARRLWSLVDNPDLKFKYLNAFDSAMIHFAAESKFLDREPRVLVRDIERQLLIFERSGYLFVFSFNPTTSYTDYQFDVPAGKYITVLNTDNPAFGGDNRIDESVEHFTQYTGKENLLSLYIPARIGMVLKLAD >NZ_CP022383.1|WP_095901637.1|1962492_1962870_+|endonuclease-domain-containing-protein MNNLIPYNPKLREFARFLRNNSTFPEILLWKEIKNKSLGVEFKRQVPILEYIVDFYCQELKLVIEIDGHIHDFRYVEDKNRQNQMEKYGLTFIRFSNEEIKTNMFSVVLSLESKIEELKREKNIL >NZ_CP022383.1|WP_095901638.1|1962897_1963533_+|tRNA-(adenosine(37)-N6)-threonylcarbamoyltransferase-complex-dimerization-subunit-type-1-TsaB MLLLSIETSGLNCSVALSKDNRILAEKSENAGKFTHSENLHLFIEAVMQQASMPLSALDAIAVSAGAGSYTGLRIGIATAKGLCFALGKPLIAIPTLQILAHQAKANCIIPMLDARRMEVYSAVFNSQYEFVTPTEAKILDEHSYQDELNKGKVTFLGDGSNKFAAICKHPNAVFIPDAFPQAKDMIPFAMTKYNAQDFEDIAYFEPTYLK >NZ_CP022383.1|WP_095901639.1|1963611_1964523_+|1,4-dihydroxy-2-naphthoate-octaprenyltransferase MKAITLIKAARLRTLPLSVAGIITGSALAYRNDPSAFSWQIFAWALLTTILFQVLSNFANDYGDGVKGTDNEHRLGPQRALQSGAISRSQLKRVVLITAGLSILSALILIYLAFGSENLLYLVIFFVLGLLCVAAAIKYTVGSSAYGYRGLGDLFVFVFFGLVSVVGSEFLYSQSLQWITFLPAISIGLLSVAVLNLNNMRDYENDQRSGKHTLVVKMGISLAKYYHYYLVVLAMVALLSFSVLRLQMRWELLYLIAFLPLVVHLLSVKRNNELQLLDKELKVVALSTFLLSVLFFIGIYISK >NZ_CP022383.1|WP_095901640.1|1964560_1965241_+|metal-dependent-hydrolase MEITFYGHASIGIKIGETHLLIDPFISENPLASDIDINQIKADYILITTAGQDHTWDVEKIAKRTKAQIISNFEIARYFFNLGIENSHPMDLGGTFNFPFGSVKMVSALHSSTFENGSDGGSACGFVIKTADKTVYIAGSTALHFDMQLIPVRYKLDLAVLPIGGNFTMDVEDAIMASDFVKCNKILGYHYNTMSHIKIDKDAAVRSFKAKGKELILLGVGQIVVV >NZ_CP022383.1|WP_095901641.1|1965340_1966249_-|hypothetical-protein MRRVSLLLSAVGLLVACNKNDYNEEPELIYFADKIHEEILAVGSTETTTTDYEFKYRGVGLVTSESVKIFTPSTITRTLTYETSYDGSFPKTTLYKVGGTTVNTTTYDQYIDRGSIKKKTIVETGKENLPTVEEYEYYNRNTLSKYTSAKKTATNTTVYTVKEYTWGGRNELNVKTSVYTQTGNTTTSATVVFTDKYLLNYNETVREHTRTENNQTVVITYTYDTKTNPRYLQFSHRMTHPDFFLQEGYGRNNITKKTVKYPNVAGKDYEEETAYEYFKNEYPLKAVIKRNGAVVGSKEFTY >NZ_CP022383.1|WP_095901642.1|1966431_1968078_+|Ig-like-domain-containing-protein MKHRLPILLSLLIALTLWQCARRGSPTGGPKDVTPPVLLQSVPKSGAMKFHGKKIRLQFDEFVVTKDVRKQLIISPPMKTFPIISPTSASKWLEINIVDTLKPNTTYVLNFGNSIQDYNEGNPISDFKYVFSTGSTLDSLWYEGDIADAIAPKPDNFVTVMLYPYNEQYNDSLVYKTQPTYVTNTLDSLDSFVLEYLKEGKYKLVALKEKSPNYIFNPKDDKIAFMDEPITVADLKGNPQGFYPKLRLFKEVPPYKAHRPTQAAGNRVQFGFEGKTDSVKIKPISHVTPDFKYIISKDPKKDTLNFWYTPKQKDSLVFTIAKQQKIDTFKVRLKEMKNDSLQVENLFSGDLPMHKDFGFKSNIPIVKTDPSKLKVFAGKDSTAVAVPFKTRLEPNNLEFYLSFDKKNDETYRIEALPNALTDFFGHTNDTLKVTLRTKKLEEVAILKMHLNVAENTLQYPVILQLTNEKATEVLREIYIEKPLTEYLFENVTPGKFRLRLIEDKNKNHQWDTGSFLEHRQPERVWYLPKEIELRANWEVEENWQITNP >NZ_CP022383.1|WP_095901643.1|1968128_1970915_+|DNA-polymerase-I MKKRLFLLDAYALIFRGYYAFIKNPRINSKGLNTSAILGFMNSLFEVIKKEKPDHLAVAFDKGGSVARTELFADYKANREETPEAIRLAIPYIHEILKALHIPIIEKEGYEADDIIGTLSKQAEAQNYQVYMVTPDKDYAQLVSDNIFMYRPARSGNDIEIWGVKEVQEKFEVQNPLQVIDFLGMMGDAVDNIPGLPGVGEKTAKKLLADFGSMENLLDNTDKLKGKLRENIENNKEKGILSKQLATIMLDVPVIFDEADFAMSQPDFEAVGKIFDELEFRQLLQNFLKTYQPEAVVQTPPKAASGQLDLFSQGDLFAQPQLVSDKKNSSNTPHFYQLADTPMAQQLLLQSLLQQTEVCFDTETTNVDALEAELVGISFCWAAHKGHYLPFPKEKEAATKLIEQFRPFFENEQIAKVGQNLKYDLKVLQNYGVEVQGALFDTMIAHYLLNPDMRHNMDILSETYLNYTPIAIESLIGKGKAQRSMRTVPIDEVKEYAVEDADVTWQLKNVFQSQMPQVNAQKIYTDLEAPLIKVLAAMEREGVNLDVDFLKEYSKTLDADIAQLETAIAQQAGETFNLASPKQLGDILFEKLKIDSKPKKTKTGQYATSEEVLAPFAAKHKIVADILEWRQVQKLKSTYVDALPLEVNKNTGRVHTTYMQTVAATGRLSSNNPNLQNIPIRTPRGQQVRKAFIARNPDYVLLSADYSQIELRIIAALSEEHNMIESFLRNEDIHRSTAAKVFNVPLEEVTREQRSHAKTVNFGIIYGVSAFGLSAQTDLSRSESKELIDTYYATYPNLRNYINKQIEFAREHGYVETILGRRRYLPDIHSHNQVVRGGAERNAVNAPIQGSAADIIKIAMIRIHNQLQAQKLQTRMLLQVHDELVFDVPKTELELVKPLIKDAMENAFTLAVPLVADLGVGDNWLDAH >NZ_CP022383.1|WP_095901644.1|1970974_1972060_+|DUF1016-family-protein MKEDFQHIIQLIQNAKERVYVKANSELVALYFNVGNILSDKISKGVWGDKTMTALADFIHTKMPNLSGFNRRGLYRMKQFYEIHDVNSEVFKLWEELVFSIERAEKQDVLIVSPPVTQFQKHQFYMENVLLKVSWSQHLDIFSKLKKPEEILFYLLETITEKWTRAELQRQLKTATYERTLLAKQATSPIIQKMDNIQTLFKDPYVFEFLDLPDNHSEKEFEKAIVLNLQKFILEIGKGFTYMGEQYRLQVGNKDYYTDLLFYHRDLQCLVLFELKIQDFEPEFLGKLNFYLEALDRDVKRPHENPSIGILLCKGKDTEVVEYSLARNPSPTIIAEYQTKLIDKQLLINKLNQLLAIFQNP >NZ_CP022383.1|WP_095901645.1|1972065_1972590_+|methyltransferase MYKVLPKKRYQKTLSILKRYAPQGSTILDIGVENPFSTIMKEEGYTVLNTTGQDLDFEANTLQNFQADFTTALEILEHLVNPLAVLQNIPTDKILITVPLRLWFAKAYRNTADPRDCHYHEFEDWQLDMLVEKAGFRIVYREKWTHPVKKLGLRPLLRYFTNRYYAVVAERINK >NZ_CP022383.1|WP_095901646.1|1972618_1973902_+|Na+/H+-antiporter-NhaC-family-protein MTKKNSIVSIIPLLIFVAIFLGSGIYFDNFYSLPAPIIALFAVVVALLIYRAPLNQKIALFFRGAGNSNVLQMCVIALLAGAFASVAKASGSIDSIVNMGMYYISPEYFPVGIFVIASFLSFATGTSVGTIMTLSPIVFNLAFESHSDTALIGAALLCGAMFGDNLSLISDTTIAATQSLGCKMSDKMKTNAQIALPVALLSAVILFFIGNPGATTFTHSEAYHTFNPLLILPYVAVVVISLFGVNVFVSLFLGVVFSGVMGLVYGKFSFLDLTQHTYKGFMDMADIFFLYFIIGGLAFLVERFGGIQFLMNLVAKRIKSGASALLGMGFLVTVADVCVANNTVAILIVSKISKRIAEQFQIPLRNAASVLDIFSCYAQGLIPYGAQVIALYQFAHTMDYLSLVGYSVYLHLLLIATLVYIQVRGKK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP022383_3 | 2974219-2974849 | Orphan |
NA
Consensus repeat of NZ_CP022383_3
|
9 spacers
spacers of NZ_CP022383_3
>3.1|2974244|41|NZ_CP022383|CRT CAAGTATTGGGAAAGGTGCTTTTTCTCAATCTGGATTAACA >3.2|2974310|44|NZ_CP022383|CRT TAACCATAGGAGCAGGTGCTTTCTCATACTGCGAATCTTTGAAG >3.3|2974379|41|NZ_CP022383|CRT TAAATATTGAAGGTTCAGCTTTTTCTGGATCTGGATTGACC >3.4|2974445|44|NZ_CP022383|CRT CAAAAATTGAAGATTGGACTTTTTCATATTGTAGTGATTTACAA >3.5|2974514|44|NZ_CP022383|CRT AAAGTATTGGAGAAAGAGCTTTTGAAAGATGCAGGAAATTAACA >3.6|2974583|41|NZ_CP022383|CRT CAAGTATTGGGGAGAGTGCTTTTTCTTATTCTGGATTAACC >3.7|2974649|41|NZ_CP022383|CRT CAGATATAGGTAAAACAGCTTTTGAATATTGCCATTTAGGC >3.8|2974715|41|NZ_CP022383|CRT TAAATATCGGAGAGGGAGCTTTTTCTTATTCTGGATTAACT >3.9|2974781|44|NZ_CP022383|CRT CAAGAATTATGAAAGACACTTTTAAGGGCTGTGGCTTAATGACA |
CRISPR arrays and Neighbor proteins around NZ_CP022383_3
The CRISPR arrays of NZ_CP022383_3 >merge|NZ_CP022383|3|2974219-2974849|CRT TCTATTACAATCCCTAATAGTGTAACAAGTATTGGGAAAGGTGCTTTTTCTCAATCTGGATTAACATCTATTACTATTCCTAATAGTGTAATAACCATAGGAGCAGGTGCTTTCTCATACTGCGAATCTTTGAAGTCTATTACTATCCCAAATAGTGTAATAAATATTGAAGGTTCAGCTTTTTCTGGATCTGGATTGACCTCTATTACTATCCCTAATAGTGTAACAAAAATTGAAGATTGGACTTTTTCATATTGTAGTGATTTACAATTTGTTACGATTCCTGATGGTATAAAAAGTATTGGAGAAAGAGCTTTTGAAAGATGCAGGAAATTAACATCTATTACAATTCCTAATAGTGTAACAAGTATTGGGGAGAGTGCTTTTTCTTATTCTGGATTAACCTCTATTAACATTCCTAATAGTGTAACAGATATAGGTAAAACAGCTTTTGAATATTGCCATTTAGGCGCTATATCTATGCCTAATAGTGTAATAAATATCGGAGAGGGAGCTTTTTCTTATTCTGGATTAACTTCTATTAACATTCCTAATAGTGTAACAAGAATTATGAAAGACACTTTTAAGGGCTGTGGCTTAATGACATCCATCGTTATTCCTAATAATGTTA >NZ_CP022383|3|2|2974219-2974849|CRT TCTATTACAATCCCTAATAGTGTAA CAAGTATTGGGAAAGGTGCTTTTTCTCAATCTGGATTAACA TCTATTACTATTCCTAATAGTGTAA TAACCATAGGAGCAGGTGCTTTCTCATACTGCGAATCTTTGAAG TCTATTACTATCCCAAATAGTGTAA TAAATATTGAAGGTTCAGCTTTTTCTGGATCTGGATTGACC TCTATTACTATCCCTAATAGTGTAA CAAAAATTGAAGATTGGACTTTTTCATATTGTAGTGATTTACAA TTTGTTACGATTCCTGATGGTATAA AAAGTATTGGAGAAAGAGCTTTTGAAAGATGCAGGAAATTAACA TCTATTACAATTCCTAATAGTGTAA CAAGTATTGGGGAGAGTGCTTTTTCTTATTCTGGATTAACC TCTATTAACATTCCTAATAGTGTAA CAGATATAGGTAAAACAGCTTTTGAATATTGCCATTTAGGC GCTATATCTATGCCTAATAGTGTAA TAAATATCGGAGAGGGAGCTTTTTCTTATTCTGGATTAACT TCTATTAACATTCCTAATAGTGTAA CAAGAATTATGAAAGACACTTTTAAGGGCTGTGGCTTAATGACA TCCATCGTTATTCCTAATAATGTTA
>NZ_CP022383.1|WP_095902304.1|2972491_2973082_+|collagen-like-protein MKRSIIITVFCACVAFLFWQCNKESVVEKVEYQRGNIVHYGSGAPSSTIGEIGDYYFDLSVSDLYGAKTESGWGDPVSLKGDRGERGEQGEQGEKGEKGEAGDKGEKGDKGAQGDKGLTGDKGEQGDKGLVGGKGEKGEKGDEGAPTTRIHLGEGEPRSYIGKEGDWYINTRTGVLYGPKKINGIVGKKLYFQILK >NZ_CP022383.1|WP_002678703.1|2971054_2971270_-|DUF2892-domain-containing-protein MKRNISTLDKNIRLLIIVITAILGYFNEFSITIASVLSGVSILLLVTILINFSPIYALLGISTYKPKEKNS >NZ_CP022383.1|WP_095902303.1|2970631_2971051_-|rRNA-maturation-RNase-YbeY MTSFFNETPFVFPYNKNSVKCWVKSVAKCEGKEAGNINYIFCDDEYLHKINVEYLQHDTLTDIITFDYTEGKVLHSDIYISVERVKENAEIFKVPFQRELLRVLAHGLLHLCGYKDKTPKDSALMRQKEEEMMLLFDQL >NZ_CP022383.1|WP_095902302.1|2969203_2970619_+|oligosaccharide-flippase-family-protein MLRKLFKDTIIYGIATVLPRVLTLLLTRLYVNKLDTADFGIYSGLFVYLILGNVLLSYGMETAFFRFMNKGEQKKKVQSTALTSLTISSLFFLLIAYLLRHFIASWLNYDVQYIAFAIYILVLDALVVIPFAWLRNKGKAKLYAMVKIGNTALNLALNIYYLNYLSAENLATNGGVYYIFLANVIASLATFIVLLPIYLKIRFKFYYPLWKEMMIYAFPVLLAGIAFAVNEGFDRVFLRMLLPADTADGTIGIYSACYKMGVFMNLFVTAYKLGIEPFFFSSAQDKNAPKTYARITEYFIVCGGFILLFITVFTDVFKLILIPNKAYWDALWIVPIILLANLCLGIYHSLSVWYKVTDRTSFGAIVSLIGMAITVVFNFALIPWLSYKGAALATLITYAAMMSISYYYGQKYYPIPYKKRKIKIFLGMSVLFSFINFYAFGNSIYIGILFLLIYGYLASFAIKLKALRSKA >NZ_CP022383.1|WP_095902301.1|2967826_2969191_+|DNA-repair-protein-RadA MAKLKTAYFCQNCGAQYSKWQGQCYTCKQWNTIVEEVVQKDTAPAWQEKESPIKNKAPKYIPISQIDTQQEARLNSNNHELNNVLGGGIVPGSVTLLGGEPGIGKSTLLLQIALNLPYRTLYVSGEESEKQIKMRADRIPHKLDNCFILTETKTQNIFQQIKELLPEIVIIDSIQTLQSDYIEASAGSISQIRECTSELIKFAKTTHTPVILIGHITKDGQIAGPKILEHMVDTVLQFEGDRNHVYRILRSLKNRFGSTSELGIYEMLSNGLREVNNPSEILISKTDETISGTSIAATLEGMRPLMIEIQALVSTAVYGTPQRSTTGFNAKRLNMLLAVLEKRAGFRLGAKDVFLNITGGITIDDPATDLGVAMAILSSNEDIAIDKDVCFAGEVGLGGEIRPVQRVEQRITEAEKLGFNTIFVSKYNKIALKNTKIKVIKVAKIEDAIQELFG >NZ_CP022383.1|WP_095902300.1|2966939_2967818_+|glucose-1-phosphate-thymidylyltransferase-RfbA MKGIILAGGSGTRLYPITKGVSKQLLPIYDKPMIYYPLSVLMLSGIREILVISTPQDLPGFERLLGDGSDFGIRLSYAEQPSPDGLAQAFIIGEEFIGDDDVCLVLGDNIFYGQSFSKMLSQAVENVTKERKATVFGYYVKDPERYGVAEFDNAGNVLSIEEKPAHPKSNYAVVGLYFYPNKVVKVAKNIKPSARGELEITTVNQVFLNDGELKVQLLGRGFAWLDTGTHDSLSEASNFVETLEKRQGLKISCLEEIAYRKGWITAEKLQELAKPMLKNQYGQYLLQIINNG >NZ_CP022383.1|WP_095902299.1|2965746_2966919_+|metallophosphoesterase MKKTYITFLAFLCCFVLGNNWLQAQELVPILSQARLHKDNSTSGFDGPYIFYTDKGIVVKQVGEKKGVVAPSIQTFTKDIKGKKFTAQLSEKESFSFKIKKELKNETAVYNMPEKLIAISDIEGEFEAFKQFLIANGVMNTKYQWKYGKGHLVTVGDFFDRGLWVTQTLWLIYHLEQQAEKAGGKVHFILGNHDLMNMNNDFRYVRKKYFQNASLLQDEYLHFYKPNTELGRWLATKNIVEKIGDYVFVHAGISIEIANLGLTVQELNDKARDYYFDNLKARKRKDSLYSILYQFGISPTWYRGWGKQTIDITEAETILERWQVGKFVIGHTLHSEVTYLMNKHVIDLDVAHAKGVVQGLLIENGNEYKVDKQGNKTAIIENASIPEDDD >NZ_CP022383.1|WP_095902298.1|2964479_2965613_-|glycosyl-hydrolase-family-5 MNKLVSLVAFLWLIIACEKSGEDNDPSTQTSTTQSTTTQTPTSPPITPPTPPQPPQNNIPYDWDFYLEPSRLNAVKKEPNVELKKLYYQVYGTPCGGWYDGMSPNSEKNNGWLKNFVEGAERAHKTPIVVLYGIPERDCGSFSKGGHPNAASYKGWIDRVSAIIGQRRAVVIIEPDAINYCGHKKGSAKYNERAELLRYAAEKLNNNNPNVASYIHAGNGPLVTNNTEAVATAIIDGGLKYMRGFALNVSGLGGTAEEQAAAEKFVTYLATKGFKNVHYVIDTGRSGINRPKHQNANAPYNSCNNFNAALGPRSTTKTTGAHADAYLWINGGGGSDGECNMGAPAAGQPYPEYTRALINNAIRVKSIQILELPNDLK >NZ_CP022383.1|WP_095902297.1|2963868_2964462_+|M15-family-metallopeptidase MKYILAFFVTFIATAQQTDFVLLKSLSSDFVFDMKYATPDNFLKQAVYECGECYLRKKTAEALVKANEEFKTLGYRIKLFDCYRPLEVQKKMWKILPGTHYVANPAKGSKHNRGAAVDLTLVDKDGKELDMGTPFDFFGEKAHHTCTTLPKKVLENRKLLKDILNKYNFKSIYSEWWHYEYRPEMQSKVENFQWQCE >NZ_CP022383.1|WP_002678807.1|2962797_2963730_+|malate-dehydrogenase MKVTVVGAGAVGASCAEYIAIKDFASEVVLIDIKEGFAEGKAMDLMQTASLNGFDTRITGVTNDYSKTAGSDVAVITSGIPRKPGMTREELIGTNANIVKSVVEQLVKYSPNVIVIVVSNPMDTMAYLVHKATKLPKNHIIGMGGALDSARFKYRLAEALNSPISDVDGMVIAAHSDSGMLPLTRLASYRGVPVTEFLSADRLNQVAEDTKVGGATLTKLLGTSAWYAPGAAVSALVQAIACDQKKLFPCSVLLEGEYGQKDVCVGVPVIIGRDGVEKIVEVKLNDAEKAKFAESTEAVREVNKALASVL >NZ_CP022383.1|WP_095902306.1|2975410_2975938_+|hypothetical-protein MKVIHIQGAQNTGKTTLCNRIDEWLLNNRIYELQQLISSRKSGQNSSIKEIIKIEVIKKILPHNDFYAVYDIQFSTGGNKRVIINTLSDCNTITEFENFYNNNKNNGYDVFITTIRNGNTPQKVIKEFYEKELKSINLPTKPNGIINLNKNPITNVVESYAMLYYEFISLFEKLI >NZ_CP022383.1|WP_095902307.1|2976152_2978093_-|TonB-dependent-receptor MQKFYIAAFIFLYSSLIFAQNDTLQNDLINLKEVIVTATRAQKNLKNVPITVQVVTAEDIRKSQATNFQSFLETEFAGINFTYDGGMPNINMMGFGGKYVLFLIDGERMAGETFDNIDYNRINLEDIERIEIIKGASSSLYGSNALGGVINIITKNAKKSFEGDLSYQYETIESHKANIGIGSKQRWGNIRLTSFYNFREPYLLKDREPLITYKNGVAEPSPKSELNIAGFTTYGVTPKLSFNLSPKTDLTLTPNYYFSERNQGDRDAERKRERYYNYTMSAKLNTALIESKKLSLSGAFDRYDKFNYFPLLKEKEKNYENTLARAGAQYNQSLWTKHSLVAGAEIFSDELLSFRFNETGTEAKENAQNYTLFTQQEWTLSPAFTLVTGVRIDYHSLFKEHFTYRLSGMLKVESFTFRGGFSSGFRSPTLKELYTNWFHPWGGGFQIIGNKDLKPESSNNFNFSVDFDSPKVNITAMTQLSLMKDKIDYRWTVASDTIRYINFKDKTQIISSELAASYRPTQALRFKASYAYYYISNSANENRPHTLTLKAEYIPKHNALYIPNIVFSGKYVSATRIHETDTSNNQLYTYYEPYSIWQLQVFSKLPYHLTLTAGIDNLFNYITPTTSFYSSISKGRTYSLGLKWNY >NZ_CP022383.1|WP_095902308.1|2978162_2978876_-|hypothetical-protein MRLQFLAPAIIAGLMMFTTSCSKDDKKTEAPKLPYKEQKVDASSYDKWVYFSFENGVVTSTTATPTTTDWDIAFNRYSVRTNSGTSGSGNGGALTTNETDWDKVATASSTASFTVDGSIYVFERGQNNQGGLGTTNASRVISGGHGENFWNMISRMPNVAKIDKSSIVHNNGWLTMDYKPNGKQLAPSYTYNNWVYIVKTPAGKFVKIQLTDYKNAKDETGYITFKYQIANDKNEFK >NZ_CP022383.1|WP_002678688.1|2978975_2979959_-|polyprenyl-synthetase-family-protein MKITAQIKEPIREEMELFEQKFRNSMSSKVALLNRITYYIVNRKGKQMRPMFVFLVAKMLSDDSKVSERTYRGASVIELIHTATLVHDDVVDDSNKRRGFFSINALWKNKIAVLVGDYLLSKGLLLSIDNGDFDLLRIISVAVREMSEGELLQIEKARRLDIVEDVYYEIIRQKTATLIAACCAMGACSVQPEQPELIEKMRLFGEYIGMAFQIKDDLFDYTEDAIGKPTGIDIKEQKMTLPLIYVLNTCTEKEKQWLINSVKKYNKDKKRVKEVISFVKTHNGLEYATAKMKEFQQKALNILNDFPASSYKEALTLMVNYVIDRKI >NZ_CP022383.1|WP_095902309.1|2980009_2981977_-|ATP-dependent-metallopeptidase-FtsH/Yme1/Tma-family-protein MSDNKSNGNRFSAYWLYGVLLLILIGFNYYSGASFWSQPKEIPQSKFEEYLSNGDVAKIIILNRKEANVYLTEDALKKEEHKEVKPTGNSLMQGANAGEVPQYRFELGDLSNFENRFDQIVKDNNLSTTRENKTQQNVIGDLLFTLLPIIVVIGLWIFVMRRMAGAGGPGGIFSIGKSKARMFDEKKETRVTFQDVAGLEGAKEEVQEIVDFLKNPDKYTSLGGKIPRGALLVGPPGTGKTLLAKAVAGEAQVPFFSLSGSDFVEMFVGVGASRVRDLFKQAKEKSPSIIFIDEIDAIGRARGKNAMTGANDERENTLNQLLTEMDGFGSHTNVIVLAATNRADILDKALMRAGRFDRQIYVELPNLNERKQIFQVHLRPIKTAETLDIDFLAKQTPGFSGADIANVCNEAALIAARKGKTAVSKQEFLDAVDRIVGGLEKKTKILTPEEREAIAFHEAGHATVSWLLEHAAPLVKVTIVPRGQSLGAAWYLPEERQIVRTEQILDEMCAALGGRAAEKVVFNKISTGALSDLEKVTKQARAMVTIYGLNEKIGNLTYYDSAQSDYSFTKPYSEKTAHLIDEEISKIIEEQYQRAIDILTENKDKLTTLANLLLDREVIFRDDLENIFGKRKFKDAHNSNEEETPLKTENTSPTE >NZ_CP022383.1|WP_095902310.1|2981981_2982353_-|ribosome-silencing-factor MNKEISSNQLISNIIAGIEKVKGTDITIMDLREVENTVCDYFILCNGSSNTQVSAISGAIQKIVSQADGHQHPWHVEGEANAEWILIDYVDVVVHIFQTKVREHYDLEGLWGDAKFINLETNY >NZ_CP022383.1|WP_002678914.1|2982542_2983331_-|hypothetical-protein MAQKVDITVEEKLRALYDLQLIDSKIDELQYTRGELPLEVSDLEDEVEGLKKRHAKFKEELDALQAGIAEKKATIEEAKALIKKYTTQQKNVRNNREYTSLSKEIEFQELEVQLAEKRIKEGKAQVDQKKEAIKQVDERKAQRDAHLKHKKAELNSVLAETEKEEAFLKEKAEEYAAKIEDRLLKAYQRIRSSVMNKLAVVPIQRGASGGSFFTIPPQVQVEIASRKKIITDEHSGRILVDAALAEEEKEKMEALFKSFKAS >NZ_CP022383.1|WP_095902311.1|2983333_2984431_-|Nif3-like-dinuclear-metal-center-hexameric-protein MKITDIIQELEKLAPLQYAEGFDNVGLLVGDANAEVKGVLITLDTLEAVVDEAIAKKCNLIVSFHPIIFSGLKSLTGKNYVERVVMKAIQHQIAIYSMHTALDNQFLGVNASICNRLELQNRRILIPQPHTIQKLITYVPKSDAENLRKALFVAGAGNIGNYAECSFNLEGKGTYKGNEESHPTIGEPNVFHTEDETQIGVIFPKHLQRQILQALRQNHPYEEVAFEIYTLENEHQHIGMGMIGELNKAMSEKVFLAYLKERMQVSVVRHSALLGKDVKKVAVLGGSGAFAIENAKRAKADVYITADLKYHEFFKAEGQILLADIGHFESEQYIKSLLFDYLSKIFPTFALSISNVDTNPIKYYS >NZ_CP022383.1|WP_002678860.1|2984427_2984976_-|protein-tyrosine-phosphatase MKKTLLICFILFALLPLWSQHKATLIHADANLYKVDSLLYRSEQLVTEDKAIIKNIPIKSIVNLRYFTRSGDKKIFNASDNIKLINHPLLTWRIKAPEIAQTLKIIREHQKQGAVLIHCYHGADRTGIMVAMYRIIYHNWTIEQAKKEMLNGPYGYHSVWKNLEALFTESTVKEVRKHLGMQ >NZ_CP022383.1|WP_095902312.1|2984972_2985635_-|RNA-methyltransferase MNDKLLSYLERFITEERKERFLEIIAQRTNHFTVAMEDVFQMHNTSAVVRTCEVFGVQQAHSIEGRFGKRLDAKIAMGAQKWVDVFRYNDTQSCIDSLRAQGYQIVATTPHKDAYFLNDFDITKKSAFFFGTEKEGLSAQVLSQADTFLKIPMVGFTESLNISVAVAIVLQQLTDKLRRSEVAWQLSDNERLNTLIHWTKQSIRNVDDVLTRYEEIKNTL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP022383_1 | 1.6|1574477|30|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574477-1574506 | 30 | NC_031020 | Morganella phage vB_MmoM_MP1, complete genome | 126618-126647 | 6 | 0.8 |
NZ_CP022383_1 | 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574249-1574278 | 30 | MF001355 | Proteus phage PM2, partial genome | 130383-130412 | 7 | 0.767 |
NZ_CP022383_1 | 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574249-1574278 | 30 | KP890823 | Proteus phage vB_PmiM_Pm5461, complete genome | 142786-142815 | 7 | 0.767 |
NZ_CP022383_1 | 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574249-1574278 | 30 | NC_019763 | Oscillatoria nigro-viridis PCC 7112 plasmid pOSC7112.01, complete sequence | 74100-74129 | 7 | 0.767 |
NZ_CP022383_1 | 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574249-1574278 | 30 | NC_019763 | Oscillatoria nigro-viridis PCC 7112 plasmid pOSC7112.01, complete sequence | 124467-124496 | 7 | 0.767 |
NZ_CP022383_1 | 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574249-1574278 | 30 | NC_019731 | Oscillatoria nigro-viridis PCC 7112 plasmid pOSC7112.03, complete sequence | 12069-12098 | 7 | 0.767 |
NZ_CP022383_1 | 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574249-1574278 | 30 | NC_019731 | Oscillatoria nigro-viridis PCC 7112 plasmid pOSC7112.03, complete sequence | 102215-102244 | 7 | 0.767 |
NZ_CP022383_1 | 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574249-1574278 | 30 | NC_017725 | Borreliella garinii BgVir plasmid cp26, complete sequence | 8969-8998 | 7 | 0.767 |
NZ_CP022383_1 | 1.4|1574325|30|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574325-1574354 | 30 | NZ_CP011023 | Bacillus pumilus strain SH-B9 plasmid pSHB9, complete sequence | 60389-60418 | 7 | 0.767 |
NZ_CP022383_1 | 1.2|1574173|30|NZ_CP022383|CRISPRCasFinder | 1574173-1574202 | 30 | NZ_CP013139 | Pseudoalteromonas sp. Bsw20308 plasmid pPBSW1, complete sequence | 502235-502264 | 8 | 0.733 |
NZ_CP022383_1 | 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574249-1574278 | 30 | NZ_CP039041 | Piscirickettsia salmonis strain Psal-072 plasmid unnamed1, complete sequence | 134970-134999 | 8 | 0.733 |
NZ_CP022383_1 | 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574249-1574278 | 30 | NC_006128 | Borreliella bavariensis PBi plasmid cp26, complete sequence | 8983-9012 | 9 | 0.7 |
NZ_CP022383_1 | 1.7|1574553|31|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574553-1574583 | 31 | KF114880 | Leptospira phage LinZ_10, complete genome | 86241-86271 | 9 | 0.71 |
NZ_CP022383_1 | 1.7|1574553|31|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574553-1574583 | 31 | KF114879 | Leptospira phage LbrZ_5399, complete genome | 32076-32106 | 9 | 0.71 |
NZ_CP022383_1 | 1.7|1574553|31|NZ_CP022383|CRISPRCasFinder,PILER-CR | 1574553-1574583 | 31 | KX656785 | UNVERIFIED: Leptospira phage Lin_34, complete genome | 21420-21450 | 9 | 0.71 |
1. spacer 1.6|1574477|30|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to NC_031020 (Morganella phage vB_MmoM_MP1, complete genome) position: , mismatch: 6, identity: 0.8
agattagacaatattctgcaccat-tatatg CRISPR spacer tatttagacaataatctgcaccatctattt- Protospacer . ********** ********** *** *
2. spacer 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to MF001355 (Proteus phage PM2, partial genome) position: , mismatch: 7, identity: 0.767
ctatggaaaacacttacttttttaaagcca CRISPR spacer ttagggaaaccacttacttttttaacttta Protospacer .** ***** *************** ..*
3. spacer 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to KP890823 (Proteus phage vB_PmiM_Pm5461, complete genome) position: , mismatch: 7, identity: 0.767
ctatggaaaacacttacttttttaaagcca CRISPR spacer ttagggaaaccacttacttttttaacttta Protospacer .** ***** *************** ..*
4. spacer 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to NC_019763 (Oscillatoria nigro-viridis PCC 7112 plasmid pOSC7112.01, complete sequence) position: , mismatch: 7, identity: 0.767
ctatggaaaacacttacttttttaaagcca CRISPR spacer cgcaggaaaacagttacttttataaagact Protospacer * ******** ******** ***** *
5. spacer 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to NC_019763 (Oscillatoria nigro-viridis PCC 7112 plasmid pOSC7112.01, complete sequence) position: , mismatch: 7, identity: 0.767
ctatggaaaacacttacttttttaaagcca CRISPR spacer cgcaggaaaacagttacttttataaagact Protospacer * ******** ******** ***** *
6. spacer 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to NC_019731 (Oscillatoria nigro-viridis PCC 7112 plasmid pOSC7112.03, complete sequence) position: , mismatch: 7, identity: 0.767
ctatggaaaacacttacttttttaaagcca CRISPR spacer cgcaggaaaacagttacttttataaagact Protospacer * ******** ******** ***** *
7. spacer 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to NC_019731 (Oscillatoria nigro-viridis PCC 7112 plasmid pOSC7112.03, complete sequence) position: , mismatch: 7, identity: 0.767
ctatggaaaacacttacttttttaaagcca CRISPR spacer cgcaggaaaacagttacttttataaagact Protospacer * ******** ******** ***** *
8. spacer 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to NC_017725 (Borreliella garinii BgVir plasmid cp26, complete sequence) position: , mismatch: 7, identity: 0.767
ctatggaaaacacttacttttttaaagcca CRISPR spacer taacagaaaacacttccttttttaaatcta Protospacer . *..********** ********** *.*
9. spacer 1.4|1574325|30|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to NZ_CP011023 (Bacillus pumilus strain SH-B9 plasmid pSHB9, complete sequence) position: , mismatch: 7, identity: 0.767
gatttttgccttacttatgaccaacaaaga CRISPR spacer gattttttccttactcatgaccagctgtaa Protospacer ******* *******.*******.* . .*
10. spacer 1.2|1574173|30|NZ_CP022383|CRISPRCasFinder matches to NZ_CP013139 (Pseudoalteromonas sp. Bsw20308 plasmid pPBSW1, complete sequence) position: , mismatch: 8, identity: 0.733
tttataaagcagtacaaaccctttgcgctt CRISPR spacer gttataaaccagtacaaagcctttaatgta Protospacer ******* ********* *****. *
11. spacer 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to NZ_CP039041 (Piscirickettsia salmonis strain Psal-072 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.733
ctatggaaaacacttacttttttaaagcca CRISPR spacer atcactaaaatgcttacttttttaaagcct Protospacer * ****..*****************
12. spacer 1.3|1574249|30|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to NC_006128 (Borreliella bavariensis PBi plasmid cp26, complete sequence) position: , mismatch: 9, identity: 0.7
ctatggaaaacacttacttttttaaagcca CRISPR spacer taacagaaaacacttccttttttaaatttg Protospacer . *..********** ********** ...
13. spacer 1.7|1574553|31|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to KF114880 (Leptospira phage LinZ_10, complete genome) position: , mismatch: 9, identity: 0.71
attgccttccaaggcgttcatactctatgag CRISPR spacer aaaaatttctaaggcgttcatactctctgga Protospacer * . .***.**************** **..
14. spacer 1.7|1574553|31|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to KF114879 (Leptospira phage LbrZ_5399, complete genome) position: , mismatch: 9, identity: 0.71
attgccttccaaggcgttcatactctatgag CRISPR spacer aaaaatttctaaggcgttcatactctctgga Protospacer * . .***.**************** **..
15. spacer 1.7|1574553|31|NZ_CP022383|CRISPRCasFinder,PILER-CR matches to KX656785 (UNVERIFIED: Leptospira phage Lin_34, complete genome) position: , mismatch: 9, identity: 0.71
attgccttccaaggcgttcatactctatgag CRISPR spacer aaaaatttctaaggcgttcatactctctgga Protospacer * . .***.**************** **..
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1410304 : 1428953
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP022383|1410304:1428953|DBSCAN-SWA TTTAGTTCACGGTTAATTTTGGTTTTAAGAATAGATTTTGCTTTAATACATATCTCTGGATAACATCTTCAGGAAGATCAAAAGCATTTGCTAAATCATCATAAGTATACTCAAGTTCTTTTAAGTGTAATTTTACAGCCATATCAAATGTTATAGCCTCATCTATACTCACACTACCTTTTTCATTTTTCTTCTCTCCTATTCTGCTCAGTTCAATGTTGAAATATTTATATTTATTTTGATTAATAACCCCTAATGAGTATGCGCGCCTGATAATTGAAGCTTTTGAGGTTAGCCAATAGCTTTTTAATGCGCTTAGACTTGAAAGTTTGAGCCCCTCTAATGAGTTCCTTATCGCTCTTTCAGGCATTAGAAATTCAGAAGCAAAATCGTTAGCTTCTTGCTCTTTGTCTCTAATGTTTGGCACTGGGAAAGCTGTGTGCATTATCAAATGCCCTAATTCGTGGGCTAATGTAAAACGTTTCCTATCATTAGGAAGTTTTTTATTCAACACTATTACTGGAAATCCTTTTTTAGTAAATAACGATATACCATCAAACTTTTCATTAGCATTTAGTTCATAAATGATAATTCCTTTATCCTCTATAATTCTAAAAATGTCCTCAATAGGTTCATTGTCGAATATTCTAAAATCCTTGCGTGTGAATTGAGCAATCTCTTCTGGAGTATATCCTTCTTCAATATCTAAGGTTTTAAGTGAGAAATCGGGATAATCAATAGAATTTGACATTTCGTCAATAATATAACCGATGAACGTACAAGATGTTTCAAAGTCTTGAATAATTGATTTTGGAATGGTGTTTTTTTTACGATAGTTAGCTGTTTCTAATTCTACTGATATTTTGCGTTCAAAAAAACCCTTTGGGAATTGCAACACGTTAAATATCTTTTCCAAAATCTCATCGGATAAACCACCAAGTCCTTTTTCAAACTTGGATAAATTAGATTGTGAAAGCCCTTGCACCGCTTTTGACAATTTTGTTTGCGTCCACCCTCTGTATTCCCTTGCAAGGGTAAGCTGATTGTGATTAACTTTCATAGATGTTTTTGTTATTAGTCGTTAGTTTTTGTAAAAAAGGTAGTGAGATTTGTCGGTTCAACAATAATACTAAATTGCCTTTTTACGTTTAGATTGCTCTTTAACTTTAGGTAGCAATCTTCCAGTTTTATTAGGCGTTAATATAGTAGGAGCATTTAATGATGTTAGTTCATCTTTACGAAGTCTCCATTTTATTTTTTCCTCATCTATATAGACAAAATGAGGATTTATCAAATCACCTGACTTGCTTTTCTCCCAACCGAAAAACACGATAGGATTTTCGTAATCTGTGGGGTCAAAAAGTTGTGTCTGCATTTGGTTTATAATGGACTCATTGGCATTTGTTTTTATATTCATAGGCATACCTTTTTTGTTTAACTTTTTAAACAGAAAAGAGTGTCCGCAGAATGTTATTACAAATCGTTTGTATTTCCAAAACTTCCATTTTTCAGGAAAAACTTTTTGGAGTTCACCTAATAAACAAGTTTGAAAAACGCTTGCTTCAAAGCCTCTACTTCTTGATTGTGGAGGGAAAAGTTTCATAGTTAAACTAAACTTTTCTTTAGCGTTTTCAAAAGCATTAAAGAACTTAATAAGCCCTTTTTCAGTACATTCAATAATAAATGTATCTTTTTTTGCGAGCTTTCGCTTGCAGGATTGGACAAAAGTTTGTAATTTTGTCGCGGGTAAATAATTGTTTATTATCATAACTAAAAAAATTAAGACAAATCCCACTACCTAAAATATATCTGCTGCAACAGATATATTTTTTACGGTGCAAATATATAAAAAATAAATGGAATTGTATATATTTTTTTGATTTTTTTGTATTTTATTTTTGGATACACTTCTAATTTATTGATAATAAAAATATTAAACTCTTCTTATCACTATCATTTTTTCTTCAATGATTATTGATTTTAAATATATTCCTACTTTTGCCCAAAATAAAACTAATATACCAATGAAAAGAATTATATTATTACTGATTGCCCTGCTCGCTATGGGGTGCTCGAAAGAAGAAAAAGAAGAAGATTTTAGCCAATATAAATTAAATGTACCTGATTGGTTAATAGGAGATTATGAATACTCTTCATGGGGTATAACTTACGATTTCGGGTTTTCAAAGAATAATTACTATTTTTCCCATGAAGATAAAAGGAATTTTTTCGAAATGTTTAAGAGCCGTTTAGTGAAGGAAGGAGAATATTCTTATTGGAATTATAAAGTGTATTACTTTATAAGCTATGCAACACAGACAAAAAAGTATTTCAAATATACTTTTGAGATGAAAGATAAAAAATGCTTTTTTGAATTTAATGGCACAACTTACAACCTTTGTAATGAAGAAAATAAAAATGATAGAGATATTAGGCGTATATATGAAGAAGTAACGGAGTATGGAGCTACAATTAAAAAAATATATGATGATGAGTATACTTATAAAAAAGTAAAATAAAAAAAGCCCTCGTAACGAGGGCTTTTTTTTATTGATAATCAGCAACTTGATAAGTTTAACAAAATTTTAACTCTATAAACACTTGACAAAATGAAAAACTATTGTAATTTTGCCTTGTCTAATCAGTAGCAAGAACTGTATAGAAATCTTGCAAATAAAAAAATAACTCTATTTCATAGATTATCTTTTAGCCCAAAAAAGGTGTAGTATAGCAGTAATGTTATACAATCAAAAAGCGATTGCTACTGATTAGACAACACCCACTTTTTGGGCTTTTTTCTATTGAAACATTAATTTAATTATTTATAAATGTCTAATCAGATTAAAAATGTTTCTATGGTGAATAAAAGTAGCTGTAAAAACACGCCCAGCAGTGCGAAAACTGCCCACACTTCACTTTTAGAAATCCTACCGAAAGTAGAACACATCGGAATGGAAGTAGAAAACAAGATTTGTAATCTTTCTAAGACTAAACGTTATTTGTTTAGCGATTTAACCGAATTTGTTGATAAGTTGCCTGATGATAACTTAAAAAACAATCTGTTTGATATTGTTTGGCAAATTCAAACACTTGATGATTGTATTTCTGAATGTATGAAAGCTGAAGACTTCTATAATTTAGATAATTTCATCTATTATGCTAAAAAAATATTAACTCAAAAAAGCGCATAAGTTATGAAAGAGTTAATAAAAATCACTGAACAAAATGGCAATCAAGTAGTGTCTATGAAAGACCTCTATACATTTTTAGAAGTTAGAGAAAATTGGACAGACTGGACTAAAAGAATGTTAGGGTATGGTTTTGACGAAAATATTGATTATGAGGCGGTTTCGGTTTTCAGACATCACCCTAACGGCATAGGTGGTACAACTGTAAAAGACTACGCCCTTACCCTTGATTGCGCCAAAGAGATTGCAATGTTGCAACGTTCTGAAAGAGGCAAAATGGCACGACAATATTTTATTGAGTGCGAAAAACAATTAAGAAGCGGTAAGTTTGCGTTACCTACTACCTATAAAGAGGCTTTACAGTCTCTTTTAGAAGAAGTAGAAGCTAAAGAGCGACTACAAGCGCAAAACGACTTACAACGTATCGAGTTGCAAAAACAATCCCCAAAGGTAGCGTATTATGAAGATGTTTTAACTTCTAAAAGCACCTACAACGCTAACCAAATTGCAAAAGAACTTGGTATGAGTGCTGTAACGCTCAATAAGAGGCTACACGAATTAAAAGTACAGTACAAGCAAGGCGGTCAATGGTTGTTGTATCATCATCACCAAGATAAAGGCTACACAAAAACGGTTACCCATACTTATACAGATAGTCAAGGCGAGACCCGTACAAGTTCCTCCACTGTTTGGACTGAAAAAGGTCGTGAGTTTATACACTCAATAATTCAATAATAATAAAGCCCCTTAATTGGGGCTTTTTTATGTTCTTATCTTAATGCCTTTCGTAGTAAGCTCATTAATGCCAGCCTTCATATTAGCAATGTCCGTCTCTATCTTGTTTAGCTTATAGGTATTAGCTTCTATACCCGCAAGATGTCTTAGTTGTTGAGCAGCATTGGTTTGCATAGATTGGTGCATTTCTCTAATGAAGTTAGCAGTCTGTAAGGCAGCATTCTTTATCTCAGCACTTAACTGGGTTTGTAACCTAAATTGTCCATTCAATTCTTCTGCACTATCCTGACTCATTCGTGCAAAACCTTTTTCAACTGCTTGGCGTTTCTCATTCAGAAAGTCAAATCCCATACTACTACTCATATTGTTCCAATCTTTTAGAAACTGTTGCATTTCACCTATTTTCCCTTTCATAGCATTTCCAAACTCACCTACAATTTGAGAAGATTGACGTGCAAAGTCTTCACTACCTCTGTTACGTTTAGCAGCTTCTGTTATTCGTTCTTGTAGTCTTTTAAAATGGTCAGCAACAAAAAGTTCATATGCAAGCTGTTTTCCTAATTTACCTATTACATTTCCTACTGTTTTAGCAAAACTTTCAAAAGCATTCTCTCCTTTTTGTAATGAATTATATACACTATCAATTATAGAATTACCTAATTCCCCAAAAGTGTTTTTTGTGTATTCTTCAAACTTTTTTTGAGCTTCTTGTGCTCTTTCATAAGCATTAACAATTTCTTGCAGTTTTTCTTTACCCGTTCCAACGAATTCTTGGTGTTCTAACAGCTTCTTTGCAAGATTTATATCCAACCCATCAACTCCTTTTGCCAATTCAGGATAAACCTTTAATATACTATTCCACTCAGTGACTGTTTTTTTCCACCAAGCCCACCCTTTTGTGTGGCTACCAGTTACTATGCCAATATTATCTAAGTCGCTAATTTTTGCACTCTCTAATGTTTTCCTCCACTCTTTTGCTTTTTCATTGATACTATTACCATAATAAGAGCTCCATCGTTCTTTATTCTGTTTTATTGTAAAATCTATTTCCTCTTTTGAATAAGCTGGTCGTATATTGACACTATTTTTAAACTCGTCCCATTGTTTTTTGTATTCTTGTAAGTATCCTATAGCATTAGACAACTCCTTAGTTCCAAAAATAGAAGAGTTTTCTTTCTGTAATAAGCGCTCTTGGTATAAGGCTTGGTTATACGATTCTTGAATTGCAAGTTTAGATTTTGATATTTGCAACATTCTATTTTCGTGTTCTAATTTAGCTTTATTTGCTATTTCATTACCGCTGGCAAATAACCCTATAAACGCACCAACAGCAGCTCCCCAACCTTTACCAACTGAATATCCCATTTGTGCAAAAGATAGTGTTTTGTTTAATATGCTACTAACCTTTTGCATATTCTCTCCTAATGACTTTAAAGCAGCATTACCTGTACTTTGTCCCAATCGCTCAAATTCTTGTCCTAACTGTCCAAATTGCCCAGTAATTGATTGAGCAGATGACAACATACCGTTGAACGCTTCCTGCCATTCAGCGGTATTGGGTTTGGCTTTGAATAGATTTTTGATGTTTGTACCGAGTTTGCCAAAAACGGTATCGCTACGCTCGGCTGTCTCTCTTGTTTGTTCGAGTTGCTGTTTGAGATTGGTTATAAACTCTACATTGGCGTTATCGTCCATATTGAGCACCTTTGCTAACTCGTCAATTTCAGATTCTGCCTCTGCTATGGTTTGGCGTATTTCCTTGACTGTCTTTTTGCGTAGGTTGTCGAACAATTTAGCAATAGCTGTACCCTCTTTTTTGTAGAGTATATCTAACTTTTTGAGTTCACGAGCCTTTTCGTCTTGCGCTTTTTTGACTTGTGGCGCATCTGCACCTAATTTAGCTTGTAGGGCGGCTATATCGGCATTGTATTTCTCCTCGATAGCTTTGCGCTGGTCGGTATAGGTTTGATACTTTTCTAACAAGTCCTTATACACTTGTTCCTGCTGCATACGTTGGTACTCAGCATTATCAGCTAAAAGCACTTTTTCATTTTCAGCAAGGCGGGCTTTTTCAGCATTGATGGCTTCGGTATTGGTGTCAAAATCTTTTCCTTTTTTCCATTTGCCTTGTGCTTCGGCTTTTTGTTTTTCGGTTTCGATGAATGCGGCTAACTGGTCTTCTGAACGCCTTCTGATTTCCTCTTCTTGCTTGTCGTATTCCAATTGTATGATAGCAAGACGTTTGTCCGCCCCGTCTTGCATTATCTTAATGCGGGATTCTTCACGAGCAAAAAGATCATCTTGGATTTGTCGGTTGTGGTCTCTTTGGGCTTTTTCGGTGTCGAACTCTGGGAGAGAGTTTTTGATGGTTTTAGATGTGTTTTTTTTGTTTTGTACGTTGTACTCACCTTTTAGCTTTTTGTCTACCAACTCTTTTTCCTCTTGGAGTTTTTTTAATTCATCTTTATCATTTTGTGTTTGTCCTTCCTTAGACTGTATCGCACTTATTTTACTTGTTAATTCCTTTTGTTTAGCAAGTAGTTCATTACGTGTATCTAATATTTTATTTCGTTGTTTATCAGCTTCAGCTAATTGATTTGATTTACTAATAATTAGACCTAAATCGGCATCTGAAAAGTTTTCATAACCAGTAAAACCAAAAGGTGTTGTTTTTCGCAAATCCTGTTTTTCACCCTCCCTTAAAAGGTTTTGATTTTTGGCTTGTTTAGCATTGTGCCTACGATTGTATTCCTCAATCATCAGCTTGCGCTCTTTTGCTTGTTCATCAGCTGACAATTCTGCTATATCATCAATTCGTTTTAAAGTAGATTGCCAAGCGTGCTGTTTGGTAGCTTTGTCAATCTGAATATCAATAGCTTGTATTTGTTCTTTGTAATTGACTATATCCACTGGATTAGCTGCCCCACTTAACCTTGCTTGCAGCCCTCTCTTTTGTACTTGCAAACGCTCAATATAGTCTTTATCCATTTTTAGGTCTTTTTCTTTTTGAGCATTACTAACTTCTTTAAGAGCTTTGGCAATGTTGCGAATAAGTTCCTCTTCTGTTTTGTATTTGCTGAATATATCAGGATATATATCTTTTAACCTATTGAGAGCGTTTATACGTTGTCCTTTGGCTGCATTTTCGTCTTTTACTACCTCAATGAGTTTTTCAATTTCATTACGTTCATCTTCTAATACCTTTTTTTGGCGTTCTTGTTCGTCGTTGAATGCTTTTTGTGCTTGTTCGGCTACTGTTGCTTCTTTTTTAAACATTACCATATAGGAAACCAACCCTACCAATGCAGCAGCGGCAAGGGCATAAGGATTAGCAAGCATTGTAAGATTAAGGAGTTTTTGGGCTTTTTCAACAAGTACCAACCACGTATAATGAGCCATTTCGGCAACGGTCATTCCTGCTGTACGCGCTGCTACTACTTGTTGTACGGCGGCTGTAGCAATGAGTGCTGCTCGATATGTTCCATAAGAAACAATAAGCCCCGCGATGAGTTTGCCAATAGTTTCATAGTTTTCTACCAAAAAAGATACGCCTTTTATTGCTCCCGAAACAATACCCTCGCTCGCTTTGCCTATCTCATTAAGCATTTGGTCAAAATTATCTTTGAGGTTGGATATTTGCCCGCCTAATGATTTGCTTTGCTCTGCCATTAGATTATAGAATAGACCGCCCTCATTAGTCATATTCTTAATAACGGCTTGTATTTCGGCAAAACCTATTTTGCCCGCACTAACCATATCTTTGATTTCGGTTTCGCTCTTACCTACAACCTTACTCAATTCAGCTATGATAGGAATACCTGCATTCATAAACTGGTATAGGTCATTGGTCATTAACTTGCCTTGTGCTTTGACTTGCCCATATACGTGAATGAGTTGCCCCATAGGTACACCTAATCCTGAAGCTACATCGCCCATACGACGGAGGGTTTCGGTTACCTCTTCAGCGGGTACTTGAAAGGCTAATAGCTTTTTTGCTCCCTCAGATACTTCTTCCAATCCGAAAGGGGTTTTAGCAGCAAGGTCGGTGAGTTGCGCCATTAGTTCGTTGGCTTTCTCCTTGCTTTTGAGCATAGTGCCAAAAGATATTTCGAGTTGCTGAAATTGTGATCGTACGGCTACCATTTGGCTAATGAATGATTGCGCGCCTTGTAGTGTAAAATAGGCGGTTGCACCTTTGAGAAGGGTTTGCCATACATCGGCTTGCTTTTTGCCTTCTTCAACGGCTTTGCGTGTCATTTGCTCGAATTGCTTTTTGATAGCCTCGACATCTTTTTGTATCTGTGATTGGTCGGCTCTTACTTGGAATAATAGAGCCCCGTCTTGTGGTTGCATAGTTAAATATTTGCGGATTTTATTTTTGATAGGAAATCACCATAGCTGGTGCGTTTTTCTGATTTCTGAGGTGCTTTTTTAGTATCTTTATCCTTATCGAAATCATAAGAGGGGATAACGGCACTATAAAGCATTACATTGGCATAGCTTATTTCTTTTAGCACGTAGTCAAAGGTTAGCCCGTACTGCTTGGCGAAAGAGCCTACAAGTCCCCAGATGCTGTCGTTTCGTTCTCCACTTCCTTCGTCGGCTTGGTTATCATCATTCCTTTGAGGGAAGTGGTAATGACGAAAAAAGCGCGTATATCTATTTGCACTAACACTTTAAAGAATGCGGATGACACTTCGGTAATGGGGGTATTAATGAGTTTTTTTGCCAGTATTTCGCCTTTGGTTACGTTCTTTTCTCTTCTCCAAAACTGCCATTTAGGATAAGTAACTATTTCAGTAAATTTTTTGCCTAATAGGATTGCAGATATAGCCCACGCTATATTCTCATATTCTTCGGCATTGTGTATGATTGATCCGAATATATTCCCCTCACTAATAGTGTCGGTGGGTATTTTGCTGATGTACTTTGAAGCCCTTACGAGGGTAAAAATAGAGGGCGGAGCGACTTTATACGCTTCGCCCCCAATGGTTACCGTTGTAGGTTCTTCAAGTAGGGTTTGTGCTACTTGTTCTTCCATAGGTTACGCTACTTTTTCAATTGATAAAAATCCTTTACCACCATTAAGGATAGTGATTTCTACTTCTACATTGTAGCCACTATCTTCTGTGTAGGTGAGTTTACCACTTACAGAGCAGTAGAACATATCAATCTTTTCTGCTCCTGACACTTTTGGAACAATAGAAAAAGAAAACTTCTTAGTAGAGACAAAAGACTTGATAATGAGTTTGTCTCCTGACTCTTCAATATCCCAAATTTCAGAAAGCAATGCCTTGTTAAGGTTTTTAACGGTGCATTTTACTTTCACTACAGGTTCGCCTTTCATTTGGTCGATGATTTCACCGCCAATGGCTGTCCATTTTAACTCTTTACCGTCCTCTGTCTCAAAAGAAAAACTATCTTCTTTGACAATACCTAACGTTTTGAGTACTGTACCCATTGCACCTCCTGCCCCTGGCGCACCAAATTTAAATTCTAATTTGCCCCAAGCGGTGGCGTTGTTATCTATAAATGCCATAATCTTTAATTATTAAATGTGTTATATCTGAATTTTACTTTTGCGTTGATGAAAAACTGCTTAATATCTGTGTCCTCAAAGGTTTGTATCATCTGATGAAGTTGCAACTTGTAATTGTGTAGGGATGTTTTAGCTTCTTCAATGATAGGCATTAAAGCACCCTCGATAGCTTCACAACGTACAAAGTTTTTCCTATACTGATTATCGTTATTTTTGACCGTAGGGACAAAGATATTGATGTTAATCACCCCCGTTTGATATTGACCGTCTAACCCAGTAAGGAATGATATTACACAATCCTCTTTCAGGGAGTTCAAAGGGCGTACACCACTACGGTAGGTTTGCCCATTGATAAGGGGATTTATCTTATCCTTAAAGTATTTGTATATGTCGGCTTCTATTTGTGAGGCTGTTTTTTTCATTGCGATAATGCTTTTAGGAGTTTAGGCACTTCTTTTTCGGCTAATAATTCAGCTGATGAAAGTACATTGTAATTGCGTGCTTCTACATAAGCAGCGTACTTCATTCCTGCTACTACTACCAGTACAAAACCTTTTGGGTATTGAGATATTACCTTATTGATGAACGTTTCGCCCTCTTTTTGTCCATTACCTCCTGACTTAGTGATTTTAAAACCTCCTTTTTCAATAGCTTTGCCGTCTTGTAGTACTACATAGCCTATTGATGAACGAAGGTTACCCGTTTGGTCTTGATAGCTACCATTTGTCCGTGCTTCATTGATACACATTTCTCCTACATACTTCAATATGCGTATTACTTTTTGGTGATACTTTTCTATTTTCTCACGCAATATACGTTCTATATCGTTGGAATTGAATTGTGGTGTTATCATACGAATATACGGCAATGAAAGTAATCTCTTGAAAATCGTATTACTTGCTTTTCGAGGCGAATATTTCCCTCTACATCTACTACTTGCAAGGTAGTACCCGCTTCTATTTTGGGAGTATCTTTAGGAGCATAGATAGTAGCGGTACATTCAAATATTTGACCATCTACTTTGCTTATCTTTTGCCCTGCTCCTGCTATCTCATCACGACATACGCCTATCTCTTGCCACTCAATAGGGTCGCTTGGATAGGTAGGTATACCATTTTCATCAATAGTAGGGTTTTGTGATACTTTCACCTTTAATAGGTACGGGTATATTTTCATTTCCTTGCAGTATTTTAGAATAAGTTGGTAATATCTCTTACAGTGGCTTTGACTTCTAACAAATTATCTCTACCGAGTTGCTTACAAAGGAGATTGTAAAAAGCGGTTATAGCTGATTTGTCGTAAGAGAAAGATAAACCACCTTCAGAAAAGGACACTGGGCGCAATAAGAGTTCAGGAATGAGGTTGTAGAAAAACAATTTTGTCTTTCGTTCGTTCTCCTCATTGAACTCATCAGAAAGCCCCAATCCTACTCGTTGCATTTTGGCAATGAGTAGGGTGGTGGGGTATTCCACGTTCCATAGTTTCAGTTTCTCATCTATGTACGCTTGTGCGGTCATCTTAGAACTTTGTTTTGATGATGAGTTTGCGCTTAGAGTCGTTCAATACTGGAGTAGCGAACGCTGTAGCTTTTGTAGATACTGATATAGGGTCTTGATGCCCAAAAGTATTTACCAAAATGAAGCTATCCTTAATAGATTTGCTCATCACATCGGCAAAGTCCATTGTGAACTCGGTGGTAGTGGTGTATTGAGTAGTACCCAACAATGCTGAAGTGGAGAATAATATGTTACCCTCTTCCCAACCATTAGCCACAGTTACTTCTCCGTTTTTGCCCTCAAAGCTGATAAAAGACTCCCATACTTTGATAATAGGCAATCCATGTTCAGCAAGTTCGGCATTAAGTTGATCCAAACGTACATCAGGCAAAATGGTAGTAGCGTTGATAGGAACGCCTAACACAAAAGCACGTGTGTTTTTGTTTTTCAATACCTGATTGAGAGTTGCACGGCTCATAGTAATAGTAGCATAACTATACCCTTTGCCTTTGGCTTCCTCTTGGTATTTTTCGATTTCCTCTATAGGGTTAGCATCAGCATCTGCCCATTTCTTGAGTGCGTTTTGTGTTTTTACTTTGAAGTCTACCGATACATTCACCACTCCACCATTATTGGTAGCGGTAGTTTTGTATTTACCAGTAGATACAAGTTGTTTAGCCATCCACTCCATACGAGCATTGATACCGTCAATACAAAAACGAGGGTCTTCGTATATCTTATCAATAAGCTGGTTTTTGATACCTGCATTAGTAGGGTTCGCATTTACCGCATAACGGAGTTGCTGAATGGTTAGGAGGTCTTTTTCGTTCAAATCGCGGGCGATTTCTACTTTTGGTATTTCGCCTTTGATGTTTTCCACGAACTCGCGCCCTTTGCGTGGTGCTTTTGAGCCAATAGCCACGATGTCCGCCATTATTTTAGCACCATCAGCCCCTTCGATATTAGAATAGGTAAGATAAGGATTGAACTTCAATGGGAAATATTCGCGGTAGCGCAAATCTCCCAAAGGGTACGCTTGAATAATAGCATTCATATTAGCCTGAGAGAACTCGGTAATAATGTTGTTTGCGTTGATATTCATCTGCTTTTAATTTTTAAGTTATTAAATGAATGAGATACGAGGCAAAGCGGTGCGTAGGAATGCCACGCCTGCTTTTTCTTTGTCGGGTAGCGCGTCTTTGCGTGCTGTTCCTGCCATAACGACTGCTACAAGTGGCATATCGTCAATGACTACATCGTGAGCGGTTAACCCCAATGCTCCTGCGGTATTGGATTGTGAAAGGTCTTCTTTTACAACCTTGAAAGTACCATTAGTGTCAGGCATTACAAGCGTTCCTGCAGGAATAACTCCATCGGTAAAGCGAGCCTTAGCAGTGGTAGGGTCTATATATACCCCGCCAGGGTAGGTAACATCCAACTGGTCAAATACGACTATTTGGCGACCTGCTTTTTCTGAAATTTTAACTTCGTTCATAAGTGTTTACTGTTTATTGAAAGTTTCATTAATATACGCTTGTACATCTGCTGATACGCCATTAGCATCTGTACCACCTCCTATAATAGGTCTTGAGTGTGAAGAAAGCCCTGCATTAGCTTGGGTCTGTAAAAACGCTTGTTCATCGGCTTTGAGTTCATTTACAAAGGCATCCATTTCGGTATCGTCTTTGAAAGTTCGCCCTAAGTGGTGTTTGTAGAATGTTTCTGATACCCCCTGCGTTTTGAGTTGGTTTAGGAAACGTTCTTTAGCACTTTGCTGCTGCTTTTCAGCTTGAAAGGCAGCAATGGTTTCATTTTGTTTTTTTACAGATTCCAAAAGATCCTTTGCCCACTCTGGCACTTCATCAGGTTTAGGATCTGTGGAAGGAGTAGGTGGGTTTTGAGGATTTGGATTAGATTTAGCCCTCATTTCTTCGAGTTCTTTCTCTAATTTTTTGCGAGCCTCTTCAGCTTTTGTAAGGCTGGTTCGCCCTTTGTCGGCTACTGATTGCAATAGCTTAACTTCTTCCTCAACTCCTTTGACGGCGTTTTCGATTTCGCTTTCTTCTTTAACCGCTGTAGCCAATCGGGTGGCTATAGCTTTTAAAACTGACTCTTCCAACCCCAAGTGCGCATACTTGGTTTTGAGTGATTGTAATAATTTATCTACCATAGATGTACAATATTTTTTGTTTTTGCAAAGGTACGGAGGGGCGTTGTAGATTGTATATTTGCGATTTAGGAAAAAGTTAGTAATTATTTAGTAATACAAAAACGCCCCTATAAAGAGGCGTTTTCGGTGTTAAACTAAGAATATATTCACTTTAAAAAGCGTTTAAGTTTGTTTCGTATAAAGTAAAAGGCTATCAATAGTACTACGATAATAGCTATAAGGTATAAATAGGAACTTTTTACGTTTTTTGTTTTATGAGAAAAAGCCGTTTCCGAGTGCCTTTGTGCTATAAAATAAGTGTTAGCCTTAGTTATATTATCAAGAGTAGTATTCGCCACTATTTGGCTATTGGATAGGTTGCTTTTAGTCGTAATCTTCACCTTTCCACCACTTACCCTTATAGTTTCATTATCGCCGTCGCGAATGCGATAATATACTAACTCTTTGCTATTACCCACGATATCCTTATCGCTCTCTACTGTTACCTCGTACTCTTGTGATGCGTGTGTATCGAGTTGCAAGGTTTGAGTATTTTGTTGAAAAAAAGCCATACTATCCTTGTACTTTATAATACGCTCTTTTTGTACCTGCTTTTGCTCGGTAGCAGCTACCTCTTTGCGTGTCCTGCAACCTATCAAGGTGAGAAACGCTAATAATAACATTACTATTCTACTCATAACTCTCTAACATTTTGATAATTTTCTTTAAACTATCTGCATAGTTAGTAGCGGTAGCATACCCTGCTTTTGCTACTTCTTCGGCAAACTTGTAAGGGTCAGCTTTCACAAGCAACGCTTTAGCATATCGCTTGTTTCTGAAGAATAATTCTGCGTGGTCAGTAAAACATTCTTCGGGCGTGTCGTACTTTCTGAACCAGTCTTTAACTTCATACTTGTATTTACCACTCGGTAATTGATATATAGACATCACTTGAGGGAACTTATATCCTAAGTTTGGAGCATTAAGTACTTCTGTAGTTTTAAACAATTGTTTTTTGTTAGCAGGCGTGTTGCTAACAAGGTTTTTAGGTACTTTTATACCAAAAAAATTATTACCAACGCCACGTTCTCCCCAACCACTTTCCAATGCAGCTTGCGCCAAGATGAAGAGGTGAGAGATACCCGTTTTTTTTTCGCTTTCCAAAGCAAAAGGTTTGTACTGTTTTACAAATTCTTTTGGTGTCATTGTTATTCGTTATTAGAGGTTTGAGATGTTTCGGACTGTTCAGCTTTTTCATTCATATAATTAGAGATGGTTTTAGCGACTTCCTCCAAGTTCTCACGATTGATAAACACTTGCTGAACAACTTGTCCTGCGCGGTCAAACCGCACTTTGTCTTCGGCTTTTTCGCGTATTGATTTGATTTCTATCAGGCATAGTACTATTGCCATAAAGAAAGTGATAAAAGGAAATAGCCACAATGAGGTTTGGTAATAGATTTCTAAGTACCAAGATAGCAAGCCGTACATACTATCCACAATCGTACAAGCAATCAGGATGTTGTAGTATTGCGCCATCTTGCTAATGGTACGCCTATATCCGTAGGAGTTGCGCGCAATACCCAAACGTTTAGCCTTGCGCACACCACTCCAAAGGTCGGCGAATATCATAAGGAGTACGAGAATGTAGATACCGAGTAGTATCCATAGGATTACAAAGATTTTTTCCATTGATTACTGTTTGTTTTTAGGTTTTTCGGGTTTTTCTGTGCCGTTGATGATAGCGGTACAGTTTTCTTGAATTTGTTTGTATAGTTCAATGTCTGAGGGTTGGAAGTTTGAGTTTTGGACATTGAAGTCGTTAGCAGTAACTGTACCTTGTAAATAGGGATAACCACCATCTTGCTGACGAGTTGCTGAAAATGCTACGGCAATAGGATTGGTTTCATTTTCAAATTCATAGGAGTACATAACAGTTACTCCTTGCACAATTTCTTGTGCAGTAATTCGGGTTGTTTTTTGAATGATTTGCATATTAATTGAATTATTAATGATTTATAATTTTTTTTACTTTTATGATTGCTTAGCGATTCATAAATATTATCCTGTATATTCTATATTTGTAATTATGCCATTAACTATAGTTATTCTGTGATTGCTAATTTCGTGAGTATTTGAATATCCTTTTTGTCCATGCATTCTTATTTCACCATAGAAGTTTGTATTTCCTCTTATTTTAACATCTCCCTCTAACACATCAAGTGCTACAGATTCTTCTTGCCCCCATATTGCTTTTAATATAAGGGCAGTGCTTTTTCGCCCTCCTCTACTTTCTAACTTCATTGCGGAATGTGTAGTGTCGCCAAATCCTGATGTATATACATCTATAGCTGATTTATCTACTGTTTTAAATATTTCAGGGTCATTTATTCGCACTTGTGTTGTACGGCTGTCTTTTCCTCTACCCATAGCCCTTATAAGACCTTCTGATGCAATAGTAAGTCCGTTTGCTTTTAAAGAAGTTTCGCTTGCACTTTCTATTTTAAAATTTCCTATTTGTCCCTTTGAAGCATATATACTTCCATCATCTTGTACTCTAAAAGGGGCTCTTTCTTTGTCTCTATAGTTAGCACCAGCAAAGAAACGAATGGACTCACCCGATAACCCCGCCCCATTAATACCTGCGTTGCCTCCTAATGTATTTCCAACAGTTAAAGCACCAGTAGTAATTGTATTTTTTACTACTTCTGTACCATTGGTATAATCAGCACCTTTGCTAAACATACCATTGATAAACTTAACATTTGCTTTTTCGGCTTCTGTGAGGTTCATTGCGTTTTTATCAATGATACCTAAATCTACCATTGTATCCCATACATCTTCAGGAGCTGGTGTCCAGTCGGTGGGTTTGTTTCCACGTTCAAGTTTAGCCCATTCTATGGTATTATTAGTACCGTTTCCACTTAATACATATATAATTAAATTAGACGGAGATTCATTTGTTTTCCAAAAAAAAGAACTTGTAAACACTCCATTTATTTTAGTAAGAGAAGTTAGATAAGTACCTCCATTATATACCTCAATCCAGCTGCCTGCATATCCAAATTGCGCTTTTATAGACAGAGTTACTAAATCTCCTTCTTTTAGTTTTTCTGTTAAGTAGTATGATGCTATATTGTAATTATTGTTGGTGATTTTTTGACTGCTATTGCGTAATAGATTCCTTCCCCCAATATTCAACTCATTCACTTTTTGCTGTGCAAATGTTTTAGTCTCTTGGAGTTTCAATTGGAGTTGTTGTATTTGTCTTTGCTCTGCTTCTGTAATTTTGCCGTCGGCTGCTGCTATAGCTTGTGTTTTGGTGAGTTCTGATTGTGCTCGTGCGTATGCTTCGGTAGCGGTTTTTGCAGTAGCAATAGCTTGTGTACGGGCTTGCTGCTCTCCTTGTACCTGCTGATTGCTGTATTGTTTCAATCTATTCTCCAATGAAAGCAAATCAGGGCTTACAAGTTGTTTTATCTCGGTTTTATTACCATCTGTTATTTTAAGATTTGCTTTGATTTCTATATGGTCATCAAAGAGATGTATATACTGCTGTCCGTTCCCTGATGTTATTTTGTCAGTTTTGATTTGTCCACCAGTGATTTCTGTAAATCCATTGAGTTTAGCAATACCACGATCTCCTTCATACTCTGAATTGACAGTGGCGTATAGGAAATGATAATATCCTGCTTCTTGTTCTATATCTATTTTGTTTTCGGATAGAATAAATTCAGCGGTTTCATCGTTTTTGCTTGCTTTAATATATAGGTAGTAGGTTTTGGCTTTATCGTCCAAACGCCCTGATACGAAAGCAGGAATATTCCAATACTTATAGCTATTAGCATCACGATTAGGATTTATATCAGTAGTACCAAGGGTGAAATGCTTGAGACAGCCGCTACCTGCATTGATTTGCTTGGTGTTTTTATCGAAATAGAGTGTGTGAGGTGCTTTTATAGGGTTTGTTTTTGAGACTACAAAATCGAACTGGGTGGACTTGTTGCCTATAAGTGCCATCATTGTTTGTACGGTGGCAGGGACGATGCTTTTGGTATACTCAGGAAAGGCTGCTTCTATTTGCTTGATAGTTTCTTGAGCATCACGCCAGCTTCTTTTGGTTAATGATTGTGTGCGCTTGTTGAGTTCTCCAAAATATACTTCTTGATTTTGGAGTTTGCGCATTTCGGAGGCAAAAGAGTGTCCTTGTACTTTGTTGGATAGCTCTATTTGTGGGCTGTAGGGGTTATTTACATACTCTTTTAGTCCTACAATGCGAATAGGCACGGGGGTGCGCTGAAATTCTGTGTCTGAAAAGTTAATATAAGCACCCATTTTGAGGCGACCTCCTACATTAGCCCAGTTCTTTTTAGCCCATATACCATCTAAATCACCAGTAAATGTAAAGAGGTCGGTGCGGTTTTCGTATAGGTATTTGCACGCTTCTTTCATCATCTCCCAGCTTGCACCTGATTTTGTAGCGTTGTCGCTGATGTAGGCGTTAGGCATTTGCATATTGTATACGGAATATTCGTCGCCTATATTGGGGCGGAATATATCGTTAGGCATAGTAACACCGTCTTCTTCTTTAGGAACAAGTTGGAATCGTTTTTCGGCGTGGTTATAGTTGGACACTTCAAACTCGCGCCCTGATAGCATACCGCTTTCAAAGTAGATAAGCATTTTTTCGCCTTTGATTTGCATTGCATTGAAATCGAGGGCTTGTGGTATGGAATCGTCAAATATATCGTAGAAGTGTTTATCTATATCTACTGCAAAAAAACCTGATACCGTACCTTTGCGTTTGGGGTATATGTGTGAGAGGTCGAGGCTTTGCTCATTTACAAATCCGTTATTTTGGGCGTTCTTGATTGTTATCGATAGCCCTTTGTCATCTGAAACGAATGTTACCCCTTCGTATGTGTATTCTTGTGATTTAGGTAGTAACAATTCTTTATTGCCATACTTGGAACGATCAATATTACGGTCGCCTCCTTGTGCATATAAGCGAGTAATACGACTTTGTTCGGTAGTGCGACTTACACCCGTTTTGAAGCCTTTTCCCTTGCCGTATTGAAGTGGTAGGGGATTGTTTTTAAAGTACTCTACCTTATGCAAATGAATGGTTTTACCTATGATTTCGTATTCGGTTTCAAAGGCTTTGGCTATCATTTCCAATGCTTCGAGGCAGTTGTTATGGTTGTAAGAAACGAGTTTTTCAGAGGCTTCTATACAATTACCTACTTGCCAACCGCTATCTATCATATTGAGGCAATCGACAAGGATTTGCACGTGATAGCGAGGTGAGGCGGTGAAAGGGAATTTTAGGGTTTTATCGTTGGGGTTGCGAAATTTGTAGTTTTTGAGGTTTGCGCCCTCGCTGTCCATAGTGAGGGTGTATTCAAAATTTCGAGTGTTATGTTTTACGATTTTAGCAGGTTGGTTAAGAGTATAACGCTCATTAGCAAACTCGCACCACGCACCAGTTGGAATGTCGGTATAGGTGGATAGTGAGAAATATAAGGTAAGCGTATGCTCGCCCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP022383|1410304:1428953|1413497_1414226_+|WP_095901241.1|DBSCAN-SWA MKELIKITEQNGNQVVSMKDLYTFLEVRENWTDWTKRMLGYGFDENIDYEAVSVFRHHPNGIGGTTVKDYALTLDCAKEIAMLQRSERGKMARQYFIECEKQLRSGKFALPTTYKEALQSLLEEVEAKERLQAQNDLQRIELQKQSPKVAYYEDVLTSKSTYNANQIAKELGMSAVTLNKRLHELKVQYKQGGQWLLYHHHQDKGYTKTVTHTYTDSQGETRTSSSTVWTEKGREFIHSIIQ >NZ_CP022383|1410304:1428953|1414253_1418579_-|WP_095901242.1|tail|DBSCAN-SWA MQPQDGALLFQVRADQSQIQKDVEAIKKQFEQMTRKAVEEGKKQADVWQTLLKGATAYFTLQGAQSFISQMVAVRSQFQQLEISFGTMLKSKEKANELMAQLTDLAAKTPFGLEEVSEGAKKLLAFQVPAEEVTETLRRMGDVASGLGVPMGQLIHVYGQVKAQGKLMTNDLYQFMNAGIPIIAELSKVVGKSETEIKDMVSAGKIGFAEIQAVIKNMTNEGGLFYNLMAEQSKSLGGQISNLKDNFDQMLNEIGKASEGIVSGAIKGVSFLVENYETIGKLIAGLIVSYGTYRAALIATAAVQQVVAARTAGMTVAEMAHYTWLVLVEKAQKLLNLTMLANPYALAAAALVGLVSYMVMFKKEATVAEQAQKAFNDEQERQKKVLEDERNEIEKLIEVVKDENAAKGQRINALNRLKDIYPDIFSKYKTEEELIRNIAKALKEVSNAQKEKDLKMDKDYIERLQVQKRGLQARLSGAANPVDIVNYKEQIQAIDIQIDKATKQHAWQSTLKRIDDIAELSADEQAKERKLMIEEYNRRHNAKQAKNQNLLREGEKQDLRKTTPFGFTGYENFSDADLGLIISKSNQLAEADKQRNKILDTRNELLAKQKELTSKISAIQSKEGQTQNDKDELKKLQEEKELVDKKLKGEYNVQNKKNTSKTIKNSLPEFDTEKAQRDHNRQIQDDLFAREESRIKIMQDGADKRLAIIQLEYDKQEEEIRRRSEDQLAAFIETEKQKAEAQGKWKKGKDFDTNTEAINAEKARLAENEKVLLADNAEYQRMQQEQVYKDLLEKYQTYTDQRKAIEEKYNADIAALQAKLGADAPQVKKAQDEKARELKKLDILYKKEGTAIAKLFDNLRKKTVKEIRQTIAEAESEIDELAKVLNMDDNANVEFITNLKQQLEQTRETAERSDTVFGKLGTNIKNLFKAKPNTAEWQEAFNGMLSSAQSITGQFGQLGQEFERLGQSTGNAALKSLGENMQKVSSILNKTLSFAQMGYSVGKGWGAAVGAFIGLFASGNEIANKAKLEHENRMLQISKSKLAIQESYNQALYQERLLQKENSSIFGTKELSNAIGYLQEYKKQWDEFKNSVNIRPAYSKEEIDFTIKQNKERWSSYYGNSINEKAKEWRKTLESAKISDLDNIGIVTGSHTKGWAWWKKTVTEWNSILKVYPELAKGVDGLDINLAKKLLEHQEFVGTGKEKLQEIVNAYERAQEAQKKFEEYTKNTFGELGNSIIDSVYNSLQKGENAFESFAKTVGNVIGKLGKQLAYELFVADHFKRLQERITEAAKRNRGSEDFARQSSQIVGEFGNAMKGKIGEMQQFLKDWNNMSSSMGFDFLNEKRQAVEKGFARMSQDSAEELNGQFRLQTQLSAEIKNAALQTANFIREMHQSMQTNAAQQLRHLAGIEANTYKLNKIETDIANMKAGINELTTKGIKIRT >NZ_CP022383|1410304:1428953|1412327_1412822_+|WP_157909497.1|DBSCAN-SWA MKRIILLLIALLAMGCSKEEKEEDFSQYKLNVPDWLIGDYEYSSWGITYDFGFSKNNYYFSHEDKRNFFEMFKSRLVKEGEYSYWNYKVYYFISYATQTKKYFKYTFEMKDKKCFFEFNGTTYNLCNEENKNDRDIRRIYEEVTEYGATIKKIYDDEYTYKKVK >NZ_CP022383|1410304:1428953|1413131_1413494_+|WP_095901240.1|DBSCAN-SWA MSNQIKNVSMVNKSSCKNTPSSAKTAHTSLLEILPKVEHIGMEVENKICNLSKTKRYLFSDLTEFVDKLPDDNLKNNLFDIVWQIQTLDDCISECMKAEDFYNLDNFIYYAKKILTQKSA >NZ_CP022383|1410304:1428953|1410304_1411363_-|WP_095901238.1|DBSCAN-SWA MKVNHNQLTLAREYRGWTQTKLSKAVQGLSQSNLSKFEKGLGGLSDEILEKIFNVLQFPKGFFERKISVELETANYRKKNTIPKSIIQDFETSCTFIGYIIDEMSNSIDYPDFSLKTLDIEEGYTPEEIAQFTRKDFRIFDNEPIEDIFRIIEDKGIIIYELNANEKFDGISLFTKKGFPVIVLNKKLPNDRKRFTLAHELGHLIMHTAFPVPNIRDKEQEANDFASEFLMPERAIRNSLEGLKLSSLSALKSYWLTSKASIIRRAYSLGVINQNKYKYFNIELSRIGEKKNEKGSVSIDEAITFDMAVKLHLKELEYTYDDLANAFDLPEDVIQRYVLKQNLFLKPKLTVN >NZ_CP022383|1410304:1428953|1422384_1422756_-|WP_095901250.1|DBSCAN-SWA MNEVKISEKAGRQIVVFDQLDVTYPGGVYIDPTTAKARFTDGVIPAGTLVMPDTNGTFKVVKEDLSQSNTAGALGLTAHDVVIDDMPLVAVVMAGTARKDALPDKEKAGVAFLRTALPRISFI >NZ_CP022383|1410304:1428953|1418581_1418743_-|WP_157909498.1|DBSCAN-SWA MLKEISYANVMLYSAVIPSYDFDKDKDTKKAPQKSEKRTSYGDFLSKIKSANI >NZ_CP022383|1410304:1428953|1421283_1422363_-|WP_095901249.1|capsid|DBSCAN-SWA MNINANNIITEFSQANMNAIIQAYPLGDLRYREYFPLKFNPYLTYSNIEGADGAKIMADIVAIGSKAPRKGREFVENIKGEIPKVEIARDLNEKDLLTIQQLRYAVNANPTNAGIKNQLIDKIYEDPRFCIDGINARMEWMAKQLVSTGKYKTTATNNGGVVNVSVDFKVKTQNALKKWADADANPIEEIEKYQEEAKGKGYSYATITMSRATLNQVLKNKNTRAFVLGVPINATTILPDVRLDQLNAELAEHGLPIIKVWESFISFEGKNGEVTVANGWEEGNILFSTSALLGTTQYTTTTEFTMDFADVMSKSIKDSFILVNTFGHQDPISVSTKATAFATPVLNDSKRKLIIKTKF >NZ_CP022383|1410304:1428953|1419270_1419765_-|WP_095901244.1|DBSCAN-SWA MAFIDNNATAWGKLEFKFGAPGAGGAMGTVLKTLGIVKEDSFSFETEDGKELKWTAIGGEIIDQMKGEPVVKVKCTVKNLNKALLSEIWDIEESGDKLIIKSFVSTKKFSFSIVPKVSGAEKIDMFYCSVSGKLTYTEDSGYNVEVEITILNGGKGFLSIEKVA >NZ_CP022383|1410304:1428953|1418781_1419267_-|WP_095901243.1|DBSCAN-SWA MEEQVAQTLLEEPTTVTIGGEAYKVAPPSIFTLVRASKYISKIPTDTISEGNIFGSIIHNAEEYENIAWAISAILLGKKFTEIVTYPKWQFWRREKNVTKGEILAKKLINTPITEVSSAFFKVLVQIDIRAFFVITTSLKGMMITKPTKEVENETTASGDL >NZ_CP022383|1410304:1428953|1420614_1420941_-|WP_095901247.1|DBSCAN-SWA MKIYPYLLKVKVSQNPTIDENGIPTYPSDPIEWQEIGVCRDEIAGAGQKISKVDGQIFECTATIYAPKDTPKIEAGTTLQVVDVEGNIRLEKQVIRFSRDYFHCRIFV >NZ_CP022383|1410304:1428953|1420183_1420618_-|WP_095901246.1|DBSCAN-SWA MITPQFNSNDIERILREKIEKYHQKVIRILKYVGEMCINEARTNGSYQDQTGNLRSSIGYVVLQDGKAIEKGGFKITKSGGNGQKEGETFINKVISQYPKGFVLVVVAGMKYAAYVEARNYNVLSSAELLAEKEVPKLLKALSQ >NZ_CP022383|1410304:1428953|1424100_1424616_-|WP_095901253.1|DBSCAN-SWA MTPKEFVKQYKPFALESEKKTGISHLFILAQAALESGWGERGVGNNFFGIKVPKNLVSNTPANKKQLFKTTEVLNAPNLGYKFPQVMSIYQLPSGKYKYEVKDWFRKYDTPEECFTDHAELFFRNKRYAKALLVKADPYKFAEEVAKAGYATATNYADSLKKIIKMLESYE >NZ_CP022383|1410304:1428953|1422762_1423431_-|WP_095901251.1|DBSCAN-SWA MVDKLLQSLKTKYAHLGLEESVLKAIATRLATAVKEESEIENAVKGVEEEVKLLQSVADKGRTSLTKAEEARKKLEKELEEMRAKSNPNPQNPPTPSTDPKPDEVPEWAKDLLESVKKQNETIAAFQAEKQQQSAKERFLNQLKTQGVSETFYKHHLGRTFKDDTEMDAFVNELKADEQAFLQTQANAGLSSHSRPIIGGGTDANGVSADVQAYINETFNKQ >NZ_CP022383|1410304:1428953|1419770_1420187_-|WP_095901245.1|DBSCAN-SWA MKKTASQIEADIYKYFKDKINPLINGQTYRSGVRPLNSLKEDCVISFLTGLDGQYQTGVININIFVPTVKNNDNQYRKNFVRCEAIEGALMPIIEEAKTSLHNYKLQLHQMIQTFEDTDIKQFFINAKVKFRYNTFNN >NZ_CP022383|1410304:1428953|1420955_1421282_-|WP_095901248.1|DBSCAN-SWA MTAQAYIDEKLKLWNVEYPTTLLIAKMQRVGLGLSDEFNEENERKTKLFFYNLIPELLLRPVSFSEGGLSFSYDKSAITAFYNLLCKQLGRDNLLEVKATVRDITNLF >NZ_CP022383|1410304:1428953|1411432_1412071_-|WP_095902413.1|DBSCAN-SWA MIINNYLPATKLQTFVQSCKRKLAKKDTFIIECTEKGLIKFFNAFENAKEKFSLTMKLFPPQSRSRGFEASVFQTCLLGELQKVFPEKWKFWKYKRFVITFCGHSFLFKKLNKKGMPMNIKTNANESIINQMQTQLFDPTDYENPIVFFGWEKSKSGDLINPHFVYIDEEKIKWRLRKDELTSLNAPTILTPNKTGRLLPKVKEQSKRKKAI >NZ_CP022383|1410304:1428953|1423577_1424108_-|WP_095901252.1|DBSCAN-SWA MSRIVMLLLAFLTLIGCRTRKEVAATEQKQVQKERIIKYKDSMAFFQQNTQTLQLDTHASQEYEVTVESDKDIVGNSKELVYYRIRDGDNETIRVSGGKVKITTKSNLSNSQIVANTTLDNITKANTYFIAQRHSETAFSHKTKNVKSSYLYLIAIIVVLLIAFYFIRNKLKRFLK >NZ_CP022383|1410304:1428953|1425104_1425404_-|WP_095901255.1|DBSCAN-SWA MQIIQKTTRITAQEIVQGVTVMYSYEFENETNPIAVAFSATRQQDGGYPYLQGTVTANDFNVQNSNFQPSDIELYKQIQENCTAIINGTEKPEKPKNKQ >NZ_CP022383|1410304:1428953|1425470_1428953_-|WP_095902414.1|DBSCAN-SWA MGEHTLTLYFSLSTYTDIPTGAWCEFANERYTLNQPAKIVKHNTRNFEYTLTMDSEGANLKNYKFRNPNDKTLKFPFTASPRYHVQILVDCLNMIDSGWQVGNCIEASEKLVSYNHNNCLEALEMIAKAFETEYEIIGKTIHLHKVEYFKNNPLPLQYGKGKGFKTGVSRTTEQSRITRLYAQGGDRNIDRSKYGNKELLLPKSQEYTYEGVTFVSDDKGLSITIKNAQNNGFVNEQSLDLSHIYPKRKGTVSGFFAVDIDKHFYDIFDDSIPQALDFNAMQIKGEKMLIYFESGMLSGREFEVSNYNHAEKRFQLVPKEEDGVTMPNDIFRPNIGDEYSVYNMQMPNAYISDNATKSGASWEMMKEACKYLYENRTDLFTFTGDLDGIWAKKNWANVGGRLKMGAYINFSDTEFQRTPVPIRIVGLKEYVNNPYSPQIELSNKVQGHSFASEMRKLQNQEVYFGELNKRTQSLTKRSWRDAQETIKQIEAAFPEYTKSIVPATVQTMMALIGNKSTQFDFVVSKTNPIKAPHTLYFDKNTKQINAGSGCLKHFTLGTTDINPNRDANSYKYWNIPAFVSGRLDDKAKTYYLYIKASKNDETAEFILSENKIDIEQEAGYYHFLYATVNSEYEGDRGIAKLNGFTEITGGQIKTDKITSGNGQQYIHLFDDHIEIKANLKITDGNKTEIKQLVSPDLLSLENRLKQYSNQQVQGEQQARTQAIATAKTATEAYARAQSELTKTQAIAAADGKITEAEQRQIQQLQLKLQETKTFAQQKVNELNIGGRNLLRNSSQKITNNNYNIASYYLTEKLKEGDLVTLSIKAQFGYAGSWIEVYNGGTYLTSLTKINGVFTSSFFWKTNESPSNLIIYVLSGNGTNNTIEWAKLERGNKPTDWTPAPEDVWDTMVDLGIIDKNAMNLTEAEKANVKFINGMFSKGADYTNGTEVVKNTITTGALTVGNTLGGNAGINGAGLSGESIRFFAGANYRDKERAPFRVQDDGSIYASKGQIGNFKIESASETSLKANGLTIASEGLIRAMGRGKDSRTTQVRINDPEIFKTVDKSAIDVYTSGFGDTTHSAMKLESRGGRKSTALILKAIWGQEESVALDVLEGDVKIRGNTNFYGEIRMHGQKGYSNTHEISNHRITIVNGIITNIEYTG >NZ_CP022383|1410304:1428953|1424618_1425101_-|WP_095901254.1|DBSCAN-SWA MEKIFVILWILLGIYILVLLMIFADLWSGVRKAKRLGIARNSYGYRRTISKMAQYYNILIACTIVDSMYGLLSWYLEIYYQTSLWLFPFITFFMAIVLCLIEIKSIREKAEDKVRFDRAGQVVQQVFINRENLEEVAKTISNYMNEKAEQSETSQTSNNE |
21 | Riemerella_phage(66.67%) | tail,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2610996 : 2647150
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP022383|2610996:2647150|DBSCAN-SWA TATGAAAAAAGTAAAACGCTCAACCTTTAAGGTATTGTTTTACCTTAAAAAGAATGCCCCAAAGAAGAACGGTAAGGTTGCTATTATGGGACGTATTACCATTGACAATCAGGTAGCTCAGTTTAGTACCAAACTTGAAATCCTTCCACAGAAATGGGATTTGAAGTACGGCAGAGTAACAGGTAAAACAGAGGAAGCTACCCAACTTAATCGCAATCTGGAAGAGATTCGCTCACGTATTATTACTCATTACGAGGAACTGATGAAGTACGAGGGAGTGGTTACAGCTCAGAAGCTAAAAGCTACTTTTCTAGGCATAGGAGTAATGGAAGATTCCTTGCTAAAAGTCTATGAGAAGTTTAAAGAAGACTTTGCCTTGATGGTAGAAAAAGGTGTTAGAAGTTACAGCACTCTTAACAAATATGAGAATGTATATACTCATTTGAGCGAGTTTATCCAGTACAAATACCGCAGAAGTGATTTTTCTTTCAAAGAACTTACAGAAGACTTTATCAACGATTTTGACTTCTATCTTAGAGTTAATAAAAGTCTTACTCACAACACAATATGGGTTTATATGATGCCCCTTTGTAAAATGGTAGAAATAGCTATAGACAAGGGTATCATCTATCGCAATCCTTTTAAGAATTACATAAGCTCTATGGAGGAGAAAGACAGAGGCTATTTGCTTAGAGAGGAAGTAGAAACCCTTCTGCAATATCACCCTAAGAGTGCTTCTATTGAATTAGTACGAGACCTATTTGTGTTTAGTTGCTTTACAGGCTTCTCTTACATTGACATCAAGCAGCTCAAGAAAAGTCACTTACAATCGTTCTTTGACGGCAACAAATGGCTTATCAAACGTCGTCAGAAATCCGATGTGCCCTGTAATGTTCGTCTATTGGATATAGCCGAGAAAATCATCGAAAAATACGACGGCACAACTCGCACAGATACTTTATTCCCAGTACCTTCAAACGCTAATTGCAATCTTCTGATAAAGAAAATGATGAAAGATTGCAACATCATTCGCGAAAAACCTATCTCTTTTCACTGGGCACGCCATACCTTTGGCACTTTGTTTTTAACAGAAGGGGTTCCCTTGGAAAGTGTTAGTAAGATGATGGGACATAAGAATATCAAAACTACCCAGATTTATGCTAAAATCACCAATGAGAAAATCAGCAAGGATATGGAAATTGCTGCCGAACGCCTAAAGAACTTAAAAATAGGATAGCTTATAATTAACATATATTCGATGTAAGTGAATATATGTTAATTTATAATTTATAATAATTATACATATTCTTAATGTTAAAAAATGGAAAGAGAATTAGTAATAGTAGTGTAAATCCCTAATGACTTATATCAATTTAGTAGGTTAATACTAAAATAGCACTGTTATTAATAATAAAGTTTTTGATTTCTAGTAAATTTATTGTATTTTTGTCTCGCGGCATAGTTTTTTTTGTTTTTTTGACACTACAAAAGTAATGATTTTCAATGTGATTTTCTAATAAACTATGCTACTTTTTCACTAAAACAGTAAGTTATTTACTTGCTAAACTTAAATTTCTATTTATGACCAAACAAGAAAAATTAATAGGTATTCATCCAGATAATACAGATCCTTTTGGGAAATGGCTTAAAGTAAATGATCTACCCGATTCTGGTACAAAATGTCATAGAGAACTAACAGAATCTACTGCTATAGATAATGATTTAATAGAATGGATGGCAAGAAAAATTATCAATCATCACTACACTCAATCTAGGATAGATAAACTAAAACAAAAATATAAAAGTTTAGGCTATGCAAAATATGCTGCGCAACATAGAAAACTGCCAATTGAAGACAAAGTAAAAAAAGGTAATGCAACTGAAATTCTTTTGACTGATTATATCCAGACAGCACGGGGAAAGGAATTTATCAAATTTTTCAAGTTAAGATATAATCCAAACGTTGATCAAGCGATAAAAGGCGATGATGTATTAATGGTCGACTTATTTGAAGAAAACGGAAATGAAAAAATAAAAATATATTTAGGGGAATCTAAATTCCGAAAAGCGCCATCAAAGGATGTTGTCGAAGATATTGCTAATTCTCTGAGTAAAGACACCTTACCTTTATCATATACTTTTCTAGTAGAAGAAATTGCAAAAACGGATGAAATACTTGCAGAAAAATTAGATGATTACATTGTACAGGACGTAAAAGATAGAGGTGATTTGATTTATGCAGGATTGCTGTTGAGCAATACCGATACTTCAAGAACTGTCGAAAGGCATCTGAATAGTGATAACTCTAACTTAGTATTCATTTCAGTTGGCATTGATAATCCAGAAAGATTTATTGAATCCGTATTTGAAAGAGCAGAAGAACTAATTGCTAATCCTGATTTGTTATGAATATAGAACATATTGAAGAAAGTTATCGGGCATTAGAATCAGATTCTACTCTACAAAATTTGATTGCCCAAGCTAATGCAAGATATATCCTTTATAATACAAGGGAATCCCAACAGAATTTCCCTCGATATACGATTAAGGATGAACAACTAAATATTCTTGCGTTCAAATATCTGAATATTGGCTGTAACTATTTTGAACATCACGACTATACAAAAGCTGCGAGCTCTTTAGAAAAAGGAGCTTTAGTACTTGAATACATACATGGATCGATTAATACCGAGACAAAGAACAAGAGTTTATTTTGTCTGATTTCTGCGCTTGCTTATTATGTTTGTTTTCAGTATTCAAAATCATTTATACTTATACGCAAATCTCAAAGCAACACAGAAATAGCCTCTTTGATATCTCTATTTCTCAATAGAGAATTCAAACAATTGCAAAAGAATATAGATCGAATTATTACAGATCCTACCTATAGTGATAAATATTTGGCTGAAGATTTTAAGGAAGATAATGCTAAGAAAGTCTATGAAATCACAATTGCAAAAGCAATAAATAATTATGTCCAATTCTATCACACAGGAGATAATGCATTTTTAGAGACAGCAAAACGAAATCTTACTAATCTTCAAGAAATAGCTGAAATAAGGAGAGAACCCGATATTTGGTGGGTAATTCGACTATTACTTCTCATCATAGATGGTTTTAGAGAATCTTCGTTATGGCATGTATTAGGACGTTATTTCAATATCAAAGATAAATTTTTTTTAAAATATATACAATCATTAGTATATAAAGATCGTAGTATCACAGAGTTATTCCTTACACAAAGGAACTCCTTACCTAAAGTTTTAAACAGAGAGCAAAATGGAAGTATTGTCAGTATCCCAACAAGTAGTGGCAAAACAAGAATTGGTGAAATTGCAATTTTGAACTGCTTGTCAAATAATCCTGATGCAAAAATTCTTTTCATTGCCCCTTTCAGATCGTTAGCCTATGAAATAGAAAATTCACTTGATGAAATTTTTAATAATCTTGATATTTCAGTTTCACATTTGTATGGGGGTAGTCTATTCAGTAAACTTGATGAAAAAGTCATAGAAGAATCCTCTGTAATTGTAGTAACACCAGAAAAAGCTAAAGCTCTATTAAGAAGTAATACAGATATACTTTCAAGCATTAAGTTGGTAATTGTAGATGAAGGACATTTACTTGGAGCAGACAAAAGACTTATCGTAAATGAAATGTTTTATGAAGAACTGAGATACCATGTTAGTATTAATGGTGGTAAATTTTTGCTTTTATCAGCAGTTTTACCAAATGCAGAAGATTTATCTGAATGGCTTACCGATTCTTCCGATAATGTGTATAAAGAGAACTGGCGACCTTCTGATGAAAGAATTGGAGTAATGGAGTGGAATGGGAGGGCTGTTAACTTAAAATGGAAAAGTAATGATACAGAAAGAAATTCGTTTAACCCTAATTTTATTGTACGACAAGAATTACCCAAGAAGCCACGACAAACAAAAACTCATTATTTTCCAGAAAACAAGAACCAGGCTATTGCGGTCACTGCATATAAATTAAGGAACTTTGGTCCTGTATTAATATTCGTTGGTGCTAAAAAATCTGTATTTACTATAGCAAGAGAATATGAAAAATGTATTGAGACTGAAGATGATACATTTAGGTTTAAAAACGAAGCTAATTGGGAAGCTTTCCAATTGGCCTGTGTCGAATCTTATGGAGAAGATTCTGAATGGCTCAAATTTGCAAAAATGGGGATTTTTTGCCATAATGCAGACCTACTTTCAGATGTTCGACTTCCATTAGAACGACTAATGAGAAGTGAAAAACCCAGAGTAATTATTGCAACCTCAACATTGGGGCAGGGGGTCAATCTTGGTGTGTCTACTGTCATTTTTTCAACTTTGTATCAGGCGGGGGAACCAATTACCAAAAGAGATTTTTGGAATATTGCAGGTAGAGCTGGGCGTGCATTTGTAGATCATGAAGCTTAATCGGCAAAACTAAATGTTGAAAAAAAATAGGGTATGGCTTCCAATTTTTGGCAGGAAAACTCCCTTCAAAAAAGGATAGATTTAAACACTTTTTAAACACCCTTTAAAAAGTGTAAAAACACGGATGAACTCCGTGTTTTTATTATATAGGGTAGTTGGCTGTAAGGATTTCTATACGCTTCTTTCCTGTGCTATTGCTGCTGCCTAAATGCATTGACACTTCTTTTTGATACCAGCCGTGTGTTTGGGTGTAGTGGGTGAGTTCATTATTATGGTAGGAGCTTAATAGAAACTTGCCTTTGAGGGTAGAAAGGGTGGCTAACAACTCGTTAAAATGCTCTTGCTCATAGCCTCCGTAGTGACCTTGTTTAGCTCCTACATAGGGTGGATCTATGTAGTGGAAGGTGTCGGGGGTGTCGTGGTGGGTGAGGACTTCGGTGGCATCGTTGTTATCTATTTGGACGCCTCGAAGGCGAGCGGAGTAGGTGTCGGTGAAGTTGGTAATTTTGTTGTTGAGGGCAGATACGTTCTTGCTGTTGGTTGTGATACGGCAGTTGCCAACTTGGTTAGAGTAACCGCAGTTAGTAGCGTACCAAAATGCCCACGCTTGTTGTACTTCGGTAAAGGCAAAAGGGGCATGATAGATTACAAGGGCGGCTTTATAGGCTTCTCGGCTAATTATTGAACGTTCTATAAGTGTTTTGAGCTCGTCAAAGCGGGTTTGCAGCACTTTGTAGAAGGTGTAGACATTGGCGTTGAAGTCGTTGATGATTTCGGTTTTTACCTTTGGTTTAGCCCAAAAGACTGCCCCGCCTCCGAAAAAGGCTTCGGTGTAGACGGTATGTTCGGGGATAAGGGGCAGTATGTGGGGTAACATTGTTTGTTTGCCTCCGTAATAGCTTATGGGCGTGCGTTGCCAAGTGGTGGATACTGGTTTCATTGCTTCTGTGGTAACATTAATAGATCGGGTTTGTAAATGTCCTTTACTTCGGTACTTGGTTCAAAGGTATCTACATCTTTGCGTTGGGGTGTTTTGGTGTATATGCCATTGGCTACTTCGGTGAGGGCTTTGTCCATTAGGCGTATGCCCATTGGTAGTAGCTCTTTTTGCCATAGTTCCTTTGCCGCTTCTTTTGGCGATTTAGCGTATAGCTTGGGCGGTATCCAACACCAATCTTGGCATAGTATATCGCCTCTATCTATACCTGCGTTTAGCCAATAAACACTTCCGCCAGCTACTATGTCACGCATTCGGATTGCCCACTCAATGGCTGAACGCCCTCGATGTCGGGGTAAAAGGCTGGGGTGATAGCCTATCCACCCTAAGTGGGTTTTGTAGCGGGTTCGCTTGCCTATATAGTCGAATGAATGAACAGTTATACCTAAATCTACCCCAGAGGGCATAGTGTCGTAGGTAAGCATCCCCGCAGGTAATATAGGTATGTTGTGTAGCTTTGCTAAACGCCCGATATACTTATCGTCTAAGGGGCAACATACGCCTACTACTTCATAGCCTTTGGTAATACATAAGGATAGTATTTCCTGCCCGAAATACTTTTGTCCGCTGATAAATACTTTAAACTTTTGCTTCATTTTTATTGTCTTCTGTTGTTGTTTCTGTAATTTTTCCTAAGTATTTAAAGCCTTGTACGGCTCTAAAATGCCCTCCATAGCCTGCCACCATTACCTTTTCACCCCCGATAGGTTTGCAGGTTTTTATCATTGAAGCTCGGCTTCGGGCTTTGTTATCTCCGTGCAATTTAGCAGAAGTTTGTTCCCATTTATTAGAATGGCGAAGGTAGTTGCACAGTTGGGGGTGTGAGGTATGAAAAAAAGTGTGTAGTTTGCGATTACACCGCCCGTTGCCCTCCAAATGGTACTGCATTACAAAGTTGAGAAATTGAGTACCTACGCCCGCTCCTTGCCATTCGGGCATTACTACTAATCGGGTAGCACGGTAGGCATTGGCTGTGAATAGAGGAGTGACAGCAACGTGGCAAACAAGTTCACCATTGACTGTACCCACAAAGTATTCGGCGCAAGGAGGGTGCGGTAGGTCTAAATAGTAATGTTCTTTAAAAAATCGCCAGTAACTACCGTTTGCCTTCCAAACTTGGAGTTCAATAGGGGGTCGTTTTTGGACTTTTTTTTTACTTCTGATACTCTCGTATCATACACCCAATCAGGTTGCAACCATTCGATTATATCGTAATGACAAGATAACAGAATAATTTGTCGATTAGGCACGCGTCTCCACGCTTTAGCAAATGCCGAAGCCCCTATTTTGGCGATTTGGCGGTCGATTACGGAGGTAAATTCATCTACTATTACCTTGTCGGGGGCATCACAGATGAGGCGCGCTAAACCCGCACGAAACTGCTCGCCATTACTGAGGACTTTGAATGGACGCAACCAAGCGGGTACATCGCCGAGCCCTACGGCTGAAAGTGCGGAGGTTACTTCGTTCATTGACTTGTTAGGGGTAATATCCTCAATAATGGGTAGGTTCGGGTTCCACCCTTCGGCGAGGTTGGTTATACCGCTATCCCATATTTGTTTGCCTATGGAGGTTTTACCGCTTCCAGAAGGACCTACAATAAGCCCTATTTGCCAACCTTCATCTTCTATGGGTAGGTTGGCGGTGTGTTCCCACGTGTGCCCGTTTTCGGCATTGAAAAGGGACTTTACTTTTTCGGCACGAAAGGTTTTGAAGTTTTCGCTGGTGTGTTTGATTTTGATTTCCATTATACGCTTACGACTTTAAGGTTAGTAAATCCCATCTTTTGGAGTTTCTCGAATAGTTCTTTTTGTTCTTGTTCGCTACCTACTTTGATGATGATAGCGTGCTGTTCTTTGTACTTGAATTTTGCCATTTGTTATTTTGTTTTTGTAATTCAGAAAATAGTTGTACTTTTGTGGTCTCTCACATCATTAAACATAAAGAAAACACGCAGGTACAGAAGACATATTGTCCTCCGTAGCCTGCGTGGTTATGTTTAAAATGATGTGAGAGTTATTTTAAAGCGGAGGACATTTTTTTACTGCTTGACCTCCTTTTTAGCAGTGCTTTAAAAGCGTTTTAAAAGCTGTTTAAATCTTCCACCGAAACGGCTTATATTTCCAAAGGATAAATACTAATACAGCGAGTAGCAAGAGCCAAAGGGTATGCCTTACGGGGCTACTTTTCAATTGTTTGCTTACTTGCTTAACTTGAGTGTATGCGTGTTTTTGTACTTCGGCTTTAGTTCTTGTGTATGAATTATTATAAAGGGTACTATCAGCCTGCTGTAGGCTCTTAGAATGGGTGCTTGTAACTCGTAGCTTAACCTTTCCGTTGAGTACTCTTATAACTTCATTATCGCCGTCACGAATGCGGGTGTAGATAAGTTCACGTGGTTTGCCTACACTATCGGTGAGGCTTTCGAGTTCGAGTTCAAAGGACTGGTCGGACTGGTGGGACAAGTCTGTTTTGCGACCTTCATAGGCAAAAAGCTGTGAACTATCCTTGTAATGGATAAAATGCTCTTTTTGAATTTGCTTTTGCTCGGTAGTGGTAACCTTGCGGGTACGGCAACCTACTAAAGCAAGGAACGCTAATAATAATAGGGTTAATTTTCTCATTTGCTAATGTTTTTGTATTCGTCTTTTGCGTTGAAACAAGGGCAAGCTTTAGCTACTCCTGGGAAGTCACGGTGTCCTAAGATTTCAGCTTGTGGATATAGGACTTTAAGCTCGGTGAGGAGCTTTTTTAAAGCTTCTTTCTGTGCCACTGTGCGGGTATCTTTGGGTTGGAGGGTGTTTTTGTCAATACCTCCAATGTAGCAAATGCCGATGCTGTCCTTGTTGTGTCCTTCTACGTGGGCAGGGACTTTATTGACGTCTCTGCCCTCTTCAACCGTGCCGTTTAGGCGTACAATGTAGTTGTAACCTATCTCATTAAAACCTCTTTGTTTGTGCCATAGGTCGATATCTTTGACGGTGTGGTCTCTGCCCTCTGGTGTAGCGGAGCAGTGTACTACGAGGTAGCGGATGTTGCGTGTGCTTTTTTTCATTGGGTTATTAGAGTATTAAGGTGAATAATATAGTAAGGGTTATGGCTATTGCCAAAGGGTTTACCCATAATACCCATTGGGCATTGTAGCTTTTAGGCTCTGGAGTTACACGCCTTTGAAAGGCTTCATACTGCCAACGTTGGATATCGTCAAGCAGGGGAAAGTCTTTGTCTGTAAGCGGGCAAAAGTGAAAGTACCCAAAGCCAAAGAAACAAGCTACTGCAAGCAAGGGCAAGAGTATGTATAGCCAGCTATAAAGCTCGGCACAAACAATAAGTCCTCCAATGAGTATTAAGGGTAATATGATGTTGGCAGAGCGGGTAAAACTTCTTTTTTTACCTGCAAATGGCACTATATAACTGAGTGCAAATAGTTTGATGATATGTTTTCGCATATATTTTTTCATATTCTTTTAGATTTTAAGGATAACCAGCTGGGCGACTTTCTAAAGATGAAATCCTATTTTCTAAATCGGCTACTTTATTATACATATTTTGAAAGTCTTCAAATAAAGTAGTCAGGCTATTTCCATTAACATTTACGCTTTGAGCACGAATATTTATACTATCCTGCGCAGTTAATGATATTGTTTTATATCCAGATATATTAACATTCATTGAATCATCACCACGAAAATTTACATCTTGAGCACGAACATCAAAGAATTGTACAATTTGTTGTCCGAGACTACCTCTTAAATAGATACCCTGATTGGTTTCTAAGACAATATTAACTCCAGATACTATATCCACTCCTTTTCCAAAAAATTTGAGAAGTTCTTCACTTGAGTGGATGTATAGAGATTTAGAGGAAATATCCATCCTGCCTAAAGGATTAGATATGGATATCTGTGAATTTTCAAGAAAACTAACGAGTTGTTTTAATTTTTCAAAAAATTCGTCTTTATCCATTTTTCCATTTATTATTGTTGCCAAATCCTTGTTGTTCTTAACTTGGGTAACAATTTCTTGAAGAGTATCAAGGGAGGTGTCGTCTACGCTTAGGGTAGTCTCTACTTGTCCCATTTTGGTTTGCAGTCCGTCAATAGCCTCTTTCAGCTGTTGTCCTGTGCCTTCATATCCGCCTTTGGGTAACAAGCCCGATACATCAGTAGGCTGCAAGTCCTCTAACTTCTGCTTGAGCTCGTTGGTGAAGTCATTAGATGATAAACCTTTGCCTTCCTCTTTGTTTACTTTTTTGTCGATGAGCTCTTGTAGCTTGGTATTAGATTTTAGAAGAGTAACGATTTCCTGCAAGGTGTCCAAGTTTACATCATCTACTTGTAAGATGGTATGAATGGCTTGTATTTGGTTTTTCAACTCATCGAAGAGGGCACGGTGGGCATTGGTGTCATTGATGTGATTGAGCAGCTGCCCTGCTGAGGCGGTGCTCTCAATTGCTCTGCTAAGCCCCTCAATGTTGCTCATTGGAATTTGTTCGCTTTTATGCCAAAAACTGTCAATCCAAGCAGCGAAATGTTCTTGCGCGGGTTTCATTAAGTTAGAAAACCACTTTTTTAATGTCTTTTTTGGTGTCATATTACGTTTGTTTTTAGTGTTTATTCGTTAAGAAGGTTAGAATCCTACATACTCTATGAATTGCACCACGCGGTAAGGTGGCATATTGTTGTGGGGCTGGTCGCCACCAACTGAGGAGGTATTTTGATTATAATAAGCATCATATGTACCACTATTCCAATTACGCCCTCCTAAAAGCCCTCCTCCTCCATAACGATTGTAGAGACTCTCACTACCTTGTTGGTGATTGTGAGCAGGCATTTCCTCAATAGTGAGTTTGTGAGAACGTTCGCCTCCTTGTTTAAGGATCTGATTAAGACCATAGTCTTGAACATCTTCGGGTTTCTTAACGTAGTCAGGGTCAAGACCTATCGGCATTCTACCTCGTAAGTTTACGTATTCTCTCCAGCCTGCGGGTATTTCTGATGCAGGTTTACCCCATAAGGCGATGAGTCCAATAGGCACGGCTTGCTTTTGTTTTTTGAGTTTTTCGACTTCGTCTTTTAACTCTTTCAACGCTTTGTTTTCGGCTTTATTTTTGCCTAATTTTTGTAGGTTAGTAACGCGTTGAAAGTCTTCCCAATTGTAAGTCTTTTCGGGGGTAGAGCGACCAAAAGCGGCTGTGCGGATGTTTTCTAAGGGACGGAGGAAACCGTCCTCAAAGGTTACTTCGTTGGTTTCCTCTTTGATGATAATGGTATCACCTTTCGCACCTCCTTCAAAGGGGAAAAGTTCTCCATTAATAAAGACAGTGCCAGGGGTGATGGTGTTGCCTATCTCCTCGCAACCTGAGATAATTGCCTTATTGCCTGCCATACTTCCTAAGCTATTGAAGAGGCGGTAGCTGTTCTGCATAAAGGCAAGGAACGCCACATCAAAAGGATAGCCTGCGTTGTGTTCTGTATTTATTGTATTCATAGTCCTCTCCCCGTTCCCCTCCCCGAAAGGGAGGGGCAATCCGCACGGGGTAACGGTTTTAGTTATGATTTATCTCCCATCGTTTGCCCGCGAGCTTGTAGAAGTTCACGAGGGCTTCGAGTTTATATTTGTCGTATTCTAAACCTTGTGGCAATACCACTATAAAGTCCACACCTCCGTCGATATAATCGCCTCGTTGGTAGAGAAAGACTTTGCCTAAGTACAAAGGCTTATTAGCACTGCGGGGATAGATATACAACCTTTGTTTCTGCCTGCCATCTTCTATACGTATGCGCCGTTGTTCGTCGTCAAACTCATCATTAAGGGCTTTGCGCAGGTAGCATACTTGGCTGTTATGAGCGAGGTTGTACAAGTCAGCTTCGCGAGCTCGCTGAAAGTCGTACAGCAGTTTGTGAAATGGCGTTGCCAGCATTCTTAACCACGCCACCAACTTTGGCTTTCGCAAAAAAGTGGGGGTAAGCAGCACGAGCAGTTTGTCGATATTAAGGTTATACATTGCTGATATAAGTGATGTCGTTGAAGTTATCAATGGTAAAATAGCCTGCGGTGGGTATCTTGCTTATTTCAATGGTTTCAAAAGCTCCGTAGTCTCCACCGCTGGTGATGTTCTTACTTTGTGCCAATACCAAATGCGGTATTTTCACTCCTTCGGCTTGTTGTAGCGCATCAATAAGGTGTGCTAATACGAGCTCACCATTGAATGGCAACCTTTTTAAGTAGCTTTTTATAGCCTCTTCTACTGGCTTGGTAGCGTGAATGATACTTTGTCCGTTACTATCTAATACCAAAGGATCATAGACGATTTTCATTTGCAGATGTAGTATATCGGGTTGATAATTTACTACCGATAGGCGTACGCCCGCGTCTTTTATCTCTTGCAAGTAGGCTTCAAATGATTGCTTTTGGGCATCGGTGATTGGTTGGAGTGTGTCGCCCTGTTCACCTGCTATCTTCACTATCAAACGACCTTCGTTTTTACTTTCTATTACCGCCGAGTACTTTACTATCTTACTTGCTTCTATCTGTTCCTCCGTGTATCCTTGGTTATTGAACTTATCGCTGTCGGGCAATAAATCAAACCCATACTGAAAGGCAAGGGCTTTGCTGCGATACCAACGTGCGGTGTGGGGTTTGAGTTCGGCAAGGCGTTTGTCAATATCCGATCTATGCTGGTCGAATAGCTTCTCTAAGCCCCATATTGCCACCGCTATAATATAGACCCACAAGCGCCATATAGCTACTTTGGAGGTGCTATTGAGGCTTTCCAATGCAGGCTCTTGTGCTTTGGCTTGCAGGATAAGGTTTTGTATCTCTTGAATGCTTCGTGCCATAGTTATTGTTGTGTTACTACAAAGTCTAAGTTAATCGCCCAAATACTAATACCCTCAAGGCGTTTAGCAACTTTCTCGTCTTCCTTAGAAAAGGCAGTTGCGGGCTGTAAGTTCTTTGCGATGTAGTAGTTTAGTATATCTTTATTGCTAAATACCTCTGTAGGTAATACTAAGGTTTTGCCCGCTTGCACATCATCAGTGATGTTAATAGAGTTGGCTTCGGCAAACTCAAAGACGCTTTCTATTGTGCCTGTATGCTGTAGAGCGAGGTCTAATAGTGACTGATTATGTAGGGCTGTTATTGTCATCTAATTCAAAAGTCTTATAGAATTTTTTATTAATAATCTTGAGCAGCACCTTTGCAAATCGAAATCCTAAACAATCTAAGTTCTCCAAGAGACTCACCACGAGTTGCCATATAATAGCTATGAGAACTACCCAATACAACCAATGAAAGGGGTCGAACTCAAAACCTCCAAGACTTGGAAACTCTACATTAGCCGAGAAGGTATGCAATATATAAATAGGTACAAGATAGGTGGCTATTTTCAGGAGCATACGCCCAAACTTGCGACTCTCGTGCTTCTCGCCCCTCTTTCGGGAGGCTTGTACCCCAGTGATCCACTCAAAGATAAGTAACACTACATAAGCGGTTAGGAATAAATGGTTGAAACCAAATAAGAAGTGCACGGTGGCAAATAGTAATGATAGTATAACGTCCATTTTGATAAATAGCATTGAAAAGGTGTGACCAAAGGATGAGCGTAAGAATTCGTTAGAGTCCCTAAAGCCAAAGCCTTGTAAGATGTAGTTGAGTTTTATCATATTATTATTTTGTTTTTAACTTATTGTACCCGTTCCAGTACTGTTAGTAGCACCAGTTTGGGCGGTGGCTGTACCTGCCGTGCTTACAGATATACCTGCGGCTACGGTTACCTCGCCACTCTTAACAAAGTCATCAATAAGAGAGGCTAAGCGTTCGGCATACTCTTCCATACTTGCATCGGTTTTGCGCTGCATATCTTGTTGCAGGCGGATAATGCCTTGTTGAAGGGCTTGTTTGTTTAGTGCCATAGTGCGTTTATAGGTTTATTTTTAACTCTTCATAGCCTTTTGAGAGGTCTATGGTAGCATTCTTGTAATCGTCGTATTCTAATTGTATTTTTAGGTCTCGTTTAAAGGCTAAAGGGTTAGTGTTTGTTTTCAGGTAGTTCTCTACACCAAACCCCATAAAAGGAAACTCTTTAAATTCCCCTTTCTGTGCTTCCACTATATGCTTTATGTGTTGCATATCCGAGGACTCAATTACAAAATCACCCTCCTCAATTACTAAATTGCCTACAGTATCTAACAACAAGTCTTTTCGTGCCATATCATTCTAATAACTGATTAATTTTACTTTCTAACTCACTGAACTTTGCCACGTTGTTAGGTGAAAAATTACCAACACCTGCAGGGGTTTGTATCACTACCGTTTTAAGCTCGGTAAGCCACTCACTCAATAACTGTTTGAGGTTAGCAAAATTGTTTTTTATCTGTATCTTCCCTTTAGCTATCTTCACTACTGCACCATCAATCTTTACTTCTATACTATCAATCTCGGTATAACCAACTATACAGGTTTCGGAGGGTTCGCCCTCAACCTCCAAACACAGCACCTGCGAACCTATCTTAGGCACTATGTTTAAACAGTTTTCAAATACCCCTTGAACAGCGTTTAAACGCACATCTAACAGCAGAGGTAAATCTTCCCTCTCCACCTCACAGGTATTCCCCTCGATACGACTCACCACACCTACAGAGGTAACCTGCTTTTTGCGATGATTTAGGGTAGTAATTGCGGTTGTAAGGGCTTGTTCCATAGTTCTGTAGTGTCTGTGCTATTAGTGTTTGAGAGTTTCATTGACAGCTTGCTTTTGCGCTTAAAGCCGTCTTGTGCGTTGAGCAAGGTAGTTACGCTTTCTAACAAATACAGTCCGTCGCGGTGTTTGTCGGGATAGTTAGGATCGGTAAGGGCTATAGTATCGCCCACTTGAGTACGTGGGTAGCCAAAACCCTCTAAAGTCCCCTCGTAACCGTCGAATACTGAACTGTTATAAGTCTTTTCGGTAAAGGCTTTTAGCTCTTCCAATGTAAGGTTGGTAGGAGCGTGCAAAGTGCGTTCGCCTCCTCCCTCTTCCCCATATTGGTAGCTTACTTTTTTAGAAGTACCTTTTTGTGAGCTCTCAGCTTTCAATAGTACCTTGCGTTCATTCTTAGTTTTGTACTTTAAATCTTTACTTTGTCTAAAGTTCTTATCAAAGATAAAGTGATGTATCACCTTTGATTTAAAATCTATTTTAAGCCCTGCAATAAGCTTTTTTTCTCTAAAAGAACAATGAACACCATACTGTTTTTTTAGCTCTTCCAGCACTTTATAAGGCGAGGAACGCTCTATCATTAGTTTGCCGAGTTGCATATCCAACACCTCCGTTTCATAGTCGGGGGCAATGTCTTTAAGTAACTGCTTCAAACTTACCGAAGCGTACGTTTTATTAATAAGGGGCTTGTTTTTCAGTTGGTACATCTCATCTTCACAAGTGAGTAACAGCGGTATATCTGCCCCTATTTGAGTGATATATCCCTCAAACTCGGTAAAATAGTCACCGTTGTAACCAGCTTCAATATGAATGCTATCGCCTACCTTTATCAGCTCTAACAAGTTCTTGCGTTCAATGCTAAAACTCTGCCCGTCCTTGCGGGTGTTCTTAAACTCACGAGGTAACTCTACCTTTGCTGTAGTGGTAAGCAGTTCTATACTATTGGCAATCTCTATTTGCTTTACTGCGCTGAACTGTATCTTGCCAGCTACTGTTATACGAATATTGATATTTAAATAACTACTTCCCATCACCTTCCAATAAGTTAAACGTTACCTCTTTTACACTTTTAGCACTCAACGTATAGGCAACAGTATCACTATAACCTTCTTTAGGATTAATAGTGATAGAATCTATATAAATGCTATCAATACCCTTGTCATAAAACTGTGCTCCAACTACTTTAATAATATCATTGTGTTCAAAAAGAGTGACTATTTGCTGTATTTGACTATCGGGGTAATTGTGGTTTTCTATATCCACCAAAATACCTTGAATGGTAATCTCCCATTCGTTGGTATTCCAACGCTCTACAATAGTACTACCATTAGTCTCAGTTTCAATGAGTTTCTTTGAGCGTGAGAAGGAAAGGATAGGAGGCGGAGCAAAAACAGTAGATTGTTCGCCTCCTATAAAGCTGTTAAACACAAGGCGAGTATTTTCGTACTCCATTGTAATCTCCTCGAAATTGGTAGCCTCGCCAAAGGTCTCCACTTGGTACTTGTTATCCTCTTTGGTAATCACTACTTGGCTCATACCCTCAGAAGATAACACTATTCCCAAAGCCCTCCCATAGCGAGAAGCCAAATCTAATACGATAGATTGTCCGTTTTCCATTTGTTACAAGTCTTGTTTTTTAACACCTAATATTCCTTGTTCTGCCAGCCATTGAATCTGTGCCCACTTCATAGCCCACGTTTCATCGTCCAAATCTTCGGGGAAGGGGATATGAAGGTAGTAACTTATCAAGGCATCAACTTTAAAGTACAAATCGCCTGTTTCCCTGTAGTTTAGAGCTCTATTAAGTTCTAAACAGTTCCAAACTTTCCCTGTCTAATAGGTATCAATTCACCAATCAGACTTGCTGAGGCATAGAATAGTCCATCATCGGCAAGTACTTCCTCTTTGTTGGTAACCAAACAAGCCTTTACCAATATTTCCTGTGCCTTCTTAGGGTCCTGATTTAAGTATTTTAGGTATTGCCCTACCACGTTGCGAGAGGGTACTACTGCTAATACTTCCAGCTCTTCTGTGCCGTTGTCATCTATTGGCAAGATAAGCGATTTTAGCTTGTCGCCATATTCTTTTTTGAGGCTTGTTTTTACCTCTTCACTTACTTTTTTTATCATAGTTTTTTTAGTTATTATAAGTTATGCTACCCTTAGCTTTACTGATAGGGCAAACAAATCGTATTGTTTTTCGAGTCCCATATCTCCAGTAACCTCTCGCCCTTCGTTTTTAAACTTTGCTACAATCTTATCCACTACTATCTCGTTGAACTCATTTACAAACTCAACTGTGATAGTAAAAGGCTTTATTTTCAATAGTCCACCCGAAACACGTTCCAAAGGGGCTATTTCGTGCATTGGAACAGTCATAGATGCTGAAGGTGTAATCTTGCCCATTGACCAGCTTGTAGGTTCCGCTCCCAAAGTATGGTTCAGCTGGTGTTCCTGCTCATTGCCATAACTAATACTCTTTACATTGATAGGAATACCATTAATTTGTACCCTCACATCAGCTGAGTCATAAGCTTTTCCGTTTCTGTTTATATCTGCCATTATGCTTGTGTTTTAAGGTTAATCGTTCCTTTAATCTCACCAATACTTCCTTTTGGCACTACTACAAACGATATTTTAAGCACCTTTTCTACTACAAGGTCACTATCCTTATCTATGGTAGTTTTGCCATACGAAATCTCACCATTGGCAAACATACGTTCCAATACGCTGTCGCCAATATCTTCCAAAGCTACAATTGTTGCAGGGCGCATTTTGCCTTTCTCGTCAAGTTCCCAATCAGTTTTGATTTTAGGCAGGTAGGCTGTGCGCAAACCTCGTGAGGCTTTGTCCATAATACGTCCGTAGGTTATAGAGTGCTCGTTCATATTGTGATGGCTATCTACCACTACAGGCGTACAAGTGTGGTCATTGTTAATGCGTACTCCCGCAATACCTGCGTAGGTAATACCAAAAATGTAGCCCTTATCTTCAAGGGTTTGTAAATCGTCAAACGCATCTACAATAGTAGTATATGAACTGAGTGCGGGTTCTATCCATACTCCTTGTGTAGCATCGGTAAGATTAAATAGCTCGTTGTTGCCTATGTTCTGTTGTACAAGGGCTTTTGAGCATACTCCAAGCACAGTGCCTACATCGGCATACTTTTGTGCCTTACCCTCCTTACTTTTTGCGTAGCTGTAATCTTGTCCTATTACTACTGATACTTTGGTAGCGTTAAGGTTAGGAAGCTCTCTGAGGTTAGCCGTACTGCTGGCTGTACCTCCGTACCCGTAGCCCTCCAATAACACTTGGCAAGGCATAAAATTGTTATACGCCCATTCTGCTAAGCCTTGTGCTTTAGCAATCGCGTTATACACCTCTTGAGGCAAGCCGTTGAGCATAGTGTACTGCTCGTCACTATCGCTATTGATAGCAATCGCCAGCTGCCGTATCTCGCCTTTGGCATACACCAGCAGCTTCTTAGCCTTTGTTTCGCATACCTCTGGCATTTTGCTGTTTTGGGCTACCAACATTAGGTGTAGGGGAGTTCCCTCGCCAGCCATTCGGTAGAACTCTGTAATATGTCGCAATACGTTCACCTGCTTGTTGTCTTCAGTTATTCCCAGCTTAGTAGCATCTTTCACGTTGTAAAGCGTGGTAGGGGTATCCCATTCTAAGCCTGTAGGTTTAGGGGCAGAAATAATAAGCCCGCTAATATTATCACCTGTGCTAATAGTGTTAGCACCCAATGCTCCTTTACTGATAACAACTCCTTTTAAATTACTCATCGCCTTCTGTATTTAAAGGTTCAACATTTTCCTTTGTTTCTTCTTTCAAAACGCCCTTACGGGTAAGAGTTTCAATCTTCTTAGTATCCTCTACGCTATTCTGTGCGTAGTCTATCTTTGTGAAAAACTCACCTTTAGGGTTTAGGTACAATCTTTGAAGCTGGGGTTCAGCTTCAAAGATTTGTTTTGCGGTTTCTAACTGACTTTTATTTGCCATAATATTTCCTTCTTTTAAGGTTACTCATTAGCTGATACCAAAGCCCCAAAACCGTACTCTTGACGTTTGTCGCATAACCCCCAAGTATGCAGTCTAACTTCTGCTGTAGGGCGTTTGCTTCTTGTATCCTGGCGCATTGGTTTAGTGAGTACATTCACACCCTCAATATGGTACACCGTATTAGGAGCATAGAAGAAGATTGATGAGCTTTGGTCGCCCACTACCTTTTTAGCTCCCATCGATTTTAGTTCACCATTTTGCCCATATAGTGGAGTTGTAGTGTTCTCAAAAATCTGCAATTCAAAGAAGCGTTTTAGCTCTCCTGTATTGCGGTCAATTTCCAAATCGCGATAGTGATTTGTGTTAGCTCTATCGTGAATAAGATCAGCCTTGTGCTCGTTAGAAAGCACCAAGTAATAGGCTGCTTTATTGTTCAAGTTCAATGCAGTGATATGTTTAAACAGGAACTCACTCAAGTCGTTATAGGTTAGTCGTTTTCTGCCGTTCACTACCTCTCCTGTAGTACGAAGTACAGGCATTGCACCATCCACATGTTTTTTAGGTGCAAGCTTGTGAATAGCATAATCACGCACTCCAATTCTGAACATATTGCTATGCTCTTTACGAATAGCCGACTCTTTGTCAAAAGCCATAGCGCGCAATTCTTCATCAGTGTATTCCGTAGGGGTAGTGTCAAGCGCATCCCACGCTACAAATGTTTTCTTACCTTCAGTTTTCTTAGGGGTAAAATCAACCGTAGCATTCACTACAAATTCTACATTCCCGATGAGCTTATTGAACTTGATACCGTCTTTATCAATAGCACTGGGGTTAGGGCGTTGCAACACACTGACAAAAGCATCGTTGTAGTTGCGAAAATCTTCCAATAATTGAGGCTCAACATATTGTTGTAGCCATAGTCCGTCTTCTAATGCTGGCATTACTTTTTGTATTTAGCGTTAAACAATTCTTTAAACCTTTCGGGCTGGTCTACCGATAGCTTTTCAAGTCCCTTAGGATCTTCTTTTTGCCATTGGTCAAAATCCCACGAAGCACGTGCCCCAGTGTTACTGCCACCACTTTGCAATAAAGAAGAGATGTTAGGAGCTTGTGTAGCTTGTTTGCCTCCTGCTACAGCTGTATTTTCTAATACGGTGATGAGGGCTTCTACTCCCGAAGTTTCTGCGATTTTTTCATAGACAGCCTTTTGGGCTTCTGTGATTTTTCCACTTTTAACCGCACCCTCAACTACCGTAATAATTTGTGCTTGTTTAAAAGTGTTAAGAGCTTTTTCAGCTTTTTCACGAGCTTCTTTTTCATTGGTAATGCGCTCCTGAATAGCTTGTATCACAGCCGTTTCTGAACTTTCCTCAGTAATACCAGAAAGCGAAAGAGCTTGTACTAACGATTGAATTAATACTGATTTCATATTCTTATCTAAAATATTTACTGTTTTAAGCGTTGCAAATAGCCCCGCATACATATTATAAACATCCTGTTCACGCATTGCATTGACATCTTCAATAGGTAGCAAAGTAGCTGTTTGTGCAGGAATAACATCAGTTACAAAACCTAAGCGTTTAGCTTCTTTAGCATCGAACCAGTTGTCGCCTACTAACCATTTTTCAACCTCTTTAGCCGACTTACCTGTACGGGCAGAAAGTTTTTCTACAAAATTCTTTTCAATAGAGCGAAGTAGTTTAGCCTGCTTTTCAAAAGAATCGGCATCGCCATTAGAATAGGACGCGGGGGCGTGTAACATTATATATCCGTTTTCTACAATACTTACTTTCTTTGCTGATAATATGATAATCGCCCCCATACTCGCCGCTATTCCATCAATTACAATATGTATAGAAGATGCTGATTTATTCAGTGCGTTATATATTAGGTTTCCGTCAAACACACTCCCCCCTGGTGTATGTAAATGGATAGTAATTTCCGAATAGTCTCGCTCCAATCGAGCAAACTCTTCTAAGAAGTATCTACCATCACCCTCCCATATAGTACCATAAGCGGTAAGTGTATTTTTTTGCGTTCTAATAATCATTTTATTGTTATAGCGTTGTTGCGTTCATTATTTTGGTGCAAAAATATAGGGGCTTTTTCAGCTATAAAAAGATAATTGCAATGGTTGCAACTTTATTCATTGAGGAAAGGAAAAGAGCCTACTTTTGCCCTAAAAATCTGATATAAAATGGCACAAAAGCGCACTAATAACAGTACTTTAATGGAATTGGCTCGCCGTATGTTTGTAGAAGAAGGTATGACGGCTAAAGCTATTGCGGGCACTATTGAAGTAACAGAACAGACTATTGGCAAATGGCGCAAAGGGGTAGGTACTAACGCTATATCGTGGGACGAACAGCGTTCACAATATTTGTCTGCTCCTCACAATATCAAAAAGAATTTAGCTAAGGAACTCACTCGCTTGGTTGAGGGAGGAGAGGCTACCTTAGATATGGGTGCTATCAATTCGGCTATTAAAGCTATACAATCAATGACTGATGAGACTTCTGTAGAAACAGTATATAGTGTGTTTAAAGAATTTGATAGCTGGATGAGTGAGCAAGATCCTGAAATGGCTGTCGCCTTTTTGGAATGGCACAAACTATACTTATTACACAAAGCACAAAATCAATAAACTATGGCAGAAGGAAAATTGACAAAGGCAATGGAGAAGCTCCTGCAAGACTATGACCAGCATTGCAGAGGAGTGGAACAAAAAACTACTTCGGGCTTAGACTTTTATGAAGCTCCCTCTGAACGTAGAAAGAAACGTCTTGCCTTAGAAAAAGACTATACTACTTGGTTTGAGTATATGTTTCCACAATATGCTGAAGTTCCTTGTGCGTGGTTTCACAAGAAGATCGCTAAGCTACTGATTGAAAACGATGTGATAAGCCTGCTTGCTGAAATATATCGTTCGGGAGCAAAATCGGTACATTTGGACTTAGGTATTCCAATGTTCTTATATGTAACGGGCAAGCTGAAGTTTATGCTATTAGTAGGACAAACAGAAGATAAAGCAAAGAAGCTTATTTCGGATATACAGAGCCAACTCACTCATAACCAACGTTTTATTCACTACTACGGCAAAAAATTTAAGTTTGGTGATTGGGCAGATGGCGACTTTACCACTACCGATGGAGCTAAGTTTATGGCTATGGGTGCGGGACAGTCGCCTCGTGGTTTGCGTGAAGGCAACCAACGCCCTGACTATATAGTGATTGATGATGTGGATACTGCCCAGCGTTGCAAAAATGATGAACTATCCAAAAAGCTATTCGACTGGGCTTGGGAAGACTTAAAAGGCACTTTTAACGAGGGAGGCAAGTATAGGCGTTTTGTGGTTGCTAATAACAATTTTCATAAGAACACACTTATCAATCAGCTAAAAGAAGAGTTTGCTATTATCAATAAAAAAGCTAAGGAGTATGGTTTTGCCCAAACGCACCATATAGTAAGCGTGCCTGCGGTAAAATCGTTAGAAACCTTTGAGCCTAATTGGGGTGAAAAAACATCGGCTGAGTATTGGCGAGAAAAATACCACTCTACACCTTACCGCTCGTTTATGCGAGAGTACATGCACGTACATATAGTGGAAGGTAGTATCTTTAAGAATGAGCAAATTCAGTACAAAGAACGCCTGCGCTACTCACAATACGATGCTCTTTGTTTTTACGGCGACTTGTCGTACAAAGATGCAGGCGACTTCAAGGCTATGCTTTTAGTAGGCAAGGTAGGGAGAGAATACCACGTATTGCTGGCGTATGTGCGCCAAACCTCCCGCAACAACGTTGCCCGCTGGCTGTATGAAACCGCGCTACACGAAAACCTACTCAAGTACAACATCGCCTACTATATTGAGGGGCTCTTTGCTCAAGACGAATTTATAAGTGATTTTGACGAGGTAGGCGACACCTACGGCTTCTATATACCTGTGCAAGCCGACAAAGATAGCAAAGGCAATAAGTTTGACCGCATTGAAAGTATGGCAGGCTATTTTGAACGCGGTAACATATTTTTTAACAAAGTCCTACAAAACTCACCCGACTTTGTAGAACTCATCAACCAGATATTGGCATTCCAAAAAGGTTCAGGGGCTCACGACGATGCCCCTGATGCCCTACAAAGTGCCATTGCTAAGCTCAACGCCCTTGCAATACTCAATGCTACTCCTGCCAAAACCATAAGCCGAAAGGAAATTCTAAAAGACAAACAAAACAGATACTGATATGTTTCTAACCACAGAAGATTACACAGCCCTTATCCGTAATGAGATAAAAGATATACTGCTTGAAAATTACAGCGAGGCAAAGCTACGTGTCGCCCAGCAAATGGCTATCGACCAAGTGAAAAACTACCTATCGGGGCGTTACGATGTAGCCGAGATATTTAGCAAGGAGGGCACAGAACGCAACGCCCATATCGTAATGCTCACGCTGGATTGTACCTTGTATCACCTCTATACCTCCACTGTGCCTAAACGTATGCCTGAAATACGCTCGGTGCGCTACCAAGATGCTATTGACTGGCTCAAAGCCGTTGGCAGTGGTGAAATATCAGCCAACCTGCCTCTTATCAAAAGCCAAGACGGACAACAGCTATTAGGTATAAAAATACAATCAAAATACGCCCCCTCATCCAATAAGTGGTAGCCCGTAGCACAGGTAGGCAGTTTTCTTATTACTGTACTACTGTTTAAATACCGTTTAAACGCTAAAAATACACCCTTAAAATTACAATACAATGAAATTATTAGGTTATCAATTTTCATTAACAAAAACGCCCAAAAATAAAACCAACGAACACTCAGTACGTGGCAATGCCCGCACCAACCCTGATGTGATACAATTCGTGCAGTCGTTCAAAGATGCTTCCCGCAAGGATATAGCCAAATGGCGTAGTGCCTTAAGTATGGCTCTTCACCCAGAAACCCCCAAGAACACCGCACTTTATGACCTTATAGACGATTTGCTAACCGATGGGCACTTGCAGTCGCAAATACAAATGCGCAAGATGAGTACCCTTAACACCGATTTTTATCTCATCAATCGCAAAACGGGTGAGATAGATGAGGAGGCTACTTTCGTATTCCAACAACAATGGTTTTACGAATTTTTAAGCATCGCCTTAGACAGCATTCTATTTGGGGCTACCCTTGTTGAGTTTAGTTCCTTTGAAGGTGAAAAAATTCGGTTCAACACTCTGTCCCGCAGGCACGTTATCCCAGTGTTAGGACGTATCCTACCTGATGTAACTAAAGAAGACTACATCAACTACCGAGACGAGTACTACGCCCCTTGGCTGCTGCAAATAGGTAAATCCGACGATTTAGGGCTCATCAATAACATTGTACCTAACCTAATATGGAAACGCAATGTAGCGCAATCGTGGGCAGAGTTCTGCGAAAAATTCGGTATGCCTCTCATTACTGCTACTTCCAATTCTACCAACAGTGATGTAGTGGACAAGGTAAACCAAATGCTATTAGACTTAGGCGAAGCAGGCGTGGCTACCTTCCCACAAGGCACCAGTATCAACTTCCAAGAAGCTAATCGCACCGATGCCTACAACGTGTATATGCAGTTTATGCAAGCCAACACCAATGAGATAAGCAAACAGTTGGTAGGCTCAACAATGCTGTCCGACCAAGGTACGAACCGAAGCCAAACCGAAGTACACGAACGTTCGCTCGACTTTAAAATAGCCCAAGCCGACAAACGTTTTATTCAGTTTGTGGTAAATGACCAACTCATCCCCTTATTGCGCCTGCAAGGTTACAAGCTATCGGACGAGGTATTCTTTGAGTTCAAAACAGCCGAGCAGGAAATAAACCTATCTGAGATGTGGAATATCACTAATGGGCTCATTGCTAACGGCTACCAAGTAGAAACCGAATGGATTTCCAAAACCTTTAACATACCTATTGAGAGCGAGGGAAAGCCCCAGCCCCTTAATGAAATTACCGCTTCATTAAGGGGAGCAGACGAGGGCGAACGCTATCCCTTTAGCTGTACCTGCGGGCAACATACGGCTTCACTTGGCAAAACCATACGCACCGTACTGGCAAAACTCACCGATAAGCTGATAAGCAAAGTGTATCACAAAAAAGACACCTTGCCCGAATACGCCCAAATGGTAGTTGCTGAAGGTGTTGCCCTTAGTGAGGCATTGCGCGACAACTTTCCTACCATTAGCCCGTATACGGGTCCAGACCAACTGTGCCTACAGCTAATGGAGTACAACCTCTTTGAGTTTGCAGCAGGCAAAACCGAAAGCCGCCTTGCTTCAATGAAAAAGCTATTAGTAGACGAAAATAACCAAATACGCTCCTTTAGCGACTTTAAAGAGTTGTGCCAAAAAGAGGTAGAGAAGTTCAATAAAAAATGGTTAGAAGCTGAGTACAACCTATCTATAGCTGTAGGGCAAAACTCCGCTCAGTACTTGCGCTTTATGGCAGAGAAAGACACCGTTACCTCCTTTGTAAAATACCAAACCGCAGGCGACGACAAAGTGCGTGAGGCACACAAAGTGCTCAACGGCAAAATATTCAACCTATCCGATAAGGAAGCAATGGATTTGTATCCGCCTAACGGCTATGGCTGCCGTTGCGAAATGGTGCAAGTGTTAGGCGATCAAAAAGGCAAAGTAACCAAGGGTAGAGAAGCTAAAATAATGCTGGAGGGTACAGATAATAAATACAAAGGTTCTCAGTTTGAAATCAATCGTGGCGACCTCAAGCAGGTGTTTACCAAACAGCAGTTTTACAGTGATACTAAGGGACTACCCAAAAAACTCAACGAAATGACTTTTGATAAATATGGACTACCCTCTTGGGAAGCGTTTAAACAGCATTTAAAACCTCTTAAATTGGATAGCACCATTACCGAGAAAAACCTCCACGAGCTATTTAAACCTTTTGAGAAAAATACCTATATGGGCTTTGAAGACTATTTAGGCAGAAAACTTACCTTGCAGAAGAGTGTTTTTGATACCCATACACAAGGGAAATACCTTAATGAAAATGAGCTAAGACACCAGTTATTCCCTTTTGTTAAGGAGATTTTAAAGAATCCCGATGAGGTATGGCATTTTGATTATAAAGGTGATGCTAAGAAGTTCCAACCACGTTACATCAAGTTCTATCAAGATAGAGTACTTATAGTAGATTGCGACTTGAATACAGAACAACAATCACTTACCATTAACACGTGGTACAGTATGAAAGCCCCCGAGAAATTTATTCGCAAAGGGCTAAAAATAAAATAAGAACTCAAAAACTTAGACCGTGTAAACCCTTTGCAGAGCCGTCTCTATTCCCGCATCCCGAAGGTAGCGTTATTAATGCAAGTCTCTGAGTCCCTAATATTGTACCGCAAAAGTATAAACAATTTTTGAAATAACAAAATAACAATGGCAACAACATCAAAATTAACCCTACTTATTGATTTAAGCCAACGCTTGTTTAACAACGGACTTACCCAAATGTCGAACCGCTTTCGCCAGCACGTACAGCAAATGCGCAACAGCTACCGCGATTTTACTAACCAAATACCAATGCTCGGCAATCTTATGGATACCCTCTCCAACAAATGGGTGCTCTTGGGGGCAAGCATAGTAGCGGTAGGTACAGGGCTGGTGCGGGCTACCTCCTTAGCTAACGATTGGCACAAACAAATGGCAGAGATTAACGTAACTGCCGAGCTGAGTAAAGAGGAACTTGGCAAACTATCTAATAAATTGTTGGATATAGGTACTAAAAACGTCGCCCCTCTGGAGGAAGTTCCTAAAGCCTTCTCACGTATCATTTCGGCAGGGCTTGATGTAAACCAATCAATGCAAGCCCTTGAACCCACCTTGCAAGCCGCTAAAGCAGGTTTCACCGATATAGAAACCGTAGCCAGTGCAGGGATTGCCACAATGATGTCGTCAGGAGAGGATATTAACAAAGTGTACGATGTGCTGTTTGCAACCGTAAAAGAGGGGAATGCCGAGTTTAAAGATATAGCCAACTATATGCCTAAGCTCACGCCATTAGCCAAAGGCTTAGGGTACCAACTATCCGAAACAGCAGGTGCCTTTGCCTCCCTTACCACAAAGCTAAGTGCCGAACAGTCTACAACCGCCTTACAGGGTATTATCCGCTCACTTTCTGAAGAGCGTATCGCCTTAGGACAAATGGACAAAAGTGGCAATTGGAAAAGTGGTTTTAAAGCATTGGGTATAGATATACACGACACTACAGGGAAAATAAAACCCTTGGTAGAAATTATAGGAATGCTCAACGACAAAATGGCAGGGCTATCCGACAAACAGCGTATGGAACAATTTGGCAAATTAGGGCTTGACCAAATGAGCACAATGGGCTTCCAAACCCTTATGCAAGATATGGAAGGACTCCAAAAAGCTACTGAAGCAACAGCTAATTCACAAGGTACGCTCGGCAAAGCCTACACCGATTCACTCACCCCTTTAGAACAATGGGGCATTGCACAAAACCAACTCAAAGGCACAATGATAAAAATAGGCGAGGCTATACTACCTATGCTATCCAAAGCTATTGAGTACATCACTCCTCTTTTTGAGTGGATATACAAGAACGTAGATTGGCTTATCCCTGTATTTGGTACATTTGCGGGCGTATTAGGCGTAGTTACTGTAGCCACATGGGCGTGGAATGTAGCCCTCGCTGCAAACCCCATAGGGCTTCTTATTGCAGGCATAGCAGCACTTATTGCTCTCGTAGTAGTAGCAATAAAGAAATTCGACCAATGGGGTGCAGGTATGTTAGCCCTACTTGGTCCCATAGGTTGGCTCATCAATGGTATTAAAACCATTTACGACCATTGGCAAAGTATCAAAAAAGCCTTTACCGATGGCGGTATTTTGGAGGGACTCAAACGCATAGGGCTCGTGTTGTTAGACACTATACTAAAGCCTATCCAACAATTGTTAGAATTGCTTTCTAATATCCCTGGTCTCGAAAGCTTAGCGGGTAAAGGTGCTGACTATATCAAAGAGCTTAGGGAGTCAATGAATACAGTTACCGATGGAGAAAAACAAAAAGAGGAAGAGCCTGAAAAAGAAACCAAACCCGAAAACCCATTCAGCTTTACCAATACCGCAGGAGCCACAGCGGGGGCAACACCAACAACAATGAACGCCAATACCCAATTAGGCTCGCAGGTAAGCAAGGTAACAGGCGATGCTACCCAAACCAAAAACATAACCATTACTTTTGAAGCACTAAGCAAAGGTGATATAAAGGTGAGCAATGCCGAAGGACTGACTTGGCAACAAGTAGAAGAACGTTTTACTGATATGCTCCTCAGAGTAGTGCGAAACGCTGAACTATCATAATATGGACTTACAAGTAAACACCGACTTATTCAACCGACTGCAACGCCTTACACAACGAACCTTCCTACAGCGAATGGTGAATGAAGCAGGCGTAATAGCTGTGAACTTCTCAAAGGACAGATTTCGGTTAAAGAATTGGATAGACAAGACTGCCGAAAAATGGCAAGCACGCAAACGCCCCAATAGGGGTTCATTACTGTTGCGCACAGGTCGCCTGAAAAGGTCTATACGCAAAATAGCTTCAGGCGACTACTATGTGGTAGTAGGAACTGATGTACCCTACGCCCAACTGCACAACGAGGGAGGTACTGTGAATAAAGTAGCACAGGTGAAAGCTCACACTCGCAAAGTAGTGATACGGCAAAGAAGTGTAAATCGCAGGGGAAACGCTACTACACGGGTTATAGGTAGTAAAATAGTGAACGTGCGAGCCCATAATCGCAAAATGAACCTTACAATGCCCAAACGCCAGTTTTTAGGCGAAAGCGAACTACTAATGAGGCGTATTGAAATGCACATAAACCGAGAACTTAACAAAGAATTGCAATGAAAGATTTTTATAACAAACTACACCAAGTATTTGAGCAAGAAGCTACTAAAGATTTATACCGAAGCAAGGGTATTGTACCCATTCAGTACATTGACTTCTACGCAGGGCAAGACTATAACGACAACCTTTTTGAGGCGCATATATTCCCTGCCTTATTAGTGCAATGGCAAATAGCCTACACCGATAACTACGAAGCAGTAGCAACCCTGACTTTTAGGCTGTGTTATGAACAGCTAAGAGACCTTTCCTCTTTAGGACAAAACAAAGTTGAGGGGCTCAAGTTCTTAGACTTTATTAATATTACCGATAGCATACTAAAAACTGTTGAAACACCCAGCACAGGCAAGCTACACCTTATAAATGAAAGCCTTAGCATTGAAGATACGGTAGTAGATGTATTCACCCTTACTTACCAATGCAGTTATTGTGGAAAACAAAAATCCCCACAAACCAAAGGCTTGCGGGGTGATTTTGAAAGAGTGGAGCTAACAGCGAAACTTAAGAGCCGTTTTTAGTTATTTGGAAGTGGGTACATCTTTGATGTTACCCACTTTTTGATTTTCTATGTAGATATTGCCCCAATAGTCTCTTCTTACATTTAAACGGTATTTCACAAAACTCTTTTTGTACTTATCATCTGTAAGAATGCTGCGCATAAACTTCTCTGTCATTTTCTTATAGGCATCGGGTATGGCATCGTCTTCATACACAAATCGGTTTGTTTCACGAGTTACTACTATAAGCCCTGAATCATCTTCCTTTTTGTTTTCTGAACCAGCATATACCTTAAAAACCTCACCTCTTTTTCTAAAAGGCTCATTGTAGCTTTGAGCACTTCCAACTAAAGGCAGTGTTAATAGTGCTATGTAAAATAACTTTTTCATTTTTCTAATGATACTCTTAAAGTGTTTGTTTCGGAGTTGTAAACAGCTTCTGATTTTCCTTCTAATTTAATAGAATAAACCTTAGAGCTTCCTCTTTGAACAGTACCCTCAAAACCTATTTCTTCATTGAACTTGTTAGGTGTAAGATAGATAGGTTCGAAAGAGGTGTTGCTAATATTTTCAAATTCTGTTTTGGAGATGTTAATACCATTTATTTTCCATTTATAATTCTTAGGTTCACCATATTTAGCTATGAGCTCTTCTTTAGAAGATTTACTAAACTCTTGTACAATATCTTTAGTAGAGATTTTAGCAGTATTAGTACAAGCCGATAATACAATTATAAATATTAAAGCTATATTTTTCATATATCATATATTTTTTGGCAAAATTAAGAGTTTTATCCATAATCTCAAAATAAACGATTCGCTTTTTTTAGTTTGCGTTTAAAGTCGTCTAAAGGAGTTTCATTTTTTTGCTCTTGGTAGCGGAATTCGGTGTGTTCGCGTTGGCGTTCATCGTCTATGTATTGTAGGCGTTCTTTGTCGTATTCTCTAAAAAATCTAAGCACTTTATCTATACCTAATCGTTCGTAAAATTCGCCGTATTCACCCGATAGTATGCGTTTGAATATAAATGATATTTCGGTGAGTTTTAAGTAACCGTAATCATTCATTATTTGGCTACTGCAAAGGCTTATTTGGTCTTCGCTCATTGGGCGACTTACGGCTAACATTTCGTTTAGATATACGAGCCATAACATTATATAACTTTCACAAGCTGTTGCGCCATAGTCTCTTCGTATGCTACTAATGGAGGGGGTGGGTAGGTTGATTGCTTCGGCTATGGTTTTGAGTTTGTAGCTGTGCTTCATACAGTTATTGGGTGAATATACCCTCAAGAATCTTTCGTTTGAAATCAGTGCTGTAAGTTTGTTTTGCACTACTGTTACCTCGTTTTGCATTTTGTAGAATTTTGTTGAGTTGTGAATTGATGTATTTTAAATCGGTGTTTCGTTGGTGAAACTCATCCATCTTCTGCCAATTGCCCAATAGGTATTGCCACGTAGAAAGGGCTTCAATGTCGTTGGCTGATACTTGTTGCAGGTAGCTAATGATTTGCTTGAGGGCTTTGCCGTCTGCACCAGTGAACTTAGGAGGGAAGCCGTACAGGCGTTTGTAAAAGGCAAACCATTCGTCTAAGAAGTTTCCGTATAGACTTACAGGGTCTGGCACATCCTCACGGTAGGACACACTGCCGTTCCATTGCTCTTGGTAGCGTTGTATATCTTCCTCTTGTGGAGGTAGGATAGCTCCGAGTTGTTGGTATTGCTGGGTGTTAAGCCCTCCTCTCTTGATTTCTATTTTGCAAAGCTCACCTTTCTTATAGGTGAGCTTTAGGAGGGTATGGGTACGGTGTATGGTTACGGTGTAGGTCATTTTTTTTATATTTTAATTAAGTAAGCGGGCATAAGGATAAAATTGCTAAACTTAAACCCAAATAGGTGAATTTTGCCATCTACGAATATCGCAAAGGAGATGTTAAGGGGCTTGTATCGTGGGAACTCTTCATTAAGTTCTTTAGCTTTATCAATGATGTATTGCTTTATTTTGGTTAATTCATCTGCCCGATATAGTTCTCCGCTCATTCCTCTGAGAAAACAATAAAATTGCTCTTGCAACTCGTTCCTTGTTTGTATACCATCGCTAAAATAGCAGTAATAATGTGTGGGTGTTTCTTTCATTTTAAATAGTTTTTAAAGGGTTATTCTAAATATAGACCTGTGGTTACTTGTTGGGTGTATTTGCCTCCTTCGACTCCATAAAGGATAGAGTATCTTGTTATTTCCTCTTCGGTAAGAGGGCGGGAGGTTTTTTGTTCAGGATGATAGATGCCTGAACGAACTACATACAACTTGAAAAATGCTTCTTGCATTTGTTCGCGGCGTTTGTTTTTAGTTTTTGTAGTTCGACAAAGATTAACTACTGGTAAGCCGTGTTTGCGCCATTGTTGGTTTAGGTGGGGTTTGAAATAGCCGTAGGCACTATCTAACATTACCCAATCTAAGTAGGGCATTTTGATTGCTATTTCTTTTACATTACTTCCTGTAATGCGATAAGCTTGGTACTTTTTATCCTTAAAAAAGTAGTCAATTAGTTGTACAAACAACCATTTATCTAATTCGGAGGCGTACTTAAAGTAATACTCTTTTTCGTCCAAGTTATTAAGCTCGTCTTCTGTTATGTTGTACTTTTTGAGTAGTTTTCTGAGCAACTTTTCAGCTGATTGCTGCTCTCCTGCTACACCTCGTTTTACGAGTTCGTAGACTTTTGCAATTTTTTCTTTTACTTTGTCGTTCATATTGTAATTGTTTTTTAGTAATTTGCTTGATTTTAGGCTTCCCACTGCTCTTTTGTTAATTGCTTGCCACAGTCCTTGCAGAATAAGGCGGTTACTTCTACGGTACAGTAGAGGGCAAGGGTGCGGAGCTCTTTATGCTTGTGGGGGCAACTTTTCAATTTGCTAATGAGCCAATTTGTTAATTTTCTAATTCTTTTCATTGCTTTGCAGTTGTTTTAGTTCTCGTTTTGCGGGTGTGCCTAAGTAGGTGTGAAAGGTACGGTAGCATATATGGTATTTTGGGTGTATGTGCTGCCAAAATATTTCCTTATAAGTGAGCCCTACTTTGTGATACAAGTGCAGGGTAAGCTGTTGTATCTCTATAATTTTTGTTAAAAGATTGATTTTATTGTATGCCATAACTGAAAATTTATTACTTTTGCGGTTGGTATTGGCAAAAGTCCTGTCTACTTTCAGTAGGCGGGATTTTTCTTTTTAGTTCATTGAAGGTTCGGGTATTCCAAAAAAGAAGTGCCATTGCTTGCTTCCTACTTTTAGAGCGGCTTTTTCAAACTTGGTGCGCATACTTTTGAATTGGCGTACTAATTCAGGAAGTTCGTCTAAGGTGTAGTCGCGAAGGAACTTTTTTAAGGGGCTTCGTTCTTTCATAAACTTATTGAAGAGAGCCCAGTTGTTTTGCTTTAGTATGCCCATTGCTTGTGCATCGGCAAGGATGATGGAGCGCAGTCGTTTTACTTCGGTTTCGTTCAGCAAATCGGTAGCGATAGTTTGGTAGTTGGGTGTGAAGCAAAAACGCTGATATAGGGCGTTTAGTTCCTCATCGGTAGCTTCTAAGAGCTGACAGGTGCGGTGTGTTTCGATCCATACGGCTTGTTCTAAGGCTTTAGGGCTTAGTTTTTGGCGTAGGGTTATTATCATATCATCTGTTTGCATATTGTAATTATTTTAAGGTTTGTGCCCTTTTTGAGCCCTACCTCGTGGATAGGGACTCTTTGGGCTTTTGTCCCGCTTAGGGGCTCGAACCCTAATGCCTGCCTGTGCGGGTATCAGCACTTAGATAGCCGAAAATTGTAGCAGGATGTTTTTCCATTGCCCGCGTGAGTCGCGTTCGTAGAAGCGGATATAGTCCTTTGAATGGTTGTACTGATAGCTTTCGCGGAATAGCTCGCAGGCGCGGGTGAAATTCTCATCGGCGAAGGTGTGTTCGTATTGATAGAGCTTCTGAATGTTGTCGTGATCAAGCTCGCCTTTTTTGCGTTCTAAGAGGGATAGGATGAACTCCTTAGTGGCTTTGTCGCCTTGGTAACGGCTTTCAATGAAATCAAAGATGTATTTTTCGGCTTCGGTGGAGCGTTCATCATACGAACCTTTTCCTTGCTTGTTGTACTCCACTTTGAAGTTTTGAAATTCTATTTTGAAGTTGCCTTTACCTTCAGCAAAACGACCGCTGTACTCTTTAAGCAGGTCACCAAGGGTTTCCATAGTTTCAAAGGCGTGTGCTTTGAAGTCTTTGAGCTGGGTGCTGATGTCTTTAGCAATGGTGAGAAGGCTTACTACGGCATCGGCTTTCATTGCTTCGTAGGCTTCACGCTTTTGCTCGCGTTCTTTTTTTTCTAATTCTTGTGCTTGTTTGATTAGGGTAGCACGTTCTTCAGGGCTGAGTGTTGTCAAATCTACTGTCATTTTATTTAGTATTTAGTTATTAGTCTTTAGTTGTTAGTTAGCTGGCTAAAAGCTCGCGTTTGATTACTCGTTTGATACGGCGGAAGCTCTCAACTACTTTGATAGAGTCGCCCCCGATGGTTGCTACTGTGGGTTCGCATTCTTTGAAGAGGCGTTCAAAGAGAGTTTTACTTTTGTCGGTACTGGTTGGGGGGAGGTTTCTGTCTTTGAGTCCATTGCTCTCGCAGATACTTTTAAAGTCTTTAAAGGTAGCGCCTATAAGGTGTATAAACTTGCGCCCGAAGCGGTCGTCGAGTTCGTCAAAGCCGAGTTTGTTGTATTTTACGCCTGCTTTAATAGTTTTTTCGAGGTTATCAGTACCACTAATCACTACGCCCATTTTGTCTTCCATTTCGTTGTACAAGGTGATGAACCAGCGGAGAGCAGAGGGTTTGAGCTTGTCGGCTTCATCAACTATAAGTAGGGGGTGTTTGCCTGTGCGTTGGGCAAAGAATTGTATTACTTTTTGTCCTAATACATCGACGGTAGTGTAGCCACTGTCCTGCTTGATACCTAACACTTTGCAGAGTTCTACAAGAAACTCTCGGCGTGCCCACTCGCGTGCTTGGATGTAGAATACGTTACTGCCTGCGTATAGGTTGGCAAATGTGGTGAGGGCTGTGGTTTTGCCACTTCCTGCTTTATGACTTATGGGAATAAAGAGGGAGGCGTTTTTGGCATCGGATAGCACGCTGAATACCATTCGGTAGTTGGTAGTTTCGGTTATTTGCCACTGCTCTGTGGTGTTGATGTCAAGGGCTTGAGCTACTTTTTGCCATAGTTCGGCTTTGATGAGCTCCCAGTTGTGGTTGAGCATTTGGGAGATAGTTGCGGAGCTTACTTCACATTTAGTAGCTACTTTGTTTTGACTACCGAGACGACTTACTTCGTCTTTAATAGCTTGTAAGATTTGTTCTTTTTGTAAATCAGTCATCTTCTTTTAGTTAAAGGGTAAGAGATAAGAGGTAAGACCTTTTACCACTTACAGGTTTATAAATCAGTTAAAATTGTAATCGTGTGAGGTTAGAAGCGTCTAATTCTTCGGTAAAATATTCACTTCCTACGGCTTTTTTAAGAGGTTCAGCTCCCCTTGCTATATGGTTGCTTTCTTGGCTTTTTAGAAAAGCTTCCTCTGTTGCGTTGTAAGCTTGTTTTTCGCTATACGCACCCATTAGTAGGTTTTCATCAGCAAGAGCGGTAAGACGTTGTAGTTCCGCTTCTTTACGGCGTTGCAATTCATTTTCACGGGCGCGGGCTTCGGATAGCCTGCCGAGCTCGGCTGTAGGACCGTGGCGTTGTATTTGCTCAAAGAGTTGCGCCTCGCAAAGGGGCACTAATAGGTTGCCGTGAGCTTCCCATAGATACACAGTATTGGAACTGAGTACATCAAAGGTCATTACTACTTTTTTGCCTGTATAGTTAGCTATGATGTCAAAATCATCAACGGATAGCTGGTAATAGAACTCTACTTTATCTATTTCGGTGCGTATGAGTCCGTTGTTTTTAAGAGTTATTTCTTTTTTGCGATGGAAAAGCATTGAGATACGTGCTGGATTTACATCAATCACGTTGTTTTTCATTGCTTTCTCGTGTAGTTCTTTAGGTGTTTCGGTTATATTAGCGTATTTGCGCGAGTAGGTGCAATAAGGGGTATTGCGCCAGCCCTCAATAAGGTTTTCAATTTCGTTATAGGCTTTTAGGTAGTCGAAACCTTCTGTTTTAGCTTCTTTTTTTACTTCGGCTAAGAACTCTGCGCTTCGGTGTGCTGAAAGTCTGCGAGATTGAACGCCTTCTCCATAGTAATATTTGTTACCCATAAGGGTTACACTTTGGAATGTACCAAACCAGCGTTCTACTTTTGCTTTGCCGTTGGGTTCGTGGGTGATATTCAGTTGTACACCTAAGGCTTCGAGGCGGGCGAAGAGTTCTTTTATCTCTTCGGTATTATGACCAGGAAAGCGGTCGGCAGTAAGACTATAGGGCAGGTAGCCTGTATATTCTACTGCCATACGGAGGGCACGCATATAGGCTTGTTTGTTCTCAGCATAAGCGATGTCATAGCCTAAGATGTCGCCACTGTGTACATCGCGTACTACTATAGCAAATAGGAAGCGTTCCGCTTTGTTACCTTTCTCATCAACTGTTTGATGCGCAATGAGGTTCACACGGCTGGCATCTATCTCCCAACAGTCGCCTGCACAAACAGCATTTTTAAAGGGTATGTAACCACTGTACATCTGTGCTTTGCGAGAGCCATAGCCGAAGCGTTTTTCAGCGGTGAGATACTTGGTTTTGGGCAGTTCAAAGATATTTTGTCCAAACCAACGGCGTGAGGGTTTTTCCTTACCAAATCGCTCGCATAACTCCCATACATAACGAATAATGAACTCGTTGCTATTGTTGAGCCCCATTGCGCGGAGTTGCATTACCCAGCTGAATACCTCTTGGTCGGTGTATTGTTCAGCATTTTTATTGCCCGTACGGGGTAGGTCTATAAGATCTACAATACAATGGTCGGTAGTGCGGAGTATTTCCACTTTTTCTTTGAGGCGTTGGTAGTTATGTGGTATATATTGGAGTTCCATTTTGCTAAGAATGGGGCTCAAGTCCTTATATAGGGCGTTCTCGGTACCTCCGTAGGTGTCAAGGTTATCCAAACAAAAGTCTAACACGGCACAGGCTTTGGCTAGGGCTACACGGCGGGTAACATCTACCTTTGTGTAATACTCTAAATACTGTGGGTAGGCTTTGTTTAAATAGTGTTTAAACGCTGTTTCGAGGTTTGTTTCTTTGCGTTCTGCCATAGCCTGCTCGTACTGGGTGAGCAGGGTTTGGGCATCTCCAAAGCGGGCACGGTAGTTTTGCGGGGCACGGTTGGGAATGTTGCTCAAGCAGTAGTAAAACTGCCCTTGTGTTTTTGCCCAACGCCACGATTTGCCACTATCGGGCATAAACTCTTTGGCTTTGGCGAGGTCGCAGGGGCGGATGGTCTTTTTATAGTTTACCCTACCTACTTTTCTGAAATAAGATTCCCCTATTTCACAAACCTCTATAACAAGGCGTTCAGAGAGCCACAGCGTTTGTTCGCCTGCTTCTGTTTTACGGATAAGTATGTCGCCTTGTTTGAAATTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP022383|2610996:2647150|2639045_2639564_+|WP_095902108.1|DBSCAN-SWA MKDFYNKLHQVFEQEATKDLYRSKGIVPIQYIDFYAGQDYNDNLFEAHIFPALLVQWQIAYTDNYEAVATLTFRLCYEQLRDLSSLGQNKVEGLKFLDFINITDSILKTVETPSTGKLHLINESLSIEDTVVDVFTLTYQCSYCGKQKSPQTKGLRGDFERVELTAKLKSRF >NZ_CP022383|2610996:2647150|2624348_2624639_-|WP_095902093.1|DBSCAN-SWA MARKDLLLDTVGNLVIEEGDFVIESSDMQHIKHIVEAQKGEFKEFPFMGFGVENYLKTNTNPLAFKRDLKIQLEYDDYKNATIDLSKGYEELKINL >NZ_CP022383|2610996:2647150|2640345_2640807_-|WP_095902111.1|DBSCAN-SWA MKHSYKLKTIAEAINLPTPSISSIRRDYGATACESYIMLWLVYLNEMLAVSRPMSEDQISLCSSQIMNDYGYLKLTEISFIFKRILSGEYGEFYERLGIDKVLRFFREYDKERLQYIDDERQREHTEFRYQEQKNETPLDDFKRKLKKANRLF >NZ_CP022383|2610996:2647150|2643347_2643977_-|WP_095902117.1|DBSCAN-SWA MTVDLTTLSPEERATLIKQAQELEKKEREQKREAYEAMKADAVVSLLTIAKDISTQLKDFKAHAFETMETLGDLLKEYSGRFAEGKGNFKIEFQNFKVEYNKQGKGSYDERSTEAEKYIFDFIESRYQGDKATKEFILSLLERKKGELDHDNIQKLYQYEHTFADENFTRACELFRESYQYNHSKDYIRFYERDSRGQWKNILLQFSAI >NZ_CP022383|2610996:2647150|2616364_2617021_-|WP_095902084.1|tRNA|DBSCAN-SWA MKQKFKVFISGQKYFGQEILSLCITKGYEVVGVCCPLDDKYIGRLAKLHNIPILPAGMLTYDTMPSGVDLGITVHSFDYIGKRTRYKTHLGWIGYHPSLLPRHRGRSAIEWAIRMRDIVAGGSVYWLNAGIDRGDILCQDWCWIPPKLYAKSPKEAAKELWQKELLPMGIRLMDKALTEVANGIYTKTPQRKDVDTFEPSTEVKDIYKPDLLMLPQKQ >NZ_CP022383|2610996:2647150|2619487_2619874_-|WP_095902452.1|DBSCAN-SWA MRKHIIKLFALSYIVPFAGKKRSFTRSANIILPLILIGGLIVCAELYSWLYILLPLLAVACFFGFGYFHFCPLTDKDFPLLDDIQRWQYEAFQRRVTPEPKSYNAQWVLWVNPLAIAITLTILFTLIL >NZ_CP022383|2610996:2647150|2627686_2628916_-|WP_095902099.1|DBSCAN-SWA MSNLKGVVISKGALGANTISTGDNISGLIISAPKPTGLEWDTPTTLYNVKDATKLGITEDNKQVNVLRHITEFYRMAGEGTPLHLMLVAQNSKMPEVCETKAKKLLVYAKGEIRQLAIAINSDSDEQYTMLNGLPQEVYNAIAKAQGLAEWAYNNFMPCQVLLEGYGYGGTASSTANLRELPNLNATKVSVVIGQDYSYAKSKEGKAQKYADVGTVLGVCSKALVQQNIGNNELFNLTDATQGVWIEPALSSYTTIVDAFDDLQTLEDKGYIFGITYAGIAGVRINNDHTCTPVVVDSHHNMNEHSITYGRIMDKASRGLRTAYLPKIKTDWELDEKGKMRPATIVALEDIGDSVLERMFANGEISYGKTTIDKDSDLVVEKVLKISFVVVPKGSIGEIKGTINLKTQA >NZ_CP022383|2610996:2647150|2642768_2643227_-|WP_095902116.1|DBSCAN-SWA MQTDDMIITLRQKLSPKALEQAVWIETHRTCQLLEATDEELNALYQRFCFTPNYQTIATDLLNETEVKRLRSIILADAQAMGILKQNNWALFNKFMKERSPLKKFLRDYTLDELPELVRQFKSMRTKFEKAALKVGSKQWHFFFGIPEPSMN >NZ_CP022383|2610996:2647150|2641377_2641677_-|WP_157909521.1|DBSCAN-SWA MKETPTHYYCYFSDGIQTRNELQEQFYCFLRGMSGELYRADELTKIKQYIIDKAKELNEEFPRYKPLNISFAIFVDGKIHLFGFKFSNFILMPAYLIKI >NZ_CP022383|2610996:2647150|2622424_2623267_-|WP_095902089.1|DBSCAN-SWA MARSIQEIQNLILQAKAQEPALESLNSTSKVAIWRLWVYIIAVAIWGLEKLFDQHRSDIDKRLAELKPHTARWYRSKALAFQYGFDLLPDSDKFNNQGYTEEQIEASKIVKYSAVIESKNEGRLIVKIAGEQGDTLQPITDAQKQSFEAYLQEIKDAGVRLSVVNYQPDILHLQMKIVYDPLVLDSNGQSIIHATKPVEEAIKSYLKRLPFNGELVLAHLIDALQQAEGVKIPHLVLAQSKNITSGGDYGAFETIEISKIPTAGYFTIDNFNDITYISNV >NZ_CP022383|2610996:2647150|2629153_2630074_-|WP_095902101.1|DBSCAN-SWA MPALEDGLWLQQYVEPQLLEDFRNYNDAFVSVLQRPNPSAIDKDGIKFNKLIGNVEFVVNATVDFTPKKTEGKKTFVAWDALDTTPTEYTDEELRAMAFDKESAIRKEHSNMFRIGVRDYAIHKLAPKKHVDGAMPVLRTTGEVVNGRKRLTYNDLSEFLFKHITALNLNNKAAYYLVLSNEHKADLIHDRANTNHYRDLEIDRNTGELKRFFELQIFENTTTPLYGQNGELKSMGAKKVVGDQSSSIFFYAPNTVYHIEGVNVLTKPMRQDTRSKRPTAEVRLHTWGLCDKRQEYGFGALVSANE >NZ_CP022383|2610996:2647150|2628908_2629133_-|WP_095902100.1|DBSCAN-SWA MANKSQLETAKQIFEAEPQLQRLYLNPKGEFFTKIDYAQNSVEDTKKIETLTRKGVLKEETKENVEPLNTEGDE >NZ_CP022383|2610996:2647150|2636552_2638499_+|WP_095902106.1|tail|DBSCAN-SWA MATTSKLTLLIDLSQRLFNNGLTQMSNRFRQHVQQMRNSYRDFTNQIPMLGNLMDTLSNKWVLLGASIVAVGTGLVRATSLANDWHKQMAEINVTAELSKEELGKLSNKLLDIGTKNVAPLEEVPKAFSRIISAGLDVNQSMQALEPTLQAAKAGFTDIETVASAGIATMMSSGEDINKVYDVLFATVKEGNAEFKDIANYMPKLTPLAKGLGYQLSETAGAFASLTTKLSAEQSTTALQGIIRSLSEERIALGQMDKSGNWKSGFKALGIDIHDTTGKIKPLVEIIGMLNDKMAGLSDKQRMEQFGKLGLDQMSTMGFQTLMQDMEGLQKATEATANSQGTLGKAYTDSLTPLEQWGIAQNQLKGTMIKIGEAILPMLSKAIEYITPLFEWIYKNVDWLIPVFGTFAGVLGVVTVATWAWNVALAANPIGLLIAGIAALIALVVVAIKKFDQWGAGMLALLGPIGWLINGIKTIYDHWQSIKKAFTDGGILEGLKRIGLVLLDTILKPIQQLLELLSNIPGLESLAGKGADYIKELRESMNTVTDGEKQKEEEPEKETKPENPFSFTNTAGATAGATPTTMNANTQLGSQVSKVTGDATQTKNITITFEALSKGDIKVSNAEGLTWQQVEERFTDMLLRVVRNAELS >NZ_CP022383|2610996:2647150|2633371_2633794_+|WP_095902104.1|DBSCAN-SWA MFLTTEDYTALIRNEIKDILLENYSEAKLRVAQQMAIDQVKNYLSGRYDVAEIFSKEGTERNAHIVMLTLDCTLYHLYTSTVPKRMPEIRSVRYQDAIDWLKAVGSGEISANLPLIKSQDGQQLLGIKIQSKYAPSSNKW >NZ_CP022383|2610996:2647150|2627276_2627687_-|WP_095902098.1|DBSCAN-SWA MADINRNGKAYDSADVRVQINGIPINVKSISYGNEQEHQLNHTLGAEPTSWSMGKITPSASMTVPMHEIAPLERVSGGLLKIKPFTITVEFVNEFNEIVVDKIVAKFKNEGREVTGDMGLEKQYDLFALSVKLRVA >NZ_CP022383|2610996:2647150|2618518_2619049_-|WP_095902085.1|DBSCAN-SWA MRKLTLLLLAFLALVGCRTRKVTTTEQKQIQKEHFIHYKDSSQLFAYEGRKTDLSHQSDQSFELELESLTDSVGKPRELIYTRIRDGDNEVIRVLNGKVKLRVTSTHSKSLQQADSTLYNNSYTRTKAEVQKHAYTQVKQVSKQLKSSPVRHTLWLLLLAVLVFILWKYKPFRWKI >NZ_CP022383|2610996:2647150|2615570_2616368_-|WP_095902083.1|DBSCAN-SWA MKPVSTTWQRTPISYYGGKQTMLPHILPLIPEHTVYTEAFFGGGAVFWAKPKVKTEIINDFNANVYTFYKVLQTRFDELKTLIERSIISREAYKAALVIYHAPFAFTEVQQAWAFWYATNCGYSNQVGNCRITTNSKNVSALNNKITNFTDTYSARLRGVQIDNNDATEVLTHHDTPDTFHYIDPPYVGAKQGHYGGYEQEHFNELLATLSTLKGKFLLSSYHNNELTHYTQTHGWYQKEVSMHLGSSNSTGKKRIEILTANYPI >NZ_CP022383|2610996:2647150|2645017_2647150_-|WP_157909523.1|integrase,transposase|DBSCAN-SWA MNFKQGDILIRKTEAGEQTLWLSERLVIEVCEIGESYFRKVGRVNYKKTIRPCDLAKAKEFMPDSGKSWRWAKTQGQFYYCLSNIPNRAPQNYRARFGDAQTLLTQYEQAMAERKETNLETAFKHYLNKAYPQYLEYYTKVDVTRRVALAKACAVLDFCLDNLDTYGGTENALYKDLSPILSKMELQYIPHNYQRLKEKVEILRTTDHCIVDLIDLPRTGNKNAEQYTDQEVFSWVMQLRAMGLNNSNEFIIRYVWELCERFGKEKPSRRWFGQNIFELPKTKYLTAEKRFGYGSRKAQMYSGYIPFKNAVCAGDCWEIDASRVNLIAHQTVDEKGNKAERFLFAIVVRDVHSGDILGYDIAYAENKQAYMRALRMAVEYTGYLPYSLTADRFPGHNTEEIKELFARLEALGVQLNITHEPNGKAKVERWFGTFQSVTLMGNKYYYGEGVQSRRLSAHRSAEFLAEVKKEAKTEGFDYLKAYNEIENLIEGWRNTPYCTYSRKYANITETPKELHEKAMKNNVIDVNPARISMLFHRKKEITLKNNGLIRTEIDKVEFYYQLSVDDFDIIANYTGKKVVMTFDVLSSNTVYLWEAHGNLLVPLCEAQLFEQIQRHGPTAELGRLSEARARENELQRRKEAELQRLTALADENLLMGAYSEKQAYNATEEAFLKSQESNHIARGAEPLKKAVGSEYFTEELDASNLTRLQF >NZ_CP022383|2610996:2647150|2639929_2640301_-|WP_095902110.1|DBSCAN-SWA MKNIALIFIIVLSACTNTAKISTKDIVQEFSKSSKEELIAKYGEPKNYKWKINGINISKTEFENISNTSFEPIYLTPNKFNEEIGFEGTVQRGSSKVYSIKLEGKSEAVYNSETNTLRVSLEK >NZ_CP022383|2610996:2647150|2619045_2619480_-|WP_095902086.1|DBSCAN-SWA MKKSTRNIRYLVVHCSATPEGRDHTVKDIDLWHKQRGFNEIGYNYIVRLNGTVEEGRDVNKVPAHVEGHNKDSIGICYIGGIDKNTLQPKDTRTVAQKEALKKLLTELKVLYPQAEILGHRDFPGVAKACPCFNAKDEYKNISK >NZ_CP022383|2610996:2647150|2612576_2613404_+|WP_095902082.1|DBSCAN-SWA MTKQEKLIGIHPDNTDPFGKWLKVNDLPDSGTKCHRELTESTAIDNDLIEWMARKIINHHYTQSRIDKLKQKYKSLGYAKYAAQHRKLPIEDKVKKGNATEILLTDYIQTARGKEFIKFFKLRYNPNVDQAIKGDDVLMVDLFEENGNEKIKIYLGESKFRKAPSKDVVEDIANSLSKDTLPLSYTFLVEEIAKTDEILAEKLDDYIVQDVKDRGDLIYAGLLLSNTDTSRTVERHLNSDNSNLVFISVGIDNPERFIESVFERAEELIANPDLL >NZ_CP022383|2610996:2647150|2623269_2623575_-|WP_095902090.1|DBSCAN-SWA MTITALHNQSLLDLALQHTGTIESVFEFAEANSINITDDVQAGKTLVLPTEVFSNKDILNYYIAKNLQPATAFSKEDEKVAKRLEGISIWAINLDFVVTQQ >NZ_CP022383|2610996:2647150|2624107_2624341_-|WP_095902092.1|DBSCAN-SWA MALNKQALQQGIIRLQQDMQRKTDASMEEYAERLASLIDDFVKSGEVTVAAGISVSTAGTATAQTGATNSTGTGTIS >NZ_CP022383|2610996:2647150|2644014_2644950_-|WP_095902118.1|DBSCAN-SWA MTDLQKEQILQAIKDEVSRLGSQNKVATKCEVSSATISQMLNHNWELIKAELWQKVAQALDINTTEQWQITETTNYRMVFSVLSDAKNASLFIPISHKAGSGKTTALTTFANLYAGSNVFYIQAREWARREFLVELCKVLGIKQDSGYTTVDVLGQKVIQFFAQRTGKHPLLIVDEADKLKPSALRWFITLYNEMEDKMGVVISGTDNLEKTIKAGVKYNKLGFDELDDRFGRKFIHLIGATFKDFKSICESNGLKDRNLPPTSTDKSKTLFERLFKECEPTVATIGGDSIKVVESFRRIKRVIKRELLAS >NZ_CP022383|2610996:2647150|2631330_2631777_+|WP_095902103.1|DBSCAN-SWA MAQKRTNNSTLMELARRMFVEEGMTAKAIAGTIEVTEQTIGKWRKGVGTNAISWDEQRSQYLSAPHNIKKNLAKELTRLVEGGEATLDMGAINSAIKAIQSMTDETSVETVYSVFKEFDSWMSEQDPEMAVAFLEWHKLYLLHKAQNQ >NZ_CP022383|2610996:2647150|2624640_2625129_-|WP_095902094.1|DBSCAN-SWA MEQALTTAITTLNHRKKQVTSVGVVSRIEGNTCEVEREDLPLLLDVRLNAVQGVFENCLNIVPKIGSQVLCLEVEGEPSETCIVGYTEIDSIEVKIDGAVVKIAKGKIQIKNNFANLKQLLSEWLTELKTVVIQTPAGVGNFSPNNVAKFSELESKINQLLE >NZ_CP022383|2610996:2647150|2621973_2622432_-|WP_095902088.1|DBSCAN-SWA MYNLNIDKLLVLLTPTFLRKPKLVAWLRMLATPFHKLLYDFQRAREADLYNLAHNSQVCYLRKALNDEFDDEQRRIRIEDGRQKQRLYIYPRSANKPLYLGKVFLYQRGDYIDGGVDFIVVLPQGLEYDKYKLEALVNFYKLAGKRWEINHN >NZ_CP022383|2610996:2647150|2631807_2633370_+|WP_095902453.1|terminase|DBSCAN-SWA MEKLLQDYDQHCRGVEQKTTSGLDFYEAPSERRKKRLALEKDYTTWFEYMFPQYAEVPCAWFHKKIAKLLIENDVISLLAEIYRSGAKSVHLDLGIPMFLYVTGKLKFMLLVGQTEDKAKKLISDIQSQLTHNQRFIHYYGKKFKFGDWADGDFTTTDGAKFMAMGAGQSPRGLREGNQRPDYIVIDDVDTAQRCKNDELSKKLFDWAWEDLKGTFNEGGKYRRFVVANNNFHKNTLINQLKEEFAIINKKAKEYGFAQTHHIVSVPAVKSLETFEPNWGEKTSAEYWREKYHSTPYRSFMREYMHVHIVEGSIFKNEQIQYKERLRYSQYDALCFYGDLSYKDAGDFKAMLLVGKVGREYHVLLAYVRQTSRNNVARWLYETALHENLLKYNIAYYIEGLFAQDEFISDFDEVGDTYGFYIPVQADKDSKGNKFDRIESMAGYFERGNIFFNKVLQNSPDFVELINQILAFQKGSGAHDDAPDALQSAIAKLNALAILNATPAKTISRKEILKDKQNRY >NZ_CP022383|2610996:2647150|2633885_2636408_+|WP_095902105.1|DBSCAN-SWA MKLLGYQFSLTKTPKNKTNEHSVRGNARTNPDVIQFVQSFKDASRKDIAKWRSALSMALHPETPKNTALYDLIDDLLTDGHLQSQIQMRKMSTLNTDFYLINRKTGEIDEEATFVFQQQWFYEFLSIALDSILFGATLVEFSSFEGEKIRFNTLSRRHVIPVLGRILPDVTKEDYINYRDEYYAPWLLQIGKSDDLGLINNIVPNLIWKRNVAQSWAEFCEKFGMPLITATSNSTNSDVVDKVNQMLLDLGEAGVATFPQGTSINFQEANRTDAYNVYMQFMQANTNEISKQLVGSTMLSDQGTNRSQTEVHERSLDFKIAQADKRFIQFVVNDQLIPLLRLQGYKLSDEVFFEFKTAEQEINLSEMWNITNGLIANGYQVETEWISKTFNIPIESEGKPQPLNEITASLRGADEGERYPFSCTCGQHTASLGKTIRTVLAKLTDKLISKVYHKKDTLPEYAQMVVAEGVALSEALRDNFPTISPYTGPDQLCLQLMEYNLFEFAAGKTESRLASMKKLLVDENNQIRSFSDFKELCQKEVEKFNKKWLEAEYNLSIAVGQNSAQYLRFMAEKDTVTSFVKYQTAGDDKVREAHKVLNGKIFNLSDKEAMDLYPPNGYGCRCEMVQVLGDQKGKVTKGREAKIMLEGTDNKYKGSQFEINRGDLKQVFTKQQFYSDTKGLPKKLNEMTFDKYGLPSWEAFKQHLKPLKLDSTITEKNLHELFKPFEKNTYMGFEDYLGRKLTLQKSVFDTHTQGKYLNENELRHQLFPFVKEILKNPDEVWHFDYKGDAKKFQPRYIKFYQDRVLIVDCDLNTEQQSLTINTWYSMKAPEKFIRKGLKIK >NZ_CP022383|2610996:2647150|2638500_2639049_+|WP_095902107.1|DBSCAN-SWA MDLQVNTDLFNRLQRLTQRTFLQRMVNEAGVIAVNFSKDRFRLKNWIDKTAEKWQARKRPNRGSLLLRTGRLKRSIRKIASGDYYVVVGTDVPYAQLHNEGGTVNKVAQVKAHTRKVVIRQRSVNRRGNATTRVIGSKIVNVRAHNRKMNLTMPKRQFLGESELLMRRIEMHINRELNKELQ >NZ_CP022383|2610996:2647150|2640811_2641372_-|WP_095902112.1|DBSCAN-SWA MTYTVTIHRTHTLLKLTYKKGELCKIEIKRGGLNTQQYQQLGAILPPQEEDIQRYQEQWNGSVSYREDVPDPVSLYGNFLDEWFAFYKRLYGFPPKFTGADGKALKQIISYLQQVSANDIEALSTWQYLLGNWQKMDEFHQRNTDLKYINSQLNKILQNAKRGNSSAKQTYSTDFKRKILEGIFTQ >NZ_CP022383|2610996:2647150|2626146_2626743_-|WP_095902096.1|DBSCAN-SWA MENGQSIVLDLASRYGRALGIVLSSEGMSQVVITKEDNKYQVETFGEATNFEEITMEYENTRLVFNSFIGGEQSTVFAPPPILSFSRSKKLIETETNGSTIVERWNTNEWEITIQGILVDIENHNYPDSQIQQIVTLFEHNDIIKVVGAQFYDKGIDSIYIDSITINPKEGYSDTVAYTLSAKSVKEVTFNLLEGDGK >NZ_CP022383|2610996:2647150|2626934_2627255_-|WP_095902097.1|DBSCAN-SWA MIKKVSEEVKTSLKKEYGDKLKSLILPIDDNGTEELEVLAVVPSRNVVGQYLKYLNQDPKKAQEILVKACLVTNKEEVLADDGLFYASASLIGELIPIRQGKFGTV >NZ_CP022383|2610996:2647150|2610996_2612232_+|WP_095902081.1|integrase|DBSCAN-SWA MKKVKRSTFKVLFYLKKNAPKKNGKVAIMGRITIDNQVAQFSTKLEILPQKWDLKYGRVTGKTEEATQLNRNLEEIRSRIITHYEELMKYEGVVTAQKLKATFLGIGVMEDSLLKVYEKFKEDFALMVEKGVRSYSTLNKYENVYTHLSEFIQYKYRRSDFSFKELTEDFINDFDFYLRVNKSLTHNTIWVYMMPLCKMVEIAIDKGIIYRNPFKNYISSMEEKDRGYLLREEVETLLQYHPKSASIELVRDLFVFSCFTGFSYIDIKQLKKSHLQSFFDGNKWLIKRRQKSDVPCNVRLLDIAEKIIEKYDGTTRTDTLFPVPSNANCNLLIKKMMKDCNIIREKPISFHWARHTFGTLFLTEGVPLESVSKMMGHKNIKTTQIYAKITNEKISKDMEIAAERLKNLKIG >NZ_CP022383|2610996:2647150|2621054_2621915_-|WP_095902087.1|DBSCAN-SWA MNTINTEHNAGYPFDVAFLAFMQNSYRLFNSLGSMAGNKAIISGCEEIGNTITPGTVFINGELFPFEGGAKGDTIIIKEETNEVTFEDGFLRPLENIRTAAFGRSTPEKTYNWEDFQRVTNLQKLGKNKAENKALKELKDEVEKLKKQKQAVPIGLIALWGKPASEIPAGWREYVNLRGRMPIGLDPDYVKKPEDVQDYGLNQILKQGGERSHKLTIEEMPAHNHQQGSESLYNRYGGGGLLGGRNWNSGTYDAYYNQNTSSVGGDQPHNNMPPYRVVQFIEYVGF >NZ_CP022383|2610996:2647150|2623552_2624092_-|WP_095902091.1|holin|DBSCAN-SWA MIKLNYILQGFGFRDSNEFLRSSFGHTFSMLFIKMDVILSLLFATVHFLFGFNHLFLTAYVVLLIFEWITGVQASRKRGEKHESRKFGRMLLKIATYLVPIYILHTFSANVEFPSLGGFEFDPFHWLYWVVLIAIIWQLVVSLLENLDCLGFRFAKVLLKIINKKFYKTFELDDNNSPT >NZ_CP022383|2610996:2647150|2630073_2631183_-|WP_095902102.1|protease|DBSCAN-SWA MIIRTQKNTLTAYGTIWEGDGRYFLEEFARLERDYSEITIHLHTPGGSVFDGNLIYNALNKSASSIHIVIDGIAASMGAIIILSAKKVSIVENGYIMLHAPASYSNGDADSFEKQAKLLRSIEKNFVEKLSARTGKSAKEVEKWLVGDNWFDAKEAKRLGFVTDVIPAQTATLLPIEDVNAMREQDVYNMYAGLFATLKTVNILDKNMKSVLIQSLVQALSLSGITEESSETAVIQAIQERITNEKEAREKAEKALNTFKQAQIITVVEGAVKSGKITEAQKAVYEKIAETSGVEALITVLENTAVAGGKQATQAPNISSLLQSGGSNTGARASWDFDQWQKEDPKGLEKLSVDQPERFKELFNAKYKK >NZ_CP022383|2610996:2647150|2626746_2626896_-|WP_157909520.1|DBSCAN-SWA MYFKVDALISYYLHIPFPEDLDDETWAMKWAQIQWLAEQGILGVKKQDL >NZ_CP022383|2610996:2647150|2639564_2639933_-|WP_095902109.1|DBSCAN-SWA MKKLFYIALLTLPLVGSAQSYNEPFRKRGEVFKVYAGSENKKEDDSGLIVVTRETNRFVYEDDAIPDAYKKMTEKFMRSILTDDKYKKSFVKYRLNVRRDYWGNIYIENQKVGNIKDVPTSK >NZ_CP022383|2610996:2647150|2641697_2642294_-|WP_095902114.1|DBSCAN-SWA MNDKVKEKIAKVYELVKRGVAGEQQSAEKLLRKLLKKYNITEDELNNLDEKEYYFKYASELDKWLFVQLIDYFFKDKKYQAYRITGSNVKEIAIKMPYLDWVMLDSAYGYFKPHLNQQWRKHGLPVVNLCRTTKTKNKRREQMQEAFFKLYVVRSGIYHPEQKTSRPLTEEEITRYSILYGVEGGKYTQQVTTGLYLE >NZ_CP022383|2610996:2647150|2642480_2642693_-|WP_095902115.1|DBSCAN-SWA MAYNKINLLTKIIEIQQLTLHLYHKVGLTYKEIFWQHIHPKYHICYRTFHTYLGTPAKRELKQLQSNEKN >NZ_CP022383|2610996:2647150|2625092_2626157_-|WP_095902095.1|DBSCAN-SWA MGSSYLNINIRITVAGKIQFSAVKQIEIANSIELLTTTAKVELPREFKNTRKDGQSFSIERKNLLELIKVGDSIHIEAGYNGDYFTEFEGYITQIGADIPLLLTCEDEMYQLKNKPLINKTYASVSLKQLLKDIAPDYETEVLDMQLGKLMIERSSPYKVLEELKKQYGVHCSFREKKLIAGLKIDFKSKVIHHFIFDKNFRQSKDLKYKTKNERKVLLKAESSQKGTSKKVSYQYGEEGGGERTLHAPTNLTLEELKAFTEKTYNSSVFDGYEGTLEGFGYPRTQVGDTIALTDPNYPDKHRDGLYLLESVTTLLNAQDGFKRKSKLSMKLSNTNSTDTTELWNKPLQPQLLP >NZ_CP022383|2610996:2647150|2642326_2642494_-|WP_157909522.1|DBSCAN-SWA MKRIRKLTNWLISKLKSCPHKHKELRTLALYCTVEVTALFCKDCGKQLTKEQWEA |
43 | unidentified_phage(25.0%) | transposase,holin,integrase,protease,tRNA,tail,terminase | attL 2639404:2639419|attR 2648877:2648892 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|