Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP031655 | Escherichia coli strain UK_Dog_Liverpool plasmid pCARB35_02, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP031654 | Escherichia coli strain UK_Dog_Liverpool plasmid pCARB35_01, complete sequence | 0 crisprs | NA | 0 | 0 | 1 | 0 |
CP031657 | Escherichia coli strain UK_Dog_Liverpool plasmid pCARB35_04, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP031656 | Escherichia coli strain UK_Dog_Liverpool plasmid pCARB35_03, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP031653 | Escherichia coli strain UK_Dog_Liverpool chromosome, complete genome | 10 crisprs | RT,csa3,PD-DExK,cas5,cas6e,cas1,cas2,cas3,DEDDh,c2c9_V-U4,DinG | 0 | 15 | 9 | 0 |
CP031658 | Escherichia coli strain UK_Dog_Liverpool plasmid pCARB35_05, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
0 : 89325
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP031654|0:89325|DBSCAN-SWA
Protein sequences of DBSCAN-SWA_1 >CP031654|0:89325|49919_50360_+|AXP29294.1|DBSCAN-SWA MATLSDTIKPNKTYLEAVLRTALLGKTEDEYVDFFLSGLRGRLLKNPRLYRSYGPYWPEIKKLLLERGYGNFGRLVDRDVRKIYRYDRPALTLIAATLYSQERFDNGQIYSAWHLLPVPEEVDDQDYEFESYDLEVEALAQAGEKT >CP031654|0:89325|29661_30072_+|AXP29278.1|tail|DBSCAN-SWA MNTSYAVIENGMVVNVIVWDGEAEFTVPDNQQLINISDISEQPGIGWVYSDGGFTAPPTPERSHDELVADAEQKKKSLIDTAMASISLIQLKLQAGRNLTQAETTRLNAVLDYIDAVTATDTSTAPDVNWPAFPEA >CP031654|0:89325|86758_87043_-|AXP29340.1|DBSCAN-SWA MQMELISRKEFDSRVTSGELDNLQAIKVKEGFCLIGNQSGTNRVFMLRRTDLKPFVWKNEIGPSSYAQTRGCLNLAFFYKNELSVVDIQGLQHV >CP031654|0:89325|62295_62517_-|AXP29313.1|DBSCAN-SWA MQSINFRTARGNLSEVLNNVEAGEEVEITRRGREPAVIVSKATFEAYKKAALDAEFASLFDTLDSTNKELVNR >CP031654|0:89325|12900_13935_-|AXP29265.1|DBSCAN-SWA MKTPLVTRNEIVEAIALHTACMPTREIPGAIANYFMITRRFYTRTDKAVINRLLIAEIRDYLIEQGRLRYATVAAEMRKEAHRMTGNNLNVEKPAPVASATPAPALNVIPNTGDTIDSQTLLKMVNEARKLCSEKPVRNNDFIARVKDELEGETYEIFVGQKNGAEIDIITMTYKQALRVAARESKAVRRSLIDKLEELQQANSPTPSIPQTLPEALRLAAELAEQKMQLEQQLVAAAPKVDFADRVSVANGILIGNFAKVVGLKQNALFSWLRQNGILMAFGARKNVPRQQYINAGYFTVKEVVLDDENGYQIRLTPQLTGKGQQWLTRKLLDAGLLKPVAIG >CP031654|0:89325|57356_58841_-|AXP29307.1|terminase|DBSCAN-SWA MARSCVTDPRWRELVALYRYDWIAAADVLFGKTPTWQQDEIIESTQQDGSWTSVTSGHGTGKSDMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFPWLSKYFILTETSFFEVTGKGVWTILIKSCRPGNEEALAGEHADHLLYIIDEASGVSDKAFSVITGALTGKDNRILLLSQPTRPSGYFYDSHHRLAIRPGNPDGLFTAIILNSEESPLVDAKFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVKIAKGWGWVACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFAECNPERFPNITIAIDGDGLGKSTADLMYERYGITVQRIRWGKKMHSREDKSLYFDMRAFANIQAAEAVKSGRMRLDKGAATIEEASKIPVGINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLANYVPQDEVLSVEDEAQVDEALAWLNE >CP031654|0:89325|2631_3633_+|AXP29254.1|DBSCAN-SWA MSKKNRPTIGRTLNPSILSGFDSSSSSGDRVEQVFKLSTGRQATFIEEVIPPNQVESDTFVDQHNNGRDQASLTPKSLKSIRSTIKHQQFYPAIGVRRATGKIEILDGSRRRASAILENVGLRVLVTDQEINVQEAQNLAKDVQTALQHSIREIGLRLMRMKNDGMSQKDIAAKEGLSQAKVTRALQAASAPEELVALFPVQSELTFSDYKTLCAVGDEMGNKNLEFDQLIQNISPEINDILSIEEMAEDEVKNKILRLITKEASLLTDKGSKDKSVVTELWKFEDKDRFARKRVKGRAFSYEFNRLSKELQEELDRMIGHILRKSLDKKPKP >CP031654|0:89325|67494_68757_-|AXP29317.1|DBSCAN-SWA MSTSAQKQSIENVSIPDVLNAGIPAIIQNIRAAQRRVSCDDLTARFFDNAVQSAEMLHAQLIDVYNAEADSHNSLVDAAENMQLDLGLKGKEIEELQLEIEHLKRQQQDAIDDATHDANQRADNAERISIELETKLNEMTAMVELRNSQISTLKSQYKEIMKLDPFNLEKRYNKAKSERQELRKQVADLNQQLKKTIKDASEARVAFANKKAEVTALVNENAKFATLKKEMYGITERRFPASKLHPTLGQISFFPRLLAYGISSPKEFNNERPYIVSKLDFAYQFCCDMGYAIDIRINEWLMPNFQPLAIFREFQPEGWVEFFHELICKEMESRRPELVRRVEWAQEVMLADAELPFEPEFIDDLATKGLHTLFDVVTRRHEQLVVELGLEETAARRLLDVCYARSDAWEKENGGTIYVR >CP031654|0:89325|7224_8040_+|AXP29257.1|DBSCAN-SWA MSKNFFQSGAFLGNGLSRFALNSDPVQLMESARASAEPPTDPVINNNPEPAAQTNDNVPSAPAPEQILEGKDGKEWTVEQAHQMILEAANRSAMQNALSDAADAVFSWADSGDLTFDSLDGFVQAIAGISDDDDSEVTEEQDDAYNEAWANVADFLAACGVDDDLIEALADDEDDDAAADVGASIAGLDSDDRDELEAAFVVAGTSDEMLTEAFKKVVRNGEIKLIRKRLRKKRLTAAQKSALKKARRKAQTGAAKLARKKSMKLRRKRLG >CP031654|0:89325|31595_32198_+|AXP29282.1|DBSCAN-SWA MTKNKYATVDFDQVNEKGLKSLIAAINKTGVTVIEVDSSNRATTKDGVKVKTAKLVLNDGQILAIQVNDTGDISSVKLNGKAIPNAQSPDIKTLGTVMGQAARKNSAKFQKSLIAKAKRVANPVDKKPAVKSNFQRLQEAKQRNAQVVAAYKSAQNSVSFNQQQITDLRAKLDKETGRLNNEKARNGELKRRLKQLKAGN >CP031654|0:89325|24961_29662_+|AXP29277.1|DBSCAN-SWA MNDVTVVTSVTYPSPESLALVADVQYHEPYLSAALNRKFRGIVDPGFYAGFFPKPGGGMNLLITSVDGDKTAGAASVNIGEFYQVTIQQRKDISLALSAGKKYAIVLKGRYLLGEDSYQVNTASHIHAAEFVARTYTDSYQLGDGELLVCTVNIPAGVSAITKEMIDVSDRIDLAIGIEISDSVTSTRSDVAASSLAVKKAYDLAKSKYTAQDASTTQKGLVQLSSATNSDSETMAATPKAVKSVKELADTKAPIESPSLTGTPTAPTAAQGTNSTQIANTAFVKAAITALINGAPGTLDTLKEIAAAINNDPNFSTTINNALALKAPLASPALTGIPTAPTAAQGTNNTQIATTAYVRAAISALVGSSPEALDTLNELAAALGNDPNFATTMTNALAGKQPLDATLTALAGLATGANKLPYFTGTDTVSQTDLTSVGRDILAKTSTLAVIQYLGLREIGTSGEKIPLLSTANTWSSQQTFKGKTAFSAAATFSAGIAGAIEPEKIGDQTVDLNNLTISSDVGAIKYYYCPTFGGGANITNKPDGVNGNFLLRVESTRKVSASDYANMQTLISNDTKRIYVRFVVNGSWAAWSQVVVSGWGQDVSVKSLSAVALSGSLTGNASTATKLQTARTIGGVSFDGSANIDLPGVNKAGNQSTTGNAATATKLQTARTINGVKFDGSANISIPTITSRGRVTALTDTTQGAATGLQMYEAYNNSYPTAYGNVLHMKGASAAGEGELLIGWSGTSGAHAPVFIRSRRDHTDAAWSAWAQVYTSRDSIPGVNATGNQNTTGNAATATKLQTARTIGGVSFDGTANINLPGVNVAGNQNTSGNAATATKLQTARTINGVSFDGSKNIELTPRSIGTINSITMSFSGGAGWFKLATVTMPQASSVVYISLIGSSGYNVNSPMQAGISELVLRAGNGNPKGLTGALWRRTSVGFTNFAWVNTSGDTYDVYVEIGNFATGVNIQWDYTSNASVTIHTSPSYTANKPTGLTDGTVYVIYSSHIKPTATDVGALPITGGNLNGGLTATGEIISKSANGLRIAYGNYGFFIRNDGSNTYFMLTNSGNSLGTYNNLRPLIINNANGTVTIGNGLNVTGGINGSLNGNAATATKLQTARTIGGVSFDGSANIDLPGVNKAGNQSTTGNAATATKLQTARTIGGVSFDGSVNIDLPGVNKTGNQSTTGNAATATKLLTARTINGVSFDGSANISLSPANIGCPASPTGWLKTGNNGESITTAQLVTLLQNNGAFNTKAWFARCAWSYATSASIPDSETGCGIIPLAGAVIEVFSNNTDNYTIRITTATTTSVSGALTNAEFIYVFNVSGSTSYSPGWRRAYNTKNKPTTTDLGLSDESGYVGRLISTRVFTSSGTYIPTPGTKRLRVTITGGGGGGGGCKATSNNETFFGAGGGAGGTIISIMTPTQNSYPVTIGAGGAGGVSATNGTRGGNSVFASLIAPGGAGGGKVGVTNTNGGNGGVPSTGDIRITGGDGGDGQSGNISVSGEGGTSHWGGGGRAGAGGGVIGKAYGSGGGGAYDAGYSGTSMTGGKGASGICIIEEFA >CP031654|0:89325|39471_40029_+|AXP29289.1|DBSCAN-SWA MKGKTAAGGGAICAIAVMITIVMGNGNVRTNQAGLELIGNAEGCRRDPYMCPAGVWTDGIGNTHGVTPGVRKTDQQIAADWEKNILIAERCINQHFRGKDMPDNAFSAMTSAAFNMGCNSLRTYYSKARGMRVETSIHKWAQKGEWVNMCNHLPDFVNSNGVPLRGLKIRREKERQLCLTGLVNE >CP031654|0:89325|38814_39303_-|AXP29288.1|DBSCAN-SWA MAQRGVNKVILIGTLGQDPEIRYIPNGGAVGRLSIATNESWRDKQTGQQKEQTEWHRVVLFGKLAEIASEYLRKGSQVYIEGKLKTRKWTDDAGVERYTTEIIVSQGGTMQMIGARRDDSQSSNGWGQSNQPQNHQQYSGGGKPQSNANNEPPMDFDDDIPF >CP031654|0:89325|40320_41340_-|AXP29291.1|head|DBSCAN-SWA MTDVLKTVTDRFCLYSNARKGRQNGRQYVLSAVKAMLESKETQEGLRLGELFGYYGHGRRQLTGKLEVPETSVIMVEGRPVVIDNVPACRTVAISVDDNGIVTHTQEILNTEPGKIVAAMIESRAGGWSWATGGRESGKIAVTTSFHGVDYVTTPNYISLDHPASAGMFESADSKSLLAESLAAHGYSDESVQAVISHYGKMAELEMMVEATERTAELETALLESQGRHLEAMAKIADAEARIALLEETAGIRDDVLAAMQDELDNLPIFVSAAQKDAFRLKEPGDAKIVATLFESLIKVGARNLPVTKKIKEVPQASNVQAPRETSIISFNNSINPFK >CP031654|0:89325|74361_75090_+|AXP29328.1|tail|DBSCAN-SWA MAGFFDDMFEDTEPSQQVTGDNLPDTESDPDIPGEGSELIEEEDIDAEIETDGVNVGNIVDPVEDNHLPNLDHGLLSDSGVRHRYQGHAVFNNLVRMDWLKAIKLDPDSFDAVLYRAIPYRNKNAPETAPEIIEPNQRIYDYQDPELITALDCPDEMDAFYALYDGSDNTGISDSALILRLAAVNVPVGSMLEWLEQLSDGTTIRRFWYIHKIFNYGTARVGSLFYCVPSRAFEGNFIGDSE >CP031654|0:89325|56098_56194_+|AXP29304.1|DBSCAN-SWA MGETVGQSFRLCVVNTVRHNYGFHSTNHRRR >CP031654|0:89325|60660_61704_-|AXP29310.1|DBSCAN-SWA MKAVITPFVQKELGLATFKVDQEVRKLVEAGRKFIMEPVPRELIEHMEDGLVVTEQTMATNEALQPFFNSDELFRRIGGIDALVAWLRRKEGQCQAADRSWCDNHIVHAERDNSAVLLCWHHDNHYRMRGFNELKETLHNNRVNWILDVARQEMGLSNSHDLSIQELCWWAFMRNMMHLMPEEVCRISINKMKATPQDSGPLKEADIRPYDDRATAYVQMMEERAAPMRAKVCPVDVDSDPGMAHFKIPKLQSLKLPEYMDFVASRPCCGCGAAGAGAHITPYIVRHSRLCAHDIYAIPLCQSCQRDIERDRDNWEKTHGRLAMHQRLFFDYALGVGVITSHSSSVR >CP031654|0:89325|10188_10893_-|AXP29262.1|DBSCAN-SWA MLNRRTFNVFCDESCHLLNDHNKVMVLGALWCPGSITKKIARDIKELKLKHNLKPDFEIKWTKVSASKVEFYLDVVDYFFSNPALRFRGVVVPDKEQLDHARFHQDHNTFYYKMFFYVLKNIIESNNTYNIYLDIKDTLGIEKIEKLRGVLHNDRYDYNHESINRIQHIRSHEVQQLQLTDLFIGALGYVHRGMNSNAGKIQVINRIKSHTNRELLKSTLPTESKFNIFVWEAR >CP031654|0:89325|61731_61911_-|AXP29311.1|DBSCAN-SWA MARKYNKLSREALKMLLDGVSRRKVKQYLVGKQIGARTAIAVLCRQEMVVLKQRMPGSR >CP031654|0:89325|50663_51173_-|AXP29296.1|DBSCAN-SWA MNIYIACALTHVPREIFHEYSNWIHSLAKGLSQNNNVKYALINSDPELSKRPESNKSRLCYIWDRDMVEKSDVIIAECSFPSTGLGIELQIAEQKNIPVIICYKDYGINKTKTIEYVNPDETTHNLQVGEGFISHMVLGLPNILDVILCKDIDNTCRKLKILLDMINHN >CP031654|0:89325|70430_71057_-|AXP29321.1|DBSCAN-SWA MFPKNKFRGSDRVIIVGSGPSAANFVAPRGVPIIAVNGAIDWLNRASYFFTLDPSPDNMRRVGRGRRRRGVCYCMALPDVKEREVRDGILCFRRVAERGTEPKNTNSPEWWAWRWSAHFGLCEDENEIASGNSAYGALNLAFHIGFKHVALVGVDATQELRVHSGGTPKNLSHLPLLFQSAREQIDVVSCGKMGGIPQMTLKEWLKNT >CP031654|0:89325|58840_60034_-|AXP29308.1|terminase|DBSCAN-SWA MTWDDHKKNFARLARDGGYTIAQYAAEFNLNPNTARRYLRAFKEDTRTTDSRKPNKPVRKPLKSMIIDHSNDQHAGDHISAEIAEKQRVNAVVSAAVENAKRQNKRINDRSDDHDVITRAHRTLRDRLERDTLDDDGERFEFEAGDYLIDNVEARKAARAMLRRSGADVLETTLLEKSLSHLLMLENARDTCIRLVQEMRDQQKDDDEGTPPEYRIASMLNSCSAQISSLINTIYSIRNNYRKESREAEKHALSMGQAGIVKLAYERKRENNWSVLEAAEFIEAHGGKVPPLMLEQIKADLRAPKTNTDDEENQTASGAPSLEDLDKIARERAASRRADAALWIEHRREEIADIVDTGGYGDVDAEGISNEAWLEQDLDEDEEEDEEVTRKLYGDDD >CP031654|0:89325|87504_88293_+|AXP29342.1|DBSCAN-SWA MLEKDYQLSAYKKLAAAGGMKTPGAITSARNSANTAKLLAEELTGLILDTIVYPDTITSYVSTIRTTTTGLTNIGELTTKHADLLAGYADLSMLLQLDIGWDVYCRANEREVSELPISIAIGDVNITKSLEDAVNALNTSSLVAAMGEINQTLNTGSGSSSGSGSGGGTATPPPALTEEQIESLKVATEQFGVVFNQTTAPTTALLQQYERANESANVAITAYNHAIGTALAEASANKVSTASAVAALVPDSVLDELNKAAQ >CP031654|0:89325|14772_15285_+|AXP29267.1|DBSCAN-SWA MSKKYTLCALVVSAILLSGCQSSGADYAADVYDTAQLNSKQETKTVNIISVLPAKVKVDNKANKEAAQTFGAVLGAVAGGVAGYNVKGTSTLGAVAGGTGGAALGAAAGSLVSDKTIVEGVSLTYKEGTKVFTSTQVGKACQFTTGLAVLISTKDNETRIQPNATCPEKK >CP031654|0:89325|30490_30772_+|AXP29279.1|DBSCAN-SWA MFDLTIFFITILGGVHSFLNGVREKRYEASCRQLMAECIAAVLAGFIGMYFAEYKGMDESLQNCVTIICSINNKLILEKSQRIIDSYLNRNAS >CP031654|0:89325|43118_49886_+|AXP29293.1|DBSCAN-SWA MNKLSMGVFRCSSVSEILKYIRAITSHRAPIKYGVEKVEGKSYDRLRREANQKAIDLLNSLVDGATLTDEQRQILAGYTGEGGIGGSVSEYYTPKPIAEGVWEIMKLYGADVGNTLEPSAGTGVFNETKPVGTVMTATEISSVSGRINQLLHPEDSVQISPFEQLAVSTPNDSFDHVVGNVPFGGRDNTRNIDKPYAEETDMGSYFMLRMLDKIKPGGFMCVIVPPSIVSGSNMKRLRLRLSRKAEFLGAHRLPTGTFDANGTSTVVDVVLMRKHPAEMAEKIPLVDESTLESANVLWPTFISGKWFEKDGRRFVHGTQEKGFQGRIEVRADGQIDNQALKAKLIHRFESRIDWSLLDMAEPSPTADVVDEGEMRLINGVWQKYAGGRWIEADAGKELKIEVASYGADSWEALQRNLTTTEGRLGMTFTQMANVRDKYTTSISDDMVQLVDWINSQPEKYRERLYRGAMIGRMLIEYQDMKAAGHSAEQIEQQRLSLVSRLQAEIDRFGNPGRGPIAKLSGSGARAWFAFRGAIKLDGTISDELTGKLVTHDSSASYDSTSYQDTLRYLYSDLTRDPIQLDDFRLAFTGELPASDDELLNLLASTPGIAVSPYGGIVPFARATSGDINEIVAPKQEFLATLPDGPVKNNVLNQLAAIEEKRIKTPAENIRFKLNSRWFDRSVILEFLQENGYPDLRYVQSVQLEGDEMVSDTYHGGDGLFVGHRYGVVQRKDKETGEIRYEWDRKSGENATGFPAQLEKYLNGARIGGKDSATANGYREQMALLEDQFNKWIKTHDRYDELVAKYNDVFNSNIPYEHSGDPLGLKGLSGKRQPFDYQNSEVRRLSEDGRGILGFGTGLGKTTTALALEAFNYENGRSTRTAYVVPKSVLENWYYEAKEFLSEEAFSNYLFVGLDVLMDGDQIRQVPVLDENGKPVLGTDGTPVMRDALKLADEATITARMNAIPHSNYRAVVFTKEQYARIPLRDDTVDEHAQDMLYDFVAAGRVASAMDSDSHRKEAARRRVLSEYSDTGTEKAEKYPYFEDMGFDSVIADEGHNYRNSYKNGREASQLAYLPTSAVAQSARDMAIKNAYLMKKNGGRGPVLLTATPVVNTPIDAYNMLSHVLPKEYWQNMGIYGPDDFVKFFGKTRLETVQKISGEVEEKMALMGFENLDALRGIFHRWVTLKTAEDVKDTVEIPELDEHQQDAPLTEEQLAAYEELRQQAEAAAKANNGVTTSVNEDGVIEHEKARPIFSIIRDMDRVCTDMDLYYRRITYRFLPEYADAVQQLADSLPKQATSEDDDSDDSITQQSQYSLIDKGEFIQLQVPEAFEQEVNKRLARFGIDEQTVTHPVTPKYAKLIATLKEFFPEGKQIIFTDEKTQHQKLKRIICNALNLEPSKVGILNAQTVAEAGKTGKKLKAVKPPKELPDEPTDAQIAKYNEQMALYDAYIAQQNEMSLGGLEKIAADFQEGRTPIIICNKKAEVGINLHRGTTDIHHLTLPWTPASIAQRNGRGARVGSNRASVRVHYYCGKGSFDEYRLKTLKRKAGWISDILRSDKSEMENADANDMIEMQMYTAKDDGERLAMMQVQMDKAKAAKRARQKEQATIDLQNYIKAQHAAGEDVEVLTAELERSKAELEKTTAEVAKFKQAAMAKAADNADWKARWGSVHHTDRMLLAQYRASLKSAIQRKANISQAISRYEKLLNRTQKAATDIKRLRPLVEDAINKGILDVDPDLVNHANEFLVIGDRSWRVGQYYDCAGDIVRIKSLDFDSQRADVEIIFTLRGTKSGNWDVKTLDKQVDVTPDEDAVMQKISGGVSIAGINDIISCDDFYRFQQRGMIKITDSYGVQTTESGYSIDFVGTYTDPLKHAVYPDRRDGALKSSIAKWVLGMMSEGNNRQIRSAETFLVELFGSNYGDVIASYGDTLSPEAIQEKIADAIAKMPEKTSQGATRNGDSELEVTNAIFGTHEFRASDYEITTAQFGTIGIYSNKAEIKQAMDAANARIAAEREANLNHAVAALTQSWVTAIREAATTGKITPAIADVVNAGSKFMDAYQMDAVQLPSAYGQLSYRMTYNLVSMFSDLAILGLVDLNEVTPELLSMRKNHVEILQRINTVLAGRTDEEKQADADRINLALGNITEEEIAARNEKQEELSSIQGDATSIAQSLGLNYRVSTADLKMMYAPKFAAGEVFGLQEASGMKGVLFRAKDAIKAKFGARWLPAKAKNSDFPGNWWIIETKHNVADVLAVIQQYA >CP031654|0:89325|61915_62296_-|AXP29312.1|DBSCAN-SWA MRHISPEELIALHDANISRYGGLPGMSDPGRAEAIIGRVQARVAYEEITDLFEVSATYLVATARGHIFNDANKRTALNSALLFLRRNGVQVFDSPELADLTVGAATGEISVSSVADTLRRLYGSAE >CP031654|0:89325|81456_81648_+|AXP29345.1|DBSCAN-SWA MNPLQENALTCYVLRFVELTAGDRAAPDWTYFSVMLMPENQTVMVGAELRHRVVASPAKRALL >CP031654|0:89325|72361_72607_-|AXP29325.1|DBSCAN-SWA MAERVDDAELSMNQLEALKDMAIDNIRKQAQVVSQVFTGKCRYCNEPIESGIYCDAECAQWHREEQAAKQRKYGMRPAGFD >CP031654|0:89325|34115_34481_+|AXP29284.1|DBSCAN-SWA MTLSAIELMDLSDKLDSLMSKAATASGMELLDISDEIDQIMQQMGYGASGGGSGEEKQPSVHDGVPKLVADFLADKFVDQSTDAFIGTLQDLSQYVGTYIDLDQVKQHTAAWIAANIKEAA >CP031654|0:89325|60119_60572_-|AXP29309.1|DBSCAN-SWA MLLNWQGRHFMEINHSRITSYEIADYMIRTKSLLSAKELAAILEKEYPHLDVDKRDVYLRLKAIAVSKYSSVLIDDSTRPRRFQIHSLNPEFFRRSRAPRRFDEKLQNELYMTQDEKERREHQPWVMARQLFNKVARQHRHYGNATSARI >CP031654|0:89325|80179_80395_+|AXP29335.1|DBSCAN-SWA MQTLALHPGVLNVRYNRHTQITPNDMSHHQKIRINAPDSSRLSHRIARMVNPFTRETTNGGLHIPATWFVA >CP031654|0:89325|73140_74358_+|AXP29327.1|tail|DBSCAN-SWA MPTEYARDNLGRYQTDGLSAKDFNKVFDLIRKQQRQNRRNARRTLTPRIMGMRNRELEAFLSLGKKKDGTYFTPEDIRSFNTSRQAHKTKFKSTVPGITYAQLVAQSTSIDIKRANNKVSDGTGIKAATFLGLKHNLALISVNASDESVHQHHRVRIRFEEWDKAVEDIAEDGAKKARIAADLCKGRVSFDCDCGRHQYWYRYMATAGNYAVAPPKEYAFPKIRNPDLTGVACKHVLHAMTRFQSPTWHKAIIIALEKAAEQVAFGDDKRKTTTYFKGELAKSLARNRTTTTDQAKAAREYELYLKSQDALGKKLRAKDSATDNVRRLLKKARTTANRKNAELKASRVREAQARAEADALKKALQTQANNLIKFFMSQGIDKAAATAQARSILETQINEARKRKG >CP031654|0:89325|30839_31169_+|AXP29280.1|holin|DBSCAN-SWA MLDTQELAPVAIAFLLSVIGGIGTFLMDVRDGRQSGNLLGLVTEIFVAVTAGAVAYLLGQHEGWELSITYLMVTIASNNGHEVISGMKRVNIDSILNVLTSLVKKGGGK >CP031654|0:89325|78522_78825_-|AXP29333.1|DBSCAN-SWA MQRKLTKRNKNWLSDMLKKANRNHMYLNDWLSIKGNLSDAKMIDRHVARYGVSLVLEKAELVFSEYYSIPQISSKGKICGYVLKHKSKLDELLVREKETQ >CP031654|0:89325|8668_9178_+|AXP29259.1|plate|DBSCAN-SWA MGHNNTKGNRKFIKGRYTANAAKGERLVSSEFQLTFAGHEDISVLVRTSQIPEMTREDVEDYGPNGVKFNQHGPIRNSGEIQVQCVETIEGDILQFIKDRIAAKDYVDITMAATPESKSSGVNAVTKAATTIEMLDCKIYSDAIDFSTEDVTAAVRPSLRIVYNWIEWD >CP031654|0:89325|50356_50605_+|AXP29295.1|DBSCAN-SWA MKKRYYTVKHGTLRALQEFADKHNVEVRREGGSKALRMYRPDGKWRTVVDFKTNSVPQGVRDRAFEEWEQIIIDNALLLNAD >CP031654|0:89325|15908_16475_+|AXP29269.1|DBSCAN-SWA MTPRQLLEDVKSRFTPLIADEPALLESLLRKALGTYQDRAGHIKRIRFTDQTCKSLACPADFLALVSVTDHTGDLVYSDVYDGNIELEDTHRAVYPLNVSYLANLRDMDLDNGEVPPEIIGLLSDYLEVLIAIPNTDRLRRISIAGKLDASNLSDENTLYQRKLDLEEKMSATRAIIPGIVLFSSMLK >CP031654|0:89325|9261_9558_+|AXP29260.1|DBSCAN-SWA MEIDFSYSPETIERRFEIIGCITISDEHYWVLYDANTWLCALAECQPSLCVGEGAFRHKVLATLEVNTLRYWCVEILSDNKELHLLLLNKCASLRRKA >CP031654|0:89325|3858_5565_+|AXP29255.1|DBSCAN-SWA MSNLREYQNRIADIAKRSKAVLGWASTAQFGTDNQFIKDDAARAASILEAARKDPIFAGISDNATAQIATAWASALADYAATHKSMPRPEILASCHQTLENCLIESTRNSMDATNKAMLESVAAEMMSVSDGVMRLPLFLAMILPVQLGAATADACTFIPVTRDKSEIYEIFNVAGSSFGSYAIGDVLDMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDVEGQNVPIRKGRTNIYVNRIKSVVDNGSGTLLHTFNNKAGEQITVTCSLNYNVGQIALSFSKAPDKGTEIAIEVEINIEAAPELIPLINHEMKSYTLFPNQFVIAAEHTVQAAYEAQREFGLDLGSLQFRTLKEYLSHEQDMLRLRIMIWRTLATDSFDIALPANQSFDVWATIIRGKFQTVYRGIIERIKSSGAMGMYAGADAASFFKQLPKDFFQPAEDYIQTPYVHYIGTLFGNVKVFEVPEGICTNLTADGIQFSPMDVLCYVRDENPGKAGFVTGDAVPAVPFQHPTTPALVNRTTLWGSAINDMHPRNGADYFTRVTLTMAKNGGINFLTGNMIDAGDSE >CP031654|0:89325|56159_56369_+|AXP29305.1|DBSCAN-SWA MAFIPPTIDDVRHCSNALSVDPAETDAARAIAEHYSKISNQEYRITQDDLDDLTDTIEYLMATNQPDSQ >CP031654|0:89325|69058_69760_-|AXP29319.1|DBSCAN-SWA MKIALVLRSGGDYNASDVQWLVNQLPKDYEIICLTDLKCLHVPGVKVIPLINQWQKCRGWWAKIELFRPDITDDLFYLDLDTVIAGDIRLILENPPTSFTMLRDFYHPQYRGSGALWIPNSVKAHIWSSFWQDPEGWISRCVTTECWGDQGFLRKVMGDDTPAFQDLYPGWFVSYKADVVEPGSKYASARYSRGNGALPKDCRIIFFHGKPRPREVSEDWLPLISSFFERESE >CP031654|0:89325|62589_62979_-|AXP29314.1|DBSCAN-SWA MGFPSPAADYVESRISLDQQLIRHPSATYFMRAADSHHREGILQGALLVVDSSLTPVDGSLLVCAMEGEYRIKRYRKYPRQHLEDLSTGKKEALPVDDDGYTGSNAVFGVITHVINDARSGEFDDCPVI >CP031654|0:89325|70954_71617_-|AXP29322.1|DBSCAN-SWA MLAWRLNLQLRMNIPPRATRTVFCVGSGPSLTREDCAAIEKTGCSIIAVNNSWQMFDDIYALYAGDLSWWKQYGSTIPGGRFRKVTANLAAAKSFSLEYRRYCGPAEGVNSGAQAISLAAESGAEVVVLVGYDCSLQNGLHWHGAHPQALRNPTQVSISKWQQQFLDTRKKHADLHILNASRSSAIQCFPRINLEAVIALLSSAVAQAPQTLLRRAECRL >CP031654|0:89325|17111_17993_+|AXP29271.1|DBSCAN-SWA MLLPLFPLPSRPTELIQFRQPNIADAMRFNSITPEEQEQQTTAYLKALLAEPAKHDPLTWTAQDRITALWWIFTGSRETPVETFTYTCKHCGKEHYYDCDMNALAEDIQVLEVEPFIDDIEVSVEGVPYQWRIVPLDGWAMEMLEMRRAALPPEDDAEFKEAIVDLRFWEFAYQCELYNDVSGTREDQAERRYETIKRMAIDTEFMKLAAHIRLAHEKLEHGLPCYIDKGEMRLRLPPHKCPNQDKKESTEGAYTRLWVPFRATDFIPQVGIEKLSDLSVQPGFVWGYTDSGR >CP031654|0:89325|32199_34119_+|AXP29283.1|DBSCAN-SWA MEQFNINKGVTIKPGLDVLPPPVTDDEYRALMAGEDRYLMTESNTLEEIEATFFYDTPIHWCATDLLEAISSTRLQLHRTMQAFVRALNQKLNGTGISAGSDKTGDVAQNGARAIGGAEIGRARNVNGLPVLPAIIPLSDGQTISILFHSPTAENRITNSDTLVAFQFLLNKKDVTHTVAPMSGRDMTLAQVTMKLANLAEKNSAKFQRAQKKKKALVDEITQLQADSDQKEDAMSDLADQVAAVEGQKADLEQKINAVASEADSLYQENERLQAEIDQLNRTGGRDTIAPAGMTGGHSRALTDRLASIKNRMHMDGEATLSNGASMKQFIGDGEGYIQLTDPDGSVYMIKAKSIQGVDMADAIGKLFKAYKAGNVSEYLVQPEEHKPENVEPEPAEDTGSSSPEPEVSVGAYRYALQMRPAAPGAIPEGNKAILPRPDEGDPYYEYARYGIATYDTPLSDQQMSEYDLKLLPREDSFDFLAKTLTNGPFGKYAQKALELATSSPDEFRVMLKTQFQKTFPNIAFPGGAGIEKMVQSMINALQAEVGEITQPEPVPAQPDETVSEADAEANKAIEYLNSVMDMQSTDMAEIRNARGNVREAIAALQAAGRFEENEELVNGAARHLADLLVAIQKAGVAA >CP031654|0:89325|9724_10189_-|AXP29261.1|DBSCAN-SWA MLQMPDLLYFNGSWQEYIDDVYDVVREDILISNITFKGLPVRLRYSPEYDGKEFGFWHLVSEGKKEEERIPDLERCKRIRWIAHMIRNYNHCDISCWSERRGPTEEWVIWNECENYVVVLSARRDYWLLKTAYVVTYDSKIRTLKQSRKRALGT >CP031654|0:89325|53134_53695_-|AXP29299.1|DBSCAN-SWA MKTIEQKIEQHRKWQKAARERAIARQREKLADPAWRESQYQKMRNTIDRRIAKQKERPPASKTRKSAVKIKSRGLKGRTPTAEERRIANALGALPCIACYMHGVISEEVSLHHISGRTAPGCHKKQLPLCRWHHQHAAPAEVREKYPWLVPVHADGVVGGKKEFTLLNKSEMELLADAYEMANIMH >CP031654|0:89325|67067_67262_-|AXP29316.1|DBSCAN-SWA MTPILPYTADDVLELAKIALAPCKDGNAMTLIDFLVKELSKGAGWPDGKDFWVEPPIGRTCIFQ >CP031654|0:89325|13931_14153_-|AXP29266.1|DBSCAN-SWA MVNANPCARQEFIWRFYSCKKHHYHFVIAATEDEARSQLPDGPCIFTARFSTNSRNSLSYWSLPFSADVQGGL >CP031654|0:89325|0_861_+|AXP29252.1|DBSCAN-SWA MNQSFISDILYADIESKAKELTVNSNNTVQPVALMRLGVFVPKPSKSKGESKEIDATKAFSQLEIAKAEGYDDIKITGPRLDMDTDFKTWIGVIYAFSKYGLSSNTIQLSFQEFAKACGFPSKRLDAKLRLTIHESLGRLRNKGIAFKRGKDAKGGYQTGLLKVGRFDADLDLIELEADSKLWELFQLDYRVLLQHHALRALPKKEAAQAIYTFIESLPQNPLPLSFARIRERLALQSAVGEQNRIIKKAIEQLKTIGYLDCSIEKKGRESFVIVHSRNPKLKLPE >CP031654|0:89325|24515_24950_+|AXP29276.1|tail|DBSCAN-SWA MSDVSTNLYKSQLLDYYYQRRAESSINKGSRFLISKAVFGTSSLVTKKGDGTYEIGELPKAFDLAELTSKFCTINLVPTYSGGIITVRMDLDQSQLQKGKNYPFNTLVVLDNENKPIAIICVQEDSLYVGKTYTAVMAINTTTA >CP031654|0:89325|37470_37788_+|AXP29286.1|DBSCAN-SWA MQIKIAAPLGGDAIIEFDDNEEVSGRLSIISGDITEDMIAEAIAGANPNSYMGFVNTLDAPASDVLRTLHLYAGWFVDWPAVEGGDEDDDDDDDDFGDHVDQIVY >CP031654|0:89325|75863_76880_+|AXP29330.1|tail|DBSCAN-SWA MATKTTTAPETDSKRTQLFLQSVSIGQNEIPREMIVGCTYVEPGELSGPQLMLMVRDSTAYVVNKLGVKFGTILTVSLGDPEGHGGILFSEEFFVLKAPRKDDTVLIYAFSNPVRLLKVPSTSAQYFVDKPPSAVVSSLAPGLKVNADSFRKTSTYHLNVGEKPTKVLQEIARDTGSMCWASRGTINFKSMEKMANAAPSLTYESANPNTSGFTISQFNILNADYEYQRRHNYRMASYDMTKGVVYSGNQEDPIKFTSNPDPTALANYNKFILPRLDMLVEGNAALTPGTTLKIVVHNTAGDGELDESIPDKMIVMSVTHFEDRFRFVSRAQLGVVNG >CP031654|0:89325|72753_73131_+|AXP29326.1|DBSCAN-SWA MATSITTTQSTRQYPLSRYDDRNIADPILRAELRKEVMLMCESNDKNLTIYYVLPDEQYRPDLLAYRMWGIAELRWVVTLAAGLEDESQGMTVGKKLKLPPATWIREMIRHFQYDGQVIGTLSIA >CP031654|0:89325|71558_71714_-|AXP29323.1|DBSCAN-SWA MSSKVNYESLASVMPRNEQETDAVVDPVIAEMNARLEAEFAAENEHTTQGD >CP031654|0:89325|11061_11907_-|AXP29263.1|DBSCAN-SWA MLAKVTFLSCITMSDFTFSGYELACFVTHSGLSRSAGHILSQCANLAATTSEYFIHKPHRLIAAETGYSQSTVVRAFREAVNKGILSVEIVIGDHHERRANLYRFTPSFLAFAQQAKNALIESKLKISSAATKVKAVLAKTLALFNFLSTPPCQNDTPSPCQDDVAIKNKKSQIKKTKRSVSGGAGTTRLKNLTSWIAEAKAKADNLRLSKKRAQKHEFKQKVEAASRKYAFLKNKRSPDIGGISNFDNLPHCMTVNEALNAVLAKNKDNEQWGIPAGFRG >CP031654|0:89325|31165_31609_+|AXP29281.1|lysis|DBSCAN-SWA MIGWGVCVLALALADRCLLKRKDITHLELGDVEIKPGFIRVPFKYRSKFPFLRGATVRYWIRDVQKPTTVIEGEQRCLTSAEQGENSEWLYIPTEYMGIGDRLWHFNVMVTHGDSFINPLYRIFPVTQQIRRSYVINLAQDVSDDEK >CP031654|0:89325|23600_24437_+|AXP29275.1|tail|DBSCAN-SWA MQRSWFNNRLTSAKQKSLLYKSLADLVQSMMDTFVDPWLERITNRKSIFSMSKEDLETRTNELGQFFTIRTSNSSSVPMLLQQRLDEIHFKGTERPINQTIYREFNGISVLWDPIYAPVDLERHPYGTVLIPESTLETTGGTFGEMFLTSRGMISIPINDLARTMGITGTIDQSAITEEILRKFNQFVKPLLPLHIVFDGLTLYLSVVVNEQADMITLNEISDTEKAYCWFETSDTTSLTGVTSISAPITATPGGTIVKATPTFDRTRADDLLLDSDA >CP031654|0:89325|11936_12737_-|AXP29264.1|DBSCAN-SWA MQQTFNADMNISNLHQNVDPSTTLPVICGVEITTDCAGRYNLNALHRASGLGAHKAPAQWLRTLSAKQLIEELEKETMQNCIVSFEGRGGGTFAHELLAVEYAGWISPAFRLKVNQTFIDYRAGRLQPAIPQSLPEALRLAADLAEQKQRLEQKMLMDAPKVEFAERVATASGVLIGNYAKVLGLGQNYLFTWLRDNGILIATGERRNVPKQEYISRGYFTLKETVIDTSNGSRISFTTRITGKGQQWLMKRLLDAGVLVPVAATR >CP031654|0:89325|40164_40341_+|AXP29290.1|holin|DBSCAN-SWA MPDIFEHGREIDAAERNRFRLSTPRGAQIYGTQAKSIIVNENPGTRRGYFWWLFKRID >CP031654|0:89325|55966_56080_+|AXP29303.1|DBSCAN-SWA MCLQRCRAAQKEMSKQGNLIQRHYSYALLILHTAQCC >CP031654|0:89325|51172_52213_-|AXP29297.1|DBSCAN-SWA MMNIKPLKIRNKVMKPHKALLNPDLTARNVLTYSHWDFVELWLKRNEKDEALFYWEQAKVFNQAANGLPNQSAPLLHYYSFMNAVKALLSSRNINFKQHHGVARGENTAQIDNLSDIKVKIKNEGILPSFSKYIDGTSHEGVYNIKNLFSGLPYIHRTYCLTYDVTDDIFIPLIDAEFVLNESDNSLFFTAILSKDFRFDTVFDILPPAFELFEREKYKIKSVESLLDFNENQSSNMPQLTEFHQKIRRDLYYINGAETLWYLKRKNNQSEHTSVSISPLVITLAAMHRLSELCRYDPLKLRQLLNEKENWIIAEFIQQSPSQFIDAISSEITGHQFLIPNVRSAS >CP031654|0:89325|1418_2615_+|AXP29253.1|DBSCAN-SWA MSDSSQLHKVAQRANRMLNVLTEQVQLQKDELHANEFYQVYAKAALAKLPLLTRANVDYAVSEMEEKGYVFDKRPAGSSMKYAMSIQNIIDIYEHRGVPKYRDRYSEAYVIFISNLKGGVSKTVSTVSLAHAMRAHPHLLMEDLRILVIDLDPQSSATMFLSHKHSIGIVNATSAQAMLQNVSREELLEEFIVPSVVPGVDVMPASIDDAFIASDWRELCNEHLPGQNIHAVLKENVIDKLKSDYDFILVDSGPHLDAFLKNALASANILFTPLPPATVDFHSSLKYVARLPELVKLISDEGCECQLATNIGFMSKLSNKADHKYCHSLAKEVFGGDMLDVVLPRLDGFERCGESFDTVISANPATYVGSADALKNARIAAEDFAKAVFDRIEFIRSN >CP031654|0:89325|87316_87496_+|AXP29341.1|DBSCAN-SWA MLVDGKQYAQHTDGNSTHGGQAISTRAWFTVNGKGIVCVGDPVSCGSTVASGDGLVQVS >CP031654|0:89325|18074_21815_+|AXP29272.1|DBSCAN-SWA MERKNANIDDIIRTVETASAKELEELAGIREAVEDLKGERVATVDPVSRSVSALNRTIENSRPDFVTNAPSVDPIVDAIKRLNLGDVSRIREDKVTNREQQAAPTAHNPPNRRREAITEDVKAQRLETVKLARDLKGERVATVDPVSRSVSALNRTIENSRPDFVANVPSVDPIVDAMKRLNLGDVSRVVQEGIAQQEQQAKSTTPKGKKRRRKAIPEDIKAQRTEAAEHAREMFDQKGGAQKSQNQRDARGRFIGKLGSKAATEDARAERAEKARRKEDDERLNAESGLLKKLSKVAEGIGNPSETRAVDALGYAVAGPLWAAGKELGGISKEVGGSLNGARKSIADVIRGNDDNSRRKGFFRRKSQNSADVVQVNTQKRTVQELQEQTSEIKEGNDKILSALDQIAKNTGKKKGGLLSKLFSLLGKGAGGVASLLMGRGMLKKAGALAFGALGAKKLVGMLRGGGKKTIAHEGGDLAARAAGKLGLKAVGKGALRAIPLVGTVAGGIYDAVTGWNDTEAQRRAFGLKSGQDPSFQQKAAYTLANVLDMGGLVSGISSAIGEVLKSLGFEDIGNMLQSFSTESIAQAIDSGITNLETYISNLGDTISTKFEDYTAKIGDAVSAWFSDTSNKLLEKLDAIKDFFTVDNLKQVFSDAIDSAIDFIKNPGKHIKEAAGNIWDGVKNLPGKALDAAVDAVKNTPAAMIVSKIPNPIGEANAKEITPELKAPVNSEANAKEIAPELKAPVNSHQETSDSKTESDAKQTNIATRVINAALDTAKDSNKTVKQTANQIINANAVETGNSALQKIDKAIGQNSSSSSSLNTTGTRNDIQKAADTYNNGNLYVKVGSLGAEGKANLDKLAPYFAELENKYGLPEGTLYAIAATESGGNPYAKSQTGALGMFQFTGIAREETGLAEGESFDPVKSAEAAALLMSKYLKQANGDLNEAITAYNAGFGTINKWKKGTGDLSKENREYAIKVNTHRARYLGGEIYTPGAGAQGGAQYGVRGPLPDNAVIDQSTGLAFTPGDSPFEKGGLVDKIGNAVGVNDLVNKFMNGRGMRREVVQGTLEERARGKGTATAAGNVYVDTPMPVEEVRPVASNSSYFDQLGAQMGIDGLFDKLRNSPGMRKNNATEPASTSKVTTAANDLQQPTGRMQIDGQVISDLGGSGAKPTMQLADNTVSLDGETKRLFAQMTSLLARIEEHTKDSAKGQGTVVKVSTPQPGVMRTVPLSIDDPLMNDYARVD >CP031654|0:89325|15288_15828_+|AXP29268.1|DBSCAN-SWA MNKIILFLIFSTFSVGTALANSLQSQIAAIAQAENEGRAKEQQAEDARKELIRQQAQAERIRREKAASAAAAREKQRVAAENERRAKREAELANDKKRDQAYEDELRKLQLESMKLELQAKAARVQRENDFIEQELKERAAKTDVIQSEADANRNISTGSKDLLQSEGKAREKKASSWW >CP031654|0:89325|5625_7215_+|AXP29256.1|DBSCAN-SWA MSQYSIQQSLGNASSVAVSPINADATLSTGVALNSSLWAGIGVFARGKPFTVLAVTESNYEDVLGEPLKPSSGSQFEPIRHVYEAIQQTSGYVVRAVPDDAKFPIIMFDESGEPAYSALPYGSEIELDSGEAFAIYVDDGDPCISPTRELTIETATADSASNERFILKLTQTTSLGVVTTLETHTVSLAEEAKDDMGRLCYLPTALEARSKYLRAVVNEDLISTAKVTNKKSLAFTGGTNGDQSKISTAAYLRAVKVLNNAPYMYTAVLGLGCYDNAAITALGKICADRLIDGFFDVKPTLRYAEALTAVEGTGLLGTDYVSCSVYHYPFSCKDKWTQSRVVFGLSGAAYAAKARGVKKNSDVGGWHYSPAGEERAVIARASIQPLYPEDTPDEEAMVKGRLNKVSVGTSGQMIIDDALTCCTQDNYLHFQHVPSLMNAISRFFVQLARQMKHSPDGITAAGLTKGMTKLLDRFVASGALVAPRDPDADGTEPYVLKVTQAEFDKWEVVWACCPTGVARRIQGVPLLIK >CP031654|0:89325|64315_66172_-|AXP29315.1|DBSCAN-SWA MSNIKYRKDIDGLRAVAILPVLLFHVGYSGFSGGYVGVDIFFVISGYLITKILINDINNGTYSLLTFYERRIRRIIPALTCVILFVLIASPLFLAPDNYSFLPKEIIGTLLFASNIVSFLKSGYFSTDAEQRPLLHTWSLGIEEQFYIISPIVLFLSFKYFKSRVGLILSLFAIVSFAFSVILTKNHPTASFYLIPTRYWELSFGALAAAGVFKKAKGRRQNEVLSILGLLLILFSIFTFTSKTVFPGYAALLPVLGATLIILNAEDTLVGKMLALKPLVFIGVISYSLYLWHWPLVVFSHDKYIIDLNLSREMLVVLSILIAWFSTRFIEAPFRNKQSYDRTRIFKYSSVAYSLLFLTSLAIWPLKGWTDRLSDEKAYILSSTKDYSPVRDKCHFSSGVPETTQYCILGVKDIEPSLFVWGDSHGAEISYALSKLTSVYTATYSACPPVVGFTSTERPECQAHNMRVLDFILNNKKIKNVVLAANYNKYEGDEKYSGFVKGFENTVKKLTDGGKRVTVLDQIPSPGVNVPNDLANAKFIVNKSFAYDDTTFKKIEFENGVNIFHFEKYLCDRDSCSMMYENYPVLFDDNHLSLTVAKIMAPHIYEMISDNRKENERF >CP031654|0:89325|71780_72359_-|AXP29324.1|DBSCAN-SWA MLRFTEEEFQAFSERRNKGRSRPKTKKDPFLSLAPVKEVSPHAKALAALAKNPDLRDGNCEHFEQVFIFDYFERKHPDIYELLHATPNGGKRSKSTAGKMKAEGQKKGYPDMSLDKACGIYHGMRIELKEPNGKAPTKEQIAWMRRLREEGYYVVLAYGAEQAITAILEYISLKKGEAIEHVLNGDKWLYAA >CP031654|0:89325|88932_89325_+|AXP29344.1|DBSCAN-SWA MSDKVTVKQTINKATSIYKIEHITVGKPGSEQYRHAFELADQLGLKHPDCIEHVFPTYADEQCTHVLTEEDFFSTEEREGVDRCIGVICSSVSYELFPNVHEDGGIGYQFLYEGDELKCYEHGLLIESIE >CP031654|0:89325|37817_38612_-|AXP29287.1|DBSCAN-SWA MTAQNTKTIQYRLRNGQSVEVTINNDGVPGEKVSISDLAIEKTIMCHLGFTEEVSKKHGVAIWRTMDTGMRRFITARTPGMTMMDLIQIAPLFECEPLDVFSNPAICQQLYGEMKLAVTPIVLHEGSLAGVWKVERISSYMPFHIHVNGVITGENQPVSVTKSDLNRAILEASCRVIGLGKQSYVSFPAGPEGPAEILIMDADLLWQIQFLIGKSIIRAEELDQYITCTMTDEVKSVAIANARNLCRAALTELQENTTEEVESD >CP031654|0:89325|41332_43042_-|AXP29292.1|portal|DBSCAN-SWA MADNKITLSSVRKALAGVFKDNGERDNILLSALAVHGGSGYLFSRAGAPVQLSGFLGGKPGDSGMAGDGLVDGSRFIFDEVQLPEDRLQRYPLLEEMAVYSTIATALNIHITHALSFDKKTGQTFSIVPVHNGNDSDYDAAQALCGELMNDIGRTINKEVAGWAFIMSVFGVAYVRPYAKEGIGITSFECSYYTLPSFIKEFEVSGNLAGFSGDYLKDASGKMVFADPWAIIPMKIPYWRPKSNLMPVHTGHKAFSLLDNPEERTPIETQNYGTSLLEYAYEPYMNLRSAIRSLKATRFNASKIDRIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANNMPTVTNTLLPIMGDGKGQMTIDTQTIQADINGIEDILTYMRQLAAALGLDYTLLGWADQMSGGLGEGGFLRTAIQAAMRASWIQQGVEEFIQRAIDIHLAFKYGKVYPEGDRPYKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFSDVLEIDEKISEALVNELKAKSEDDDHLMDSIIKTPPQELAQILESVFKEGNEND >CP031654|0:89325|21814_22171_+|AXP29273.1|DBSCAN-SWA MANNNEIDPLLTLELSGVKTYESQEEAWGARLYEWLNTYQGEVYGDPSWGNVLPQFKHEPTNLSHVQIAVEAMLLQKLTVDLPDIPISGLSVAEGDAFDKLKISIRIRDITITQDVVL >CP031654|0:89325|53941_54253_-|AXP29300.1|DBSCAN-SWA MRKNFNIDGKYVVLSVSTNIQSPAVIVTVKLSDRMPDIDSISVAFPVRSMRSAEHFVMNATEEEARRGFAKVMSEFGEFLGHVDKALSISSARSKALTASMMK >CP031654|0:89325|16485_17097_+|AXP29270.1|tail|DBSCAN-SWA MGLNVASVKSYVSSALTTTLFGSGVGEREVGKLTSIIMNKMLFAQGWQFSVEVDGLEGADFFAKDITYHDYSIEYETIKIGGGNILQPTERSPGQITMMVRDTVDGLVLDWFKTAKSRVINPDGTGNIPSKYLLNVRIYRLLSSGLTKLENEMTVFPVTTGDVTYARDQVTEFKSFPMTFALHSTFNQSSSSLASLLGFSFSL >CP031654|0:89325|68758_68977_-|AXP29318.1|DBSCAN-SWA MWPFRRKYHYWLIAFVTPTGGIRHVITRYRNKRLTLARILQAAIGEGLDTNCVVLPPSYLGKMTEAQANTEL >CP031654|0:89325|77551_78526_-|AXP29332.1|DBSCAN-SWA MNILIIGRKFEAISDVKTYTEMWAYNLACAFSEAGVTLQYHRPYSPGVESPEDYVEAVLTAALSCSAKAILAPGLRYFTTVPREIGVQLRRRFTGWVAQVYDGSMLDSAPVDITFTVRDDTWRYLDNPGRLERHNRFNKHVGWAANQELFHLETKTDDVLRIFVDHAAFDVSGFDHSLSILMNLQRLTVPYEAKTLTDDGLVTIEPGNISVTPYRRTPVPATEFAAELRKSDIFIVTHPESLGLTVLEAAMCGALILTPPDCLPPDRLELVNHMVIKSRIDWDEVIARVDRVKNAEKVQCHTWSAIAEKMLETFVTQKPSRGNG >CP031654|0:89325|85860_86766_-|AXP29339.1|DBSCAN-SWA MFKHWKNITIYKLSREADLTDLEDKKKMILFTPCGSQDMAKFGFVSPFGDNSEVIAMHGNGFILVEAKRETKILPPPVIQRAIQEKIEKLEQEQARKLKKTEKDSLKDEVLHSLLPRAFSKFSVIQAIYDGSTKRIYINASARQAEDMLALMRKSLGSLPVVPLSVENPIELTLTDWVRDGSAPQGFQMGDAAELKAVLEDGGIARVKKQDLGSDEISTHLEAGKLVTKLALDWQNRIKFTLDHNFSLTSVKFADELLEQNSDIDSEDVAQRLDADFFLLTSEISCLVDALVNALGGEAKQ >CP031654|0:89325|34493_37481_+|AXP29285.1|DBSCAN-SWA MSLSDQVVMATSIETLIELLKNLPNYGRVSYVVTAKGDEVKTAFDIVDASALLVSNTLDGKINPDYPQELQPRDRTRASNLLQVNQISKDLRPAQLTDSGLSSHGAPIIGEDNAVESGNGRTMGIIKAYQDGNADRYREYLIDHATEFGIRPEKVESMAAPVLVRRRLTKVDRVQFAKDSNISDLQEMAASEKAFVDADSITPAMMALFNPSESGDLLSRSNDAFIRGFMTQVGATQAAGLVTEDGRPTRQLVDRIQNAIFAKAYKDARLVRMVAEEPDPDMRNVLTALNAAANDFVQMQALSGEAHKQAVTTIVDGIETADSLDKKALAALKDAVDLVRQSKESGQHITDVIAQGDMFSETAPEVKALALFIVANNRSAKRMATAFKLMAQRINDELQHQGQALGDMFGGGDVSLQDILRQVSQELENEGMQGISGGLFESVSGGSYNGVAPYTSLLLHRASGIKDIIHLIRLLSRTDPQDEQLVQVLAHFVRMPVADVKKWCRLFGISNSLLRGLLNHASSLGRDGFDEIAQAIKNGDMPPAIDWFSIRPTRVKAFLSAAHTASPLAEMVQRLSLIFTDHTALGDLTLDEMKEASIQWADQQNEVNSDFLPAFRKAVSKADDARGILKAFKALQSRVNKHVGDIDGVTAEGRDILKEHGITPEFIDEIRTDMQREVVSSLQIVARALADANPKSAAIVNQVIGDIEASEGMGALKLFLSRAFNPNGNILPGIIGEAKKYVSEEELEHLDQLLKRFSYNPQTRWQMNQRSMGSVHEKVLSAMNSAIANSSVSEEKALEWADSFITEEVEEARAGQNGGINLRKELADIYRLTGGKISTLSKVVHHQGRAYANLNGVVAVNLNDENARALWHELGHHLEYSNPGLLEKARSFLKANVEGGKLSFVNIGGRGKPEWCFRSRLSNIYMAKVYPPVSVSNSGKIRQKSPTISKTSATEVFSMALQLYHDKEAAAASLMNGDGLLELLLGVAKELNNAD >CP031654|0:89325|82822_85864_-|AXP29338.1|DBSCAN-SWA MKELCYGSVCSGIEAASIAWEPLGMRPVWFAEIESFPSAVLALRWPHVANLGDMTKLAKKVLAGEIESPDVLVGGTPCFTAGHMVLCKNGYKPIEDVCPGDYVVSHLGRLQQVKRVGSKIANTGLLNAVGQPLDIRTTNDHPFLAVRWKAQNTRKNGTYFKRELLSEPEWRAACDMPGYQWCALTNFNIASPDICSRFLSEEQAMYLAGAYVGDGYIRRWRGKSKKAVVFGINCQKLRKFHCRIPENIFSVASEIRGSIKVTLNDTCYANWLNEHFGELSHAKRIPAWVMSHPLRHVFLQGYLDTDGTPSGKAGFRINSVSPALAWGVAGLSQTCGYVSSVSFIEVEPKKVIEDRVVNQRNYYQVTICPQKLSRKSRLAHGMLLRTVKEFKSVGLDTVYNIEVEGDHSYILNGAVVHNCQAFSIAGLRGGLDDERGALTLKYVELANAIDDKRAESFLKPAVIVWENVPGVLSSADNAFGCFLAGLAGEDAPFEPGDRPESGKSNAFWRWDGKTGCHAPKWPQCGCIYGPQRKVAWRILDAQYFGVAQRRRRVFVVASARTDLDPATVLFEFEGVRRNIAPSRGEGKETTRYTSNIAIRSCDDTNIVAMAHGQGGAEIKTDNSAPTLTCNHEAPIVLLGDGRIRRLTPVECERLQGFPDGHTLIPTEKRKKVNSDELAYLHNHYPDLSEEEAAMLAADGPRYKAIGNSMAIPVMRWIGERITKAACRQNEGRETKERKVKPAAEFERSIFKWAGGKFGVLEQIFRYLPEGKRLIEPFVGGGAVFMNAGYQENLLNDVNADLINFYKTLQREAHSLITLAHRFFLDYNTQEGFLAVRNAFNKQVYDDLHRAAAFLFLNRHCFNGLTRYNQAGEFNVGYGKYKTPYFPLQEMEAFLGAEGRSEFVCGDFAAVIEGAGEGDVIFCDPPYEPLPNTEGFTNYSGHDFKFEEQKRLVSLLTDAHRRGAKVLITNSGAPNIRELYQDSGFRVEPLFARRSVSCKGDTRGVAHDVIAILL >CP031654|0:89325|76872_77505_+|AXP29331.1|plate|DBSCAN-SWA MGSLTGKYRAVVVSVDDPKGLMRTQIRVVGMMDGLPDASLPWAEAILSNANTFSPFLPGDKVWVEFPYNGDSRWPLIIGYAQDASGGAPNVPPEASGQGEGYVSPEVEGAPAQPSTSAKKDFISSRNGLMEVRTAGGAWAVTHLKSGTTIGFNEAGELYAISQGPAFISSAGNLDIKSGADVALKAGGSMAIEASGNLSIKAAQVSVDKA >CP031654|0:89325|80825_81260_-|AXP29337.1|DBSCAN-SWA MFGKLFGKKVASAKVELKKVENRDLMEAIIGGCLLVSAADGEIEKEETAKLDQLVRSNPRLSHFGNEITATITRFTEQLEAGFRVGRMNILREIEDIKNDPKEAEEVFVNMLTIAEADGEIEPAEHKVLEEVGRRLGLRVEDYL >CP031654|0:89325|88332_88755_+|AXP29343.1|DBSCAN-SWA MSFFSTLKTALSLKEKLAATGVLVLICALVGAGFAWERHQLKQAMEKIGSLDQAIKERDKSIMDLNQTIETMNKAEQHFHSQEVKNESEQAKYADRQMERKAEVQKQLVAAGNVRQRIPADTQRLLRESISEFNADADKG >CP031654|0:89325|54303_55335_-|AXP29301.1|DBSCAN-SWA MSNLLTVHQNLPALPVDATSDEVRKNLMDMFRDRQAFSEHTWKMLLSVCRSWAAWCKLNNRKWFPAEPEDVRDYLLYLQARGLAVKTIQQHLGQLNMLHRRSGLPRPSDSNAVSLVMRRIRKENVDAGERAKQALAFERTDFDQVRSLIENSDRCQDIRNLAFLGIAYNTLLRIAEIARIRVKDISRTDGGRMLIHIGRTKTLVSTAGVEKALSLGVTKLVERWISVSGVADDPNNYLFCRVRKNGVASPSSTSQLSTRALEGIFEATHRLIYGAKDDSGQRYLAWSGHSARVGAARDMARAGVSIPEIMQAGGWTNVNIVMNYIRNLDSETGAMVRLLEDGD >CP031654|0:89325|56479_57331_+|AXP29306.1|DBSCAN-SWA MINYVYGEQLYQEFVSFRDLFLKKAVARAQHVDAASDGRPVRPVVVLPFKETDSIQAEIDKWTLMARELEQYPDLNIPKTILYPVPNILRGVRKVTTYQTEAVNSVNMTAGRIIHLIDKDIRIQKSAGINEHSAKYIENLEATKELMKQYPEDEKFRMRVHGFSETMLRVHYISSSPNYNDGKSVSYHVPLCGVFICDETLRDGIIINGEFEKAKFSLYDSIEPIICDRWPQAKIYRLADIENVKKQIAITREEKKVKSAASVTRSRKTKKGQPVNSNPESAQ >CP031654|0:89325|8075_8657_+|AXP29258.1|DBSCAN-SWA MAPIPYGVYSQADGVSPFLKVTLTNSQYQVTGYISQGAAMNMAQNWEAPFTGMSMGSVAGAFSGFAQVGTETTSVARWNSLMVWEGGTPPTFTLPVTFIALFDPFTEVSGAIAALSAMISPELKDASIGGRIPERVTLNIGRRINIIDVAIQDISFDLDAPRDSNGHFLKNTVNLQLTGSSIYNSSDIVRAFQ >CP031654|0:89325|69756_70434_-|AXP29320.1|DBSCAN-SWA MMAPTIYHRIDGTKYRNVWVVCDLHGCYTRLMSELHRVDFDPAQDLLISVGDLIDRGTENVECLELLQMPWFRAVMGNHELLMLDALSPDGNVNNWLMNGGQWFFMLDADQEILARALVELVRRLPYIIELNTGQETIVIAHADYPDNEYQFGKEVPLFNVVWARERISDSMDDIGGEISGADRFIFGHTPVKSPKTFWNQHYIDTGAVFCGNLTLMKVKGDGAA >CP031654|0:89325|52303_52945_-|AXP29298.1|DBSCAN-SWA MDKKICVVSMSVGKPASMTAAWINNELIMAERTSYPERRRDMELQLLRELQEKEEKGFIVLVEEENSFITGRVGQRVRLRDPFMNGRPVLIEAMQIYKELERQKAIKLPRKESGKYILHQSIFDSEHDKKGDEFFNINWSEITTEHVLSLLCCFATEYNNVASADYIRAMAGEVEARQEPSLLSPLINIIRGTQCLADKRVPQGLLTGKENYL >CP031654|0:89325|75076_75862_+|AXP29329.1|plate|DBSCAN-SWA MILNNQGWLLAIFKKKGLTPTGKLEFATIDGIDSALAQALNEAFDSQVVSFNDRINQSFREFLKRTPRDRITLGTFSDVKEWLSSFEADRAGRKDTASAGPVNKLAMPLVNLSRSPAFSIYEGELCRDNYDEGHVTNENDEIEALVSTIPFSLEYSLWIASDEKESLGMVTTALAFWLRMYASLGQASFTHTANVGGYEIPVTCYIEGQKSIAFQDLTTGTTDNRLFAVGLNLTVVAELPILAYMQQTTGTITVKAKILEE >CP031654|0:89325|80661_80826_-|AXP29336.1|DBSCAN-SWA MASKARIAIAIGFLLLSVLVDFTSTILSVLSDGALVAVAVTLVWPILKTASKDQ >CP031654|0:89325|55342_55564_-|AXP29302.1|DBSCAN-SWA MGYSAAKVSTHLELEKNRGYWRAKGFDRDSCQLSLSRGEEKIERTRGRWRFYDENHKQVKAEPILYTLLKTII >CP031654|0:89325|22167_23601_+|AXP29274.1|DBSCAN-SWA MSKTTPTKDSIRAEFEELVEKDSFWSKFVGSQFVSMLTLFITQIVYRCFQYADAALAEGFISTATRRSSILAAAETNSYVGTKPTPSSGMIEITATSEDAPAVIPKNMPLISDDQYPYMTMDVCRLVDGTGTVEVAQLEIQEVTYTVTAAKEFLEVVLSKALTAVCYKLEVFVTTDGKTTQWSSSTMFRLAGSKSQVYVEFYKPSEQLGVRFGDGLIGQIPPEGSTITLKVWCTNGDITLVAGQNLTPVDSAANLANLISVKTTTPITAGTDAETTEITRNRAQYYLAYDDQVVWGGDYTYFLVRNIPGLSWVKAWGEGQQEKLDGAYNVQNINKIFISGWHPNKSQSELEEMILTAFKKVPNELNKKFSYKEVRKLPFKITITGRISASLTIENVTDELKSALETKFGRDSNFFDPNGVGKYILIKKKDVWAFIETLGYFRDFYLEFVEWNESNGFYDFVYLDTENSTFNISYEEE >CP031654|0:89325|78824_80189_-|AXP29334.1|DBSCAN-SWA MSASPLESMPNSLSAEQAVLGGLMLDNCRWDEVADRIVADDFYTSAHREIFSEMERLLSHGKPIDLITLAEALEQNGKLERAGGFAYLAEMSKNTPSAANICAYADIVRERAVVREMISVANEIAEAGYAQDGRGSNELLDMAERRVFEIAEKRQKSGSGPKDIASILDATVSRIEELFQRPHDGVTGLDTGFTDLNKKTAGLQPSDLIIVAARPSMGKTTFAMNLVENAAVRNYKPVLVFSLEMPSHQLMMRSLASLARVDQTRIRTGQLNDEDWARVSGAMGILLDKQNIFIDDSSALTPTELRSRARRVYKENGGLSMIMIDYLQLMRVPELQDNRTLEIAEISRSLKALAKELQVPVVALSQLNRSLEQRADKRPVNSDLRESGAIEQDADLIMFLYRDEVYHPDSEMKGIAEVIIGKQRNGPIGTVRLAFNGQYSRFDNYAGADWQEDY |
94 | Escherichia_phage(61.8%) | portal,terminase,head,holin,plate,lysis,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP031653_1 | 708069-708208 | Orphan |
NA
Consensus repeat of CP031653_1
|
1 spacers
spacers of CP031653_1
>1.1|708118|42|CP031653|CRISPRCasFinder ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT |
CRISPR arrays and Neighbor proteins around CP031653_1
The CRISPR arrays of CP031653_1 >merge|CP031653|1|708069-708208|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGTTTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA >CP031653|1|1|708069-708208|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT TTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA
>CP031653.1|AXP25283.1|706976_708017_+|permease MTGQSSSQAATPIQWWKPALFFLVVIAGLWYVKWEPYYGKAFTAAETHSIGKSILAQADANPWQAALDYAMIYFLAVWKAAVLGVILGSLIQVLIPRDWLLRTLGQSRFRGTLLGTLFSLPGMMCTCCAAPVAAGMRRQQVSMGGALAFWMGNPVLNPATLVFMGFVLSWGFAAIRLVAGLVMVLLIATLVQKWVRETPQTQAPVEIDIPEAQGGFFSRWGRALWTLFWSTIPVYILAVLVLGAARVWLFPHADGTVDNSLMWVVAMAVAGCLFVIPTAAEIPIVQTMMLAGMGTAPALALLMTLPAVSLPSLIMLRKAFPAKALWLTGAMVAVSGVIVGGLALLF >CP031653.1|AXP25282.1|706268_706904_+|NAD-dependent-epimerase/dehydratase-family-protein MSQVLITGATGLVGGHLLRMLINEPKVNAIAAPTRRPLGDMPGVFNPHDPQLTDALAQVTDPIDIVFCCLGTTRREAGSKEAFIHADYTLVVDTALTGRRLGAQHMLVVSAMGANAHSPFFYNRVKGEMEEALIAQNWPKLTIARPSMLLGDRSKQRMNETLFAPLFRLLPGNWKSIDARDVARVMLAESMRPEHEGVTILSSSELRKRAE >CP031653.1|AXP25281.1|705622_706141_-|glutamine-amidotransferase MSKKIAVLITDEFEDSEFTSPADEFRKAGHEVITIEKQAGKTVKGKKGEASVTIDKSIDEVTPAEFDALLLPGGHSPDYLRGDNRFVTFTRDFVNSGKPVFAICHGPQLLISADVIRGRKLTAVKPIIIDVKNAGAEFYDQEVVVDKDQLVTSRTPDDLPAFNREALRLLGA >CP031653.1|AXP25280.1|705199_705643_+|hypothetical-protein METLIAISRWLAKQHVVTWCVQQEGELWCANAFYLFDAQKVAFYILTEEKTRHAQMSGPQAAVAGTVNGQPKTVALIRGVQFKGEIRRLEGEESDLARKAYNRRFPVARMLSAPVWEIRLDEIKFTDNTLGFGKKMIWLRDSGTEQA >CP031653.1|AXP25279.1|704846_705149_-|GIY-YIG-nuclease-family-protein MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAFSAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD >CP031653.1|AXP25278.1|704356_704860_+|N-acetyltransferase MLIRVEIPIDAPGIDALLRRSFESDAEAKLVHDLREDGFLTLGLVATDDEGQVIGYVAFSPVDVQGEDLQWVGMAPLAVDEKYRGQGLARQLVYEGLDSLNEFGYAAVVTLGDPALYSRFGFELAAHHDLRCRWPGTESAFQVHRLADDALNGVTGLVEYHEHFNRF >CP031653.1|AXP25277.1|703838_704363_+|SCP2-domain-containing-protein MLDKLRSRIVHLGPSLLSVPVKLTPFALKRQVLEQVLSWQFRQALDDGELEFLEGRWLSIHVRDIDLQWFTSVVNGKLVVSQNAQADVSFSADASDLLMIAARKQDPDTLFFQRRLVIEGDTELGLYVKNLMDAIELEQMPKALRMMLLQLADFVEAGMKNAPETKQTSVGEPC >CP031653.1|AXP25276.1|702634_703630_-|collagenase MELLCPAGNLPALKAAIENGADAVYIGLKDDTNARHFAGLNFTEKKLQEAVSFVHQHRRKLHIAINTFAHPDGYARWQRAVDMAAQLGADALILADLAMLEYAAERYPHIERHVSVQASATNEEAINFYHRHFDVARVVLPRVLSIHQVKQLARVTPVPLEVFAFGSLCIMSEGRCYLSSYLTGESPNTVGACSPARFVRWQQTPQGLESRLNEVLIDRYQDGENAGYPTLCKGRYLVDGERYHALEEPTSLNTLELLPELMAANIASVKIEGRQRSPAYVSQVAKVWRQAIDRCKADPQNFVPQSAWMETLGSMSEGTQTTLGAYHRKWQ >CP031653.1|AXP29083.1|701747_702626_-|U32-family-peptidase MKYSLGPVLWYWPKETLEEFYQQAATSSADVIYLGEAVCSKRRATKVGDWLEMAKSLAGSGKQIVLSTLALVQASSELGELKRYVENGEFLIEASDLGVVNMCAERKLPFVAGHALNCYNAVTLKILLKQGMMRWCMPVELSRDWLVNLLNQCDELGIRNQFEVEVLSYGHLPLAYSARCFTARSEDRPKDECETCCIKYPNGRNVLSQENQQVFVLNGIQTMSGYVYNLGNELASMQGLVDVVRLSPQGTDTFAMLDAFRANENGAAPLPLTANSDCNGYWRRLAGLELQA >CP031653.1|AXP25275.1|700534_701542_-|LLM-class-flavin-dependent-oxidoreductase MTDKTIAFSLLDLAPIPEGSSAREAFSHSLDLARLAEKRGYHRYWLAEHHNMTGIASAATSVLIGYLAANTTTLHLGSGGVMLPNHSPLVIAEQFGTLNTLYPGRIDLGLGRAPGSDQRTMMALRRHMSGDIDNFPRDVAELVDWFDARDPNPNVRPVPGYGEKIPVWLLGSSLYSAQLAAQLGLPFAFASHFAPDMLFQALHLYRSNFKPSARLEKPYAMVCINIIAADSNRDAEFLFTSMQQAFVKLRRGETGQLPPPIQNMDQFWSPSEQYGVQQALSMSLVGDKAKVRHGLQSILRETDADEIMVNGQIFDHQARLHSFELAMDVKEELLG >CP031653.1|AXP25284.1|708221_708797_-|osmotically-inducible-protein-OsmY MKALSPIAVLISALLLQGCVAAAVVGTAAVGTKAATDPRSVGTQVDDGTLEVRVNSALSKDEQIKKEARINVTAYQGKVLLVGQSPNAELSARAKQIAMGVDGANEVYNEIRQGQPIGLGEASNDTWITTKVRSQLLTSDLVKSSNVKVTTENGEVFLMGLVTEREAKAAADIASRVSGVKRVTTAFTFIK >CP031653.1|AXP25285.1|708806_709397_-|DnaA-initiator-associating-protein-DiaA MQERIKACFTESIQTQIAAAEALPDAISRAAMTLVQSLLNGNKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIANDRLHDEVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVALTGYDGGELAGLLGPQDVEIRIPSHRSARIQEMHMLTVNCLCDLIDNTLFPHQDD >CP031653.1|AXP25286.1|709416_709812_-|YraN-family-protein MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS >CP031653.1|AXP25287.1|709769_711806_-|penicillin-binding-protein-activator MVPSTFSRLKAARCLPVVLAALIFAGCGTHTPDQSTAYMQGTAQADSAFYLQQMQQSSDDTRINWQLLAIRALVKEGKTGQAVELFNQLPQELNDSQRREKTLLAVEIKLAQKDFAGAQNLLAKITPADLEQNQQARYWQAKIDASQGRPSIDLLRALIAQEPLLGAKEKQQNIDATWQALSSMTQEQANTLVINADENILQGWLDLQRVWFDNRNDPDMMKAGIADWQKRYPNNPGAKMLPTQLVNVKAFKPASTNKIALLLPLNGQAAVFGRTIQQGFEAAKNIGTQPVAAQVAAAPAADVAEQPQPQTVDGVASPAQASVSDLTGEQPAAQPVPVSAPATSTAAVSAPANPSAELKIYDTSSQPLSQILSQVQQDGASIVVGPLLKNNVEELLKSNTPLNVLALNQPENIENRVNICYFALSPEDEARDAARHIRDQGKQAPLVLIPRSSLGDRVANAFAQEWQKLGGGTVLQQKFGSTSELRAGVNGGSGIALTGSPITPRATTDSGMTTNNPTLQTTPTDDQFTNNGGRVDAVYIVATPGEIAFIKPMIAMRNGSQSGATLYASSRSAQGTAGPDFRLEMEGLQYSEIPMLAGGNLPLMQQALSAVNNDYSLARMYAMGVDAWSLANHFSQMRQVQGFEINGNTGSLTANPDCVINRKLSWLQYQQGQVVPAS >CP031653.1|AXP25288.1|711870_712731_+|rRNA-(cytidine-2'-O-)-methyltransferase MKQHQSADNSQGQLYIVPTPIGNLADITQRALEVLQAVDLIAAEDTRHTGLLLQHFGINARLFALHDHNEQQKAETLLAKLQEGQNIALVSDAGTPLINDPGYHLVRTCREAGIRVVPLPGPCAAITALSAAGLPSDRFCYEGFLPAKSKGRRDALKAIEAEPRTLIFYESTHRLLDSLEDIVAVLGESRYVVLARELTKTWETIHGAPVGELLAWVKEDENRRKGEMVLIVEGHKAQEEDLPADALRTLALLQAELPLKKAAALAAEIHGVKKNALYKYALEQQG >CP031653.1|AXP25289.1|712773_713865_-|fimbrial-protein MKRAPLITGLLLISTSCAYASSGGCGADSTSGATNYSSVVDDVTVNQTDNVTGREFTSATLSSTNWQYACSCSAGKAVKLVYMVSPVLTTTGHQAGYYKLNDSLDIKTTLKANDIPGLVTDQTVSVNTRFTQIKSNTVYSAATQTGVCQGDTSRYGPVNIGANTTFTLYVTKPFLGSMTIPKTDIAVIKGAWVDGMGSPSTGDFHDLVKLSIQGNLTAPQSCKINQGDVIKVNFGFINGQKFTTRNAMPDGFTPVDFDITYDCGDTSKIKNSLQMRIDGTTGVVDQYNLVARRRSSDNAPDVGIRIENLGGGVANIPFQNGILPVDPSGHGTVNMRAWPVNLVGGELETGKFQGTATITVIVR >CP031653.1|AXP25290.1|713875_716248_-|fimbrial-biogenesis-outer-membrane-usher-protein MLETTKSGMQTTDLSRFSKKYAQLPGTYQVDIWLNKKKVSQKKITFTANAEQLLQPQFTVEQLRELGIKVDEIPALAEKDDDSVINSLEQIIPGTAAEFDFNHQRLNLSIPQIALYRDARGYVSPSRWDDGIPTLFTNYSFTGSDNRYRQGNRSQRQYLNMQNGANFGPWRLRNYSTWTRNDQTSSWNTISSYLQRDIKALKSQLLLGESATSGSIFSSYTFTGVQLASDDNMLPNSQRGFAPTVRGIANSSAIVTIRQNGYVIYQSNVPAGAFEINDLYPSSNSGDLEVTIEESDGTQRRFIQPYSSLPMMQRPGHLKYSATAGRYRADANSDSKEPEFAEATAIYGLNNTFTLYSGLLGSEDYYALGIGIGGTLGALGALSMDINRADTQFDNQHSFHGYQWRTQYIKDIPETNTNIAVSYYRYTNDGYFSFDEANTRNWDYNSRQKSEIQFNISQTIFDGVSLYASGSQQDYWGNNEKNRNISVGVSGQQWGIGYSLNYQYSRYTDQNNDRALSLNLSIPLERWLPRSRVSYQMTSQKDRPTQHEMRLDGSLLDDGRLSYSLEQSLDDDNNHNSSVNASYRSPYGTFSAGYSYGNDSSQYNYGVTGGVVIHPHGVTLSQYLGNAFALIDANGASGVRIQNYPGIATDPFGYAVVPYLTTYQENRLSVDTTQLPDNVDLEQTTQFVVPNRGAMVAARFNANIGYRVLVTVSDRNGKPLPFGALASNDETGQQSIVDEGGILYLSGISSKSQSWTVRWGNQADQQCQFAFSTPDSEPTTSVLQGTAQCH >CP031653.1|AXP25291.1|717159_718020_-|tagatose-1,6-bisphosphate-aldolase MSIISTKYLLQDAQANGYAVPAFNIHNAETIQAILEVCSEMRSPVILAGTPGTFKHIALEEIYALCSAYSTTYNMPLALHLDHHESLDDIRRKVHAGVRSAMIDGSHFPFAENVKLVKSVVDFCHSQDCSVEAELGRLGGVEDDMSVDAESAFLTDPQEAKRFVELTGVDSLAVAIGTAHGLYSKTPKIDFQRLAEIREVVDVPLVLHGASDVPDEFVRRTIELGVTKVNVATELKIAFAGAVKAWFAENPQGNDPRYYMRVGMDAMKEVVRNKINVCGSANRISA >CP031653.1|AXP25292.1|718032_719187_-|AgaS-family-sugar-isomerase MPKNYTPAAAATGTWTEEEIRHQPRAWIRSLTNIDALRSALNNFLEPLLRKENLRIILTGAGTSAFIGDIIAPWLASHTGKNFSAVPTTDLVTNPMDYLNPAHPLLLISFGRSGNSPESVAAVELANQFVPECYHLPITCNEAGALYQNAINSDNAFALLMPAETHDRGFAMTSSITTMMASCLAVFAPETINSQTFRDVADRCQAILTSLGDFSEGVFGYAPWKRIVYLGSGGLQGAARESALKVLELTAGKLAAFYDSPTGFRHGPKSLVDDETLVVVFVSSHPYTRQYDLDLLAELRRDNQAMRVIAIAAESSDIVAASPHIILPPSRHFIDVEQAFCFLMYAQTFALMQSLHMGNTPDTPSASGTVNRVVQGVIIHPWQA >CP031653.1|AXP25293.1|719537_720671_-|N-acetylglucosamine-6-phosphate-deacetylase MTHVLRARRLLTEEGWLDDHQLCIADGVIAAIEPIPVGVTERDAELLCPAYIDTHVHGGAGVDVMDDAPDVLDKLAMHKAREGVGSWLPTTVTAPLNTIHAALKRIAQRCQRGGPGAQVLGSYLEGPYFTPQNKGAHPPELFRELEIAELDQLIAVSQHTLRVVALAPEKEGALQAIRHLKQQNVRVMLGHSAATWQQTRAAFDAGADGLVHCYNGMTGLHHREPGMVGAGLTDKRAWLELIADGHHVHPAAMSLCCCCAKERIVLITDAMQAAGMPDGRYTLCGEKVQMHGGVVRTASGGLAGSTLSVDAAVRNMVELTGVTPAEAIHMASLHPARMLGVDGVLGSLKPGKRASIVALDSGLHVQQIWIQSQLASF |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP031653_2 | 733203-733454 | Orphan |
NA
Consensus repeat of CP031653_2
|
2 spacers
spacers of CP031653_2
>2.1|733257|64|CP031653|PILER-CR AGAACCCGGCTTATCGGTCAGTTTCACCTGATTTACGTAAAAACCCGCTTCGGCGGGTTTTTGC >2.2|733375|59|CP031653|PILER-CR GCGGCTTATCGGTCAGTTTCACCTGGTTTACGTAAAAAACCGCTTCGGCGGGTTTTTGC |
CRISPR arrays and Neighbor proteins around CP031653_2
The CRISPR arrays of CP031653_2 >merge|CP031653|2|733203-733454|PILER-CR AGATGAATGACTGTCCACGACAGAACCCGGCTTATCGGTCAGTTTCACCTGATTTACGTAAAAACCCGCTTCGGCGGGTTTTTGCTTTTGGAGGAGCGGAAAGATGAATGACTGTCCACGACGCTATACCCAAAAGAAAGCGGCTTATCGGTCAGTTTCACCTGGTTTACGTAAAAAACCGCTTCGGCGGGTTTTTGCTTTTGGAGGGGCAGAAAGATGAATGACTGTCCACGACGCTATACCCAAAAGAAA >CP031653|2|1|733203-733454|PILER-CR AGATGAATGACTGTCCACGACAGAACCCGGCTTATCGGTCAGTTTCACCTGATT TACGTAAAAACCCGCTTCGGCGGGTTTTTGCTTTTGGAGGAGCGGAAAGATGAATGACTGTCCA CGACGCTATACCCAAAAGAAAGCGGCTTATCGGTCAGTTTCACCTGGTTTACGT AAAAAACCGCTTCGGCGGGTTTTTGCTTTTGGAGGGGCAGAAAGATGAATGACTGTCCA CGACGCTATACCCAAAAGAAA
>CP031653.1|AXP25306.1|731697_732843_+|glycerate-2-kinase MKIVIAPDSYKESLSASEVAQAIEKGFREIFPDAQYVSIPVADGGEGTVEAMIAATQGSERHAWVTGPLGEKVNASWGISGDGKTAFIEMAAASGLELVPAEKRDPLVTTSRGTGELILQALESGATNIIIGIGGSATNDGGAGMVQALGAKLCDANGNEIGFGGGSLNTLNDIDISGLDPRLKDCVIRVACDVTNPLVGDNGASRIFGPQKGASEAMIVELDNNLSHYADVIKKALHVDVKDVPGAGAAGGMGAALMAFLGAELKSGIEIVTTALNLEEHIHDCTLVITGEGRIDSQSIHGKVPIGVANVAKKYHKPVIGIAGSLTDDVGVVHQHGIDAVFSVLTSIGTLDEAFRGAYDNICRASRNIAATLAIGMRNAG >CP031653.1|AXP29084.1|730710_731601_+|2-hydroxy-3-oxopropionate-reductase MTMKVGFIGLGIMGKPMSKNLLKAGYSLVVADRNPEAIADVIAAGAETASTAKAIAEQCDVIITMLPNSPHVKEVALGENGIIEGAKPGTVLIDMSSIAPLASREISEALKAKGIDMLDAPVSGGEPKAIDGTLSVMVGGDKAIFDKYYDLMKAMAGSVVHTGEIGAGNVTKLANQVIVALNIAAMSEALTLATKAGVNPDLVYQAIRGGLAGSTVLDAKAPMVMDRNFKPGFRIDLHIKDLANALDTSHGVGAQLPLTAAVMEMMQALRADGLGTADHSALACYYEKLAKVEVTR >CP031653.1|AXP25305.1|729910_730681_+|5-keto-4-deoxy-D-glucarate-aldolase MNNDVFPNKFKAALAAKQVQIGCWSALSNPISTEVLGLAGFDWLVLDGEHAPNDISTFIPQLMALKGSASAPVVRVPTNEPVIIKRLLDIGFYNFLIPFVETKEEAEQAVASTRYPPEGIRGVSVSHRANMFGTVADYFAQSNKNITILVQIESQQGVDNIDAIAATEGVDGIFVGPSDLAAALGHLGNASHPDVQKAIQHIFNRASAHGKPSGILAPIEADARRYLEWGATFVAVGSDLGVFRSATQKLADTFKK >CP031653.1|AXP25304.1|728560_729895_+|MFS-transporter MILDTVDVKKKGVHTRYLILLIIFIVTAVNYADRATLSIAGTEVAKELQLSAVSMGYIFSAFGWAYLLMQIPGGWLLDKFGSKKVYTYSLFFWSLFTFLQGFVDMFPLAWAGISMFFMRFMLGFSEAPSFPANARIVAAWFPTKERGTASAIFNSAQYFSLALFSPLLGWLTFAWGWEHVFTVMGVIGFVLTALWIKLIHNPTDHPRMSAEELKFISENGAVVDMDHKKPGSAAASGPKLHYIKQLLSNRMMLGVFFGQYFINTITWFFLTWFPIYLVQEKGMSILKVGLVASIPALCGFAGGVLGGVFSDYLIKRGLSLTLARKLPIVLGMLLASTIILCNYTNNTTLVVMLMALAFFGKGFGALGWSVISDTAPKEIVGLCGGVFNVFGNVASIVTPLVIGYLVSELHSFNAALIFVGCSALMAMVCYLFVVGDIKRMELQK >CP031653.1|AXP25303.1|726614_728186_-|D-galactarate-dehydratase MANIEIRQETPTAFYIKVHDTDNVAIIVNDNGLKAGTRFPDGLELIEHIPQGHKVALLDIPANGEIIRYGEVIGYAVRAIPRGSWIDESMVVLPEAPPLHTLPLATKVPEPLPPLEGYTFEGYRNADGSVGTKNLLGITTSVHCVAGVVDYVVKIIERDLLPKYPNVDGVVGLNHLYGCGVAINAPAAVVPIRTIHNISLNPNFGGEVMVIGLGCEKLQPERLLTGTDDVQAIPVESASIVSLQDEKHVGFQSMVEDILQVAERHLQKLNQRQRETCPASELVVGMQCGGSDAFSGVTANPAVGYASDLLVRCGATVMFSEVTEVRDAIHLLTPRAVNEEVGKRLLEEMEWYDNYLNIGKTDRSANPSPGNKKGGLANVVEKALGSIAKSGKSAIVEVLSPGQRPTKRGLIYAATPASDFVCGTQQVASGITVQVFTTGRGTPYGLMAVPVIKMATRTELANRWFDLMDINAGTIATGEETIEEVGWKLFHFILDVASGKKKTFSDQWGLHNQLAVFNPAPVT >CP031653.1|AXP25302.1|726130_726466_-|antitoxin-PrlF MPANARSHAVLTTESKVTIRGQTTIPAPVREALKLKPGQDSIHYEILPGGQVFMCRLGDEQEDHTMNAFLRFLDADIQNNPQKTRPFNIQQGKKLVAGMDVNIDDEIGDDE >CP031653.1|AXP25301.1|725666_726131_-|type-II-toxin-antitoxin-system-YhaV-family-toxin MDFPQRVNGWALYAHPCFQETYDALVAEVEALKGKDPENYQRKAATKLLAVVHKVIEEHITVNPSSPAFRHGKSLGSGKNKDWSRVKFGAGRYRLFFRYSEKEKVIILGWMNDENTLRTYGKKTDAYTVFSKMLKRGHPPADWESLTQETEENH >CP031653.1|AXP25300.1|724802_725612_+|DeoR-family-transcriptional-regulator MSNTDASGEKRVTGTSERREQIIQRLRQQGSVQVNDLSALYGVSTVTIRNDLAFLEKQGIAVRAYGGALICDSTTPSVEPSVEDKSALNTAMKRSVAKAAVELIQPGHRVILDSGTTTFEIARLMRKHTDVIAMTNGMNVANALLEAEGVELLMTGGHLRRQSQSFYGDQAEQSLQNYHFDMLFLGVDAIDLERGVSTHNEDEARLNRRMCEVAERIIVVTDSSKFNRSSLHKIIDTQRIDMIIVDEGIPADSLEGLRKAGVEVILVGE >CP031653.1|AXP25299.1|724567_724870_-|hypothetical-protein MICSRRSLVPVTRFSPEASVLLIVSPFVKLSFHFVLPINAFLLSKCKPTLPIDASYSRFSDFHYVSFVNQIRKLLSFVLFLSHHDAVSTETKRKINIAVI >CP031653.1|AXP25298.1|723273_724554_-|tagatose-bisphosphate-aldolase-subunit-KbaZ MKHLTEMVRQHKAGKTNAIYAVCSAHPLVLEAAIRYASANQTPLLIEATSNQVDQFGGYTGMTPADFRGFVCQLADSLNFPQDALILGGDHLGPNRWQNLPAAQAMANADDLIKSYVAAGFKKIHLDCSMSCQDDPIPLTDDIVAERAARLAKVAEETCLEHFGEADLEYVIGTEVPVPGGAHETLSELAVTTPDAARATLEAHRHAFEKQGLNAIWPRIIALVVQPGVEFDHTNVIDYQPAKASALSQMVENYETLIFEAHSTDYQTPQSLRQLVIDHFAILKVGPALTFALREALFSLAAIEEELVPAKACSGLRQVLEDVMLDRPEYWQSHYHGDGNARRLARGYSYSDRVRYYWPDSQIDDAFAHLVRNLADSPIPLPLISQYLPLQYVKVRSGELQPTPRELIINHIQDILAQYHTACEGQ >CP031653.1|AXP25307.1|733752_734940_-|YhaC-family-protein MFPVSSIGNDISSDLVRRKMNDLPESPIVNNLEALAPGIEKLKQTSIQMVTLLNALQPGGKCIITGDFQKELAYLQNVILYNDSSLRMDFFGYNALIIQRSDNTCELTINEPLKNQEISTGNINVNFPLKDIYNEIRRLNVVFSCGTGGIVDLSSLDLRNIDLELYDFTDKHMANAILNPFKLDDTDFTNANMFQVNFVSSKQNTTISWDYLLKITPVLTSISDMYSEEKIKLVESCLNELGDITEEQLKIMRFAIIESIPRATLTDQLENELTKEIYKNSSKINNYLNRIKLPEMKGFSSEKIDYYIDIIIKDYESVKENAYLIDPKINYNTDLNIEDSSSEEFLSDNTLEKDENSPDNCFEVVKYNTYEAYNSENLYFTREEYTYDYDLLNAI >CP031653.1|AXP25308.1|734961_735501_-|hypothetical-protein MKGFPIAHIFHPSIPPMHAVVNNHNRNIDYWTVKRKFAEIVSTNDVNKIYSISNELRRVLSAITALNFYQGDVPSVMIRIQPENMSPFIIDISTGEHDDYIIQTLDVGTFAPFGEQCTCSAVNKKELECIKETISKYCAKFTRKEAILTPPAHFNKTSITSDCWQILFFSPDHFNNDFY >CP031653.1|AXP25309.1|735756_736101_-|DNA-binding-transcriptional-activator-TdcR MTGITIFYGDNIIRYVVNIKKGLRPYFKQLPDNYQAKFELNLMSKFSNFIINKPFSAINTAARHIFSRYLLENKHLFYQYFKISNTGIDHLEQLINVNFFSSDRTSFCECNRFP >CP031653.1|AXP25310.1|736289_737228_+|transcriptional-regulator MSTILLPKTQHLVVFQEVIRSGSIGSAAKELGLTQPAVSKIINDIEDYFGVELVVRKNTGVTLTPAGQLLLSRSESITREMKNMVNEISGMSSEAVVEVSFGFPSLIGFTFMSGMINKFKEVFPKAQVSMYEAQLSSFLPAIRDGRLDFAIGTLSAEMKLQDLHVEPLFESEFVLVASKSRTCTGTTTLESLKNEQWVLPQTNMGYYSELLTTLQRNGISIENIVKTDSVVTIYNLVLNADFLTVIPCDMTSPFGSNQFITIPVEETLPVAQYAAVWSKNYRIKKAASVLVELAKEYSSYNGCRRRQLIEVG >CP031653.1|AXP25311.1|737326_738316_+|serine/threonine-dehydratase MHITYDLPVAIDDIIEAKQRLAGRIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSLTDAEKRKGVVACSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVEMEGRIFIPPYDDPKVIAGQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAVAIKSINPTIRVIGVQSENVHGMAASFHSGEITTHRTTGTLADGCDVSRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVVTEGAGALACAALLSGKLDQYIQNRKTVSIISGGNIDLSRVSQITGFVDA >CP031653.1|AXP25312.1|738337_739669_+|threonine/serine-transporter-TdcC MSTSDSIVSSQTKQSSWRKSDTTWTLGLFGTAIGAGVLFFPIRAGFGGLIPILLMLVLAYPIAFYCHRALARLCLSGSNPSGNITETVEEHFGKTGGVVITFLYFFAICPLLWIYGVTITNTFMTFWENQLGFAPLNRGFVALFLLLLMAFVIWFGKDLMVKVMSYLVWPFIASLVLISLSLIPYWNSAVIDQVDLGSLSLTGHDGILITVWLGISIMVFSFNFSPIVSSFVVSKREEYEKDFGRDFTERKCSQIISRASMLMVAVVMFFAFSCLFTLSPANMAEAKAQNIPVLSYLANHFASMTGTKTTFAITLEYAASIIALVAIFKSFFGHYLGTLEGLNGLILKFGYKGDKTKVSLGKLNTISMIFIMGSTWVVAYANPNILDLIEAMGAPIIASLLCLLPMYAIRKAPSLAKYRGRLDNVFVTVIGLLTILNIVYKLF >CP031653.1|AXP25313.1|739694_740903_+|propionate-kinase MNEFPVVLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVNGGEPAPLAHHSYEGALKAIAFELEKRNLNDSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLHNYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTSHRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERAQLAIKTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVEIDTEMNNRSNSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGKVNAPAEFA >CP031653.1|AXP25314.1|740936_743231_+|PFL-like-enzyme-TdcE MKVDIDTSDKLYADAWLGFKGTDWKNEINVRDFIQHNYTPYEGDESFLAEATPATTELWEKVMEGIRIENATHAPVDFDTNIATTITAHDAGYINQPLEKIVGLQTDAPLKRALHPFGGINMIKSSFHAYGREMDSEFEYLFTDLRKTHNQGVFDVYSPDMLRCRKSGVLTGLPDGYGRGRIIGDYRRVALYGISYLVRERELQFADLQSRLEKGEDLEATIRLREELAEHRHALLQIQEMAAKYGFDISRPAQNAQEAVQWLYFAYLAAVKSQNGGAMSLGRTASFLDIYIERDFKAGVLNEQQAQELIDHFIMKIRMVRFLRTPEFDSLFSGDPIWATEVIGGMGLDGRTLVTKNSFRYLHTLHTMGPAPEPNLTILWSEELPIAFKKYAAQVSIVTSSLQYENDDLMRTDFNSDDYAIACCVSPMVIGKQMQFFGARANLAKTLLYAINGGVDEKLKIQVGPKTAPLMDDVLDYDKVMDSLDHFMDWLAVQYISALNIIHYMHDKYSYEASLMALHDRDVYRTMACGIAGLSVATDSLSAIKYARVKPIRDENGLAVDFEIDGEYPQYGNNDERVDSIACDLVERFMKKIKALPTYRNAVPTQSILTITSNVVYGQKTGNTPDGRRAGTPFAPGANPMHGRDRKGAVASLTSVAKLPFTYAKDGISYTFSIVPAALGKEDPVRKTNLVGLLDGYFHHEADVEGGQHLNVNVMNREMLLDAIEHPEKYPNLTIRVSGYAVRFNALTREQQQDVISRTFTQAL >CP031653.1|AXP25315.1|743244_743634_+|enamine/imine-deaminase MKKIIETQRAPGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITDLNDFATINEVYKQFFDEHQATYPTRSYVQVARLPKDVKLEIEAIAVRSA >CP031653.1|AXP25316.1|743705_745070_+|L-serine-ammonia-lyase MISAFDIFKIGIGPSSSHTVGPMNAGKSFIDRLESSGLLTATSHIVVDLYGSLSLTGKGHATDVAIIMGLAGNSPQDVVIDEIPAFIELVTRSGRLPVASGAHIVDFPVAKNIIFHPEMLPRHENGMRITAWKGQEALLSKTYYSVGGGFIVEEEHFGLSHDVETSVPYDFHSAGELLKMCDYNGLSISGLMMHNELALRSKAEIDAGFARIWQVMHDGIERGMNTEGVLPGPLNVPRRAVALRRQLVSSDNISNDPMNVIDWINMYALAVSEENAAGGRVVTAPTNGACGIIPAVLAYYDKFRRPVNERSIARYFLAAGAIGALYKMNASISGAEVGCQGEIGVACSMAAAGLTELLGGSPAQVCNAAEIAMEHNLGLTCDPVAGQVQIPCIERNAINAVKAVNAARMAMRRTSAPRVSLDKVIETMYETGKDMNDKYRETSRGGLAIKVVCG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP031653_3 | 748035-748152 | Orphan |
NA
Consensus repeat of CP031653_3
|
1 spacers
spacers of CP031653_3
>3.1|748075|38|CP031653|CRISPRCasFinder GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA |
CRISPR arrays and Neighbor proteins around CP031653_3
The CRISPR arrays of CP031653_3 >merge|CP031653|3|748035-748152|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGGGTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGATGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG >CP031653|3|2|748035-748152|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGG GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA TGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG
>CP031653.1|AXP25318.1|746703_748014_+|serine-dehydratase-subunit-alpha-family-protein MFDSTLNPLWQRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPGTGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFSRAKVWNGEKWACVTIVGGHTNIVHIETHNGVVFTQQACVAEGEQESPLTVLSRTTLAEILKFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVIRTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAIYIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEIMASKAR >CP031653.1|AXP25317.1|745344_746676_+|transporter MEIASNKGVIADASTPAGRAGMSESEWREAIKFDSTDTGWVIMSIGMAIGAGIVFLPVQVGLMGLWVFLLSSVIGYPAMYLFQRLFINTLAESPECKDYPSVISGYLGKNWGILLGALYFVMLVIWMFVYSTAITNDSASYLHTFGVTEGLLSDSPFYGLVLICILVAISSRGEKLLFKISTGMVLTKLLVVAALGVSMVGMWHLYNVGSLPPLGLLVKNAIITLPFTLTSILFIQTLSPMVISYRSREKSIEVARHKALRAMNIAFGILFVTVFFYAVSFTLAMGHDEAVKAYEQNISALAIAAQFISGDGAAWVKVVSVILNIFAVMTAFFGVYLGFREATQGIVMNILRRKMPAEKINENLVQRGIMIFAILLAWSAIVLNAPVLSFTSICSPIFGMVGCLIPAWLVYKVPALHKYKGMSLYLIIVTGLLLCVSPFLAFS >CP031653.1|AXP25316.1|743705_745070_+|L-serine-ammonia-lyase MISAFDIFKIGIGPSSSHTVGPMNAGKSFIDRLESSGLLTATSHIVVDLYGSLSLTGKGHATDVAIIMGLAGNSPQDVVIDEIPAFIELVTRSGRLPVASGAHIVDFPVAKNIIFHPEMLPRHENGMRITAWKGQEALLSKTYYSVGGGFIVEEEHFGLSHDVETSVPYDFHSAGELLKMCDYNGLSISGLMMHNELALRSKAEIDAGFARIWQVMHDGIERGMNTEGVLPGPLNVPRRAVALRRQLVSSDNISNDPMNVIDWINMYALAVSEENAAGGRVVTAPTNGACGIIPAVLAYYDKFRRPVNERSIARYFLAAGAIGALYKMNASISGAEVGCQGEIGVACSMAAAGLTELLGGSPAQVCNAAEIAMEHNLGLTCDPVAGQVQIPCIERNAINAVKAVNAARMAMRRTSAPRVSLDKVIETMYETGKDMNDKYRETSRGGLAIKVVCG >CP031653.1|AXP25315.1|743244_743634_+|enamine/imine-deaminase MKKIIETQRAPGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITDLNDFATINEVYKQFFDEHQATYPTRSYVQVARLPKDVKLEIEAIAVRSA >CP031653.1|AXP25314.1|740936_743231_+|PFL-like-enzyme-TdcE MKVDIDTSDKLYADAWLGFKGTDWKNEINVRDFIQHNYTPYEGDESFLAEATPATTELWEKVMEGIRIENATHAPVDFDTNIATTITAHDAGYINQPLEKIVGLQTDAPLKRALHPFGGINMIKSSFHAYGREMDSEFEYLFTDLRKTHNQGVFDVYSPDMLRCRKSGVLTGLPDGYGRGRIIGDYRRVALYGISYLVRERELQFADLQSRLEKGEDLEATIRLREELAEHRHALLQIQEMAAKYGFDISRPAQNAQEAVQWLYFAYLAAVKSQNGGAMSLGRTASFLDIYIERDFKAGVLNEQQAQELIDHFIMKIRMVRFLRTPEFDSLFSGDPIWATEVIGGMGLDGRTLVTKNSFRYLHTLHTMGPAPEPNLTILWSEELPIAFKKYAAQVSIVTSSLQYENDDLMRTDFNSDDYAIACCVSPMVIGKQMQFFGARANLAKTLLYAINGGVDEKLKIQVGPKTAPLMDDVLDYDKVMDSLDHFMDWLAVQYISALNIIHYMHDKYSYEASLMALHDRDVYRTMACGIAGLSVATDSLSAIKYARVKPIRDENGLAVDFEIDGEYPQYGNNDERVDSIACDLVERFMKKIKALPTYRNAVPTQSILTITSNVVYGQKTGNTPDGRRAGTPFAPGANPMHGRDRKGAVASLTSVAKLPFTYAKDGISYTFSIVPAALGKEDPVRKTNLVGLLDGYFHHEADVEGGQHLNVNVMNREMLLDAIEHPEKYPNLTIRVSGYAVRFNALTREQQQDVISRTFTQAL >CP031653.1|AXP25313.1|739694_740903_+|propionate-kinase MNEFPVVLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVNGGEPAPLAHHSYEGALKAIAFELEKRNLNDSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLHNYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTSHRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERAQLAIKTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVEIDTEMNNRSNSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGKVNAPAEFA >CP031653.1|AXP25312.1|738337_739669_+|threonine/serine-transporter-TdcC MSTSDSIVSSQTKQSSWRKSDTTWTLGLFGTAIGAGVLFFPIRAGFGGLIPILLMLVLAYPIAFYCHRALARLCLSGSNPSGNITETVEEHFGKTGGVVITFLYFFAICPLLWIYGVTITNTFMTFWENQLGFAPLNRGFVALFLLLLMAFVIWFGKDLMVKVMSYLVWPFIASLVLISLSLIPYWNSAVIDQVDLGSLSLTGHDGILITVWLGISIMVFSFNFSPIVSSFVVSKREEYEKDFGRDFTERKCSQIISRASMLMVAVVMFFAFSCLFTLSPANMAEAKAQNIPVLSYLANHFASMTGTKTTFAITLEYAASIIALVAIFKSFFGHYLGTLEGLNGLILKFGYKGDKTKVSLGKLNTISMIFIMGSTWVVAYANPNILDLIEAMGAPIIASLLCLLPMYAIRKAPSLAKYRGRLDNVFVTVIGLLTILNIVYKLF >CP031653.1|AXP25311.1|737326_738316_+|serine/threonine-dehydratase MHITYDLPVAIDDIIEAKQRLAGRIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSLTDAEKRKGVVACSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVEMEGRIFIPPYDDPKVIAGQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAVAIKSINPTIRVIGVQSENVHGMAASFHSGEITTHRTTGTLADGCDVSRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVVTEGAGALACAALLSGKLDQYIQNRKTVSIISGGNIDLSRVSQITGFVDA >CP031653.1|AXP25310.1|736289_737228_+|transcriptional-regulator MSTILLPKTQHLVVFQEVIRSGSIGSAAKELGLTQPAVSKIINDIEDYFGVELVVRKNTGVTLTPAGQLLLSRSESITREMKNMVNEISGMSSEAVVEVSFGFPSLIGFTFMSGMINKFKEVFPKAQVSMYEAQLSSFLPAIRDGRLDFAIGTLSAEMKLQDLHVEPLFESEFVLVASKSRTCTGTTTLESLKNEQWVLPQTNMGYYSELLTTLQRNGISIENIVKTDSVVTIYNLVLNADFLTVIPCDMTSPFGSNQFITIPVEETLPVAQYAAVWSKNYRIKKAASVLVELAKEYSSYNGCRRRQLIEVG >CP031653.1|AXP25309.1|735756_736101_-|DNA-binding-transcriptional-activator-TdcR MTGITIFYGDNIIRYVVNIKKGLRPYFKQLPDNYQAKFELNLMSKFSNFIINKPFSAINTAARHIFSRYLLENKHLFYQYFKISNTGIDHLEQLINVNFFSSDRTSFCECNRFP >CP031653.1|AXP25319.1|748225_748390_-|hypothetical-protein MSKKSAKKRQPVKPVVAKEPARTAKNFGYEEMLSELEAIVADAETRLAEDEATA >CP031653.1|AXP25320.1|748412_749114_-|pirin-like-protein-YhaK MITTRTARQCGQADYGWLQARYTFSFGHYFDPKLLGYASLRVLNQEVLAPGAAFQPRTYPKVDILNVILDGEAEYRDSEGNHVQASAGEALLLSTQPGVSYSEHNLSKDKPLTRMQLWLDACPQRENPLIQKLALNMGKQQLIASPEGTMGSLQLRQQVWLHHIVLDKGESANFQLHGPRAYLQSIHGKFHALTHHEEKAALTCGDGAFIRDEANITLVADSPLRALLIDLPV >CP031653.1|AXP25321.1|749218_750115_+|LysR-family-transcriptional-regulator MAKERALTLEALRVMDAIDRRGSFAAAADELGRVPSALSYTMQKLEEELDVVLFDRSGHRTKFTNVGRMLLERGRVLLEAADKLTTDAEALARGWETHLTIVTEALVPTPAFFPLIDKLAAKANTQLAIITEVLAGAWERLEQGRADIVIAPDMHFRSSSEINSRKLYTLMNVYVAAPDHPIHQEPEPLSEVTRVKYRGIAVADTARERPVLTVQLLDKQPRLTVSTIEDKRQALLAGLGVATMPYPMVEKDIAEGRLRVVSPESTSEIDIIMAWRRDSMGEAKSWCLREIPKLFSGK >CP031653.1|AXP25322.1|750165_750522_-|DUF805-domain-containing-protein MQWYLAVLKNYVGFSGRARRKEYWMFTLINAIVGAIINVIQLILGLEFPFLSLIYLAATIIPVIALCVRRLHDTDRSGAWALLYLVPIIGWLVLFVFACLEGNSGSNRYGNDPKFGSN >CP031653.1|AXP25323.1|750763_751129_-|DUF805-domain-containing-protein MDWYLKVLKNYVGFRGRARRKEYWMFILVNIIFTFVLGLLDKMLGWQRAGGEGILTTIYGILVFLPWWAVQFRRLHDTDRSAWWALLFLIPFIGWLIIIVFNCQAGTPGENRFGPDPKLEP >CP031653.1|AXP25324.1|751421_752408_-|glutathione-S-transferase-family-protein MGQLIDGVWHDTWYDTKSTGGKFQRSASAFRNWLTADGAPGPTGTGGFIAEKDRYHLYVSLACPWAHRTLIMRKLKGLEPFISVSVVNPLMLENGWTFDDSFPGATGDTLYQHEFLYQLYLHADPHYSGRVTVPVLWDKKNHTIVSNESAEIIRMFNTAFDALGAKAGDYYPPALQTKIDELNGWIYDTVNNGVYKAGFATSQQAYDEAVAKVFESLARLEQILGQHRYLTGNQLTEADIRLWTTLVRFDPVYVTHFKCDKHRISNYLNLYGFLRDIYQMPGIAETVNFDHIRNHYFRSHKTINPTGIISIGPWQDLDEPHGRDVRFG >CP031653.1|AXP25325.1|752477_752960_-|DoxX-family-protein MILSIDSNDANTAPLHKKTISSLSGAVESMMKKLEDVGVLVARILMPILFITAGWGKITGYAGTQQYMEAMGVPGFMLPLVILLEFGGGLAILFGFLTRTTALFTAGFTLLTAFLFHSNFAEGVNSLMFMKNLTISGGFLLLAITGPGAYSIDRLLNKKW >CP031653.1|AXP25326.1|753055_753355_-|hypothetical-protein MSSKVERERRKAQLLSQIQQQRLDLSASRREWLEATGAYDRRWNMLLSLRSWALVGSSVMAIWTIRHPNMLVRWARRGFGVWSAWRLVKTTLKQQQLRG >CP031653.1|AXP25327.1|753344_753749_-|hypothetical-protein MADTHHAQGPGKSVLGIGQRIVSIMVEMVETRLRLAVVELEEEKANLFQLLLMLGLTMLFAAFGLMSLMVLIIWAVDPQYRLNAMIATTVVLLLLALIGGIWTLRKSRKSTLLRHTRHELANDRQLLEEESREQ >CP031653.1|AXP25328.1|753751_754057_-|DUF883-domain-containing-protein MSKEHTTEHLRAELKSLSDTLEEVLSSSGEKSKEELSKIRSKAEQALKQSRYRLGETGDAIAKQTRVAAARADEYVRENPWTGVGIGAAIGVVLGVLLSRR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP031653_4 | 1121484-1121756 | Unclear |
I-E
Consensus repeat of CP031653_4
|
4 spacers
spacers of CP031653_4
>4.1|1121510|35|CP031653|PILER-CR CCGTCCACGCTGTAACGGCCATCATTAAGTTTAGT >4.2|1121571|35|CP031653|PILER-CR CCGGCTGATGGTCTGGGAGTGTCCATCGGGCAACT >4.3|1121632|35|CP031653|PILER-CR CCGGAAGTAGGCCTGACAGTGATTGAACGCATACT >4.4|1121693|35|CP031653|PILER-CR CCGAGTTGGGGCGGCGCAATAACGAGACGATACGC >4.5|1121509|36|CP031653|CRISPRCasFinder ACCGTCCACGCTGTAACGGCCATCATTAAGTTTAGT >4.6|1121570|36|CP031653|CRISPRCasFinder ACCGGCTGATGGTCTGGGAGTGTCCATCGGGCAACT >4.7|1121631|36|CP031653|CRISPRCasFinder ACCGGAAGTAGGCCTGACAGTGATTGAACGCATACT >4.8|1121692|36|CP031653|CRISPRCasFinder ACCGAGTTGGGGCGGCGCAATAACGAGACGATACGC >4.9|1121513|32|CP031653|CRT TCCACGCTGTAACGGCCATCATTAAGTTTAGT >4.10|1121574|32|CP031653|CRT GCTGATGGTCTGGGAGTGTCCATCGGGCAACT >4.11|1121635|32|CP031653|CRT GAAGTAGGCCTGACAGTGATTGAACGCATACT >4.12|1121696|32|CP031653|CRT AGTTGGGGCGGCGCAATAACGAGACGATACGC |
CRISPR arrays and Neighbor proteins around CP031653_4
The CRISPR arrays of CP031653_4 >merge|CP031653|4|1121484-1121756|PILER-CR,CRISPRCasFinder,CRT GAGTTCCCCGCGCCAGCGGGGATAAACCGTCCACGCTGTAACGGCCATCATTAAGTTTAGTGAGTTCCCCGCGCCAGCGGGGATAAACCGGCTGATGGTCTGGGAGTGTCCATCGGGCAACTGAGTTCCCCGCGCCAGCGGGGATAAACCGGAAGTAGGCCTGACAGTGATTGAACGCATACTGAGTTCCCCGCGCCAGCGGGGATAAACCGAGTTGGGGCGGCGCAATAACGAGACGATACGCGAGTTCCCCGCGCCAGCGGGGATAACTAT >CP031653|4|2|1121484-1121753|PILER-CR GAGTTCCCCGCGCCAGCGGGGATAAA CCGTCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAA CCGGCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAA CCGGAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAA CCGAGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAC >CP031653|4|3|1121484-1121752|CRISPRCasFinder GAGTTCCCCGCGCCAGCGGGGATAA ACCGTCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAA ACCGGCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAA ACCGGAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAA ACCGAGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAA >CP031653|4|1|1121484-1121756|CRT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAACTAT
>CP031653.1|AXP25661.1|1120472_1121144_+|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAVISRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >CP031653.1|AXP25660.1|1120193_1120334_-|hypothetical-protein MSEENKENGFNHVKTFTKIIFIFSVLVFNDNESKITDAAVNLFIQI >CP031653.1|AXP25659.1|1119307_1120180_-|YgcG-family-protein MRYFILMFTFVCSFVAAQPTIVPQLQQQVTDLTSSLNSQEKKELTHKLESIFNNTQVQIAVLIVPTTKDETIEQYATRVFDNWRLGDAKRNDGILIIVAWSDRTVRIKVGYGLEEKVTDALAGDIIRSNMIPAFKQQKLAQGLELAINALNNQLTSQHQYPTNPSESESASSSDHYYFAIFWVFAVMFFPFWFFHQCSNFCRACKSGVCISAIYLLDLFLFSDKIFSIAVFSFFFTFTIFMVFTCLCVLQKRASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASGRW >CP031653.1|AXP25658.1|1117949_1119248_+|enolase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >CP031653.1|AXP25657.1|1116224_1117862_+|CTP-synthetase MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >CP031653.1|AXP25656.1|1115205_1115997_+|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADSSAENSSEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >CP031653.1|AXP25655.1|1114799_1115135_+|mRNA-interferase-MazF MVSRYVPDMGDLIWVDFDPTKGSEQAGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG >CP031653.1|AXP25654.1|1114551_1114800_+|MazF-MazE-toxin-antitoxin-system-antitoxin-MazE MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKEVW >CP031653.1|AXP25653.1|1112239_1114474_+|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >CP031653.1|AXP25652.1|1110890_1112192_+|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKTLFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHDVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILAPQLEALLPKVRACLGSLQAMRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK >CP031653.1|AXP25662.1|1121801_1123280_-|sugar-kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFAGHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIPRHMLFDVQMPGTVLGHITPQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDAKAQDLSPEDLLNKKASCVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNYFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDKMVRVKDIFMPVESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNADSIQSWSNA >CP031653.1|AXP25663.1|1123306_1124584_-|MFS-transporter MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQVAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSIIGLLALTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGMVIVFTFLLFQKIRTADSAPAMASSK >CP031653.1|AXP25664.1|1124902_1125688_+|SDR-family-oxidoreductase MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVDITAEGAPQKIIAASCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASPASNYVNGHLLVVDGGYLVR >CP031653.1|AXP25665.1|1125757_1127212_+|FAD-binding-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYILEDMKTGFNILREVMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKATGEGIAEIVARYPQCQRVDSKLIETWFNNLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >CP031653.1|AXP25666.1|1127305_1128643_+|MFS-transporter MNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLASIAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >CP031653.1|AXP25667.1|1128620_1129400_+|electron-transfer-flavoprotein-subunit-beta/FixA-family-protein MNILLAFKAEPDAGMLAEKEWQAAAQGKSGPDISLLRSLLGADEQAAAALLLAQRKNGTPMSLTALSMGDERALHWLRYLMALGFEEAVLLETAADLRFAPEFVARHIAEWQHQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDALFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAEMPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWRDYLRQRMQP >CP031653.1|AXP25668.1|1129396_1130257_+|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNIAIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSVRKSHWGNALTATLQTEKRPLCLSLARQAGAAKNATLPSGMQQLIIVPGALPDWLVSTEDLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKVVLEALVTNIHADCQ >CP031653.1|AXP25669.1|1130404_1130980_-|glycerol-3-phosphate-responsive-antiterminator MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >CP031653.1|AXP25670.1|1130996_1131257_-|ferredoxin-family-protein MSVARNLWRVADAPHIVPADSVERQTAERLISACPAGLFSLTPEGDLRIDYRSCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >CP031653.1|AXP25671.1|1131247_1132519_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP031653_5 | 1143550-1144066 | Unclear |
I-E
Consensus repeat of CP031653_5
|
8 spacers
spacers of CP031653_5
>5.1|1143578|33|CP031653|PILER-CR GTTGCCCGCGCAATTCCGGGAGCATCCGCAATT >5.2|1143639|33|CP031653|PILER-CR GACGGACAAAATATATATTGATTTGCGAATTAT >5.3|1143700|33|CP031653|PILER-CR GCCCGTCACCGACGCGCAGTGGCGCTACCGTGA >5.4|1143761|33|CP031653|PILER-CR GGGATCTAACGCGCTGTAAAAATTCCGTGCTTT >5.5|1143822|33|CP031653|PILER-CR ATGCGGATTACCGGCAAAACATGGGAGCAAACC >5.6|1143883|33|CP031653|PILER-CR GCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA >5.7|1143944|33|CP031653|PILER-CR GGTTTACCGCCCCGCAGAGGCGCTGGCAGATCC >5.8|1144005|33|CP031653|PILER-CR GGGATGACCTGTCGCTAAAACTCGCCGCGTACA >5.9|1143579|32|CP031653|CRISPRCasFinder,CRT TTGCCCGCGCAATTCCGGGAGCATCCGCAATT >5.10|1143640|32|CP031653|CRISPRCasFinder,CRT ACGGACAAAATATATATTGATTTGCGAATTAT >5.11|1143701|32|CP031653|CRISPRCasFinder,CRT CCCGTCACCGACGCGCAGTGGCGCTACCGTGA >5.12|1143762|32|CP031653|CRISPRCasFinder,CRT GGATCTAACGCGCTGTAAAAATTCCGTGCTTT >5.13|1143823|32|CP031653|CRISPRCasFinder,CRT TGCGGATTACCGGCAAAACATGGGAGCAAACC >5.14|1143884|32|CP031653|CRISPRCasFinder,CRT CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA >5.15|1143945|32|CP031653|CRISPRCasFinder,CRT GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC >5.16|1144006|32|CP031653|CRISPRCasFinder,CRT GGATGACCTGTCGCTAAAACTCGCCGCGTACA |
cas2,cas1,cas6e,cas5 |
CRISPR arrays and Neighbor proteins around CP031653_5
The CRISPR arrays of CP031653_5 >merge|CP031653|5|1143550-1144066|PILER-CR,CRISPRCasFinder,CRT GTGTTCCCCGCGCCAGCGGGGATAAACCGTTGCCCGCGCAATTCCGGGAGCATCCGCAATTGTGTTCCCCGCGCCAGCGGGGATAAACCGACGGACAAAATATATATTGATTTGCGAATTATGTGTTCCCCGCGCCAGCGGGGATAAACCGCCCGTCACCGACGCGCAGTGGCGCTACCGTGAGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATCTAACGCGCTGTAAAAATTCCGTGCTTTGTGTTCCCCGCGCCAGCGGGGATAAACCATGCGGATTACCGGCAAAACATGGGAGCAAACCGTGTTCCCCGCGCCAGCGGGGATAAACCGCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTAGTGTTCCCCGCGCCAGCGGGGATAAACCGGTTTACCGCCCCGCAGAGGCGCTGGCAGATCCGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATGACCTGTCGCTAAAACTCGCCGCGTACAGTGTTCCCCGCGCCAGCGGGGATAAACCG >CP031653|5|3|1143550-1144065|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACC GTTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACC GACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACC GCCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACC GGGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACC ATGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACC GCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACC GGTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACC GGGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACC >CP031653|5|4|1143550-1144066|CRISPRCasFinder GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACCG >CP031653|5|2|1143550-1144066|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACCG
>CP031653.1|AXP25680.1|1143159_1143453_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAIWLLEVRAGVYVGDTSKRIREMIWQQITQLAGCGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVDNQ >CP031653.1|AXP25679.1|1142239_1143163_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLSPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLAATVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALTEDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRQTYALLAKQYGVKWNGRKYDPKDWEKGDVVNRCISAATSCLYGISEAAVLAAGYAPAIGFIHSGKPLSFVYDIADIIKFDSVVPKAFEIAARQPAEPDKEVRLACRDIFRSTKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPETLGDSGHRGRGG >CP031653.1|AXP25678.1|1141592_1142243_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQERPAESDTFTIECRSFAPELRTGQQLCFNLRANPTICKSGKRHDLLMEAKRQVRGQAEGSDVWLHQQQAALDWLAAQGERSGFTLLDTSVDAYRQQQLRRENSRQLIQFSSVDYTGMLTVTDPGLFLQRLSQGYGKSRAFGCGLMLIKPGAEA >CP031653.1|AXP25677.1|1140864_1141611_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLIFQLHGPMASWGVDAPGEVRHTHELPSRSALLGLLAAGVGIRRDDTERLNAFNRHYSLVVCASRNPRWARDYHTIQMPKEVRKARYFSRREELSDPDLLSAIISRRDYYTDAWWMVAVATTADAPYSLEQLQDGLRHPVFPLYLGRKSHPLALPLAPLLLEGNACDALCNAYQQYQDHFHKLKVSLPKLQDECWWEGEHDGLVASKILRRRDVPLNRQQWLFGERTINQGPWLSKEEPCTSQE >CP031653.1|AXP25676.1|1137861_1138014_+|type-I-toxin-antitoxin-system-hok-family-toxin MLTKYALVAIIVLCCTVLGFTLMVGDSLCELSIRERGMEFKAVLAYESKK >CP031653.1|AXP25675.1|1136862_1137597_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIHPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMLEEETRFFGLKRECGLHEG >CP031653.1|AXP25674.1|1135076_1136789_+|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLTDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKIAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD >CP031653.1|AXP25673.1|1133277_1135077_+|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTQVPPSALLPLNPEQLVRLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAEMPGITIISASQTGNARRVAEALRDDLLAAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGAVNEIHTSPYSKDAPLVASLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVEGKTLPLNEALQWHFELTVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPIVDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQAEVENEVHVTVGVVRYDVEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAPGKNWLFFGNPHFTEDFLYQVEWQRYVKDGVLTRIDLAWSRDQKEKVYVQDKLREQGAELWRWINDGAHIYVCGDANRMAKDVEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY >CP031653.1|AXP25672.1|1132596_1132962_-|6-carboxytetrahydropterin-synthase-QueD MMSTTLFKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIIDFAELKAAFKPTYERLDHHYLNDIPGLENPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCIYRGE >CP031653.1|AXP25671.1|1131247_1132519_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL >CP031653.1|AXP25681.1|1144147_1145185_-|aminopeptidase MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEIFDKAGIAVLSVEATNWNLGNKDGYQQRAKTAAFPAGNSWHDVRLDNQQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >CP031653.1|AXP25682.1|1145436_1146345_+|sulfate-adenylyltransferase-subunit-2 MDQIRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >CP031653.1|AXP25683.1|1146346_1147774_+|sulfate-adenylyltransferase MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEKTFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMAWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFAGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >CP031653.1|AXP25684.1|1147773_1148379_+|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVQQRELHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >CP031653.1|AXP25685.1|1148428_1148752_+|DUF3561-family-protein MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >CP031653.1|AXP25686.1|1148945_1149257_+|cell-division-protein-FtsB MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNDDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >CP031653.1|AXP25687.1|1149275_1149986_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSRFAQLPLANHPQITVVDGGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >CP031653.1|AXP25688.1|1149985_1150465_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYERGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >CP031653.1|AXP25689.1|1150461_1151511_+|tRNA-pseudouridine(13)-synthase-TruD MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >CP031653.1|AXP25690.1|1151491_1152253_+|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP031653_6 | 1668860-1668977 | Orphan |
NA
Consensus repeat of CP031653_6
|
1 spacers
spacers of CP031653_6
>6.1|1668891|56|CP031653|CRISPRCasFinder TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA |
CRISPR arrays and Neighbor proteins around CP031653_6
The CRISPR arrays of CP031653_6 >merge|CP031653|6|1668860-1668977|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGCTGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAACCGAGCCGTAGGCCGGATAAGGCGTTTACGC >CP031653|6|5|1668860-1668977|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGC TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA CCGAGCCGTAGGCCGGATAAGGCGTTTACGC
>CP031653.1|AXP29120.1|1667632_1668763_-|ribonucleoside-diphosphate-reductase-1-subunit-beta MAYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEALHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAVGLDLPFQTRSNPIPWINTWLVSDNVQVAPQEVEVSSYLVGQIDSEVDTDDLSNFQL >CP031653.1|AXP26163.1|1667378_1667633_-|ferredoxin MARVTLRITGTQLLCQDEHPSLLAALESHNVAVEYQCREGYCGSCRTRLVAGQVDWIAEPLAFIQPGEILPCCCRAKGDIEIEM >CP031653.1|AXP26162.1|1666674_1667325_+|lipopolysaccharide-kinase-InaA MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMTHHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRALLVTEDMAGFISIADWYAQHAVSPYSDEVRQAMLKAVALAFKKMHSINRQHGCCYVRHIYVKTEGKAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEPIPKADWEQVKAYYYAM >CP031653.1|AXP26161.1|1666253_1666460_-|hypothetical-protein MNFIRQGLGIALQPELTLKSIAGELCSVPLEPTFYRQISLLAKEKPVEGSPLFLLQTCTEQLVVSGKI >CP031653.1|AXP26160.1|1664936_1665788_-|hypothetical-protein MPNTSIHLSRCNILQNNKLQPEEVYKESQQTAKLEIFCDEFLKISQSRYGLSTSADSIANLLTFFTKASDAIDRIKTQKIDVHSYGFVPIRHFVEHVITYKNEFAADYEPYTLTFTRGENNEGVLSIESKEGSISQRTINLNEYETAINIINEHVTKENIHNTVQSLTEKDISKINSSDKHHKISSEESIKSQLYSDQKKYADLLLHSEKNTEWYKYASSEERYDKFKNSSKEIKNTYKQIVLAQKKLNQMKYINKLGNDSNLLIVFYVQIMPDDFVMQLHRF >CP031653.1|AXP26159.1|1663940_1664255_-|hypothetical-protein MTNKLGGELIDIADKKLAPLINDSFSYTRDFFAYSKQENNIFTFDNSKFVDPKEKEGLMIQHSNGQLVITGKYCPEGVQTAFTQEQYDKLIRYINIFFTFPKCE >CP031653.1|AXP26158.1|1662645_1663722_+|glycerophosphodiester-phosphodiesterase MKLKLKNLSMAIMMSTIVMGSSAMAADSNEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDHLVVLHDHYLDRVTDVADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTFEEEIEFVQGLNHSTGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQCFDADELKRIKNELEPKMGMDLNLVQLIAYTDWNETQQKQPDGSWVNYSYDWMFKPGAMKQVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKLPEYTTDVNQLYDVLYNKAGVNGLFTDFPDKAVKFLNKE >CP031653.1|AXP26157.1|1661282_1662641_+|glycerol-3-phosphate-transporter MLSIFKPAPHKARLPAAEIDPTYRRLRWQIFLGIFFGYAAYYLVRKNFALAMPYLVEQGFSRGDLGFALSGISIAYGFSKFIMGSVSDRSNPRVFLPAGLILAAAVMLFMGFVPWATSSIAVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLFLLGMAWFNDWHAALYMPAFCAILVALFAFAMMRDTPQSCGLPPIEEYKNDYPDDYNEKAEQELTAKQIFMQYVLPNKLLWYIAIANVFVYLLRYGILDWSPTYLKEVKHFALDKSSWAYFLYEYAGIPGTLLCGWMSDKVFRGNRGATGVFFMTLVTIATIVYWMNPAGNPTVDMICMIVIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGSVAASAIVGYTVDFFGWDGGFMVMIGGSILAVILLIVVMIGEKRRHEQLLQKRNGG >CP031653.1|AXP26156.1|1659381_1661010_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-A MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRNHGLLHSGARYAVTDAESARECISENQILKRIARHCVEPTNGLFITLPEDDLSFQATFIRACEEAGISAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDPFRLTAANMLDAKEHGAVILTAHEVTGLIREGATVCGVRVRNHLTGETQALHAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRINQHVINRCRKPSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAPVMAKTRILRAYSGVRPLVASDDDPSGRNVSRGIVLLDHAERDGLDGFITITGGKLMTYRLMAEWATDAVCRKLGNTRPCTTADLALPGSQDPAEVTLRKVISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVENLNVNSLLDLRRRTRVGMGTCQGELCACRAAGLLQRFNVTTSAQSIEQLSTFLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL >CP031653.1|AXP26155.1|1658132_1659392_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-B MRFDTVIMGGGLAGLLCGLQLQKHGLRCAIVTRGQSALHFSSGSLDLLSHLPDGQPVADIHSGLESLRQQAPAHPYSLLGPQRVLDLACQAQALIAESGAQLQGSVELAHQRITPLGTLRSTWLSSPEVPVWPLPAKKICVVGISGLMDFQAHLAAASLRELDLSVETAEIELPELDVLRNNATEFRAVNIARFLDNEENWPLLLDALIPVANTCEMILMPACFGLADDKLWRWLNEKLPCSLMLLPTLPPSVLGIRLQNQLQRQFVRQGGVWMPGDEVKKVTCKNGVVNEIWTRNHADIPLRPRFAVLASGSFFSGGLVAERNGIREPILGLDVLQTATRGEWYKGDFFAPQPWQQFGVTTDETLRPSQAGQTIENLFAIGSVLGGFDPIAQGCGGGVCAVSALHAAQQIAQRAGGQQ >CP031653.1|AXP26164.1|1668996_1671282_-|ribonucleoside-diphosphate-reductase-1-subunit-alpha MNQNLLVTKRDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADLISRDAPDYQYLAARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFKQMDTFIDHDRDMTFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAACLFSNYPRETRLQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINATSSAIVKYVSQRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMWHLEVESLLVLKNNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFFADQEEFERLYTKYEKDDSIRKQRVKAVELFSLMMQERASTGRIYIQNVDHCNTHSPFDPAIAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELEELAILAVRALDALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQYYLLKASNELAKEQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTHGLRNSTLSALMPSETSSQISNATNGIEPPRGYVSIKASKDGILRQVVPDYEHLHDAYELLWEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLTAYKFGVKTLYYQNTRDGAEDAQDDLVPSIQDDGCESGACKI >CP031653.1|AXP26165.1|1671977_1675730_+|AIDA-I-family-autotransporter-YfaL MRIIFLRKEYLSLLPSMIASLFSANGVAAVTDSCQGYDVKASCQASRQSLSGITQDWSIADGQWLVFSDMTNNASGGAVFLQQGAEFSLLPENETGMTLFANNTVTGEYNNGGAIFAKENSTLNLTDVIFSGNVAGGYGGAIYSSGTNDTGAVDLRVTNAMFRNNIANDGKGGAIYTINNDVYLSDVIFDNNQAYTSTSYSDGDGGAIDVTDNNSDSKHPSGYTIVNNTAFTNNTAEGYGGAIYTNSVTAPYLIDISVDDSYSQNGGVLVDENNSAAGYGDGPSSAAGGFMYLGLSEVTFDIADGKTLVIGNTENDGAVDSIAGTGLITKTGSGDLVLNADNNDFTGEMQIENGEVTLGRCNSLMNVGDTHCQDDPQDCYGLTIGSIDQYQNQAELNVGSTQQTFVHALTGFQNGTLNIDAGGNVTVNQGSFAGIIEGAGQLTIAQNGSYVLAGAQSMALTGDIVVDDGAVLSLEGDAADLTALQDDPQSIVLNGGVLDLSDFSTWQSGTSYNDGLEVSGSSGTVIGSQDVVDLAGGDNLHIGGDGKDGVYVVVDASDGQVSLANNNSYLGTTQIASGTLMVSDNSQLGDTHYNRQVIFTDKQQESVMEITSDVDTRSDAAGHGRDIEMRADGEVAVDAGVDTQWGALMADSSGQHQDEGSTLTKTGAGTLELTASGTTQSAVRVEEGTLKGDVADILPYASSLWVGDGATFVTGADQDIQSIDAISSGTIDISDGTVLRLTGQDTSVALNASLFNGDGTLVNATDGVTLTGELNTNLETDSLTYLSNVTVNGNLTNTSGAVSLQNGVAGDTLTVNGDYTGGGTLLLDSELNGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKMVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAKVGGYLNNLRAANQAFMMERRDHAGGDGQTLNLRVIGGDYHYTAAGQLAQHEDTSTVQLSGDLFSGRWGTDGEWMLGIVGGYSDNQGDSRSNMTGTRADNQNHGYAVGLTSSWFQHGNQKQGAWLDSWLQYAWFSNDVSEQEDGTDHYHSSGIIASLEAGYQWLPGRGVVIEPQAQVIYQGVQQDDFTAANRARVSQSQGDDIQTRLGLHSEWRTAVHVIPTLDLNYYHDPHSTEIEEDGSTISDDAVKQRGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW >CP031653.1|AXP26166.1|1675857_1676580_-|bifunctional-2-polyprenyl-6-hydroxyphenol-methylase/3-demethylubiquinol-3-O-methyltransferase-UbiG MNAEKSPENHNVDHEEIAKFEAVASRWWDLEGEFKPLHRINPLRLGYIAERAGGLFGKKVLDVGCGGGILAESMAREGATVTGLDMGFEPLQVAKLHALESGIQVDYVQETVEKHAAKHAGQYDVVTCMEMLEHVPDPQSVVRACAQLVKPGGDVFFSTLNRNGKSWLMAVVGAEYILRMVPKGTHDVKKFIKPAELLGWVDQTSLKERHITGLHYNPITNSFKLGPGVDVNYMLHTQNK >CP031653.1|AXP26167.1|1676726_1679354_+|DNA-topoisomerase-(ATP-hydrolyzing)-subunit-A MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDLAVYNTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAPTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITDAGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE >CP031653.1|AXP26168.1|1679502_1681191_+|DUF2138-domain-containing-protein MSGEKKAKGWRFYGLVGFGAIALLSAGVWALQYAGSGPEKTLSPLVVHNNLQIDLNEPDLFLDSDSLSQLPKDLLTIPFLHDVLSEDFVFYYQNHADRLGIEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETVPVYQLRYNGNNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIAGDLLSGKKRWQASFGLEERTAEKTPVRQRIVVSARWLGFGYQRLMPSFAGVRFEMGNDGWHSFVALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQENDKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGAHESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYFFRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYINPQGIAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLAQMEPGAAWQWLPITWQPL >CP031653.1|AXP26169.1|1681187_1681811_+|DUF1175-domain-containing-protein MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYRLNFLAR >CP031653.1|AXP26170.1|1681744_1686349_+|alpha-2-macroglobulin-family-protein MDTQRFQSQFHWHLSFKFSGAIAACLSLSLVGTGLANADDSLPSSNYAPPAGGTFFLLADSSFSSSEEAKVRLEAPGRDYRRYQMEEYGGVDVRLYRIPDPMAFLRQQKNLHRIVVQPQYLGDGLNNTLTWLWDNWYGKSRRVMQRTFSSQSRQNVTQALPELQLGNAIIKPSRYVQNNQFSPLKKYPLVEQFRYPLWQAKPFEPQQGVKLEGASSNFISPQPGNIYIPLGQQEPGLYLVEAMVGGYRATTVVFVSDTVALSKVSGNELLVWTAGKKQGEAKPGSEILWTDGLGVMTRGVTDDSGTLQLQHISPERSYILGKDAEGGVFVSENFFYESEIYNTRLYIFTDRPLYRAGDRVDVKVMGREFHDPLHSSPIVSAPAKLSVLDANGSLLQTVDVTLDARNGGQGSFRLPENAVAGGYELRLAYRNQVYSSSFRVANYIKPHFEIGLALAKKEFKTGEAVSGKLQLLYPDGEPVKNARVQLSLRAQQLSMVGNDLRYAGRFPVSLEGSETVSDASGHVALNLPAADKPSRYLLTVSASDGAAYRVTTTKEILIERGLAHYSLSTAAQYSNSGESVVFRYAALESSKQVPVTYEWLRLEDRTSHSGELPSGGKSFTVNFAKPGNYNLTLRDKDGLILAGLSHAVSGKGSTAHTGTVDIVADKTLYQPGETAKMLITFPEPIDEALLTLERDRVEQQSLLSHPANWLTLQRLNDTQYEARVPVSNSFAPNITFSVLYTRNGQYSFQNAGIKVAVPQLDIRVKTDKTHYQPGELVNVELTSSLKGKPVSAQLTVGVVDEMIYALQPEIAPNIGKFFYPLGRNNVRTSSSLSFISYDQALSSEPVAPGATNRSERRVKMLERPRREEVDTAAWMPSLTTDKQGKAYFTFLMPDSLTRWRITARGMNGDGLVGQGRAYLRSEKNLYMKWSMPTVYRVGDKPAAGLFIFSQQDNEPVALVTKFAGAEMRQTLTLHKGANYISLTQNIQQSGLLSAELQQNGQVQDSISTKLSFVDNSWPVEQQKNVMLGGGDNALMLPEQASNIRLQSSETPQEIFRNNLDALVDEPWGGVINTGSRLIPLSLAWRSLADHQSAAANDIRQMIQVNRLRLMQLAGPGARFTWWGEDGNGDAFLTAWAWYADWQASQAIGVTQQPEYWQHMLDSYAEQADNMPLLHRALVLAWAQEMNLPCKTLLKGLDEAIARRGTKTEDFSEEDTRDINDSLILDTPESPLADAVANVLTMTLLKKAQLKSTVMPQVQQYAWDKAANSNQPLAHTVVLLNSGGDATQAAAILSGLTAEQSTIERALAMNWLAKYMATMPPVVLPAPAGAWAKHKLTGGGEYWRWVGQGVPDILSFGDELSPQNVQVRWREPAKTAQQSNIPVTVERQLYRLITGEEEMSFTLQPVTSNEIDSDALYLDEITLTSEQDAVLRYGQVEVPLPPGADVERTTWGISVNKPNAAKQQGQLLEIARNEMGELAYMVPVKELTGTVTFRHLLRFSQKGQFVLPPARYMRSYAPAQQSVAAGSEWTRMQVK >CP031653.1|AXP26171.1|1686349_1687999_+|DUF2300-domain-containing-protein MNWRRIVWLLALVTLPTLAEEPPLQLALRGAQHDQLYKLSSSGVTNVSTLPDTLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESITRDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLTTLKPETSVTVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWFADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQVASGQCVEVELFARYPLKKITAEKSTTAVKPGVLNGRYRVTFTNGNHITFVSHGETTLLSEKGKLKLQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLTIPDSSATQRVSASPATTGARTMAAWTQDLIYAGDPVHYHGSRATEGTLSWRQATAQAGQGERYDQILAFAYPDNSLSRWGAPRSTCQLLPKAKAWLAKKMPQWRRILQAETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRLDLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD >CP031653.1|AXP26172.1|1688003_1688780_+|DUF2135-domain-containing-protein MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPVEGEDASFSQSINYPASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDGSFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWDTDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPVHGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGELTLVKSFDW >CP031653.1|AXP26173.1|1688853_1690038_-|acetyl-CoA-C-acetyltransferase MKNCVIVSAVRTAIGSFNGSLASTSAIDLGATVIKAAIERAKIDSQHVDEVIMGNVLQAGLGQNPARQALLKSGLAETVCGFTVNKVCGSGLKSVALAAQAIQAGQAQSIVAGGMENMSLAPYLLDAKARSGYRLGDGQVYDVILRDGLMCATHGYHMGITAENVAKEYGITREMQDELALHSQRKAAAAIESGAFTAEIVPVNVVTRKKTFVFSQDEFPKANSTAEALGALRPAFDKAGTVTAGNASGINDGAAALVIMEESAALAAGLTPLARIKSYASGGVPPALMGMGPVPATQKALQLAGLQLADIDLIEANEAFAAQFLAVGKTLGFDPEKVNVNGGAIALGHPIGASGARILVTLLHAMQARDKTLGLATLCIGGGQGIAMVIERLN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP031653_7 | 2301840-2301963 | Orphan |
NA
Consensus repeat of CP031653_7
|
1 spacers
spacers of CP031653_7
>7.1|2301883|38|CP031653|CRISPRCasFinder CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA |
CRISPR arrays and Neighbor proteins around CP031653_7
The CRISPR arrays of CP031653_7 >merge|CP031653|7|2301840-2301963|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA >CP031653|7|6|2301840-2301963|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA
>CP031653.1|AXP26713.1|2301399_2301705_-|monooxygenase MATLLQLHFAFNGPFGDAMAEQLKPLAESINQEPGFLWKVWTESEKNHEAGGIYLFTDEKSALAYLEKHTARLKNLGVEEVVAKVFDVNEPLSQINQAKLA >CP031653.1|AXP26712.1|2299669_2301274_-|FAD-NAD(P)-binding-protein MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENSKMMLANIASIEIPPINCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRILLGEYFRDQFLRLVDQARQQKFAVAVYESCQVTDLQITNAGVMLATNQDLPSETFDLVVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTSLSGLDAAMAVAIQHGSFIEDDKQHVVFNRDNASEKLNITLMSRTGILPEADFYCPIPYEPLHIVTDQALNAEIQKGEEGLLDRVFRLIVEEIKFADPDWSQRIALESLNVDSFAQAWFAERKQRDPFDWAEKNLQEVERNKREKHTVPWRYVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLALREAGIIHILALGEDYEMEINESRTVLKTEDNSYSFDVFIDARGQRPLKVKDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEDIRGRVAFGALPWLMHDQPFVQGLTACAEIGEAMARAVVKPASRARRRLSFD >CP031653.1|AXP26711.1|2298845_2299658_+|hypothetical-protein MIITRADLREWRIGAVMYRWFLRHFPRGGSYADIHHALIEEGYTDWAESLVEYAWKKWLADENFAHQEVSSMQKLATDPGEIPFCSQFARSDDHARIGCCEDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSRISSAGDSSRIANTGMRVRVCTLGERCHVASNGDLAQIASFGANARIANSGDNVHIIASGENSTVVSTGVVDSIILGPGGSAALAYHDGERVRFAVAIEGENNIRAGVRYRLNEQHQFVEC >CP031653.1|AXP26710.1|2298056_2298842_+|thiosulfate-reductase-cytochrome-B-subunit MNPSQHAEQFQSQLANYVPQFTPEFWPVWLIIAGVLLVGMWLVLGLHALLRARGVKKSVTDYGEKIYLYCKAVRLWHWSNALLFVLLLASGLINHFALVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGGNGHHYRIRRQGWLERAAKQTRFYLFGIMQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGLLCLYPQAVGDVFPGVRYWLLQAHFALAFISLFFIFGHLYLCTTGRTPHETFKSMVDGYHRH >CP031653.1|AXP26709.1|2297391_2298060_+|4Fe-4S-dicluster-domain-containing-protein MSFTRRKFVLGMGTVIFFTGSASSLLANTRQEKEVRYAMIHDESRCNGCNICARACRKTNHVPAQGSRLSIAHIPVTDNDNETQYHFFRQSCQHCEDAPCIDVCPTGASWRDEQGIVRVEKSQCIGCSYCIGACPYQVRYLNPVTKVADKCDFCAESRLAKGFPPICVSACPEHALIFGREDSPEIQAWLQQNKYYQYQLPGAGKPHLYRRFGQHLIKKENV >CP031653.1|AXP26708.1|2296680_2297328_+|YdhW-family-putative-oxidoreductase-system-protein MGEMNHRDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHSAAGQLVARAVFLSPPYSVAEEELSVLLESIKQNGDYADIACMTGSQDDYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAPYYFQEAQIEAAIAAMDVAPEYADIRQVESSTAVLYLFSERFMTYGKAYGLCEWFEVEQFQNP >CP031653.1|AXP26707.1|2294574_2296677_+|aldehyde-ferredoxin-oxidoreductase MANGWTGNILRVNLTTGNITLEDSSKFKSFVGGMGFGYKIMYDEVPPGTKPFDEANKLVFATGPLTGSGAPCSSRVNITSLSTFTKGNLVVDAHMGGFFAAQMKFAGYDVIIIEGKAKSPVWLKIKDDKVSLEKADFLWGKGTRATTEEICRLTSPETCVAAIGQAGENLVPLSGMLNSRNHSGGAGTGAIMGSKNLKAIAVEGTKGVNIADRQEMKRLNDYMMTELIGANNNHVVPSTPQSWAEYSDPKSRWTARKELFWGAAEGGPIETGEIPPGNQNTVGFRTYKSVFDLGPAAEKYTVKMSGCHSCPIRCMTQMNIPRVKEFGVPSTGGNTCVANFVHTTIFPNGPKDFEDKDDGRVIGNLVGLNLFDDYGLWCNYGQLHRDFTYCYSKGVFKRVLPAEEYAEIHWDQLEAGDVNFIKDFYYRLAHRVGELSHLADGSYAIAERWNLGEEYWGYAKNKLWSPFGYPVHHANEASAQVGSIVNCMFNRDCMTHTHINFIGSGLPLKLQREVAKELFGSEDAYDETKNYTPINDAKIKYAKWSLLRVCLHNAVTLCNWVWPMTVSPLKSRNYRGDLALEAKFFKAITGEEMTQEKLDLAAERIFTLHRAYTVKLMQTKDMRNEHDLICSWVFDKDPQIPVFTEGTDKMDRDDMHASLTMFYKEMGWDPQLGCPTRETLQRLGLEDIAADLAAHNLLPV >CP031653.1|AXP26706.1|2293927_2294554_+|ferredoxin-like-protein MNPVDRPLLDIGLTRLEFLRISGKGLAGLTIAPALLSLLGCKQEDIDSGTVGLINTPKGVLVTQRARCTGCHRCEISCTNFNDGSVGTFFSRIKIHRNYFFGDNGVGSGGGLYGDLNYTADTCRQCKEPQCMNVCPIGAITWQQKEGCITVDHKRCIGCSACTTACPWMMATVNTESKKSSKCVLCGECANACPTGALKIIEWKDITV >CP031653.1|AXP26705.1|2293262_2293472_+|fumarate-hydratase-FumD MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRIQRSELEKQAMETVINALVK >CP031653.1|AXP26704.1|2291294_2292707_-|pyruvate-kinase-PykF MKKTKIVCTIGPKTESEEMLAKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKTAAILLDTKGPEIRTMKLEGGNDVSLKAGQTFTFTTDKSVIGNSEMVAVTYEGFTTDLSVGNTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNLPGVSIALPALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVIEIREHLKAHGGENIHIISKIENQEGLNNFDEILEASDGIMVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATQMLDSMIKNPRPTRAEAGDVANAILDGTDAVMLSGESAKGKYPLEAVSIMATICERTDRVMNSRLEFNNDNRKLRITEAVCRGAVETAEKLDAPLIVVATQGGKSARAVRKYFPDATILALTTNEKTAHQLVLSKGVVPQLVKEITSTDDFYRLGKELALQSGLAHKGDVVVMVSGALVPSGTTNTASVHVL >CP031653.1|AXP26714.1|2302277_2303534_+|hypothetical-protein MGSDAKNLMSDGNVQIVKTGEVIGATQLTEGELIVEAGGRAENTVVTGAGWLKVATGGIAKCTQYGNNGTLSVSDGAIATDIVQSEGGAISLSTLATVNGRHPEGEFSVDQGYACGLLLENGGNLRVLEGHRAEKIILDQEGGLLVNGTTSAVVVDEGGELLVYPGGEASNCEINQGGVFMLAGKASDTLLAGGTMNNLGGEDSDTIVENGSIYRLGTDGLQLYSSGKTQNLSVNVGGRAEVHAGTLENAVIQGGTVILLSPTSADENFVVEEDRAPVELTGSVALLDGASMIIGYGADLQQSTITVQQGGVLILDGSTVKGDGVTFIVGNINLNGGKLWLITGAATHVQLKVKRLRGEGAICLQTSAKEISPDFINVKGEVTGDIHVEITDASRQTLCNALKLQPDEDGIGATLQPA >CP031653.1|AXP26715.1|2303574_2304948_-|multidrug-resistance-protein-MdtK MQKYISEARLLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGRRERIAHQVRQGFWLAGFVSVLIMLVLWNAGYIIRSMENIDPALADKAVGYLRALLWGAPGYLFFQVARNQCEGLAKTKPGMVMGFIGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFLAMVSYIKRARSMRDIRNEKGTAKPDPAVMKRLIQLGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLMVGVCMATLTAIFTVSLREQIALLYNDNPEVVTLAAHLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFYITFTAYWVLGLPSGYILALTDLVVEPMGPAGFWIGFIIGLTSAAIMMMLRMRFLQRLPSVIILQRASR >CP031653.1|AXP26716.1|2305162_2305804_+|riboflavin-synthase MFTGIVQGTVKLVSIDEKPNFRTHVVELPDHMLDGLETGASVAHNGCCLTVTEINGNHVSFDLMKETLRITNLGDLKVGDWVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQIWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARENAMNQPGTEA >CP031653.1|AXP26717.1|2305843_2306992_-|cyclopropane-fatty-acyl-phospholipid-synthase MSSSCIEEVSVPDDNWYRIANELLSRAGIAINGSAPADIRVKNPDFFKRVLQEGSLGLGESYMDGWWECDRLDMFFSKVLRAGLENQLPHHFKDTLRIASARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPFMQYSCAYWKDADNLESAQQAKLKMICEKLQLKPGMRVLDIGCGWGGLAHYMASNYDVSVVGVTISAEQQKMAQERCEGLDVTILLQDYRDLNDQFDRIVSVGMFEHVGPKNYDTYFAVVDRNLKPEGIFLLHTIGSKKTDLNVDPWINKYIFPNGCLPSVRQIAQSSEPHFVMEDWHNFGADYDTTLMAWYERFLAAWPEIADNYSERFKRMFTYYLNACAGAFRARDIQLWQVVFSRGVENGLRVAR >CP031653.1|AXP26718.1|2307282_2308494_-|Bcr/CflA-family-multidrug-efflux-MFS-transporter MQPGKRFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPASAVSASLSLFLAGFAAAQLLWGPLSDRYGRKPVLLIGLTIFALGSLGMLWVENAATLLVLRFVQAVGVCAAAVIWQALVTDYYPSQKVNRIFATIMPLVGLSPALAPLLGSWLLVHFSWQAIFATLFAITVVLILPIFWLKPTTKARNNSQDGLTFTDLLRSKTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVIGLSYVPQTIAFLIGGYGCRAALQKWQGKQLLPWLLVLFAVSVIATWAAGFISHVSLVEILIPFCVMAIANGAIYPIVVAQALRPFPHATGRAAALQNTLQLGLCFLASLVVSWLISISTPLLTTTSVMLSTVVLVALGYMMQRCEEVGCQNHGNAEVAHSESH >CP031653.1|AXP26719.1|2308606_2309539_+|LysR-family-transcriptional-regulator MWSEYSLEVVDAVARNGSFSAAAQELHRVPSAVSYTVRQLEEWLAVPLFERRHRDVELTAAGAWFLKEGRSVVKKMQITRQQCQQMANGWRGQLAIAVDNIVRPERTRQMIVDFYRHFDDVELLVFQEVFNGVWDALSDGRVELAIGATRAIPVGGRYAFRDMGMLSWSCVVASHHPLALMDGPFSDDTLRNWPSLVREDTSRTLPKRITWLLDNQKRVVVPDWESSATCISAGLCIGMVPTHFAKPWLNEGKWVALELENPFPDSACCLTWQQNDMSPALTWLLEYLGDSETLNKEWLREPEETPATGD >CP031653.1|AXP26720.1|2309535_2310561_-|PurR-family-transcriptional-regulator MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVIDNAFEGGYMAGRYLIERGHREIGVIPGPLERNTGAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRPTAVFCGGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREEPQSIEVHPRLIERRSVADGPFRDYRR >CP031653.1|AXP26721.1|2310548_2310773_-|hypothetical-protein MISVFTTSPFRQDRPKFHAYTICVLAIDPFLTLRVVFPAYRNTFVVRKVCKGKRLPCDFAGAEVRVWSEMEWQQ >CP031653.1|AXP26722.1|2310859_2310949_+|YnhF-family-membrane-protein MSTDLKFSLVTTIIVLGLIVAVGLTAALH >CP031653.1|AXP26723.1|2311114_2312284_+|MFS-transporter MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRSALIFLMAIFTLGNVLSAIAPDYMTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTLANIGGVPAATWLGETIGWRMSFLATAGLGVISMVSLFFSLPKGGAGARPEVKKELAVLMRPQVLSALLTTVLGAGAMFTLYTYISPVLQSITHATPVFVTAMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEFGAAISMVVWGAATFAVVPPLQMRVMRVASEAPGLSSSVNIGAFNLGNALGAAAGGAVISAGLGYSFVPVMGAIVAGLALLLVFMSARKQPETVCVANS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP031653_9 | 3312050-3312194 | Orphan |
NA
Consensus repeat of CP031653_9
|
1 spacers
spacers of CP031653_9
>9.1|3312102|41|CP031653|CRISPRCasFinder TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC |
CRISPR arrays and Neighbor proteins around CP031653_9
The CRISPR arrays of CP031653_9 >merge|CP031653|9|3312050-3312194|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGCTGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTCGTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC >CP031653|9|8|3312050-3312194|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC
>CP031653.1|AXP27644.1|3310700_3311984_+|acyl-CoA-thioesterase MNTFSVSRLALALAFGVTLTACSSTPPDQRPSDQTAPGTSSRPILSAKEAQNFDAQHYFASLTPGAAAWNPSPITLPAQPDFVVGPAGTQGVTHTTIQAAVDAAIIKRTNKRQYIAVMPGEYQGTVYVPAAPGGITLYGTGEKPIDVKIGLSLDGGMSPADWRHDVNPRGKYMPGKPAWYMYDSCQSKRSDSIGVLCSAVFWSQNNGLQLQNLTIENTLGDSVDAGNHPAVALRTDGDQVQINNVNILGRQNTFFVTNSGVQNRLETNRQPRTLVTNSYIEGDVDIVSGRGAVVFDNTEFRVVNSRTQQEAYVFAPATLSNIYYGFLAVNSRFNAFGDGVAQLGRSLDVDANTNGQVVIRDSAINEGFNTAKPWADAVISNRPFAGNTGSVDDNDEIQRNLNDTNYNRMWEYNNRGVGSKVVAEAKK >CP031653.1|AXP27643.1|3309495_3310566_+|integrase MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >CP031653.1|AXP27642.1|3309299_3309518_+|excisionase MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >CP031653.1|AXP27641.1|3309092_3309260_+|hypothetical-protein MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >CP031653.1|AXP27640.1|3308974_3309160_-|hypothetical-protein MFSASITLLNGSPFHSPVTRKCIYHLHKTKPAVASSDKRNPRQCEDAVHCCYTLFCSQRKR >CP031653.1|AXP27639.1|3308668_3308920_-|hypothetical-protein MSAERAKIHAKNLRNFVYYCAVSNELFSAQKMDGKFVRVRKNFMGAKHEKRFVSLFDLHDSFRADLYFPFLLVAGRYLQGHVS >CP031653.1|AXP27638.1|3308247_3308778_-|hypothetical-protein MKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >CP031653.1|AXP27637.1|3307815_3308037_+|TraR/DksA-family-transcriptional-regulator MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >CP031653.1|AXP27636.1|3307435_3307717_+|cell-division-protein-ZapA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >CP031653.1|AXP27635.1|3307233_3307425_+|DUF1382-family-protein MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >CP031653.1|AXP27645.1|3312217_3314479_-|hydratase MIKLSEKGVFLASNNEIIAEEHFTGEIKKEEAQKGTIAWSILSSHNTSGNMDKLKIKFDSLASHDITFVGIVQTAKASGMERFPLPYVLTNCHNSLCAVGGTINGDDHVFGLSAAQRYGGIFVPPHIAVIHQYMREMMAGGGKMILGSDSHTRYGALGTMAVGEGGGELVKQLLNDTWDIDYPGVVAVHLTGKPAPYVGPQDVALAIIGAVFKNGYVKNKVMEFVGPGVSALSTDFRNSVDVMTTETTCLSSVWQTDEEVHNWLALHGRGQDYCQLNPQPMAYYDGCISVDLSAIKPMIALPFHPSNVYKIDTLNQNLTDILREIEIESERVAHGKAKLSLLDKVENGRLKVQQGIIAGCSGGNYENVIAAANALRGQSCGNDTFSLAVYPSSQPVFMDLAQKGVVADLIGAGAIIRTAFCGPCFGAGDTPINNGLSIRHTTRNFPNREGSKPANGQMSAVALMDARSIAATAANGGYLTSASELDCWDNVPEYAFDVTPYKNRVYQGFVKGATQQPLIYGPNIKDWPELGALTDNIVLKVCSKILDEVTTTDELIPSGETSSYRSNPIGLAEFTLSRRDPGYVGRSKATAELENQRLAGNVSELTEVFARIKQIAGQEHIDPLQTEIGSMVYAVKPGDGSAREQAASCQRVIGGLANIAEEYATKRYRSNVINWGMLPLQMAEVPTFEVGDYIYIPGIKAALDNPCTTFKGYVIHEDAPVTEITLYMGSLTAEEREIIKAGSLINFNKNRQM >CP031653.1|AXP27646.1|3314661_3316095_-|anion-permease MNKKSLWKLILILAIPCIIGFMPAPAGLSELAWVLFGIYLAAIVGLVIKPFPEPVVLLIAVAASMVVVGNLSDGAFKTTAVLSGYSSGTTWLVFSAFTLSAAFVTTGLGKRIAYLLIGKIGNTTLGLGYVTVFLDLVLAPATPSNTARAGGIVLPIINSVAVALGSEPEKSPRRVGHYLMMSIYMVTKTTSYMFFTAMAGNILALKMINDILHLQISWGGWALAAGLPGIIMLLVTPLVIYTMYPPEIKKVDNKTIAKAGLAELGPMKIREKMLLGVFVLALLGWIFSKSLGVDESTVAIVVMATMLLLGIVTWEDVVKNKGGWNTLIWYGGIIGLSSLLSKVKFFEWLAEVFKNNLAFDGHGNVAFFVIIFLSIIVRYFFASGSAYIVAMLPVFAMLANVSGAPLMLTALALLFSNSYGGMVTHYGGAAGPVIFGVGYNDIKSWWLVGAVLTILTFLVHITLGVWWWNMLIGWNML >CP031653.1|AXP27647.1|3316170_3317223_-|4-oxalomesaconate-tautomerase MKKIPCVMMRGGTSRGAFLLAEHLPEDQTQRDKILMAIMGSGNDLEIDGIGGGNPLTSKVAIISRSSDLRADVDYLFAQVIVHEQRVDTTPNCGNMLSGVGAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQTPNGVVEYEGSARIDGVPGTAAPVALTFLNAAGTKTGKVFPTDNQIDYFDDVPVTCIDMAMPVVIIPAEYLGKTGYELPAELDADKALLARIESIRLQAGKAMGLGDVSNMVIPKPVLISPAQKGGAINVRYFMPHSCHRALAITGAIAISSSCALEGTVTRQIVPSVGYGNINIEHPSGALDVHLSNEGQDATTLRASVIRTTRKIFSGEVYLP >CP031653.1|AXP27648.1|3317406_3318360_+|LysR-family-transcriptional-regulator MKHELSSMKAFVILAESSSFNNAAKLLNITQPALTRRIKKMEEDLHIQLFERTTRKVTLTKAGKRLLPEARELIKKFDETLFNIRDMNAYHRGMVTLACIPTAVFYFLPLAIGKFNELYPNIKVRILEQGTNNCMESVLCNESDFGINMNNVTNSSIDFTPLVNEPFVLACRRDHPLAKKQLVEWQELVGYKMIGVRSSSGNRLLIEQQLADKPWKLDWFYEVRHLSTSLGLVEAGLGISALPGLAMPHAPYSSIIGIPLVEPVIRRTLGIIRRKDAVLSPAAERFFALLINLWTDDKDNLWTNIVERQRHALQEIG >CP031653.1|AXP27649.1|3318400_3319396_-|6-phosphogluconolactonase MKQTVYIASPESQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSPDKRYLYVGVRPEFRVLAYSIAPDDGALTFAAESALPGSPTHISTDHQGQFVFVGSYNAGNVSVTRLEDGLPVGVVDVVEGLDGCHSANISPDNRTLWVPALKQDRICLFTVSDDGHLVAQDPAEVTTVEGAGPRHMVFHPNEQYAYCVNELNSSVDVWELKDPHGNIECVQTLDMMPENFSDTRWAADIHITPDGRHLYACDRTASLITVFSVSEDGSVLSKEGFQPTETQPRGFNVDHSGKYLIAAGQKSHHISVYEIVGEQGLLHEKGRYAVGQGPMWVVVNAH >CP031653.1|AXP27650.1|3319550_3320369_+|pyridoxal-phosphatase MTTRVIALDLDGTLLTPKKTLLPSSIEALARAREAGYQLIIVTGRHHVAIHPFYQALALDTPAICCNGTYLYDYHAKTVLEADPMPVNKALQLIEMLNEHHIHGLMYVDDAMVYEHPTGHVIRTSNWAQTLPPEQRPTFTQVASLAETAQQVNAVWKFALTHDDLPQLQHFGKHVEHELGLECEWSWHDQVDIARGGNSKGKRLTKWVEAQGWSMENVVAFGDNFNDISMLEAAGTGVAMGNADDAVKARANIVIGDNTTDSIAQFIYSHLI >CP031653.1|AXP27651.1|3320369_3321428_-|molybdenum-import-ATP-binding-protein-ModC MLELNFSQTLGNHCLTINETLPANGITAIFGVSGAGKTSLINAISGLTRPQKGRIVLNGRVLNDAEKGICLTPEKRRVGYVFQDARLFPHYKVRGNLRYGMSKSMVDQFDKLVALLGIEPLLDRLPGSLSGGEKQRVAIGRALLTAPELLLLDEPLASLDIPRKRELLPYLQRLTREINIPMLYVSHSLDEILHLADRVMVLENGQVKAFGALEEVWGSSVMNPWLPKEQQSSILKVTVLEHHPHYAMTALALGDQHLWVNKLDEPLQAALRIRIQASDVSLVLQPPQQTSIRNVLRAKVVNSYDDNGQVEVELEVGGKTLWARISPWARDELAIKPGLWLYAQIKSVSITA >CP031653.1|AXP27652.1|3321430_3322120_-|molybdenum-ABC-transporter-permease MILTDPEWQAVLLSLKVSSLAVLFSLPFGIFFAWLLVRCTFPGKALLDSVLHLPLVLPPVVVGYLLLVSMGRRGFIGERLYDWFGITFAFSWRGAVLAAAVMSFPLMVRAIRLALEGVDVKLEQAARTLGAGRWRVFFTITLPLTLPGIIVGTVLAFARSLGEFGATITFVSNIPGETRTIPSAMYTLIQTPGGESGAARLCIISIALAMISLLISEWLARISRERAGR >CP031653.1|AXP27653.1|3322119_3322893_-|molybdate-ABC-transporter-substrate-binding-protein MARKWLNLFAGAALSFAVAGNALADEGKITVFAAASLTNAMQDIATQYKKEKGVDVVSSFASSSTLARQIEAGAPADLFISADQKWMDYAVDKKAIDTATRQTLLGNSLVVVAPKASEQKDFTIDSKTNWTSLLNGGRLAVGDPEHVPAGIYAKEALQKLGAWDTLSPKLAPAEDVRGALALVERNEAPLGIVYGSDAVASKGVKVVAIFPEDSHKKVEYPVAVVEGHNNATVKAFYDYLKGPQAAEIFKRYGFTTK >CP031653.1|AXP27654.1|3322914_3323124_-|hypothetical-protein MVKYSTSFLVLVKKTSPDKIIDSLNARLVGHFLFLNFLLFLPIFFLIYLTKVSNNCWENSELVVILSPT |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP031653_10 | 3844572-3844725 | Orphan |
NA
Consensus repeat of CP031653_10
|
1 spacers
spacers of CP031653_10
>10.1|3844625|48|CP031653|CRISPRCasFinder TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA |
CRISPR arrays and Neighbor proteins around CP031653_10
The CRISPR arrays of CP031653_10 >merge|CP031653|10|3844572-3844725|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCGTCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAACGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG >CP031653|10|9|3844572-3844725|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCG TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA CGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG
>CP031653.1|AXP28144.1|3842697_3844437_+|flagellar-type-III-secretion-system-protein-FlhA MLSRSDLLTLLTINFIVVTKGAERISEVSARFTLDAMPGKQMAIDADLNAGLINQAQAQTRRKDVASEADFYGAMDGASKFVRGDAIAGMMILAINLIGGVCIGIFKYNLSADAAFQQYVLMTIGDGLVAQIPSLLLSTAAAIIVTRISDNGDITHDVRHQLLASPSVLYTATGIMFVLAVVPGMPHLPFLLFSALLGFTGWRMSKRPQAAEAEEKSLETLTRTITETSEQQVSWETIPLIEPISLSLGYKLVALVDKAQGNPLTQRIRGVRQVISDGNGVLLPEIRIRENFRLKPSQYAIFINGIKADEADIPADKLMALPSSETYGEIDGVLGNDPAYGMPVTWIQPAQKAKALNMGYQVIDSASVIATHVNKIVRSYIPDLFSYDDITQLHNRLSSMAPRLAEDLSAALNYSQLLKVYRALLTEGVSLRDIVTIATVLVASSAVTKDHILLAADVRLALRRSITHPFVRKQELTVYTLNNELENLLTNVVNQAQQGGKVMLDSVPVDPNMLNQFQSTMPQVKEQMKAAGKDPVLLVPPQLRPLLARYARLFAPGLHVLSYNEVPDELELKIMGALM >CP031653.1|AXP29213.1|3841967_3842756_-|putative-lateral-flagellar-export/assembly-protein-LafU MVTTIKLIVNSVSKSERESIIAALHGQSIFSGGGLSPLNKISPSHPPKPATVAVPEETEKKARDVNEKTALLKKKSATELGELATSINTIARDAHMEANLEMEIVPQGLRVLIKDDQNRNMFECGSAQIMPFFKTLLVELAPVFDSLDNKIIITGHTDAMAYKNNIYNNWNLSGDRALSARRVLEEAGMPEDKVMQVSAMADQMLLDAKNPQSAGNRRIEIMVLTKSASDTLYQYFGQHGDKVVQPLVQKLDKQQVLSQRMR >CP031653.1|AXP28143.1|3840841_3841897_-|DNA-polymerase-IV MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARKFGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPLSLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELHLTASAGVAPVKFLAKIASDMNKPNGQFVITPAEVPAFLQTLPLAKIPGVGKVSAAKLEAMGLRTCGDVQKCDLVILLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMAEDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQEHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLGL >CP031653.1|AXP28142.1|3840392_3840845_-|GNAT-family-N-acetyltransferase MNNIQIRNYQPGDFQQLCAIFIRAVMMTASQHYSPQQIAAWAQIDESRWKEKLAKSQVRVAVINAQPVGFISRIERHIDMLFVDPEYTRRGVASALLKPLIKSESELTVDASITAKPFFERYGFQIVKQQHVECRGAWFTNFYMRYKPQH >CP031653.1|AXP28141.1|3840072_3840300_+|hypothetical-protein MYHSIYTRHTSSCMYCDCVRSPQSLTLVSSWRFTRLSPSCNSNYLEYINIKNQEMADESRARKKELILERSAKSH >CP031653.1|AXP28140.1|3839819_3840086_-|hypothetical-protein MEWYMGKYIRPLSDAVFTIASDDLWIESLAIQQLHTTANLPNMQRVVGMPDLHPGRGYPIGAAFFSVGRFYPARRRGNGAGNRNGPLL >CP031653.1|AXP28139.1|3838005_3839463_+|cytosol-nonspecific-dipeptidase MSELSQLSPQPLWDIFAKICSIPHPSYHEEQLAEYIVGWAKEKGFHVERDQVGNILIRKPATAGMENRKPVVLQAHLDMVPQKNNDTVHDFTKDPIQPYIDGEWVKARGTTLGADNGIGMASALAVLADENVVHGPLEVLLTMTEEAGMDGAFGLQSNWLQADILINTDSEEEGEIYMGCAGGIDFTSNLHLDREAVPAGFETFKLTLKGLKGGHSGGEIHVGLGNANKLLVRFLAGHAEELDLRLIDFNGGTLRNAIPREAFATIAVAADKVDALKSLVNTYQDILKNELAEKEKNLALLLDSVANDKAALIAKSRDTFIRLLNATPNGVIRNSDVAKGVVETSLNVGVVTMTDNNVEIHCLIRSLIDSGKDYVVSMLDSLGKLAGAKTEAKGAYPGWQPDANSPVMHLVRETYQRLFNKTPNIQIIHAGLECGLFKKPYPEMDMVSIGPTITGPHSPDEQVHIKSVGHYWTLLTELLKEIPAK >CP031653.1|AXP28138.1|3837286_3837745_-|xanthine-phosphoribosyltransferase MSEKYIVTWDMLQIHARKLASRLMPSEQWKGIIAVSRGGLVPGALLARELGIRHVDTVCISSYDHDNQRELKVLKRAEGDGEGFIVIDDLVDTGGTAVAIREMYPKAHFVTIFAKPAGRPLVDNYVVDIPQDTWIEQPWDMGVVFVPPISGR >CP031653.1|AXP28137.1|3835950_3837195_-|esterase MTQANLSETLFKPRFKHPETSTLVRRFNHGAQPPVQSALDGKTIPHWYRMINRLMWIWRGIDPREILDVQARIVMSDAERTDDDLYDTVIGYRGGNWIYEWATQAMVWQQKACAEEDPQLSGRHWLHAATLYNIAAYPHLKGDDLAEQAQALSNRAYEEAAQRLPGTMRQMEFTVPGGAPITGFLHMPKGDGPFPTVLMCGGLDAMQTDYYSLYERYFAPRGIAMLTIDMPSVGFSSKWKLTQDSSLLHQHVLKALPNVPWVDHTRVAAFGFRFGANVAVRLAYLESPRLKAVACLGPVVHTLLSDFKCQQQVPEMYLDVLASRLGMHDASDDALRVELNRYSLKVQGLLGRRCPTPMLSGYWKNDPFSPEEDSRLITSSSADGKLLEIPFNPVYRNFDKGLQEITGWIEKRLC >CP031653.1|AXP28136.1|3835491_3835893_-|sigma-factor-binding-protein-Crl MTLPSGHPKSRLIKKFTALGPYIREGKCEDNRFFFDCLAVCVNVKPAPEVREFWGWWMELEAQESRFTYSYQFGLFDKAGDWKSVPVKDTEVVERLEHTLREFHEKLRELLTTLNLKLEPADDFRDEPVKLTA >CP031653.1|AXP28145.1|3844754_3845252_-|transposase MSEYRRYYIKGGTWFFTVNLRNRRSQLLTTQYQMLRHAIIKVKRDRPFEINAWVVLPEHMHCIWTLPEGDDDFSSRWREIKKQFTHACGLKNIWQPRFWEHAIRNTKDYRHHVDYIYINPVKHGWVKQVSDWPFSTFHRDVARGLYPIDWAGDVTDINAGERIIL >CP031653.1|AXP28146.1|3845427_3846186_-|peptidoglycan-endopeptidase MSFMSSFLLGRFLHPGVFSLCVLLPLFASATTSHISFSYAARQRMQNRARLLKQYQTHLKKQASYIVEGNAESRRALRQHNREQIKQHPEWFPAPLKASDRRWQALAENNHFLSSDHLHNITEVAIHRLEQQLGKPYVWGGTRPDQGFDCSGLVFYAYNKILEAKLPRTANEMYHYHRATIVANNDLRRGDLLFFHIHSREIADHMGVYLGDGQFIESPRTGENIRVSRLAEPFWQDHFLGARRILTEETIL >CP031653.1|AXP28147.1|3846477_3847218_+|transpeptidase MRKIALILAMLLIPCVSFAGLLGSSSSTTPVSKEYKQQLMGSPVYIQIFKEERTLDLYVKMGEQYQLLDSYKICKYSGGLGPKQRQGDFKSPEGFYSVQRNQLKPDSRYYKAINIGFPNAYDRAHGYEGKYLMIHGDCVSIGCYAMTNQGIDEIFQFVTGALVFGQPSVQVSIYPFRMTDANMKRHKYSNFKDFWEQLKPGYDYFEQTRKPPTVSVVNGRYVVSKPLSHEVVQPQLASNYTLPEAK >CP031653.1|AXP28148.1|3847188_3847956_-|class-II-glutamine-amidotransferase MCELLGMSANVPTDICFSFTGLVQRGGGTGPHKDGWGITFYEGKGCRTFKDPQPSFNSPIAKLVQDYPIKSCSVVAHIRQANRGEVALENTHPFTRELWGRNWTYAHNGQLTGYKSLETGNFRPVGETDSEKAFCWLLHKLTQRYPRTPGNMAAVFKYIASLADELRQKGVFNMLLSDGRYVMAYCSTNLHWITRRAPFGVATLLDQDVEIDFSSQTTPNDVVTVIATQPLTGNETWQKIMPGEWRLFCLGERVV >CP031653.1|AXP28149.1|3848161_3848740_-|phosphoheptose-isomerase MYQDLIRNELNEAAETLANFLKDDANIHAIQRAAVLLADSFKAGGKVLSCGNGGSHCDAMHFAEELTGRYRENRPGYPAIAISDVSHISCVGNDFGFNDIFSRYVEAVGREGDVLLGISTSGNSANVIKAIAAAREKGMKVITLTGKDGGKMAGTADIEIRVPHFGYADRIQEIHIKVIHILIQLIEKEMVK >CP031653.1|AXP28150.1|3848979_3851424_+|acyl-CoA-dehydrogenase MMILSILATVVLLGALFYHRVSLFISSLILLAWTAALGVAGLWSAWVLVPLAIILVPFNFAPMRKSMISAPVFRGFRKVMPPMSRTEKEAIDAGTTWWEGDLFQGKPDWKKLHNYPQPRLTAEEQAFLDGPVEEACRMANDFQITHELADLPPELWAYLKEHRFFAMIIKKEYGGLEFSAYAQSRVLQKLSGVSGILAITVGVPNSLGPGELLQHYGTDEQKNHYLPRLARGQEIPCFALTSPEAGSDAGAIPDTGIVCMGEWQGQQVLGMRLTWNKRYITLAPIATVLGLAFKLSDPEKLLGGAEDLGITCALIPTTTPGVEIGRRHFPLNVPFQNGPTRGKDVFVPIDYIIGGPKMAGQGWRMLVECLSVGRGITLPSNSTGGVKSVALATGAYAHIRRQFKISIGKMEGIEEPLARIAGNAYVMDAAASLITYGIMLGEKPAVLSAIVKYHCTHRGQQSIIDAMDITGGKGIMLGQSNFLARAYQGAPIAITVEGANILTRSMMIFGQGAIRCHPYVLEEMEAAKNNDVNAFDKLLFKHIGHVGSNKVRSFWLGLTRGLTSSTPTGDATKRYYQHLNRLSANLALLSDVSMAVLGGSLKRRERISARLGDILSQLYLASAVLKRYDDEGRNEADLPLVHWGVQDALYQAEQAMDDLLQNFPNRVVAGLLNVVIFPTGRHYLAPSDKLDHKVAKILQVPNATRSRIGRGQYLTPSEHNPVGLLEEALVDVIAADPIHQRICKELGKNLPFTRLDELAHNALAKGLIDKDEAAILVKAEESRLCSINVDDFDPEELATKPVKLPEKVRKVEAA >CP031653.1|AXP28151.1|3851466_3851940_-|inhibitor-of-vertebrate-lysozyme MGRISSGGMMFKAITTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQMVQGHKLPAWVMKGGTYTPAQTVTLGDETYQVMSACKPHDCGSQRIAVMWSEKSNQMTGLFSTIDEKTSQEKLTWLNVNDALSIDGKTVLFAALTGSLENHPDGFNFK >CP031653.1|AXP28152.1|3852093_3852864_+|amidohydrolase MPGLKITLLQQPLVWMDGPANLRHFDRQLEGITGRDVIVLPEMFTSGFAMEAAASSLAQNDVVNWMTAKAQQCNALIAGSVALQTESGSVNRFLLVEPGGTVHFYDKRHLFRMADEHLHYKAGNARVIVEWRGWRILPLVCYDLRFPVWSRNLNDYDLAIYVANWPAPRSLHWQALLTARAIENQAYVAGCNRVGSDGNGCHYRGDSRVINPQGEIIATADAHQATRIDAELSMVALREYREKFPAWQDADEFRLR >CP031653.1|AXP28153.1|3854302_3854752_-|hypothetical-protein MMKYLMVLLSLFSGSVLGMGRVNELCGIDSVKTIEIINLPSYVTTLVPLSKEGLNEIYRYKVVVNEISDLYAGKIIDLLQMKYFRKEKYNNIRWGVSIISKGNNKCEIYFDAFGECGSVNGINVCFEKNEMIGWIKKEIPLLSQKIGGL >CP031653.1|AXP29214.1|3854763_3857823_-|RHS-repeat-protein MTSPLNSEGRYTEGEGGLKRVVKKEHADGSITRSEYDEAGRLKAQTDAAGRRTEYSLHMASGAVTAVTGPDGRTVRYGYNSQRQVTSVTYPDGLRSSREYDEKGRLTAETSRSGETTRYSYDDPASELPTGIQDATGSTKQMAWSRYGQLLAFTDCSGYTTRYEYDRYGQQIAVHREEGISTYSSYNPRGQLVSQKDAQGREIRYEYSAAGDLTATISPDGKRSTIEYDKRGRPVSVTEGGLTRSMGYDAAGRITVLTNENGSQSTFRYDPVDRLTEQRGFDGRTQRYHYDLTGKLTQSEDEGLITLWHYDASDRITHRTVNGDPAEQWQYDEHGWLTTLSHTCEGHRVSVHYGYDDKGRLTGERQTVENPETGEMLWEHETGHAYSEQGLATRQEPDGLPPVEWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETARSFGGAGSTAGYEQATAYTLTGQLQSRHLNLPQLDCDYTWNDNGQLVRISGPQECREYRYSGTGRLTGVHTTAANLDIDIPYATDPAGNRLPDPELHPDSTLTAWPDNRIAEDAHYVYRYDEYGRLAEKTDRIPEGVIRMHDERTHHYHYDSQHRLVFYTRIQHGEPQVESRYLYDPLGRRTGKRVWRRERDLTGWMSLSRKPEETWYGWDGDRLTTVQTQQTRIQTVYQPGSFTPLLRIETENGEQAKARHRSLAEVLQEDTGVTLPAELAVMLGRLERELRQGSVSEESQQWLAQCGLTAEQMAAQLEAEYIPERKLHLYHCDHRGLPLALISPEGETAWQGEYDEWGNLLGEESAQHLQQSLRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLRGEWNLYKYPLNPVRFIDSLGLKFHVNGDPSDFNQAVEYLKQDSQMKETIDFLSSSEETINIEYIEGTNVRFNSNNMAIYWNSRASLFCSTELNSKSQSPALGLGHEFAHAQYYLLDKENFMALLSRTDKKYENKEEARVITIIESRAAKTLGECTRGAHSGLPFYRVDGPLQTMKITGTPE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP031653_11 | 4034753-4034868 | Orphan |
NA
Consensus repeat of CP031653_11
|
1 spacers
spacers of CP031653_11
>11.1|4034784|54|CP031653|CRISPRCasFinder TGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTC |
CRISPR arrays and Neighbor proteins around CP031653_11
The CRISPR arrays of CP031653_11 >merge|CP031653|11|4034753-4034868|CRISPRCasFinder AACGCCTGATGCGACGCTGACGCGTCTTATCTGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTCAACGCCTGATGCGACGCTGGCGCGTCTTATC >CP031653|11|10|4034753-4034868|CRISPRCasFinder AACGCCTGATGCGACGCTGACGCGTCTTATC TGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTC AACGCCTGATGCGACGCTGGCGCGTCTTATC
>CP031653.1|AXP28311.1|4033230_4034733_+|L-arabinose-isomerase MTIFDNYEVWFVIGSQHLYGPETLRQVTQHAEHVVNALNTEAKLPCKLVLKPLGTTPDEITAICRDANYDDRCAGLVVWLHTFSPAKMWINGLTMLNKPLLQFHTQFNAALPWDSIDMDFMNLNQTAHGGREFGFIGARMRQQHAVVTGHWQDKQAHERIGSWMRQAVSKQDTRHLKVCRFGDNMREVAVTDGDKVAAQIKFGFSVNTWAVGDLVQVVNSISDGDVNALVDEYESCYTMTPATQIHGEKRQNVLEAARIELGMKRFLEQGGFHAFTTTFEDLHGLKQLPGLAVQRLMQQGYGFAGEGDWKTAALLRIMKVMSTGLQGGTSFMEDYTYHFEKGNDLVLGSHMLEVCPSIAVEEKPILDVQHLGIGGKDDPARLIFNTQTGPAIVASLIDLGDRYRLLVNCIDTVKTPHSLPKLPVANALWKAQPDLPTASEAWILAGGAHHTVFSHALNLNDMRQFAEMHDIEITVIDNDTRLPAFKDALRWNEVYYGFRR >CP031653.1|AXP28310.1|4031519_4033220_+|ribulokinase MAIAIGLDFGSDSVRALAVDCASGEEIATSVEWYPRWQKGQFCDAPNNQFRHHPRDYIESMEAALKTVLAELSVEQRAAVVGIGVDTTGSTPAPIDADGNVLALRPEFAENPNAMFVLWKDHTAVEEAEEITRLCHAPGNVDYSRYIGGIYSSEWFWAKILHVTRQDSAVAQSAASWIELCDWVPALLSGTTGPQDIRRGRCSAGHKSLWHESWGGLPPASFFDELDPILNRHLPSPLFTDTWTADIPVGTLCPEWAQRLGLPESVVISGGAFDCHMGAVGAGAQPNALVKVIGTSTCDILIADKQSVGERAVKGICGQVDGSVVPGFIGLEAGQSAFGDIYAWFGRVLGWPLEQLAAQHPELKAQINASQKQLLPALTEAWAKNPSLDHLPVVLDWFNGRRTPNANQRLKGVITDLNLATDAPLLFGGLIAATAFGARAIMECFTDQGIAVNNVMALGGIARKNQVIMQACCDVLNRPLQIVASDQCCALGAAIFAAVAAKVHADIPSAQQKMASAVEKTLQPCSEQAQRFEQLYRRYQQWAMSAEQHYLPTSAPAQAAQAVPTL >CP031653.1|AXP29218.1|4030302_4031181_-|arabinose-operon-regulatory-protein MAEAQNDPLLPGYSFNAHLVAGLTPIEANGYLDFFIDRPLGMKGYILNLTIRGQGVVKNQGREFVCRPGDILLFPPGEIHHYGRHPEAREWYHQWVYFRPRAYWHEWLNWPSIFANTGFFRPDEAHQPHFSDLFGQIINAGQGEGRYSELLAINLLEQLLLRRMEAINESLHPPMDNRVREACQYISDHLADSNFDIASVAQHVCLSPSRLSHLFRQQLGISVLSWREDQRISQAKLLLSTTRMPIATVGRNVGFDDQLYFSRVFKKCTGASPSEFRAGCEEKVNDVAVKLS >CP031653.1|AXP28309.1|4029452_4030217_-|DedA-family-protein MQALLEHFITQSTVYSLMAVVLVAFLESLALVGLILPGTVLMAGLGALIGSGELSFWHAWLAGIVGCLLGDWISFWLGWRFKKPLHRWSFLKKNKALLDKTEHALHQHSMFTILVGRFVGPTRPLVPMVAGMLDLPVAKFITPNIIGCLLWPPFYFLPGILAGAAIDIPAGMQSGEFKWLLLATAVFLWVGGWLCWRLWRSGKATDRLSHYLSRGRLLWLTPLISAIGVVALVVLIRHPLMPVYIDILRKVVGG >CP031653.1|AXP28308.1|4028640_4029339_+|thiamine-ABC-transporter-ATP-binding-protein-ThiQ MLKLTDITWLYHHLPMRFSLTVERGEQVAILGPSGAGKSTLLNLIAGFLTPASGSLTIDGVDHTTTPPSRRPVSMLFQENNLFSHLTVAQNIGLGLNPGLKLNAAQQEKMHAIARQMGIDNLMARLPGELSGGQRQRVALARCLVREQPILLLDEPFSALDPALRQEMLTLVSTSCQQQKMTLLMVSHSVEDAARIATRSVVVADGRIAWQGKTNELLSGKASASALLGITG >CP031653.1|AXP28307.1|4027046_4028657_+|thiamine/thiamine-pyrophosphate-ABC-transporter-permease-ThiP MATRRQPLIPGWLIPGVSAATLVVAVALAAFLALWWNAPQGNWVAVWQDSYLWHVVRFSFWQAFLSALLSVVPAIFLARALYRRRFPGRLALLRLCAMTLILPVLVAVFGILSVYGRQGWLASLCQSLGLEWTFSPYGLQGILLAHVFFNLPMASRLLLQALENIPGEQRQLAAQLGMRGWHFFRFVEWPWLRRQIPPVAALIFMLCFASFATVLSLGGGPQATTIELAIYQALSYDYDPARAAMLALIQMVCCLGLVLLSQRLSKAIAPGTTLLQGWRDPDDRLHSRICDTVLIVLALLLLLPPLLAVIVDGVNRQLPEVLAQPVLWQALWTSLRIALAAGVLCVVLTMMLLWSSRELRARQKMLAGQALEMSGMLILAMPGIVLATGFFLLLNNTIGLPQSADGIVIFTNALMAIPYALKVLENPMRDITARYSMLCQSLGIEGWSRLKVVELRALKRPLAQALAFACVLSIGDFGVVALFGNDDFRTLPFYLYQQIGSYRSQDGAVTALILLLLCFLLFTVIEKLPGRNVKTD >CP031653.1|AXP28306.1|4026087_4027071_+|thiamine-ABC-transporter-substrate-binding-subunit MLKKCLPLLLLCTAPVFAKPVLIVYTYDSFAADWGPGPKIKKAFEADCNCELKLVALEDGVSLLNRLRMEGKNSKADVVLGLDNNLLDAASKTGLFAKSGVAADAVNVPGGWNNDTFVPFDYGYFAFVYDKNKLKNPPQSLKELVESDQNWRVIYQDPRTSTPGLGLLLWMQKVYGDDAPQAWQKLAKKTVTVTKGWSEAYGLFLKGESDLVLSYTTSPAYHILEEKKDNYAAANFSEGHYLQVEVAARTAASKQPELAQKFLQFMVSPAFQNAIPTGNWMYPVANVTLPAGFEQLTKPATTLEFTPAEVAAQRQAWISEWQRAVSR >CP031653.1|AXP28305.1|4024268_4025924_+|transcriptional-regulator MPSARLQQQFIRLWQCCEGKSQDTTLNELAALLSCSRRHMRTLLNTMQDRGWLTWEAEVGRGKRSRLTFLYTGLALQQQRAEDLLEQDRIDQLVQLVGDKATVRQMLVSHLGRSFRQGRHILRVLYYRPLRNLLPGSALRRSETHIARQIFSSLTRINEENGELEADIAHHWQQISPLHWRFFLRPGVHFHHGRELEMDDVIASLKRINTLPLYSHIADIVSPTPWTLDIHLTQPDRWLPLLLGQVPAMILPREWETLSNFASHPIGTGPYAVIRNTTNQLKIQAFDDFFGYRALIDEVNVWVLPEIADEPAGGLMLKGPQGEEKEIESRLEEGCYYLLFDSRTHRGANQQVRDWVSYVLSPTNLVYFAEEQYQQLWFPAYGLLPRWHHARTIKSEKPAGLESLTLTFYQDHSEHRVIAGIMQQILASHQVTLEIKEISYDQWHEGEIESDIWLNSANFTLPLDFSLFAHLCEVPLLQHCIPIDWQADAARWRNGEMNLANWCQQLVASKAMVPLIHHWLIIQGQRSMRGLRMNTLGWFDFKSAWFAPPDP >CP031653.1|AXP28304.1|4024048_4024180_-|glucose-uptake-inhibitor-SgrT MRQFYQHYFTATAKLCWLRWLSVPQRLTMLEGLMQWDDRNSES >CP031653.1|AXP28303.1|4022768_4023947_-|MFS-transporter MIWIMTMARRMNGVYAAFMLVAFMMGVAGALQAPTLSLFLSREVGAQPFWIGLFYTVNAIAGIGVSLWLAKRSDSQGDRRKLIIFCCLMAIGNALLFAFNRHYLTLITCGVLLASLANTAMPQLFALAREYADNSAREVVMFSSVMRAQLSLAWVIGPPLAFMLALNYGFTVMFSIAAGIFTLSLVLIAFMLPSVARVELPSENALSMQGGWQDSNVRMLFVASTLMWTCNTMYIIDMPLWISSELGLPDKLAGFLMGTAAGLEIPAMILAGYYVKRYGKRRMMVIAVAAGVLFYTGLIFFHSRMALMTLQLFNAVFIGIVAGIGMLWFQDLMPGRAGAATTLFTNSISTGVILAGVIQGAIAQSWGHFAVYWVIAVISVVALFLTAKVKDV >CP031653.1|AXP28312.1|4034932_4035628_+|L-ribulose-5-phosphate-4-epimerase MLEDLKRLVLEANLALPKHNLVTLTWGNVSAVDRERGVFVIKPSGVDYSVMTADDMVVVSIATGEVVEGTKKPSSDTPTHRLLYQAFPSIGGIVHTHSRHATIWAQAGQSIPATGTTHADYFYGTIPCTRKMTDAEINGEYEWETGNVIVETFEKQGIDAAQMPGVLVHSHGPFAWGKNAEDAVHNAIVLEEVAYMGIFCRQLAPQLPDMQQTLLDKHYLRKHGAKAYYGQ >CP031653.1|AXP28313.1|4035702_4038054_+|DNA-polymerase-II MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQVPRAQHILQGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGGVTVYEADVRPPERYLMERFITSPVWVEGDIRNGAIVNARLKPHPDYRPPLKWVSIDIETTRHGELYCIGLEGCGQRIVYMLGPENGDASALDFELEYVASRPQLLEKLNAWFANYDPDVIIGWNVVQFDLRMLQKHAERYRIPLRLGRDNSELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQELLGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMPFLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHASPGGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHSTEGFLDAWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALKIIMNAFYGVLGTTACRFFDPRLASSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTFVWLKGAHSEEEAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCRFLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQELYLRIFRNEPYQEYIRETIDKLMAGELDARLVYRKRLRRPLSEYQRNVPPHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYEHYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF >CP031653.1|AXP28314.1|4038218_4041125_+|RNA-polymerase-associated-protein-RapA MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPVTRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREVFLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQRTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAAERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQLVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIEQLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYRPVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSARQELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKVSGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTSHRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWFAEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRIGQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDLINYLASPDQTEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSNGGEKAQALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPDFPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSSTISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGNNLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSARALIDAARNEADEKLSAELSRLEALRAVNPNIRDDELTAIESNRQQVMESLDQAGWRLDALRLIVVTHQ >CP031653.1|AXP28315.1|4041136_4041796_+|bifunctional-tRNA-pseudouridine(32)-synthase/ribosomal-large-subunit-pseudouridine-synthase-RluA MGMENYNPPQEPWLVILYQDDHIMVVNKPSGLLSVPGRLEEHKDSVMTRIQRDYPQAESVHRLDMATSGVIVVALTKAAERELKRQFREREPKKQYVARVWGHPSPAEGLVDLPLICDWPNRPKQKVCYETGKPAQTEYEVVEYAADNTARVVLKPITGRSHQLRVHMLALGHPILGDRFYASPEARAMAPRLLLHAEMLTITHPAYGNSMTFKAPADF >CP031653.1|AXP28316.1|4041912_4042728_-|molecular-chaperone-DjlA MQYWGKIIGVAVALLMGGGFWGVVLGLLIGHMFDKARSRKMAWFANQRERQALFFATTFEVMGHLTKSKGRVTEADIHIASQLMDRMNLHGASRTAAQNAFRVGKSDNYPLREKMRQFRSVCFGRFDLIRMFLEIQIQAAFADGSLHPNERAVLYVIAEELGISRAQFDQFLRMMQGGAQFGGGYQQQTGGGNWQQAQRGPTLEDACNVLGVKPTDDATTIKRAYRKLMSEHHPDKLVAKGLPPEMMEMAKQKAQEIQQAYELIKQQKGFK >CP031653.1|AXP28317.1|4042982_4045337_+|LPS-assembly-protein-LptD MKKRIPTLLATMIATALYSQQGLAADLASQCMLGVPSYDRPLVQGDTNDLPVTINADHAKGDYPDDAVFTGSVDIMQGNSRLQADEVQLHQKEAPGQPEPVRTVDALGNVHYDDNQVILKGPKGWANLNTKDTNVWEGDYQMVGRQGRGKADLMKQRGENRYTILDNGSFTSCLPGSDTWSVVGSEIIHDREEQVAEIWNARFKVGPVPIFYSPYLQLPVGDKRRSGFLIPNAKYTTTNYFEFYLPYYWNIAPNMDATITPHYMHRRGNIMWENEFRYLSQAGAGLMELDYLPSDKVYEDEHPNDDSSRRWLFYWNHSGVMDQVWRFNVDYTKVSDPSYFNDFDNKYGSSTDGYATQKFSVGYAVQNFNATVSTKQFQVFSEQNTSSYSAEPQLDVNYYQNDVGPFDTRIYGQAVHFVNTRDDMPEATRVHLEPTINLPLSNNWGSINTEAKLLATHYQQTNLDWYNSRNTTKLDESVNRVMPQFKVDGKMVFERDMEMLAPGYTQTLEPRAQYLYVPYRDQSDIYNYDSSLLQSDYSGLFRDRTYGGLDRIASANQVTTGVTSRIYDDAAVERFNISVGQIYYFTESRTGDDNITWENDDKTGSLVWAGDTYWRISERWGLRGGIQYDTRLDNVATSNSSIEYRRDEDRLVQLNYRYASPEYIQATLPKYYSTAEQYKNGISQVGAVASWPIADRWSIVGAYYYDTNANKQADSMLGVQYSSCCYAIRVGYERKLNGWDNDKQHAVYDNAIGFNIELRGLSSNYGLGTQEMLRSNILPYQNSL >CP031653.1|AXP28318.1|4045389_4046676_+|chaperone-SurA MKNWKTLLLGIAMIANTSFAAPQVVDKVAAVVNNGVVLESDVDGLMQSVKLNAAQARQQLPDDATLRHQIMERLIMDQIILQMGQKMGVKISDEQLDQAIANIAKQNNMTLDQMRSRLAYDGLNYNTYRNQIRKEMIISEVRNNEVRRRITILPQEVESLAQQVGNQNDASTELNLSHILIPLPENPTSDQVNEAESQARAIVDQARNGADFGKLAIAHSADQQALNGGQMGWGRIQELPGIFAQALSTAKKGDIVGPIRSGVGFHILKVNDLRGESKNISVTEVHARHILLKPSPIMTDEQARVKLEQIAADIKSGKTTFAAAAKEFSQDPGSANQGGDLGWATADIFDPAFRDALTRLNKGQMSAPVHSSFGWHLIELLDTRNVDKTDAAQKDRAYRMLMNRKFSEEAASWMQEQRASAYVKILSN >CP031653.1|AXP28319.1|4046675_4047665_+|4-hydroxythreonine-4-phosphate-dehydrogenase-PdxA MVKTQRVVITPGEPAGIGPDLVVQLAQREWPVELVVCADATLLTDRAAMLGLPLTLRTYSPNSPAQPQTAGTLTLLPVALRESVTAGQLAVENGHYVVETLARACDGCLNGEFAALITGPVHKGVINDAGIPFTGHTEFFEERSQAKKVVMMLATEELRVALATTHLPLRDIADAITPALLHEVIAILHHDLRTKFGIAEPRILVCGLNPHAGEGGHMGTEEIDTIIPLLDELRAQGMKLNGPLPADTLFQPKYLDNADAVLAMYHDQGLPVLKYQGFGRGVNITLGLPFIRTSVDHGTALELAGRGEADVGSFITALNLAIKMIVNTQ >CP031653.1|AXP28320.1|4047661_4048483_+|ribosomal-RNA-small-subunit-methyltransferase-A MNNRVHQGHLARKRFGQNFLNDQFVIDSIVSAINPQKGQAMVEIGPGLAALTEPVGERLDQLTVIELDRDLAARLQTHPFLGPKLTIYQQDAMTFNFGELAEKMGQPLRVFGNLPYNISTPLMFHLFSYTDAIADMHFMLQKEVVNRLVAGPNSKAYGRLSVMAQYYCNVIPVLEVPPSAFTPPPKVDSAVVRLVPHATMPHPVKDVRVLSRITTEAFNQRRKTIRNSLGNLFSVEVLTGMGIDPAMRAENISVAQYCQMANYLAENAPLQES >CP031653.1|AXP28321.1|4048485_4048863_+|Co2+/Mg2+-efflux-protein-ApaG MINSPRVCIQVQSVYIEAQSSPDNERYVFAYTVTIRNLGRAPVQLLGRYWLITNGNGRETEVQGEGVVGVQPLIAPGEEYQYTSGAIIETPLGTMQGHYEMIDENGVPFSIDIPVFRLAVPTLIH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP031653_8 | 8.1|3024209|40|CP031653|CRISPRCasFinder | 3024209-3024248 | 40 | NZ_CP041417 | Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence | 47951-47990 | 0 | 1.0 |
CP031653_7 | 7.1|2301883|38|CP031653|CRISPRCasFinder | 2301883-2301920 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 113727-113764 | 2 | 0.947 |
CP031653_10 | 10.1|3844625|48|CP031653|CRISPRCasFinder | 3844625-3844672 | 48 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 4089-4136 | 3 | 0.938 |
CP031653_10 | 10.1|3844625|48|CP031653|CRISPRCasFinder | 3844625-3844672 | 48 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 4088-4135 | 3 | 0.938 |
CP031653_10 | 10.1|3844625|48|CP031653|CRISPRCasFinder | 3844625-3844672 | 48 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 4088-4135 | 3 | 0.938 |
CP031653_10 | 10.1|3844625|48|CP031653|CRISPRCasFinder | 3844625-3844672 | 48 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 4088-4135 | 3 | 0.938 |
CP031653_1 | 1.1|708118|42|CP031653|CRISPRCasFinder | 708118-708159 | 42 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 30214-30255 | 7 | 0.833 |
CP031653_1 | 1.1|708118|42|CP031653|CRISPRCasFinder | 708118-708159 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24899-24940 | 8 | 0.81 |
CP031653_5 | 5.1|1143578|33|CP031653|PILER-CR | 1143578-1143610 | 33 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62681-62713 | 8 | 0.758 |
CP031653_5 | 5.6|1143883|33|CP031653|PILER-CR | 1143883-1143915 | 33 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530640-530672 | 8 | 0.758 |
CP031653_5 | 5.9|1143579|32|CP031653|CRISPRCasFinder,CRT | 1143579-1143610 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62713 | 8 | 0.75 |
CP031653_5 | 5.9|1143579|32|CP031653|CRISPRCasFinder,CRT | 1143579-1143610 | 32 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222137 | 8 | 0.75 |
CP031653_5 | 5.9|1143579|32|CP031653|CRISPRCasFinder,CRT | 1143579-1143610 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467671-2467702 | 8 | 0.75 |
CP031653_5 | 5.9|1143579|32|CP031653|CRISPRCasFinder,CRT | 1143579-1143610 | 32 | NC_008759 | Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence | 12670-12701 | 8 | 0.75 |
CP031653_5 | 5.11|1143701|32|CP031653|CRISPRCasFinder,CRT | 1143701-1143732 | 32 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17977-18008 | 8 | 0.75 |
CP031653_5 | 5.11|1143701|32|CP031653|CRISPRCasFinder,CRT | 1143701-1143732 | 32 | NZ_CP017753 | Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence | 97497-97528 | 8 | 0.75 |
CP031653_5 | 5.14|1143884|32|CP031653|CRISPRCasFinder,CRT | 1143884-1143915 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148991-149022 | 8 | 0.75 |
CP031653_5 | 5.14|1143884|32|CP031653|CRISPRCasFinder,CRT | 1143884-1143915 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530640-530671 | 8 | 0.75 |
CP031653_5 | 5.15|1143945|32|CP031653|CRISPRCasFinder,CRT | 1143945-1143976 | 32 | NZ_CP006991 | Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence | 532343-532374 | 8 | 0.75 |
CP031653_1 | 1.1|708118|42|CP031653|CRISPRCasFinder | 708118-708159 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24786-24827 | 9 | 0.786 |
CP031653_5 | 5.1|1143578|33|CP031653|PILER-CR | 1143578-1143610 | 33 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86181-86213 | 9 | 0.727 |
CP031653_5 | 5.3|1143700|33|CP031653|PILER-CR | 1143700-1143732 | 33 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17976-18008 | 9 | 0.727 |
CP031653_5 | 5.11|1143701|32|CP031653|CRISPRCasFinder,CRT | 1143701-1143732 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 405875-405906 | 9 | 0.719 |
CP031653_5 | 5.14|1143884|32|CP031653|CRISPRCasFinder,CRT | 1143884-1143915 | 32 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14984 | 9 | 0.719 |
CP031653_5 | 5.14|1143884|32|CP031653|CRISPRCasFinder,CRT | 1143884-1143915 | 32 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15014 | 9 | 0.719 |
CP031653_5 | 5.14|1143884|32|CP031653|CRISPRCasFinder,CRT | 1143884-1143915 | 32 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3485 | 9 | 0.719 |
CP031653_5 | 5.15|1143945|32|CP031653|CRISPRCasFinder,CRT | 1143945-1143976 | 32 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35771 | 9 | 0.719 |
CP031653_4 | 4.9|1121513|32|CP031653|CRT | 1121513-1121544 | 32 | NZ_CP030933 | Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence | 51062-51093 | 10 | 0.688 |
CP031653_5 | 5.2|1143639|33|CP031653|PILER-CR | 1143639-1143671 | 33 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78535-78567 | 10 | 0.697 |
CP031653_5 | 5.6|1143883|33|CP031653|PILER-CR | 1143883-1143915 | 33 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14952-14984 | 10 | 0.697 |
CP031653_5 | 5.6|1143883|33|CP031653|PILER-CR | 1143883-1143915 | 33 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14982-15014 | 10 | 0.697 |
CP031653_5 | 5.7|1143944|33|CP031653|PILER-CR | 1143944-1143976 | 33 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35739-35771 | 10 | 0.697 |
CP031653_5 | 5.9|1143579|32|CP031653|CRISPRCasFinder,CRT | 1143579-1143610 | 32 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86181-86212 | 10 | 0.688 |
CP031653_5 | 5.10|1143640|32|CP031653|CRISPRCasFinder,CRT | 1143640-1143671 | 32 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244717 | 10 | 0.688 |
CP031653_5 | 5.10|1143640|32|CP031653|CRISPRCasFinder,CRT | 1143640-1143671 | 32 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78567 | 10 | 0.688 |
CP031653_5 | 5.11|1143701|32|CP031653|CRISPRCasFinder,CRT | 1143701-1143732 | 32 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2248362-2248393 | 10 | 0.688 |
1. spacer 8.1|3024209|40|CP031653|CRISPRCasFinder matches to NZ_CP041417 (Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence) position: , mismatch: 0, identity: 1.0
gcgctgcgggtcattcttgaaattacccccgctgtgctgt CRISPR spacer gcgctgcgggtcattcttgaaattacccccgctgtgctgt Protospacer ****************************************
2. spacer 7.1|2301883|38|CP031653|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 2, identity: 0.947
cggacgcaggatggtgcgttcaattggactcgaaccaa CRISPR spacer cagacgcagaatggtgcgttcaattggactcgaaccaa Protospacer *.*******.****************************
3. spacer 10.1|3844625|48|CP031653|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
4. spacer 10.1|3844625|48|CP031653|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
5. spacer 10.1|3844625|48|CP031653|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
6. spacer 10.1|3844625|48|CP031653|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
7. spacer 1.1|708118|42|CP031653|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 7, identity: 0.833
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer acaaatgccggatgcggcgtaaacgccttatctggcctacgc Protospacer ***. *.****************.*********.******.
8. spacer 1.1|708118|42|CP031653|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 8, identity: 0.81
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer attgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer *. * ******************.*******.*******.
9. spacer 5.1|1143578|33|CP031653|PILER-CR matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.758
gttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer gtccctatcgcaatgccggcagcatccgcaatc Protospacer **. *. ****** **** ************.
10. spacer 5.6|1143883|33|CP031653|PILER-CR matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.758
gccgaacggctggcgaagcaggtggctggcgta CRISPR spacer gccgaacaggtggcgaagcaggtgatgggccag Protospacer *******.* **************.. *** .
11. spacer 5.9|1143579|32|CP031653|CRISPRCasFinder,CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer tccctatcgcaatgccggcagcatccgcaatc Protospacer *. *. ****** **** ************.
12. spacer 5.9|1143579|32|CP031653|CRISPRCasFinder,CRT matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
13. spacer 5.9|1143579|32|CP031653|CRISPRCasFinder,CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
14. spacer 5.9|1143579|32|CP031653|CRISPRCasFinder,CRT matches to NC_008759 (Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcg-----caattccgggagcatccgcaatt CRISPR spacer -----cgtgaaactcatttccgggagcatccgcattt Protospacer **.* ** ***************** **
15. spacer 5.11|1143701|32|CP031653|CRISPRCasFinder,CRT matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer agcgtcaccgacgcgcagggccgctaccaact Protospacer **************** * *******.
16. spacer 5.11|1143701|32|CP031653|CRISPRCasFinder,CRT matches to NZ_CP017753 (Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcaccgacgcgcagtcgcgcttcttcaa Protospacer ***************** ***** *. ..*
17. spacer 5.14|1143884|32|CP031653|CRISPRCasFinder,CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer gggtacggctggcgaaggaggcggctgcggaa Protospacer * ************* ***.***** * *
18. spacer 5.14|1143884|32|CP031653|CRISPRCasFinder,CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ccgaacaggtggcgaagcaggtgatgggccag Protospacer ******.* **************.. *** .
19. spacer 5.15|1143945|32|CP031653|CRISPRCasFinder,CRT matches to NZ_CP006991 (Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence) position: , mismatch: 8, identity: 0.75
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer catcatcctcccgcagatgcgctggccgatcc Protospacer *.*.* .******** ******** *****
20. spacer 1.1|708118|42|CP031653|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 9, identity: 0.786
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer gttgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer .. * ******************.*******.*******.
21. spacer 5.1|1143578|33|CP031653|PILER-CR matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 9, identity: 0.727
-gttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer cgcta-ccgcgcaattcgaggagcatccgctggg Protospacer *.*. *********** .*********** .
22. spacer 5.3|1143700|33|CP031653|PILER-CR matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.727
gcccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer cagcgtcaccgacgcgcagggccgctaccaact Protospacer **************** * *******.
23. spacer 5.11|1143701|32|CP031653|CRISPRCasFinder,CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcactgacgcgcagtcgcgcttcttcaa Protospacer ******.********** ***** *. ..*
24. spacer 5.14|1143884|32|CP031653|CRISPRCasFinder,CRT matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
25. spacer 5.14|1143884|32|CP031653|CRISPRCasFinder,CRT matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
26. spacer 5.14|1143884|32|CP031653|CRISPRCasFinder,CRT matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ttgcgcagctggcgcagcaggtggctgccgag Protospacer ..* .*.******* ************ ** .
27. spacer 5.15|1143945|32|CP031653|CRISPRCasFinder,CRT matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.719
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcgac Protospacer ******.*** ************* *
28. spacer 4.9|1121513|32|CP031653|CRT matches to NZ_CP030933 (Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence) position: , mismatch: 10, identity: 0.688
tccacgctgtaacggccatcattaagtttagt CRISPR spacer ccgctgctgtgacgcccatcattaagttactc Protospacer .* .*****.*** ************* .
29. spacer 5.2|1143639|33|CP031653|PILER-CR matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 10, identity: 0.697
gacggacaaaatatatattgatttgcgaattat CRISPR spacer gacggaaaaattatatattgattttacttctgg Protospacer ****** *** ************* .*.
30. spacer 5.6|1143883|33|CP031653|PILER-CR matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 10, identity: 0.697
gccgaacggctggcgaagcaggtggctggcgta CRISPR spacer cagcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
31. spacer 5.6|1143883|33|CP031653|PILER-CR matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 10, identity: 0.697
gccgaacggctggcgaagcaggtggctggcgta CRISPR spacer cagcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
32. spacer 5.7|1143944|33|CP031653|PILER-CR matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 10, identity: 0.697
ggtttaccgccccgcagaggcgctggcagatcc CRISPR spacer ccgagaccgcctcgccgaggcgctggcagcgac Protospacer ******.*** ************* *
33. spacer 5.9|1143579|32|CP031653|CRISPRCasFinder,CRT matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 10, identity: 0.688
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer gctaccgcgcaattcgaggagcatccgctggg Protospacer . *********** .*********** .
34. spacer 5.10|1143640|32|CP031653|CRISPRCasFinder,CRT matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer tgaggcaaaatatagattgatttccgaaaata Protospacer .*.********* ******** ****
35. spacer 5.10|1143640|32|CP031653|CRISPRCasFinder,CRT matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer acggaaaaattatatattgattttacttctgg Protospacer ***** *** ************* .*.
36. spacer 5.11|1143701|32|CP031653|CRISPRCasFinder,CRT matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 10, identity: 0.688
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacatcaccgacgcccagtggcgcgacgtccc Protospacer *.********** ********* ** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
160225 : 198185
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP031653|160225:198185|DBSCAN-SWA TATGCAAAAGTTTGATACCAGGACCTTCCAGGGCTTGATCCTGACCTTACAGGATTACTGGGCTCGCCAGGGCTGCACCATTGTTCAACCATTGGACATGGAAGTCGGCGCGGGAACCTCTCACCCAATGACCTGTCTGCGCGCGCTGGGGCCAGAACCGATGGCGGCTGCTTATGTTCAGCCTTCTCGTCGCCCGACCGATGGTCGCTACGGCGAAAACCCCAACCGTTTACAGCACTACTATCAGTTCCAGGTGGTCATTAAGCCATCGCCGGACAATATTCAGGAGCTGTACCTCGGTTCTCTGAAAGAGCTGGGCATGGACCCGACTATTCACGACATCCGTTTCGTGGAAGATAACTGGGAAAACCCGACTCTGGGTGCCTGGGGACTGGGCTGGGAAGTGTGGCTAAACGGCATGGAAGTGACGCAGTTCACTTACTTCCAGCAGGTTGGTGGTCTGGAGTGTAAACCGGTTACCGGCGAGATCACCTACGGTCTGGAACGTCTGGCCATGTACATTCAGGGCGTAGACAGCGTTTACGACCTGGTCTGGAGCGACGGCCCGCTGGGTAAAACCACCTACGGCGACGTGTTCCATCAGAACGAAGTGGAGCAGTCCACTTACAACTTCGAATACGCGGATGTGGACTTCCTGTTCACCTGCTTTGAGCAGTACGAGAAAGAAGCGCAGCAGCTGCTGGCGCTGGAAAATCCGCTGCCGCTGCCAGCCTACGAGCGTATTCTGAAAGCCGCCCACAGCTTCAACCTGCTGGATGCGCGTAAAGCCATCTCCGTCACCGAGCGTCAGCGCTACATTCTGCGCATTCGCACCCTGACCAAAGCAGTGGCAGAAGCATACTACGCTTCCCGTGAAGCCCTCGGCTTCCCGATGTGCAACAAAGATAAGTAAGAGGCGGCTATGTCTGAGAAAACTTTTCTGGTGGAAATCGGCACTGAAGAGCTGCCACCAAAAGCACTGCGCAGCCTGGCTGAGTCCTTTGCTGCGAATTTTACTGCGGAGCTGGATAACGCTGGCCTCGCACACGGCACCGTTCAATGGTTTGCTGCTCCGCGTCGTCTGGCGCTGAAAGTAGCTAACCTGGCGGAAGCGCAACCGGATCGTGAAATCGAAAAACGCGGCCCGGCGATTGCCCAGGCGTTCGACGCTGAAGGCAAACCGAGCAAAGCGGCAGAAGGTTGGGCGCGTGGTTGCGGTATTACCGTTGACCAGGCTGAGCGTCTGACTACCGATAAAGGCGAATGGCTGCTGTATCGCGCCCATGTGAAGGGCGAAAGCACCGAAGCACTGCTGCCGAATATGGTTGCGACTTCTCTGGCGAAACTGCCGATCCCGAAACTGATGCGTTGGGGCGCAAGCGACGTGCACTTCGTGCGTCCGGTGCACACCGTGACCCTGCTGCTGGGCGACAAAGTCATTCCGGCAACCATTCTGGGCATTCAGTCCGATCGCGTGATTCGCGGCCACCGCTTTATGGGCGAACCGGAATTCACCATCGACAACGCCGATCAGTATCCGGAAATTCTGCGTGAGCGCGGGAAAGTCATCGCCGATTACGAAGAACGTAAAGCGAAGATTAAAGCCGATGCCGAAGAAGCAGCGCGTAAGATTGGCGGTAACGCTGACTTAAGCGAAAGCCTGCTGGAAGAAGTGGCTTCGCTGGTGGAGTGGCCGGTCGTTCTGACCGCAAAATTCGAAGAGAAATTCCTCGCGGTGCCGGCTGAAGCGCTGGTTTACACCATGAAAGGTGACCAGAAATACTTCCCGGTGTATGCGAACGACGGCAAACTGCTGCCGAACTTTATCTTCGTTGCCAACATCGAATCGAAAGATCCGCAGCAGATTATCTCCGGTAACGAGAAAGTCGTTCGTCCGCGTCTGGCGGATGCCGAGTTCTTCTTCAACACCGACCGTAAAAAACGTCTGGAAGATAACCTGCCGCGCCTGCAAACCGTGCTGTTCCAGCAACAGCTGGGTACGCTGCGTGACAAAACTGACCGCATCCAGGCGCTGGCTGGCTGGATTGCTGAACAGATTGGCGCTGACGTTAACCACGCAACCCGTGCGGGCCTGCTGTCTAAGTGTGACCTGATGACCAACATGGTCTTCGAGTTCACCGACACCCAGGGCGTTATGGGGATGCACTATGCGCGTCACGATGGCGAAGCGGAAGATGTCGCGGTGGCGCTGAATGAGCAGTATCAGCCGCGCTTTGCCGGTGATGACCTGCCGTCTAACCCGGTAGCCTGTGCGCTGGCGATTGCTGACAAGATGGATACTCTGGCGGGTATCTTCGGTATCGGCCAGCATCCGAAAGGCGACAAAGACCCGTTTGCGCTGCGTCGTGCCGCACTTGGCGTGCTGCGAATTATCGTTGAGAAGAACCTCAACCTTGATCTGCAAACGCTGACCGAAGAAGCAGTGCGTCTGTATGGCGATAAGCTGACTAACGCCAACGTGGTTGATGATGTTATCGACTTTATGCTCGGTCGCTTCCGCGCCTGGTATCAGGACGAAGGTTACACCGTTGACACCATCCAGGCGGTACTGGCGCGTCGTCCGACTCGTCCGGCTGATTTCGATGCCCGCATGAAAGCGGTATCGCACTTCCGTACCCTTGAAGCAGCTGCTGCACTGGCGGCGGCGAACAAGCGTGTCTCCAACATTTTGGCGAAATCTGACGAAGTGCTGAGCGACCGCGTGAATGCCTCTACCCTGAAAGAGCCGGAAGAAATTAAACTGGCGATGCAGGTTGTGGTGCTACGTGACAAGCTGGAGCCGTACTTTGCGGAAGGTCGTTACCAGGATGCGCTGGTCGAACTGGCAGAGCTGCGTGAGCCGGTTGATGCTTTCTTCGATAAAGTGATGGTCATGGTTGATGACAAAGAATTGCGTCTCAACCGTCTGACCATGCTGGAGAAACTGCGCGAACTGTTCCTGCGCGTTGCGGATATTTCGCTGTTGCAGTAATAACGCCGTTATTAAAAAGCCTGCCATCTGGCAGGCTTTTTTTATTATCGCTAAATAATACAGCAACCTTTAATAATCTTCTGCTGAATAAAGATTATCTCATATATTAATTTTATGAGGTTTTTTTAGGATTATATCAAGGAGAAGAAACAAACTTATTAAGCTAGAATAGCCACGGGTGCTTGAGACTGTTTGTCTCAGGTATTCACCGAAAGGCAGACAGAGAAAAGCCCCACCTGACTATAAATCAAAGTGTACTGCACCCATTTTGTTGGACGATGAAATGGAATAGCCCCTAATATGTCAAAGCCAAAATACCCTTTTGAAAAGCGCCTTGAAGTCGTGAATCACTACTTCACAACTGATGATGGTTACAGGATCATCTCGGCACGTTTTGGTGTCCCCCGAACCCAGGTCAGGACATGGGTTGCCCTCTATGAAAAACATGGAGAAAAAGGTTTAATTCCCAAACCTAAAGGCGTTAGTGCTGATCCAGAGTTGCGTATTAAGGTCGTGAAAGCTGTGATCGAGCAGCACATGTCCCTTAATCAGGCTGCTGCTCACTTTATGCTTGCTGGTAGTGGTTCTGTAGCCAGGTGGCTGAAGGTCTATGAAGAGCGCGGAGAAGCTGGTTTACGCGCGCTCAAGATTGGCACCAAAAGAAACATTGCAATATCAGTTGATCCAGAAAAAGCGGCATCAGCATTGGAGCTGTCAAAAGACCGACGCATTGAGGATCTTGAAAGGCAAGTTCGATTTCTTGAAACGCGGCTTATGTATCTAAAAAAGCTGAAAGCCTTAGCTCATCCCACGAAAAAGTGAAAGTACTCAACGAGCTAAGGCAGTTTTATCCTCTTGATGAGCTTCTCAGGGCTGCGGAGATACCGCGCAGTACGTTTTATTATCATCTAAAGGCTCTCAGCAAGCCTGACAAGTATGCGGACGTTAAAAAGCGTATTAGTGAGATTTATCACGAGAATAGAGGCCGATACGGATACCGTAGGGTAACGCTGTCTCTTCATCGAGAAGGGAAACAGATTAACCATAAAGCTGTTCAGCGCCTGATGGGAACCCTCTCACTTAAAGCAGCGATTAAGGTCAAGCGATACCGCTCTTACAGAGGAGAGGTAGGGCAAACCGCCCCTAATGTTCTCCAAAGAGATTTCAAGGCTACGCGGCCAAACGAGAAGTGGGTTACCGATGTTACTGAATTTGCAGTCAATGGGCGCAAGCTGTATTTGTCTCCAGTAATAGATCTCTTCAACAACGAAGTTATTTCTTACAGCCTTTCGGAAAGACCAGTGATGAACATGGTTGAGAATATGCTCGATCAGGCATTCAAAAAGCTTAATCCTCACGAGCATCCTGTTCTGCACTCTGACCAGGGATGGCAGTATCGTATGAGAAGATATCAAAATATCCTTAAAGAACATGGTATTAAACAAAGCATGTCCAGAAAAGGCAATTGTCTGGATAATGCTGTGGTGGAGTGTTTCTTTGGAACCTTAAAGTCGGAGTGTTTTTATCTTGATGAGTTCAGTAATATAAGCGAACTGAAGGATGCTGTTACGGAATATATTGAATACTACAACAGCAGAAGAATTAGCCTGAAATTAAAAGGTCTGACTCCAATTGAATATCGGAATCAGACCTATATGCCTCGTGTTTAACTGTCCAACTTTTTGGGGTCAGTACACCACGAGGCATCCCTATGTCTAGTCCACATCAGGATAGCCTCTTACCGCGCTTTGCGCAAGGAGAAGAAGGCCATGAAACTACCACGAAGTTCCCTTGTCTGGTGTGTGTTGATCGTGTGTCTCACACTGTTGATATTCACTTATCTGACACGAAAATCGCTGTGCGAGATTCGTTACAGAGACGGACACAGGGAGGTGGCGGCTTTCATGGCTTACGAATCCGGTAAGTAGCAACCTGGAGGCGGGCGCAGGCCCGCCTTTTCAGGACTGATGCTGGTCTGACTACTGAAGCGCCTTTATAAAGGGGCTGCTGGTTCGCCGGTAGCCCCTTTCTCCTTGCTGATGTTGTACGGGCATGAACAACCTGACTTCAGGAAGGTCGTTTCCCTTCAGGAACGGGGATGAACGCGCGCCTGCGGCGCCCGTTCTTTTCCCCCGCCTTCTCTGGTTATGGCCTGTCAGAAATCACATCACCTTCCCTGTGATTATTCTCTTTTTCTTCCTGCTCTGATTCTGACTACTGCAGTGTTGTCCTGTCTGCCGGGCGTACCTTCGGCTCCCGGCTGCCTGGCGGCATCCGGTCGTCAGGCCCGACTCCGACAGGCAGCTGCTGGCGCACCTGCCAGTCTCCGCCGAACCCGCTGGCGCGGTTTCGGGCTGTTGCTACGGTCCGATGCGTGAAAACCCCTGGCTTGCAGCGACCGTTCCGGTCGCGCCGCCTCTGTCGCGCCGTGACCTGCGCTCCCGGCATCCCGGCAAAAGGCGCCGGGATGTAAGGCGCAACGCCGTTTCGCAAACCGCGCTTCACGTCGTTGCGGCGGGTCCCTGCCCGTTGTTGTGTCCGGTGCCATTCACCAGTTTAAGCCCCTGTCGCCGGGGGCATAGACAGCCACAACCGTCTCCGCCAGTCCAGGGGTGAAATGCACATCCCGCAAAACTTTTTGCCCAGAAGCAGGCAAAAACTTTTGCGGTCTGCCCCCGGACAGTCGTTTCCGGCCGTGGCAGTGATGCCCCATGCGGCGAAGGTTCTCAAACAGGTGATGGTGAATACCGGAGGGAACAAAAGGCAGGTCCCCGCAAAACGTCAAAACCGCGGCTCCCGAAACTCAACAGATGATGGGGGCTGAAAACCAGAGAGAGGAGAAAGCAGGATGAGTACCCGAAATATCCACGTTAACACCGCGTCGTACACGCTTCTTGTTGCCGGGAAGAAAAAAAACACCGGGGAAGAGTGGGACGTCCTGGAATTCAGCAGCCTGACTGAGCTGAAAAAATATCGCAAAAGCCACCCGGAAAAGATGGCCTTCAGTTATAGCTACGCGCTCAGTCAGGGTGTGGATAAGCAGTTCCGCCATATCAACATTGCGGAAGCCGATCACTTCAAACAGTTCCTGCGTCAGATTAAGCGTGCCGGCCTCGATATCCGGGCGATCTGCTGACCGGATGTGATGAATATCCCTCCTGATTGTTGCAATCTTGTTAATGGAGATCGGGGGGATGATATTAAGCGGAAAAATTTTTTATTAAATTTATTCTTTCATAAAGGAGTAGCTGTTATGCGATTAGCTTCCCGTTTTGGTCGGTATAATTCCATCCGCCGTGAACGTCCTTTAACGGATGATGAATTAATGCAGTTCGTGCCTTCGGTATTTTCCGGTGATAAACATGAGTCCCGGAGTGAACGTTATACGTATATTCCAACAATCAATATCATCAATAAGTTACGTGATGAAGGTTTCCAGCCATTCTTTGCCTGTCAGAGTCGGATTCGTGATTTGGGACGTCGCGAATACAGTAAACATATGTTACGTCTTCGCAGGGAAGGGCATATTAACGGACAGGAAGTTCCTGAAATTATCCTGCTTAATTCACATGATGGTTCATCCAGTTATCAGATGATCCCCGGAATTTTTCGTTTTGTCTGCACAAATGGCCTGGTGTGCGGAAATAATTTTGGCGAAATCCGCGTTCCACATAAAGGTGATATTGTCGGGCAGGTTATCGAGGGAGCGTATGAAGTGCTCGGTGTCTTTGATAAGGTCACTGATAATATGGAGGCGATGAAAGAAATTCATCTTAACAGTGACGAGCAACATTTATTTGGCAGAGCTGCACTGATGGTCAGGTATGAAGATGAAAATAAAACGCCAGTGACACCTGAACAGATAATTACTCCCCGTCGTCGGGAAGACAAACAGAACGATCTCTGGACAACCTGGCAGCGGGTTCAGGAGAATATGATAAAAGGTGGATTATCGGGGCGAAGTGCCTCCGGGAAAAATACCAGGACAAGAGCCATTACAGGTATTGATGGTGATATAAGAATCAACAAGGCGTTATGGGTGATTGCCGAACAGTTCAGAAAGTGGAAGTCATGATAATATAAGTAAAGCCCCGAAAAAATTTTCGGGGCTTTACTTATATTATCATTTTTCGGCATTGCTGTGTCGTCTGATATACCTCAATTTCATCAACCGGCTTTCCGGTTGTCTGTAAATATTGTGCCCGCGAAAAATGGTCTGGCAGGAGGCTGCACAGTTACCGGGTATATATAAGATGGGTTCCCGCTTCGCGGGGCTTCGGCCAGTCAGTATTGTCACTCATATGATATTTTTGTGTGTGGCCTTCCATGCCGCTTTTGCGGCATAACCGGTATCTGACAATGTCTGTAAGATTAATTGTTCTGCACGCTGTTAATTTCCGGAAGTGATTTCTTCGTGGCCGGAAGCCGGATACCTTTACTGCTTTTTATCCCAGTATAAATCCGGTAAACATCTGATGCGTAGGCAAGACGTCTCTGGTTCTGGCGTTCGGTTTTCCTGAATCCGGCATTGTATGCACCAACGGCCTCCCAGGTGACGCCCCATTTTTTAAATGCTATTGCCAGATAATAAGCACCGGTATAAATGTTCATGCAGGGATCTGTTGTCAGATGTTCTGGCTTAATTCCGTAGCGGGCCAGTTCGTTAAAATGCTGGGAATCTACCTGCATCAGTCCGCTGCCATATCCCGTTACCGGATTAATACCGATGGCATTAACCCGGTAACGGGATTCTTGCCATGATATTGCTCTCAGTAAATCAGGATCTATTTTGTAATCCCGGCCTGCAAGATCAAAGCAATCAGTGGCCTGGCAGATCTCATTTATAAACATCAGGCAGATGGTTAACATCCATTTTTTCATTTTTCCACCTCTGGTGACTTTATCCGTAAATAATTTAACCCACTCCACAAAAAGGCTCAACAGGTTGGTGGTTCTCACCACCAAAAGCACCACACCCCACGCAAAAACAAGTTTTTGCTGATTTTTCTTTATAAATAGAGAGTTATGAAAAATTAGTTTCTCTTACTCTCTTTATGATATTTAAAAAAGCGGTGTCGGCGCGGCTACAACAACGCGCCGACACCGCTTTGTAGGGGTGGTACTGACTATTTTTATAAAAAACATTATTTTATATTAGGGGTGCTGCTAGCGGCGCGGTGTGTTTTTTTATAGGATACCGCTAGGGGCGCTGCTAGCGGTGCGTCCCTGTTTGCATTATGAATTTTAGTGTTTCGAAATTAACTTTATTTTATGTTCAAAAAAGGTAATCTCTAATGGCTAAGGTGAACCTGTATATCAGCAATGATGCCTATGAAAAAATAAATGCGATTATTGAGAAGCGTCGACAGGAAGGGGCAAGGGAAAAAGATGTCAGTTTTTCAGCAACAGCTTCAATGCTTCTTGAACTGGGGCTTCGTGTACATGAGGCTCAGATGGAGCGTAAAGAGTCTGCATTTAATCAGACTGAGTTTAATAAATTGCTTCTTGAATGCGTTGTAAAAACACAATCATCAGTAGCGAAAATTTTGGGTATTGAGTCTCTCAGTCCTCATGTCTCCGGAAATCCAAAGTTTGAATATGCCAATATGGTTGAAGATATCAGGGAGAAGGTATCATCTGAGATGGAACGATTTTTTCCAAAAAATGATGATGAATAAAAGAAATTTGACTTCGTTCAAATATCAGAGTTTTTATGATTTAAAAAGGTGACAGTACGAAAGATAATTAGTATATTAATTACGTGGTTAATGCCACGTTAAAATTTGAATTTGAAAATCGCCGATGCAGGGAGTTCTCTCCTCCCTGCATCGACTGTCCATAGAATCCTTTGTGAGGAGGTTCCTATGTATCCGATGGATCGTATTCAACAAAAACATGCTCGTCAAATAGATCTGCTGGAAAATCTGACGGCAGTTATTCAGGATTATCCAAATCCAGCCTGTATCAGGGACGAAACTGGAAAATTTATTTTTTGCAATACGCTGTTTCATGAGTCATTTCTTACACAAGATCAAAGTGCTGAAAAATGGCTTCTGTCGCAGAGAGATTTTTGTGAATTGATCTCTGTCACAGAGATGGAAGCATACAGGAATGAGCATACGCATCTTAATCTTGTAGAGGATGTTTTTATTCAGAACAGATTCTGGACAATATCTGTCCAGTCATTTCTTAATGGACACAGAAATATTATTCTGTGGCAATTTTATGATGCTGCTCATGTTCGTCATAAAGACAGTTATAATCAAAAAACGATTGTCAGTGATGATATCAGAAATATAATCAGAAGAATGAGTGATGATTCTTCTGTATCATCATATGTAAATGATGTCTTTTACTTATATAGCACCGGAATCAGTCATAATGCTATAGCAAGAATATTAAATATATCCATCTCCACATCAAAGAAACACGCATCTCTGATATGCGACTACTTCTCTGTTTCTAATAAAGATGAGTTAATTATCTTACTCTACAATAAAAAGTTTATTTATTATTTATACGAGAAGGCTATGTGTATCATAAATACGCGTTAATAAGGTGTTGATAAAATATAGACTTTCCGTCTATTTACCTTTTCTGATTATTCTGTAAACATAAGTGGTAACCAGAAGATAAACAGCGGGAGGTGTTATTGAAAAGATTTGGTACACGTTCTGCAACAGGTAAGATGGTAAAACTAAAATTACCTGTAGATGTGGAAAGTCTATTAATTGAGGCAAGTAACAGAAGCGGAAGAAGTCGATCGTTTGAGGCAGTAATAAGACTTAAAGATCATCTTCACCGCTATCCAAAGTTTAACAGGGCAGGGAATATCTATGGTAAGTCGCTGGTTAAGTATCTGACAATGCGTCTGGATGATGAAACTAACCAGCTACTTATTGCAGCCAAAAATCGTAGTGGATGGTGTAAAACAGATGAGGCTGCAGACAGAGTTATTGATCATTTGATCAAGTTTCCTGATTTTTATAACTCGGAGATATTCAGGGAGGCAGATAAAGAGGAAGATATAACATTTAATACACTCTAGTTTTATTCATTTATCCGAAATTGAGGTAACTTATGAATGCTGTTTTAAGTGTTCAGGGTGCTTCTGCGCCCGTCAAAAAGAAGTCGTTTTTTTCTAAATTCACTCGTCTGAATATGCTTCGCCTGGCTCGCGCAGTGATCCCGGCTGCTGTTCTGATGATGTTCTTCCCGCAGCTGGCGATGGCCGCCGGCAGCAGTGGTCAGGACCTGATGGCAAGCGGTAACACCACGGTTAAGGCGACCTTCGGTAAGGACTCCAGTGTTGTTAAATGGGTTGTTCTGGCTGAAGTTCTGGTCGGTGCTGTCATGTACATGATGACCAAAAACGTCAAGTTCCTGGCCGGTTTTGCCATCATCTCTGTATTTATTGCTGTGGGTATGGCCGTCGTTGGCCTCTGACAGGAAATAAAACGATGTCGGGAGACGAGAATAAACTTAAGAAATATCGTTTCCCGGAAACACTGACCAACCAGAGCCGCTGGTTTGGCCTGCCACTGGATGAACTGATCCCCGCAGCAATCTGTATTGGCTGGGGTATCACAACATCGAAATATCTGTTCGGTATTGGTGCAGCGGTTCTGGTTTATTTCGGGATTAAAAAACTGAAAAAAGGGCGGGGCAGTTCCTGGTTACGTGACCTGATTTACTGGTATATGCCAACAGCCCTGCTGCGCGGTATTTTTCATAATGTTCCCGATTCGTGTTTCCGGCAGTGGATTAAATAGAACTGATACCAGGATTGTTATATGGAACACGGTGCCCGTTTAAGTACCAGTCGTGTAATGGCCATCGCCTTTATATTTATGTCAGTGCTTATTGTTCTCAGCCTCTCTGTTAACGTCATTCAGGGGGTGAATAACTACCGTCTTCAGAATGAGCAACGCACTGCCGTGACGCCAATGGCATTTAATGCCCCCTTTGCCGTGTCACAGAACAGTGCCGACGCCTCTTATTTACAGCAGATGGCGCTGTCATTTATTGCCCTCCGTCTGAATGTTTCATCAGAAACCGTCGATGCCTCACATCAGGCGCTTCTGCAATATATCCGCCCGGGCGCACAGAACCAGATGAAAGTTATTCTGGCTGAAGAAGCGAAGCGTATTAAAAACGATAACGTGAACTCAGCCTTTTTCCAGACCAGTGTTCGTGTCTGGCCTCAGTATGGCCGTGTGGAAATTCGAGGTGTGCTTAAAACCTGGATTGGTGATTCAAAACCTTTCACTGATATCAAACATTACATCCTTATTCTGAAGCGGGAAAACGGGGTGACCTGGCTGGATAATTTCGGGGAAACAGACGATGAGAAAAAATAATACGGCAATAATATTCGGCAGCCTGTTTTTTTCCTGCAGCGTGATGGCCGCAAACGGTACGCTGGCCCCCACCGTGGTGCCAATGGTGAACGGTGGTCAGGCCAGTATTGCCATCAGCAATACCAGCCCGAATCTGTTTACCGTTCCCGGTGACCGGATTATCGCCGTGAACAGTCTGGATGGTGCCCTGACCAATAATGAGCAGACCGCCTCCGGCGGTGTGGTGGTTGCCACCGTCAACAAAAAGCCCTTTACGTTCATTCTGGAAACAGAACGTGGTCTGAATCTTTCCATTCAGGCCGTTCCCCGTGAAGGCGCGGGGCGTACCATTCAGCTGGTCAGTGACCTGCGCGGAACCGGAGAAGAAGCCGGTGCGTGGGAAACGTCCACGCCTTACGAATCCCTGCTTGTGACCATCAGCCAGGCCGTCCGTGGCGGAAAATTACCCGCAGGCTGGTATCAGGTCCCAGTGACAAAGGAAACCCTGCAGGCCCCGGCGGGGCTGTCTTCAGTGGCAGATGCCGTATGGACGGGGAATCACCTGAAGATGGTCCGCTTTGCCGTGGAAAATAAAACGCTGTCTGCCCTGAATATCCGGGAAAGTGACTTCTGGCAGCCGGGAACCCGTGCCGTGATGTTCAGCCAGCCTGCCAGCCAGTTACTGGCAGGTGCGCGCATGGATGTGTATGTCATCCGTGACGGGGAGGGCAACTGATGGCCAGTATCAATACCATTGTGAAACGCAAGCAGTACCTGTGGCTGGGGATTGTGGTTGTCGGTACAGCCTCCGCGATTGGTGGGGCACTGTATCTGTCTGATGTGGACATGTCCGGTAACGGTGAAACCGTGGCTGAACAGGAGCCTGTGCCGGATACCGTTATCGTCGACCAGGGCGAAAAACTCTCCCTGAAAGAGACGTTAACCCTGCTGGACGGTGCCGCACGTCATAACGTACAGGTCCTGATAACCGACAGCGGGCAGCGAACCGGTACAGGCAGTGCGCTGATGGCCATGAAGTATGCCGGGGTGAACACGTATCGCTGGCAGGGCGGAGAACAGCGACCGGCAACCATCATCAGTGAACCGGACCGGAATGTCCGCTATGACCGGCTGGCCGGAGATTTTGCGGCCAGCGTGAAAGCCGGAGAAGAGAGCGTGGCACAGGTCAGCGGGGTACGGGAACAGGCCATACTGACACAGGCCATTCGCAGTGAGCTGAAAACACAGGGCGTGCTCGGACACCCGGAGGTGACCATGACCGCCCTGTCACCGGTCTGGCTGGACAGCCGGAGCCGTTATCTGCGGGATATGTACCGACCGGGGATGGTGATGGAGCAGTGGAACCCGGAGACACGCAGTCATGACCGCTATGTTATCGACCGGGTGACGGCGCAGAGTCACAGCCTGACCCTGCGGGATGCGCAGGGCGAAACGCAGGTGGTGCGTATTTCCTCCCTGGACAGCAGCTGGTCGCTGTTCCGGCCGGAAAAAATGCCGGTGGCAGACGGCGAGCGACTGAGGGTGACAGGGAAAATTCCCGGACTCCGCGTCTCCGGCGGTGACCGCCTGCAGGTGGCATCCGTCAGTGAAGATGCGATGACGGTTGTTGTGCCGGGGCGGGCTGAACCGGCCACCCTGCCTGTGGCTGATTCACCGTTCACGGCACTGAAGCTGGAGAACGGCTGGGTGGAAACGCCCGGGCATTCCGTCAGTGACAGTGCGACGGTGTTTGCCTCCGTCACACAGATGGCAATGGACAACGCCACCCTGAACGGTCTGGCCCGCAGCGGTCGTGATGTCCGGCTGTATTCCTCACTGGATGAAACCCGTACTGCGGAAAAACTTGCCCGCCATCCCTCCTTTACGGTGGTTTCTGAGCAGATAAAGGCGCGGGCCGGTGAGACATCGCTGGAAACCGCTATCAGTCTGCAGAAAACCGGGCTGCACACGCCGGCACAGCAGGCCATTCATCTGGCCCTTCCTGTGCTGGAAAGTAAAAACCTGGCCTTCAGCATGGTGGACCTGCTGACAGAGGCGAAGTCGTTTGCTGCAGAAGGAACCGGTTTTACTGAACTGGGAGGGGAAATCAATGCGCAGATAAAACGCGGTGACTTACTGTATGTGGATGTGGCAAAAGGCTATGGCACAGGCCTGCTGGTTTCCCGTGCGTCGTATGAGGCAGAAAAAAGTATTCTTCGCCATATTCTCGAAGGTAAGGAGGCGGTCACGCCGCTGATGGAGAGAGTACCCGGCGAACTCATGGAGACGTTAACGTCGGGACAGCGTGCCGCCACCCGTATGATACTGGAAACGTCCGACCGTTTCACGGTGGTACAGGGTTATGCCGGGGTGGGTAAGACCACACAGTTCCGGGCGGTGATGTCAGCCGTGAACATGCTGCCGGAGAGTGAGCGTCCCCGTGTCGTTGGGCTGGGGCCCACGCACCGTGCGGTCGGGGAGATGCGCAGCGCCGGCGTGGATGCACAGACACTGGCGTCCTTTCTGCATGACACGCAGCTGCAGCAGCGCAGCGGAGAAACGCCGGATTTCAGCAACACGCTGTTCCTGCTCGATGAGAGCTCAATGGTGGGCAATACCGACATGGCACGGGCATACGCCCTGATTGCGGCCGGTGGCGGTCGTGCTGTGGCCAGCGGTGACACGGACCAGCTGCAGGCCATCGCGCCCGGTCAGCCTTTCCGTCTCCAGCAGACGCGCAGTGCTGCCGATGTGGCCATCATGAAGGAGATTGTGCGTCAGACGCCGGAACTGCGGGAGGCGGTATACAGCCTGATTAACCGGGATGTGGAAAAGGCACTGTCCGGGCTTGAGAGTGTGAAACCGTCTCAGGTGCCACGTCTGGAGGGCGCATGGGCACCGGAGCACTCCGTGACGGAGTTCAGTCACAGCCAGGAAGCGAAACTGGCAGAAGCGCAGCAGAAGGCGATGCTGAAAGGCGAGGCTTTCCCGGATATCCCCATGACACTGTATGAAGCCATTGTCCGCGACTATACCGGCAGGACACCGGAAGCACGGGAGCAGACGCTGATTGTCACGCACCTGAATGAGGACCGGCGCGTACTGAACAGCATGATTCATGATGCACGGGAAAAGGCCGGTGAGCTGGGGAAAGAGCAGGTCATGGTGCCTGTCCTGAACACAGCGAATATACGTGACGGGGAGCTGCGTCGTCTCTCCACCTGGGAGAATAACCCGGATGCCCTTGCCCTGGTGGATAGTGTGTATCACCGGATTGCCGGTATCAGCAAGGATGACGGGCTGATAACCCTGGAGGATGCGGAAGGTAACACGCGGCTGATTTCGCCCCGGGAGGCGGTGGCTGAAGGTGTCACACTGTACACCCCGGACAAAATCCGGGTGGGGACCGGCGACCGGATGCGCTTCACGAAGAGTGACCGGGAGCGCGGTTATGTGGCCAACAGCGTCTGGACGGTGACAGCGGTTTCCGGTGACAGTGTCACGCTGTCAGACGGACAGCAGACCCGGGTGATTCGCCCCGGCCAGGAGCGGGCAGAGCAACATATTGACCTGGCTTATGCCATCACCGCCCACGGTGCGCAGGGGGCGAGTGAAACCTTTGCCATCGCGCTTGAAGGCACGGAAGGTAACCGGAAACTGATGGCCGGCTTTGAGTCAGCCTACGTGGCCCTGTCGCGTATGAAGCAGCATGTGCAGGTGTACACCGATAACCGTCAGGGCTGGACGGATGCCATTAACAATGCCGTACAGAAAGGAACTGCCCACGATGTGCTGGAGCCGAAACCGGACCGGGAGGTCATGAATGCGCAGCGGCTGTTCAGTACGGCACGGGAACTGCGGGACGTGGCGGCAGGCCGTGCCGTTCTCCGTCAGGCGGGGCTGGCCGGGGGAGACAGTCCTGCACGGTTTATTGCTCCGGGACGTAAATATCCGCAACCGTATGTGGCACTGCCGGCTTTTGACCGTAACGGCAGGTCCGCCGGTATCTGGCTGAACCCACTGACCACGGATGACGGAAACGGGCTGCGGGGATTCAGCGGTGAAGGCCGTCCGTGGAATCCCGGTGCCATCACCGGTGGCCGCGTGTGGGGGGATATTCCGGATAACAGCGTCCAGCCGGGAGCCGGAAATGGTGAACCGGTCACGGCAGAGGTACTGGCACAGCGGCAGGCTGAAGAGGCCATCCGCCGTGAAACGGAACGCCGCGCAGATGAAATTGTCAGGAAAATGGCAGAGAACAAACCTGACTTGCCGGACGGCAAAACAGAGCTGGCTGTCAGGGATATTGCCGGGCAGGAACGTGACCGGACTGCCACTTCTGAACGGGAAACCGCACTGCCGGAGAGTGTGCTGCGTGAATCACAACGGGAACGGGAAGCGGTCCGGGAGGTTGCCCGGGAAAATCTGCTGCAGAGGCTTCTGCAGCAGATGGAGCGGGATATGGTTCGTGACCTGCAGAAAGAGAAAACCCTGGGCGGAGACTGATACAGGAAGACAAACACTGATGACAACGGATAACACGAACACGATACGTAACGATTCACTGGCTGCACGGACCGATACCTGGTTGCAGTCATTTCTGGTCTGGTCACCCGGACAGCGGGATATCATCAAAACGGTGGCACTGGTGCTGATGGTACTGGACCACATCAATCTGATATTCCAGCTGAAGCAGGAATGGATGTTTCTGGTCGGGCGTGGGGCCTTTCCGCTTTTTGCCCTGGTGTGGGGGCTGAATCTGTCCCGTCATGCGCATATCCGGCAACCGGCCATTAACCGGCTGTGGGGATGGGGAATTATTGCCCAGTTCGCATATTATCTGGCAGGCTTTCCCTGGTATGAGGGGAATATCCTGTTTGCCTTTGCAGTGGTGGCACAGGTGCTGACGTGGTGTGAAACGCGCAGTGGGTGGCGTACAGCAGCGGCCATTCTCCTGATGGCCCTGTGGGGGCCTTTGTCCGGCACCAGTTACGGCATTGCCGGGCTGCTGATGCTGGCAGTCAGCCACCGGCTGTACCGGGCGGAAGACAGAGCGGAACGTCTGGCGCTGGTCGCCTGCCTGCTGGCTGTTATTCCGGCGCTTAACCTTGCCACCAGTGATGCAGCGGCGGTGGCTGGTCTTGTGATGACGGTGCTGACCGTTGGTCTGGTGTCGTGTGCAGGAAAATCATTACCCCGTTTCTGGCCCGGAGACTTTTTCCCGACGTTCTATGCCTGTCATCTGGCTGTGCTGGGCGTTCTGGCGCTGTGACGGGTGTGGTATCTTTGGCCGCAAGAGGATGATTCGTCAGAGGCAGAACACAGCATGACAGAGCAGAAGCGACCGGTACTGACACTGAAGCGGAAGACAGAGGGAACAGCGCCTGTCCGCAGCCGGAAAACCATCATCAATGTCACCACGCCACCAAAATGGAAGGTGAAAAAGCAGAAACTGGCCGAGAAAGCGGCCCGGGAAGCAGAGCTGGCGGCAAAAAAAGCGCAGGCCAGACAGGCGCTGTCCATTTATCTGAACCTGCCCTCACTGGATGAGGCCGTGAATACCCTGAAGCCCTGGTGGCCGGGATTATTTGACGGTGACACGCCCCGGCTTCTGGCCTGCGGTATCCGGGACGTGTTACTGGAAGACGTGGCGCAGCGGAATATCCCGCTCTCGCATAAAAAACTGCGCAGGGCGCTGAAGGCCATCACCCGTTCAGAAAGCTATCTGTGTGCCATGAAAGCCGGTGCCTGCCGGTATGACACGGAAGGGTATGTGACGGAGCATATTTCTCAGGAGGAGGAAACGTATGCGGCAGAGCGTCTGGATAAAATCCGCCGCCAGAACCGGATAAAGGCAGAACTTCAGGCCGTGCTTGATGAGAAATAAAAAAAAGCCTCCCTCAGAGAGAGGAGGCAGGGAAATAAGGCTTAAAAGGAAATAACTTCCGTACCCGAAAGGAAGCCAATTCAGTAGATTATTTAACACCATAATACTGACGGGTTAAAGAGGAGTGATAAAGATGAATGGATTCAGAAACAGTTCACGGAACGGTCAGGTCTGGTGCTGGCAGCGTGCCGGAAGCCGGGCCGTTATTCTGGAGGTCAGTGGACGCTGGATGGAAGCGGCAGAAGGATGGCGACGGGCTGCCTGTGTTGCGCCCCGGACAGACTGGCAGCAGTTTGCCCGAAAAAGGGCTGAACACTGCCACCGGCGCTGCCGGGGAAGGGTGTAACGAAAAAAGCAGCCTTCATTACTCCAGGGATTCAGAAGGCTGAATAATGCATGAGATTTGCTTTTTTTAATATCCGGTCGGGCAGTATGACCGGACAGTTCTGAGTAGCACAGAGTTTTTTCCGTATTAAGTCGTGATTTTATATTTTGTGATTAATTTCACAAAATAAGGTGTTATTCAGTGTGTGCTGCAATATTCAGGATGCCTGAAATACAGGTGCTATTTATCTGATATGGAGAATAACATGAGGAAATATATTCCACTGGTATTATTTATCTTTTCATGGGTAATGACTCCAACTTATTGATAGTGTTTTATGTTCAGATAATGCCCGATGACTTTGTCATGCAGCTCCACCGATTTTGAGAACGACAGCGACTTCCGTCCCAGTGTAACCGGCTCATTTAAACCGTCTGGTCTGTTTCCTCCGGCTCTACAAAAATAATGTCCATCATTTTTAATGGACACTATCGTATGAAACACCGGACCTGGATCACTGAAGCTTTACGTCTTCACTTTGAAGAACATTTACCCCGGGTTGTGGCCGGGCGTCGCCTGGGTGTACCAAAATCAACAGTTTGTAGTATGTTCGTGCGCTTTCGGAGAGCTGGCCTTTCGTGGCCTTTGCCCGCAGGCATGTCGGAGCAGGAACTTGATGCCTGCCTTTACGGACAATTTTCCACGGTACCAGTCGTACGTCCTGAAAGCACCGTTATATCCGAAGCCCCCGTGGTAAAAAAACGTCCCCGGCGGCCCAACTTCCCTTATGAGTTTAAAATCGCCTTAGTGGAGCAGTCACTGCAGCCCGGAGCCTGTGTGGCGCAGATCGCCCGGGAAAACGGAATCAACGATAACCTGCTCTTCAACTGGCGCCATCAATACCGGAAAGGTGGCCTGCTGCCTTCCGGAAAAAATATGCCGGCACTGCTTCCCGTGACGTTAACGCCGGAGCCGGATAATAAAATCCCGGCCCCCGCACAGGAACCAGAGCAGATAAATACACCGTCCGACAGTCTGTGTTGTGAGCTGGTTCTGCCGGCCGGAACTCTCAGGCTTAAAGGTAAACTGACGCCGGCGTTATTACAGACACTTATCCGCGAAATAAAAGGGAGCAGCCACTGATGATATCTCTCCCTGCAGGTTCGCGTATCTGGCTGGTTGCAGGTATCACCGATATGCGAAATGGCTTTAACGGCCTGGCATCAAAAGTTCAGAACGTCCTGAAGGATGACCCGTTCTCCGGACACCTGTTCATCTTCCGCGGACGCCGGGGTGACCAGATAAAAGTGTTGTGGGCTGACAGTGACGGACTGTGCCTCTTCACCAAACGCCTGGAGCGGGGCCGCTTCGTCTGGCCAGTCACCCGTGACGGCAAGGTGCACCTTACTCCGGCTCAGTTATCCATGCTTCTTGAAGGTATCAACTGGAAGCACCCGAAACGAACGGAACGCGCTGGAATCCGCATATAACCCGTTGTAAAGTGAGGATATGGACACCTCACTTGCTCATGAGAACGCCCGCCTGCGGGCACTGTTGCAGACGCAACAGGACACCATCCGCCAGATGGCTGAATACAACCGCCTGCTCTCACAGCGGGTGGCGGCTTATGCTTCCGAAATCAACCGGCTGAAGGCGCTGGTTGCGAAACTGCAACGTATGCAGTTCGGTAAAAGCTCAGAAAAACTTCGTGCAAAAACCGAACGGCAGATACAGGAAGCACAGGAGCGAATCAGCGCACTTCAGGAAGAAATGGCGGAAACGCTGGGTGAGCAATATGACCCGGTACTGCCATCCGCCCTGCGCCAGTCTTCAGCCCGTAAACCGTTACCGGCCTCACTTCCCCGTGAAACCCGGGTTATCCGGCCGGAAGAGGAATGCTGTCCTGCCTGTGGTGGTGAACTCAGTTCTCTGGGATGTGATGTGTCAGAGCAACTGGAGCTTATCAGCAGCGCCTTTAAGGTTATCGAAACACAACGTCCGAAACAGGCCTGTTGCCGGTGCGACCATATCGTGCAGGCACCAGTACCTTCAAAACCCATTGCACGCAGTTATGCCGGAGCGGGGCTTCTGGCCCATGTTGTCACCGGGAAATATGCAGACCATCTGCCGTTATACCGCCAGTCAGAAATATACCGTCGTCAGGGAGTGGAGCTGAGCCGTGCCACACTGGGGCGCTGGACAGGTGCTGTTGCTGAACTGCTGGAGCCGCTGTATGACGTCCTGCGCCAGTATGTGCTGATGCCCGGTAAAGTCCATGCTGATGATATCCCCGTCCCGGTCCAGGAGCCGGGCAGCGGTAAAACCCGGACAGCCCGGCTGTGGGTCTACGTCCGTGATGACCGTAACGCCGGTTCACAGATGCCCCCGGCGGTCTGGTTCGCGTACAGTCCGGACCGGAAAGGTATCCATCCACAAAATTACCTGGCCGGTTACAGCGGTGTGCTTCAGGCCGATGCTTACGGTGGTTACCGGGCGTTATACGAATCCGGCAGAATAACGGAAGCCGCGTGTATGGCTCATGCCCGGAGAAAAATCCACGATGTGCATGCAAGAGCGCCCACCTACATCACCACGGAAGCCCTGCAGCGTATCGGTGAACTGTATGCCATCGAGGCAGAGGTCCGGGGCTGTTCAGCAGAACAGCGTCTGGCGGCAAGAAAAGCCAGAGCCGCGCCACTGATGCAGTCACTGTATGACTGGATACAGCAACAGATGAAAACACTGTCGCGTCACTCAGATACGGCAAAAGCGTTCGCATACCTGCTGAAACAGTGGGATGCACTGAACGTGTACTGCAGTAATGGCTGGGTGGAAATCGACAACAACATCGCAGAGAACGCCTTACGGGGAGTGGCCGTAGGCCGGAAAAACTGGATGTTCGCGGGTTCCGACAGCGGTGGTGAACATGCGGCGGTGTTGTACTCGCTGATCGGCACATGCCGTCTGAACAATGTGGAGCCAGAAAAGTGGCTGCGTTACGTCATTGAACATATCCAGGACTGGCCGGCAAACCGGGTACGCGATCTGTTGCCCTGGAAAGTTGATCTGAGCTCTCAGTAAATATCAATACGGTTCTGATGAGTCGCTTACGTCCCAGTCGTCCCAGCCGTGCCAGGTGCTGCCACAGATTCAGGTTATGCCGCTCAATTCGCTGCGTATATCGCTTGCTGATTACGTGCAGCTTTCCCTTCAGGCGGGATTCATACAGCGGCCAGCCATCCGTCATCCATATCACCACGTCAAAGGGTGACAGCAGGCTCATAAGACGCCCCAGCGTCGCCATAGTGCGTTCACCGAATACGTGCGCAACAACCGTCTTCCGGAGACTGTCATACGCGTAAAACAGCCAGCGCTGGCGCGATTTAGCCCCGACATAGCCCCACTGTTCGTCCATTTCCGCGCAGACGATGACGTCACTGCCCGGCTGTATGCGCGAGGTTACCGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAATCGTGTTGAGGCCAACGCCCATAATGCGGGCTGTTGCCCGGCATCCAACGCCATTCATGGCCATATCAATGATTTTCTGGTGCGTACCGGGTTGAGAAGCGGTGTAAGTGAACTGCAGTTGCCATGTTTTACGGCAGTGAGAGCAGAGATAGCGCTGATGTCCGGCGGTGCTTTTGCCGTTACGCACCACCCCGTCAGTAGCTGAACAGGAGGGACAGCTGATAGAAACAGAAGCCACTGGAGCACCTCAAAAACACCATCATACACTAAATCAGTAAGTTGGCAGCATCACCGGGTGGCCTGATTTAGGTGGTGGATCAGCAGTATCACAGTTTGCAATGATGGCCAGAAAATTTGGTAAACAACTTATTGTCGCTGTAAGTGAAAAGGATGCCATCAAATTAAAAGACAATTTTGATATTATTGGGATGTTGTCATATGGTAAAGAAAATTTTATTGCCATGAGTCATACGAAAAGCGAGTTAACTGGAGGGAAACGTCGGTACCGGATTAAAAATTGATATAAATAAAAAGCATACTTATTGGCTGGAGTTTCGGTAGTAAAAAGCGGGAAATGAAGGTAATAGTCAGCAACAGGGAATGTGGTATTATCGCGGCGGGTGTCTGAGCCTTTCTGGTTCAGGCAAGACGCAGGTACCAGAAATGCGAAGACCCCACTCGTTAATCCATTAACTCGTGAGGTCTGCATGAAGTACCTTAACACTACTGATTGTAGCCTCTTCCTTGCAGGGAGGTCAAAGTTTATGACGAAATATGCCCTTATCGGGTTGCTCGCCGTGTGCGCCACGGTGTTGTGTTTTTCACTGATATTCAGGGAACGGTTATGTGAACTGAATATTCACAGGGGAAATACAGTGGTGCAGGTAACTCTGGCCTACGAAGCACGGAAGTAAGCTGCCGGGCGGGGACGGAAGTCCCCGCTTTCCGGAAGTGTGAGGTATTTCAGGGGCAGACACCCGACATGCCAGAAACAGCCGGTCCCGCCCGGGGCCGGCATCCTGGTTAAGGCATTTCCTGCTTTTCAGTCATTTCATTATCAAAATCACATTAAACGGTTGTAATCAGACATGATTTGTGCGCCAACACCGATCATCGTCACAACTTTCAAGTCGCTGATTTCAAAAAACTGTAGTATCCTCTGCGAAACGATCTCTGTTTGATTATTGAGGAGGCGAGATGTCGCAGACAGAAAATGCAGTGACTTCCTCATCGGGAAAAAAACGACCTTACAGAAGGGGTAATCCTGTTCCTGCGAGAGAACGACAAAAAGCGTCTCTTGCAAGAAGGAGTGCCACTCATAAAGCGTTTCATGCCGTCATTCAGCTTCGACTGAAAGAAAAGTTAAGTGAGCTTGCTGATGAAGACGGGATCACTCAGGCACAGATGCTTGAGTGGCTGATAGAGTCAGAGGTTAAGCGCAGAAAATCTTTGTGAGTATTTGCGTTTCTTGCTGTTTCAGTGATGAGATGTTAGATTGCTGATCGTTTTAAGGAATTTTGTGGCTGGCCACGCCGTAAGGTGGCAGGGAACTGGTTCTGATGTGGATTTACAGGAGCCAGAAAAGCAAAAACCCCGATAATCTTCACCAGGTTTGGCGACTAAGAGAAGATTACCGGGGCCCACTTAAACCGTATAGCCAACAATTCAGCTATGCGGGGAGTATAGTTATATGCCCGGAAAAGTTCAAGACTTCTTTCTGTGCTCACTCCTTCTGCGCATTGTAAGCGTAGGATGGTGTGACTGATCTTCAACAAACGTATTACCGCCAGGTAAAGAACCCGAATCCGGTGTTTACACCCCGTGAAGGTGCAGGAACGCTGAAGTTCTGCGAAAAACTGATGGAAAAGGCGGTGGGCTTCACCTCCCGTTTTGATTGCGCTATTCATGTGGCGCATGCCCGTTCGAAGGGACTGCGTCGGCGCATGCCACCGGTGCTGCGTCGACGGGCTATTGATGCGCTGCTGCAGGGGCTGTGTTTCCACTATGACCCGCTGGCCAACCGTGTCCAGTGCTCCATCACCACGCTGGCCATTGAGTGCGGGCTGGCGACGGAGTCTGCTGCCGGAACACTCTCCATCACCCGTGCCACCCGTGCCCTGACGTTCCTGTCAGAGCTGGGGCTGATTAGCTACCAGACGGAATATGACCCGCTTATTGGCTGCAACATTCCGACCGATATCTCGTTATAAGGGCCATCTTCGTACCCTTCTTTCGACGCAATAGCTTTCAGTCTTGCGCTGGCACACGCTGAACTACCCGCATTACCATTACTACAGGCAGCAATAACCTCCTTATCACTGAATATCTTTTTCAAGCAATGCATCATATTTCTGCTGTGCCTTTTCACGCTCTGCCGGATCTTTACTGTTTTTAATCGTCTGTTTTGCTATCTCAAGTTCTGTCTTTTCAGACACGCTCAGGTAGTTATTCTCAACAGCATTCTTCCCGCTCTGTGTGCTCCCACTGCAGCTGATGCGGTGCTGTTTCCGGTTAGACCGCCGGTCAGCCCTGCTGACACGGTTGCCAGCGTACTTATCGTCTGCTTCTGCTCTTCACTGAGGTCAGACAGCTTCACTCCCGGATACAGCATTCCGATCGCTCTGGCTGCCAGCTCTCCTGTTGCTGCGCCTGCTGCACCAGCAGCAACATTATTACTCTGCATTGCTGCTACTGCACCGCCCAGAATGGCGTGAGCAATGGCCTTTACTGCCGGGTCTTTTTCTGTTGATTTCAGCAGGTATGCCAGTTCCGGAGCCGAAGCTCCCGCCAGTACTGCACCAACGTCACCACCTGCCAGCTATGGCTGGAGATGGTGGATGTTACCGATAATCAAGAAGCCTTAGATATAAGGCAATGTTTTATCTTTCTTTCATACGAAAACCAGTGCTGCCATACTAGCCCTTTTCCTGTGACATCTAAGCGAGCTCATGTTTATCGGGAAGGGTGTTAAATCATTGTTAATCATACACAGGTAATTATGCAGCGGTTATGCATAACTTCATTTCGAGCGTGTGATAACGCCAACACAACCACCAGAATTATAAAATAAAATCACAAAACCTGAGAACATATCACATTTTATCACATTAATAATTTTTCTTTATGATTTCATAACTATTTACAGCCTTTTATTAAGCCAAAAAAAGTTAAATTGCAATTATTTCATAATTATTCACGCTACAGTTTTTTTACTTTTAAAATCATATAAATATATAATCACCTTAAAACAGCAAACTCCACAAAATAATAACAAATAGAGATATTTCCACCCAAATAATGACTATCTATTCATGAGCTGACATTATGAAATTAATCATTATTTAATGCAAGTAACACAAATTAAAACAAATACCTTAATATCCATTATTGTGTGAATCTCATCACAATAATGTTTATTTTACCGCCTAGTATGAACCACGAAATCAATTGAAGTTAGCCGCCATGAAATTAAATACACAACATGGCGCATACCTTTTTATTCTAAAAAAATTAAAGGAATAATTATGGAAAAACATTACGTCGGTTCTGAAATTGGTCAATTGCGTAGTGTTATGCTGCACCGCCCAAATTTAAGTCTGAAACGTTTGACACCATCGAATTGTCAGGAACTGCTTTTCGATGATGTACTCTCGGTTGAACGGGCAGGTGAAGAGCATGACATCTTCGCAAATACGCTGCGCGATCAGGGGGTGGAAGTCCTGCTGTTAACAGACCTCCTGACACAAACCCTTGATATTAAAGAAGCGAAAACTTGGTTACTGGAGACGCAAATTTCTGACTACCGCCTCGGACCTACCTTTGCGGGCGATGTGCGCAGCTGGCTGGCGGACATGCCGCACCGTGAACTGGCGCGAAGATTAAGCGGCGGATTAACTTACGGTGAAATTCCGGCTGCCATTAACAATATGGTGGTGGATACCCACACGTCTAATGACTTTATTATGAAGCCGCTACCGAATCATTTATTTACCCGCGATACCTCCTGCTGGATTTATAACGGTGTTTCTATTAACCCGATGGCCAAACCAGCCCGTCAACGTGAAACCAATAACCTCCGGGCAATATATCGCTGGCACCCGGCATTTGCCGACGGCGATTTTATTAAGTATTTCGGCGACGAAAATATTTATTACGACCACGCCACTTTGGAAGGTGGCGACGTATTAGTGATTGGTCGTGGGGCGGTATTGATCGGCATGTCTGAACGCACAACACCGCAGGGCGTGGAGTTCCTCGCCAACAGCCTGTTCAAACATCGTCAGGCCGAGCGAGTGATCGCCGTTGAGCTGCCAAAACACCGCTCCTGTATGCACCTTGACACCGTCATGACCCACATCGACGTTGACACTTTCTCCGTTTACCCGGAAGTGGTGCGCAAAGACGCCCAGTGCTGGACGCTCACTTCGAACGGACGCGATGGCCTACAACGGACCCAGGAAACCGACCTGTTGCACGCCATCGAGAAAGCACTCGGTATTGACCAGGTACGCTTGATCACCACCGGCGGCGACGCCTTTGAAGCCGAACGTGAGCAGTGGAACGACGCCAATAACGTTCTGACCATCCGCCCCGGTGTGGTGATTGGTTACGAGCGCAACGTCTGGACTAACGAGAAATACGACAAAGCCGGCATCACCGTGCTGCCCATCCCGGGGGACGAATTGGGACGAGGCCGCGGCGGCGCACGCTGCATGAGCTGTCCGCTTGAACGCGACGGAATTTAAAGGAGCCATCATGGAACGAAAACCCACTTTGGTTGTGGCGTTGGGCGGCAACGCATTATTGAAGCGCGGCGAACCACTGGAAGCAGAAATCCAGCGCCAGAACATTGAGTTGGCCGCCCGTACCATCGCCGGGCTCACGGTGAATTGGCGCGTGGTGTTGGTTCACGGCAACGGTCCACAGATCGGGCTGCTGGCGCTGCAGAACAGCGCCTACGACAAAGTGACCCCTTATCCACTGGACGTTCTTGGCGCCGAAAGCCAGGGGATGATCGGCTACATGCTCCAGCAGGCGCTGAAAAACAGCCTGCCACAGCGTGAGGTGAGCGTCCTGCTTACTCAGGTGGAAGTGGACGCTACTGACCCGGCGTTCAGCAACCCGACCAAATATATCGGACCGGTGTACAACGAAGACCAGGCAAAAACACTGGCAGCAGAAAAAGGTTGGGGGTTTAAGGCCGACGGCAGCTACTTCCGTCGCGTGGTGCCATCTCCACAGCCGAAACGCATTGTCGAGAGCGATGCTATTACGGCACTGATCCAGCGCGACCATCTTGTTATCTGCAACGGCGGCGGTGGTGTACCAGTTGTGGAAAAGGCTAACGGCTATCGCGGAATTGAGGCGGTGATCGACAAAGACCTCTCTGCTGCCCTGCTGGCATACCAGATAGGGGCCGACGCACTACTGATTCTCACTGATGCCGACGCGGTTTACCTCGATTGGGGCAAACCGACCCAGCGTCCGCTAGCGCAGGTGACGCCAGAACTGCTCAGAGGCATGCAGTTTGACACCGGATCGATGGGGCCGAAAGTGGCCGCCTGCTGCAAGTTTGTTGAAGCTTGCAACGGTATTGCCGGGATCGGCGCTCTGGTCGACGGGGCTGAGATTTTGGCGGGCAATAAAGGCACATTGATTCGTAACTGAATCCCCCTTCACCTAACCCTCTCCTCAAAGGGGAGATGGCAGAGTGAGGGCATCAGACAGTTAAAATTTAAAAAGGATTTCCTAATGACCATCAATTTGAAAAAACGCAACTTCCTTAAACTGCTGGACTACACCCCGGCAGAGATCCAGTACCTGATCGATCTCGCGATCAAACTGAAAGCGGCCAAAAAAGCCGGACGAGAAAAACAGACCTTGGTTGGCAAAAACATTGCCCTGATTTTTGAAAAAACCTCCACCCGTACCCGCTGTGCTTTCGAAGTGGCTGCGTTCGACCAAGGGGCGCAGGTGACCTACCTCGGCCCAGGCGGATCGCAAATCGGCCATAAAGAGTCAATGAAAGACACCGCCCGTGTGCTGGGCCGTATGTATGACGGCATCGAATACCGTGGTTACGGTCAGGCCATCGTTGAGGAGTTGGGCAAATACGCGGGCGTACCGGTGTGGAACGGTCTGACCGACGAATTTCACCCAACCCAAATCCTCGCAGATTTGATGACCATGCTGGAACATTCCCCGGGCAAAAAACTGTCGGAACTGAGCTTTGCCTACCTTGGCGACGCACGCAACAACATGGGTAACTCCCTGATGGTGGGGGCTGCCAAAATGGGGATGGATATCCGTCTTGTAGCCCCAAAATCCTTCTGGCCGGATGTGGTGCTTGTTGAACAGTGCCGTTCCATCGCGGAAGAGACGGGCGCACGTATCACCCTGACCGATGACGTGGAAGAAGGCGTGTGGGGAACGGATTTCCTCTACACCGATGTTTGGGTCTCAATGGGTGAACCGAAAGAGGCGTGGACCGAACGCGTCAGCCTGATGAAGCCTTATCAAATCAACGCTGACGTGATGAACGCCACCGGCAACCCGAACGTCAAGTTCATGCACTGCCTGCCAGCCTTCCACAATGAGCACACCAAAGTGGGCCGAGAAATTGAGATGGCATACGGCCTGAAGGGACTGGAGGTGACGGAAGAGGTCTTCGAATCCCCTAACTCTATCGTCTTTGATGAAGCAGAAAACCGCATGCATACCATTAAAGCGGTCATGGTGGCGACACTCGGCGACTAATCACCACCCGGTGCGTCGTAGGGGGCACCGGGTCTCAGGAGAACATCATGGGTAAGTTCAAATTTCCCTCCGCATATACCATTCTCTTTTTTCTGATTGCCATCGTTGCCGTCCTGACGTGGATCATTCCAGCCGGGCAGTATCACATGGCAATGAACGAAGCTCTCGGCAAGGAGGTTCCTGTTGCCGGCACCTATGCACACGTAGAGGCCAATCCGCAGGGACTGATTTCAGTGCTGATGGCCCCAATTGCCGGGTTGTATGATCCAGACTCCGGTCAGGCTAGGGCGATAGACGTTGCGCTGTTTATTCTGATCATCGGAGGATTCCTCGGGATCGTCACCAAAACCGGGGCCATTGACGCCGGAATCGAGCGCGTCACCACCCGACTACGTGGTCGCGAAGAGTGGATGATCCCGATCCTTATGGCGCTGTTTGCTGCTGGCGGTACAATTTACGGCATGGCCGAAGAATCCCTGCCGTTTTATACCCTGCTGGTGCCGGTGATGTTGGCAGCACGCTTCGACCCTGTGGTAGCCGCCTCCACCGTGCTGCTCGGCGCCGGGATCGGCACGCTCGGCTCCACCATTAACCCTTTCGCGACGGTGATCGCCGCCAATGCAGCCGGGATCCCCTTCACCAACGGTATCACCTTGCGTGTGGTGGTGCTTGTCATCGGCTGGATAATCTGCGTGACATGGGTGATGCGCTATGCCCGGAAAGTTCGCAAGGAGCCGTCTCTCTCCATTATTGCGGATAAACAAGAGGAGAACCTCGCCCACTTCCTCGGCAATAAAAGCGAACAGGCTCTGGAGTTCACCCCGGTACGCAAAATTATTCTGGTGATTTTCGCCCTTACCTTCGCGGTCATGATCTACGGCGTGGCGGTGCTGGGTTGGTGGATGGCGGAGATCTCAACGGTATTTCTGGCCAGCGCAATTATCATCGGTCTGATCGCCCGCATGAGCGAAGAGGAACTGACCTCTACCTTTATCAACGGCGCGCGAGATTTGCTGGGCGTTGCACTGATTATCGGTATCGCGCGCGGTATCGTAGTGATCATGGATAAAGGTATGATTACCCATACTATTTTGCACTACGCCGAGGGAATGGTTACTGGATTATCGACAGTAGCATTCATCAACGTGATGTATTGGCTGGAAGTGGTGCTGTCGTTTCTTGTGCCTTCTTCGTCCGGTCTGGCCGTTCTGACGATGCCGATCATGGCACCTCTTGCCGACTTCGCTAACGTCAACCGCGACCTGGTAGTTACGGCTTACCAGTCTGCGTCCGGTATCGTTAACCTGATCACTCCCACCTCTGCCGTTGTGATGGGCGGACTGGCTATCGCCCACGTGCCTTACGTGCGATATCTGAAATGGGTTGCGCCGCTGCTTGGGATATTAACAGTGGTAATTATGGTGGCATTAAGTCTGGGGGCATTGTTGTAATTTGCCGGATGGCGCTGCCCCTATCCGGTCTACGGAATGGTGTGGTGTCACCGGTTATTTCGAGAACTGAATATAGGATTATGATGGATTACGAAGATTTCTCTCCCAAAGAGCAACTACAGCTAACGGTCTGCCAACGTCTGATTGCAGAAAAGAGCTATTTTTCCCAAGAGGAGCTTCGCCGCGACTTACAGGAGCGTGGTTTTGAGACAATCAGCCAGTCCACCGTTTCTCGTCTGCTCAATTTGTTAGGTGTCATAAAAATTCGAAATGCCAAAGGGCTAAAAGTCTATTCGCTGAATCCACAGTTGCGTCCGGCTCCTGATGCCGCGCGCACTGTGTCCGAAATGGTGGTGAGCGTTGAGCACAATAGAGAATTTATCCTTATCCATACCGTTGCTGGATATGGCCGCGCAGTGGCACGTGTCCTTGATTATCACCAGTTACCAGAAATTTTAGGCGTGGTGGCCGGAAGCAGTATCGTCTGGGTCGCTCCTCGGATGGTGAAGCGTGTCGCTCTGGTGCATAAGCAAATTAATTATTTACTAAGAACGTATTAATATTCACAAATGCCCGCTTGTATTGACTAATACGTAAGCGTCAACGGAGCACCGTATTGACGCTTATTTATTGGTGAGAACTACGTTCCATGGCAGGAGTTCGTCAACACGGTTGGAGGGCCATTCCGGCAGTACGCTCAGAATATGGCGCAGATACGCTTCCGGATCGATACCGTTCAGACGGCAGGTGCCGATCAGCCCGTACAGCAGTGCACCACACTCGCCGCCGTGATCGCTACCGAAGAACACGTAATTTTTCTTTCCGAGACAGAATGCACGAAGCGCTCTTTCCGCTGTGTTATTGTCCGCCTCCGCCAGACCGTCATCACTGTAATAACAGAGGGCGTCCCACTGATTCAGTACATAGCTGAACGCTTCGCCCAGCCTGGATTTTTTCGACAGCGTGCCTGAGCCCGCAGATATTCTATTTCCCGTTCATCTTCTTCGATCTTTTCTTCGGCACGTGCCAGTGCAGAGCGCAGGAAGGCCTCCGTCTCTTCAACCAGACTCAGTTGCTGGTCTTTCTGACGGAGCTGGCTTTCCAGTTCTGCAATGCGAATGAGGTATTTCTGACTCATGGCCGTTTTTATAATGCGGCCAGGCGTTTTTTACAACATTGTCAGGGCGTTAAGGCGGGATGTTTTTGGCTGACGCCAGTCCAGCTTATCGAGGAGCATTGCCAGTTGCGAGCGGGTAATGGATACCTTGCCGTCACGTACCACAGGCCAGATAAACTGGCCTTCCTCCAGGCGTTTGGTGAACAGGCACAGACCATCAGCATCAGCCCAAAGAATTTTAACGGTGTCACCCCGTCGGCCACGGAAGATAAACAGGTGACCGGAGAAGGGATTATCATTCAGCACATGTTGTACCTGTTCTCCCAGTCCGTTGAAGGATTTACGCATATCGGTAACGCCGGCAACGAGCCAGATACGGGTACCTGATGGGAGTGAGATCATCTTCCCCTCCCGGTCAGTTCACGAATCAATACAGTGAGCAGCTCTGGTGAAGGATTTTCCAGCGTCATGTTACCGTGACGGAACTCCACCTTGCAGGAGCTGGCACTGACAGTAGTCTGAGTGGATAAGGACGGAGTAAGAGCAGCCATCGGTTCTTTCGGCTCATCAGGCGTTATCTCTACAGGTAATAATTCAACGCCTGCGTCAGAAGTGGTTGTCACCGGAATACGCCGTGATATACGCCCTTCGTTTTGCCAGAGTCTGAGCCATTTGAAAATAACATTATCATTGACGCCATTTTCTCGTGCAATCTGGGCCACACAAGCACCGGGTTGTGATGCCAGTTCAACCATACGAATTTTGAATTCATTCGAAGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCCTGCGGAGCAAGGCCGTCGCGAACGAGTGGCGGAGGGTGTGCGGTGTGGCGGGCTTCGTGATGCCTGCTTGTTCTACGGCACGTTTGAAGGCGCGCTGAAAGGTCTGGTCATACATGTGATGGCGACGCACGACACCGCTCCGTGGATCGGTCGAATGCGTGTGCTGCGCAAAAACCCAGAACCACGGCCAGGAATGCCCGGCGCGCGGATACTTCCGCTCAAGGGCGTCGGGAAGCGCAACGCCGCTGCGGCCCTCGGCCTGGTCCTTCAGCCACCATGCCCGTGCACGCGACAGCTGCTCGCGCAGGCTGGGTGCCAAGCTCTCGGGTAACATCAAGGCCCGATCCTTGGAGCCCTTGCCCTCCCGCACGATGATCGTGCCGTGATCGAAATCCAGATCCTTGACCCGCAGTTGCAAACCCTCACTGATCCGCATGCCCGTTCCATACAGAAGCTGGGCGAACAAACGATGCTCGCCTTCCAGAAAACCGAGGATGCGAACCACTTCATCCGGGGTCAGCACCACCGGCAAGCGCCGCGACGGCCGAGGTCTTCCGATCTCCTGAAGCCAGGGCAGATCCGTGCACAGCACCTTGCCGTAGAAGAACAGCAAGGCCGCCAATGCCTGACGATGCGTGGAGACCGAAACCTTGCGCTCGTTCGCCAGCCAGGACAGAAATGCCTCGACTTCGCTGCTGCCCAAGGTTGCCGGGTGACGCACACCGTGGAAACGGATGAAGGCACGAACCCAGTGGACATAAGCCTGTTCGGTTGGTAAGCTGTAATGCAAGTAGCGTATGCGCTCACGCAACTGGTCCAGAACCTTGACCGAACGCAGCGGTGGTAACGGCGCAGTGGCGGTTTTCATGGCTTGTTATGACTGTTTTTTTGTACAGTCTATGCCTCGGGCATCCAAGCAGCAAGCGCGTTACGCCGTGGGTCGATGTTTGATGTTATGGAGCAGCAACGATGTTACGCAGCAGGGCAGTCGCCCTAAAACAAAGTTAGCCATATGAACTCGGAATCAGTACGCATTTATCTCGTTGCTGCGATGGGAGCCAATCGGGTTATTGGCAATGGTCCTAATATCCCCTGGAAAATTCCGGGTGAGCAGAAGATTTTTCGCAGACTCACTGAGGGAAAAGTCGTTGTCATGGGGCGAAAGACCTTTGAGTCTATCGGCAAGCCTCTACCGAACCGTCACACATTGGTAATCTCACGCCAAGCTAACTACCGCGCCACTGGCTGCGTAGTTGTTTCAACGCTGTCGCACGCTATCGCTTTGGCATCCGAACTCGGCAATGAACTCTACGTCGCGGGCGGAGCTGAGATATACACTCTGGCACTACCTCACGCCCACGGCGTGTTTCTATCTGAGGTACATCAAACCTTCGAGGGTGACGCCTTCTTCCCAATGCTCAACGAAACAGAATTCGAGCTTGTCTCAACCGAAACCATTCAAGCTGTAATTCCGTACACCCACTCCGTTTATGCGCGTCGAAACGGCTAACCATTCCGTCAACGGGACGCCAAAATGCTGCGCATTTTGGTTCCCTCCGCTGCGCTCCGGCTCTCGTTACGTCCAACGTTAGCACCACTTAAACCCAGCTTTATTTAGCTCATGTTTATTCAAACGGCATTTAGCTTTTCAGGCGTTATTCAGTGCCTGTTTTGCCTTTTTTCCGGGCTTCGCCTGCATGGGCTGCGCAGGTTTTCAGTCTTTTTGGCCTCTAGCCCTTGCGTAGCAAGCGCAAGCAGCTATCGTTTTTGCAGTGCTGTGCCGCCTCGGTGGCGCAGCGTTTTTTCACGGTTAGCGCCCGTCGCCAAATTCAAGTTATCCGTTTTGGCTTCTGGTTCTAACATTTCGGTCAAGCCGACCCGCATTCTGCGGTCGGCTTACCTCGCCCGTTAGACATCATGAGGGAAGCGGTGACCATCGAAATTTCGAACCAACTATCAGAGGTGCTAAGCGTCATTGAGCGCCATCTGGAATCAACGTTGCTGGCCGTGCATTTGTACGGCTCCGCAGTGGATGGCGGCCTGAAGCCATACAGCGATATTGATTTGTTGGTTACTGTGGCCGTAAAGCTTGATGAAACGACGCGGCGAGCATTGCTCAATGATCTTATGGAGGCTTCGGCTTTCCCTGGCGAGAGCGAGACGCTCCGCGCTATAGAAGTCACCCTTGTCGTGCATGACGACATCATCCCGTGGCGTTATCCGGCTAAGCGCGAGCTGCAATTTGGAGAATGGCAGCGCAATGACATTCTTGCGGGTATCTTCGAGCCAGCCATGATCGACATTGATCTAGCTATCCTGCTTACAAAAGCAAGAGAACATAGCGTTGCCTTGGTAGGTCCGGCAGCGGAGGAATTCTTTGACCCGGTTCCTGAACAGGATCTATTCGAGGCGCTGAGGGAAACCTTGAAGCTATGGAACTCGCAGCCCGACTGGGCCGGCGATGAGCGAAATGTAGTGCTTACGTTGTCCCGCATTTGGTACAGCGCAATAACCGGCAAAATCGCGCCGAAGGATGTCGCTGCCGACTGGGCAATAAAACGCCTACCTGCCCAGTATCAGCCCGTCTTACTTGAAGCTAAGCAAGCTTATCTGGGACAAAAAGAAGATCACTTGGCCTCACGCGCAGATCACTTGGAAGAATTTATTCGCTTTGTGAAAGGCGAGATCATCAAGTCAGTTGGTAAATGATGTCTAACAATTCGTTCAAGCCGACCGCGCTACGCGCGGCGGCTTAACTCCGGCGTTAGATGCACTAAGCACATAATTGCTCACAGCCAAACTATCAGGTCAAGTCTGCTTTTATTATTTTTAAGCGTGCATAATAAGCCCTACACAAATTGGGAGATATATCATGAAAGGCTGGCTTTTTCTTGTTATCGCAATAGTTGGCGAAGTAATCGCAACATCCGCATTAAAATCTAGCGAGGGCTTTACTAAGCTTGCCCCTTCCGCCGTTGTCATAATCGGTTATGGCATCGCATTTTATTTTCTTTCTCTGGTTCTGAAATCCATCCCTGTCGGTGTTGCTTATGCAGTCTGGTCGGGACTCGGCGTCGTCATAATTACAGCCATTGCCTGGTTGCTTCATGGGCAAAAGCTTGATGCGTGGGGCTTTGTAGGTATGGGGCTCATAATTGCTGCCTTTTTGCTCGCCCGATCCCCATCGTGGAAGTCGCTGCGGAGGCCGACGCCATGGTGACGGTGTTCGGCATTCTGAATCTCACCGAGGACTCCTTCTTCGATGAGAGCCGGCGGCTAGACCCCGCCGGCGCTGTCACCGCGGCGATCGAAATGCTGCGAGTCGGATCAGACGTCGTGGATGTCGGACCGGCCGCCAGCCATCCGGACGCGAGGCCTGTATCGCCGGCCGATGAGATCAGACGTATTGCGCCGCTCTTAGACGCCCTGTCCGATCAGATGCACCGTGTTTCAATCGACAGCTTCCAACCGGAAACCCAGCGCTATGCGCTCAAGCGCGGCGTGGGCTACCTGAACGATATCCAAGGATTTCCTGACCCTGCGCTCTATCCCGATATTGCTGAGGCGGACTGCAGGCTGGTGGTTATGCACTCAGCGCAGCGGGATGGCATCGCCACCCGCACCGGTCACCTTCGACCCGAAGACGCGCTCGACGAGATTGTGCGGTTCTTCGAGGCGCGGGTTTCCGCCTTGCGACGGAGCGGGGTCGCTGCCGACCGGCTCATCCTCGATCCGGGGATGGGATTTTTCTTGAGCCCCGCACCGGAAACATCGCTGCACGTGCTGTCGAACCTTCAAAAGCTGAAGTCGGCGTTGGGGCTTCCGCTATTGGTCTCGGTGTCGCGGAAATCCTTCTTGGGCGCCACCGTTGGCCTTCCTGTAAAGGATCTGGGTCCAGCGAGCCTTGCGGCGGAACTTCACGCGATCGGCAATGGCGCTGACTACGTCCGCACCCACGCGCCTGGAGATCTGCGAAGCGCAATCACCTTCTCGGAAACCCTCGCGAAATTTCGCAGTCGCGACGCCAGAGACCGAGGGTTAGATCATGCCTAGCATTCACCTTCCGGCCGCCCGCTAAATATCTCCTTTTGGGTTGTTAATAAAACATCCAATAAGTTGACTGTGCGTGAAAAAGAAAGTTTTGTGTGATGGCGTTGAAGATCGCACCGTTAAGCTCTTATGTGGGATGGTGCAGAGCTCGACGACTACCGATAAAACGCAACCGCCGCAAACAGACAAGAAAAAGCCCCAACTGATAACAGTTGGGGCTTCAGTATTGTGATTGGTGGAGCAATAGCACCCTGAACCCAAAACCTTCTCGCTCAACCGGTAGTGGCTGATAACAACTCGTGAGGGCTATTGCGGGTTAAGCATTTAGCGATGTCTAGGGCCAGACTGGACGTCTGAACGCAAGCCGCTGATACTGTACATAACCACAGTATCAGCGGAGGATACCCATGTCGCTGGCAAGGAACGCCACGGCGAGTCAATCGCCCACTCAAACAAACGGTTACGAACGCCACCAACCCGACCAGACGCTGCTCTACCAGCTGGTTGAGCAGCACTACCCAGCCTTCAAAGCCTCACTCGAAGCCCAAGGTCAACACCTGCCTCGCTACATCCAACAAGAATTCAACGACCTCCTCCAATGTGGCCGTCTGGAGTATGGTTTCATGCGGGTTCGCTGCGAGGATTGTCATCACGAGCGTCTGGTCGCCTTCAGCTGTAAACGACGCGGCTTTTGCCCTAGCTGCGGTGCCCGCCGGATGGCCGAGAGTGCGGCGCTGCTGATAGACGAAGTCTTCCCCAAGGAGCCCATTCGCCAGTGGGTGCTCAGCTTTCCTTTCCAGCTACGCTTTTTGCTGGCTCGCCATCCCCAGCTGATGGGCCAGGTCTTGAGTATCGTCTATCGTACACTCTCAACTCATCTGATCAAAAAAGCCGGTTACACCAAAGCCTCTGCACAAACTGGCTCAGTGACTCTTATCCAACGCTTTGGCTCCGCGCTAAATCTCAATGTCCACTACCACATGCTGTTTCTCGATGGTGTCTATGCCGAAGATGACTATGGCAAGCAACGCTTCCATCGTGTCAAGGCACCCACTTACGATGAGCTGAATACGCTCGCTCACACCCTCAGCCATCGCATCGCTCGCTGCATGGAAAAGCGTGGGATTTTGGAGCGTGATGCCGAGAATACGTGGTTGACACTGGAAGAGGGCGAAGACGATACGCTGACTCAATTACATGGTGCTTCGGTTACGTATCGCATTGCCGTCGGCCCCCAGCAAGGGCGCAAAGTCTTCACCCTGCAAACCTTGCCAGGGCGTGAGGATAAAGCCGACTCAAGCAGTCGAGTAGCCAACCATGCTGGTTTCTCGCTACACGCCGGTGTGATGGCCGAAGCGCATCAGCGGGATAAGCTTGAGCGCTTGTGTCGCTACATTAGTCGGCCAGCGGTTTCAGAAAAACGTCTGGCATTAACCGCCAATGGGCAGGTGCGTTACGAGCTCAAAACTCCGTACCGCAATGGCACCACCCATGTGATCTTCGAGCCGCTGGACTTCATCGCCAAACTCGCTGCGTTGGTACCTAAGCCGCGAGTCAACCTCACACGCTTCCACGGCGTCTTTGCACCGAACAGCAAACACCGAGTTCAAGTAACACCCGCCAAGCGGGGCAAGAAGCCCGACAAATCGGAAGGTCTCGATACTAACTGGCGTGACAAGAGTCCTGCAGAGCGCCACCGCGCCATGACCTGGATGCAACGCCTCAAGCGAGTCTTCAATATTGATATTGAAGTCTGCGAACACTGCGGCGGTCACGTCAAAGTGATTGCCAGCATCGAAGATCCGAAGGTCATTGAGCAGATTCTCAAGCATCTGAAACAGAAAACAGCCAAGGCGAATGCCGCCAAGCAGCGTGAGCTGCCACCAGAACGAGCGCCGCCACTGACTCCCAGCCTGTTCGATCCATCACAGAGTCGTCTCTTTGACTGA
Protein sequences of DBSCAN-SWA_1 >CP031653|160225:198185|191428_192133_+|AXP24785.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP031653|160225:198185|169080_169770_+|AXP24763.1|DBSCAN-SWA MYPMDRIQQKHARQIDLLENLTAVIQDYPNPACIRDETGKFIFCNTLFHESFLTQDQSAEKWLLSQRDFCELISVTEMEAYRNEHTHLNLVEDVFIQNRFWTISVQSFLNGHRNIILWQFYDAAHVRHKDSYNQKTIVSDDIRNIIRRMSDDSSVSSYVNDVFYLYSTGISHNAIARILNISISTSKKHASLICDYFSVSNKDELIILLYNKKFIYYLYEKAMCIINTR >CP031653|160225:198185|189551_190031_+|AXP24781.1|DBSCAN-SWA MMDYEDFSPKEQLQLTVCQRLIAEKSYFSQEELRRDLQERGFETISQSTVSRLLNLLGVIKIRNAKGLKVYSLNPQLRPAPDAARTVSEMVVSVEHNREFILIHTVAGYGRAVARVLDYHQLPEILGVVAGSSIVWVAPRMVKRVALVHKQINYLLRTY >CP031653|160225:198185|166064_166352_+|AXP24759.1|DBSCAN-SWA MSTRNIHVNTASYTLLVAGKKKNTGEEWDVLEFSSLTELKKYRKSHPEKMAFSYSYALSQGVDKQFRHINIAEADHFKQFLRQIKRAGLDIRAIC >CP031653|160225:198185|171009_171576_+|AXP24766.1|DBSCAN-SWA MEHGARLSTSRVMAIAFIFMSVLIVLSLSVNVIQGVNNYRLQNEQRTAVTPMAFNAPFAVSQNSADASYLQQMALSFIALRLNVSSETVDASHQALLQYIRPGAQNQMKVILAEEAKRIKNDNVNSAFFQTSVRVWPQYGRVEIRGVLKTWIGDSKPFTDIKHYILILKRENGVTWLDNFGETDDEKK >CP031653|160225:198185|195058_195406_+|AXP24790.1|DBSCAN-SWA MKGWLFLVIAIVGEVIATSALKSSEGFTKLAPSAVVIIGYGIAFYFLSLVLKSIPVGVAYAVWSGLGVVIITAIAWLLHGQKLDAWGFVGMGLIIAAFLLARSPSWKSLRRPTPW >CP031653|160225:198185|160225_161137_+|AXP24754.1|tRNA|DBSCAN-SWA MQKFDTRTFQGLILTLQDYWARQGCTIVQPLDMEVGAGTSHPMTCLRALGPEPMAAAYVQPSRRPTDGRYGENPNRLQHYYQFQVVIKPSPDNIQELYLGSLKELGMDPTIHDIRFVEDNWENPTLGAWGLGWEVWLNGMEVTQFTYFQQVGGLECKPVTGEITYGLERLAMYIQGVDSVYDLVWSDGPLGKTTYGDVFHQNEVEQSTYNFEYADVDFLFTCFEQYEKEAQQLLALENPLPLPAYERILKAAHSFNLLDARKAISVTERQRYILRIRTLTKAVAEAYYASREALGFPMCNKDK >CP031653|160225:198185|195399_196239_+|AXP24791.1|DBSCAN-SWA MVTVFGILNLTEDSFFDESRRLDPAGAVTAAIEMLRVGSDVVDVGPAASHPDARPVSPADEIRRIAPLLDALSDQMHRVSIDSFQPETQRYALKRGVGYLNDIQGFPDPALYPDIAEADCRLVVMHSAQRDGIATRTGHLRPEDALDEIVRFFEARVSALRRSGVAADRLILDPGMGFFLSPAPETSLHVLSNLQKLKSALGLPLLVSVSRKSFLGATVGLPVKDLGPASLAAELHAIGNGADYVRTHAPGDLRSAITFSETLAKFRSRDARDRGLDHA >CP031653|160225:198185|187015_188020_+|AXP24779.1|DBSCAN-SWA MTINLKKRNFLKLLDYTPAEIQYLIDLAIKLKAAKKAGREKQTLVGKNIALIFEKTSTRTRCAFEVAAFDQGAQVTYLGPGGSQIGHKESMKDTARVLGRMYDGIEYRGYGQAIVEELGKYAGVPVWNGLTDEFHPTQILADLMTMLEHSPGKKLSELSFAYLGDARNNMGNSLMVGAAKMGMDIRLVAPKSFWPDVVLVEQCRSIAEETGARITLTDDVEEGVWGTDFLYTDVWVSMGEPKEAWTERVSLMKPYQINADVMNATGNPNVKFMHCLPAFHNEHTKVGREIEMAYGLKGLEVTEEVFESPNSIVFDEAENRMHTIKAVMVATLGD >CP031653|160225:198185|178909_179257_+|AXP24772.1|DBSCAN-SWA MISLPAGSRIWLVAGITDMRNGFNGLASKVQNVLKDDPFSGHLFIFRGRRGDQIKVLWADSDGLCLFTKRLERGRFVWPVTRDGKVHLTPAQLSMLLEGINWKHPKRTERAGIRI >CP031653|160225:198185|181551_181785_+|AXP24774.1|DBSCAN-SWA MTGWPDLGGGSAVSQFAMMARKFGKQLIVAVSEKDAIKLKDNFDIIGMLSYGKENFIAMSHTKSELTGGKRRYRIKN >CP031653|160225:198185|172401_176061_+|AXP29059.1|DBSCAN-SWA MSGNGETVAEQEPVPDTVIVDQGEKLSLKETLTLLDGAARHNVQVLITDSGQRTGTGSALMAMKYAGVNTYRWQGGEQRPATIISEPDRNVRYDRLAGDFAASVKAGEESVAQVSGVREQAILTQAIRSELKTQGVLGHPEVTMTALSPVWLDSRSRYLRDMYRPGMVMEQWNPETRSHDRYVIDRVTAQSHSLTLRDAQGETQVVRISSLDSSWSLFRPEKMPVADGERLRVTGKIPGLRVSGGDRLQVASVSEDAMTVVVPGRAEPATLPVADSPFTALKLENGWVETPGHSVSDSATVFASVTQMAMDNATLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQIKARAGETSLETAISLQKTGLHTPAQQAIHLALPVLESKNLAFSMVDLLTEAKSFAAEGTGFTELGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEGKEAVTPLMERVPGELMETLTSGQRAATRMILETSDRFTVVQGYAGVGKTTQFRAVMSAVNMLPESERPRVVGLGPTHRAVGEMRSAGVDAQTLASFLHDTQLQQRSGETPDFSNTLFLLDESSMVGNTDMARAYALIAAGGGRAVASGDTDQLQAIAPGQPFRLQQTRSAADVAIMKEIVRQTPELREAVYSLINRDVEKALSGLESVKPSQVPRLEGAWAPEHSVTEFSHSQEAKLAEAQQKAMLKGEAFPDIPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVLNSMIHDAREKAGELGKEQVMVPVLNTANIRDGELRRLSTWENNPDALALVDSVYHRIAGISKDDGLITLEDAEGNTRLISPREAVAEGVTLYTPDKIRVGTGDRMRFTKSDRERGYVANSVWTVTAVSGDSVTLSDGQQTRVIRPGQERAEQHIDLAYAITAHGAQGASETFAIALEGTEGNRKLMAGFESAYVALSRMKQHVQVYTDNRQGWTDAINNAVQKGTAHDVLEPKPDREVMNAQRLFSTARELRDVAAGRAVLRQAGLAGGDSPARFIAPGRKYPQPYVALPAFDRNGRSAGIWLNPLTTDDGNGLRGFSGEGRPWNPGAITGGRVWGDIPDNSVQPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTELAVRDIAGQERDRTATSERETALPESVLRESQREREAVREVARENLLQRLLQQMERDMVRDLQKEKTLGGD >CP031653|160225:198185|164985_165144_+|AXP24757.1|DBSCAN-SWA MKLPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGHREVAAFMAYESGK >CP031653|160225:198185|166361_167291_+|AXP24760.1|DBSCAN-SWA MNIPPDCCNLVNGDRGDDIKRKNFLLNLFFHKGVAVMRLASRFGRYNSIRRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGFQPFFACQSRIRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVTPEQIITPRRREDKQNDLWTTWQRVQENMIKGGLSGRSASGKNTRTRAITGIDGDIRINKALWVIAEQFRKWKS >CP031653|160225:198185|190986_191370_-|AXP24784.1|transposase|DBSCAN-SWA MPSNEFKIRMVELASQPGACVAQIARENGVNDNVIFKWLRLWQNEGRISRRIPVTTTSDAGVELLPVEITPDEPKEPMAALTPSLSTQTTVSASSCKVEFRHGNMTLENPSPELLTVLIRELTGRGR >CP031653|160225:198185|186019_186931_+|AXP24778.1|DBSCAN-SWA MERKPTLVVALGGNALLKRGEPLEAEIQRQNIELAARTIAGLTVNWRVVLVHGNGPQIGLLALQNSAYDKVTPYPLDVLGAESQGMIGYMLQQALKNSLPQREVSVLLTQVEVDATDPAFSNPTKYIGPVYNEDQAKTLAAEKGWGFKADGSYFRRVVPSPQPKRIVESDAITALIQRDHLVICNGGGGVPVVEKANGYRGIEAVIDKDLSAALLAYQIGADALLILTDADAVYLDWGKPTQRPLAQVTPELLRGMQFDTGSMGPKVAACCKFVEACNGIAGIGALVDGAEILAGNKGTLIRN >CP031653|160225:198185|190639_190990_-|AXP24783.1|DBSCAN-SWA MISLPSGTRIWLVAGVTDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRRGDTVKILWADADGLCLFTKRLEEGQFIWPVVRDGKVSITRSQLAMLLDKLDWRQPKTSRLNALTML >CP031653|160225:198185|165363_165594_+|AXP24758.1|DBSCAN-SWA MACQKSHHLPCDYSLFLPALILTTAVLSCLPGVPSAPGCLAASGRQARLRQAAAGAPASLRRTRWRGFGLLLRSDA >CP031653|160225:198185|184788_186009_+|AXP24777.1|DBSCAN-SWA MEKHYVGSEIGQLRSVMLHRPNLSLKRLTPSNCQELLFDDVLSVERAGEEHDIFANTLRDQGVEVLLLTDLLTQTLDIKEAKTWLLETQISDYRLGPTFAGDVRSWLADMPHRELARRLSGGLTYGEIPAAINNMVVDTHTSNDFIMKPLPNHLFTRDTSCWIYNGVSINPMAKPARQRETNNLRAIYRWHPAFADGDFIKYFGDENIYYDHATLEGGDVLVIGRGAVLIGMSERTTPQGVEFLANSLFKHRQAERVIAVELPKHRSCMHLDTVMTHIDVDTFSVYPEVVRKDAQCWTLTSNGRDGLQRTQETDLLHAIEKALGIDQVRLITTGGDAFEAEREQWNDANNVLTIRPGVVIGYERNVWTNEKYDKAGITVLPIPGDELGRGRGGARCMSCPLERDGI >CP031653|160225:198185|177575_177788_+|AXP24769.1|DBSCAN-SWA MNGFRNSSRNGQVWCWQRAGSRAVILEVSGRWMEAAEGWRRAACVAPRTDWQQFARKRAEHCHRRCRGRV >CP031653|160225:198185|171619_172291_+|AXP29060.1|DBSCAN-SWA MAANGTLAPTVVPMVNGGQASIAISNTSPNLFTVPGDRIIAVNSLDGALTNNEQTASGGVVVATVNKKPFTFILETERGLNLSIQAVPREGAGRTIQLVSDLRGTGEEAGAWETSTPYESLLVTISQAVRGGKLPAGWYQVPVTKETLQAPAGLSSVADAVWTGNHLKMVRFAVENKTLSALNIRESDFWQPGTRAVMFSQPASQLLAGARMDVYVIRDGEGN >CP031653|160225:198185|170296_170662_+|AXP24764.1|DBSCAN-SWA MNAVLSVQGASAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVGL >CP031653|160225:198185|169868_170264_+|AXP29058.1|DBSCAN-SWA MKRFGTRSATGKMVKLKLPVDVESLLIEASNRSGRSRSFEAVIRLKDHLHRYPKFNRAGNIYGKSLVKYLTMRLDDETNQLLIAAKNRSGWCKTDEAADRVIDHLIKFPDFYNSEIFREADKEEDITFNTL >CP031653|160225:198185|193807_194098_+|AXP24788.1|DBSCAN-SWA MFIQTAFSFSGVIQCLFCLFSGLRLHGLRRFSVFLASSPCVASASSYRFCSAVPPRWRSVFSRLAPVAKFKLSVLASGSNISVKPTRILRSAYLAR >CP031653|160225:198185|182028_182178_+|AXP29061.1|DBSCAN-SWA MTKYALIGLLAVCATVLCFSLIFRERLCELNIHRGNTVVQVTLAYEARK >CP031653|160225:198185|182954_183029_+|AXP29062.1|DBSCAN-SWA MPGKVQDFFLCSLLLRIVSVGWCD >CP031653|160225:198185|192166_193054_-|AXP24786.1|integrase|DBSCAN-SWA MKTATAPLPPLRSVKVLDQLRERIRYLHYSLPTEQAYVHWVRAFIRFHGVRHPATLGSSEVEAFLSWLANERKVSVSTHRQALAALLFFYGKVLCTDLPWLQEIGRPRPSRRLPVVLTPDEVVRILGFLEGEHRLFAQLLYGTGMRISEGLQLRVKDLDFDHGTIIVREGKGSKDRALMLPESLAPSLREQLSRARAWWLKDQAEGRSGVALPDALERKYPRAGHSWPWFWVFAQHTHSTDPRSGVVRRHHMYDQTFQRAFKRAVEQAGITKPATPHTLRHSFATALLRRALLQS >CP031653|160225:198185|168510_168894_+|AXP24762.1|DBSCAN-SWA MAKVNLYISNDAYEKINAIIEKRRQEGAREKDVSFSATASMLLELGLRVHEAQMERKESAFNQTEFNKLLLECVVKTQSSVAKILGIESLSPHVSGNPKFEYANMVEDIREKVSSEMERFFPKNDDE >CP031653|160225:198185|178088_178271_-|AXP24770.1|DBSCAN-SWA MFHTIVSIKNDGHYFCRAGGNRPDGLNEPVTLGRKSLSFSKSVELHDKVIGHYLNIKHYQ >CP031653|160225:198185|167587_168190_-|AXP24761.1|DBSCAN-SWA MVLLVVRTTNLLSLFVEWVKLFTDKVTRGGKMKKWMLTICLMFINEICQATDCFDLAGRDYKIDPDLLRAISWQESRYRVNAIGINPVTGYGSGLMQVDSQHFNELARYGIKPEHLTTDPCMNIYTGAYYLAIAFKKWGVTWEAVGAYNAGFRKTERQNQRRLAYASDVYRIYTGIKSSKGIRLPATKKSLPEINSVQNN >CP031653|160225:198185|196643_198185_+|AXP24792.1|transposase|DBSCAN-SWA MSLARNATASQSPTQTNGYERHQPDQTLLYQLVEQHYPAFKASLEAQGQHLPRYIQQEFNDLLQCGRLEYGFMRVRCEDCHHERLVAFSCKRRGFCPSCGARRMAESAALLIDEVFPKEPIRQWVLSFPFQLRFLLARHPQLMGQVLSIVYRTLSTHLIKKAGYTKASAQTGSVTLIQRFGSALNLNVHYHMLFLDGVYAEDDYGKQRFHRVKAPTYDELNTLAHTLSHRIARCMEKRGILERDAENTWLTLEEGEDDTLTQLHGASVTYRIAVGPQQGRKVFTLQTLPGREDKADSSSRVANHAGFSLHAGVMAEAHQRDKLERLCRYISRPAVSEKRLALTANGQVRYELKTPYRNGTTHVIFEPLDFIAKLAALVPKPRVNLTRFHGVFAPNSKHRVQVTPAKRGKKPDKSEGLDTNWRDKSPAERHRAMTWMQRLKRVFNIDIEVCEHCGGHVKVIASIEDPKVIEQILKHLKQKTAKANAAKQRELPPERAPPLTPSLFDPSQSRLFD >CP031653|160225:198185|170676_170988_+|AXP24765.1|DBSCAN-SWA MSGDENKLKKYRFPETLTNQSRWFGLPLDELIPAAICIGWGITTSKYLFGIGAAVLVYFGIKKLKKGRGSSWLRDLIYWYMPTALLRGIFHNVPDSCFRQWIK >CP031653|160225:198185|176881_177442_+|AXP24768.1|DBSCAN-SWA MTEQKRPVLTLKRKTEGTAPVRSRKTIINVTTPPKWKVKKQKLAEKAAREAELAAKKAQARQALSIYLNLPSLDEAVNTLKPWWPGLFDGDTPRLLACGIRDVLLEDVAQRNIPLSHKKLRRALKAITRSESYLCAMKAGACRYDTEGYVTEHISQEEETYAAERLDKIRRQNRIKAELQAVLDEK >CP031653|160225:198185|181730_181973_-|AXP24775.1|DBSCAN-SWA MQTSRVNGLTSGVFAFLVPASCLNQKGSDTRRDNTTFPVADYYLHFPLFTTETPANKYAFYLYQFLIRYRRFPPVNSLFV >CP031653|160225:198185|163516_164885_+|AXP24756.1|transposase|DBSCAN-SWA MSKPKYPFEKRLEVVNHYFTTDDGYRIISARFGVPRTQVRTWVALYEKHGEKGLIPKPKGVSADPELRIKVVKAVIEQHMSLNQAAAHFMLAGSGSVARWLKVYEERGEAGLRALKIGTKRNIAISVDPEKAASALELSKDRRIEDLERQVRFLETRLMYPKKAESLSSSHEKVKVLNELRQFYPLDELLRAAEIPRSTFYYHLKALSKPDKYADVKKRISEIYHENRGRYGYRRVTLSLHREGKQINHKAVQRLMGTLSLKAAIKVKRYRSYRGEVGQTAPNVLQRDFKATRPNEKWVTDVTEFAVNGRKLYLSPVIDLFNNEVISYSLSERPVMNMVENMLDQAFKKLNPHEHPVLHSDQGWQYRMRRYQNILKEHGIKQSMSRKGNCLDNAVVECFFGTLKSECFYLDEFSNISELKDAVTEYIEYYNSRRISLKLKGLTPIEYRNQTYMPRV >CP031653|160225:198185|190387_190609_-|AXP24782.1|DBSCAN-SWA MSQKYLIRIAELESQLRQKDQQLSLVEETEAFLRSALARAEEKIEEDEREIEYLRAQARCRKNPGWAKRSAMY >CP031653|160225:198185|179276_180848_+|AXP24773.1|transposase|DBSCAN-SWA MDTSLAHENARLRALLQTQQDTIRQMAEYNRLLSQRVAAYASEINRLKALVAKLQRMQFGKSSEKLRAKTERQIQEAQERISALQEEMAETLGEQYDPVLPSALRQSSARKPLPASLPRETRVIRPEEECCPACGGELSSLGCDVSEQLELISSAFKVIETQRPKQACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVTGKYADHLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDVLRQYVLMPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSQMPPAVWFAYSPDRKGIHPQNYLAGYSGVLQADAYGGYRALYESGRITEAACMAHARRKIHDVHARAPTYITTEALQRIGELYAIEAEVRGCSAEQRLAARKARAAPLMQSLYDWIQQQMKTLSRHSDTAKAFAYLLKQWDALNVYCSNGWVEIDNNIAENALRGVAVGRKNWMFAGSDSGGEHAAVLYSLIGTCRLNNVEPEKWLRYVIEHIQDWPANRVRDLLPWKVDLSSQ >CP031653|160225:198185|194103_194895_+|AXP24789.1|DBSCAN-SWA MREAVTIEISNQLSEVLSVIERHLESTLLAVHLYGSAVDGGLKPYSDIDLLVTVAVKLDETTRRALLNDLMEASAFPGESETLRAIEVTLVVHDDIIPWRYPAKRELQFGEWQRNDILAGIFEPAMIDIDLAILLTKAREHSVALVGPAAEEFFDPVPEQDLFEALRETLKLWNSQPDWAGDERNVVLTLSRIWYSAITGKIAPKDVAADWAIKRLPAQYQPVLLEAKQAYLGQKEDHLASRADHLEEFIRFVKGEIIKSVGK >CP031653|160225:198185|182754_182871_-|AXP29063.1|DBSCAN-SWA MKIIGVFAFLAPVNPHQNQFPATLRRGQPQNSLKRSAI >CP031653|160225:198185|193198_193696_+|AXP24787.1|DBSCAN-SWA MNSESVRIYLVAAMGANRVIGNGPNIPWKIPGEQKIFRRLTEGKVVVMGRKTFESIGKPLPNRHTLVISRQANYRATGCVVVSTLSHAIALASELGNELYVAGGAEIYTLALPHAHGVFLSEVHQTFEGDAFFPMLNETEFELVSTETIQAVIPYTHSVYARRNG >CP031653|160225:198185|161146_163216_+|AXP24755.1|tRNA|DBSCAN-SWA MSEKTFLVEIGTEELPPKALRSLAESFAANFTAELDNAGLAHGTVQWFAAPRRLALKVANLAEAQPDREIEKRGPAIAQAFDAEGKPSKAAEGWARGCGITVDQAERLTTDKGEWLLYRAHVKGESTEALLPNMVATSLAKLPIPKLMRWGASDVHFVRPVHTVTLLLGDKVIPATILGIQSDRVIRGHRFMGEPEFTIDNADQYPEILRERGKVIADYEERKAKIKADAEEAARKIGGNADLSESLLEEVASLVEWPVVLTAKFEEKFLAVPAEALVYTMKGDQKYFPVYANDGKLLPNFIFVANIESKDPQQIISGNEKVVRPRLADAEFFFNTDRKKRLEDNLPRLQTVLFQQQLGTLRDKTDRIQALAGWIAEQIGADVNHATRAGLLSKCDLMTNMVFEFTDTQGVMGMHYARHDGEAEDVAVALNEQYQPRFAGDDLPSNPVACALAIADKMDTLAGIFGIGQHPKGDKDPFALRRAALGVLRIIVEKNLNLDLQTLTEEAVRLYGDKLTNANVVDDVIDFMLGRFRAWYQDEGYTVDTIQAVLARRPTRPADFDARMKAVSHFRTLEAAAALAAANKRVSNILAKSDEVLSDRVNASTLKEPEEIKLAMQVVVLRDKLEPYFAEGRYQDALVELAELREPVDAFFDKVMVMVDDKELRLNRLTMLEKLRELFLRVADISLLQ >CP031653|160225:198185|176080_176827_+|AXP24767.1|DBSCAN-SWA MTTDNTNTIRNDSLAARTDTWLQSFLVWSPGQRDIIKTVALVLMVLDHINLIFQLKQEWMFLVGRGAFPLFALVWGLNLSRHAHIRQPAINRLWGWGIIAQFAYYLAGFPWYEGNILFAFAVVAQVLTWCETRSGWRTAAAILLMALWGPLSGTSYGIAGLLMLAVSHRLYRAEDRAERLALVACLLAVIPALNLATSDAAAVAGLVMTVLTVGLVSCAGKSLPRFWPGDFFPTFYACHLAVLGVLAL >CP031653|160225:198185|182461_182719_+|AXP24776.1|DBSCAN-SWA MSQTENAVTSSSGKKRPYRRGNPVPARERQKASLARRSATHKAFHAVIQLRLKEKLSELADEDGITQAQMLEWLIESEVKRRKSL >CP031653|160225:198185|178232_178910_+|AXP24771.1|DBSCAN-SWA MSIIFNGHYRMKHRTWITEALRLHFEEHLPRVVAGRRLGVPKSTVCSMFVRFRRAGLSWPLPAGMSEQELDACLYGQFSTVPVVRPESTVISEAPVVKKRPRRPNFPYEFKIALVEQSLQPGACVAQIARENGINDNLLFNWRHQYRKGGLLPSGKNMPALLPVTLTPEPDNKIPAPAQEPEQINTPSDSLCCELVLPAGTLRLKGKLTPALLQTLIREIKGSSH >CP031653|160225:198185|188067_189471_+|AXP24780.1|DBSCAN-SWA MGKFKFPSAYTILFFLIAIVAVLTWIIPAGQYHMAMNEALGKEVPVAGTYAHVEANPQGLISVLMAPIAGLYDPDSGQARAIDVALFILIIGGFLGIVTKTGAIDAGIERVTTRLRGREEWMIPILMALFAAGGTIYGMAEESLPFYTLLVPVMLAARFDPVVAASTVLLGAGIGTLGSTINPFATVIAANAAGIPFTNGITLRVVVLVIGWIICVTWVMRYARKVRKEPSLSIIADKQEENLAHFLGNKSEQALEFTPVRKIILVIFALTFAVMIYGVAVLGWWMAEISTVFLASAIIIGLIARMSEEELTSTFINGARDLLGVALIIGIARGIVVIMDKGMITHTILHYAEGMVTGLSTVAFINVMYWLEVVLSFLVPSSSGLAVLTMPIMAPLADFANVNRDLVVTAYQSASGIVNLITPTSAVVMGGLAIAHVPYVRYLKWVAPLLGILTVVIMVALSLGALL |
45 | Stx2-converting_phage(33.33%) | transposase,integrase,tRNA | attL 191366:191425|attR 201860:202659 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1151491 : 1164674
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP031653|1151491:1164674|DBSCAN-SWA TATGCGCATATTGCTGAGTAATGATGACGGGGTACATGCACCCGGTATACAAACGCTGGCGAAAGCCTTGCGTGAGTTTGCTGACGTTCAGGTGGTCGCCCCCGATCGTAACCGCAGCGGCGCTTCAAATTCTCTGACACTGGAATCCTCCCTGCGCACGTTTACCTTTGAAAATGGTGATATTGCTGTGCAAATGGGAACCCCGACCGATTGCGTCTATCTTGGCGTGAATGCTCTGATGCGTCCGCGCCCGGACATTGTTGTGTCCGGAATTAACGCCGGGCCGAATCTGGGGGATGATGTTATTTATTCCGGTACGGTAGCCGCCGCGATGGAAGGCCGTCATTTAGGTTTTCCGGCGCTTGCCGTCTCGCTTGACGGGCATAAACATTACGACACTGCCGCGGCGGTAATCTGTTCAATTTTGCGCGCACTGTGTAAAGAGCCGCTGCGCACCGGGCGTATTCTTAATATTAACGTTCCGGATTTACCCTTGGATCAAATCAAAGGTATTCGCGTGACGCGCTGCGGTACACGACATCCGGCAGATCAGGTGATCCCGCAGCAAGATCCGCGCGGCAATACGCTGTACTGGATTGGCCCGCCGGGCGGTAAATGTGATGCTGGTCCGGGGACCGATTTTGCTGCGGTAGATGAGGGCTATGTCTCCATCACGCCGCTGCATGTGGATTTAACTGCGCATAGCGCGCAAGATGTGGTTTCAGACTGGTTAAACAGCGTGGGAGTTGGCACGCAATGGTAAGCAGACGCGTACAAGCACTTCTGGATCAATTACGTGCGCAAGGTATTCAGGATGAGCAGGTGCTGAATGCACTTGCCGCCGTGCCGCGTGAAAAATTCGTTGATGAAGCGTTTGAACAAAAAGCCTGGGACAATATCGCTTTGCCGATAGGTCAGGGGCAGACAATTTCGCAGCCATATATGGTGGCGCGAATGACCGAATTACTCGAGCTGACGCCGCAGTCGCGGGTGCTGGAAATTGGCACCGGTTCGGGATATCAAACGGCAATCCTGGCGCATCTTGTCCAGCATGTTTGCTCGGTTGAACGGATTAAAGGCTTGCAGTGGCAGGCACGTCGCCGCCTGAAAAATCTTGATTTACATAATGTTTCAACCCGTCATGGCGATGGATGGCAAGGTTGGCAGGCACGTGCGCCGTTTGACGCTATCATTGTTACGGCGGCACCGCCGGAAATTCCAACTGCGCTAATGACGCAGCTGGACGAAGGCGGGATTCTCGTCTTACCCGTAGGGGAGGAGCACCAGTATTTGAAACGGGTGCGTCGTCGGGGAGGCGAATTTATTATCGATACCGTGGAGGCCGTGCGCTTTGTCCCTTTAGTGAAGGGTGAGCTGGCTTAAAACGTGAGGAAATACCTGGATTTTTCCTGGTTATTTTGCCGCAGGTCAGCGTATCGTGAACATCTTTTCCAGTGTTCAGTAGGGTGCCTTGCACGGTAATTATGTCACTGGTTATTAACCAATTTTTCCTGGGGGATAAATGAGCGCGGGAAGCCCAAAATTCACCGTTCGCCGCATTGCGGCTTTGTCACTGGTTTCGCTATGGCTGGCAGGCTGTTCTGACACTTCAAATCCACCGGCACCGGTCAGCTCCGTTAATGGCAATGCGCCTGCAAATACTAATTCTGGTATGTTGATTACGCCGCCGCCGAAAATGGGGACGACGTCTACAGCGCAGCAACCGCAAATTCAGCCGGTGCAGCAGCCACAAATTCAGGCTACTCAACAACCGCAAATCCAGCCAGTGCAGCCAGTAGCTCAGCAGCCGGTACAGATGGAAAACGGACGCATCGTCTATAACCGTCAGTATGGGAACATTCCGAAAGGCAGTTATAGCGGCAGTACCTATACCGTGAAAAAAGGCGACACACTTTTCTATATCGCCTGGATTACTGGCAACGATTTCCGTGACCTTGCTCAGCGCAACAATATTCAGGCACCATACGCGCTGAACGTTGGTCAGACCTTGCAGGTGGGTAATGCTTCCGGTACGCCAATCACTGGCGGAAATGCCATTACCCAGGCCGACGCAGCAGAGCAAGGAGTTGTGATCAAGCCTGCACAAAATTCCACCGTTGCTGTTGCGTCGCAACCGACAATTACGTATTCTGAGTCTTCGGGTGAACAGAGTGCTAACAAAATGTTGCCGAACAACAAGCCAACTGCGACCACGGTCACAGCGCCTGTAACGGTACCAACAGCAAGCACAACCGAGCCGACTGTCAGCAGTACATCAACCAGTACGCCTATCTCCACCTGGCGCTGGCCGACTGAGGGCAAAGTGATCGAAACCTTTGGCGCTTCTGAGGGGGGCAACAAGGGGATTGATATCGCAGGCAGCAAAGGACAGGCAATTATCGCGACCGCAGATGGCCGCGTTGTTTATGCTGGTAACGCGCTGCGCGGCTACGGTAATCTGATTATCATCAAACATAATGATGATTACCTGAGTGCCTACGCCCATAACGACACAATGCTGGTCCGGGAACAACAAGAAGTTAAGGCGGGGCAAAAAATAGCGACCATGGGTAGCACCGGAACCAGTTCAACACGCTTGCATTTTGAAATTCGTTACAAGGGGAAATCCGTAAACCCGCTGCGTTATTTGCCGCAGCGATAAATCGACGGAACCAGGCTTTTGCTTGAATGTTCCGTCAAGGGATCACGGGTAGGAGCCACCTTATGAGTCAGAATACGCTGAAAGTTCATGATTTAAATGAAGATGCGGAATTTGATGAGAACGGAGTTGAGGTTTTTGACGAAAAGGCCTTAGTAGAAGAGGAACCCAGTGATAACGATTTGGCCGAAGAGGAACTGTTATCGCAGGGAGCCACACAGCGTGTGTTGGACGCGACTCAGCTTTACCTTGGTGAGATTGGTTATTCACCACTGTTAACGGCCGAAGAAGAAGTTTATTTTGCGCGTCGCGCACTGCGTGGAGATGTCGCCTCTCGCCGCCGGATGATCGAGAGTAACTTGCGTCTGGTGGTAAAAATTGCCCGCCGTTATGGCAATCGTGGTCTGGCGTTGCTGGACCTTATCGAAGAGGGCAACCTGGGGCTGATCCGCGCGGTAGAGAAGTTTGACCCGGAACGTGGTTTCCGCTTCTCAACATACGCAACCTGGTGGATTCGCCAGACGATTGAACGGGCGATTATGAACCAAACCCGTACTATTCGTTTGCCGATTCACATCGTAAAGGAGCTGAACGTTTACCTGCGAACCGCACGTGAGTTGTCCCATAAGCTGGACCATGAACCAAGTGCGGAAGAGATCGCAGAGCAACTGGATAAGCCAGTTGATGACGTCAGCCGTATGCTTCGTCTTAACGAGCGCATTACCTCGGTAGACACCCCGCTGGGTGGTGATTCCGAAAAAGCGTTGCTGGACATCCTGGCCGATGAAAAAGAGAACGGTCCGGAAGATACCACGCAAGATGACGATATGAAGCAGAGCATCGTCAAATGGCTGTTCGAGCTGAACGCCAAACAGCGTGAAGTGCTGGCACGTCGATTCGGTTTGCTGGGGTACGAAGCGGCAACACTGGAAGATGTAGGTCGTGAAATTGGCCTCACCCGTGAACGTGTTCGCCAGATTCAGGTTGAAGGCCTGCGCCGTTTGCGTGAAATCCTGCAAACGCAGGGGCTGAATATCGAAGCGCTGTTCCGCGAGTAAGTAAGCATCTGTCAGAAAGGCCAGTCTCAAGCGAGGCTGGCCTTTTCTGTGCACAATAAAAGGTCCGATGCCCATCGGACCTTTTTATTAAGGTCAAATTACCGCCCATACGCACCAGGTAATTAAGAATCCGGTAAAACCGAGAATGGTCGTTAACACTGTCCAGGTTTTCAGACCGTCTGCTACCGACAACCCCAGATATTTGGTCACAATCCAGAACCCTGAGTCATTAATATGTGACGCGCCAAGCCCACCAAAGCAGGCTGCCAGCGTCACCAATACGCACTGAATCGGATTCAATCCCATCACCGCTTCTGAGAGTAACCCGCCGGTTGTCAGGATTGCTACGGTTGCTGACCCCTGCGATGCACGCAGAGCCAGTGAAATAATAAATGCGGCTGGTAACAGAGGCAGGTCAATCATTTGTAGCATATTGGCAAGGGCTTTGCCGACGCCCGATTCCACCAGCACTTTGCCAAATACCCCTCCAGCACCAGTAACCAAAATCACTACCGCCGCAGTGGGAAGGGCTGAGCCCATAATGTCGCTGGTGTGTTGTAAGCTCCAGCCGCGACGTAAAGCCAATAACCAGAATGCCAGCACCAGCGCAATCATTAGAGCTACCATTGGTGAGCCGATCAGCTGTAGCGTACCAAGCAGGGGATGCGAAGGCGGCATCAGTGTTGCGGAAACCGTACCCGCCATGATAATCGCGATAGGAATAACAATTAGCGAGGTGACCAGCGCGACGCCCGGTGGATTTATTTTATCGCTTAATTTTGTCGCGCCTTCCTCACTGGCCGGAGCCAGTTGCATCTGTTCCAGTACTTCCACCGACATCGCATATTGGTGCTTATTGATTATTTTCGCTGCAAAGTAGCCAACAATCCCTACGGGAATAGAAATCGCAATACCGATGATGGTTAGCCAGCCGATGTCTGCGTGGAGTAACCCCGCTGCGGCGACAGGGCCTGGATGCGGCGGTACCGCCACATGTACAGTTAGCATGATCCCAGCGACAGGCAGGCCAAATTTGAGTGGCGATATTTTGGCAACCTTGGCAAAACCGTAAATGATTGGCGCAAGAATAATAAAGCCGACATCAAAGAAGACGGGAATACCGAGGAAGAACGCTGCCAGAGTCAGCGCAGCGATAGTTCGTTTGTCACCTAACTTGCGACTGAAATAATTAGCCAGTGACTCTGCACCACCAGAGTGTTCGATCATACGCCCCAGCATAGCGCCCAGACCAATAATAATAGTGACGGAACCAAGCACACCGCCCATCCCGGCGATCATCACTTTACCCACTTCGCCCGCCGGTATACCCGCCGCAAGTGCGACTAACAGGCTGACGAGGAGCAGAGCAACGAATGGTTGTACCTTTGCCTTGATGACCAGCAGCAACAGCATGATTACGCCAGTTAACGCAATGCATAACAATGTAATTGTGGACATGGGAAACCCTGTCTGAAAGTTATAGTTAACCTACCCCATCCGTAGATGGGGGGATGTATGGGTACGTTGTAATTAGGGATTTAACGAATTAGCGCCAGGCGTCAAACCAGCCAAGCCCTTCTTCGGTGAGGCCACGAGGTTTATATTCACAACCGATCCAGCCCTGATATCCCACCTCATCGAACAGGCGGAACAGCCACGGATAGTTGATTTCTCCATCGTCCGGTTCATGTCGATCAGGTAGTCCGGCAATTTGTACGTGCGCATATTTCCCGGCGTAGTCGCGGATTAAATGCGTCAGGTTGCCATCTACTTTTTGCGCATGAAAAGTATCTAGTTGAATAAACACGTTATCTCGCGCAACCTCTTCAACAATAGCCAGTGCCTGATACTGGCTGGAGAAGAGATAATGAGGCTTAACGTCGGGGCTGAGTGCTTCAACTAATATTCGCTTGCCGTGTAGCGCAAAGCGGTCGGCAGCGTAGCGGAGATTATCGATAAATACTGCCCGGTACCGTTCAGCGTCTTCGCCAGCGGGCACGACGCCTGCCATCACATGGACTTGTTCACAATTGAGCGCCAATGCATATTCCAGTGCCAGGTCGATGTCTGCGTGTGCTTCGTGCTCACGTCCGGGAAGGGCGGATAATCCCCATTCCCCCGCATTAATATCTCCGGGAGCGGTATTGAACAGCGCCAGTGTCAGATGGTTTTGCTCCAGTTGCTTTTGGATTTGCAGGGTGGAGTAGTCATAGGGAAACAGAAATTCCACAGCATCGAACCCGGCTTTTCGCGCTGCGGCGAAGCGTTCAATAAAAGGCACTTCGGTGAACATCATGGATAAATTAGCTGCAAAACGAGGCATTGCATTAACTCCTTAATTCCGCAATTTCACCTGCGGTCAGATAACGGATCGGGCGGTCACCGAGAATAAAAATCAGCTTTGCCGTTTCCTCCAGCTCTTCCATATTGTTGGCGGCTTCTTGCAGGCTTTCACCGCAAACCACTGGGCCATGATTTGCCAGTAAAAAAGCCTGATTGTCTGCTGCCAGTTCCGCCAGATCCTGTGCGATGCGTTTATCGCCCGGTCGGTAATAAGGCACCAGCGGGACATTTCCCATCCGCATCACCACGTATGGTGTGAACGGACGAATAACGTTGCTGCTGTCCAGCCCTTGCAGGCAGGAAAGCGCCGTCGACCATGTGCTGTGCAAATGCACCACCGCTTTACAGCGCGGATTGTTGCGATACAGCGCCAGATGAAAGAGCACCTCTTTCGAGGGTTTGTCACCACTTAACCATTCGCCATCCGCGGCGACTTTGGAAAGCCGCTGCGGATCGAGATTGCCCAGGCATGAACCTGTCGGTGTCGCCAGTAAATTCCCGTCAGGTAAAAGCAGCGACAGATTGCCAGCCGAACCGGTTGCATAGCCGCGCTGAAAGAATGAACTGGCAATCCGCGTCATCTCCTCTCGCAAAGACTGCTCTACTTTTGCGAAATCGCTCATGATAAAAACTCTCTTTGGGCTCGTGAAAAAAAGGCTTCATCACCGAAGTTGCCAGATTTAAGGGCGAGTGAGACAGGCTTATCCAGTGCGTTTACCCACGGCACGCCGGGGGAAATGGTTGGGCCAATATGAAACCCTTTTATGCCCAGGCTCTGTGTGACTACGCCGGAGGTCTCACCGCCTGCGACAATAAAGCGTGTCACGCCTTCCGCTGCTAACCGCGCCGCTAGTTGAGAAAACAGAGTTTCTACTGCCTGACTGGCTTTTTGTGCACCGTATTGCTGTTGAATTGCTGCCAATGCGTCAGTGCTGGCGGTGGCAAAAACCAGTGGAGCAAGTACACTTTCCTGGCCCAGAACCCACTCTGCCAGTTCGTGTGCATAAGCGGCCAGAGTTTCGGTTGAGAGGCAGCGTGCCACATCAACTTCACGGGCTGGTGCAATTTGACGGTAATGTGCCACCTGGCGGTTGGTCATTTGAGAGCATGAACCGGAGAGCACTACGCCGCGCCCAGCGAGCGGATGCCCTGCTTCGCGAGCCTGGTTACCGTTTTCTTGCGCCCACTGCCGGGCCAGGCCAATCGCCAGACCAGAACCGCCCGTTACCAGTGGGGCATCGCGCAAGGCTTCTCCCTGAATTTCCAGATGGTGTTCGGTCAGCGCATCAAGCACCGCGTAGCGGTAGCCCTCTTGCTGTAAGCGAGCCAGCTCTTGACGAACGGCATCCACACCTTGTTCGAAAACATGTGCCGAAACGACGCCGCAGCGCCCTGTGGATTGCGCTTCAACCAGACGGGGAAGATAGCTGTCGGTCATGGGATTTACCGGGTGATGGCGCATCCCGGATTCGGCCAGCAGTTGATTCATTACGAACAAATACCCCTGATAAACCGTACGTCCGTTGACCGGCAGGGCCGGAGAGAAGACCGTAAACGGCGTGTCGAGAGCATCCATTAATGCATCGGTAACCGGGCCGATATTACCTTTCGCCGTACTGTCGAAAGTAGAGCAGTATTTGAAATAGATCTGTTTGCAACCTTGCTGTTGCAACCAGCTCAGAGCCGCCAGCGATTGCTGTGTGGCTTCAACCACCGGACAGGAGCGCGTTTTCAGGCTGATCACCAGTGCGTCGATTGCTTCCGGCATTTTACCTGTTGGAACACCGTTAATTTGTACCGTTGGTAGACCGTTTTCCACCAGAAAACTGGCGATATCCGTCGCGCCGGTAAAATCATCGGCGATAACGCCAATCTTGATCATGATTTCGCTCCCGGTAGAGTGATGCCAGAGAAAATCTTGATAACTGCGCTATCGTCTTCTTTCCCGTAACCTGCGTTACTGGCGCTGGTGAACATATTCAATGCTGTTGAGGCCAATGGCAGCGGGAAGTGCAGGGCTTTGGCTGTATCGGCAACCAGACCAAGATCCTTAACAAAAATATCGACGGCTGAATGCGGGGTGTAATCGCCATCCACCACATGACGCATCCGGTTTTCGAACATCCAGGAATTTCCGGCGGCATTGGTCACGACGTCATACATCACATCCAGCGGGATCCCCGCACGGGCTGCAAGTGCCATCGCTTCGGCTCCGGCAGCAATATGTACGCCCGCTAACAACTGGTGAATAATTTTTACGGTCGAACCTAGTCCCGGTTCTGCACCTATGCGATAAACTTTTCCGGCAACGGCTTCCAGCACGGGTGCCAGTCGTTCAAAGGCAATATCGCTACCGGAGGCCATGACAGTCATTTCACCGTTAGCGGCTTTTACTGCACCACCAGAAACTGGCGCATCCAGCATTTCCAGATCGAATCCAGCCAGAGCGGTAGCAATTTCTTGCGCATCAGCACTAGCGATAGTGGAAGAAACCATTACTGCCGTACCGGGTTTCAGATGTTGTGCAACGCCTGTTTCACCAAACAGCACCTGTTTAACCTGGGCCGCATTGACCACCAGCACCAGCAGAGCGTCCAGTTTTTCGGCAAACGTCGCGGCGTTATCAGAAACCCCGCAAGCACCTGCCTCTTTCAACGTAGCGCAGGCATTGCTGTTCAGGTCTGCGCCCCAGGTAGAAAGACCTGCGCGGACACATGACAGTGCTGCTCCCATTCCCATTGACCCTAAGCCAACGATACCGACATGAAACTCAGATCCCGTTTTCATCTGCTCTCCTTGTTAATTTAAGTGATATTTTGTTTGATATTGTGAATATAAGCGCTGGAAGATAACGATATGGTGAGCTGATTCACATAAATTAACATTGTGTGTTATTTTATGTGAACTAAGCGTTAGTTGCGCCGCGCACGTTTCGCAGGCAAATAGCGTAGAATGTCAGCAGGACAAAGGAAGGAGCAAAAGTTGATACCCGTAGAGCGTCGCCAAATCATCCTTGAGATGGTAGCTGAAAAAGGCATTGTCAGTATTGCTGAACTAACGGACAGAATGAATGTGTCACATATGACCATTCGTCGGGATTTACAAAAACTGGAGCAGCAGGGAGCCGTTGTGCTGGTGTCCGGAGGCGTCCAGTCTCCGGGACGCGTGGCGCATGAACCTTCTCATCAGGTAAAAACTGCGCTGGCAATGACGCAAAAAGCGGCTATTGGCAAGCTGGCGGCAAGTCTTGTTCAGCCGGGAAGCTGTATCTATCTGGATGCGGGAACGACCACGTTAGCGATAGCACAGCATCTGATTCACATGGAGTCACTGACTGTGGTCACAAACGATTTCGTTATTGCGAACTACTTGCTCGACAACAGTAATTGCACAATTATTCACACTGGCGGTGCAGTGTGTCGGGAAAACCGTTCCTGTGTCGGGGAAGCCGCTGCGACCATGCTGCGCAGCCTGATGATTGATCAGGCTTTTATTTCTGCATCGTCATGGAGTGTGCGGGGGATTTCTACGCCAGCGGAAGATAAAGTCACGGTGAAACGGGCGATTGCCAGTGCCAGCCGCCAGCGAGTTTTGGTCTGTGATGCGACGAAATATGGTCAGGTGGCGACATGGCTGGCGTTACCATTAAGCGAGTTTGAGCAGATTATCACAGACGACGGTTTGCCGGAGAGTGCCAGTCGCGCGCTGGCGAAGCAGGATCTCTCTTTGCTGGTAGCGAAAAATGAATAATGGCCTGCAATAACATTTGGTTACTCATGCTTCACAGAAGAAGCATGAGACTACTTTATTTTATAAAATGACAGCCGCCCGCTTTTCGGCGTGCCGGTATCAATATAAATCTGGTTAGCGAACGTCTGAATGTTATCAAACATCATATGTCCAAATATAAAATAATCAGCGCCGTTTATTTGTTGTAACTCGCCATTAAGCGATTTCTGCACACGATCAACAGGCCAGAGTAATTCGCTCTCCGCTATTTCTTTACCAAAGAGATATTCACTCCCCGGATAATCTGCATGTGCGATGACATATTTTATGTTGTCGTTAGTGATTTCAATAATATGTGGAAGGTGATGGAATTTCAGCAACAGATCTATTGCCTCTTGTTGCTCTGAATCATTTAAATCGAAAAACCAGTCACCACCGCTGGCAAGCCACATATTGCCATCGCCAGTTTCGAATGCATCCAACGCCATCGCTTCGTGGTTGCCTTTAACCGACGTAAACCAGGGTTGGTTTAGCAGGCGCAGCACGTTAAGACTCTTCGGCCCACGATCAATATTATCGCCTGTGGAAATAAGTAAGTCGGTTTCAGGGTAAAAAGAGAGTTGATGTAAACGGGATTGTAATAACTGATATTCACCATGAATATCACCAACGACCCATATATGGCGATAGTGATGGGCATTGATTTTTTGATAGCGTGTAGATGGCATGGTTTTACCCTGTAAAATAAGCTTTCCTATTATACAGGGTATTTTTATTTGATTCGTCAGTTGTCGTTAATATTCCCGATAGCAAAAGACTATCGGGAATTGTTATTACACCAGACTCTTCAAGCGATAAATCCACTCCAGCGCCTGACGCGGAGTCAGTGAATCCGGGTCGAGATTTTCCAGTGCTTCGACCGCAGGCGAAGTTTCTTCTGGCACTGACAGCAAAGACATCTGCGTACCATCCACTTGCGTCGCGGCGGCGTTCGGCGAAATGCTTTCCAGCTCACGCAGTTTTTGCCGTGCGCGCTTAATCACCTCTTTTGGCACGCCCGCCAGAGCTGCAACCGCCAGGCCGTAGCTTTTGCTCGCCGCGCCATCCTGCACGCTATGCATAAAGGCAATGGTGTCGCCGTGCTCCAGCGCATCGAGATGCACGTTGGCGACGCCTTCCATTTTCTCCGGTAACTGGGTCAGCTCGAAATAGTGGGTAGCAAATAACGTCAATGCCTTAATCTTATTCGCCAGATTTTCCGCGCACGCCCACGCCAGCGACAGACCATCGTAGGTGGACGTTCCACGCCCGATCTCATCCATTAACACCAGACTGTATTCGGTGGCGTTATGTAAAATATTGGCGGTTTCAGTCATCTCCACCATAAAGGTTGAGCGCCCGGAAGCCAGGTCATCTGCCGCGCCTACGCGGGTAAAGATGCGATCGATAGGTCCAATCTCGACTTTTTGTGCCGGTACATAGCTGCCGATGTAGGCCATCAGCGCAATCAGTGCGGTCTGGCGCATATAGGTACTTTTACCGCCCATGTTCGGGCCGGTAATAATCAACATACGGCGCTGCGGCGACAGATTCAGCGGGTTAGCGATAAATGGCTCATTCAGTACTTGTTCAACTACCGGATGGCGACCTTCGGTAATGCGAATGCCCGGTTTATCAATGAAGGTCGGGCAGGTGTAGTTCAGGGTATAGGCCCGTTCCGCCAGGTTAACCAGCACGTCGAGTTCCGCCAGCGCGCTCGCGCTCTGTTGCAACGCTTCCAGATGCGGCAACAGCAGGTCGAACAGCTCTTCATAAAGCTGTTTTTCCAGTGCCAGTGCTTTGCCTTTTGAGGTGAGAACTTTATCTTCGTACTCTTTTAGCTCTGGAATGATGTAGCGCTCGGCGTTTTTCAGCGTCTGGCGACGCATGTAGTTGATGGGTGCCAGATGGCTTTGCCCACGGCTGATTTGAATGTAGTAGCCGTGCACCGCATTAAAGCCAACTTTCAGCGTGTCCAGGCCGGTACGTTCACGCTCGCGGACTTCCAGACGCTCCAGATAATCGGTCGCGCCGTCAGCCAGCGCGCGCCACTCATCCAGCTCTTCGTTATAGCCCGATGCGATAACACCACCGTCGCGTACCAGCACCGGCGGTGTGTCGATGATTGCTCGCTCCAGCAGATCGCGCAGCTCGGCAAACTCGCCCATCTTCTCACGTAGCGCCTGTACCGGTGCACTATCGACAGTTTCTAACTGCGCACGCAGCTCCGGCAGTTGCTGGAAAGCGTGGCGCATACGGGCCAGATCGCGTGGGCGAGCAGTTCGTAAAGCCAGACGTGCCAGAATACGTTCCAGGTCGCCGACCTGACGCAGTACCGGCTGCAACTCGGCGGTGAAATCCTGCAATGCGCCAATAGTTTGCTGGCGCTCAAGCAACACGCGGGTATCGCGCACTGGCATATGCAGCCAGCGTTTCAGCATACGACTACCCATCGGCGTGACGGTGCAGTCGAGCACAGAAGCCAGCGTATTTTCCGCACCGCCCGCCAGGTTCTGAGTAATTTCCAGGTTACGACGTGTCGCGGCATCCATAATGATGCTGTCCTGCTCACGTTCCATGGTGATAGAACGAATATGCGGCAGGGTCGTGCGTTGGGTATCTTTCGCATACTGCAACAGACAACCGGCAGCACAAAGTCCGCGCGGCGCGTTCTCGACGCCAAAACCGACCAGATCGCGGGTGCCAAATTGCAGATTCAACTGCTGGCGCGCGGTGTCGATTTCAAACTCCCACAGCGGGCGACGGCGCAGGCCGCGACGGCCTTCAATCAGCGACATCTCGGCGAAATCTTCTGCATACAGCAGTTCCGCCGGATTAGTGCGTTGCAGCTCTGCCGCCATCGTTTCGCGGTCGGCCGGTTCGCTCAGACGAAAACGCCCGGAGCTGATATCCAGCGTCGCGTAGCCGAAACCTTTGCTGTCCTGCCAGATAGCCGCCAGCAGGTTGTCCTGACGCTCCTGTAACAGGGCTTCATCGCTGATGGTGCCTGGCGTAACGATACGCACAACTTTGCGCTCAACCGGACCTTTGCTGGTCGCCGGATCGCCAATTTGTTCGCAGATGGCAACGGACTCTCCCTGATTCACCAGTTTGGCGAGATAGTTTTCCACCGCATGGTAGGGAATCCCCGCCATCGGGATCGGCTCTCCCGCCGAAGCACCGCGTTTGGTCAGTGAAATATCCAGCAGTTGCGACGCGCGTTTTGCGTCGTCATAAAACAGTTCATAAAAATCACCCATCCGGTAAAACAGCAGGATCTCGGGATGCTGGGCTTTCAGCCTGAGATACTGCTGCATCATGGGCGTATGGGCGTCGAAATTTTCTATTGCACTCAT
Protein sequences of DBSCAN-SWA_2 >CP031653|1151491:1164674|1158169_1159432_-|AXP25697.1|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQAREAGHPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSTETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPGVPWVNALDKPVSLALKSGNFGDEAFFSRAQREFLS >CP031653|1151491:1164674|1154214_1155207_+|AXP25693.1|DBSCAN-SWA MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFARRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMNQTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILADEKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRERVRQIQVEGLRRLREILQTQGLNIEALFRE >CP031653|1151491:1164674|1157534_1158173_-|AXP25696.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS >CP031653|1151491:1164674|1162112_1164674_-|AXP25701.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV >CP031653|1151491:1164674|1151491_1152253_+|AXP25690.1|DBSCAN-SWA MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW >CP031653|1151491:1164674|1155300_1156665_-|AXP25694.1|DBSCAN-SWA MSTITLLCIALTGVIMLLLLVIKAKVQPFVALLLVSLLVALAAGIPAGEVGKVMIAGMGGVLGSVTIIIGLGAMLGRMIEHSGGAESLANYFSRKLGDKRTIAALTLAAFFLGIPVFFDVGFIILAPIIYGFAKVAKISPLKFGLPVAGIMLTVHVAVPPHPGPVAAAGLLHADIGWLTIIGIAISIPVGIVGYFAAKIINKHQYAMSVEVLEQMQLAPASEEGATKLSDKINPPGVALVTSLIVIPIAIIMAGTVSATLMPPSHPLLGTLQLIGSPMVALMIALVLAFWLLALRRGWSLQHTSDIMGSALPTAAVVILVTGAGGVFGKVLVESGVGKALANMLQMIDLPLLPAAFIISLALRASQGSATVAILTTGGLLSEAVMGLNPIQCVLVTLAACFGGLGASHINDSGFWIVTKYLGLSVADGLKTWTVLTTILGFTGFLITWCVWAVI >CP031653|1151491:1164674|1152246_1152873_+|AXP25691.1|DBSCAN-SWA MVSRRVQALLDQLRAQGIQDEQVLNALAAVPREKFVDEAFEQKAWDNIALPIGQGQTISQPYMVARMTELLELTPQSRVLEIGTGSGYQTAILAHLVQHVCSVERIKGLQWQARRRLKNLDLHNVSTRHGDGWQGWQARAPFDAIIVTAAPPEIPTALMTQLDEGGILVLPVGEEHQYLKRVRRRGGEFIIDTVEAVRFVPLVKGELA >CP031653|1151491:1164674|1160532_1161300_+|AXP25699.1|DBSCAN-SWA MIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGSCIYLDAGTTTLAIAQHLIHMESLTVVTNDFVIANYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSVRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFEQIITDDGLPESASRALAKQDLSLLVAKNE >CP031653|1151491:1164674|1156753_1157530_-|AXP25695.1|DBSCAN-SWA MPRFAANLSMMFTEVPFIERFAAARKAGFDAVEFLFPYDYSTLQIQKQLEQNHLTLALFNTAPGDINAGEWGLSALPGREHEAHADIDLALEYALALNCEQVHVMAGVVPAGEDAERYRAVFIDNLRYAADRFALHGKRILVEALSPDVKPHYLFSSQYQALAIVEEVARDNVFIQLDTFHAQKVDGNLTHLIRDYAGKYAHVQIAGLPDRHEPDDGEINYPWLFRLFDEVGYQGWIGCEYKPRGLTEEGLGWFDAWR >CP031653|1151491:1164674|1161350_1162007_-|AXP25700.1|DBSCAN-SWA MPSTRYQKINAHHYRHIWVVGDIHGEYQLLQSRLHQLSFYPETDLLISTGDNIDRGPKSLNVLRLLNQPWFTSVKGNHEAMALDAFETGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDNIKYVIAHADYPGSEYLFGKEIAESELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGTPKSGRLSFYKIK >CP031653|1151491:1164674|1153012_1154152_+|AXP25692.1|DBSCAN-SWA MSAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMGTTSTAQQPQIQPVQQPQIQATQQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSGSTYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVGQTLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPTATTVTAPVTVPTASTTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRVVYAGNALRGYGNLIIIKHNDDYLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLHFEIRYKGKSVNPLRYLPQR >CP031653|1151491:1164674|1159428_1160337_-|AXP25698.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSCVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNAAQVKQVLFGETGVAQHLKPGTAVMVSSTIASADAQEIATALAGFDLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGAEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS |
12 | Escherichia_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1253604 : 1275611
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP031653|1253604:1275611|DBSCAN-SWA TTTATGTCTGGATGTGGTCGTCAACACATCCTCTTTTAAATACAGGCGCTTTTACCTTAGTTGGATCATCTGTTTTAAAGATTTTCAATCGTTCACTGAAGAACATTACCTCAGATCCTTTGTACCGATCTTGAGCACTGTAGTTTATTCCATATACTAATGATGATGTCTTATACATAGCTTGGATCTCAGGAGTGTTATCGTAAGAAACAATCCAAGGTGTTTTAATATTATTCTGTACGACTTTTGCAACCCTAACATGATCATCATGATCATAATGGTTGATATATAAACCTTTACCTTTTATGTAATAAGGTGGATCCAAATATGTTAATGAGTTTTGTGGCAATTGAATAACTATTTTTTTAATGAAATCAATTGCATCCATGTTATATAAATCTATTCTGTGACGATTTTCTGATATTTTATGGATTCGAGAAATCAAATCACTCTTATTATAGCGAGCGTCAAGCTTCCATTTCCCTTCTTGATTTTTACCGCCAATTACTCCACCTTTTAATATTCCAGAACGATTTGTTCTATTGAGAAAAAAGGTTGAGAAACCAATTGTTAAAAGATCATGGTCTTTAGGATTATTAATTATATCTTTCTGCCTAAACCATTCATCCATTGTAACTTCTGTTCTTTCAATTAAAGAACATAATTGATCTGCATGATTTAATACACTGTGCCAAAAAGCATAAACAGAAATATTAATATCATTAAGGATGATTTTTTCAGCTGCATTTAGATGTAACAGTTTCAATGCTAAACCAGCACCCCCAGCATATGGTTCGGCATAATGTATAGGCGAAAGATTATTTTCTTCAATAATCCGAAGCATGAAATTTGCAAGCTTGCCTTTACCGCCAGGATAGCGAAGTGGGGTGTTAAAACGCATATGATACCTCTAAATGTCAGATCTAAAGTATATCAAAAAAAGGGCTTTATGGCCATCACATAAAACCCTTTTTAATCAATTAGTTATCTGACAGATATGATGAGATGGTAGCAGAATCGACACCATGTCCACTTATCATAATAACTTTAAGACTCTTTATTAGTTCATTCTTGAAGCTATCCGATTGAACAGGATTTTTCTCGACCCAATATCGATATGGGTTATCTTTAACTCTTCCTTTGACTTGGGCCTGAAATTGAGTCGTATGTGCAAAATCTTTAAACAGTTTTCTAACTCTCCCTCCATGGTTTTTGTTAACTTTTTTATAATTATCTATAAGGATTTTTAAATCAATCGGAGCGTTACCAATCTTTAGCGTTGCAATTATGTCTTTAGCTGTTTTCATAAAAACAGCTTTTGTAAATTTATTTTTATTTTCCCAATATGCATCATCAGGTGGTAAATTATATAAAAACTCAAAAATCATTTGGTCTGGAGGTAAGGTGCTTGGAAGCAAACATAAGTTTTTTTCTTTTTTGGCTTTTTTTGCATTAGCACTGTTGTCGTGAACAACATCACCGTCTAAGACGATCAAGCTTTTAGCTGTAAATTCTGGTATTTTTCTTGCCATTAAGTCAAGCATGGCAGAGCAGCTAATATTTATATTCCCTAATGGATTTAATATTTTATTTATTTTTCGATCTGTTATCAATTGTTTGAAAAAGTCGAAACCTTCTTTGTCTTCAAAATATACATTTGCCTTTGGAAGACAAATGTCATCATTAATTTTTACAGTTTCAACATGTAAATCAGCGTGGATATCTGTCCATGAAAGATTATTCTTTGTTTTAATATCTCCATATGTGTCTGTAAGATATATTGTTTTAAAGCTATCACTATCTTGTTTTGAACGATTATAGATATCCTCTATTATTAATGGAGAGTGCGAGGTCATGATTATTTGCAGATCGTATTGTTTTGCTGCTTTAGTTAGAATATTAATTAGCTCTAACTGTGCCGCAGGGAAAAGACCTGCATCGGCTTCATCAATTAACAATATTCCGCCATGATAATCAGAATAAGTTTCCTTCAGTCTTTTAAATGAAAAAATTGCCTGGATTAATTGTCCAACATTATCTTCACCAACAGATACTGATTGGTGATCATAATTGTCACCATGGACAACCATTGAATCGATAGTACCTTTTGTTGCTGTTACTGAGCTTCCATTGTTTTTTAGTAAGAGTTGGTTACTCATCATTCTGATTTCATCAGAATTTTCATTGATGTACTGAACATCCCGTGTTGAGTAATCAGTCCTTAATGTAATAGGAAGAAGTCGTGCTAAGCTTAAAAAAATAACAGGATGAGTAACGTTTCTACTTTGGTTTTTTTCCGGTATTGAATCATTTCCCCTGACTACAGGCCTAGATTTATCTCTGTCACTGTAGCTGTATAAACCTAATGTTAGTTTTTCAAGATGTTTGTTCGATGCACCATCATAGACACTAATTTTGACGTCCATTGAACCAGGAACATCAAATTGTTCTGAAAGCCTAAAATGTTCGCTGAAAGCTGACTTGAAGCTGCCATTTGTTAATGTTTTATATTGTGTTAAATCAGTCTCAGGGTTTTTGGTAAAGTCCTTTGTAAAGCTAAATATTTGGGCGATAATGCCAAGAATTGTGGATTTAGATGTGCCATTTTTACCACAAATAACAGTTAGGCGAGAACCAAACTCGATGTTTATATTTTTCAGTCCGCGAAACTTCACAACATTTATATTTTTTAGTTTGGTAATTTGATTTGCCATGTTTCTATCTTGCTCCGTAAAATGGCCACAAATTGTGGCCAATAGTTTCATGCTTGGATTATATTGTTATCTTCTTCAAGCTTGTTTGTCCATCATATTCCTTCAGATATGAGCCATAGTGCCTGAAAAGCATCTCCGGCCCTTTATGCCCCATTTGAGCTGCAAGCCAAAACAGGTTTGCTCCCCGGCTGATATGGCGGGTGGCGAATGTATGCCTGGTTTGATATGGATTTCTGTAACGAATACCTGCTTTTCGCAATGTTGGCACCCATGCTTTTTTCCTGATTGCATCAGCACTTGCCCAGGGCTTATTGGTCTTTGGATCTTCAAAGACAGTAGCATCCTTCATGAATGTAAATGGCTTCTGATTTATCAGCGCCAACATTGCATCTTCTGTCAGTTCAACTTTACGAGTACCGGCTTTTGTCTTTGTTCCTTTGATAACACCGACAACACTTGCGCTCTGGACATGGGCAGTTTTTCCAACAAAGTCGATATCACGCCATCGAAGGGCACATAATTCAGAACTACGCAGGCCTGTATGTATAGCGAACCGGAACAGATTCTCCCATTGTTTGTTTCCGGCTGCTGTTAGTAATGCATCAACTTCTGCTGGTGATAGCGGATCAACCACGTAGCTGCTTTCTGCTTCTGACTTATCACTTTGGTAGCGCGAAGCAGTTACCAACGATACGGGGTTAATTTGAAGTACCCCATCGGTTACGGCTTCATCAAGTGCTGACCGCAGGAAAGATAACTGGTTGCGAATTGTTTTTAAGGTCGTTTTCTGGCTTTGAATCCACGCTTTCAGGATTGCTGGTGTTAATTCACTTGCAGGGCAAATGTGGAGTGAGGCTAACGCACTACGGCATTTTTTATAACCACCAATCGTAGAGGGTGAAAGTTTTCTCGTTTCGCAGATTTCAAGGTATTCGTCCAGGTACATTTTTACCGTTTTGCCTGCAGCAGCATTACCAAAAATTTTCAAACGAGCAGAACGGGGAAAATATTCCGCATAAATAAATGTTCCCCTTTCGATCTTATTATGAATTTCGCCGAGTGTGCGCTCGGCGTATTTAATGTTCTTTGGTGTTACTTCCAGATTGGAAAGAGGCTCACGACATTTAACTCCTTTGTAGGTGAAAGTTATATTGATCGTTTCGCCCTGGCGGTGTTTCCTGATTGTTACGCCGCGCGGTAGTTTGAGCAGTTTTGTCTGGCCCATTTTGCAACCTCACTAAGATCAATCCACCTCTCCTTAACGCCTTCAACCTTTAAAACCTGAACACCTTCACGCCAAACACCGCGCTGTACACGTTTGTTTATTGCTTCAGGAGTTTCGCCAGTTTCTTTGCAATAAGTTGAGATTGGAACGCAATCGAGGTTCAGCATATGTTTCTCCACTTAGCCCGCTGCACACGGGCAGTAATATCAAATTCCAGTCCTGATAATTAATTTTGTTCCCTGGTTGCTACCTGTTTTATTGGCCTGATGCTGTCCAGTAGCAGACGGCGACGCATGTTTGGTGCACCCCAACGGTAACCAGTCTTTTTGTCGTAGGATTCACAACGTCCGGCAACCCAGGACGTTTCAGTGGAATGTAATTTCATCCGCTTTTCACCGTCTCGGGTGATAACAATTCCTGTATGAGTTTTTATCACGCTCATTTCTTAGTCTCTGGTGCTTTCGGCATTACTGCCCAGTGAGTGATATTGACGTTTTCAAGGTCCCCGACCTGAAATGTCCACTGCCATTCTCCGGTTTCTTTTTGTTCCCAGGTGTACCAGAGAGAACGCCAGCCAATCAGCCAGCCTTCTCCGTTAGCATCAAATAACAGAACACTTTCATTTGCTGGTGGCAGTTCAGCTGACACTGGTATTATTTTGTTTTCCAGTGCCGCACATTTAGCTTCAAGCGCGTCGAATTTACGTACCAGGTACTCAGCATTTGTTTCGTTCACTTTCAGATCTCGCGGTACACATTTCCCGCGAAGAAACCCTTCCATTTCGAAAACATTCATGCGCATTTGCGTAACTCCGATAACTCGTTAAAACGTTCCATAAACATCCCGTAGGCGTGGCCCGGTGCCAGTGGAATCACGTTGAACATCTCTGTTGCCGGGATGCCTTCCAGTACAGGCCAGAAAGAGCCATCATCAAGCCCGAGATCGCGGCGTTCGGTTGCCAGCATGATGAGATCGGCATATTTCACGGGCGTACTCATAACTGGGGGTAACCCGTATTTCTCACGGATTACGGCGTCTATTTTTTCTTCCATCTGTTTATAGTCAGGAAGAAGGCGTTTCAGTGGAGCGGGGATGTCCTGGCAATACGCTTCTGTTGCATCATGCATTAACGCTTCAAAAGCAAATTCCTGCGGTACCAGTTGGCTGCAAAGAACCGCATGTTGGGCGACGCTGTAGAAGTGTGAAAGATGTCCTGCAAAGCGACAGATATTTGAAAGGGAAACCGCGATATCGTTAATAACGATGTCGGCTTTATTTATCCTGTCATAATAAAAATGCTTCCCGGAAAAAGTTTTAATAAATGACATTTTGTTCTCCACGTTATTTGCGCTGCACCGCACTGAATTCTGGTAAAAGGAAGCCCTCACCATCCGGCGATTATTGAGTTAATTACGTTTCCATAAATGCCCCCGCAGGGGCATTTTCAGTAATGAAATCAGGCGGTGAAAGTACCAATAAAGGTTTCTACTTTGCTGTCTTTGAATTTCTCAACAAGCAGATCGCGAAATTCATTAGCCATTTCTTCCTGCACCGCCTCCAGCTGAATAATGCGCAGAACAAGTACAGGACGATCGCCAGTGATAATGCTGAGGCGTAATTTAAACGAACGTTCTTTCAGACCTTCAAACGGAACGCATTTAAATTCAAATGCCACTGGCATAATGTCTTTGGTCTTCGCTTCGACAGACTCCATCAGGGAGCGTTTGCCGCTGAAGTCATTATCTTCAAAATCAGCGGTCTGGTTTGCTTCAATCGTGATTTTACGGACAGCCGCAGCCGCTTTTGTTGCCTGGATAGCGTCACCATTAGCATCAAAACCCACAAGGTAGTCGGCCCAGTCTTCAATCCATTCTGCCAGTGACTTCTGGGAGTTACGCTCGCTGTTAACAGACAACAGTGCAGAGAACGGTGCTGTCTTTTTCAGTTTGAGAGTGGCGGTGTTATCTGCGTGACCTGGTTCATCAATAGTACCCAGGTTAAGCACACTGACGGCACGCATATTATCAGCATCGATAAAGCAGCGGGTACCTTCATCTGCAAGATCTTTAGAATAACGGGTAAAGTCATCGATGCTGGCAGTGGAAAGCGCACCACGGAAACGGAAGCGATTTAAATTAAATTTTTCCAGATCATGAATGCGGAAATTCTCAGGCAATGCCACAGCATCGGCACCAATCTTACTGATAATTTCATTAACACCCTGAGCAGAAATAAGGGCATGGATTTGATTAATTGCGGTTGCGTCTAAGTTCTGAGACATAATAAGTCCTCACTATATTAAGATATTCAGTGATGAGATAAATAATCAGTTAATTAAGAACGATATTAATGACCTGCTGCGCGTAGTTTTCCGTCAGGTTCACCGGCAAGAGTCAGTAATTGTCCCTGGTCTTCCTGCAGAATAGTCAGGCGACCACCGCGATTGACATACATCGGTGTTTCGGTGGTGTCTTCTTCGGAAATTTTCCCGCGGTTAGTCGGGCGAACATATGAGAGTTTGTGTTTGATTTTCACTCGGTTCTCATCAAATGGTTCGATTTCCAGGTTGAGCGAGACCTTACCTTTGGTTTTCGTGTTCATCACACCGGAAGCGACTTCACTGAGAACTGCGCCGATTTTGGTTTCAAATACGCCGCCGTCCAGCTCCCCGATAAATGCCTGCACATCAGTACTGCGTTCGCTAGCCATTTTGCTGCTCCTCATCATATCGACCCTGCAAGGTCGGTTAGTTTCTCCACAAAACAGAGAAGAACACCTGCGGTGGCAGCCGCCCGGATGGATTGGGTTATGAGCCCGTCGTCCGGTGATGCTCTTCTCTGTTTTGTAAAAAGAGCGGTACCAGCCGGAAGCAAGTGTACAAACTGGTACCGCCAAAGCAGTGGCTGTTGTGGTGACCGGTGCTGATCTCCGGCTTGCGGTTATTTCAGACTCTCACGGGCGTTTAATTGCCCCGCCGAACAGCTCTTTTCCGCAATAGCTGCAATGTCTTTCGCGCATCAGCCTGCGCATTCACCACAACGCTGAGAGCACTTAGCCAGTTACGGCACCACACTTTGTCGCGGTTCCATAAATGCCCTCATCGTTGCACCCTGGTCTCTTCCCAGGCGTCAAACCGAATCGCCACGCTGGTTAGGCGTCTTATCAGCATCATCATTGACTTGCACATTCCGGCTACCTGGTTTGTTTGCCCGAGCAAGGAGTGGATTGTCCCCTTTAACGTCACCAGACCGCTAACGACGCATGTGCCATACGCCGTGTTACAACCAAATTTTGTTAGTACCTTGTTTGTTGGTCTGGAAAGAAAGATAAAATGAAGTTGCGCATTATGCAAGTGTTTTTATTGCGAGATATGCAATTTTGTGGGTAATGAAAAGCCACCTTCGGGTGGCTAATTGATGAGGAGGTAAGGGTTAATTGTGTCGCTTAAGGGTTTGTGACTGGCTGATTAAGACCTTTCCAAAGACCATAAACCGGTGTTCATTTTCGCTGGTAATTCCCCATTCACGGTAAATCTGGTTATCAGAAATCACCAGTAGTTTGTCAGGTATCATTTGCAGTCGTTTGACATAAATTTTATCATCAAAACCAAATACATAGATACCATCTCCATCAAACTGATTGATACTGACATCAACGAAGATGAGATCTCCTGGCTCAATGGTTGGACACATACTGTCCCCACGAACGTTGATAACTTTGATGTGATTGGCTGGTCGTCCGCCGAACATTGATACAGCATTATCAGTTCTGTATTCGATGGCATGAATCACATCAATGACATCACCGCCCTGGATAAGGCCATTTCCCGCACTGGCACTGACATCCAGCATTTCAATACGGAATACATCCTTCACCTGCGCAACATCCTCACTAATACTGTTTTTACATACAGTATTACTTTTGAAGTCTGAGGTAAAGAGATCAGCAATATCAACACCTAAGCTCCTGGCAATATTACTCAGGGCTTGTTCAGTGAATTGTTTCTGCTTACCTGTTTCCAGGCGTGAGATATTCGCCGCATCCACTCCTATTGCTTCAGCGAGATCGGCGATTTTCATGTTCTTCGCCTGGCGAAGTTGTCTGACTCGATTTCCTATGTTCATGCGTTTATTACATTTCTTTATTGCGCGTTAAGCAAATCAACTTGCGCAAAATATTTGCGTGAAATAATATGCTTATCACGCAATATGTGGAGGTTATATGCAATCACCATTACGGAATGTGCGTAAAGCGCACGGATTTACTTTGCAGCATGTTGCTGCGGGCGTTCAGGTCAATCCAGCGACGCTGAGTCGTATTGAAAGACTGGAACAAATTCCATCTATCGACCTTGCAGAACGTCTGGCCAATTTTTTTAAGGGTGAAATCAGCGAAATGCAGATTCTTTATCCGGCACGTTTTCAATCTAGCCAAAACCAGAATGGGTTTAAACCACAGGAACAGGAGGTAAGCCGTGGGTAAGCATCACTGGAAAGTGGAAAAACAACCTGACTGGTACGTGAAAGCTGTCAGAAAAACTATCGCGGCATTGCCAGGTGGTTACGCTGAAGCTGCTGATTGGCTGGATGTAACAGAGAACGCATTATTTAATCGCCTTCGTGCCGATGGTGATCAGTTTTTTCCGTTGGGATGGGCAATGGTTTTACAGCGCGCGGCTGGCACTCACTACATTGCGGATGCTGTCGCACAGTCTGCTGGTGGGGTGTTCGTATCGCTTCCTGAAATTGAGGAAGTAGAGAACGCCGATATAAACCAGCGCCTGCTGGAAGTCATCGAACAGATCGGGAGTTACTCAAAGCAGATTCGTTCGGCAATCGAAGATGGGGTCGTGGAGCCACACGAGCAGACAGCAATTAATGATGAGTTGTATCTGTCAATTTCGAAGCTCCAGGAGCATGCAGCACTGGTCTACAAAATCTTTTGCGCTCCAGAAAAGAGTGACGCCCGCGAGTGTGCAGCTCCGGGCGTCGTGGCGTTTTGTGTCTGTGGAGAAACTAACGCATGAACAGTTTAACGGCAAATAACCGTTTGTCGCAACAACCGGTGGTCAGCGTCGCTGAACACCGGTTGTTATGGCATGAATGCAGATTACCAAATCACCTGGCTGTAAGTAACCACAGAGAACTTTACCTGACTGTGGGGGGCGAGTTGTGCAGGAACTTAACCGCTGGTTTCGTGACGGAAGAGGACTTTATGTTCATGTTATTCGTTGGGAGCCAGAAACACAGCGCGTTATCTATCTTCGCAAAGACTACCCGCATGAGTGCTTTAGTCCTTTGTGGAAATTCAGGCGTGATTTTGTTGAGTGTGAAGGACCACCAGCACATTGATTCTGCCATTCCGGGACGTTACACTGTTCAGGCACCTTATAAAACGGGTGCCGGGCGTGGAAACCCGGAATTCACCAAAGCGCACAACCGCGCTCTTGCGGTTTTTTTGTGTCATGAGCAGCATTACGCCCAAATTATGGTGGGGCGTGCAGGGCCAACTTCGGTTGGGCCGGGTTCTTTGGTGACCGGTATTTCCACCCCTGTACGTCTCACCACCAATAAGGTCGTGGAAAGCCTTGGTGGTGAGTTATTAAAAATCACCAAAGAGGCTGCCATCATGGCTACGATCCCAACCCTCACTCAACCTGAAATTGCCATCGTTGATGGTCAGGCTGTTACTTCATCCTTGGCTGTTGCCAACTTCTTCTCCAAACGTCATGACGATGTACTGAAAAAGATCCGCACGCTTGAATGCTCAGCATCATTCACTGCCCGCAATTTTTCGGTGAGTGATTACACTGATTGCACAGGTCGCAAACTTCCTTGCTACCAAATCACACGCGACGGTTTTGCGTTTCTTGCTATGGGTTTCACGGGTAAACGTGCTGCCCAGTTCAAGGAGGCATACATCAATGCCTTTAACCAGATGGAGAAACAGCTTTCAAAGCCCTCTGTACCGAGCGACGTTGCACATAACGCCAGCGTTCTCTGTTCCTACATTTCATCAATTCATCAGGTCTGGCTGCAGCAGCTTTATCCTATGTTGGCAAAAGCCGAATCTCCGCTGGCTGTTAGCTTGTATGACTATATTAATGATGCTTCGGCACTGGCCTGCCTCATAAATTTGTCGCTGAACCCTTCAGAGGTATGGGGGCGCAAATGATCCGGAATATTTTCAAACGGTTTACCAATCAGACTTTCCGTTGTCCTCGCCCCGGTCAGTGGTACACCACGCCTGCAGGGCATGTTCTACGTGTCAGCCTGGTTGACCGTGAATGTCAGAAGGTGATTTGTGAACCGCTGGGCCGTAATTATCGCGTCAGTATGCCGCTTATAGCCTTTCGCTCCGGAAAAAACATGAAGCATCTCGGAGGTGCAGCATGAGTATGGAGCTGATGGTTAAAGCGATGAAAATTCGAGTGGGAAATCCATTGCGAAAACTGGTTCTGATCAAGCTGGCTGATAATGCCAGCGATCAGGGGGAGTGCTGGCCCAGCTACCAGCATATTGCTGACCAGTGCGAGATTAGCAAACGTTCTGTGATGAATCATATTGCGGCCCTTTGTGAGTCCGGGCTGGTAAAAAAAGTCACCCGGAAAGGTGAAAAAGGTAACTCAAGTAATATCTATCTCCTTCATCTGGATGGTGCAGGAGATTCACTAGGGGGTAGTGCAAATAATTCACTATCTGGTGCAGCAAATTCACCAGGTAGTGCAGGAGTTGCACCAGGGGGTAGTGCAGGAGATTCACCCAGAACCAGTCACTCTTTTGAACCAGTCAAAGAACCAGTCAATGAACCAATAGCTGTTGGTGCATCAGTTGATGAGTCCGTGCGAGTTCGTTCAAACCGACCGGAATACTCTCCGGAGTTTGAGCAGGCATGGCTGGCATACCCCAAACGTGCTGGTGGCAATTCAAAATCTGCAGCCTTCAAAGCCTGGAAAGCCCGTTTGAATGAGGGGGTAAAACCCGAAACCATGCTGGAAGGTGTGAAACGCTACGCGGGCTGGGTATCTGCGATGGGTAACAGCGGCACACAATTTGTGAAACAGGCTGTCACGTTCTTTGGTCCGGATCGTCATTTCGAAGAATCCTGGGAAGTTCCTGCGGTATCTGCAGCCAGACGCGAGGACCCGTACTTCAAAGCCAGTTACGACAACGTGGACTACAGCCAGATCCCGGCAGGATTCAGGGGGTAATCATGAGTCTTTTGAATGAAGTTCAGAAATTCATTGAAGCCCATCCGGGGTGTACTTCCGGAGACATTGCGGATGCTTTTGCAGGTTACTCACGGCAGCGCGTTCTGCAGTCAGCAAGCAAGTTACGTCAGAGTGGGCGTGTGGCTCACCGTTGTGAAGGAGATACACGCAGACATTTCCCACGCCTGACTGAGAGAGCGCAGGAGCCGGAACCACAACCAGTTCGTGAAACCAGACCTGTGCGCAAGTTCTATGTCGGCACTAACGACCCGCGGGAGATTTTGAGCCTGACCCGCCAGGCGGAAGAACTGGAGTCCAGGGGCTTATACCGTCGTGCTGCAACGGTGTGGATGGCGGCATTCCGTGAAAGCCACTCCCAGCAAGAGCGAAACAATTTTCTGGCGCGTCGTGAGCGGTGCTTACGGAAAAGCAGCAAGCGCGCTGTATCGGGTGAAGAGTGGTATCTGTCAGGGAATTACGTGGGGGCTTAATGACGACGTTAACTCAATGCCAGCAGCAGGTGCTGGATATGCTGATTTCTTACCAGCAAGAACGTGGCTTTCCGCCAACCAATCAGGAGGTGGCAACCATGCTGGGATACCGTTCAGTGAATGCAGCGGTGGAACATCTTCGCGCACTGGAGAAAAAAGGCGTCATCACGATAAAGCGTGGCGTGGCCCGGGGTATCACTCTTCATACCGCGGTGAAGGACGACGACAGCGAGGCGGTCGGGATTATCCGCTCACTGCTTGCCGGTGAGGAAAACGCCAGGCTGCGTGCAGCCCACTGGTTACATGAGAGGGGCCTGAAAGTATGAAGCTGATCCTACCTTTCCCGCCCAGCGTGAACACGTACTGGCGACACCCCAACAAAGGGGCGTTTGCAGGTAAGAGCCTGATAAGCGCGGCGGGGCGAAAATTTCAGAGCGCGGCGTGCGCAGCAATAGTTGAACAGTTACGTCGTCTGCCGAAACCAACGTCGGCACCTGCTTCAGTGGAGATCGTGTTGTTTCCTCCGGATAACAGGATCCGCGATCTGGACAACTATAACAAGGCGCTGTTTGACGTTCTGACCCACGCGGGTGTGTGGGAAGACGACAGCCAGGTGAAAAGAATGCTGGTGGAGTGGGGACCGGTTATCCCGGAAGGGAAGGTCGAGATCACTATCAGTAAGTACGAGAAAACGGCGGGTGCAGCCGCCTGAGCAAGAGGAGAAACGAAGTATGAATAATCTGATGGTCATTGATGGTATTGAAGTTCGTCGTGATGCTTATGGGCGTTACAGCCTGAACGATCTGCACAGGGCAGCCGGGGGAGAACAAAAAAACCGCCCGAAATACTGGCTCTCCAATAAGCAAACCTGTGAATTGATTGAACAACTTTTCACCGAGGGTGGAATTCCGCCTCTGGAACAAAATCAACCAGTTAGCGTCATTAATGGCGGAAATAACCAGGGGACGTATGTCTGCAAAGAACTGGTGTATGCCTATGCAATGTGGATCAGCCCGTCATTCCATCTGAAGGTGATCCGTACTTTCGATATGGTAACCAGCACACCGGAAAAATTATCCGGGCAGGCTGCTGACAAGATGCAGGCTGGAGTGATTCTGCTGGACTTTATGCGCAGGGAGTTAAACCTGTCTAACTCTTCAGTGCTTGGGGCCTGTCAGAAACTCCAGGAGGCTGTTGGCTTACCGAATCTGGCACCGCGCTATGCCATTGATGCTCCTGCTGATGCACACGATGGCTCAAGTCGCCCGACACTGTCACTGAGTGCACTGCTGAAACAGTATGGTATCCGCCTGACGGCTAATCAGGCATATCATCAGATGGTGAAGCTGGGGATCGTCGAGCAGCGCGAACGATACAGCCGTACCGGGATTAACAACATCAAAAAATTCTGGTCGCTGACGGCGAAAGGCTGCATGTTCGGCAAGAACATCACCAGTCCAGCAAATCCGCGCGAGACGCAGCCGCATTTCTTCGAATCCCGATTCCCTGAGCTGTTAAAGCTGCTCGATACCGTTCATTGAGGTGACCGTGAGAGCACTACTGACCCCTGAAATTGCCCCGCGTATGGGGATCGTATTGTTCAGGCCAGGTTCAGAGCTGATGCCCCTGTTTATGCAGGGGCGTGTCCTGCTGGAGCCTGAGCCGGAACGTTATTCATCTTTCGCCAGTGGTGCCGTTCCGGCGGCATCACAACCGCTGGCGGATGATCCTGCCGTTCGGGCCGTGTTCCGCAATGAGGCAGTGATCCGTCGTGCTGGTGGCGTGGAATGTCTTGAAAGCTGGTTACTTCGTGAAAAAGGCTGCCAGTGGCCTCATTCCGACTGGCACAGCGAGAACATGACCACAATGCGACACGCTCCGGGCGCAATCCGTCTGTGCTGGCACTGCGATAACCAGCTGCGCGATCAGTTCACGGAACGGCTGGAATCAATGGCAACGGATAACTGTGCCCGCTGGGTGTTGTCTGTAGTCCGTCGGGATCTCGGTTTTGATGATAACCATGCCGTGACAATGCCGGAACTGTGCTGGTGGCTGATTCGTAATGACCTGGCGGATGCCTTACCTGAAAGCGCAGCCCGTAAGGCGCTGAGATTACCGAAACCTGTTGTGCCGTCTGTCACCCGGGAAAGTGACCTTGTGCCTTCGGTTCCTGCCACCAGCATCATACAGGATAAAGCGAAAAAGGTGCTGGCGCTGAAAGTGGATCCGGAGTCGCCGGAGTCTTTTATGTTACGCCCAAAACGCCGCCGCTGGGTTAATGAAAAGTACACGCGCTGGGTTAAGACACAGCCGTGTGCATGTTGTGGAAAGCCTGCTGATGATCCCCACCACCTGATAGGCCACGGTCAGGGTGGAATGGGTACAAAAGCGCATGACCTCTTTGTGTTGCCTTTGTGCAGAAAGCATCACGACGAGCTGCATGCGGATACCGTGGCATTTGAAGAGAAGTATGGCTCTCAGCTGGAGCTGATATTTCGTTTTATCGATCGTGCGCTGGCAATTGGCGTATTGGCGTAAGTGGAGAACGAGCATGAACCTTGAAGCCTTACCAAAATATTACTCCCCAAAATCTCCAAAATTGAGCGATGACGCACCGGCGACAGGCTCAGGTGGTTTAACGATTACGGATGTGATGGCTGCGCAGGGGATGGTGCAGTCGAAAGCACCGCTTGGGTTTGCCTTATTCCTGGCAAAAGTTGGTGTTCAGGATCCTCAATTTGCGATTGAAGGTCTGCTCAATTACGCGATGGCACTGGATAACCCGACATTGAACAAATTGAGTGAAGAAACCCGGTTACAGATCATCCCTTACCTTGTGAATTTTGCCTTTGCTGATTATTCCAGGTCTGCGGCAAGTAAGGCTCGCTGTGAGCATTGTGCTGGTACTGGATTTCATAATGTATTGCGCGAAGTGGTGAAACACTCCAGAAGCGGGGAATCTGTTATCAAGGAAGAGTGGGTGAAGGAACTATGTCAGCATTGTCATGGTAAGGGAGAAGTCAGCACAGCGTGCAGAGGGTGTAAGGGTAAAGGTATTGTCCTGGATGAAAAAAGGACCCGGCTTCATGGCACACCTGTTTATAAGATTTGTGGGCGTTGCAATGGAAACCGGTTTAGCCGTTTACCAACCACACTGGCGCGGCATCATGTCCAGAAGCTGGTACCAGACCTGACGGATTATCAGTGGTACAAAGGATATGCAGATGTCATTGATAAACTGGTTACAAAGTGCTGGCAGGAAGAAGCATATGCAGAGATACAATTGAGAAAGGTGACAAGATAAATGGTTTTCGCCGAAGATGACGACATGATGCTTGCATTTTTCAAAAAATATGGATAAGATTTTCCCAACGATGGGCTTTGTATGTCTACCGTTGATAAGATTTAAGAACCCGCCACTGAGCGGGTTTTTTGTACCTGTAAACTTGGTGCAGTACAGTAAACACGCTGGTGGTCGTGAATACTGACTTTTTATCTTGCTGGCTTTTTAGACAAGAGTTATTGGTATGTCATGTTAACCAGAAGGGAAAAAGACATGCTAAAACAGCAAGATATGACAGAAACCGCCGCCGCAGTCCTTCATTTCTTACCTGCTGACAAGTGGGTAACGCCACGCATGATGACGAGAACTACCGGAGTAAGTGAAGCCCGGTGCCAGTTAATACTGACTCAGTTAGTTCTGGCGGGTCTGGCGAAGGATAACGGCGGGTACGGGAATAAATTCAGACGCTGCCAGTAATGGCGGTTTCCTGCTGTGAAAATGGGCGGCTGGTGGGTGTTGGTAGCACCTGCCAGCCATTCGCTCATGCTTACTGGTCACAAGCGAACCACGGCCCACTGCTTTAGCGCAAAAGCAGAGTGAGCCTACCAGAGTTACGCTTACTGATCCATGAAAAATACTGTAAAAATAAACAGTGTTGATTTAATCAACGCTGATTGCCTGCATTTTATTCAGTCCCTGCCTGATGATTCCATTGACCTGATTGTTACCGATCCGCCGTACTTCAAGGTGAAACCCAACGGCTGGGACAATCAGTGGAAAGGGGACGAAGATTACCTTAAGTGGCTGGACCACTGTCTGGCCCAGTTCTGGCGGGTGTTAAAACCTGCCGGAAGCCTTTACCTGTTCTGTGGGCATCGCCTGGCATCTGATATTGAGATCATGATGCGTGAACGTTTCAACGTGCTTAACCATATCATCTGGGCGAAGCCGTCCGGACGTTGGAATGGGTGTAATAAAGAAAGTCTGCGCGCATATTTTCCTGCCACAGAGCGCGTTCTGTTTGCTGAACATTACCAGGGGCCATATCGCGGCAAAAGTGACGGCTATGCAGCAAAAGAAAGGGAACTCAAACAGCACATAATGGCACCGCTGATATCGTATTTCAGGGATGCTCGTGCCGAACTGGGTATAACGGCAAAACAAATTGCCGAAGCCACAGGTAAGAAAAATATGGTTTCCCACTGGTTTGGTGCCAGTCAGTGGCAGTTGCCGAATGAGGCTGACTATCGGAAGTTACAGGCACTGTTTTTCCGTATAGCGGCAGAGAAGTTTCAGGAACAACAACTGGAACAACCACACCACCAGCTGGTGGCATCTTATGATTCACTGAATCGTAAATATTCTGAATTGCTGGATGAGTTTAAATCTCTCCGGCGCTATTTCTCCGTATCAGTCTCCGTGCCTTATACCGATGTCTGGATGCATAAACCCGTTCAGTTCTACCCGGGGAAACATCCGTGTGAGAAACCGGCGGATATGCTCAGGCAAATAATCAATGCCAGTAGTCGACCAGGTGATCTGGTTGCTGATTTTTTTATGGGATCCGGTTCCACAATAAAAGCAGCAATGGCGCTGGGGCGTCGGGCCTTAGGTGTTGAGCTTGAGTCAGAGCGGTTTAACCAGACAGTGAAAGAGATAAACGAGCTGGTGGGGAAATAATCTGGTGGCCACGTCAGGTGGCCTTTTTATTTCCATTACACAGCACCCGCATCTGCGAGGTGGGGTTATGAAATCCATGGATAAGTTAACAACGGGTGTCGCCTATGGCACCTCAGCAGGTAGTGCCGGGTACTGGTTTTTACAGTTGCTCGATAAAGTCACGCCCTCACAGTGGGCGGCAATAGGTGTGCTGGGTAGTCTGGTATTTGGCTTGCTGACGTATCTGACAAACCTTTATTTCAAGATTAAAGAAGACAAGCGTAAGGCTGCACGGGGAGAGTAATTCAATGACTCAAAACTATGAACTGATTGTGAAAGGGATCCGCAATTTTGAGAATAAAGTTACGGTAACTTTAGCGTTACGGGACAAAAAACGCTTTGACGGTGAAATTTTTGACCTGGACATCTCGCTGGACCGTGTTGAAGGTGCCGCGCTGGAGGTTTATGAGGCAGCAGCCAGAAGGAGCATCAGACAGGTCTTCCTGGATGTTGCTGCCGGGTTATGTGAAGGGGATGAGCAGTCGCCGGAAAAGCGCCCCGTAATTTTAGATGCGCAGAATGTTTGGATAACCTACAAAGGAAAGCTACCAGGAAGAATTACTGGTTCTCTGAAGACTCCTCCGGAATCACAACCTTAAGTCACTGACCGGAACAGATAAACCTGTCCGTGGGCAGAAACCGATAAATCCTGATAAATATCCATGAACGCAAAAATCAGATACGGCCTGTCGGCTGCCGTTCTGGCACTGATTGCCGTCGGTTCGCCCGCGCCTGATATTCTCGACCAGTTTCTGGATGAAAAAGAAGGTAACCACACAACGGCATACCGCGATGGGTCCGGCATCTGGACCATCTGTCGGGGTGCCACGATGGTGGATGGAAAACCCGTTTTCCCCGGTATGAAACTGTCGAAGGAAAAATGCGACCAGGTCAACGCCATTGAGCGTGATAAGGCGCTGGCATGGGTACAGGCCAGCGCAGCAGCGAAGTTTGCTTCCGCAGCGAAGACATCCGAAACGAACGCGAAAGCGTCGGAAACCCGTGCAGAATCCTCAAAAACGGCAGCCGCATCGTCCGCCAGTTCGGCAGCGTCATCGGCATCATCTGCGTCTGCTTCAAAAGATGAGGCGACCAGACAGGCGTCAGCAGCGAAGGGCAGCGCCACGACGGCATCCACGAAAGCAACAGAGGCGGCAGGCAGTGCGACGGCGGCAGCACAGAGCAAAAGTACGGCGGAATCCGCGGCAACGCGCGCTGAGACAGCGGCAAAACGTGCAGAGGATATTGCATCCGCCGTGGCGCTTGAGGATGCGAGCACGACGAAAAAGGGGGTAGTACAGCTCAGCAGTGCGACCAACAGCACGTCTGAAACGCTGGCGGCAACGCCAAAGTCAGTAAAATCAGCCTATGACAATGCAGAGAAACGTCTGCAGAAAGACCAGAACGGCGCTGATATACCCGATAAGGGACGCTTCCTGAACAACATTAACGCGGTCAGTAAAACAGACTTTGCTGATAAGCGTGGTATGCGTTATGTGCGGGTTAACGCTCCTGCAGGTGCAACATCTGGAAAATATTACCCTGTTGTTGTTATGCGTTCTGCTGGCTCAGTAAGCGAACTGGCATCAAGGGTCATTATCACCACGGCAACGCGAACCGCAGGCGATCCGATGAATAACTGCGAGTTTAACGGATTTGTTATGCCTGGTGGCTGGACTGACAGGGGGCGTTATGCTTATGGAATGTTCTGGCAATATCAAAACAATGAACGAGCCATCCACTCAATAATGATGAGTAATAAGGGCGATGATTTGCGCTCTGTGTTCTATGTTGATGGCGCTGCTTTCCCTGTTTTTGCGTTTATCGAAGATGGCCTGTCAATATCCGCACCTGGTGCTGATCTCGTTGTTAATGATACGACCTATAAGTTTGGGGCAACAAATCCAGCGACTGAATGTATCGCGGCGGACGTTATCCTTGATTTTAAGAGTGGGCGTGGTTTTTATGAGTCTCATTCGTTAATCGTTAACGATAACTTGTCATGCAAAAAACTTTTTGCCACAGACGAAATTGTAGCGCGTGGTGGTAATCAGATTCGAATGATAGGTGGGGAGTATGGTGCATTATGGCGTAATGATGGCGCTAAAACTTACCTGCTGCTTACCAATCAAGGTGATGTTTATGGTGGCTGGAATACATTAAGACCGTTTGCTATTGATAACGCAACCGGCGAACTGGTTATTGGAACCAAACTGTCCGCAAGTCTGAACGGTAATGCATTAACAGCAACAAAGCTGCAAACGCCAAGACTGGTTTCTGGTGTTGAGTTTGATGGTTCCAAAGATATTACTTTAACCGCCGCGCATGTGGCTGCTTTTGCCAGAAGGGCAACGGATACATATGCCGATGCGGATGGTGGCGTTCCCTGGAATGCCGAATCAGGCGCTTACAATGTCACCCGCTCTGGCGACAGCTATATTCTGGTTAACTTCTATACCGGAGTCGGAAGTTGCCGGACCCTGCAGATGAAGGCGCATTACAGAAATGGTGGTCTGTTCTACCGTTCTTCAAGAGACGGTTATGGTTTTGAGGAAGGCTGGGCAGAAGTTTATACCTCGAAAAATCTTCCACCAGAAAGCTACCCAGTCGGCGCACCAATCCCGTGGCCATCAGATACCGTTCCGTCTGGTTATGCCCTGATGCAGGGGCAGACTTTTGACAAATCTGCCTACCCGAAACTTGCAGCCGCTTATCCGTCAGGCGTGATCCCTGATATGCGTGGCTGGACGATTAAGGGCAAACCCGCCAGTGGTCGTGCCGTATTGTCTCAGGAACAGGACGGCATTAAATCGCACACCCACAGCGCCAGCGCATCCAGTACGGATTTGGGTACGAAAACCACATCGTCGTTTGATTACGGCACTAAATCCACGAATAACACCGGGGCGCATACCCATAGTTTAAGTGGCAGCACGAATGCAGCTGGTAATCACAGCCATAGAGATGGCCGTCGATTTAACCCCAGTGTTTTTAAAGATACTTATCAATATGGTTATACAAGCTCAGGTCAAAATACCTGGGGGGTACAAGGCTCAGTAGGTATGTCTACGGGGTGGTTAGCTAATACCAGTACAGATGGTAATCATAGCCACTCACTGTCCGGCACAGCAGCATCTGCAGGTGCACACGCGCATACTGTCGGTATTGGTGCTCATACGCACTCTGTTGCGATTGGCTCACATGGACACACCATCACCGTTAACGCTGCTGGTAACGCGGAAAACACCGTCAAAAACATCGCATTTAACTACATTGTGAGGCTTGCATAATGGCATTCAGAATGAGTGAACAACCACGGACCATAAAAATTTATAATCTGCTGGCCGGAACTAATGAATTTATTGGTGAAGGTGACGCATATATTCCACCTCATACAGGTCTGCCTGCAAACAGTACCGATATTGCACCGCCAGATATTCCGGCAGGATTCGTAGCCGTTTTCAACAGTAATGAGGCATCGTGGCATCTCGTTGAAGATCATCGGGGTAAAACGGTTTATGACGTGGCTTCCGGCGACGCGTTATTTATTTCTGAACTCGGACCGCTACCGGAAAATGTTACCTGGTTGTCGCCGGGAGGGGAATATCAGAAGTGGAACGGCACAGCCTGGGTGAAGGATACGGAAGCAGAAAAACTGTTCCGGATCCGGGAGGCAGAAGAAACAAAAAACAGCCTGATGCAGGTAGCCAGTGAGCATATTGCGCCACTTCAGGATGCTGTAGATCTGGAAATCGCAACGGAGGAAGAAATCTCGTTGCTGGAAGCATGGAAGAAGTATCGGGTGTTGCTGAACCGTGTTGATACATCAACTGCACCTGATATTGAGTGGCCGATAATTCCTGTAACATCATAGTCATTATTTATTTGTTTTGCTGACATGTTGTATACGACAACGTGTCAGCAGTTTGACTGGTATTATAACGAATATATTGTAATGTTTTCTGGTGTTATTAATTGTTGTTTTATTACCATATATCTATGGGTTTTGTTTTCTGCTCATCTAAACAAAATGCACAGATGTAAGCCCCCTTACTTCGGTTGTACTGAACCGTTTCTAAAGCAGAGTCAAGATGTTTACTATCAAAGTTCCAGCTTTTTTCCATAACAGAAACACATTTTCTGCAATAGAAACTTTCTTTTGAGTTAGTATCTCCTGAATCATTTTTAGTGTAGCCAAAGAATTTTTTTGCTGATTCCTTTGAACATGCGGAAATGGCTGCAGTGAACTCATAATGAGTTTGAAGCATAGCCCATACAATACCTGAAAATTCTTCATCCTCGTCTTCATCTAATCCTATATTAATGCCGCCAGAGGAATGTATTGCTTTATATTTATACCTATGCCATATCCAGTGTGCTGATTCAGTATTTAGATAGTTGTGAGCACGTACAACTGATGTGAAAATTAGCTCTGGCATTATATCTGTTTTTTTATCAACCGTATGCATAAAACGATTTCTTAGAATGCGCATACTATTAAACCAAGTAATAAAGTCATCAGGCAGTGATTTAGAGGCGAAAGTATTATATACTTTAATTAAATCTTGGGCGTCTAATGTGTGGAAGTCTGAAAATGAAATGTTACCTTCACTATCAGGTTTAGGTGTATTGCGAGTTGCATTCGTGATAAGTAGATAAGGTGATATTTCGACGATAAGTCCTTTTAGTCTGAATTCAATAGATTGAAGTATCAAATTAAAAGATGTAATCAATTTTGGTCTAGCAAATAACCAATAATGCTCTAAATTTTTAATATCTGATTCTGTATAGAATTCAGACGTTTTACAATAACTATTCCAAAGGTTTATATCTCTGTAATCATAAGCTAAATCGGCAATTTTTTCCCAAGCCTCATTTACCATGTGGTCAGCCATTGTATAAAAATCAGTCTTTGTTGGAATGTCAGTTATCATTATCAATCTACCTTTGGGTCTTTATTGTTGTCTATCAAGATATAATATTCAGGCTGGAGTGGCTAATGTTGATCTACTTATTTTCTCCCGGTAAGTCAGTTTATCGCGATATAAGCCGCATTTGATTTTGGCGACTATGGTCGTGTTAGGTATGTATCTGGCGTGTTACTGCGCCAGATTAAATATAAACCAATAATATTAAAGGGTGTTACTCTCTTGACAGATTTAGGGAACAAATCACTCGTCAGCTGACTCCCAAGTATCTTTCAGAGTTTCCTGAACAAAAGTTTTAGCTGAATCTTTATCGGCGGTGCGCCTAACAGAAAAACCATCGTTGCTGGTGGCTTTTACGATCATCTCCACATCGTCATAATGCTCGCCAATGTATTGGGTTAATTCTTCCTTTAATGCGTCTACAGCGCCGTTTGGCAT
Protein sequences of DBSCAN-SWA_3 >CP031653|1253604:1275611|1263933_1264752_+|AXP25795.1|DBSCAN-SWA MSMELMVKAMKIRVGNPLRKLVLIKLADNASDQGECWPSYQHIADQCEISKRSVMNHIAALCESGLVKKVTRKGEKGNSSNIYLLHLDGAGDSLGGSANNSLSGAANSPGSAGVAPGGSAGDSPRTSHSFEPVKEPVNEPIAVGASVDESVRVRSNRPEYSPEFEQAWLAYPKRAGGNSKSAAFKAWKARLNEGVKPETMLEGVKRYAGWVSAMGNSGTQFVKQAVTFFGPDRHFEESWEVPAVSAARREDPYFKASYDNVDYSQIPAGFRG >CP031653|1253604:1275611|1275416_1275611_-|AXP25809.1|DBSCAN-SWA MPNGAVDALKEELTQYIGEHYDDVEMIVKATSNDGFSVRRTADKDSAKTFVQETLKDTWESADE >CP031653|1253604:1275611|1257502_1257709_-|AXP25783.1|DBSCAN-SWA MLNLDCVPISTYCKETGETPEAINKRVQRGVWREGVQVLKVEGVKERWIDLSEVAKWARQNCSNYRAA >CP031653|1253604:1275611|1269153_1270206_+|AXP25803.1|DBSCAN-SWA MKNTVKINSVDLINADCLHFIQSLPDDSIDLIVTDPPYFKVKPNGWDNQWKGDEDYLKWLDHCLAQFWRVLKPAGSLYLFCGHRLASDIEIMMRERFNVLNHIIWAKPSGRWNGCNKESLRAYFPATERVLFAEHYQGPYRGKSDGYAAKERELKQHIMAPLISYFRDARAELGITAKQIAEATGKKNMVSHWFGASQWQLPNEADYRKLQALFFRIAAEKFQEQQLEQPHHQLVASYDSLNRKYSELLDEFKSLRRYFSVSVSVPYTDVWMHKPVQFYPGKHPCEKPADMLRQIINASSRPGDLVADFFMGSGSTIKAAMALGRRALGVELESERFNQTVKEINELVGK >CP031653|1253604:1275611|1257768_1257984_-|AXP25784.1|DBSCAN-SWA MSVIKTHTGIVITRDGEKRMKLHSTETSWVAGRCESYDKKTGYRWGAPNMRRRLLLDSIRPIKQVATREQN >CP031653|1253604:1275611|1260639_1260894_-|AXP25789.1|DBSCAN-SWA MHNAQLHFIFLSRPTNKVLTKFGCNTAYGTCVVSGLVTLKGTIHSLLGQTNQVAGMCKSMMMLIRRLTSVAIRFDAWEETRVQR >CP031653|1253604:1275611|1264754_1265243_+|AXP25796.1|DBSCAN-SWA MSLLNEVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTRRHFPRLTERAQEPEPQPVRETRPVRKFYVGTNDPREILSLTRQAEELESRGLYRRAATVWMAAFRESHSQQERNNFLARRERCLRKSSKRAVSGEEWYLSGNYVGA >CP031653|1253604:1275611|1270493_1270844_+|AXP25805.1|DBSCAN-SWA MTQNYELIVKGIRNFENKVTVTLALRDKKRFDGEIFDLDISLDRVEGAALEVYEAAARRSIRQVFLDVAAGLCEGDEQSPEKRPVILDAQNVWITYKGKLPGRITGSLKTPPESQP >CP031653|1253604:1275611|1270907_1273535_+|AXP25806.1|tail|DBSCAN-SWA MNAKIRYGLSAAVLALIAVGSPAPDILDQFLDEKEGNHTTAYRDGSGIWTICRGATMVDGKPVFPGMKLSKEKCDQVNAIERDKALAWVQASAAAKFASAAKTSETNAKASETRAESSKTAAASSASSAASSASSASASKDEATRQASAAKGSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGVVQLSSATNSTSETLAATPKSVKSAYDNAEKRLQKDQNGADIPDKGRFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRLVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEGWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSLSGSTNAAGNHSHRDGRRFNPSVFKDTYQYGYTSSGQNTWGVQGSVGMSTGWLANTSTDGNHSHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA >CP031653|1253604:1275611|1270273_1270489_+|AXP25804.1|lysis|DBSCAN-SWA MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >CP031653|1253604:1275611|1261763_1262024_+|AXP25791.1|DBSCAN-SWA MQSPLRNVRKAHGFTLQHVAAGVQVNPATLSRIERLEQIPSIDLAERLANFFKGEISEMQILYPARFQSSQNQNGFKPQEQEVSRG >CP031653|1253604:1275611|1268799_1269003_+|AXP25802.1|DBSCAN-SWA MLKQQDMTETAAAVLHFLPADKWVTPRMMTRTTGVSEARCQLILTQLVLAGLAKDNGGYGNKFRRCQ >CP031653|1253604:1275611|1262564_1263716_+|AXP25793.1|DBSCAN-SWA MNSLTANNRLSQQPVVSVAEHRLLWHECRLPNHLAVSNHRELYLTVGGELCRNLTAGFVTEEDFMFMLFVGSQKHSALSIFAKTTRMSALVLCGNSGVILLSVKDHQHIDSAIPGRYTVQAPYKTGAGRGNPEFTKAHNRALAVFLCHEQHYAQIMVGRAGPTSVGPGSLVTGISTPVRLTTNKVVESLGGELLKITKEAAIMATIPTLTQPEIAIVDGQAVTSSLAVANFFSKRHDDVLKKIRTLECSASFTARNFSVSDYTDCTGRKLPCYQITRDGFAFLAMGFTGKRAAQFKEAYINAFNQMEKQLSKPSVPSDVAHNASVLCSYISSIHQVWLQQLYPMLAKAESPLAVSLYDYINDASALACLINLSLNPSEVWGRK >CP031653|1253604:1275611|1258998_1259823_-|AXP25787.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNSERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAVRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERSFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >CP031653|1253604:1275611|1265565_1265955_+|AXP25798.1|DBSCAN-SWA MKLILPFPPSVNTYWRHPNKGAFAGKSLISAAGRKFQSAACAAIVEQLRRLPKPTSAPASVEIVLFPPDNRIRDLDNYNKALFDVLTHAGVWEDDSQVKRMLVEWGPVIPEGKVEITISKYEKTAGAAA >CP031653|1253604:1275611|1265209_1265569_+|AXP25797.1|DBSCAN-SWA MVSVRELRGGLMTTLTQCQQQVLDMLISYQQERGFPPTNQEVATMLGYRSVNAAVEHLRALEKKGVITIKRGVARGITLHTAVKDDDSEAVGIIRSLLAGEENARLRAAHWLHERGLKV >CP031653|1253604:1275611|1265974_1266784_+|AXP25799.1|DBSCAN-SWA MNNLMVIDGIEVRRDAYGRYSLNDLHRAAGGEQKNRPKYWLSNKQTCELIEQLFTEGGIPPLEQNQPVSVINGGNNQGTYVCKELVYAYAMWISPSFHLKVIRTFDMVTSTPEKLSGQAADKMQAGVILLDFMRRELNLSNSSVLGACQKLQEAVGLPNLAPRYAIDAPADAHDGSSRPTLSLSALLKQYGIRLTANQAYHQMVKLGIVEQRERYSRTGINNIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESRFPELLKLLDTVH >CP031653|1253604:1275611|1261639_1261792_+|AXP29105.1|DBSCAN-SWA MSDSISYVHAFITFLYCALSKSTCAKYLREIICLSRNMWRLYAITITECA >CP031653|1253604:1275611|1258333_1258870_-|AXP25786.1|DBSCAN-SWA MSFIKTFSGKHFYYDRINKADIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKQMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFNVIPLAPGHAYGMFMERFNELSELRKCA >CP031653|1253604:1275611|1263712_1263937_+|AXP25794.1|DBSCAN-SWA MIRNIFKRFTNQTFRCPRPGQWYTTPAGHVLRVSLVDRECQKVICEPLGRNYRVSMPLIAFRSGKNMKHLGGAA >CP031653|1253604:1275611|1256375_1257542_-|AXP25782.1|DBSCAN-SWA MGQTKLLKLPRGVTIRKHRQGETINITFTYKGVKCREPLSNLEVTPKNIKYAERTLGEIHNKIERGTFIYAEYFPRSARLKIFGNAAAGKTVKMYLDEYLEICETRKLSPSTIGGYKKCRSALASLHICPASELTPAILKAWIQSQKTTLKTIRNQLSFLRSALDEAVTDGVLQINPVSLVTASRYQSDKSEAESSYVVDPLSPAEVDALLTAAGNKQWENLFRFAIHTGLRSSELCALRWRDIDFVGKTAHVQSASVVGVIKGTKTKAGTRKVELTEDAMLALINQKPFTFMKDATVFEDPKTNKPWASADAIRKKAWVPTLRKAGIRYRNPYQTRHTFATRHISRGANLFWLAAQMGHKGPEMLFRHYGSYLKEYDGQTSLKKITI >CP031653|1253604:1275611|1253604_1254504_-|AXP25781.1|DBSCAN-SWA MRFNTPLRYPGGKGKLANFMLRIIEENNLSPIHYAEPYAGGAGLALKLLHLNAAEKIILNDINISVYAFWHSVLNHADQLCSLIERTEVTMDEWFRQKDIINNPKDHDLLTIGFSTFFLNRTNRSGILKGGVIGGKNQEGKWKLDARYNKSDLISRIHKISENRHRIDLYNMDAIDFIKKIVIQLPQNSLTYLDPPYYIKGKGLYINHYDHDDHVRVAKVVQNNIKTPWIVSYDNTPEIQAMYKTSSLVYGINYSAQDRYKGSEVMFFSERLKIFKTDDPTKVKAPVFKRGCVDDHIQT >CP031653|1253604:1275611|1267794_1268547_+|AXP25801.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATGSGGLTITDVMAAQGMVQSKAPLGFALFLAKVGVQDPQFAIEGLLNYAMALDNPTLNKLSEETRLQIIPYLVNFAFADYSRSAASKARCEHCAGTGFHNVLREVVKHSRSGESVIKEEWVKELCQHCHGKGEVSTACRGCKGKGIVLDEKRTRLHGTPVYKICGRCNGNRFSRLPTTLARHHVQKLVPDLTDYQWYKGYADVIDKLVTKCWQEEAYAEIQLRKVTR >CP031653|1253604:1275611|1262016_1262568_+|AXP25792.1|DBSCAN-SWA MGKHHWKVEKQPDWYVKAVRKTIAALPGGYAEAADWLDVTENALFNRLRADGDQFFPLGWAMVLQRAAGTHYIADAVAQSAGGVFVSLPEIEEVENADINQRLLEVIEQIGSYSKQIRSAIEDGVVEPHEQTAINDELYLSISKLQEHAALVYKIFCAPEKSDARECAAPGVVAFCVCGETNA >CP031653|1253604:1275611|1260973_1261666_-|AXP25790.1|DBSCAN-SWA MNIGNRVRQLRQAKNMKIADLAEAIGVDAANISRLETGKQKQFTEQALSNIARSLGVDIADLFTSDFKSNTVCKNSISEDVAQVKDVFRIEMLDVSASAGNGLIQGGDVIDVIHAIEYRTDNAVSMFGGRPANHIKVINVRGDSMCPTIEPGDLIFVDVSINQFDGDGIYVFGFDDKIYVKRLQMIPDKLLVISDNQIYREWGITSENEHRFMVFGKVLISQSQTLKRHN >CP031653|1253604:1275611|1274231_1275179_-|AXP25808.1|DBSCAN-SWA MITDIPTKTDFYTMADHMVNEAWEKIADLAYDYRDINLWNSYCKTSEFYTESDIKNLEHYWLFARPKLITSFNLILQSIEFRLKGLIVEISPYLLITNATRNTPKPDSEGNISFSDFHTLDAQDLIKVYNTFASKSLPDDFITWFNSMRILRNRFMHTVDKKTDIMPELIFTSVVRAHNYLNTESAHWIWHRYKYKAIHSSGGINIGLDEDEDEEFSGIVWAMLQTHYEFTAAISACSKESAKKFFGYTKNDSGDTNSKESFYCRKCVSVMEKSWNFDSKHLDSALETVQYNRSKGAYICAFCLDEQKTKPIDIW >CP031653|1253604:1275611|1273534_1274119_+|AXP25807.1|tail|DBSCAN-SWA MAFRMSEQPRTIKIYNLLAGTNEFIGEGDAYIPPHTGLPANSTDIAPPDIPAGFVAVFNSNEASWHLVEDHRGKTVYDVASGDALFISELGPLPENVTWLSPGGEYQKWNGTAWVKDTEAEKLFRIREAEETKNSLMQVASEHIAPLQDAVDLEIATEEEISLLEAWKKYRVLLNRVDTSTAPDIEWPIIPVTS >CP031653|1253604:1275611|1257980_1258343_-|AXP25785.1|DBSCAN-SWA MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPANESVLLFDANGEGWLIGWRSLWYTWEQKETGEWQWTFQVGDLENVNITHWAVMPKAPETKK >CP031653|1253604:1275611|1254583_1256317_-|AXP29104.1|DBSCAN-SWA MANQITKLKNINVVKFRGLKNINIEFGSRLTVICGKNGTSKSTILGIIAQIFSFTKDFTKNPETDLTQYKTLTNGSFKSAFSEHFRLSEQFDVPGSMDVKISVYDGASNKHLEKLTLGLYSYSDRDKSRPVVRGNDSIPEKNQSRNVTHPVIFLSLARLLPITLRTDYSTRDVQYINENSDEIRMMSNQLLLKNNGSSVTATKGTIDSMVVHGDNYDHQSVSVGEDNVGQLIQAIFSFKRLKETYSDYHGGILLIDEADAGLFPAAQLELINILTKAAKQYDLQIIMTSHSPLIIEDIYNRSKQDSDSFKTIYLTDTYGDIKTKNNLSWTDIHADLHVETVKINDDICLPKANVYFEDKEGFDFFKQLITDRKINKILNPLGNINISCSAMLDLMARKIPEFTAKSLIVLDGDVVHDNSANAKKAKKEKNLCLLPSTLPPDQMIFEFLYNLPPDDAYWENKNKFTKAVFMKTAKDIIATLKIGNAPIDLKILIDNYKKVNKNHGGRVRKLFKDFAHTTQFQAQVKGRVKDNPYRYWVEKNPVQSDSFKNELIKSLKVIMISGHGVDSATISSYLSDN >CP031653|1253604:1275611|1266791_1267781_+|AXP25800.1|DBSCAN-SWA MRALLTPEIAPRMGIVLFRPGSELMPLFMQGRVLLEPEPERYSSFASGAVPAASQPLADDPAVRAVFRNEAVIRRAGGVECLESWLLREKGCQWPHSDWHSENMTTMRHAPGAIRLCWHCDNQLRDQFTERLESMATDNCARWVLSVVRRDLGFDDNHAVTMPELCWWLIRNDLADALPESAARKALRLPKPVVPSVTRESDLVPSVPATSIIQDKAKKVLALKVDPESPESFMLRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHADTVAFEEKYGSQLELIFRFIDRALAIGVLA >CP031653|1253604:1275611|1259888_1260251_-|AXP25788.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGH |
31 | Enterobacteria_phage(46.43%) | tail,lysis | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1796944 : 1806386
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP031653|1796944:1806386|DBSCAN-SWA AATGATTGAATTTAGCCATGTCAGCAAACTGTTTGGCGCACAAAAAGCCGTTAACGATCTCAATCTCAATTTTCAGGAAGGGAGTTTTTCGGTGCTGATTGGCACATCTGGCTCCGGCAAATCCACCACCCTGAAAATGATTAACCGCCTAGTGGAACATGACAGCGGCGTGATCCGCTTTGCCGGAGAAGAAATTCGCTCGCTGCCAGTACTGGAGTTGCGCCGCCGGATGGGCTATGCCATTCAATCTATTGGCCTGTTCCCCCACTGGAGCGTGGCGCAAAACATCGCTACCGTGCCGCAATTGCAAAAATGGTCGCGGGCGCGGATTGACGATCGTATCGACGAATTAATGGCGCTACTGGGGCTGGAGTCAAATTTGCGTGAGCGTTATCCGCATCAGCTTTCCGGTGGTCAGCAGCAACGTGTGGGAGTGGCGCGCGCACTGGCTGCCGATCCGCAAGTCTTACTGATGGATGAACCTTTTGGCGCACTGGACCCGGTAACGCGCGGCGCGTTGCAACAAGAGATGACGCGCATTCACCGTTTGCTGGGGCGCACTATTGTGCTGGTCACTCATGATATTGATGAGGCGCTAAGGCTGGCAGAACATCTGGTATTGATGGATCACGGTGAAGTGGTGCAGCAGGGGAATCCGCTGACGATGCTGGCTCGTCCGGCGAATGATTTTGTCCGCCAGTTTTTTGGACGTAGTGAACTGGGTGTGCGCCTGCTTTCGTTACGTAGTGTGGCGGATTACGTGCGTCGCGAAGAACGGGCAGAAGGTGAGGCACTGGCAGAAGAGATGACGCTACGCGATGCGCTCTCCCTGTTTGTCGCGCGGGGATGCGAGGTGCTGCCGGTGGTGAACACGCAGGGCCAGCCTTGCGGCACGCTGCATTTTCAGGATCTGCTGGAGGAGGCGTAAGCGTATGAAGATGTTGCGCGATCCGCTGTTCTGGCTCATTGCTTTGTTTGTGGCACTGATTTTCTGGCTGCCTTACAGCCAGCCGCTGTTTGCTGCCTTGTTCCCACAACTGCCACGACCCGTTTATCAGCAAGAAAGTTTTGCAGCTCTGGCACTGGCTCATTTCTGGCTGGTGGGAATTTCGAGTTTGTTTGCGGTGATCATTGGCACTGGTGCCGGAATTGCTGTCACTCGCCCGTGGGGCGCGGAATTTCGCCCACTGGTGGAAACTATTGCCGCCGTTGGACAGACTTTTCCGCCTGTCGCAGTGCTGGCGATTGCCGTTCCGGTGATCGGCTTTGGTCTGCAACCAGCGATTATCGCCTTGATCCTTTACGGTGTGCTGCCCGTCCTGCAGGCGACACTTGCCGGGCTGGGAGCGATTGATGCCAGCGTGACAGAAGTTGCGAAAGGTATGGGAATGAGTCGTGGTCAGCGACTGCGTAAGGTCGAGCTACCGCTGGCGGCTCCGGTGATTCTGGCGGGCGTGCGAACTTCGGTGATTATCAACATTGGTACGGCGACGATCGCCTCAACGGTAGGGGCCAGCACGCTGGGTACGCCCATCATCATCGGGCTTAGCGGATTTAATACCGCGTATGTGATCCAGGGGGCGTTACTGGTGGCACTGGCGGCGATCATCGCAGACCGCCTGTTTGAAAGGCTGGTGCAGGCGCTTAGCCAGCACGCAAAATAAAGGTATAACCTGCGAGCATGACGCCACCAATTCCGCCTAACGCCATAAACAGGAACAGGGCGATGACCCCAATTTTAGCTATGCGCATATTGCACTCCTTATGTTAACGAAAGGATTGTACAGTAAAGCGCATTTGTTAACGAATCATTAAATGCCGAGTGGGAAAATATCATGGCCTTGTTCTTGCCAACTGGTGAGTTGCTGCTGTTGGGCGGAGGTTCGATTTTCACCGCACCACACCAGCAATGTACGGCCTTCGAATAGTTCAGGGCGTAGTTGATTGAGCGAGTGGGCGAGGACATCAATGCGCCATCCTTGTTGACTGGCAATCCAGCCCTCCAGCCACAGACGGGTGGTATCCTGAATATTCCAGCCAACCACCAGCGCATCTTTACCCTGTTTTTTACGTGCCGAAGCCAGACAAATGGCGATGTAGTTGATCAGTACGCCGTCGAGGATCGCCAGCAGCGCCTGGAGAGTCGGTTGTTGGCATTGAAGTCGTCGGCGCAGAGGAATAAACAGATGTGTGGTGAGTGTCTGGGCGGGGTAATCCTGACCGCGCTCTTTGATCCACGTTCGCAGGCTATGCAGATTGCCGCTTTGCAGGTAGGTCAGTAATGTTTCTTGCTGATCGCGCCAGCCGTTCTGCACATCAACATTTTCATTACTGAGCAGCATTTTAACTTTGCTGACCTGCACGCCGTTGTCGATCCAGCGTTTGATCTCGCGGATACGGTCAATATCGGCATCGTTGAACAGTCGATGACCGCCGTCTGTCCGTTGCGGTTTCAGCAACCCGTAACGCCTCTGCCACGCGCGTAACGTGACAGGGTTAATATCACAAAGCAACGCCACTTCACCAATTGTGTAAAGCGCCATCGTCTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCAGTTTTGCGAACCAGGTAGTTTTGCCCGTTTTTTGTGCATCTATAGGGTGATTTTATTTTTGCCAGGCGATTTTGAGTGATCGTACTCACGAATTCTCATTTTTCTGCAAGAGTTCAAAGAAAGTTAAACGCAGGCAATGTATGTTACGCGTTTTAAAGGGAAGTGTGGTTTGCGGGTATGTACGATTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTTTTTTTAGTCATTGCGTGGTTAATGAGTAAAACGCCATTATTCATACCGTTAATGCAGGTCACGGTTCGTCTGCCGCATAAATTTCTCTGCTACATCGTCTTTTCCATCTTCTGCATCATGGGCACCTGGTTTGGGTTGCACATTGACGATTCTATTGCCAATACCCGTGCGATAGGCGCGGTAATGGGCGGCTTACTCGGCGGTCCGGTCGTCGGTGGGCTGGTTGGTCTGACCGGCGGCTTACATCGATATTCGATGGGGGGCATGACCGCGCTAAGTTGCATGATCTCAACCATCGTTGAAGGATTGCTCGGCGGCCTGGTACACAGCATCCTGATCCGTCGCGGGCGCACTGATAAAGTCTTTAACCCCATTACCGCCGGTGCCGTCACGTTCGTCGCTGAAATGGTGCAAATGCTGATCATCCTTGCGATCGCCCGACCTTATGAAGATGCGGTGCGTCTGGTGAGTAATATTGCTGCACCAATGATGGTCACCAATACCGTCGGCGCGGCGCTGTTTATGCGTATATTGCTCGATAAACGCGCGATGTTTGAAAAATACACTTCTGCTTTTTCTGCCACTGCGCTGAAAGTGGCAGCCTCGACGGAAGGCATTTTGCGACAGGGGTTTAACGAAGTGAACAGCATGAAAGTGGCTCAGGTGCTGTATCAGGAGCTGGATATTGGTGCAGTCGCGATTACCGATCGAGAGAAATTGCTGGCCTTTACCGGAATTGGTGACGACCACCATTTACCCGGCAAACCGATTTCTTCGACTTACACCTTAAAAGCGATTGAAACCGGTGAAGTGGTCTACGCTGATGGCAACGAAGTACCTTATCGTTGCTCTTTGCATCCGCAATGCAAACTGGGGTCGACGCTGGTAATTCCGTTGCGTGGCGAAAATCAGCGCGTGATGGGCACCATCAAATTGTATGAAGCCAAAAACCGTTTATTCAGTTCAATAAACCGCACGCTGGGCGAGGGGATTGCGCAACTGCTTTCGGCGCAGATTCTTGCCGGGCAATATGAGCGGCAAAAAGCGATGCTCACCCAGTCAGAAATCAAACTGCTTCACGCCCAGGTGAATCCTCATTTTTTGTTTAATGCGCTTAACACCATTAAAGCGGTGATCCGCCGCGACAGCGAACAGGCCAGCCAGCTGGTGCAGTATCTTTCCACTTTTTTCCGCAAAAACTTAAAGCGGCCTTCGGAGTTTGTTACTCTCGCCGACGAAATTGAACATGTGAACGCTTATCTGCAAATTGAAAAGGCGCGCTTCCAGTCGCGGTTGCAGGTCAACATTGCTATTCCGCAAGAATTATCCCAGCAGCAATTGCCCGCGTTTACCCTGCAACCGATAGTGGAAAACGCCATTAAACATGGGACATCACAACTGCTGGATACAGGGCGAGTGGCAATCAGCGCCCGACGTGAGGGGCAACATTTGATGCTGGAGATCGAAGACAATGCCGGTTTGTATCAACCGGTAACCAATGCCAGTGGGCTGGGGATGAATCTGGTGGATAAGCGTTTACGTGAACGGTTTGGCGATGACTATGGAATAAGCGTCGCCTGTGAGCCTGATAGTTACACCCGAATAACGTTACGACTACCATGGAGGGACGAGGCATGATTAAAGTCTTAATTGTCGATGATGAACCGTTAGCACGGGAGAACCTGCGCGTATTTTTGCAGGAGCAGAGCGATATTGAAATCGTTGGAGAGTGTTCAAACGCCGTGGAAGGGATCGGCGCGGTGCATAAACTGCGCCCGGATGTGCTGTTTCTCGATATCCAGATGCCGCGCATCAGTGGTCTGGAAATGGTGGGGATGCTCGACCCGGAACATCGTCCGTATATTGTTTTTCTCACCGCGTTTGACGAATACGCTATTAAAGCCTTTGAAGAACATGCCTTTGATTATCTGCTGAAGCCAATTGATGAAGCGCGACTGGAGAAAACGCTGGCGCGTTTGCGTCAGGAACGCAGCAAGCAGGATGTTTCGCTGTTACCGGAAAATCAACAGGCGCTGAAATTTATCCCTTGTACGGGGCATAGTCGGATTTATTTGCTGCAAATGAAAGATGTGGCATTTGTCAGCAGTCGGATGAGCGGTGTCTACGTTACCAGCCACGAAGGGAAAGAGGGCTTTACCGAATTGACATTACGTACCCTGGAAAGTCGTACACCACTACTGCGCTGCCATCGTCAGTATCTGGTTAACCTCGCGCATTTACAGGAGATTCGTCTGGAAGATAACGGCCAGGCCGAGTTGATTTTGCGTAATGGCTTAACCGTGCCGGTCAGCCGCCGTTATCTGAAAAGCTTAAAAGAGGCGATTGGCCTGTAAAAGACTGCTAAAATGGCTTTTTGCCTCATCAACACCTGAAGGCCTCATGCTAAGTAACGATATTCTGCGCAGCGTGCGCTACATTTTGAAAGCCAATAATAATGACCTGGTGCGTATTCTGGCGCTGGGTAATGTCGAAGCCACCGCGGAACAGATCGCCGTCTGGCTACGTAAAGAAGACGAAGAGGGTTTTCAGCGTTGTCCGGACATTGTTTTGTCGTCATTCCTCAATGGCCTGATTTATGAAAAACGCGGCAAGGATGAGTCTGCTCCGGCACTGGAGCCGGAACGTCGCATTAATAACAACATCGTGCTGAAAAAATTACGCATCGCGTTTTCGCTGAAAACCGATGACATTCTGGCTATCCTCACCGAACAGCAGTTCCGCGTTTCGATGCCGGAAATTACAGCGATGATGCGCGCACCGGATCATAAAAACTTCCGCGAATGCGGCGATCAATTTTTACGTTATTTTCTGCGTGGACTGGCAGCGCGCCAGCATGTGAAGAAAAGCTAAGACGGGTATGGCGGCCATGCGAAACATGGCCGCCGACAGATTATTTCACTTCTTTAAAACCAGCGGCTTTCATCACCAGTTCCATTTGCGCCATAGTGATACCTTTTTTGGCATCTTCAGCAGAAACGTTGATTCCTGAAATACCCTGCAGGGCTTTAAAATCCACTTTTTCCATATCGATAGTCACGTTTTCCTGCGCGTAGGTATCTGTATAGGTTAATTTTTCTTCAACACCCGCGATGTTTTTGTATTTGGCGCTTAACGGCTCAAGTGTCTTGGCAGCGTCTTCTTTGGTGGTTGCACCAATAGAAGCAAATTGAATTTTGGTTTCAGAAGATTGCTTAAGCACCTTGTCACCTTTGTAGACATAGGTAATGGCAATTTCAGTGCCGTTCAGATTGGCGCTGAATTTCTTCGATTCTTCTTTGTCACCGCAGCCAGCAAGAGAGAAAACCAGAACAGATGCAACAACGAGGGAAAACAGCTTATTGAAAGCCTTCATGTAAAACTCCATTTTATTTAATCAAGAAACTGGTGACTCTCACCAGGGGCTATATAGAATATGCCTAATACCGTGACGTGAGCAGTCCGGAACTGGAGTAGAACTCTTAGTAAAAAGCACTATTTCATCCTTGTTGCTGAAGCATGGGGAATAATTGTTCGCAAAGTAAAACACCGTTATTCATTGCTTCTACCCGTGCCTCGCTTTCTGTATTACGAAATTGTCCCAACACATGTGCCAGCCGATAAAAACCGACCGCGGAGAGGTCATTCGCCAGCAACTCTGCCTGACTAATAGCGCTCTGTTCCTGATAGCGCCAGCCGTTATGGAGCAGTTGAATAAGTAACGCCTGGCAGCGCATCAGCAACTGATGAGCGGTAGACGGCACAGGCAAAACGCTGGCAGAAGGTAGCGGTGCCACAGGCGCAGTTTCTGCGTCCAGCGCCCAGGCGCGGGTTTTTGTCATCATCACCCGTGGTTCCAGTGTCAATTGCCCATCAACAAAACTGACAAAGCCAGAAACCAGACTCACGGGGTCGTCTGTTTGTTGCAAAAGCGCCGCCATGCGTTCAACGGCAAAAGGAGAACAGGCAGATGCCGGAAGGGATAACGTCAGGAGATTATCTTCACCTTCGCCGCTGATTACCTGCGCATCCAGCGTCTGGCGGCTGCTATCCCAACCGAGCGAAATACACTCAGCGACCGGCAGAATAAATAAGTTATCGACCTGATTAAGAGGCCGTATGCAGGCGGGGGGACGCTGGCGTAAATATTCCCGCAAAGCCACAATGCCCGGCTGGCGTAACGGCGCGCTCAACATTTGCCAGGCATCAGGCGACAGCGGCACAACGCTGCTTAAGCGGTTGCGGGTAGCTAACAGCAGCTCGCCATCGGCACTGCGTTTTGCTGCTTGTGAAACAATTTGCCCACCCGCCAGTGCGCCAGCCTGAAAACTAAACAGCCGACGCGTAGCTGCCGGTGAGTTTTCCTGTTCACTTCGCGGCCAACTGCGCGAAAGGTGCAAAATACTGCCGGTGTCGGGATCGGTAAACCAGATGCGTAAACCATAATGCTCAATATCCTGCCAGCAACGCATACCTAAAGACACCAGCCGCAGATGATCAAGCTTTGCTTCTCCGGCAATGCCAGAGCCAACGACCGTGCGCCACGGCACAGGAGGAACTTCACCAACACTGTCGCGCCGGGCCATCTCTTGTGCGCAATTTAATCGACTGTTTAATGCCGCAAGCTGACGTAAGCATTCTCCGGCATGATAGTGGCTGGCGCGGACGTGGAAGGCATCAACGCTTGCCCGTAGCTGTCGTAGTGATTCACTCACCCATCGCCAGTTGCAGCGTTCCGCCGCCTGCTGCGCGCGACTGAAAGCGGCCTCGTAGTGAATAAGCGGCTGGCTGATGCCGCCCAGCCATAATGCCTGGCTTAATTGCTGAACATATTGACGACACGCGTTACCCTCGTCGTTGGCAAACGGATCGTCAGATGATGTGACGTGTTCGCTGCGCATCTGCCAGATTAAATGAGTAAATTCCGCTTGCTGAGTTTTGGCCTCGACGAAGGCCTGCACAGCCAGTACGACATGTTCGCAAAGTGTGCCTTCAATACAATCACAACGGGCGAAACGAATACTGCTGCGGGAATAAAAACGCACATCGCTCATCGGTAAGCGGGCAGAGGGAATTTCGCCCGGCGTACAGAACAACTCAATGGTGATGCCTTTACCGACCAACGCCTGTGCGCGTTTGCGGGTGGCATCGGGCAGGGTAGCCAGTTCTTCCAGCCAGATTGCCGGATCCCACGCTTCTTCTTTTTCCGTAGGCTGGGCGGTAGTACAAAGTCGTTGATAACTTAACACAAGCATCACGCGATGACGGCACATACCGCTGGCCCCGCAGCTGCACTGAGCCTCTTTCAGTGCCTGGCCGTTCGCCAGCTGGGTACGGACACCGTCACTGAAGGTGGCGATTAAAGCGCTGTTCTCATGGCTGATTTCCGGGACGTTGCCATTTTCCAGTTCCTTAAGACTGCGCTTAACAAAACCGGCATTGCTTAACGCCGTCAGGGCCTGCGGTGTCAGTTCTAATAATTCCGGACGTAGTGAATTCATGACTGAAGATTCTCCGCAAGCCATGATGCCAGCTCGCCCGGTGTCATGGCGGCTATTTGTGCGCCGACATTTACCAGCGCCTGGGCCGTATCGTGGTCATAGCAAGGTGTTGCTGTGCTATCGAGCGCTGCCAGTCCCAGCACTTTGATGCCGCTCTGGACACACTTTTTCACCTGATGCGTCAGTAATGATGATGAACCCCCTTCGTAAAAATCGCTCACGAGGATAATGACGCTTTTCGCTGGTTGTTCAATAAGTTGCCGACCATACTCCACGGCACTGGCGATATTGGTCCCGCCGCCCAACTGTACTTTCATTAATAACTCTACCGGATCGGCAACGTCTGCCGTGAGATCAACGACGCTTGTGTCAAACGCCACCAGATGGGTACGAATGCCGGGTAACTGCCACAAACAGGCCGCCATCACCGCAGAGTGGATCACCGAGTCCACCATCGATCCGCTTTGATCAACCAGTAAGACCAGTTGCCATTGTTCGCTTTGGCGTTTAATGCGGCTGTTAAAGCGGGGGGATTCGATATACAACTTGCCGTGTTGCGGGTGCCAGTGTTGCAGGTTGGCGCGCAGAGTACTTTTGAAATCAAAGTTTCGCGCCAGTGGAATTAATGAGCGGCGACGGCGATCGCGGACACCAGAAAAAGCCTGACGAACTTCCTTTGCCAGTCGCGCCATAATTTCTTCAACAACCTGGCGCACTATCTGGCGGGCGGCAGCCAGTACTTCGGGATTCATCAGATGTTTGGTGTGCAAAACGGCGCGTAGCAGGCTTTCCGAAGGCTGCATACGTTCCAGCACGTCGAGATTTGTCACCACATCTTCAATGCCGTAGCGCAGTACGGCATCGCTTTCCAGCCGCTCAATCACCTGTTGCGGAAACAGCGTGTGGATACTGTTGATCCACTCAGGAGTGGTGAGATTTGAGCCACCTAATCCACCAGAGCGTTCACCACGCTGGAGCCGTTCAGGATCGCGCCCATACAGCCACTCCAGCGCGTGATCTATCTGCCGGGCGTTGTCATCCAGCCCACAAAGCGTCGTTTCTGCCGCTTCGCCAAGAATTAATCGCCAGCGTTGTAGCTCACGGGTGGTCAGAAGATCGTTCAGTTCAGACAT
Protein sequences of DBSCAN-SWA_4 >CP031653|1796944:1806386|1801389_1802109_+|AXP26278.1|DBSCAN-SWA MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >CP031653|1796944:1806386|1799707_1801393_+|AXP26277.1|DBSCAN-SWA MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILAIARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDEA >CP031653|1796944:1806386|1797875_1798607_+|AXP26274.1|DBSCAN-SWA MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWGAEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRLRKVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQHAK >CP031653|1796944:1806386|1803252_1805253_-|AXP26281.1|DBSCAN-SWA MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENSALIATFSDGVRTQLANGQALKEAQCSCGASGMCRHRVMLVLSYQRLCTTAQPTEKEEAWDPAIWLEELATLPDATRKRAQALVGKGITIELFCTPGEIPSARLPMSDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVEAKTQQAEFTHLIWQMRSEHVTSSDDPFANDEGNACRQYVQQLSQALWLGGISQPLIHYEAAFSRAQQAAERCNWRWVSESLRQLRASVDAFHVRASHYHAGECLRQLAALNSRLNCAQEMARRDSVGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDIEHYGLRIWFTDPDTGSILHLSRSWPRSEQENSPAATRRLFSFQAGALAGGQIVSQAAKRSADGELLLATRNRLSSVVPLSPDAWQMLSAPLRQPGIVALREYLRQRPPACIRPLNQVDNLFILPVAECISLGWDSSRQTLDAQVISGEGEDNLLTLSLPASACSPFAVERMAALLQQTDDPVSLVSGFVSFVDGQLTLEPRVMMTKTRAWALDAETAPVAPLPSASVLPVPSTAHQLLMRCQALLIQLLHNGWRYQEQSAISQAELLANDLSAVGFYRLAHVLGQFRNTESEARVEAMNNGVLLCEQLFPMLQQQG >CP031653|1796944:1806386|1802155_1802626_+|AXP26279.1|DBSCAN-SWA MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQRCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLKTDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAARQHVKKS >CP031653|1796944:1806386|1798587_1798695_-|AXP26275.1|DBSCAN-SWA MRIAKIGVIALFLFMALGGIGGVMLAGYTFILRAG >CP031653|1796944:1806386|1805249_1806386_-|AXP26282.1|DBSCAN-SWA MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGGSNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSLIPLARNFDFKSTLRANLQHWHPQHGKLYIESPRFNSRIKRQSEQWQLVLLVDQSGSMVDSVIHSAVMAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDHDTAQALVNVGAQIAAMTPGELASWLAENLQS >CP031653|1796944:1806386|1796944_1797871_+|AXP26273.1|DBSCAN-SWA MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGVIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMALLGLESNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLARPANDFVRQFFGRSELGVRLLSLRSVADYVRREERAEGEALAEEMTLRDALSLFVARGCEVLPVVNTQGQPCGTLHFQDLLEEA >CP031653|1796944:1806386|1798754_1799486_-|AXP26276.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNGWRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKKQGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFEGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFPLGI >CP031653|1796944:1806386|1802666_1803128_-|AXP26280.1|DBSCAN-SWA MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKLTYTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK |
10 | Enterobacteria_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2387623 : 2416226
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP031653|2387623:2416226|DBSCAN-SWA GTTATATTTTTTCGATCTCGACCAGATTAGTGTGCTGCGGGTTTCCCTTCGCCAGTGGTGAAGGGCGCAGAGTGGTTAGCGTATTCACACAGCCGCCATGGTCGATTTTATCGCCAGACATATTGGCCTCGTGCCAGGCTCCCTGGCCCATAGCGCTAACCCCAGGGAGAATGCGTGGTGTCACTTTGGCTGGTAGCCGAACTTCGCCACGATGGTTAAACACCCGCACCATATCGCCGTTGGCAATCCCACGTTTCTGCGCATCTATAGGGTTGATCCACACCTCCTGACGGCAGGCAGCCTTCAGGAGATCAATATTGCCGTAGGTCGAGTGAGTACGGGATTTGTAATGGAAACCAAACAGTTGAAGTGGGAAGGTTCTACGTTCAGGGGAGTCCCAGCCTTCAAAGGTAGAGGCATAAACTGGCAGTGGGCTTATCACTTCATCTTTTTCCAGTTCCCAGGTACGGGCAATTTCCGCCAGCTTGCTGGAATAAATTTCAATCTTACCGGAAGGTGTTTTAAGTGGATTTGCCTCGGGGTCGTCACGAAATGCTTTGTAGGCGACAAAATGACCATTGGGATCTTTACGCTTATAGATACCCATTTTTTTCAGTTCGTCGTAAGACGGTAACGCCGGATCTTTGGCAAGCATTTTGGCGTACAGATGTTGTAACCATTGTTCCTGCGTGCGACCTTCAGTGAACTTTTGATAGACGTCAGGTCCAAGACGTTTCGCGACTTCACTCAGGATCCAGTAAATCGGTTTACGTTCGAATTTTTCGCTGGTGACTGGCTGGAGGAAAATGAGATATCCCATGTTACCGGCGTAGTCGTTAGGAATAATATCTTCCTGCTCAACGGTCATCAGGTCTGGCAGCAGAATGTCGGCATATTTTGCCGATGAGGTCATAAAGTTTTCGATGACCACAATCATTTCGCATTTCGATTCGTCTTGCAGAATTTCATGCGTTTTGTTGATGTCAGAATGCTGATTAACGAGGGTATTTCCCGCGTAGTTCCAGATGAACTTAATGGGCACATCCAGTTTATCTTTGCCGCGGACGCCGTCGCGGATTGCCGTCATTTGCGGACCATGATCGATAGCATCTGTCCAGCTGAAGCAGGAGATTGACGTTTTGACCGGATTATCCAGCACCGGCAGGCGTTCTATGGTAATGGTATAGGTCGATTCACGCGCGCCACTATTTCCGCCGCTGATGCCGACATTGCCCGTCAAAATAGGTAACATAGCAATAGCGCGTGCAGTCAGTTCGCCGTTTGCCTGGCGTTGCGGCCCCCAGCCCTGGCAGATATAAGCGGGTTTTGCTGTGCCAATTTCACGCGCCAGTTTGATGATACGGTCTACCGGGATACCGGTAATTTGCGAAGCCCACTGCGGCGTTTTCGCTGTGTTATCATCACCTTCACCAAGAATATAGGCTTTATAGTGACCATTTTTGGGTGCATCTGCGGGTAAGGTTTTTTCGTCATAGCCGACGCAGTATTTATCGAGAAAAGGTTGATCAACGAGATTTTCGTTAATCAATACCCAGGCAATACCCGCAACCAGCGCGGCATCGGTGCCCGGGCGAATAGGGAGCCATTCATCTTCACGACCAGCAGCCGTATCGGTATATCGCGGATCGATAACAATCATTTTGGCGTTCGATTTCTCGCGCGCTTTTTCAAGAAGATAAGTGATGCCACCGCCGCTCATGCGGGTTTCTGCCGGGTTGTTACCAAACATCACGACCAGCTTGCTGTTTTCAATATCCGTGGTGCTGTTGCCATCATTACTGCCGTAGGTGTAGGGCATGGCACAGGAAATTTGCGCGGTGCTGTAGGAGCCATACTGATTGAGTGAACCGCCGTAGCAGTTCATCAGGCGTTTGACCGCCGAGGCTGATGGCGAAGAGCGGGTCATATTGCCGCCAACAATCCCCGAAGAGTACTGAATATATACAGCCTCATTGCCATATTGTTCGACGGTTTTTTTCAGGCTACTGGCGATAGTATCCAGGGCTTCATCCCAGCTAATCCGTTCGAATTTGCCTTCACCGCGTGTACCCACGCGTTTCATTGGGTAATTCAAGCGATCGGGATGATTAATACGCCGGCGGATGGAGCGACCGCGCAAACAGGCGCGTACCTGATGGTTGCCGTACTCATCGCTGCCGGTATTGTCAGTTTCCACCCAGGTCACTTCATTATCTTTAACATGTAGACGAAGTGCACAGCGGCTACCACAGTTGACGGAACAGGCACCCCAGATCACTTTTTCGCTGGCCTGTTGTACCGTTGCCGCTGCACTGCGCAGGGTGAACGGTAAAGAAAAACCGCCTGCAGCCAGCGCCAGAGAACCTATCGCGGTTGATTTAACGAGTGTTCTGCGGCTGATGCCCACCATTCGGTCATTTTTGGACATAACTCACTCCCTGTTCTTTATCGTTATATAAATGTTTATATATTGAATATTTAGCGCGCTAACAATAGAGGGAGTCTACCCATTTTGGGTTAAGAATTATTAATCCATATCAATAGAAGGGTATGAGTAATAAGGTGGGATTATGTTGTATGTTCAAATCGCCGGATGTGTCGTATCCGGCGTTCAGTCGATAATGTATTACTGCGGTTCGGCAGGCGCGCCATCCTGGGTAGACTGCGCGGGAGCAGAGACGTTACCGCTGGTGGTGCGGGTATAGAGAATTTTATGCGTATCATTAGCGCAATGGCCGACGACCTGGGAATCAGGCTGATCAACCTGGTCATTGGGTACAATACTTAACGTGAAGCTGCTTTCGGGTACGCCATTATTGATAATGCGCTGTGATATATCGCTCTGTATGCGCTCACAGGATCCCGGCGCGGCGAGTACCACGGGTGAGGCGAGGGCGAGCAGAAGCGCGGCACAGCAGGTTGAGAGTTTCATCATAAGCTCCTTACGCGAAGATAACTTCTTTAAGCATAGCATTTAACGTGTAAAGTACTGTATTTGCTACTATGATTGAGAATCATCTCTACTCTCTGGTGACTGTTGTGAAATACAAATTACTACCATGCTTACTCGCGATATTCCTCACAGGATGTGACCGCACAGAGGTAACACTTTCATTTACCCCTGAGATGGCCAGTTTCTCTAATGAATTCGATTTTGATCCGCTGCGTGGTCCGGTAAAAGATTTCACTCAGACATTAATGGATGAGCAAGGTGAAGTGACGAAACGTGTTTCTGGGACTTTGTCGGAAGAAGGCTGTTTTGATTCACTCGAATTACTGGATCTGGAAAATAATACCGTGGTCGCTCTGGTACTGGACGCCAATTATTACCGTGATGCCGAGACGCTGGAGAAGAGAGTACGTTTACAGGGAAAATGCCAGCTAGCAGAATTACCTTCTGCCGGGGTGAGTTGGGAAACCGATGATAATGGCTTCGTGATTAAAGCCAGCAGCAAACAAATGCAGATGGAATATCGCTATGATGATCAGGGTTATCCGCTGGGTAAAACCACGAAAAGTAACGACAAAACATTATCTGTCAGCGCCACGCCATCAACGGATCCGATCAAAAAATTAGATTACACAGCGGTTACTTTACTGAATAATCAACGGGTTGGTAATGTAAAACAGAGCTGTGAATATGACAGTCACGCTAATCCGGTGGACTGTCAGCTAATCATTGTTGATGAAGGAGTAAAACCCGCCGTCGAACGGGTTTACACCATCAAAAATACGATCGATTATTATTAATGCTATTGTGCGGTCGGCTTCAGGAGAGTCTGACCCGGTGTTTTGTGCTCTGCCAGATACTGATGCTGGAATATACACATGCGAATGGCATTACGATATTGACCATTAATAAAGAACTCGTGCATCAATTCACCTTCAACCGAAAAGCCAAGCTTGCGGTAAATGTGAATCGCTTTTTCATTCTCTTTATCAACGATCAGATACAGCTTATAGAGATTGAGAACGGTAAAGCCATAGTCCATTGCTAATTTGGCGGCACGGGTTGCCAGACCTTTCCCCTGATACTCCGGGGAGATAATTATCTGAAATTCTGCGCGGCGATGAACATGGTTAATTTCCACCAGCTCCACCAGACCGGCTTTTTCGCCGTCACATTCCACCACAAAGCGCCGTTCGCTCTGATCGTGAATATGCTTATCATACAGATCAGAGAGTTCAACAAAGGCTTCGTAGGGTTCCTCAAACCAGTAACGCATCACACTGGCGTTGTTGTCGAGTTGATGTACATAGCGTAAATCTTCACGCTCCAGCGGGCGTAGCTTAACACTGTGGGCGCTTGGCATAACGTGTCCTTACATTCCTTAAATCAATAACAGGTTAGGGGGTAATAACGCGGCCAGTTCGACGGTCCAGGCAGCGCAAAGTATTGGGCTCCCAGTAGGCATTGATGTTGGCGCTTTGCTCACATTTATCGCGGTTATCAAAAGCGGCGTCGGCTTTATCCCACTCTTTTTCAGTGCGTTTATTCACTTTCTGGCGCAGATTGCGCGTGTCATTCCATTGCTCTTTTTCCATAGCGGCGTGCTGGCGGCTTTGTGCACTGTCGCCAGACTCAATCACCAGTTTGTTAGTTTCGGCATGAACAGTTGTGCTCAATGCCAGTGCGCAAGGCAGCAGAATAGCGAGCAGGCCGATTCGTTTGCTGAGAGTGATTTTCATAATTCATTCCCTGTATGAATGATTAAAGGTGATTCTACACCATCCACTGCGGACGCAAAACGTACCAGGAGGGTGTTTATATTGATGATATTATGTCGCCCTATAACTATACATGATGTCAATAAGAGACAAAGATGATTAAAACAACGTTACTATTTTTTGCTACTGCGCTGTGTGAAATTATTGGATGCTTTCTGCCCTGGTTGTGGTTAAAACGAAACGCCAGTATCTGGCTGTTGCTTCCGGCGGGGATTTCACTGGCGCTGTTTGTCTGGTTGTTAACGTTGCATCCAGCGGCGAGTGGGCGTGTTTACGCGGCTTATGGTGGCGTTTATGTCTGCACGGCGTTGATGTGGCTGCGCGTTGTGGATGGCGTGAAACTGACTCTTTATGACTGGACGGGTGCGTTGATTGCGCTTTGCGGCATGTTGATCATTGTTGCGGGCTGGGGGCGCACGTAGGAACATAAATCCATTTTATCAATAAGATAAGAGGAAGTGTCAGCTGACAAAAGGTATTCTATTTCATCTTTTGTCAACCATTCACAGCGCAAATATACGCCTTTTTTTGTGATCACTCCGGCTTTTTTCGATCTTTATACTTGTATGGTAGTAGCTCAGTTGCGTAGATTTCATGCATCACGACAAGCGATGCAAGGAATCGAACATGAAGATCGTAAAGGCTGAAGTTTTTGTTACCTGTCCGGGGCGTAATTTCGTCACATTAAAAATCACCACTGAGGACGGTATTACGGGCCTTGGGGATGCCACCCTCAATGGACGTGAGCTTTCCGTGGCCTCTTATTTGCAGGATCACCTTTGTCCGCAGCTTATTGGTCGCGATGCGCACCGTATCGAAGATATCTGGCAGTTTTTCTATAAAGGTGCTTACTGGCGTCGCGGTCCGGTTACGATGTCGGCCATTTCAGCGGTTGATATGGCGCTGTGGGATATTAAAGCCAAAGCTGCCAACATGCCGCTTTACCAGTTACTCGGCGGCGCGTCTCGTGAAGGGGTGATGGTTTATTGCCATACCACCGGTCACAGTATTGATGAAGCTCTGGATGATTATGCCCGTCATCAGGAGCTGGGATTCAAAGCCATCCGCGTGCAGTGCGGAATCCCTGGTATGAAAACCACCTACGGCATGTCGAAAGGTAAAGGTCTGGCTTATGAACCCGCAACCAAAGGACAGTGGCCGGAAGAGCAGCTGTGGTCGACGGAGAAATACCTCGATTTCATGCCGAAATTGTTTGACGCGGTACGTAACAAGTTTGGTTTTAATGAACATTTGCTGCATGACATGCACCATCGCTTAACGCCTATTGAAGCGGCGCGCTTTGGTAAAAGCATTGAAGATTATCGCATGTTCTGGATGGAAGACCCGACGCCTGCAGAAAACCAGGAATGCTTCCGTCTCATTCGCCAACATACCGTCACACCCATCGCAGTGGGTGAAGTCTTCAACAGCATCTGGGACTGCAAACAACTGATTGAAGAGCAACTCATCGATTATATCCGCACCACGCTGACCCATGCAGGCGGAATTACCGGTATGCGCCGGATTGCCGATTTTGCTTCGCTGTATCAGGTACGTACTGGCTCACACGGTCCTTCCGATTTGTCACCAGTCTGCATGGCTGCGGCGCTGCACTTTGATCTGTGGGTCCCCAATTTCGGTGTCCAGGAATACATGGGTTATTCCGAACAAATGCTCGAAGTCTTCCCGCACAACTGGACTTTCGATAACGGCTATATGCATCCGGGAGACAAACCGGGTCTTGGCATCGAATTCGATGAAAAGCTGGCGGCGAAATATCCCTATGAACCTGCTTATCTACCAGTCGCACGTCTGGAAGATGGCACGCTGTGGAACTGGTAAGGAGTAAGATAATGAAAAGCATATTAATTGAAAAACCGAATCAACTGGCGATTGTCGAACGTGAAATACCCACCCCGTCAGCGGGTGAAGTACGAGTAAAAGTGAAACTTGCCGGAATTTGTGGTTCAGATAGCCATATTTATCGTGGGCATAATCCTTTTGCGAAATATCCGCGCGTCATTGGACATGAATTCTTTGGCGTCATTGATGCGGTGGGTGACGGCGTGGAAAGCGCCAGAGTCGGTGAACGTGTTGCTGTCGATCCGGTGGTCAGCTGTGGGCATTGCTATCCGTGCTCTATAGGTAAACCGAACGTTTGTACGACACTGGCTGTTTTAGGTGTGCACGCTGACGGTGGTTTCAGTGAATATGCCGTGGTTCCGGCAAAAAATGCGTGGAAAATTCCTGAAGCAGTGGCCGATCAATATGCGGTAATGATCGAACCTTTTACCATTGCGGCTAACGTAACCGGACATGGTCAACCGACTGAAAATGATACCGTTCTGGTTTATGGTGCCGGTCCAATCGGCCTGACGATCGTTCAGGTATTAAAAGGCGTCTATAACGTTAAAAATGTGATTGTTGCCGATCGCATTGATGAACGACTGGAAAAAGCGAAAGAGAGCGGGGCAGACTGGGCGATCAATAACAGCCAGACACCGCTTGGCGAGATTTTCGCTGAAAAAGGCATCAAGCCGACATTAATTATCGATGCGGCTTGTCATCCTTCTATCCTGAAAGAGGCCGTAACGCTGGCTTCTCCAGCGGCACGTATTGTATTGATGGGCTTCTCCAGTGAACCGTCTGAAGTGATTCAGCAAGGAATTACCGGAAAAGAACTCTCTATTTTCTCTTCACGCTTAAATGCAAATAAATTTCCGGTTGTTATCGACTGGTTAAGTAAAGGGTTAATTAAACCAGAAAAATTAATTACCCATACGTTTGATTTCCAGCATGTTGCTGATGCCATTAGTTTATTTGAACAGGATCAAAAACATTGCTGCAAAGTCTTACTCACTTTTTCTGAATAATACCAATAACGGCGAGTAAGTAGTACGCATCTTACCTCTTTTTTAGAGATAACCATTATGACAATAGAAAAACATGAAAGAAGCACTAAGGATTTGGTGAAAGCAGCAGTATCGGGATGGCTGGGCACTGCGCTTGAATTTATGGATTTCAAGAGTCATGCGTGTTAACTATTTGATAAATATTAAATTAATTTTTCATTGCTTCGTTATGGGGCATGGTTGGGGCAAACTCGCTTAACTGTGTATTTAACAAAGCTACCTGTGCATTATTGTTTTCAGACATCCATTTTCCGTATACCTGAAATACCATTTGCGCATCTGCATGGCCCATCTGGTTTGCTATAAATGCCGGGTTAGCACCAGCTGTCAGCGACCAGCAGGCATAAGTATGTCTCGACTGATATGATTTTCGATGGCGGAGTCCGGCACGTTTTATCGCTGCGTCCCACATCTGCCTTATTGAGTCAACGGTAAAATGGTCACCATAATTTTTTACTCTCGCTGACACTTCAGGTTGAAAAACAAAGGTGCATTTTTGTTTTTCTGTTCTGCCATACTCTCTGAGGTGAACATCAATGATATGCTCTTTGCTCAGTCTCGTTAATGTCATCTGACTCCGGAGAGCGTCGATTGCTGGCTTAATAAGATGAATGACCCGATTGGTTCCCGCCTGTGTTTTTGGTACCGTGAAACGGTCTTTTGCTAAATTTCTCCTGATCATCATTGTTCCATTTTTCAGATCTATGTCCTCCCATCCAAGTGCACACAGCTCACCAGGGCGAACGCCAGTATAAACAGAAACACACCATAAATTTTTTGCTTGCTGATTTCTGCACGCATCGATAAGACGGATAAATTCTTCCCGCGAAAGAGGATCCGGAATGGTTCTTGATTCCTTTAATGGCGAGATCCCCTTAAACGGATTATCTGCCAGGTAACCGTTATCAACACCAAACTGGAACACGGCGTTAAGATTTGTCATGTAATTATTTACAGTTACAGCCGATCTCCCTGGTTGTGTAACAATATAGTTACTTTTGGGGATCTGGTATCCAGTCAGTAACTCTTTACGAACCTCCAGTAATTTTTCTTTATTAATCGATGAGGCAAGATTTTTTTCACCGATTATGCTCAGGATATTTTTGATGACGGCACGGTATGTGTTGAGTGATGTTTTGGCGACTTCAGTTTCTTTCAGTGCCAGAAATTTTTCAGCCAGTTCTTTTATGGTTAAATCTTGTCGGGCCTCACCAAATTTTTCCAGATTGCGTGAGGAGGGAAACTGTTTTGCATAGTCGAAAACACCAGTTTTTATTGCGTAACAAACAGAGGAGCGTAGTTCACCTGCAACGCGCCTGTTTTTTGCTGTGTCAGGAACCCCCAGGTTTTCCCTGACTCTTACGCCTTTATAAACAAACCAGATACGTAATTTCCCTCCATGGTTTTCCACGCCTGTCGGATATTTCATTTCAACTTCTCTCATTAGTTAGTGTGGCTTTTAGTCAAGTAAGATGACGTCTTGGTCTCGCTGATGCCTGGCGCTCAATCCAGCGATCAATTTCTTCCAGGTTGTAAAAGCATGGACTGTTATCCCATGGCATACCGTCATGAGCGACATGCTTATATTCCCTTCCTTCCATAAACGATTTTTCCCGGGCCTTTTTTAACGTACCTTTTTTTATTCCTTTCAGCGCAATTAACTGCTCTTCGGATACCCATTTGCCGGGAGAGACAATCATGATTACTTCGCTCATCGATTTCTTTATCTCTTACATTAGACGAGCGCCGGTTGCAGAATACCAGTCACAACCGGCGACAGTTGAACATTAAGAATCAGCCTGACTCGGGATCAGTTTTTGCCAGATAACTGAAACGTATTTTGCCTGGTAACGGGCGTCATCAAGTGCATTATGGCGCTCACCTTCGAATTGAATAGCCGTTCTGGCATCGAAGTCTATGACTTTCCCCAGCTCAACGATTGTGCGTACATCGCGATCGTTGTAGTAACGCCACGGGCAGGGGATCCCCTGCCGTTCGTATGAACGGCGCAAAATCGTGTTGTCGAAGTTGGCTCCATTCCCCCAAACCTGAACAAAAAATTCACCGGAGTTTTCGTCGATAAATTCCCGCAATTGTAACAGTGCATCATCTAACGGGATTTCATCGGTCATAATGGCAGATTGCGCTTCGCGTGATTGCTTAAGCCACCATTTAATGGTGTCCCGATCAATGACTCCGCCAGCAGTTTCCAGATCGATAGTCTTACTAAATTCCGGTCCCATATCTCCGGTTTGCGGATCGAAAAATATTGCACCTATTGAGGTGATCGGGGCATCAGGATTTTTTCCCATGGTTTCAAGGTCGATCATTAGATGGTCACACGTCCTGCTGGTGGATGTGATTTCGTGATGACCGTTCACCTTAATTGGGTGATCTGCCGTCTCGCCAGTTTCATTATCGCTATTGTGATGCTGATTGCCGCCAGTGTTCTCCTTGTGTGGATGTTCAGCGCCTTCCATTTCCTCCGGATCATCTTCCTGAACTTCAACCTGATACTCTTCATCGAATGTTTCTTGGTATGTTGCGTCGCCCATCACCGCGCCACAATCAGGGCAGTTGCCGCCGCCGGTCTGACCGCAGGCGGTGCAGACTTTTTCCACTTCCTGTTGCGCTACTGGTTCAGGCTGTTTCGTTTCTGGCTCGTTTTGTAACGCATTTGGACTGTTTTGTTCCGCTTTTTGGTAGTTCCGTTCCGATTCATGCTGGTTCTGGTTCACAGAATCGCGGGTCTGGATCCCCTTAACCCATTTCGGATCATTCGGGTCACTAATCCCTTCAACAAATTCACCACGTGATGCAGCAAGCAACTTATCAGCGTCAGGCTGGCTGATATTGGCTGCCTGCATAATTTTGTTTACTTCGTCAGCGGTAACTTTTATCGGCTCTGGTTGTTCTGAATCTTCAGCGGTATCTACATTTTGCGGTAAGCCCGTGTATGTGCCATTTTTTCGGGCAAAATATTCTTCTTTTGTGATTTCAGTGGCGCCAGCAGCCAGTGCCTTATCCAGACCAGAAAGTTTGTTTGCGCGACCGTATTTTTCTCCGTCCTTATCTGCGAAGAGGAAATAGAACGGCCCCTCACGCTCTACAGATGGTTCAGCTTCCGGCGCGGTTTCATTTTTTGGGATATCAGATACCTCAGTTTCCACTGCATCAGTTTGTGTTTCTGATGACTGGAGAACATCAACAGTGCCCAGGTCTGTTTCTTCATTCTCAAACACGCCCTTTGTCGTCAGGTATTCGCAGATATATTTGTTCAGTGCTACGGGATCTTTGTGAATGTCGATCGGACGCTCACGGACAAGGCCAAAAATAGTTTGGCGGTCGTAGCGAAGGGCATCAGGCTGTTTGCGCATTGATGCCGAGATACGCTTCCAGTCTTCGCGGTCGTTGTCGATAACTTCTTTTTTTGCCCAGCGATGGATGCTGCCGTCAATGTTTCCGGCATCCACATCACCAGGCCAGAGAGCGTAGGCCAGTTCGTCATCCAGTGTTTTCCATGTCTGCTTGTATTCGCGATGAATGGCAGCAATGACCGGGCTGATTTTTCCTGTTGAATTTTCAGTGTACTGTTGATTGGCTCTGGCGCGTGCGAGATCAACAACAGACGTGTATTTTCCGGTTTCCTTGCGTTCACCTTCGCGACGTTTTTTCCAGATGCGCATCTCTGCCTGAATTTCGGGCCATTTAGCACCAGGAATACATTTATGCTTAACCCACCCAATGGCGTGCAACTTAAGCTCCGGATACATGGCGTTAACTTCTGGCATTTTCATCAACGCTTCAACGATATGTCCGTCGAATGTTGCCATGTCTTCCTGCAACAATTCCTGTGCGCTAATAACCATATCAACGGTGATGTTTTCACATGTGTCGAACTTAACCATGACAGCGTTCTGTACTTCAGGGGCCAGCTTGTCAAAAGTGACGTTCATCGGATCGGATTCAGTCTCAACCGGGACAAAAGAAGCAGACTCCTCATCCCAGCGGTTTTCCTGCATATATTCAGCATCCCATGAATCGAGGGCAGGGCGGGGTATGCCAGGTTTATCCTCGCAGACAATAAATTTATAAGCGCAGTCCTGAGCAGCCGGATAATGTTCCAGGAATTGCCAGTGAAATTTTGCTCGAGCACGGCGTTCGTCGCCAGCTTCAATGGCTGTGGCTACAGCCACAGCGCCTTCTTCCCTTGTTGCCAGTTCGTCAGGAATAGCGGCGCAAATAAAGACTTTACTCATTTTGTTTTAACCTCATGACAGATTTAAGGATGAACAAATCCCTGCCATTGCTGGCATATAAGAATGAAACCGGATATTTATTACGGAACTGTTTTAAAGACCTGCCGGGATTTCGATATTATCCTGGTTAATAACTTTATCGACCGGGTAACAGTTACCGGGAATTTTCTGTTCGGTTGCTGCAGTCATACACTCCTGCATTGTCCTGTGAACACTGACTGCAATATCAACTGGCACTCCGGAAACAAGAAAAACTGTCAGAACAAGCGCAAATGCTGAATTCATTGTGCACATCCTTTTGGCATCAGACGTAAACGAGCCAGCATTGAAACAATGCATATTTTATTTAATAGCTCCCGTTCTTGTTTTCTCTTGTTAATGGCATCTTCAGTAAATACAGGGTTACTGATAGTGACACCAATTTCAAAACAACCTTCAGACGTATTAACGTTTGGTAATAACGTTTTCATTATCGCGTCCTCAACAATGAATTTTGTGATGCAGTGCCTGGTGCCTCCAGGTGACGTTAACCAGTTAACAATTAACGTCGGATATCCGGATTAGTGATTTCAGGTTGTATCGTGAGATCAGTGATGGAAAAAGTATTACGTACATGATCGCCGGGTTAAATAAAGAATATGGCGATGTGGTGGAATCCGGACTGCTTTTTGCAGATCCTGCCGTTGTAGATCGTGAAACTGACGAACTTATAGAAAAAGCAATTGCTTTCAAGCTTGCGTATCGACAGCAATACCAACAAAAAGCTGGATGGAATTATGAGTCTTCTTTTTGCTGAACGCCCACTGGTTATAAACACGCAGCTGGCAATGAAAATTGGCTTAAACGAAGCCATTGTTTTGCAACAACTGCACTACTGGTTGAGAGATACCAACTCCGGCATGGAATGTGATGGTGTTCGCTGGATTTATAACACAACGGAACAATGGCTGGAACAGTTCCCATTCTGGTCAGAGTCAACGTTAAAGCGCGCGTTTGCAAGTCTGAAAACGCTGGGGCTTTTGCGTTGTGAAAAGCTCAATAAATCAAAGCGCGATATGACCAATTTCTACACGATCAACTACGGGAGCGAGCTTTTAGATGGTGGCAAATTGAGCGAATCCATCGGTTTAAAATGCGCCGCTCCATCAGGTCAAAATGACACGATGGAAGAGGTCAAAATGAAACGCTCCATTGGTTCAAAACGACTCAATGTCATCGGGTCAAAATGGCCTGATGATCTTACAGAGAATACAACAGAGATTACTACAGAGAATAAAAAGACTTCTCGTCCGGAAGCTTCGCAACCGGACCCGCAGACGGTTGAACAGGATTTTTTAACCCGACACCCTGACGCGGTTGTGTTCAGTGCGAAAAAACGCCAGTGGGGCAGCCAGGAAGATTTGGCGTGTGCGCAGTGGATCTGGGGGCGAATCGTGAGTCTTTACGAGCAGGCCGCCAGCGATGATGGCGAGATTTCGCGACCGAAAGAACCCAACTGGACCGCATGGGCCAACGACGTGCGCACAATGCGGATGCTGGATGGCAGAACTCACAGACAAATTTGTGAAATGTTTGGTCGGGTGCAGCGGGATCCATTCTGGGTAAAAAATATCATGAGTCCGTCAAAGCTTCGCGAAAAATGGGATGAACTGGTTATCCGCCTGGGGCGTTCGTCTGTACAGCGTTGTGTGAATCATATTTCTGAGCCGGATACCGAAATTCCGCCGGGCTTCAGGGGGTAAGTGTTAATTTCTGGTCATGAGGTAATTTTCAGGAGGGCTTGTGGCAAAAGTTTTTACACAAGAAGAGCGGGAAAAAATTAAAGGGCAGGTTGTTGAACTCGTACGCCGGAGTGGGCGTGAGACGTTACGGCAACTGGAAGTCAAGACAGGTGCGACAAGATATCTGATGAGCGTTCTCGCAAGAGAGCTGGTTGCCAGCGGCGATGTATACAACTCTGGTTACGGGTTATTCCCGTCTGAACAGGCGCGTAAGGACTGGCAAAATGCCCGTAAAAAGCTCTCAAGGGCAAAGCTGAAGGAACCATCTGCGGTTGATCCGGACCTTATCTGGTCATTACCTGACGGAGAAATACGTCGTTACGACAGGCGTCATAATATGATTTGTACTGAGTGTCGTAAAAGCGAAGTTATGCAGCGCATATTGTCGTTTTATCAGGGGGATGTCCGGTATTTATTGAAGTGACGAGATTAAAGTGCATTAGTTCAGATGCAAATTGACATTTTGTGGCACAGGGTAGAGCTAGCGTGGTTGTCCGCTTTGTGCCAAGAGCGGACTTTGCAAAATGGGGGTTATTTCAATCAAAACGTAACGTCACAACCAGCCGACGCTCTCTCGCCATTTATAATTAGTAACTTTATCATTTTCGCTTATTTTTTTAGATATAGAGCGCGGCTCTCTTCCTAGATACTCAGATATTTCTATCGGGGACAAATCAAAATCAACAAGCATGACTCTAAGTTTTTCCATTTCCTTTAAAGTCCAAGGCTTGCCATAATTTTCATAAAGAGATACCTTATGCTCCCTGATAGTTCGCTTTCTCTGAGTTTCACTTTCCCTCTGAAAAATCTCGCTTTTAAATTGTGAACGGAACTCTTTACAGAAGTTGTTATCAGTTTTTTTGTTATATAGATTGCTAAAAATAAGCTGGGATGCCGGGTCTAAATCAGGTGTGTTAATAATGAAAACTTTGACCTTTTCCATATAGGGATATTCAATTTTACCCAATATGCTTAATTGCTTTATATTAAAAAAACCTCTCAAATCAAAAGATTTAATGAGCTTTGATTGGATTACACTTTCAATCCTGCTTGCATTTAAATAACAGTACTTTGCCATTGGAAGGGCCCATACTACCAGTTGAGGAATATATTTTTCTAATGCAATGCTCTGCCAATCAGTATCAAAGGTTTGGCTTTTTGCTAGCATGTTTTTCGGCAAATCAGAATACATTGTGGTAGATGCCCATATCTTATAATCATTCGCCAAGGCTTGATAGTATTTTGTGTGGTTATGAATTTTATAAGCTGACATGAAGCGATAAACGTCTTCGTCGTGACCAGCGTCGTAAATTGTTCGATTTCCTCTAAGATAACCGTCGTAATGTTCGGTTATCCTTCTACCTACATTACAACTTACCCCAACGTAAACCACACGACTGAAAAGTCCTTTATGGACAATAAGATAAACTCCGCTACAGCCAGATTTCCTGGCCTCTGATAGAGAACCTAAAAATCTCCATTCCATAATTAAATCCATAATTATTGCTACTGTTTTTGTTTATCATTATTTTCGTGAAACTTCAACAATTTTATCCAAAAGCTAAGGGCAAAGACTTATATAATTATACTTGTCATCGTTAGCGATTATATAGAGTAGTGGCGCTGACCTGCTCCCTGGTGATTCACACAGAATGCTGTTAGTAATGTCCGTTCCTCGCTCTCAGCGGACCTTCAGCTCAGTGATATCGTCCGCTCTGTGCAAAGAGCGGACGTTGGTATGCAAGAGCCCTCCAAAAGTTGATGGTTGGTTTGCAGGGGGGCTTAAAGAAACTGCACTTATCAAGTTGAAGTTCTGTATTCAGCGAAATCGTAGCACTCTGACGATAAGTAACTCCGGTACTCCGCTCTTCGATGAAGAACCAGAGTAATCCCCCCGAAAAACCAGCGCATCAAAATTGGATCTTCAGCGGTAGCTTATCGGCTATCGGAAGTACAGGTGTCGATTCGTGGTGAATTGCTTTGATAATAAACGATTAATACGGAAAAACGCATTAATCATTTATTAGCTTTTAGTAAACCACAATTTATTCCGTTTTACATATCATAGTAGTCGATTGGAGAATATAGTTTCTGGGAATGTACTCTTCAAAGTGTTCGTCCTTTTTAAATACATGAACTACATTTGGGAATAATTGATAGTCAACAGGGTGTATAGCGTTTGGATTATGGTACATGTACATGGCTGTACACCATGGTTCTTGATAGTTAGGGTCACTTACATCGGCTGAAAATGGATGTGGGGCTGCATCCTGATCAGTTTTAACACCACTGACGTACACTTTGAATCCACTCGCCTCTACACCTGCAAGAATTCCCATCCGGTTAAACTTAGGTATGGTTGCTTGAGTAGTGAGTAAAACGGCAGAAACATAATTATTTTGTTCTGAGCCAAAAAAGTTCGACTTGATACTTCTATTTTCATCTGTATGTCTTTCAATAGAAATGCCTGACTCAATATCAATCCCGTACAAATAGCTATGCAAGGCTTCGCTTGAGAAGGCCATGGACATTCTTTTTGAATAATCCTGCATTGCTATGACAAATGGTTTGTTCTTTGTATGGTTGAGTTCCCAGTAATGAACTTTCTCTGGCTCAGGGCAATGCCGGACTTTTTTTAATAAACTTCTTGCAAACTTAAAAGGCATGACATTTAGAACATGTTTTCTTAATTCATCCATCTGTTCATCGTTAATGACTTTTCTTTCAAGAGGGGCTTCTGCTTCAGCAATGCTTACAGCCTCTACAGCAATTTCCACTCCAAATTTAGATAGCAGAAAATCTGGTTGATTGTATTCTCTATTCATTTCAAAGTCGAGTTCATAAAATACAGCGTTCAAATATAATTCAAATAACCTTGAATTAAATGCATCACTTTGAAAATCCCTTATAAATATTCCATCAGGATCTTTGAACCAGTATGCAAGTTCCTCAAGAACAATATATGCAGGGAAATGAAGAGGGTCTTCGAGGAGCATTTTTATATAAACATTCCTTTTTTTCGCTGGGACCTTACTCAAGAATAATGAAAAAGGTTTGGTTGATTCATCGCCTTGCATGAATGTACCATTTTGGTGCTGCGCCAGCATCTTTGGTATGTCATCGTTCAAATTATTAAGCAAGACATCCATTGAATCAAATGAAGCCAAGACGTTTATTGCTCTGAATTTTTTATCTAAATCCCGACCTAAGACTATTGCGTTAAAATCTTTATCAATATTGCATATGATTATTGTGGATAACAATGTTATCCCATTCCCCTCATATTTAAACCAGCGTATCTCCTCAGAAAATGTCTTAAGGTAAGGTGAGCGACCGTAAAAATAAATATCAAATTGTTCTTTGCTGATCTCACTGAAGTGTAATCCTGCGTTCATACCAATTCCTTTTCAATGAATAATTGGCCTTTAGGAGTGATTCCCTTTGTCTTTAATTCAGTTCTAACTAGTTCTTTAATCCAATAGCCTAAGCTCATCATGCAGTTGGATCATAAGACAACGCCCTATAGTGCTCGTGATACTATAGGGCATCTGACCACACTGTTAACTGGAGTAACGACTATGGCAGGAATACAGCATAACCAAACTCACCCCAAACTTACATAGCGCTTTCTGGCCGTGAGCATAACAAGGTCCACTCCTCGCTCATAAGGGACAACCATACTCAAATCTCCCACATTGCAGGAGATTTGAGTATGAACACGTCACCGTGGAACAAAGACCGTATCATAGGCCAAAAAAGACCACTTCAGATATCTCATATCTGGGGTATCCGAATCCGACTTGAACTGGAAGGTAAAACTCGCGATTTAGCTCTGTTCAACATGGCCCTGGATAGTAAGCTTCGAGGCTGTGATCTGGTCAAACTCAAAGTATCTGATGTTGCATATGGTGGCTCTGTTTCAAGCAGAGCAACGGTGTTGCAACAGAAAACCGGTAGCCCTGTTCAATTTGAGATAACCAAAGGGACAAGAGAAGCTGTTGCTGCATTGATACAGCTTAGCAATTTGCACAGTAAAGACTTCTTGTTTCGGTCTAGGGTCGGAACTAACCAGCACATTTCAACCCGGCAATACAACCGAATCTTTCATGGGGGGGTAGAAAAGCTTGGTCTCGAAGATTCGCTTTACAGCACACATTCCATGAGAAGAACAAAACCTTACCTGATCTACAAGAAAACCAAGAATCTCCGGGTGATCCAACTTCTGTTGGGTCATAAGAAACTGGAAAGCACAGTCCGTTATCTGGGCATTGAAGTCGATGATGCGTTAGAGATTTCTGAATCGATTGAAGTCTAAGGTTGTCAGGGCTGCAACAGCAGCCCTGTGCCATAAGCGGAAGTATTTAACAACTATCAGTGTTGTTCAACAGATAAAGGGGCACTTGATTTTTTCTGTTCTCAGGAAATGATAAAAGCGCGTCGGTTCAAGCCTGCTTAACGGGAGTTTGTTAATCCTGTTGCCGTGACGTTTTGACACCATTATGATGGGGAGACACTTAATGTATGAAGGTTCCGCCACTTATACCTGTCCAACAACTGCCTCGGATGTTTCTTTGTATGAATAAGTGGTAATGAGTAGTGAATCGCTAACAGTCACCCGAACAATCGGTGCCTGCAATTAATTCTATATTCTAAACGAGGGGGAGATTATTACACATGAAATTTAAGGACAAGAACCTTAAGGCTCTCGCGGAATGTATCATAGGAGATAATAAGGCATTTCTGTATCGTTCAAGCAGTCACATCACTGAATTTTTCCAGGACTGCGGCATGGATGTTACTCATGACGGATCCACTCGGTGGAAATGGACGGCCCAGAGGCTTGAAGAACTTCTTTATGAGCCACAGTCAAAGCCACATACTTTGCCGGAAAGGTTTGTTCATGTGCTCAGAACTTTAATGTTAAAAGAAGATGCAATGGATGACGATCCAGGAAGATTAAAGGCGCTTGAAGAACTGAACAAGCCTTTGATGCGGGAAGGCTATGAGGCATTCTATGGTGACGATCGCCTTTTGTATATACGCCATACCGATACCAAAACGGTTTCAGTCAGTAATAACCCTCATCGGCCCTTAACGCCTCACGAAGTAGAATGCAGAAGGTTACTGACCGCGTTTCTTGATACCTGCTCAGAAGATGAGTTAATAGAAGATATTCTCCTTCCTTTATTCCGGCAACTTGGTTTTCACCGGATAACAGCAGTGGGACATAAAGATAAAGCGCTGGAATACGGGAAAGACATCTGGATGAAGTTCACACTGCCAACTCAGCATGTTCTTTATTTCGGCATTCAGGCAAAAAAAGGTAAGTTGGATGCGTCCGGTGCCAGCAAATCTACGAATTCAAACGTGGCAGAAATCTTCAACCAGGTACTGATGATGCTTGGCCATGAAATATTTGACCCAGAAACAAATAGAAAGGTGCTGGTAGATCATGCCTTTATCGTTGCTGGCGGAGAAATTACTAAACAGGCGAGGAACTGGCTGGGCGGGAAACTTGATGCCAGCAAAAGAAGCCAGATAATATTTATGGACCGGGAAGACATTCTTAATTTATATACTGTAAGTAATGTACCTCTGCCAACAGGTGCTCTCATCTCTGATGATGCCGTTAAGAACGATGATATTCCTTTCTAATCAGAAGTACGTCTTTTTCTGAAAGAATACGTGATAGGTAGCCACACCACACCTTTAGTGACCCCTTAATCTGGTAATATAACAGCCCGTATGAATGTCCGCGGCATCGCGGGCTGAAATTTATTAAAAATACTTATTCATCAAGCTGGAGTAGTTTGCCGAGTAACTGTAAACGCCCAACTTAACCGGACCATTCACTTTTAGATTGCTACCAGCAAACCAACTTCCGTTTCTCGCTCAAAGCGGACTAGAAGGTTAGCTTGCGTCGGACTTGGCGTATTTAAAGAAGTGCTGGTGGTAACTGGTTGTTGTGTTCCATTTCTACAAAACAAAATCACAGAAACTATACCCAATAGTTATATTGAATCAATGATGAGACAGCCTCATATTTATCAGAACTGGTGTACGTCCAATACAGGAGGTTGTCGTGCTGGTTCTCAAATATGCGCTAGCTATTGCGGCTGTAATGGCAATTTATTGTCTTGCTATTGTTCTTACGGATCGCCTTTCTGATTGATTTTATATTGGCGAGGTGACGGGAGTTAAGTAGAATTGCTGCGGGTGCTTGAGGCTATCTGCCTCAGGCATGAACACCAAAGGCAGATAGAGAAAAGCCCCAGTTAACATTACGCGTCCTGCAAGACGTTTAACATTAATCTGAGGCTCAATCTATGAACGGCAAATCTAGGTTAGCCTCTTACGTGCCGAAAGGCAAGGAGAAGCAGGCTATGAAGCAGCAAAAGGCGATGTTAATCGCCCTGATCGTCATCTGTTTAACCGTCATAATGACGGCACTGGTAACGAGGAAAGACCTCTGCGAGGTACGAATCCGAACCGGCCAGACGGAGGTCGCTGTCTTCACAGCTTACGAACCTGAGGAGTAAGAGACCAGGCGGGGGAGAATCCCTCGCCACCTCTGATGTGTCAGGCATCCTCAACGCACCCGCACTTAACCCGCTTCGGCGGGTTTTTGTTTTTATTTTCAACGCGTTTGAAGTTCTGGACGGTGCCGGAATAGAATCAAAAATACTTAAGTAGCGCGCAGGGATAAGAGGGATGGTCCCTTAAAGGGGAGAGCTAATTATCCGGAAGGATTCTGATGATGAACATCGAAGAACTGCGTAAAATTTTTTGTGAAGATGGCCTCTATGCTGTGTGCGTTGAAAATGGAAATCTTGTTAGTCATTACCGCATTATGTGTTTGCGAAAGAATGGGGCTGCGTTAATTAATTTTGTGGATGGTCGAGTGACAGACGGATTTATCTTGCGCGAAGGTGAGTTTGTCACTTCATTACAGGCACTGAAAGAGATTGGAATAAAAGCAGGCTTTTCAGCTTTTGCAGAAGAATAAACTCATCTACAATCTTGCGCGGGGCTGAACTCCCGCTGAGTAACACCGTGCCACCGGAGAAAACCGATGGCACGCAACGTAAAATATTACAATTCTGATAATTCGCCCGTTCTTGCCTGCACGCACGAGCGGTATTCTCACGCATTCAAGTCTGAATGGTTCCAGCACCCTCCATGCACTGAAGAGCAGGCTGAATGGATAATTCAGTGTTACCGCAGGCGCGGATACGAGGTTAAGAAAGCTCTTAGTCTCGACTACCGTCACTGGATAATCTCAGTCAGATTGCCTTACTCCGAACGCCCACCGCGTCCGTCCCGTACATTCCAGCAACGCATCTGGAGGTAACGTGCGGGTATTACTTCGACCTGTTCTGGTACCGGAACTCGGGCTGGTGGTCGTTAAGCCGGGCCGTGAATCCATGCCGGTATTCCACAATACCCGGGTACTGGTGGAGCCGGAACCGAAAAGCATGCGTAATCTGCCGTCCGGGGTCGTTCCTGCCGTTCGCCAGCCGCTGGTGGAAGACAAAACATTGCTGCCGTTTTTCAGTAACGCACGGGTAATTCGTGCTGCTGGTGGTGCTGGTGCATTGTCTGACTGGCTGTTGCGCCATATTAAATCCTGCCAGTGGCCACACGGCGATTATCATCACAGCGAAACCGTCATTCACCGTTATGGTACCGGCGCAATGGTGTTGTGCTGGCACTGCGACAACCAGCTGCGTGACCAGACATCCGAATCACTCGAGCAACTTGCTCATCAAAACCTGTCAGCATGGATGATTGACGTCATCGGTCACGCAATAAGCGGTACGCAGGAGCGTGAATTATCTTTGGCTGAATTATCCTGGTGGGCGGTCCGCAATCAGGTGGCGGACGCGCTACCGGAAGCGGTATTACGTGGTTCGCTGGGGTTGCGTGCGGAAAAAATCCGCTCAATGTACCGTGAAAGCGACATCGTACCGGGAGAGCAGACCGCCAACAGCATACTGAAACAGCGCACAAAAAATCTTGCGCCGCTGCCTCACGCCCACCAGCAACAGAACCCACCACAGGAAAAGACGGTGGTCAGCATTGCCGTTGATCCTGAGTCTCCGGAATCTTTCATGAAACGACCTAAACGTCGCCGCTGGGTTAACGAGAAATACACACGCTGGGTGAAGACACAGCCGTGTGCGTGTTGTGGTAAGCCAGCCGACGATCCCCATCACCTGATTGGTCATGGTCAGGGCGGAATGGGGACAAAATCTCACGATATTTTCACGCTACCGCTGTGTCGGGAGCATCACAACGAGCTTCATGCGGATCCTCTGGCGTTCGAAGAAAAGCATGGTTCTCAGGTTGATTTAATTTTTCGTTTTCTTGATCACGCCTTTGCAACTGGCGTGCTTGGGTAAAAGAGGTGACTGATGCTCATAGATTTGGTTTTACCTTACCCGCCGACGGTGAACACTTACTGGCGACGCCGTGGCAGCACATATTTTATCTCGGAGGAGGGAAAGCGTTATCGCCGGGCTGTGGCGCTTATTGTTCGCCAGCAGCGGCTGAAATTAAGCCTGTCCGGAAGGCTGGCGATAAAGGTGATTGCAGAGCCACCGGATAAGCGTCGTCGCGACCTGGACAATATCCTGAAAGCACCGCTGGATGCGCTGACGCATGCGGGAGTGTTAATGGACGATGAGCAGTTTGATGAAATCAATATCGTTCGTGGTCAGCCAGTATCTGGTGGACGTCTGGGGGTGAAGATTTACCCCATAATGCATGAAGAGCAGGTCAAAAAATGAAACTGGAAGATTTACCGAAATACTACTCCCCAAAATCCCCTGGCCTGACCGATGCATCGGCCTCAACGTCAAAAGATGCGCTGAGTATCACTGATGTGATGGCCGCGCAGGGCATGACACAGAATCGGGCTGAGATGGGTTTTTCTGCGTTCCTGGGGAAAATGGGCATCAGTATGAATGACAGGGCGCGGGCAACAGAATTACTGGCAGATTATGCACTCAGTCGGTGCGATCGTGTGGCGGCGTTGAGAAAGCTTCCGGCAGAAATAAAACCGGTAGTGATGCGCATTATGGCTTCGTACGCTTTTGAGGATTATGCCCGCAGCGCAGCGAGTAAAAAGCAGTGCCCTTGTTGCTATGGGGAAAAATTTATTGAAAGCGTAGTTTTTACAAACAAGGTCCAGTATCCGGATGGTAAGCCGCCGGTATGGGCAAAGTGTACGAAAGGTGTGTATCCGTCTTACTGGGAAGAATGGAAAAAAGTCAGGGAGGTGGTAAAAGTTGCCTGTCCGGAGTGTGGCGGAAAGGGTGAGGTTTCCACCGCCTGTAAGGATTGCCGTGGGCGTGGTGTCGCCATTCATCGTGAAGAGTCGGTAAAACGTGGTATGCCTGTTATCAGAGACTGCCAGCGTTGTGGTGGTCGTGGCTATGAAAGACTACCATCAACGGAGGCATTTAATGCTATATGCGAGGTGACAAACCAGATAACACGCGCGTCATGGGAAAAAACAGTTAAGAAATTCTATGATGCGCTGGTGACCCGGTTTGATATTGAAGAAGCATGGGCTGAGCGGCAGTTAAAAAAGGTAACTAGGTAACAAGGTTGATTTTTCCGGAATCTGTGGTAAATTCGTCATAACGATGGGCGTTTTATGCCTGACGTTAGAAGAGTTTCTACAACCCGCCGCCGAGCGGGTTTTTTATTGCGGAATTAATTATGGACCGTTATTATTCTGCTCCCGGCCCTTTAGCTCAGTGGTGAGAGCGAGCGACTCATAATCGACAGGTCGCTGGGTCAAATCCAGCAAGGGCCACCAACCGTCACCAGTTCATCAGGAAAGAGCGTCAACCCTTTAAGTTGAGTGTGCGAGGTTCGAGTCCCCGGTGGCGGTCCAGTGCCGACTTCGCTCAGTAGGTAGAGCAACTGACTTGTAATCAGTAGGTCACCAGTTCGATTCCGGTAGTCGGCACCATATGCGGGCATCGCATAATGGCTATTACCTCAGCCTTCCAAGCTGATGATGCGGGTTCGATTCCCGCTGCCCGCTCCAGTTAGAGTCTTTCAGTCTGCGATGATGGGAAATCCCGGAGTGACTGAAAGACGTTTAAGTTATGAATGATCGCTTTTTTTTGCAAAATTGCTGTGCAGAAATACTAACCTTCGGGCAGGCGATCATTCATAAGCACTCTGCTTTTATTCCGATTAACTGTGGGTGGTTTGTTGGATAGAGTGCTTTCCTTACTGTATATATTGTTTCGCCCGCTTTTGCGGGCTTTTCTTTTCAAATCCCTTTCATTTCTCAGTGTAAAACTACGCCATCCGTTATTTGCGGAGGTGAGGCTATGAAATCCATGGACAAAATTTCAACGGGCATTGCCTATGGCACCTCCGCAGGCAGTGCTGGCTACTGGTTTTTACAGCTGCTCGATAAAGTCACGCCCTCACAGTGGGCAGCAATAGGTGTGCTGGGTATCCGCACCGGCGGGCGCGTGCTGGCGGTAAACAGCCAGACCCGGACGCTGACGCTCGACCGTGAAATCACGCTGCCATCTTCCGGCACCACGCTGATAAGCCTGGTTGACGGGCAGGGGAGTCCGGTCAGCGTGGAGGTTCAGTCCGTCACCGACGGCGTGAAGGTGAAAGTGAGCCGTGTTCCTGACGGCGTTGCTGAATACAGCGTATGGGGGCTGAAGCTGCCGACGCTGCGCCAGCGCCTGTTCCGCTGCGTGAGTATCCGTGAGAACGACGACGGCACGTATGCCATCACCGCCGTGCAGCATGTACCGGAAAAAGAGGCCATCGTGGATAACGGGGCGCACTTTGACGGCGACCAGAGCGGCACGGTAAATGGTGTCACGCCGCCAGCGGTGCAGCACCTGACCGCAGAAGTCACCGCAGACAGCGGGGAATACCAGGTGCTGGCCCGCTGGGACACGCCGAAGGTGGTGAAGGGCGTGAGCTTCCTGCTTCGCCTGACCGTGGCAGCGGATGACGGCCGTGAGCGGCTGGTCAGCACGGCCCGGACGACGGAAACCACTTACCGCTTCACACAACTGGCTCTGGGGAACTACAGGCTGACAGTCCGGGCAGTAAATGCGCGGGGGCAGCAGGGCGATCCGGCGTCGGTATCGTTCCGGATTGCCGCACCGGCAGCGCCGTCGCGGATTGAGCTGACGCCGGGCTATTTTCAGATAACCGCCACGCCGCATCTTGCGGTTTATGACCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAAAACGGATTGCGGATATCAGGCAGGTTGAAACCAGCGCGCGTTATCTTGGTACGGCACTGTACTGGATAGCCGCCAGTATCAATATCAAACCGGGCCATGATTATTATTTTTACGTTCGCAGTGTGAACACCGTTGGCAAATCGGCATTCGTGGAGGCTGTCGGTCAGCCGAGTGATGATGCATCAGGCTATCTGGATTTTTTCAAAGGCGAGATAGGGAAAACCCATCTGGCTCAGGAGCTGTGGACGCAGATTGATAACGGTCAGCTTGCGCCTGACCTGACTGAAATCAGGACGTCCATAACGGATGTCAGCAATGAAATAACACAGACCGTCAATAAGAAACTGGAAGACCAGAGTGCAGCGATCCAGCAGATACAGAAGGTTCAGGTTGATACAAATAATAACCTGAACAGCATGTGGGCAGTGAAGCTGCAGCAGATGCAGGACGGACGCCTTTATATTGCGGGTATCGGTGCCGGTATTGAGAACACCCCTGACGGCATGCAGAGTCAGGTGCTGCTGGCAGCAGACAGGATTGCGATGATTAATCCTGCGAATGGCAACACAAAGCCGATGTTTGTTGGTCAGGGCGATCAGATATTCATGAATGAAGTGTTCCTGAAACGCCTGACGGCCCCCACCATTACCAGCGGCGGTAATCCTCCGGCATTTTCCCTGACATCAGACGGGAGACTGACGGCGAAAAATGCGGATATCAGTGGCAGTGTGAATGCGAACTCAGGAACGCTCAACAATGTCACGATTAACCAGAACTGTACGATTAAGGGCATGCTGGAGGCGACCCAGGTCAGAGGAGATTTCGTTAAAGCTGTATCAAAAGCCTTCCCGAAAAAAGTCGGTACGTGGGGTAACACGGAAACACCAAACGGTACGGTTACAGTAACCATCAGCGATGATCATAACTTTGACCGCCAGATTATTATTCCGCCCATTATTTTTAACGGTATAGCGTATGACGATCCGGGGAGCGGAAATAACCCAGGAGGCACGCGATACACGGGTTATGGTTTTGAAGTTCGCAAAAACGGCGTATTAATCGCATCCAGAGAAACTAAAGGGGCCATTCCCGGTAGTTACAGTGCAGTTATTGATATGCCGAGTGGCAGGGGAAGCGTCACTCTGGAGTTTAAGATTTTCCAGAAAGGCAATCAGGGGGCAGGCAATATCACCGACTGTACGGTGATTGTGACCAAAAAAGCGGCTTCCGGCATCAGTATTCGTTGAAATATTTATAACCCCAATAAAGGGCGTCAGGAATGACGCCTTTTTTATTGCAGAAAAGCGAGAGGTAATTATGCGTAAAGTTTGTGCAGCAATTTTGTCCGCAGCCATTTGTCTGGCCGTATCCGGTGCGCCTGCATGGGCGTCTGAACATCAGTCCACGCTGAGCGCGGGGTATCTTCATGCCCGGACCAACGTTCCCGGCAGTGATGATCTGAACGGGATTAACGTGAAATATCGTTATGAGTTTACGGATACGCTGGGGCTGGTGACGTCATTCAGCTATGCAGGAGACAAGAATCGCCAGCTGACCCGTTACAGCGATACCCGCTGGCATGAAGATTCCGTGCGTAACCGCTGGTTCAGCGTGATGGCGGGGCCGTCTGTGCGCGTGAATGAATGGTTCAGCGCGTATGCGATGGTACTGGTGGAAGAACTATCGAGCAAGCACGTGCGAACTTGCGGGTAATGTATGAGCAAAAAGCTGGCCTTGCTAATACTGACCTAAACACCCTTACCGGTGAATATTCTGGTTTCTATCAACAACCAACGAGCGCTTACGCAACAGAAGAGTTAAATTACCCAATCGGTCTGGCGGGCGCTTTAATAGTGCTCCAAACGAGAGCCAACACTGCTTCTTCCTGCGTTCAGGTGTACCACCCTTATAATAATCCGGGAATTACTTATAGACGAATATATGAAGGAGGTAGCGGTACCTGGTCTGAATGGAAGAGAGATGTATCAACAGAAAGGGTTGAAGAGGGAAAAGAAACAACTTACGTATATTCTACGTATTCTTCAGGCGCACCACGCTTACAGGTTTCCAAATCTGGTTTGTGGGGTTGTCATAATGGCACTGGCTGGTTGCCATTAGCTGTTGGGCAAGGAGGTACAGGTGCGACAACAGTAGAAGATGCGCGAAACAACTTAAGTCTTGGCGAAAGTAGCGCAGTTAAATTTAAAAACCTTACTTTAACCGAAGCGCTCGACACGACATTAGGACTGCTTACAAAAACAGGACGAGACTGGAACACGCAGCATACTGATAACATTAATAAATTTATACCAATTGCAGGCAGTACAAACGGCCCGGCAGGCTCTATGGTTCTTGGCGGCATTCATGTTCAATTTAGTAAAAATTATGCTGTGCAGTTCGGAGGCCGCAATTCCGGTTTTTGGGGAAGAACAATTGAAAATGGAACGACACAGGAATGGAAGAAATTACTAACAGTAGACGATCTCAATTCATCTACCGATCTTGCTGTCAGGTCATTAACCACATCTAACCCGGTAAAATCTGGCGGAGGGCGAATTGATGTCCTTGGAAGCACGTCAGACTATAGCAAAATGGATTGCTTTGTACGTGGGTTTGATAGCACCGGTAATTCTCTCGTGTGGGCGTTGGGTTCATCAGTCGGCGTAAGTAAGATGCTATCGCTAAAAAATTTCTTTAGCGGAGCTGAGATACTGTTAAATGGTAATGACGGCGCGGTTCAACTCAAAACAGGTGCTGTTAACGGGGCTACAGCGCAGACGCTCACTATCAACAAGAATGAGGTTAACTCAACCGTTGATTTAACCCTTACAAAGCAATCAGGGACTGGTAATCGTTTTGTTTTACAGAACTCAGGTAATGCAGAACTACCGTTTTCTGTCAGGGTGTGGGGTTCCAGTACTCGACAAAACGTTTTTGAGGTTGGAACGTCTGCTGCGTATCTGTTTTATGCGCAAAAAACGTCAGCAGGCCAGTTGTTTGATGTAAATGGCGCTATTAATTGCACAACGCTGAATCAGTCATCAGACCGCGACCTTAAAGACGATATTCTCGTTATCAGCGACGCGACGAAAGCAATCCGTAAAATGAACGGATACACCTACACGCTCAAGGAAAACGGGATGCCTTATGCTGGCGTTATTGCACAGGAAGTAATGGAGGCGATACCAGAAGCTGTGGGATCGTTTACTCATTATGGTGAAGAGTTGCAAGGTCCGACCGTTGACGGCAACGAGCTACGCGAAGAAACTCGCTATCTTAATGTTGACTACTCCGCCGTGACGGGTTTACTTGTTCAGGTCGCCCGTGAAACAGATGATCGCGTTACCGCGCTGGAAGAGGAAAACACAACGCTACGTCAAAATCTGGCAACAGCAGGCACCCGGATCAGCACTCTGGAAAATCAGGTAAGCGAACTGGTTGCACTTGTCCGGCAGTTAACAGGAAGCGAACATTGATATCCTTCAAGCCCTGAAGGAGGCTGTTCCTGGTACGTTCAGACTGTTGTTGAGCTGGAAATCGCAACGGAGGAAGAAACTTCGTTGCTGGAAGTCTGGAAGAAGTATCGGGTGTTGCTGAACCGTGTTAATACAACAACTGCACCGGATATTGAATGGCCAGTAGCACCTATAGGGTAA
Protein sequences of DBSCAN-SWA_5 >CP031653|2387623:2416226|2399802_2399991_-|AXP26807.1|DBSCAN-SWA MKTLLPNVNTSEGCFEIGVTISNPVFTEDAINKRKQERELLNKICIVSMLARLRLMPKGCAQ >CP031653|2387623:2416226|2390248_2390554_-|AXP26795.1|DBSCAN-SWA MKLSTCCAALLLALASPVVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTRTTSGNVSAPAQSTQDGAPAEPQ >CP031653|2387623:2416226|2392977_2394192_+|AXP26800.1|DBSCAN-SWA MKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQCGIPGMKTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFNEHLLHDMHHRLTPIEAARFGKSIEDYRMFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYEPAYLPVARLEDGTLWNW >CP031653|2387623:2416226|2392808_2392997_+|AXP26799.1|DBSCAN-SWA MSADKRYSISSFVNHSQRKYTPFFVITPAFFDLYTCMVVAQLRRFHASRQAMQGIEHEDRKG >CP031653|2387623:2416226|2391374_2391935_-|AXP26796.1|DBSCAN-SWA MPSAHSVKLRPLEREDLRYVHQLDNNASVMRYWFEEPYEAFVELSDLYDKHIHDQSERRFVVECDGEKAGLVELVEINHVHRRAEFQIIISPEYQGKGLATRAAKLAMDYGFTVLNLYKLYLIVDKENEKAIHIYRKLGFSVEGELMHEFFINGQYRNAIRMCIFQHQYLAEHKTPGQTLLKPTAQ >CP031653|2387623:2416226|2405014_2405617_+|AXP26813.1|integrase|DBSCAN-SWA MNTSPWNKDRIIGQKRPLQISHIWGIRIRLELEGKTRDLALFNMALDSKLRGCDLVKLKVSDVAYGGSVSSRATVLQQKTGSPVQFEITKGTREAVAALIQLSNLHSKDFLFRSRVGTNQHISTRQYNRIFHGGVEKLGLEDSLYSTHSMRRTKPYLIYKKTKNLRVIQLLLGHKKLESTVRYLGIEVDDALEISESIEV >CP031653|2387623:2416226|2395410_2396706_-|AXP26803.1|DBSCAN-SWA MREVEMKYPTGVENHGGKLRIWFVYKGVRVRENLGVPDTAKNRRVAGELRSSVCYAIKTGVFDYAKQFPSSRNLEKFGEARQDLTIKELAEKFLALKETEVAKTSLNTYRAVIKNILSIIGEKNLASSINKEKLLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNAVFQFGVDNGYLADNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAKNLWCVSVYTGVRPGELCALGWEDIDLKNGTMMIRRNLAKDRFTVPKTQAGTNRVIHLIKPAIDALRSQMTLTRLSKEHIIDVHLREYGRTEKQKCTFVFQPEVSARVKNYGDHFTVDSIRQMWDAAIKRAGLRHRKSYQSRHTYACWSLTAGANPAFIANQMGHADAQMVFQVYGKWMSENNNAQVALLNTQLSEFAPTMPHNEAMKN >CP031653|2387623:2416226|2390661_2391372_+|AXP29151.1|DBSCAN-SWA MKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY >CP031653|2387623:2416226|2401855_2402800_-|AXP26811.1|DBSCAN-SWA MDLIMEWRFLGSLSEARKSGCSGVYLIVHKGLFSRVVYVGVSCNVGRRITEHYDGYLRGNRTIYDAGHDEDVYRFMSAYKIHNHTKYYQALANDYKIWASTTMYSDLPKNMLAKSQTFDTDWQSIALEKYIPQLVVWALPMAKYCYLNASRIESVIQSKLIKSFDLRGFFNIKQLSILGKIEYPYMEKVKVFIINTPDLDPASQLIFSNLYNKKTDNNFCKEFRSQFKSEIFQRESETQRKRTIREHKVSLYENYGKPWTLKEMEKLRVMLVDFDLSPIEISEYLGREPRSISKKISENDKVTNYKWRESVGWL >CP031653|2387623:2416226|2403347_2404697_-|AXP26812.1|DBSCAN-SWA MNAGLHFSEISKEQFDIYFYGRSPYLKTFSEEIRWFKYEGNGITLLSTIIICNIDKDFNAIVLGRDLDKKFRAINVLASFDSMDVLLNNLNDDIPKMLAQHQNGTFMQGDESTKPFSLFLSKVPAKKRNVYIKMLLEDPLHFPAYIVLEELAYWFKDPDGIFIRDFQSDAFNSRLFELYLNAVFYELDFEMNREYNQPDFLLSKFGVEIAVEAVSIAEAEAPLERKVINDEQMDELRKHVLNVMPFKFARSLLKKVRHCPEPEKVHYWELNHTKNKPFVIAMQDYSKRMSMAFSSEALHSYLYGIDIESGISIERHTDENRSIKSNFFGSEQNNYVSAVLLTTQATIPKFNRMGILAGVEASGFKVYVSGVKTDQDAAPHPFSADVSDPNYQEPWCTAMYMYHNPNAIHPVDYQLFPNVVHVFKKDEHFEEYIPRNYILQSTTMICKTE >CP031653|2387623:2416226|2411654_2413817_+|AXP26822.1|DBSCAN-SWA MKSMDKISTGIAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGIRTGGRVLAVNSQTRTLTLDREITLPSSGTTLISLVDGQGSPVSVEVQSVTDGVKVKVSRVPDGVAEYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFLLRLTVAADDGRERLVSTARTTETTYRFTQLALGNYRLTVRAVNARGQQGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSEKRIADIRQVETSARYLGTALYWIAASINIKPGHDYYFYVRSVNTVGKSAFVEAVGQPSDDASGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLTEIRTSITDVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQDGRLYIAGIGAGIENTPDGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQIFMNEVFLKRLTAPTITSGGNPPAFSLTSDGRLTAKNADISGSVNANSGTLNNVTINQNCTIKGMLEATQVRGDFVKAVSKAFPKKVGTWGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYDDPGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGRGSVTLEFKIFQKGNQGAGNITDCTVIVTKKAASGISIR >CP031653|2387623:2416226|2414648_2416046_+|AXP26824.1|DBSCAN-SWA MWGCHNGTGWLPLAVGQGGTGATTVEDARNNLSLGESSAVKFKNLTLTEALDTTLGLLTKTGRDWNTQHTDNINKFIPIAGSTNGPAGSMVLGGIHVQFSKNYAVQFGGRNSGFWGRTIENGTTQEWKKLLTVDDLNSSTDLAVRSLTTSNPVKSGGGRIDVLGSTSDYSKMDCFVRGFDSTGNSLVWALGSSVGVSKMLSLKNFFSGAEILLNGNDGAVQLKTGAVNGATAQTLTINKNEVNSTVDLTLTKQSGTGNRFVLQNSGNAELPFSVRVWGSSTRQNVFEVGTSAAYLFYAQKTSAGQLFDVNGAINCTTLNQSSDRDLKDDILVISDATKAIRKMNGYTYTLKENGMPYAGVIAQEVMEAIPEAVGSFTHYGEELQGPTVDGNELREETRYLNVDYSAVTGLLVQVARETDDRVTALEEENTTLRQNLATAGTRISTLENQVSELVALVRQLTGSEH >CP031653|2387623:2416226|2396725_2396977_-|AXP26804.1|DBSCAN-SWA MSEVIMIVSPGKWVSEEQLIALKGIKKGTLKKAREKSFMEGREYKHVAHDGMPWDNSPCFYNLEEIDRWIERQASARPRRHLT >CP031653|2387623:2416226|2400074_2400317_+|AXP26808.1|DBSCAN-SWA MRISDFRLYREISDGKSITYMIAGLNKEYGDVVESGLLFADPAVVDRETDELIEKAIAFKLAYRQQYQQKAGWNYESSFC >CP031653|2387623:2416226|2410087_2410909_+|AXP26821.1|DBSCAN-SWA MKLEDLPKYYSPKSPGLTDASASTSKDALSITDVMAAQGMTQNRAEMGFSAFLGKMGISMNDRARATELLADYALSRCDRVAALRKLPAEIKPVVMRIMASYAFEDYARSAASKKQCPCCYGEKFIESVVFTNKVQYPDGKPPVWAKCTKGVYPSYWEEWKKVREVVKVACPECGGKGEVSTACKDCRGRGVAIHREESVKRGMPVIRDCQRCGGRGYERLPSTEAFNAICEVTNQITRASWEKTVKKFYDALVTRFDIEEAWAERQLKKVTR >CP031653|2387623:2416226|2413887_2414283_+|AXP26823.1|DBSCAN-SWA MRKVCAAILSAAICLAVSGAPAWASEHQSTLSAGYLHARTNVPGSDDLNGINVKYRYEFTDTLGLVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMVLVEELSSKHVRTCG >CP031653|2387623:2416226|2397049_2399521_-|AXP26805.1|DBSCAN-SWA MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHREYKQTWKTLDDELAYALWPGDVDAGNIDGSIHRWAKKEVIDNDREDWKRISASMRKQPDALRYDRQTIFGLVRERPIDIHKDPVALNKYICEYLTTKGVFENEETDLGTVDVLQSSETQTDAVETEVSDIPKNETAPEAEPSVEREGPFYFLFADKDGEKYGRANKLSGLDKALAAGATEITKEEYFARKNGTYTGLPQNVDTAEDSEQPEPIKVTADEVNKIMQAANISQPDADKLLAASRGEFVEGISDPNDPKWVKGIQTRDSVNQNQHESERNYQKAEQNSPNALQNEPETKQPEPVAQQEVEKVCTACGQTGGGNCPDCGAVMGDATYQETFDEEYQVEVQEDDPEEMEGAEHPHKENTGGNQHHNSDNETGETADHPIKVNGHHEITSTSRTCDHLMIDLETMGKNPDAPITSIGAIFFDPQTGDMGPEFSKTIDLETAGGVIDRDTIKWWLKQSREAQSAIMTDEIPLDDALLQLREFIDENSGEFFVQVWGNGANFDNTILRRSYERQGIPCPWRYYNDRDVRTIVELGKVIDFDARTAIQFEGERHNALDDARYQAKYVSVIWQKLIPSQADS >CP031653|2387623:2416226|2408654_2409704_+|AXP26819.1|DBSCAN-SWA MRVLLRPVLVPELGLVVVKPGRESMPVFHNTRVLVEPEPKSMRNLPSGVVPAVRQPLVEDKTLLPFFSNARVIRAAGGAGALSDWLLRHIKSCQWPHGDYHHSETVIHRYGTGAMVLCWHCDNQLRDQTSESLEQLAHQNLSAWMIDVIGHAISGTQERELSLAELSWWAVRNQVADALPEAVLRGSLGLRAEKIRSMYRESDIVPGEQTANSILKQRTKNLAPLPHAHQQQNPPQEKTVVSIAVDPESPESFMKRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKSHDIFTLPLCREHHNELHADPLAFEEKHGSQVDLIFRFLDHAFATGVLG >CP031653|2387623:2416226|2407326_2407470_+|AXP26815.1|DBSCAN-SWA MMRQPHIYQNWCTSNTGGCRAGSQICASYCGCNGNLLSCYCSYGSPF >CP031653|2387623:2416226|2391969_2392311_-|AXP26797.1|DBSCAN-SWA MKITLSKRIGLLAILLPCALALSTTVHAETNKLVIESGDSAQSRQHAAMEKEQWNDTRNLRQKVNKRTEKEWDKADAAFDNRDKCEQSANINAYWEPNTLRCLDRRTGRVITP >CP031653|2387623:2416226|2394203_2395223_+|AXP26801.1|DBSCAN-SWA MKSILIEKPNQLAIVEREIPTPSAGEVRVKVKLAGICGSDSHIYRGHNPFAKYPRVIGHEFFGVIDAVGDGVESARVGERVAVDPVVSCGHCYPCSIGKPNVCTTLAVLGVHADGGFSEYAVVPAKNAWKIPEAVADQYAVMIEPFTIAANVTGHGQPTENDTVLVYGAGPIGLTIVQVLKGVYNVKNVIVADRIDERLEKAKESGADWAINNSQTPLGEIFAEKGIKPTLIIDAACHPSILKEAVTLASPAARIVLMGFSSEPSEVIQQGITGKELSIFSSRLNANKFPVVIDWLSKGLIKPEKLITHTFDFQHVADAISLFEQDQKHCCKVLLTFSE >CP031653|2387623:2416226|2387623_2390050_-|AXP26794.1|DBSCAN-SWA MSKNDRMVGISRRTLVKSTAIGSLALAAGGFSLPFTLRSAAATVQQASEKVIWGACSVNCGSRCALRLHVKDNEVTWVETDNTGSDEYGNHQVRACLRGRSIRRRINHPDRLNYPMKRVGTRGEGKFERISWDEALDTIASSLKKTVEQYGNEAVYIQYSSGIVGGNMTRSSPSASAVKRLMNCYGGSLNQYGSYSTAQISCAMPYTYGSNDGNSTTDIENSKLVVMFGNNPAETRMSGGGITYLLEKAREKSNAKMIVIDPRYTDTAAGREDEWLPIRPGTDAALVAGIAWVLINENLVDQPFLDKYCVGYDEKTLPADAPKNGHYKAYILGEGDDNTAKTPQWASQITGIPVDRIIKLAREIGTAKPAYICQGWGPQRQANGELTARAIAMLPILTGNVGISGGNSGARESTYTITIERLPVLDNPVKTSISCFSWTDAIDHGPQMTAIRDGVRGKDKLDVPIKFIWNYAGNTLVNQHSDINKTHEILQDESKCEMIVVIENFMTSSAKYADILLPDLMTVEQEDIIPNDYAGNMGYLIFLQPVTSEKFERKPIYWILSEVAKRLGPDVYQKFTEGRTQEQWLQHLYAKMLAKDPALPSYDELKKMGIYKRKDPNGHFVAYKAFRDDPEANPLKTPSGKIEIYSSKLAEIARTWELEKDEVISPLPVYASTFEGWDSPERRTFPLQLFGFHYKSRTHSTYGNIDLLKAACRQEVWINPIDAQKRGIANGDMVRVFNHRGEVRLPAKVTPRILPGVSAMGQGAWHEANMSGDKIDHGGCVNTLTTLRPSPLAKGNPQHTNLVEIEKI >CP031653|2387623:2416226|2401303_2401726_+|AXP26810.1|DBSCAN-SWA MAKVFTQEEREKIKGQVVELVRRSGRETLRQLEVKTGATRYLMSVLARELVASGDVYNSGYGLFPSEQARKDWQNARKKLSRAKLKEPSAVDPDLIWSLPDGEIRRYDRRHNMICTECRKSEVMQRILSFYQGDVRYLLK >CP031653|2387623:2416226|2395280_2395391_+|AXP26802.1|DBSCAN-SWA MTIEKHERSTKDLVKAAVSGWLGTALEFMDFKSHAC >CP031653|2387623:2416226|2407628_2407841_+|AXP26816.1|DBSCAN-SWA MNGKSRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIMTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >CP031653|2387623:2416226|2416100_2416226_+|AXP26825.1|tail|DBSCAN-SWA MEIATEEETSLLEVWKKYRVLLNRVNTTTAPDIEWPVAPIG >CP031653|2387623:2416226|2400243_2401263_+|AXP26809.1|DBSCAN-SWA MLSSLRIDSNTNKKLDGIMSLLFAERPLVINTQLAMKIGLNEAIVLQQLHYWLRDTNSGMECDGVRWIYNTTEQWLEQFPFWSESTLKRAFASLKTLGLLRCEKLNKSKRDMTNFYTINYGSELLDGGKLSESIGLKCAAPSGQNDTMEEVKMKRSIGSKRLNVIGSKWPDDLTENTTEITTENKKTSRPEASQPDPQTVEQDFLTRHPDAVVFSAKKRQWGSQEDLACAQWIWGRIVSLYEQAASDDGEISRPKEPNWTAWANDVRTMRMLDGRTHRQICEMFGRVQRDPFWVKNIMSPSKLREKWDELVIRLGRSSVQRCVNHISEPDTEIPPGFRG >CP031653|2387623:2416226|2392445_2392772_+|AXP26798.1|DBSCAN-SWA MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLTLHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLTLYDWTGALIALCGMLIIVAGWGRT >CP031653|2387623:2416226|2409716_2410091_+|AXP26820.1|DBSCAN-SWA MLIDLVLPYPPTVNTYWRRRGSTYFISEEGKRYRRAVALIVRQQRLKLSLSGRLAIKVIAEPPDKRRRDLDNILKAPLDALTHAGVLMDDEQFDEINIVRGQPVSGGRLGVKIYPIMHEEQVKK >CP031653|2387623:2416226|2408374_2408653_+|AXP26818.1|DBSCAN-SWA MARNVKYYNSDNSPVLACTHERYSHAFKSEWFQHPPCTEEQAEWIIQCYRRRGYEVKKALSLDYRHWIISVRLPYSERPPRPSRTFQQRIWR >CP031653|2387623:2416226|2405976_2406957_+|AXP26814.1|DBSCAN-SWA MKFKDKNLKALAECIIGDNKAFLYRSSSHITEFFQDCGMDVTHDGSTRWKWTAQRLEELLYEPQSKPHTLPERFVHVLRTLMLKEDAMDDDPGRLKALEELNKPLMREGYEAFYGDDRLLYIRHTDTKTVSVSNNPHRPLTPHEVECRRLLTAFLDTCSEDELIEDILLPLFRQLGFHRITAVGHKDKALEYGKDIWMKFTLPTQHVLYFGIQAKKGKLDASGASKSTNSNVAEIFNQVLMMLGHEIFDPETNRKVLVDHAFIVAGGEITKQARNWLGGKLDASKRSQIIFMDREDILNLYTVSNVPLPTGALISDDAVKNDDIPF >CP031653|2387623:2416226|2408056_2408308_+|AXP26817.1|DBSCAN-SWA MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDGRVTDGFILREGEFVTSLQALKEIGIKAGFSAFAEE >CP031653|2387623:2416226|2399614_2399806_-|AXP26806.1|DBSCAN-SWA MNSAFALVLTVFLVSGVPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVINQDNIEIPAGL |
33 | Enterobacteria_phage(30.0%) | tail,integrase | attL 2388688:2388702|attR 2412466:2412480 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2817473 : 2828251
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >CP031653|2817473:2828251|DBSCAN-SWA GTCATTTTGCTAATAACACTTTTCTGGCTTCATCGACTTTATTTGGAGCATCGAACCAATATTCGGCGAGCAAAGGGCAGATCTCTGTATCAACGACTTGTTCATACCAAGCCTGGGCATCGTTGATTTTTTGGCCGATAGCTGGGGTGACATAGCTGTGACCGATACAAAACTGTGGTCCTAAGGTAACATCTTTTGCCAGCATATCGTTAAGTACCGTCAGTCGAGATTTGATGAATGCTAACATGTCAAAATCAATTGCGTAATTATAATTTACCCAGTTTCTCCACGCGTCATTAAAAGCTGGCTTTAGATCGATAAACGCGAAACGACGGCGCAGCGCTAGGTCAAGTAGTGCGAGTGAGCGGTCTGCAATATTCATCGTACCGATGATATATAGATTCTCAGGAATGTATATTTTTTCATCATCATTTTTAGGATAGGAAAGAGATAATGCCTCAGTCGGCGTGCGTTTATCTGCTTCCATCAACGTGAGTGTTTCACCGAAGATTTGCGCCGGATTGCCACGATTAATCTCTTCAATTATCACCACATATTTCGAGGTAGGATTATTGACTGCAGTTTTGATTGCATTTACAAAAGGTCCATCAATTAGCGTCAATTGCCCTTCTTTACCTGGACGCCAGCCGCGAATAAAGTCTTCGTAAGAGAGGTTCGGGTGAAATTGCACCGCGCTAATACGCTCAGGTGCTTTTTCTCCCATCAAGCAGTACGCCAGACGTCGCGCTAACCAGGTTTTTCCAGTTCCGGGTGGTCCTTGTAATATCAGGTTTTTCTTGTCGATCAGGCGCTGAAGTGTGAGTTGGATCTTAGCCTCTTCCAAGAAACAGCCATCCTGCACCAGATGACTGATGTCATAAGGAACGTGAGTGAGTTTTGGCAACGGCGCACTCTCTTCGACAGTTTCCTCAGTTATCGTCTGGGATTCATTTTTCTCAAAATTCAGGTATTCATAATTTCCGGACTCAAGGGACTTTAACTCATCATCATTGCATAGTTTCTGTAGATAAAAGTAGATTGAGCTTGTAACTGTTTTGCTCTCAGGATGCTCTACCTTGAATACGTCCAAGTAGTTTTCTTCCAACTCCGAAGCAGTGAAATAAGGGCCATTTTTTTTCAGGCATAACGCCTTAATTTTATTCAGTAAAGAGGCTTTCCACGTTCTATTTGCCACCTCATCTTTAGACTGGCTAAGATCTGTATTCCACGCTGATAAAGAAAGTTCTGGGAAAGAATGAACCGGATAGTTTGGTTGGGTAAAGACCTCGTTCAGCGCCCGCATAAGACTCAGGTAACTTTGTCCACTACAGCGACCTTTTGCCCCGTTTTTAATGATCTTAATGTTTAACACTGTCTGAATGTAATACTGCGACTGGCTATCTAAAGTAGGGTAGAACCAAGGACGGGTCCAGTACAACCCCATGGTGAGATTCCAACCGACATTCATTACTGTAGAAGCAATGTCATATGCGGCAGTGAAGTCTGCAGAGTTAGTATTCTGGTTATCCGCAAAGGTCATTGCCTGCGAGAACATTTCCCACAAGCATTCAATGTCATTAGGGTCGCGTGACTTTTCATAACCAAAGAACCAAGATTTTTGGTTATTCAACAGCGGGATTCCGGCAAAGGAGTCAGGAATCGGTTCGTTCACGCCCAACAAATTCGCTAGCTTGGCAGCAATAATTTTGCGATTGCTGTCGGTCAAGTTACGATTGAACAAGCCCATAGTAGTAAACGGACAAATGTCTTTTAAGGGAAAGATCTCTCCCATAATAGATTTGTCCTGCAGATGGGACATTCCTTCCACACCTGAGGCAATTAGATGAATACCTTTGACTAATTCATCTCTACGATTTCGCCAAGTCAGTAACGCGTTGGCAAAAGCCTCATAAAAACTAGCCCAAGCAAATTTGCCATCATGTTCTGCTGTATCCACGGGAACTCCATTATACTTGTTGAGCAATGATAATTTATCTGTGTGAGTTTTAACATATTACTATCAGTTATAGAAAAATTTAACTACCCGATACAGAGAGCGGCATGCTGAATTTGACCTGACTTGCTTCCAACTAATTAAAATCAACTTATTTATCAATTGGTTATTTTGGCGCATAGCGGTCATCAAGGGAATATCGCGTTGTCATAAGGTGTCGAGGCTCGGAGGTTCAAATCCTCTCATGCAAAAAATAAATAAAATTAATGACGGTTGGAAATTATTCAATACATACACTCTCGAAAGTGCATCAGCCAACCGCAGCACGTCTTGCATACGGCGTGTCTGCAGTTTTATATAATCCTGGCTGGAAACCTCTTATACAAAGTAGATACACCAATATCATAGATGATCGCCACCTTCTGACGCGGGACTTTTGATGCAATTAATCGCCCGGCCTGCGCCCATTGTTCTGATGTAAGTTTAGGACGACGTCCACCAATTCCTCCCTGTGCGCGAGCATCTTCCAGTCCAACTTTTGTCAGTCATCAATCAGTTCACGCTTCATTTCAACCAGGGCTCCATCACATGGAAAAATAAAAAAAACCATCGATGTTGACGTGTCAATGTTACCTGTCAGGCTACGGAAATCCCCCCCTTACCCGCAGCTACTCGGTAAGTATAATAAGATGTTTCCTGCTCCTTCCTAACCTGTCCAGTTTCCAGACAATTAAAGTATCACCTTATATAAGAGACTGCAGAGCGCACTTTAACCTAGGTCGTTCTGAAATAGTTGCAGATTCGATTAATCAACGATTATACTCCCCGGCACTCCAGAGGATCTGGTAAACATAAAGTCAGAGCTTGGGTATACTGGCACACAAATGGCAGATCTTGCAGGTGCAGCCAGTCATAGCCAGTGGCGAAAATACACGAGTGGTTCTGAGCCCCGCGCCATGTCATCACATATCTTGTTTTTTTATTGCTACCAATCTGACTTTGAGTACTAATGAGCTAGATAGAATTGTTGGTAATGACTCCAACTTACTGATAGTGTTTTATGTTCAGATAATGCCCGATGACCTTGTCATGCAGCTCCACCGATTTTGAGAACGACAGTGACTTCCGTCCCAGCCTTGCCAGATGTTGTCTCAGATTCAGGTTATGTCGCTCAATGCGCTGAGTGTAACGCTTGCTGATAACGTGCAGCTTTCCCTTCAGGCGGGATTCATACAGCGGCCAGCCATCCGTCATCCATACCACGACCTCAAAGGCCGACAGCAGGCTCAGAAGACACTCCAGTGTGGCCAGAGTGCGTTCACCGAAGACGTGCGCCACAACCGTCCTCCGTATCCTGTCATACGCGTAAAACAGCCAGCGCTGACGTGATTTAGCACCGACGTAGCCCCACTGTTCGTCCATTTCAGCGCAGACAATCACATCACTGCCCGGTTGTATGCGCGAGGTTACCGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAACCGTGTTGAGGCCAACGCCCATAATGCGTGCACTGGCGCGACATCCGACGCCATTCATGGCCATATCAATGATTTTCTGGTGCGTACCGGGCTGAGAGGCGGTGTAAGTGAACTGTAGTTGCCATGTTTTACGGCAATGAGAGCAGAGATAGCGCTGATGCCCGGCAGTGCTTTTGCCGTTACGCACCACGCCTTCAGTAGCGGAGCAGGAAGGACATCTGATGGAAATGGAAGCCACGCAAGCACCTTAAAATCACCATCATACACTAAATCAGTAAGTTGGCAGCATTACCTTGTTAGTTTGTCTTTTGTGTTTACCTGATTCGGGTAAACGCCTTTACCTGATTTAGGTAAACTTTTCTTACCTGATTCAGGTAAATTTACCTCTTTCAGGTAAACTTTATTTTTCTTACCCGATTCGGGTAATGTTGACCATTCACTGACCACATTATTGATGCCGATATTCCGCCCGCTCTGAATAAAAATCCCACGCTTTACCAGAACACTTTTTGCAGCAGAACACTTGTGCGGCAATATCCCGGTTAATTCGGAAAGTTGCTCGTTGCTAACCCAATCCAGTTTTTTATTAAAGCCATATGTTTTGCGCATGACAGCCAGAAAGACCAGAAGCTGGTGCTGTGTTAATCCGGCCAGCATCACAGCTTCCAGCAACTCATTTGCAATGCGCGTATAACCATCATCGAGATCTGCCACGCGCCGCTCCTTTTGTGCCACATCCGGCACAGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCCGCAGACCGACTTTCGCCATTTTTGAACCTGTCATATTGCCCCCAGCATGGTGGTGACCATCGCCATCAATGGACCAGCCAGATCCGGGTCCACTCGAAACATCGACACAATGCCTTCACTCATCTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACCGCCTGCTTTGCCTCACAGAGTTCCTTTTCCATTTCAGCCAGCCGAGCCATGAAGCTATCCTGCTCAACCAGGTGGCCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATAGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAGGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTAGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACCGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATTGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTGAAACAGAAAGCCTCTGAGCAGAAGGTGGCTGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGAGATGATGCGTGGAACAAATTACGACTCGGCGTCATCACGGCTTCAGAAGTTCACAATGTGATAGCAAAACCCCGCTCCGGTAAAAAGTGGCCTGACATGAAAATGTCCTACTTTCACACCCTGCTGGCTGAGATTTGCACCGGTGTGGCTCCGGAAGTTAACGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAATGACGCCAGAGCCCTGTTTGAGTTTACTTCCGGCGTGAATGTTACTGAATCCCCGATCATCTATCGCGACGAAAGTATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAGCTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAGTTCCGGCTCGGTGGTTTCGAGGCCATAAAGTCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACACGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGTATGAAGCGTGAAGGACTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGCGCGATCACTTTCGTCTACTCCATTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCACTGAAAGCACAGCGGCTGGCTGAGAAGATAAATAATAAACAGGAGGATATATGAGTCAGGTTGGTAATCATTCATTCGAATTTCCGGCATCGCAAGGTGTACAGGGTGGTACTGTTACACTCTTCCTTACCATACCAGGAAGATCGCTGGCTCGTTTCCTCGCTTCAGATAATTACGGCCATACACTGGAACGCTCTCAGCGAGAAATTAATCCAAATCGAGTACGAAAATTTTTAAATTATCTCACTAACGCAGACTCAAGAAATGAGTCTTTTATCATTCCCCCTCTCGTAGGTAACTGTGATTCGAATATAGAATTTGTACCGTTTGGCAACACAAATGTTGGTATAGCCAGAATTCCCCTCGACGCCGAAATAAAACTTTTTGATGGCCAACATCGTGCAGCTGGCATTGAGATATTTTGCCGAAGTTCCCCATCAACGCTCATGGTTCCCATGATGCTTACAATGAATCTGCCGCTAAAAACCCGGCAGCAGTTCTTTTCGGACATAAATAACAACGTTTCTAAGCCATCAGCGACCATCAATATGGCGTATAACGGCCGGGATGATATTGCTCAGGGAATGATATCCTTCCTGACCCAACATACTGTATTTGCCGATATAACCGATTTTGAACACAACGTAGTGCCATTAAAAAGTAATATGTGGGTGAGTTTCAAGGCACTCACTGATGCAACGTCAAAGTTTGCTAGGAACGGCAATCAACAACTTGAAATGGGATATATAGAATCTGTCTGGGAGGCATGGATTACACTAACTCAGATTGACTCAATCCGACATGGTGTACACCACGCTACGTACAAGCGCGATTATATTCAGTTCCATGGAGTAATGATTAACGCTTTCGGTTTTGCGGTTCAACAGATGATGGTTAATCATTCCATCGCAGAAATAACTTCTATGATCGAAAAACTCTGTGCAACTACCAGCTCTGCAGAAAGAGAGGATTTTTTTCTGATGGATAACTGGGCGGGGATCTGCACGAAAGCCAGCCAGGAAAAACTATCGGTTATTGCCAATGTGGCAGCGCAGAAAGCAGCAGCAAACAGACTGATACAAGCTTTTACCAAAGGAAGTCTGGAAACAACTTAATGAATCAACATTGTCTCATATCAGCATGCTGTACGGCGTCTTTAAGGAACGGTGAGCATGAAAAACAAAATCATCATGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATCATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGCTAGATGAACGAAGACGCCTGGCCGTTCAGGGTTGTCGGACTTGTGCAAGTTGCCAGGAGGAGATCGAACTTAAGAACAAACAATGGGGACTGTGATGGCCTCAAAGCAGCAAATTTCAACATCGTCCAACTGAGGTGTAAAAATGTTCAGAATCATTTTTCCTAACACCTGGTACGTCGACCACCACGGCACTCCCTGCAAAATCCTGCGTTCTACCCACAACAAAGTTCACTACATCCGAAAAGGCAGAACATGTATCGCCAGCATGTTCCGCTTTAATCATGACTTTGAACCTGTGAATAAAGCTGATGCAGATCGGATAGCAGAAGAGATCGAAACGGCAGAACACATTAAGAAGTTACGTGCCATACGCAGGAAATAGAAAAATTGATAAATTCAATACTGCATTTCTCAGCATTAAATTTATCTCTATGACCAGTCAAGAGATGTACCTGCCATGAGCTTAATATCATGTCAGATATATCGGTCACAAACTCCCTCAGCAGCTAAGAGGAGGACAAATGTCTCGACTAATCACTTTACAGGACTGGGCTAAAGAAGAATTTGGGGACTTAGCACCAAGTGAGCGAGTTCTGAAAAAATACGCGCAAGGGAAAATGATGGCCCCACCCGCTATAAAAGTTGGTCGCTACTGGATGATTGACCGAAATTCCCGTTTTGTAGGAACGCTGGCAGAACCGCAACTCCCAATAAACGCAAACCCAAAACTCCAACGGATAATCGCTGATGGCTGCTAGACCCCGATCTCACAAAATCTCTATACCCAATTTATATTGCAAATTAGATAAGCGAACCGGAAAGGTATATTGGCAATACAAACATCCACTATCCGGTCGTTTTCATAGCTTAGGAACTGATGAGAATGAAGCAAAACAAGTTGCTACTGAAGCAAATACCATTATTGCTGAACAACGTACCAGACAAATATTAAGCGTCAATGAGCGTCTGGAAAGAATGAAAGGCAGGCGCTCAGACATTACGGTGACAGAATGGCTTGATAAATATATTTCTATCCAGGAGGACAGGCTGCAACATAATGAACTAAGACCCAACTCCTATCGGCAAAAAGGCAAACCCATTCGTCTTTTCCGTGAGCATTGTGGAATGCAACACCTCAAGGATATTACCGCACTTGATATTGCCGAAATAATTGATGCTGTAAAGGCTGAAGGTCATAACAGGATGGCGCAAGTCGTGAGAATGGTGTTGATCGACGTCTTCAAAGAAGCACAACACGCAGGACATGTTCCGCCAGGATTTAACCCAGCGCAGGCAACAAAACAACCGCGAAATCGAGTAAACCGCCAAAGATTGTCACTGCCCGAATGGCAGGCAATATTTGAAAGCGTAAGCAGACGGCAGCCCTATTTAAAATGCGGCATGCTACTTGCTCTTGTTACTGGACAACGTTTAGGCGATATCTGCAATTTGAAATTCTCTGATATATGGGACGACATGTTGCACATTACTCAGGAAAAAACCGGTTCAAAACTTGCTATTCCGCTTAACCTGAAATGCGATGCTCTGAATATTACCCTTCGTGAAGTTATATCTCAGTGCAGGGATGCTGTTGTTAGTAAATATCTGGTCCATTACCGTCACACTACCTCTCAAGCAAACAGAGGAGACCAGGTTTCTGCAAATACTCTGACAACGGCTTTTAAAAAGGCCAGGGAAAAATGTGGCATAAAATGGGAGCAAGGAACTGCGCCCACATTTCATGAGCAGCGATCTCTGTCAGAACGGTTATATCGGGAACAGGGTCTGGATACGCAAAAGTTGTTAGGCCATAAATCCAGAAAAATGACCGACCGATACAATGATGATCGTGGTAAAGACTGGGTTATCGTAGATATCAAAACAGCATAGAAAATAGCCAGTTTTGGGGAAGGGTTTTGGGGAAAGTTTTGGGGAAGATTTTACATCATCATAAAACAACGGGCGTATAACACGCCCGTTTCAATATTTAACACATGTAGAGATTACATGTTCTTGATGATCGCATCACCAAACTCTGAACATTTCAACAGTTTAGCGCCTTCCATCAGACGTTCGAAGTCATAGGTTACGGTCTTCGCATTGATTGCGCCTTCCATACCTTTAACAATCAGGTCTGCGGCTTCGGTCCAACCCATGTGGCGTAACATCATCTCAGCAGAGAGAATAATAGAGCCTGGGTTTACTTTGTCCTGACCGGCATATTTCGGCGCAGTACCGTGGGTGGCTTCAAACAGGGCGCATTCGTCACCGATGTTTGCACCTGGGGCGATACCGATACCGCCAACCTGCGCTGCCAGGGCGTCAGAAATGTAGTCACCGTTCAGGTTCATACAGGCGATAACATCATATTCAGCCGGACGCAGCAGGATCTGTTGCAGGAATGCATCAGCAATCACGTCTTTAATGACGATCTCTTTGCCGGTGTTCGGGTTTTTAACTTTCAGCCACGGGCCGCCGTCGATCAGTTCACCGCCAAACTCTTCACGCGCCAGCTGGTAGCCCCAGTCTTTAAACGCTCCTTCGGTGAACTTCATGATGTTGCCTTTGTGCACCAGAGTCACAGAGTCACGATCGTTAGCAATTGCGTATTCGATCGCTGCACGAACCAGACGTTTGGTGCCTTCTTCCGAACACGGCTTAATACCGATACCACAATGTTCCGGGAAGCGAATTTTCTTCACCCCCATCTCTTCACGCAGGAATTTAATCACTTTCTCGGCGTCGGCAGAGTCTGCTTTCCATTCGATACCCGCATAAATGTCTTCCGAGTTTTCACGGAAGATAACCATATCGGTCAGTTCAGGGTGTTTAACCGGGCTTGGAGTGCCCTGATAGTAACGTACCGGACGCAGGCAGATGTAGAGATCCAGTTCCTGGCGCAGGGCAACGTTCAGAGAGCGAATACCGCCACCAACAGGAGTGGTCAGCGGACCTTTAATGGCAACGCGATATTCACGAATCAGATCAAGGGTTTCAGCAGGCAGCCAGACATCCTGACCATAAACCTGTGTGGATTTTTCACCGGTGTAAATTTCCATCCAGGAGATTTTACGCTCGCCTTTATAGGCTTTCTCGACTGCAGCGTCGACCACTTTCAGCATGGCTGGGGTTACATCTACACCGATTCCATCACCTTCAATGTAAGGGATAATCGGATTTTCAGGAACGTTGAGTTTGCCGTTTTGCAGGGTGATCTTCTTGCCTTGTGCCGGAACAACTACTTTACTTTCCAT
Protein sequences of DBSCAN-SWA_6 >CP031653|2817473:2828251|2823500_2823659_+|AXP29171.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSITKRGWVFPGLSVIRNPLKAQRLAEKINNKQEDI >CP031653|2817473:2828251|2823655_2824720_+|AXP27184.1|DBSCAN-SWA MSQVGNHSFEFPASQGVQGGTVTLFLTIPGRSLARFLASDNYGHTLERSQREINPNRVRKFLNYLTNADSRNESFIIPPLVGNCDSNIEFVPFGNTNVGIARIPLDAEIKLFDGQHRAAGIEIFCRSSPSTLMVPMMLTMNLPLKTRQQFFSDINNNVSKPSATINMAYNGRDDIAQGMISFLTQHTVFADITDFEHNVVPLKSNMWVSFKALTDATSKFARNGNQQLEMGYIESVWEAWITLTQIDSIRHGVHHATYKRDYIQFHGVMINAFGFAVQQMMVNHSIAEITSMIEKLCATTSSAEREDFFLMDNWAGICTKASQEKLSVIANVAAQKAAANRLIQAFTKGSLETT >CP031653|2817473:2828251|2822515_2822827_+|AXP27182.1|DBSCAN-SWA MPLLCCEATYIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKASEQKVAA >CP031653|2817473:2828251|2824873_2825092_+|AXP27185.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPLDERRRLAVQGCRTCASCQEEIELKNKQWGL >CP031653|2817473:2828251|2821793_2822333_-|AXP27181.1|DBSCAN-SWA MQLLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGHLVEQDSFMARLAEMEKELCEAKQAVILNAPRHQKLKEMSEGIVSMFRVDPDLAGPLMAMVTTMLGAI >CP031653|2817473:2828251|2827000_2828251_-|AXP27189.1|DBSCAN-SWA MESKVVVPAQGKKITLQNGKLNVPENPIIPYIEGDGIGVDVTPAMLKVVDAAVEKAYKGERKISWMEIYTGEKSTQVYGQDVWLPAETLDLIREYRVAIKGPLTTPVGGGIRSLNVALRQELDLYICLRPVRYYQGTPSPVKHPELTDMVIFRENSEDIYAGIEWKADSADAEKVIKFLREEMGVKKIRFPEHCGIGIKPCSEEGTKRLVRAAIEYAIANDRDSVTLVHKGNIMKFTEGAFKDWGYQLAREEFGGELIDGGPWLKVKNPNTGKEIVIKDVIADAFLQQILLRPAEYDVIACMNLNGDYISDALAAQVGGIGIAPGANIGDECALFEATHGTAPKYAGQDKVNPGSIILSAEMMLRHMGWTEAADLIVKGMEGAINAKTVTYDFERLMEGAKLLKCSEFGDAIIKNM >CP031653|2817473:2828251|2817473_2819429_-|AXP27180.1|DBSCAN-SWA MDTAEHDGKFAWASFYEAFANALLTWRNRRDELVKGIHLIASGVEGMSHLQDKSIMGEIFPLKDICPFTTMGLFNRNLTDSNRKIIAAKLANLLGVNEPIPDSFAGIPLLNNQKSWFFGYEKSRDPNDIECLWEMFSQAMTFADNQNTNSADFTAAYDIASTVMNVGWNLTMGLYWTRPWFYPTLDSQSQYYIQTVLNIKIIKNGAKGRCSGQSYLSLMRALNEVFTQPNYPVHSFPELSLSAWNTDLSQSKDEVANRTWKASLLNKIKALCLKKNGPYFTASELEENYLDVFKVEHPESKTVTSSIYFYLQKLCNDDELKSLESGNYEYLNFEKNESQTITEETVEESAPLPKLTHVPYDISHLVQDGCFLEEAKIQLTLQRLIDKKNLILQGPPGTGKTWLARRLAYCLMGEKAPERISAVQFHPNLSYEDFIRGWRPGKEGQLTLIDGPFVNAIKTAVNNPTSKYVVIIEEINRGNPAQIFGETLTLMEADKRTPTEALSLSYPKNDDEKIYIPENLYIIGTMNIADRSLALLDLALRRRFAFIDLKPAFNDAWRNWVNYNYAIDFDMLAFIKSRLTVLNDMLAKDVTLGPQFCIGHSYVTPAIGQKINDAQAWYEQVVDTEICPLLAEYWFDAPNKVDEARKVLLAK >CP031653|2817473:2828251|2822823_2823504_+|AXP27183.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWNKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEICTGVAPEVNAKALAWGKQYENDARALFEFTSGVNVTESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP031653|2817473:2828251|2825744_2826887_+|AXP27188.1|integrase|DBSCAN-SWA MAARPRSHKISIPNLYCKLDKRTGKVYWQYKHPLSGRFHSLGTDENEAKQVATEANTIIAEQRTRQILSVNERLERMKGRRSDITVTEWLDKYISIQEDRLQHNELRPNSYRQKGKPIRLFREHCGMQHLKDITALDIAEIIDAVKAEGHNRMAQVVRMVLIDVFKEAQHAGHVPPGFNPAQATKQPRNRVNRQRLSLPEWQAIFESVSRRQPYLKCGMLLALVTGQRLGDICNLKFSDIWDDMLHITQEKTGSKLAIPLNLKCDALNITLREVISQCRDAVVSKYLVHYRHTTSQANRGDQVSANTLTTAFKKAREKCGIKWEQGTAPTFHEQRSLSERLYREQGLDTQKLLGHKSRKMTDRYNDDRGKDWVIVDIKTA >CP031653|2817473:2828251|2825139_2825379_+|AXP27186.1|DBSCAN-SWA MFRIIFPNTWYVDHHGTPCKILRSTHNKVHYIRKGRTCIASMFRFNHDFEPVNKADADRIAEEIETAEHIKKLRAIRRK >CP031653|2817473:2828251|2825518_2825755_+|AXP27187.1|DBSCAN-SWA MSRLITLQDWAKEEFGDLAPSERVLKKYAQGKMMAPPAIKVGRYWMIDRNSRFVGTLAEPQLPINANPKLQRIIADGC |
11 | Enterobacteria_phage(40.0%) | integrase | attL 2815446:2815469|attR 2826954:2826977 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
3283362 : 3310566
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >CP031653|3283362:3310566|DBSCAN-SWA TATGACAACGGACGATCTTGCCTTTGACCAACGCCATATCTGGCACCCATACACATCCATGACCTCCCCTCTGCCGGTTTATCCGGTGGTGAGCGCCGAAGGTTGCGAGCTGATTTTGTCTGACGGCAAACGCCTGGTTGACGGTATGTCGTCCTGGTGGGCGGCGATCCACGGCTACAATCACCCGCAGCTTAATGCGGCGATGAAGTCGCAAATTGATGCCATGTCGCATGTGATGTTTGGCGGTATCACCCATGCGCCAGCCATTGAGCTGTGCCGCAAACTGGTGGCGATGACGCCGCAACCGCTGGAGTGCGTTTTTCTCGCGGACTCCGGTTCCGTAGCGGTGGAAGTGGCGATGAAAATGGCGTTGCAGTACTGGCAAGCCAAAGGCGAAGCGCGCCAGCGTTTTCTGACCTTCCGCAATGGTTATCATGGCGATACCTTTGGCGCGATGTCGGTGTGCGATCCGGATAACTCAATGCACAGTCTGTGGAAAGGCTACCTGCCAGAAAACCTGTTTGCTCCCGCCCCGCAAAGCCGCATGGATGGCGAATGGGATGAGCGCGATATGGTGGGCTTTGCCCGCCTGATGGCGGCGCATCGTCATGAAATCGCGGCGGTGATCATTGAGCCGATTGTCCAGGGCGCAGGCGGGATGCGCATGTACCATCCGGAATGGTTAAAACGAATCCGCAAAATGTGCGATCGCGAAGGTATCTTGCTGATTGCCGACGAGATCGCCACCGGATTTGGTCGTACCGGCAAACTGTTTGCCTGTGAATATGCAGAAATCGCGCCGGACATTTTGTGCCTCGGTAAAGCCTTAACCGGCGGCACAATGACCCTTTCCGCCACACTTACCACGCGCGAGGTTGCAGAAACCATCAGTAACGGCGAAGCCGGCTGCTTTATGCATGGGCCAACTTTTATGGGCAATCCGCTGGCCTGCGCGGCAGCAAACGCCAGCCTGGCGATTATCGAATCCGGCGAATGGCAGCAGCAGGTGGCGGCTATTGAAGTGCAGCTGCGCGAGCAACTGGCACCAGCCCGTGATGCCGAAATGGTTGCCGATGTGCGCGTACTGGGGGCAATCGGTGTGGTCGAAACCACTCGTCCGGTGAATATGGCGGCGCTGCAAAAATTCTTTGTCGAACAGGGTGTCTGGATCCGGCCTTTTGGCAAACTGATTTACCTGATGCCGCCCTATATTATTCTCCCGCAACAGTTGCAGCGTCTGACCGCAGCGGTTAACCGCGCGGTACAGGATGAAACATTTTTTTGCCAATAACGGGAAGTCCGCGTGAGGGTTTCTGGCTACACTTTCTGCAAACAAGAAAGGAGGGTTCATGAAACTCATCAGTAACGATCTGCGCGATGGCGATAAATTGCCGCATCGTCATGTCTTTAACGGCATGGGTTACGATGGCGATAATATTTCACCGCATCTGGCGTGGGATGATGTTCCTGCGGGAACGAAAAGTTTTGTTGTCACCTGCTACGACCCGGATGCGCCAACCGGCTCCGGCTGGTGGCACTGGGTAGTTGTTAACTTACCCGCTGATACCCGCGTATTACCGCAAGGGTTTGGCTCTGGTCTGGTAGCAATGCCAGACGGCGTTTTGCAGACGCGTACCGACTTTGGTAAAACCGGGTACGATGGCGCAGCACCGCCGAAAGGCGAAACTCATCGCTACATTTTTACCGTTCACGCGCTGGATATAGAACGTATTGATGTCGATGAAGGTGCCAGCGGCGCGATGGTCGGGTTTAACGTTCATTTCCACTCTCTGGCAAGCGCCTCGATTACTGCGATGTTTAGTTAATCACTCTGCCAGATGGCGCAATGCCATCTGGTATCACTTAAAGGTATTAAAAACAACTTTTTGTCTTTTTACCTTCCCGTTTCGCTCAAGTTAGTATAAAAAAGCTGAATGCGAAACATTAAAAAACATTAATATCAATGTGTTACAATATCATTGGTCTAAAAAATAGACTACATGATGCTACAAAACACAACATATCCAGTCACTATGAATCAACTACTTAGATAGTATTAGTGACCTGAGACAGAGCATTAGCGCAAGGTGATTTTTGTCTTCTTGCGCTAATTTTTTGTTATCAAACATGTCGCACTCCAGAGAAGCACAAAGCCTTGCAATCCAGTGCAAAGATTTGTGTGCCTCAGTTTTGTCTAAGTGTTCTACTGAAAACATAGTAAAATCGGCAACAGCTGGAAATCATTCAATACTCGCACTATCGAAAGTTCACCAGCCAACCGCAGCACGTTCTTGCATACGACGTGCTGCGGTTTTCTTTATGATTTATGCACAATGGACAATTTGAAATTATTGATGATTGTATGGTGCATCGTTTTCTGAACCTACACTGATTTTTTGGTATAGCCTTGCCTAGTCAGTCTTACCGGATCAAACTCTTCGCTATTGCAATACTAACCAAAATCATCAATTTGACAGCGATTAACCAGAATAATAGTATACTATCACCAGTAAGAAATTATCGTTATTTGTAGCGATACATATTATATATATATCATTCTCAGGTGCGTACATGATTATCAACCAGGTACCTATAAAAATAAAAATCTTTATCTTTTTATTTTCATGCATCTCTATTATATTTTTGTTACTGCATGCAAATAATGGAATATACATAACACAAACAACACAAATAAGTTATAGTGTTTTCATTATTGGGCTTTTTTTCATAAACCTGATGATTTTTATTTTTCTATTGCTTTACTATGTTTCTAATCAGAGACAAAGTTATCTCTTAATTCTTTCATTCGCGTTTTTGAGCAACACGTATTATTTATTAGAAGTGGCTATTATTTCTTTATCTCCGTTAGGTAACGATTTATCTACAATCTATCAGAAATCAAATGATATCGCAATATATTATCTATTCCGTCAGTTCAGCTTTATATCTATAATCTTTCTGGCTGTTTATTCCACCAATGTTAAAAATAAAAGTGTTTTAGAAGATAAAAGAAACATAATAATTGTTGTTTTGTCAATATTAATTCTTTTTATTACTCCGTTTGTAGCAAAAAATCTAAGCAGTGACAATATAAAATATAGTCTTAATATTATACAATACTCGCTGAATCGTCATTTGCCGACGTGGAATATCGTGTACACCAAAATAATATCAGTATTTTGGCTTGTATTACTTATCAGCTCATGCATCAGCATACGTAATTACTCAAAAATATGGTTGTGTATAATACTTATTAGTATAGTGTCAGTATGCAATAATCTAATTTTATTGTATTTTATTGATAAATCCCATCCTGCATGGTATATGACAAAATTTCTTGAATTGATATCAATGATTTATATCATTTCAACACTCATGTATTATGTTTTCAGGAAATTAAATCATGCTAATCATATGGCAATTCATGATCCACTAACGAATACATACAATAGAAGATACTTTATTGACTCATTGAAGAATATATCAAAACACCATGATTTCTCAGTAATAATGTTAGATATTGACAGTTTCAAAAGCATCAATGACAAATGGGGGCATCATATGGGTGATCAAGTCATAGTAATGGTTACCAGAATAATAAAAAAATCCATCAGGAAAGAGGATGTATTAGGGCGCTTAGGCGGTGAGGAGTTCGGTATTATCATTAAAGGTAATACTCAAAAGCTCTTGCTATCAATTGCAGAGCGAATCAGAAAAAACATTGAAGAGCAATGCTCGGAAAAATTATTATCGCATGGACCTGAGAAAATAACTGTCAGTATTGGTTGCTTTACTTCAAAAGAGAATAATCTCAGCCCATCTGAAATGTTAGTCAATGCCGATAAAGCGTTATATCAAGCCAAAAGAACCGGAAAAAACAAGGTGATAATTCACTCAAAATAAACACCTTTTTAAAATACAGCCCCAATAAACTGCAGAATATTATCCCATATAATATCCTGCAGTTCGTAATGCACTATTCGATAATGGGTACTGTTGGCCATTCAATATCCGGTGCAGTTGTTGTATTAACACGGTTCAGCAACACCCGATACTTCTTCCAGGCTTCCAGCAACGAGTTTTCTTCCTCCGTTGCGATCTCCAGATCTACAGCATCCTGAAGTGGCGCAATATGCTCACTGAATTCCTGGATGTAGAACTATGTGGTGACGGTCTTCCAGCCATTCGGCTCCTGCTGTATCGAAGCATACCAGGCTATTTCAATATCGCTATGCTGCGGCAGCATTTAACCCCTTGTAATTCATCACCATAATTGATTTAATTCACAAACAAAACTATAACATGGTGAAATTAATGAAAAAAAACACAGATGATGGGGCTAAAATTTACACACCACTTACCCTAAAGCTTTATGACTGGTGGGTTTTGGGAGTATCAAATCGGCTTGCATGGGGATGTCCTACAAAGGAACACCTTCTTCCACACTTTCTGGAACATTTAGGTAACAACCATCTGGATATTGGTGTTGGAACTGGGTTTTACCTTACTCACGTACCTGAGAGTAGTCTGATATCTTTAATGGATTTGAACGAAGCTAGCCTGAACGCGGCATCTACAAGGGCTGGGGAATCAAAAATTAAACATAAAATTAGCCATGATGTTTTTGAACCTTATCCCGCGGCGTTACATGGTCAATTTGATTCCATTTCCATGTTTTACCTTCTTCACTGCCTGCCTGGAAATATATCTACAAAAAGCTGTGTAATACGCAATGCGGCGCAGGCCTTAACTGACGATGGAACTCTATACGGAGCCACAATTCTTGGCGATGGAGTTGTGCACAATAGCTTCGGTCAAAAACTGATGCGCATTTACAATCAGAAAGGCATCTTTTCAAACACAAAAGATTCCGAAGAAGGCTTAACACATATACTCTCAGAGCATTTCGAGAATGTTAAAACCAAGGTTCAAGGTACTGTAGTAATGTTTTCCGCTTCAGGGAAAAAATAGCATCCAACCGCAGCTCGCTCTTGCTTAAGACGTGCTGCGGCATAATCCCAATGATTACTCCCTGACAGGGTTCGTAGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTCTTCCAGACTTCCAGCAACAAGGTTTCTTCCTCCGTTGCGATTTCCAGCTCAACAACAGTCTGAACGTACCAGGAACAGCCTCCTTCAGGGCTTGAAGGATATCAATGTTCGCTTCCTGTTAACTGCCCGACAAGTGCAACCAGTTCGCTTACCTGATTTTCCAGAGTGGTGATCCGGGTGCCTGCTGTTGCCAGATTTTCACGTAACGTTGTATTTTCCTCTTCCAGCGCGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAACCCCGTCACGGCGGAGTAGTCAACATTAAGATAGCGAGTTTCTTCGCGTAGCTCGTTGCCGTCAACGGTCGGACCTTGCAACTCTTCACCATAATGAGTAAACGATCCCACAGCTTCAGGTATCGCCTCCATTACTTCCTGTGCAATAACGCCAGCATAAGGCATTCCGTTTTCCTTGAGCGTGTAGGTGTACCCGTTCATTTTACGGATTGCTTTCGTCGCGTCGCTGATAACGAGAATATCGTCTTTAAGGTCGCGGTCTGATGACTGATTCAGCGTTGTGCAATTAATAGCGCCATTTACATCAAACAACTGGCCTGCTGACGTTTTTTGCGCATAAAACAGATACGCAGCAGACGTTCCAACCTCAAAAACGTTTTGTCGAGTACTGGAACCCCACACCCTGACAGAAAACGGTAGTTCTGCATTACCTGAGTTCTGTAAAACAAAACGATTGCCAGTCCCTGATTGTTTTGTAAGGGTTAAATCAACTGTTGAGTTAACCTCATCCTTGTTGATAGTGAGCGCCTGCGCTTTATCAGTTCATCCAGCGCGGCTGCTTTGTTCATGGCTTTGATGATATCCCGTTTCAGGAAATCAACATGTCGGTTTTCCAGTTCCGGAAAACGCCGCTGCACCGACAGGGGGATCCCGTCGAGAATACTGGCAATTTCACCTGCGATCCGCGACAGCACGAAAGTACAGAATGCGGTTTCCACCACTTCAGCGGAGTCTCTGGCATTTTTCAGCTCCTGTGCGTCGGCCTGCGCACGCGTAAGTCGATGGCGTTCGTACTCAATAGTCCCTGGCTGGAGATCTGTCTCGCTGGCCTGCCGCAGTTCTTCAACTTCCCGGCGCAGCTTTTCGTTCTCAATTTCAGCATCCCTTTCGGCATACCATTTTATGACGGCGGCAGAATCATAAAGCACCTCATTACCCTTCCCACCACCCCGCAGAACGGGCATTCCCTGCTCCTGCCAGTTCTGAATGGTACGGATACTCGCGCCGAAAATGTCAGCCAGCTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAAAGCCCGTTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGGGGGTGTTTTAAATAAAAACATTAAGTTACGACGAAGAAGAACGGAAATGCCTTAAACCGGAAAATTTTCATAAATAGCGAAAACCCGAGAGGTCGCCGCCCCGTAACCTGTCGGATCGCCGGAAAGGACCCGCAAAATGATAATAATTATCATCTGCATGTCACAACGTGCATCTACGCCATCAAACCACGTCAAATAATCAATTATGACGCAGGTATCATATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACAATACAAATCAGCGACACTAAATACGGGACAACCTCATGTCAACGAAGAACAGAACCCGCAGAACAACAATCCGCAACATCCGCTTTCCTAACCAAATGATTGAACAAATTAACATCGCTCTTGATCAAAAAGGGTCCGGGAATTTCTCAGCCTGGGTCATTGAAGCCTGCCGCCGGAGACTGTGCTCAGAAAAAAGAGTTTCTCCTGAAGCAAACAAAGAAAAGAGTGACATTACTGAATTGCTCAGAAAACAGATCAGACCAGATTGAAGCAATTTAGATAATCGTGCAGACTACGCCCCTCATATCACATGGAAGGTACTACAATGGCTCAGGTTGCCATTTTTAAACAAATATTCGATAAAGTGCGAAATAATTTAAACTATCACTGGTTTTATTCTGAACTAAAACGTCACAATGTCTCACATTACATTTACTATTTAGCTACAGAGAATATTCATCTTGTTCTTGAAAACGATAATACGGTTTTAATAAAAGGACAGGGTAAGGTTGTAAATGTAAGATTTTCAAAAAATAAATGCCTTATAGAAGCCACCTTAAAAGGATTCAAATCAGGAGAGTTATCATTTTACGAATACAGGAAAAATCTTGCTACAGCAGGGGTTTTCAGATGGATTACAAATATCCACGAAAACAAAAGGTATTACTATACCTTTGATAATTCATTACTCTTTACTGAGAACATTCAGAACACTACACAAATATTTCCGCACTAAATCATAACGTCCGGTTTCTTCCGTGCCAGAACCGGACTCGCTGGCATGATGAAATATGTGTACCCGGTAACCCCGGTGTGCATCGTTTTTGATTATTCCCGCACACTCGCGCAGAAGGAGTTCCCCGTCGGGCTACGGTCTCTGTTAATACGGGAATACGGCGACGATACAGCGCATGATGTGTCAGGCTTGAATACCTTTATCCTTTAAAAGGGATATCAGTTAAGTTATCCCGTGTAGGGTATAAACCATTATCAAAGCCACTCTGTAGGAAGTGGCTTTTGTAATGGCAATAAAAAGCCCCGCGAATGCGAGGCTAAATCCTGGTATTTGTAATGACTGGTTCTTATCTCAACGCAGCCCCTTACCGCGCGCAAAATGCTCAATATCAAGCATCAGCAATGAGATGTTTAATCTGGATTCACTCCAGAAGTGAGCACCACCCTGTCTACAGAGCCAGATGTGAAGGATGATGAGTAAAATTATCGCTATCATCGAAGGCATTGCGTCCTGATGTATTCCTGAAGCGTTCTCAGTGCTGTTTGGTCGCGGATAATTCCGTCCCGGATACCGAGAACGTTTCGTCCAGCAACTGGAGAGAGTTCGACGGTGGCATCATTGCCCATGCCGGAGGCGCTGGAGGTTTCGGCTGAGGATGGCACAGGGCATTTTCCTTTGACGAGCACCCGACCACCATTATCAAGCTTGCGCCGAAGAGCATCATTTTCAGCTTTCGCATCAGCTAACTCCTTCGTGTATTTATCATCGAGTGCATCAGCATCACGCTGGCGCTGCTGCATGTCAGTAATGGTGGCGTTCGCCTTCTCCAGTTCACTGGCCTTGTTATCGCGCTGTTCTTTGTAGGCGATTGCATTATCACGGTAATGATTAACAGCCCATGACAGGCAGACGATGATGCAGATAACCAGAGCGGAGATAATCGCGGTTACCCTGCTCATTGTTGCCCCCACAAACAGACCTCACGCTCAATCTCACGACGAGTCATCAGGCCTTTCCATTGCTTACCGCCAGCGTATGCCCAGCGACGTAGCTGGTCACATGCGCCCTTGATATCGCCCTGGTTTATTTTGCGAAGAAGAGCGGATGTTCTGAAATTGCCTGCGCCCACGTTATAGACGAACGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCTACCGTGGCAAGGTCTTTATTCAGGAGGGCTTTGCATTCTGCTTCGGTATACGTTTTACCGGGAATGATGTCTTTTCCTGTATGCCCGTGACATACAGTCCATACGCCAACGATATCTTTGTATGGTATGTAGCTGACACCTTCCAGACCATCGTTACCACTTGGGCCAGTGATTAACACTGATGCTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACGGCTTTTCGTAATGATGGAGGCATTATTCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAAGCTACCCAGCACACCTATTGCTGCCCACTGTGAGGGCGTGACTTTATCGAGCAGCTGTAAAAACCAGTAGCCAGCACTGCCTGCGGAGGTGCCGTAGGCAATGCCCGTTGTTAACTTATCCATGGATTTCATAGCCTCACCTCCGCAAATAACGGATGGTGTACACGGTTCGGAACGAAGAGGAAAGGTATAGAAGTTACATTAGCGTAAGGCTTGAACATCTATTCAAAAAGAAAAACGCCGCGATTATTCTGGCGTAGCTGAAAGCATCATACAATTATCAAATACGAAAATTACAAAATCATTAAAACGCATCACGTTACATCATGTCTTTTTCTAAAAAAAATCTTGATGAATATTGATGGGGAGGAACACCAAAATACCTTCTGAAAACACTTACAAAATATGACGTGTTTTCATAACCGCATATCTCAGCAACTTTTCCAACAGAATATAAATTGTAGCTTAATAACCTTTCCGCCATCACCATTCGCTCTTCAAGAATTAACTTACTAAATGATAAGCCTTCGTGCTTTAATTTTCTTTTTAACAGACTTTCACTCAGATACAGTCTTGAAGATATATCACAAAGTCTCCATGCTGCAGATATATCCGTGTGAATAATAGCCTTAACTTTACTTCCTAAACTATTAAGACATCCAAATAAAAAACTTTGCACTATTTTCTCTGAAGATAAGATAGCAAGACATGCAAGTGATATTTGATTTCTAACAAAATCCACAGTTCTGCCATCACAATTCAAGCATGCAATCAAGTTCTTTAACAATGAAAAATCTTCACATTCCACCATCAAGTATGCCGGATAAAACCTTCTTACAGAAAAAGGTGAGAGTGTGTTGCTTTTAAAGAAATCATTAACTGTTTTTTCTTCAACATCTACGATCATTACATGATCTATATTTGATGAAAAAAATCTTTTAAATTGTAATCAATGAGAACAGCACTTCCTTTTTTAAACAAAATATCTTCTTTACCAATTCGGACATCAAACGAATTCAACACCAAAATGATAGAACATATGTATGGCATATTATCCACCTGATATCATTGGGGTTACACCAGGTAAGTGTAGGTGGAAAATCAATATTCGCCAGTTCAACAATAAGGAAAATTTCATTACATCACAAGTATAAAATTATGTATTTAACTCACAAAGACAAATTATTAAACCAATCTGTTATATTATATATAGCTGCGTGGAATCATAATATCATATATTTTGACTGGCATGTTTACCAACTTTAAGTTGCATCTCAATTGTTTCTTCAGCGTAAACAGAGTTTTTATACAAACTGACACTCTGGGTATCATAGTGTAGTTTTTACGATTGTAAATATCCTGCATGCAGGAACTCATCCTTTTGGATGATATCGCATACAATTAATTTACCATCAGTCTTAGAGCCAGTTCGTCCGGATAGGGATCGAAGTAATTTTGTGTAAGCAAGTAATCATTAGGATACTCACCCAGATAATGCTTCAGCAGAGTCAACGGCGCAAGAAGAGGTAATGTGCCAGAACGATAGTTAAGTATAACCTCGCTCAACTCTTTACGCTGGCGTGTACTTAAGTAGTTACTAAAATACCCCTGTATATGCATCAGCACATTCGTGTGATTTTTACGTGATGCAGGTTTTCTGAGAATCGCCATCAGCTTATCACGATACACCTCAAAGTATGATTCAAGGTCCGCCCACTCGTGTATTGCAGCCACAAATGGTCCCATATCTTTATAGCCTGCCTGACTATGCGCCAACAACTGAAGCTTATAACGACTATGAAAAGCTAATAACTCTCTTCTTGATAATTTCTCCTTGTAAAGGTGATTGAGCTCATGCAAAGCAAAAACTCTTTCAACAAAATTCTCACGAAGCACTGGATCATGTAATCGCCCATCCTCTTCAACCGGTAGCCAGGAAAACTTTTCCATCAAAGTGCTCGTAAATAGTCCCACTCCATCTTTACGACCTCGATTACCATTTTCATCATAGACACGCACGCGCTCCATGCCACAGCTGGGAGATTTAGCACAAACCACAAACCCCGATACATCCTTTAATTTGTCCATATAAGAACGACTAAACTCTGTCATTCTCTCTGTCACATCCTCATTCTGGTCGTGGCTGAAACACATCCGTATATTTCCTTGCATCGAGCGCACAAGACGTAGAGCAGGACGCGGAACTGGCAGCCCTATAGCCATTTCCGGACATACTGGTCTGAATGTTACCCATTCCACTAATTTGTCCATTAAAAAGTCAGCTCTTTTGTGACCACCATCAAAACGAACAGCAGAACCGGCCAAACAACCGCTGATTCCAATCACAGGTTTTTTTATCATATTCTCCCCCTTGACTAATTCATTAACACATAAACTGTGTAGTGCACGGAATAAATTGCCTTTCTGGCGTCATCACTGACAATTTTTCTGTTATGGACTATTCCTAATATAGTATGAAAGTTCTTTAAGTGATCGGTCGTAATCATCTATCTTTCATACTTACTCTCAACTATCAAAAGTACAGGATTTATTATGAAGTTATGGCCTGTGTTGACTGGCATTGCACTCTCTTTCACTCTTATAGCATGTAAGGCCCCGACACCACCTAAAGGTGTGCAGCCGATTACAAATTTTGACGCCAACCGCTACCTCGGAAAATGGTATGAAATAGCTCGCCTCGAGAACCGGTTCGAACGTGGTCTGGAACAGGTCAGCGCTACTTATGGAAAACGGAACGACGGAGGGATTCGTGTACTTAACCGTGGATACGATCCAACGAAAAATAAATGGAGCGAGAGCGAAGGTAAAGCATACTTTACTGGAGATACTAAAACTGCAGCATTGAAGGTTTCGTTTTTTGGCCCCTTCTATGGTGGCTATAATGTAATCAAACTGGATGATGAGTATAAGTATGCTCTTGTCAGTGGTCCGAACAGAGAATACCTATGGATTCTGGCAAGGACCCCAACTATTCCAGATAAAGTAAAAGCAGACTATGTGCGAACCGCTCAAAAGTTGGGATTCAATGTCAATGAATTATTATGGGTTAAACAATAAAATCCCTACCCGAAATAATACTTATTAGAAAAAAACCAGCCTTTGGGGAGGCTGGCTAAATCAGGAAACAAGCTGTTATATGATAATAACTACGTTGTGATTCCAACATTTAAAATGTTAGACTAATGACAATCAGACAGCAACTTTTCCTTTAATTATTTCGAACAATCAGCATCCATCTCCAATCGGAGATCCAACACCATCAGCATACCCTCCACTACGCCCTCAGCTTTCTGAAGCATCCTGCCAACCCAACAATCAGATCGCCCATGCTTACGTGCAAGCGCCATAAAAGTCATGCCGCCGACATAATAGTCCACCAATAAATCATGCAAATCGCTGTTGTTCTTTTTCAGACGGGCCATGCACCCGCAAATGATCATCGCATCATCGTCACAACATTGCGGGCGAGATTTTACTTTTGAAGGAATTAATCCCTTAAAACCGGCGGCAATGGACGACCAAGTCACATCTTCATGATTATTAGCCGCCCACGCTCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTACTCCACAAAAATCAGACCAGAACGCCAATCACAAGCAAAAATCAACAAAACAGTATTAGTTGATTGTTATCTCTGACTTCATACTCCTGCTCCTGTCAGGGTTTTGGCGTAATTCTTCAGTATTCGGTAATCGGTCAAAACAGAACCGGGGAAACGATATAAGCGCAGATGCCCCCAGCGGTGGCGAAGAAGTTCTGCCATATAAAACTCAAACATCATTCATTCCCCATTTCGGTGATGGTCAGTTCCAGCCTCCCACCTTTGGTAACAGGCATCTTCACAACGCGGTAATCAACGACCTGAGCATCATCCAGCCAGAAACCTGCTTTAGTGAGTGCGTCAAAAGCGGCTTTTTGCAGATTATCCAGGTCACGGCGACGGCGATCCGGCATGTGGCACTCAATGCGGATTTTCACAGGCATAGCCAGGCCGATATCCAGCATTGCGTTTTTAATGATTCGGGCGACGTTATCGCGGTATGCCTGCCCCTCTGCGCTGACGTGCGTGCGCCCGCGATTATGGCGGTAATAGCGATTATTGCTCGGAGGCCAGGGTAATGTGATGTTGTAGGTATTCACGCCTTGATTACCCCCTCTTTCATCCAGATAACCTGCGTTCTCGCCATACCTTCCAGCGCGCATTCTTTTGCATATGCGGCATCGACAAAATGTGTGCGGCGGTCGATTTCGTCGTGGCAGGCAGAACATGCAATGGTGGCAATCAGGTCTGGCGGTTTGGTACCGGTGCCGCACAATCCAGTCAGCCGGATATGTGCCAGTACAGACGTTTCAGGGTTGCCATTACATACGCCAGGGGTTCTTACCTGGCATTCCCGACCACGCGCTGCTTTTCTCAAATCAGCCATGATTCCTCCTTGCTGCCAGTCGCAACCATTTTTTATCAACCAGGCTGGCGGTATATCCGAGCAGTGTTGGTATTTCGGATGGCTTCAGCTCAGGTTTACGCTTACGACGATTTGGTACTCTGTAGATGTGTCCGTTCATGACACGAATAAGCGGTGTAGCCATTACGCCTCCTGCTTGTCGCGCAGCAGCTGGAACTCGCAGCTCTGCGGAATAGTCAGGTGGCAGCCAATATTCATCGCCCAGGCTTCAACCTTACACAGGAAGACATACATCTCTCCGGCATCAAGATCGGAGGTATGGCGTAACGACTGGATAGTGGTGATTTCACCGGTTACGACATCAACCAGTTCTTTGGTTTCATAACCGAGGTATGTGTGTTTGAGAGCTTCTTTTACCCATGCTGCGGTAGCGAACGATTTCCCCCTGCTGATAAGGTATTCACTGATTTCGCTGTACCACATGTGGCTGAGTGCATTCTGGGAAAGACTGCGTTTCTCACGCCACGGTTTAAGCACCATGCGAAAGCATTTTCCGTCCTCCAGATAAGGCTGGATCTGCTGGCCGATAGCGGTGAAGTTGCCGCGATGTAATTTGATGCCATCTTGTGGTAGGTTCACGCTTCACCTCCGCAGAGGTCAAACGCAGGATGCAAAAAATCGCAGGTGCATTTCTGCATCTGTGAAGGGAGAAGAGAGTTTGGATTGTATGTGCGCATAAACGTCCCCGTTTAGCGCAGAAGTCACCGGAGTTGTTCAAGCTCCGATGACTTTATTATTACGAATTGATTTTACAAAATCAAAAGGTATGTTAGTGACGCGGGTCTGTTATTATGCGAGAAGGGTTTCCGTATAAAACAAGGACCTTACTTCCTTGAGTAAATAACGGATCTTTGCCTTGAACAATGGTCATTAAATTCCCATTCTCAGTTTCGACAACATATTCCATGCCTGTTTGTTTTGTTGCTGAAGATTCGATTGCTGCCCCGGCAATACCACCAATGACTGCACCACCAACGGCACCAACGATATTAGAACGAACTCCCCCACCAAGCGCAGAACCAGCGGTTGCCCCCACGGCAGCCCCAGCAGTCCCGCCTAACGCGGAAGTCCCACTGATATCAACCCCCCTGGCACTAATAACTGTACCAGCGATAGTTCGATTAACCATGCCCACAGAGCCAACAGAATAACTATTTGGCGATATATTTTGTGCGCATCCAACCAACACTAAGAGTGGAGCAATTACGAATAATCGCTTCATTTAGCTACCCTAACAGGAAACATTGGACGAGAAAGATCAACACTTTCTAATGCTTGCAAGAACTGCGTTATGTTGTTTTGCACCGCGCGATTAACAGATTCGCGTGCTCGAACAATACCGTAGAATGCGTAACTGGCTGGAACAGTACCGGTAGACTCAATATCCTGCGTATATATAATATCACCATTCGCACGGTTGATTATTTCATACCTTGCAATTGCTTTAGTTGTCATTGAAACACCAAAAGCAGGAACGTCAAGAGCCAACACTTTAACATTTAAGCTAACCGTATTTGGTGAACTATCACGAAAAATAGTCATTCGGTCGAGTGCTTCCTGCAAAGATTCACGCCAAATTGGAGTTATAGCCTCCATACCAGCAGTGATATCCCCTTTCTGCTCATCTGGACGAGCAAGTGATACCGTTAATGACTTAATTTCAGCATCTATTTTTTTCTGGCTAACTCCCACGTTAGGTGTTGAAAAATTCAATGGTGGCACACTAGCGCAACCTGTTAAAGAACCAATAATCATGGCTAATAATATTATCTTCTTCATAAATTTACCTTATTGTTATAACCAAAGGAATTATAAAGTAAAAAAGTTCACTATCACTAGCCATTAACGACATCAATTTCAGAGAAACATGGTACTCATTTCCACAAATTTGACACAAGTCATTTTCATCTACATATTCCATCATACTTGATGCATATGTTATTGAAGCCTCTATCCTATCCGTTCATAATAGCAATAGTTACCCGGGTGATAGTACCTCTATGATTACTCGTCTTTCTGATTGATTGGATTAAATATGCGCGCCAAAATTTATCAACTTTCGTTATGGATATTTATTTCGTTTCTAGCGATCTATGCCTTTATTATCTATAAAGGTTCTTATATTGGAGTAGCATTGCATCAAATTGCTTGGATCATCATTATTGCCTCTGGCTTGATTGCTAGGCTAACTAAACCAAAGCAAAAACCAATTTCGTCCAATAATTAGACATGTATTAAAAAATGATATTTTTATGTACATAGTCTATTGAAAATTGCCGCGATAAAATGCCAACACCCGCTTCATCGCGGCACTCTGGCGACACTCCTTGAAAATCAGATTCGTGCTCACCTTTCCTTCCCGTTCTTCCCTGGTAGCGAACCGGTAATACACCGTTCGCCAGACCTTACCATCAATAACTAAGATTCCTGTCCGCGCCATTTTAGCCGCAGCCTGGTTTATGCTGGTTACTGTTGCGCCTGTTACCGCAGCAACGTCCTGCGCACAGAAGCTCTTATGCGTCCCCAGGTAATGAATAATTGCTTCTTTTCCCGTCATACACTGGCTCCTTTCAGTCCGAACTTAGCTTTGATTTCTGCAATCTTCGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGCCCATGACAGGAAGTTGTTTTACTGGTTCAGGGATCGCCTCACCACGGTTAATTCTCGCAGTCATATGGACAAGCTCATCTGCGGCCTTACGGCGTAATTCCGCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGTAACCAGCCAGTAGTGCGCGTTTGATTTCCACGGATAAGACTCCGCATCCGGATACAGGCCTCGCTTCCGGCAATACTCGTAAACCATATCAACCAGCTCGCTGACGTTTGGCAGTCCGGCGATAACGGATGCTTCTTCCCGGCACCATGCAACAAACTGCCCGGGTGATGGCAGGAATGGTCGATTCTGCCGACGGGCTACGCGCATTCCAGCGTTAACCTGTTCCATTGTGGTGATCCCGTTTTCCCGAAAAGCCAGCACCCACTGGCGGCGGATTTCGTTCAGTTCGTTCTGGTCCCGGTTAGCCAGGCTCGCTGGGAAAGTTGCCAGTAACTGGCTGAATACACCGTTGATTATCTGCGCTACCTGCTGTACCTGCGGCTTTTCGTCGTACTGTTCCGGCATGTTATTGGCGATCCGGCACATCTGCTCACGGTCAAAGTTAACCATCTGTGCGGCGATGTTTTTCATAAATCCACCCCGTAAATCCAGTCAGTGTTCGTCAGGTCGAGTTTTGGTTTGCCGGCTGTCACGCCAGCCTGTTGCTTGTTTCGGTTGATTTCGAGCTGGGTCCACTTGTCGCGGAGTTTGGCCGGACTCAGCACGTTACCGGACCAGAAGTTGTCCTGGCAGGCCCAGCGGAACAGTACACACATGTCGCGGTGGTTACGTCCATCACGTTCACGCATCAGACGGATATCGTTAGCCCACCCTGCAAAATTCGGTTTTCTGGCTGATGGCGCGATGGTCTTCACCATGTCAAACATCCACTCTGCGGCGGTCAGGTCTTCTGCTGTCCCCCACTTGCTGCCGTTCTGAATCGCAGCATCCGCTTTCACCACAGGAAGGTCGTTTTCTGGCAGGTCAGAGGATTCGCCAGAATTCTCGGACGAATAAGGTTTTATATTGTCTTTTGTTAGTTTGTCTTTTGTGTTTACCTGATTCGGGTAAGTGCCTTTACCTGATTTGGGTAAACTTTTCTTACCTGATTCAGGTAAATTTACCTCTTTCAGGTAAACTTTATTTTTCTTACCTGATTCGGGTAATGTTGACCATTCACTGACCACATTATTAATGCCTATATTCCGCCCGCTCTGAATAAAAATCCCACGCTTTACCAGAACACTTTTTGCAGCAGAACACTTGTGCGGCAATATCCCGGTCAACTCGGAAAGTTGCTCGTTGCTCACCCAATCCAGTTTTTTATTAAAGCCATATGTTTTGCGCATGACAGCCAGGAAGACCAGAAGCTGGTGCTGTGTTAATCCGGCCAGCATTACAGCTTCCAGCAACTCATTTGCAATGCGCGTATAACCATCATCGAGATCTGCCACGCGCGGCTCCTTTTGTGCCGCATCCGGCACTGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCAGCAGACCGGCTTTCGCCATTTCTGAACCTGTCATATCGCCCCCAGCATGGTAGTAACCATCGCCATCAATGGACCAGCCAGATCTGGGTCCACACGAAACATCGACACAATACCTTCACTAATTTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACAGCCTGTTTTGCCTCACTGAGTTCCTTTTCCATTTCAGCCAACCTAGCCATGAAGCTATCCTGCTCAACCAGGTAACCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATGGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAAGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTGGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATAGACGGCCTGCTGTGAAACTTCGCAAGCAGCGCCCAGTTTCTTTTGTGAACCAACGATATTGATCGCTGTTTTGATAGCTGGGTTCATAACAACCTCCGTGGTTAATTTGAATCAAGATTAAAACTATGGTTGTTTTTAGTCAACAACCATTTTCGTTTGATGGAATAAAACCTTGGTTGTACATTTGGACTATGAAAACAACACTCTCAGAAAGACTTAAAGAAGCCAGATTAGCGCGAGGCCTTACACAAAAGGCGCTTGGGGATTTGGTCGGGGTTAGCCAAGCTGCTATTCAGAAAATCGAAACAGGGAAAGCTAATCAAACAACTAAAATCGTGGAGATCGCGAACGCTTTGGGTGTGCGCGCAGAATGGTTATCTTCTGGCGTTGGAAATATGTCAGACAGTACAGTGCAACCAATACAATCAACTGTCAGCCATTCCAAATACTTCAAGATTGACGTTCTTGATATAGAAGTCAGTGCTGGGCCGGGAGTCATCAACCGTGAGTTTGTAGAAGTTCTACGCTCGGTTGAGTACTCGTTTGACGATGCTCGTCACATGTTCGATGGTAGGAAGGCGGAAAATATCCGCATCATTAACGTGCGTGGTGACAGCATGTCAGGAACGATCGAACCAGGTGATCTGCTGTTCGTTGATATCACAGTTAAATCTTTCGACGGTGATGGTATCTATGCGTTTCTGTACGACGACACAGCCCATGTAAAGCGCCTGCAAATGATGAAGGATAAGCTGCTGGTCATCTCTGATAACAAAAGCTACTCACCGTGGGACCCGATCGAGAAAGACGAGATGAACCGGGTGTTCATCTTCGGTAAGGTTATTGGGAGCATGCCGCAGACATATAGGAAGCATGGTTAAAGTGAGGCTAAAAAACAGTTACAGCAATAGGCCTGTTGTTTTTCTTTAAACACGCAGTGTTAAACCGCTCTTTGAGATGCGGAGTAATGAGATGGAAGACTTGAATCACATAAGGGTTAGTGATGGAGTGCGTAGCGAGCAGCAATAGTGCAATACCTAATGTCGTTGAAGTAATACGTCGCATCAATGAAGGTTCCACTCAGCCATTTCTTTGCAAATGTGATGATGGGCAGTTGTATGTTTTGAAGTCAAAACCATCAATGCCCCCGAAAAATCTCTTAGCTGAGTTCATTTCGGCGTGTTTGGCTAATGATATCGGCCTTCCTTTACCTGACTTTAAAATCGTATTTGTGCCAGAGGAACTTATAGAGTACTCACCTGATCTGCAGCAACAAATTTGTACAGGATATGCCTTTGCTTCATTGTTCATTGACGGTGCAATAGCGTTAACGTTTACGCAGTCAAGAAACGAAACGATCATCCCAGTCGAACAGCAAAAATTAATCTATGTTTTTGATAAATGGATATTAAATGCAGACAGAACGCTTACTGACAAAGGTGGAAACGTTAACATCCTTTATGACATCAGTAACGATAAGTATTATCTGATTGACCATAATCTCTCATTTGATCAGAATGCTGGACCTGAAGATTTTTCTGTGCACGTGTACGGCCCTGGTAACCGCAAATGGCAATATGATTTAGTGGATCGCGTAGAGTACCGCCAGAGGGTCGTTAACAGTTTACACAAGCTTCCTGCTATCCTTGACGAAATTCCAGAAGAGTGGATAGTAGATGAGGAGTTTTTACCTTTTGTCTGCACTACGCTAGACAAAGGTGATTGTGATGAATTTTGGAGCGCAATAGAATGACAACTCCATGCCTATATAGCATCGTTCGCTATGCGCCTTATGCGGAGACTGAAGAATTCGCAAACATAGGCGTACTTCTGTGCGCGCCAAAAGAAAATTACTTTGATTTCCAGCTCACAAAGCGAAATGACTCTCGTGTAAAGAATTTTTTCCATGATGATTGTATTTTCCCTGTAGCAAAAGACTCAATACAAAGAGAACTACAGTTCGCAAAAATGCATGCGACCCAGATTGTTGGACATCAACAACTTGCACAATTCTTCAGATATTTTACAAACAAAAAAGAATCAATTTTTCAGTTCAGTTCTACGAGAGTGATTCTCAGCGAAAACCCAAAAGAAGAGCTGGCCCGCATTTACAATAAATATGTAAACCACTCTGACTACACAAAAGAGCGCCGTGAAGATGTTCTAGCCAGAGAGCTAAAACGAAGTATCGATAGAATAGATGGATTGAAGAACGTCTTCAAACAAGCAACCATTGATGGGTATTTCGCAAAGTTCTCAATGCCATTGGTCGCCAAGAAGCATGACAGGATCCAATGTGCCATCAAACCTCTGGCATTCACTCAAGCTGAACCAGGAAAAATGATGGAGCATAGTGATACTTGGGTGATGAGAATAACTCGAGCAGCAGAAGAAAACCTGCTTTCACTTGATGACATTTTATTCACAATTGAAACTCCTGAATCACCAAACTCAGGCCAAAGCAAAGTTATTGACATCATAAAGAGAACTATGGATGCTAAGAAAATAAATCATATACCTGCATCCAACCACAAAGAAACTATTGATTTTGCAAAAAAAATACTTCCCCAAGTTTAAAATTTATTTTTGTATGTGATATTCCTTATTAATAACCCGGCCACCGTGCCGGGTTTTCTTTTGCCTCCCCTCATCACACAAACCGCTCAAAAAACCACCATAACCTCGCTTCAGTTATCGCTATGCGATTCAAGTCACAAAATAAATCCATCCTAAATACAACCAGTTATATCTAAAACAACCAATAAAACAACTTTTGTTGTTGACGGTAAAACAACTATAGTTTTAAATAGGTTCATCGCAACAACACAACGATACGGCAACCACCTGATTCACCGTTGCGATGACCGCTTAGATCCGCAGTTTGAATTTCAGCAGGCTTCGGGGAGTGCGAGGGGTGAAACGGACGCGTGAACGTCGGTGTGACCAGCTGAAATCAACTCAACATTTCATACCTTAGTCGCTTCAACGAGGCGGCTTAGTTATGACAACCGGCGGCCATCCACCGCCTGAATACGCGCAGAAGTCTCTATATGTTCAGCAGCCCAGCTTACGGGCAGGAGTTTTTATGGTTCATCAACATTACGGAACGCAGACCGTTAATCGCGGCGCGGTCATGCCAGGAATGCTGGTCAAACACAAAGATGGTACCTGGACTGCATCAGCTAATTTACGCGGACGGCTTTATCTGCATCGCGGCATCGAGCGCACTTATACCCGTGATTTGCTCGTGGAAGTTTTTCTCGACGGACGCGGTAACGGCCTGAATCACTAATCCCCTTTCCTGTTTTCCTAATCAGCCTGGCATTTCGCGGGCGATATTTTCACAGCCATTTTCAGGAGGTCAGCCATGAACGCTTATTACATTCAGGATCGTCTTGAGGCTCAGAGCTGGGCGCGTCACTACCAGCAGATCGCCCGTGAAGAGAAAGAGGCAGAACTGGCAGACGACATGGAAAAAGGCCTGCCCCAGCACCTGTTTGAATCGCTATGCATCGATCATTTGCAACGCCACGGGGCCAGCAAAAAAGCCATTACCCGTGCGTTTGATGACGATGTTGAGTTTCAGGAGCGCATGGCAGAACACATCCGGTACATGGTTGAAACCATTGCTCACCACCAGGTTGATATTGATTCAGAGGTATAAAACGGATGAGTACAGCACTCGCAACGCTGGCTGGGAAGCTGGCTGAACGTGTCGGCATGGATTCTGTCGACCCACAGGAACTGATCACCACTCTTCGCCAGACGGCATTTAAAGGTGATGCCAGCGATGCGCAGTTCATCGCATTGCTGATCGTCGCCAACCAGTACGGCCTTAATCCGTGGACGAAAGAAATTTACGCCTTCCCTGATAAGCAGAACGGCATTGTTCCGGTGGTGGGCGTTGATGGCTGGTCCCGCATCATCAATGAAAACCAGCAGTTTGATGGCATGGACTTTGAGCAGGACAATGAATCCTGTACATGCCGGATTTACCGCAAGGACCGTAATCATCCGATCTGCGTTACCGAATGGATGGATGAATGCCGCCGCGAACCATTCAAAACCCGCGAAGGCAGAGAAATCACGGGGCCGTGGCAGTCGCATCCCAAACGGATGTTACGGCATAAAGCCATGATTCAGTGTGCCCGTCTGGCCTTCGGATTTGCTGGTATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACTGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATCGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTTAAACAGAAAGCCACTGAGCAGAAGGTGGCAGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGGGATGATGCATGGCACAAATTACGGCTCGGCGTCATCACCGCTTCAGAAGTTCACAACGTGATAGCAAAGCCCCGCTCAGGAAAGAAGTGGCCTGACATGAAAATGTCCTACTTCCACACCCTGCTGGCTGAGGTTTGCACCGGTGTGGCTCCGGAAGTTAATGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAACGACGCCAGAACCCTGTTTGAATTCACTTCCAGCGTGAATATTACTGAATCCCCGATCATCTATCGCGACGAAAATATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAACTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAATTCCGGCTCGGTGGTTTCGAGGCAATAAAATCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACGCGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGCATGAAGCGTGAAGGCCTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGTGCGATCACTTTCGTCTACTCCGTTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCCCTGAAAGCACAGCGGCTGGCTGAGGAGATAAATAATAAACGGGGAGCTGTATGCACAAAGCATCTCCCGTTGAGTTAAGAACGAGTATCGAGATGGCACATAGCCTCGCTCAAATTGGAGTCAGGTTTGTGCCAATACCAGTAGAAACAGACGAAGAATTTCATACGTTAGCCGCATTCCTTTCACAAAAGCTGGAAATGATGGTGGCGAAAGCAGAAGCAGATGAGAGAGACCAGGTATGACAACCACTGAATGCATTTTTCTGGCAGCGGGCTTCATATTCTGTGTGCTTATGCTTGCCGACATGGGGCTTGTTCAATGACACCTCAGCAAGAAAACGCCCTTCGCAGCATTGCCCGTCAGGCTAATTCTGAAATCAAAAAAGCCAGACAGCAGTTTCCGGATAAAAACGTCGATGACATTTGCCGTAGCGTACTAAAGAAGCACCGCGAAACGGTAACGCTGATGGGATTCACACCGACTCATTTAAGCCTGGCGATCGGCATGTTGAACGGCGTCTTTAAGGAACGGTGAACATGAAAAGCAAAATCATCAGGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATAATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGATAGATGAACGAAGACGACTGGCCGTTCAGGGTTGTCGGACTTGTGCCAGTTGCCAGCAAGATCTGGAGCTTATCAGTAAACAGAGAGGTTCGAAGTGAGCGAAATTAACTAGAAGCCAAAGATAAAATCATCGCTGAGCAGGAGAAAATCGCTAACGGAGAAAAGACAGTAAGTCAGTATATGAAAACCGCATGATATCATCAGATAAAAATCGGTCGTAAAGCGAAATATTAATACCAGAACAAACGAGTCGAGGTAAATTATATTACCTCGATAAATTAACTAAAACTTGCCCGCTATATACTATATCATTCAGTATCATCACGCGCGGTCTGTGCATATGTCACTACCGCACCTAATATATTAATTTTCTTTTCAACATAGATAATATTATCGTACTCATAATTGCCATACGGATAGCAAATGCGAATATTCTCATGTAGATCGGGGTCATCCACCTCAGCTCCAGAACAACTTTTTGAACTACCGGAAGTATACCGATACGGTGCAACATAAGACGATGTCTCTCCAGGCAAAAAATAAGTTAGTGTCGTAAGGGGTATAATCAGAAAAAATCCAGCAAATATGCACATCCCTGCATAAACCTTAAGGTATGCTGACAGACTCTTCCAGCCGCTTTGTTTTACTATCCCCTTCTTAACCCAAAACAGAGATAACAGAAAAGCTATTCCCATGCTAAACAGAATGTAATAGTGGGATATACTCTGATTAAGAAACGTGACCCTGTAGATATCTGCCCGCCACCAGAAGAAAAGGAAAATAAAGATCAGCCCTGAAACTGTCATGCAAATCAAATAAGGATACGAATCTTTTTTCATGTTTAGCGCCCATAAAATTTTTCCTGACCCGGACAAATTTACCATCCATTTTTTGCGCAGAAAATAGCTCATTACTTACTGCACAATAATACACAAAATTGCGTAAATTTTTTGCATGGATTTTAGCTCTTTCAGCCGACATTTAAGGGGTAAATAGCATTTCCTAAAAGCAACTGCACCAACCCAACAGAATGGGCTACCGCTTACGTTGAGAGCAAAAAAGTGTATAGCAGCAATGAACAGCATCCTCGCACTGACGAGGATTTCTTTTATCTGAACTCGCTACGGCGGGTTTTGTTTTATGGAGATGATAAATGCACTTCCGAGTCACAGGTGAATGGAATGGAGAACCATTCAACAGAGTTATCGAAGCCGAGAACATCAGCGACTGCTATGACCACTGGATGCTGTGGGCGCAGATAGCACATGCAGACGTAACCAATATTCGAATTGAAGAACTGAAAGAACACCAAGCCGCCTGATGGCGGTTTTTTCTTGCGTGTAATTGCGGAGACTTTGCGATGTACTTGACACTTCAGGAGTGGAACGCTCGCCAGCGACGCCCAAGAAGCCTTGAAACAGTTCGTCGATGGGTGCGCGAATGCAGGATATTCCCTCCTCCGGTTAAGGATGGAAGAGAATATCTGTTCCACGAATCAGCGGTAAAGGTTGACTTAAATCGACCAGTAACAGGTAGCCTTTTGAAGAGGATCAGAAATGGGAAGAAGGCGAAGTCATGAGCGCCGGGATTTACCCCCTAACCTTTATATAAGAAACAATGGATATTACTGCTACAGGGACCCAAGGACGGGTAAAGAGTTTGGATTAGGCCGAGACAGGCGAATCGCAATCACTGAAGCTATACAGGCCAACATTGAGTTATTTTCAGGACACAAACACAAGCCTCTGACAGCGAGAATCAACAGTGATAATTCCGTTACGTTACATTCATGGCTTGATCGCTACGAAAAAATCCTGGCCAGCAGAGGAATCAAGCAGAAGACACTCATAAATTACATGAGCAAAATTAAAGCAATAAGGAGGGGTCTGCCTGATGCTCCACTTGAAGACATCACCACAAAAGAAATTGCGGCAATGCTCAATGGATACATAGACGAGGGCAAGGCGGCGTCAGCCAAGTTAATCAGATCAACACTGAGCGATGCATTCCGAGAGGCAATAGCTGAAGGCCATATAACAACAAACCCTGTCGCTGCCACTCGCGCAGCAAAATCAGAGGTAAGGAGATCAAGACTTACGGCTGACGAATACCTGAAAATTTATCAAGCAGCAGAATCATCACCATGTTGGCTCAGACTTGCAATGGAACTGGCTGTTGTTACCGGGCAACGAGTTGGTGATTTATGCGAAATGAAGTGGTCTGATATCGTAGATGGATATCTTTATGTCGAGCAAAGCAAAACAGGCGTAAAAATTGCCATCCCAACAGTATTGCATGTTGATGCTCTCGGAATATCAATGAAGGAAACACTTGATAAATGCAAAGAGATTCTTGGCGGAGAAACCATAATTGCATCTACTCGTCGCGAACCGCTTTCATCCGGCACAGTATCAAGGTATTTTATGCGCGCACGAAAAGCATCAGGTCTTTCCTTCGAAGGGGATCCGCCTACCTTTCACGAGTTGCGCAGTTTGTCTGCAAGACTCTATGAGAAGCAGATAAGCGATAAGTTTGCTCAACATCTTCTCGGGCATAAGTCGGACACCATGGCATCACAGTATCGTGATGACAGAGGCAGGGAGTGGGACAAAATTGAAATCAAATAA
Protein sequences of DBSCAN-SWA_7 >CP031653|3283362:3310566|3284710_3285187_+|AXP27595.1|DBSCAN-SWA MKLISNDLRDGDKLPHRHVFNGMGYDGDNISPHLAWDDVPAGTKSFVVTCYDPDAPTGSGWWHWVVVNLPADTRVLPQGFGSGLVAMPDGVLQTRTDFGKTGYDGAAPPKGETHRYIFTVHALDIERIDVDEGASGAMVGFNVHFHSLASASITAMFS >CP031653|3283362:3310566|3305317_3305614_+|AXP27632.1|DBSCAN-SWA MNAYYIQDRLEAQSWARHYQQIAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKAITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV >CP031653|3283362:3310566|3308974_3309160_-|AXP27640.1|DBSCAN-SWA MFSASITLLNGSPFHSPVTRKCIYHLHKTKPAVASSDKRNPRQCEDAVHCCYTLFCSQRKR >CP031653|3283362:3310566|3302075_3302831_+|AXP27628.1|DBSCAN-SWA MVVFSQQPFSFDGIKPWLYIWTMKTTLSERLKEARLARGLTQKALGDLVGVSQAAIQKIETGKANQTTKIVEIANALGVRAEWLSSGVGNMSDSTVQPIQSTVSHSKYFKIDVLDIEVSAGPGVINREFVEVLRSVEYSFDDARHMFDGRKAENIRIINVRGDSMSGTIEPGDLLFVDITVKSFDGDGIYAFLYDDTAHVKRLQMMKDKLLVISDNKSYSPWDPIEKDEMNRVFIFGKVIGSMPQTYRKHG >CP031653|3283362:3310566|3303699_3304527_+|AXP27630.1|DBSCAN-SWA MTTPCLYSIVRYAPYAETEEFANIGVLLCAPKENYFDFQLTKRNDSRVKNFFHDDCIFPVAKDSIQRELQFAKMHATQIVGHQQLAQFFRYFTNKKESIFQFSSTRVILSENPKEELARIYNKYVNHSDYTKERREDVLARELKRSIDRIDGLKNVFKQATIDGYFAKFSMPLVAKKHDRIQCAIKPLAFTQAEPGKMMEHSDTWVMRITRAAEENLLSLDDILFTIETPESPNSGQSKVIDIIKRTMDAKKINHIPASNHKETIDFAKKILPQV >CP031653|3283362:3310566|3290917_3291082_+|AXP27604.1|DBSCAN-SWA MMKYVYPVTPVCIVFDYSRTLAQKEFPVGLRSLLIREYGDDTAHDVSGLNTFIL >CP031653|3283362:3310566|3298965_3299157_+|AXP27622.1|DBSCAN-SWA MRAKIYQLSLWIFISFLAIYAFIIYKGSYIGVALHQIAWIIIIASGLIARLTKPKQKPISSNN >CP031653|3283362:3310566|3298148_3298709_-|AXP27621.1|DBSCAN-SWA MKKIILLAMIIGSLTGCASVPPLNFSTPNVGVSQKKIDAEIKSLTVSLARPDEQKGDITAGMEAITPIWRESLQEALDRMTIFRDSSPNTVSLNVKVLALDVPAFGVSMTTKAIARYEIINRANGDIIYTQDIESTGTVPASYAFYGIVRARESVNRAVQNNITQFLQALESVDLSRPMFPVRVAK >CP031653|3283362:3310566|3288276_3288414_-|AXP27599.1|capsid|DBSCAN-SWA MAYEPCQGVIIGIMPQHVLSKSELRLDAIFSLKRKTLLQYLEPWF >CP031653|3283362:3310566|3305619_3306405_+|AXP27633.1|DBSCAN-SWA MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKATEQKVAA >CP031653|3283362:3310566|3296104_3296245_-|AXP27614.1|DBSCAN-SWA MMFEFYMAELLRHRWGHLRLYRFPGSVLTDYRILKNYAKTLTGAGV >CP031653|3283362:3310566|3297699_3298152_-|AXP27620.1|DBSCAN-SWA MKRLFVIAPLLVLVGCAQNISPNSYSVGSVGMVNRTIAGTVISARGVDISGTSALGGTAGAAVGATAGSALGGGVRSNIVGAVGGAVIGGIAGAAIESSATKQTGMEYVVETENGNLMTIVQGKDPLFTQGSKVLVLYGNPSRIITDPRH >CP031653|3283362:3310566|3305035_3305242_+|AXP27631.1|DBSCAN-SWA MVHQHYGTQTVNRGAVMPGMLVKHKDGTWTASANLRGRLYLHRGIERTYTRDLLVEVFLDGRGNGLNH >CP031653|3283362:3310566|3309495_3310566_+|AXP27643.1|integrase|DBSCAN-SWA MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >CP031653|3283362:3310566|3309092_3309260_+|AXP27641.1|DBSCAN-SWA MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >CP031653|3283362:3310566|3306401_3307082_+|AXP27634.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSSVNITESPIIYRDENMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP031653|3283362:3310566|3290171_3290405_+|AXP27602.1|DBSCAN-SWA MSTKNRTRRTTIRNIRFPNQMIEQINIALDQKGSGNFSAWVIEACRRRLCSEKRVSPEANKEKSDITELLRKQIRPD >CP031653|3283362:3310566|3301806_3302037_-|AXP27627.1|DBSCAN-SWA MNPAIKTAINIVGSQKKLGAACEVSQQAVYKWLHNKAKVSPEHVGSIVTATGGVVKAYQIRPDLPKLFPHTEKNAA >CP031653|3283362:3310566|3289222_3289783_-|AXP27600.1|terminase|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIKRRRSLSTRMRLTQQLI >CP031653|3283362:3310566|3287705_3288332_+|AXP29197.1|DBSCAN-SWA MYTPLTLKLYDWWVLGVSNRLAWGCPTKEHLLPHFLEHLGNNHLDIGVGTGFYLTHVPESSLISLMDLNEASLNAASTRAGESKIKHKISHDVFEPYPAALHGQFDSISMFYLLHCLPGNISTKSCVIRNAAQALTDDGTLYGATILGDGVVHNSFGQKLMRIYNQKGIFSNTKDSEEGLTHILSEHFENVKTKVQGTVVMFSASGKK >CP031653|3283362:3310566|3307435_3307717_+|AXP27636.1|DBSCAN-SWA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >CP031653|3283362:3310566|3299193_3299487_-|AXP27623.1|DBSCAN-SWA MTGKEAIIHYLGTHKSFCAQDVAAVTGATVTSINQAAAKMARTGILVIDGKVWRTVYYRFATREEREGKVSTNLIFKECRQSAAMKRVLAFYRGNFQ >CP031653|3283362:3310566|3301197_3301737_-|AXP27626.1|DBSCAN-SWA MQPLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGYLVEQDSFMARLAEMEKELSEAKQAVILNAPRHQKLKEISEGIVSMFRVDPDLAGPLMAMVTTMLGAI >CP031653|3283362:3310566|3297505_3297607_-|AXP27619.1|DBSCAN-SWA MRTYNPNSLLPSQMQKCTCDFLHPAFDLCGGEA >CP031653|3283362:3310566|3292324_3292540_-|AXP27608.1|lysis|DBSCAN-SWA MKSMDKLTTGIAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >CP031653|3283362:3310566|3294961_3295486_+|AXP27612.1|DBSCAN-SWA MKLWPVLTGIALSFTLIACKAPTPPKGVQPITNFDANRYLGKWYEIARLENRFERGLEQVSATYGKRNDGGIRVLNRGYDPTKNKWSESEGKAYFTGDTKTAALKVSFFGPFYGGYNVIKLDDEYKYALVSGPNREYLWILARTPTIPDKVKADYVRTAQKLGFNVNELLWVKQ >CP031653|3283362:3310566|3285101_3285281_+|AXP27596.1|DBSCAN-SWA MKVPAARWSGLTFISTLWQAPRLLRCLVNHSARWRNAIWYHLKVLKTTFCLFTFPFRSS >CP031653|3283362:3310566|3295641_3296019_-|AXP27613.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLIPSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVGGMTFMALARKHGRSDCWVGRMLQKAEGVVEGMLMVLDLRLEMDADCSK >CP031653|3283362:3310566|3291363_3291831_-|AXP27606.1|lysis|DBSCAN-SWA MSRVTAIISALVICIIVCLSWAVNHYRDNAIAYKEQRDNKASELEKANATITDMQQRQRDADALDDKYTKELADAKAENDALRRKLDNGGRVLVKGKCPVPSSAETSSASGMGNDATVELSPVAGRNVLGIRDGIIRDQTALRTLQEYIRTQCLR >CP031653|3283362:3310566|3308668_3308920_-|AXP27639.1|DBSCAN-SWA MSAERAKIHAKNLRNFVYYCAVSNELFSAQKMDGKFVRVRKNFMGAKHEKRFVSLFDLHDSFRADLYFPFLLVAGRYLQGHVS >CP031653|3283362:3310566|3302953_3303703_+|AXP27629.1|DBSCAN-SWA MECVASSNSAIPNVVEVIRRINEGSTQPFLCKCDDGQLYVLKSKPSMPPKNLLAEFISACLANDIGLPLPDFKIVFVPEELIEYSPDLQQQICTGYAFASLFIDGAIALTFTQSRNETIIPVEQQKLIYVFDKWILNADRTLTDKGGNVNILYDISNDKYYLIDHNLSFDQNAGPEDFSVHVYGPGNRKWQYDLVDRVEYRQRVVNSLHKLPAILDEIPEEWIVDEEFLPFVCTTLDKGDCDEFWSAIE >CP031653|3283362:3310566|3299483_3300185_-|AXP27624.1|DBSCAN-SWA MKNIAAQMVNFDREQMCRIANNMPEQYDEKPQVQQVAQIINGVFSQLLATFPASLANRDQNELNEIRRQWVLAFRENGITTMEQVNAGMRVARRQNRPFLPSPGQFVAWCREEASVIAGLPNVSELVDMVYEYCRKRGLYPDAESYPWKSNAHYWLVTNLYQNMRANALTDAELRRKAADELVHMTARINRGEAIPEPVKQLPVMGGRPLNRAQALAKIAEIKAKFGLKGASV >CP031653|3283362:3310566|3283362_3284652_+|AXP27594.1|DBSCAN-SWA MTTDDLAFDQRHIWHPYTSMTSPLPVYPVVSAEGCELILSDGKRLVDGMSSWWAAIHGYNHPQLNAAMKSQIDAMSHVMFGGITHAPAIELCRKLVAMTPQPLECVFLADSGSVAVEVAMKMALQYWQAKGEARQRFLTFRNGYHGDTFGAMSVCDPDNSMHSLWKGYLPENLFAPAPQSRMDGEWDERDMVGFARLMAAHRHEIAAVIIEPIVQGAGGMRMYHPEWLKRIRKMCDREGILLIADEIATGFGRTGKLFACEYAEIAPDILCLGKALTGGTMTLSATLTTREVAETISNGEAGCFMHGPTFMGNPLACAAANASLAIIESGEWQQQVAAIEVQLREQLAPARDAEMVADVRVLGAIGVVETTRPVNMAALQKFFVEQGVWIRPFGKLIYLMPPYIILPQQLQRLTAAVNRAVQDETFFCQ >CP031653|3283362:3310566|3289922_3290066_-|AXP27601.1|DBSCAN-SWA MTWFDGVDARCDMQMIIIIILRVLSGDPTGYGAATSRVFAIYENFPV >CP031653|3283362:3310566|3293809_3294769_-|AXP27611.1|DBSCAN-SWA MIKKPVIGISGCLAGSAVRFDGGHKRADFLMDKLVEWVTFRPVCPEMAIGLPVPRPALRLVRSMQGNIRMCFSHDQNEDVTERMTEFSRSYMDKLKDVSGFVVCAKSPSCGMERVRVYDENGNRGRKDGVGLFTSTLMEKFSWLPVEEDGRLHDPVLRENFVERVFALHELNHLYKEKLSRRELLAFHSRYKLQLLAHSQAGYKDMGPFVAAIHEWADLESYFEVYRDKLMAILRKPASRKNHTNVLMHIQGYFSNYLSTRQRKELSEVILNYRSGTLPLLAPLTLLKHYLGEYPNDYLLTQNYFDPYPDELALRLMVN >CP031653|3283362:3310566|3307233_3307425_+|AXP27635.1|DBSCAN-SWA MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >CP031653|3283362:3310566|3296883_3297054_-|AXP27617.1|DBSCAN-SWA MATPLIRVMNGHIYRVPNRRKRKPELKPSEIPTLLGYTASLVDKKWLRLAARRNHG >CP031653|3283362:3310566|3291827_3292325_-|AXP27607.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIIPGKTYTEAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSALLRKINQGDIKGACDQLRRWAYAGGKQWKGLMTRREIEREVCLWGQQ >CP031653|3283362:3310566|3308247_3308778_-|AXP27638.1|DBSCAN-SWA MKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >CP031653|3283362:3310566|3309299_3309518_+|AXP27642.1|DBSCAN-SWA MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >CP031653|3283362:3310566|3291223_3291376_-|AXP27605.1|DBSCAN-SWA MPSMIAIILLIILHIWLCRQGGAHFWSESRLNISLLMLDIEHFARGKGLR >CP031653|3283362:3310566|3300181_3301201_-|AXP27625.1|DBSCAN-SWA MTGSEMAKAGLLEQNRLSGANRNTLIAGGIMANTAEIFNFPVPDAAQKEPRVADLDDGYTRIANELLEAVMLAGLTQHQLLVFLAVMRKTYGFNKKLDWVSNEQLSELTGILPHKCSAAKSVLVKRGIFIQSGRNIGINNVVSEWSTLPESGKKNKVYLKEVNLPESGKKSLPKSGKGTYPNQVNTKDKLTKDNIKPYSSENSGESSDLPENDLPVVKADAAIQNGSKWGTAEDLTAAEWMFDMVKTIAPSARKPNFAGWANDIRLMRERDGRNHRDMCVLFRWACQDNFWSGNVLSPAKLRDKWTQLEINRNKQQAGVTAGKPKLDLTNTDWIYGVDL >CP031653|3283362:3310566|3296241_3296604_-|AXP27615.1|DBSCAN-SWA MNTYNITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE >CP031653|3283362:3310566|3297053_3297509_-|AXP27618.1|DBSCAN-SWA MNLPQDGIKLHRGNFTAIGQQIQPYLEDGKCFRMVLKPWREKRSLSQNALSHMWYSEISEYLISRGKSFATAAWVKEALKHTYLGYETKELVDVVTGEITTIQSLRHTSDLDAGEMYVFLCKVEAWAMNIGCHLTIPQSCEFQLLRDKQEA >CP031653|3283362:3310566|3307078_3307261_+|AXP29198.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >CP031653|3283362:3310566|3307815_3308037_+|AXP27637.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >CP031653|3283362:3310566|3293323_3293458_-|AXP27610.1|DBSCAN-SWA MPYICSIILVLNSFDVRIGKEDILFKKGSAVLIDYNLKDFFHQI >CP031653|3283362:3310566|3292727_3293315_-|AXP27609.1|DBSCAN-SWA MIVDVEEKTVNDFFKSNTLSPFSVRRFYPAYLMVECEDFSLLKNLIACLNCDGRTVDFVRNQISLACLAILSSEKIVQSFLFGCLNSLGSKVKAIIHTDISAAWRLCDISSRLYLSESLLKRKLKHEGLSFSKLILEERMVMAERLLSYNLYSVGKVAEICGYENTSYFVSVFRRYFGVPPHQYSSRFFLEKDMM >CP031653|3283362:3310566|3287337_3287514_-|AXP27598.1|tail|DBSCAN-SWA MQEFSEHIAPLQDAVDLEIATEEENSLLEAWKKYRVLLNRVNTTTAPDIEWPTVPIIE >CP031653|3283362:3310566|3290461_3290872_+|AXP27603.1|DBSCAN-SWA MAQVAIFKQIFDKVRNNLNYHWFYSELKRHNVSHYIYYLATENIHLVLENDNTVLIKGQGKVVNVRFSKNKCLIEATLKGFKSGELSFYEYRKNLATAGVFRWITNIHENKRYYYTFDNSLLFTENIQNTTQIFPH >CP031653|3283362:3310566|3285932_3287264_+|AXP27597.1|DBSCAN-SWA MIINQVPIKIKIFIFLFSCISIIFLLLHANNGIYITQTTQISYSVFIIGLFFINLMIFIFLLLYYVSNQRQSYLLILSFAFLSNTYYLLEVAIISLSPLGNDLSTIYQKSNDIAIYYLFRQFSFISIIFLAVYSTNVKNKSVLEDKRNIIIVVLSILILFITPFVAKNLSSDNIKYSLNIIQYSLNRHLPTWNIVYTKIISVFWLVLLISSCISIRNYSKIWLCIILISIVSVCNNLILLYFIDKSHPAWYMTKFLELISMIYIISTLMYYVFRKLNHANHMAIHDPLTNTYNRRYFIDSLKNISKHHDFSVIMLDIDSFKSINDKWGHHMGDQVIVMVTRIIKKSIRKEDVLGRLGGEEFGIIIKGNTQKLLLSIAERIRKNIEEQCSEKLLSHGPEKITVSIGCFTSKENNLSPSEMLVNADKALYQAKRTGKNKVIIHSK >CP031653|3283362:3310566|3296600_3296891_-|AXP27616.1|DBSCAN-SWA MADLRKAARGRECQVRTPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDRRTHFVDAAYAKECALEGMARTQVIWMKEGVIKA |
52 | Enterobacteria_phage(47.06%) | terminase,lysis,capsid,tail,integrase | attL 3285278:3285292|attR 3310640:3310654 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
3809734 : 3835453
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >CP031653|3809734:3835453|DBSCAN-SWA GTTACTGAATACGCTCACCAAAGTAAAACTCAGGCTGATATTCACGTATCAGCCTTTTTTCTTCTTCCTCCAGCTCACGCTTTTTGCGCTTACATGCCTGTAGCTCCCTCCCCTTCTCGCTGGCACTTATTTGATATTGCTCTTTACGGCGGGAAAAATCCTGTAATGCACCCCACGGGATACCATAAGCCCCCGTTTTTCTGATACCTGGTATCACATTCCTGAATACCCAGTTACTGAAACGATGGGCGAACGTGCCAGGAGTAACAGCTTTGCGACTTCTGGCGATCAGCTTGTAAAAACCAGATTCAGAAACGACGCTATGATTTGGATTTCCGCGAATACCGTAGCTTAAAGCGACGGTATTCTTCTCATCCACATCCAGAGCTTTTAGAGCATCGCGTGAGTTGCTTATTTCCAGAGCTTCACAAACATCCTTTGCAACAAACCACGGATCGCCGTTCAGATACACCACACGAACATCCACGCCATCAAAGCGCAGAACGACAAGATCACGAATATCGCAGAATTTTTTTACTGAACGAGCGTACCCCTTGCCCGTCACGGCAATATTTTTATTCATCGTTTTTTACCTCACATACAAAAAACCCCGCATTGCGTGCGGGGTATGAAAGATATTATTAGTGGGGATTGGCCTGTTCTCGTTGTTTATCTAACCATGCCTCTACATCTCTACGGTGCCAGGTATGCCGCCGCCCAATTCTGAACGGTTGAGGAAAACCATTATTTTCATCTTTCCAGAAATTGATGAATGCGCTCATTGCCCCATAGCGCAAAATTTTCATTACGTCTTTAGTAAATAAAATATCTTCATTGGTATTCATTTGCTGAACTCCTCAACCATTTACTATTCTCAAACCTTTCTTACCACCGGATCTATTAATTACCCCTTTTCTCGCCTCATCAAAAAAATCTCCTACCCATTGCATCATAATTTTGCGTTGTTCTAAATAAGTAGTTCTATTATAAATATCTCTTATTTTATCACCACTTTTATGTGCTAATGCAGCCTCAATAACATCAGGGTTAAAACCCTCCTCATTTAAAAGCGTACTCCACATCGAACGAAAACCGTGTAACGTTACAATCCCTTTGAACTTGCTGGCAGCAATTGGGGTATTGATAGTATTCTTTCCCATAGGAGCATCTTTTGTTCTGGAGGAAAAAAATATATAACGCCCTCTTTTTATTTTCTGCATTGTTCTTAGGATTCTAATAGCCTGTGATGAGAGAGGCACAACATGTTCACGATGGCATTTCATTTTATGCGCGGGGATAATCCACAAGCCAGAATCAAAATCAATCTCGGACCACTCTGCTTTAATCGCCTCACCTGGCCTGACCATTGTCAATATCTGGAATAAAAGTGCATTGTGAGCTATTTGATACTTATGAGGCACACTATCCCACCAACTCAGAAATTCAGGCAATCTTTCAACAGGTAGTGCAGCTAATGATTTATTTTTCTTTCCTGTGAATGCAGTTTTTATCTTAAGTAATGGATTCGTTTTCAATGCTCCACAATTTACAGCGTAATTCATAATTTCATTTAATCTTGATATTAATTTTTTTTGCAACGCATTCTTATCGGATACGGCATCCAGAGCATTAATAGCTACTGGCGCTGTAATTTTCTCTATGCTGTACTTACCAAAGAAAGGAACAAGATATTTGTATACTTCATATTCGATATTATACAGCGTAGGTTTCCGCAATTCCGATCCCTTTTTAAAAGCGATCCATGCATTAGCAACAGCTTCAAATGTTTGTAGATTTTTTAGTGACATCTCAATTTTACGATTTTTCTTCTCTGTCACTGGATCAACTCCACGTGCAATCATTCGCCGAAGTTCATCACGTATTTCCCGTGCTTCCGCGAGTGAGAATTCAGGAAAACGCCCTATCGTGTATGTCTGCCGTTTCTTCGTTATCGGATGGCTATAACGGAAACGCCACACTTTCCCACCAGCTTTACTCACATTCAGCAATAAACCGAACCCATCATAAACGGCATAGTCCTTTTCACGTGGTTTCATTCCCTTAACTTCAGTCACGGTTAATGGCTTTACCGGCATCTATCGCCCTCATTTTTTAGTCCGTCATGTAGTCCTTTCAAGTCGATAACAAGCGATAAACTAACTCATTATCAAGTAAAGAGAGGAAACACAAAAAATCACAACTCATTGAAAAGACTACAAAACGACACCAGAACATAAAAACAGGTAAGAAATGTACCCTACATCCAGAATGACGCAATACGTGAGCGTCGGGGATCACCATAACGACTGCCATCCGCATTGATGGATTCACCATCCCGCAACCAGACCCCACGTCCGTTCAGCTCACGTTTTTGTTCAGGCATAATCCGTCCTGAACAGGAAGGACACTGAATATAAGCCGCCTCACTTGCCAGCACGGGATCGGCAATATCACGGAAACCAGCAACCACATCGCCGCAGGGCTGAAAATACTCACCACAGTGTGGACAGGGCCAGTACCAGCGACGGCGATCGCCACGGTTATAGAGCGACAGTATCCCCGTGGTTGGTGGAGCCTCATGCGGTGAAGTCCGTCGCCATTTCACATCCTTCACATCCCTGCCGGGGGAACTCTCCACCAGCGTCATACCACTGGACATAAATGTGGTGGTACGTTTTGAGGCAAGAGAGAAGGCATCCCCCTCGCCATCAATATCTTCCGGAAAACGGTCATAATCCGTCAGCGCCACGCATTTATAATCTGATGAGGACATGATATTGACTGACGGCCAGCCGATTTTCAGGTAGTTGCCAGCAAGGAATGTTCTGTCATAAACGTTGTTGTCATTTTTGTTCGGACTCAGGCGACTGACCACTTCCGGGCTGACGCGAAACGTTCTGGCAAGTCGTTTTTTGGAGTGTTCGCGGGCTTTTTCCTCCGTCATCTGAATGATCAGCATATCAGCAGGATCGCAAATCACATTGTAAATCACCCAGCCGTCAATCAGGCCGATAGTCTTGCCAGTTCGTGCCGGGCCAACAAATATCACTGCGTCGTATTCACGCGAGGCCAGGCAGTTCATCGGCTCAATAACATACGGTGCCACCAGCGGATCCCACGGGACTGAATTCCCGGCCCCCATGGGCACCCGCATATACTGAGCAACGGCATCAGCAACCAGCATTCGTCTCGGTGCGCGAAGGATATAACCTGAATCGGTTCGTGCTGCCTTTGCGGTTTCCTGATTCAGCATTACTCCTCCTGCTGTAATTCCTCCTCATCATCCGCACCTGCTTCGGTCACCCGCAGGGCTATCTGATCGCGCAGATCATCAATAATGGACTGAACACGGCTCACAGCGGCAGGCTGCAGGCCGCAGTCACGTTCCAGAATATCCGGTAATGTCTCCAGCACCTGCACGACCGCTTTTGCCCAGATGGCAAACTCCCGTCTGACATCACTGGCCGGAATGAGTTGTGCCGTTTCCTGTTCGAACTTAAGACGCTCACGTTCAGACTGATACCAGGCTTTGCGTTCATGTGGATCCATTTCGCCCTCAGCAACCGGCGGTGGTAACCCCATAAATTCAGTCAGAATATCGGTCAACCGATATAGCTTGAGTTTGTCATGTCCACCAGCGGGACGAATGTTTTTCAGTCTTGCCACGACAGTCTGGCGGTGCAGACCAGATAAAGCCGCCAGTTGATTAATATTCAGCACCAAGTTTTTCAACTCATGATCCATATTTCCTCCGGAGAGCTTTAAACATGCATCGTGCGAACAACTTTAAGAAAACGCGTTCGATGTCGAACAAAAACCGCTCAATTCGACATACAAAAAACAAATAACCATTAATAATCAATAAGATGCAAAGATGATGGTGGCCGATAAAAATGCAAAAACTAGCCTTTTTCCGCGACGCTCCCGCCCCGTGGTAGGCAACCCCGCCGGGAGGACCCATAAGAAAACACAGGTCCGACCGCCGTTTTGAATTCTCTTCTAAACGACTCTAGTTTGATTTGAGTATGCAATACGCGTAAAAAAGCCCCGCATAGGCGAGGCTGGAATCAGACAGGGGGGATTAAGACTGGATATTCCAAATCATTGCGTTACCGTTATGATTTTCAATGATTTTTGCCAGTGGTGCTCCACCAAAAGAATGCATGTAGCTGTGATGTATAGTCAAAACCAAATCTTGAAGTACTTGATCACTTTCAAGTTCGGTAACATTCAACCCAATTTTCTGCGCCTTATCAAAATGAATATGACGCGAGTGGGTATAGGTTGTGTGATGGTTGTTTAACTCTGAACAAACATGTGTTGCTTTTGATTCTGCCTCAGGATCATTATCAAACATGCCTGTCATAAGCCAATGCTTAACAATCTCATTTGCCCATTTGATTGCTTTCTCACACTCGCCGATGATCGTCGGGTTTAGCTTTTGAAGAATGAACTGCCACATCTGAACAGCTGCGGGATTTTGAAAAATTTCCGTCTGCGCACGATTCCATTCTTCAATGATGGCATGGGTGGAGAAACCGTTGAACTGAGGATCAATTGGGCCAATGTTGGACTGTTTACCCATGATGATTTCATTGGCACAACATGCAAGCATAGTTCCGCAGGACATTGAAATCATAGGAACAATTGCTCTGATGTTAGTTCCAAACTTCGACCTTAAGTAATGCCCGATTGATTCCAAAGCGGCAATATCACCACCTGGAGTATGAAGTATCAAATCCAACCCTTTTGATACATCTAAACCATTGATGGCTGTCATCAACCCGTTCTTATCATCATCAGTCATCTGAGTAAGATGGCGTACTTCTGCACCACCATGCTGTAACCACCCTGAGTAATATGTGATTACATTTCTTCCAGTATGATTCGAAAGTTGAGATAAGTATTTACGGCGAACCTCATCCATAGGACTTTTATGGGCGAGAGCCGTTATCTCGCCCAGTACGTCGCTCCAATTAGGCATAAATCAGTACGTGTAAAGTTGATGACTTGTAGTTTGGTTTTGCTTCGATTGTGTAGCGGTACCTTGTGAGTACCAAACACCTGTGTTTCCACTCGTACTAGTTTGCTGTGCCATTACACGAGAAGCGAAGTTGTTAACAGTTTCACCAGTCAACATTTCTGCCGGTTTGATATTGTAAACATCGTAGAACTCAGCTGGGGTCATAATGATAGCTCCTTTTATAGCGCTACATATAGTGCCTGTGCTAAGGGGGACATACTATATGTGCCATTGTTGTCATATTTAAATGTGCAAAAAGCCTCAAAAGAGGCTTGAAAAAGATGCAAAAATACTATCTATGACGTGACTATAGCTACTTACCCACAGTTTATACAAGCCAAAAATGCTAAATTCAGATAATCCTATTCGCTGAAAAACTACACAAAAGCTACGTTCATCTTGTCATTAGATAACGTCAGTAACTTTCACTGGATCTAAGCCATTGGTGAAAAAGCTAATTGCATGATGAAGCACTGTAATGTAGCAGTTGAGGTTGTTAAATGAGAATCACCCTAACAATGTCAACGGACATTATTCATTTTCGTAACTTCTCAATCTCTCGTATACCCGCCAGGTTATTGTTGCCCTTCTCAATAACAGCCAGCAAAGGTTCAATCCAGAGGACGGCCTGGCAATACGTCATTGAGCTGGCGGCAGCGGTACTATCATCGGTTGCGTTAGTGTTCCCGGTATCGGGGTGCATTGCGCTGGCACGTAAACGGTACGCGTATTTGAGCAGCCCACCAGCGACATCAGCAGGAACAGGCAGATCACAGGTCTTTTCACGTCGGAGAATCTCCCGGTATTCGATGACAGTTTTCTCAGTACCGGCATCGATCAGCGAGTTAAGGCGGTTGGCGTTCTCTGCTATCTGGTTAAACCGATTGAAGTTGAAAGCCTGTGTAGTTATCACCGTCGCCTGCAGCGCGTTATCATTACGTAATACCCGATTCTCACTCTGTTCTGTTTCAAGAGCTGAGCGGCTACGAACCAGTAATACGCTAAGCACTGCAATAATGATTACGACAGCCACCAGCAGAACCGCAACAATCGTAATTTTTCTGGGTTTCATCAGAATACCCCCGGAACTGATACCGGAATGCCTGGGTTAAGCGGCCCGAGCCCATCACCAAGAACCTGAGGTTTTTCTGCCCACAGGCAAACTTCACGCTCAATCTCACGACGAGTCATCAGGCCTTTCCATTGCTTACCGCCAGCGTATATCCAGCGACGTAGCTGGTCACATGCGCCTTTGATATCGCCCTGGTTTATTTTGCGAAGAAGCGTCGATGTTCTGAAATTGCCAGCACCCACGTTGTAGACGAACGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCAACCGTGGCAAGGTCTTTATTCAGGAGGGCTTTGCATTCTGCTTCGGTATACGTTTTACCGAGCATGATGTCTTTTCCGGTGTGTCCGTGACATACAGTCCATACGCCAACGATATCTTTGTATGGTATGTAGCTGACACCTTCCAGGCCATCGTCACCACTTGGGCCAGTGATTAACACTGATGCTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACTGCTTTTCGTAATGATGGAGGCATTATTCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAGGCTACCCAGCACACCTATTGCTGCCCACTGTGAGGGCGTGACTTTATCGAGCAGCTGTAAAAACCAGTAACCGGCACTACCTGCTGAGGTGCCATAGGCGACACCCGTTGTTAACTTATCCATGGATTTCATAACCCCACCTCGCAGATGCGGGTGCTGTGTAATGGAAATAAAAAGGCCACCTGACGTGGCCACCAGATTATTTCCCCACCAGCTCGTTTATCTCTTTCACTGTCTGATTAAACCGCTCTGACTCAAGCTCAACACCTAAGGCCCGACGCCCCAGCGCCATTGCTGCTTTTATTGTGGAACCGGATCCCATAAAAAAATCAGCAACCAGATCACCTGGTCGACTACTGGCATTGATTATTTGCCTGAGCATATCCGCCGGTTTCTCACACGGATGTTTCCCCGGGTAGAACTGAACGGGTTTATGCATCCAGACATCGGTATAAGGCACGGAGACTGATACGGAGAAATAGCGCCGGAGAGATTTAAACTCATCCAGCAATTCAGAATATTTACGATTCAGTGAATCATAAGATGCCACCAGCTGGTGGTGTGGTTGTTCCAGTTGTTGTTCCTGAAACTTCTCTGCCGCTATACGGAAAAACAGTGCCTGTAACTTCCGATAGTCAGCCTCATTCGGCAACTGCCACTGACTGGCACCAAACCAGTGGGAAACCATATTTTTCTTACCTGTGGCTTCGGCAATTTGTTTTGCCGTTATACCCAGTTCGGCACGAGCATCCCTGAAATACGATATCAGCGGTGCCATTATGTGCTGTTTGAGTTCCCTTTCTTTTGCTGCATAGCCGTCACTTTTGCCGCGATATGGCCCCTGGTAATGTTCAGCAAACAGAACGCGCTCTGTGGCAGGAAAATATGCGCGCAGACTTTCTTTATTACACCCATTCCAACGTCCGGACGGCTTCGCCCAGATGATATGGTTAAGCACGTTGAAACGTTCACGCATCATGATCTCAATATCAGATGCCAGGCGATGCCCACAGAACAGGTAAAGGCTTCCGGCAGGTTTTAACACCCGCCAGAACTGGGCCAGACAGTGGTCCAGCCACTTCAGGTAATCTTCGTCCCCTTTCCACTGATTGTCCCAGCCGTTGGGCTTCACTTTGAAGTACGGCGGATCGGTAACAATCAGATCAATGGAGTCATCAGGCAGGGACTGAATAAAATGCAGGCAATCAGCGTTGATTAAATCAACACTGTTTATTTTTACAGTATTTTTCATGGATCAGTAAGCGTAACTCTGGTAGGCTCACTCTGCTTTTGCGCTAAAGCAGTGGGCCATGGTTCGCTTGTGACCAGTAAGCATGAGCGAATGGCTGGCAGGTGCTACCAACACCCACCAGCCGCCCATTTTCACAAATTAAAAGCCCTTCATTGCTGAAGGCGTCTGTAACAGCCGAACTGGTAATCTGCCAGCCCCGCCATAACCAGCTGGGTCAGTATTAACTGACAGCGTTCGCGTGAAAGGTATGTGTTTTGTGCTATCTCCCCGACTGTTGCCGGTTTGCCGTTTAATTCATTAAAAACAACTTTCGCCGTTTCTGTCATATCTTGCTGTTTTAGCATGTCTTTTTACCTTCATGGTTAACATGACATACCAATAACTCTTGTCTAAAAAGCCAGCAAGATAAAAAGTCAGTATTCACGACCACCAGCGTGTTTACCGTACTGCACCAGGTTTACAGGTATAAAAAAACCCGCTCGACGGCGGGTTTAAGCTGTGTGACGAAGTAATCACTCTTAACACAGTAACGCAATTTTTGCGGACCGCGATAATGTTTTTTACACCAAAAAAAGGTATTTTGTGGAAAAAAATCAAGACATAGCGAGCAAAGAGTGACTAAAACAACTTCTTTTCAGATCTTCTACGATGCAGAGGATAATGAATTAGCACAGCATAAAATTGATGCCAAAACATTAAGCATTTCCATAGGTTCGATGGCAGATTTAATTTCAGCAGCTGATAAAAGACTTAATGACGGCCAACAAACCGTTAAGTTAATGGTTACTAATCCAGCTGAAGCGGGATCACTCGGCGTATCCTATACGATGATGGAGCTTGTTCCTCATGCCGTCGACGTAGCAAAAGTGATTGGCCTAACAGGGATAGCCGGGGCTACTATTGGAGCTCCAGCATTATCACTAATCCGCCAACTGGGCAGCAAGAAAGTAATTTCGGTAACAAAACGGGCAGGAACAGAAGAATCTGTTCTTGAGCTTGAAGGGGAAGAAATTGTTTGCCATGACTCAGTGGCTAAGTTAGTAACAGATCCAGAAGTCCGTGATGCCCTTGTGAATGTAGTTCGTGCACCATTAGACGGCAAACAAGGAGCAGTATTTAAGGTGCTGAATGATGAAGGCGAAGAAGTCGTTCGTCTTGAAGGAAGTGAAACCGAAGAGATCAAACCGCTGCCTAGAGGCACACTACTTGAAAAAGAAGAGTCTGTAGAAGAAGTCAATGTTAGATTCGTACAGATCAACTTCGAGGGTACAAAGGGTTGGAGAATCGATTATTTAGGCGAAGAGCATGCTGTTACCTTTGAAGATCAGTTGTTTATACACCAAGTTCAAAACGGAATCATCAGTTTTACAAAAGAAGATTTGTTTGTTGTTGAGCTAAAAACTATAAAAACTTTCACTGCGCGAAATGCCACAACCAAGTATGCTATAACCAAAGTAAAACGAAAACGCCCTGCTGAGGCTTGACAAAAGTGACACTAAACATGCAGATAGCACAACTGATCTTCTGGATAGGGGTGATAATGATCATCCCTGCCTTTAGTCGTTTTTGCTATTCAGCGTCAGCCTTGCTTTGGCGTCGACTATTCCCTACAAAAGTCTTCGAATTCCGATATCACGATGAAGATTCCGGAGTGACCAAAACGCTAGTAATAAAAGTTCCAAGTAAAAAGGGAAAAATGCTAACAACCCTCATTGATGAGGCAATTGCGGAGAATTCAAAACGAAAATGAATTCTCAAACTAAAGGTCTAAGCACAGGAAAAGCTACCCTTTCAACTGGTGGTTGGGGAGCAATTTTAAGTGTTCTTGTTAGCGCGATTTTAACCGACCCTAATAGCGTGTGGCGAACTGTAGCCTACGCATTAGTACCGGGTGTTGCGGCAACTTTAACTTACGTCATGAATTGGTTCATTTCGAGGCATGGTTTCGAATCTCCAGAAGACGCAGCCAAAAGAGCAAAATGCAAGCGTGATTTGGCTGAGATTGAGAAGCAGTTACGATCTGACCACTTAAGTCCTGAAATAAAATCAACATTGATGCAGGCTAAAGCAAGAACAATCGAAATTTTGGTATCAATCGGAAGGGAATCTATACTTGAAACATCTGCGCGTAACATTACACCATCAGAAACTGCCGATCCGCAAAGCTGAGCGGCAGTTGCTATTCTAGTAGTCTATTGGTCCATTTCCAAACAAAGATCAAGCATTGAGAGACAGCCTTCAATAAACCCCTCAGCCATCTGTATCTCAATGCGTATTAGTTTCTCATCCTTTTTACGAGCTTTGGCGAGCTTTCTTTTAGAGATACCGTATAGGTAATGGGCAACAAGAAGCGAATGTTCGTCTGGCCTTTTTTGCTTTAGACGAGCAAGACAACCTTCAATAATTAATGCATCACTATCTGAACAAGCCTGACGTGTTTTGCTTGTATAGGGAAGAAGTCCCTTAAACCCAGCAGCTATAGGCGAATAGTCTACTCCTGAACTGTCACTCGCCGCCCATGCCCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTTCTCCACAAAATCAGGCCAGCACGCCAGTTGCCAGCGCACGATCGATAAAACGAAATATCAGCTCCAGCTGGGAGCCATACTTATCTTCAAATGCCACGGTATCCGCATGCAGCTCGTCGTGATGCTTTCTGCACAAAGGCAACACAAAGAGGTCATGCGCTTTTGTACCCATTCCCCCCTGACCGTGGCCTATCAGGTGGTGGGGATCATCAGCAGGCTTTCCACAACATGCACACGGCTGTGTCTTAACCCAGCGCGTGTACTTTTCATTGACCCAGCGGCGACGTTTTGGGCGTAACATAAAAGACTCCGGCGACTCAGGATCCACTTTCAGCGCCAGCACATTTTTCGCTTTATCCTGGATGATGCTGGTGGCAGGAACCGAAGGCACAAGGTCACTTTCCCGGGTGACAGACGACACAACAGGCTTCGGTAATCTCAGTGCCTTACGGGCTGCACTTTCCGGTAAGGCATCCGCCAGGTCATTACGAACCAGCCACCAGCACAGTTCCGGCATTGTCACAACGTGACTATCATCAAAACCGAGATCCCGACGCACAACAGACAACACCCAGCGGGCACAGTTATCCGTTGCCATTGATTCCAGCCGTTCCGTGAACTGATCGCGCAGCTGGTTATCGCAGTGCCAGCACAAACGGATTGCGCCCGGAGCGTGTCGCATTGTTGTCATGTTCTCGCTGTGCCAGTCGGTATGAGGCCACTGGCAGCCCTTTTCACGAAGTAACCAGCTTTCAAGACATTCCACGCCACCAGCACGACGGATCACTGCCTCATTGCGGAACACGGCCCGAACGGCAGGATCATCCGCCAGCGGTTGTGATGCCGCCGGAACGGCGCCACTGGCGAAAGATGAATAACTTTCCGGCTCAGGCTCCAGCAGGACACGCCCCTGCATAAACAGGGGCATCAGCTCTGAACCAGGCCTGAACAATACGATCCCCATACGCGGGGCAATTTCAGGGGTCAGTAGTGCTCTCACGGTCACCTCAATGAACGGTATCGAGCAGCTTTAACAGCTCAGTGAATCGGGATTCGAAGAAATGCGGCTGCGTCTCGCGCGGATTTGCAGGACTGGTGATGTTCTTGCCGAACATGCAGCCTTTCGCCGTCAGCGACCAGAATTTTTTGATGTTGTTAATCGCTGTACGGCTGTATCGTTCGCGCTGTTCGACGATCCCCAGCTTCGCCATCTGGTGATATGCCTGATTAGCCGTCAGGCGGATACCATACTGCTTCAGCAGTGCACTCAGTGACAGCGTGGGGCGGCTTGAGCCATCAGGCGCGTCAGCAGGAGCATCAATGGCATAACGCGGTGCCAGATTCGGTAAGCCAACAGCCTCCTGGAGTTTCTGACAGGCCCCAAGCACAGATGAGTTAGACAGATTTAACTCCCGGCGCATAAAGTCCAGCAGAATCACGCCAGCCTGCATCTTGTCAGCAGCCTGTCCGGATAACTTTTCCGGCGCGCTGGTTACCATATCGAAAGTACGGATCACCTTCAGATGGAATGACGGGCTGATCCACATTGCATAGGCATACACCAGTTCCTTACAGACATACGTTCCCCGTTCATTTCCCCCATGAATCACACTCACCGGGTCAACACCCAAATTCTGGGTGTTGATTAATTCATGAACAAGATCAATAGTTTGTTGGCTGGAAAGAAACTTTCCTGGCTCCTTGGTTCTGGCATTTGCACCAGATGCTACTGCTGCGCGATGCAGATCGTTCAGGCTGTAACGCTCATAAGCATCACGACGAACTTCAATACCATCAATGACCATCAGATTATTCATACTTCGTTTCTCCTCTTAATCAGGCAGCTGCACCCGCCGTTTTCTCGTACTTACTGATAGTGATCTCGACCTTCCCTTCCGGGATAACCGGTCCCCACTCCACCAGCATTCTTTTCACCTGACTGTCGTCTTCCCACACCCCCGCGTGGGTCAGGGCGTCAAACAGCGCCTTGTTATAGTTGTCCAGATCGCGGATCCTGTTATCCGGAGGAAACAACACGATCTCCACTGAAGCAGGTGCCGACGTTGGTTTCGGCAGACGACGTAACTGCTCAACTATTGCTGCGCACGCCGCGCTCTGGAATTTTCGCCCCGCCGCGCTTATCAGGCTCTTACCAGCAAACGCCCCTTTGTTGGGGTGTCGCCAGTACGTGTTCACGCTGGGCGGAAAAGGCAGGATCAGCTTCATACTTTCAGGCCCCTCTCATGTAACCAGTGGGTTGCACGCAGCCTTGCGTTTTCCTCACCGGCAAGCAGTGAGCGGATAATCCCGACCGCCTCGCTGTCGTCGTCCTTCACCGCGGTATGAAGCGTGATGCCCCGGGCCACGCCACGCTTTATCGTGATGACGCCTTTTTTCTCCAGTGCGCGAAGATGCTCCACCGCTGCATTCACTGAACGGTATCCCAGCATGGTTGCCACCTCCTGATTGGTTGGCGGGAAGCCACGTTCTTTCTGATAAGAAATCAGCATATCCAGCACCTGCTGCTGGCATTGAGTTAACGTCGTCATGCCGCCATCTCCCTGACCAGTTTTTCTGCCTGCTGGCGAACCTGCGCCAGAAAGGCCTCACCACATGCCTCAAGTTCATAGCGCCCGATGTAGCTGATTGCCGGTCCCTTCCAGGTCTTGTCGAAAACAGCAATAGCACCAGCGAAGAAAGCGCCTGTCGGCACCTGCTTCTCATCCTTCGGGATAAACCAGGCAGGCAGTTCAAAACCAATACGCCCGCGAATAAAAGCAATATGATCTGCATCTTCCGGCCACCACACTTCGCTGGTGGCAGCTTTGATCAGGAAAACATAGCGCCCGCCTTTATCACGCATGGCACTGGCATGCTTCATGATGTAACGCATGCCGGTGATGTATTGCCCCTCATGCTGACTGGCGCGGCTGTATGGGGGATTACCAAAGGCAGCACCTTTAAGCTCCGCAAGGCGTTCTGACCAGTCATGCGCCAGCGCGTTGTCTTCCGCCGTGTAATACGCAGCACATTTGGCGTTATCACCGTCAGTAAACAGATCCAGAACAAACGGGCCAAACAGAGTGTTAATTCCCCAGAAAATGTTGTCCGGCGTGCGCCACTGATCGCCCACTTCCTTCAGTTCATGGGCTGGTTTGTTCCGCAGTTCCGCCAGCGCCTGGCAATATTTATTACTCATTAAGCCCCCACGTAATTCCCTGAGAGATACCACTCTTCACCTGATGCAGCCCGCTTACTGCTTTTCCGTAAACACCGTTCACGACGCGCCAGAAAATTGTTTCGTTCTGGCTGGGAGTGGCTTTCACGGAATGCCGCCATCCACACCGTTGCAGCACGACGGTATAAGCCCCTGGACTCCAGTTCTTCCGCCTGGCGGGTCAGGCACAAAATCACCCGCGAGTCGTTAGTGCCGACATAGAAATTGCGCACAGGTCTGGTTTCACGAACTGGTTGTGGTTCCGGATCCTGCGCTCTCTCAGTCAGGCGCGGGAAATGTCTGTGTGTATCTCCTTCACAACGGTGAGCCACACGCCCACTCTGACGTAACTTGCTTGCTGACTGCAGAACGCGCTGCCGTGAGTAACCTGCAAAAGCATCCGCAATGTCTCCGGAAGTACAGCCCGGATGGGCTTCAATGAATTTCTGAACGTCATTCAAAAGACTCATGATCACCCCCTGAATCCTGCCGGGATCTGGCTGTAGTCCACGTTGTCGTAACTGGCTTTGAAGTACGGGTCTTCACGTTTTTCTGTGTGCGTGCTGACGGACGGCGATAAGCGCAGGGAAAGCTCATCCCATTTTTCCCGCAACTTCGACGGGCTGAGCACGTTACGGCACCAGAACGGATCGCGGCTGACGCGGCTGTACATCTCGCAGATTTGTTTGTGAGTACGACCATCCTGCACACACATCAGGCGAATTTCGTTTGCCCATGCTGTCCAGTTCGGTTCTTTGGGACGAACCACCTCGCCGTCACATTCGGCGGCCTGCTCGTACAGGGCAATGATTTTTTTCCAGAGCCACTGTGCGCAGGTCAAATCATCCTGCGTTCCCCACTGGCGCTTTTTAGGGCTGAATACAACCGCATCAGGATGGCGAGTTAAAAAATCCTGTTCATCCGTCTGCGTGTCCGGTTGCGAAGCGTCCGGACGAGAAGGTTTTTTATCTGACGGATCATGTTTTGATTTTACTGACGGATCCCCGCCAGATTCTGACGGGTGAAAACCCGCTTTTTTGCCAGATTTCGACGCATCAAATTTTGACGGGTCAGATTTTGATGCGTCAGATTTTGACGGGTCAGAATCTGACAGTTGAGAAAATGCCGCTGCCTGAAGCTTCGCAACGTTAAGCTGATAAACATTCGACGCATTGCGGTTACCCTGGCGACGCGCCTTACGCGTTAACCAGCCTTCTGCTTCCAGCCGTGCGATAGCCGTCCTGACGGTACTCATCCCCGCGCCAATCTGACGGGCAATAGTTTCAATTGATGGCCAGCACACACCTTCGTCATTACTGAAATCAGCCAGGCGGGCCATAATTGCCACGCTGGATAATTTCATGCCTGATGCAGCGCAACCATCCCATACATAGCCGGTTAATTTAGTGCTCATGACCGACCTCTATTTCCCTGAATTTACGACGAAACTGTTCGAGCGGACTGAAGCATTCATGCTCATAACCTTCGCGGAGGTAGATAACCCGTTGTGTTTCCGGCTCCCAACGAATGACTCTGACGGGCACTCCGTAGTGATCTTTGAACCAGCGGTTAACTTGTCGCAAAGGACCGTCTCCTTCTGCCGGTTGAAATCACCCACAGCCCACTCTGCAAAGCTGTGGGTTACAATTTCCCTGTCACCTGGTACATTCACTGCATAGCAATATTCCACCTTCGCTTTTCCACCCGGTACAGGAAGCGCAATCAGTTGCGAGCGACGGTAGTGTGTTGTTAAACTGTTCATGCGTTAGTTTCTCCACAACCAGAAGCAATCGACGCCACGACGCCCGGAGCTGCACACTCGCGGGCGTCATTACTTTCTGAAATGCAAAAAATTTTGTAGACAAGTGCTGCATGCTCCTGCAGCTTCGAAATTGAGAGATACAGCTCGTCGTTAATTGCTGTCTTCTCATGCGGTTCCACTACACCGTCTTCGATTGCTGAACGAATCTGTTTTGAATAACTGCCGATCTGTTCAATGACTTCCAGTAAACGCTGGTTAATATCGGCATTGTCCACATCCTCGACGTCAGGAAGAGACACAAAGACGCCATTTGCAGACTGCGCCACAGCGTCGGCAATGAAGTGAGTGCCACCAGCACGTTGTAAAATCATTGCCCATCCCAGCGGGAAAATCTGATCGCCATCGGCACGAAGGCGGTTAAATAATGCGTTCTCTGTTACATCCAGCCACTCAGCAGCTTCAGCGTAACCCCCCGGCAACGCCGCGATAGTTTTTCTGACAGCTTTCACGTACCACTCAGGCTGTTTTTCCACTTTCCAGTGATGATTACCCACGGCTTACCTCCTGTTCCTGTGGTTTAAACCCATTCTGGTTTTGGCTAGATTGAAAACGTGCCGGATAAAGAATCTGCATTTCGCTGATTTCACCCTTAAAAAAATTGGCCAGACGTTCTGCAAGATCGATAGATGGAATTTGTTCCAGTCTTTCAATACGACTCAGCGTCGCTGGATTGACCTGAACGCCAGCAGCAACATGCTGCAAAGTAAATCCGTGCGCCTTACGCACATTCCGTAATGGTGATTGCATATGACCTCCACATATTGCGTGATGAGCATATTATTTCACGCAAATATTTTGCGCAAGTTGATTTGCTTAACGCGCAATAAAGAAATGTAATAAACGCATGAACATAGGAAACCGAGTCAGACAACTTCGCCAGGCGAAGAACATGAAAATCGCCGATCTCGCTGAAGCAATAGGAGTGGATGCGGCGAATATCTCACGCCTGGAAACAGGTAAGCAGAAACAATTCACTGAACAAGCCCTGAGTAATATTGCCAGGAGCTTAGGTGTTGATATTGCTGATCTCTTTACCTCAGACGTCAAAAGTAATACTGTATGTAAAAACAGTATTAGTGAGGATGTTGCGCAGGTGAAGGATGTATTCCGTATTGAAATGCTGGATGTCAGTGCCAGTGCGGGAAATGGCCTTATCCAGGGCGGTGATGTCATTGATGTGATTCATGCCATTGAATACAGAACTGATAATGCTGTATCGATGTTTGGCGGACGGCCAGCCAATCACATTAAAGTTATCAACGTTCGTGGGGACAGTATGTGTCCAACCATTGAGCCAGGAGATCTCATCTTCGTTGATGTCAGTATCAATCAGTTTGATGGAGATGGTATCTATGTATTTGGTTTTGATGATAAAATTTATGTCAAACGACTGCAAATGATACCTGACAAACTACTGGTGATTTCTGATAACCAGATTTACCGTGAATGGGGAATTACCAGCGAAAATGAACACCGGTTTATGGTCTTTGGAAAGGTCTTAATCAGCCAGTCACAAACCCTTAAGCGACACAATTAACCCTTACCTCCTCATCAATTAGCCACCCAAAGGTGGCTTTTCATTACCCTTTAAATTGCATATCTCGCAACAAAAACACTTGCATAATGCGCAACTTCATTTTATCTTTCTTTCCAGACAAACAAACAAGGTACTAACAAAATTTGGTTGTAACACGGCGTATGGCACATGCGTCGTTAGCGGTCTGGGGACGTTAAAGGGGACAATCCACTCCTTGCTCGGGCAAACAAACCAGGTAGCCGGAATGTGCAAGTCAATGATGATGCTGATAAGACGCCTAACCAGCGTGGCGATTCGGTTTGACGCCTGGGAAGAGACCAGGGTGCAACGATGAGGGCATTTATGGAGCCGCGACAAAGTGTGGTGCCGTAACTGGCTAAGTGCTCTCAGCGTTGTGGTAATCCGCGAAATGGCGCGGCGGTAAGTATGGCGGGGTTACTCTTTCCCCGTTGAGGACACCGGATTGTCAGGTTGACCATACGCCTGAGTGACAACCCCACCACAACAGCCACTGCTTTGGCGGTACCAGTTTGTACCCTTGCTTCCGGCTGGTACCGCTCTTTTTACAAAACAGAGAAGAGCATCACCGGACGACGGGCTCATAACCCAATCCATCCGGGCGGCAGTCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACCAACCGACCTTGCAGGGTCGATATGATGGGGAGCAGCAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATCGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCAGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCGCTCAACCTGGAAATCGAACCGTTTGATGAGAACCGTGTGAAAATCAAACACAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAAGACCAGGGACAATTACTGACTCTTGCCGGTGAACCTGACGGAAAACTCCGCGCAGCAGGTCATTAATATCGTTCTTAATTAACTGATTATTTATCTCATCACTGAATATCTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACGCAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGCATCGATGACTTTACCCGTTATTCTAAAGATCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCTGATAATATGCGAGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACTCTCAAACTGAAAAAGACAGCACCGTTCTCTGCCCTGTTGTCTGTTAACGGCGAGCGTAACTCCCAGAAGTCACTGGCAGAATGGATTGAAGACTGGGCCGACTACCTTGTGGGCTTTGATGCTAATGGTGACGCCATTCAGGCAACAAAAGCGGCTGCGGCGGTCCGTAAAATCACGATTGAAGCAAACCAGACCGCTGATTTTGAAGACAATGACTTCAGCGGCAAACGCTCTCTGATGGAGTCTGTCGAAGCGAAAACCAAAGACATTATGCCAGTAGCATTTGAGTTTAAATGCGTTCCATTTGAAGGCCTGAAAGAACGTCCGTTTAAATTACGCCTCAGCATTATCACTGGTGATCGCCCTGTACTGGTTCTGCGCATTATTCAGCTGGAAGCAGTACAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAAAAATTCAAAGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCATTTATGGAAACGTAATTAACTCAATAATCGCCTGATGGCGAGGGTTTTCTTTAACCAAAATTCAGCGCGGTGCAGCGCATATACGTGGAGAACAAAATGTCATTTATTAAAACTTTTTCCGGGAAGCATTTTTATTATGACAGGATAAATAAAGACGACATCGATATTAACGATATCGCGGTTTCCCTTTCAAATATCTGTCGCTTTGCCGGTCATCTTTCGCACTTCTACAGCGTCGCCCAACATGCGGTTCTTTGCAGCCAGTTGGTGCCGCAGGAATTTGCTTTTGAAGCGTTAATGCATGATGCAACAGAAGCGTATTGCCAGGACATTCCCGCACCACTGAAACGCCTTCTTCCTGACTATAAACAAATGGAAGAAAAAATAGACGCCGTAATCCGTGAGAAATACGGGTTACCCCCAGTTATGAGTACGCCCGTGAAATATGCCGATCTCATCATGCTGGCAACCGAACGCCGCGATCTCGGGCTTGATGATGGCTCTTTCTGGCCTGTACTGGAAGGTATCCCGGCAACAGAGATGTTCAACGTGATTCCACTGGCACCGGGCCATGCCTACGGGATGTTTATGGAACGCTTTAACGAGTTATCGGAGTTACGCAAATGCGCATGAATGTTTTCGAAATGGAAGGGTTTCTTCGTGGGAGATGTGTACCGCGAGATCTGAAAGTAAATGAAACAGATGCTGAATACCTGGTGCGTAAATTCGATGCGCTTGAAGCTAAATGTGCAGCACAGGAAAACAAAGTAATACCAGTGTCAACTGAACTGCCACCAGCAAATGAAAGTGTTTTGTTATTCGATGCTAACGGAGAAGGCTGGCTAATTGGCTGGCGTTCTCTCTGGTACACCTGGGGACAAAAAGAAACCGGAGAATGGCAGTGGACATTTCAGGTCGGGGACCTTGAAAACGTCAATATCACTCACTGGGCAGTAATGCCAAAAGCACCGGAGGCTGGAGCATAATGACCACTTTTACCGACAAAGAACTGATTAAAGAAATTAAAGAGCGTATCAGCAGCCTTGACGTGCGAGACGATATTGAGCGCCGTGCTTATGAAATCGCACTCCTATCTCTGGAAGTAGAACCAGATGAACGCGAAGCTTATGAATTATTCATGGAAAAGCGTTTCGGTGACTTAGTAGATCGTCGGAGAGCAAAAAACGGCGATAACGAATACATGGCATGGGATATGACTCTCGGTTGGATCGTCTGGCAGCAACGAGCTGGTATCCATTTTTCAACAATGTCACAGCAAGAGGTGAAATAATGGAGCCATACAGCCTCACACTCGATGAGGCCTGTCATTTTCTCAAGATATCCAGACCGACTGCCATTAACTGGATACGCACAGGGCGTCTTCAGGCAACACGCAAAGATCCCACTAAGAATAAATCTCCTTACCTCACAACACGACAAGCCTGCATTGCGGCTCTTCAGTCTCCGCTGCATACTGTCCAGGTGAGCGCGGGTGATGGCATAACAGAGGAAAGAAAATGTCACTCTTCCGCAGAGGTGAAATATGGTACGCCAGTTTCACATTGCCGAACGGTAAAAGATTTAAACAGTCTCTTGGAACAAAGGACAAAAGGCAGGCGACAGAACTCCATGACAAGCTAAAGGCTGAAGCATGGCGGGTCAGCAAACTTGGTGAAATACCTGATATAACGTTCGAGGAAGCGTGTGTCAGGTGGCTTGAAGAGAAAGCACATAAAAAATCACTGGACGATGACAAAAGCCGGATCGGATTCTGGCTTCAACATTTCGCAGGAATGCAACTAAGAGACATTACTGAATCAAAAATTTATTCAGCAATGCAGAAAATGACGAACCGGCGTCATGAGGAAAACTGGAAACTCAGGGCAGAAGCATGCAGAAAAAAAGGGAAACCTGTTCCAGAATACACGCCAAAACCAGCGTCCGTTGCAACGAAGGCTATGCATCTTTCATTTATAAAGGCCCTACTAAGAGCCGCAGAGCGTGAATGGAAAATGCTGGATAAGGCACCAATTATTAAAGTGCCCCAACCAAAGAATAAACGGATCCGCTGGCTGGAGCCCCATGAAGCACAAAGACTGATTGATGAATGTCCGGAGCCATTAAAGTCTGTTGTTGAATTTGCACTGGCAACAGGCCTAAGACGCTCGAACATCATCAACCTTGAATGGCAACAAATAGATATGCAGCGCCGGGTGGCATGGATAAACCCGGAAGAGAGTAAATCAAACCGCGCAATTGGCGTTGCGCTGAATGATACTGCATGTCGCGTACTGAAAAAACAAATCGGGAATCATCACCGTTGGGTATTTGTGTACAAGGAAAGCTGTACCAAACCAGACGGAACGAAAGCGCCAACAGTAAGGAAGATGCGGTATGACGCAAACACAGCCTGGAAAGCGGCGCTGAGACGGGCTGGTATTGATGATTTCAGATTTCACGACTTGAGACACACCTGGGCAAGTTGGCTGGTTCAAGCCGGAGTCCCGTTGTCAGTGTTACAGGAAATGGGAGGCTGGGAGTCTATCGAAATGGTTCGTCGATATGCTCACCTTGCACCTAATCACCTTACCGAACACGCACGGCAAATAGACTCGATCCTGAACCCATCGGTCCCAAATTTGTCCCAGTCAAAAAATAAGGAAGGTACTAATGATGTGTAACTTATTGATTTAAATGGTGCCGATAATAGGAGTCGAACCTACGACCTTCGCATTACGAATGCGCTGCTCTACCAACTGAGCTATATCGGCCCTGAAAGGACATGTTCACGAACGTGAATCACGGTGGACAAGGTTAAAACTAACCGGGCGATGCGTCAATGGCCTTGTGAATCAAATGGCTACTTTTGCATCACCCGGTTTTATTTACGCACGAATGGTGTAATCACCAATACCGATCCACTTGTAAGTGGTCAGTGCTTCCAGCCCCATTGGGCCACGCGCGTGGAGTTTTTGTGTGCTTACCGCCACTTCCGCACCTAGTCCAAACTGGCCGCCGTCGGTAAAACGCGTAGAGGCGTTAACGTAAACAGCGGACGAATCCACTTCGTTAACAAAACGCTGGGCGTTGCGCATATCGCGGGTCAGGATCGCATCGGAGTGTTGTGTGCCGTGTTCACGAATATGGGCGATGGCATCGTCAAGATCACTGACGATTTTGACGTTCAAATCTAATGACAGAAACTCATCGTCATACTCTTCCGCTTTAACAGCCACCACCTTCGCGGGGCCTGTCTGCAACTGCGCCAGCGCAGCTGCATCTGCGTGTAATGCCACGCCGCTTTCCTCCATTTGTTTGCTTAATGCGGGCAGGAAGCTATCGGCGATGTTTTTATTCACCAGCAACGTTTCTACCGTATTACATGTGCTCGGACGCTGAGTTTTCGCGTTGACGATCACTTTTAATGCTTCAGCAATCTCTACACTTTCATCAACATAAATATGGCATACGCCTATACCACCTGTGATCACCGGGATCGTCGACTGTTCGCGGCACAGTTTATGCAAACCAGCGCCACCACGCGGGATCAGCATGTCGATGTATTTATCCATACGCAGCATTTCACTGACCAGCGCACGGTCAGGATTATCAATCGCCTGCACGGCACCCACCGGTAAGCCACAGGATTTCAGGGCGTCCTGAATCACCGCCACCGTTGCCGCGTTAGTGCGACAGGTTTCTTTACCGCCACGCAGAATCACTGCGTTACCGGTTTTCAGGCACAGCGAAGCGACATCAACCGTCACGTTCGGGCGCGCTTCATAAATCACGCCAATAACCCCCAGCGGTACGCGACGACGCTCAAGACGCAGGCCGCTGTCCAGTACGCTGCCATCGATTACCTGCCCCACCGGATCGGCGAGGTTACACACCTGGCGCACATCATCGGCAATGCCTTTCAGCCGTGCGGGCGTCAGTGCCAGACGGTCAAGCATCGCTTCGCCAAGGCCATTGGCACGCGCGTCAGCAACATCCTGGGCGTTAGCGTTGAGGATGATTTCGCTTTGTGCTTCCAGTTCATCGGCGATTTTTTCCAGCACGCGATTTTTTTCGCGGCTGGAGAGTTGCGCTAATTTATACGAGGCTTGCTTCGCGGCAATGCCCATTTGTTCCAGCATCAGCCTGCTCCTTAACGGGTAATCATGTCATCACGGTGAACGGCAACCGGGCCGTATTCATATCCCAGTATTGCATCAATTTCTTGCGAGTGGTGCCCGGCAATACGGCGTAATGCATCGCTGTTGTAACGACTGACGCCGTGGGCGATATCGCGACCTTCGAGGTTGCAAATGCGGATGACTTCACCACGCGAGAAATTGCCAGTCACGCTTTTAATGCCTTTCGGCAACAGGGAGCTGCCGCGTTCAAGAATGGCGGCAGTTGCCCCTTCATCTACCGTGATTTCACCCGCCGGCGGCGCACCGAAAATCCAGCGTTTACGGTTTTCAAGCGGAGTCGCCTGGGCATGGAACAGCGTACCGACGGAAATGCCTTCCATCACATCACCAATAACGCCCGGCTTGCTGCCCGCGGCAATAATGGTGTCGATACCCGCACGGCAAGCCACGTCAGCGGCCTGCAATTTGGTACTCATGCCGCCAGTTCCGAGGCCTGAAACGCTGTCACCGGCAATCGCGCGCAGTGCGTCATCAATGCCGTAAACATCTTTAATCAGTTCTGCCTGCGGATTGCTGCGCGGATCAGCGGTATACAAACCTTTTTGATCGGTCAGCAGCAACAGTTTATCGGCACCCGCCAGAATCGCCGCCAGCGCAGAAAGGTTATCGTTATCGCCGACCTTAATCTCTGCCGTAGCGACAGCATCGTTCTCATTGATTACCGGAACGATATTGTTATCGAGCAACGCACGCAGGGTGTCGCGGGCGTTCAGGAAGCGTTCACGGTCTTCCATATCAGCACGGGTCAGCAGCATCTGCCCGACGTGAATGCCATAAATCGAAAACAGCTGTTCCCACAGTTGAATCAGTCGACTCTGCCCTACCGCCGCCAGCAGTTGTTTCGAGGCGATAGTCGCTGGCAGTTCCGGGTACCCCAGGTGCTCACGTCCGGCGGCGATCGCGCCCGACGTCACAATAACAATCCGATGCCCGGCGGCATGTAACTGCGCGCACTGGCGAACAAGTTCAACGATATGGGCACGGTTCAGACGGCGCGATCCGCCTGTTAGCACACTGGTGCCGAGTTTTACCACCAGCGTCTGGCTGTCACTCATGATTCTCTGCCATTCAATTTTAGGAAAAATGATATCAAACGAACGTTTTAGCAGGACTGTCGTCGGTTGCCAACCATCTGCAAGCAAAGCATGGCGTTTTGTTGCGCGGGATCAGCAAGCCTAGCGGCAGTTGTTTACGCTTTTATTACAGATTTAATAAATTACCACATTTTAAGAATATTATTAATCTGTAATATATCTTTAACAATCTCAGGTTAAAAACTTTCCTGTTTTCAACGGGGCTCTCCCGCTGAATATTCGCGCGTTAATTAAAATCAGGAATGAAAATGAAAAAGAGCACTCTGGCATTAGTGGTGATGGGCATTGTGGCATCTGCATCCGTACAGGCCGCAGAAATATATAACAAAGACGGTAATAAACTGGATGTCTATGGCAAAGTTAAAGCCATGCATTATATGAGTGATAACGACAGTAAAGATGGCGACCAGAGTTATATCCGTTTTGGTTTTAAAGGCGAAACACAAATTAACGATCAACTGACTGGCTATGGCCGTTGGGAAGCGGAGTTTGCCGGAAATAAAGCGGAGAGTGATACTGCACAGCAAAAAACGCGTCTCGCTTTTGCCGGATTGAAGTATAAAGATTTGGGTTCTTTCGACTATGGCCGTAACCTGGGCGCGTTGTATGACGTGGAAGCCTGGACCGATATGTTCCCGGAATTTGGTGGCGACTCCTCGGCGCAGACCGACAACTTTATGACCAAACGCGCCAGCGGTCTGGCGACGTATCGGAACACCGACTTCTTCGGCGTTATCGATGGCCTGAACTTAACCCTGCAATATCAAGGGAAAAACGAAAACCGCGACGTTAAAAAGCAAAACGGCGATGGCTTCGGCACGTCATTGACATATGACTTTGGCGGCAGCGATTTCGCCATTAGTGGTGCCTATACCAACTCAGATCGCACCAACGAGCAGAACCTGCAAAGCCGTGGCACTGGCAAGCGTGCAGAAGCATGGGCAACAGGTCTGAAATACGATGCCAATAATATTTATCTGGCAACTTTTTATTCTGAAACACGCAAAATGACGCCAATAACTGGCGGCTTTGCCAATAAGACACAGAACTTTGAAGCGGTCGCTCAATACCAGTTTGACTTTGGTCTGCGTCCATCGCTGGGTTATGTCTTATCGAAAGGGAAAGATATTGAAGGTATCGGTGATGAAGATCTGGTCAATTATATCGATGTCGGGGCTACATATTATTTCAACAAAAATATGTCAGCGTTTGTTGATTATAAAATCAACCAACTGGATAGCGATAACAAATTGAATATTAATAATGATGATATTGTCGCGGTTGGCATGACCTATCAGTTTTAA
Protein sequences of DBSCAN-SWA_8 >CP031653|3809734:3835453|3821301_3822111_-|AXP28115.1|DBSCAN-SWA MNNLMVIDGIEVRRDAYERYSLNDLHRAAVASGANARTKEPGKFLSSQQTIDLVHELINTQNLGVDPVSVIHGGNERGTYVCKELVYAYAMWISPSFHLKVIRTFDMVTSAPEKLSGQAADKMQAGVILLDFMRRELNLSNSSVLGACQKLQEAVGLPNLAPRYAIDAPADAPDGSSRPTLSLSALLKQYGIRLTANQAYHQMAKLGIVEQRERYSRTAINNIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESRFTELLKLLDTVH >CP031653|3809734:3835453|3829479_3829842_+|AXP28130.1|DBSCAN-SWA MRMNVFEMEGFLRGRCVPRDLKVNETDAEYLVRKFDALEAKCAAQENKVIPVSTELPPANESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA >CP031653|3809734:3835453|3830598_3831537_+|AXP29212.1|integrase|DBSCAN-SWA MDDDKSRIGFWLQHFAGMQLRDITESKIYSAMQKMTNRRHEENWKLRAEACRKKGKPVPEYTPKPASVATKAMHLSFIKALLRAAEREWKMLDKAPIIKVPQPKNKRIRWLEPHEAQRLIDECPEPLKSVVEFALATGLRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVALNDTACRVLKKQIGNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWKAALRRAGIDDFRFHDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYAHLAPNHLTEHARQIDSILNPSVPNLSQSKNKEGTNDV >CP031653|3809734:3835453|3825819_3826080_-|AXP28123.1|DBSCAN-SWA MQSPLRNVRKAHGFTLQHVAAGVQVNPATLSRIERLEQIPSIDLAERLANFFKGEISEMQILYPARFQSSQNQNGFKPQEQEVSRG >CP031653|3809734:3835453|3828952_3829489_+|AXP28129.1|DBSCAN-SWA MSFIKTFSGKHFYYDRINKDDIDINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKQMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFNVIPLAPGHAYGMFMERFNELSELRKCA >CP031653|3809734:3835453|3822839_3823493_-|AXP28118.1|DBSCAN-SWA MSNKYCQALAELRNKPAHELKEVGDQWRTPDNIFWGINTLFGPFVLDLFTDGDNAKCAAYYTAEDNALAHDWSERLAELKGAAFGNPPYSRASQHEGQYITGMRYIMKHASAMRDKGGRYVFLIKAATSEVWWPEDADHIAFIRGRIGFELPAWFIPKDEKQVPTGAFFAGAIAVFDKTWKGPAISYIGRYELEACGEAFLAQVRQQAEKLVREMAA >CP031653|3809734:3835453|3834397_3835453_+|AXP28135.1|DBSCAN-SWA MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNDSKDGDQSYIRFGFKGETQINDQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNENRDVKKQNGDGFGTSLTYDFGGSDFAISGAYTNSDRTNEQNLQSRGTGKRAEAWATGLKYDANNIYLATFYSETRKMTPITGGFANKTQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGIGDEDLVNYIDVGATYYFNKNMSAFVDYKINQLDSDNKLNINNDDIVAVGMTYQF >CP031653|3809734:3835453|3810374_3810578_-|AXP28100.1|DBSCAN-SWA MNTNEDILFTKDVMKILRYGAMSAFINFWKDENNGFPQPFRIGRRHTWHRRDVEAWLDKQREQANPH >CP031653|3809734:3835453|3824914_3825094_-|AXP28121.1|DBSCAN-SWA MRQVNRWFKDHYGVPVRVIRWEPETQRVIYLREGYEHECFSPLEQFRRKFREIEVGHEH >CP031653|3809734:3835453|3819230_3819479_+|AXP28112.1|DBSCAN-SWA MQIAQLIFWIGVIMIIPAFSRFCYSASALLWRRLFPTKVFEFRYHDEDSGVTKTLVIKVPSKKGKMLTTLIDEAIAENSKRK >CP031653|3809734:3835453|3822130_3822520_-|AXP28116.1|DBSCAN-SWA MKLILPFPPSVNTYWRHPNKGAFAGKSLISAAGRKFQSAACAAIVEQLRRLPKPTSAPASVEIVLFPPDNRIRDLDNYNKALFDALTHAGVWEDDSQVKRMLVEWGPVIPEGKVEITISKYEKTAGAAA >CP031653|3809734:3835453|3816393_3816609_-|AXP28108.1|lysis|DBSCAN-SWA MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >CP031653|3809734:3835453|3823492_3823987_-|AXP28119.1|DBSCAN-SWA MIMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFPRLTERAQDPEPQPVRETRPVRNFYVGTNDSRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA >CP031653|3809734:3835453|3817878_3818073_-|AXP28110.1|DBSCAN-SWA MLKQQDMTETAKVVFNELNGKPATVGEIAQNTYLSRERCQLILTQLVMAGLADYQFGCYRRLQQ >CP031653|3809734:3835453|3809734_3810316_-|AXP28099.1|DBSCAN-SWA MNKNIAVTGKGYARSVKKFCDIRDLVVLRFDGVDVRVVYLNGDPWFVAKDVCEALEISNSRDALKALDVDEKNTVALSYGIRGNPNHSVVSESGFYKLIARSRKAVTPGTFAHRFSNWVFRNVIPGIRKTGAYGIPWGALQDFSRRKEQYQISASEKGRELQACKRKKRELEEEEKRLIREYQPEFYFGERIQ >CP031653|3809734:3835453|3825269_3825827_-|AXP28122.1|DBSCAN-SWA MGNHHWKVEKQPEWYVKAVRKTIAALPGGYAEAAEWLDVTENALFNRLRADGDQIFPLGWAMILQRAGGTHFIADAVAQSANGVFVSLPDVEDVDNADINQRLLEVIEQIGSYSKQIRSAIEDGVVEPHEKTAINDELYLSISKLQEHAALVYKIFCISESNDARECAAPGVVASIASGCGETNA >CP031653|3809734:3835453|3819921_3820290_-|AXP28113.1|DBSCAN-SWA MMRDIQMVLERWGAWAASDSSGVDYSPIAAGFKGLLPYTSKTRQACSDSDALIIEGCLARLKQKRPDEHSLLVAHYLYGISKRKLAKARKKDEKLIRIEIQMAEGFIEGCLSMLDLCLEMDQ >CP031653|3809734:3835453|3820304_3821294_-|AXP28114.1|DBSCAN-SWA MRALLTPEIAPRMGIVLFRPGSELMPLFMQGRVLLEPEPESYSSFASGAVPAASQPLADDPAVRAVFRNEAVIRRAGGVECLESWLLREKGCQWPHTDWHSENMTTMRHAPGAIRLCWHCDNQLRDQFTERLESMATDNCARWVLSVVRRDLGFDDSHVVTMPELCWWLVRNDLADALPESAARKALRLPKPVVSSVTRESDLVPSVPATSIIQDKAKNVLALKVDPESPESFMLRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHADTVAFEDKYGSQLELIFRFIDRALATGVLA >CP031653|3809734:3835453|3831741_3832995_-|AXP28133.1|DBSCAN-SWA MLEQMGIAAKQASYKLAQLSSREKNRVLEKIADELEAQSEIILNANAQDVADARANGLGEAMLDRLALTPARLKGIADDVRQVCNLADPVGQVIDGSVLDSGLRLERRRVPLGVIGVIYEARPNVTVDVASLCLKTGNAVILRGGKETCRTNAATVAVIQDALKSCGLPVGAVQAIDNPDRALVSEMLRMDKYIDMLIPRGGAGLHKLCREQSTIPVITGGIGVCHIYVDESVEIAEALKVIVNAKTQRPSTCNTVETLLVNKNIADSFLPALSKQMEESGVALHADAAALAQLQTGPAKVVAVKAEEYDDEFLSLDLNVKIVSDLDDAIAHIREHGTQHSDAILTRDMRNAQRFVNEVDSSAVYVNASTRFTDGGQFGLGAEVAVSTQKLHARGPMGLEALTTYKWIGIGDYTIRA >CP031653|3809734:3835453|3822516_3822843_-|AXP28117.1|DBSCAN-SWA MTTLTQCQQQVLDMLISYQKERGFPPTNQEVATMLGYRSVNAAVEHLRALEKKGVITIKRGVARGITLHTAVKDDDSEAVGIIRSLLAGEENARLRATHWLHERGLKV >CP031653|3809734:3835453|3818343_3819213_+|AXP28111.1|DBSCAN-SWA MTKTTSFQIFYDAEDNELAQHKIDAKTLSISIGSMADLISAADKRLNDGQQTVKLMVTNPAEAGSLGVSYTMMELVPHAVDVAKVIGLTGIAGATIGAPALSLIRQLGSKKVISVTKRAGTEESVLELEGEEIVCHDSVAKLVTDPEVRDALVNVVRAPLDGKQGAVFKVLNDEGEEVVRLEGSETEEIKPLPRGTLLEKEESVEEVNVRFVQINFEGTKGWRIDYLGEEHAVTFEDQLFIHQVQNGIISFTKEDLFVVELKTIKTFTARNATTKYAITKVKRKRPAEA >CP031653|3809734:3835453|3829841_3830147_+|AXP28131.1|DBSCAN-SWA MTTFTDKELIKEIKERISSLDVRDDIERRAYEIALLSLEVEPDEREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTLGWIVWQQRAGIHFSTMSQQEVK >CP031653|3809734:3835453|3826051_3826204_-|AXP28124.1|DBSCAN-SWA MSDSVSYVHAFITFLYCALSKSTCAKYLREIICSSRNMWRSYAITITECA >CP031653|3809734:3835453|3810590_3811829_-|AXP28101.1|DBSCAN-SWA MPVKPLTVTEVKGMKPREKDYAVYDGFGLLLNVSKAGGKVWRFRYSHPITKKRQTYTIGRFPEFSLAEAREIRDELRRMIARGVDPVTEKKNRKIEMSLKNLQTFEAVANAWIAFKKGSELRKPTLYNIEYEVYKYLVPFFGKYSIEKITAPVAINALDAVSDKNALQKKLISRLNEIMNYAVNCGALKTNPLLKIKTAFTGKKNKSLAALPVERLPEFLSWWDSVPHKYQIAHNALLFQILTMVRPGEAIKAEWSEIDFDSGLWIIPAHKMKCHREHVVPLSSQAIRILRTMQKIKRGRYIFFSSRTKDAPMGKNTINTPIAASKFKGIVTLHGFRSMWSTLLNEEGFNPDVIEAALAHKSGDKIRDIYNRTTYLEQRKIMMQWVGDFFDEARKGVINRSGGKKGLRIVNG >CP031653|3809734:3835453|3815291_3815828_-|AXP28106.1|DBSCAN-SWA MKPRKITIVAVLLVAVVIIIAVLSVLLVRSRSALETEQSENRVLRNDNALQATVITTQAFNFNRFNQIAENANRLNSLIDAGTEKTVIEYREILRREKTCDLPVPADVAGGLLKYAYRLRASAMHPDTGNTNATDDSTAAASSMTYCQAVLWIEPLLAVIEKGNNNLAGIREIEKLRK >CP031653|3809734:3835453|3826925_3827204_+|AXP28126.1|DBSCAN-SWA MHISQQKHLHNAQLHFIFLSRQTNKVLTKFGCNTAYGTCVVSGLGTLKGTIHSLLGQTNQVAGMCKSMMMLIRRLTSVAIRFDAWEETRVQR >CP031653|3809734:3835453|3814720_3814921_-|AXP28105.1|DBSCAN-SWA MTPAEFYDVYNIKPAEMLTGETVNNFASRVMAQQTSTSGNTGVWYSQGTATQSKQNQTTSHQLYTY >CP031653|3809734:3835453|3813814_3814717_-|AXP28104.1|DBSCAN-SWA MPNWSDVLGEITALAHKSPMDEVRRKYLSQLSNHTGRNVITYYSGWLQHGGAEVRHLTQMTDDDKNGLMTAINGLDVSKGLDLILHTPGGDIAALESIGHYLRSKFGTNIRAIVPMISMSCGTMLACCANEIIMGKQSNIGPIDPQFNGFSTHAIIEEWNRAQTEIFQNPAAVQMWQFILQKLNPTIIGECEKAIKWANEIVKHWLMTGMFDNDPEAESKATHVCSELNNHHTTYTHSRHIHFDKAQKIGLNVTELESDQVLQDLVLTIHHSYMHSFGGAPLAKIIENHNGNAMIWNIQS >CP031653|3809734:3835453|3812985_3813477_-|AXP28102.1|DBSCAN-SWA MDHELKNLVLNINQLAALSGLHRQTVVARLKNIRPAGGHDKLKLYRLTDILTEFMGLPPPVAEGEMDPHERKAWYQSERERLKFEQETAQLIPASDVRREFAIWAKAVVQVLETLPDILERDCGLQPAAVSRVQSIIDDLRDQIALRVTEAGADDEEELQQEE >CP031653|3809734:3835453|3827305_3827500_-|AXP28127.1|DBSCAN-SWA MTAARMDWVMSPSSGDALLCFVKRAVPAGSKGTNWYRQSSGCCGGVVTQAYGQPDNPVSSTGKE >CP031653|3809734:3835453|3813466_3813694_-|AXP28103.1|DBSCAN-SWA MGPPGGVAYHGAGASRKKASFCIFIGHHHLCILLIINGYLFFVCRIERFLFDIERVFLKLFARCMFKALRRKYGS >CP031653|3809734:3835453|3830146_3830497_+|AXP28132.1|DBSCAN-SWA MEPYSLTLDEACHFLKISRPTAINWIRTGRLQATRKDPTKNKSPYLTTRQACIAALQSPLHTVQVSAGDGITEERKCHSSAEVKYGTPVSHCRTVKDLNSLLEQRTKGRRQNSMTS >CP031653|3809734:3835453|3816676_3817729_-|AXP28109.1|DBSCAN-SWA MKNTVKINSVDLINADCLHFIQSLPDDSIDLIVTDPPYFKVKPNGWDNQWKGDEDYLKWLDHCLAQFWRVLKPAGSLYLFCGHRLASDIEIMMRERFNVLNHIIWAKPSGRWNGCNKESLRAYFPATERVLFAEHYQGPYRGKSDGYAAKERELKQHIMAPLISYFRDARAELGITAKQIAEATGKKNMVSHWFGASQWQLPNEADYRKLQALFFRIAAEKFQEQQLEQPHHQLVASYDSLNRKYSELLDEFKSLRRYFSVSVSVPYTDVWMHKPVQFYPGKHPCEKPADMLRQIINASSRPGDLVADFFMGSGSTIKAAMALGRRALGVELESERFNQTVKEINELVGK >CP031653|3809734:3835453|3827572_3827935_+|AXP29211.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGH >CP031653|3809734:3835453|3823983_3824925_-|AXP28120.1|DBSCAN-SWA MSTKLTGYVWDGCAASGMKLSSVAIMARLADFSNDEGVCWPSIETIARQIGAGMSTVRTAIARLEAEGWLTRKARRQGNRNASNVYQLNVAKLQAAAFSQLSDSDPSKSDASKSDPSKFDASKSGKKAGFHPSESGGDPSVKSKHDPSDKKPSRPDASQPDTQTDEQDFLTRHPDAVVFSPKKRQWGTQDDLTCAQWLWKKIIALYEQAAECDGEVVRPKEPNWTAWANEIRLMCVQDGRTHKQICEMYSRVSRDPFWCRNVLSPSKLREKWDELSLRLSPSVSTHTEKREDPYFKASYDNVDYSQIPAGFRG >CP031653|3809734:3835453|3828000_3828825_+|AXP28128.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAVRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >CP031653|3809734:3835453|3826177_3826870_+|AXP28125.1|DBSCAN-SWA MNIGNRVRQLRQAKNMKIADLAEAIGVDAANISRLETGKQKQFTEQALSNIARSLGVDIADLFTSDVKSNTVCKNSISEDVAQVKDVFRIEMLDVSASAGNGLIQGGDVIDVIHAIEYRTDNAVSMFGGRPANHIKVINVRGDSMCPTIEPGDLIFVDVSINQFDGDGIYVFGFDDKIYVKRLQMIPDKLLVISDNQIYREWGITSENEHRFMVFGKVLISQSQTLKRHN >CP031653|3809734:3835453|3833006_3834110_-|AXP28134.1|DBSCAN-SWA MSDSQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQLHAAGHRIVIVTSGAIAAGREHLGYPELPATIASKQLLAAVGQSRLIQLWEQLFSIYGIHVGQMLLTRADMEDRERFLNARDTLRALLDNNIVPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLTDQKGLYTADPRSNPQAELIKDVYGIDDALRAIAGDSVSGLGTGGMSTKLQAADVACRAGIDTIIAAGSKPGVIGDVMEGISVGTLFHAQATPLENRKRWIFGAPPAGEITVDEGATAAILERGSSLLPKGIKSVTGNFSRGEVIRICNLEGRDIAHGVSRYNSDALRRIAGHHSQEIDAILGYEYGPVAVHRDDMITR >CP031653|3809734:3835453|3815827_3816394_-|AXP28107.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGDDGLEGVSYIPYKDIVGVWTVCHGHTGKDIMLGKTYTEAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSTLLRKINQGDIKGACDQLRRWIYAGGKQWKGLMTRREIEREVCLWAEKPQVLGDGLGPLNPGIPVSVPGVF |
39 | Shigella_phage(37.5%) | integrase,lysis | attL 3829862:3829876|attR 3833542:3833556 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
4221061 : 4227620
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >CP031653|4221061:4227620|DBSCAN-SWA GATGAAAATTGCGCTGGTTATTTTCATCACCCTTGCCCTGGCGGGCTGTGCGCTGTTATCACTCCATATGGGAGTGATCCCCGTGCCGTGGCGCGCGCTGCTGACCGACTGGCAGGCCGGACGCGAGCATTATTATGTATTGATGGAGTACCGACTGCCGCGCTTGCTGCTGGCACTGTTTGTCGGTGCAGCCCTCGCCGTGGCGGGCGTGCTGATACAGGGGATTGTGCGCAACCCTCTGGCATCACCGGATATTCTCGGCGTTAACCATGCCGCCAGCCTGGCCTCTGTGGGGGCTCTACTTCTTATGCCGTCACTGCCCGTGATGGTGCTGCCGCTGCTGGCCTTTGCGGGCGGCATGGCGGGGTTGATATTACTGAAGATGCTGGCAAAGACCCACCAGCCGATGAAGCTGGCGCTCACCGGCGTGGCGCTTTCTGCATGCTGGGCCAGCCTGACGGATTATCTGATGCTCTCGCGCCCACAGGATGTGAACAACGCCCTGCTGTGGCTGACCGGCAGCTTATGGGGCCGTGACTGGAGCTTTGTGAAGATTGCCATCCCGCTGATGATTTTATTTCTGCCGCTGAGCCTGAGTTTTTGCCGCGATCTCGACCTCCTTGCACTCGGCGATGCGCGCGCCACCACGCTCGGTGTGTCGGTGCCCCATACCCGATTCTGGGCTTTGTTACTAGCTGTCGCCATGACATCTACCGGCGTGGCCGCCTGCGGCCCGATTAGCTTTATTGGTCTCGTGGTGCCGCATATGATGCGTAGCATCACCGGTGGACGTCACCGCAGACTGCTGCCTGTTTCAGCCCTGACAGGTGCGTTGCTGTTGGTGGTTGCCGATCTGCTGGCGAGAATTATTCATCCCCCACTGGAGCTCCCGGTTGGCGTGCTGACCGCCATTATCGGTGCGCCGTGGTTTGTCTGGTTGCTTGTGAGAATGCGATAAATGACTTTACGAACTGAAAATCTGACGGTCAGTTACGGGACAGACAAGGTACTTAACGACGTTTCACTCTCACTGCCAACGGGGAAGATCACCGCCCTGATCGGTCCTAACGGTTGCGGGAAATCGACGCTGTTAAACTGTTTTTCGCGGCTTTTAATGCCGCAGTCTGGCACCGTATTTCTCGGCGATAATCCCATAAATATGCTCTCATCGCGCCAGTTGGCCCGCAGGCTTTCGCTGCTGCCTCAGCACCATTTAACGCCAGAGGGGATCACAGTCCAGGAGCTGGTTTCGTATGGTCGTAATCCCTGGCTGTCACTCTGGGGGCGTCTCTCCGCTGAAGACAATGCACGAGTTAATGTCGCCATGAACCAGACCCGGATCAATCATCTTGCCGTTCGTCGGTTAACCGAGCTTTCCGGCGGTCAGCGCCAGCGCGCATTTCTGGCGATGGTCCTGGCCCAGAATACGCCCGTTGTATTACTTGATGAGCCAACCACCTATCTTGATATCAATCACCAGGTGGACCTGATGCGGTTGATGGGCGAACTCCGGACTCAGGGGAAAACGGTGGTCGCTGTGCTGCACGACCTTAATCAGGCTAGCCGGTACTGCGATCAACTGGTGGTAATGGCAAACGGACATGTTATGGCGCAAGGCACACCAGAAGAGGTGATGACCCCAGGATTGCTGAGAACAGTATTCAGCGTGGAAGCGGAAATACACCCCGAGCCGGTATCTGGCAGGCCGATGTGCCTAATGAGGTAGATTGCACAGGCCGTAAGAACCAAACCACGACTGAATGAAACTGGACTGGCGCCAGCAAGCCTGTTCAGACTGGGGCTGAACTTTTCCGGACTCTGAAAGATTACCAATACTCATCGTCCATCCGCTTGCTTTAGGCTGACAGGTTCATAATCAACGCAAACCAGAGCTGTACAGGCTTGGGCGCGGCTTTCAAACCAGTCGTGATCACGGCAATCAATTTTGAACTCTGCTTAACGGACATTTCTGTATAACCCTTACGGCAACGAAAAACGCGAAGTTAAAATTTTAGAAACCCAAAAACGTGACATGACTAAGTTTAGATTTCAGGGGGGGAGATCAAAAAATTTCGCTCTGTGCCAGAGCGGACATTCACGGAGCTGGTTCATTACCAATGAGGTTGGGCTTTTGAGGATAAATCAATGATCAGACGCCAACGTAAATCAAAAGCACCCCTGGAACGGAATTGCTAATCCAGTTTCTGACCATCGATTTTTCTAAAAAGTGTGCGTTTGCTACTACTTAGGTAGGTGCAGCTTTCTTAATCACCGGCAGCCACGTTATACAGGCCAGTTGATGGATCGATTGTTATCAATGATATCTTTATGAGTCGGTGTCTCACCCAGCTTACCGAAGCTGGCATAAAGTGAAGGCAGACGGGCCCGTCCTTCTCCCTTTTTCGCCAGAGGGAAAGCGCGAAGCATGGTGGCAGCCTCCCAGTCACAACAATAGGATGGTGTGCACGGCTGCTGACGCCATGATTCAGCGATAGAGCCGGAAAATACGGGGTCAAAGCCGGTATCATTAACCAGAGTCATCGTGACCTGTACTGCGGCTGGATGATCTCCTGCTACTGCAACTGCTAGGCGACCGCGGCTCCCTTCAGGTAGTCTGTTGTGGTCAACTAAAACTGGCCTCCGCGTTAGAGTTTTTCCAGTATCGGTTTTCTGATTCGTTTGGTGGTAACCCACCATTATATTCGTGCGGTCTTAGTGCGCTGTAATATCCAACGATATAGTCCGTTATTGCGTGAGCTGCATCGCTGAAGCTTACATAGCCCGTCGCTGGCACCCATTCGTTCTTCAGACTCCTGAAGAAGCGCTCCATTGGGCTGTTATCCCAGCAGTTTCCACGCCGACTCATACTCTGCCTGATCCGGTATCGCCACAGTAACTGCCGGAACTGCCTGCTCGTATAATGACTGCCTTGATCGCCTGGAACATCACCCCGACGGGCTTACCACGGGTTTCCCATGCCATTTCCAGTGCTTTCATGGTAAGCCTGCTGTCCGGCGAGAACGACATGGCCCAGCCCACTGGTTTTCTTGCGAACAGGTCGAGAACAACGGCGAGGTACGCCCAGCGCTTACCCGTCCAGATATAGGTCACATCACCGCACCACACCTGATTTGGTTCCGTTACGGCGAACTGTCGCTCAAGATGATTCGGGATAGCAACGTGCTCATGACCGCCACGCTTATACCGGTGAGTCGGCTGCTGGCAACTGACCAGCCCCAGCTCTTTCATGAGTCTGCCAGCAAGCCAGTGCCCCATCTGGTAGCCTCTCCGGGTTGCCATTGTGGCGATGCTTCTTGCTCCGGCAGAGCCGTGGCTGATGCCATGCAGTTCAAGTACCTGGCTGCGTAATACAGCCCGTCTGCCGTCTGGTTTTTCAGGACGGTTTTTCCAGTATCTGTAGCTGCTGCGATGAACCCCGAACACATGGCAGAGTGTGACCACAGGATAATGCGCTCTGAGTTTCCTGATTATCGAGAACTGTTCAGGGAGTCTGACATCAAGAGCGCGGTTGTAGATTCAATTGGTCAACGCAACAGTTATGTGAAAACATGGGGTTGCGGAGGTTTTTTGAATGAGACGAACATTTACAGCAGAGGAAAAAGCCTCTGTTTTTGAACTATGGAAGAACGGAACAGGCTTCAGTGAAATAGCGAATATCCTGGGTTCAAAACCCGGAACGATCTTCACTATGTTAAGGGATACTGGCGGCATAAAACCCCATGAGCATAAGCGGGCTGTAGCTCACCTGACACTGTCTGAGCGCGAGGAGATACGAGCTGGTTTGTCAGCCAAAATGAGCATTCGTGCGATAGCTACTGCGCTGAATCGCAGTCCTTCGACGATCTCACGTGAAGTTCAGCGTAATCGGGGCAGACGCTATTACAAAGCTGTTGATGCTAATAACCGAGCCAACAGAATGGCGAAAAGGCCAAAACCGTGCTTACTGGATCAAAATTTACCATTGCGAAAGCTTGTTCTGGAAAAGCTGGAGATGAAATGGTCTCCAGAGCAAATATCAGGATGGTTAAGGCGAACAAAACCACGTCAAAAAACGCTGCGAATATCACCTGAGACAATTTATAAAACGCTGTACTTTCGTAGCCGTGAAGCGCTACACCACCTGAATATACAGCATCTGCGACGGTCGCATAGCCTTCGCCATGGCAGGCGTCATACCCGCAAAGGCGAAAGAGGTACGATTAACATAGTGAACGGAACACCAATTCACGAACGTTCCCGAAATATCGATAACAGACGCTCTCTGGGGCATTGGGAGGGCGATTTAGTCTCAGGTACAAAAAACTCTCATATAGCCACACTTGTAGACCGAAAATCACGTTATACGATCATCCTTAGACTCAGGGGCAAAGATTCTGTCTCAGTAAATCAGGCTCTTACCGACAAATTCCTGAGTTTACCGTCAGAACTCAGAAAATCACTGACATGGGACAGAGGAATGGAACTGGCCAGACATCTAGAATTTACTGTCAGCACCGGCGTTAAAGTTTACTTCTGCGATCCTCAGAGTCCTTGGCAGCGGGGAACAAATGAGAACACAAATGGGCTAATTCGGCAGTACTTTCCTAAAAAGACATGTCTTGCCCAATATACTCAACATGAACTAGATCTGGTTGCTGCTCAGCTAAACAACAGACCGAGAAAGACACTGAAGTTCAAAACACCGAAAGAGATAATTGAAAGGGGTGTTGCATTGACAGATTGAATCTACAGTAGCCTTTTTTAATATTTCATTCTCCATTTCAATGCGTTGTAGCTTTTTCCTCAGCTTACGTATTTCGATTTGTTCTGGTGTTATCGGAGAGGCTTTTGGTGTTTTGCCCTGACGCTCATCACGCAGTTGTTTGACCCATCTTGTCATTGTGGAAAGGCCAACATCCATAGCTTTGGCGGCATCTGCCACCGTGTATGTCTGGTCAACAACCAGTTGAGCGGATTCGCGTTTAAACTCTGCGCTAAAATTTCTTTTTTTCATTGGAGCACCTGTGTTGTTCTGAGGTGAGCATATCACCTCTGTTCAGGTGGCCAAATTCAGTAAACCACTTCACCACTTCAGAGCAGACGAGACAAAAAGAATGGTTGACGCATCAGCTCAACCCCATATAGTTGTAACACTTGAGCCAAACCCTTGGGCCGCTTTTTACTTTGATATTAACATTGCTAATACAGGGAACGCACCTGCCTATAATGTTGAGGTTGTGTTTGATCCTCCACTAGTAAATGCGGAGCATAGAGAAAAAAGTGAGATTCCGTTTAGTAAGGTAAGCGTGTTAAAAAATGGGCAATCACTTACCAGCAATCTCTGTAAGTATGAACAAATCAAAGATCAAATTTATAATATTAATATAAGCTGGGCAAGCAAACCTAAATCAAACGATAGAGAAACAAATGAATATGTGTATGACATGGCGACATTTGAAGGAATAAGTTATCTAGGAGCGAGAAGCCCATTGACGCAAATTGCAGAACAAATTAAAGGTATAAGAGAGGATTGGAAACCTATTGCACAAGGAGCTAAAAAAGTAAAAGCAGACGTATATACTTCAAGCGATAGAAACGAAGAACGCACGTATCTGCAAGAGCAACACGATTTGGCAATAAAAAGGAGAGATGAGAAAAGAGAAAAAAGATTAGAGTCTGGTGAATAATTTTAAAGGGAGTGGGTAACTAACCCACTCGTAACTATAAACCTGTAATTAATCACTTATTTTTGACAACAGATAATTACTGAACGCACTGCAAGTGACTAACATAAATTTAGCTTCAGCTAAGGTAGGATTTACATCTTCTTCTGTTAGCGCATGACGTATTCCACCCTGATCACTTGTATAACCATAGAGCTGACTAAAAGCGCCTTTCATTGCAGAGTGTATATATCCTTTTTCCTCTATAGCTTTAAGACAAGCCCCCAAGGTTCCTTTATCATTGCCCGTGATTTTCCTGCATAAAGATTCAATTGCAGAGATAGACTCTTTAATCGAGTTTCTGTAGTCTGGCTGCTCTCTATCCGTCATTAGTTGTAACGCCCTTTCGAAATGGCTACGCGATGAATCAGTGCCATTATCAACTGCGTTCTGAACACTTTCAATTTCGTTATCATTTGAAATAGGAGTAATACAACCATTTATTATGGTATAACCAACGCCATGCTTTTTAAAGATGGAATTGAGATGCTTCGATAGATTAATATATGAATTAGTTCTCTCAATGATGAACTCAATTAAATCATATACCAAATACCATGCTTCCCCATATATATAATCTCGGATAGCAGTCAGCAACGTCTTATCACTTTTGTATCCACTTTCATAACGAGGAATATTATCCGCAGGTTGATTTAGATAATATATCCACACAGACTGCGCACATTTTGTTGCTGTAGCAGTTTGACGATTGTTAGTCCAAAGGAAAAGATATAAGCAATTCCACAATGCCATGCGTGTATCAGAATTAAGATCATTCAGCTGGACATGCTCTCTAACGTCAACATGACCATACCTCACAGAAAATGGCTTTATCAT
Protein sequences of DBSCAN-SWA_9 >CP031653|4221061:4227620|4225723_4226074_-|AXP28484.1|transposase|DBSCAN-SWA MKKRNFSAEFKRESAQLVVDQTYTVADAAKAMDVGLSTMTRWVKQLRDERQGKTPKASPITPEQIEIRKLRKKLQRIEMENEILKKATVDSICQCNTPFNYLFRCFELQCLSRSVV >CP031653|4221061:4227620|4226174_4226747_+|AXP28485.1|DBSCAN-SWA MVDASAQPHIVVTLEPNPWAAFYFDINIANTGNAPAYNVEVVFDPPLVNAEHREKSEIPFSKVSVLKNGQSLTSNLCKYEQIKDQIYNINISWASKPKSNDRETNEYVYDMATFEGISYLGARSPLTQIAEQIKGIREDWKPIAQGAKKVKADVYTSSDRNEERTYLQEQHDLAIKRRDEKREKRLESGE >CP031653|4221061:4227620|4221061_4222018_+|AXP28480.1|DBSCAN-SWA MKIALVIFITLALAGCALLSLHMGVIPVPWRALLTDWQAGREHYYVLMEYRLPRLLLALFVGAALAVAGVLIQGIVRNPLASPDILGVNHAASLASVGALLLMPSLPVMVLPLLAFAGGMAGLILLKMLAKTHQPMKLALTGVALSACWASLTDYLMLSRPQDVNNALLWLTGSLWGRDWSFVKIAIPLMILFLPLSLSFCRDLDLLALGDARATTLGVSVPHTRFWALLLAVAMTSTGVAACGPISFIGLVVPHMMRSITGGRHRRLLPVSALTGALLLVVADLLARIIHPPLELPVGVLTAIIGAPWFVWLLVRMR >CP031653|4221061:4227620|4224652_4225804_+|AXP28483.1|transposase|DBSCAN-SWA MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPHEHKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRGRRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISGWLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRRHTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIATLVDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSELRKSLTWDRGMELARHLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQYFPKKTCLAQYTQHELDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD >CP031653|4221061:4227620|4226795_4227620_-|AXP28486.1|DBSCAN-SWA MIKPFSVRYGHVDVREHVQLNDLNSDTRMALWNCLYLFLWTNNRQTATATKCAQSVWIYYLNQPADNIPRYESGYKSDKTLLTAIRDYIYGEAWYLVYDLIEFIIERTNSYINLSKHLNSIFKKHGVGYTIINGCITPISNDNEIESVQNAVDNGTDSSRSHFERALQLMTDREQPDYRNSIKESISAIESLCRKITGNDKGTLGACLKAIEEKGYIHSAMKGAFSQLYGYTSDQGGIRHALTEEDVNPTLAEAKFMLVTCSAFSNYLLSKISD >CP031653|4221061:4227620|4222018_4222786_+|AXP28481.1|DBSCAN-SWA MTLRTENLTVSYGTDKVLNDVSLSLPTGKITALIGPNGCGKSTLLNCFSRLLMPQSGTVFLGDNPINMLSSRQLARRLSLLPQHHLTPEGITVQELVSYGRNPWLSLWGRLSAEDNARVNVAMNQTRINHLAVRRLTELSGGQRQRAFLAMVLAQNTPVVLLDEPTTYLDINHQVDLMRLMGELRTQGKTVVAVLHDLNQASRYCDQLVVMANGHVMAQGTPEEVMTPGLLRTVFSVEAEIHPEPVSGRPMCLMR >CP031653|4221061:4227620|4223343_4223757_-|AXP28482.1|DBSCAN-SWA MVGYHQTNQKTDTGKTLTRRPVLVDHNRLPEGSRGRLAVAVAGDHPAAVQVTMTLVNDTGFDPVFSGSIAESWRQQPCTPSYCCDWEAATMLRAFPLAKKGEGRARLPSLYASFGKLGETPTHKDIIDNNRSINWPV |
7 | uncultured_Caudovirales_phage(16.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|