Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP032820 | Butyricimonas sp. H184 plasmid unnamed, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP032819 | Butyricimonas sp. H184 chromosome, complete genome | 4 crisprs | DEDDh,cas3,RT,csa3,DinG,PD-DExK | 0 | 1 | 4 | 3 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP032819_1 | 496491-496605 | Orphan |
NA
Consensus repeat of CP032819_1
|
1 spacers
spacers of CP032819_1
>1.1|496522|53|CP032819|CRISPRCasFinder GCTTTTGAATGCGAAAGTAGCTCAGTTGGTAGAGCACGACCTTCCCAAGGTCG |
CRISPR arrays and Neighbor proteins around CP032819_1
The CRISPR arrays of CP032819_1 >merge|CP032819|1|496491-496605|CRISPRCasFinder GGGTCGCGGGTTCGAGTCCCGTTTTTCGCTCGCTTTTGAATGCGAAAGTAGCTCAGTTGGTAGAGCACGACCTTCCCAAGGTCGGGGTCGCGGGTTCGAGCCCCGTTTTTCGCTC >CP032819|1|1|496491-496605|CRISPRCasFinder GGGTCGCGGGTTCGAGTCCCGTTTTTCGCTC GCTTTTGAATGCGAAAGTAGCTCAGTTGGTAGAGCACGACCTTCCCAAGGTCG GGGTCGCGGGTTCGAGCCCCGTTTTTCGCTC
>CP032819.1|AZS28452.1|495682_496195_-|gamma-carbonic-anhydrase-family-protein MAIIRELNGHTPKFGKNCFLAENAAVIGDVEMGDDCSIWYGAVLRGDVHYIKIGNKVNIQDNATIHATYQKSPTNIGNNVSIAHNAVVHGCTIHDNVLIGMGAIVLDNAVIESNSIVAAGSVVTKGTVVESGWVYAGTPAKKLKQMGPELLRDEVERIVNSYSMYASWYK >CP032819.1|AZS28451.1|495002_495653_-|ribulose-phosphate-3-epimerase MSVIVSPSLLSADFLHLSKDIEMVNRSQADWFHLDIMDGVFVPNISYGLPVVSQIKKMATKPLDVHLMIVQPERYVEAFHKAGADILTVHYEACTHLHRTIQQIKAQGMKAGVSLNPHTPVSLLEDIIKDIDVVLLMSVNPGFGGQSFIEQTINKVDKLKKLIIESNSHTLIEIDGGVNFETGKRLVNAGADALVAGSFVFNATDPEANIKGLKEL >CP032819.1|AZS31843.1|494044_494920_-|ATP-binding-cassette-domain-containing-protein MEAIIKLENAVIRQQKNTILKDVSLEIKEGEFVYLIGKVGSGKTSLLKTLYAELPLAYGKGEVAGYQLSKIKKSQIAYLRRKCGIVFQDFKLLTDRNVYANLEFVLKATGWRDKKAIKERINEVLDKVELPDKRDKFPHQLSGGEQQRIVLARALLNAPPIILADEPTGNIDPETSYRLIELLKDICDSGKTVVIATHQYDLIEHFPGRVFRCENGALQEDFSFAEKLAARNKVMNEGLESELINIDNSETIENQENIDLSEIQVEPHLKIIEEKKEKDNDDSDVIGFELL >CP032819.1|AZS28450.1|491029_493774_-|TonB-dependent-receptor MKISKLIMIMCFSVMGCALTQGQIYTPTTDNKIHFFGVIVDSITKEPLPGAVIYQITEEKFRKLTQFTVADKNGKFQFETPKYFSTRVEISCVGYKVRKFVLPKDHQTMNMGKVWLAPDIQKLDEVTVSARAKMYKQFGDTTRIYARSVKTLKGDAIIEILRQIPGIKVNQDGSVSMDGKPIEYTLLNDKLIFGDDKITALYTIDADQAMTIDIYDEEIESATSFKKQKSTVMNIRTKKDFNRYFTVEALLEGGQNHIKHENGNDPKIYNLNGKLGTFQEGFQAQLNTMKSNFMGQQDRYNQVLKNEMPRKQEYIRGKLVREFGDFNNYLRDRYSITGSYSKDKNQYIIRKENTYFPTEDFESQLANNENKQTSANTIYNVAVDGNYTSTPNIKAHFSGLFMSKDSDQSRVNQSETFRNGTLLTTTHNRINSEINHAYSSSYLNFAYKLNKKNFMHINLGINHEKPKRTESHDLEIMNSELETHTTLGVKNNSPHTDLSSSLHVTHLFDTSIVSGHSISLQIETLHKKAKINNTAVDKQTGLIDDIYSENYEMNTQTLSSGLEFIHKIPGRSITSRVTFNHIAINRDERIPEKYVIRKTFNTWSPFINISWEGEKRGNISLTFSISQQIPHVKLFSNILNISNPMFPQTGNPDLDPVKEYNFSMKHRLSSVKKQISLNTEVRLKYYTDNVVYKRQYFKNNTILPEYDSYEFPAGSTLISPINGNDYWDLKISSSIQKQINYLGRLETIVSYTFRDPQSEVAGRLVRQQEHKGTFNIKLATNFSRHIRLNLSNSTSYLWNQNSEKYKERGIQNKATVHLLLNFLQYANFESGYIMDYYNPSVSNTKINSHLLNAMLGYRIFKKRKGLISLNAFNILDKNTNFKTSVSNQFISNSWDRLFERYYTISFEYKFNSQK >CP032819.1|AZS28449.1|490533_491004_-|hypothetical-protein MKKQMKLLFTILFSCTITSVCAQLQQGQLMNIKFSTTKPIQKGDSVTIVVEWDPQYISTIIFENVYTPTKEEVNKRKFTLVVKPQKTTTYQITRIYTEWKPNTLPHTIKVLDENGVEIKEDKTEQKEVTTNKEIPTPNKHSNNSSKNKKKNKKKMP >CP032819.1|AZS28448.1|490087_490537_-|hypothetical-protein MKKYIIPLLIVLLCSIIVGGYYQLKKIGITNMSLNKDKPSIQKGDSVNITTQIVSEKKIMEIKYSTREPIPKGDSFTFHVTWDTLYISNIIFKDVYSPTKEEMKNGMYTLVVHPEKTTTYQIQKDYTTGESRILPHEVKVLDEHGIEIK >CP032819.1|AZS28447.1|489542_489998_+|SPOR-domain-containing-protein MRIIVSVILLAFITISCGTNKRVYAPPFEEEETESVVVKENVRTTAQNPTKKTENTKPVVSREENVTMTHGDVLKRYNVIVGSFSNVDNAVRLQAKLNGMGYHSIIMKNSAGMSRVSIAGFDEEGAAREELLKVREQYPEFADAWLLISKQ >CP032819.1|AZS28446.1|488351_488612_+|hypothetical-protein MTYQDALDYLRIAENAYNVQAYSESAEIVEKLAYFAIDRENGLSPQQRVEITEAVKQAIGRFTFCPDEYIWEKTCGLIDLFRWQIK >CP032819.1|AZS28445.1|484328_487703_+|hypothetical-protein MNIDFIMLVARNEMKLLRRNWLFIFFVFLTILGLILLQSVLQHVFPVYHFYALSCSIPFINTYLFNLFQSFFLVFIAGNYIYRDCKLDSLDAILVRPYSNLDYLIGKWIAIISLFITVYIINLIILGIFHIGNGSYPFTFFPYLFYLLTLALPVLLFVTGLTVWLNVVFKTPFLSMFILFGYILVDVFYLSDIQFGCFDFLAMTIPNVFSDIVGHVGVSTYLLQRFAYILFGISFLFFAVSRLQRLAGSIKDVRRCILLGIILFGIGVGCGWSYYWHYYKINQKRKQYIALYEEYKDNERIRISEQKIVYKQEGEWISVLDSITVYNPNKKKIKDVILYLNPSLTVNNVTCMGEKVHYRKNEQVLLLDYPIGCGEYRNFVIHYSGKIDESVCYLDVDDSEYNNTKWSNSILRYGKRTAMVEEAYTFLTPECLWYPVCAPLVNPVQPLASEISFSNYSLTVVHDTMYTVISQGKPSRSREGSYFKNTNPLPGLTLCMGKYNCRSLLIDSTLFEIYFFNEGNNFLAILDGSQKGVVEGIRGVKEKFEYKYGIKYPFSKMTLVEIPVSLCSYSRIGKEGSEFVQPELVFQPENWCKNSQYVSMKNYTREMEKMRPMQSVEVSEKEKISSWSESYFNSLAMEFPKMDLLSFLSNHQLFLTPVKNMSSVAFWFTNFTGCLYSDKYPYIDYFIRQMLMNNRVQILQNSIEVGSTKDDSVIDYLSSHSLQNVLSGSRLSSFEGSILKLKSQYFAKYIYCHIDQDEFKAFLVDFYSRNLFREVPFEQFVEEVQQKFHFDFLTFVEEFYRMSGVPFFFIRNLDQKIMTEGQFCETLVSFDVWNPSECNGVVTLYSEGDDYTPDLQEVKSIPVYSGTCIHVSVPMKKRKWNILLHTNFSQNNPDSYFKLFSQELKTTSDSFMEIVSIDTICFTSDKDEILVDNEDDMFCTTMSESRETWLSKLLHQNTPKYGMMTDFLQGKFSGWLPSVFNDAIGEPVRSFCGKVVKNGTGKATWKALLPEDGDYELYVYQLGLNHRFQYQVELFSYCYSVEQKGYKANIRVDILPSGTRNVFLQDNEGRDEDSTFDMQIFESRWIFLGKYKLSKGEISISLLDRGAFPGQLIFADAVKWVKRR >CP032819.1|AZS28444.1|483844_484294_+|hypothetical-protein MNCIILNTCAMCKKYLYMMSILICLLFVNFSAFCVDLVVVVCNNQGKPIVGTVVMIHESGTQDYLRVNTDKFGEACFAVAAFSHCDIFVPGLSMALLPVTSIMIPSNCNEIRVTVSPTVWSDSYTSLNYLVELLQLCAILPENKLIMIA >CP032819.1|AZS28453.1|496951_498097_+|IS256-family-transposase MEFTHEQISEIISEITNGESGFHGLVKRGLESLMLTERSLHNETLSDVSNGFRGRRVCHGGKVFELRVPRSRNSNFYPMLLGVLKDQEEEAQKLVSSLYCSGLTTEQVGKIYEQFYGKHYSKSQVSRLLNTAREDVNAWLGRKLEKRYPILYIDATYVLTRRDESVSNEAYYTVLGVKEDRTREVLTVVNFPVESATNWKDVFEDLKARGVQEVNLLVCDGLPGIENVLADTFPLADLQLCTVHLKRNIAGKVKPRDKKQVLEELKQICAPDQWEISPEKAFLKFKEFIARWQKSYPVLKRYCHDRYRFYFTYFKYEREIRGMIYTTNWIERLNRDYKRVINMRGAMPNPRAVILLMGTVAQNADIYKYPIYNFLESRLFY >CP032819.1|AZS28454.1|498650_499544_+|IS1595-family-transposase MSNNANLFYFFKQFPDEDSCRKYLEKRRWGNTPTCPHCGNAQKIYRYKNGKTFKCAHCKKQFSVKTGTIYENSNIPLQKWFLTFYLISLSKKGISSIELSKTIGVTQKSAWYLLHKVRYMLEHRNSDNQLRGTVEVDETYVGGKKKGKRGRGSENKTPVFGAVQRGGRLSITPVPNAKRKTLEPIIHKRVKKGTHINSDEWWAYTKLCSDYAHDVVNHRRKEYVRGKVHTNTIEGAWSHLKRSIMGTYHRPSREHLSKYCAEFEYNYNTRKNSPEIKFKKIIDKNLIRVSYRTITGT >CP032819.1|AZS28455.1|499558_499909_+|hypothetical-protein METVSVKLVVFKHPEPARWDTYCSYCPELQYFFGRGETVNEVVEQVHQTLLEELQNRNAYKNLHNLGWNITDTSIKVPIFTDEEAVSLTEQSYEVTISNYQIVKIDVEVPPARKLW >CP032819.1|AZS28456.1|499910_500204_+|hypothetical-protein MYPIATDIFEHFLSESGFTCTNITNGFDIHKKGELYIKIPQKKNLTKNQVKNLLTKAGLQIEDFNLYIEHTKATNLFNFLVDESLKGPSLKNKNTES >CP032819.1|AZS28457.1|500200_500707_-|hypothetical-protein MEKLLLALCLLFVSCGNNKKTESTTQETSQPVTETKAAKEPKTLIAGDELTFPSKNGGEVKFLFKEINTIKLSDGTYNLVLKVQIKNNLTQQLLISDVSWKLTDTDMIEVEESGVYDSEFEMFKPAMFFFTTVDAGFGKVEEVGYKVNKGTYYLSIWGYTSAKIEIRD >CP032819.1|AZS28458.1|501070_501709_+|hypothetical-protein MISIAFLLSSYSIAINDNKKNSELGVALTSREIHAIKDLKEMVLGFTGKSRMTAIPFSFAEILNIDSTNITLLPYYDFDTAIFYEDPSPEKILESLTVSDDLYFAISKDGCLAYSLLAKKKEDSWVLSRFSENWGEVIKWFPERLQDADSRDFKIFRVGGIEYFLYYKNGKTIYSTNIGQEISEEILCEIILESIHQVDENKKYLDSIGVHF >CP032819.1|AZS28459.1|501773_502076_-|Dabb-family-protein MLKHIVMWKLKEFAEGKTKAENALIMKESLERLVGIVPEIKSLQVGINEKTSDMAYDAVLISTFEDTDALARYKVHPEHVKVSNYCKKIRDSRVFVDFHE >CP032819.1|AZS28460.1|502136_505643_+|AAA-family-ATPase MNASEIGGGLKNECKNKRVMVEKARKSKISKKQTVTDINFKFKSIRTSCSEVVDFTGNKRYRTVFRLNEIETLFVDTEVINKQFDEKPWDAEIVAALIYMQGDRPMLVAQKGGKVTIPETESVFQFSTTFSAEDFTDFKFQEGVYRVQVMIDGQAGVSDEIHLLEPHDLFRDYFQLLDIGFDKCVEEAPDVQRPHSYQAFSAKGLEDIRFYLVAENFLSSEWTYEFLINVFDSNGILKASRVAKGNYYIPNREGKRLLCFALDLGAGLNNFWEEGKYRVDVLCFDQPVIHLEFSIGEQDLPYNFTDEIAEVNGQVHEVAGVPSTPQDKEEALSRLYQLVGLRKVKEEITRMYELAEFVKMRQENGFNDKLPVLNMILMGNPGTGKHLVTEITGEMFYRLGVLVNGKVHRYKREDFTRPGMDAEEQLIREAIAKSMGGILLIEDADELYPTNDPNDPAMRIFTVLLGILEQEQPSLLVVMAGDVMTLQAIVEGVPGMKKCFPYQFVFDDYSAEELMEITHQMLEKRQFKFTPEAEDKFSDMLKDCCSVRESGFSNGRFIEERLDDASTRMAKRLMSNHKGTRTKEDMMLIQVEDIDAPEQPDPSKSVDALNEMIGSKELKNSLISYINYIYFIRERQKHGYADVIPPLHMLFTGNRGTGKTTVARMLGEIFESAGILESSMVTVRSRGELIGDGSIPPQQIAMYVFEQARGGILFLEDAHTLFQDNVGAAALSVIFGQLSPTDNGDTIVILSGDPEAMDKALAGNPKVKTLFPYHFHFSDYTPEELLEIAVQKVAEKNYTLHPKAKEAFKALVSQVCNEHDKFFGNALFVEKMVDKAIHNLSARTMKIRKERELTRKEITTLMAVDIPTATSELPNSYRDTFDEKEIASALKDLDHMVGQTKLKKQIHDFVDLARHYNQQGIKLNTRVSLQWCFTGNSGMGKGTVARIIARIYKAMGIIDKSEVTSFKVERLLGLTEEDVIQSIGTALLQSKGGLFFFDEDSSSLNEVAGFRDRVRAILVNQLATQPGAYNVIYAKQDPPRQIINDEVEKVSDMINILVFEDYTADQLMEILKRDLATDNTRMTRTAQQHMEQFISYIVANKKRSHASARLIKLVAEMMMRNRVQRLAKSKKANDMETKHSVTKQDVEMFTPAMLDSMISERNTIGYKQ >CP032819.1|AZS28461.1|505639_506953_+|MATE-family-efflux-transporter MNRRILHLAIPSIVSNITVPLLGLVDVTIVGHLGATAYIGAIAVGGLLFNILYWNFGFLRMGTSGLTSQAYGRKDKDAEIRILVQAVSVGLFSALAMLILQYPIERLAFRLLDTSAEVEQYTVTYFRICIWGAPAVLAQYGFTGWFIGMQNSRYPMYIAIVMNVINIVCSSCFVFLFGMKVEGVALGTVVAQYSGVMMALGLWFYNYKELWGRMTFKGSLQLIAMRRFFAVNRDIFLRTLCLIGVTTFFTSTGARQGDVILAVNTLLMQLFTLFSYIMDGFAYAGEALSGRYVGACNLIQLKRAVRALFGWGVGLSLVFTLLYGVGGENFLGLLTNDKIVIETAGHYFYWVLAIPLAGFAAFLWDGILIGATATRFMLWSMLVASGSFFVIYYCFSGATNNHMLWLAFLVYLALRGGMQTLWSRRVFTLEYLQRLRS >CP032819.1|AZS28462.1|507039_510729_+|phosphoribosylformylglycinamidine-synthase MAIVFYQKDATVYAVHYKESENPLDVSKLEWLFSGAKKVQSNSLENFFVGPRREMITPWSTNAVEITQNMGISGILRIEEFTRVEDEKAAFDPMLQAFYKGLNQDTFTIDKKPDPIVYIEDIRAYNQQEGLALNEDEISYLEGLSKKLNRRLTDSEVFGFSQVNSEHCRHKIFGGTFIIDGEEQEESLFSMIKKTSRTNPNRIVSAYKDNCAFIEGPAAVQFAPATPDKPDFYTQKDINTVISLKAETHNFPTTVEPFNGAATGSGGEIRDRIAGGQAALPLAGTAVYMTSYPRLEAGRSWESAIDPRKWLYQTPEDILIKASNGASDFGNKFGQPLIVGSVFTFEHEEHEKTYGYDKVIMQAGGIGFANKEQAMKKTPEAGEQVVLLGGDNYRIGMGGGAVSSVDTGVYANAIELNAVQRSNPEMQRRVCNAIRSMAESDVNPIVSIHDHGAGGHLNCLSELVEETGGLIHLEKLPVGDPTLSDKEIVGNESQERMGLVMKAKDVETLQRVADRERAPMYVIGETTGDHRFVFEDARTGEKPIDLEMTDMFGNPPKTVMEDKTQREHFADIEYSEDKVEEYIEQVLQLEAVACKDWLTNKVDRSVTGRVARQQCAGPLQLPLNDLGAMALDYRGERGIATSIGHAPVAALANPACGSRLAIAEALTNLVWAPIEQGIKGVSLSANWMWPCKNAGEDARLYKAVQAASDFAIELGINIPTGKDSLSMTQKYGKDKVYAPGTVIISTVGEVTDVKKIVSPVLKPDQDSEIVYIDFSFDKLQLGGSSLAQVLNRVGKETPDVKDSVYFVDAFMAIQRLVEEGYVLAGHDISAGGMITTLLEMCFADNRLGLNIDFSYLAEKDIVKILFAENPGVLVQIKDCKKVAAILDEAGVAYNFLGRLGKAGKLNIKKDSKTFNLDIPSLRDLWFKTSYLLDRRQSGNELALERYKNYKNHDLKYKFTPSFSGKLSQYGLDVNRVKPSGIKAAVIREKGCQCERETAWAMHLAGFDVKDVHMTDLVSGRETLEDVNFIVFVGGFSNSDVLGSAKGWAGAFKYNEKARVALENFYKRENTLSLGICNGCQLMVELDLVYPEQGEKVKMLHNTSHKFESAFLSVDVVESNSVLLKSLAGSRLGIWVAHGEGRFQLPYGQEEYNIPLRYSHDTYPANPNGSSFATAAICSKDGRHLAMMPHLERSLHPWNWGYYAEDRNGDEVSPWIEAFVNARLWIEEHK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP032819_2 | 783336-783424 | Orphan |
NA
Consensus repeat of CP032819_2
|
1 spacers
spacers of CP032819_2
>2.1|783359|43|CP032819|CRISPRCasFinder TAACACAATTTGATAAAATCAAGGGGATATTTTTCTCATTTTA |
RT |
CRISPR arrays and Neighbor proteins around CP032819_2
The CRISPR arrays of CP032819_2 >merge|CP032819|2|783336-783424|CRISPRCasFinder TGACAAAATCAAGGGATTAAAAATAACACAATTTGATAAAATCAAGGGGATATTTTTCTCATTTTATGATAAAATCAAGGGATTAAAAA >CP032819|2|2|783336-783424|CRISPRCasFinder TGACAAAATCAAGGGATTAAAAA TAACACAATTTGATAAAATCAAGGGGATATTTTTCTCATTTTA TGATAAAATCAAGGGATTAAAAA
>CP032819.1|AZS28674.1|781942_783289_-|DUF4143-domain-containing-protein MGRYLPRSIDNILLAWKKESKHKPLLLRGARQVGKSSAVKHLALEFDNYVEVNFEKQPQLKELFVGELDVKQIVAKLAVFFGVTIQPGKTLLFFDEIQMCKEAISSLRFFYEDYQELHVIAAGSLLEFVLNEIPTYGVGRIRSLFVYPMTFDEFLMATGNEKLIIEKKAASASRPLMEVFHQKLVELFRIYLMVGGMPESVVTWVDEGDYLQCQQVQDEIIVSYEDDFAKYKKRVDTTLLRLTVRGVVHQVGEKLMYSRISRDYRSGQVKEALELLRLAGLIIPVVHTAANGLPLGAECNDSFVKYLYFDSGLLLRILNMDLGDISKITEQLLVGGATDLVNKGFLTEMVAGLELLRYQSPTQRHDLYFWMRTEKNSMAEVDYLITRNLKILPIEIKAGIKGGMKSLFNFMKDKNVEVGIRSSLENFGEIISDGKRVEICPLYALSNL >CP032819.1|AZS28673.1|780641_781562_+|DMT-family-transporter MKKDELKGHISMTTANMMWGLMSPISKMVLITSIITPFIIVEIRIIGAAILFWIASIFTKREHVSPPDLLKLFFAAMLGVLFNQGLFTIGLGMTSPVDASIITTSTPILTMIIAAIYLKEPITSKKVSGIFLGASGALLLIVSNQNIGGTTHNSNIWGDTVCLVAELSFALYLVLFKQLISRYSPITLMKWMFTYAAICITPFSFQNMTQLEWLSLEPNTWYGLSFILLGSTFVSYLLSPIGQRHLRPTVVSMYCYVQPIIASCVAVYWGMDSFNLLKIIAVICVFSGVFLVTRSKSRADLEAEVN >CP032819.1|AZS28672.1|779728_780550_-|AraC-family-transcriptional-regulator MKRENMFQAFELIVREYNTFPLPVHQHTFFELAYIVSGTGTFQVSLHDTPYSAGSLFLILPNTSHVFKIESYSHFVYLRFTEHYLEQYFTSDERDLIRSVQHTGSVLCDPKDRENVRDLVNVIVREYTQVRVYSNELLNYWLRSIIVLIVRNLIVSESLGVFRVEEEKIMQIIQYIQAHIREPRLLQLASLGQRFHLSETYIGRYFKRHTNENLKDYILRCRLASVEDLLLHTSMRINEIAAIMNYTDESHLIREFKKYKNMSPSDFRSGRKN >CP032819.1|AZS28671.1|777697_779635_-|AraC-family-transcriptional-regulator MNVNKCPLKYVISLCLTVWAAGALFAQPVVSSSDSAYGRALYKRIMRLNPDSLLRAHADPAVYQRTITAESGSDSIYSREDKRWFYIDRNEFHRVIAAGPSYDTYEYIYATYKGYFYHLSREKLLHEADLAEQAADKYPGDALRNEADYLRVWYNNLGRDSTVWIAEQQAAIKAFQRKGATWWALRVKGHLFQLTAGFGTSYPVAFATAEELIHDIDGLDCRQYPFNGNMYGDIGLLYYKFRDYEAAVPLLKKATRETPGFYFDDSNLKARNSLAIYYRQSGDRKRSDDYFISILQSPDTIYARAVYDAVALANLAHNLTRAGLYCEAIGLYELSLPIMISDNDYSFASGIAVGMAECYMGLDNWEWAKQWADTATAYIREHIHFINPHRARQIYPILAKYHQQKGDAATAGKYIDSLSMAHADYRDEFDAMLLMRTRQELLAEKNRSQHERLQQQQTAITRLVVFAVIAVLVSMLIYYLYHKKKAAYRELVEKNKRWADSDRLEQIIDPVSPNHTITEEKGEDGNRQQGPTEKDVELAARIHKIMVEGQIYRDNSLSLDTLSQKLEVTRETVSRAINRTTGKNFTRFLNEYRVKEAVRILSTGRNHTVNFDELAEQVGFNSRITFHRAFKQLTGLPPAEFKRNS >CP032819.1|AZS31867.1|776325_777435_-|DUF3575-domain-containing-protein MMRKTLLAGLFLLFLSLPGYGSETLYSGSPTGGDTIITFRFVPGEDMFYVPYGGNNTELDRLYALVDEYRAEITSGRMPVDVDGYCASGETPEASFRIAVTRSNRVKSELIVHKRLAEEHFITKNHVSAYTAPDGKTYRDMVVVTLRIPAKEQPKQPKLVKEEPRCEEPPVEERQPESVVEQQPEPVVEIAVPAKFYCFAVRTNLLYDAFLLPTLGVEWRVNRHIGIKVDGSHSWLGNEKGKVQKIWLVSPEVRWYLLNNKRFYVGVGANIGEYNIYKGMLGSLFSDDTGYQGKLWGAGLTVGYQLCLSHRFSFDFNLGLGYTRMEYDRFTVSNKVRVFKDRDKTKNFWSPTQAGISLIWTIGSNNIVR >CP032819.1|AZS28670.1|775479_776313_-|hypothetical-protein MTVTKLMMAAGAIIFLTACDVKDPIHETDHPDKAKITVTADWSGIGQGIAKPSGYTVAFGDRTFTAMADKYTLPDLIEPGNYTLYFYHEADNITINGTASTADYAAGMPGWFFTGRLDAVIDGGKHHELTVAMKQQVRQLTLVIEATGGTVDKIATITGTLSGVAGSLDMSNDTHGTPSDVALTFAKGTDGKWTATVHLLGITGAKQKLSGTIKFTDGTPDDMILDSDLTAALATFNDNKNEPLTLGGQAEETPSPAGFTTIINAWNKITGGSVIAN >CP032819.1|AZS28669.1|773611_775444_-|hypothetical-protein MNKKKLLLLLAFPLFLLAGCGHDDIEGDGPDIPVGQNGDIKFKIGFVQPDGVAYATEDNDAPRTRVATDNQFRSTWEDGDEIGVFAVTHGQPLAASDNFIHNVKLTYSSADDSWSGAVYWPTASSGITSLDFYAYYPYDDNGGTPATLDPIAITFTVKTDQSGKTTVGSAEKSNYNLSDLLTAKADNTGSGYGKGETVSLQFSHALAMVQVSIDDVKYKALNTNEDITVELRNVRTAAGLNLFSGAVAASGDAGGVTMRRIEQPSDADYESSFTFCALIPAQVVARDARIFLISNGDLLLNGSPLTAQLDMTARTVESFTQQIPYTALPFIPAGSFIMGSPGNEPNRGTNETQHKVTLTKGFYITKYAITNSQYVEFLNAKGIQGEMLKHPNIPGAGGNMIGGEYSGEPLIYEGNTNQWGVKWDTNKWIPQPGYEDHPVVWVTWYGAKAYAEWVGGSLPTEAQWEYACRGDKGSLPFGVGDGYKLDNTLANFDWKYSWSWDGSSPTADITDTGTCPGVTQAVGSYSPNSYGLYDMHGNVLEWCLDNSDWVPADYGEASVTDPTGPSAGNRRVLRGGYYGFSAQYCRSAYRSNYEPDDANFNFGFRVVFVP >CP032819.1|AZS28668.1|772948_773443_-|helicase MQIQIFNIPLTDTGEALSEMNRFMTGRKVLEVEQRFFQNEKGGGWSFCVRYLPDVTQSGATGSRAKVDYKQVLTESQFAVFSRLREIRKALANEDGVPAYAVFTDEELAGICRLAELTEKSLATLPGIGEKKIIRYGKSMLKRYHTDNNNHTAQADEIQIGGQG >CP032819.1|AZS28667.1|771953_772952_-|RNA-directed-DNA-polymerase MKREGYLMERIADPDNLRLAFWKARKGKNYNDEVKHYRFRLDANLAELRRELLAGDVPVGDYRYFHIYDPKERMICAAAFRERVLHHAVMIVCHPVFERFQTDDSYASRIGKGQYAALEKAKTHTRHYRWLCKLDVRKFFDSIDHETLYGLLSRRFKDPALLGLFRRIIDSYRTAPGCGLPIGNLTSQHFANYYMAYADHHVREALRAPAYVRYMDDMVLWHDDKTELLRITAGLTSFIGERLWLTLKPPCINNTDKGVPFLGYVLFPGRVRLNRNSKKRFRCKMNSYDIKLDTGEWSQARYALHVQPLVAFTRFADAREFRQISAKKREVG >CP032819.1|AZS28666.1|771816_772065_-|hypothetical-protein MVSGAVCAPCAAAGGFHPLCRCAGIPADIGKEKGGWLKTGNNRVLRGGYYGNSAQYCRSAYRNNNKPDNANNNFGFRVVFVP >CP032819.1|AZS28675.1|783546_784746_+|MFS-transporter MNTSQVIITPGNITKDRKISICVFLSGFSCFAQLYYFQPLLPDLAQEFGLSASHSSLAISFSTLGMVIGLFTAMFVADTIPRKKLISAALLSSAVFSVICSYSPSFFLLVALSTLKGFLLSGATSVSLAYISEEVQPQNKGKITGLYIAGNALGGMGGRVISSYLSSEFSWRVASVSIGVLCALFAISFLIFSPRSVNFKPKRESFKSLIVSNLHLIVSVKLIPFYLIGSLMLGIFVSLYNYLGFYLIKEPFNFPPYLIHYIYFMYLFGVFGSIATAKLTALYNHFKILKTIIALSVAGLLLLYINNFWLVTLGLAIFTFNFFVVHVICNRIVSDYNLQKRSVTISIYLLFYYMGSSIWGSATGVILDHFGWQWFIAGLILLTFILYAIAYKGSKLMGN >CP032819.1|AZS28676.1|785298_785907_-|DNA-binding-response-regulator MQNNNKSYRIVIIEPSMIISTGLKKLIEMRNEFEVVAVIADCFHSLERINHLNPDVIIINPSVVELKKRQHLEELFDGVKDTAFVALVYQYIDPEVLKQYHTTIDIADDGDKIAQKLLHSIDALSAPADLLDKNELSEREKEILISLAKGKINKEIADLHHISVHTVITHRKNIIRKTGIKSVSGLTVYAILNNLIDINEVE >CP032819.1|AZS28677.1|785893_786655_-|hemerythrin-domain-containing-protein MNKLGLESIISNICGKNRDMKSHLFSADMKLADVIHADYSLLLLLHRFGINLGFGDKTVQECCEANHVSCTLFLMICNVYSNEQYLPTEDEIEGIGANVDQLIAYLKNSHSYYLDNRMLAIQEQLKEISEGCEQQHQQILYLFFNEYKNEVIRHFEYEEITVFPYISNMMKGVRPGDYDIGVFRGNHSNIDDKLNDLKNIIMKYLPGDTLSDMRIRVLFGIFALEEDLSKHSLIEDKILVPLVMKLEQRYAKQ >CP032819.1|AZS28678.1|786812_788741_-|dihydrofolate-reductase MKKQAYLLEQFDDIKILRYDVPAFEGLSLREKLFVYYLSQAALAGRDILWDQNNKYNLRVREALEKILRAYRGNRETEEFQAFLVYAKKVFFANGIHHHYSMEKFIPSFTPEYFKTLLREVGAEPLYGEVERVIFDPEYMAKRVVLDEGKDLVKASANHYYEGVTQEEVEKFYGEKKKENALLSWGLNSTLVRDEKGQLHEQIWFAGGKYGKEIRHVADYLKKASEYACNEHQKEVIEWLIRYYETGDLSWFDRYSIEWVKEVAAPIDFINGFIEVYGDPLAYKASWESVVQIMDEEACERTRKLADNAMWFEQNAPIDERFKKSEVVGITARVVQAAMLGGDCHPATPIGINLPNAEWIRERYGSKSVTLDNITYAYNQASLNSGVLDEFAYSDEEKELVRTYGYVGGNVHTDLHECLGHGSGKMLPGVGQEALKNYYSTIEEARADLFALYYVMDPKLVELGVIPSLDVAKSEYCNYIRNGLMVQLTRVKLGNNLEESHMRNRQLIASWVYEKGKETNVIERVNREGKTYFTIRDYEALRMLFGRLLAEVQRVKSEGDFEGARDLIENYGVRVDPVLHQEVLERYQKLNVAPYAGFVNPKYRLVEKEGSVVDVEIEYPTDFLTQMLEYGNNNEPDSPLFE >CP032819.1|AZS28679.1|788737_789598_-|hypothetical-protein MYYLKCKHCNHLNEVKSEYLVVCGLCNRKLEDNYKDWKVKHPEKSFEDFQREVCLTEEQVKRGDIVKPGGKKGNKKGWIAGCIGGILVVTLMVVFVMKGMKNDSDELYGDGNTPVSEQQWYTGTYAGGVSLATPKAMKSAGNVINKLPEEERPLVKEMQTFSYHANDLEFTYLYAEYTPEAGQVDLEKAVNAGLNQMKDQPGVFDLKNDIAYVEEGDLSGVVMQGTYVKRGTEYEFKITLYAAGLKLYQFISSNRKGDDTAREVARRIYGSLKISGYGTTAEGGTE >CP032819.1|AZS28680.1|789601_790276_-|TrmH-family-RNA-methyltransferase MHTVKEQVEYLSEFVTDYKNDLFRRLIAERTSYVTMVLEDFYQQHNCSAVLRSCDCFGIQNVHVIENTNTFKDNSEISMGAADWLTVHRHRKRENNTEETIDMLKSQGYRIVATTPHERDTFIDDIDLHKGKMAFFMGTELTGLSDDVLSRADEFVKVPMYGFTESYNVSVCAALLMYSVVQRLRQTDINWHLSEEERYQVLFAWYKRSIKASAQILERFNHNE >CP032819.1|AZS28681.1|790449_790719_-|HU-family-DNA-binding-protein MNKAQLIDAIAEKAGLTKADSKKALEAFVATIGESLKKDEKVALVGFGSFSVSERSARSGRNPQTGKTIQIPAKKVVRFKAGAELEGQL >CP032819.1|AZS28682.1|791154_792552_+|hypothetical-protein MKKSFTFLKVSYIAMLLSVMSISEGFAQGLRTSYFMDNVPSRLKMNPALQPARGYFNIPAIGAIGLSASSNKLSIDDFIDIIQNKNGFLNSDQFINRLDNKNTMNIDLTLDLISFGIYSGKGFWTVNLGARSIISASIPKSMFELARYANTNPELDENVNLNYDIKDMELHANIFTEIGLGYSRPVTKDLTVGGRVKLLVGLGNLDAKIDQIYAHVNTDDLASYQSWKVKTKGRLEASMKGLEFSYNSDSGYIDDIDFDSPGISGYGLGLDLGASYNLFGCINLSAAIIDLGFINWTKSSTTIATTNSEHDYGDFANSGSDEYDPDNTEAGDVFDFDLIQFQTEENAKSRSTSLRSTLNLGAEYTLLNNKLGFGLLSSTQFTQPEAYSELTISANYRPKNWFGVSLSYSFLHSDFETLGIGLKLGSIFIGSDYLMTKSLGDANRANVYLGVSVPLGKKRQDYRRK >CP032819.1|AZS28683.1|792674_795860_-|Dna2/Cas4-domain-containing-protein MLRVYRASAGAGKTFALTMEYFKIVFNASSEYKNVLAVTFTNKATEEMKSRIVRELNKLAEGQKSDYREELKRALKLTDEQVKERAGVLRTLILHDYGRLSVTTIDRFFQRVIKAFTRELGIFPGYNVELDSDYVLQRAVDQVMQRMNKDKELRTWITELMSDSVEDAKSWSVKNKIAELGKELFGESYMLFDKEVREKFNDRGFLKGYKMFLQGVVSRFEDRMEAFGQSACRYIEESGLHAEDFKGGKRSFINYFYKLRDKGLDKIPDSARKAIDNPDEWVTKKQDSSLRALIENVYPELNTSLRESVEYYDKNLPDYVSAVLLSKNLYQLGILNDLYGAVRDYCDEKGLMLLSDTTRLLNMLIAGNDTSFLFEKTGNFYKHIMIDEFQDTSTMQWGNFLPLVVNTLSEGGRALIVGDVKQSIYRWRNGDWRLLAEGVERDLATFGTDNVILEHNWRSCKEVVEFNNRFFVLAARQLKELYDSECGEENVYSRSITEAYHHPEQLVSRSGNGYVEIRFGGEKQEEGSDVEIMNAVVDVINDTIHRGGELKNCVILVRNGKEGAFVADYLIEYNKRENIPFPITFISNDSLYVSSSPYVELIINVLRYMVEPYDAVNRTVLLYNYRTFVRGENTSQEDLLFKAGSTDESFLDTIDLPFLREPEKLTCSSLFEITESIIEVFGLRARTEELPYLIAFQDILFDYEKNNTNSIPVFLEWWEKEKDKKVLSTSEEVDAVRILTIHKSKGLEFKTVIIPFCAWDLDDTRHGRRIWCQNREKGFRELEYAPLNYSSKLLDSHFREDYLHEHVKAYVDNLNLLYVAFTRAERELYVLPYAPKITKEGKPSDIGAFLFQVLETGEIPEWNSENSSLVIGEKQLITGKEKVEDDNTLSLAEYPIYELNERVSVRYKYEDYTDPEKTETSAVDEGKILHEIFRRIETVEDVDRAVNNMYASGLLTSVERADYREQVVAYLNNTPHSEWFDGKYKVINERDILFCHSGKARPDRVMIDGKKAIVVDYKFGQKEEKSYIRQVGFYCKTLRQMGYTDVSGYIWYVPLGKVVSV >CP032819.1|AZS28684.1|795964_797122_-|iron-containing-alcohol-dehydrogenase MRNFDYYNPVKILFGKGKIAEIAKWIPKGMKVMMTYGGGSIKKNGVYEQVMDALQGFDIVEFGGIEPNPKYETLMRAVELAKQENVGFLLAVGGGSVIDGTKFIAAATCFEGSEPWDIVAKGARVNAALPLGTVLTLPATGSEMNNGGVITRQATLEKLAFSSRHVFPKFSVLDPMVTYSLPMKQVANGVVDTFVHVMEQYYTFPEQAEIQDRFSESILKTLIELGPKLMHAEKPNYEDRANFMWAATMGLNGLIACGVSQDWSTHHIGHELTAFNGLDHGITLAIVLPGVMNATKIKRREKLLQYASRVWNIDTDQPDAADQAIARTEQFFNELGVKTRLSDYGVGIDTIDRIEKRFRERGNFALGGIDDVNVDNVRAILTSRL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP032819_3 | 1581503-1581584 | Orphan |
NA
Consensus repeat of CP032819_3
|
1 spacers
spacers of CP032819_3
>3.1|1581530|28|CP032819|CRISPRCasFinder TCGTTTCTCCCGGGGAGGAGGCGCAGCG |
CRISPR arrays and Neighbor proteins around CP032819_3
The CRISPR arrays of CP032819_3 >merge|CP032819|3|1581503-1581584|CRISPRCasFinder CTGTTACATTGCGGCAATTTCTCGCGTTCGTTTCTCCCGGGGAGGAGGCGCAGCGCTGTTACATTGCGGCAGTTTCTCGCGT >CP032819|3|3|1581503-1581584|CRISPRCasFinder CTGTTACATTGCGGCAATTTCTCGCGT TCGTTTCTCCCGGGGAGGAGGCGCAGCG CTGTTACATTGCGGCAGTTTCTCGCGT
>CP032819.1|AZS29372.1|1579955_1581377_+|NADH-quinone-oxidoreductase-subunit-N MVYTNFLHMQQEISLVAVIVLLLIYDIFGTPKSLKYIQPVACLLVGIHILFNICPTPDVTAFGGMYHNTAMGSVMKTILGIGTLLVFMQANNWLSGERAIIRRGEFYMITLATLLGMYFMISAGNFLMFFIGLETASIPMATLAAFDKYKQQSAEAGAKYILTAVFASGLSLYGISLIYGTASTLYFEDIPAGLNGSPLQLMAFVFFFVGLGFKLSLVPFHQWTPDVYEGAPTSVTAYLSVISKGAAAFVLMTILYKVFAPLVNEWQTILYWVIVASITLANLFAIRQQNLKRFLAYSSISQAGYIMLGVISGTALGMTGLVYYVLVYMLSNLAAFGVIAVVEHRSGKITIDDYNGLYATNPRLSFVMMLALFSLAGIPPFAGFFSKFFIFAAAAQQGYYVLVFIALLNTIISLYYYLRVVKAMFINKSEQPIAPFASDNYSRVSLVICSAGILLVGLLSIVYESIGTYSFGM >CP032819.1|AZS29371.1|1579658_1579934_+|hypothetical-protein MKQLPPPTPPYTGGEPNARNRRNATAPHLHPFPKGRAMDTQIHRDPILPLPKGRAMDFQIHRNAILPLCKGELEGVAPAPAPCGRNLKFKI >CP032819.1|AZS29370.1|1578118_1579615_+|NADH-quinone-oxidoreductase-subunit-M MNFLSLFVLIPILTMCGLFLSRNLQQIRVAVATGASLLLALAVGLVFAYLGERAAGNTAEMLFTASTIWYAPLNIHYSVGVDGISVAMLLLSAIIVFTGTFASWKMDFLQKEYFLWFNLLTIGVFGFFISVDLFTMFMFYEVALIPMYLLIGVWGSGKKEYAAMKLTLMLMGGSALLLVGILGIYFHSSATGALTMNLQEIARAHAMPENLQHWFFPLVFIGFGVLGALFPFHTWSPDGHASAPTAVSMLHAGVLMKLGGYGCFRVAIYLMPEAAKELSWIFIILTGISVVYGAYSAIVQTDLKYINAYSSVSHCGLVLFAILMLNQTAMTGAVMQMLSHGLMTALFFALIGMIYGRTHTRDIREMGGLMKIMPYLGVCYVIAGLASLGLPGLSGFVAEMTVFVGAFQETDTFHRVFTIIACTSIVITAVYILRVVGKLLYGSVVNEHHYALSDANWYERFSTTILIIAIAGIGLAPLWLSDMIRTSLDSVLAALGRM >CP032819.1|AZS29369.1|1576144_1578049_+|NADH-quinone-oxidoreductase-subunit-L MEYTTLILTLPLLTFLALGLLGTKLKPATAGCIGTLSLAICALLAYMTAWQYFSMPRVEGVRLPVTPFNFEWLRFTRHLHIDLGILLDPISVMMLVVITTVSLMVHVYSLGYMHGEKGFQRYYAFLSLFSFSMLGLVVATNIFQMYIFWELVGVSSYLLIGFYYTKPEAVAASKKAFIVTRFADLGFLIGILLLSFYTKTFSFQLLTSGDTSLFAGAAGTTFMGCSVMSWAMALIFMGGAGKSAMFPLHIWLPDAMEGPTPVSALIHAATMVVAGVYLVARLFPVYYFEAPEVLVLIAVVGAVTSLYAAVVACVQTDIKRVLAFSTISQIGFMMVALGVSGMEGHEGLGYMASMFHLFTHAMFKALLFLGAGAIIHAVHSNEMSHMGGLRKYLPVTHATFLVACLAISGIPPFSGFFSKDEILTAAFMFSPVLGVVMSFIAALTAFYMFRLYYNIFWGKESTHEHTPHEAPKSMTLPLVFLAAVTLIAGFIPFGKFVSSNGLAYTIHLDWTIATASIIIAAASIALATWFYKGSNPVPDRLATAFSGLHRAAYRRFYMDELYLFITKRVIFNHVSRPIAWFDRHVIDGSLNGLANATRRFSYSIRGLQSGQVQQHAYVILLGSLLILVAILLIM >CP032819.1|AZS29368.1|1575902_1576136_+|hypothetical-protein MKKQLPPPTPPYTGGEPNARKHRDATAPRLLPLPKGRAMATQIHRDAILPLCKGELEGVVAAPAPCGRNLKSRNHKS >CP032819.1|AZS31924.1|1575570_1575900_+|NADH-quinone-oxidoreductase-subunit-NuoK MEYYLVISTIMLFAGIYGFITRRNLLAVLISIELILNSVDINFAVFNRYLFPEQLEGFFFTLFAIGVSACETAVAIAIIINIYRNIRNIQVKNLNELREEEDKQLKIEN >CP032819.1|AZS29367.1|1574818_1575331_+|NADH-quinone-oxidoreductase-subunit-J MSTAELQETVFFILATVIVVFSILTVTTNRILRSATYLLFVLFATAGIYFQLEYSFLGAVQLTIYAGGIIILYVFSILLTTPDKSKVSRLKNSKMFAGLITALAGTAICVYITLMHSFGPSRFVGGEINQKVIGTALMGTEKYQYLLPFEVISVLLLTCIIGGIMIARKR >CP032819.1|AZS29366.1|1574133_1574586_+|4Fe-4S-dicluster-domain-containing-protein MKSIIKYITTFFKGLLSLLTGMKVTLREFFTKKTTEQYPENRATLKMFDRFRGELVMPHDENNHHKCIACGICEMNCPNGTIKVTSELVTDEAGKKKKVLVKYRYDLGRCMFCQLCVKMCPQQAIEFRPTFEHAVYTRSKLVKYLHEEGM >CP032819.1|AZS29365.1|1573046_1574123_+|NADH-quinone-oxidoreductase-subunit-NuoH MFDFSTITQWVDELLRSFLPHAAATLVEFILVGLCLLVGYAVIALVLIYVERKVCAFFQCRIGPNRVGPYGIIQSVADMIKMLTKEIIDINHVDRFLFNLAPFVVIIASVLAFGCIPFAKGLHVIDFNVGIFFLIAVSSIGIVGILLAGWASNNKYSLIGAMRSGAQMISYELSIGLSILTVVVFTGSMQLSTIVDSQVNGWLLFKGHLPACIAFIVYIIAGTAETNRGPFDLPEADSELTAGYHTEYSGMHFGFFYLAEYLNMFVVASVASTLFLGGWMPLHVPGWENFNQIMDYIPSIFWFFGKTAVVIFIIMWFKWTFPRLRIDQLLRLEWRYLLPINLINLFLMVIIVVFKLHF >CP032819.1|AZS29364.1|1571149_1572775_+|NADH-quinone-oxidoreductase-subunit-D MLANKIKNIIPDAEIDESGILTVTVPPAKYRELALRLRHDHDLSFDFMICMTGMDWGDSLGVINLLQSTAHEHKLFLKTFTTDRENPELPTVSDIWATANLNEREVYDFFGIRFINHPDMRRLFLRNDWVGYPLRKDYNADPEINPVRLESEETLDAVPTFESDTQDGEVSEKENILFEEDEYVVNIGPQHPATHGVLRFRVSLEGEIVKKVDVNCGYIHRGIEKLCESLTYPQTLALTDRLDYLAAHQNRHALCMCIEEAMGLEIPERVKYIRTIMDELNRLSSHLLFWSCLCMDMGALTAFFYGFRDREKVLDIMEETTGGRLIQNYNVIGGVMADIHPNLIHRIKEFIKYLPPMLKEYHDVFTGNVIAQQRLKGVGILKKEDAISYGATGPTGRACGWSCDVRKHSPYGVYDKVDFKEILYTEGDSFARYMVRMDEILESLHILEQLVDNIPAGDYAVKTKPLIKLPEGHYFKSVEASRGEFGVYLESRGDKYPYRVKFRSPCLPLVSIVDLITKGGKIADLIAIGGTLDYVVPDIDR >CP032819.1|AZS29373.1|1581668_1582238_-|hypothetical-protein MKNSRFGCFVCLVGLLGGVSCEARKERVKVQGEMNQEGGRDKVYEAVFVLDRFKKMISVTCVSEEVIYPRYYGGTYINDAGKPVILTTDTTKRELIRKLAGCDEVVVLPCAISYRYLKDVDDKIKSCLGDRPGLIEKVGINTFGILTDKNCVDVSLVCCTPEKVELFKSLVVDDPCITFSENVSRVVLQ >CP032819.1|AZS29374.1|1582305_1583040_-|hypothetical-protein MSKKKKKKKKKMTSPFRQPSTTRSGGITTYYRKGKECQCASKRPAGSTEARDRRHANRSARQQLLTNVFKFVDRLVKTLLRKLAEANSWVPFAKDTSMSAPNLCHKVNASACGEHGVENFRAFMFSVGWLAVPLYTRVCRDGWTFTLEWRRHGILDEARDGDTLIMGYFYDCLPDAPRVLYLPAVTRGDGTATFTLPDPAGPGGEPLSPETVVHIYPYFKCPDVEEYTRSIYLRVAPGAEEVLG >CP032819.1|AZS29375.1|1583436_1584228_+|hypothetical-protein MARIKSTNFYQKSGKVANEVFSNLGNGEVGCRAKNSKQAPLNEEQKINASGFSALNTEVRYLNPVLRVTFPRDSNGTKWANRFTSVNKKREGVLTTTKINPDTPVNPKKKANEEYTSEIDWTRVLFAAGPLLAPSVNANRTNMSRKQILAEKTSFQSRVTPLSDADTPAVTAEEGEMYNITLTQVANVYEGAYCWTNDRVYTVIYSPDDHFALVKQLRDRGDGGITSIAIPSFVKPENIHVYSLAMTADGKITSNSVHSRLGE >CP032819.1|AZS29376.1|1584390_1585047_-|hypothetical-protein MKLILGILILLFSSLNDGTFSVKEKLQRTPYAELPFECIRRAPWGKPGYAITKEEYEHLPVFTYANLGLTERNDYDEEVTIFRKFASDFDNYCIVALEIGVSDPGKQLLVTYDNNGKLIDFLEAGIYDGGTENRLCLKQWRIDTNRKIMVTWIKALTGKEWNITKDFSSIQGQRIDCHYQIDKNGKFYKVKEIIYQAKNYPMSYLRSDKNLWEGDESF >CP032819.1|AZS29377.1|1585289_1585838_-|hypothetical-protein MKLLLLLFLIPVLKVSELNQPLYSSISNDTIMGKQASYCYMKDTRITTIIRNVNNVDTSEHVYFDNGEVVSWARFVARPIKFTQEELHSVFRKNLTDSEWDCIKGKVGFFLQIWVVADKKGNPVELEFTVRNTDPVFLKMTPDRLFQIEQELKQLLKTEIAEDEHDIKNVKHIVMVSYQDLK >CP032819.1|AZS31925.1|1586104_1587103_-|IS110-family-transposase MENFYFIGVDVSKKKLDFCVLLNGTIVLEEVVNNNIPSIRRFLETFLVDFKTSIDNLLVCAEHTGQYTFPLSIACQSLSCGLWLENPAQIKYSSGFSRGKNDKVDAHRIARYASRFSDNARYYQRPREEIERLKQLRNEQDLYKSDLAKYKGQLSDQREFMLDRVYQEKAQRLERLIEELEKLIKDIDLLVETIIRSCRVLWRQRELLLSVEGVGPSVALCLIIETEAFTRFDDPRKFACHAGVAPFRYTSGSSQHSRNKVSQRANKQIKTLLHMAALSITGRKSGELKAYYERKVKEGKNKMTVINAIRAKLIARMFAVIKKDQFYTPVYS >CP032819.1|AZS29378.1|1587845_1590071_-|patatin MKHLPLPSPATSRIPTLRRLLCLGLLLLLVPLAAEAQRKKVGVVLSGGGAKGVAHIGVLQVLEKAGIPVDVIVGTSMGSIVGGLYAIGYSADQLDSLVRVQDWMFLLSDKVNRYDLPFGEKEQDGKYLLSLPFDRGKKRPSGFISGQNVYNLFSDLTVGYHDSLDFLQLPIPFACVAADLFQREEIVMTSGNLVQAMRASMAIPGVFTPVRTGNRVLVDGGILNNFPTDVARRLGAEIVIGVDVQADLMKEDKLASVSGVIPQIINLLCMNKHEENVQLADLVIRPDVKGYSAASFSARSIDSLLARGKVAALQQWPAIMQVKELIGKPERSGREMPQAPRPDKIPIRNIIIRGLSPREEEWVRRKMRIEENSEITLHDIHREIATLYGTKAFAAVNYRLLGTVPRDLELTLQSNPMSTLNVGFRFDTEDMAALLFNTTLNNRGLRGSQLALTGRLSKNPYVKLEYSLENTFLRGFNLAYLFRYNDVNYYRKGRKTNNVTYRYHMGELGLSSLYLRNFKFKVGLRYEYFDYNSLLFSNEDEVMSVRPEGFFSYYGVAELDTYDRRFYPTRGISLQAAYSLYTDNLATYDGGSPFSALSIRFRPVLSLSDRLKMLPSAYGRVLIGHDPAYSYLNQLGGTEFGRYNAQQMPFAGINRVEVFENAVIVGRLQLRQRMGHKHYLSLTGNYALQDDNFFDLFGRKGIWGGGIGYSRDSMLGPIDLIFSFSDWSEKVEFYFNLGFYF >CP032819.1|AZS29379.1|1590319_1590979_-|hypothetical-protein MKWSIIRVGIVALFMGMTTIVSGMTEDDRAQRRAERHQRKLQQYRSGWNRLIPKYQNIQYAGSMGFLSLGVGWDYGREKQWETEIMFGLVPRFSSNKAKVTFTLKQNYAPWELPLGGKWWLEPLVTGLYMNTILSDDFWVKEPEKYPNNYYKFSTRVRFHVFAGQRIAIDLGPHSPFRRVTAFYELSTCDLYIISSATNKYVKTWDILSLSFGVKLHIL >CP032819.1|AZS29380.1|1590959_1591754_-|metallophosphoesterase MKKVILYGILLLLYRCDLIEYHPYDVRLHGQTGINAKNIARIEESCKGKDTLRFVLMGDSQRWYDETEAFVKALNNRDDVDFVIHGGDISDFGLTKEFLWVRDIMEKLKVPYVALLGNHDILGNGMDVFLKVYGDENFSFRAGNTKFICLNTNALEFDYSHPVPDFTFMYNELQDTVGCPRTIPVMHVQPFNVEFNNNVARGFHLLLQEFPGMEFCLHAHSHSLLHEELFNDGIPYIGCAAMKDKNYLLFTLTPGGYAYEVVYY >CP032819.1|AZS29381.1|1591895_1592372_-|Cys-tRNA(Pro)-deacylase MKAKINKTNAARLLDKAKIPYELIPYEVDENDLAATHVAAQLGENIEQVFKTLVLHGDRNGYFVCVIPGDHELDLKVAAKESGNKKADLIPMKELLPVTGYIRGGCSPIGMKKEFPTFIHASCLKFEYIYISAGIRGLQIRLNPKDLISYTRATLLPA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP032819_4 | 4554941-4555043 | Orphan |
NA
Consensus repeat of CP032819_4
|
1 spacers
spacers of CP032819_4
>4.1|4554966|53|CP032819|CRISPRCasFinder GGTTGTGCGTTTGGTTGTGCATTTTTGTAATAAAAAAAACGAAATGTTTTTAA |
CRISPR arrays and Neighbor proteins around CP032819_4
The CRISPR arrays of CP032819_4 >merge|CP032819|4|4554941-4555043|CRISPRCasFinder TTTTTTAGGGGTTGGTTGTGCATTTGGTTGTGCGTTTGGTTGTGCATTTTTGTAATAAAAAAAACGAAATGTTTTTAATGGTTGTGCATTTGGTTGTGCATTT >CP032819|4|4|4554941-4555043|CRISPRCasFinder TTTTTTAGGGGTTGGTTGTGCATTT GGTTGTGCGTTTGGTTGTGCATTTTTGTAATAAAAAAAACGAAATGTTTTTAA TGGTTGTGCATTTGGTTGTGCATTT
>CP032819.1|AZS31526.1|4551364_4552231_+|hypothetical-protein MKNKTWKFTLICSLIVMGFTACDDDDEYYYVGGGSSWLSYGNLEKIDNGSRSKFAIRRDDGNRLIVTEGMPIRFDGAKEDLRVYAHYSIVGSERDESGLEGNMNYYVRLYGLDDVLTKVPVKQSFIHENEGVRQDSIGNDPINVQEAWFGGRYLNVEFRIPVKDGSKEKHFINLVQDDVVAHHDTVYVTLRHNAYGEKPGTGNDRGNFSWGRGRVSFDLTSIVPEGQTSVPVKLIWTEYGKNASETVRREDSGTYTLKNARKTGKDRGLNQDKSKMVSTEGVGECVVE >CP032819.1|AZS31525.1|4550169_4551150_-|N-acetylmuramoyl-L-alanine-amidase MPMKHALTIFSFLLLTIFMFPATAQETAKANKGEGVLQFLKRFNRTKPFHMERFFELNRNKLDKNNGLQLDVTYTLPPLNNEGYEPLFGEKLAKYTIDSDELNGACFYLVSGHGGPDPGAIGELRGHPLHEDEYAYDITLRLARNLMSKGAKVHIIIQDAQDGIRDEKFLDVSDRETCMGQAIPLNQVKRLQQRCDKINELYKKDKEHYRRALFIHLDSRSESKQIDVFFYHYDGSVKGKHLANTLQNVFNRKYDKHQPLRGFSGTVSPRDLYVIKQTLPVAVFIELGNIQNSRDQQRFLLDDNRQALANWMCEGLIEDYKNYSKK >CP032819.1|AZS31524.1|4549603_4549888_+|hypothetical-protein MKYGTGIRDKYEGGYIEPCSLGLSSATIVRLNDWLSEYWKEHDNGYIDSAIIDMLDQKGLEIAMRIKKELPEMNVEYFSDARMVPVLKENIGRI >CP032819.1|AZS31523.1|4549241_4549607_+|hypothetical-protein MELFMMREELSLQEKLEYRNILNGIGEILLRLNFLGQYQVISDLLNLLDKNEDMIFIKELNGVNMWGGAGAVWEVGIQEKKDEITFINKLIELIDFMETTNVLGRGIKSIKRILSSIKQVT >CP032819.1|AZS31522.1|4548550_4548850_+|hypothetical-protein MNGYYRINKQRCLQKLESWSKFMIQGQRWKLYDDIHVDEVSKVFQARDRWFNGGLFLLDCLSKKLDQAYDCFLAIPLLETGCKTDLKDLNIDYIKKKFA >CP032819.1|AZS31521.1|4548085_4548544_+|hypothetical-protein MILESEGRFVEIIILHRNNLESIDIEDANWLDAEIKINVPGFKGYYNANLRTDDFERFYKDLNKLKTDRFFQIEFTTMEEGIYLKGTQGLLGTIKWEGIARSYWGNSVLTFEIETGFASIDALIEQTLEILNEYPVQESEKLAELKKNEIEF >CP032819.1|AZS31520.1|4547497_4547947_+|DUF4304-domain-containing-protein MESKEFKKVFEKVAKANNFEKAFGGWFKESTECIVVLCLQKSNFGDYYELNIKIFIQGMSGNKYTRSKDLVKKNVGDVFTRQPSDYSNVLDFDISMDDGKRIEKLESLFREFIVPFTDMALSRLGLRELAKEGKIFLLPAVKEGLTILS >CP032819.1|AZS31519.1|4546612_4547101_+|hypothetical-protein MKKYYFFICIALLFCSACKNKKNSENDNESKDIFIPILKNKLDSFFVYADTTYGRHGDHGLLYTISFHEKNGKQIVSLGVDFFYHLRRIKGYTFVNDRLVVYNGNYSDQKQYLLDTSKLTIFADTILGYRSDIVLDMDYEVIKQDYLICDKDSLSLIFSGFY >CP032819.1|AZS31518.1|4546181_4546448_+|hypothetical-protein MAEDKKHNCHVCGLYSEDLPWGKDGQSPTYIICDCCGVEFGNEDYTVESTKKYREEWIKKGAPWFISKAKPLNWSLERQLMNIPNKFK >CP032819.1|AZS31517.1|4545614_4546043_+|hypothetical-protein MKVRCVSNRGMDLRPYEYEQIGEDILGRFGASAYTRYGIEIGREYLVMGIIVYQTYQAYLIDDDGLILSCPCQLFEVLDERLIFNWEFRTIGKDEDIYPFVQAILGYTELCTDKKSYENLIIEKDEYARQIYFKRKIEFENE >CP032819.1|AZS31527.1|4555268_4555652_-|XRE-family-transcriptional-regulator MTINERMNHIIKELYGGNKRAFANAIGVSATVIENVVGTRQGKPSYDVLEKVCANANISAEWLLMERGEMLYNSTSQTQYTPPDQNTHYFIEKIAEQAEEIGKLKEQVKLLGTQQSAAPDAICADVV >CP032819.1|AZS31528.1|4555812_4556022_+|hypothetical-protein MQAIKVQIYFSEWEKVSDFISEINIDEEMAAYAIDNRTMVIATVGECSMAYAKAQLKTWFSDPTIETIK >CP032819.1|AZS31529.1|4556029_4556215_+|hypothetical-protein MKKRIVVEYGKISQISKDFKVTRQAVYKALNYLSNSSKATLIRKVAIERGGVEIGDQKETA >CP032819.1|AZS31530.1|4556505_4558584_+|kinase MEYFNKIVCVTVQELTSSENGEPVISLWTLYSLIRRGKAQRVNRGGGLDNYALIDYLSLPERYRIRFEQKYGDPVELIKEKCMKNRLKIDEAARIFFEDYRYDKAGELVSLTEPKKAEYTINASVLNELISILNDREGYRKALGGSTKKVWETIIGTADRLRDSYGHTLPENAARLKDKINQYKKEGYSCLISKKMGNGNTLKITEEAGNMIIALKRSSVPVYTDAQIFVEFNRIADEKGWKQLRSIQSLRQFLNRPDIEPLWYDAVHGELKAHQRYSRKNKTELPSMRDSLWYGDGTKINLYYKDYDKDGKLVVRTTQVYEVIDAYSEVFLGYHISDSEDYEAQYNAYRMAIQVSGHKPYELVHDNQGGHKKLQNSHFFDKIVGHVHRTTAPYSGQSKTIESVFGRFQAEVLHKDWRFTGQNITTKKDTSRPNLERIEANKDKLYTLAELKAAYAAARKEWNESKHFATGVNRIEMYQNSVNPDTPTVGVLDMIEMFWVMTDKPSTYTDNGLKITIKKREFTYEVYEVPGVPDHEFLRKNRGQKFYTMYDPYDHTSVRLYKKDKAGELRFVRTAEPYIVIHRNIQEQTEGEMSFIRRNIEANTEDRIERQVDARIIEQAHGVSMEQQGLKRPKLKGAKSETEREIERRVRRYSQDPEQLSAGKVTKLISNITFDQLNGDIRLNEKKVAGKL >CP032819.1|AZS32132.1|4558607_4559477_+|ATP-binding-protein MTQQEKDSIREALRVYAAKYSSQKKAAASLNGVSAGTLSAVVNGKYESISDDMFRNIISQITPAAAATGWQLVETNSFQEIWYALSDAQEFKKVRWIVGGAGCGKTTTATMYAQKNHEVFVILCDEDMRKGDFVREIARKLGFKTCGMRIREILDLAIESIIQMENPLLVFDEGDKLNDNVFHYFINLYNRLEGKCGITFLSTDYIQHRIDCGLNHNRKGYNEIYSRIGRKFFELEPTSHNDVFAICQANGLMDKKLIANVIDVTEKSEFDLRCVKDAIHREKKVAAAK >CP032819.1|AZS31531.1|4559534_4559738_+|hypothetical-protein MSKIEEVFRGLGRTEKAKFISQNIDYANADAIAEYVSAYLFDVLKDVGNDEYVATYLKEKGYKVTKE >CP032819.1|AZS31532.1|4559758_4560202_+|hypothetical-protein MKQIVLPLASRFPVGHFKRGQLTGFPEKVIKGTKIHTFREDPGKWAYNVELINSHNAELSIRRWIGRPYHTPQLEVKRLKKIGIQQVQMTWDSDIEQPTVFIDGKRILNVEQLAANDGMTLDDFVSWFFKTSNTFEGVIIHFTDFRY >CP032819.1|AZS31533.1|4560205_4560826_+|hypothetical-protein MARALSVTEAVSMKKETLKLTGAWADAFGEPERIGVWFIWGNSGNGKSSFVMQLCKELAKFGRVAYDSLEEGASLTMQNTLRRFNMAEVNRRFQLLDCEPMSELGERMDKHKSPDFYVIDSFQYTQMSYKEYIKFKEAHRNKLLIFISHADGRNPDGRSAKKVMYDAALKIYVEGFRAFSKGRFFGSVGHFTIWDEGAVRYWGDNA >CP032819.1|AZS31534.1|4560837_4561065_+|hypothetical-protein MSKNNQVITISPPMFIGEGNQKESISSKGHRCSYCHGNGFFWGEEQRERVKVDCPVCKGSGKLDAVITIEWKPAK >CP032819.1|AZS31535.1|4561073_4561784_+|DUF2786-domain-containing-protein MEKEVPENILAKIRKLLRLKESAIKIGSEGEAHAAAEAVNRLLTSYNLSLMDVTPEEQKNMISVTESEKITYQDTYGNIWKRDLLRIICEYNFCRILLHGGTTYMVVVGTRENAEVVLSLYNYLRSVFRRLSVERCTEYVATRRGYYRTKKFKRNYIKSYLLGCCTGLRKQFESIRKTAEETGLMLCHNHLIDDYFQSIGTTTHKSKNRNKVNTSAYCSGYDDGSKINLNKQINGK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP032819_3 | 3.1|1581530|28|CP032819|CRISPRCasFinder | 1581530-1581557 | 28 | MN234235 | Mycobacterium phage BogosyJay, complete genome | 33380-33407 | 8 | 0.714 |
CP032819_3 | 3.1|1581530|28|CP032819|CRISPRCasFinder | 1581530-1581557 | 28 | MN234200 | Mycobacterium phage Maminiaina, complete genome | 33362-33389 | 8 | 0.714 |
1. spacer 3.1|1581530|28|CP032819|CRISPRCasFinder matches to MN234235 (Mycobacterium phage BogosyJay, complete genome) position: , mismatch: 8, identity: 0.714
tcgtttctcccggggaggaggcgcagcg CRISPR spacer agcaggctcccgcggaggaggcgcagca Protospacer ****** **************.
2. spacer 3.1|1581530|28|CP032819|CRISPRCasFinder matches to MN234200 (Mycobacterium phage Maminiaina, complete genome) position: , mismatch: 8, identity: 0.714
tcgtttctcccggggaggaggcgcagcg CRISPR spacer agcaggctcccgcggaggaggcgcagca Protospacer ****** **************.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
934442 : 944756
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP032819|934442:944756|DBSCAN-SWA CTCATATCTTGAAATATTTACGGGTTTTCTCTTTCTTCTGTGTAGCGATTGCCGCCCCTTCCGTCTGGCGCAGACGGGAGGTATAGACCCCCAGGATGTCAAGCGTGGCCGTCACATCCGCAGCCGCATCGTGGGCATCGTCCAGTTCCACCCCCAATTTTGAGGCTACCAGTTCCAGTTTATACGAAGTAACCTCCGGATCAGCGGCAAAAGCCAACCGTCCCATGACAAGCGTGTCGATGTAGTGCGGCTGGAAGTTGCCGTAATAGTCCTTGGTACCGGAAAAGGTCTTTTCAAACTCGGCGGCCAGTCCGGCATAGTTCATCAGCTGTTGCAGGAATCCGATATCGAAAGCGATGTTCTGTCCGACCAGCACGGGCTTGCACTGATAGCCCTTCGAGAGGGTGCTGCGTTTGGCGAAGGCGATGACTTCTCCGGCGACTTTCTTCATGTCCACACCCTGTGTGCGCAGCATCTCCATGGTAATGGCGGAATAGTCCAATGCCGTCTGTTCGTACTTCATCGGGACATATTCCGGTTCTTTCGCCTGTTCATGGCGGGTACGCAACACCTTGCGTCGGGGAAGCCCCGCGTCCTGTTTCCCGTAGGGGGCGATATACGCCTGATAGCGGTCGAATACCTGCCATGTGTCGAAACGGACGGCTTGCAGGGCGATCTGTGTACAGGCGCATTCACGGCAGTCCAGCCCTCCGGTCTCGAAGTCCAGCCCGATGCCTACATATATCTTCTGTTCTGTCTTGGGTGCCATATCTTTTCAGGATTGAATGAATAACAATGAGTTCCTATAGGTCTGTAATGTATTGCATCCGTTGTAGTCGCTGTAACGGATGACGGCAGTGAGGATGACCACCCGGTCTTTGAGCGACTGGATCACGGTGTGGTGTTCCATGTAATAGTCGTTCCAGCATACACATTCCGCCAGACGGTTGTTCTGGGAGAGTGTCAGCTTGGCGAAACGCTTCCGGCTTCCCGTCTCCCTGTCCTTGTAGGTATGTTCGGTTACATCCACCACCGTGGCGCAGACAGTCGCACGCCTGCCGTCGTTCTCGTCCCGTGCCACCTCGTCCAGGGTCAGGTAAGAGGCTTTTCCCTTGACCTGCCTGCGGGCTTCCGAATTGTTGAAGATGCGGCGATAATCGATGCTGCCTATGCCCGACACGGCAATCTGCTGCTGCGACCAGAAGAAATGCCGCCCGCGCATGTCCTGCGGGAAGTCTTTTTCAGAAAGGGAGAATCCCAGTTCCCGTGCTGCGCGTTCGAGCAGCGCGCAGCGTTCGGTAACCGCCCCGACTTTTTCGATACGGTCGAAGCATCCGGCAAGGATCATGTGCTTGACATGCCGGGCGTTCACGGGGACTTTCACGGCCTCTTCCGCGTTGTCGGGGTCGTCCCAGTAACTGTATTTCTTGAGTTTGTAACGGAATATGCGGTGGATGAAGTTCTCGATACCGGTATATGCCCCGCCCCGGTCGCGTTCCGTGACGATGTATTCCACTGTCTTGACACCGACCTGTTTGATACGGGTAAGCGACCAGAAGATTTCATCGGTGGCATAGTCGGTGAAGAACTCCGTCCCGGAGCGGTTGATGTCCGGCGGCACGATCTTGGCCGACGAGCAGCGTTCCATCTCCGCCATCAACGAAGGAATCTCCTTGTCGTCCGCCCATTGCAATGCCACCGTGTAGAATGCCGAGGGGTAATTGGCCTTGAGCCATGCCCCGCAATAGGCTGTCAGGGCGTATGCCGCGGCATGGGAACGGTTGAACGAATATTTCCCGGCTACCTCTATCTTGTGCCAGATTTCCTCCGCCTCATAGTCCGGGCATCCGTTTCCGACGGCTCCGGCAATGAAATCCGCCTTCAGCGTGGCCATCAGGTCGGCTTTCTTCTTTCCTATGGCCTTGCGCAGGTAGTCGGTCTTCCCGAGGTCGAATCCCCCGAGCGTATGGGCAACGGACATGAACTGCTCCTGGTAGACCATGATTCCGAAAGTGTTCTTCGTCGCTTCATAACAGCCGTAGTTATAGACCGGGGCCACCTCGCCACGCCTGAAACGGACATAATCATCGGTGGCGCCGATGTCGAGTGTGGCGGGACGGTACAGGGCATTGATGGCGATCAGGTCCTCGATACACTCCGGCCGTACATCCTGGATAAACCGGGTAATGCCCGGCGAAGAGAACTGGAACACATTCTGCGTGTTACCGTCCGAGAGCAGCCGGTAGGTCTTCCCGTCTTCCAGCATATCCTGCGTGATACGCCCGATGGTGAGTTCCTGCCTGAAATTCCGGTTGACCAACGCGATGACGGCACTCAGTTTGGCAAGTTCCTTTGTCGCCAGCACATCCTCCTTCAACAATCCGATCTCATCGACCGAATAGCCGTCGAACTCCGATACCAGTGCCCCGTCCATCTTGCGGACCGGCAGGAAATCGAAGCATTCGGCGGGCCTGCCGTCCCGCGTGTCCGGTGTGACAACGATGGCCGAGGCGTGTATGGAGGCCGCTTTGGGCTGTCCGAGCAATCCTTGCACGTCCTCGATGACCAGCGGATAGGTCTGGATAAAGTCCCGAAGTTTTCTGTTGGAGGCAGCCTGTCTGAACAGTCCAGTCCAGTCCGTGCCGTCATCTATCATGGCGGTAATATAGTTCACGATGGAGTGAGGCACGCGGTGTACGCGGGCCACGTCCTTCAAGGCGGCTTTCAGTTTCATCGTGGTAAAGGTGCCGGCGGAAAAGACACGCTGGCGGCCGTCCGCGTTGTACCGTTCTTCCAGATAATCCTTGATCTCCTGCCGCCGGTCGGAGGCATAGTCGACATCTATATCCGCCTTACGGGAGGGCGGAATGTCCGCCCTCCACAAGTCCTTTCCCCACGAACGAATCCATTACGGTGAATGCTTCATTCGCTTTTTTAATTGTTACGTTTGTCACTTTCATTTTTATACTTTTTATAGTGGACGATATCGCATATCGTCTGTTTGCAAACCTTGAATTGTTCAGCCAGTTCATTTTGCATGACTCCCGACCGGTGTAACCGCCTGATATGATCCGCTTGTTCATTTGTCAGTTTGGCGTTCACACTGTTCTCTCCATAATCATCCCTGAGTTTATGCACCATAGAATGCATGACGTTTTCTTGCCTGGTACACATTTCCAGGTTGTCACTTCGATTATTGTGCTTATTGCCATCCTTATGATTAACCTCCTTTGCCGCATTCCAATCATCAAGAAAATGTTCTGCGACAAGTCTATGGATTAATTTTTTACATATTTTTCCGTCATAGGCCAAAGAAACATTGTAATACGGGGATGTACCCCCGACCCACGGTTTGATGATTTGTCCGTTCCTCCAATATTGGGATGTGGCATTACGGACATATCTTCCGACACTTCTAATTCTACCGCAGGAGCTTACTTGATAAATTCCCTCGTAATGATGAATATCTTTCCAAATTTCTTCCTCGTCCATTATCGTATATATTTCTTGTATCGTATTATCGCGCTTACCGTCTGGCGGCTGACGCCGTACTGTTTTGCCAGGCTGTTCTGGGAAGCCTGGCCGGAGGAATACCTCACCCGTATCTCTTCCGCCTGCCCGTTGGTCAGTTTGGCGTTCACGCTTTTCTCGCCGTAATCCCGTTTGAGGCCACCCGCTATGGCATGTTCCATATTCCGCTGGTGCGTACACATCTCCAGGTTATCCGCACGGTTGTTGTCGCGGTTCCCGTCGATATGGTTCACCTCCAGTCCCGGATTCCAGTCGGGCAGGAAGTGCTGTGCCACCAGACAGTGCACGGAGAACTTCTTCCCGACACCGCCCTTGTAGAGCCGGACACAGTCATAGAGCGAAGTCGTCCCGCACCAATGGCACATAATGCGTTCGGGTTGTATCCAGGTAATGCCGTTATGCGAAACCTGCCGTTCCAGACTCTTGACACGCCCGAGGTTGCTGATCTTGTACTTTCCTTCATAGTTCTTGATATCCACCCAGATTTCCCGGGTGTCAGCCATGGGTTATACGGTTTAGTGCTGTCTGAAAAATTCCCCGGTCTATCTCCACGCCGATGAACCGGCGTCCCGTATTCCGGCAGGCCACGGCTGTACTGCCGCTGCCCATCGCAAAGTCGAGGACGATGTCTCCTTCCTTGGTGTAGGTGCGTATCAGGTATTCCAGTAGGGCGACAGGTTTCTGTGTCGCATGCAGGCAGGACTGTTGCTTGTCGGTCTTGAATTTCAGCACGCTGCGCGGATAGCGTTCCGTGGATATGTAGTCCCGGAAATTGTCGTGCTTCCGGTATATCTCGCCGGCATCGCATTTCCGCTGGTGTTCGGCCATCACGACCTTGCGCTTGTGCCCGTCCGTTTTGATCGGATGATACTTCGGCAACTTGTCGTAAAAGACCAGTATGTCCTCGTGCGCCTTCATCGGCATACGCCTGGCGTTTAGGAAACCTGTCGCCTGCGTCTTCTCCCATACCCAGGCATACCGCAGCCTGCGAAGATTGGAGCTGCCCAGCACGCTCGTAAAAGGATGCTGGCAGAACAGCAGGATGGGTGTGTCAGGTCTGGATATGCCCTGTACGGCATTCCACATCTTGGGGATGTCTATCACGGCGTCCCAGCGGCAATGTGTCGTACCGTATGGAGGGTCTGACAATACCATGTCGGCAGTGATGCCTTCTTCAGCCAGCAGGGGCAGCACATCGAGTGCATCGCCCCGGTAAAGGTCGCAACCGTCATAGGGACGGTGATGTTCGTAGATCGGATTCATGGGTTTCGAGTTCCTTTAAGTTCCACAAGCAATCTCGGCGGTCAAAAAGGATTTCGTCGCCACACATCAATTCATCTGCGTATATCGTCCGCTCTTCTCCAGCCCGGATTACCCGCAGGCGGGCATCCGGGCAAAGACGGTAGCTCTTTCCTTCGGACTCGATTTCCACGTAACGCTCGCCTTTGCCGAGTGTGATGTCCGGGGCAAGCACCGTCAGTTCGTCTTTCCAGCTGAGCCCGCAGCGTTCCGGCACGAGGAAGCGCGAGAATATAAGGTCATATTTCAGCGGGTCGATGGAGGTTATGCCAAGCAGGTAGGAAACCAGCGATCCCCCGGCGGAACCGCGACCTATACCGGTTGCGATGCCTCTCCGGTGTGCCTCCCGCACCATATCCCACTGCACCAGGAAATAGTCCACATTGTCGGTCGATTCAATGATATAGACTTCCTCGTCCAGCCGTTCCCGATAGCGTTCCCGTTCCGGCTCCGGCACTTCCCGGTCCAGCCAGTCGTCAAGCAGTCTGAGGAACATCGTACGGCGGTCGCCATACCGTTCCCGCTCTTGTGGCCGCATACGGTATTCCGGCATGAACATGCGCCCGGTCTCGAATGAGGCGTCCGCACGTCCGGCAATCTCCACCGTGGGACGGCACATGCGCCTGAACAGGGAGTCGAAGTCCCATTGCCCGGAGAACAGAGGACGGAGCGTGTCATACAATTCATCCGCTGTCTTGAAATACTGGTCATCGCTCTGTTCATGGGCTGCTCCCGTGGCAATCTTGTTCACCACGATTCTGTACCCCGCATCGTCCTTGTCCATGTAGTAGCAGTCCGGAATCAGGACCGGTTCTACTGTAAATGAATCCGTGTCGGCATCATAGCAGTTGCCGAAATAATATTTCAGGGCTTCCAGTTGCTCCCTGTCGATGCGGTCCGCCTTGTATTCGTTGGCGTCGACCTGGTAATAGACCGCTTCGGCTCCTTTTCGGATACGCTCCACCTGTTTGGGATGTCCGGCCATCCAGTAGACTGAACGGGTGGCAAAGACCGGCACACAACCTGCGGCATACATCAGCAGCTGTTCATAGCGGAGGGTGTTGTCTTCCGAATCGACCATGACGGCCCGTTGGATGCGCAGCAGATTATGCAACCCCTCGTTATCCAGGGCATAAATCTTCAGCCCGACACGTTCTTCTTCGTGCATCATTGTCAGCGAGTAGCCGAATATATGTTTCAGCCCGGTATTGGCGCACTCCTTCTGAAGGTTCAGCGTGGCGGCCATTGTATTGCGGTCGCAGATGCCGACAGCCGTATGCCCGAGCCATTTCGCCTTGCGGCACAAATCCTCCGGCGAGCCGCAGGCGTTCAGCAGTTCGTAGGAGGTATGGATGCCAAGGTTCACGAAAGGCACATCGTGTACCGGAGGCTTGGGCCGGCCGATATATTTCAGCAGGTTGAAACGGAATTCTTCCCGCAGGTCATAGTAGTACCAGTTCCTGCCGAAAGGGAAAGCCACATGGAAGATTCCTTCTTCCATCAGCACATCCGGACTTTCCATCAGGTTGAACACGAACCTGTCCCCATCACTGCGGAAGATGGATTCCACTCCCGACAGGTCGGCCGTGAAAAGCCGCCCGAAGCCGGGAATATCCACCACTTCCGTATCGACCGGGATATAGGTTATCCGTTGTGCATCGAGCCATGAGAAAAGTTCCTGTATCATTTTTCCTGTACTTTTTTGAGTTTGTATTCAAGGACGGACAGCAGGCGGCAGGCGAAAATCCCGTAGATTTCCGATTCCGTGAGTTCGTCCCAGTCCTTGGCGGCATCGGCAATGTCCGCGATGAAAACCTCGAAATAGGGTTTCAGCCGTTCCGCCGTCCGCTTTACCGACTCGACGGCATCGCCGTCGTAACCGATGACTACGGTTCTCACGCCTTTCGATTGCAGCTTGTAGATCTGGACATCGGAGATTTTCTTTCCGAAAGTGGCCACGGCTGCGACATGGGAATTGTCATAGAGTTCGAGTCTGCGGGTCAGGGCGATGACATCGAAAATACCTTCCGCGAGGATGACCGTATCGGTTTCATCCATACGGATGGCATCGTAGTTATAAAGGAGTTTGGAAAAGTCGTTTTCCGTGGAGTTCCGGTAACGCAGGATCCTGTACCCGCCGCTATGTCTGGTCTTGCGGTTGTGGGTGTCTATATCCTCTTTCGGCCAGGTATGGCGGGAGACATAGCCGACAACCGTACTGTCATCGATGACCGGAAAGACGACATAGTCCGCGTAGCGCGGATTGAGCCTGCCGGTTATCCCGATCGGGAAATACTCGTAATCGTCGAAACAGAAGCCGCGCCGCTGCAAATACGGATGCCGGAAAGTGCGCTTGTAAAAGTCCGGCAATTCGACCGGCACCAGTTCGTCGTCTATCTCTTCCGCCTCGTCCTCCTCCAGCAGGTGCAGGTTCAACGGTGCGGCAATGTCGGCTGTCTGCGAAACCATCAGGTCCATGCGGCCGATAGCCGCCAGAAGTTGCTCCAATGTACGGGTGGATGCCCCGCAGGAGAAGCAGTGGGCCATGAAAGGCCGGCGGTGGGCAGTCTCCTTGCCGATGTAGATACCGAACTTACCGCCCGACTTTCCGCAGAACGGGCAGCGTGGAACGATAAGGTTCTTACCGCTCCCGTCCCGTTTGGCACCCGTCTCGCGGGTTATCTCCGAAACCAGATATTGATATTCCTGTACAGACAGTTCCATATAAAAGGTATAGCTGCAGTACAAGGACTAAAGTTTAAAAACGCCTCATAAATTTCATTGTTTTGATACGCTTTCTCAAGGTGGAAATGGGTATCGCACACTGCCGCGACAAGAGAGGGAGCCTCTATGCCTTAAAAAGCAGGACGAAAACCTGCCCATGTGTCGTGGCAGGCGCTGGTGCGGATAACAAGGAGGTCAATGGGAACAGGCCTTCTCCCGGACAGAGGTTCATGGCAGGATACCGCGAACTTATCCGGCGAAAAAGGATATAATAGAGGAAATTCCCATGCATTATCGAAAAAATCTGTACAATAAGAACCTGTCGGCAGACTCTGCACGAAAGGGCGGAAAAAAGCCCGGCAAGATTTCACCTCTGCCCCAAGGACATACAGACTGTACCGGAATGCGAAAAGACCTCGGGGAGATTCCATGAGTGGCTCTTTCCATGATTTTTTAGGGAATTTATTTGACCGGTCAGGATAAAATTGCTATATTTCAATTACGAAGCGACAGTGTTTAATAAAATTTATCAACAAGTAGTATATGAAGACAAACGAGGACTGGTCCGAAATAGTGGATACGCTCCGCCCGTACCTTAATGGGAATCCGGCAAAGGAAAAGCTTCTGAAAGATGTTGAGAACTGTCTGCGGTTTCTCGGATGGAAAAAGACCAACGGGACGATGAGGTCACGCTCCACATCAGACGGGGAGTCCGGGAAACCGGCAATAACCTTGTTGAAACGGGAAGGGGACTCCGGGTGGCAGGCGTTTCCCGTCATGACGGAATCCCCGGATGGCATGACATACGCCACGGGGCTGTATATAAGGGAAAACATCCGGCTTTATTACAGACCCGATAGCGGGACAGGCATACCGGTCTGCGTACTGACGGCTGAATTAAGGGAGGATGATACTAACGGGCCGCTTCTCTGCGCCCTGCTGTCCTATAAGGAGTTCGATTTGCAAAGTGTGGAGAATTTCTGTCTGAGGCGCTACAACCTGATACGTACGGGTGGCAGTTTCCGCCAGTGGGTGGAGGATTTTCTTTCCGGAAGTACCGGAACGAAGAATATCACCGGCCTGCTCAGGGAAAAATTCATGTCCGAAGGGTTAGAGGACTCCCTTATCAGGGAGGAACTGGAGAAACTCGGACTTAGGGTACGCTATGAAAAAGGGCATGCCGTAATCACTCTCCGTTACTCGGGATGAAGCGGTTTCCGGTGGCAGTCCCGTACCGCTATTACCTGCCTGTTACAGAGCTCTGTACAGCATGCCTGGCGGGATGACCTGCCGGAAGAACATGGAGCATAGCCTTCTCTATATGGGGGAGATTGGCCTGCCTTTTGCAGTTCATTTTCACAGGAATAAAACTATCCGCTATTGAGTTTTCATTTTGAACGACTTTTTATTCCCTGTTCAGATTCAGCGTCCTCTGCCCGTCATAGAACACCTCGTTATCATAGTCCGTCGCTATTTTAATGGTATCGCCCTTTTTGAAAAAGCGGCTCTTGGCCACATGCAGGCGCATCACGTTCTCCTTGCGCTCGGCCGATGACTGGTTGAGCGAAATAAGATGCGTGCATGGGCGTGCCAGACCTTTGGCCTCCGAACAGTTGTACTCGGTCAGCACGTTCCGCTCGTCATTCAGCCACTCCCGGTCTTCGATGGTAGACTGGTAAGTCACCACCATCCATACCTTTTCGTCCGCCGCCAGGTCCTTCAGGTCATTGGCCACCGCGATACGCTTCGCCCGTTCATGGTCGGCTCCCCATGAACGGCGGTTGGCATCCGTCAGCAAGTCCATCGAATCCACAATCACGATGTCCGGATTATATCCTTTGAGCTTCCTGTATTCCGAAATGCCGTTCTTGATGTCGAGTGTCGAGACCTGGGCATTGAAACGCGGGTAACTGCGCACCGTGATGCTTCCGGCATACGAGGCGACCAGTTTTTCCAGATGTCTCATCTCCGTATCCGAAATCTTTCCCCGCTCGTAATAGTAGGTGTTCCTGGAAATCAGTCCGCCGGAATAGGCGTTCAACGCCTCTTCCTCCGAACCTTCCAGCTGGAAATGCAGCACATGCAGCCCGTCATCGATGTCCGCCCTGACACCGATCCATTTGGCGATGTGCGACTTTCCCACGCCTGTGCTGGCCAGGAAACAGGTCAGCTGACCTCGCAGGTTACGTCCGGCATTGAGCGCATCCAGATACGGGATGTAGAAGCGGGACACACGAGGGGCGGCCGAGCGTTCCTCTTCCTCCTCACGCCGGCGGTTCCTCTCGAAACGCTCCTTGAAAGTCGCGGCCACATCGACGAACGAGGTGCTCTTCAGCATGAACCCTGCCAGCCACTCGGCATATTCCCGAAGAGTCTTCTCCGCCTTGTCCTGTTTGTTCTCGTTATACAGTTTCCCGACCTCCGCATAGACCGACTGTAACCGGACCCCCTTGATGTAGGACTCCAGCATGTCGGTCATCACCTCGGCGCTTTGTCCCTCGTCATACTCCCGGAAAGTGTCTATCAGCTCGATGGCGTCGTAATCCTCATGGAAGGTCTGTGCCAACACGGCATACGACGGCGGTGTCTTGTATGTCCTGAAATGTGCGGCTATCGCCTCCTGTACCCGCTGGAACGAACGGTCCGGCAGGTATTCCTTACGCATATGCCGGGCAAGGACGGCACACAACGGCTCCTGCCTGAGCGCCGTGGCATAGAGTTCATACAGGAACTCGGCACTGAGCGGATTGACGGCACTCAT
Protein sequences of DBSCAN-SWA_1 >CP032819|934442:944756|934442_935210_-|AZS28788.1|DBSCAN-SWA MAPKTEQKIYVGIGLDFETGGLDCRECACTQIALQAVRFDTWQVFDRYQAYIAPYGKQDAGLPRRKVLRTRHEQAKEPEYVPMKYEQTALDYSAITMEMLRTQGVDMKKVAGEVIAFAKRSTLSKGYQCKPVLVGQNIAFDIGFLQQLMNYAGLAAEFEKTFSGTKDYYGNFQPHYIDTLVMGRLAFAADPEVTSYKLELVASKLGVELDDAHDAAADVTATLDILGVYTSRLRQTEGAAIATQKKEKTRKYFKI >CP032819|934442:944756|937375_937933_-|AZS28790.1|DBSCAN-SWA MDEEEIWKDIHHYEGIYQVSSCGRIRSVGRYVRNATSQYWRNGQIIKPWVGGTSPYYNVSLAYDGKICKKLIHRLVAEHFLDDWNAAKEVNHKDGNKHNNRSDNLEMCTRQENVMHSMVHKLRDDYGENSVNAKLTNEQADHIRRLHRSGVMQNELAEQFKVCKQTICDIVHYKKYKNESDKRNN >CP032819|934442:944756|943400_944756_-|AZS28797.1|DBSCAN-SWA MSAVNPLSAEFLYELYATALRQEPLCAVLARHMRKEYLPDRSFQRVQEAIAAHFRTYKTPPSYAVLAQTFHEDYDAIELIDTFREYDEGQSAEVMTDMLESYIKGVRLQSVYAEVGKLYNENKQDKAEKTLREYAEWLAGFMLKSTSFVDVAATFKERFERNRRREEEEERSAAPRVSRFYIPYLDALNAGRNLRGQLTCFLASTGVGKSHIAKWIGVRADIDDGLHVLHFQLEGSEEEALNAYSGGLISRNTYYYERGKISDTEMRHLEKLVASYAGSITVRSYPRFNAQVSTLDIKNGISEYRKLKGYNPDIVIVDSMDLLTDANRRSWGADHERAKRIAVANDLKDLAADEKVWMVVTYQSTIEDREWLNDERNVLTEYNCSEAKGLARPCTHLISLNQSSAERKENVMRLHVAKSRFFKKGDTIKIATDYDNEVFYDGQRTLNLNRE >CP032819|934442:944756|940954_941995_-|AZS28794.1|DBSCAN-SWA MELSVQEYQYLVSEITRETGAKRDGSGKNLIVPRCPFCGKSGGKFGIYIGKETAHRRPFMAHCFSCGASTRTLEQLLAAIGRMDLMVSQTADIAAPLNLHLLEEDEAEEIDDELVPVELPDFYKRTFRHPYLQRRGFCFDDYEYFPIGITGRLNPRYADYVVFPVIDDSTVVGYVSRHTWPKEDIDTHNRKTRHSGGYRILRYRNSTENDFSKLLYNYDAIRMDETDTVILAEGIFDVIALTRRLELYDNSHVAAVATFGKKISDVQIYKLQSKGVRTVVIGYDGDAVESVKRTAERLKPYFEVFIADIADAAKDWDELTESEIYGIFACRLLSVLEYKLKKVQEK >CP032819|934442:944756|937932_938475_-|AZS28791.1|DBSCAN-SWA MADTREIWVDIKNYEGKYKISNLGRVKSLERQVSHNGITWIQPERIMCHWCGTTSLYDCVRLYKGGVGKKFSVHCLVAQHFLPDWNPGLEVNHIDGNRDNNRADNLEMCTHQRNMEHAIAGGLKRDYGEKSVNAKLTNGQAEEIRVRYSSGQASQNSLAKQYGVSRQTVSAIIRYKKYIR >CP032819|934442:944756|935216_937325_-|AZS28789.1|DBSCAN-SWA MWRADIPPSRKADIDVDYASDRRQEIKDYLEERYNADGRQRVFSAGTFTTMKLKAALKDVARVHRVPHSIVNYITAMIDDGTDWTGLFRQAASNRKLRDFIQTYPLVIEDVQGLLGQPKAASIHASAIVVTPDTRDGRPAECFDFLPVRKMDGALVSEFDGYSVDEIGLLKEDVLATKELAKLSAVIALVNRNFRQELTIGRITQDMLEDGKTYRLLSDGNTQNVFQFSSPGITRFIQDVRPECIEDLIAINALYRPATLDIGATDDYVRFRRGEVAPVYNYGCYEATKNTFGIMVYQEQFMSVAHTLGGFDLGKTDYLRKAIGKKKADLMATLKADFIAGAVGNGCPDYEAEEIWHKIEVAGKYSFNRSHAAAYALTAYCGAWLKANYPSAFYTVALQWADDKEIPSLMAEMERCSSAKIVPPDINRSGTEFFTDYATDEIFWSLTRIKQVGVKTVEYIVTERDRGGAYTGIENFIHRIFRYKLKKYSYWDDPDNAEEAVKVPVNARHVKHMILAGCFDRIEKVGAVTERCALLERAARELGFSLSEKDFPQDMRGRHFFWSQQQIAVSGIGSIDYRRIFNNSEARRQVKGKASYLTLDEVARDENDGRRATVCATVVDVTEHTYKDRETGSRKRFAKLTLSQNNRLAECVCWNDYYMEHHTVIQSLKDRVVILTAVIRYSDYNGCNTLQTYRNSLLFIQS >CP032819|934442:944756|938467_939235_-|AZS28792.1|DBSCAN-SWA MNPIYEHHRPYDGCDLYRGDALDVLPLLAEEGITADMVLSDPPYGTTHCRWDAVIDIPKMWNAVQGISRPDTPILLFCQHPFTSVLGSSNLRRLRYAWVWEKTQATGFLNARRMPMKAHEDILVFYDKLPKYHPIKTDGHKRKVVMAEHQRKCDAGEIYRKHDNFRDYISTERYPRSVLKFKTDKQQSCLHATQKPVALLEYLIRTYTKEGDIVLDFAMGSGSTAVACRNTGRRFIGVEIDRGIFQTALNRITHG >CP032819|934442:944756|942126_942426_-|AZS28795.1|DBSCAN-SWA MESPRGLFAFRYSLYVLGAEVKSCRAFFRPFVQSLPTGSYCTDFFDNAWEFPLLYPFSPDKFAVSCHEPLSGRRPVPIDLLVIRTSACHDTWAGFRPAF >CP032819|934442:944756|939200_940958_-|AZS28793.1|DBSCAN-SWA MIQELFSWLDAQRITYIPVDTEVVDIPGFGRLFTADLSGVESIFRSDGDRFVFNLMESPDVLMEEGIFHVAFPFGRNWYYYDLREEFRFNLLKYIGRPKPPVHDVPFVNLGIHTSYELLNACGSPEDLCRKAKWLGHTAVGICDRNTMAATLNLQKECANTGLKHIFGYSLTMMHEEERVGLKIYALDNEGLHNLLRIQRAVMVDSEDNTLRYEQLLMYAAGCVPVFATRSVYWMAGHPKQVERIRKGAEAVYYQVDANEYKADRIDREQLEALKYYFGNCYDADTDSFTVEPVLIPDCYYMDKDDAGYRIVVNKIATGAAHEQSDDQYFKTADELYDTLRPLFSGQWDFDSLFRRMCRPTVEIAGRADASFETGRMFMPEYRMRPQERERYGDRRTMFLRLLDDWLDREVPEPERERYRERLDEEVYIIESTDNVDYFLVQWDMVREAHRRGIATGIGRGSAGGSLVSYLLGITSIDPLKYDLIFSRFLVPERCGLSWKDELTVLAPDITLGKGERYVEIESEGKSYRLCPDARLRVIRAGEERTIYADELMCGDEILFDRRDCLWNLKELETHESDLRTSPSL >CP032819|934442:944756|942538_943204_+|AZS28796.1|DBSCAN-SWA MKTNEDWSEIVDTLRPYLNGNPAKEKLLKDVENCLRFLGWKKTNGTMRSRSTSDGESGKPAITLLKREGDSGWQAFPVMTESPDGMTYATGLYIRENIRLYYRPDSGTGIPVCVLTAELREDDTNGPLLCALLSYKEFDLQSVENFCLRRYNLIRTGGSFRQWVEDFLSGSTGTKNITGLLREKFMSEGLEDSLIREELEKLGLRVRYEKGHAVITLRYSG |
10 | unidentified_phage(62.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1332936 : 1338418
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP032819|1332936:1338418|DBSCAN-SWA TTTATTTGTTATCATCTTCAATGCCATATTTTTTCTTTTGCCGAATCACATAATTACTGATTTTTTCATCATATAGTGGATACTGGTAATCCAACTGGCATGCCACTTCTGACGAAACCTCCCGGAACAATGCCATACATTGTTCAAAGGATTCCCACATATGGGGATAGGAATCCATATTATAAGTGGACATAAGTCGTTCCCACAATTCCTCTGGGACATATTGCTTCATAAATTTATAGTTTTTTCCCAAACTGAAATGAAATCCCTGTTTGATACCAATCAGCCAGGAAATCATGCGAAGCAATTCTTTTCGTACTATATCATTCATATGGTCAATAGCAAAAAGAATTTCTTTGCGGCATAAGCCCTTTACTACATATGTTGTAGTATTCCAAAATTCATTACAGCAATCGTCAAACATTCTTTGAGTAGGCTTCTGCAAGTGGTAGTCTATATCCGTTGGTATTGGCGGCTTTACGATACGGTTGTCTTTATCCAACAGTAACTTTACCAGTTTATCCCATGTAAAATACTCGTCTATCAGTTCCAGCGGCAGCAAAGTTAAATCTATCTTAACATCATCAGTAAACAGCATTAAATATGAAAATCCCTTTTCTACAGCTGGAAATAATTCCATATCTTCCGGCTTTTGCAGAATCAACCTTTCACCAAATATATCTAGCCATTTATCATCACTTGTGAAGCTGTCCATATCCGTAACAAAAAAAGTAATATCAAAATCCTGAAAATCATCAGGCGGAATATTTGTATTTGTTCTAGATCCTTCCAAAGTAACCATGCGAATGCGTTTGTCTGTTTTTGCAAAATTCAAAACAATATCATAAACTTCCTTTTCTGATCTCATTTTTATCTCCTCCAATTATTGAAATAGGTGTGAAGCCATTTTAGGGCTTTCCCGCCTTGATTACCCTTTAGCAAAGACGGGCGGGGCTGTCAACGGCGGCGCATGAAATGCGCCGTTCATCTTGACCGTTGACTGGCTCGGCTGGCTTTGCTATTTTTATCGAATTGCATATTGGTAAATTGCATTGATATGTACTTGGGTTGTATTCACGATTGGCACATTAACATCTTCGGGCTTTATGGCAAGTGGCAGCTCCGTACAGGCGAGAATGAGTGCATCTACATTTTCTTTTTCAATGTATTTATTTGCCAGTGCAATCATTTTCTCTTTATCACTAGGAATTACTATGCCATTCTCTAAATTGGGGTATATCAGTTTTCCTAATAGAGATATTATTTTTTCGCTATCGCCTTTACAAGTAACATCATCGGGCGGCGAAGTTCATCTTCCATACCCGGAATATCCATCATCTCTTTAGGCGGTTCTGCTTCCTCTACAACTTTAAGCTCAAACCCATTGTTCAGCAATCCCATTATAATCTGTGTAAGTGTGTGATGTTGTTTTACTACATCACATCCTAAAAAATGTGTATTTCGTTCTCCTGTTATAAAGTAATTATCAATCGCCCAATATTGCGGTTTCCCATCATCTGTATAAATCCAATCCTGCCCAACTCCTGCGGTAAAAACAGGGTGTTCGATATTAAAAAGAAAAATTCCACCTGGTTTTAATGTCCTATACACTTTTTGAAATATTTCCACTATGTCCTCTATATAGTGCAGAGCTAAATTAGATATAACACAATCCCATTCATTTTCTGGATAGTCATATTCCTCTAATCCGCTAATACGATATTCAATTTGATTTCCTGAATTGCGCTTTTGGGCTTCCTCAATCATTTTCTTACTTAAATCGATCCCTAATATTTTTGTAGCTCCTTGTTCTTCTGCGAACTTGCAGTGCCAGCCATATCCGCACCCCAAGTCAAGAACAGATTTTCCTTCTAACGAAGGAAACAAAGGTTTTAATTGATGCCATTCACCAGCAGCTTTTAACCCCTCTTTATCAATGACAATAACATTCCCTTGTGTTTCTTCAAGAATTGCTGATCGCAAACTCGTTTTCCCTGACCCTGGTTGCCCACCAAGTAAAAAAGCGGTTGGCGATTCAACCGCTTTTTTTCCTTGAACCAATTCTTCTAAATTATCATTTAAGCGATTTTCAAATTGTTTGTCAGTAAAATTGACTATATTTGCCATATTAAGCCACTTTCTCTTTATTCAAAACTTGTTTTGAATAGCGATTCATTGACCGCCAATACTCATGAACGGCTTGTAATTCTTCTAAAGAATAATCAAAAAGATTTACACGTTTTGCAAGCTTTAATTGGTTTAATAAATGGACAGTTGCGGATGTACTTCAGAAAAGATTAGATGTCTAAAAAGCTTGTAGTTAAAGCTTTTTAGACATCTAAATCTAGGTACTAAAACAATTCATCCAGTAAAATATAATATTTTATTTTCTCCCAATCAGGCTTGATCCCCAGTAAGTCAAAAAATAGCTCGACATACTGTTCTTCCCCGATATCCTCCCTGATCGACCGGACGCAGAAGGCAATGTCATACCACTTGTCCGCCCTGCCGCTTCTCCCAAGATCAATAAAGCCACTTACTTTGCCATCTTTCACAAAGATGTTGCTGTCTCCCAGGTCGCCGTGGGAAAAGACAAGTTCCTCTTCGGGCTTTTCCGTCTTTAAAAAATCATACAGCTCGCGCGGATCTTTAAATGGAGTGTCTTCTTCCCAGTTTTCGCAATCCACATCGGCCAGATCGTTATTCAGTAAGTAATCCAATTCGGCTAAGCGGCTGTCTAAGCTATTCGTATAGGGACAATCCGATATGTCGATGGAGTGAAAGAGCCTGATGCACTCCGCATACAGCTCGATAATCTTTTCAGGGCTTTGTTCATCTTCATACTCTTCCGAGCAAAGGACGCCATCGGCCTCACTCATGAGCAGATTGCTCCAGCCATCATGCCGTTCAAAGTGCAGGACCTTTGGAACAGGCAGCTTTCCTTCCAGCCATAGCATCATGTCCTTTTCCCGTTCCACATCATAGGTGGTCCCTTTATACCGGCTGTCCGTCATTTTTAAATATAGGTTTTCATTTTCTCCCACCAGCTTATATACCTTAGCAGGAGACATTCCTTCCGTATCTTTTACGCAGCGGTATTTTTCGATCAGTTTTTTCAATTCCGGTGATATTCTCATTTTAGCCATTTATTATTTCCTTCCTCTTTTCTACAGTATTTAAAGATACCCCAAGAAGCTAATTATAACAAGACGAACTCCAATTCACTGTTCCTTGCATTCTAAAACCTTAAATACCAGAAAACAGCTTTTTCAAAGTTGTTTTCAAAGTTGGCGTATAACATAGTATCGACGGAGCCGATTTTGAAACCACAATTATGATACCTTCTCGGTATATTTTTCAGGTTTATGTTCAAAAGTTTCAAGCAGTTTTATAGCTTTCTTTTCGTTGATACAATAAGCATTATTACATGCAAATAACACTTGATTTAAACATGAAACTATACGAAAAACATGACCCGCAATATAATATTTATCGTCTGTTCCCGAATTTGCTTTTACAAACATTAAAGAGAACCCTGCTTCAAACATAAAAAAGTTACATTCAGGATAATTTGAACAACCATAAAACGATTTTTTTAATACAATATTGTTGCCACACTTAGGACATTTTCCTACAAGGGGTCCCGAGCGCTTAGTGGGAATTTGTACCCCTTATCGATACAAATTCCCCGTAGGCGCTAGGGACCTCTTTAGCTCCTTGGAAGCTGTCAGTAGTATACCTAATAATTTATCTACATTCCCTTTAGTAACGTGTAACTTTCCAAATTTACAAAAGCGACTCATAGAATTATTTCCTCCCGTTAAATAATAGATAACTATTAAAAATAGACAATACTTGCTCATAAGTAACGGTACTTAAATTGTTTACTTTGGCGTGTTTCATTGCTTGATGAAACTGATTTTTAGTAAACAGTTGACGATATTCTCGATTGACCCATTTTGAAACAAAGTACGTATATAGCTTCCAATATTTATCTGGAACATCTGTGGTATGGCGGGTAAGTTTTATTAAGACACTGTTTACTTTTGGTTTAGGATGAAAGCATTCCGCTGGCAGCTTAAGCAATTGCTGAATCGAGACTTGAGTGTGCAAGAGCAACCCTAGTGTTCGGTGAATATCCAAGGTACGCTTGTAGAATCCTTCTGCAACAATCAGATAGATGTCAGACGCATGGCTTTCAAAAACCACTTTTTTAATAATTTGTGTGCTTAAATGGTAAGGAATACTCCCAACAATTTTATACCTCTGTTTGTTAGGGAATTGAAACTGTAGAATATCTTGGTGAATTAAAGTGACACGAGTATTCAGTTTTAATTTTTCTGACGATAAGTTGAATAGATGACTGTCTAATTCAATAGACGTTACCTGTTTACTTATTTTAGCCAGTTTCGTCGTTAAATGCCCTTTACCTGTTCCAATTTCGTAAACGGTATCGGTTTCTTTTAAATTCAATTGTTTTATTATTTGGTTGAGTACTTTTTCACTCGTTAAAAAGTTTTGAGAATATTTTATATTAAAGTTTTGAGAATATTTTATATTTTTGTTCATGTAATCACTCCTTCTTAATTACAAATTTTTAGCATCTAATTTAACTTCAATAGACTAATAACAATGTTTGGTCTTGCTTCACTTTGTGCTTGTCGGAAGCCTAAATAGTAACTGCGTTTTGCTTCGTGTCCGTCTGGCAACCGCTCAAATTGTTCATCAAGTTCTGATACACCTTCTTCGTTATAGCGTGGATCAAATTCTTCTAAAAAGTCTGCTTCCTCTACAACTTTAAGCTCAAACCCATTGTTCAGCAATCCCATTATAATCTGTGTAAGTGTGTGATGTTGTTTTACTACATCACATCCTAAAAAATGTGTATTTCGTTCTCCTGTTATAAAGTAATTATCAATCGCCCAATATTGCGGTTTCCCATCATCTGTATAAATCCAATCCTGCCCAACTCCTGCGGTAAAAACAGGGTGTTCGATATTAAAAAGAAAAATTCCACCTGGTTTTAATGTCCTATACACTTTTTGAAATATTTCCACTATGTCCTCTATATAGTGCAGAGCTAAATTAGATATAACACAATCCCATTCATTTTCTGGATAGTCATATTCCTCTAATCCGCTAATACGATATTCAATTTGATTTCCTGAATTGCGCTTTTGGGCTTCCTCAATCATTTTCTTACTTAAATCGATCCCTAATATTTTTGTAGCTCCTTGTTCTTCTGCGAACTTGCAGTGCCAGCCATATCCGCACCCCAAGTCAAGAACAGATTTTCCTTCTAACGAAGGAAACAAAGGTTTTAATTGATGCCATTCACCAGCAGCTTTTAACCCCTCTTTACTGCGGGACATCTTTGCATATTCTTCAAAAAACTTCTCATTATCGTATTCATTATTCATAACCTTAACATCTCCTATTTTATCCTTCGGCACCGCCGCCTCCAGGCGGCTCCCCAACTTCAT
Protein sequences of DBSCAN-SWA_2 >CP032819|1332936:1338418|1337539_1338418_-|AZS29152.1|DBSCAN-SWA MKLGSRLEAAVPKDKIGDVKVMNNEYDNEKFFEEYAKMSRSKEGLKAAGEWHQLKPLFPSLEGKSVLDLGCGYGWHCKFAEEQGATKILGIDLSKKMIEEAQKRNSGNQIEYRISGLEEYDYPENEWDCVISNLALHYIEDIVEIFQKVYRTLKPGGIFLFNIEHPVFTAGVGQDWIYTDDGKPQYWAIDNYFITGERNTHFLGCDVVKQHHTLTQIIMGLLNNGFELKVVEEADFLEEFDPRYNEEGVSELDEQFERLPDGHEAKRSYYLGFRQAQSEARPNIVISLLKLN >CP032819|1332936:1338418|1336742_1337504_-|AZS29151.1|DBSCAN-SWA MNKNIKYSQNFNIKYSQNFLTSEKVLNQIIKQLNLKETDTVYEIGTGKGHLTTKLAKISKQVTSIELDSHLFNLSSEKLKLNTRVTLIHQDILQFQFPNKQRYKIVGSIPYHLSTQIIKKVVFESHASDIYLIVAEGFYKRTLDIHRTLGLLLHTQVSIQQLLKLPAECFHPKPKVNSVLIKLTRHTTDVPDKYWKLYTYFVSKWVNREYRQLFTKNQFHQAMKHAKVNNLSTVTYEQVLSIFNSYLLFNGRK >CP032819|1332936:1338418|1332936_1333803_-|AZS29145.1|DBSCAN-SWA MRSEKEVYDIVLNFAKTDKRIRMVTLEGSRTNTNIPPDDFQDFDITFFVTDMDSFTSDDKWLDIFGERLILQKPEDMELFPAVEKGFSYLMLFTDDVKIDLTLLPLELIDEYFTWDKLVKLLLDKDNRIVKPPIPTDIDYHLQKPTQRMFDDCCNEFWNTTTYVVKGLCRKEILFAIDHMNDIVRKELLRMISWLIGIKQGFHFSLGKNYKFMKQYVPEELWERLMSTYNMDSYPHMWESFEQCMALFREVSSEVACQLDYQYPLYDEKISNYVIRQKKKYGIEDDNK >CP032819|1332936:1338418|1334195_1335062_-|AZS29147.1|DBSCAN-SWA MANIVNFTDKQFENRLNDNLEELVQGKKAVESPTAFLLGGQPGSGKTSLRSAILEETQGNVIVIDKEGLKAAGEWHQLKPLFPSLEGKSVLDLGCGYGWHCKFAEEQGATKILGIDLSKKMIEEAQKRNSGNQIEYRISGLEEYDYPENEWDCVISNLALHYIEDIVEIFQKVYRTLKPGGIFLFNIEHPVFTAGVGQDWIYTDDGKPQYWAIDNYFITGERNTHFLGCDVVKQHHTLTQIIMGLLNNGFELKVVEEAEPPKEMMDIPGMEDELRRPMMLLVKAIAKK >CP032819|1332936:1338418|1336606_1336738_-|AZS29150.1|DBSCAN-SWA MSRFCKFGKLHVTKGNVDKLLGILLTASKELKRSLAPTGNLYR >CP032819|1332936:1338418|1333959_1334199_-|AZS29146.1|DBSCAN-SWA MISLLGKLIYPNLENGIVIPSDKEKMIALANKYIEKENVDALILACTELPLAIKPEDVNVPIVNTTQVHINAIYQYAIR >CP032819|1332936:1338418|1336173_1336317_-|AZS29149.1|DBSCAN-SWA MKLLNINLKNIPRRYHNCGFKIGSVDTMLYANFENNFEKAVFWYLRF >CP032819|1332936:1338418|1335286_1336081_-|AZS29148.1|DBSCAN-SWA MAKMRISPELKKLIEKYRCVKDTEGMSPAKVYKLVGENENLYLKMTDSRYKGTTYDVEREKDMMLWLEGKLPVPKVLHFERHDGWSNLLMSEADGVLCSEEYEDEQSPEKIIELYAECIRLFHSIDISDCPYTNSLDSRLAELDYLLNNDLADVDCENWEEDTPFKDPRELYDFLKTEKPEEELVFSHGDLGDSNIFVKDGKVSGFIDLGRSGRADKWYDIAFCVRSIREDIGEEQYVELFFDLLGIKPDWEKIKYYILLDELF |
8 | Streptococcus_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1686833 : 1695416
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP032819|1686833:1695416|DBSCAN-SWA ATCAGTACATCTGATTCAACTGCTCTTTAATTTTGTCACTCGTGATATAGTCATCGAAGTCCATTCGCTTGTCTATCATTCCATTAGGAGTCAATTCGATGATCCGGTTACAAACCGTTTGGATGATTTCGTGGTCGTGAGAATTCATCAGAACGATCCCTTTGAAGGCCACCAACGTGTTATTAAAAGCTTGGATAGATTCCAAATCCAGGTGATTTGCCGGAGAATCCAACAACAAGACATTCGCGTTTGTCAACATCATCTTGGCGATCATGCAACGCATCTTCTCTCCACCGGATAACACTTTCACCCGCTTGTAGATATCCTCCCCGGAAAATAACATTTTCCCGAGGAATCCCCTTAAAAATGTTTCGCTTGTATCTGCCGAATACTGTCCTAACCACTCGACCAAAGTCATATCCGTCTGGAAATAAGCCGAATTATCCAACGGAAGGTAGGCACTCGTGATCGTTACCCCCCAGGTAAAGCTACCGGTGTCCGGTTTCTCGTTCCCCGTGAGGATCTCCAACAGAGCGGTCATCGCCCTCGGATCCCGTGACAGGAAAGCGATCTTATCTCCCTTTTCCACGGTAAATTCCACGTCGCGGAATAATGTCTGTCCCTCGATGGATTTACTCAAATTACGAACATCCAGGATCCGGTCGCCCACTTCCCGTTCCGGGGTAAAGATTATTCCGGGGTATTTCCGGGTAGAAGGTAAAATCTCCTCGATATTCAATTTCTCCAGCATTTTCTTGCGGCTCGTGGTTTGCTTTGATTTTGCCACGTTGGCACTAAAGCGGGAAATAAACTCTTGTAATTCTTTCTTTTTTTCCTCGGCTTTCTTGTTCTTAGCTTGAGCCTGGCGCAGGGCTAACTGGCTCGACTCGTACCAGAAACTGTAGTTACCGGCAAACAACTGGATTTTTCCGAAATCAATATCCACCGAGTGCGTACACACGGCATCCAAGAAATGGCGGTCATGCGATACCACTAAAACCGTGTTCTCGTAATTGGCCAGATAATTTTCCAACCACATCACCGTTTCCAAGTCCAAATCATTCGTAGGCTCATCCAACAGCAAGTTATCCGGTTTCCCGAACAGGGCCTGAGCCAACAAAACCCGTACTTTTTGTTTACCACTTATATCTTTCATCAAGGAATAATGCAAATCCTCCTTGATTCCCAATCCGCTCAACAAGTTCGCCGCGTCACTCTCCGCGTTCCATCCGTCCATTCCGGCAAATTGTTCTTCCAGTTCGGCGGCCCGAATCCCATCTTCATCCGAGAAATCTTCCTTCATGTAGATCGCATTTTTCTCCTGCATCACTTTCCACAAAGAGGCGTGTCCTTGCAACACGGTGTCCAAAACGGTACACTCGTCGAAAGCAAAGTGGTCCTGTTTCAATACCGACAACCGTTCTCCCGGTCCGAACATCACGGTTCCCCGCGTAGGCTCCAAATCGCCGCTCAATATTCTCAAGAACGTAGATTTTCCGGCACCGTTGGCACCGATTACTCCATAACAATTTCCCGGTGTAAATTTCAAATTCACATCCCGGAACAAAGTTCTCTTCCCAAACTGAATCTCTAAATCCGAAACTGTAATCATTATTTATCTTTTTCTAAAAACGCCCGCAAAGATACACCTTAAAAAGTTGCCATGCAACTTTTTAATGCTTCTCTCCAATCCCGGATTTCCAAGTCGAAGGTCTCTTTAATTTTTGTCTTATCTAAAATTGAATAAGCAGGTCTTTTCGCTTTGGAAGGATATTCCCCGGTTGTCACCGGGTTTACTTTACACTTTGCACCTGTAATCGCCACGATTTCTTTGGCAAAATCATACCACGTACAGGCCCCTTCGTTCGTGAAGTGATACACTCCCTCCAACTCCGGGAGATCATCCCGTTCCATGATACGAATAATCGCCTCGGCCAGATCCCCGGCATAAGTCGGCGTGCCCCACTGATCGTTCACCACGTTCAATTCGTCTCGTTCCGAAGCCAAGCGGGATATTGTTTTCACGAAATTATGTCCATACCCGGAGTACAACCAAGCCGTACGGATAATAAGATACAGACATCCAGATTTCTTGATGGCCTGTTCTCCTTCCAATTTCGTTCTCCCGTACACGCTTTGCGGGTTTGTAGGATCTTTTTCCGTGTAAGGATGATCGGAGGTTCCGTCAAACACGTAGTCTGTCGAGATATGAAGCAACAGACAATCCAATTTCACTGCTGCATTAGCCAGGTTTGCCACGGCATCACGATTCAGTTTCATCGCTTTTTCCTCGTCTTCCTCGGCCTGATCGACAGCCGTGTACGCGGCACAATTGATGATCGTGTCGATATCATGCGTCTCGATAAAGGAGCATATCGCCTTAAAATTCGTTATATCCAATTCTTCGATATCCGTGTAAAACACGTCATCCAGCAAAGAAAAACTGCGATTTCTCAGTTCGGTACCCAACTGTCCATTCGCACCCGTGATCAATATATTTGCCATTTGTAATTTTTGATTTTAGATTTATGATTTATGATTTTTCCCCTTAAAGCCGACACACGTTGGCTTTCACGGGCTTGAAGACGTTATGTTCGCTTATCGTATTTCTCGAATTCGGAATTCATGCTGACAAACTCGGGCACAATGTCCTTCATCATCTTCACCACATCCGGGATATTTACCCTGGCAGCCAGTTCATTCAATTTATTCAATTCATTTACGACATCAAAATAATTATACTCCCGGACTTTCGCTACCCGAATCTTGCTATGAGCGGTGGGTAGGGTGTTCTCCTTGTTACTCAACAGTTCTTCGTATAATTTCTCGCCCGGTCGCAATCCCGTGTAAACGATGGAGATGTCCTTGTCCGGATCAAATCCCGATAATTCAATCATCCGTCTTGCCATATCGGCAATTTTCACGGGACGTCCCATTTCGAAAATATAAATCTCACCGCCCATACCCATCGTGCCGGCTTCCATCACCAAACGACAGGCTTCCGGGATCGTCATGAAATAACGAATAATATCGGGGTGTGTCACCGTTACAGGTCCCCCATTGCGAATCTGTTCGCGGAATCTCGGGATCACCGAACCGTTAGATCCCAACACGTTCCCGAAGCGAGTCGTGATAAAGCGGGTTGTCCCCGCTTGCAAGCCATCCCGAATGGCCACGCTTAGGCTTTGTACATAGATTTCGGCCAAACGCTTGGAAGCTCCCATGATATTTGTCGGGTTTACCGCCTTGTCCGTGGATACCATCACAAAACGCTCGACACCATATTTCACGGCATGGTCAGCCACATTCATGGTTCCGAACACGTTCACCCGCACGGCTTCACAAGGATTCGCTTCCATCAAGGGCACGTGTTTGTAAGCCGCCGCATGGAATACCACCTGAGGACGATAATTACGGAATACAAAATCCATGCGATCAAGCGATCTTACGTCCCCGATGACAGGGACAAAGGGTGTTGTCGGGTATTTATCCGTGAATTCCAACTGAATGTTGTGCATCGGGGTTTCTGCTGAATCCAGGAAGATCAATTGCTTGACTTTAAAACGCGTCAACTGCCGACACAATTCACTCCCGATAGAGCCGGCTGCTCCCGTTACCATGACCACTTTCCCCTCCAATCCTTCTGCAATTTCTTTCATGCTGATCTTGATTTCGTCACGCCCAAGCAGGTCTTCCAGACGGATTTCCTTCATCTGCGGGCGCGGCAACTGCCCGTCACTCATTTCCTCCACGGAAGGCAGTACCAACATTTCCAAACCCAATTTCTGGCAATAGCGCACCAACCGTTTTTGTTCTTCCTGGGCCGTCCGATAATTCGGAAACACGACACCATGAATCCGCAAATGCTTTACCAAGCGGTTAAATCCCTCTTCTCCCGTGATAAAGTACACGGATTCACCGGATAAGCGGTAATGTTTATATCTTTTCCCTATCGTCAAATAGCCCGTCACTTTGTATGAAGGCAGGAAACTGTTAGAGACGGCCGTACTGAGCGAGGCGGAACAATCGTCCATCCCGAAGATCAATACCCGCTTTTGTTCTCCATTCAAGCGCACCTTCAAGAAATCATACAAACTTACCAACACGACCCGCACGGATACCAACACGACCGTCGTTGTCATCACATCCAACAAAGCTCCCAAGATCAATCTTTCGTTGGATACTTGCGGGTAGAAGACGTGAATCAATGGGTAAAGAATGATCACTTTCAAGGTCATCGCCATTCCCATACGCCATGTCTCCCGCAACGTGGTGAAACGCATCACGACGCGATGGACGTGTAATATCGCAAACGCTAACGCGCTGATGGTTGCTGCCAGTAATGCCAAATATATAGCCAATGACCAAACAACAGGTTCTTCTCTGACAAACCGGATAAGCCAGTAAGCAAAAACCGTTACCAAAGTTGACAGCACGACGTCAATCCCCATCACGATCCAGCGACTCAGATATTTCGCGTGTACAATCTTGGTAGCTAATCCATTCGGTCCAAATAAATAATTTTTCATTCAATTCTTTTTTACAACCATATCGCATCCTGCAACCTCGGATGTTTTTTATCTTTCTCCGATAATATGATATCTTCTACCGGCAATTTCCAATCAATTCCAAGTTCCGGATCATCGTAACGAATCGCCCCTTCGTGGTCAGGAGCGTAGTACTCGTCGCATTTGTACTGAAATACCGCCTCTTCACTCAAAACGGCAAATCCATGCGCGAATCCCTTGGGCAGGAAGAATTGCAATTTATTTTCCTCGCTCAATTCCACGGCTACATGTTTTCCGAAAGTGGGCGAACCTTTACGAATATCCACGGCAACATCCAACACACGGCCTTTTACGACACGCACCAGTTTTGCCTGCGTGTATGGAGGCAATTGGTAGTGTAATCCTCGTAACACGCCATATTTAGACTTAGATTCGTTATCTTGCACGAACACGGTTTTACTTACCTTTTCTTCAAAACGTTTTAACGAAAAACTTTCGAAGAAATACCCCCGATCATCACCAAACACTTGCGGTTCAAGTATAACAAGACCTTCTATTTCTGTATCAATAACCTTCATTATCATTGAAAATTGAAAATTGAAAATTGAAAATGCAACACCAGGTACAATTTCATATTTCAATTTACAGATTATACATTGATTCGTAATATTTCATGTATTCTCCACTCGTCACGTTATCCAACCAATCCTGATTTTCCAAATACCATTTTACTGTTTTTTCAATCCCCTCCTCGAATTGCAGGGAAGGTTCCCAACCTAGTTCTTCGTGCAATTTGGTGGAGTCGATCGCGTAACGCAAATCGTGTCCGGCCCGGTCCGTTACATACGTGATCAACTTCTCGGAAGTTCCTTCCGGACGTCCTAATAAACGGTCGGTAACCCGGATCATCACTTTGATCAGATCAATGTTTTTCCATTCGTTGAACCCTCCGATATTGTAAGTATCTCCCACTTTACCTTTATGGAATATCAAATCAATGGCCCGTGCATGATCTTCCACGAACAACCAATCTCTCACGTTCTCTCCTGTCCCATACACGGGAAGCGGTTTGTTGTGACGAATATTGTTGATAAATAGCGGGATTAACTTCTCCGGGAACTGGAAGGGCCCGTAGTTGTTGGAACAATTCGAGATTTTCACCGGTAACCCGTAAGTGTCATGGTACGCCCTCACGAAATGATCGGAGGAAGCCTTCGAGGCCGAGTAGGGAGAGTGAGGATCGTATTTTGTGGTTTCCAAAAAGAATCCTCCATCCGGTTGCAAAGAACCGTACACTTCGTCCGTTGAAACATGATAAAACAGTTTTCCGTCAAAATTACCCTCCCAAGCCAGCTTGGCAGCTTGCAACAAGCTTAACGTTCCCATCACGTTTGTCTGGGCAAAAGAGAACGGATCTTTGATCGATCGATCCACGTGGCTCTCTGCTGCCAAATGGATTACCCCGTCTATTTCATATTCGGTAAAAATCTCTCGTATTTTCTCGAAGTCACAAATATCCGCTTTCACGAACTTGTAATTCGACATTTTCTCGACATCCTTCAAATTGGCCAGATTGCCCGCGTAGGTCAACTTATCCAAATTGATCACACGATATTCCGGATATTTGTTTACCATTAGCCGAACCAGATGTGAACCGATAAAGCCGGCCCCGCCAGTGATTAGTAAATTTTTCATTCTTCAAAATTTAGTTAAACCCATAATACTTCTCTACAGTCGTGTACAAGTAAACTTGATTTACCAACAAACGGGCATCCTGCCAAATCTTAAAATCCTCAAAACACTTAACCGTTCCCATAATTTAGAAATAACTTATACCCTGTAAAACAACTTGTAACTTGCAACCTGCAATTTGTTGGAATTATTGCAGTAATTTCAATAAATATTGCCCATACTGGTTTTTAATCATCGGTTGTGCCACAGTCCTCAAACGCTCTGCATCAATCCATCCTTTCATGTAGGCTATTTCTTCCAAACAAGCCACCTTTAGGCCTTGGCGTTTTTCGATCACTTCCACGAAATTGGAGGCTTCAGCCAATGAATCGTGAGTTCCCGTATCCAACCAAGCAAATCCGCGTCCCAATAACTGAACTTTTAAATCGCCAGCCTCCAGAAAGGTTTGATTCACAGAAGTAATTTCCAACTCCCCGCGAGCGGACGGTTTTATCTCTTTAGCAATACGCACCACCTCGTTCGGGTAGAAGTACAACCCGACCACGGCATAATTCGATTTCGGTTTTTTCGGCTTTTCCTCGATACTCAGTACATTCCCGTCCTGATCAAATTCGGCCACCCCGTAACGTTCCGGGTCATTCACCCAATAACCGAAAACGGTTGCCTTATTCTCTCCTTCCGCGGCCTTCACGGCATCTAACAACATGCCTGTAAAATGTTGTCCATAAAAAATATTGTCTCCCAACACCAAGCAAACCGAATCCTGCCCGATAAATTCCTCCCCGATAATAAATGCCTGTGCCAACCCGTCCGGGCTGGGTTGTTCCGCGTAGGTGAAACGAACCCCGTAATCGGAACCGTCACCGAGTAAACGTTCGAAACCCGGTAAGTCCTGCGGGGTTGAAATAATTAAAATATCCTTGATTCCCGCCAACATCAATACAGAAATGGGGTAATATACCATCGGCTTATCATATATCGGCAATAATTGCTTCGATACGCCTTTCGTAATTGGATATAACCTGGTCCCGGATCCACCGGCTAATACAATTCCTTTCATATCATTAAATTCTGACGCAAAGTTATCATTTTATTTATAGGCTCCAGTATTTCATCTGGTGAATTTTCCCCCTGTAAATTCCTTTTATATCTCCTAATACAGCATGTTCGTATGTCAATTCCCGGAAATAGTCCTCGTCCAAATCCATGTACTCTTTATGCGCAACCGCCACGATGACGGCATCGTAGTTTGCCCGGGGTTTTTCGATCAACCGGAAACCGTATTCGTGCATCACTTCCTCCGAATCCGCGTTCGGATCCGTGACATCTACATCCACGCCAAAATCTTTCAACTCTCTCACGATGTCCACGACTTTCGAATTCCGGATATCCGATACATTCTCTTTAAACGTAACACCCATAACCAACACGCGAGCCCCCAGGATATTCTTTCCCATCCCGATCATCTTTTTCACGATTTTCTTGGCGATATATCCTCCCATCGAGTCATTTACAAACCGCCCGGCATTAATCATTTGCGGATGATATTTCAATTCTTTTGCCTTATGTACCAAGTAATAAGGATCGACCCCGATACAATGCCCTCCTACCAAACCGGGATAAAATTTCAAGAAATTCCATTTCGTTCCCGCGGCCTCCAACACGTCATACGTGTTGATACCGATACGGCTGAAGATAATGGACAATTCGTTCATCAGAGCGATATTCACGTCCCGCTGCGTGTTTTCGATAATCTTCCCTGCTTCAGCCACTTTGATGTTCGGAGCCCGATGTACTCCCGGTTTCACCACTAGCTCGTAAACTTTAGCCACGGTTTCCAAGGTTTCGGCATCACCTCCTGAAACAATTTTTACCGTGTTGGCCAACGTGTGAACTTTGTCGCCCGGGTTGATTCGTTCGGGAGAATAACCGATCTTAAAATCAATTCCAACTTTCAGCCCGGACACCTCTTCCAGCACCGGGACGCAATCTTCTTCCGTACAGCCCGGGTAAACCGTGGATTCGTAAACCACGTAATCACCTTTCTTTAAAGCCATCCCTACCGTGCGGGAAGCCGCCAGCAAAGGCTTTAAATCCGGTTGATTAAATTTATCGATCGGGGTCGGCACCGCCACGATAAAAAACGACGCTTCCCGTAATTCAGCCACAGAAGAGGTAAAATAGATATCACAATCCTCAAATGCTTCCGTCCCCAACTCGTTACACGGGTCTATCTTCTTTCTCATCTTTGCCAAGCGTTCCTCGTTTATATCGAACCCGATCACAGAAATTTTTTTGGCAAACTCCAAAGCAATCGGCAACCCGACATATCCTAACCCGATTAATGCTAACTTTGCCTTTTTATCTAATAACTTATTATACAT
Protein sequences of DBSCAN-SWA_3 >CP032819|1686833:1695416|1694126_1695416_-|AZS29457.1|DBSCAN-SWA MYNKLLDKKAKLALIGLGYVGLPIALEFAKKISVIGFDINEERLAKMRKKIDPCNELGTEAFEDCDIYFTSSVAELREASFFIVAVPTPIDKFNQPDLKPLLAASRTVGMALKKGDYVVYESTVYPGCTEEDCVPVLEEVSGLKVGIDFKIGYSPERINPGDKVHTLANTVKIVSGGDAETLETVAKVYELVVKPGVHRAPNIKVAEAGKIIENTQRDVNIALMNELSIIFSRIGINTYDVLEAAGTKWNFLKFYPGLVGGHCIGVDPYYLVHKAKELKYHPQMINAGRFVNDSMGGYIAKKIVKKMIGMGKNILGARVLVMGVTFKENVSDIRNSKVVDIVRELKDFGVDVDVTDPNADSEEVMHEYGFRLIEKPRANYDAVIVAVAHKEYMDLDEDYFRELTYEHAVLGDIKGIYRGKIHQMKYWSL >CP032819|1686833:1695416|1688482_1689337_-|AZS29453.1|DBSCAN-SWA MANILITGANGQLGTELRNRSFSLLDDVFYTDIEELDITNFKAICSFIETHDIDTIINCAAYTAVDQAEEDEEKAMKLNRDAVANLANAAVKLDCLLLHISTDYVFDGTSDHPYTEKDPTNPQSVYGRTKLEGEQAIKKSGCLYLIIRTAWLYSGYGHNFVKTISRLASERDELNVVNDQWGTPTYAGDLAEAIIRIMERDDLPELEGVYHFTNEGACTWYDFAKEIVAITGAKCKVNPVTTGEYPSKAKRPAYSILDKTKIKETFDLEIRDWREALKSCMATF >CP032819|1686833:1695416|1686833_1688444_-|AZS29452.1|DBSCAN-SWA MITVSDLEIQFGKRTLFRDVNLKFTPGNCYGVIGANGAGKSTFLRILSGDLEPTRGTVMFGPGERLSVLKQDHFAFDECTVLDTVLQGHASLWKVMQEKNAIYMKEDFSDEDGIRAAELEEQFAGMDGWNAESDAANLLSGLGIKEDLHYSLMKDISGKQKVRVLLAQALFGKPDNLLLDEPTNDLDLETVMWLENYLANYENTVLVVSHDRHFLDAVCTHSVDIDFGKIQLFAGNYSFWYESSQLALRQAQAKNKKAEEKKKELQEFISRFSANVAKSKQTTSRKKMLEKLNIEEILPSTRKYPGIIFTPEREVGDRILDVRNLSKSIEGQTLFRDVEFTVEKGDKIAFLSRDPRAMTALLEILTGNEKPDTGSFTWGVTITSAYLPLDNSAYFQTDMTLVEWLGQYSADTSETFLRGFLGKMLFSGEDIYKRVKVLSGGEKMRCMIAKMMLTNANVLLLDSPANHLDLESIQAFNNTLVAFKGIVLMNSHDHEIIQTVCNRIIELTPNGMIDKRMDFDDYITSDKIKEQLNQMY >CP032819|1686833:1695416|1689420_1691361_-|AZS29454.1|DBSCAN-SWA MKNYLFGPNGLATKIVHAKYLSRWIVMGIDVVLSTLVTVFAYWLIRFVREEPVVWSLAIYLALLAATISALAFAILHVHRVVMRFTTLRETWRMGMAMTLKVIILYPLIHVFYPQVSNERLILGALLDVMTTTVVLVSVRVVLVSLYDFLKVRLNGEQKRVLIFGMDDCSASLSTAVSNSFLPSYKVTGYLTIGKRYKHYRLSGESVYFITGEEGFNRLVKHLRIHGVVFPNYRTAQEEQKRLVRYCQKLGLEMLVLPSVEEMSDGQLPRPQMKEIRLEDLLGRDEIKISMKEIAEGLEGKVVMVTGAAGSIGSELCRQLTRFKVKQLIFLDSAETPMHNIQLEFTDKYPTTPFVPVIGDVRSLDRMDFVFRNYRPQVVFHAAAYKHVPLMEANPCEAVRVNVFGTMNVADHAVKYGVERFVMVSTDKAVNPTNIMGASKRLAEIYVQSLSVAIRDGLQAGTTRFITTRFGNVLGSNGSVIPRFREQIRNGGPVTVTHPDIIRYFMTIPEACRLVMEAGTMGMGGEIYIFEMGRPVKIADMARRMIELSGFDPDKDISIVYTGLRPGEKLYEELLSNKENTLPTAHSKIRVAKVREYNYFDVVNELNKLNELAARVNIPDVVKMMKDIVPEFVSMNSEFEKYDKRT >CP032819|1686833:1695416|1693219_1694092_-|AZS29456.1|DBSCAN-SWA MKGIVLAGGSGTRLYPITKGVSKQLLPIYDKPMVYYPISVLMLAGIKDILIISTPQDLPGFERLLGDGSDYGVRFTYAEQPSPDGLAQAFIIGEEFIGQDSVCLVLGDNIFYGQHFTGMLLDAVKAAEGENKATVFGYWVNDPERYGVAEFDQDGNVLSIEEKPKKPKSNYAVVGLYFYPNEVVRIAKEIKPSARGELEITSVNQTFLEAGDLKVQLLGRGFAWLDTGTHDSLAEASNFVEVIEKRQGLKVACLEEIAYMKGWIDAERLRTVAQPMIKNQYGQYLLKLLQ >CP032819|1686833:1695416|1691982_1693035_-|AZS29455.1|DBSCAN-SWA MKNLLITGGAGFIGSHLVRLMVNKYPEYRVINLDKLTYAGNLANLKDVEKMSNYKFVKADICDFEKIREIFTEYEIDGVIHLAAESHVDRSIKDPFSFAQTNVMGTLSLLQAAKLAWEGNFDGKLFYHVSTDEVYGSLQPDGGFFLETTKYDPHSPYSASKASSDHFVRAYHDTYGLPVKISNCSNNYGPFQFPEKLIPLFINNIRHNKPLPVYGTGENVRDWLFVEDHARAIDLIFHKGKVGDTYNIGGFNEWKNIDLIKVMIRVTDRLLGRPEGTSEKLITYVTDRAGHDLRYAIDSTKLHEELGWEPSLQFEEGIEKTVKWYLENQDWLDNVTSGEYMKYYESMYNL >CP032819|1686833:1695416|1691372_1691918_-|AZS31934.1|DBSCAN-SWA MKVIDTEIEGLVILEPQVFGDDRGYFFESFSLKRFEEKVSKTVFVQDNESKSKYGVLRGLHYQLPPYTQAKLVRVVKGRVLDVAVDIRKGSPTFGKHVAVELSEENKLQFFLPKGFAHGFAVLSEEAVFQYKCDEYYAPDHEGAIRYDDPELGIDWKLPVEDIILSEKDKKHPRLQDAIWL |
7 | Escherichia_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
3755868 : 3764869
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP032819|3755868:3764869|DBSCAN-SWA CATGAGTGCCGTCAATCCGCTCAGTGCCGAGTTCCTGTATGAACTCTATGCCACGGCGCTCAGACAGGAACCGTTGTGTGCTGTCCTAGCCCGGCATATGCGTAAGGAATACCTGCCGGACCGTTCGTTCCAGCAGGTACAGGAGGCGATAGCCGTACACTTCAAGACATACAAGACACCGCCGTCGTATGCCGTGCTGGCACAGACCTTCCATGAGGATTACGACGCCATCGAGCTGATAGACACTTTCCGGGAGTATGACGAGGGACAGAGTGCCGAGGTGATGACCGACATGCTGGAGTCCTACATCAAGGGGGTCCGGTTGCAGTCGGTCTATGCGGAGGTCGGGAAACTGTATAACGAGAACAAACAGGACAAGGCGGAGAAGACTCTTCGGGAATATGCCGAGTGGCTGGCAGGGTTCACGCTGAAGAGTTCCTCGTTCGTCGATGTGGCCGCGACTTTCAAGGAGCGTTTCGAGAGGAACCGCCGGCGCGAGGAGGAAGAGGAACGCTCGGCCGCCCCTCGTGTGTCCCGCTTCTACATCCCGTATCTGGATGCGCTCAATGCCGGACGTAACCTGCGAGGTCAGCTGACCTGTTTCCTGGCCAGCACAGGCGTGGGAAAGTCGCACATCGCCAAATGGATCGGTGTCAGGGCGGACATCGATGACGGGCTGCATGTGCTGCATTTCCAGCTGGAAGGTTCGGAAGAAGAGGCGTTGAACGCCTATTCCGGCGGACTGATTTCCAGGAACGCCTACTATTACGAGCGGGGAAAGATTTCGGATACGGAGATGAGACATCTGGAAAAACTGGTCGCCTCGTATGCCGGAAGCATCACGGTGCGCAGTTACCCGCGTTTCAATGCCCAGGTCTCGACACTCGACATCAAGAACGGCATTTCGGAATACAGGAAGCTCAAAGGATATAATCCGGACATCGTGATTGTGGATTCGATGGACTTGCTGACGGATGCCAACCGCCGTTCATGGGGAGCCGACCATGAACGGGCGAAGCGTATCGCGGTGGCTAATGACCTGAAGGACCTGGCGGCGGACGAAAAGGTATGGATGGTGGTGACTTACCAGTCTACCATCGAAGACCGGGAGTGGCTGAATGACGAGCGGAACGTGCTGACCGAGTACAACTGTTCGGAGGCCAAAGGTCTGGCCCGCCCGTGCACGCACCTTGTTTCGCTCAACCAGTCATCGGCTGAAAGAAAGGAGAACGTTATGCGCCTGCACGTGGCCAAAAGCCGGTTCTTCAAGAAAGGAGACACTATCAGGATTGCCACCGACTACGACAATGAGGTGTTCTATGACGGGCAGAGGACACTGAATCTGAACAGGGAATAAAAAGCCATCGATCCGGATGCGGCGTGTCGGTGGCTTTTTGACTTGTAACGACTGGAATGTTGAGAATATATGGCAGCTTACATGCGTAAAACCAATTCCGGACAGATACGATTTTTATTGTATGCCCTTTGTATTCCGCCATATCGTCCGTTCCCCATGTTATCAACAGCAGGTTTTCACATTTCAATTCCTCTGCCGCCTCCAAGACTTTCAGTTTCCGCTCACGGGTCTTGGCGTTCGAGATATCATACGACCTTTGTTCGAGGAAAGTTCGAATGACCTCGCAAATACAAATCCGTTATTTGTAGTAATTGGAAAACGATACTTTTCATTCAGTATGTTTATTGTACTGCACAAAAAATACGCTTTTTTTATGTAGAGAGTGATCATTTTAGAAAAGTACATCTCTACACCTGATAATCAGCCAATGCTGATTATCAATCTCCTTATTTATATATTGTTTTATCGATTAGACAAAAGCAGTCATTTTTTGTCCAGCCGGGAAGACAATTTCTCTTTTCCTTTGGGTGGAAAGACCTTCCATGATTCTGCCGGTCTATTTTTATTGGGCTAATCCGGATTCTTTTTCCAAACATTTCAAACCAACCTGTAGCGGTCACGACTATACCTTAATATGGAACTGTCGGTACAGGAATATCAATATCTGGTTTCGGAGATAACCCGCGAGACGGGTGCCAAACGGGACGGGAGCGGTAAGAACCTTATCGTTCCGCGCTGCCCGTTCTGCGGAAAGTCGGGCGGTAAGTTCGGTATCTACATCGGCAAGGCGACTGCCCACCGCCGGCCTTTCATGGCCCACTGCTTCTCCTGCGGGGCATCCACCCGTACATTGGAGCAACTTCTGGCGGCTATCGGTCGCATGGACCTGATGGTTTTGCAGACAGCTGACATTGCCGCACCGTTGAACCTGCACCTGCTGGAGAAGGACGAGGCGGAAGAAATAGACGACGAACTGGTGCCGGTCGAATTGCCGGACTTTTACAAGCGCACTTTCCGGCATCCGTATTTGCAGCGGCGCGGCTTCTGTTTCGACGATTACGAGTATTTCCCGGTCGGGATAACCGGCAGGCTCAATCCGCGCTACGCGGACTATGTCGTCTTTCCGGTCATCGATGACTGTACGGTTGTCGGCTATGTCTCCCGCCATACCTGGCCGAAAGAGGATATAGACGCCCACAACCGCAAGACCAGACATAGCGGCGGGTACAAGATCCTGCGTTACCGGAACTCCACGGAAAACGACTTTTCCAGACTCCTTTATAACTACGATGCCGTCCGTATGGATGAAACCGATACGGTCATCCTCGCGGAAGGTATTTTCGATGTCATCGCCCTGACCCGCAGACTCGAACTCTATGACAATTCCCATGTCGCAGCCGTGGCCACTTTCGGAAAGAAAATATCCGATGTACAGATCTACAAGCTGCAATCGAAAGGCGTCAGAACCGTAGTCATCGGTTACGATGGTGATGCCGTCGAGTCGGTAAAGCGGACGGCTGAACGGCTGAAACCCTATTTCGAGGTTTTCATCGCGGACATTGCTGATGCCGCCAAAGACTGGGACGAACTCACGGAAGCGGAAATCTACGGGATTTTCGCCTGCCGCCTGCTGTCCGTCCTTGAATACAAACTCAAAAAAGTACAGGAAAGATGATACAGGAACTTTTCTCATGGCTCGATGCACAACGGATAACCTATATCCCGGTCGATACGGAAGTGGTGGATATTCCCGGCTTCGGGCGGCTTTTCACGGCCGACCTGTCGGGAGTGGAATCCATCTTCCGCGGCGACGGGGACAAGCTCGTGTTCAACCTGATGGAAAGTCCGGATGTGCTGATGGAAGAAGGAATCTTCCATGTGGCCTTCCCGTTCGGCAGGAACTGGTACTACTATGACCTGCGGGAAGAATTCCGTTTCAACCTGCTGAAATATATCGGCCGGCCCAAGCCTCCGGTACATGATGTACCTTTCGTGAACCTCGGCATCCATACCTCCTACGAGCTGCTGAACGCCTGCTGCTCGCCGGAAGATTTGTGCCGCAAGGCGAAATGGCTCGGGCATACGGCTGTCGGCATCTGCGACCGCAATACAATGGCCGCCACGCTGAACCTTCAGAAGGAGTGCGCCAATACCGGGCTGAAACATATATTCGGCTACTCGCTGACAATGATGCACGAAGAAGAACGTGTCGGGCTGAAGATTTATGCCTTGGATAACGAGGGGCTGCATAATCTGCTGCGCATCCAACGGGCCGTCATGGTCGATTCGGAAGACAACACCCTCCGCTATGAACAGCTGCTGATGTATGCCGCAGGTTGTGTGGTGGTTTTCGCCATCCGTTCCGTCTACTGGATGGCCGGACACCCCAAACAGGTGAAGCGTATCCGGAAAGGTGCCGAAGCGGTCTATTACCAGGTCGACGCCAACGAATACAAGGCGGACCGCATCGACAGGGAGCAACTAGAAGCCCTGAAATATTATTTCGGCAACTGTTATGATGCCGACACGGATTCATTTACAGTAGAACCAGTCCTGATTCCGGACTGCTACTACATGGACAAGGACGATGCAGGGTACAGAATCGTGGTGAACAAGATTGCCACGGGAGCCGCCCATGAACAGAGCGATGACCAGTATTTCAAGACAGCGGATGAATTGTATGACACGCTCCGTCCTCTGTTCTCCGGGCAATGGGACTTCGATTCCCTGTTTAGGCGCATGTGCCGTCCCACGGTGGAGATTGCCGGACGTGCGGACGCCTCATTCGAGACCGGGCGAATGTTCATGCCGGAATACCGTATGCGACCGGAAGAGCGGGAACGGTATGGTGACCGCCGTACGATGTTCCTCCGACTGCTTGACGACGGGCTGGACCGGAAAGTGCCGGAACCGGAGCGGGAACGCTATCGGGAACGGCTGGACGAGGAAGTCTATATCATTGAATCGACCGACAATGTGGACTATTTCCTGGTGCAGTGGGATATGGTGCGGGAGGCGCACCGGAGAGGCATCGCAACTGGTATAGGTCGCGGTTCCGCCGGGGGCTCGCTGGTTTCCTACTTGCTTGGCATAACCTCCATCGACCCGCTGAAATACGACCTTATCTTCTCGCGCTTCCTCGTGCCGGAACGCTGCGGGCTCAGCTGGAAAGACGAACTGACGGTGCTTGCCCCGGACATCACACTCGGCAAGGGCGAACGCTATGTGGAGATGGAATCTGAAGGCAAGACTTACCGTCTGTGCACGGATGCCCGGATGCGGGTAATCCGTAACGGAGAAGAGCGGACGATATACGCAGATGAATTGATGTGTGGCGACGAAATCCTTTTTGACCGCCGAGATTGCTTGTGGAACCTAAAGGAACTCGAAACCCATGAATCCGACCTACGAACACCACCGTCCCTATGACGGTTGCGACCTTTACCGGGGCGATGCACTCGATGTGCTGCCCCTGCTGGCTGAAGAAGGCATCACTGCCGACATGGTATTGTCAGACCCTCCATACGGTACGACACATTGTCGATGGGACGCCGTGATAGACATCCCCGGGATGTGGAATGCTGTACAGGGCATATCCAGACCTGACACTCCCGTACTGCTGTTCTGCCAGCATCCTTTTACGAGCCTGCTGGGCAGCTCCAATCTCCGCAGGCTGCGGTATGCCTGGGTATGGGAGAAGACGCAGGCGACAGGCTTCCTAAACGCCAGGCGTATGCCGATGAAGGCGCACGAGGACATTCTGGTCTTTTACGACAGGTTGCCGAAGTATCATCCGATCAAGACGGACGGGCACAAGCGCAAGGTCGTGATGGCCGAACATCAGCGGAAATGCGATGCCGGCGAGATATACCGGAAGCACGATAATTTCCGGGATTACATATCCACGGAACGCTATCCGCGCAGTGTGCTGAAATTCAAGACGGACAAGCAGCTCTCCTGCCTGCACGCGACACAGAAACCTGTCGCCCTGCTGGAATACCTGATACGCACCTACACCGACGAAGGAGACATCGTCCTCGACTTTGCGATGGGCAGCGGCAGTACAGCCGTGGCCTGCCGGAATACGGGACGCCGGTTCATCGGCGTGGAGATAGACCTGGGAATTTTTCAGACAGCACTAAACCGTATAACCCATGGCTGACACCCGGGAAATCTGGGTGGATATCAAGAACTATGAAGGAAAGTACAAGATCAGCAACCTCGGGCGTGTCAAGAGTCTGGAACGGCAAGTTTCGCATGACGGTATCACCTGGACACAACCCGAACGTATCATGTGTCATTGGTGTGGGACGACTTCGCTCTATGACTGTGTCCGACTCTACAAGGGCGGTGTCGGAAAGAAGTTCTCCGTGCACCGTCTGGTGGCACAGCACTTCCTGCCCGACTGGAATCCGGGACTGGAGGTGAACCATATCGACGGGAACCGCGACAACAACCGTGCGGATAACCTGGAAATGTGTACGCACCAGCGGAATATGGAACATGCCATAGCGGGTGGCCTCAAACGGGATTACGGCGAGAAAAGCGTGAACGCCAAACTGACCAACGGGCAGGCGGAAGAGATACGGGTGAGGTATTCCTCCGGCCAGGCTTCCCAGAACAGCCTGGCAAAACAGTACGGTGTCAGCCGCCAGACGGTAAGCGCGATAATACGATACAAGAAATATATCAGATGAAAGTAACACATATAAGAATCAGGAAAGCAGACGGACCGCTGACGGTCATGGATGCCTTTGTGGACAAAGGACTGACAGAAGGCGGCCACGCTTCGCTGCCGGACATAGATGTCGACTATGCCTCCGACCGGCGGCAGGAGATCAAGGATTACCTGGAAGAACGGTACAACGCGGACGGCCGCCAGCGTGTCTTTTCCGCCGGCACCTTTACCACGATGAAACTGAAAGCCGCCTTGAAGGACGTGGCACGCGTACACCGCGTGCCTCACTCCATCGTGAACTATATTACCGCCATGATAGATGACGGCACGGACTGGACAGGACTGTTCAGACAGGCTGCCTTCAACAGAAAACTTCGGGACTTTATCCAGACCTATCCGCTGGTCATCGAGGACGTGCAAGGATTGCTCGGACAGCCCAAAGCGGCCTCCATACACGCCTCGGCCATCGTTGTCACACCGGACACGCGGGACGGCAGGCCCGCCGAATGCTTCGATTTCCTGCCGGTCCGTAAGATGGACGGGGCACTGGTATCGGAGTTCGACGGCTATTCGGTCGATGAGATCGGATTGTTGAAGGAGGATGTGCTGGCGACAAAGGAACTTGCCAAACTGAGTGCCGTCATCGCGTTGGTCAACCGGAATTTCGGGCAGGAACTTACTATCGGGCGTATTACGCAGGATATGCTGGAAGACGGGAAGACCTACCGACTGCTCTCGGACGGCAACACGCAGAATGTGTTCCAGTTCTCTTCGCCGGGCATTACCCGGTTTATCCAGGATGTACAGCCGGAGTGTATCGAGGACCTGATCGCCATCAATGCCCTGTACCGTCCCGCCACACTCGACATCGGTGCCACCGATGATTATGTCCGTTTCAGGCGTGGCGAAGTGGCCCCGGTCTATAACTACGGCTGTTATGAAGCAACGAAGAACACTTTCGGAATCATGGTCTACCAGGAGCAGTTCATGTCCGTTGCTCATACGCTCGGGGGATTCGACCTCGGGAAGACCGACTATCTGCGCAAGGCCATAGGAAAGAAGAAAGCCGACCTGATGGCCACGCTGAAGGCGGATTTCATTGCCGGAGCCGTCGGGAACGGATGCCCGGACTATGAGGCGGAGGAAATCTGGCACAAGATAGAGGTAGCCGGGAAATATTCGTTCAACCGTTCCCATGCCGCGGCATACGCCCTTACAGCCTATTGCGGGGCATGGCTCAAGGCCAATTACCCCTCGGCATTCTACACGGTGGCATTGCAATGGGCGGACGACAAGGAGATTCCTTCGTTGATGGCGGAGATGGAACGCTGCTCGTCGGCCAAGATCGTGCCGCCGGACATCAACCGCTCCGGGACGGAGTTCTTCACCGACTATGCCACCGATGAAATCTTCTGGTCGCTTACCCGTATCAAACAGGTCGGTGTCAAGACAGTGGAATACATCGTCACGGAACGCGACCGGGGCGGGGCATATACCGGTATCGAGAACTTCATCCACCGCATATTCCGTTACAAACTCAAGAAATACAGTTACTGGGACGACCCCGACAACGCGGAAGAGGCCGTGAAAGTCCCCGTGAACGCCCGGCATGTCAAGCACATGATCCTTGCCGGATGCTTCGACCGTATCGAAAAAGTCGGGGCGGTTACCGAACGCTGCGCGCTGCTCGAACGCGCAGCACGGGAACTGGGATTCTCCCTTTCTGAAAAAGACTTCCCGCAGGACATGCGCGGGCGGCATTTCTTCTGGTCGCAGCAGCAGATTGCCGTGTCGGGCATAGGCAGCATCGATTATCGCCGCATCTTCAACAATTCGGAAGCCCGCAGGCAGGTCAAGGGAAAAGCCTCTTACCTGACCCTGGACGAGGTGGCACGGGACGAGAACGACGGCAGGCGTGCGACTGTCTGCGCCACGGTGGTGGATGTAACCGAACATACCTACAAGGACAGGGAGACGGGAAGCCGGAAGCGTTTCGCCAAGCTGACACTCTCCCAGAACAACCGTCTGGCGGAATGTGTATGCTGGAACGACTATTACATGGAACACCACACCGTGATCCAGTCGCTCAAAGACCGGGTGGTCATCCTCACTGCCGTCATCCGTTACAGCGACTACAACGGATGCAATACATTACAGACCTATAGGAACTCATTGTTATTCATTCAATCCTGAAAAGATATGGCACCCAAGACAGAACAGAAGATATATGTAGGCATCGGGCTGGACTTCGAGACCGGAGGGCTGGACTGCCGTGAATGCGCCTGTACACAGATCGCCCTGCAAGCCGTCCGTTTCGACACATGGCAGGTATTCGACCGCTATCAGGCGTATATCGCCCCCTACGGGAAACAGGACGCGGGGCTTCCCCGACGCAAGGTGTTGCGTACCCGCCATGAACAGGCGAAAGAACCGGAATATGTCCCGATGAAGTACGAACAGACGGCATTGGACTATTCCGCCATTACCATGGAGATGCTGCGCACACAGGGTGTGGACATGAAGAAAGTCGCCGGAGAAGTCATCGCCTTCGCCAAACGCAGCACCCTCTCGAAGGGCTATCAGTGCAAGCCCGTGCTGGTCGGACAGAACATCGCTTTCGATATCGGATTCCTGCAACAGCTGATGAACTATGCCGGACTGGCCGCCGAGTTTGAAAAGACCTTTTCCGGTACCAAGGACTATTACGGCAACTTCCAGCCGCACTACATCGACACGCTTGTCATGGGACGGTTGGCTTTTGTCGCTGATCCGGAGGTTACTTCGTATAAACTGGAACTGGTAGCCTCAAAATTGGGGGTGGAACTGGACGATGCCCACGATGCGGCTGCGGATGTGACGGCCACGCTTGACATCCTGGGGGTCTATACCTCCCGTCTGCGCCAGACGGAAGGAGCGGCAATCGCTACACAGAAGAAAGAGAAAACCCGTAAATATTTCAAGATATGA
Protein sequences of DBSCAN-SWA_4 >CP032819|3755868:3764869|3757858_3758899_+|AZS30971.1|DBSCAN-SWA MELSVQEYQYLVSEITRETGAKRDGSGKNLIVPRCPFCGKSGGKFGIYIGKATAHRRPFMAHCFSCGASTRTLEQLLAAIGRMDLMVLQTADIAAPLNLHLLEKDEAEEIDDELVPVELPDFYKRTFRHPYLQRRGFCFDDYEYFPVGITGRLNPRYADYVVFPVIDDCTVVGYVSRHTWPKEDIDAHNRKTRHSGGYKILRYRNSTENDFSRLLYNYDAVRMDETDTVILAEGIFDVIALTRRLELYDNSHVAAVATFGKKISDVQIYKLQSKGVRTVVIGYDGDAVESVKRTAERLKPYFEVFIADIADAAKDWDELTEAEIYGIFACRLLSVLEYKLKKVQER >CP032819|3755868:3764869|3764101_3764869_+|AZS30976.1|DBSCAN-SWA MAPKTEQKIYVGIGLDFETGGLDCRECACTQIALQAVRFDTWQVFDRYQAYIAPYGKQDAGLPRRKVLRTRHEQAKEPEYVPMKYEQTALDYSAITMEMLRTQGVDMKKVAGEVIAFAKRSTLSKGYQCKPVLVGQNIAFDIGFLQQLMNYAGLAAEFEKTFSGTKDYYGNFQPHYIDTLVMGRLAFVADPEVTSYKLELVASKLGVELDDAHDAAADVTATLDILGVYTSRLRQTEGAAIATQKKEKTRKYFKI >CP032819|3755868:3764869|3755868_3757224_+|AZS30969.1|DBSCAN-SWA MSAVNPLSAEFLYELYATALRQEPLCAVLARHMRKEYLPDRSFQQVQEAIAVHFKTYKTPPSYAVLAQTFHEDYDAIELIDTFREYDEGQSAEVMTDMLESYIKGVRLQSVYAEVGKLYNENKQDKAEKTLREYAEWLAGFTLKSSSFVDVAATFKERFERNRRREEEEERSAAPRVSRFYIPYLDALNAGRNLRGQLTCFLASTGVGKSHIAKWIGVRADIDDGLHVLHFQLEGSEEEALNAYSGGLISRNAYYYERGKISDTEMRHLEKLVASYAGSITVRSYPRFNAQVSTLDIKNGISEYRKLKGYNPDIVIVDSMDLLTDANRRSWGADHERAKRIAVANDLKDLAADEKVWMVVTYQSTIEDREWLNDERNVLTEYNCSEAKGLARPCTHLVSLNQSSAERKENVMRLHVAKSRFFKKGDTIRIATDYDNEVFYDGQRTLNLNRE >CP032819|3755868:3764869|3758895_3760653_+|AZS30972.1|DBSCAN-SWA MIQELFSWLDAQRITYIPVDTEVVDIPGFGRLFTADLSGVESIFRGDGDKLVFNLMESPDVLMEEGIFHVAFPFGRNWYYYDLREEFRFNLLKYIGRPKPPVHDVPFVNLGIHTSYELLNACCSPEDLCRKAKWLGHTAVGICDRNTMAATLNLQKECANTGLKHIFGYSLTMMHEEERVGLKIYALDNEGLHNLLRIQRAVMVDSEDNTLRYEQLLMYAAGCVVVFAIRSVYWMAGHPKQVKRIRKGAEAVYYQVDANEYKADRIDREQLEALKYYFGNCYDADTDSFTVEPVLIPDCYYMDKDDAGYRIVVNKIATGAAHEQSDDQYFKTADELYDTLRPLFSGQWDFDSLFRRMCRPTVEIAGRADASFETGRMFMPEYRMRPEERERYGDRRTMFLRLLDDGLDRKVPEPERERYRERLDEEVYIIESTDNVDYFLVQWDMVREAHRRGIATGIGRGSAGGSLVSYLLGITSIDPLKYDLIFSRFLVPERCGLSWKDELTVLAPDITLGKGERYVEMESEGKTYRLCTDARMRVIRNGEERTIYADELMCGDEILFDRRDCLWNLKELETHESDLRTPPSL >CP032819|3755868:3764869|3761917_3764095_+|AZS30975.1|DBSCAN-SWA MKVTHIRIRKADGPLTVMDAFVDKGLTEGGHASLPDIDVDYASDRRQEIKDYLEERYNADGRQRVFSAGTFTTMKLKAALKDVARVHRVPHSIVNYITAMIDDGTDWTGLFRQAAFNRKLRDFIQTYPLVIEDVQGLLGQPKAASIHASAIVVTPDTRDGRPAECFDFLPVRKMDGALVSEFDGYSVDEIGLLKEDVLATKELAKLSAVIALVNRNFGQELTIGRITQDMLEDGKTYRLLSDGNTQNVFQFSSPGITRFIQDVQPECIEDLIAINALYRPATLDIGATDDYVRFRRGEVAPVYNYGCYEATKNTFGIMVYQEQFMSVAHTLGGFDLGKTDYLRKAIGKKKADLMATLKADFIAGAVGNGCPDYEAEEIWHKIEVAGKYSFNRSHAAAYALTAYCGAWLKANYPSAFYTVALQWADDKEIPSLMAEMERCSSAKIVPPDINRSGTEFFTDYATDEIFWSLTRIKQVGVKTVEYIVTERDRGGAYTGIENFIHRIFRYKLKKYSYWDDPDNAEEAVKVPVNARHVKHMILAGCFDRIEKVGAVTERCALLERAARELGFSLSEKDFPQDMRGRHFFWSQQQIAVSGIGSIDYRRIFNNSEARRQVKGKASYLTLDEVARDENDGRRATVCATVVDVTEHTYKDRETGSRKRFAKLTLSQNNRLAECVCWNDYYMEHHTVIQSLKDRVVILTAVIRYSDYNGCNTLQTYRNSLLFIQS >CP032819|3755868:3764869|3760618_3761386_+|AZS30973.1|DBSCAN-SWA MNPTYEHHRPYDGCDLYRGDALDVLPLLAEEGITADMVLSDPPYGTTHCRWDAVIDIPGMWNAVQGISRPDTPVLLFCQHPFTSLLGSSNLRRLRYAWVWEKTQATGFLNARRMPMKAHEDILVFYDRLPKYHPIKTDGHKRKVVMAEHQRKCDAGEIYRKHDNFRDYISTERYPRSVLKFKTDKQLSCLHATQKPVALLEYLIRTYTDEGDIVLDFAMGSGSTAVACRNTGRRFIGVEIDLGIFQTALNRITHG >CP032819|3755868:3764869|3761378_3761921_+|AZS30974.1|DBSCAN-SWA MADTREIWVDIKNYEGKYKISNLGRVKSLERQVSHDGITWTQPERIMCHWCGTTSLYDCVRLYKGGVGKKFSVHRLVAQHFLPDWNPGLEVNHIDGNRDNNRADNLEMCTHQRNMEHAIAGGLKRDYGEKSVNAKLTNGQAEEIRVRYSSGQASQNSLAKQYGVSRQTVSAIIRYKKYIR >CP032819|3755868:3764869|3757240_3757603_+|AZS30970.1|DBSCAN-SWA MRRVGGFLTCNDWNVENIWQLTCVKPIPDRYDFYCMPFVFRHIVRSPCYQQQVFTFQFLCRLQDFQFPLTGLGVRDIIRPLFEESSNDLANTNPLFVVIGKRYFSFSMFIVLHKKYAFFM |
8 | unidentified_phage(71.43%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP032819.1|AZS29041.1|1226065_1226488_+|PcfK-like-protein |
1226065_1226488_+
Protein sequences of CP032819.1|AZS29041.1|1226065_1226488_+|PcfK-like-protein>CP032819.1|AZS29041.1|1226065_1226488_+|PcfK-like-protein MKGTDHFKRTIQMYLEQRAAEDALFAKNYRNPAKNIDDCVTYILNYVQRSGCNGFTDGEIFGQAVHYYDENEIEVGKPIQCQVAVNHVVELTAEEKAEARQNAVRRYQEEELRKLQNRSKPRTATKATAQEVQQPNLFNF |
140 aa aa |
40
gnl|BL_ORD_ID|40 information
|
NA | NA | No | NA | ||||||||
CP032819.1|AZS29109.1|1296839_1297262_+|PcfK-like-protein |
1296839_1297262_+
Protein sequences of CP032819.1|AZS29109.1|1296839_1297262_+|PcfK-like-protein>CP032819.1|AZS29109.1|1296839_1297262_+|PcfK-like-protein MNGTDHFKRTIQAYLDSRAAEDKLFAASYSKPNKNMDDCITYLLHWAKSQCNGGNGIGVTAGEVLSQAVHYFDEDDIDIGKPIPCQVMVCGVELTDEEKAEARQRAIRQYQDEELRKMQNRNKAKANQKTNAQQVELSLF |
140 aa aa |
40
gnl|BL_ORD_ID|40 information
|
NA | NA | No | NA | ||||||||
CP032819.1|AZS29180.1|1366583_1367039_+|PcfK-like-protein |
1366583_1367039_+
Protein sequences of CP032819.1|AZS29180.1|1366583_1367039_+|PcfK-like-protein>CP032819.1|AZS29180.1|1366583_1367039_+|PcfK-like-protein MLNTGWLAAPQYQKIMAQGTDYFKLTIQNYLDARAREDELFAPRYANPKKNIDDCCTFIINQVRQSGCNGFADEEIYSMALHYYDEEDIDIGKPVSCKVVVNHTVELTEEEKAEARRNAIRKAESEAYAKLAKAKSKPKKIEDNKLMPSLF |
151 aa aa |
40
gnl|BL_ORD_ID|40 information
|
NA | NA | No | NA |