Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_013721 | Gardnerella vaginalis 409-05, complete sequence | 3 crisprs | DEDDh,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,WYL | 4 | 1 | 3 | 1 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_013721_1 | 172791-172864 | Orphan |
NA
Consensus repeat of NC_013721_1
|
1 spacers
spacers of NC_013721_1
>1.1|172814|28|NC_013721|CRISPRCasFinder CGAATGTTTTTTGTGTGCTATGCCTTCA |
CRISPR arrays and Neighbor proteins around NC_013721_1
The CRISPR arrays of NC_013721_1 >merge|NC_013721|1|172791-172864|CRISPRCasFinder TTTTGCCAAAATGTCGGTATAGACGAATGTTTTTTGTGTGCTATGCCTTCATTTTGCCAAAATGTAGGTATAGA >NC_013721|1|1|172791-172864|CRISPRCasFinder TTTTGCCAAAATGTCGGTATAGA CGAATGTTTTTTGTGTGCTATGCCTTCA TTTTGCCAAAATGTAGGTATAGA
>NC_013721.1|WP_012913758.1|171993_172713_+|toxin-Fic METVFDPYLIPHTRVLKNKSNINNQIELDKYENDAVLTRCSILYENLPHAEGTVKQLQWIHHYLFQDVYDWAGQIRTIDMTKGGGEPFHPLEYMGVGIRYCEQTLKNDNLLQGLSIDEFISKLSVNYNNFNVLHPFREGNGRTQRVFWDIVARDAGYHFDWGLITQRVNDEASIQAKDANDTKLLEDMFHIITKPLNVELLAQQQFAHLVEEEYEYAPNVASILQDKDYAAYKTRYGID >NC_013721.1|WP_012913757.1|171789_171975_+|antitoxin-VbhA-family-protein MIDKETQRRRQENLGLAIASGKLEGNSPSKEFLYDANQYANGLISSNEFVARMRSRYGVMD >NC_013721.1|WP_033888553.1|171302_171539_-|hypothetical-protein MQSCILKSKDSQNLSKKKEPSHTNANRHSLSKVKKFYSFESESANLLANTRHDSYDIEGIRVIDIAQWLLETNTSFRL >NC_013721.1|WP_033888552.1|170735_171050_+|F0F1-ATP-synthase-subunit-epsilon MSEKQVKSLHVSVVAARHPVWEGDAKFVVIPSVNGAMGVLPGHEAVLALIDHGFVKVDDLKGVRHVFKVTDGFYSVDSDNITIAVEHSCNVDKDGNVLPNAKVI >NC_013721.1|WP_012913756.1|169245_170733_+|F0F1-ATP-synthase-subunit-beta MAENQTTAPPEAEQEVDPTAGRVTRVQGSVIDVEFPVGYLPDIYNALKVDINTVGNTEGDTVHEITLEVEQHLGDSTVRAVALKPTDGLVRGALVRDTGGPISVPVGDVTKGHVFDVTGNILNAKPGEHIEITERWPIHRNPPAFDQLESKTQMFETGIKVIDLLTPYVQGGKIGLFGGAGVGKTVLIQEMIQRVAQNHGGVSVFAGVGERTREGNDLIGEMADAGVLEKTALVFGQMDEPPGTRLRVPLTALTMAEYFRDVQNQDVLLFIDNIFRFTQAGSEVSTLLGRMPSAVGYQPNLADEMGALQERITSTRGHSITSLQAIYVPADDYTDPAPATTFTHLDATTELSRDIAAKGIYPAVDPLTSTSRILDPRYVGQAHYDCANRVKAILQRNKELQDIIALIGIDELGEEDKTTVSRARKIEQFLGQNFYVAEKFTGRPGSYVPADETIEAFTRICDGVYDEIPEQAFSGIGGIDDLERRWHDMQQEFGA >NC_013721.1|WP_004574170.1|168234_169239_+|F0F1-ATP-synthase-subunit-gamma MSSQLALKSRIISTQSLGKIFKAQEMIASSHIAKARDIALNAKPYSDAIFDAVQALVAHHEANIKHPIFGKEHAGNRVAVLALTSDRGMAGAFTSSIIRETEALLAKLDAEGKHAELYVYGRRGATYYKYRNRDVAGTWEGSTDHPDVTIAEKISDTLMDAYMTPEAEGGVSELYIVFTEFINMVRQKVRTLRMLPVELVPLKDGEEFSMHRPDYEAASGPSAALSSTSAVSAREEAVSRSNPEAVEYCFEPSPDEVLDAVLPKYIQSRIHECLLTSAASETASRQNAMHTATDNARNLVDDLTRKLNASRQASITQELTEIIGSADALNNEED >NC_013721.1|WP_012913755.1|166569_168231_+|F0F1-ATP-synthase-subunit-alpha MAELSIDPALIRKALDGFVDSYKPTDMPTQEVGYVVTAGDGIAHVAGLSGCMANELLTFENDTLGLAFNLDAHEIGVVILGDFTGIEEGQEVYRTGEVLSVPVGDAYLGRTVDPLGNPIDGLGAIECSERRILEAQAPDVIHRHPVDEPLSTGLKAIDAMTPIGRGQRQLIIGDRQTGKTAIAIDTIINQRANWETGDPKKQVRCIYVAIGQKGSTIASVRQSLEEAGAMKYTTIVASPASDSAGFKYIAPYTGSAIGQHWMYNGKHVLIVFDDLSKQAEAYRSISLLLRRPPGREAYPGDVFYLHSRLLERCAKLSDDLGGGSMTGLPIIETKANDVSAYIPTNVISITDGQIFLQSDLFNAGQRPAVDVGISVSRVGGAAQAKALKKVSGMLKISLARYKSLESFAMFASDLDAASKAQLTRGARLTELLKQKQFSPRAMEQEVVSVWAGTHGKLDDIPVKDVLRFESELLDYLDKGTDILQVIRDTEDFTKETEAKLDAAIDDFRRTFKTSAGKPLIVKDSLPPAENPAPVEKEQLVAKPKADSREPEGK >NC_013721.1|WP_004574172.1|165971_166553_+|ATP-synthase-F1-subunit-delta MPEKASRANDSLSRSSLAQAISDAHDEAQRISDELSSMMDMIDKYPELSDAITDPNRSSEDKSRLIDELIGGKAHPVVLRIMHYLVGTWHGGAESGKTGSFGGAWSGFEREAGETLVTVTTAQPLTDKQIKRLIEIYSNKLGHRAYINPIVDPNVLGGMRIQIGDQITDRTMLMQLKQLQRSARNGAWTQAVK >NC_013721.1|WP_012913754.1|165329_165872_+|F0F1-ATP-synthase-subunit-B MAYAAETNKLALFLPEPYDVIWSLVILVVLAAFFYKFVMPKFQAILDERAEKIEGGMAKAANVQREADELKSQIENELSQAQTDAAKTREEARAEASKIIGEARQRAEKDAAKIISEAQHSIEAQHKHAMSSLQGEVSVLAAALAGKILASKLDDDTVSSKIIDHVIDEVGDTKNSDQSK >NC_013721.1|WP_004105752.1|165035_165269_+|ATP-synthase-F0-subunit-C MNSIIALAGLTGNLSILGFAVATLSPAIGMAMVVSKAMESTARQPEVGNRIQLFMFIGLAFIEVLGLLGFVAYIMGS >NC_013721.1|WP_012913759.1|173007_174258_+|Fic-family-protein MDYKTIRKIVAMSKSNAKPCDIAEEEYCLRIDNPVTYRSGIVFDAHEIFAMCVTDLVTYMNYIANTEKAVEKAWNELPRFAQREYLMQLIAKEVQNTNIIEGVHSTRKELSKALDAANEQNNHTRFSEFTKLFLELADSNECDTSIPQTLQDIRKIYDSIMQGELEANNIPDGEIFRSGKVSIRDGANNVVHDGDATEAEIQDHLTQMLSLMNSSEVPTLIKACMCHYAFESVHPFYDGNGRTGRFLLALQLHEHLSMPTILSLSSIIYAEKSGYYAAFSKAQELFNCNDLTMFCCTMLEYIVKAQGEVFENISVQLKHIIYCMGKLRELFSEEEISKVQREVLSILLQNKLFSYNPTPMTRKQLSEYVGNAIAEQVGERKIVSALNGLIELGMVDSIGERPIRYELSKKANEMFA >NC_013721.1|WP_012913760.1|174288_175338_+|thiamine-biosynthesis-lipoprotein,-ApbE-family MFGNVMAIERALGTGIIISSSVPISQRVQNRIRDFIEEYESVLSRFRADSLVSRMACAEHGGEFEFPTWAQPLFAIYSEFYDATHGAFDACIGADLLALGYNNSVQFVPESAASAGKNDNSSSYSCSNYRRALPVKWADISRDDGGATLHTNKPVQLDFGAAGKGYFVDLVMQIIKEEFSDDSTANNYFPSDFDFLVNAGGDMRACFSKKNSQIKVALENPFDTTQAVGVASIASGALCASSAARRRWKVKDTNCLAADAFESNVVATHLINALDGVPSQKLSASWTYVPAKTCAFPTAYADALATALFISQESDLQKIAQTTGAEFAVMQPNHALRKTCAFPARFFAE >NC_013721.1|WP_041160458.1|175666_177076_-|HAMP-domain-containing-histidine-kinase MKSIPKLIQRFISIFLLSSVLIVLMNIIAFVVLIGNYAPDKEISPYSIAKETGEALQLSASGDYALSKNMSSKLTNSGAWAILIDNNTLKVVWKTENVPADIPNNYTLSDIANLSVGYIDGYPTYTGKNKDGVVVVGFPHNSFWKHTRPSWNYSLILNFPQIVLSVLFINILLILGIYLIANIKLLKSINPITKGIQRLSSGESVHIPETGALSEISSNINSASDILQIQKEQLRKRETARANWIAGVSHDIRTPLSMVMGYAGQLENSLHILEEDRKKATMIIKQSVRMKNLINDLNLASKLEYNMQPIMKKEENVVAVVRQVVVDFMNMGMQDEFSIKWETDAELTVCNINVDKDLLKRAISNLIQNSISHNENGCTIYVSVAANNNNCIICVEDNGIGVSDEKIEKLNHAPHYMICDTSTAEQRHGLGLLIVQQIIGAHNGTVDISRSKYGGFKVELTVPSKKFEL >NC_013721.1|WP_012913762.1|177082_177784_-|response-regulator-transcription-factor MNLNDYLLNKHLLLVDDEQGLLDMVVSILNEYGFYNITTAKSIKDAIEATQKLRPELAILDVMLPDGNGFELMKQIKQYSDCPILFLTACGEDEDKFKGFGLGADDYIVKPFLPKELTFRIMAILRRSYKSENPIVKLKNSQIDFSSAQVIKNNEHIQLTAKEYDLLSALYRNAGCIVTIDALCEATWGANPFGYENSLMAHIRRIREKIELNPSQPVSLITVKGLGYKLIVE >NC_013721.1|WP_012913763.1|177923_178688_+|ABC-transporter-ATP-binding-protein MNNIVTTEHLTKKYKSFIAVNDVSLHIRKGSIYGFLGPNGAGKSTTMKMLLGLTAPTKGVFTIDGKQFPADRIPILKEIGSFIESPSFYANLTGRENLDIIRRILELPKSAVDDALELVGLSEFGDRLAKKYSLGMKQRLGLAGALLGRPPILILDEPTNGLDPSGIHEIRNLIKSLPDLYDCTILISSHMLSEIELIADDIGILNHGHLLFEGSLDELRHHALQSGFASDNLEDMFLSMIDEDNKIRKQSARL >NC_013721.1|WP_012913764.1|178684_179410_+|ABC-transporter-permease-subunit MKTLVIELKKCKRTGFIALMIAIGVMGAAYAFVNFIVRKNTLLNLPLAPMDILLTQLYGMIMVLNMFGIIVAACIIYNMEFKGNAVKKMHMLPMSVPTMYLCKFMLLTVLLLIAICIQNLALAKIGVTDLPQGTFNVSTLISFAAYSFITSMPVLSFMIFVSSRFENMWVTLGIGVAGFLSGMALASSDIVLFMASPFIVMLKPAVAMSAQPNTTVVIVSIVETILFLCSGLWAAKKLHYE >NC_013721.1|WP_012913765.1|179423_180188_+|ABC-transporter-permease MRFFELLKIEFTKVKRSKIVPLIFIAPLIVVGSGVASLHRYFTPEYTHAWSAMFIQSALVYSYYLLPLSMIVICVMIAGRETSNNGILKMLALPVSRYAISAAKFCVLLFYLLMEMVVFFAVFVIAGLFATSSTGVTETLPIMYLLKWCLDLFLTMIPSVIVMWAITVLFEKSMLSVGLNILLVIPGVLIANTPLWIVYPYCYSGYMVSRSLHAVTSTGLGNGFNLIPFLPCAIAISVFVFLIAITQFGKKEMR >NC_013721.1|WP_080514457.1|180447_180897_+|hypothetical-protein MMVDASPQLRSKKKLIEGFIAQLNENELDKLADSGGNITVYDSNGNKVSIVDCWSEYVEQKYNHDLAQLVQSEGLNDALTRKFMEKSFSVGEVSELGTDINDLMPRMSRFGAAAKARAEKKSRIVESMRNMFNEFVGLIGFSQYDSQKD >NC_013721.1|WP_012913767.1|181138_182857_+|FTR1-family-iron-permease MRACVKVRRIIRDSAALILAVCMLVGCIVSSFVTTIPAFATDADRSYTSWTEVSKAIDNELLQGQSEYNSGNNSGAATRFEAAYNSVYVASNMITVVRDAIGQNKVQAQTDQFQQLQTLVYQQNQGSQISAISTALAADVAQTASSLDANLKVDKPNVYAQKLRAQIKAERKKLDAAKKKNLGRNGRTWGQVAREMNVILDKSVATYKFAKGNKTQVASAVDLINEAYYQYYEKLGFEKNVMNAISGSRVSTVEYQFKECRQTMNNGGTIEQAKKFVTDLKSMLIEDAAKLDKGASSSANPFMQFITSSFGQAFVILLREGLEALLVVAAIIAYLVKSGHKNMVKYIYLGIAAGIVASLAVAALFGLLFNGSGPQQEITEGVVALFAMLMLLYTSNWMISRSSVQAWNKYISEQTTAAVSKGSLVSLALLSFLAVFREGAETVIFYQAIFAVSNGADSMIWGGFISAAAVLVVIFLLIRFASVHIPIRPFFTGTSALMSVLVVIFAGGGVHALIEGDALEGMYIQGLPTNDWLGFYPYVETIVAQIVAAIVVISLLCVSIARSRVKRAVENK >NC_013721.1|WP_012913768.1|182952_183597_+|amino-acid-ABC-transporter-substrate-binding-protein MKNNKMMAIAALALAGLLALAGCGSNGNSAKTDAKSQATEQKADDKADDKGGDKGFEEVPVGPHQDQNIGPLTIGAVYFQPVDMVPAGMGLKASEASFHLEADIHANQKGTKLGYGKGEFIPDLTVNYEIVDKASGESVGKGTFMQMNASDGPHYGANVKLDKAGNYKLVLSIESPEKKGWMLHVDPATGVTGRFWTEPIKATFDDWKYTPRQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_013721_2 | 870159-870613 | TypeI-E |
I-C,I-E,II-B
Consensus repeat of NC_013721_2
|
7 spacers
spacers of NC_013721_2
>2.1|870187|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT TAAAAGTATGATGCATTGAAGTAACATCTAGTG >2.2|870248|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT CAAACGTGTCCATGCTTACGCCGCAAGTTTTAA >2.3|870309|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT GGTGACAGGATTGTTTTTACTGACGGTCAAGGC >2.4|870370|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT CGGCTCAGCCGTTAAAACTAGATCACACTTGTA >2.5|870431|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT CGACCTTCAGCAGTTGCCGCAGTTCCACGCAAA >2.6|870492|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT TACTTTTGGCTGGCTTGTCAACAGGAACATGGG >2.7|870553|33|NC_013721|CRISPRCasFinder,CRT CAAGCTCACTAATAATGGCGAAACGCAGGGTAT |
DEDDh,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around NC_013721_2
The CRISPR arrays of NC_013721_2 >merge|NC_013721|2|870159-870613|PILER-CR,CRISPRCasFinder,CRT GTCTTTCCCGCATACGCGGGGGTGATCCTAAAAGTATGATGCATTGAAGTAACATCTAGTGGTCTTTCCCGCATACGCGGGGGTGATCCCAAACGTGTCCATGCTTACGCCGCAAGTTTTAAGTCTTTCCCGCATACGCGGGGGTGATCCGGTGACAGGATTGTTTTTACTGACGGTCAAGGCGTCTTTCCCGCATACGCGGGGGTGATCCCGGCTCAGCCGTTAAAACTAGATCACACTTGTAGTCTTTCCCGCATACGCGGGGGTGATCCCGACCTTCAGCAGTTGCCGCAGTTCCACGCAAAGTCTTTCCCGCATACGCGGGGGTGATCCTACTTTTGGCTGGCTTGTCAACAGGAACATGGGGTCTTTCCCGCATACGCGGGGGTGATCCCAAGCTCACTAATAATGGCGAAACGCAGGGTATGTCTTTCCCGCATACGCGGGGCTCTGCG >NC_013721|2|1|870159-870552|PILER-CR GTCTTTCCCGCATACGCGGGGGTGATCC TAAAAGTATGATGCATTGAAGTAACATCTAGTG GTCTTTCCCGCATACGCGGGGGTGATCC CAAACGTGTCCATGCTTACGCCGCAAGTTTTAA GTCTTTCCCGCATACGCGGGGGTGATCC GGTGACAGGATTGTTTTTACTGACGGTCAAGGC GTCTTTCCCGCATACGCGGGGGTGATCC CGGCTCAGCCGTTAAAACTAGATCACACTTGTA GTCTTTCCCGCATACGCGGGGGTGATCC CGACCTTCAGCAGTTGCCGCAGTTCCACGCAAA GTCTTTCCCGCATACGCGGGGGTGATCC TACTTTTGGCTGGCTTGTCAACAGGAACATGGG GTCTTTCCCGCATACGCGGGGGTGATCC >NC_013721|2|2|870159-870613|CRISPRCasFinder GTCTTTCCCGCATACGCGGGGGTGATCC TAAAAGTATGATGCATTGAAGTAACATCTAGTG GTCTTTCCCGCATACGCGGGGGTGATCC CAAACGTGTCCATGCTTACGCCGCAAGTTTTAA GTCTTTCCCGCATACGCGGGGGTGATCC GGTGACAGGATTGTTTTTACTGACGGTCAAGGC GTCTTTCCCGCATACGCGGGGGTGATCC CGGCTCAGCCGTTAAAACTAGATCACACTTGTA GTCTTTCCCGCATACGCGGGGGTGATCC CGACCTTCAGCAGTTGCCGCAGTTCCACGCAAA GTCTTTCCCGCATACGCGGGGGTGATCC TACTTTTGGCTGGCTTGTCAACAGGAACATGGG GTCTTTCCCGCATACGCGGGGGTGATCC CAAGCTCACTAATAATGGCGAAACGCAGGGTAT GTCTTTCCCGCATACGCGGGGCTCTGCG >NC_013721|2|1|870159-870613|CRT GTCTTTCCCGCATACGCGGGGGTGATCC TAAAAGTATGATGCATTGAAGTAACATCTAGTG GTCTTTCCCGCATACGCGGGGGTGATCC CAAACGTGTCCATGCTTACGCCGCAAGTTTTAA GTCTTTCCCGCATACGCGGGGGTGATCC GGTGACAGGATTGTTTTTACTGACGGTCAAGGC GTCTTTCCCGCATACGCGGGGGTGATCC CGGCTCAGCCGTTAAAACTAGATCACACTTGTA GTCTTTCCCGCATACGCGGGGGTGATCC CGACCTTCAGCAGTTGCCGCAGTTCCACGCAAA GTCTTTCCCGCATACGCGGGGGTGATCC TACTTTTGGCTGGCTTGTCAACAGGAACATGGG GTCTTTCCCGCATACGCGGGGGTGATCC CAAGCTCACTAATAATGGCGAAACGCAGGGTAT GTCTTTCCCGCATACGCGGGGCTCTGCG
>NC_013721.1|WP_012914177.1|868976_870056_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MPLTVITMTNCPLSLRGDLTKWMQEIASGVYVGNFNSRVREELWKRIEDSVGNGAVTMSFSSRNEIGYDFKTIHSHREVVYSDGLPLVRIPTVDTLENDIKHGFSDAAHFHNAKKFSNIRRKNINTPSSTIENNLANENISYLSYDNASQTAKFATDNEEITCTDYWKDFIDKIDKNFVVIDCETTGLNSKVDKIIEIGAVKFIDGKIEEFQKLININCDSVSNFNKNNFIPNEIIELTGITSNMLESYGVRLDLALKEFINFVERLPVLGYNVQFDMSFLNSALSSLDERFDHSLSINKIFDISRFVKKEKKFLKNYQLKTVLHEYNIAESVPHRALLDARLTAHLVFNLTELLKFLA >NC_013721.1|WP_012914176.1|868033_868975_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MKKKFGAKKAEIPEFPRISDRVSFIYVEHAKINRLDSAVTVFDANGTIRVPAAMIGVLLLGPGTEITHRAMELLGDVGASIVWVGEHGVRNYAHGRALSRSSRLLEKQSKLVTNSRSRLNVARKMYQMRFPNENVSSYTLQQLRGREGARVRHLYREMSNKYNVQWNGRDYKVNDFESGTVVNKALSVGNVCLYGLVHSIISALGLAPGLGFVHTGHDLSLVYDIADLYKAELTIPASFEIAARCESDDDIEQLMRLKMRDCFANCNIMSRIVNDIQNLLEIPIDDQITVDVIHLWDDKELLVASGVNYSEVN >NC_013721.1|WP_012914175.1|867350_867998_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MSYLSRVEIDYKKPSSLRDLKSVGAFHNWVEQSFPDEWENHERSRKLWRVDVLHGKHYLLIVSDSKPDLQRLEMYGVSGTASSKTYDKFLGSLMNGMRMQFRVTLNPVVSISDNAETHTARGRVVPHVTYDQQMNFLLNRAQKLGFSLNENEFAIVERGYSLFTKSEKPIRLSKAVYQGILTISDADIMRKTLLEGIGKKKAYGFGMMTVIPLDN >NC_013721.1|WP_012914174.1|866474_867350_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MKSLLLKFSGPLQSWGTDSHFETRHTDYYPSKSAVIGMIAAAFGYRRSTDCDENIAKLNDLDFAVRIDQQGNLLRDYHIAAKYKANGDFEKNYVTNRYYLEDAIFLVAIGSNNEQLIYDISNALRSPYFQSSLGRRSLPPTADFILGVEDCGVIQALLTHEWLANKWSKKRFNTSKLSEASESSNESCNEVFISLYADSVLVDKAASELVKDDNTTYKIQESVRRLRKDYVNSFSNKERKFGFRYESKMEIPIEKISHNNLDSEDISSDSKHLKSVGEFIAEHDAFSGLES >NC_013721.1|WP_012914173.1|865327_866416_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MNSRLFLDIQAIQSVPPCNINRDDAGSPKTAQYGGVTRARVSSQCWKHSMREYFKEHSGDSNVGMRSKNIVKYVADKIITLKPELSEQEALDLANKTLNNAGFKTKTDKGKIIPVVNVLFFLGENQANSLAQAAINNVTDKKQLEEILKDNPPIDIALFGRMLADNPSLNEDASSQVAHAISTHAVRAEFDYYTAVDDLSVDDNAGAGMLGTIEYNSSTLYRYANVAIHEFSHQLSDNKESTINALKLFIEAFANAMPTGKVNTFANQTLPQMLVVTLREDRPVNLVSAFEDPVKAKDGYVSKSIEKLSQEYEKVQKFVHKPLASFYVTMDSSNKEIKLGVEEQSMQQLLDDFSSKVSELLQ >NC_013721.1|WP_012914172.1|864625_865240_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MAENQNKSVEVSNAVRKIINQFIEIADTSSYRAMLAQLRHTIGKPLSQSVEIWPWILNNVPESFMDKYGEVSYQLRAVINTLQIYALYEQGVAEKSSYKDLNKSEDSVNEKSYNNMGTALRTLRSDEGVRGAMDRRFSVMITASDYESFYYYLRQLVRLLKSRTKENQQTIDYSKLARDLYLLQFEDSEKVRLSWAQEYYRFTK >NC_013721.1|WP_012914171.1|862933_864604_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MSRFNLLDEPWISVIVDEKGHNKLVSITDVFKHASEYKALAGDMKTQDFALLRILLAVLHTVFSRYDIQGNSREFDSNEDNEYYFNKETMNIWREVWNSKEFPDAVFKYLEQWHDRFYLFDDKYPFLQVLKQDIDSKKLGGKSPSEISGKNINRLISESNNKVAVFSPKDNVDNNKSSLNEAQLARWIITLQSYAGLADKTFFGTGKYKASKGWLFDLGGIYVEGENLFETLMLNCVLVGEMQSPEKRQKPCWEYSGAENIENSFYETFIDNISQLYTRWSRAIYINPDISIENPISVSIVKLPDINHKDAFIEPMTVWQYNKERENKDKYTPRKHKVEESMWRSFGLLTLQESDDGILKNHKPGIMEWLNKISKDIEGSSISLQAVSMKDDGNATSWVPTDEICDTLHIDEVVVTDNSDNGWVGRINNEVEYTRSAIGFIYRQFLLDICEIRNRNKDDATKYADKCISHVYFLVDQPFRQWLANIKPKDSMNERCAQWRNTLHNILINEAKGMLENATLRDFTGRPVVQSEKESTKNIVTAYSIFTSRLKKLSKK >NC_013721.1|WP_012914170.1|860090_862937_+|CRISPR-associated-helicase/endonuclease-Cas3 MSCNHVVNSALWGKKREANGVMQWLPLAQHLEDTRNVIGQLWEHWLSGGQRRLIESSLSKRVDAKKLSQFLGCVHDIGKATPVFQFRKSPSNSKDLDIALKNKLATVGFTNIDYFIDTTEGSHNSHHTITGQFILSNAGVPEGICAIVGAHHGKPLNNDSVCRSNKSKYPDHYYQSETESENSKLWKKLQNDILDWALERNDFSNVNDLPEISEPAQVLLCGLVIMADWIASNEHYFPLISIEQDLIENQEERYRKGWENWLQHGSKDVWESLNCCSNVSQTYKYRFGFFPNNIQIALHDVISQSKEPGIFILESAMGSGKTEASLIAAEQLANLTGRSGVFFGLPTQATSNGMFRRVEDWLKNVNSDFQGEIGLRLVHGKAELNADYAHLQHGMQNMNDGCESTSNSNDVNNNGVILNDWFTGRKTAMLDDFVVGTVDQFLLASLKQKHLMLRHLGLSKKVVIIDEVHAYDAYMNKYLEESLIWMAAYGVPVVLLSATLPAKRRKELIKAYMCGLFGFNWRECDKSNVDFETNNYPLITYSDKNCVKQKFIENDASDNKSVSVRKITDDNLHESLVGELKSLLNNGGIAGIIVNTVKRAQEIYNACVDEFSDDEVIVIHSQFIATDRVRKEQQICNMIGKNAHRPARAIIIGTQVLEQSLDIDFDVLFTDLAPIDLLLQRAGRLHRHTIERSETFAEPILYVLGTSDRYEFDKGSESIYSKYLLMRTQYYLPNVINMSHDISRLVQIVYGDNPLELQEDLKDVYAVAKREHDSVRNSNESAAKTYRIENPESEIGEKSIVGLLTNSITNESDEFACAQVRNSGESIEVIAVKRVGSGYGTLHDCKDISQNIDDVEVAMKLAQETVGLPWMFTLNSDRVDETIAELERIRKQNQFKNWDNQPWLRGSLVLLFDENNICELSKYRVVYSEKSGIVCIKSSEEDRKELRR >NC_013721.1|WP_076611763.1|857159_858341_+|IS256-like-element-ISGva1-family-transposase MPRLSSKRCPICSHSMKRNGRTRKGAIRWRCTACGMSFTRKREDITHAAQFNKFIHWIMGNQTQREACPSGSDRTWRYRIEWCWNVKPELPETGEIYDYVQIDGTYLPYGWCLLTAASKGKVIALQWCNRENTEAYKSLLNNLLAPTVVVTDGNAGALSAIKQVWPETRVQRCLVHIKRNIRVLTTSKPRLQSHKALWGLAKKLVKIRTLKESDLWVNLLQKSYNQWKDWLNQKTFRKNVSEENIPSWVRSNQQWWYTHQKARKAYNLLAKQVKNGTLFTFLDPLLLESTSVPIPSTTNALEGGINSQIKKLVSIHKGLSEDHMRRAIEWWCYFNSENPSDPKQFITAECFKPKNKKIIIDNNPIGPAHLDTGFDLCKSDYHPDISIRKGTMR >NC_013721.1|WP_041160497.1|853930_855328_-|hypothetical-protein MEDKSLDLTTPVLGFLKTENNDSSETVVMLHDNGLKFELSVLFKELSGQIFRWFLGNNVIYVNSDAKYKTINRLPVPNILQVLSPSKTYTLVGCRYVKSSTNVMRQIGIGTITCDYIIVGSNEKQFEKITNLRTSCSNLYDWFTSGGINVNKLSDDGESIKIDIKKSSSKELCKINPMQNESNKSFISLSLAINQELRKLSKNGSIVSFEEEAFIETHSEDELNWEKHISIHNWILNLISISKLERCEFSKMEVGVENNNTCNGTTVRWFPVLHHLNAGTQNIYKKHNKQFLFDFDDINICGIEKWFELLQRYGRAMGIISHIAKEQNNLPVESVNISLGVALEDIGWEIIKSKGQKNRMNKNGGLACFMCALCAIKDEFGKYFPIDIKDFDWLKEMNDIYKKNKHDDANENMEKTPKDYEKMYRINVCFINIIRLWIGKQLGVELPTMLKRLNLNIFKYNETSI >NC_013721.1|WP_012914179.1|871920_873162_+|allantoate-deiminase MIDSSISAETIESAISWLAGFTEKPGEQGVTRPLYTKSWKQALFALRQRFEKLGMTVEFDQVGNLIATVEGCEKPESIIACGSHIDTVSRGGRFDGQLGIVAAYLAIKNLLTRYGKPKKSLRIICLAEEEGSRFPYVFWGSKNFFGLAQKEDVENICDSEGVPFETAMREAGFNYQTTQPKFANLEAWIELHIEQGPTLYSNHEDLGIITSIAGQHRWDIHLKGVQNHAGTTMMSYRHDAVDCMSHIIAKNLDKAKQAGDPLVLTFGRISVVPNQVNIVPGEVTFSMNCRHTDKAFLDKFLQELDEDIHETAKSYGITAQIDKWMSDDPTPLNDDIINLLKDQANKQGYAYRVMHSGAGQDTQIFAPFVKSGMIFVPSKDGISHAPEEYTDPQDAVHGVKLLRDALHSLAYED >NC_013721.1|WP_012914180.1|873471_874956_+|putative-allantoin-permease MQISKLLQDEPVASKNLECYRERGYTNEDLLPKPQEKRTMSALNFCTLWMQSVHNIPNYAAVGAFLLMGYSPIYVMIGIMLGAILTAIVMVANGVVGSKYGIPFSMHLRATYGKAGAQLPGFLRGVVAAICWYGIQTYIGAKALEILVGKICPEFLTFGNEIPFVGSHIPIIISFAIFWLLNLAIGLGGGKTLNKLTVVLSPLIYVVFGCMSVWAISCAGGLSAIMQYHATVQAPTNYGLGLFMMMVVNSVLGVWAAPAVSIADFTQNAKSTMAQLKGQIASFLLSYALFAFTSVVILVGGSLFYHETITDVLVIINKWDNLPAIAIATGLLLLTTVCTNATGNIVPAAYQLTALAPRFVNYKRGVVIAALVSTVLMPWKFMANITGFLNLIGMLLGPIAGVLLADYYVVKRTHIDLDQIYYDTNSSSKSAYSGTNWISYTSTILGLLFALSGQIIPLLSPLTQISWIVGCAVGFVCQILLSKVFQPTMAKERK >NC_013721.1|WP_012914181.1|874957_876334_+|allantoinase-AllB MNYDVVITNGLVVFDNEATKADIAIKDGKFAAFGSNFTADKTIDATDLIVVPGKVDSHVHINAPGGGVRDDWEGFVTATSAGAKGGITTMLQMPCNQIPATTDGKSFEAVLKEADGKLKVDVGQTGGLEPQNLNGGICEQDKQGVVNYKAFLGTTGDKDLQVDLYNCDDYSLYTGMQQIAKTGKILIMHCENAPITDQLGKKAHQMGARKLSEFVATRPVFTEVEAIERACLFAKETGCRIHIVHVSSPEGAQAVADARKNGVDVSCETCNHYLYFDTSELDEIGNIAKCTPPIRDKENQNGLWEKVFDGSINFIVSDHSPAPLSLKQTDDAFTAWGGIGSVQNDVDVFFDEAVQKRGMSLTQFVALNSSNSAKRFDLKGKGSIRLGYDADLAFIKPNAPYVLKAEDLEYRNKFSPYVGRTIGCQVVRTLLRGEEIYSQENGVCKDFKGKFILHPAKD >NC_013721.1|WP_004106666.1|876394_876592_+|sulfur-carrier-protein-ThiS MLKLNHDTVDYVENENLLELLKREGYDSTFVAIEVDGELVKRKDFETFIVKDNAKIEVFSIMGGG >NC_013721.1|WP_004108962.1|876592_877216_+|sulfur-carrier-protein-ThiS-adenylyltransferase-ThiF MESDDIRAKVLKRQLKEDNDLFAQQSVSILGCGGLGSNIALMLARAGVKKLYIYDFDSIEYSNLNRQNYTINEIGQHKVYATKARLNETLPYVEVEAFVQKVTPESLDEIAERSDLFIEAFDNRESKAMVLDYFMNHPNKYVITASGLSGLGDIRNVKIKHLSNVCLVGDFKSSPEEGLYLPYVSIIASLEALEALKWIKNGGNYGE >NC_013721.1|WP_004108959.1|877205_877973_+|thiazole-synthase MENEDYLELGGKKFTSRYILGSGKYSEELIDAAVNSAKAEMITVAIRYSEKAAGILKHIPEGVTILPNTSGCVTVEEAVNTAHIAREMGCGNFIKLEIIPDRKYLLPNNEATLQATKILADEGFTVLPYMYPDLYFAKAMRDAGAAAIMPLGSLIGSNKGLETKKFIQLIIDEIDLPVIVDAGIGVPSQAAEAMEMGAAAVMANTGVATANDIVLMAKAFKLGIEAGRAAYLAGPGRVLEKGATPSSPKRDDESK >NC_013721.1|WP_041160578.1|878393_881837_+|tetratricopeptide-repeat-protein MTNNMQTTENSESQEDSMLQLQESLNSLWSEGNPIDISSRAKKYVIKDFPQTNTNNQLGWVQLNKKKEYTSIGLFKEDGRKFLQLAVADSEGRIFLGSKLSYEDIQDMRPQLSFVEKNNSSLKDLHRFYGKQYKKTKITLKCAFLEAFDETYTKHTDYDMDNIVESYLNGDATVEDFLASLESSKSKSALKPEHRSKTFAGKINHFSKTFSKIRKENVFCVSNGDLVLQLSEDNGLIKIGSWISHGFGTKKTYTLEELRTKLPENLRCNLTYRNAITIAFFAALFIPDMHKSLTGFGYSNVLRRLEHEAPLGAIRRSVEDIKIAKKENLRVSGLEEYFEKLMRSTGALETERSLEAVHGAERLHMQKSQQTEANYLLWDDALEPNAALKVLKIEGAINRLYSIYDCLQINSIMGGKTCENTIKEEQASQIDMAMIKDVAFAASGSIGMQFFGNPLMDDKIRPILHMWRRAKQVKNNDKYFNKIDNDSEWSYRQRLSNLIRSVYLPFRFDAEFRSNLEDGNVAINLTTAGAALMPSVAYANEDKSWKNLNDDDKEKLATNYNLQVGMIMSTLAFGASGKVKNVSIRLDSIGLEEMVTAQNNAMNNLVNRTLNALNVMNSDAANNDSSKPKGDPKDGDIHGDPSKLQEMQQTNISNANKTNNNVISMHASASSQELNSETDTNADSNENQLSENSMDKSKEENSFAAFTTAPSIRALTTVTFSRDRFIKLIHQNGLNNPIKTYEKFNAKLNIDSHGKLNTIEPDFDVHSSRFSPHGSQEEPEFSDKIFTADQESILGTQYAQGLSIQREDLLQQAVADFHHIASELMPTTAQKAHEAMSIIESIDDPELNAQADSITRALIDETDLPNLSFTTAKRIRDIRTKAHEQFMNGDLSEALKEYENSVAQFDTMFTSSEAVPRYFNSYAERVIYNHLFATKEERTRLIPDGLFYAHMELADLLSQLNQHDEALRHLNIMVSYAPTYALSHLRLADILAQKEDWSSVIAACINALNVSLDRDDAAFAYYKLAYAEWMQNNFLIAASSYRMAQYLAPGKIEPLEMELDELLSRMRSQCKLIPSNIHEAQMALLSEDVPAWPDIEAEEIIDKAAKLTVNDGMFVIARTLCVANIRMTSNEDLSSTVQTQFLRSLNA >NC_013721.1|WP_012914184.1|881970_884859_+|NAD-dependent-DNA-ligase-LigA MSENSIREESTESEQLSLFYDFDEQTISHEAPKNETNSISTVKTETVTQADSKTDNKTDMHEANEWNSTIDTEKWIANLKPTDTDALALGNLQPATLTIEQAAKLWAKLAAWAQSDQIAYYVEDSPTSSDAAYDARMRALSALEAAFPTLDTPQSPTHRVGGTFSNDFTSVKHPSRMMSLDDVFSIEELHDWYNSVIRDLQWDESQPLPMTCEVKIDGLALNLIYRNGTLEQGLTRGDGVTGEDITLNVRTIEAIPTQLHSDNPDDIPEFVEIRGEVFMKWEDFRKLNDEQENAGRVAFANPRNAAAGSLRQKDPRITATRRLSFFAHGLGELRWKENTSHNNSETKFNQSDAYKLYQQWGIPVSPYTRKVTNFSEIEEMIDYYGKHRANILHALDGIVVKVDDRALQHQLGATSRAPRWAIAYKYPPEEVNTYLKDIIVQVGRTGRVTPVAVLEPVTVAGSTISRTTLHNAYEVEHKGVLIGDTVVVRKAGDVIPELVGPVLKAREGRENELRKFVMPEYCPSCGTKLAPAKEGDKDIRCPNVENCPAQLTERIIHLASRQAFDIENLGDNAALALTNPEDCRPTTAEVYCPDMDKIIIQRGATQQPYIPSPDLTLPEPQTPVLRNEAGLFSITADDLQNVMVWKEIPLVEEWKEVSKDGSFKKRTRKIGGSGLWHQVRAFWTRTIEAKLSTNYEAEGTSEKTTSLNEQWDPQYPKFQVPIDAKVVLWKNKRITRNAKTSDNAKETINVPWYTRPSETTRSMLEEIAQKGKNAALWRVLVALSIRRLGPPTARLIAANFGSLDNISKASIEELTQIDGVGPEIAQAVYNWFQQAKDPANWQFEVLKSWQEAGVVGKVEASSFAQTLVGKTIVVTGSLQGFTRDSAKEAIVSRGGKASGSVSKNTYCVILGENAGSKATKAQELGIPMLNEQQFNTLLKTGNLEEILQIANNTVPNLIAEES >NC_013721.1|WP_012914185.1|884862_885990_+|Mrp/NBP35-family-ATP-binding-protein MNTSELEKQVYDLLGSVIDPELGRSVTELNMVTGVHVIKKAETIDATKFAYDVEINLELTVPNCPLAEVITGRVQEAISKYPQAILIPHVNATAMSKTKLEKLVADLKAERKENPFNKAGTRTRIFAIASGKGGVGKSSITANLAATFAALGYDTAAIDADIYGFSLPRMFGVNSQPTNLNGMLMPVVAWGVKLISIGMFAGTDRAILWRGPRLQRSLEQFLSDVWWGNPDVLLLDLAPGTGDMALAVAQSLPNVELVVVTTPQPSASDVAVRSGLMALQIPVKVRGVVENMSWFENNGERLELFGSGGGKRVSEQLCNALGTNVPLLAQLPLDPALRETGEAGRPAVLTENGKLADSNLANTFIHLAKSLIKNS >NC_013721.1|WP_012914186.1|886083_887454_-|DUF349-domain-containing-protein MTEQSKPEETTKTTTAPKPVAPSPASFAHKTRVVAQAPTISYSEEDIKAAKTFGRVDENGTVYVTENGVEREVGQYSTGTPEEALTFYIHRYLDLKIKLDLFAKRLEASNVKAKEIDETLNTLKAELENPAAVGDIAALRARHAELAAKGNEKKEAISKARKEALEKALKERTNIVERAESLVAQMNESTNWRELNDKLRALFDEWQEHQRTTIHLNKSDADALWQRFSKARSAFSQARNSWVKEREHVRNTAKQLKEAIIAEAESLKDSKDWRETSMKFNALMDRWKKAGNAGRQHDDALWAKFREAADAFYHARQNDREKVNAGEHENLAKKEALLVKAEALLPVENEEAAKKARKALSQIQEEWDAIGFVPREDMRRIESRLNDVDKKIKAVEDAAWREADPEANARKSSFALQLEAQLEELNAAIAKETDNKRKAKLEAEKATKEQWLSAVK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_013721_3 | 1135304-1135391 | Orphan |
NA
Consensus repeat of NC_013721_3
|
1 spacers
spacers of NC_013721_3
>3.1|1135327|42|NC_013721|CRISPRCasFinder CGAAGTAGAAACCGCGGAAAACCAACGTTCAATAGAATTTTT |
CRISPR arrays and Neighbor proteins around NC_013721_3
The CRISPR arrays of NC_013721_3 >merge|NC_013721|3|1135304-1135391|CRISPRCasFinder CGCACGATTAACTCACGATTAATCGAAGTAGAAACCGCGGAAAACCAACGTTCAATAGAATTTTTCGCACGATTAACTCACGATTAGA >NC_013721|3|3|1135304-1135391|CRISPRCasFinder CGCACGATTAACTCACGATTAAT CGAAGTAGAAACCGCGGAAAACCAACGTTCAATAGAATTTTT CGCACGATTAACTCACGATTAGA
>NC_013721.1|WP_012914338.1|1134194_1134887_+|hypothetical-protein MTVFQMPFYSAYENKRITPDFEPIPGWKPFANYGSSPAQIIAHCVRNKDFSVLHAGDYFDETVNGTTYRWTIAEFNHYGRGEALLVPDKLMPDNMKFSDVDNLYIDSTLDWKFKDFYRVMPDSLKPYVLEMSLPCTNVNGKVDYFDEHVFPPSEIEAFGTAKYSNESSRTYTQWACFTDNSSRIRSNRWYWLRSTHTNSNIAIPAVNTDGSAAAARIRAAGGALPCFCIG >NC_013721.1|WP_012914337.1|1132214_1134185_+|hypothetical-protein MSDFQNATWSVIQLDYANAFIPDVRLNGGDENGRIIRVQLLDNGVPVDDSTVEVFLCWNRQPGVLIGDRVKMEAKDSDDGRIWQVAVPVAACRMPGTVTLGFEVKRDKTIVCSRSFTAIVEQPVFDAGSPEGKSYRQELEDTAQQAKDATGKANALTDKVSHLIEQNETVSRNAQNAADAANNATSIAQQAAQQAKDAASEASQAAQNTQNVISHATEVAQQCDASKQTADQAAKRADDAVSGLKQTVQNAAADAASKVQQAVERANSATQAVDAVREKTEAANKQTETDLAALREEVVKAQRAGFTASSSAQKCDEAAQAYRNVSGEVAQAKQTSEQAVEAANQALHTAQESAAAVAQAQSVLDQVKDASETAKRVVSAVEELKQTNNAALEATRTANAQASAAADAAGKANNATSTANSAAQAANDAAGKVTQTLQESETRFKAVEQAANDAKSVAGTANSTAEAARSTAEQAQSKANDAAGSAQRAQNTANSAIEATDNNKNRIDSMESDVSSLKNSCSAAQSKANDAAQTASKAQSVADSANSAAQAAASKADSAQQAVNNIRTPIVKPQSLTGYTTPSSWDWTLTDLKELPHGGHILIYPQADSIKEYMRQQPEFEIEEQNGVRTGKITVITHQPNTNGSALKLVFVWFAD >NC_013721.1|WP_012914336.1|1132075_1132222_+|hypothetical-protein MNPEVKQTITNLQLMIANLSLDNARLQARVSILETQVKEVKAEETDHE >NC_013721.1|WP_041160521.1|1131885_1132086_+|hypothetical-protein MVDIKIVPSEPAPVLATRTIKTETHYAIASTDESPWQKEMNLKLDVIMRALGVGLNDKEKGKSDES >NC_013721.1|WP_049762332.1|1129848_1131873_+|hypothetical-protein MSGFDNWNPNYVPYTPPAQGGGRKANGSGKNFDAAKRRSARKALIAQNKAERAARKAERQRRAAERKAAAIVRRQQAAIRRAERKRISEQKRLERLQNAQIRKQRVEEQREARQQKHQLVLEHKIEREKAHRLEQERRAACKALSELRKKTMGGGDTQSRVSIVDVNNDLPYVEDVAATRLWGVPDGLGGMLPLDGEVIFDDISDANLLFKKGKECLDKLSKPQLTYEADVVSLGKIGFSSDAVGVGDTVQIIDTTFTPPIRVEGRVLKIEEDLLDSVDTTRITLGNIHESYTQKRRAQEQKLDELIAQSGEWNAVADGNSLYVRDLIGRLNEVLNARGGYTYFTPDEGIFVYDKGEDAHPTNAIQIGGGFWRIADSLKPNGDWDWKNVCDGHGLFADRIYTGILSDAQGRNFWNLDTGEFSLQASVKIGGKTVQEIVDDSADGVISKMNESLTQEAIFNKLTNGGQTQGIYLSGGQVYLNASYLKTGSLNAGLIKAGRIQDYANSNWWDLTTSTIHISRGEIGGMTIANNKIHNRTLSLDEYGMHFMSGNNKYGRIGITDYTDFQHKVLFFGSRSDDCLIGFGIQSNGVGIYEAALTYSPTSFGTLEQGLNFSADCSFAHHKVSKAGLASDCVYEDGANMDSRTFYLPTSMDKDGTAREWVPVTFGFRNGFVI >NC_013721.1|WP_041160520.1|1129138_1129852_+|hypothetical-protein MEFFCSKCDGTLVGVLNGVTMAKRCRSVDGSDTLDITCSNCPVTKGDRIVFTDGQGIGAEYLVQSVQAIRGEEQPTITIQCANSIAELNSVYVEDLRGTAATAEGRFKDLLAGTRWDVKYVENGATETAQGNYAFYHTNVLKALQATCKTFNLELYTSVELDGNHVVSRSINAVEHRGGKTPVKRFEYSRDLKSIKRTINSANVITRLYVWGKSVSQVEQKTENSKTGFEDNEGVEL >NC_013721.1|WP_012914334.1|1128385_1129135_+|hypothetical-protein MSFEGAVRINGSPIEELGFSVCYPGMRVSAPEPIVKFQRVPGSSIAVDTTLRDEDGNAPLKERTVTLTMCTIGHVEDIARMQANLAALTGSLMTVQCGSSPTWRGYATFKNWQPVFAFGNTAKYKCDLVLTAEPFAYGVPTTVSVSGDVHIAVDGDRPCFPHFKLKAASDEVIINCSSSSKLLTFEGLSDGATLEIESTPQARVARMNGTIVVPTLQSDFFPLIPGIVDLSVTGASGVMSFTPLFVYGV >NC_013721.1|WP_012914333.1|1124860_1128382_+|phage-tail-tape-measure-protein MAQNENITIRMTADIADYSAKLQTAAHLTGKFDSFVKSAGTTGEKTGRIMRALAIGAGAVAVAIGVDATRRFAEFDQAMAAVRANVTENVGELKKLETAALDAGRSGMFSATKAANAINELGKAGVSVKDIIGGGLKGALDLAAAGEMNAADAAELTASALNQFGLAGSKASHVADLLAAGANMAQGGVQDMGEALKNVGVNAHILGMSVEETVGALTLFASKGLVGSDAGTKFNAMLQNLVAPSIRAQGTIKKLGLQIYDSQGRFVGLASVAQQLHDKLGHLTQAQRNAALGRIFSNAALTTANTLYEQGAKGVEKYTKMINQQGFASKVANTQMDNLKGQLTRLGNAWDTMLIKIGSGRKGIISGMIGAVTGLVNAFAALPPEVQQAVIAITALVGIGAGLYDMYRKSSRYGGVVAKSFDFIGSKAKRLFTLLKSTKLGAGLESAFRSLSSTATDTFSSIGYKIAGASSGLSVFKAGVLGLTAVTTGVLVAALAAAAIGFVMWQAKADAAKKQTDALKDTVRNSGDVYRKLADELKNGNDGVSWFTKGTQNFTDALKTCGVSMDTFVSAVKGSKTAVTSFNKALDKTWKNSTPDRFGGITKQSVNILKEAFDQAKKSVKDAQNAMKEEQAQQKANALASSQHADALMKGAEAAAKNSGEILKVAKVEDILIAKFGASKNAINAQAEAINNNVEAMQKYYGFAMDADQALTNLDKTIRESSKSVAEAGKHWMDNTDAADKNMSALTSLAKQTFETAEAMAKNGESVENVTKAFDKGSAAFVDLAQKTGMSKDRAIALAKAWGISHDALMKLIGAAKQSNVEAKVTAKDNFSEVFKKQNLSVKNLKGGKFEITGNNKKALDAIAKVSKAKLDPKKLTLTLDKKQLQTALDAVKKMKPVEVKAKVKADTKQAKKAIADVSKAKVPDKTVKLKGDKRNLDSTMSKAKAAKVPDKTVKLKGDKRNLDSTMSKAKAAKVPDKTVKLKGNKTDFQNKFNSAQSARLRDKTVYFRANANEVWSAINAINNASVHVNARVHRANGGVVYGAGTGTSDSIPAMLSNGEYVMTAAAVQRIGVNMLDRLNYGNSIAGSEKPASNTGGDMLVNAVNGLRNDMRALNDRLVASGCVANSNVAEAMRSVFDDGVKLKLDANGREVMAGLLATPMSRELSHMIDLGR >NC_013721.1|WP_049762331.1|1124623_1124860_+|hypothetical-protein MLALDEYENTVLCPLCGMPSKFCHDYLKVNDTFERAKIETCFVSAMREQAMEKYLKDDRPGTTRSQTTKLVPFGVEEE >NC_013721.1|WP_012914332.1|1123994_1124495_+|hypothetical-protein MPIKYIQPQLTVDIVTDLVSLQKTLMLTQELVLMQQDETNAIRDGRSADEVLEEIHKTKELADRSVITITLQGLNHSKWSEYVIKNTENKEDEDTPTINVKQAALEAFPAMIVKAQYKLSKRSVSNNDVKELLPKLADSQITDIVTTIQNLNEPTTAFPKELTQLI >NC_013721.1|WP_012914339.1|1135476_1136472_+|glycosyl-hydrolase-family-25 MALNGIDVSWYQRGINIAAVPADFVIVKATEGAWYTNPCFHAQADATLNSGKLLGIYHYISGGNAQAEMQYFVNAVKPYIGRAILALDFESGSNSAYGDTAYLQQCAQTVYNLTGVRPLLYGSQCDYGRLARVSKATNCGLWIAQYANSAHTGYQSEPWNEGSYSCAIRQYSSAGALPNYGGNLDLNKFYGDRTAWNKYAQSDHATPPPAPKPAPKPDVSPIEHDGDISSFTMHIPWGTNQDQRMGITRCGNVVTVNGCGCVVCGGGSWIHAREQVPEGFRPVSLATIRLSGNGTGSIMVKPDGSICWDGDGKNCFTHINATWITKDNQPK >NC_013721.1|WP_012914340.1|1136491_1136968_+|phage-holin-family-protein MNDVEIFFLELAGAVVLLDFISGFAKAVYTHSVASSKMRDGLFHKFAYVLVIAVCILFDYAQAKVDLGTHVPLVLIACAYIVITDTVSFFENVTAFNVQIANMRVVKVILSVLDCVKTYVDGQADNAINNGKHAAAIETDTELDGRGKEMNDTPTENK >NC_013721.1|WP_012914342.1|1137465_1138542_-|hypothetical-protein MKNFFKGISFTQVLAGSLAAVTSFLLASKIGIAGSVIGVAIGSIVSAVASQLYQNVIHASSRKLSEVNPNTSTEQSNFSNVSDSSVRSNSLENVDSKRIGDDIHMQYSSNSLQCNTDPSDDKNMAVQDARFGRTVSSQVRNPELENTRILELSKLREQHEEDMKLSEGTIPEPVPLSKVVSENYGYSSESYQNRNDHRTTIVVAVISALIAVIVTAGAVLLFTQGKGTDNIDKPKEISHEQQQNQPQPKQRLNTNTNQDHKKSHDDSNSKDANEDENDEHGNENAAGSTGGHSDNDLGSDTNNDSGTNSGDAASGNSGSGADSSDAGAGSAGSSGSTGSSNNVTSPGNGDQSGSANGN >NC_013721.1|WP_004108292.1|1138732_1139740_-|DMT-family-transporter MKVKYETAKSIAEPLSDKSGMPQGKSVHTAQIMLLICAAIWGGSYVSSKYALEVFPVQWLMGVRMVGACTIMAVIFFGTLRKTFTKKLIVPSLLTGITYYATLILQTEGLRSIDPGRSAFLTAAYCVITPFSAWLIVKKKPTLLGLVAAVMCVAGVGFVALKPGMLVLTLSYGDFLTLLCAAIFAFNLTFLAYYSRKYNPVTLTFGQFVVSGVLFVAGALICEPLPNFNAADHFSIFANMFYLTVVVTVVAQIMQNYSLVNLSTANASVIMCTESLFTLLFSTVLFHEHVSNMAFVGFALIFAAIITASLGEHYCDKKANCELSYKDRDFEVAKS >NC_013721.1|WP_012914343.1|1140085_1140490_+|flavin-nucleotide-binding-protein MATLSADMKEFIANNLAWIATVSKDGELDLGPKMSMFVLDDNHLAYHERTAGQHYKNLQDGSQLVVAVANLAKKKGYRFRGTVTLHTDDAIYEEQVRVAEEKGTKKPAAVPVMEITEIQDLTSGATAGKTIAKD >NC_013721.1|WP_012914344.1|1140804_1141254_-|DUF3021-domain-containing-protein MEKQVATLGKTMVKNIVKGIGIGCTIFTVMSFISSLLAHSEVGNRIASYAVATFVIGIGYGVFAIFWSNERMSNFAKFVFALVPPIAIQFIVSVIVGWISFKDEPAVICGWIAFTVILPIPIAAIIYYFEKKKAKEMNARLKALRKESK >NC_013721.1|WP_004106477.1|1141365_1141875_-|LytTR-family-transcriptional-regulator MKVTIAIVPPEEEQSVQLSIHNIDDELKHIIAQLQSIDSNDANVRSAENNTEINTSDKLSNNNPFIYGYINDSIIVMHEPDVIMICVENSRVIIHGDKGECVSYKRLMDFDNPQFASFVRISKSAIVNLNRIVRVDPGFGGSMSVKMDDGSVEWISRRCLGGFKKRLGM >NC_013721.1|WP_012914345.1|1142013_1142832_-|histidine-phosphatase-family-protein MVDFVENVASNVNNSINSAHEGIARSVMLIRHGQTPYNAQFRLQGMIDIALDESGMDQVTRSGKALRVLFGASDLEDPKDPNSTEHVRITPEEADASFVAIVSPLIRAQQTAHAFADKIGLHVHVENGVRERFFGEWEGLNRQQISEQWPEDFKAWVEGKSGELRHGAEAKQEVGERAAAAVEDWARKTGADRDLLVFSHGSCLSQTVHKLLGLDKVDPSYKTIGGMGNGRWARLIPSLNAEGSIHWRLDAYDQGPAADPTSQRLIRIWGAA >NC_013721.1|WP_012914346.1|1142856_1143279_-|ribosome-silencing-factor MTALESSISAIRIAAAAADRLKATNEQAFDVSDLLGITDAMLVVSASNERQVLGVAEEIEKDLYLKDNKRKALSREGLELAQWILLDFGDFVVHVMHEEARAFYRLERLWNDCPAIDLQLPEHASETEDSSKTEDSSETE >NC_013721.1|WP_012914347.1|1143315_1144725_-|bifunctional-UDP-N-acetylglucosamine-diphosphorylase/glucosamine-1-phosphate-N-acetyltransferase-GlmU MEKHDVTAAIILAAGDGTRMRSETPKVLHPFAGKTFLNRVMDAMNGVNPKKLAVVVHAQAERVAAAAISYNENVHIVNQDETPGTGRAVQCAIKDLNEIAKNETGEALKGAVLIAASDMPLLDTQTLQSLVDFHNNNNNVATVLTAVLDDATGYGRIVREANGDVLRIVEHKDANAGELAIHEVNTSVYVFDATVLCDAIADLNSQNAQGEFYLTDALEHARKFGHVGAMSAPDPLTVEGVNNRVQLAALAKAHNKRVCEKWMLEGVTILDPDTTWIEDSVTLAQDVTVLPGCFLQGCTTVASGAVVGPYTTLIDAQIDEDAVVERSRVQESHICRAANIGPWTYLRAGNVLGEESKAGAFVEMKKAHIGNGTKVPHLSYIGDADLGEHTNIGGGTITANYDGVHKNHTTIGSGAHVGAGNLFVAPVTVGDDVTTGAGSVVRHDVPADSMVYSENTQHVVENWKPAWER |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NC_013721_2 | 2.2|870248|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT | 870248-870280 | 33 | NC_013721.1 | 1126533-1126565 | 0 | 1.0 |
NC_013721_2 | 2.3|870309|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT | 870309-870341 | 33 | NC_013721.1 | 1129274-1129306 | 0 | 1.0 |
NC_013721_2 | 2.5|870431|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT | 870431-870463 | 33 | NC_013721.1 | 1129423-1129455 | 0 | 1.0 |
NC_013721_2 | 2.4|870370|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT | 870370-870402 | 33 | NC_013721.1 | 1128755-1128787 | 1 | 0.97 |
1. spacer 2.2|870248|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT matches to position: 1126533-1126565, mismatch: 0, identity: 1.0
caaacgtgtccatgcttacgccgcaagttttaa CRISPR spacer caaacgtgtccatgcttacgccgcaagttttaa Protospacer *********************************
2. spacer 2.3|870309|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT matches to position: 1129274-1129306, mismatch: 0, identity: 1.0
ggtgacaggattgtttttactgacggtcaaggc CRISPR spacer ggtgacaggattgtttttactgacggtcaaggc Protospacer *********************************
3. spacer 2.5|870431|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT matches to position: 1129423-1129455, mismatch: 0, identity: 1.0
cgaccttcagcagttgccgcagttccacgcaaa CRISPR spacer cgaccttcagcagttgccgcagttccacgcaaa Protospacer *********************************
4. spacer 2.4|870370|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT matches to position: 1128755-1128787, mismatch: 1, identity: 0.97
cggctcagccgttaaaactagatcacacttgta CRISPR spacer cggctcagccgttaaaactagatcacacttata Protospacer ******************************.**
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_013721_2 | 2.3|870309|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT | 870309-870341 | 33 | MN148435 | Shigella phage SGF2, complete genome | 34778-34810 | 7 | 0.788 |
NC_013721_2 | 2.3|870309|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT | 870309-870341 | 33 | NC_010696 | Erwinia tasmaniensis Et1/99 plasmid pET35, complete sequence | 18653-18685 | 10 | 0.697 |
1. spacer 2.3|870309|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT matches to MN148435 (Shigella phage SGF2, complete genome) position: , mismatch: 7, identity: 0.788
ggtgacaggattgtttttactgacggtcaaggc--- CRISPR spacer gatgacattattgtttttactgacgg---aggtttt Protospacer *.***** ***************** ***.
2. spacer 2.3|870309|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT matches to NC_010696 (Erwinia tasmaniensis Et1/99 plasmid pET35, complete sequence) position: , mismatch: 10, identity: 0.697
ggtgacaggattgtttttactgacggtcaaggc CRISPR spacer taacacaggattgtttttgctaacggtctgcgt Protospacer . **************.**.****** . *.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
193640 : 201222
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_013721|193640:201222|DBSCAN-SWA AATGGTAAATATAAGTAAATGGGATGCGAGTGAGTATCTTGAGGACGATGATGATGTTATTGCGTATCTCAATGCTGCAGCAGAACTTAACGATCCTCGCTTGTTGCAAGCTGCTATAGGCGATATTGCAAAAGCTCGAGGCATGAAAGAGATTGCTCAAAAAGCTGAGGTTGGAAGGGAAAGCCTTTACAAAAGCCTAAGACGAGATGGCAATCCTAGCTTTCAAACTATTGCGAAAGTTGTTCGAGCGTTAGGTGGACGCATTGCAATAGAACCAGCAAATGCGTGACATACCAAGTTATCGTGGCAGTTTTGCTAATTTTGCAATTTTTGCCACGATAACTCGTATAGCATAGCGACTGTGCAATTAAAGTTATGCGATACAGTCGCGTAATCAGTTAAAAAATATTGCCTGTTCATCAACGATATTGATGAACAGGCAATATTTTCAAGACAACAAAATTTGGAAACTAATCTAAGTTATATTGCTAAATCGTTTACTATTTTGCTACGTCAAGAACTACTAAACGTTGTCCGCCATAAGTTCTGGCATCGAGACAAGTTTGAATAAAGTAAGCAGATTTAGGGAATTGCTCAACGTACTCTTTGAATACTTTTAGCAACTTATCCGAGCTTCCGCCACCCTCAAAGTCTTTATAATCCGCCACTTTGTAATCTTTGCCGTTTAATTTAACTGGCATGCCTTTTTGGAGTTTCTTTAATAAGATGCCTGAACTGTTCAAACGGTCTACAACCATATAATGGCCAGTGTTAACCTGGAACCAGTCGTCTTGAAGAGTCCAGTGATAGAAAGCATCTTTAACGTAAGCCAACTGATCAGGATAAGCGTCCCAAATAACGCGTTTAGTTTCACCTAGGGCTTCCAAAGTCCCTTCTTGTCCCTTTACAAAAGCTTCGGCGATTTTCTTTTTTGAGGCTTTTTTGATTGGTTTTGAATGTTTTTTGGTAGAATGACGGCTTATTTTTGTTGGTACTGGTTTGTTTGCTTCGGAGTTACCGCAGGCTGCTAGCCCCACTAGCGAGCTGAAGAGCACTACAACACAAAGTAATTTTTTATAAGTCATTTTCCCCATAAATGGATTTCTTTCTTTCTTCTTGTAGATGCGTCTAACGGATGTTAGTCATCTTTATAAATTTGAACTCCAACTTTAGTGCCTGCTCAGTTCAGTTCAAAAATCTTGGTTTGAGCATGAGTAGGAAGTGATAATCGCTGCCGATTTTTGCATAAACGAATCAACAAGACCAGTTCCTTTTCACATAGTGTACCATAGAGGCTGAGTGACCTGTGTTGCTTTCTGACTTTTTAGCTTTTTATATAGACTCTATTTAGCTGCAATAATTAACCTAACTAAATTAAAATTGGTTAACTAATTAAATTAATTAGTTAACCAATCAGTTAGTTATTCAGTTATTCAGCAAATCAGTTAGTAAATCAACTATTTGATTAGAAATCCCAATCATCGTCGTCGGTTTCAACTGACTTGCCCATAATGTAGCTGGAACCAGAGCCGGAGAAGAAGTCGTGATTTTCGTCGGCTGCAGGAGAAAGAGCAGCGAGAATTTCTGGGCTTACGTGAGTTTCTTCTTCGCTGTACTTAGCAGGGTAGCCAAGGTTCATTAAAGCTTTGTTAGCGTTGTAACGCACGAAGTCAAAAACGTCATCGTGGAAGTCAAATCCTTCATAAATCTGACCAGAATACTCAACTTCCAGATCATACAAAGTGTCGAGCAAGTTCATAGTGAACTTTTCAAGATTCTTCTTATCGTTCTCGCTTCTAAGCTCCAAACCGCGCTGGAACTTGTAGCCCGAATAATAGCCGTGAATTGCCTTATCGCGCAAAATCAAGCGAATCATATCGGCAGTGTTCATCATCTTGCCGCGAGAAGCAAAGTAAAGCGGAAGATAGAATCCCGCGTAAAGAAGCAACGAAGAAAGCATAGTTGCAGCAATCTTGCGCTTAAGCGGATCTTCGCTCTCATACTCGCAAAGCACAGTGGTGACGCGCTGCTGCAAAACGTCGTTGCCAACAGCCCAACGATAAGCTTCGTCAATCTCTTCGCTAGAGCAAAGAGTAGAGAAAATAGAGGAGTAGGAGCGAGCGTGAATCGACTGCATAAAAGCAATATTCGTGTAAATAGCCTGCTCGTGCTCCGTGCGAGCGTGCTCAATCTGGCAAAGCTCGCCAATAGTTGCCTGGCTGGTATCAAGCAAAGTCAAGCCTGTAAAAACGCGAGTAGTGGTCTTGCGCTCAAGATCGGTGAGGGAACGCCAAGAAGGAATATCGTTGCTCAGCGGAACCTTTTCTGGAAGCCAGAAGTTAGCAATAAGGCGATTCCAAACGTCCAAATCCTTGTCGTCAACAATATTATTCCAATTAACTGGGCGTACGCGCGATCCGTGATGGTAGCGGTACTGAGATGGTGTGATAGAAGCTGCCATCATAGCGTCATCTGCGATTACGCGATTTGTGCCGAAAGGCTTATCGAGGCTACTTCTAAGCTCGGTTGGGCTGAGTTCTGTCATTATTACTCCTGATTCTGTGTTTACGTTTTATAAAATTTTTAAGTTTTTGATTTGGCGTGCGAAAAATTCACAGCGTTACGACTTACAGCGTGCAGCTTACGCAACCTTCGATTTCCGTACCTTCCAAAGCCTTCTGGCGAATACGAATGTAGTAAAGAGTCTTAATGCCCTTACGCCATGCGTAAATCTGAGCCTTGTTCAAATCGCGAGTAGTCGTAGTGTCTGGGAAGAACAAAGTAAGCGACAATCCTTGATCCACATGCTGAGTTGCTTCGGCATAAGTGTCAATAATTGCCTTCCAGCCAATCTCATAGGCGTCCTTAAAGTACTGCATATTGTCGTTCGTCATGTATGCAGCAGGGTAGTAGACGCGTCCAACCTTGCCTTCTTTGCGAATCTCAATGCGAGAAGCAATCGGGTGGATTGAACTTGTGGAATGGTTGATGTACGAGATTGAGCCAGTTGGAGGAACAGCTTGCAAATACTCGTTGTAAATGCCATCGCTCAAAATCTCGTCGCGCAACTGCTCCCAATCGCTGACGGTTGGAATCGCAACGTTAAAACGGGCAAACAAATCGCGAACTTTTTGAGTTTTAGGCTCAAGCGAGCGGCTGCCATCTGTGTACTTATCGAAGTAGTTGCCTTTTCCTGCAGGCTTAGCGTAAGCGCTCGTCTTAAAGTTTGCAAAAGCGTGGCCGCGCTCTACAGCCAAAGCGTGAGAAGCCTTATAAGCGTGGTAAGCAACCGTCATAAAGTACATGTCTGTAAAGTCAAGAGCTTCCTCGCTTCCGTACTGCATGTGTTCACGCGCAAGGAAACCGTGCAAATTCATTTGACCCAAGCCAATAGCGTGACCTTCCGCGTTACCGCGACGAATCGAAGGCACAGAATCAATAGAAGTATGCTCCGAAACAGAAGTGAGCGCACGCACAGCCAAATCAACGCTTGTGCCCAAGTCGCCATCCATTGCCTTAGCGATATTAAGAGAGCCAAGATTGCAAGAAATATCGCGACCAACGTGGCTATAAGACAAGTCGGCGTTATAAGTGCTTGCTTCCTGAGTTTGCAAAATCTCAGAGCACAAGTTGCTCATAGTAATGCGACCCTCAATAGGATTCGTGCGATTCACAGTATCCTCAAACAAAATGTAAGGATAGCCAGACTCGAACTGAATTTCTGCAATCGTCATAAAGAATTCGCGAGCGTCGATGTACTTCTTGTGAATACGAGGATTTGCAACCATCTCATCGTAATGCTCGGTAACAGAAATATCTGCAAAAGGCTTGCCATATTCGCGCTCAACATCGTAAGGACTAAACAAAGCCATCTTTTCCTTACGCTTAGCAAGCTCAAAAGTAATATCTGGAATCACAACACCCAAAGAAAGCGACTTAATACGAATCTTCTCGTCTGCGTTCTCGCGCTTAGCATCGAGGAAACGCATAATATCTGGGTGATGCGCGCTAATATACACTGCACCTGCACCCTGTCGAGCGCCAAGCTGATTAGCGTAAGAGAATGAATCTTCCAAAAGCTTCATAACAGGCACAACGCCACTCGACTGATTCTCGATTCGCTTAATAGGAGCGCCAAGCTCGCGCAAATTCGTAAGAAGCAGAGCCACGCCGCCGCCACGCTTAGAAAGCTGCAAAGCAGAATTAATTCCGCGCGCAATAGACTCCATGTTATCTTCCAAACGCACGAGGAAGCAGCTCACAGCTTCGCCGCGCTGAGCTTTACCAAGATTCAAGAACGTTGGGGTGGCTGGCTGGAAGCGGCCTGCAAGCATTTCGTCTAAGTAGTTGGTCGCTGCTTTTTCGTCACCATCTGCAAGTTCCAATGCAACAGCAGCTGCGCGCTGAGCGAAGTTTTCCAAATACTGCTTGCCGTCGAAAGTCTTCAAAGCGTAAGAAGTGTAGAACTTAAAAGCACCCAAGAATGTGCCAAAAGTGTGGTGTGCGGCTTCCGCATGCTCGTAGAATTTATCGAGAAATTCATCGCTGTATTTGTTGAATACGTTACCGTTGTAGTAATAGTTGTCAATCAAATAGTTCAAGCGCTCGCGAGTTGAAGCAAACTTCATGCTATGTTCTTGCACGTAACCCGAAACGTAAAGCTTTTCTGCTTCTTGATCTTTATCAAACTGAATGCCTTTTTCTGGGTCAAAAAGACCGAGCATAGCGTTAAGAACGTGGTAGTCGCGGCTGTTGCGGATGCGCTCTGCTGTTGTACTGGCAGTTGCGCTTGCAGAATCAGTGTGATCAAGATTAAGCGAAGTAGATTCGCTCATCGGATCCGTGCTATTCATGGGGTTTGTTCCTTTCTTCATGCGTTATTTTTGCTTTTTTAAACGCTTTTTGTTTTGTTTCTTTTACTTATTTCGTTTGTTTTGTTTATTTTGTTTCTTCTGTTTCTTTGAGTTGTTTTAAAAATCTCGGTATGCCCTGCGCTACTTTTTCTATATCTTCTGCAGTTCCCGTAAGTTCAAACGAGTACATATTTGGGATTCCACACTTTCCTGCAATTTGATCACCTGCAGCGCAATATGCTTCTCCAAAATTCGTATTTCCGGAAGAAATTACTCCGCGAATAAACGAACGATTTTCGCGATTATTTAAGAATTTGCGAACTTGCATCGGTATTGCTTTCGCAATATTTCCACCGCCGTAAGTTGGAACAATCAAAACATACGGTTCGCGCACGTTTAATTCCGGTTCTTTAGGACGAAGCGGTATGCGATATACGTTCACGTTATTGCTTGGAAAATCGCAATTTTGCACAAATCTTAACGTATTTTCGGAAACTGAGGAAAAGTAGACTACAGCTCCGATTGATTCACCTTGTGCGCTAGATTCTGGTTCGCTTTGTTGTGTTATATTTTCTTGCTCACACATTTCTTATGCTTCTTTTGCTGCCTCTGCGGAAGTTCCAAGTTTATCCGCAATTTGCGCAATTAAATCTGGACGGTACCCACTCCAGGATGCGTCGGGAGCAATGACTACTGGAGCTTGACGGTATCCTGCATTTTTTAACTGTTCGAGTGATTCGGGGCTTTTTGACAAGTCTACAGTTTCGAAGTTCACGCCAAGTTTTGTGAACTGTCGCTTAGTTGCATCACACTGTGGGCAATGAGGTTTAGTAAACACTGTGATAGTCATTTCAATCTCCTCTGGTGATGGGGAATTTTGCAATTCCGTGTTGTATCGACAAAAGCCAAGTAGTTAGCTTACTCTGAATAAGCACTATTTATAGTGGTGTGTCGACTGTGTAAACACTATATATAGTTATTATTAAAATAGCTAAAAATAGCGTGTTTGTAAGCGTGTCGTGTTTTATGAAGTAAATATGAGCTTAGTTTTAACGATTAAAAATGTAATTTATTTCACAAAATTTATATTTAAAACTACTTAACTTGCACATTGTTAAGCCTTGTGCATATCCGCCTATAAAATTATTAAGTGCGTTTCTTGAAGGGGCGTAACGCAAAATAAAAAATTGTGTATTTTGTAGATAATGCAGATTGCTTGGGTGCGCAAAGCATGCGTGCAAATAACAAAACAAGTACAAGATAAGAGGTGTCATTATGACTCAAGCTCAGGCAAATATTGGTGTTGTTGGTTTGGCTGCCATGGGTTCGAATTTAGCGCGTAATCTTGCGCATCATGGAAATACAGTAGCTGTTTACAATCGTCATTATTCGCGCACTGAAACTTTAATGAATGAGCATGGCAGTGAAGGTGCTTTTGTGCCTTCAAAAACTGTTGAAGAATTTGTGGCTTCTCTCAAGCGTCCGCGCACCGCAATCATTATGGTAAAAGCTGGTGCTCCTACGGATGCTGTTATTGAAGAACTTGCAAATGCTATGGAGCCTGGAGATATTATCGTCGACGGCGGAAACTCTTATTTTGAAGATACGATTCGCAGAGAGCGTGACATTCGCGCGCGTGGTCTTCATTACGTTGGTTGCGGTGTTTCTGGTGGCGAAGAAGGTGCTTTGCGTGGTCCTTCGATGATGCCTGGTGGCACTGAGGAATCTTGGAAGACTCTTGGTCCTATTTTGAAGTCTATTGCTGCTGTTGCTGAAGGTGAGCCTTGCGTAACGCATATTGGTACCGATGGTGCTGGTCATTTTGTGAAGATGGTTCACAATGGCATCGAGTACTCGGATATGCAGCTTATTGCCGAAAGCTACGATTTAATGCTTCGCGGTTTGGGCATGAAGTCCGACGAAATCGCGGATGTGTTTAGCGAGTGGAATAAGGGCGAGCTTGATTCTTATTTGATTGAGATTACTGCGGATGTGCTTCGCCAGAAGGATGCTAAGACTGGTAAGCCTTTGGTTGAGATGATGGTGGATCATGCTGGCATGAAGGGCACTGGTACTTGGACTGTACAATCTGCGCTTTCGCTTGGCATACCAGTTACGGGTATCGCTGAAGCCGTGTTTGCTCGCGGACTTTCCAGTCAGGTTGCTTTGCGAGAAGCTGCCGAAAGTCAAGGTTTGACTGGTCCGGATGTGCATTTTGATTTGAACGATGCTGAGCGTAAGGCGTTTATTGAGGATATTCGCCAAGCTTTGTACGCTTCTAAGATTGTTTCTTATGCTCAGGGCTTCGATGAGATTGCGGCTGCTGCAATTGAGCATGATTGGAAGATTGACCAGGCTGCAGTTGCTCGTATTTGGCGAGGCGGCTGCATTATTCGCGCTCAGTTCTTGAACCGCGTTTCCGAGGCTTTTGAGTCCGGTGAAGCAAGCGTTTCTTTGCTTTTTGCACCGTACTTTAAGGCTGCAATTGAAAAGTCTCAAGCTGCTTGGCGTAGGGTTATTGCGCGTGCGGTGGAGCATGGTGTGCCTGTGCCAGCATTCTCTAGCTTGCTTGCATACTACGATGGTTTGCGTTCTAAGCGTTTGCCGGCTTCGCTTATTCAGGCTCAGCGTGATTTCTTCGGTGCTCATACTTACGGTCGTATTGACGAGCCAGGCGTATTCCACACTTTGTGGGCAGATCCTAATCGCCCAGAAGTAAAGCAAGATTAG
Protein sequences of DBSCAN-SWA_1 >NC_013721|193640:201222|198577_199078_-|WP_004109846.1|DBSCAN-SWA MCEQENITQQSEPESSAQGESIGAVVYFSSVSENTLRFVQNCDFPSNNVNVYRIPLRPKEPELNVREPYVLIVPTYGGGNIAKAIPMQVRKFLNNRENRSFIRGVISSGNTNFGEAYCAAGDQIAGKCGIPNMYSFELTGTAEDIEKVAQGIPRFLKQLKETEETK >NC_013721|193640:201222|193640_193928_+|WP_012913778.1|DBSCAN-SWA MVNISKWDASEYLEDDDDVIAYLNAAAELNDPRLLQAAIGDIAKARGMKEIAQKAEVGRESLYKSLRRDGNPSFQTIAKVVRALGGRIAIEPANA >NC_013721|193640:201222|199081_199342_-|WP_012913782.1|DBSCAN-SWA MTITVFTKPHCPQCDATKRQFTKLGVNFETVDLSKSPESLEQLKNAGYRQAPVVIAPDASWSGYRPDLIAQIADKLGTSAEAAKEA >NC_013721|193640:201222|195104_196184_-|WP_012913780.1|DBSCAN-SWA MTELSPTELRSSLDKPFGTNRVIADDAMMAASITPSQYRYHHGSRVRPVNWNNIVDDKDLDVWNRLIANFWLPEKVPLSNDIPSWRSLTDLERKTTTRVFTGLTLLDTSQATIGELCQIEHARTEHEQAIYTNIAFMQSIHARSYSSIFSTLCSSEEIDEAYRWAVGNDVLQQRVTTVLCEYESEDPLKRKIAATMLSSLLLYAGFYLPLYFASRGKMMNTADMIRLILRDKAIHGYYSGYKFQRGLELRSENDKKNLEKFTMNLLDTLYDLEVEYSGQIYEGFDFHDDVFDFVRYNANKALMNLGYPAKYSEEETHVSPEILAALSPAADENHDFFSGSGSSYIMGKSVETDDDDWDF >NC_013721|193640:201222|199767_201222_+|WP_004109841.1|DBSCAN-SWA MTQAQANIGVVGLAAMGSNLARNLAHHGNTVAVYNRHYSRTETLMNEHGSEGAFVPSKTVEEFVASLKRPRTAIIMVKAGAPTDAVIEELANAMEPGDIIVDGGNSYFEDTIRRERDIRARGLHYVGCGVSGGEEGALRGPSMMPGGTEESWKTLGPILKSIAAVAEGEPCVTHIGTDGAGHFVKMVHNGIEYSDMQLIAESYDLMLRGLGMKSDEIADVFSEWNKGELDSYLIEITADVLRQKDAKTGKPLVEMMVDHAGMKGTGTWTVQSALSLGIPVTGIAEAVFARGLSSQVALREAAESQGLTGPDVHFDLNDAERKAFIEDIRQALYASKIVSYAQGFDEIAAAAIEHDWKIDQAAVARIWRGGCIIRAQFLNRVSEAFESGEASVSLLFAPYFKAAIEKSQAAWRRVIARAVEHGVPVPAFSSLLAYYDGLRSKRLPASLIQAQRDFFGAHTYGRIDEPGVFHTLWADPNRPEVKQD >NC_013721|193640:201222|194139_194733_-|WP_041160459.1|DBSCAN-SWA MGKMTYKKLLCVVVLFSSLVGLAACGNSEANKPVPTKISRHSTKKHSKPIKKASKKKIAEAFVKGQEGTLEALGETKRVIWDAYPDQLAYVKDAFYHWTLQDDWFQVNTGHYMVVDRLNSSGILLKKLQKGMPVKLNGKDYKVADYKDFEGGGSSDKLLKVFKEYVEQFPKSAYFIQTCLDARTYGGQRLVVLDVAK >NC_013721|193640:201222|196266_198474_-|WP_041160546.1|DBSCAN-SWA MSESTSLNLDHTDSASATASTTAERIRNSRDYHVLNAMLGLFDPEKGIQFDKDQEAEKLYVSGYVQEHSMKFASTRERLNYLIDNYYYNGNVFNKYSDEFLDKFYEHAEAAHHTFGTFLGAFKFYTSYALKTFDGKQYLENFAQRAAAVALELADGDEKAATNYLDEMLAGRFQPATPTFLNLGKAQRGEAVSCFLVRLEDNMESIARGINSALQLSKRGGGVALLLTNLRELGAPIKRIENQSSGVVPVMKLLEDSFSYANQLGARQGAGAVYISAHHPDIMRFLDAKRENADEKIRIKSLSLGVVIPDITFELAKRKEKMALFSPYDVEREYGKPFADISVTEHYDEMVANPRIHKKYIDAREFFMTIAEIQFESGYPYILFEDTVNRTNPIEGRITMSNLCSEILQTQEASTYNADLSYSHVGRDISCNLGSLNIAKAMDGDLGTSVDLAVRALTSVSEHTSIDSVPSIRRGNAEGHAIGLGQMNLHGFLAREHMQYGSEEALDFTDMYFMTVAYHAYKASHALAVERGHAFANFKTSAYAKPAGKGNYFDKYTDGSRSLEPKTQKVRDLFARFNVAIPTVSDWEQLRDEILSDGIYNEYLQAVPPTGSISYINHSTSSIHPIASRIEIRKEGKVGRVYYPAAYMTNDNMQYFKDAYEIGWKAIIDTYAEATQHVDQGLSLTLFFPDTTTTRDLNKAQIYAWRKGIKTLYYIRIRQKALEGTEIEGCVSCTL |
7 | Burkholderia_virus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
987303 : 996636
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_013721|987303:996636|DBSCAN-SWA AATGGAAAAATACAACAATTGGAAACGAAAATTTTATGCAATATGGGCAGGGCAAGCAGTATCATTAATCACTAGTGCCATCCTGCAAATGGCGATTATTTTTTACCTTACAGAAAAAACAGGATCTGCGATGGTCTTGTCTATGGCTTCATTAGTAGGTTTTTTACCCTATGCGATTTTGGGACCTGCCATTGGTGTGCTAGTGGATCGTCATGATAGGAAGAAGATAATGATTGGTGCCGATTTAATTATCGCAGCAGCTGGTGCAGTGCTTGCTATTGTTGCATTCTGTATGGAGCTACCTGTCTGGATGATTATGATAGTATTGTTTATCCGTAGCATTGGAACAGCTTTTCATACCCCAGCACTCAATGCGGTTACACCACTTTTAGTACCAGAAGAACAGCTAACGAAATGCGCAGGCTATAGTCAGTCTTTGCAGTCTATAAGCTATATTGTTAGTCCGGCAGTTGCAGCACTCTTATACTCCGTTTGGGATTTAAATGCTATTATTGCCATCGACGTATTGGGTGCTGTGATTGCATCTATTACGGTAGCAATTGTACGTATACCTAAGCTGGGTAATCAAGTGCAAAGTTTAGAACCAAATTTCATAAGGGAGATGAAAGAAGGAGTTGTGGTTCTGAGACAAAACAAAGGATTGTTTGCCTTATTACTCTTAGGAACACTATATACTTTTGTTTATATGCCAATCAATGCACTATTTCCTTTAATAAGCATGGAACACTTTAATGGAACGCCTGTGCATATTTCTATTACGGAAATTTCCTTTGCATTTGGGATGCTAGCAGGAGGCTTATTATTAGGAAGATTAGGGGGCTTCGAAAAGCATGTATTACTAATAACAAGTTCATTTTTTATAATGGGGACCAGTTTAGCCGTTTCGGGAATACTTCCTCCAAATGGATTTGTAATATTCGTAGTTTGCTGTGCAATAATGGGGCTTTCGGTGCCATTTTATAGCGGTGTGCAAACAGCTCTTTTTCAGGAGAAAATTAAGCCTGAATATTTAGGACGTGTATTTTCTTTGATCGGAAGTATCATGTCACTTGCTATGCCAATTGGGTTAATTCTTTCTGGATTCTTTGCTGATAAAATCGGTGTAAATCATTGGTTTTTACTATCAGGTATTTTAATTATTGGCATTGCTATAGTTTGCCAAATGATAACTGAGGTTAGAAAATTAGATTTAAAATAAACAATATTGGAGGAATATTTATGTATCTTATTTTCATGTAACTCTTCCTGCTAAAATCGCAGGGTTTTCCCTGCATACAAGCAAATGAAAGCATGCGATTATAGACAGGAGGAAATGTTATGGAATTAATATTAAAAGCAAAAGACATTCGTGTGGAATTCAAAGGACGCGATGTTTTAGATATAAATGAATTAGAAGTATATGATTATGACCGTATTGGTTTAGTAGGAGCAAATGGTGCTGGAAAAAGCACTTTACTCAGGGTACTTTTAGGAGAATTAACTCCCCCAGGATGTAAAATGAATCGTCTGGGTGAACTTGCCTATATTCCCCAGTTGGACGAAGTAACTCTGCAGGAGGAAAAAGATTTTGCACTTGTAGGCAAGCTAGGTGTTGAGCAATTAAATATACAGACTATGAGCGGTGGTGAAGAAACAAGGCTTAAAATAGCACAGGCCTTATCGGCACAGGTTCATGGTATTTTAGCGGATGAACCTACGAGCCATTTAGACCGTGAAGGAATTGATTTTCTAATAGGACAGCTAAAATATTTTACAGGTGCACTGTTAGTTATTAGCCATGACCGCTATTTTCTTGATGAAATAGTAGATAAAATATGGGAACTGAAAGATGGCAAAATCACTGAGTATTGGGGAAACTATTCTGATTATCTTCGTCAGAAAGAGGAAGAACGTAAGAGCCAAGCTGCAGAATACGAACAATTTATTGCGGAACGTGCCCGATTGGAAAGGGCTGCGGAGGAAAAGCGAAAACAGGCTCGTAAAATAGAACAGAAGGCAAAAGGTTCTTCAAAGAAAAAAAGTACTGAAGACGGAGGGCGTTTAGCTCATCAAAAATCAATAGGAAGTAAGGAAAAAAAGATGTATAATGCTGCTAAAACCCTAGAGCACAGGATTGCGGCCTTAGGAAAAGTAGAAGCTCCGGAAGGCATTCGCAGAATTCGTTTCAGGCAAAGTAAAGCATTGGAGCTCCATAATCCATACCCTATAGTCGGTGCAGAAATTAATAAAGTATTTGGGGATAAGGCTCTGTTTGAAAATGCATCTTTTCAAATTCCGTTAGGAGCAAAAGTGGCGTTAACTGGTGGTAATGGAATCGGAAAAACAACTTTAATCCAAATGATCTTAAACCATGAAGAAGGAATTTCTATTTCGCCTAAGGCAAAAATAGGTTACTTTGCACAGAATGGTTACAAGTACAACAGTAATCAGAATGTTATGGAGTTTATGCAGAAGGATTGTGACTACAATATATCAGAAATTCGTTCAGTGCTAGCATCTATGGGGTTCAAACAGAACGATATTGGAAAAAGTTTATCTGTTTTAAGCGGTGGAGAAATTATAAAATTGTTGCTTGCTAAAATGCTCATGGGTAGATATAACATCCTAATAATGGATGAACCCAGTAACTTCCTTGACATACCAAGTTTAGAGGCTTTGGAAATACTAATGAAGGAGTACACCGGAACTATCGTGTTTATCACCCACGATAAACGATTACTCGAAAATGTAGCAGATGTAGTTTATGAAATTAGAGATAAGAAAATAAATCTGAAACATTAAATTTAAGGTAGTCGCTGGTCAGTATAGTCTGTTCTGGTTGGCGACTCCATTGTTAAAGAGTATAAAGACTTTAGATTTTATGAATATTAAAAATAGGAACAGTCAATTGAACTGCTCCTATTTTTCTGCTAAATATATTGTAGTTTTCTTATATGTATAATGATAGATTAGCGGATTCTCATCTACGGTACTTACTTCAAATATGAAGAAGTGATCGCGGTTACCTGCATATTTGTTTTCTAACGCTTTTAGTGCGAGTAGGTTATCTCCATGTATGAGCATGTTTTCTGTATTAGGATCATTTTTGCAGTTTGATTTTTCTTTATCCTCAACTAGAATCCTAGGCTCTATTTTGATTTCTTTATCCTTACCAAGCCACGTAAGTTCCAGTTTTTGATTATTAGCCATTACTTTTCATCCTCCGCTATAAATTCCAAAATATCGTTGGGCGTGCAGTGTAAAGCTGTGCAGATACCTTCCACTTTTTCCAGTGATATATATTGCCCAGTGCGTAGTTTCGTAATAATGTTGGCGGAAATATTGCTCATTCTCATTAAGTCCTGATTGGAAATATCTTCTTCAATCATGCGATGTAATAGTTTTTTATAACTAACAGCCATAATTCCCTCCTTATTTTAATAGTTAAATATTATCACGATTCTGTGAGTATATCAACGATATAAAGAGCTAGCCACTCCGGAATATAGCAGATTAAAAAAGTCTCCTTTGCTATTCATTCAAGGCAAATGCCTAATGAATGCTTAGGAGATTTTTATTGTCAAAAATTTTTTCTAAAACCGTCAGATTACACCCTCTTTCATGGCTACCAAGTAGAAGACCAAAAAAACTTTTTAAGACAGGTTAAAAAGTCGGCTTTTTCTTTTGCCTGTGATGTAGAAGGGATAAGCCTTCTAGAAAAGGAGGATCAAATATGGATCTAGAGAAACTGCCAAAAGCACTAAAGAACAAGAATATCTTTTGTTCATGGAAGTACGAGCAGCAAGAAAACGGTAGGCTCACTAAGCCGCCTTATAACCCCATAACGGGAAAGAAAGCAGCTACAGACAACTTAAGCGATTTCACCTCGCTTGATGATGCCATGAAAGTAAAAGACCAATACGACGGAATTGGTATAAAGCTTGTTGATAACCTCTTAGGCATCGATTTAGACCACTGCATCGATAATGGCATCATTAAAGACTGGGCAAAAGAGATTATTGCCCATTTCAAGCACGCTTATATCGAATACAGCCCAAGTAAGCATGGCTTTCATATCTTGCTTCTGTTTAACGGCGCATACGATAAAAAGTCTTATTACATCAAGCATGGGGATATTGAGGTTTACAGCGCTCTAGCTACGAATCGCTATCTAACAGTTACGGGCGACGTGTATCAGGAAGGCGAGCTTTTAGAAGACGATGAAGCCATCATGTGGCTTTTAGACACGTACATGAAGCGAAACATAAACACTCATGAAAAAGAAAATAACGCGTGTACAAGGTCATATTTAAGCGATACATCTGTTATTGAAAAAGCTGGCAACGCCATAAATGGAGCTAAATTTACAGCACTTTGGCATGGTGACATTTCGGCTTACCCTTCTCATAGCGAAGCAGATGCCGCGCTTGTGTCTATTCTTGCCTTTTACTGTGGCGGTAACAAGGAGCAAATAGACCGACTTTTTCGTAAGTCTGCTTTATTCCGTCCTAAGTGGGATGAACACCGTGGCGCGGATACTTATGGCAACACAACCATTCAGAAAGCTGTTTCTAAATGTCATACCTTCTATGAGCCTTTCCATAATGCAAGTGCTGAGGAAGACTTTGATGATCTCTTATCTAAATTACAGGAACTAAAGCCTGATGAAAATATACGATATCGCAACGGTGATTTAGGTAATGGCAGATTGTTTGCTGATATTTTCCAAAACATTCTGCACTATGTACCGGAGCGCAAAATGTGGTTCATCTTCGATGGTGTACGTTGGACTTGCGATATTGGAACGCTTAAAACGATGGAACTATGCAAGGATTTAGCTCTTTCTCTTATACGTTATGCTGGTGTCATTAAAGACGAGAGAACCCGTTCAATGCTTGTTGAGTATTGGAACAAATGGAGCAGCCGTCGTAACCGTGAGATTTATATCAAGGAAGCTCAAAGCGTTTACCCCATTTCGATGGAAGCGTTCGATAAGAACATTTATCTATTTAACTGCCAAAACGGAACTTTAGACCTTCAGCACGGTGTATTTCGCAAGCACTTAGCAACAGATCTTATTACCAAAGTATCACCTGTGTTTTATGACCCGAAGGCAAGAAGTGAACGTTTTCGCCAGTTTGTCGATGAAATTATGAGCGGAGATTGTGAGAAAGCCTTGTATCTTCAAAAGTCATTAGGCTATGCGTTAAGCGGTGATACACGCTATGAGTGCATGTTCTTTCTTTTTGGAGAAAGCACAAGAAACGGTAAAGGCACACTCATGGAAAGCGTTCTGTCTGTGATGGGCGATTACGGAAAAGCGGTACGCGCGGAAACCATTGCTCTAAAGAAGAACCCTAATAGCTCACAGCCGACCGAGGATGTAGCACGCCTTGCCGGTGTTCGCTTTGCCAATATATCTGAGCCGAGCCGAGGACTATTCCTTAATTCAGCCCAAGTTAAATACATGACGGGAAGTGATACCTTAAACGCTCGATTCCTGCATGAGAACAGTTTTGACTTTAAACCACAGTTCAAACTGTACGTGAACACCAATTATCTGCCCGTCATCAGCGATATGACAGTGTTTAGTAGCGACCGCATGCAGATTATTCCCTTTAACAGGCATTTTGAAGCGTGGGAACAGGATAAAACCTTAAAGGCGGAGTTTTCTAAAAAAGAGGTGCAAAGTGCCATTTTAAACTGGCTTCTTGAAGGCTTTACCCACCTCCGAGATGAAGGCTTTAAGCCGCCTAATAGCGTTCTTGATGCCATTTTTAGTTATGCCCACGACAGTGACAAGATGGCACAGTTTGCTGAAGATGTACTCGTTAAAGACCAGAGCTCTGAGATTCGAACGGCAGTTGTATACGACCACTACAAGAAGTGGTGCATTGATAACGGCTGTTTTTCTGAGAACAGCAGAAACTTCAATCAGGAATTAAGAAAATTTGCCGAGGTTGTAAGAAAGCGTCCCAAGACAGGCGGAGAAAAAACCACGCTTCTTAAAGGCTACAGGCTTAAAAGTGAGTTTAGCGATTTTTTAAGTGGCAACTAGTGGCAGTTAATAACCTATATCCCCATATAACCCTATAAGGCAATAACGAAATTTAACTGCCACATCATGCCACGTTTAAGGAGGTGGTGCCTATGATTGCGTAATTCGTATCGCAAACCTGTGTTGAACATGAAGAAGAAATACTATCATGCACAAATCAAAATACTGAAATGATTTCAGCTTTTTGACAACATTGAAGAAAGAAAACCTGCCTAGGTACTGTGCTGTTAGAAATTTATTCGCATACTACTTTCTGAACCTAACCCCCTAGGGGGGGTCAAATCTCTAAAGAGTACCCCTTCGGGAACGGGCGCAGGGTCTTGCGTGCAAAAACAGCGAATTCAAAAGGGTAAATGGGCAAAATCAAGCAAATTACTGCAGTAATTGCAGAAAACAGTAATGAAAGGAACTAACAATTATGATTAACAGAGATATTGAAAAAGAAATGAAACTAGCAAGTCAGGACTACCACAACACGTTTTGGAACGTACTGCGCGACGCAAATGGTAATGCGAACAATATTGCAGGAAAACTGCGCCACATTCATACAGGTGCCTATGATTTGCCGACTTCTTCTCTAGCAAAACTCGAAGCAAAACTTGCAAACAAAAGTAATCTTAGAAAACTGTGTACAGTGGTAAAAGCACACTCGCAATCAAGCAAATTGTTCATCTATGAAGGTACACCGTTCACTAAATGGATGAAAGATTATGATGCGAATATCATTACGCATATGCCTATTAAAGATGGCTTTAAGCGTATTGATGTAACGGATTATACCCTAGTTAACCTTATTCGTTTGGGTACGGATTTTACAGCTGATCTTGACTTTGATATCGAAGAGTTCATCACAAGCGAACTGACAAAGCAATTTACATTGAGTGAGTCGGAAGCTTTTATAAATGGCGATGGGATAACTAAGCCGACAGGAATTTTATCAGATACATCAGGTGCTGAGGTTGCTGTGCAAGCAAAAGAGATTTCCGCAGATACCATCAAACATTTATTCTTCTCGCTAGATAGTAAGTATCGAGAAAATGCCACGTGGCTTATGAATGATGAAACTGCTCTTTATCTTCAAACCTTAAAGGACACTACCGGAGCATACATCCTACCTGATTTTGATGGCAACTTAATGGGCAGACCTGTGGTCATTGACAATGCCATGCCTTCTTCGCAAAAGGGAGCAAAAGTTATCGCCTTTGGTGATTTTTCCAACTATTGGATTATCGAACGCCATAACCCAACCATCAAGGTTCTAAACGAAATATTTTCTCTTATAGACCAGATAGGTTACCTCTCTTTTGAGTATTTGGATGGGAAACTGGTACATCCTGAAGCAATTAAAGTATTGCAACTTTCTTAAACAAGGAGGTCAAAATGCATGGATGAACAAATTAAGGATACACCGGTAAAAACCGAAGTGTTAAGAATCGGAAATACTGAATTTACGGTTAACTCTCTTGAAAGCAAGAGTGCTCAGAAAGGTGTCAAAGACCTAATCAAACACCTTATTTTAAGCAATTTAAACCATGTAATAGAAGCTAAATAACTTGATAAATTCTGAGTTATACGGGAATATATACTGACCGCTTGAAGGCTGTCGGAAAGGAGGAAAAACAGATGAATAAACTAAATAGACAGCCTTCTTATACTCTAGATAATGTAGGCAAGATAACGGCACTTTACTGCCGTTTATCCAAAGACGATGAATTACAAGGTGACAGCAATTCCATTATCAATCAAAAGAGCATGCTGAAAAAGTATGCCGAGGATAATCATTTCAACAATCTTATGTTCTTTGTAGACGATGGCTACTCGGGTACTAATTTCCAGCGACCTGATTGGCTTAAATTAACTGCTCTCATTGACGAAGGAAAGATTGGAACCATTATCGTCAAGGACATGTCACGCTTGGGTAGAGATTATCTGCAAGTTGGTATGTATACAGAGATGGTATTCCCTAACGCTGATATTCGCTTTATTGCAATAAATAACGGCGTAGATAGCAATAATCAGTCAGACAACGACATGACACCTTTCATCAACATCTTCAATGAGTTTTATGCTAAAGACACAAGCAGAAAAATAAAAGCTGTGTTTAAAGCAAAAGGACAAAGTGGGAAGCCGCTCTGCATCAATCCTCCTTACGGATATCTTAAGGATCCTGAAGATAAAAATCATTGGATTGTAGATGAAGAAGCTGCCAGTGTAGTGCGTGAAATCTTTAGACTCTGCGTTAATGGGTATGGGCCTAGCCAGATTGCCAATGAGCTAATCAAAAGGAATATTCCAACACCTTCTGAGCATTTTTTCTCTCTTGGTATTAAAATTCCTTCCGCGAAATCGGAGATTAAGGGAGTCTGGAATCAAAAAACTATCTCTAATATGCTTGAAAAGCAAGAATATCTTGGGCATACGGTCAATTTCAAAACAAGAAAGAAATCGTACAAGTGCAAGAAAACATTATTGAATCCCAAAGAAGACTGGCTAATTTTCAAAAACACACACGAAGCAATTATTGACCAGGAAACTTTTGATATTGTTCAACGTATAAGAAATGGTAGACGAGTGCGTACCAATCTTGGAGAGATGCCGGTGTTATCGGGAATGCTATTTTGTGCGGATTGTGGTAACAAGCTTTATCAGGTTCGAGGAAAAGGTTGGAGCCATGATAAAGAATACTTTGTCTGTGCTACATATCGTAAGCAAAAAGGCAAATGCTCATCACATCAAATTAGAAATATTCAGATCGAAGCAATACTGCTTCACGAGCTACGAATGATTACTTCATTTGCCAAACAGCACGAAGAAGAGTTCGTGGGACTTGTGATGAAGAAAAGCGAGAAAGAACTAACCCAAAAGTTAAAGTCCTCTAATAGGGAACTCGAGCAAGCAAAAGCAAGAATTAGCAAACTTGACACAATTGTTCAGCATCTCTATGAAGACAACTTAGACGGTAAAATTTCTGATGAACGATTTAAGAGCATGTCTGAGTCCTACGACAAGGAACAAGCTGAACTGAAAAGCAAGATTGAATCCCTTGAAGCCTTTATTTCTAAAGCGCAAGAGGAATGTCTCAACGTGGATTCTTTCTTGAAATTAGTACGACAATACACAGATATACAAGAACTTAATGCAGAGATCATTCGAACCTTTGTAGATAAGATTTATGTTGAAAAATCTGAAAAGGTAGCAGGTACAAGAACCAAGAAACAAACCATATGGATACAGTGGAATTACATAGGTGCAGTTGATATTCCACTCCATAAGTAA
Protein sequences of DBSCAN-SWA_2 >NC_013721|987303:996636|994983_996636_+|WP_012914238.1|DBSCAN-SWA MNKLNRQPSYTLDNVGKITALYCRLSKDDELQGDSNSIINQKSMLKKYAEDNHFNNLMFFVDDGYSGTNFQRPDWLKLTALIDEGKIGTIIVKDMSRLGRDYLQVGMYTEMVFPNADIRFIAINNGVDSNNQSDNDMTPFINIFNEFYAKDTSRKIKAVFKAKGQSGKPLCINPPYGYLKDPEDKNHWIVDEEAASVVREIFRLCVNGYGPSQIANELIKRNIPTPSEHFFSLGIKIPSAKSEIKGVWNQKTISNMLEKQEYLGHTVNFKTRKKSYKCKKTLLNPKEDWLIFKNTHEAIIDQETFDIVQRIRNGRRVRTNLGEMPVLSGMLFCADCGNKLYQVRGKGWSHDKEYFVCATYRKQKGKCSSHQIRNIQIEAILLHELRMITSFAKQHEEEFVGLVMKKSEKELTQKLKSSNRELEQAKARISKLDTIVQHLYEDNLDGKISDERFKSMSESYDKEQAELKSKIESLEAFISKAQEECLNVDSFLKLVRQYTDIQELNAEIIRTFVDKIYVEKSEKVAGTRTKKQTIWIQWNYIGAVDIPLHK >NC_013721|987303:996636|987303_988521_+|WP_000417519.1|DBSCAN-SWA MEKYNNWKRKFYAIWAGQAVSLITSAILQMAIIFYLTEKTGSAMVLSMASLVGFLPYAILGPAIGVLVDRHDRKKIMIGADLIIAAAGAVLAIVAFCMELPVWMIMIVLFIRSIGTAFHTPALNAVTPLLVPEEQLTKCAGYSQSLQSISYIVSPAVAALLYSVWDLNAIIAIDVLGAVIASITVAIVRIPKLGNQVQSLEPNFIREMKEGVVVLRQNKGLFALLLLGTLYTFVYMPINALFPLISMEHFNGTPVHISITEISFAFGMLAGGLLLGRLGGFEKHVLLITSSFFIMGTSLAVSGILPPNGFVIFVVCCAIMGLSVPFYSGVQTALFQEKIKPEYLGRVFSLIGSIMSLAMPIGLILSGFFADKIGVNHWFLLSGILIIGIAIVCQMITEVRKLDLK >NC_013721|987303:996636|988640_990104_+|WP_000420313.1|DBSCAN-SWA MELILKAKDIRVEFKGRDVLDINELEVYDYDRIGLVGANGAGKSTLLRVLLGELTPPGCKMNRLGELAYIPQLDEVTLQEEKDFALVGKLGVEQLNIQTMSGGEETRLKIAQALSAQVHGILADEPTSHLDREGIDFLIGQLKYFTGALLVISHDRYFLDEIVDKIWELKDGKITEYWGNYSDYLRQKEEERKSQAAEYEQFIAERARLERAAEEKRKQARKIEQKAKGSSKKKSTEDGGRLAHQKSIGSKEKKMYNAAKTLEHRIAALGKVEAPEGIRRIRFRQSKALELHNPYPIVGAEINKVFGDKALFENASFQIPLGAKVALTGGNGIGKTTLIQMILNHEEGISISPKAKIGYFAQNGYKYNSNQNVMEFMQKDCDYNISEIRSVLASMGFKQNDIGKSLSVLSGGEIIKLLLAKMLMGRYNILIMDEPSNFLDIPSLEALEILMKEYTGTIVFITHDKRLLENVADVVYEIRDKKINLKH >NC_013721|987303:996636|994744_994912_+|WP_004121940.1|DBSCAN-SWA MDEQIKDTPVKTEVLRIGNTEFTVNSLESKSAQKGVKDLIKHLILSNLNHVIEAK >NC_013721|987303:996636|990221_990512_-|WP_004121959.1|DBSCAN-SWA MANNQKLELTWLGKDKEIKIEPRILVEDKEKSNCKNDPNTENMLIHGDNLLALKALENKYAGNRDHFFIFEVSTVDENPLIYHYTYKKTTIYLAEK >NC_013721|987303:996636|991035_993360_+|WP_004121944.1|DBSCAN-SWA MDLEKLPKALKNKNIFCSWKYEQQENGRLTKPPYNPITGKKAATDNLSDFTSLDDAMKVKDQYDGIGIKLVDNLLGIDLDHCIDNGIIKDWAKEIIAHFKHAYIEYSPSKHGFHILLLFNGAYDKKSYYIKHGDIEVYSALATNRYLTVTGDVYQEGELLEDDEAIMWLLDTYMKRNINTHEKENNACTRSYLSDTSVIEKAGNAINGAKFTALWHGDISAYPSHSEADAALVSILAFYCGGNKEQIDRLFRKSALFRPKWDEHRGADTYGNTTIQKAVSKCHTFYEPFHNASAEEDFDDLLSKLQELKPDENIRYRNGDLGNGRLFADIFQNILHYVPERKMWFIFDGVRWTCDIGTLKTMELCKDLALSLIRYAGVIKDERTRSMLVEYWNKWSSRRNREIYIKEAQSVYPISMEAFDKNIYLFNCQNGTLDLQHGVFRKHLATDLITKVSPVFYDPKARSERFRQFVDEIMSGDCEKALYLQKSLGYALSGDTRYECMFFLFGESTRNGKGTLMESVLSVMGDYGKAVRAETIALKKNPNSSQPTEDVARLAGVRFANISEPSRGLFLNSAQVKYMTGSDTLNARFLHENSFDFKPQFKLYVNTNYLPVISDMTVFSSDRMQIIPFNRHFEAWEQDKTLKAEFSKKEVQSAILNWLLEGFTHLRDEGFKPPNSVLDAIFSYAHDSDKMAQFAEDVLVKDQSSEIRTAVVYDHYKKWCIDNGCFSENSRNFNQELRKFAEVVRKRPKTGGEKTTLLKGYRLKSEFSDFLSGN >NC_013721|987303:996636|993778_994726_+|WP_004121942.1|capsid|DBSCAN-SWA MINRDIEKEMKLASQDYHNTFWNVLRDANGNANNIAGKLRHIHTGAYDLPTSSLAKLEAKLANKSNLRKLCTVVKAHSQSSKLFIYEGTPFTKWMKDYDANIITHMPIKDGFKRIDVTDYTLVNLIRLGTDFTADLDFDIEEFITSELTKQFTLSESEAFINGDGITKPTGILSDTSGAEVAVQAKEISADTIKHLFFSLDSKYRENATWLMNDETALYLQTLKDTTGAYILPDFDGNLMGRPVVIDNAMPSSQKGAKVIAFGDFSNYWIIERHNPTIKVLNEIFSLIDQIGYLSFEYLDGKLVHPEAIKVLQLS >NC_013721|987303:996636|990511_990724_-|WP_004121956.1|DBSCAN-SWA MAVSYKKLLHRMIEEDISNQDLMRMSNISANIITKLRTGQYISLEKVEGICTALHCTPNDILEFIAEDEK |
8 | Streptococcus_phage(42.86%) | capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1115132 : 1128382
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_013721|1115132:1128382|DBSCAN-SWA TATGAGTTCTGGACGTGGTGGAGCTAGAGTTCGCAGTGGTCCCGCGCCTGACCCACATTCGCGCAATAGTTTGCGAAAAGGAGTTAAAGCTCTAGTACTTTCCGCTGATGGTTTTGCTGGCGAAATCCCTAAATTCCCCTTGCCGCAGTTTCGTGTTGTTGATTCTGAAGGAGTTAGGGATACTAATGGAGAAAAGCGTTTTGCTAGTCGCGAGAAGACTGTTTGGAGACATTTGTGGAGTTTGCCACAAGCTGAAGCTTGGAGTATGCCTGAGTATTCTTACATGTTTTTTGAGATTGCTTTATATGCGCGTCAGCTTGTGATTTGTGAGCAGGCTGGCGCGAGTGCTGCTGACCGTGGCTTGTTGCCGCGTTTTGCTGATCGCATTGGTCTTAGTGAAGCTGCTATGGCTGGTTTTGGTTGGAGTATTCGCCGTGATTCCGTTGTTGATTCTGCTCGCGATTTGAGTGTGATTGATGGTGTTCGCGTGTTACCTCGTCGTATGCGTGGTGGTGATGATGCTTAGTTCTGATAATTGGCATGTTGATTTTCCTACTCTTGCTGATGTTATTGATGCGTGGATTCAAGCGCATTGCAAACAACCTGATGGCTTTAATCGCGGTAAGCCATTTGTGCTTGCTGACTGGCAGTTTTGGTTGGCAGCGAACCGTTGGCGTATTCGCGAGGATGCTAAATTCGTGCCGCCTGAAGAGGTCACGACTGATAATCCTATGGTTTTGAATCAAGCGTTTACGTATCGCCAAACATTGTGTGTTGCTCCTCAGAAGACTGGTAAAGGTCCATGCGCAGCGGCATTTGTTGCTGCTGAAGCTGTTGGTCCTACTGTTTTTAACGGTTGGGCGAAAAAGGGAGATGTTTACCGTTGCTCTGATTGGGGTTGTTCTTGTGGTTTTGAGTATGCGTACTTGCCGGGCGAACCTATGGGTAGACCTCAACCATCACCTCTTATTCAGTTGACTGCTAACAGTGAAGACCAGGTACTTAACATGTATCGTCCTTTGAAAGCTATGGTTTTGCTTGGCTCTTTGAAGGAACGTTTGCGTGTGCGTGAGGGTTTTATTCGTGTGCTTAATGGTGATAGTGATGTGAGTGACGCGGCTGACCTTGACCGTATTGATATTGTTACGTCTTCTGCTCGTAGTCGTTTGGGTAATCCTATTACTGACGCTGAACAGGATGAAGCTGGCTTATATACGTCTTCTAATGGCATGGTGCAGGTTGCTCAGACTCAGCGGCGCGGCGCGGCTGGTATGGGTGGGCGCACTCACGCGTGGACGAACGCTTATGACCCTACTGAGAATAGCTACGCTCAGCAGATTGTCGAGAGTGGTGACCCTGACGTTTTTGTGTTTTATCGTAATCCGGATTTGGCTCCTGAGTTGAGACGTGAAGATGGTTCTTTGATGTCTTTTTTGAAGCGTAGTGAACGCCGCAAAATCCTTGAATACGTATATAAGGGTTCGCCTTGGGTTGATTTGGACAGTATTGAAGCTGAAGCAGCTAGCTTGCTTAAGACTGACCCATCGCAGGCGGAACGCTTTTTTGGCAATCGTCTTGTTCAAGGTGCTGGAGCTTGGATTGAGGAAGCACAGTGGGCTGAAGCTTATGGGGGGTATCAAGATTGAGTAAAAAACATGAGTTATGGCTTGAGAACCCCCCAGCAGGCACTAGCGTGTGTGGTGGTTTTGATGGTTCTGAGAATGATGATTTTACGTGTTTTAAGCTCGAAACGTTGAGTGGTTTTATTTTCACACCTCGCTATGGGTATGATCAGCGTCCTACGATTTGGAATCCTAAAGAGTGGGGTGGGCGTATTCCTCGTGGTGAAGTTATTGCTGCTATGGATGAGCTTGCTCACAAGTATAAGTTTGTGCGTATTTATTGTGACCCGGGGTTTAAGGATGAGATGAGTTGGGAATCTCAAATTGATGCTTGGGCGCGCGCGTATGGTGAGCGTGTTTTTGTTCCGTGGGTTATGAACGGTAGTAATCGTATCACGGCTGTTTATAAGGCGTTGCGGCGGTTTGAGGAAGATTTAAGCACACACCGTATTACGCATGATGGTTGTCCTATTACTAACGCTCATATTGTGAACGCGCGGCGTATCCCTAAGACAGCTGAAAGATACGGTCTTGGTAAACCTCAGCAGGATAGAAAGATTGATGCTGCTGTTGCAACGATTCTTGCACATGAAGCAGCGTGTGACGCGCGCATGGATGGTTGGGGTGAAGAAAAGCCGAATCTTATTTATTCGCCGTCTAATATGAGGAGACTTAGATGACACTTAATATTGATGATGTAAATGTGCTTCTTCGTAGGTTGTTGATGCAGGTTATTGGTCGTCAAGCAGATATTACAAAGCATGTTGAGTATTTTCGCGGACGCCGCGGTAAACTGCCGTTTACTAGCCGTGAGTTTAAGAAGTATATGGAGAATCGTTTTAACTCGTTCTCTGATAACTGGTGCTCTACCGTGGCGCAAGCGCCGGTTGAGCGTATTCATTTTCAAGGGTTTATTACGCCTGATAGTAGTGTTGCTCCTGATTCTTTGCATCGCGTGTGGCAGGATTCTGATGCTGACCGTGGTCTTAGTGAAGCGGCACTTATGATGATGGTTGCGCGCCGCTCATATGGGTTGGTTACTCAAATGGCTGACGGTCATGCGCGTATTACTTTTGAGAATCCTGATTCTTGCGCTATGGAGTTTGACCCACGTACAGGCGCGCCTACTGTAGGCTTAACTCTTGGTGGCGAAGGCTCTGACACTGGCGTACTGTATTTTCCTGACTGTTATGTGCTTGTTCGTAAAAATGCTGATATGGCGTTTGGTTCTACTGGCGTGGAGAGTTGGGCGGTAGACGAATCTACTTTTCAGCCGAATCCGCTTGGTGTTGTTCCTATGGTTGAGTTTCGCAATCAGTGTATGCTTGACCGTTCTCCAATTTCTGATATTGAGCAAGTTGAAGCTATGCAGGATACTGTGAATGTTATTTGGGCTTATCTTCTTAATGCTCTTGATTATGCTTCTATGCCGGCTCGTGTGATTCTTGGCGGCGAACGTTTGCAAGAGGGTGTTTATGATACTAACGGCACTCTCGTTGGTAGTCGTCCGGCTGACCTTGAAAAGCAGATGATGGATAGGATCTATCAGATTACTGGCGATGGTGTGAAAATTTCAGAATGGCAGCCAGCTAATTTAGACGCGTTTTTGCCGGTTATCAAGAAGGCTGTTGAACATATTGCGGCTGAGACGCGTACACCGTCTCACTATTTGCTTACGTCTACTGAAGTGCCAGCGACTGGTTATGAGGTTGCCGAAGCAGGTTTAGTCAATAAGATTCTTGATCGTATTTCTTATTTGCGTGGCGGCGTGAAGCAGTTATGTTGGCTCGCAATGCTTACTGAAGGTGATAAAAATGCTGCTTTGCTTGTGCGTAATAGTACTGTGAAGTTTGCTAATCCTCAGTATCGCAGTGAAGCGCAAATGATGGACGGACTTATTAAAATGCGTCAAGCTGGGTTCCCGTTTGAATATGTTGCTGAATATGCTGGGTTGAGTCCTCGTGATGTTGAACGCGTACTTATGATGCGCGAGAAAGAGATGAGTGACCCGACACTTGAGAAGATTGCAGGTCAGTTGAAGCATGACGCGTAATCTTCAAGCTGTGGACACGTTCCATAAGCGCGTGGCGGCTCGTGAGGCTGTTGCTGTTCGCGCTGCACGTCACGCGTGGAGTCGTGTGGATAAGAACGATATTCGAGGTTCCTGGGCACGCGTTAAACAGCCTTTAGTGGACGTGTTGAGTGACGTGCAGGAGCAGGCTGTTGACGCTGGCTTGGATGCGAATGTTGATGTAATGGCTGAGATTGGTCAATATTCGGCGCCTGAAGCGCTTGTAGATTCGCATGCTTTTTCAAAGTTTAGCGCCGGCGGCGTGCCGTTGGACGAGTGGGTTGACCGTCCAGCAATTCGCACTCTTGAACTTATTAAAGAGGGTGTTGGTGTTGATGAAGCGTTAGGACGCGTCGGAGGTTCGTTTAGTTCTAGTGTTGCAACAAACCTTGCTGATGTGATTAGACAAGCGCAACAAGCGGATATTGCCACTCGCAGGCATATGGGTTTTATTCGTTGTTGCAATGCGGACGCGTGCAAATGGTGCATCGTATTAAGCGGCAAAATATACCGTTATAACACTGGCTTTGAGCGTCATATGAATTGTCATTGCTACCATTTGCCCGTCAACTTGGACGATGTTGGATCTGTTATGGATATTGCTCCATCTCCGATTGAACTGTTTAACAGAATGAGCGAGTCTGAGCAGAATAAAACTTTTGGAATAGTTGGTGCAAGAAGCATTCGTGCTGGTGCCGATATTGGACAGATTGTGAACAGCAAGCTTGATAGCGCACGCATTACCAAGTCTAGCCGTTCTTATTTCGCATTACAGCTTAAAGATAAGGGGTTTACTCTCCCCTCTGCTAAAGACAAGTCGCGCAAACCATTTAGAAGGCTAACTGTTGATGAATGTTGGGCATCTGGGAATCGTAAAGAAGCCATACAGAGACTTAAGGATAACGGTTACATTCTTCCTCGCGATTTTAGGTATGAGCACAGGAAAAACATTAGCGGATTGTGGGAGTTAAACCCAGCTGTTAGACGTGCTGAGCAGATTTATCAAGAAGCTATTAAAAGCGAAGATTTAGCGCGCATTGCGCAGGCTGAAAAAGGTTTGCGTGAAGCGTATGGAAAGATTTCCGATTCCATGTCAAACGGCATGGGTTATAGGTGACGAAAGGTCAAGGAAAGGAGAAAAACCATGCAAGATGGTGAAAACAATAATGAGCAAGCACAGGATAAACAAGAGCCTGAGCAGATTCAACAGGATGCTAGTCAGTTGACTACTGAGGAAGCTATTGAAAAGTATAAGGCACAGCAGCATGTGAATCGTGACCTTGAGCACAAGTACAAGGATGCAATGAAGAAGCTTGACGGGTACAAGGGCATGGAAGAGCAATTAGCAAAGCTGCAAGGTCGCGAAGCTGAGTATCAGAAAGCTGAAGAACTCGCAAAAATTCAGCAACAGGCTATTGCGAGCGCTAATCAGCGTGTTCTTAAGAGTGAGATTCGCGCCGCCGCTGCTAGCGTGTTAGAGAATCCGGCAGATGCAACAATTTTCCTTGATCTATCAAAATTCACCGTTTCCGATGATGGAGAAACCAATTCTGAGGAAATTAGTAACGCTTTGGGCGCGCTTGTTAAAGAGCGCCCATATTTGGCGAAACGCCAACCAAACACTGGTGTCGTGAGCACGCCTCCTAGCGGCACGCGTGCTCAGACTGTTCAGCAGCTTACACGTGAGCAGCTTAAGGGTATGACACCAAGCGAGATTGCAAAGGCTGATGCGGAAGGCAGGCTGCGCAACATTCTAGAAGGCAAATAATTACTTAAGGAGAAAATATGACGGGTCTTAACAATTTTATCCCAGAAATATGGAGTGCTAATATTCTTGTCACTCTTGAGAATTCTCTCGTGTTTGCAAATCTGGCAAATCGTGAGCATGAAGGCGAGATTAAAGCGTACGGCGATACCGTGCATATTACAGGTATTGGCGATATCCAGATTCAGGATTATACAAAGTATGGCAAGCTAACAATTCAGCCTGTTACTGATATTGATGCTGGTGTACTTAAGATTGACCAGTCTAAGGCGTTTGCTTTTGAAGTAGACGACTTGGATACTGTTCAGTCACGCAAAGATTTGCGAGGTAAGTTCCAAGAGCGCGCCGCCTATAATCTTGCTGCTGAGGTTGATAAGTATGTTGGCGGACTTATGGTTACGGCGGCGGCTGGTAAGGCTTTGAAGAAGACTTACACGAAGCCAGAAGACGTGTATGAAAGCATTGTTTCTCTTGGCGTGAGACTGAGTAAGCAGAATATCCCAACTACTGGTAGGTTCCTTGTGGTTGACCCAGATGTTTATGGAATGCTGCTTTTGGACGACCGTTTTGTTAAGAACACTGCTGTTGAGTCTGCAACATTGCACAATGGTTTCGTTGGTAACGTGAACGGTTTCACCGTGTATCAAACTAATTGTATGCCAGGCAATACTGATACGAAGCATACTATGCTTGCAGGTTCTACTATTGCTACTACTTTCGCACAACAGATTTCCAAAATGGAATCGACACGTAGAGAAGAAAGCTTTAGTGATCTTATTAAAGGCTTGCTTGTGTACGGCGCTAAGGTGATTCGTCCTGAAGCTCTTGCAACATGCGAACTCACCACTACTGGCTCTGTTTCTAGGACTGCCTGATTATGAGTTTGGCGACGATTGACGATTGCCGCCGTTTCAAAGTGCAGGTCGACGGGCGCGAAGCGGAAGCTGAAGCACTGTTAGAAGTTGCCAGCAGCAGCATTACAGCGGCTGCTGGCTCTCCTATTATGCGCGGAACTCACACGGTTGTTTTACCGGGCGTTGACCGCAAGCGCCTACCATTACCATTCCGTCCAGTCACTCACGTTGAGTCAGTTCTCATAGACGGCACACTTGACACTAATTGGAAACTTATAGGAGATTCCTTATATCGCGCCAACGATTGGGCGTCTCCTAATGAGCCAACTAGTGTACAAGTAACGTTTACAGCCGGCTTCGTGAACATTCCAGAGGATATTAAACGCCTTTGTTGTTCCATGGTTGCGGCTGGTTTAGCTCAAACCGAGAACGGCGGATTGCAAACACACACTGGCGTAGCGTATGAGCGCATTGACGACTATCAGATTGGTTACACGCAAGGTGAAAACGCTCTAGTAGACGCAATGCAACTGCCCGAAGCAACATGCAAAATGTTGCGCTCGCGGTTTGGTTCTCCCGGTTTGGCGGTGAGGATTATATGAGTGCTCTTAGCGTTGTACGTCGTGCTCAACAGGCGGCTGAGTGTCTTATGGTTGATACTGTGCTGGTGAAACGGGTAACGGGTTATGTGTTAGATGAGCAGACTGCTTTGCAGAAGCCATCGTATATGAAAGTGTATGAGGGCAAGTGTAAGTTGCAAGCTTACGATTCTAATGGTGCGAATAGTGCTAATAGTGCGAACGGCGTGAGGTTGAGCGATAAGGTGAATTATGGTTCTCCTATGCTTTCTTCTACGCAGGGTATTCACTTTCCTATGAGTGTGTCCGGGCTTGCTCCGGGTGACATTGTGGAAGTCATACATAGTGTTAATAAGGCTCTTGAGAATCGCGTGTTTAAGCTGGCTTTGAATACTAGTGCTAAAACGTTTACTTCGGCGCAACGCTGGGTATTGAACGCGAATCTTGAGACTGTAGAGGCGGTGGAGAATAATGTTTCGTCTTGAATCTTCTGAGCTTTCACGGTTCGCAGGTGCTTTGCATACGGCTTTTCGTGTGAAAGATGAGCAGGTTGAAAAAATTGTTGCTCATGCAGCACTCAATGTGAAGAAAGCTGTTAAAGCTGATTTAGCAAAATCCAACTACTGGTATTTTCGCGAAACGCCTATCACGTATGAGATTGAGAAAAAGTTTCATGAAGTTACAGCTACTGTAGCACCATTAAAAGGTAGTCCGGGTAGTGTTATTAACTTCGCTTTTTTTGGCTCTGAGCGTGGTGGTGGCACTCACAAATTCTATGAGTATGCTCAGCCTGAGTTTGACACGATGATTGAAGAAATGAGAAAGGTTGGTGTTGAAATATGACAACTTTTCTTGAAGCTCGAAAAGAGGTTTTAAGCATCATAAAACTACCCGACGGGTGGAAGCGTTACGAGGACGGGCAAGCACCGTTGTCAAAAACCACTCTCCCACCGTGGGTAATTTTTACCGTCAAACCGCAAAACAGATTCCACTGTGAAGCTGGCGAAACGCGATTGCGATACGCGCTCATAGAAGCACGCATTGTAAACGCTTCGCAGTTGAGCGTTGATTTGCTCGCTGAAAAACTCATAGACATGGTTGAAGCTGCAAAACTACAAAACATTGAAGGCATGAGCATCTACAGGGATTCTGGCTCGTACCCGGGAGATATGAAGAATCTCAGTCAGAATATGAATTATGTTGTGCGAGTCGTCGAATGGCGGTTCGCATTTAATATTTAGGAGACATTATGACAGTTGAAATTGCTAAAGTTCCAACACATCTTGCTGCTGGCTTACACCGTACTATTTGGGTGCCAGCTTCTAATGGTATTGCTGATATTCATAAGCCTACTGTGGCTGAATTGGAGAAAGCTGGCAATATTGATTTGAGTATTTACTTGGGTTCTCATGATGCTTTTAACCTTGACCATTCGCAGGAGACTTTTAACGATGAGCGCGAAGCTTATGCGGTAGCAGGCAAGATTAACGGCATGGAAAAGTATGAGAATGGTAAGCTGCACGTTATTGATAATACGAACACTAAAGATTCTGAGAAGTACAACGAAGCTATTAAAACGCTCACTAAGGGTGCTCGTGGTTTCTTTGTGCGCAGGCGTGGTAAGAAAGCGTCTGAGGAGTTCGTGGCGGGTGATGTTGTTAGCGTGTTCCCGGCAACTATTGGTTTGAAAACTGCGTTTAAGGATAATCGACAGATGAGTTTGATTAACTTTGCCGCTGACCCATCTTCTTCTGATGAGGAAAGTGTTGTTGTGGATTCTGCTAGTGCTGTGCCTGCATCTTCTACTGTAGTGTCGAACTCAGCACAGTAGTAATAATTCTTTCTTGCCGGTTTTTGTGGTGGATTGGCAAGAAGTATAGGGGAAGCGCGTGCGAGTTTTTGTTCCTTTCCTCGCACGCGCCTTTACTTTCTAACACCACAACATACCCCCCCCTATTAATTGGAGTTTTATTATGCCTATTAAATACATTCAACCACAATTAACTGTTGATATCGTAACTGACCTTGTATCATTGCAAAAAACGCTCATGCTTACTCAAGAGCTTGTACTTATGCAGCAGGACGAAACAAACGCTATTCGCGATGGACGCTCAGCTGACGAAGTACTGGAAGAAATACATAAGACAAAAGAACTCGCAGACCGTAGCGTTATTACTATCACTTTACAGGGCTTAAACCACTCTAAGTGGAGTGAATACGTTATTAAAAACACAGAAAACAAAGAGGACGAAGATACTCCAACAATCAACGTTAAACAGGCTGCGCTTGAAGCGTTCCCGGCAATGATTGTTAAAGCGCAATACAAACTGAGTAAACGCAGTGTGAGTAACAATGATGTGAAAGAGTTATTGCCAAAGCTTGCTGACTCGCAGATTACAGACATTGTTACAACTATTCAAAATCTTAACGAACCAACAACCGCATTCCCAAAAGAATTAACGCAGCTTATTTAGAATATTTCCCTAGTCTCGTAGCGCAATTAAAAACTGCACGTAGGCTAGGGATTTCTTATAAGCGATTCTGCGGATGGCAGCCAACTGATGACGATCCTATCGAGTGGGATGAAACAGAACGCGCATGGATGCTCGCACTGGATGAGTATGAGAACACAGTTCTTTGCCCACTGTGCGGAATGCCGTCTAAGTTCTGCCATGATTATTTGAAAGTAAATGACACGTTTGAGCGCGCAAAAATCGAAACATGTTTCGTAAGTGCTATGCGTGAGCAGGCGATGGAAAAGTATTTGAAAGATGACCGTCCGGGCACTACGCGCTCGCAAACTACTAAGCTAGTGCCGTTTGGAGTTGAGGAGGAGTAAATGGCGCAAAACGAGAATATTACTATTCGCATGACCGCGGATATTGCTGACTACTCGGCAAAACTCCAAACGGCAGCACACTTGACTGGAAAGTTTGACAGTTTTGTAAAGAGTGCTGGTACTACTGGGGAAAAGACCGGTCGCATAATGAGGGCACTTGCTATTGGTGCTGGTGCAGTTGCTGTTGCTATTGGTGTTGATGCTACGCGTCGCTTTGCCGAGTTTGACCAAGCTATGGCGGCTGTTAGGGCTAACGTCACTGAGAATGTTGGAGAGTTAAAGAAGCTTGAGACAGCAGCGTTGGATGCTGGGCGTAGTGGCATGTTTTCTGCAACTAAAGCTGCTAATGCTATCAATGAGCTTGGTAAAGCCGGTGTGAGCGTTAAGGATATTATTGGCGGCGGTTTGAAGGGTGCTCTTGATTTGGCAGCTGCTGGTGAGATGAACGCTGCTGATGCTGCTGAATTGACGGCTTCTGCATTGAATCAGTTCGGGCTTGCTGGTTCTAAAGCGTCTCATGTTGCAGATTTGCTTGCAGCTGGCGCAAACATGGCACAGGGTGGTGTGCAAGATATGGGCGAGGCATTGAAGAACGTTGGTGTTAACGCTCACATTCTCGGAATGAGCGTTGAGGAAACTGTTGGCGCTCTAACACTGTTTGCATCTAAAGGTTTGGTTGGTTCTGACGCTGGTACAAAGTTTAATGCTATGCTTCAGAATCTTGTTGCACCGTCTATTCGAGCGCAGGGTACTATTAAGAAGCTTGGATTGCAGATTTATGATTCTCAAGGCAGATTTGTTGGCTTAGCGAGTGTTGCTCAACAGTTGCATGATAAGCTTGGGCATTTAACTCAGGCGCAACGAAATGCTGCTCTTGGACGTATTTTCTCTAACGCAGCATTGACAACCGCTAACACTTTGTATGAGCAGGGCGCAAAGGGTGTTGAAAAGTATACGAAGATGATTAACCAACAGGGGTTCGCGTCAAAAGTTGCTAACACTCAGATGGATAATCTCAAAGGTCAGCTCACTCGCCTTGGCAATGCGTGGGATACCATGCTTATTAAAATCGGCAGCGGCAGGAAAGGTATTATAAGCGGCATGATTGGGGCTGTTACCGGGCTTGTTAACGCATTCGCAGCCTTACCTCCTGAAGTTCAGCAAGCCGTCATTGCAATAACCGCGCTAGTCGGTATTGGTGCTGGACTGTATGACATGTATCGTAAAAGTTCTAGATATGGTGGCGTTGTTGCTAAGAGTTTTGACTTTATAGGCAGTAAAGCAAAGCGGCTATTTACGCTGTTGAAGAGCACAAAACTTGGTGCAGGACTTGAATCAGCCTTTAGGAGCTTAAGCAGTACTGCAACTGATACTTTTTCAAGTATTGGGTACAAGATTGCTGGGGCTAGTTCTGGGTTGAGTGTTTTTAAGGCTGGCGTTCTCGGATTAACTGCTGTCACAACAGGAGTTCTTGTTGCTGCATTAGCGGCAGCAGCGATTGGTTTTGTGATGTGGCAAGCTAAAGCTGATGCTGCCAAAAAGCAGACTGATGCACTAAAAGATACTGTGCGTAATAGTGGTGACGTATATCGTAAGCTTGCTGATGAATTAAAAAACGGCAATGATGGAGTCAGCTGGTTTACAAAAGGCACTCAGAATTTTACTGACGCTCTTAAAACTTGCGGCGTAAGCATGGACACGTTTGTTAGCGCAGTCAAAGGCAGCAAAACCGCTGTCACGTCTTTCAATAAAGCCTTAGATAAGACTTGGAAAAACTCTACTCCAGACAGGTTCGGCGGAATCACAAAACAATCTGTCAACATTCTTAAGGAAGCGTTTGATCAAGCAAAAAAGAGCGTTAAAGACGCACAAAACGCGATGAAAGAGGAACAAGCGCAGCAGAAAGCAAATGCGCTAGCTTCTTCGCAGCACGCGGACGCGCTCATGAAGGGCGCGGAAGCCGCAGCCAAAAACAGTGGTGAAATATTAAAAGTCGCTAAAGTAGAAGACATTCTTATAGCAAAATTCGGTGCTAGTAAAAACGCTATCAACGCTCAAGCAGAAGCCATCAACAATAATGTTGAAGCAATGCAAAAATACTACGGTTTCGCAATGGACGCTGACCAAGCACTCACAAACTTGGATAAAACAATACGCGAAAGTAGTAAAAGCGTTGCAGAAGCCGGTAAACATTGGATGGATAACACTGATGCTGCCGATAAAAACATGAGCGCGCTCACAAGCCTTGCTAAGCAAACATTCGAAACCGCTGAAGCGATGGCTAAAAACGGTGAAAGCGTTGAAAATGTAACCAAAGCATTCGACAAAGGCAGTGCCGCGTTCGTTGACCTTGCTCAAAAAACAGGCATGAGCAAGGATAGAGCAATTGCGCTCGCAAAAGCTTGGGGTATTAGTCATGATGCGCTCATGAAGCTGATTGGGGCTGCGAAACAGTCTAATGTTGAAGCAAAAGTTACCGCAAAAGACAACTTTAGTGAAGTGTTTAAAAAACAGAATCTATCTGTAAAAAACCTTAAAGGCGGTAAGTTTGAGATTACTGGCAACAACAAGAAGGCTCTTGATGCTATTGCTAAAGTATCAAAAGCAAAGCTTGACCCTAAAAAGCTGACTCTTACTCTTGATAAGAAACAGCTTCAGACCGCTCTTGATGCGGTTAAAAAAATGAAGCCGGTTGAAGTTAAGGCAAAAGTTAAAGCTGACACTAAGCAGGCTAAAAAAGCTATTGCGGATGTTAGTAAGGCTAAAGTGCCGGACAAGACTGTCAAACTCAAAGGTGATAAGCGTAATCTTGACTCGACTATGTCTAAAGCTAAAGCAGCTAAAGTGCCGGACAAGACTGTCAAACTCAAAGGTGATAAGCGTAATCTTGACTCGACTATGTCTAAAGCTAAAGCAGCTAAAGTGCCGGACAAGACTGTCAAACTCAAAGGTAATAAAACTGATTTCCAAAACAAGTTTAACAGCGCACAAAGCGCACGGTTAAGAGATAAAACTGTGTATTTTAGAGCGAACGCCAATGAGGTTTGGAGTGCTATAAATGCGATTAATAACGCTAGCGTACACGTCAATGCTCGAGTTCATCGCGCGAATGGTGGTGTCGTGTATGGTGCTGGTACTGGTACATCAGATAGTATTCCAGCGATGCTTTCTAACGGTGAGTACGTTATGACAGCCGCCGCGGTTCAGCGCATTGGCGTGAACATGCTTGACCGTTTAAACTACGGGAACTCTATTGCAGGATCTGAGAAACCAGCATCTAATACTGGTGGGGATATGCTCGTGAACGCTGTGAATGGGTTACGAAACGATATGCGCGCGTTAAACGACAGGCTGGTCGCGTCAGGGTGTGTGGCTAATTCTAACGTTGCTGAAGCAATGCGCAGCGTTTTTGACGATGGAGTAAAACTCAAGCTTGATGCTAACGGTCGTGAAGTAATGGCTGGGCTGCTTGCTACGCCTATGAGCCGAGAACTTTCTCACATGATAGATTTAGGAAGGTGA
Protein sequences of DBSCAN-SWA_3 >NC_013721|1115132:1128382|1124623_1124860_+|WP_049762331.1|DBSCAN-SWA MLALDEYENTVLCPLCGMPSKFCHDYLKVNDTFERAKIETCFVSAMREQAMEKYLKDDRPGTTRSQTTKLVPFGVEEE >NC_013721|1115132:1128382|1119968_1120592_+|WP_012914325.1|DBSCAN-SWA MQDGENNNEQAQDKQEPEQIQQDASQLTTEEAIEKYKAQQHVNRDLEHKYKDAMKKLDGYKGMEEQLAKLQGREAEYQKAEELAKIQQQAIASANQRVLKSEIRAAAASVLENPADATIFLDLSKFTVSDDGETNSEEISNALGALVKERPYLAKRQPNTGVVSTPPSGTRAQTVQQLTREQLKGMTPSEIAKADAEGRLRNILEGK >NC_013721|1115132:1128382|1123994_1124495_+|WP_012914332.1|DBSCAN-SWA MPIKYIQPQLTVDIVTDLVSLQKTLMLTQELVLMQQDETNAIRDGRSADEVLEEIHKTKELADRSVITITLQGLNHSKWSEYVIKNTENKEDEDTPTINVKQAALEAFPAMIVKAQYKLSKRSVSNNDVKELLPKLADSQITDIVTTIQNLNEPTTAFPKELTQLI >NC_013721|1115132:1128382|1115132_1115657_+|WP_012914320.1|DBSCAN-SWA MSSGRGGARVRSGPAPDPHSRNSLRKGVKALVLSADGFAGEIPKFPLPQFRVVDSEGVRDTNGEKRFASREKTVWRHLWSLPQAEAWSMPEYSYMFFEIALYARQLVICEQAGASAADRGLLPRFADRIGLSEAAMAGFGWSIRRDSVVDSARDLSVIDGVRVLPRRMRGGDDA >NC_013721|1115132:1128382|1120609_1121464_+|WP_012914326.1|DBSCAN-SWA MTGLNNFIPEIWSANILVTLENSLVFANLANREHEGEIKAYGDTVHITGIGDIQIQDYTKYGKLTIQPVTDIDAGVLKIDQSKAFAFEVDDLDTVQSRKDLRGKFQERAAYNLAAEVDKYVGGLMVTAAAGKALKKTYTKPEDVYESIVSLGVRLSKQNIPTTGRFLVVDPDVYGMLLLDDRFVKNTAVESATLHNGFVGNVNGFTVYQTNCMPGNTDTKHTMLAGSTIATTFAQQISKMESTRREESFSDLIKGLLVYGAKVIRPEALATCELTTTGSVSRTA >NC_013721|1115132:1128382|1122492_1122864_+|WP_012914329.1|DBSCAN-SWA MFRLESSELSRFAGALHTAFRVKDEQVEKIVAHAALNVKKAVKADLAKSNYWYFRETPITYEIEKKFHEVTATVAPLKGSPGSVINFAFFGSERGGGTHKFYEYAQPEFDTMIEEMRKVGVEI >NC_013721|1115132:1128382|1123270_1123852_+|WP_012914331.1|DBSCAN-SWA MTVEIAKVPTHLAAGLHRTIWVPASNGIADIHKPTVAELEKAGNIDLSIYLGSHDAFNLDHSQETFNDEREAYAVAGKINGMEKYENGKLHVIDNTNTKDSEKYNEAIKTLTKGARGFFVRRRGKKASEEFVAGDVVSVFPATIGLKTAFKDNRQMSLINFAADPSSSDEESVVVDSASAVPASSTVVSNSAQ >NC_013721|1115132:1128382|1118795_1119941_+|WP_012914324.1|DBSCAN-SWA MTRNLQAVDTFHKRVAAREAVAVRAARHAWSRVDKNDIRGSWARVKQPLVDVLSDVQEQAVDAGLDANVDVMAEIGQYSAPEALVDSHAFSKFSAGGVPLDEWVDRPAIRTLELIKEGVGVDEALGRVGGSFSSSVATNLADVIRQAQQADIATRRHMGFIRCCNADACKWCIVLSGKIYRYNTGFERHMNCHCYHLPVNLDDVGSVMDIAPSPIELFNRMSESEQNKTFGIVGARSIRAGADIGQIVNSKLDSARITKSSRSYFALQLKDKGFTLPSAKDKSRKPFRRLTVDECWASGNRKEAIQRLKDNGYILPRDFRYEHRKNISGLWELNPAVRRAEQIYQEAIKSEDLARIAQAEKGLREAYGKISDSMSNGMGYR >NC_013721|1115132:1128382|1116773_1117433_+|WP_041160517.1|terminase|DBSCAN-SWA MSKKHELWLENPPAGTSVCGGFDGSENDDFTCFKLETLSGFIFTPRYGYDQRPTIWNPKEWGGRIPRGEVIAAMDELAHKYKFVRIYCDPGFKDEMSWESQIDAWARAYGERVFVPWVMNGSNRITAVYKALRRFEEDLSTHRITHDGCPITNAHIVNARRIPKTAERYGLGKPQQDRKIDAAVATILAHEAACDARMDGWGEEKPNLIYSPSNMRRLR >NC_013721|1115132:1128382|1117429_1118806_+|WP_012914323.1|portal|DBSCAN-SWA MTLNIDDVNVLLRRLLMQVIGRQADITKHVEYFRGRRGKLPFTSREFKKYMENRFNSFSDNWCSTVAQAPVERIHFQGFITPDSSVAPDSLHRVWQDSDADRGLSEAALMMMVARRSYGLVTQMADGHARITFENPDSCAMEFDPRTGAPTVGLTLGGEGSDTGVLYFPDCYVLVRKNADMAFGSTGVESWAVDESTFQPNPLGVVPMVEFRNQCMLDRSPISDIEQVEAMQDTVNVIWAYLLNALDYASMPARVILGGERLQEGVYDTNGTLVGSRPADLEKQMMDRIYQITGDGVKISEWQPANLDAFLPVIKKAVEHIAAETRTPSHYLLTSTEVPATGYEVAEAGLVNKILDRISYLRGGVKQLCWLAMLTEGDKNAALLVRNSTVKFANPQYRSEAQMMDGLIKMRQAGFPFEYVAEYAGLSPRDVERVLMMREKEMSDPTLEKIAGQLKHDA >NC_013721|1115132:1128382|1124860_1128382_+|WP_012914333.1|tail|DBSCAN-SWA MAQNENITIRMTADIADYSAKLQTAAHLTGKFDSFVKSAGTTGEKTGRIMRALAIGAGAVAVAIGVDATRRFAEFDQAMAAVRANVTENVGELKKLETAALDAGRSGMFSATKAANAINELGKAGVSVKDIIGGGLKGALDLAAAGEMNAADAAELTASALNQFGLAGSKASHVADLLAAGANMAQGGVQDMGEALKNVGVNAHILGMSVEETVGALTLFASKGLVGSDAGTKFNAMLQNLVAPSIRAQGTIKKLGLQIYDSQGRFVGLASVAQQLHDKLGHLTQAQRNAALGRIFSNAALTTANTLYEQGAKGVEKYTKMINQQGFASKVANTQMDNLKGQLTRLGNAWDTMLIKIGSGRKGIISGMIGAVTGLVNAFAALPPEVQQAVIAITALVGIGAGLYDMYRKSSRYGGVVAKSFDFIGSKAKRLFTLLKSTKLGAGLESAFRSLSSTATDTFSSIGYKIAGASSGLSVFKAGVLGLTAVTTGVLVAALAAAAIGFVMWQAKADAAKKQTDALKDTVRNSGDVYRKLADELKNGNDGVSWFTKGTQNFTDALKTCGVSMDTFVSAVKGSKTAVTSFNKALDKTWKNSTPDRFGGITKQSVNILKEAFDQAKKSVKDAQNAMKEEQAQQKANALASSQHADALMKGAEAAAKNSGEILKVAKVEDILIAKFGASKNAINAQAEAINNNVEAMQKYYGFAMDADQALTNLDKTIRESSKSVAEAGKHWMDNTDAADKNMSALTSLAKQTFETAEAMAKNGESVENVTKAFDKGSAAFVDLAQKTGMSKDRAIALAKAWGISHDALMKLIGAAKQSNVEAKVTAKDNFSEVFKKQNLSVKNLKGGKFEITGNNKKALDAIAKVSKAKLDPKKLTLTLDKKQLQTALDAVKKMKPVEVKAKVKADTKQAKKAIADVSKAKVPDKTVKLKGDKRNLDSTMSKAKAAKVPDKTVKLKGDKRNLDSTMSKAKAAKVPDKTVKLKGNKTDFQNKFNSAQSARLRDKTVYFRANANEVWSAINAINNASVHVNARVHRANGGVVYGAGTGTSDSIPAMLSNGEYVMTAAAVQRIGVNMLDRLNYGNSIAGSEKPASNTGGDMLVNAVNGLRNDMRALNDRLVASGCVANSNVAEAMRSVFDDGVKLKLDANGREVMAGLLATPMSRELSHMIDLGR >NC_013721|1115132:1128382|1115649_1116777_+|WP_012914321.1|DBSCAN-SWA MLSSDNWHVDFPTLADVIDAWIQAHCKQPDGFNRGKPFVLADWQFWLAANRWRIREDAKFVPPEEVTTDNPMVLNQAFTYRQTLCVAPQKTGKGPCAAAFVAAEAVGPTVFNGWAKKGDVYRCSDWGCSCGFEYAYLPGEPMGRPQPSPLIQLTANSEDQVLNMYRPLKAMVLLGSLKERLRVREGFIRVLNGDSDVSDAADLDRIDIVTSSARSRLGNPITDAEQDEAGLYTSSNGMVQVAQTQRRGAAGMGGRTHAWTNAYDPTENSYAQQIVESGDPDVFVFYRNPDLAPELRREDGSLMSFLKRSERRKILEYVYKGSPWVDLDSIEAEAASLLKTDPSQAERFFGNRLVQGAGAWIEEAQWAEAYGGYQD >NC_013721|1115132:1128382|1122041_1122506_+|WP_012914328.1|DBSCAN-SWA MSALSVVRRAQQAAECLMVDTVLVKRVTGYVLDEQTALQKPSYMKVYEGKCKLQAYDSNGANSANSANGVRLSDKVNYGSPMLSSTQGIHFPMSVSGLAPGDIVEVIHSVNKALENRVFKLALNTSAKTFTSAQRWVLNANLETVEAVENNVSS >NC_013721|1115132:1128382|1122860_1123262_+|WP_012914330.1|DBSCAN-SWA MTTFLEARKEVLSIIKLPDGWKRYEDGQAPLSKTTLPPWVIFTVKPQNRFHCEAGETRLRYALIEARIVNASQLSVDLLAEKLIDMVEAAKLQNIEGMSIYRDSGSYPGDMKNLSQNMNYVVRVVEWRFAFNI >NC_013721|1115132:1128382|1121466_1122045_+|WP_012914327.1|DBSCAN-SWA MSLATIDDCRRFKVQVDGREAEAEALLEVASSSITAAAGSPIMRGTHTVVLPGVDRKRLPLPFRPVTHVESVLIDGTLDTNWKLIGDSLYRANDWASPNEPTSVQVTFTAGFVNIPEDIKRLCCSMVAAGLAQTENGGLQTHTGVAYERIDDYQIGYTQGENALVDAMQLPEATCKMLRSRFGSPGLAVRII |
15 | Streptomyces_phage(40.0%) | tail,terminase,portal | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_013721.1|WP_049762331.1|1124623_1124860_+|hypothetical-protein |
1124623_1124860_+
Protein sequences of NC_013721.1|WP_049762331.1|1124623_1124860_+|hypothetical-protein>NC_013721.1|WP_049762331.1|1124623_1124860_+|hypothetical-protein MLALDEYENTVLCPLCGMPSKFCHDYLKVNDTFERAKIETCFVSAMREQAMEKYLKDDRPGTTRSQTTKLVPFGVEEE |
78 aa aa | NA | NA | NA | 1115132-1128382 |
yes
Self-targetings in the prophage
1. spacer 2.2|870248|33|NC_013721|PILER-CR,CRISPRCasFinder,CRT matches to NC_013721 position: 1126565-1126533, mismatch: 0 caaacgtgtccatgcttacgccgcaagttttaa CRISPR spacer caaacgtgtccatgcttacgccgcaagttttaa Protospacer ********************************* |