Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP018683 | Vibrio harveyi strain QT520 plasmid p2, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP018682 | Vibrio harveyi strain QT520 plasmid p1, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP018684 | Vibrio harveyi strain QT520 plasmid p3, complete sequence | 1 crisprs | NA | 0 | 1 | 0 | 0 |
NZ_CP018680 | Vibrio harveyi strain QT520 chromosome 1, complete sequence | 2 crisprs | cas3,csx1,DinG,DEDDh,csa3 | 1 | 1 | 5 | 0 |
NZ_CP018681 | Vibrio harveyi strain QT520 chromosome 2, complete sequence | 1 crisprs | csa3,cas3,DEDDh | 0 | 0 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP018684_1 | 2244-2327 | Orphan |
NA
Consensus repeat of NZ_CP018684_1
|
1 spacers
spacers of NZ_CP018684_1
>1.1|2267|38|NZ_CP018684|CRISPRCasFinder ATGTAATACATTGACGTTTGCCCAATTGTTCGATATTT |
CRISPR arrays and Neighbor proteins around NZ_CP018684_1
The CRISPR arrays of NZ_CP018684_1 >merge|NZ_CP018684|1|2244-2327|CRISPRCasFinder TATTTATTATATGTGTATTACACATGTAATACATTGACGTTTGCCCAATTGTTCGATATTTTATGTATTACACATGTAATACAT >NZ_CP018684|1|1|2244-2327|CRISPRCasFinder TATTTATTATATGTGTATTACAC ATGTAATACATTGACGTTTGCCCAATTGTTCGATATTT TATGTATTACACATGTAATACAT
>NZ_CP018684.2|WP_074052118.1|885_1713_+|hypothetical-protein MKKLFSIISILFISYSFGEVAKADEYISCNKCSSADISSAADSWGLKNISELDSKKNIRKKVHIVDLVNLKSSSYLVSKKNSNGTFYIEKNRISPSEELKEKLDDVIQSKEEMNLMSSDLVIPKEVIKDPWEFVGCSYCVVDLQNFLNNSLQGQISQFNSAVQTLAYTFGLINGPYQQTFTLKFAGGGSTTFEVKQVTNTYDFVIIKILQVVDESGNEIPLNRANANFKSLRIPSHDRWQIINNYLWRYRLSIPPTDGGVVTVTECPLAPQHNCW >NZ_CP018684.2|WP_017191050.1|2340_3108_+|ParA-family-protein MKTKVISAANQKGGVGKTTTLVNLGAELARKRKVLVVDLDPQGNCTKTLTGERHFQFEETIAAMFDKPKVVSIADLIRPALVDGEPIQNLDVVPADFQLSRIIETSLTKINRERILDKQLARLGERYDFILLDTPPNLSLTTLNAIQASDLILIPVDSGAFSLDGISPLLEAVSEIKDDDANYLILRNEVDVRNTVINEFIEEELEVAKEKVLSVAIRRSEHVSQANAVSAPVRFYKAGSLVNNDYRKLAAMIVG >NZ_CP018684.2|WP_017191051.1|3117_3414_+|hypothetical-protein MAKLKARGAITIASSADKRDDKSQAITTVKKKVTTSYQVSPLTLRMSLKDKKSIAEWVNDLQDLSERNVSAAKLFRALALYRDNIDDEELIKIINKMN >NZ_CP018684.2|WP_017191052.1|3623_3824_-|hypothetical-protein MALIVGPKEKVGRDGGSFQEVELYAGTETVKYRYKKYCTIRDNEYAPPTSTSGRKWEQIFRTPDSE >NZ_CP018684.2|WP_017191054.1|5162_5432_-|Txe/YoeB-family-addiction-module-toxin MSSSQRLLSWTNDAWGDYLYWQTQDKKTLKRINKLINDTKRSPFEGIGKPEQLKENLSGFWSRRIDDTNRLVYAVDDQAITIISCRYHY >NZ_CP018684.2|WP_017191055.1|5424_5679_-|type-II-toxin-antitoxin-system-prevent-host-death-family-antitoxin MRIVSFTEARNGLKAVLDGVVNDADTTVITRRDSEDAVVMSLDYYNSLMETVHLLRSPQNVEHLNRSIAQYRAGKTTAQELIDE >NZ_CP018684.2|WP_074052119.1|7587_8316_-|hypothetical-protein MKKYLSLLMVITLSIVLSGCNGSSDSSSELAESYDGVYKDKNGESLFYNSNEDAIYLYRPPQQYKDGYISSSNRSIVVDNSLIGPYIDTNHFVKSELGDYYHYQNSTVQLHFSKGNVSALVKDEGDRTLVDTTYTKLPTLADFDLMYQSYADWERMTLIFSNDDRMFAQLDFMLTCQLNADVKKVSNFYRVSNGTITCNDPNDPRIDSNMHGVIYKVAEDSRAIVIVQGKRWTYRTTFQTVY >NZ_CP018684.2|WP_017191058.1|8774_9134_-|hypothetical-protein MDFLVGVDLQDSFVLGWHYNDQTLEIELEFSIWPESKYYEAPKIGEHTCYRLGSLLFEGVLSINGLLNQNDIQPIIDLDGSKDYGSIDYFEKNQSYFKVAGGFGNIEFESSGVHFKLRT >NZ_CP018684.2|WP_017420534.1|9361_9634_+|DUF1778-domain-containing-protein MATTLPRITARVDVDTQDLLTKAAAIAGMSSINSFVLSAAIEKAKQVIEREQALKLSQADAMLLMEALDRPATQNSKLKAAADRYESKTQ >NZ_CP018684.2|WP_017191060.1|9630_10128_+|GNAT-family-N-acetyltransferase MMNTVLLDKAKHDRNRFNCGIEALNNYLKVMASQQAKKDNTRTFVLEDDNDNSHVIGFYTLTMTPIDLKALPDKLQKKHQSSTSGGLIARLAVDDRYKGKGFGEWLLIDALRKLLAASDSVAFPVVIVDAKDGAKHFYERYGFQEFEEAENKLFITIADVRTSLG >NZ_CP018684.2|WP_017819283.1|10273_10702_-|hypothetical-protein MRFNYRSKKDNEDALWQLEVAKLYIDYAKYCSTLFSSFLAGQVTLLGTVFSELKSREFAVYAILLMVFAVISAYSIAETELRRLRGQEIDRDVPRYVILRKRFPNHTSVQTGKSFLTGVLVVSSISCYLYFLYLGNVVKLPL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP018684_1 | 1.1|2267|38|NZ_CP018684|CRISPRCasFinder | 2267-2304 | 38 | NZ_CP018684 | Vibrio harveyi isolate QT520 plasmid p3, complete sequence | 2267-2304 | 0 | 1.0 |
1. spacer 1.1|2267|38|NZ_CP018684|CRISPRCasFinder matches to NZ_CP018684 (Vibrio harveyi isolate QT520 plasmid p3, complete sequence) position: , mismatch: 0, identity: 1.0
atgtaatacattgacgtttgcccaattgttcgatattt CRISPR spacer atgtaatacattgacgtttgcccaattgttcgatattt Protospacer **************************************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP018680_1 | 2142803-2143041 | Orphan |
NA
Consensus repeat of NZ_CP018680_1
|
2 spacers
spacers of NZ_CP018680_1
>1.1|2142845|43|NZ_CP018680|PILER-CR AGTAGGTCACCAGTTCGATTCCGGTAGTCGGCACCATTCTTTT >1.2|2142930|79|NZ_CP018680|PILER-CR TCGTGTCGGTGGTTCGATTCCGCCTCGAGGCACCATATTTGGTGCATTTGCTTAAAGCAAGACACAAAACAGAATAGTT |
CRISPR arrays and Neighbor proteins around NZ_CP018680_1
The CRISPR arrays of NZ_CP018680_1 >merge|NZ_CP018680|1|2142803-2143041|PILER-CR CTTAGCTCAGTAGGTAGAGCAACTGACTTGTAATCAGTAGGTCACCAGTTCGATTCCGGTAGTCGGCACCATTCTTTTGCCTCGATAGCTCAGTCGGTAGAGCAGAGGATTGAAAATCCTCGTGTCGGTGGTTCGATTCCGCCTCGAGGCACCATATTTGGTGCATTTGCTTAAAGCAAGACACAAAACAGAATAGTTCCTCCTTAGCTCAGTTGGTAGAGCGACGGACTGTTAATCCG >NZ_CP018680|1|1|2142803-2143041|PILER-CR CTTAGCTCAGTAGGTAGAGCAACTGACTTGTAATCAGTAGGT CACCAGTTCGATTCCGGTAGTCGGCACCATTCTTTTGCCTCGA TAGCTCAGTCGGTAGAGCAGAGGATTGAAAATCCTCGTGTCG GTGGTTCGATTCCGCCTCGAGGCACCATATTTGGTGCATTTGCTTAAAGCAAGACACAAAACAGAATAGTTCCTCCTTA GCTCAGTTGGTAGAGCGACGGACTGTTAATCCG
>NZ_CP018680.2|WP_009697576.1|2141081_2142221_+|membrane-bound-lytic-murein-transglycosylase-MltC MKKIAFIATALLMAGCSREFVEKIYDVDYEPTNRFANNLAELPGQFEKDTDALDALINSFSGNIKKRWGQREVKFAGKSNYVKYIDNYLSRAEVDFQKGVITIETVSPTEPQKHLKNAIITTLLTPDDPANVDLFSSKEIKLEGQPFLYKQVVDQDKKPIQWSWRANRFADYLIANQVKTKDVDFKKAYYVEIPMVEDHFSQRSYQYADIVRRASKRYDIPEDLIYAIIKTESSFNPYAVSWANAYGLMQVVPKTAGRDVFKLVKNRSGQPSPEYLFNPENNIDTGTAYFYILKNRYLKDVQHPTSLEYSMISAYNGGTGGVLNTFHRTDRKRAMRDLNSLQPNQVYWALTKKHPNAEARRYLEKVTNFKKEFNSGHTL >NZ_CP018680.2|WP_005425792.1|2140726_2140999_+|oxidative-damage-protection-protein MSRTVFCARLKKEGEGLDFQLYPGELGKRIFDNISKEAWAQWQHKQTMLINEKKLNMMDPEHRKQLETEMVNFLFEGKDVHIEGYTPPSE >NZ_CP018680.2|WP_050922738.1|2139630_2140707_+|A/G-specific-adenine-glycosylase MTPFASAILEWYDAYGRKDLPWQQNKTAYSVWLSEIMLQQTQVTTVIPYYQRFLERFPTVVDLANAEQDEVLHLWTGLGYYARARNLHKAAKEVAHKYNGEFPLDLEQMNALPGIGRSTAAAVLSSVYKQPHAILDGNVKRTLSRCFAVDGWPGQKKVENQLWEIAETHTPQTDVDKYNQAMMDMGAMMCTRSKPKCTLCPVNEICIAKKQGNVLDYPGKKPKKDKPVKQTRFVMLHHKNGDSHEVWLEQRPQTGIWGGLFCFPQTEHIDVESDIELLLEQRGIQANDIKHQETLITFRHTFSHYHLDITPILIDLSKQPNVVMEGNKGLWYNLSKPEEVGLAAPVKLLLEALPHELR >NZ_CP018680.2|WP_005449665.1|2138707_2139427_-|tRNA-(guanosine(46)-N7)-methyltransferase-TrmB MSEVTTNEYNEDGKMIRKIRSFVRREGRLTKGQENAMNECWPTMGIDYKAELLDWKEVFGNDNPVVLEIGFGMGASLVEMAKNAPEKNFIGIEVHSPGVGACLADAREAGITNLRVMCHDAVEVFEHMIPNGSLATLQLFFPDPWHKKRHHKRRIVQLEFAEMVRQKLILNEGIFHMATDWENYAEHMIEIMNQAPGFENIAEEGDFIPRPEERPLTKFEARGHRLGHGVWDIKFKRTA >NZ_CP018680.2|WP_005425798.1|2137667_2138696_-|DNA-methylase MYHYAILANPGHNRIYFDSAMTIACSELKAILASEGVEVTEIGNKDVGLPAALVFSCETELSEAQAGKIAASSIYYALFEVREDGLLRPIQAPAFNTFPESMSQILRYTGKTNEQFTRLMVNLGVSAANTGSAQLTLMDPMCGKGTTLFEGLIHGLNVVGVEINQKWVQEIQTFIVKFMKNGRFKHKVSKEKRTSGGKKVADGFVVEAAADKDDYNKGNLQTMKLYSADTRIADQVVKKNSVDVMVSDLPYGVQHGSKNAKDSKLNRSPLELLKEALPAWKVVLKKQGSVVLSFNEFTLKWKDVAALFEEQGWKVLSEEPYIGYLHRVDQSINRNIIVAIKP >NZ_CP018680.2|WP_074050831.1|2136644_2137565_-|glutaminase-B MKPTAQILTDILAEVRPLIGQGKVADYIPALAKVPNTKLGIAVYTNEGEVIKAGDAEESFSIQSISKALSLTLAMCLYKQEEIWCRVGKEPSGQAFNSMIQLEMEQGIPRNPFINAGAIVVADLLQSRLSAPRQRLLEFARQLSGDTHIVYDKVVAASEMMHGDRNAAIAYLMRSFGNFENEVIPVLQNYFHACALKMSCVDLAKTFSYLANKGTSVQTAKPVVSPTQTKQLNALLATCGLYDGAGEFAYRVGMPGKSGVGGGIIAVVPGEMTIAVWSPELDASGNSLAGTKALELLAERIGRSIF >NZ_CP018680.2|WP_045480178.1|2135301_2136474_+|radical-SAM-family-heme-chaperone-HemW MLTPPALSLYVHIPWCVQKCPYCDFNSHALKAEIPEEEYINALLEDLDTDIEKYRLNEAPRPLHSIFIGGGTPSLISAEGIERLLKGIEARIPFKPEIEITMEANPGTIEAERFVGYRKAGVTRISIGVQSFEQQKLERLGRIHGQDEAVNAAKLAHQIGLNSFNLDLMHGLPDQSIEQALADLDKAIELAPPHLSWYQLTIEPNTMFYYKPPTLPDDDDLWDIFEQGHEKLAAAGYVQYEISGYSKPGYQCQHNLNYWRFGDYLGIGCGSHGKLSFADGRIIRTTKVKHPRGYLMAYQNMVKPYLDSEQLVADEDRPFEFFMNRFRLMEACPKQDYIDTTGLPLSSIQETIDWALEMGYLSETETHWQITEKGKLFLNDLLEAFMAEEE >NZ_CP018680.2|WP_005436525.1|2134680_2135283_+|XTP/dITP-diphosphatase MKKIVLATGNQGKVREMADLLSDFGFEVLAQSEFNVSEVAETGTTFIENAIIKARHAAQETGLPAIADDSGLEVDFLKGAPGIYSARYAGEKASDQENLEKLLKAMEGVPEAERTARFHCVLVLMRHENDPTPIVCHGKWEGRILTEAHGENGFGYDPIFFVPEDNCASAELEPARKKQLSHRGKALKSLFAQLSEQVSQ >NZ_CP018680.2|WP_005425804.1|2134143_2134575_+|DUF4426-domain-containing-protein MSKWITALLISLLSLPSMAGQFKTMKDIEVHYIAFNSTFLTPKIARSYDIKRNGYNAVLNISVLDSASLGKPAVEAKISGQAKNLIGQTQKLTFREVKEGDAIYYLAELGITNEETFTFDIDVKAGNKGSGKLKFTQKFYVEE >NZ_CP018680.2|WP_005425805.1|2133792_2134083_+|YggU-family-protein MSEAVWHDGEDVVLRLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLTKFLAKQFKVAKGLVHIEKGELGRHKQIRIESPVQIPTEIKAII >NZ_CP018680.2|WP_005449658.1|2143367_2144993_+|methyl-accepting-chemotaxis-protein MKLKTQAYLLSAIILIALLALTATGLWTLRVASNLDNKARVTELFKSAYSILTEVEKMAVEGKMPEDEAKALATRLLRNNIYKDNEYVYVADENMIFVAAPLDPQLHGTSFHDFRDGDGNSVGQLILDVLGRKTGQIVEYTWTQKLPDGTIEEKHSIAEKTPHWNWVVGTGIGFNEVNARFWSTAQWQLALCLAIAGAILGGLIISIRKMLNLLGGEPNDVREAVQAVAQGDIQTSFEYTAPKDSIYGAVQEMSQSLAKMVTNLDESMHALRSELSAVESRSNSIADLTMSQQQSTAMIATAMTEMASSANHVASSASDTAQNTDEADKQSQHTQSLIHSTVDNIQGLATQLNTASKAVADLDQDVNSIVKVLDVIGDIAEQTNLLALNAAIEAARAGEQGRGFAVVADEVRNLAGRTQDSTKEIQQMISNLQEGSRNAIHTMEICAETSESTVTESMNASEALQQIVTALESITAMSQQIATAAAEQTQVSDDIAHRINLIEESGSQLNTVVTESQSSTQTLASLADELEGWVNKFSVKH >NZ_CP018680.2|WP_009697574.1|2145893_2147330_-|NAD-dependent-succinate-semialdehyde-dehydrogenase MMQQIQSETLRVLFESLSSQDGIAVINPATEQELIRLKPSSLDELDVQIETCKAAQVEWEKLSAKVRSASLKKWFQLLVEHTEDIANIITLEQGKPLSESRGEVAYGASFVEWFAEEAKRAYGEVIPAPAVDRRLSTIKQPVGVCAAITPWNFPIAMITRKAAPALAAGCGMIIKPSELTPLTALAVVELAHQAGIPKALLSTVVSEQAAEFGLVLSTDPRIKKISFTGSTRVGKILMKQASDTVKAASMELGGNAPFIVFDDADLEAAANGLIASKFRNAGQTCVCTNRLYVHQNVKEAFLTKLLDKVTSLTVGNGLEKNTTLGPVITMASKHRLEAVIDQAVQEGAAMLNLPQKRSGRFMEPVILDNVEQGMAIVQQELFGPVLPVISFDDDEQVLSMANDTEYGLACYFYTDSLKRIIKFSEGLEYGMVGVNEGIISTEVAPFGGIKESGIGREGAKQGMDEYLETKYICLGGLS >NZ_CP018680.2|WP_009698789.1|2153934_2155185_-|HD-GYP-domain-containing-protein MAAIKLTVDRIQPGLHIRLPLKWNDHPFLLNSFKIKDQEQVEMIRHLGVKFVYFNPEQSDASPLPANQPQVEVTQDNTLDLEAQKLWKEKQKRIEKLSAYRRRVIQCEKEFERSLARMRSVMTKIRNRPVDAVGEAQQLIDDIVDKLMCDDNVTLHLMNGKNEFEDIYFHSLNVAVIAMMIGRAKGYSAEQLKALSFAALFHDMGKIKIPTAILRKQVPLTEPESNYLKLHTKYGLDMANQIEDFPDSAKTVIAQHHELRDGSGYPEGLKGDEIDELAQVIIVANAFDNLCHTSIAAEQKIPYTALSHLYKNCKHLYKEENLNILIKFMGVFPPGTVVQLSNNMVGLVISVNASHLLFPNVLVYDPAVPRTQAPIIDLASKDIKIVNAIHPSKLPEKIKEYLNPRSRISYFFDSDE >NZ_CP018680.2|WP_009698790.1|2155330_2156329_-|LacI-family-DNA-binding-transcriptional-regulator MARIKDVAELAGVNRSTVSRIINGEGKFREETRKKVEAAMAELNYRPSAIARSLATSSSNMVGLLVTYYTGGFFGEMMEQVQTELDMHKKFLITAQGHHSAEGEREAIQRFNDLRCDGYVLHSRYLSDDDLRELAKLPTPFVLLDRYVEGIEERCVTFNHHHASRIAVEHLIAGGHQKIVCIAGPSQRQNSLLRKRGYIDAVKTAGIDIDESWCEEGNYGRQSGYDAMAALYRRHPDLTAVFSCSEEMTVGAMQFLHEHRISVPKQISLVSFDSVDLCESLYPTVSAVHFPISDMARVAVQTLIGLIKEQPLMTQPEFEAQLKLRKADRIIT >NZ_CP018680.2|WP_017187877.1|2156431_2157187_-|chitin-disaccharide-deacetylase MKVIFNADDFGLTPGVNKGIIKAHQQGVVQSTTMMVGMDAERHAIELAKQNPNLNVGVHLRFTTGIPLTGHPNLTRGQTQFVSYDELWKIQDFEEEVVYQEAVAQVEAFLKLGLPLSHLDSHHHAHTHPQLLPVIRKVAQKYKVPLRGSGLCHIENSVKYYFTAEFYDQGVSLEGIMKHLLSLKAQYDVVEVMCHPAEADQSLLSKSSYAQQREVELQVLTSPVFKQELAQHGMAITDYSALVSSSGVASV >NZ_CP018680.2|WP_074050832.1|2157204_2158527_-|6-phospho-beta-glucosidase MPRNAIKLAIIGGGSSYTPELVEGVIKRLEFLPVKQMHFVDIESGAEKLEIIKGLAQRMIDKAGADIEIKAGFDRREAIKDADFVMTQFRVGGLTARANDERIPIKYDVIGQETTGPGGFAKALRTIPVILDICKDIEELSPHAWMLNFTNPAGLVSEAVSKYSNVKSIGLCNVPVSMEMMIAEMMDCEPKELQLEFAGLNHLVWVLKVWLKGEDVTQTVLEKVGDGANFSMKNIWEEPWDPAFLKALGAIPCPYHRYFYQTDAMLAEEKQSAGEQGTRAEQVMETESALFKLYQDPNLDHKPKELEERGGAYYSDASLNLVDAIYNNRNSIHVVNVLNNGAINGLPDSAVIECSAVIGSWGAKPLAVGELSSNIKGLLHQVKAYEQLAIEAAVEGSYEKALMALTNNPLVPDIGRAKSILDEILEVNAAYLPQFKLTTL >NZ_CP018680.2|WP_005448800.1|2158537_2158849_-|PTS-lactose/cellobiose-transporter-subunit-IIA MEQELVVMEIICNAGEARSLSYEALRLSREKDFTAAEEKLSQAKECINKAHLIQTQLIEEDQGEGKVPMTLVMVHAQDHLMTTILAQEMAVEIVALNKQLAAR >NZ_CP018680.2|WP_029790579.1|2158896_2160231_-|PTS-sugar-transporter-subunit-IIC MKLYDAIIGIVEKHIAPIAAKVGNQPHVRAMRDGFIVAMPFIIVGSFILIFAFPPFAEDTTFALGRIWLDFATTHFDTIMMPFNMSMGIMTIFVSLGVAYSLAKAYKMDGITSAVLSLMCFLLVAAPAKDGALAMKHMGGTGIFTAVMCAFFAVELYRFMKKHNITIRMPEQVPPAIARSFEVLLPVLAVFLTLYPLSIFVQTQYDMLIPDAVMAMFKPLISASNTLPAIIGALLVCQLLWFAGIHGAAIVVGLLSPIFLTNISANIDAFVAGQPIPNVFTQPFWDFYIFIGGSGATLALVMLMSFSRSAHLKSIGRMSAVPGFFQINEPVIFGSPVVMNPILFIPFVFAPIVNATIAYFAVQLGFVGMGVATTPWTTPALIGASWGSGWTFSPVLLVIGLLILDLFIYLPFFKMFEKQVMEQELPTSKESKDAEQPSGEGVTA >NZ_CP018680.2|WP_005448803.1|2160319_2160625_-|PTS-sugar-transporter-subunit-IIB MKKILLCCSAGMSTSMLVKKMEQAAEKQGLECKIDALSVNAFDEAIKEYDVCLLGPQVRFQLEELRKTAQEHGKNIDAISPQAYGMMKGEEVLQQALELIN >NZ_CP018680.2|WP_005448805.1|2161029_2161479_+|type-III-secretion-system-chaperone MSTLIEQLFQELVTDLGIDTLQPNEDGHCTLMVDETLLLDIELDPKRERLILTSLVGPLSSQQGIKQLSSLMQFNKALYKELNMSLSLETHTASIVLTYSVDAGLCTVFDLESALSCFISQTEKCRHLLESTQEDSPPKNSMLNTAICV |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP018680_2 | 3468495-3468656 | Orphan |
NA
Consensus repeat of NZ_CP018680_2
|
1 spacers
spacers of NZ_CP018680_2
>2.1|3468547|58|NZ_CP018680|CRISPRCasFinder ATATTGTTTAGAGCATTTCAAGATGGTGCGGAAGGAGAGACTTGAACTCTCACACCTC |
CRISPR arrays and Neighbor proteins around NZ_CP018680_2
The CRISPR arrays of NZ_CP018680_2 >merge|NZ_CP018680|2|3468495-3468656|CRISPRCasFinder GCGGCGCCAGAACCTAAATCTGGTGCGTCTACCAATTCCGCCATTTCCGCATATATTGTTTAGAGCATTTCAAGATGGTGCGGAAGGAGAGACTTGAACTCTCACACCTCGCGGCGCCAGAACCTAAATCTGGTGCGTCTACCAATTCCGCCACTTCCGCAT >NZ_CP018680|2|1|3468495-3468656|CRISPRCasFinder GCGGCGCCAGAACCTAAATCTGGTGCGTCTACCAATTCCGCCATTTCCGCAT ATATTGTTTAGAGCATTTCAAGATGGTGCGGAAGGAGAGACTTGAACTCTCACACCTC GCGGCGCCAGAACCTAAATCTGGTGCGTCTACCAATTCCGCCACTTCCGCAT
>NZ_CP018680.2|WP_074051126.1|3465356_3466532_+|2-octaprenyl-3-methyl-6-methoxy-1,4-benzoquinol-hydroxylase MNKYDIAVIGGGMVGAAVAVGFAKQGRSVVMVEGAEPKAFEASQALDIRVSAISHQSVKLLDSLGAWSVIESMRVCPYRRLETWEHPECRTRFHSDELSLEQLGYIVENRLIQLGLWQIFAQYDNLTVMCPERLKDIEFAEVNRVMLESGEQFEASWVIGADGANSKVRQLAGIGVTAWDYRQHCMLINVKTELPQQDITWQQFTPSGPRSFLPLCSLAEDGKEVGQGSLVWYDSPKRIKQLCAMTKPQLKEEILRHFPAELGDIEVLQFGSFPLTRRHAQSYSNKNCVLVGDSAHTINPLAGQGVNLGFKDVDVLLSVTNQQEQLNDALLAKYERARRPDNLLMQTGMDVFYKGFSNDLGPLKFARNAALKLAENSGPIKAQVLKYALGM >NZ_CP018680.2|WP_005438595.1|3463673_3465098_-|tRNA-(N6-isopentenyl-adenosine(37)-C2)-methylthiotransferase-MiaB MSKKLLIKTWGCQMNEYDSSKMADLLNAANGYELTEEPEEADVLLLNTCSIREKAQEKVFHQLGRWKTLKDKKPGVVIGVGGCVATQEGDHIRQRAPYVDVIFGPQTLHRLPEMIKQSQSDEAPVMDISFPEIEKFDRLPEPRAEGATAFVSIMEGCSKYCTYCVVPYTRGEEVSRPMDDVLFEIAQLADQGVREVNLLGQNVNAYRGPMHDGEICSFAELLRLVASIDGIDRIRFTTSHPLEFTDDIIAVYEDTPELVSFLHLPVQSGSDRVLTMMKRPHTGIEYKSIIRKLRKARPDIQISSDFIVGFPGETDKDFQDTMKLIKDVDFDMSFSFIFSPRPGTPAADYPCDLSEQVKKERLYELQQTVNAQAMRYSRLMLGTEQRVLVEGPSKKNLMELRARTENNRVVNFEGSADLIGQFVDVKITDVFANSLRGELVRTEKDMDLRSVISPTQMMAKTRREDELGVATFTP >NZ_CP018680.2|WP_005447495.1|3462502_3463600_-|PhoH-family-protein MSNKIVTLEINLEPSDNRRLASLCGPFDDNIKHLERRLGVEINHRSDLFTIVGKPHTAAAALDILKTLYVDTAPVRGETPDIEPEQIHLAIKESGVLEQNTESHFEHGKEVFVKTKKGVIKPRTPNQGQYLMNMVTHDITFGIGPAGTGKTYLAVAAAVDALERQEIRRILLTRPAVEAGEKLGFLPGDLSQKVDPYLRPLYDALFEMLGFERVEKLIERNVIEVAPLAYMRGRTLNDAFIILDESQNTTVEQMKMFLTRIGFNSRAVITGDVTQIDLPRGAKSGLRHAIEVLSEVDDISFNFFQADDVVRHPVVARIVNAYEKWEAQDQKERKEFEKRRREERDAKLLEAAKAELSAQVAATKE >NZ_CP018680.2|WP_005425552.1|3462035_3462500_-|rRNA-maturation-RNase-YbeY MAIELDLQLAVENEEGLPSEQDFQLWLDKTIPLFQPQAELTIRIVDEQESHELNHEYRGKDKPTNVLSFPFEVPPGMEMDLLGDLIICRQVVEKEAVEQNKPLLAHWAHMVVHGSLHLLGYDHIEDDEAEEMESLETEIMQGMGYEDPYIAEKE >NZ_CP018680.2|WP_005425551.1|3461029_3461929_-|CNNM-family-magnesium/cobalt-transport-protein-CorC MNEDNSPSSNEGKKEKAEGPSRKSFFERLGQLFQGEPKDRQELVDVIRDSEINDLIDHDTRDMLEGVMEISEMRVRDIMIPRSQMVTVERTDDLDTLIALITDAQHSRYPVISEDKDHVEGILLAKDLLKYLGSGSNPFDIEEVIRPAVVVPESKRVDRLLKEFREERYHMAIVVDEFGGVSGLVTIEDILEEIVGDIEDEFDDDEETDIRKLSKHTFAVRALTTIEEFNETFGTNFSDEEVDTVGGMVMTAFGHLPSRGEVVDIEAYSFKVTAADNRRVIQLQVTIPDEEKLVEATQE >NZ_CP018680.2|WP_017818929.1|3459471_3460992_-|apolipoprotein-N-acyltransferase MMNVLFHRLKRPLVAAFVGASTTLAFAPYQLWPLAILSPAILLILLADQSPRRALWIGYSWGLGQFATGVSWVYVSIAGFGGMPLIANLFLMGLLIAYLSIYTGLFAWLTNKFFPEFTLSKALLAAPALWLVTDWLRGWVMTGFPWLWLGYSQIDAPFASFAPIGGVELLTLFIMISAGALAYAWIHKQWLMIIIPAVLLSAGFGIRQYDWVTPRIEDTTKVALIQGNVDQNLKWLPSQRWPTIMKYADLTRENWDADIIVWPEAAIPAFEIEVPSFLRNIDTAAKMNNSAIITGVVNQGKDGQFYNSILSLGVNPYGDYSFDMDKRYHKHHLLPFGEFVPFEDVLRPLAPFFNLPMSSFSRGDFIQPNIVANGKHMDPALCYEIIFSEQVRQNVTDDTDFILTLSNDAWFGHSIGPLQHMEIARMRALELGKPLIRSTNNGVTAVTDHKGKVIEQIPQFETAVLRAELVPTDGQTPYRIVGTWPLYIWAGLSLMLAWWLSRKKKS >NZ_CP018680.2|WP_005447499.1|3458949_3459420_+|zinc-ribbon-containing-protein MPKRKAGYEEMLEDVIETLKHSPDEVNKVFETSGKVVDAANDMTKDELSLISAYVKADLKEFSDSYEEGKSGPFYLTIADSVWQGLLEITDRTKVEWVELFDDLEHQGLYEAGEVIGLGTLVCDECGHKTTYNHPTVIIPCIKCNHTGFSRQSLKP >NZ_CP018680.2|WP_009697741.1|3456220_3458794_-|leucine--tRNA-ligase MQEQYNPQDLEQKVQKHWDDNKTFVVSEDPNKEKFYCLSMFPYPSGRLHMGHVRNYTIGDVVSRFQRLQGKNVMQPIGWDAFGLPAENAAVKNNTAPAPWTYENIEYMKNQLKLLGFGYDWNREFATCTPEYYRWEQEFFTKLYEKGLVYKKTSSVNWCPNDQTVLANEQVEDGCCWRCDTPVEQKEIPQWFIKITEYAQELLDDLDKLEGWPEMVKTMQRNWIGRSEGVELKFEVKGQQDLEVYTTRPDTLMGVTYVGIAAGHPLATIAAENNPELAAFIEECKNTKVAEAELATMEKKGMATGLTAIHPLNGREVPVYVANFVLMDYGTGAVMAVPAHDQRDFEFATKYGLDIVPVIKPVDGSELDISEAAYTEKGVLFDSGEFDGLEFQAAFDAIAAKLEAEGKGTKTVNFRLRDWGVSRQRYWGAPIPMVTTEDGEVHPVPADQLPVILPEDVVMDGVTSPIKADKEWAKTTFNGEPALRETDTFDTFMESSWYYARYCSPQADDILDPEKANYWLPVDQYIGGIEHACMHLLYSRFFHKLLRDAGYVTSDEPFKQLLCQGMVLADAFYFENEKGGKEWVAPTDVKVERDGKGRITSATDNEGRNVEHSGMIKMSKSKNNGIDPQEMVDKYGADTVRLFMMFASPADMTLEWQESGVEGANRFLKRVWKLVNEHTSKGAAEAVDAAALSGDQKALRRDVHKTIAKVTDDIARRQTFNTAIAAIMELMNKLAKAPQESAQDRAILDEALKAVVAMLYPITPHISYELWAALGEADIDNAVWPTFDEKALVEDEKTIVVQVNGKLRAKLTVAADATKEQVEELGLNDENVTKFTDGLTIRKVIYVPGKLLNIVAN >NZ_CP018680.2|WP_009697740.1|3455491_3456049_-|luciferase MRFFSLIKLPVVLVLAGLLSACGFHLRGEYSVPEELHTMSFTSYDEYSPLTRYIRAQLELNKVDLVQPSSSTPNLHLIEATIDERTLSLYQNSRAAEKELTYVVKYRVTIPGFGAKDFKTTVNRNYLDNPLTALAKSVERDVIEDEMRQQAASQMMRQLGRVRAEYEQGQPTPDVMKTNSTNTNS >NZ_CP018680.2|WP_074051125.1|3454466_3455486_-|DNA-polymerase-III-subunit-delta MRIFADKLADHLAKHIKQVYLIFGNEPLLIQESRQAIQTMAHQQGFEERHRFAVDASLDWNQVYDCFQALSLFSSRQLIELELPENGVTTTVSKELQTLCEMLHDDIMLVIVGSKLTKAQENAKWFKALSAKGDWVSCQTPDLQRLPMFVQARCRTLGLKADQQSLQMLAQWHEGNLFALSQSLEKLALLYPDGELTVVRLEEALSRHNHFTIFHWIDALLAGKANRAQRILRQLEAEGTETVILIRSVQKEFNQLLNMHRDLTQMGMSQVFEKHRVWQNKRPFYNAALTRLSASRICALLSLLTHAEIKAKTQYDESVWPTIHQLSLETCSPDIKLAI >NZ_CP018680.2|WP_005449255.1|3469474_3470566_-|redox-regulated-ATPase-YchF MGFKCGIVGLPNVGKSTLFNALTKAGIEAANFPFCTIEPNTGVVPVPDLRLDALAKIVNPQKVLPTTMEFVDIAGLVAGASRGEGLGNKFLANIRETDAIGHVVRCFENDNIVHVAGKVSPIEDIEVINLELAMADLDSCERAIQRNAKKAKGGDKDAKFELTVLEKLLPVLTEGGMARTVDLAKEELAAIGYLNFLTLKPTMYIANVNEDGFEDNAYLDAVREFAAKENNVVVPVCAAIESEMAELEDDEREEFLADLGIEEPGLNRVIRAGYELLNLQTYFTAGVKEVRAWTIPVGATAPKAAAKIHTDFEKGFIRAEVVGYDDFIQFNGESGAKDAGKWRLEGKEYIVKDGDVVHFRFNV >NZ_CP018680.2|WP_005438598.1|3470576_3471167_-|aminoacyl-tRNA-hydrolase MTQPIKLLVGLANPGPEYAKTRHNAGAWVVEELARVHNVTLKNEPKFFGLTGRIMVNGQDLRLLIPTTFMNLSGKAIAALAKFYQIKPEEIMVAHDELDLPPGVAKFKKGGGHGGHNGLRDTISKLGNNKDFYRLRIGIGHPGHKDKVAGFVLGKAPAKEQELLDAAADESVRCLDILIKDGLSKAQNRLHTFKAE >NZ_CP018680.2|WP_005449254.1|3471318_3472263_-|ribose-phosphate-pyrophosphokinase MPDMKLFAGNATPELAQRIADRLYISLGDASVSRFSDGEVAVQINENVRGSDVFIIQSTCAPTNDNLMELVVMIDAMRRASAGRITAVIPYFGYARQDRRVRSARVPITAKVVADFLSNVGVDRVLTIDLHAEQIQGFFDVPVDNIFGTPVLLEDMQSRGLENPVVVSPDLGGVVRARATAKALGDIDIAIVDKRRPRANVSEVMNLIGDVEGRDCVIVDDMIDTGGTLCKAAEALKERGAKRVFAYATHAVFSGNAADNIKNSVLDQVIVTDSISLSKEMAATGKVTTLSLSRMLAEAIRRISNEESISAMFN >NZ_CP018680.2|WP_009696818.1|3472290_3473163_-|4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol-kinase MIDLSTRWPSPAKLNLFLYINGRTENGYHELQTLFQFVDHGDELTIQANDSGEITISPEIEGVPLQDNLIWKAATALQRYAKCSYGAHIDLHKILPMGGGIGGGSSNAATTLVALNYLWQTHLSDDELAEIGLALGADVPVFVRGFAAFAEGVGEKLSPAHPDEKWYLVVRPNVSIATADIFGHPDLTRNTPKRDLETLLNTPSVNDCEKIVRMLYPEVDKQLSWLLQYAPSRLTGTGSCVFAEFSSKSEAETILAQLSDKVSAFVAQGRNISPLKETLAEYQSASHRPI >NZ_CP018680.2|WP_005449252.1|3473159_3473798_-|lipoprotein-localization-protein-LolB MKLINKPSNMTLRSFLILLMSSIVLAGCSSVPESVTSVEWQAHEQRLETIKNFQATGKLGYIGPDQRQSLNFFWKHSSSLSQLRLTTLLGQTALKLTITPQGATVETYDDQVLSARDANQLIYRLTGLMMPVDHLPDWLLGLPTDADSFQLSPTNTLQALDKQIGLNDWNIAYQRYGDIEWHNQTLPLPNKLKLSTSDVKINLVITKWNITQ >NZ_CP018680.2|WP_005449251.1|3473932_3475189_+|glutamyl-tRNA-reductase MSLLAIGINHNTASVDLREKVAFGPDKLGPALEQLRDHEAVNGSVIVSTCNRTEVYCDVKQGARNKLIDWLAQFHQVSQEDLMPSLYVHEEQAAIKHLMRVSCGLDSLVLGEPQILGQVKQAFSDSRDHQAVDASIDKLFQKTFSVAKRVRTETDIGGNAVSVAYAACTLAKHIFESLSDSTVLLVGAGETIELVAKHLASNGCTKMIVANRTRERALGLAEQFGAEVISLNEIPDHLPRADIVISSTASPLPIIGKGMVETALKKRRHQPMLLVDIAVPRDVEAQVGELSDAYLYSVDDLQSIIDSNIEQRKVEAIQAEAIVSEESAAFMTWLRSLQAVDSIRDYRKSANEIREDLLSKSLQSLATGADPEKVLRELSNKLTNKLIHAPTRALQSAAEQGEPAKLTVIRQTLGLDDL >NZ_CP018680.2|WP_005449249.1|3475218_3476307_+|peptide-chain-release-factor-1 MKASILTKLETLVERYEEVQHLLGDPDVIGNQDKFRALSKEYSQLEEVTKCFQAYQQAQDDLVAAEDMANEDDEEMREMAQEEIKEAKETIERLTDELQILLLPKDPNDDRNCFLEIRAGAGGDEAGIFAGDLFRMYSKYAEKRGWRIEVMSSNEAEHGGYKEMIAKVSGDGAYGVLKFESGGHRVQRVPATESQGRVHTSACTVAVMAEIPEADLPEIKAADLKIDTFRASGAGGQHVNTTDSAIRITHLPTGTVVECQDERSQHKNKAKAMAVLAARIVQAEEERRAAEVSDTRRNLLGSGDRSDRIRTYNYPQGRVSDHRINLTIYRLNEVMEGDLQSLIDPVVQEHQADQLAALAENN >NZ_CP018680.2|WP_074051127.1|3476310_3477168_+|peptide-chain-release-factor-N(5)-glutamine-methyltransferase MSQVQSIEQALKSATAILTEGGKESPSLDAAVLLCHVLGKPRTYLLTWPEKALDPEQQAQFDALLARRITGEPVAYIIGEREFWSLPLKVSPSTLIPRPDTERLVEVALDKTYEQTGPILDLGTGTGAIALALASELPKRQVMGVDLKHEAKDLAEYNASQLNIKNVTFDQGSWFEPIASGTKFALIVSNPPYIDEKDPHLAEGDVRFEPKSALVADENGLADIRHISDLARQYLEEDGWLAFEHGYDQGEAVREIMTHFGFEQVVTEKDYGGNDRVTLGCYKPS >NZ_CP018680.2|WP_005449241.1|3477208_3477592_+|SirB2-family-protein MYVALKHIHLVTIALSATLLSIRFVLLMMDSPKRNNRFLKVFPHIVDTALLLSGIGLIMVTGFIPFTDAAPWLTNKITCVLAYIALGFFALKMAKNKLLRIFAFFGALGWLVMAANIAVSKNPNLFG >NZ_CP018680.2|WP_005449240.1|3477597_3478407_+|SirB1-family-protein MLEFFDEDFDNMALVEGALELNHSINPDTNVHWAKQELERLYKEAEAILVHETDEEQRFDSFLRLFFYEWGFKGDDQEYFISDNSFIDKVLERKKGIPVSLGAVLLYLGNRLGFPMKGVTFPTQFLIKVDWMHKTPDYINPFNGEYVGEKILQAWLIGQEGPLATLKPEHFEEADNPTVIGRWLALIKSAMLREERYTLALRCTNLALTFVPDDPYEIRDRGFIFQQLECHQVAVSDFQYFIDQCPDDPASELLKSQVNAMSEKHVTLH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | NZ_CP018680.2 | 2142490-2142532 | 0 | 1.0 |
1. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to position: 2142490-2142532, mismatch: 0, identity: 1.0
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcaccattctttt Protospacer *******************************************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | NZ_LN868946 | Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 4, complete sequence | 103819-103861 | 5 | 0.884 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | NC_021742 | Serratia liquefaciens ATCC 27592 plasmid unnamed, complete sequence | 35424-35466 | 5 | 0.884 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | NZ_CP041417 | Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence | 296497-296539 | 6 | 0.86 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | KU052038 | Escherichia phage SerU-LTIIb, partial genome | 1162-1204 | 6 | 0.86 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | KU052038 | Escherichia phage SerU-LTIIb, partial genome | 3068-3110 | 6 | 0.86 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | LN997803 | Escherichia coli phage phi467 | 15122-15164 | 6 | 0.86 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | NZ_CP023526 | Cedecea neteri strain FDAARGOS_392 plasmid unnamed, complete sequence | 1925-1967 | 6 | 0.86 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | NZ_CP022660 | Salmonella enterica subsp. enterica strain RM11060 plasmid pRM11060-2, complete sequence | 52074-52116 | 6 | 0.86 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | NZ_CP045061 | Salmonella enterica subsp. enterica serovar Muenchen strain LG26 plasmid pLG26p2, complete sequence | 52080-52122 | 6 | 0.86 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | NZ_CP045054 | Salmonella enterica subsp. enterica serovar Muenchen strain LG24 plasmid pLG24p2, complete sequence | 52085-52127 | 6 | 0.86 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | NZ_CP045058 | Salmonella enterica subsp. enterica serovar Muenchen strain LG25 plasmid pLG25p2, complete sequence | 52084-52126 | 6 | 0.86 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | MH791411 | UNVERIFIED: Escherichia phage Ecwhy_1, complete genome | 13641-13683 | 6 | 0.86 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | MH494197 | Escherichia phage CMSTMSU, complete genome | 196768-196810 | 6 | 0.86 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | NZ_CP054057 | Scandinavium goeteborgense strain CCUG 66741 plasmid pSg66741_1, complete sequence | 85813-85855 | 7 | 0.837 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | KY653118 | Morganella phage IME1369_01, complete genome | 694-736 | 7 | 0.837 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | NC_049942 | Escherichia phage JLK-2012, complete sequence | 23522-23564 | 10 | 0.767 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | LC494302 | Escherichia phage SP27 DNA, complete genome | 76609-76651 | 11 | 0.744 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | LT603033 | Escherichia phage vB_Eco_slurp01 genome assembly, chromosome: I | 17115-17157 | 11 | 0.744 |
NZ_CP018680_1 | 1.1|2142845|43|NZ_CP018680|PILER-CR | 2142845-2142887 | 43 | NC_027364 | Escherichia phage PBECO 4, complete genome | 207828-207870 | 11 | 0.744 |
1. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to NZ_LN868946 (Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 4, complete sequence) position: , mismatch: 5, identity: 0.884
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcaccatcaagtc Protospacer *************************************. *.
2. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to NC_021742 (Serratia liquefaciens ATCC 27592 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.884
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagccggcaccaatcaagt Protospacer ***************************.******** ** *
3. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to NZ_CP041417 (Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence) position: , mismatch: 6, identity: 0.86
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcaccatatgcgg Protospacer ************************************* . .
4. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to KU052038 (Escherichia phage SerU-LTIIb, partial genome) position: , mismatch: 6, identity: 0.86
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcaccatatgcgg Protospacer ************************************* . .
5. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to KU052038 (Escherichia phage SerU-LTIIb, partial genome) position: , mismatch: 6, identity: 0.86
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcaccatatgcgg Protospacer ************************************* . .
6. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to LN997803 (Escherichia coli phage phi467) position: , mismatch: 6, identity: 0.86
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcaccatatgcgg Protospacer ************************************* . .
7. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to NZ_CP023526 (Cedecea neteri strain FDAARGOS_392 plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.86
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcaccagaatgcg Protospacer ************************************ * .
8. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to NZ_CP022660 (Salmonella enterica subsp. enterica strain RM11060 plasmid pRM11060-2, complete sequence) position: , mismatch: 6, identity: 0.86
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcaccacttacga Protospacer ************************************.*. .
9. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to NZ_CP045061 (Salmonella enterica subsp. enterica serovar Muenchen strain LG26 plasmid pLG26p2, complete sequence) position: , mismatch: 6, identity: 0.86
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcaccacttacga Protospacer ************************************.*. .
10. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to NZ_CP045054 (Salmonella enterica subsp. enterica serovar Muenchen strain LG24 plasmid pLG24p2, complete sequence) position: , mismatch: 6, identity: 0.86
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcaccacttacga Protospacer ************************************.*. .
11. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to NZ_CP045058 (Salmonella enterica subsp. enterica serovar Muenchen strain LG25 plasmid pLG25p2, complete sequence) position: , mismatch: 6, identity: 0.86
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcaccacttacga Protospacer ************************************.*. .
12. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to MH791411 (UNVERIFIED: Escherichia phage Ecwhy_1, complete genome) position: , mismatch: 6, identity: 0.86
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcacaaatcagcc Protospacer ********************************** * ** ..
13. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to MH494197 (Escherichia phage CMSTMSU, complete genome) position: , mismatch: 6, identity: 0.86
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcacaaatcagcc Protospacer ********************************** * ** ..
14. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to NZ_CP054057 (Scandinavium goeteborgense strain CCUG 66741 plasmid pSg66741_1, complete sequence) position: , mismatch: 7, identity: 0.837
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgattccggtagtcggcaccaaatgcgg Protospacer ************************************ . .
15. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to KY653118 (Morganella phage IME1369_01, complete genome) position: , mismatch: 7, identity: 0.837
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer agtaggtcaccagttcgactccggtagccggcaccatattaaa Protospacer ******************.********.********* .*
16. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to NC_049942 (Escherichia phage JLK-2012, complete sequence) position: , mismatch: 10, identity: 0.767
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer atcaggtcgccagttcgattccggtagccggcaccatatgcgg Protospacer * .*****.******************.********* . .
17. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to LC494302 (Escherichia phage SP27 DNA, complete genome) position: , mismatch: 11, identity: 0.744
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer gaaaggtcaccagttcgattccggtagtcggcacaaaataaag Protospacer .. ******************************* * .
18. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to LT603033 (Escherichia phage vB_Eco_slurp01 genome assembly, chromosome: I) position: , mismatch: 11, identity: 0.744
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer gaaaggtcaccagttcgattccggtagtcggcacaaaataaag Protospacer .. ******************************* * .
19. spacer 1.1|2142845|43|NZ_CP018680|PILER-CR matches to NC_027364 (Escherichia phage PBECO 4, complete genome) position: , mismatch: 11, identity: 0.744
agtaggtcaccagttcgattccggtagtcggcaccattctttt CRISPR spacer gaaaggtcaccagttcgattccggtagtcggcacaaaataaag Protospacer .. ******************************* * .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
434895 : 444919
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP018680|434895:444919|DBSCAN-SWA TATGTTCAAAGAGAACGCAAAGAAAGTCGAAACCATTATCAAAACCAGTGAAGAGCCTCAGTCTTCACGTCTAAATGGCTTCCAACGCTTAAAAGAGTGCTGCTTTATCGTTGGCGTTCTTAGCTCGGTATTGTTGGCCGTTGCATTATTTACCTTTAGCCCAGCTGATCCGTCTTGGTCCCAAACTGCTTGGGGGGGAGAGATCAATAACGCTGGTGGCTTGTTCGGCGCTTGGTTAGCCGACACGCTATTCTTTACCTTTGGTTCCCTTGCGTACCCACTGCCATTCCTGTTGACAGCAGGAGCATGGGTGATTTGCCGTAAGCGTTCTGAAGACGATCCTATCGACCTTATGTTGTGGGGTACTCGTCTACTTGGGTTAGTTATCCTCATTCTCACCAGTTGTGGTCTTGCTGATATTAACTTTGACGACATTTGGTATTTCTCATCTGGTGGTGTCGTTGGTGACGTACTAACCAGTCTATCTTTACCTACACTCAATGTGTTGGGTACAACGCTTGTGCTTCTTTTCCTATGGGGAGCCGGCTTTACTCTGTTTACCGGTATTTCTTGGCTGAAGATTGTCGAATGGTTAGGTGACCGTGCGCTTGGCTTAGTTGCTGCTGTTACTAATAAAGCTCGTGGTACTGAGCAAGAAACGCTAGAACCTCAATTAGACGAGTTCGCTGAGGACCGAGTCGTGAGTAAGCGCGGTCAAGATGATCTTGAAGATGAACCTCTGCCGCACCTTACGGCTTATGATGTAGAAGAGCCGAAAGAAGAGGCGCCTGCACATGAATACCCGATCTACATGCCTCAATCTGCAGCGGAACAACCGAAACAGCAGTCTGCGACACAACCTATAGCGCAGCCAACGCCTCAACCAGCGGTTGTGAATGCAGCGCCACTACAATCTCAAGTTCAGCATCAGGCTCAAGAGCAAGCTCAATCTCAATCAGTTGAGCCTGTTCGAGCAGTGAACACAGACAACGTGGATCCGACGGTTGAGCGTACTAAGCAACTGAATGTCACCATTGAGGAGCTAGAAGCGGCAGCGCAACAAGCTGACGACTGGGCTGATGAAACGCCACAAGTTCAAGTATCACAAGCCCAAGAGCCAGTGGTTGCACCTATTGTGGCTCAGCCTGAAGCGCCACAGTCCGTTGCTCAAGAACCAGCTCCACAGCAAGTTGATCATCAACAAGCGCTACAAGAACCGTTTGTGGCACCTGAACCAACTGTCTTTGAACAGGTCGAGAATGAGCCAAGCTACGATGAGTATGCTCAGTTTGAAGCAGATCAGGCTCAACAGCCCCAAGTTGTTCAAGAACAAGCACTGGTTCACGCGCCAATCGAACAGCCAACTACAGAGCAACCTCTACATGAAGAGCCTGTAATTAACGCTGCAGCTTTGGATAACATTGCAGAACAAACAGAGCCAAGCCAGCACATCGAGCCAACGATTTCGAACTTTGATGTCTTGGATGAAGAAGATGATTACCAAGCGCAGCAACCTGTACAGCAAGCTCAGCCTACACATGTTGAGCCGCAACAACCTGCTTCTCAGCCTGTACAACCGCAGGCTGTACAAGAACAACCTATGCAGCACCAAGCTGCGCCTCAGCCGGTAGCACCAGCGCCTCAAGATGTCGAGGTAGAAGAGGTTCAAGAAGGTGACCAAGACGTAGCGGCGTTCCAAAACTTGGTTTCGAGCGCTCAAGCGAAAGTGGCTGCTCAGCAAAACCCATTCTTGGTACAACAAGAGCAAAACTTGCCAGTACCAGCAGAGCCATTGCCAACGCTAGAATTGCTTTACCACCCAGAGAAACGCGACAACTTTATCGACCGTGACGCTTTGGAGGAGGTGGCTCGTTTGGTTGAAACCAAACTTGCAGATTACAAGATTAAAGCTGATGTAGTCGGTATTTACCCAGGTCCTGTTATTACTCGATTCGAGCTTGATTTGGCTCCGGGCGTAAAAGTGAGCCGTATTTCAGGTTTGTCTATGGACTTGGCTCGTGCATTGTCAGCAATGGCAGTACGTGTTGTCGAAGTTATTCCAGGCAAACCGTATGTGGGTCTAGAGTTGCCTAATATGAGCCGCCAAACCGTGTTCCTGTCTGATGTAATCAACAGTCCTCAATTTGAACAAGCAACCTCGCCGACAACAGTCGTTCTTGGTCAAGATATTGCTGGTGAAGCGGTTATTGCTGATATTGCGAAGATGCCTCACGTGTTAGTTGCGGGTACGACTGGTTCTGGTAAGTCGGTGGGTGTGAACGTCATGATCTTGAGTATGCTGTATAAAGCATCGCCTGAAGATCTACGTTTCATCATGATCGACCCGAAAATGTTGGAACTTTCAATCTACGAAGGTATCCCTCATCTTCTTTCTGAAGTTGTAACGGACATGAAAGACGCGTCTAACGCGCTACGTTGGTGTGTTGGCGAGATGGAACGCCGTTATAAACTGATGTCGGCATTGGGCGTCCGTAATGTCAAAGGCTTCAACGAGAAGTTGAAGATGGCAGCAGAAGCGGGTCACCCAATTCATGACCCGTTCTGGCAAGAAGGCGACAGCATGGACACTGAGCCACCGTTGCTAGAGAAACTGCCTTACATCGTTGTGGTTGTCGACGAATTTGCCGACCTAATGATGGTGGTAGGTAAGAAGGTTGAAGAACTGATTGCTCGTTTGGCACAGAAAGCGCGTGCGGCTGGTATTCACTTGATTCTTGCGACGCAGCGCCCATCGGTTGACGTTATTACCGGTCTGATTAAAGCGAACATTCCAACGCGTGTGGCATTTACAGTATCTACTAAGACTGACTCGCGCACGATTCTTGACCAAGGCGGCGCAGAATCACTACTCGGTATGGGTGACATGCTTTACCTACCTCCGGGTTCAAGCCATACAATCCGTGTTCACGGTGCGTTTGCGTCGGATGATGATGTTCACGCTGTTGTGAACAACTGGAAAGCGCGTGGTAAGCCTAATTACATCGACGAAATCATCAGTGGTGACCAAGGCCCAGAAAGTCTGCTTCCGGGTGAACAGATGGAATCGGATGAAGAAGTGGACCCATTGTTCGATCAAGTCGTAGAGCATGTGGTTCAATCTCGTCGTGGGTCGGTTTCTGGCGTACAACGTCGATTCAAGATTGGTTACAACCGAGCGGCTCGAATTGTTGAGCAGCTTGAAGCGCAAGGCATCGTAAGTGCTCCGGGTCATAACGGTAACCGTGAAGTACTGGCTCCTGCGCCACCAAAAGATTAATGAACTTGTGTTACTCACCTCAGTCCAACTGAGGTGAGTGGTTCTAACAATAGGTATCTGAATGAACAAACTATTTTCAGCGAAACTTTTCTCTGCGCTTGTTTTAAGCTTTTCTCTCTTCTCTACTGCACATGCAGCGTCTCCTAAAGATGAATTAAACAAACGCCTGTCTATGAATGATGGCTTCAGTGCGGATTTCTCACAGCAAGTAATCAGCCCAGAAGGTGAAACAGTCATGGAAGGGGAAGGTACGGTTGAAATTGCTCGTCCAAGCCTGTTCCGCTGGAGCACGACATTCCCTGACGAAAACTTGCTGGTCTCTGATGGTAAAACACTGTGGTACTACAGCCCATTCATTGAACAAGTCAGCATCTACTGGCAAGAGCAAGCAACAGAGCAAACGCCATTTGTTTTGCTAACGCGTAATCGCGCAAGTGATTGGGATAACTACAAGATTTCTCAAAAAGGTGACGAGTTCACCCTCATTCCTACCGCAGTAGACTCCACACAAGGTCAATTCCAGATTAATATTGATGCAAAAGGTGTGGTGAAAGGCTTTAACGTAGTTGAGCAAGATGGTCAGAAGGGGCTGTTCAGCTTTAGCAACGTGAAGCTTGGCAAACCAAAAGCAGACAGATTCACCTTCACAATCCCAGATGGCGTGGAAGTGGATGATCAAAGGAATTAATTGGTAGTGAGCAATTACACATTAGATTTCTCGGGGGACGAAGATTTTCGTCCCCTTGCTGCTCGTATGAGACCGGAAACGATCGAACAATACATCGGACAGCAGCACATTTTAGGTCCTGGTAAGCCTTTACGTCGCGCGTTAGAGGCGGGTCATATTCACTCGATGATCTTGTGGGGACCACCAGGCACTGGCAAAACGACATTAGCGGAAGTGGCTGCAAACTATGCCAACGCAGAAGTAGAGCGTGTGTCTGCGGTGACGTCTGGTGTGAAAGAAATCCGTGCTGCAATCGAGAAAGCGCGTGAGAATAAAATGGCGGGTCGCCGTACCATCTTGTTTGTGGATGAGGTGCACCGCTTCAATAAATCTCAACAGGATGCGTTTTTACCTCATATTGAAGATGGAACCGTTACTTTTATCGGCGCGACCACAGAAAATCCATCGTTTGAGCTAAACAACGCATTGCTGTCACGTGCACGTGTGTACAAGCTGACTTCTTTAGATAAAGAAGAAATTTCATTAGCGTTGAACCAAGCCATTAGCGATAAAGAGCGTGGGTTGGGGAATACACCAGCACATTTCGCCGATAATGTATTAGATCGCTTAGCTGAACTGGTAAACGGTGATGCTCGCATGTCCCTCAATTACCTTGAGCTGCTCTATGATATGGCTGAGGATAATGCGCAGGGCGAAAAAGAGATCACTCTGAAGTTACTTGCGGAAGTAGCAGGAGAAAAGGTCTCTCGCTTTGATAACAAAGGCGATATTTGGTACGACTTAATCTCTGCGGTCCACAAATCCATTCGTGGCTCAAACCCTGATGCTGCGCTCTATTGGGCTGCGCGTATGATGGCTGCGGGTTGTGACCCTTTGTACATCGCACGCCGCCTGCTGGCTATTGCCTCTGAAGACGTAGGCAACGCAGATCCTCGCGCAATGCAAGTAGCCTTATCTGCTTGGGACTGTTTTACCCGTGTTGGTCCGGCAGAAGGTGAACGTGCGATTGCACAGGCGATCGTTTACTTAGCGTGCGCACCAAAAAGTAATGCCGTCTATGTGGCGTGGAAGCAAGCGCTCACAGATGCTCATAACTTGCCTGAATATGAAGTGCCGCATCACCTTCGAAATGCGCCAACGAGTTTGATGAAAGACATGGGTTACGGTGCAGAATATCGTTATGCTCACGATGAGCCAGGAGCCTATGCCGCAGGTGAACAATATTTGCCGCCTGAGATGGGTGATCGTAAGTACTATCAGCCGACAAATCGTGGTCTAGAGACCAAAATCGGCGAAAAGTTAGATTACTTGGCTACTTTAGACGCAAATAGCCCACAAAAGCGCTATGAAAAGTAGGCGTTTTTGGATACATTTATTGCGTAAATTTTTTTAACGCGTACGCCTTGGAAAGTGCGCGTGATTTCACAACAAAAAGCATAGGATTAACAATGCTGGATTCTAAATTACTTCGTACAGAGCTGGATGAAACAGCTGCTAAACTGGCGCGTCGTGGCTTTAAGCTAGACGTAGAGACTATTCGTAAACTTGAAGAACAACGTAAGTCCATTCAAGTAGAAGTTGAAAATTTACAATCCACGCGTAACTCCATCTCCAAACAAATCGGCCAAAAAATGGCGGCTGGTGACAAAGAAGGCGCAGAAGAGATCAAGAAACAAATCGGTACACTAGGTAGCGATCTTGATGCGAAGAAAGTTGAACTTGAGCAAGTAATGGCTCAGCTAGACGAATTCACGCTTTCTGTACCAAATATCCCAGCAGATGAAGTGCCAGATGGCAAAGATGAGAACGATAACGTAGAAATCTCTCGTTGGGGTGAGCCAAAATCTTACGACTTCGAACTGAAAGATCACGTTGACCTAGGCGAAATGGGTGACGGTCTAGACTTCGCAAGTGCAGTTAAGATCACTGGCGCACGTTTCATCGTGATGAAAGGTCAATTTGCTCGTCTACACCGTGCAATCGCTCAGTTCATGCTGGATCTTCACACAGAAGAACACGGCTACACTGAAATGTACGTACCTTACCTAGTAAACTCTGACAGCCTATTCGGTACCGGTCAGCTTCCTAAGTTTGGTAAAGATCTTTTCCACACTGAGCCGCTAGTAGAAAAAGTAAACGACGAAGAGCCACGTAAACTGTCTCTAATTCCTACTGCAGAAGTGCCAGTAACGAACCTAGTTCGTGACACGATTTCTGACGAAGCGGATCTACCACTTAAGATGACGGCTCACACACCATGTTTCCGTTCTGAAGCAGGTTCTTACGGTCGTGATACTCGTGGTCTGATTCGTATGCACCAGTTCGACAAAGTTGAGCTAGTACAAATCACTAAACCAGAAGATTCAATGAACGCGCTAGAAGAGCTAACCGGTCATGCTGAGAAAGTTCTACAGCTTCTTGAACTTCCTTACCGTAAAGTGGTTCTGTGTACAGGTGACATGGGCTTCGGCGCTCGTAAAACTTACGACCTAGAAGTGTGGGTTCCTGCTCAAGAAACTTACCGTGAAATCTCTTCATGTTCTAACATGTGGGACTTCCAAGCTCGTCGTATGCAAGCGCGTTTCCGTCGTAAAGGCGAGAAGAAACCTGAGCTTGTACACACGCTAAATGGTTCTGGTCTAGCGGTTGGTCGTACCATGGTTGCTATCCTAGAAAACAACCAAGAAGCAGACGGTCGCATTGCAATCCCTACAGTACTTCAGAAGTACATGGCGGGCGCAACGCACATCGGTTAATTCATTGACCATTAGAACACCAAAACCCAGCCTTAGTGCTGGGTTTTTTGTTATCTGCGTCGAGTGAAAATACGTTGTTATGTAAATGTAGGATGAGGTATTGCATAGTTAGCCCAAATTATTAAAACCAGCCCCTAATGACATTTTTCTAAGCTTCATTGAGCGTGACAAGCTCCTCCAAAAGATAAAGAGGAGGTACTTTCATGGCTTCAATAAGAATAAGCGGCTCTGTTGGCTTGGGTGGTAAAAATGTGGATGGTGACATTCGTACGGTACAAAGGTCTATTAACCAATTGCTTGGTTCTTTAAAGGGAGTAAAGGAACTTAAAGTGGATGGGAAACTAGGCTCTAGACCGGAAAACTCCAAAACTGTCGGCGCAATTAAAGCTTTTCAGAAAAACCTTGTTGGGATGGCTCGTCCAGATGGAAGGATAGATGTGAATGGAAGAAGCCATCGTAAGCTGAATGATTATTTAAAACGAACACCTGAGACTGCGGTGGCATATACGTTACCTTTGGTTGGCTCAAGAGATGCGTTGACGGACTTGGATTATCGCAAAGTAGCTGAAACTCTTGGATGCGAAGTCGCTGCTATTAAAGCTGTAGCGGAAGTTGAGAGCCGAGGAGAAGCCTACTTTAGCAATGGTAAGCCAAAGATTTTGTTTGAGGCTCATATATTTTCGAGACTCACTTCAAGAGCCTACGATAACAGTCATCCAAGTATTTCAAGTCGGCATTGGAATCGAAGTCTTTATGTCGGTGGTATCTCAGAATACGTACGTTTAAATAAAGCGATTGAACTGAATTCAAATGCAGCAATCCGTTCCGCATCTTGGGGACGTTTCCAAATTATGGGCTTTAATTTCAAGTTAGCTGGTCATGTTACCGCTGAATCCTTTGTTAAAGCTGTATTCGAGTCGGAGAAGAAACAACTTGAGGCTTTTGTTACGTTTATCCAAAAGTCTGGCTTAGGTGAGCATATAAGAGATAAAAATTGGGCTGCCTTTGCTCGTGGTTACAATGGTTCTGAGTATCAGAAAAATCAATACGACGTCAAATTAGAAAAGGCTTACAAAAAATATGCTTCGATTAAAAATGCTGCGTAACATTTCTTTGCTTGGACTTATTTTTAGTTCATCTGCTTGTGCTTCCAGCCACACTAATACTGCGCTATTTAAGTGTGATGCTTCACATCCTAACCGTCTAGAAATATCAATAGAAAACATAAACAGTCAGGTTCTACTGTCGGAGCTTAGCTTAGGGGGGAGCAGTATTGAAAGATCATTAGTTATAAAGGACTTTAAATTAGACCAGTATCATCGAGCGTTGGTAGATGAGAAGTCACTGGAATTTAGTATTGGTGAGCGCGTCATATTGGTTAGTGAGTACTTTAGCGAGGAGTTTGATGAGGTAGAGAAGATACTCAGTGTTACTTTGAGAGAGCCTGAGCAAACGCAGCATTTTGAATGCGAGGAAGGGAGTATGAGTAACCTTGCATTGCTTTTTCATGAGAGCGCCGAATAATCGATTACAGAGTAATTATTACTAAAACGTAATAGTGTTTTAGAATTTACCCTAATTATCTTGTTATTTGTAATCGATATAATCATCCACCTTACCACGGTAAGTAGATTTTCTCATTCAAGTAACGAGGAGGGCTTATGGCCGTAGGGTGGGCAAATGACGATAGTGTCAGCCAACAGATTCAACATACCATCGACGATGAAATTTCCCGTGTTCGAGGCAATATTAAGCAGGGAGAAAGCGCCCACTATTGTGATGAGTGCGGAGATGAAATTCCAGAAGCGCGTCGAACAGCAATGAAAGGCGTTCGTCTTTGTATCGAATGCCAGTCAACGATCGAATTGGTGTCGCAACGCCAAGCACTGTTCAATCGACGAGCGAGTAAAGACAGTCAACTTCGATAGCTTTCCTTGAGTCAATGACAAGAAAACGGTTTGAGTAGCAATTCAAACCGTTCTTATTTGTCTCTTTGTCACAGTGTCCTGCGTTAACCGAGAACGAATTAAGCGAAGCAATCTTCGCGTTGCAAAGCAGCATCAATCGCATTAACCAACTTTTCAATTTCGTCTGGTTTGCTGATAAACGGTGGCATCATATAAATCAGTTTGCCAAATGGACGAATCCAAACACCGTGCTCAACAAAGAGAGCTTGGATGGTTTCCATATTTACTGGGCTATACGTTTCCACAACGCCAATAGCACCTAGCCAACGGGTGCTTTTCACCAAGTCGTATGTTTCAAGTTTTGGCAGCAACTCGGAAAACAGCGTTTCAATTTGTTGGGTTTGTTGCTGCCAATTACCTTGCTCAATCAATTCCAAACTTGCGGTGGCAACAGCACAGGCGAGCGGGTTACCCATAAAGGTTGGACCGTGCATAAAACACCCAGCATCACCGCCACAAACGGTATCTGCGACGTGTTTGCTAGCTAGGGTTGCTGAGAGGGTCATGTAGCCTCCAGTGAGCGCTTTACCTACACACAGAATGTCTGGTTGAATATCTGCATGTTCGCAGGCAAACAATTTGCCAGTGCGTCCAAACCCGGTTGCGATCTCATCTGCAATCAATAACAAGCCGTATTTGTCGCAAAGCTTACGCACACCTCTGAGAAATTCTGGATGGTAGATGCGCATTCCACCTGCGCCTTGAACAATAGGTTCTAGGATCACCGCAGCAAGTTCTTGATGGTGCGTCTCTATCTTTTGCTCGAAGTCAGTAAGGTCTTCTGATTTCCATTCATCCCAATAACCACAAGTTGGTGACTCCGCAAAGATATGTTCAGGCAAAAAGCCTTTGTAGAGAGAATGCATTGAGTTGTCAGGATCGGTAACCGACATTGCCGCAAAAGTATCGCCGTGATAACCGTGTCGTAAAGTGAGAAACTTTGGTCGGCGTTCGCCCCTGGCATGCCAATATTGCAATGCCATCTTCAAGCTCACTTCTACGGCAACGGAGCCAGAATCAGCAAGAAAGACATGTTCAAGGTTACTTGGTGCGAGGGACAGCAGTTTCTTACACAAATTGATTGCTGGTTGATGGGTAATCCCGCCAAACATCACATGAGAGACCTGCTCGATTTGCTGGTGAGCTGCTTGATTTAAGTGCGGATGATTATAACCGTGGATTGTCGACCACCAAGAAGACATCCCGTCTATAAGCTCAGTGCCATCTTCCAATTTAATACGCACGCCATTTGCTGATGCAACTGGGTAGCAGGTCAGAGGTGTCAACGTCGAAGTATAAGGATGCCAGATATGCTGGCGGTCGAAGGCGAGATCCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP018680|434895:444919|434895_438171_+|WP_074050312.1|DBSCAN-SWA MFKENAKKVETIIKTSEEPQSSRLNGFQRLKECCFIVGVLSSVLLAVALFTFSPADPSWSQTAWGGEINNAGGLFGAWLADTLFFTFGSLAYPLPFLLTAGAWVICRKRSEDDPIDLMLWGTRLLGLVILILTSCGLADINFDDIWYFSSGGVVGDVLTSLSLPTLNVLGTTLVLLFLWGAGFTLFTGISWLKIVEWLGDRALGLVAAVTNKARGTEQETLEPQLDEFAEDRVVSKRGQDDLEDEPLPHLTAYDVEEPKEEAPAHEYPIYMPQSAAEQPKQQSATQPIAQPTPQPAVVNAAPLQSQVQHQAQEQAQSQSVEPVRAVNTDNVDPTVERTKQLNVTIEELEAAAQQADDWADETPQVQVSQAQEPVVAPIVAQPEAPQSVAQEPAPQQVDHQQALQEPFVAPEPTVFEQVENEPSYDEYAQFEADQAQQPQVVQEQALVHAPIEQPTTEQPLHEEPVINAAALDNIAEQTEPSQHIEPTISNFDVLDEEDDYQAQQPVQQAQPTHVEPQQPASQPVQPQAVQEQPMQHQAAPQPVAPAPQDVEVEEVQEGDQDVAAFQNLVSSAQAKVAAQQNPFLVQQEQNLPVPAEPLPTLELLYHPEKRDNFIDRDALEEVARLVETKLADYKIKADVVGIYPGPVITRFELDLAPGVKVSRISGLSMDLARALSAMAVRVVEVIPGKPYVGLELPNMSRQTVFLSDVINSPQFEQATSPTTVVLGQDIAGEAVIADIAKMPHVLVAGTTGSGKSVGVNVMILSMLYKASPEDLRFIMIDPKMLELSIYEGIPHLLSEVVTDMKDASNALRWCVGEMERRYKLMSALGVRNVKGFNEKLKMAAEAGHPIHDPFWQEGDSMDTEPPLLEKLPYIVVVVDEFADLMMVVGKKVEELIARLAQKARAAGIHLILATQRPSVDVITGLIKANIPTRVAFTVSTKTDSRTILDQGGAESLLGMGDMLYLPPGSSHTIRVHGAFASDDDVHAVVNNWKARGKPNYIDEIISGDQGPESLLPGEQMESDEEVDPLFDQVVEHVVQSRRGSVSGVQRRFKIGYNRAARIVEQLEAQGIVSAPGHNGNREVLAPAPPKD >NZ_CP018680|434895:444919|440307_441615_+|WP_005450806.1|tRNA|DBSCAN-SWA MLDSKLLRTELDETAAKLARRGFKLDVETIRKLEEQRKSIQVEVENLQSTRNSISKQIGQKMAAGDKEGAEEIKKQIGTLGSDLDAKKVELEQVMAQLDEFTLSVPNIPADEVPDGKDENDNVEISRWGEPKSYDFELKDHVDLGEMGDGLDFASAVKITGARFIVMKGQFARLHRAIAQFMLDLHTEEHGYTEMYVPYLVNSDSLFGTGQLPKFGKDLFHTEPLVEKVNDEEPRKLSLIPTAEVPVTNLVRDTISDEADLPLKMTAHTPCFRSEAGSYGRDTRGLIRMHQFDKVELVQITKPEDSMNALEELTGHAEKVLQLLELPYRKVVLCTGDMGFGARKTYDLEVWVPAQETYREISSCSNMWDFQARRMQARFRRKGEKKPELVHTLNGSGLAVGRTMVAILENNQEADGRIAIPTVLQKYMAGATHIG >NZ_CP018680|434895:444919|438232_438859_+|WP_005450808.1|DBSCAN-SWA MNKLFSAKLFSALVLSFSLFSTAHAASPKDELNKRLSMNDGFSADFSQQVISPEGETVMEGEGTVEIARPSLFRWSTTFPDENLLVSDGKTLWYYSPFIEQVSIYWQEQATEQTPFVLLTRNRASDWDNYKISQKGDEFTLIPTAVDSTQGQFQINIDAKGVVKGFNVVEQDGQKGLFSFSNVKLGKPKADRFTFTIPDGVEVDDQRN >NZ_CP018680|434895:444919|442710_443139_+|WP_017188206.1|DBSCAN-SWA MLRNISLLGLIFSSSACASSHTNTALFKCDASHPNRLEISIENINSQVLLSELSLGGSSIERSLVIKDFKLDQYHRALVDEKSLEFSIGERVILVSEYFSEEFDEVEKILSVTLREPEQTQHFECEEGSMSNLALLFHESAE >NZ_CP018680|434895:444919|441818_442721_+|WP_049536243.1|DBSCAN-SWA MASIRISGSVGLGGKNVDGDIRTVQRSINQLLGSLKGVKELKVDGKLGSRPENSKTVGAIKAFQKNLVGMARPDGRIDVNGRSHRKLNDYLKRTPETAVAYTLPLVGSRDALTDLDYRKVAETLGCEVAAIKAVAEVESRGEAYFSNGKPKILFEAHIFSRLTSRAYDNSHPSISSRHWNRSLYVGGISEYVRLNKAIELNSNAAIRSASWGRFQIMGFNFKLAGHVTAESFVKAVFESEKKQLEAFVTFIQKSGLGEHIRDKNWAAFARGYNGSEYQKNQYDVKLEKAYKKYASIKNAA >NZ_CP018680|434895:444919|443641_444919_-|WP_009698192.1|DBSCAN-SWA MDLAFDRQHIWHPYTSTLTPLTCYPVASANGVRIKLEDGTELIDGMSSWWSTIHGYNHPHLNQAAHQQIEQVSHVMFGGITHQPAINLCKKLLSLAPSNLEHVFLADSGSVAVEVSLKMALQYWHARGERRPKFLTLRHGYHGDTFAAMSVTDPDNSMHSLYKGFLPEHIFAESPTCGYWDEWKSEDLTDFEQKIETHHQELAAVILEPIVQGAGGMRIYHPEFLRGVRKLCDKYGLLLIADEIATGFGRTGKLFACEHADIQPDILCVGKALTGGYMTLSATLASKHVADTVCGGDAGCFMHGPTFMGNPLACAVATASLELIEQGNWQQQTQQIETLFSELLPKLETYDLVKSTRWLGAIGVVETYSPVNMETIQALFVEHGVWIRPFGKLIYMMPPFISKPDEIEKLVNAIDAALQREDCFA >NZ_CP018680|434895:444919|443276_443543_+|WP_005450802.1|DBSCAN-SWA MAVGWANDDSVSQQIQHTIDDEISRVRGNIKQGESAHYCDECGDEIPEARRTAMKGVRLCIECQSTIELVSQRQALFNRRASKDSQLR >NZ_CP018680|434895:444919|438865_440215_+|WP_009699193.1|DBSCAN-SWA MSNYTLDFSGDEDFRPLAARMRPETIEQYIGQQHILGPGKPLRRALEAGHIHSMILWGPPGTGKTTLAEVAANYANAEVERVSAVTSGVKEIRAAIEKARENKMAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTVTFIGATTENPSFELNNALLSRARVYKLTSLDKEEISLALNQAISDKERGLGNTPAHFADNVLDRLAELVNGDARMSLNYLELLYDMAEDNAQGEKEITLKLLAEVAGEKVSRFDNKGDIWYDLISAVHKSIRGSNPDAALYWAARMMAAGCDPLYIARRLLAIASEDVGNADPRAMQVALSAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVYVAWKQALTDAHNLPEYEVPHHLRNAPTSLMKDMGYGAEYRYAHDEPGAYAAGEQYLPPEMGDRKYYQPTNRGLETKIGEKLDYLATLDANSPQKRYEK |
8 | Mycobacterium_phage(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
878159 : 889273
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP018680|878159:889273|DBSCAN-SWA TATGAAAGATAAATATTTAGAGATATACCAGCAGGAAGCGTTACAAGAGGCACTTGTCGAACTCAAAGAGGCAAGGCAACGTGAGAAGTTATTGGCTGATGAAAACAAAGCCATTCTGTCAGCAATCTCTGCAATGAGTGAAGCTCGAAACCGTCACGAGATTTTCTCCGGTCTAAACAGTGTTCTAAAGAAGTACATTGATTTTGATGACTTCATCGTCATTACTCGCGACTGTAATAGACAAAACTTCAAGACGTTATTATCAACAAATAGTGTTTTTGATAGAGCAGATTGGTTACCGGGTAACGCGTCGGAAAGGGCTCTTAATGGCGAATGTATTTTGTTGTTCGAGCCGATGCGATTAAAGGAGTTTGAGAGCTTAAACTCCTTTATTAAAAGCCACGTGAATTCAATGCTACTTACGGGCATTCGATCTGAAGTAACCCAGACTATTATCATCCTTGTTGGAGCTCAAAAAGGTCACTTCAGTATTGAAAACAAAGAAACGTTAAGGCGTTTTAGACCCTTAATTGAAAGGGCTGTTATCGATATTGAAACCAAAGAAAAGTTACAACGTATCGTTGAGGTGAGGACGTCTCAATTAGCACGGGCTAGAGAAGAAGCAGAAAGAGCTAACCAATCGAAGTCTGAGTTTTTGGCCATGATGAGTCACGAAATTCGAACGCCTTTGAACTCAGTTCTTGGGATGTTAGACATCTTGAGACAATCAACTTTAACCGATGCCCAGTCAGACGTATTGAACCAAATGGAATGCTCTGCAGATTTGCTATTGGCGATTATCAGCGACATTTTGGATTTATCAAAGATAGAGTCAGGAAGCTTTCATTTAAACGAACAGTGGACAGACTTAAGTGACGCGGTAACACTAATTGTTTCTCAACAAAAACAAGTGGCTACAAGCAAAAGTTTGAGCTTTGACCTACACTGCAATCTAGATAGAAATAAACAGTACTGGATTGATTCTACTCGTATATCACAGGTGCTTTTTAACTTGATTGGAAACGCGATTAAGTTTACAGACTTTGGAGATGTGCATGTTTCTGTGCTAGAAAGCAATGGTGAAATCATCATATCTGTCTCAGATACTGGGATAGGAATCCCTCAAGCTAAAATTGGTCATTTGTTTACAGCTTTTCATCAAGGCGACAGCTCAATTACTCGTCGATTCGGTGGTACCGGGCTTGGACTTGCCATCACCAAACACCTTGTTGAGATGATGCGAGGTTCAATTTCTGTAAAAAGTGAAGAGAGAGTCGGCTCTCATTTTACAGTCAAAATCCCGGTTCTTACCAGGACTAACCAAGAAAGACCGGTAAAGATAGAATCAAATCAGCCATCAAAGTCGACCAACATTTTAGTCGTGGAAGATACTGAATCTAACCAGTTAGTGATAAAGCTTATTCTGAATAAACTCGGGCACAATGTATTTATTGCCAGCCACGGCGCGGAAGCCTTAACGTTTTTGGAGCAACAAACACAAGACATCGATATCATTTTAATGGATGTGTCGATGCCCGTTATGGATGGCATTACAGCGACAAGACTTATCAGAAAGAAAGGAATCAAAACACCAATCATTGCATTGACTGCACACGCGCTTGAGAGTGATAAAGCGACATGCATGAAAGCTGGCATGGACGGATTTGTATCAAAGCCAGTTCGCAGACAGGATATCTACGAAGCGATCCATTCATTAGTGGAGATCGCGTAGCGAATTAATATAGATCTTAACAATCGTAAGATTATTAAGATCTATATTAAGTATTTTATGGTAAATCGCGGTTATGTTTAATGACACTCATAAAAAGCCTCATATTTTAAAATCTGAGAGGGATAGGAGCCCAAACGAAGAAAGAATTGGCGTACGTTGATATTCTATTTAACATAATATACATAATGCGCACTGAATTGAGGATTCCACTGGGCTTGAAAATGCTACGGCAAATCCTGCACAAGCCATTGAAATACCTGTTAATCCAAGCCCATTAAACTTTTTTGCAATATTCGCCCAAGCTTGTTGCGCTTCGTACGTTTTCGCTTTGTCAGCTGCTAGATAAACAAGGACAGATTCTGTGTCTGCCCCGATTTCTTTTGCCAGAAAAAGAGCTTCATTTTCAGTGAGATATCTTTGCCCATTTCTAATGGCACTAATCTTTTGGCGGCTTAAACCTAAATCATGAGCCACTTGCTTGTCTTGGATATAGTTCTTCGCCTTTTTGTAGGCATCTAACAGTTCATTTGTGTACATAGGAATCCTCCAGTCCTGATTATTGTATACCCGCAGTCCACAAAAAGCGGTCTTTACAGTCCTGATTTTTGTGTCTTATAGTCCACACAAATCAGGACTCGCACGCTTGAGTCCTCAAGTTTAGACCACCTTGGTTCGGGCGTTTGCCCTTGACGCTTTCGTCTGTCCTTGGTGGTCGCTCTCAACGGTCAAGGTGTTGCTATGAAAAAACTGTCTACTGAAAATGCGATCATTATCGATACGGAAACTACTGGCTTAGGCTTTGATGCAGAAATTGTCGAGTTCACTGCTATCTGTGCTGACTCTGGTAAAGTTATCGTAAACGAGCTGGTTAAACCGACTTGTTCCATTCCTGCTGAAGCGACGGCAATTCATGGCATCACCGATGAAGATGTTAAGGACGCGCCCGACTTTCATCTTGTTTTTTCAAACCGTTTTCTTCCGCTTCTTAACGGTCGTCCAATCATCATCTACAACTCAGATTTTGATACGCGCTTAATCATCCAATCGTTAGACAAGCACTGTAACTCTGCTTACGTCCAATCCGTTGAAGATTTGTTTTTCAAATTCTGTGTTCCTCATTGTGCGATGCTTTGGTACGCCGAGTTCTTCGGTGCTTGGAATGACCAGCACGAAAATTACAAGTGGCAATCTCTGACTAACGCTTGTGCGCAACAACATGTCGATGTGTCTGACTTAACCGCGCACCGAGCGCTGGCTGATTGTGAAATGACTCGTCGATTGATCCACGCGGTTAACTCACAGCTTGAAAAACAAACTAATCAAAACTGTGACGGCGTCACTAATGTTTTCTCGGAGTGATAACCATGGAACGTGATAACTCATTTACATGCTTAGTTCTTCTCGCGGGTTTAATCGCTGCGGTGTTCTTGTTTTATAAGAACAATTACATGGACGCTACGGCTGAGTCTTACGCCAAAGTCCAAACGTGGGTTGATGAACATCCATCAGCAAAACCGCTGTTAAACGAGCTGTTGAGCGATGGCAAGTTAACGCCTAACGAAGTCGCTGACATCCGTATTTTCATCAAAGAAGAGCCAAAACGTCTTCTTATTTCTCAGGCAACTGAGGCGCGTTAACCATGAACGAAGCTCAAATCATCTATTACGACTTGCTTCCTGATTACACGGTTTCCGTGTTGGTCAAAGGCTGTGATGAATGGGATTTGCTTAAGTCGATGTCTCATCTTGAGTCTTGGGCTTCGTCTCAATTTCTATCTTACGAGTTGGTTTCTATCACCAACACAACTTACCAAGAGCGCGTTGATTTAGGGGTGTTCGATGACTACTGCAACTAACATCCTCAAGAAGTTCGATGAGCAATCAGTTCATATTGACTACCTATGTTTTACATTTGCCGTCAAGGACTTACGCCATTGTCATAACGCGCTTCAACGTCTGCATAAGCATGAGGAATACCGAGGATTAGCGCCTAAAACACTGTTACAGCGTAACTGTAAGGCACCTAAGTTCCCTGCTCCACCTCAGTTTAATTCGACCATTGCCAAGACAGCAGATGAGATTAATGCGTACAACGACGCCTTTGAAATCTGTTACCGAAACTACTTAGAAGAATGTCTACGCATCTTCACCAATCAAGTGTTGGGCTTGTCGCTGTCTGCGCCTCGTGGGCTTGGTTTTCAGTTCTACACCGAATCCATGAAACTCACCTCAGCGAACGGTGAAGACTTCTGTGGTTTCGTTGGTATTGGTGGCAACAATGACACGGTGCATTTTCAGATTAACGGTAAAGGCTGCAAGCATGTCTTTGCCCGTCGCGCGCCTTGGTCACTGCATGACTGGCTGACTAACGTATTGGGTGTGCAAACATTGGCGCGTGTTGACCTCGCTTATGACGATTACGACGGGATTTTTGATTGTGAATACGCGCGCACGGCTTGGAATGACAATGCATTTCGAACGGCTGAACGCGGTCGCAGTCCAGTGCTGCATGTTGACCACACCATTGCTGGCTATCGCGGTGGTCGCCCTGATTACACCAAAGAGCAATATTCGATTGGCTCTCGTACCTCTCGCATTTACTGGCGTGTTTATAACAAGGCACTTGAGCAGAAACTCGCGAACACGGGACTGGTTTGGTATCGCTCCGAGGTCGAACTTAAAAAATGGAACATCGACGTTCTGCTTAACCCAGCTGGTGCATTCGCTGCACTGAATGACTTCTCGGCGTCGATTTCTACCGCAAAGAAATTCAATACAAAACCTGTCCCGACCAAACGCGCGGCGTTAGACCTGTTGGCTTCGGCTCACTGGATGCGTCGTCAGTACGGGAAAATCCTCAACTCTTTAATCGAATTCCATGAGGGTGACATTGAAACCGTGGTCGGTTCCCTTGTCCGTGATGGAACGAAATTCACCTTCCCCGATACCTACGGCAAGTTGGTGACTCACATATTGGAGACTTAACAAATGGCTAAATCTGTTTTTGTTCTAGGCATGGACATCACTTGGAACTCAGCGCGTGGTGACAGTGCGCAACTGAACGTGTCACGTCCATTACGTGAAATTAACTCGGAGAAATTCAAGCGTCGCACCATTGGCGAATCGGGTGATGTTAACCCGCAATGGGATCAACCTTTGATGATTGATCATACTTACGCCCTGCTCCTTGAACGCACTGGCGCGCTTGTTCCTCGCCGTGAGTACGAGCTGCGTTTGGAAATCAACCCAGACGACCCATTGGCCGGTGCGATTGTGACGGAGTTGATCCCCGTCGACCAAGAAATCAAGAAGCACTTTGAAACTTCAATGAAAGCGCAAGGTTAATCCATGCCTATTTGTGTCGATGTTACTCGTCGTGGTTGGCTTCAAGCCACTGGAGAATCTACTCAGCACTGTAGTTCTTACGTGATGATGTCGGTCACTGACTTCAATCAATATCAAGAGCCTGTCGCCTTCAATAGCGACCTGTTCTTATATGTAAGTGGCGTGCTTTTGGTCAACATGATTGTTGGTCATTGGTCAGGCCGTGTTGTTCGCCTTATGAGTAAAAGGTAATTCTTATGAAAAAACTAGAACTTGTTGTAAATAACGTAAAAAGCGCCGTGGTAAACAAGAAAGTCGCGGTGGGTGCTGCGCTTATGGTCGCGTCGGTTTCGCCTGCATTTGCTGACGGCATCACCGATGCCATCACGGCTGCCACAACCAGTGGTCAAGCTAACGTATCGTTGACGGTTGCAGGTCTTATCGGTATGGCGGCACTGGGCTTTGGTGTAACGATGATTGTTGGCTTCCTTCGTAAGTAACGGTTCGTCTCTATGCCTCCGATATCTGGTAATTTACTTGGGGATGTTCTCGCTATCGTTCTAGGTGTTGCTTTTGCGGGGGCATTTCTCCATGGCTTTGTGAGTGGCATCAATACTCACTAATCAGTGAATCAGGGGGCTTCGGCTCCCTTTTTTCTTGGTTGTTCTATGAAAAAACTACTTCACATCCTGCCGTTACTTTTGCTGTCCTTTTACGCAAATGCTAACCAAGTCTTTTATGCCAAGGTAACCAGCTTCGTTAACAGCGCTGAAGAGCCGCTTGCTCGCCAATGTGCGAACATCGGTGTGGGCTCTTATATCGGTTATTCCTCGGGTAATAAAACCATTCCTCTTCGGAAGATTTTGCCTAACGGCATCAACTGTATTTCTATCCAAATCGGTCCTTCTGCTTCGGTTATTTATACTCGTAACTCGGTTCAACATCGGATTTGGTTTAACTGGGAGTTTCAAGATACCTGCCCTGAAAATCAGGAATACAATCCGACTACCCGATTGTGTGAATCCCAATGTGAGTTTGGTGAAAACCCTGACGGCTCTTGTATGGATGCTTGTCAGTTCAAGCAATCTGTTGATGAACAAAAGCTGCTTCATTGGTCAGCGTATGTTTATGGGCCTGAAGTAACTGGCGCTTGTTATGGTGATTTCGGTGCTACTCGGTGTGAGCTGCGTCGTCCTCCGGTTTCAGTGACGGTATGTACTGATGCGGATTCTGGTGAGTTTACGCAAAACACGACTTGCACGTCACAGTTTGTCTTTACTGGAAAACAATGCGATGGCGGTACACTATTTTGGGGAAAGAATGGTCCAGATGAGCCATTTGATCCTGATAACCCCGACGACCCAGAGCATAAACCTGATGACCCTACGGGTGACATCGAAGACCCAAGCGTCTTACCCGATGACTCCACCAATACGGTGACGCCTCCTGATGTGAATGACAAGCCAGATGTGGAAGACCCGGACACCGATGGTTCAACAGACACGGCAGTTCTTTCCGCGATTAAAGGGCTTAATGCTGACGTGAACAGTGGCATTCATGACTTAAACGTAGATGTGAATGAATCCCACGCCAAGATTAACAATGCGGTCATTGACCTAAAAGCCTCGGTGGTCGGCAACACGCAAGCCATTCAAAAGCAGCAAATCAACGACAACAAGATTTACAACAACACCAAAGCGCTCATCCAACAAGCGAATGGGGACATCACCACCGCCGTCAATAAGAACACCAACGCAACCGTCAAAGGACTCAAAGAACTTGATGCTTCCGTTGGCGACTTGAACGGCAATTTAGATGACATTAAAGGACTGCTTACGGGCGGTAACTTTGATTCACCCAATGGTGAGGACGTTGCCGAGGTTATCTTCTCTTCAGATGACTTCGTATCGATAAACGAAACCATTCACGACAAGCGTCAATCGATTCAAGATTACGTCGACCAAGTGAAAGGGCTTGTCTCTATTAGCACCAATTTTAATAACGGCTCACTCAGTGATAAGTCATTCACCGTTAAAGGCACCACGGTTGAATCGGGTTTACAACGTTTCGATTCGGTCTCTGGCTACGTTCGCCCAGTGGTGCTGTTTATCTGCGCTTTGATAGCACTTTGGATCTTGTTCGGTCCTAGGAGTAAATAACATGGATTACATCTACGCAGCTTTGGAGTTCATTGCCAATGTTGGTCAAACGCTGCTCGACTTCATTCAAAACATCCCTGACCTGATCATTAATTGTGTCGAGTATGGTGCGCTGTGGTGCATCTCTATTTGGCTGGATATCAAAATCGCATCAATTCAGATTTCATTGAGGATCGCGCAGACGTTACTGGCAGACTATGGCGTCTATACCTTGATTGAGAGTAACTTTAATTCCCTTCCCTCTGACGTGCGTTACATCCTAACTCAGTACGGGGTGACGTCTGGCTTGCGGGTTATCTTTGACTCGTTCGCCGCTTCGCTTGTCATGCGCTTCTTTAACTGGTGATAACATGGCAACTTCATTTCGATACGGTCATGGCGGCTCTTACAAATCAGCATGCGCAGTATGGTTTGACCTACTTCCGGCACTTCGTGAAGGTCGAGTCTGTATTACCAACATTCATGGTATGCAGCCCTTAGAAGTGATCGAGAAAAGACTCGGTGAGAAGTTTCCTGATAGTGCGCGCCTTATTCGCATTAGCTCTCGAAACCCTGACGGCTTCGAGCTTTGGAAGTACTTTTTCTGCTGGGCACCTATTGGCTCTTTTATCCTCATCGATGAGTGTCAGCAAATCTATTCCACCAATGCTGGCTTCAAGATGGCGAACATACACAAGCGCCCTTTTACTGACTTTGAGCCTCACTTACCGCAAGGATTTTCAGAAATCTTCCACTCTCGCTGGCTTACTGTGGATACATCCAGTTTAGACCGTGGTGAAGTCGATGACTGCCAACGTACCCGCTTTGATGAGCAAGGACGCATCATCTATCCCGAGAACTTCAACAACGCGTTTATGGAGCATCGCCATTACAACTGGGACATTGTGTTACTGACGCCCGACTTCGCTCAAATCCCTAAAGAGTTAAAAGGCGTCGCCGAGCTGGCAAAGCAACACAAAGGGAAAGACGGGATCTTCTTTTCTAATCGTAAGCCAAGAATATTGGAGCATGACCCGCTTCGTACTGTCACGGTGCCAAGTAAAGATGATGTGGTTTACAACCTGAAAGTCCCTCTCGATGTGCATTTGCTTTATGCCTCTACGGTGACTGGACAAATCACTAAATCGGGGCTTGGTAAGAACATCTTTCTTAATCCTAAGTTCTTAGCCGCTGTGGCTATCTTCATTCTTTCAATGGGATATTTAACGTATGCGCTTATTGGTATTTTTTCTGGTTCTGAGGAGACATCTTCGCAAGGAACGCCAGCTCATCAAACTTCGCAGCAAGGTGCTGTTTCATCTTCAAACCGTCAAACGCGCCCTAATCAAAACCATGCGGTTCATTCTGTCGTGGGTTCTGGTGGTTCTGATTGTTCGGGCGTTGGTTGCGATGGTCGGTCTTATTATGATGTAGGCTCGGTTCCTGCTTGGTTCCCACTATCGAATTCTGAAAGTATCTATGTCTCAGCGGTTGAGCGCTGGTACAAGAAAAAGGTCGTTTACGTGAATGTTCATTTCGAAATCACCACGCCTAGAGGTGTGTCTTATCTTGATGATGTGTTCCTAAAGAAAGTCGGCATTCAGATGGAATACCTCGACGATTGCTTGGTGAAGCTATCGGACGGCGAATCTAACTTTTTTGTGACGTGTTCACCCTATGAGCAGATTGCACAAAACCAGAAATCAGACATTGAGCTCAAGCCTGTTGGTGGGCTGTTTGGAGGCGATGAAAGCTAATGAACGAATACGTAACCCATGGGCAACTGCTCGAAATCATAGAACTGTTTGACCATCTCTCCATGCTTAACGCGATCATAGTCATCATCGTTTACGACCTCTTTCGTAGTGGTGTTCGAATGCTGTCTGACTATCTGAATAAGGAAAACGGACAATGAAGATGACATCAGAACGGTTTAGTCGTGCGGTCTATAACTCCCCGCTTGGCGCGTTCGTTTTAGTTGGGCCACCAACATTTGAGCAGCTTCAGGAACGTAGGCTATTTGTTTTACGGTTTGAGATAGCGATGAGCAAAGCCATTTATGGCTGAGTTCGTTTGCTATCAAAACACGAACCGGTGCCAGGATTATCGATGCCATTTTGCCATCGTAATCGAACCAGGGAACTTGCTGCAGAGAGGTAAAATGATGGCAATTACAATTCGAGATACTCAGGAACACACAGAAATGCTCTCACAGCTCAAAGAGCAAACGGGCACATCAACTATGAGTAAAGCGCTGCTCAAAGGTGGCTATGATGCCTTGAAGTACAAAGAACTATACCTTGCAGAACGTAGGAAGAATGAACAGCTTAGAGGCGAACTGTATAGGCATACAGAAGCAATCAATGACTATGTTGGTGCGCTCGATAGCTTACGAGAGTTAACGCGATAAGGATGGGCATCGCCCCGACCGAAGGGAGTCACCGAGATATAAGGAGTTGCGAAGCGACGACGAAGCACCGAGCCACCCACCACTGTCGATCTTGGGCACTTGCTTAGACTGGCGAGTGTCCCTATCTGCCCAAGCCTAAATAGGTAATACCACCCCTCGCCCTGCTAGAATCTGCCTTGCAGAGACTCACCACATCAAAGGCGCTTCAACCTACTGGAACAAACCGAGCTATCAACATCAAAGCTTTGCGAGTGCCGAGCAAGGCTTTTCCTGTATCAGATGAGACCTACCGACCAGAGCGCGTAGCAAGTGAGGACGGGCTAGGACGATTGCGCTACGTGCGCGGGAGGTCAAACCCCCCGAATCTGTATTACGGGGGTAAATTCCACCTACACTCATGATATTAGGGTTTTAAAGAAAAACTAATTGTCTATAATTTATTTGACTTAATATGGAACTAAAGGTGTAGTGTGAACATAAACTTTTACAAGTTAGACGACGATTTACAGGATGAATGGGATATTACTTTAAACGAACCTCTTCTAGTAATGGAGAAAGCAGAAAGTAAAGATATCCCATCTGTAGGCGAGTTTATGGTGATCCCCGCAGGTACTTTTTTATATAAGGTAACTAAAATAATAAGGAAGGTTGATGACGCCACGAAAGGTCATTTTACAACTTCGGAGATAGATGTAGTGTTAACGCCCACAGTTATTGACTAAGCCACGAAATCACATACTGAATATCAAGAGCGTCATGCAAAAAATTACTATACAGGCTACTGAGAGTATAAGGTTGTTAGTATGGATATGTAGTTCAATAATGCGACTTTGGCTAACAAGTAAATTTGACCTAGCTTTATCTTCATCATTCCAGCCGTAGCTATCACTTGCTGACCTATTTTCCAAGTAGTCACCCAACAACTTGGACATCTTTTCAAGTTTGGTTAATGGATTGCCAAACAGGACATTTAAGGGAGTGATTTTCGCTTCTAAAGAATATTCCACTAGACGTTTAGAGTATTCGAGTACGCCTAATCTCTCAATAGTATCCATATTGTATCCCTCGCTGGTACAGCCCTCTTGTATATTATAGACACTTTAATGCATGACATTGCTTATAAAAAGGCTCAAGTGAGCCTTTGTGTTAGACCAATTTCTTTAAAGCCCTCGCATACTTTAGGATCTGATGAGCCACTTCGATATCATTAGAAGCACCTAGCTCTAACAGCGCTACACCAATTAAAACTTGTTGAGCTGTTACTAGTTGTCCGGTCGGAAGCTCTAAGCGGTCATGCCTCATCACGAAGTTTTCCCAATCATCGCAGATGCTCAATTCCCTACCCTTGTTCATGCGCATCAGCCTTCTACATTCTGGCGGAATTGCATTTCCCTTATCCCACTGTTTGATCGTCCTCACACTTTTCAAACAAAGTTCAGCAGCTTCTTCAACGGTTAAACCACATTCAAACTCACGAAAAATATAGTTTTTTGTCATTTCGTGATACTTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP018680|878159:889273|888907_889273_-|WP_045489002.1|DBSCAN-SWA MKYHEMTKNYIFREFECGLTVEEAAELCLKSVRTIKQWDKGNAIPPECRRLMRMNKGRELSICDDWENFVMRHDRLELPTGQLVTAQQVLIGVALLELGASNDIEVAHQILKYARALKKLV >NZ_CP018680|878159:889273|880628_881249_+|WP_074050433.1|DBSCAN-SWA MKKLSTENAIIIDTETTGLGFDAEIVEFTAICADSGKVIVNELVKPTCSIPAEATAIHGITDEDVKDAPDFHLVFSNRFLPLLNGRPIIIYNSDFDTRLIIQSLDKHCNSAYVQSVEDLFFKFCVPHCAMLWYAEFFGAWNDQHENYKWQSLTNACAQQHVDVSDLTAHRALADCEMTRRLIHAVNSQLEKQTNQNCDGVTNVFSE >NZ_CP018680|878159:889273|888231_888483_+|WP_045488997.1|DBSCAN-SWA MNINFYKLDDDLQDEWDITLNEPLLVMEKAESKDIPSVGEFMVIPAGTFLYKVTKIIRKVDDATKGHFTTSEIDVVLTPTVID >NZ_CP018680|878159:889273|881728_882874_+|WP_074050435.1|DBSCAN-SWA MTTATNILKKFDEQSVHIDYLCFTFAVKDLRHCHNALQRLHKHEEYRGLAPKTLLQRNCKAPKFPAPPQFNSTIAKTADEINAYNDAFEICYRNYLEECLRIFTNQVLGLSLSAPRGLGFQFYTESMKLTSANGEDFCGFVGIGGNNDTVHFQINGKGCKHVFARRAPWSLHDWLTNVLGVQTLARVDLAYDDYDGIFDCEYARTAWNDNAFRTAERGRSPVLHVDHTIAGYRGGRPDYTKEQYSIGSRTSRIYWRVYNKALEQKLANTGLVWYRSEVELKKWNIDVLLNPAGAFAALNDFSASISTAKKFNTKPVPTKRAALDLLASAHWMRRQYGKILNSLIEFHEGDIETVVGSLVRDGTKFTFPDTYGKLVTHILET >NZ_CP018680|878159:889273|885722_887108_+|WP_074050437.1|DBSCAN-SWA MATSFRYGHGGSYKSACAVWFDLLPALREGRVCITNIHGMQPLEVIEKRLGEKFPDSARLIRISSRNPDGFELWKYFFCWAPIGSFILIDECQQIYSTNAGFKMANIHKRPFTDFEPHLPQGFSEIFHSRWLTVDTSSLDRGEVDDCQRTRFDEQGRIIYPENFNNAFMEHRHYNWDIVLLTPDFAQIPKELKGVAELAKQHKGKDGIFFSNRKPRILEHDPLRTVTVPSKDDVVYNLKVPLDVHLLYASTVTGQITKSGLGKNIFLNPKFLAAVAIFILSMGYLTYALIGIFSGSEETSSQGTPAHQTSQQGAVSSSNRQTRPNQNHAVHSVVGSGGSDCSGVGCDGRSYYDVGSVPAWFPLSNSESIYVSAVERWYKKKVVYVNVHFEITTPRGVSYLDDVFLKKVGIQMEYLDDCLVKLSDGESNFFVTCSPYEQIAQNQKSDIELKPVGGLFGGDES >NZ_CP018680|878159:889273|884274_885372_+|WP_186293484.1|DBSCAN-SWA MDACQFKQSVDEQKLLHWSAYVYGPEVTGACYGDFGATRCELRRPPVSVTVCTDADSGEFTQNTTCTSQFVFTGKQCDGGTLFWGKNGPDEPFDPDNPDDPEHKPDDPTGDIEDPSVLPDDSTNTVTPPDVNDKPDVEDPDTDGSTDTAVLSAIKGLNADVNSGIHDLNVDVNESHAKINNAVIDLKASVVGNTQAIQKQQINDNKIYNNTKALIQQANGDITTAVNKNTNATVKGLKELDASVGDLNGNLDDIKGLLTGGNFDSPNGEDVAEVIFSSDDFVSINETIHDKRQSIQDYVDQVKGLVSISTNFNNGSLSDKSFTVKGTTVESGLQRFDSVSGYVRPVVLFICALIALWILFGPRSK >NZ_CP018680|878159:889273|882877_883234_+|WP_017190168.1|DBSCAN-SWA MAKSVFVLGMDITWNSARGDSAQLNVSRPLREINSEKFKRRTIGESGDVNPQWDQPLMIDHTYALLLERTGALVPRREYELRLEINPDDPLAGAIVTELIPVDQEIKKHFETSMKAQG >NZ_CP018680|878159:889273|887107_887266_+|WP_005445749.1|DBSCAN-SWA MNEYVTHGQLLEIIELFDHLSMLNAIIVIIVYDLFRSGVRMLSDYLNKENGQ >NZ_CP018680|878159:889273|881529_881745_+|WP_009697828.1|DBSCAN-SWA MNEAQIIYYDLLPDYTVSVLVKGCDEWDLLKSMSHLESWASSQFLSYELVSITNTTYQERVDLGVFDDYCN >NZ_CP018680|878159:889273|883237_883465_+|WP_045487871.1|DBSCAN-SWA MPICVDVTRRGWLQATGESTQHCSSYVMMSVTDFNQYQEPVAFNSDLFLYVSGVLLVNMIVGHWSGRVVRLMSKR >NZ_CP018680|878159:889273|878159_879890_+|WP_074050432.1|DBSCAN-SWA MKDKYLEIYQQEALQEALVELKEARQREKLLADENKAILSAISAMSEARNRHEIFSGLNSVLKKYIDFDDFIVITRDCNRQNFKTLLSTNSVFDRADWLPGNASERALNGECILLFEPMRLKEFESLNSFIKSHVNSMLLTGIRSEVTQTIIILVGAQKGHFSIENKETLRRFRPLIERAVIDIETKEKLQRIVEVRTSQLARAREEAERANQSKSEFLAMMSHEIRTPLNSVLGMLDILRQSTLTDAQSDVLNQMECSADLLLAIISDILDLSKIESGSFHLNEQWTDLSDAVTLIVSQQKQVATSKSLSFDLHCNLDRNKQYWIDSTRISQVLFNLIGNAIKFTDFGDVHVSVLESNGEIIISVSDTGIGIPQAKIGHLFTAFHQGDSSITRRFGGTGLGLAITKHLVEMMRGSISVKSEERVGSHFTVKIPVLTRTNQERPVKIESNQPSKSTNILVVEDTESNQLVIKLILNKLGHNVFIASHGAEALTFLEQQTQDIDIILMDVSMPVMDGITATRLIRKKGIKTPIIALTAHALESDKATCMKAGMDGFVSKPVRRQDIYEAIHSLVEIA >NZ_CP018680|878159:889273|887515_887761_+|WP_033008583.1|DBSCAN-SWA MAITIRDTQEHTEMLSQLKEQTGTSTMSKALLKGGYDALKYKELYLAERRKNEQLRGELYRHTEAINDYVGALDSLRELTR >NZ_CP018680|878159:889273|880058_880427_-|WP_045489333.1|DBSCAN-SWA MYTNELLDAYKKAKNYIQDKQVAHDLGLSRQKISAIRNGQRYLTENEALFLAKEIGADTESVLVYLAADKAKTYEAQQAWANIAKKFNGLGLTGISMACAGFAVAFSSPVESSIQCALCILC >NZ_CP018680|878159:889273|883470_883713_+|WP_005445757.1|DBSCAN-SWA MKKLELVVNNVKSAVVNKKVAVGAALMVASVSPAFADGITDAITAATTSGQANVSLTVAGLIGMAALGFGVTMIVGFLRK >NZ_CP018680|878159:889273|881254_881527_+|WP_074050434.1|DBSCAN-SWA MERDNSFTCLVLLAGLIAAVFLFYKNNYMDATAESYAKVQTWVDEHPSAKPLLNELLSDGKLTPNEVADIRIFIKEEPKRLLISQATEAR >NZ_CP018680|878159:889273|885373_885718_+|WP_005445753.1|DBSCAN-SWA MDYIYAALEFIANVGQTLLDFIQNIPDLIINCVEYGALWCISIWLDIKIASIQISLRIAQTLLADYGVYTLIESNFNSLPSDVRYILTQYGVTSGLRVIFDSFAASLVMRFFNW >NZ_CP018680|878159:889273|888492_888816_-|WP_045488999.1|DBSCAN-SWA MDTIERLGVLEYSKRLVEYSLEAKITPLNVLFGNPLTKLEKMSKLLGDYLENRSASDSYGWNDEDKARSNLLVSQSRIIELHIHTNNLILSVACIVIFCMTLLIFSM |
17 | Vibrio_phage(91.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2064667 : 2082377
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP018680|2064667:2082377|DBSCAN-SWA CTTATAGGCGTTCTTCTAGCCAAGGCTGTACAGACTGAATCGCCGCAGGTAGCGCTGCAACATCAGTACCACCTGCTTGTGCCATATCTGGGCGACCACCGCCTTTACCGCCAACTTGCTCAGCAACCATCTTAACTAGGTCACCCGCTTTCACTTTGCCGATAAGGTCTTTGGTTACACCTGCAATCAGACCGATCTTGTCATCGTTCACGTTTGCAAGAAGGATAACGCCGCTACCGACTTGGTTTTTGATGTCATCAACCATAGTACGTAGGTTCTTGTTATCTGCGCCTTCTAGAGCAGCAATCAGTACTTTAGTACCGTTGATGTCCTGAACTTTGCCCATGATGTTTGCGCTTTCTGCCGCAGCCATCTTGTCTTTCAGTTTCTGGATTTCTTTCTCTAGAGATTTTGCTTTCTGTGCTGATTCAGCCAGTTTCTCTTCGTACTTAGCCGCTTGAGCTTCGATTGCATCTAGTGCACCTTCACCCGTTACCGCTTCGATACGACGGATACCAGCTGCGATACCACCTTCAGATGTGATCTTGAATAGACCGATGTCACCTGTGTTTGAAGCGTGGATACCACCACAAAGTTCAGTAGAGAAGTCGCCCATAGACAGTACGCGTACTTCGTCGTCGTACTTCTCGCCGAATAGTGCCATTGCGCCTTTCTTCTTAGCAGACTCAATGTCCATCACGTTGGTTTCAATCGTGTGGTTACGACGAATTTGCGCGTTAACCAGACGCTCAACTTCTTTCAATTCTGCTGCAGTTACGGCTTCTAGGTGTGAGAAGTCGAAACGTAGGCTGTCAGCTTTCACTAGAGAGCCTTTCTGCGTTACGTGCTCACCAAGAACTTGGCGAAGTGCTGCGTGTAGCAAGTGCGTTGCTGAGTGGTTTAGAGAAATAGCTGCGCGACGCTCTGCGTCAACGATAGTTGCTACTTCGTCACCTTTCGCTAGAACGCCTTCTGCCATTACACCGTGGTGTGCGATTGCGTTACCTAGTTTTTGAGTGTCTTCAACACGGAATACACCAGACTCAGTGCGGATTTCACCAGCGTCACCACACTGACCGCCTGACTCTGCGTAGAATGGTGTTTCACCAAGCACGATGATTGCTTTGTCGCCTGCAGATAGAGATTCTACTTCGTTGCCTTCAACGAACATCGCCGCTACAGAGCTAGAGCCTTTAGTGCCTGCGTAACCACAGAATTCTGTTTGCGTATCAACTTTGATTGCTGCGTTGTAGTCAGTACCAAAGTTACCAGCTTCACGCGCACGTTGACGCTGCTCTTCCATCGCTTTCTCGAAACCTTCTTCGTCGATTGCAAACTCACGTTCACGTGCAACGTCGTTAGTTAAGTCAGCTGGGAAGCCGTAAGTATCGTAAAGTTTAAATACGGTTTCGCCGTCTAGAACTTTACCGTCTAGGTTATCTAGCGCTTCGTTTAGAATCGCCATACCGCGCTCAAGTGTACGACCGAAGTTTTCTTCTTCGATACGCAGTACTTTTTCAACCACAGCTTGTTGACGCTTCAGCTCTTCACCAGCCGTACCCATGATTTCCGCTAGTACACCAACCAGTTTGTGGAAGAATGCACCTTGTGCGCCTAGCTTGTTACCGTGACGAACTGCACGACGGATGATACGACGTAGAACGTAACCACGACCTTCGTTTGAAGGCATAACGCCATCAACGATCAGGAATGAACATGAACGGATGTGGTCAGCGATAACGCGTAGCGATTGGTTTGATAGATCTTCGTAGCCAATCACTTCAGCAGCAGCTTTGATAAGAGCTTGGAATACATCGATTTCGTAGTTTGAGTGAACGCCTTGCATGATTGCAGAGATACGCTCGATACCCATACCAGTATCTACCGATGGTTTTGGTAGCGGTTCCATAGTGCCGTCTGCGTGACGGTTGAACTGCATGAATACGTTGTTCCAGATCTCGATGAAACGGTCACCATCTTCTTCAGGTGTGCCAGGACGGCCACCCCAGATGTGCTCACCGTGATCGTAGAAGATTTCAGTACATGGACCACAAGGACCTGTGTCACCCATTTGCCAGAAGTTATCTGACTCGTATGGTTTACCACCTTTCTTGTCGCCGATGCGAATAATGCGGTCTGCTGGAACGCCTACTTTTTTGTTCCAGATATCGAATGCTTCGTCGTCTGTCTCGTAAACTGTTACAAGTAGACGGTCTGCTGGCAGCTTAAGGGTTTCTGTTAGGAATTCCCATGCGAAAGAAATTGCGTCTTCCTTGAAGTAGTCACCAAAGCTGAAGTTACCTAGCATTTCAAAGAAAGTGTGGTGACGAGCTGTAAAGCCAACGTTTTCCAGGTCGTTGTGTTTACCACCAGCACGTACACAACGTTGAGCCGTAGTCACTCGAGTGTAGGCTCGTTTTTCTAAGCCTAAGAAACAATCTTTAAATTGGTTCATACCCGCGTTTGTGAACAGCAGGGTTGGGTCGTTATGTGGAACTAACGATGAACTTTCTACGATTTGGTGTCCTTTGCTCTCAAAGAACTTGAGGAACGCGTTACGAACCTCATCAGTGCTCATGTACATGCAGCTCTTCCTGAAAATAGTCGAGTTAGAATTTTGCCGTATTGTAGATCATGCAATCAGCTACGACTAGTTTTCTTGCGGAAAAGAGGCATTAGAAGAAAAAATTTGAAGCAAAACAAAAAACAGTGCGAACGACTGACCGTTAATCTTCGTCCTCAAAGCTCAGCGCGTAGCTAATTTGGTCAAAACTGTAGCCACGATATTGTAGAAATCGCACTTGTTTAGCGTACTCTTTCTGGTCTTTGGCTTTTATTCCTTTGAATTTCTTCTCTGCAGCCATTCTTGCCAGTTCAAACCAGTCTTGGGGCTCTTCTGCCATCGCCATATCGATGATGGATTCTGCAACACGTTTTTGGTTCAGTTCTTGACGAATGCGGCGCTCACCGTGACCTTTGTAAACATGCTGGCGCACTTGGCTTTTGGCGTAACGCAGGTCATCTAAATAGTTGTGGTCGAGACAAAAATTAATCGCAACTTCAATGTCTGCCTCTTCATAACCTTTTAGAGCCAGCTTTTGGTATAACTCGTATTGCCCATGGTCTCGACGGCTCAATAATTGGATAGCCGCTTCTTTGCTAGATAGAGCCGGCGCTTGACGCTTTTGGTACATGGTGATCCTAAAACTGTTTTTGCTGTACAGGGTTTATCTCAATCGTAGTAAATAACCGCTCACTCTAACCTACTAACACGCTAGTTCTGATCGCTTGCCTACCCTAGTTGATATAAGCCCAGCCTTTGTTTTTTCGATTTTTATTTCAGCAGATACAACAAAGCCCCGCATAGCAGGGCCTTGTTCAATATGACTCAGTAAGAAGATTAAAACTCTTCTTCTTGCTCTGATTTTTCACCTGATTCTGGTGCTTCAGGTAGTGCTGGTGAAAGTAGCATTTCACGCAGTTTAGTATCGATAGTCTTACCGATTTCTGGGTTTTCGCGTAGGTAGTTACAAGCGTTTGCTTTACCTTGACCAATCTTGTCGCCTTGGTAGCTGTACCATGCACCTGCTTTTTCAACTAGCTTGTGCTTCACGCCTAGGTCAATCAGCTCACCTTCACGGTTGAAACCTTGGCCGTACATGATTTGCGTGTTTGCTTCTTTAAACGGTGCAGCAATCTTGTTCTTAACAACTTTGATGCGCGTTTCGTTACCTACTACTTCGTCACCTTCTTTGATTGCGCCAGTACGACGGATATCTAGACGAACAGAAGCGTAGAACTTAAGTGCGTTACCACCAGTCGTTGTTTCTGGGTTACCGAACATCACACCAATCTTCATACGGATTTGGTTGATGAAGATACACATACAGTTAGATTGCTTTAGGTTACCCGTTAGCTTACGCATCGCTTGAGAAAGCATACGTGCTTGAAGGCCCATGTGGCTGTCGCCCATTTCGCCTTCAATTTCTGCTTTTGGTGTTAGTGCTGCAACTGAGTCGACAACCATAACGTCGATAGCACCAGAACGTGCAAGTGCGTCACAGATCTCTAGCGCTTGCTCACCTGTATCAGGCTGAGAAACCAATAGTGCATCGATATCAACGCCAAGTTTCTTCGCGTATACAGGGTCTAGCGCGTGCTCTGCATCGATAAATGCACAAGTTTTACCTTCACGTTGCGCAGCAGCGATCAGCTCAAGAGTTAGCGTTGTTTTACCTGAAGATTCTGGACCGTAGATTTCTACGATACGACCCATTGGTAGACCACCAGCACCCAAAGCGATATCAAGAGAAAGAGAACCCGTTGAGATGGTTTCTACGTCCATTGCGCGGTTATCACCAAGGCGCATGATAGAGCCTTTACCGAATTGCTTTTCAATTTGACCTAGCGCAGCGGCGAGCGCTTTCTGTTTGTTCTCGTCCATTACTTTCTCCGATTTATTCATCTTGTTTTGGGCGACGAATAGTAATTCATTTTTCTTAAGCAGAAGAACTGCTCAGCAATGAAAGCCATTATACTGTTGATTCATACAGTGTCTATACCTGTATGAAAAAAATTTGACTGACCTAAAGTCATGATTTGCTCTAGCCCTTATCAAATCTGGGCTAAGGCAATTTCACAAATTCTAAATCAGATGTTGATACAATTGAGTCAGTGCGTGTTCGACCGCTTTTTTGCGAACTTGAGCACGGTCACCAGTGAAATGCATGGTATCCACTTGCTGCCAGTTTTGGTTATCAGCGAAGGCAAAACAGACCGTACCGACAGGTTTCTCTTCACTGCCTCCTCCTGGTCCAGCAATACCACTGATGGACACGCCGATAGTAGCATTCGAATTACGAATGGCACCTTGTACCATTTCAATCACGACAGGTTCGCTCACCGCACCATGTGCCTCCAATGTTGAAGCTTGTACACCAAGCATCTCCATTTTGGCTTCATTACTGTAAGTGACAAACGCCCGGTCAAGCCACGCAGAACTGCCCGCAATATCAGTAATAGCAGTTGCAACACCGCCACCCGTGCATGATTCTGCCGTCGTTAAAACATGTCCATGCTTAGCCAATAATGCACCTAAATCGGCACTCAATTTTGTTGCTCGTTCCATAGCCAAAAATGCCCTTTGTCTCTTTTTACTTGATAGGGATTCACGTATCCTAAGCCGCTACGGAAATAAACAAAAGTACTTACCTGTGAAAGCTGAACAAAAACACACTCCGATGATGCAGCAATATCTCAAGCTAAAGGCTGAGAACCCAGAAATCTTGCTGTTTTACCGCATGGGCGACTTCTACGAACTCTTCTACGACGATGCTAAACGCGCATCTCAATTGCTTGATATTTCGTTGACTAAACGTGGTGCATCGGCAGGTGAACCTATCCCGATGGCTGGCGTTCCATTCCACGCCGTTGAAGGTTATCTAGCCAAACTTGTTCAGCTTGGTGAGTCTGTCGCGATCTGTGAACAGATTGGCGATCCTGCAACAAGCAAAGGTCCAGTTGAACGTAAAGTAGTACGCATTGTTACACCAGGTACAGTAACCGACGAAGCACTGCTGTCAGAGCGTATCGACAACCTGATTGCGTCTATTTACCATCACAATGGTAAGTTCGGTTACGCGACATTGGATGTGACTTCGGGTCGCTTCCAATTAGTTGAGCCTGAAACCGAAGAAGCAATGGCTGCTGAACTGCAACGTACTGCACCTCGTGAATTACTCTTCCCTGAAGATTTTGAACCCGTACATCTGATGTCGAACCGTAACGGTAACCGTCGTCGTCCTGTCTGGGAATTTGAGCTAGATACCGCTAAGCAGCAGTTGAACCAACAATTTGGTACGCGCGACTTGGTAGGCTTTGGGGTTGAGCACGCTTCACTGGGTCTGTGTGCTGCGGGTTGTTTGATCCAATACGTAAAAGATACTCAGCGTACTGCCCTACCACACATTCGTTCATTGACGTTTGATCGCCAAGATCACTCAGTCATTCTTGATGCGGCAACACGTCGCAACTTAGAGTTGACGCAAAACCTATCTGGTGGGACCGATAACACGCTAGCCGAAGTACTCGATCACTGTGCGACCCCAATGGGCAGCCGTATGCTGAAGCGCTGGTTACATCAACCAATGCGCTGTGTCGATACTCTTAACAACCGTCTTGATGCAATTGGTGAGATCAAAGATCAAAGCTTGTTCACCGACATTCAACCAATCTTCAAACAAATTGGTGACATCGAGCGTATCCTAGCCCGTTTGGCGCTGCGCTCAGCACGTCCGCGCGATATGGCTCGTCTACGCCATGCGATGCAACAGTTGCCAGAGTTGGAAGCAGTCACTTCTTCTCTAGCTCACCCATACCTTAAAAAGTTGGCTCAATTTGCTGCGCCAATGGATGAGGTATGCGACTTACTTGAGCGCGCAATCAAAGAAAACCCACCAGTGGTTATTCGCGAGGGAGGCGTGATTGCAGAAGGCTACAATGCGGAACTTGATGAATGGCGAAAGCTTGCAGACGGCGCAACCGAGTATCTTGAAAAACTCGAAGCAGACGAGCGTGAACGTCATGGTATCGACACCCTAAAAGTTGGCTACAACGCTGTACACGGTTTCTTCATTCAGGTTAGCCGTGGTCAAAGCCACTTGGTTCCACCACACTATGTACGTCGCCAAACACTGAAAAATGCAGAACGCTATATCATTCCAGAGCTGAAAGAGCACGAAGATAAAGTTCTGAATTCAAAATCAAAAGCGCTAGCAGTTGAGAAAAAACTGTGGGAAGAGCTATTTGATTTGTTGATGCCAAACCTTGAGAAGATGCAAAACCTTGCTTCGGCGATCTCTCAGTTGGACGTACTACAAAACCTAGCCGAGCGTGCAGATTCACTTGATTACTGTCGCCCAACATTGGACAAAGAAGCGGGCATTAGCATTCAAGCGGGTCGTCATCCTGTCGTGGAGCAAGTAACAAGCGAACCATTTATAGCTAACCCAATTGAACTGAGCTCAGATCGTAAGATGTTGATCATCACCGGTCCAAACATGGGTGGTAAATCAACCTACATGCGCCAGACTGCATTGATTGCTTTGATGGCTCATATTGGTTCATACGTTCCTGCAGAATCAGCTCATATTGGCTCTCTAGACCGCATCTTTACTCGTATCGGTGCATCTGATGATTTGGCATCTGGCCGTTCAACCTTCATGGTAGAAATGACTGAAACCGCCAACATTCTTCACAACGCCACTAAGAACAGCTTGGTGCTGATGGATGAAATTGGTCGCGGTACCAGTACTTACGATGGTCTGTCTTTGGCATGGGCAAGTGCAGAATGGCTTGCAACTCAGATTGGGGCAATGACTCTGTTCGCAACGCACTACTTCGAACTGACTGAGCTGCCAAACTTGCTGCCAAATCTCGCTAACGTTCACCTTGATGCGGTAGAACACGGTGACAGCATTGCCTTTATGCATGCCGTTCAAGAAGGCGCGGCAAGCAAGTCTTATGGTCTAGCGGTTGCTGGTTTAGCGGGCGTTCCAAAACCAGTCATCAAGAATGCACGTCACAAACTGTCGCAGCTTGAACAATTAGGTCAAGGTAATGACTCACCTCGTCCGAGTACCGTTGATGTGGCAAACCAACTCAGCCTGATCCCTGAACCTAGTGATGTTGAACAAGCATTGTCGAATATCGATCCTGATGATTTAACGCCGCGTCAGGCACTGGAAGAGCTATACCGCTTGAAGAAGATGCTCTAGTTCTTGGGCTAACTGGTACAGTAAAACGCAAAAAGGCTATGGTCACCCATAGCCTTTTGTTTATTCTGATCTAGCCTCTATCGCTGCCTACGTTTAGTCGTCTTCTACGTTGAACAAGTTTTCCATGTTCAGACCTTGTTTAATCAAGATCTCGCGTAGTCGACGTAAGCCTTCCACTTGGATCTGGCGAACACGTTCGCGTGTTAAACCAATTTCACGTCCCACTTCTTCAAGCGTTGATGGCTCATAACCAAGCAGACCAAAACGACGTGCCAGCACTTCTTTCTGTTTTGGATTCAGCTCTTCTAGCCAATGGATCAATGAAGATTTGATGTCATCGTCTTGAGTGGAAACTTCAGGATCCGAGTTGTTTGCATCAGGAATAATATCCAGTAACGCTTTCTCGCCGTCACCACCGATTGGTGTATCGACAGAGCTAATACGCTCGTTAAGGCGAAGCATTTTGCTTACGTCTTCTACTGGGATATCTAACTGTGCAGCAATCTCTTCTGCTGTTGGTTCATGGTCGAGCTTCTGAGAAAGTTCACGCGCTGTACGCAGATAAATGTTCAGCTCTTTAACCACATGAATTGGCAGACGGATTGTACGAGTTTGGTTCATCAACGCACGTTCGATCGTTTGGCGAATCCACCACGTCGCGTAAGTTGAGAAACGGAAGCCTCGTTCTGGATCAAACTTCTCGACCGCACGGATCAAACCTAGGTTACCTTCTTCAATAAGATCAAGAAGCGCAAGACCACGATTGCTGTATCGGCGAGAGATCTTCACTACAAGACGCAAGTTACTTTCGATCATGCGTTTGCGAGCGGCTTCATCGCCACGCAGTGCACGTCGTGCATAAAGTACTTCTTCTTCTGCAGTAAGTAGAGGTGAGAAACCAATTTCACCTAGGTAAAGCTGAGTCGCGTCTAGGCTCTTGCTGCTCGCATCAAACTCTTCGCGTGCTGCTGTTTTGGTTCCGTTAGATGGTTTTTCAAGTCCGTTAGAAATGCCTTCCGATTGTGCATTATCATATTCGAACTCTTCAACTTTGGTTACTGTGTTGCTGATACTCATAACGCCTCCCCCTGGCGAGTTAGCAAGACATGACAACTCAATATGTCGCTAAATGATGTCGCAAGTTGTTATTCAAAGTTCTACGGTAAATAGCGCTTTGGATTCACTGATTTACCTTGATAGCGAATTTCAAAGTGAAGTTTGACGGATTTAGAACCAGAACTACCCATGGTCGCTATTTTTTGTCCGCTCTTAACACTTTGTCCTTCTGAGACCAGCAATCTGTCATTGTGTGCGTATGCGCTTAAATAATTGTCGTTATGCTTCACTATAATTAGGTTGCCATAACCTCGTAGCGCATTACCTGAGTACACGACTGTGCCGGCTGCGGTTGAAACGATTGGCTGACCACGCTGTCCTGCGATGTCTATGCCTTTATTTCCTTGTTCTCCCGCTGAGAAGTTCTTGATTACTCTCCCTTTTGTTGGCCATAACCACTTCGATACTTTATCGTTTTTCGCGGTAGTTACTGGCGGTTTCGGTTTAACATTCTGATTACCTTTTGACCCAACATACTCCTTTGGTTTGGATTGTTCAACCTTCTTTGGAGGCTGTTTTTGTGCAACTTGAGTCTTAGTCGGTTTAGGTTTTTGAGTAGAGCTTTTACTTGAATTTGACGGCTTTGACGCTGTTGATGTTTTAGTCACTGGAACAGGTTTCGCCGCAACTACAGGAGCAACAACGGGTTCCACTTTATGACCATATTTAGGCGCAACATATTTCGGAGCCCAAAGTTTTAACTTCTGCCCCGGGAAAATCGTGTAAGGCGCAGATAATTCATTGTAACGAACAAGATCATTTACATCCTTATCTGTGACGTAGGCAATAAAATAAAGGGTATCACCTTTCTTTACTTCGTAATAACTCCCTCTATAACTACCACGCTCAACCGTCGAGTAATCTTTATTTAGGCTTGAAACAGGTGCAGGTGAATGAGCCGCGCAACCAACCAGTCCCGCAGCCAATAAACAGCTAATACCAAAACGACGTAATAACTTCTTCACCTAGTCTTTAACCCTTAAGCTAAATCACCTGCAACTAATGGCACAAATCTTACCATTTCTACCACGGTCGACAGATATTGTTCACCCTGACGTTCAATTTTTAATAGTTGTTGTTCATCTTCACCAACAGGAATGATCATCTTACCGCCCTCTTTCAGCTGGGATAGAAGTGCTTGGGGAACACTTTCCGCTGCCGCAGTGACAATAATCGCATCGAATGGACCTTTCGCTTCCCATCCAAGCCAACCATCTCCATGTTTGGTAGAGACGTTATAAATATCCAGTTGTTTCAGACGACGCTTAGCATCCCATTGCAAAGACTTAATGCGTTCAACGGAATAGACATGATCGACAATTTGCGCAAGCACTGCAGTTTGGTAACCCGAGCCCGTGCCGATTTCTAAAACATTACTGTCCGATTTTAGATCAAGCAGCTCTGTCATTTTTGCAACGATATAAGGCTGAGAAATAGTTTGTCCCTGACCAATCGGCAGCGCATTGTTGTCGTACGCTTGGTGCATCATTGCTTGTGAGACAAAGCTCTCACGCGGTAATCGATAAATCGCATCCAACACACGTTGATCACTGATGCCACTGGAGATAAGAAACGCTATCAACCTGTCAGCATGTGGGTTACTCATTAACTCTCTTCCTTTAACCAACTGTCCATTGCGCGTAGCGACTCATGAGCGGTGAGATCGACTTGCAGCGGCGTGATTGAAACAAAACCATGTTCAATGGCATAAAAGTCCGTACCGATGCCAGCATCTTGCTCTTTACCCGGAGGACCAAGCCAATAAATATCATGTCCGCGTGGATCTTTCTGCTTAATCATATTCTCAGCGTGATGACGCGCACCCAAGCGAGTCACTTCTATCTCACCAAGTTCTTCAAACGGAAGATCAGGGACGTTTACATTCAATAGACGGTTGGTTGGAATTGGGCGAATCAGATGCTGCTCAACCAACTGGCGAGCAATTTTAGCAGCAGATTCAAAATGTTGTTTACCGACTAGAGAGAAAGCAATCGCCTGCACGCCAAGGAAGTGTCCTTCCATAGCAGCAGCCACCGTGCCGGAGTACAAAACGTCATCACCTAGATTTGCGCCATGATTGATGCCTGACAGCACAAGGTCTGGCAAATCATCTTTCATCAACTCATTCAGCGCAAAGTGTACACAGTCTGTTGGCGTACCTTGCACAGAGAATGTCTTTGGCGCAATCTCGCTGACACGCAGCGGTTGCTCCAGCGTCAATGAGTTAGATGCACCAGAGCGGTTTCTGTCTGGTGCGACAATCGTTACTTCGGCAATGCTTCTCAGCTCATCAGCAAGCGCATGAATTCCTTGAGCATGAACACCATCATCATTGCTGAGCAGTATTCTTAATGGCTTCGCGTTGTGTGAATCCATTTCCATTAATATTCTCTTTCTACCTTTTCTTCGATGACCAGCTCACGCACGATTGAGGTCGCAAATGAACCCGCATCCAGCGAGAATGTCATCGTGATGTTATTACCATCGACCGTCCAAGCTAGGTCTTGTGGCTTGAGTGCGATATCGCGGCGGTCGTGACGCATACGGTTACCGCGAATCAGTGCCATCAAATCTGGTTCTTCATCTAAGAACGGTTGTTCTAAGGTTAGCGCATCCGCTTGAGTTGGTAGTGCGTTATCGCCCGCCATTGCCGCAGTGATCGCCACTTCGCCTTGAGCAAATTGTGCTTGTAACTCGGCGATGTTGCTTGTATCAACTAACTGAGTGCCAGAAGCTGTCTGCGCAACATCACCGTCGATGAACTTGTCAAATACACCATTCTCTAGGCGAGCTGAAACAATGCGGTTGAAGATCCACGAACGAGCCGCTGACAGATACATGCTGCGTTTGTTTTGATTGCGGGTACGAACGTTCTCGCGTCCCCATCGACGTGCTTCTTCAAGGTTGTTGCCATCGTTACCAAAACGCTGACTACCAAAGTAGTTTGGCACACCCACTTGTTTCACTTTTTCAAGACGTTGTTCAACATCCGCTATATCTGTCACTTCAGACAAAGTAACCGCGAACTCGTTCCCTACTAAGTCACCAGGGCGCAGTTTTTTATTGTGACGATCCGTTGCTAGGATCTCGATGCTCGGGTATTGCGCCAAAAATGCTGAAAAATCTGGCGTTTCACCTTTTGGTAAATGCACACTTAGCCACTGTTCAGTCACAGCATGACGGTCTTTAAGACCCGCCCAACTAACGTCTTTCGACTTAACACCACAGGCTTTTGCTAGTTCGTTAGCAACGAAACTGGTGTTCTCACCTGTTTTACGGATACGAACCATTAAATGCTCGCCTTCACCCGTAAAGGCAAAACCTAAATCTTCGCGAACCTGAAAGTGCTCTGGCTGCGCTTTGATTTTTGCAGATGCAACTGGCTTACCTGTTAGGTAAGCCAATGAAGATAAAATGTCTGACATGATGTATTTCCGCTGATATCACAGCTTAGAGGAGCAAATTGCTAATTATTGCTTAAACAGAAGTACAACTGCTTCCGTTGCAATGCCTTCTTTGCGTCCAGTAAAACCAAGACGTTCTGTCGTCGTTGCTTTTACGTTAATGTTGCTAATGTCGGTTTCAAGGTCTTCGGCAATAGCAGCACACATTGCGTCAATGTGCGGTGCCATCTTCGGCGCTTGTGCCATGATAGTGACATCGGCATTACCAAGACGGTAGCCTTGCTCTTTGACTCGGCGATAAACGTCTTTCAGTAGTTCACGACTGTCTGCACCTTTCCATTTGTCGTCCGTATCAGGGAAGTGACGACCAATATCGCCTGCAGCAATAGCGCCAAGCAGTGCATCCGTTAACGCATGCAGCGCTACATCACCATCAGAGTGTGCAATTAAACCTTGTTCGTATGGGATCGCTACGCCCCCAATAATTACCGGACCTTCACCACCGAATTTATGTACGTCGAAACCATGACCAATTCGAATCATTTTCTATCCTTTCTTTATTCGCCGCGCTCACGGCTTAGATAAAACTCGGCTAACGCAAGATCTTCTGGCTGAGTAATTTTAATATTATTCGCATTCCCCTGAACCAAAGCCGGCTTCTCGCCCAACCATTCCAAAGCCGAAGCTTCATCAGTAATGGTTGCACCTTGTTCTAATGCGTCAGTCAGTGCTTGAGTAAGCTGCTGAGTTTTAAACATTTGTGGAGTTAATGCATGCCACAATGCTTCTCTATCTACCGTATGATCGATATCTTGTGCTGCGTTTGCTCGCTTCATTGTGTCTCTCACTGGAGATGCGAGGATGCCACCAGGTGGGTGCGCGCTACAAACGTCGATCAAATGGTCAATATCGGATGTTAATACACACGGTCTTGCAGCGTCGTGGACTAATACCCACTCGCTTTCCAGATGTGTACTTACATAGTTTAGCCCTGATAACACAGAGTCTGCCCTTTCTTTACCACCAGCCACGCGAATCACATCAGGGTGCTGAGCAATGGAGAGTTCAGGGTAGTATGGGTCGCCGTCGGTAATAGCCACGACAACCTTAGAAATTTTTGGGTGAGAAAGCAGTTTCTCTACGGTATGTTCCAGTACCGCTTTGCCGTCAATCAACAGGTATTGTTTTGGGCGATCTGCCTTCATGCGCCTGCCAACACCTGCGGCTGGCACAATAGCAATATGGGTTGGAGTGTTAGTATCCAACATATTAATGCTCTTCCTCATCGACAATTCGGTAGAAGGTTTCGCCATCTTTAACCAAACCAAGTTCATGACGTGCACGTTCTTCGATTGCATCAAGACCTTGTCGTAAGTCATCAATCTCAGCGAACATCTCGTTATTACGAGCTTGCAGCTTGCTGTTTACTTGCTGCTGCACTTCAATTTCATCTTCTACTGTGTAGTAGTCAGACACCCCGTTTTTGCCAAACCAGAGGGTATATTGAAGCCAGCCAAACAGTAACGTAAGGGCTATAACAAAGATTCGCATATCGTTGCTAACATTCTGTACCGGAAGGGATTAGATAAGGTGACATATATAGCATAAATGCGTGGTTGGCTCTAGGGTAGAAAAAGAGACTCACACGCTCCTTACAGGAAAAAAGCAGCAAAATGAAACAAAGAGATTCCCTACTCTCTCCTTCGTCGTTCTAGGGAATGACAAATCAACGTAAGCGAAAGACATCCAAAAGCAAAGCCTGTTCTCATGTATCCAAGTGAAGTGCTCTCGAAGAGCCAAGTCCGCTCCTACCTTTCGAAGAGAAGCGCTCCCGAAGGGCGAAGCCCGTTCGTGCCTCTCGAAGCATAGCGTTCTCATATGTTAGATACAAAAACGCCCCGCAAGTGCGAGGTGTTTTTAAAAGCTTTAAGTTAAGCCGTTAACTCAGTTAAGAATTAAGCCTGGCCTTTAACTTCTTTGAGACCGCTTCCTTGAAACATGGAGGAGCGCGCTCACTTAGAGCTTCCGTCATATAAAAAAACGCCCCGCAAGTGCGAGGCGTTTTCAAAAGCTTTAAGTTAAGCTGTTAATTCAGTTAAGAATTAAGCTTGACCTTTAACTTCTTTGAGACCGCTTCCTTGAAACATGGAGGAGCGCGCTCACTTAGAGCTTCCATCATATAAAAAAACGCCCCGCAAGTGCGAGGCGTTTTCAAAAGCTTTAGGTTAAGCTGTTAATTCAGTTAAGAATTAAGCTTGACCTTTAACTTCTTTGAGGCCGCTTCCTTGAAACATGGAGGAGCGCGCTCACTTAGAGCTTCCGTCATATAAAAAAACGCCCCGCAAGTGCGAGGCGTTTTCAAAAGCTTTAGGTTAAGCTGTTAATTCAGTTAAGAATTAAGCTTGACCTTTAACTTCTTTAAGACCGTTGAAAGGAGCGCGCTCACCTAGAGCTTCCTCGATACGGATTAGTTGGTTGTACTTAGCAACACGGTCAGAACGGCTCATAGAACCAGTTTTGATTTGACCTGCAGCTGTACCTACCGCTAGGTCAGCGATAGTTGCATCTTCAGTTTCGCCAGAACGGTGAGAGATTACTGCTGTGTAACCTGCGTCTTTAGCCATCTTGATTGCAGCTAGAGTCTCAGTTAGAGAACCGATTTGGTTGAACTTGATAAGGATAGAGTTAGCTACGCCTTTCTCGATACCTTCAGCAAGGATCTTAGTGTTAGTAACGAATAGGTCGTCACCTACTAGTTGAAGCTTGTCACCTAGTAGTTCAGTTTGGTGCTTGAAGCCAGCCCAATCAGACTCGTCTAGACCGTCTTCGATAGAAACGATTGGGAATTGGTTAGCTAGCTCAGCTAGGTAGTGGTTGAACTCTTCAGAAGTGAAAGTTTTACCTTCGCCTTTCATGTTGTAGATGCCAGCTTCTTTGTCGAAGAACTCAGATGCTGCACAGTCCATAGCTAGAGTAACGTCTTTACCTAGTTCGTAACCAGCAGCTGCAACAGCTTCTGCGATAACTTCTAGAGCTTCAGCGTTAGACTTAAGGTTAGGAGCGAAACCACCTTCGTCACCAACTGCAGTGCTGTAGCCTTTAGACTTAAGAACTTTAGCTAGGTTGTGGAATACTTCAGCACCGATACGTAGACCTTCTTTAAGAGTCTTAGCACCAACTGGTTGGATCATGAACTCTTGGATGTCAACGTTGTTGTCTGCGTGCTCACCACCGTTGATGATGTTCATCATTGGTAGAGGCATAGAGAACTGACCAGCTGTGCCGTTTAGCTCAGCGATGTGCTCGTATAGAGGCATGCCTTTAGCCGCTGCAGCAGCTTTAGCGTTTGCTAGAGAAACAGCTAGGATAGCGTTCGCACCGAACTTAGATTTGTTTTCAGTGCCGTCTAGTTCGATCATTACTGCGTCGATTGCAGCTTGGTCTTTAGCGTCTTTGCCAACTAGAGCTTCAGCGATTGCGCCGTTTACAGCTTCAACAGCTTTAAGAACACCTTTACCTAGGAAACGTGCTTTGTCACCGTCACGTAGCTCAAGAGCTTCGCGAGAACCAGTAGATGCGCCAGATGGAGCTGCCGCCATACCTACGAAACCGCCTTCTAGGTGTACTTCAGCTTCTACAGTTGGGTTACCACGTGAGTCGATGATTTCACGACCTAGAACTTTAACGATCTTAGACATTAATGTTTCCTCTCGTTCAAATATAAATGTCAATTTTAAAGGGCAGCAGCACAACCTTCGCCGCCGCCCGTATCCTTTTACTTCTCGAATTCGCCGCGAGCGTTTTGACCAGCAGCTTTAACGAAACCTGCGAATAGTGGGTGACCATCGCGAGGAGTCGAGGTGAATTCTGGGTGGAATTGAGCCGCTACAAACCATGGGTGAGCTGGGTTCTCAATCATTTCCACTAGTTTCTTGTCCGCTGATAGACCAGATACTTTCAGACCTGCTTTTTCGATTTGCGGACGAAGAACGTTGTTTACTTCGTAACGGTGACGGTGACGTTCGTGAACTGTCGCGCTACCGTATAGTTCACGAGCTTTCGTCCCTTTCTCTAGGTGACATAGCTGTGAACCAAGACGCATCGTACCACCTAGGTCAGACGTTTCAGTACGTTCTTCAACTTTACCTTCGCCGTCTACCCATTCAGTGATCAAACCTACCACAGGGTACTTAGTGTCTTTGTTAAATTCTGTTGAATGTGCACCTTCCATACCCGCTACGTTGCGCGCGTATTCGATCAGTGCTACTTGCATGCCTAGACAGATACCTAGGTAAGGTACTTTGTTTTCACGAGCGTATTGTGCTGCGCGAATCTTACCTTCGATACCACGGTCACCGAAGCCGCCAGGTACTAGGATTGCATCTAGACCTTCAAGTACTTCTACGCCTTTCGTCTCAACATCTTGAGAGTCTACGTATTTAATTGTGACGTTTAGACGGTTTTTCAAGCCTGCGTGTTTAAGTGCTTCGTTTACTGACTTGTAAGCATCTGGTAGTTCGATGTATTTACCGACCATACCGATAGTTACTTCGCCAGTTGGGTTCGCTTCTTCATAGATAACCTGTTCCCATTCAGACAGGTCAGCTTCTGGCGCGTTGATACCAAAGCGAGCACAAACTAGGTCATCTAGACCTTGAGACTTAACAAGTTGAGGGATCTTGTAGATTGAATCTACGTCCTTCATTGAGATTACCGCTTTCTCAGGAACGTTACAGAATAGCGCAATTTTCTTACGCTCGTTCGCTGGGATCATACGATCAGAGCGGCAAACAAGAATATCTGGTTGGATACCGATAGATAGCAGCTCTTTAACAGAGTGCTGAGTTGGCTTAGTTTTCACTTCACCTGCAGCTGCTAGGTAAGGAACTAGCGTTAGGTGCATGAACATTGCACGTTCACGACCTAGTTCTACAGCAAGCTGACGGATAGCTTCCATAAATGGTAGTGATTCGATATCACCTACCGTACCACCGACTTCAACGATAGCAACATCATGGCCTTCAGAACCTGCGATTACGCGGTCTTTGATAGCGTTAGTGATGTGAGGGATAACCTGAATGGTTGCACCTAGGTAATCGCCGCGACGTTCTTTACGTAGTACGTCTGCATAAACACGACCAGCAGTGAAGTTGTTACGCTTAGTCATCTTGGTGCGGATGAAACGCTCGTAGTGACCAAGGTCAAGGTCAGTCTCTGCGCCATCTTCCGTAACGAACACTTCACCGTGTTGAGTTGGGCTCATTGTGCCTGGATCAACGTTGATGTAAGGGTCAAGCTTCATCATAGTCACTTTAAGACCACGAGCTTCTAGAATAGCTGCAAGAGATGCTGCTGCAATACCTTTACCTAGAGAGGATACAACCCCGCCAGTAACAAAAATGTAATTTGTCGTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP018680|2064667:2082377|2073598_2074522_-|WP_009698767.1|DBSCAN-SWA MKKLLRRFGISCLLAAGLVGCAAHSPAPVSSLNKDYSTVERGSYRGSYYEVKKGDTLYFIAYVTDKDVNDLVRYNELSAPYTIFPGQKLKLWAPKYVAPKYGHKVEPVVAPVVAAKPVPVTKTSTASKPSNSSKSSTQKPKPTKTQVAQKQPPKKVEQSKPKEYVGSKGNQNVKPKPPVTTAKNDKVSKWLWPTKGRVIKNFSAGEQGNKGIDIAGQRGQPIVSTAAGTVVYSGNALRGYGNLIIVKHNDNYLSAYAHNDRLLVSEGQSVKSGQKIATMGSSGSKSVKLHFEIRYQGKSVNPKRYLP >NZ_CP018680|2064667:2082377|2069879_2072441_+|WP_009698766.1|DBSCAN-SWA MKAEQKHTPMMQQYLKLKAENPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGVPFHAVEGYLAKLVQLGESVAICEQIGDPATSKGPVERKVVRIVTPGTVTDEALLSERIDNLIASIYHHNGKFGYATLDVTSGRFQLVEPETEEAMAAELQRTAPRELLFPEDFEPVHLMSNRNGNRRRPVWEFELDTAKQQLNQQFGTRDLVGFGVEHASLGLCAAGCLIQYVKDTQRTALPHIRSLTFDRQDHSVILDAATRRNLELTQNLSGGTDNTLAEVLDHCATPMGSRMLKRWLHQPMRCVDTLNNRLDAIGEIKDQSLFTDIQPIFKQIGDIERILARLALRSARPRDMARLRHAMQQLPELEAVTSSLAHPYLKKLAQFAAPMDEVCDLLERAIKENPPVVIREGGVIAEGYNAELDEWRKLADGATEYLEKLEADERERHGIDTLKVGYNAVHGFFIQVSRGQSHLVPPHYVRRQTLKNAERYIIPELKEHEDKVLNSKSKALAVEKKLWEELFDLLMPNLEKMQNLASAISQLDVLQNLAERADSLDYCRPTLDKEAGISIQAGRHPVVEQVTSEPFIANPIELSSDRKMLIITGPNMGGKSTYMRQTALIALMAHIGSYVPAESAHIGSLDRIFTRIGASDDLASGRSTFMVEMTETANILHNATKNSLVLMDEIGRGTSTYDGLSLAWASAEWLATQIGAMTLFATHYFELTELPNLLPNLANVHLDAVEHGDSIAFMHAVQEGAASKSYGLAVAGLAGVPKPVIKNARHKLSQLEQLGQGNDSPRPSTVDVANQLSLIPEPSDVEQALSNIDPDDLTPRQALEELYRLKKML >NZ_CP018680|2064667:2082377|2069311_2069794_-|WP_009703950.1|DBSCAN-SWA MERATKLSADLGALLAKHGHVLTTAESCTGGGVATAITDIAGSSAWLDRAFVTYSNEAKMEMLGVQASTLEAHGAVSEPVVIEMVQGAIRNSNATIGVSISGIAGPGGGSEEKPVGTVCFAFADNQNWQQVDTMHFTGDRAQVRKKAVEHALTQLYQHLI >NZ_CP018680|2064667:2082377|2080736_2082377_-|WP_005449808.1|DBSCAN-SWA MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLKVTMMKLDPYINVDPGTMSPTQHGEVFVTEDGAETDLDLGHYERFIRTKMTKRNNFTAGRVYADVLRKERRGDYLGATIQVIPHITNAIKDRVIAGSEGHDVAIVEVGGTVGDIESLPFMEAIRQLAVELGRERAMFMHLTLVPYLAAAGEVKTKPTQHSVKELLSIGIQPDILVCRSDRMIPANERKKIALFCNVPEKAVISMKDVDSIYKIPQLVKSQGLDDLVCARFGINAPEADLSEWEQVIYEEANPTGEVTIGMVGKYIELPDAYKSVNEALKHAGLKNRLNVTIKYVDSQDVETKGVEVLEGLDAILVPGGFGDRGIEGKIRAAQYARENKVPYLGICLGMQVALIEYARNVAGMEGAHSTEFNKDTKYPVVGLITEWVDGEGKVEERTETSDLGGTMRLGSQLCHLEKGTKARELYGSATVHERHRHRYEVNNVLRPQIEKAGLKVSGLSADKKLVEMIENPAHPWFVAAQFHPEFTSTPRDGHPLFAGFVKAAGQNARGEFEK >NZ_CP018680|2064667:2082377|2075162_2075939_-|WP_081384165.1|DBSCAN-SWA MEMDSHNAKPLRILLSNDDGVHAQGIHALADELRSIAEVTIVAPDRNRSGASNSLTLEQPLRVSEIAPKTFSVQGTPTDCVHFALNELMKDDLPDLVLSGINHGANLGDDVLYSGTVAAAMEGHFLGVQAIAFSLVGKQHFESAAKIARQLVEQHLIRPIPTNRLLNVNVPDLPFEELGEIEVTRLGARHHAENMIKQKDPRGHDIYWLGPPGKEQDAGIGTDFYAIEHGFVSITPLQVDLTAHESLRAMDSWLKEES >NZ_CP018680|2064667:2082377|2072534_2073518_-|WP_005452115.1|DBSCAN-SWA MSISNTVTKVEEFEYDNAQSEGISNGLEKPSNGTKTAAREEFDASSKSLDATQLYLGEIGFSPLLTAEEEVLYARRALRGDEAARKRMIESNLRLVVKISRRYSNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERALMNQTRTIRLPIHVVKELNIYLRTARELSQKLDHEPTAEEIAAQLDIPVEDVSKMLRLNERISSVDTPIGGDGEKALLDIIPDANNSDPEVSTQDDDIKSSLIHWLEELNPKQKEVLARRFGLLGYEPSTLEEVGREIGLTRERVRQIQVEGLRRLREILIKQGLNMENLFNVEDD >NZ_CP018680|2064667:2082377|2077518_2078229_-|WP_074050820.1|DBSCAN-SWA MLDTNTPTHIAIVPAAGVGRRMKADRPKQYLLIDGKAVLEHTVEKLLSHPKISKVVVAITDGDPYYPELSIAQHPDVIRVAGGKERADSVLSGLNYVSTHLESEWVLVHDAARPCVLTSDIDHLIDVCSAHPPGGILASPVRDTMKRANAAQDIDHTVDREALWHALTPQMFKTQQLTQALTDALEQGATITDEASALEWLGEKPALVQGNANNIKITQPEDLALAEFYLSRERGE >NZ_CP018680|2064667:2082377|2068066_2069110_-|WP_005436413.1|DBSCAN-SWA MDENKQKALAAALGQIEKQFGKGSIMRLGDNRAMDVETISTGSLSLDIALGAGGLPMGRIVEIYGPESSGKTTLTLELIAAAQREGKTCAFIDAEHALDPVYAKKLGVDIDALLVSQPDTGEQALEICDALARSGAIDVMVVDSVAALTPKAEIEGEMGDSHMGLQARMLSQAMRKLTGNLKQSNCMCIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRTGAIKEGDEVVGNETRIKVVKNKIAAPFKEANTQIMYGQGFNREGELIDLGVKHKLVEKAGAWYSYQGDKIGQGKANACNYLRENPEIGKTIDTKLREMLLSPALPEAPESGEKSEQEEEF >NZ_CP018680|2064667:2082377|2079357_2080659_-|WP_005425372.1|DBSCAN-SWA MSKIVKVLGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKARFLGKGVLKAVEAVNGAIAEALVGKDAKDQAAIDAVMIELDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTAGQFSMPLPMMNIINGGEHADNNVDIQEFMIQPVGAKTLKEGLRIGAEVFHNLAKVLKSKGYSTAVGDEGGFAPNLKSNAEALEVIAEAVAAAGYELGKDVTLAMDCAASEFFDKEAGIYNMKGEGKTFTSEEFNHYLAELANQFPIVSIEDGLDESDWAGFKHQTELLGDKLQLVGDDLFVTNTKILAEGIEKGVANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGERAPFNGLKEVKGQA >NZ_CP018680|2064667:2082377|2077027_2077504_-|WP_005425365.1|DBSCAN-SWA MIRIGHGFDVHKFGGEGPVIIGGVAIPYEQGLIAHSDGDVALHALTDALLGAIAAGDIGRHFPDTDDKWKGADSRELLKDVYRRVKEQGYRLGNADVTIMAQAPKMAPHIDAMCAAIAEDLETDISNINVKATTTERLGFTGRKEGIATEAVVLLFKQ >NZ_CP018680|2064667:2082377|2064667_2067250_-|WP_074050818.1|tRNA|DBSCAN-SWA MYMSTDEVRNAFLKFFESKGHQIVESSSLVPHNDPTLLFTNAGMNQFKDCFLGLEKRAYTRVTTAQRCVRAGGKHNDLENVGFTARHHTFFEMLGNFSFGDYFKEDAISFAWEFLTETLKLPADRLLVTVYETDDEAFDIWNKKVGVPADRIIRIGDKKGGKPYESDNFWQMGDTGPCGPCTEIFYDHGEHIWGGRPGTPEEDGDRFIEIWNNVFMQFNRHADGTMEPLPKPSVDTGMGIERISAIMQGVHSNYEIDVFQALIKAAAEVIGYEDLSNQSLRVIADHIRSCSFLIVDGVMPSNEGRGYVLRRIIRRAVRHGNKLGAQGAFFHKLVGVLAEIMGTAGEELKRQQAVVEKVLRIEEENFGRTLERGMAILNEALDNLDGKVLDGETVFKLYDTYGFPADLTNDVAREREFAIDEEGFEKAMEEQRQRAREAGNFGTDYNAAIKVDTQTEFCGYAGTKGSSSVAAMFVEGNEVESLSAGDKAIIVLGETPFYAESGGQCGDAGEIRTESGVFRVEDTQKLGNAIAHHGVMAEGVLAKGDEVATIVDAERRAAISLNHSATHLLHAALRQVLGEHVTQKGSLVKADSLRFDFSHLEAVTAAELKEVERLVNAQIRRNHTIETNVMDIESAKKKGAMALFGEKYDDEVRVLSMGDFSTELCGGIHASNTGDIGLFKITSEGGIAAGIRRIEAVTGEGALDAIEAQAAKYEEKLAESAQKAKSLEKEIQKLKDKMAAAESANIMGKVQDINGTKVLIAALEGADNKNLRTMVDDIKNQVGSGVILLANVNDDKIGLIAGVTKDLIGKVKAGDLVKMVAEQVGGKGGGRPDMAQAGGTDVAALPAAIQSVQPWLEERL >NZ_CP018680|2064667:2082377|2075938_2076982_-|WP_074050819.1|tRNA|DBSCAN-SWA MSDILSSLAYLTGKPVASAKIKAQPEHFQVREDLGFAFTGEGEHLMVRIRKTGENTSFVANELAKACGVKSKDVSWAGLKDRHAVTEQWLSVHLPKGETPDFSAFLAQYPSIEILATDRHNKKLRPGDLVGNEFAVTLSEVTDIADVEQRLEKVKQVGVPNYFGSQRFGNDGNNLEEARRWGRENVRTRNQNKRSMYLSAARSWIFNRIVSARLENGVFDKFIDGDVAQTASGTQLVDTSNIAELQAQFAQGEVAITAAMAGDNALPTQADALTLEQPFLDEEPDLMALIRGNRMRHDRRDIALKPQDLAWTVDGNNITMTFSLDAGSFATSIVRELVIEEKVEREY >NZ_CP018680|2064667:2082377|2067392_2067860_-|WP_009703750.1|DBSCAN-SWA MYQKRQAPALSSKEAAIQLLSRRDHGQYELYQKLALKGYEEADIEVAINFCLDHNYLDDLRYAKSQVRQHVYKGHGERRIRQELNQKRVAESIIDMAMAEEPQDWFELARMAAEKKFKGIKAKDQKEYAKQVRFLQYRGYSFDQISYALSFEDED >NZ_CP018680|2064667:2082377|2078230_2078512_-|WP_005452106.1|DBSCAN-SWA MRIFVIALTLLFGWLQYTLWFGKNGVSDYYTVEDEIEVQQQVNSKLQARNNEMFAEIDDLRQGLDAIEERARHELGLVKDGETFYRIVDEEEH >NZ_CP018680|2064667:2082377|2074536_2075163_-|WP_005452113.1|DBSCAN-SWA MSNPHADRLIAFLISSGISDQRVLDAIYRLPRESFVSQAMMHQAYDNNALPIGQGQTISQPYIVAKMTELLDLKSDSNVLEIGTGSGYQTAVLAQIVDHVYSVERIKSLQWDAKRRLKQLDIYNVSTKHGDGWLGWEAKGPFDAIIVTAAAESVPQALLSQLKEGGKMIIPVGEDEQQLLKIERQGEQYLSTVVEMVRFVPLVAGDLA |
15 | uncultured_Mediterranean_phage(18.18%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
3303841 : 3310908
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP018680|3303841:3310908|DBSCAN-SWA AATGAAACTGCCGATTTATCTTGATTATTCCGCTACATGTCCAGTAGATCCTCGTGTTGCTGAAAAAATGGTTCAGTACATGACAATGGATGGCACATTTGGCAACCCAGCTTCGCGTTCGCACCGTTACGGCTGGCAGGCAGAAGAAGCTGTAGATACTGCTCGTGAGCAAATTGCTGACCTTCTAAATGCAGATCCTCGCGAAATCGTATTCACATCTGGTGCGACAGAGTCTGACAACCTTGCTATCAAAGGCGCAGCACATTTTTACTCTAAGAAGGGTAAACACGTAATCACTTGTAAAACAGAGCATAAAGCGGTTCTTGACCCATGTCGTCAACTAGAACGCGAAGGCTTTGAAGTGACTTACCTAGAGCCAGAATCAAACGGTCTGATCGATCTAAACAAACTTCAAGCAGCGATGCGAGAAGACACTGTTTTGGTATCAATCATGCACGTTAACAACGAGATTGGTGTAATCCAAGACATCGCTGCAATCGGCGAACTATGTCGTGAACGTAAGATTGTGTTCCACGTGGATGCGGCACAGTCTGCGGGCAAACTGCCAATCGATGTTCAAGAATTGAAAGTTGACCTAATTTCACTGTCTGCGCACAAAGTATACGGTCCTAAAGGCATTGGCGCACTTTACGTACGTCGTAAGCCTCGTATCCGTCTGGAAGCGCAAATGCACGGTGGCGGTCATGAGCGTGGTTTCCGTTCGGGTACACTTCCAACTCACCAAATCGTGGGTATGGGTGAAGCGTTCCGTGTAGCGAAGGAAGACATGCAAAAAGATTACGATCACGCGCTAGCACTTCGTAACCGTCTTCTAGATGGCGTTAAAGATCTAGAAGCAGTAACGGTGAACGGTGATCTTGAACAGCGTGTTCCACACAACCTAAACGTGAGTTTTGCATTCGTAGAAGGTGAATCTCTACTGATGTCACTGAAAGACCTAGCAGTATCTTCTGGTAGTGCTTGTACATCAGCGAGCCTAGAGCCATCTTACGTTCTTCGTGCACTTGGTCTAGACGACGAATTGGCGCACAGCTCAGTACGTTTCTCATTTGGTCGTTTCACGACTGAAGCAGAAATTGATTACGCAATTGAACAAATCCGCGTTGCAGTGACTAAGCTACGCGACATGTCTCCTCTATGGGATATGTACAAGGAAGGGGTTGATCTGAGCACTGTTGAATGGGCTCACCATTAATCTCACGGACTGAATAGAGGATTCGAGGTAATTTATCATGGCATACAGCGAAAAAGTAATTGATCACTACGAGAACCCACGTAACGTTGGTTCGTTCGATAAAGAAGACCCATCAGTAGGCAGCGGCATGGTTGGCGCACCAGCTTGTGGTGACGTAATGAAGCTTCAAATCAAAGTGACGCCAGAAGGCATCATTGAAGACGCGAAATTCAAAACATACGGTTGTGGTAGTGCAATCGCTTCAAGCTCACTAGTAACTGAGTGGGTAAAAGGCAAGTCTATCGACGAAGCGGCATCTATCAAAAACTCTGAGATTGCAGAAGAGCTAGAGCTTCCACCAGTGAAAGTTCACTGCTCAATCCTTGCGGAAGACGCAATCAAAGCAGCGGTTGCTGACTACAAGAAAAAACACCAGGAATAATTATTCCGTATATAATGGGAGCCCTTGAAATGCTCCCATTCCAAAACGTAATTTAGACAGACTAAGGTGCAGTATGGCCATCACAATGACAGAATCAGCGGCAAGTCGCGTAAAAGCATTCCTAGATAACCGAGGTAAAGGTATCGGTTTGCGTCTTGGAGTGAAGACGACAGGCTGTTCCGGCATGGCTTACGTTCTTGAGTTTGTTGATGATCTAAACGAAGAAGATGAAGTTTTTGAACTGTCGGATGTAAAGATCATCATTGATAAGAAAAGCCTAGTATACCTAGACGGTACTGAGCTGGATTACGTCAAGGAAGGGTTGAATGAAGGTTTTGAGTTCAACAACCCTAACGCGAAAAGTGAATGTGGTTGTGGTGAAAGCTTCAACGTCTAATTGACTGTCCTTCTGGGCAAGCGAGTCCAGAAGGCTTTACTAGGACCCGATCTTAATGAATCACTTTGAATTATTTGGGCTACCAAGTCAGTTTCAGCTGGATGGTAGCCTTCTTTCTTCTCAGTTCCGAGAACTACAAAAACGCTTCCACCCAGACAACTTTGCGACTGCCTCAGAGCGTGATCGCTTGATGGCTGTTCAAAAAGCAGCGCAAATCAACGATGCGTATCAAATACTTAAGCATCCAATCTCTCGCGCTGAATACATATTGGCAGAAAACGGCACAGAAATTCGTGGTGAGCAACAAACCATGCAAGACCCGATGTTCCTAATGGAACAAATGGAACTGCGTGAAGAACTGGAAGACATTGCTGACAGCTCTGATCCTGAATCAGCGCTTTTTGACTTCGATTCTAAGGTCAGCAAAATGTACAAACAGCATTTGGCGAGTGTAGAACAAGAACTTGACCAAGGTCTTTGGGCGGAAGCTGCAGACCGAGTTCGTAAACTCAAATTCATTGCCAAGCTAAAGAACGAAATTGAACTGGTTGAAGACAAGCTCCTCGGCTAGTTAGCGCGACAAGGACCCATCATGGCATTACTTCAAATCGCAGAACCGGGCCAAAGCTCGGCACCTCATGAGCACAAGTTAGCAGCGGGTATCGACTTAGGTACTACCAACTCTTTGGTTGCGTCTGTTCGCAGCGGTGACGCAAACACACTGACAGACGATCAAGGTCGTAGCATTTTGCCTTCTGTGGTGAACTACGCACAGGACTCGGCTTTAGTTGGTTACGAGGCAAAAGCAAAAGCAGAGCAAGAACCAGAAAACACCATTATTTCGGTAAAACGTCTGATTGGTCGTTCACTAAAAGACATTCAAGCTCGCTACCCATCTTTGCCTTACCAATTTAAAGAAAGCGACAACGGTCTGCCTATTTTGCAAACTGCACAAGGTGACAAGAACCCGATTGAAGTGTCTGCCGATATTCTTAAAGCACTAGGAAAACGTGCAGAAGATACACTCGGCGGCGAGTTAGCTGGCGTGGTTATTACTGTTCCTGCTTACTTTGATGATGCTCAACGTGCTGGTACAAAAGATGCGGCGAAATTGGCGGGGCTTCATGTGCTACGTCTGCTTAACGAGCCGACAGCTGCAGCGATCGCTTACGGCCTAGACTCTGGTCAAGAGGGCGTGATTGCGGTTTACGATCTAGGTGGCGGTACGTTCGATATCTCCATCCTACGTTTGTCAAAAGGCGTATTTGAAGTATTAGCAACAGGTGGTGATTCTGCGCTAGGCGGTGATGATTTTGACCACCTATTAGCGGATTACCTAATGGAGCAAGCTGGTCTAGAAGCGCCGCTTTCTGCTGAGAAAAACCGTGCACTACTGAACATTGCAACAGCAACTAAGATTGCTTTCTCTGAGCAAGACAGTGTTGATGTTGATGTATTTGGTTGGAAAGGTTCTGTGACTCGCGAGCAGTTCGAAGATCTGATTCGTCCGCTAGTGAAGAAAACCCTGATGTCTTGCCGTCGTGCATTGAAAGATGCGGATGTGGATGCGGAAGACGTACTTGAAGCGGTTATGGTTGGCGGTTCAACTCGCACATTGCTTGTTCGTGAAATGGTAGGCGAATTCTTCGGTCGTACACCACTGACAAGCATTAACCCAGATGAAGTTGTTGCAATCGGTGCGGGTATTCAAGCAGATATCCTAGCGGGTAACAAGCCTGACTCTGAGATGCTGCTTCTGGACGTTATTCCTTTATCTCTTGGTATTGAAACCATGGGCGGCTTGGTTGAGAAAATCATTCCACGTAACACCACTATTCCTGTGGCTCGTGCGCAAGAGTTTACGACGTTCAAAGATGGTCAAACGGCAATGAGCGTGCACACCGTACAAGGTGAACGAGAAATGGTGGATGACTGTCGTTCACTGGCTCGTTTCTCATTAAAAGGCATTCCACCGATGGCGGCTGGTGCTGCACACATTCGCGTGACGTACCAAGTGGACGCTGATGGTCTGTTGTCTGTCACTGCGATGGAAAAGAGCACGGGTGTTCAGGCTGAAATCCAAGTGAAACCTTCTTACGGTTTAAGCGATGACGAAGTCGCAAACATGCTGCGTGACTCGATGACTTACGCGAAAGAAGACATGCAAGCTCGTGCACTGGCTGAGCAGCGCGTAGAAGCGGATCGTGTGATTGAAGGTTTGATTGCTGCGATGCAAGCTGATGGTGATGAACTACTATCAGACCAAGAGAAACAAGACCTATTAAAAGTGATTGAAGCGCTGATTGAACTGCGCAACGGCGAAGATGCGGACGCTATCGAGCAAGGCATCAAAGACACCGACAAAGCAAGCCAAGATTTTGCGTCGCGCCGTATGGATAAATCGATTCGAGCTGCACTGTCAGGTCAGTCAGTTAATGATATATAAGAGATAAGTAGTTATGCCTAAGATTATTGTATTACCACACGAAGATTTGTGCCCAGAGGGTGCAGTGTTGGAAGCAAACACTGGTGACACCGTTCTAGACGTAGCGCTAAAAAATGGCATTGGTATCGAACACGCATGTGAAAAGTCATGTGCATGTACTACCTGCCACGTGATCATTCGTGAAGGTTTTGATTCGCTAGAAGAGAGTGAAGAGCTAGAAGATGACATGCTAGATAAAGCATGGGGCCTTGAGCCTGAATCTCGCCTTGGTTGCCAAGCAAAAGTAGCGGATGAAGATTTGGTTGTTGAAATTCCAAAATACACGCTAAACCACGCGTCGGAAGATCACTAATACTATCTCCCCAACATAGGGGCGAAGCAAAAGGAAGATGAGCAATGAAATGGACAGATTCTCGCGATATCGCGATTGAGCTTTGTGACAAGTTTCCCGACTTAGATCCTAAAACCGTACGTTTTACCGATCTTCACCAATGGATCATGGAGTTGGATGAGTTTGACGATGAGCCAAACCATTGTAATGAAAAGATTCTAGAAGCGGTTATCTTGTGCTGGATGGACGAAGCAGATTAATGCTCAATTCGGCGGTAAAACGCAGCAGAGTTAACAATTCTCACTAAAAAAAGCGGACCTTTAGGTCCGTTTTTTTATCAAGTTGACATTTTTACCCTACATAATGCTAACATCCGTTAGTTATTAAAAAAGAAGAGGCGTAGGCAACGCCGTTATAAGACAAGGAGAAACCATGTCTACACAGATGTCTGTATTTCTTAGCCAAGAACCAGCGGCACCACATTGGGGCGACAAAGCACTGCTTTCTTTCGCTGAGAATGGCGCAACCATTCACCTAGGTGAAGGTCACGATCTTGGTGCGATTCAACGTGCAGCGCGTCAACTCGATGGCCAAGGAATTCGTTCAGTTTTACTTGCTGGTGATAACTGGGATTTGGAGAGCGTTTGGGCATTCCACCAAGGTTACCGCAACCCGAAAAAACACGGCACACTAGAATGGGGTGAGCTTTCTGAGCAAGATCAAGCAGAGCTTCACGCTCGCATCACTTCTACGGATTTTACTCGCGATATCATCAACAAAACTGCAGAAGAAGTGGCGCCACGTCAATTGGCAACCATGGCAGCTGAATTCATCAAATCTATCGCGCCACAAGGTACAGTAACGGCTCGTATCGTTAAAGATAAAGATCTGCTTTCAGAAGGTTGGGAAGGTATTTATGCCGTAGGTCGTGGCTCTGAGCGCACATCGGCAATGCTGCAACTTGATTACAACCCAACAGGCGATGAAAACGCACCAGTCTTTGCATGTCTAGTAGGTAAAGGCATCACCTTTGATTCAGGTGGTTACAGCTTGAAACCATCAAACTTCATGACAGCTATGAAAGCGGATATGGGCGGTTCAGGTACGATAACTGGCGGTCTTGGTCTTGCTATTCTTCGTGGTCTTAATAAGCGTGTAAAGCTTATCCTTTGTTGTGCGGAGAACATGGTCTCTGGTCGCGCACTTAAACTTGGCGACATCATCACTTACAAAAATGGTAAGACCGTTGAGATCATGAATACCGACGCAGAAGGTCGTTTGGTTCTTGCTGATGGTCTTATTTACGCAAGTGAGCAAAACCCTGAACTGATCATTGACTGTGCGACCCTAACGGGCGCAGCGAAGAATGCACTAGGGAACGATTACCATGCATTAATGAGCTTTGATGATGAGCTTTCTCATCAAGCGCTAACGGCGGCAAACCAAGAGAAAGAAGGTCTGTGGCCACTGCCTCTTGCTGATTTTCACCGTGGCATGCTGCCGTCAAACTTTGCTGACCTATCTAACATCAGTTCTGGCGATTACTCACCTGGCGCAAGTACAGCTGCAGCATTCTTGTCTTACTTTGTCGAGGATTACAAGAAAGGTTGGTTGCACTTTGACTGTGCAGGCACGTACCGTAAGTCAGCGTCTGATAAGTGGGCTGCAGGCGCAACAGGCATGGGTGTTCGCACACTTGCTCGTTTCTTGAACGAACAAGCGGCAAAATAAGACTTTTACCAGTGCAGACACCCCGTCTGTACTGGTATAACTAAGAGAGACGCAAGTCACTCATATAAAACTATTATTACAAGTACAAAAAGGAAACTTTATGGCTCTAGAAAGAACGTTTTCAATTGTTAAGCCCGACGCTGTAGAACGCAACCTGATTGGTGAAATCTACCACCGTATCGAAAAGGCGGGTCTACGTATCGTTGCTGCAAAAATGGTTCATCTGACAGAAGAACAGGCGAGCGGTTTTTACGCAGAACATGAAGGCAAACCTTTTTTCCCTGCACTGAAAGAGTTCATGACATCTGGTCCTATCATGGTTCAGGTTCTTGAAGGTGAAGATGCAATCGCACGCTACCGTGAGCTAATGGGCAAAACAAACCCAGAAGAAGCAGCATGTGGCACTATCCGTGCCGATTACGCACTAAGCATGCGCCATAACTCTGTTCATGGTTCGGATAGCCCTGAGTCAGCAGCTCGTGAAATCGAGTACTTCTTCCCTGAGTCAGAGATTTGCCCTCGCTAA
Protein sequences of DBSCAN-SWA_4 >NZ_CP018680|3303841:3310908|3310482_3310908_+|WP_005440311.1|DBSCAN-SWA MALERTFSIVKPDAVERNLIGEIYHRIEKAGLRIVAAKMVHLTEEQASGFYAEHEGKPFFPALKEFMTSGPIMVQVLEGEDAIARYRELMGKTNPEEAACGTIRADYALSMRHNSVHGSDSPESAAREIEYFFPESEICPR >NZ_CP018680|3303841:3310908|3305929_3306445_+|WP_005447715.1|DBSCAN-SWA MNHFELFGLPSQFQLDGSLLSSQFRELQKRFHPDNFATASERDRLMAVQKAAQINDAYQILKHPISRAEYILAENGTEIRGEQQTMQDPMFLMEQMELREELEDIADSSDPESALFDFDSKVSKMYKQHLASVEQELDQGLWAEAADRVRKLKFIAKLKNEIELVEDKLLG >NZ_CP018680|3303841:3310908|3306466_3308320_+|WP_009702953.1|DBSCAN-SWA MALLQIAEPGQSSAPHEHKLAAGIDLGTTNSLVASVRSGDANTLTDDQGRSILPSVVNYAQDSALVGYEAKAKAEQEPENTIISVKRLIGRSLKDIQARYPSLPYQFKESDNGLPILQTAQGDKNPIEVSADILKALGKRAEDTLGGELAGVVITVPAYFDDAQRAGTKDAAKLAGLHVLRLLNEPTAAAIAYGLDSGQEGVIAVYDLGGGTFDISILRLSKGVFEVLATGGDSALGGDDFDHLLADYLMEQAGLEAPLSAEKNRALLNIATATKIAFSEQDSVDVDVFGWKGSVTREQFEDLIRPLVKKTLMSCRRALKDADVDAEDVLEAVMVGGSTRTLLVREMVGEFFGRTPLTSINPDEVVAIGAGIQADILAGNKPDSEMLLLDVIPLSLGIETMGGLVEKIIPRNTTIPVARAQEFTTFKDGQTAMSVHTVQGEREMVDDCRSLARFSLKGIPPMAAGAAHIRVTYQVDADGLLSVTAMEKSTGVQAEIQVKPSYGLSDDEVANMLRDSMTYAKEDMQARALAEQRVEADRVIEGLIAAMQADGDELLSDQEKQDLLKVIEALIELRNGEDADAIEQGIKDTDKASQDFASRRMDKSIRAALSGQSVNDI >NZ_CP018680|3303841:3310908|3305093_3305477_+|WP_005424883.1|DBSCAN-SWA MAYSEKVIDHYENPRNVGSFDKEDPSVGSGMVGAPACGDVMKLQIKVTPEGIIEDAKFKTYGCGSAIASSSLVTEWVKGKSIDEAASIKNSEIAEELELPPVKVHCSILAEDAIKAAVADYKKKHQE >NZ_CP018680|3303841:3310908|3305550_3305874_+|WP_005424893.1|DBSCAN-SWA MAITMTESAASRVKAFLDNRGKGIGLRLGVKTTGCSGMAYVLEFVDDLNEEDEVFELSDVKIIIDKKSLVYLDGTELDYVKEGLNEGFEFNNPNAKSECGCGESFNV >NZ_CP018680|3303841:3310908|3303841_3305056_+|WP_005447719.1|DBSCAN-SWA MKLPIYLDYSATCPVDPRVAEKMVQYMTMDGTFGNPASRSHRYGWQAEEAVDTAREQIADLLNADPREIVFTSGATESDNLAIKGAAHFYSKKGKHVITCKTEHKAVLDPCRQLEREGFEVTYLEPESNGLIDLNKLQAAMREDTVLVSIMHVNNEIGVIQDIAAIGELCRERKIVFHVDAAQSAGKLPIDVQELKVDLISLSAHKVYGPKGIGALYVRRKPRIRLEAQMHGGGHERGFRSGTLPTHQIVGMGEAFRVAKEDMQKDYDHALALRNRLLDGVKDLEAVTVNGDLEQRVPHNLNVSFAFVEGESLLMSLKDLAVSSGSACTSASLEPSYVLRALGLDDELAHSSVRFSFGRFTTEAEIDYAIEQIRVAVTKLRDMSPLWDMYKEGVDLSTVEWAHH >NZ_CP018680|3303841:3310908|3308716_3308911_+|WP_005424887.1|DBSCAN-SWA MKWTDSRDIAIELCDKFPDLDPKTVRFTDLHQWIMELDEFDDEPNHCNEKILEAVILCWMDEAD >NZ_CP018680|3303841:3310908|3308333_3308672_+|WP_005424889.1|DBSCAN-SWA MPKIIVLPHEDLCPEGAVLEANTGDTVLDVALKNGIGIEHACEKSCACTTCHVIIREGFDSLEESEELEDDMLDKAWGLEPESRLGCQAKVADEDLVVEIPKYTLNHASEDH >NZ_CP018680|3303841:3310908|3309083_3310382_+|WP_050940961.1|DBSCAN-SWA MSTQMSVFLSQEPAAPHWGDKALLSFAENGATIHLGEGHDLGAIQRAARQLDGQGIRSVLLAGDNWDLESVWAFHQGYRNPKKHGTLEWGELSEQDQAELHARITSTDFTRDIINKTAEEVAPRQLATMAAEFIKSIAPQGTVTARIVKDKDLLSEGWEGIYAVGRGSERTSAMLQLDYNPTGDENAPVFACLVGKGITFDSGGYSLKPSNFMTAMKADMGGSGTITGGLGLAILRGLNKRVKLILCCAENMVSGRALKLGDIITYKNGKTVEIMNTDAEGRLVLADGLIYASEQNPELIIDCATLTGAAKNALGNDYHALMSFDDELSHQALTAANQEKEGLWPLPLADFHRGMLPSNFADLSNISSGDYSPGASTAAAFLSYFVEDYKKGWLHFDCAGTYRKSASDKWAAGATGMGVRTLARFLNEQAAK |
9 | Faustovirus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
3400137 : 3406821
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP018680|3400137:3406821|DBSCAN-SWA CATGACAACGAATCAACAGAATGCAGTTGTTTCACAACCACAGACCGTCGTTGTGAAACTGGGTACAAGTGTGCTGACAGGCGGCACTTTGGCGTTAGACCGTGCTCATATGGTAGAGCTGGCTCGTCAATGTGCTGAGCTTAAAAAACAAGGCCATTCAGTAGTAATGGTTTCGTCTGGTGCGATTGCAGCAGGCCGTGAGCATCTTGGATACCCCGCACTGCCCAACGCGATGGCGAGCAAACAGTTGCTTGCAGCCGTGGGTCAGAGCCGTCTGATTCAAACATGGGAGTCTTTGTTTGGTATTTACGGTATCAAAATTGGTCAGATGCTACTGACTCGCGCTGATTTGGACGATCGTGAACGTTTCCTTAACGCACGTGACACCATTAATGCGTTGGTTGCGAATGACATTATCCCTATCGTTAATGAAAACGATGCTGTCGCAACAAATGAAATCAAAGTAGGTGATAACGATAACTTGTCTGCTTTGGTGGGGATTTTGTGTGGTGCAGACAAATTGCTGCTCCTGACTGATCAAAAAGGTTTGTTTACCGCAGACCCACGTAAAGATCCGAATGCTGAGCTGATCAAAGAAGTGAAAACTATCGATGATACTTTGCGTAAAATTGCAGGTGGTAGTGGCACAACGCTAGGTACGGGCGGCATGGCAACCAAACTTCAAGCGGCAGACATAGCTCGCCGTGCAGGTATTGAAGTTATTATCGCCGCTGGTAGCGCACCAAATGTTATCTTTGACTCATTGAGTTCAGAGCCACAAGGCACACGTTTCCTACCGTGTTCTGAAGCACTAGAAAATCGTAAGCGTTGGATCTTAGCCGGTCCTGCGGCGTCTGGTGACATCATTATTGATGACGGTGCCGTTAACGCGGTTGTAGGTAAGGGCAGTAGTTTGCTTGCGAAAGGCGTTATTAAAGTAAGTGGTGATTTTGCTCGTGGCGAAGTCGCCCGCGTGACAAACTCCCACGGTAAACTGGTTGCTCGTGGTATCAGTGCCTACTCAAGTGAAGACCTAGCGAAAATTGCAGGTAAGCACAGTAAAGATATCATCTCTATTTTGGGTCATGACTACGGCTCAGAAGTGATTCATCGCGATGATCTTGTCGTCATTCAAGAATAGAAATTGAGGGAAGCAAAGTGGATTTAACGAATATGGGTAAAGCTGCCAAAGATGCAGCTTTCGAATTAGCGACAGCGTCAACCGCACAAAAGAACCAAGCACTAGCCATTATCGCCGATGAGCTAGAAGCAAATGCAGCAACCATTTTGGCTGCAAACGCAAAAGATATTGAGTTAGGTCGCGAAGCGGGTCTAACGGATGCATTACTTGACCGCCTTTTACTGAATGAAGAGCGCTTAACCGGTATTGCTAACGATGTTCGTAATGTGATCAGCCTTAACGATCCTGTGGGCAGTGAAATTGACAGCAAAGTGCTAGAAAACGGCATGTCTCTGTCTCGTCGTCGCGTGCCACTTGGTGTAGTTGGTGTGATTTACGAAGCGCGTCCGAATGTAACTATCGATATTGCTGCTTTGTGTCTAAAAACAGGTAATGCAAGCATCCTACGTGGTGGTAAAGAGACATTTTTCTCTAATATGGAGCTGGTGAAGGTTATCCAATCGGCGCTTGCAAAAGCAAACTTGCCAGCGGCTTCTGTTCAGTACATCGAAAAACCGGATCGTGAGTTGGTTTCTCAATTACTTAAGCTAGACGACTACGTCGATATGATCATTCCACGTGGTGGCGCTGGTCTACACAAAATGTGTAAAGAGAACAGCACCATTCCAGTGATCATTGGTGGTTTTGGTATTAGCCACATTTTTGTTGATGAATCAGCAGAGCTAAAGAAATCACTCAATGTTGTTGAAAACTCAAAAGTTCAACGCCCGTCCGCATGTAACTCGCTAGATACTTTGCTGGTTCACGAGAATATCGCTGCGCAGTTCCTGCCAATGATCGTAGAGCGCATGAACGAAAACGTAACTTTTGTCGCAGAGCCAAAAGCAAAAGCACTGATGGCACAAGCGAAACAAATTCGCGATGCGGGTGAAGGTGATTTTGATACGGAATGGCTAAGTTATACGCTAGGTGTAAAAGTGGTTGCTGATGTGAAAGAAGCGATTGATCACATGCGCGTTCACAACGCGAGCCACTCAGATGCGATCATGACAAACAGCCTACAAAACTCAGAATTGTTCATTAACTCTGTCGGTTCTGCTGCGGTTTACGTAAATGCCGCAACACGTTTTACTGACGGTGCTCAGTTTGGTTTAGGTGCTGAAGTTGCGGTATCAACTCAGAAACTGCATGCTCGTGGTCCTATGGGTTTGGAAGAACTGACCAGCTACAAATGGGTTGGTAAAGCGAACTACTTAGCACGCAGTTAACAGAAACCGTTTGATATTGATTGAAAAAGGGGCAGCGATGCCCCTTTTTGTTTTCTCTAACATAGCAATTCCGCTACACTAGTCTTGCTTTTGGGACGAACCCAATTGCAAGCTATTTGGAGGTGATATGCATTGTCCTTTTTGTTCTGAGAACGATACAAAAGTAATTGATTCACGCTTGGTTGCTGATGGGCATCAAGTGCGTCGTCGTCGTCAATGCCTTGCGTGCAGTGAACGTTTTACAACGTTTGAGACCGCAGAGTTGGTGATGCCGAAAGTGATTAAATCTAACGGTAACCGTGAACCATTTGATGAAGATAAAATGGTTGGTGGTATTCAACGTGCCTTAGAAAAACGTCCAGTCAGTGCTGACTCTATTGAGCTAGCCATCAGCATGATTAAGTCCCAACTGCGTGCAACTGGTGAACGTGAAGTACCAAGTGAAATGATTGGTAATTTGGTTATGGATCAATTGAAAGAGCTGGATAAAGTCGCTTATATCCGCTTTGCATCGGTTTATCGTAGCTTTGAAGACATCCGTGAATTCGGTGAAGAAATTGCTCGCCTAGAAGACTGATTGAATCATGGCTCAACAATCTCTCTCTTCAGAATTTACCTCTCAAGATTTCGAAATGATGTCGCGTGCGCTCAAGTTAGCGAAGCGCGGCATTTATACAACGGCACCTAACCCGAATGTAGGCTGCGTTATTGTCCGTGACGGTGAGATTGTTGGCGAAGGTCATCACCATCGTGCGGGGGAACCTCATGCTGAAGTTCACGCGATGCGTATGGCTGGCAATAAAGCTGAAGGCGCAACGGCATATGTGACACTGGAACCTTGCTCTCATTATGGTCGTACACCACCTTGTGCAGAGGGGCTAATCAAAGCCAAAGTCGCACGTGTTGTTTGTGCAATGGAAGATCCAAACCCGAAAGTTGCAGGACGCGGTTTTCAAATGCTGCGCGATGCGGGGGTAGACGTTCAAGTTGGCTTATTAGAAAGCGATGCGATCGAGCTGAACAAAGGGTTTATCAAGTTCATGCAAACTGGCATGCCTTATGTTCAGTTGAAAATGGCAGCCAGCCTCGATGGGCAAAGTGCACTCAATAATGGTCAGAGTCAATGGATCACCTCCCCAAAGGCTCGCCAAGATGTACAGCGTTATCGAGCGTTGAGTGGCGCTATTCTCTCGACCAGTAAGACAGTCATTGATGACAATGCATCGCTGAATGTGCGTTGGGATGATCTGCCATGCAGTGTGCAAGCTCAATATCCTCAAGATGAAGTTCGTCAGCCTCCTCGTGTGATTTTCGATCGCCAGTCTCAACTTAGCGATGATCTAAAACTATTCAATACTGACGGTGAGCGTATTATCGTTAGTCACGATGGCGATATTGCGCCTGAGCTGACGGAAAATGGTCAAATTGACTTAACGGCGACATTAAAAGCTGTGGCGAGCGAATATCATATTAATCACTTGTGGGTTGAAGCTGGCGCTACACTCGCGAGTTCATTAATCAAAGCTAACCTTGTAGATGAACTGATTGTTTATCTTGCTCCTAAATTAATGGGCAGCGATGGGCGAGGTTTGATTGGTGCTCTTGGGCTAACCGATATGGCACAAGTTATCGATTTAACTATTACCGACGTTCGAATGGTAGGGGTGGATATTCGCATTACCGCAACCGTCAAACGCAACCAAAGCTAGAAAGAAGCATTATGTTTACAGGAATTGTAGAAGCAGTTGGCAAACTAACTGCAATCACGCCAAAAGGCGAAGACATTACCGTTACGGTCGAAGTTGGCAAGTTGGATATGTCTGACGTTAAGTTGGGGGACAGTATCGCAACGAATGGTGTGTGCTTGACGGTCGTCGACTTCGGTAGCAACTATTACAGTGCTGACTTATCACTGGAAACGCTGAATAAAACAGGTTTTGCCGCTTACCAAGTTGGCGATAAAGTGAACCTGGAAAAGGCCATGTTACCAACGACACGTTTTGGTGGGCACATTGTGTCTGGTCACGTCGATGGCGTGGGCGTGATTGTTGAGCGCAACCAAGTTGGTCGAGCGATTGAATTCTGGGTAGAAATGCCTACTGAAATCAGTAAGTACGTCGCAGAAAAAGGCTCGATCACAGTGGATGGCATTAGTCTGACCGTCAATGACCTACGTAAAAATGGTTTTAAATTGACGATTGTTCCTCACACCAGTGAAGAGACCACGATTGATCAGTTCCAGGTGGGTCGCAAAGTGAACCTAGAAGTGGATGTACTGGCTCGCTACATGGAACGCTTGCTACAAGGTCAAAAAGAAGAATCGCAAGAGTCACGTATTACCATGGACTTCTTGCAGCAAAATGGTTTTGCGTAGGCGAAACCGACAATAATCACAGAACGTTATCAGGTTAATTGACTTAATAAGGGTTGGCTGCAAACTGCGCTAAAGCCAGTAGACATGGCTAACCGAAACGTTAAACGTATTGCTGCTGCAATAATAAATTTATGAATAACGGGAAACAGAATATGCCTATCAGCACGCCTCAAGAAATTATTGATGACATTCGCGCGGGCAAAATGGTCATCCTAATGGATGATGAAGACCGCGAGAATGAAGGTGATTTGATCATGGCTGCCGAGCACATTACGCCGGAAGCAATCAACTTTATGGCTACGCATGGTCGTGGCTTGATCTGTCTAACCATGACAAAAGCACGTTGTGAAAGCCTTGGTTTACCGCCAATGGTGCAAGACAACAATGCGCAATACACCACGAACTTTACCGTTTCTATTGAAGCGGCTGAAGGTGTTACGACAGGTATCTCTGCCGCAGACCGTGCTCGCACTGTACAAGCAGCCGTTGCGCCAGATGCGAAAGCAGCTGATTTAGTTCAACCTGGTCATATCTTCCCTCTAGCAGCACAAGATGGTGGCGTTCTAACTCGCGCTGGTCATACAGAAGCAGGCTGCGATTTGGCTCGCCTTGCAGGTCTAGAGCCAGCATCGGTTATCGTTGAGATCCTAAACGACGACGGCACGATGGCACGTCGTCCTGATCTTGAAGTATTTGCTGAAAAGCACGGCTTAAAGCTAGGTACTATTGCGGACTTGATTGAATACCGCAATAACACCGAAACAACGATTGAACGTGTTGCTGAGTGTGCATTGCCAACAGAGTTCGGTGAGTTCACGCTTGTGACTTACAAAGACACGATTGATAACCAAGTGCACTACGCAATGTGTAAGGGCGACCTTGCTTCTGAAGCGCCATTGGTCCGCGTTCATTTACAAGATGTATTTACCGATGTCCTTCGAAGTGACCGCAACGCAGAGCGTAGCTGGACACTTGAGAAAGCAATGAAGCGCATTGGCGAAGAAGGTGGTGTTCTGGTTGTACTTGGCAACGAAGAGTCAACCGATCTGCTTATTCACCGAGTGAAAATGTTTGAAGCACAAGACAAAGGCGAAGCACCAACGCTTGCTAAGAAGCAAGGTACGTCACGCCGAGTGGGTGTCGGCTCTCAAATCCTTGCTGATCTGGGCATTCACGATATGCGCCTGCTTTCTTCAACCAACAAGAAGTACCACGCACTCGGCGGTTTTGGTCTGAATGTGGTTGAGTACGTCTGCGAATAATAACTGACCAGCGAGTTCCTAGTCTTACACGCACTTATCGTCGCTAAGGCGAGGAACTCGTGATTCGTCCCCAATTTCCATTAGATATTGCTCACAAAATTGTGATAGAATCCGGCGATTCTCACTTGATGAACAATTTTCACTTATAAATAGAGTTAAAGGAAAGCTTATGAAAGTGATCGAGGGTGGCTTCCCAGCGCCAAATGCAAAAATTGCTATCGTTATTTCTCGTTTCAACAGTTTTATTAACGAAAGCCTACTATCTGGTGCGATCGATACACTAAAGCGTCACGGACAAGTAAGCGAAGACAACATCACTGTTGTTCGTTGCCCAGGCGCCGTAGAACTACCTCTAGTAGCTCAACGTGTTGCTAAAACTGGTAAGTACGATGCAATCGTATCTCTAGGTACAGTAATTCGTGGCGGTACACCGCACTTTGACTATGTTTGTAGTGAATGTAACAAGGGTCTTGCACAAGTTTCTCTGGAGTACAGTCTTCCAGTAGCATTTGGTGTATTGACTGTTGATACTATCGATCAAGCGATTGAACGCGCAGGAACCAAGGCTGGTAATAAAGGTGCAGAGGCTGCACTAAGCGCACTTGAAATGATTAATGTACTATCTGAAATTGATTCCTAA
Protein sequences of DBSCAN-SWA_5 >NZ_CP018680|3400137:3406821|3402672_3403122_+|WP_005424846.1|DBSCAN-SWA MHCPFCSENDTKVIDSRLVADGHQVRRRRQCLACSERFTTFETAELVMPKVIKSNGNREPFDEDKMVGGIQRALEKRPVSADSIELAISMIKSQLRATGEREVPSEMIGNLVMDQLKELDKVAYIRFASVYRSFEDIREFGEEIARLED >NZ_CP018680|3400137:3406821|3405071_3406181_+|WP_074051116.1|DBSCAN-SWA MPISTPQEIIDDIRAGKMVILMDDEDRENEGDLIMAAEHITPEAINFMATHGRGLICLTMTKARCESLGLPPMVQDNNAQYTTNFTVSIEAAEGVTTGISAADRARTVQAAVAPDAKAADLVQPGHIFPLAAQDGGVLTRAGHTEAGCDLARLAGLEPASVIVEILNDDGTMARRPDLEVFAEKHGLKLGTIADLIEYRNNTETTIERVAECALPTEFGEFTLVTYKDTIDNQVHYAMCKGDLASEAPLVRVHLQDVFTDVLRSDRNAERSWTLEKAMKRIGEEGGVLVVLGNEESTDLLIHRVKMFEAQDKGEAPTLAKKQGTSRRVGVGSQILADLGIHDMRLLSSTNKKYHALGGFGLNVVEYVCE >NZ_CP018680|3400137:3406821|3404265_3404919_+|WP_009697720.1|DBSCAN-SWA MFTGIVEAVGKLTAITPKGEDITVTVEVGKLDMSDVKLGDSIATNGVCLTVVDFGSNYYSADLSLETLNKTGFAAYQVGDKVNLEKAMLPTTRFGGHIVSGHVDGVGVIVERNQVGRAIEFWVEMPTEISKYVAEKGSITVDGISLTVNDLRKNGFKLTIVPHTSEETTIDQFQVGRKVNLEVDVLARYMERLLQGQKEESQESRITMDFLQQNGFA >NZ_CP018680|3400137:3406821|3400137_3401277_+|WP_005447599.1|DBSCAN-SWA MTTNQQNAVVSQPQTVVVKLGTSVLTGGTLALDRAHMVELARQCAELKKQGHSVVMVSSGAIAAGREHLGYPALPNAMASKQLLAAVGQSRLIQTWESLFGIYGIKIGQMLLTRADLDDRERFLNARDTINALVANDIIPIVNENDAVATNEIKVGDNDNLSALVGILCGADKLLLLTDQKGLFTADPRKDPNAELIKEVKTIDDTLRKIAGGSGTTLGTGGMATKLQAADIARRAGIEVIIAAGSAPNVIFDSLSSEPQGTRFLPCSEALENRKRWILAGPAASGDIIIDDGAVNAVVGKGSSLLAKGVIKVSGDFARGEVARVTNSHGKLVARGISAYSSEDLAKIAGKHSKDIISILGHDYGSEVIHRDDLVVIQE >NZ_CP018680|3400137:3406821|3401294_3402545_+|WP_074051115.1|DBSCAN-SWA MDLTNMGKAAKDAAFELATASTAQKNQALAIIADELEANAATILAANAKDIELGREAGLTDALLDRLLLNEERLTGIANDVRNVISLNDPVGSEIDSKVLENGMSLSRRRVPLGVVGVIYEARPNVTIDIAALCLKTGNASILRGGKETFFSNMELVKVIQSALAKANLPAASVQYIEKPDRELVSQLLKLDDYVDMIIPRGGAGLHKMCKENSTIPVIIGGFGISHIFVDESAELKKSLNVVENSKVQRPSACNSLDTLLVHENIAAQFLPMIVERMNENVTFVAEPKAKALMAQAKQIRDAGEGDFDTEWLSYTLGVKVVADVKEAIDHMRVHNASHSDAIMTNSLQNSELFINSVGSAAVYVNAATRFTDGAQFGLGAEVAVSTQKLHARGPMGLEELTSYKWVGKANYLARS >NZ_CP018680|3400137:3406821|3406350_3406821_+|WP_005440184.1|DBSCAN-SWA MKVIEGGFPAPNAKIAIVISRFNSFINESLLSGAIDTLKRHGQVSEDNITVVRCPGAVELPLVAQRVAKTGKYDAIVSLGTVIRGGTPHFDYVCSECNKGLAQVSLEYSLPVAFGVLTVDTIDQAIERAGTKAGNKGAEAALSALEMINVLSEIDS >NZ_CP018680|3400137:3406821|3403129_3404254_+|WP_009697719.1|DBSCAN-SWA MAQQSLSSEFTSQDFEMMSRALKLAKRGIYTTAPNPNVGCVIVRDGEIVGEGHHHRAGEPHAEVHAMRMAGNKAEGATAYVTLEPCSHYGRTPPCAEGLIKAKVARVVCAMEDPNPKVAGRGFQMLRDAGVDVQVGLLESDAIELNKGFIKFMQTGMPYVQLKMAASLDGQSALNNGQSQWITSPKARQDVQRYRALSGAILSTSKTVIDDNASLNVRWDDLPCSVQAQYPQDEVRQPPRVIFDRQSQLSDDLKLFNTDGERIIVSHDGDIAPELTENGQIDLTATLKAVASEYHINHLWVEAGATLASSLIKANLVDELIVYLAPKLMGSDGRGLIGALGLTDMAQVIDLTITDVRMVGVDIRITATVKRNQS |
7 | Staphylococcus_phage(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP018681_1 | 1012427-1012500 | Orphan |
NA
Consensus repeat of NZ_CP018681_1
|
1 spacers
spacers of NZ_CP018681_1
>1.1|1012452|24|NZ_CP018681|CRISPRCasFinder TTGGATATTGAGTTACTTGTTTAC |
CRISPR arrays and Neighbor proteins around NZ_CP018681_1
The CRISPR arrays of NZ_CP018681_1 >merge|NZ_CP018681|1|1012427-1012500|CRISPRCasFinder TTAGTTACTTAGCTACTTAACCTACTTGGATATTGAGTTACTTGTTTACTTAGTTACTTAGTTACTTAGCTACT >NZ_CP018681|1|1|1012427-1012500|CRISPRCasFinder TTAGTTACTTAGCTACTTAACCTAC TTGGATATTGAGTTACTTGTTTAC TTAGTTACTTAGTTACTTAGCTACT
>NZ_CP018681.2|WP_005449401.1|1011330_1012218_-|LysR-family-transcriptional-regulator MNKDLDLNLLKILVLLERHRQLKPVAKALGKSEASISKYLTRLRTQLEDELFIRHAHHFEPTDYLKRKLPEITDALGQLESCLVRREFDPLSYEKSISICLPQSAQQSFGDLLLIDLMELFPNAYINVESSTDSTIDDILVDKVDMQLHYFNEEYPKTIHQQFIGYAPAVIVVPEELGINDLETACKLDFILLELAGWKDREQVTKRALEQSGININRVATIGNITSLLKVIRSKAAATILLEYQKPIEGYTFIPVPESFYPQGRPKVVIQMKQSHRYNSMHQLLTDAIAKYVIS >NZ_CP018681.2|WP_005449400.1|1010725_1011334_+|response-regulator-transcription-factor MNNVLIIDDQPLYSEALASLVQNAFQTVNVVESTDSAEVMELIRNQPVDLVILDVVLGDRDGMRLAKNILATGYQGRILFISSRDYSSLSKAAFEMGAHGFLNKNESKETIMDAIVSVTRGYSMFKATHNQSSGNVALSNREAMVFHYLAQGYSNKKISEQLSLSAKTISTYKTRILKKYHAESVVELLNTLPQDECGSLCC >NZ_CP018681.2|WP_050938390.1|1007615_1008338_-|nitroreductase-family-protein MSQSSFESIVHSRRSVRKYDTNAEFNHDDVAKALALTTLSPNSSNMQMWEFHRVISAEKRAKLADFCMGQNAAKTANELVVFVATTHNWEERAKRNAETVRQAFEGRDDAAAKRALKYYEKLIPMVYNNDAFGIRGVVRKLYSWWLGRNKPMVREVAKSDVRVCLHKSVSLAAMTFMFAMREKGYDTCPMEGFDSKRVKQLLNLRDSDQITMVISCGIRTEEGVYGERHRVDNEAVIKTH >NZ_CP018681.2|WP_005449396.1|1006362_1007343_-|chemotaxis-protein-CheV MSNVTSTILTESGTNELEIIEFHLEKQMPDGSTKTCYYGINVAKVREVIRVPETTDYPNAQPHMIGVFSSREVLTPLVDLAGWLGVPTRRDLERKFVIVTDFNKMTNGFLIDSISRIHRISWNDVESPSQFLEAGEQDCVVAVVRKDGNLIMILDFEKIIADINPELSMEKYDVTGDKSVDLNQRMVGKRNAKTVMVVDDSAFIRSLIQDTLASAGYNVITCKDGGEAYEKLMSLIEVAKQEQLPVRELLDAVVTDVEMPRMDGMHLVKRLRETEAYNEMPIVMFSSLMSEDNRAKALSLGANDTITKPEIGRMVNMMDKYVFNFA >NZ_CP018681.2|WP_017819534.1|1005782_1006316_+|type-II-secretion-system-protein MKRKSGFTLIELVVVIVILGILAVTAAPRFLNIQDDAREARLEGMKGAIASALAIGYGKMATAGLESVPYVSNKKTAYPATNLPFDGCKTSDSMQCTFLYGYPDADDHTIGLLIQGINESSNSDWKTFHHEAGSFTREMVIAPRDDPRASLTQCGILYGPPMSKNKSYKLEILPCPK >NZ_CP018681.2|WP_005449394.1|1004704_1005427_-|sulfite-exporter-TauE/SafE-family-protein MEFDSTNLLAMGLIFLGSYVQTAIGFGLAIVAAPLLILWAPEYVPAPICLVALFISLMNAMKHRSNVELGGLKMALIGRIPGSLAGGALLFYVSTEALTLWIGFLVLFAVAVSVLPFRIEPTPGRMTFAGFFSGLFGTSSAIGGPPMALLLQHQEANQLRGNLSAFFVFSSIISLVVQIPVGFFNLHHLIITLPLLPAAAFGYWLAIKTTQSLPKEKVRFGALALCFISGITAVIQGLAL >NZ_CP018681.2|WP_005449393.1|1003716_1004622_-|EamA-family-transporter-RarD MNNNRYGNFMAALSFVIWGLLPVYYRFLPNASMDELLALRIIGSVPVGFLLVLVITHHMPKWAAIWKDKKSLWYTFVASSLMCISWVAFTWALTNNRVIDASLGFFIGPLVSVALGVFVLGDKLSKGQFIAILLATIGVVYQVIQYGQLPLVALTMGLFFSLYGLFKKKINFDWSTTLFIEAVVLTPIALAYLMFKQWTIGELSSTVDMTTFMLYLGSAPVTILPLIFYSIAIRITNLSTVGLMQYIEPSLQFVLAVVFFGELFDSVKAVTFAFIWVGLLFTICEGIAKLSKRKKLANHSL >NZ_CP018681.2|WP_005449392.1|1003131_1003518_-|MAPEG-family-protein MVTALYASILALLLIWLAFQVIKQRRSNKIAYADGGVEALQIARSAQSNASEYVPITLILMALLEYNGASLLWIHLAGIVFVIGRVIHAKGILGEDLKGRITGMKLTFFTMIGLVALNLIYLPYGSLW >NZ_CP018681.2|WP_050938205.1|1002604_1003138_-|hypothetical-protein MVKKTTYLPPAWLLVFVGLVLNIAAIILTSVVLDKLGKEISLLAEQKADNLYSIQLAWNSVETLERKREVLLLHLHLAQSEPVSEEMNQVLRGHLSAWTGQEVSPIQIEQLPNLMSEINQAQNNYRNRIDSYYLANLETTEVMTGLEEKMAWYKSIGLFLQVFGLALILARDLARKP >NZ_CP018681.2|WP_074051560.1|1000139_1002296_+|c-type-cytochrome MSTFWNWWAVACTIIFFALMVSVIVKYWRSNHKADQDHTIASFDGIDEKDAPPPKLLFISYAVAFLLSAGYLVLYPGLGEWKGLVNYDQGNDKLSSPSTTLDQQFSQVTNTSLAVLAQNTDIVRTGRMLYQTHCAACHRDNAQGQKHFPNLIDKEWMYGGDDDAIIHSIAKGRNGAMPGWVEVMRKDEVNKVAYYLASLNQRHSDVPEVKVELGKRLFVEYCASCHGDGSIANQQTGVPDLSDSVWLHGGSIEEIQHTINYGLNNLMPAFENQLSENEILAIGAYIHKAGEEEEQKLSALNAESVERGEYLAHAGDCVACHSAEGGEPFAGGLPFVTPFGTIYSTNITPHASEGIGEYNYDDFRDALVKGKGKHGYLYPAMPYTSYQYLTEQDMTDLWEYMQSIPAVARRNDENHMMFPSNIRLGLLGWNIVFMDTDPIDYNLPSELEGVVEDVEKWQKGKYWVAGLGHCSECHTPRNIAQALIPERIFQGNLIDGWNAPDITSTELYVGGWDEKTLTDFLHTGHSDKGTAFAGMADVVKNSLSLMTREDIESMSYYLLSGDVNNFIAADAVPLEPKGFDEAAYQDPIYATYKQTCGACHGNDGKGRAPIAPTLLNNGIIMHSDPFNTIAVTVRGLQPTYIDPERNFMPMASFEDILSDARLAELITFVRRHLGDQQQEVTAADVREVRETLEAAGYAGGLHTTPDMYDRRDNTINIR >NZ_CP018681.2|WP_005449405.1|1013872_1014472_-|response-regulator-transcription-factor MMNKLNYLIFDDHPLVCIAIKSLVESLGTANDVLTVTNSKDALKILKEQQIGLLILDVNLADCDGYDFYKRIKSHGFAGKVIFFSAETSHMYSQMAFRSGADGYVCKSENHEILKDAIEAVSKGYSFFKFKQSVEEHRNVPALSSRESAVMKLLLQGKTNRDIADVLSISDKTVSTYKRRVLDKYNVKNIVELSKVLGT >NZ_CP018681.2|WP_074051561.1|1014476_1015631_-|EAL-domain-containing-protein MNKDLKAMVIDDHPLQITLLKQMLSRHGVDVSTFDNVDSAIQHVKTSDVDIIFCDLQMPNKDGVDMMMMLNQIGYQGKVVLVSAMELMIVATVRAMCESFSFEVLGKLLKPYDEEQVVEMLNKSGVQPAKFTSFQQQVCVQDQEFLFALAEGRVKNYYQPLADANTGEIIGYEALARWFHPIYGVLAPYNFLSIVKRCHLSAELFDAMFSNALYDMKNRGLRLHVSLNVDHDNLEDPEFATRFIERCREHGISPDQFTIEITERDTFETNAALYKNLLKFRMSGVTVSIDDFGTGSSSLEKLAQLPFNELKIDRSFIQGLVNDPKKKNIVLAICALAKSLNISVVAEGVEDEPTLNAMRQYTVDVCQGYYIDKPMPLEAITILK >NZ_CP018681.2|WP_074051562.1|1015623_1019280_-|transporter-substrate-binding-domain-containing-protein MFSILRKTVILFVAFIGTTSPSYSFASDESSETVLVGVIALRSEDPEIGNKIGTYYGINLDYLTNIAKVLNLELELRSYNTIPELFADIENGTIDGAVGFSKSPDRESRFIFSDAFFSSTIAVWYREDSYYQRDPRELQWVCVIGTVYCEYLEDMGATKVRTVESRLKAFEEVRGGRANALISSYVAINQYLDENDIVNGAVDIPSWLNEEESRFIASKSNQALVDRINKILSWERNGKNIRSVASTNPYHVNDKLLVEYRRDLENKQIITYSSSEEAFPFLFRNPHTDELDGFLPDFIDLIQSRTGLRFEYQKPSSSLNSGLTAFNADLVPVAYVENPPLSDWLVTKPFMHNNFVAIEALDPKEHPAHDARSGILMSLKKQGLVNLDSWKEERFTRYEDLRQLLTDLKQGKIEVAYIPDDIVHSMIAQDNIDGLLISEKDTLTLSIAFAVADHNTKLKKILDSIIDTIDAKEIEKLNRTYRNFNLVYGYDEEHIFTMILIVVVVFVILLAVAYFVLAHLKLKVNLAELNATNEEKEKQWLMEIIQEINSIVFIHGEDNQIQMSNCSLYKNHRCKGCTIQSRKSAKPLVDNVIELRQVIAGKRIADITASNGCKLGIQHVYRERKAILSPSSKKKFVLTVLQDITEQKEREFALIDAQEKAQTAVRSRENFLATMSHELRTPLSAAHGLLDLLDRQTSSEGSKELITQAMRSLNHLNLLVDEVLDYSKLEAGQLKVTPVKTHAVNTLCDVIRSFEPKASSKGLEYRVTFKPFLNPWLKVDAVRLVQIATNLLSNAVKFTNEGEISVSIAVHEGRLILKIADTGIGMTETQLEGILQPFVQADDTITRKYGGTGLGLSIVDRLVDCMGGELCINSQFGLGTTMVVKLPITFCEPEPQAALSYTFSPMLPANVRDWCLAWGMASTDVKPNLVGQFSPQGKYQGLNLDCGSKGSCELTPSEAKYPDTLLALLRQDCTLKPTAETEDTNSNLDWKHGTVLVAEDNPINQSVITMQLRELGIEPVIVSNGLEAWQYVNRDHQVALVLTDFHMPEMDGFELVKKLKAAPELASIPVIGVTAEDSRLANERAKHIGIDDILYKPYDLEKLRSKLLPFLGQEKMAQWPEWIEKFRKQDAKEIANVFKSSMTHDVANLKAATSKQDKKRIIHGIKGALGAIGVTILAELCIEAEKVSDGEFEVHVTDLIERIEQEINLAEHWIRAHE >NZ_CP018681.2|WP_074051563.1|1019614_1023550_-|zinc-dependent-metalloprotease MFRKTIIVASIAAGLSGCGAEDRAYDTVERSAKEVTVQSLDTESLWMYMPSTGEAPRYALTQRGFFQGDPKLVKLRFDETNGIYVEALDRDKVDSLEPSRWDSEINRVPVLKIPGEFRQYRCAENGYGECTNKEEINRDENVDWSTATHFIPDYENIESLSEDTLSTWYTASNVAESADPRVISYEYNPEEGVINVEIERTYTANPDDQYKFGGLENLSFKTRFFYSLVKLDKLASPNYKPVYYQGQDSAYYGFFNDSKAVKTHTGESDVQGSRFAYINRFNPNLESIDYYLSDTYFDEGNEVYRQLTIDTMKEVNESLEGTGVPPIRIVNTKEKAGIKSGDLRYNVFNLIDEPVDNGLLGYGPSATNPLTGEIIHAHVNQYLGVIRTTTRRYWDDLAMRYNRQEIAKLAPKDDTATDSGDADSGTAPVVAERTDIDVFNDMITADRGPDQFVPQVTEDEMAFVGEGTFVNEVPKADLDHKFDVDNKELALQTFYKRQDMLKRFSEQNVYSEDAMWLSTNAKGLISGIDYADGGYFADAKQNTMKKWSELSLEQQKLASEAISEHMFKSTLIHELGHNLGLRHNFMGSVDKAHFYTADELANSSLGHQAKPAAYSSIMDYGASIFDELLVFGKYDKAALKFAYAREIETNEFIGESVRDKTGAQVRKTYSLAEYDRKMSEDYNTYPTGVIAHLRANVGTDDIPELASYRFCTDEHTTTSLTCDRFDEGTSLTEITEFRIQRYYDSYETANRRNGRDFFSQYHQYEYFWQRMYEFQKIRDIVENVGEIDFVFARYIGANTTNNKGQVFEEIAENNCLDWRGNPKPLDSMSAGLRPICDTYNAANLAADFFLDVLTAPDKVCELEELSGIDGVPNRYRFAKLSDLWLTYQTGMSLNRDVPTSCFDEELVQILKKQANEIVMRSETRDGHIWSSLKANNPYQTASNSVDLLGVWPDKLLAAQMLVRRDSPYIATENSSLALIDLSDKITKLYTHLSDLAGRPEARQAIFVDGNGDYVETVLRYQPSLTQTIEAVPSYLWPMKRYFVMGGQEAFEYWTEGSPQDSVVPYFGTLLANLMKYNRANEYGLGDSVRGLSDSIAINLANSATADPNGINFTWKGMNYSLNSRNSLANAVASRALYQDEQKARVEKLNGLPYRVRNALSVFKNTRDRNEARIIAIGDKDALVALRDTTGFARFTMFDKMFEEYEEDGNKCLRFKVDGESDEDHMKRKCNSLTRLQTAQNSAIGKFEGDELEKMFDVAQIFGDQIAENNTSSLNASHKEVYDYDPEELRLWSSNEYVTYRRAFEQFPLYSD >NZ_CP018681.2|WP_074051564.1|1023618_1024458_-|hypothetical-protein MNKTKVLPILFLATPSFVLANNLNDEWDESDKKSAWSGYVSADYSRNGYEDSAYLANRSASATGVLRYAVSDKSRLQLVVSGYHQQDGDVYGTRGQFWNDTSLSWAYNGLLNPTEGSSVSGEIRAIFPTSKSSKRNDLQFGTRLKLRWSAAFDEWLDGLTLSNTVLLRKNFHEYKTAGGYQLIEYRLSNQFSVDYAFADDFYFNIFLMPRQSWNYQGNSLDPTILHGEEIGYQMTESISMSVGMTNSVGYYNPDKGQNPLNDLIDLKKMTYYAVVNYQF >NZ_CP018681.2|WP_074051565.1|1024492_1025323_-|ABC-transporter-ATP-binding-protein MAELLRIEGLKQYYLTGKGVFKKGYVIKAVDDVSFSLEQGQTLGLVGESGCGKSTLGRTILKLYEPTEGKIFFEGKEITHLNSKQMRPLRREMQIVFQDPLESLNQRHTVGGILEEPYKIHGIGTPAERKRWVLELLDKIGLPHTAVNRYPHEFSGGQRQRIGIARAIALKPKLLICDESVSALDVSVQAQILNLLLKIQKEMNLSIIFISHDLSVVKHISDHVAVMKKGKVVEMGTAKEVYSSPKNEYTKELLSAIPITHPSQRIKNTSQVANNY >NZ_CP018681.2|WP_050558520.1|1025322_1026195_-|ABC-transporter-ATP-binding-protein MKQQTVLKVRDLEVEFSTDEGTVKILKGVSFDVRAGRTLGLVGESGSGKSVTSMSIMGLLPKPYGQITKGEIWYRDTDLTKMVERDLYQMRGNRISIIFQDPMTALNPVQTVGNQLVEVLDLHHSHMSKAEKVDYSIGLLEKVKIPMPELRFQEYPHNLSGGMRQRVMIAMALACKPDVLICDEPTTALDVTVQASILKLMKELQEETGMAMIFITHDLGVVAEICDDVAVMYDGRIVENADVFELFDYPKHPYTKRLLSLIPDSDSESKQAIQVVPIDLSLFPEYQGAS >NZ_CP018681.2|WP_050925466.1|1026208_1028032_-|ABC-transporter-substrate-binding-protein MYRFALISLLFSGLVQAGELPSDLQWQSNWQDPVFASDEAKRGGTLRSYMLSFPQTLRSVGPDANSGIRFYIMDGTPKLAQRHPNTGKWIPQLADEWAFSDDYKTAYFKLNPKVKWSDGEMVTADDYLFMLTYYRSKDIIDPWYNDFFRNTIASVEKFDDYTISITVSEPKSNDELMVLLNTPNHGLQPRPQHYFANINDQNGDGMDDNFVRKYNFKAEPTTSPYYISKINKGRSITFKHVGDNWWGYGNKYYQNRFNVDKLRIKVIRDSDIARKHFEKGKLDTFTMVLPQLWHEKTDSAPFAKGYIQKFWGFNQTSQGAGGLWMNTSMPLLSDINVRKGITFATDFDGMIKNVLRGDYSRKPHAMGYGHGDYDLPDAKAPAFEPELAIGYFEAAGFTNIGPDGIRVNKNGERLSFKITYGYNIWTPRIAYLKEQAKLAGLELNLNLVDGSAAFKYILEKKHQLAFLNMGGGELPAYKEYFHSSNANRVQTNNHTNYSSLELDREIEAFNSEFDADKKYQLSREIQKLISNAFVIVPGYMVPYTRDAHWRWVKYPNNPMTKKTGAMFNILGTSNFWIDTELKQQTQLAYKEGETFEPVTVIDERYRL >NZ_CP018681.2|WP_074051566.1|1028048_1029107_-|ABC-transporter-permease-subunit MSQLNLFTLNPLTRKQIKRFKEIKRGYWSLMLLSTLLILSLCAEVLINSKALVVRYQGEFFFPIFSDVYSGSNFGLDYAAETNYRDLRQNFEYENNGDFVLLPLVPWNAFEQDFSGSFPPNAPDFEAQHYLGTDVIGRDILARLVYGFRTAMGFALLTMSIAYVIGVTVGCAMGFFGGKFDLVVQRLIETWSMVPFLYVIMILVSIAQPTFMLFVLINVAFSWMGITWYMRTMTYKESAREYVLAAKALGASTARIIFHHILPNTMVMIVTLAPFTIAANITALTALDYLGLGLIPPTPSWGELLQQGKSNLDSPWIVVSVVTAIVSVLVMVTFIGEAIRTAFDPKKYTLYR >NZ_CP018681.2|WP_029789536.1|1029109_1030135_-|ABC-transporter-permease-subunit MVAYLLKRFALVVPTFLGITILIFAITRFVPGGPVERMLANMQPQGDGASVSAVVGQNSALSEEQLADLNKFYGLDKPVTEAYVDWLYRLVQFDLGESTRYYEPVTEMIFERLPVSAMYGGITFFISYFISIPLGYYKALKHGSVFDSVSSIMIFVGYALPGYVVGVLLITLFSYHLEWFPMGGFVDDDFDDFTTLSEQVTDILWHAVLPLICYLIGDFATLTMTMKNNLMENLSSDYIRTAIAKGLPFRQAIRKHALRNSLIPIASHFGNSLLFFMTGSFLIEVIFDINGIGLLGYESIVERDYPVVMGIVAINALVLLFGNILSDVCVALVDPRVKFGA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|