Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP045339 | Vibrio sp. THAF190c plasmid pTHAF190c_a, complete sequence | 0 crisprs | cas3,DEDDh,csa3 | 0 | 0 | 0 | 0 |
NZ_CP045341 | Vibrio sp. THAF190c plasmid pTHAF190c_c, complete sequence | 1 crisprs | DinG,cas6f,cas7f,cas4 | 0 | 1 | 0 | 0 |
NZ_CP045338 | Vibrio sp. THAF190c chromosome, complete genome | 0 crisprs | DEDDh,DinG,csa3,WYL,csx1,cas3 | 0 | 0 | 5 | 0 |
NZ_CP045340 | Vibrio sp. THAF190c plasmid pTHAF190c_b, complete sequence | 0 crisprs | csa3,PD-DExK | 0 | 0 | 0 | 0 |
NZ_CP045343 | Vibrio sp. THAF190c plasmid pTHAF190c_e, complete sequence | 1 crisprs | NA | 0 | 1 | 0 | 0 |
NZ_CP045342 | Vibrio sp. THAF190c plasmid pTHAF190c_d, complete sequence | 1 crisprs | NA | 0 | 1 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP045341_1 | 66154-66240 | Unclear |
I-F
Consensus repeat of NZ_CP045341_1
|
1 spacers
spacers of NZ_CP045341_1
>1.1|66181|33|NZ_CP045341|CRISPRCasFinder ATGATGATCGGAAATGCAGTACCGCCGGAAATT |
cas6f,cas7f |
CRISPR arrays and Neighbor proteins around NZ_CP045341_1
The CRISPR arrays of NZ_CP045341_1 >merge|NZ_CP045341|1|66154-66240|CRISPRCasFinder TTCTTAGCTGCCTATTCGGCAGGACACATGATGATCGGAAATGCAGTACCGCCGGAAATTTTCTTAGCTGCCTATTCGGCAGGACAC >NZ_CP045341|1|1|66154-66240|CRISPRCasFinder TTCTTAGCTGCCTATTCGGCAGGACAC ATGATGATCGGAAATGCAGTACCGCCGGAAATT TTCTTAGCTGCCTATTCGGCAGGACAC
>NZ_CP045341.1|WP_152471223.1|65242_65707_-|MerR-family-DNA-binding-protein MCIKPSEELVMLTTEIARVAGVTAETVRFYTRKGLLYAERDPNNGYKIYQQSAVNRLKFISHARAIGFSLSQIQEIIDYSEQGQTPCPAVRQMLTDKIVETKQKIEEYQRHLAMMEATYTEWDEKPDMEPNGNALCCLIEDWSEKHHQVLPEEN >NZ_CP045341.1|WP_152471222.1|62520_65241_-|heavy-metal-translocating-P-type-ATPase MNTTQYSLALSGINCGRCVAKIVDELRERDPDVSFTINETKHRAELITTLMPESAIEVIAKLGYVASIPTQQEYRTSVSNVTCQHCVGKITKAIVELDATAQVVVDLDKQTLVVNSSLNGDEIESVLVEQGYSGSTDGTQPQTTDVTKSNKVKTTQAVDVSIELSLIGVTCASCVNTIEGALRSVSGVDSVDVNFANRTATVVSSQPASVLIEAIQGAGYDAEEIVDQALASDLKEEREAEEYRFKVKQSVIGLGLGVPLMVYGLFGGPMNVNTQSEQLLWLAVGLVTFFILVRAGKHFFTGAWKTFLSRNANMDTLIALGTGTAWLYSMVVVVGPQWLPESARHLYFEATAMIIGLINLGQALELKARGRTSQAIKRLLDLRVKTAMVIRDGKEMSIPIEQVIARDLIRVRAGEKVPVDGVITEGETTIDESMLTGEPIPVVKQVGSSVSAGTINDHGSFVFEAQKVGSDTVLAQIISMVSNAQNSKPPISHLADKVSSVFVPTVMILSILTALAWYNFGPQPSLVYMIVAATSVLIIACPCALGLATPISTMIGVGKAAEFGGLIRNGEALQRASELDVVVLDKTGTITQGKPEVTNFVLLSEDSELLPLVNALEKGSNHPLAKALIAFTQSEVSDEANLLEVSDFESLTGLGVQAQYASQRVLLGNRKLMNQFKVNVESVTEHALQWEQEANTVVYFAIGDQLKALFGISDPIRDDARSAINRFHQQGIHVVMLTGDNHNTAKAVASLANVDEYHAQLMPEDKLHWIKQLQAKGHVVGMVGDGINDAPALAQSDVGFAIGSGTDVAIESADITLMRSSLHGISDVIGISTATMKNIKQNLWGAFIYNSLGIPVAAGLLFPFTGWLLSPIIAGAAMSLSSVTVVTNANRLRLYTPPSAPTTNEE >NZ_CP045341.1|WP_152471221.1|62052_62340_-|hypothetical-protein MKSKKPDCHKNGHHKGHGWMMILCALLMVGIPLFILTSSQEVFSWAYLSSAIVPLIICLAMHGLMMKMIMPSEKKDSETNDQSMAKIENNPEFKA >NZ_CP045341.1|WP_152471220.1|61807_62032_-|hypothetical-protein MKKLYVSVILSSLIAAPVFATSTSEQHLKSYTNQDVQTHMKQSKMSSQQESNSMMDGKSMENMLNDCRKMMDKV >NZ_CP045341.1|WP_152471219.1|61007_61388_-|DUF3316-domain-containing-protein MNTLNKILLSVTCLLVSSVSIAQPLHSSGNYVSRVDHKTVSIDAVSSKQRAYMLGAKKLSEFEGMSGRELSTTLIPVTTNGSRRNSTHLKDGGYVTVQERLNSQGQLEYVAKIHINVHYQERDSNN >NZ_CP045341.1|WP_152471218.1|60587_60923_-|hypothetical-protein MIDKVSVIFQEHRLEEIESILMDEGVRKFTIYPVQGRGTQANLIDTHVLSSYYKLDVFIGQNYTGRLTEILLNTVSVSQEGNGVTSVERNVLVYDNNTKQQISLSSYNHKP >NZ_CP045341.1|WP_152471217.1|59868_60552_-|heavy-metal-response-regulator-transcription-factor MKLLVVEDEEKAGAYLKKGLIESGYSVDLSKDGVDALYLATSGDYDLIILDIMLPKLNGWQILHTLRSSDISIPVIILSAKDEVEDRVKGLELGANDYLVKPFAFVELLARVKNVLRNQASTSREVSTLDIADLSFDLLKRKVTRQSDTIQLTAKEFSLLEYFFSKKGQVVTRTQIASAVWDMNFDSDTNVIDAAVKRLRSKIDKPYKQKLIHTVRGMGYKLDDAVE >NZ_CP045341.1|WP_152471216.1|58552_59872_-|heavy-metal-sensor-histidine-kinase MKKLLRQSIARKLVFMFVTASFVVLVVFALTIQHAIKNHFNEQDYKHLETKLLPLMGTIERDLDIFHSWDISAWILNNKQVVQTNNGTIDFPPELIEGSSYEWSVGNNHYQAFKFDYSEVEGGAIVLAIDVTHHKLFFDKLNTILFWTLVLTVSLSGAYALIIVRNGLLPLKTLKEYIEQVNTSNLSIRIPSESLPKELSSLIVSQNEMLERLQTGYLRLSEFSSDIAHELRTPLNNIMTQTQVALGTERSIEEYEDILVSNIEELERFNKTISDTLYLAKSENKLLHKTESELELKEVITPLLEYYEVLAEEKNVRFTLDGSSTLFGDKDMLQRAFGNVLSNALRHCFHNTDISVVISESETSNKVTISNTGEPIPLSSLPYIFERFYRSDKSRVHNGSVGAGLGLPIAKAIMKAHSGDILVNAKDERTEFVFIFNNS >NZ_CP045341.1|WP_152471215.1|58056_58446_-|hypothetical-protein MIISILAMLMSSYVSSMSMMDMSPISTSSSLQPKLSHHDSMGMTETTASHHAMIGCQPMEVAESESSTHCGSSSVFGGDCCKSVCSSVFFPLSSSRLINISEPDLALRHPIKIGVTVTRIQSLLRPPSV >NZ_CP045341.1|WP_152471214.1|56573_57965_-|TolC-family-protein MNIESNGSRVLRTLSLGVAIALYAMPALSHTKSNVTPQTIATSTHQLNALIEVALENDTSRSQYAAQSQAVREMGVASATLMDPKLKVGFGGLPVDSFKFDEDPMTNISVGLMQQFERGATLDLNQRKANQQADGIAYQVQVREREVANSITQLWLELGYLQHAETLIESNQRLMVEMEQYIQTNYSIGKSEAQDLLNTQLQVNKLDEKLQSNQQMQNRVVAQLSEWLGSEWLHRRAIDSNQVGYQLDWSRLNTLLDQQTSDTQYYSLLNQNPMVQMADANISANRTQVEVAEQSYTPQFGVEVMYAYRQADNMKGEPTSDLLSAYLTMDIPLFTDSRQDKNVAAAQYQVGAAQYQKDTLLAQMNSQVVALTTDRENLAERIERYQSRFIPQSKARIEAVERGYQNNTAPFGDVIIASTDDLALQMELARLITDMNQINSKLSMLLGGFAYQISAPNTQYKEQ >NZ_CP045341.1|WP_152471224.1|66355_66973_-|type-I-F-CRISPR-associated-endoribonuclease-Cas6/Csy4 MTKRYYFLIRYMPAQADHELLAGRCISQVHLFMVNNRQAMNRIGVSFPDWNESTVGQTIAFVAEDKEMMVGLSFQPYFSVMVNEGLFEISSVCEVPANALEVRFTRNQTIGKSFLGSKKRRIKRSMARVELSGAGTSLPATNEERVIESFHRIPVSSGSSAQDYILFVQKEFVDERVAANFNSYGLATNQERKGTVPELRFLPFF >NZ_CP045341.1|WP_152471225.1|66982_68014_-|type-I-F-CRISPR-associated-protein-Csy3 MELCTQLNYVRSLSAGKAYFYYLSKSGEMCPLDIDRTRLRAPKGGYSEAYKGGKFVKKNVAPQDLAYSNPQFIEECYVKPGVDDIYCAFSLRIRANSLTPDTCSDDEARSKLSLLAKTYQELNGYQELALRYAKNILLGTWLWRNRECRKLSIEVATSDSQTLIVENATRLSWYGHWDKASAECLEKLTAYLMRALSDPTEYFYMDVKAKIGVGWGDEVYPSQEFLDDREDGVPTKQLATVELLNGKETVAFHGQKIGAALQSIDDWWHEDADKPLRVNEYGADREYVIARRHVSYGNDFYQLVRNTENWIETMIASQTIPNDVHFIMSVLIKGGLFNCSKAK >NZ_CP045341.1|WP_152471226.1|70110_71322_-|TniQ-family-protein MKTDIQFYLDESLESFLLRLSQEQGYERFSHFAEDIWFDTMDQHKAIAGAFPLELNRVNIYHAQTTSQMRVRVLIHLENQLKLNNFGVLRLALSHSKAQFSPQYKAVHRFGADYPYAFLRKCFTPICPLCIDEAPYIRQQWQFISHQACEHHGCKLVHHCPECKSRLEYQSTESISQCECGYELRNSPVEDAQEVEVLVAQWLSGSDSKLLGLLTGKMTLSERYGFLLWYVNRYGDIDDPSFESFIEYCCAWPAALWQDLDVLKEKAELVRVKDWKKVFFNEAFGALLKDCRQLPSRQLSHNIVLTQVLAYFTQLMAAVPSSVKGNIGDVLLSPLEASTLLSCTTDEVYRLYEFGEIKAVIRPRMHTKIASHESAFTLRSVIETKLTRMSSESDGLNVYLPEW >NZ_CP045341.1|WP_152471227.1|71572_74053_+|PAS-domain-containing-protein MVKLSSVSPKVLLFLFILLTPLTFGGISLYVLSNHSQSTAKQLQKGNSSFEIESAKIFMEKYLEVAEDRIVLASQHQGVIDAASQSNIERLSYHLERFKGQNQSMQFAVRDRTGKLLFEDTFRHPAAKQEQAIFSAIANQNVHERILYLLHDDTNHICIKLFMPLYQSGEFTGVIYGETAVDYDDFFGSFVTNRERWYELSQTEPPVASLVETNRKLEHNHHSFSSHASKYNEKDWLLTQTPLARGNLVLTQGISRSFLKEQLSTLKRELLNGLLIVLITSSILVFLLGQKLFVNPHKELEGSKKLLEESNKLLAEREHDSHLLATVVKAARDAVVITDKEGRIEWVNNAFEVMTGHQLDSVKGLTPGSFLQGEDTCNETASRIGSALKKGKQVKAEILNYTIDKVPYWVDIDIVPLRNEDGEIAHFIAIERDVTDFKKLELELEEQASKAHQANQAKSLFLATMSHEIRTPMNGLLGLLQMLVDELEQEQQREVLRLALGSGEHLIAILNDILDLAKIENDSLEIDQHAFKMSEIISPVLNTYQTLCSEKGLKFNLTDHCDPSLTYYGDSVRIRQVLLNLAGNAMKFTESGSIDIDIKQLSHSKMEFAVQDSGIGIPNDRLQSIFNEFEQADISTTRNYGGTGLGLAICHKLVTLMNGQISVESELSKGSRFSFVIDLPTIAHSDEKNTESQVVDLSQFKALVADDNKMNRIIAKGFLDKLNVECVTCGDGFEALSHLESGNFNLVIIDNHMPKISGVETVRRIREAVKGNIVVMGWTADIMQTSTHAFLEAGADEVLTKPLIKKDLIEALSRHINQVKAIDKAS >NZ_CP045341.1|WP_152471228.1|74213_75593_-|DEAD/DEAH-box-helicase-family-protein MLRTWQAECAAQVIDKFTSNKRSHFFCQATPGAGKTVMAAEVAKRLFQEGMIDLVLCFSPSLSVAEGMRKTFSWKLECSFNGGLGSLGGSYTYQSIRFLDENFWNTVSKYRVLVVFDEIHHCSFDDEGRSNSWGLEIVSKIQGFAHYTLALSGTPWRSDRLPIVMAEYSGPDGKVVCDYQYGLQQAVEDKVCHRPKIVLIDNEHLSVKAGSDNQHFASILDCLKQSDVSYQSVIHNEDAMNYILNNGCQKLAQIRQESPNAGGLVVAASIKHAKHIQKRLIEHFNQSACIVTYHHDNPLNEIESFRHSNVQWIVSVGMISEGTDIPRLQVCCHMSSVKTELYFRQVLGRILRVNDSPNQEAWLYTFAEESLIGFAERIEQDVPEVCMLVESNLHFVTYEHSELKSDHQSKELRGLVSTKVDNTVAWGDRLDFSLDPLSSLNRELCFGSFKKRVLSAFLN >NZ_CP045341.1|WP_152471229.1|75586_76132_-|hypothetical-protein MKKVTRINPHMFNLLIEKGMDNFSVIEARDALLNGTSTFTSSDEARKYVYKQLLSLEEKGWLSAIGARRGKRYHQTNEFKALTIEPRAARNKKANVDSITVQTPEFSLNALEQEKKQHEGELAITLGEIEEYQSLLTRFPNNKHDMEPLFNAARERSAKLLGKINALTNWMQVAQSKASQC >NZ_CP045341.1|WP_152471230.1|76233_76548_+|helix-turn-helix-domain-containing-protein MQKDNPIPARLKAARKKAKITQKDLGVKIGMEPSSASGRMNHYEKGRHVPDIGTLERMAEELNVPLNYFFCRNELSAELACAIDKMSDEEKKALLENLNGESNL >NZ_CP045341.1|WP_172974337.1|76544_77639_-|TniB-family-NTP-binding-protein MMISKILRGINLNLTSSQFEQLRSFETCFIEYPAITEIYSIFDQLRFNHSLGGEPESFLLTGEAGSGKTALINNYLSRFPSSSTWSKQPVLSTRVPSRINEQSTLTQFLVDLNGKSGGRGTRRLNEIALGEAVVTQLKRKSVELIIVNEIQELVEFSTAEERQAIANTFKYISEEAKVSFVLVGMPYADVIATEPQWNSRLSWRRKIEYFKILKANSYSSGTASYAFDLEQKKHFAKFVAGLSARMSFDEPPVLTKNEVLYPLFVMCKGECRALKHFLKDALLMSFNDNVDTIDKAILSRTFSFKFPYLDNPFDAPVEELRLHQIDSGSAYNLNAIAAEDKILAPRFTDAIPLSMLLSKSGLKA >NZ_CP045341.1|WP_152471232.1|77607_79473_-|DDE-type-integrase/transposase/recombinase MSSDSNNIFGFFDEFEASEGESQPLPIELIQEPIEISSTIDSQSPKVQKEVIRRIKIIDFVEKRLKGGWTEKNLNPILSLIEPELQLTPPSWRTLATWKKYYFEAGKDPCALIPKHTFKGNRQKEMDSQSLIDEAIQNVYLTRERLSVAEAYRYYKSRVIQINRGIVEGKIKPISERSFYNRIHGLPPYEVAIARFGKRYADREFRSVGQQVAATKPMEYVEIDHTPVPVILIDDELDIPLGRPYLTMLYDRFSKCIVGLSVNFRDPSFDSVRKALLNTLIDKSWLKNSYPSIKNDWPCHGKIDYLVVDNGAEFWTDSLEDSLRPFVTDIQYSQTAKPWRKSGIEKLFDQMNKGLVNSLPGKTFTNPTQLEDYNPKKDAVVRVSVFLELLHKWIVDYYHMAPDSRERDIPYHKWKQSEWTPSYYSGAEKEQLRVELGLLRHRTIGVSGIRLHNLRYQSSELIEYRKYWASNNGKKLYVKTKTDPSDISSIHVYLESEKKYIKVPAVDNTGYTSGLSLFEHERIQRVRRLNIRQLADDESLADTFLYMKNRIQEETDRFRNAKSNKASLPKTGNTSKLAKFNDVGSEGPSSINTTPDKQDTTSVFDAPEQLFDDDFEDIEGY >NZ_CP045341.1|WP_152471233.1|79475_80099_-|TnsA-endonuclease-N-terminal-domain-containing-protein MFDQTKKSSHVHNICKFMSLKNDAVVRTLSILEFDFCFHLEYNADIKSFTSQPFGFHYQFNNRKCRYTPDFLATDHDDHSTFFEVKHSSQILKPDFRDRFEEKQRVAFSEFNRRLVLVTEKQIRMGPTLDNFKLLHRYSGLRTVTEFQKLVLAFIQRKQMVKLQEVSLYFGLSEQDTLISTLPWISSGQVKTDLNNIGFGLETYVWC |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP045341_1 | 1.1|66181|33|NZ_CP045341|CRISPRCasFinder | 66181-66213 | 33 | NZ_CP045341 | Vibrio sp. THAF190c plasmid pTHAF190c_c, complete sequence | 66181-66213 | 0 | 1.0 |
NZ_CP045341_1 | 1.1|66181|33|NZ_CP045341|CRISPRCasFinder | 66181-66213 | 33 | NZ_CP045122 | Rubrobacter sp. SCSIO 52915 plasmid unnamed1, complete sequence | 239164-239196 | 10 | 0.697 |
1. spacer 1.1|66181|33|NZ_CP045341|CRISPRCasFinder matches to NZ_CP045341 (Vibrio sp. THAF190c plasmid pTHAF190c_c, complete sequence) position: , mismatch: 0, identity: 1.0
atgatgatcggaaatgcagtaccgccggaaatt CRISPR spacer atgatgatcggaaatgcagtaccgccggaaatt Protospacer *********************************
2. spacer 1.1|66181|33|NZ_CP045341|CRISPRCasFinder matches to NZ_CP045122 (Rubrobacter sp. SCSIO 52915 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.697
atgatgatcggaaatgcagtaccgccggaaatt CRISPR spacer cggatgatcggaaatgccgtcccgccccttctc Protospacer *************** ** ***** *.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
642391 : 649619
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP045338|642391:649619|DBSCAN-SWA AATGAAACTGCCTATTTACTTTGACTATTCAGCTACTTGCCCAGTTGATTCTCGTGTTGCTGAAAAAATGGTTCAGTGTATGACGATGGACGGTAACTTCGGTAACCCAGCATCTCGTTCACACCGTTACGGCTGGCAAGCAGAAGAGTCAGTAGATAATGCTCGTGAGCAAATTGCCGATCTCCTGAATGCAGATCCACGTGAAATCGTATTCACGTCTGGTGCAACAGAATCTGACAACCTTGCTATTAAAGGTGCTGCGCACTTTTACGAGAAGAAAGGTAAGCACGTAATCACGTGCAAAACAGAACACAAAGCGGTTCTTGATCCATGTCGCCAACTTGAGCGTGAAGGCTACGAGGTAACTTACCTAGAGCCAGAATCAAACGGCATCATCGATCTAGACAAGCTACAAGCTGCAATGCGTGAAGATACAGTGCTTGTTTCTATCATGCACGTAAACAACGAAATCGGCGTTATCCAAGATATCAATGCAATCGGCGAACTATGTCGTTCACGCAAGATCATCTTCCACGTTGATGCGGCTCAGTCTGCGGGTAAAATTCCACTAGACGTACAAGAGACTAAAGTTGACCTAATCTCACTTTCAGCTCACAAAATGTACGGCCCTAAAGGTATCGGTGCACTTTACGTTCGTCGTAAGCCTCGTATTCGTCTTGAAGCTCAAATGCACGGTGGTGGTCACGAGCGTGGTTTCCGCTCTGGTACACTAGCGACACACCAAATCGTTGGTATGGGTGAAGCGTGTGCTATCGCTAAACAAGATATGCAGAAAGATTACGACCACGCACTAGCGTTACGTGAGCGTCTACTGAAAGGTGTTCAAGACCTTGAAGCAGTAACGGTAAACGGTGACCTAGACCAGCGTGTTCCGCACAACCTAAACGTGAGCTTTGCTTTCGTTGAAGGTGAGTCTCTGCTTATGTCTCTAAAAGACCTAGCTGTATCATCTGGTTCTGCATGTACATCTGCAAGCCTAGAGCCATCTTACGTTCTACGTGCTCTTGGTCTGAACGATGAACTGGCACACAGCTCAGTACGTTTCTCATTCGGCCGTTTCACAACGGAAGAAGAAGTTGACTACGCAATCGCACAGATTCGTGTAGCAGTAAACAAATTACGCGACATGTCTCCTCTATGGGATATGTATAAAGAAGGGATTGATTTGGACACTGTTGAGTGGGCACATCACTAATCTCACGGATATAGAGGATACGAGGTAACTATCATGGCATATAGCGAAAAAGTAATTGATCACTACGAAAACCCACGTAACGTAGGTTCGTTTGATAAAGAAGATCCAAGTGTAGGTAGCGGCATGGTTGGCGCACCAGCATGTGGTGATGTAATGAAACTGCAAATCAAGGTAACACCAGAAGGTATTATCGAAGATGCAAAATTCAAGACATACGGTTGCGGTAGTGCAATCGCATCTAGCTCACTAGTTACTGAGTGGGTTAAAGGCAAGAGCATCGATGAAGCAGCAGCAATCAAAAACTCTGAAATTGCAGAAGAGCTAGAGCTTCCACCAGTGAAAGTTCACTGTTCGATTCTTGCTGAAGATGCAATTAAAGCAGCCGTTGCGGATTACAAAAAGAAACGTTAATCGTACCCCGTAGCGAATCGTTGCAAAAAAATAGTAATATTAGGGAGTGCGTTTTGCACTCCCACTGAGTTCTCAAATTATTAAAACAAGGTTGTAGTATGGCCATCACCATGACAGAGACGGCGGCAAGCCGAGTTCAAGCATTCCTAGATAACCGAGGTAAAGGTATCGGGTTACGTTTGGGCGTAAAAACCACTGGCTGTTCAGGTATGGCATATGTACTTGAGTTCGTTGACGAACTTAATGAAGAAGATCAAGTATTTGAGCACTCAGGTGTTAAGGTCATCATTGATCCTAAGAGCTTGGTTTACCTAGATGGCACTGAGCTTGATTATGTAAAAGAAGGCCTAAACGAAGGTTTTGAATTCAACAACCCAAATGCGAAAGGCGAGTGTGGTTGTGGTGAGAGCTTTAATGTATAAGTGCTTCAATGGCTAAGCATCTATTGTAAGCGCATCGATTGTAAGCAACGTACAATTAGGCTCTGCATCAGGGCCTAAAATTAGGACCACCTTTACATGAATCATTTCGAATTATTTGGGCTACCACTTCAGTTTCAACTGGATGGTAGCCTTCTTGCTTCTCAGTTTAGAGATCTGCAGCGTCAATTTCACCCAGACAACTTTGCTACTGCCTCCGAGCGTGATCGCTTGCTAGCGGTGCAAAAAGCTTCTCAAATCAATGACGCTTATCAGGTACTTAAGAACCCAATTAGTCGCGCAGAGCACCTATTAGTGCAGAATGGCGTTGATATTCGTGGTGAACAGCAAACCATGCAAGATCCCATGTTCCTTATGGAGCAGATGGAACTGCGTGAAGAATTAGAGCATATCGCAGCAAGCCCAGATGGTGCTGACGAGTTGTTCGACTTCGATACTAAAGTCGGCAAAATGTACAAACAACACCTTGCGCAGATCGAGCAACAGCTGGATCAAGCCGCATGGGAAGAAGCCGCCGACAGTGTTCGTAAACTGAAATTCATCGCAAAACTCAAGAAAGAAATAGAACAAGTCGAAGATCACCTGCTCGACTAGCGACTAGATAAAAGGAATCCTCATGGCATTACTTCAAATTGCAGAACCAGGACAAAGCTCAGCACCGCATGAGCATAAGCTGGCTGTGGGTATCGACCTAGGTACCACCAACTCTCTTGTTGCCTCGGTTCGCAGTGGTGAAGCGACGACTTTAGCGGATGCTCAAGGCCGCTCAATTTTGCCTTCTGTGGTTCACTATCAAGCTGATTCTCACACTACTGGCCAAGAGGCTCGTGATTGTGCACAGACTGATCCACAGAACACCATCATTTCAGTGAAACGTCTTATTGGTCGTTCATTGGCTGATATTCAGCAGCGTTACCCATCACTGCCTTATCAATTTGAAGAGAGCGATAATGGCTTACCAGTTATTCGTACTGAACAAGGCACTAAGAACCCAATTCAAGTATCGGCTGATATCTTGCGTACTTTGGGTCAACGTGCCGAGTCTACGCTAGGCGGTGAACTTGCAGGTGCGGTAATTACTGTTCCAGCATACTTCGATGATGCGCAGCGTGCTGGTACAAAAGATGCGGCGCAACTGGCAGGTCTTCATGTTCTTCGCCTTTTGAATGAGCCGACGGCTGCTGCGATTGCTTATGGCCTAGATTCAGGTCAAGAAGGCGTGATTGCGGTATACGATTTAGGTGGTGGTACGTTTGATATCTCTATCCTGCGTCTATCAAAAGGCGTATTTGAAGTATTAGCAACAGGTGGTGACTCTGCGCTGGGTGGCGACGATTTTGACCATTTAGTCGCTGAGCACTTCCAACAGCAAGCCGGTTTGATTGAGCTGACTGCTGAACAAAGTCGTGTTCTGCTAGATGCTGCGACGGATGCAAAAATCCGCCTTTCTGTAGAAGAGCAAGTTGACGTCGAAGTTCTTGGCTGGAATGGAACCCTGACTCGTGATGAGTTTAACGACATTATCAAGCCACTGGTTAAGAAAACTCTGATGTCTTGTCGTCGTGCATTGAAAGATGCTGATGTAGACGCGGATGAAGTGCTTGAAGTTGTTATGGTAGGTGGTTCAACTCGTACTTTGTTTGTTCGTGAAATGGTCGGTGACTTCTTTGGTCGCACACCGCTGACGAGCATTAACCCTGATGAAGTGGTTGCGATTGGTGCTTCAATTCAAGCGGATATCCTAGCGGGCAACAAGCCAGATGCTGAAATGCTGCTATTGGACGTAATCCCGCTTTCACTGGGCATCGAAACCATGGGTGGTCTAGTAGAAAAGATCATCCCTCGTAACACCACTATTCCTGTTGCTCGTGCGCAAGAGTTTACGACCTTTAAAGATGGTCAAACGGCAATGACGGTTCATACTGTTCAAGGTGAACGTGAGATGGTTGATGACTGTCGTTCATTAGCTCGTTTTGCTCTTAAAGGTATTCCGCCTATGGCTGCGGGTGCAGCGCATATTCGCGTAACTTACCAAGTGGATGCAGACGGTCTTTTGTCTGTGACAGCAATGGAAAAGAGCACGGGCGTTCAAGCGGAAATCCAAGTTAAGCCATCATACGGTCTAAGCGATGATGAAGTGGCTAACATGCTCAAAGACTCAATGACATACGCAAAAGAAGACATGCAAGCTCGTGCACTAGCTGAGCAGCGTGTAGAAGCGGATCGTGTGATTGAAGGTCTAATTGCAGCAATGCAGGCTGATGGCGACGACCTGCTTAACGAACAAGAGAAACAACAGCTACTACAAGCGATTGAGTCTTTGATCGAAAAGCGTAACGGTGATGACGTTGATGCCATTGAGCTAGAGATTAAGAATACAGACAAAGCGAGCCAAGACTTTGCTTCTCGTCGTATGGATAAATCAATTCGTGCTGCACTGTCAGGTCAGTCAGTCGATAATATTTAATAGTTAGAGAATAGATAAACATGCCAAAGATTATTGTTTTACCTCACGAAGATCTATGTCCAGAAGGCGCAGTGCTAGAAGCAGAAACTGGCGAGACGGTACTTGATGTTGCACTAAAGGCAGGCATTGGTATTGAGCATGCATGTGAAAAGTCGTGTGCATGTACAACGTGTCACGTTGTGATTCGTGAAGGTTTCGATTCACTAGAAGAGAGTGACGATCTTGAAGATGACATGCTAGATAAAGCATGGGGTCTAGAGATGGAATCTCGTCTAGGTTGTCAGGCAAAAGTTGCGGACGAAGATCTCGTTGTTGAGATTCCAAAATACACGCTAAACCTAGCTTCTGAAGATCACTAAAACTCAAGCCAAATGGCGAAGTTTTTAGTAAAACTGGCTTAGAATAATAGTGTATTAGAGGTGCATGAGTGCCTTTAACTAATAAGGAAATGATATGAGCTTAAAGTGGATTGATTCGCGAGATATCGCGATTGAACTATGCGACTTATACCCTGATACGGATCCAAAAACGGTACGTTTTACCGATCTGCACCAATGGATATTAGATCTTGAAGATTTCGATGATGAACCAAATCACTCGAATGAGAAGATCCTTGAGGCCATCATTCTATGCTGGATGGATGAGATGGACTAACGTCCCACGCTCATTTTTAACTTGGTGATAGAGATAAGCTCTCTGAAAACACAGCCAAGTTAACAATTCTTACTAAAAAAAGCGGACCTTATGGTCCGTTTTTTTATCAAAATGACATTTTACTCCTACATAATGCTAACATCAGTTCGCTAATAAAAAATAATCTGAATCACTAAGGTAACGTGGTTCAAAGACAAGGAGAAATCATGTCTACACAGATGTCAGTATTTTTAAGTCAAGAGCCTGCCCAGCCTCAGTGGGGAGACAAGGCTATCCTTTCGTTTTCAGAAGCAGGTGCGACAGTTCACCTAGCAGGAGGCCACGATCTAGGCGCAGTTCAACGTGCTGGTCGTAAACTTGACGGACAAGGTATTTCGTTCATTTCACTGCAAGGCGAAGGTTGGGATCTGGAGAGCGTTTGGGCTTTCCACCAAGGTTATCGCGGTCCTAAAAAGCAAACCACGCTAGAGTGGGGCGAGCTACCAGAAGCGGACCAAGCTGAGCTAGAAGCTCGTATTCGCGCGACAGACTGGACTCGTGACATCATCAACAAAACGGCAGAAGAAGTTGCACCGCGTCAGCTGGCAACGATGGCTGCTGAATACATCAAATCTATTGCGCCAGAAGGCACTGTAAAAGCGAAGATCGTTAAAGACAAAGACTTACTTACTGACGGTTGGGAAGGCATCTACGCTGTCGGTCGCGGTTCTGAGCGCACGTCAGCAATGCTACAACTAGACTTCAACCCAACGGGTGATGAAAACGCGCCAGTATTTGCGTGTCTAGTTGGTAAGGGTATTACTTTCGATTCGGGCGGTTACAGCATCAAACCAGGCCAATTCATGACGGCAATGAAAGCAGACATGGGTGGTGCAGCTACAATTACCGGTGGCCTAGGTCTAGCAATCGAGCGTGGCTTGAATAAGCGTATCAAGCTTATCCTATGTTGTGCAGAGAACATGATCTCGGGTCGTGCTCTTAAGCTTGGCGATATCATTACCTACAAAAATGGCACCACTGTAGAGATCATGAATACTGACGCGGAAGGTCGTTTGGTACTCGCTGATGGTCTGATGTACGCAAGTGAGCAAAATCCAGAGCTTATCATCGACTGTGCAACGCTGACTGGTGCGGCGAAGAATGCTCTAGGCAACGATTACCACGCTTTAATGAGCTTTGATGATGAGCTTGCTCACCAAGCGCTAACTGCAGCAAATCAAGAGAAAGAAGGTCTATGGCCACTTCCATTGGCTGACTTCCACCGCGGCATGCTGCCATCAAACTTTGCTGATCTATCTAACATCAGCACTGGTGATTACACTCCGGGTGCAAGTACTGCAGCGGCATTCTTGTCTTACTTTGTTGATGACTACAAGAAAGGCTGGCTACACATGGACTGCGCGGGTACATACCGTAAATCACCAAGTGATAAGTGGGCAGCAGGCGCGACAGGCATGGGTGTTCGCACTCTGGCTCGTCTACTTATCGAGCAAGCAAAATAATTATAAAATCAGGCAGCTATCGGCTGCCTGATTGTTACCTAAACTCAATCAATGAGGGCAGGTGTCGTTTTAAATAACAACAAAAATGAAGTGAAGGAACCCCTATGGCTCTAGAAAGAACGTTCTCAATTGTTAAGCCAGATGCTGTTAAACGTAACTTGATCGGCGAAATCTACCACCGCATTGAAAAAGCGGGCCTACAAATCATTGCAGCTAAAATGGTGCATTTAACAGAAGAGCAAGCGAGCGGTTTTTACGCGGAACACGAAGGCAAAGAATTCTTCCCAGCTCTTAAAGAGTTTATGACGTCTGGTCCTATCATGGTTCAGGTACTAGAAGGCGAAGATGCAATTTGTCGTTACCGCGAACTAATGGGTAAAACAAACCCAGAAGAAGCGGCGTGCGGTACTATCCGTGCAGACTACGCAATCAGCATGCGTTACAACTCAGTACACGGCAGCGACAGCCCAGAGTCGGCAGCGCGCGAGATTGAGTTCTTCTTCCCTGAATCTGAAATTTGCCCACGTCCTGCTGAATAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP045338|642391:649619|642391_643606_+|WP_032551302.1|DBSCAN-SWA MKLPIYFDYSATCPVDSRVAEKMVQCMTMDGNFGNPASRSHRYGWQAEESVDNAREQIADLLNADPREIVFTSGATESDNLAIKGAAHFYEKKGKHVITCKTEHKAVLDPCRQLEREGYEVTYLEPESNGIIDLDKLQAAMREDTVLVSIMHVNNEIGVIQDINAIGELCRSRKIIFHVDAAQSAGKIPLDVQETKVDLISLSAHKMYGPKGIGALYVRRKPRIRLEAQMHGGGHERGFRSGTLATHQIVGMGEACAIAKQDMQKDYDHALALRERLLKGVQDLEAVTVNGDLDQRVPHNLNVSFAFVEGESLLMSLKDLAVSSGSACTSASLEPSYVLRALGLNDELAHSSVRFSFGRFTTEEEVDYAIAQIRVAVNKLRDMSPLWDMYKEGIDLDTVEWAHH >NZ_CP045338|642391:649619|647784_649080_+|WP_152468205.1|DBSCAN-SWA MSTQMSVFLSQEPAQPQWGDKAILSFSEAGATVHLAGGHDLGAVQRAGRKLDGQGISFISLQGEGWDLESVWAFHQGYRGPKKQTTLEWGELPEADQAELEARIRATDWTRDIINKTAEEVAPRQLATMAAEYIKSIAPEGTVKAKIVKDKDLLTDGWEGIYAVGRGSERTSAMLQLDFNPTGDENAPVFACLVGKGITFDSGGYSIKPGQFMTAMKADMGGAATITGGLGLAIERGLNKRIKLILCCAENMISGRALKLGDIITYKNGTTVEIMNTDAEGRLVLADGLMYASEQNPELIIDCATLTGAAKNALGNDYHALMSFDDELAHQALTAANQEKEGLWPLPLADFHRGMLPSNFADLSNISTGDYTPGASTAAAFLSYFVDDYKKGWLHMDCAGTYRKSPSDKWAAGATGMGVRTLARLLIEQAK >NZ_CP045338|642391:649619|646944_647283_+|WP_032551309.1|DBSCAN-SWA MPKIIVLPHEDLCPEGAVLEAETGETVLDVALKAGIGIEHACEKSCACTTCHVVIREGFDSLEESDDLEDDMLDKAWGLEMESRLGCQAKVADEDLVVEIPKYTLNLASEDH >NZ_CP045338|642391:649619|644535_645051_+|WP_032551306.1|DBSCAN-SWA MNHFELFGLPLQFQLDGSLLASQFRDLQRQFHPDNFATASERDRLLAVQKASQINDAYQVLKNPISRAEHLLVQNGVDIRGEQQTMQDPMFLMEQMELREELEHIAASPDGADELFDFDTKVGKMYKQHLAQIEQQLDQAAWEEAADSVRKLKFIAKLKKEIEQVEDHLLD >NZ_CP045338|642391:649619|643639_644017_+|WP_004740340.1|DBSCAN-SWA MAYSEKVIDHYENPRNVGSFDKEDPSVGSGMVGAPACGDVMKLQIKVTPEGIIEDAKFKTYGCGSAIASSSLVTEWVKGKSIDEAAAIKNSEIAEELELPPVKVHCSILAEDAIKAAVADYKKKR >NZ_CP045338|642391:649619|649184_649619_+|WP_152468206.1|DBSCAN-SWA MALERTFSIVKPDAVKRNLIGEIYHRIEKAGLQIIAAKMVHLTEEQASGFYAEHEGKEFFPALKEFMTSGPIMVQVLEGEDAICRYRELMGKTNPEEAACGTIRADYAISMRYNSVHGSDSPESAAREIEFFFPESEICPRPAE >NZ_CP045338|642391:649619|645073_646924_+|WP_152468204.1|DBSCAN-SWA MALLQIAEPGQSSAPHEHKLAVGIDLGTTNSLVASVRSGEATTLADAQGRSILPSVVHYQADSHTTGQEARDCAQTDPQNTIISVKRLIGRSLADIQQRYPSLPYQFEESDNGLPVIRTEQGTKNPIQVSADILRTLGQRAESTLGGELAGAVITVPAYFDDAQRAGTKDAAQLAGLHVLRLLNEPTAAAIAYGLDSGQEGVIAVYDLGGGTFDISILRLSKGVFEVLATGGDSALGGDDFDHLVAEHFQQQAGLIELTAEQSRVLLDAATDAKIRLSVEEQVDVEVLGWNGTLTRDEFNDIIKPLVKKTLMSCRRALKDADVDADEVLEVVMVGGSTRTLFVREMVGDFFGRTPLTSINPDEVVAIGASIQADILAGNKPDAEMLLLDVIPLSLGIETMGGLVEKIIPRNTTIPVARAQEFTTFKDGQTAMTVHTVQGEREMVDDCRSLARFALKGIPPMAAGAAHIRVTYQVDADGLLSVTAMEKSTGVQAEIQVKPSYGLSDDEVANMLKDSMTYAKEDMQARALAEQRVEADRVIEGLIAAMQADGDDLLNEQEKQQLLQAIESLIEKRNGDDVDAIELEIKNTDKASQDFASRRMDKSIRAALSGQSVDNI >NZ_CP045338|642391:649619|647377_647578_+|WP_032551310.1|DBSCAN-SWA MSLKWIDSRDIAIELCDLYPDTDPKTVRFTDLHQWILDLEDFDDEPNHSNEKILEAIILCWMDEMD >NZ_CP045338|642391:649619|644115_644439_+|WP_032551305.1|DBSCAN-SWA MAITMTETAASRVQAFLDNRGKGIGLRLGVKTTGCSGMAYVLEFVDELNEEDQVFEHSGVKVIIDPKSLVYLDGTELDYVKEGLNEGFEFNNPNAKGECGCGESFNV |
9 | Faustovirus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1339942 : 1349724
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP045338|1339942:1349724|DBSCAN-SWA CTTATTCTTCATCAATGATCCACCCATAACCTTTCTCCTCTAACTCGTGAGCTAACGTATGGTCGCCAGATTTTACGATTTTTCCCTTGTACAACACATGGACGAAATCGGGCTTGATGTAGTTAAGGATGCGTTGATAGTGCGTAACCAGAATAAAGGCACGTTGCTTATTCTGTAATGCGTTCACGCCTTCTGATACCGCCTTGAGCGCATCAATATCTAGACCTGAATCGGTCTCGTCGAGTATGCAAAGATCGGGAGAAAGTACTGCCATTTGCAAGATATCATTGCGCTTTTTCTCACCACCTGAGAATCCCTCATTAACGGAACGTTGCAAGAGGTGGGGTGGCATCTTCAATAATTGAGCTTTTTCTTCGAGTAGGTCTTCAAAGTCAAATCGATCGATAGGTTCTAAACCACGATATTCGCGAATCTCGTTGAGGGCAGTATTGAGAAACAGCTTGTTGCTCACACCAGGAATTTCGACGGGGTATTGGAAAGCCAAAAATACGCCTTCACCCGCTCTCTCTTCTGGAGCGAGCTCAGTGAGATCTTGGCCTTTGAAAATAATCTCGCCGCTGTCGACTTCGTAATCGTCTTTTCCTGCTAGTGTCGCCGAGAGTGTACTTTTTCCTGAACCATTTGGGCCCATGATGGCATGAACTTCACCGGGTTTAACGGTTAGGTTAATGCCCTTTAGAATAGCGTTTTCTTCAACGCTAACGTGCAAATCTTTAATTTCTAACATGTTTATCCCACACTGTGCTCGAGGCTGATTGATAAAAGCTTCTGGGCTTCAACAGCGAATTCGAGTGGTAATTCTGAAAATACGTCTTTACAAAAGCCATTCACAATCATTGAAATAGCGTTGTCTTCACTGATGCCTCGTTGGACACAGTAAAAGAGTTGGTCTTCGCCGATTCGAGATGTTGTGGCTTCATGTTCAACATGAGCGGATGGATTTTTGACTTCGATATATGGGAAGGTGTGCGCGCCGCATTGATCGCCAATTAACATCGAATCACACTGAGTAAAGTTCCTCGCACCTTCAGCGTTAGGAGTCACTTTAACCAGCCCTCGGTAGCTGTTTTGGCTTTTGCCAGCAGAGATCCCTTTTGAGATGATCGTAGAGCGCGTATGTTTCCCGATATGGACCATCTTGGTTCCGGTATCCGCCTGTTGAACTCCATTGGTTAAAGCAACGGAGAAAAACTCGCCGACTGAGTAATCGCCCTTTAAAATGACGCTCGGGTATTTCCAAGTGATTGCAGAGCCTGTTTCAGATTGAGTCCAAGACATCTTGCTGTGGTCACCTTCACAAACGGCTCGTTTGGTGACAAAGTTAAGAATCCCGCCTTTGGAATCTTCTTCGCCGGAGAACCAGTTTTGGACGGTGGAGTATTTCACTTCCGCGTTTTTATGAATGATCACCTCTACCACGGCTGCGTGAAGTTGGTAACTGTCTCGCATGGGAGCGGAGCAGCCTTCAATGTAACTCACGTAGGAATCTTCATCGGCGACAAGAATGGTGCGTTCAAATTGGCCTGTTTTGGCATGATTAATTCTAAAGTAGGTTGAGAGCTCACGAGGGCACCTGACACCTTTGGGGATGTATACAAAGGTCCCGTCAGAAGCGACCGCGGCGTTTAAGGCTGCAAAGAAATTGTCTTTAGGAGGGACAACAGTGCCCAAATATTGTTGAACAAGGCTTGGGTATTCTTGAATAGCTTCACTGAATGAACAGAAGATAATCCCCAGCTCTTGCAGCTCTTTTCGATAGGTGGTGGTAACAGAAACCGAATCAAAAATAGCGTCTACAGCAACTTCAGACCCTTCTCTTACTGGGACACCAAGTGCTTCAAACGCAGTTTCTACTTCTTTGGTTAAGAACTCATTTGACCCATCGTCACTGTTGTCGTTACTAGCATCATCGCAACTTGAGCAGCTCGGCGCTGAATAGTAACTGTAATCTTGATAATTGAGCTCAGAGTATTCGGCTTTGAGCCAATGTGGCTCCTCCATTTCTAACCATTGTTGGAATGCGTTCAAGCGAAACTCCAACATCCATTCAGGTTCGTTGCGCTTTGCGGAAATCGCTCTGACAACCTGTTCATTGATCCCTTTTGATAAAGTATCTGAAGACAGCTCTGTGTAGAAACCTTCACGGTAGTGAGTACTTTGCAGTATCTCTTCAACTTCAGGCTGCGTTTCAACTGCACACATGCTTAAACTCCAAAACTCTCACCACATCCACAAAGGTTTTTTACTTTGGGATTGTTGTATACAAAATTGCTATTTAGCCCTTGACGTACAAAGTCAATCTCAGTGCCATCGACCATAGGCATTGCTTGGAGAGCGACATAAAAAACAACGTTATGAGACTCATATTTAATGTCGCCTTCTGATGGTTGTTCGATAAGGGTTACAACGTAGGCAAACCCCGTACAGCCAGAGGATTTCACTGTCAGGTGGAAGTACTGTTTCTCACCCGCTAATTGAGTGATTCTGTTGGCGGCATTTTCACTCAAAGTGACGCCTTGCCACTGCATGTCATGTATTGATATTTCGTTAGAACCAGAACTCGGAATACTCACCTTACTGCCTCATAGTTGATTACTGCGATCAAATGATATTGATTATCATTTTCATATCAACCATGCATTTTTTATGGTTTTTGGTGTGAAGATCAATATATTGACAAAATTAAATCTACAAATACTATATGTGCCACGCTATATTTGATAATGGTTATCATTAAGATTTAAGTGAGGTTGTACTATATGGATATCGTCGTCTACTCGAAACCACAATGTGTTCAATGCACTGCGACAGTGAAGGCATTACAGAGCAAAGGTATCGAGCATACCGTTGTTGATCTAACGCTGGATGAAGTTGCGATGGAGAGGGTTCAATCACTGGGTTATCGACAGGCTCCAGTGGTGGTGGCCAATGGTCGTCACTGGTCGGGTTTTCGACCTGACATGATCAGCGCTTTGGGACAATGATCGTTTACTACTCGAGTGCCTCAGGCAATACCCAGCGATTTGTAGAGCAGCTGGGAATGTCAGCAATTCGGCTTCCTATTGAAGATGGTGAAGGGCTACCCGAGATTGAACAGCCATTCGTGCTCATATGCCCAACATATGCTGACGGAGAAGGGCGGGGAGCCGTGCCTAAGTCTGTTATACGCATATTGAATAATCCGAAAAATCGCGCATTGATTAAAGGCGTGATCGCGTCGGGAAATCGAAACTTTGGGGCGTTGTTTGCTCGTGGTGGTGCGATTGTCGCTGAAAAGTGCAATGTACCACTCTTGTACCGTTTCGAACTATCAGGAACCCAGAACGATGTTGAAATCGTTCGTGACGGGTTAAAACAATTTTGGAAACATCATGGATAACCACTTCGAGAATGAGTCTCTTGACTACCATGCTCTAAATGCGATGTTGAATCTGCTCGATGATCAAGGGGCGATTCAATTTGAGGCAGATAAGCAAGCTGCTCGCCAATTTTTCTTACAACATGTGAATCAAAATACGGTCTTTTTCCATAACTTGGATGAGAAAATAGACTATCTCCTGAGAGAAGGTTACTACGAGTCGGAAGTTATCGAACAGTATGACCGCAGTTTTCTTCATCAGATTTGGCAGCAAGCTTATCAAATCAAATTTCGCTTCCGTACCTTCTTAGGTGCCTACAAGTTTTTCACTTCCTACGCACTGAAAACACGAGATGGTAAAAGATACCTTGAGCGGTTTGAAGACCGAGTTGTGATGACGGCACTCTTATTGGCACGAGGTGATTGTGACTTTGCTCTTGCACTGATGGAGGAGATCATCCATCAGCGTTTTCAACCGGCTACGCCAACCTTTGTCAACGCTGGTAAGAAAAGTCGTGGTGAATTGATCTCATGCTTTTTGCTGAGAATTGAAGACAACATGGAGAGTATTGGCCGCGCAATAAACTCTTCGTTGCAGTTGTCCCGTCGCGGTGGCGGTGTCGCGCTTCTTCTTAGCAATTTAAGAGAACAGGGAGCGCCAATTAAGGGAATTCTTAACCAAAGTTCGGGTGTTGTACCGGTGATGAAGCTACTTGAAGACAGCTTCTCTTACGCAAACCAATTAGGTGCAAGGCAGGGAGCAGGTGCTGTTTACCTAAATGTTCATCATCCCGACATCATGCAGTTTCTCGATACAAAAAGAGAAAATGCCGATGAGAAAATTCGAATTAAGACATTAAGCCTTGGAGTGGTTATTCCTGATATTACGCTGGAGTTAGCCAAGCAAAACAAGGATATGTACTTGTTTTCCCCTCATGATGTAGAGCGTGTTTACGGTGTGCCTTTCTCTGAGATCAGCGTTACCGAGAAGTATCATGAAATGGTGGACGACCCAAGAATTCGGAAAGGGAAGATCAACGCGCGAGGTCTTTTCCAAACATTGGCAGAGATCCAATTTGAGTCTGGCTACCCATATATCATGTTTGAAGATACGGTGAATCAAGCCAACCCGATTGAGGGTCGCATCAACATGTCCAACTTATGCTCGGAGATTCTTCAAGTGAACGAGGCGTCGAGCTTCGATGATGACTTGAACTACAACCATACTGGGCGAGATATCTCATGCAACCTAGGTTCGATGAATATCGCTAAAGCCTTGGATGGTGGGCAGCTATCGCAAACCGTAGAGACGGCAATTCGAGCCTTGAATGCTGTGTCAGAGATGAGTGCCATCAATAGCGTGCCGTCGATTCGAAAAGGGAATGATGAGGGGCATGCGATTGGCTTAGGTCAAATGAACCTGCACGGATTCTTAGCACGTGAGCGAATTCATTACGACTCCCCTGAGGCGATTGATTTCACTGGCAGCTATTTTGCCTGCATCTTGTACCACTGCCTCGTTGCGTCCAACAAACTCGCTCAAGAGAAAAAGAGCGTGTTTAAGGGGTTTGAAAACTCCACTTACAGCGACGGCTCATTTTTCAGTAAGTATCTAGAACAAGATTTCTCTCCAAAGTCAGACGTCGTCAAAGCGCTATTTGAACGCCTAGATATACGTGTACCGACAGTTTCCGAGTGGGAAGAGCTGAAACAAACGGTGATGAGTTCTGGCCTTTACAACCGAAATCTACAGGCGATTCCACCTACAGGCTCTATTAGTTACATCAACGACTCTACGGCGTCTATTCACCCTGTGACCGCCAAAGTTGAGATTCGAAAAGAGGGTAAGATTGGTCGCGTTTACTTCCCTGCGCCTTATTTGAACAATGAAAATTTAGAGTACTTCCGTGATGCGTATGCCATCGGTCCTAAAGCGATTATTGATGTGTATGCCGCAGCGACAGAGCACGTCGATCAAGGCTTGTCACTAACCCTGTTCTTTGATGATGAAGTGACGACACGAGACATTAATAAAGCTCAGATCTACGCATGGAAAAAGAAAATTAAGACGCTTTACTACATCCGTATGCGACAGAAAGCGCTAGAAGGCACTCAAGTCGAAGGCTGTGTGTCTTGCTCACTGTAAATAAGGGATAAACTCAATGAAACAGATGGAATCAGGTCCAGTACGAGCAATTAACTGGAACCGTATGATAGATGACAAAGACCTAGAGATTTGGAATCGTCTAACGGTGAATTTTTGGCTTCCAGAAAAGGTGCCATTGTCGAATGATATTCAGACATGGAAGCAGTTGACCGAGGATGAGCAAACATTAACGATTCGAGTATTTACCGGGCTGACCTTGCTCGACACAATTCAAAACTCTGTGGGAGCTCCTGCCTTGATGGAGGATGCTCGAACGCCACACGAAGAAGCAGTGTTGACCAATATCGCTTTCATGGAAGCAGTACATGCTCGTAGTTACTCTTCGGTGTTTTCAACCTTATGTACCACGCCACAAATTGATGAAGCATTTCGATGGGCGGAAGAAAACCCGTTACTCCAGAAGAAGGCTCAGTTGATTCTAGAAGATTACTTATCAACGGGCGACCCGCTTAAGAAAAAGGTAGCGAGCGTATTTCTTGAAAGCTTCATGTTCTACTCAGGTTTCTACTTGCCGATGCATTGGTCGAGCCGAGCGAAACTGACTAATACGGCGGACTTAATCCGTCTTATTATCCGAGACGAAGCCATCCATGGTTATTACATTGGATACAAATTTCAGCTCGCTTTTCAGGAGCTGGATGAAGTTGAGCAGGCGAGAGTAAAGGATGAAGCGTACTCGCTTATGTTCTCTTTGTATGAAATTGAAACGCAATACACAGAGTCGCTTTATGACCAAGTTGGTTTAACAGAAGATGTTAAACACTTCTTACACTACAACGCCAATAAAGCATTGATGAACTTAGGCTTTGAAGCACTCTTCCCCGATGAATTATGTCAGGTCAATCCGGCAATTATGGCAGCACTTTCGCCAAATGCGGATGAAAACCATGACTTCTTCTCAGGTTCTGGCTCCTCCTATGTGATAGGTAAAGCCGTGGCGACTGAAGATGACGATTGGGACTTCTAAAGAAAAATCCGAGTATTGATACTCGGATTTTTTATTGGCGATTCGATTAATTTATTAGCTGATTAGCCGATGAATTAAATGTGCTTTTTGGCTACTTCAACCGCTTTCTCTAGATCTGGGATGGTTGTCAGTTCAGGAACGCGTTGTAGGTATACGATGGTGTTGTTCTTATCGACAACAAATGTGGTTCTTGTAAGTAACCCTAGCTCGCTGATTTGTGTGCCCGTCTTAAAACCAAATGCGTGGTCTTTTGCATCCGATAGTAGCAACAAGTTATCAGACAGGTTCTTCTCTTTCTTGAAGCGATCTAATGCAAAAGTCGTGTCAGCACTTAACCCAACAAAATCAATGTCGGCAAGCAGCTGCTTGTTGCTCTCTACGAATTTTGAAAGGTCTTGAATTTGCTGGTCACAAACTGGGGTATCGACAGAAGCCAGCACGCTAAAGATACGAACGTTTTTGTTCTCGGCAGACGTGTCATACGCTTGGTTTGTGTTGCTCATTAACGCTACTGATGGAAGTTTGTCACCTACAGACAAAGACTCGCCCACTAGGTTAAAGGTGGTGTCTTGGAACATAGTCAATGTCTGATTGCTACCGGTTGGTGCATTGTCGCTGGTCGTTGAGAACTGAGCTGCTTCAACTGTTGATGATATAGCCAGTGCACTCATTAAAAACAGAGCGTTGTGTTTAGTCAGTTTGAGCTTGTTCTTTGATGACTTAAATAGGTCGCTTAGGTTAGCAATGGCTGTGCGGTTGAATTTCACAATGGTTTTCCTTTAGTTGATAAACAGTAGAAGAACACCAGTGTAGTGAGCGCGAAAATTTCGATATAAACAAAAATATAATTGATTCCAAATAATTCATATCTCGCAAATTCAAACAGCGATTAGGCTTGTTTCTTTTTGTGGGAGAAATACGTAACTCGTTGTTCTAATTATTAATATCTTCATTTTTTATTATCGGTTTCAAACGGTTGATTATCAACACTCTTCTTCGAAATAGCTCCTTGATTACCTTTTCTCGAACATCATATTTGGTGGCGATGTTCCGTGACCACAGACTTTTTATATTGGTGTCGTTTTTGTTAGGACTTGGGAAATGCGCCATTGTACTTTACCCATCAACAGAGATAGTCATCAGCGAATACTGACGTGGCTGAGTCTGAATAGGACACCTTACCTTGTATACCTAAAGTGATAATTAGAAAAACAAAAGTGTAGAAAGACTTTAGATATGAGTGAGAAAGAGTGGTTTGAGACCTTATCAACATAATATGTATATCGTAAGGGTGTGCGAAAAGTGTCGATTGGAGAGTGTGCTTAGAAGAGTAAGACGAACGGCATTACAGCGATTGTGTTACCCCAATTCGTGGCGCTGTCGTTGCAGCCGTTGCTTTAAGAAGCATCTTTATACTTTCTCGTGAAAATGGAACGACGATTAGGGAGCGAAAACGCCATCGCAGGGTGGCCAAAGGAGAGGTAACAAAAAGCCCGAGTACATTGTGCTCGGGCTTTGTCGTTTTATCTCGTTAGTAGAGACTCTGTGTTAGATAGCTTGTCGAAGTTGGAAGTAACGACCTTCACCCGACAGCAGCTCACTATGAGAGCCTTGTTCTACAATCTCACCTTGCTCAATCAAGACGATAGAATCCATCTTCTCTAGTCCAATCAAACGGTGAGTAATGAAGATAACTGTCTTGCCTTCGAAGTGCTGCTCAAACAGCGCCATGATGCTTTTCTCTGTCTGTTTATCTAGGCCTTCGGTAGGCTCATCCAACAGTAGGATAGGGGCATCGTGGAGAATGGCACGTGCAATGCCGATACGACGCTTCTCACCACCAGAAAGCTGACGGCCTCCGTCACCTAACCAAGCATCGAGTGCGTTGTCTTCAAGCAGTTTCTCAAGGCCAACGCTCGTCAGAATGCCTTGTAGCTCTTCATCGCTAGCTTGCGGTTTAGCAATCAGTAGGTTGTCACGCAGGCTACCGTTGAGGATATCAACACGCTGGCTTACCACACTGATGGATTCACGCAGCTGGCTCTCATTCCATTGGGTTAGGTCGATACCCGCAATAGAAATCACGCCACGTTTTGGATCCCAATAGCGAGTCAGCAACTGAATCAAAGTCGACTTACCTGAACCTGTTTGGCCGACAATCGCCACTTTATTGTTCGCCGGAATCGATAGGTCAACCGAGTTCAATACCGTGCGCTCAGAATCTGGATACTGGAATGAAACCGCGTTGAATTGAATATCTAGCGGTTGGTCAATATCAAGCTTGCTATCGCTGAACTGAACTTCTGGTTCAGACAGAATCACTTCATTCAAGCGACGCGCCGAAGTTAGTGTTTGACCTAAGTGTTGGAACGCGCCAGCGATAGGCATTAGCAATTCAAAGCTTGCCATTGTAGCGAAAGCCATCAACGCGATCAGTGGGTCTGGTGCGTTGCCGCCAACACCGTCTGCAGCAAGCCACAACATCAGCACTAGAGTTAAACCGTTGAACAACATCAATGCCGCTGACGCCATACCTGTTAGGTTGGCATTAACGAACTGGTTCGCCATTAGCTTTTGTTGCGTGTTTAGAATCGCATTGCGGTAGCGTTCTTCAGCACCAAACAGGGTTAGTTCGCTGTAGCCTTCAATCCAATCCAGTGTGGTAACACGTAGCTCTGCTTTGTTCTGAGTTAGCTCGCCACCGTTGCGTTGACCCAGCTTGTAGAACAATACCGGCCAGATGATCAACATGATCAGTAGGATAGAGCCTAAGATTAATCCTAGAGAGCTATCAAACCACATCAAAAATGCGGTTAGGAAGAAGATACCCAACACACCCACAGTCACAGGGCTTACCAGTCGTAGGTAGACATGGTCCATCGCATCGACGTCGGCAACCAGGCGGTTCAGCAGGTCAGCATCACGCAGGTTTGAAATACGACCTGGGATAAGTGGTGCCAGCTTTTTGAAGAAGAAGATACGTAGGTCAGTTAACAGCTTGAAGGTCGCATTGTGGCTCACTACACGCTCACCCCAACGGCCAGCGGTACGGCTCATTGCAAGACCGCGCACGCCACCACCTGGCAACATGTAGTTAAAAGTTTCACGAGCGATCGTCAGGCCAGCAACCGCAGAAGCAGAAATGAACCAACCAGACAGGGTCAACAAACCGATAGAGGCGGCTAAGGTTAAGAAGGCTAGTAGCATGCCCAAAGACAGCCCAAACCAATGTTTCTTATACAGTTTCAGGTAAGGCAGTAAATCACGCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP045338|1339942:1349724|1345482_1346454_+|WP_152468561.1|DBSCAN-SWA MKQMESGPVRAINWNRMIDDKDLEIWNRLTVNFWLPEKVPLSNDIQTWKQLTEDEQTLTIRVFTGLTLLDTIQNSVGAPALMEDARTPHEEAVLTNIAFMEAVHARSYSSVFSTLCTTPQIDEAFRWAEENPLLQKKAQLILEDYLSTGDPLKKKVASVFLESFMFYSGFYLPMHWSSRAKLTNTADLIRLIIRDEAIHGYYIGYKFQLAFQELDEVEQARVKDEAYSLMFSLYEIETQYTESLYDQVGLTEDVKHFLHYNANKALMNLGFEALFPDELCQVNPAIMAALSPNADENHDFFSGSGSSYVIGKAVATEDDDWDF >NZ_CP045338|1339942:1349724|1346528_1347125_-|WP_152469539.1|DBSCAN-SWA MSALAISSTVEAAQFSTTSDNAPTGSNQTLTMFQDTTFNLVGESLSVGDKLPSVALMSNTNQAYDTSAENKNVRIFSVLASVDTPVCDQQIQDLSKFVESNKQLLADIDFVGLSADTTFALDRFKKEKNLSDNLLLLSDAKDHAFGFKTGTQISELGLLTRTTFVVDKNNTIVYLQRVPELTTIPDLEKAVEVAKKHI >NZ_CP045338|1339942:1349724|1339942_1340689_-|WP_152468556.1|DBSCAN-SWA MLEIKDLHVSVEENAILKGINLTVKPGEVHAIMGPNGSGKSTLSATLAGKDDYEVDSGEIIFKGQDLTELAPEERAGEGVFLAFQYPVEIPGVSNKLFLNTALNEIREYRGLEPIDRFDFEDLLEEKAQLLKMPPHLLQRSVNEGFSGGEKKRNDILQMAVLSPDLCILDETDSGLDIDALKAVSEGVNALQNKQRAFILVTHYQRILNYIKPDFVHVLYKGKIVKSGDHTLAHELEEKGYGWIIDEE >NZ_CP045338|1339942:1349724|1348002_1349724_-|WP_152468562.1|DBSCAN-SWA MRDLLPYLKLYKKHWFGLSLGMLLAFLTLAASIGLLTLSGWFISASAVAGLTIARETFNYMLPGGGVRGLAMSRTAGRWGERVVSHNATFKLLTDLRIFFFKKLAPLIPGRISNLRDADLLNRLVADVDAMDHVYLRLVSPVTVGVLGIFFLTAFLMWFDSSLGLILGSILLIMLIIWPVLFYKLGQRNGGELTQNKAELRVTTLDWIEGYSELTLFGAEERYRNAILNTQQKLMANQFVNANLTGMASAALMLFNGLTLVLMLWLAADGVGGNAPDPLIALMAFATMASFELLMPIAGAFQHLGQTLTSARRLNEVILSEPEVQFSDSKLDIDQPLDIQFNAVSFQYPDSERTVLNSVDLSIPANNKVAIVGQTGSGKSTLIQLLTRYWDPKRGVISIAGIDLTQWNESQLRESISVVSQRVDILNGSLRDNLLIAKPQASDEELQGILTSVGLEKLLEDNALDAWLGDGGRQLSGGEKRRIGIARAILHDAPILLLDEPTEGLDKQTEKSIMALFEQHFEGKTVIFITHRLIGLEKMDSIVLIEQGEIVEQGSHSELLSGEGRYFQLRQAI >NZ_CP045338|1339942:1349724|1340691_1342164_-|WP_152468557.1|DBSCAN-SWA MCAVETQPEVEEILQSTHYREGFYTELSSDTLSKGINEQVVRAISAKRNEPEWMLEFRLNAFQQWLEMEEPHWLKAEYSELNYQDYSYYSAPSCSSCDDASNDNSDDGSNEFLTKEVETAFEALGVPVREGSEVAVDAIFDSVSVTTTYRKELQELGIIFCSFSEAIQEYPSLVQQYLGTVVPPKDNFFAALNAAVASDGTFVYIPKGVRCPRELSTYFRINHAKTGQFERTILVADEDSYVSYIEGCSAPMRDSYQLHAAVVEVIIHKNAEVKYSTVQNWFSGEEDSKGGILNFVTKRAVCEGDHSKMSWTQSETGSAITWKYPSVILKGDYSVGEFFSVALTNGVQQADTGTKMVHIGKHTRSTIISKGISAGKSQNSYRGLVKVTPNAEGARNFTQCDSMLIGDQCGAHTFPYIEVKNPSAHVEHEATTSRIGEDQLFYCVQRGISEDNAISMIVNGFCKDVFSELPLEFAVEAQKLLSISLEHSVG >NZ_CP045338|1339942:1349724|1342721_1342946_+|WP_150894961.1|DBSCAN-SWA MDIVVYSKPQCVQCTATVKALQSKGIEHTVVDLTLDEVAMERVQSLGYRQAPVVVANGRHWSGFRPDMISALGQ >NZ_CP045338|1339942:1349724|1343333_1345466_+|WP_152468560.1|DBSCAN-SWA MDNHFENESLDYHALNAMLNLLDDQGAIQFEADKQAARQFFLQHVNQNTVFFHNLDEKIDYLLREGYYESEVIEQYDRSFLHQIWQQAYQIKFRFRTFLGAYKFFTSYALKTRDGKRYLERFEDRVVMTALLLARGDCDFALALMEEIIHQRFQPATPTFVNAGKKSRGELISCFLLRIEDNMESIGRAINSSLQLSRRGGGVALLLSNLREQGAPIKGILNQSSGVVPVMKLLEDSFSYANQLGARQGAGAVYLNVHHPDIMQFLDTKRENADEKIRIKTLSLGVVIPDITLELAKQNKDMYLFSPHDVERVYGVPFSEISVTEKYHEMVDDPRIRKGKINARGLFQTLAEIQFESGYPYIMFEDTVNQANPIEGRINMSNLCSEILQVNEASSFDDDLNYNHTGRDISCNLGSMNIAKALDGGQLSQTVETAIRALNAVSEMSAINSVPSIRKGNDEGHAIGLGQMNLHGFLARERIHYDSPEAIDFTGSYFACILYHCLVASNKLAQEKKSVFKGFENSTYSDGSFFSKYLEQDFSPKSDVVKALFERLDIRVPTVSEWEELKQTVMSSGLYNRNLQAIPPTGSISYINDSTASIHPVTAKVEIRKEGKIGRVYFPAPYLNNENLEYFRDAYAIGPKAIIDVYAAATEHVDQGLSLTLFFDDEVTTRDINKAQIYAWKKKIKTLYYIRMRQKALEGTQVEGCVSCSL >NZ_CP045338|1339942:1349724|1342166_1342535_-|WP_152468558.1|DBSCAN-SWA MSIPSSGSNEISIHDMQWQGVTLSENAANRITQLAGEKQYFHLTVKSSGCTGFAYVVTLIEQPSEGDIKYESHNVVFYVALQAMPMVDGTEIDFVRQGLNSNFVYNNPKVKNLCGCGESFGV >NZ_CP045338|1339942:1349724|1342942_1343341_+|WP_152468559.1|DBSCAN-SWA MIVYYSSASGNTQRFVEQLGMSAIRLPIEDGEGLPEIEQPFVLICPTYADGEGRGAVPKSVIRILNNPKNRALIKGVIASGNRNFGALFARGGAIVAEKCNVPLLYRFELSGTQNDVEIVRDGLKQFWKHHG |
9 | Mycobacterium_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1570373 : 1582488
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP045338|1570373:1582488|DBSCAN-SWA ATTAAGAACCCAACCTTGTAAATGGAATTAACTGCAGAATCATGCGAATCGCAATCGCAGAAATGATAAGGCTTAGGCACTGAGGCAAACCCACAGCACCCATGACCCAAGCTACGTTGGGTGGGATAGACTCTAAATACTGGCTGACGTCCACAGGCTCAAATGCAGCCACCGCCCAAGACAATCCCTTGTTCACGACAGACATGAATTCTTCAACAATCCAGAAAACCATATCTTTCAGCATGTCGACAAGGCTTATAACGAGTCGATAGAGAAACTCCAATAGCTGATTGAAAAGCGAGATAATCCAATCCATAAGTTATCCTCCAAAGATAATTTGACGACAATAGAACGCTGTCGATGCTAAGAAGATGAAGCGGAGAAAACCAAATATCGAATCTAAATCGATGTAGTCATCAAATGAAAAGTAGCCATAAAAAGGAACGGGCAGACCGAAGTCAGGGCGCTTAGCACCACTTAAATCAATCTCACCAAAGCTAGAAACGAAAGGGTCAACCACCGATGTTTTCATCTCTTGAAGTTGAGCGGTAACAATGCTGTCTAAACCACCGTCATAACTGGACTGATAGAACCCCTGACAACTGGAGGTTTCAATACAAGAGCCACCAGTTCCGGCCGCTGAAGTATCAACTTCTTTCAAAAGCTCACCTATGTCGTTAACGCCCTCGGTTAGTCCGTCTACTTTCCCCGATAACCCATCAATGCTTGTTTGAACACCTGATAAGTCGACGGTTGATGTGCCTCCGGAGATGGCTCCTATCTGGTCACTCAATTGGTCGATTCTCGCGCCAAGCTCCTGATTGACGGCATAAGCAGCATCATCAGCAACACTGGATAACATTTGAGTGTGGAAATCTTGTGTTTGCTTAATGTCGGTGGTGCTTGTAAGAACCTTATTGAGTTTGCTTTTTATCTCTCCATTAGCTTTAAATAGCTTTTTGAAATTTTGAGGATTATTCCAAGATTTAGCAGCATTTTTTAGTTGCCTTTGTTGGTCTGCCGTCAAACCTCCACCGCCAGAAACCTCAAGGTTGTTTACAGCGTTCAAAACTTGATTAGAAATATCTTGTGCACCACTTGCAGCATCTGCGGCATTTTCAGCATTACGCTTAGCCTCATAAACGTAACCACCAAGAGCAGAGATAGAACCTTGATTAGCAAGAACACCATCGTTAATGGCATCCAGTTCGTTATGTAATTCAGAGAATTTAACGTCAACATCGCTAAATTCATTTTTAATCAGTCGACTGTTATCAACACCGCCATCGATGTTGTTAATAGCGCGTTCTAAATCTTGTAGCTCTTTTCGAGTCCGCTCCTCTTGAAACGTCATTGTCTGGCGTAGGTTCTCAAGTGTCCTTGTAAACTTCTTACTCGTATTACCCTCAGCCATAAGATTGATAATATTTGTCAAATCTCGATTAGCGTTAGAAAGACTGTAGATAGCGGTATTAGAGCTTTGGCAATCAGGGTCTTGAGCATCACACTTTATAAGCTCAGGTCGGCTTGCTTTGTACAAATCAAAAAAGGGATCAAGTTTGTCAGGTATCCCGTTGTTATCCGAATCAAGGTCCACGTTATCATTAAGAGTGGGATTAGGGTCAACACCATTTACAAGTCCGTCACCGTCCCAGTCTTCATCATTGTCCGGTATACCGTTGTTATTCAAATCGTCAGGGTTTGCCGTTTCACTACCACAGTTAAAAATATCTGTATCGGCATAGTTCCAACTTGCCGTGCAGTATGAATCAGGTTGGGGCGGATTATCATCGCAAGTAAGACTCCCTACAGGAAGTAAGGAGCGACTGGGTGGTGTATCAGAGCAGAGCTTCACCTTGAGCTGAGAATACGGGCCGCCGTAAGTGACATCAGCGGATAGAGCAGGCGCAGAACAAAGCAAAGCTAATAGAATTAGAGATGTGCTTTTCATAAAACAACCTATAAAAAAAGGAGCCGAAGCCCCTTTGAAATTAACCTTGGAAATTCTGAGCAGCTAGAAAACCAGACAAGCCACCCAAAAGAGCAAAGACCATAAGAATCAGGTCATGTATGGCGACCAACATACTCAGAAACCTCCATCAGTTAAGCGGACTTAACAGCACGTTTGGCAAGACCGATTGCCTTATACGCCATAGTGATACCAACGATTGCTACACCCGCAGCACCCACTTTAGTTGCAACATCAGCAAAATCTACAGCAGCCCAAATAGCTTCCATAATTCAGTTCCTTATAAAATTTTAATTAACTTAACCGCCATTTTGACGGCATATGACATAGCGAAGCCGCTAACGAAAACCAGTGAGAAAGCCCCACCGAAAGCCGCCGTAGCTTCGATTGCAGTAACCTGAGTGAACCCCATCAAATATTCATAATCACTAGCTGTAATCACAATGACTCCAGAGCAGGACTCAACTGACGTTTCAACTAAAGCTAGAAATCCATTGTCATTTGGTAAGGCACACTTAGGCATAACAATTCCGTTTTAATCTACTTCTTAAGTGACGCATCGAAATGCTTTTGAATGTCCGCATCAGCAGGGACGAGCTTAGTCACCAAGATGTCGAGAGGATCATCAGGGTTGGCACCGAAGTTCAGTTCATATTCACGATTGGCAACAAATGCGCGTGTACGGATGAGTTCGCGCGCGTAGTTCACATCGATTTTCAACGCTTGTTTGTTGTAGGGAATATCCGTCGAGAACCCGATGCCTGTTTGTTGAAACTTCTCGTTGTCGACTTCTTCCACAGCACGAAGCACGGCCAGTTCCGCGAATTCCATACCGGATTTAGGGAAACGCTTGATAGAGATACCAGTTATTGTAGGCATATTGTTGACTCCAAAATTTCGATTTTCTTTTTCGTGTATTCGTCAGGAATCCCTAACGAGGTTTCAAAGTTGGCGCGTCTGTGGTGAGTGGGAATGAGCATCCCGAACGCTTCACCCAAATCACCCTCAGTCATCGCAACGATTTCAGCCGACCCTTACCACATTGACGACGAATCCAAGCGATACGAGCGAAGAACTCTAGACCTTCTTTCTTCTTGTTCAGCTGAAGTTTCATTGGTTCCGCTGGGTCGATACTGGCCGCGAAGTCGCAGATGCCCGCGAACGCAGAAGTAGGCGAGGCGAGGAGTGACAAATCACACTTCTTCAGCTCCACTTCGTTGCGGTACCAAATCACTTCAGGGTCAGTAATCTTTTGCTCAAACTTCTTGTTATAGATTCGCCAGTAGACCGTTGACGTACGAGAGCCAACGAGAACCGCTTCTTCTGATAACTCACCAGATTGAGAAACGCGCTTATGAGGAACCATTGTCGGACCACGACCACGAGAAGCAGTGCGAAATGCTCCCTCATAAAAACATTTCTCAGCGTACTTACAGTCAAAGATTCCGGTGTAGTCATCGACACACAAATCAAGGCGGGCGAGGCGAGTGATGCCCAAGATAGATAACCACCAGTGCAGTTTTTGGTGTGAGATGAACTCGAACAACTTCGAACAACCCGTACCGTTGATTTGCACAAACACCGTGTCATTGTTGCCACCGATACCAACTAGACCACATTCCACTTCACCCGTTGAATCGAGAATCACCATAGAGTCGTTGTAACCATGAAGGCCACGGCCACGCATCGGAGACAAACGGAAGTTGAACACCTTGGCCATGAACTCCTCGAACCTATGAGCCAGAATCTTGCTGCATTTGTTACGATGTAACTCCATGGATTTCTCTATGGCTTCTGGTGAGTTAAAGCGGCCGTTAACGCTGACTTTCTTAAACTCTGGGAACTGCAAGTTGATAAAGTCTTGATCGTTTGAGTTGTCCAAAGAACGCAAAGAACTGTAGGAAAACGAAAACGCTAAGTGGTCAACTTGTACCGGACGAATTTCGTCGTGATACTTGTGCGGTTTCTTAGATGGCATGGAAAACTCCCTTAACCAATAACTCTTGATAGTTCTCGTCAGTAATCTCGACCACTTGGAACGAGGTCATTCCGTAGTGCGTCATCAAGAACTGGTAAAAGTCGTAAGGTGTTTTGAAGAACTCATGACCCCAAGTGAAGTAAGCATTGATGCCATGGCTCGGTTCGTTATCGAAGTAGATTGAATCCATCACGAACACTCCAAGGCCAAGAGCTCAGGAGTAGGAGACCGAACCATCAGATTCTTGGTACTCGATGTTTTAATAAGGCAGCTAAGGTCTTGCACCAGTGAGCGAACAAACTCGAAGTGTTCTTGAGCACGATGCTTATCTTTCAGCATGGCTAAGATTTCAGTGACAGCGTCACCAGAGAATTGAACGCCATTATCAAGGCGTTGGCCTAAACGAAATATCTTGAACTGGCGCAGATAGCTGCGAATCGACTTAGCCAAGTTCGGTTGAGTTCGTGAGCTTGAGGTTATCGATAAGACACCATCAACCAACGTGTATTTGAAAGAGCCAACATGCTTGGCCGTCGAAAGAGACTGAGATTTAGCGACTTGGCGATTGGCTTCTTTGATAGATGCTTGGTTATTTTCGAACTCGGCCTGACGTTGGCGCTGACGAGAAGCAAAGTCTTGTTTGGCTTTAACTAGCTTGAGTTCATTGAAGTAGTTGATGCATTCAAGAGCGAACTCAGTTAACTCGCCAGATGAGTCATACATAGATTCGCCAAAGTTCTTACGAAAGAACACACCAGTTAGGCGTGGGCTAACTTTGAATTCATTTTTAATGAGTATGACAACGTTACAGTCCAACATAATTCACCTTAGGAAAAGAAGCTTTTGCCTGCATGAGTACCGATTTATGGGGATTCTAGTTGCCATTAATGGGGATTGTAAATAGCGCAAATTGGGGATTAAAAAGCTAGAATGAGATCAAACGGAGATGAGCTATGTATGCAAATAAACTATTAGATGCTTATAAAAAGTCGCAAAACTACGTACAAGATAAGCAGATAGCACATGATTTAAATCTTTCACCACAGAAGCTAAGCAAAATACGCAAAGGTATTCGCTATGTATCTGATGAGGAAGCAATTTTCTTAGCTGAAAATGCAGGAATTGACCCAGAGCTAGCTTTGCTAGGTTGTCACGCTGACCGCAATGAAAACCCAGAGGTAAAACAACTTTGGGAGCAGATTGCAAAAAAGTTTGATAGGCATGGTTTGTCAGGCATTTCAATGGCTTGCGGTGGATTAGCATTATGGATGATGCCTACCCAAGAAGCACTAGCCAACTGCGTATTATGGGTTTGAATTATGTTGAGGGTTTGCTGAAAGCCTGAATGAAATCAAACGTGGGCGAGGGATATGCTCAATTAGAAAGGCCAGCTTGCTGGCCTTTTTTAATGCTCATCGTGAGAATCGTTTAATGATTGAAGCCCGTAATATAATTTGCATAATGCGTGTGTTATTGCTGTGTAACAATGCTTTTAGGTAAGTTGGTGTATTCGTTGGAGCATTCATATTCTGTATGCAACCATAGAGCTACAATCATTGCTTGATCTAATATGTGCGGGTAGATGTCAAGCTTTCTTGCCCATTCTTGAGCTTGAAGCTTAGCTCTTCTGATACTTATATTACGACTGATTTTTTTATTTGCTCCATGTTGCTCGAATACCATGTCAAGCCATAGGGTGGCAGTATTAACTTCGTGATAAAATGTATCTAATTCTTGAGTTATCTTTTCTATTAGAGACTCGTCTATTAGTTCTGGTCTTTTATTTTTGATTTCGTTGAGGACTTGAATTGTAGCTTTCTGGTCAATCATCATATCTATATCTCCCTCTCTTTTAATATATGCTAACCTATCTAAGTGTCTAATATCAAAGGTTTAATACTACACAGCACTGTTACGTTATGACTCACTTTTCGCTGCTATAAGCTTGCTTGTAGAGGGAAAATGAGTGGCCAAGCGCCCACGTTCTTTAAGAGTCAGTTTCAGTCAATCGCGCGGCAGTTTTTGGTTTATCTCGACCTATGACGAATATTGACGAGCGAGACTAAGCGAGGAAAATTGAGGAGTAGGTCGAAGCTAGATTTTTCGAATGGAAGGGGCCCCGTTTCGTCGGGGAAGGAGCCATTCGTAAATGGCTCCTTCTGGCGAAGGTACACTATCTTTAATAGTGTACTCTTGTTTCATTTTGAAACACTACTTAGACAAAATTTGCTTCCAAGATCTTACACAGCTTTCTTCTTCTTGCTTGTTATTGAATAAGATTGAGCCATTTTCGTATAGCGCTAACATGTCTCGAGCTTGATTTTTATCACGCTCTTCAAATGAAAATATAGGGCGATTAGATTGCTTGGTTAACTCTTCTATAAACCCTTTTTTATCATCGACATATTCATACATCGTCATGATAACTTCAGTTTCAGATACCCCCTTCATGCAAGGCCCAACAAGCTTCATGAACTCAAAAGGATCTAACTGCGTCACTTGATCTTTTTTGCCTTGGCAAATCTCGCGTTCAGTTAGGTTTAAAGCTTTTGCCAATTTCGCCACTTGAGTCGCTTTGGGTTCGGTTATTCCTCTTTCCCATTTACTGACTGTCTGCACAGTTACGTCCATCTGTTCAGCTAAATCTTCCTGCTTTATTACTAGTTCAATTCTACGTTCTTTTAGTACTTCACCAATACTCATGTTTACGTTGTCCATATCAAAACCTTACATTAGTTTAAGTTTACACCCACTTTAGTGTTGCTTATTAGTAACACATACTTTAGTGTATATGTACATACACAAAAGTACTTATTAAACTTCTTGACAGGGTTTTAGGTGTGATAGACAAACTCAAGATATCAATACCATTTAAAGCAGAATTCACTATGACGACTTACCAAGCTAAGTCGGGTGAAGCTGTCTCTTATGTTGATATTAAAGAGTGTTCTCGACGCGGAATCGGCTTGGAGGCGAAGACGATTTTCTTTACCGGAGAGTCTGGCTCGGATAAGTACGAAGTTGCTGACTTAAGGCATGCGTTTGAATCTCTTCCAACGCACTTTACTGGTATGGCATTCAAGATTTATCAAGGAACGCGCTTGCGTTCTCCTTGTATAGAGTTAAAAGCTTCTCCCGCAAAAATTTTGCAAGGACATAATGTTTACGGGCCAACATCTATTGAAGTTGGAGCGGTTGAAATGTTGATGGCATTTTATAACAACTATCCAGACGTCTACGAAATGCTTGACGTTCCATTATCGACACTAGATACCATTGATGCGACGTATTCAGCTCGAGTTAAAACTGAACTCCAAGCTCGCCAAGTTATCCAGCAACTCAAAAATGTATCGAATAAGCAGATGCGTACAGCCGTACGTAATGAGCATGAAACCACAGTTTACTTCAACAAGAACTCTCGCCACTGTGATCGCAAAGCTTACTTAAAAGGGCCTGAGTTCAATCGTCAGCTGAGAGATTTAAGAGCACTTCAGGAGAAAGGTGATCATTCATATGATCGTGTTATCGATGTCATGTCTTCACCTCAATTAATAAATTATGCACGGCACTTAGTACGCTTTGAGGCAGGAGCTCACCGTCGCTACTTAGACGCTCTAGAAATACCCAAAAACCTATTTGAAGCAATTAGTTATCAAAAAGAATACGAGTCGGAAGGTCGAAACTTAATAGCAGATATCTGGACGAAAGCATTTACGCCATTACTAGACGCATTAGAGGGACAACGCATGAACATCTTTAACGACGAAGAAGTCCATAACAACTTGAAAAAAACTTATTATCGCATCACTCCAAAAGGAAACGTCTCTTACGCTAAAGCTGACAAGTTATTCCGTTTTTATCGCTCGTTAGTATCAGATGGCTATTGTGCGGTATATCAATCATTTACATCTAGAGCAACATTCTCTCGTCAGCTAAATGACTTATTAGCAATCGGCTTTTCAAAAGCACAACTTCAAAACCTTCAAGGTAATGAGAAGGACAATGTAGTTCCTCTATTACAAGTCATTGAAATCGACTTCTCACAACAACACCCAACTGACTATAAAGAGCCCAAAGTAGGGCACATATCACGAATGTATGGTTACGGTGAAGACAACGTAATCAGACTAACAGCTTAACCAAACTCATAAGGAAACATCATGCTAAAAATCGAGATCTTCCCAGAAAACGTACAAGTTGAAACACGAACAATCCCAGCAAAAGAAGGTAAACCGCCTCGCAAGATTTATGAGCAAATAGCTTATGCACACTTAGGTGGCAAGTTCCCTGTAGAGATGAAATTACAATTAGAAGAAGGCCAACCAGCTTATGTATCAGGACTTTATACAATACATTCCTCTAGCTTTGTTATTAATCAGTTCGGCTCTCTTGAGTTGAAACGCTTCGGGCTACTCATTGACCCAATAGTCGAAAAATAAACTGCGAACAAGGAAGTTAAACGTCATGGCTACAGTAGGAATATATAACCCAAACAATAGTGTTCATCTTGGCTTAGCAGACTTTGAGTTACTTCAATTATCACGGCCAACATATAAAGCTCGCTTTATGCGTCGAGGTCATGAAGATGTTCAACTCTTAGTCTTTTTAGATGACCATCGAGAAATCCCATTTGTTAGAGTTCCACTCGAAATCTTACTAAAGGCAGCTAAGAGTTATGAGTCAAACAACTAGTTATTTTACTCATCCATTTTTTTGTATTCCACGAGATGAACGTAAAACAAATGGCTCAAAGGGAGCTCGTATCGCAGGATACGAGCCATACTCATCAGAAGAAGATGCTTACTTAATCGAGCATTATCAAAAAATACCTTTGACCCATATTGCTAAAAAACTCGGTCGCTCGGCTAGCTCTGTCTACACACGTTCTAAGAAGTTAATTAGAGATGATCTGATAGTTAATAATCAACAGTGGAAAAAATCGCACTATAGTGAAAAAGAAGACGCATTCATTATCGAATGCCAAGATAAAATGTCCTTCGAACAAGTTGGTCAGGTTCTTGGCCGCAGCAGAGACTCAGTAAAAGTTAGAGCCGGTAAGTTAGGCGTCTCATACAGGAAGGTTGCTGAAACATCTCCGGTAGTAAAGCTATATAATGATGATGTTGAGTTAATACGTCAGTTATCAGAGTTAGGCTTAACTTATCGAGAAATCTCAGAAAAGTTCGAAGTTGATGAGAGTCATGTGAGGAAAGTCTGTCTCTTTAAAGAGCGTTTATACTTGGATAAGAAGGATTATTTTAATCATATAAATCGTCAAATAGATGCCATAGATGGCCAGGATTAACCTGAAAGAAATCACAATCCAACTGCGCATTATGGGTTTGAATTATGTTGAGGGTTTGCTGAAAGCCTGAATGAAATCAAACGTGGGCGAGGGATATGCTCAATTAGAAAGGCCAGCTTGCTGGCCTTTTTTAATGCTCATCGTGAGAATCGTTTAATGATTGAAGCCCGTAATATAATTTGCATAATGCGTGTGTTATTGCTGTGTAACAATGCTTTTAGGTAAGTTGGTGTATTCGTTGGAGCATTCATATTCTGTATGCAACCATAGAGCTACAATCATTGCTTGATCTAATATGTGCGGGTAGATGTCAAGCTTTCTTGCCCATTCTTGAGCTTGAAGCTTAGCTCTTCTGATACTTATATTACGACTGATTTTTTTATTTGCTCCATGTTGCTCGAATACCATGTCAAGCCATAGGGTGGCAGTATTAACTTCGTGATAAAATGTATCTAATTCTTGAGTTATCTTTTCTATTAGAGACTCGTCTATTAGTTCTGGTCTTTTATTTTTGATTTCGTTGAGGACTTGAATTGTAGCTTTCTGGTCAATCATCATATCTATATCTCCCTCTCTTTTAATATATGCTAACCTATCTAAGTGTCTAATATCAAAGGTTTAATACTACACAGCACTGTTACGTTATGACTCACTTTTCGCTGCTATAAGCTTGCTTGTAGAGGGAAAATGAGTGGCCAAGCGCCCACGTTCTTTAAGAGTCAGTTTCAGTCAATCGCGCGGCAGTTTTTGGTTTATCTCGACCTATGACGAATATTGACGAGCGAGACTAAGCGAGGAAAATTGAGGAGTAGGTCGAAGCTAGATTTTTCGAATGGAAGGGGCCCCGTTTCGTCGGGGAAGGAGCCATTCGTAAATGGCTCCTTCTGGCGAAGGTACACTATCTTTAATAGTGTACTCTTGTTTCATTTTGAAACACTACTTAGACAAAATTTGCTTCCAAGATCTTACACAGCTTTCTTCTTCTTGCTTGTTATTGAATAAGATTGAGCCATTTTCGTATAGCGCTAACATGTCTCGAGCTTGATTTTTATCACGCTCTTCAAATGAAAATATAGGGCGATTAGATTGCTTGGTTAACTCTTCTATAAACCCTTTTTTATCATCGACATATTCATACATCGTCATGATAACTTCAGTTTCAGATACCCCCTTCATGCAAGGCCCAACAAGCTTCATGAACTCAAAAGGATCTAACTGCGTCACTTGATCTTTTTTGCCTTGGCAAATCTCGCGTTCAGTTAGGTTTAAAGCTTTTGCCAATTTCGCCACTTGAGTCGCTTTGGGTTCGGTTATTCCTCTTTCCCATTTACTGACTGTCTGCACAGTTACGTCCATCTGTTCAGCTAAATCTTCCTGCTTTATTACTAGTTCAATTCTACGTTCTTTTAGTACTTCACCAATACTCATGTTTACGTTGTCCATATCAAAACCTTACATTAGTTTAAGTTTACACCCACTTTAGTGTTGCTTATTAGTAACACATACTTTAGTGTATATGTACATACACAAAAGTACTTATTAAACTTCTTGACAGGGTTTTAGGTGTGATAGACAAACTCAAGATATCAATACCATTTAAAGCAGAATTCACTATGACGACTTACCAAGCTAAGTCGGGTGAAGCTGTCTCTTATGTTGATATTAAAGAGTGTTCTCGACGCGGAATCGGCTTGGAGGCGAAGACGATTTTCTTTACCGGAGAGTCTGGCTCGGATAAGTACGAAGTTGCTGACTTAAGGCATGCGTTTGAATCTCTTCCAACGCACTTTACTGGTATGGCATTCAAGATTTATCAAGGAACGCGCTTGCGTTCTCCTTGTATAGAGTTAAAAGCTTCTCCCGCAAAAATTTTGCAAGGACATAATGTTTACGGGCCAACATCTATTGAAGTTGGAGCGGTTGAAATGTTGATGGCATTTTATAACAACTATCCAGACGTCTACGAAATGCTTGACGTTCCATTATCGACACTAGATACCATTGATGCGACGTATTCAGCTCGAGTTAAAACTGAACTCCAAGCTCGCCAAGTTATCCAGCAACTCAAAAATGTATCGAATAAGCAGATGCGTACAGCCGTACGTAATGAGCATGAAACCACAGTTTACTTCAACAAGAACTCTCGCCACTGTGATCGCAAAGCTTACTTAAAAGGGCCTGAGTTCAATCGTCAGCTGAGAGATTTAAGAGCACTTCAGGAGAAAGGTGATCATTCATATGATCGTGTTATCGATGTCATGTCTTCACCTCAATTAATAAATTATGCACGGCACTTAGTACGCTTTGAGGCAGGAGCTCACCGTCGCTACTTAGACGCTCTAGAAATACCCAAAAACCTATTTGAAGCAATTAGTTATCAAAAAGAATACGAGTCGGAAGGTCGAAACTTAATAGCAGATATCTGGACGAAAGCATTTACGCCATTACTAGACGCATTAGAGGGACAACGCATGAACATCTTTAACGACGAAGAAGTCCATAACAACTTGAAAAAAACTTATTATCGCATCACTCCAAAAGGAAACGTCTCTTACGCTAAAGCTGACAAGTTATTCCGTTTTTATCGCTCGTTAGTATCAGATGGCTATTGTGCGGTATATCAATCATTTACATCTAGAGCAACATTCTCTCGTCAGCTAAATGACTTATTAGCAATCGGCTTTTCAAAAGCACAACTTCAAAACCTTCAAGGTAATGAGAAGGACAATGTAGTTCCTCTATTACAAGTCATTGAAATCGACTTCTCACAACAACACCCAACTGACTATAAAGAGCCCAAAGTAGGGCACATATCACGAATGTATGGTTACGGTGAAGACAACGTAATCAGACTAACAGCTTAA
Protein sequences of DBSCAN-SWA_3 >NZ_CP045338|1570373:1582488|1575789_1576152_-|WP_152468668.1|DBSCAN-SWA MMIDQKATIQVLNEIKNKRPELIDESLIEKITQELDTFYHEVNTATLWLDMVFEQHGANKKISRNISIRRAKLQAQEWARKLDIYPHILDQAMIVALWLHTEYECSNEYTNLPKSIVTQQ >NZ_CP045338|1570373:1582488|1572621_1572864_-|WP_152468665.1|DBSCAN-SWA MPKCALPNDNGFLALVETSVESCSGVIVITASDYEYLMGFTQVTAIEATAAFGGAFSLVFVSGFAMSYAVKMAVKLIKIL >NZ_CP045338|1570373:1582488|1574508_1575138_-|WP_152468666.1|DBSCAN-SWA MLDCNVVILIKNEFKVSPRLTGVFFRKNFGESMYDSSGELTEFALECINYFNELKLVKAKQDFASRQRQRQAEFENNQASIKEANRQVAKSQSLSTAKHVGSFKYTLVDGVLSITSSSRTQPNLAKSIRSYLRQFKIFRLGQRLDNGVQFSGDAVTEILAMLKDKHRAQEHFEFVRSLVQDLSCLIKTSSTKNLMVRSPTPELLALECS >NZ_CP045338|1570373:1582488|1570691_1572323_-|WP_152468664.1|DBSCAN-SWA MKSTSLILLALLCSAPALSADVTYGGPYSQLKVKLCSDTPPSRSLLPVGSLTCDDNPPQPDSYCTASWNYADTDIFNCGSETANPDDLNNNGIPDNDEDWDGDGLVNGVDPNPTLNDNVDLDSDNNGIPDKLDPFFDLYKASRPELIKCDAQDPDCQSSNTAIYSLSNANRDLTNIINLMAEGNTSKKFTRTLENLRQTMTFQEERTRKELQDLERAINNIDGGVDNSRLIKNEFSDVDVKFSELHNELDAINDGVLANQGSISALGGYVYEAKRNAENAADAASGAQDISNQVLNAVNNLEVSGGGGLTADQQRQLKNAAKSWNNPQNFKKLFKANGEIKSKLNKVLTSTTDIKQTQDFHTQMLSSVADDAAYAVNQELGARIDQLSDQIGAISGGTSTVDLSGVQTSIDGLSGKVDGLTEGVNDIGELLKEVDTSAAGTGGSCIETSSCQGFYQSSYDGGLDSIVTAQLQEMKTSVVDPFVSSFGEIDLSGAKRPDFGLPVPFYGYFSFDDYIDLDSIFGFLRFIFLASTAFYCRQIIFGG >NZ_CP045338|1570373:1582488|1578479_1578758_+|WP_152468671.1|DBSCAN-SWA MLKIEIFPENVQVETRTIPAKEGKPPRKIYEQIAYAHLGGKFPVEMKLQLEEGQPAYVSGLYTIHSSSFVINQFGSLELKRFGLLIDPIVEK >NZ_CP045338|1570373:1582488|1579819_1580182_-|WP_152468668.1|DBSCAN-SWA MMIDQKATIQVLNEIKNKRPELIDESLIEKITQELDTFYHEVNTATLWLDMVFEQHGANKKISRNISIRRAKLQAQEWARKLDIYPHILDQAMIVALWLHTEYECSNEYTNLPKSIVTQQ >NZ_CP045338|1570373:1582488|1570373_1570688_-|WP_032549088.1|DBSCAN-SWA MDWIISLFNQLLEFLYRLVISLVDMLKDMVFWIVEEFMSVVNKGLSWAVAAFEPVDVSQYLESIPPNVAWVMGAVGLPQCLSLIISAIAIRMILQLIPFTRLGS >NZ_CP045338|1570373:1582488|1580560_1581067_-|WP_152468669.1|DBSCAN-SWA MDNVNMSIGEVLKERRIELVIKQEDLAEQMDVTVQTVSKWERGITEPKATQVAKLAKALNLTEREICQGKKDQVTQLDPFEFMKLVGPCMKGVSETEVIMTMYEYVDDKKGFIEELTKQSNRPIFSFEERDKNQARDMLALYENGSILFNNKQEEESCVRSWKQILSK >NZ_CP045338|1570373:1582488|1572881_1573220_-|WP_032549084.1|DBSCAN-SWA MPTITGISIKRFPKSGMEFAELAVLRAVEEVDNEKFQQTGIGFSTDIPYNKQALKIDVNYARELIRTRAFVANREYELNFGANPDDPLDILVTKLVPADADIQKHFDASLKK >NZ_CP045338|1570373:1582488|1579288_1579624_+|WP_152469541.1|DBSCAN-SWA MIECQDKMSFEQVGQVLGRSRDSVKVRAGKLGVSYRKVAETSPVVKLYNDDVELIRQLSELGLTYREISEKFEVDESHVRKVCLFKERLYLDKKDYFNHINRQIDAIDGQD >NZ_CP045338|1570373:1582488|1577159_1578458_+|WP_152468670.1|DBSCAN-SWA MIDKLKISIPFKAEFTMTTYQAKSGEAVSYVDIKECSRRGIGLEAKTIFFTGESGSDKYEVADLRHAFESLPTHFTGMAFKIYQGTRLRSPCIELKASPAKILQGHNVYGPTSIEVGAVEMLMAFYNNYPDVYEMLDVPLSTLDTIDATYSARVKTELQARQVIQQLKNVSNKQMRTAVRNEHETTVYFNKNSRHCDRKAYLKGPEFNRQLRDLRALQEKGDHSYDRVIDVMSSPQLINYARHLVRFEAGAHRRYLDALEIPKNLFEAISYQKEYESEGRNLIADIWTKAFTPLLDALEGQRMNIFNDEEVHNNLKKTYYRITPKGNVSYAKADKLFRFYRSLVSDGYCAVYQSFTSRATFSRQLNDLLAIGFSKAQLQNLQGNEKDNVVPLLQVIEIDFSQQHPTDYKEPKVGHISRMYGYGEDNVIRLTA >NZ_CP045338|1570373:1582488|1574308_1574509_-|WP_032549081.1|DBSCAN-SWA MDSIYFDNEPSHGINAYFTWGHEFFKTPYDFYQFLMTHYGMTSFQVVEITDENYQELLVKGVFHAI >NZ_CP045338|1570373:1582488|1576530_1577037_-|WP_152468669.1|DBSCAN-SWA MDNVNMSIGEVLKERRIELVIKQEDLAEQMDVTVQTVSKWERGITEPKATQVAKLAKALNLTEREICQGKKDQVTQLDPFEFMKLVGPCMKGVSETEVIMTMYEYVDDKKGFIEELTKQSNRPIFSFEERDKNQARDMLALYENGSILFNNKQEEESCVRSWKQILSK >NZ_CP045338|1570373:1582488|1575272_1575635_+|WP_152468667.1|DBSCAN-SWA MYANKLLDAYKKSQNYVQDKQIAHDLNLSPQKLSKIRKGIRYVSDEEAIFLAENAGIDPELALLGCHADRNENPEVKQLWEQIAKKFDRHGLSGISMACGGLALWMMPTQEALANCVLWV >NZ_CP045338|1570373:1582488|1581189_1582488_+|WP_152468670.1|DBSCAN-SWA MIDKLKISIPFKAEFTMTTYQAKSGEAVSYVDIKECSRRGIGLEAKTIFFTGESGSDKYEVADLRHAFESLPTHFTGMAFKIYQGTRLRSPCIELKASPAKILQGHNVYGPTSIEVGAVEMLMAFYNNYPDVYEMLDVPLSTLDTIDATYSARVKTELQARQVIQQLKNVSNKQMRTAVRNEHETTVYFNKNSRHCDRKAYLKGPEFNRQLRDLRALQEKGDHSYDRVIDVMSSPQLINYARHLVRFEAGAHRRYLDALEIPKNLFEAISYQKEYESEGRNLIADIWTKAFTPLLDALEGQRMNIFNDEEVHNNLKKTYYRITPKGNVSYAKADKLFRFYRSLVSDGYCAVYQSFTSRATFSRQLNDLLAIGFSKAQLQNLQGNEKDNVVPLLQVIEIDFSQQHPTDYKEPKVGHISRMYGYGEDNVIRLTA >NZ_CP045338|1570373:1582488|1578783_1579011_+|WP_152468672.1|DBSCAN-SWA MATVGIYNPNNSVHLGLADFELLQLSRPTYKARFMRRGHEDVQLLVFLDDHREIPFVRVPLEILLKAAKSYESNN |
16 | Vibrio_phage(70.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2568325 : 2575217
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP045338|2568325:2575217|DBSCAN-SWA ATTAGGAATCGATTTCAGAAAGAACGTTGATCATCTCAAGTGCGCTTAGTGCAGCTTCTGCACCTTTATTACCAGCCTTGGTTCCTGCGCGTTCAATAGCTTGATCGATCGTATCAACAGTTAGCACACCGAATGCTACTGGAAGAGAGTATTCCAGAGACACTTGCGCCAAACCTTTATTACATTCACTACAAACATAGTCAAAATGAGGCGTACCGCCACGAATTACTGTACCTAGAGATACAATCGCGTCGAACTTACCTGTTTTTGCAACGCGCTGCGCTACAAGTGGAAGTTCAACTGCACCTGGACAACGAACAACAGTGATGTTGTCTTCGCTAACTTGTCCGTGACGTTTTAAAGTATCGATTGCACCAGAAAGTAAACTTTCGTTAATAAAACTGTTGAAACGAGCAATAACGATAGCAATTTTTGCATTTGGCGCTGGGAAGCCACCCTCGATCACTTTCATAAGCCTTCCTTTAACTATTTTCATCAAGTGAGAATCGCCGGATTCTAGCACAAAACTGTGAGCAATATCTAATGCTAATTTTATTGAACTGATACCGGAGAAACAAAGTGGTAACGACAGCGTGATTCCTTTGTCGATAAGGCTCAAAACGCAGAGAGCGGACTATGCAATCTTGTGCTCTCTGCTTGCTACCTATCGTTTAATCGAGACTGTATTACCTAAAAAGTTTGCTGATTGTTCTGTTTTGAATCAGCGTTATTCGCACACGTACTCAACAACGTTCAAACCAAAACCACCCAGTGCGTGGTAGCGTTTGGTACTTGAAGAGAGTAGGCGCATGTCTTGAACACCAAGGTCTTGCAGGATCTGAGAGCCTACACCAACACGACGTGACGTGCCTTGCTTCTTAGCCATAGTTGGCTGCTCGTTCTTATCTTGAGCTTCGAAAGTCTTCACTTTGTGGATGAGAGAATCAGACGACTCTTCATTACCAAGAATCACTAGAACACCACCTTCTTCGCCGATACGCTTCATCGCTTTATCTAGCGACCAGCTGCGCTCAGTCCCACGATCAGAATGCAGCAGATCGGTGAATGTGTCATGCAGGTGAACACGAACCAATGGCGCTTCACCCGTTAGGTCACCCTTTTGCATCGCGTAGTGGATTTGGTTATCAATCGTATCGCGGTATGTCACGAGTTCGAAATCACCAAACTCAGTCGGTAGATGACACTGCGCGACGCGCTCAATCGTTGTCTCGGTGTTGTTGCGGTATTCGATAAGGTCTGCAATCGTACCTAGTTTGATGTCGTGCTTTTCAGCAAACACTTCAAGGTCTGGACGACGTGCCATAGTGCCATCGTCATTCAGGATTTCAACAATCACAGATGCTGGTTCACAGCCTGCCAAGCGAGCCAAGTCACAACCCGCTTCAGTGTGACCTGCACGAGTTAATACACCGCCCTCTTGAGCCGTCAATGGGAAGATATGGCCCGGTTGAACTAGATCTGCCGCTTTCGCCTCTTTCGCCACAGCCGCTTGAACAGTCACAGCACGGTCTGATGCAGAAATACCCGTCGTTACGCCTTGTGCAGCTTCAATAGAAACGGTGAAGTTAGTAGTGTATTGCGCGTTGTTGTCTTGAACCATTGGCGCGAGACCTAAACGGCTCGAGCGCTCTTTGGTTAGCGTCAGACAGATCAAGCCACGACCGAACATCGCCATAAAATTGATTGCTTCTGGCGTTACATGCTCTGCCGCCATGATCAGATCACCTTCATTCTCGCGATCTTCATCGTCCATCAGGATAACCATCTTTCCTAGGCGGATGTCTTCAATAATTTCTTGAGGAGTACTAATTGGCATTATTTTATATCCTTTGAAATGGTTCTGCTTCCAAGAACCGACACTGTTTAAACCTGATTGGAATTGCTCAACTCCACTGCCCTTCGCTGTCGCGCACAAGGAGTCGTTGATTAAGCAAAACCGTTTTGTTGTAAGAATTCCATTGTTAGACGAGACTCTGGCTTAGACTCTTCTTGCTGGCCTTGCAGTAGTCGCTCCATGTAACGAGCAAGTACATCTACTTCAAGGTTCACCTTACGACCGACATTAAATTGATCGATCGTCGTCTCTGATGAGGTGTGAGGAACAATAGTCAGCTTAAACGCGTTCTTACGTAAATCATTCACTGTAAGGCTGATACCATCGACAGTGATAGAGCCTTTTTGAGCGACGTACTTTGAGATCTCCGCAGGCATCTCTACCCAGAATTCAATCGCACGACCAACTTGGTTGCGCTCGACAATCTCACCAACACCGTCAACGTGACCAGAAACAATGTGACCGCCAAAACGCGTGGTCGGCAGCATTGCTTTCTCAAGGTTTACTTTATCGCCCGCTTGATAGTCGATAAAGCCGGTCTTTTTCAGAGTTTCCAGTGATAAGTCAGCGCTATAGCTGTTGTCATCGAATTCAACCACAGTCAAACACACACCATTGGTGGCGATGCTGTCGCCGAGCTGTACATCAGACATATCCAGCTTACCGACATTCACGGTTACCGTAATATCTTCACCGCGAGGTGTGATAGCACTTAGGGTACCTACGGCTTCTACAATTCCTGTAAACATTCTAAAACTCTTGTTCTTACTGCTTTTACATCAAGCAGTTAAATTACTATAGATATAACTATCACGGATTAAGATTTCCATACTGGCTGAGCGACAATGCGAATGTCAGGCCCTACCATACGTACATCTTTGATCGACAAATCGATCACGTCATCCATTGAGCTTAAGCCCAACGCACCCATCAAACCGCGGCCATCGCTGCCCATTAGTTTAGGTGCTAAGTATAAGATTAGCTCGTCAACCAGTTGATTCTCAATAAAGCTTTTTGCCAATGTTGCGCCTGCTTCCACCCAAATATGGTCAATGTGATGACTAGGAAGATCTTGAATCAGCTGGGTAATATCAAGCTGCCCCTCTCTGTCGAGAGCAACCATAATGTCAGCTTTAGACTCAGCAACCCGTAGCACTGGCGCTTCAGATTGATACAACTTGAGACCTTGATTAGCCGCTTGAAGCAATTTTGCCTGTCGATCGAGAATCACACGCGTAGGTTGACGCAGCGAAGATTGCGGATAAAGAGATTGAACCGCTTGCGGTAACTCGTCCCAACGTACATTAAGTGACGCATTATCTTCAATCACGGTTTGACTGGTCGATAATACCGCTCCCGCTTTAGCACGGTAATGTTGCACATCACGACGAGCATGCGGCGAAGTAATCCATTGGCTCTGACCATTACTTAAAGCGGTTTGACCGTCTAAGCTTGCTGCCATCTTCAATTGAACAAACGGCATACCTGTTTGCATGCGTTTGATAAAGGCTGGGTTTAACGCCAATGCATCTTGCTCTAGCAGCCCGACTTGCACCTCAATACCCGCATCACGCAGCATTTTGATGCCGCGGCCTGCAACTTTCGGGTTTGGGTCCTCCATTGCACAAATCACTTTTGCAACTTGAGCCTTAATCAACCCTTCAGCACATGGCGGTGTACGACCGTAATGAGAACACGGCTCTAGCGTGACATAAGCAGTCGCGCCTTTGGCTTTGTCGCCGGCCATTCTCATGGCATGAACTTCTGCGTGTGCTTCACCCGCTTTGGCATGAAAACCTTCACCAACAATGTTGTCGCCGTGCGCGATAACGCAGCCCACATTTGGGTTTGGTGCAGTGGTATAAATGCCGCGCTTAGCCAGTTGCATGGCACGCGACATCATTTGAAAATCTAACGGAGAAAATTTTGACATGAGTGGGAGTTAGTCCTCTAGCTTGGCAATCTCTTCGCCAAACTCTCGGATGTCTTCAAAGCTACGATAAACAGAGGCAAAGCGAATGTAAGCCACTTTGTCGAGCTCTTTCAGTTGGTCCATGACAAGGTTACCAATCATCTCTGTTGGTACTTCACGCTCGCCAGTTGCTCGCAGTTGAGACTTGATAGTACTGATCGCAAGTTCAATGGCGTCTGCGCTCACTGGGCGCTTCTCTAATGCACGCTGTACCCCACCGACCATTTTGTCTTCGTTAAAGGGTTCACGGTTGCCATTCGACTTGATCACTTTTGGCATAACAAGCTCAGCCGATTCGAATGTCGTAAAACGCTCACTACATGCCAGACATTGACGGCGACGACGAACTTGATGCCCGTCTGCGACTAGCCTTGAGTCGATTACTTTAGTGTCGTTCTCTGAACAAAAAGGACAATGCATATCACCTCCAAATAATTGATTGTCAGTGTAACGGAATTGCCAAACGTAGGGAAAGAAAAAGGGCCAATTAAGGCCCTTTTATGTGGTTATTCGTTGATTATGACTAACCGAGCTTCACATTCATCCTAGCTACGAACTAGGTAGTTCGCTTTACCAACCCACTTGTAGCTGGTTAGCTCTTCCAATCCCATAGGACCGCGAGCATGAAGCTTCTGTGTTGATACTGCCACTTCTGCGCCCAGACCAAACTGCGCACCATCCGTAAAGCGAGTTGACGCGTTCACATAAACCGCCGCAGAACCGACTGAGTTAATGAAACGCTCAGAGCTCTCTAGGCTGTTTGTCATGATTGCATCTGAGTGGCTCGCATTATGTACGCGCATGTGGTCAATCGCCTGCGCTACGTCAGCAACCACCTTGACACCTAAGGTGTAGCTTAGCCACTCAGTATCGAAATCATCTTCACCCGCATCACGCACGTCAGCGAAATCAGCAAGCAGCGCTTTAGCACTTGCATCAGCCACTAATGTCACTTTACCCGCTAGGCGTGTTTTAAGCTTACCAAGGAACTCCTCAGCGACCGCTTCATGAACAAGCAAGGTATCCAGAGAGTTACATGCTGATGGGCGCTGAACCTTTGAGTTTTCAACCACATTCACTGATTTTTCTAGGTCAGCCGACTCATCAACGAAGATGTGACTAATACCAAAGCCACCGATGATTACTGGGATAGTGCTGTTCTCTTTACACATTTTGTGTAGACCTGCACCACCGCGTGGGATGATCATATCAACGTACTCGTCTAGCTTAAGCAGTTGAGACACAAGTTCGCGATCTGGCTTCTCAATGTATTGTACCGATGCCGCTGGTAGCTCTGCTTTTTCAAGTGCAGATTGGATAACTTTAACCAGCTCCATGTTTGAGAAAAACGTCTCTTTACCACCACGTAGGATGCTTGCGTTACCTGTTTTCAGGCACAGTGCCGCAATGTCGATGGTTACGTTCGGGCGTGCTTCATAAATAACACCAACAACACCAAGCGGTACGCGACGACGAGATAGTGACATACCGTTTTCTAGTACTTTGCTGTCAATTTCGCTGCCTACAGGGTCATTCAGGCTGATAACGTTGCGCACATCGTTAGCAATACCCGTTAGACGCTCTTCGTTAAGAAGTAGGCGGTCAAGCAGCGCTTCTGTTAAACCAGCTTCACGACCTAATTCGATATCTTTTGCGTTCGCTTCAAGAATGGTCGCCGCATTTGCTTCTAGTTCATCTGCAATGATAGAGAGCGCTTTGTTCTTCTGCGCTGTTGATGCTGTTGCTAGGTGAAAGGCAGCCTCTTTTGCTGCGATACCCATGTTAGTTAAATCCACGTTTAACTCTCCCTTAATTCTGTCTATCTGAAATTCTGGGTACGCTAGTTAATGAAATAGCCACCTTCTATCGGCAGCTATTCATTGGTCTCTCATTTTCTGAGTAAGCGTACTAAGTTTAGCTCGTCTGTTACTCTTGGATCACTACCATGTCATCACGGTGAATAACCTCTGAGCCGTAGTCATAACCCAGAATTTCTCCAATATCTTTACTGTGTTTGCCCGCAATCTTCGCCAAGTCTTGGCTTGAGTAGCTTGCGATACCACGCGCAATCACTTTACCTTTGTTGTCTGTTACTTGAGTCACTTCACCACGAGAGAACTCACCGCGAACTTGAATCACACCTTTAGCCAACAAGCTGCTGCCCTTGGTGTTCACAGCATTCACTGCACCGTCATCAACCACGATATCACCCGCCGATGCAGGGCCAGCCAGGATCCAACGCTTACGGTTTTCAAGCGCCTCTTCCAATGGCAAGAAACGTGTGCCTTGTGGCTCTGCGCTGAGTGAGTCAAATACCACATTCTCTGCGCTGCCTGCTGCGATGATAACTTCAATACCTGCACGACGCGCAATATCCGCCGCTTGCAGCTTAGTGGCCATGCCGCCAGTACCCAAGGTGGTACCACTGCCGCCAGCAATCTTACGCAAGGTATCATCGATGGTCTTAACCTCTTTGATCAACTCAGCATTTGGGTCTTTGCGAGGATCAGCGGTAAATAGACCTTTTTGGTCAGTCAGGAGTAGAAGCTTATCAGCACCACACAAAATACCGACTAAAGCCGACAAGTTATCGTTGTCGCCTACTTTAATCTCGTTGGTTGCTACCGCATCGTTCTCATTAACGATTGGGATAATGTCGTTATCCACCAATGCATTGATGGTGTCACGTGCATTTAGGAAGCGCTCACGGTCATCGAGGTCGGCACGCGTTAACAGCATTTGGCCAATCTTGATGCCGTAGATAGCAAACAAAGACTCCCAAACCTGAATCAGTTGACTCTGTCCTACTGCAGCAAGCAACTGTTTGCTCGCCATTGAGTTGGGCAGTGCGGGGTAACCAAGGTGCTCACGTCCAGCTGCGATAGCGCCAGACGAAACCATAACCACAGAGTGGCCTTGCTTTTTCAGTTCTGCACACTGACGTACAAGCTCTACCATGTGAGCTCGATCTAACGCTAATGTGCCACCAGTTAAGACACTGGTACCCAGTTTAACAACCACAGTTTTGCGCTGTGTTGCTGTCCCGCTTTGATGATTAGTTGTCAT
Protein sequences of DBSCAN-SWA_4 >NZ_CP045338|2568325:2575217|2570271_2570928_-|WP_032551635.1|DBSCAN-SWA MFTGIVEAVGTLSAITPRGEDITVTVNVGKLDMSDVQLGDSIATNGVCLTVVEFDDNSYSADLSLETLKKTGFIDYQAGDKVNLEKAMLPTTRFGGHIVSGHVDGVGEIVERNQVGRAIEFWVEMPAEISKYVAQKGSITVDGISLTVNDLRKNAFKLTIVPHTSSETTIDQFNVGRKVNLEVDVLARYMERLLQGQQEESKPESRLTMEFLQQNGFA >NZ_CP045338|2568325:2575217|2568325_2568796_-|WP_004741508.1|DBSCAN-SWA MKVIEGGFPAPNAKIAIVIARFNSFINESLLSGAIDTLKRHGQVSEDNITVVRCPGAVELPLVAQRVAKTGKFDAIVSLGTVIRGGTPHFDYVCSECNKGLAQVSLEYSLPVAFGVLTVDTIDQAIERAGTKAGNKGAEAALSALEMINVLSEIDS >NZ_CP045338|2568325:2575217|2569051_2570161_-|WP_032551636.1|DBSCAN-SWA MPISTPQEIIEDIRLGKMVILMDDEDRENEGDLIMAAEHVTPEAINFMAMFGRGLICLTLTKERSSRLGLAPMVQDNNAQYTTNFTVSIEAAQGVTTGISASDRAVTVQAAVAKEAKAADLVQPGHIFPLTAQEGGVLTRAGHTEAGCDLARLAGCEPASVIVEILNDDGTMARRPDLEVFAEKHDIKLGTIADLIEYRNNTETTIERVAQCHLPTEFGDFELVTYRDTIDNQIHYAMQKGDLTGEAPLVRVHLHDTFTDLLHSDRGTERSWSLDKAMKRIGEEGGVLVILGNEESSDSLIHKVKTFEAQDKNEQPTMAKKQGTSRRVGVGSQILQDLGVQDMRLLSSSTKRYHALGGFGLNVVEYVCE >NZ_CP045338|2568325:2575217|2572696_2573947_-|WP_152469191.1|DBSCAN-SWA MDLTNMGIAAKEAAFHLATASTAQKNKALSIIADELEANAATILEANAKDIELGREAGLTEALLDRLLLNEERLTGIANDVRNVISLNDPVGSEIDSKVLENGMSLSRRRVPLGVVGVIYEARPNVTIDIAALCLKTGNASILRGGKETFFSNMELVKVIQSALEKAELPAASVQYIEKPDRELVSQLLKLDEYVDMIIPRGGAGLHKMCKENSTIPVIIGGFGISHIFVDESADLEKSVNVVENSKVQRPSACNSLDTLLVHEAVAEEFLGKLKTRLAGKVTLVADASAKALLADFADVRDAGEDDFDTEWLSYTLGVKVVADVAQAIDHMRVHNASHSDAIMTNSLESSERFINSVGSAAVYVNASTRFTDGAQFGLGAEVAVSTQKLHARGPMGLEELTSYKWVGKANYLVRS >NZ_CP045338|2568325:2575217|2570996_2572082_-|WP_152469565.1|DBSCAN-SWA MMSRAMQLAKRGIYTTAPNPNVGCVIAHGDNIVGEGFHAKAGEAHAEVHAMRMAGDKAKGATAYVTLEPCSHYGRTPPCAEGLIKAQVAKVICAMEDPNPKVAGRGIKMLRDAGIEVQVGLLEQDALALNPAFIKRMQTGMPFVQLKMAASLDGQTALSNGQSQWITSPHARRDVQHYRAKAGAVLSTSQTVIEDNASLNVRWDELPQAVQSLYPQSSLRQPTRVILDRQAKLLQAANQGLKLYQSEAPVLRVAESKADIMVALDREGQLDITQLIQDLPSHHIDHIWVEAGATLAKSFIENQLVDELILYLAPKLMGSDGRGLMGALGLSSMDDVIDLSIKDVRMVGPDIRIVAQPVWKS >NZ_CP045338|2568325:2575217|2572121_2572571_-|WP_032551633.1|DBSCAN-SWA MHCPFCSENDTKVIDSRLVADGHQVRRRRQCLACSERFTTFESAELVMPKVIKSNGNREPFNEDKMVGGVQRALEKRPVSADAIELAISTIKSQLRATGEREVPTEMIGNLVMDQLKELDKVAYIRFASVYRSFEDIREFGEEIAKLED >NZ_CP045338|2568325:2575217|2574077_2575217_-|WP_032551631.1|DBSCAN-SWA MTTNHQSGTATQRKTVVVKLGTSVLTGGTLALDRAHMVELVRQCAELKKQGHSVVMVSSGAIAAGREHLGYPALPNSMASKQLLAAVGQSQLIQVWESLFAIYGIKIGQMLLTRADLDDRERFLNARDTINALVDNDIIPIVNENDAVATNEIKVGDNDNLSALVGILCGADKLLLLTDQKGLFTADPRKDPNAELIKEVKTIDDTLRKIAGGSGTTLGTGGMATKLQAADIARRAGIEVIIAAGSAENVVFDSLSAEPQGTRFLPLEEALENRKRWILAGPASAGDIVVDDGAVNAVNTKGSSLLAKGVIQVRGEFSRGEVTQVTDNKGKVIARGIASYSSQDLAKIAGKHSKDIGEILGYDYGSEVIHRDDMVVIQE |
7 | Staphylococcus_phage(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2786070 : 2798139
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP045338|2786070:2798139|DBSCAN-SWA ATTACAACTCTTCTTGTGGCATTTCGCCAGTCTCTGGCTCTTCAGGCTGAGCTGGAGTCAGTAGCATTTCACGAAGTTTAGCATCGATAGTTTTCGCTGCTTCTGGGTTCTCACGAAGGAACTTACCAGCGTTCGCTTTACCTTGACCAATCTTATCGCCGTTGTAGCTGTACCAAGCACCTGCCTTTTCAATTAGCTTGTGCTTAACACCTAGGTCAATCAGCTCACCTTCACGGTTGAAGCCTTGGCCGTATAAAATTTGAGTTTCTGCTTGCTTAAATGGTGCAGCAATCTTGTTCTTAACAACCTTGATACGAGTTTCGTTACCAACAATCTCGTCGCCCTCTTTGATAGAGCCAGTACGACGAATATCAAGACGAACAGATGCGTAGAACTTAAGTGCGTTACCACCAGTCGTCGTCTCTGGGTTACCAAACATAACACCGATCTTCATACGAATTTGGTTGATGAAGATACACATACAGTTAGATTGCTTAAGGTTACCCGTTAGCTTACGCATAGCTTGAGAAAGCATACGTGCTTGAAGCCCCATGTGGCTGTCGCCCATTTCGCCTTCGATTTCCGCTTTCGGTGTCAATGCTGCTACTGAGTCGACAACAAGAACGTCAATCGCACCTGAACGCGCTAGCGCATCACAGATTTCTAGTGCTTGCTCACCTGTATCTGGTTGAGAAACAAGAAGTGCATCGATATCAACACCCAGTTTTTGAGCGTAGATAGGGTCTAGTGCGTGCTCTGCATCGACGAATGCACACGTTTTGCCTTCACGTTGAGCTGCTGCAATAAGCTCTAGTGTTAGGGTTGTTTTACCTGATGATTCTGGACCGTAGATCTCAACGATACGACCCATTGGTAGACCACCAGCACCTAGTGCGATATCTAGAGATAGCGAGCCAGTAGAAATCGTTTCTACATCCATTGTGCGGTTGTCACCAAGACGCATGATAGAGCCTTTACCAAACTGCTTTTCAATCTGACCAAGGGCTGCGGCTAACGCTTTTTGTTTATTCTCGTCCATTACTATCTCCACATAATCGGTTGTCTATAACAAGCGATTTAGAATCTGTTTCTAACCCACATAGGGGTGACTAATTGGTGTTTATTATACTGTTGATTCATACAGTGTCCATACCTGTATGAAAATTATTTGTACAATTGTTACTTCTCTTCCAACAGGTAATCATAAATAACCTCAAGCGCACAGGCGGTCGCTTGTTGTCGTACTTGTGATCGATCACCCGCAAATACATGCGTTTCTACTTTTTGCCAACCACTGTTTGAAGCCCAAGCAAAACAGACCGTGCCGACGGGCTTCTCTTCACTGCCTCCGCCAGGGCCTGCAATACCACTGATCGATACGGCGATCGTGCCATTAGAGTGCGCCAACGCGCCTTGCGCCATTTCAACCACCACAGCCTCACTGACTGCACCATGCGCTTCCAGCGTTGCATCTTGCACGTCAATCATCTCTTGTTTGGCTTCATTGCTGTAAGTGATAAAAGCACGGTCAAACCATGCTGAACTACCGGCTATATCTGTGATTGCACTTGCAACGCCACCACCAGTACACGACTCTGCCGTGACCAAAACGTGCTGATTTTTTAATAAAAGCTGGCCGAGTTTTTCGCTGAGTTGTTGAATCGATGACATGGTGACAAACTCCATTGTTTCTGCATGGTTTCTACATGAGTGAATTTGAATCTTTATCGCTTGCGCCGCTTTTCGCAGGTATCGCTTTCACGTATCCTAAGCCGCAATAGAAATAAACGAAAGAAGTTAACAGTGAAAGCCGATCAAAAACATACTCCCATGATGCAGCAGTACCTAAAACTCAAAGCAGAGAATCCAGAGATCCTGCTTTTCTATCGCATGGGAGATTTCTACGAGCTTTTCTATGATGATGCCAAAAGAGCCTCTCAACTCCTTGATATTTCCCTAACCAAACGTGGCTCATCTGCTGGCGAACCTATCCCAATGGCAGGCGTGCCGTTTCATGCTGTAGAAGGCTACCTCGCTAAACTGGTTCAACTCGGTGAATCTGTGGCGATCTGCGAACAAATCGGTAACCCTGCGACATCGAAAGGTCCAGTAGAGCGTGCCGTTGTTCGTATCGTGACACCGGGTACCGTCACTGACGAAGCCCTTCTTTCTGAGCGCGTTGATAATCTGATTGCTGCGATTTATCACCACAACGGCAAGTTTGGTTACGCAACTTTAGATATCACTTCAGGTCGATTCCAACTTTCGGAGCCAGAGACCGAAGAGTCAATGGCCGCTGAATTGCAACGCACTTCTCCAAGAGAACTGCTCTTTCCAGAAGACTTCGAACCTGTAGAGTTAATGGCGAACCGTAACGGTAACCGCCGTCGCCCAGTATGGGAGTTTGAATTAGATACGGCCAAGCAGCAACTCAACAAACAGTTCGGTACTCGCGACCTTGTCGGCTTTGGTGTGGAACATGCCAAGCTTGGTCTATGTGCTGCGGGTTGTTTGATCCAATATGTAAAAGATACCCAACGTACCGCTCTACCGCATATTCGCTCACTGACTTATGATCGCCAAGATCACTCTGTGATCTTAGATGCAGCGACTCGTCGCAACCTAGAGATCACTCATAACCTTGCAGGTGGCACCGACAACACGCTGGCAGAAGTGCTCGATCACACAGCAACGCCAATGGGTAGCCGTATGCTCAAGCGTTGGTTACATCAGCCAATGCGCAATATCTCAGCCCTCGACCAACGCTTAGATGCAATTGGTGAGATGAAAGATTTATCTCTATTCACAGAGCTGCAGCCAACACTGAAACAGATTGGTGATATTGAGCGTATCCTTGCTCGATTAGCTCTTCGCTCTGCACGTCCGCGTGATATGGCCCGCCTGCGTCAAGCGATGGACTACCTTCCAGAACTTGCAGATACATTAGAACAAGTCTCTCATCCATACCTGACTCAGCTTGCACAATATGCAGCACCTTTTGAAGAAGTGTCTGAGCTGCTTACTCGTGCGATCAAAGAGAACCCACCAGTTGTGATTCGCGATGGCGGAGTAATTGCTGCAGGGTACAGCGCAGAGCTAGATGAGTGGCGCGACCTTGCAGACGGTGCAACAGAGTACTTAGACAAACTCGAAGCAGAAGAGCGTGAACGCCACGGGATTGATACGCTAAAGGTTGGCTACAACAACGTGCACGGCTTCTTTATTCAGGTGAGCCGGGGTCAAAGCCACTTAGTTCCACCTCATTACGTACGCCGCCAAACGTTGAAGAACGCTGAGCGCTACATTATCCCTGAGCTAAAAGAGCACGAAGACAAGGTTCTGAACTCAAAATCTAAAGCACTAGCGATTGAAAAGCAGCTTTGGGAAGAACTATTCGACCTACTATTGCCGCACCTAGAGCAACTGCAGAACATCGCCTCTTCTGTATCGCAATTAGACGTTCTTCAAAACCTAGCTGAACGTGCTGATACTTTAGATTACTGCCGCCCTACTCTAAACAAGGAGCCTGGAGTTCACATTCAAGCGGGTCGTCACCCTGTGGTTGAACAGGTGATGGACGAACCTTTCATTGCCAACCCGATTGACTTGAATGATCAACGTAAGATGCTGATCATCACAGGTCCAAATATGGGCGGTAAATCGACCTATATGCGTCAAACTGCGTTGATCGCACTAATGGCTCATATTGGTTGTTACGTTCCTGCAGAGAGTGCAACCATTGGCTCTATCGATCGTATCTTCACTCGAATTGGTGCATCTGATGACCTTGCATCTGGCCGTTCAACCTTCATGGTAGAGATGACAGAAACAGCGAACATTCTCCATAACGCAACACCGAGCAGCTTGGTTCTGATGGATGAGATAGGTCGTGGTACTAGCACCTATGATGGCCTATCACTGGCTTGGGCAAGCGCGGAGTGGTTAGCAAATCAAATCAATGCGATGACACTGTTTGCGACTCACTACTTCGAACTGACTGAACTACCAAACCAGATCCCAACCTTGGCGAATGTTCACTTGGATGCCGTTGAGCATGGCGACAGCATTGCCTTCATGCACGCAGTTCAAGAAGGTGCTGCAAGCAAATCTTACGGTCTTGCAGTAGCTGGGCTTGCGGGAGTGCCAAAAGCCGTGATTAAGAATGCGCGAGCTAAGCTCACCCAATTGGAAGCGCTAAGTGCTGAGACACCAACAGCAAAACCAAGCGGTGTCGATATTGCGAATCAACTTAGTCTTATCCCAGAGCCAAGTGAAGTAGAGCAAGCGCTAGCAAACGTAGATCCAGATGATCTAACACCACGCCAAGCACTGGAAGAGTTGTACCGCCTAAAGAAACTTCTCTAGTCAAAAGAAAGTCTCTTTAACCTCGATAAAAACAAAAACGCTGACATTGAGTCAGCGTTTTTTATTGGGCATTTCTAGAACTAGACTAATCGTTTTCGACGTTGAATAGATTTTCCATATTCAAGCCTTGCTTGATCAGGATTTCTCTTAAACGACGTAGACCTTCTACTTGAATCTGGCGCACACGCTCACGACCGACTTCTTCAAGAGTCGATGGCTCGTAACCAAGAAGCCCAAAACGACGTGCAAGCACTTCTTTCTGCTTAGGATTCAGTTCATCTAACCAGTAGATCAGTGATGTTTTGATATCGCTGTCTTGAGTCGACACTTCTGGGTCTGAGTTGTTTACGTCTGGAATGATGTCCAGCAGCGCTTTCTCGCCGTCACCACCGATTGGTGTATCGACTGAGCTTACACGTTCATTAAGGCGTAGCATTTTGCTTACGTCGCTCACCGGCTTATCTAGCTTGGTTGCGATCTCTTCCGCTGTTGGCTCATGGTCAAGCTTTTGTGATAGCTCACGAGCGGTACGCAGGTAGATGTTTAGCTCTTTCACAACATGAATTGGCAGACGAATAGTACGCGTTTGGTTCATCAACGCACGTTCGATCGTTTGACGAATCCACCATGTTGCGTAAGTAGAAAAGCGGAAGCCTCGTTCTGGATCGAACTTCTCAACAGCACGGATCAAACCTAGGTTACCCTCTTCGATAAGATCGAGAAGAGCTAAGCCACGGTTGCTGTAACGACGTGAAATTTTAACCACAAGACGTAGGTTACTTTCGATCATGCGCTTGCGAGCAGCTTCGTCTCCGCGTAAGGCGCGACGAGCATAAAGCACTTCTTCTTCTGCGGTTAAGAGTGGTGAGAAGCCAATTTCACCCAAATACAACTGAGTCGCATCTAGGCTTTTCGAAGTCACTTCGACTTCTTCTTTTGCTTCTGTTTTTTTAGTGACGGTTTGTTCGCCTTGCTCAACATTTGCCAACTCCGTGGTCTCTTGGTCAAGTTCGAACTCTTCTTTAATAACTGCATTGCTGATACTCATAACGCCTCCCCCTGGCGATATTAGCAAGACATTACAACTTCAAATGTCGCTAAATGACTATGTCACTTTATACAAGTACCATTGCGGCACTCATATTAAGGTAAATAACGTTTTGGATTCACGGATTTACCTTGGTAACGAATCTCAAAGTGCAGCCTAACGCTTTTGGCTCCAGAACTTCCCATTGTGGCAATTTTCTGTCCTGGTTTAACACTTTGCCCTTCAGATACTAATAGTCGATCGTTATGCGCGTATGCACTTAAGTAGTTGTCATTATGCTTTACAATCACGAGATTGCCGTAGCCTCTGAGTGCATTACCCGAATAAACAACCGTTCCCCCTGCAGTAGATACGATAGGCTGACCACGCTGTCCTGCTATGTCGATGCCTTTATTTCCTTGTTCACCTACAGAGAAATTCTTGATTACTCTCCCTTTGGTTGGCCATAACCATTTCGATACTTTATCGTTAGTTGGCTTGCTGGCGGTTGTAACATTCTGTTTACCTTTAGAACCAACATACTCCTTTGGCTTCGATTGTTCAACCTTCTTTGGTGGATCTTTTTTAACCGTTTTCGCCGTTGACTGCACAGGTGCCGGTTTAGAGTTTTTACTCTTCACAGGTTCAGCTTTGGTGGTCGCTGGCTTAGAGGATTTCGAGCTGGAGGAGCTCGGCGTGCTCACAACGGGAACTATCGGTGCTGGTTTAGGTGCAAGGGTAACGCTGGCAACGGCCGCCGACTTGCCATAAGCAGGAGCCTTGTATGACGGACGCCATAACTTAAGCTTCTGCCCTGGATGGATAGTATAAGGCGCAGAGAGATCGTTGTAGTGAATCAGGTCATTGACATCTTTGTTGGTCAAATAAGCGATAAAGTAGAGGGTATCGCCTTTCTGAACTTCATAATAACTGCCACGATAGCTGCCACGCTCAATGGTGGAGTAATCTTTGTTCAGGTTGGATACAGGTGCAGGGGAATTCGCTGCGCACCCTGCTAGTGCGCAACTCAACAATATCGCTATCCCTTTTGAAAACTGTGAATACATTCAGTTGTTATCCATCTACACCAATAGTTTTGTTACGCCAATTCACCCGGAACCAAAGGAACGAAACGTACCATCTCAATCGTTTCTGAAAGGTACTCGTCACCTTGACGGGTAATTTTTAACAGTTGTTGCTCGTTGTCCCCAACCGGGATCAGTAAACGTCCACCATCACTCAATTGCTCTAAAAGCGCTTGAGGGATTGATTCAGCAGCTGCCGTTACGATGATGGCATCAAATGGGCCTTTCGCAGCCCACCCTTGCCAGCCATCGCCATGTTTCGTGGAAATGTTATAAAAATCAAGCTGCTTGAGCCTTCTTTTCGCATCCCACTGTAATGACTTGATACGCTCAACAGAATAAACATGATCGACTAACTGAGCCAATACCGCTGTTTGGTACCCCGAGCCAGTTCCAATCTCTAAGACGCGGCTAGTCTGTTGAAGTTCCAACAGTTCCGTCATTTTCGCCACTATATAAGGTTGAGATATCGTCTGACCTTGTCCAATAGGCAATGCATTGTTGTCATAAGCTTGGTGGTACATCGCCTGCGATAGAAAGCGCTCGCGAGGCAAACGGAATATCGCATCTAGAACCTTTTGGTCTCGAATGCCACTCTCAATCAAGAAACTGACTAAGCGGTGTGCGTGAGGGTTAGTCATTGGCGCTCTCCTAGCCACTGCGACATGGCACCTAAAGATTCATGAGCTGTTAGATCAACCTGCAGCGGCGTCACTGAAACGAAGCCATGCTCTATCGCATAGAAATCTGTTCCCTGTCCTGCGTCCTGCTCTTTACCTGGAGGACCTAACCAGTAGATATCATGACCCCTTGGATCTTTCTGCTTAATCATGTCTTCAGAGTGATGGCGAGCACCAAGGCGCGTCACCTGTGTTCCCTTAAGATCACCAACAGCAACATCTGGAACATTGACGTTCAACAAACGATTGGTTGGGATCGGTCGAGCTAAGTGCTGTTCAACGATTTGGCGTGCAATAAGAGCGGCGGTTTTAAAATGTTGTTTGCCTACCAAAGAAAACGCGACTGACTGTACACCTAAAAAGTGCCCTTCCATCGCTGCGGCAACGGTGCCTGAATAGAGCACATCATCGCCTAGATTTGCTCCGTGATTAATGCCAGCCAGCACTAAATCAGGCATATCATCCTTAAGCAGCTCATTCAATGCAAAGTGCACACAATCCGTCGGTGTACCTTGCACAGAGTAAGTGTTAGCTGCGATTTCTTGAACGCGCAGAGGCTGCTCAAGCGTTAATGAATTCGACGCACCCGAGCGATTACGGTCAGGTGCAACGATAATAACGTCAGCAAGGTCGCGTAACTCATCAGCTAGGGCATGAATACCCTGAGCATGAACGCCATCATCATTGCTGAGTAAAATCTTCATTCTGTATCCTTTCTTATATCCGTCCAATTTACGCCTTTACTGCGCGTATTGTCGCTCTACAGCCACTTCTTCCACTAGCTCACGAACAATCGCTGTCGCAAAGCAACCTGCATCCAATGAGAACTTAAGCGTGATGTTGTCGCCATCCACTTCCCATGAGAGCTGTTGTGGTTTAAGAGAAGCTTCACGACGATCATGACGCATACGGTTTCCGCGAATGAGCGCCATTAAATCTGGCTCGTCATCAAGGTGCTTCTGCTCAAGAGCGAGTGCATTCTCTTGTGTCGGCAATGCATTGTCGCCAGCCATCGCTACAGAAACCTGCGCTTCACCTTTCTCAATCGCCAAGTTCAACGCTTCAATGTTTTCACCCGTTGCCAACAGTTGCTCACCAGCGCTGAAAACGACGTCGCCAAGAATTGCTGAGTTAAACACGTTCTGTTCAATACGCTCTGACAAAATGCGATTGAAAATCCAAGAGCGAGCTGCAGATAGATATAAACTACGCTTGTTCTGGTTACGTGTACGCACATTATCACGCCCCCAGCGACGAGCTTCAGACAAATTATTACCTTCATTGCCAAAACGCTGAGCACCAAAATAGTTCGGTACACCTAGCTGAGCCACTTTCTCAAGGCGATTAACCACGTCAGCGGTGTCAGTCACTTCAGACAGAGTCAGTTCAAATTGGTTGCCAACTAGGTCGCCCGGGCGCAGCTTTTTGTTGTGGCGTGCCGTCGCTAAGATCTCAATACTTGGGTACTGAGCTAAGAAAGCTGTAAAGTCTGGCGTCCCCCCCTTTGGTAGGTGAACACTCAACCATTGCTCAGTCACTGCATGACGGTCTTTCAGACCCGCCCAGCTCACGTCTTTCGATTTAACGCCACATGCTTTTGCCAGTTCATTGGCAACAAAGCTGGTGTTCTCACCGGTTTTGCGGATACGCACCATTAGGTGCTCACCTTCTCCAGTGAATTCGAAACCTAAGTCTTCGATAACCACAAAATGCTCGGCCTTCGCTTTCAGTTTAGCTTGTGCCGTCGGCTTGCCATTCAAATAGGCAAATGAAGATAGAATATCTGACATGCTGTTCTTTTCGTTTTGGTGCTTACGCACTTTTAGTAATTAGGACGACTGCTTCGCTTGCAATGCCCTCTTTGCGTCCGGTAAAACCAAGACGCTCAGTTGTTGTCGCTTTTACATTCACGTTGCCAATATCTGTTTCAAGATCCTCAGCAATCGCTTGGCACATTGCATCAATGTGCGGGGCCATTTTTGGCGCTTGTGCCATGATGGTCACGTCCACATTACCGATTTGGTAACCTTGTTCTTTAACGCGGCGGTAAACATCTTTAAGCAGATCACGACTATCTGCTCCCTTCCACTCATCATCGGTGTCTGGGAAGTGACGACCAATATCGCCTGCGGCAATCGAGCCTAATAGCGCATCACATAATGCGTGCAAAGCCACATCACCATCTGAATGCGCAACCAGTCCTTGCTCATAAGGAACGCTCACGCCACCAATAATCACTGGGCCTTCACCACCAAATTTATGAACATCAAAACCGTGGCCAATACGAATCATCATTATTCCTTATTCTGACTTAAATAGAACTCAGCCAATGCTAAATCTTCTGGCTGAGTAACCTTAAAATTATCTGAGCGCCCAGCAACCAGAGCTGGTGATGCGCCTTTCCACTCAAACGCAGACGCCTCATCGGTAATCGACACGCCTTGTTCTAGAGCCTCTGTTAGAGCCTGCTGCAGCGGTAAAGTTCGAAACATTTGCGGCGTCAATGCGTGCCATAAGTTTGCACGCTCTACCGTGTGATCAATCTGCCCATCAACACCACGTTTCATCGTATCACGAACGGGAGCAGCCAAAATGCCACCTACCGGATGAGACAGTGCTACTTCAATCAGCTTATCGATGTCGTGCTGTTGAATACATGGCCTAGCAGCATCGTGCACCATGACCCACTCGCCCAAACCGTTCTCTTGAATATAGTTCAGAGCAGACAGAACCGAGTCTGCTCGTTCATTACCACCCTTAACACGCACCACATCTGGATGTTGGCTCAAGCTCAGCTCAGGGAAGTATGGGTCACCTTCACTTACGGCTACCACGACCTTGCTCACTTGAGGATGAGAAAGTAGCTTGTTGATGGTGTGCTCCAAGATGGTGATCGCGTTAATCGTCAGATATTGTTTTGGGCGATCGGCCTTCATTCTACTGCCTACGCCAGCCGCAGGTACAACTGCGACAATACCTTGTTTATCAGTCGACATTAATGCGATTCCTCACCAATTATGCGATAAAACGTTTCACCTTCTTTCACCATTCCGAGCTCATGTCGAGCACGCTCTTCGATGGCGTCTAACCCTTGTCGAAGGTCGTCGATCTCAGCAAACATTTCTGCATTTCGAGTATGCAGCTTCTCATTCACTTGTCTTTGAACGTCGATTTCGTTGTTCACTGTGTAGTAATCAGAAATGCCATTTTTACCGAGCCACAATGTGTGTTGCAGCCAGCCAAGTAAGATCAGCAGGACAAGAGCAAAGATTCGCATAACACCTTTCGCCGGTTAGAAAAGAAGGAACTAGAATAAGGCACATATATACCATAATGCGCTTATTGGTGCGAGATATGTGGTGTAAGGGAATCGATAAGTAGCAGAATTATCGGACAGAATCTTTAATGGTAACAAAGATACTCCCGATTCATTCACCTCCCACTCTCGTCATTCCCAAGAAGGAGGGACGACTGAGTTGGGAATCTATCGAAGTTAACATTAAGAGATTCCCGACTCCTTCCTTCGTCAGTCTAGGGAATGACGAATTGTAGGCAAAAAAATGCCCCGCATTAGCGAGGCATTTAAAGCTTAGATTAACAATTCAATGTATGAATTATTGACCTTTAACTTCTTTAAGACCGTTGTAAGGAGCTTTAGAACCTAGAGCTTCTTCGATACGGATTAGTTGGTTGTACTTAGCAACACGGTCAGAACGGCTCATAGAACCAGTCTTGATTTGACCTGCAGCAGTACCTACCGCTAGGTCAGCGATAGTTGCGTCTTCAGTTTCGCCAGAACGGTGAGAGATTACAGCTGTGTAACCTGCGTCTTTAGCCATCTTGATTGCAGCTAGAGTCTCAGTTAGAGAACCGATTTGGTTGAACTTGATAAGGATAGAGTTAGCTACGCCTTTCTCGATACCTTCAGCAAGGATCTTAGTGTTAGTAACGAATAGGTCGTCACCTACTAGTTGAAGCTTGTCACCTAGTAGTTCAGTTTGGTGCTTGAAGCCATCCCAGTCAGACTCGTCTAGACCGTCTTCGATAGAAACGATTGGGAATTGGTTAGCTAGCTCAGCTAGGTAGTGGTTGAACTCTTCAGAAGAGAAAGTCTTACCTTCGCCTTTCATGTTGTAGATGCCAGCTTCTTTGTCGAAGAACTCAGATGCAGCACAGTCCATAGCTAGAGTAACGTCTTTACCTAGTTCGTAACCAGCAGCTGCAACAGCTTCTGCGATAACTTCTAGAGCTTCAGCGTTAGACTTAAGGTTAGGAGCGAAACCACCTTCGTCACCAACTGCAGTGCTGTAGCCTTTAGACTTAAGAACTTTAGCTAGGTTGTGGAATACTTCAGCACCGATACGTAGACCTTCTTTAAGAGTCTTAGCGCCAACTGGTTGGATCATGAACTCTTGGATGTCAACGTTGTTGTCTGCGTGCTCACCACCGTTGATGATGTTCATCATTGGTAGAGGCATAGAGAACTGACCAGCAGTACCGTTTAGCTCAGCGATGTGCTCGTATAGAGGCATGCCTTTAGCCGCTGCAGCAGCTTTAGCGTTCGCTAGAGAAACAGCTAGGATTGCGTTAGCACCGAACTTAGATTTGTTCTCAGTACCGTCTAGCTCGATCATTACTGCGTCGATTGCAGCTTGGTCTTTAGCGTCTTTACCAGTTAGAGCTGTAGCGATTTCGCCGTTTACAGCTTCAACAGCTTTAAGAACACCTTTACCTAGGAAACGTGCTTTGTCGCCGTCACGTAGCTCAAGAGCTTCGCGAGAACCAGTTGATGCGCCAGATGGAGCAGCAGCCATACCTACGAAACCGCCTTCTAGGTGTACTTCAGCTTCTACAGTTGGGTTACCACGTGAATCGATGATTTCACGACCTAGAACTTTAACGATCTTAGACAT
Protein sequences of DBSCAN-SWA_5 >NZ_CP045338|2786070:2798139|2787872_2790434_+|WP_152469288.1|DBSCAN-SWA MKADQKHTPMMQQYLKLKAENPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGSSAGEPIPMAGVPFHAVEGYLAKLVQLGESVAICEQIGNPATSKGPVERAVVRIVTPGTVTDEALLSERVDNLIAAIYHHNGKFGYATLDITSGRFQLSEPETEESMAAELQRTSPRELLFPEDFEPVELMANRNGNRRRPVWEFELDTAKQQLNKQFGTRDLVGFGVEHAKLGLCAAGCLIQYVKDTQRTALPHIRSLTYDRQDHSVILDAATRRNLEITHNLAGGTDNTLAEVLDHTATPMGSRMLKRWLHQPMRNISALDQRLDAIGEMKDLSLFTELQPTLKQIGDIERILARLALRSARPRDMARLRQAMDYLPELADTLEQVSHPYLTQLAQYAAPFEEVSELLTRAIKENPPVVIRDGGVIAAGYSAELDEWRDLADGATEYLDKLEAEERERHGIDTLKVGYNNVHGFFIQVSRGQSHLVPPHYVRRQTLKNAERYIIPELKEHEDKVLNSKSKALAIEKQLWEELFDLLLPHLEQLQNIASSVSQLDVLQNLAERADTLDYCRPTLNKEPGVHIQAGRHPVVEQVMDEPFIANPIDLNDQRKMLIITGPNMGGKSTYMRQTALIALMAHIGCYVPAESATIGSIDRIFTRIGASDDLASGRSTFMVEMTETANILHNATPSSLVLMDEIGRGTSTYDGLSLAWASAEWLANQINAMTLFATHYFELTELPNQIPTLANVHLDAVEHGDSIAFMHAVQEGAASKSYGLAVAGLAGVPKAVIKNARAKLTQLEALSAETPTAKPSGVDIANQLSLIPEPSEVEQALANVDPDDLTPRQALEELYRLKKLL >NZ_CP045338|2786070:2798139|2793183_2793930_-|WP_152469291.1|DBSCAN-SWA MKILLSNDDGVHAQGIHALADELRDLADVIIVAPDRNRSGASNSLTLEQPLRVQEIAANTYSVQGTPTDCVHFALNELLKDDMPDLVLAGINHGANLGDDVLYSGTVAAAMEGHFLGVQSVAFSLVGKQHFKTAALIARQIVEQHLARPIPTNRLLNVNVPDVAVGDLKGTQVTRLGARHHSEDMIKQKDPRGHDIYWLGPPGKEQDAGQGTDFYAIEHGFVSVTPLQVDLTAHESLGAMSQWLGERQ >NZ_CP045338|2786070:2798139|2793966_2795016_-|WP_152469292.1|tRNA|DBSCAN-SWA MSDILSSFAYLNGKPTAQAKLKAKAEHFVVIEDLGFEFTGEGEHLMVRIRKTGENTSFVANELAKACGVKSKDVSWAGLKDRHAVTEQWLSVHLPKGGTPDFTAFLAQYPSIEILATARHNKKLRPGDLVGNQFELTLSEVTDTADVVNRLEKVAQLGVPNYFGAQRFGNEGNNLSEARRWGRDNVRTRNQNKRSLYLSAARSWIFNRILSERIEQNVFNSAILGDVVFSAGEQLLATGENIEALNLAIEKGEAQVSVAMAGDNALPTQENALALEQKHLDDEPDLMALIRGNRMRHDRREASLKPQQLSWEVDGDNITLKFSLDAGCFATAIVRELVEEVAVERQYAQ >NZ_CP045338|2786070:2798139|2796840_2798139_-|WP_032549795.1|DBSCAN-SWA MSKIVKVLGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKARFLGKGVLKAVEAVNGEIATALTGKDAKDQAAIDAVMIELDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTAGQFSMPLPMMNIINGGEHADNNVDIQEFMIQPVGAKTLKEGLRIGAEVFHNLAKVLKSKGYSTAVGDEGGFAPNLKSNAEALEVIAEAVAAAGYELGKDVTLAMDCAASEFFDKEAGIYNMKGEGKTFSSEEFNHYLAELANQFPIVSIEDGLDESDWDGFKHQTELLGDKLQLVGDDLFVTNTKILAEGIEKGVANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGSKAPYNGLKEVKGQ >NZ_CP045338|2786070:2798139|2795520_2796222_-|WP_152469293.1|DBSCAN-SWA MSTDKQGIVAVVPAAGVGSRMKADRPKQYLTINAITILEHTINKLLSHPQVSKVVVAVSEGDPYFPELSLSQHPDVVRVKGGNERADSVLSALNYIQENGLGEWVMVHDAARPCIQQHDIDKLIEVALSHPVGGILAAPVRDTMKRGVDGQIDHTVERANLWHALTPQMFRTLPLQQALTEALEQGVSITDEASAFEWKGASPALVAGRSDNFKVTQPEDLALAEFYLSQNKE >NZ_CP045338|2786070:2798139|2792560_2793187_-|WP_150869276.1|DBSCAN-SWA MTNPHAHRLVSFLIESGIRDQKVLDAIFRLPRERFLSQAMYHQAYDNNALPIGQGQTISQPYIVAKMTELLELQQTSRVLEIGTGSGYQTAVLAQLVDHVYSVERIKSLQWDAKRRLKQLDFYNISTKHGDGWQGWAAKGPFDAIIVTAAAESIPQALLEQLSDGGRLLIPVGDNEQQLLKITRQGDEYLSETIEMVRFVPLVPGELA >NZ_CP045338|2786070:2798139|2790519_2791482_-|WP_152469289.1|DBSCAN-SWA MSISNAVIKEEFELDQETTELANVEQGEQTVTKKTEAKEEVEVTSKSLDATQLYLGEIGFSPLLTAEEEVLYARRALRGDEAARKRMIESNLRLVVKISRRYSNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERALMNQTRTIRLPIHVVKELNIYLRTARELSQKLDHEPTAEEIATKLDKPVSDVSKMLRLNERVSSVDTPIGGDGEKALLDIIPDVNNSDPEVSTQDSDIKTSLIYWLDELNPKQKEVLARRFGLLGYEPSTLEEVGRERVRQIQVEGLRRLREILIKQGLNMENLFNVEND >NZ_CP045338|2786070:2798139|2795038_2795518_-|WP_152469570.1|DBSCAN-SWA MIRIGHGFDVHKFGGEGPVIIGGVSVPYEQGLVAHSDGDVALHALCDALLGSIAAGDIGRHFPDTDDEWKGADSRDLLKDVYRRVKEQGYQIGNVDVTIMAQAPKMAPHIDAMCQAIAEDLETDIGNVNVKATTTERLGFTGRKEGIASEAVVLITKSA >NZ_CP045338|2786070:2798139|2791577_2792528_-|WP_152469290.1|DBSCAN-SWA MYSQFSKGIAILLSCALAGCAANSPAPVSNLNKDYSTIERGSYRGSYYEVQKGDTLYFIAYLTNKDVNDLIHYNDLSAPYTIHPGQKLKLWRPSYKAPAYGKSAAVASVTLAPKPAPIVPVVSTPSSSSSKSSKPATTKAEPVKSKNSKPAPVQSTAKTVKKDPPKKVEQSKPKEYVGSKGKQNVTTASKPTNDKVSKWLWPTKGRVIKNFSVGEQGNKGIDIAGQRGQPIVSTAGGTVVYSGNALRGYGNLVIVKHNDNYLSAYAHNDRLLVSEGQSVKPGQKIATMGSSGAKSVRLHFEIRYQGKSVNPKRYLP >NZ_CP045338|2786070:2798139|2787248_2787740_-|WP_152469287.1|DBSCAN-SWA MSSIQQLSEKLGQLLLKNQHVLVTAESCTGGGVASAITDIAGSSAWFDRAFITYSNEAKQEMIDVQDATLEAHGAVSEAVVVEMAQGALAHSNGTIAVSISGIAGPGGGSEEKPVGTVCFAWASNSGWQKVETHVFAGDRSQVRQQATACALEVIYDYLLEEK >NZ_CP045338|2786070:2798139|2786070_2787108_-|WP_032549782.1|DBSCAN-SWA MDENKQKALAAALGQIEKQFGKGSIMRLGDNRTMDVETISTGSLSLDIALGAGGLPMGRIVEIYGPESSGKTTLTLELIAAAQREGKTCAFVDAEHALDPIYAQKLGVDIDALLVSQPDTGEQALEICDALARSGAIDVLVVDSVAALTPKAEIEGEMGDSHMGLQARMLSQAMRKLTGNLKQSNCMCIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRTGSIKEGDEIVGNETRIKVVKNKIAAPFKQAETQILYGQGFNREGELIDLGVKHKLIEKAGAWYSYNGDKIGQGKANAGKFLRENPEAAKTIDAKLREMLLTPAQPEEPETGEMPQEEL >NZ_CP045338|2786070:2798139|2796221_2796503_-|WP_032549794.1|DBSCAN-SWA MRIFALVLLILLGWLQHTLWLGKNGISDYYTVNNEIDVQRQVNEKLHTRNAEMFAEIDDLRQGLDAIEERARHELGMVKEGETFYRIIGEESH |
12 | uncultured_Mediterranean_phage(25.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP045342_1 | 10339-10451 | Orphan |
NA
Consensus repeat of NZ_CP045342_1
|
1 spacers
spacers of NZ_CP045342_1
>1.1|10371|49|NZ_CP045342|CRISPRCasFinder GGGTTCTGGCTCAGGGTCTGGTTCAGGCATCACAGGTGGGTCGATCGGG |
CRISPR arrays and Neighbor proteins around NZ_CP045342_1
The CRISPR arrays of NZ_CP045342_1 >merge|NZ_CP045342|1|10339-10451|CRISPRCasFinder TCTGGTTCTACAGGAGGCTGAGGTTCTATTGGGGGTTCTGGCTCAGGGTCTGGTTCAGGCATCACAGGTGGGTCGATCGGGTCTGGTTCTACAGGAGGCTGAGGCTCTATTGG >NZ_CP045342|1|1|10339-10451|CRISPRCasFinder TCTGGTTCTACAGGAGGCTGAGGTTCTATTGG GGGTTCTGGCTCAGGGTCTGGTTCAGGCATCACAGGTGGGTCGATCGGG TCTGGTTCTACAGGAGGCTGAGGCTCTATTGG
>NZ_CP045342.1|WP_152471341.1|5632_5893_+|hypothetical-protein MAQAKNKLEKLATWNYSRRFGNMNAWLNDDEGGVRKVPKDVIQSIINDASSGKFCTVKAVAAKNRVTDATAAKYLRMHNVELKKPL >NZ_CP045342.1|WP_152471340.1|4503_5646_+|galactokinase MSNLFDKVQSAFEQIFGEVNTHLIQAPGRVNLIGEHTDYNDGYVLPCAIDYQAAVAAKKSGDNMVQVYSVDYNQTDAFYLDKEITHHPEYMWANYVRGVVKCLQSRGFELGGVNLTVSGNVPQGAGLSSSAALEVVIGQTFKNLFNLEISQAEIALNGQQAENEFVGCNCGIMDQLISAEGVKDHALLIDCRSLETYPIGMPKGMSVMIINSNKKRGLVDSEYNTRREQCEEAAHFFGVKALRDVSLSQLENRRDGLDDVVVKRARHVITENARTLAASKALTEGDMGLVGKLMEESHASMRDDFEITVPEIDYLVGLVKSVIGEQGGVRMTGGGFGGCIVALCPPPLVDDVVTAVTERYQAHTGLKEDIYVCSAENGAG >NZ_CP045342.1|WP_152471339.1|3449_4499_+|UDP-glucose--hexose-1-phosphate-uridylyltransferase MSEKFNPVDHPHRRYNPLTGQWILVSPHRAKRPWSGQDEKPSNEQLPAYDENCFLCAGNTRISGDVNPDYKGTYVFKNDFAALMENSPAAPKSNNPLFKVEGVTGLSRVICFSPDHSKTLPELPVKKIRGVVDTWTEQIEELGKDYVWVQAFENKGETMGCSQPHPHGQIWANSFLPNEIERKEKNLKEYYQENGSNLLVDYVKSELEDGSRIVVETEHWVAVVPYWAAWPFETMLMPKTHIRRMSELTDEQRDDLAVAIKKLTSRYDNLFQCSFPYSMGWHYAPFFEEGTDIEHWQLHALFYPPLLRSASVRKFMVGYEMLAESQRDLTAEQAAQRLRDLSDIHYKEQ >NZ_CP045342.1|WP_152471338.1|2437_3448_+|UDP-glucose-4-epimerase-GalE MKVLVTGGMGYIGSHTCVQMINAGIEPIILDNLCNSKIAVLNRIEELTGKLPTFYRGDIRESEILDLIFEEQDIDAVIHFAGLKAVGESVAKPVHYYQNNVAGTLCLVESMKKAGVKNLIFSSSATVYGDPAVVPITEGSPTGGTTNPYGTSKHMVERCLMDLVDAESDWSITLLRYFNPVGAHESGRIGEDPQGIPNNLMPFIAQVAVGRRDHLSVFGDNYDTPDGTGVRDYIHVVDLADGHLAALNKLKDKGGLHIFNLGTGNGTSVLQMVEAFGQACGAPIKYETCPRRPGDIAACWASTQKAERELGWKATRTIAEMTADTWNWQSNNPEGY >NZ_CP045342.1|WP_152471337.1|1803_2205_-|hypothetical-protein MKKIMPYWAIALMTLNLLVTVVNAAKFTTVIGVFSAYLAISPILLAFKWIHPEWLAIDTEGKMSKHMRAMVLWPITLYEYHSHKRVVKKITVLVEALEQMNGDEIVKTQEWAELIQCRKKLSVFTAKAKKYSK >NZ_CP045342.1|WP_152471336.1|365_785_-|hypothetical-protein MSNYGIVELSNRPGSFRARVPMLKKGTYTSKDFSNEKYLHPMTSAQAWRDELGKSTWGEEIWMYIKKTKSVRCLWGKKGGVSVCHKTIEGEGEIWIVMWREKGKPKIMPFLLKDDIEAKKNAEEYAKKKKVELSFLNRI >NZ_CP045342.1|WP_152471412.1|15_303_+|hypothetical-protein MLCNECPGSYPGIADIQDISYDVEASIQGVNQKFSLQSGVTSTINLVRDSNGYFGGNGVAKVKFDLEYDRSDSTTTVAIDLNYEDTLTLVFEPVI >NZ_CP045342.1|WP_152471343.1|11208_11706_-|hypothetical-protein MRKAAALLCTLASFSSVASSNYQAVAVSDDWSVGLWTNPDTKERTCVVAPTTMNKRTIPAIVFIPKAPGGARANLIASGSFEGIGATYTVDNSSSVTVGLKNRKVPYGHLVTDHEYQLMLERFKDGLTVTYEVFSSNKFVEGGKRTMSLDGFRTAYELAQRCQWY >NZ_CP045342.1|WP_152471344.1|12566_13109_-|prepilin-type-N-terminal-cleavage/methylation-domain-containing-protein MKKNGFTLIELVVVIVILGILAVSAAPRFLNLQVDARNATLHAMKGTINGAMGIAYGKLAIQGLENESRVSSADYPDLFSKCGTDEGDISNLCVFRYGYPAADWGTLPALVTNLSAVEGSDWFIHDQNTVSVSGGFMSSIKISPKGFRDTTKCHILYENWLVLVDTAMYSPPKLTVVPCD >NZ_CP045342.1|WP_152471345.1|13526_13874_-|hypothetical-protein MVLFALLSTGVFADSIRTSDGISCSFDAGDTPFEIIAYTEKDLADYDGENSYYNDNTESRIGMRLKYKFGGPKRLDCDKLYQMELRDKEARVRQLEEQIRAMEAVNSVDWDIFSK >NZ_CP045342.1|WP_152471346.1|13882_14266_-|hypothetical-protein MVKQILIVTLLTLSSLCVAQSQSPSSYGGKSNPTEAVTLTGVKVVPISITNTDDFHQAFDITVNGKVVMETVSLGRGVEQTINVPVRLKAMGIPEKFKICSISKPLASQSMYRIKLCTVAKLLWVKK >NZ_CP045342.1|WP_152471347.1|14306_14738_-|hypothetical-protein MKKLILALAVIAASNSVVAATSDNMTFSKTITSACTATVDTPTGALILDGTSSAPAESEQAKVTFSTTANTITYQLSSVSDTGSTTLPGGTTYKLYNDTNVEVSSSQVGVAKGATIGFFVRSDNIVTDAGDYQATAVLEVTCS >NZ_CP045342.1|WP_152471348.1|15331_16423_+|diguanylate-cyclase MKAFVYIALASSVIGLAVLDFNVSAVSRIDSMAEFSQAIRNYTSKLNLSAAQDFPYNVPSFAAVHVMKELNDLSVNMKYREVSNNPSNRQNLANDHEKKIIMGAKKSVAQYGFSSDIDHIYLIRYDPIYVKESCLSCHGKVENASAQQHRHYGKKTGYGYKEGDLYGVKVISIDITWFLIRYLAFLVIFVIPVIMWFKGFDKKLTEALNVDSLSGAYNKNYYLKSKDRLIDNGYVLLFDVDDFKNINDTYGHTCGDNVIWQVCELIKSKCRKTDMLFRFGGEEFVLFLKGINSKERVISIVGDIMTEISAHEFVHEGIKFNVTVSIGICKKAPSISTEDAISKADLAMYDVKKHGKNNFKFAD >NZ_CP045342.1|WP_152471349.1|16472_17624_-|diguanylate-cyclase MNQDVNLSSKSIIGEIRYKVLHYGYLWVIVTQALLCMTVGYYIVADPRSVSPEYFSTLAISATLLLVSCGVFFLNKPARYYDVLLSTYALLISISAILLTHITASAFLSPDDGKKILLILFLIMLMSWNANRVVLATGTLPIVCFYIYLTSIRPNVIMLDILISAMKFPILSMAFYSMSNKFFELFESKFIENFNQIHRLERVSYVDELTRIKNRKGFNSALRIAIDSARRSKSTLAIVILDIDFFKQYNDTKGHPAGDLCLRNVAAILRSQCKRKIDSVCRIGGEEFALIMPATSAHQAASVVDNIRTALAKTAIEHPKSSISKHVTVSIGIAEFGDKDDFESLYQRADKAMYKAKVAGRNQYIVCLEECAYCHSNSGVKTV >NZ_CP045342.1|WP_152471350.1|17857_18118_-|hypothetical-protein MDTQNSDLSQCILDCYVIGYYHDAMFGRMPIVAFIDEGNEKEGLMAPDHITQRSIMACYKAGTYDCLPLKIKNMIQKYEQKCGMEQ >NZ_CP045342.1|WP_172974344.1|18350_18890_-|outer-membrane-beta-barrel-protein MNKTLFAVAMTSIVSSASVFAADDVKHEPAKLYVSVGKIDQTADEYIPEPAGFELTFGGDINDYLGVSLSFAKADETYNTVLAETTTFGINVDLGKKFDVNEDFQIKPYGLVGIKETEFTLSNSYAEASDSSAYMGFGLGVRATFKQALVIGFEYVSNSASDDYFPDYDQTKWFVGYQF >NZ_CP045342.1|WP_152471352.1|19073_20507_-|hypothetical-protein MFNKKMLATSLTAIMLTACGGGSGSGGSDGTSVSIQGKAIDGYIQGATVFLDLDFDKVLDSNEPHIVTVDEGDFELVVGPDIKECVKYVPVVVDVPIGAIDLDKPEEPIQEAYQMVIPPEFALTTDSDLKNVTPLTSVIWSEVESELRTVTQQPLSCQTLIKGEALRKDIAQRLRDQEVQMHTRYNITAERLYGDYIDTNDTEAHTLAKRLIPGLQKTYKETKAIRDENPSAIAFVEFFLGEYSVEEGYEDKWYRRQFIQPSTGNYTSNVWEMSEDLETRIKLFSQDQLVTVQRNGINVETAHGLIWDSHSSNYQCASAEALESLDKPYSYGINNTVYLSVEDWNTCKAADYEQGSVEQTVLLKTFSGQQGELTNYSHHTYNSATSSGLEYLIGDESLTKAELIAAIKTKGISHDFYDESDYGTSYWFRTRNQWGSDPTQVVTTHDKDGNWQQSHYFANGTHKKLCGQSEETLVPCE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP045342_1 | 1.1|10371|49|NZ_CP045342|CRISPRCasFinder | 10371-10419 | 49 | NZ_CP045342 | Vibrio sp. THAF190c plasmid pTHAF190c_d, complete sequence | 10371-10419 | 0 | 1.0 |
1. spacer 1.1|10371|49|NZ_CP045342|CRISPRCasFinder matches to NZ_CP045342 (Vibrio sp. THAF190c plasmid pTHAF190c_d, complete sequence) position: , mismatch: 0, identity: 1.0
gggttctggctcagggtctggttcaggcatcacaggtgggtcgatcggg CRISPR spacer gggttctggctcagggtctggttcaggcatcacaggtgggtcgatcggg Protospacer *************************************************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP045343_1 | 3105-3197 | Orphan |
NA
Consensus repeat of NZ_CP045343_1
|
1 spacers
spacers of NZ_CP045343_1
>1.1|3128|47|NZ_CP045343|CRISPRCasFinder CCTACATTCAAATCTCAATACCCTTCCTCTCAACCATTGTCCGATTT |
CRISPR arrays and Neighbor proteins around NZ_CP045343_1
The CRISPR arrays of NZ_CP045343_1 >merge|NZ_CP045343|1|3105-3197|CRISPRCasFinder GGTCTAAATTGAAATCAAATACACCTACATTCAAATCTCAATACCCTTCCTCTCAACCATTGTCCGATTTGGTCTAAATTGAAATCAAATACA >NZ_CP045343|1|1|3105-3197|CRISPRCasFinder GGTCTAAATTGAAATCAAATACA CCTACATTCAAATCTCAATACCCTTCCTCTCAACCATTGTCCGATTT GGTCTAAATTGAAATCAAATACA
>NZ_CP045343.1|WP_102353116.1|2640_3000_-|hypothetical-protein MARSRVLQAAIKGDLEKDFAEFQKRTQMNDADAVRDLLTFALRIKLNDSEDTRPSNRELMEEMYRRIRQIQGTGNLTHSQSFDGAAFYSKKDEAGEMRNLVNADVNNKVDEFLSGEKKG >NZ_CP045343.1|WP_152471417.1|3566_4010_-|transglycosylase-SLT-domain-containing-protein MRCLLCLLSLFPCFSKAFCFDEAGRYYNVDPRLLKSIATVESSLNPNAYNENKNQSGQVVSRDFGLMQINSHWFDKLSDFNVNETNIYEPCFNVHLGAWVLSSNFASHGYNWNSVGAYNAGFHKRTENARKIYIKKVQSVYYQLNLK >NZ_CP045343.1|WP_152471418.1|4103_4607_-|DnaJ-domain-containing-protein MNINDALSILKLNKENETYSRDEVKKVFRKLINQYHPDKEGGDVHFSQLLNQAWESLRKFDQVTFTGEADGQSENLSEILKTAIDQIKNLDGLNIEICGTWIWVTGETKRHKEKLKESKFFYSKNKTAWYFKTGTAKRRGSSKSLDDIRESHGSQVVKRSYKKMLAA >NZ_CP045343.1|WP_152471419.1|4643_5105_-|hypothetical-protein MIQPKFYKRELPFIIPVNQEYKTLSGLLSDYQSDVLSGSLEKEIQSIKASGFLSNTKAIVNKNRRWTYQQILSIKSEIINASPVEFGNIELIQDTNFHGVCLTFKYKSLTPSILFQSTKDGFIATAINYNISSLPVKTPSQIKISIRNKKNIH >NZ_CP045343.1|WP_152471420.1|5188_5557_-|hypothetical-protein MPVYKLENFDMSEWLEQANWLAADHQSLPAIPLLIQLGLSLDQACELRHDAFCTMGESLHDGLTDYLYKVKGRVSATPSKFTGDALLNHETEALFDRLERIDYWGSEHELDGWWLCNRWDFN >NZ_CP045343.1|WP_152471421.1|6107_6494_-|hypothetical-protein MFPSHDKQTVSNDLTATSVSTLDPEGSPKNDNVLIRDILANGWNCMPKELEPIWTDYDGIEVMFCIEHQNQGGDITLEPFLESSLEEGTPVKCWSVFGHCIDGGIECLHDCKTQHEANTLAKYCERMI >NZ_CP045343.1|WP_152471422.1|7185_7446_-|hypothetical-protein MKPTIDLALFLSQSLKIDRAFLNDCMSMIANKETLVSLHMYADELAVYINIDAEQIVSQLQDFSCAYIGNPLKHICIKDYQPEPSL >NZ_CP045343.1|WP_152471423.1|7495_7771_-|hypothetical-protein MSINNQQFQKVVHGCGHKWLALNEPNYLKYLDCIEYVYANDGYAFVRMTDESSAINLHEELVNRYRRYNQIEVKCRRYDMRLFEVSFELAP >NZ_CP045343.1|WP_152471424.1|7821_8886_-|hypothetical-protein MNYSQKLLAEHLQKHPEHNQYVSAIDLSKAERAFHATSFSPERRGVLVVVDYAQRCEGFKVQLNQAITTAKANGIVKVCEQHLEEIHKHFQKEIAQLFINYIHAESVCMSSAITGPANFPVERNKKRLASAMNKYDAIAEREKQLYKRAVKMLLPDGDGTVIKSDSASAVQQLEAKLEQLVKEKEMWIAINKMVRKVYPKGQLKAGATQNDIDELIKDLQTSFGLYEDAAINLLKPCRITSKVVAFPSYSLTNRSQEIKRTEQRIKEQKKLEEGRESNQLNDSLDNGIEYELTEDNKIAIHFTTKPSEEVRSILKSNAFKCSRHRNYAWVRKHTLNAQNAFVREVLPVLKDIAA >NZ_CP045343.1|WP_152471425.1|9020_10574_-|DGQHR-domain-containing-protein MLEKMLLNEPFSYRFPATRGLQHKPYFQINVPLSILVKMLNLDDKGSTLERSQRSVNENRAKGFGEYLIRNVKENSFYIFPSIIGVIDTPVNSKPATFFDVYEVLNVKRSNDVNMLSQGVLVVSMDSTIKLFDGQHRSSGSAYAIRKIATDPELKDLDLSNIHVPFMAYTDLTLVERQIGFSDTNNNLSKPPAAISIAYDHRDAMSQFAVELSTELMCFKDMVDFERNAITGANLNYFSLKTIRDATQSFLGLNKKQAKEGLTAEQKEIAKEFWMTFSRHTGWSALNFGLNDARDHRETHLNTYAVFLKALALAAKNILTNFISFDKVDLSKLDDLDVSRWSDDFKDRAFDQVSGKMKPDSTGVMLTANKLQLAVGCPLDIEQAQLEKQYFGEFVQPKVVQQPEPQAEEQEVEEVVFGQMGQDFTTEDATVIATIVGKKWSDTVTEEQIEDAAAKIYKVVGEFAEEDGQASIEFLKDIKQCLVPDSNDNIESDWRALNRINSLRSQLKKFHDECVAA >NZ_CP045343.1|WP_152471426.1|10896_11415_-|hypothetical-protein MSCSYVGDLFLAHLKSSLSSIAYSGKLHDVDSLRTFLASLDPNPYCGSPSVGVLNLIRLFHVANHRAYRLRYPHKEVEAFQSYKRRDPSVTPTNHRPKADYKQIVATVKAIDCLIDDCNEDGEGVVPCDSRKCWAHFLELHRIMKNIMIQNCPEYQSAQWRVSDFIEWEKMY |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP045343_1 | 1.1|3128|47|NZ_CP045343|CRISPRCasFinder | 3128-3174 | 47 | NZ_CP045343 | Vibrio sp. THAF190c plasmid pTHAF190c_e, complete sequence | 3128-3174 | 0 | 1.0 |
1. spacer 1.1|3128|47|NZ_CP045343|CRISPRCasFinder matches to NZ_CP045343 (Vibrio sp. THAF190c plasmid pTHAF190c_e, complete sequence) position: , mismatch: 0, identity: 1.0
cctacattcaaatctcaatacccttcctctcaaccattgtccgattt CRISPR spacer cctacattcaaatctcaatacccttcctctcaaccattgtccgattt Protospacer ***********************************************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|