Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP040640 | Agrobacterium sp. T29 chromosome circular, complete sequence | 1 crisprs | WYL,DEDDh,csa3,cas3 | 0 | 1 | 3 | 0 |
NZ_CP040641 | Agrobacterium sp. T29 chromosome linear, complete sequence | 1 crisprs | csa3,DEDDh | 0 | 0 | 1 | 0 |
NZ_CP040642 | Agrobacterium sp. T29 plasmid unnamed2, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040640_1 | 1383426-1383505 | Orphan |
NA
Consensus repeat of NZ_CP040640_1
|
1 spacers
spacers of NZ_CP040640_1
>1.1|1383449|34|NZ_CP040640|CRISPRCasFinder CGGAGCGGCTTCTTCGCCTTCAGCGGCGTCGTCA |
CRISPR arrays and Neighbor proteins around NZ_CP040640_1
The CRISPR arrays of NZ_CP040640_1 >merge|NZ_CP040640|1|1383426-1383505|CRISPRCasFinder TTCTTGGCCGGAGCCTTCTTCTTCGGAGCGGCTTCTTCGCCTTCAGCGGCGTCGTCAGCCTTGGCGGCAGCCTTTTTCTT >NZ_CP040640|1|1|1383426-1383505|CRISPRCasFinder TTCTTGGCCGGAGCCTTCTTCTT CGGAGCGGCTTCTTCGCCTTCAGCGGCGTCGTCA GCCTTGGCGGCAGCCTTTTTCTT
>NZ_CP040640.1|WP_142779287.1|1382656_1383184_-|AAA-family-ATPase MGVRNYLIEGVSGTGKTSVATELQRRGYHVIHGDRELAYKGDPETGEPVDLSPFQGDGDMVYRHRRHIWDVEKVQALVTDRRHANTFFCGGSRNFQRFIELFDQVFVLDVDVATLRRRLTERPEDEFGGKPAEREFVLQLHATKEDLPADATVIDSSRSLDMVVDDILVRCVDSA >NZ_CP040640.1|WP_142779286.1|1381742_1382393_-|hypothetical-protein MSPRMGFTLLLSLFTATGTAAHADEAGLIWKPVKNSDRSYTARIGAKLPVDTPIRAGLEMGMSASKTGQVVDTPVRVWGNVTLLAEQLPGVSLARDVGVIFNALTGSSSVSVTSQQKRIVTPELDIEANRNFTVRYDGTAQQWNGLDVSQSLRLSRSETGTAFVLTGASRNSFNEFSSGVAVEQKLGDHLTVRGTLDQGYADHFRPGVSARYSIRW >NZ_CP040640.1|WP_142779285.1|1380092_1381496_+|Si-specific-NAD(P)(+)-transhydrogenase MHQFDLIVVGSGPAGRRAAIQAAKLEKRVLVIEKGSRVGGVSVHTGTIPSKTLRETALNLTGWRERGFYGRAYRVKQEIDAEDLRRRLLITLDHEVEVLEHQFARNRVQHIRGTASFIDANTMKVVKSDGEIMTVTGTSILLTIGTRPYRPPHIPFDGEAVLDSDEILEIKELPRSMVVVGAGVIGIEYATIFSALDTQVTVVEPRETMLEFIDKEIVEDFTYQLRDRNMKLIFGQKAEKVERDESGKCLVSLGNGRVLKAETVLFAAGRVGATDTLNLSACGLEADSRGRLKVDPETFQTSVPNIYAAGDIIGFPSLASTSMEQGRIAARHAVGAPAGEPPQFFPYGIYAVPEISTCGLTEEEVIERGIPYECGIAHFRETSRGHIMGLDSGLLKMIFSLKTRRLLGVHIVGEGATELVHIGQAVLNLKGTVEYFVENTFNYPTLAEAYKIAGLDAWNRMGEIRKD >NZ_CP040640.1|WP_142779284.1|1378805_1379339_+|DUF1003-domain-containing-protein MSDISDYIVSHFKRSSREIGEVERRILELSHQKKLVSSDTNAEFSAGASFGDRLADNIAKVGGSWGFILGFCFFLIFWAVINTIILTTGAFDPYPFIFLNLLLSMLAAIQAPIIMMSQNRQAARDRFEAAKDYEVNLKAELEVLSLHEKIDVKVLAELAALRQDLAALHRHVTRRED >NZ_CP040640.1|WP_168208027.1|1376748_1378689_+|LTA-synthase-family-protein MGLRDSAPKTTASEKAGFVFSPAWTRSLSKLSGIAYPLANLTVASVVLVVALEWIARGSLTDVGAFLTSSARPGMTTIAAVLALLVALDALLGRRYLSLIALAPLCALTGLISAQKQTYLSDPLYPSDLLFGRQILELLPTMLKAQPMTAALVALGICATIAALTGLWLLARRHSPGLSWRERAAGLALTLPLLAGLASLMDYSHYSWVRDRLNIIPMMWDQRENYRHNGFLMAFAFNIPMANVSAPQGYGENTIADLTSEPAAFAANKGDYPDVIMLMSESLWDPTRLENVKLSADPMPTIRAKQSGNVFSPEFGGMTANVEFEALTGFSNAFLPYGSIPYQQYIRRPVPSLASFFRGEGYSAIAMHPFQEWFWNRKQVYRNFGFEEFRSEETLPAMEKRGNFASDDALMDEIMATAEKAQNPLFLFAVTLQGHGPYEATRYAENTIGIEGDLSASASQALATYSQGVAEADEALLKLMRWAKKRDRETIIVLFGDHLPPLGQTFVESGYMPGMVASRRAPLEVMKKEHETPLVVWSSKKGVRKNIGTISPALLPYHVLKTAGFSDPFYTGTLGDVQQAFSVIDRHMLVTTDGKALPDWSIAPNAVPDVVRDYRLLQFDMMFGQQYGRERFFPGFNWLHEGAPSV >NZ_CP040640.1|WP_142779283.1|1374846_1376526_-|NAD+-synthase MSDRHDIQNHLRIAVGQFNPTVGDVAGNLAKAREARADAATQGADLLLLTELFISGYPPEDLVLKPAFLKACLKAVEELAAETADGGPGVVIGFPRQGETGRHNSVALLDGGKIIALRDKIDLPNYGEFDEKRVFSEGSISGPYNFRGVRIGIPICEEIWNDMGVCETLAESGAEILLVPNGSPYYRGKLDVRHQVALRQVIESGLPLVFANQLGGQDELVFDGASFGFNADKTLAFQMSQFEATLAVTDWKRTADGWHCDSGPFSKIPEGEEADYRACMLGFRDYVNKNGFKSVVLGLSGGIDSAICAALAVDALGEERVRCIMLPYRYTSEESLKDAADCAKALGCRYDIVPIVEPVEGFLSALSDLFEGTEEGITEENLQSRTRGTILMAVSNKFGSMVVTTGNKSEMSVGYATLYGDMNGGFNPIKDLYKMQVYAISSWRNAHVPPGALGPSGEVIPANIISKAPSAELRPNQTDQDSLPPYPVLDDILECLVEKEMSVEEILARGHDVATVHRVEHLLYLAEYKRRQSAPGVKITKKNFGRDRRYPITNRFRDR >NZ_CP040640.1|WP_142779282.1|1374215_1374593_+|VOC-family-protein MNTIADHGIRFGRIAAMLPVKNIEKAHDFYVGVLGFEKTFENGTPVGFMILKQGNAELHLTLQPSHKAAPFNVAHMMVSNVDALHALCKSQGLRIIKGLQDKDYGLRAFVFEDPDGNRIDVGQVI >NZ_CP040640.1|WP_142779281.1|1373196_1374027_-|hypothetical-protein MRSSIEIFNIRTRQMRAVWQTPDLFEAPNWSPDGKYLLLNSEGLLYRLSLAGDISPEKVDTGFATLCNNDHGISPDGSLYAISDKVEFGKSAIYLLPSAGGAPRLMTKNLPSYWHGWSPDGKGFAYCGIRDQVFDIYSMDITSGVETRLTHGEGRNDGPDYSPDGEWIYFNSSRTGRMQIWRVRVDGSAVERITDSPYGDWFPHPSPRGDKVVFVSYDGDVFDHPRDLDVRVRLMDMDGGNAETLFELFGGQGTMNSPNWSPDGDEFAYVRYFPVE >NZ_CP040640.1|WP_059760139.1|1372157_1372874_-|5,6-dimethylbenzimidazole-synthase MPADSSVSNPPGASSFDHALSPARPFSCEEREAIYRAIETRRDVRDQFLPDPLPDDLVERLLKAAHSAPSVGFMQPWNFTLVTDGAIRQAAFVAFSRANEEAAAMFTGEQQALYRSLKLEGIRKAPLSICVTCDPTRGGKVVLGRTHNPRTDVYSTVCAIQNLWLAARAEGIGVGWVSIFHDSDIRTILDIPDHIEIVAWLCLGRVDALYNEPELAVKGWRQRVPLEELVFRNRWGGV >NZ_CP040640.1|WP_168208026.1|1370836_1371952_+|DUF2865-domain-containing-protein MTRRSRIIGLLLPLIFLAPAAAFADQVCDTLYAQLREPPRVIGNTSEVRRYANALARQNIVIRKIRNDLRGYGCSSGSVIVYGNPNAGICAEIGDALAEAESERDAIIRDRDDAMAAARDNDGDIRRQRILAALDANGCNTMPQTETQLPPPPDVTRYPDAFRQNGPQNDDEPGQAGLSPYPNAAAEGGLRTLCVRTCDGSFFPIASNASPLDFRAQAEQCEKMCPGTETELYFHSMTDQETADMVSAETGKPYRDLPTAFAYRNATAKAPGCACNMAAYHKEMQKQEEAARPQPEKPYSGITTIPSPQGDKAEKPTEQQQAAKPPEQPVPERDYDPNDSRVRVIGPKFLPDQTGRIDLKNPALKGIQPQQ >NZ_CP040640.1|WP_004433052.1|1385862_1386009_+|DUF1127-domain-containing-protein MNIARSLTNWRKYRQTVTELGRMTDRELSDLGIGRQDIRRVAKTAVGF >NZ_CP040640.1|WP_003495735.1|1386489_1386633_+|DUF1127-domain-containing-protein MNPIRIAKNWISYRRTINELGSLSNQALSDIGLTRYDIRNVAARSFR >NZ_CP040640.1|WP_168208028.1|1387239_1388682_+|methylenetetrahydrofolate--tRNA-(uracil(54)--C(5))-methyltransferase-(FADH(2)-oxidizing)-TrmFO MDASMQDKTTSPIHVVGGGLAGSEAAWQIAQSGVPVILHEMRGVRGTDAHKGDTLAELVCSNSFRSDDATANAVGVIHAEMRLAGSLIMACADRHQVPAGGALAVDRDGFSEAVTKELESHPLVTIIREEVNGLPPKEWGNSIIATGPLTSPDLAAAIQAETGEDALAFFDAIAPIVHRDSINMDICWYQSRYDKVGPGGTGKDYINCPLNEEQYNAFIDALIAGDTVGFKEWEGTPYFDGCLPIEIMAERGRETLRHGPMKPMGLTNAHNPTVKAYAIVQLRQDNALGTLYNMVGFQTKLKYGVQADVFRMIPGLENAEFARLGGLHRNTYIDSPILLDRSLKLKSRPDLRFAGQITGCEGYVESASVGLLAGRFAAAEQKGEAPSLPPATTALGSLLNHITGGHLSSDDEPGKRSFQPMNINFGLFPELAPGSIVKPEGVKRFRGKDKTIMKRQLIAARALRDCAAWLDESQAETEAV >NZ_CP040640.1|WP_142779290.1|1388688_1389291_-|pilus-assembly-protein-TadG MKKTTWFLRLAGFGRCRSGAAAVEMGLLAPLLVLMLAVIIEVGRGWLSYDRFMTIVDNSARWAARFPEFEERVRTGVPSFVVLSGSGILQTGKLDLTLRSVKLVDKVARLQFPAHNFLGSAEDVPWEKTVIANGFVAQEAIIVVSGRYSYRPLISVLADITLKFEYVAAVNPFFSQRYPYQSGKSDFAKWNLKRSPFKAN >NZ_CP040640.1|WP_142780173.1|1389287_1389854_-|pilus-assembly-protein-TadE MKSLLSPRQDKSRGGPSRCRVTHAKCYRLLGDRKAATAVETALLLPLFFALIFGTLEIGLLMLYYLYLSFASNAGIEYLRKAASDGKPATEIALRKAISSRFIGGTDETTLKIALLPIPDDDIAEAKVPIPIVNDFRPPADTAGQYILAIGYNWNFLMPTTRFLVPDTGGIHQLRNISLAITAVRVTE >NZ_CP040640.1|WP_142779291.1|1389855_1391070_-|pilus-assembly-protein MTSRRIKKAAWKWSVFSSLLRDRAGTFAIMTALLLPVFIILLGLLFEGGRALAYYNQSKRVMAMACERATKPTRTYTLLDTVRRDNVTAAFDAMIQSTRQKVLSRDVQVKWTETKINAEFSYGLIFSEMFNLEKLKYRLAYSCEGIPPYPEDDAVIIDNMFESNALGVERVLKNGVTKETPGGCWGVYPYSEIGWDGGTGPGVELQDWSSPCCRRNHNWEGYPAGMQSKKLNEAPTANDKACTLKEVDKAKTTDKIEIDKKAGTELSLPTRYVMELDSDWGPPKPGKKKNIEANSSIYKDVELHPGIYKIMVWYNGRRAVEDVEKTNGIKISLQQLLPDLKPQQRVWELTQDKNSIAWTPRDYSFRVKAYSIYRVTIEATGLSDSFGGIITGFQLIYVDRMEEG >NZ_CP040640.1|WP_003525152.1|1391499_1392627_+|S-(hydroxymethyl)glutathione-dehydrogenase/class-III-alcohol-dehydrogenase MDVRAAVAIQAGKPLEVMTVQLEGPRAGEVLVEVKATGICHTDDFTLSGADPEGLFPAILGHEGAGVVVDVGPGVTSVKKGDHVIPLYTPECRECYSCTSRKTNLCTSIRATQGQGVMPDGTSRFSIGKDKIHHYMGCSTFSNYTVLPEIALAKINPDAPFDKVCYIGCGVTTGIGAVINTAKVEIGSTAIVFGLGGIGLNVLQGLRLAGADMIIGVDINPDRKAWGEKFGMTHFVNPKEVGDDIVPYLVNMTKRNGDLIGGADYTFDCTGNTKVMRQALEASHRGWGKSVIIGVAGAGQEISTRPFQLVTGRNWMGTAFGGARGRTDVPKIVDWYMEGKIQIDPMITHTMPLEDINKGFDLMHKGESIRGVVVY >NZ_CP040640.1|WP_168208029.1|1392914_1393655_+|hypothetical-protein MSLVSEHHERRELGAIGQLLMKGDDAAGVLLTIKSRLPETGRIVVNLSSWYVEPSCRWFAPRMLQMASSNEDEIFTDLTPSPEACKLNERLGFATVTDCTLFYPLPFAALRPASARLRPPGEIKPEILSGEMRDMLEDHARLGCIVAVMEAENRHYPLVFLKTTTKRLPSARLIHCEDRQVAQRHISAIARHLLGHGRLALTMAATGAERKAGGLAAHKSAPIQVKGAWNPRFINEAYSELVLLPP >NZ_CP040640.1|WP_020810249.1|1393760_1394024_+|acyl-carrier-protein MLAAKKSEIDVADTIYSYLSNRFPAYAPFSADTLLLEGGVIDSLGFLELMIFLGEGFGIILDDEHFTPENLGTPADLIAFVLRERRR >NZ_CP040640.1|WP_142779293.1|1394020_1395553_+|AMP-binding-protein MTPHFLLHHLLTARAASDDQALVHKDRSLNYREFSAAAARCAAALQEAGAQRGDRVVIYLPRGIEECWSIFGVSMASGVFVPVNALLKAQQIRHIVKDCGAKIVISDAAMMDELKAALEDLPDVTVLLAEEIEARADTPARPSAAIGEDLAAILYTSGSTGSPKGVMLSHRNLLAGARIVRTYLDITGSDRILSLLPFSFDYGLNQLLTAVEQGAATIISTFRLGDEIVRDLRDHAITGLAGVPTIWAILTKAAPSLTKTPLPHLRYITNSGGRVPQETVKALREKLPDTKIYLMYGLTEAFRSTFLPPEEIDRRPTSIGKAIPECEIFIVTAEGQRAKPGEPGILVHRGPTVSLGYWNRPEDTAKVLRPHPFIPAALGGETVCYSGDLAVEDEDGFFSFVARNDAMIKSSGYRISPTEVEESLMSTGLFQQVAVIGLPDPFAGEKVHAVATAANQNIDVTAALKKAAEMLAPFMIPRAIELVERLPVTANGKVDYRALVRERTDNGANG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP040640_1 | 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder | 1383449-1383482 | 34 | HQ331142 | Salmonella phage S16, complete genome | 92743-92776 | 8 | 0.765 |
NZ_CP040640_1 | 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder | 1383449-1383482 | 34 | NC_012586 | Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence | 1761529-1761562 | 9 | 0.735 |
NZ_CP040640_1 | 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder | 1383449-1383482 | 34 | NZ_CP043499 | Rhizobium grahamii strain BG7 plasmid unnamed, complete sequence | 1224348-1224381 | 9 | 0.735 |
NZ_CP040640_1 | 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder | 1383449-1383482 | 34 | NC_008826 | Methylibium petroleiphilum PM1 plasmid RPME01, complete sequence | 110083-110116 | 9 | 0.735 |
NZ_CP040640_1 | 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder | 1383449-1383482 | 34 | JX181825 | Salmonella phage STML-198, complete genome | 134228-134261 | 9 | 0.735 |
NZ_CP040640_1 | 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder | 1383449-1383482 | 34 | KJ000058 | Salmonella phage STP4-a, complete genome | 93822-93855 | 9 | 0.735 |
NZ_CP040640_1 | 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder | 1383449-1383482 | 34 | NC_042044 | Salmonella phage Melville, complete genome | 94122-94155 | 9 | 0.735 |
NZ_CP040640_1 | 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder | 1383449-1383482 | 34 | NZ_CP013110 | Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence | 1248601-1248634 | 10 | 0.706 |
NZ_CP040640_1 | 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder | 1383449-1383482 | 34 | NZ_CP024310 | Sinorhizobium fredii strain NXT3 plasmid pSfreNXT3c, complete sequence | 850917-850950 | 10 | 0.706 |
NZ_CP040640_1 | 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder | 1383449-1383482 | 34 | NZ_CP013054 | Sinorhizobium americanum CCGM7 plasmid C, complete sequence | 1160346-1160379 | 10 | 0.706 |
NZ_CP040640_1 | 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder | 1383449-1383482 | 34 | NZ_CP023064 | Sinorhizobium sp. CCBAU 05631 plasmid pSS05631b, complete sequence | 670407-670440 | 10 | 0.706 |
1. spacer 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder matches to HQ331142 (Salmonella phage S16, complete genome) position: , mismatch: 8, identity: 0.765
cggagcggcttcttcgccttcagcggcgtcgtca CRISPR spacer ctcatcttcgtcatcgccttcatcggcgtcgtca Protospacer * * * * ** ********* ***********
2. spacer 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder matches to NC_012586 (Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence) position: , mismatch: 9, identity: 0.735
cggagcggcttcttcgccttcagcggcgtcgtca CRISPR spacer ctttactacttcttcgccttcggcggcttcgtcg Protospacer * .* .*************.***** *****.
3. spacer 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder matches to NZ_CP043499 (Rhizobium grahamii strain BG7 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.735
cggagcggcttcttcgccttcagcggcgtcgtca CRISPR spacer ctctactacttcttcgccttcggcggcttcgtcg Protospacer * .* .*************.***** *****.
4. spacer 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder matches to NC_008826 (Methylibium petroleiphilum PM1 plasmid RPME01, complete sequence) position: , mismatch: 9, identity: 0.735
cggagcggct--tcttcgccttcagcggcgtcgtca CRISPR spacer --caccagtcgatcttcgccttccgcggcgccgtca Protospacer * *.*.. *********** ******.*****
5. spacer 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder matches to JX181825 (Salmonella phage STML-198, complete genome) position: , mismatch: 9, identity: 0.735
cggagcggcttcttcgccttcagcggcgtcgtca CRISPR spacer ttcatcttcgtcatcgccttcatcggcgtcgtca Protospacer . * * * ** ********* ***********
6. spacer 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder matches to KJ000058 (Salmonella phage STP4-a, complete genome) position: , mismatch: 9, identity: 0.735
cggagcggcttcttcgccttcagcggcgtcgtca CRISPR spacer ttcatcttcgtcatcgccttcatcggcgtcgtca Protospacer . * * * ** ********* ***********
7. spacer 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder matches to NC_042044 (Salmonella phage Melville, complete genome) position: , mismatch: 9, identity: 0.735
cggagcggcttcttcgccttcagcggcgtcgtca CRISPR spacer ttcatcttcgtcatcgccttcatcggcgtcgtca Protospacer . * * * ** ********* ***********
8. spacer 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder matches to NZ_CP013110 (Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence) position: , mismatch: 10, identity: 0.706
cggagcggcttcttcgccttcagcggcgtcgtca CRISPR spacer ctctactatttcttcgccttcggcggcttcgtcg Protospacer * .* ..************.***** *****.
9. spacer 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder matches to NZ_CP024310 (Sinorhizobium fredii strain NXT3 plasmid pSfreNXT3c, complete sequence) position: , mismatch: 10, identity: 0.706
cggagcggcttcttcgccttcagcggcgtcgtca CRISPR spacer ctctactatttcttcgccttcggcggcttcgtcg Protospacer * .* ..************.***** *****.
10. spacer 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder matches to NZ_CP013054 (Sinorhizobium americanum CCGM7 plasmid C, complete sequence) position: , mismatch: 10, identity: 0.706
cggagcggcttcttcgccttcagcggcgtcgtca CRISPR spacer ctctactatttcttcgccttcggcggcttcgtcg Protospacer * .* ..************.***** *****.
11. spacer 1.1|1383449|34|NZ_CP040640|CRISPRCasFinder matches to NZ_CP023064 (Sinorhizobium sp. CCBAU 05631 plasmid pSS05631b, complete sequence) position: , mismatch: 10, identity: 0.706
cggagcggcttcttcgccttcagcggcgtcgtca CRISPR spacer ctctactatttcttcgccttcggcggcttcgtcg Protospacer * .* ..************.***** *****.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
638318 : 645650
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP040640|638318:645650|DBSCAN-SWA CATGCGATTTCCCTTTTCCCTGCCGTGGCTGCGCCCGGCGGATGGCAGAGCCGTGCCTGAAAGCCGGAAAATGGCCGATGGCTTTATGGCGGTTGCCGTGCAGGGCGGGCAGGCCTTCTGGTCGGGCCGGTCCTATGCCGCACTTGCCCGCGAGGGCTTCATGAAGAACCCGGTGGCGCACCGGGCCGCCCGCATGGTGGCGGAGGCGTCCGCCTCGGTCAGCTGGCTGCTTTATGATGGCGAAGAGGAACTCGCCGATCACCCGCTCCTGGCGCTGCTCTCAAAACCGGGCGCCCATATGGGTGGGCCGGATTTCTTCGAGGCGCTTTATGGCCACCTCATGCTCGCCGGAAATGCCTATATCGAGCCGCTCACGGTGGGCGGGCGGCTGCGCGAGCTGCATCTGCTGCGGCCAGACAGGGTCAGCATCGTCGAAGGGCCGGATGGCTGGCCGGTGGCCTATGATTACCGTGCCGAGGGCCGCGCCTCCCGGCGCATCGCCGCCGAGCGCGACGGGCTGGGGCTGCTGCATCTGAAACTTTTCCATCCGCTGGATGACCGGGCGGGTTTTGCGCCGCTCGCCTCCGCAGGGGCCGCGCTCGATCTGCACAATGCCGCAAGCCAGTGGAACAAGCGCCTGCTCGACAATTCCGCCCGGCCATCCGGCGCGCTGGTCTACCAGCCGAAGGAGGGCGGTAATCTCTCCACCGAACAATATGAGCGGCTGAAGCGTGAGCTGGAGGAAGGCTATCAGGGGGCGATGAATGCCGGCCGCCCGCTGCTGCTGGAAGGCGGGCTGGACTGGAAGGCCATGGGTCTTTCGCCGCGCGACATGGATTTTCTCGAGGCCCGCAATGGGGCGGCGCGCGATATCGCGCTTTCGCTCGGCGTGCCGCCGATGCTGATCGGCATTCCCGGCGACAATACCTATGCCAATTACCAGGAGGCGAACCGCGCCTTTTATCGTCTTACCGTCCTGCCGCTCGTTTATCGCACGGCGGCGAGGCTCTGCGGCTGGCTTTCTCCGGTCTTCGGCTCTGGGCTGAGGCTCGAACCCGATCTCGACCGGATTGCCGGGCTTGCCGGCGAGCGGGATGCGCTCTGGGCACGCATCGGTGCGGCCTCGTTCCTGAGCGACGAGGAAAAGCGCGAAGCCGTCGGTTATTGATCGCCGCCCCGGCGGGATCGAAACCCCGGTGTTCCGTTCAATCATTTTCTGAAGAAGCGGACTCTCTTTGTGAGCTCTTCCGCCCAAGCGATTCCGAAGATTCGCAAAATCACCCCGCGCAGGCGCGCAGGAAGCGCCCGCGTGCGACGCGGCGCGTCCGCCCGATAACCACATCCTAGAACGGAATGATTACCCATGTCTGAATTCGCCAATGAGGCCGGCATCTGGGCCGCCCGCATCACCGGCGCCGTGGCGGGCGCGGGTGTATCGCTCGTTTATCTCCTGCCGAAAAGCAAACGCGAGGCCGCGAGCCGTTTCATCACCGGCGTTTCCTGCGGCATGATCTTCGGCGGGCCTATCGGCCTGTGGATCGTGCAGCAGCTCGATATCGCCGGCGCGCTTTCGGGCCGGGAAATCATGGTGGCGGGTTCCGCCGCCGCCAGCATGGGCGCGTGGTGGGGGCTGGGCGTGCTGGTGCGCATCGCCGACCGTTACGGTGCGCGCCCGCGCGCCTGACACTTCCTTCACATCGCAGGAGTTTTCCATGCACGTTTATCGCGGGCCGCGTCCCGCCACGCGCAAATTCGCCAATCTGGAACTGCGCGGCATTGCCAGCGACGGCACCTTTTCCGGTTATGCCAGCGTTTTCGGCGAGGTCGATCTCGGTCAGGATGTCATCGAGCGCGGTGCTTTCCGCCGCTCCATCGAGGAGCGGGGAGCGGCCGGTATCCGCATGCTCTACCAGCACGATCCGGCCGAGCCCATCGGCGCCTGGCGCACCATCCGCGAGGACGAGCGCGGGCTTTATGTCGAGGGCGTTCTCGCCCCCGGCGTCGCTCGCTCCCGCGAGGTGCATTCGCTGATGAAGACCGGCGCGCTGGACGGGCTGTCCATCGGCTTTCGCACCGTACGTTCCGGCAAGGCGGCCAGCCAAGGCGTCCGGCGCATTCTGGAAGCCGATCTGTGGGAAATCTCGGTCGTGACCTTCCCGATGCTGCCTTCGGCGCGCGTTTCCGACGTCAAGCACGCCCGCTTCTTCCGCGACGGCGAGACCGAGCTGGTGCGCACCATGCGCCGCGCCGCCCGCGCGCTGTTCGACACGACACCCAAACGCTGACTTCCGGACAATCACCAACAAGGAAACGACATGACAGACCAGATGACGAAACCGGCCCCGATGACCGTCGCGCCGCAGGTGAAGGCCGTGCCCGATACGGTGACTGCCGCCTTCGACGAGTTCATGGAGGCCTTCGAGGCCTTTCGCGAGACCAACGACCAGCGGCTTGCCGATATCGAGCGCAAGATGGGCGCGGATGTCGTGACCCGCGACAAGCTCGACCGTATCGACAGGGCGCTCGACGACAACCGCCGGATCATGGACGATCTGGCGCTCAAGAAAGCGCGCCCCGCGCTTGGCCGCAAGGATGCGCTTTCCCACGATGCCGAAGAGCACAAGGCTGCTTTCGAGGCCTATATCCGTCGCGGCGAGGAGGGCGCGCTGCGCGATCTGGAGGCCAAGGCCTTTGCCGGCTCGACCGGTGCCGATGGCGGTTTTCTGCTGCCGAACGAGACGGATGGCGAGATCGGCCGGCGCATGACGGCGATTTCGCCGATCCGCGCGCTGGCGACCGTGCGGCAGGTTTCCGCCGCCGTGCTGAAGAAACCCTTCTCGCCCGGCGGCATGACGACCGGCTGGGTTTCTGAGACGGCGGCGCGTCCGCAAACGGCAACGCCGCAGCTCGCCGAGCTTTCCTTCCCGACCATGGAGCTTTACGCCATGCCGGCCGCGACTCAAGGGTTGCTGGATGATGCGGCAGTCGATATCGAAGCCTGGATCGCTTCCGAAGTGGATATCGCTTTCGCCGAACAGGAGGCCGCCGCCTTCATCGCCGGCGATGGCGTCAACAAACCCAGGGGTTTCCTCTCCTATACCGCCGTCGCCAATGATGGCTGGAGCTGGGGCAATATCGGTTATGTCGCGACCGGCGTTTCGGCCGGATTCGCCTCCGCCGGGCCGATGGATGTGCTGCTCGATGCTGTTTATGCGCTGAAGGCCGGCCACCGCCAGAACGGAACCTTCCTGATGAACCGCAAGACGCAAGGGGCGTTGCGCCGCTTCAAGGATACCAGCGGCGCCTATCTGTGGCACCCGCCCGCCGCCGCCGGCCAGCCGGCCTCGCTGATGGGCTTTCCGGTGACGGAGGCGGAGGACATGCCGAATGTGGCGGCCAACAGCTTCGCCATCGCCTTTGGCGATTTCCGCGCCGGCTACCTCGTCGTCGACCGTACCGGCGTGCGCATCCTGCGCGATCCCTATTCGGCCAAACCCTATGTGCTGTTCTACACCACCAAACGCGTGGGCGGCGGCGTGCAGAATTTCGAGGCGATCAAGCTGGTGAAATTCGGGGTGAATTGACCGCCCTTTCTTCTCCCCGCCGGGGAGAAGATGCCCCGAAGGGGCAGATGAGGGGGCAAGCTCTCCGAACACATCTACCCTTGCCCCCTCATCCCGCTGCCGCGGACTTCTCCCCCTCGGGGAGAAGAAACAAGCGGTGCCCGCTCGCTAATGCGATTACCTTCCTTCCGGAGACCCCATGACCTATGCCCTCATTCATCCGCCGCAGGCGGAGCCGCTGACACTTGCCGAGGTGAAGGCGCATCTGCGTCTCGACAGCGGCGATGAGGACGCGCTTCTTGCCGCGCTCATCCGCACCGCCCGCGAGCATCTGGAACGCACGACCGGGCTTTGCCTCATCCGCCAGACCTGGCGGCTTTATCTCGACCGGTGGCCTGAGAACGGCGTGATTCTGATTGGCAGGACACCGGTGCAAGCCATCGAAACGATTCTGGTTTTTGACGGTGACGGGCGTGCGGCAGACATCACCGCCGGGGAAAAATTGCTCGATGGCGCGGCGCGTCCCGCAAGGCTGTGGCTGCGCGATCCTCCCGCCCCCGGACGGGCGATGAACGGCATCGAGATCGATTTCATTGCCGGCTACGGTGAAGCGGGAACGGATGTGCCCGACACGCTGAAACGCGCCATGCTGATGCATGTGGCGCAGATGTTCGCCTTTCGCGGTGCTGTCGCCCCGGAAAACCAGCCCGCCGCCATTCCCGCCGGTTACGAGCGGCTGGTGGCGCCTTTCTGCCGTCTGGGGCTTTGAGAGATGAACCTTGTTTTTCTCGATCCCGGTAAGTTGACGGCGCGGCTGGAACTGGACGTGCGGACCGAAACGCCGGACGGGCAGGGCGGTGCTGCGGAAAGCTGGAATTTCCTGCGGTCGCTCTGGGCTGCGATCGAACCCGTTTCGGAGGCATCCCATGAACGTGCCTCTGCCGAGGGCGTGACGATCACCCACCGCGTCTGGCTGGCTTATCGCGGCGACATTGCCGCCGGCATGCGCTTTCGCAAGGGCCGCCGTATTCTGGCGATCCGGGCGGTGATGGACCCGGATGAGACGCGTCGCTTCATCGTCTGCCGCTGCGAGGAGGAGAGCCGATGAGCGCCGCAAATCTGCTTCTGCAGGCAATTTTCGCAAGGCTCGGCAGCGATGCGGCGCTGATGGCGCTTATCCCCGGCGGCATTGTCGACCGGCTTCTGCCGCGCCCGGTTCTGCCGTGCATAGTTATCGATGATCTCGAAAGCCGGGACTATTCGACGGCGACGGAAAGAGCCGAGGAGCATTTTCTGTCGCTGCAGATATGGAGCGACGCCAATGGCCGCAGGGGCGCAGACGAAATCGTCGACCGGGTGAAAATCCTGCTCGATGATGCCGCGCTGCCGATTGCCGGCGTTTCTCTCGTCAATCTGCATCTTCTTTCCAGCCGTTCGCGGCGCGAGGCGAAGACACGGAATTTCATCGCGGAAATGCGCTTCAGGGCGGTGACGGAGTAGCGCCGTTTCAGGAGCCCTGCCTGCGCACGGTTTTCCACAGAACGATCAGCAGCAGAAACGAAATGCCGATCAGCACGGAGGCGATGGTGAGCATGGCGGATACGCCGCCGCGATCGAGCGCCACGGTGAAGACGACCGGCGCCACGGCAATGGCGAGGTTCTGCGGCAGGGAGATGCGCGCGGCCTGAAGCCCATATTCTTCCGGCGAAAACACAGCCAGCGGCAACACGGCCCGGCTGACGGTGAGCACGCCGGCGCCAAAGCCGAAGAAAACGATGAAACCGATGAAGGCGGGCATGGCCGGAGCAAAAACGAGCAACAGTAGCAGGGAAAAGAGCAGCAGACAGAAGCCGATCATGGCGGTGAGGAAGGGATTCCCGCGCTTCCCGAGCAGGAAATCCAGCCCGCGCGCCGTAATCGCAAGGACGCTGCGTGCCGATGCCAGTTGCACCGCAAGCGACTGCGTTGCGCCGGCCTGAACCAGAAGCAGCGGCAGGAGCGGTGAGAGGCCGAAGGTCGTAAACGCGCTGATCGTCGTCATGGCGGCCAGCAGCACGAAGGCGCGCCGCGTATCGACCGGTGAGGGCGACATCGCCGTCTCTTCTTTCGGCTTTGCCTCTTTTCGGGCCGGACGGCCGGGAAGCACGAAAAGATAGAGCGGCAGCAGGATGAAGATTTGCAGACAGGCATAGGCGGCAAGCGTGCCGCGCCAGCCGAGATGCTGATCGGCAATGGTAGTGACCGGCAGGAAAACCGCCGCCGAAAGCCCGGTGAACAGCATCAGCAGCGTCAAAAGCCGGCCGCTTTCCGCGCCCACGCGTTCCACCACGGCCGCATGGGCCGCCGTCGTCAGGCCGCAGGTGGCGGCAAAACCCATCACGGCCCAGCCGAAGAGATAGCTCACCGCCCCGCCCGCGAGGGCGAGCACGGCAAAGCCGGCCGCGAAGAATACCGAACCGGCCGCCAGCACCGGCGCCGCGCCGTGGCGCATCAGCATTCTTCCAAGCAGCGGGCCGCACAGGGCGCTGATGGTCATCATGATCGTGAGGCCGGCGAAAACCACGCCATTGGTGATGGCCAGCTCCTGGCCGATCCGTGGCCCCAGAACGGCCAGCATATCGAAACCGCTACCCCAGCTCACGATCTGCCCGACCGCAAGCACGCCGATAAGGCGCGCGCGAGACGTCAGGGGGGCAGCGTCGGACATGGTGAGATCGCGGGGTGCGAGGTGCAGGAATGCTTTTGGTAGCAGGCCCTTCGCGACCCCGCAACAGCAATCGAGACGAAAGGAAACAACATGGTGGCGCAGAAGGGCAAGGACCTGCTGCTGAAGATCGACAATGCCGGTTCCTATGCAACCGTTGCGGGGCTGAGGACAAAACGGCTGGCCTTCAATGCGCAGGCCGTCGATGTGACGGATGCCGAAAGTGCGGGGCGCTGGCGGGAGCTTCTTGCCGGCGCCGGCGTGCAGCGGGCATCGCTCACGGCCTCCGGCATCTTCAAGGATCAGGCGAGCGATGCGCTGGTGCGCGGCGCATTTTTTGCGGGCAGCATTCCCGGCTGGCAGATCGTCATTCCCGGTTTCGGCATCGTTACCGGGCCGTTCCAGATCGTGGCGCTCGAATATTCCGGCCGTCACGATGGCGAGGTGCAGTTCGAGATCGCGCTGGAATCGGCCGGTCTTCTCACATTCGGAGCGCTGTGATGGCTGAGCGTTTGCGTTACGGGCGGGCGAACCGCCATCGCGGCGAGATCGAAGCGCTGATCGACGGCGAAAGGCGCGTTCTCTGCCTGACGCTCGGGGCCTTGGCCGAACTCGAAACCGCCTTTGCAGCCGACGACCTCACCGCGCTCGCCGAACGTTTCGCGAGCGGCCGCATGAAGGCGGTCGACATGATAAGGGTGATCGGCGCGGGCCTGCGAGGCGCGGGCAATGTCTTTTCCGACGAGGATGTGGCCGCCGCCACGGTGGAAGGCGGCATTGCCGGCCATGCCGCCATCGTTGCCGATCTTCTGACCGCCACCTTCGGCGGCCTGAAAGGCGAGACGCCGCCGGACCCTTGAGCGCCGCAGCAGGCGAAGCGACGCCGCGCCCGTTCCCCTGGGAGGCGGCGATCCATGCCGGCTTCTGCCTGCTGCGGCTCTCCTCCGAAACCTTCTGGCGGCTGACCCCGCGGGAATTCTTCGCGATGACGGGCGGCAACGCCCTTCTCCGCGGCCCCGACCGTCAGGCGATGGAAGCGATGATGCGGCGGTTTCCGGATGGGTGA
Protein sequences of DBSCAN-SWA_1 >NZ_CP040640|638318:645650|642079_642649_+|WP_142778915.1|DBSCAN-SWA MTYALIHPPQAEPLTLAEVKAHLRLDSGDEDALLAALIRTAREHLERTTGLCLIRQTWRLYLDRWPENGVILIGRTPVQAIETILVFDGDGRAADITAGEKLLDGAARPARLWLRDPPAPGRAMNGIEIDFIAGYGEAGTDVPDTLKRAMLMHVAQMFAFRGAVAPENQPAAIPAGYERLVAPFCRLGL >NZ_CP040640|638318:645650|645440_645650_+|WP_125144629.1|tail|DBSCAN-SWA MSAAAGEATPRPFPWEAAIHAGFCLLRLSSETFWRLTPREFFAMTGGNALLRGPDRQAMEAMMRRFPDG >NZ_CP040640|638318:645650|643387_644587_-|WP_168207993.1|DBSCAN-SWA MSDAAPLTSRARLIGVLAVGQIVSWGSGFDMLAVLGPRIGQELAITNGVVFAGLTIMMTISALCGPLLGRMLMRHGAAPVLAAGSVFFAAGFAVLALAGGAVSYLFGWAVMGFAATCGLTTAAHAAVVERVGAESGRLLTLLMLFTGLSAAVFLPVTTIADQHLGWRGTLAAYACLQIFILLPLYLFVLPGRPARKEAKPKEETAMSPSPVDTRRAFVLLAAMTTISAFTTFGLSPLLPLLLVQAGATQSLAVQLASARSVLAITARGLDFLLGKRGNPFLTAMIGFCLLLFSLLLLLVFAPAMPAFIGFIVFFGFGAGVLTVSRAVLPLAVFSPEEYGLQAARISLPQNLAIAVAPVVFTVALDRGGVSAMLTIASVLIGISFLLLIVLWKTVRRQGS >NZ_CP040640|638318:645650|639680_640001_+|WP_020808622.1|DBSCAN-SWA MSEFANEAGIWAARITGAVAGAGVSLVYLLPKSKREAASRFITGVSCGMIFGGPIGLWIVQQLDIAGALSGREIMVAGSAAASMGAWWGLGVLVRIADRYGARPRA >NZ_CP040640|638318:645650|645084_645444_+|WP_059753802.1|DBSCAN-SWA MAERLRYGRANRHRGEIEALIDGERRVLCLTLGALAELETAFAADDLTALAERFASGRMKAVDMIRVIGAGLRGAGNVFSDEDVAAATVEGGIAGHAAIVADLLTATFGGLKGETPPDP >NZ_CP040640|638318:645650|638318_639485_+|WP_142778913.1|portal|DBSCAN-SWA MRFPFSLPWLRPADGRAVPESRKMADGFMAVAVQGGQAFWSGRSYAALAREGFMKNPVAHRAARMVAEASASVSWLLYDGEEELADHPLLALLSKPGAHMGGPDFFEALYGHLMLAGNAYIEPLTVGGRLRELHLLRPDRVSIVEGPDGWPVAYDYRAEGRASRRIAAERDGLGLLHLKLFHPLDDRAGFAPLASAGAALDLHNAASQWNKRLLDNSARPSGALVYQPKEGGNLSTEQYERLKRELEEGYQGAMNAGRPLLLEGGLDWKAMGLSPRDMDFLEARNGAARDIALSLGVPPMLIGIPGDNTYANYQEANRAFYRLTVLPLVYRTAARLCGWLSPVFGSGLRLEPDLDRIAGLAGERDALWARIGAASFLSDEEKREAVGY >NZ_CP040640|638318:645650|642984_643380_+|WP_142778916.1|DBSCAN-SWA MSAANLLLQAIFARLGSDAALMALIPGGIVDRLLPRPVLPCIVIDDLESRDYSTATERAEEHFLSLQIWSDANGRRGADEIVDRVKILLDDAALPIAGVSLVNLHLLSSRSRREAKTRNFIAEMRFRAVTE >NZ_CP040640|638318:645650|640632_641901_+|WP_137003130.1|capsid|DBSCAN-SWA MTDQMTKPAPMTVAPQVKAVPDTVTAAFDEFMEAFEAFRETNDQRLADIERKMGADVVTRDKLDRIDRALDDNRRIMDDLALKKARPALGRKDALSHDAEEHKAAFEAYIRRGEEGALRDLEAKAFAGSTGADGGFLLPNETDGEIGRRMTAISPIRALATVRQVSAAVLKKPFSPGGMTTGWVSETAARPQTATPQLAELSFPTMELYAMPAATQGLLDDAAVDIEAWIASEVDIAFAEQEAAAFIAGDGVNKPRGFLSYTAVANDGWSWGNIGYVATGVSAGFASAGPMDVLLDAVYALKAGHRQNGTFLMNRKTQGALRRFKDTSGAYLWHPPAAAGQPASLMGFPVTEAEDMPNVAANSFAIAFGDFRAGYLVVDRTGVRILRDPYSAKPYVLFYTTKRVGGGVQNFEAIKLVKFGVN >NZ_CP040640|638318:645650|642652_642988_+|WP_020808625.1|head|DBSCAN-SWA MNLVFLDPGKLTARLELDVRTETPDGQGGAAESWNFLRSLWAAIEPVSEASHERASAEGVTITHRVWLAYRGDIAAGMRFRKGRRILAIRAVMDPDETRRFIVCRCEEESR >NZ_CP040640|638318:645650|640029_640602_+|WP_142778914.1|head,protease|DBSCAN-SWA MHVYRGPRPATRKFANLELRGIASDGTFSGYASVFGEVDLGQDVIERGAFRRSIEERGAAGIRMLYQHDPAEPIGAWRTIREDERGLYVEGVLAPGVARSREVHSLMKTGALDGLSIGFRTVRSGKAASQGVRRILEADLWEISVVTFPMLPSARVSDVKHARFFRDGETELVRTMRRAARALFDTTPKR >NZ_CP040640|638318:645650|644677_645085_+|WP_003521219.1|tail|DBSCAN-SWA MVAQKGKDLLLKIDNAGSYATVAGLRTKRLAFNAQAVDVTDAESAGRWRELLAGAGVQRASLTASGIFKDQASDALVRGAFFAGSIPGWQIVIPGFGIVTGPFQIVALEYSGRHDGEVQFEIALESAGLLTFGAL |
11 | Geobacillus_phage(33.33%) | protease,head,tail,portal,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1394020 : 1402298
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP040640|1394020:1402298|DBSCAN-SWA GATGACACCCCATTTCCTGCTGCACCACCTGCTTACCGCCCGCGCGGCGAGCGACGATCAGGCCCTCGTCCACAAGGACCGATCTCTGAATTATCGCGAATTTTCCGCAGCGGCGGCCCGTTGCGCGGCGGCATTGCAGGAAGCCGGAGCGCAACGCGGTGACCGCGTCGTCATCTATCTGCCGCGCGGCATCGAGGAATGCTGGTCGATCTTTGGCGTGAGCATGGCGTCCGGCGTCTTCGTGCCGGTGAACGCATTGCTGAAGGCGCAGCAGATTCGCCATATCGTTAAGGATTGCGGCGCAAAAATCGTCATCAGCGACGCCGCGATGATGGATGAGCTGAAAGCGGCACTGGAAGACCTTCCCGACGTCACGGTCCTGCTGGCGGAAGAGATAGAGGCCCGCGCCGACACACCCGCCCGGCCTTCAGCGGCAATAGGGGAAGACCTGGCCGCCATTCTTTATACCTCCGGCTCCACCGGATCGCCCAAGGGCGTGATGCTGTCGCACCGCAATCTTCTGGCCGGCGCGCGCATCGTGCGCACCTATCTGGACATCACCGGTAGCGACCGCATTTTGTCCCTCCTGCCCTTCAGCTTCGATTATGGCCTGAACCAGTTGCTGACGGCGGTGGAACAGGGCGCGGCCACGATCATCTCCACCTTCCGGCTGGGCGATGAGATCGTCCGCGATCTGCGCGACCATGCCATCACGGGTCTCGCCGGCGTGCCGACGATCTGGGCCATCCTGACGAAAGCGGCCCCGTCGCTCACAAAGACGCCTCTGCCGCATCTGCGTTACATCACCAATTCCGGCGGGCGCGTGCCGCAGGAAACCGTAAAAGCGCTGCGCGAGAAGCTGCCGGATACGAAAATCTACCTGATGTATGGCCTCACGGAAGCCTTCCGCTCCACCTTCCTGCCGCCTGAAGAAATCGACCGCCGCCCGACCTCCATCGGCAAGGCCATTCCCGAATGCGAAATCTTCATCGTCACCGCCGAGGGACAGAGGGCAAAGCCGGGTGAGCCCGGTATTCTCGTGCATCGCGGCCCGACCGTCTCGCTGGGTTACTGGAACCGGCCGGAGGATACGGCAAAGGTGCTGCGCCCTCACCCCTTCATTCCGGCGGCGCTGGGCGGCGAGACCGTGTGTTATTCCGGCGATCTGGCGGTGGAGGACGAGGATGGTTTCTTCAGCTTCGTCGCCCGCAACGATGCGATGATCAAATCGTCGGGCTATCGCATCAGCCCGACCGAGGTTGAGGAAAGCCTGATGTCGACAGGCCTTTTCCAGCAGGTCGCCGTCATCGGCCTGCCGGACCCCTTTGCGGGCGAAAAGGTCCATGCCGTGGCGACTGCCGCCAACCAGAATATAGATGTGACGGCGGCTCTGAAGAAGGCGGCCGAAATGCTAGCCCCCTTCATGATCCCGCGCGCCATCGAACTGGTGGAACGGCTGCCGGTCACGGCCAATGGCAAGGTGGATTACCGCGCGCTGGTGCGCGAACGGACAGACAATGGCGCCAACGGATAAACCCCAGAGCCACGGCCCGGCTTTTGCAGCCGCGTTATTCGACACAAGCGAGAACGATCTGATGATCGGCGGACTTCCCGTGCGCGATATTGTCGCCGGGACCGGAACGCCTTGCTTCCTCTATGACGCTGCCGCCATGCGCCGCGCCTATCGCGATCTCGAAACCGCGCTCGGCGGTTTTGCCGATATCTATTATTCGGTGAAGGCCAATCCCCTGCCCGCCATCGTCTCGCTTTTCCGGCAGGAAGGTGCGGGTGCGGAAATCGCTTCCGTCGGTGAATACCGCGCCGCCATCAAGGCCGGTGTCGCAGCGGAAAACATCATCTTTGCCGGCCCCGGCAAAGGCATGGCCGAGCTGCGCGAAGTGATCGAGGGCGGCATCGGCGAAATCCACATCGAAAGCGCCGAGGAAATCGCCCGCATCGAGGCAATCGGCAAGCCGGTCAAAGCCTCGATCCGCATCAATCCGGTGCCGGATGCACAGGCCGGCGCGATGCGCATGGGCGGCAAGGCCACAGCCTTCGGTTTCGATGAGGAGGAGCTGGAAAATGTCCTGACGCTCTTCAAGGACGCCCGATATATCGATCTCGTCGGCGTCCATATTTACGGCGGCACGCAGATCCTCGATGCCGACATGCTTGTCTCCCAATGGAGACACGGCATATCCCTTGCCGCGCGCACGGCCGAGATTCTCGGCAGGCCGCTCGAAACCATCGATCTCGGCGGCGGTCTCGGCATTCCCTATTTCGCAGGCGAAACGCCGCTCGATCTTGAAAAAGTCAGCGCCGCCATTGCCGATCTCAAGGCGCTCCTGAAGGCGCATCCGCTGATAACCGACGCCCATGTCATCGTCGAACCCGGCCGTTTCCTCGCCGGTCCCGGCGGCGTCTATGTGGCGGAGGTGAACTCGGTAAAGACCTCGCGCGGCACCACCTTCGTGGTGATGGATGGCGGCATGCACCATCACCTGGCCGCCTCCGGCAATCTCGGCCAGATCGTCAAGCGCAACTACCCCATCGTCGCACCGGCCATGATGCGGGCGGAGCATGAGGAAACGGCAACCATCGTCGGCCCGCTCTGTACGCCGCTCGATACGCTCGCCCGCAATGCGGCGCTGCCGAAGCTCAAGGCCGGCGATCTCCTCGCCATCCTGCAATCGGGCGCCTATGGCGCCAGCGCCAGCCCAGCGGGTTTTCTGAGCCATGCGGCGGCGAAGGAAGTGCTGGTGGAGAATGGGGTGTTTGAGGTGATCGGGCGCTGAGGACGTTTGGTAGCGAAAACCGTAGCCGGCTTTCCTAAGGGCACACATTTGGCTGCTTACCCCGGAGCTTCCTCCAATCTTCCTCATTCCTGTGCCTGTCACAGGAATGAGGGAGGCGACTAACCCCGAAGCAACCTGTCCCCCACCGCGAATTTCGATTTCAGCAGACACTCATCATATTCCGCCTCGGCAACGGAATCGAACACGATACCGCCGCCGACATTGAAGACGGCACGGCCGTCGTCAAACAGGCTGAGCGTGCGGATCGCCACGGAGAACCGCATCTCACCACTTGGAGACATGAAGCCGATTGCGCCGCAATAGGCGTCGCGCGGCGTAGCCTCCAGCTCGCGCAGGATTTTCATCGCCCACATTTTCGGCGCGCCGGTAACGGAGCCGCAGGGAAACAGGGCGGCGAAGATATCTTCCACCGTCACGTCAGGCAGAAGTCTCGCCCGCACATGGCTGACCATCTGGTGCACGGTCGGATAGGTCTCGATATCGAACAGCCGCGGCACATCAAGGCTGCCGACTTCGGTGATGCGGGAAATATCGTTGCGCAGCAGATCGACGATCATGCGGTTTTCGGCGAGCGTCTTTTCGTCCGCCAGCATCGCCGCGATGATGGCCCGGTCTTCTTCCGCATCGGCCCCGCGCGGCGTCGTTCCCTTCATCGGATGGGTTTCGATGAAACCCTCCCCATCCACCGAGAAGAACAGTTCCGGCGAGCGCGACAGGATGACCGGGCCGCCGAGATCGACCAGCGCGCCATATTTCACCGGCTGCCGTTCGATCAGCGACCAGAAGGCCGTCAGCGGATCGCCGCTCCAGCGGGCATGGACCGGCATGGTCAGATTGCCCTGGTAACAGTCGCCGCGACGCAGATGGTCGTGCAGACGCTCGAAACGCTGGCAATAGTCTTGAAGGGTCCATGCGGACACGGGATCGGACAGAAACGCATCCGCATCCGGCAGATTTGCCGGCCGCGCGAACGGTCCCTCATCCGGCTGCGGGCCGGAAAAGACGCCGAAATTGAGAAACGGCACGTTGCGAGGCTCGTCCGCGAAAGGCGCAAGCTTCGGCTCGAACAGGAAACCCGCTTCGTAGGACATATAGCCGGCGAGGTATTTCCCGGCCCGGCGCAGCTCTTCCATGCGCTTCAGCGCAGCAAAAAACGACGCCGGTTCATCCGCGACGATGATCTCTTCCGGCTCGGTGAAGGCTGTCACCGTGCCGGTCGTATCATCCCGGAAAAGAACGTAAGGCGCGTGTGCCAAGGAAAGCGTCCGAAACAGGAGAAAAAGTCGGGAAGAGTCAGATGGGGTCTATATCGCCCCGCGCCCACATTTCGATGGTTTCCGCATAAAAATCGGCGAAACGGCCTTCCTCGATAGACTTGCGGATACCCTGCATCAGCTCCTGATAATAGGCAAGATTGTGCCAGGAGAGCAGCATGCCGCCCAGCGCCTCATTGGCGCGGGTGAGATGGTGCAGATAGGCGCGGGAATAATCGCGCGAGGCCGGGCAGTTGGACTGCTCGTCAAGCGGGCGCATATCCTCAGCATGGCGGGCATTGCGGATATTGACCTTGCCGCGACGGGTAAAGGCAAGCCCGTGACGGCCGGAACGGGTCGGCATCACGCAGTCGAACATGTCGATGCCGCGCGCCACCGATTTCAGGATGTCATCGGGCGTGCCGACGCCCATCAGGTAACGCGGCTTTTCGGTGGGCAGGACCGGCAGGGTGATATCGAGCATGCCAAGCATCACATCCTGCGGCTCGCCGACGGCAAGGCCGCCGACCGCATAACCCTTGAGATCGAGCTGCTTCAGCCCTTCGGCGGAGCGGATGCGCAGATCCGGCTGGTCACCGCCCTGCACGATGCCGAACATGGCCTTGCCGGGCTGCTCGCCAAAGGCGACGCGGCAGCGCTCGGCCCAGCGCAGCGACATTTCCATGGCGCGCTCGATTTCCTTGCGCTCGGCCGGCAACGCGATGCATTCATCGAGCTGCATCTGGATATCCGAATCTAGCATGCCCTGAATTTCGATGGAGCGTTCCGGCGACATGTGGTGCAGAGAACCGTCCACATGGCTCTTGAAGGTCACACCCTTCTCGTCCAGCTTGCGCAGGCCGGAAAGCGACATCACCTGAAAACCGCCGCTATCGGTGAGGATCGGGTGCGGCCAGCGGATCAGCTCATGCAGGCCGCCAAGGCGGGCAACACGCTCCGGACCGGGCCGCAGCATCAGGTGATAGGTATTGCCGAGAATGATATCGGCCCCAAGCTCCCGCACCTGATCGAGATACATGGCCTTGACGGTGCCGACCGTGCCGACCGGCATGAAGGCGGGCGTGCGGATGACGCCGCGCGGCATGGCGACTTCGCCGAGGCGTGCGCCGCCGCTCGTGGCTTTCAGGGTGAAGGTGAATTTGTCGTGCATCAGTTTTTCCGGAACAACAGGCTGGAATCGCCATAGGAATAGAAGCGGTATCCGGTTTCGATGGCGTGCTTATAAGCGTCACGCATCGTTTCGAGACCGCAGAAAGCCGAAACCAGCATGAACAGCGTCGATTTCGGCAGGTGGAAATTCGTCATCAGAATATCGACCGCGCGGAAGCGATATCCCGGCGTGATGAAGATGCCGGTGGCATCGGACCATGGGTGGATAACGCCATTCTCATCGGCCGCACTTTCGATCAGGCGCAGCGAGGTGGTGCCGACGCAAACGATGCGCCCGCCGCGCGCCTTGACGGCATTCAGCCGGTCGGCGGTTTCCCGCGACACATGGCCGATCTCGAAATGCATCTTGTGATCGTCAGTATCGTCGGACTTAACCGGAAGGAAGGTGCCTGCCCCAACGTGAAGCGTCACGAAATGCCGCTCGATACCCACCTTATCAAGAGCTTCGAACAGATCGGGCGTGAAATGCAGCCCGGCGGTGGGGGCGGCAACGGCACCCTTTTCGCGGGCGTAGATGGTCTGGTAATCGGTCTGGTCCTGCGCATCTTCCGGGCGTTTCGCGGCGATATAGGGCGGCAGGGGAATATGGCCGACGGAGGCAATCGCCTCGTCCAGCACCGGGCCGGAGACGTCGAACAGCAGCGTGATCTCGCCCTCCTCGCCCTTCTCCTCGACGGTGGCTTCCAGATGGGCAAGGCCGCAGGCATTGTCGCGCTCATAACCGAAACGGATGCGGTCGCCCTGCTTGATGCGCTTGCCGGGACGCGCGAAGGCCTTCCAGCGGGACTGATCGGCGCGCATGTGCAGCGTGGCCGAAACCGCCGTCTCCGGCGCGCCTTCGCGCAGGCGTACACCTTCGAGCTGGGCGGGAATGACGCGGGTATCGTTGAAGACAAGCGCATCGCCGGGCTTCAGGAAGGACGGCAGGTCGAAGACGCGATGGTCTTCCATGCGGTTCTCATTCGGATCGACCACCAACAGGCGCGCGCTATCGCGCGGGTTCGCCGGGCGCAGGGCGATATTCTCCTCGGGCAGATCGAAATCGAACAAGTCTACACGCATTGCAAGGGTCTTCCAGAAATCCAATTTGACACGGCACGGTGTTTTGCGGCCCGACGCATCCGCACCCCTCATTCCTGTGCTTGTCACAGGAATCCAGTCGGCGCGCGTCTGCGCGGCGGAAGGAGTCCTTTCAGCCCAGGGACTTGGGCTGGCTGGATTCCTGTGACAAGCACAGGAATGAGGGAAGAAGAGAACGACGCGACGGATGGCAAGCGTCAAAACCGCCGCGTCTATCCTTAAAATCACATACCGTCAATAAACGTCAAAACCCGCCTCGCATAGACTGCGAGACGGGTCGAACTGCAATTTCCGAAAATCAGGCCGCAGAAGCGAGCTTGATGGAAACGATCGAATCCGGGTCCTTCACCGGCTCGCCGCGCTTGACCTTGTCGATGGCTTCCATGCCTTCGATGACCTGGCCCCAGACGGTGTACTGCTTGTTGAGCCACGGCGAATCCGTGAAGCAGATGAAGAACTGCGAGTTGGCGGAGTTCGGGTTCTGCGAACGGGCCATAGAGCAGGTGCCGCGCACGTGCGGAATGGCGGAGAATTCGGCCTTCAGGTCCGGCTTGTCGGAGCCGCCCATGCCGGCGCGGGCCGGGTTGAAGCTTTCCGCACCCTTCTTGCCGAACTTCACGTCGCCCGTCTGGGCCATGAAGTCTTCGATGACGCGGTGGAAAACGACGCCATCGTAAGCGCCTTCCGATACGAGTTCCTTGATGCGGGCGACGTGGCCCGGAGCAACTTCCGGCAGCAGCTGGATCACGACCTTGCCGGTCGTCGTTTCCATGATGATGGTGTTTTCCGGATCCTTGATCTCGGCCATGACTATTCTCCTCTGTTCGGGCTCCATTGCCCTTACCTTACTTTTTGCCGACGGTGACCTTGATCATCCGGTCGGGGTTGGAAACTTCGCCGTTGCCGCCGGCGCCGCGCTTGATCTTGTCGACGGCTTCCATGCCGGACACGACCTTGCCGACCACGGTATATTGGCCGTTCAGGAACGGACCGTCGGCAAACATGATGAAGAACTGCGAATTGGCGGAATTCGGATCCTGCGAGCGGGCCATGCCGACCACGCCGCGCGTGAACGGAACCTTGGAGAATTCGGCCGGAATATCCGGCAGGTCGGAACCGCCCGTACCCGCGCGCTGGGCGCTGAAGCCCTTCTCCATGTTGCCATATTGCACGTCGCCGGTCTGCGCCATGAAGCCGTCGATGACGCGGTGGAAGGCGACATTGTCATAAGCGCCCTTCTTGGCCAGCGCCTCGATCTGCGCCACGTGCTTGGGCGCGACATCCTGCATCAGCTCGATGACAACGGGACCGTCCTTCAGCTGCACGGTCAAGAGTTCGGCGGCGGAAGCGAAGGTGCTGGCGGCGAGTGCGCCTGCAAACATGGCGCCGGCAAAGGCAAATCGAACGAGTTTCATCGGATTGGCTCCAGAATGTGGGGCTGAAGTCAGCGCTTCAGCTTGGCGTTGAGGGCTTCAAGCACGGCTTTCGGCACGAAGGCGTCGACATTGCCGCCCATGGCGGCGATCTGCCGGACCAATGTGGCTGTAATGGGTCGCGACGAGGTGCCGGCGGGCAGGAATACGGTCTGGATATCGGGCGCCATCTGGCGGTTCATCCCGGCCATCTGCATTTCATAATCGAGATCGGTGCCGTCGCGCAGGCCGCGCAGCAGAAGGCGCGCACCGTGCTCTCGGGCGGCATCGACGACCAGATTATCGAAAGAGACGACTTCCATGCGCGCCGCTTCGCCCGGCAGATGTTCGGCAAGCGCCCGCTTTATCAGCCCTGCCCGCTCCTCGAAGCTGAACAACGGCGCTTTTCCGGGATGAATGCCGACCGCAACGATGACTTTCGACGCCACGTTCAGCGCCTGAATAAGCACATCCAGATGTCCGTTGGTCATCGGATCGAAGGATCCTGGATAAAAGGCAATCGTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP040640|1394020:1402298|1401204_1401774_-|WP_142779296.1|DBSCAN-SWA MKLVRFAFAGAMFAGALAASTFASAAELLTVQLKDGPVVIELMQDVAPKHVAQIEALAKKGAYDNVAFHRVIDGFMAQTGDVQYGNMEKGFSAQRAGTGGSDLPDIPAEFSKVPFTRGVVGMARSQDPNSANSQFFIMFADGPFLNGQYTVVGKVVSGMEAVDKIKRGAGGNGEVSNPDRMIKVTVGKK >NZ_CP040640|1394020:1402298|1399258_1400341_-|WP_142779295.1|tRNA|DBSCAN-SWA MRVDLFDFDLPEENIALRPANPRDSARLLVVDPNENRMEDHRVFDLPSFLKPGDALVFNDTRVIPAQLEGVRLREGAPETAVSATLHMRADQSRWKAFARPGKRIKQGDRIRFGYERDNACGLAHLEATVEEKGEEGEITLLFDVSGPVLDEAIASVGHIPLPPYIAAKRPEDAQDQTDYQTIYAREKGAVAAPTAGLHFTPDLFEALDKVGIERHFVTLHVGAGTFLPVKSDDTDDHKMHFEIGHVSRETADRLNAVKARGGRIVCVGTTSLRLIESAADENGVIHPWSDATGIFITPGYRFRAVDILMTNFHLPKSTLFMLVSAFCGLETMRDAYKHAIETGYRFYSYGDSSLLFRKN >NZ_CP040640|1394020:1402298|1396933_1398091_-|WP_142779294.1|DBSCAN-SWA MAHAPYVLFRDDTTGTVTAFTEPEEIIVADEPASFFAALKRMEELRRAGKYLAGYMSYEAGFLFEPKLAPFADEPRNVPFLNFGVFSGPQPDEGPFARPANLPDADAFLSDPVSAWTLQDYCQRFERLHDHLRRGDCYQGNLTMPVHARWSGDPLTAFWSLIERQPVKYGALVDLGGPVILSRSPELFFSVDGEGFIETHPMKGTTPRGADAEEDRAIIAAMLADEKTLAENRMIVDLLRNDISRITEVGSLDVPRLFDIETYPTVHQMVSHVRARLLPDVTVEDIFAALFPCGSVTGAPKMWAMKILRELEATPRDAYCGAIGFMSPSGEMRFSVAIRTLSLFDDGRAVFNVGGGIVFDSVAEAEYDECLLKSKFAVGDRLLRG >NZ_CP040640|1394020:1402298|1398128_1399259_-|WP_080856425.1|tRNA|DBSCAN-SWA MHDKFTFTLKATSGGARLGEVAMPRGVIRTPAFMPVGTVGTVKAMYLDQVRELGADIILGNTYHLMLRPGPERVARLGGLHELIRWPHPILTDSGGFQVMSLSGLRKLDEKGVTFKSHVDGSLHHMSPERSIEIQGMLDSDIQMQLDECIALPAERKEIERAMEMSLRWAERCRVAFGEQPGKAMFGIVQGGDQPDLRIRSAEGLKQLDLKGYAVGGLAVGEPQDVMLGMLDITLPVLPTEKPRYLMGVGTPDDILKSVARGIDMFDCVMPTRSGRHGLAFTRRGKVNIRNARHAEDMRPLDEQSNCPASRDYSRAYLHHLTRANEALGGMLLSWHNLAYYQELMQGIRKSIEEGRFADFYAETIEMWARGDIDPI >NZ_CP040640|1394020:1402298|1400657_1401167_-|WP_020810245.1|DBSCAN-SWA MAEIKDPENTIIMETTTGKVVIQLLPEVAPGHVARIKELVSEGAYDGVVFHRVIEDFMAQTGDVKFGKKGAESFNPARAGMGGSDKPDLKAEFSAIPHVRGTCSMARSQNPNSANSQFFICFTDSPWLNKQYTVWGQVIEGMEAIDKVKRGEPVKDPDSIVSIKLASAA >NZ_CP040640|1394020:1402298|1395536_1396814_+|WP_168208030.1|DBSCAN-SWA MAPTDKPQSHGPAFAAALFDTSENDLMIGGLPVRDIVAGTGTPCFLYDAAAMRRAYRDLETALGGFADIYYSVKANPLPAIVSLFRQEGAGAEIASVGEYRAAIKAGVAAENIIFAGPGKGMAELREVIEGGIGEIHIESAEEIARIEAIGKPVKASIRINPVPDAQAGAMRMGGKATAFGFDEEELENVLTLFKDARYIDLVGVHIYGGTQILDADMLVSQWRHGISLAARTAEILGRPLETIDLGGGLGIPYFAGETPLDLEKVSAAIADLKALLKAHPLITDAHVIVEPGRFLAGPGGVYVAEVNSVKTSRGTTFVVMDGGMHHHLAASGNLGQIVKRNYPIVAPAMMRAEHEETATIVGPLCTPLDTLARNAALPKLKAGDLLAILQSGAYGASASPAGFLSHAAAKEVLVENGVFEVIGR >NZ_CP040640|1394020:1402298|1394020_1395553_+|WP_142779293.1|DBSCAN-SWA MTPHFLLHHLLTARAASDDQALVHKDRSLNYREFSAAAARCAAALQEAGAQRGDRVVIYLPRGIEECWSIFGVSMASGVFVPVNALLKAQQIRHIVKDCGAKIVISDAAMMDELKAALEDLPDVTVLLAEEIEARADTPARPSAAIGEDLAAILYTSGSTGSPKGVMLSHRNLLAGARIVRTYLDITGSDRILSLLPFSFDYGLNQLLTAVEQGAATIISTFRLGDEIVRDLRDHAITGLAGVPTIWAILTKAAPSLTKTPLPHLRYITNSGGRVPQETVKALREKLPDTKIYLMYGLTEAFRSTFLPPEEIDRRPTSIGKAIPECEIFIVTAEGQRAKPGEPGILVHRGPTVSLGYWNRPEDTAKVLRPHPFIPAALGGETVCYSGDLAVEDEDGFFSFVARNDAMIKSSGYRISPTEVEESLMSTGLFQQVAVIGLPDPFAGEKVHAVATAANQNIDVTAALKKAAEMLAPFMIPRAIELVERLPVTANGKVDYRALVRERTDNGANG >NZ_CP040640|1394020:1402298|1401803_1402298_-|WP_020810243.1|DBSCAN-SWA MTIAFYPGSFDPMTNGHLDVLIQALNVASKVIVAVGIHPGKAPLFSFEERAGLIKRALAEHLPGEAARMEVVSFDNLVVDAAREHGARLLLRGLRDGTDLDYEMQMAGMNRQMAPDIQTVFLPAGTSSRPITATLVRQIAAMGGNVDAFVPKAVLEALNAKLKR |
8 | uncultured_Mediterranean_phage(66.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1418701 : 1427257
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP040640|1418701:1427257|DBSCAN-SWA CCTATTCAAGGAAACTGGAGGGGTTCACCGGCGTCGCATCCTTGCGAACTTCGAAGTGAACCTGCGGACGCTTGGCGCTGCCGGTCATGCCCGATGTGGCGATGGTCTGGCCGCGCTGAACCTTCTGGCCGCGCTGCACGTCGAGATTGGCCGCATTGCCGTAAACGGTGACCTTGCCGTCGTCATGGCGGACGAGAACCGTGTTGCCGAGCTGCTTCAGGCCGTTGCCCGCATAGATGACCACGCCGTTTTCGGCAGCCTTGATCGGCGTGCCCTCGGGAACGGAGATGTTGATGCCATCGTTGCGGCTGCCTTCGACATTGTCGCCGAAATTGTTGATGACCGCGCCGCGAACCGGCCAGCGATATTTGCCGATGCCCGTGGATTCCGGTGCGACGGAAGCCATGTCGGCCTTTTTCTCGATGTCGCTGACACTGGCGGTGGCCGGTGTGGCAGGCGCAGTCGGAGCCGCGGCAGCAGCCGGTGCCTTGTAGGGCTCAGGCTTGGCGGATGCGGTTTCGACCGGCTTGGCCGCTTCCTTGGCAGGAACGGAAGCCGTCTTGATGGCATCCGTCGAAGCGCCCGGCATATTGAGCGTCTGGCCGACGCGGATGCTTTCGTTGCTGAGACCGTTTGCCGCCTTGAGGGCAGCGACCGACACGCCGTTTTCGCGGGCGATCTTCGCCAGGCTGTCGCCGGCCTGAACCTTGTAGCCACCGGTCGGCGGCAGCGGCTTGCCGCCGGGAGGCGTCAGCTTGCCGGTCTCGGAGGAAACCTTGTCACGGGCGGCCGCCTGCGACGGCAGCACGGCCACGTTCCCTTCAGGACTACGCAGCGGTGTCGGCTGGTCGCCATTGCGGTTGAGAGCGATATCGCCGGCCGCGTCCTTGGCGGCATTGCGGACCTGACCGAACTTCGGAATGAGGATCGACTGGCCGGCCTGCGCCGAGGATGCGGTCTTCAGACCGTTGACGCGCAGCAGTTCCTTTTCCGGAACGCCGTAGCGGTTGGAGAGGGTCGCGATGCTTTCGCCGGGACGCAGGGTCACGGAGGAAGCGCCATCGGTATGCCAGCCGTTCTTGTCGGAACGAACGGTTGCGGTCGTCAGATTGTCGGTGACGGGAGCGGCAGCCTGCCGGGCGGGCGCAGCAAGAGACGGCGCGGCCTTGGGCTGAGCGGACGGGAAAGGCTGGGCCATGGCTTCGTTGCGGCTCGACGGATCGCGGCTTGCCACCGCCGTCGGCGCCGACAGCTCGGAGCGCTGCACCGGAGAGGCCGAAGCCGAGCGGGCCGTGGAAGGCTGAACGCCGCCATAACCACCCGATTGCGGGTAGGAATTGCCCGCCATGTTCTGGTTGGCATAACCGCCCTGCGGTGCGGCAGAGGCGTAACCGCCATTCATGTCGCCCTGCGGAACCGGGGCCTGCCCGTAAGCACCGCCGCCCTGACGGGCAGGAATTGACGCCGTGGTCATCTGATCCGGTCCACTGGAAAAGAGGCCGCTGAACCGCGTTGCATCCGAGCTACAGCCCGTTGCGACGCTTGCCAGCAATGCCGCTGCGAACATTTTTGCGACTGATTTTCCGATTTTAGACGAATGTCTCATACGCATGACTCGATCTACACTGACGCCAACCCGCCGCATCGAGACCGATCCGGCACAAATTTCACTTCTTGCAAGGCCCATTTCCCGAGGTGGGGAAAACCATGCGTCGATGTTTATGATTAAAGCGCGTTAGTGTTACCACGCCGTTAAAACGTTAAGCCGTTCAGCGCTTTTTTGACACTTCGATAACCATGACGAAAACCGGATGCGTCATGTTGATACAATAATGATATTGAAAATCAGTTAGTTGCGACAATTCAGAACATTAGGAGACAAAAATAGAATGTCGCCAAAATTTTCATTTTGTCACAAATGACGCGCAATATGAGTGCTGAGCGGCAGGTAAGGCGCCTCGAACAGCTCTTCCTTCTCGAAGCGGCTGCCGGTCTTGGAGAAGCGCGTCATCACGCAACGCTCGTCTTCAAGGATGATCGGCGCGATCATCATGCCGCCCGAAACGATCTGCTCGGCAAAGAAGCGGGGCATGGTGGTGAAGGCGGCGGTGGAGACGATGCGGTCGAAGGTGCCCTCGCCCACCAAGCCGTTGCTGCCATCCGCCTGACGGATGACGACGTTGCGAATGGAGAGATCGTCCAGACAGTTCTGCGCCTGCTGCACCAGCGTCTTGTAACGCTCCAGCGAAAAGACGCGCTCCACCCGGCGCGCAATGATCGCGGTCATGAAACCGCTGCCCGTGCCGATTTCCAGCACGCGCTGGCCGGGTTTCAGCTGCAGCCGGGCAAGGATCTTCACCGCCATGTCGGCGCCTTCCATGAAGGCGCCGCAGTCGATCGGGATCGTGCGGCTGGAATAGGCGTCCGCCGCAAATTGCGGCGGCACGAATTTGGAGCGTGGCGTCTGCTCGACCGCCGTCAGAAGGTCAAGATCGGAAATCCCCTCGCCCCTTAAGCGGAGAACGAGAGCGGCAAAACCTTCCTTTTCGACCATTGCGGATTTCAATCGGCGACTCCGAATCCAAGTGCTTGCGCAACCCGATCCTTCACCGTGTAGTCCGTCAGGTCAAGCTTAAGCGGTGTGACGGAAATCTTGCCATGTTTCAAAGCGTGGATATCGGTGCCTTCGCGGAAGGTGCCGAGACGTTCGCCGAAGCGCAGCCAGTAATAGGGAAAACCGCGGCCATCCTGACGTTCTTCCACCGTCAGGCCGAAATCGAGCTTGCCCTGCCCGGTCACGGAAACGCCTTGCACATCCTTCGGAGCGCAATTGGGGAAATTCAGATTGAGGAAGGTGCCATCGGGCAGTTCCACATTCATCAGCTTGCGCAGAAGGTCCGGCGCATAGGTCTCGGCGACTTCCCACGGCACGACGCGGCCATCGGCGTGGCTGAAGGCCTGGCTGAGCGCAAAGGAGCGCACGCCCTGCAACGTGCCTTCGATGGCGCCGGCAATGGTGCCGGAATAGGTCACGTCATCGGCCATGTTGGCGCCGGCATTGACACCGGAGAGCACGAGATCCGGCTTTTCCGGCAGCACTTCGCGAATGCCCATGATGACGCAATCGGTCGGCGTGCCGCGCAGCGCGAAATGCTTGTCGGAAACTTTTCGCAGACGCAGCGGTTCGGACAGGGTCAGCGAGTGCGCGAGACCGCTCTGGTCGGTCTCCGGTGCGACGATCCAGACATCGTCGGACAGCGTGCGGGCGATACGCTCCAGCACGGCCAGACCCTCGGCGTGGATACCGTCGTCATTCGTCAGCAAAATCCGCATGTTTTCCTCCGCTGCTGTATCCCACTATGAGGACACAATGATTTCTCTCACAGAAGCACCCCCACGACCGTCATTCCGGCCTTGAGCCGGAATCCAGTCGACGCGCGTCTGCGCGGCGGGAAGAGTCTTTTTCAGCCCAAGGACTTGGGCTGGCTGGATTCCGGCTCAAGGCCGGAATGACGGCCGCGCCCTTAGCAGACGAACAATCAGGCCGCTTTCTCAATCCGCTTCAGACCGCCCATATAGGGCAGCAGCACGTCTGGAATTGTGACGGAACCGTCCTCGTTCAGATAATTTTCGAGAACCGCAATCAGGCAGCGGCCGACAGCCGTGCCGGAGCCGTTCAGCGTGTGCACGAACTTCGTCGCCTTGTCGTCCTTGCCGCGATAACGCGCATTCATGCGCCGCGCCTGGAAATCGCCGCAGACCGAGCAGGACGAAATTTCGCGATAGGTGTTCTGGCCGGGCAGCCACACTTCCAGATCGTAGGTCTTGCGGGCGCTGAAGCCCATATCGCCGGTGCAGAGCGTCATGGTGCGGAAATGCAGGCCAAGGCGCTTCAGCACTTCCTCGGCGCAGGCCGTCATGCGCTCGTGTTCGGCAACGGCGCTTTCCGCATCGGTAATGGAGACGAGTTCGCATTTCCAGAACTGGTGCTGGCGCAGCATGCCGCGCGTGTCGCGGCCGGCCGAGCCCGCCTCCGAACGGAAGGACGGCGTCAGCGCCGTGAAGCGCAGCGGCAGCTTCTCCTGGTCGAGAATTTCGCCGGAGACGAGATTGGTCAGCGTCACTTCCGCTGTCGGGATCAGCCAGCGGCCATCCGTGGTCCTGAACAGGTCCTCGGTAAACTTCGGCAATTGGCCGGTGCCGAACATCGCCTCGTCGCGCACCATCAGCGGCGAAGACACTTCGGTATAACCGTGCTCGCCGGTGTGCAGGTCGATCATGAACTGGCCGAGCGCCCGTTCCAAACGCGCGAGCTGGCTGGTCAGGACGGTAAAACGCGCACCGGAAAGCTTGGCGGCACGCTCGAAATCCATATAGCCGAGCGCTTCGCCGATATCGAAGTGTTCCCTGGCCTCATGGTTCCAGCCGGGCTTCTGGCCGACCACGCGGGCCACCACATTGTCATGCTCGTCATTGCCGTCGGGCACATCGTCGAAGGGCATGTTGGGCAGGCGCGACAGGGCATCGTTGAGCTCGGCGGTGACCTGACGGTCTTCCTCCTCCGCGCGCGGCATGTTGTCCTTGAGATCGGCGACTTCGGCTTTAAGCTTTTCTGCAAGCTCCATGTTCTTCTGCGCCATGGCGGCGCCGATTTCCTTGGAGGCGGCGTTGCGGCGCGATTGCATGTCCTGCAGGGACTGGATGACGGAACGGCGCTTTTCATCCAGCGCGATCAGGCCGCTGGCGGCGGGCTCCGCGCCACGGCGCGCAAGCGCCGCATCGAAAGCTTCGGGGTTTTCGCGTATCCATTTAATGTCGTGCATCGTCGTTCCAGACCGTTGTTGCATCGTCGCGTTTTACAGCAAAACGCCGGGCATGAGCCCGGCGCGAGGAAATGTCGTGAGCGGCGAGACCTTCAAGAATGTCAGGTCTCCTCCAGCTCCGTGCTTTTGGATTCCGCCGCACGTTTCCTCTCCACGAGTCGAGCCATGTAGATGGAAATCTCGTAGAGAATGATCGCAGGCAGCGCAAGACCGATCTGGGACATGGGATCCGGCGGCGTCAGCACGGCGGCGACCACGAAGGCGAGGACGATCGCGAACTTGCGCTTTTCCCGCAGCCAGTCGCTGGTCAAAAGCCCGACGCGCGCCAGAAGCGTGGTGACGACAGGCAGCTGGAACACCAGACCGAAGGAGAGAACCAGCGTCATGATGAGGCTGAGATATTCCGACACCTTCGGCATCAGCGAAATCGCCACCTCGCCATCCTCGGGCAATTGCTGCATGGCGAGGAAGAACCACATGACCATGGGCGTGAAGAAGAAATAGACGAGCGCCGCACCGATGAGGAAAAGGATCGGCGACGCGATGAGGAACGGCAGGAAGGCCGCCCGTTCGTTCTTGTAGAGACCGGGCGCCACGAATTTATAGAGCTGCGAGGCGACCACCGGAAACGAGATCACCATCGCGCCGAACATGGCGACCTTGATCTGCGTGAAGAAGAATTCCTGCGGCGCGGTATAGATCAGCGACGATTTCGTGACATCGAGGCCCGCCCAGAGAACCGCCCATTTGTAAGGAATGACAAGCAGGTTGAACAGGTGCTTGGCAACGGCGAAACAGGCGATGAAGGCGACGAAGAACGCGCCGAGCGCCCAGATCAGCCGCGTGCGCAACTCCATGAGATGCTCGATGAGCGGCTGCGGCTTGTCCTCGATATCCCCGCTCATGCTTCATCCTTTTTGGGCTTTGCGGCCTTGGTTTTTGCCGTTTTCGCCGGCTTGGTCTCGGCCACGACCGCCTTGTCTGCGGCAGACTTCCTGACGGCAGCCTTTTTGGCGACGGCCTTCACGGTGGGATTGGAAACAGCGGGCTTCGAAACGGTGGGCTCACTGGTTTTCGCGGCGGTTTTCGTCGCCGAAGCGGTTTCGGGCGCTGCCGCCGCAGCCTTGCTGCGGGCGGCGCGTTTGGGCTTGGCGGTAACCGCTTCGGCCTCCACGGTCGCAATCGATTTCGCCCGCGCGCGCTTCGGCTTCTCCTCCGCAGCGACGGCTGCGGCAACCGGCGCGGCGGATGCGGCGACAACAGGCGGCGCATCCGGCAGCTTCATCTCCGGCTCGGGCACGCTCACCAGCGGCGCAACCGGCTCGCTTGTGACGGGCGCAGGGGTGGAAGACAGGCCGTCCGGCGGGGTGGTCGCCTTCTGCAGATCGGACTTGATCTCATTGCCAAGCTGGCGAAGCGGGTTCATCGCATCGCGCAGCGAATTGGTCGGGTTGAGATTGCGGACATCGGAGATGGTCTGGCGCACATCGTCCATGTCGGCCTCTTTGAGGGCCTCGTCGAACTGGGTGCGGAAATCCCCCGCCATCTTGCGAAGGCCGGCCATCGTCTTGCCGAAAGCGCGGATCATGGGCGGCAAGTCCTTCGGCCCGACGACCACGATCAACACGACCGCAATCACCAGAAGCTCGCTCCAGCCGATATCTAACATCAATATGCTCCCGAGACCCTAGCGCATCGGCCCGAAAAATCGGAATCGATTTTCGACCGGAACGATGCGCAACATACAAGTGGTAGAGCGTCCTTGTGCGTCCGAAAGAACGCACGGCGCTCTAGCGCCAGTCCCACACCAAGCGGCGACAGGCCGCTTACTTGATCTCGTCAGCCTTGTGGTCGACGGTCTTCGTGTTGGCATCGGCCGGCGGCGGCGTCTGGTCTTCGTCAGCCATGCCCTTCTTGAAGCTCTTGATGCCCTTTGCGACATCGCCCATCAATTCCGGGATCTTGCCGCGTCCGAAGAGGACGAGAACGATCACCAGCACGATGAGCCAATGCCACACACTAAAAGAACCCATAACTGAAACTCCTGAATTTCGCTTTCAGACGATGTAAGACGTTTGAAAGGCTTTTTCAAACGACAAATCGCCGCTTCGACAGCGAGTGCGCATAAAGTTTTCGTGGCCCCGCCAACAGGATGGCGATATCAGTCGTCGCCATCGCCGCCGCCCGGCGCAAGCAGGCCCAGTTCCTCGAGATCGAGCTGCGTGATCGGGTCTTCGTCCTCCCGCAATTCGTCGCTCATCATCGGCAGAGGAACGCCGAAATTGGAGGGAATGCGGCCCGAAAGCAGCCCTGCCCCCTTAAGCTCCTCGAGGCCCGGCAGATCGCGCAATTCCTCAAGGCCGAAATGGTCGAGAAACTCCACCGTGGTGCCGATCGTCACCGGCCTGCCGGGTGTGCGCCTGCGGCCACGGAACCTGACCCAGCCCGCCTCCATCAGCACGTCAAGCGTGCCGCGCGAGGTCTGCACGCCGCGGATTTCCTCGATTTCGGCGCGTGTCACCGGCTGATGATATGCGATAATAGCAAGAACTTCCAGCGCCGCGCGCGACAGCTTCTTCGGCTCCTTTTCCTCCGCGCGGATGACGAAGGAGAGATCGCCGGCGGTGCGGAAGGCCCATTGCCCGCCCACCTGCACGAGATTGACGCCCCGCTCCGCATAAAGAACCTTGAGATGCTGCAGAATGGTGACGACATCCATGCCGCGCGGCAGGCGCTCGGCGATGAACCCCACGGAAACCGGCTCGGACGAGGCAAAGACCAGGGCCTCGGCGATCCGCTCGGCCTCCTTCAGCTGGCGCTCGGAAAATACCGTGGGTTCGGGCTCTGCCCCATCGGCCGCGATGGTGGCCTCGGCGGTCTCATCGTCATTCATCATTGTCAGTCCTTTCGGCATTCACCGCACGGTCGTCTCGTCCACCCTTACGCATATAAATGGGCTGGAAAGCGCCCTCCTGCCGGATTTGCAGCGAGCCCTCGCGCACCAGCTCCAGCGAAGCGGCAAAGGCGCTGGCAATCGCCGTCACCCGCATGGCGGGATCGGGCACATATTGCAGCAGATACTGATCGAGCACGGTCCATTCCCCGACATCGCCGAGGAGATGGGTCAGAAGCTCACGCGCCTCGACAAGCGACCAGACCTGCCGTTTTTCAATGGTGACCTGGGTAATCGCCTGCCGCTGCCGCAGGTTGGCATAGGCGCTCAGGAGATCGTAGAGGTTCGCCTCATAGGCCGAGCGGTTGATATGGGGAATATGCTCCGGCGCGCCACGGGCAAAAACATCGCGGCCGAGCTGGGCGCGGTTGACGAGGCGTTCCGCCGCCTCGCGCATCGCCTCCAGGCGTTTCAGCCGGAAGGCGAGCGTCGCGGCCATTTCCTCGCCCGAGGGACCGTCATCCTTCGATTGCTGAGGAATAAGGAGTTTGGATTTGAGGAAGGCGAGCCACGCCGCCATGACGAGATAATCGGCCGCAAGCTCGATGCGCACGCGGCGCGCGCTTTCCACGAATTGCAGATATTGTTCGGCGAGCGCCAGCACCGAAATGCGCGACAGATCGACCTTCTGCGTGCGGGCAAGATGCAGCAGAAGATCGAGCGGGCCTTCGAAACCCGCGACATCGATGACCAGCCCGGCCTCGCCCGTCAGCCGCTCCGGCGTCACATCCTGCCAGAGCTTGTCCATCGGTGTCGAATTGCGAGACTTGTCTGCGGCCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP040640|1418701:1427257|1423624_1424428_-|WP_003522727.1|DBSCAN-SWA MSGDIEDKPQPLIEHLMELRTRLIWALGAFFVAFIACFAVAKHLFNLLVIPYKWAVLWAGLDVTKSSLIYTAPQEFFFTQIKVAMFGAMVISFPVVASQLYKFVAPGLYKNERAAFLPFLIASPILFLIGAALVYFFFTPMVMWFFLAMQQLPEDGEVAISLMPKVSEYLSLIMTLVLSFGLVFQLPVVTTLLARVGLLTSDWLREKRKFAIVLAFVVAAVLTPPDPMSQIGLALPAIILYEISIYMARLVERKRAAESKSTELEET >NZ_CP040640|1418701:1427257|1418701_1420267_-|WP_168208031.1|DBSCAN-SWA MFAAALLASVATGCSSDATRFSGLFSSGPDQMTTASIPARQGGGAYGQAPVPQGDMNGGYASAAPQGGYANQNMAGNSYPQSGGYGGVQPSTARSASASPVQRSELSAPTAVASRDPSSRNEAMAQPFPSAQPKAAPSLAAPARQAAAPVTDNLTTATVRSDKNGWHTDGASSVTLRPGESIATLSNRYGVPEKELLRVNGLKTASSAQAGQSILIPKFGQVRNAAKDAAGDIALNRNGDQPTPLRSPEGNVAVLPSQAAARDKVSSETGKLTPPGGKPLPPTGGYKVQAGDSLAKIARENGVSVAALKAANGLSNESIRVGQTLNMPGASTDAIKTASVPAKEAAKPVETASAKPEPYKAPAAAAAPTAPATPATASVSDIEKKADMASVAPESTGIGKYRWPVRGAVINNFGDNVEGSRNDGINISVPEGTPIKAAENGVVIYAGNGLKQLGNTVLVRHDDGKVTVYGNAANLDVQRGQKVQRGQTIATSGMTGSAKRPQVHFEVRKDATPVNPSSFLE >NZ_CP040640|1418701:1427257|1425349_1425556_-|WP_003509722.1|DBSCAN-SWA MGSFSVWHWLIVLVIVLVLFGRGKIPELMGDVAKGIKSFKKGMADEDQTPPPADANTKTVDHKADEIK >NZ_CP040640|1418701:1427257|1422239_1423523_-|WP_059760200.1|tRNA|DBSCAN-SWA MHDIKWIRENPEAFDAALARRGAEPAASGLIALDEKRRSVIQSLQDMQSRRNAASKEIGAAMAQKNMELAEKLKAEVADLKDNMPRAEEEDRQVTAELNDALSRLPNMPFDDVPDGNDEHDNVVARVVGQKPGWNHEAREHFDIGEALGYMDFERAAKLSGARFTVLTSQLARLERALGQFMIDLHTGEHGYTEVSSPLMVRDEAMFGTGQLPKFTEDLFRTTDGRWLIPTAEVTLTNLVSGEILDQEKLPLRFTALTPSFRSEAGSAGRDTRGMLRQHQFWKCELVSITDAESAVAEHERMTACAEEVLKRLGLHFRTMTLCTGDMGFSARKTYDLEVWLPGQNTYREISSCSVCGDFQARRMNARYRGKDDKATKFVHTLNGSGTAVGRCLIAVLENYLNEDGSVTIPDVLLPYMGGLKRIEKAA >NZ_CP040640|1418701:1427257|1420612_1421266_-|WP_003522731.1|DBSCAN-SWA MKSAMVEKEGFAALVLRLRGEGISDLDLLTAVEQTPRSKFVPPQFAADAYSSRTIPIDCGAFMEGADMAVKILARLQLKPGQRVLEIGTGSGFMTAIIARRVERVFSLERYKTLVQQAQNCLDDLSIRNVVIRQADGSNGLVGEGTFDRIVSTAAFTTMPRFFAEQIVSGGMMIAPIILEDERCVMTRFSKTGSRFEKEELFEAPYLPLSTHIARHL >NZ_CP040640|1418701:1427257|1424424_1425192_-|WP_142779309.1|DBSCAN-SWA MLDIGWSELLVIAVVLIVVVGPKDLPPMIRAFGKTMAGLRKMAGDFRTQFDEALKEADMDDVRQTISDVRNLNPTNSLRDAMNPLRQLGNEIKSDLQKATTPPDGLSSTPAPVTSEPVAPLVSVPEPEMKLPDAPPVVAASAAPVAAAVAAEEKPKRARAKSIATVEAEAVTAKPKRAARSKAAAAAPETASATKTAAKTSEPTVSKPAVSNPTVKAVAKKAAVRKSAADKAVVAETKPAKTAKTKAAKPKKDEA >NZ_CP040640|1418701:1427257|1425684_1426419_-|WP_142779310.1|DBSCAN-SWA MMNDDETAEATIAADGAEPEPTVFSERQLKEAERIAEALVFASSEPVSVGFIAERLPRGMDVVTILQHLKVLYAERGVNLVQVGGQWAFRTAGDLSFVIRAEEKEPKKLSRAALEVLAIIAYHQPVTRAEIEEIRGVQTSRGTLDVLMEAGWVRFRGRRRTPGRPVTIGTTVEFLDHFGLEELRDLPGLEELKGAGLLSGRIPSNFGVPLPMMSDELREDEDPITQLDLEELGLLAPGGGDGDD >NZ_CP040640|1418701:1427257|1426408_1427257_-|WP_003522723.1|DBSCAN-SWA MAADKSRNSTPMDKLWQDVTPERLTGEAGLVIDVAGFEGPLDLLLHLARTQKVDLSRISVLALAEQYLQFVESARRVRIELAADYLVMAAWLAFLKSKLLIPQQSKDDGPSGEEMAATLAFRLKRLEAMREAAERLVNRAQLGRDVFARGAPEHIPHINRSAYEANLYDLLSAYANLRQRQAITQVTIEKRQVWSLVEARELLTHLLGDVGEWTVLDQYLLQYVPDPAMRVTAIASAFAASLELVREGSLQIRQEGAFQPIYMRKGGRDDRAVNAERTDNDE >NZ_CP040640|1418701:1427257|1421262_1422033_-|WP_003522730.1|DBSCAN-SWA MRILLTNDDGIHAEGLAVLERIARTLSDDVWIVAPETDQSGLAHSLTLSEPLRLRKVSDKHFALRGTPTDCVIMGIREVLPEKPDLVLSGVNAGANMADDVTYSGTIAGAIEGTLQGVRSFALSQAFSHADGRVVPWEVAETYAPDLLRKLMNVELPDGTFLNLNFPNCAPKDVQGVSVTGQGKLDFGLTVEERQDGRGFPYYWLRFGERLGTFREGTDIHALKHGKISVTPLKLDLTDYTVKDRVAQALGFGVAD |
9 | uncultured_Mediterranean_phage(75.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040641_1 | 1459265-1459571 | Orphan |
NA
Consensus repeat of NZ_CP040641_1
|
2 spacers
spacers of NZ_CP040641_1
>1.1|1459326|62|NZ_CP040641|PILER-CR TTCACTGCCACAGCGACGTACAGGCCGTTGCTGTAGGTGATATCGCCCCAAGCGTTCGCCTC >1.2|1459449|62|NZ_CP040641|PILER-CR CTGCCCGCCACGCCAACGAAGAGGCCGCCCCCGTAGGCGACATCACCCCAGAAAGCTGCCTG |
CRISPR arrays and Neighbor proteins around NZ_CP040641_1
The CRISPR arrays of NZ_CP040641_1 >merge|NZ_CP040641|1|1459265-1459571|PILER-CR GGGCGCGGCGCGGGCGGTCCAGGTGATGCCGTCCGGCGAGGTCATGACCCGATTGGTGCCGTTCACTGCCACAGCGACGTACAGGCCGTTGCTGTAGGTGATATCGCCCCAAGCGTTCGCCTCGGGCGCGGTACGAGCCGTCCAGGTGATACCGTCCGGCGAAGTCATGATGCGGTAGGTGCCGCTGCCCGCCACGCCAACGAAGAGGCCGCCCCCGTAGGCGACATCACCCCAGAAAGCTGCCTGCGGCGCGGCGCGTGCGGTCCAGGTGATGCCGTCCGGCGAAGTCATGATGCGATGAGTGCCT >NZ_CP040641|1|1|1459265-1459571|PILER-CR GGGCGCGGCGCGGGCGGTCCAGGTGATGCCGTCCGGCGAGGTCATGACCCGATTGGTGCCG TTCACTGCCACAGCGACGTACAGGCCGTTGCTGTAGGTGATATCGCCCCAAGCGTTCGCCTC GGGCGCGGTACGAGCCGTCCAGGTGATACCGTCCGGCGAAGTCATGATGCGGTAGGTGCCG CTGCCCGCCACGCCAACGAAGAGGCCGCCCCCGTAGGCGACATCACCCCAGAAAGCTGCCTG CGGCGCGGCGCGTGCGGTCCAGGTGATGCCGTCCGGCGAAGTCATGATGCGATGAGTGCCT
>NZ_CP040641.1|WP_142781086.1|1457873_1459085_-|DUF418-domain-containing-protein MNDRIANMDAIRGFALFGILVVNILAFSSVWYGSGFPAPGNRSVLDEVLAFLVSALFELKFYLLFSFLFGYSVTLQMQSAEKAGATFLPRMMRRQAGLFLIGILHAVFLFHGDILSTYAILGFTLLALRHLRGQTKLRLALLLVLATALFWLVLAWLQGAAVPPPFDPAALNADAAASIAAWRGGPLTVVGEHLAALEDFLPLLLLLQAPCAFAMFLVGFVAGRKRLFLHRDVYGPLLNQSLAWGLLIGLPGGLIYATAAQYAPGTAVETAGIALSILTSPFLSLAILAGLLKLLDSGRVERLRDCFASLGRMALSNYLLQSLTCAFIFHGYGLGLVDRLAISQVLGLGVLVFIMQMLLSCWWMNRFHYGPLEWLLRAATVWHYPGWRKKVTKQGEPARDNRR >NZ_CP040641.1|WP_142781085.1|1457427_1457856_+|VOC-family-protein MEPDQWPALVPELTCTDLAASRRFYCDVIGFSVRFERPEDAFVYLELGNAHLMLEQVHADSWVAEGLDPPFGRGMNLQIEVAALAPIIDRIRATGLGFYREPAEAWYRDGDVEYGQTELLVQDPDGYLLRLVEVLGERSSAA >NZ_CP040641.1|WP_142781084.1|1456939_1457356_+|nicotinamide-mononucleotide-transporter MLVDLQVFTRDVAQKSDSGLMWRGFGFLASLQMSGKIVSGGTSPARHRLSQQPKKRISLFEYYGLDWLLLASGLTTKYLMIHQNRWAFATSILGCLAGLAVALMASQHGIALYNLILIGMSCTGFVHWGRLTRSRVSA >NZ_CP040641.1|WP_142781083.1|1456285_1456840_-|hypothetical-protein MLHRLIPPSIRGDLVLRGYEELPDLPDNLKVGGYLDLSRCRNLKGLPGNLRIKSYLDLTDCTGLMFLPDDLRVGESIILTGCTGITSLPDGLSAGGSLKLTGCMGLTALPRKLRIGGNLDLEGCTGLASLPTGLAVKGSLLLRRCAGLTNLPHKLTVGGDLDLTGTGITTLPDNLRVGGYIFRD >NZ_CP040641.1|WP_142781082.1|1455464_1456253_+|hypothetical-protein MQLKSSIFTNPAQRAGLEKYYDAQKSELLASLKAREADPSFQATFTSELTMPDGKTLSGKGWRITSEMAEKAMVSFDKWLEIMADTYESQETLFDMAQQRMTMLEAENPDTSSHVRTAFSAGGELLAYINEDGSLVTSNIGPRHGETMTHTALELKLQAILQQADAMRLSGENRIDYLNREVRNALSYERGDVAMTSYDSGASPTKREFGKAWHTTFDVDQVYADALADARASYDSTKVLHDQWQENLRKMQSFLLGLQETA >NZ_CP040641.1|WP_142781081.1|1454517_1455201_-|hypothetical-protein MLNFTVPPTVNGNLDLTSCRDLDLLPRGLTVRGSLNLTGCTDLAALPEGLNVLGSLYLAGCTGLTALPDDTRVGNSAYLNGCTGLRSLPRGLTVRGRLDLSGCEGLNLLPDGLMVGGNLYLRGCRNLLAMPDDIIVKGRLDLAGCTGLTALPDNLTVGGRLDLTGCKGLIRLPEGLSVGGSLHLTGCSSLTQLPRTLKVAGNIHISGCTKIEVLPDDLEVGGSIIHK >NZ_CP040641.1|WP_142781080.1|1454048_1454459_+|GtrA-family-protein MKFTRDALRRFLTYALVGGGTFGLDRLLMAGCLRLGMAYPVAVYIGFFLGVSLNYLISRRYVFRGTSRSMEMGYFNMLTVAAMGAFATSSLSVLIVRGFDVDMLLARLPVAAMVGVGNYLFNLYANFDVAGRHHAR >NZ_CP040641.1|WP_142781079.1|1453107_1453785_-|hypothetical-protein MPHSPFPPSVVGNLDLSHCTDIVLPEGFKISGSLYLLGCIDLTVLPDNLDIGDSLYLVGCSSLTMLPRGLKVGGDLYLIDCTGLTTLPDDLKVTRSLHLGGCTALKTLPDDLTVGGWLDLTGCKALRALPKRLKVGGWLRLNDCTSLAALPTDMRIGGSLYLTGCTGLIAASPGLLAVGGHHSLVTRFKAIDAVLLMTLRKFKTVDLMIIAALAGSGYLALFRFL >NZ_CP040641.1|WP_142781078.1|1452675_1453008_+|lysozyme-inhibitor MNKLILIFAATLPVLSSCAEVSGSSPSVAPIPESETTTYQCNDGRVVSADFENDAERVVLRSKGAIFARLNAKPAASGIWYEGQGYTLRGKGAQANLTGPDGRTFDCVSN >NZ_CP040641.1|WP_142781077.1|1452015_1452528_+|hypothetical-protein MLIQKLIASFVALVVFTSSAFAACDCVPSGTGIPASALLPSVDYGQQIKVAVVQPKAMVKTAAKTTKTVSPPAKDEMVDCLTGTSANSIEIVWAMEDTGACPGKILYKDAVRLAAKAAAGAVEFRTPFYIPSKCDSGWAKVPTKVEDGKIKLYPWRKTCVSGYFVTYVVK >NZ_CP040641.1|WP_142781088.1|1460815_1462555_-|hypothetical-protein MAEHIHGSLDSLRLYLTQLKTKAEDLDRECARLRAIVSQEIKRVEDLIQKQVQTTTGRRNRTLGIWIVAFAAAWWAPIAPTLAQTGTFVDVMTRDPATTNVQNRLCYTTNGRVDIGCPADAPYLDVATGRLGIGTTNPTQALDVSGSIYVGGTLGTGAGGSMPVISQNQIPLLHTFTPTGTNGGNLFLGRGSGNFTMAYISSANDASYNTGVGMDTLQGITTGKYNTAIGWTALASTSVASYNTAVGIAAMAKNVTAGSNVAVGAYSLYNANAGVDNVVIGGSSLLAMRSSFRNTTVGAYTMSNSGSGNDNTVVGHASMQYKNGSFNTTVGKNAGGVNGGTITGTVALGFEAGKSLATNTSNTLVGYRAGASLTTGSNNIIIGASTDAPVAAGSNQLNIGNAVWGDIGSGSGHANKLGVNVSSPSSSLHVSGTLRLTGGSETCDTNRLGAIRYTSGSFDVCRSIANGWEPLATTGKDSAVDRITSSSMAGVTANATGYISLTTGGITGTAYFSPNSVLVNKGISATGGVSATQGYFAATLEVSGAIKISGDGTEGCGSTSDKGKMRINPATGRLQICVD >NZ_CP040641.1|WP_168208211.1|1462573_1464064_-|methyl-accepting-chemotaxis-protein MLNSIRAKLTLLALISILSILAIGGVGFYGFKQLEDAILKANGDTIPTLVTSGRMSFDLARLHTLDAQYMGEPHKEDRERLLDQRVGIVTQIEKTQKEYEQLSSLPDEPKVYAEFKGAFATYREQKRALSALVEEGKVEEATTLFDGTMKTAYNEAVMSMQKIVRMNAEAAKVRAENANRTENFLSALMGINVVVAMVIVAILLIAILRSVLRGLAELERCLKALSQLDLRVVASAGTKDEIGRILEMYNSTLGKLKTVIAETKEASSTVSAASSELSSTMDVLTNATGEQSAALAEIASAVEETSSSAMSVKERTEHSVTATNDVASEFDTATESLRELQSAAAGIEEARGVIQAISEQINLLALNAAIEAARAGDAGRGFAVVADEVRKLASSTGVSTQQITERIAKLKHSVDKIAGSLSRSVSLVDGVKDNGRAMLGSVTEQTAAIEQISRSMQEFQDQMDDMVRSIQESKTASTGLSETAVGLSGTAGRFNT >NZ_CP040641.1|WP_142781090.1|1464108_1464750_-|hypothetical-protein MKIILLALAALSLITAAFAQERTAAGTVENQMSWSALNTKIATANSKADAVNSRVEQVVVCGRKGMLYAPGQAGADGQGCVVSKLDSSYVNMLNDINSNLTNINSCAANGSVYNRSAHSCLPVKMPDPATLNIGTYNQTLCTRGGTHTVVSSCPGGQRLLGCGGGPGDQDESHEYWVLMPDFAANRCIGYVGNPRCYDDGWSRTIVSAVCYRP >NZ_CP040641.1|WP_142781092.1|1464779_1465418_-|hypothetical-protein MKTPLLLILLLLALLSSVHAQERTAAGTMETQMSWSALSSKIGTVDAKVFGINSLINQDIACGRKGMLYAPGPGADGQGCMKPFVDDTALNQLNAKMNSALACASQGRMFNGSSCVTAAVALPAAPRLQCRVASHVGPGPHYASRAQCNSDEIMTGGGGQSETEGTNLCSGLGSSFIHATVPSGNGWAVDGYRPGGGDACTIAYAICCKIVN >NZ_CP040641.1|WP_142781093.1|1465433_1468184_-|hypothetical-protein MKVSTRVSNTRKGHYARILMAAALLAAFSPAAWAQYNTLGRDFTVRTGTTSATSVERIRVTQAGLMGIGHTNPSYTLDISGTARATRFIGDGSGLTNLPGQNIISGTTTMVQGWPDAIVCTLQNGSNGTDTRVFHLSFAPFYTGQYFYRLNEQTVVVNGPTSGGVSTQIGFTASGSYASFDTTYTSYTSAGTCANKTISQLYTEGKAFNFIGNTGMGDAGGLGYAMTSGTLSVTANTSGIVSLTTAGTTWGYLGSNGSYLPKLNTDNISATTINGVPVSSLGSGASPTNVPAFRAHRNGTDQSLPTSTYTNITWTTEEFDTYSNFSTSTGRFTPTVAGYYNIHLSIGCLNLASTNACVARILKNGAAVTHSNVRSPQFDVTAHSSVIVYLNGAGDYVTAQALSEASSASLTGNGANTYFEAALIASGNGLVSGTGASALSAMSDVTLASPATGEILTYNGTKWVNSTPTTNPTISGTTTMMEGWPDAIFCSDNSYGSDYLFHTGIRGTNHVYAPTWETQNGNFIITYNPSGGYVSGSAVIAGTCANKSISQLYTEGKAYNFIGNSGANGNSDRIASGTTSIVAETGGLIRINTSGVNTAYFDTVGRLVVPGISTTGIISGTGGYFSGNLGISGRLDVSNPANTTLQVMATNSGVRGGMSSDIPSGSAFYMGSYSNSPLALGINNSERMRIGTDGRVGIGTTSARAALDSPNGGIFNHISIGVCAYGPCPGPENQEYPYETIQLDPGNNLRISFGEYQPFFFGNNGHALKPGGGSWNAFSDKRLKDLDGTYPRGLKEIAALEPVRFHYKKNNQMGQPSDREFVGLIAQDVQPHFPEAVSKEKDGYYRLDTTPISFAVINAIKELKAENEQHRKANVQLQAENNRLQASNDNLARELRTFRSEYEAFKAKITSVVVIE >NZ_CP040641.1|WP_142781094.1|1468574_1469468_+|helix-turn-helix-domain-containing-protein MAGVKRDKLALVFDVNKNSSEVEHSRASSWRGFSVEFIDLSGLKGYEFRGGNPKHHYLAYHDLIRADGEWQVGGEPASNRKDIREMITYIPKQLDFKGWVTLEQRQNSIVALTFDPHLIGRELEILFPVQMQSPHVYFKNQNIQSTMLKIGSLLKRASSYPSMYMETLGLSAVLELAMVLTNETFTQKRGGLSRSQELLVAEYIKVNLTKDMSLDELASLVQMSRFHFSRSFKETFGESPIRYINKERVTFAKSVLLTSRTPIGEISETLGFGSIQNFIKTFREITGVTPLEFRRTS >NZ_CP040641.1|WP_142781096.1|1469574_1470027_+|hypothetical-protein MYSIVFPEVVLPHGVSLWKACTSPRDYIANPKAVAAQTFKSPQMDVPYHDSEVWSEVASLFAVQYPRTAAVDISFETHLDCPWIAFPWAICKGLASRACLVWVDDRGKRRLVRHIRLYPNGDHLEEDHRWLEHAVRKAIPSSLIVYVTSE >NZ_CP040641.1|WP_142781608.1|1470032_1470773_+|AAA-family-ATPase MPAPFLRRLSYAPPGEEKGFPFNVPLFTREFEIAFERPITIFCGENGSGKSTLLETIAKGCGFNPGGGNAHVYASRDDLNDLVESCRFAWLPKTSKGFFFRAESFFNYATYIDDLARQFGSRQSYRPYGGKSLHAQSHGESFLSLFAHRIGGKGVYIFDEPEAALSPMRQLAFLALLREILRSGDSQIIMATHSPILLGYPDSQLLQIADGAIEPTTLRETEHYIVMRRFLEEPDRYIGDIFSDDL >NZ_CP040641.1|WP_142781097.1|1471222_1471516_-|hypothetical-protein MPNVSTTAGDKQAVENQPLPKRKEWKGLYPKVTVRLNGPLGDVVDELQEATHAASPSDVVKRALVIYHTLVKQKLAGNEPYIEQKEGDTTKRIPIFL >NZ_CP040641.1|WP_142781099.1|1471771_1471999_+|hypothetical-protein MDPEFEKLVDPKVQAMVERYVKPEPWYSKLLWGVLGSLVASALIAIATFVYSEGSACHVATPQANARIAETDKKI |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
742169 : 762379
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP040641|742169:762379|DBSCAN-SWA GATGGTAAATCCTTATTCCGATGAAGAACGTTTGCAGGCAATGGTAGCGGCCAGCTATCGCGCCAGCCGTTCGCATTTCAACCACCTTCCACTCCGCCACATCATCAACCCGCCTGCCGAAATGTTCGATGCGAAGCTCGCACGGCAGATGGCGATCTACGTGCTTCACGTCGATTTCGATGTTCCGCGCCGCCGCTTGGTCGTGCTCCTGGGCGTTGCCCGCTGGACTGTGATGCAGGCTGTCCGCGTCGTTGAGGCACGGCGCTTCGAACCGCTCTTTGACAAGGCTTACGAGCGTATCGCCGCCCGCGCCAAAGACACCTTTATGGAAATGCTCTACGAGGCCTCGGCCGGACAGGAAGCCTCCCATGGCTGAGTTCATCCGCGCCACCCTATCAAGCATCTTTGTCGGCGAGCGGCTCCGCCCGATCGATATGGATTATGCCGAGGCTATTGCCGCTTCGATGTCTGAACACGGGCAGATCAGCCCGATCATGATCCGCAAGACGCCTGCAAAAAAGGGAACTCCTTTCACGCTGGTCGCGGGCGGCTACCGCATCACCGCTGCAACGCTGCTTGGCTGGAAGGAGATTGACGCCATCGTCGTCAAGGCTGATGCCGTTGAGGCGCAGCTGCTCGAAATCTCCGAGAACCTCTATCGCAATGAGTTGAACCCGCTTGATCGCGCCATCTTCGTCATGAAGTACCGCGAACTTTGGGAAGAGAAGCATGGCGAGATCAAGCCCGGCCGTCCTTCGGAGAAAAACCGTAACGATTACGGAATTATCTTCTCCGGCGGGCGGGAGCTTTCCGAACGCGTAAAAGAGCGCTTCGGCTTCGGTCAAAGCACCTATGAGAAGGTGACCAGCATCGGGAAAAATCTCGATCCGGTGCTGAGACAGGCCGTGCGTGGCACCTCGGCGGAAAACGATCAGTCGCAGCTTTTGACGCTGGCGAAACTTCCCCGTGAAGATCAGGTGAAGGTCGCGGCAGCGCTGAAGCACGAGCCGGACGTGAAGAAGGTTCTTGCCTTCACCAAGCCCCCCGCTCTGGTCACCCCGCCGCCCACTCCTTCTCAATCAATCATCCTCACCAAACTGATCGCCGCCTGGGACGAGGCGAGCGAGGAAACCCGCGATAGCTTCCTTGAGCACATCGGCATGTCTGACGCGCCGGATGCCCTCATGGCTGCGATCCGCGAGGAGGCTGCATGAGCACGAAACGCGATCCCAACCAGATGGACTTTTTCAAGGAGACTGTTTTCCCGGTGCGCTCTGCGTCGGAACGTCTCGATATCGACCGCTTCCGTTCGACCCTGAAACGCGAAATGGCCCGTGCCATCCGTGAATGCCAGTATGACCGCGACACGATCGCGGCACGCATGGCTTACTATCTCGGCCTGGACAAAGTCTCGAAGTCCGCTCTCGACAGCTACACCGCCGAAAGCAAGACCGCCCACGACATCAGCATGCCGCGCTTCAAGGCGTTCGTTCGCGCCACCAACGCATTCTGGCTTTGGGATGTCGTTGTTTCCGATGACGGCCTGTTGCTGCTCGAAGGTGATGAAGCCCGCCTCGCGGAAATGTCCCGCATTCGCCAGGAGCAGAGGAAGCTCGCCAAGGAACTGAAAGTTCTTCAGGCGACGCCAGTGCACATTCGTCGGGTGCGCAAATGAAGAAAGAATGGTTCACATCCGCCGAATTGGCCCAGGCAGCTTTGCCGGGTATTCCGAGCACGCGCCAAGGGCTTGAGTTGTTTATCGCTCGCTCTGGCGTGCGTTCCACCGTGAAAGCCCGCCCGAAGGCCGGACAAGGCGGCGGGTTTGAATACCATTATTCCTTCCTGCCTTCGGTTGCGCAGGCGAAGCTCGCATTTCTCAATGCGGAGCCGACCGATCCGCGCCCGACGAAACTTTCGAAGATGCTTTGGGACCGTTTTGAGGCCCTTTCAGACGCCCACAAGGCTATCTGCAAGACCCGCTTCGCCGTCCTGACGGAAGTCGAGGAACTGCGGACCTCGGGCATCAGCATGAAACATGCCGTCGCCCACGTCACCCGCAGGGCCGATATCGTGCCCGCGACCTATTACGAGTGGCGCAAGATGGTTGAAGGCCATTCCCGCCAGGACTGGCTTGCCGCCCTCGCCCCATCTTTCTCGGGAAGCGCCAGCGGCGAAGCCGCCGAAGTCACTCCCTGCCACCCCGAGGCGTGGAAAATCCTGAAATCCGACTTCCTGCGGCCGGAACGTCCATCCTTTAGCGCCTGCTACCGTCGCATGATGATGGTTGCCCGTGACCAGAACCTGTCGCCGATCCCTTCGGAGCGTTCCTTGCGCCGCCGCCTGGATGCGGAAGTGCCGAAGGCCGCGCAGATCATCGCTCGCGAAGGCAAAGATAAAGCGAAACAGCTTTTCCCTCCTCAGAAGCGCACCGTGGCACATCTGCACGCGATGAAGATCGTCAACACCGACGGTCACCAGCTCGACCTGTTTGTGCGGGCACCGTGGTCGGAGACGCCAGTGCGCGTGATCCTGATCGGTATTCAGGATGTCTATTCGCGCAAGGTTCTTTCGTGGACGCTCTCCGAAGCCGAAACGTGGGAGGCCGTCCGCACCTGCATCGGTTCGATGATCGAGAACCATGACGGCATCCTCCCCGAGCACATCTATATGGATAACGGCCGTGCCTTCGCGGGCAAGATGATCTCGGGCGGCGCAAAGACCCGTCACCGCTTCAAGGTCAATGAGGACGATGTTGCCGGCCTTTTGAAGACGCTCGACATCGAGCCGCATTTCGTGAAGCCGCGTTCCGGTCAGTCCAAACCGATCGAACGCGCCTGGCGCGATCTCGCCGAGGAAATTTCCAAGCATCCGTCTATGTCCGGTTGCTATACGGGCAACCGGCCGGACGCGAAGCCGGAAAACTACGGCAATAGCGCCGTGCCGCTCGAAACGCTCCAGCGCCACGTTGCGCAGTGCGTTGACGAGCATAATCACCGGCTGAACCGTACAACGGAAACCGCCCACGGCCGCAGTTTTGCGCAGACGTTTGACGCGTCGATCGCCGAGCCGTCCACGATCGTCCGTTATGCCAGCATGGCCCAGCGTTCGCTCTGGATGCTCTCGGCAGTCGCCATAACGGCCCGCAAGCCGGACGGCGCGATCCACATGCACGGTAACCGCTACTGGCACCCCGTGCTTAACGAGTGGATCAGCAAGAAGCTGACGGTTCGGTTCGATCCAGCCGACCTGCACAAGCCGGTCAAGGTCTACGACCCGGAAGGCCGCTTCCTTTGTGACGCCAACTGCCTGGCTAAAACCGGCTTTGCCGATACGGGCGCTGCCCGTCGCCAGGAGAAGGCACGCAAAACGCACGTCAAGAACCTTCAGGCGGTGGCCAAGAGCAATGCGGCGCTCTCCCCGATGCAGCTTGGCGAGATCATGGAAAAGGGCCGCAAGGCCGAGGCTGCGAAGCGTCCGCAGACGCCGGTTCGTCCGGTCGTCACGCGCCTTGTTACCGGCAACCTTGCACATGCCCCGGTCGAGGCGGTCAGCGTCGATCATTTCGAAGACAGTTTTGCGCGTGGCCTCGCCCGAGTGGCGGGCGGGGAAAGCGCAATCATCCAATTCCCCACGGGGAATACCGAGGCAGGCGGCAAGCCTGCCCGCAAGAGAAGAGCCGAAAAGTACTGAGTACGGTTCCAGTCCAACAGGGCGAAAAAAAATGAGCGACCCAAAGGCCGCCCCACAATTGAACAAAGGAACCTTTATATGAAAAAGACGACCAGCACAAATACCGTCTGGGAACAGTCTCAACCGACGATCGAGTTTACCGCCAAGCATCCGGCTTCCGACGTTGCCGAATGGCGCAAGCTGACGGCGCGCACCGTTGATGTCGCCGTCACTTACGGTTGGACGAAAGCCGAAGTGTCCCGCCGCTCGGGCGTTCCGGACGGCACGTTTTCCCCTTGGTTTAGCGGAAAGTACCTCGGCGTCCTGGCGAACGTAAATCAGCAGTTGGCCAACTGGCTCGATGCTATCGATGCCAGCCAGAACATGGCCGCTATCATGCCCGTGTCACCGCCTTTCCAGCGCACGACGGTCGGCATGGACGTTTACAACGCTCTTCTGTTTGCCCAGGTAACGTCCGGTTTCGTCCGCGTCACCCTGCCCGCAGGCTCTGGCAAAACCGCCGCTGTGGAGCACTTTGCCGCCACTCGGCCACATGTCTTTAAGGCGACGCTTAGCCCGAGCACCAAGACGGTTCACGGCATGCTCGTCGAGTTGTGTGCCGCTTTGGAGGTGCACGAGCATAATCCCGCAAAATTCGCCCGCGCCCTCGGTGCGAAGCTGAAGCGCGTTGGCGAAGGCTCGCTGCTCATCATTGATGAGGCACAGAACGCGGTTCCGGACGCGATCAACCAGCTGCGTCACTTCGTGGACAATGATCATTGCGGCGTCGCTCTGATCGGCAATGAGGACACCGCGACCGCCTTTGTTAAGGATCAGGGTCGTTCAGTTGCCAGCCGTGCGCAGGTGCTTTCCCGCTTCGATCGGCAGGTCCGCACGGTTCGTAATCCGATTGCGGACGCGGAAATGCTGATCAAGGCATGGGGCGTAGAGGAAGGCAGCGATTGCGCAACCTTTCTAAAAGGTCTCTCCCAGAAGCCCGGGGCGCTTCGCCAGATCGACCGCACCATGAAGGCCGCTTCCATGCTCGCCATCGGCGACGGCGAGGAAGGCGTTCGCCTGGAGCACCTTCAGGCCGCCTGGAAGAACCGCGACATGGGAGACAGCCTATGACGCCGGAAAACCCTTCCCTCAAATCTCACCTGGATTATCTCGCGAGCATCTTCGGGGACGCCAAAAAGATGAAGCCGGACGAAAAACTGGAACTGGATGCGCAATCCGCCCAGACCGTCCTGAAAACCCTTCGCGCCCTGTCACAGCAGGCCGGCCATCTTGAGCTGGAGCTTTCCATTCTGCGCGACAGCGAGGCCGGGAAGCTGCTCGCCAAGACCGCCGAGCAGCTCGCAACCGGCGAACTCACCGGCCTTCTGAAAAAGGCCGAAGGCAACATCATCCGCCCGAACTTTGGAGGAAAAAAGAATGACGGCGAAGCCTGACTGCGTTTCCGATTATCTGCTCAAGTTGGCGCGCGATTTGAACGGCGTCGTCAATGAGCGTGGCACGATCAACCTTGACCGGGTGACCTCTGTGAACGTCATCCTTCACATCGGCCGCATCGCCGAACTCGCCCGCAAACTCGAAAACGCCTGGTCACAGGCAGAGTGGAACCGGCGCGCTTCACAAGACCGCCTGTCGCTGCTGACCAGCATGAACCGCGTTACGGCTGAGGTTCTCGGCCTGATGCGGCCCGACACCGAGGACGGCGGTAACGTCGTCCAGTTCCGCCCCAAGCCCTCCAATTCCCCTGCACCTTCCGCGCCGTCTGGTGGCGACGCGGCCTGATCCCCCTTTCACATTAATTTATGAGGTTCATAACCATGGAAGCTGTAATTCTGGAAGAAAAGGCCGCAGCCGGCATCACTGTCGTCAACGGTAAAGATTATGTCACCAACGCGGATGGCGGCCTGACGCCGCTTGCCCTGGTGAAGGCAGAAGACTTCCTTGAAGATCAGATGGTTCGCAAGATCATCGGTTTCGCCAAGCCGCTTTCGGCTGAGCTGGCCCGCTTCAAAAAGCACACCCGCGCCGATATCGCCGAGTTCGACCGGAACCTTGAGGCCAAATACGGCCTCGTCAAACGCGGTCGCGCCGGGGCAGGAAACCAGAAATACCGCACGATCGACGGGCTAATGTCCGTCGAGACCCGCGTGAACAAACTGATCGAGTTCGGGCCGCAGTTGCAGGTCGCGAAAGGGCTGATCGACGAATGCCTGAATGAGTGGACCGAAGACGGCCGTGCTGAAATCCGTGGCCTCGTCACTCGTGCCTTCAATGTCGATCAGGCAGGCAAGATCAGCAAGGATGCCGTCTTCGAACTGTTCAAGATCGAAAGCGATGATCCGCGCTGGATGAAGGCGATGGATGCCATCAACGCGGCCGTGCGGGTAATCGGCTCCAAAGAGTACCTGCATTTCAGCTTTCGGGAATCGCACGATGCGGAGTGGGTCCGCATTTCCCTCAACATTGCTGACGCGTGAGGACAGCACCATGGCACGAATTTATGTCGCGTCCTCATGGCGCAATCCCCATCAGCCGGCAATCGTCGACCTTCTCCGGACAAACGGCCATGAGGTCTATGACTTCCGCAACCCTCCACACAACACGGGTTTCTCCTGGTCTCAGATCGGCCTCGCTGTACCTTGCTCTGCCGAGGACTATCGAAACGCCCTTTTGACGCACCCGCGTGCGGCGCAGGGCTTCATGTCCGATTTCGCGGCTATGCGTTGGGCTGACACCTGCCTTCTCGTTCTGCCCTGCGGTCGCAGCGCTCACCTTGAGCTTGGCTGGATGGCCGGCGCTGGTAAGCGCACCCTCATTCTCACACAGGACGGCGAAGAGCCGGAGCTTATGGCGCTGCTCGCTGATACGATCTGCATCAACGCCGGGGAAGTCCTAGCCGAACTCCGGAAAGGCGGTGCGGCATGAGCGAGCTTCACCTGTACGACAATCTTTTTGATGCGGAAGCCAGAGTTGCCGCCATCTACATGATCGAAGCGCTTGGTGACCCTGACAGCCCGCCAGACTGGCTGCAGGATTTCGTTGACAATACCGATTTGCCGGATGTCCGCGCTATTCTTGAAGCCCATTCAGAGCTTGCCGACGCGATCGATATGAACAAGTTCGAGGACTGCAAGGAGCAGGCGAGCGCGCTCATGGAACGCTGGTCGCTAGTCGGCCGAAAAGGGCTGATTGTCAAAGCTGAGATATGCGTTCGGCGATATCGGGCTGGCACCACAATCTTCTCCAGCGGATGGGGCCACCTCAGATGGCGATGGTTTATCGTTGATGAGTTGGACTTGGTTGTTCCAAAGCTTCTGGGGATCGCGAGCGGGCAGCATCATGTATCAATGCTCGAAGGCCAAGCGGCATGACCAGCATCTACTTCTCCGACGCGACCCTGAAATCCTTCTCCGCCGCCACCAAGGGTGGAAAGTCCACGATCAAGATCGAGATCGAGACGGCCGATCGCTACCAGATGGCCAGCATACTCAATCAGCTCGATGAGATCGAGGCTGAACAGAAGGCAGCGAAAACGCCTCGCAAAGCCCCCTCCGGAAAGACGGACGCGCCCTTGTTGGCGCTTCCGGCTCCCGTGAAACAGATCAGCTATCATGGTGATGATCATGAATGACCCTATCGCCAAGGCGAAGGCCGAAGAGGCCCGCCAGTCGCAAATCCTCGCCGATGCGATCCACAAGGCAATTATCGAGACCGGCGAGCAGTTCGAAGTGCCAATCCTGAATGCCGTTGGTGGTGCGCTTGCCACCAACATTGCAGAGGTTCTGGCCTCAATATCCGACCGCCGTCACCGCAAGATGTTCCGTGACCAGCTCGATCGCGCCATTTCTCTCGCGCTCGCTCAGGCCGCCACGCGTCCCATGGCGCCTGTCGAAACCGTCATTGTCGGAGGTGTCCGCCAGTGACGAGTAAGCGACAAATCCCTGCGGCCTTCACCAAGGGCTACGTGCTTTGTTCTCCATCGGGAAAGCTTCAGCCGAACACATGGAGCGAGACCGCCGCCAGGGCGGTCGCATCCAAATACCGCAAACGCGACACCTGGGAGAAGGCGCAACGCCGGGGATGGTCCGTCCAGTTCGTCTATGTGCGGTTCTTCATCCCGGTTTTCAAAGCCACCTTCACCACAACCGAAATCAGCGAGGCCTACGATGCCGAGAACATTTGAACCCGATCAGTTGCTGACGGCGCTGATCGACGCATTTCTGAAGGACGGCCATTTCGTTCACGCGAGAAACGGCAAGATGTTCGTCCTGGTCGTCACCGAGGAAGGCGGCGAAGACAAATCTTCCGAGTTCTGCCTTTCCGACATTGCAGAGCACGCGGCACGGAGGATGTCGAAATGAGCAAGACCATCGCCGCCATCAAGATCGAGCAGAAGAAACTCGGGCTGGACGATTTCACCTATCGCGCCAAGCTCCACATTTTGACCGGCAAGACATCCACGACGGAAATGACCGAGGCGGAGCGGCAAAAGGTTCTCGTCAGCTTGCGAGGCAGTGCCGCGAGACCCGCGCCGGTTCGCCAGGACGGCCGGGACGGCAAGCGCAAGCTTTCCGGCAAGTATCTGCCGAAGATGCGGGCGCTGTGGATCGCCTGTTACAATCTCGGCGTGATCGACGATCGCCGTGACAGCGCACTGGAAGCCTTCGCAATGGGCAGGCAATTGCCGAACATTTCGGATATGCGCTTCGTTCACAAGCCAGAGGACGCCGTCAGCGTCGTCGAGGCGATGAAGGGAATGCTGGCGCGAGCCGGTGTTGTCTGGGCCGATCGCCTGCCGTGCGAGCCTTACGAGAAAAGCCCCGGCTACAAGATCGCCCGCGCACAATGGGCGATCCTGCATCCAGCCGAGCCGAACGCGTTCTGGCAGGCCGTGACCCACATCGTCACCGAAAGCATCAGCTACAGAAATTTGAGCGATGCTGAGTGGATCACGGTGATGAACCATTTCGGGCCGCAGGTTCGTCGGCTGAAGAAGGCTCAAATGTGATGACCGGGAACATCGCTCCCCTGAATGGCATGCCGCTCTTTGGCTGGCCGGACCAGCGGGAGATCGATGTTCTTCAGAACCAGCGTGACCAACTGGCGGAAAGGATCGCCAAGCTGCCGCGCTTTTCCCACCGACGTATCGAGCTGGAAGCACGCCTGCGGGCGCTGACCGAAGAGCAGCTCATCATTTCGAACAGGATTACCCGTGGCCGAAGACCTCACCCTTGATCTGTTATCGACGCTCGGCGAAGACGGTTTCTTCTCCCTGGTCGAAGCGCATGCCGGCGTCAGGCTTTACGTTCCGTCTGATCCGGAGCGCAGCGAACTTTCTTCGACGATCGGCGTTGATGCTGCATACCGTCTAGCCAAAGCATATCCGGGCGGATATATAAGGGTGCCTCTGGCACGCGAGTTTCGCGCCCGTCGTTATGTCGATGCCGAAATGAGCAACCGCGATATTGCCAAAAGGCTCGGTTTGACCGAAAGCGGTGTTGAAAGACTTTTGAAGCGCGCCAGAAAACGGGAGCCGCTCAAGTCCAGGCGAAAGACAGACCCCCGCCAGATGGAAATGTTTTAAAGGCCCGCCCGCCACGGCGGGCCTGATTTGTTTCAGGCCCAAATTCTAATTTGCCCCCATATCGGCCCGCGCGATCTCCGCCAGTTGCCATCATGACGGGGCTTTCATGACCACCCAAACTTTTGACGAATGGCTGATCGCCCGCCTTCGCGTTGCTGGCGCTTATGGCGGGACCATGGACGGCGTTCATGGCCGCGAAGTGATCGCGGCGCTTGAGCGCTTCCAGGGCGCTTATGATCTGCCGATCACCGGCCGCGCAGATCAAGTCACCGTTGATGCGCTCCGCAAGGTGCAGAGCAAAAACCCGAACAGCAATCTCGTTACCTACGAGAAGGTGCCTGTGCCGGCCGAACCCGTATGGATGCGCGAGGCGCGCCGCTACATGGGACTGAAGGAAATCGCCGGCCCCAAATCCAATGCAACCATCATGGGCTGGGCCAAGAAGCTCGGCGGCTGGATTGCCAGCTTTTATACCGACGATGACATTCCCTGGTGCGGACTGTTCGTCGGCAACCTCATCGCCACCACTCTTCCCAAGGAGGCTCTTCCCGCCAACCCTCTCGGGGCGTTGAACTGGAAGAAGTTCGGCGTCGAAAGCCGGATTGCGCGCGGCGCTATCCTCGTTTTCGAGCGCAAGGGCGGCGGACATGTCGGCTTTTATGTTGGTGAGGACCGGACGCACTATCACGTCCTCGGGGGCAACCAGGACAATTCCGTTTCGATCACGCGGGTGGACAAAAGCCGCCTCGTGAGTGGTGGCGTCCGCTGGCCGAAGTCCGCTGACGCGCCGATCGCTGGCAAGGTCGAACTGTCGAGCACCGGCGCACCGGTCTCGAAGAGCGAGGCATGACGCCGATGAAGCCAACCTACAGTACATCCAAGCGCTATCTCTGGGGGTCGTTCTGGGCCTCGTGGGGAGCGATTTATCTTCTCATCGCCGGGGCGCTCGCTGGCTCTTCAGAAGCTACTGGCATGGCGACGATTGCACTGCCGTCGCTCCTGACGCTGATCGCAACCGTGCTCGGGGTCCATCGCCACTACGGCAGCAAGGATTTCGAAGCGGCCGCCCAGAGTGAAGCCGTTCTGCCTTCGCCACCACCTTACCTGTCGCGCGACCAGCCTGCTGCCCTGTCGGAGACGGTACGATGATTTCTGCCTGGCTAACGAAGGCGGCAAGGCCTGTCATCATCGTGCTGCTCCTGATCGGAGCTGCCCTTTTCCTCGGCTGGCTCACCATCGCCACCGTCAACGGCATGGTGGAACGGGCCGTCAGCAGAACGAAAGCCGAGAGCGATGCCCGTTGGACGGCCGCGATCGAGGCCGCCAACACCAAAGCCGCCAGCGCCGAGGCGGCTCAAGCCCGTTATGCGCTCGACCTGGAGCGCAGCACTTCCGCCAAGATCGATGCGCTGCGAGCGCGAAATGAAGAACTGGAGACACAGAATGCGGCTTTGCCGAATGGCGATGACTGCGGCCTTGGCCGTGATCGCGTCCGCCTGCTCCCCCACTGAAAGCAAGGCTCCGCTCGTGCTTCGCACCGTCAAGCCCGCCGTGCCGCCCGCGTCCCGTGTGCCCTGCGCAGTTGGCGATCTGCCGGATCGTGACCTTTCCCAGCGGGAGGTCGCCACCCGCTGGAGTGCCGACCGAACTGAAATCCTGTCCTGCGACGCGCGCCGAGCTGCTGCCGTCGCGGCGATCGACAACATGCCGGAGACCTCACCATGAATTTCGGTGGAAACGCCGCTCTCGATTTGGCGGCCGAGCGGGCCGAGCAAGAGCGTGAGGCCGGGATTGCCGCCGCCTCGCGCTCCCTGCGCACACCCGGCACGATCGAGTGCGAAGACTGCCCGAACGACATTCCCCGCGAGCGCCGCATCGCATTGCCATCCGCCACACGGTGCATCGTCTGTCAGACGAAGTTCGAGAAGGCCCGCCGATGACGCCGACCGAAATCATTCCCTGGCTGACGCTGGTTCTCTCCAGCCTGGCTGTCCTTGGTCACCTCAAGGGCTTTTTCTCCAGCGGCGAAAAGAAGCTTGAGGCTGACATCAAAAGCGGTCGCGAGGAAATCGCGCTCTTGGGCAAAGACGTCGAGGCCCATGAAACCAAGCTGACCAGCCATGACCGCCGCATCCAGGCGATCGAAGGGGAAATGCGTCACCTGCCGGACCGGGAAAGCCAGCACCGTCTGGAGCTTGCCCTTGAAAAGGTGAATGGCCGCCTCGACACGCTGAACGAAACCCTCAAGCCCATCAAAGCCACGAACGAAAGAATGAACCAGCTACTGGTGGAAACGGCAGGCAAACAATGAGCATCGGCATCGACTATATGAAAATCATGCGCGAAGAAGCGCGGCTCATCATCTTGAGGGCCTTGGCGGAACAGGTAAATGAGAGCCTGAGCAGCTCCATGCTTGAGCCGGTCCTTGCCAATTTTGGCATCAACCAGGAACGGCCATGGGTGCATCAGCAGATCGAGTATCTGGAGACCATGGGCGCTGTTGTCGTCGTCAGTGCCGGCACTGTCAAGATTGCTTCCTTGACCGATCTCGGCCGTCGCCATGTCGATCGCCAGTCCGCCATCGAGGGCGTGAAGCGTCCGTCACGCGTGGGTGCCTGACATGGCGAAAGCACGCGGCCGTCTTTCAGCAATTGATCTCCTGCCCGAGGAATGCAGTGACGCGATCTCCTGGGCATCGCAGGAACTTGCCGACCGTGATCGAAGCCAGCTCGACATCTATGCCGAGTGGAAAACGAAGCTGATCGCGCTTCAGGGCGAAATCGGTCTCGATTTCGACATCCCGTCATTTTCGGCCTTCAATCGCTTCGCCATCCGACTGTCACAGATGACGCGCCGGCTTGAACAGACCCGCGAAATCGCCGCTACCATTTCCGAGCGCATGGACGCGGCCGGTTCCGACGATCTGACCCTGATCGCGGCCGAGGCGATCAAGACGTTGATTTTCGAGCTGCTGCAATCGGCTGGTGATGCGGGCATTTCGCCGAAAGGTGCCATGGAGCTGGCGAACGCGCTGCGCGCTGCCTCGGCCGCCCAGGTCACGTCTTCGAACCGCCGTCTGAAGCTCGAAGCTGAAGAGAAGGCCCGACGCATTGAGGCCGACATGAAGGCGAAGGCGGAAAAGGCGCTCGACGTTCTTTCGAACGAACCCGGCATTTCGAAAGAGGCTATCGCCCGCGCCCGTCGCGAGTTCCTCGGCGTGCGGCCGAAGGCGAAATCCGTTCCCGAGGTCTCTCCGGATGTGGAGAATAAGGATGGTGACCAATGACCGTCACTCTTCCGAAAGCCCCCTGCACCAGGTGCGATGGCACAGGGCGGGCCATCAAGCGGATCAACCGCCGACGTGACGGCACGATTTCCAGCACCGTCTATGACCTGAAGAACGATTGCCGGTCCTGCAAAGGGACCGGTCTTGCATGCATGGAGGAACAGCGTGGCTGAAGCCTTGCCCGGTTTGCCGCAAGGCAAGTGGACCGACCCGCCAGTGCTTCCGATCGATCCGGCCAAGTTGCCGGACGAATTGCCTCGTGGTGCCGATATTCCGGCTGATCTCGATCCATTGGCCGAAGGCGTCCTTATGGCGCATCAGGCCGAATGGATTGCCGACGACAGTCTGCTGAAGGGCTGCGCCAAGGGCCGCCGAACCGGCATCACCTTCGCCGAGGCGCTGGATGCAACGCTGATCGCCGCCGCCCAGCGATCGGCAGGCGGGCAGAACTATTTCTACATTCCGGACACCAAGCCGAAGGGCCGGGAGTTCATCGGCTATGCGGCGCATTTCGCAAAGACCGTCGCCAAGGAACTGCTGACGATCGAGGATGGCATTTTCTTCGATCAGCGTGACGACGGTACCACGAACGCGATCTCCAGCTATATCATCCGTTTCAAGTCCGGTTTCCGCATCGAGGCGCTTTCCTCCCGGCCGGAAAATATCCGTGGTCTTCAGGGCACGGTCTGCATCGACGAAGCAGCCTTCCATCGTGATGTGCGCGCCGTCATTGATTCCGTCGCGGCACTTCTGATCTGGGGCGGCAAGGTCCGTGTGATTTCCTCGCACAACGGCGTCAGCAACCCGTTCAATGAACTGATCAAGGAAGCCGAGGCCGGAAAGAACGGCTTCAACTTCCACACCTTTACCTTCGGCGATGCCGTAAAGAATGGTCTGTTCAAACGCGTCTGCCTGATCAAGGGGGAGGAATGGTCACAGGAGAAGGAAGACGCCTGGGAAGCGAAAATCCGCTCGGCCTATGGCACGCGCACTTCCAAGATGAAACAGGAGCTGGACGCGATCCCGGCCGAATCGGAAGGGGCGGCATTGACCCGCGTCCTGATCGAGCGCTGCATGTCCGCCGATCTACCAGCCGTCGTGCGGTGGGACCGGCCGGACGAGTTCAAGAACCTCGACGATTTTGAGCGGGCCGAACAGGCTGACGAGTTCTGTGAAGGCCTCTTGAAGCCCCTTTTAGACAAGCTCGACAAGGACCGGGAACATTGCTTCGGGGAAGACTTTGCCCGTTCTGGCGACAAGACGGCAATCGTGGTCTTCGAGATCGGGGCCGATCTCATCCGCCGCGCCCGCCTGATTGTCGAGTTGAAGAACATCCCGTTCGACCAGCAGCGCGACATCCTCTTCTACATTGGCGATGCCTTGCCGCGTCTCATCGGTGGCGCGCTCGATGCGCGAGGCAACGGCCAGTACCTTGCCGAAAAGGCCCGCCAGCGCTGGGGGGAATGCATCCACGAAGTGATGACATCTGCCAAGTGGTACGCGGCCAACATGCCCGGTTACATCGAAGCCTTTGTCGATAAGAGCCTGCTTTTGCCGAACGACGCCGATGTTCTCGCCGATCACCAAGCACTCGCCTACGTCAACGGGATCATCAAGGTTCCGGATGAACATTCGACGAAGGGCGCTGACGGTTACGATCGCCACGGCGATACCGCGCCGGCTGGCGCGCTGGCGTGGTTCGCTTCCAACCAGGAGGCGATCGCCTACGAGTACGAGACCAGCCGCAAGGCTATCAATCCGATGCAGGGCCACAATGGCGGCCCGCCGATGCATGACGACGATCGCCGTGGCGGAACCGTCAATATCTACCTGAGAGGATCGCTCTAATGGCGAAAAAGAAGAAGCAGAAGATTTCTCGTCACCTCGCCAGCTCCATGAAGGATCAGGACGGCAAGGTCGTCAGCGTTGCCGAGCTGACGGAAGAAGTGGCCGGTGCACAGGTGGGCGGTGTCCGCCAGTGGATTTCCGGTCATCCGGCCGATGGCATGACGCCGATCAAGCTGGCATCCATTCTGCGCGCCGCCGACCAGGGCGAGGTGGAAGCCTATTTCGAGCTGGCCGAAGACATCGAAGAACGCGATAGCCATTATCTCGCGCAGCTCGCCACGCGCCGTCGCTCAGTTTCGCAATTGCCGATTACTGTCACGCCCGCGTCGGATGGCGCCGAACACAAGAAGCATGCCGAGTTTCTGCGCGAGTGGATCAAGACTGGCGTGTTGCGCTCAAGCCTCTTCGATATGCTCGATGCGATCGGCAAGGCAATTTCCGTCCTGGAGATAGATTGGCACCACAAGAACGGCAACGTGCTTCCGCGCGCGATGATCTGGCGGACGCAGCGCTGGTTCACGTTTGACCGGGCGGACGGCGAAACCTTGCTGCTACGCGAGGGCGCTGCCGGTGAGCCCCTCATTCCACACAAGTTCATTGTCCATCGCTCCAAAGCTAAATCAGGTCTGACCATCCGGTCCGGTATTGCCCGCGTCGCGGTCTGGCTCTGGATGTTCAAGAGCTTCACCGTCAAAGACTGGGCGGTCTTCACACAGAACTATGGCCAGCCAATCCGCATCGGCAAATATGGCCGTGGCGCAACCGAACAGGAAAAGGATGTCTTGTGGCGGGCGGTCTCCGGCATCGCCGGCGACTGCGCGGCCATCATTCCCCGCGAAATGCTGATCGAGTTCCATGAGGTGGGTTCGAAGAGCAGCTCGACCGACATGTTCGAAAGGCGGGCGGACTGGTACAATCGCGAGACTTCCAAACTGATCCTCGGCCAGACGACAACGACGGACGCTGTTTCCGGTGGTCATGCAGTGGCCAAAGAACATCGCCTTGTCCAGGAAGATATTGAACGTTCCGACGCGCTTGACGCATCCGACACGCTTAATGCGCAGCTCGTGCCGAACATCATCGCTTTCAACTTCGGCCCCCAGGACGAATATCCAACCATCCACATCGGCCGTCCGGACGAAGTGCCGCTGAAGGACTTTGCCGAGGCCTTCGACAAGCTCGCGAAACACGGCTTGACGGCTGAGGCCAGCTTCCTTCGTGACCGCCTCGGCATTCCCACACCTGCGACGGATGCCGAGCTGGTCGGCGGGCGTGTCGAGACGGTCGTTCCGCCCGAGGACAGGCCCCAGCGAAAACCGCTGACCGCAAAGCAAAGCCTCGATCGTCTATTCGCGTTGGCGCATTCCCGCGATGAACCGGACCTGCTCGAAAAGTTGACCGATCGGCTGGAGAAGGATGCGGCAGCCGCCATGGATGGCATGATCGACGAGGTCCGCGAGATCCTGTCCACCGCGACGGATCTTCGGGACGCGGCCCGCAAACTCGCGGATCTCGATCTGTCGGCCGAGGATCTCGCGGAAGCCATGGCGCGTGGCATGACGATGGCGCACCTGATCGGGCAGGCCGCGCTCATTGATGACCTCAAGAGGCAGTCATGACAGGAGGCCCACTGGCGCGCTTCAGGGGGCTTTTTGCGGCATCCGTAGCTTCTGGCCCGAAAATCGCTTCCACGGCCTTTAAAAACGCCTCAACTTTTAAGCCGTTGGCCGTTGCTCTGATGGCAACGACGGTGGCCGCGATCAATCTGCCGTTTGATGAGGCAATCGACTTCCTTCGCCAGAAAACGGCCGTCCCAACGGAAAGCTATCGCGATGTCTGGGACGCGGCCCATTCGAAGATGTTCATGGTGGCGGGCGCAAATAAAAAAGCGCTGGTCGAGGATTTCCAGGCAGCGATCATCAAGGCGGCCGAACAGGGCCTGACGCTTGAAGATTTCCGTGCCGACTTCGATGCGATCGTGGCGCGCCACGGCTGGCAATACAACGGTTCACGCGGTTGGCGCTCCCGCCTGATCTTCGAAACCAATCTCAGCACGGCTTACGCGGCCGGCCGCTACGCCCAGATGACGGCCCCGGAGACGCTCGAAGCGTTTCCCTACTGGATGTACAATCATTCCGGCGCGCTGCATCCGCGCCTGGAGCACAAGGCATGGGATGGCGATTGCTACGAAGCGACCGATCCTGTCTGGGCGAAAATGTATCCCCCGAACGGCTACCGGTGCGGTTGCTTCGTCACGCCGGTGTCTCGTCCTGGTCTCCGCCGCCTCGGCAAATCCGGGCCGGACACGCCGCCGAACCTTGACCAACTCGGCACGGACCAGCCGCGGGGGATCGATCCGTCTTTCGCCTACAATCCCGGCGCTGCCTGGCTGACGCAAACCGCACCTGGTCCGAAGGCGGTCAGCGCCGACCAGGTGCAGGTCGCGGCCTTCGTTCAATCGGCGCTGAAGGGCAAATGGCCTGACGGTGCCTGGACGCCGGTTGCGACGGCGAACAAGGCGACGGGTGCCGCCCTCGATGTCGCCGCAGGCACGGAAATCCGCCTGACGGCTGACACCATCCGCAGCCAGGCGCAAATAGCTGAAGCTCTGACGCCCGACACCATCGGCGTCATTCCCGGCCAGCTCGTCCAGTCCGGAAAGCTGCTTCGCGACAATCGCGGCCGGCCGGCCTTCGTCGGCGAACATGATGGTGTCTTTTATCGCGCCGATGTCGATATCGTCATGAAGGCCGATCGGAAGACCGTTTATATGACCTCGCTCCAGCGTGTCAGCCGTTCTGATCTGACGGACGCCTTCGGCCCCGGCTGGCAGTGAGGTTCTTGGCATGAGCGGCGCATCGATCTCGATCACGGCACAGGTTCTCGATTCGGAAGTCCGACGCGGCTTTCGCCAGCTCGAAGGCCTGATGACGAACACGACGCCCGTTATGCGCGCGATCGGCGTTGGCCTTGTCGGTTCGACCCATATGCGTTTCGTCACCCAGACCGATCCGGACGGTCAAGCGTGGCAGGCGTTGAACACGGGATATGCCGAGGACAAGCGCAATTCCCGCATCCTGACCGAAAGCGGCCGGCTGCGGAACAGCATCAATGCCAGGGCAAGCAATGACGAGGTACTTGTTGGCACCGATGTCATTTACGCTGCCCCTCACCAGTTCGGCGCGACGATCGTGCCGGTTCGGGCAACGCATCTCCGGTTCCGGATCGGCGGGAACCTTATCAAAGCCGACAGTGTTACCTTGCCCGCACGTCCCTTCCTCGGTATATCGTCGGATGACGAAGCGATGATCGCCGAAACCGTCTTCGGTTTTGTGGACCGCTATTCCCCCCGCTGAAATTCTCTTTCCTTCAGGCCCGCCCGCAGGCGCAGGCATGAATTGAATTTTGCCCGCATGGCATATCCGGACCATGCGAAACCTGATCTCGACCACTGTTGTTGCACTTCATTCGGCCTCGGCTGCGCCGGTCTCGACCCATATTGTCGCGTTGCAGTCGGCCGCCACGGTTCCCGAATGGCTGCACGTTCTGCCGACCGGCCGTTTCTCCGGCGTCGATGGTCGCGGCCCTTATGTCTTGGACAATGCCGACGCCCTGATCTCGGCATTCAATGCCGAGGGCAAGAAATTGCCGGTCGATGAGAACCATTCGACCGACCTTGCGGCAAAGCAGGGTTTTTCCGCGCCTGCGCGTGGCTGGCTTACCGCGCTGGAGCGCCGTGACGATGGCGTCTGGGCGAAGGTCGAGTGGACGCCCGATGGCCTGACGATGATGCAGGGCAAAGCCTACGGTTACATTTCCCCGGTTTTCACCCACAGCGCCAAGGCTCCCTTTGCGGTTCACAAGCTGCTGCGCGTGGCGCTGACCAACGATCCCAACCTCAACCTGAAATCGCTTCACTCTCAGAACTTGGAGACCACCATGGATTTGGAAGTTCTCCGGAAGGCGCTTGGCCTGCCGGAAACCGCAGATGAGGCCGCGATCCTTGCGGCGCTCACGGCGGCTCACTCTGCCCAGACCGCGCATGCGGCCTTGATGTCGAAGCTGGCGGAAGTCGCCGGCGTCGAAGTGACTGTCGGCGCTGACGCGTTGGTGACCGCCCTTCAGGCGAAAACGAAGCCTGCCTCGGGCATCGAGCAGGAAAACGCCACCCTGAAAAACGAGATCAAGGCGCTCAATACCCGTGTCGAAACGCTGATCACCGATTCCGCAAAGGAAAAGGCCACCACCGTCGTCGATGCCGCCATCGCCAAGATGCAGATCGTGCCGTCCCTGCGTGAGCACTTCATCGCTCGCCACATGAAAAACCCGACCGAAGTCGAAGCGGAGCTGAAGGTCATGCCGTCGCTCAACGCCGGCGGCCTCGGCGGCCGTCAGCCGCCCGCCGAGGGTGAAACCGCAACCAGTGATGAGCTTACCGTCGCTGCCCTGATGGGCGTCGATCAGGAAGCCTTCAAGAAAGAGCACAAGGCGCTCTTCGGAAAGGACATGTGACATGGCAGCTACCGCTGACATTCGCCGCAAGACCCGCAACGGCGATGCCTATGGTTATCCCGTTCTCGCCGGCGTCCGCATCTTCGGCGGCACCTATATCGGCGTGACCGCCGCGCTCGCGGCCGTGCCGGCCGGTCATGCCTCCTGCGTCGCCCTGATCGGTTTCGCGGAAGAAAACGTCGATAACCGCGACGGCGCGACCGGTGACCGCCTCATCAACGCGAAGAAGGATGTCACTGCCATCACACTGGCCGGTGCCACCGCCGCCGATATCGGCAAGACCGTTTACGCCTCGGCAGATGACACCTTCACGCTCGCTGCTGGCGCATTGCTGCCGGCCGGCATCCTCCATGCCATTGACGCCGACGGCGTCTGGCTCAAGCCCCTCTAAGGAACCACGATGGATATCACGATCTCCAATTTGCGGGGCATCTACACCTCGCTTTCGACCATCTTTAACCAGGCGATGGCCGCCACGCCTGCGTTTTACGAGACCATCGCCATGACGGTCACGTCTACGACCTTCGCCAACCAGTATCCGCGCCTGGACGACTTGCCCGGTTTCCGTGAATGGATCGGCGATCGTGTTGCCCATGATGTCGGTGCGTCTCTCTATCAGATCGTTAACCGCGACTTCGAGAAGACGATCAAGATCAAGCGCAAACAGATCGAAGACGATCAGGTCGGCATATTCACACCCATGGCCGCACAGTTCGGGCAGGACGGCAAATCGTTCCCCGATACTCTCGTTTGGCCTCTGTTCAAGAAGGGCGAGACCGTCACTTGCTATGATGGACAGTATTACTTCGATACCGACCATCCCGGCTACAATGAGCAGGGTAATCAGATTTCGGTGTCGAACTATACCGCAGGTGCGCAGCCGGCCTGGTACCTGATCGACGATACGCAGGTCATCAAGCCGATGATCTGGCAGGACCGTAAGAAGATCAAACTGACCCAGATGTTCGACGAAAAAGACCCGAACGTCTTCTGGCGCGCCGAATACATCTGGGGCGCCGACACACGCGGCAACGCTGGTTTCGGCATGTGGCAGTTCGCCTACAAGTCGAAGGCCGAGCTGACGCAGGAAAACTATGATGCCGCCCGTACCGCCATGCAGTCCATCCGTAGAAAAGATGGACAGATACAGGCGATCCGACCGGTCAAGCTGCTCGTGCCGCCCGTACTCGAAGCGACGGCCCGCAAGATCGTTGAAGCTGCCCTGATCAACGGTGGTGACACCAACGTCTGGGCAAAGACCGCAAGCGTCGTCGTCATCCCGCATCTCGCGTAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP040641|742169:762379|754890_756573_+|WP_142780701.1|DBSCAN-SWA MAEALPGLPQGKWTDPPVLPIDPAKLPDELPRGADIPADLDPLAEGVLMAHQAEWIADDSLLKGCAKGRRTGITFAEALDATLIAAAQRSAGGQNYFYIPDTKPKGREFIGYAAHFAKTVAKELLTIEDGIFFDQRDDGTTNAISSYIIRFKSGFRIEALSSRPENIRGLQGTVCIDEAAFHRDVRAVIDSVAALLIWGGKVRVISSHNGVSNPFNELIKEAEAGKNGFNFHTFTFGDAVKNGLFKRVCLIKGEEWSQEKEDAWEAKIRSAYGTRTSKMKQELDAIPAESEGAALTRVLIERCMSADLPAVVRWDRPDEFKNLDDFERAEQADEFCEGLLKPLLDKLDKDREHCFGEDFARSGDKTAIVVFEIGADLIRRARLIVELKNIPFDQQRDILFYIGDALPRLIGGALDARGNGQYLAEKARQRWGECIHEVMTSAKWYAANMPGYIEAFVDKSLLLPNDADVLADHQALAYVNGIIKVPDEHSTKGADGYDRHGDTAPAGALAWFASNQEAIAYEYETSRKAINPMQGHNGGPPMHDDDRRGGTVNIYLRGSL >NZ_CP040641|742169:762379|761485_762379_+|WP_080866970.1|DBSCAN-SWA MDITISNLRGIYTSLSTIFNQAMAATPAFYETIAMTVTSTTFANQYPRLDDLPGFREWIGDRVAHDVGASLYQIVNRDFEKTIKIKRKQIEDDQVGIFTPMAAQFGQDGKSFPDTLVWPLFKKGETVTCYDGQYYFDTDHPGYNEQGNQISVSNYTAGAQPAWYLIDDTQVIKPMIWQDRKKIKLTQMFDEKDPNVFWRAEYIWGADTRGNAGFGMWQFAYKSKAELTQENYDAARTAMQSIRRKDGQIQAIRPVKLLVPPVLEATARKIVEAALINGGDTNVWAKTASVVVIPHLA >NZ_CP040641|742169:762379|749774_750038_+|WP_142780692.1|DBSCAN-SWA MTSKRQIPAAFTKGYVLCSPSGKLQPNTWSETAARAVASKYRKRDTWEKAQRRGWSVQFVYVRFFIPVFKATFTTTEISEAYDAENI >NZ_CP040641|742169:762379|745936_746968_+|WP_065656426.1|DBSCAN-SWA MKKTTSTNTVWEQSQPTIEFTAKHPASDVAEWRKLTARTVDVAVTYGWTKAEVSRRSGVPDGTFSPWFSGKYLGVLANVNQQLANWLDAIDASQNMAAIMPVSPPFQRTTVGMDVYNALLFAQVTSGFVRVTLPAGSGKTAAVEHFAATRPHVFKATLSPSTKTVHGMLVELCAALEVHEHNPAKFARALGAKLKRVGEGSLLIIDEAQNAVPDAINQLRHFVDNDHCGVALIGNEDTATAFVKDQGRSVASRAQVLSRFDRQVRTVRNPIADAEMLIKAWGVEEGSDCATFLKGLSQKPGALRQIDRTMKAASMLAIGDGEEGVRLEHLQAAWKNRDMGDSL >NZ_CP040641|742169:762379|761086_761476_+|WP_080866969.1|DBSCAN-SWA MAATADIRRKTRNGDAYGYPVLAGVRIFGGTYIGVTAALAAVPAGHASCVALIGFAEENVDNRDGATGDRLINAKKDVTAITLAGATAADIGKTVYASADDTFTLAAGALLPAGILHAIDADGVWLKPL >NZ_CP040641|742169:762379|754059_754725_+|WP_142780700.1|DBSCAN-SWA MAKARGRLSAIDLLPEECSDAISWASQELADRDRSQLDIYAEWKTKLIALQGEIGLDFDIPSFSAFNRFAIRLSQMTRRLEQTREIAATISERMDAAGSDDLTLIAAEAIKTLIFELLQSAGDAGISPKGAMELANALRAASAAQVTSSNRRLKLEAEEKARRIEADMKAKAEKALDVLSNEPGISKEAIARARREFLGVRPKAKSVPEVSPDVENKDGDQ >NZ_CP040641|742169:762379|749472_749778_+|WP_142781575.1|DBSCAN-SWA MIMNDPIAKAKAEEARQSQILADAIHKAIIETGEQFEVPILNAVGGALATNIAEVLASISDRRHRKMFRDQLDRAISLALAQAATRPMAPVETVIVGGVRQ >NZ_CP040641|742169:762379|742536_743382_+|WP_142780684.1|DBSCAN-SWA MAEFIRATLSSIFVGERLRPIDMDYAEAIAASMSEHGQISPIMIRKTPAKKGTPFTLVAGGYRITAATLLGWKEIDAIVVKADAVEAQLLEISENLYRNELNPLDRAIFVMKYRELWEEKHGEIKPGRPSEKNRNDYGIIFSGGRELSERVKERFGFGQSTYEKVTSIGKNLDPVLRQAVRGTSAENDQSQLLTLAKLPREDQVKVAAALKHEPDVKKVLAFTKPPALVTPPPTPSQSIILTKLIAAWDEASEETRDSFLEHIGMSDAPDALMAAIREEAA >NZ_CP040641|742169:762379|751067_751439_+|WP_035261583.1|DBSCAN-SWA MAEDLTLDLLSTLGEDGFFSLVEAHAGVRLYVPSDPERSELSSTIGVDAAYRLAKAYPGGYIRVPLAREFRARRYVDAEMSNRDIAKRLGLTESGVERLLKRARKREPLKSRRKTDPRQMEMF >NZ_CP040641|742169:762379|743839_745858_+|WP_142780685.1|integrase,transposase|DBSCAN-SWA MKKEWFTSAELAQAALPGIPSTRQGLELFIARSGVRSTVKARPKAGQGGGFEYHYSFLPSVAQAKLAFLNAEPTDPRPTKLSKMLWDRFEALSDAHKAICKTRFAVLTEVEELRTSGISMKHAVAHVTRRADIVPATYYEWRKMVEGHSRQDWLAALAPSFSGSASGEAAEVTPCHPEAWKILKSDFLRPERPSFSACYRRMMMVARDQNLSPIPSERSLRRRLDAEVPKAAQIIAREGKDKAKQLFPPQKRTVAHLHAMKIVNTDGHQLDLFVRAPWSETPVRVILIGIQDVYSRKVLSWTLSEAETWEAVRTCIGSMIENHDGILPEHIYMDNGRAFAGKMISGGAKTRHRFKVNEDDVAGLLKTLDIEPHFVKPRSGQSKPIERAWRDLAEEISKHPSMSGCYTGNRPDAKPENYGNSAVPLETLQRHVAQCVDEHNHRLNRTTETAHGRSFAQTFDASIAEPSTIVRYASMAQRSLWMLSAVAITARKPDGAIHMHGNRYWHPVLNEWISKKLTVRFDPADLHKPVKVYDPEGRFLCDANCLAKTGFADTGAARRQEKARKTHVKNLQAVAKSNAALSPMQLGEIMEKGRKAEAAKRPQTPVRPVVTRLVTGNLAHAPVEAVSVDHFEDSFARGLARVAGGESAIIQFPTGNTEAGGKPARKRRAEKY >NZ_CP040641|742169:762379|742169_742544_+|WP_142780683.1|DBSCAN-SWA MVNPYSDEERLQAMVAASYRASRSHFNHLPLRHIINPPAEMFDAKLARQMAIYVLHVDFDVPRRRLVVLLGVARWTVMQAVRVVEARRFEPLFDKAYERIAARAKDTFMEMLYEASAGQEASHG >NZ_CP040641|742169:762379|746964_747291_+|WP_142780686.1|DBSCAN-SWA MTPENPSLKSHLDYLASIFGDAKKMKPDEKLELDAQSAQTVLKTLRALSQQAGHLELELSILRDSEAGKLLAKTAEQLATGELTGLLKKAEGNIIRPNFGGKKNDGEA >NZ_CP040641|742169:762379|753746_754058_+|WP_142780699.1|DBSCAN-SWA MSIGIDYMKIMREEARLIILRALAEQVNESLSSSMLEPVLANFGINQERPWVHQQIEYLETMGAVVVVSAGTVKIASLTDLGRRHVDRQSAIEGVKRPSRVGA >NZ_CP040641|742169:762379|750862_751090_+|WP_142780695.1|DBSCAN-SWA MTGNIAPLNGMPLFGWPDQREIDVLQNQRDQLAERIAKLPRFSHRRIELEARLRALTEEQLIISNRITRGRRPHP >NZ_CP040641|742169:762379|748776_749226_+|WP_142780690.1|DBSCAN-SWA MSELHLYDNLFDAEARVAAIYMIEALGDPDSPPDWLQDFVDNTDLPDVRAILEAHSELADAIDMNKFEDCKEQASALMERWSLVGRKGLIVKAEICVRRYRAGTTIFSSGWGHLRWRWFIVDELDLVVPKLLGIASGQHHVSMLEGQAA >NZ_CP040641|742169:762379|752584_752950_+|WP_142780697.1|DBSCAN-SWA MISAWLTKAARPVIIVLLLIGAALFLGWLTIATVNGMVERAVSRTKAESDARWTAAIEAANTKAASAEAAQARYALDLERSTSAKIDALRARNEELETQNAALPNGDDCGLGRDRVRLLPH >NZ_CP040641|742169:762379|753375_753750_+|WP_080866962.1|DBSCAN-SWA MTPTEIIPWLTLVLSSLAVLGHLKGFFSSGEKKLEADIKSGREEIALLGKDVEAHETKLTSHDRRIQAIEGEMRHLPDRESQHRLELALEKVNGRLDTLNETLKPIKATNERMNQLLVETAGKQ >NZ_CP040641|742169:762379|753160_753379_+|WP_142780698.1|DBSCAN-SWA MNFGGNAALDLAAERAEQEREAGIAAASRSLRTPGTIECEDCPNDIPRERRIALPSATRCIVCQTKFEKARR >NZ_CP040641|742169:762379|750212_750863_+|WP_142780694.1|DBSCAN-SWA MSKTIAAIKIEQKKLGLDDFTYRAKLHILTGKTSTTEMTEAERQKVLVSLRGSAARPAPVRQDGRDGKRKLSGKYLPKMRALWIACYNLGVIDDRRDSALEAFAMGRQLPNISDMRFVHKPEDAVSVVEAMKGMLARAGVVWADRLPCEPYEKSPGYKIARAQWAILHPAEPNAFWQAVTHIVTESISYRNLSDAEWITVMNHFGPQVRRLKKAQM >NZ_CP040641|742169:762379|759419_759929_+|WP_142780704.1|DBSCAN-SWA MSGASISITAQVLDSEVRRGFRQLEGLMTNTTPVMRAIGVGLVGSTHMRFVTQTDPDGQAWQALNTGYAEDKRNSRILTESGRLRNSINARASNDEVLVGTDVIYAAPHQFGATIVPVRATHLRFRIGGNLIKADSVTLPARPFLGISSDDEAMIAETVFGFVDRYSPR >NZ_CP040641|742169:762379|747274_747637_+|WP_142780687.1|DBSCAN-SWA MTAKPDCVSDYLLKLARDLNGVVNERGTINLDRVTSVNVILHIGRIAELARKLENAWSQAEWNRRASQDRLSLLTSMNRVTAEVLGLMRPDTEDGGNVVQFRPKPSNSPAPSAPSGGDAA >NZ_CP040641|742169:762379|743378_743843_+|WP_080819450.1|DBSCAN-SWA MSTKRDPNQMDFFKETVFPVRSASERLDIDRFRSTLKREMARAIRECQYDRDTIAARMAYYLGLDKVSKSALDSYTAESKTAHDISMPRFKAFVRATNAFWLWDVVVSDDGLLLLEGDEARLAEMSRIRQEQRKLAKELKVLQATPVHIRRVRK >NZ_CP040641|742169:762379|747672_748332_+|WP_142780688.1|DBSCAN-SWA MEAVILEEKAAAGITVVNGKDYVTNADGGLTPLALVKAEDFLEDQMVRKIIGFAKPLSAELARFKKHTRADIAEFDRNLEAKYGLVKRGRAGAGNQKYRTIDGLMSVETRVNKLIEFGPQLQVAKGLIDECLNEWTEDGRAEIRGLVTRAFNVDQAGKISKDAVFELFKIESDDPRWMKAMDAINAAVRVIGSKEYLHFSFRESHDAEWVRISLNIADA >NZ_CP040641|742169:762379|751545_752289_+|WP_142780696.1|DBSCAN-SWA MTTQTFDEWLIARLRVAGAYGGTMDGVHGREVIAALERFQGAYDLPITGRADQVTVDALRKVQSKNPNSNLVTYEKVPVPAEPVWMREARRYMGLKEIAGPKSNATIMGWAKKLGGWIASFYTDDDIPWCGLFVGNLIATTLPKEALPANPLGALNWKKFGVESRIARGAILVFERKGGGHVGFYVGEDRTHYHVLGGNQDNSVSITRVDKSRLVSGGVRWPKSADAPIAGKVELSSTGAPVSKSEA >NZ_CP040641|742169:762379|752411_752588_+|WP_168208184.1|DBSCAN-SWA MATIALPSLLTLIATVLGVHRHYGSKDFEAAAQSEAVLPSPPPYLSRDQPAALSETVR >NZ_CP040641|742169:762379|749222_749486_+|WP_142780691.1|DBSCAN-SWA MTSIYFSDATLKSFSAATKGGKSTIKIEIETADRYQMASILNQLDEIEAEQKAAKTPRKAPSGKTDAPLLALPAPVKQISYHGDDHE >NZ_CP040641|742169:762379|758311_759409_+|WP_142780703.1|DBSCAN-SWA MATTVAAINLPFDEAIDFLRQKTAVPTESYRDVWDAAHSKMFMVAGANKKALVEDFQAAIIKAAEQGLTLEDFRADFDAIVARHGWQYNGSRGWRSRLIFETNLSTAYAAGRYAQMTAPETLEAFPYWMYNHSGALHPRLEHKAWDGDCYEATDPVWAKMYPPNGYRCGCFVTPVSRPGLRRLGKSGPDTPPNLDQLGTDQPRGIDPSFAYNPGAAWLTQTAPGPKAVSADQVQVAAFVQSALKGKWPDGAWTPVATANKATGAALDVAAGTEIRLTADTIRSQAQIAEALTPDTIGVIPGQLVQSGKLLRDNRGRPAFVGEHDGVFYRADVDIVMKADRKTVYMTSLQRVSRSDLTDAFGPGWQ >NZ_CP040641|742169:762379|756572_758192_+|WP_142780702.1|DBSCAN-SWA MAKKKKQKISRHLASSMKDQDGKVVSVAELTEEVAGAQVGGVRQWISGHPADGMTPIKLASILRAADQGEVEAYFELAEDIEERDSHYLAQLATRRRSVSQLPITVTPASDGAEHKKHAEFLREWIKTGVLRSSLFDMLDAIGKAISVLEIDWHHKNGNVLPRAMIWRTQRWFTFDRADGETLLLREGAAGEPLIPHKFIVHRSKAKSGLTIRSGIARVAVWLWMFKSFTVKDWAVFTQNYGQPIRIGKYGRGATEQEKDVLWRAVSGIAGDCAAIIPREMLIEFHEVGSKSSSTDMFERRADWYNRETSKLILGQTTTTDAVSGGHAVAKEHRLVQEDIERSDALDASDTLNAQLVPNIIAFNFGPQDEYPTIHIGRPDEVPLKDFAEAFDKLAKHGLTAEASFLRDRLGIPTPATDAELVGGRVETVVPPEDRPQRKPLTAKQSLDRLFALAHSRDEPDLLEKLTDRLEKDAAAAMDGMIDEVREILSTATDLRDAARKLADLDLSAEDLAEAMARGMTMAHLIGQAALIDDLKRQS >NZ_CP040641|742169:762379|748342_748780_+|WP_142780689.1|DBSCAN-SWA MARIYVASSWRNPHQPAIVDLLRTNGHEVYDFRNPPHNTGFSWSQIGLAVPCSAEDYRNALLTHPRAAQGFMSDFAAMRWADTCLLVLPCGRSAHLELGWMAGAGKRTLILTQDGEEPELMALLADTICINAGEVLAELRKGGAA >NZ_CP040641|742169:762379|760002_761085_+|WP_142780705.1|DBSCAN-SWA MRNLISTTVVALHSASAAPVSTHIVALQSAATVPEWLHVLPTGRFSGVDGRGPYVLDNADALISAFNAEGKKLPVDENHSTDLAAKQGFSAPARGWLTALERRDDGVWAKVEWTPDGLTMMQGKAYGYISPVFTHSAKAPFAVHKLLRVALTNDPNLNLKSLHSQNLETTMDLEVLRKALGLPETADEAAILAALTAAHSAQTAHAALMSKLAEVAGVEVTVGADALVTALQAKTKPASGIEQENATLKNEIKALNTRVETLITDSAKEKATTVVDAAIAKMQIVPSLREHFIARHMKNPTEVEAELKVMPSLNAGGLGGRQPPAEGETATSDELTVAALMGVDQEAFKKEHKALFGKDM >NZ_CP040641|742169:762379|750021_750216_+|WP_142780693.1|DBSCAN-SWA MPRTFEPDQLLTALIDAFLKDGHFVHARNGKMFVLVVTEEGGEDKSSEFCLSDIAEHAARRMSK |
31 | Ochrobactrum_phage(50.0%) | transposase,integrase | attL 734853:734869|attR 765396:765412 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|