Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP028350 | Pantoea vagans strain PV989 plasmid pPV989-508, complete sequence | 1 crisprs | csa3 | 0 | 3 | 0 | 0 |
CP028349 | Pantoea vagans strain PV989 chromosome, complete genome | 4 crisprs | cas3,DEDDh,DinG,csa3,WYL | 1 | 3 | 6 | 0 |
CP028351 | Pantoea vagans strain PV989 plasmid pPV989-167, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP028352 | Pantoea vagans strain PV989 plasmid pPV989-94, complete sequence | 0 crisprs | NA | 0 | 0 | 10 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP028350_1 | 51138-51396 | Orphan |
NA
Consensus repeat of CP028350_1
|
3 spacers
spacers of CP028350_1
>1.1|51157|51|CP028350|PILER-CR TCCTGATTCAGGCAGGACAGCGTCTCTTTCTGCTGATTCCCGTTGAGCACT >1.2|51227|80|CP028350|PILER-CR ACTGCATTCACGTTCAACACGGCCGGCGTGCGTACTGTATTGAGGCGCTTTAACTTATCAGTATTGCCAGCGTGACACTG >1.3|51326|52|CP028350|PILER-CR TTTGCCGACTGGCATCGTCAGAGCTCCGGCGTACTGCGGATTCCCCTGTCGG |
CRISPR arrays and Neighbor proteins around CP028350_1
The CRISPR arrays of CP028350_1 >merge|CP028350|1|51138-51396|PILER-CR TTGCTGATTATAGTGAACGTCCTGATTCAGGCAGGACAGCGTCTCTTTCTGCTGATTCCCGTTGAGCACTTTACTGATTACAGTGAATGACTGCATTCACGTTCAACACGGCCGGCGTGCGTACTGTATTGAGGCGCTTTAACTTATCAGTATTGCCAGCGTGACACTGTTACTGATTACAGTGAATGTTTGCCGACTGGCATCGTCAGAGCTCCGGCGTACTGCGGATTCCCCTGTCGGTTACTGATTACAGTGAACG >CP028350|1|1|51138-51396|PILER-CR TTGCTGATTATAGTGAACG TCCTGATTCAGGCAGGACAGCGTCTCTTTCTGCTGATTCCCGTTGAGCACT TTACTGATTACAGTGAATG ACTGCATTCACGTTCAACACGGCCGGCGTGCGTACTGTATTGAGGCGCTTTAACTTATCAGTATTGCCAGCGTGACACTG TTACTGATTACAGTGAATG TTTGCCGACTGGCATCGTCAGAGCTCCGGCGTACTGCGGATTCCCCTGTCGG TTACTGATTACAGTGAACG
>CP028350.1|AVV39396.1|49941_50871_+|RepB-family-plasmid-replication-initiator-protein MADKDNENKALLEPFLSVTKNSGEVIQLHPNKNNTVQPVALMRLGLFVPTLKSTARGKSGAMASTDATKELKNLTLVKAEGYEKITITGARLDMDNDFKTWAGIIQSFSRYPTKGDTVTLPFIDFVKMCGIPSANSSAALRKRLDASLRRIATNTLSFEGNGKAYHTHLVQSAYYDREKDIVRIQADPKLFELYNFDHKVLLQLRAISRLKRKESAQALYTFLESLPTNPAPISLARLRMRLNLGSKTTTQNHVVRRAMEQLKEIGYLDYSEVKRGRSVFFIIHSRTPKLDGIGNLESIDSLDEIEFED >CP028350.1|AVV39395.1|48106_49477_-|succinate-semialdehyde-dehydrogenase MTEHAVSRNPATGEILARYPLQNSTELEKTLAESASAFTAWQRSSMSDRVRVLRQLGEQIRLREQDLSRMITLEMGKPITQARAEVLKCANLCDWYAEHGPAMLADQPTQIADAWQRFRPVGVILAVMPWNFPLWQVLRGAVSMLLAGNTYLLKHAPNVMGSATLMAELFTATDLPAGGFNLINVDNDGVSVAIKDDRIAAVAVTGSVRAGAAIAAQAGAALKKTVLELGGSDPFIVLADADLDEAVKSAVIGRFQNTGQVCMAAKRFIVEAPIAEEFEKRFTAAVQALKMGDPLDENTFLGPMARADLRDELDGQVKATLSEGARLVLGGEKVAGQGNYYAPTILADVTSTMTAFKQELFGPVAAIAVARDPQHALEIANESEFGLSATVWSGNEETADRLALQLEVGAVFINGNGASDPRVTIGGVKKSGYGRELSHFGVHEFCNVQTVWKNRR >CP028350.1|AVV39394.1|46924_48103_+|FAD-dependent-oxidoreductase MPQRIAVIGAGVLGLAVAQSLSRRGAQVTVFDKSLPGSGTSQISYAWVNANGKEPGHYHELNAQAINEHKRWQASHRAWPRWLLETGSLEWAADESSLRQLTQRAAKLATLDYPVEKRCRAALLGALPGLRLDPRIQHAWFFPSECLLYPSLFIASLLADLRASGGQLVCNNEVTTLTESGRDVHLTLASGDEWHGDQLVLATGRWAPELISQCGLELAMTDANRADPVACSFLAQTQPLPIPLNCNLITPELNVRPDGGGRLMLQALDLDQHADPSRPASPDGLTGKEMLRRLRRLFNNAEGARIERIETGQRSRPADGLPALGYISDSARVYLMVSHSGMTLAPLLGRLVAEEMLSGTPSPMLSRFTPHRLLRPLSEVVKTAPAYLPAAQ >CP028350.1|AVV39393.1|45601_46831_+|hypothetical-protein MTLSSLISGDPRLPVAVLSGFLGAGKTTLLNHILNNREGRRIAVIVNDMSEINIDAALVRNGDAQLSRTDEKLVEMSNGCICCTLREDLLLEVKRLAQAGRFDHLVIESTGISEPLPVAETFTFEDETGESLSAYARLDSMITVVDGFNFLRDYRSVDDLQSRGESLGEQDERSVVDLLVDQIEFCDLLVLNKTDLLSPAELHQLQGMLRALNPHARMVNSEFGQVPLDFLLNTGRFDFDRAAQAPGWLQTLRGEHQPESEEYGIRSVVYRARRPFHPQRFWEVVNHQLDGVIRSKGYFWLATRPEFAAMWSQAGAVARQGYAGRWWVSVPRDNWPQDADSLNFIAEQWQEGTGDARQELVFIGIDMDEQHIIAALDHALLTPYEMAAGPEHWVTLDDPVPAWFDEMTA >CP028350.1|AVV39392.1|45140_45479_-|hypothetical-protein MQQLTLVPRLMPVVRRGEKTSTIRWQEGDIVTGPLRLVNQQDEADTVIVWVTRIDTLRLNEVAAKLGKEHEWPDAVLLEGMREHYPAIRLSSEVQLITHLTPAETRQKLAER >CP028350.1|AVV39391.1|44591_45140_+|XRE-family-transcriptional-regulator MNTPISIIANALVRERQRSGLSLAEVARRAGIAKSTLSQLEAGNGNPGIETLWSLCVTLNIPFSRLLEPDARRLQVIRRGEGLTVTADLADYQAVLLASCPPGVRRDIYLLEVQPGSERISQPHIPNTIEHIIIAKGRALVGPVDSAVELDVGDYITYPGDELHIFRALEADTLALLVIEHS >CP028350.1|AVV39390.1|43811_44480_-|branched-chain-amino-acid-ABC-transporter-permease MLFPQMQSLDKGVIKAIFLVCLADGIVGLSYGSLASADGFPLWVPLALSTLVLAGASEFLFIGIVAGGGSPFTAAAAGLLVNARHLPFGIAVKDLVGKGPQGWFGCHIMNDESVVFGISQPQLAQRRSAYWLCGIGIGLIWPVSVMVGAAIGQFIPDVSVIGLDAVFPAILIALIFPALRQRRTRIPATVGALLSLLATPLVPAGMPVLFSLLGLLTWRSRK >CP028350.1|AVV39389.1|43491_43815_-|AzlD-domain-containing-protein MTHQGLIIAGIAMLAAGTYLMRFAGVRLGNRLPISERTQQMLSDAATVLLLAVAATTTLFEGQHFAGVARVAGVGFALLLAWRRAPLIVIILGAAAMTALLRFLGVA >CP028350.1|AVV39388.1|42645_43374_-|thiamine-biosynthesis-protein-ThiJ MNRPLQIGLLLFPDVTQLDLTGPWEVFARMPGVECHLIWKDRQPVRSDRGLSILPTAIFDDCPLLDVICVPGGPGQIALMSDEETLDFLRRQAEQAQWVTSVCTGSLVLGAAGLLKGYRATSHWSSIDQLALLGAEPVNQRVVRDRNRISGAGVTSGIDFALTLVAEIAGDAVARAVQLQMEYDPEPPFSCGSPQTAPPEEVAQARAKIAEFIATRRAATEHAARRLQADQLTDAGDAQQQQ >CP028350.1|AVV39387.1|41730_42696_+|acyltransferase MNKTGLAMLKTLACITAVSFFTLYQSYDSYHYDVNLTLNVLSFISTIATPLYFLLSGYLDAGELHTPAWQLGKIRRILLIFIFWFSFFWFAGMHHKGYLIQPWFVIALIVIYVSHPLIDWLIQRPGVMAAGILLLLVLAFTYDLLASLYPDQRAFSLLPQYRIWTWVLYYLTGRLMAAPRVMQLLTSPRVIKVSLLLLPAVYLFTWLYERYYFIARFLATHNDVVLTGSQVYLLVVLIVISINAIRLPEKAQWLTTFLTTLGKTMTGVYILHYLIFGVLAAAIRIQTLGDKLLVIALTLILSMALSLLLLRIPGVRKLISL >CP028350.1|AVV39789.1|52425_53625_+|ParA-family-protein MANDDKQVMKVAQRSERMLLSLTDQIQAQKQELHENTYYQVYAKAALAKLPKLTRASVDYAVNEMEESGYQFDKRAAGSSVKYAMSIQNIIDIYHHRGVPKYRDRHSEAYTIFVGNLKGGVSKTVSTVSLAHALRAHPHLLFEDLRILVIDLDPQSSATMFLNHERSVGLVEATAAQAMLQNVTRDELLNEFIVPSIIPGVDVLPASIDDAFIASSWDQLCAEHLPQQNVHAVLYDNVIAKLKKDYDFIFIDSGPHLDAFLKNAIAASDLLMTPVPPAQVDFHSTLKYLTRLPELIAIIEDSGATCRLQGNIGFMSKLSNKADHKLCHSLAKEIFGGDMLDAALPRLDGFERCGESFDTVISANPSTYVGSSEALKNARSAAEDFAKAVFDRIEFIRLN >CP028350.1|AVV39397.1|53632_54604_+|ParB/RepB/Spo0J-family-partition-protein MSAKRITIGRTFSQTPLENDTPDSQHNQTFVLATGKRALFRFERIAASDVENKTFVTMETNGRDQAGLTPDSLRDIIRTIKLQQFFPAIGVMRDDRIEILDGSRRRAAALYCKTGLDVLVTDAAISADEARRLAQDIQTAREHNLREVGMRLLALKEGGLSQKEIAENQGLSQAKVTRALQAASVPSDLISLFPNHAELTYPDYKALLQAADKLSESGQSVEALINSISREIDVVCAREGLAEDELKNHILRLIRQGSQTLMKEPEKDKTQATALWSFADKDRFARKKVRGRMFSYEFNRQSKELQEELDKVIAETLKKYLNR >CP028350.1|AVV39398.1|55313_55598_-|hypothetical-protein MTIAVLNAGYASTDSDEPASRLSVLLNLFSSAFTVNSFTVSHVPFLSSRFSFSAVEHILSSALLNTVNLTEVADFRSLALFLSNPAHAVRINRK >CP028350.1|AVV39399.1|55697_56708_-|transketolase MSSQPQKKRLTTSAMIASIAAEGQPTVSAPFGQALVSLAEQRSDVVGMTADLSKYTDLHLFADAFPQRFYQMGMAEQLLMSAAAGMAREGFMPFVTTYAVFASRRAYDFICMAIAEENLNVKIACALPGLTTGYGPSHQATDDLAIFRGMPNLTIIDPCDALEISQVIPAIAEHQGPVYLRLLRGKVPLVLDKYDYRYQHGKAQRLRTGKDAVIISSGLMTMRALEAADVLEKQGIEIGVVHSPVIKPLDEETILREAGEALHHNQLVITAENHSITGGLGEAIAGLLMRNHVTPRFKQIALPDAFLAAGALPTLHDRYGISADAMVRQITAWLEA >CP028350.1|AVV39400.1|56709_57552_-|transketolase MTDPRHSSHTLSLPQRARNIRRHALLMGQIQGQGYIGQALGAADLLAVSYFHAMNYRPDEPEWEERDRFYLSIGHYAIALYAALLEAGVIPEEERETYGTDESRLPMSGMAAYTPGMEITGGSLGHGLGIAVGACLGLKQKGSQARIYNLLSDGELNEGSTWEAAMSASHWQLDNLIAIIDVNNQQADGHSSEILAFEPIVERWQAFGWYTQRVDGNDIAALQHAFDAARSWPEPQPRVIICDTRMGKGVPFLESREKTHFIRVDADEWDTALTALEQGV >CP028350.1|AVV39401.1|57564_58863_-|MFS-transporter MTTLNLSAASVARDITYRKIAWRLMPWLMLCYLCAYLDRVNVGFAKLQMMDDLSLSETVYGLGAGIFFLGYFFFEVPSNLILHRVGARRWIARIMITWGVISALFAFVETAWQFYLLRFLLGVAEAGLAPGLLLYLTYWFPSHRRARMTVLWFVAIPLSGMIGGPLSGWIMATFAGFHGWAGWQWMFMLEAIPTLVMGFVVLLVLKDRVEQAEWLDDDEKRRVRADLDEDNQQKACHGTVKAFVADRRLWLLALIYFCVVMGQYALTFWLPTLVRNSGVSEPLHIGLLTSLPYLCAIIFMLLAGRSGDRHRERRWHLIVPMLIGLSGLTLATLLSHNVSLSLFSLCIAAAGILSASSLFWMLPTNLLGGVSAAAGIAAVNCIANLAGFFSPWLIGSITTATGSPASGMYFIAAVLLGGALSVLRIRAADVNR >CP028350.1|AVV39402.1|58917_59667_-|short-chain-dehydrogenase MLLENKVVIITGAASQRGIGRATAACFAQHGARVVVLDLDLQAAQASALSCGDGHLGLATDVTDPLSVHRAIDQVLEHYGRIDVLVNNAGITQPVKTLEIGIEDYNRILDVNLRGTLLMSQAVIPTMQRQKSGSIVCLSSVSAQRGGGIFGGPHYSAAKAGVLGLTKAMAREFGPDQIRINALTPGLIQTDITGGLLQDERRHAILDGIPLGRLGTASDVANAALFLASDLSGYLTGITLDVNGGMLIH >CP028350.1|AVV39403.1|59864_60767_+|LysR-family-transcriptional-regulator MEKQSEHLPSIKSLRVFEQVAHFGNVARAAEELSITPSAASHQLAKLERELGCILFNRSAKGVTLTLSGEHYLREIGPILLCLAQATARISNEKARSNLRIHCAPSFGLLWLLPRIHKFRDSYPEFQLTLSCSYENLSFSRDNIDIAIRHGFPDWNAFEIKTIRHEKMSVLASPDYLKQHPVSQAADLIHHALILSESPLIQWPQWFAAQQLPQPDKEWLFRFDRSYMSLEAAMLGHGLIFESELLAEDYLRSGKLVRVLTEQHSLPVSAHHLVFPRGFAQYPRVAHFLSWIQEELRTLI >CP028350.1|AVV39404.1|60966_62418_+|sugar-porter-family-MFS-transporter MSTNTPDAGEAGDHAPADTVRQRIFLVVLVATMGALAFGYDTGIISGALPYMTSPPDQGGLGLNSFTEGLVASSLVFGAAIGSFLSGFFSDRFGRRITLRTLAVIFVLGSLGTALAPSVNVMVAMRFLLGIAVGGGSSTVPVFIAEIAGPRLRAPLVSRNELMIVTGQLIAYVASTLLSYLLHDEHLWRYMLAIAMVPGFLLFIGTFFVPASPHWLVAEGRLKEAKKILKYLRETPREVRHEMAQMKKQARAAERGPDAKTLIREKWVIRLMIIGVGLGFVAQFTGVNGFMYYTPIILKQTGLGTSASIAATIGNGVVSVLATFVGIWAISRFPRRTMLITGLCLVITAQILLGSVMTFMSSGLMQSYLALGCILLFLFCMQMCISPVYWLMMSELFPMQLRGVLTGGAVSLQWIFNAVVAFGFPPIMAYAGSTTFFIFAAINVGSLIFVMTMVPETRGKSLEEIESHMKEKFGEKPKEAEAC >CP028350.1|AVV39405.1|62500_62722_-|hypothetical-protein MVHPGQHAEQMKSEYSAEPAPIFVSGHASLKRRFQPSAAYEGYAGFWRFQPEIARHGRKPSRKIELITVNETA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP028350_1 | 1.1|51157|51|CP028350|PILER-CR | 51157-51207 | 51 | NZ_CP028350 | Pantoea vagans strain PV989 plasmid pPV989-508, complete sequence | 51157-51207 | 0 | 1.0 |
CP028350_1 | 1.3|51326|52|CP028350|PILER-CR | 51326-51377 | 52 | NZ_CP028350 | Pantoea vagans strain PV989 plasmid pPV989-508, complete sequence | 51326-51377 | 0 | 1.0 |
CP028350_1 | 1.3|51326|52|CP028350|PILER-CR | 51326-51377 | 52 | NZ_CP038854 | Pantoea vagans strain LMG 24199 plasmid unnamed1, complete sequence | 525005-525056 | 0 | 1.0 |
CP028350_1 | 1.1|51157|51|CP028350|PILER-CR | 51157-51207 | 51 | NC_014258 | Pantoea vagans C9-1 plasmid pPag3, complete sequence | 528067-528117 | 1 | 0.98 |
CP028350_1 | 1.1|51157|51|CP028350|PILER-CR | 51157-51207 | 51 | NZ_CP038854 | Pantoea vagans strain LMG 24199 plasmid unnamed1, complete sequence | 524836-524886 | 3 | 0.941 |
CP028350_1 | 1.2|51227|80|CP028350|PILER-CR | 51227-51306 | 80 | NZ_CP028350 | Pantoea vagans strain PV989 plasmid pPV989-508, complete sequence | 51227-51306 | 20 | 0.75 |
CP028350_1 | 1.2|51227|80|CP028350|PILER-CR | 51227-51306 | 80 | NZ_CP038854 | Pantoea vagans strain LMG 24199 plasmid unnamed1, complete sequence | 524906-524985 | 22 | 0.725 |
1. spacer 1.1|51157|51|CP028350|PILER-CR matches to NZ_CP028350 (Pantoea vagans strain PV989 plasmid pPV989-508, complete sequence) position: , mismatch: 0, identity: 1.0
tcctgattcaggcaggacagcgtctctttctgctgattcccgttgagcact CRISPR spacer tcctgattcaggcaggacagcgtctctttctgctgattcccgttgagcact Protospacer ***************************************************
2. spacer 1.3|51326|52|CP028350|PILER-CR matches to NZ_CP028350 (Pantoea vagans strain PV989 plasmid pPV989-508, complete sequence) position: , mismatch: 0, identity: 1.0
tttgccgactggcatcgtcagagctccggcgtactgcggattcccctgtcgg CRISPR spacer tttgccgactggcatcgtcagagctccggcgtactgcggattcccctgtcgg Protospacer ****************************************************
3. spacer 1.3|51326|52|CP028350|PILER-CR matches to NZ_CP038854 (Pantoea vagans strain LMG 24199 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
tttgccgactggcatcgtcagagctccggcgtactgcggattcccctgtcgg CRISPR spacer tttgccgactggcatcgtcagagctccggcgtactgcggattcccctgtcgg Protospacer ****************************************************
4. spacer 1.1|51157|51|CP028350|PILER-CR matches to NC_014258 (Pantoea vagans C9-1 plasmid pPag3, complete sequence) position: , mismatch: 1, identity: 0.98
tcctgattcaggcaggacagcgtctctttctgctgattcccgttgagcact CRISPR spacer tcctgattcaggcaggacagcgtctctttctgctgattaccgttgagcact Protospacer ************************************** ************
5. spacer 1.1|51157|51|CP028350|PILER-CR matches to NZ_CP038854 (Pantoea vagans strain LMG 24199 plasmid unnamed1, complete sequence) position: , mismatch: 3, identity: 0.941
tcctgattcaggcaggacagcgtctctttctgctgattcccgttgagcact CRISPR spacer ttctgattcaggcaggacagcgcctttttctgctgattcccgttgagcact Protospacer *.********************.**.*************************
6. spacer 1.2|51227|80|CP028350|PILER-CR matches to NZ_CP028350 (Pantoea vagans strain PV989 plasmid pPV989-508, complete sequence) position: , mismatch: 20, identity: 0.75
actgcattcacgttcaacacggccggcgtgcgtactgtattgaggcgctttaacttatca CRISPR spacer actgcattcacgttcaacacggccggcgtgcgtactgtattgaggcgctttaacttatca Protospacer ************************************************************
7. spacer 1.2|51227|80|CP028350|PILER-CR matches to NZ_CP038854 (Pantoea vagans strain LMG 24199 plasmid unnamed1, complete sequence) position: , mismatch: 22, identity: 0.725
actgcattcacgttcaacacggccggcgtgcgtactgtattgaggcgctttaacttatca CRISPR spacer actgccttcacgttcaacacggccggcgtgcgtcctgtattgaggcgctttaacttatca Protospacer ***** *************************** **************************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP028349_1 | 194884-195031 | Orphan |
NA
Consensus repeat of CP028349_1
|
1 spacers
spacers of CP028349_1
>1.1|194938|40|CP028349|CRISPRCasFinder CCCATGCACACCAAAGTGCACAGCCACCGGAGCGTACTCC |
CRISPR arrays and Neighbor proteins around CP028349_1
The CRISPR arrays of CP028349_1 >merge|CP028349|1|194884-195031|CRISPRCasFinder TAGTACGTGAGGATGGCGAGCACTGCCCAACCGGAGCCTGGCAAGTAAGCCAGGCCCATGCACACCAAAGTGCACAGCCACCGGAGCGTACTCCTAGTACGTGAGGATGGCGAGCACTGCCCAACCGGAACCTGGCAAGTAAGCCAGG >CP028349|1|1|194884-195031|CRISPRCasFinder TAGTACGTGAGGATGGCGAGCACTGCCCAACCGGAGCCTGGCAAGTAAGCCAGG CCCATGCACACCAAAGTGCACAGCCACCGGAGCGTACTCC TAGTACGTGAGGATGGCGAGCACTGCCCAACCGGAACCTGGCAAGTAAGCCAGG
>CP028349.1|AVV35875.1|193225_194542_-|sn-glycerol-3-phosphate-ABC-transporter-substrate-binding-protein-UgpB MSVVTFRRSLMSVLLGLTLSSHAMAATEIPFWHSMEGELGKEVDSLAQRFNETHPDYKIVPTYKGNYEQSLAAGIAAVRSGKAPAVLQVYEVGTATMMASKAIVPVHQVFKDAGIPFDEKQFVPTVAGYYSDSKGQLISQPFNSSTPVLYYNKDAFKKAGLNPDQPPKTWQELATDAAALRKAGMSCGYASGWQGWIQIENFSAWHALPVATENNGFDGLDAVLEFNKPVQVRHIDLLEAMNKKGDFTYFGRKDESTAKFYNGDCGITTASSGSLADIRHYAKFNFGVGMMPYDDTVPNAPQNALIGGASLWVMKGKDAATYKGVAEFMQFLAKPEIAAEWHQKTGYLPITTAAYELTKQQGFYDKNPGADIATRQMMNKPPLPFTKGMRLGNMPQIRTVIDEELESVWTGKQSPQSALDNAVKRGNELLRRFQQQMK >CP028349.1|AVV35874.1|192284_193172_-|sn-glycerol-3-phosphate-ABC-transporter-permease-UgpA MSSSRPVFRTSLLPYLLVLPQLLITAIFFLWPAGEALWYSLQSLDPFGISSSFVGLENFRRLFADPYYLDSFWTTIKFSGMVTVFGMVFSLLLAALVDYVVRLRRLYQTLLLLPYAVAPVVAAVLWMFLFNPGLGLFSYLLNHIGYNWNYAQNSGQAMFLIVLASIWQQMSYNFLFFFAALQSIPKSLVEAAAIDGAGPVRRFFNLSLPLITPVSFFLLVVNLIYAFFDTFPVIDAATGGGPVQATTTLIYKIYREGFTGLDLSSSAAQSVVLMLLVIGLTVLQFRFVERKVQYQ >CP028349.1|AVV35873.1|191442_192288_-|sn-glycerol-3-phosphate-ABC-transporter-permease-UgpE MIENRRGLDLFSHIVLVLGVLTILFPLYVAFVAATLDNEAVYQVPMTLVPGTHLWENISRIWTHGVNGSGPAFGLMLLNSMIMALGITFGKITVSMLSAFALVWFRFPLRTLFFWLIFITLMLPVEVRIFPTVQVIADLNLLDSYSGLTLPLMASATATFLFRQFFMSLPDELVEAARIDGAGAMRFFIDIVLPLSKTNLAALFVITFIYGWNQYLWPLLIVNDASLGTAVAGIKSMISSSGSPTQWNEVMAAMLLTLIPPLAVVLVMQRAFVRGLVESEK >CP028349.1|AVV35872.1|190367_191441_-|sn-glycerol-3-phosphate-import-ATP-binding-protein-UgpC MAGVTLQAVTKSYDGKNQIIQPLNVTINDGEFMVMVGPSGCGKSTLLRMVAGLERVTSGDIYIDNRRVTQEEPKDRGIAMVFQNYALYPHMSVEENMAYGLKIRGMGKEQIRQRVLDAARSLELDHLLMRRPRELSGGQRQRVAMGRAIVREPAVFLFDEPLSNLDARLRVQMRLELQQLHRRLQTTSLYVTHDQVEAMTLAQRVMVMNKGVVEQIGTPVDVYERPASQFVASFIGAPAMNLLKGQISSDGSRFNLDASHALPLSDSKPKWANQPVVLGIRPEHIRQSSREQGGVPLRVDTLEMLGADNLAHGRIGDTPLVVRLAHSERPQPGSTLWLHLPSDALHFFDSTHGKRLE >CP028349.1|AVV35871.1|189627_190371_-|glycerophosphodiester-phosphodiesterase MSTQHWPYPRIVAHRGGGKLAPENTLAAIDVGAKYGHQMIEFDAKLSMDAQIFLLHDDTLDRTSNGWGVAGQLPWEKLSQLDAGSWFGNAFTGEKLARLDEVAARCRQHQMMANIEIKPTTGSDAETGRAVAQAAAILWQDQAAPLLSSFSFEALEAAMQVEPQLPRGLLSHSWDPEWQEKTSALACVSIHLNHKVLTAERVTELKAGGLKILVYTVNSPDRARELLKWGVDAICTDSIDIIGPDFH >CP028349.1|AVV35870.1|189343_189631_+|DUF2756-domain-containing-protein MKKWMIVLAALLPFASQANTLNSTNDPNKPGYNPSQQRMQSQMQSQQQQQQLKLRQDQQRQTQDMQRKMQEQRNSAQQRVITTQPGQQNQNPNQN >CP028349.1|AVV35869.1|187383_189135_-|gamma-glutamyltransferase MKMQQTVRQLSWSLVMSFTVVAGANAAPTQVPPVSYGVDSDTFHPVKAQHGMVASVDALATQVGVEILRQGGNAVDAAVAVGFALAVTHPQAGNLGGGGFMLLRTASGRATAIDFREMAPGHASRDMFLDKQGNADSKLSLTSHLASGTPGTVAGLALAAQKYGTLPLSTLLAPAIRLARDGIPVNDALADDLKTYGKEVLITHPNSKAIFYKPDGTPWQKGDRLVQKNLAHSLQLIARQGPDAFYKGEIADEIAGEMAQHGGLISKADLAAYRAVERQPISGTYRGYEVFSMPPPSSGGIHIVQILNILENFDLAKMGFGSADAMQVMAEAEKYAYADRSEYLGDPDFVKVPWQALTSKAYAKTLAQQIDVAKARPSSEIKPGKLEPYESNQTTHFSVVDKQGNAVAVTYTLNTYFGSGIVAGKSGILMNNEMDDFSAKPGTPNVYGLVGGEANAVQPAKRPLSSMSPTIVAKGGKTWLVTGSPGGSRIITTVLQMVVNSIDFGMNVAEATNAPRFHHQWLPDQLRVEKGFSPDTLRLLEAKGQHVKVLPSMGSTQSIMIGPDGMLYGASDPRSIDDLSAGY >CP028349.1|AVV35868.1|186572_187067_+|GNAT-family-N-acetyltransferase MSEIVVRHVMPEDAAALHRIYSQPDTQASTLHLPYSSLQMWQSRLATPQPHSHLLVACIDEEVVGQCALDAVARPRRRHVASLGMGVDERYRQRGVGTALMREMVSLCDNWLQVSRMELTVFVDNGPAIALYQRFGFEIEGTAKGFAIRHGELIDAHYMARVKA >CP028349.1|AVV35867.1|185529_186225_-|pirin-family-protein MIYVRKAEERGHANHGWLDSWHTFSFASYHDANFMGFSALRVINEDVIDGGQGFGTHPHKDMEILTYVLSGTVEHQDSMGNKEQIPAGEFQIMSAGTGVRHSEYNASESEPLHLYQIWIIPERTGIEPRYDQRRFPDVQGRQLVLSPDAREGSLKVYQDMTLSRWVLAAGEQDNVAIDAGRRIWIQVVKGDVTVNGNAVTTSDALAIWDESALTIEASSAAEVLLFDLPPV >CP028349.1|AVV35866.1|184431_185427_-|HTH-type-transcriptional-regulator-GntR MKKKRPVLQDVADRVGITKMTVSRYLRNPEQVSVALRDKIAVALDELGYIPNRAPDMLSNATSRAIGVLLPSLTNQVFADVLRGIEAVTDEANYQTLVAHFGYNPQKEELQLRSLLGWNIDGVILTERTHTPGTLRMLEVAGIPVIEVMDSVTPCLDMAVGFDNVEAARQMTQAILSKGHRHTVYLGARLDERTLQKQRGYEIAMREAGLTPHSVMMEDASSFSAGGDLLREAQRLYPETDSLFCTNDDLAVGAMFECQRQGLQVPTQMAIAGFHGHDISQVVTPQLATVLTPRDRMGREAAAMLLARIAGESSDDPLRDIGFEISEGGSI >CP028349.1|AVV35877.1|195041_195755_-|high-affinity-branched-chain-amino-acid-ABC-transporter-ATP-binding-protein-LivF MANAMLTIENVSAHYGKIQALHNVSLYINQGEIVTLIGANGAGKTTLLGTLCGEPRATQGTITFDGKAITDWQTARIMREAIAIVPEGRRVFSRMTVEENLAMGGFFASRPEYLTRIKRVYELFPRLEERKIQRAGTMSGGEQQMLAIGRALMSQPRLLLLDEPSLGLAPIIIQQIFDTIEQLRSEGMTIFLVEQNANQALKLADRGYVLENGHVVLEDSGAALLSNEAVRSAYLGA >CP028349.1|AVV35878.1|195756_196524_-|ABC-transporter-ATP-binding-protein MIQPLLAVEGLMMRFGGLLAVNNVALELRPQEIVSLIGPNGAGKTTVFNCLTGFYKPTGGTIKLREQHLEGLPGQKIARMGIVRTFQHVRLFREMTVIENLLVAQHQHLKSGVFSGLLKTPAFRRSESEALDRAATWLERVGLLDLANRQAGNLAYGQQRRLEIARCMVTRPEILMLDEPAAGLNPKETHELDALIAELRGEHKVSVLLIEHDMKLVMGISDRIYVVNQGTPLANGTPEEIRNNPDVIRAYLGEA >CP028349.1|AVV35879.1|196520_197792_-|high-affinity-branched-chain-amino-acid-ABC-transporter-permease-LivM MKTSFINALISSLMLLVLATFFMGLRLNLDGTNLVVQNAGSVRWDWIAAGCAVVFLFQLLRPIWQSGLKKISGPALVLPGLDGSTPKQKLVMAVLIIAAVAWPFLVSRGTVDIATMTLIYVMLGLGLNVVVGLSGLLVLGYGGFYAIGAYTFALLNHYYGLGFWQCLPLSGMVAALFGLLLGFPVLRLRGDYLAIVTLGFGEIVRILLLNNTALTGGPNGISQIPKPTLFGLEFGRTPREGGWDTFHNFFGLKYDPSDRIIFLYLVALLLVVLTLFVINRLLRMPLGRAWEALREDEIACRSLGLSPTRIKLTAFTISAAFAGFAGSLFAARQGFVSPESFTFVESAFVLAIVVLGGMGSQFAVILAAILLVVSRELMRDLNEYSMLVLGGLMVLMMIWRPQGLLPMKRPHLKLRAAKKGEQA >CP028349.1|AVV35880.1|197788_198715_-|high-affinity-branched-chain-amino-acid-ABC-transporter-permease-LivH MSEQFLYFIQQMFNGVTLGSTYALIAIGYTMVYGIIGMINFAHGEVYMIGSYVSFIVIAALMMMGIDTTWLMIAAGFVMAVIISSAYGWSIERVAYRPVRSSKRLIALISAIGMSIFLQNYVSLTQGSRDLALPSLITGQWTVGESNGFAATISTMQIVIWVVTFLAMLALTTFIRYSRMGRACRACAEDLKMASLLGINTDRVISLTFVIGAAMAAVAGVLLGQFYGSINPFIGFMAGMKAFTAAVLGGIGSIPGAMIGGLVLGIAEALTSAYLSTEYKDVVSFALLIVVLLVMPTGILGRPEVEKV >CP028349.1|AVV35881.1|198849_199962_-|branched-chain-amino-acid-ABC-transporter-substrate-binding-protein MKMKGRALLAGCVALAMSHAALAEDIKVAIVGAMSGPVAQYGDMQFAGATQAIEDINAKGGVNGNKLVAVKYDDACDPKQAVAVANKVINDGIRYVIGHLCSSSTQPASDIYEDEGVLMITPAATAPDLTSRGYKLIMRTTGLDSDQGPTAAKYVMTELKPQRIAVVHDKQQYGEGLARSVQESLKKQGANIVMFEGITAGDKDFSTLVARFKKENVDFVYFGGYYPEMGQIVRQARAAGLKTQFMGPEGVGNASLSNIAGAASEGMLVTLPKRYDQVETNKPIVDALKAKKLDPTGPFVWTTYAALQSLATGMERSKSAEPDAIVKNLKEGAAVPTVMGNLNWDEKGDLKGFEFGVFKWHADGTSTAVK >CP028349.1|AVV35882.1|200357_200753_+|aspartate-1-decarboxylase-autocleavage-activator-PanM MKLTIQRLTALTPQDRIDLGKVWPDLEMEKLEQGLSESHRLYAARFNDRLLAGLQLEISGIHGKVHRLAVRDVTRRRGVGQYLLEETIRQNGSIADWWIADDGSDDQQVRAAFMQACGFRAQSDGWIRAAD >CP028349.1|AVV35883.1|200813_201671_-|RNA-polymerase-sigma-factor-RpoH MTKEMQTLAIAPLGNLESYVRAANTWPMLSAEEEKALAERLHYQGDLEAAKTLILSHLRFVVHIARNYSGYGLPQADLVQEGNIGLMKAVRRFNPEVGVRLVSFAVHWIKAEIHEYVLRNWRIVKVATTKAQRKLFFNLRKTKQRLGWFNQDEVEMVARELGVSSKDVREMESRMAAQDMTFDMSADDESSEGRSMAPVLYLQDKTSDFADGIEEDNWDAHAADKLSYAMEGLDERSQHIIRARWLDEDNKTTLQELADQYGVSAERVRQLEKNAMKKLRMAIEA >CP028349.1|AVV35884.1|201910_202900_-|cell-division-protein-FtsX MVNKRIKRPAAPKAKQPSKSKSKALKGGWQEQWRYALRGTLSDMWRQPLATLLTVMVIAISLTLPSVCYMVWKNVSQAATQWYPAPQLTVYLSKTLDDTAAENVVAQLKQVEGVDNVNYLTREEALNEFRNWSGFGGAMDMLEQNPLPAVAIITPKLNFQNSDTMQSLRDRVSKVQGVDEVRMDDSWFARLAALTGLVGQIASMIGVLMIVAVFLVIGNSVRLSIFARRDTINVQKLIGATDGFILRPFLYGGALLGFSGAVLSLLLSEVLVLRLQSVVASVATVFGTTFSLEGFSWDEALLLLLIAAIIGWVAAWLATVQHLRRFTPQ >CP028349.1|AVV35885.1|202889_203558_-|cell-division-ATP-binding-protein-FtsE MIRFEEVSKAYLGGRQALQGVDFHLRPGEMAFLTGHSGAGKSTLLKLICGIERPSAGKIWFSGHDISRLRNSEVPFLRRQIGMIFQDHHLLMDRSVYDNVAIPLIISGASGEDIRRRVSAALDKVGLLDKAKSYPIQLSGGEQQRVGIARAVVNKPAVLLADEPTGNLDDALSEDILRLFEEFNRVGVTVLMATHDSGLIARRNYRMMTLNQGRLHGGHDGQ >CP028349.1|AVV39181.1|203563_205258_-|signal-recognition-particle-docking-protein-FtsY MAKEKKRGFFSWLGFGKEEEKQPAAEEQTPEIAQPEQQQTEQEAPALSQAEAQAEETVAITETVAGQQREAEVTAAAPHESAPAVLDEVEPIEADPEALLEVAAPEHDATPVIDADDVVTPIADAEPADAPLSEEELTALALGDAEIVDIPHSETEAPLSEEELTALALGDAEIVEPPHSEAPLNNEVLTPQVLDDAAVADTQDSEADAPLTEEELTALALGDADVTDQEPTESLSDTVSDLPLAAAPLIVQEQERPSKEGFFSRLKRSLVKTRQNLGSGFISLFRGKKIDDDLFDELEEQLLIADVGVDTTRRIITNLTQQANRKQLRDAEALYGLLKAEMASILAKVDAPLDVSGKTPFVILMVGVNGVGKTTTIGKLARQYQAEGKSVMLAAGDTFRAAAVEQLQVWGQRNNIPVIAQHTGADSASVIFDAIQAAKSRGVDVLIADTAGRLQNKSHLMEELKKITRVMKKLDDSAPHEVMLTLDASTGQNAVSQARLFNEAVGLTGIALTKLDGTAKGGVIFSVADQFGIPIRYIGVGEGIEDLRPFKAEDFIEALFARED |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP028349_3 | 2935887-2935999 | Orphan |
NA
Consensus repeat of CP028349_3
|
1 spacers
spacers of CP028349_3
>3.1|2935930|27|CP028349|CRISPRCasFinder GCAGATGCATGCGCTAAAGTTGGGGGT |
CRISPR arrays and Neighbor proteins around CP028349_3
The CRISPR arrays of CP028349_3 >merge|CP028349|3|2935887-2935999|CRISPRCasFinder GTTCGCTGCGCGAACGGGCTCTGAGAGGGGTTTCGTTCGCCTGGCAGATGCATGCGCTAAAGTTGGGGGTGTTCGCTGCGCGAACGGGCGCTGAGAGGGGTTTCGTTCGCCTG >CP028349|3|2|2935887-2935999|CRISPRCasFinder GTTCGCTGCGCGAACGGGCTCTGAGAGGGGTTTCGTTCGCCTG GCAGATGCATGCGCTAAAGTTGGGGGT GTTCGCTGCGCGAACGGGCGCTGAGAGGGGTTTCGTTCGCCTG
>CP028349.1|AVV38230.1|2931876_2935824_+|trifunctional-transcriptional-regulator/proline-dehydrogenase/L-glutamate-gamma-semialdehyde-dehydrogenase MATTTMGVKLDDATRERIKLAAQRIDRTPHWLIKQAIFNYLGQLDSGDAVPEIPLSAQVAAESEDTQPEETHQPFLDFAEQILPQSVTRSAVTAAWRRPETDAVPMMLEQARLPAALAEKTHQLAYQLADKLRHQKGATGRAGMVQSLLQEFSLSSHEGVALMCLAEALLRIPDKPTRDALIRDKISNGNWQSHLGRSPSLFVNAATWGLLFTGRLVSTHNEANLSSSLNRIIGKGGEPLIRKGVDMAMRLMGEQFVTGETIAEALANARKLEEKGFRYSYDMLGEAALTAGDAKAYLLSYQQAIHAIGKASNGRGIYEGPGISIKLSALHPRYSRAQYERVMEELYPILKSLTLLARSYDIGINIDAEEADRLELSLDMLEKLCFEPELEGWNGIGFVIQAYMKRCPFVIDELIDLAQRSRRRLMIRLVKGAYWDSEIKRAQMEGLEGYPVYTRKVYTDISYLACARKLLSVPSLIYPQFATHNAHTLAAIYQLAGNNYYPGQYEFQCLHGMGEPLYEQVVGKVADGKLNRPCRIYAPVGTHETLLAYLVRRLLENGANTSFVNRIADTSLPIDELVADPVTAVEKLGISEGAIGLPHPKIPLPRELYGENRVNSAGLDMANEHRLASLSSALLSSAAQPCLAEPMIDGEAGAGELRDILNPAAPNDRVGQVREATEQEVSVALDAAVNSGPIWFATPPQERAAILERAAQIMEGQMQQLLGILVREAGKTYNNAIAEVREAVDFLYYYAGMVRDDFDNETHRPLGPVVCISPWNFPLAIFTGQIAAALAAGNSVLAKPAEQTPLIAAQAVQILLDAGVPPGVLQLLPGQGETVGAQLTGDNRVRGVMFTGSTAVATLLQRNLAGRLDQHGRPVPLIAETGGMNAMIVDSSALTEQVVIDIVASAFDSAGQRCSALRLLCIQEDVADHTLKMLRGAMAECRMGNPERFSTDIGPVIDAEAKANIDRHIQAMRNKGFTVYQAVQDNPQDSKEWTSGTFVKPTLIELNQVSDLDKEIFGPVLHVVRFTRNNLPKLVEQINASGYGLTLGVHTRIDETIAQVTANARVGNLYVNRNMVGAVVGVQPFGGEGLSGTGPKAGGPLYLYRLLASRPDGALRLTFDRQDAERPADSTLRQSLLAPHQALSNWAKDKPELAQLCQHYGELAQAGVVRLLPGPTGERNTFSLLPRDLVLCLADNEQDALIQLAAVTSVGSKALWQDDELHRALRASLPDAVKARITLARDPLAAEFDAVIYHGDADQLRTLCEQIAARDGAIVSVQGFARGETNLLLERLLIERSLSVNTAAAGGNASLMTIG >CP028349.1|AVV38229.1|2929940_2931425_-|sodium/proline-symporter-PutP MSVSTPMLVTFVVYILGMVLIGFAAYRSTKNFDDYILGGRSLGSVVTALSAGASDMSGWLLMGLPGAIFISGISESWIAIGLTIGAWLNWKIVAGRLRVQTEHHNNALTLPDFFTSRFEDHSKILRVISAIVILVFFTIYCASGVVAGARLFESTFGMDYQTALWAGAAATILYTLVGGFLAVSWTDTVQASLMIFALILTPVFVIIAVGGWSDSLTVIEAKSLENLDMLKGLNFVAVISLLGWGLGYFGQPHILARFMAADSHRSIRTARRIGMAWMVLCLAGAVAVGFFGIAYFQNNPTLATGVNDNAERVFIELARILFNPWIAGILLSAILAAVMSTLSCQLLVCSSALTEDLYKGFLRKNASQKELVWVGRLMVLLVALIAIALASNPENRVLGLVSYAWAGFGAAFGPVVLFGVCWKRMNRNGALAGMIIGALTVLVWKQYGWLGLYEIIPGFIFASIAIVVFSLMGREPSAEAQQRFAAAEAEFQTK >CP028349.1|AVV38228.1|2928923_2929748_-|FTR1-family-protein MFVPFLIMLREGLEAALIVSLIASYLKRTQRTQWFPAMWAGVFIAAALCLGLGLFINATTGEFPQKEQELFEGIVAVIAVVILTSMVFWMRKVARNIRVELEQAVDQALQRSGRGGLALVLMVFLAVAREGLESVFFLLAAFTQDVGYAPPIGAVLGLATAVVLGMLLYWGGIRLNMAHFFRWTSVFILFVAAGLAAGAIRAFHEAGLWNRFQDVAFDLSNTLSTHSLFGTLLEGILGYQETPSVSEVTIYFVYLIPALILFFMPARPAASATV >CP028349.1|AVV38227.1|2927755_2928868_-|iron-uptake-system-protein-EfeO MTMRFRRKALLLPLLAISASATAAVPQVTVSVNDRQCEPMSLTVKAGKTQFLIKNNSQKALEWEILKGVLVVEERENIAPGFSQKLTANLEAGEYEMTCGLLSNPKGKLIVKGEGAATNSTSARMQLEGPITEYKAYVTEQVNQLVSSTQAFTDAVKAGDVEKAKALFAPTRQYYERIEPIAELFSDLDGSIDAREDDYEKKSADPKFTGFHRLEKALFADNSTKDMADYADKLNKDVKDLQVRISELAFPPAKVVGGAAGLIEEVASSKISGEEDRYSRTDLWDFQANIDGAQKIVELLRPLVSKANPQLLAKVDANFKKVDAILSKYRTQTGFESYEKLTSADRNALKGPITALAEDLSLLRGTLGLD >CP028349.1|AVV38226.1|2926475_2927753_-|deferrochelatase/peroxidase-EfeB MSRKEDDAQPARRRLLKGLGLLGGAAAIGGGCPFHTAAADSFSPGTVTPQARQQTQPFYGQHQAGITTPQQASMMLVAFDLLSNDKAELKRLFQLLTQRIAFLTAGGPAPAVTNPQLPPMDSGILGATIAPDNLTITVSVGHSLFDERFDLAPHKPKKLQPMTRFPNDSLDASQCHGDLLLQLCANTQDTVIHALRDIIKHTPDLLGVRWRREGFISDHAARSQGQETPINLLGFKDGTANPDTRNSALMNQLLWVTADQEEPVWARNGSYQAVRLIRFHVEMWDRTPLGEQQTIFGREKLSGAPLGMKHERDVPDYARDPDGDVIALDAHIRLANPRTPETASSLMLRRGYSYSAGISASGQLEMGLLFVCYQHDLERGFLTVQQRLNGEALEEYIRPFGGGYFFALPGVPDASHYLAQPLLES >CP028349.1|AVV38225.1|2925116_2925920_-|phosphate-starvation-inducible-protein-PhoH MKGATMGRQKAVIKARREARRVLRNDSRSHRQREEESVTSLVHMSGLDAIGMARDTRDRLPIEARNEAQAHYLNAIGTKQLIFATGEAGCGKTWISAAKAAEALINKDIDRIIVTRPVLQADEDLGFLPGDISEKFAPYFRPVYDILVKRLGASFMQYCLRPEIAKVEIAPFAYMRGRTFENAVVILDEAQNVTAAQMKMFLTRLGENVTVIVNGDITQCDLPAGVPSGLADALARFEEDEMVGIVRFGKEDCVRSALCQRTLHAYS >CP028349.1|AVV38224.1|2924022_2924556_+|peptide-methionine-(S)-S-oxide-reductase MAIEYAVIAGGCFWCTEAVFKDVIGVESVESGYTGGARPNPTYEQVCSGATGHAEAIRIGFDPEQVTYGDLLDISFVTHDPTQLNRQGNDIGTQYRSAIFPANAEQEAEARAAIERAQTDHDVPVVTTIEPLKEWYPAEAYHQDYWEGAGQRNGYCMAVIPPKLQKLRKSFANRVKS >CP028349.1|AVV39307.1|2923408_2923963_+|flavin-reductase-family-protein MHKYAFPVSKARKYLEPGPVLLLSSQYQDQHDIMTLGWHTVLEFSPSLVGCMIAGMNHSHELIRNSGQCVLNIPSASLINEVVAVGNSHGDSIDKFEAFGLTPEPAQVVGAPMIAECFASFECQLYDDAMVARYNLFIFEIVKAHVAEQPEYPPTLHYTGEGRFSVMSDKMLDKSGDFKPEMLI >CP028349.1|AVV38223.1|2922027_2923086_+|two-component-system-sensor-histidine-kinase-BasS MIKRFDHRSMRFRLILTIGLILLVFQVISVVWLWHESKEQIQFLVEAQLQKRNMDSHVKREVHEAVASLAVPSLVMITLTLLLCYQAVKWITRPLYQLQRELESRSEENLDPVACHSQVHEIDAVTQAINQLVARLNSSLERERLFTADVAHELRTPLAGLRLHLELIERNSDVKVQPLVQRLDQMTHSVSQLLNLARAGQSFTSGTYQNVGLIEDVILPMEAELSIMLEAHQQTLVLDLPQEQFVRGDATLLKMLLRNLVENAHRYSPDNSTISVQLLSVPQPMMVVEDEGPGIDESKSGELSKAFIRMDSRYGGIGLGLSIVSRIVQLHRFQFFLENRRDRSGCRAIIKF >CP028349.1|AVV38222.1|2921350_2922031_+|two-component-system-response-regulator-BasR MKILIVEDDALLLQGLMLALEGEGYVCDGVTRVRDAEAHFASGLYSLVVLDLGLPDEDGLHFLIRLRRQKKMTPVLILTARDTISERIAGLDAGADDYLIKPFSLDELLARIRALIRRHVNQGDSHVRVGALALDMTHRQIMLNDVLLDLTPKEFAILSRLMLKAGNPVHREILYQDIYNWETEPSTNTLEVHIHNLRDKIGKSAIRTVRGFGYALVTQGGVREAI >CP028349.1|AVV38231.1|2936922_2937999_-|acyltransferase MSQTKSDQILWVDTLKGTCILLVVLYHTVLPGFEGTMKYLTAGWIPAEIWIQFNTVLSPLRMPAFFFVSGLLATNGILNRPWKQVFTSRITNLFYLYILWGFIQWWSIIGISTEITGQRISQNLNAAYAGSLFEFLKLTFMAMSTSWYLYGLGLYFLCAKIFRQYKLALVAVAILLNYLAVEKVIPFWGPQSLAQYFLFFLLGAFWSQTMLRLSEWRRENLMPWALLAAVAGIHVIFGLDKSLFLCVLAVLFSIAACRWLNQHFSMRYLNWVGRNTLQIYVIHRIFIEFFGMSVILFAQRHHLFEQAWFSFLWACFYPVAIVGLCSLCSVAIWSLTNRGVGQSLFVFPTLMKRVPGGG >CP028349.1|AVV38232.1|2938462_2939215_-|L-cystine-ABC-transporter-ATP-binding-protein-YecC MSAIEVRKLVKSFNGQKVLHDIDLNVAAGEVVAIIGPSGSGKTTLLRSINLLEVPDSGTIRVGEITVDAALAQSKQKEQVRRLRQQVGFVFQNFNLFPHRSVMENIIEGPVIVKGEAKADAVARARTLLEKVGLHGKEESYPRRLSGGQQQRVAIARALAMRPEVILFDEPTSALDPELVGEVLSTIRALAEEKRTMVIVTHEMSFARDVADRAIFMDQGRIVEQGEAKALFSHPQQPRTRQFLDKFLSQ >CP028349.1|AVV38233.1|2939211_2939880_-|cystine-ABC-transporter-permease MQESLQLVLDSAPFLLKGALFTLQLSIGGMFFGLLLGFILALMRLSRFWPVRWLARIYVSIFRGTPLIAQLFMIYYGLPQFGIELDPIPSAMIGLSLNTAAYASESLRGAIASIERGQWEAAASIGMTRWQTLRRVILPQAARTALPPLGNSFISLVKDTSLAATIQVPELFRQAQLITSRTLEVFTMYLAASLIYWVMATVLSALQNRLEQHVNRQDSESK >CP028349.1|AVV38234.1|2939879_2940680_-|cystine-ABC-transporter-substrate-binding-protein MSFSRAGRQMVMGVMAVALIAGVNVKTFAAENLLNKIKERGTLLVGLEGTYPPFSFQDEKGKLTGFEVEFAEQLAQHMGVKASLKPTKWDGMLASLDAKRIDVVINQVTISDERKKKYDFSTPYTVSGIQALTMKANAGTITKPADLAGKKVGVGLGTNYEQWLRENVKGVDIRTYDDDPTKYQDLRSGRLNAILVDRLAALDLVKKTGDTMAVAGDAFSRQESGVAVRKGNDDLLKAIDQAIADMQKDGSLSKLSQKWFGADVTK >CP028349.1|AVV38235.1|2940793_2941780_-|D-cysteine-desulfhydrase MSLHLLHQFPRLELLGAPTPLEHLPRLSDYLGRDIFIKRDDFTPVAMGGNKLRKLEFLAADALREGADVLLTAGAIQSNHVRQTAAVAARLGLKCVALLENPIGTHAENYLSNGNRLLLDLMNAEVIMVDALHNPTEQLAEEATRLEAQGFRPYIVPVGGSNALGALGYVECAQEIAHQSEGVVDFAAVVVASGSAGTHAGLAVGLEHLLPETELVGVTVSRQVEAQLPLVERLRQSLAETLEVQATAPLTLWDDYFAPRYGEPNDEGMAAVKLLAQLEGILLDPVYTGKAMAGLLDGISQNRFRREGPLLFIHTGGAPALFAYHPSV >CP028349.1|AVV38236.1|2941983_2942487_-|flagella-biosynthesis-regulatory-protein-FliZ MGAKTKARPLSRYLKDYKHSQSNCSHCGKVLDRMALVFRGQIINKEAIARMDQMIDEQLWLKLQPELTALCRFCSDIFCNTHPNYFDIMAFKQYLFEQTEMSPSTIREYVVRLRRLDEMLKAKNFPAEKLKGNSWHQCLESDLPDAGNNNYRIALRKYDQFLGWQQA >CP028349.1|AVV38237.1|2942537_2943260_-|RNA-polymerase-sigma-factor-FliA MNDFYTAEGVMDKHSLWQRYVPLVRHEALRLQVRLPASVELDDLLQAGGIGLLNAVERYDALQGTAFTTYAVQRIRGAMLDELRSRDWAPRSVRRNAREVAGAMHRVEQSLGRSASEQEVAQQLNVSMEEYRQILLDTNNSQLFSYDEYREEHGDSAELVTEGHEEANPLHQLLEGSLRERVIEAIEALPDREKMVLTLYYQEELNLKEIGAVLDVGESRVSQLHSQAIKRLRARLAGAR >CP028349.1|AVV38238.1|2943402_2944371_-|HNH-endonuclease MRFNYDLLPGEQLTFDEIARRYALQYPDDKELTARALLSPSTGLRTLKIRAVFAREESNSPLEDLLFISHDNERKNYLRRFEHYLKQNLSFLYFRRSDNEKRDNLWKVMGSSQVFAMIDPQSATAQNLLNSRGYKLVLMHQDDEDIYWQLFNPQADPCFPQSQRIQFIVVLSTPQHKVPAAVMPGTEVGRRVKARVLQRASQAEFSAGVKARYGACVMTGTELTERHNWPWVEACHIDTQEGDDGVLADNSIDNGLFLRSDLQRLFINRLISIDAESGTIQVHPGEEARQHIAPWYQELEGRVCSLWAAVPPATRQRLRARR >CP028349.1|AVV38239.1|2944642_2944909_-|hypothetical-protein MNIFIINLASSTGRRATTAHAYVINLEAATRLLKLLYPVWMVADKGGLFEEYGAIKPLAVHPVPVLLHNLAQQTSIQHFNNITQPHQR >CP028349.1|AVV38240.1|2945315_2946602_-|flagellin-FliC MAQVINTNSLSLITQNNINKNQSALSTSMERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQAARNANDGISAAQTTEGALSEINNNLQRVRELTVQSQNGTNSDSDLTSIQDEIKSRLDEIDRVSGQTQFNGVNVLAQNGTMKIQVGANDGETISIDLKKIDSSTLGLSGFGVDKNTLKTSDAITQVGASGSLKDVDLSSVATALKVDASTLSLKNVQTSAGAATSTYVVSSGSDNYAVSVDDASGKVTLNTTDISYSDSDNGVTAGSMTGKYIKVGADSTGAAVGYVTVQGKDYKTAAGALTNTNDTTGSQNVASAIGDIASSTNTNVFTGSATADPLALLDKAIASVDTFRSSLGAVQNRLNSAVTNLNNTTTNLSSAQSRIQDADYATEVSNMSKAQIVQQAGNSVLSKANQVPQQVLSLLQG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP028349_4 | 3673332-3673449 | Orphan |
NA
Consensus repeat of CP028349_4
|
1 spacers
spacers of CP028349_4
>4.1|3673372|38|CP028349|CRISPRCasFinder ACTAAAAACGCCGGGAGCGTTTTTGAACAACGCAACGT |
CRISPR arrays and Neighbor proteins around CP028349_4
The CRISPR arrays of CP028349_4 >merge|CP028349|4|3673332-3673449|CRISPRCasFinder GTTGGCCCGGTAACGGGCGCACCTCACGGATGAGGTGCGTACTAAAAACGCCGGGAGCGTTTTTGAACAACGCAACGTGTTGGCCCGGTAACGGGCGCACCTCACGGATGAGGTGCGT >CP028349|4|3|3673332-3673449|CRISPRCasFinder GTTGGCCCGGTAACGGGCGCACCTCACGGATGAGGTGCGT ACTAAAAACGCCGGGAGCGTTTTTGAACAACGCAACGT GTTGGCCCGGTAACGGGCGCACCTCACGGATGAGGTGCGT
>CP028349.1|AVV39342.1|3672229_3673189_+|class-1b-ribonucleoside-diphosphate-reductase-subunit-beta MKLKQIQAINWNRIQDDKDLEVWNRLTSNFWLPEKVPLSNDLPGWQTLDKQQQQLTIRVFTGLTLLDTIQNVIGAPGLMEDAITPHEEAVLSNISFMEAVHARSYSSIFSTLCNTQDVDAAYQWSEENAPLQNKARIILEHYRADDPLKKKIASVFLESFLFYSGFWLPMYWSSRGKLTNTADLIRLIIRDEAVHGYYIGYKYQQALASADDLRREELQQFAYDLLLALYENEVAYTEALYGDVGWVEEVKTFLHYNANKALMNLGYEALFPAELTDVNPAILSALSPNADENHDFFSGSGSSYVMGKAVETEDEDWNF >CP028349.1|AVV38832.1|3670082_3672221_+|ribonucleotide-diphosphate-reductase-subunit-alpha MATTDTLNGVTLDYHALNAMLNLYDADGRIQFDKDREATRQFYLQHVQPNTVSFASNAERLRYLVAESYYEADVLNQYDFDFLLALHEVAENWGFEFKTFLGAWKYYTSYTLKTFDGKRYLETFEQRACMVALTLAQGDETLARALLEEILSGRFQPATPTFLNCGKQQRGELVSCFLLRIEDNMESIGRAVNSALQLSKRGGGVAFLLSNLRESGAPIKRIENQSSGVIPVMKMLEDAFSYANQLGARQGAGAVWLNAHHPDIFRFLDTKRENADEKIRIKTLSLGVVIPDITFQLAKENKPMALFSPYDVERIYGQAFGDISISEKYDEMLADDRIRKTYIQPRDFFQTLAEIQFESGYPYIMYEDSVNRSNPIAGRINMSNLCSEILQVNSASEYNDDLSYRRVGKDISCNLGSLNIAHAMDSRDLARTVDVAVRALTSVSMMSEIGSVPSIAEGNRRSHAIGLGQMNLHGYLAREGIAYGSPEALDFTNLYFYCITYYALRTSNQLAQEHQQTFDGFADSDYASGVYFDKYLQRAWLPRTERVAQLFADAGLTLPTQADWQALRDAVMQHGLFNQNLQAIPPTGSISYINHATSSIHPIVSPIEIRKEGKTGRVYYPAPFMTNENLHLYQDAYQIGPEAIIDTYAEATQHVDQGLSLTLFFRDDVTTRDINKAQIYAWKKGIKTLYYIRLRQMALQGTEVQGCVSCSL >CP028349.1|AVV38831.1|3669699_3670113_+|class-Ib-ribonucleoside-diphosphate-reductase-assembly-flavoprotein-NrdI MFPLVYFSSQSENTHRFVTRLSLPARRIPLDAREALHIDQPYILVVPSYGGGSSRGAVPRQVIQFLNDEANRRGIRGVIAAGNRNFGAGYCLAGSIIAKKCQVPCLYRFELMGTPDDIAKVKAGVTQFWQQQIPSTA >CP028349.1|AVV38830.1|3669452_3669695_+|glutaredoxin-like-protein-NrdH MRIIIYTKDNCVQCTATKNAMDRQGLAYQLINLDSQPDAIDNLKTLGYRQVPVVMADNDHWSGFRPDKIATLRQLAAVGG >CP028349.1|AVV38829.1|3668837_3669155_+|DUF883-domain-containing-protein MFNRTVKNDDIDINQDVNELADSLEALLKSYGSDAKDEVDSARSQAEKLLKQTRSKLNGGGNRVSQVARDAGAQVDTYVHDKPWHGVGVGAAFGIVVGILLASRR >CP028349.1|AVV38828.1|3668347_3668683_-|hypothetical-protein MYLRPDEVARVLEKAGFEVDEITPRAYGYRRGENYVYVNREARMGRMALVIHPTLKEKSQPFAEPASEMKTCDHYTQFPLYLAGDSQEHYGIPHGFSSRMALERYIQSVFG >CP028349.1|AVV38827.1|3666801_3668211_-|amino-acid-permease MSEQPTPSQEKFKRTMKVRHLVMLSLGGVIGTGLFFNTGYIISTTGAAGTLLAYLIGALVVWLVMVCLGELSVAMPETGAFHVYAARYLSPATGYTVAWLYWLTWTVALGSSLTAAGFCMQYWFPQIPVWIWCLVFCCAIYLLNVISTRFFAEGEFWFSLIKVVTIVAFIVLGGAAMFGFIPMKDGTPAPGLSNLTAHGWLPNGVLPILMTMVAVNFAFSGTELIGIAAGETQEPEKVLPLAIRTTVARLIIFFIGTVFVLAALIPMDQAGIVKSPFVLVFEKIGIPYAADIINFVILTAILSAANSGLYASGRMLWSLANENTLPRAFASVNKRGVPLLAITVSMIGGLLALVSSVVAPDTVFVALSAISGFAVVAVWLSICASHFVFRRRHLQQGKSLAELGYRAPLYPLTPVLGFLLCLLACLGLAFDPTQRIALWCGIPFVLFCYAAYHLTHKSSPKPEEAKDVA >CP028349.1|AVV38826.1|3665876_3666812_-|homocysteine-S-methyltransferase MSHNPVAQALTESPLLILDGALATELEARGCHLADALWSAKVLMENPELIYQVHYDYFVAGARCAITASYQATPQGFATRGLDEAQSLALIAQSVALARRARHDYLAVRPDAKTLLVAGSVGPYGAFLADGSEYRGDYALPEAEMMAFHRPRVQALLAAGADLLACETLPSFAEAQALVKLLAEFPESRAWFTFTLRDAGHISDGTPLSEVVSWLNQQPQVVAIGINCVALESVTPALHQLQRLTDKPLVVYPNSGEQYDADSKTWHSAPSGCTLHDKLDEWQQAGAKLIGGCCRTSPNDIAAIARACQPQ >CP028349.1|AVV38825.1|3665371_3665791_-|hypothetical-protein MKQRILALAAALLLSGCATIVGTETQMVRIDSLPQGARFTVQDERGYAVAQGFTPQTVELAKSTGRYFGKKHYLLMLESPGYVPVTVPIEARANLWYLLGNIPLGGFPGWLLVDPFYGGMYDLKPEHPRPFMNPVGARG >CP028349.1|AVV38824.1|3664825_3665371_+|spermidine-N1-acetyltransferase MTVKLRPLEREDLHFVHQLDNNASVMRYWFEEPYEAFVELSDLYNKHIHDQTERRFVVEHSGQKAGLVELVEINHVHRRAEFQIIIDPAHQGKGLATQAAKLAMDYGFSVLNLYKLYLIVDQENEKAIHIYTKLGFEIEGVLKHEFFINGEYRNTIRMCIFQHQYLERHKTPGGMVKPTAQ >CP028349.1|AVV39343.1|3673664_3674852_+|MFS-transporter MNATRQGLTPALVMLMSVATGLCVASNYYVQPLLNTIAQQFDLSVSLAGFIVTTAQLGYACGLLLLVPLGDRFERRSLIVTMILLAATGMVIIALSHTFLFLLLGTVMTGLFSVAAQILVPLAATLAEPERRGKIVGTVMSGLLLGILLARTVAGGLAQLGGWRTVYWTASLLMVIMALALWRSLPRVKQSVPMSYPQLLASIFRLYAGNRVIRTRAWTGCLSFANFSLLWTSMAFLLSGAPYHFSEGKIGLLGLVGAAGALAARQAGSLADKGKAKLTTRLGLLLMLLSWAAIAWGAEQLVPLIVGILLLDLAVQAVHITNQSVIYAQMPEARNRLNAGYMTSYFIGGAAGSLLSATAYHLAGWYGVCIAGALLTLVNLLIWAGGSRFEPEKIN >CP028349.1|AVV38833.1|3675155_3675686_+|transcriptional-repressor-MprA MESSFTPIEQMLNIRANRHKDFPLQEIILTRLCMHMQGKLLDNRNKMLKAQGINETLFMALITLDAQENHSIQPSELSSALGSSRTNATRIADELEKRGWIERRESDHDRRCLHLHLTDKGMEFLRQLLPPQHQSLQYLWSSLSVDEKSQLEGITRKLLNRLDKMDEDQLIASLSR >CP028349.1|AVV39344.1|3675836_3677009_+|multidrug-export-protein-EmrA MSATAEAQSPQPSASKKKKRKSVLIVLALIFVLIGIAWGVYWFLVLRHFQETDDAYVAGNQVQVMAQVSGSVNKVWFEDTDFVKKGDVLVSLDKTDAEQAFEKAQTALATSVRQTHQLMINGKQYQASITLQQTALAQAEADLKRREPLGAANLIGREELQHARDAVATAKAQLDVAIQQYNANQAMILNTSLENQPAVQQSAAELRDAWLALQRTEIRSPMDGFVSRRSVQVGSQISTSTPLLAVVPATNLWVDANFKETQLAGVRIGQPATVVADIYGDEVVYQGKVVGLDMGTGSAFSLLPAQNATGNWIKVVQRLPVRIELNQDDIARHPLRIGLSTLVKIDTTSKEGSALATSARQQAAYSSNALAIDLAPVNQMITDIVRANAG >CP028349.1|AVV38834.1|3677025_3678561_+|multidrug-efflux-MFS-transporter-subunit-EmrB MAQKPLEGMPLVLMTIALSLATFMQVLDSTIANVAIPTIAGNLGASNSQGTWVITSFGVANAISIPLTGWLAKRIGEVKLFTWSTILFAIASWACGMSDSLEMLIFFRVIQGIVAGPLIPLSQSLLLNNYPPAKRSVALSLWAMTVIVAPICGPILGGWISDNYHWGWIFFINVPIGVAVVILTLQTLRGRETKTEIRPIDMVGLVLLVVGIGCLQVMLDRGKELDWFNSSEIIVLGVVAVIALSVLLVWELTDDHPIIDLSLFKSRNFTIGCLSISLAYMLYFGSIVLLPQLLQEVYGYTATWAGLASAPVGILPVILSPIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAYTFEPGMDFGASAWPQFIQGFAVACFFMPLTTITLSGLPPERMAAASSLSNFMRTFAGSVGTSITTTMWTNRESMHHSYLTESITPYNVNSQQMYSQLESMGMTQQQASAYIAQQITNQGLIISANEIFWASAGVFLILLVLIWFARPPFGAGGGGGGAH >CP028349.1|AVV38835.1|3678726_3679911_-|tRNA/rRNA-methyltransferase MNDEYKGKSGKVKVMYVRGEDDKSARAPNPRTGKGGHSAARQDDNRRSSGPRTERTQEGGRSAGRRGDDTGRSSSPRRDEGGRGAPRRSDDGAGRRNDRGDDSRGAYGSSDSPWRTVSRAPSADRASSPESDKPDHGGISGKSFVDPEQIRRQRNEETRVYGENACQALFQSRPEAIVRAWFVQEVTPRFREALRWLAANRKAYHVVEDAEITKASGTEHHGGVCFLIKKRMGMAVSEWLSQANTKDCVLALEDVSNPHNLGAIMRSCAHFGVKGLLVNDASLLESGAAVRTAEGGAEHVQAISGATFSEGLEAFRKAGYTIVTTSSHQGTPLAQAQLPEKMVLVLGQEREGLSDTAFKQGDMSLSIGGTGNVESLNVSVATGVLLAEWWRQNA >CP028349.1|AVV38836.1|3680070_3680490_+|thioredoxin-TrxC MNTVCASCQATNRVPDERLADVAKCGRCGNELFDGEVVNATSANFDKYLQDDLPVVIDFWAPWCGPCVNFAPVFKDVASERSGKVRFIKVNTEAEPALSARFNIRSIPTIMLFKNGERVDMLNGAMPKAPFDEWLDESL >CP028349.1|AVV38837.1|3680571_3681249_+|DTW-domain-containing-protein MSDNAVLRLRTQRLARATRPFLARGNRVIRCQGCLLPEANCLCSQIVPQSARSRFCLVMFDTEPMKPSNTGRLIADILPDTQAFGWSRTEPDPLLLAAVRNSDYQPLVVFPASYADPGREVLTTPPTSGKPPLFIMLDGTWTEARKMFRKSPWLDALPVMSLDVSTPSRYTLREAHGEGQHCTAEVAAALLAQAGDRRAAEALSHHFDRFRTAYLAGKPHHAGQA >CP028349.1|AVV38838.1|3681388_3684046_+|protein-acetyltransferase MSQRGLEALLRPKSIAVIGASVTPGRAGYFMMRNLLAGGFSGPVLPVTPKYKAVSGVLAWPTIDSLPFAPDLAVICTHSKRNLELLQQLGEKGCKACIILSAPASQSAELKACASQWQIRLLGPNSLGLLAPWQGLNASFSPVPIEKGRIAFISQSAAVSNTILDWAQQRNLGFSWFIALGDSLDTDVDDLLDFLARDGKTSAILLYLEHLSDARRFVSASRSASRNKPILVIKSGRSREAQALLGTHSGLDAAWDAAIQRAGLLRVQDTHELFSAVESLSHMRPLRGDRLMIISNGAAPAALALDELYARNGKLAQLSDETRRQLEAILPAGAGRGNPLDLKDDATAERYAACVEILLDSHELDALMIIHAPSAVAPATETAAHLIDTVARHPRGKLVTLLTNWSGEFSSQAARRAFTQAGIPTWRTPEGTVTAFMHQVEYRRNQKQLRETPALPATLNQDSAHAHQLLSKALARGITSLDTHEVQPVLQAYGLTTLPTWIASDSAAAVAIADQIGYPVALKLRSPDIAHKSEVQGVMLYLRNASEVEHAAEAIFDRVKQTLPQARIEGLLVQSMASRAGAQELRVVVEQDALFGPIIMLGEGGVEWQADKQAAVALPPLNMTLARYLVIQAIKSGKIRSRSALNPLDIPGLSQLLVQVSDLVVDCPEIQRLDIHPLLAAGNDFTLLDVTLTLAPFHGDNETRLAIRPYPQHLEEQVELKDGQFCLFRPILPEDEPLLRDFISQVTKEDLYYRYFSEINEFTHDDLANMTQIDYDREMAIVAVRQHQGRPEIIGVTRAISDADNIDAEFSVLVRSDLKGLGMGRRLLEKMIRYTRHHGLQQLNGITMPHNRGMITLARKLNFHVDIQLDDGIVSLKLSLQNEIS >CP028349.1|AVV38839.1|3684163_3685519_+|CDP-diacylglycerol--serine-O-phosphatidyltransferase MLSKFKRNKHQQHLAELPKLSQSVADMSTLYSPAEFRQTLLEKIAAAQTRICIVALYLENDDAGRAVMDALYHAKQARPELDISILVDWHRAQRGRIGAARGVTNADWYCEMATRYADIAIPVYGIPVNTREALGVLHLKGFIIDDTLLYSGASINDVYLHQHDKYRYDRYQVIRNSQLTDTMYNWITLNLKPAEAVNRLDFAERPSSPEIKNETRQFRQDLRGFNYQFEGNAGNDELAVTPLVGLGKHSLLNKTIFHLMPCAEQKLIICTPYFNLPALLVRNIINLLRQGKQVEIIVGDKTANDFYIAPDQPFKIIGALPYLYEINLRRFLSRLHYYVTTGQLVVRLWKDGENSYHLKGMWVDDEWQLITGNNLNPRAWRLDLENAVLIHDPHQQLAAQREKELTLIRQHTTVVTHFRELESIADYPAKVRKLIRRLRRIRIDRLISRIL >CP028349.1|AVV38840.1|3685850_3687140_-|MFS-transporter MTTTNNKIQPDDTRKRIWAIVGASSGNLVEWFDFYVYSFFSLYFAHIFFPQGDTTTQLLQTAGVFAAGFLMRPIGGWLFGYIGDKHGRKNSMLVSVCMMCFGSLVIACLPGYATIGVAAPVILLLARMFQGLSVGGEYGTSATYMSEVALEGRKGFYASFQYVTLIGGQLAAVLTVVLLQFLLTDAELRNWGWRIPFFLGALLAVVALWLRRSLEETSDKTSREHRDAGSVIGLLRNHTRPFLMVLGFTAGGSLSFYTFTTYMQKYLVNTSGMDPKTASGLMTGALLVFMLIQPIIGALSDKIGRRTSMMIFGAGAAICTVPVLTLLQNVQSPGVAFLLIMLALLITSFYTAISGILKAEMFPPQVRALGVGLSYAVANAIFGGSAEYVALLMKKQGIETTFFWYVSAMGAVAFLVSLLLHKRGKGIKL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP028349_5 | 3841467-3841587 | Orphan |
NA
Consensus repeat of CP028349_5
|
1 spacers
spacers of CP028349_5
>5.1|3841510|35|CP028349|CRISPRCasFinder CTAAAAACGCCGGGAGCGTTTTTGAACAGCGCAAC |
CRISPR arrays and Neighbor proteins around CP028349_5
The CRISPR arrays of CP028349_5 >merge|CP028349|5|3841467-3841587|CRISPRCasFinder GCGTTGGCACGTTCACGGGCGCACCTCAGGGATGAGGTGCGTACTAAAAACGCCGGGAGCGTTTTTGAACAGCGCAACGCGTTGGCCCGTTCACGGGCGCACCTCAGGGATGAGGTGCGTA >CP028349|5|4|3841467-3841587|CRISPRCasFinder GCGTTGGCACGTTCACGGGCGCACCTCAGGGATGAGGTGCGTA CTAAAAACGCCGGGAGCGTTTTTGAACAGCGCAAC GCGTTGGCCCGTTCACGGGCGCACCTCAGGGATGAGGTGCGTA
>CP028349.1|AVV39351.1|3840717_3841311_+|5-formyltetrahydrofolate-cyclo-ligase MSSPSMLQRQEIRQQVRHLRRAMTDEQQAQAAEQLAELALNYAPLSAARHIALFLSVDGELNTRPLIARLWHLKKAVYLPVLHPFSPGNLLFQRYSPDTPLIPNKLRIPEPLLDIRQLITLDQLDLMLVPLVAFDQHGQRLGMGGGFYDRTLQNWRQHGFLPVGLAHDCQQVDSLPVAEWDVPLPAVMTPSKLWQWE >CP028349.1|AVV38965.1|3840071_3840401_+|cell-division-protein-ZapA MSAQPVDLQIFGRSLRVNCPPEQQDALNSAAEDLNQRLQDLKVRTRVTNTEQLVFIAALNVCHELAQEKVKTRDYAANMEQRIRMLQQTIEQALVEQGRISEREGAKFE >CP028349.1|AVV38964.1|3839322_3839901_-|YecA-family-protein MSLQNATPDYNALAAVLSQQGVGMTPAEMHGLLSGILCGGNQDTSWKTLVHDLANEGMAFSHTLAVPLAALHEHTATTLEDDGFLFQLMLPADDDITVFDRADALAGWVNHFLLGLGVTQPKLDKVTGETGEAIDDLRTIAQLGYEEDEDQEELEQSLEEVIEYVRVAALLCHDTFTRPQPTAPEVQKPTLH >CP028349.1|AVV38963.1|3837953_3839276_-|Xaa-Pro-aminopeptidase MISLERYQQRRQALLAKMAPGSAALIFAAPEVTRSNDTEYPFRQSSDFSYFTGFNEPQALLVLIKSDENHNHSVLFNRVRDPAAEVWSGRRLGQEAAPEKLGVDRALPWNDIGEQLHLLLNGLDVIYHAQGEYAHADTLVFSALDKLRRGFRQNLSAPATVTDWRPWVHEMRLFKDAEEIALLRRAGEISALAHTRAMRICQPGMFEYQLEGEIHHEFTRHGARYPSYNTIVGAGENGCILHYTENESEMRDGDLVLIDAGCEFYGYAGDITRTFPVNGKFSPAQRAIYDIVLASLNRSLEMFRPGVSIREVNDEVVRIMITGLVELGILEGDVDTLIAEESHRQFYMHGLGHWLGLDVHDVGHYGTPSRDRILEPGMVLTVEPGLYIGPDADVPAQYRGIGIRIEDDIVITEEGIENLTDSVVKEADEIEALMAAAKQA >CP028349.1|AVV38962.1|3836778_3837957_-|2-octaprenyl-6-methoxyphenyl-hydroxylase MTILIAGGGMTGATLALAISHLTQGTLPVTLIESSEPGSRAHPGFDGRAIALSAGTCQQLADINLWHRIASCATPITDIHVSDRGHAGFVSLAAADYAIPALGQVVELFDVGQRLFAELKKAPGVTLRCPARVTHAQRSEQQVEVTLDSGEQLSGKLLVAADGTRSALAASCGIQWQRDDYQQLAAIANVTTALPHQGRAFERFTEHGPLALLPMSGNRLSLVWCHPLSQRERIEQWSEAEFLSQLQRAFGWRLGKFTHTGQREYYPLALHRAISPVTHRVAVVGNAAQTLHPIAGQGFNLGLRDVMSLAETLAAAHRQQQDPGSYAVLNHYQQRRQPDRAATIGITDGLVRVFANRYGPMVAGRNLGLLAMDHLPWLRNQLAERTLGWVKR >CP028349.1|AVV38961.1|3835562_3836765_-|FAD-dependent-2-octaprenylphenol-hydroxylase MQTFDVVIAGGGMVGLAVACGLQGSGLRIAVLEKSPEPPLALSASPSIRVSAINAASERLLQKLDVWSTILSLRCRAYHGMEVWDKDSFGTIAFDDEQQGLSHLGHIIENPVIHNALWQRASACSDVTLMAPSQLQQVAFGDNEAFITLQDGSMLSARLMIAADGANSWLRNKADIPLTFWDYDHHALVANVRTEKPHDAVARQVFHGDGILAFLPMQDPHLSSIVWSLSPQEASRLETMPEALFNQQLSVAFDMRLGLCQLESERKSFPLVARYARNFAAHRLALVGDAAHTIHPLAGQGVNLGFMDAAELIGEIRRLHQQGKDIGQHLYLRRYERSRKHSAAMMLAGMQGFREMFAGNNPAKKLLRDVGLKLADRLPGVKPMMLKQAMGLNDLPAWLR >CP028349.1|AVV38960.1|3834032_3835130_-|glycine-cleavage-system-aminomethyltransferase-GcvT MTQQTPLFEQHQACGARMVDFHGWMMPLHYGSQMDEHHVVRSDAGMFDVSHMTIVDLTGPRTREFLRYLLANDVAKLTQPGKALYTGMLNASGGVIDDLIVYFMSETFFRLVVNSATREKDLAWITQHAEGYGITLTERDDLALIAVQGPQAQQKAQTLFSEEQRLAVAGMKPFFGVQSGDLFIATTGYTGEAGYEIAMPAEEAANFWQRLLAAGVKPAGLGARDTLRLEAGMNLYGQEMDEGVSPLAANMGWTVCWEPADRDFIGREALELQRERGTEKLVGLILTEKGVLRNGQPVRFTDDQGQLQEGIITSGSFSPTLGYSIALARVPASIGSTAIVEIRNRQMPVQVTRPVFVRAGKPVAQ >CP028349.1|AVV38959.1|3833622_3834009_-|glycine-cleavage-system-protein-GcvH MSNVPNTLKYRDSHEWVRKEADGSYTVGITEHAQELLGDMVFVDLPEVGATFAAGEECAVAESVKAASDIYAPISGEIVAVNEALTDSPEQVNSEPYDGGWLFKIKASDASEIDMLLDADAYKASIDE >CP028349.1|AVV38958.1|3830668_3833542_-|glycine-dehydrogenase-(aminomethyl-transferring) MTQTLSQLEHNGAFIERHIGPSPEQQAQMLDAIGARSLDALISTIVPADIQLPGPPAVGEAATEQQALAELKAIASQNLRYKSWIGMGYSAVITPPVILRNMLENPGWYTAYTPYQPEVSQGRLEALLNFQQLTLDLTGMDIASASLLDEATAAAEAMAMAKRVSKLKNANKFFIADDIHPQTLDVVRTRAETFGFELIIDSADKASDHDDLFGVLLQQVGTTGEAHDYSALIAGLKARKVVVSVAADFMSLVLLEAPGKQGADIVFGSAQRFGVPMGYGGPHAAFFASRDEHKRSMPGRIIGVSRDAAGNTALRMAMQTREQHIRREKANSNICTSQVLLANIAGLYAVFHGPAGLKRIASRIHRFTNILAAGLQQGGLKLRHQHWFDTLTVEVADKAAVLNRALSFGVNLRSDIHNSVGITLDETTSRDDILALFAILLGDEHGQDLEKLDSEVASESHAIPAGLQRHSEILTHPVFNRHHSETEMMRYMHSLEKKDLALNQAMIPLGSCTMKLNAAAEMIPITWPEFAELHPFCPAEQATGYLQMIGQLSQWLVQLTGYDALCMQPNSGAQGEYAGLLAIRRYHESRGEGDRHLCLIPSSAHGTNPASAQMAGMDVVVVACDKQGNIDLGDLREKAAQAGDKLSCIMVTYPSTHGVYEETIREVCQIVHQYGGQVYLDGANMNAQVGITTPGYIGADVSHLNLHKTFCIPHGGGGPGMGPIGVKAHLAPFVPGHSVVQIDGVLTQQGAVSAAPFGSASILPISWMYIRMMGAEGLKQASSVAILNANYIASRLQSAYPVLYTGREGRVAHECILDIRPLKEQTGISELDIAKRLIDYGFHAPTMSFPVAGTLMVEPTESESKIELDRFIDAMLAIRMEIDRVTAGEWPLDDNPLVNAPHTQLEIVSEWSHPYSRELAVFPAGSHNKYWPTVKRLDDVFGDRNLFCSCVPMSDYQ >CP028349.1|AVV38957.1|3829858_3830602_-|NAD(P)-dependent-oxidoreductase MKLALVTGGSRGIGRATSLLLAARGYRVAVNYRQREAEAQQLVAQIQQQGGEAFAVQADISDEAQVMAMFAQLDQQSVPLALLVNNAGILFQQCRTEQLDAARLHKVFATNVIGTFLCCREAVKRMGTHHGGQGGAIVNVSSAASRTGAPGEYVDYAASKGAMDTLTKGLSLEVAQQGIRVNGVRPGFIYTEMHADGGEPGRVDRLASIIPMGRGGEAEEVAEAIVWLASDAASYVTGSIIDAAGGR >CP028349.1|AVV38966.1|3841740_3842979_-|phosphoglycerate-dehydrogenase MAKVSLEKDKIKFLLVEGVHQSALENLRAAGYTNIEFHKGALDSDALKASIRDAHFIGIRSRSQLTEEIFAAAEKLVAVGCFCIGTNQVDLQAAASRGIPVFNAPFSNTRSVAELVIGEMLLMLRGIPEANAKAHRGIWNKIAKGSFEARGKKLGIIGYGHIGMQLGVLAESLGMHVFFYDIENKLPLGNATQVRHLADLLNMSDVVSLHVPETASTQDMIGAEQLAQMKPGALLINASRGTVVDIPALCDALASKHVGGAAIDVFPTEPATNSEPFISPLSEFDNVILTPHIGGSTEEAQENIGIEVAGKLAKYSDNGSTLSAVNFPEVSLPIHGISASRLLHIHENRPGVLTAINQIFAEQGINIAAQYLQTSPFMGYVVIDIDAEPEVAENALQLMKAIPGTIRARLLY >CP028349.1|AVV38967.1|3843258_3843918_-|ribose-5-phosphate-isomerase-RpiA MTQDELKKAVGWAALDYVQPGTVVGVGTGSTAAHFIDALATVRHQIEGAVSSSDASTQKLKSLGIQVFDLNEIDLLSVYVDGADEINPQMQMIKGGGAALTREKIVAAVAETFICIADASKEVDVLGRFPLPVEVIPMARSYVARQLVKMGGLPEYRQNVVTDNGNIILDVHNLRILDPIELEKTINALPGVVTVGLFAARGADIALIGGPDGVKTIKK >CP028349.1|AVV38968.1|3844100_3845006_+|transcriptional-regulator-ArgP MKRPDYRTLQALDSVIRERGFERAAQKLCITQSAVSQRIKQLENLFGQPLLVRTVPPRPTEQGQKLLALLHQVELLEEEWLGDENSGTTPLLLSLAVNADSLATWLLPALKDVLADSPVRLNLQVEDETRTQERLRRGEVVGAVSIQPQPLPSCLVDQLGALDYLFVASRDFADRFFPNGVTRSALLKAPAVAFDHLDDMHQAFLQQNFDLSPGSVPCHIVNSSEAFVQMARQGSTCCMIPHLQIERELASGELIDLTPGLCQRRMLYWHRFAPESRLMRRVTDALIAHGHRVLRQDDVAA >CP028349.1|AVV38969.1|3844989_3845247_+|hypothetical-protein MTWQPENKNGAKAPFLLLTNRDNQESYNNRKVKREGNRVRRGKRPAAWDKNAGSVFEQRNALARSRAHLRDEVRNCAGRAARDWP >CP028349.1|AVV38970.1|3845350_3846073_-|oxidative-stress-defense-protein MKLKALALAAIMSAGALPGVHADELPNGPHVVTSGKATVDARPDIATLSIVVNVSSKDAADAKKQADSRVAQYFDFLQKNGIEKKDIDAANLSTQPEYDYTKEGKSVLKGYRAVRQVQVTLRQLDKLNDLLDGALKSGLNEVRSVELGVANPESYKEQARKAAIKNATQQASQLAEGFNAKLGSIYSIRYQVANYQPMPMNRMYKAAAAADTSAQETYDQQSINFDDQVDVVFELKPNTP >CP028349.1|AVV38971.1|3846217_3846841_-|arginine-exporter-ArgO MISLYFQGLALGAALILPLGPQNAFVMNQGVKRQYHLMTAALCSLSDILLICGGIFGGSALLSQSPLLLMVITWAGVAFLLWYGWGALRSAFRGEADLTDGEPLKQSRGRIIATLLAVTWLNPHVYLDTFVVLGSLGGQLPTTTARQWFALGTISASILWFFGLALLAAWLSPRLRTAKAQRIINLLVGAVMWFIALQLARQGIAGF >CP028349.1|AVV38972.1|3846974_3847850_-|mechanosensitive-ion-channel-protein-MscS MEDLHVVDGLNNAGNWLIRNQALIISYAVNIVAAIAIIIIGMVIARIISNAVNRLLRARHIDATVADFLSALVRYGVIAFTIIAALGRIGVQTASVIAVLGAAGLAVGLALQGSLSNLAAGVLLVTFRPFRTGEFVDLGGVMGTVLHVQIFSTTLRTGDGKIVVVPNGKIISGNIVNFSREPVRRNEFIIGVAYEADVDEVIALLQQVVEADSRVLKEKGIQIGLNELAASSMNFVVRCWSNSGDLQDVYWDLLKNFKRALDGKGIGIPYPQMDVHLHQKSAPEAVETQVQ >CP028349.1|AVV38973.1|3848040_3849120_-|class-II-fructose-bisphosphate-aldolase MSKIFDFVKPGVVTGDDVQKIFKVAKENKFALPAVNCVGTDSINAALEAAAKVKAPIIIQFSNGGAAFIAGKGFKTDKPQGAAIFGAIAGAHHVHLMAEQYGVPVILHTDHCAKKLLPWIDGLLDAGEEYFQKNGKPLFSSHMIDLSEESLEENIEISSKYLARMAKLDMTLEIELGCTGGEEDGVDNSHMDASALYTQPEDVNYAYEKLHAISPRFTIAASFGNVHGVYKPGNVKLTPTILRDSQKYVCEKHGLPHNALDFVFHGGSGSSAAEIEESISYGVIKMNIDTDTQWATWDGILQYYKKNEGYLQSQLGNPEGDDKPNKKYYDPRVWLRSAQASMVVRLEQAFKELNAVDVL >CP028349.1|AVV38974.1|3849225_3850389_-|phosphoglycerate-kinase MSVIKMTDLDLAGKRVLIRADLNVPVKEGKVTSDARIRASLPTIEAALKQGAKVMVTSHLGRPTEGEYNEEFSLLPVVNYLKEKLSGTNVTLAKDYLDGVEVGAGELVVLENVRFNKGEKKDDEALSKKYAALCDVFVMDAFGTAHRAQASTHGVGKFAPIACAGPLLSAELEALGKVMSNPERPLVAVVGGSKVSTKFDVLQSLVKIADTVIVGGGIANTFVAIDNNVGKSLYEPDFVEAAKGLRDQHGIPVPTDSRVGTEFSETAPATVKKVSEVADNEEIMDFGDETAQAMAAILKDAKTILWNGPVGVFEFPNFRKGTEIVANAIADSDAFSVAGGGDTLAAIDLFGIEDKISYISTGGGAFLEFVEGKKLPAVAMLEERAKQ >CP028349.1|AVV38975.1|3850440_3851460_-|erythrose-4-phosphate-dehydrogenase MTVRVAINGFGRIGRNVLRALYETGRRAEITVVAINELAEATGMAHLLKYDTSHGRFAFDVRQERDVMWVGGDTLRILHRADIADLPWRELDVDVVLDCTGVYGSRADGEAHLQAGAKKVLFSHPGGNDLDATVVYGVNQQALTASDRIVSNASCTTNCIIPVIKLLDDAFGIESGTVTTIHSAMHDQQVIDAYHSDLRRTRAASQSIIPVDTRLAAGITRIFPKFNDRFEAIAVRVPTINVTAIDLSVSVHNAVTACDVNTLLRSAAEGAFRGIVDYTELPLVSVDFNHDPHSAIVDGTQTRVSGQHLIKTLVWCDNEWGFANRMLDTTLAMAASGFR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
CP028349_2 | 2.2|580481|21|CP028349|PILER-CR | 580481-580501 | 21 | CP028349.1 | 580540-580560 | 1 | 0.952 |
1. spacer 2.2|580481|21|CP028349|PILER-CR matches to position: 580540-580560, mismatch: 1, identity: 0.952
tggatcctcacaatctgctaa CRISPR spacer tggaacctcacaatctgctaa Protospacer **** ****************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP028349_5 | 5.1|3841510|35|CP028349|CRISPRCasFinder | 3841510-3841544 | 35 | NZ_CP028350 | Pantoea vagans strain PV989 plasmid pPV989-508, complete sequence | 450661-450695 | 3 | 0.914 |
CP028349_5 | 5.1|3841510|35|CP028349|CRISPRCasFinder | 3841510-3841544 | 35 | NZ_CP016890 | Pantoea agglomerans strain C410P1 plasmid unnamed1, complete sequence | 160197-160231 | 3 | 0.914 |
CP028349_5 | 5.1|3841510|35|CP028349|CRISPRCasFinder | 3841510-3841544 | 35 | NZ_CP034470 | Pantoea agglomerans strain CFSAN047153 plasmid pCFSAN047153_1, complete sequence | 267240-267274 | 3 | 0.914 |
CP028349_5 | 5.1|3841510|35|CP028349|CRISPRCasFinder | 3841510-3841544 | 35 | NZ_CP034475 | Pantoea agglomerans strain CFSAN047154 plasmid pCFSAN047154_1, complete sequence | 539503-539537 | 3 | 0.914 |
CP028349_5 | 5.1|3841510|35|CP028349|CRISPRCasFinder | 3841510-3841544 | 35 | NC_014258 | Pantoea vagans C9-1 plasmid pPag3, complete sequence | 116058-116092 | 3 | 0.914 |
CP028349_4 | 4.1|3673372|38|CP028349|CRISPRCasFinder | 3673372-3673409 | 38 | NZ_CP016890 | Pantoea agglomerans strain C410P1 plasmid unnamed1, complete sequence | 160195-160232 | 4 | 0.895 |
CP028349_4 | 4.1|3673372|38|CP028349|CRISPRCasFinder | 3673372-3673409 | 38 | NZ_CP034470 | Pantoea agglomerans strain CFSAN047153 plasmid pCFSAN047153_1, complete sequence | 267238-267275 | 4 | 0.895 |
CP028349_4 | 4.1|3673372|38|CP028349|CRISPRCasFinder | 3673372-3673409 | 38 | NZ_CP034475 | Pantoea agglomerans strain CFSAN047154 plasmid pCFSAN047154_1, complete sequence | 539502-539539 | 4 | 0.895 |
CP028349_4 | 4.1|3673372|38|CP028349|CRISPRCasFinder | 3673372-3673409 | 38 | NZ_CP028350 | Pantoea vagans strain PV989 plasmid pPV989-508, complete sequence | 450660-450697 | 4 | 0.895 |
CP028349_4 | 4.1|3673372|38|CP028349|CRISPRCasFinder | 3673372-3673409 | 38 | NC_014258 | Pantoea vagans C9-1 plasmid pPag3, complete sequence | 116056-116093 | 4 | 0.895 |
CP028349_5 | 5.1|3841510|35|CP028349|CRISPRCasFinder | 3841510-3841544 | 35 | NZ_CP016890 | Pantoea agglomerans strain C410P1 plasmid unnamed1, complete sequence | 15149-15183 | 4 | 0.886 |
CP028349_3 | 3.1|2935930|27|CP028349|CRISPRCasFinder | 2935930-2935956 | 27 | NC_048742 | Streptomyces phage Gilson, complete genome | 68293-68319 | 5 | 0.815 |
CP028349_4 | 4.1|3673372|38|CP028349|CRISPRCasFinder | 3673372-3673409 | 38 | NZ_CP016890 | Pantoea agglomerans strain C410P1 plasmid unnamed1, complete sequence | 15147-15184 | 5 | 0.868 |
CP028349_4 | 4.1|3673372|38|CP028349|CRISPRCasFinder | 3673372-3673409 | 38 | NZ_CP045721 | Pantoea eucalypti strain LMG 24197 plasmid unnamed1, complete sequence | 290748-290785 | 6 | 0.842 |
CP028349_4 | 4.1|3673372|38|CP028349|CRISPRCasFinder | 3673372-3673409 | 38 | NZ_CP022517 | Pantoea vagans strain FBS135 plasmid pPant1, complete sequence | 116369-116406 | 6 | 0.842 |
1. spacer 5.1|3841510|35|CP028349|CRISPRCasFinder matches to NZ_CP028350 (Pantoea vagans strain PV989 plasmid pPV989-508, complete sequence) position: , mismatch: 3, identity: 0.914
ctaaaaacgccgggagcgtttttgaacagcgcaac CRISPR spacer ataaaaacgccgggagcgtttttgaacaacgcaaa Protospacer ***************************.*****
2. spacer 5.1|3841510|35|CP028349|CRISPRCasFinder matches to NZ_CP016890 (Pantoea agglomerans strain C410P1 plasmid unnamed1, complete sequence) position: , mismatch: 3, identity: 0.914
ctaaaaacgccgggagcgtttttgaacagcgcaac CRISPR spacer acaaaaacgccgggagcgtttttgaacaacgcaac Protospacer .**************************.******
3. spacer 5.1|3841510|35|CP028349|CRISPRCasFinder matches to NZ_CP034470 (Pantoea agglomerans strain CFSAN047153 plasmid pCFSAN047153_1, complete sequence) position: , mismatch: 3, identity: 0.914
ctaaaaacgccgggagcgtttttgaacagcgcaac CRISPR spacer acaaaaacgccgggagcgtttttgaacaacgcaac Protospacer .**************************.******
4. spacer 5.1|3841510|35|CP028349|CRISPRCasFinder matches to NZ_CP034475 (Pantoea agglomerans strain CFSAN047154 plasmid pCFSAN047154_1, complete sequence) position: , mismatch: 3, identity: 0.914
ctaaaaacgccgggagcgtttttgaacagcgcaac CRISPR spacer acaaaaacgccgggagcgtttttgaacaacgcaac Protospacer .**************************.******
5. spacer 5.1|3841510|35|CP028349|CRISPRCasFinder matches to NC_014258 (Pantoea vagans C9-1 plasmid pPag3, complete sequence) position: , mismatch: 3, identity: 0.914
ctaaaaacgccgggagcgtttttgaacagcgcaac CRISPR spacer ataaaaacgccgggagcgtttttgaacaacgcaaa Protospacer ***************************.*****
6. spacer 4.1|3673372|38|CP028349|CRISPRCasFinder matches to NZ_CP016890 (Pantoea agglomerans strain C410P1 plasmid unnamed1, complete sequence) position: , mismatch: 4, identity: 0.895
actaaaaacgccgggagcgtttttgaacaacgcaacgt CRISPR spacer gacaaaaacgccgggagcgtttttgaacaacgcaacgc Protospacer . .**********************************.
7. spacer 4.1|3673372|38|CP028349|CRISPRCasFinder matches to NZ_CP034470 (Pantoea agglomerans strain CFSAN047153 plasmid pCFSAN047153_1, complete sequence) position: , mismatch: 4, identity: 0.895
actaaaaacgccgggagcgtttttgaacaacgcaacgt CRISPR spacer gacaaaaacgccgggagcgtttttgaacaacgcaacgc Protospacer . .**********************************.
8. spacer 4.1|3673372|38|CP028349|CRISPRCasFinder matches to NZ_CP034475 (Pantoea agglomerans strain CFSAN047154 plasmid pCFSAN047154_1, complete sequence) position: , mismatch: 4, identity: 0.895
actaaaaacgccgggagcgtttttgaacaacgcaacgt CRISPR spacer gacaaaaacgccgggagcgtttttgaacaacgcaacgc Protospacer . .**********************************.
9. spacer 4.1|3673372|38|CP028349|CRISPRCasFinder matches to NZ_CP028350 (Pantoea vagans strain PV989 plasmid pPV989-508, complete sequence) position: , mismatch: 4, identity: 0.895
actaaaaacgccgggagcgtttttgaacaacgcaacgt CRISPR spacer gataaaaacgccgggagcgtttttgaacaacgcaaagc Protospacer . ********************************* *.
10. spacer 4.1|3673372|38|CP028349|CRISPRCasFinder matches to NC_014258 (Pantoea vagans C9-1 plasmid pPag3, complete sequence) position: , mismatch: 4, identity: 0.895
actaaaaacgccgggagcgtttttgaacaacgcaacgt CRISPR spacer gataaaaacgccgggagcgtttttgaacaacgcaaagc Protospacer . ********************************* *.
11. spacer 5.1|3841510|35|CP028349|CRISPRCasFinder matches to NZ_CP016890 (Pantoea agglomerans strain C410P1 plasmid unnamed1, complete sequence) position: , mismatch: 4, identity: 0.886
ctaaaaacgccgggagcgtttttgaacagcgcaac CRISPR spacer acaaaaacgccgggagcatttttgaacaacgcaac Protospacer .***************.**********.******
12. spacer 3.1|2935930|27|CP028349|CRISPRCasFinder matches to NC_048742 (Streptomyces phage Gilson, complete genome) position: , mismatch: 5, identity: 0.815
gcagatgcatgcgctaaagttgggggt CRISPR spacer gcagatgcgtgcgctaaagttcaaggg Protospacer ********.************ ..**
13. spacer 4.1|3673372|38|CP028349|CRISPRCasFinder matches to NZ_CP016890 (Pantoea agglomerans strain C410P1 plasmid unnamed1, complete sequence) position: , mismatch: 5, identity: 0.868
actaaaaacgccgggagcgtttttgaacaacgcaacgt CRISPR spacer gacaaaaacgccgggagcatttttgaacaacgcaacgc Protospacer . .***************.******************.
14. spacer 4.1|3673372|38|CP028349|CRISPRCasFinder matches to NZ_CP045721 (Pantoea eucalypti strain LMG 24197 plasmid unnamed1, complete sequence) position: , mismatch: 6, identity: 0.842
actaaaaacgccgggagcgtttttgaacaacgcaacgt CRISPR spacer gagaataacgccgggagcgtttctgaacaacgcaacgc Protospacer . ** ****************.**************.
15. spacer 4.1|3673372|38|CP028349|CRISPRCasFinder matches to NZ_CP022517 (Pantoea vagans strain FBS135 plasmid pPant1, complete sequence) position: , mismatch: 6, identity: 0.842
actaaaaacgccgggagcgtttttgaacaacgcaacgt CRISPR spacer gagaataacgccgggagcgtttctgaacaacgcaacgc Protospacer . ** ****************.**************.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1371972 : 1381643
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP028349|1371972:1381643|DBSCAN-SWA ATCAGAACTGGTAAGTCACACCCATTGCCACAATGTCGTCGTTGTTGATCGACAGCGCGTTGTCATCATCCAGCTGGTTGATTTTGTAATCCACGAAGGCGGACATGTTTTTGTTGAAGTAGTAGGTTGCACCCACATCGATATATTTCACCAGATCTGCATCACCGATACCGTTTTCAATATCTTTGCCTTTGGTCTGCACATAGCCAATAGAAGGACGCAGGCCGAAATCAAACTGATACTGCGCCACCAGTTCGATGTTCTGCGCTTTGTTGGCTGCGCCCGCGACGGCAGCGCCGGTATTACGGTTGGTGCCGCTGATTGGCGTCATGTTGCGGGTATCGGTGTAAATCGCCGCCAGGTAAACGCTGTTGGCATCATACTTCACCCCTGCGCCCCAGGAATCTGCGCGGTCACCGCCGCCATAGGCGCGCGCCTGCTGCTGGTTGGTGCGGTCAGATGAGGTGATGGCACTCATCAGGCTCACGCCGCTATCACCCAGCGCATAGCCCAGTGAAGCACCGTAGCCGTCGCCGTTCTGGCGGTTAATGTCAGTACGGCTGTTGGCGCTGGTGGCGTCGTTCTTGCCCTGATACTGCAGCGCAAATTTCAGGCCATCGACCAGACCAAAGAAGTTGTTGTTGCGGTAAGTGGCGACGCCGGTGGCGCGCTGGGTCATAAAGTTATCGGAACGGGCCGTGCCGTCCCCACCGAATTCCGGGAACACGTCGGTCCAGGCTTCAACGTCATACAGCACGCCATAGTTACGGCCATAATCCACCGAACCATACTGCGAGAATTTCAGGCCAGCAAAACCCAGGCGGGTCTTGTTGCCGTTATTGGCGTCGCTGCCTTCAGAGTTATTAACGTTGTACTGATATTCCCACTGACCGTAACCGGTCATCAGGTCATTAATCTGCGTCTGGCCTTTAAAGCCGATACGGGTATAAGACTTGTCACCGTCTGCGCTGGCATTATCGCTGATGTAGTGCTCAGCTTTAACTTTGCCGTATAAATCCAGTTTGTTGCCGTCTTTGTTATAGATTTCGGCAGCGTGCGCGGCAGAAGCCAGCGTCAGCGTGGAGACGATCATGGCCAGTGTGCTTTTATTCATTTGCTATCAATCCCAATGTGAACGCCGTAAAAAAACGAAAGTTTTTGTAAAACACCGACTTTTTACCCGTCCGTGATGAACTTTTTATGACACTTTTCCGGATTTCTTGTTAATGCTGAACTTTTGGTCTACCGCCTTGTCCGCGTAAATTCGTGCTTTTCCCCCGCTGGCTGTTGGCTTCCGTATCGACTCCTGTTAAAACGTCCTTTCGTAAAAATTTTTCACTACGGCAGGAAATATGAGTGGCAGTCAGACCTTAGTGGTCAAGTTGGGTACCAGCGTTCTTACCGGCGGCTCTCGCCGGTTAAATCGGGCGCACATGGTGGAACTGGTTCGTCAGTGTGCGCAGCAGCATGCCGCAGGCCACCGCATTGTGATCGTCACTTCTGGCGCGATGGCCGCCGGACGCGAACACCTGGGCTACCCCGAACTGCCACCCACCATCGCCTCAAAGCAGCTGCTGGCTGCGGTGGGGCAGAGCCGTCTGATTCAGCTGTGGGAACAGTTATTCTCCATCTATGGCATCCATATTGGTCAGATGCTGCTGACCCGTGCCGACATGGAAGATCGCGAACGTTTCCTGAATGCCCGCGATACGCTGCGCGCGTTGCTGGATAACCACATTGTGCCGGTGATCAACGAGAACGATGCGGTTGCTACCGCTGAAATCAAAGTGGGCGACAACGACAATCTTTCGGCGCTGGCGGCGATGCTCGGCGGTGCCGACAAGCTGCTGCTGCTGACCGATCAGCAGGGTCTGTTTACCGCCGATCCGCGCAACAATCCTGACGCTGAGCTGATCAGCGACGTCCACGTGATCGATGATGCGTTACGCGCGCTCGCGGGTGACAGCGTATCCGGTCTCGGCACCGGCGGTATGTCGACCAAGCTTCAGGCGGCGGATGTTGCCTGCCGCGCCGGAATAGACGTGATTATCGCCTCGGGCAGTCGCAGCGGTGTGATCAGCGATGTGATTGACGGTAAGCCGGTTGGTACGCGATTCCATGCACAGCAATCGCCGCTGGAGAACCGTAAACGCTGGATTTTCGGTGCGCCACCGGCGGGCGAACTCACCGTCGATGACGGGGCACTGGCGGCCATCATTGAACGCGGCAGTTCGCTGCTGCCCAAGGGCATTCGCGCTATCCAGGGGAATTTCTCGCGCGGTGAAGTGATCCGCATCCGCAGCCTGCAGGGCCGGGATATTGCTCACGGTGTTTCTCGCTACAACAGTGACGCGATGCGCATGATTGCCGGCCATCACAGCCAGCAGATCAGCGAAATTCTGGGATATGAATATGGCCCGGTGGCCGTTCACCGCGACGACATGATCGTCAGTTAAGGAGCGGACAATGCTTGAAGAGATGGGAAAAGCGGCGCGTGCCGCAGCGTATACCGTGGCCGATCTGTCGACGGCTGAGAAGAATCAGGTTCTGATGACCATTGCTGACCGCCTTGAGGCGGAGAGCGCAGACATTCTGGCTGCTAACGAGCTGGATTTAGCCGATGCGCGTCAGAACGGCATGAGTGCCGCGCTGTTAGACCGGCTGACACTCAACCCGCAACGCCTCGCCAGCATCGCCAGCGATGTGCGTCAGGTCTGTCAGCTGGCCGATCCGGTGGGACAGTTAATCGACGGCGGTCAGTTCGACAGCGGACTGCGTATTGAACGCCGTCGCGTGCCGCTGGGCGTTGTTGCAGTCATTTATGAGGCGCGTCCTAACGTCACGGTTGATGTCGCCAGCCTGTGTCTGAAAACCGGCAACGCCGCCATTCTGCGTGGCGGCAAAGAGACCTGGCGCACCAATGCGGCCACCGTCCGGGTGATCCAGAACGCGCTGAAACAGCACGGTCTGCCGACGGCTATCGTGCAGGCGATTGAAAATCCGGATCGCGAGCTGGTGAATCAGCTGCTGAAGCTGGATCGCTACGTCGATATGCTCATCCCGCGTGGCGGAGCCGGGCTGCACAAGCTGTGCCGGGAAAACTCCACCATCCCGGTGATTACCGGCGGCATCGGCGTCTGTCACATCTATATTGATGAGACGATGGAGATTGAACCGGCGCTTGATCTGATCGTCAACGCCAAAAAACAGCGTCCCAGCGCCTGTAACTCGCTGGAAACGCTGCTGGTGGATCAGGCGATGGCCGAACGTTTCCTGCCCGCTTTCAGCGCGCGTATGGCGCAGGAAGGCATCGCGCTGCATGCTGACAGCAACGTGCTGGCGCAGTTGCAGCAGGGCCCGGCTTCGGTCATCCAGGTTAATCCTCAGCAGTACAATGATGAGTGGCTGTCGCTGGATCTTAACGTCAAATTAGTGGCGGATATGGATGAGGCGATCGATCACATCCGTACGCACGGCACGCAGCACTCGGATGCCATCCTGACCCGTGACACCCGCAACGCCGCGCGTTTTGTTCGTCAGGTCGACTCCTCAGCGGTCTATGTTAACGCCAGCACCCGCTTCACCGATGGCGGGCAGTTTGGCCTGGGTGCGGAAGTGGCGGTCAGTACCCAGAAGCTGCATGCACGCGGCCCAATGGGATTAGAGGCGCTGACCACCTATAAATGGATTGCCTGGGGCGACGACACGCTCCGGTCATAGCTACGATAATTTTTAACAAAACGGGAGTCTTTCAGAGACTCCCGTTTTTTTTAGCCCGGCTTTCCGGTCCAGTGAAAGATAGAGTTTTCTGCCCATCTGATCAATTTTCATTCCGATCCTGTCACCGTTGCCCGTTATGCTGGCTGAATTTCACGCAATTTCTTACGGAACATTCCTGAAAGGCGGGGCTGTATGCGGCCTGAGGCAATATTTTTTGTTCAGCTGCAGGCCGAAGCGGATAATGATAATTAGAAATGCTTGATAAAGCGGTTAGGTAGTAAACGAGCGGTAATATAAAATGAACTGGTAGCAATAATGAAAATAACATCCAGTATCTCTTCTATGGCTGATTATTAACTAATGGTTAAAAAAGAAAGTACAAGACAAGCTTTCACCACCAGGCTGATAGCGGCCTGTGAGAGTGCCGGCATTGTCGGACATGGACGAAACAAACAGGTTGCCAGAGCCCTGCAGCAGCAGGGCTGTAAGATTTCAACACCAGGAGTGTGGAAGTGGTTTAACGCGCAGTCAGTGCCTGATGGCGGCAATTTACTGGCGCTCAGCCAGCTGCTGGGAGTCAGGGTAGAGTGGCTGCAATACGGTATGGAACAACCCGCAGACCCGGTAACGCTGAGTACGTTTACCGGCCATAAGAATGTGTTTCGGGTGGATAGTCTTGATATCGGGCAGCGAAGCGCTGCGGGTCTGCCCGCGCGTGATGAGTTTGTGGAGACCATCCAGGCGATCGAATATGGTCTGGATGAGGCGCGGGTGCTATTCAACGGGCGGCCGGCGGAGAACATCCGGTTAATTGCCATCAATAGCGACTCTATGGCTGACACCTTTGCACCGCGCGATCAGCTGTTTGTGGATATCAGCGTGCGGATGTTCGACGGTGATGGCATCTATATTTTTACCCTCGATGAGCAACTCTATCTCAAGCGTCTGCAGCTGCAGCATAAGAAAATTGCTGTAATTTCAGATAATAAACGCTATGAAACCTGGTATCTCAATCACGACGACGTTACCAGCCTGAAGGTCCTGGCAAAAGTGATCATGAGTCAGGCGCGTCACTATCAGATCCTCGGCTAAGGCCGCTTTTGCCGCCTTTCTCAGACAGGCTGCAAAAAAATAATCGGCATAAATTCATAAATTTAACCAGCAGTAAATAAAAAATTAACCAACAGTGTTTGATAAAGCATAAGAACCTGTTAATAATTTGGATATGGATGAACGAAGCAGTTCTTTCACGGATGAGCCGCTCTTAAACAATTGAGGCGCTGCACAAGCGCGAATCATACCACCCAAAAGTCAGTGAACTTTGGGATGGGGTGGATGCGCAAAATGCAGCGTCGTCAGATGCCGTTTATTTCCCCTCGCTTACTTTCCACCTGGCTGCTAACGCAGCCGGGCGGCGAGCGTGCGAGCGCGATACACGTCATTCGGGCCCGTTCGGATCACCGAAAGCTCGCGCCTGCCACCCCATCGCCAAAGACTCATTCACAGGAGGAGTTATGGCGATAATTCACTATGGCAAAAGCGTTTTTGTGGGCAACGCACGTACTCGTCGCCATCGGCGGCGCCAACCTCTCAGCAGCCTGACTCAGAGCATCGACCAGGCCCTGAACTTCCCAACTGAACCCCCCGTCCTGAGACGTGCAGAGCAAATCTGCCAGCGTCAGGTCGCCCCGCGCGTTGATCGCGCAATCACTGCACCCCGCGTCACCAGTCAGGAAACACCCAGTTTCGACAATTGCTGTCTTCCGCATGTTCACCTCTACGCCGTGAGCTGAAGCGCGAAGAAAGCGTGCATTTCCGTTCCGGCTAACTCTTCGTTATCTCCAGCTCTCACTGGACGCAATTTGCTGAATTACTGTTCAGTTTTGCTGCAGGGTCAGTCAGGCCTCTTGCCGCGCCTGAACGGGTAATGATCAGCAGGCGTTATGTATTTCTTTTCACAATGGTGAATTGCTTTAACCCGTCATTAAGGAGTCGTTATGAGCCAGATAATTGCAGTACTGAATTTTGAAGAGGGATATGTCGACACGCCTTATCTCGACACGCTGGGCTTTCCCACGGTCGCGGGCGGTATCCGCATCGGGCCTAAGGGGGCATCGCTGAGTAATTACACTTTCCGCGTGCCACGGCGCGTCGGGGATGTCTGGAAGCAGTGCATTCTTGAAAACAAAGTTCAGGAGATGCAGGGCAGGGATCTGCTGCGTAACGCGCTGGCAAAGTGCAATGACGCCCGCACGGATGTGTTGCTGAGCATGACCTATCAGCTGGGCGTCGAAGGCGTTATGCAGTTCAAAAACATGCTCACCGCCATCGCTGCTGAACAATTCAATGAGGCAGCTGACGCGATGATGAACAGCCTCTGGGCCCGCCAGACACCGGGCCGCGCCCGGCGTCATGCGGAGATGATGCGCAGCGGAACCTATGCGGTTTACAGGGGGCTGCTATGAGATGGCTCATGTTTCTCGTCGCCGTACTGACGGCGATCATCATCCTGCTGCTGTTGCAGCGCTTCACCACCCTGCAATTTGTCTCTCATGCTCGCCTGCTCTTCAAAACCTGGTCCGTCTGGCTCGCGTCGCTGGGATCCATGCTCAGCGCCTGGGCACAGTCATTCCCTTCTGCCGCCGTTGATGCCTGGAACGTGCTGCCGGAGGACGTGAAATCGATTCTGCCACACAACTACCTGGGGTTTGTGGGGGCGTTCATGGTGGCGATGGGCGTGATTGCACAATTTGTTCGTCAGAAAAATCTGCTTAACGCGAAAGAGCAAACGGAAGGAGCGAAACCATGACGCTGATTACCACTTTACTGGGCGGCAGCTGGCACTGGCTTGCTGCACTGGCCGGCATCATTGCGGCACTGGGCGCCAGTTATTTCGGCGGCAGGAAAATCGGCAAAGTGCAGCAAAAAGCGCGATCTGACGTCGCAAACGCGCAACAGGAGGCGTCCCGCGTCAGCGCAGTGGCGCAACAACAACAACATAACCGCGAGGAGGCTAACCGTGTGGCGACTGACAACCATAGCCTTGATGACGCTGCTGCTCGCGACAAGCTGCAGCAGTCGAAATACCACAAACCCTGAGCCGGTGGCGATCACCGACTCTGCCTGCGTACTTTTTTCCCCTCTTTATACCTACGGCGATGACGCGCAAAAAATGGACGTCAGGACAGTGCGCGCCATCAACACCCACAACGACATGTGGGACACCCTTTGCGGCCATCCGGCTGCGGCTAAGTAGTGCGCTGATGGTCAGCGCGATATCTCAATGATCCTGAGGAGGATTCTATGATCGAAGTTAACTCTTTTGCTGAACTGCGCACCACTGTTCCACCAAAATCAGGTGAAGTCGCGAGCCTGAAACGCTATTACGACAAAGACTCCAGCTTCCGTGGCGGCGCAGACTTTGTCGGTTTCCTCACCACCACGCCGCTGAAAGATGATGGCGGTACAGTGGCGGTGGGTAACGGTTTCTACTGGAAGCGCACCATCAACGATCCCGCCGAAGTGAATATCCTGCACTTTGGCGCGAAAGGGGATGGCGTCACCGATGACACTGATGCCTTTAAGCGCATGCTGGCATGGACGCAGAGCTACAACACCTATGCGAAAGCAATTCCGGTACGTTTTCCGGGTGGTCGCTTCCTGATTTCACCTATCGATATCAGCGATACAGAGTTGCCGTTCTTTGGTCTGGCAGGTGACGATATCGAACTCGGCTCGTCACCGCGCACCACTATTATTTCTGACAAAAGTGCTACTCCGGTGTTTAAGGTTAATGCGCGTAAAATCGCCATCAAAGGCATTTGCTGGCACGGACAGGCTAATGCCGGGACTGTCGACACCACACTCAAAGTCACCGTCACGCCTGAACAGTGCAGCAACGTTCAGCCGTTCTTCGAAAACACGATTGTCGGCGGCCAGATCGTCAATATCTTCTGCTTCAAGGCGCAAAGTACCGGGGGAACCGTCTTTAAGCTGCAGGATACGCTCGACAGTAAGTTCGATCAGATCTACACCTCCACGACCTATTCACGCGTGTTTGATGTCGGCTGGTCAAACACACCGAAGGGGAACTGGGACCACTCAACCGCCATTGAGCTCTGTAATGCCAACTTCCAGTCGGGCTACGGCGACGCCACGCTCTACATGCCGCGCGTTACGCAGGGCGTCATGCGCAACGTCTGGATTGAGCACACGACTAACCCGGGGGATCTGTCGGACGGCGGCTGGAACATTGAGACGCTCAACATTGAAGATTGCGCCACGCCGCTCAATCTCAACAATGCCCGCGTGGTAATGCGTCATATTAACCTGCAGGCGGGGGCGAAAATTACTAATGATCTGGCCGGGAGTAAGTGGTTATCGACCTTTGAATATGGCTATCGTCGGGATGAGAACTACGGCACGTTCATGACCGGTTCGCTGCGTGCTGGCTATTACAGCGGCTATAAAGTGGTGAATAACACTGCGACTGACAACTGGTATCGGCTGGGGCAGCTCTATTTCCCGGCGCCTAACCAGCAATGGGTGATGGAACTGATCAGTAAAGCGGATGCCACCACGCCGTCCGGCACCGCCGGTTCACCGGTCAATATGCCTGCCACTGGTAAAACCCTGATCAACCTGCAGCGACTGGAGACGGTCTGGGCCGATGCTTACCACATGGGACAGCCTTCCGTGCTGGACATTCGTTATGGCCGGGTAGGGACCACCTATGCCGTCATCTGGGTCAAGCTGAAAGCCAACAGCGGCGAGACCATGTTCAACCTGAAGACCACCGGTCCGACCCGTTTCGACACCGGTTCCTGCTCGCTGTTCCAGTCGGATATGTCGGTGGTGACGGACATCACCAATATCAGCAACCTGAAACCTGCGGCACGCTTCGGCATGCACAATGGACTGGCAGGTATTGGTGCTAACGAGAAGGGCGTGGTGACGCTCGCGACCGCTGCGGGTACACCGACCAATAAAACCGCGCCAACCGGTTTTGTGCTGATCAATATCAACGGCGTTGACCGCAAGGTTCCTTACTACGATTGATGATCGGCATGTTTTGGGCGAGGCTCCGGCCTCGCCCTTTTTTAACCGGTAACAAAAGTCAGGTGAAAATATCATTGTCATATGGGGAAATATGAATGCTTTTCAAACAAGTGCACTTGACGTAATAACGGCTCTTTTTTACCGTAGCACCCGTACTTCTCACCCTTGCCGGTGTAGCTCAGTTGGCAGAGCAGCGCATTCGTAATGCGAAGGTCGCAGGTTCGACTCCTGTTTCCGGCACCACTCCGCTCTTTTTCAGCCTCACGCCGGAATAAATTTCTGCAGGCCATTCTTAAGCCTTCATTCTGATTAAAGAACCCACACTGAGAATTTACCGTCGCCAGCCGCTATCTGCTGGTGCGGCAGGGTTTGTCTGCCATAAACAATGCGCGCTCTGGCCAGGTTGTCAGTCCGGAACGCTGAACCGTTCACTCATCGCCGCTAAGTTCAGGCCACGCTATTTGCGTTTAATTTGAATAACGGCTTGCGTCGCCTGGTGAAGTGTGGCCTAATGAATCGTCGGTTTGATTAACAGACACTATCAAAGTAAGTAACCAAAGCAGTTCTCATCTCTAAGTTTGTCGCGAGACATCCCTTCTTTTTGAGTCTTCTCTTACGTTCTGATTAAAATCTGTTCTCAGGCCATTATGCCGGAAGGCTCGATATATTAAGGATTTAAGCATGTCTAACAAAATTCGCGGTACCGTTAAATGGTTTAACGCAGAGAAAGGCTTCGGTTTCATCTCTCCTGCAGACGGCAGCAAAGATGTATTCGTACATTTCTCTGCAATCCAGGGCACCGATTTCCGTTCATTAGACGAAGGTCAGCAAGTTGAATTCACAGTAGAAAATGGCGCTAAAGGCCCAGCTGCTGCGAACGTTGTTGGTCTCTAA
Protein sequences of DBSCAN-SWA_1 >CP028349|1371972:1381643|1378496_1378790_+|AVV36838.1|DBSCAN-SWA MTLITTLLGGSWHWLAALAGIIAALGASYFGGRKIGKVQQKARSDVANAQQEASRVSAVAQQQQHNREEANRVATDNHSLDDAAARDKLQQSKYHKP >CP028349|1371972:1381643|1376051_1376783_+|AVV36834.1|DBSCAN-SWA MVKKESTRQAFTTRLIAACESAGIVGHGRNKQVARALQQQGCKISTPGVWKWFNAQSVPDGGNLLALSQLLGVRVEWLQYGMEQPADPVTLSTFTGHKNVFRVDSLDIGQRSAAGLPARDEFVETIQAIEYGLDEARVLFNGRPAENIRLIAINSDSMADTFAPRDQLFVDISVRMFDGDGIYIFTLDEQLYLKRLQLQHKKIAVISDNKRYETWYLNHDDVTSLKVLAKVIMSQARHYQILG >CP028349|1371972:1381643|1378152_1378500_+|AVV36837.1|DBSCAN-SWA MRWLMFLVAVLTAIIILLLLQRFTTLQFVSHARLLFKTWSVWLASLGSMLSAWAQSFPSAAVDAWNVLPEDVKSILPHNYLGFVGAFMVAMGVIAQFVRQKNLLNAKEQTEGAKP >CP028349|1371972:1381643|1377688_1378156_+|AVV36836.1|DBSCAN-SWA MSQIIAVLNFEEGYVDTPYLDTLGFPTVAGGIRIGPKGASLSNYTFRVPRRVGDVWKQCILENKVQEMQGRDLLRNALAKCNDARTDVLLSMTYQLGVEGVMQFKNMLTAIAAEQFNEAADAMMNSLWARQTPGRARRHAEMMRSGTYAVYRGLL >CP028349|1371972:1381643|1373323_1374427_+|AVV36832.1|DBSCAN-SWA MSGSQTLVVKLGTSVLTGGSRRLNRAHMVELVRQCAQQHAAGHRIVIVTSGAMAAGREHLGYPELPPTIASKQLLAAVGQSRLIQLWEQLFSIYGIHIGQMLLTRADMEDRERFLNARDTLRALLDNHIVPVINENDAVATAEIKVGDNDNLSALAAMLGGADKLLLLTDQQGLFTADPRNNPDAELISDVHVIDDALRALAGDSVSGLGTGGMSTKLQAADVACRAGIDVIIASGSRSGVISDVIDGKPVGTRFHAQQSPLENRKRWIFGAPPAGELTVDDGALAAIIERGSSLLPKGIRAIQGNFSRGEVIRIRSLQGRDIAHGVSRYNSDAMRMIAGHHSQQISEILGYEYGPVAVHRDDMIVS >CP028349|1371972:1381643|1381430_1381643_+|AVV36840.1|DBSCAN-SWA MSNKIRGTVKWFNAEKGFGFISPADGSKDVFVHFSAIQGTDFRSLDEGQQVEFTVENGAKGPAAANVVGL >CP028349|1371972:1381643|1371972_1373085_-|AVV36831.1|DBSCAN-SWA MNKSTLAMIVSTLTLASAAHAAEIYNKDGNKLDLYGKVKAEHYISDNASADGDKSYTRIGFKGQTQINDLMTGYGQWEYQYNVNNSEGSDANNGNKTRLGFAGLKFSQYGSVDYGRNYGVLYDVEAWTDVFPEFGGDGTARSDNFMTQRATGVATYRNNNFFGLVDGLKFALQYQGKNDATSANSRTDINRQNGDGYGASLGYALGDSGVSLMSAITSSDRTNQQQARAYGGGDRADSWGAGVKYDANSVYLAAIYTDTRNMTPISGTNRNTGAAVAGAANKAQNIELVAQYQFDFGLRPSIGYVQTKGKDIENGIGDADLVKYIDVGATYYFNKNMSAFVDYKINQLDDDNALSINNDDIVAMGVTYQF >CP028349|1371972:1381643|1374437_1375691_+|AVV36833.1|DBSCAN-SWA MLEEMGKAARAAAYTVADLSTAEKNQVLMTIADRLEAESADILAANELDLADARQNGMSAALLDRLTLNPQRLASIASDVRQVCQLADPVGQLIDGGQFDSGLRIERRRVPLGVVAVIYEARPNVTVDVASLCLKTGNAAILRGGKETWRTNAATVRVIQNALKQHGLPTAIVQAIENPDRELVNQLLKLDRYVDMLIPRGGAGLHKLCRENSTIPVITGGIGVCHIYIDETMEIEPALDLIVNAKKQRPSACNSLETLLVDQAMAERFLPAFSARMAQEGIALHADSNVLAQLQQGPASVIQVNPQQYNDEWLSLDLNVKLVADMDEAIDHIRTHGTQHSDAILTRDTRNAARFVRQVDSSAVYVNASTRFTDGGQFGLGAEVAVSTQKLHARGPMGLEALTTYKWIAWGDDTLRS >CP028349|1371972:1381643|1378994_1380749_+|AVV36839.1|DBSCAN-SWA MIEVNSFAELRTTVPPKSGEVASLKRYYDKDSSFRGGADFVGFLTTTPLKDDGGTVAVGNGFYWKRTINDPAEVNILHFGAKGDGVTDDTDAFKRMLAWTQSYNTYAKAIPVRFPGGRFLISPIDISDTELPFFGLAGDDIELGSSPRTTIISDKSATPVFKVNARKIAIKGICWHGQANAGTVDTTLKVTVTPEQCSNVQPFFENTIVGGQIVNIFCFKAQSTGGTVFKLQDTLDSKFDQIYTSTTYSRVFDVGWSNTPKGNWDHSTAIELCNANFQSGYGDATLYMPRVTQGVMRNVWIEHTTNPGDLSDGGWNIETLNIEDCATPLNLNNARVVMRHINLQAGAKITNDLAGSKWLSTFEYGYRRDENYGTFMTGSLRAGYYSGYKVVNNTATDNWYRLGQLYFPAPNQQWVMELISKADATTPSGTAGSPVNMPATGKTLINLQRLETVWADAYHMGQPSVLDIRYGRVGTTYAVIWVKLKANSGETMFNLKTTGPTRFDTGSCSLFQSDMSVVTDITNISNLKPAARFGMHNGLAGIGANEKGVVTLATAAGTPTNKTAPTGFVLININGVDRKVPYYD >CP028349|1371972:1381643|1377205_1377484_+|AVV36835.1|DBSCAN-SWA MAIIHYGKSVFVGNARTRRHRRRQPLSSLTQSIDQALNFPTEPPVLRRAEQICQRQVAPRVDRAITAPRVTSQETPSFDNCCLPHVHLYAVS |
10 | Streptococcus_phage(25.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1443393 : 1451119
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP028349|1443393:1451119|DBSCAN-SWA TGTGAAATTTGAACTTGATACCACAGACGGGCGCGCACGCCGTGGCCGTCTGATTTTTGATCGCGGCGTGGTGGAAACCCCGGCGTTTATGCCCGTTGGCACCTACGGCACCGTGAAAGGCATGACGCCGGAAGAAGTGCAGGAAACCGGCGCGCAGATCATTCTGGGCAATACTTTCCACCTCTGGCTGCGTCCGGGCCAGGAGATTATGAAACTGCATGGCGATCTGCATGACTTTATGCAGTGGAAAGGCCCGATCCTGACCGACTCCGGCGGCTTCCAGGTCTTCAGCCTGGGCGACATCCGTAAAATCACCGAAGCGGGCGTTCACTTCCGTAACCCGATCAATGGCGACCCGATCTTCCTGGATCCCGAAAAATCAATGGAGATTCAGTACGACCTGGGTTCCGACATCGTAATGATTTTCGATGAATGTACGCCGTACCCGGCTGACTGGGACTACGCCAAACGCTCTATGGAGATGTCGCTGCGCTGGGCGCAACGCAGCCGTGATCGTTTCGACAGCCTCGGGAATAAAAATGCCTTATTTGGCATTATTCAGGGCTCGGTTTACGAAGATTTACGAGATGTCTCGGTGAAAGGTCTGGTAGAGATTGGCTTTGATGGGTACGCTGTGGGCGGCCTGGCGGTGGGTGAGCCTAAGCAGGACATGCACCGTATTCTTGAGCACGTCTGTCCGCAGCTTCCGCAGGATAAACCGCGTTATCTGATGGGTGTCGGCAAGCCAGAAGATCTGGTTGAAGGCGTCCGTCGCGGCGTTGATATGTTTGATTGCGTGATGCCAACCCGTAATGCGCGAAATGGTCACCTGTTTGTCACCGAAGGTGTGGTGAAGATTCGCAACGCCCGCTACAAAGATGACACTGCGCCGCTGGATGCGGAGTGTGATTGTTACACCTGTCGCAATTATAGCCGTGCCTACTTGTATCATCTCGACCGTTGTAACGAAATACTGGGCGCGCGTCTGAATACTATCCACAATTTGCGCTACTACCAGCGTCTGATGGCAGGTTTACGCCAGGCCATCGAAGAGGGTAAATTAGAGCGCTTTGTAACTGAGTTTTACCAACGGACGGGCAAAGAAGTTCCGCCATTAACGTCTGATAATTCATCAATGAGGGAAATTTAATGAGCTTATTCATTTCTGACGCCGTGGCCGCAGCAGGCGCTCCGTCTCAGGGAAGTCCGTATTCTCTGGTGATCATGCTGGTGGTGTTTGGTCTGATTTTCTATTTCATGATCCTGCGTCCGCAGCAGAAACGTGCGAAAGAGCACAAGAAGCTGATGGATTCCATCTCTAAGGGTGATGAAGTGCTGACCAGCGGTGGCCTGGTAGGCCGCGTAACGAAAGTCTCTGACACGGGCTACGTAGCTATCGCGCTGAATGACACCAATGAAGTCGTCATTAAACGTGATTTCGTCGCCGCCGTGCTGCCGAAAGGCACTATCAAGGCGCTGTAATTCTTTCTTCCCTAAGGGAACTGCCGTGTTAAATCGTTATCCTTTGTGGAAGTACGTAATGCTGGTCGTCGTGATTCTCGTCGGCCTGCTCTATGCGCTTCCCAACCTGTATGGTGAGGATCCGGCCGTTCAAATCACTGGTGCGCGCGGAAGCGCCGCCAGTGAGCAGACGTTGGATCAAATTCAGTCCGCATTAAAACAAGACAATATCCAGAGCAAATCTGTTGCGCTGGAAAATGGTGCGATTACCGCGCGTTTCGCTAATACGGATGTGCAGTTACGCGCCCGTGAAGCGATCATGAAAGCGCTGGGTGAGAACTACGTTGTCGCGCTGAACCTTGCGCCTGCCACGCCGCGCTGGCTGACGATGCTGTCAGCAGAGCCGATGAAACTCGGTCTCGATCTGCGTGGTGGTGTGCACTTCCTGATGGAAGTGGACATGGACACCGCGCTCAGCAAGCTGCAGGAGCAGAATGCTGACACGCTGCGTAGCGACCTGCGCACCAAAAATATCCCCTACACCAACGTTAATAAAATCGCGAACTACGGCGTGGAAATTCGTTTCCGTGACGCTGCCAGCCGCGACGCGGCGATCTCCTGGCTGAGCTCGCGTCATCAGGATCTGGTGATCAACAGCAGCGGCAGCGATGCACTGCGTGCCACCATGAGCGATGCCCGTCTCAGCGAAGCGCGTGAATATGCGGTTCAGCAGAACATTACGATTCTGCGTAACCGTGTAAACCAGCTGGGCGTAGCTGAACCGCTGGTTCAGCGTCAGGGTTCCGATCGTATCGTGGTTGAGCTGCCGGGTATTCAGGATACGGCGCGCGCCAAAGAGATTCTGGGTGCCACCGCGACGCTGGAATTCCGTCTGGTGAACACCAGCGTGGATCCGACTGCGGCGGCCAGTGGCCGTGTACCGGGTGACTCTGAAGTTAAAGACATGCGTGATGGCCAGCCGGTCGTGCTTTACAAGCGGGTGATCCTGACCGGTGACCATATCACCGACTCAACCTCCAGCATGGATGAGTACAACCAGCCACAGGTGAACATTTCACTGGATGGCGCAGGCGGTAACATCATGTCCAACTTCACCAAGGACAATATCGGCAAGCCGATGGCGACCCTGTTTGTGGAGTACAAGGACAGCGGTAAGAAAGATGCCAATGGCCGTTCCATTCTGGTGAAGCAGGAAGAGGTGATTAACGTCGCCAATATCCAGTCTCGCCTGGGGAACAGCTTCCGCATCACCGGTATCAACAACCCGAACGAAGCGCGTCAGCTGTCGCTGCTGCTGCGTGCCGGTGCGTTGATTGCGCCGATCCAGATTGTGGAAGAGCGTACTATCGGGCCGACCATGGGTCAGCAGAACATTACTCAGGGTCTGGAAGCCTGCCTGTGGGGTCTGCTTGCCTCGATCCTGTTTATGGTGGTGTTCTATAAGAAGTTTGGTCTGATTGCGACGAGTGCACTGCTGGTGAACCTGGTGCTGATTGTCGGCATCATGTCCCTGCTGCCGGGCGCGACCCTGACCATGCCGGGTATTGCCGGTATCGTGTTAACGCTGGCGGTGGCAGTCGATGCCAACGTACTGATTAACGAACGTATTAAAGAAGAGCTGCGAAACGGGCGCTCGGTTCAGCAGGCAATTCATGAGGGCTACAAAGGCGCCTTCTCCAGTATTGTCGATGCGAACGTAACCACCCTGATCAAAGTTATCATTCTTTACGCGGTCGGCACCGGTTCGATCAAAGGCTTTGCGATTACCACCGCAATTGGTATCGCGACCTCAATGTTCACCGCGATTATCGGTACCCGTGCCATTGTTAACCTGGTTTACGGTGGCAAACGCATCAACAAGCTGTCTATCTGAGGAGTGCGTTGTGGCACAGGAATATAACATTGAGCAGTTGAACCACGGGCGTAAAGTCGTCGACTTTATGCGCTGGGATAAGCTGGCCTTCATCATTTCGGGACTGCTGATTGTGGCCGCGATTGCGATCGTGGGCGTGCGTGGTTTTAACTGGGGCCTCGATTTCACCGGTGGTACGGTGATTGAGATCGCGCTGGAGAAACCGGCCGACCTCGACACGCTGCGTAGCGAACTGGTGAAAGCGGGCTTTGACGAGCCGCTGGTGCAGAACTTTGGCAGCAGCCGTGACGTGATGGTGCGTATGGCACCGGTTACCGGCCCGGCAGGTACCGAGTTAGGCAATAAAGTGGTGTCGGTGATCAACCAGACCACGCAGCAAAACGCGACCGTTAAGCGCATTGAGTTCGTGGGGCCGAGTGTGGGCAGCGACCTGGCACAGGCGGGTGCGATGGCGCTGCTGTCGGCACTGATTGCGATTCTGATCTATATCGGCTTTCGCTTTGAGTGGCGTCTGGCGCTGGGCACTGTGCTGGCGCTGGCGCATGACGTGATCATCACCTGCGGCCTGCTGGCGCTGTTCCGCATTGAAATCGACCTGACGATTGTTGCCTCGCTGATGTCGGTGATTGGCTACTCGCTTAACGATAAAATTGTGGTCTCTGACCGTATTCGTGAAAACTTCCGCAAGATCCGTCGCGGCAGCTCTTACGATATTACCAACGTGTCACTGACCCAGACGTTAAGCCGTACCCTGATTACCTCGCTGACGACGCTGGCGATGATCCTCATCCTGTTCATATTTGGTGGTGCGCTGCTGAAAGGCTTCTCACTGACCATGCTGATTGGTGTGACGATTGGTACGATTTCATCCATTTATGTCTCTTCAGCGCTGGCGCTGAAGCTGGGCATGAAGCGTGAACATATGATGGTGCAGAAAGTCGAGAAAGAGGGCGCCGATCAGCCTTCTATCCTGCCTTAATCGCGCATCAGAAATAAAAACGGGCAGCCTTTTGGCTGCCCGTTTTTTTTGCACCACGGCTAAGGATTAGCGGGTGCCGCAGGGGAGGTCGGGGGCGGTGCGTCAGCGGTCACGTTATGCACGCGGACATAACCCAGTTGCTCCGGTGTCAGGCCGCTGAGTCGAAGCGGAATGCTCGCCGAATGCTGCGGCAGCAGCGAGGCAGGCAGCGCGACCGTCTGCGTCAGACTGTCACTGCTCAGCGGTTTACCGGTCGCCGGGTCGAGTTCACCCCAGATCACCGTCGCGTGCAGGGCGGGCAAAGGCCGATCGTCCATTGAGCGTACGGTCAGGGTTGCCCGCGAACCGCTGGCTTCCGCAGTGACAGGGGACAGCGTCAGTCGCAACGTGCCGAGCTGAGTCTGCAGCGCCACTGGCGTATTCGCCTGTGGCACCAGCCAGGCACCCTGCTGCAACTGGCTGTTAAGCTGTCCCTGAATCTCCAGCGCGCTGGCCTGCGTGGTCAGATGCTGCATCTGCTGATTGAGCTGACTCACTTCCTGATGCAGCTCTCTGACCTGCGGGTTACCTGCACTGCTACTGCAGCCGCTCAGCGCCACCAGTGAGAACGCCACCGGAACCCAAACCATCCTGCTCATAAGTTATCCTTGTCCGTTATTTTGGAATACGTAAAACCTGGCCAGGATAGATTTTATCAGGATGGGTGAGCATTGGCTTATTCGCTTCGAAAATCTTGTTATATTCGTTAGCGTTACCGTAAACCTGTTTAGAAATAGCGCTCAGCGTGTCGCCTTTCTTCACGGTGTAAAGCTCTGACTCGGCCGCGCTGTCTGTCACGGTGACCTTGTCTTCCACTTTTGTGATGCCTGCCACGTTACCTGCCGCGATCAGAATCTTCTCTTTCAGCTCCTGAGAAAGGCCATCGCCGGTCACGGTGACGGTATCGCCATTGACCTTTACGTCCACCTTGTCGCTGTCTGGCAAGCCGAGCTTGTTGATGTGATCCTGCAGCTTTTTATTCTGATCGTCACCGCCGTTTACGGCGTCCCAGAGTTTTTCACCGGCTTCTTTCACAAAGTTAAACAGACCCATATAACCTCCAATGTGTTAGTGGAAAATTTTAATTGCAGCTCTTAAAGCGTAGTCCCTTTTGCCGCAGACGAAATTTAGCATCGTTTATCCTGCGATGAGAAGGCCGCGAAAGACACGGAAGAGATTTTGTTGATTCGCAGTTCGCGCTACACTGTATGGCCCATCACTCTCGCGTCAGGAAACGTCATGCATTGCCCATTCTGCTCCGCTGTGGACACCAAAGTGATTGATTCTCGTCTGGTTAGTGAAGGCTCCTCGGTGCGACGCCGCCGCCAGTGTCTGATGTGTCATGAACGCTTCACCACCTTTGAGGTGGCGGAACTGGTGATGCCCCGCGTGGTGAAAAGCAATGATGTGCGCGAGCCTTTCAATGAAGACAAAATGGCCAGCGGGATGATGAAAGCGCTTGAGAAGCGTCCGGTCAGCGCCGACGCGGTAGAAAGCGCCGTAAACCATATTAAAACGCAGCTGCGCGCCACCGGCGAACGTGAGATCCCCAGTAAGCTGATTGGCAATCTGGTGATGGATGAGCTGAAAAAGCTCGATAAAGTCGCCTATATTCGCTTCGCCTCGGTTTACCGCAGCTTTGAAGATATTCGCGATTTTGGCGAAGAGATCGCCCGGTTACAGGATTAGAGTATGGACGAACGCTATATGGCGCGTGCGCTGGAACTGGCGCGACGCGGCCGTTTTACGACCATGCCGAACCCGAATGTCGGCTGTGTGATTGTGCGCGATGGTGAAGTCGTAGGTGAGGGCTGGCATCAGCGTGCTGGTGAACCCCATGCTGAAGTCCACGCTCTGCGCATGGCTGGCGAGAAAGCGCGTGGCGCAACGGCCTATGTCACGCTGGAGCCGTGCAGCCATCATGGCCGCACGCCGCCCTGCTGTGATGCGCTGATCGCCGCAGGTGTGATCCGTGTCGTCGCCGCCATGCAGGACCCGAATCCGCAGGTCGCGGGGCGCGGACTGCACCGTCTGCATCAGGCGGGCATCGACGTCAGCCATGGTCTGATGATGCCGGAAGCCGAAGCACTCAATCGCGGCTTCCTCAAGCGCATGCGCACCGGCTTTCCCTGGATTCAGCTCAAGCTGGGCGCGTCACTGGATGGCCGCACGGCCATGGCCAGTGGCGAGAGCCAGTGGATCACCTCTGAGGCTGCCCGCCGTGATGTGCAGCGTCTGCGGGCGCAAAGCGCCGCCATTCTCAGCAGTAGCGCTACGGTGCTGGCGGACGATCCCTCTCTGACAGTGCGCTGGTCTGAACTCAATACCGACAGCCAGGCTCTGGTTGATGAGCAGCAACTGCGTCAGCCGGTACGTGTGATTATCGACAGCCAGAATCGCGTGACGCCGCAGCATCGGCTTATCTCACAGCCGGGCGAAACCTGGCTGATGCGCCATCAGCCGGATCAGCAGCTCTGGCCCGCTGACGTCACGCAAATCGCCGTGCCGCTGCGCGAACAGCAGCTGGATCTGGTGGCGATGATGATGCTGCTGGGACAGCGACAGATTAACAGCGTCTGGGTTGAAGCGGGTGCGACGCTCGCCGGTGCCCTGTTACAGGCCGGGCTGATGGATGAACTGATCGTTTATCTGGCACCTAAGCTGTTAGGTCATGAAGGACGCGGCCTGTGTCAGCTGCCGGGGCTCAGCCAGCTGGCTGACGCGCCTGCGTTCCGTTTCAGCGATGTCCGGCAGGTCGGTGACGATTTACGTCTGACCCTGACACCGCAATAGCATGCCCGGAAAAGCGAGAGCACGCCGCCTAAGAGTATGATAGAATCCGCCCCCCTGCGCGGGGCCAACCAAACCCCTGAAAGGATAAGTATGAAAGTTATCGAAGCTGCTGTTGCAACGCCTGAGGCCAATGTTGCCATCGTCATCGCGCGTTTTAACAACTTCATTAATGACAGCCTGCTGGATGGCGCAGTTGATGCCCTGAAACGTATCGGCCAGGTCAAAGATGAAAATATCACCGTTGTCTGGGTGCCGGGTGCTTACGAACTGCCGCTGGCAGCACGTGCGCTGGCGAACTCCGGCAAACATGATGCGATTATCGCACTCGGCACCGTTATTCGTGGTGGCACTGCGCACTTCGAATATGTGGCGGGTGAAGCCAGCTCCGGTATCGCCAGCGTTGCCATGAACAGCGACATTCCTGTCGCGTTCGGCGTGCTGACCACTGAAAGCATCGAGCAGGCCATTGAGCGTGCCGGCACCAAAGCGGGTAACAAAGGTGCTGAAGCGGCGCTGACTGCGCTCGAAATGATCAACGTATTGAAAGCCATTAAAGCCTGA
Protein sequences of DBSCAN-SWA_2 >CP028349|1443393:1451119|1449004_1449454_+|AVV36898.1|DBSCAN-SWA MHCPFCSAVDTKVIDSRLVSEGSSVRRRRQCLMCHERFTTFEVAELVMPRVVKSNDVREPFNEDKMASGMMKALEKRPVSADAVESAVNHIKTQLRATGEREIPSKLIGNLVMDELKKLDKVAYIRFASVYRSFEDIRDFGEEIARLQD >CP028349|1443393:1451119|1444899_1446747_+|AVV36894.1|DBSCAN-SWA MLNRYPLWKYVMLVVVILVGLLYALPNLYGEDPAVQITGARGSAASEQTLDQIQSALKQDNIQSKSVALENGAITARFANTDVQLRAREAIMKALGENYVVALNLAPATPRWLTMLSAEPMKLGLDLRGGVHFLMEVDMDTALSKLQEQNADTLRSDLRTKNIPYTNVNKIANYGVEIRFRDAASRDAAISWLSSRHQDLVINSSGSDALRATMSDARLSEAREYAVQQNITILRNRVNQLGVAEPLVQRQGSDRIVVELPGIQDTARAKEILGATATLEFRLVNTSVDPTAAASGRVPGDSEVKDMRDGQPVVLYKRVILTGDHITDSTSSMDEYNQPQVNISLDGAGGNIMSNFTKDNIGKPMATLFVEYKDSGKKDANGRSILVKQEEVINVANIQSRLGNSFRITGINNPNEARQLSLLLRAGALIAPIQIVEERTIGPTMGQQNITQGLEACLWGLLASILFMVVFYKKFGLIATSALLVNLVLIVGIMSLLPGATLTMPGIAGIVLTLAVAVDANVLINERIKEELRNGRSVQQAIHEGYKGAFSSIVDANVTTLIKVIILYAVGTGSIKGFAITTAIGIATSMFTAIIGTRAIVNLVYGGKRINKLSI >CP028349|1443393:1451119|1448380_1448818_-|AVV36897.1|DBSCAN-SWA MGLFNFVKEAGEKLWDAVNGGDDQNKKLQDHINKLGLPDSDKVDVKVNGDTVTVTGDGLSQELKEKILIAAGNVAGITKVEDKVTVTDSAAESELYTVKKGDTLSAISKQVYGNANEYNKIFEANKPMLTHPDKIYPGQVLRIPK >CP028349|1443393:1451119|1450648_1451119_+|AVV36900.1|DBSCAN-SWA MKVIEAAVATPEANVAIVIARFNNFINDSLLDGAVDALKRIGQVKDENITVVWVPGAYELPLAARALANSGKHDAIIALGTVIRGGTAHFEYVAGEASSGIASVAMNSDIPVAFGVLTTESIEQAIERAGTKAGNKGAEAALTALEMINVLKAIKA >CP028349|1443393:1451119|1446757_1447726_+|AVV36895.1|DBSCAN-SWA MAQEYNIEQLNHGRKVVDFMRWDKLAFIISGLLIVAAIAIVGVRGFNWGLDFTGGTVIEIALEKPADLDTLRSELVKAGFDEPLVQNFGSSRDVMVRMAPVTGPAGTELGNKVVSVINQTTQQNATVKRIEFVGPSVGSDLAQAGAMALLSALIAILIYIGFRFEWRLALGTVLALAHDVIITCGLLALFRIEIDLTIVASLMSVIGYSLNDKIVVSDRIRENFRKIRRGSSYDITNVSLTQTLSRTLITSLTTLAMILILFIFGGALLKGFSLTMLIGVTIGTISSIYVSSALALKLGMKREHMMVQKVEKEGADQPSILP >CP028349|1443393:1451119|1443393_1444542_+|AVV36892.1|tRNA|DBSCAN-SWA MKFELDTTDGRARRGRLIFDRGVVETPAFMPVGTYGTVKGMTPEEVQETGAQIILGNTFHLWLRPGQEIMKLHGDLHDFMQWKGPILTDSGGFQVFSLGDIRKITEAGVHFRNPINGDPIFLDPEKSMEIQYDLGSDIVMIFDECTPYPADWDYAKRSMEMSLRWAQRSRDRFDSLGNKNALFGIIQGSVYEDLRDVSVKGLVEIGFDGYAVGGLAVGEPKQDMHRILEHVCPQLPQDKPRYLMGVGKPEDLVEGVRRGVDMFDCVMPTRNARNGHLFVTEGVVKIRNARYKDDTAPLDAECDCYTCRNYSRAYLYHLDRCNEILGARLNTIHNLRYYQRLMAGLRQAIEEGKLERFVTEFYQRTGKEVPPLTSDNSSMREI >CP028349|1443393:1451119|1447785_1448364_-|AVV36896.1|DBSCAN-SWA MSRMVWVPVAFSLVALSGCSSSAGNPQVRELHQEVSQLNQQMQHLTTQASALEIQGQLNSQLQQGAWLVPQANTPVALQTQLGTLRLTLSPVTAEASGSRATLTVRSMDDRPLPALHATVIWGELDPATGKPLSSDSLTQTVALPASLLPQHSASIPLRLSGLTPEQLGYVRVHNVTADAPPPTSPAAPANP >CP028349|1443393:1451119|1444541_1444874_+|AVV36893.1|DBSCAN-SWA MSLFISDAVAAAGAPSQGSPYSLVIMLVVFGLIFYFMILRPQQKRAKEHKKLMDSISKGDEVLTSGGLVGRVTKVSDTGYVAIALNDTNEVVIKRDFVAAVLPKGTIKAL >CP028349|1443393:1451119|1449457_1450558_+|AVV36899.1|DBSCAN-SWA MDERYMARALELARRGRFTTMPNPNVGCVIVRDGEVVGEGWHQRAGEPHAEVHALRMAGEKARGATAYVTLEPCSHHGRTPPCCDALIAAGVIRVVAAMQDPNPQVAGRGLHRLHQAGIDVSHGLMMPEAEALNRGFLKRMRTGFPWIQLKLGASLDGRTAMASGESQWITSEAARRDVQRLRAQSAAILSSSATVLADDPSLTVRWSELNTDSQALVDEQQLRQPVRVIIDSQNRVTPQHRLISQPGETWLMRHQPDQQLWPADVTQIAVPLREQQLDLVAMMMLLGQRQINSVWVEAGATLAGALLQAGLMDELIVYLAPKLLGHEGRGLCQLPGLSQLADAPAFRFSDVRQVGDDLRLTLTPQ |
9 | uncultured_Mediterranean_phage(50.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1923801 : 1964980
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP028349|1923801:1964980|DBSCAN-SWA ATTATTTCCCCTTCGCCTGTCCGCCGACTACAGGAACCACCACTATTTTTCTGTCATAGGCAGCAGTCTGCTCAACGTTTTTATGCCCCGTTATAGCTCTTTTCTCGTAAAGGTCACCCTCCAGATCGGAAACTCCCTTAGCCTTCAGATCATGGAATGTGAAATCAAACAGTAGCTCAGGGAACTTGAGTTTCGCCGCTTCGCGTGCAGCACTCCAGCGACTATTGAACCCGTCGCGGGTGTAGCCAGAACCATTGGGCTGATGGATTATGAAAATACTGCTCATGCCAGGCTTAAGGGGTAGTTCAGCTGCCATCTTTATGGCTGCAGTAAATCTTGGTGACCATGCTTTAATCTGAGCAACACTGGTTTTACTCTGTTTAATCAGTACGCCCTCATCCACAATTTGGCTTTTTTTCATTGCGAGAATATCACCCTGACGCGAGCAGGTGAGATAGGCCAACTCCATTGCGATCTTCACAACTTCAGGCGCGCATGAATAGAGAGCCTGGTACTCCGCATCGGTCACATACCGATCCCTGGATACCTCCTTGAACTTCTTAACCCCCTTGGTTGGATTACCTTTGACCATGCCACGTTCATAGCCCCAGCGGTACATGCGGGACATAAACGCTTTTTCACGGTTGGCCTGCACCCGGCTTTTTAATCCACGCTTGTCCATATACTTTCTGACGTGCTCAGGCCTAATGTCATCAGAGGGCATGGCACCAAAAACAGCTAAAACATTCTTTGAATATTTCAGGTAATCCCGCTGCGTTTCGCGTGCAAGCTCGAAAAAATCAGCAGATTTGAAAAAGCGGTCAGCCAGTGACGCCAGCAACCGGTCATCAGGTATCTCATTAATAAGTGCCTCATATGCTGCCCAAACTGAGGACTGAGCCGCATCCAGCGCACAAAGGCGAATTGCGCCCCCATCCTTGGGATGGAACTCATAGGCTGACCTGCCGCGATAAACGCGTGGTGGCATCCAGGCGTCAGCAGCATTTTTGCGAAGGCGAGCCATTAATCTAATGCTCCAAAGTTGGGTTCATGTTTACTGGATTCTGGGGCTTTCCGTGATAACAATGGGTCATTGAAATGCTGCCATGTTGTTCTCGGGCGACCATCACGGCGTACCACGAAAAATATACCGGCCTGTTTAAGGCACTGGCATTGTTTAGACGGAATTTTATAGCCCGTTATTTTTTCGATATCGGCGTCAGAAATAATTACGTTATTAATATCAGACAGCGCCATCACTCCACCTCCTTCAAAAAGATAATCCAGTGGGTTTTGTCGCCTTTGCCAGTGCGCTGCCAGATGGTTGGCTTGTGATCGGTTAGGGCGATTACCTGGCTTACTGGTATCTGTGTCTCATTCCATTTAAAAATCAGCGTGCCGTGTGGCCGCAAAACACGAAATGCTTCACTGAAACCCGCGCGGATATCGTCGCGCCATGTCTGCTTATCCAGCGCACCGTACTTTTTCCTCATCCAGCCGTTTTCTCCAGCGCGGTCCAGGTGGGGTGGATCAAATACCACCTGTGCGAAGCTGCAGTCAGGAAAGGGCAAAGAACGGAAGTCAGCGATGATGTCGGGATCAATATGCAAAGCACGGTTATCACACAGCACATGTGACTCTTTGCGGATGTCTGCGAATATCGCGCGGCTGTCTTTCTTGTCGAGCCAGAACATCCGAGAGCCACAGCACATATCGAGGATCGGCTGTTCCATCACTCCACCTCCGGTGCTGTTGACCGAAGCCAAATACATACTGCGCCGTCTTCAGTGTCATGGATTGAGCCAACGAACCAGCCTCCACCTTCAGGGCTTTCAGGTTGCCAGGCAGAGATATCGCAACCATCAACGGTGGGATCGAGCATGTCTTCATCACGGTAATTAACTTTCCACTTCAATCCATGATCAGACATCCACTGATCAAACTCTTCGGTAGAAATGAATTCACGACCATCACAGAACGCCAGATAATCTGGGTGAGACCAATAGCCATATTGGTCGCGTTGCACTTCTAATGGCTTAATGCTCATTGAATCGTCTCCTTAACCCATGCATTCCAGATAAAGCCCGCTGGCAATCAGACGGGCGCGGCGTGTAGCTGCTTCACGATGGCGCTTGATAGCCTCTTCAGAACGGTCGTTGCTGTAGTTGATAACCATTGTCTTACATGGCGGGGGAGCAACACGGCGCGGATTTCTGACCAAGGTGTAAGTGCGGTCAATGGAGCCACCACCGAGAAAGACCTGATTGGATGCCTCAACCTGAAGTGTTTCACCACCGCGCCGCATGATGTGAAGGACCAAACGGTTGAACTCACTCAGCGTCATACCGAGACGTTCTGCCAGCTCCCGGCCCGTTGCAGGGCCTTTTGATAACTGCCAGGCTAACTTTTCACTGAATCCAGCATTTGCCCCGTTGCTGCGCCGGAATTGGGCGACCTTTTTCATGACACCACCTTCAGCGTTACCGTGTGCGTGCGGAGCACATCCATTTCCATTTGGGAAATGATGTTGATCGCATGTGAAATGCCCGGTTGCTGGTGGTTACCCAGAGTGGTTACGGCGCGGCGTGCTTCCCCAAGGGCTTCACCACGCAGTGTGCGAATCCACTGATCACAAGCTGGAGTGGCCAGCGCTGCGTTCAGGTCATCAATCAGGGTCATATCTGCCCCGGCAGCCTGAAGAACGCTAATGGTGTCAGGCAGCACGCTGTTGATACGCAGCACCTCTCCAGCCATGAGGTTGGCGCGAACGGTTGCAACCTCAAGACGTGAAGCCAGCTCAGTCACCATCTTTGCCATATCAATCAGAGGCGTGTCCGCACCGATGTTCTTTGCGAACTGATGGCCGGCAGCGACAACTTCTTTATTCGATTTGAAGTGATGCATGTCATCGCCCTCAGTGAATGGTGATGGTGCTGTTAAGGCGCTCAGCTTCGTTCTGCGCCTTAATAGGATTAGTGATTACTGAACCGTCAGGCATGACCCAGCCGTTGAGGATATGGCTGTAGGGCAGGGTGATAGGGTGATAATGCCTACGGTGATATGGTCGTTTGGCTTTTCCATGAAACTCTCCACACACGATTTTTAGTTGCATGAATCCCTTGCCAGTGACGGCAATAAAAACTATTGGGATTCGTTTAAGTTGGCTGGTGAGCTACTGCAATAACCCACCGCCCGATTACTCCACACATTTGAAAGGTTGTTGCGGTGCCGGGTGCCTCCCGGTGCTCTGGTCAGACTGACAGACACCAGAGCGGAGACTCTTAGACTGTATGCAATCTTTGTCAGTCTTCCGCGCGCGCTGGCCGCATTCACCACAACGAAAAGGACACTTACTCCACGTCTCTAAAGCGTTCGAAAACACCCGCTTTGCAAATGTCCTTGTCGTTGTGAAAAAGGGCGGTTAAACAAAACATTCTGAGTAACCGCCAACACAGCAATTCCGTACTCTTAAAACGCTGGTCCAAGAACCACGTTTTCAACATCACACTGCACACTCACCACACCTGCATCACCACAGCAGATCACATCAGCATCCGGGAAGAGGCGGAGAAAAGTAATCAGGTCCCGGACCGTTGTATTCGCCATGTTCTTAATCATTTTCATCCTGCAATCTCCGAGCCTTAATTTGCTCCGGTTTATTTATCCACGTCCGGCCCGTGTTTCCCTGCTATTCCCCAACAACAAGGAATCGGATAATCTGGATATACCCCAACAACAATGAAGGATGGCTTATGCATATGGCTAATGACGAAGGTGATAAATATTTCGAAAGGCACATGCCTGGTCGCAAGATTCCACCTCCGCCGCCAAAACCAAAAGATGAAGACGAAAAATGAGGTAGTTTATGAACCGAGATGATTTGGAATTTTCTGTAACTTATTCATATTACTTGGAAAAGATGAACTATCGTCTTCTCACAAGAATTGACAAGCTGATTACCTTAACTCTCATCGTTCTCGGTTTTTCCGTATTTGCTAAATTCAGCAATATGTTTGTTTTCGGTGCTGTCGTTGCAGTGCTATCGGTTCTTCAACTTGTTTACCAATTTGCTCAGGAAGCTGGTGCTTCTAAAGAGCAAATGCGGCATTACCGCAGCCTCATGGTAAATATGAAAAATATTGGAGATGAAGAGTTACGTCAGCGCTTCGCAAAAATTCAAGACTCAGACAGCATGCCTTGGCAATCACTTGAAGACGCCGCATACAATCGAACCTTGATAGCGTTAGGGCATACCGCCTCGCTTACAAAGCTTTCAGCAAAAGATTCAGTGCTATCTTGGTTTGCAGGTGATCTTCCAAAGAGTTAGGAAAATCTAATTGGAACGACCAAAGCCAGGCGGTCATGTACCCCCAAGACCGCGTCCACCAAAGCCTACCGTTTAGGTATATCCAGATTTTTAAAGAGCGAAGCGTCCAATGGGCGCTTTTTTTACGAATCATCCCGATCTTCTTACGCCTCGGGCGGCTACTTCGTGGTCGTCCTGCCTGTTCGCTGTTGATGGGTTAAATATACACATAATGTGATTTGGTGGTCAATCACAAAACGTGTATATAGATTGAAGAACACATTATGTGCATGATTTTTCGTGTGATAAATTATTTTTAGATGAAACTTTGAGCGGCTTGTGATGCTCTTAAATGGCTGGTTATCGCCTTAATAGGGGGGTGTTGTGAGCAAAAAATCAGACGAGCTCTATGACGAGATGTGCAGGGTTGTTGGTGACGTGGTGTTCACACTTCACGATTATGGGATTGAGTCGAAGCAGATAGTGATAGCGGATGCACTAAGAACGGCGCTGGCTTCGAATAACCCTGAGCGGTCAAAGTTACTGGCTAAAGCCATGGAAGCTGCGACGAAAGTGCTGGATCGATAGGCACAAAAAAACCCGCGTTACACGGGCATAAATTTGTTCAATTGGGGCAAACCATCTGTCAGCATAAGCTGACGATAGTGATAATACTTACGATAAATTAATTAGGAAATAGATACTCCACAATTTTTAAGCAGCGCAAAATTTGCGCGAAAAAAGAGGCCCGCCGGACGCGGGCTAACAGTATCAAAGCATTCAAAAGAGAAGTTTTGAGCATCACTTGAGCTCATTCAGATTATCGTCAGATACTGCGAAATCTTTATTGAAATGTCTCTTCAGGCCATTGAGCTTTCACCACCTTACCTATGATCCGGCAACTGTGATCACAGTCCAGAATTCGATAGGCAGGATTAAGCGGCACCAGATAGCTTACACCTGCGTCTTTCTCATATTTTTTGAAAGTCACTTCAGAGTCAGCGTTAGCAGATGCCACACAGAAGTCACCAGTCTCAACCGGTTCTGCCGGGTCAATGAGGATCAGCATACCTTCCGGGAAACTTGGACGAACGCCCTGCGGCGCAGTCATTGAGTGGCCTTTGACTTCAAGCCAGAACGCTTTATCGCTGGCCTTCTTCGTAGTAGGTACCCATGCCTTAGCGTCTCTGGAAGTAAAGCTACCCACTTCAGAAAAATCACCAGCCTGGACATAAGTGAACAGGGGGTATTCATACTGCTTGAAAACTGCATCGGCATCCTCACCAAAAAGTATTTTGGCAGGAGATGTACCTAAAGCAGACCCTAATAGAATTGCATCGTCAGAACTGACCTTCCTCGTTCCAGATTCATAATTACCCAAACGTGACGGAGCTGCCCAGCCACATAATCTGGCAAGCTGCGCCTGGCTCAATCCCTTGCTTTCTCTTAGGGCTTTGATCCTTTCCCCTATCAACTCATGCATTGTTTTCATCCCCTCAAATTTAACACGCAGCGTGATTACTGTATCTACACGTTTTGTCATTGATTCTTAATCACGAATTGTGTGTAATTGCTTTGTGATTAACTTTTGGAGAAGCCAATGAACACTATCGCTGAGCAAAGAAAGAAGCTCGGTATTTCCCAGTCTGTTTTGGCTGATGTTATTGGCTGGGGCCAATCGCGAGTAGCCAACTATGAGCTAAGCATCCGTAAGCCAGGCCTTGATGAGTGCCGAATGATTGTTTTAGGACTCAACAAGCTCGGAGCCAATTGCTCTCTGGATGATGTTTTTCCGCCCGGCAGAGTAAAGAAGTAACAGATTTTAAACGCTGCACTTTATTTAGTAACCACAGGAAAGAGGGCTTAACCGTGGATCAGAAGCACTGGCAAGTAGAAAAGCAACCGGCATGGCTGGTGGCAGCAATCAAGAAGACGATTTCGAGTCTTCCGGGTGGTTATGCGGAAGCTGCTGAATGGTTGGGTGTAACCGAAGATGCGCTATTTAACCGCCTGCGCACTAACGGCGATCAGATTTTCCCAATGGGCTGGGCGATGGTTCTGCAGCAGGCAAGTGGTACTAAGCACATCGCTAATGCGGTGTCCCGTCAGTCAAACAGCGTCAACGTTCCGCTGGTGGACATTGAGGATGTGGATAACGCGGACATCAATCAGCGCCTGATGGAGTCTGTCGAGTGGATTGGTAAGCATTCGGCCTACATTCGCAAGGCAACCGCCGACGGAATGATTGATGCGGCTGAACGTGAGCAGATCGAAGAGAACAGCTATCAGGTAATGGCGAAGTGGCAGGAGCATCTGACGCTGCTGTATCGCGTTTTCTGCGCCCCAGAAAAGGTGAACGCCGCTGGATTGCAGTCCGCGGCGCTCGATGCGACTAAATCAACGTGTGTGGAGAACTAATCGCGTGATCAATTTAACCAGAAAATCAGGATTACCGCAATTCCGTTGCCTTCCCTCAGCTGGTGGCCGCCTCAGCAGTGAGCCGCTGCGGTATGTGCTTAATGTACCAGGCGTAAGCGAAGAAGTTAACCACAGCTTTGTGAGCTGGGCTGTGGGCGATGCTAACCAGCGAATGAAGGCGACGAAATGCGAGAGCTTGACCGAATCTTCCGCGACAAGCGAGGCATCCCTGTGCGGGTCATTCGCTGGGAGCCAGAGAACGACAGGGTTATCTACCTGCGTGACAACTATGAACATGGCGAGTGCTTCAGCTCTCTCGAACGGTTTAAACAGTACTTCAGGGAGGTCACTGTAATTCATGAGCCTACTTCTGAAAGTAAAACCGCTGGTGATCAGCCCGGAACTTGCGCTGCGCATCGGCCTGAATGAAGCGATTGTCCTGCAACAGATTTGCTATTGGCTGGAAGACACCACAGCAGGCGTTGAATATGAGGGCAAACGCTGGGTTTATAACAGCATCAATGCGTGGAATGAGCAGTTCCCATGGTGGACCACGAAGACCATACAAAGAACGGTTTCGTCGCTGAAAAAGATGGGGCTGATTTTTGTTGAGCAACTAAAAAAGAAACAGCACGACCAGACCAATTATTACGCAATTAACTACGCAAACCCTTTGCTGACCGATAAGGACAATTTGTCCCTATCGAGAGAGACAATTTGTCCTAATCGAAAAGGTCAATCTGTCCCTATGGATAAGGTCAACTTGTCCCACTCCATCGGGTCAAATTGTCCCAATGTTACAGAGATTACAACAGAGAATACTACAGAGATTACAACACCCCCTTCTTGTCAGGTTGCGCCGCAACCTGACGATGAGTGGTCACTGGTTAATCGTTCTCGGGAAGTCTTGCGCCACCTGAACAAAATTACCGGCGCTAAACACACAGAGGCGCAGTCTTCGATGGGTCACATCAAATCCCGACTGAAAGACAAATTCACGGTGGAAGAGCTTTGCCTGGTTGTGGATTACAAGCACGTCCACTGGGAAGGCACCGAGGAATACCAGTACATGCGGCCAAAGACTCTGTTTGTCCCCGGAAATCTGCCTGGCTATCTCCAGTCAGCCACCAAATGGGACAAACACGGTCGCCCGCCGCGCTCTGAATGGAATGCCATGAAGCGCAACATGCAGCGGGATATCACAGTCATTCCGCAGCCTGACAGCTCAGTGCCTCACGGCTTTCGCGGTTAACGGGGGATAAATCATGATCAACCACGAATCAAAAATTCTTGAACTGATTACCCGCAATGGCCCGCTGAAGGTGCGCGAACTCTGCAAGCTAACCGGCCTGCATGAGACATCGGTGAAGCGTTTTATCAAACCGCTGTTCACCAAAGGGCTGCTAAAGCGTGCCAGCGACTGGAGTTACTCAATCAACACCGACCCGTTGCCGGTTGAGAGCGAGAAATACAGCCACAAGGCTAAGCAGGCCGCCGAACTGGAAAGCAAAGGGTTCTGGCTGCGTGCAGCACAGGTATGGCGCGAGGCGATGCTGGTGGCGAAGTTTGACGCATCACGCAACGTAGCCAAAGAGAACTGCGACCGCTGCGCTGTGAAGGGCTCACTCAACTGCGGCAGCTATGGCGGACTTGATACCGGCCGCATCATTTCAGCCAGTGTGAACAGGGATTTGTTATGAGAGCGCACCTGAAGAACCACTACCAACGCAATGAGATTTTCTACCAGGCCATCCGCACAGCAGTGGTGATGATTGCCGCCCTGATTTTTGTCCTGACATGGGAGCTGACCACAGCATGAGTACTTTAGCGCGCATTTACGACGACAAGAAAAACAGCGATACCGACATCACCACCCGCAAAACCTACCTGCTGGGCGTTGATGAGTTGTATGTTGAAACTAATTACAACATCCGTGATATTGACCAGACCCATGTCGAGGAATTCCGCGACGCCTTTATCGCTGGTGAGCACGTGCCTCCGCTGGCTGTTAAGGTCACTGAGAAGGGCATTAAGATCATCGACGGCCATCACCGCTACTACGGTGCGAAGCTGGCACAGGAAGCAGGCTACACGCTGCGCCTTGAGTGTAAGGACTTCGTGGGTAGTGAAGCTGACAGCGTGGCGTTCATGGTTACCAGCAGTCAGGGCCGCGCCCTGTTGCCGCTGGAACGTGCAGCAGCCTATCAGCGTCTGGTTAATCAGGGCTTAGAGCCAGCTGAGATTGCCGCTAAGGTGAAACGTTCGATCACCGACGTTGAACAGCACCTGCAGCTGCTGACCGTTGGCGAACCGCTGATTGAGATGGTGAAGTCTGGCGAAGTGGCCGCGACGACTGCGGTAGCCCTGCAGCGTGAACATGGTGTGAAAGCTTCTTCCGTTGCGCAAGAGCAGATGCAGAAGGCGAAAGCAGCAGGCAAGAAGAAGCTGACCAAAACTGATGCTATGCCGAAATTCAGTGCCGCTCAGGCACGCAAGCTGGCAGAACTGATTGCTAAACACTGTCAGACCGAGCAGGGCGAGGAGGGTTCTCGCGTTTCGCTGACGTTTGAAACTGACCTGCAGGCGGCTGAGCTGATGGATATCATCCTGATCGCCAAAGAGCATTATGGCGTCACGAAATCAGTTAGTGAGCAACCGGAACCGGCTAAGACCGAGAACGGCGATGGTGATGACCTGCCGCTGCTGAAAAACGAAATTCTGGAACAGAGCGGCGTTGAGGTCTGGGCGTGTGTGGTTGCGGCTTTCAAAATGAAAGCTGAGTACACCTACAGCGAATCCAAGTGGGCGCATACCTGGGCGGCAGACTCTGTAGAGAACCCTACCTGTGTGACAGTGCCTGCAGAGACTATTGCCAGCGCAGTCCGTCTCATCAAGCAGCACCAGGACGCTCTTGAACTGAAGCTGTGGGTTTCCGAGCAGTACGATGATCCAGAGCTGGCAATCGAGCAATTACAGCGCTTCTCAGCAGTGCTGATTGACGTTCGCCACGACAGGCCATGCACGGTTCAGGAATTTATCGCGCTAGTGGAGCAAACTAACCGTGATTACTGGTTAAACATTCGCATGCTGCGTCAGGCAGTAAGTGAAACGCAGTAATCTTAAAAAGATACTACGTAATTACCTCCTAAATAAATGGCAGGATTTACCACATCAAAAATATTGATTGATTGATAAGCCCCGTGATATGAAGGGGCTTTTAAAAATCAAGGAGGAAAGTTATGAGTCGATTGTCGAAATCGGTCACTTTTATATCTGAACTTTTAGCAGCAGCAGCTATCGTTGCGTCAGCTACAACATTCGTACTTGATAAATTAGTATGGAGTAAGCAGGTTGACATTACAGCATCTTTAGTAAATAGCGGGCCAAAGAGTTTATCAATATTAATGTCCAATAACGGTCAGGTTGATGTGGCTATAAGAAAAGTTACTATAGATGTTTTAGGGAATAGTATTAAAAATTTGGTGAAGCTAGATGCTGGTGGAGAAATACTAACTAAAAATTCATCAAAACTTATCAATTCAACCTCGTCCTCTTTAAGTAGTTCCGTAATTGTAGGCCCAGATGATAATGTCAAAATAATGGGTGCCTCGGATATGTTAGACTGTGTAGTAAACATCAACTACATCGCTGCTGGAGATGACGTTTCGAGAAGTATACCTATTAAGGCAAAGTGTTACCCATACTGCGTAGTCGATCCTGAGGATGTAGAAACTTTAATTAAGAAGATAGGATCGCATCATATAGAGTGGTGATTTACGTGTCGAAAAATAAGTGCAGTAAATTTTGAGTAATGCTCATAAAAAGCAATATAATGGAGATACACCTACAGCGAATCCAAATGGGCGCATACCTGGGCCGCAGACTCTGTAGAGCATCCAGAGCACATCGTCGTGCCACAGGATACCATTGAGAAAGCACTGAAGCTTATTCATCAGCGCCAGGATGAACTGGTCATCAAGCGCCGGCTGACTGAGCAACTCGGTGATGCTGAGATGGTCTCAGAACATCAGCTTTGGTTTATCGGAACGCTGTCTGATTTACGTCTGAGTTAGCCATGTACGGTTGAGGAGTTTGTCGCACTTATAGGAAAAACGGACCGGAACTGTTGTTCAAATTACCGATTATTGCGACAGGCATTTCACGAGACTACTGGTCAGTAGTGATTTCCAGGATCTTAAAGTAGTGTGTAATTGTTTCTCAGTTCGATCCTGAACAGTTAAATGAAAGCGCCTGTGGGCGCTTTACAATCGAGTATCATAACAATTTTTACGTTTAATGTTAGCAATGATGAGGTTTCTACTTGGTTCTACGTTCGAAAACGCATCTAAATCTGTATTTATCAATATCTATAGTGATATCTTTATGGTTATTTCATTAATAAGGGTTAAGGCTTTCTATGTCAGGATATGCTGATTTCATACCAACTCTAAAACGCTTATCAAAGTTTGATAAGCTATCTGAGCGCGAAATTCTAGAGCAGTTCAGAAAAAATCATAATGTGGAGCCATCAATTCAAAAAAAAGATCTTTGTTATGCTTCATTTGCAGTAAGTGTTGATGGTCTTAAGTTTTTGCTAACCGAAAGACCGATATCTTATTTGAATCTGCATTGGGGAAGGTGTTCTAATATGCAGGTCAATGCAAATCCACTTGGTATAGCTTTGCCACTATACATTGGAGAGGGCACTCTTTCCTCTGCAATCCATGAAATTGGTATAAGTAATAAACCTTATGAAGAGTCAAATAAATGGCTTTTTGAAAATTTCTCATTAGAGATTGCAATTGCTTATTTCAACAAGTATTTCATAAAAAGCGAATCACTTGGAAGCTATAAAACCATTATTTTTGAAGCTATTGAGGCTTTTTATTTAGGCTATGACCACATATCTATTATGTCGCTCTTTCCTGTTTTTGAAGGTGGGTTAAGAAACTTGCTTGTGAAATTTTGTAATGGTGATGATACAAACACAAGCGCCGAGAAATTTGAAAAGGAAATTAGAAAGCTTATTATAAGATGGGGGGAAGGTCGGATTCCAGGCTTCGATTGGCATCCAGGTAAAGGGTACGATGTTGAAACTGAGGTGGATTTTTTTACGCATTTAAACCCACAATGTGATGTGATGAACTCAACCAGATCCTTTTTTAAAAACGTCATATATAAATCAACTGGAGGAGTTAATCAGGGTGGTTTTAATAGACATCTGGCGCTGCATTTGCTCAATAACGATTACAATGAATCATCTAATTTCATCAGAATATTTTTAGCGTTAACACACATAACGTTTGCCGAAAGCCTTATGAATGAAAATGTTCCTTTTTTTTGGCAGGGTATAGATGAAAATGATCGGCGTATAGCTTCATTTATTTCTAGAAGTGCTGATGTTATTTTCGGTATGCGCAGAAAAGAATTAAATAGGCTTGGGGTGGATCTATATTAGCCAAGGATTGGTTTTAAATAATGAGATTAATCCGTCTTAATGCTTATTTCTTTACGTAACGATTTATAGTGTGCGCTCATGAAAAAAGCATACTTCCTTAATGAAATAATTGTCAGTTATTTTAGGTGGTAAAATTATTGGCAAAGCCGTTATGATGGAAATGCTAGTTAGTGTATGCAGGCACTACAGGCAAAGGTTGATCCCGTTCATTTGCAGATGATGGGGCGGGACCATAATAAAACAGTGTGTGGAGAAGTAAGCATGAATCAGCTTTTAGTGATTGATGGGGTTTCCGTTCGTCAGGACAACTCCGGCCGTTATTGCCTTAACGATCTTCATCGTGCAGCAGGCGGCGAACGCCGACATGAGCCTTCCTTGTGGCGCAACCTTCAACACACCAATGAACTCGTTCAGCTTTTGAGCGATACAGGAATTCCTGTATCGGTAATTAAGGGCGGAGTGAATCAGGGCACGTTTGTGTGCAAAGAACTGGTCTACTCATATGCGATGTGGATCAGCGCTGAATTCAGTTTAAAAGTAATCCGCACGTACGATTCTCTGGCATCAAAGCCCCCCGTCGTTTCAATGCCAGAAGAAGTGCAGGCCAGCATCATCCTGCTCGAATCAGCGTCCCGGATGCTGAACTTCTCGAACTCATCAAAGCTTGGCGCATATCAGAAGATTCAGCAGCATTATGGTATCCCTAACATGATGCCAGCATATGCCATCGACGCACCCGTTGATGCAAAAGATGGTTCAAGCCGTCCCACGCTTTCATTAAGTGCGCTTCTAAAGGCCAACAGTATTCGAATGAATGCGAGCCAGGCCTATCGTCAGCTTGAGAAACTTGGGATCGTTGAACACAAAAGCAGAGCCAGCCGGTCAGGCACCGACGGTGTGAAGCTATTTTGGTCTCTGACGACTAAAGGCTGCATGTACGGCAAAAACATCACCAGCCCGGCTAATCCGCGCGAAACCCAGCCTCATTTTTTCGAATCAAAATTTGCCGAGCTTCTCCGCCTGCTCGACACCGTGCATTGAGGTGACTGTGAGAGCATTGCTTACGCCAGAGATAGCCCCGCGCACCGGGATTGTGTTGCTGAAGCCAGGGCCAGACCTGTTGAAGCTGTTTAAGGGCAGGGTGGTAATCAGCACACCGACAATGGATATGGCAGACCTGCCATCAGGGCGGCTGAATGACGGCACACAGCCGTTACTTGATGAGCCATCACTGATTCCCTTCTTCAGTCACGAACGCGTGATAAAGGCCGCTGGTGGGCCAAATGCGCTGGCATCCTTCGTCCAGTCTTTCAACTGCTGCCAGTGGGAGCGGGAGAAATTGGGCGTATGGCATCACCATGAATTTACTGTGTCAGAAACTGAAAACGGCCTGGTGTCTCTTTGCTACAGCCACGATAATGAGTTCAGGGAAAACGGCGTACCCGGTAGCCTGGAGAATATCGCCAAAGGCAACACCGCGCTGTGGATCATCAGGGCGGCATGCAGCCAGATGGCGCTCAGCAGCGACCACCAGTTGACCCTGCCAGAACTATGCTGGTGGGCAACTCTGAATGATGTGATTGACCTGATACCAGAGGCACCGGCCCGGCGCGTTCTGCGCATGCCGAAAGAAAGTATCCAGAGCGGCGAGCTGAAAGAGGCCCGCATTGTTCCGGTGCGACCGGCGCGAGAGGTTATTCAGGATGCAGCGCAGATCGTCAAAAAGATAATCAGCCTCCATGCTGACCCGGAATCACCAGAATCATTCATGAAGCGCCCCAAGCGTAAGCGCTGGGAGAATGAGAAGTACACGCGATGGGTTAAGTCACAGAATTGTGCATGCTGTGGCGTACAGGCTGATGATCCGCATCACATCATCGGACACGGGCAGGGAGGAATGGCAACGAAGGCGCATGATTTATTCGTGATGCCGCTATGCAGAGCGCATCACGATGAACTGCACCGGGATATGAAAGCGTTTGAAGCTAAATACGGCAGTCAGGTTGATCTGCTGTTCAGGTTCCTCGATTTCGCGATTGCAGTCGGTGTTATCGGGACAGACAAAAAATAAAGTGTGTGGAGAGGGTTAAAAATGCGTGATATTCAATTGGTACTGGAGCGGTGGGGCGCATGGGCAGCGTGTGAGGGTAGCCAGGTCGGCTGGTCACCAACGAGCCCAATGTTCAGGAGTCTGCTTCCTCAGGAGGGTAAATCTTCCCGAAACTCATGCAGCGACAGCGATGGGATCATCATCGACACAGCGGTTGGCATGCTGAAAAAGACCGACCGCCATGACGAGCTCGAGCTGGTGATGCTGCACTACATGTTTGACGTGTCTAAATCCACCATTTCCCGCTGGAAGAAATGCTCTGAAGGTAAAGTCAGGCAGCAACTGATGATTGCTGAGACCTTCATTGATGCCTGCATAATCATGACCGGTGTACAGCTTGAGATGGATGATTGGACTCGTAAAACTATTTTAAGAAAATCTGCATGATCAGCTTTTCGTTACGATTTTTAGGCGCTAATGTGCTAAGAGTCGTAACAACGCAGCGTTACTTATTTTCGAAACCTCGCTCCGGCGGGGTTTTTTACTGGGCAGCGCTTTTAACGCTGCCTTTGAAATCATTTCTTGAGTTGCAAGTACCAAGCACGACTAGTAGCTTGAGTTACGAGCACGTGGTAATGCCGCTTTACCTGCGCAGCCATTCCATGCACTTCATTCCTGACTTCCTCAACCGTAAGGTCTTTGCTGGTATTATATTCAGCATCGGGAAGTATAAAAACTGTGCCGTCGTCACTAGTGATTTCGCGAGAATAGCCTTTCGCTTCCATCAGATCGTGAAGGCTTTGATAGTCTTCACCATTTGCACCATGAAGCACCACCCTTACCGTAAAATCAGACATTTTAAAACTCCATTCATAAGGTGATTTAATAGATTACCAGATAAATCTGAATTTTGAGCGCTGGTGACTGACAAGATTTCAGCTTCATTGACATCAAAAAATTATTGAACTGGTTTGGGCGGGCTTTTTGCTTTCTGACTCTTGAACACATACATACGGGATAATCTCGTATGTAGATAGCCCCAGATTATCCCTGTTGCCGACGGGCAAGGCTTTTACCGCTATAGCGTCAGGGTTCCATTCAAAGAGGTCGCCATCGCGCGGCCTTTTCTCGTTTTTGCGCACATCAATCAGTCTCCACACACACTTTTGACGCCGTGGTGCTGCGCATCTTTCTTATGACTACTGACAGCACCTGCCAATTAACGGAGGTGAGGATGAAACGCATGCCGGACAAAGACGTTGGGTTCTGGGCAAGCCTGATTGCCTGGCTTTACGCCCACAAAAACGAAACCGGCTATGCGGGTCTTGCCGGAGTCATGGCGATTCTGAGAGCCACTTACGTGGGTAAAGACGCATGGTCACGCCGCCTGCTTGATGCTGCGATGTGCAGCGTCTTCGCCTTCTTCCTGCAGCCAAGCCTGCAGGTGATTGGTTCAGTGTTCAATTGGCACTTCAGTGAAGACATCACGCGGGTTGCTGCGGTCTTTCTTGGCTTCCTCGGTGTGGACTACGTGTCTACGAAGATACGCCGCCAGATAGATAAGCGATTGGGGGACAGTAATGCTGACAGCCAGTAGTTTTCAGCTCGCGACCGGCGTGAGTAATGCGCTGCGTGATGCATGGTTTCCTCATGTAGCGGCAAGCGTCTCTGCGTTCCAGATAAGTACGCCATTGCGTCAGGCTCACTTTCTGGCGCAGACAGGGCATGAGTCAGCTGGGTTCCTGAAGGTGGAAGAGGGGCTGAACTACAGCGAGAACGCGCTTACGGCAATGTTTGGTAAGCGCATCACCGCTGAGCAGGCCCGCGCCTATGGGCGTAATGCAATGCATGCGGCTAACCAAAAGATGATCGCCAGCATCATTTACGCAAACCGTAATGGCAATGGTGATGTTTCTTCGGGGGATGGGTACCGCTATCGCGGGCGTGGCCTGATTCAGATTACCGGCAAGTCCAACTATGCAGCACTGGTGAAACAGCTGGCCGCTGATGTAGCGTCAAACCCTGATTTATTGCTGGGCTATCGCTTTGCTGCGATGTCTGCGGCGGCATGGTGGAAGAATAACGGCCTAAACGAGCTTGCTGACTCTGATGATGTTACCCGCATCACCAGAATCATTAACGGTGGCTTAAATGGTCTGGACGACCGGAAATCCCGCTTATCAAAATCTAAGGGGATTCTATGTTCAACGTAATCGGCTTTATCCGAAACAATTCAGGCCTGGTCATCATCGGTCTTATCTGCGTGGCGCTATGGGGACTGAACGCAAGTAACTCACAGCTGAAAGCAACGAACGACAGACTTGAGAAGCTGGCAAACAGCAAAGACGAGCAGATTAACGACCTGCGCTCCAAGAACGATGGTCTGGCATCAAGTGTCACCGAGTTGGTAACAGCCGTTAAGCAGCAAAACGAAGTGATGAGTCAGGTCACAGAGCAGCGTGCCGTAACAGCCCAGCAGAACCGGAAACTACAGAATGAAATTAAGCGTTACCTTGCGGCGGACAAGTGTGCTGTTGCTCCTGTTCCCCCTGATGCTGCTGACAGGCTGCGCGACGCAGCAAAAGCCGCTGGTGGAGTACCGGACAGTAAAGCAACCTCAGCTAAACCTTCCGGCAGAACTGACCAGCCAGATTGATGTGCCAGCGCCGTCACAGGATATGACGTTCGGTGATAGCGTAAGCCTCAACGCTGAGTTATATGGCGCTCTGGGACAATGCAACATCGATCGCGCAGCTATCAGGGAAATTGAGATTAAGCAGCAAGCAAGTTTTTAATGGGCTTTTTTTACTTTCCTGAGATACGTTTATTAAAATCATCTATGATAACCTTCTTTGAAGCTGGAGATGATCATGGATGCTGTCGGACAAATTGTAATAGGTGTCTTCATCGCGGTTGCCACTGCATATGCTGCATTGATGCGGTTTTATGATGAAAAATGGTGGGAGAAAAAACTTCATTATTTCCTTTCTCAAACCGATGCGGCTTACTTACTACATAGATCTATTAAGTATTGGAGTGATAAGGGGCAGTATCTATCTGATGCTAGTTTGCATCCTGAATTTATAATGTTAAACGGGCAAGAGGAGGCTGAACTACAAAAGCAGTTCATTTGCTCATTAGCCGAATTAGATAAATTTTCATATATGGGTTCTCTTTTAACTTCAGAGAAATCAACTAAAATTATTAAAAGATTTCAAGTCTCTTATAGAGGTGTTGCAAAGCGCCTCATTGCGAATGATAAAGATAAAGACGCGTTAGCATCTGGGTTGGACTTCTCTTTAACTTTGCTCAATGAGTTAGTGGAAGAAGCTAAAGAGGAACTAAAAATTAAAAATGGAAAGAGGCTTGCCTTTTTTGAGAGATTTAAAGGGTAAGTGATTACAATGGAAATTTTTTATAATTAATTGTTTATTTTTAAGTATCTTTGCAGTCTAGAATAAAATTATTATTCTTTAACCGGTGTTTCTTGAGTTATTGTGTGGACTCCATTTTCTAAAAGGAAATCACATGTTTAATGAAAATTTTCTTCCACCTGACCCCGATAACCCAGGCTGGGTAATTGCGTGGGGTGTTGTTAAGAGTGAAAAATGGGACCTGGCAGGCGTTTATGGCAATCAAGAAGATGCAAACGCTAAGGCAGAAGAGATGGGGGACGATTATCAAGTGCACTTCGGCTCACATAAGTTAGGTACCAAAGATTTCATCTGGAATGATTAACTATCAACAACATTCAAACCGCCTAATGGCGGTTTTTTTGTGGAGTGAGAATGGCAGAGCCACGCATCTATAACAGCCGCTGGGACAAGGCCAGGCTCTCCTTCCTGAAGTCTCATCCTCTCTGCGTCATGTGCCACCGGCAGGGCAGAGCAGTGGCGGCTGCTGTTGTTGACCACATCAAGCCACACAGGCTTAAGGAGGCCATTAACAGCGGCAAGCAGGACGAGATAGCGAAGGCTCAGAAGCTCTTCTGGGACAAGACCAACTGGCAACCCCTCTGTAAGCAGCATCACGACTCGACAAAGCAGCGTGAAGAGAAGCGCGGTCACGTGATTGGATGCGATGAGAACGGGCTGCCGCTCGATCTACAGTCACACTGGTACAAAAGTATAAAACCCTAAGTCAGAACGAAGAGTTTGCACACAATCATTAATTTCCAATATGTTATGAATTCATTTTAACTGTTTGGATATGAATTTATGTTGCAGCGTGGAGAAGCCTCAGCAGTAATGACTAAGCTCGTTAATGACTCCTCGGGGTATTTGGCGGCTTATGCAGCGTTAGCTTCAGTGGCTTACTTGATCCATAAGAAGGGAAATTTTTTAGTCTCAAACGTTTTTACAATATTGGGGATGCTCTGTGGTGCCATTCTCTTTTTTTACTGGATAGGACATGTGGTATCTCTTTGTGAGGGCCTACGCGATAAGCATCAGAGATCTAAGTCAGCGATGTTTCATACATCAGTAATATTGTTTACTTTTTTATTAGGTTATCCAGGAATTATAGCGGTAACCATTTGGACTGTATTTTATTCAATCAAATAAGCAGGTGTGGGGTGGGTTAAGAGTTCAGAGGTAATTGGCCTCCTGACCGCCCGCCCCCCTTTTTATGCACAACCGCGAAATGAAAAGTTTTTTTCTGGGAGGTTTTTATGGCCGGAAGACGACCAAAACCGACCCATCTTAAGGTCGTTACCGGCAATCCGGGCAAGCGAAAACTAAACGATAAAGAGCCTGCACCTGCGAGAGAAATCCCCAGCCCGCCGTCACACCTCACTGATTGGGGAAAGGTTGCGTGGGGGAAGCTGACTGTCCTGCTCGACGGGATGGGCGTGCTGACCGTTGCCGATGTTCTGGCGCTGGAAAGGCTCTGTGATATTTACGCCGACATTCTTCAGCTACGGATCACGATTGCCGAAGAGGGCAGAACTTACACAGTCCAGACCGATGGCGGATTTCTGATTAAAGCCAACCCGGCTGTTTCAATGCTGGCTGATGCAGACCGGCGATTTAAAAGCTACCTGGTAGAGTTCGGCCTGACACCGGCTGCCCGGTCAAAGGTGAACGTGAATGGTGGAGAAAAAGAAGAAGACCCGCTCAACCAGTTCTTCGGTTGATCCGGCGACGCAGTATGCAATGGATGTTACCAGCGGTACGGTAATTGCCGGACCAGACATTCGCGCCGCTTGCGCTCGCCACATAAGGGATTTGGAAGAGGGTCCGAAGCGTGGCTTGTTCTGGGATGTTGAAGCTGTAACGCGTGTCGTTAATTTCTTCGCGCAGGTTCTGAAGCTCAACGGCGGTGAGCATGAGGGCAAACCCTTCATACTGCTGCCGTGGCAGTGTTTCATTGTTGGCTCACTGTTCGGCTGGAAGGCGGAAGACGGCACTCGCCGCTTTCGCATGAGCTATATCGAGTCCGGTAAGGGCTCGGGTAAATCGCCACTGGCGGGCGGTGTTGGTCTTTACCTGCTGATGGCAGACAAAGAGCCTCGCGCCGAAGTGTACGCGGCGGCCACGAAAAAAGACCAGGCGATGATCCTGTTCCGCGATGCGGTGACGATGGTCGATCAGTCGCCCGCGCTGGCGCAGCGCATCACCAAATCCGGCACTGGACTGAACGTATGGAACCTTGCGTTTCTGCAGACGGGCTCTTTCTTCAAGCCGATCAGCTCCGATGATGGTCAGTCAGGACCGCGTCCACATGGCGCACTGATTGACGAAGTGCATGAGCACAAAACAAATGCCGTTGTTGAGATGATGCGTGCCGGTACAAAAGGCCGCCGTCAGGCGCTGATGTTCCTGATCACCAACAGCGGCCACGATAAAACCAGCGTCTGTTATGAGTATCACGAGTACGGGCGCAAGGTTGCTGCCGGTGATCTGGTCGATGACAGCTTCTTCAGCTTCATCTGTTCACTTGATGAGGGCGATGACCCGTTTAAGGATGAGTCCTGCTGGGGAAAAGCTAACCCGTCTCTGGGTCAGACCTTCACGGATAAATATCTGCGGGAGCAGGTAACGCAGGCGCGCGGCATGCCATCAAAAGAGAGCATCGTCCGCCGCCTGAACTTTTGTCAGTGGGTGGAAGCGTCCGATCCGTGGATTGACAGCGACACCTGGATGAACTGCGAACAGGATTTTGATCCGGAGGATTTAGCGGGTGAAGAGTGTTATGGCGGTCTGGACCTGTCCGGTTCCCGTGACCTGACGGCACTGGCGCTTTACTTTCCGAAATCCAAAAAGCTTTTAGTTGAGTTCTGGACGCCGAAGGACTCATTGCTGGAGCGCGCTAAGACTGACCATGTTCCCTATGATGCCTGGCTGCGTAACGGCTTTATTCACGCGCCACCGGGTAAGGCGGTCAATTACGGTTTTGTGGCGGTGCGTATCGGTGAGCTGGCGGCCAGATACGATATTAAGTGCATCGCGTTTGACCAGTACCGCATCAAGTATCTGGAGCCCGAACTCGAAAGCGAGTCTGTGAGCGTTGACCTTGTTCCGCACGGTCAGGGCTTTTATAAAGCGCAGGAGTCCGGGCTGTGGATGCCGCGATCAATTGAGCTGTTTGAAGAGCACCTTAATAACCGGGTGCTTGTTATCCGGCCTAATCCCTGCCTGCGCTGGAATGCCGCCTCTGCAGTGCTTGAAGCTGACCAGAAGGATAACCGCATATTTGCCAAAAAGAAAAGCACCGGCCGTATCGATGGCGTGGTGGCTTCGGCTATGGCAATCGGTGCAGCAGAGGATGCGGTGCTGGTGGATAGCGGCGATCCTGATGACTTTTTTGATGACCCGATCATGGTAGGTATCTGATGAAGGAAAAAAAACAGCCGGGTCGCATCAAGAGCGCGATTGTGAACTGGCTCGGTGAGTCGATTGGGCTTAATGACGCTGCGTTCTGGCAGGAGTGGTACGGCACAAGCAGCAGCGGTAAGGTTGTGACAGCCGAGAAAGCGCTGGCGCTGGCCTCTGTCTGGGCCTGCGTGCGCCTGCTGAGCGAGTCAGTTTCAACCCTGCCGATGAAGGTTTACGAACGTGCTGCTGACGGCTCGCGCAAGCTGGCGCTCAATCATCCGGCCTATCAGCTGCTGTGCCGCCGCCCGAACAGCGAAATGACGCCATCGCGCTTCATGCTGATGGTCGTTGCCAGTATCTGTCTGCGGGGTAATGCCTACGTTGAGAAAAAGATGATCGGCACCAAGCTGGTCTCACTGGTTCCGCTGCTTCCCCAGAGCATGAAGGTGGAGCGGCTGGACAGCGGCGAACTGCAGTACACATACACAGAGAAGGGCGTACCGCGCATCATCCCGGTTAAAAATATGATGCACATCCGGGGTTTTGGACTGGATGGTGTATGCGGAATGATGCCGATGCGCACCGGGCGAGACGTGTTTGGCGCAGCGATGGCGGTCGAAGAATCAGCCGCAAAAATTTTTGAAAACGGTATTCAGACGTCAGGTTTCTTTCTGTCAAAGAACCTGCTGACCAAAGAGCAGCGCCAGAAAAACCGCGAAAACCTCAACCGGTTCGTTGGTTCAAAAAACGCGGGCAAGGTGATGGTCCTTGAGGGTGACATGTCCTATCAGGGCATCACCCTTAACCCTGAAGATGCTCAGATGCTGGAGTCACGATCATTCAGTATTGAGGAAATCTGCCGCTGGTTCCGCGTGCCGCCGTTTATGGTGGGTCACGTTGATAAGCAGAGTAGCTGGGCGTCGAGCGTTGAAGGCATGAACCTGCTGTTCCTGACGAATACGCTGCGCCCGATGCTGGTGAACATTGAGCAGGAGATTTCACGCTGCCTGCTGAACGGTGATGAAGACCTGTTTGCTGAGTTCTCCGTTGAAGGCTTGCTTCGTGCCGACAGCGCCGGACGCTCCGCTTATTACACCACTGCGCTGCAGAATGGCTGGATGTCCCGTAATGACGTGCGCCGTCTGGAGAATCTGCCGCCGATTGAGGGTGGTGATATCTACACCGTGCAGCTGAATCTGACACCGCTTGAAGACTTACGCAAAAACAGCACCGCCGCAAGGGCCACGCTGTTGCGTGAAGTTCACAACGCCGTTTTCCCGGACATTCCTTTCGAACAATCACCGCTTAAACAGGCGGCTTAGGAGCACCCCCAATGACAGTAAAAAGTCTTCCGGCAGCGCCGGAGGGGCGGCCTTTTGCGCGCGAAAATCGCGATCTGCCGTCTTCTGCAATGGAGCGCTGGAACGGCGGCATCAAGGCCGCAAAGAGTGATGACAACAGCATTTCCGTGTTCGACGTCATTGGCGCTGACTGGTACGGCGACGGCGTTACCGCCAGCCGCATCGCGGCCGCGCTCCGCTCAATCGGCGGTGCCGACGTGACCGTGAACATCAATTCGCCGGGCGGCGACATGTTTGAAGGCCTGGCGATTTACAACCTGCTGCGTGAATACGAAGGGAAAGTCACCGTCAAGGTGCTGGGCCTCGCTGCTTCTGCTGCGTCGATTATCGCGATGGCCGGTGATGAGGTGCAGATCGGTCGCGGTGCCTTCCTGATGATCCATAACTGCTGGGTGTACGCGATGGGCAACCGTCACGACCTGCAGCAGATTGCGGCGGACATGATGCCTTTTGATAAGGCGATGAACGATATCTATGTCGCACGGACCGGTCTGGATGCTTCCACCATCGACGCGATGATGGATGCAGAAACCTACATTGGCGGCAGCGATGCGGTTGAAAAAGGTTTTGCAGATCGCCTGCTGGCGGCAGATGAGATTGCTGACGGCGACGACAGTCCTGCAGCGGCTCTGCGCAAGCTGGACGCGATGCTGGCAAAAACCGATGCGCCGCGCTCCGAGCGTCGAAAACTTCTTAAAGCACTAACCGGCAGCAAGCCAGGCGCTGCTGCCACCCCTGAAGGTATGCCGGGCGCTACCGACGAAATCAACCCCGAAAATATTGCTCAACTTAAAAACGCGCTGGCCGCGTTCGGCTAATAAGGATTCATCATGTCAGATGTAAATGAGTTACTGAAAAAGGTATCTGCAAAGCTGGAGGAAGTGTCCGGCACTTTCAGTCAGAAGGCTGAAGATGCGCTCAAAGAAGCGAAAAACTCCGGCCAGCTGTCTGCTCAGACCAAAGAAGCAGTAGATAGAATCGCCACTGAATTTAACGCGCTGACTGAGGCAAACAAGTCACTTAAGGCATCGCTGGGTGATCTGGAGCAGCACGTTGCGCAGATGCCGCTGGCGAATGCGAAAAATGTTATCGAAACCGTGGGCGGTCAGGTTGTTTCTTCCGAAGCGCTGAAAGCTTTTTCGGCCAGCATCGAAGGTAATAAGCGCCTGAGCATTCCGGTTAAGGCGGCACTGCTGTCGGTCAACGTGCCGGGCCAGATCGTTGCACCTGACCGCCTGCCAGGTATCGATCAGCAGCCAAAACAGCGCCTGTTTATCCGCGACCTGATTGCGCCGGGCCGTACTGAGTCCAGTACCATCTACTGGGTTCAGCAGACCGGATTTACCAATAATGCGGCGACCGTCGCTGAGAACACTAAGAAGCCGTACAGCGATATCACCTTTGCGGAAAAAATCACGCCGGTCCGCACTATCGCGCACCTGTTCAAAGCCGCCAAGCAGATTCTGGATGATATGCCGCAGCTGCAGTCGACGATTGACGCCGAACTGCGCTACGGCCTGAAGTACGTTGAAGAGCAGGAAATTCTGTTCGGTGACGGCACCGGCACGCACCTTAACGGTATCGTTCCGCAGGCATCGAAATACGCAGCTTCATTCAGCGTGGCAAATCAGAACGGCATTGATGATCTGCGACTGGCTATGCTGCAGGCGCAGCTGGCACGCTTCCCGGCGTCCGGCCATGTTCTGCACTTCATCGACTGGGCGAAGATCGAGCTGACTAAAGATTCGCTGGGCCGTTACATTCTGGCGAACCCGGCAGCGCTGACTGGTCCTACACTGTGGGGGCTGCCGGTCGTCGCGACTGAAGCCGCTGCATTCCAGGGTAAATTCCTGACCGGCGCATTCAATGCCGGTGCACAGATTTTTGACCGCGAAGATGCCAACGTGGTTATCTCGACCGAAAACGCCGACGACTTTGAGAAAAATATGATCTCAATCCGTTGTGAAGAGCGTCTGGCGCTGGCCGTTAAGCGTCCTGAAGCGTTCGTTTACGGTTCCTTTACCGCACCTGCTGCAGCTGCGTAACAGCAACGGCGGCCTCCGGGCCGCTTTTCCGGGAATTACATATGAAACTGCTTTTGATTAAACCGAATTATTTCGGTGGCACTGTCGTTTCTGAAGACAACACCATCGAGACCGACGAGCAGCATGGTCGCGAGTTGATTAAAAAAGGCTATGCAGAACTGGTTGAAGACGATACTGCTGCTTTGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGCAAAAGGTAAAAACAAAAAAGGCTGAACACCATGCTGCTGACGCTCGAAGAAATTAAACAGCAGTGCCGACTGGAGAGCGACTTCACGGAAGAAGATCGACTGCTTGAGCTTTTTGCGCTGGCTGCTGAGGCAAAGGCGGTGACCTACCTCAACCGCAATCTTTATAAAACGGTGGCAGATATTGCACCGCTTGATACGGACGGCATGGTCATTACCGAAGATATCCGGCTTGCCCTGCTGATGCTGGTCAGTCACTGGTATGAGCATCGCAGTTCAGTGTCAGAGCTGGAGATGACGGAGACGCCGCAGGCTTTTCAGTTCCTGCTCTATTCGCGGCGTCTGCCGGTGTCGGGGTATTAGCATGCAGCAACGCTCATCAAACACCAGTGCCGTTTACACGCTGCCCGATCCCGGCGAGCTGAATAAGCGCATTCACTTGCGCCAGCGCATCGACCAGGCAGCAGCCGATTACGGTACAGAGCCGGTCTATCAGAATGAAAAGGACGTGTGGGCTAAGGTCCGGCAGGTGGGTGCCACCACTTACCATGAATCTGTTCAGGCTGATGACACCATAACCCACTACATGACGATCCGTTACCGCCGGGGCATCACTTCTGATTTTGAGGTGGTTTACGGTGGCTACGTGTATCGCGTCAAGCGCCTGCGCGACCTCAACTCTGCCGGTCGTTACCTGCTGATGGAGTGCGAGGAGCTTAGGGCTGTGGAAATCGACGGAGAGATGTATGGCTAAGCCGCTTCTGCACGTTGATTTTCAACAGCCCAAAGACCTCGTTTTTAACCGGGCAAAAATGCGCCGCGCCTTCATTCAGATTGGTCAGGTTCACATGCGTGATGCCCGGCGTCTGGTTGTGCATCGTGGACGGTCTGCACCGGGGGAGTATCCGGGATTCAGGACCGGCAAACTGGCGCGGTCCATCGGCTATTACGTTCCCCGCGCATCAAAAAGCCGTCCGGGCCTGATGGTGCGCATCGCGCCAAACCAGAAGCGGGGCGAGGGTAACCGCCTTATCGAGGGTGACTTTTACCCGGCGTTTCTGTTCTACGGCGTGAAGCGTGGATCTAAGCGCAAAAAGAGCCATCACAAAGGGAAATCCGGCGGCAATGGCTGGCGCGTGGCCCCGCGTAAAAACTATATGACCGAAGTGCTGGAGGCGCGCAAAACGTGGACGCGCTATGTGCTGACCCGTGCGCTGCGTACCTCCCTGCGTCCTGAAAGGAAAAAGAAATGAAGCTATCACTGGTAATTGCCGCACTCCGGGCGCGGTGTCCGATATTCGCGGGCAACGTAGCCGGAGCGGCTGAATTTAAGTCTATCCCTGAAACCGGGAAGATGAAGCTGCCGGCGGCGTATGTGATCCCGACCGAAGACATTACTGCTGAGCAGAAGTCCATGACTGACTACTGGCAGAACGTGACCGAAGGCTTTGCGGTGGTCGTGGTTCTGGACAACACGCGCGATGAGCGCGGTCAGGCAGCCGGTTATGATGCCGTGCATGATGTGCGGCAGCAAATCTGGAAGGCGCTTCTGGGCTGGGAGCCTGACGACGATGCAGGCCCGGTGGCGTATTCCGGTGGTCAGCTTCTGGACATGGATCGGGGCCGACTCTACTACCAGTTTGAATTCATGCTAACGCGGGAAATCACTGAAGAGGACACGCGCCAGCAGGATGACCTCAATACCCTGGATGAGCTGAAATCGGTCGATATCAACGTTGACTACATCGAGCCAGGTAATGGCCCTGACGGCATCATTGAGCACCACATCAAAATCAACCTCAGCGAGTAAATCATGCAACTCAGACCCAAGCGCGGGCGGTCAGTTCCTGACCCTGTCCGGGGCGATCTGCTGCCTTCAGAAGGCCGGAACGTCGAAGAGAGCAGCTACTGGCACCGCCGCATTGCGGATGGTGATGTCGAAGAAGTCAGCGCGGAAGAAGAAAAGCCCGCCGCTGACGCTAAGAAAAAGGGCGGTGAATAATGTCAGTATCGTTCCCCACTATTCCGTCAGACCTCCGCGTCCCGCTGTTCTGGGCGGAGATGGACAACAGCGAAGCGAATACCACGCAGAGCAGCGGCCCGTCACTGCTGATTGGCCTCGCCTCAACTGACAGCTCCATCGTTAAAAACAAACTCACCATCATGCCGTCTGCCGCACTGGCGGGTAAGGTTGCAGGCCGTGGCAGCCAGCTGGCCCGCATGGTGGCACGGTATCGCGCTGTCGATCCGTTTGGTGAGATGTGGGTTATCGCAGTGACCGAGCCAGAAGGCGAGACTGCCAAAGGCACCGTGACGCTTACCGGCAATGCGCAGGCGTCAGGTTCTCTCAGCCTGTATATCGGTGCGGTGCGCGTTCAGGCCGCCGTGGTGACTGGCGACGCTCCGGCAGCCGTGGCCGCGACATTGGCCGCAGCCGTTAACGCCAATGCTGACCTGCCGGTGACCGCCGCTGCAGCGGCTGGCGTTGTCACGCTTACTGCCCGTCACAAAGGCCTAACCAGCAACAGCATTCCGCTGGCGCTGAACTACTACGGCACCGTAGGAGGCGAAGCCACGCCTGACGGCGTTAACGTTGCGATTGCCGCTATGGCTGGCGGTACGGGCTCACCGTCACTTGCTGCGACCCTGGCCGCGATGGGTGACGAGCCGTTTGACTTCATCGGCACACCGTTCAGTGATTCCGCCTCTCTGGCGACACTGGCGCTGGAGATGAACGATTCTTCCGGTCGCTGGGGCTATGCACGCCAGCTTTACGGCCACGTTTACACGGCGAAAATCGGCACGCTGTCAGACCTGGTTGCATTTGGCGACACCATGAACAACCAGCACATTACCGTAGCCGGTTATGAGCCTGCCGTTCAGACCGCTGCTGATGAGCTGGTCGCACTGCGCACCGCACGCAACGCCGTGTTTATCCGCACTGACCCGGCCCGCCCTACGCAGACCGGTGAGCTGACCGGCGCATTACCGGCACCGGCAGGCAGCCGCTTTACCCTGACCGAGCAGCAGTCGCTGCTGAAGCACGGTATTGCCACGGCCTACGCTGAGAGCGGCGTGCTGCGCATTCAGCGCGACATTACTACCTATCAGAAAAACGCCTATGGCGTGGCGGACAACAGCTACCTCGACAGCGAAACGCTGCATACCAGCGCCTACGTTATCCGTCAGCTGAAGAGCATCATTACCAGCAAGTACCCGCGTCATAAGCTGGCGAATGACGGGACGCGCTTCGGTCCGGGTCAGGCCATCGTGACGCCTGCTGTGCTGAAGGGTGAGATGTGCGCCAGCTATCGCACGATGGAGCGGTCGGGGATCGTGGAGAACTTCGATCTCTTCAAACAGCATCTTGTGGTAGAGCGCAACGTCAGTGACCCGACCCGCGTGGATGTCCTGTTCCCGCCGGATTACGTTAACCAGCTGCGCGTCTTTGCGCTGCTTAATCAGTTCCGTCTGCAATACAGCGAGGAGACCGCGTAATGGCAAAGATTGCGGGTACAGCATACGTCAAGGTGGACGGCCAGCAGCTGTCGCTGACCGGCGGCATTGAGGTGCCGATGAACACCAAAGTGCGAGATGACGTTATCGGCCTGGCCGGTGACGTGGATTATAAAGAGACGCACCGCGCGCCCTATGTCAAAGGCACCTTTAAGGTGCCAAAGGCGTTTCCGGTCACCAAACTGATGGACTCAGACCAGATGACCATCACCGCCGAACTGGCTAACGGCATGGTTTACGTGCTGTCTGAAGCTTTCCAGTTCGGTGAAGCAAACCACAATGCGGAAGAGGGTACGGTTGACCTCGAATTCCACGGCTCAGAAGGATTCTATCAGTGAGTGAACTTCAGCTTTCAAAACCTATTACGGCACATGGTGAGACTATTCATGTGCTGGAGCTGCGCGATCCAACGGGCAAGGATGTCCGTGAGCTGGGCTATCCCTACCAGATGAATCAGGATGAGTCAGTAAAGCTGCTGGCTCACGTTGTGGCTAAATACATCAGCCAGCTGGGCGGCATTCCACCCAGCTCAGTTGATGACATGTCGCCATCAGACCTAAATGCTGCTGGCTGGGTAGTTGCCGGTTTTTTCCTTCAGGCCTGACAGCTAAAGATCTGCTTAACCTGTATTTTGACTGCGCCAGTTACTGGCGCATAAATCCTCTGGAAGTCCTGAGTGAGGACTTAAAAAGCCTGCAATTGCTTATTGACCAGGCGAACCGGATAGAACGGGAGCGAAAAGCCAATGGCTGAATTTGAACTGAAGGCGCTCATTACTGGCGTTGACAGACTTTCGCCTGCACTTGGTCGCATGCAAAAGAACCTGCGCCGGTTCCGTAAAGATGCAGAGGAATCGGGCAGGGGCGGCATGGCTATGGCAGGCGGTCTTGCTGCCGGGCTAACAGGTTCACTGGTTGCTTACGCTAAGCAGGAAGATGCTGCGACGGGCCTTAAGGTCGCCATGATGGACGCAAACGGCGCTGTTGGTTCTGACTTCGAAAAGATCAATAAGCTGGCTATAGGCCTCGGAAATAAACTGCCTGGTACTACTGCTGACTTTCAGAACATGATGCAGATGCTTGTCAGGCAAGGTATCCCGGCTCAGAACATCCTGAGTGGTGTTGGTGAGGCATCGGCTTATCTGGCGGTTCAGCTCAAGAAAACGCCTGAAGCGGCTGCAGAGTTTGCTGCAAAAATGCAGGATGCTACCGGCACTGCTTCAGAAGATATGATGGGATTATTCGACACCATCCAGAAGGCTGTTTATCTGGGTGTTGATGATACCAACATGTTGTCCTTCTTTTCTAAAACCAGCTCCATTCTGAAAATGGTCAGTAAAGATGGACTGACTGCGGCCCGCGCCCTGGCACCAATTTCCGTGATGATGGATCAGATGGGGATGGAAGGTGAGGCTTCCGGTAACGCCCTCCGAAAAGTATTCCAGGCGGGCTTTGACGGTAAAAAGATGAAGGCTGCCAATAAGCTGCTAAGCAAAAAAGGCATTACGCTGGATTTCACAGATGGTAAGGGTGAATTCGGCGGGCTTGATAATCTTTTCAAACAACTGAATAAGCTTCAGTCATTAACAACCAAACAAAAAACCACCATTATTAAGCAGATTTTCGGTGATGACGCTGAAACACTACAGGTGCTAAATGCGCTGATTGATAAAGGTAAAACCGGCTATGACCAGATTCAGGAGAAAATGGGAAAGCAGGCGGACCTTAATAAGCGTGTTAATGCTCAGCTAAGCACACTTTCGAATATCTGGGAGTCATTGACAGGTACAGCAGTCAATGGACTTGCGGCTATCGGCGGGGCTTTTGCTGGTGATGCTAAAAGGCTGGTGGGCTGGCTGGGTGACATGTCAGAGCGATTTACTGAGTTTGCTGACAAAAACCCCAAGGTTATTCGTGGTGCATTTGGCATCGCGGCCGGTTTTGTCGGAATGAAGCTGGGGCTGCTGGGAATCAACTTTGCGCTTGGGATTCTGGGGCAAGGCCTGAAACTTTCTCCGATGGGTATATTCCTTAGACTTGCAGCGCTGGGGATTGGGTTATTGATTTCTGACTGGGATAAGTTCGGTCCTGTTGTCGAGAAAGTCTGGACTAAAATCGATGGCCTGACAGAGGCGCTGGGTGGCATGAACAGTATCATCACCGGAATTGGCGGGGTGATGGCTGGTGCATTTACGCTTCAGGTTATTGGGTCGCTCACCACAGCCACAGCTAAGGCAAGCGGGCTTCTCGTTGTTCTTACCAAAATAGGGAAGTTGAGCGCACTGACGGTTTCAATAGCGGTTGCGCTCTACATGTTCAAAAAGCTGGAAGAGATTTCCGATGCGACTACTCAGAAAGATGGCACGGAATCATTCTGGGAGTCACTTAAGAAGAGATGGAAGGCTGGCGGCTGGTATAACAATGAGCAGCAGATGAAGGGAGGTGATGGAGCGTTAATCCCTCAAAGTATGAACGTGCCGTTAAAACGCGATGATCCATCATCGCAAAGGGGAGAGTTAAAGGTTTCCTTTGAAAATGCTCCTCCAGGAATGCGCGTTGAGCCAGTAGGTAGTGCTCTGCCATGGTTTGACCTTGATGTGGGTTATAACCGTTTCAGTAATAAAAGCTAAAGTTTTTCCCTTGATTGCCGAACATGATTGGGTCATTAACTCAAATTTATAAGGGAATATTATGAAGAATCATCTAATTGCTAGCTTGGTTATTGGAGGCTCTTTAATCATATCTTCTGCAATTATCTCAGGAGCGATTCCATTGAAAAATGAAAACGTACTGCAGGTTGTAGACGGAAGTGTGAAATTGGGTAACGTCTTCAGTGAAGATGATTTGGTTTCAGCTAAGTTGGTTTTTAGTGATGATAATCAAGATCAGGTGTTATTTGAGAACGTTGGTCCAGAAGAGACTGGCACTCGTCTAGAGGAAAAAATTAAAGAGCTTGCAAAGCAAATTAATTACAACGAAAAGGATGAGTCAAAAAAGATAGCAGCAGAGAAGCTGTCAATAAAGATTCCTTTTACATTAGTGTTAACTTCATCGGTGAAGTATCGCTCCGAGTACCAACCATCCTTTACTTTAAATTTGACGAAAAAAAATGTTGATCTGCCAGCCAACTCAAACATGCTGGAAACCATCAAACCTGCCATTGATAATTTTGTTAAGAGTCAGAAAGCACAATTTGATGCGTCGCATTTCATAAAGTAAGCAATTCATTTCAAACCCGCTCCGGCGGGTTTTTTTATGCCCGGAGTAAGCCATGAGCTGGAAAGACAATCTGCAGGATGCCTCACTGCGTGGTATCGCGTTTAAAGTGGACAGCGACGAGGCAACCTTTGGCCGCCGCGTGCAGGTGCATGAGTACCCCAACCGCGATAAGCCGTGGGCGGAGGATTTAGGCAGGGCAACACGTCGCTTCAGCGTGCAGGCCTATCTGATTGGCGATGACTTCTTTGAGCAGCGCAACCGGCTGATTGAGGCTATCGAAAAGCCAGGTTCCTGCACGCTGGTTCACCCCTACTACGGTGAAATGACCGTGGTTGTTGACGATACTGTGCGTATCAGTCATTCGCAGAGCGAAGGGCGCATGTGCCGCATCAGCTTCAGCTTCGTTGAGTCTGGCGAGCTGTCTTTCCCGACCGCAGGGCTTGCAACCGGCCAGAAACTATCCTCTTCCGTTTCGTTTCTGGATGACGCCATTTCTTCAGCATTCGGTGCTTTTGGTATGGATGGGATGCCTGACTTCCTGCAGGACGGAGTGCTGGATGAGGCGACAGGCATGTTCAGTACCGTGACCAGCGCGTTTCAGTACGTTGACTCTGGCATCAGCGCCGCATCGCGCCTGATGCAGGGTGATTTATCAGTGCTGCTCAGCCCTCCGTCCAGCGGTGCGAGCTTTGTTAACCGGCTGCAGACCATGTGGCGTGCCGGAACGCGGCTGACGGGCAACGCTTCTGACCTGATTTCGATGATTAAAGGGCTGACCGGCATCACGGTTGATTCGGGTCTGGCACCGCGCGGCGTCTGGAAAACAGACAGCAAAACCGCACAGGCGCAGACCACACAGCGCAATTATGTGGCTCAGGCGGTACGCACCACGGCAATCAGTGAAGCAGCCGCAACGGTCACCAGTCTGCCGCAGCCAGCAAATCGCACTGTCACGCGCCAGCAGGACCCGCAGCAGCCGGTTGTAGTATCGCATCCGGCCGTCAGCAACATACGGCCGGATTCAGGCAATGCCGCTTCAAACACAGACACCATAGCGACCGCTACCGCTTCCACGTCTTCAGGTGTTACCACCTCACTTGATAATGGCACCGTTATTTCATGGGATGACCTGGCGCAGGTGCGAGACAGTCTTAATGAAGCCATTGACCTTGAGATGGAGCGCGTCTCTGATGACGGGCTCTATCAGGCGCTGGTCACTGTGCGCACCGATGTTAACCGCGACATCTCAGCCCGTCTGGAGCAGGTTGAGCGCATGACGGAGCGCACACCTTCGCAGGTGACCCCTGCGCTGGTGCTTGCTGCTGATTGGTACGACTCAGCAGCCCGCGCAGGCGATATCACGGCACGCAACGGCATTCGCCATCCCGGCTTCGTGCCGGTTCAGTCACTGAGGGTGCCGGTACGATGAACAACACAGTTATTTTACGGGTGAACGGTCAGGAGTGGGGCGGCTGGACTTCGGTCAGGATCGCCGCAGGTATTGAGCGTATCGCCCGCGACTTTACCGTCGAGATTACCCGCAGCTGGCCCGGCGATACCGACCAGGCGGTGCGCAGTACCCGCATTAAAAACGGCGATCTGGTTGAGGTCCTGATAGGCACCGATAAGGTGCTGACCGGCTACGTCGAGGCAACGCCGGTCCGGTACGACGCACGCAGCATCAGCACCGGGATTTCAGGGCGCAGTAAAACCGCTGACCTCATCGACTGTTCTGCCACGCCTTCACAGTATGCCGGTCGTTCGCTGGCTCAGGTGGCCGCTGAGCTGGCAAAGCCGTTCAGCATAAAGCTAGTGGATGCGGGAGGTGCATCTGGTGCGCTTCAGGGCATTCAGGCCGACCAGGGCGAAACGGTCATGGACGTGCTGAATAAAATGCTCGGGCTGCAGCAGGCGCTGGCATATGACAACGCGCAGGGAAATCTGGTAATCGGTGGCATTGGCAGTCAGCAGGCGCACACCGCACTGGTGCTGGGTGAAAACATTCTTTCCTGCGACACGGAAAAGAGCATCCGGGACCGGTTCAGCGACTATCAGGTGTCCGGTCAGCGCAAGGGTAACGACGACGACTTTGGCGAGGCCACAACTACGGCTATCCGCTCAAAGACCATCGATGGCGGCCTGAAACGTTACCGTCCGATGATTATTCGCCAGACCGGCAACGCTACCACGGCTACCTGCAGCGCACGCGCAGAGTTTGAGATGCGCCAGCGTGCTGCGCGTACCGATGAGGTCACTTACACCGTGCAGGGCTGGCGACAGGGTGACGGCTCGCTCTGGCTGCCTAACCTGCAGGTTATCGTCTTCGATCCCATTCTCGGCTTTAACAACCGCCAGATGGTCATCGCTGAGGTGACCTATCAGCAGGATGAAAACGGCACCGTCACCGAAATCCGTGTCGGGCCGCCGGATGCTTATCTTCCTGAGCCGGCGAAACCCGGCAAGCGGAAGAAAAAGACAGAAGAGGATGATTTCTAATGGCTAACCCGATTTCAGGTATGGGCCGTGCGCTCTCAAACCTGCTTGCGCGTGCAGTCGTGCGCGGGCTGAACACGGCCACAAAGTGCCAGATGCTGCAGGTTGAAATGGCGGGAGGCGAGGGCAAAAGCGACATCGAGCATATGGAGCCATACGGGTTTACCGCCGCACCGCTCACTGGTTCTGAGGCCGTGGCCGCCTACTTTGATGGTGACAGGTCACACGGTGTTGTGCTGGTTGTATCTGACCGTCGTTTCCGCATCAAAGGGTTGAAGACTGGCGAAGTGGCGGTATATGACGATCAGGGTCAGTCGGTGACGCTGACCCGCGCAGGAATTGTCGTCAATGGCGCAGGCAAGCCGATCACCTTTACCAACGCGCCAAAGGCCCGGTTTGAAATGGACATCGAATCTACCGGTGAGATTAAAGATAAGTGCGACTCTTCCGGCCTGACCATGTCAGCCATGCGGATAGCCTATAACGGTCACACGCATAAAGAGAACGGCTCCGGTGGCGGCACAACTGACGCGCCAACGCAGAAAATGGTGGTCTCATGATTATTGTCATTAACGGCGTACAGCGTGACGTGACATGGCCGCCCGATCCGCTTACGCGCGCGGTGATTATCTCTCTGTTCTCCTGGCGAAAGGCTGAGCCTGACGATAGCCCGGAACAGGATAACGGCTGGTGGGGCGACAGCTTCCCGACCGTCCAGAATGACCGCATCGGCTCCCGCCTCTACCTTCTCAGCCGCACGACACTTACCAATAAAACACCGCTCAAAGCGCGTGAATATATCAACCAGGCGCTTCAGTGGCTGGTGGATGATGGTGTGGCAGTGAGGGTAGACGTTAAGGCTGAGCGAACCGGAATTACAACGCTAAGCGCATCGGTGGTTATCAACCAGAAAGACGGCACCCGCACAGCATTTTCCTTTGACGATTTATGGAGTGAACTTAATGGCTGACAGTGGATTTACCCGCCCGACACTCCCTCAGTTAATCACCACCGTCCGCAACGATATTCTTACCCGACTGGCTGCAGATTCGACGCTGGCGGCTCTGCGACGCACCGATGCAGAAGTGTATGGCCGGGTGCAGGCAGCAGCGGTGCACACCGTGTATGGCTATATCGATTACCTGGCCCGCAACCTTCTGCCGGACCTCGCAGATGAGGACTGGCTGACCCGGCACGCCAACATGAAGCGATGCCCGCGCAAGGCAGCGACGGCCGCCGCCGGTTTCGCCCGGTGGGACGTCACAACCGCCGGTATCGCCATTCCTGCCGGGGTGACCATCCAGCGCGACGACCTGACATCCTTCACCACTACGGCGGCGGCTACTTCGGCGGGAGGTGTGCTGCGCGTGCCGGTTACCTGTGATACAGCAGGGAAAAACGGTAACACCGATGACGGGCTTGCCATGCGTCTGGTCAGCCCCATTACCGGCCTGACGTCTGCAGGGGTAGCGGACAGCATTCAGGGTGGCGCTGATATTGAAGATTTAGAGGTATGGCGTGCACGCGTCATTGAACGCTGGTACTGGACACCGCAGGGCGGCGCAGACGGTGATTATGAGGTCTGGGCTAAAGAAGTGGCGGGCATTACCCGCGCATGGACCTACCGGCACTGGAGCGGGCGCGGAACGGTGGGGGTGATGGTGGCAAGCAGTGATCTGATAAACCCGATCCCCGACGCGGCCACGGTGGCAGCTGTCAAAGCCAATATCGAACCGCTGGCCCCGGTGGCCGGTGCGGATATCTATGTATTTGCTCCAACGCCTCATACGGTTAATTTCCAGATTCGACTGAACCCGGACACTGCAGCTGTACGCTACGCCGTCGAGGCGGAATTGCGGTCAATGATGCTGCGCGATGGTGTACCTGAAGGCGTGCTGAAGCCGTCCCGCATCAGCGAGGCTATCAGTATCGCAACGGGTGAATACAGTCATACACTGGTCAGCCCGGCAGCTGATATCACTATTGCTAAAGGCGAGTTGGGAGTAGTGGGGGCAATCTCATGGACTTAACAGCACAGTACCGGCAGATGCTGGGTGCGTTACTGCCGCGCGGTCCTGCGTGGGATGGTGATGACCTGCTGCTGACGGGATTTGCTCCCTTACTGGCAGCAGTGCATGGGCGCGGTGATGCTTTGATGCTGGAAACCGACCCGCGCTCAGTTACTGAGCTGATTGACCGTTATGAAAATATCAGCGGGTTACCTGACAGCTGCGCACCGCCGGGCGTGCAGACCCTGCAGCAGCGGCGTCAGCGACTGGATGCAAAGCTCAATCTGGCGGGCGGAATCAATGAGGCGTTTTACCTGGCTCAGCTGGAGGCTCTGGGTTACACCGGCGTCACCATCACCCGCTACAACAAGAGCCAGTTTAACTGCCTGTCCGACTGCACCGACTCACTCTACAGCGACGACTGGCGTTACTACTGGCAGGTAAACATGCCAGCCGCCACGCAGATCACTGAAATGACGGCTATCAGCAACAGCACCGACAGTCTGCGCATGTGGGGCGATACCATTGCCGAATGCGTACTGACGAAGCTGGCACCGTCTCACACTTACGTTATTTTCAAATACCCGGGGTAACTATGCATCGTATTGACACGTCCACCGCGCAGAAGGATAAGTTCGGCGCAGGTAAAAATGGCTTTACCGGCGGCAATCCGCAGACAGGCGAGTTGCCTACTGCGCTCGATCAAAACTTCTTCGACTCGTTGCAGGAAGAGATTTGCGGAGTGATTGAAGGCGGGGGGATTGCATTAAACAAAGCGGATCGCGGGCAGATGCTGAAAGCGATGAAAGCCATGTTTCCCATAATGGGCCTTTTTGCCAACAGCTTATCTTCACCTGGATACCAAAAGCTTCCTGGCGGCGTACTCATACAATGGGGAACGGCCGCTATCCCATCAGCAGGCAGTGTCACGGTGACGTATCCTATAACTTTTTCAGCTGGATTTTCTAACGCATTTGTTTCACCGCTTGATAGTAGTTCTACTAATAATTATCGGGTAGGTATCGATACTTCAACTACAAGTACGATTACTTTAAGATCAACAAATACGAACAGTGTTACTGGTGTAAACTGGATGGTAATAGGTAAAGTTTAATATGATGAATAAAGTTTATTACGATGCTCAAAGTGCGGGTTTTTATTTGGAGGGCATAAATTCCCAAATCCCTTCTACAGCCATTGAGATATCTACCGAAGACTATAAAGTTTTACTTTCAGAGCAAGAAAAGGGAATGGAAATCACTCCTAATGAAAATGGCTATCCAACCCTTACTGCCCCCCCGGCCTTAACTCAACAACAGGAAGTGTTAATTGCATACAGTAATAGATTATCACTTATGGGTAAAGCATCTGATGTAATTGCCCCACTCCAGGATGCTGTTGATCTTGGTGATTCAACACCATATGAGGACGCTCTCCTCAAAAATTGGAAGCAATACAGGGTGGCACTTAATCGGCTTGACCTCTCTACCGCTCCTGATATTCAATGGCCTAAGATTCCTGGGTGACTGCTTGCGCGCCTTTCCCCTTAACTTTTGAAGTCATCTTATGGGCTATCTTTCTGCCATAACCTCGCAGCGGGGATTCAATTAATTTAAAATTCAATTCCACTAACAAAACAGTCAGAACCAGTGCTGAAACTGTCATAACAACGCAGAGAAGCACTGATTTTGACGGGCCCTGACCAATGGACACGAAGTAACGGATAGTTGATTCCTGAATGAAATAAATTGCTGGCATATGAATCACATAGATTGCATACGATCGCGACCCAACCCAAAGTAATACCTTTCTTAGTGCTAGAGGGCAATAGATATACCCCTTGTTGAAGGAGGCCACATAAATAAGAATGGCACTTACTAATGCCAGCATCCCAACCATAAACCGGCTATCGTGAAGAACTGGTATGCTAAACATTAAAAAGATCAGAGCGATAGCAGTTAAACTAAATGAAGATACTCTGTTCTTCAAAGAGATAGGTTCAAACCTGAATATCCTACTATCCATGTAAAAAATAGAAATCACAACTCCCCAAGAAATAGAATCAAGCCTGAAATTCATCAGATAAGTATCTTGCCTATGTATAAGGAATTGTATGAATATGAAGGCAAGCAACAATAGTACCCTGTACGCTTTCTTTGTGAAGATGATGAAAAATGGCAATATAAAATAAAACTGTTCTTCAAGGTTAAGTGACCAGAACGGCCCCATGCTTGTGGGCAATGAATTAACAAGCATATAGTGGCTGAGAAGGTTGTAGGTGTAGGAAAGTATGCTTATGGCCTGATAAAAATTCCATTTTATGTCGCCGAATGCTCCAGATATGTTATAGCTGAAAGTCATGCATATAATTATTGCTATCCATAGGAATGAGGTGGGAAAAAGACGAAATGCCCTTTTGATAAAGAACGCCTTAACCACACCTGATAATCCTGTCCCACGTTGCTTGCACTGGTCAATCAGTGGAATCAAGGAGCAGCTAACTACAAATCCGGATATACACAAAAATAAATCCACACCAGACCAGAATTGCAAATAGGTATTCACCTTTGCAAAAAACGAATGGTCACTCCAGAAGTACAGAGAAGGGAAGTGCTGCACAAGAACCATTATGATTGCGAAAGCGCGCAGAATTTCAATATCGCTGTTTGGTCTCAGCTTTTCCATTTGGCCTCAGATGTGAATGTTTAGCGGCAAGATAGTATCAACCAATACCTCTTGTTGTCAGCCATAAAAAGCCCGGCGACTGGATGATGACTTACCCGCTCCTGTCTGTGCAGGATACGGAGTGGGTAATTTGAGATTATTAACCCCGATAACATGATGCTGATTGAGACGCTGGAGGGTTTTGCGCTCGTAGATGGTATACTCACTGCTAAAACAGGCGATGCTGTCGCTTGTGTGCTCGGCGACTACCCGCAAGTTGAAAAATTATTCAGTTCTGGGATGACCGCTTTTGATAGTGAGATGATCAACGGAGAATGGATGGAAGGGGTTATCGTGCTGGGGAGCGTTACTGCAGTGGTTGTATCCGTTTGCAAACCTCTGCGGTCCACGATCTAGCTTTAGCTCACATGTAGCACAAAAAATACTGCAAATCTCATCAAAACTACCATCACGACAGTTTATGATTTACAGTATTTCTCTGTAAAACTACACTTCGATACACATCAACCCGACTGGTTGAATATTCAAAGTAAAATTAATTTAATGCAAATAATTAAATTGTAGATTTTTTCCAATTTTAATAAATGGTAACTCGACCAGTCTGTGCAATATGTATGCAAATGCAATTGCTATAAGCGTTGATGTAACAAGTTTAGTCAGGTTTGCAGCTTCGCTCCAATAATAAGGAGGGTGTTCAAGAAGAAGATACATAATGAAAAAGTGTGTAATATATATCGAGAATGATATATTACCAAAAAATTGCAAAACCCTAGAGTCTGATATCTTGAAGTTAGATTCATACATGATGATGGCAAAAAACAATGGAAGCGCTATAAGATAGAACTCAGTCAATCCATAGCCCTTGACTACATCTGAAAAGTAAAGCCATCCAGATAGCGCAAGTAATAACATAATCATGGGGCAGGAGATGGCATTCCTAACCTTCATCCATCCTCTGATAAACGCTTCGCAAAGCCACATGCCCAGCAGGAACTCCAAGTGAAGGGTTGTGCTAAAGAACCTCAGGGCGTGGTCGACATATGAGCTATCTCCGAGAGCAGCTACATAGCTGGAGTTGAGCGACACTGAACCGGTGAAGGTAAGCTGAAGCAGAAAAACCTGAGCTAACAAAATGGTGGAGGCTATTAGCACCCTATGCTTATGATTGATAATCATTGCCATGCCGAAAACGAAATAAAACCAGGTTTCGTATGAGAGCGTCCATGCAGGGCCTATAAAATTATAATCGAATGCTGGAGCTGGCCTGTTGTAGTCCTGAAGAATAAACATGCCTGACTTAATCATTGTCCATATTGGGTGGACGTTGTATCGGACCAAGAATATTGAAGAGATGACCAGTATAAAGAGAAATAGCGGATAGATTCGAAAAAATCTCTTTAGAAAGAAACCAGCTACGTTACCTTTTTCTGCTGTAACATAGGTTATGATGAATCCGCTAATGAGAAAAAAAAGGTCAACGCCGAAACCGCCGTTCAGGAATACCCTGTTATAAGGCTCAACCTTACCGATATACATAAAAGAAAAGTGAAACATAACGACCAGTAGCGCCGCAAATCCTCTCGTATAGTGAATGCTTTTTAACATGGGTAGTCCAAAATTTTAGATTCGCGCAAGATAGTATCAATCAACTCCTCATGCTGTCAGCCATAAAAAAGCCCGGCGACCGGGCAATGACTCAGCCGCTCCTGTCTGAGCAGGTTACGGGGTGGGTAATTTGAGGTTAGTTAACCGCAACTGAAGTCGCCAGCTTAAAAATCCAGCACTCTCATCAGCTTTACAATTTTGCTCCGCGTTTCGGCTTGATCAAATCCACTAATCGATATTACTGTTTATCCATACAGTATTCATCGGAGGAGGATTTGTTATGGCGAGAGAGAGCGATATAAACGGTGCCTTTATGGCTGCGATTAAAAAAGACAGCATGGGGCGGCAGATAGTCACGACAGCGGCATTCCAGAAGAATCTGGACGACGCGAACCACGTCTGGACGCTGCAGGAGTGCAACCGTTGGATACGGTACTACCAGAACTTCTTCTTCGAACTGGTTACAGAGGAAAGTGAGAATAAGACTTGGGCGCTGCGCAGCATGGGTTATGTGAGGTAATTATGGGCTTTCCATCACCTGCATCGGACTACATTGAAAAGCGCATCGACCTTAATGACGTGCTCATGCCTCACCGCAATAATATGATCCTGATTGAAACTCCGGACGGATTCGTACTGGCAGATAAATCAGTGAAGCCAAAGCCGGGTGACAAGGTGGCATTTCAGACAGATGAGTTTCCGCAACTTGGGCGACTGTTCCGCACCGGAATCATTACTTCAGACGGTGAGACAATCGACGGGGAAGGGCTTGAAGGTATTATCGTGCTGGGGAAAGTGACGGCTGAGGTTGTGTCAGTTTATGAGCCTTGCAGACCTATACTTTAA
Protein sequences of DBSCAN-SWA_3 >CP028349|1923801:1964980|1958391_1958805_+|AVV37360.1|DBSCAN-SWA MIIVINGVQRDVTWPPDPLTRAVIISLFSWRKAEPDDSPEQDNGWWGDSFPTVQNDRIGSRLYLLSRTTLTNKTPLKAREYINQALQWLVDDGVAVRVDVKAERTGITTLSASVVINQKDGTRTAFSFDDLWSELNG >CP028349|1923801:1964980|1938607_1938889_-|AVV37334.1|DBSCAN-SWA MSDFTVRVVLHGANGEDYQSLHDLMEAKGYSREITSDDGTVFILPDAEYNTSKDLTVEEVRNEVHGMAAQVKRHYHVLVTQATSRAWYLQLKK >CP028349|1923801:1964980|1939266_1939629_+|AVV37336.1|holin|DBSCAN-SWA MKRMPDKDVGFWASLIAWLYAHKNETGYAGLAGVMAILRATYVGKDAWSRRLLDAAMCSVFAFFLQPSLQVIGSVFNWHFSEDITRVAAVFLGFLGVDYVSTKIRRQIDKRLGDSNADSQ >CP028349|1923801:1964980|1962682_1962925_+|AVV37366.1|DBSCAN-SWA MMLIETLEGFALVDGILTAKTGDAVACVLGDYPQVEKLFSSGMTAFDSEMINGEWMEGVIVLGSVTAVVVSVCKPLRSTI >CP028349|1923801:1964980|1930882_1931776_+|AVV37327.1|DBSCAN-SWA MSLLLKVKPLVISPELALRIGLNEAIVLQQICYWLEDTTAGVEYEGKRWVYNSINAWNEQFPWWTTKTIQRTVSSLKKMGLIFVEQLKKKQHDQTNYYAINYANPLLTDKDNLSLSRETICPNRKGQSVPMDKVNLSHSIGSNCPNVTEITTENTTEITTPPSCQVAPQPDDEWSLVNRSREVLRHLNKITGAKHTEAQSSMGHIKSRLKDKFTVEELCLVVDYKHVHWEGTEEYQYMRPKTLFVPGNLPGYLQSATKWDKHGRPPRSEWNAMKRNMQRDITVIPQPDSSVPHGFRG >CP028349|1923801:1964980|1958797_1959865_+|AVV37361.1|plate|DBSCAN-SWA MADSGFTRPTLPQLITTVRNDILTRLAADSTLAALRRTDAEVYGRVQAAAVHTVYGYIDYLARNLLPDLADEDWLTRHANMKRCPRKAATAAAGFARWDVTTAGIAIPAGVTIQRDDLTSFTTTAAATSAGGVLRVPVTCDTAGKNGNTDDGLAMRLVSPITGLTSAGVADSIQGGADIEDLEVWRARVIERWYWTPQGGADGDYEVWAKEVAGITRAWTYRHWSGRGTVGVMVASSDLINPIPDAATVAAVKANIEPLAPVAGADIYVFAPTPHTVNFQIRLNPDTAAVRYAVEAELRSMMLRDGVPEGVLKPSRISEAISIATGEYSHTLVSPAADITIAKGELGVVGAISWT >CP028349|1923801:1964980|1940902_1941427_+|AVV37339.1|DBSCAN-SWA MDAVGQIVIGVFIAVATAYAALMRFYDEKWWEKKLHYFLSQTDAAYLLHRSIKYWSDKGQYLSDASLHPEFIMLNGQEEAELQKQFICSLAELDKFSYMGSLLTSEKSTKIIKRFQVSYRGVAKRLIANDKDKDALASGLDFSLTLLNELVEEAKEELKIKNGKRLAFFERFKG >CP028349|1923801:1964980|1928950_1929589_-|AVV39254.1|DBSCAN-SWA MHELIGERIKALRESKGLSQAQLARLCGWAAPSRLGNYESGTRKVSSDDAILLGSALGTSPAKILFGEDADAVFKQYEYPLFTYVQAGDFSEVGSFTSRDAKAWVPTTKKASDKAFWLEVKGHSMTAPQGVRPSFPEGMLILIDPAEPVETGDFCVASANADSEVTFKKYEKDAGVSYLVPLNPAYRILDCDHSCRIIGKVVKAQWPEETFQ >CP028349|1923801:1964980|1964656_1964980_+|AVV37369.1|DBSCAN-SWA MGFPSPASDYIEKRIDLNDVLMPHRNNMILIETPDGFVLADKSVKPKPGDKVAFQTDEFPQLGRLFRTGIITSDGETIDGEGLEGIIVLGKVTAEVVSVYEPCRPIL >CP028349|1923801:1964980|1933763_1934297_+|AVV37330.1|DBSCAN-SWA MSRLSKSVTFISELLAAAAIVASATTFVLDKLVWSKQVDITASLVNSGPKSLSILMSNNGQVDVAIRKVTIDVLGNSIKNLVKLDAGGEILTKNSSKLINSTSSSLSSSVIVGPDDNVKIMGASDMLDCVVNINYIAAGDDVSRSIPIKAKCYPYCVVDPEDVETLIKKIGSHHIEW >CP028349|1923801:1964980|1946180_1947026_+|AVV37345.1|DBSCAN-SWA MTVKSLPAAPEGRPFARENRDLPSSAMERWNGGIKAAKSDDNSISVFDVIGADWYGDGVTASRIAAALRSIGGADVTVNINSPGGDMFEGLAIYNLLREYEGKVTVKVLGLAASAASIIAMAGDEVQIGRGAFLMIHNCWVYAMGNRHDLQQIAADMMPFDKAMNDIYVARTGLDASTIDAMMDAETYIGGSDAVEKGFADRLLAADEIADGDDSPAAALRKLDAMLAKTDAPRSERRKLLKALTGSKPGAAATPEGMPGATDEINPENIAQLKNALAAFG >CP028349|1923801:1964980|1964414_1964654_+|AVV37368.1|DBSCAN-SWA MARESDINGAFMAAIKKDSMGRQIVTTAAFQKNLDDANHVWTLQECNRWIRYYQNFFFELVTEESENKTWALRSMGYVR >CP028349|1923801:1964980|1934941_1935982_+|AVV37331.1|DBSCAN-SWA MSGYADFIPTLKRLSKFDKLSEREILEQFRKNHNVEPSIQKKDLCYASFAVSVDGLKFLLTERPISYLNLHWGRCSNMQVNANPLGIALPLYIGEGTLSSAIHEIGISNKPYEESNKWLFENFSLEIAIAYFNKYFIKSESLGSYKTIIFEAIEAFYLGYDHISIMSLFPVFEGGLRNLLVKFCNGDDTNTSAEKFEKEIRKLIIRWGEGRIPGFDWHPGKGYDVETEVDFFTHLNPQCDVMNSTRSFFKNVIYKSTGGVNQGGFNRHLALHLLNNDYNESSNFIRIFLALTHITFAESLMNENVPFFWQGIDENDRRIASFISRSADVIFGMRRKELNRLGVDLY >CP028349|1923801:1964980|1952694_1952877_+|AVV39256.1|DBSCAN-SWA MGSCRFFPSGLTAKDLLNLYFDCASYWRINPLEVLSEDLKSLQLLIDQANRIERERKANG >CP028349|1923801:1964980|1925859_1926264_-|AVV37320.1|DBSCAN-SWA MKKVAQFRRSNGANAGFSEKLAWQLSKGPATGRELAERLGMTLSEFNRLVLHIMRRGGETLQVEASNQVFLGGGSIDRTYTLVRNPRRVAPPPCKTMVINYSNDRSEEAIKRHREAATRRARLIASGLYLECMG >CP028349|1923801:1964980|1955394_1956771_+|AVV37357.1|DBSCAN-SWA MSWKDNLQDASLRGIAFKVDSDEATFGRRVQVHEYPNRDKPWAEDLGRATRRFSVQAYLIGDDFFEQRNRLIEAIEKPGSCTLVHPYYGEMTVVVDDTVRISHSQSEGRMCRISFSFVESGELSFPTAGLATGQKLSSSVSFLDDAISSAFGAFGMDGMPDFLQDGVLDEATGMFSTVTSAFQYVDSGISAASRLMQGDLSVLLSPPSSGASFVNRLQTMWRAGTRLTGNASDLISMIKGLTGITVDSGLAPRGVWKTDSKTAQAQTTQRNYVAQAVRTTAISEAAATVTSLPQPANRTVTRQQDPQQPVVVSHPAVSNIRPDSGNAASNTDTIATATASTSSGVTTSLDNGTVISWDDLAQVRDSLNEAIDLEMERVSDDGLYQALVTVRTDVNRDISARLEQVERMTERTPSQVTPALVLAADWYDSAARAGDITARNGIRHPGFVPVQSLRVPVR >CP028349|1923801:1964980|1956767_1957838_+|AVV37358.1|plate|DBSCAN-SWA MNNTVILRVNGQEWGGWTSVRIAAGIERIARDFTVEITRSWPGDTDQAVRSTRIKNGDLVEVLIGTDKVLTGYVEATPVRYDARSISTGISGRSKTADLIDCSATPSQYAGRSLAQVAAELAKPFSIKLVDAGGASGALQGIQADQGETVMDVLNKMLGLQQALAYDNAQGNLVIGGIGSQQAHTALVLGENILSCDTEKSIRDRFSDYQVSGQRKGNDDDFGEATTTAIRSKTIDGGLKRYRPMIIRQTGNATTATCSARAEFEMRQRAARTDEVTYTVQGWRQGDGSLWLPNLQVIVFDPILGFNNRQMVIAEVTYQQDENGTVTEIRVGPPDAYLPEPAKPGKRKKKTEEDDF >CP028349|1923801:1964980|1949857_1950418_+|AVV37350.1|DBSCAN-SWA MKLSLVIAALRARCPIFAGNVAGAAEFKSIPETGKMKLPAAYVIPTEDITAEQKSMTDYWQNVTEGFAVVVVLDNTRDERGQAAGYDAVHDVRQQIWKALLGWEPDDDAGPVAYSGGQLLDMDRGRLYYQFEFMLTREITEEDTRQQDDLNTLDELKSVDINVDYIEPGNGPDGIIEHHIKINLSE >CP028349|1923801:1964980|1950609_1952106_+|AVV37352.1|tail|DBSCAN-SWA MSVSFPTIPSDLRVPLFWAEMDNSEANTTQSSGPSLLIGLASTDSSIVKNKLTIMPSAALAGKVAGRGSQLARMVARYRAVDPFGEMWVIAVTEPEGETAKGTVTLTGNAQASGSLSLYIGAVRVQAAVVTGDAPAAVAATLAAAVNANADLPVTAAAAAGVVTLTARHKGLTSNSIPLALNYYGTVGGEATPDGVNVAIAAMAGGTGSPSLAATLAAMGDEPFDFIGTPFSDSASLATLALEMNDSSGRWGYARQLYGHVYTAKIGTLSDLVAFGDTMNNQHITVAGYEPAVQTAADELVALRTARNAVFIRTDPARPTQTGELTGALPAPAGSRFTLTEQQSLLKHGIATAYAESGVLRIQRDITTYQKNAYGVADNSYLDSETLHTSAYVIRQLKSIITSKYPRHKLANDGTRFGPGQAIVTPAVLKGEMCASYRTMERSGIVENFDLFKQHLVVERNVSDPTRVDVLFPPDYVNQLRVFALLNQFRLQYSEETA >CP028349|1923801:1964980|1944864_1946169_+|AVV37344.1|portal|DBSCAN-SWA MKEKKQPGRIKSAIVNWLGESIGLNDAAFWQEWYGTSSSGKVVTAEKALALASVWACVRLLSESVSTLPMKVYERAADGSRKLALNHPAYQLLCRRPNSEMTPSRFMLMVVASICLRGNAYVEKKMIGTKLVSLVPLLPQSMKVERLDSGELQYTYTEKGVPRIIPVKNMMHIRGFGLDGVCGMMPMRTGRDVFGAAMAVEESAAKIFENGIQTSGFFLSKNLLTKEQRQKNRENLNRFVGSKNAGKVMVLEGDMSYQGITLNPEDAQMLESRSFSIEEICRWFRVPPFMVGHVDKQSSWASSVEGMNLLFLTNTLRPMLVNIEQEISRCLLNGDEDLFAEFSVEGLLRADSAGRSAYYTTALQNGWMSRNDVRRLENLPPIEGGDIYTVQLNLTPLEDLRKNSTAARATLLREVHNAVFPDIPFEQSPLKQAA >CP028349|1923801:1964980|1937030_1938053_+|AVV37332.1|DBSCAN-SWA MRALLTPEIAPRTGIVLLKPGPDLLKLFKGRVVISTPTMDMADLPSGRLNDGTQPLLDEPSLIPFFSHERVIKAAGGPNALASFVQSFNCCQWEREKLGVWHHHEFTVSETENGLVSLCYSHDNEFRENGVPGSLENIAKGNTALWIIRAACSQMALSSDHQLTLPELCWWATLNDVIDLIPEAPARRVLRMPKESIQSGELKEARIVPVRPAREVIQDAAQIVKKIISLHADPESPESFMKRPKRKRWENEKYTRWVKSQNCACCGVQADDPHHIIGHGQGGMATKAHDLFVMPLCRAHHDELHRDMKAFEAKYGSQVDLLFRFLDFAIAVGVIGTDKK >CP028349|1923801:1964980|1959855_1960437_+|AVV37362.1|DBSCAN-SWA MDLTAQYRQMLGALLPRGPAWDGDDLLLTGFAPLLAAVHGRGDALMLETDPRSVTELIDRYENISGLPDSCAPPGVQTLQQRRQRLDAKLNLAGGINEAFYLAQLEALGYTGVTITRYNKSQFNCLSDCTDSLYSDDWRYYWQVNMPAATQITEMTAISNSTDSLRMWGDTIAECVLTKLAPSHTYVIFKYPG >CP028349|1923801:1964980|1926260_1926704_-|AVV37321.1|DBSCAN-SWA MHHFKSNKEVVAAGHQFAKNIGADTPLIDMAKMVTELASRLEVATVRANLMAGEVLRINSVLPDTISVLQAAGADMTLIDDLNAALATPACDQWIRTLRGEALGEARRAVTTLGNHQQPGISHAINIISQMEMDVLRTHTVTLKVVS >CP028349|1923801:1964980|1928490_1928694_+|AVV37323.1|DBSCAN-SWA MSKKSDELYDEMCRVVGDVVFTLHDYGIESKQIVIADALRTALASNNPERSKLLAKAMEAATKVLDR >CP028349|1923801:1964980|1957837_1958395_+|AVV37359.1|plate|DBSCAN-SWA MANPISGMGRALSNLLARAVVRGLNTATKCQMLQVEMAGGEGKSDIEHMEPYGFTAAPLTGSEAVAAYFDGDRSHGVVLVVSDRRFRIKGLKTGEVAVYDDQGQSVTLTRAGIVVNGAGKPITFTNAPKARFEMDIESTGEIKDKCDSSGLTMSAMRIAYNGHTHKENGSGGGTTDAPTQKMVVS >CP028349|1923801:1964980|1923801_1924824_-|AVV37317.1|integrase|DBSCAN-SWA MARLRKNAADAWMPPRVYRGRSAYEFHPKDGGAIRLCALDAAQSSVWAAYEALINEIPDDRLLASLADRFFKSADFFELARETQRDYLKYSKNVLAVFGAMPSDDIRPEHVRKYMDKRGLKSRVQANREKAFMSRMYRWGYERGMVKGNPTKGVKKFKEVSRDRYVTDAEYQALYSCAPEVVKIAMELAYLTCSRQGDILAMKKSQIVDEGVLIKQSKTSVAQIKAWSPRFTAAIKMAAELPLKPGMSSIFIIHQPNGSGYTRDGFNSRWSAAREAAKLKFPELLFDFTFHDLKAKGVSDLEGDLYEKRAITGHKNVEQTAAYDRKIVVVPVVGGQAKGK >CP028349|1923801:1964980|1931789_1932224_+|AVV37328.1|DBSCAN-SWA MINHESKILELITRNGPLKVRELCKLTGLHETSVKRFIKPLFTKGLLKRASDWSYSINTDPLPVESEKYSHKAKQAAELESKGFWLRAAQVWREAMLVAKFDASRNVAKENCDRCAVKGSLNCGSYGGLDTGRIISASVNRDLL >CP028349|1923801:1964980|1929975_1930524_+|AVV37325.1|DBSCAN-SWA MDQKHWQVEKQPAWLVAAIKKTISSLPGGYAEAAEWLGVTEDALFNRLRTNGDQIFPMGWAMVLQQASGTKHIANAVSRQSNSVNVPLVDIEDVDNADINQRLMESVEWIGKHSAYIRKATADGMIDAAEREQIEENSYQVMAKWQEHLTLLYRVFCAPEKVNAAGLQSAALDATKSTCVEN >CP028349|1923801:1964980|1941560_1941770_+|AVV37340.1|DBSCAN-SWA MFNENFLPPDPDNPGWVIAWGVVKSEKWDLAGVYGNQEDANAKAEEMGDDYQVHFGSHKLGTKDFIWND >CP028349|1923801:1964980|1943119_1944865_+|AVV37343.1|terminase|DBSCAN-SWA MVEKKKKTRSTSSSVDPATQYAMDVTSGTVIAGPDIRAACARHIRDLEEGPKRGLFWDVEAVTRVVNFFAQVLKLNGGEHEGKPFILLPWQCFIVGSLFGWKAEDGTRRFRMSYIESGKGSGKSPLAGGVGLYLLMADKEPRAEVYAAATKKDQAMILFRDAVTMVDQSPALAQRITKSGTGLNVWNLAFLQTGSFFKPISSDDGQSGPRPHGALIDEVHEHKTNAVVEMMRAGTKGRRQALMFLITNSGHDKTSVCYEYHEYGRKVAAGDLVDDSFFSFICSLDEGDDPFKDESCWGKANPSLGQTFTDKYLREQVTQARGMPSKESIVRRLNFCQWVEASDPWIDSDTWMNCEQDFDPEDLAGEECYGGLDLSGSRDLTALALYFPKSKKLLVEFWTPKDSLLERAKTDHVPYDAWLRNGFIHAPPGKAVNYGFVAVRIGELAARYDIKCIAFDQYRIKYLEPELESESVSVDLVPHGQGFYKAQESGLWMPRSIELFEEHLNNRVLVIRPNPCLRWNAASAVLEADQKDNRIFAKKKSTGRIDGVVASAMAIGAAEDAVLVDSGDPDDFFDDPIMVGI >CP028349|1923801:1964980|1942701_1943166_+|AVV37342.1|terminase|DBSCAN-SWA MAGRRPKPTHLKVVTGNPGKRKLNDKEPAPAREIPSPPSHLTDWGKVAWGKLTVLLDGMGVLTVADVLALERLCDIYADILQLRITIAEEGRTYTVQTDGGFLIKANPAVSMLADADRRFKSYLVEFGLTPAARSKVNVNGGEKEEDPLNQFFG >CP028349|1923801:1964980|1952105_1952462_+|AVV37353.1|tail|DBSCAN-SWA MAKIAGTAYVKVDGQQLSLTGGIEVPMNTKVRDDVIGLAGDVDYKETHRAPYVKGTFKVPKAFPVTKLMDSDQMTITAELANGMVYVLSEAFQFGEANHNAEEGTVDLEFHGSEGFYQ >CP028349|1923801:1964980|1948641_1948971_+|AVV37347.1|head,tail|DBSCAN-SWA MLLTLEEIKQQCRLESDFTEEDRLLELFALAAEAKAVTYLNRNLYKTVADIAPLDTDGMVITEDIRLALLMLVSHWYEHRSSVSELEMTETPQAFQFLLYSRRLPVSGY >CP028349|1923801:1964980|1950421_1950610_+|AVV37351.1|DBSCAN-SWA MQLRPKRGRSVPDPVRGDLLPSEGRNVEESSYWHRRIADGDVEEVSAEEEKPAADAKKKGGE >CP028349|1923801:1964980|1960439_1960958_+|AVV37363.1|DBSCAN-SWA MHRIDTSTAQKDKFGAGKNGFTGGNPQTGELPTALDQNFFDSLQEEICGVIEGGGIALNKADRGQMLKAMKAMFPIMGLFANSLSSPGYQKLPGGVLIQWGTAAIPSAGSVTVTYPITFSAGFSNAFVSPLDSSSTNNYRVGIDTSTTSTITLRSTNTNSVTGVNWMVIGKV >CP028349|1923801:1964980|1940232_1940688_+|AVV37338.1|DBSCAN-SWA MFNVIGFIRNNSGLVIIGLICVALWGLNASNSQLKATNDRLEKLANSKDEQINDLRSKNDGLASSVTELVTAVKQQNEVMSQVTEQRAVTAQQNRKLQNEIKRYLAADKCAVAPVPPDAADRLRDAAKAAGGVPDSKATSAKPSGRTDQPD >CP028349|1923801:1964980|1947038_1948253_+|AVV37346.1|capsid|DBSCAN-SWA MSDVNELLKKVSAKLEEVSGTFSQKAEDALKEAKNSGQLSAQTKEAVDRIATEFNALTEANKSLKASLGDLEQHVAQMPLANAKNVIETVGGQVVSSEALKAFSASIEGNKRLSIPVKAALLSVNVPGQIVAPDRLPGIDQQPKQRLFIRDLIAPGRTESSTIYWVQQTGFTNNAATVAENTKKPYSDITFAEKITPVRTIAHLFKAAKQILDDMPQLQSTIDAELRYGLKYVEEQEILFGDGTGTHLNGIVPQASKYAASFSVANQNGIDDLRLAMLQAQLARFPASGHVLHFIDWAKIELTKDSLGRYILANPAALTGPTLWGLPVVATEAAAFQGKFLTGAFNAGAQIFDREDANVVISTENADDFEKNMISIRCEERLALAVKRPEAFVYGSFTAPAAAA >CP028349|1923801:1964980|1925532_1925847_-|AVV37319.1|DBSCAN-SWA MSIKPLEVQRDQYGYWSHPDYLAFCDGREFISTEEFDQWMSDHGLKWKVNYRDEDMLDPTVDGCDISAWQPESPEGGGWFVGSIHDTEDGAVCIWLRSTAPEVE >CP028349|1923801:1964980|1954814_1955342_+|AVV37356.1|DBSCAN-SWA MKNHLIASLVIGGSLIISSAIISGAIPLKNENVLQVVDGSVKLGNVFSEDDLVSAKLVFSDDNQDQVLFENVGPEETGTRLEEKIKELAKQINYNEKDESKKIAAEKLSIKIPFTLVLTSSVKYRSEYQPSFTLNLTKKNVDLPANSNMLETIKPAIDNFVKSQKAQFDASHFIK >CP028349|1923801:1964980|1939612_1940245_+|AVV37337.1|DBSCAN-SWA MLTASSFQLATGVSNALRDAWFPHVAASVSAFQISTPLRQAHFLAQTGHESAGFLKVEEGLNYSENALTAMFGKRITAEQARAYGRNAMHAANQKMIASIIYANRNGNGDVSSGDGYRYRGRGLIQITGKSNYAALVKQLAADVASNPDLLLGYRFAAMSAAAWWKNNGLNELADSDDVTRITRIINGGLNGLDDRKSRLSKSKGILCST >CP028349|1923801:1964980|1930710_1930953_+|AVV37326.1|DBSCAN-SWA MRELDRIFRDKRGIPVRVIRWEPENDRVIYLRDNYEHGECFSSLERFKQYFREVTVIHEPTSESKTAGDQPGTCAAHRPE >CP028349|1923801:1964980|1960959_1961370_+|AVV37364.1|DBSCAN-SWA MMNKVYYDAQSAGFYLEGINSQIPSTAIEISTEDYKVLLSEQEKGMEITPNENGYPTLTAPPALTQQQEVLIAYSNRLSLMGKASDVIAPLQDAVDLGDSTPYEDALLKNWKQYRVALNRLDLSTAPDIQWPKIPG >CP028349|1923801:1964980|1927665_1928127_+|AVV37322.1|DBSCAN-SWA MNRDDLEFSVTYSYYLEKMNYRLLTRIDKLITLTLIVLGFSVFAKFSNMFVFGAVVAVLSVLQLVYQFAQEAGASKEQMRHYRSLMVNMKNIGDEELRQRFAKIQDSDSMPWQSLEDAAYNRTLIALGHTASLTKLSAKDSVLSWFAGDLPKS >CP028349|1923801:1964980|1938982_1939192_-|AVV37335.1|DBSCAN-SWA MWRLIDVRKNEKRPRDGDLFEWNPDAIAVKALPVGNRDNLGLSTYEIIPYVCVQESESKKPAQTSSIIF >CP028349|1923801:1964980|1936243_1937023_+|AVV39255.1|DBSCAN-SWA MNQLLVIDGVSVRQDNSGRYCLNDLHRAAGGERRHEPSLWRNLQHTNELVQLLSDTGIPVSVIKGGVNQGTFVCKELVYSYAMWISAEFSLKVIRTYDSLASKPPVVSMPEEVQASIILLESASRMLNFSNSSKLGAYQKIQQHYGIPNMMPAYAIDAPVDAKDGSSRPTLSLSALLKANSIRMNASQAYRQLEKLGIVEHKSRASRSGTDGVKLFWSLTTKGCMYGKNITSPANPRETQPHFFESKFAELLRLLDTVH >CP028349|1923801:1964980|1963069_1964134_-|AVV37367.1|DBSCAN-SWA MLKSIHYTRGFAALLVVMFHFSFMYIGKVEPYNRVFLNGGFGVDLFFLISGFIITYVTAEKGNVAGFFLKRFFRIYPLFLFILVISSIFLVRYNVHPIWTMIKSGMFILQDYNRPAPAFDYNFIGPAWTLSYETWFYFVFGMAMIINHKHRVLIASTILLAQVFLLQLTFTGSVSLNSSYVAALGDSSYVDHALRFFSTTLHLEFLLGMWLCEAFIRGWMKVRNAISCPMIMLLLALSGWLYFSDVVKGYGLTEFYLIALPLFFAIIMYESNFKISDSRVLQFFGNISFSIYITHFFIMYLLLEHPPYYWSEAANLTKLVTSTLIAIAFAYILHRLVELPFIKIGKNLQFNYLH >CP028349|1923801:1964980|1932339_1933641_+|AVV37329.1|DBSCAN-SWA MSTLARIYDDKKNSDTDITTRKTYLLGVDELYVETNYNIRDIDQTHVEEFRDAFIAGEHVPPLAVKVTEKGIKIIDGHHRYYGAKLAQEAGYTLRLECKDFVGSEADSVAFMVTSSQGRALLPLERAAAYQRLVNQGLEPAEIAAKVKRSITDVEQHLQLLTVGEPLIEMVKSGEVAATTAVALQREHGVKASSVAQEQMQKAKAAGKKKLTKTDAMPKFSAAQARKLAELIAKHCQTEQGEEGSRVSLTFETDLQAAELMDIILIAKEHYGVTKSVSEQPEPAKTENGDGDDLPLLKNEILEQSGVEVWACVVAAFKMKAEYTYSESKWAHTWAADSVENPTCVTVPAETIASAVRLIKQHQDALELKLWVSEQYDDPELAIEQLQRFSAVLIDVRHDRPCTVQEFIALVEQTNRDYWLNIRMLRQAVSETQ >CP028349|1923801:1964980|1929706_1929922_+|AVV37324.1|DBSCAN-SWA MNTIAEQRKKLGISQSVLADVIGWGQSRVANYELSIRKPGLDECRMIVLGLNKLGANCSLDDVFPPGRVKK >CP028349|1923801:1964980|1924823_1925051_-|AVV39253.1|DBSCAN-SWA MSDINNVIISDADIEKITGYKIPSKQCQCLKQAGIFFVVRRDGRPRTTWQHFNDPLLSRKAPESSKHEPNFGALD >CP028349|1923801:1964980|1961353_1962529_-|AVV37365.1|DBSCAN-SWA MEKLRPNSDIEILRAFAIIMVLVQHFPSLYFWSDHSFFAKVNTYLQFWSGVDLFLCISGFVVSCSLIPLIDQCKQRGTGLSGVVKAFFIKRAFRLFPTSFLWIAIIICMTFSYNISGAFGDIKWNFYQAISILSYTYNLLSHYMLVNSLPTSMGPFWSLNLEEQFYFILPFFIIFTKKAYRVLLLLAFIFIQFLIHRQDTYLMNFRLDSISWGVVISIFYMDSRIFRFEPISLKNRVSSFSLTAIALIFLMFSIPVLHDSRFMVGMLALVSAILIYVASFNKGYIYCPLALRKVLLWVGSRSYAIYVIHMPAIYFIQESTIRYFVSIGQGPSKSVLLCVVMTVSALVLTVLLVELNFKLIESPLRGYGRKIAHKMTSKVKGKGAQAVTQES >CP028349|1923801:1964980|1952869_1954753_+|AVV37355.1|tail|DBSCAN-SWA MAEFELKALITGVDRLSPALGRMQKNLRRFRKDAEESGRGGMAMAGGLAAGLTGSLVAYAKQEDAATGLKVAMMDANGAVGSDFEKINKLAIGLGNKLPGTTADFQNMMQMLVRQGIPAQNILSGVGEASAYLAVQLKKTPEAAAEFAAKMQDATGTASEDMMGLFDTIQKAVYLGVDDTNMLSFFSKTSSILKMVSKDGLTAARALAPISVMMDQMGMEGEASGNALRKVFQAGFDGKKMKAANKLLSKKGITLDFTDGKGEFGGLDNLFKQLNKLQSLTTKQKTTIIKQIFGDDAETLQVLNALIDKGKTGYDQIQEKMGKQADLNKRVNAQLSTLSNIWESLTGTAVNGLAAIGGAFAGDAKRLVGWLGDMSERFTEFADKNPKVIRGAFGIAAGFVGMKLGLLGINFALGILGQGLKLSPMGIFLRLAALGIGLLISDWDKFGPVVEKVWTKIDGLTEALGGMNSIITGIGGVMAGAFTLQVIGSLTTATAKASGLLVVLTKIGKLSALTVSIAVALYMFKKLEEISDATTQKDGTESFWESLKKRWKAGGWYNNEQQMKGGDGALIPQSMNVPLKRDDPSSQRGELKVSFENAPPGMRVEPVGSALPWFDLDVGYNRFSNKS >CP028349|1923801:1964980|1941820_1942171_+|AVV37341.1|DBSCAN-SWA MAEPRIYNSRWDKARLSFLKSHPLCVMCHRQGRAVAAAVVDHIKPHRLKEAINSGKQDEIAKAQKLFWDKTNWQPLCKQHHDSTKQREEKRGHVIGCDENGLPLDLQSHWYKSIKP >CP028349|1923801:1964980|1952458_1952728_+|AVV37354.1|tail|DBSCAN-SWA MSELQLSKPITAHGETIHVLELRDPTGKDVRELGYPYQMNQDESVKLLAHVVAKYISQLGGIPPSSVDDMSPSDLNAAGWVVAGFFLQA >CP028349|1923801:1964980|1948972_1949362_+|AVV37348.1|head,tail|DBSCAN-SWA MQQRSSNTSAVYTLPDPGELNKRIHLRQRIDQAAADYGTEPVYQNEKDVWAKVRQVGATTYHESVQADDTITHYMTIRYRRGITSDFEVVYGGYVYRVKRLRDLNSAGRYLLMECEELRAVEIDGEMYG >CP028349|1923801:1964980|1938074_1938479_+|AVV37333.1|DBSCAN-SWA MRDIQLVLERWGAWAACEGSQVGWSPTSPMFRSLLPQEGKSSRNSCSDSDGIIIDTAVGMLKKTDRHDELELVMLHYMFDVSKSTISRWKKCSEGKVRQQLMIAETFIDACIIMTGVQLEMDDWTRKTILRKSA >CP028349|1923801:1964980|1949354_1949861_+|AVV37349.1|DBSCAN-SWA MAKPLLHVDFQQPKDLVFNRAKMRRAFIQIGQVHMRDARRLVVHRGRSAPGEYPGFRTGKLARSIGYYVPRASKSRPGLMVRIAPNQKRGEGNRLIEGDFYPAFLFYGVKRGSKRKKSHHKGKSGGNGWRVAPRKNYMTEVLEARKTWTRYVLTRALRTSLRPERKKK >CP028349|1923801:1964980|1925056_1925533_-|AVV37318.1|DBSCAN-SWA MEQPILDMCCGSRMFWLDKKDSRAIFADIRKESHVLCDNRALHIDPDIIADFRSLPFPDCSFAQVVFDPPHLDRAGENGWMRKKYGALDKQTWRDDIRAGFSEAFRVLRPHGTLIFKWNETQIPVSQVIALTDHKPTIWQRTGKGDKTHWIIFLKEVE |
57 | Enterobacteria_phage(25.58%) | capsid,integrase,portal,terminase,tail,holin,head,plate | attL 1923641:1923700|attR 1965103:1965162 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2303135 : 2316986
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP028349|2303135:2316986|DBSCAN-SWA AATGCCTGTAATTACGCTTCCTGATGGCAGCCAGCGCGTGTTTGACCGCCCTGTCAGTGTCATGGATATTGCGCTGGACATCGGTCCAGGTCTGGCGAAAGCCTGTATCGCTGGCCGCGTTAACGGTGAACTGGTTGATGCTGCTGATCCCATCACCGACGATGCCGCTGTTGCGATCATTACCGCAAAAGATGAAGCGGGTCTGGAAATTATCCGTCACTCCTGTGCGCACCTGTTAGGGCACGCAATTAAACAGCTGTGGCCGGATACCAAAATGGCGATCGGCCCGGTCATTGATAACGGCTTCTATTATGATGTCGACATCGATCGTACCCTGACCCAGGAAGATATCGACCTGCTGGAAAAACGCATGCATCAGTTGGCTGAAACCAACTACGATGTGGTGAAGAAAAAGGTCAGCTGGCAGGAAGCACGCGACGTCTTTGCGGCGCGCGGCGAAATCTACAAGACCACTATTCTTGATGAAAACATCAGCCATGACGATAAGCCTGGCCTGTATCATCACGAAGAATATGTCGACATGTGCCGCGGTCCGCACGTGCCGAACATGCGGTTCTGCCATCATTTTAAACTGCAGAAGATCTCCGGTGCCTACTGGCGCGGCGACAGCAGCAACAAAATGCTGCAGCGTATTTATGGCACCGCATGGGCAGACAAAAAGCAGCTGGCAGCCTATCTGCAGCGTCTGGAAGAGGCGGCGAAGCGTGACCATCGTAAGATTGGTAAGCAGCTGGATCTCTACCACATGCAGGAAGAAGCACCCGGTATGGTCTTCTGGCATAATGACGGCTGGACCATCTTCCGTGAGCTGGAAGTGTTTGTCCGCAGCAAATTAAAAGAGTACCAGTACCAGGAAGTGAAAGGTCCGTTCATGATGGACCGCGTCCTGTGGGAAAAAACCGGGCACTGGGAAAACTATAAAGAAGCAATGTTCACCACCTCATCTGAGAACCGTGAATACTGCATCAAACCGATGAACTGTCCTGGACACGTTCAGATCTTTAATCAGGGTCTGAAATCCTACCGTGACCTGCCGCTGCGTATGGCAGAGTTTGGTAGCTGTCACCGTAATGAGCCATCAGGTGCGCTGCACGGTCTGATGCGCGTTCGCGGCTTTACCCAGGATGATGCACACGTCTTCTGTACAGAAGAGCAGGTGCGTGACGAAGTGAACAGCTGTATCCGCATGGTTTATGACATGTACAGCACCTTCGGCTTTGAAAAGATTGTGGTGAAGTTATCCACCCGTCCTGAGAAGCGCATCGGTACAGATGAGATGTGGGATCGTGCCGAGGAAGACCTGGCCGCTGCGCTGAACGAAAACAACATTGAGTTTGAGTTCCAGCCAGGTGAAGGCGCGTTCTACGGCCCTAAAATTGAGTTCACTTTGTACGATTGTCTTGATCGCGCCTGGCAGTGCGGAACAGTTCAGCTAGACTTCTCCCTGCCAAAACGCCTTGAAGCAACCTACGTGGGTGAAAACAACGACCGTCAGACGCCAGTGATGATTCACCGTGCGATTCTGGGATCGATGGAGCGCTTCATCGGTATTCTTACCGAAGAATTCGCCGGTTTCTTCCCAACCTGGCTGGCACCGCTGCAAGTTGTGGTAATGAATATTACCGACGGACAAGCGGAATATGTTGAGTCTTTGACGCGTAAGCTGCAGAATGCTGGCATTCGTGTGAAGGCAGACTTGAGAAATGAGAAGATAGGCTTTAAAATCCGCGAGCACACATTACGTCGCGTCCCTTATATGTTGGTCTGCGGTGATAAAGAGGTGGAATCTGGCAAAGTTGCCGTTCGCACCCGCCGTGGTAAAGACCTGGGATCGATGGATGTCGATCTGTTTGTTGAGAAACTTCAACAAGAGATTCGCAGCCGAAATCTTCACCAATTGGAGGAATAAAGTATTAAAGGCGGAAAACGAGTACTACCGACGCGTCCGAACAAAATTAACAGTGAAATCCGCGCAACTGAAGTGCGCTTAACGGGCATGGATGGCGAGCCGATTGGCATTGTCACATTACGCGAAGCTCTGGAAAAAGCCGAGGAAGCGGGCGGCGACTTAGTCGAAATCAGCCCGAATGCCGAACCGCCGGTCTGCCGTATTATGGATTACGGCAAGTTCCTCTATGAAAAAAGCAAATCTTCTAAGGAACAGAAGAAGAAGCAAAAAGTTATTCAGGTTAAGGAAATCAAATTCCGTCCTGGAACCGATGATGGCGACTATCAGGTCAAACTACGCAACCTGATTCGCTTTCTGGAAGATGGCGATAAAGCCAAAATCACGCTCCGCTTCCGCGGTCGTGAGATGGCGCACCAGCAGATCGGTATGGAAGTGCTTAACCGCGTCCGTAAAGACCTGTGTGAAGATCTGGATTTGGCCATTGTCGAATCCTTCCCTTCGAAGATCGAAGGCCGCCAGATGATCATGGTGCTCGCTCCTAAGAAGAAACAGTAGGCTTTCAAGTAGCAATGACCGCGCAGCGTTCGCGCTGTGCGGTTATCTGTTCGCCTGTCTGGGTCATGTTATTAACAATGCGAAGTGGATATTTTTAAAATGCCAAAGATTAAAACTGTACGTGGCGCGGCTAAGCGCTTCAAGAAGACCGCTTCTGGCGGCTTCAAGCGTAAACACGCAAACCTGCGTCATATTCTGACTAAAAAATCTACTAAGCGTAAACGTCACCTGCGCCCTAAAGGCATGGTGTCAAAAGGCGATCTGGGTCTGGTTATTGCCTGCCTGCCGTACGCATAAGTAAATTTTTTTCATTAATTCAGAATATTAAACAGGAGAGCTAAATGGCTCGTGTAAAACGTGGTGTAGTTGCTCGCGCACGTCACAAAAAAATCTTAAAACAAGCTAAAGGCTATTACGGTGCACGTTCACGTGTTTACCGTGTTGCTTTCCAGGCTGTTATCAAAGCTGGTCAGTATGCTTACCGTGACCGTCGTCAGCGTAAGCGTCAGTTCCGTCAGCTGTGGATCGCGCGTATCAACGCAGCGGCGCGTCAGAACGGCATGTCTTACAGCCGTTTCATCAATGGTCTGAAAAAAGCAGCCATTGAGATTGACCGTAAGATCCTGGCCGACATCGCAGTATTCGACAAAGTGGCATTCTCTGCACTGGTCGAAAAAGCGAAATCAGCCCTGGCGTAAGTCAGATGGAAAAGGGAGCTTGCTCCCTTTTTTATTGCCTGTAACTTACGCAAAAGATTGACATTTTCGTCGCTGAGCTTTTCAATAGACCCCCTGTATCGCATGACAAGGTAACCGCAAGCATGAATGCTGCTATTTTCCGTTTCTTTTTTTACTTTAGCGCCTGACATTCGGGGGCTTTTGCGCATAAGAGAAGAAACGAAAAATCGCGCTAAAAGCCTCCCTCGTGGAGGCTTTTTTGTTTCTGGCGCTAGCATATTGGACCGGCAGGTCCCAACGAGAACTGACCAGCCTGACTGGAAAAAGAGGAAACTATGTCCCAACTCGCAGAACTGGTGGCCAGCGCCACGGCGGCCATCGACGGGGCGACGGATATCGCCGCCCTTGACGCGGTGCGCGTCGAATACCTGGGTAAGAAAGGACATCTCACACTGCAGATGACGACGCTACGCGAGCTGCCAGCAGAAGAGCGTCCTGCAGCGGGTGCGGTAATCAATGAAGCGAAGCAGCAGGTCACCGATCGACTGAATGCCCGTAAAGATGCACTGGAAACCGCGGTGCTGAATGCCCGTCTGGCTGAAGAGACCATTGATGTCTCTCTGCCGGGTCGTCGTATCGAGAATGGCGGTCTGCATCCGGTCACCCGCACCATGGATCGCATTGAGACCTTCTTTGGCGAGTTAGGCTTTGGCGTGGTAACAGGGCCCGAAATTGAAGATGATTATCATAACTTCGATGCGCTGAATATTCCTGCGCACCACCCGGCGCGTGCCGATCACGATACCTTCTGGTTCGACGCCACGCGTCTGCTGCGTACGCAGACCTCTGGCGTGCAGATCCGCACCATGCAGGCTCAGCAGCCGCCTATCCGTATTATTGCGCCAGGCCGCGTCTACCGTAACGATTATGACCAGACGCACACGCCGATGTTCCATCAGATGGAAGGGCTGATCGTGGATAAAAACATCAGCTTTACCAACCTGAAAGGCACGCTGCACGATTTCCTGAACAACTTCTTCGAAGAAGATTTGCAGATTCGCTTCCGTCCTTCTTATTTCCCGTTCACTGAGCCATCGGCAGAAGTGGATGTTATGGGTAAAAACGGTAAGTGGCTGGAAGTGCTGGGTTGCGGCATGGTGCACCCTAACGTGCTGCGCAACGTCGGCATCGATCCGGAAGTTTATTCCGGTTTCGCCTTTGGTATGGGCATGGAGCGTCTGACCATGTTGCGCTATGGCGTGACCGATCTGCGTGCTTTCTTCGAAAATGATTTACGTTTCCTCAAACAATTTAAGTAAGGGCAGGATATCCAATGAAATTCAGTGAACTCTGGTTACGCGAATGGGTAAATCCAGCCCTGGACAGCGCTGCGCTGTCTGAACAAATCACCATGGCCGGTCTGGAAGTGGACGGCGTTGATGCCGTCGCCGGTGTGTTCCACGGCGTTGTGGTTGGTGAAGTCGTGGAGTGCGGTCAGCACCCCAATGCCGACAAGCTGCGTGTCACCAAAATCAACGTCGGTGGCGATCGCCTGCTGGATATCGTCTGTGGCGCGCCGAACTGCCGTCAGGGACTGAAAGTTGCTGTCGCGACCGTCGGCGCTGTGCTGCCAGGTGATTTCAAAATCAAAGCGGCTAAACTGCGTGGCGAGCCCTCTGAAGGGATGCTCTGCTCGTTCTCCGAGCTGGGTATTTCCGATGATCACAACGGTATCATCGAACTGCCTTCTGACGCGCCAGTTGGCACCGACATCCGCGCGTACCTGCAGCTGGATGACAACACCATTGAAATCAGCGTCACGCCTAACCGCGCCGACTGTCTGGGTCTGATCGGTATCGCCCGTGATGTCGCGGTGCTCAACGGTTTACCGCTGAACGTGCCGGAGATGCAGCCGGTCGCGGCGACGCTTAACGACACGTTCCCGATTCAGGTTGACGCGCCAGAAGCCTGTCCGCGCTATCTGGGCCGTGTCGTGAAAGGAATTAACGTTAAGGCAGCTACGCCACTGTGGATGAAAGAGAAGCTGCGTCGCTGCGGCATTCGCTCTATCGATCCTGTCGTCGATATCACCAACTACGTTCTGCTGGAGCTGGGTCAGCCAATGCACGCCTTCGACCTTGACCGTATTGATGGCGGTATCGTGGTGCGCATGGCGAAAGAGGGCGAAAAGCTGACGCTGCTGGATGGCACTGAAGCCACCCTGCAGAGTGATACTTTAGTGATCGCCGATCATCAGAAAGCGCTGGCGATGGCCGGAATCTTTGGTGGCGAACATTCGGGTGTGAACGGCGAAACGCAGAACATCCTGTTCGAGTGTGCGTACTTTGATCCGCTCTCTATCACTGGCCGCGCCCGTCGTCACGGTCTGCATACCGACGCGTCTCATCGCTACGAGCGTGGCGTTGACCCGGCGCTGCAACATACCGCGCTGGAGCGTGCTACCCAACTGCTGCTGTCCATCTGCGGCGGTGAAGCAGGCCCGATCATCGATCAGACCCATCAGGCAGCCCTGCCTGTACCCGCCACGATCACGCTGCGTCGCGAAAAGCTGGACCGTCTGATTGGTCACGTGATTGCGGATGAGCAGGTTACCGACATTCTGACGCGTCTCGGCTGTGACGTGACGGCAGGTGAAGGGCAGTGGCAGGCCGTCGCGCCGAGCTGGCGCTTCGATATGGCCATTGAAGAAGATCTGGTCGAAGAAGTGGCCCGTATTTATGGCTACAACAATATTCCTGATGTACCTGTTCAGGCCGGTCTGGTCATGACCCGTCACCGTGAAGCGAATCTGTCGCTGAAACGCGCGAAAAACCTGCTGGTGGATAAAGGTTATCAGGAAGCGATTACCTATAGCTTCGTCGATCCGAAAATCCAGCAGCTGCTGCATCCGGGTGAAGAGGCGCTGTTACTGCCAAGCCCGATCTCCAGTGATATGTCGACCATGCGTCTGTCACTCTGGACCGGTCTGCTCAGTGCGGTGGTCTATAACCAGAACCGTCAGCAGAGCCGTGTACGTCTGTTCGAGAGCGGCTTACGCTTTGTGCCAGATACGCAGGCGGATTTAGGCATCCGTCAGGATCTTATGCTGGCTGGCGTCATCAGCGGCAACCGTGTTGAAGAGCACTGGGATCTGGCGCGTCAGACAGTTGACTTCTATGATTTGAAAGGCGATTTAGAGTCGCTGCTCGATTTAACCGGCAAACTGGATGAGATTAGCTTCCGGGCTGAGGCGAATCCGGCGTTGCATCCAGGACAGAGCGCGGCGATTTATTTGCATGATGAACATATCGGATTTATCGGCGTTGTACATCCTGAGCTGGAACGTAAGCTTGATCTCAATGGCCGCACCTTAGTCTTTGAACTGCTTTGGAATAAGGTCGCAGACCGCGTCCTGCCTGACGCGCGCGAGATTTCTCGCTTCCCGGCGAACCGCCGTGATATTGCCGTTGTAGTGGCTGAAAACGTCCCTGCAGCAGATATCATTGCGGAGTGTAAGAAAGTTGGCGTAAATCAGGTAGTTGGCGTAAACTTGTTTGACGTGTACCGCGGTAAGGGCGTTTCTGAGGGCTACAAGAGCCTCGCAATCAGCCTGATTTTGCAGGATACCAGCCGGACACTCGAAGAAGAGGAGATTGCCGCGACCGTTGGCAAATGCGTTGCGGCATTAAAAGAGCGATTCCAGGCAACCTTGAGGGATTGAACCTATGGCGCTTACAAAAGCTGAGATGTCAGAGTACCTGTTTGAGAAACTCGGGCTGAGCAAACGCGATGCCAAAGAGTTAGTAGAGCTGTTTTTCGAAGAGGTACGTCGCGCTTTGGAAAATGGAGAACAGGTAAAACTGTCTGGGTTTGGTAATTTTGACCTGCGTGACAAGAACCAGCGTCCTGGCAGAAACCCGAAAACGGGTGAAGATATTCCGATTACTGCGCGTCGCGTAGTAACGTTCCGTCCAGGACAGAAGTTGAAAAGCCGCGTAGAGAACGCTACACCCAAAGAGACTGACTGAATTTCAGCGTTTCAAAAAGGCCGCGAAAGCGGCCTTTTTTCATAATGACAGCATGCAGCATTGCCGCTCATCCTATCCTTTAGAGTCCAGGCCATCCCGGCGCAGACCGCCTCATCAGCCTTTCTTTTTTCACCAATCCCTCTACAATCAAATCACTGTTTTCTCAGCTAATCCCGCTTATGTCTCTGCTCGATACGTTCGCGCGCCAGGCCGATCAACGCGGCCAGAAATGGCTGATTGCGCTCGCCCTTATGCTCTCTGCACTGCTCTGTCTCAGCCTCTGTGCCGGGGACAGCTGGATCGCGCCCTCTGCGTGGTTCAGCGATAGCGGACAGCTCTTTGTCTGGCAGTTGCGGCTGCCCCGCAGTCTGGCGGTCGTCCTGGTGGGGGCGTCGCTGGCAGTATGCGGCGTGGTGATGCAGGCGCTGTTTACCAATCCGCTGGCGGAGCCAGGCCTGCTCGGCGTTTCGAACGGCGCAGGTATCGGTCTGGTGCTCGGCGTGCTGCTGGGCAGCGGCTCGCTGTGGAGTCTGGGGCTGGCGGCGATGGCCGGTGCGCTGATAATTACGCTAATCCTGCTGCATTTTGCCCATCGTCAGCTCTCGGTCACCCGGCTGTTGCTGACCGGCGTGGCGCTGGGGATTATCTGCAGTGCCATCATGACCTGGGCGGTCTACTTCAGCACCAGTCTCGATCTGCGTCAGCTGATGTACTGGATGATGGGCGGATTCAGCGGCATTGACTGGCGCTATGGCTGGATGATGCTGGCACTGCTGCCGGTGATGCTCGCGCTGGGGGCGACCGGGCCGATTCTCAATCTGCTGGCCCTGGGCGAGACGTCCGCCCGTCAGCTCGGGCTGTCACTTTTCTACTGGCGTAACCTGCTGGTACTGGCGATGGGCTGGCTGGTGGGCATCAGCGTCGCGATGGCCGGCGCCATTGGCTTTGTCGGGCTTGTTATCCCGCATCTGCTGCGTCTGAGCGGCATCAGCGATCACCGCTATCTGCTGCCAGCATCGGCGCTGGCGGGCGCGGCGGTGCTTCTGGCCGCCGATATCGTGGCGCGGCTGGCACTGACTTCAGCGGAGTTGCCGATAGGCGTCGTGACGGCGACACTGGGCGCGCCACTGTTTATTATTTTACTGGTGAAATCTTCGCGCTAGCTTCGGCTGGCTACTCTGCATCCACAACCGCATCCATTACAGGAAAAAGCATGAATATTTATGAGACTGAACTGGTTACGCTCGACGGGGAAAAAACCACGCTGTCACAGTGGCAGGGCAAAGTCCTGCTGGTCGTCAACGTTGCCTCGAAATGTGGCCTGACGCCGCAATACGAAGAGCTGGAGAATCTGCAGAAAGCCTGGCAGGATCAGGGCTTCAGCGTCCTCGGTTTCCCCTGCAATCAGTTTCTGGAGCAGGAACCGGGCAGCAGTGAGGAGATCAAAACTTTCTGCAGCACCACCTATGGCGTAACGTTCCCGCTGTTTGCCAAAACCGAGGTTAATGGCCCGGCGCGTCATCCGCTCTATGCGCAGCTGATTGCGGCCCGGCCTGACGCGGTGCGTCCGGAAGGCAGCGGTTTTTACGAACGTATGGAGAGTAAAGGGCGCGCGCCGAAAGAGCAGGGCGACATCCTGTGGAACTTTGAAAAGTTCCTGATTGGCCGTGACGGTAGTGTGATTCAGCGTTTTGCACCTGACATGACCCCGGAAGATCCCATCATTCTGGAAACGATTAAACAGGCGCTGGCAAAATAGATGCTGTTGCGCCTGCAACAGGCTGCCGTCCCCGGACGGCTGCTGCCACTGACCGGCGAGCTTGCCGCCGGTCAGTTGCTGCACATTGCAGGCCCGAACGGAGCCGGTAAAAGCACCTTACTGAGTGTCATTGCGGGGCTGCAACCCGCCTCCGGCAGGGTGTTACTGGATGAACAGCCGCTGAGTAACTGGAACGGCGCGGCACTGTCCAGGGTGCGCGGCTGGCTGCCACAGCAGCAGGCCCCACTGAGTCAGATGCCGGTCTGGCACTATCTGCGTCTGCACCTGAAACCGGCGGGGCCACAGGCCGATACCCGTCTCAGCATGCTGCTGCAGCGGCTTCAGCTGCAGGATAAGTTATCGCGTTCACTGACCCGGCTCTCCGGCGGGGAATGGCAGCGCGTCCGGCTGGCAGCCGTCTGTGCCCAGATCGATCCTCACATCAACCCCTGCGGGCGGCTGCTGATTCTTGATGAGCCGTTGACGGCGCTGGATATCGGCCAGCAAAAAGCGGTGGACGATCTGATTGCGGCGCTGTGCGCGTCGGGTGTTAGTGTTGTCGCCAGCAGTCACGATCTTAACCACAGCCTGCAGCAGGCAGACAGGGTCTGGCTGATGGATAGCGGGCAGGTGATCGCGCAGGGCGAACCTTCAGAGGTTCTGACGCCGCAGCGCCTGACACTGCTCTACCAGATTGCTTTCCGGCAGATTGAACTTGAAGGACGCACGCTGCTGACCGTTCTGCCCTGACACTTGCTACCTGGCGGGCTTTACCGCTACATTATGCCGGTTGCAACTTCTGATAAGGACATTTGGCGATGCGCCTGGGTTTGCTGATTTTAGTTCTGCTGCTGACGGGATGCAGTCATCACGCCCCGCCGATTAATGGCCGTCTTTCCGATTCAATCACCGTGATAGCGGAACTGAACGATCAGCTCGGTCACTGGCGCGGTACGCCTTATCGTTATGGCGGCATGAGTCGTAACGGCGTTGACTGCTCCGGCTTTGTCTATCTGACCTTCCGCGATAAATTTGCATTGCAACTGCCGCGCACCACCTCAGCCCAAAGCGATATTGGCACCCGCATCAGCAAAGATGAACTGCTGCCAGGCGATCTGGTCTTTTTCAAAACCGGACGCGGTGAAAATGGCCTGCATGTGGGCATCTATGATACCGATAACGCTTTTATCCACGCCTCGACCAGTCAGGGCGTCATTCGCTCCTCACTCGACAATGTCTACTGGCGAAAAGTGTTCTGGCAGGCTCGCCGTATTTAACAGCACCTTCGCGAAATCAGACTTAAGTCCAATTTTAAATAGAGTTAAAACAACGAGTAAAAGTTTTATTTTAAATGAACGTTTAGGACGAGTATTTTAATAAACAGAATAATGGCTGATTTAATTCCACTAATAACCGGTGTTTTATCACTACTATTTGGCTTAAGCGTGATCGGTTGTTGCCAAACGGAAATGGAAGGGTAAAGAAGTTCGTCTCTTCAATGATTTGGCACTATTGCCTGCGGCTGACCATTCCTATACGATATCCCTATCGCCGCATATTCTGGCATTAAAATGAAAATTCATCTCTCGGCCGACTATCAGTCGGAAATCTGGTTTTATCCTGTCTGTGATGTTGATGGACGGTTGACCGCTGTCGAGCTGGTAACACAGTTTGTGCATGAAAGTGCGCCCATCACCCTGCCTCAGGATCTGCTGCTTCCCCAGCTTGATGAGTCGCAGCAGTTACGTCTGCTGCAAAGCCAGATTGGATTGCTGGAGCAGAATCGCGGATTATTTGAAGCGCATTCCGTGTCAGCTTTTATCCGTATTGATGAAGGCATGGCCCGGACGCTGCTTGCCAGTGAATTAGTCATGCGCAAAATAAAACAGCTGCCTTTCATTCTATTGAATATCACTGAAACGTTTCCGCAACTCAAGCTGGGCAAAAATAATCCGCTGCTGGCGAGCCTGCATGCTGAGTTTAATTTAGCGCTTTCACATTTTGGCGCGGGCAAAACACCCTCTAATGCGGTCTACGATAATTTATTCAGCTGCATCTGTCTCGACAAAGAATTTATTCACTCGCTGGCAAAACGCGCTTCATTTGTGCCATTTATTCAGAGCATTATTGATAATTTCCGCGCCCATTGTGACCGGCTGATTATTTGCGGCATTGATGACGAAGTGCTGTTCGATAAAGTCAGCCAACTGAATGGTGCCGATTTTCAGGGTGCGCTCTTCCCGGTGGTAAAATCTGACGCACTGCGTTCACTGCTCTGTGCCGACGAGCGCATGCCATCCTCTCAGCATCCCGGCTGACAGAACCTGTTAATCCCTGCTATCAGCGTTACACTAAAGGACTAACCTACGTATCGCTGAGTATCGCATGGGAGAAACAATGTCCGCAGGCAAGGCCTCTTTTGACAACACCTGGTTCCGTGAACTGACCGGATGTTATACCGCGCTGAACCCCACGCCGCTGGCAGGCGGACGCCTGCTGTATCACAACGCACCGCTTGCAACCTCAATGGGGCTGGATAGCGCGCTCTTTGAAGGTCACGGTCACGATGTCTGGCACGGTGCGGCACTGCTGCCCGGCATGCAGCCGCTGGCGCAGGTCTACAGTGGCCATCAGTTTGGTGTCTGGGCGGGTCAGCTCGGTGACGGGCGCGGTATCCTGCTGGGTGAACAGCGTCTGGATGATGGCAGCAAACTTGACTGGCATCTCAAGGGGGCAGGCCTGACGCCTTATTCCCGCATGGGCGATGGTCGTGCCGTGATCCGTTCCAGCGTCCGTGAGTTTCTCGCCTCGGAAGCGCTGCACCATCTGGGTATCCCCACCACGCGCGCACTCACTCTGTCGATAGGCGATGAGCCGGTCTACCGCGAAACCACTGAGCGTGGCGCGATGCTGATGCGCATTTCACCGAGCCACCTGCGTTTCGGTCATTTCGAGCATTTCTTTTACAGTCAGCAGCAGGAGAAAGTGCAGCAACTGGCTGACTACGCCATTCGTCATCACTGGCCGCATCTGGAAGAGGAAGCGGACCGCTATCAACAGTGGTTTACCGACATTGTGTTACGCACCGCGCGCCTGATTGCGTTGTGGCAGAGCGTGGGTTTTGCGCATGGCGTCATGAATACTGACAATATGTCGATCCTTGGCCTGACCATCGACTATGGCCCGTTTGGTTTTCTGGATGATTATCAGCCTGACTTTATCTGTAATCACAGCGACTATCAGGGGCGCTACAGCTTTGAAAATCAGCCGATGATTGGCATGTGGAATCTCAATCGTCTGGCACATGCGCTGTCGGGTCTGCTGACCACTGAGCAGTTGCGGACGGCGCTGAGCGCCTATGAACCTGAACTGATGCGGGTCTGGGGTGAGAGGATGCGGGCAAAACTCGGCCTGCTGACACAACAGAGCAATGACAATCAGATTCTGACCGATCTACTGGCGCTGATGACGCAGGAGCACACTGATTACACCCTGACGTTCCGGCTGCTGAGTGAAACCCAACAGGCTGAAAGCCGTTCAGCGCTGCGCGACGAGTTCATCGATCGTGAGGCGTTTGATGGCTGGTATCAGCGCTACCGCAACCGTCTCGTGGATGAGCAGGTTAGGGATGAAGAACGACAGGCTGTGATGAAGGTGGCTAATCCGGCTGTCATTTTGCGCAACTATCTGGCGCAGCAGGCGATTGAAGAGGCGGAGCGGGGAGAGCAGGGGGCGCTGGCCCGTCTGCATCAGGCGTTACAGCAGCCCTTCAGCGATGAGACAGCCGCTGAGTATCGTCAGCGGCCGCCAGACTGGGGCAAAACCTTAGAGGTGAGCTGCTCCAGCTGATTCAGACTCAGAAGCGGTGAGAAACCGCCTCGGCCAGCAGGGCGAGAACCTGTTCACTCTCCTGCCAGCCGAGACAGGGATCGGTTATCGACTGGCCATAAACCAGTGGTTCACCGCTTACCACTTTCTGATTGCCTTCTTCCAGGAAGCTTTCAATCATCACACCGGCAACGGCCCGCGATCCTGATTTAATCTGCTCTGCTACCTCTGCCGCCACATCCATCTGACGACGATGCTGCTTCAGACAGTTGCCGTGACTGAAATCGATCACCAGCTGCTCTGGCAGATTAAAGGCCGCCAGACTCTCCGCGGCGGCGGCGACATCGCTGGCATGATAGTTAGGCTGTTTACCGCCTCGCAGGATCACATGACCATGCGGGTTGCCACTGGTCTGGTAGATAGTCATCTGGCCCAGTTTATCCGGCGACAGGAACATGTGGCTGGCGCGTGCGGCGCGAATCGCATCCACCGCGATCTGCACGTTTCCATCGGTGCCGTTTTTAAAGCCAACCGGGCAGGAGAGGGCAGAGGCCATTTCGCGGTGGATCTGGCTTTCGGTGGTACGTGCGCCAATCGCGCCCCAGCTGATGAGGTCAGCAATAAACTGGCCGATCACCATATCCAGAAATTCTGTGGCGGTGGGCAGGCCCAGCGCGTTGATATCAATCAGCAGCTTGCGGGCGATGCCGATACCCCGGTTCACATCGTAACTGCCATCCAGGTCCGGATCGGAGATTAACCCTTTCCAGCCTACGACGGTACGAGGCTTCTCGAAGTAGGCACGCATCACAATCTCCAGCCGATCTTCATAGCGGATGCGTAACGCGTTCAGACGCGCAGCATACTCCAGGGCTGCTTTCGGGTCGTGCAGCGAACAGGGCCCGATCACTACCAGCAGGCGCGGATCGTCACCCGTCAATATACGGGCGATACGTTTACGGGCAACGGTGACGTTGTCGGCAATCGCCGCGGAGACAGGATGTTCAGCCGCCAGACTGGCGGGGGTAATCAGACTACCAATGCGTGCGGTACGCAGTTCGTCGGTTTTTTTCAT
Protein sequences of DBSCAN-SWA_4 >CP028349|2303135:2316986|2313654_2314401_+|AVV37672.1|DBSCAN-SWA MKIHLSADYQSEIWFYPVCDVDGRLTAVELVTQFVHESAPITLPQDLLLPQLDESQQLRLLQSQIGLLEQNRGLFEAHSVSAFIRIDEGMARTLLASELVMRKIKQLPFILLNITETFPQLKLGKNNPLLASLHAEFNLALSHFGAGKTPSNAVYDNLFSCICLDKEFIHSLAKRASFVPFIQSIIDNFRAHCDRLIICGIDDEVLFDKVSQLNGADFQGALFPVVKSDALRSLLCADERMPSSQHPG >CP028349|2303135:2316986|2311537_2312083_+|AVV37669.1|DBSCAN-SWA MNIYETELVTLDGEKTTLSQWQGKVLLVVNVASKCGLTPQYEELENLQKAWQDQGFSVLGFPCNQFLEQEPGSSEEIKTFCSTTYGVTFPLFAKTEVNGPARHPLYAQLIAARPDAVRPEGSGFYERMESKGRAPKEQGDILWNFEKFLIGRDGSVIQRFAPDMTPEDPIILETIKQALAK >CP028349|2303135:2316986|2314480_2315932_+|AVV37673.1|DBSCAN-SWA MSAGKASFDNTWFRELTGCYTALNPTPLAGGRLLYHNAPLATSMGLDSALFEGHGHDVWHGAALLPGMQPLAQVYSGHQFGVWAGQLGDGRGILLGEQRLDDGSKLDWHLKGAGLTPYSRMGDGRAVIRSSVREFLASEALHHLGIPTTRALTLSIGDEPVYRETTERGAMLMRISPSHLRFGHFEHFFYSQQQEKVQQLADYAIRHHWPHLEEEADRYQQWFTDIVLRTARLIALWQSVGFAHGVMNTDNMSILGLTIDYGPFGFLDDYQPDFICNHSDYQGRYSFENQPMIGMWNLNRLAHALSGLLTTEQLRTALSAYEPELMRVWGERMRAKLGLLTQQSNDNQILTDLLALMTQEHTDYTLTFRLLSETQQAESRSALRDEFIDREAFDGWYQRYRNRLVDEQVRDEERQAVMKVANPAVILRNYLAQQAIEEAERGEQGALARLHQALQQPFSDETAAEYRQRPPDWGKTLEVSCSS >CP028349|2303135:2316986|2305067_2305619_+|AVV37662.1|DBSCAN-SWA MKGGKRVLPTRPNKINSEIRATEVRLTGMDGEPIGIVTLREALEKAEEAGGDLVEISPNAEPPVCRIMDYGKFLYEKSKSSKEQKKKQKVIQVKEIKFRPGTDDGDYQVKLRNLIRFLEDGDKAKITLRFRGREMAHQQIGMEVLNRVRKDLCEDLDLAIVESFPSKIEGRQMIMVLAPKKKQ >CP028349|2303135:2316986|2310021_2310324_+|AVV37667.1|DBSCAN-SWA MALTKAEMSEYLFEKLGLSKRDAKELVELFFEEVRRALENGEQVKLSGFGNFDLRDKNQRPGRNPKTGEDIPITARRVVTFRPGQKLKSRVENATPKETD >CP028349|2303135:2316986|2303135_2305064_+|AVV37661.1|tRNA|DBSCAN-SWA MPVITLPDGSQRVFDRPVSVMDIALDIGPGLAKACIAGRVNGELVDAADPITDDAAVAIITAKDEAGLEIIRHSCAHLLGHAIKQLWPDTKMAIGPVIDNGFYYDVDIDRTLTQEDIDLLEKRMHQLAETNYDVVKKKVSWQEARDVFAARGEIYKTTILDENISHDDKPGLYHHEEYVDMCRGPHVPNMRFCHHFKLQKISGAYWRGDSSNKMLQRIYGTAWADKKQLAAYLQRLEEAAKRDHRKIGKQLDLYHMQEEAPGMVFWHNDGWTIFRELEVFVRSKLKEYQYQEVKGPFMMDRVLWEKTGHWENYKEAMFTTSSENREYCIKPMNCPGHVQIFNQGLKSYRDLPLRMAEFGSCHRNEPSGALHGLMRVRGFTQDDAHVFCTEEQVRDEVNSCIRMVYDMYSTFGFEKIVVKLSTRPEKRIGTDEMWDRAEEDLAAALNENNIEFEFQPGEGAFYGPKIEFTLYDCLDRAWQCGTVQLDFSLPKRLEATYVGENNDRQTPVMIHRAILGSMERFIGILTEEFAGFFPTWLAPLQVVVMNITDGQAEYVESLTRKLQNAGIRVKADLRNEKIGFKIREHTLRRVPYMLVCGDKEVESGKVAVRTRRGKDLGSMDVDLFVEKLQQEIRSRNLHQLEE >CP028349|2303135:2316986|2312901_2313360_+|AVV37671.1|DBSCAN-SWA MRLGLLILVLLLTGCSHHAPPINGRLSDSITVIAELNDQLGHWRGTPYRYGGMSRNGVDCSGFVYLTFRDKFALQLPRTTSAQSDIGTRISKDELLPGDLVFFKTGRGENGLHVGIYDTDNAFIHASTSQGVIRSSLDNVYWRKVFWQARRI >CP028349|2303135:2316986|2305718_2305916_+|AVV37663.1|DBSCAN-SWA MPKIKTVRGAAKRFKKTASGGFKRKHANLRHILTKKSTKRKRHLRPKGMVSKGDLGLVIACLPYA >CP028349|2303135:2316986|2310503_2311487_+|AVV37668.1|DBSCAN-SWA MSLLDTFARQADQRGQKWLIALALMLSALLCLSLCAGDSWIAPSAWFSDSGQLFVWQLRLPRSLAVVLVGASLAVCGVVMQALFTNPLAEPGLLGVSNGAGIGLVLGVLLGSGSLWSLGLAAMAGALIITLILLHFAHRQLSVTRLLLTGVALGIICSAIMTWAVYFSTSLDLRQLMYWMMGGFSGIDWRYGWMMLALLPVMLALGATGPILNLLALGETSARQLGLSLFYWRNLLVLAMGWLVGISVAMAGAIGFVGLVIPHLLRLSGISDHRYLLPASALAGAAVLLAADIVARLALTSAELPIGVVTATLGAPLFIILLVKSSR >CP028349|2303135:2316986|2306631_2307615_+|AVV37665.1|tRNA|DBSCAN-SWA MSQLAELVASATAAIDGATDIAALDAVRVEYLGKKGHLTLQMTTLRELPAEERPAAGAVINEAKQQVTDRLNARKDALETAVLNARLAEETIDVSLPGRRIENGGLHPVTRTMDRIETFFGELGFGVVTGPEIEDDYHNFDALNIPAHHPARADHDTFWFDATRLLRTQTSGVQIRTMQAQQPPIRIIAPGRVYRNDYDQTHTPMFHQMEGLIVDKNISFTNLKGTLHDFLNNFFEEDLQIRFRPSYFPFTEPSAEVDVMGKNGKWLEVLGCGMVHPNVLRNVGIDPEVYSGFAFGMGMERLTMLRYGVTDLRAFFENDLRFLKQFK >CP028349|2303135:2316986|2312083_2312833_+|AVV37670.1|DBSCAN-SWA MLLRLQQAAVPGRLLPLTGELAAGQLLHIAGPNGAGKSTLLSVIAGLQPASGRVLLDEQPLSNWNGAALSRVRGWLPQQQAPLSQMPVWHYLRLHLKPAGPQADTRLSMLLQRLQLQDKLSRSLTRLSGGEWQRVRLAAVCAQIDPHINPCGRLLILDEPLTALDIGQQKAVDDLIAALCASGVSVVASSHDLNHSLQQADRVWLMDSGQVIAQGEPSEVLTPQRLTLLYQIAFRQIELEGRTLLTVLP >CP028349|2303135:2316986|2315939_2316986_-|AVV37674.1|DBSCAN-SWA MKKTDELRTARIGSLITPASLAAEHPVSAAIADNVTVARKRIARILTGDDPRLLVVIGPCSLHDPKAALEYAARLNALRIRYEDRLEIVMRAYFEKPRTVVGWKGLISDPDLDGSYDVNRGIGIARKLLIDINALGLPTATEFLDMVIGQFIADLISWGAIGARTTESQIHREMASALSCPVGFKNGTDGNVQIAVDAIRAARASHMFLSPDKLGQMTIYQTSGNPHGHVILRGGKQPNYHASDVAAAAESLAAFNLPEQLVIDFSHGNCLKQHRRQMDVAAEVAEQIKSGSRAVAGVMIESFLEEGNQKVVSGEPLVYGQSITDPCLGWQESEQVLALLAEAVSHRF >CP028349|2303135:2316986|2307629_2310017_+|AVV37666.1|tRNA|DBSCAN-SWA MKFSELWLREWVNPALDSAALSEQITMAGLEVDGVDAVAGVFHGVVVGEVVECGQHPNADKLRVTKINVGGDRLLDIVCGAPNCRQGLKVAVATVGAVLPGDFKIKAAKLRGEPSEGMLCSFSELGISDDHNGIIELPSDAPVGTDIRAYLQLDDNTIEISVTPNRADCLGLIGIARDVAVLNGLPLNVPEMQPVAATLNDTFPIQVDAPEACPRYLGRVVKGINVKAATPLWMKEKLRRCGIRSIDPVVDITNYVLLELGQPMHAFDLDRIDGGIVVRMAKEGEKLTLLDGTEATLQSDTLVIADHQKALAMAGIFGGEHSGVNGETQNILFECAYFDPLSITGRARRHGLHTDASHRYERGVDPALQHTALERATQLLLSICGGEAGPIIDQTHQAALPVPATITLRREKLDRLIGHVIADEQVTDILTRLGCDVTAGEGQWQAVAPSWRFDMAIEEDLVEEVARIYGYNNIPDVPVQAGLVMTRHREANLSLKRAKNLLVDKGYQEAITYSFVDPKIQQLLHPGEEALLLPSPISSDMSTMRLSLWTGLLSAVVYNQNRQQSRVRLFESGLRFVPDTQADLGIRQDLMLAGVISGNRVEEHWDLARQTVDFYDLKGDLESLLDLTGKLDEISFRAEANPALHPGQSAAIYLHDEHIGFIGVVHPELERKLDLNGRTLVFELLWNKVADRVLPDAREISRFPANRRDIAVVVAENVPAADIIAECKKVGVNQVVGVNLFDVYRGKGVSEGYKSLAISLILQDTSRTLEEEEIAATVGKCVAALKERFQATLRD >CP028349|2303135:2316986|2305960_2306317_+|AVV37664.1|DBSCAN-SWA MARVKRGVVARARHKKILKQAKGYYGARSRVYRVAFQAVIKAGQYAYRDRRQRKRQFRQLWIARINAAARQNGMSYSRFINGLKKAAIEIDRKILADIAVFDKVAFSALVEKAKSALA |
14 | Tupanvirus(11.11%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2447444 : 2487166
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP028349|2447444:2487166|DBSCAN-SWA TTTAGCGGAAGGCGTAACCCAGCGTCACACCGACGCTGTAGTTGCGGCCTGGCGCAGATTCGAAGTAGCGACCATTGCTCTCATTCACAATTACTGAACCCACATAGTGCCGGTCAAAAAGGTTATCGACCCGGCCAAACAGATCCAGCGTCCAGTTATCCACCAGCCACTTATAGCCACTATTCACCGCCGCCACGGTATAGGACGGGGTATTCACGTTGTTCTGATCTTCAGCGGCGATCTGGCTCAGATAACGCACTTCACTGCCTGCGTAAAAGCCCTGTTCCGGCAGATAGCCGAGTCCGGCGTACACCATGTTCCGCGCGATGCCCGGAATGCGGTTGCCGTCGCAGCTGTCGCTGCCACAGGCGTTGCTGCGATAGCGCGCATCCAGCAGCGTATAGGCCATTTTCAGCCGCCAGTCCCAGGCAAACTGCTGGTCCAGGCTTAACTCCAGACCGCGGCGACGGGTCTGACCGGCATTTTTATAGGTGGTGCGTCCGCCACTGCTGGCATCCGCCACAATCTCATCGCGGGTATCGGTCTGGAACAGCGCCGCGCTGATCAGGCCGTTGCCGATGCGTTTTTTGCTGCCGATCTCCAGCGTGTCGCTGGTGGCCGGTTTCAGCCCCAGATTCAGGCCGGTTGCGCCATCTGAACGGTATGAAAGCTCATTAATAGTTGGAGTTTCAAAGCCGCGTCCGGCCGAGATCCACGCGTTCCAGCTGGGATCAAAGGCGTACTTCAGCGCCGCCGCGGGCAGCCACTTGTGATAGCGCGCCTCACCGCTGTCGTCGCCGTTACCAGGCCGGATATAGAAGTCATTAGAGTCAAAGTTGACGGTGCTGAAGCGCACGCCTGCATCAAGCGACAGTTTATCGGTGAGCTGCCAGTTGGTCTGCACATAAGGATCGAGCGTCCACATCAGGTTGCGCTCGTTGCGGCGCAGATTGCCCTGTTCGCCCAGCTGTGTGACGCCGTTGCTGACCGTGAAGTTTTCATAGCCTTTGCGGCGCTCGGTCATGGTTTCGTAGTCGAGTCCACCGGTCACCGCCACCGGCATCGACAGCAGCGTGTCGCGGTGGGTCCAGCGGGTATCGACGCCCTGATAGTGGCGGGTCAGGGCGATCACGCCACCGGGATGAGACGGATTGCGCTGCACGGTGGCGGGAATCGACTGGAATTGGGTGGTTTCGCGCATCCCCGCGTAGAGCATCACGCTGAGGTCGTCGTTTTCACTCATCTGGCGCTGATAACGCAGGCCGCCCTGGGTCTGATCCACTGTTTTACGGGTGTTGTAGAGCGTAACGTTGCTGACCACCTGGCGCGGATTGTCACGCCATTGCGCCTCGGTTAAGCCGCCGGGATCCTGAGCGTCGATGTGCACGCTGTTAAACAGCAGCGTCAGGGTGCTGACATCATCGATGCGAACGCCAAGCCGGGCGTTGCCGAGGTTCTTCTGCGCAGAACTGTGATCGCGATAGCCATGGGTGGTAAAGCGGGAGGCGGAGACGGTGTAATTCACATCCCCGGCGTGCGTGCCGTCACCGGTCGCGCCACTGGCTTTGACGCTGTTGCGCCAGCTGCCGTAACTGCCGTACCAGCTTCCTGCTTCCAGCGTGGTTGGCTGCTGACCCTGCTGGGTGGTCACATTGATGACGCCGCCCGAGGCGTTGCCGTAGAGCGCAGAAAAGGGGCCACGCAGCACCTCGACATGATCAATCGAGCCGATATCAATGTTGGAGGTCTGCGCCTGGCCGTCCGGCATGGTGGCCGGAATCCCGTCCACATAGATGCGCAGACCGCGCACGCCATAGGTGGAGCGGGAGCCAAAGCCGCGCATCGACAGCTGCAGGTCCTGTGCGTAGTTCTGTCTCTTATGTTTAGAGTTCTGGCGGTCTCGAATCTAATGTTCATCTAAAAACACTAGCGCTCATTAATCACCAATTATGTACATATGTCTTTGATTTTAAATATGATATTTTCTCATTTAGTTGCACTTGCCATCACTTCTGTACACCTATAAACATCTGTGTACAATGCCATTTAAGTACAAATTGAGTCATGATATTTGATGGTGAGTACATAATGGCTATTAGTGATACGAAACTGCGCGGGTTACATGGCAAACCTTACAGCGGTCCAGCGGAAATAACCGATGCTGACGGGTTAGGGATAAGGATAACCCCTAAAGGGATAGTCAGCTTTCAATACCGGTACAGAATAAATGGCAGTCAACACAGACTTGGGATAGGCCGTTACCCCGGGGTATCTCTACGTGATGCTCGTATAAAAGTCGGTGAGTATAAATCTCTCATTGCTGAAGGGATCGATCCTAAGCATCAACTTACTGTTAAAAAGAATAAGCCAACCGTACATGAGTGTATCAAGTATTGGTATGACAATTACGTATTGCAGTCACTGAGAAAAAGCACCGCTGAAGTGTATGAGCGCATAGTGCTAAACGAGATGGAAAAGTATTTTGCGGATATACCAATTGAGCACATCCCGGTCAGTGCCTGGGTAGACTTTTTCACGGAGCAAGAACAGGCGAACCCATTAAAAGCCCGTAAACTATTGGTGCATTTGCGCGGGGCAATAGCCTGGTGCTCACGAAGACAGTTTATTGAGGACTCATCGCTTCTGAGATTGAACCCCAAAGAGTTTGGCAGGAACCCTAAAACCGGCGACACAGTTCTGACCTATCGGCAGTTAGCAAAGATATGGGTTGAGAACGAAAAATCCACCGCAACGCTTTCGAGCAAAATGCTCATCAAGTCGCTCATTTTGTATGGATCACGCAACAGCGAATTGAGAGAGTCACGAAAGGAGGATTTTGATTTTGAAGAGGGTATCTGGACATTGCCCTCAGAACGTAGCAAAACAAATAAAATCATCAGGAGGCCGATCTTCAAACAAATCGAGCCACTCCTCAAACAGTCCATCGATAATGGCAATGGCATTCTGTTTCACGGTGCTTTCGAAAGAAACATTCCGCTGAGCATCGGCTCATCAACCAGGTATGTCCGATTGCTTCGTGATCAGCTTAACTTTGGTGATTTTACCGCTCATGACTTCCGAAGAACGATGGCCACTCGATTAGCAGAGGAAGGGATTGCGCCCCATGTCATTGAGAAAATGCTGGGGCATGATCTTGGAGGCGTGCTTGCAGTGTATAACAAGCACGATTGGTTAGCGGAACAGAAGGTTGCTTATGAACTCTACGCAGACAAGATTTTCGAGCAGATCAAGCTGATCTCTGATTAACGCCGCCGTTTAAAATCCATTGCTCCACCTCAGCCAGTAAGTATTTTTTAGGGCGATTTCTTACTGGCTTGGGGAAGGAATGATTCAGCACATAAGTCCTCATTGTTGTTCGTGAGGTAACACCAATTTTTTGCATCGCTTCGCTTTCAAAGATCATTTCAATATTAGCCATTTTTTTCTCCACACATTCCTGCTGCATCAGGTTTGTTTAGCCGTGACAGGTCACGGCGTATTGATATTCAATTTCAGCTTATGCCAGCCTCTGGTGGCCCAGCATGCTGCTTGACCCTGGCAAGGGCATGACTGCACCGGCAGCTGCTCTTTGCATTTCCCGCACTGCTGGTGGGACAGCGCTTCAATCTGCTTAGCCAGCTCAGCGGAGTCTTTCCGAATTAACAGCGCGATATACTCTTTCAGCTCATACGGGTCACGACCGGGGCGGCGTGCGGCGCAGTTCTGCGCCAGCATCTCCAGTTCCTGACTATCCAGCGCCAGCTCCAGTTTTTTATCACCGGCAGCGGCCTGTCTGGCACGCTGCGCTGCTTTGCGTTCGGCGGGGGATTTAGGCATCAACCCGCCTCCTTCAAAAAGATTATCCAGTGTGTTTTGTCGCCTTTGCCGGTTCGCTGCCAGATGGTTGGCTTCTGTTCGGTTAGTGCGATTACCTGGCTCACCGGTATCTGTGTCTCATTCCATTTAAAAATCAGCGTGCCGTGTGGCCGCAAAACACGAAATGCCTCACTGAAACCGACGCGGATATCGTCGCGCCATGTCTGCTTATCCAGCGCACCGTACTTTTTCCGCATCCAGCCGTTCTCTCCAGCGCGGTCCAGGTGAGGTGGATCAAACACCACCTGTGCGAAGCTGCAGTCAGGAAAGGGCAAAGAACGGAAGTCTGCGATGATGTCGGGGTTAACATGCAAAGCACGGTTATCACACAGCACATGTGACTCTTTACGAATGTCTGCGAATATCGCGCGTCTGTCTTTCTTGTCGAGCCAGAACATGCGGGAACCACAGCACATATCGAGGATCGGTTGCTCCATCACTCACCATCCTTACCGGCGCGGAGTTGGCTTACAAACTTGACAGCTTTGCTTTTTGAAACCTCAATAGCTCTGTTAAAGCCGGTTTCGTAGTCTCGACCCTCACCATCGCCATATGGGCTGCCCAATTCTTTCGAAAACATCTCCACGCCCTCAGCCCGCACAGAGTTGAGGTAGGAGTCGGTGGCTGGGGTTGCTTTCACCATCCATACAATGTTATCTGTTTCGGAAGGCTCCTCATGGCTGCACACAGGGCAAACCGAGAAGCCCGCTGCATGGATGGCTACATTTGCTTTGATGGATGCATTCTCAGCCGCCAGCTCATCACGCTGCTGCTCCAGCGCCTGGAATGCTTCGGCAATGGCGAGGATGTTGGCAGGGCTGCAAAGCTCCAGATAATCGCAAAGCCATGATGCTTCGTTTGAGTTGCTCCCGGTGGAGATTACAGCTTCGCCGCGACTCTCGCAGATTATCTCGTGGGCTTGTTTGCCGTCTTTATCCTGGCAAAGCGCGCTACTCCAATCGCTGCCGTGCCAGTCCGCAACAAACTTCGCTTGCTCAACTAATTCATTCAGCTTTTCCATCACTCACCATCCTTATCCCATGCATTCCAGATATAGCCCGCTGGCAATCAGACGAGCACGGCGTGCAGCTGCTTCACGATGGCGCTTAATAGCCTCTTCAGAGCGGTCGTTGCTGTAGTTGATAACCATTGGCTTACATGGCGGGGGAGCAACACGGCGCGGATTTCTGATTAGGGTGTAAGTGCGGTCAATTGATCCGCCACCGAGACAGACCTGATTGGATGCCTCAACCTGAAGTGTTTCACCACCGCGGCGCATGATGTGCAGGACCAAACGGTTGAACTCACTGAGGGTCATACCGAGTTGCTGCGCCAGCTCCCGACCCGTTGCCGGGCCTTTGGATAACTGCCAGGCCAGCTTTTCACTAAAACCAGCATTTGGGCCATTGCTGCGGCGAAATTGGGCGACCTTTTTCATGACACCACCTTCAGCGTTCCCGTACGTGAGCGGAGTAAATCCATTTCCATTTGGGAAAGGATGTTGATCGCTTGTAAAGTGCCTGGCAGCTGTTGATTACCCATAGTTGCTACAGCCTGACGTGCCTCACCGATTGCTTCACCGCGCAGTGTTCGGATCCACTGGTCGCAGGCTGGCGTAGCAAGTGCTGCATTCAGGTCATCAATCAGCGTCAGGTCTGCGCCTGCAGCCTGTAAGGCTGAAATGGTGTCAGGTAGCACGCTGTTGATGCGCAGAACCTCCGAAGCCATTAGGCTGGCGCGGACGGTGGCAACATCAAGACGCGTAGCCAAATCACTAACCATCTTTGCCATTTCCAGCAGAGAGGTTTCTTTACCGATGATCCTGGCGAACTGGTGGCCAGCAGCGACGACTTCTTTATTCGATTTAGAAGAAAGCATGTTGCTGGCCCTCAGTGGATGGTGATGTTGATGGTTTTATTAAGCCGCTCAGCTTCACGCTGCGCCTTAATGGGATTACTGATTACCGAGCCATCAGGCATAATCCAGCCGTTCAGGATATGGCTGTAGGGCAGGGTAATGATGCCAACGGTGATATGGTCATCAGGCTTTTGCATCGCTGTACTCCCTGCGTGCTTTATCAAACTCGCTACCAACGATCTCTACTGCGCCAAAGTCCGGTTTAGGCGTTGCGCCTGTCTGAACGTAGATGGCGTCACACTGGCGGAAGTAAGTTATTCCGCATAAAAAAAGCATCCCCCAGTCCAAGCCTATTGCCTTGAAAAACTCGTCATTGCTGACCTTGGTTTTCGGGTAATGCTCACCCCACAGCTTCTTCACTGCTGCATGCTCTTCTTTCATGCCTTTAGGAGGGCGTGCTTTAGGCCAGGATGCATACCCGGTGTTACCGGTTGGCACTGTCCATAACTCTTTAGCCAGGTAAGGTGCCGCATCAAAATTCACACCGTAGAAGGTTGAGCGCGTCAGATCGCTTTTGAATACGGGTTTGCCACCCAGCATTGAAGTCAGCTCCGCCGCTTCTTTACGCATCTGCGCTTCATCAGCACGCGTCTTTTCCCATGCTGCTAACGCCTCAGCATTCGTAAACTTCCAGTAGCCCATAACTGTCTCCACACACGATTTTTGGTTGCATGAATCCCTTGCCAGTGATGGCAATAAAAAACTTTTGGGATTCGTTTAAGTTGGCTGGTGGGTTACTGCAATAACCCACAGCCCGATTACTCCACACACTTGAAAGGTTGCTGCGGTGCCGGGTGCCTCCCGGTGCTCTGGTCAGACTGACAAACACCAGAGCGGAAACTCTTAGACTGTGTGCAATCTTTGTCAGTCTTCCGCGCGCGCTGGCCGCATTCACCACAACGGAAAGAGCCATCTCCATGCTATGCAAGGGTGGAACCTGCTCCCACCAAATGACTCTTACCTGTTGTGTGCTGACCTCCCAGCCAGCTTGGTTCGGGTCACACGCAATCACGTGCTTTACGAACTGGCAGACTTTTACGGTGCTGCCCCCGTTGTTATGGCGACCGGTGCTGATCTCCGGCATTTGCAGATAGAGTCAATAAGGTGTGTGGAACTATCAACTCACCCGCGCATCAGCCTGCGCATTCACCACAACGAAAAGTACACTTACTCCACGTCTCTAAAGCGTTCGAAAATACCCGCTTTGCAAATGTCCTTGTCGTTGTGAAAAAGGGCGGTTAAACAAACCTTCATGAGTAACCGCCAACACAGCAATTCAGTACTCTTAAAACGCTGGTCCGCGAACCACGTTTTCAACATCACACTGCACACTCACTACACCGGCATCACCACAGCAGATCACATCTGCATCCGGGAAGAGACGGAGAAAGGTAATCAGGTCCCGGACTGTTGTGTTCGACATGTTCTTAATCATCTTCATTACCGCCTCTTCAGAGAGAACTAAATCCCAACTCGTAAGAGGCGATTAACACTGGCAACCTCATTGTTGGATTTAATGTAGGATAACCAACAATAGGATGTCAAGTATGTTTGTAGGAAAGCCTACATTTAGGGCGAAAAAAAACCGGATGGGTATCCGGCTTAGTTTTTTGAGGAGGGAATTAGAATTCCATGATCACTTGTTTTACCACTCCGACGATCCGGCAGTTCCCATCACATTCAATGGTTTTGTAATTTGGATTTAGAGGAACAAGGTAGCGGTGAGGGAAATCCTCAACAAACTTTTTCAGGGTAGCTTCTGACCCGCCATCAATATGAGCAACGACAATTTTGCCGTTAATTGCGGCGACATCATGAATTTCAGGCTCGACGATGACAATCGAATCTTCCGGGATGGTTGGATTCCCATGTGGATTCGTCATTGAGTCGCCACGAACCCTTAGTGCAAACGCTTTATCAGAGACCGATGCAGTGGTGTAGAGCCACATGATGGCCTCATCGCGGGTTAACCCTGGGTCTGTAGCTGTCCAGGTGCCAGCTTGTACCCATGAAATTAAAGGCACCTCTTTTGCACTGACTTGAACAGGTCTGAGTAATGGACTTGATATGGATTCTCCTTTCCCACTAACCAACCAGGTTGGATCGCATTTAAGCGCGTCCGCGAGTGCCTGAAGGTTAGCCCCGTTTGGCTGGTAATCATCCTTCTCCCAGCCTGTAACAGTCACGCGATTAACACCGGCCTTTTCTGCCAGTGCTTGTTGCGTGAGATTAAGTTCTTTCCGCTTAAGGCGGATGCGTTCACCCATATTCATCATGTAGGCGATCCTACCAAAATCGTAGGTAAGAATCTTGACATTCATATGTTGGATATCCTACATTGTGTGTCACGCCAACCCATTAAAGGAATTGCTCTCATGAGGAAGCAGGACGTAATCAAATTCTTTGGAGGGGTATGTAAAACCGCAGCAGTTTTAGGAATTAAACATCCGTCAGTTTCTGAGTGGCCGGAAGTAATTCCTGAGGGTCGTGCATACCAGATCGAAAAAATCACCAAAGGTAAGCTGCGCTTCGATGCGTCGTTGTACCAAAAAGATACAGGCCAACCGGCCTAACTGGAACTACCAAAGGAGATAGAAATGGTAGACACAATCAACACAGCAATTCGACTGATGTGCAAAGCACACAAAGCGGGTCGTTTAGGTATGGCCGATGACTTAGGCATGACCATCGATCAGTTTCACAACCACATGTACCGCAAGTGTGGCAGTCGTTTCTTCACCCTGGATGAGCTCATGAAAATGGAAGATTTATCCGGTACTGCATGCCTGGCAGACTTTTTCGCGACACGTCACGGAAAGCTGCTGGTGGATGTATCTGCAGTGAAAGAGGTGGATAAGGTCGATCTGTATGACATCGAGATGAAAGCGAGCGCAGCAGCTGGTGAGTTAGCAATTGCCAAAATTGCCGCTGCTTCTGACGGTGTGATCGACAGCAAAGAGCGTAAAACCCTGTCCGCATTGTTCCACAAAAAAATGCGTCACCAGATTCATGGATTTCTTGGTTTCATGGCGCTGTATGGCGTCGGCGTTGCTGAGCACTCGGTAGATATGTTCGTGGCGAACGGCAGGAAGATTGATGCGTCAGGTGTGCAGATCGAAGTGCAGGATATTTGAAATGAAAAACTATTTTAGCCCTAAAAAGGTGACACCCGCAGGATTGCAGTCCCCGGGTGTCTGTCGCGATTTTATCAACGTGTGTGGAGAATCAATCGCATGTCCATTGTAAGCCAAAAATTATCGGTTGAGCAATTCCGTTGCCGTTTTATTGCTGGCGTTCCTGTCTATGAGCAAATCATACCGACAGCTGGTGGCCCTAACAACTACCAGACAACTACTCGTCTGGTAGTTGAGTCCGCGTGGAAGACTTTCTACCGCTGTCCGGCGCAATCAGGTGTGAATTGATGGAAAACGAGATCATTAAACCCTGGGTGGAACGCTACAAAGACCCGCGCGGCGTGGTTGTGGAGACGGTTGGTGTAGACGTGGTTAATCATCGCGTGATTTACATGCGCCCCAACTATCCGCACCCATGCATGCAGCCCCGCGTTCTGTTCAGTCAGAAGTTCAGGAAGGTGGCGTCATGAGTTTATTGCTGAAAGTTAAGCCTCTGGTCATTAGTCCGGCCCTTGCGCAGCGTATTGGGCTGAATGAGGCCATTGTGCTGCAACAGATTTGCTACTGGCTCGAAGATACAGCATCTGGTGTCGAATATGACGGCAAACGCTGGGTTTATAACACTATCGAAGAGTGGACTAATCAGTTCCCGTTCTGGTCGTCTGACACGGTTAAACGCGCTCTGACCTCTCTCAAAAAGCACGATTTAATTTTTGTCGAGCAACTGAAAAAATCTCAGCATGATCGGACTAATTATTACGCAATTAACCACGCAAACCCTTTATTGACCGATGAGGGCAAATTGCACTCATCGAAGGATGCAACTTGCACCAATCGAGTAGAGCAACCTGCACCAATCGATAAGGGCAACATGCCCTCATCCATCGGGGCAAATTGCCCTCGTCTTACAGAGAATACAACAGAGATTACTACAGAGATTACAACAACCCCTTCTTGTCAGGTTGCTGCGCAACCAGACGATGAGTGGTCCCTGGTTAATCGTTCTCGGGAAGTCTTACGCCATCTGAACAAAGTTACTGGCGCTAAGCACACAGAGGCGCAGTCGTCGATGGGTCACATCAAATCTCGGCTGAAAGATGGATTTACGGTGGAAGAGCTTTGCCTGGTGGTGGATTACAAACACGCCCACTGGGAAGGCACCGAGGAATACCAGTACATGCGCCCTAAGACGCTGTTCATCCCCGGAAACCTGCCTGGCTATCTCCAGATAGCAACCAAGTGGGATATGCATGGTCGCCCGCCGCGCTCTGAATGGAATGCTCTGAAGCGCAACATGCAGCGGGATATCACTGTCATTCCGCAGCCTGACAGCTCAGTGCCTGACGGCTTTCGCGGGGCATAAGGGGAGAAAATCATGACCAACCACGAATCAAAAATTCTTGAACTGATTACCCGGAATGGTCCGCTGAAGGTTCGCGAACTCTGCAAGCTCACTGGTCTGCATGAGACTTCAGTGAAGCGCTTTATCAAACCGTTGTTCACCAAAGGGCTGCTTAAGCGTGCCAGTGACTGGAGTTACTCGATCAACACTGACCCGTTACCGGTAGAAAGCGAGAAATACAGCCACATGGCGAAGCAGGCCAGCGAACTGGAGGCGAAAGGATTCTGGTTGCGTGCCGCGCAGGTCTGGCGTGAAGCGATGCTGGCGGCCCGGTTCGATGCATCCCGCAACGAAGCCAAAGAGAATTGCGACCGCTGCGCCGTGAAAGGCTCACTCAACTGTGGCAGCTACGGTGGACTCGATACCGGTCGCATCATTTCAGCCAGTGTTAACAGGGATTTGTTATGAAAGCGCACCTGAAGAACCACTACCAACGCAATGAGATTTTCTACCGGGCCATCCCCACAGCAGTAGTGATGATTGCCGCCCTGATTTTTGTCCTGACATGGGAGCTGACCACAGCATGAGTACTTTAGCGCGCATTTACGACGATAAGAAAAACAGCGACACCGATATCACCACCCGCAAAACCTACCTGCTGGGCGTTGATGAGCTCTATGTTGAAACCAATTACAACATCCGTGATATCGATCAGACCCATGTCGAGGAGTTCCGCGATGCCTTTATCGCTGGTGAGCATGTGCCTCCGCTGGCTGTTAAGGTCACCGAGAAGGGCATTAAGATCATCGATGGCCATCACCGGTATTACGGCGCGAAGCTGGCTCAGGAAGCGGGCTATACGCTGCGCCTTGAGTGCAAAGACTTCGTGGGGAGTGAGGCGGACAGCGTGGCGTTCATGGTCACCAGCAGCCAGGGACGCGCCCTGTTGCCGCTGGAACGTGCAGCCGCCTATCAGCGCCTCGTTAATCAGGGCTTAGAGCCAGCGGAAATTGCCGCCAAGGTGAAACGTTCGATCACCGACGTTGAACAGCACCTGCAGCTACTGACAGTTGGCGAACCTCTGATTGAGATGGTGAAGTCTGGCGAAGTGGCCGCAACCACAGCAGTAGCCCTGCAGCGTGAACATGGCGTTAAAGCCTCTTCCGTTGCGCAGGAGCAGATGCAGAAGGCAAAAGCGGCTGGCAAAAAGAAACTGACCCGCTCAGCAGCCATCGTATCACCCGTAAAACTGCGTGAAAAAATTCGTGCAGAGCACGCGGCATGGTCACAGGAAACTTTCGGCGACGTTGGGCCGGTCGGACCCCTGAAGCACCTGGCAAAAGAAGCGATGGAAGCAGCCGAAGCGCCCGATGACCTGTCTGAATGGGCTGACCTTCAATTTCTGCTGTGGGATGCGATGCGCCGTGCCGGTATCACTGAAGAAGAGCTTAATGCCGCGATGGAATTGAAGCTTAGCGTCAACAAGGCCCGTAACTGGCCCGAACCTAAAGACGGTGAGCCGCGTGAGCACCTGAAGGCTGATAGCCAGGAAGCGGTCCAGCCTGAAAAAGATTATGGTGATGAGCTGCCATTGCTGAAGCACGAAATCCTTGAGCAAAGCGGTGTTGAAGCGTGGGCCTGCGTTATTGCCGCGTTCAAAATGAAAGCTGAATACACCTACAGCGAATCCAAGTGGGCGCATACATGGGCGGCAGACTCCGTTGAGAATCCTACCTGTGTGACAGTTCCGGCAGAGACGATTGCCAGTGCAGTGCGCCTCATCAAGCAGCACCAGGACGATCTTGAACTGAAGCTGTGGTTGTCAGAGCAGCACGATGATTCAGAGGTGGCAACAGAGCAACTGATACGCTTCTCAGCAGTTTTGTCTGAAGTTCGCCAGGACAATCCATGCACAGTTCAGGAGTTTATCGCGCTGGTGGAGCAGACCAACCGTGATTGCTGGTCAAACATCCGCATGCTGCGTCAGGCGGTTCGTGAAGTGGCCGGGCAGATGACAATTCCGGGTATTGGCGAGGTGGCATTATGAGCTGGAGGAGGCTCAGTGGCGGGAGAAATCAGGTAATACTCACTGAATATTCTCTTGATGTGAAAGAGGGTGACTCCCGCGCTGTTTATCTTGTTCGGCATAACAGCAAAATCTGGAACACCACGCTTGAGCAGAACATCACTGTCGAGCGAGATAGCTATGGTGGCTTCAAACCGACCATTGCAATGCAAGATTTTCCACGAGGGCTCAGCGAACGTGAGTCGATGCTCAAACTGGCTGATTGGTTGCATCGTTTGGGCGTCTCAATTGAAGATCACTGGAGCAAGCCATGATTTCACATGAAGCGTTCATTGGTGCCATTTTCTTTGCTGCGTTTTATTTTTTCTTTGCCGGTATTGTGGCTGAGCACAGCAGCTCAAACCGTCACAAAGATATCAGCAGCCCAATCCCTTCCATCGCGAGGGGACTAGCCTGGCCATTGGCTTTAATTAAATATGCACTGCTGGTGATTTGGGGGTGGTTGTGAAATTAATCCTCCCGTTCCCGCCAAGCGTTAACACGTACTGGCGTAACACCAGAAAGGGAGTATTGATCAGCGCCTCCGGGCGCTGTTTCCGCTCCAATGCGCTTGCCGCCGTCATGGAGCAACTTAAACGCCGACCTCAGCCGATTACAGTGAACGTAGAGGTAAGCGTTCTGCTGTTCCCGCCAGACAAGCGCCAGCGTGACCTTGATAACTACCTCAAAGCCCTGTTTGACAGCCTCACGCATGCCGGTATATGGGGCGATGACAGGCAGATTAAGCGATTCACTGTAGAGTGGGGTCCGGTAACCAAAGGTGGTAAGTCTGAGGTGGTAATCAGTGAGTTTCAGCCGGTGGCGGCATAGGTCCGCAACTGGTTACATGACCAGTAAAATTGGATATAGTGCGTGGTGTACTAGCGAATTGCAGTCGCCGTATCAAGGTTGGTCCCGTTCATTTGCAGATGACGGGGCGGGGCCAGTTAAAAATAGTGCGTAAATCGTGTGTGGAGAGGTCAAAATGCTGAATCAATCAGCGGGTGCTATTGCGCCTGTAGTCAATGCTATTCAATCCCCAATCATGACCAGCCGTGAGATTGCCGAACTGACCGGCAAAGAACACAAAAATGTCACTGTAGATATTCGCCGTATGCTGGATGACCTGGGAGAAGATGCGCTGAAATTCCAGCGTATCTATCTCGACACCATGAACCGACAGCGAACTGAGTATCACCTCGACCGTGAGCACACCGAATGCCTCATCACCGGTTACAGTGCCATCCTTCGCATGAAAGTGATTAAGCGGCTGCATGAGTTAGAGGAAAGCCAGCCAGTTAAAATCCCGCGAACCTTTGCTGAGGCACTCCGCCTGGCCGCCGAAATGGAAGAGGAGAAGGATCGCCTGCAGCTGCAGCTTACTGAAGCCGCACCAAAGGTTGCGTTTGTGGATCGCTATGTCACGGCCACCAGTTCAATGACATTCCGCCAGGTGGCAAAACTTCTTGAGGCTAAAGAGCCAGAGCTTCGCCTGTTTCTGATTGAGAGTCGTGTTATGTACCGTCTTAATGGCGTCCTGACTCCCTACAGCCAGCACATTGAAGCCGGTCGGTTTGAAGTGAGAACCGGAACCACTACCGAATCAAATTATATGTTCAGTCAGTCCCGCTTCACTGCTAAGGGCGTTCAGTGGATTGGCGGGCTAGGGACGGCGTATAAAGCTGCTGGTGGTGCTGAGTGAGAGCGCTGCTTACACCCGAAATCGCGCCGCGCACAGGGATTGTGCTGCTCAAGCCGGGGCCGGAGCTTTTGAGGCTTTTCAAAGGTCGTGTTGTGATCAGCACACCGACAATGGATATGGCAGACCTGCCATCAGGGCGGCTGAATGACGGTACACAGCCGTTACTTGATGAGCCCTCACTGATTCCTTTCTTTAGTCACGAACGCGTGATAAAGGCCGCTGGTGGACCGAATGCGCTGGCATCCTTCGTCCAGTCCTTCGGGTGCTGCCAGTGGGAGCAGTTGGGAGTGTGGCATCACCATGAATTCACAGTGTCAGAAATCGAAAACGGCCTGGTGTCTCTTTGCTATAGCCACGATAATGAGTTCAGGGAAAACGGCGTACCCGGCAGCCTGGAGAATATCGCCAAAGGTAACACTGCACTCTGGATAATCAGGGCGGCATGCAGCCATATGGCGCTAAACGGTGACCACCAGCTGACGCTGCCTGAACTGTGCTGGTGGGCAACCCTGAATGATGTGATTGACCTGATACCAGAGGCACCGGCCCGGCGCGTTCTGCGTATGCCGAAAGAGACCATCCAGAGCGGCGAGCTTAAAGAAGCCCGCATTGTTCCGGCGCGACTGGCTCGCGAGGTGATTCAGGATGCTGCTCAGCTGGTCAAAAAGTTAATAGACCTGCGCACCGACCCGGAATCACCAGAATCATTCATGAAGCGCCCCAAGCGTAAGCGCTGGGAAAGTGAAAAGTACACACGATGGGTAAAGTCGCAGACTTGCGCATGTTGCGGCATACAGGCTGACGATCCTCATCACATCATCGGACACGGACAGGGGGGAATGGGAACGAAGGCGCATGATTTATTTGTGATACCGCTATGCAGAGCGCATCACGATGAACTGCACCGGGATATGAAAGCGTTTGAAGCAAAATATGGCAGTCAGATAGAGCTGCTATTCAGGTTCCTTGATTTCGCGATTGCAGTCGGCGTGATCGGGACAGACAAAAAATAAAGTGTGTGGAGAGGATCAAATATGCGTGACATGTCACAGGTATTAGAGCGCTGGGCGGGATGGGCTAAATCAGACAGCAGTGGCGTCGATTATTCTGCAATCGCAGCTGGGTTTAAAGGGCTGCTGCCACAGGACTCAAAGTTAACGCTTACCTGCAGCGATGGAGACGGACTGATTATTGAAGGGTGCCTGTCCCGGCTTAAAGCTAAGCGCCCGGATGAGCATGCGATCATTGTGCTGCATTACTTTTTCAATATCTCAAAGCGCACCCTGGCGAAACAGGCTAAACGCGATGAGAAGATAGTCAGAATTGAAATTCAGATGGCTGAAGGCTTTATTGAGGGGTGCCTGGCAATGCTGGATGTGCGGCTTGATATGGACGACGAACTGACGCCGAAAAAAATATTGAAAAAACCTCTCACGCGGTCCGCATTTTCCTTAGTAATCTGATAAGGTCGATTACCAAGCAGTGCAGCTTATCTGCTAAAAGTCAGTTCCAAATGTGGATGTCAAAGCGCCTCGGGCCTTACCAGCCTGGAGGCGTTTTTTATTTTAAATACCCCCTAAAGGGGATAGCGATATATCTATCCCTTGCAGGGGATAAGAAATTCACCCTGTTGTCGACGGGCAAGGCACTTACCGCATTTGCGTCAGGGTTATTTTTCAAAAATATCGGCCTCCGGTAAACAAAAATGTTGACCAAGTAAGCATAAATGTTTACTATAGCTTCATGTTCAACAGACAGGAGGAGTAGTGAAGCAAAGCGAGTTCAGGCGGTGGCTTGAATCTCAGGGAGTCGAAGTTTCAAACGGTACTAACCATCTGAAGCTGAGATACAACGGGAAGCGAAGTGTAATGCCGAGGCATCCCGGTGCTGAGTTAAAAGAACCACTGCGAAAGGCCATAATGAAGCAGTTAGGCCTGAAATAATTAACCAGCCCTCCGGGGCTGGTTCTCGCAGAGTTTCACTAAGACGATATGCGATACCCGATTAATCTTGAGCCGTGCGACGGCGGATATGTGGTTTCGTTCCCGGATATACCGGAGGCGCTTACTCAGGGCGATACGCGTGAGGAGGCGTTAGAGATGGGGCTGGATGCGCTGGTTACTTCATTTGATTTCTACTTTGAAGATAACCAGCCTGTTCCGGCACCTGGACCGGTGACAGGGGATTTTGTAGAGGTTCCGGCGAGTGTGTCGGCGAAGGTGCTATTGCTAAATGCTTTCCTTGCTTCCGGCTTAACTCAGGTTGAGCTGGCTTCACGCATGGGAGTTAAAAAGCAGGAGGTAACGCGCATCTTCGATCTGCATCACTCGACCAAAATAGATACTGTTCAGAAGGCGCTCTCAGCGCTGGGCAAGCGGCTTGAATTAGTCGCTGCCTGACAGGCATTAAATAATAAATTCAAAGGCTCACTTCGGTGGGCCTTTTTCGTTTTTGCGCACGCCAATCAGTCTCCACACACACTTTTGACGCCGTGGCGTTGCGCAATCCTCTCAATGAAAGTAAGCCGCCATCATCCCGGTGGCGGGAATCAGAGCATGCCTCCAGAAAAAGACCCGGGCTTTTGGGCCACAGTGCTGCTGTGGCTGTATGCCCATAAAACAGAATGGGGATATGCCGGGGTAGCAGGCATGTTTTCACTACTAAGAAGTGCATATGCAAAAAGCTCCTGGAGTAAGCGGGTGCTGGACGCCGTCTCCTGCAGTGCGCTGGCTTTCTTTGCAGCGCCCACGCTTCAGGTCGTCGGCGCTCTCTTCAACTGGAACATTCCTGACGCCGCTGCACAGGTTTTCGCGGTTTACATAGGGTATGTCGGCAATGACTACATCAGCGCCAGGCTGCGCGGGTGGATAGACAGAAAAGCAGGGGATACAAATGAAAGTCAGCAATAACGGCATCAACCTCATCAAGCGCTTTGAAGGTCTGGAGCTTAAGGCCTACAAAGACAGCGTTGGCATTCTGACTATCGGTTACGGCCATACTCACGCAGTTAAAGCAGGTGACGCGATCACCAGCGAACGTGCTGATGCTTATCTTCGTGAAGATTTGCAGGTGGCAGAGCTGACCATCAACACGAACGTGAAGGTTAAGCTCACTCAGGGTCAATTCGACGCGCTGGTGTCATTCGTGTTTAACCTCGGGTCTGGCAACTTTGTTAAATCGACACTGATCAGGAAACTCAACGCAGGCGACTATGCTGGCGCAGCTGATGAGTTCGGCAAATGGGTTAACGCCGGTGGTAAGAAGTTGCCCGGACTCGTTAAACGCCGTGCCGCTGAAAGAGAGGTATTTCTGACATGAACCCGTTAAGCCTCATCAAAACTTTTTCACCCGTTATCGTCATCGGTCTTATTTGCCTGGCTCTCTGGATGCTCAATGCTCGCAGCTCCCAGCTTGAGGCAACTAACCAGCGGCTGGAGAAATTAGCCAACAGCAAAGACGATCAGATTAACGACCTGCGCTCCAAAAACGATGGCCTGGCATCAAGTGTCACTGAGCTTGTAACAGCAGTTAAACAGCAAAACGAAGTGATGAGTCAGGTCACAGAGCAGCGTGCCGTAACAGCCCAGCAGAACCGGAAACTACAGAATGAAATTAAGCGTTACCTTGCGGCGGACAAGTGTGCTGTTGCTCCTGTTCCTCCAGATGCTGCTGACCGGCTGCGTGACGCAGCAAAAGCCGCTGGTGGAGTACCGGACAGTAAAACAGCCAAAGTTAAACCTTCCGGCTGAACTGACCAGTCAGATTGACGTGCCAGCGCCATCACAGGATATGACGTTCGGTGACAGCGTAAGCCTCAACGCTGAGTTATATGGCGCTCTGGGGCAGTGCAACATTGATCGCGCCGCCATCCGTAAAATTGAGTCAACCAGATAGGTAAAAACATGAGCGAAGCAAAACCACAGGACGGAAGCAAAGTGCAGGGTTATCGCACACTGACCGACAAAGATATTCTTGAAATGAATCGTCTTAAAGAGATTAGCCGCCAGTTCATCGCCCAGTTGGAATACTTGAAAGGTGCTAAGGACTACGATCCTCGATGGATTGCCCAGGCAAAAACAGCAATGCAGCATGCCTGCATGTTCGCTTGTCGCGCAGTTGCTCAGCCGGATGATAATTGCTGAGGCATCAAATGGCACATTAAAGAATGTGCCCGATGTATGCACTCAAATACCTAAACTAAGCAAATATGCTTTCAAGTCTTCTTTTTTCAGCCCGGTAATTTCTGCGAATCCTTGGCTCAATTCGTCGATTGTCTGTCCTACAGGCGACTGGCCGGATGAGGCCCGTGCGAAGTAAACACCGTGTTCATAGCCAATATCGTAAATGATTCCGCTAATCTCGACTCTTATCATTACTTGACCTCCTAATTAGTGAATAAACAGGTTAGCTAGCAAATTTAGTAAAGCAATGAAAATCCGCCGACAAGGGATAACGGTTAGCCACGCTGTGAAGCGTTGCGACACTGGTACAAAAAAACCGGCCTTTCGACCGGTTTACATTTTGATTAGCGCTTGCTGTCTGGCGTGCGCTTGACCTGCTCCCAAGTTGAACCTGACTTACTGGTTGGTGGAGCTGTGTGATTGTCAGGAATAGTAGTGTAGTTATCGGTTTTACCACCACGAGGGCCAATCTGTTGGTATACACCGCCATCTTTACCACTGTTCTGACCAGGCTTTAAAGACATAAAACCCCCTTAAGGGAATTGCCACAAGATTGAGGCCATAAGATTTTATTAATGAAACCTTATATTTCAAATTGAAATAAATTCATACATTGAGTAATCAATATAAATCAGGTGATTGGATGAAAGTGTGCATTGATGACGTTGAATACGCGCCCATAACTGAACGCGTATCGAATATCGGCATCGCCATCAGCACCCATAATCGTCATGACGTTTTATCCCGCGCTCTTGAGCATCAGGTGAAGTTTCTGCCAGCCGGTGCGCTGGTGGTCGTTATTGATGACGGTTCAGACAAACTAGTGACAGCGCCGGAAGGTGTCCGGGTTATTCGGCATGACACCTCACGCGGCATCGTTGCAGCTAAGAACGCCAGCCTTGAGGCGCTGATTGATGCCGGTTGTGAGCATCTCTTCCTGTGGGATGATGATGCCTGGCCGGTTGCTGGTGGGTGGGAGCAGCCTTACATCACATCGCCTGAGCCACATCTGGCTTATCAGTTTCAGGACTTCGCTACCGGACAGAAGCTGAATGATATCGCGGTACTCTACCGGGATGATCATCACGTCGCCTACACCGGCCAGCGTGGTGTGATGCTTTATTACCACCGCAGCGCGATTGAGAAGGTTGGCGGTTTCGATACGGTCTACCAGCGCGGCATGTATGAGCATTCTGATTTAGCGCTGCGCATCCACAATGCCGGGTTAACGAGCTGGGCGTTTGCTGATGTGACTGGCTCAGGCAAGCTGATTTATTCGCTTGATGAGCATCAGGCCGTAGAACGTTCAGTACCAAAGCCTGACCGTGAAGCGCAGGTGAAGCGCAACGTCACGATTCACAATGAACGCCGTAACGGCGGTTACACCGGTTATGCAGAGTACCGCCAGCAGCGCAACGTGGTCATCACCACACTGCTGACCAGCCAGCCCGACCCACAGCGCGACACCAGAATGACGGCATCGCCTGACATGCTGAATAAATGGGCGGTATCGGTCAAAGGTGGTGATGCGGTCGTGCTGGCCGATGAGCTGACAACCTCACCTGTAGGCGCGTCGCTGGTAGCTGTCGCTGATGTGGAGATGAATGTCTACTTCCGGCGCTGGCTGCATATCTGGCAGCACCTCCGAGATCATCCTGAATATCACTTCGTCTGGTGTACTGATGGTACTGACGTTGAGATGTTGCGCGAACCGTGGCAAGGGATGGAGCAGGGCAAAATTTACGTTGGTTCCGAACCTAAAACCTATGCAGCCGCATGGGCTAAGCAGCAGCATCCTGAAGGTGTCTATCAGGCGTTTCTCGCTGAGCACCAGAATAATGTGATGCTGAATGCCGGTCTGCTTGGTGGCGCTCGTGCTGACGTGATGTCAATAGCTCATGGTATCGTCCGCCTCTATTACCACATCGAATCACTGCGCTTCTGGAATCAGGAGGTCAAAGCTGCGGCTGTTGGCGACATGATTGCCTTTGGCATTGTGGCCCATCGCTACAGTGACAGGCTGGTTACTGGTCCCCGCGTGCACACAGTGTTCAAGTCAGAAGGCATCGGTAAGGAGTTCGCCTGGTGGAAGCACAAATAAGTTTTGTGGTAGTAGGGCATTACTCACGCAGGCATCAGGCAGAACGCCTGGCACAGATTCTTAACGCTCACCTTCTTATTGATGAGGATCATCATGGCGCGAACTGGAACCATCGCCGCGCTATTGAGTGGGCCAGTCAGCAGGACTGCCGGGTAGTGATACTGGAAGACGACGCACTGCCGGTACATGGCTTCGCGCAAAAGGTGGCTGAGTGGCTGTCGCGCTTCCCTGAAGATCTGCTGAGCTTCTATCTCGGTACCGGCAGACCGCCGCAGTATCAGCCCGAGATAGCGACAAAGCTTATTGATACAGACCGGCAACAGGCAGATTACATCACACTCAACCGGCTGATTCATGGCGTCTGCTACAGCGTCCCTCAGCCAAAACTTAATCAGGTTATCAGTCGCTGGAATCATGGCTCGCCTGCTGATTACGCGGTGGGTGACGCATGCGGCGGTGCAGTGGTGTACCCATGTTACTCACTGGTTGATCACGCAGACGCGGCGACAGTTGAGCGCCATCCCGACAACACACCACGAACTGAGCGCCGCAGGGCGTGGAGACTGGATGCCGCAACGAATCCCGAGAGCGTGCCGTAAGCATGGCTGCGCTAAGACAACCACAGACCGCTCTGGCTATTGCGCTGATCACCTTAATGAAGGGTGGCAGCAGCATCATAACGGACTGAGCAGACACCAGCGCGGTTATGGCAGTCAATGGGATGTCAGACGCGCACGAGTTCTTGAGCGTGATCGGCACCTCTGTCAGGAGTGCCTGCGAAAGGGCAGGCCCACAGCGGCAAAGACGGTTGACCACATCACCCCGAAAGCACATGGGGGTACCGATGACGACAGCAATCTTCAGGCTTTATGCTGGCCGTGCCACAAGGCAAAAACAGCCAGAGACAGAATCAAACGATAATGATTATCGTTTCATTCTGTCAGGGTGCACCGTTTTGGTGCGTACACATCACAAATGAGAATGACTATCATTTGAGTGATTTTTGAAGGGGGAGGGCGGGTCGAAAGTTCCCCCCTTTCGCCTTTCAGGACCGCCGCCTAACCCTTTTTCACACCGCCGCAGGTTAGAAAACTTTTTTTGGGGTCCCCCAACCAGTTATTAATAGGAGTTTTCGATTATGCCAGGACCACCGAAAACCCCGACACATCTGGCTTTGGTGAAGGGGAACCCATCAAAACGGGCAGTAAACAAAAAAGAGCCAAAACCGCCTTCTGGGGTACCCCCAATTCCGAAGCATTTGGACAAAATGGGGAAGTACTGGTTCAAGCGAATCGGCGAAGAGCTTGATGCAGTCGGAGTGATGACCACTCTCGACGGTAAAGCTCTTGAGTTGCTGATCGAGGCTTACACCGAGTACCGGCAACACTGCGATGTTCTTACTGAAGAGGGCTACACCTACAAAACGGTGTCCGCTACGGGTGAGAATATTGTTAAAGCACATCCGGCAGCAGTGATGAAGTCCGATGCGTGGAAGCGCATCCGGGCGATGCTCTCTGAATTTGGCATGACCCCGGCCAGCCGCTCCAAGGTTGGCGCATCCGGGCCAGCCGAAGCCGATCCTCTGGAAGAGTTTCTTAAAAAGCGCAAATGATGAATGGCAACTGTTCAGGCTGGTATTCAGTACGCAGAAAGCGTGCTGGCTGGCGAGATCGTTGCTGGCGAACTGGTGCGTCTGGCGTGCCAGAGATTCCTCAATGATTTAGAACACGGGCCGGAACGCGGTATCTACTTCAGTGAGGACCGCGCCCAGCACATTCTCGACTTCTATAACTTCGTTCCGCACGTAAAAGGTGCCCTGGCAGGTAAGCCTATTGAGCTTATGCCCTGGCACATCTTCATCCTGATAAACCTTTTTGGTTTCACCATTCCGCTGATTGATGAAATGAGCGGCAAACAGGTTATGGATGATGATGGCGATCCGGTCATGGTTCGCCGGTTCCGTACCGCTTATAACGAAGTGGCACGTAAAAACGCCAAATCAACTGTTTCATCGGGTATCGGGCTGTATATGACCGGGGCTGATGGTGAGGGGGGCGCAGAGGTTTACTCAGCGGCCACAACCCGCGATCAGGCCCGTATCGTATTTGATGATGCCAAGAACATGATCAAGAAAGCGCCCCGCACCTTAGGTCGTCTATTTGGTCACGTTAAGCTGAACATCCACCAGGAGCGTTCAGCATCAAAATTTGAGCCCCTGTCCAGTGACGCTAACAACCTCGACGGCCTGAATATCCATTGCGGCATAGTAGATGAGCTGCATGCCCACCGTACACGCGATGTCTGGGATGTGCTTGAAACCGCCACGGGTGCGCGACTTCAGTCTCTGTTGTTCGCCATCACTACAGCAGGGTCCAATAAAGAAGGCATCTGCTTTGAACAGCGCGACTACGCCATCAAGGTGTTACGCGGCGTGGTGGAAGATGACACTTACTTTGCTGTCGTTTACACCCTGGATGAAGAAGACGATCCGTTTGATGAGGCTAACTGGCCTAAGGCTAACCCCGGTCTCGGCGTCTGCAAGCGCTGGGACGATATGCGCCGCCTCGCTAAGAAGGCAAAAGAGCAGGTTGCTGCAAGGCCTAACTTTTTCACGAAGCATCTGAACATCTGGGTTACAGCCGAAAGCGCCTGGATGGATATGGATCGCTGGTCGAAGATGTTGCCAACTGCCGAGGAGGCTAAAAGAAAAGGCTGGCCTCTCTGGGTTGGTGTTGACCTGGCTAACAAAATCGACATCTGTTCCGCTGTAAAAACATGGCGTGATCCGACAGGCGAAACGCACATGGAACCCCGGTTCTGGTTGCCTGAGGGGCGTATCGAAACAGCGCCTAATCATATTGCAGAGCTCTATCGTAAATGGGCTGATGCAGGACACCTTGAGCTTACGGACGGAGATGTAATCGATCATGGTGTCATTAAGGCTGAAATAGTGGAGTGGGTTAAGGGCGAGAATATAAAAGAAATCGCATTCGACCCATGGAGCGCATTGCAGTTCAGCCTGGCGCTGGCTGAAGAGGGCCTGCCGCTGGTTGAGGTGCCTCAGACGGTTAAAAACCTGTCTGAATCCATGAAGTCTGTTCAGGCGGAGATATATGGCAACAAGTTCCATCATGACGGCAACCCCGTTATGACCTGGATGATGTCGAATATCACCGTCAAGCCTGACAAAAACGACAATATCTTTCCGAACAAATCCACGCCGGAAAACAAAATTGACGGGCCGGTTGCGTTATTCACAGCCAAAAGCCGACTTCTGGTTAATGGCGGTGGTGATGTGCAGGACCTGAGCGGCTTCTTTGAAAACCCGATAATGATAGGTTTCTGATGAAAAAAAATAAGCAGCCCGGCAAAGTTAAAAGCGCTTTGCTCAACTGGCTCGGGGTCCCCATTAGCCTGACGACCGGCACGTTCTGGGAGGAATGGTGGGGGAAAAGCAGCAGCGGCAAGACGGTTTCCGCAGATAAGGCAATGCGATTATCGGCCGTCTGGGCATGTACCCGCCTGCTGAGCGAGTCAGTTTCAACGCTTCCACTCAAGATTTACCAGCGCCAACCTGATGGATCGCGTGTGCTGGCGCTGGATAATCCGGTTTATCAGGTGCTCTGCCGCCGTCCGAATCTTGAAATGACGCCTTCGCGCTTCATGCTGTCGGTAGTGGCGTCAGTCTGCCTTCGCGGTAATGCATTCATCGAAAAAAAGATGATTGGTAAAAAGCTGGTGGCGCTGGTTCCGCTTCTTCCGCAGAACATGGTTGTTAAGCGCTTGGATAATGGCAGTCTTCAGTACACTTACACGGAAGTTAAGTCGAAACGCGAAATCCCGGTTCATAACATCATGCACATCCGGGGATTTGGTCTGGACGGTGTATGCGGAATGATGCCGATGATGACCGGTCGTGATGTTATCGGCGCGGCAATGTCGGTTGAAGAGTCAGCAGCAAAAATTTTTGAAAACGGCCTGCAGAGCTCAGGCTTTCTTTCCTCCGATGTCGCCATGGACGACAAGCAGCGTGAAAGGCTGCGGGGCTATCTTGAGCGCTTTATCGGATCGAAAAACGCCGGAAAGGTAATGGTCCTTGAGGCAGGGATGAAGTATCAGGGCGTCACCATTAACCCTGAGGCTGCTCAGATGCTGGAATCGCGGTCATTCAGTATTGAGGAAATCTGCCGATGGTTCCGCGTGCCACCCTTTATGGTTGGTCATACCACTAAACAAAGCAGCTGGGCATCAAGCGTTGAGGGAATGAATCTGCTGTTTCTGACTAACACCCTGCGTCCGCTGCTGGTTAATATTGAGCAGGAAATATCACGCTGTCTGCTTGATGGCAGCGACGATGTATTTGCTGAGTTTTCAGTTGAAGGGCTGCTTCGTGCAGACACCGCAGGCCGATCAGCTTACTACACCACAGCCCTTCAGAACGGGTGGATGTCGCGTAACGATGTGCGCAGGCTGGAAAATCTGCCGCCGATTGCGGGTGGTGACATCTATACAGTTCAGCTGAATCTCACACCACTGGATCAGCTCCGTGAGAACAACGCTGGCGCACAGGCCAGTAACATGATGAAGCTCCACGCTTTCCTTTTCCCGGACATTCCACCAGAACACTCACCGCTTAAAAAAGCGGCTTAGGAGACCCCTGATGACCATTAAAACGCTTCCGGTTGCACCGGAGGGGCGTCCTTTTGCACGTCAGAATACTGAGCTTCCCTCTGCTGCATTTGAGCGCTGGGACGGCGGAATTCGTGCGGCGGGCCAGTCTGGTGACAACACGATCTCCATTCTGGACACCATCGGTGAGGACTGGTACGGCGAAGGCGTAACAGCCAGCCGGATTTCCGGCGCACTGAGAAGCATTGGTGGCGGTGATGTGACGGTGAATATCAACTCGCCTGGCGGTGACATGTGGGAAGGGCTGGCAATTTATAACCTTCTTGTTGCCTACGAAGGCAAAGTGACCGTCAAGATTCTGGGCATTGCCGCCTCAGCAGCGTCAATTATAGCGATGGCCGGTGATGATATTCAGATGGGGCGCGGGGCGTTCCTGATGATCCACAACTGCTGGACCATTGCCGCAGGTAACCGCAACGACTTCCGCGACTATGCGGATTCGCTGGAGCCGTTTGATAAGGCAATGGCCGATATCTATGCCGCACGCTCAGGGCTAAAGCTAAGCGAGGTGCAGACCCTGATGGATAACGAATCATTTATCTCCGGCAGCGAGGCTGTAGAAAAAGGCTTTGCTGACTCTCTGCTCTCCGCAGATGAAATCACCAGCGACGATGAAAGCCCCGCCGCCGCACTCAGAAAAATTGATGCTTTCCTGGCTAAAGGCGGTATGCCCCGTTCCGAGCGCCGGAAGCACCTCAAGGCTTTAGGTGGCAAGCCGGGCGCTGCCACCGAAAAGAACGACAAGCCGGGCGCTGTCGATGAAATAAACCCTGAAGCACTTAACTCCCTCAAAAACGCGCTGGCTTCGCTCGGCGAATAAGGAAAACGCATGTCTGATGTAAATGATCTGTTGACGAAAGTCTCCAACAAGCTGGAAAAAGTGTCTGCTGAGTTCAGCGAGAAAGCTGAAAAGGCGCTGAATGAAGCGAAAAATTCCGGCCAGCTTTCAACCGAAACCAAAGCGGCAGTAGATAAAATCGCGACTGAGCATAATGCGCTCAATGAGGCGATGAAGACCCTCAAAACCTCACTGGGTGATCTGGAGCAGCACGTTGCCGCTCAGATGCCGTTGAATGCTGCGCAGGAAGTGATCCAGTCTGTGGGCCAGCAGTTTGTATCTGCCGAGGTGATGAAAGATATCCGCTCAAGCCTCGAAGGTAATAAGCGTATTTCGGTGCCTGTGAAAGCCGCGCTTACCACTGTTGATGTGCCGGGTCAGATTGTGGCACCTCAGCGCCTTCCCGGCATTGACACTGCGCCTAAGCAGCGGCTGTTTATCCGTGACCTGATCGCACCGGGCCGCACGCAGTCCAACACAATTTACTACGTTCAGCAGACGGGCTTCACCAATAAGGCTTCAGTGGTGCCGGAAAATACCACCAAGCCATACAGCGATATTCAGTTTGCTGAGAAGACCACTGCCGTGCGCACGATCGCTCACATGTTTAAGGCTTCAAAGCAGATTCTGGATGACTTTGCTCAGCTGCAGTCGACCGTTGATGCCGAAATGCGTTACGGCCTGTCTTACGTCGAAGAGCAGGAAATCCTGTTCGGTAACGGTGAAGGCGCACATCTGGCGGGCATCATTCCTCAGGCCAAGCCATTCAGCGCGGCGTTTGCTGTTCAGAATGAAACGGGGATCGACATTCTTCGTCTGGCCATGCTGCAGGCGCAGCTTGCCCGCTTCCCGGCGTCAGGTCATGTGCTGCATTTCACAGATTGGGCAAAGATCGAGCTGAGCAAAGATACGCTGGGACGTTACATCCTGGCTAATCCTTCACAGCTGACCACGCCTACCCTGTGGGGTCTGCCGGTCGTGGCAACCGAAGCCGCTCAGTTCCTGGGTAAATTCCTGACGGGTGCGTTTAATTCCGGTGCACAGATTTTTGACCGCGAAGAAGCAAACGTTGTGGTTTCCAGCGAAAACTCCGACGACTTTGAGAAAAACATGATCTCAATCCGTTGTGAAGAGCGTCTGGCGCTGGCTGTATACCGTCCTGAAGCGTTTGTGTATGGCTCTCTGACTGGTTCAGGCAGCTGATCATTAAAGCGGCTTTCGGGCCGCTTTTAAGGGATTCTCATCATGATCATCGCAATTGAAACAGTCAGGGAGCACTGCCGCATTGATGCTGACGATAGCAGTGAAGATTCGCTCTTGATGATCTACATCGGAGCGGCAAAGCGGCACATTGAGAAATGGACTCGTCGAAATCTTTATGAAACCAATGCTGATGCCGGGTTTGATACCGATGAGGACCGTCTTCTGCTTGATGATGATATTCGCCTGGTCATATTGCTATTGGTCGGCCACTGGTATGCAAATCGCGAAGCGGTCAGCGACAAAAATACCAGCGAAATGCCTCTCGCGGTAGATGCGCTCCTTCAGCCTTACAGGATTTACGGGCTATGACCGGCCTTGCGGCTGGCGAGCTTGATAAGCGCATTAAGGTCCAGCGCACCGAATCTGAGCGCGGTCCGCTTGGAGAGGTGTTGCCGGGGCAGATTGTCATCAGCTCTCCCTTTATCTGGGCGAAAGCTGAAAACATTTCAAACCGCAAAATACGCAGCATGGATCAGCAACAGATTGTTGAAACCTGGCAATTTACCATCCGGCCACGCAGCGATGTTCAGACGGACTGGAAAATAAGCTGGGGTAATGAGGTTTACACAATCAGGGCTGTTGACCGCAGCAGCCGAGATCGTGCTGTTATTACAGCTGAAAGGGATGTGCGTCATGATTGAATCAGGCATCTATAAATCCCTTCAGTCCCTTTCTGAACTGGAAGTCTATCCACTTCTGATCCCGGATACTGAGCAGCATGGGATCACTTACCAGCGTATTTCTGACCCCGAGATTGAAGGCGGTCTGGTCAGAACATCGCTGGTGGCCGGCCGGTTCCAGATTTCCTTTGTGAAAGTCTCTGACTACTCAGGCCTGCTGGCGCTTGATGCTCAACTCTGGCAGATGTGGAAGGGCATCAGGCATGGGGATATCGGCGGCTATCCGGTTCAGTACGTTGAGCGCGGTACGCTGCAGCAGGATAAATCCACGCTGCCAAACAACGCAGTGCAGTACCGCCTGACCAGAGACTTCATCATCTACTTCAGTGAGGTATGAGCGTGCTGAGAATGGAAGTTACCGGCCTTGATGAGCTTGAGCGTCAGCTTATTGCGCTGGGTGAAAAGGCCGGAACAAAGGTTCTGCGCGAGGCGGGCCGTGCCGCGCTGCAGGTGGTTGAGCAGGATATGAAAGAGCATGCAGGCTACGACGAGTCGGCGAAAGGCCCGCATATGCGTGACTCAATCAAAATACGCTCGACAACGCGCACCAGAGGTAATGCCGTTGTGGTGCTTCGCGTCGGTCCCAGCAAACAGCATTATATCAAGGCGCTTGCTCAGGAGTTCGGCACGGTCAAACAGGTTCCCGATCCCTTCATTCGGCCTGCGCTGGATTACAACAAATCCCGCGTTCTACGAATCCTCGCGGTAGAAATACGGGACCGCATTCAAAACAACGTGTAGCAGCCGCTACCAACTTCATAGAGAGAAAAAGTCATGGCTGATAAAACTTCGCCAGAATACGCGATGCTGCCTGCAGGAACCGTAGTGAAATGGGGGCCATCCGGTGCCGCTGTCTCAGCAATGAAGCCGCTGATCAACTGTAAGGCGCTCGGTGCTACCGGGCAGACCGGCAGCTTTGTGGACTGCACCACGCTGATTGATAAGAGCAAGCAGTTCATTTCAGACCTGCCGGAAGGTCCGGAGAAATCCCTGGGCTTTATCGACGATCCGTCTAACACCGATTTTGCCGCTTTCCTGAATGCCGCGCAGAACCGCCAGACTGTCCAGTTTTACGTAGAGCTGCCGAACGGCCGCACCGCCAACATGGTGCTGGCGCTGTCCGGCTGGCAGATGAATGAAATCACCGCGCCAGCAAGCGAAGTGATCCAGATTACCGTTCAGGGCAAGCAGAACAACATTGAGTGGGGCGTTGTCGCAGGCTCCTGATTTACACAGTCACGCCGCCTGGTGGCGGCTTTTACTACCTAACGGGATCAAATAATGTCCGAGAAAAAATTCAGTGCGGCCATGTTAAAGTCAGTTCTGCTGCAGCCAAAGTCTACAGCCATCAAAACAGAGTTGCTGGGCGCTCAGGTTTACATCCGCCGCCGCACTGCCGGTGAGCTGATCCGCTACGAAGAGGAGCTGGACGCAGCACAGGCAACCGGCAATGTCCGCGCAATCTCTGAGATGAGCGTGCAGCTGGTGCTCGACAGTCTGGTTAATCCTGACGGCTCAGCCATCAAGCCTGAACTGCTTCCTACCGCAGCGGAACTGCTGGAGGCTCACGATAACCCGGCGCTGATGGCGGCGATTGAGCGCGTGAAAACGCATGCCATCGGTAAACTGGAAGTTGCCGAAAAAAACTGACCAGCTCGCCATGGCTGCAGCTGATTTTGTGGCTGGCGGACAGGTGGGGCGAACCTGACCCGTCCGTTATAGCCGCATTACCGTGCGACACGCTAAACCACTGGCGAGCTTACTTTCTTCAGCAGGGCATCCTGACCCGCTCCGAACCGCAATCCGCGCAATCCCCACACGACACCAGGCCGAACACGGCCACGCATAGCGTGGATCAGCAGTGTGACGCCGTAATGAGGGCGTTAATGTAATGGCTGACGTAGCATCGCTGGCGGTAGGGTTACACCTCAACGCTGCAAATTTTAAAAGCCAGCTCGTCAGTGCGTATGGCGATGCCGGTAAGCAGTCGCGCCAGTTCAACAGACAGGCGCAGGACGACGCAAAAAAAACGGAAGAGGCTTACGGTCGCGTTAATGCTGCCGTTCGCGGACTGGCCGGGCGGATCGCCGGGCTGGCTGGTGTAGGGCTGTCACTCGGTACCATTATCCAGACGTCCCGCCAGTACTCTCAGGCGCTGTCTGACCTGTCATCCATTACCGGCGCGACCGGCAACAAGCTGCGCGATCTGGATGCGGCAGCGCAGCAGATGGGGCGCACTACCGAGTACAGCGCCAGCCAGGCTGTTGAGGCGCTGAAGCTGATGGCATCAGCCAAGCCCGAACTGCTTGATACGGCTGACGGGCTGCAAAAGGCGACCAACAGCGCACTGCTGCTGGCACAGGCGGGCGGCAGTACACTGCCCGACGCCACAAGAACGCTGGCCCTGTCACTTAATCAGTTCGGTGCCGGTGCTGAACAGGCCGACCGTTATATTAACGTCCTGGCAGCTGGTGCAAAATTTGGCGCTTCCGAGATTAACGATACCGCCGCCGCGATTAAAAATGGAGGTGTGGCCGCTGCGCAGGCCGGTATAGGGTTTGAGACGCTGAATGCTGCCATTCAGGTGCTGGCGTCGCGTGAAATTAAAGGGGGTGAGGCTGGCACGGCGCTGCGCAACATCATCCTCAGTCTTGAAAAAGGCACAGATAAAACGCTCAAGCCATCTGTTGTGGGGCTCAGCAAGGCGCTGGAAAATCTGGCCGGGAAAAACCTTTCAACCGCGCAGGCCGTAAAACTGTTTGGCGTTGAGAACATCAACGCCGCGTCTATTCTGACGGGCAATCGCGGCAAAATTGATGAGCTGACCAAATCCCTCACCGGCACGCAGACGGCGCATGAGCAGGCAGCGATCAGGGTTAACAATCTGAACGGCGACCTGATGGGGCTGACCAGCGCCTTTGAAGGGCTGATTATCAAAGTAGGCCAGTCTGGCAGTGGTCCTCTGCGCTCAGGCATTCAGGTCATTTCCGAAGGCATCAATAAGCTTTCAGATAATTTTAATGCCATTGCCTCCGTCGCGCTTTATACCCTTATTCCCGTTCTCTCCACCAAGCTCACAGCAGGGCTTAGGGAAAACATTTCAGGCTGGGCGGCGAACGAAGCCGCAGTGCGGAAAAATGCGCTGCAGCAGGCTGAGACGGCTAAGCAGACCATTGCCGCCGCGCAGGCAACGCGTCAGCAGGCGCAGGAAGAAGCCCGCTATCTGGGGACGCGCACAGCGGCAAACGCCGCAGCGGGTATCAATGTCGGCTATCAGAAAGAGCAGGTTGCGCTGAGCCGTACTATCCGTGAATCAAGAATTGCTGAGACAGCGGCAACAGAACGGCTGGCGACGGCCAATGCACAGCTATCTGTCACGGCCCGCGCTGCCTCTGTTGCATCCGGCCTGGCGCGTGGAGCGCTGTCGCTCATTGGTGGCCCGGTTGGTGCTGCGATGCTTGCCGGTTCGGCGCTGCTTTATTTCCATGAGCAGGCAAAACAGGCTCGCCAGTCTGCTATTGACCTCAAGGGCGCGGTCATCGAAACCACCGCCGCGCTGATGCAGATGTCAGACAAACAGCTTTCCGTAAAGCAGCTTGACCTGCAGGACCAGTATGAAAATCAGGTCACGCAGAGAAACCAGCTGATTAAGGAAATTCAGGACGCTGATAGCCGCATCGACAGCCTGAAGGGGTTTGATCCCTTCGGTCAGCTTGCCGGTGTTGAGAAAGGGAAGACCCGTGCAGAAGCCGACCTTGAATCTGTAAACGGTGGACTGAAAACGCTTAAAGACAACATGGAGAACGTCGATAAAGCGCGGTTCCTGGTAAAAACGGGCATTGCTGATTCGGCTAAGAATCTTAAGAGCGACGTTCAGGCTGCAACAGCTGCAGCCGCTGAAGCCGGTAAAGTTGCATCGCCCTGGGGCGGAGAGGACCCGGCTAAGGCTGATAAGAAAGGCGCTCAGGCGCTGAAGCAGTTTACTGCGTTACGTAACGAGATTGAGCAGGCGAACGCCTCAAGCCTGGAGAAAATCAATCTTCAGGAAAAGGTCTCGCAGGAAAAAATTCTGAAGGATGCCAAAGCTGCTGGTGTGAGTCAGTCGGAGGTGCAACGCGTACTGACCCTGAATACGACTAATTATCAGCGCCAGCGTCAGGAACTGGCCGAGCAGTACTCACCGGCTAAAGCCATTATCCGTCAGGAGTCGGAAGCCAGCCGCAACCTGAAAGAACTGTATGACGCCCGTCTGGTCACTGAGCAGGAGTACCAGTCAGCCCGAATTACGCTGGCAAACGATTCTGCTCAGAAGATGATTCAGACGCAGGCCAGCCAGGCGGCTGCGCCAAAGCTCAACATAGCCGGAGAAGTTGATCCGGTTGCGCAGCTTCAGAATCAGCTGGTGCAGCAGCAGAGCCTTTATACCGCTTACTATGAAAATAGCAGGCTGAATAAGGATCAGTACGAAGCGCTGATGCAGAAGTCATCACGGGATTCGGCGGATGCTCAGTATCAGGCTGCGCTTAATCTGTATGCCGGGCAGAGCACGCTGAATAAAGGGATCGTGAGCCTGGCGGAAACGGCGGCGGAGAGAACGACTAACTCCCTGACCGGTTTGCTTACCGGCACGCAGTCTTTCCGGGAAAGCCTTTCAAACCTGTTCGCCTCGCTGGCGCAAAGCGTCATCAAAAGTCTGGTTGAAATGACTGCTCAGGCATTGCTGACTAAAACAGTGCTGTCATCCTTCATGAGTTTTGGGGGGGCAGCGGTCGGCGCAGTCGGTACCGGTGCAGCCGCATCGGCTGGCAGTACCGGTGCCATGGGTATGAGTACCAGCTTTCAAGCATACGACGGCGGTGGATTTACCGGGACCGGTGGCAAATATGATCCGGCTGGTGTGGTTCATAAAGGTGAGTTCGTCTTTACCAAAGAGGCCACCGAGCGGATTGGCGTTGAAAACCTGTACGGGATGATGCGCGGATACGCCAGCGGCGGGCTGGTCGACACCCCCACTGAGCGACCAGCTGCGCTGCCCGGCAGTGGTCGTTCGGGTGGCAATACCATTATTCAGGTAGATGCGCCGGTCACGATTATGCAGGAAGGCGGGGCCGGTGACGCATCCGCTACTGGAACCTCTGCCGTAGCCTCACAGCTCAAATCTATCGTTCAGCAGACCATAACGGACAGGCTGAGGAAGGAAATTTCACCGGGTGGCATACTCTATAGCGGTCGGAGTTGATTATGGCGACAGATACATTTACCTGGGCAACGCGCATTCAGGCGAGTGAGCAGCTCAGCGTTTCCACCATTCAGGCTCAATACGGCGATGGTTACAAACAGGTTGCCGGGAAAGGGATTAACGATGCCGCTGAAAGCTGGTCGCTGAGCTGTAACGGTCAGGTGGACGTTATGGCTTCCGTGCGTTCGTTCCTGAAAACACACGTCGCCACTTCTTTCTGGTGGACAAATCCATGGGGTGAGAAAAAGCTTTATCGCGTTAAAGGGGATTCGATTAATCCAAAGTTTATCAATGGCGGGTTTGTTGAAATCAGTTTTACCTTTGAACAGGCTTTCGCACCGTGACATGTCACGGTAACAACAGGGCGCTCAGCGCCCTTTTTTATTGGGTGAAAAATGAGTTTTAACCAGGACATTCAGGCGCTGGAGCCGGGGAGTCTGGTCCAGCTGATAGAGATTGACGGCACAGCTTTCGGGCTGGATACCGTACTGCGCTTCCATGCGTACAACCTGCCGACCGAAGGCTGGCAGTCGTTTGCAGCGGAAAACCTGCCGTCAATCATCTGGCAGGGCAATGAGTACGATCCGCACCCCTATGAGCTGACCGGCATGGAAATGAGCAGCACCGGTTCACAGCCGACGCCAAAGCTTTCTGTCGGCAACGTGGGCAACTATGTGACCGCGCTCTGCCTGCAGTTTGACGACATGGTGAAGGCGAAGGTGCGTATCCACACCACGCTGGCAAAGTATCTTGACGCGGCAAACTGGACGGCGGGCAACCCCAGCGCAAACCCGCAGGAGGAACGCGTTCAGCTGTTTTATGTGAATGCGAAAACCTCCGAAACTCGTGCTCAGGTGGATTTTGAACTCTGCTCTCCGTTTGATATCCAGAGCCTGCAGCTGCCATCGCGCCAGATAACGCCGGTCTGCACCTGGTGCATGCGTGGCTGGTATCGCACCGGCACAGGGTGCGATTACGCAGGCAGCCGGTACTTTACCAAGGACGGCACAGCAACCAGCGACCCGTCAAAAGATGTCTGCGGTGGGCGTATGGCTGACTGCAAAGCACGCTTTGGCGATGACCAGCCACTGCCGTTCGGCGGCTTCCCGGCTGCAAACCTGCAGGGTAAATAACGATGCGCAAAAAGATTCTTGAGGCGATACGCGAGCACGTGGCCGCCGAATACCCTAAAGAGGCATGCGGACTGGTCATCCAGTCAGGCCGGAGCCAGAACTATATCCCCTGCCGGAATATCGCTGACGCGCCGACAGAGCACTTCACGCTGTCGCCGGAGGATAAGCGGGAAGCGGAAGCGCAGGGCGACATCCTGATGGTTATCCACTCACACCCGGACGTGCCGCAGCTCATCCCGTCAGAACATGACAGGGTGCAGTGCGACTTTTCCGGCGTGGAGTGGGGGATCATGTCGTGGCCGGATGGCGACTTCTGCACTATCAGCCCGCGTACCGACCGCGACTATACAGGCCGCCCCTGGCTGATTGGCGGTAATGACTGCTGGACACTCATCATGGACTACTACCAGCGTGAGCACGGCATCACCCTGAAAAACTGGTCTGTTGATTATGAGTGGTGGGTGGGCGGCAAAGAAAATCTGTATGACGACAACTGGCAGGCTGAGGGGTTTGTGGAGGTTGAGCCTGCGGAGATGCGTGAAGGCGACATGATCATGATGCGCATCAGCGCCCCGGTAACGAACCACGCGGCTATCTACCTCGGCAACAACATTATTCTGCACCATAACGCCGGGAGCCTGTCGACGCGGGTACCCTATGGCGAGTACTGGCGAAACCGTACCGTGCGCGTCGTGCGCAGAAAGGAGCTGATTGATGCTTAAAACCATGCGACTTAGAGGCCGGATGGCAAAAATGTTTGGTCAGGTGCATCAGTTTCACGTTGCTGATTTGCGCGAGCTGCTGCGTGCTATGTGCTCACAGGTACCCGGATTTAAAAAATACGTTTCGAATGCGCATCTCAATGGCGTGCGCTTCGCCTTCTTCAGCGGCAAAGACAATATCGGCCTGCAGGAATTCGACATGTCCTCCGCTGCGACCGAGTTTCAGATGGAGCCGGTTCTGGAAGGTTCGAAGCGTGGCGGTACTTTGCAGATCATCATCGGAGCTGTTGCGATTGTGGCCGCGTTCTTCACTGCCGGTGCGTCACTGGCTGCATACGGTGCGGCTATCGGTACCACGACCGCAGTTGGACTGGCTACTACAGCACTGACCAGTATCGGTATCAGCATGCTGCTGGGCGGTGTCGTACAGATGCTGACGCCGCAACCCAAGCTCAACGTGGGTGCGTCATCCAGCACGGACAATAAGCCGAACTATGCGTTCGGTGCGCCAGTCAACACCGTTGCCATGGGCTACCCCGTTCCTGTCCTTTACGGTATGCGCGAAATTGGCGGTGCGATCATCAGCGCAGGTAGCTTTACCAGCGATCAGCAGTAGTCAAACACAGTTAATTCAACAGGCCACCTTCGGGTGGCTTTTTTTATGGGTGAAATATGCGACTTCTTGAAGGTGCCGTGATTCAGGGCAGTAAGGGTGGTGGTGGTGGCAGCGCTCATACTCCGGTTGAACAGCCAGACGATCTGCTGTCTATCGCAAAATTAAAAATGCTGCTGGCTATCTCAGAGGGTGAGATTCAGGGTGATTTAACCGCGCAGCAGATTTATCTGAACGACACCCAGCTGGCGAACGAAGACGGCACCTACAACTTCACCGGCGTAGTGTGGGACTGGCGTAAGGGTACGCAGGACCAGACCTATATTCAGGGCATGCCTGAAGTTGATAATGAGCTTTCTGTTGGTGTAGCTGTAACGCAGGCCATCGCCTGGACCCGCCAGTTTACCAATCTGACGCTAGATGCCATTCGTATTAAGCTGAGTCTGCCGGTGCAGTATCAGTATAAAGACAACGGCGACATGGTTGGCACCGTGACGCAGTATGCAATTGATCTCTCTACTGACGGCGGTTCATGGGTCACGGTGGTTGACGGAAGGTTTGACGGTAAAACCACGTCTGAATATCAGCGCGATCATCGCATCGACTTACCTAAAGCCACCTCCGGCTGGTCAATCAGGGTGCGTCGCATTACCGCTGATTCGTCATCCTCAAAGCTGATAAACGCCTTCAAAGTTTTTTCATTCGCAGAGGTTATCGACAGCAAGCTTCGTTACCCCAATACCGCGCTGCTGTATATCGAGGTTGATGCAAGCCAGTTCAGCGGTCAGGCACCGAAAGTAACGTGCAAACCAAAAGGGCGGCTGGTTCGCGTGCCGACAACCTATGACCCTGTTTCACGCACTTATGCCGGAACGTGGCAAGGTGATTTCAAATATGCCTATACCGATAACCCGGCGTGGATTTTCTATGATCTGGTGCTGGATAAAATCTTTGGTATGGGGACGCGGGTCGATGCCACCATGATCGACAAGTGGGAACTTTACAGCATCGCACAGTACTGCGACCAGATGGTGCCTAACGGCGCTGGCGGCATGGAGCCGCGCTTTACCTGTAACGTATTCATCCAGAGTCAGCAGGATGCGTATACGGTCCTGAAGGACATAGCGGCGATATTCCGTGGCATTACGTTCTGGGGTAACAGCCAGATTTTTGTGAATGCTGACGTGCCGCAGGTTGATTCAGATGGCAACGTTGACGTTGATTTCGTTTACCACGCCGCGAACGTCATTGACGGGCTGTTCACTTATGCCGGTGGCAGCTATAAGAACCGCTATTCATCCTGCCAGGTGAGCTGGTCCGATCCGATTAACCACTATTCTGACACGGTTGAAGGCGTTTACGACTCAGACCTGGTACAGCGCTATGATGTGCGCGAGATGAGCCTCACCGCTATCGGCTGCACGTCTCAGAGCGAGGCGCATCGGCGCGGGCGCTGGGCGATTCTTTCCAATGCCAAAGATGGTACGGTTTCATTCGGAGTCGGCCTGGATGGTTATATTCCTATCCCCGCTGAAATCATCGGTGTGGCTGATCCGTTCAGGTCGGGCAAACAGAACGGCGGCCGCCTCAGTTCTGTCAATGGCCTGCGCATTACCCTTGACCGTCCTGTTGATTACGCAGTTGGTGATCGGCTGGTGGTGAACCTGCCGGACGGTACCGCGCAGACACGGACGATTGGCAGCATCAGCGCCGATAAGAAAACGGTCAGCGTAAATACATCTTTCCGCATGACGCCGGTAGCGGGTGCTGTATGGGCTATAGACAGTAATAATCTGGCAATTCAGTATTTCCGCGTCACATCAGTAGCCGGTAATGATGACGGCACTTTTACCATTACCGGCGTGCAGCACGATCCGAATAAATACCGCTACATTGACGATGGTGTGCGCATTGAGCCAGCGCCAATCACCGTCACGCCTATCAGCGTGCTGAAGGCACCGGCCAACATCGTAATCAGCGAAGTCAGCTTTATCGAACAGGGGCTCTCTGTTGCCTCAATGCAGGTGACATGGGACAGGGTTCAGGGAGCAATCAGCTATGTGGCTCAGTGGCGCAAGGACAAGGGTGACTGGGTAAACGTCAGCCAGACCAGCGCACAGGGCTTCAGCATTCAGGGTATCTATACCGGCGTTTATGATGTCCGGGTGCGTGCAGTCAATGCTGTAGAGGTTTCATCTCCGTGGGGTTATGCAGATTCTACTGCCCTGAACGGTAAAACAGGTAAGCCAGGCACGCCGGTTAACCTCCGTGCTACTGATAACGTGGTCTGGGCAATTGATGTAACGTGGGCGTTTCCTGATGGTTCAGGTGATACCTCTTACACAGAGATTCAGGTGGCCACAACGGCAGACGGGCAGAATCCTCAGTTCCTGGCTTATGTTCCTTATCCTGGTGTTAGCTATCAGCACGGTCCGATGCCCGCTGGCGTTCGCCGCTGGTACCGCGCCCGGCTGGTGGACCGCATCGGCAATACCGGAGACTGGACGAAGTTTGTGGAAGGTGCCAGTAGCGTTGATGCGACCGCGTTGCTGGGCGACATTACCGAGCAGGTCCTGAAAACGGATGCCGGTAAGCAGCTCATTGCCAAAGTCGATACCAATATCGATGCCATGCTGCAGAATGCGCTCAACCTGGACGCGACCGTAGATCACCAAATGGCAGAGTCCGGAAAAAACCGAGCTGACATTCTGACGGTCAAACAGACCATTGCCACAAATGAACAGGCTTATGCCCAGAAGTTTGAACAGATACAGGCGAGCGTAGACCAGAACACCGCCGTTGTTCAGCAGACGTCAACAGCCCTGGCTGACACCAATGGTAAACTATCAGCACAGTACTCAGTGAAAGTGGCCGTGGACAGTAACGGTCGCCAGTACGCAGCTGGAATGGGAATTGGCGTTGAGAATAGTCCTTCAGGGATGCAGACACAGGTGCTGTTTCTGGCTGACCGCTTTGCCGTTATGTCGCAGGTTGGCGCAACGCCTCAAACCTTCTTTGCTATCCAGAACGGCCAGACCATCATCAGCCAAGCGTTTATCGGTGAAGGCACCATAACCAGCGCCATGATTGCTGCTTATATCCAGTCAACGAATTATGTTGCAGGTGCGGCTGGCTGGAGATTGGGTAAAGATGGAACATTTGAGAGGAATGCTGCAAATGGATCCGGAAGGGTTGTAGACACAGGCACGCTAAGACAGGTCTATGATTCAAATGGCACTTTACGCATCAGAGACGGCCTCTGGTAAGGAGTATCAATGCCCGGTGGACTTCAGTGCTGGGATGCAAATGGAAAGCTGATAGTAGATATTGGTGACTACAACACAAGGTACCTCGGAAGAACAGCTGTAACGATGGCGGCTAATACCAATGTCGTTACAGGTGCATTTGGCGGACTGACGACTTCTGGATCATTTGTAGTTGTGGTATCAGCCTCAAGCTCTGTTTATTACACACCATCAAATTTCGCGGCGCGTGCATTGAATGGCAGTTTCAATATATTCAAACTGTCAAGTTACACCGCAGCCGTCACCCTAACTTTAGACATGTATGCCTTCATATGAGTGGATATCAGGTATGGAACTCGGCAGGTGCCCTCGTAATAGATTCTGATTATAAGGGAACTTATTACCGGGATACGGTTAATTACGCAGGCATTACTGACATTGGTTACTACAATATTTCTTGTCAACTTGGGAATTCAACTGATATGGGGCACGTAAGTGCTAGTGTCCCACTTGATGATAACCTTCGATGGTTTAAACCCAACAATAACGCCAAAATGTTTTTCACAGGTCCTGACTGGATGACTGCCAATGCTGGTTCAATGGCTCGAAGTCGAAGTGATATGCCTGTAGAAAGTGGTTATAGAGACGTATTTAACTCATCAGGGCAATTAGTGTGGTCGGCAGTTATGGCTGCAAAAATACCTCGAATAATAGGTTTTTTTGATGTTCCGGCTAATTTCGATTTGGATAATTCTGTTTATTCACAAACAATAGGTAACAACTCATGGATTCTTGTTAGCTCAGTGCCTGGCGGAAATATTTCAGATGATGGATCGGCAACCGGTTTTTCGGGGCCATTCTTCAGATTTCAAAATGGGATGCTGCAGTGTCAATGGGTAAATCAACTTCAGCAGTCATGGGCTAGCACGCTGAAACCATATGGCATGCGTATTCCATACGGGATATTTTCAAACCTTAGTTAA
Protein sequences of DBSCAN-SWA_5 >CP028349|2447444:2487166|2486515_2487166_+|AVV37843.1|DBSCAN-SWA MSGYQVWNSAGALVIDSDYKGTYYRDTVNYAGITDIGYYNISCQLGNSTDMGHVSASVPLDDNLRWFKPNNNAKMFFTGPDWMTANAGSMARSRSDMPVESGYRDVFNSSGQLVWSAVMAAKIPRIIGFFDVPANFDLDNSVYSQTIGNNSWILVSSVPGGNISDDGSATGFSGPFFRFQNGMLQCQWVNQLQQSWASTLKPYGMRIPYGIFSNLS >CP028349|2447444:2487166|2456469_2456658_+|AVV37802.1|DBSCAN-SWA MSIVSQKLSVEQFRCRFIAGVPVYEQIIPTAGGPNNYQTTTRLVVESAWKTFYRCPAQSGVN >CP028349|2447444:2487166|2480472_2480814_+|AVV37837.1|tail|DBSCAN-SWA MATDTFTWATRIQASEQLSVSTIQAQYGDGYKQVAGKGINDAAESWSLSCNGQVDVMASVRSFLKTHVATSFWWTNPWGEKKLYRVKGDSINPKFINGGFVEISFTFEQAFAP >CP028349|2447444:2487166|2477122_2480470_+|AVV37836.1|tail|DBSCAN-SWA MADVASLAVGLHLNAANFKSQLVSAYGDAGKQSRQFNRQAQDDAKKTEEAYGRVNAAVRGLAGRIAGLAGVGLSLGTIIQTSRQYSQALSDLSSITGATGNKLRDLDAAAQQMGRTTEYSASQAVEALKLMASAKPELLDTADGLQKATNSALLLAQAGGSTLPDATRTLALSLNQFGAGAEQADRYINVLAAGAKFGASEINDTAAAIKNGGVAAAQAGIGFETLNAAIQVLASREIKGGEAGTALRNIILSLEKGTDKTLKPSVVGLSKALENLAGKNLSTAQAVKLFGVENINAASILTGNRGKIDELTKSLTGTQTAHEQAAIRVNNLNGDLMGLTSAFEGLIIKVGQSGSGPLRSGIQVISEGINKLSDNFNAIASVALYTLIPVLSTKLTAGLRENISGWAANEAAVRKNALQQAETAKQTIAAAQATRQQAQEEARYLGTRTAANAAAGINVGYQKEQVALSRTIRESRIAETAATERLATANAQLSVTARAASVASGLARGALSLIGGPVGAAMLAGSALLYFHEQAKQARQSAIDLKGAVIETTAALMQMSDKQLSVKQLDLQDQYENQVTQRNQLIKEIQDADSRIDSLKGFDPFGQLAGVEKGKTRAEADLESVNGGLKTLKDNMENVDKARFLVKTGIADSAKNLKSDVQAATAAAAEAGKVASPWGGEDPAKADKKGAQALKQFTALRNEIEQANASSLEKINLQEKVSQEKILKDAKAAGVSQSEVQRVLTLNTTNYQRQRQELAEQYSPAKAIIRQESEASRNLKELYDARLVTEQEYQSARITLANDSAQKMIQTQASQAAAPKLNIAGEVDPVAQLQNQLVQQQSLYTAYYENSRLNKDQYEALMQKSSRDSADAQYQAALNLYAGQSTLNKGIVSLAETAAERTTNSLTGLLTGTQSFRESLSNLFASLAQSVIKSLVEMTAQALLTKTVLSSFMSFGGAAVGAVGTGAAASAGSTGAMGMSTSFQAYDGGGFTGTGGKYDPAGVVHKGEFVFTKEATERIGVENLYGMMRGYASGGLVDTPTERPAALPGSGRSGGNTIIQVDAPVTIMQEGGAGDASATGTSAVASQLKSIVQQTITDRLRKEISPGGILYSGRS >CP028349|2447444:2487166|2458295_2459759_+|AVV37806.1|DBSCAN-SWA MSTLARIYDDKKNSDTDITTRKTYLLGVDELYVETNYNIRDIDQTHVEEFRDAFIAGEHVPPLAVKVTEKGIKIIDGHHRYYGAKLAQEAGYTLRLECKDFVGSEADSVAFMVTSSQGRALLPLERAAAYQRLVNQGLEPAEIAAKVKRSITDVEQHLQLLTVGEPLIEMVKSGEVAATTAVALQREHGVKASSVAQEQMQKAKAAGKKKLTRSAAIVSPVKLREKIRAEHAAWSQETFGDVGPVGPLKHLAKEAMEAAEAPDDLSEWADLQFLLWDAMRRAGITEEELNAAMELKLSVNKARNWPEPKDGEPREHLKADSQEAVQPEKDYGDELPLLKHEILEQSGVEAWACVIAAFKMKAEYTYSESKWAHTWAADSVENPTCVTVPAETIASAVRLIKQHQDDLELKLWLSEQHDDSEVATEQLIRFSAVLSEVRQDNPCTVQEFIALVEQTNRDCWSNIRMLRQAVREVAGQMTIPGIGEVAL >CP028349|2447444:2487166|2449531_2450710_+|AVV37792.1|integrase|DBSCAN-SWA MAISDTKLRGLHGKPYSGPAEITDADGLGIRITPKGIVSFQYRYRINGSQHRLGIGRYPGVSLRDARIKVGEYKSLIAEGIDPKHQLTVKKNKPTVHECIKYWYDNYVLQSLRKSTAEVYERIVLNEMEKYFADIPIEHIPVSAWVDFFTEQEQANPLKARKLLVHLRGAIAWCSRRQFIEDSSLLRLNPKEFGRNPKTGDTVLTYRQLAKIWVENEKSTATLSSKMLIKSLILYGSRNSELRESRKEDFDFEEGIWTLPSERSKTNKIIRRPIFKQIEPLLKQSIDNGNGILFHGAFERNIPLSIGSSTRYVRLLRDQLNFGDFTAHDFRRTMATRLAEEGIAPHVIEKMLGHDLGGVLAVYNKHDWLAEQKVAYELYADKIFEQIKLISD >CP028349|2447444:2487166|2473279_2474491_+|AVV37829.1|capsid|DBSCAN-SWA MSDVNDLLTKVSNKLEKVSAEFSEKAEKALNEAKNSGQLSTETKAAVDKIATEHNALNEAMKTLKTSLGDLEQHVAAQMPLNAAQEVIQSVGQQFVSAEVMKDIRSSLEGNKRISVPVKAALTTVDVPGQIVAPQRLPGIDTAPKQRLFIRDLIAPGRTQSNTIYYVQQTGFTNKASVVPENTTKPYSDIQFAEKTTAVRTIAHMFKASKQILDDFAQLQSTVDAEMRYGLSYVEEQEILFGNGEGAHLAGIIPQAKPFSAAFAVQNETGIDILRLAMLQAQLARFPASGHVLHFTDWAKIELSKDTLGRYILANPSQLTTPTLWGLPVVATEAAQFLGKFLTGAFNSGAQIFDREEANVVVSSENSDDFEKNMISIRCEERLALAVYRPEAFVYGSLTGSGS >CP028349|2447444:2487166|2461475_2462492_+|AVV37810.1|DBSCAN-SWA MRALLTPEIAPRTGIVLLKPGPELLRLFKGRVVISTPTMDMADLPSGRLNDGTQPLLDEPSLIPFFSHERVIKAAGGPNALASFVQSFGCCQWEQLGVWHHHEFTVSEIENGLVSLCYSHDNEFRENGVPGSLENIAKGNTALWIIRAACSHMALNGDHQLTLPELCWWATLNDVIDLIPEAPARRVLRMPKETIQSGELKEARIVPARLAREVIQDAAQLVKKLIDLRTDPESPESFMKRPKRKRWESEKYTRWVKSQTCACCGIQADDPHHIIGHGQGGMGTKAHDLFVIPLCRAHHDELHRDMKAFEAKYGSQIELLFRFLDFAIAVGVIGTDKK >CP028349|2447444:2487166|2476901_2477123_+|AVV37835.1|DBSCAN-SWA MILWLADRWGEPDPSVIAALPCDTLNHWRAYFLQQGILTRSEPQSAQSPHDTRPNTATHSVDQQCDAVMRALM >CP028349|2447444:2487166|2454858_2455512_-|AVV39281.1|DBSCAN-SWA MMNMGERIRLKRKELNLTQQALAEKAGVNRVTVTGWEKDDYQPNGANLQALADALKCDPTWLVSGKGESISSPLLRPVQVSAKEVPLISWVQAGTWTATDPGLTRDEAIMWLYTTASVSDKAFALRVRGDSMTNPHGNPTIPEDSIVIVEPEIHDVAAINGKIVVAHIDGGSEATLKKFVEDFPHRYLVPLNPNYKTIECDGNCRIVGVVKQVIMEF >CP028349|2447444:2487166|2472421_2473270_+|AVV37828.1|DBSCAN-SWA MTIKTLPVAPEGRPFARQNTELPSAAFERWDGGIRAAGQSGDNTISILDTIGEDWYGEGVTASRISGALRSIGGGDVTVNINSPGGDMWEGLAIYNLLVAYEGKVTVKILGIAASAASIIAMAGDDIQMGRGAFLMIHNCWTIAAGNRNDFRDYADSLEPFDKAMADIYAARSGLKLSEVQTLMDNESFISGSEAVEKGFADSLLSADEITSDDESPAAALRKIDAFLAKGGMPRSERRKHLKALGGKPGAATEKNDKPGAVDEINPEALNSLKNALASLGE >CP028349|2447444:2487166|2450690_2450882_-|AVV39280.1|DBSCAN-SWA MANIEMIFESEAMQKIGVTSRTTMRTYVLNHSFPKPVRNRPKKYLLAEVEQWILNGGVNQRSA >CP028349|2447444:2487166|2468328_2468682_+|AVV37824.1|DBSCAN-SWA MPQRIPRACRKHGCAKTTTDRSGYCADHLNEGWQQHHNGLSRHQRGYGSQWDVRRARVLERDRHLCQECLRKGRPTAAKTVDHITPKAHGGTDDDSNLQALCWPCHKAKTARDRIKR >CP028349|2447444:2487166|2471106_2472411_+|AVV37827.1|portal|DBSCAN-SWA MKKNKQPGKVKSALLNWLGVPISLTTGTFWEEWWGKSSSGKTVSADKAMRLSAVWACTRLLSESVSTLPLKIYQRQPDGSRVLALDNPVYQVLCRRPNLEMTPSRFMLSVVASVCLRGNAFIEKKMIGKKLVALVPLLPQNMVVKRLDNGSLQYTYTEVKSKREIPVHNIMHIRGFGLDGVCGMMPMMTGRDVIGAAMSVEESAAKIFENGLQSSGFLSSDVAMDDKQRERLRGYLERFIGSKNAGKVMVLEAGMKYQGVTINPEAAQMLESRSFSIEEICRWFRVPPFMVGHTTKQSSWASSVEGMNLLFLTNTLRPLLVNIEQEISRCLLDGSDDVFAEFSVEGLLRADTAGRSAYYTTALQNGWMSRNDVRRLENLPPIAGGDIYTVQLNLTPLDQLRENNAGAQASNMMKLHAFLFPDIPPEHSPLKKAA >CP028349|2447444:2487166|2465663_2465852_-|AVV37820.1|DBSCAN-SWA MIRVEISGIIYDIGYEHGVYFARASSGQSPVGQTIDELSQGFAEITGLKKEDLKAYLLSLGI >CP028349|2447444:2487166|2460242_2460608_+|AVV39282.1|DBSCAN-SWA MKLILPFPPSVNTYWRNTRKGVLISASGRCFRSNALAAVMEQLKRRPQPITVNVEVSVLLFPPDKRQRDLDNYLKALFDSLTHAGIWGDDRQIKRFTVEWGPVTKGGKSEVVISEFQPVAA >CP028349|2447444:2487166|2464793_2465228_+|AVV37817.1|DBSCAN-SWA MNPLSLIKTFSPVIVIGLICLALWMLNARSSQLEATNQRLEKLANSKDDQINDLRSKNDGLASSVTELVTAVKQQNEVMSQVTEQRAVTAQQNRKLQNEIKRYLAADKCAVAPVPPDAADRLRDAAKAAGGVPDSKTAKVKPSG >CP028349|2447444:2487166|2460762_2461479_+|AVV37809.1|DBSCAN-SWA MLNQSAGAIAPVVNAIQSPIMTSREIAELTGKEHKNVTVDIRRMLDDLGEDALKFQRIYLDTMNRQRTEYHLDREHTECLITGYSAILRMKVIKRLHELEESQPVKIPRTFAEALRLAAEMEEEKDRLQLQLTEAAPKVAFVDRYVTATSSMTFRQVAKLLEAKEPELRLFLIESRVMYRLNGVLTPYSQHIEAGRFEVRTGTTTESNYMFSQSRFTAKGVQWIGGLGTAYKAAGGAE >CP028349|2447444:2487166|2482997_2486204_+|AVV37841.1|DBSCAN-SWA MRLLEGAVIQGSKGGGGGSAHTPVEQPDDLLSIAKLKMLLAISEGEIQGDLTAQQIYLNDTQLANEDGTYNFTGVVWDWRKGTQDQTYIQGMPEVDNELSVGVAVTQAIAWTRQFTNLTLDAIRIKLSLPVQYQYKDNGDMVGTVTQYAIDLSTDGGSWVTVVDGRFDGKTTSEYQRDHRIDLPKATSGWSIRVRRITADSSSSKLINAFKVFSFAEVIDSKLRYPNTALLYIEVDASQFSGQAPKVTCKPKGRLVRVPTTYDPVSRTYAGTWQGDFKYAYTDNPAWIFYDLVLDKIFGMGTRVDATMIDKWELYSIAQYCDQMVPNGAGGMEPRFTCNVFIQSQQDAYTVLKDIAAIFRGITFWGNSQIFVNADVPQVDSDGNVDVDFVYHAANVIDGLFTYAGGSYKNRYSSCQVSWSDPINHYSDTVEGVYDSDLVQRYDVREMSLTAIGCTSQSEAHRRGRWAILSNAKDGTVSFGVGLDGYIPIPAEIIGVADPFRSGKQNGGRLSSVNGLRITLDRPVDYAVGDRLVVNLPDGTAQTRTIGSISADKKTVSVNTSFRMTPVAGAVWAIDSNNLAIQYFRVTSVAGNDDGTFTITGVQHDPNKYRYIDDGVRIEPAPITVTPISVLKAPANIVISEVSFIEQGLSVASMQVTWDRVQGAISYVAQWRKDKGDWVNVSQTSAQGFSIQGIYTGVYDVRVRAVNAVEVSSPWGYADSTALNGKTGKPGTPVNLRATDNVVWAIDVTWAFPDGSGDTSYTEIQVATTADGQNPQFLAYVPYPGVSYQHGPMPAGVRRWYRARLVDRIGNTGDWTKFVEGASSVDATALLGDITEQVLKTDAGKQLIAKVDTNIDAMLQNALNLDATVDHQMAESGKNRADILTVKQTIATNEQAYAQKFEQIQASVDQNTAVVQQTSTALADTNGKLSAQYSVKVAVDSNGRQYAAGMGIGVENSPSGMQTQVLFLADRFAVMSQVGATPQTFFAIQNGQTIISQAFIGEGTITSAMIAAYIQSTNYVAGAAGWRLGKDGTFERNAANGSGRVVDTGTLRQVYDSNGTLRIRDGLW >CP028349|2447444:2487166|2455833_2456370_+|AVV37801.1|DBSCAN-SWA MVDTINTAIRLMCKAHKAGRLGMADDLGMTIDQFHNHMYRKCGSRFFTLDELMKMEDLSGTACLADFFATRHGKLLVDVSAVKEVDKVDLYDIEMKASAAAGELAIAKIAAASDGVIDSKERKTLSALFHKKMRHQIHGFLGFMALYGVGVAEHSVDMFVANGRKIDASGVQIEVQDI >CP028349|2447444:2487166|2469376_2471107_+|AVV37826.1|terminase|DBSCAN-SWA MATVQAGIQYAESVLAGEIVAGELVRLACQRFLNDLEHGPERGIYFSEDRAQHILDFYNFVPHVKGALAGKPIELMPWHIFILINLFGFTIPLIDEMSGKQVMDDDGDPVMVRRFRTAYNEVARKNAKSTVSSGIGLYMTGADGEGGAEVYSAATTRDQARIVFDDAKNMIKKAPRTLGRLFGHVKLNIHQERSASKFEPLSSDANNLDGLNIHCGIVDELHAHRTRDVWDVLETATGARLQSLLFAITTAGSNKEGICFEQRDYAIKVLRGVVEDDTYFAVVYTLDEEDDPFDEANWPKANPGLGVCKRWDDMRRLAKKAKEQVAARPNFFTKHLNIWVTAESAWMDMDRWSKMLPTAEEAKRKGWPLWVGVDLANKIDICSAVKTWRDPTGETHMEPRFWLPEGRIETAPNHIAELYRKWADAGHLELTDGDVIDHGVIKAEIVEWVKGENIKEIAFDPWSALQFSLALAEEGLPLVEVPQTVKNLSESMKSVQAEIYGNKFHHDGNPVMTWMMSNITVKPDKNDNIFPNKSTPENKIDGPVALFTAKSRLLVNGGGDVQDLSGFFENPIMIGF >CP028349|2447444:2487166|2476005_2476458_+|AVV37833.1|tail|DBSCAN-SWA MADKTSPEYAMLPAGTVVKWGPSGAAVSAMKPLINCKALGATGQTGSFVDCTTLIDKSKQFISDLPEGPEKSLGFIDDPSNTDFAAFLNAAQNRQTVQFYVELPNGRTANMVLALSGWQMNEITAPASEVIQITVQGKQNNIEWGVVAGS >CP028349|2447444:2487166|2475570_2475972_+|AVV39283.1|DBSCAN-SWA MLRMEVTGLDELERQLIALGEKAGTKVLREAGRAALQVVEQDMKEHAGYDESAKGPHMRDSIKIRSTTRTRGNAVVVLRVGPSKQHYIKALAQEFGTVKQVPDPFIRPALDYNKSRVLRILAVEIRDRIQNNV >CP028349|2447444:2487166|2474533_2474860_+|AVV37830.1|DBSCAN-SWA MIIAIETVREHCRIDADDSSEDSLLMIYIGAAKRHIEKWTRRNLYETNADAGFDTDEDRLLLDDDIRLVILLLVGHWYANREAVSDKNTSEMPLAVDALLQPYRIYGL >CP028349|2447444:2487166|2451755_2452340_-|AVV37795.1|DBSCAN-SWA MEKLNELVEQAKFVADWHGSDWSSALCQDKDGKQAHEIICESRGEAVISTGSNSNEASWLCDYLELCSPANILAIAEAFQALEQQRDELAAENASIKANVAIHAAGFSVCPVCSHEEPSETDNIVWMVKATPATDSYLNSVRAEGVEMFSKELGSPYGDGEGRDYETGFNRAIEVSKSKAVKFVSQLRAGKDGE >CP028349|2447444:2487166|2482317_2482941_+|AVV37840.1|tail|DBSCAN-SWA MLKTMRLRGRMAKMFGQVHQFHVADLRELLRAMCSQVPGFKKYVSNAHLNGVRFAFFSGKDNIGLQEFDMSSAATEFQMEPVLEGSKRGGTLQIIIGAVAIVAAFFTAGASLAAYGAAIGTTTAVGLATTALTSIGISMLLGGVVQMLTPQPKLNVGASSSTDNKPNYAFGAPVNTVAMGYPVPVLYGMREIGGAIISAGSFTSDQQ >CP028349|2447444:2487166|2468899_2469373_+|AVV37825.1|terminase|DBSCAN-SWA MPGPPKTPTHLALVKGNPSKRAVNKKEPKPPSGVPPIPKHLDKMGKYWFKRIGEELDAVGVMTTLDGKALELLIEAYTEYRQHCDVLTEEGYTYKTVSATGENIVKAHPAAVMKSDAWKRIRAMLSEFGMTPASRSKVGASGPAEADPLEEFLKKRK >CP028349|2447444:2487166|2466303_2467761_+|AVV37822.1|DBSCAN-SWA MKVCIDDVEYAPITERVSNIGIAISTHNRHDVLSRALEHQVKFLPAGALVVVIDDGSDKLVTAPEGVRVIRHDTSRGIVAAKNASLEALIDAGCEHLFLWDDDAWPVAGGWEQPYITSPEPHLAYQFQDFATGQKLNDIAVLYRDDHHVAYTGQRGVMLYYHRSAIEKVGGFDTVYQRGMYEHSDLALRIHNAGLTSWAFADVTGSGKLIYSLDEHQAVERSVPKPDREAQVKRNVTIHNERRNGGYTGYAEYRQQRNVVITTLLTSQPDPQRDTRMTASPDMLNKWAVSVKGGDAVVLADELTTSPVGASLVAVADVEMNVYFRRWLHIWQHLRDHPEYHFVWCTDGTDVEMLREPWQGMEQGKIYVGSEPKTYAAAWAKQQHPEGVYQAFLAEHQNNVMLNAGLLGGARADVMSIAHGIVRLYYHIESLRFWNQEVKAAAVGDMIAFGIVAHRYSDRLVTGPRVHTVFKSEGIGKEFAWWKHK >CP028349|2447444:2487166|2466004_2466184_-|AVV37821.1|DBSCAN-SWA MSLKPGQNSGKDGGVYQQIGPRGGKTDNYTTIPDNHTAPPTSKSGSTWEQVKRTPDSKR >CP028349|2447444:2487166|2463245_2463422_+|AVV37813.1|DBSCAN-SWA MKQSEFRRWLESQGVEVSNGTNHLKLRYNGKRSVMPRHPGAELKEPLRKAIMKQLGLK >CP028349|2447444:2487166|2467745_2468360_+|AVV37823.1|DBSCAN-SWA MEAQISFVVVGHYSRRHQAERLAQILNAHLLIDEDHHGANWNHRRAIEWASQQDCRVVILEDDALPVHGFAQKVAEWLSRFPEDLLSFYLGTGRPPQYQPEIATKLIDTDRQQADYITLNRLIHGVCYSVPQPKLNQVISRWNHGSPADYAVGDACGGAVVYPCYSLVDHADAATVERHPDNTPRTERRRAWRLDAATNPESVP >CP028349|2447444:2487166|2460048_2460246_+|AVV37808.1|DBSCAN-SWA MISHEAFIGAIFFAAFYFFFAGIVAEHSSSNRHKDISSPIPSIARGLAWPLALIKYALLVIWGWL >CP028349|2447444:2487166|2452753_2453197_-|AVV37797.1|DBSCAN-SWA MLSSKSNKEVVAAGHQFARIIGKETSLLEMAKMVSDLATRLDVATVRASLMASEVLRINSVLPDTISALQAAGADLTLIDDLNAALATPACDQWIRTLRGEAIGEARQAVATMGNQQLPGTLQAINILSQMEMDLLRSRTGTLKVVS >CP028349|2447444:2487166|2453208_2453373_-|AVV37798.1|DBSCAN-SWA MQKPDDHITVGIITLPYSHILNGWIMPDGSVISNPIKAQREAERLNKTINITIH >CP028349|2447444:2487166|2463470_2463878_+|AVV37814.1|DBSCAN-SWA MRYPINLEPCDGGYVVSFPDIPEALTQGDTREEALEMGLDALVTSFDFYFEDNQPVPAPGPVTGDFVEVPASVSAKVLLLNAFLASGLTQVELASRMGVKKQEVTRIFDLHHSTKIDTVQKALSALGKRLELVAA >CP028349|2447444:2487166|2465118_2465373_+|AVV37818.1|DBSCAN-SWA MLLLLFLQMLLTGCVTQQKPLVEYRTVKQPKLNLPAELTSQIDVPAPSQDMTFGDSVSLNAELYGALGQCNIDRAAIRKIESTR >CP028349|2447444:2487166|2465381_2465621_+|AVV37819.1|DBSCAN-SWA MSEAKPQDGSKVQGYRTLTDKDILEMNRLKEISRQFIAQLEYLKGAKDYDPRWIAQAKTAMQHACMFACRAVAQPDDNC >CP028349|2447444:2487166|2455611_2455809_+|AVV37800.1|DBSCAN-SWA MRKQDVIKFFGGVCKTAAVLGIKHPSVSEWPEVIPEGRAYQIEKITKGKLRFDASLYQKDTGQPA >CP028349|2447444:2487166|2456836_2457733_+|AVV37804.1|DBSCAN-SWA MSLLLKVKPLVISPALAQRIGLNEAIVLQQICYWLEDTASGVEYDGKRWVYNTIEEWTNQFPFWSSDTVKRALTSLKKHDLIFVEQLKKSQHDRTNYYAINHANPLLTDEGKLHSSKDATCTNRVEQPAPIDKGNMPSSIGANCPRLTENTTEITTEITTTPSCQVAAQPDDEWSLVNRSREVLRHLNKVTGAKHTEAQSSMGHIKSRLKDGFTVEELCLVVDYKHAHWEGTEEYQYMRPKTLFIPGNLPGYLQIATKWDMHGRPPRSEWNALKRNMQRDITVIPQPDSSVPDGFRGA >CP028349|2447444:2487166|2476512_2476881_+|AVV37834.1|tail|DBSCAN-SWA MSEKKFSAAMLKSVLLQPKSTAIKTELLGAQVYIRRRTAGELIRYEEELDAAQATGNVRAISEMSVQLVLDSLVNPDGSAIKPELLPTAAELLEAHDNPALMAAIERVKTHAIGKLEVAEKN >CP028349|2447444:2487166|2451279_2451756_-|AVV37794.1|DBSCAN-SWA MEQPILDMCCGSRMFWLDKKDRRAIFADIRKESHVLCDNRALHVNPDIIADFRSLPFPDCSFAQVVFDPPHLDRAGENGWMRKKYGALDKQTWRDDIRVGFSEAFRVLRPHGTLIFKWNETQIPVSQVIALTEQKPTIWQRTGKGDKTHWIIFLKEAG >CP028349|2447444:2487166|2486213_2486519_+|AVV37842.1|DBSCAN-SWA MPGGLQCWDANGKLIVDIGDYNTRYLGRTAVTMAANTNVVTGAFGGLTTSGSFVVVVSASSSVYYTPSNFAARALNGSFNIFKLSSYTAAVTLTLDMYAFI >CP028349|2447444:2487166|2464371_2464797_+|AVV37816.1|DBSCAN-SWA MKVSNNGINLIKRFEGLELKAYKDSVGILTIGYGHTHAVKAGDAITSERADAYLREDLQVAELTINTNVKVKLTQGQFDALVSFVFNLGSGNFVKSTLIRKLNAGDYAGAADEFGKWVNAGGKKLPGLVKRRAAEREVFLT >CP028349|2447444:2487166|2447444_2449286_-|AVV37791.1|DBSCAN-SWA MRGFGSRSTYGVRGLRIYVDGIPATMPDGQAQTSNIDIGSIDHVEVLRGPFSALYGNASGGVINVTTQQGQQPTTLEAGSWYGSYGSWRNSVKASGATGDGTHAGDVNYTVSASRFTTHGYRDHSSAQKNLGNARLGVRIDDVSTLTLLFNSVHIDAQDPGGLTEAQWRDNPRQVVSNVTLYNTRKTVDQTQGGLRYQRQMSENDDLSVMLYAGMRETTQFQSIPATVQRNPSHPGGVIALTRHYQGVDTRWTHRDTLLSMPVAVTGGLDYETMTERRKGYENFTVSNGVTQLGEQGNLRRNERNLMWTLDPYVQTNWQLTDKLSLDAGVRFSTVNFDSNDFYIRPGNGDDSGEARYHKWLPAAALKYAFDPSWNAWISAGRGFETPTINELSYRSDGATGLNLGLKPATSDTLEIGSKKRIGNGLISAALFQTDTRDEIVADASSGGRTTYKNAGQTRRRGLELSLDQQFAWDWRLKMAYTLLDARYRSNACGSDSCDGNRIPGIARNMVYAGLGYLPEQGFYAGSEVRYLSQIAAEDQNNVNTPSYTVAAVNSGYKWLVDNWTLDLFGRVDNLFDRHYVGSVIVNESNGRYFESAPGRNYSVGVTLGYAFR >CP028349|2447444:2487166|2459755_2460052_+|AVV37807.1|DBSCAN-SWA MSWRRLSGGRNQVILTEYSLDVKEGDSRAVYLVRHNSKIWNTTLEQNITVERDSYGGFKPTIAMQDFPRGLSERESMLKLADWLHRLGVSIEDHWSKP >CP028349|2447444:2487166|2456657_2456840_+|AVV37803.1|DBSCAN-SWA MENEIIKPWVERYKDPRGVVVETVGVDVVNHRVIYMRPNYPHPCMQPRVLFSQKFRKVAS >CP028349|2447444:2487166|2481605_2482325_+|AVV37839.1|DBSCAN-SWA MRKKILEAIREHVAAEYPKEACGLVIQSGRSQNYIPCRNIADAPTEHFTLSPEDKREAEAQGDILMVIHSHPDVPQLIPSEHDRVQCDFSGVEWGIMSWPDGDFCTISPRTDRDYTGRPWLIGGNDCWTLIMDYYQREHGITLKNWSVDYEWWVGGKENLYDDNWQAEGFVEVEPAEMREGDMIMMRISAPVTNHAAIYLGNNIILHHNAGSLSTRVPYGEYWRNRTVRVVRRKELIDA >CP028349|2447444:2487166|2450932_2451280_-|AVV37793.1|DBSCAN-SWA MPKSPAERKAAQRARQAAAGDKKLELALDSQELEMLAQNCAARRPGRDPYELKEYIALLIRKDSAELAKQIEALSHQQCGKCKEQLPVQSCPCQGQAACWATRGWHKLKLNINTP >CP028349|2447444:2487166|2464034_2464388_+|AVV37815.1|holin|DBSCAN-SWA MPPEKDPGFWATVLLWLYAHKTEWGYAGVAGMFSLLRSAYAKSSWSKRVLDAVSCSALAFFAAPTLQVVGALFNWNIPDAAAQVFAVYIGYVGNDYISARLRGWIDRKAGDTNESQQ >CP028349|2447444:2487166|2475184_2475568_+|AVV37832.1|DBSCAN-SWA MIESGIYKSLQSLSELEVYPLLIPDTEQHGITYQRISDPEIEGGLVRTSLVAGRFQISFVKVSDYSGLLALDAQLWQMWKGIRHGDIGGYPVQYVERGTLQQDKSTLPNNAVQYRLTRDFIIYFSEV >CP028349|2447444:2487166|2453359_2453878_-|AVV37799.1|DBSCAN-SWA MGYWKFTNAEALAAWEKTRADEAQMRKEAAELTSMLGGKPVFKSDLTRSTFYGVNFDAAPYLAKELWTVPTGNTGYASWPKARPPKGMKEEHAAVKKLWGEHYPKTKVSNDEFFKAIGLDWGMLFLCGITYFRQCDAIYVQTGATPKPDFGAVEIVGSEFDKARREYSDAKA >CP028349|2447444:2487166|2474856_2475192_+|AVV37831.1|head,tail|DBSCAN-SWA MTGLAAGELDKRIKVQRTESERGPLGEVLPGQIVISSPFIWAKAENISNRKIRSMDQQQIVETWQFTIRPRSDVQTDWKISWGNEVYTIRAVDRSSRDRAVITAERDVRHD >CP028349|2447444:2487166|2480865_2481603_+|AVV37838.1|tail|DBSCAN-SWA MSFNQDIQALEPGSLVQLIEIDGTAFGLDTVLRFHAYNLPTEGWQSFAAENLPSIIWQGNEYDPHPYELTGMEMSSTGSQPTPKLSVGNVGNYVTALCLQFDDMVKAKVRIHTTLAKYLDAANWTAGNPSANPQEERVQLFYVNAKTSETRAQVDFELCSPFDIQSLQLPSRQITPVCTWCMRGWYRTGTGCDYAGSRYFTKDGTATSDPSKDVCGGRMADCKARFGDDQPLPFGGFPAANLQGK >CP028349|2447444:2487166|2452352_2452757_-|AVV37796.1|DBSCAN-SWA MKKVAQFRRSNGPNAGFSEKLAWQLSKGPATGRELAQQLGMTLSEFNRLVLHIMRRGGETLQVEASNQVCLGGGSIDRTYTLIRNPRRVAPPPCKPMVINYSNDRSEEAIKRHREAAARRARLIASGLYLECMG >CP028349|2447444:2487166|2462513_2462942_+|AVV37811.1|DBSCAN-SWA MRDMSQVLERWAGWAKSDSSGVDYSAIAAGFKGLLPQDSKLTLTCSDGDGLIIEGCLSRLKAKRPDEHAIIVLHYFFNISKRTLAKQAKRDEKIVRIEIQMAEGFIEGCLAMLDVRLDMDDELTPKKILKKPLTRSAFSLVI >CP028349|2447444:2487166|2457745_2458180_+|AVV37805.1|DBSCAN-SWA MTNHESKILELITRNGPLKVRELCKLTGLHETSVKRFIKPLFTKGLLKRASDWSYSINTDPLPVESEKYSHMAKQASELEAKGFWLRAAQVWREAMLAARFDASRNEAKENCDRCAVKGSLNCGSYGGLDTGRIISASVNRDLL >CP028349|2447444:2487166|2462992_2463178_+|AVV37812.1|DBSCAN-SWA MWMSKRLGPYQPGGVFYFKYPLKGIAIYLSLAGDKKFTLLSTGKALTAFASGLFFKNIGLR |
57 | Klebsiella_phage(21.74%) | capsid,integrase,portal,terminase,tail,holin,head | attL 2446646:2446661|attR 2490628:2490643 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
3143884 : 3164318
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >CP028349|3143884:3164318|DBSCAN-SWA CGTGCTAACTCAACAACAACAGGCTCTTCTCGACTGGGACAAAACAGAGGGCATGATGCCGGTCATCGTTCAGCACAACGTCTCAGGCGAAGTGCTGATGCATGGCTACATGAACCCGGAAGCGCTGGAAAAAACGCTGGCAGAGGGCAATGTCACCTTTTTCTCGCGCACTAAAAACCGTCTGTGGACCAAAGGCGAAACGTCCGGCAACTTTCTGCAGGTAGTGAGCATTACGCCTGATTGCGACAATGACACGCTGCTGGTGCTGGCGAATCCGATTGGCCCGACCTGCCATCTGGGCACCACCAGCTGCTTCTCGCCGGCCGTGCCTGAGTGGACCTTCCTTTATCAGCTGGAAGAGCTGCTGGCGTCGCGTAAAACGGCCGACCCTGATAGCTCTTATACGGCACGACTCTACGCCAGCGGCACCAAGCGCATCGCACAGAAAGTGGGCGAAGAAGGCGTGGAGACTGCGCTTGCGGCTACCGTCAACGATCGGGATGAGCTGACCAATGAAGCGTCAGACCTGGTCTATCACCTTATGGTGCTGTTGCAGGATCAGGATCTCGACTTCAGCGCCGTGATTAATAACCTGCGCGCCCGCCACAAGTAATTGTGCAATAACCTGCTCAGAATTTCGTTTTTATGAAGATTCTGAGCAGAATCCTTCTCAATCTTCATCTCAATCCCGCACAGTAACGGCGACTCAAACCTCAGGGAGAGAACCGTGACCACCCATGAAAAATTAATCGCCCTGCTCGACCAGCACCAGGCGCGCTATCGCCTGATGGAGCATGAGGCGACCGGTAAATGTGAAGCCGTCGCGGCAATACGCGGCACCGAAATCGGTCAGGGCGCAAAAGCGCTGGTCTGCCATGTAAAAGGCAACGGCGTTAAACAGCATGTGCTGGCGGTGTTACCGGCCGACAGGCAGGCCGATCTGGGCAAAGTCGCCGCGGCGGTCGGTGGACGACGCGCCTCACTCGCCAGCCCGGCGGAGACCGATCTGCTGACCGGCTGCGTATTCGGAGCGATCCCACCGTTCAGCTTTCATCCCGACCTCCGACTGATTGTGGACCCCGCGCTGTTTGAACGTTATCCGGAGATCGCATTTAACGCCGGTCTGCTGGAGCGCTCGGTGATCCTGAACACCGAAGACTACCGGGCGCTGTGCAAGGCTGACGTGATAGAGATAACGCAGTAATTGCCATCCCGGCAGCCCGGCGATACTCTGCATAATCACTCATCACGACAGAATAAGAGCAAAATCATGAACATTTTCACTCAACACCTGCGCCAGACGCTGGCTGTGGCGGTTATGGCAGGCTGCGCGTTTAGCGCTCAGGCAAAAATTGAACAGGTGCGTTTTGCAGTCGATCCGACCTATCCGCCGTTTGAATCAAAAACGCCACAGGGCAAGCTGGTCGGCTTTGATATCGATCTGGGCAATGCGCTCTGCACGCAGATGCAGGCGAAATGTGTCTGGGTTGAAAGCCAGTTTGACGGCATGATCCCGGCGCTGAAAGCACGTAAGTTTGATGCCATTCTGTCTGACATGGGCATTACTGAAGAGCGCCTGAAGCAGATCGACTTTACCGTGCCACTGTATGACACCCACACCCAGCTGATTGCGCGTAAAGGCGCCGGCATCCTGCCCACCGTCGAGTCTCTGAAAGGGAAAACGGTCGGCGTGGAACAGGGAACGGTTCAGGAGCGTTATGCGCTGGCGAAATGGCAGCCCTACGGTGTGACCGTGGTGCCTTATGGCGATCAGGCACAGGTTGAAAGCGATCTGGTTTCAGGCCGACTGGATGCCGTCTTTACCGACGCGGCACAGGCGGCGATTGGCTTCCTGAAGCATCCGCAGGGCAAAGATTTTGAACTGGCTGGCCCGATTATTCAGGATCCGATTATCGGCCCCGGCACGGCAATTGGACTTCGTAAAGGCGATACCGAACTGAAAACGGCGCTGGATAACGCCTTTGCCGAAATTAAAAAGAATGGCACCTTCGATCAGATTCAGAAGCGCTACTTCTCCACGGATATCTCTATCCAGCAGTAACCTCGCGGTCGCCCTTTTTCGTCCACACCGCGATCCGGCACTATACTCTAACTTTTTCGGGACGGCGCGCTGTCGTCCTTTGCCGATTAAAGGACATTAACACGGTATGCATCAACAACATCATCCACTGTTAAGCGCCTCTTTAGGCACCCAGCGTGAAATTATCAGTTTCCATTTTGCTGAGGACAATGAACGTCAGGTTTATATCCAGGCTGCGCTGCATGGCGACGAACTGCCCGGTATGGCGGTGGCCTGGTATTTGAAGCAGCGGCTGCAGGCGCTGGAGTCCGCCGGACAGCTTAAAGCTGCCTTTACCCTGGTGCCGGTGGCTAATCCGCTGGCACTGGGACAGCACTGGCACGGCACCCATCTCGGCCGCTTTCATACCCTCTCCGGTCAGGACTTCAACCGCCGTTTTCCGTCACTGGGCAATACTCTGGCGGCCGGGCTAGCTGAGAGTCTGACCCAAAGTGAATCACAGAACAGGAACCTTATTCGCGATGCGATCGATCGTCATTATCGCGATACCGTGCCAAAGACCGAGCTGGATGCGCAGCGCTATACGCTGATGCGGATGGCCAGTCAGGCCGATCTGATGATTGACCTGCACTGTGACTGGGAAGCCGTGCCGCATCTCTATACCACGCCGCATGCATGGCCGGATATTGAACCGCTGGCACGCTGGTTAGGCAGTGAAGTGCAGTTGCTGGCCCAGATCTCTGGCGGTGAGCCGTTTGATGAAGCCTGTTGCGAACCCTGGCTGACGCTGGCAGAGCGGTTCGGCAAAGATTACCCGATGCCTCGTGGCTTATTGCCGGTGACGCTGGAGTTGCGTGGCATGCGTGATGTGTCGCCTGAACAGGCGGAGAAAGATGCAGATGCGATTATTTCTGCGTTGCAGGAAGGCGGTTATATCGGCTACAGCGAGCCTTCGATTTCCGTGGAAGTGGCCGCGCCCGTCATTGGCGGCGATGAGCTGGTCCCGGCGAGCGCGGGCGGTGACGATATCGTGGCCATCAATGACGGTCCGATTGGTTTAAACGTGCCTGACGTGGCTGAGGCACCAGGAGAAGCAGGTGTAATTAAAGAGCGTCACAGCGAGGGATCGGCTGCGTCCTCCCCTGCCCTGAAAAACCCCGCTACGCCGTTATCAGGCTGTGAATACATTCATTCACCGGTGTCAGGGCTGATCCTGCATCGTAAACCGCTCGGTGCGCAGATTCGCCCGGGAGAAGTGGTGGCTGAGATTATTGATCCGATTACTGACCACATCACCCCACTGGTGGCGGAGTATGGCGGTATTCTCTATGCAAGGCACTGGGTGAGGTTTGCGACGGCAGGCATGCTGGTTGTGCGGCTGGCAGGAGAGCGGGAGATTCGGAGCGGGGATTTGCTGGTAGGGTAAAAAAAGGGCTTCCCGAGGGAAGCCCTTTTTAGTGATTAATCTAACCACTCGGTGTGGAACACACCGTCTTTATCAATACGCTTATACGTATGCGCACCGAAGTAGTCACGCTGCGCCTGAATCAGGTTGGCAGGCAGCACTTCTGAACGGTAGCTGTCGTAATAAGCGATCGCCGCAGAGAAGGTTGGCGTTGGGATACCGTTCTGAATTGCGTAAGAAACGACATCACGCAGCGCCTGCTGATACTGATCCGCGATATCTTTGAAGTACGGAGCCAGCAACAGGTTAGCGATGCCTGCATCGGCTTCATAAGCATCGGTGATCTTCTGCAGGAACTGGGCACGGATGATGCAGCCAGCGCGGAAGATCTTAGCGATTTCACCGTAGTGCAGATCCCAGTTGTTCTCTTCTGACGCTGCACGCAGCTGTGAGAAGCCCTGCGCATAAGAGACGATTTTACCCAGGTACAGCGCGCGACGGACTTTCTCGATGAACTCTGCTTTGTCACCAGTGAAAGCTTTAGCCTGTGGACCGCTCAGTACTTTAGAGGCCGCAACACGCTGCGTTTTCAGTGAGGACAGGTAACGAGCGAAGACAGACTCAGTGATCAGTGACAGCGGTTCACCCAGATCCAGCGAGCTCTGGCTGGTCCATTTACCGGTGCCTTTGTTAGCCGCTTCATCCAGAATCACATCAACCAGGTATTTACCGTCTTCGTCTTTCTTGGTGAAGATATCTTTGGTGATGTCGATCAGGTAGCTGCTCAGCTCGCCGTTGTTCCAGTCGGTGAAGGTTTCAGCCAGCTCTTCGTTGTTCAGGCCCAGCGCGCCTTTCAGCAGAGCATAGGCTTCTGCGATCAGCTGCATGTCGCCATATTCGATGCCGTTGTGAACCATCTTCACATAGTGGCCCGCGCCGTCCGGACCGATATAGGCCACACAAGCTTCGCCATCTTCCGCACGAGCAGCAATCTTATCCAGAATCGGTGCAACCAGCTCGTAAGCTTCTTTCTGACCGCCAGGCATGATAGATGGGCCTTTCAGCGCGCCTTCTTCACCACCGGAAACGCCGGTACCGATGAAGTTGAAGCCCTGGTCAGACAGTTCTTTGTTACGACGGATGGTATCTTTGTAGAAGGTGTTACCGCCATCGATCAGGATGTCGCCTTTATCCAGGTGTGGTGTCAGTGAGGCGATAGTCTTGTCAGTCGCTTCACCCGCCTGAACCATCAGCAGGATACGACGAGGTTTTTCCAGCGATTCAACAAACTCTTCAACGGTGTAAAAAGGAGCAAGCTTCTTGCCCTGGTTCTCGGCGATGACTTCGTCAGTCTTTTCGCGTGAACGGTTGAAGATTGAAACAGTGTAACCGCGGCTCTCAATGTTAAGAGCCAGGTTGCGGCCCATCACTGCCATACCTACAACGCCGATCTGTTGCTTGGACATTACATACTCCTGTCTGAAGGTCACTTCACAAACGCAGCGGTTTGTGAACAAAAGTGGCGAACATGTTAACCCAGGTAAGCATTAACCCGTTAGAGAATGATGCTATCAGATAGCACTGACAACCATAGGCATATACATGCTCATTGATAAACTAATACAAATAAGCATTAAGTTTTACAATATGACCTTTATTATTCCAGTCGATGATATCAACGCCGCGTAGCAACTTATCATCAAGCTTAAGTTCAAATTCAATGACTGATCTGTTGTGCTCAGCCAGCACAGAAAGCGCTTTAAATTCAAGCAACTTAGTATCGTTGAAGAGTTGTACCAACATGCTACGGATGTTTTCCTGACCAACCTTATGATTTCCAGGATCGGTCAGCGTCGCATTACGGTCGAACATTTCGACCACTTCATGTAAAGAGCGTGAATTAAAAGCAGCGATATATTTTTCTGTCAATTTCTTCACAGGCGGCAAAGCATTAGCATGTGATGATTCAAAGGCTTTTACAGCATGTGCATCATAGAAATTGTGATACACCGAATCACCAAGAGGACGCATTACTATCTTTTTACCCTTCAATACCAATTCATTTAATGAAGAAGAGAGATAAAAATTATTATCGATAGAGTTATCTTTACGAATAAGCTCTTTAGCGGCTTCAATGAAATCAGAGCCTCGCTTGAAGTAGTAAAGCCCCGCAACAGCATTACGACTAATAGCATTTTTTTCTGCAGTCTGAACGACCAGACCATCTTCATTAGTCTTAACGAAAGACCATTTTGGATGGACAGATTCAAAGGTGAGGACACCAGCATCAGCCTCCAGACTTCTGAAATACTCAAGTACCTGCTGCAGATCATCATTAATGTAATGGTCGGCAGAAGAAATAATCAATTCATCATCGAGATTAATTTCGTCAATAGCCATCATGCTGCTGCAAACAGCGCCCTTGGTCATTCCCTGCAATGGAATGATAACCGCCTGAGGCGAGATCAGTTCAATAATGTTACCCAGCGCCAGCTTACTAAGCTTATCCGCAGGAACAACAAAAAGACTTTTATACTCTTCCGTCAACGTAGAGAAGATATCCTGAGAATACTCCAGCAGGGTACGGTTGTTAATTTCCGTCAGGATTTTTGGATAGATGAAGTCAGCACTGGTTTCATAAAGACTGGCTCCACTCATTGGAATAATAATATTTAACATGCCATTAAACCTCGCTCTCTACCAAAGAGATAAACCGTTTAATATTCGCATAATTAACATCATGCACGGTTTCAACTTCTAATACATGTGCCCCACTGGCTCTGGCCGCTTTGATTCCATTTTCGTTATCTTCAACAATGACACAGTCAGCAGGTTTCAGACCCAGCATTGAAATAGCTTTATTGTACATTTCCGGATCGGGTTTGCCTTTAACAACATCCTGGTTAGAAACGTAAAACTCCAGATATTGTGCAAGATGTGCACGTTCCATCATAACTTCGATCGAATGCTTGATAGAGTTTGAAGCGACGGCCATTTTATAACCATCTTTTTTCAACATGGACAGGGCATATTCATGATGGAACATGGGCTTACATTTGTTATTGACAATTTCCATGGTGTACTGCTGCTTCATTTCATTAATGAAACTATGGAGATTGGTTGATAAGCCTTTATCTTTACTTAACATGATAAGTTTATCTTTAGTAGGCAAACCATCATATGTGGTTAAATGCTCCTGTCGGGAGATGGTCAGTCCGAACAGATCAAGAGCAGTATTCAATGCCTCATAATGCCAATCTTTAGCATCGATAAGAACGCCGTCCATATCGAAAATAACAGCTTTAATATTAGTAGCCATTAAAAAACTCCATACATTCTTCAGGGTAATCGGTACAAATGATTAAATCAGATGAGGAAAGTAGTTTATCATTGAATTTTTTATATTTCTCCCACATTTCAAGGTGGTTCCTTTTATGTAACTCAGGAGAAACAATACAAACTTTTTTTCCATCGGAAAGGATTTTATGCACTAACTCTTCTTTAACAAGATCTTTTTCAAAGCCATCTAACCATACACCTTCAGAAATATCATAAAGTGCATTAAGGCTTTCGAATTCACTATACCTGACAAAAGCTTTTAAACCGTGCTTATAGTAAGCAAGAGAATCAGGCAACGCCATATCGAAAAAAAAGTAATTAGTAATGAGCATTTTCTCAAGAGTGTCTTTGACTTTCTCCTGAAGACCATCGGCTTTGACGTTCAACGCCAATGGGGGGTTAGTGCTGGTGTGCGTCTTATACAATTCAAGCATATGCTTGAATGTCATACATGATGCATCAGGTATATCATGACTGATTACGATTTCACCATTGAAGTCACGTAAGTCAGTTTCTGTGCCAAGATCAAGCTGAAACGACCGAATGAAAGCTGCTTCAGTGTTTTTTTCTTGTAAAGACTTCCAGTAGCCACGATGAGAAATAACTATCATTTTTAACCTTCTTTACTTAATTAACTGATTGTAAATTTCAACAGCCTGAGTAGTCTCACCCTGATAAATGAGTTGTCCTTTTTCCATAACGAGGCATGCGTCACAGATATTTTCAACCTGATTAACATTGTGACTAACAAAGAAAATAGTTTTCCCACTCTCTTTGACTTCATTTATTTTATTCAGACATTTTGCCTGGAAATTCGCATCACCAACAGCTAAAGCTTCATCGATGACCAGGATTTCAGAATCCAGATGTACGCCAATAGAAAAGGCCAGTCTGACGTACATACCTGATGAATAATATTTCACCGGCTCATAAATGAAATCTCTGACTTCAGCAAATTCAATGATACTTTCAAGTTTTGTTTCTATTTCCTTACGGCTCATACCTAATATTGAACCGCTAAGAAAAATATTCTCGTAACCTGAGAGCTCATGATGAAATCCTGCACCGACTTCCAGCAGACAGGAAATAGAACCATCGACAGTTATTTTTCCGGAGCTGGGTGCAATGATCCGGGTCAAAAGTTTAAGCAGGGTAGACTTGCCAGCTCCATTTCGCCCCAGAACACCAACAGCCTGTCCAGGCGTAATAGAAAAGGAGACATCGTTTACAGCGCGGTAAATTTTTCCGGGTTGAGAACGCTTTTTGTTAAACGAAAAAAGCTCTTTAAGAGTACTATGCTGGTAGTTTCCAACATTGAATTCACGTGTAACAGATTCAAAGGTAATCATTCGTAATCCGCAATTAATCTATCATTTTTGATAAAGTAGTACACACCAAAGATGACTAAAACGATGGCAGTGATAAAGCTATACAATAATGTAGTAACATTAATGCTATTACCACCCAGCAAAGCCCAGCGGGATAGAGCAACAATACCCGTCATCGGGTTGAAGGAATAATATTGATGGAATTCCTTCGGTACTATTGTAAGTGTATAAACAACCGGAGAAGCAAAAAACAGAGCCTGCGTCGCTAAGGGAAGCACGTGGCGTAAATCACGGAAACGTAATTTTAAAATTGCCATGAAACACCCTACGCCACACCCGAGCATAAGTGTAATAACTATACAAGGAATAAGGAAAAGGAGGCGTGAAAAATCAAAATGTACACCATAGAAAATAGCGAGCGGAATAAAAACAATAAAAGCTATGACAAAATCAATAAAAGCCACAAAAACACTCGCTATGCATAGCGAAATTCTGGGGAAATAGACTTTTGTCAGAAGATGCATATTATTTATTATCGCATCACTCGCAGCCATCATGCTGGTAGCAAACAGGTTCCAGAAAAGGATACCGGAGATTAAAACAGCAGCATAAGGTGCATTATATTCTGGCGTTGGGACTTTAACCAGCAATCCAAACACGAACAGGTACAAAAGTATATTTACGCCTGGATTTATAATTGCCCACAAAATCCCTAAGCTAGTCTGTTTATAGCGAACATTAAAATCTCTTCGGATAAACTCCTTAATCAAATTCCTGGAGTCTAATACTTCCGCTAAAGTCGTCCTGCTTTGCGGAGTAGATACAAAACCTTTCATTAATTTCCTCTCTTTACAAATAAGCGATAAGCAATTTTACGCACATAAGGCAACTTAAGCAGTTGAGTCCTGAAAAAGAAATAGACCTTCATCAGCGTTATTTGCAGCAATCTTAAATGCCAGATTAATGGACGAGAACAATTCTGTTTTCTGATATATCTGGAATACAAATCAACCCAGTCGAAGGGTTGATAGAGACTGGACCAGCCTTTATTGAAGTAGACCTCTTTGTGTTTACGCCAGACAAATCCCAGCCTTGCTGAATCTAATGGCAGGAAATTATTAACGTAGATTTTCTCAGAGCGTTCCACCTCAGCCGCATTTCTTGAGAAATTTCCAGAAAACTCAAACCCTTTAATTGAACGTTCTAAATTTTTCACCCAGACATACTGTTCTGGTACAAAACGCATATAGCATGCATTCATCATGCTTTTGCAGTGCTGAAAAATGCTTTCATCAGCAAGTGGAATGTCGAAGAAACTTAAAACATCGTCTGTCAAACCAGCCACCATAATATCACCGGGGTGATAAAGAAAAGGAAGCTGGCTATAGGGATTACGAGCGAATAAGTTACAGTTGATTAGATGGCGACTAAAAACTGTATACTCATTCTGGCGTCGAATATCCTCTACAGACAAGAATTTATTTCCCAGGAAATAGTCACTCAGGCATGTAAGAATATTGTCATTATATAAGTAAGAATCTGTCCTCATTTTTATCGAATACTTTTTTGAAGCGCATCTCAAACCATTAAGGCTGGATACCAGCATTCGATTGACATTACTGACAACGTTCACGCCATCAACATTTTTAGATATCGATCCTACATCCTCATTGAAAAGAAATTTGATGCCAGGATAGGAAATCCTGAGGTTATCTTCAATGACATCATCTGCCTTCCATGTTGACACAATAATTTCTGCTTTCGGGAAATTATCTAATGTCAGGTTGAGATTAAAAATAAAGTCATTATCTAATTCGCGTGAGGAATTTAAAATCCCTCCCTGAATGATGATGGAAACTTCACTCTGTGAGATCAGATTATCAGGATGCATTGTCCACGTTACTTAGATTTCATGTCTTGATTCATTGTGAGGTTAATTCATACCATTCAATGTATTTTTTTAAACCATTCTCAAGACTTGTTTGAGGTGACCAGTTAAACTCTCGTTTAGCACGCGTAATATCCAGAGATATAGAATTAACAACTGATATATCGCTTTTCACTTTAACAGTATCGACAACAATACCATGCCGGTTGAAGCTCTCGATAATATCTAAAATAGAATATTTCGAATCCGAACCGACATTGAAGATTCTAACATCGCCTTCATAAGAAGCAGACTTTATAAAGCAATCATGAAGATCATCAATATAAATGTAATCGCGTGAAGCTGAACCATCTCCATAGATCTTTACAGGAGTATTCTTTTTATAGGAATCCATGATTGCAGGAATAAGCCCCTGCCCACCTTTGAATACCTGCCCGGGTCCATAGGGATTTGCAATCCGTAGCACCCGATAGTTCAACCCATAGTTACAGTGATAAACCTGAAGATATGATTCCTGAGCAAGTTTATTTGCTGCATAAATAGTTTTGGGCGAAAGAGCATGATCCTCACGACATTTCTCGCCACCAGTATCTCCATAAATTGTCCCACCGGATGAAGAGAAAATAAAAGTAACCTTACTGGCTACTAATTTTATATCTTCCAGGAAGCTGATAAATGGCAGGAGATTAGAGTTAACACTCTCAAGAGCACAACTCTGCGACTCTGCCGGGGAGAACCTGTTTGCAAGATAATAGACAACAGGATTATCATCCTGAGCAATTTTGCTAACAAAATCATCGTGCTGACATAGCGTTGCATCAATAAAGTCAGGCGTAAAGCCGTTAACTTCAGTGTTGCGGCTATAAAGAAACACCTGTTTCTTAAAACCCATTCTGCTCAAATAATTTAACAGAGACTGGCCAATGTAACCACGCCAGCCCACAATATATAGATTACTTGATAGCATAATAAAATTTTCTACCTATCGTAACGAGTGTATGGTGAAAAATAGGTGGAATAAACTTTAAAGAAATATCAAATATTTTACGCCCAACTCTTAACAGAAAATTGCGATAGCTAAAGAAGAGTGGATTGAAATGATACATTCCTGTTTTGAAATAAATGTTTGAGAGCCTGATAAATATGCTCTCTTTAGTTTTGGAGGGAACAGCATTAAGATCGCTTGCTTTAAAGTTATAAGCATCTTCTTCTGGGCCATCCCTGAAAGGAGCCATCTTAAAAGTATACGGCAGGACAGGAATCCATCCTTTCAGAAACTTCTTCCACTTCAAAGATACATCCCAGCTATTAATAATAAAAACGTGTGCTTTAAGCTTTTCAAGACTATCTTCAGCGGACTCTTTGATCACGCCCCCTCTCTTTTTTACAAAGTTTGTAAAAAGGCGAGATTCCGGCCAGTAATTATAGAAATCCTTCACATGGTTGTTTTTAACCAGTTCTCTGATAAGGAGAATATTGTCCATATGCGTTTCTCTGAAGAACTCAGCAGGGTAGTAATCTCTTAATACCTTGTAGTCACCAGCAAGCAACATATCGGATGGATGGTACCAGCGGTCTGCACGTGTATAGCAGTTAGTCGTGATCAACTTATCCTTAACATCAGATGAGAAGGTCTTCACCACTCTCATCAAGGACGTAAAGCTAACTGTCTGGTCCATCCTGACTTTGATGACAAACGCGCTCTCGTCTATAACGTCAAGTGCCGCATTAACCAGGTTAATCTGACGATTTATATTAAAAAAGCCGGGGTTAAAAACATCATCACAGCCTAATACCATAATGTTTTCACGAGTATTAAGATTTAGTTTTTTTAGAAGAACCAGAACTTTTTCTGCATCTTTTTTATATGTCGATATGATAACTTTCACATCGCCAATTTGTTTGTGAATTGCATTTAAGTTTTTTTCTAAAAAATCCTCTTCAAGAGGACCATGGATAATCACAGAAATCTTATTAACCATTATATTTATATCTCAACAGGTCAAGTTCATACATTAAGTCAATGCCAATACCCTTTTCAACCCGCGATGCGTCCTCTCTGAAGAAGACGACATCTTTGCAGGAGAAAAGTATTTCTGCAATCAATCTGGACGCTTTGACATGCTTACCGATAGAGACATCAGTGACATTGATGGTAATTTGTGAAAAGGCTTTTCTGGATTTATTGATATCCAGTACAGTCCCATAAGGATGATGATATGAGAACGCAGAAATAACATTTTTCTCGGGGAAATAATTTTTTTCCAGAATCACATCATCGAAATTATCAGAAGGCAAAATAATCATATATTCTGCTGCATTCACTAGTTGATAAGCCTCGTTGCTATCAATATGAATCTTTTTATCATCTAATAACAGGTATTCGAGTCGCGCATCATCTCTCATAGTAACGAAGAGGCATTTTTCGCGCAGAGGCCGGACAGGAGTCGATTTGATTTCAATATTTTCGTACTGGCTCACTCTGTTGATAATAGCACTCTCAAAGCTTACCAGTGTTTTTAGTGATTCACGCTGAACTTTTCTGATCTCTTCTTTTTCGTCACGACTCATACTGCTAAAACCTGTCAGCATCGCCTTGATAGAATCGACAGAGTTTTTCTCGAAGCAGAGTTTTTCAGCAGCGGGTCCGCTCATATCTGCGATGCCGCTAGTTTTACTTCCTGCCACAATCAGACCATAGTTGATGGCTTCTAACGCAGTGTTCGGGAAGTTTTCATAAGGCGAGGGAATGACGAACAGATCATAATTTTTATATTGCTTTTGCAATTCCTTATAAGGAATAAAATCATAAAAACTAAAGTTCTTTTTCAGTTCCAGTGGAATAAGCCCATAGCATGTATCCATATAATCACGGCGATCATAAAAATCATCTCCGCTGTTACCGATTAATGTCACATGCAGGTCATAACCCTCAACCATTAATTCGCAGGCCGCAAGTATCAGCAATTCCTGTTTTTTACGGAGCTCAAACCGACCCAGAGAAAGAATCTCAATAGTGCTATTATTAGCCCTGACTGGCGGGATAGCCTGTTCAGCTAGCGGATAATAAGATGGGCAGGTATGAACTTTGGTATCGTGTAAGACACTGAGATAATTGGCAAGGAAATGAGATGTTGAGAAATTAGCATCACTCAGAATTGATTGTGCTCTCTCATACTGATACATCTCAATCATCCACGGTGGTGCACACTCCATAAAATTAAGCGCTGTACCCCACTCCCAGACCTCACGAATGCCGGTATGATTATTCGTGATAAAACGACATGAATAATCCAGTCCCTCAAGCGCGCGCTTTGCAAGTGTGTAGCTTGCAAGACCGATGAAGTCGGTGACTTCAACAAAGTCAGGTTTTATCTGCGAAAGCACTTCGTCGACTGCTTTGGAATTTTTGAAAATGCTGAAGTTATCTTTTATTTTCCAGCATTTGATCGCTGGATCATCCAGATCGCTGCCATGCAAAAAAATCGTATGGACTTCGTATCCTAACTTACTGAATCCTTTTGCTGTATGTTCTATATAGGTAGCAATACCACCTTGTCCTTTGTAATAGCCACACTCATGACCAATAAGGCATATTTTTTTCTTGGCATTTGTCATACTTAAAAGTCCAGCGCTTTCTTCGAAACATCTAATGATAAAAATAATTTCAAATCAGAAGGAATACCTAAACCATACATTCCGTCTGCTTCAGCACCAATGTTATAGACACCAATGTTTGCATCTTCTTCACGATACATCCAGGTGTATACGGGTGCCACATAAAATTCACCGTTGGTACGCTCATTCAGTTCAATCATTTTATTGGCGTTACGACAAAAATCAGCGCCATGTTTGAAATTATAGATACCTACCGTTGCTTCTTCGGATACCACTTCCTTTTCAACAACACGAATAACCTTACCATCACTATTCAACTCAGCGTAAGACCATTTAGGATCGTTAGCTGACATAGTCATAATCAAACCTGACAGATTTTCATCATCCATATATTTCAGATAATCGTTAATATCGATATCAATCCACTGGTCAGAATTGGCAATCATTAACGGCTCATCATTATCAATAAACTCTTTCGCACATAATACAGTGCAGGCAGCCCCATCAGTAATGCCATCAATACCAACAATTTCACACCCAGGCGCCCATTCGTTGAGCTTTGTTGATAAATCATATTTCTCAATATGATCCTGCTGGCAAATAAAAATAAACTTGTGAGGCTGATTAGGTTTAAGATTATTAATAACAACTTCGATCATGCGTTTGTCGTTGAGGGGGATCAAAGGCTTAGGATCAGTATAACCTTCTTTAGCGAAACGACTGCCTCGACCTGCCATAGGAACAATGATGTTTAACATTTCATTCTGCCTTAAGATATTTATCATTTAATGCGCCGGGAACTTTTACCACTGTTGTCACAGTGTCTTCGAGTGCTTCGAAATTTGTGTACTCACCGGGAGAGATTTTAACGACCTCTCCTGCACTGATGAGATTACCGTTCATTAAAGCTTTACCACTGACAATCACAGTGATCTCTGTAGCGATTTTGTGGCAATGACTGGCTTCTTTATCACCGGCTTTGAAGTGCTGTACAGCGACTTCTACATCAGTTGTTTTATAAAGAGTAGGCTCAAAACCGCCAACAAACCAACCACGCACAAACTGGTCTACCTTTTTAATTTCCATAAGATTAACCCTAAAAATACCTATCCATAAAGCGGTGATTTCTTAACGAGGGATAAGTCGAGTACACAGGAAAATATGATGAAAAAGAGCACGCTACCGCCCTTAGCTATAGCTACTAAATGCTCATCAAAACTTCATCAATGCAGATCAAAAAACTTAACCTTCTGATATGTTTACTTTTTCCAATTTTTCCCGGTGAAAAAATGACAATCCAGGCACTACATGTCTCTGGAAAAAACCCCTTAATTCTAACACGGTAGCACCGTTCTTCGAAATATGGTGGGGTGATTGTGACGCGATCTTCTGAATAAATGTTGATGTATGGTATTGAACTCTTAATAAAATAAACAATGTTATTATTAACATTACACTAATACCGTTATAATAAATAAGCATGCTAACTATTCCTGAATCGTCCATATTGAGATGTGCTTAACTGGTCAAGGCTGGATTAACTCGTCATAATGATGAGTTATTGACATCAGTTTGGTACGGCATGAGATAATACAGCCGTGCAACAAAATGCTAATCTTCGGGAAAAGTATGAAAATTTTAGTTACTGGCGGTGCAGGTTTTATCGGGTCAGCTGTAGTGAGACTTCTCGTTGCTGATCAGGATAATGAAGTATTGGTTGTCGATGCTTTAACTTACGCGGGTAATCTTGATTCACTCAAAACGATTGAAAATTCCCCCGGATTTTCTTTCGTTGAAGCTGATATCTGCGATTATGATGCAATGAAAGCAGTGATAAGCTCTTTCAAGCCTAATGCAATCATGCACTTAGCTGCAGAAAGTCATGTTGATCGCTCGATTGATGGACCAGGCGCATTCATCCAGACCAATATCATTGGCACCTATAACCTGCTCGAAGCTTCACGCCTGTATTACAACCAGTTGAGTGATACCGATAAAGCGGGCTTCCGTTTTCATCACATTTCAACGGATGAAGTGTATGGTGATTTACACAACACTTCTGATTTGTTCACTGAGACCACGCCCTACTCGCCTAGCAGCCCCTATTCTGCATCCAAAGCGAGCAGTGACCATCTGGTGAGAGCATGGCATCGCACCTATAACCTTCCGGTTCTGGTGACAAACTGCTCTAACAATTATGGTCCTTATCATTTTCCGGAAAAATTGATCCCGCTGACTATTCTCAATGCTCTGCATGGTAAACCACTGCCGGTATACGGTAATGGCAGCCAGATTCGTGACTGGTTATATGTAGAAGATCACGCCAGAGCACTTCAGGAAGTCGTTAAAAAAGGCGTCATTGGTCAGACTTATAATATTGGCGGCCACAATGAGCAGACAAACCTGACGGTTGTCCAGACTATCTGTGACATTCTTGATGAGCTTTCCCCATCAAACCTTCCGGATGTCGGCAGCTACCGTGAATTGATCACGTTCGTTAAGGATCGTCCGGGCCATGATCTCCGCTATGCTATTGATGCCTCTAAGATTGAGCGTGAATTAGGCTGGGTTCCACAGGAGTCTTTTGCAACGGGCTTGCGCAAGACTATTCAGTGGTATCTCGAAAACGAATGGTGGTGGAAACGCGTTCAGGACGGGTCTTATCAGGGTCAGCGGTTGGGTTTAGAAAATAATTCAGGTGAGGATTTATAATGAAAGGTATCGTTCTTGCCGGGGGTTCTGGCACTCGTTTATACCCAATCACTAAAGGTGTATCTAAGCAGCTGCTGCCTGTATACGATAAGCCTATGATTTACTATCCGGTTTCAGTGCTCATGCTGGCCGGTATCAGGGATATACTGATTATTACTACACCTGATGATCACGCTTCCTTTGTCAGGCTGATGGGTGACGGTAGCCAGTTCGGTATTAACCTCAGTTATGCTATTCAGGCGAGTCCCGATGGCCTTGCTCAGGCTTTCATTATCGGTGAAAAATTCATCAATAACGAAAGTTGTGCATTAGTGCTGGGTGATAATATTTTCTTTGGTCAGGGATTTGCCCCGGTTCTGGCACGCACGTCTGCGAATACTAACGGCGCTACCGTGTTTGGTTATCAGGTTAAAGATCCTGAGCGTTTTGGTGTCGTCGAGTTTGATGAGAACTTTAAAGCCTTATCACTGGAAGAAAAACCTGCCCAACCAAAATCCAACTGGGCGATTACAGGTTTATACTTCTACGATAACAACGTTACGGATTATGCGCGTACCTTAAAACCTTCGCACCGTGGTGAGCTGGAAATCACTGACCTTAACCGTCTATATTTAGAGCAAAATACGCTGAATGTGGAACTTCTGGGTCGTGGTTTTGCCTGGCTTGATACGGGTACACATGACAGCCTGATCGAAGCTTCTCAGTTCATTCATACAATTGAAAAACGGCAGGGTTTCAAAGTTGCCTGTCTTGAAGAAATCTCCTTCAAAAACGGCTGGCTTTCGAGCCAGGATGTTTATAATGAAGGTCAGAAATTATCTAAGACAGAATACGGTCAGTATCTTATGGGATTGGTAAAATAATCTTCAGGCGGGGCTTCCCGCCATAATCCCTTATAAGTGGTGATGAGTATGGTAAAAAAAGGCAAACTCAATATCGAAATTGAGTACCTGAGATTTTTTGCAATAATTGCTGTTCTTCTTGAGCATTTGCCCACCCTGTATATCTGGAGCAAGCATCAATTACTTCAAAATATCAACAAGTACGTTGAGTTCTGGCCTGGTGTTGATCTTTTCTTTGCCATTTCTGGCTATCTGATTGCGACTAATCTTCTCACTCATCTGGACAGTAAACAATCAGCTACTCAAATATTCAGACACGCGATTATTCCTTTTTTTGTTAAAAGAATATACAGGTTACTGCCAGCATCATGGTTCTGGATGTTTGTTGTCCTGTTTTTTTCAGCCTTCTATAATGTTACTCACGCATTTGGAGAATTCAGGCATAATATTATTTATTGCATCACCATTATTACATTTTCATTCAATGTTTTTTCCGGTTACCTTCTTGAGCAAGGTTACCTGCCAACTTATGGCCCTTACTGGTCGTTATCGCTGGAAGAGCAGTTCTATTTTGTAGCACCTTTCATTCTTCTTTTCATGAGAACGGGCTGGAAGGTAATCACATTCTCACTCATCGCTGTTTTCCTGCTCTATATTGTTCAGCAAGGTGACTGGAGTCCTAACTTCCGCTATCACGGCCTTATTTTTGGTGTTCTCTTAGCCATGCTGAAGCTGTCGCTGGGTAATAAAATCAAACCCATTATTTTAGGGCAATGGTATATACGCTATCTGGTGAATTTTTTCCTGCTTTTTTCGCTTTTCGCCATTCCAGCTGCATTCAAAGGTTCTGTTTACCTATTCGGCTCTTTAGCACTGACTTCTGCATTGCTGGTCTATATAGCTTCCTGGGACAGAGGCTATTTTATGGCACCCGAAAGTGAAGGGATTGTGTCTCGTACAATGAGCTGGATTGCGTCCCGATCCTACAGTATCTATCTGGCACATATGCCTGTGATCTATTTAGTACAGGAATCAACAGTACGTATCATGGTTCATTACGGTTTTGCCGCTGGGGATTCCTACCTGCGATCTCTGTGGATGACGATAATTGCTGTAGGACTGATTTTTATTTCATCTGAATTGAGCTATCGCTACCTGGAAGTGCCATTACGAAATAAAGGCCGGAAAGTAGCCCGCAAGTATGAAGTCGCCGCCTGATGATTTTAGGTATTGCCACTCGTCCCAGCAACATCCATGAATTTATGGCATAAAAAAGGCGAACCTGAGTTCGCCTTTTTTTCAACTACCGAAACCCATCCGGGTTCTTGCTCTGCCAGTTCCACGTATCACGCATCATCTCATCGATGCCGCGCGTCACGCGCCAGTCGAGCTCACTGTTCGCCAGTGACGCATCGGCCCAGAACGCAGGCAGGTCGCCGTCACGGCGTAGCTTGATTTCAAACGGAATCGCCTTGCCGGAAGCCTTCTCGAACGCTTTGATCATCTCCAGCACGGAGAAGCCTTTGCCGCCGCCCAGGTTAAAGGCGGTGTAGCCTTCCACTTTGCTCAGATGGTCCAGCGCTTTCAGGTGGCCTTCCGCCAAGTCAACCACATGGATGTAGTCACGCAGACAGGTACCGTCTGGCGTATCGTAATCGCCGCCAAACACGCCCAGCTTCTCCAGACGACCAATCGCGACCTGCGCGATGTAGGGCAGCAGGTTGTTCGGGATGCCGGTCGGGTCTTCACCGATTTCGCCAGACTCGTGTGCGCCAACCGGGTTGAAGTAACGCAGGGCAATCGCCTTAAACTTCGGTTCTGCTTTGGCGAAGTCGCGCATGACGAATTCGGTCATCAGCTTTGAGGTGCCGTAAGGGCTGGTCGTGCCACCGATTGGGGTAGTCTCAACGTAAGGAACTGGCGCATCAGCGCCATAAACGGTCGCTGATGAGCTGAAGATAAAGTTCCACACGCCGGCATTACGCATCTCTTCCAGCAGCACCACGGTGCCCGCGACGTTGTTCTCGTAGTACTCCAGCGGCATGCGGGTCGACTCACCCACCGCTTTAAGCGCTGCAAAGTGGATCACCGCTGAAATGCTGTTAGCAGCAAACAGGTCACGCAGACAGGCACGGTCACGGATATCGCCTTCAACGAAGGTCGCTTTTTTACCCGCCAGTTTCTCAACGCGGTTAATCGCTTCGCGTGAGGCGTTGCAGAGGTTATCCAGAACCACAACATCATCACCGCGCTGCAACAGCGCCAGTACCGTATGGGAGCCGATATAGCCTGCTCCGCCCGTTACTAAAATTGCCATGTGAGCTCCTTTCAATCTGCCGCGCAGGACGCGGCGATTTAATTACTTTTGTTTTGCGAGAATCGTCTGGATAGCTTCGCGGAAATCGCGACCCTGCGCGTTGTTGCGCAGTCCGTAAGAGACGAACGCCTGCATGTAACCGAGTTTACGACCGCAGTCGAAACTGTGACCGGTCAGCAGCGAAACATCAACGGTTTTATGCTTGCTCAGGCTGGCGATCGCATCTGTCAGCTGGATACGGCCCCACGCGCCTGGTTCAGTGCGCTCCAGTTCTGCCCAGATATCAGCAGAAAGGACATAGCGGCCTACCGCAGCCAGGTCAGAGCTCAGGCCGGCTGTATCTTCAGGCTTCTCAACGAAATCGGTGATGGTGCTGATGTCGCCTGGGTGATCGATCGGCTCTTCCGTGGTGATCACAGAGTACTCTGAGAGATCGGACGCCGGCATATGCTGCGCCAGAACCTGACTGTGGCCGGTCTCTTCAAAACGGGCAACCATCGCCGCGAGGTTATAACGCATGTGGTCAGCGGTAGAGTCATCCAGCAGGACGTCCGGCAGCACCACCACGAACGGGTTATCACCGATCATCGGGCGGGCACAGAGAATGGAGTGACCCAGGCCCAGCGGCTGCGCCTGGCGCACGTTCATGATGGTGACGCCTGGCGGGCAGATAGACTGCACTTCGCTCAGCAGCTGACGCTTCACGCGCGCTTCAAGCAGCGCTTCCAGTTCATAGGTGGTGTCGAAGTGGTTCTCAACGGCATTTTTCGATGCATGCGTGACCAGCACAATCTCTTTAATACCTGCGGCAACACACTCGTCAATGATGTACTGGATCATCGGTTTGTCGACGACAGGCAGCATCTCTTTCGGAATGGCTTTAGTCGCAGGGAGCATATGCATACCGAGGCCCGCAACCGGGATTACTGCTTTAAGCTTGGTCAT
Protein sequences of DBSCAN-SWA_6 >CP028349|3143884:3164318|3151291_3152017_-|AVV38421.1|DBSCAN-SWA MITFESVTREFNVGNYQHSTLKELFSFNKKRSQPGKIYRAVNDVSFSITPGQAVGVLGRNGAGKSTLLKLLTRIIAPSSGKITVDGSISCLLEVGAGFHHELSGYENIFLSGSILGMSRKEIETKLESIIEFAEVRDFIYEPVKYYSSGMYVRLAFSIGVHLDSEILVIDEALAVGDANFQAKCLNKINEVKESGKTIFFVSHNVNQVENICDACLVMEKGQLIYQGETTQAVEIYNQLIK >CP028349|3143884:3164318|3160213_3161077_+|AVV38430.1|DBSCAN-SWA MKGIVLAGGSGTRLYPITKGVSKQLLPVYDKPMIYYPVSVLMLAGIRDILIITTPDDHASFVRLMGDGSQFGINLSYAIQASPDGLAQAFIIGEKFINNESCALVLGDNIFFGQGFAPVLARTSANTNGATVFGYQVKDPERFGVVEFDENFKALSLEEKPAQPKSNWAITGLYFYDNNVTDYARTLKPSHRGELEITDLNRLYLEQNTLNVELLGRGFAWLDTGTHDSLIEASQFIHTIEKRQGFKVACLEEISFKNGWLSSQDVYNEGQKLSKTEYGQYLMGLVK >CP028349|3143884:3164318|3154837_3155866_-|AVV38425.1|DBSCAN-SWA MVNKISVIIHGPLEEDFLEKNLNAIHKQIGDVKVIISTYKKDAEKVLVLLKKLNLNTRENIMVLGCDDVFNPGFFNINRQINLVNAALDVIDESAFVIKVRMDQTVSFTSLMRVVKTFSSDVKDKLITTNCYTRADRWYHPSDMLLAGDYKVLRDYYPAEFFRETHMDNILLIRELVKNNHVKDFYNYWPESRLFTNFVKKRGGVIKESAEDSLEKLKAHVFIINSWDVSLKWKKFLKGWIPVLPYTFKMAPFRDGPEEDAYNFKASDLNAVPSKTKESIFIRLSNIYFKTGMYHFNPLFFSYRNFLLRVGRKIFDISLKFIPPIFHHTLVTIGRKFYYAIK >CP028349|3143884:3164318|3155858_3157502_-|AVV38426.1|DBSCAN-SWA MTNAKKKICLIGHECGYYKGQGGIATYIEHTAKGFSKLGYEVHTIFLHGSDLDDPAIKCWKIKDNFSIFKNSKAVDEVLSQIKPDFVEVTDFIGLASYTLAKRALEGLDYSCRFITNNHTGIREVWEWGTALNFMECAPPWMIEMYQYERAQSILSDANFSTSHFLANYLSVLHDTKVHTCPSYYPLAEQAIPPVRANNSTIEILSLGRFELRKKQELLILAACELMVEGYDLHVTLIGNSGDDFYDRRDYMDTCYGLIPLELKKNFSFYDFIPYKELQKQYKNYDLFVIPSPYENFPNTALEAINYGLIVAGSKTSGIADMSGPAAEKLCFEKNSVDSIKAMLTGFSSMSRDEKEEIRKVQRESLKTLVSFESAIINRVSQYENIEIKSTPVRPLREKCLFVTMRDDARLEYLLLDDKKIHIDSNEAYQLVNAAEYMIILPSDNFDDVILEKNYFPEKNVISAFSYHHPYGTVLDINKSRKAFSQITINVTDVSIGKHVKASRLIAEILFSCKDVVFFREDASRVEKGIGIDLMYELDLLRYKYNG >CP028349|3143884:3164318|3147385_3148795_-|AVV38417.1|DBSCAN-SWA MSKQQIGVVGMAVMGRNLALNIESRGYTVSIFNRSREKTDEVIAENQGKKLAPFYTVEEFVESLEKPRRILLMVQAGEATDKTIASLTPHLDKGDILIDGGNTFYKDTIRRNKELSDQGFNFIGTGVSGGEEGALKGPSIMPGGQKEAYELVAPILDKIAARAEDGEACVAYIGPDGAGHYVKMVHNGIEYGDMQLIAEAYALLKGALGLNNEELAETFTDWNNGELSSYLIDITKDIFTKKDEDGKYLVDVILDEAANKGTGKWTSQSSLDLGEPLSLITESVFARYLSSLKTQRVAASKVLSGPQAKAFTGDKAEFIEKVRRALYLGKIVSYAQGFSQLRAASEENNWDLHYGEIAKIFRAGCIIRAQFLQKITDAYEADAGIANLLLAPYFKDIADQYQQALRDVVSYAIQNGIPTPTFSAAIAYYDSYRSEVLPANLIQAQRDYFGAHTYKRIDKDGVFHTEWLD >CP028349|3143884:3164318|3161119_3162274_+|AVV38431.1|DBSCAN-SWA MSMVKKGKLNIEIEYLRFFAIIAVLLEHLPTLYIWSKHQLLQNINKYVEFWPGVDLFFAISGYLIATNLLTHLDSKQSATQIFRHAIIPFFVKRIYRLLPASWFWMFVVLFFSAFYNVTHAFGEFRHNIIYCITIITFSFNVFSGYLLEQGYLPTYGPYWSLSLEEQFYFVAPFILLFMRTGWKVITFSLIAVFLLYIVQQGDWSPNFRYHGLIFGVLLAMLKLSLGNKIKPIILGQWYIRYLVNFFLLFSLFAIPAAFKGSVYLFGSLALTSALLVYIASWDRGYFMAPESEGIVSRTMSWIASRSYSIYLAHMPVIYLVQESTVRIMVHYGFAAGDSYLRSLWMTIIAVGLIFISSELSYRYLEVPLRNKGRKVARKYEVAA >CP028349|3143884:3164318|3163415_3164318_-|AVV38433.1|DBSCAN-SWA MTKLKAVIPVAGLGMHMLPATKAIPKEMLPVVDKPMIQYIIDECVAAGIKEIVLVTHASKNAVENHFDTTYELEALLEARVKRQLLSEVQSICPPGVTIMNVRQAQPLGLGHSILCARPMIGDNPFVVVLPDVLLDDSTADHMRYNLAAMVARFEETGHSQVLAQHMPASDLSEYSVITTEEPIDHPGDISTITDFVEKPEDTAGLSSDLAAVGRYVLSADIWAELERTEPGAWGRIQLTDAIASLSKHKTVDVSLLTGHSFDCGRKLGYMQAFVSYGLRNNAQGRDFREAIQTILAKQK >CP028349|3143884:3164318|3148946_3150008_-|AVV38418.1|DBSCAN-SWA MLNIIIPMSGASLYETSADFIYPKILTEINNRTLLEYSQDIFSTLTEEYKSLFVVPADKLSKLALGNIIELISPQAVIIPLQGMTKGAVCSSMMAIDEINLDDELIISSADHYINDDLQQVLEYFRSLEADAGVLTFESVHPKWSFVKTNEDGLVVQTAEKNAISRNAVAGLYYFKRGSDFIEAAKELIRKDNSIDNNFYLSSSLNELVLKGKKIVMRPLGDSVYHNFYDAHAVKAFESSHANALPPVKKLTEKYIAAFNSRSLHEVVEMFDRNATLTDPGNHKVGQENIRSMLVQLFNDTKLLEFKALSVLAEHNRSVIEFELKLDDKLLRGVDIIDWNNKGHIVKLNAYLY >CP028349|3143884:3164318|3150637_3151279_-|AVV38420.1|DBSCAN-SWA MIVISHRGYWKSLQEKNTEAAFIRSFQLDLGTETDLRDFNGEIVISHDIPDASCMTFKHMLELYKTHTSTNPPLALNVKADGLQEKVKDTLEKMLITNYFFFDMALPDSLAYYKHGLKAFVRYSEFESLNALYDISEGVWLDGFEKDLVKEELVHKILSDGKKVCIVSPELHKRNHLEMWEKYKKFNDKLLSSSDLIICTDYPEECMEFFNGY >CP028349|3143884:3164318|3146051_3147350_+|AVV38416.1|DBSCAN-SWA MHQQHHPLLSASLGTQREIISFHFAEDNERQVYIQAALHGDELPGMAVAWYLKQRLQALESAGQLKAAFTLVPVANPLALGQHWHGTHLGRFHTLSGQDFNRRFPSLGNTLAAGLAESLTQSESQNRNLIRDAIDRHYRDTVPKTELDAQRYTLMRMASQADLMIDLHCDWEAVPHLYTTPHAWPDIEPLARWLGSEVQLLAQISGGEPFDEACCEPWLTLAERFGKDYPMPRGLLPVTLELRGMRDVSPEQAEKDADAIISALQEGGYIGYSEPSISVEVAAPVIGGDELVPASAGGDDIVAINDGPIGLNVPDVAEAPGEAGVIKERHSEGSAASSPALKNPATPLSGCEYIHSPVSGLILHRKPLGAQIRPGEVVAEIIDPITDHITPLVAEYGGILYARHWVRFATAGMLVVRLAGEREIRSGDLLVG >CP028349|3143884:3164318|3143884_3144496_+|AVV38413.1|DBSCAN-SWA MLTQQQQALLDWDKTEGMMPVIVQHNVSGEVLMHGYMNPEALEKTLAEGNVTFFSRTKNRLWTKGETSGNFLQVVSITPDCDNDTLLVLANPIGPTCHLGTTSCFSPAVPEWTFLYQLEELLASRKTADPDSSYTARLYASGTKRIAQKVGEEGVETALAATVNDRDELTNEASDLVYHLMVLLQDQDLDFSAVINNLRARHK >CP028349|3143884:3164318|3153918_3154851_-|AVV38424.1|DBSCAN-SWA MLSSNLYIVGWRGYIGQSLLNYLSRMGFKKQVFLYSRNTEVNGFTPDFIDATLCQHDDFVSKIAQDDNPVVYYLANRFSPAESQSCALESVNSNLLPFISFLEDIKLVASKVTFIFSSSGGTIYGDTGGEKCREDHALSPKTIYAANKLAQESYLQVYHCNYGLNYRVLRIANPYGPGQVFKGGQGLIPAIMDSYKKNTPVKIYGDGSASRDYIYIDDLHDCFIKSASYEGDVRIFNVGSDSKYSILDIIESFNRHGIVVDTVKVKSDISVVNSISLDITRAKREFNWSPQTSLENGLKKYIEWYELTSQ >CP028349|3143884:3164318|3158261_3158588_-|AVV38428.1|DBSCAN-SWA MEIKKVDQFVRGWFVGGFEPTLYKTTDVEVAVQHFKAGDKEASHCHKIATEITVIVSGKALMNGNLISAGEVVKISPGEYTNFEALEDTVTTVVKVPGALNDKYLKAE >CP028349|3143884:3164318|3152831_3153887_-|AVV38423.1|DBSCAN-SWA MHPDNLISQSEVSIIIQGGILNSSRELDNDFIFNLNLTLDNFPKAEIIVSTWKADDVIEDNLRISYPGIKFLFNEDVGSISKNVDGVNVVSNVNRMLVSSLNGLRCASKKYSIKMRTDSYLYNDNILTCLSDYFLGNKFLSVEDIRRQNEYTVFSRHLINCNLFARNPYSQLPFLYHPGDIMVAGLTDDVLSFFDIPLADESIFQHCKSMMNACYMRFVPEQYVWVKNLERSIKGFEFSGNFSRNAAEVERSEKIYVNNFLPLDSARLGFVWRKHKEVYFNKGWSSLYQPFDWVDLYSRYIRKQNCSRPLIWHLRLLQITLMKVYFFFRTQLLKLPYVRKIAYRLFVKRGN >CP028349|3143884:3164318|3144610_3145087_+|AVV38414.1|tRNA|DBSCAN-SWA MTTHEKLIALLDQHQARYRLMEHEATGKCEAVAAIRGTEIGQGAKALVCHVKGNGVKQHVLAVLPADRQADLGKVAAAVGGRRASLASPAETDLLTGCVFGAIPPFSFHPDLRLIVDPALFERYPEIAFNAGLLERSVILNTEDYRALCKADVIEITQ >CP028349|3143884:3164318|3152013_3152832_-|AVV38422.1|DBSCAN-SWA MKGFVSTPQSRTTLAEVLDSRNLIKEFIRRDFNVRYKQTSLGILWAIINPGVNILLYLFVFGLLVKVPTPEYNAPYAAVLISGILFWNLFATSMMAASDAIINNMHLLTKVYFPRISLCIASVFVAFIDFVIAFIVFIPLAIFYGVHFDFSRLLFLIPCIVITLMLGCGVGCFMAILKLRFRDLRHVLPLATQALFFASPVVYTLTIVPKEFHQYYSFNPMTGIVALSRWALLGGNSINVTTLLYSFITAIVLVIFGVYYFIKNDRLIADYE >CP028349|3143884:3164318|3145153_3145945_+|AVV38415.1|DBSCAN-SWA MNIFTQHLRQTLAVAVMAGCAFSAQAKIEQVRFAVDPTYPPFESKTPQGKLVGFDIDLGNALCTQMQAKCVWVESQFDGMIPALKARKFDAILSDMGITEERLKQIDFTVPLYDTHTQLIARKGAGILPTVESLKGKTVGVEQGTVQERYALAKWQPYGVTVVPYGDQAQVESDLVSGRLDAVFTDAAQAAIGFLKHPQGKDFELAGPIIQDPIIGPGTAIGLRKGDTELKTALDNAFAEIKKNGTFDQIQKRYFSTDISIQQ >CP028349|3143884:3164318|3162359_3163373_-|AVV38432.1|DBSCAN-SWA MAILVTGGAGYIGSHTVLALLQRGDDVVVLDNLCNASREAINRVEKLAGKKATFVEGDIRDRACLRDLFAANSISAVIHFAALKAVGESTRMPLEYYENNVAGTVVLLEEMRNAGVWNFIFSSSATVYGADAPVPYVETTPIGGTTSPYGTSKLMTEFVMRDFAKAEPKFKAIALRYFNPVGAHESGEIGEDPTGIPNNLLPYIAQVAIGRLEKLGVFGGDYDTPDGTCLRDYIHVVDLAEGHLKALDHLSKVEGYTAFNLGGGKGFSVLEMIKAFEKASGKAIPFEIKLRRDGDLPAFWADASLANSELDWRVTRGIDEMMRDTWNWQSKNPDGFR >CP028349|3143884:3164318|3157504_3158260_-|AVV38427.1|DBSCAN-SWA MLNIIVPMAGRGSRFAKEGYTDPKPLIPLNDKRMIEVVINNLKPNQPHKFIFICQQDHIEKYDLSTKLNEWAPGCEIVGIDGITDGAACTVLCAKEFIDNDEPLMIANSDQWIDIDINDYLKYMDDENLSGLIMTMSANDPKWSYAELNSDGKVIRVVEKEVVSEEATVGIYNFKHGADFCRNANKMIELNERTNGEFYVAPVYTWMYREEDANIGVYNIGAEADGMYGLGIPSDLKLFLSLDVSKKALDF >CP028349|3143884:3164318|3150012_3150648_-|AVV38419.1|DBSCAN-SWA MATNIKAVIFDMDGVLIDAKDWHYEALNTALDLFGLTISRQEHLTTYDGLPTKDKLIMLSKDKGLSTNLHSFINEMKQQYTMEIVNNKCKPMFHHEYALSMLKKDGYKMAVASNSIKHSIEVMMERAHLAQYLEFYVSNQDVVKGKPDPEMYNKAISMLGLKPADCVIVEDNENGIKAARASGAHVLEVETVHDVNYANIKRFISLVESEV >CP028349|3143884:3164318|3159131_3160214_+|AVV38429.1|DBSCAN-SWA MKILVTGGAGFIGSAVVRLLVADQDNEVLVVDALTYAGNLDSLKTIENSPGFSFVEADICDYDAMKAVISSFKPNAIMHLAAESHVDRSIDGPGAFIQTNIIGTYNLLEASRLYYNQLSDTDKAGFRFHHISTDEVYGDLHNTSDLFTETTPYSPSSPYSASKASSDHLVRAWHRTYNLPVLVTNCSNNYGPYHFPEKLIPLTILNALHGKPLPVYGNGSQIRDWLYVEDHARALQEVVKKGVIGQTYNIGGHNEQTNLTVVQTICDILDELSPSNLPDVGSYRELITFVKDRPGHDLRYAIDASKIERELGWVPQESFATGLRKTIQWYLENEWWWKRVQDGSYQGQRLGLENNSGEDL |
21 | Synechococcus_phage(35.71%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
0 : 5332
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP028352|0:5332|DBSCAN-SWA
Protein sequences of DBSCAN-SWA_1 >CP028352|0:5332|1567_2017_+|AVV39949.1|DBSCAN-SWA MKPAIYAEGLTVLWPALFSPPVKIPLYGEYCPAGFPSPAQDYVEKELDLNELCIRRRASTFFVRASGNSMQDLGLYDGDVMVVDRAEEASHGDIVIAEVNGEFTVKRLQLRPRLALLPMNPAYAVIYPEELQLLGVVTWFFSSTRARGR >CP028352|0:5332|3423_3756_-|AVV39951.1|DBSCAN-SWA MTFTELDNALADTGRRSLASRLVFALADGLDKNQIGLNLDEFEKQSGYIRTNIRSVAAALKESGIIEIQYYDDSVPENETIVMGSVSRGRWSKQHYCLTDTVKALLKRQA >CP028352|0:5332|2026_3301_+|AVV39950.1|DBSCAN-SWA MFGLADVNSFYASCEALFRPDLRGKPVIVLSNNDGCVIARSAGAKRLGIKMGAPWFQIKHQDFSERIYAFSSNYALYHSLSQRVMTALEEMTPRVEQYSIDEMFLDLTGIDGCEDFEHFGRRLRTHVLATTGLTVGVGMGPTKTLAKSAQWASKEWPQFRGVLALTPGNIQRTETLLSKQPVEEVWGVGRRIGKRLNLMGISNALQLAHAHPALIRKNFSVVLERTVRELNGESCIPLEELPPAKQQICCSRSFGERITSKILMQQALCQYATRAAEKLRGERQFCRRISVFIRTSPHAENEIFYGNSAGEKLSLPTQDTRDIIEAAMRSLDRIWLEDRRYMKAGIMLDDFTPNGVSQLNLFDDVQPRANSAQLMKVLDGINQSGLGNVWFAGQGVSTEWKMKRDMLSPAWTTRWDDIPVARVL >CP028352|0:5332|1130_1385_+|AVV39948.1|DBSCAN-SWA MTGWQLKLWRRSLLWKRERAAAELGVSLRTYKDYENADTVKRSIAMATVTLSLINVLPSLQSQEISSESMVHLINQMISTVDEK >CP028352|0:5332|426_762_+|AVV39947.1|DBSCAN-SWA MNRDIVMLLSVRGHPCGAVTTTKTGVKRSRQGRTRKPHAPPERTKCPAHGGERVSRTAKLPLNAKRPGYREGTALYPCRFPAAGREAAKGAERSDILIVNLRDAAGRGGIG >CP028352|0:5332|4207_5332_-|AVV39953.1|DBSCAN-SWA MAASKFQKKLNALKTSGPSVSAMPEVAASVAARDTPAPVGTPDASKIRAEIARPGLEPEGQIVRVRADEIYEVEQVRPEDDFDEEVISGMVESFSEFGNLTPPRCFPKDRKGYRVWFGATRIRSMKKRGDEFIDIYVGRPPKDEKQRIMGQLIENLQQSGLKPLATALAFEQLKTDFNMTGEEIARSLGKPTAFVSKHIRIGSAPEKIKALLKNKSISDVDLAYTLIQIDNIDSGASDRLIEKHNAGSTLTRVQVKKELDRLKGRDVSKAPTKVSHAKSQSGDDVLQPDNKTEISHAKALITESGSTSSSNKKHQYQAADVPLPLIPVVKFNGTEVTVLLHKMPDEFGFIWLNTAEGELYVRASDVEFLGLRSE >CP028352|0:5332|3752_4208_-|AVV39952.1|DBSCAN-SWA MPAYSLTPFQQLSAQFRGSIVHRWAKDEDQSAAEAARTLTEHARHLNIISRELAVPGAALSEWNKRKQPPLWAALAAFDLILRRGWRPVSNEEWAGFASLILKLSPGVNLDRLSESLPADIDLNIASGWIAAAIEEDQHYRVRKKMAVKPE |
7 | Klebsiella_phage(25.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
10039 : 10792
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP028352|10039:10792|DBSCAN-SWA TTTAAGGCAGCATGCGCAGCAGTTGCAGAGACGCTTCGGTGTCCAGGCTGAAGCGAACCTGATGACTGGCAGCCACGTCCAGGGCAAACACCTTCGTATAAACTTCGGTTGATTCAAACTTCTCATGCCCCAGCAGTGACTGCAGCACCTTCGGCGGCACGTGGTGATAGAGCATGTGCATCGCGAAACTGTGACGGAACGTATGGGGAGAGATTTCAATGCTGAACCGCACGCCGGCACGGGCAGCTGCATTGATGGCGCGGTTTATCCAGTTCCTCACCGTGCGATCGGAGACGGTCCAGACCGGCAGAGCGCGGCGCTCGCCGGTATCACTGTCGGTCTCAAATTCTTCCCTGACGCTGGCGAACAGCCGGCGCATCTCATCAACGTAAGCCGGATCGGAAAGCGGCACAACACGGTTGGGACTGGTTCCTCTTCTTGGACGTCCGCTGCCGGCGCGCCGCTGCTTAGCCGTGCGGATCACCACGTGCGGGATGGCGTCGTTCAGGCGAAAGTCACGCCGCCGCAGCGCCAGGACTTCATTGAGACGTCCGCCGGTGTTCCATAAAGTATTGACCAGCGCATGCTGGGTCCAGTCCGGTATATGGTGAAGCAGGCCGGTGATCTCAGGTGCCAGTAAGTAACGGGGCAGCTCGGTGTATGCGGCGGCCATGGACCGCAGCGCGAGCGCACGACCAAAATCGGGCTGTGCAGCTGGCAGCGCGACCTGAGTATTGACGGATGGTAGCGACAT
Protein sequences of DBSCAN-SWA_2 >CP028352|10039:10792|10039_10792_-|AVV39957.1|DBSCAN-SWA MSLPSVNTQVALPAAQPDFGRALALRSMAAAYTELPRYLLAPEITGLLHHIPDWTQHALVNTLWNTGGRLNEVLALRRRDFRLNDAIPHVVIRTAKQRRAGSGRPRRGTSPNRVVPLSDPAYVDEMRRLFASVREEFETDSDTGERRALPVWTVSDRTVRNWINRAINAAARAGVRFSIEISPHTFRHSFAMHMLYHHVPPKVLQSLLGHEKFESTEVYTKVFALDVAASHQVRFSLDTEASLQLLRMLP |
1 | Macacine_betaherpesvirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
18580 : 21683
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP028352|18580:21683|DBSCAN-SWA CATGCCAAATAAAATCCTCACAGATACCTATCGTAAAGACATTGATGGCCTCAGAGCATTAGCTGTCGTTCTTGTCATTCTTTTCCATGCCGGAATTAGTCAGATAGCATCCGGATTTATAGGGGTCGATATTTTCTTTGCGATTTCTGGTTTTTTGATAACAGGAATTATTGTAAAACAAAGAGAGAAGGGTATCTTCGGCCTGGCGAACTTCTTCGTGCACCGTCTCTGGCGCATACAGCCTGCTTTTATCACGATGTCCCTGGCTACGCTTATAGCAACGTTCTATTTGTATCTGCCGGATGATTTTCTGGCATACCTGCACAGTAGTTACTACAACGCTTTACTGCTTTCAAATCAGTTCTTTGCACGTCAGAGTGCTGCGTATGCTTCTCCTGCAGCCGATCTGTTCCCTTTACTCCATACCTGGTCTCTGGCAATTGAGTGGCAGTGGTATCTTTTTCTCCCACTGGGCATTTTACTGTCAGGTGCCATCTTACGGCTTAAGTCGATTCAGAAGGTGATCAGTAACCCGGCGAATGCACTTGCTGCAGTGTGGTTTATCCTGACGCCGCTGGCAGCTGCTCTGGCGCTGATTATTGCCAGCCGAGAGGCCGATACAGCCTATTACTCACTGCTTACCCGTATATTCGAATTCATGGTGGGGGGAGCTGCATTTTTTCTGACTCGCTATATTCGCTCGGTTCCTTCAGCTGTCTCTAACTGTACAGGCATTTTATCGCTTGCCGTGTTGCTTTATATCGCGATTCACCCTGACGTTATCGGGTTCTATCCAAATCTCTATGCATTGCTGGTCGTCACTGCTTCAGCCTTACTCATGTTTACCGGAACGTACGGAAACAGCATCGCATCAAAGTTACTTAGCCTGACACCCATCGCATTCACGGGCAGGATATCTTATTCGTTATATCTCTGGCACTGGCCCGTTCTGGCCATAACTCGTTACCTGGGTTACGACCTGACTGGCTTCACGCTTCTTTTGTGCCTGACTCTGACGGTCCTGCTGTCGCTGATTAGCTATTATCTTGTGGAACAGCCATGCAGACGGTTGCGCTGGCCTCTGAAATATACGATTCCCCTGCTGATCATCGTCCCGATCATTATCTTCAACGCTGTGTTCAAATTTGCAGAGAACCGTGACGGCATGCCAGGACGATTCGGTGTTGAATATGACCGCATGGCTCACAATATCAGTGCTGGTTTAGCTCAGGCCGGACACCGTCCGGACTGTCTGGATGGATCTCAGAATGCGGACAAATGCATGTTCGGAGATCTTAAAGGTCAGAAAACAGCCCTGATGATCGGAGATTCTCACTCAAACCATTTCTGGGGGTTCTTCGATGTGCTCGCTAAGGACGCTCAAATCCAGATGACTGCGTTGAGTACGGCGTTGTGTTTAGCGCTGCCGGATACATACCTTTATGACTGGTGGTCATTCCGGAACCAGACCTTTGATAAATGCCATGAAAATACGGCTAAGTATTATGACCTGATTGCTAAAAATCATTATGACTACGTTATTCTGGGCCAGGTCTGGGAATGGTACGTAAGTGGTCCTCATGTCATAAACCATCCAGGTGACGAGCGCTCTGATGCACTGACTAAAGAACGTTTTAACGTCGCCATACGGCAGGCATTGAAAGCGATAATCGCATCAGGAGCACGGCCGGTGATAATAAGAACCGTAGCTCCTATGCCCGTTAATTATCAGTCATGTATCCGTCACCATGTCATTTTCAGAGAGCCTTATCGTCAGGATGAATGCGATAACCAAAACCCGAAAAGTCCTGAAAAAGAATGGACGCTGCCGCTGTTCGACCAGCTGCAGAAAGAATTTCCAACCCTGATTGTGATTGATCCTAAAAAAGTCCAATGCGAGAACAGCTATTGCATAACTGCACTGGACGGGACCCCGCTCTATCGTGACGTTGGCCACCTGAATGATTTTGCATCCTATAAATTCGGGCAGGAATATTTGCGGAAATTTGGAAATCCGCTGAATTAAAAAAAAGGGCCTCTCAGAGGCCCTTTTTATTATCTCATCTGAGGAGATGTGATCACTTAATAGATGGGATATCTTTCAGCCGCGTAGTGTAACGCGGTGACAACATTTCACGCTTCATCTGCCAGGCACTGTCACGTTCACCCTGACCGGCGAACCATACCTTTCCCTTACCTGAGCGGTTAATGGTGTCCAGTGCAGCCATCAGCGCATCAGCATTTGCACGCGGCTGCTGCTCGCTGAACATGTCGAATTGCGCGACCCCAGACTGATAAAAATCACCCAGCATGACGCCCGCTTTAGCGTACCGGTAGCCTTCAAGCCAGATAGTGCTGAGTCCGCGGAGCGCTGACTCAATAATGTCGCGCGTATCATTCGTCGGATAATCGCAGATGCGCGAGGCGGTATTCGAATACTGTGGCTCATCGCCATGCCGGCCCGTTGCGATGGATACACTGATATGACGGCAGCGCGAATTCTGCTCCCTAAGTTTTTCCGCCGCACGTGTGGCATACAGCACGATTGCCTGCTGCATATCTTCCAGTTTTGTGACTCTTTCGCCAAATGATCGCGAGTTTAGGATGTGCTGCTTTGGCGGCGGGGCATCTTCCAGTGCAATACAGGACTCACCATTAAGCTCGCGAGTAGTACGCTCAACGATAACGTCAAAATTTTTCCGTATCATGCTGATGTTGCTGTCAGCCAGCTGCAGAGCCGTGGTGATCCCAAGCTGGTACATGCGCTTAGTAATGCGCGGCCCAATACCCCAGATATCGCTGACGTCGGTCAGGTCCAGTAGTTTTCGCTGACGGGTTCGGCTGGACAGGTCTACCACACCCATTGTCTGGGTCCATTTCTTCGCGGCATGGTTAGCCAATTTGGCCAGCGTTTTCGTTGGGCCGAATCCTACGCCGATAATTAACCCGGTTTCCTTCCGGATGCGCTCTCGCATCTGCTGTCCGAAAGTTTCTAGAGGGATCAGGCTATCGATCCCTGTGACGTCAAGAAACGACTCATCAATGGAGTAAACCTCCTGACCTGCGGCCATTTCGCCCAGTATGGCCATCATGCGCGCTGACAT
Protein sequences of DBSCAN-SWA_3 >CP028352|18580:21683|20657_21683_-|AVV39963.1|DBSCAN-SWA MSARMMAILGEMAAGQEVYSIDESFLDVTGIDSLIPLETFGQQMRERIRKETGLIIGVGFGPTKTLAKLANHAAKKWTQTMGVVDLSSRTRQRKLLDLTDVSDIWGIGPRITKRMYQLGITTALQLADSNISMIRKNFDVIVERTTRELNGESCIALEDAPPPKQHILNSRSFGERVTKLEDMQQAIVLYATRAAEKLREQNSRCRHISVSIATGRHGDEPQYSNTASRICDYPTNDTRDIIESALRGLSTIWLEGYRYAKAGVMLGDFYQSGVAQFDMFSEQQPRANADALMAALDTINRSGKGKVWFAGQGERDSAWQMKREMLSPRYTTRLKDIPSIK >CP028352|18580:21683|18580_20605_+|AVV39962.1|DBSCAN-SWA MPNKILTDTYRKDIDGLRALAVVLVILFHAGISQIASGFIGVDIFFAISGFLITGIIVKQREKGIFGLANFFVHRLWRIQPAFITMSLATLIATFYLYLPDDFLAYLHSSYYNALLLSNQFFARQSAAYASPAADLFPLLHTWSLAIEWQWYLFLPLGILLSGAILRLKSIQKVISNPANALAAVWFILTPLAAALALIIASREADTAYYSLLTRIFEFMVGGAAFFLTRYIRSVPSAVSNCTGILSLAVLLYIAIHPDVIGFYPNLYALLVVTASALLMFTGTYGNSIASKLLSLTPIAFTGRISYSLYLWHWPVLAITRYLGYDLTGFTLLLCLTLTVLLSLISYYLVEQPCRRLRWPLKYTIPLLIIVPIIIFNAVFKFAENRDGMPGRFGVEYDRMAHNISAGLAQAGHRPDCLDGSQNADKCMFGDLKGQKTALMIGDSHSNHFWGFFDVLAKDAQIQMTALSTALCLALPDTYLYDWWSFRNQTFDKCHENTAKYYDLIAKNHYDYVILGQVWEWYVSGPHVINHPGDERSDALTKERFNVAIRQALKAIIASGARPVIIRTVAPMPVNYQSCIRHHVIFREPYRQDECDNQNPKSPEKEWTLPLFDQLQKEFPTLIVIDPKKVQCENSYCITALDGTPLYRDVGHLNDFASYKFGQEYLRKFGNPLN |
2 | Pseudomonas_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
26387 : 28805
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP028352|26387:28805|DBSCAN-SWA CATGCGCTTTACACTGGATAACGTAACCGCCCGTGAAATTCTTGACTCGCGCGGCAACCCCACCGTTGAAGTTGAAGCCCGGACTACGGACGGGATGATGGCGCGCGCCTCGGTGCCTTCCGGGGCCAGTACCGGCTCGCGCGAGGCCGCAGAGCGCCGCGACGGTGACGCACACCGTTTTGGCGGTAAAGGCGTGCTGGATGCCGTCAGGGCGGTTAATACGGAAATCTGCCAGGCCGTGCGCGGCAGCGACGTGCGCGAACAGCGTCAGCTGGACAACATCATGCTGGCGCTGGACGGCACGGAGAACAAGAGTCGCCTGGGTGCCAATGCCATTCTGGGCGTGTCGCTGGCGGTGGCCCGGCTGGCGGCACAGGCCTCCCACGAGCCGCTGTGGCGCTACCTGGGTGGCCTGCAGGCGAACCTGCTGCCCGTGCCGTGCATGAACATTATCAACGGCGGCGTGCACGCCCGCGGTCAGGGCGCGGACTTTCAGGAGTTCATGATAGCCCCGCACGGCGCACCCACGCTGCGCGAGGCCGTACGTCAGGGCAGCGAGGTCTACCAGGCGCTGCGCCAGATCCTGCTGGACAACAACCTGTCGGCGGCAGTGGGCGACGAAGGCGGTTTTGCGCCCGCCGTGTCGTCAAACCGCGACCCGCTGGCGTTTATCGTACAGGCCATTGAGAAAGCCGGATACCGGCCCGGAGAAGACATCAGCATCTGCATGGACCCTGCCTCAAGTGAGTTTTTCGCAGACGGCAAATATCATCTGCGCACCGAGGGCAGCGCGCTGAGCGCGGCGGAAATGACGGCATACTACGGCGAGCTGATGGACCAGTTTCCCGTCATCCTGCTGGAAGACGGCCTGGCCGAGGACGACTGGGCGGGCTGGAAGCATTTGCACCGGCAGCTTGCGGGCAGGGCCGAACTGGTTGGTGACGACCTGTTTGTCACCAACGTGAAGTATATCCAGCGCGGCATTGACGAAAACCTCGCCAGCGCCGCACTGATTAAACTCAATCAGATAGGCTCGCTCAGCGAAACGCTGGACGCCGTGGCGCTCTGCCAGCGGCACGGCTGGGGCGCGTTCATGTCCCACCGCAGCGGGGAAACCACGGACACGTTCCTTGCCGACCTGACGGTGGCCCTGCGCGCGGGACACCTGAAAACCGGCGCGCCGTGCCGCGGCGAACGCGTGGAAAAGTACAACCAGCTGATGCGCATTGAGCAGGCGCTGGGTGACGATGCGCACTACGCGGGGCTTCATGCGTTTGTGCGCCGCACGTAGCACATCCGCCGGACGGTCCCCGTCCGGCACAGAAGGAATGCCTGTGGCCTCTGCAACGCTGATCCTTTTGCGTCACGGCGAAAGCCAGTGGAACCGGGAAAACCGTTTTACCGGATGGACAGACGTGCCGCTGACGGTCAGGGGGCGGCAGGAGGCCGACCGGGCCGGTGACGCGTTGCGGGCAGCGGGCCTGATGCCCGACCGTGTCTTCACCTCGGTGATGACACGCTGCGTTCACACGGTCTGGCGGGTGCTGGACCGTCTTGACCGCAGCTGGACGCCCGTGGAAAAGAGCTGGCGACTGAATGAGCGCCACTACGGGGCGCTGCAGGGGCTGAACAAGGACGACGCTGCCCGGATAATGGGGGAAGAGACTGTCCGGCTCTGGCGCAAAAGCTTCAGCGGCATTCCGCCCGCGGACGCGGAGGCTCCGGCGCAGCTGCGTCGGGATCTGCGCTACCGCCGCGTTGCGCTGGCCGACCTGCCTGCAACGGAAAGCCTGGAAATGACGCTGCGCCGCGTCATGCCGTACTGGCAGCATGCTGTGGTACCGCAGTTACGCTGCGGAAGTTCCGTTCTGGTTGTGGCGCACGGCAACACGCTGCGCGCCCTGACCGCCTTCCTGGACGGCATGCCTTACGACAGCGTGGCACAGCTGCACATCCCGACCGGCGTGCCGGTCGTATATCAGATGGATGCGGCCGCCCAGGTCGTGTCCCGTGATGTTCTGAATATTAAAAAAAAGGAGTGAACGTGAATCAGCTTTTTGCCGTCATCACCGGCGGTGCTTTAGGGTGTGTTATCCGCTGGCAGCTGGGCGCGCGCCTGAATGCGCTTTTTCCGGACCTGCCGCCGGGGACGCTGCTGGTCAACCTGCTGGGTGGATTTATCATCGGGGCGGCCCTGGCGTATTTTCTGCGCCATCCCGGCCTCGATCCGGCCTGGCGGCTGCTCATTACCACCGGGCTGTGTGGAGGAATGACGACCTTCTCCACATTTTCAGCTGAGGTTTTTGCCCTGCTGCAGTCAGGCAGCTATGCCTGGGCAGCCGCCTCGGTGCTGATACACGTGCTGGGTTCGCTGGCCATGACGGCGGCCGGATTTTATATCATGACGCTGTGTGGTTAA
Protein sequences of DBSCAN-SWA_4 >CP028352|26387:28805|26387_27677_+|AVV39969.1|DBSCAN-SWA MRFTLDNVTAREILDSRGNPTVEVEARTTDGMMARASVPSGASTGSREAAERRDGDAHRFGGKGVLDAVRAVNTEICQAVRGSDVREQRQLDNIMLALDGTENKSRLGANAILGVSLAVARLAAQASHEPLWRYLGGLQANLLPVPCMNIINGGVHARGQGADFQEFMIAPHGAPTLREAVRQGSEVYQALRQILLDNNLSAAVGDEGGFAPAVSSNRDPLAFIVQAIEKAGYRPGEDISICMDPASSEFFADGKYHLRTEGSALSAAEMTAYYGELMDQFPVILLEDGLAEDDWAGWKHLHRQLAGRAELVGDDLFVTNVKYIQRGIDENLASAALIKLNQIGSLSETLDAVALCQRHGWGAFMSHRSGETTDTFLADLTVALRAGHLKTGAPCRGERVEKYNQLMRIEQALGDDAHYAGLHAFVRRT >CP028352|26387:28805|28424_28805_+|AVV39971.1|DBSCAN-SWA MNVNQLFAVITGGALGCVIRWQLGARLNALFPDLPPGTLLVNLLGGFIIGAALAYFLRHPGLDPAWRLLITTGLCGGMTTFSTFSAEVFALLQSGSYAWAAASVLIHVLGSLAMTAAGFYIMTLCG >CP028352|26387:28805|27720_28428_+|AVV39970.1|DBSCAN-SWA MASATLILLRHGESQWNRENRFTGWTDVPLTVRGRQEADRAGDALRAAGLMPDRVFTSVMTRCVHTVWRVLDRLDRSWTPVEKSWRLNERHYGALQGLNKDDAARIMGEETVRLWRKSFSGIPPADAEAPAQLRRDLRYRRVALADLPATESLEMTLRRVMPYWQHAVVPQLRCGSSVLVVAHGNTLRALTAFLDGMPYDSVAQLHIPTGVPVVYQMDAAAQVVSRDVLNIKKKE |
3 | Streptococcus_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
32906 : 34103
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP028352|32906:34103|DBSCAN-SWA CCTATTCTCCTGCCGGTCCGTTGAATCCCACTCCAGCTACATCGGTGAGGGCATAGGCAACTACTCCCCATACGGGAAGCATCTGGCTCACATCAAGCAGCGTTACAGTCTCGTCAGCATCCAGCGCCTGCAGGGCAGGAACCGGATTAAGCAGCAGGCGCCTTAATGTCAGCTCCCCGTCAAGTTCAGCCACGATAAGCTGACCGTGTGCGGGTATCAGCGCGCGATCGATGGCCAGTACGGACCCTTTAACAATACCGGCGTCGGGAAAATTACTGTCGCTGCGCATCAGATAGGTCGAGTAAGGGGAAAGATGCACAAGATCACCCAGATTAAGGCGCGTTTCAGTGTAGTTCTGGGCCGGACTCTGAAAGGCCATGCAAATACTCCATTCTTTACAGCTTTTCAGCCTTGTAAGGTGAACCAGATAACGACCAGGTGTCCGTCGCGCGGGCGCATTTAATACTCCTGATATCAGTATACGTCGCTATCCTTACATCAACAGCAGGAGGTAACTATGTGTGGAAGATTTGCGCAGTACAGCAGCCGCGATGATTATTTTGATGCACTCAGCCTGACACCGGACGAAATCACATTCGATCCGGAACCTCTCGGGCGCTTTAACGTAGCACCAGGCACTAAAGTACTCCTTCTGAACGAACAGGAAGATTCGCTGCGCCTCGATCCTGTGTACTGGGGCTACGGACCGGAATGGTGGGATAAGCAGCCACTTATCAACGCGCGCGGTGAGACGGCTGCGAGCGGCCGCATGTTTAAGCCTCTCTGGAATCATGGCCGCGCTATCGTGCCTGCTGACGGCTGGTTCGAATGGCAGAAAGAAAGTGGTGAGAAGCAGCCATTCTTCATTTTTCACAAAAAGAAGGAGCCTCTTTATTTTGCCGCAATCGGCAGGCAGCCGTACGGGCAGGATCACGGCAAAGAGGGCTTCGTGATTGTCACCTCAGCCAGCAATCAGGGCATGGTGGATATACATGACAGGCGGCCGCTGGTGATTACGCCTGATGCCGTTCGTGAGTGGCTCAGCAATGAAACGTCTCCAGCGCGTGCAGAAGAAATTGCTCATGACGCAGCCGTTCCGGAAAAAGCATTTACCTGGCATCCCGTCAGCAAAAAAGTAGGCAATATACATAACCAGGGAAAAGAACTGATTGAGGCTGTAGTGGACAGTGATGTTTAG
Protein sequences of DBSCAN-SWA_5 >CP028352|32906:34103|32906_33284_-|AVV39975.1|DBSCAN-SWA MAFQSPAQNYTETRLNLGDLVHLSPYSTYLMRSDSNFPDAGIVKGSVLAIDRALIPAHGQLIVAELDGELTLRRLLLNPVPALQALDADETVTLLDVSQMLPVWGVVAYALTDVAGVGFNGPAGE >CP028352|32906:34103|33422_34103_+|AVV39976.1|DBSCAN-SWA MCGRFAQYSSRDDYFDALSLTPDEITFDPEPLGRFNVAPGTKVLLLNEQEDSLRLDPVYWGYGPEWWDKQPLINARGETAASGRMFKPLWNHGRAIVPADGWFEWQKESGEKQPFFIFHKKKEPLYFAAIGRQPYGQDHGKEGFVIVTSASNQGMVDIHDRRPLVITPDAVREWLSNETSPARAEEIAHDAAVPEKAFTWHPVSKKVGNIHNQGKELIEAVVDSDV |
2 | Morganella_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
37537 : 38071
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >CP028352|37537:38071|DBSCAN-SWA ATCAGTATGAAGAGTGCCAGTCTGTTCCCCCTTCCCAGCGTGACTGCCAGTGCGCCAGATAAGTCTGCGCCAGTGCCGGAACTTCCCGCACGACCAGCGCATTCTCGGAATTTGATTCAGCTGCGCTGCGGGAATAGTTAAATGAGCCAAGCTCCACGTTCTGTCCGTCGGTGATGATCACTTTGTCGTGCATAATCTTGTACTGACCATTAGTGCGCAGAGGAATGCCGGCGTTAACTACCACGTTCATTGCCGCCTGGCTGGCTTTGCTGCGGTTGCCTTTTTCGTCAACAACAACCCTGACGTCCACGCCGCGGCGCTTTGCCGCAACGAGAGATTTCACCACTTCAGGGGAGGTAAAGCTGTAGCCCATCAGGCGGATGCTTTCACGGGCATCGTCCAATGTACGCAGGACCAACTGCTGCGCACTGCCTTCGGGCGAAAATCCGGCGTCAACGGAGGGTGCAGCAACGGCCGCGCCGGTGGAAAAGGTCAGCCATAATGCGGTGGCCACGATTCGCGGGTTCAATTTCAT
Protein sequences of DBSCAN-SWA_6 >CP028352|37537:38071|37537_38071_-|AVV39980.1|DBSCAN-SWA MKLNPRIVATALWLTFSTGAAVAAPSVDAGFSPEGSAQQLVLRTLDDARESIRLMGYSFTSPEVVKSLVAAKRRGVDVRVVVDEKGNRSKASQAAMNVVVNAGIPLRTNGQYKIMHDKVIITDGQNVELGSFNYSRSAAESNSENALVVREVPALAQTYLAHWQSRWEGGTDWHSSY |
1 | Wolbachia_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
59894 : 64676
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >CP028352|59894:64676|DBSCAN-SWA ATTACTGGGCCGGCCGCGGCGGCGTAGCCGGTACTGCCTGTGGTGCTGCTGCCGGCTGCTCCTTTACCCAGCCGGACTCTGCCTTATAGCGTTCGTCATAGCCGAGGCGGGCGTGGAGCGCGGCGGGCACATCCGCTGGCGAATCCAGCTTTGCGATGCGCTGCTGCGTGTCGCCGGTGAACCTTACTACCACGTGCTCACGCGGAACCGGCTGACCGTTTGAACGGTTAATACCATTGCCGCTGCCGATAGGCTGTGCCCCTTCCGGCGTTAGCAGATTCACCACCATCGGCGGAGCCTCCGTGCTACCTGGAAGACTGACCAGCACGTGCTGATAGGTGCCGTTCTGCCGCACCACCCCCTGTAAGTAATCACCGTCTCCCTGCACCAGAAGCAGATCCGGGCGGCCGTCTTCACCCCATGAGGACATGACTGGTACACCAGCACGGTCGTTATGGGCAGGACGTGGTGCCGACAGTGGATCACCGGGCTGACTGCTCCCCTGTCCGTCCGGCCGGAACCAGTACAGCCCATCTGCGGGCTTTGGTGGCGCATCAATATTGATGTCGAGACGACTTACCAGCGTATGCTGGATCTGTGTGTACCGATTTACATCTACGGCTGCCAGCGGCACTAAATCATCAGCACCGGTCTGAACGCGTGAGCCGCCAGCTGGCGACAGAGACCACCGATTTCGGTGAGTGTTCTCGTTTTCATAATGGGTGACGACACCCTCTTCATTTTTCCTGGGTACCTTAATGGTGACTGGTTGCTTACCCTGCCACTGGAGCGTGACCATACGGCCGGTCTTCAGATTTGTCTCGCGCAGCAGGCCAGCAAGCTCTTTGCCCCAGAACGTTTGCGGGCCCTCTTTTGTGCGCAGGGTGATAAAGGTACTTTCGCTGGCATCAGGTTCAAAGCGGAACGGAGCCTGTCCGAAGGCAGTAACTTTACCGGTGACCGGCTCGCGGGCTTTTTCCGCAGGGGTGTGAACCGCCGGTGAGATTTTCGGCGCAGCAGGCTCAGAGGCGGGTACAGAATTCTGCTGCTGTGAGCCTGTCATGGGCGCGGAGTTTTTCACAGGCACATCGGCATGCGCCGCCGGACCATGCTGAGTGATCGGCGGTGCCTCATAAGTGCGTGGTACCTGTTGGTCGCCGCGTACCGCATCTGCCGGAGCGGCAGGACTCACGGGCTGACCCGCTGCACGTCGTGCTGCATCCAGTTCTGCCTGCTGACTGGCATTCTTCATGCTCACGTCCAGCTTGTTGGCCACGATGACGTCGATCGCAAATGCTTTGAACTCGTCGCTGCCAGTGAGCTCTATGCGACCGTGGTATTGGTGCGCAGCTGTAAGAAGCGCGGCCAGCACTTTTTCCTCATGCGAACTGGTGCCTTCGGCCATCACCAGGCGATTGCCGTGGTCGATAAACGCGGGCTCGCCATCCAGCTTGTACAGGACGCTTTTGCCGTCCGGGTGTTTCTCGTGAGTGACGCGGCTCAGCAGCTTATCGGCATCGATACGTGCCGACTTCGACCCGGAAGAGGCGGGCGCATCTTCATCCTGAGCCGCCACGCGAGGCGCGCCGACCAGGATGGCATTTTCTGCGCCAAAACCGGAGGTTAGTGGGTTGCCCGGCGAGGCGTCAGGCTGCGGAGCATCCTCTGCAGCCGCCTGCGCCACTTCTTCATGTGCAAGTTCTGGCGTGCGGGCGGCGCGTTCAGGTCTGGCATCAGCCTCTGCCGTTTGAGAAGAGACGTCCGATACAGCAGAAGCAACAGGGGCTGAAACCGATGGAGCTTGTGTCGCCTGCTCAGATGCTATGGCGGATACCACTGGCATTTCGGATGCCGTTGCAGCTATCGCTGCATTAGGCTGCGGGTCCGTCACATTTGCTTTTATCATCGATGTCACCGGCTCAGTCCCGATGGCCATGGATGGTGTGACGACGGCGGACGGGGCCGGCACTGCCTCAGGCTCCGGAGATAGCGCGTCAGAGGATGTGACCGTTACTGAAAGCGGTGCCGGTGAGGCCGCGTCGGATGGGGCTGGTAGTGCGCCCAGGTTCGCAAAGGCCCCCTCTGCTGCAGATAGTGCAGCCGGAGTGACTTTGCCGATCTCTGACAGCCGCACTACCTCATTGCGCAGCACCACCAGGTTGTCCTCAAACTGTGCGCGTGAGACATCACCGCGAACGAGAAGGCTGAGCTGCGCCTCAATGCGTGCTGGGACGGTAGCAACATCGAGCGAGGCGGCCCCGGGATTGCGGGTTAAATAGTCTCCACGGGGAACCGGTGCGTCCGGCGAGGGCATACCCGCGAGTCCGTCCTGCGTTACGCGATCAAACGCACCCTGTACTTCTTCAAGGTGGCTGCTGGCATAGACGCCCTGCGCCTTTGCATCATCGAGCTGTGCCTGAAGCCGCGCCACGTTTTCCGTCAGAGCCGGCGCCGCCGTTTCACCGCGTGCCAGCGACACAAACTCATCACGCAGGGCCATCTGGATACGAACAAGCGCTGTAGCGTTCGGGTCAGCAGCAGACGCTGGCGGCAGCTGCGCCGCTATAACCGCCGCGGTTGCCGAATACTGAGCGGCACGCTCAGGGCTCTCCAGTGAAGCCATGCGCTCAAAGACCGCGCGTGTTTCCTCCTGACGGCCCGGCTTAAAGCGTTGCGGGTCGGCGCGGGTAAACTCGCCGTAAAGCATCCGCACCTCACGATCAAATTGCGCACGTGTCACGTCCCCTCGCGCCAGTAGCGCATACTGCTGAGCTACCCTGTCTGGCACGGTGTTAACGTCAAGCGTGCCCCCGGTCGGGTGAAGTGAGCTCCAGGTGTCGCGCTGGCCAAGAAGGCCACGCAGGCGGCTTAAACCGCCCGTTTCCATGACGTTAGCACCTGCCATTCCACCCTGCGTCACGCGGTCAAAGATCTGCTGCATTTCCTTTACGTGTTCACGGGTGTACACGCCGGCTATCAACGCGCTCTCCACGCGTGAACGCAGCGTGCTGATGTCCTGTAGCAGCGTCGCCGGTGAGATCTGACCGCTGGCCAGCGCGGCCAGCCTGTCACGTAGCTCGGCGCCAGTGCCGTAGAGACCAGTGAATCCGTTATAAGCGGCGGCGGACGGTGGCGGCATCACCGGTGATGCAGGTGATGTCAAAACCTCCGCCTGCGTAACAGCCGGGGCTAACGCTGAAGGTGCAGCCACCGTTGCAGTGACAGGTTGAGCTTCGGCAGTCGATGCCATGCCAGTCGCAGGCGCCGGGATGTCAGAGTCGCTGACGTTGACGCCAGGCGCCTCTGCAGCCGCAGACGTCTCTGCCGGAGTGGCCGGTTCGCTCACATTGGAAGCAGGTATGGTGCTGTCCGTTGCTGGCGCATCTGTCTGCTGCGCTTTCAGAATGCGGTACACGCTGGTCTGCCCGATGCCAAGCTGTTCAGCAATCTGTGCGGGCTTCAGGCCTTGATCGCGCAGCTTAAGAATCTCATCCGTTTTTTCCTTTGCCGCGCCGGGGCGGCGGGCGGGCATGACTTCTGGTGCGTCCGGGGACACAGCGGGATGATTATCCGGCGCAGGTTGTGCGTCATCAGGTGTGGCCATTGGGGCATCCTTATAATTGAGCTGCTCAAGAGCCTGAGCAATATCGGGAAGCAGCGTTTCGCGCAGGCGCTCAGCGCCACGGAAAAGGTGCAAATCGTTGAAGTCTGTAAAGCCGCGTTCGATTTCGGCGGCCGTCAGGTCTGGCAGCAGAACCACGCCTCCGGTGGCGGCTGCAGCTCGGTTTGCCGACAGTACGCCTTTGTTTTTTTCTTTTGCATGATCAACATCGGCCATAAACAGGTGAGGGCTGTCGGGATACCGAACTTTAAGCACGTTGGCCACTGCCTCCATGTTCCCGGCATCGATAGTCATCACAACTGGCCGGCCGGTAACCATATAAAGGCTGCGCGCTGTGGCGTAACCTTCGGCGTACAGAATGGGTTCACCGTTGCGCAGTTCGCCGCCCTCCACGCGAAAATGTCCGGCTTTGGGTGCGTCCTTAAACAGATATTTTTCGCCGTCAGGCGGAATGTACTGCAGCGTCCGGAAAGTACCGTCAGCATCATGGAATGGCACAACCAGTGCCCCATTGCGTGTCTGCCTGACGTCGTCAGCAACCGTAATGCCTTTGCGTACCAGATAACTGTGCGCCGGGTCGGCTGCAGGTAGCCTGTCATACAGCGATTGCGCTTTTGCGGTCTGTTGCGCATATAGCGCAGCCTGCTCGCGAGCAAAATCATCTTGTGACTGGCGCATCACCGCACGGATGTGCACGCGCGCCAGAGGATCGGCTTCGCCCGTGCTGCCACTGGCCTTCCAGCGAGTAATATCCTTATCGTTATCAGCGCGCTGGTAATTGATGTACCAACCGCCAGGTTTGATGCCATCAAGGAAGCCCCGGTAAACGCCATCGCGATTGCCTTTTTTTCCGTCTACAACCGGCACCCGGTGGCGCTGTCCATCCATCACCGGCAGGCCATCCAGCACCAGACCAGCGCTGGTCAGCGCATCGAAAAATTCTGAATGAGGATCACCGCCGCCGCTGCTCCGGATGCTACGGTCTGGCAGCCATTTCTGAAGCCGGTCCAGGTCGGCACCCGGGCGGGCATACCAGAGCTGCGCCTCCTTATTCCAGGCAATAGCGCTGCGCCCGTCCGGGTGCGTACCTGCATCGGCGCGCGCGGCGTCGCGCTGATCCGGTAAAACGGCAAGCCATACCGCATAGTCCGCTGTTTTCAT
Protein sequences of DBSCAN-SWA_7 >CP028352|59894:64676|59894_64676_-|AVV40003.1|DBSCAN-SWA MKTADYAVWLAVLPDQRDAARADAGTHPDGRSAIAWNKEAQLWYARPGADLDRLQKWLPDRSIRSSGGGDPHSEFFDALTSAGLVLDGLPVMDGQRHRVPVVDGKKGNRDGVYRGFLDGIKPGGWYINYQRADNDKDITRWKASGSTGEADPLARVHIRAVMRQSQDDFAREQAALYAQQTAKAQSLYDRLPAADPAHSYLVRKGITVADDVRQTRNGALVVPFHDADGTFRTLQYIPPDGEKYLFKDAPKAGHFRVEGGELRNGEPILYAEGYATARSLYMVTGRPVVMTIDAGNMEAVANVLKVRYPDSPHLFMADVDHAKEKNKGVLSANRAAAATGGVVLLPDLTAAEIERGFTDFNDLHLFRGAERLRETLLPDIAQALEQLNYKDAPMATPDDAQPAPDNHPAVSPDAPEVMPARRPGAAKEKTDEILKLRDQGLKPAQIAEQLGIGQTSVYRILKAQQTDAPATDSTIPASNVSEPATPAETSAAAEAPGVNVSDSDIPAPATGMASTAEAQPVTATVAAPSALAPAVTQAEVLTSPASPVMPPPSAAAYNGFTGLYGTGAELRDRLAALASGQISPATLLQDISTLRSRVESALIAGVYTREHVKEMQQIFDRVTQGGMAGANVMETGGLSRLRGLLGQRDTWSSLHPTGGTLDVNTVPDRVAQQYALLARGDVTRAQFDREVRMLYGEFTRADPQRFKPGRQEETRAVFERMASLESPERAAQYSATAAVIAAQLPPASAADPNATALVRIQMALRDEFVSLARGETAAPALTENVARLQAQLDDAKAQGVYASSHLEEVQGAFDRVTQDGLAGMPSPDAPVPRGDYLTRNPGAASLDVATVPARIEAQLSLLVRGDVSRAQFEDNLVVLRNEVVRLSEIGKVTPAALSAAEGAFANLGALPAPSDAASPAPLSVTVTSSDALSPEPEAVPAPSAVVTPSMAIGTEPVTSMIKANVTDPQPNAAIAATASEMPVVSAIASEQATQAPSVSAPVASAVSDVSSQTAEADARPERAARTPELAHEEVAQAAAEDAPQPDASPGNPLTSGFGAENAILVGAPRVAAQDEDAPASSGSKSARIDADKLLSRVTHEKHPDGKSVLYKLDGEPAFIDHGNRLVMAEGTSSHEEKVLAALLTAAHQYHGRIELTGSDEFKAFAIDVIVANKLDVSMKNASQQAELDAARRAAGQPVSPAAPADAVRGDQQVPRTYEAPPITQHGPAAHADVPVKNSAPMTGSQQQNSVPASEPAAPKISPAVHTPAEKAREPVTGKVTAFGQAPFRFEPDASESTFITLRTKEGPQTFWGKELAGLLRETNLKTGRMVTLQWQGKQPVTIKVPRKNEEGVVTHYENENTHRNRWSLSPAGGSRVQTGADDLVPLAAVDVNRYTQIQHTLVSRLDINIDAPPKPADGLYWFRPDGQGSSQPGDPLSAPRPAHNDRAGVPVMSSWGEDGRPDLLLVQGDGDYLQGVVRQNGTYQHVLVSLPGSTEAPPMVVNLLTPEGAQPIGSGNGINRSNGQPVPREHVVVRFTGDTQQRIAKLDSPADVPAALHARLGYDERYKAESGWVKEQPAAAPQAVPATPPRPAQ |
1 | Idiomarinaceae_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
69659 : 70124
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >CP028352|69659:70124|DBSCAN-SWA GTCAGCGGCGGGCCATGTAGTCGGCATAAACCTTTCGCGCATAGATCATGCGGCGGGCAGTATTGTCGTCGGCAAAGCCAGCGTTATAGCTGCCGAGACACTCCCAGGTCACACCACAGACCTGAAGGTGCCTGGCCAGAATCCAGGCGCCAACCATAACGTTCAGGCAGGGTTTGGTTAGCAGATCCTGTTCGCTCTGCAGCACGCCCATGGCACGCAGCTGCGGTACGTGCGTGGAATTAATCTGCATCAGGCCAAAATCCCGACTGGTAACCAGCCCTTTTTTATTGCGGTTATAGCCGATAGCTACAGGATTGAAGCCGCTTTCCACCTTACTGATAGAGCGCAGTAGTTGTGGATCTACGTGATAACGAGCGCCCGCTTCGTTATAGCAGAATGCCTGCGCGGAGCGGGAAGCGCACAGCATCAATCCGCTGGCCAGCAGCAGGGTGAAAAGTTTCTTCAT
Protein sequences of DBSCAN-SWA_8 >CP028352|69659:70124|69659_70124_-|AVV40010.1|DBSCAN-SWA MKKLFTLLLASGLMLCASRSAQAFCYNEAGARYHVDPQLLRSISKVESGFNPVAIGYNRNKKGLVTSRDFGLMQINSTHVPQLRAMGVLQSEQDLLTKPCLNVMVGAWILARHLQVCGVTWECLGSYNAGFADDNTARRMIYARKVYADYMARR |
1 | Ralstonia_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
79460 : 79844
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >CP028352|79460:79844|DBSCAN-SWA GTTAGACCTGCGGGCTGTTAAGGAAGGCTGAGGCTGTATATGAACGCGCAGACGTGGCTTGTGCCCTCTGCTGACGTTCAAACCAGAACTGCACAGGCTCTTTAACCTTACAGACCAGGCCAAAGCGCAGAGCCGTCCGAAAATTAATACTGCTACGCTGCGCCCGTTCAACCTGCTCTGCAAGGATGGCTCTGAAAGCCTCAGGTGTTGCATTGCCTTTGGCAACACGGCAGTCCCTGCAAACTGCCGCAAGCCCGCCGCTGATATATTTCTCGCCAATAAGTTCTGCATGCCAGCCCCGATCAGGTAACTGCTCACCGCAATAAGCGCAGCAGCCACCAAACTGGTTTTTCAGAGCTTCACGTTTGTTTTTTGATAGATGCAC
Protein sequences of DBSCAN-SWA_9 >CP028352|79460:79844|79460_79844_-|AVV40020.1|DBSCAN-SWA MHLSKNKREALKNQFGGCCAYCGEQLPDRGWHAELIGEKYISGGLAAVCRDCRVAKGNATPEAFRAILAEQVERAQRSSINFRTALRFGLVCKVKEPVQFWFERQQRAQATSARSYTASAFLNSPQV |
1 | Salmonella_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_10 |
86523 : 90141
Sequences of DBSCAN-SWA_10
Nucleotide sequences of DBSCAN-SWA_10 >CP028352|86523:90141|DBSCAN-SWA TTTACTTGCCCGGCCGTTCTTCTGCAGAAGCGGCCAGCACACCCGGATGCAGATGCACCCCGGAAACGTCCGTATTTTCACGCGCTTTGTGCAGATCGCAGGCGGCCTGCATGGATAACCAGAGCCGCGCTGAAATACCCAGCCCCGCCTCCAGTCTCACCGCCATTTCCGGGCTGACCGACGCCTTACCGCTGGCGACCTTGCTGAGCGCCGAAGCTGATACGCCCAGCTCTTTCGCCAGTACGCGCAGGCCAATATTGTTATCTTCAATGTACTCGGTGATCAGGCCGCCCGGATGCGGGGGATTAAACATGGCCATCAGTGATAGTCCTCCAAGTTGAGAATGTAAGCATCACCGTCTGTAAATTCGAACGTAATGCGCCAGTTGGCTCTTACCGTAATTGAATAAAGATTTTTCCTGTCCCCTTTCAGGGGATGGAAGCGAAAGCCCGGATAGTTTTTAAATTCATCCGTCGATACCGCTTCATTAATCACCAGCAGGCGAAGCCTGATGCGTTCAGCATCTTTCGACTGTACACCGCTCACATCACCTTTCTCAAAAAGTTTCTGTAAACCCTTATGCTTCCAGCTCTTTATCACGATAGTGCTCCGTGTTTCGTCTTGAGAAACATTATAGCATGCGTTTCGCATGGAGAAACAGTTTTATCTTTATTTTTCACGCTTGGTTGCTGTAATTTGTTCGGTATAAGCCATCGATTCATCTCGCGCTGCATGGTGTCGATATGCGGCACGCGCTGTTGAATCTCCCAGGCTACGTCACAAGCTGCATCTGAAGAGTACAACTGAAGGGTTTCAAGTCCGGCCAGCGACGGAAACCGGCACGACCAGACCCTGATCATATTTTTCAGATCGTGTTCCGGCACGAAGCCGCACCAGGCAGTAAACCAGCTGCGTAGAGTATCAGGCGTTCGGAAAGCCAGTTCTATTTCTGCCTGGCAGATGCGATTTTCCAGCTGGCGCGCCCGGTAGTCCTCCGCCTGCCTTTCACGCCGCACTATACGGCTCACGCTTTTTTGTGCTGCATGTTCAGCACGTTCTGAGCGACGGAAAGCGGCTTCAGGAAACAATATCGTCGCAAGTTCTCCCGGCAGCGGCAGCGGACACACATCACCTGCCGTTCTAAGAAACGTATCCAGGGCGCTTTCCCATTCCGGCAGCGGCGCACGCTTCTCGCGCGGCCGCAGATAAATTCCTCTGATACGGTTCAGCGCCCGGACGCTGATGCGACTCGCGCCACTCAGCTGCTTCATAAAAGCGGCCGCGTAGGGATGGCGGGCAAGTCGTGTCCCCCGCTGCTGGGCGTGCTCTACGATTGCGATTGCCTGAAGTGCGGCCTGTCGTTCTGGCAACAGCGGCATCAGTGCCAGTGAGGATGGGATCATGCGGCGTGTCCTCCCATCTTCATCGCGGCAACAAGCTCCCCAGCGCCGGTAAAGCCGGCCCGCGTGTACTCAGCAATTACCGACAGTTTTTCCTGTAGTGCCATCGGATCAAGCTGGTTGCTGACGTACAGGCGCAGACGGCTGCGGGTGCGACTGTCTGCCACTACCGCACACCATGCGGGCCAGCAGGCAGACGGGCTTACCAGCTCAACGGCGATCTCTTCCATTTCTGGCTGCGACAGTCGCAGATGTATGGGATAAGCCCCGCCCCATTCACCGCGCATCTCGCAATCCATGCGCCAGCTGTCTGGCAGCTTCAGCGCACTGTGTACCAGGCCGCACATGCGAAGGCGCGCAGCTTCTCCGTGTTCGCTCAGTTCTTCAAGATCGCAGCCTGAAAACGTCTGCAATGCATTTAGGGTAAATTGTTCGTCTCGGATCATTATGTTTTGCTCTGTTTCGCGGTTCGTGGCTCCGGCCCGGTTATCGAGCCGGAGAGGTTTCCGGCAGGGGTCAACCCTGCCAGCCTGGCTGGATGGTTATTCCGGCTTGTACGCCATCAGGCGAACCACACGAACCCCGGTTCCCGCAAACATGCCCTCAATCGGCGCAGACCAGTTGCAGTCCCAGCCTGGCAGCAGGTCTTTGCGGTCACTGCCTGCGGGCAGAATGGCCGCCAAGCGCCCGCCGTCCTTCACCAGTGCCGCCGCTGCGTTAAGGTGCGCTACTGCCCGACCTTCGCTGTATGGCGGATTCATCACTACTACGTCAAAACGTTGCGGCGTCATCTGCGACCATTGCAGGAAATCGGCTTCGATCACGTTGTGTCCTTTTGCCTCCAGTACGCGGCAGTGCAGCGGCGAAATCTCTACGCAGGTGGTTTGCATTTTCGGCATCAGGTCGGCAAGTCCGCCCATACCCGCGCTCGGCTCCAGGCAGGTTTCCCCCTCGCGGATATCCGCCTCGTTAAGCAGCTGCGCGGCGAGTTCGCCCGTGGTCGGGTAAAACTGGTGTGTTTTCTGGTCCGGCAGCGCGCCGGAAAGCTGGACATCTTCAAGCGCCGTTGTCGGGTCGTAATCAAACTCGAACCAGGTAAAAGCATTGATATTCATTTTTACGCCGCCCAGGCTCATAAGCACTTTTTCTGCTTCTGCGCGTGCGGCCTTGTTCTCATCCGAATAACCTTTTAAACGGCGGTTAAAGGGGTTCGTAGTGACAGGCGGGCGCGGTTCTTCCCAGCGGTTGCGCTTTAAGGGTTCGGTGCGCTCTGTCTCCAGATCACTGAGTACGCTCAGGACAGAAAAAGGCAGCAGATTCGAATGCAGATCAAAACTTTTTACCTTTGTGCGCGGCTTCTGACGGTGCTCCGCAGGGATAGCCGCCGGATGCAAGAAGGCCAGAATGTCATTAAGGCGCCAAGCCATGTCCGGATGAATTTCAAGGTGAATAGTGCCTTTCATAAAGGCTTTTACGCGCAGTGCTCCGCCGTCGATAGCAACCCACTCACCCGAACGGGCGCGCGCGATGTTCAGAATTTTATTAGTCGTGTTCAGGCCGGTAGCATCACGGCCCATGAACTTCGCAATCACTTTGCGCAGGTCATCTACGTAGCCGGTGCGCTCGTAATTGCACATTCCCCACTGATCGAACATGCATTCGATAATCATGCGTTTGCTGAAGCCTTCCGGGCGGTTGGTCACGTGGCTGCGTGACAGGCTGCGGAAAATACCGTCAACGCGCTCGGAGAAAAATTTCTGACGTGAGAAAAGCAGCTCGGAAAGCGTCGCGCGCACGGCATTTTCCTCAAAATCCGGTGCTTTCATCTCATGGATCTGATTGTTCCACTCGTTGCGGCGATCGTTGGGCATATACTCGTAAACGTCAGTCAGGTTTAGCGCCTGTTGCCAGAAATCAGCGTTAAGGGAAGCTATCGCGCCAGGCAAGTCAAACACCACGCGCACAGGCGGCATGTGCAGCCGGGAATTACGTACATTGCCACGCATGAAATACTCAACCGCATCGGCATTCTCACCTTCGCGTAAAGTGGCCGAAATATTTTTTAATTTTGCGTACATCAAGCCATAGCGGCCTAACAGGCCGTCAATAACGTCTGTTTCAACGGGTGCAAAAAAGGTATCTGAAATGACTTCCCCGGCGTTGATTGGGCCGCGGTCGGTTGTCGCATCAGGGGTGACGTTTACGAGTGTGTTTTGCAT
Protein sequences of DBSCAN-SWA_10 >CP028352|86523:90141|86523_86841_-|AVV40029.1|DBSCAN-SWA MAMFNPPHPGGLITEYIEDNNIGLRVLAKELGVSASALSKVASGKASVSPEMAVRLEAGLGISARLWLSMQAACDLHKARENTDVSGVHLHPGVLAASAEERPGK >CP028352|86523:90141|87118_87925_-|AVV40031.1|DBSCAN-SWA MIPSSLALMPLLPERQAALQAIAIVEHAQQRGTRLARHPYAAAFMKQLSGASRISVRALNRIRGIYLRPREKRAPLPEWESALDTFLRTAGDVCPLPLPGELATILFPEAAFRRSERAEHAAQKSVSRIVRRERQAEDYRARQLENRICQAEIELAFRTPDTLRSWFTAWCGFVPEHDLKNMIRVWSCRFPSLAGLETLQLYSSDAACDVAWEIQQRVPHIDTMQREMNRWLIPNKLQQPSVKNKDKTVSPCETHAIMFLKTKHGALS >CP028352|86523:90141|87921_88368_-|AVV40032.1|DBSCAN-SWA MIRDEQFTLNALQTFSGCDLEELSEHGEAARLRMCGLVHSALKLPDSWRMDCEMRGEWGGAYPIHLRLSQPEMEEIAVELVSPSACWPAWCAVVADSRTRSRLRLYVSNQLDPMALQEKLSVIAEYTRAGFTGAGELVAAMKMGGHAA >CP028352|86523:90141|86840_87122_-|AVV40030.1|DBSCAN-SWA MIKSWKHKGLQKLFEKGDVSGVQSKDAERIRLRLLVINEAVSTDEFKNYPGFRFHPLKGDRKNLYSITVRANWRITFEFTDGDAYILNLEDYH >CP028352|86523:90141|88464_90141_-|AVV40033.1|DBSCAN-SWA MQNTLVNVTPDATTDRGPINAGEVISDTFFAPVETDVIDGLLGRYGLMYAKLKNISATLREGENADAVEYFMRGNVRNSRLHMPPVRVVFDLPGAIASLNADFWQQALNLTDVYEYMPNDRRNEWNNQIHEMKAPDFEENAVRATLSELLFSRQKFFSERVDGIFRSLSRSHVTNRPEGFSKRMIIECMFDQWGMCNYERTGYVDDLRKVIAKFMGRDATGLNTTNKILNIARARSGEWVAIDGGALRVKAFMKGTIHLEIHPDMAWRLNDILAFLHPAAIPAEHRQKPRTKVKSFDLHSNLLPFSVLSVLSDLETERTEPLKRNRWEEPRPPVTTNPFNRRLKGYSDENKAARAEAEKVLMSLGGVKMNINAFTWFEFDYDPTTALEDVQLSGALPDQKTHQFYPTTGELAAQLLNEADIREGETCLEPSAGMGGLADLMPKMQTTCVEISPLHCRVLEAKGHNVIEADFLQWSQMTPQRFDVVVMNPPYSEGRAVAHLNAAAALVKDGGRLAAILPAGSDRKDLLPGWDCNWSAPIEGMFAGTGVRVVRLMAYKPE |
5 | Escherichia_phage(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|