Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP047095 | Bacillus marisflavi strain 151-25 chromosome, complete genome | 0 crisprs | DinG,csa3,WYL,cas3,DEDDh | 0 | 0 | 2 | 0 |
NZ_CP047096 | Bacillus marisflavi strain 151-25 plasmid p25, complete sequence | 1 crisprs | csa3 | 0 | 1 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP047096_1 | 97564-97669 | Orphan |
NA
Consensus repeat of NZ_CP047096_1
|
1 spacers
spacers of NZ_CP047096_1
>1.1|97593|48|NZ_CP047096|CRISPRCasFinder TAAATTATTGTAACATTTATAAAAACCTTATAAAATCAACGTTTTTAT |
CRISPR arrays and Neighbor proteins around NZ_CP047096_1
The CRISPR arrays of NZ_CP047096_1 >merge|NZ_CP047096|1|97564-97669|CRISPRCasFinder ATGCATCATCTAATTGATGCATTAGATTGTAAATTATTGTAACATTTATAAAAACCTTATAAAATCAACGTTTTTATATGCATCATCTAATTGATGCATTATATTG >NZ_CP047096|1|1|97564-97669|CRISPRCasFinder ATGCATCATCTAATTGATGCATTAGATTG TAAATTATTGTAACATTTATAAAAACCTTATAAAATCAACGTTTTTAT ATGCATCATCTAATTGATGCATTATATTG
>NZ_CP047096.1|WP_159130493.1|95645_96929_-|hypothetical-protein MEYRLLSENRKGIIIKEVEMDLLKMIFNQRIVTSEQIGYWLGLKSKNEFANLRRRLTKFTAYRLLVRNKYPLVDGIFFYYYRIGSKGISILKELGLITPEESSYSTYKRIGQPAHMDHLLATQQVVIQVVKDMEQAKVESHYPYDSLFLRNKDDNKPFLVPDWVLSCQKKNVYIELDTGTERMSDIEHKMSRYYEWASMNPDQVHNVFFISLDESFRTTKFYGDRRRRIGNLRKGILDTEEGKKVPQNLNVYSVALKRASKIIQKVLRGEIPLQIDSKKHRLRTGLDVLIDMNNCFPYQFTEVGEEAYYKPSAPSSLRLDLIYDVFREGVYVETVGMMYLEEGLVSTLRILDYANVQVEERLDLKKPIHRIIGIYETSLESQHDVLGIEYSNLLIADHETWDKDYESQPTFYKQTSSANIVEVTYDH >NZ_CP047096.1|WP_159130492.1|93034_95656_-|TraM-recognition-domain-containing-protein MTIDLIKKKLPLILATVSLSLTVFVFIKATINIFFIFKSFIDNRMYYYSLFGNPFDPSLVVMVILMTLFILAWLFSTRVKFFQSFPWRLVQFSMLLGAVTSLLLWNVTAAVYKYVVPYFLNKADMIDLSSDKLQEVMLGNTQALLLLVMALPYIAVILLTLYISGLFNQYREEIIESFKTYEVTSRWLQKVTKSERDTGLPDVVLGPDSETKEEVMIPGKDRTLNTMIIGSIGTGKTAAVATPMINQDLHHFVKYINDFQKMVEEGTLEKLKGDYLNGLVVIEPSNDLCQSVYRLCLAHGIPEEAIYYVDPTNDETPAINPLQGPVDKVAESLTMVVEGLGENDNFFFAQSQRAHFKQYIYLLKLHSKNNEDPTMADLYEMYNDVQLVHLMHLKLKKRIEEEAPLAKTDRERDFWSVVQDVDRWFDATVIPVKDFKTQQMMRIESGKYKGMIEYYDKKGEFIEGLKNILTDISSNVLLRKVLFRKSTFDFDAHLKSGGILLVNTEKGDLMELSNVLGKMVLLSVQNAVFRRQPKVEPYHSIYADEFADYIYKPFKSFPAQSRKYKAIIHVIAQTIAQLADDFGDKYMHTVLTTFRNKIVYSDVSQMDAETFSFLFHEKNNYVESTTEQSVSPLQESPVTRMGMSYQQEKDSVMSPADILSLKAFEAAAKIVVDNESKPARRIKANFVPEIEFTEAIVKVNPEAAAFWIKERNRIIIDEDVKEVEEITPIETVEEQEERLEEEAAQEEILLSAEDVEHSIKHLHDKQPRDTVRFDAPPPARLRNRSEGVSLEGDTEAEMQAKFEDEEKEDEVSTEQNRSSIERKELFDATELIPKKLEIKSKNDQMEEGKSYSKSAPSDKEKALFNELVESMEE >NZ_CP047096.1|WP_159130491.1|91512_92298_-|hypothetical-protein MEIMIKEACDKGFWLCLEPSDWISLIAVFIAMIAAVASWGTIISQERINKKNKEAIIVPGIKHIDTSISHILSDWDDESDSIIGKKFSKTTLPIWNHGNSPVFNIQYCYYLENFGDIKANENKMGLFSNYEINLDTSVEDSKSITVSYNNINERQGVDTRYIKPFVRTLDVIKPNERANILLPDYFIILLNDFFFNSFLTDNRTPVLQLKIFYDDVYLNTWEKQFRIIPGPYNFKGEELNTSFEYEIILDKKKRIREPRKI >NZ_CP047096.1|WP_159130490.1|90331_91228_-|30S-ribosomal-protein-S1 MMATVTEKWSDHDLEQLAKYHREGTTVQGIVQSVGFLTIPVEEEGKMVSQETEVAIFRLEGNVKAYCPAKEFSNRSFKTLNGFVGTKQNLIITRLNLQQQMALVSVKKADEKKSEQFWNTLKYLDKKEELADQIFTGVVYGVNEKSETVHVRVEGTDCFMVKYDWDWNRQSNVLADVERGETIQVVVKKFDEKEGLVRVSRKDTMEDPFKKLEQMREMEVVVGRVTDVHHLHGIFVQLEEGVTLKASKPRYLDEPIVGDIVAVKVREIDAKNRKGKVVIVNYPQGRKKRKDVGSFLFD >NZ_CP047096.1|WP_159130489.1|89333_90320_-|hypothetical-protein MAIDVTGVEDLEQEEVVNESAYFPSQEGDVPFWDHPLMYGHTKLPKNYYPYTTKAEVLRAFKRKQITEEDFTILKVLGDAVAANEDQLRRYLSSKMSRSAVSKRLEGLRNHGFVERWHCRLENDDEEKVRPPAPFTLGIAGYKLMKHFYSDCAFMNPDSWDKDNGKSIQRYVAMNELRCLLVESKVVRGWQWNATIGNQRKYRSPFGVAYIDTPNTPINFLIERPQMSQNFIGYLRDTLAQWKNIYENTGALEISRAPKNTAIVILYASTISMAEHIHMKLSLNEYPFQVWVCVEELLDKGGLAKSFMRPLTDGQLERIKLPFLEPKA >NZ_CP047096.1|WP_159130488.1|87476_88976_-|hypothetical-protein MKKVLLAIGDERLSGIFRRALTSHSNDFEVLDSECFRSQYLEELIDTNRPDILLLHESLLFAEYPEEERDDTWLRIIQRYRVKFEDTLRIVFVCERSKRDPFLSHLVIRSVYDIFNTSSIDKELMVEQLKERPLFTNIKQFIHGIEFDDIEVVEDELDDPVVKKPKSTEKDEGESKLKKEKNKVQKQVVQKVINKNIIKRDINIQLNSQVEKLVGIPIEKKIIMIASPFKRSGSTFIAHMLARQLANMGISVSFVESPYSYAYTYDRFIGHEKISNFRSKFYQFSSKDEFKLPGPNEWDLEGINMVSKHPSNEPIYSQDEIPFDMYVKVLLSLQSTVTIVDAGTDWNLEVHKNVHDIATKTYFVVEPDFSLMQYIEESKEEHILDYRALIEEKKTELIANRFEDSLKTNEVVEQVFQKNLKACVPVFPVTEVFDAQYKGLFLNDVPSVQPLVEEALRPIIEDILPEEFIKKQKGKISRFMGMFNKRLTINKNQKEMNPQ >NZ_CP047096.1|WP_159130487.1|86769_87480_-|hypothetical-protein MKPGVKIVLGISLSVLTMSFVVMYDLYIKERIDSEEVVVVKAGQEIKKNERITRSMIAVERRSKQSIISDAIPASEFGTLLTKTAGQTIVGNSMISKKMIDYDLLIPDPDSGEAIRPITDEMIYSQPGSLRRKDRIDIYLVKKKEESGNVNVASSKDGKKNSNLVQDPILTGIRVVYVKDDSNKEVINGTPDKEKDDRLNASGKISALEVILNEEDFQKLMDKVVNEDYLLYITYN >NZ_CP047096.1|WP_159130486.1|84006_85563_-|Flp-pilus-assembly-complex-ATPase-component MVKTLEAAIEEIELVPSNSNLDEDTLGLETAFNPADWIGQVSEEKGLGKQATITTFERKVSFKKICKIVKDELYENNENEDNEGKIAQMLERQHKAVIGDRHEMSRFTNQITEVLRKQNITSKDYPDFYGSLAEAVFHEVWGVGILHKWEKYPDSEACVIRGTELWIDINGKFVKQDETYENEEAVERVKRAFLIRMKDAVLNEQKPEIEIEREDGSRITMIQKPRSRDNYVMFRRFVVQDLSLLEQSKRETIPERDIPIYQALSKAMVNIVFAGRVRSAKTTFMKTMIRERKPEYIGAVMEKHFELGLSKHFPDRLFFEVQAKEGDLHKAMPRLLRMEHDYIIVGEIRSLETEAYLQSSERGERGNLTTYHLTNVDNVVEQIARHILDEFPTRNIDNEIARVAQNIDIVITLKSDRDRRRKRVIGVTEIVWDEKRRMHFTQDLIRFSPLTNKYYYSSKISKELFMSMMEESEEDAKRLLTLLSKREQESGMSEYLNLKDNSYDSVLEGVGEGETIYG >NZ_CP047096.1|WP_159130485.1|83033_84014_-|hypothetical-protein MDSLVTWFWASGTHLILYVLAVSFALYTTKEVIALPLKEKISEWQYRSRLRIIKTENEFKTKRIEHPFFRHIYLLIKATSSTKSNGDVAAFYVVTSLLSGFTFVVAFLKFQDWIGALLIAACIGAIPYLALQMKLRIVRNSVSDEILTIIQTITQQYSANSYDIYYALVESYKEIENRELRRVFTRLISELQVSRNEDQLREVIEIFVYSSNSNWAKRLGSILLKAYLTNENVLNALLVLSRQVEQTQEMLEEEKSQSMDSVANGFITVPIFVGSIGLGYYTSGAQDWFNLQFDNNYALSLFVASLIGVIFSVFISLILKKPKNDL >NZ_CP047096.1|WP_159130538.1|82111_82975_-|hypothetical-protein MYGMAIVFSLGGGALVYAGLSNSTERLQTRLRMRSTLNKGRALVQESASKSAAEDWLKKAGNPLGLTSLTYHIIFIVGAVILLGNYVFIPFIMTGEFSIIAFAVIFIGFLFFLPSMPYSLFVFFMKRLVDFQQAKKNAEVFMLYDLLINELQMMKVSRINSYNLIKDLLPYFSTIQPFLAKTLTEWSSNIGPDAALENLGKKLGTREGRSLVSVLKSLDRVDRVTAITSLKGMQEMFTRAQIENNRRKRKVTTDLLGIPVKVTHFLIIINFIVVVVIMVSNVLSSSH >NZ_CP047096.1|WP_159130494.1|97791_98124_+|hypothetical-protein MDATNEKIGSIDDLQYNKNVLCFLRSKLDDFAIDCFKMVARQHNHNGLIKSKMIEDYSSKRYIYEKAFNFLEAQGFIEIKELGNMKPYFITTRGRQLVDLILNEKNQHEN >NZ_CP047096.1|WP_159130495.1|98135_99473_+|cell-division-protein-FtsZ MLNLEKLGATRAILSAHMNSKEVQREIDRYPSQAVIGLGQGGGRIAAEMSRFGFPTYLVNSSKSDMDEHAPLIPEERRVITKSQDFPELEGTDKNAQLGFQIAKENQQAYKELAVSDEIQNSDFVWVCVSLGGGTGNGALKVALAYLSKIRSNRSLPGNKIPLGVICSLPSSEERGSAFRENALAGIQVLQSMINQNQIGSVLVIDNEKMKDYYAESPLVTYAGTEVDAKSYSNMVVSSIVAEVASLPLLQGRSVFDKTEFLSTISTAGWLSISKHKGLSDDEDLEKIIDFLFKENEVLANYELSNAIAGAVAVMYPDTKKVSPRIADDVYKYTSELLQSKVNLSISSNSKIEDIQLYGLTVYPQPSNRIKQLREELEHWKQKEKEQEEAKLQAASALELDEFNNFFSSSETTTKRKKFTIDDLNSEDEEKPASQASLDDLDNLF >NZ_CP047096.1|WP_159130496.1|99829_100927_+|hypothetical-protein MDREDGIIINNDDQSQEDKSVTIHNELAEKFKDDPLSHLLIEAGKSDLLLPDAIIDQLKRKNLYSISQAADIVGKKDYNIRNNIQRNGLGDYVGITQTGKLYRLDYIGIYKLYLIFTVQEELRLNPADIASVVGVMAEKVRSFQPAKPNNFYSNNEIATPGPGPYQSNQDAENQILRMIMFNQLVDQRKEKQYELVEAKRLVSEWEIEMNHLNQMIEMQESLRAMAKNVNSKEEVNDWITHINKSLKNAFDSQQDEKGFWGRLFGSKKEDNSFKEVELSHKESAVMLEIEQDLDKLKKQKEEILGRKELLLQNRSDKENELKAFDEFVESQRTLLIDSTNNPIIRGMLENNNPAALLSHLNEKDQ >NZ_CP047096.1|WP_159130497.1|102024_103695_+|hypothetical-protein MVSVKSDVTQCQLNKALNPAFSKENVHRVLVSAAFMNQLFQNDQFYKNSPTFQHGSSSVKLYHVTRSDILTAISLFISCNTVGEINEITYNRLFEQRIKPLYNKFLSYQEFVLSIHKFSELKLLKISQNPITKRYNLKINHFLQEGSDSSAPVPERYISLHPFIFEERFLRQSIDYWKMYLRYIVQCNMTSSPRYYYFNQDKENYISHLQSSDLRTFLNKKENHQVKQVIQKLTSVEILPGEGPLFTSTGNGPLITKRFKRFHEVGLKINPSYLSHRSITTEARFPLAIEERYQRESKFIQHYLEALGVGELYHVELKGKFNGSLAKNLVYRLKSYSKGMVKIALDELAAEFKMFKRIPSNLDAFISTALRFKKQTEFKRILRDENLYTLLIRGWKATEREDRIYEFLNVLSPLKVNEFKAFCRHGYKSLKATYQRSTISEGDYRCSVELDNIPGIDLLRRTAYKLEVDPLEYNKQEESLAKLIPLATDNTEVGSLIHTVFKRLNSLGKHQGSHLDLGKVKLESILLDEWLKTGSKHFQKRISQVYYHLKSLRITL >NZ_CP047096.1|WP_159130498.1|104161_105028_+|site-specific-integrase MTFKFNFTYDFEQYLIHEKKFSSTTLNQYTYTVEMFFNHLKHKYKKSFIDPIEVQPKDIRGFLEERITAGTSISTVNRYVSNIKTFFDYLWAKGKIVIDPAVKIERFKVDRRGVKHVEYSNVIKMFPTIMNSHKYPVILKAIYVLAMHGFRAKEFHILKNEVFEEDQDIFIRTPKRSIVLKGEEASIFRAHFYNSLFDSSEYVFTTKKNSTGGFAPIEYESLYMYIGYIRNDFNLAVTFNLENIRISYAYHLYKQKNYTVDDLAEELGIQRLSAAGLLETTLERYEAE >NZ_CP047096.1|WP_159130499.1|105129_105735_+|hypothetical-protein MMKSHESRHIALHNLTVGGLIGEGEISFEDPFRIEVNLENNKLAFIHNYISENELDSLVQVNLKRNQLTIYTQQKEFTHWYKNEQKVYVSSDVTYKSILIALILFGDRKMESLSIHTSVASKYLPTMAYSLEKILKVPIYAATKELKLFNTNNLFLNALMHLSLIECTELANFLTISEKRELRNILKDVEECEGGLMYGTS >NZ_CP047096.1|WP_159130500.1|105721_106393_+|hypothetical-protein MGLRNVLLFTILSSSLAVAGCNNRIVNEANAVGEKNSEMAAKAEQEKEQKVESQREIYKEMEKPIDEVIEKIDKDKKKVVDPKVIEKATYTNPEEFSKKLGQVLYEFSTGELSVDDYYNFLTRNASDDFLSQYLPNESAGTLFLENVQSLLIDKLPELKKGYTISELTYDRFEKEAYFYRKVTTVKDDKPIYYISTIVKEGDSWKFQDDSPSPPFEQENDLED >NZ_CP047096.1|WP_159130501.1|106396_107254_+|hypothetical-protein MINATNDLTAYAKEKLTLHFENNEDFSREFMALTNFYSKKFNHVIDSEEMLTKFYNDFFEIAKSQIFQGYYLMLQIMYDEKNEIEDDFLAQDLGTLKDEIPSLLRQGFGVALESVKKTENAHKLSMWLVTNFENVYDLINQVFFDLLCTGSLYALSDEGHRRGLSSSGQEDTPLLMGSPLAKTFINPQIYMVPTAKGEDYEMWDLRWWSSFKSDEKAGGATILNIESEESKQFILSVDLHQTVHYSEREDLVATLSALLSVRNNVPQQKIFINVSVVDDYILVQI >NZ_CP047096.1|WP_159130502.1|107449_108079_-|DUF4352-domain-containing-protein MREKRSDIFGFLSLGVGFIGLLNCLVPIVGLITIVFSIIFGIISLIKKERTKWKAITGISISFTALIIATVAIIISFIPDPTLSQVDKPDWAPEEVYDYGYPVKHDDVEISVNKVFFSDGKMIRPGITIKNIGEDTVHYSPTDFTLVQDQGNEKGKRINPVTNSELMLKSGELEPGEEISGEIEFVFPVNHSFLALVYKDEVYIDVTID >NZ_CP047096.1|WP_159130539.1|108397_109261_-|thiamine-biosynthesis-protein-ThiF MDLLAKEYATQKRKKLFPLTVLVGTGGTGSALVQQIAQMYQEFDDKGFLLLADPDTIEEKNIKNQLFTPGTVGKKKAEVLAKRYSSAYGVNIHSYTKDYVESVNVLQKLYNPEYINIPNFSSIDMMVLPILIGCVDNNWTRKIFHDFFNKVPTLLYLDAGNESTKIPVDFPTRPKSEWTSQELIDYNESGWTGQVVAGLKINGKTILDPAAVRYPDILEDDPTDKQPSKLSCEELSASDPQRLVTNRMAALSLSSYVAELFDCGTISNSLTVFHSRKGYMRSEMIPE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP047096_1 | 1.1|97593|48|NZ_CP047096|CRISPRCasFinder | 97593-97640 | 48 | NZ_CP047096 | Bacillus marisflavi strain 151-25 plasmid p25, complete sequence | 97593-97640 | 0 | 1.0 |
1. spacer 1.1|97593|48|NZ_CP047096|CRISPRCasFinder matches to NZ_CP047096 (Bacillus marisflavi strain 151-25 plasmid p25, complete sequence) position: , mismatch: 0, identity: 1.0
taaattattgtaacatttataaaaaccttataaaatcaacgtttttat CRISPR spacer taaattattgtaacatttataaaaaccttataaaatcaacgtttttat Protospacer ************************************************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
373483 : 381797
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP047095|373483:381797|DBSCAN-SWA GATGATAGAACGTTATACGCGCCCGGAGATGGGAGCCATTTGGACGGAAGAAAATCGCTTCCAATCTTGGTTGGAAGTAGAAATCCTGGCATGCGAAGCCTGGGCTGAGATCGGGGATATCCCGAAGGAAGACGTAGCCAAGATCCGTGAAAATGCAGGATTCAATGTAGACCGCATCAAAGAGATCGAAGAAGAGACGCGCCATGATGTGGTAGCCTTCACACGTGCGGTATCCGAGACGCTCGGAGAAGAACGGAAATGGGTCCATTACGGACTGACGTCGACGGACGTAGTGGATACGGCCCTTTCGTATCAGCTGAAGCAGGCCAACGCCATCATCCTGAAGGATCTGGAACGTTTTGTAGAGATTCTGAAGAACAAAGCCATCGAACATAAGCATACGGTCATGATGGGGCGTACGCACGGGGTGCACGCTGAGCCGACCACGTTCGGATTGAAGCTTGCCCTCTGGTATGAAGAGATGAAGCGTAATGTCGATCGCTTCAAGGACGCTGCGGCCGGTGTTGAATTCGGTAAGATTTCCGGTGCTGTCGGAACATATGCCAACATTGATCCATTCGTTGAAGCGTATGTATGCGAAAAGCTCGGAATCACACCGGCACCTGTTTCCACCCAGACGCTTCAGCGTGACCGTCACGCACATTACATGGGGACACTTGCACTGATTGCAACATCCGTCGAGAAGTTTGCAGTCGAAGTGCGCGGACTCCAAAAAAGTGAGACGCGTGAAGTCGAAGAGTTCTTCGCCAAGGGGCAAAAAGGCTCTTCTGCCATGCCTCATAAACGCAATCCGATCGGTTCCGAGAATATGACCGGGATGGCTCGTGTGATCAGGGGTTATATGCTCACGGCCTATGAAAATGTACCTCTCTGGCATGAGCGCGACATCTCACACTCTTCTGCTGAGCGCGTGATCCTTCCGGATGCTACGATCGCCCTCAACTACATGTTGAACCGCTTTGCCAACATCGTGAAGAACTTGACGGTGTTCCCGGAGAACATGAAGCGCAATATGGACCGTACACTCGGCCTCATCTATTCACAGCGCGTCCTCCTTGCCCTCATCGATAAGGGGATGGCTCGTGAGGAAGCCTACGATACCGTGCAGCCGAAGGCCATGGAAGCATGGGAGAAACAGGTGCACTTCCGCACTCTCGTAGAAGAAGAGGAAGCCATCACATCAAAATTGTCGCCTGCTGAGATCGATGACTGTTTTGATTACAATTACCATTTGAAGCATGTAGATACGATTTTTGAGCGCTGCGGCCTCGTAGACTGACGCTCTGACAATCGCATAGGGACGGGTTTATATTCAGACCCCTCCCTTTTTTTATGGAAATATAGACTGAAGATTGGCAATGTTTCGAATATGGAGGGACTCAACATGGAAAAAGGCTCGCTGTTATACGAAGGAAAAGCCAAACGGATTTACGCAACGGATGACCCTGATGTCGTATGGATCCATTATAAAAATTCCGCCACGGCCTTCAATGGGGAAAAGAAGGCGGATATCGCTGGGAAGGGCGTCCTGAATAACAAGATTTCCAGTCTGTTATTTTCAAAGCTTGCCGAACGCGGCATCCAGAGTCATTTCATCAAACAGCTGTCGGATGAAGAGCAGCTTGTGAAAAAGGTGGATATCATTCCACTCGAAGTCGTGGTCCGCAATGTGATCGCAGGCAGCCTGTCGAAGCGTCTGGGCAGGGAGGAAGGCGAGGAGATGCCTTCACCGATCATCGAGTTTTACTACAAGGATGATGCGCTGGGGGATCCACTCATCAATGATGACCACATCGATTACCTTGGGGTTGCGACCAGTGAAGAGCGCCGGGAGATCAGGGATATGGCCCTTCAGGTCAACCGCGTCCTGCAAACGATCTTCAATGAGGCTGGTGTGATCCTCGTCGATTTCAAGCTTGAGTTCGGAAAGGACCGGGAAGGACAGATACTCCTCGGGGATGAAGTATCGCCGGATACATGCCGATTATGGGATGCCGAGACGCGCCAGAAGCTCGACAAAGATGTGTTCAGAAGAGGAATCGGTAGTTTGACAGAAGTATATGCCATCATACTAGAGCGATTGGGAGGAACTGCACATGTATAAAGTCAAAGTATATATCACCCTGAGAGAAAGCGTATTGGACCCACAAGGAAGCGTCGTGAAGAATGCCCTGCATTCCATGGAGTTCACGGAAGTGGAAGAGGTTCGTGTCGGGAAATATATGGAGCTGAGCCTCCCGTCATCTGTAAAAAACGTTGAGGCGGCCGTTGAAGAGATGTGCCACCGCCTGCTTGCCAATCCGGTCATCGAAGACTACCGATATGAAATCGAGGAGAGTGTCGCTCAATGAAGTTCGCGGTCATCGTATTTCCAGGATCCAACTGTGATGTAGACATGTATCATGCGATCAAGGATGAAATCGGCGAGGAAGTCGAGTATGTATGGCATGAGGAAGCGAGCCTTGAAGGCTTCGATGCCATCCTGCTGCCCGGCGGATTCTCGTATGGCGACTATCTCCGTTCAGGTGCCATCGCCCGTTTCTCCAACGTCATGAAGGAAGTGATCAAGGCTGCGGGAGAAGGAAAGCCCGTACTTGGTGTATGCAATGGATTCCAGGTACTGCTTGAATCCGGACTTCTGCCAGGTGCCATGAGACGGAACGATCATCTGAAGTTCATCTGCAAGACCGTACCGTTGAAGGTCGAAACCAATGAAACGATGTTCACGAACAGCTATGAAAAGGGTGACGAGATCCAGATCCCGATTGCCCATGGAGAAGGCAATTATTTCTGTGACGAAGAGACGCTCAAGGAACTGAAGGCCAACAATCAGATCGTGTTCACCTACGGCGGGGATAATCCGAACGGAAGCGTAGCGGATATTGCCGGGATCACGAATCACGCAGGCAACGTCCTTGGTATGATGCCTCACCCTGAGAGGGCCATGGAACCTTTACTAGGTAGTGAAGACGGGAAGAAATTATTTCAATCGATTGTGAAGCACTGGAGGGAACAACATGTCGTTAATGCTTGAACCGACTTCAAAGACAGTTAAAGAGGAAAAGCTTTACCGGGATATGGGGTTGACCGATGAAGAATTCACGATGGTGGAAGAGATCCTCGGAAGGACACCGAACTATACGGAAACCGGCCTTTTCTCGGTCATGTGGTCCGAGCACTGCAGCTACAAGAACTCTAAGCCGGTCCTCCGCAAGTTCCCGGTCACGGGCGAACGCGTCCTGCAGGGTCCTGGGGAAGGCGCGGGAATCGTCGATATCGGCGACGACCAGGCCGTGGTCTTCAAGATCGAGAGCCATAACCACCCATCAGCCATCGAACCCTACCAGGGGGCGGCGACGGGTGTCGGAGGCATCATCCGTGATGTATTCTCCATGGGGGCTCGCCCTATCGCACTCCTCAACTCTCTTCGCTTTGGAGAGTTGGATTCTCCTCGTGTTCAGTATCTATTTAAAGAAGTAGTGGCAGGGATCGCCGGCTACGGGAACTGCATCGGCATCCCGACGGTCGGGGGTGAAGTGCAATTCGACGCTTCCTATGAGGGGAATCCCCTCGTCAATGCCATGTGTGTCGGATTGATCGACCATAAAGACATCAAGAAGGGCCAGGCCCACGGAGTCGGCAACACCGTCATGTACGTCGGCGCGAAGACAGGACGCGACGGGATCCACGGGGCGACATTTGCGTCGGAGGAGCTCAACGAAGGATCGGACGAGAAGCGTCCAGCCGTTCAGGTAGGCGACCCGTTCATGGAGAAATTGCTCCTTGAAGCGTGCCTTGAGCTCATCCATAACGATGCACTTGTCGGGATCCAGGATATGGGGGCTGCAGGACTGACAAGCTCCTCGGCTGAAATGGCTTCAAAAGCCGGATCCGGTATTGAGATGAACCTTGACCTGATCCCGCAGCGTGAAACGGGGATGACGGCGTATGAAATGATGCTGTCTGAGTCCCAGGAACGCATGCTCATCGTCGTCAAGAAAGGACGCGAGCAGGAAATCGTCGATCTCTTCTCCAAATATGATCTTGAAGCGGTATCCGTCGGGCATGTAACGGATGACCAGCGTCTCCGCCTCACCCATAAAGGCGAAGTCGTAGCCGATGTGCCGGTCGATGCACTGGCTGAAGAAGCACCGGTCTACCATAAACCATCCAAAGAAGCGGCGTACTACAAGGAATTCCAGGCGATGGAGACCCCGGTGCCGACTGTGGACGATTATAAAGAAACCCTTCTCTCCCTCCTTCAGCAGCCGACGATTTCAAGCAAGGAATGGGTCTATGATCAATATGACCATCAGGTGAGGACGAGTACGGTTGTCAGCCCGGGCTCCGATGCGGCTGTCGTCCGCGTGCGCGGGACAAGGAAGGCCCTTGCCATGACAACGGACTGTAATTCTCGCTATCTGTATCTCGATCCGGAAGTAGGCGGGAAGATTGCTGTTGCAGAAGCAGCCCGTAACATCGTCTGCTCAGGAGCGATCCCCCTTGCCATCACAGACTGCCTGAACTTCGGTAATCCCGAGAAGCCGGAGATTTTCTGGCAGATCGAGAAATCGGTCGATGGGATGAGTGAAGCCTGCCGGGAGCTGAGCACACCGGTCATCGGAGGGAATGTATCCCTTTATAATGAATCAATGGGAACAGCGATCTACCCGACACCGGTTGTCGGTATGGTCGGCTTGATCGAAGACATCGACCACATCACCACCCAAAGCTTCAAAGCGGAAGGGGACCTCATCTATCTGCTGGGTGAAACGCTCGAAGAGTTCGGAGGCAGCGAGCTTCAGAAGATGACGGAAGGGAAGATCTTCGGCCATGCACCGGCGATCGATCTGAAAGTGGAGCTTCGTCGTCAGATGCAGCTTCTCGAAGCCATCCAGGAAGGCCTTGTCGCCTCTGCCCATGATCTTGCAGAAGGCGGTCTGTCTGTCGCCCTTGCTGAATCGGCTTTCGGAACGGAAGGCCTTGGAGCGGAAGTGACGCTGGCAGGATCAAACCCTGTAGCGGATCTGTTCAGTGAAACTCAATCCCGCTTCCTCATCACCGTCAAGCCGGAACATCAGGAAGCATTTGAACGGCTAGTATCTGAAACGACCCTCATCGGTCGCGTAACGGATGATGCGGCCATATCCATCAATTCCGAGGGTGAAGGCGGCCTCATCAATGCATCGGTCCAAGAGCTGGAAGCAGCTTGGAAAGGAGCCATCCCATGCTTACTGAAATCAAAGGCTTAAACGAAGAGTGCGGTGTGTTCGGCATCTGGGGTCATCCCAATGCCGCACAGATCACTTATTATGGACTCCACAGCCTGCAGCACAGGGGACAGGAGGGGACGGGAATCGTCGTATCTGACGGGGAACGCCTCCGCTGCCTGAAGGGAGAAGGCCTCGTCACAGAGGTTTTCCATGAAGGGACAATCGAGAAGCTGGAAGGAACGGGAGCCATCGGCCATGTCCGCTATGCTACGGCTGGAGGCGGCGGGTATGAAAACGTACAGCCCCTCCTCTTCAACTCCCAGACCGGCGGCCTTGCCCTCGCCCATAACGGGAACCTGGTGAATGCCAACGGACTCAAGCATCAGCTTGAAGGACAGGGGAGCATCTTCCAGACGACCTCGGATACAGAGGTCCTGGCCCATCTGATCAAACGAAGCGGATTTTCCAGCCTGAAGAACCGCGTCCGCAATGCCCTTTCCATGGTCAAGGGAGCGTATGCCTTCGTCATCATGACCGAAGATGAGCTCATGGTCGCCCTCGATCCCCATGGACTGCGCCCTCTTTCACTCGGGAAGATCGGGGATGCGTACTGTGTGGCATCTGAAACATGTGCGTTTGATATCGTCGGGGCGGAATTCATCCGGGACGTCGAGCCTGGTGAACTTCTTGTCATCAATGATGAAGGTGTGACATCCGAGAAATTCAGTTTCTCCAGCGGCAACGCCATGTGCACGATGGAATATGTGTATTTCTCCAGACCGGACAGCAATATCCAAGGTGTGAACGTCCACTCCGCACGGAAGCGGATGGGGATGGAGCTTGCGAAGGAAGCACCGATCGAAGCGGACGTCGTGACGGGGGTGCCGGACTCCAGTATCTCGAGTGCCATCGGATATGCAGAGGCTTCGGGCATCCCGTATGAGCTCGGGCTCATCAAGAACCGTTACGTCGGCAGGACGTTCATCCAGCCGTCCCAATCGCTGCGTGAGCAAGGCGTGAAGATGAAACTCTCCCCGGTGCGCGGCGTCGTGGAAGGGAAGCGGGTCGTGATGGTCGATGACTCCATCGTACGCGGAACGACAAGCAGGCGGATCGTCAGCATGCTGAAAGAAGCGGGGGCGAAGGAAGTTCACGTGTGCATCAGCTCGCCGCCGATCAAGAATCCGTGCTTCTACGGCATCGATACGTCCACACATGAAGAGTTGATCGCAGCGAACAACTCCGTGGAAGAGATGAGGCAGATCATCGGAGCTGATTCCCTCACGTTTTTGTCAACGGAAGGGGTCATGAAGGCGATCGACCGGAACGACAATTCCGACAATCGCGGGCAGTGTCTGGCATGCTTCACAGGGAAGTATCCGACTGAGATCTATCCGGATACCCTTCATCCACACGAAAAAGAGCTAGTGAAATAGGAGGAAGATTGATGGCTAATGCATATCGGCAAGCAGGAGTTGACATAGAAGCAGGATACGAATCCGTCGATCGGATCAAGAAGCACGTCAAGCGGACGGCCCGGGAAGGCGTCCTTGGTCAGTTGGGGAGCTTCGGCGGCATGTTCGACCTCTCGGCATTGAACTTGAAGGAGCCGGTCCTCGTCTCCGGGACGGACGGAGTCGGGACGAAGCTGAAGCTCGCCTTCCAGGCGGATCGCCATGACACCATCGGCATCGACTGCGTCGCCATGTGTGTGAATGACATCGTGGTCCAGGGAGCGGAACCCCTATATTTCCTTGATTATATTGCCACCGGCAAAGCCGTCCCTGAAAAGATCGAAGCGATCGTCAAAGGGATTGCCGACGGCTGCGAGCAGGCAGGCTGTGCCTTGATCGGCGGAGAAACCGCGGAGATGCCGGGGATGTATGCAGATGATGAGTATGATATTGCCGGATTTTCCGTCGGCGCCTGCGAGAAGAAGGCGATTGTGACGGGAGAAGAGATCCGCGAAGGTGACGTCCTCATCGGCCTCGCGTCCAGCGGTATCCACAGTAACGGCTATTCCCTCGTGAGGAAGGTCTTCTTCGATGATCATGATTTCAGGCTTGAAGATGCGCTTCCCGGATGGGACGCCCCTCTTGGTGAAGTCCTCCTGACACCGACGAAGATTTACGTGAAGCCGGTATTGGAGACGCTGAAGGCATTCTCCATCAAGGGAATGAGCCATGTGACGGGCGGTGGATTCATCGAGAACATCCCGCGTATGCTCCCGGATGGACTGAAAGCCCATATCACCGAAGGGGCGTGGGACATCCCTCCGATCTTCGGAGCCCTTGAAGAATACGGCCGCATCCCGAGGGAAGAGATGTACAACATCTTCAATATGGGGATCGGATTCGTTATGGCCGTCGAGAAGGGAGTGGCTGAAGAGGTCCTTGCCTTCTTGAATGGCATCGGCGGGGAAGCGGCCATCATCGGGCATGTCGCCGAAGGGAACGGCGTGACGATTTCTCCTGCCTCAGGAAGCGGACGATCATGAAGAAGATTGCCGTGTTTGCATCGGGGAGCGGAAGCAACTTTCAATCGATCGTGGATGAAATCGACAGCGGTACGCTTGAAGCCGATGTCCGCCTGCTCGTATGCGATCGCCCCGGTGCCAGGGCGACGGAGCGTGCGGAAGCGGCAGGGATCCCAGTCTTCTCCTTCCGGGCGAAAGAGTATGAAAGCAAGGAAGCCTTCGAACGTGAGATCATCCGCGAACTGGAAGCGGCAGGGGTGGAGTTCATCGTCCTTGCCGGTTACATGAGGCTGATCGGGCCGACACTGCTGGAGGCGTTCGGCGGAAGGATCGTCAATATCCATCCTTCCATCCTGCCGGCATTTCCGGGCAAGGATGCCATCGGACAGGCGTTCGATGGCGGGGTGAAGGTGACCGGGGTCACGATTCATTATGTGGACGCCGGAATGGATACGGGTGAGATCATCGCCCAGGAAGCCGTGACCGTGGAGGAAGATGAAACAAGAGAGTCCCTTCAGCGCAAGATCCAGGCAGTCGAGCACCGTTTGTACCCGGCAACATTACGCGACCTGTTCAGAAAAAAAGTGATGGGTCGATCGCTTTAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP047095|373483:381797|378730_380152_+|WP_048007551.1|DBSCAN-SWA MLTEIKGLNEECGVFGIWGHPNAAQITYYGLHSLQHRGQEGTGIVVSDGERLRCLKGEGLVTEVFHEGTIEKLEGTGAIGHVRYATAGGGGYENVQPLLFNSQTGGLALAHNGNLVNANGLKHQLEGQGSIFQTTSDTEVLAHLIKRSGFSSLKNRVRNALSMVKGAYAFVIMTEDELMVALDPHGLRPLSLGKIGDAYCVASETCAFDIVGAEFIRDVEPGELLVINDEGVTSEKFSFSSGNAMCTMEYVYFSRPDSNIQGVNVHSARKRMGMELAKEAPIEADVVTGVPDSSISSAIGYAEASGIPYELGLIKNRYVGRTFIQPSQSLREQGVKMKLSPVRGVVEGKRVVMVDDSIVRGTTSRRIVSMLKEAGAKEVHVCISSPPIKNPCFYGIDTSTHEELIAANNSVEEMRQIIGADSLTFLSTEGVMKAIDRNDNSDNRGQCLACFTGKYPTEIYPDTLHPHEKELVK >NZ_CP047095|373483:381797|375599_375854_+|WP_048007507.1|DBSCAN-SWA MYKVKVYITLRESVLDPQGSVVKNALHSMEFTEVEEVRVGKYMELSLPSSVKNVEAAVEEMCHRLLANPVIEDYRYEIEESVAQ >NZ_CP047095|373483:381797|376520_378755_+|WP_048012137.1|DBSCAN-SWA MSLMLEPTSKTVKEEKLYRDMGLTDEEFTMVEEILGRTPNYTETGLFSVMWSEHCSYKNSKPVLRKFPVTGERVLQGPGEGAGIVDIGDDQAVVFKIESHNHPSAIEPYQGAATGVGGIIRDVFSMGARPIALLNSLRFGELDSPRVQYLFKEVVAGIAGYGNCIGIPTVGGEVQFDASYEGNPLVNAMCVGLIDHKDIKKGQAHGVGNTVMYVGAKTGRDGIHGATFASEELNEGSDEKRPAVQVGDPFMEKLLLEACLELIHNDALVGIQDMGAAGLTSSSAEMASKAGSGIEMNLDLIPQRETGMTAYEMMLSESQERMLIVVKKGREQEIVDLFSKYDLEAVSVGHVTDDQRLRLTHKGEVVADVPVDALAEEAPVYHKPSKEAAYYKEFQAMETPVPTVDDYKETLLSLLQQPTISSKEWVYDQYDHQVRTSTVVSPGSDAAVVRVRGTRKALAMTTDCNSRYLYLDPEVGGKIAVAEAARNIVCSGAIPLAITDCLNFGNPEKPEIFWQIEKSVDGMSEACRELSTPVIGGNVSLYNESMGTAIYPTPVVGMVGLIEDIDHITTQSFKAEGDLIYLLGETLEEFGGSELQKMTEGKIFGHAPAIDLKVELRRQMQLLEAIQEGLVASAHDLAEGGLSVALAESAFGTEGLGAEVTLAGSNPVADLFSETQSRFLITVKPEHQEAFERLVSETTLIGRVTDDAAISINSEGEGGLINASVQELEAAWKGAIPCLLKSKA >NZ_CP047095|373483:381797|373483_374782_+|WP_048007505.1|DBSCAN-SWA MIERYTRPEMGAIWTEENRFQSWLEVEILACEAWAEIGDIPKEDVAKIRENAGFNVDRIKEIEEETRHDVVAFTRAVSETLGEERKWVHYGLTSTDVVDTALSYQLKQANAIILKDLERFVEILKNKAIEHKHTVMMGRTHGVHAEPTTFGLKLALWYEEMKRNVDRFKDAAAGVEFGKISGAVGTYANIDPFVEAYVCEKLGITPAPVSTQTLQRDRHAHYMGTLALIATSVEKFAVEVRGLQKSETREVEEFFAKGQKGSSAMPHKRNPIGSENMTGMARVIRGYMLTAYENVPLWHERDISHSSAERVILPDATIALNYMLNRFANIVKNLTVFPENMKRNMDRTLGLIYSQRVLLALIDKGMAREEAYDTVQPKAMEAWEKQVHFRTLVEEEEAITSKLSPAEIDDCFDYNYHLKHVDTIFERCGLVD >NZ_CP047095|373483:381797|381209_381797_+|WP_063190568.1|DBSCAN-SWA MKKIAVFASGSGSNFQSIVDEIDSGTLEADVRLLVCDRPGARATERAEAAGIPVFSFRAKEYESKEAFEREIIRELEAAGVEFIVLAGYMRLIGPTLLEAFGGRIVNIHPSILPAFPGKDAIGQAFDGGVKVTGVTIHYVDAGMDTGEIIAQEAVTVEEDETRESLQRKIQAVEHRLYPATLRDLFRKKVMGRSL >NZ_CP047095|373483:381797|375850_376537_+|WP_048007508.1|DBSCAN-SWA MKFAVIVFPGSNCDVDMYHAIKDEIGEEVEYVWHEEASLEGFDAILLPGGFSYGDYLRSGAIARFSNVMKEVIKAAGEGKPVLGVCNGFQVLLESGLLPGAMRRNDHLKFICKTVPLKVETNETMFTNSYEKGDEIQIPIAHGEGNYFCDEETLKELKANNQIVFTYGGDNPNGSVADIAGITNHAGNVLGMMPHPERAMEPLLGSEDGKKLFQSIVKHWREQHVVNA >NZ_CP047095|373483:381797|380163_381213_+|WP_063190569.1|DBSCAN-SWA MANAYRQAGVDIEAGYESVDRIKKHVKRTAREGVLGQLGSFGGMFDLSALNLKEPVLVSGTDGVGTKLKLAFQADRHDTIGIDCVAMCVNDIVVQGAEPLYFLDYIATGKAVPEKIEAIVKGIADGCEQAGCALIGGETAEMPGMYADDEYDIAGFSVGACEKKAIVTGEEIREGDVLIGLASSGIHSNGYSLVRKVFFDDHDFRLEDALPGWDAPLGEVLLTPTKIYVKPVLETLKAFSIKGMSHVTGGGFIENIPRMLPDGLKAHITEGAWDIPPIFGALEEYGRIPREEMYNIFNMGIGFVMAVEKGVAEEVLAFLNGIGGEAAIIGHVAEGNGVTISPASGSGRS >NZ_CP047095|373483:381797|374887_375607_+|WP_048014866.1|DBSCAN-SWA MEKGSLLYEGKAKRIYATDDPDVVWIHYKNSATAFNGEKKADIAGKGVLNNKISSLLFSKLAERGIQSHFIKQLSDEEQLVKKVDIIPLEVVVRNVIAGSLSKRLGREEGEEMPSPIIEFYYKDDALGDPLINDDHIDYLGVATSEERREIRDMALQVNRVLQTIFNEAGVILVDFKLEFGKDREGQILLGDEVSPDTCRLWDAETRQKLDKDVFRRGIGSLTEVYAIILERLGGTAHV |
8 | Synechococcus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2718485 : 2738047
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP047095|2718485:2738047|DBSCAN-SWA ATCATAGAACCATCGCAGCTAGGATACCTACGATGGTCAATATTGACGGGACCATGAGCCCAATCGCCCAGGTTGTCGTCTTTTTTATTCCCTCGATGTCTTTGGCATTTTCCTTCGCAAGTGAGAGCGCTTCCGCCGCCTTACCGTCCGCACTTTCAGCAGTCTTTTTGACCTCAGATAAATATCCAAGTTGCGTTTTTATCACCCCGATATCTACGCGAAGCTCAGTTAGAATGTCGTAAATCTCGCGAGAGTTGTGTTCTTTCTGCAGTTCTGGCATATCGTCGCCTCCTTTCTAGTGCATAAGAAAAGAGCCACCTAGTTTGGTGACTCTTTTTTTAGATTTGAAGAGTAAATTATCAGTCAAAACCTCGGTTCATTTCCTTGACTAATTAAGTAGTCGTTTAACCAATCGACTGAATATATTTTTCCAATACTAATATCATAATCAAGTACTAAGTTCATAATCTCAATAACTTTTTCTTTTGTAATAACTTCATCTTCCAAGTTCCAGTAATGATGCACCAGTTCTTCTAAAAATACCATAAGAATCAATTCATCGTCATAATTCTTCTCTCTCCATTTACCCACACAATAGACAGCTAAGTTAAACCGGAATCCGATTGAATCATCGACGTCAAGTTCCTCAAATTCAAAATTACCCCTTTCAGTTAACAGAGCACATAAACGAGGGAGTTTTTCTTTTTCAAGGAATTCTTGTTCTAATTTAATTGAAATCCCGTGTAACGCAACACTAAATAAGAACGCTTCGTGTTTAGTAATAGGAAAAGTTGCTCTGATTTCAATTTTATTAATTAAATATTCTGGAATAGTTGGACCAGGACAAGTAATCTGATTTTTAAATACACCCAATTTCATCACCTCCAAACTGTTATTCGACATTAAAAAACATTTACCTTTAAGAATTGGATAAATTTATCCGCTTTAATTCCGTAGAGATGTCGTAAATCATGTCATTGAGTTTGCGCCAAGAGGATTCGCTTATGATAAGTCATCTCCTTCCGTAGGATAAGAGAAAGAGCCGCCAGCGATGGCAACTCCATGTTAAGCAATTTGATCCGATTATTCTTTTATCCAATTTTCTAATTCCCAAGTAACCCCCGTTTTCTTTTTGAAATTACTAAGAGCTATAGCAATCCCAACACTCGTTAGAGATGAATTTGCTAATTTTGTGGTATTCCATAATTCAATTAATTTGGCATACCTTAAATGTAAATCATCAATAATTGGATCATTTGGATAAGAGTATGACAGTATTTTATTTATATCAGCTGATCCAATACTAATAGATATACATCCACTATACTGCAAATGTTGAAAAGCAATTTCGCCACTAGTTATACCCCGCAAAAATTGATCTAGGAAAACAGGAAAGTCTTTCGCTGAAAATAGCGCACCAATATTAACATGTTTAACTATAAATAAAACAGTCAATATATTAATTTGTTTCATGGTTAGTTTTGGTATAATTTCTAAAGCTTCATTACACACTAATGTGGTGAATTTTTGATCTTTTTGTTTTGTTCTGCTTACCAATAAATCAATCAATAAATCAGCAATTTCATCATCATTACGGCGAGCGTGGTTTTTCTGCGCCTCATATATTACGAATCTTACATCAGGATCTTTTGTGTTTGTTAATGATTCAGGAGACTCTGCTTTTAATTCATCTAAATATTTATTTATTAATTTTTCGGCTCTTTCATTAACAACTGCTTCAATTTTTTCGCCCAAGTCATAAAAGTTCGACTTAAAAACACCCATGGCGATGTCCTTTACGTCAGTATAAGACATACCGTGCTGGTGAACTGTTACTTCATTCCCTTGTACATTGTTTGAATGGTCGCCACCCGTGATTTCTTGTTTTCTATTTATCATGTTTAGTCTCTCCAAATGTCACAGTAACGTTTTTACCACCCTGTACATTATTAGAATGATTTCCAGCATCAATTTTTTGACCAGGGGACCTCTTCGACTTAAAAACAAACCCCAATAAAGAAGTTATAGCAACTATTCCAATCCCACTAAATAACCATTCTTTATTTTCCGAAATCCAAACCACTAAATCTCCCCCTTGCCAATCTTTGTAAATCAATGGTAAGGGCAAATTTACGTTTTGTCCATCTATTTAGATGAAGTTTCATATCTTATTGAAGGATATTTTTCTGTCAGGTCAACGTTGTAATCACACTTTTCGCAACGGTGTTCGATTTGCGGTGGGTTTGTACACCAACTGTTTTTTCCTGTAGGGAGCATTTCACCCTTATTACAATGATCACAAATGTACTTCACTCCATAAGGACGTACTTCAAACCTTTCCTCCAAAATCATCACCTTCTTTTCAAAATGAATAGCAGAATTTGATTTTAACTAATTTTTCATATTTATTTAGGTTAAGAAGCTAAAGCTTCCAACGACGTGCTCGACCTTAGCAGGGACCCGTTATTGGAAACTTAGAATTCAAACATGGCTGTGTTGTACTGTTTAATCTGCGACATAGGTATTTTCATCAGCTTGCGAAAGTCCATCGTAACACCACTTGTGCTCCGTCATTTCGACCCGACCTCCCCTTGCGTTTGTTGCTGCTCTAGCTGCGCCAGTTGTGCCGTCTTGACCGCCAACTCGATCTCGAGCTTAGCGATACGTTGCGACAAGATATGAATCACGTCGTTTCCGTCCACGTTTGTATTCATTACGCCTCACCTCCCTCGATTAATAACCTCAAGTCCGCTATTTCCTGCGCCTGTGCCTTATTTTCCGCGCTCAGCTCCTGCACACTCAACGCCAGGTAGTTCACATATTTATAAACGTTAATCGCCGACAAGTCCTGCGTAGCAATTTCCGGGCTCAGCTCGCTCAGTACGCCGATTTGCCAGTTCGAGTAGTTTCCGACGTTTACATCTTCGTTCAGAATGTAGCGCTTGATTTCGAGCTTGTTGATGACGTCTAGCCCGCGCTCCGTGACCGGCTCGATGTTTTGCTTCGTCAGGACGCTCGATCCGTTCCGGTACTCGCTCGCCTTGATTGGGCGATAACTTATAGTGGACCCGTCGTTATAGCCTAGGTGATTCGTAACGCGGACCTCGCCGTCAGTTGGTCCGCAACCAAGGTAGAAATTATCGCCAGTGGTTTTTATTCCGCCTGCTTGGAGCACGAAGTCGTTGGCGCCGGACCCGCCCAGCGTACCTCCACCTCGACCGTTCCAATAAACGCTATACGTACCGCCACGTGTCGTGACGTCATTTGCGCCAACTTTATCAGCGTCAAGTTTGCCCGTACCCATGTCGCCATTTCCATTTGTGACAAAAATCGTAGGGTCGCCTGCGGTCGTTTTTTGAAAGCGAATGCCTGCTGCGTAATTCGTATTCGGCGATCCGTAAGTGAGGACGCCATCAGTTTCGCTAGAGCTTCCGTTGTCCTTAACCCATAGGCGAAACTCGTTGATGCCGGGACGTGTGTTTTTCATCGGACGGAAGTATACGCTGGCTTCATCCGACTCAATGTTTACCGTCTGGCTGGCGTCGAGGATGACCCTATTCTCATCGGACCGCAACGCAACCACGCCGTAGACCGAGTGGACTGTCAGCCCGCTTGCCGTCGAGTACGTTGTGTCACGAAAAGCCAATGTTCCGGACGCATCTTCGTTCCCGACCGCATCGGCATACGTTGAGATTCCGAAATCGCTAAAGTAAAGCGAGCGATTAAGCGTGTCATTTCGTGCCCGCAAATACCCGTTTTCGAATTTCAGCCGGATTTGATTCGTTTCGGTTTTACCGCGCCAAGACCGCGTGTGTCTGCCTCGCGATTCAATATACGGCCCCTCGATTAGCGTGTATTCTTCCAGGTCGCCGCCGAATATGCGAATACGGTTCGTGTTGATTTGGCCGGCGGTGAGGAGGTTCGTATTGATTCCGTCGGCTGTTATCGCGTTCGGAAACGTGATGCCCCCGTCAGTCGTGACGCCTAGCCCCGCCGACCTGAACGCAACGAATCGCAACGGGTCATTCGGATCACGCGCCACAATACCCATTCCTGGCGGATACTCAAGCTCCGTCAGCGAGTTGTTCAGCGCCTCAGTTGCCCGCCGTGTTGCTTCGTCTAGCACGTCGTATTTCAGCTTGCCGGTGTCACTATCGAAGAACTCGTCGAACAACCTTTTCGTGAAATCAGTAATCGCCGATCCGTAACTTTTTCGCAAGGTTGATAGTGTGACTCTTGGCGCCTTGTTCGTGCCGGGATAGTCTTCGATTTCCATAACGCGAAGGTCGAGGTCGATGCCCAACGGTTCGTAGATTGTCGGAATCCTGTCGCCCAGACCCGGCTTTTCGCTGAGGTAGCCTGCGTCTTTCAGAACGCTGAACTCGAGCTCGATTGATACCTCAGGGGCATCGATGAGTTGGCGCTTGAGTGAGGCGAGCAACGTCGGCTGATGAATAATCGTTTCGTTCGTGTAAGGCTTGGCGTGGCGAATGCCGAAGATGGCTGCGTTTGGCGACGTGTACTCCGCCGTGACCACCGGCACGTCGTTGACGACCTTTCCTTCGCCGCGAATGTACGTTGACAAATTACTCGTATCGACCGTGTGCTTGAACGTTTTGACGTTGTGGTTGTAGCGAAATTGTAAGTCGGCGTCGTTGCCGATCTGATTGCGAATCGTGACGTGCTTGTCGTCGCGCAGTTCAAACTCGAAGCCAAACCGCTGAATCGCCTTGTTGAATAGCGCGATACAGTTGTCGTCGCCGAAGTTTTCAAATTCGAGGTTGTTGAACGCGTTGATGACGCTATACGTCCAGCCCGTGCCGTTGAAGATAAAGTCGAGTATTTGCGCGATTGTCTTAACGCCTGTGGTTAGCGTGTCGTAGCGATAAACGTCGATGAGGTCAAAGAATTCGTGCGTGGCGAATACCGACTTAACGGGCGTTTGAGCGTGCACGCGCTGCTCGACCTGCTTGATCCGGTAACGGTGCTTCGTTGATTGATCCGTGATGAAGCACTCCTCTTCGACGAGGTTGAATGCGTGAGCGTTGACGTCGGTTTTCGGCAATAAAAAAGACAGCGAACGATTTCCGTTCACCGCCTGTTTGCGTGAGAGTCCTAGTTGATTGGCGAGAAAGTGAGACTCGCCGGATAACGTTGTGAGTTCGAGCAAGTTACGTCCTCCTTTCGTGTTAATTTATCGCAAAAGAAAAAGTCGCCGGTGAGGGCGACTTATTTATTTATGACTTTATTAAGCTCTCTACGTAACAAAGCTGCCTCAAGCATAAGACACTCGTTAGACCTAACCCCATAAAGGCTATCTTTCTCTCTAACCAAAATATCATTACCGTTTTCATCTTTTTCATAAATGTCGTCCCACTCGTCATAACAAAGTAGCCCGTAATCAAAGGCGTCCAACCCATGCCTTGAAAATGCTTCTTCTATTTCTTGTGCTATTACTCCGATATGAACCCGAGCCTCTTCCCCTTTTTCTTTAACGGCGTCAATATACCGGAACTGTTGGTAGTTCACTTCACTCCAGGCATCTAGCACAGCATCATTAATCGGTAGTATGGAAGTCTTATGGCGTCTGTCCGACGTATTAATTACCCCTCCCGAAGCAAACACGTTAGTAAATCGATTGGTTGATTGTCCAAGTGAAGCTGTGTTATCTACGTACGGCCTCACGGCAACAGGTGCATAAAACGTTGTGTAACTCCCATTAATCAGCATTGTCGTCCCTGTTCCTATCCGAAATCTAACTTGCTGACCATCAGCAGCTCTTGCGTCAAACTCTCCATTTTTGGATACAATTGAGCTATTGATATGGAATTGTCCCCCGACCTCAAACATCCCTTTAACGTACGTATCTTTTGAGATATAGTTGCGATCTGAACCCGCAGAAGATAAATTCACATATTCGTTAGATCCTAGAAAAGTAGTTCGAAGTGTAGCATTGAACATGTCTAGAACAAGATTGGTCGTTTTAGCTGTCGTAGAGATCGGACCATAGAAATCATTGTCGTCATAATTAATGGGAGCACCTGTTGTTGAAAGTCCCCCGGACCCACCTTCACTATGACAGCCGATGAATGTTATCCTGTTCGCACGATCAAGACGAATCCGAAATGGCTCCCAAGTTGAGAATCTACTTGAGATAAAGCGCATGCCTTGAATAGCGCCGCTTGCATTACCCGCCATACCATCTATGGACAACGATCCCCCGCCATTATCCGTAAGGTAATTACCGATCGAATCGTCTCTACGATGCTTGGTATGGTGATTCGTTCCGTAGAAGGAGGAGGCGATTACTGTAACATCAGACGCCCCAAAGTTCCCGCGTCTATCAGTAACGGTCGAGCCGCTGGTTGCATCGTAGTACGGGTCGCCATACCCTTCCGCCCCACTCTTCGGCTTAGCTCCTTGAATCTTCAGACCCCATTTTCCGCCCATTGTGAAAACGGAATCTAACGTCAAACCGTCCGCGCCGTTTCGTACAGTCCCGCCGTCATAAGGTACTCCGTTCATATCCGTAAACTCACTTGTGAAGCTACTCCGGGTTACGTCCATCCAAATCGATGCTTCTCGCCAGTAGCCGAGAACGTGGACGTTCGAGTATTTGTTATGAACGCGACAGCCGAGAAACACTCCTACGTCCCATTTAGCGCCGTAATTAGTCGGAGAATTGTCGTTCTTATCGAAGTCTAAGAAGATGCTGAAATTCTGAAGCTGAACGTATTCCGCTTGAATGTTCAGTGCAACAGAAAGTTCCGTGTCAGACGTCTCTAAGGCGTTCGAGAGGTGTTTGATACGTGTACGTAGCCGTTTTGTTCCTGCGCCCTTCACGAGGAATCCGCTAGTTTGCTTGTAGTCAATCAACGCCTTATCAGCGTAACCATTTCCCGGGCCGACTCCCCTAATCGTGATCGTTTTCTTAACCTCAATCTCGCCGGTGATAACCGACACACCTGGCAGATGTACCACGCCGCCGAACTGTTCAGCGAAAGCTATCGCGTTTCTAATCGCCTCAGTATCGTCAGTAACACCGTCGCACAGGGCACCAAACCATCTAACGTTGACGACCATATCTTGGATGTCGTTATCTAGCGCGTCAATACGTTCTTTAATGACCGGATATGATTCGCCTCTAGCATTCGTCCTCGCTTGAATCGTTTCGGCGTCTGATGTACCGCTGTCTACGATTAATTGATTGAGCTGATTCTGTAAAGATTCGGCCCGTTCATTGCTTTCCGTTGCTTTTTGTGCAGCCTCTTCCGACAGTACCTTCGCTCTGCTTGCGTCTGTTTTCGCTCCGGCCGATTCAGCCCCTAGCGTGCGGAAGTTATCGTTTAAATCATTTCGAAATGTGCGATCAAACGGAACGCCGATTTGTTTATAGTTTGCCATAATGTTGCGCTCCTCTCTATAAATAATAAAAACGGAAGTCAAACTCAACGCTATAACTTCCGCTTGCTCCTGTTAGTCGTATGTCGTTCCATCCCGGCGCCAACCGAATCAGCTTGCGGTTAGTGTCGCGAAACACGTTCACCACGTCGTTCTTCAGCGACCTTGTGCCGGCGATAGCAAGCCGGTCGGCCGAGCCGGTCGTGCCTGTGTAGCGCCATACGTCCGAGGTTGTGACGTTTCTTATTTCAAGGTTCGTCGAAGCCCCCGATACGTAATCGCCAACGGCATGTTGCGTGGGTCAACCGTCGCGTCGCCCGGATTGTAGATGCGAAAGCTTGCCGTCGTGTGCCGGTACTGCATGTCGTTCGCCTCGATTAAGCCTTGGCCGACTTGCCAAACGCCCAACTGGTCGAAGTCCTTGGCGGTGAGAGACGTGGCGATGGATTCGCTAAATGGCGAATGACTGACGAACTCCAACGCAAACTCTCCGTATGTGCCGAGTTTCTCCGGCGACCATTCGGCTGCAACTTCGACCTTCCAACGCTTAGCTGGTTCAGCGTCAATGACGATATAGAATTGATCCAACGTCATCAGCTTACGGAACAGCTCGTTGCGAAGAAGGGCTACGTCCTGTCCGTCGACTGCAAACCACGCACACTGCGCCCTAATCAACCGAGGTCCCCACGTCTTACCTAGCCGTATCTGACCGTGCGACCCTTCGCCCTTTTCCGTTTCAATGATCGGCTGAGGCGAAGAAATCACGAGACTGCGAACAAGCACGCCAAAATAGCCCGACATATCGAACTCTTCGCCGGACTTCGTTACGATTTTAAAATTGTCGAGTCCTTTCATCGCTTCACCCCCTGCGTTCTCATTTCCTGCGATAATTGGGTGCTGAATAGACGGTCGTTGTCTCTGAATGTTGCCGCCGCTATTTCCCTGCCGTCAACTTCGACGCGAACCGGTCGGCTTCCAAGCGCCTCGATCGAGCGAATCAACGGACCTAGGTCGTTTCCACTGCCGGACTGCTGCGGAGCTCGAGCAATCCCTGCGTCCAACATGCGGAAAAGATTCGACTGTTGCGCCTCGGTCAGTACCATTTCGTTTCGAAGTAATCGCACATCAATTTCGTTGTGCAAAGGCGATTGGGCAAACTGCTCCGCAATACCTCCGACGTGTAGCTTCGGCAGTGGTCCGCCTTGGTGACGTTTAACCGATGCTTTGGCGCCCGTGCCGCTTCCGCCGTGTTCGGTAATGTAGACGGCTTTTGTGATCGACTTTCCGAGGCGTCCGTTGAGGAGGTCCGCTTGCCCGATAAGGTCGAGAATCTTCGAGCGTACCCCGTTTAGTGATCCGATTTGGCTGTCGATTGCGTCCCTTGCCTCTCGGTATTCCGCAGTGTTCATTTCGCCCGACCGTTTGAGTTCGTCGAGTTTTCCTTTTTGCTTTTCGAGTTTCTTAATGCCGTCATCGAGCGCCTTCATTTCTTCGCCTTTTTTAGCGTTGATTCCGGCTTGCTTCAACTCTAGCGCCACCATGCTTTGCTTAACCTCGTCGAGCTTTCCGATTTGCTCCTGAATCTTATCGACTTCCTTCGCCTTCTCTAATACAACTTCCGCCTGCTTGGCGCGCTGTTTTGCGATACCTTGCAGTTTGTTGTCGTGAAGCGCAATGGTTTCGTTCAAGCGATCAATTTCAAGGGTATCGTTATTCTGCTTCGCAATAGCAAGCTCGTCATTTAAGCCAGCGATAATGCGGATCTCATCTTGCTCCATTTTGTCGAGTTCAATCATTCGCTCTTTTGCCGTGTTGATATCCTTTTGCGCCTGCTCTTCATCGCGAAGCATTTCCGCCATGTTGGCCTCGAGTTTAGTCCGCTGTGCTTCGAGTTCAAGTCGGATCATTTCGAGCTGCTCGTCGTTGTATTGTTTCGCTGCGTCCGTATTCTCAAGTAAGATGTTACCTTGCTCGGAAAGCACTGTGTTCGATTCCGGAACCGCTTCGACGATTTCATCATTCAGCCCAATGAGACGATTAAGCTCATCGTTTGAGAGTCCGGACTTCTCTCGCAACCTTTCCTGTTCATCTGTCAACTTTGCGATGGTGTCCGGGTTAACTGTTTTTGCGAGCTCCGAATTAATGTCGACAAAGCGTGCGAACTCATCAGTCGTAAGCTGCGATTTACTTCTGAGCTTTTCAAATTCCTCAATGTTGGCGCCGAGTTCATCACGTTGTTTCATCATTGCGTTTACATTCTCGAGGGTGACATCGTTATAGTCACTTTGATACACGCTAGCACTGGCGAACAATCCACCTAGCACAGACAATGCCGTAATAAGCCAGCCGGCCGGCCCCATTGACACCATGAGCGTACGCATAGTAATAACTAGCTTTCCGATGGACGCAAGCACAAGCCCGATCGCTGCCGCCGTTCCTCCGAAAGCTAGACCGGCTTTGACCGTTGCCATGTCGACTTCGCCGAAGGCAAGTGTGATTTTAGACGCCTCACGAACTATGCTCGTGAACATCGGCAGGAACTCGTTTCCTAGCTTGATTCCAACCTCTTCGAGCGCTGATTGAAACTCCTTGAATGCTCCGAGTAAGTTATCCTGCATCCGTTTCGCCATCTTGTCGGCTGTGCCCGCGCTGTTTTCAAGTTCCTTCGTATAGTCAGCGAGTTGATCCTCACCAATCCCAAGCAACGTCGTAAATCCGGCGGCTGCCTCACGTCCAACTAACTGAGCCGCCGTGGCCGTCTTTTGTGCGTCAGACATGCCATCCATCTTTTCGCTTAGTTCTCCGACGATCCGTGACAACGGCTTGATTTTACCCTCATTGTCCGTGACGCTAAAGCCGAGCTCATCCATCGCCTTCTTCGTCTGACCAACCGGATTTGCAAGGGATAGGAAAGCCGCACGTAGGGAAGTACCGGCCATCGAGCCTTGAATACCTGCGTCGGACATTTTGGCGACTGCCGTGGCGGTTTCCTCGATTGAAAAGCCGAGTGACGCGGCAACCGGGGCCACCATCTTCATAGCCTCGCCTAACTGCGGTAAGTCCGTATTGGCCGTCGTCATCGTCTTAACGAGCACATCGACCGCATGACCTGACTCGGTTGCTTCGATTCCAAAACCGGTCATGATGTTCGAAACGATATCGGCCGACCTAGCGAGGTCAATTCCGCCTGCTGCTGCTGCATTTAGTACGGCCGGTAAGGCATCAATTTGTTCCTCGACGCTGAAGCCCGCCATTGCGAGGAACGTAAGTCCTTCGGCGGCCTGGGACGCGCTGAATTGTGTTGTTGCGCCCATCTCACGTGCTGCACTCTCCAACTTCGCCATATCCTCCGCCGAAGCATTAGAAATGGCCCCGAGCTTTGACATCGACTGCTCAAACGTTGCAGCAACGGCCACACTCGATCCAATCGCTGCCACTACTGCCCCGCCCATGACCAACGACGCTTTTTGAATCGCGCCGAAGTCCTGCTTCATCTTCTTGGCGCTGTCGCCCGTCTTGTTCATTTCCGCCTTTGACTCCTGGAGTTTGCGTTTGAACTCTTCGTTCGACAGCACCATCTTGGCGCGTATTTCTCCGACTTCTGCCATTCGTTTTCCTCCTTTCCTAGTTCATCATTGCACGCAACTGTTCGAACTTCTCACGACTGAAGCCGGCCGTCTCTTCCTGGACGCCTAGTTGCTTCGTCATTCCATGAATCAGCGTTTTATAGTCCGCATCCTCCATCGCACGATTGTTCGTGGCAATGCGTAGCTGTAGTTCTTGCATTCGTCTGGCTGTGTCGTACTTACCTTTGCGCCCAACCAACTCCATTAAATCGACCATATAATAGCCGGTTTCAATTTCGACCTGCGTCTTGCCGAGCAAAATAGCCGCGTCAATAATGAAGTCATCGACCGTCGCCTTTTCGCCCTCTTCCGTGCCTACTCCGCTGGACTTATCGGCAGAAGGCTCTTGACGTTTTTTGCAACGGAGTCGAGGCGGTTCTTTTTAACGGTGCGGGAGAGGTATTCGAAAATCTCATCCACACCCACTTCGGCGTTAATATAATCAACGTCCAAGTCGGACAGGACCGCTACAATCTCAATAATTTCGTCCGCCGCATATCCAAGGCCCGTCAACACAGATGTGTAAAAATCTTCTTGCGGTGCAGAGATAACCTGCACAATCATACCCGGCAGCCGATCTACTGTTGCGAATAGGTCGCGCCACCGATTAGGCGTCAGTTTATTTATTTTCACGCGCTTGTCGCCGAGCCACATTTCGTCCTCGTTCAGCGCCTTTAGTTTCTGAAATTGTTTCAACATGGGCGTTACTCCCTTCGTCTACTTAAATAAAAAGAGCGCCCGCTAGGACGCCCTTACTCTTATGCTTCTGCTGTCTCGTCACCCATTACGAATAGGCCATCTTCATTTTGATAAGCAATGAATGTGATATTCTGTACGCGCTCGTTGTCGCTATTGTAAGTTAGTTCTCCGTCGGAACTTGCGCCGGCGAGAGGAATAGTGATCCAGTCGTTTGGAGTCGCATCTGGATCAGTTGGCTTGATCACGAGTTTCTTTGCGTTAGCACGCATGTCGTAACCAGCCTCAGACTTAACAACGAGTTTCTTTTTCGACGGGTCAGTGCCATCAATTGTGAACTCGCTGTTAGGAGTTACTTTCGAAAGCTTTTCGATATCATGAAGTGCAAAAGGCACAACCGCCTGAGCTGTCCGACCTTTCATGATTGACTTGACGATCGTGTCGCCGTACTGGTCAATCGTGACGTCCTGCTTTGACGTCGTGTACGAAAAGACAATTCCGCCTTTAGTGAATTCGAATTTAGTCATGTCGGCACCTTCGCCATATTCGACGATTGCTGGGCCGATCGGTACGTTAATTCCTGCCATTTAAACGTCCTCCTTTATATTTGCAAAATAAAACGCCGCCGGGCGCTAGGGCCTGACGACGCAATCAAAATTCAGTGAATAGATTGGGCGGTTATTTTCATCGTCGCCCAGGTAAATCGGCACGCTGTTACTCGCACGTATCTTGACGACCGAACTGTCGCCCACCTTTACGTTAGTGAGGTTTGTCAGCGCCTCCTGTAACTCATACGCACGATCTTCCGCCTTCGCGGGATGAACGTCGCGCACGAATATTTGAAACGACGGCTGCTTCTTGGTGGTCCACTCGTCTGTCGGGAAACCGCCCGTCAGGCGAATGCGAGTACATGAGTCAGGTGCCGAAGCCGGAAAGTTGTTCGGATAGTAGACGCCAGGGACTCGACTTTTTACGAAGTTAATCAGATCCAGTATTCGCATCACTATCACAATCCTTTCGCTAAGGCTTTCGCCCACCAGTTTACGTATTTCTTCGCCTCGCCTTTTAAGGGGCGCTCGAGGTACTTGTTGCCGACGGAGTAACCCGCCATTCCTGTCGCCTTGCTCGACGCAGGGCCTAGATTGTAGTCCATTTCATGCGTCCAGTAAGCGTAGTTGAATCGCCCGTTGCCGTCTTTGCTTACTGCGCTGAAAGTTACTTCGCCTTCGATAGCACCGCGTTTAACAACCGTCTTAGACTTAGCCGAGCCTCCAAGAGCTCCGTATTCAATTGGCGCAATATTGGAGGCAATCTTGGCGAGGTCCATAACGGAATCCTCCATGATCGCTTTTCCAATGCCGGTGACGTTGTGGTCGGCTTTATCCAACTTTTCGAAAAAATCCGAGGCGTCAAATTCGAAATCACTCACACAAACACCTCTGTTAAGATCGGCTTGCCGGCAATGTTGCGCTGCACGTTGATTTCCTTCGGCTTACGACGCATGGTTTCGCCTAGCTCGTTCGTGAAGGAGATTTCATCCTCGTACTTAATGGCGACCAGTCCGTCGAGCAAAATGCGGGCATCTGCGACAATGACTTCGCCATCACGAGTAAGACCGCCGACTTTCGACTTGACGGACATTGTGCCTTCGTCGATACGGCAGCGATGTTCTACGGTAGTAGGGTCGGTCGGCCAGCCCCATTCGTCCGTGCCTCCGCCTGACTTTACCGTGATTGTTTGTCGCATTGGAATGAACGACATTAGCGCACCGACCTTCCGACACGTCGCGTGCCAAGTTTCATTCCGCCGTTAGCCTGGCCGATTAGATCAAGCGACCGCTGCGGAATAAAAGCGACTAGCTCGCGAGACTGCGTGTCTTTGAAATTAAAAGACGCCACTCCCGTTATCGAAAATGACGCCACTCCCTGGTTGTTAAGACGGTTCGTATCGTTATATGCTGACGCCAGCACAGCTGCAAATTCATAGACCGCATCTTCCGGAATCACATACTTTGGGTACTTCGACGAAAACGTCTGACTCACCACGTTTAAGATACGTTGCTTTTTCGCAGGATCAGCGTCTTGCCAATCCTCGATGTCTATTACATTCTGTTCTATATAGCTGTCAGCGTTAGAAATTGAAAACACCATGACGTTCACCTCCGGATTTATTTCGCGGAGGATTTACGCGCCTTAGGTTTCGGCTTCTTCTCCGCTTTCTTGGCGGATTCTTCCGCCTTTGCTGCTTTTTCAACTTCTTGCTGTTTCAACTTTTGGAGCGTCATGTGAATGAGCATCCCCATATTAAGCCCCTGCCGATACGCTAATCTTGATGACCTTAGATTCATCGTAAAGGTGAGCTGCGAAGTGTTGGTCGCCAGTAACTACTGTTTCTTTAGTTACAATGTTCCGGTCCTTTTCAACGGAAGCGTCTCGTTTCATGTAAAGTTTGAGCGCGCCTCTCTTCACAAGGTAAGCCGTACCTTCTTCGAGCTTGTTAGAACGTACTACTTGTGCGCCTAAGAGCTCACCGAAAGTGCCTTTTATTAGGATTTGATCACCAAGCTCGGAAGCCCTAGTCCAATCCTGACCCGCTGCTTTACGTAGTTTGAGAGCGTCCTTTGAATTCACAAAGAGAACCATTGCCCCTACTTCATCGTTCTCATCGTTAAACAAAGCGATGGCGTTATCGATTGTGTCGATGTCGAATGCGCTACCTTGGTAACTTAAAGGTGCCGATGTAACCGCAGCAAGAACCTCATTATCAATACCTGCAGCAATAGATTGAAGAATTTGCTCAACAGTTGCCCCGATTGGATTTCCTAAGCCTGAGAGCGCGGATTCGTCGGAGATTGTACCTCCTTTTGCTACTTTCTTAACTGTAGCTTGGTCAGTGGAAGTGGTTAATTGAGTTAGGTCAATAGGTTCCCCTTCCGCAACAACAGTTGCATCGCCAATGTAATTAAACTTAGGAACCGTAACTGTGTTACCAGGTTGACCTTGTAGCGTGTAGTCTACATCAGCAAGAGGAAGGAACTTAATTGCATTGGGTAGTTCAGCCGAAATCATGTCCTGCATTACCTGTGGATTGATCAAGTTTTCTAGTTTTGTTAGTGCCATTTAAAACACTCCTTATTTTTTATTAAAAACAGATAGTTTAAATAACTACCTGCTTAATTTCATGTATAAATCGGGGTTCTCACTATGGAGCCTTACTTTTTCTGCGTAATCCATCTTGTTAAATGATTCTTTAGAGATTCCTTGATACTCGTTTCCGTTAGTCTGTTCGCCAACCGGTTTCGGTGCTGCCTTTCTAAACAGTCCCTTACTTTCGGCTTTGTCCACCCATTCAAGCTTTCCTTCGGGCGACAAGTTATTCGGAATCAAATCGTGAAACTCTTCGGGAATTGTTGCGATACGTGATTCGAGTAACTTTCCGACCACCTCTTCTAACTCTACGTTACGTGCCTGTGCCGACTCAAACATCGATTTAAACTCTCCGAACTCTTTCGAAGTAGCTTCATACAACGACTTGAATTCGCCAGCCTCTTCTTGCGCCTTACGCTCATCGTCTTGACGTTGCTTTGTCAGCGACTCAAGTTGGGCTTGCGTTTCCTTATAGCGTTGATTCACTTCGTCAAAGCGCGACTTCGGAATCATGTGCTCGGCTCGTTCTTGTTCTGCGGATTTGGTTTCCGTGTTTTCCACCGCGTCCGGTGCTTCTTCCGCAAAGAATTGAAGATTTAAAGGTAGTAAGTTTTTCATTTCGTTTCCCCCTCGTTTTTATCGTGATCGACACGGATTGGAGTTGCGTCTTTTAATGACTTACGCCAGGTCGGGACGTCGGATCGGCGATACGTGATGCCGACAACGTGGATGGAATATTTCTCGGCGAGGTAGGTCACCAATATACGGATAAGGTCCAGGAGCGTCCGCTACTAATTTCACGACTTTACCTTCCCAATTACGGCAGGCGTCCGTTGCGCCATGCGAACTAATAACTCCGTAAAGCGCCCCACGATCTATCGCGTCGTTTCTGGCTGCCTCTCGGTAGGTAGTGTGTAGCTTCGTTCGCGTAACCATGTCCGCATAATCAGTTGGCTTCCATCGCCGGTTGCTTGCGTCTATGATGCCGGTGTTTAATGAGTCGCCCAACTGCCGACGGGCCTCCGCAAGAAAGTCACGCCTAATAGACCGGACGCCGTTTACGCCTTGTGCCATGTTTTCACGCATGACATCCGCCATTGCTTGACGGATAGCGTTGCGGGTTCTCCGGTCGATGTTCTGCGTCACCTGCAAGAGGTCGGCCTGTGTATCGGCAATAGCGAGCTCGATAAGTGACTTGTTCAATTCGTTGAACTTCACTAACTTCATTGCTTCCGAGAAAGTTTCAGCGACTCCCAGCGAAACGATTGAACGGGCCACGCTTTCTTGTGCCGCCTTTGGAAATGTTGTGCTGGCCCACGCTGTGATGTTTGCGTCCAAGTCTTTCAGTATCTCCCGAATCTGCGCTTGAACGGCCAATGCGTTAGCTCTCCGGAAGTCTGAAAGGTCCATGCGCATTAGTTCCGTATCAATATCGTCCATCGCTCGTCTGAAGTACCGTACTAGAGTCGCAACGTCTTTTTCATATACCGGCTCTGGAATTCGGTTCATTACTCATCGCCCTCAATATCGTTAAATACCGAAGGACTAACGAACTGATCCGCCTGTTCCTCGGCCTTGATTCGCTCAAGTTCCAGCTTCGCCTGTTCCTCACTAAGTCCAAGCGTTTCCATAACGGCGCTGAGTTGAGACTGAATTGGCTTGCTGCCGGTTCTTAGTTGAGCAATTTGCGCCTTCTCCATGTCGTCCTCTGGAAGGCCGTCTTTGAACTTGATGATCGGCACGGTAACTTCATACGGTAACTTTGTACCGGCTTTTTCGTGCTCAAGTAATTGTGCGATGAACAAAACGCGTTTCAGACCTTTGTCGTAGTACTGACGCTTTCTATTAATCTTCGCAAGTAACGTGTTCATACGATATTTAATAGCCAACGCAGACGAACCTGACGTCCCACTGTCCGTTTTACCGAGCGCAACCGGAGGTAGTTCCGATACGATGAACAATTGCTCCATCAGCAACTCCAACTCTTTAAAAGCTGCGTCAAGTTGGCCGTTCCATGTGATATATTGCGGCAATACATCGGATTTATCTGCTGCCTCAAATATCTTGTCTCTGCCGGCATGGAATATTGGATCGCCGTTCTCATCTTCTCCGAGTGAACCTTGCGGCACGACCATCGCAGGATCGGAATGTTTGTCGAGAATTTCAGCGATGCGACTAAGACGGTTATCAATCTCGGCGAAAATGGACTCGTTCTCGCTCAAGTCGTCAATCCCCTGCCAACAGTCATCCGTTGAGTAGTTCGGAATATGGACAACTAACGGAAATGGCACGCCTGTCTCAACTACCGTTTCTTTGTCCGTGATGGGCAGCTCTTTCTCGACTTTCCATTCAACTACCGCATTTTCATAATGCGAATGGCCGAACGGTGATAAGCGATACTTTGAATATCGGATTTCTCCGGGATAGTGCGATTCAACATTCAGCACCCAATCCTGATCGCCTGTTCCTTCAATAACCGATGGCACCGCAATATGGTAAGCAAAGACTGAGGTATCATCGCCGGGCAACGTTTCCGGAAAAACATACTCGGCTTTCTGTGCTTCAACAAAAACGCGATACGGGTCCACTGATTCCGAAAGCTTTCCTCCGAAACGTTGGCCATATCGCACCTTGTAAAATGAGTCGCCTCGATAAGCATTTGAAAGTGCCGACTCGTAATTCGTGATATTCAATTCGTTTTCCTCAACGATGCGTTCAAGCGCTTTTTGCTCCTCGGACTGGTCGTCTTTGCCTGCGCTGAAGATCGGCGTTTCACCGAACAAAAAGTCGGCCGATTTCTTAGCGATTGTTCCCGCCAAGTTAGATGCGATAAAGACAGCCAGGCGCCGGTCGCTCGGAAGCTTGTAGCTGTTAAATACTGCGTAATGATTGCCTAAGAATAACTGCTTATTCTTGCGATAACGTTGAATGCGGTCCTTGTGACCTTCGGGCGGAAAATAGCCGCCGGTTTCTATAAATGCCAATTAAAATACCTCCTGTTATAGCCCTGGCGGCTTATTTACGTAGGACTTTCTGCGATTGGTACCGAGCATCGATACAACGCCGGAGCATGCGTCTGGAAGATCGTCATGCTGGCCGGATGGGAATTGCTCGAGCTGTTCAAGCAAAAGGCTTTGTCCCTTCTTAAAACGCAAATAACCCGTCTCACACAACGGTTCAAGCGCCTCAATACGTTTCTCTTTGTTGGCGCCTTGGTGTGCGATGGGTTTTAGTTTCGTTCCATATATGCGTTCTTTCATCAGCCTGTCGCGCAACTGACGGTGCATGTCGTGGCCGACTCCGATTGTCTCTACTGCGAAAATCTTATGGCCGTATTCTCGAATCTTTTCGACCGCCATTTCCAGCGCCTTATGTGCCGGGCACTTTTTCGCCCATGCGTCGACCACGTACATAACGCCTGTTCTGCGATCACGGCCGAGGGTCACTATAGCGTTGTAATCCGACCGAGATGACTTACCGGCCGCGACGTCCCAAAATGCGTAATACTCTAAGGGTAGCCTGCGCCCGTCACTGTCGATGAAGTCTGACTCGTCAAAGTAACGGAAGGTGTCCGGGTTAAAGATCATCGACTCTTCATCGCGTGGCTCGTTCTGATACTCCGTGTTGAAGGCTTTCGAGCCGTTCGACCATTTCCAGGTCATCAGCTTCCAAATCGGCTGCACTTCCGGCCACAAGACGACAGCGCCTTGATCCATTTCCGCCTTGTTAGCTTCGTAGAATTCCCGCGCCATAGTTTCACGTTGATTCACCGGTACCTGGGCACTCTGATAAATCTCCTGGCACTTGCCCCAAAGGTCCATGCGTTCTGGCAAGTCAATCAACGCTTTATACTTATGCGATTTGAAATCAGCACGGTTGAGTAAGAGGTCGTTCAGCAAGCTCGCCTCATGTACGACAGTTCCCATCACAATAAACGCCGTCTTTTCTCCTTTGGCGTCCCCAAGAGGCATAACTGTTTGCGCAAACCAGTCCCTCAGTTTCTCCCGCTGTTCTTTTGTGGCTGCGTTTGTCTTAATATCTTCGAGGTCATCCCCGATTACCAAATCCGGACGTACGCCATTCCAGTTCCTGCCCCTCAGCGCTTGGCCGGTGGATGCAGCCTGAACCATCGTAAGTGTGCGAGGCAACCCGTCTGCCTGTTGCTCAAACGCCACAAACGTTTCCGAATTGTCCTTGGGGTTCATCTGCTGCTTAGGCGATAGTAACGGCCCATAGTCTGCACGCAGCTTTGCGTTTGTTTTAAGCTGTCCTGATAGCCATTCGAGGTTTGGTCCAGACACCGACGGCGTTTCCGAAATGATAATGATATATTTACGTTTGCGAAAAACTGTCTCGCGAAGTGGGGCCGCCTTGCTCAAGTACGAGCTTTTTCCGTGCGACCTGGGCGCTGCCCGCGCGATCTTGGCGTTCTTATTGACGTTAGATACCTCGTCGATTGAAGCACATATTTCCTTGTGGAAGGTGGCGGCATCGTCAACGCTGTCGAGCTCAAACGCGTCCCAGTTGCCGGGATTGCCGGGATTGTGCGTCTCGCTGAAATACTCTAGCGCAAAGTAAAGCAGATCGGTTTCGCTTCGGTGTATGCGTTCGAGGCGTTCAAGCTCCGTGAGTATGTCGTCCAGTTCCTCGTACTCATCCGACGATAGCATGCCGGCTTCGGCCAGCTTGGCCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP047095|2718485:2738047|2734861_2736337_-|WP_159129821.1|portal|DBSCAN-SWA MAFIETGGYFPPEGHKDRIQRYRKNKQLFLGNHYAVFNSYKLPSDRRLAVFIASNLAGTIAKKSADFLFGETPIFSAGKDDQSEEQKALERIVEENELNITNYESALSNAYRGDSFYKVRYGQRFGGKLSESVDPYRVFVEAQKAEYVFPETLPGDDTSVFAYHIAVPSVIEGTGDQDWVLNVESHYPGEIRYSKYRLSPFGHSHYENAVVEWKVEKELPITDKETVVETGVPFPLVVHIPNYSTDDCWQGIDDLSENESIFAEIDNRLSRIAEILDKHSDPAMVVPQGSLGEDENGDPIFHAGRDKIFEAADKSDVLPQYITWNGQLDAAFKELELLMEQLFIVSELPPVALGKTDSGTSGSSALAIKYRMNTLLAKINRKRQYYDKGLKRVLFIAQLLEHEKAGTKLPYEVTVPIIKFKDGLPEDDMEKAQIAQLRTGSKPIQSQLSAVMETLGLSEEQAKLELERIKAEEQADQFVSPSVFNDIEGDE >NZ_CP047095|2718485:2738047|2726128_2726740_-|WP_159129810.1|DBSCAN-SWA MKGLDNFKIVTKSGEEFDMSGYFGVLVRSLVISSPQPIIETEKGEGSHGQIRLGKTWGPRLIRAQCAWFAVDGQDVALLRNELFRKLMTLDQFYIVIDAEPAKRWKVEVAAEWSPEKLGTYGEFALEFVSHSPFSESIATSLTAKDFDQLGVWQVGQGLIEANDMQYRHTTASFRIYNPGDATVDPRNMPLAITYRGLRRTLK >NZ_CP047095|2718485:2738047|2718847_2719357_-|WP_159129805.1|DBSCAN-SWA MGVFKNQITCPGPTIPEYLINKIEIRATFPITKHEAFLFSVALHGISIKLEQEFLEKEKLPRLCALLTERGNFEFEELDVDDSIGFRFNLAVYCVGKWREKNYDDELILMVFLEELVHHYWNLEDEVITKEKVIEIMNLVLDYDISIGKIYSVDWLNDYLISQGNEPRF >NZ_CP047095|2718485:2738047|2721197_2723654_-|WP_159129808.1|DBSCAN-SWA MLELTTLSGESHFLANQLGLSRKQAVNGNRSLSFLLPKTDVNAHAFNLVEEECFITDQSTKHRYRIKQVEQRVHAQTPVKSVFATHEFFDLIDVYRYDTLTTGVKTIAQILDFIFNGTGWTYSVINAFNNLEFENFGDDNCIALFNKAIQRFGFEFELRDDKHVTIRNQIGNDADLQFRYNHNVKTFKHTVDTSNLSTYIRGEGKVVNDVPVVTAEYTSPNAAIFGIRHAKPYTNETIIHQPTLLASLKRQLIDAPEVSIELEFSVLKDAGYLSEKPGLGDRIPTIYEPLGIDLDLRVMEIEDYPGTNKAPRVTLSTLRKSYGSAITDFTKRLFDEFFDSDTGKLKYDVLDEATRRATEALNNSLTELEYPPGMGIVARDPNDPLRFVAFRSAGLGVTTDGGITFPNAITADGINTNLLTAGQINTNRIRIFGGDLEEYTLIEGPYIESRGRHTRSWRGKTETNQIRLKFENGYLRARNDTLNRSLYFSDFGISTYADAVGNEDASGTLAFRDTTYSTASGLTVHSVYGVVALRSDENRVILDASQTVNIESDEASVYFRPMKNTRPGINEFRLWVKDNGSSSETDGVLTYGSPNTNYAAGIRFQKTTAGDPTIFVTNGNGDMGTGKLDADKVGANDVTTRGGTYSVYWNGRGGGTLGGSGANDFVLQAGGIKTTGDNFYLGCGPTDGEVRVTNHLGYNDGSTISYRPIKASEYRNGSSVLTKQNIEPVTERGLDVINKLEIKRYILNEDVNVGNYSNWQIGVLSELSPEIATQDLSAINVYKYVNYLALSVQELSAENKAQAQEIADLRLLIEGGEA >NZ_CP047095|2718485:2738047|2732510_2733326_-|WP_159129818.1|capsid|DBSCAN-SWA MALTKLENLINPQVMQDMISAELPNAIKFLPLADVDYTLQGQPGNTVTVPKFNYIGDATVVAEGEPIDLTQLTTSTDQATVKKVAKGGTISDESALSGLGNPIGATVEQILQSIAAGIDNEVLAAVTSAPLSYQGSAFDIDTIDNAIALFNDENDEVGAMVLFVNSKDALKLRKAAGQDWTRASELGDQILIKGTFGELLGAQVVRSNKLEEGTAYLVKRGALKLYMKRDASVEKDRNIVTKETVVTGDQHFAAHLYDESKVIKISVSAGA >NZ_CP047095|2718485:2738047|2736352_2738047_-|WP_159129822.1|terminase|DBSCAN-SWA MAKLAEAGMLSSDEYEELDDILTELERLERIHRSETDLLYFALEYFSETHNPGNPGNWDAFELDSVDDAATFHKEICASIDEVSNVNKNAKIARAAPRSHGKSSYLSKAAPLRETVFRKRKYIIIISETPSVSGPNLEWLSGQLKTNAKLRADYGPLLSPKQQMNPKDNSETFVAFEQQADGLPRTLTMVQAASTGQALRGRNWNGVRPDLVIGDDLEDIKTNAATKEQREKLRDWFAQTVMPLGDAKGEKTAFIVMGTVVHEASLLNDLLLNRADFKSHKYKALIDLPERMDLWGKCQEIYQSAQVPVNQRETMAREFYEANKAEMDQGAVVLWPEVQPIWKLMTWKWSNGSKAFNTEYQNEPRDEESMIFNPDTFRYFDESDFIDSDGRRLPLEYYAFWDVAAGKSSRSDYNAIVTLGRDRRTGVMYVVDAWAKKCPAHKALEMAVEKIREYGHKIFAVETIGVGHDMHRQLRDRLMKERIYGTKLKPIAHQGANKEKRIEALEPLCETGYLRFKKGQSLLLEQLEQFPSGQHDDLPDACSGVVSMLGTNRRKSYVNKPPGL >NZ_CP047095|2718485:2738047|2729509_2729671_-|WP_159129812.1|DBSCAN-SWA MQELQLRIATNNRAMEDADYKTLIHGMTKQLGVQEETAGFSREKFEQLRAMMN >NZ_CP047095|2718485:2738047|2730839_2731208_-|WP_159129814.1|DBSCAN-SWA MRILDLINFVKSRVPGVYYPNNFPASAPDSCTRIRLTGGFPTDEWTTKKQPSFQIFVRDVHPAKAEDRAYELQEALTNLTNVKVGDSSVVKIRASNSVPIYLGDDENNRPIYSLNFDCVVRP >NZ_CP047095|2718485:2738047|2730269_2730794_-|WP_159129813.1|DBSCAN-SWA MAGINVPIGPAIVEYGEGADMTKFEFTKGGIVFSYTTSKQDVTIDQYGDTIVKSIMKGRTAQAVVPFALHDIEKLSKVTPNSEFTIDGTDPSKKKLVVKSEAGYDMRANAKKLVIKPTDPDATPNDWITIPLAGASSDGELTYNSDNERVQNITFIAYQNEDGLFVMGDETAEA >NZ_CP047095|2718485:2738047|2723713_2725888_-|WP_159129809.1|DBSCAN-SWA MANYKQIGVPFDRTFRNDLNDNFRTLGAESAGAKTDASRAKVLSEEAAQKATESNERAESLQNQLNQLIVDSGTSDAETIQARTNARGESYPVIKERIDALDNDIQDMVVNVRWFGALCDGVTDDTEAIRNAIAFAEQFGGVVHLPGVSVITGEIEVKKTITIRGVGPGNGYADKALIDYKQTSGFLVKGAGTKRLRTRIKHLSNALETSDTELSVALNIQAEYVQLQNFSIFLDFDKNDNSPTNYGAKWDVGVFLGCRVHNKYSNVHVLGYWREASIWMDVTRSSFTSEFTDMNGVPYDGGTVRNGADGLTLDSVFTMGGKWGLKIQGAKPKSGAEGYGDPYYDATSGSTVTDRRGNFGASDVTVIASSFYGTNHHTKHRRDDSIGNYLTDNGGGSLSIDGMAGNASGAIQGMRFISSRFSTWEPFRIRLDRANRITFIGCHSEGGSGGLSTTGAPINYDDNDFYGPISTTAKTTNLVLDMFNATLRTTFLGSNEYVNLSSAGSDRNYISKDTYVKGMFEVGGQFHINSSIVSKNGEFDARAADGQQVRFRIGTGTTMLINGSYTTFYAPVAVRPYVDNTASLGQSTNRFTNVFASGGVINTSDRRHKTSILPINDAVLDAWSEVNYQQFRYIDAVKEKGEEARVHIGVIAQEIEEAFSRHGLDAFDYGLLCYDEWDDIYEKDENGNDILVREKDSLYGVRSNECLMLEAALLRRELNKVINK >NZ_CP047095|2718485:2738047|2731213_2731636_-|WP_159129815.1|DBSCAN-SWA MSDFEFDASDFFEKLDKADHNVTGIGKAIMEDSVMDLAKIASNIAPIEYGALGGSAKSKTVVKRGAIEGEVTFSAVSKDGNGRFNYAYWTHEMDYNLGPASSKATGMAGYSVGNKYLERPLKGEAKKYVNWWAKALAKGL >NZ_CP047095|2718485:2738047|2726736_2729493_-|WP_159129811.1|tail|DBSCAN-SWA MAEVGEIRAKMVLSNEEFKRKLQESKAEMNKTGDSAKKMKQDFGAIQKASLVMGGAVVAAIGSSVAVAATFEQSMSKLGAISNASAEDMAKLESAAREMGATTQFSASQAAEGLTFLAMAGFSVEEQIDALPAVLNAAAAGGIDLARSADIVSNIMTGFGIEATESGHAVDVLVKTMTTANTDLPQLGEAMKMVAPVAASLGFSIEETATAVAKMSDAGIQGSMAGTSLRAAFLSLANPVGQTKKAMDELGFSVTDNEGKIKPLSRIVGELSEKMDGMSDAQKTATAAQLVGREAAAGFTTLLGIGEDQLADYTKELENSAGTADKMAKRMQDNLLGAFKEFQSALEEVGIKLGNEFLPMFTSIVREASKITLAFGEVDMATVKAGLAFGGTAAAIGLVLASIGKLVITMRTLMVSMGPAGWLITALSVLGGLFASASVYQSDYNDVTLENVNAMMKQRDELGANIEEFEKLRSKSQLTTDEFARFVDINSELAKTVNPDTIAKLTDEQERLREKSGLSNDELNRLIGLNDEIVEAVPESNTVLSEQGNILLENTDAAKQYNDEQLEMIRLELEAQRTKLEANMAEMLRDEEQAQKDINTAKERMIELDKMEQDEIRIIAGLNDELAIAKQNNDTLEIDRLNETIALHDNKLQGIAKQRAKQAEVVLEKAKEVDKIQEQIGKLDEVKQSMVALELKQAGINAKKGEEMKALDDGIKKLEKQKGKLDELKRSGEMNTAEYREARDAIDSQIGSLNGVRSKILDLIGQADLLNGRLGKSITKAVYITEHGGSGTGAKASVKRHQGGPLPKLHVGGIAEQFAQSPLHNEIDVRLLRNEMVLTEAQQSNLFRMLDAGIARAPQQSGSGNDLGPLIRSIEALGSRPVRVEVDGREIAAATFRDNDRLFSTQLSQEMRTQGVKR >NZ_CP047095|2718485:2738047|2720372_2720567_-|WP_159130361.1|DBSCAN-SWA MVWISENKEWLFSGIGIVAITSLLGFVFKSKRSPGQKIDAGNHSNNVQGGKNVTVTFGETKHDK >NZ_CP047095|2718485:2738047|2734031_2734862_-|WP_159129820.1|capsid|DBSCAN-SWA MNRIPEPVYEKDVATLVRYFRRAMDDIDTELMRMDLSDFRRANALAVQAQIREILKDLDANITAWASTTFPKAAQESVARSIVSLGVAETFSEAMKLVKFNELNKSLIELAIADTQADLLQVTQNIDRRTRNAIRQAMADVMRENMAQGVNGVRSIRRDFLAEARRQLGDSLNTGIIDASNRRWKPTDYADMVTRTKLHTTYREAARNDAIDRGALYGVISSHGATDACRNWEGKVVKLVADAPGPYPYIGDLPRREIFHPRCRHHVSPIRRPDLA >NZ_CP047095|2718485:2738047|2731632_2731968_-|WP_159129816.1|DBSCAN-SWA MSFIPMRQTITVKSGGGTDEWGWPTDPTTVEHRCRIDEGTMSVKSKVGGLTRDGEVIVADARILLDGLVAIKYEDEISFTNELGETMRRKPKEINVQRNIAGKPILTEVFV >NZ_CP047095|2718485:2738047|2721054_2721198_-|WP_159129807.1|DBSCAN-SWA MNTNVDGNDVIHILSQRIAKLEIELAVKTAQLAQLEQQQTQGEVGSK >NZ_CP047095|2718485:2738047|2733371_2733971_-|WP_159129819.1|DBSCAN-SWA MKNLLPLNLQFFAEEAPDAVENTETKSAEQERAEHMIPKSRFDEVNQRYKETQAQLESLTKQRQDDERKAQEEAGEFKSLYEATSKEFGEFKSMFESAQARNVELEEVVGKLLESRIATIPEEFHDLIPNNLSPEGKLEWVDKAESKGLFRKAAPKPVGEQTNGNEYQGISKESFNKMDYAEKVRLHSENPDLYMKLSR >NZ_CP047095|2718485:2738047|2718485_2718764_-|WP_159129804.1|holin|DBSCAN-SWA MPELQKEHNSREIYDILTELRVDIGVIKTQLGYLSEVKKTAESADGKAAEALSLAKENAKDIEGIKKTTTWAIGLMVPSILTIVGILAAMVL >NZ_CP047095|2718485:2738047|2731967_2732357_-|WP_159129817.1|DBSCAN-SWA MVFSISNADSYIEQNVIDIEDWQDADPAKKQRILNVVSQTFSSKYPKYVIPEDAVYEFAAVLASAYNDTNRLNNQGVASFSITGVASFNFKDTQSRELVAFIPQRSLDLIGQANGGMKLGTRRVGRSVR >NZ_CP047095|2718485:2738047|2725904_2726132_-|WP_159130362.1|DBSCAN-SWA MRNVTTSDVWRYTGTTGSADRLAIAGTRSLKNDVVNVFRDTNRKLIRLAPGWNDIRLTGASGSYSVEFDFRFYYL >NZ_CP047095|2718485:2738047|2729826_2730165_-|WP_159130363.1|DBSCAN-SWA MWLGDKRVKINKLTPNRWRDLFATVDRLPGMIVQVISAPQEDFYTSVLTGLGYAADEIIEIVAVLSDLDVDYINAEVGVDEIFEYLSRTVKKNRLDSVAKNVKSLLPISPAE >NZ_CP047095|2718485:2738047|2719567_2720383_-|WP_159129806.1|DBSCAN-SWA MINRKQEITGGDHSNNVQGNEVTVHQHGMSYTDVKDIAMGVFKSNFYDLGEKIEAVVNERAEKLINKYLDELKAESPESLTNTKDPDVRFVIYEAQKNHARRNDDEIADLLIDLLVSRTKQKDQKFTTLVCNEALEIIPKLTMKQINILTVLFIVKHVNIGALFSAKDFPVFLDQFLRGITSGEIAFQHLQYSGCISISIGSADINKILSYSYPNDPIIDDLHLRYAKLIELWNTTKLANSSLTSVGIAIALSNFKKKTGVTWELENWIKE |
22 | Paenibacillus_phage(58.33%) | holin,terminase,portal,tail,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|