Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP016770 | Candidatus Planktophila dulcis isolate MMS-21-155 chromosome, complete genome | 2 crisprs | DinG,WYL,cas4,DEDDh,cas3 | 0 | 0 | 4 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016770_1 | 301955-302094 | Orphan |
NA
Consensus repeat of NZ_CP016770_1
|
1 spacers
spacers of NZ_CP016770_1
>1.1|302005|40|NZ_CP016770|CRISPRCasFinder CGCACCGCTTAATTAAATATGCAATAATTGGACTCCGCTT |
CRISPR arrays and Neighbor proteins around NZ_CP016770_1
The CRISPR arrays of NZ_CP016770_1 >merge|NZ_CP016770|1|301955-302094|CRISPRCasFinder GTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCCCGCACCGCTTAATTAAATATGCAATAATTGGACTCCGCTTGTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCC >NZ_CP016770|1|1|301955-302094|CRISPRCasFinder GTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCC CGCACCGCTTAATTAAATATGCAATAATTGGACTCCGCTT GTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCC
>NZ_CP016770.1|WP_095696023.1|301415_301883_+|hypothetical-protein MKFDIKKVFPENPSKFEGFRIIRLIAALYMSVMVARSCIHLFAPDGGAQSIAGIDTSVEGGNNIIAIFHQWGAIQLILAILLIVLFFRYPGLTPLILLTLTLDPVLRFVAGQQMSLTTTGTPPGEALNGVSLYLLLVLFLGSLWNKKAKLDLSGL >NZ_CP016770.1|WP_095696022.1|300738_301407_+|hypothetical-protein MKIKSVAISATAFVLLGGVLGVQQYISSQITSKVQREMPNASGISASVPLADVPSNLTSDSIKSADINIKSFALKESGTKTSLNISASSISKAKPTLVGSLEITATIPASTITKSSEFNDAQIVGNTLQVSAGAGGMGTAILIPKYSNSQLYFELQSISLLGNQIPASSLPSDLQNQIKSRSQRSLTPPKGLKVKSVSLSSKGLSVKMFGNNIQLGNLGSGL >NZ_CP016770.1|WP_095696021.1|300306_300639_+|TipAS-antibiotic-recognition-domain-containing-protein MKFGIHSEQYQNNFSKEETQKFTEVFGELTQEFAEKLNTGVPSSDESVQVLVRRHYEFCLQFWTPTKEAYKSLAMTYILPSPYRDAYEEVAEGLGKYHYDAVCIWADKNL >NZ_CP016770.1|WP_095696019.1|299046_299586_+|hypothetical-protein MRKFLIVTIVSTLIIIVTYFLPSGVWAEFGELPAHPLIIHGVVVLLPLLAILLLAGLFWKNLLKKLHLPLIGALALSVVGVLAAKSSGYSLSAVVGLPKSHAQWGNYLVLLAIALVSSFVLFSYFSFYKKSKIASSSLGVLMAFLAVSAIGMTYVVGHSGAESVWKYRYEIGKDQQGLP >NZ_CP016770.1|WP_095696018.1|298599_299013_+|Rieske-2Fe-2S-domain-containing-protein MEPISRRSFIAGVCAVVALGGSEVPAAANTAVKKLPGGRLSVDLKAVPALAKVGGATRIGSVKGVPVAIARTGTSKYIAFNLLCPHQKVTVTQNEKGWVCNAHGSEFESDGDLALGPATTGLARVPMKISKGLATIG >NZ_CP016770.1|WP_095675835.1|297978_298590_+|hypothetical-protein MRKVIVSIATVIGLVLSSNVAFADSAKPGQSMTHMKTAAGVASTLEAAGVILYVQGGATSAVIGDNLSAATSQVVFHIPVTANKAGVQHVGSNIIFFNTANNQYLTLKNPVIDLAKGVVSATVPQAGDAKVDILTITNASTAKPKITNDKKAKQRTTAYAGTTLVLAPGVAATIASVLGLPAGSLPDGLAFGTADVTLYSKLK >NZ_CP016770.1|WP_095675834.1|297424_297967_+|DUF305-domain-containing-protein MLKKISVLLLAVGVILIPSGANASTHAKSLQNLGMNEIMFAQMMIPHHEQAISMSETALKKSRNQAILKLSNQIKSLQGTEKSQLAYWLKATDSSMTMDHDMQMSGMLTTKELASLKRLTGTQFDRTFLQLMIKHHQGAIEMLDLISDSRNAEAKALAKAIKSAQSKEITSMKLLLNKLK >NZ_CP016770.1|WP_095696017.1|296337_296832_+|site-specific-integrase MRPSLAIPHFETFVLLRFSDQVFEKHLKTAKTYRTIPFPVGLQDLITNHVNRFGLGPHGLLLQNRSGKIWRYKDASAMFREVARPLGLDKGEGLHQLRHTFVSVLIQLNLNAKQIQEWLGHESILETMDRYGHLFPNSLNQASQLLDSHVVLALQNKAEARMLA >NZ_CP016770.1|WP_150121963.1|295873_296434_-|GIY-YIG-nuclease-family-protein MSLQSSNAFQILDLRNVAKQKFQNAGLLMTGEYLTARIPVSCICQKCGAKTKQTLNGVMNGKTCKYCYHVGIKYGESAYLYLIIHKEFSSIKVGISNHEANLNRLEAHKKNGWELYKSFDFDTANEAEWFETKLLNWLRRDRQLGVHLVRELMPQGGFSETVDGNEISILEIEQKFLELLEIGMTD >NZ_CP016770.1|WP_095696015.1|294315_295320_+|NAD(P)-dependent-alcohol-dehydrogenase MKASFLNKDKNIYVEEIDVPTLDADQVLIRVESVGICGSDVHYYKHGAIGPYVVEKPIILGHELSGVITAVGKDVEKNRIGARVAVEPQRACKVCKQCKAGRYNLCPDIEFYATPPIDGAFCEFVKIQSSFAYDIPANISFDAAALIEPMSVCIWAAQKAGIESGSTVLIAGAGPIGVIMAQVAKAFGAKDVVVTDVIEKRLAFVKGFGATRTINSTTESVGSEKFDVFIDACGVPSAVYAGIKSTGPAGRVLLVGLGSDDMSLPVSHIQNNEILVTGVFRYANTWPIGIDLLASGKVNLDAIVTHHFALNDVEHGLRATASPDAMKVIIHPNN >NZ_CP016770.1|WP_095696024.1|302381_302864_+|SRPBCC-family-protein MTSEKVRSEIFDTGNPKIKSARIIVEASPSTIFAILSNPKSHRDIDGSATVTANVSGPEALVLGSKFGMKMRLGITYWITNTVVEYKKDELIAWRHLGRWRWRYELTTLGNGSTQVTESFDGTYAPAVAQVWLNFRKAYPWTQLAVAKTLVRLKAVAEAS >NZ_CP016770.1|WP_095675844.1|302873_303197_+|hypothetical-protein MKFLISVIDDLSNSGTPAEMVAIDAFNDQLRANGQWIFAWGMQAPETATVIDNRGGANSETGHPLFDSKEHYSGLWLIEAADAATAKKLAFDASNACNRKVELRPLH >NZ_CP016770.1|WP_190286211.1|303335_304322_-|DMT-family-transporter MDELPSFLPMQLMNQLTQVNQSKLISSKYMAVALSKTQRSGLLFAFLGIFAFSLSLPFTKLALKSFDPFFTAFARPVIAAVIAIPLMMIAKVPMLPRNLWKPTAFTAAGAVFGWPILIALALQRTTSAHVSVIAAVMPLVTAIIAVIKHKKHPGLSFWVASSLGTVLLVAFSITRGGGTNADLKTDLLIIGAVIASSYCYVEGAALTSHMPGWQVISWVVVVSLPIALPAAAFVYAQTNADYSFHGDALFGLLAIGLSSMYLGFFAWYRGLRDFGVAHGSQVQQLQAIMTLGWSALLLGETVTLTMALSAIGIVLCVLWALSNVNRVK >NZ_CP016770.1|WP_095675846.1|304239_305247_-|Gfo/Idh/MocA-family-oxidoreductase MTQKLRIAIIGAGRIGYVHAGSVNDTPELELVYVVDPFEENAKKVTAAFGGKVSNDPSAVIASGEIDAVIIGSPTATHIPLLRECIAAGVHALCEKPIDLDVKNVEEFRALANSAKTNITLGFNRRQDPQYKALKAKVASGAIGTVEQVILTSRDPGPAPQGYIAVSGGIFRDMTIHDFDMARNFVPDIVEVTAFGANSFCDYIKEEGDFDNISVIMKGSNNELITVVNSRHAAFGYDQRAEIFGDKGMLQISNLSDTTVKSFTKDGTTAGEPFMDFFLERYADSYRNELKLFIEGIKTGKVLGSTYDDGRAALILADAAHESAHTGKSIKVNLK >NZ_CP016770.1|WP_095675847.1|305283_306306_-|Gfo/Idh/MocA-family-oxidoreductase MSALPKPHIFTAAESKPLRWGIFGAGWISEAMVKTAQLNSNQQFVAVASRTPGKAEAFAQKWNIDSFHNSYEELAARDDIDAIYLGTLPSDRLEVALVAINAGKHVLIEKPITMDYDEAQQIYAAAKAKKVLAMEAMWTRFLPQMDIARQLVADGALGDVELVVSNFCQNNLGVTRLFTLGGGNPIIDMGIYPAALSQQFLGNPDEIHAFGKLHPNDIDEETHTFMRFANGSRSNFVLSARTTLPHWAGVSGSKGAITFGTPWFTPSSITFHESTFNGAQSTWVDDLGIPEHFGLIYQVHAFAQYVDQGLLEGPLYTHHDSLSNIKTVLEIGNLIGTRYK >NZ_CP016770.1|WP_095696025.1|306315_307392_-|transaldolase-family-protein MTQSPFLYMKENSPTVLWNDSADPKELKDALNWGIVGATCNPVIALTAIKADAPHWVSRIKEYAKSHPAATEDEIGWAMVKELSTNAAKLLEGEFEKYNGRNGRLSIQTDPRNFRNAKALAEQAVEFSRLAKNMIVKIPVTTEAISAFEEATYQGVSLNATVSFSVAQTVAVAEAIERGLKRREAEGLDISTMGPVCTIMVGRVDDWVKVSAEKLGSKVDPEILEWSGVAVFRNAHKIYQERGYRTRLLSAAFRNHMHWSEILGGDSVISPPYAWQVKINEMGITPNLNSVNEPIEARILDPLLENFPEFRKMYDVDGLAVEDFTNFGGTLRTLRGFLQSVNDLESFVRDVTVPNPDK >NZ_CP016770.1|WP_095675849.1|307396_308311_-|TIM-barrel-protein MTAQIRVGTAPDSWGVWFPSEPHQVPWDRFLDEVVEAGYHWIELGPYGYLPTDPKQLEDELGKRNLKMTAGTVFTGFHKEDESQWQRAWDQALAVANLVSKLGVEHLVVIPDLWRDDKTGQARESRTLSNEQWKRLAAGHNKLGKALLEEFGIHQQFHSHADSHIGTYQEVERYLQETDPKYSNLCLDTGHFAYYLGDNLKLMNAYPERIGYLHLKQVHPDILAETLKNDVPFGDAVAKGVMTEPGFEGVPKFAPIIERALEINPEIFAIIEQDMYGCPVDMPFPIAQRTREHILAATRAARVK >NZ_CP016770.1|WP_095696026.1|308320_310234_-|3D-(3,5/4)-trihydroxycyclohexane-1,2-dione-acylhydrolase-(decyclizing) MATRKMTVSQAVVEFLSHQYTVDGDHRERTIQGVFGIFGHGNVAGIGQALKQLSVENPSLMPYYQARNEQAMVHESSAFARMKRRRATFACTASVGPGATNMLTGAAVATTNHLPVLLLPSDTFANRASDPVLQQLEMPHDATLSVNDAFKPLSRFFDRVQRPEQLFSALMGAMRVLTDPVETGAVTICLPEDVQAEMIDVPEEFLADRDWHIRRPRAEAAQLAEVARVITSSKRPFIVAGGGVIYSDAHDALQTFVEQTKIPVGTSQAGVGSLNWDHPQLLGSVGATGTTAANRAAKEADVVIGIGTRYSDFTTSSRTAFQNPDVRFININIASFDAFKHGSALPVVADARESLKELTALLTTFATTSDYQSKYTQEKSEWDAVVDAAFVDQKRALPSQTEIIHAVQSASDATDTLICAAGSLPGDLHKLWRVRSPLGYHVEYAFSCMGYEIAAGLGAARAGATPIVMVGDGSYLMMHTEIVSAVAEGLKVIIVLIQNHGYASIGHLSESIGSERFGTQYRFKDQAGNNFESGEKLPVDLAANAASLGITVIDIKQTTSAIADLHAAVKKAKQSSTSTLIHINSDPLLYSPDGEGWWDVPIAPISTLKSTQDAYAQYKDEISLQRPLLGNGTKDKK >NZ_CP016770.1|WP_095696027.1|310235_311156_-|5-deoxy-glucuronate-isomerase MSSADKWYFRHGELSRDGWDVLLDPQSPPVAGWKYTGLRIGTLTESKSLTLPADTNERIIFPLEGQEFLVEYTHDGNSSSQILHGRTSVFHGPADFIYLPINTSATISGVGRIAVGQTPATKVKAVRYVAKEDVSISLRGAGRETRQVHNLGMPETLDADRMIVCEVIVPASNWSGSPSHKHDVYIPGKESELEEIYYFQSAVTRGAKTPPSSLPFGYFRGTSADSRPYDVNEEVHSGDVALVPYGWHGPAAAGPGYDLYFFNVMAGPDPDRAWNATDHPDQVWIRDSWQSQQSDPRLPYGSTERI >NZ_CP016770.1|WP_095696028.1|311164_312655_-|CoA-acylating-methylmalonate-semialdehyde-dehydrogenase MSTIVNHWINGAEFVSTSGRTSPVYDPALGVETKRVALANQAEIDAAIKAAKDAFPAWRDESLAKRQQIIFTFRELLNSRKGELAEIITSEHGKVLSDALGEITRGQEVVEFATGIPHLLKGFYSENVSTGVDVYSTRQPLGVVGIISPFNFPAMVPMWFFPIAIAAGNTVVIKPSEKDPSASMWVAKLWKEAGLPDGVFNVLNGDKESVDGLLNSPDVESISFVGSTPIAKYIYESASRTGKRVQALGGAKNHMLVLPDADLELVADSAINAGFGSAGERCMAISVVVAVEPVADKLIPKIVERMGKLRTGDGRRGCDMGPLVTREHRDKVASYIDIAEKDGATVVVDGRNPQVDGDANGFWLAPTLVDKVPTTSKVYTEEIFGPVLSIVRVKSYDEGVALINSGAFGNGTAIFTNDGGAARRFQNEIQVGMVGINVPIPVPVAYYSFGGWKQSLFGDTKAHGVEGVHFFTRGKAITSRWLDPSHGGINLGFPQN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016770_2 | 1006759-1006855 | Unclear |
NA
Consensus repeat of NZ_CP016770_2
|
1 spacers
spacers of NZ_CP016770_2
>2.1|1006784|47|NZ_CP016770|CRISPRCasFinder GAGTCCGAGCGCGTTGCACTGATGGCAGAACTTGAAGGCGAACGTGC |
cas3 |
CRISPR arrays and Neighbor proteins around NZ_CP016770_2
The CRISPR arrays of NZ_CP016770_2 >merge|NZ_CP016770|2|1006759-1006855|CRISPRCasFinder TGCAGATGTTCTTGAAGAGATGGATGAGTCCGAGCGCGTTGCACTGATGGCAGAACTTGAAGGCGAACGTGCTGCAGATATTCTTGAAGAGATGGAT >NZ_CP016770|2|2|1006759-1006855|CRISPRCasFinder TGCAGATGTTCTTGAAGAGATGGAT GAGTCCGAGCGCGTTGCACTGATGGCAGAACTTGAAGGCGAACGTGC TGCAGATATTCTTGAAGAGATGGAT
>NZ_CP016770.1|WP_095696372.1|1005188_1006100_-|DMT-family-transporter MSEAQKATVHNHTELPARPDLIRLIIGIFGIGSSGPLIALSAMPVPTLIFWRNLGGSLMTLPFALRHKLDRTGVKWAVLAGIVLAVHFVGFFLSMRMTSVTAGTAIVATQPIFAAFFVKLTGGHIPTKAWLGMLISFTGVLVVTGIDLQLDRRSFLGDLAALISGALAAAYMLIGSRAQQTLATTSYTTICYFVCAMTALPMALLSGYDIVGFALREWWILLGLIIGAQILGHTMFNITLKRVSPAVVSMIVFFEVPVAAIVSLVFDIGKQPTLSIIPGVILILLGCILVVLRTRPESVMTEQ >NZ_CP016770.1|WP_095696371.1|1004346_1005192_-|PHP-domain-containing-protein MIDLHTHTICSDGTDAPFALVKKALAAGITTLAITDHDSTAGWEEAVSAIQPQIELVLGAEISCLTTDGISVHMLGLLFDGKNSEMQQMLSDSRDTRVPRMRKMVELMSADGINISLDDVYRATPEGATVGRPHLADALVANGVVATRDEAFLDLLNNESKYYVTHAAPTPVDAIEVIRKAGGVAVIAHPFASRRGQIITASTFTDLVAAGLNGIEVHHRDQSADEQSTLTAIAQELNLVITGSSDYHGTGKLNGLAENTTHQAQWEQLESLADARRVVKK >NZ_CP016770.1|WP_095676471.1|1003738_1004350_-|MarC-family-protein MNSLGAVTFATQAFVTLFVIMDPPGATPIFLGLVGDKSPRERVRLAWQAAGVSLFVIASFALFGRFILDYMNVSIEALQAAGGLLLLYVALQLLTGNKNTGTENASDNIGMVPLGTPLLAGPGAIVATMIYVQKADTNAQILGLVIAILAVHLIIGTVLMASTKIVGLIKDSGVTLLASIAGLLLAAIAVQMLANAIKAFAAS >NZ_CP016770.1|WP_095692708.1|1002376_1003738_+|DEAD/DEAH-box-helicase MSLTFADLPLRKETIDALHEHGFTSPFPIQEMVMPIALADGDVIGQAKTGTGKTLAFGIPVIERVIAPNDADWAQLPNQGKPQVLIVVPTRELCVQVTKDVEELSFNRGIRTLAVYGGRAFEPQIEALNNGVEIVVGTPGRLLDLYRQGQLTLKFVSRVVLDEADEMLDLGFLPDVEKIFTSTPARQQTMLFSATMPGDIIALARRFMNQPVHIRTQDNEDEGAVVSRIEQHVIRAHAMDKIEMLARILQADGRGPTIVFCRTKRTAQKTSDDLFERGFRAATIHGDLGQSAREKALNDFKAGKSDVLIATDVAARGIDIDGITHVINYQCPEDEKTYVHRIGRTARAGAAGIAVTFVDWDDLARWKMIDTALVLGLPEPVETYSSSEHLFEMLNIPAGSSGRMTKKSAAAVDKPKTDRPKSDRPRSEKAVEPKKPAADRIKRERTRTKRISE >NZ_CP016770.1|WP_095696370.1|1002124_1002376_+|DUF3107-domain-containing-protein MSSKKSEKAAKVRISIINVGSELSFDCHSTPAEIKSAVTAALTAQTPLSLQDVQGHEIIVPADKIGYVEIGEPAERRVGFGVV >NZ_CP016770.1|WP_095676864.1|1001472_1002108_+|TetR/AcrR-family-transcriptional-regulator MTTESATANNSRSDKSRLPRDERRAILLSAALEVFTAAGYHSAAMDEIADRANVSKPVLYQHFPSKLDLYLAVLDLHIDSLVFEIQKAISSTPDNERRVHVTIEAYFNFIENEGEAFRLLFESDMSVEPQVRERLNRMTYDCAAAVSGVISNDTGLPKEAAMMLGVGLIGYVQVTARHWLERDSKLTRQQAMDLVENLMWRGISGFPRTDS >NZ_CP016770.1|WP_095676468.1|1000235_1001417_-|adenylyltransferase/sulfurtransferase-MoeZ MKTPPLVTPGPALTVDEVRRYSRHLIIPDVAMAGQQRLMNAKVLCVGAGGLGSPALMYLAAAGVGTLGIVEFDTVDESNLQRQIIHGQSDIGKSKALSAKEKIAEINPYVNVILHETRLDNSNVMEIFSQYDIIVDGTDNFATRYLVNDACVLLKKPYVWGSIYRFDGQASVFWAEYGPCYRCLYPEPPPPGMVPSCAEGGVLGVLCATIGSIQTTEAIKVLTGVGEPLIGSLMVYDALDMTFRKIKVRKDPNCPLCSENATQTALLPDYEAFCGTLSEAAQEASSGSTITVQDLKAKIDNKDNFYLIDVREPSEYEIVNIPTAHLIPKQGFIDGSVLASLPQDKPIVLHCKSGVRSAECLAILKNAGFADASHVFGGVIAWAKQIDTTLPVY >NZ_CP016770.1|WP_095696369.1|999366_1000236_-|N-acetyl-1-D-myo-inositol-2-amino-2-deoxy-alpha--D-glucopyranoside-deacetylase MLSSYKGYRMLLVHAHPDDETINNGSTMAMYAALGADVTLVTCTRGEEGEVLVKDLAHLAAHETDSLGEHRVGELADAMKALGISDHRFLGEGEKKYRDSGMMGTEPNNRPDVFWQADLEEASSELVKIMDEVKPHVLITYDEIGGYGHPDHIQAHRVAMRASEKSSWNIEKIYWNVMPRSVIQEGIDAMKKLGSDFMGAEKAEDLPFAKDDSFVHAMVDGNAYVEKKMDAMRAHSTQIEVDGPFFALSNNLGLQVWGNEYYTLVKGEKSEPLDSRGHEMDLFAGVTPS >NZ_CP016770.1|WP_095696368.1|999055_999367_-|hypothetical-protein MQFLSSLLFGAMIAVSATLVHQTLPPVGVSVGIFATYLGIWYVGRHYGKRRYKLIALSAWLAVISIAGSFGVGEELLIQGDNQGSALLTIGFVAGVVAVLRNP >NZ_CP016770.1|WP_095696367.1|997470_999054_+|cysteine--tRNA-ligase MASMSLRTQIAQALGKRATIRLRDSDGGLRDIVGVLQSETELINRRGEVVNFNPDEAVAFRVIPVFNRRDVSTGSLSIYDTKSKSLHTIAGTDGVVRIYCCGPTVYRDAHVGNLRTFLLSDLISRTLQMTGLDVSLVQNITDVGHMADDFEEDKMLAESAKTKVDPFQIARTFEDRFHIDLERLNIEPAASYPRASEKMAEMITAIEKLIAMKRAYVGSDGSVYFDATSFPTYGALSGNKLDSLQPGHRYEFTDEGGKRFHADWALWKLAGARTQMIWDSPWGAGFPGWHIECSAMSIELLDAHVDLHLGGIDLRFPHHENERAQSNSLTGNETVDTWVHGEHLLFEGRKMSKSAGNVVLLQDVIDRGLDPLSLRFALLENRYRSQMDLSWASLEAAHSTLKRWRQLLSNAGTSAEMKFDQEVSDALTTDLDTPRAMQRIRTIEKDSTIGALDKRALFLFADQVFGLDLDRGVEQREVSSEIQALLDARITARAEKNWSLSDSLRDQLTNAGLEINDGAEGQSWSWK >NZ_CP016770.1|WP_095676475.1|1007445_1007940_+|DUF1003-domain-containing-protein MARNFGLDTPRETRRSLRGNIDPETFGRLSERFARFLGTARFLVYMTAFVLTWVLWNTLAPRDIRFDNYPFIFLTLILSLQASYAAPLILLAQNRQADRDRIALNEDRAQNARSIADTEYLTRELASLRIALGDVATRDYLRNELGDMAKEIVVELRKPESDAK >NZ_CP016770.1|WP_095676476.1|1007910_1009029_-|Mrp/NBP35-family-ATP-binding-protein MTTLESVHAALATVQDPELHRALPELGMVKSVEIKGSIAHLEILLTISGCPMRDRLQKDIESAVTAVEGISAIELTFGVMDEEQRANIKKLLRNGRESFISFAQKDSLTRVIGVASGKGGVGKSSLTANLAVSSAQKGLRVGILDADVYGHSIPRLMGLIGQRPTAIDQMFIPLESFGVKTVSMEMFKPERSDAIAYRGPLLHRVLEQLLSDAYWGDLDLLYIDLPPGTGDLAISLGQLVPSSEIVVVTTPQVAAAEVAERAGRIAHQIHQRVIGVIENMSAYPCAKCGELTSLFGEGGGEETSRRLSQLVGSDVPLLGKIPFSPDLREGGDAGAPVVISAPDSPSAKAIEAIVSQLIVREKSLLGVRLGLA >NZ_CP016770.1|WP_095676477.1|1009025_1009334_-|Sec-independent-protein-secretion-pathway-component MFFDFGAGELVGLAILAMILIGPERLPNLAVDAAKFVKRIREMASKATEELKDNLGPGFEDLKPTDLNPKSFIKKQLSSVLDDDDSTPATSKRTSTIDPDLL >NZ_CP016770.1|WP_095696373.1|1009353_1010472_-|trypsin-like-peptidase-domain-containing-protein MSINNGGPWWDAPSKSGLGKNITLRSAIVLALVVGVIAGAFGASSSGSLFGRSVNLVKSTSAIERPAGSVAEIAQRVLPSVVSIEAKSSNGGSTGSGFVIDSSGYILTNNHVIAASVTSGGDITVRLNDGSSFDAKVVGRDSSYDLAVLKIVGASLKALQFGDSDKVAVGDSVIAIGSPLGLSGTVTLGIISAKDRAVTAGESNSENSFINALQTDAAINPGNSGGPLVDATGSVIGVNSAIASLGSSFSSQTGSIGLGFAIPINQARKTADQLIKNGKATYPVMGISVDMNFSGDGAMIAKNAQAILPGGPAAKAGLKSGDIITAIDGRPITSPEELIVTIRSLNIGDSVVVTYKRGSESKSATLTLTASK >NZ_CP016770.1|WP_095676479.1|1010484_1011111_-|class-I-SAM-dependent-methyltransferase MNNNPHSYAESFIAEDAVKIAARARGLELGTLDASQGTGAYLRHLAHLLDAQSVVEIGTGSGVGSLWILEGMIASGTLTSIDDEMEHTSIAKLAMADADIAQSRYRFITNSVMDVMTKLTDRAYDLVVYRHNPEDLSFAISEAHRILRSGGVFVIDNFFGGSKVHDPAQRDPKTIALREAGKLIKGDTDSWVSSLIPTGDGLLLATKL >NZ_CP016770.1|WP_095696374.1|1011107_1012589_-|leucyl-aminopeptidase-family-protein MLHTVAPDLEALISADVLALGFTKKNDENIELVGSARLISSLEKYFGINLIDEIAFFAPSGKAGELFEIPVLHKDSTVDRLYLVAVGDGSLASLRAAGASLGRKVRGKAIELISLVCQSRAEIRAHGASILLGAYTWNLKTGKPAEIATIAIATKDGASVSEAGVIARALYTARDLIHTPANIKNPLWMAQEAKKIAEEKGLSISVLAGKELAQFGGLRAVGNSSPKPGPRFIEITYIPKGKARSAAALPHVVIVGKGITFDTGGISLKRPYEFMTAMKSDMAGAAAALATISALPDLQPQVKVTVLMMCAENSLSGTSQRPSDVITQYGGTTVEIINTDAEGRLVLADGLAYAVENLDPDYLFDIATLTGSATLGLGRQYAALYTRDEKLAKELVSVGESSGERVWHMPLIDDYQDSLESDVADLNHAADKGDYSAGSVTAALYLEHFVGDSRWVHLDIAGTGRSETDSGENAKGGTGFGVRFFIDWILSLS >NZ_CP016770.1|WP_020045748.1|1012604_1012784_-|DUF3117-domain-containing-protein MAAMKPRTGDGPMEVTKEARSLVMRIPLEGGGRLVVELNPQEANNLSAALEAAVALIKK >NZ_CP016770.1|WP_095676481.1|1012896_1013097_-|sigma-70-family-RNA-polymerase-sigma-factor MSSSSNPQTLAELLASLPEEERIILTLHYLRSKSSGEIATLLSVPERAVIVVIESGKTRLKAILGL >NZ_CP016770.1|WP_095676482.1|1013149_1013599_-|SRPBCC-family-protein MSSNTLSISLTIDAPREVVWKKIADWKSQGDWMLQTKVWVTSNQVEGVGTSIAAFTGPLHKFYPRLKSLGLLDLMVVTQWQPPHRCDVDHVGKVLKGSGSFQLSEINGSSTRFDWSETIVAPKAIFLLAAPFLYVGVRISLARFARSFT >NZ_CP016770.1|WP_095676483.1|1013601_1014141_-|TIGR00730-family-Rossman-fold-protein MRIAVFCSSSPTIDSKFIDLAFELGAGIAQSGAELVSGGGHISAMGAISRGARSQGGRTIGIIPQKLVDIEFADHDSDELIVVDSMRTRKAKIEDLSDAFITLPGGLGTLEELFEIWVGRYLKFHDKPVIILDPHGVFQPLHALVEHLENENFVKPGMRDLLHWTTTVEEAVAIAYGKK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
615120 : 623049
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP016770|615120:623049|DBSCAN-SWA TATGAGCACTCTTGAAATTCGCGGATTGAAAGTATCTGTCGAGACCGAGCAAGGATCGATTGAAATTCTTAAGGGCGTAGACCTCACCATCAAGTCAGGTGAGACACATGCAATCATGGGACCTAACGGTTCAGGTAAGTCAACGCTTGCATACTCAATCGCCGGTCACCCTAAATACACCATCACAGAGGGCACAGTTACTCTCGATGGTGCAGATGTACTTGAGATGACAGTTGACGAGCGCGCAAAGGCAGGACTCTTCCTTGCAATGCAGTATCCAGTTGAAGTTCCAGGCGTTTCAGTTTCAAACTTCCTTCGTACAGCAGCAACAGCTCTTCGCGGAGAAGCCCCAAAGCTTCGCGAGTGGGTAGGGGAAGTAAAGAGTGCAATGGAATCGCTCAAGATGGATGCATCATTTGCTCAACGCAATGTGAATGAAGGATTCTCAGGAGGAGAGAAGAAGCGTCATGAAATCATGCAGCTCGAACTTCTCAAGCCTAAGTTTGCAATCCTTGATGAGACAGATTCAGGTCTTGATGTTGATGCGCTTCGTATCGTCTCTGAGGGTGTTGTTCGCGCAAAGGCTGCAAATAACCACGGAATCCTTCTGATTACTCACTACACACGCATCTTGCGCTATATCAAGCCTGACTTCGTGCATGTATTTGCTAACGGTCGTATCGTCGAAGAAGGCGGACCAGAGCTTGCAGATAAGCTCGAAGCAAATGGTTATGCGGAGTACGTCACCGCTTAATTATGACATTTGATGCTCACGCTATCGCTAAGGATTTTCCGATTTTGGATCGGACAATCCGCGATGGCAAGCGCCTTGTCTATTTAGATTCTGGCGCAACGAGCCAAAAGCCCAATGTTGTCATTAATGCAGAAAGTGATTTCTACCGTTTTCATAATGCGGCGGTGCATCGCGGTGCGCATCAATTGGCTGAGGAAGCAACGGATGCCTACGAAGGTGCTCGCGAAATTGTTGCTCAATTCCTCAACGCATCGGTCGATGAAATCGTCTTCACTAAGAACGCCACGGAATCTCTCAATCTCATCTCCTATGCGATGGGCAACGCTGCTCCTGGCAATCGTTTTCACCTTAAAGCAGGGGATTCAATAGTCGTTACTGAGATGGAGCACCATGCAAATCTCATCCCTTGGCAGCAGTTAGCTGCCCGCACGGGAGCAATTCTGAAATGGTTCACCGTCACAGATGATGGACGTCTAGATCTCTCTCAGATCAATTCAGTCATTGATGAGAAGACAAAAGTTGTGGCTCTCACTCACCAATCAAATGTACTTGGCACCATCAACCCGCTTGAAGCGATTACAAAGCGTGCTCACGAAGTGGGCGCAGTTGTAGTTCTTGATGCATGTCAATCTGTTCCTCACATGAAGACTGATGTGAAGAAACTCGATATTGATTTCTTAGCATTCTCAGGACACAAGGCAGTAGGTCCTACAGGTATTGGAGTTTTCTGGGGACGCGCAGAACTTCTTGCTGAACTTCCACCGTTTCTTACCGGTGGCTCCATGATTGAAAATGTCACTATGGAATCTGCAACATGGGCGCCAGCACCAAAGAAGTTTGAAGCCGGTGTTCCCAATATGGCACAGGCTGTTGGATTAGGTGCAGCGCTGACCTATCTGACTGGAATTGGTATGGATAACATCCATAAGCATGAGATATCACTCACCAAATATTTGCTTGAGGGTCTGTCTGCAATTACTGATCTGCGCATAATTGGTCCAAAGACAACAGAGCTTCGTGGTGGAGTAGTTTCATTTACTCTTGGAGATATTCATCCACACGATTTAGGTCAGTACTTAGATAGCCAAGGAATTGCAGTTCGCACAGGTCATCACTGCGCATGGCCACTGACTCGCAAGCTTGGAGTTCCCGCAACTACACGTGCCTCTGTCTATCTCTATAACACCACAGATGACCTTGATGCACTCATCGTCGGCGTGCAGGGCGCTCAGAAATATTTTGGTCGCTAATGCAGCTCGATAATCTCTATCAAGAAGTGATTCTGGATCACTATAAGAATCCGCAGAACAAGAAGTTGAATACAACTTTTGATGCGCAGGTTCACCACATCAATCCCAGCTGTGGCGATGAAATCACTCTCAACGTCACACTTGAAGGAAATATGGTCAAGAGCGTTTCTTGGGATGGCGTGGGGTGCTCGATTTCTCAGGCAAGTGTCTCGATTCTTACAGACCTTCTTATCGGCAAGAGCCTTGAGCAAGCTCAGGTGATAGCTGATTCCTTTATGCACTTGATGCAGAGCAAGGGAACCGAGAAGGGTGATGAGTCACTTCTTGAGGATGCAGTCGCCCTTGCGGGAGTTTCTCAGTATCCGGCTCGCATTAAATGTGCGTTGCTGGGTTGGATGGCATTTAAAGATGCAAGCGTTCAAGCGTTGGCAAAGCAAGCCTAAAAAGAAAGTAGGATAGAGACATGGTTGCAAAAATCGAAGATATTAATGAGGCGATGAAGGATGTTGTCGACCCAGAACTTGGTATCAACGTTGTTGACCTAGGACTTATCTACGACATGATGGTCGATGACAACAATATTGCTGTCCTCAATATGACTCTCACATCTGCTGCATGCCCACTACAAGATGTGATTGAAGATCAAACACGTCAAGCTCTTGCACCTTTTACAGACGATGTGAAAATCAATTGGGTGTGGATGCCACCTTGGGGTCCCGATAAAATTTCTGATGATGGTCGCGAACAACTTCGCGCCCTCGGCTTTACAGTCTGATTCCTAGATCTGATCTCTGATGTCTCAACATGCAGTTTGGATGACCTTTCGGTCAATGACTGCAGATCCATCTGTTAAGTCTCAGAAGCTAAAGCCCGGAACAATCAAGAGAATCATTACCTACGGAAAGCCGTATAAATCTCACATCATCATATTTCTCATCACCGTAGTAATCGAAGCGCTCCTCGTCGTTTCAACACCACTGTTGCTTCGCGAACTCATCGATAAGGGCGTCTTGCCTAAAGATACGGGCCTTGTTACAAAGCTTGCTTTCCTTGTGGGACTGCTTGCTGTGGTCGATGCTGCATTTAACATCTTCGGGCGTTGGTACTCAGCAAGAATCGGCGAGGGCTTGATCTATGACCTGCGCTCACAAGTATTTGCTCATATTCAACGCCAATCAATTGCATTCTTTACTCGCACGCAAACAGGTGCGCTGATCTCTCGCATTAACTCAGATGTGATGGGTGCTCAACAAGCCTTTACTGGGACTCTGTCAGGTGTGGTGAGCAATGTGGTCTCACTCGTACTCGTTGTGACAACGATGCTCATTCTCTCTTGGCAGATCACAGTCGTATCTCTTCTTCTTCTACCTGCCTTTCTTCTTCCAACTAAGTGGGTTGGAAAGCGCATTCAAGCCCTGACTCGGGATTCATTTAATCTCAACGCAACAATGTCGAGCACCATGACTGAGCGCTTCAATGTCTCTGGTGCACTACTTGTGAACCTGTATGGAAAGCCAGCAAAAGAGGAGAACTTCTTTCGCACACGTGCGCGCAAGGTTGCAGATATAGGCATTCAGACAGCAATGCTTAACAGAGTCTTCTTTGTCGGAATCACAAGTGTCGCAGCGGTTGCAACAGCCTTTGCTTATGGCATTGGGGGACACCTTGCAATCAACGGTTCCATTACGGTCGGAACTCTGCTCGCTATCACTGCACTGCTCATTCGCCTCTATGGGCCGCTGACTGCGCTCTCTAATGTTCGCATCGATGTAATGACTGCACTTGTCTCATTTGAACGCGTCTTTGAAGTACTAGACCTTCAGCCGATGATTGTTGATAAGGCTGATGCAAAGGTTTTGAGGAAGAAAGACTTGAAGATCGATTTCACCGATGTTGCCTTTAGCTATCCGCGTGCGGATGAGATTTCATTAGCATCGCTTGAATCGGCGGCAAAGCCGGAGACTGTTGAAAGCGGAGAGGTTTTGCGCGGAATTACTTTTAGCGCACCTGCAGGATCATTGACTGCAATTGTTGGCCCATCGGGCGCTGGTAAGACAACGATGAGCGCGCTACTTCCAAGGCTGTATGACGTTACACGTGGTGCAATCACCATTGATGGCGAAGATATTCGAGAGTTCACTGTGCAATCACTTCGTGACTCCATCGGTGTTGTTATGCAAGATGCACACCTGTTCCATGAAACTATCTCTGAAAACCTTCGCTATGCGAAAGAAGATGCAACTGAGGCCGAGATGATTGAAGCGTGCAAAGCAGCGCAGATTTGGGATCTGATCTCAACGCTTCCAAATGGCTTTGACACCATGGTAGGAGAGCGAGGCCATCGCCTTTCTGGTGGAGAGAAGCAAAGGCTTGCAATAGCCCGACTGCTTCTTAAAGCACCTGCCATTGTGATCTTGGATGAGGCAACTGCGCACCTTGATTCTGAGAATGAATCTCTCGTGCATGAAGCTTTAAAGCATGCTCTTAAGGGGCGAACATCGATTGTGATTGCACATCGATTGAGCACAGTGATGGAAGCTGATCAGATTTTAGTTCTAGAAAAAGGGCTGATCGTAGAGCGCGGAAAGCATGAAGAACTTATTGCTCAGGGTGGTTTGTACTCAGAGCTCTTTGCGAGACAAGACATCACGACGAATTAAGAATCGTCCGTAGATAACAAGGCCTCCAAAGAAGAGCCACTGCAAGGCATATGCCATATGTGGCCCATCGGAGAGTTCAGGAAGTTGCGCTGGAACAGCAGGAGTGAGAGCAGAATCTGATCCACTCAGTAAATCAATATAGAAAGTCTCTGTTGACGAACCAGATTGTGCATTTGCCTTTGAGACAAGTCCGTCAGCTTTGTTGCCAGGAATCGCAAAGAATGATCCACGCGGAAGAGAGGAATCTAAGCGAAGCCTTCCGGTAATGCTCACTTCACCCGTTGGTAAGACAGGTAGTTCGGGTTGAGAAGATGCGTTCGCACCAGCTTTGACCCAACCGCAATCAACCCAAAAGCTCTTTCCTTCAGGCGTAGTAAAGCGCGTGAGAAGTTCAAAACCATACACACCTTCTGAATAGCGATTACGCAAGAGTATTTGTGATGAGGAGTCAAAGATTCCTTCAGTGCGTACTGGTTGCCATTCATGGTCAGCAGGGTTGGATCGCACAGATGTCAGTGAGACAGGGCTCATGCTTGAGTTTGTTTCGATGACTGAATTGCGTGCGTGCCGATCAACACCACGGTGATACTGCCATTGCGCTGCCCAGATGCATCCAATAATCAGAAGTAGGGCGACGAGGGACTTAAGGAGCGAGTAACTCTCTTTCTTTCTTGTGTTACTCAATTCCTCTCGGCCTCTTCTTCAAGATGGCTTCACCTGGTAGAACTGTTTCGCGCCCTGCATTTGCAACTACGACTGCGATATAGGGCAAGGCAACGGCGCCGAGAAGTGCGAACCATCGATATGGACTGGGTAGTAACACAGTCAGGATGAAGCAGGCCGTTCGTATCATCATTGAAATGAAATAGCGACGTTGTCTCGCAGACTGGTCAGTAGATAACCCTTTTGGGGCGCTAGTGATGTCGTAAACGTCATCTTCTTTAGCCATAGAGCAACAGTAGGGTAGTTTTCTCTCATGAGCGAGCCGACTTCAATCGCACTTGTTACCGGTGGAAATCGCGGTATCGGTTTAGCAATCGCAACTGCTCTCAAGGCCGCAGGGCACAGAGTCGTCATTACCTATCGCAGTGGAACGCCACCGACAGGCTTTGATGCTGTGCAGATGGATGTCACTGATTCAGCAAGTGTGGATGCAGCATTTACAAAGATTGAATCTGAGATCGGACAACCTGAGATTATCGTTGCAAACGCAGGCATTACAAAAGATACTCTTGTCCTGCGTATGAGTGATGAAGATTTTGAAAGTGTGATTGATGCAAACCTCACGGGTGCATTCCGAGTTGCAAAGCGGGCGACAAAGGGTCTGCTTAAGTTAAAGCGAGGACGTCTCATATTTATTGGTTCCGTCGTTGGGGGAGTAGGCGCTGCAGGCCAAGTGAATTACTCAGCCTCTAAATCAGGTCTTGTCGGAATGGCTCGATCCTTTGCCCGTGAACTTGGAAGTCGAGGCATTACAGCCAATGTCATTGCACCGGGATTTGTCGAGACAGATATGACAGCAGAACTTGATGAGAAGCGTCGTGTAGATATTGCAGCACAAGTTCCACTTGGACGTTTCTGCTCTGCCGCAGAGATTGCAGATGTTGTGGCATTTATTGCATCCCCGCAAGCTAGCTATATCACCGGTGCCATCATTCCAGTCGATGGCGGATTAGGAATGGGTCACTAAGTATGGGAATCCTTCAAGGTAAAAACATCCTCGTCACCGGCGTTCTCACCGATGGATCAATCGCATTTCATATTGCCAAGATTGCACAAGAGGAAGGTGCGAACGTAGTTCTTACTGGCTTTGGTCGTGCTCTGAGCTTGACCACTCGTATTGCAGGTCGCCTCCCACAACTTCCTCCGATCATCGAACTCGATGTCACCAATCAAGAGCATCTCGATGGTTTGGCTGAGCAAGTAAGAAAGCATGTCCCACATCTTGATGGTGTTGTGCACTCCATCGGCTTCGCACCTGAGGCAGCACTGGGTGGAAATTTCCTGAATACTGCATGGGAAGATGTCGCAACTGCAGTGCATGTATCTGCATACTCGCTTAAATCGTTGACAATGGCTTGCAGACCAACATTTAAAGATGGGGCATCTGTTGTCGGCCTTGATTTTGATGCACAAGTTGCCTGGCCAAAGTATGACTGGATGGGCGTTGCTAAGGCTGCTCTTGAATCCACATCACGCTATTTGGCTCGCGATCTCGGCGCTGAAAATGTTCGTATCAACTTAGTTGCAGCAGGCCCTATTCGCACCATGGCTGCAAAATCAATTCCTGGCTTTGATGAATTCGAAAAGGTGTGGAATGAACGAAGCCCACTGGAATGGGATGTCACAGATCCTGTTCCTGCAGCAAAAGCTGCCGTTGCACTCTTAAGTGATTGGTTTCCCAAGACTACGGCTGAGATCATTCATGTCGATGGCGGTCTGCACGGTATGGGCGCCTGATTACGCACGGATTTTTCATTCTGAGCCCGCTCAGGTAGCCTTGCCCCAGACCAGTAGCTCAGGCTGCTGCAAGAGGAGCTCACCGCTCCCACCTCTTTCGCTTCGGCGGATTGGTTAGTCGGAAGTAAACCTGAATCAGGTTTTACTCTTTCTATGCGGTGCGCGTTTTTTGTAGCGACTTGGATACTCAAGAACCGAACATAGGAGCAGAAATCACAACAGAGCCGCGCATTAACGACCGTATCCGCACACCCCAGATTCGTCTCATCAATTACACCGGTGAACAAGTTGGAGTTGTAGATATCGAAGCAGCGCTAGCAATGGCAGAAGAAATCGGACTCGATCTCGTTGAGATCGCACCGGAGGCTAATCCGCCAGTGTGCAAGATCATGGACTTCGGCAAGTACAAGTACCAGGAAGCACAGAAAGCTCGCGAAGCACGTCAGAACCAGACCCACATTGTGGTGAAGGAAGTGCGTATGACACCAAAGATCGAAAATCACGACTACGAGACAAAGCGCTCAGCAGTCGAGAAATTCCTTAAAGGTGGCGACAAGGTGAAGATCACGATGAAGTTCCGTGGCCGTGAGCAAACACGTCCAGAGCTTGGGTTCAAACTCTTGCAACGTCTTGCAGAAGATGTGAAAGAGATTGCATTCGTGGAGTTCGCTCCTAAACAAGAAGGTCGACAGATGACGATGGTCCTAGGGCCAACGAAGAAGAAGACCGAAGCGGTAGCAGAACAGAAAGCGGCGCGAGCTGCCAAAGTGAAGGCTGCTGAGGAAGAAGCAGCAGCCACACAATAG
Protein sequences of DBSCAN-SWA_1 >NZ_CP016770|615120:623049|615120_615873_+|WP_095676118.1|DBSCAN-SWA MSTLEIRGLKVSVETEQGSIEILKGVDLTIKSGETHAIMGPNGSGKSTLAYSIAGHPKYTITEGTVTLDGADVLEMTVDERAKAGLFLAMQYPVEVPGVSVSNFLRTAATALRGEAPKLREWVGEVKSAMESLKMDASFAQRNVNEGFSGGEKKRHEIMQLELLKPKFAILDETDSGLDVDALRIVSEGVVRAKAANNHGILLITHYTRILRYIKPDFVHVFANGRIVEEGGPELADKLEANGYAEYVTA >NZ_CP016770|615120:623049|622422_623049_+|WP_190286215.1|DBSCAN-SWA MDTQEPNIGAEITTEPRINDRIRTPQIRLINYTGEQVGVVDIEAALAMAEEIGLDLVEIAPEANPPVCKIMDFGKYKYQEAQKAREARQNQTHIVVKEVRMTPKIENHDYETKRSAVEKFLKGGDKVKITMKFRGREQTRPELGFKLLQRLAEDVKEIAFVEFAPKQEGRQMTMVLGPTKKKTEAVAEQKAARAAKVKAAEEEAAATQ >NZ_CP016770|615120:623049|620459_620732_-|WP_095696180.1|DBSCAN-SWA MAKEDDVYDITSAPKGLSTDQSARQRRYFISMMIRTACFILTVLLPSPYRWFALLGAVALPYIAVVVANAGRETVLPGEAILKKRPRGIE >NZ_CP016770|615120:623049|615875_617123_+|WP_095696176.1|DBSCAN-SWA MTFDAHAIAKDFPILDRTIRDGKRLVYLDSGATSQKPNVVINAESDFYRFHNAAVHRGAHQLAEEATDAYEGAREIVAQFLNASVDEIVFTKNATESLNLISYAMGNAAPGNRFHLKAGDSIVVTEMEHHANLIPWQQLAARTGAILKWFTVTDDGRLDLSQINSVIDEKTKVVALTHQSNVLGTINPLEAITKRAHEVGAVVVLDACQSVPHMKTDVKKLDIDFLAFSGHKAVGPTGIGVFWGRAELLAELPPFLTGGSMIENVTMESATWAPAPKKFEAGVPNMAQAVGLGAALTYLTGIGMDNIHKHEISLTKYLLEGLSAITDLRIIGPKTTELRGGVVSFTLGDIHPHDLGQYLDSQGIAVRTGHHCAWPLTRKLGVPATTRASVYLYNTTDDLDALIVGVQGAQKYFGR >NZ_CP016770|615120:623049|617122_617566_+|WP_095696177.1|DBSCAN-SWA MQLDNLYQEVILDHYKNPQNKKLNTTFDAQVHHINPSCGDEITLNVTLEGNMVKSVSWDGVGCSISQASVSILTDLLIGKSLEQAQVIADSFMHLMQSKGTEKGDESLLEDAVALAGVSQYPARIKCALLGWMAFKDASVQALAKQA >NZ_CP016770|615120:623049|617586_617898_+|WP_095531491.1|DBSCAN-SWA MVAKIEDINEAMKDVVDPELGINVVDLGLIYDMMVDDNNIAVLNMTLTSAACPLQDVIEDQTRQALAPFTDDVKINWVWMPPWGPDKISDDGREQLRALGFTV >NZ_CP016770|615120:623049|617917_619783_+|WP_095696178.1|DBSCAN-SWA MSQHAVWMTFRSMTADPSVKSQKLKPGTIKRIITYGKPYKSHIIIFLITVVIEALLVVSTPLLLRELIDKGVLPKDTGLVTKLAFLVGLLAVVDAAFNIFGRWYSARIGEGLIYDLRSQVFAHIQRQSIAFFTRTQTGALISRINSDVMGAQQAFTGTLSGVVSNVVSLVLVVTTMLILSWQITVVSLLLLPAFLLPTKWVGKRIQALTRDSFNLNATMSSTMTERFNVSGALLVNLYGKPAKEENFFRTRARKVADIGIQTAMLNRVFFVGITSVAAVATAFAYGIGGHLAINGSITVGTLLAITALLIRLYGPLTALSNVRIDVMTALVSFERVFEVLDLQPMIVDKADAKVLRKKDLKIDFTDVAFSYPRADEISLASLESAAKPETVESGEVLRGITFSAPAGSLTAIVGPSGAGKTTMSALLPRLYDVTRGAITIDGEDIREFTVQSLRDSIGVVMQDAHLFHETISENLRYAKEDATEAEMIEACKAAQIWDLISTLPNGFDTMVGERGHRLSGGEKQRLAIARLLLKAPAIVILDEATAHLDSENESLVHEALKHALKGRTSIVIAHRLSTVMEADQILVLEKGLIVERGKHEELIAQGGLYSELFARQDITTN >NZ_CP016770|615120:623049|620759_621473_+|WP_095676124.1|DBSCAN-SWA MSEPTSIALVTGGNRGIGLAIATALKAAGHRVVITYRSGTPPTGFDAVQMDVTDSASVDAAFTKIESEIGQPEIIVANAGITKDTLVLRMSDEDFESVIDANLTGAFRVAKRATKGLLKLKRGRLIFIGSVVGGVGAAGQVNYSASKSGLVGMARSFARELGSRGITANVIAPGFVETDMTAELDEKRRVDIAAQVPLGRFCSAAEIADVVAFIASPQASYITGAIIPVDGGLGMGH >NZ_CP016770|615120:623049|619744_620467_-|WP_095696179.1|DBSCAN-SWA MSNTRKKESYSLLKSLVALLLIIGCIWAAQWQYHRGVDRHARNSVIETNSSMSPVSLTSVRSNPADHEWQPVRTEGIFDSSSQILLRNRYSEGVYGFELLTRFTTPEGKSFWVDCGWVKAGANASSQPELPVLPTGEVSITGRLRLDSSLPRGSFFAIPGNKADGLVSKANAQSGSSTETFYIDLLSGSDSALTPAVPAQLPELSDGPHMAYALQWLFFGGLVIYGRFLIRRDVLSRKEL >NZ_CP016770|615120:623049|621475_622243_+|WP_095676125.1|DBSCAN-SWA MGILQGKNILVTGVLTDGSIAFHIAKIAQEEGANVVLTGFGRALSLTTRIAGRLPQLPPIIELDVTNQEHLDGLAEQVRKHVPHLDGVVHSIGFAPEAALGGNFLNTAWEDVATAVHVSAYSLKSLTMACRPTFKDGASVVGLDFDAQVAWPKYDWMGVAKAALESTSRYLARDLGAENVRINLVAAGPIRTMAAKSIPGFDEFEKVWNERSPLEWDVTDPVPAAKAAVALLSDWFPKTTAEIIHVDGGLHGMGA |
10 | Cedratvirus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1055486 : 1067480
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP016770|1055486:1067480|DBSCAN-SWA TTTAAAGAGTTGTGACAGGGGCATAGCGCAGCAATAAGCGCTTATCGCCATATTGGCCGAAGTTGATGGTCGCCTCTGATTTATCGCCTTCTCCAGCAAGTGCCACCACAGTCCCTAATCCGAATGTGTCATGTGACACACGCTGTCCCACTTCAAGAACCATCGCGGTTGATTTCTTGCCCGTAGCTCGTGGTGGTGGACCAGATGGAAGACGAGATTTCTTCACAGCAGAAGATGTTGTAAATGTGCTGCGACCTTCATTCTTCCATTCGATCAATTCAGATGGGATCTCATCAAGGAATCGAGAACCTGGATTGTATTTCGGTGTTCCAAAAGTTAAACGATATTCAGCGCGTGAAATATAGAGACGCTTTTCTGCGCGCGTTAATCCAACATATGCAAGGCGACGTTCTTCTTCAATCTGTTTGGGATCATCGAGTGTTCGCGCATGAGGGAAGATTCCATCTTCCATACCTGTGAGAAAGACTGTGGGAAATTCAAGACCCTTTGCAGTATGCAGAGTCATCAGTGTGACAACTCCCCCGTGATCTTCTCCATCAGGAATCTCATCAGCATCAGCAACAAGGGATACCTTCTCCAAAAACCCAGAGAGTGAAATCTCTTCATCTTCACCAAGTTCTTCAAAGGGGCGCTCTTCATATTCCATGGATGCAGAGACAAGTTCCTTCAAGTTCTCAACGCGCACTTCATCTTGTGGGTCTTTACTTGCTTCAAGCTCTGTCAGCAACCCTGACTGTTCAAGGACTGCTTCGATAATCACAGATGGCTTTGTCTTTGCTTCAACCAGTGTCTGCAGCGCAATCAGCATTGAGGTAAAGGAGGCGATGGATTGCGCTGCCTTATTAGGAACAGATGTCGCCTCTGATACTCGCAGCAGCGCATTCCAGAAAGAGATTCCCTGAGCTTCGGCGAAGATTTCAACTTCCTCTAGCGCGCGATCTCCAATACCTCGCTTAGGTAAGTTGATGATGCGGCGGATTGAAACTTCATCATCGGGATTAACTAGAACGCGAAGGTAGGCGAGTAAATCCTTGACCTCTTTACGTTCATAGAATCGAAGACCACCCACGACTTTATAAGGGATTGCAGCGCGCATAAAGATTTCTTCAAAGATACGAGATTGAGCGTTGGTGCGATAGAAGACCGCTGTATCTCCTGGATTGGAGTGGCCCATATCCTGTAAGGAACGAATTTCACTCTTAATGAACTCTGCTTCGTCATATTCAGATTCAGCAACATAACCAACAAGTGGTGCACCAGATCCTGCATCAGACCAAAGGTTCTTCTCTTTCCGTGATTCATTCTTGGTAATCACAGCATTTGCAGCGTTGAGAATATTCTGTGTTGAACGGTAATTTTGTTCAAGCAAAACTGTTGCAGCATTGGGGTAATCAACCTCAAATTGCAAGATATTGCGGATGGTTGCACCACGGAATCCATAGATAGATTGATCAGCATCTCCCACAACGCAGAGCTCTGCAAGTGGGAAGCCATCTCCTTCAACACCGGTGAGTTCCTTTACCAATTGATATTGCGCATGGTTGGTGTCCTGATACTCATCGACAAGGATGTGGCGAAAGCGGGAGCGAAAGCGCGCTTTAGCTTCAGGAAACTTCTGCAGAACCTGCACAGTCTTCATAATCAAGTCATCAAAGTCCATCGCATTTGCCTGCTGCAAGCGCTTTTCATACATCGTATAAACATCGGCGACGATGGTCTCGAATTGATTGGTGGCATGGCTTAAATACTCATACGGAGTTTGAAGTTCATTCTTTGCATTAGAAATCAGGTTCTGGAATTGACGGGCTGGATATCTCTTGGAATCAATATTGAGAGTCTCCATCACACGTGAGATCAGCTTCTGTGAATCTGCTGAGTCATAGATGCTAAATGTGCTGCTATATCCAAGACGTTCAACCTCTTGGCGCAGCATGCGCACACAGGATGAGTGAAATGTTGAGACCCACATTGATTTAGCAACCGGTCCCACAAGTTCGCTCACACGCTCTTTCATCTCACCAGCTGCCTTATTGGTGAAGGTAATTGCCAAGATTTCATAGGGGCGAACTTCGCGACGAGCCATGATGTATGCGATGCGACGAGTAAGAACTCGAGTCTTTCCAGATCCAGCACCTGCGACTACTAAGAGTGGAGAACCAGCGTGGGCAACGGCCTTTTGTTGCTGGGGATTGAGTCCATCCAGCAATGGGTCGGTCGTGGTCATGGCGGGGAGTCTATTAGGCTTGGCTGTCATGGAGCCTTTAGCAGATTCAGAGATCAAGCGCGTTTTAGTTATCAACGCCCACCCCGATGATTCCGACTTTGGCGCATCAGGAACTATCGCGCAGTGGGTTCAAAAAGGCATTGAAGTCTCCTACGTTCTCTGCACCAATGGCGACCAAGGTGGCGAAGAGTCAGGTTTTACCAAGGAAGAGATGCCTGCTGTGCGCCAACGCGAACAACGCGCTGCATGTAAAGCACTTGGAATTAGCGATGTCACATTCCTTAACTATGTCGATGGCCATCTTGAGCCAACGATTGCGCTGCGTAAAGATATTGTTCGTCAGATTCGTCGCGTGCAACCAGATCGCATCGTCTGCCAATCACCTGAACGCAATTGGGAGCGCATCGGTGCAAGCCATCCCGATCACTTAGCTGCAGGGGAAGCAACTATTCAAGCTGTCTATCCCGATGCGCGTAATCCTTTTGCATTCACAGATCTGCTTGAACAAGAGAATTTGCAACCATGGAGAACGAAAGAGGTATGGATGCAAGGCCATGCCCACCCAGATCACTTCGTCGATATCACAGACACATTCCATTTCAAGATTGCTGCCCTTAAAGAGCATGCATCACAGACTGGCCATATGGAAAATCTCGAAGAGATGCTTCGTGAATGGGGTCAGCGCAATGCAGGTATGGGGCAACTTCCTGAAGGGCGTATTGCTGAAGCCTTTAAGATTGTTAACACCAACTAAATTACTTAGCTGAAGCCTTAACTATTTCAACCATCTGGATGGGAAATCCATCTGCTGCATACATCGCCTGAGTTCCACTCATTTGATACGCACCCACTGCGCCAATTTTCTGCACAAGATCTAAGCCTTTGGTGATCTTGCCCCAGATTGTGTAGTCAGGTCCAAGCGTGGTGTCTTGATAGACAAGAAAGAACTGTGATCCGTTGGTGTTGGGGCCAGAGTTAGCCATAGCCACTGTCCCTGCTGGGTAATTGATGCCGGCAGCCTTTGGAAGGTTTTCATTGGCATAACCTTTCCATCCTGTCGGAGAACCACTTCCACTTGCTGTGGGATCGCCACATTGCAGAACAAAAATTCCTTCCGTTGTTAAACGATGGCAGAGAGACTTGTTGTAGTAACCAGCATTTGCAAGGGATGCAATCGATGTCACTGTGATTGGCGCCTTTGCACCAAGGAGTGAAATCACAATAGGGCCGCAGTTAGTTGTAATCGTTAAAGTCTTTGCAGGGCTCTTTGCAGCAACAGTTGGCTGCTTTACCTTGGCCGGAGCATGTCCCTTAGCGGTGCTCTTTGCGCATCCTGGAACTGAGATGGCACGCTCGGCTGCATGAGCAGGAGCTGTTACCGACGTTGCTGCAAGAAGTATGAGTGCAATGGCTGCGACCTTCGCGTTCTTAAACATTTCTGTGGTTCCTATTCCCATTCAATCGTGCCTGGTGGCTTACTTGTCACATCAAGGACCACGCGGTTGACCTCGCGGACCTCATTGGTAATTCGTGTTGAAATCTGCTCAAGAACTTCGTATGGCACACGTGACCAATCTGCAGTCATCGCATCCTCACTTGAAACTGGGCGCAAAACTATCGGATGACCGTAAGTGCGTCCATCGCCCTGAACTCCAACGCTTCGCACATCAGCCAAAAGCACAACCGGGCATTGCCAGATATCTCGATCAAGGCCTGCAGCCTTTAATTCATTGCGCGCAATCAGATCGGCATGACGCAAAATTTCTAAGCGCTCAGCTGTTACCTCACCGATAATGCGAATGCCAAGGCCAGGACCTGGAAATGGTTGGCGCCAGACAATCTCTTCAGGCAAGCCAAGTTCAATTCCAACCTGGCGCACTTCATCTTTAAAGAGAGTGCGCAGTGGTTCTACAAGTTTGAATTTAAGGTCATCAGGAAGCCCACCAACATTGTGGTGAGATTTGATGTTGGCAGTACCTGTTCCGCCCCCTGATTCAACAACATCTGGATAGAGCGTGCCCTGCACAAGGAATTCAACATCTCCGCCCGCTGCAATATCGCGAGCTGCTGCTTCAAAGGAGCGAATGAATTCACGGCCGATAATTTTGCGCTTTTCTTCAGGGTCTGTAACTCCAGCAAGTGCGTTCAGGAACTGATCTACGGCATCTACGACGACGAGTTCGACTCCAGTGGAGGCAACAAAGTCACGCTGCACTTGCTCTGATTCACCACTGCGCAGTAGACCGTGATCTACGAAGACACAGGTCAGTTGCTTGCCCACAGCGCGCTGCACGATTGCAGCTGCAACTGCTGAATCAACGCCTCCAGAGAGACCACAGATAACGCGCTTATTACCAATAACTTCCCTAGCCTTTGCAACTTCATCTTCTGCGATGTTATGCGTTGTCCACGTTGGCTTACATCCTGCAATATTGATAAGCCAATTCTTCAAGATTGCTTGGCCATGCTCTGAGTGAAGAACTTCTGGGTGAAATTGCACTCCTGCAAGCTTTCCGGATGCATCTTCAAATGCTGCAATAGGTGTGTCAGACGTTGATGCTGTCACGCTGAAACCAGATGGAACCTCAGAGACTGCATCACCATGTGACATCCACACAGATTGCGCTGCAGGAAGACCTGCAAACATCTTGGAACCAGCTTTTACTGCAAGGGGTGTGCGACCAAACTCTGATTTACCAGTCTGTGCCACAACGCCAGCAAGGGCTGCAGCCATTGTTTGAAATCCATAGCAAATTCCAAAGACTGGAATACCGAGCGTGAAGATTGCAGGATCGACTTTTGGTGCGTGATCTGCATAGACCGATGAAGGCCCACCTGACAAGATAATTGCTTCGGGATTCTTTGCACTTACTTCATCTGCAGTAATGGAAGATGGAACAATTTCAGAGTAAACATTGGCTTCGCGAACACGACGAGCAATGAGCTGTGCATACTGTGCGCCGAAGTCGACGACAAGAACGCCGTGCTTATCGGCCATGCTGCATAATGATTTCTACGCGCTGGAATGACTTCACATCTGAGTATCCGCAGGTGGCCATTGCACGACGTAGTGCACCGAACAAATTCATTGAGCCATCAGAGGTATGAGATGGGCCGTTGAGAACTTCTTCAAGAGTGCCAACAGTTCCGACATTCACGCGCTGACCGCGTGGAACATCTTGGTGATAGGCCTCAGATCCCCAGTGCCAACCAAGTCCTGGAGATTCAACTGCCTTAGCAAGTGCTGAACCCATCATGACCGCATCTGCCCCGCATGCAATTGCCTTTGCGATATCGCCAGATTTACCAACAGAACCATCTGCAATAACGTGAACATATCGTCCGCCTGATTCATCAAGGTATTCACGGCGCGCTGCTGCAACATCGGCAACCGCTGTCGCCATCGGAACCTGAATTCCAAGAACAGTCTGTGTTGTGTGAGCTGCTCCCCCACCAAAGCCAACGAGAACACCAGCAGCTCCTGCACGCATCAAGTGAAGGGCGCCTGTTGTCGTTGCAACGCCACCAACAATGACAGGAACATCGAGCTCATAGATAAACTTCTTTAAGTTAAGAGATTCACTCTCTGCAGCAACATGCTCAGCTGAGACAGTAGTTCCGCGAATAACGAAGATATCAACGCCTGCATCGATCACAGCTTTGTGTAGCTGTGCGGTGCGCTGTGGTGAAAGCGATGCTGCAACGGTAACGCCCGAGTCGCGGATCTGCTTGATGCGCTCCTTAATCAATTCAGGTCGGATTGGTGCTGCATAAATCTCTTGCATACGTCGCGTTGCATGCTTATCTGGCATTGATGCGATTTCACCCAGTGGAATACGTGGATCGTCATAACGAGTCCACAAGCCCTCAAGGTTGAGAACTCCAAGGCCTCCAAGCTTTCCGATAGCGATCGCTGTCTCAGGTGAGACAACTGAATCCATTGGAGCTGCAATCAGTGGAAGATCAAACTTATAGGCATCGATCTGCCATGAGGTGCTGACCTCTTCAGGAGTGCGAGTACGACGGCTTGGAACGATGGCGATATCGTCGAATGAATAAGCCTGGCGGGCGCGTTTTCCGGGAGCGATTTCGTAATCCATAGTTATTTCTTCTTGCTGTAGTTAGGGGCATCGGCCACATCGAGAACATCATGTGGGTGGCTCTCTTGTAAGCCAGCTGCAGTTATTTGAATCAAGCGACCTTCACGGCGCAGAGTTTCGATATCAGGAGCGCCTGCATATCCCATGCCGCTACGAAGTCCGCCCACAAGTTGGTGAACAACATCGGCAACAGGTCCGCGATAGGCAACCTTTCCTTCAATTCCTTCAGGAACAAGTTTATCTTCTGAGAGAACATCATCTTGCATGTAGCGATCCTTAGAGTATGACTTCTTCTCTCCACGAGATTGCATCGCACCGAGTGAGCCCATTCCGCGATATGCCTTAAACATACGGCCATCAATTTCAACGAGTTCGCCAGGAGATTCTTCACACCCTGCAAGAAGTGAACCGAGCATCACTGAATGTGCGCCGGCAACAATTGCTTTGACGATATCCCCTGAATATTGAAGTCCACCATCTGCAATCAGTGGAATGCCAGCTTTATTACAAGCCTTCGCAGCTTCCATGATGGCGGTGACTTGTGGAACACCGACACCTGCTACAACACGCGTTGTGCAGATAGATCCTGGACCAACACCAACCTTTACTGCATCCGCTCCTGCATTAATCAGGGCCTGTGCACCTGCACGTGTTGCAACATTTCCACCAATGATTTCGATGGTGGAGGAAAACTTCTTAATGCGCTCTATCGCATCGAGTACTGCTCGGTGATGGCCATGCGCTGTATCCACAACGATGACATCAACTCCTGCCTCAATCAGCTTCTGTGCACGGGCAAAACCATCATCGCCCACACCAACTGCTGCCCCTGCAAGGCCAACGTTTTTTACTAACTTCACATGGGTGACTTGTTCATCGACAGGAAGGTTGCGGTGAATAATCCCGATGCCACCAGCCTTTGCCATAGCAATCGCCATCGTGGATTCTGTAACGGTATCCATGGCGGAGGAAATAACAGGAACTGCCAGAGTGATATTGCGTGTGAGGCGAGTTGAGGTATCAACCTCAGATGGCACTACATCAGATGCATCTGGAAGAAGAAGGACATCGTCATATGTCAGGCCAAGGAGCGCGACCTTCTCAGAATCGATCACGGCGGCTCCCTTGAATAGCAACGAGAGCGCAGTCTACCTGCTCAATTACGCGCCTAACGCCTGCGCAATTTCCTCACAGGCATGCTCAAGGCCCTCGGCCCGCACAACTTGATCACATGAAATTGCATCCCAACCAGGCCCACCCACCACGATGCGAGGAGCTGGGCGGATTGCAGGAATTTCATTCCAATATTTGCTTTCGGCATTCTTTGGTAATTGTGCCCATAAGAAAATTGCAGGGGGTGCACATCGCGTCACCATCGCAGATAGGGCCTCAAGCGGTGTGCGAGCACCAAGAACCGATGTCTGGATATTGCGTTCGCAGAGCGCTGCAGCCAGGGCATAGATGGGCAGCGAATGAAGCTCTTCGCCCACTGCGGCCAGTAATACAGGGCGAGGATTGATGGGCTTCTTTAATTCAACTACTCGATTATGCATTGTGCGTTTGAGAATTTCAGAGAAGAGATGCTCAACCTCAATACCCTTCTGATTGTTCTCCCACTCTTCGCCAATCAAAAAGAGAACCGGCGCAATCACATCAGACCATGCGCCTTCAACCCCGTATGTATCAATTTCATGGGCCAACGTTGTTTCTACAAATGTATGATCAAAACTTTGTAAGGCTTTATAGAGAGCTGCGACAACTTCTTCGCGGACTTCAAAATCATTCACGATCTTCTTCAGTGGCACCGCTGTCTTGCAAGCCTTCGCTTGCTCGGCTGCATCTGCAGGAGTAACCCCAGCAACGATAAGGCGGCGCATCATGGTCAGCTTTGCTAAATCATTGGGGCAGTAACGGCGATGCTCGCCCTCTTCATGATCGGATGGTCCAAGGCCGTAGCGGCGGGCCCACGTGCGCAGCGTGGCTGGTGCTACACCGATGCGACGGGCAACTGCTGCCACCGTCAGTTTCTCTTCAGCGTCTACTGAAGCGCTCTTTGCCATGCCCCTATCGTGGCGTGAAGTGGCCTAACTCACCACTCCAAAATCGGACATTTCGCACATTGAACAACGCGTGACCCGATAGGTACTTGAACAACCTACGAAACGGTTGTATGGTGCAGATATCTAGAACAAAGGAGAACCACATGGCTGAACTATCACGTCTACCGCAACCTATTGCTGAGCAATGGGAGTGGCAGTATGAAGGTGCATGCCGTTCACTCGATTCAGAGATGTTCTTCCATCCAGATGGCGAACGTGGTCCACGTCGACGCAATCGTGAGAATGCTGCAAAGGCTGTCTGCGCTTCATGCCCAGTTATTCAAGCCTGTCGCACACACGCCCTTGCAGTCCAAGAGCCATATGGAATCTGGGGAGGCATGTCAGAAGATGATCGCGCCACTATCCTGATTCAGCGCGGTATTCCTTTGATTTCACACGCTTCATAATTTTTGAGTTAAAAAAGAAATCCCCCGCACATAATGTGCGGGGGATTTCTTTCTTTAATTACTTAGTGACCGTGTCCACCTGGACCATGTGAATGACCATGACCGCTCGATGCTGGCTCTTCATTTGCCGGGCGCTCATAGACAACAGCTTCTGTTGTAATAAACATTGCAGCAATAGATGCAGCATTTGCAAGTGCTGAACGAGTGACCTTCACTGGGTCGATAACGCCATCTTTTGCAAGATCGCCATAGACATCAGTTGCAGCGTTGAAGCCTTCATTTGGCTTCATTGCGCGGACCTTTGCAACTACTACGTAGCCTTCAAGTCCAGCGTTTTCTGCAATCCAGCGAAGTGGTTCATCGCATGCCTTGCGAACAAGTGCAACACCGACAGCCTTATCGCCTGTGAAGCCGAGGTTGTCATTGAGAGCATCAGCTGCATGCACAAGGGCAGCGCCTCCGCCGATAACGATTCCTTCTTCAACAGCTGCGCGAGTTGCAGAGATAGCATCTTCAAGACGGTGCTTCTTCTCCTTCAACTCAACTTCTGTATGTGCTCCGACCTTGATAACGCAGACTCCACCAGAGAGCTTTGCAACTCTCTCCTGAAGCTTTTCGCGATCCCAGTCAGAATCAGATTGTGAAATCTCTGCACGAAGTTCTCCAACGCGACCTGCTACAGCTGCCTTATCGCCAGCTCCATCGACAATGGTGGTTGTCTCCTTGGTGACCACGATGCGACGAGCGCGTCCGAGATCTTCCAGAGTTGCAGCCTCTAACTTCATACCAACTTCTTCAGAGATGACAGTTGCACCAGTCAAGATCGCCATATCTTGCAAGATTGACTTGCGACGATCTCCAAATGCTGGAGCCTTTACTGCAGCTGATGTGAATACTCCGCGCATGCGGTTTACAACGAGAGTTGAAAGCGCTTCGCCTTCAACATCTTCAGCAACGATAAGAAGTGGCTTTCCTGCCTGTGAAACCTTCTCAAGAAGAGGAAGGAGTTCAGCAAGAGCTGAGACCTTGTTGGAGACGAGAAGGATGTAAGCATCTTCAAGGACTGCTTCCATGCGATCTTGGTCTGTAACGAAGTATGGAGAGATGTAGCCCTTATCGAACTGCATACCTTCAGTGAACTCGAGCTCCAGTGCTGTAGTTGATGCTTCCTCAACAGTAATAACGCCATCTTTGCCGACCTTATCCATCGCTTCTGCGATGAGGTCTCCGATTGCGCGGTCCTGTGCTGAAATCGTTGCAACATCTGCGATCTGCGCCTTATCTTTTACGACAGTTGCATTCTCGCGAAGACGAGCTGAGATTGCCTCAACAGCTGCTTCGATTCCCTGCTTGAGATCCATTGGCTGTGCTCCGGCAGCAAGGTTACGAAGACCTTCCTTGACCATTGCCTGAGCAAGCACTGTTGCAGTCGTTGTTCCGTCTCCTGCGACATCGTTGGTCTTTGTGGCGACTTCCTTGACGAGCTGTGCGCCCATGTTCTCAACAGGATCTGAGAGTTCAATCTCTTTAGCAATTGTCACACCATCGTTAGTGATGGTGGGTGCGCCAAATGACTTAGCGATGACAACGTTGCGACCCTTAGGTCCTAGCGTTACCTTGACGGTGTCAGCGAGCTGATTAACACCGCGTTCCATTGCGCGACGAGCATGCTCGTCGAATTCCAACATTTTGCCCATGGACTACTTCTCGATTATCGCGAGAATGTCGCGAGCAGAGAGAACGAGGTAATCCTCGTTGTTGTATTTAACTTCTGTTCCGCCGTACTTGCTGTAGAGAACGACATCGCCGACCTTTACATCCATCGGTACGCGAGCGCCATCATCGAAGCGGCCTGGGCCTACTGCAACAACTGTGCCTTCTTGTGGCTTCTCTTTCGCTGTATCTGGGATGACAAGACCTGATGCAGTTGTTGTCTCAGCTTCATTTGCCTTAACAACGATGCGATCTTCGAGCGGCTTAATGGCTACTGCCATGGTGGTACTCCTTTTAGATTCGATATATCCACTTTGTTCGGGGCTCACCTGCTTTAGCAGTGTCGACCCTAGAGTGCTAACGCCAAGCCTATGGGGGCTTATTCCAGCGGGCAAATCGGGCCTACAGGCTGACCTGCTCCACCTTCAGGCTGGAATCAGCGTCAAAGGCGCCCTTGGTGGCGGGCCTACCAGCGCTGACCATCAGGCTGCCCAGGGATGCCACCATCGCCCCGTTATCGGTGCAGAGAGCCGGAGATGGAATACGAAGTGCAATACCAGCCTTCTCACAGCGCTCTGTGGCAACGGCTCGAAGACGAGAGTTGGCGGCAACACCGCCTGCGATCACCAACGAATCAATGCCCGTTGATTTACAGGCAGCCAGTGCTTTGAGCATCAAGACATCAACTATGGCTTCTTGGAAAGATGCTGCAACGTCGGCGCGAATGAAGGATGGAGTTCCTTCAAGATAACGAGCAACCGCAGTCTTTAGCCCTGAGAATGAAAAATCATAGGGGCGCGTTGCCCAGTCATTTGATGTTGTCAGGCCCCGTGGAAAATCGATGGCGCTCGCAGATCCGCTCACTGCTTCTCGATCTATCGCAGGACCACCAGGAAAACCTAAATCCAGAATACGTGCAATTTTATCGAATGCCTCACCTGCAGCATCATCCATCGTTGCGCCAAGTTTGGTTATTGAACCTGTGATGTCATCAACTTGAAGAAGGGATGAGTGGCCACCGCTGACAAGGAGTGCGATAGTGGGATCAGTTGGTTGATCATGAGTTAAGTAATCAACGGATACGTGAGCTGCTAAATGGTTAACGCCATAGAGCGGACGACCAAGTCCTTGCGCCAATCCACTTGCTGATGCAACACCGACAAGGAGTGCGCCGACGAGTCCTGGGCCAGCAGTGACCGCAATCGCATCAATATCTGACAGTGAAATCTTCGCATCTGTGAGTGCGCGCTGGATACTTGGCAACATCGCCTCCAAATGAGCACGCGATGCAATCTCAGGAACTACTCCCCCAAAACGTGCATGTTCATCAACGCTGGATGCAATGACATTGGCAAGAAGTGTGCGACCACGAACAATGCCGATTGCGGTTTCATCGCATGAAGTTTCAATACCAAGGACTACTGGTTGCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP016770|1055486:1067480|1062990_1063890_-|WP_095676527.1|DBSCAN-SWA MAKSASVDAEEKLTVAAVARRIGVAPATLRTWARRYGLGPSDHEEGEHRRYCPNDLAKLTMMRRLIVAGVTPADAAEQAKACKTAVPLKKIVNDFEVREEVVAALYKALQSFDHTFVETTLAHEIDTYGVEGAWSDVIAPVLFLIGEEWENNQKGIEVEHLFSEILKRTMHNRVVELKKPINPRPVLLAAVGEELHSLPIYALAAALCERNIQTSVLGARTPLEALSAMVTRCAPPAIFLWAQLPKNAESKYWNEIPAIRPAPRIVVGGPGWDAISCDQVVRAEGLEHACEEIAQALGA >NZ_CP016770|1055486:1067480|1059177_1060728_-|WP_095676524.1|DBSCAN-SWA MADKHGVLVVDFGAQYAQLIARRVREANVYSEIVPSSITADEVSAKNPEAIILSGGPSSVYADHAPKVDPAIFTLGIPVFGICYGFQTMAAALAGVVAQTGKSEFGRTPLAVKAGSKMFAGLPAAQSVWMSHGDAVSEVPSGFSVTASTSDTPIAAFEDASGKLAGVQFHPEVLHSEHGQAILKNWLINIAGCKPTWTTHNIAEDEVAKAREVIGNKRVICGLSGGVDSAVAAAIVQRAVGKQLTCVFVDHGLLRSGESEQVQRDFVASTGVELVVVDAVDQFLNALAGVTDPEEKRKIIGREFIRSFEAAARDIAAGGDVEFLVQGTLYPDVVESGGGTGTANIKSHHNVGGLPDDLKFKLVEPLRTLFKDEVRQVGIELGLPEEIVWRQPFPGPGLGIRIIGEVTAERLEILRHADLIARNELKAAGLDRDIWQCPVVLLADVRSVGVQGDGRTYGHPIVLRPVSSEDAMTADWSRVPYEVLEQISTRITNEVREVNRVVLDVTSKPPGTIEWE >NZ_CP016770|1055486:1067480|1055486_1057730_-|WP_095696405.1|DBSCAN-SWA MTTTDPLLDGLNPQQQKAVAHAGSPLLVVAGAGSGKTRVLTRRIAYIMARREVRPYEILAITFTNKAAGEMKERVSELVGPVAKSMWVSTFHSSCVRMLRQEVERLGYSSTFSIYDSADSQKLISRVMETLNIDSKRYPARQFQNLISNAKNELQTPYEYLSHATNQFETIVADVYTMYEKRLQQANAMDFDDLIMKTVQVLQKFPEAKARFRSRFRHILVDEYQDTNHAQYQLVKELTGVEGDGFPLAELCVVGDADQSIYGFRGATIRNILQFEVDYPNAATVLLEQNYRSTQNILNAANAVITKNESRKEKNLWSDAGSGAPLVGYVAESEYDEAEFIKSEIRSLQDMGHSNPGDTAVFYRTNAQSRIFEEIFMRAAIPYKVVGGLRFYERKEVKDLLAYLRVLVNPDDEVSIRRIINLPKRGIGDRALEEVEIFAEAQGISFWNALLRVSEATSVPNKAAQSIASFTSMLIALQTLVEAKTKPSVIIEAVLEQSGLLTELEASKDPQDEVRVENLKELVSASMEYEERPFEELGEDEEISLSGFLEKVSLVADADEIPDGEDHGGVVTLMTLHTAKGLEFPTVFLTGMEDGIFPHARTLDDPKQIEEERRLAYVGLTRAEKRLYISRAEYRLTFGTPKYNPGSRFLDEIPSELIEWKNEGRSTFTTSSAVKKSRLPSGPPPRATGKKSTAMVLEVGQRVSHDTFGLGTVVALAGEGDKSEATINFGQYGDKRLLLRYAPVTTL >NZ_CP016770|1055486:1067480|1064033_1064336_+|WP_095676528.1|DBSCAN-SWA MAELSRLPQPIAEQWEWQYEGACRSLDSEMFFHPDGERGPRRRNRENAAKAVCASCPVIQACRTHALAVQEPYGIWGGMSEDDRATILIQRGIPLISHAS >NZ_CP016770|1055486:1067480|1066036_1066330_-|WP_095676530.1|DBSCAN-SWA MAVAIKPLEDRIVVKANEAETTTASGLVIPDTAKEKPQEGTVVAVGPGRFDDGARVPMDVKVGDVVLYSKYGGTEVKYNNEDYLVLSARDILAIIEK >NZ_CP016770|1055486:1067480|1057758_1058484_+|WP_095676523.1|DBSCAN-SWA MEPLADSEIKRVLVINAHPDDSDFGASGTIAQWVQKGIEVSYVLCTNGDQGGEESGFTKEEMPAVRQREQRAACKALGISDVTFLNYVDGHLEPTIALRKDIVRQIRRVQPDRIVCQSPERNWERIGASHPDHLAAGEATIQAVYPDARNPFAFTDLLEQENLQPWRTKEVWMQGHAHPDHFVDITDTFHFKIAALKEHASQTGHMENLEEMLREWGQRNAGMGQLPEGRIAEAFKIVNTN >NZ_CP016770|1055486:1067480|1066451_1067480_-|WP_095676531.1|tRNA|DBSCAN-SWA MQPVVLGIETSCDETAIGIVRGRTLLANVIASSVDEHARFGGVVPEIASRAHLEAMLPSIQRALTDAKISLSDIDAIAVTAGPGLVGALLVGVASASGLAQGLGRPLYGVNHLAAHVSVDYLTHDQPTDPTIALLVSGGHSSLLQVDDITGSITKLGATMDDAAGEAFDKIARILDLGFPGGPAIDREAVSGSASAIDFPRGLTTSNDWATRPYDFSFSGLKTAVARYLEGTPSFIRADVAASFQEAIVDVLMLKALAACKSTGIDSLVIAGGVAANSRLRAVATERCEKAGIALRIPSPALCTDNGAMVASLGSLMVSAGRPATKGAFDADSSLKVEQVSL >NZ_CP016770|1055486:1067480|1064398_1066033_-|WP_095676529.1|DBSCAN-SWA MGKMLEFDEHARRAMERGVNQLADTVKVTLGPKGRNVVIAKSFGAPTITNDGVTIAKEIELSDPVENMGAQLVKEVATKTNDVAGDGTTTATVLAQAMVKEGLRNLAAGAQPMDLKQGIEAAVEAISARLRENATVVKDKAQIADVATISAQDRAIGDLIAEAMDKVGKDGVITVEEASTTALELEFTEGMQFDKGYISPYFVTDQDRMEAVLEDAYILLVSNKVSALAELLPLLEKVSQAGKPLLIVAEDVEGEALSTLVVNRMRGVFTSAAVKAPAFGDRRKSILQDMAILTGATVISEEVGMKLEAATLEDLGRARRIVVTKETTTIVDGAGDKAAVAGRVGELRAEISQSDSDWDREKLQERVAKLSGGVCVIKVGAHTEVELKEKKHRLEDAISATRAAVEEGIVIGGGAALVHAADALNDNLGFTGDKAVGVALVRKACDEPLRWIAENAGLEGYVVVAKVRAMKPNEGFNAATDVYGDLAKDGVIDPVKVTRSALANAASIAAMFITTEAVVYERPANEEPASSGHGHSHGPGGHGH >NZ_CP016770|1055486:1067480|1061832_1062945_-|WP_095676526.1|DBSCAN-SWA MIDSEKVALLGLTYDDVLLLPDASDVVPSEVDTSTRLTRNITLAVPVISSAMDTVTESTMAIAMAKAGGIGIIHRNLPVDEQVTHVKLVKNVGLAGAAVGVGDDGFARAQKLIEAGVDVIVVDTAHGHHRAVLDAIERIKKFSSTIEIIGGNVATRAGAQALINAGADAVKVGVGPGSICTTRVVAGVGVPQVTAIMEAAKACNKAGIPLIADGGLQYSGDIVKAIVAGAHSVMLGSLLAGCEESPGELVEIDGRMFKAYRGMGSLGAMQSRGEKKSYSKDRYMQDDVLSEDKLVPEGIEGKVAYRGPVADVVHQLVGGLRSGMGYAGAPDIETLRREGRLIQITAAGLQESHPHDVLDVADAPNYSKKK >NZ_CP016770|1055486:1067480|1060717_1061830_-|WP_095696406.1|DBSCAN-SWA MDYEIAPGKRARQAYSFDDIAIVPSRRTRTPEEVSTSWQIDAYKFDLPLIAAPMDSVVSPETAIAIGKLGGLGVLNLEGLWTRYDDPRIPLGEIASMPDKHATRRMQEIYAAPIRPELIKERIKQIRDSGVTVAASLSPQRTAQLHKAVIDAGVDIFVIRGTTVSAEHVAAESESLNLKKFIYELDVPVIVGGVATTTGALHLMRAGAAGVLVGFGGGAAHTTQTVLGIQVPMATAVADVAAARREYLDESGGRYVHVIADGSVGKSGDIAKAIACGADAVMMGSALAKAVESPGLGWHWGSEAYHQDVPRGQRVNVGTVGTLEEVLNGPSHTSDGSMNLFGALRRAMATCGYSDVKSFQRVEIIMQHGR >NZ_CP016770|1055486:1067480|1058485_1059166_-|WP_095676867.1|DBSCAN-SWA MFKNAKVAAIALILLAATSVTAPAHAAERAISVPGCAKSTAKGHAPAKVKQPTVAAKSPAKTLTITTNCGPIVISLLGAKAPITVTSIASLANAGYYNKSLCHRLTTEGIFVLQCGDPTASGSGSPTGWKGYANENLPKAAGINYPAGTVAMANSGPNTNGSQFFLVYQDTTLGPDYTIWGKITKGLDLVQKIGAVGAYQMSGTQAMYAADGFPIQMVEIVKASAK |
11 | uncultured_virus(22.22%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1194968 : 1204376
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP016770|1194968:1204376|DBSCAN-SWA ATCATTTCCCAACCTCCTGATAGCCACTTGTCCAATAACCAGCTTGATCTACTTGTTGCACGATGCGCTTCTTCCACAGATCACTAGTCTGAGGGAAATCTTCACGCCAATGGGAGCCACGAGTTTCTTGGCGAATGAGTGCAGCCTTCACAATTGTCTGTGCGAGCTGGAAGAGATTTGTTGTCTCCCATGCTTCAACACATGGCTGCGTGCTCTTTCTATCTTCAATACGGGTCAAATCTGAACTTGTTTTCAGTAATGAATCTGATGATCGCAATACTCCTGCACCACGGCTCATGCTCACCTGAATATCATGGCGCACTTGTGGATCAAGCAGAATTGCTTGCGATGCATTCACCACTGGTTCACTCTTCTCAGGCAACTTCTCTGCAATATCTGCTGCAATACGTGCACTAAATACAAGGCCTTCCAGAAGTGAATTGGATGCAAGTCGATTAGCGCCATGAACACCAGAACATGCAGTTTCACCGCATGCATAGAGGCCATTGACACTTGTTCGGCCATTGAGATCAACGCGCACTCCACCTGATGCATAGTGCGATGCTGGAGCCACGGGAATCAATTCTTTCAGTGGATTAATGCCATGAGAAATACAGGATGCATAAATGGTTGGGAAGCGTTCCTTAAATCCTTCAAGGTGTCGAACATCGAGCCAGACATGGTGAGCACCAGTTTTATTCATCACGCGCATAATAGAAATTGCCACAACATCGCGTGGTGCAAGTTCTGCAAGGGGATGAATATCTTGCATAAAGCGCACGCCTTTATCGTCAACAAGATAGGCACCTTCACCGCGCACAGCTTCACTAATCAATGGCTGTTGACCTTCTGAGTTATCGCCCAGCCATAAGACTGTTGGGTGGAATTGAATGAATTCAACATCGGCCACCTTTGCCCCTGCACGAAGAGCGAGTGCAACTCCATCTCCTGTTGATACAGATGGATTTGTTGTTTGAGCAAATACTTGACCTAGTCCACCTGTTGCAAGGACTACAGCGCGAGCAAGACCTCGGCCAACGCCATCGCGACTACCTGCGCCGATAACGTGAAGAGTTACTCCACACACTTCACCAATATCGTTCTTAAGTGCATCGAGCACAAGTGCATGTTCAACAACTTCAATCTCAGGATCATCATGGACAGCCGCAAGAAGTGCTCGGGAGACTTCAGCTCCAGTTGCATCTCCTCCTGCATGCAAGATGCGATTACGAAGGTGTCCACCTTCACGAGTAAGTGCAATTTCACCGCTATCTTCTTTATCAAAGATTGCTCCACGTTCGATTAACTTACGAACAGCTTCAGGGCCTTCTGTGACAAGTACGCGCACAGCATCAACATCACAGAGCCCAGCACCTGCAACGAGTGTGTCTTTCTCATGAGCTTCGGGTGAATCACCATCACCAAGTGCTGCGGCAATACCGCCTTGCGCCCACTTAGTGGAGCCTTCATCAACTCGTGCTTTTGTCACAAGTAAAACTGATAATCCATATGTGCGACATTGCAGCGCAGTAGTTAATCCCGCAATTCCTGATCCCACAACGATGACATCAGCTGTTGCAGTCCAGCCAGGAGTTGGCGCTAATAACTTCATGCGCGATTTCCCATAGGCATATTGTCAATCAATCGCACACCATTTATCCACCCCGCGACAATCGCACGAACCTTACGAGTGTCTGATTGAGCGTGAGCAAAGGTCTTCTCATCGATGAAATCTGCATAATCAACAGTAAGAGAGCTCTCACTCGCCAAGATCTCACGCAATTCATTTTCAGACTTGGCTTTGGTAAGCGCTCTATAAATGACTTGTGCTGCGTTTCGACCTTCATCGCTAAGTCGCACATTGCGAGATGAGAGCGCAAGTCCATCGACTTCACGAATAGTGGGTGCTGCAACAATCTCAATACCTTCAGCAATTGTCCTAATTAACTGCAGCTGCTGAAAATCTTTCTCTCCAAAGATCGCCACCTTTGGTTTAAGAAGTGAAAAAAATTGGTGGACAACGGTGAGCACTCCATCAAAATGTCCTGGCCGTGCTTTACCTTCATAGATCTCACCCAAGGGCCCTGCAGATTTCTTCACATAACCATCAGGATAAATCTCTGCTTGGGTAGGTAGCCAGAGATGCGTAACTCCAGCTGCATCAGCAATTGCAATATCAATATCTGGTGTGCGGGGATATTGCGCTAGGTCTTCTTTGTTTTCAAATTGCAAAGGATTAATAAAAATACTTGCCACAACATCATCACTATATTTCTTTGCAAGTGAAAAGAGTGAAGCGTGGCCCGCATGCAGTGCGCCCATTGTTGGAACAAATGCGCAGGCAGAAGGAAGCTGGCGAGCATCTGTTACTACTTTCATGGCTCCAGTCCTTTCAGGTACCGGGCAGGCAGTGAGGTAATTCTAAGGCGTTAGGACAGTTGCGCCAAAAGCTCAGAGATTTTGCCGTGTGTAAGAAGAAGGGCATCAGGTTCAATCTCATGCCACGGTTCCAGAACAAAGCGACGTTCAAAGGCGCGTGGATGAGGAAGTTCTAATTCTGTGGCGCTACTGAGCAATGTGCCATATTGAATGAGATCGAGATCGATGGTGCGAGGTCCCCACCTTTCAATACGCTCACGCCCCAGTGATTTCTCGATTCCATGTAGCAGTGAAAGCAGATCGATAGCAGGCAAATCACTTTCAAGAATGCAGACTGCATTGATGTAATCCGGTTGTTCTGGACCACCCACTGGTTTTGTTGTGTAGTAAGAAGAGACCGCAGTAACGATAGTTGCCTCTTTGAGCATCGCAACTGCTAAATCCATCTGCTCTTTGGGATTGCCAATATTGGCACCGAGTGCAACTACTGCCTTCATCGGGTAATAGTGACCGAGATATCGGAAGCTTGTGCAGAAATAGGAGCCTTAGGTTTATGCACCGTCACAGCAATATTTGAAATTTCTGGATGTCCGCTCTTGATTCGATCTGCAATACGTCCTGCAAGTCTTTCAATCAGTTGAACTCGCTCACCTGTGATCTCTTCAACCACGATGTCAGCAAGTGCTCCGTAATCAATTGTGTCCGCTAAATCATCACCCACACTTGCACGAGATAGATCTAAGTGAATTTCTAGATCAACAAAGAAATCTTGACCATTCTTTGCTTCATGATCGAAAACTCCGTGATATCCAAAACCCCAAATACCTGTGAGTGAGATGACATCACTCATGAGCGAGCTGCTTTCACATTTGCCTTCACACCATGAACTCTAACTGCCCAGATGCCCTTGCCGACTAGTGATTGGGTTACTGCAACAGTGGCTTCCTCCCGCTCATCAGGATGCTCTCCGCCTAAGAAACGTTTACGTGAGTGACCAATCAATACTGGATAGCCAAGTGCCACAAATTCATCAATCCGGTTAAGGACTTCCCAGTTGTGTTCAGCGTTCTTTGCAAAGCCAATACCTGGATCCAAGATGATGTTCTCGCGCTTAACGCCTGCTGCAAGTGCTTTATCAAGCTGCAGCGTCACTTCTTCAATGACTTCAGCAACAACATCGCCATAAATAGCCTTCTCATTCATATCTTTTGAATGTCCCCGCCAATGCATCAAGGTGTATTTACATCCCAGCTGAGCAACTGTTGAAAACATGTCAGGGTCTGCAGCCCCTCCACTAACATCATTAACGATGCTAGCTCCAGCTTCAACTGCAAGCTTGGCCGTTGTTGCTCGCATGGTGTCGATGCTAATTACAACGCCCTTTTTTGCAAGCTCGCGAATGACAGGGATAACTCGTGCTTGCTCTTCTTCTTCTGTAATTCGATCGGCACCTGGACGAGTGGATTCACCGCCCACATCAATGATGTCAACACCATCTTCAATCATTTCAAGGCCATGTGTGATCGCTAGTGATTCTTCATAATGCAGGCCACCATCGGCAAAAGAATCTGGAGTGACGTTAAGAATGCCCATCACCAACATGAGAATTTATCGATTCGTAATGAGGCTGATTGCTTCAGCGCGAGTAGCTGCATCGCGTAGAACACCGCGAACAGCTGATGTTGTTGTGCGCGCCCTGGATTTGCGGACTCCACGCATAGACATACACAAATGTTCGCAATCGATGATGACAATGACGCCCATAGGTTTGAGAATCTCAACCAAGGCATCTGCAATTTGAGTTGTTAAACGCTCCTGGACTTGAGGACGACGAGCAAAGAGATCAACAAGACGTGCAACTTTGGATAGACCTGTGATCTTGCCGCTCGGGATATAGCCCACATGTGCAACGCCATGAAATGGTGTGAGGTGATGTTCGCAATGAGAAAAGACTTCAATATCGCGAATGATCACAAGCTCTTCATGTCCGATGTCAAAGGTAGTTGTTAATACATCTTCTGGCTTCTGCCATAGTCCTTCAAAGTTCTCTTTAAATGCACGAGCCACTCGTGCTGGAGTCTCTTTCAGACCTTCGCGTTCAGGATCTTCACCAAGTGCTAAGAGAAGTTCGCGCACAGCCTTTTCTGCGCGCTCCTGATCGTAGGGATGCGCAGCACGTCCATCGCCAGGCCCCATGGAAACGGAGGAAATATCAGTCACTCGAGGTTGCCTTCTTCACGCGACGTGATTTGCGAGGAGCATCTGTAGGAGTTGCTCGTTCTGGAATATCAACAGGTGGTTGTGCTGAAGGTAAACGATTCTCTGATCCTGTCCATGCTGGGCGCTTTGGCCATGACTTTACCTTGGCAAAGATTGCAGCAATCTCTTCCTTGTTCAGAGTTTCCTTTTCAAGAAGTTCTAGAACCATTTCATCAAGAATGGTGCGGTTTACTTCAAGGATGTCATACGCCTCTTGATGCGCAGTCTCAATGAGTTTACGAATCTCACGATCGACAATTGCTGCAACATTTTCAGAGTAATCACGTTGATGTCCATAATCACGGCCCATAAATGGCTCTGATGCATCGGTGCCCAATTTGATTGCACCAATTGCTTCAGTCATTCCATATTGAGTCACCATGGCGCGAGCAAGGGCGGTTGCCTTCTCAATATCGTTAGAGGCTCCCGTTGATGGATCGTGGAAGATGAGTTCTTCTGCAGCGCGTCCACCCAGTGAATATGCCAATTGATCAAGGAGTTGGTTGCGAGTTGTTGAGTATTTATCTTCATCAGGAAGAACCATTGTGTAACCCAGTGCGCGACCGCGCGGCATGATGGTGATCTTATGAACAGGATCTGTATGAGGAAGTGCGTAGGCAACAAGTGCATGTCCTGCCTCGTGATACGCAGTGACGCGACGCTCTTCTTCAGACATCAGACGTGATTTACGTTGTGGTCCAGCCATCACGCGATCGATAGCTTCATCAATCTGTGTATTGGTAATTGTCTTTTGACCTTCACGAGCAGTAAGTAGTGCTGCCTCATTCAATACATTCGCAAGGTCTGCACCAGTAAATCCTGGTGTGCGACGGGCGTATGTAACGAGTTCAACATCCTTTGAAAGCGGCTTACCCTTTGCATGCACTTTAAGAATATCTTCACGGCCCTTGAGATCTGGACGCTCAACAGGAATCTGACGATCAAAGCGACCTGGGCGAAGGAGTGCGGGATCTAGAACATCAGGTCGGTTGGTTGCTGCAATCAAAATAACTTGGCCATTTGCTTCAAAGCCATCCATCTCAACAAGGAGTTGGTTCAGTGTTTGTTCGCGTTCATCGTGACCGCCACCCATACCTGCTCCGCGCTGACGACCGACTGCATCAATTTCATCTACGAAGACAATTGCAGGTGAATTGGCTTTAGCTTGGGTAAAGAGATCGCGCACGCGAGCTGCACCGACACCAACAAACATTTCAACAAAGTCAGAACCAGAAATTGAATAGAAAGGAACTTTTGCCTCACCAGCAACTGCGCGAGCAAGAAGAGTTTTACCTGTTCCTGGAGGACCGTAGAGAAGAACGCCCTTAGGAATTTTTGCACCAAGGAGTGCGTACTTTGCAGGATCAGCTAAGAAATCTTTAATCTCTTCAAGTTCTGCAATCGCTTCATCCGCACCTGCAACATCATCAAAGGTATTTTGTGGAACATCGCTATCTTGTAGCTTTGCGCGAGATTTACCAAATGAGAAGACGCGATTTCCGCCTTGAGCATTGCTCATCATGAGGAAGAACAAGAAGCCAATAATCAGAATTGGGCCAAAGGTGAAGAAGAAAGTAGTCAAGAATGATTGCGATGGAACGCTAACGCTCCAACCCTTTGTAGGAGGGTTTGCAGTCAGAGCATCAATCAAATTCGGCTCTTGGCGCGCAATATAAGAAGCTTCAACTTTTGTTGCGCCCTTAATTGTGTTTCCACTATTGAGAATCAGACGAATCTTCTGTGATTTATCAACAAGGACTGCTGACTCAACTTGAGCCCTAGAAATAGCATCGATTGCTTGAGAAGTTTTGATCTCTGTATAGCGATTGGCAGCATTGGTAATCTGTCCAAAGATTGTGACACCGAAAATTGCAACAATGATCCAGAACAGCGGGCCACGGAAAATCTTCTGTGAACGAGTAAGAGGAGCCTTCTTGCCACCATCTTTATTCGTCGTCTTAAGAGCCTTCTTCGCTGAGTTCTTCTTCAATTTACTAAGTGGGTTCTGGGAAGACATGAAATTCCTTATGAGTACATGTGCTTTGCAAGCACACCGATAAATGAGAGGTTGCGGTACTTCTCGTCGAAATCAAGTCCATAACCCACGACGAATTCCTTAGGAATATCAAAGCCAACGTATTTCACATCGACATCAACCTTTGCAGCTTCTGGCTTGCGAAGGATTGCAAGAATTTCAACAGATGCTGCACCGCGAGAGTAAAGGTTTGATTTAAGCCAAGAAAGTGTTAATCCGGTATCGACGATATCTTCAACAATCAGAACATGGCGACCGGTGATGTCACGATCGAGATCTTTAAGAATGCGAACTACGCCGCTTGATTTAGTTCCTGAACCGTAGGAAGAAACAGCCATCCAATCCATCTCGATATGAGTCTGCATTGCGCGTGTTAAATCAGCCATCGCCATGATTGCGCCTTTGAGCACGCCAACAAGAAGTACATTCTTATCTTTGTAATCTGCATCGACGAGCGCTGCGAGTTCTGCAATCTTTGCTGCAAGTTGATCTTCTGTTGCTATTACCTTTTCAACATCGGTTCCGACTGCTGCTAAATCCACGTGGTCTGCTCCTTGATACCTAGGGCTGGGCTAAGAGCGAAAGTCTGCCCGAAATTCGCTCAACCTTCACACCACCCGGAAGGCTCACCACCCCTTGACCATGCCAAGAAGTGACCAGCGCCTCAACTGCGGCAAGGTGATCGGCTGTAATTGAGCCTGTCGGGGCGCCAGCGGCATAGAGAGCAGCTCTCAGGACTCGAGAACGGATTGCTCGAGCCAGGCTCGCTAAGTGATCGCACTCAAGATTTGCGAGGTCAGATGATGAGATTTCACTCTGTGCAATCTCATCGAGGGCATCAGCATCATCACGCAAAATCGATGCACTGCGAGCAAGTGCTGCTGCAATTCCAGGTCCTAACTTCTCCTCCATCACTGGAAGAACTTCGTTACGAACTCTCACACGCGAGAAATCAGTATTTCCATTATGTGGATCATTCCAAGGTTCAATCTCTAATTCTCTGCATGCCATAACAGTTTCTTCTCGAGTGATCTGTAATAGTGGGCGAAGATAAATTCCATTCTCTTCTGACATTCCAGACAGTGAGCGAGTTCCAGATCCCCGAGCAAGTCCTAATAAAACAGTTTCTGCTTGATCATCTCGAGTGTGACCTAAGAAGACTTTCGTTGCTTTCTCTTGCGCAGCGCACGCACTAAGAGCTTGATAGCGTGCATCACGAGCGCCAGCTTCGAGGCCGGACTCAGTAGTAACCACAACTTTCTTAACAATGACTTTGCCATAACCCATCTCCATCAATTGCTTCTCAACTTTTTCTGCCTGTGCACCCGAACCACTTTGTAGTTGATGATCGACAGTGACTGCAATTGCTGTAATCGCACATACTTTCGACTCTGTGAGAATTGCAGATGCAAGTGCAAGTGAATCTGCACCACCAGAGACAGCAACCAGCACAACATCGCCGGCTTCAAGCCGTGCAAGGTGAGGTTTTACAGCGTTGCGAATTGCAACGATGGCATCAGTCATAGGTGAAGGTTAGACCCGACCACCAAATGGCATAAGCCCCTTAACGCTATAAATACTTGAGTAGAGCAAGCCCTTATCCTTGGAGTTGGCCTCCCACATCATTCCTCCACCTGCATAAATGGTGATGTGGTGAATGGTTGAGATTGTGCCTTTGTAGGAATAGAAGAGAAGATCGCCAGGCTGAAGTTCAGTCAGCGCAACATGCTTTGTATAGCCTGAATACAACGCAGAGTTAAGTCGATCCCAATTCGGCCAACCAAGACCTGCTGACTTATATGCAGCATAAACAAGTCCAGAACAGTCAAATGAATTTGGTCCTTCAGAACCCCAGATATATGGCTTACGTGCTTGAACCTGCTTCTTAGCAAAGGCAACCGCAGCTAAACGTTGTGCCTCAGTTGTTCTAATTGTTGTGCGGCCCTTGAAACCAATATTTGGCCAAACTTTCGCTTGGTTGATCGTGGTAGCCGCAGTAGAAGCTTGGCTCTCTTCAAGCAACGCCAACTGACGTTGTTGTTCTAGGGTCACACGAACATTACGTGCAGTAGCGAGTTCTTTCATCAACTTATCTTGAACAGCTCGAAGTTTATTGACTTCCTTTTGTTGAAGAGCTTGCGCTGCATCCGCAATTTTCTTTGTCGCTTCAACTTTAGCAGTGGCAACTACCTGGATTGCCTTTGCTTCATCAGCTTTCTTCTTTGCAGCTTTTGCAACAATCTCTGCAGCCTTATAGCGGTCAAGAGCGGTGGTGTTTTGTGCACCTAACGTGTTGAGAGTGGAGAGTTGATCAATGAGATCTTGTGGTCCATTAGAACTCAGTAAGGGTTGGATATCACTCATACTTCCACCGAGGATGTATGCATTGGCTGCAAGTTTTCCGATAACTCGGTGAGCTTCTGCGACTGCAGCTGCAGTCTCGGCAGCATGTTTAGCAGCCGCAATTGCTTGCGCTGTAGCAACTTCGAGTTCTCTCTTCGCTTTGAGATAAACAGCTTGCGCAGCATTGGCTCTGGCAGTTAATTGTTTCAGAGTGAGATTTGCTGCAGCTAATTTCTTTGCTGCAGCATCTGCTGCCACTTTCTTGGCAGCTTCTGCCTGTTTAGCCGCTTCAATCTCGGCAAGTGTTGGTTTTGGCTTTGCGATGGCAGGGGTGGCCGCAAGAGTCAGGCTTGCAATGATGGCAAGGCACATCAAGGGTTTTCTGCGGCGCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP016770|1194968:1204376|1202201_1203164_-|WP_095696462.1|tRNA|DBSCAN-SWA MTDAIVAIRNAVKPHLARLEAGDVVLVAVSGGADSLALASAILTESKVCAITAIAVTVDHQLQSGSGAQAEKVEKQLMEMGYGKVIVKKVVVTTESGLEAGARDARYQALSACAAQEKATKVFLGHTRDDQAETVLLGLARGSGTRSLSGMSEENGIYLRPLLQITREETVMACRELEIEPWNDPHNGNTDFSRVRVRNEVLPVMEEKLGPGIAAALARSASILRDDADALDEIAQSEISSSDLANLECDHLASLARAIRSRVLRAALYAAGAPTGSITADHLAAVEALVTSWHGQGVVSLPGGVKVERISGRLSLLAQP >NZ_CP016770|1194968:1204376|1194968_1196579_-|WP_095696456.1|DBSCAN-SWA MKLLAPTPGWTATADVIVVGSGIAGLTTALQCRTYGLSVLLVTKARVDEGSTKWAQGGIAAALGDGDSPEAHEKDTLVAGAGLCDVDAVRVLVTEGPEAVRKLIERGAIFDKEDSGEIALTREGGHLRNRILHAGGDATGAEVSRALLAAVHDDPEIEVVEHALVLDALKNDIGEVCGVTLHVIGAGSRDGVGRGLARAVVLATGGLGQVFAQTTNPSVSTGDGVALALRAGAKVADVEFIQFHPTVLWLGDNSEGQQPLISEAVRGEGAYLVDDKGVRFMQDIHPLAELAPRDVVAISIMRVMNKTGAHHVWLDVRHLEGFKERFPTIYASCISHGINPLKELIPVAPASHYASGGVRVDLNGRTSVNGLYACGETACSGVHGANRLASNSLLEGLVFSARIAADIAEKLPEKSEPVVNASQAILLDPQVRHDIQVSMSRGAGVLRSSDSLLKTSSDLTRIEDRKSTQPCVEAWETTNLFQLAQTIVKAALIRQETRGSHWREDFPQTSDLWKKRIVQQVDQAGYWTSGYQEVGK >NZ_CP016770|1194968:1204376|1203173_1204376_-|WP_095696463.1|DBSCAN-SWA MRRRKPLMCLAIIASLTLAATPAIAKPKPTLAEIEAAKQAEAAKKVAADAAAKKLAAANLTLKQLTARANAAQAVYLKAKRELEVATAQAIAAAKHAAETAAAVAEAHRVIGKLAANAYILGGSMSDIQPLLSSNGPQDLIDQLSTLNTLGAQNTTALDRYKAAEIVAKAAKKKADEAKAIQVVATAKVEATKKIADAAQALQQKEVNKLRAVQDKLMKELATARNVRVTLEQQRQLALLEESQASTAATTINQAKVWPNIGFKGRTTIRTTEAQRLAAVAFAKKQVQARKPYIWGSEGPNSFDCSGLVYAAYKSAGLGWPNWDRLNSALYSGYTKHVALTELQPGDLLFYSYKGTISTIHHITIYAGGGMMWEANSKDKGLLYSSIYSVKGLMPFGGRV >NZ_CP016770|1194968:1204376|1198192_1198948_-|WP_095696460.1|DBSCAN-SWA MLVMGILNVTPDSFADGGLHYEESLAITHGLEMIEDGVDIIDVGGESTRPGADRITEEEEQARVIPVIRELAKKGVVISIDTMRATTAKLAVEAGASIVNDVSGGAADPDMFSTVAQLGCKYTLMHWRGHSKDMNEKAIYGDVVAEVIEEVTLQLDKALAAGVKRENIILDPGIGFAKNAEHNWEVLNRIDEFVALGYPVLIGHSRKRFLGGEHPDEREEATVAVTQSLVGKGIWAVRVHGVKANVKAARS >NZ_CP016770|1194968:1204376|1198954_1199542_-|WP_095676658.1|DBSCAN-SWA MGPGDGRAAHPYDQERAEKAVRELLLALGEDPEREGLKETPARVARAFKENFEGLWQKPEDVLTTTFDIGHEELVIIRDIEVFSHCEHHLTPFHGVAHVGYIPSGKITGLSKVARLVDLFARRPQVQERLTTQIADALVEILKPMGVIVIIDCEHLCMSMRGVRKSRARTTTSAVRGVLRDAATRAEAISLITNR >NZ_CP016770|1194968:1204376|1197396_1197843_-|WP_095696458.1|DBSCAN-SWA MKAVVALGANIGNPKEQMDLAVAMLKEATIVTAVSSYYTTKPVGGPEQPDYINAVCILESDLPAIDLLSLLHGIEKSLGRERIERWGPRTIDLDLIQYGTLLSSATELELPHPRAFERRFVLEPWHEIEPDALLLTHGKISELLAQLS >NZ_CP016770|1194968:1204376|1199558_1201622_-|WP_095696461.1|protease|DBSCAN-SWA MSSQNPLSKLKKNSAKKALKTTNKDGGKKAPLTRSQKIFRGPLFWIIVAIFGVTIFGQITNAANRYTEIKTSQAIDAISRAQVESAVLVDKSQKIRLILNSGNTIKGATKVEASYIARQEPNLIDALTANPPTKGWSVSVPSQSFLTTFFFTFGPILIIGFLFFLMMSNAQGGNRVFSFGKSRAKLQDSDVPQNTFDDVAGADEAIAELEEIKDFLADPAKYALLGAKIPKGVLLYGPPGTGKTLLARAVAGEAKVPFYSISGSDFVEMFVGVGAARVRDLFTQAKANSPAIVFVDEIDAVGRQRGAGMGGGHDEREQTLNQLLVEMDGFEANGQVILIAATNRPDVLDPALLRPGRFDRQIPVERPDLKGREDILKVHAKGKPLSKDVELVTYARRTPGFTGADLANVLNEAALLTAREGQKTITNTQIDEAIDRVMAGPQRKSRLMSEEERRVTAYHEAGHALVAYALPHTDPVHKITIMPRGRALGYTMVLPDEDKYSTTRNQLLDQLAYSLGGRAAEELIFHDPSTGASNDIEKATALARAMVTQYGMTEAIGAIKLGTDASEPFMGRDYGHQRDYSENVAAIVDREIRKLIETAHQEAYDILEVNRTILDEMVLELLEKETLNKEEIAAIFAKVKSWPKRPAWTGSENRLPSAQPPVDIPERATPTDAPRKSRRVKKATSSD >NZ_CP016770|1194968:1204376|1197839_1198196_-|WP_095696459.1|DBSCAN-SWA MSDVISLTGIWGFGYHGVFDHEAKNGQDFFVDLEIHLDLSRASVGDDLADTIDYGALADIVVEEITGERVQLIERLAGRIADRIKSGHPEISNIAVTVHKPKAPISAQASDISVTITR >NZ_CP016770|1194968:1204376|1201630_1202182_-|WP_095676660.1|DBSCAN-SWA MDLAAVGTDVEKVIATEDQLAAKIAELAALVDADYKDKNVLLVGVLKGAIMAMADLTRAMQTHIEMDWMAVSSYGSGTKSSGVVRILKDLDRDITGRHVLIVEDIVDTGLTLSWLKSNLYSRGAASVEILAILRKPEAAKVDVDVKYVGFDIPKEFVVGYGLDFDEKYRNLSFIGVLAKHMYS >NZ_CP016770|1194968:1204376|1196575_1197346_-|WP_095696457.1|DBSCAN-SWA MKVVTDARQLPSACAFVPTMGALHAGHASLFSLAKKYSDDVVASIFINPLQFENKEDLAQYPRTPDIDIAIADAAGVTHLWLPTQAEIYPDGYVKKSAGPLGEIYEGKARPGHFDGVLTVVHQFFSLLKPKVAIFGEKDFQQLQLIRTIAEGIEIVAAPTIREVDGLALSSRNVRLSDEGRNAAQVIYRALTKAKSENELREILASESSLTVDYADFIDEKTFAHAQSDTRKVRAIVAGWINGVRLIDNMPMGNRA |
10 | Acanthocystis_turfacea_Chlorella_virus(16.67%) | protease,tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1326937 : 1335996
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP016770|1326937:1335996|DBSCAN-SWA ACTACTTTTGCGGAGAAATCTCATCTTCGCTGTTGCAGGTCACAGATAACAGGGTGGCACCTTCATCGAGGTTGACCAGGCGAACGCCCATTGAGTCACGCCCAGTTTGGCGCACTTCAGCAGCTGGGGTGCGCATCACTGTTCCAGCAGATGTGATTGCCAGAATTTCATCGCTATCAACTAAAACAAGTGCTGAGACAAGTGTCCCGCGAGAATCTTCATCAATCTTTGCAGCCTTGATTCCAATTCCACCGCGACCTTGTAAACGATATTCATCAATTGGAGTCTTCTTGCCATAACCACCATCGGTGGCTGTAAAGACAAATGCACCGACAGATGTTTCGGCATCCACGCGTGACATCGTAAGAAGTTGATCGCCTGCGCGGAACTTCATTCCAATCACACCTGATGTTGATCGACCCATGGAGCGCAATGATTCATCATCTGTTGTAAAGCGCAGTGACATCGCTTTCTTTGAAACCAGTAGCAACTCATCGCCGTTGCGAACAAGGGATGCAGAGACAACTTCATCGCCAGGCTTGAGTGAAATTGCAATCAATCCACCTGTGCGAGGTGAGTCGTATTCAGAGAGCGGAGTCTTCTTAACGAGACCGTTCTTTGTTGCAAGAACTAAGAATGGTGCTGCGTTGTAATCCTTAAATGAAAGTACTTGTGCAATCTGCTCATCAGGCTTGAATGCCATCAAGTTAGCAACGTGCTGACCGCGGGCATCGCGACCTGCATCCGGCAACTCGTGCACCTTGGCGCGATAGACGCGACCTTGGTTAGTAAAGAACAATAGCCAGTCGTGAGTAGAAGCCACGAAGAAGTGATCGACAACGTCATCTTGCTTAAGAGCAGCGCCCTTCACGCCACGACCGCCGCGACGCTGTGATTTATAGAGATCAGCGCTGGTGCGCTTTGAATATCCGCCACGAGTGATGGTGACAACAGCATCGTGATCTGGAATTAGATCTTCTGCAGAGAAATCACCTTCGCTTGCAACAAGCTGTGTGCGGCGCTCATCGCCATACTTTGCGATCAAATCTGAGAGCTCTGTCTTGATAATTTCGCGCTGTTTAGCCTCTGATGCCAAGATTGCATTGAGCTCGATGATGTCAGCCATCAAACCTTCATACTCATCATTGATCTTCTGACGCTCAAGTGCTGCAATACGACGCAGCTGCATATCAAGAATTGCATTTGCTTGAATCTCATCGACATCAAGGAGCTTCATCAAACCTGTGCGGGCTTCCTCCGGTGTCTTTGATGCACGGATAAGTGCAATCACAGCATCGAGTGCATCAAGTGCCTTGAGGTATCCCTTGAGGATATGCGCGCGCTTTTCTTTCTCAACCAAGCGGAACTTGGTGCGACGGACAATAACTTCAACTTGGTGCTCGATGTAGTACTTGATGAATTCATCAAGGCGCAGTGTGCGAGGAACACCATCGACGAGTGCCAACATATTGGCACCGAATGTGTCTTGCAGCTGTGTCTGCTTATAAAGGTTATTAAGAATCACCTTAGGAATTGCAGATGATTGAAGAACAATGACAAGGCGCTGGCCTAAACGCTCATTACCTTCATCGCGCACATCAGCGATGCCCTTGATCTTGCCATCTTTGACGAGCTCGGCAATCTTCAAAGCTAAATTATCTGGGTTAACTTGGTATGGAAGTTCAGTGACAACAAGGCATGTGCGCTTATTGATCTCTTCAACATTGACCACTGCGCGCATCGTGACTGAACCACGACCTGTGCGATAGGCATCTTCGATTCCACCGCGACCAACAATGAGCGCCTTTGTTGGGAAATCTGGACCCTTAACAATGGTGAGCATATGGTTGAGAAGATCAGGAGCTGCCATATCAGGATGTTCGAGCGCGTAGATAACTGCTTCACCGATTTCGCGAAGGTTATGGGGTGGAATATTTGTAGCCATACCCACTGCAATACCTGCAGAGCCGTTGACGAGAAGGTTAGGAAAACGTGAAGGCAATACATCCGGCTCTTGTGAGCGACCGTCGTAGTTAGGTGAGAAGTTGACGGTATCTTCATCGATATCGCGCATCATCTCCATGGCGATAGGCGCTAAACGAGCTTCTGTGTAACGCATTGCAGCCGCTGGATCGTTGCCGGGTGAACCGAAGTTTCCGTTTCCATCTACCAATGGATAACGAAGTGACCATGGCTGAGCTAATCGAACAACGGTGTCATAGATCGCGGTGTCACCATGAGGGTGGTAATTACCCATGACGTCACCGACGATGCGAGAAGATTTGTAATAACCCTTATCTGGGCGATAGCCAGCGTCATACATTGCATATAAAACACGGCGGTGCACAGGCTTAAGACCGTCACGAACATCGGGAAGTGCGCGACCGACAATAACTGACATCGCATAGTCGAGGTAGCTTCGCGCCATTTCAACTTGAAGGTCAACTTTTTCGATGCGATCGAAAGTTACTTCCTCGGGTGTTGTCTCATCAATAGCCATTAGATATCCAAGAACCTAACATCCTTAGCGTTGCGCTGAATAAATGCGCGACGTTGTTCAACATCTTCACCCATCAGAACTGAGAACAAATCATCGGCAGCTGCTGCATCGTCAAGAGTTACTTGAATCAGAACTCGGTGCTCAGGATCCATTGTGGTATCCCACAACTCTTTAGCGGGCATCTCGCCAAGTCCCTTAAAGCGCTGAATTCCATCTTCCTTAGGTAGACGCTTTCCGGCATCGAGACCAATCTTGATCATGCCATCGCGCTCTCGATCGCTAAATGCGTATTCGACAGGATCTTTTCCACCCCACTTCAACTTATATAAAGGAGGCTGTGCGAGATAGACAAAACCGTTTTCAATCAGTGGGCGCATAAAGCGGAAGAGAAGTGTGAGAAGTAGCGTGCGGATGTGTTGACCATCAACGTCAGCATCTGCCATCAAAATAATCTTGTGATAGCGAAGCTTTGCAATGTCGAAATCATCGTGGACACCTGTACCGAGTGCTGTGATCAGTGCCTGCACTTCATTGTTCTGCAAGACGCGATCAATGCGCGCCTTCTCAACGTTAAGAATCTTTCCACGGATAGGCAATACAGCTTGATTACGTGAGTCGCGCCCACCCTTAGTAGAACCACCCGCAGAGTCACCTTCGACAATATAGAGCTCACACTTTGAAGGATCAGTCCACTGGCAATCAGCAAGCTTTCCTGGCATTCCGCGACCTTCGAGCAAACCTTTTCGATTACGCGCCAAGTCGCGCGCTTTACGTGCAGCAACGCGAGCAGATGCTGCATCGATTGATTTACGAATAATGTCTTTACCCTCTTGAGGGTTTTGTTCAAACCAAGCAGTCAGATGTTCGTTAACAACCTTTTGTGTAAAGGACTTTGCCTCAGTATTACCCAGCTTTGTCTTTGTCTGTCCTTCAAACTGTGGCTCACCAAGCTTGATAGAGACAATGGCTGTCAGACCTTCACGAACATCATCTCCTGTGAGGCGATCTTCCTTCTTACGAATCAAACCTTGTTCTTCAGCAAACTTATTGACAACAGATGTGAGGGCTGTGCGGAAGCCTTCTTCGTGGGTTCCACCCTCATGTGTGTGAATGGTATTTGCAAAGGTATAGACAGATTCAGAGAAGCCGTTATTCCACTGCATGGAAACTTCAAGTGACAGACGTGCCTTCTTATCTTCAGCAACGAGAGCAATGACTGATTTATGCAGTTCTCCACGAGTTGAATTCAGGTGCTTTACAAAGTCTGTAATTCCGCCTTGATATTGATAGCGCACGGTGAGCTGTTCGCCCTTTTCATCAACGTGGCCAGGTCGCATATCTGTCAGAGATAAAATCAACCCGCGGTTAAGAAATGCCATCTCTCTAAATCGTGCAGAGAGAACTTCGAAAGAAAAATCTGTTGTTTCAAAGATCTCTTCAGATGGCCAGAATTGAATTGTTGTTCCGGTCTCTGTTGTTGCCTCACCTTTTTTAACGGGAGCAGTTGGAACACCGAGTTTGTACTCTTGTGTCCAGATAAATCCATCTCGCTTTACCTTTGCAGATAGATTTGTTGAGAGCGCATTAACAACAGAGGAACCAACACCGTGTAATCCACCAGAGACTGAGTAACCACCGTCTCCGAACTTTCCACCTGCGTGCAAAACTGTAAGAACAACTTCAAGAGCTGGCTTCTTCTCAATCGGGTGGATATCAACAGGAATTCCTCGACCGTTATCGATGACGCGAAGAGATCCATCTTCCATCAGCGTTACAGCGATCTCTGTGCAATAGCCTGCGAGCGCCTCATCTACTGCGTTATCGACGATTTCATAGACAAGGTGGTGAAGGCCGCGCTCACCTGTGGAGCCGATATACATTCCAGGGCGCTTGCGGACTGCCTCAAGACCCTCAAGGACGGTGATCGAGGAGGCGTCATATCCGCCGGATTCCTTCTTCACATCAGCCAAGGGGGGCTCCTTCAAATGTCCTGTATGGGGCTGTAGAGCGGTAATTCCTCTATAGCAAGCTCAAATCCTATCGCTAAATGGAGAGTCAAATAACCCTGAAAAGCCTTAGGAGTGGGAAATACCGCGCGAAGAGCAGCAGGGTGAGGGTTCCCATTAAAAAGTTGTAGAAATACCGTCTCTCCACAAGGTTTATCCCCATTGTGTATGAAGTTACATCAATGTAATTCAGCGTGTGCAGTCACTGAGTAACATCGCACGATTCTCAGCAAAGACTCAGGTTGTAAGAGGATGGTGTGGCCATGGCAACTAAAGAAGTGAGTGAATCAAAGGGTCAGCGCATCCTCGTCGTCGATGACGAATCAAGCATCAGCGATCTCATCGCAACAAGTTTGAAATTTGTTGGCTTCGATGTGCGCACAGCTGCAACAGGATCACAAGCGTTAACAATTGCTGAAGAGTTTAAACCTCACGCAATGATTCTCGATGTCATGTTGCCAGACCTTGATGGATTCGAAGTCTGTCGCCAGATTCGCAATGAAGGAATTGAAGTGGGCGTTCTCTTTCTCTCTGCCAAAGATGAGATGAAAGATAAAGTCCAAGGCTTAACTATTGGCGGCGATGATTACATGACAAAGCCATTTAGCCTTGAAGAACTTGTCGCACGTCTTCGCGCACTCCTTCGTCGCATCGGTGTTACTGCGCAGAAAATGGATGAAGAGAAGATTCGCTTTGCAGATCTTGAGCTCAATGAAGCAACACACGAAGTTCATCGCGCAGGGCACTTACTTGAAGTATCCCCCACTGAATTTACTCTCCTTCGATACATGCTCATCAATGCAGATCGCGTTGTCAGCAAGGCACAGATTTTGGATCATGTTTGGGACTATGACTTTGGTGGCGATGCTGGAATCGTTGAAACCTATATCTCCTACCTTCGTAAGAAGATTGATATCTACGAACCAGCACTGATTCACACTGTGCGCGGTGTTGGCTATCGACTTAGATTGCCTGCAGCTAAGTAGATTCAATGAATTCGCCATTTCGCACCTGGAGCTTGCGCAGCAGACTTTCGCTAGGAATAGTTCTGCTCACAGCGATGGGATTTATCGCTGCTAGCTTTGCTACACAATCACTTCTTAAGGGTTACTTACTCACACAAGTAGATGACCAATTGGTTGCAATTTCAGAAGCAGCCGTTCCTCGAATTGAGCGCGGTGGAATTGCTATTGACGAAGATGGCGACGATGAACGACCTGGTCGTGGACTAGGGCGCGGGCTTGGTGCTCCTGCACCACTTTCTCAAATACCAACGTCAACATCGATCACAGTTCTTGGCTCTGATGGCGCTGTTCTAGGTGGGTTAGGTGGCAATTTAGGAGGCGCATCAATTACTTCCTATCTAGCAGGTCTACTGCCTGAAGAAGTTGCAGTTCATGGTGATGAAGCTTTTACGATCGCAGCTCCTGGTCCTGACTTCAGAGTTATTGCAAGGCCACTGACATCACCTGCTGGAACATTTGTCGCTGCGCAAAATCTTGGTGAGCTAGAGCGAACTCTGGCGCGACTTACTTTCTTATTTGGCCTTATCGGTCTGGCACTTCTTATCCTGATTGCCACAGCATCACGAGCTGTAATCAAAATTGGGCTTCGTCCATTGGAAGATGCGGAAAAGACTGCCGGAGAAATTGCAGCTGGGAATTACTCTGCGCGTATGCCAGAGACTGATCCTGGAACTGAAGTTGGTCGTTTAGTCTCATCGCTGAACTCGATGCTCTCACGTATTGAAAAGTCATTTGCAATCCAGAACGAGAGTGAAAACAAGTTACGTCGCTTCGTTGCAGATGCATCTCATGAGCTGCGCACACCAATCACGGCAATTCGCGGATTCTCTGAACTTCATCGACAAGGCGCTGTCACTGGTGAGAAAGAAACTACTGAACTAATTGGTCGTATTGAAAATGAATCAAAGCGTATGGGTTCACTTGTGGAAGATTTACTTCTTCTAGCTCGCCTGGATCAATCACGCGAAATGGATTCAAAGCCAGTTGATATCAATAAGGTTGTTGAAGATGCCGTTATCTCTGCTCGCGCTGCCGGCCCTGATCACCCAGTTGAATTCATTTCAAGCAACGATGAGATTTTTACCTTGGGTGATGAAGTGCGCATCCATCAAGTGGTTGCAAACTTACTTGCCAATGCTCGCGCTCATACGCCAGCAGGTACTCCAATTACAGTCTCTCTTTCAACAACAGATGCAGGTGTTGAAGTCACTGTTGCAGATAAGGGTCCTGGGTTATCACTGGAGGATCAGAATCGGATCTTTGAACGTTTCTATCGCACAGATGCATCACGTGTTCGAACCGGAAGTGATGGCAGTGGGTTAGGTCTTTCTATCGTTGATGCTGTGATGCGCGCCCATGGTGGCAGCGTCTCTGTGGAATCAACGCCAGGTATGGGCGCTGTATTTACCTTGTTATTCCCCCGCAGGGCCGAGTGACTCCATCGCCTTTAAAGTGGCGATACGTTCTTGAATTGGTGGATGGGTACTAAATAACTTAGAGACTGCTTGTCCATCCAGTGGGGATTCAATCCAAATATGTGCAACTGCATTTGATTTCTGACGCACCACAGTGGAATCAGCAGCTAATACTTCAAGTGCTTTACGCAAGCCTGCAGGATTTCGCGTAAAGGAGACTGCAGTTGCATCAGCTAACGCCTCTCGCTTACGTGAAATCGCAGACTTCAATAACATCGCTGCAATGGGAGCGAGCATAAGAACTACAAGAGAGAATACGAGTGCAAGTGGATTTGAATTGCCACCGCGATCTCTATCTCTACCTCCACCAAACCACATCATGCGGGTAAGGAAATCACTCAAGATTGCAATCGCACCGGCCGTTGTTGCAGCAACTGCTGAAACTAAAGTGTCACGGTTTGCCACGTGAGATAACTCGTGGGCAATAACTCCTTGCAATTCATCGCGATCCATTACTTCAAGTATGCGTGTAGTAAAGGCAATGAGGGCATGTTCAGGATTTCGCCCTGTAGCAAAAGCATTAGGGGCGCTATCTTCAACGATGGCAACCTTCGGCATAGGTAGCCCACTTGCGATGACAACTTCTTCAATCACATTAAAGAGCTCTGGCGCATCCTCGCGTGTAATGAGTTTGGCACCTGTCATGGTTAGAACTAATTTATCTGAGCCGTAATACGAACCCCACACGCCAATGAGTGCAAAGCCCACCGCTATCGGCACCATCGTTGATGCTGTGCCCACTCCGAAATATGTCATTGCAGCATATGCAACAAACCATGTCAGAACACCCATCGATGCCAGAAGGAAGTAAGTCTTTCCTTTATTGGCAGATTGCAGTGCGCGGAAGTTATTTTCAGCCATAGATAAAACTAGAACTTAACGTTAGGAACGTTACGATCTGCTGCATTTTCTACTTCATAGAATTCGCGAGCATCAACCTTGGCAAAGCCTGCAAAGAAGTTGGTAGGCACTGTCTTCACAGCTGTATTGAGCGAGCGCACAGTGTCGTTATAGAACTGGCGCGAGAATGAAACTTTATTCTCTGTAGTGGAAAGTTCCTCTTGGAGAGATAAGAAGTTGGAAGATGCTTTCAGATCTGGATATGCCTCAGCAACTGCAAGCAAACCACGTAGTGCGTTAGTAAGTGCGCCATCTGCTGCTGCAACATCTGCAACAGTGGATGCACTCGTTGACTTAGCACGGGCTGCAACGACAGCATCAAAGGTTCCCTGTTCGTGTGCGGCATAACCTTTAACAGTCTCAACGAGGTTAGGGATCAAGTCTGCACGGCGCTTGAGTTGAACCTCAATCTGCGCGAATGCCTCATCGACTGTGATGTTCAGTCGTATGAGGCGGTTGTATTGAGCAACAGCAAAGATGACAAGAAGGACGAGTGCGCCAATGACGATAAATTCCATACTTACTTCAACCCCTTTTGCCTGGTTGGTATTCCCCAGATAGAGATTATCTAGGAGTTATTTGATTTTACTTTGGTGGAAATTCATAAAGGAACGGCTTGGAGTTGGCCCACGCTGTCCTTGGTAACGTGATGCGTAGATTGCTGAGCCGTAAGGATGCTCAGCAGGGGATGAAAGTTTAATCAGGCAGAGCTGACCGATTTTCATGCCTGGGAATAACTTCACCGGCAGGTTGGCAACGTTGGAGAGCTCAAGAGTGATGTGCCCAGAAAAACCTGGATCGATAAAGCCTGCAGTTGAGTGGGTAAGAAGTCCAAGGCGCCCTAGAGATGACTTTCCCTCTAGGCGACCTGCGATGTCATCAGGAAGTGTGATTATTTCGTAGGTGGATGCCAGAACGAATTCACCTGGGTGCAAGATAAATGCTTCGCCATCTTCGGCGATGACTTCGCGGGTGAGCTCAGATTGTTCGATCGATGGATCGATCACTGAGTACTTGTGGTTCTCAAAGACGCGGAAGAACTTATCCAGGCGGACATCAACTGATGATGGCTGGATCATCGCCTCTTCATAGGGCTCAACGGCAACGCGGCCTGATTGAATTTCGGAGCGGATATCGCGGTCACTGAGTAGCAC
Protein sequences of DBSCAN-SWA_4 >NZ_CP016770|1326937:1335996|1334811_1335360_-|WP_095676776.1|DBSCAN-SWA MEFIVIGALVLLVIFAVAQYNRLIRLNITVDEAFAQIEVQLKRRADLIPNLVETVKGYAAHEQGTFDAVVAARAKSTSASTVADVAAADGALTNALRGLLAVAEAYPDLKASSNFLSLQEELSTTENKVSFSRQFYNDTVRSLNTAVKTVPTNFFAGFAKVDAREFYEVENAADRNVPNVKF >NZ_CP016770|1326937:1335996|1332433_1333903_+|WP_095696539.1|DBSCAN-SWA MNSPFRTWSLRSRLSLGIVLLTAMGFIAASFATQSLLKGYLLTQVDDQLVAISEAAVPRIERGGIAIDEDGDDERPGRGLGRGLGAPAPLSQIPTSTSITVLGSDGAVLGGLGGNLGGASITSYLAGLLPEEVAVHGDEAFTIAAPGPDFRVIARPLTSPAGTFVAAQNLGELERTLARLTFLFGLIGLALLILIATASRAVIKIGLRPLEDAEKTAGEIAAGNYSARMPETDPGTEVGRLVSSLNSMLSRIEKSFAIQNESENKLRRFVADASHELRTPITAIRGFSELHRQGAVTGEKETTELIGRIENESKRMGSLVEDLLLLARLDQSREMDSKPVDINKVVEDAVISARAAGPDHPVEFISSNDEIFTLGDEVRIHQVVANLLANARAHTPAGTPITVSLSTTDAGVEVTVADKGPGLSLEDQNRIFERFYRTDASRVRTGSDGSGLGLSIVDAVMRAHGGSVSVESTPGMGAVFTLLFPRRAE >NZ_CP016770|1326937:1335996|1331705_1332428_+|WP_095696538.1|DBSCAN-SWA MATKEVSESKGQRILVVDDESSISDLIATSLKFVGFDVRTAATGSQALTIAEEFKPHAMILDVMLPDLDGFEVCRQIRNEGIEVGVLFLSAKDEMKDKVQGLTIGGDDYMTKPFSLEELVARLRALLRRIGVTAQKMDEEKIRFADLELNEATHEVHRAGHLLEVSPTEFTLLRYMLINADRVVSKAQILDHVWDYDFGGDAGIVETYISYLRKKIDIYEPALIHTVRGVGYRLRLPAAK >NZ_CP016770|1326937:1335996|1333879_1334803_-|WP_095676775.1|DBSCAN-SWA MAENNFRALQSANKGKTYFLLASMGVLTWFVAYAAMTYFGVGTASTMVPIAVGFALIGVWGSYYGSDKLVLTMTGAKLITREDAPELFNVIEEVVIASGLPMPKVAIVEDSAPNAFATGRNPEHALIAFTTRILEVMDRDELQGVIAHELSHVANRDTLVSAVAATTAGAIAILSDFLTRMMWFGGGRDRDRGGNSNPLALVFSLVVLMLAPIAAMLLKSAISRKREALADATAVSFTRNPAGLRKALEVLAADSTVVRQKSNAVAHIWIESPLDGQAVSKLFSTHPPIQERIATLKAMESLGPAGE >NZ_CP016770|1326937:1335996|1335417_1335996_-|WP_095676777.1|DBSCAN-SWA MLLSDRDIRSEIQSGRVAVEPYEEAMIQPSSVDVRLDKFFRVFENHKYSVIDPSIEQSELTREVIAEDGEAFILHPGEFVLASTYEIITLPDDIAGRLEGKSSLGRLGLLTHSTAGFIDPGFSGHITLELSNVANLPVKLFPGMKIGQLCLIKLSSPAEHPYGSAIYASRYQGQRGPTPSRSFMNFHQSKIK >NZ_CP016770|1326937:1335996|1326937_1329436_-|WP_095696537.1|DBSCAN-SWA MAIDETTPEEVTFDRIEKVDLQVEMARSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMYDAGYRPDKGYYKSSRIVGDVMGNYHPHGDTAIYDTVVRLAQPWSLRYPLVDGNGNFGSPGNDPAAAMRYTEARLAPIAMEMMRDIDEDTVNFSPNYDGRSQEPDVLPSRFPNLLVNGSAGIAVGMATNIPPHNLREIGEAVIYALEHPDMAAPDLLNHMLTIVKGPDFPTKALIVGRGGIEDAYRTGRGSVTMRAVVNVEEINKRTCLVVTELPYQVNPDNLALKIAELVKDGKIKGIADVRDEGNERLGQRLVIVLQSSAIPKVILNNLYKQTQLQDTFGANMLALVDGVPRTLRLDEFIKYYIEHQVEVIVRRTKFRLVEKEKRAHILKGYLKALDALDAVIALIRASKTPEEARTGLMKLLDVDEIQANAILDMQLRRIAALERQKINDEYEGLMADIIELNAILASEAKQREIIKTELSDLIAKYGDERRTQLVASEGDFSAEDLIPDHDAVVTITRGGYSKRTSADLYKSQRRGGRGVKGAALKQDDVVDHFFVASTHDWLLFFTNQGRVYRAKVHELPDAGRDARGQHVANLMAFKPDEQIAQVLSFKDYNAAPFLVLATKNGLVKKTPLSEYDSPRTGGLIAISLKPGDEVVSASLVRNGDELLLVSKKAMSLRFTTDDESLRSMGRSTSGVIGMKFRAGDQLLTMSRVDAETSVGAFVFTATDGGYGKKTPIDEYRLQGRGGIGIKAAKIDEDSRGTLVSALVLVDSDEILAITSAGTVMRTPAAEVRQTGRDSMGVRLVNLDEGATLLSVTCNSEDEISPQK >NZ_CP016770|1326937:1335996|1329435_1331406_-|WP_190286201.1|DBSCAN-SWA MADVKKESGGYDASSITVLEGLEAVRKRPGMYIGSTGERGLHHLVYEIVDNAVDEALAGYCTEIAVTLMEDGSLRVIDNGRGIPVDIHPIEKKPALEVVLTVLHAGGKFGDGGYSVSGGLHGVGSSVVNALSTNLSAKVKRDGFIWTQEYKLGVPTAPVKKGEATTETGTTIQFWPSEEIFETTDFSFEVLSARFREMAFLNRGLILSLTDMRPGHVDEKGEQLTVRYQYQGGITDFVKHLNSTRGELHKSVIALVAEDKKARLSLEVSMQWNNGFSESVYTFANTIHTHEGGTHEEGFRTALTSVVNKFAEEQGLIRKKEDRLTGDDVREGLTAIVSIKLGEPQFEGQTKTKLGNTEAKSFTQKVVNEHLTAWFEQNPQEGKDIIRKSIDAASARVAARKARDLARNRKGLLEGRGMPGKLADCQWTDPSKCELYIVEGDSAGGSTKGGRDSRNQAVLPIRGKILNVEKARIDRVLQNNEVQALITALGTGVHDDFDIAKLRYHKIILMADADVDGQHIRTLLLTLLFRFMRPLIENGFVYLAQPPLYKLKWGGKDPVEYAFSDRERDGMIKIGLDAGKRLPKEDGIQRFKGLGEMPAKELWDTTMDPEHRVLIQVTLDDAAAADDLFSVLMGEDVEQRRAFIQRNAKDVRFLDI |
7 | Bacillus_virus(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|