Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP016772 | Candidatus Planktophila dulcis isolate MMS-IA-53 chromosome, complete genome | 4 crisprs | DinG,WYL,cas4,DEDDh,cas3 | 0 | 0 | 3 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016772_1 | 216822-216934 | Orphan |
NA
Consensus repeat of NZ_CP016772_1
|
2 spacers
spacers of NZ_CP016772_1
>1.1|216848|19|NZ_CP016772|CRISPRCasFinder TGCGAAGAAGACGACCGCT >1.2|216893|16|NZ_CP016772|CRISPRCasFinder AACCAAGAAGGTTGTG |
CRISPR arrays and Neighbor proteins around NZ_CP016772_1
The CRISPR arrays of NZ_CP016772_1 >merge|NZ_CP016772|1|216822-216934|CRISPRCasFinder AAGAAGTCAACCGCGAAGAAGTCTCCTGCGAAGAAGACGACCGCTAAGAAGTCTGTAGCGAAGAAGAAGACAACCAAGAAGGTTGTGAAGAAAACTGCAGCGAAGAAGACCAC >NZ_CP016772|1|1|216822-216934|CRISPRCasFinder AAGAAGTCAACCGCGAAGAAGTCTCC TGCGAAGAAGACGACCGCT AAGAAGTCTGTAGCGAAGAAGAAGAC AACCAAGAAGGTTGTG AAGAAAACTGCAGCGAAGAAGACCAC
>NZ_CP016772.1|WP_095692157.1|216078_216696_+|RdgB/HAM1-family-non-canonical-purine-NTP-pyrophosphatase MSHKLLLATRNKGKIEEFRRILDAVAPGEIDLVGLDQFPELHDVVEDGATFEENALKKAREMSLAVGIPAIADDSGLCVDALKGDPGIFSARWAGSHGDDAANTAKVLQQLSDIPDEKRSAHFTCVAALYLPDGRSHCEEAHFDGWILRAPIGEHGFGYDPIFRPDGLELSSAQMSAEDKDAISHRGKSLRAIAPHVITLLKTLG >NZ_CP016772.1|WP_095692156.1|215352_216078_+|ribonuclease-PH MARNDGRTVDQLRDIKITRGWLDHAEGSVLVEFGKTRVLCVASFTPGVPRWLKDSGTGWVTSEYAMLPRATHTRSDRESVKGKLGGRTQEISRLVGRSLRGIVDMKELGENTIVIDCDVLQADGGTRTAAITGAYVALADAISWAQKQGHIKANAKPLADSVAAISVGIIDGVPMLDLCYEEDVRAETDMNVVCSGDGRFIEVQGTAEGAPFDRVLLDSLLDLAVAGCATLTELQKQALAK >NZ_CP016772.1|WP_095675767.1|214524_215322_+|glutamate-racemase MSNAPIGIFDSGVGGLTVARAILDQLPNESTLYIGDTARGPYGPRSLAEVRDFSLETLDFLVDQGVKALVIACNTASAAMLRDARERYSVPVIEVIQPAVRRAVAATRTGKVGVIGTRATIDSKAYLDAFAAAPQLKISSIACPLFVEYVERGETSGDAITKVAREYLQPMIDAEVDTLVLGCTHYPLLTGVISYVMGNDVSLVSSAEETAKDLYRVLVENSLLRGPSSTPASHKFLSTGDSKAFEVLARRFLGPEVGSVQHQVL >NZ_CP016772.1|WP_095692155.1|213547_214498_+|cysteine-synthase MARYDSLESSVGNTPLIGLPRLSPAPNVRLWAKMEDRNPTGSIKDRTAISMIEAAERDGLLKPGSTILEPTSGNTGISLAMASKVRGYKLICVMPENTSPERRQLLEMWGAEIISSPAAGGSNEAVRVAKEIAEKNPDYVFLYQYGNPANTEAHYKNTGPEIFTDLPTITHFVAGLGTTGTLMGAGRYLREQNPDIQIIAAEPRYGELVYGLRNIDEGFVPELYDATVLTRRFSVGAEDSVKRVRELLEVEGIFAGISTGAILHAAIAMGNEALRDGRDADIAFIVCDAGWKYLSTGIYGSQIAEATEGLDGTLWA >NZ_CP016772.1|WP_095692154.1|213272_213545_+|MoaD-family-protein MSIEVRIPTILRPYTKDQKSVEAAGATLSAVITDLDANYAGLGERLLENGALRRFINVYVNDEDVRFLGGLDAQLKDGDSITILPAVAGG >NZ_CP016772.1|WP_095692153.1|212810_213227_+|M67-family-metallopeptidase MTLEISQAFVDAILEQSRVEYPDECCGVILGPAGSGKALRHKPMINAAHSPTFYEFDPKDLLALYREADDNDEEIVVIYHSHTETEAYPSRTDIAYAGEPGAHYVLVSTRKEIAPATEFRSFRIVDGVVTEELVTISG >NZ_CP016772.1|WP_095675763.1|212195_212786_+|DUF2017-domain-containing-protein MTATEGFSRHGDHSYVATFADSEKEVLLNLCEQIIELLAERQDHGHEDPLAAMVGITSHDSPPEDEVLHRLLPNAYADEVDASEFRRYTESTLRQKKQAHAISMRIHLKSSDDGTIDLDHDNANAWLGGINDIRLALGVRLKVENNSHEELELLSPDDPLRGVYAVYTWLGWLQETLLSALIDDADEDEESQLGSS >NZ_CP016772.1|WP_095676812.1|211905_212199_+|ATP-dependent-Clp-protease-adapter-ClpS MVKTADKIEEEIRAIFSSDTPWVTVVWDDPVNLQTYVVYVFMELFGYSKARATELMLQVHNEGKAIVSTGSREEMEHDVARLHEYGLWATIQRGDQL >NZ_CP016772.1|WP_095692152.1|210136_211906_+|hypothetical-protein MVKTQRKFIGVVAVATLFLSLISTPISAADNPPRKIMTGWVPYYSMKTALPDVLNNIDLIKEVMPFWYTLKFDGKAKAAVVTDLYAPANPSVPISEPLTAMRNAGLSIIPTITDGTSKLVLAGLLKNPTSRTQVVSAIMNLVRANNYDGIDIDFEGFAFVDGNSTWTSTAPSWVAFIKELSIALRAEKKLLSVSTPYVLNPNEAQKGYFVYAWAAIASSIDKLRIMTYDYSVSKVGPMGPITWAERTVQYAVSVMPASKVFVGVPGYGRDWVTAVTGVCPANVVNSVKPGAKAATFVMRDAVALAATYGTVPRYDEKFGEMTFSYQKVYNGTTATGLATSCTASRTAWYQDARGWALRAALVTKYRIGGITAWTFGMEEPLAMESIRQVAKEIAPDQVAVTAAIDNSTIDYGNPITVTAAFTIKDKSPVVGVPVRIEGKSAGDTNWRTLATVTTGIDGKIEKAVLVGKSTAVRVYSDSTWERTEGASSEFPIVVNRLLVISAPGTAKSSVATVITGNIRPRIAGASVQLEKLVGKEWKPLDVAVLTDAQGNFSLNLSGQTRGVSSLRISVAADSLWSAVLSPIFNIIVR >NZ_CP016772.1|WP_095692151.1|209838_210144_+|hypothetical-protein MESLALIVSLMIGNILFSGPFALLLTLPRIRAISTGIPFLIFRRLAMGTAALTGIFLSVIFLFNDLQLIVKALSLLCIGTHLWAADREYGKFISSRLRRNG >NZ_CP016772.1|WP_095692159.1|217654_219400_-|acyltransferase MAASRGIQYIPAIDGLRAVAVIAVMFYHLGFTWIPGGFLGVDLFFVISGYVITRLLLDSIEQSGGLDLRGFYIARARRLLPALVFMLVSTTIAIGIWAPDAIKRLLIDTPFSLTGTMNWWLVARHQDYFESIGRPPLLQHTWSLAVEAQFYLVWPLILYFILKQFGKKHIPLASLAIAAASGITLLLVSFSLDASNASKVSHVYFGTDTHSIGLFLGAALAVSWIPQNFTKTVSRKAQDFIDGVGFLGFIGILAAFLLIDENQPTLYKIAFPLAGLCGAAIIMSVVHPASRFAPVLQNPIFLWIGERSYAIYLWHWVIFQVTRPSVDLAGKEWALYSLRILIVLALSDISLRYVELPIRRGVIQYWWKGLKYRTKKERSQQTRTFSIITVIVLLLASVVSVRAIGIANDQRQRLEDSLTATPTANTEVVKDGLWVTGDSVILGIRSKLGESHPISIMNARIGRQAPELLSVMLQDKKEAANVPVIFNLGNNNALTREQTVAIFEAVKDQPRIIVVNTAVPRPWREGNNSLIAEVASKYANVIIVDWNAISEGRPEYFAPDGVHLVPTGVDVYVAEILKHLD >NZ_CP016772.1|WP_095675771.1|219405_220251_-|hypothetical-protein MAEKNFRNWVGFREEADRAPVANPVDRIRELESQLADLKSRRDITGLSREEFEILATETAMAMIKSAQAREAKATAAADRVINETNRTAKDTLEGAENKARSILAGAESRGRKYISTAEAEASEIVRDAGREATAVANAKIAEADSAVDAKRRDAAALTTAARREAERVISEAADNVVEYRTWLSDIIAESERLYRSQATALSAAESAIAASREKLDSAFARLTKMQQVVDNSLNEDGTVKKSAPIRVESKRTRAAIAAPKKTSKAPAKKIAPKKKPAKRK >NZ_CP016772.1|WP_095675772.1|220315_220963_-|hemolysin-III-family-protein MSTEPIQSPPKLRGWFHLAATPLVIIASLVLFILSGESLKWAVALYSITAIMLFSVSAIYHRVPWIPRKKKIWRRWDHANINLLIAGSYTPFAVALLDDRDRNVLLAIVWTGALLGVALRVFWVNAPRFLYVANYLLLGWVAIIYTPQLYKEGGLWVILPIIIGGLLYSIGAIFYALKRPGRNAKYFGFHELFHIFVLAAWISQYLAVSFAIYRK >NZ_CP016772.1|WP_095692160.1|220972_222184_-|MFS-transporter MLTQLKDLKAYHGFTGLAISRFISNVGNGVSPIALAYGVLSLPGSTGKDLSIVMAARFVPLLAFMLFGGVLADRFQRNRLVGGSDMIGSFLAAVSAISLIAGFSSTWLLALMGALFGILNAIWWPAMSGVLPEILPKEKLQEGNAVIGLLTNFGYIVGTLGGGILVSTVGAGWGLLVDAISFFIAGVIVWYLPIIGKIKDKSPGIIHDLAVGWKEFISRSWVIAMVVAFALINMAFESMLSVLGPLNFSDPISGPKQWSYNLAGLSVGMLIGGIWVLKVKIGRPLFLAMILVSLSAVWDFALAFDVPMFFSVIAAVISGISLEVFMVTWNTSLQSHVPEESYSRVSSYDTLGSFGIAPLGIVIAGPLAMHFGVNTILIVTGVTTLIAAVASLLVPSVRNLRND >NZ_CP016772.1|WP_095692161.1|222410_223172_+|bacteriorhodopsin MSVTLDSNQWNLVYNIFSFGLISMLACTVYTLVSQSRVLPKYRNALVMSSMVTFIAGYHYFRIFNSFDEASEGMVVNVSGEQGAFNEAYRYVDWLLTVPLLLVEVIAVLALAKEVSKSLIMRLVPASAAMIALGYPGEITSDKNTAILYGVLSTIPFLYILYVLFVELGKSLERQPAGVAETIGRLRLLLIATWGVYPISYILGMNGDPTASSFVGVQVGYTIADVLAKCVFGLTILKIARMKSHAEGMAADH >NZ_CP016772.1|WP_095692949.1|223265_223487_+|hypothetical-protein MGGYLLLAVGLINLRYQTGKSDVLNHSLILIIPGAILLGLTFISAGKKWLNTKAATAMVIACGGLLLIYSFIN >NZ_CP016772.1|WP_095692162.1|223483_224797_-|bifunctional-o-acetylhomoserine/o-acetylserine-sulfhydrylase MTNNWSFETLQIHAGQTADPTTGARALPLYQTTAYQFRDTTHAANLFGLAELGNIYTRIMNPTQDAVEQRLAALEGGVAALLLASGSAATTFAVMNVAEAGDHIVSSPSLYGGTYNLFHYTLPKFGIEVTFVDDPNNPESWKKAVKPNTKAFFGETIANPKNEILDIKAIADVAHSVGVPLIVDNTVATPYLIKPIDFGADVVVHSATKFLSGHGNAVVGAIIDAGKFDYAQHQDRFPGFNKPDPSYHGLVFSQALGVGSAFGANLSYIFKIRLQLLRDIGAAVSPFNAWLLAQGLETLSLRMDRHIENAKAVATWLEAHPDVEKVNYAALKSSPWNALAAKYAPKGPGAVLSFELKGGVEAGKKFVESLKLFSHVANIGDVRSLVIHPATTTHSQLSPAEQLEAGVTPGLVRLSLGLENIQDIKADLEDGFTAARG >NZ_CP016772.1|WP_095692163.1|225033_225360_-|hypothetical-protein MAKQSPAKIKKLRGEAMRAAAARKAARAVSASTHSEVDLGAYAGVDGPWRELGLAAPARRALIDEGYYKLSDLRKVSLDAIKDLHGMGPNAIRILTTAMKKADLSFRK >NZ_CP016772.1|WP_095692164.1|225524_226748_-|ABC-transporter-permease MSTQATLEKETLKGATSNYLSRVKSGDIGSLPAVLGLISLIAVFGAMSEFFLTNRNFANLLTQAAPVMVIAMGLVFVLLLGEIDLSAGYASGVCGAVLVLLVTNEGWSWYTALGASIAVGALLGVLIGTLVSRLGIPSFVVTLAAFLAFQGVLLLLAGEGGTIPIADKTILAVENSNMTPMQGWILWAVSSAAYVLGGLRRINSRRKAGLVVELTQLWAMKTIALLIITGGAVYQLNQERGLSATNSTKGVPIVAPLILVILIAGTFLLSRTAFGRHIYAVGGNAEAARRAGINVKRVRTIAFVLCSALAAVAGMLFASRMNSISPSTGGSSTLLYAVGAAVIGGVSLFGGKGRMRDAILGGFVVAVIDNGMGLLGYGAGIQYLVTGAVLLVSAGVDAVSRRGALTN >NZ_CP016772.1|WP_095692165.1|226747_227497_-|sugar-ABC-transporter-ATP-binding-protein MSTPLLSLKGINKSFGPVHVLKDVNFDVYPGQVTALVGDNGAGKSTLIKCIAGIYTPESGEFLFEGKNVTIDGPRAATALGIEIVYQDLALCDNLDIVHNMFLGREEKKGITLNETSMESLARKTLDGLNVRTVKSIRQTVSSLSGGQRQTVAIARAVLWNSKVVVLDEPTAALGVAQTEQVLNLVRRLADKGLAVVLISHNLIDIFQVADNIAALYLGNMASQVKKSDVTTNQVIELITTGKSEGVTK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016772_2 | 321039-321178 | Orphan |
NA
Consensus repeat of NZ_CP016772_2
|
1 spacers
spacers of NZ_CP016772_2
>2.1|321089|40|NZ_CP016772|CRISPRCasFinder CGCACCGCTTAATTAAATATGCAATAATTGGACTCCGCTT |
CRISPR arrays and Neighbor proteins around NZ_CP016772_2
The CRISPR arrays of NZ_CP016772_2 >merge|NZ_CP016772|2|321039-321178|CRISPRCasFinder GTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCCCGCACCGCTTAATTAAATATGCAATAATTGGACTCCGCTTGTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCC >NZ_CP016772|2|2|321039-321178|CRISPRCasFinder GTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCC CGCACCGCTTAATTAAATATGCAATAATTGGACTCCGCTT GTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCC
>NZ_CP016772.1|WP_095692237.1|320500_320950_+|hypothetical-protein MKFDIKQVFPENPSKFEGFRIIRLIAALYMSVMVARSCIHLFAPDGGAQSIAGIDTSVEGGDNIIAIFHQWGAIQLILAILLFVLFFRYPGLTPLILLTLTLDPVLRFVAGQQMSLTTTGTPPGEALNGVSLYLLLVLFLGSLWNKKPN >NZ_CP016772.1|WP_095675841.1|319823_320492_+|hypothetical-protein MKIKSVAISATAFVLLGGVLGVQQYISSQITSKVQREMPNASGISASVPLADVPSNLTSDLIKSADINIKSFALKESGTKTSLNISASSISKAKPTLVGSLEITATIPASTITKSSEFNDAQIVGNTLQVSAGAGGMGTAILIPKYSNSQLYFELQSVSILGNQIPASSLPSDLQNQIKSRSQRSLTPPKGLKVKSVSLSSKGLSVKMFGNNIQLGNLGSGL >NZ_CP016772.1|WP_095692236.1|319164_319644_+|hypothetical-protein MKRLKFGFIPVLLLLLASCSNDSQAMVEVNLAPVPYSIQVPAEIAKHLSVENMVTTTPNEFTNQAREAGAIAQVYINYREDDGTTHGFAGVYYFKKADFEKAGNPNEPPVYGSKVLEEKSMVLAIAGPQDSIFDPNSQDGKKSMALYTLVYDPRSFKSS >NZ_CP016772.1|WP_095692235.1|318448_318949_+|hypothetical-protein MRKFLIVTIVSTLIIIVTYFLPSGVWAEFGGLPAHPLIIHGVVVLLPLLAIFLLVGLFWKNLLKKLHLPLIGMLALSVVGVLAAKSSGYSLSAVVGLPRSHAQWGNYLVLLAIALVSSFVLFSYFSFYKKSKIASSSLGVLMAFLAVSAIGMTYVVGHSGAESVWK >NZ_CP016772.1|WP_095692234.1|318001_318415_+|Rieske-2Fe-2S-domain-containing-protein MEPISRRSFIAGVCAVVALGGSEVPAAANTSVKKLPGGRLSVDLKAVPALAKVGGATRIGSVKGVPVAIARTGTSKYIAFNLLCPHQKVTVTQNEKGWVCNAHGSEFESDGDLALGPATTGLARVPMKISKGLATIG >NZ_CP016772.1|WP_095692233.1|317380_317992_+|hypothetical-protein MRKVLVSIVTIIGLVVSSNVAFADSAKPGQSMTHMKTAAGVASTLEAAGVILYVQGGATSAVIGENVSAATSQVVFHIPVTANKAGVQHIGSNIIFFNTANNQYLTLKNPVIDLAKGVVSATVPQAGDAKVDILTITNASTAKPKITNDKKTKQRTTAYTGTTLVLAPGVAATIASVLGLPAGSLPDGLAFGTADVTLYSKLK >NZ_CP016772.1|WP_095692232.1|316826_317369_+|DUF305-domain-containing-protein MLKKIIVSLLAIGIILVPNSANASSHAKSLQNLGMSEIMFAQMMIPHHEQAISMSETALKKSRNQEILKLSRQIKTLQSSETSQLTYWLKATNSSMTMDHDMKMSGMLTVKEFASLKQLTGTQFDRTFLQLMIKHHQGALEMLDLISGSRNAEAKALAKAINSAQSKEISSMKLLLKKLK >NZ_CP016772.1|WP_095692231.1|313525_316213_+|pyruvate,-phosphate-dikinase MTTQFVYSFSEGSKELKDLLGGKGANLAEMMRIGIPVPPGFTITTEACKEFLALGVAPLELEIQITKALRELEDEMQKRLGDKKDPLLVSVRSGAKFSMPGMMETILNVGLNDESVQGLIKQTKNPRFAWDAYRRLIQMYGKTVLGIEGQKFANELDKAKKDQGVVDDHQLSVESLSKLVETFKNIVFQESGKNFPQDPREQMDQSMRAVFNSWNTDRAKLYRRRERIADDLGTAVNIGTMVFGNMGEDSGTGVCFTRDPATGELGAYGDYLQNAQGEDVVSGIRNTLSLEDLGRLHPDVFSELRGIMYNLETHYRDLCDIEFTIERGKLWILQTRVGKRTASAAFRIAMQLVDEHIITMDEALVRVNGGQLAQLMFPQFDMRNAPKAITKGMAASPGAAVGKAVFDSETATTWAENGERVILLRRETNPDDLGGMIAAVGILTSRGGKTSHAAVVARGMGKVAVCGAEELEVNENEKFAKVGSLRINEGDYISIDGSTGEVFNVEIPVEPSTIVRYLTDGIADASEGKEDGTRELIRSVDRLLRHADRRRKLRVRANADTDLDAAVARKFGAEGIGLCRTEHMFLGERRVLIERVILAKNNEHREEALAALLPLQREDFFNIFKEMDGHTTTIRLLDPPLHEFLPNLADLKVKSALARERNQVIEADERLLYEVEKMYESNPMLGLRGVRLGLVSPGLYELQVRAIAEAMADRIASGANPKVEIMIPLVGSHMELKITRLAAEEVIQEVARERKIELPIEIGTMIELPRAALTANRIGLVADFFSFGTNDLTQTTWGFSRDDVEAEFFAKYLELGVFTISPFETIDQSGVGELLIIATERGKSVNPNLHFGVCGEHGGDPESIHFFHKVGLDYVSCSPFRVPIARLEAGRAAVS >NZ_CP016772.1|WP_095692230.1|312719_313478_+|SDR-family-oxidoreductase MSNDLSSFSGQVVLIVGAASGIGRAAAQLITSREGTVVIADLDMAGLASLQKELGIKDKQVKSVNLGDQSSIQALITSVISDHSQIDALINTAGVVGPTNTKVEDVEWAAFERTVTINLFGTVWITQAILPHMKTRKYGRIAHVASIAGKEGNPGMHAYNTSKSGMIGFIKGVGKEVAAEGITINALAPAVIRTPMNADTSEETLKYMLGRIPMGRVGEPEEAAEMLAFMASKACSFTTGFTFDTSGGRATY >NZ_CP016772.1|WP_095692229.1|311101_312730_+|D-aminoacylase MTSNTFTIRGATVIDGSGSVGVKKDVVIVEGNIAEVGKLIKGNERGKIIDASDLTLTPGFIDMHSHSDLGVIADKAHLSKVTQGVTLEVVGQDGLSYVPSNEKVQAELRAQLYGWNGTLNDHDWNFNSVSQYLGEVDKGSAVNVAYLMPHGTIRMLVRGMNEGISSAEDIEKMQEILRTGMQEGAFGMSAGLTYVPAMYSDTHELIELCKVVREFGGYYAPHHRSYGAKIFESIAECIQISKESAVPLHLTHCHLSAPIYHGRANELLKLLDDASGQGIDISLDTYPYLAGSSYLHMMLPSWVQAGGIDQLRIRLREPEVQKKVIDALDHIGSDGNQGGVVNWDNIVIAGVEKAENKKYVGIAISKLALSQNKLASQLYIDLVLSEDFKASMVVFGGNEENVRTIMKDSRHTVGSDGILHGDRPHPRAYGTFARFLGHYSRDEQMFPLEGAVNRMTGRPAMRLGLQDRGFIREGYRADLVLFDNESIADRSTFESPRLPASGFEYVWINGIPTLEKGERTNLVPGKGIRKTALTNLGGKNVK >NZ_CP016772.1|WP_095692238.1|321465_321954_+|SRPBCC-family-protein MTSEKVRSEIFDTGNPKIKSARIIVEASPSTIFAILSNPKRHRDIDGSATVTANVSGPEALVLGSKFGMKMRLGITYWITNTVVEYKKDELIAWRHLGRWRWRYELTTLGNGSTQVTESFDGTYAPAVAQVWLNFRKAYPWTQLAVAKTLVRLKTVAESEGQ >NZ_CP016772.1|WP_095692239.1|322139_322463_+|hypothetical-protein MKFLISVIDDLSNSGTPAEMVAIDAFNDQLRTNGQWIFAWGLQAPETATVIDNRGGADSETGHPLFDSKEHYSGLWLIEAADAATAKKLAFEASKACNRKVELRPLH >NZ_CP016772.1|WP_190283231.1|322601_323552_-|DMT-family-transporter MNQLTPVNQSKLISSKYMAVALSKTQRSGLLFAFLGIFAFSLSLPFTKLALKSFDPFFTAFARPVIAAVIAIPLMMIAKVPMLPRNLWKPTAFTAAGAVFGWPILIALALQRTTSAHVSVIAAVMPLVTAIIAVIKHKKHPGLSFWVASSLGTVLLVAFSITRGGGTNADLKTDLLIIGAVIASSYCYVEGAALTSHMPGWQVISWVVVVSLPIALPAAAFVYAQTNADYSFHGDALFGLLAIGLSSMYLGFFAWYRGLRDFGVAHGSQVQQLQAIMTLGWSALLLGETVTLTMALSAIGIVLCVLWALSNVNRVK >NZ_CP016772.1|WP_095675846.1|323505_324513_-|Gfo/Idh/MocA-family-oxidoreductase MTQKLRIAIIGAGRIGYVHAGSVNDTPELELVYVVDPFEENAKKVTAAFGGKVSNDPSAVIASGEIDAVIIGSPTATHIPLLRECIAAGVHALCEKPIDLDVKNVEEFRALANSAKTNITLGFNRRQDPQYKALKAKVASGAIGTVEQVILTSRDPGPAPQGYIAVSGGIFRDMTIHDFDMARNFVPDIVEVTAFGANSFCDYIKEEGDFDNISVIMKGSNNELITVVNSRHAAFGYDQRAEIFGDKGMLQISNLSDTTVKSFTKDGTTAGEPFMDFFLERYADSYRNELKLFIEGIKTGKVLGSTYDDGRAALILADAAHESAHTGKSIKVNLK >NZ_CP016772.1|WP_095692241.1|324549_325572_-|Gfo/Idh/MocA-family-oxidoreductase MSALPKPHIFTAAESKPLRWGIFGAGWISEAMVKTAQLNSNQQFVAVASRTPGKAEAFAQKWNIDSFHNSYEELAARDDIDAIYLGTLPSDRLEVALVAINAGKHVLIEKPITMDYAEAQQIYAAAKAKKVLAMEAMWTRFLPQMDIARQLVTDGALGDVELVVSNFCQNNLGVTRLFTLGGGNPIIDMGIYPAALSQQFLGNPNEIHAFGKLHPNEIDEETHAFMRFANGSRSNFVLSARTTLPHWAGVSGSKGAITFGTPWFTPSSITFHESTFNGAQSTWVDDLGIPEHFGLIYQVHAFAQYVDQGLLEGPLYTHHDSLSNIKTVLEIGNLIGTRYK >NZ_CP016772.1|WP_095692242.1|325581_326658_-|transaldolase-family-protein MTQSPFLYMKENSPTVLWNDSADPKELKDALTWGIVGATCNPVIALTAIKADAPHWVSRIKEYAKSHPAATEDEIGWAMVKELSTNAAKLLEGEFEKYNGRNGRLSIQTDPRNFRNAKALAAQAVEFAQLAKNMIVKIPVTTEAISAFEEATYQGVSLNATVSFSVAQTVAVAEAIERGLKRREAEGLDISTMGPVCTIMVGRVDDWVKVSAEKIGAKVDPEILEWSGVAVFRNAHKIYQERGYRTRLLSAAFRNHMHWSEILGGDSVISPPYSWQVKINEMGITPNLNSVNEPIEARILDPLLENFPEFRKMYDVDGLAVEDFTNFGGTLRTLRGFLQSVNDLESFVRDVTVPNPDK >NZ_CP016772.1|WP_095692243.1|326662_327577_-|TIM-barrel-protein MTAQIRVGTAPDSWGVWFPSEPHQVPWDRFLDEVVEAGYHWIELGPYGYLPTDPKQLEDELGKRNLKMTAGTVFTGFHKEDESQWQRAWDQALAVANLVSKLGVEHLVVIPDLWRDDKTGQARESRTLSNEQWKRLAAGHNKLGKALLEEFGIHQQFHSHADSHIGTYQEVERYLQETDPKYSNLCLDTGHFAYYLGDNLKMMNAYPERIGYLHLKQVHPDILAETLKNDVPFGDAVAKGVMTEPGFEGVPKFAPIIERALEINPEIFAIIEQDMYGCPVDMPFPIAQRTREHILAATRAARVK >NZ_CP016772.1|WP_095692244.1|327586_329500_-|3D-(3,5/4)-trihydroxycyclohexane-1,2-dione-acylhydrolase-(decyclizing) MATRKMTVSQAVVEFLSHQYTVDGDHRERTIQGVFGIFGHGNVAGIGQALKQLSVENPSLMPYYQARNEQAMVHESSAFARMKRRRATFACTASVGPGATNMLTGAAVATTNHLPVLLLPSDTFANRASDPVLQQLEMPHDATLSVNDAFKPLSRFFDRVQRPEQLFSALMGAMRVLTDPVETGAVTICLPEDVQAEMIDVPEEFLADRDWHIRRPRAEAAQLAEVARVIASSKRPFIVAGGGVIYSDAHDALQTFVEQTKIPVGTSQAGVGSLNWDHPQLLGSVGATGTTAANRAAKEADLVIGIGTRYSDFTTSSRTAFQNPDVRFININIASFDAFKHGSAMPVVADARESLRELTALLATFATTSDYQSKYTKEKSEWDAVVDAAFVDQKRALPSQTEIIHAVQSASDATDTLICAAGSLPGDLHKLWRVRSPLGYHVEYAFSCMGYEIAAGLGAARAGATPIVMVGDGSYLMMHTEIVSAVAEGLKVIIVLIQNHGYASIGHLSESIGSERFGTQYRFKDQAGNNFESGEKLPVDLAANAASLGINVIDIKQTPSAIGDLHAAVMKAKQSSTSTLIHINSDPLLYSPDGEGWWDVPIAPISTLKSTQDAYAQYKDEISLQRPLLGNGTKDKK >NZ_CP016772.1|WP_095675851.1|329501_330422_-|5-deoxy-glucuronate-isomerase MSSADKWYFRHGELSRDGWDVFLDPQSPPVAGWKYTGLRIGTLTESKSLTLPADSNERIIFPLEGQEFLVEYTHDGNTSSQILHGRTSVFHGPADFIYLPINTSATISGVGRIAVGQTPATKVKAVRYVAKEDVSISLRGAGRETRQVHNLGMPETLDADRMIVCEVIVPAGNWSGSPSHKHDVYIPGKESELEEIYYFQSAVTRGAKTPPSSLPFGYFRGTSADSRPYDVNEEVHSGDVALVPYGWHGPAAAGPGYDLYFFNVMAGPDPDRAWNATDHPDQVWIRDSWQSQQSDPRLPYGSTERI >NZ_CP016772.1|WP_095692245.1|330430_331921_-|CoA-acylating-methylmalonate-semialdehyde-dehydrogenase MSTIVNHWINGAEFVSTSGRTSPVYDPALGIETKRVALANQAEIDAAIKAAMDAFPAWRDESLAKRQQIIFTFRELLNSRKGELAEIITSEHGKVLSDALGEITRGQEVVEFATGIPHLLKGFYSENVSNGVDVYSTRQPLGVVGIISPFNFPAMVPMWFFPIAIAAGNTVVIKPSEKDPSASMWVAKLWKEAGLPDGVFNVLNGDKESVDGLLNSPDVESISFVGSTPIAKYIYESASRTGKRVQALGGAKNHMLVLPDADLELVADSAINAGFGSAGERCMAISVVVAVEPVADKLIPKIVERMGKLRTGDGRRGCDMGPLVTREHRDKVASYIDIAEKDGATVVVDGRNPQVDGDANGFWLAPTLVDKVPTTSKVYTEEIFGPVLSIVRVKSYDEGVALINSGAFGNGTAIFTNDGGAARRFQNEIQVGMVGINVPIPVPVAYYSFGGWKQSLFGDTKAHGVEGVHFFTRGKAITSRWLDPSHGGINLGFPQN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016772_3 | 935901-936000 | Orphan |
NA
Consensus repeat of NZ_CP016772_3
|
1 spacers
spacers of NZ_CP016772_3
>3.1|935929|44|NZ_CP016772|CRISPRCasFinder ACTGCATCAGTAGGTGCTGCTTGAAAGATCGGAACGGGGATTGC |
CRISPR arrays and Neighbor proteins around NZ_CP016772_3
The CRISPR arrays of NZ_CP016772_3 >merge|NZ_CP016772|3|935901-936000|CRISPRCasFinder TGCAGCTTTCTTTGCGCGCTTAGGCGCAACTGCATCAGTAGGTGCTGCTTGAAAGATCGGAACGGGGATTGCTGCAGCTTTCTTTGCGCGTTTAGGCGCA >NZ_CP016772|3|3|935901-936000|CRISPRCasFinder TGCAGCTTTCTTTGCGCGCTTAGGCGCA ACTGCATCAGTAGGTGCTGCTTGAAAGATCGGAACGGGGATTGC TGCAGCTTTCTTTGCGCGTTTAGGCGCA
>NZ_CP016772.1|WP_095676405.1|933529_933844_-|50S-ribosomal-protein-L21 MYAIVKAGGRQEKVTVGETITVDRIDAAVGASVSFPALLVVDGANVTTDLKVLSSIKVTGEVIDEVKGPKIDILRYKNKTGHRRRQGFRAQHTRVKITAISGAK >NZ_CP016772.1|WP_095676404.1|933255_933510_-|50S-ribosomal-protein-L27 MASKKGVSSTRNGRDSNPQYLGIKRFGGQEVNAGEILVRQRGTHFHPGKNVGRGKDDTLFALAAGVVEFGRARDRRVVNVVPAA >NZ_CP016772.1|WP_095692665.1|931645_933160_-|GTPase-ObgE MTTFIDSVTLFAAAGKGGDGCVSVKREKFKPLGGPDGGNGGRGGDIILVVDSSVTTLLDFHHSPHRKATSGHQGYGDRKDGVSGEDLILPVPNGTVIYDEDGEQIADLIGIGTTFLAARGGHGGLGNLALSSSKRRAPGFALLGEPGEERRLTLQLKSVADIALVGFPSAGKSSLIAAISAARPKIADYPFTTLVPNLGVVQAGDTRFTVADVPGLIPGASQGKGLGLQFLRHVERCVALVHVLDCGTLETDRNPIDDLEAIENELALYGGLEDRVRIVALNKVDLPDGKAMADMVEQQLKEKGYEVYKVSAASREGLQELLYSMARLVQRERAEAAKEERTRIILRPVAVDDSGFTVQKNGDGSFSVRGQKVVRWVRQTNFKNAEAIGYLADRLAQLGVEKELFKKGAVAGSEVRIGSGDNEVVFEWEPTIEAGAEQLAGFLHRRGEDSRLEGAWNTVETERDRLSDDEVARQWEYNVAEPTNPEMKLTLSEIQESDTESNDK >NZ_CP016772.1|WP_095692664.1|930533_931649_-|glutamate-5-kinase MNRGAITSAKRVVIKIGSSSLTGSAGSELDPHAVQKVVDLAYSLKKRGAEVVVVSSGAIAAGLSPLGLKVRPKDLATQQAAASVGQGLLIAQYSEKFKAHGVISSQVLLTTEDVVRRSHYANAQQTLTKLLSLGVVPVINENDTVGTQEIRFGDNDRLAALVALLIQADLLVLVSDIDALYDAPPTQAGAKAIRYVANISDIESITLGGAGSSGVGSGGMVTKVEAARIATSAGIPMLLTSLQDSGHAVAGEEFGTFFEAHTSKANSRLLWLAHASTPRGRLILDDGAVTAILERGVSLLPAGVTAVEGDFISGDTVELASGSGKVIARGLVAFDSEEIPQMLGRSTKELAAALGAEYERELVHRDDLVLL >NZ_CP016772.1|WP_095692663.1|929268_930537_-|glutamate-5-semialdehyde-dehydrogenase MNAEAVVAELAQKARKASRSLSTATGAERKAALEAIAKAIESRSAEILAANVLDMASARAEDMHPQMQDRLLLTAERIAGIAGGARQVAALADPLGQTLRKSTLANGLELEQISVPFGVIGMVYEARPNVTVDAAVILLMSGNAALLRGSSSAHHSNEILVNVMKDALATTKISPDVIQLIPSEDRATTKALLTARGKVDLVIPRGSAALIRMVVDEATVPTIETGAGVCHVYVDEFADIEKALPILINSKTHRPSVCNAAETLLVHKAIAPTFLPMALKALSDAGVILHSDATAQKVADTFKIASTLATDANWSTEYGVLEMNVAVVDSVDAAADHIAQYGTNHTEAIVTENKANAARFIALSDCAAVMVNTSTRFTDGEQMGFGAEIGISNQKLHARGPMGLEAMTTTTWIVTGTGQIRS >NZ_CP016772.1|WP_095692662.1|928943_929243_-|Asp-tRNA(Asn)/Glu-tRNA(Gln)-amidotransferase-subunit-GatC MSSLSRDDVAKLAGLARIEMTEAELVELSSQFGLILDAVARVQEMDLSGVKATSHPQPLENIARPDVVHPSLSPHDALSGAPAQEESRFRVPQILGEAE >NZ_CP016772.1|WP_095692661.1|927450_928947_-|Asp-tRNA(Asn)/Glu-tRNA(Gln)-amidotransferase-subunit-GatA MIRNTAAQMADALAKGETTSVELTQAHLDRIAEVDGQVKAFLHVDSQGALAQAKDVDARRAKGEKLSPIAGIPLALKDVLAQKGVPTTAGSKILQGWLPPYDSTVVSKLKDAGVVIMGKTNMDEFAMGSSTENSGYGPTFNPWDLTRTPGGSSGGSAAAVSAFEAPLAIGSDTGGSIRQPAALTGIVGVRPTYGAVSRFGLIAYSSSLDQAGPFGRTVLDTALLHEVMAGHDVKDATSINAPVPAVVAAAKSGDVKGMKIGVIKQLQGEGYQKGVQTRFDESLQVLASLGAEIVEVDCPSFEYALAAYYLIAPSECSSNLARFDAMRYGLRTGDVDGASAEAVMSATRDAGFGREVKRRIILGTYALSSGYYDAYYGSAQKIRTLIIQDYAKAFTKADVLVSPTAPTTAYKIGEKVDDPMAMYLGDVATIPVNLAGICGMSLPAGLADEDNLPVGFQIMAPAMQDQRLYQTGAALEAALLSKWGAPILSKAPELKGAK >NZ_CP016772.1|WP_095692660.1|925945_927451_-|Asp-tRNA(Asn)/Glu-tRNA(Gln)-amidotransferase-subunit-GatB MALPTYDEVIAKWDPVLGLEVHVELNTASKMFCSCATEFGAAPNTQTCPVCLALPGALPVVNEKAIESTILIGLALNCKIAPYSRFARKNYFYPDMPKNFQISQYDEPICFDGYVDVEIDTEEGPKQFRIEVERVHMEEDTGKSLHVGGATGRIHGADYSLLDYNRAGIPLVEIVTKIVPGTGKYAPEVAKAYVAELRDILRGLKVSDVKMEQGSLRCDANVSLKPIGSDVLGTRSETKNVNSLRSVERAIRGEMIRHAELLNDGKKVKQETRHFQEDTGLTRSGRSKEQAEDYRYFPDPDLVPVTPAAAWIEELRATLPERPSLRRKRIKEEWNVPDKEMQAMINADVLDIVEATVLLGADPTKARTWWLGEISRIANDQNIAVADLAITPADVAEIVALVAKGELTDKLARQVVEGVIAGEGKPAEVVEKRGIKVVSDDGALMAAIEKVCAEQADTAEKVRGGHLPAAGALIGAVMKETKGQADAAKVRELLLKHLGQG >NZ_CP016772.1|WP_095692659.1|924262_925936_-|dihydroxy-acid-dehydratase MSKMKPRSGLVTDGLERAPARGMLRAVGMGDEDWVKPQIGIASSWNEITPCNLSLDRLAKASKKGVIDAGGFPMQFGTISVSDGISMGHEGMHFSLVSREVIADSVETVMQAERLDGMVTFAGCDKSLPGMMMAAARIDVASVFVYAGSTLPGQVDGKDVTIIDAFEAVGACARGLITKDRVDQIERAICPGEGACGGMYTANTMASIAEAIGLSLPGSAAPPAVDRRRDAYAEQAGAAVVNLIAKGITTRDILTKKAFENAITILMALGGSTNAVLHLLAIAHEADVDLTLEDFHRIGSKVPLLGDLKPFGKYVMTDVDRVGGIPVVLRILLDAGFLHGDTLTVTGKTMAENLADINPPLADGNVLFPADKPMSTDIGITILGGTLATEGAVCKTAGIGIESFEGPARVFEREQAAMDALENGTIQVGDVVVIRYEGPKGGPGMREMLMITGAIKGAGLGKTTLLLTDGRFSGGSTGLCVGHVAPEAVDGGPIAFIKDGDRVRIDIPNRTLDLLVDPAELAARKVGWKPLPHKYTRGVLHKYSKLVGSASKGAVCD >NZ_CP016772.1|WP_095692658.1|922287_924051_-|acetolactate-synthase-large-subunit MAKHVPTANGTEMTGATALVKSLEAAGVDVMFGLPGGAILPAYDPIYDSTIRHILVRHEQGAGHAATGYAQVTGRAGVCIATSGPGATNLVTPLMDAAMDSVPLLAITGQVPSAAIGTDAFQEADIRGITMPFTKHNYLITNPDEIPGVIAEAFHIATTGRPGPVLVDIAKDALQKMTKYNWPTSIKLAGYNPKTTPDAQAITDAAALIAQSSKPVFYVGGGVIKANAHAELRQLVELLGGPVVTTLMARGAFPDSHPLHMGMPGMHGTVAAVTALQKADLLITLGARFDDRVTGKLSTFAPNAKILHADIDPAEIGKNRHADVAVVGDVRETIAALIPALKAALAKNKPDLTAWLRQMNSLKSTYPLGFDTPDDGSLSPQLVIQRLGQISGTDTIFTAGVGQHQMWASQFISYEHPRTWLNSGGAGTMGYGVPAAMGAKVGAPDTTVWAIDGDGCFQMTNQELVTCALNNIPIKVAIINNESLGMVRQWQTLFYDSRYSNTSLESKRVPNFPMLAESMGCVGLSCERPEDLDKTIEKAMSINDQPVVVDFRVHRDAMVWPMVAAGTSNDEIMIARATAPDWDSQEL >NZ_CP016772.1|WP_095692667.1|936302_937463_-|rod-shape-determining-protein-RodA MSTFLNRSPYRRARRSSVFSGFDPVLTGAVAALLVIGTLLVYAATRDWYASNGLDPQYYLKRHVINIVIGLALAWGTTIIDYRLLRAYTPYIWGLGVFGLLFVLIPGVGSEVNGAKAWIRLPAGLQIQPAEIAKISIIIGIAMLLSERTHNNDAPSHQDVLKALGVAAIPILLILAQPDMGTVLIISASVVTMLAVSGAPTRWVVGLILLALIGGFVAVKAGVISDYQVKRLQSFVDPNADSQGAGYQLRQARITVGSGGLIGTGLLNGPQTNGRFVPEQQTDFIFTVAGEELGFLGSGLIIFLLFLILMRAFAIARRSTDPYGMLVCTGVIAWFAFQIFENIGMTLGLMPMTGVPLPFMSYGGSSMFANLIGFGLLQNVHASHRS >NZ_CP016772.1|WP_095692668.1|937459_939616_-|penicillin-binding-protein-2 MNQRSRLSLLVFQIFIASLMLALFGRLFYLQVAAGPIYRDAALSIQSRDVVTPANRGFIVDSSGVPMALNRVGLAVTVDRTKIDKLPDKGVAVVKDLVTLLGLNFDDVWQRTRLCGELPKGKKAGCWTGSRFQPIPITNTADPQIALRIVERSDRYPGISATPLAIRSYPTTLGLNGGHVLGYVGPLTESDLSGANGRSYFRSESIGKAGLEIVYDEYLRGTPGIKTFIVDRKEAVTTTSKNTKPVAGNHLVTSLDIRLQAASEAALAAAVKRARGSGFRADGGAAVVLDVRNGQVLSLASYPTFDPNAFETGLTVQEAEDLYSEKMGVPALNRALQGLYALGSTFKAVSVIAAKDAGYSLSASYACPSEVQVGTRAFQNFESKAQGTLSMKKAIAVSCDTIWYRIAYDEWLRDGGLRPKSNPNDYFFKAAEKFQMGKKTGVDLPSESSGRLANREWRKAWYSQNKDFYCNYKERSTKSQQTAFLLQLARENCLDGDKIRAGDAVNFSIGQGDTVVTPLKLAQMYAAIANGGTIWKPTVAKAIVKTDGTVLRTFQPEKLGELGEDQATIDFLHDALREVAISGTGAGAFAGFPVATSGKTGTAQVFGRNPNGSAKSDTSWYASFAPAKNPRYAVVMMVSQGGYGAGTSGVGVRQIYEAIFGAQGSTVKPELALFPNGKPPTTLPRISPATKPKPSILNPGKPKVLASPTPTAKAKVKR >NZ_CP016772.1|WP_095676409.1|939612_940134_-|rod-shape-determining-protein-MreD MSLRRFFYSFPIFFTVFLLQEAVVTQMRLPAGGINLLLVVALIWAALSTPEIGALTGFGAGLMMDVSQTSPGPMGHWTLVLIIACYAIAFLGYGDDNIRGNPINIIFLVTIGVIAAQTVFLLLGMMLGQQVGSVSNIAFLLAGSAFWTAIISPLILKVISFFHANIFGMRSQL >NZ_CP016772.1|WP_095676410.1|940136_941081_-|rod-shape-determining-protein-MreC MRYGGDNRGRLLIIVLLVTSLFLITLDLRGVQVIDGLRTGTQTALTPVQKAGSWLVSPFRNFLSDVTHLGRTRNKMEKLTAENEKLRLTLQNRKTADAQLKQLKGVLNLAGTAGYEVVNAKVISQGSTTSFTQTITIDAGTSSGVRANMTVLSGYGLVGVVKYAYRDSALVQLASDPAFKIGARIAGTQQIGILSGQGTRKGVLQLLDNTTQVRKGDALLARGSQNGRPFVPGVPIGEVTSVDNSPGAVTQTADVKFYTNFSTLGVVAVVVSGSSADPRDSLVPPKPRPTPLPTVTIYATPGAVEPTPTPTATK >NZ_CP016772.1|WP_190277152.1|941091_942111_-|rod-shape-determining-protein MSFIGRDMAVDLGTANTLVYVRGRGIVLNEPSVVAINQDTGGILAVGLEAKKMIGRTPGNIVAIRPLKDGVIADFDTTERMLRYFIQKVHRRSYLAKPRIVVCVPSGITGVEQRAVKDAGYAAGARKVYIIEEPMAAAIGAGLPIHEPTGNMVVDIGGGTTEVAVISLGGIVTALSIRIGGDELDQSIISWTKREYSLLLGERTAEEIKMAIGSAYPLQGENDAEIRGRDLATGLPKTIVVTAAEIRKALEEPVNQIINAVKATLDKCPPELASDLMDRGIVLTGGGALLKGLDERLRKETGMPIHIADRPLDAVVEGSGKCIEEFEALEKVLISEPRR >NZ_CP016772.1|WP_095676412.1|942123_942546_-|nucleoside-diphosphate-kinase MSIEKTLVLVKPDGVARGLVGEVIARIEAKGYSIVSLRMLQADRALLEKHYAEHQGKPFFEPLVEFMMSGPIVALVAEGNRVIEGFRSLAGVTDPTVAAPGTIRGDLARDQGTKVVQNIVHGSDSPESAAREIAIFFEGK >NZ_CP016772.1|WP_095692669.1|942558_942894_-|DUF4233-domain-containing-protein MRVLGSAVLVMEFFVMGFAMLLAKDNQEPSSIIAGAVIAILMLLTPGLLKKRTGWILGSILQFLMIGYAVVVPSMAIVGLIFAGLWIAAIVVGRRGEAIRAKLMASRTPNP >NZ_CP016772.1|WP_095692670.1|942893_944252_-|bifunctional-folylpolyglutamate-synthase/dihydrofolate-synthase MTNTSPEDQSRIDVIEQALLARWPETRIEPTLERIAALVDMLGSPQLSYPTIHIGGTNGKTTTSRMIDSLLFEMGLRTGRFTSPHLESYLERIAINGEPIAAKDLIFSFNDISAYLDLMDEKFEHPISFFEAITALAFAAFAEHPIDVGVIEVGMGGQWDATNVVKADVSVIMPIGLDHTEYLGETLTEIAQTKAGIIKEGGYVVLAQQEPECAVELLKQAALVGADVAREGVEYSVLTRSIAVGGQLLAIQGTKEIYTDIFIPLHGKHQASNAAAALVAVEVFFGDQDLDIEAVRAGFANVKSPGRCEVLHRDPTIIVDAAHNPHGASAIADTIQSEFTFDEVIGIFAPMGDKDVRGILLELEQVMDSVIVTANSSSRSMKVSELEKMAAEIFGSDRVFAVPTVTEAIDKAVKDCIRPLSVDTIGILITGSVVTVGEARAIVRKKFAKEEK >NZ_CP016772.1|WP_095692671.1|944252_946832_-|valine--tRNA-ligase MSSEKRELASSFLPGDIEGPLYTKWIEAGYFTADANSSKEPFTIVIPPPNVTGNLHIGHALDQTLQDCLTRMKRMQGFEALWLPGMDHAGIATQNVVEKQLATQGLSRHDLGREDFVKKVWEWKSESGGQILGQMRRLGDSVDWSREAFTMDENLSQAVLTIFKKLFDQGLIYRAERIINWCPRCLTALSDIEVEHQDDEGEFVQVRYGEGEQSIVVATTRAETMMGDGAVAVHPDDPRYKHMVGTEVLLPLVNRMIPIIADELVDPDFGTGAVKVTAAHDPNDFEMAMRHNVPFVVIMNEHGIMDGTGTEFDGMDRFDARVAVVAKLKEMGRIVAEKRPYIHAVGHCSRCDITVEPRLSKQWFVKVAPLAKAAGDAVRDGRVKIEPAELAPRYFEWVDNMHDWCISRQLWWGHRIPVWYGPNDEVIVVGPGESAPAGYTQDPDVLDTWFSSALWPFSTLGWPNNTADVKKFYPTSVLVTGYDILFFWVARMMLFGLFAMDGVPPFHTIVLHGLVRDQFGKKMSKSRGNVVDPLEFIDKYGADALRFTLARGSNPGKDQALAEDWIAGSRNFATKLWNATRFAMMNGATVEGPLPATETLSDIDKWVLSRLSETTTEFTALMESYEFARACDAIYHFAWDDLCDWYLELSKEAFASGNAGASQRVLGHVLDTLFRLLHPVMPFITETLWTTLTGGETLVTAKWPVADSSHINKKSEALVGELQKIITEVRRFRNDQGVKPSQKIPGRFIAPADVTAYASAMAFLLRLELTEFTPSASVEIGSMKVELDLSGTVDVVAERARLEKDLVTAQKDMKTADVKLNNEGFMAKAPESVVAEIRERMAATSADIERITAQLAALK >NZ_CP016772.1|WP_095692672.1|946854_948153_-|ATP-dependent-Clp-protease-ATP-binding-subunit-ClpX MSTRIGEANDLLKCSFCGKTQKQVKKLIAGPGVYICDECIELCNEIIVEELSEASSLGLSELPKPQAIFEFLDQYVIGQDRAKKSLSVAVYNHYKRVQSGDSRNEDGIELAKSNILLLGPTGCGKTLMAQTLARMLNVPFAIADATALTEAGYVGEDVENILLKLLQAADYDVKKAETGIIYIDEIDKVARKSENPSITRDVSGEGVQQALLKILEGTVASVPPQGGRKHPHQEFIQIDTTNVLFIVGGAFSGLEKIIEARSGSTGVGFGAELQSAEEKNRRDIFADVMPEDLLKFGMIPEFIGRLPVLTSVENLDKPALMQILTEPKNALVKQYQKLFDLDDVELEFAPDALDAIAELALNRGTGARGLRAIMESALLGVMYDVPSRADIAKVIIEKACIDSNAAPTLLPRTGDIPKRASRREKPNEEKSA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016772_4 | 1013663-1013759 | Unclear |
NA
Consensus repeat of NZ_CP016772_4
|
1 spacers
spacers of NZ_CP016772_4
>4.1|1013688|47|NZ_CP016772|CRISPRCasFinder GAGTCCGAGCGCGTTGCACTGATGGCAGAACTTGAAGGCGAACGTGC |
cas3 |
CRISPR arrays and Neighbor proteins around NZ_CP016772_4
The CRISPR arrays of NZ_CP016772_4 >merge|NZ_CP016772|4|1013663-1013759|CRISPRCasFinder TGCAGATGTTCTTGAAGAGATGGATGAGTCCGAGCGCGTTGCACTGATGGCAGAACTTGAAGGCGAACGTGCTGCAGATATTCTTGAAGAGATGGAT >NZ_CP016772|4|4|1013663-1013759|CRISPRCasFinder TGCAGATGTTCTTGAAGAGATGGAT GAGTCCGAGCGCGTTGCACTGATGGCAGAACTTGAAGGCGAACGTGC TGCAGATATTCTTGAAGAGATGGAT
>NZ_CP016772.1|WP_095692710.1|1012092_1013004_-|DMT-family-transporter MSEVQKATVHNHTELPARPDLIRLIIGIFGIGSSGPLIALSAMPVPTLIFWRNLGGSLMTLPFALRHKLDRTGVKWAVLAGIVLAVHFVGFFLSMRMTSVTAGTAIVATQPIFAAFFVKLTGGHIPTKAWLGMLISFTGVVVVTGIDLQLDRRSFLGDLAALISGALAAAYMLIGSRAQQTLATTSYTTICYFVCAMTALPMALLSGYDIVGFALREWWILLGLIIGAQILGHTMFNMTLKRVSPAVVSMIVFFEVPVAAIVSLVFDIGKQPTLSIIPGVILILLGCILVVLRTRPESAMTEQ >NZ_CP016772.1|WP_095692709.1|1011250_1012096_-|PHP-domain-containing-protein MIDLHTHTTCSDGTDAPFALVKKALAAGITTLAITDHDSTAGWEEAVSAIQPQIELVLGAEISCLTTDGISVHMLGLLFDGKNSEMQQMLSDSRDTRVPRMRKMVELMSTDGINISLDDVYRATPEGATVGRPHLADALVANGVVATRDEAFLDLLNNESKYYVTHAAPTPVDAIRVIRKAGGVAVIAHPFASRRGQIITASTFTDLVAAGLNGIEVHHRDQSADEQSTLTAIAQELNLVITGSSDYHGTGKLNGLAENTTHQAQWEQLESLADARRVVKK >NZ_CP016772.1|WP_095676471.1|1010642_1011254_-|MarC-family-protein MNSLGAVTFATQAFVTLFVIMDPPGATPIFLGLVGDKSPRERVRLAWQAAGVSLFVIASFALFGRFILDYMNVSIEALQAAGGLLLLYVALQLLTGNKNTGTENASDNIGMVPLGTPLLAGPGAIVATMIYVQKADTNAQILGLVIAILAVHLIIGTVLMASTKIVGLIKDSGVTLLASIAGLLLAAIAVQMLANAIKAFAAS >NZ_CP016772.1|WP_095692708.1|1009280_1010642_+|DEAD/DEAH-box-helicase MSLTFADLPLRKETIDALHEHGFTSPFPIQEMVMPIALADGDVIGQAKTGTGKTLAFGIPVIERVIAPNDADWAQLPNQGKPQVLIVVPTRELCVQVTKDVEELSFNRGIRTLAVYGGRAFEPQIEALNNGVEIVVGTPGRLLDLYRQGQLTLKFVSRVVLDEADEMLDLGFLPDVEKIFTSTPARQQTMLFSATMPGDIIALARRFMNQPVHIRTQDNEDEGAVVSRIEQHVIRAHAMDKIEMLARILQADGRGPTIVFCRTKRTAQKTSDDLFERGFRAATIHGDLGQSAREKALNDFKAGKSDVLIATDVAARGIDIDGITHVINYQCPEDEKTYVHRIGRTARAGAAGIAVTFVDWDDLARWKMIDTALVLGLPEPVETYSSSEHLFEMLNIPAGSSGRMTKKSAAAVDKPKTDRPKSDRPRSEKAVEPKKPAADRIKRERTRTKRISE >NZ_CP016772.1|WP_095676469.1|1009028_1009280_+|DUF3107-domain-containing-protein MSSKKSEKAAKVRISIINVGSELSFDCPSTPAEIKSAVTAALTAQTPLSLQDVQGHEIIVPADKIGYVEIGEPAERRVGFGVV >NZ_CP016772.1|WP_095692987.1|1008376_1009012_+|TetR/AcrR-family-transcriptional-regulator MSTESATANNSRSDKSRLPRDERRAILLSAALEVFTAAGYHSAAMDEIADRANVSKPVLYQHFPSKLDLYLAVLDLHIDSLVFEIQKAISSTPDNEQRVHVTIEAYFNFIENEGEAFRLLFESDMSVEPQVRERLNRMTYDCAAAVSGVISNDTGLPKEAAMMLGVGLIGYVQVTARHWLERDSKLTRQQAMDLVENLMWRGISGFPRTDS >NZ_CP016772.1|WP_095676468.1|1007139_1008321_-|adenylyltransferase/sulfurtransferase-MoeZ MKTPPLVTPGPALTVDEVRRYSRHLIIPDVAMAGQQRLMNAKVLCVGAGGLGSPALMYLAAAGVGTLGIVEFDTVDESNLQRQIIHGQSDIGKSKALSAKEKIAEINPYVNVILHETRLDNSNVMEIFSQYDIIVDGTDNFATRYLVNDACVLLKKPYVWGSIYRFDGQASVFWAEYGPCYRCLYPEPPPPGMVPSCAEGGVLGVLCATIGSIQTTEAIKVLTGVGEPLIGSLMVYDALDMTFRKIKVRKDPNCPLCSENATQTALLPDYEAFCGTLSEAAQEASSGSTITVQDLKAKIDNKDNFYLIDVREPSEYEIVNIPTAHLIPKQGFIDGSVLASLPQDKPIVLHCKSGVRSAECLAILKNAGFADASHVFGGVIAWAKQIDTTLPVY >NZ_CP016772.1|WP_095692707.1|1006270_1007140_-|N-acetyl-1-D-myo-inositol-2-amino-2-deoxy-alpha--D-glucopyranoside-deacetylase MLSSYKGYRMLLVHAHPDDETINNGSTMAMYAALGADVTLVTCTRGEEGEVLVKDLAHLAAHETDSLGEHRVGELADAMKALGISDHRFLGEGEKKYRDSGMMGTEPNNRPDVFWQADLEEASSELVKIIDEVKPHVLITYDEIGGYGHPDHIQAHRVAMRASEKSSWNIEKIYWNVMPRSVIQEGIDAMKKLGSDFMGAEKAEDLPFAKDDSFVHAMVDGNAYVEKKMDAMRAHSTQIEVDGPFFALSNNLGLQVWGNEYYTLVKGEKSEPLDSRGHEMDLFAGINPS >NZ_CP016772.1|WP_095676466.1|1005959_1006271_-|hypothetical-protein MQFLSSLLFGAMIAISATLVHQTLPPVGVSVGIFATYLGIWYVGRHYGKRRYKLIALSAWLAVISIAGSFGVGEELLIQGDNQGSALLTIGFVAGVVAVLRNP >NZ_CP016772.1|WP_095676465.1|1004374_1005958_+|cysteine--tRNA-ligase MASMSLRTQIAQALGKRATIRLRDSDGGLRDIVGVLQSETELINRRGEVVNFNPDEAVAFRVIPVFNRRDVSTGSLSIYDTKSKSLHTIAGTDGVVRIYCCGPTVYRDAHVGNLRTFLLSDLISRTLQMTGLDVSLVQNITDVGHMADDFEEDKMLAESAKTKVNPFEIARTFEDRFHIDLERLNIQPAASYPRASEKMAEMITAIEKLIAMKRAYVGSDGSVYFDATSFPTYGALSGNKLDSLQPGHRYEFTDEGGKRFHADWALWKLAGARTQMIWDSPWGAGFPGWHIECSAMSIELLDAHVDLHLGGIDLRFPHHENERAQSNSLTGNETVDTWVHGEHLLFEGRKMSKSAGNVVLLQDVIDRGLDPLSLRFALLENRYRSQMDLSWASLEAAHSTLKRWRQLLSNAGTSAEMKFDQEVSDALTTDLDTPRAMQRIRTIEKDSTIGALDKRALFLFADQVFGLDLDRGVEQREVSSEIQALLDARITARAEKNWSLSDSLREQLTNAGLEINDGAEGQSWSWK >NZ_CP016772.1|WP_095676475.1|1014349_1014844_+|DUF1003-domain-containing-protein MARNFGLDTPRETRRSLRGNIDPETFGRLSERFARFLGTARFLVYMTAFVLTWVLWNTLAPRDIRFDNYPFIFLTLILSLQASYAAPLILLAQNRQADRDRIALNEDRAQNARSIADTEYLTRELASLRIALGDVATRDYLRNELGDMAKEIVVELRKPESDAK >NZ_CP016772.1|WP_095676476.1|1014814_1015933_-|Mrp/NBP35-family-ATP-binding-protein MTTLESVHAALATVQDPELHRALPELGMVKSVEIKGSIAHLEILLTISGCPMRDRLQKDIESAVTAVEGISAIELTFGVMDEEQRANIKKLLRNGRESFISFAQKDSLTRVIGVASGKGGVGKSSLTANLAVSSAQKGLRVGILDADVYGHSIPRLMGLIGQRPTAIDQMFIPLESFGVKTVSMEMFKPERSDAIAYRGPLLHRVLEQLLSDAYWGDLDLLYIDLPPGTGDLAISLGQLVPSSEIVVVTTPQVAAAEVAERAGRIAHQIHQRVIGVIENMSAYPCAKCGELTSLFGEGGGEETSRRLSQLVGSDVPLLGKIPFSPDLREGGDAGAPVVISAPDSPSAKAIEAIVSQLIVREKSLLGVRLGLA >NZ_CP016772.1|WP_095692712.1|1015929_1016238_-|Sec-independent-protein-secretion-pathway-component MFFDFGAGELVGLAILAMILIGPERLPNLAVDAAKFVKRIREMASKATEELKENLGPGFEDLKPTDLNPKSFIKKQLSSVLDDDVSTPATSKRTSTIDPDLL >NZ_CP016772.1|WP_095692713.1|1016257_1017376_-|trypsin-like-peptidase-domain-containing-protein MSINNGGPWWDAPSKSGLGKNITLRSAIVLALVVGVIAGAFGASSSGSLFGRSVNLVKSTSAIERPAGSVAEIAQRVLPSVVSIEAKSSNGGSTGSGFVIDSSGYILTNNHVIAASVTSGGDITVRLNDGSSFDAKVVGRDSSYDLAVLKIVGASLKALQFGDSDKVAVGDSVIAIGSPLGLTGTVTLGIISAKDRAVTAGESSSENSFINALQTDAAINPGNSGGPLVDATGSVIGVNSAIASLGSSFSSQTGSIGLGFAIPINQARKTADQLIRNGKATYPVMGISVDMNFSGDGAMIAKNAQAILPGGPAAKAGLKSGDIITAIDGRPITSPEELIVTIRSLNIGDSVVVTYKRGSESKSATLTLTASK >NZ_CP016772.1|WP_095692714.1|1017388_1018015_-|class-I-SAM-dependent-methyltransferase MNNNPHSYAESFIAEDAVKIAARARGLELGTLDASQGTGAYLRHLAHLLDAQSVVEIGTGSGVGSLWILEGMIASGTLTSIDDEMEHTSIAKLAMADADIAQSRFRFITNSVMDVMTKLTDRAYDLVVYRHNPEDLSFAISEAHRILRSGGVFVIDNFFGGSKVHDPAQRDPKTIALREAGKLIKGDTDSWVSSLIPTGDGLLLATKL >NZ_CP016772.1|WP_095692715.1|1018011_1019493_-|leucyl-aminopeptidase-family-protein MLHTVAPDLEALISADVLALGFTKKNDENIELVGSARLISSLEKYFGINLIDEITFFAPSGKAGELFEIPVLHKDSTVDRLYLVAVGDGSLTSLRAAGASLGRKVRGKAIELISLVCQSRAEIRAHGVSILLGAYSWNLKTGKPAEIATIAIATKDGASVSEAGVIARALYTARDLIHTPSNIKNPLWMAQEAKKIAEEKGLSISVLAGKDLSQFGGLRAVGNSSPKPGPRFIEITYIPKGKARSAAALPYVVIVGKGITFDTGGISLKRPYEFMTAMKSDMAGAAAALATISALPDLQPQVKVTVLMMCAENSLSGTSQRPSDVITQYGGTTVEIINTDAEGRLVLADGLAYAVENLDPDYLFDIATLTGSATLGLGRQYAALYTRDEKLAKELVSIGESSGERVWHMPLIDDYQDSLESDVADLNHAADKGDYSAGSVTAALYLEHFVGDSRWVHLDIAGTGRSETDSGENAKGGTGFGVRFFIDWILSLS >NZ_CP016772.1|WP_020045748.1|1019508_1019688_-|DUF3117-domain-containing-protein MAAMKPRTGDGPMEVTKEARSLVMRIPLEGGGRLVVELNPQEANNLSAALEAAVALIKK >NZ_CP016772.1|WP_095676481.1|1019800_1020001_-|sigma-70-family-RNA-polymerase-sigma-factor MSSSSNPQTLAELLASLPEEERIILTLHYLRSKSSGEIATLLSVPERAVIVVIESGKTRLKAILGL >NZ_CP016772.1|WP_095692716.1|1020053_1020503_-|SRPBCC-family-protein MSSNTLSISLTIDAPREVVWKKIADWKSQGEWMLQTKVWVTSNQVEGVGTSIAAFTGPLHKFYPRLKSLGLLDLMVVTQWQPPHRCDVDHVGKVLKGSGSFQLSEINGSSTRFDWSETIVAPKVIFLLAAPFLYVGVRISLARFARSFT >NZ_CP016772.1|WP_095692717.1|1020505_1021045_-|TIGR00730-family-Rossman-fold-protein MRIAVFCSSSPTIDSKFIDLAFELGAGIAQSGAELVSGGGHISAMGAISRGARSQGGRTIGIIPQKLVDIEFADHDSDELIVVDSMRTRKAKIEDLSDAFITLPGGLGTLEELFEIWVGRYLKFHDKPVIILDPHGVFQPLHALVEHLENENFVKPGMRDLLHWTTTVEEAVAIAHGKK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
621036 : 628962
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP016772|621036:628962|DBSCAN-SWA TATGAGCACTCTTGAAATTCGCGGATTAAAAGTATCTGTCGAGACCGAGCAAGGATCGATCGAAATCCTTAAAGGCGTGGACCTCACCATCAAGTCAGGTGAGACACATGCAATCATGGGACCTAACGGTTCAGGTAAGTCAACGCTTGCATACTCAATCGCCGGTCACCCTAAATACACCATCACAGAGGGCACAGTTACTCTCGATGGTGCAGATGTACTTGAGATGACAGTTGATGAGCGCGCAAAGGCAGGACTCTTCCTTGCAATGCAGTATCCAGTTGAAGTTCCAGGCGTTTCAGTCTCAAACTTCCTTCGTACAGCAGCAACAGCTCTTCGCGGAGAAGCTCCAAAGCTTCGCGAGTGGGTAGGGGAAGTAAAGAGTGCAATGGAATCACTCAAGATGGATGCATCATTTGCTCAACGCAATGTGAATGAAGGATTCTCAGGCGGAGAGAAGAAGCGTCATGAAATCATGCAGCTGGAACTCCTTAAGCCTAAGTTTGCAATCCTTGATGAGACAGATTCAGGTCTTGATGTTGATGCGCTTCGTATCGTCTCTGAGGGTGTTGTTCGCGCGAAGGCTGCAAATAACCACGGAATCCTTTTGATTACTCACTACACACGCATCTTGCGCTATATCAAGCCTGACTTCGTTCACGTATTTGCCAACGGTCGTATCGTCGAAGAAGGCGGACCAGAGCTTGCAGATAAGCTCGAAGCAAATGGTTATGCGGAGTACGTCACCGCTTAATTATGACTTTTGATGCTCACGCTATCGCTAAGGATTTTCCGATTCTTGATCGGACAATCCGCGATGGCAAGCGCCTTGTCTATTTAGATTCTGGCGCAACGAGCCAAAAGCCCAATGTTGTCATTAATGCAGAAAGTGATTTCTACCGTTTTCATAATGCGGCGGTGCATCGCGGTGCGCATCAATTGGCTGAGGAAGCAACCGATGCCTACGAAGGTGCTCGCGAAATTGTTGCTCAATTCCTCAACGCATCGGTCGATGAAATCGTCTTCACTAAGAACGCCACGGAATCTCTCAATCTCATCTGCTATGCGATGGGCAACGCTGCTCCTGGCAATCGTTTTCACCTTAAAGCAGGGGATTCAATAGTCGTTACCGAGATGGAGCACCATGCCAATCTCATCCCTTGGCAGCAGTTAGCTGCCCGCACGGGAGCAATTCTGAAATGGTTCAGCGTCACAGATGATGGACGTCTAGATCTCTCTCAGATCAATTCAGTCATTGATGAGAAGACAAAAGTTGTGGCTCTCACTCACCAATCAAATGTACTTGGCACCATCAACCCGCTTGAAGCGATTACAAAGCGTGCTCACGAAGTGGGCGCAGTTGTAGTTCTTGATGCATGTCAATCTGTGCCTCACATGAAGACTGATGTGAAGAAACTCGATATTGATTTCTTAGCATTCTCAGGACACAAGGCAGTAGGTCCTACAGGTATTGGAGTTTTCTGGGGACGCGCAGAACTTCTTGCTGAACTTCCACCGTTTCTTACCGGTGGCTCCATGATTGAAAATGTCACTATGGAATCTGCAACATGGGCGCAAGCACCAAAGAAGTTTGAAGCCGGTGTTCCCAATATGGCACAGGCTGTTGGATTAGGCGCAGCGCTGACCTATCTGACTGGAATTGGTATGGATAACATCCATAAGCATGAGATATCACTTACCAAATATTTGCTTGAGGGTCTGTCTGCTATTACTGATCTGCGCATTATTGGTCCAAAGACAACAGAGCTTCGTGGTGGCGTCGTTTCATTTACTCTTGGAGATATTCATCCTCACGACTTAGGTCAGTACTTAGATAGCCAAGGAATTGCAGTTCGCACAGGTCATCACTGCGCATGGCCACTGACTCGCAAGCTTGGAGTTCCCGCAACTACACGTGCCTCTGTCTATCTCTATAACACCACAGATGACCTTGATGCACTCATCGTTGGCGTGCAGGGCGCTCAGAAATATTTTGGTCGCTAATGCAGCTCGATAATCTCTATCAAGAAGTGATTCTGGATCACTATAAGAATCCGCAGAACAAGAAGTTGAATACAACTTTTGATGCGCAGGTTCACCACATCAATCCCAGCTGTGGCGATGAAATCACTCTCAACGTCACACTTGAAGGAAATATGGTCAAGAGCGTTTCTTGGGATGGCGTGGGGTGCTCGATTTCTCAGGCAAGTGTCTCGATTCTTACAGACCTTCTTATCGGCAAGAGCCTTGAGCAAGCTCAGGTGATAGCTGATTCCTTTATGCACTTGATGCAGAGCAAGGGAACCGAGAAGGGTGATGAGTCACTTCTTGAGGATGCAGTCGCCTTTGCGGGAGTTTCTCAGTATCCGGCTCGCATTAAATGTGCGTTGCTGGGTTGGATGGCATTTAAAGATGCAAGCGTTCAAGCGTTGGCAAAGCAAGCCTAAAAAGAAAGTAGGATAGAGACATGGTTGCAAAAATCGAAGATATTAATGAGGCGATGAAGGATGTTGTCGACCCAGAACTTGGTATCAACGTTGTTGACCTAGGACTTATCTACGACATGATGGTCGATGACAACAATATTGCTGTCCTCAATATGACTCTCACATCTGCTGCATGCCCACTACAAGATGTGATTGAAGATCAAACACGTCAAGCTCTTGCACCTTTTACAGACGATGTGAAAATCAATTGGGTGTGGATGCCACCTTGGGGTCCCGATAAAATTTCTGATGATGGTCGCGAACAACTTCGCGCCCTCGGCTTTACAGTCTGATTCCTAGATCTGATCTCTGATGTCTCAACATGCAGTTTGGATGACCTTTCGGTCAATGACTGCAGATCCATCTGTTAAGTCTCAGAAGCTAAAGCCCGGAACAATCAAGAGAATCATCACCTACGGCAAGCCGTATAAATCTCACATCATCATATTTCTCATCACCGTAGTAATCGAAGCGCTCCTCGTCGTTTCAACACCACTGTTGCTTCGCGAACTCATCGATAAGGGCGTCTTGCCTAAAGATACGGGCCTTGTTACAAAGCTTGCTTTCCTTGTGGGACTGCTTGCTGTGGTCGATGCTGCATTTAACATCTTCGGGCGTTGGTACTCAGCAAGAATCGGCGAAGGCTTGATCTATGACCTGCGCTCACAAGTATTTGCTCATATTCAACGCCAATCAATTGCATTCTTTACTCGCACGCAAACAGGTGCGCTGATCTCTCGCATTAACTCAGATGTGATGGGTGCTCAACAAGCCTTTACTGGGACTCTGTCAGGTGTGGTGAGCAATGTGGTCTCACTCGTACTCGTTGTGACAACGATGCTCATTCTCTCTTGGCAGATCACAGTCGTATCTCTTCTTCTTCTACCTGCCTTTCTTCTTCCAACTAAGTGGGTTGGAAAGCGCATTCAAGCCCTGACTCGGGATTCATTTAATCTCAACGCAACAATGTCGAGCACCATGACTGAGCGCTTCAATGTCTCTGGTGCACTACTTGTGAACCTGTATGGAAAGCCAGCAAAAGAGGAGAACTTCTTTCGCACACGTGCGCGCAAGGTTGCAGATATAGGCATTCAGACAGCAATGCTTAACAGAGTCTTCTTTGTCGGAATCACAAGTGTCGCAGCGGTTGCAACAGCCTTTGCTTATGGCATTGGGGGACACCTTGCAATCAACGGTTCCATTACGGTCGGAACTCTGCTCGCTATCACTGCACTGCTCATTCGCCTCTATGGGCCGCTGACTGCGCTCTCTAATGTTCGCATCGATGTAATGACTGCACTTGTCTCATTTGAACGCGTCTTTGAAGTACTAGACCTTCAGCCGATGATTGTTGATAAGGCTGATGCAAAGGTTTTGAGGAAGAAAGACTTGAAGATCGATTTCACCGATGTTGCCTTTAGCTATCCGCGTGCTGATGAGATTTCATTAGCATCGCTTGAATCGGCGGCAAAGCCGGAGACTGTTGAAAGCGGAGAGGTTTTGCGCGGAATTACTTTTAGCGCACCTGCAGGATCATTGACTGCAATTGTTGGCCCATCGGGCGCTGGTAAGACAACGATGAGCGCGCTACTTCCAAGGCTGTATGACGTTACTCGTGGTGCAATCACCATTGATGACGAAGATATTCGAGAGTTCACTGTGCAGTCACTACGTGACACCATCGGTGTTGTTATGCAAGATGCACACCTGTTCCATGAAACTATCTCTGAAAACCTTCGCTATGCGAAAGAAGATGCAACTGAGGCCGAGATGATTGAAGCGTGCAAAGCAGCGCAGATTTGGGCTCTGATCTCAACGCTTCCAAATGGCTTTGACACCATGGTAGGAGAGCGAGGCCATCGCCTTTCTGGTGGAGAGAAGCAAAGGCTTGCAATAGCCCGACTGCTTCTTAAAGCTCCTGCCATTGTGATCTTGGATGAGGCAACTGCGCACCTTGATTCTGAAAATGAATCCCTCGTGCACGAAGCACTCAAGCATGCTCTTAAGGGGCGAACATCGATTGTGATTGCTCATCGATTGAGCACGGTGATGGAAGCGGATCAGATCTTAGTTCTTGAAAAGGGGCTGATCGTAGAGCGCGGAAAGCATGAAGAACTTATTGCTCAGGGTGGTTTGTACTCAGAGCTCTTTGCTAGACAAGACATCACGACGAATTAAGAATCGTCCGTAGATAACAAGGCCTCCAAAGAAGAGCCACTGCAAGGCATAAGCCATATGTGGTCCATCGGAGAGTTCTGGAAGTTGTGCTGGAACAGCAGGAGTGAGAGCAGAATCTGATCCACTCATTAAATCAATATAGAAAGTCTCTGTTGAAGATCCAGATTGTGCATTTGCCTTTGAGACAAGTCCGTCAGCCTTGTTGCCAGGAATCGCAAAGAATGATCCACGCGGAAGAGAGGAATCTAAGCGAAGCCTTCCGGTAATGCTCACTTGACCCGTTGGTAAAGCAGGTAGTTCGGGTTGAGAAGATGCGTTCGCACCAGCCTTGACCCAACCGCAATCAACCCAGAAGCTCTTTCCTTCAGGTGTAGTAAAGCGCGTGAGAAGTTCGAAACCATACACACCTTCTGAATAGCGATTACGCAAGAGTATTTGTGATGAGGAGTCAAAGATTCCTTCAGTGCGTACTGGTTGCCATTCATGGTCAGCAGGGTTGGATCGCACAGATGTCAGTGAGACAGGGCTCATGCTTGAGTTTGTTTCGATGACTGAATTGCGTGCGTGCCGATCAACACCACGGTGATACTGCCATTGCGCTGCCCAGATGCATCCAATAATCAGAAGTAGGGCGACGAGGGACTTAAGGAGCGAGTAACTCTCTTTCTTTCTTGTGTTACTCAATTCCTCTCGGCCTCTTCTTCAAGATGGCTTCACCTGGTAGAACTGTTTCGCGCCCTGCATTTGCAACTACGACTGCGATATAGGGCAAGGTAACGGCGCCGAGAAGTGCGAACCATCGATATGGACTGGGTAGTAACACAGTCAGGATGAAGCAGGCTGTTCGTATCATCATTGAAATGAAATAGCGACGTTGTCTCGCAGACTGGTCAGTAGATAACCCTTTTGGGGCGCTAGTGATGTCGTAAACGTCATCTTCTTTAGCCATAGAGCAACAGTAGGGTAGTTTTCTCTCATGAGCGAGCCGACTTCAATCGCACTTGTTACCGGTGGAAATCGCGGCATCGGTTTAGCAATCGCGACTGCACTCAAGGCTGCAGGACACCGCGTCGTCATTACTTATCGAAGCGGAACGCCACCGACAGGCTTTGATGCTGTGCAGATGGATGTCACTGATTCAGCAAGTGTGGATGCAGCATTTACAAGGATTGAATCTGAAATTGGACAGCCTGAGATTATCGTTGCAAACGCAGGCATTACGAAAGACACTCTTGTCCTGCGTATGAGCGATGAAGATTTTGAAAGTGTGATTGATGCAAACCTCACAGGTGCATTCCGAGTTGCAAAGCGTGCAACAAAAGGTCTGCTTAAGTTAAAGCGAGGACGTCTCATATTTATTGGTTCCGTCGTTGGGGGAGTAGGCGCTGCAGGCCAAGTGAATTACTCAGCCTCTAAATCAGGTCTTGTCGGAATGGCTCGATCCTTTGCCCGTGAACTTGGAAGTCGAGGCATTACAGCCAATGTCATTGCACCGGGATTTGTCGAGACAGATATGACAGCAGAACTTGATGAGAAGCGTCGCGTAGATATTGCAGCACAAGTTCCACTTGGACGTTTCTGCTCTGCTGCAGAGATTGCAGATGTTGTGGCATTTATTGCATCCCCGCAAGCTAGCTATATCACCGGTGCCATCATTCCAGTCGATGGCGGATTAGGAATGGGTCACTAAGTATGGGAATCCTCCAAGGTAAAAACATCCTCGTCACCGGCGTTCTCACCGATGGATCAATCGCATTTCATATTGCCAAGATTGCACAAGAGGAAGGTGCGAACGTAGTTCTTACTGGCTTTGGTCGTGCTCTGAGCTTGACCACTCGTATTGCAGGTCGCCTCCCACAACTTCCTCCGATCATCGAACTCGATGTCACCAATCAAGAGCATCTCGATGGTTTGGCTGAGCAAGTAAGAAAGCATGTCCCACATCTTGATGGTGTTGTGCACTCCATCGGCTTTGCACCTGAGGCAGCACTGGGTGGAAATTTCCTGAATACTGCATGGGAAGATGTCGCAACTGCAGTGCATGTATCTGCATACTCGCTTAAATCGTTGACAATGGCTTGCAGGCCAACATTTAAAGATGGGGCATCTGTTGTCGGCCTTGATTTTGATGCACAAGTTGCCTGGCCAAAGTATGACTGGATGGGCGTTGCTAAGGCTGCTCTTGAATCCACATCACGCTATTTGGCTCGCGATCTCGGCGCTGAAAATGTTCGTATCAACTTAGTTGCAGCAGGCCCTATTCGCACCATGGCTGCAAAATCAATTCCTGGCTTTGATGAATTCGAAAAGGTGTGGAATGAACGAAGCCCACTGGAATGGGATGTCACAGATCCTGTTCCTGCAGCAAAAGCTGCCGTTGCACTCTTAAGTGATTGGTTTCCCAAGACTACGGCTGAGATCATCCATGTCGATGGTGGTCTGCACGGTATGGGCGCCTGATTACGCACGGATTTTTCATTCTGAGCCCGCTCAGGTAGCCTTGCCCCAGACCAGTAGCTCAGGCTGCTGCAAGAGGAGCTCACCGCTCCCACCTCTTTCGCTTCGGCGGATTGGTTAGTCGGAAGTAAACCTGATTCAGGTTTTACTCTTTCTATGCGGTGCGCGTTTTTTGTAGCGACTTGGATACTCAAGAACCGAACATAGGAGCAGAAATCACAACAGAGCCGCGCATTAACGACCGTATCCGCACACCCCAGATTCGTCTCATCAATTACACCGGTGAACAAGTTGGAGTTGTAGATATCGAAGCAGCGCTAGCAATGGCAGATGAAATCGGACTTGATCTCGTTGAGATCGCACCCGAAGCTAATCCGCCAGTGTGCAAGATCATGGACTTCGGCAAGTACAAGTACCAGGAGGCACAGAAAGCTCGCGAAGCACGTCAGAACCAGACCCACATTGTGGTGAAGGAAGTGCGTATGACACCAAAGATCGAAAATCACGACTACGAGACAAAGCGCTCAGCAGTCGAGAAGTTCCTCAAAGGTGGCGACAAGGTGAAGATCACGATGAAGTTCCGTGGCCGTGAGCAAACACGTCCAGAGCTTGGGTTCAAACTCTTGCAACGTCTTGCAGAAGATGTGAAAGAGATTGCATTCGTGGAGTTCGCTCCTAAGCAAGAAGGTCGACAGATGACGATGGTCCTAGGGCCAACGAAGAAGAAGACCGAAGCGGTAGCAGAACAGAAAGCGGCGCGAGCTGCCAAAGTGAAGGCTGCTGAAGAAGAAGCAGCCGCACAATAG
Protein sequences of DBSCAN-SWA_1 >NZ_CP016772|621036:628962|628338_628962_+|WP_190277142.1|DBSCAN-SWA MDTQEPNIGAEITTEPRINDRIRTPQIRLINYTGEQVGVVDIEAALAMADEIGLDLVEIAPEANPPVCKIMDFGKYKYQEAQKAREARQNQTHIVVKEVRMTPKIENHDYETKRSAVEKFLKGGDKVKITMKFRGREQTRPELGFKLLQRLAEDVKEIAFVEFAPKQEGRQMTMVLGPTKKKTEAVAEQKAARAAKVKAAEEEAAAQ >NZ_CP016772|621036:628962|623833_625699_+|WP_095692438.1|DBSCAN-SWA MSQHAVWMTFRSMTADPSVKSQKLKPGTIKRIITYGKPYKSHIIIFLITVVIEALLVVSTPLLLRELIDKGVLPKDTGLVTKLAFLVGLLAVVDAAFNIFGRWYSARIGEGLIYDLRSQVFAHIQRQSIAFFTRTQTGALISRINSDVMGAQQAFTGTLSGVVSNVVSLVLVVTTMLILSWQITVVSLLLLPAFLLPTKWVGKRIQALTRDSFNLNATMSSTMTERFNVSGALLVNLYGKPAKEENFFRTRARKVADIGIQTAMLNRVFFVGITSVAAVATAFAYGIGGHLAINGSITVGTLLAITALLIRLYGPLTALSNVRIDVMTALVSFERVFEVLDLQPMIVDKADAKVLRKKDLKIDFTDVAFSYPRADEISLASLESAAKPETVESGEVLRGITFSAPAGSLTAIVGPSGAGKTTMSALLPRLYDVTRGAITIDDEDIREFTVQSLRDTIGVVMQDAHLFHETISENLRYAKEDATEAEMIEACKAAQIWALISTLPNGFDTMVGERGHRLSGGEKQRLAIARLLLKAPAIVILDEATAHLDSENESLVHEALKHALKGRTSIVIAHRLSTVMEADQILVLEKGLIVERGKHEELIAQGGLYSELFARQDITTN >NZ_CP016772|621036:628962|626675_627389_+|WP_095692440.1|DBSCAN-SWA MSEPTSIALVTGGNRGIGLAIATALKAAGHRVVITYRSGTPPTGFDAVQMDVTDSASVDAAFTRIESEIGQPEIIVANAGITKDTLVLRMSDEDFESVIDANLTGAFRVAKRATKGLLKLKRGRLIFIGSVVGGVGAAGQVNYSASKSGLVGMARSFARELGSRGITANVIAPGFVETDMTAELDEKRRVDIAAQVPLGRFCSAAEIADVVAFIASPQASYITGAIIPVDGGLGMGH >NZ_CP016772|621036:628962|626375_626648_-|WP_095676123.1|DBSCAN-SWA MAKEDDVYDITSAPKGLSTDQSARQRRYFISMMIRTACFILTVLLPSPYRWFALLGAVTLPYIAVVVANAGRETVLPGEAILKKRPRGIE >NZ_CP016772|621036:628962|623038_623482_+|WP_095692437.1|DBSCAN-SWA MQLDNLYQEVILDHYKNPQNKKLNTTFDAQVHHINPSCGDEITLNVTLEGNMVKSVSWDGVGCSISQASVSILTDLLIGKSLEQAQVIADSFMHLMQSKGTEKGDESLLEDAVAFAGVSQYPARIKCALLGWMAFKDASVQALAKQA >NZ_CP016772|621036:628962|621036_621789_+|WP_095676118.1|DBSCAN-SWA MSTLEIRGLKVSVETEQGSIEILKGVDLTIKSGETHAIMGPNGSGKSTLAYSIAGHPKYTITEGTVTLDGADVLEMTVDERAKAGLFLAMQYPVEVPGVSVSNFLRTAATALRGEAPKLREWVGEVKSAMESLKMDASFAQRNVNEGFSGGEKKRHEIMQLELLKPKFAILDETDSGLDVDALRIVSEGVVRAKAANNHGILLITHYTRILRYIKPDFVHVFANGRIVEEGGPELADKLEANGYAEYVTA >NZ_CP016772|621036:628962|621791_623039_+|WP_095692436.1|DBSCAN-SWA MTFDAHAIAKDFPILDRTIRDGKRLVYLDSGATSQKPNVVINAESDFYRFHNAAVHRGAHQLAEEATDAYEGAREIVAQFLNASVDEIVFTKNATESLNLICYAMGNAAPGNRFHLKAGDSIVVTEMEHHANLIPWQQLAARTGAILKWFSVTDDGRLDLSQINSVIDEKTKVVALTHQSNVLGTINPLEAITKRAHEVGAVVVLDACQSVPHMKTDVKKLDIDFLAFSGHKAVGPTGIGVFWGRAELLAELPPFLTGGSMIENVTMESATWAQAPKKFEAGVPNMAQAVGLGAALTYLTGIGMDNIHKHEISLTKYLLEGLSAITDLRIIGPKTTELRGGVVSFTLGDIHPHDLGQYLDSQGIAVRTGHHCAWPLTRKLGVPATTRASVYLYNTTDDLDALIVGVQGAQKYFGR >NZ_CP016772|621036:628962|623502_623814_+|WP_095531491.1|DBSCAN-SWA MVAKIEDINEAMKDVVDPELGINVVDLGLIYDMMVDDNNIAVLNMTLTSAACPLQDVIEDQTRQALAPFTDDVKINWVWMPPWGPDKISDDGREQLRALGFTV >NZ_CP016772|621036:628962|627391_628159_+|WP_095676125.1|DBSCAN-SWA MGILQGKNILVTGVLTDGSIAFHIAKIAQEEGANVVLTGFGRALSLTTRIAGRLPQLPPIIELDVTNQEHLDGLAEQVRKHVPHLDGVVHSIGFAPEAALGGNFLNTAWEDVATAVHVSAYSLKSLTMACRPTFKDGASVVGLDFDAQVAWPKYDWMGVAKAALESTSRYLARDLGAENVRINLVAAGPIRTMAAKSIPGFDEFEKVWNERSPLEWDVTDPVPAAKAAVALLSDWFPKTTAEIIHVDGGLHGMGA >NZ_CP016772|621036:628962|625660_626383_-|WP_095692439.1|DBSCAN-SWA MSNTRKKESYSLLKSLVALLLIIGCIWAAQWQYHRGVDRHARNSVIETNSSMSPVSLTSVRSNPADHEWQPVRTEGIFDSSSQILLRNRYSEGVYGFELLTRFTTPEGKSFWVDCGWVKAGANASSQPELPALPTGQVSITGRLRLDSSLPRGSFFAIPGNKADGLVSKANAQSGSSTETFYIDLMSGSDSALTPAVPAQLPELSDGPHMAYALQWLFFGGLVIYGRFLIRRDVLSSKEL |
10 | Cedratvirus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1063617 : 1075611
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP016772|1063617:1075611|DBSCAN-SWA TTTAAAGAGTTGTGACAGGGGCATAGCGCAGCAATAAGCGCTTATCGCCATATTGGCCGAAGTTGATGGTCGCCTCTGATTTATCGCCTTCTCCAGCAAGTGCCACCACAGTCCCTAATCCGAATGTGTCATGTGACACACGCTGTCCCACTTCAAGAACCATCGCCGTTGATTTCTTGCCCGTAGCTCGCGGTGGTGGACCAGATGGAAGACGAGATTTCTTCAAAGCAGAAGATGTTGTAAATGTGCTGCGACCTTCATTCTTCCATTCGATCAATTCAGATGGAATTTCATCAAGGAATCGAGAACCTGGATTGTATTTCGGTGTTCCAAAAGTTAAACGATATTCAGCGCGTGAAATATAGAGACGCTTTTCTGCGCGCGTTAATCCAACATATGCAAGGCGACGTTCTTCTTCAATCTCTCTGGGATCATCGAGTGTGCGTGCATGAGGAAAGATTCCATCTTCCATACCTGTGAGAAAGACTGTGGGAAATTCAAGACCTTTCGCAGTGTGCAGAGTCATCAGTGTGACAACTCCCCCGTGATCTTCTCCATCAGGAATCTCATCAGCATCGGCAACAAGGGATACCTTCTCCAAAAACCCAGAGAGTGAAATCTCTTCATCTTCACCAAGTTCTTCAAAGGGGCGCTCTTCATACTCCATCGATGCAGAGACAAGTTCCTTAAGGTTCTCAACGCGCACTTCATCTTGTGGGTCTTTACTTGCTTCAAGCTCTGTCAGCAACCCTGACTGTTCAAGGACTGCTTCAATGATCACAGATGGTTTTGTCTTTGCTTCAACCAGTGTCTGCAGCGCAATCAGCATTGAGGTAAAGGATTCGATGGATTGCGCTGCCTTATTGGGCACAGATGTCGCCTCTGATACTCGAAGTAGTGCGTTCCAGAAAGAGATGCCTTGAGCTTCGGCGAAGATTTCAACTTCCTCTAGCGCGCGATCTCCAATACCTCGCTTAGGTAAGTTGATGATGCGACGTATGGAAACTTCATCATCGGGATTAGCTAGAACGCGAAGGTAGGCGAGTAAATCCTTCACCTCTTTACGTTCATAGAATCGAAGGCCACCCACGACTTTATATGGGATTGCAGCGCGCATAAAGATTTCTTCAAAGATACGAGATTGAGCGTTGGTGCGATAGAAGACTGCGGTATCTCCTGGATTGGAGTGGCCCATATCCTGTAAGGAACGAATTTCACTCTTAATGAACTCTGCTTCGTCATATTCAGATTCAGCAACATAACCAACAAGTGGTGCACCAGATCCTGCATCAGACCAAAGGTTCTTCTCTTTCCGTGATTCATTCTTGGTAATCACAGCATTTGCAGCGTTGAGAATATTCTGTGTTGAACGGTAATTTTGTTCAAGCAAAACTGTTGCAGCATTGGGGTAATCAACCTCAAATTGCAAGATGTTGCGGATGGTTGCACCACGGAATCCATAGATAGATTGATCAGCATCTCCCACAACGCAGAGCTCTGCAAGTGGGAAGCCATCTCCTTCAACACCGGTGAGTTCCTTTACCAATTGATATTGCGCATGGTTGGTGTCCTGATACTCATCGACAAGGATGTGGCGAAAGCGGGAGCGAAAGCGCGCTTTAGCTTCAGGAAACTTCTGCAGTACCTGCACAGTCTTCATAATCAAATCGTCAAAGTCCATGGCATTTGCCTGCTGCAACCGCTTTTCATACATCGTATAGACATCGGCGACGATGGTCTCGAATTGATTGGTGGCATGGCTTAAATACTCATACGGAGTTTGAAGTTCATTCTTTGCATTAGAAATCAGGTTCTGGAATTGACGGGCTGGATATCTCTTGGAATCAATATTGAGAGTCTCCATCACACGTGAGATTAGCTTCTGTGAATCTGCTGAGTCATAGATGCTAAAGGTGCTGCTATATCCAAGGCGTTCAACTTCTTGGCGCAGCATGCGCACACAGGATGAGTGAAATGTTGAGACCCACATTGATTTAGCAACCGGTCCCACAAGTTCGCTGACACGCTCTTTCATCTCACCAGCTGCCTTATTGGTGAAGGTAATTGCCAAGATTTCATAGGGGCGAACTTCGCGACGAGCCATGATGTATGCGATGCGACGAGTAAGAACTCGAGTCTTTCCAGATCCAGCACCTGCAACAACTAAGAGTGGAGAACCAGCGTGGGCGACGGCCTTTTGTTGCTGGGGATTGAGTCCATCCAGCAATGGGTCGGTCGTGGTCATGGCGGGGAGTCTATTAGGCTTGGCTGTCATGGAGCCTTTAGCAGATTCAGAGATCAAGCGCGTTTTAGTTATCAACGCCCACCCCGATGATTCCGACTTTGGCGCATCAGGAACTATCGCGCAGTGGGTTCAAAAAGGCATTGAGGTCTCATACGTTCTCTGCACAAATGGCGACCAAGGTGGCGAAGAGTCAGGTTTTACCAAGGAAGAGATGCCTGCTGTGCGCCAACGCGAACAACGCGCTGCATGTAAAGCACTTGGAATTAGCGATGTCACATTCCTTAACTATGTCGATGGCCATCTTGAGCCAACGATTGCGCTGCGTAAAGATATTGTTCGTCAGATTCGTCGCGTGCAACCAGATCGCATCGTCTGCCAATCACCTGAACGCAATTGGGAGCGCATCGGTGCAAGCCATCCCGATCACTTAGCTGCAGGGGAAGCAACTATTCAAGCTGTCTATCCCGATGCGCGTAATCCTTTTGCTTTCACAGATCTGCTTGAAAAAGAGAATTTGCAACCATGGAGAACGAAAGAGGTATGGATGCAAGGCCATGCCCATCCAGATCACTTCGTCGATATCACAGACACATTCCATTTCAAGATTGCTGCCCTTAAAGAGCATGCATCACAGACTGGCCATATGGAAAATCTCGAAGAGATGCTTCGTGAATGGGGTCAGCGCAATGCAGGTATGGGGCAACTTCCTGAAGGGCGTATTGCTGAAGCCTTTAAGATTGTTAACACCAACTAAATTACTTAGCTGAAGCCTTAACTATTTCAACCATCTGGATGGGAAATCCATCTGCTGCATACATCGCCTGAGTTCCACTCATTTGATACGCACCCACTGCGCCAATTTTCTGCACAAGATCTAAGCCTTTGGTGATCTTGCCCCAGATTGTGTAGTCAGGTCCAAGCGTGGTGTCTTGATAGACAAGAAAGAACTGTGATCCGTTGGTGTTGGGGCCAGAGTTAGCCATAGCCACTGTCCCTGCTGGGTAATTGATGCCGGCAGCCTTTGGAAGGTTTTCATTGGCATAACCTTTCCATCCTGTTGGAGAACCACTTCCACTTGCTGTGGGATCGCCACATTGCAGAACAAAAATTCCTTCCGTTGTTAAACGATGGCAGAGAGACTTGTTGTAGTAACCAGCATTTGCAAGGGATGCAATCGATGTCACTGTGATTGGCGCCTTTGCACCAAGGAGTGAAATCACAATAGGGCCGCAGTTAGTTGTAATCGTTAAAGTCTTTGCAGGGCTCTTTGCAGCAACAGTTGGCTGCTTTACTTTGGCCGGAGCATGTCCCTTAGCGGTGCTCTTTGCACACCCTGGAACTGAGATGGCACGCTCGGCTGCATGAGCAGGAGCTGTTACCGACGTTGCTGCAAGAAGTATGAGTGCAATGGCTGCGACCTTCGCGTTCTTAAACATTTCTGTGGTTCCTATTCCCATTCAATCGTGCCTGGTGGCTTACTTGTCACATCAAGGACCACGCGGTTGACCTCGCGGACCTCATTGGTAATTCGTGTTGAAATCTTCTCAAGAACTTCGTATGGCACACGTGACCAATCTGCAGTCATCGCATCCTCACTTGAAACTGGGCGCAAAACTATCGGATGACCGTAAGTGCGTCCATCGCCCTGAACTCCAACGCTTCGCACATCAGCCAAAAGCACAACCGGGCATTGCCAGATATCTCGATCAAGGCCTGCAGCCTTTAATTCATTGCGCGCAATCAGATCGGCATGACGCAAAATTTCTAAGCGCTCAGCAGTTACCTCACCGATAATGCGAATGCCAAGGCCAGGACCTGGAAATGGTTGGCGCCAGACAATCTCTTCAGGCAAGCCAAGTTCAATTCCAACCTGGCGCACTTCATCTTTAAAGAGAGTGCGCAGTGGTTCTACAAGTTTGAATTTAAGGTCATCAGGAAGCCCACCAACATTGTGGTGAGATTTGATGTTGGCAGTACCTGTTCCGCCCCCTGATTCAACAACATCTGGATAGAGCGTGCCCTGCACAAGGAATTCAACATCTCCGCCCGCTGCAATATCGCGAGCTGCTGCTTCAAAGGAGCGAATGAATTCACGGCCGATAATTTTGCGCTTTTCTTCAGGGTCTGTAACTCCTGCAAGTGCGTTCAGGAACTGATCAATGGCATCAACGACAACGAGTTCGACTCCAGTGGAGGCAACAAAGTCACGCTGCACTTGCTCTGATTCACCACTGCGCAGTAGACCGTGATCTACGAAGACACAGGTCAGTTGCTTGCCCACAGCGCGCTGCACGATTGCAGCTGCAACTGCTGAATCAACGCCTCCAGAGAGACCACAGATAACGCGCTTATTACCAATAACTTCCCTAGCCTTTGCAACTTCATCTTCTGCGATGTTATGCGTTGTCCACGTTGGCTTACATCCTGCAATATTGATAAGCCAATTCTTCAAGATTGCTTGGCCATGCTCTGAGTGAAGAACTTCTGGGTGAAATTGCACTCCTGCAAGCTTTCCGGATGCATCTTCAAATGCTGCAATAGGTGTGTCAGACGTTGATGCTGTCACGCTGAAACCAGATGGAACCTCAGAGACTGCATCACCATGTGACATCCACACAGATTGCGCTGCAGGAAGACCTGCAAACATCTTGGAACCAGCTTTTACTGCAAGGGGTGTGCGACCAAACTCTGATTTACCAGTCTGTGCCACAACGCCAGCAAGGGCTGCAGCCATTGTTTGAAATCCATAGCAAATTCCAAAGACTGGGATGCCGAGCGTGAAGATTGCAGGATCGACTTTTGGTGCGTGATCGGCATAGACCGATGAAGGTCCACCTGACAAGATAATTGCTTCGGGATTCTTTGCACTTACTTCATTTGCAGTAATGGAAGATGGAACAATTTCAGAGTAAACATTAGCTTCGCGAACACGTCGAGCAATGAGCTGTGCATACTGTGCGCCGAAGTCGACGACAAGAACGCCGTGCTTATCGGCCATGCTGCATAATGATTTCTACGCGCTGGAATGACTTCACATCTGAGTATCCGCAGGTGGCCATTGCACGACGTAGTGCACCGAACAAATTCATTGAGCCATCAGAGGTATGTGATGGGCCATTGAGTACTTCTTCAAGAGTGCCGACAGTTCCGACATTCACGCGCTGACCGCGTGGAACTTCTTGGTGATAGGCCTCAGATCCCCAGTGCCAACCAAGTCCTGGAGATTCAACTGCCTTAGCAAGTGCTGAACCCATCATGACCGCATCTGCCCCGCATGCAATTGCCTTAGCGATATCTCCAGATTTACCAACAGAACCATCTGCAATAACGTGAACGTAACGTCCACCTGATTCATCAAGGTATTCACGGCGCGCTGCTGCCACATCGGCAACCGCTGTTGCCATCGGAACCTGAATTCCAAGAACAGTTTGCGTTGTGTGAGCTGCTCCCCCACCAAAACCAACAAGAACACCAGCTGCTCCTGCACGCATCAAGTGAAGCGCACCTGTAGTCGTTGCAACTCCGCCAACAATCACAGGGACATCAAGCTCATAAATAAATTTCTTTAAGTTAAGAGATTCACTCTCTGCAGCAACATGCTCAGCTGAGACAGTTGTTCCGCGAATAACGAAGATATCAACGCCAGCATCAATCACAGCTTTGTGAAGCTGTGCGGTGCGCTGTGGTGAAAGCGATGCTGCAACGGTAACGCCCGAGTCGCGGATCTGCTTGATGCGCTCCTTAATCAATTCAGGTCGGATTGGTGCTGCATAAATCTCTTGCATACGTCGCGTTGCATGCTTATCTGGCATGGATGCGATTTCACCTAGTGGAATACGCGGATCGTCATAACGAGTCCACAAGCCCTCAAGGTTGAGAACTCCAAGGCCACCAAGCTTTCCGATGGCGATCGCTGTCTCAGGTGAGACAACTGAATCCATAGGAGCTGCAATCAGTGGAAGATCAAACTTATAAGCATCGATCTGCCATGAGGTGCTGACCTCTTCAGGGGTGCGAGTACGGCGGCTTGGGACGATGGCGATATCGTCGAATGAATAAGCCTGGCGGGCGCGTTTTCCGGGAGCGATTTCGTAATCCATAGTTATTTCTTCTTGCTGTAGTTAGGGGCATCGGCCACATCGAGAACATCATGTGGGTGGCTCTCTTGTAAGCCAGCTGCAGTAATTTGAATCAAGCGACCTTCACGGCGCAGAGTTTCGATATCAGGAGCGCCCGCATATCCCATACCGCTACGAAGTCCGCCCACAAGTTGGTGAACAACATCGGCAACAGGTCCGCGATAGGCAACCTTTCCTTCAATTCCTTCAGGAACAAGTTTATCTTCTGAGAGAACATCATCTTGCATGTAGCGATCCTTAGAGTATGACTTCTTCTCTCCACGAGATTGCATCGCACCGAGTGAGCCCATTCCACGATATGCCTTAAACATACGACCATCAATTTCAACGAGTTCGCCAGGAGATTCTTCACACCCTGCAAGAAGTGAACCGAGCATCACTGAATGTGCGCCGGCAACAATTGCTTTGACGATATCCCCTGAATATTGAAGTCCACCATCTGCAATCAGCGGAATGCCAGCTTTATTACAAGCCTTCGCAGCTTCCATGATTGCGGTGACTTGTGGAACACCGACACCTGCAACAACACGAGTTGTGCAGATAGATCCTGGACCAACACCAACCTTTACTGCATCCGCTCCTGCATTAATCAGGGCCTGTGCACCTGCACGTGTTGCAACATTTCCACCAATGATTTCGATGGTGGAGGAAAACTTCTTAATGCGCTCTATCGCATCGAGTACTGCTCGGTGATGGCCATGCGCTGTATCCACAACGATGACATCAACTCCTGCCTCAATCAGCTTCTGTGCACGGGCAAAACCATCATCGCCCACACCAACTGCTGCCCCTGCAAGGCCAACGTTTTTTACTAACTTCACATGGGTGACTTGTTCATCGACAGGAAGGTTGCGGTGGATGATGCCGATGCCGCCAGCCTTTGCCATAGCAATTGCCATTGTGGATTCAGTGACGGTATCCATGGCGGAAGAAATAACAGGAACTGCCAAAGTGATATTGCGTGTGAGGCGAGTTGAGGTATCAACCTCAGATGGCACTACATCAGATGCATCAGGAAGAAGAAGGACATCGTCATATGTCAGGCCAAGGAGGGCGACCTTCTCAGAATCGATCACGGCGGCTCCCTTGAATAGCAACGAGAGCGCAGTCTACCTGCTCAATTACGCGCCTAACGCCTGCGCAATTTCCTCACAGGCATGCTCAAGGCCCTCGGCCCGCACAACTTGATCACATGAAATTGCGTCCCAACCAGGCCCACCCACCACGATGCGAGGAGCTGGGCGGATTGCAGGAATCTCATTCCAATATTTGCTTTCGGCATTCTTTGGTAACTGTGCCCATAAGAAAATTGCAGGGGGTGCACATCGCGTCACCATCGCAGATAGGGCCTCAAGCGGTGTGCGAGCACCAAGAACCGATGTCTGGATATTGCGTTCGCAGAGCGCTGCAGCCAGGGCATAGATGGGCAGCGAATGAAGCTCTTCGCCCACTGCGGCCAGTAATACAGGGCGAGGATTGATGGGCTTCTTTAATTCAACTACTCGATTATGCATTGTGCGTTTAAGAATTTCAGAGAAGAGATGCTCAACCTCAATACCCTTCTGATTGTTCTCCCACTCTTCCCCAATCAAAAAGAGAACTGGCGCAATCACATCAGACCATGCGCCTTCAACCCCGTATGTATCAATTTCATGGGCCAACGTTGTTTCTACAAATGTATGATCAAAACTTTGTAAGGCTTTATAAAGAGCTGCGACAACTTCTTCGCGGACTTCAAAATCCTTCACGATCTTCTTCAGCGGCACCGCTGTCTTGCAAGCCTTTGCTTGCTCGGCGGCATCTGCAGGAGTAACCCCAGCAACGATAAGGCGGCGCATCATGGTCAGCTTTGCTAAATCATTGGGGCAGTAACGGCGATGCTCGCCCTCTTCATGATCGGATGGTCCCAGGCCATAGCGGCGGGCCCACGTGCGCAGCGTGGCTGGAGCTACACCGATGCGACGGGCAACTGCCGCCACCGTCAGTTTCTCTTCAGCGTCTACTGAAGCGCTCTTTGCCATGCCCCTATCGTGGCGTGAAGTGGCCTAACTCACCACTCCAAAATCGGACATTTCGCACATTGAACAACGCGTGACCCGATAGGTACTTGAACAACCTATGAAACGGTTGTATGGTGCAGATATCTAGAACAAAGGAGAACCACATGGCTGAACTATCACGTCTACCGCAACCTATTGCTGAGCAATGGGAGTGGCAGTACGAAGGATCATGCCGTTCACTCGATTCAGAGATGTTCTTCCACCCTGATGGTGAACGTGGACCACGTCGTCGCAATCGTGAGAACGCTGCAAAGGCAGTCTGCGCTTCATGCCCAGTGATCCAAGCCTGCCGCACACACGCCCTTGCAGTCCAAGAGCCATATGGAATTTGGGGAGGCCTGTCAGAAGATGATCGCGCCACTATCTTGATTCAGCGCGGTATTCCTTTGATCTCACACGCTTCATAATTTTTGAATTAAAAGAGAAATCCCCCGCACATAATGTGCGGGGGATTTCTTTCTTTAATTACTTAGTGACCGTGTCCACCTGGACCATGTGAATGACCATGACCGCTAGAAGCTGGCTCTTCATTTGCTGGGCGCTCATAGACAACAGCTTCTGTTGTTATAAACATTGCAGCAATAGATGCAGCATTTGCAAGTGCTGAACGAGTGACCTTCACTGGGTCGATAACGCCATCTTTTGCAAGATCGCCATAGACATCAGTTGCAGCGTTGAAGCCTTCATTTGGCTTCATTGCGCGGACCTTGGCGACTACTACGTAGCCTTCAAGACCAGCGTTTTCTGCAATCCAACGAAGTGGTTCATCGCATGCCTTGCGAACAAGTGCAACACCGACTGCCTTATCGCCTGTGAAGCCGAGGTTGTCATTGAGAGCATCAGCTGCATGCACAAGGGCAGCGCCTCCGCCGATAACGATTCCTTCTTCAACAGCTGCGCGAGTTGCAGAGATAGCATCTTCAAGACGATGCTTCTTCTCCTTCAACTCAACTTCTGTATGTGCTCCGACCTTAATGACGCAGACTCCACCAGAGAGCTTTGCAACTCTCTCCTGTAGCTTTTCGCGATCCCAGTCTGAATCAGATTGTGAAATCTCTGCACGAAGTTCTCCAACGCGACCTGCAACAGCTGCCTTATCGCCAGCTCCATCGACGATGGTTGTTGTCTCCTTTGTGACCACGATGCGACGAGCGCGTCCGAGATCTTCCAGAGTTGCAGCCTCTAACTTCATACCAACTTCTTCAGAGATGACAGTTGCACCAGTCAAGATCGCCATATCTTGCAAGATTGACTTGCGACGATCTCCAAATGCTGGAGCCTTTACTGCAGCTGATGTGAATACTCCGCGCATGCGGTTTACAACGAGAGTTGAGAGCGCTTCGCCTTCAACATCTTCTGCAACGATAAGAAGTGGCTTTCCTGCCTGTGAAACCTTCTCAAGAAGAGGAAGGAGTTCAGCAAGAGCTGAGACCTTGTTGGAGACGAGAAGGATGTAAGCATCTTCAAGGACTGCTTCCATGCGATCTTGGTCTGTAACGAAGTATGGAGAGATGTAGCCCTTATCGAACTGCATACCTTCAGTGAACTCGAGCTCCAGTGCTGTAGTTGATGCTTCCTCAACAGTAATAACGCCATCTTTGCCGACCTTATCCATCGCTTCTGCGATGAGGTCTCCGATTGCGCGGTCCTGTGCTGAAATCGTTGCAACATCTGCGATCTGCGCCTTATCTTTTACGACAGTTGCATTCTCGCGAAGACGAGCTGAGATTGCCTCAACAGCTGCTTCGATTCCCTGCTTGAGATCCATTGGCTGTGCTCCGGCAGCAAGGTTACGAAGACCTTCCTTGACCATTGCCTGAGCAAGCACTGTTGCAGTCGTTGTTCCGTCTCCTGCGACATCGTTGGTCTTTGTGGCGACTTCCTTGACGAGCTGTGCGCCCATGTTCTCAACAGGATCTGAGAGTTCGATTTCCTTAGCAATTGTCACACCATCGTTTGTGATGGTGGGTGCGCCAAATGACTTAGCGATGACAACGTTGCGACCCTTAGGTCCTAGCGTTACCTTGACGGTGTCAGCGAGCTGATTAACACCGCGTTCCATTGCGCGACGAGCATGCTCGTCGAATTCCAACATTTTGCCCATGGACTACTTCTCGATTATCGCGAGAATGTCGCGAGCAGAGAGAACGAGGTAATCCTCGTTGTTGTACTTCACTTCTGTTCCGCCGTACTTGCTGTAGAGAACGACATCGCCGACCTTTACATCCATCGGTACGCGAGCGCCATCATCGAAGCGGCCTGGGCCTACTGCAACAACTGTGCCTTCTTGTGGCTTCTCTTTCGCTGTATCTGGGATGACAAGACCTGATGCAGTTGTTGTCTCAGCTTCATTTGCCTTAACAACGATGCGATCTTCGAGCGGCTTAATGGCTACTGCCATGGTGGTACTCCTTTTAGATTCGATATATCCACTTTGTTCGGGGCTCACCTGCTTTAGCAGTGTCGACCCTAGAGTGCTAACGCCAAGCCTATGGGGGCTTATTCCAGCGGGCAAATCGGGCCTACAGGCTGACCTGCTCCACCTTCAGGCTGGAATCAGCGTCAAAGGCGCCCTTGGTGGCGGGCCTACCAGCGCTGACCATCAGGCTGCCCAGGGATGCCACCATCGCCCCGTTATCGGTGCAGAGAGCCGGAGATGGAATACGAAGTGCAATACCAGCCTTCTCACAGCGCTCTGTGGCAACGGCTCGAAGACGAGAGTTGGCAGCAACACCACCTGCGATCACCAGTGAATCAATTCCCGTTGATTTACAGGCAGCCAGTGCTTTGAGCATCAAGACATCAACTATGGCTTCTTGGAAAGATGCTGCAACGTCGGCGCGAATGAAGGATGGAGTTCCTTCAAGATAACGAGCAACCGCAGTCTTTAGCCCTGAGAATGAAAAATCATAGGGGCGCGTTGCCCAGTCGTTCGTTGTTGTCAGCCCTCGCGGGAAATCGATGGCGCTCGCAGAACCGTTTACTGCTTCTCGATCTATGGCAGGACCGCCAGGAAATCCTAATTCGAGAATGCGTGCAATTTTATCGAATGCCTCACCTGCAGCATCATCCATCGTCGCACCAAGTTTGGTTATTGAACCTGTGATGTCATCAACTTGAAGAAGGGATGAGTGGCCACCGCTTACAAGAAGTGCGATGGTGGGATCTGTTGGTTGATCATGAGTTAAGTAATCAACGGAGACGTGAGCTGCTAAATGATTAACGCCATAGAGCGGACGACCAAGTCCTTGCGCCAATCCACTTGCTGATGCAACACCGACTAGAAGTGCGCCAACGAGTCCTGGGCCAGCTGTCACTGCAATCGCATCAATATCTGCAAGTGAAATCTTCGCATCTTTGAGTGCGCGCTGGATACTAGGCAACATCGCCTCCAAATGAGCACGCGATGCAATCTCAGGAACTACTCCCCCAAAACGTGCATGTTCATCAACACTGGATGCGATCACATTGGCAAGAAGTGTGCGCCCACGAACAATGCCGATTGCTGTCTCATCGCATGATGTTTCAATACCAAGGACTACTGGTTGCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP016772|1063617:1075611|1069963_1071076_-|WP_095676526.1|DBSCAN-SWA MIDSEKVALLGLTYDDVLLLPDASDVVPSEVDTSTRLTRNITLAVPVISSAMDTVTESTMAIAMAKAGGIGIIHRNLPVDEQVTHVKLVKNVGLAGAAVGVGDDGFARAQKLIEAGVDVIVVDTAHGHHRAVLDAIERIKKFSSTIEIIGGNVATRAGAQALINAGADAVKVGVGPGSICTTRVVAGVGVPQVTAIMEAAKACNKAGIPLIADGGLQYSGDIVKAIVAGAHSVMLGSLLAGCEESPGELVEIDGRMFKAYRGMGSLGAMQSRGEKKSYSKDRYMQDDVLSEDKLVPEGIEGKVAYRGPVADVVHQLVGGLRSGMGYAGAPDIETLRREGRLIQITAAGLQESHPHDVLDVADAPNYSKKK >NZ_CP016772|1063617:1075611|1072529_1074164_-|WP_095676529.1|DBSCAN-SWA MGKMLEFDEHARRAMERGVNQLADTVKVTLGPKGRNVVIAKSFGAPTITNDGVTIAKEIELSDPVENMGAQLVKEVATKTNDVAGDGTTTATVLAQAMVKEGLRNLAAGAQPMDLKQGIEAAVEAISARLRENATVVKDKAQIADVATISAQDRAIGDLIAEAMDKVGKDGVITVEEASTTALELEFTEGMQFDKGYISPYFVTDQDRMEAVLEDAYILLVSNKVSALAELLPLLEKVSQAGKPLLIVAEDVEGEALSTLVVNRMRGVFTSAAVKAPAFGDRRKSILQDMAILTGATVISEEVGMKLEAATLEDLGRARRIVVTKETTTIVDGAGDKAAVAGRVGELRAEISQSDSDWDREKLQERVAKLSGGVCVIKVGAHTEVELKEKKHRLEDAISATRAAVEEGIVIGGGAALVHAADALNDNLGFTGDKAVGVALVRKACDEPLRWIAENAGLEGYVVVAKVRAMKPNEGFNAATDVYGDLAKDGVIDPVKVTRSALANAASIAAMFITTEAVVYERPANEEPASSGHGHSHGPGGHGH >NZ_CP016772|1063617:1075611|1066616_1067297_-|WP_095676867.1|DBSCAN-SWA MFKNAKVAAIALILLAATSVTAPAHAAERAISVPGCAKSTAKGHAPAKVKQPTVAAKSPAKTLTITTNCGPIVISLLGAKAPITVTSIASLANAGYYNKSLCHRLTTEGIFVLQCGDPTASGSGSPTGWKGYANENLPKAAGINYPAGTVAMANSGPNTNGSQFFLVYQDTTLGPDYTIWGKITKGLDLVQKIGAVGAYQMSGTQAMYAADGFPIQMVEIVKASAK >NZ_CP016772|1063617:1075611|1074582_1075611_-|WP_095692753.1|tRNA|DBSCAN-SWA MQPVVLGIETSCDETAIGIVRGRTLLANVIASSVDEHARFGGVVPEIASRAHLEAMLPSIQRALKDAKISLADIDAIAVTAGPGLVGALLVGVASASGLAQGLGRPLYGVNHLAAHVSVDYLTHDQPTDPTIALLVSGGHSSLLQVDDITGSITKLGATMDDAAGEAFDKIARILELGFPGGPAIDREAVNGSASAIDFPRGLTTTNDWATRPYDFSFSGLKTAVARYLEGTPSFIRADVAASFQEAIVDVLMLKALAACKSTGIDSLVIAGGVAANSRLRAVATERCEKAGIALRIPSPALCTDNGAMVASLGSLMVSAGRPATKGAFDADSSLKVEQVSL >NZ_CP016772|1063617:1075611|1071121_1072021_-|WP_095692751.1|DBSCAN-SWA MAKSASVDAEEKLTVAAVARRIGVAPATLRTWARRYGLGPSDHEEGEHRRYCPNDLAKLTMMRRLIVAGVTPADAAEQAKACKTAVPLKKIVKDFEVREEVVAALYKALQSFDHTFVETTLAHEIDTYGVEGAWSDVIAPVLFLIGEEWENNQKGIEVEHLFSEILKRTMHNRVVELKKPINPRPVLLAAVGEELHSLPIYALAAALCERNIQTSVLGARTPLEALSAMVTRCAPPAIFLWAQLPKNAESKYWNEIPAIRPAPRIVVGGPGWDAISCDQVVRAEGLEHACEEIAQALGA >NZ_CP016772|1063617:1075611|1072164_1072467_+|WP_095692752.1|DBSCAN-SWA MAELSRLPQPIAEQWEWQYEGSCRSLDSEMFFHPDGERGPRRRNRENAAKAVCASCPVIQACRTHALAVQEPYGIWGGLSEDDRATILIQRGIPLISHAS >NZ_CP016772|1063617:1075611|1068848_1069961_-|WP_095692750.1|DBSCAN-SWA MDYEIAPGKRARQAYSFDDIAIVPSRRTRTPEEVSTSWQIDAYKFDLPLIAAPMDSVVSPETAIAIGKLGGLGVLNLEGLWTRYDDPRIPLGEIASMPDKHATRRMQEIYAAPIRPELIKERIKQIRDSGVTVAASLSPQRTAQLHKAVIDAGVDIFVIRGTTVSAEHVAAESESLNLKKFIYELDVPVIVGGVATTTGALHLMRAGAAGVLVGFGGGAAHTTQTVLGIQVPMATAVADVAAARREYLDESGGRYVHVIADGSVGKSGDIAKAIACGADAVMMGSALAKAVESPGLGWHWGSEAYHQEVPRGQRVNVGTVGTLEEVLNGPSHTSDGSMNLFGALRRAMATCGYSDVKSFQRVEIIMQHGR >NZ_CP016772|1063617:1075611|1063617_1065861_-|WP_095692747.1|DBSCAN-SWA MTTTDPLLDGLNPQQQKAVAHAGSPLLVVAGAGSGKTRVLTRRIAYIMARREVRPYEILAITFTNKAAGEMKERVSELVGPVAKSMWVSTFHSSCVRMLRQEVERLGYSSTFSIYDSADSQKLISRVMETLNIDSKRYPARQFQNLISNAKNELQTPYEYLSHATNQFETIVADVYTMYEKRLQQANAMDFDDLIMKTVQVLQKFPEAKARFRSRFRHILVDEYQDTNHAQYQLVKELTGVEGDGFPLAELCVVGDADQSIYGFRGATIRNILQFEVDYPNAATVLLEQNYRSTQNILNAANAVITKNESRKEKNLWSDAGSGAPLVGYVAESEYDEAEFIKSEIRSLQDMGHSNPGDTAVFYRTNAQSRIFEEIFMRAAIPYKVVGGLRFYERKEVKDLLAYLRVLANPDDEVSIRRIINLPKRGIGDRALEEVEIFAEAQGISFWNALLRVSEATSVPNKAAQSIESFTSMLIALQTLVEAKTKPSVIIEAVLEQSGLLTELEASKDPQDEVRVENLKELVSASMEYEERPFEELGEDEEISLSGFLEKVSLVADADEIPDGEDHGGVVTLMTLHTAKGLEFPTVFLTGMEDGIFPHARTLDDPREIEEERRLAYVGLTRAEKRLYISRAEYRLTFGTPKYNPGSRFLDEIPSELIEWKNEGRSTFTTSSALKKSRLPSGPPPRATGKKSTAMVLEVGQRVSHDTFGLGTVVALAGEGDKSEATINFGQYGDKRLLLRYAPVTTL >NZ_CP016772|1063617:1075611|1074167_1074461_-|WP_095676530.1|DBSCAN-SWA MAVAIKPLEDRIVVKANEAETTTASGLVIPDTAKEKPQEGTVVAVGPGRFDDGARVPMDVKVGDVVLYSKYGGTEVKYNNEDYLVLSARDILAIIEK >NZ_CP016772|1063617:1075611|1065889_1066615_+|WP_095692748.1|DBSCAN-SWA MEPLADSEIKRVLVINAHPDDSDFGASGTIAQWVQKGIEVSYVLCTNGDQGGEESGFTKEEMPAVRQREQRAACKALGISDVTFLNYVDGHLEPTIALRKDIVRQIRRVQPDRIVCQSPERNWERIGASHPDHLAAGEATIQAVYPDARNPFAFTDLLEKENLQPWRTKEVWMQGHAHPDHFVDITDTFHFKIAALKEHASQTGHMENLEEMLREWGQRNAGMGQLPEGRIAEAFKIVNTN >NZ_CP016772|1063617:1075611|1067308_1068859_-|WP_095692749.1|DBSCAN-SWA MADKHGVLVVDFGAQYAQLIARRVREANVYSEIVPSSITANEVSAKNPEAIILSGGPSSVYADHAPKVDPAIFTLGIPVFGICYGFQTMAAALAGVVAQTGKSEFGRTPLAVKAGSKMFAGLPAAQSVWMSHGDAVSEVPSGFSVTASTSDTPIAAFEDASGKLAGVQFHPEVLHSEHGQAILKNWLINIAGCKPTWTTHNIAEDEVAKAREVIGNKRVICGLSGGVDSAVAAAIVQRAVGKQLTCVFVDHGLLRSGESEQVQRDFVASTGVELVVVDAIDQFLNALAGVTDPEEKRKIIGREFIRSFEAAARDIAAGGDVEFLVQGTLYPDVVESGGGTGTANIKSHHNVGGLPDDLKFKLVEPLRTLFKDEVRQVGIELGLPEEIVWRQPFPGPGLGIRIIGEVTAERLEILRHADLIARNELKAAGLDRDIWQCPVVLLADVRSVGVQGDGRTYGHPIVLRPVSSEDAMTADWSRVPYEVLEKISTRITNEVREVNRVVLDVTSKPPGTIEWE |
11 | uncultured_virus(22.22%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1205627 : 1215035
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP016772|1205627:1215035|DBSCAN-SWA ATCACTTTCCAACCTCCTGATAGCCACTTGTCCAATAACCAGCTTGATCTACTTGTTGCACGATGCGCTTCTTCCACAAATCACTTGTCTCAGGGAAATCTTCACGCCAGTGGGAGCCACGAGTTTCTTGGCGAATGAGTGCAGCTTTCACAATTGTCTGTGCAAGCTGGAAGAGGTTAGTTGTCTCCCATGCTTCAACACATGGCTGCGTGCTCTTCCTATCTTCAATACGAGTTAAGTCTGAACTTGTTTTGAGTAATGAATCTGAGGATCGCAATACACCTGCGCCACGGCTCATACTCACCTGAATATCATGGCGCACTTGTGGGTCAAGCAGAATTGCTTGCGATGCATTCACTACTGGTTCACTCTTCTCAGGCAAGTTCTCTGCAATATCTGCGGCAATGCGTGCGCTAAATACAAGGCCTTCCAGAAGTGAATTGGATGCAAGTCGATTAGCGCCATGAACACCAGAACATGCAGTTTCACCGCATGCATAGAGCCCATTGACACTTGTGCGGCCATTGAGATCAACGCGCACTCCACCTGATGCATAGTGCGATGCTGGAGCCACGGGAATCAAATCTTTCAGTGGATCTATGCCATGTGAAATGCATGATGCGTAAATAGTAGGAAAGCGTTCTTTAAATCCTTCAAGGTGGCGAACATCGAGCCAGACATGGTGAGCACCAGTTGTATTCATCACGCGCATAATGGAGATTGCTACAACATCGCGTGGTGCAAGTTCAGCAAGGGGATGAATATCTTGCATGAAGCGCACGCCTTTATCGTCAACAAGATAGGCACCTTCACCGCGCACAGCTTCACTAATCAATGGCTGTTGTCCTTCTGAGTTATCGCCCAGCCACAAGACTGTTGGGTGGAATTGAATGAATTCAACATCGGCCACCTTTGCCCCTGCGCGAAGAGCGAGTGCAACTCCATCTCCAGTTGATACAGATGGATTCGTTGTTTGAGCAAATACTTGACCTAGTCCACCTGTTGCAAGGACTACGGCGCGAGCAAGACCTCGGCCAACGCCATCGCGACTACCTGCGCCGATAACGTGAAGAGTTACTCCACACACTTCACCAATATCATTCTTGAGCGCATCGAGCACAAGTGCATGTTCAACAACTTCAATCTCAGGATCGTTTTGTACTGCAGCAAGAAGTGCTCGGGAGACTTCAGCTCCAGTTGCATCTCCCCCTGCATGCAAGATGCGATTACGAAGGTGTCCGCCTTCACGAGTAAGTGCGATTTCACCAGTATCTTCTTTATCAAAGATTGCGCCACGTTCGATTAACTTACGAACAGCTTCAGGGCCTTCGGTAACAAGTACGCGCACAGCATCAACATCACAGAGCCCAGCACCTGCAACGAGTGTGTCTTTCTCATGAGCTTCGGGTGAATCACCATCACCAAGTGCTGCGGCAATACCGCCTTGCGCCCACTTAGTGGAGCCTTCATCAACTCGTGCTTTTGTCACAAGTAAAACTGATAATCCATATGTGCGACATTGCAGCGCAGTAGTTAATCCTGCAATCCCTGATCCCACAACGATGACATCAGCTGTCGCAGTCCAGCCTGGAGTTGGCGCTAATAACTTCATTCGCGATTTCCCATAGGCATATTGTCAATCAATCGCACACCATTTATCCACCCAGCGACGATTGCACGAACCTCACGAGTATCTGATTGAGCGTGATCAAAGGTCTTCTCATCGATGAAATCTGCATAATCAACAGTAAGAGAGCTCTCACTCGCCAAGATCTCACGCAATTCATTTTCAGACTTGGCTTTGGTAAGCGCTCTATAGATGACTTGTGCTGCGTTTCGACCTTCATCGCTAAGTCGCACATTGCGAGATGAGAGCGCAAGTCCATCGACTTCACGAATAGTGGGTGCTGCAACAATCTCAATACCTTCAGCAATTGTCCTAATTAACTGCAGCTGCTGAAAATCTTTCTCTCCAAAGATCGCCACCTTTGGTTTAAGAAGTGAAAAAAACTGGTGGACAACAGTGAGCACTCCATCAAAATGTCCTGGTCGAGATTTACCTTCATAGATATCACCCAGTGGCCCTGCAGATTTCTTCACATAACCATCAGGATAAATCTCTGCTTGGTTAGGAAGCCAGAGATGTGTAACTCCAGCTGCATCAGCAATTGCAATATCAATATCTGGTGTGCGGGGATATTGCGCTAGGTCTTCTTTGTTTTCAAATTGCAAAGGATTAATAAAAATACTTGCAACAACATCAGAGCTATATTTCTTAGCAAGCGCAAAGAGTGAAGCGTGGCCCGCATGCAGTGCGCCCATTGTTGGAACAAATGCGCAGGCAGAAGGAAGCTGGCGAGCATCTGTTACTACTTTCATGGCTCCAGTCCTTTCAGGTACCGGGCAGGCAGTGAGGTAATTCTAAGGCGTTAGGACAGTTGCGCCAAAAGCTCAGAGATTTTGCCGTGTGTAAGAAGAAGGGCATCGGGTTCAATCTCATGCCATGGTTCCAGAACAAAGCGACGTTCAAAGGCGCGTGGGTGTGGAAGTTCTAACTCTTTGGCGCTACTGAGCAAAGTGCCATATTGAATGAGGTCGAGATCGATTATGCGAGGTCCCCATTTTTCAATGCGTTCACGCCCCAGTGATTTCTCGATTCCATGTAGCAGCGAAAGCAGATCGATAGCAGGCAAATCACTTTCAAGAATGCAGACTGCATTGATGTAATCGGGTTGTTCTGGACCACCCACTGGTTTTGTTGTGTAGTAAGAAGAGACCGCAGTAACGATAGTTGCCTCTTTGAGCATCGCAACTGCTAAATCCATCTGCTCTTTTGGATTGCCGATATTGGCACCGAGTGCAACTACTGCCTTCATCGAGTAATAGTGACCGAGATATCGGAAGCTTGTACAGAAATAGGTGCCTTAGGTTTATGTACCGTCACAGCAATATTTGAAATTTCTGGATGTCCGCTCTTGATTCGATCTGCAATACGCCCTGCAAGTCTTTCAATCAGTTGAACTCGCTCACCTGTGATCTCTTCAACCACGATGTCAGCAAGTGCTCCGTAATCAATTGTGTCCGCTAAATCATCACTCACACTTGCACGTGATAGATCTAAGTGAATTTCTAGATCAACAAAGAAATCTTGACCATTCTTTGCTTCATGATCGAAAACTCCGTGATATCCAAAACCCCAAATACCTGTGAGTGAGATGACATCACTCATGAGCGAGCTGCTTTCACATTTGCCTTCACGCCATGAACTCTAACTGCCCAGATGCCCTTGCCGACTAGTGATTGGGTTACTGCAACAGTGGCTTCCTCCCGCTCATCAGGATGCTCTCCCCCTAAGAAACGTTTACGTGAGTGGCCAATCAATACTGGATAGCCAAGTGCCACAAATTCATCAATCCGGTTAAGGACTTCCCAGTTGTGTTCAGCATTTTTTGCAAAGCCAATACCTGGATCCAAGATGATGTTCTCGCGCTTGACGCCTGCTGCAAGTGCTTTATCAAGCTGCAGCGTCACTTCTTCAATGACTTCAGCAACAACATCGCCATAAATAGCCTTCTCATTCATATCTTTTGAATGTCCACGCCAATGCATCAGCGTGTATTTACATCCCAGCTGAGCAACTGTTGAAAACATGTCGGGGTCTGCGGCCCCTCCGCTGACATCGTTAACGATGCTAGCTCCAGCTTCAACTGCAAGCTTGGCCGTTGTTGCTCGCATTGTGTCGATGCTGATTACAACGCCCTTTTTTGCAAGCTCGCGAATGACAGGGATAACTCGTGCTTGCTCTTCTTCTTCTGAAATTCGATCGGCACCTGGACGAGTGGATTCACCGCCCACATCAATGATGTCAACACCATCTTCAATCATTTCAAGGCCATGTGTGATCGCTAGTGATTCTTCATAATGCAGGCCACCATCGGCAAAAGAATCTGGAGTGACGTTAAGAATGCCCATCACCAACATGAGAATTTATCGATTCGTAATTAGGCTGATTGCTTCAGCGCGAGTAGCTGCATCGCGTAGAACACCGCGAACAGCCGATGTTGTTGTGCGCGCCCTGGATTTACGGACTCCACGCATCGACATACATAAATGTTCGCAATCGATGATGACAATGACGCCCATAGGTTTGAGAATCTCAACCAAGGCATCTGCAATTTGAGTTGTTAAACGCTCCTGCACTTGAGGGCGACGAGCAAAGAGATCAACAAGACGTGCAACTTTGGATAGACCTGTGATCTTGCCGCTCGGGATATATCCCACATGTGCAACGCCATGAAAAGGTGTGAGGTGATGTTCGCAATGAGAAAAGACTTCAATATCTCGAATGATCACAAGCTCTTCATGTCCGATGTCAAAGGTAGTTGTTAATACATCTTCTGGCTTCTGCCATAGTCCTTCAAAGTTCTCTTTAAATGCACGAGCCACTCGTGCTGGAGTCTCTTTCAGACCTTCGCGTTCAGGATCTTCACCGAGCGCTAAGAGAAGTTCGCGCACAGCCTTTTCTGCGCGCTCCTGATCATAGGGATGCGCAGCACGTCCATCGCCAGGCCCCATGGAAACGGAGGAAATATCAGTCACTCGAGGTTGCCTTCTTCACGCGGCGTGATTTGCGAGGAGCATCTGTAGGAATTGCTCGTTCTGGAATATCAACAGGTGGCCGTGCTGAAGGTAAACGATTCTCTGATCCTGTCCATGCTGGGCGCTTTGGCCATGACTTCACCTTGGCAAAGATTGCAGCAATCTCTTCCTTGTTGAGAGTTTCCTTTTCAAGAAGTTCTAGAACCATTTCATCAAGGATGGTGCGGTTTGCTTCAAGGATGTCATACGCCTCTTGATGCGCAGTCTCAATGAGTTTACGAATCTCACGATCGACAATTGCTGCAACGTTTTCAGAGTAATCACGTTGATGTCCATAATCACGGCCCATAAATGGCTCTGATGCATCTGTGCCAAGTTTGATTGCACCAATTGCCTCAGTCATTCCATATTGAGTCACCATGGCACGAGCAAGAGCGGTTGCCTTCTCAATATCGTTAGAGGCTCCCGTTGATGGATCATGGAAGATGAGTTCTTCTGCAGCGCGTCCACCCAGTGAATATGCCAACTGATCAAGGAGCTGGTTGCGAGTTGTTGAGTATTTATCTTCATCAGGAAGAACCATTGTGTAACCAAGTGCGCGACCACGAGGCATGATGGTGATCTTATGAACAGGATCGGTATGAGGAAGTGCATAGGCAACGAGAGCATGTCCTGCCTCGTGATACGCAGTAACACGACGCTCTTCTTCAGACATCAAACGTGATTTACGTTGCGGTCCAGCCATCACGCGATCGATAGCTTCATCAATCTGTGTATTGGTAATTGTCTTTTGACCTTCACGAGCAGTAAGTAGTGCTGCCTCATTCAATACATTTGCAAGGTCTGCACCAGTAAATCCTGGTGTACGACGGGCGTATGTAACGAGTTCGACATCCTTTGATAGCGGTTTACCCTTTGCATGCACTTTAAGAATATCTTCACGGCCCTTGAGATCTGGACGTTCAACAGGAATCTGACGATCAAAGCGACCTGGACGTAGAAGTGCGGGATCTAAAACATCAGGTCGGTTGGTTGCTGCAATCAAAATAACTTGGCCATTTGCTTCAAAGCCATCCATCTCAACAAGGAGTTGGTTCAGTGTTTGTTCGCGTTCATCATGACCGCCACCCATACCTGCTCCGCGCTGCCGACCGACTGCATCAATTTCATCTACGAAGACAATCGCAGGCGAATTGGCTTTAGCTTGAGTAAAGAGATCGCGCACGCGGGCTGCGCCCACTCCAACAAACATTTCAACAAAGTCAGAACCTGAAATTGAATAGAAAGGAACTTTTGCTTCACCAGCAACTGCGCGAGCAAGAAGAGTTTTACCTGTTCCTGGAGGGCCGTAGAGAAGAACGCCTTTAGGAATTTTTGCACCAAGGAGTGCGTACTTCGCAGGATCAGCTAAGAAATCTTTAATCTCTTCAAGTTCTGCAATCGCTTCATCCGCACCTGCAACATCATCAAAGGTATTTTGTGGAACATCGCTATCTTGTAGCTTTGCGCGAGACTTACCAAATGAGAAGACACGATTTCCACCTTGAGCATTGCTCATCATGAGGAAGAACAAGAAGCCAATAATCAGAATTGGGCCAAAGGTGAAGAAGAAAGTGGTTAAGAATGATTGCGATGGAACGCTAACGCTCCAACCCTTTGTAGGAGGGTTTGCAGTCAGAGCATCAATTAAATTCGGCTCTTGGCGCGCAATATAAGAAGCTTCAACTTTTGTTGCGCCCTTAATTGTGTTTCCACTATTGAGAATTAGACGAATCTTCTGTGATTTATCAACAAGGACTGCCGACTCAACTTGAGCCCTAGAAATAGCATCGATTGCTTGAGAAGTTTTGATCTCTGTATAGCGATTGGCAGCATTGGTAATCTGTCCAAAGATTGTGACACCGAAAATTGCAACAATGATCCAGAACAGCGGGCCACGGAAAATCTTCTGTGAACGAGTAAGAGGAGCCTTCTTGCCACCATCTTTATTCGTCGGCTTAAGAGCCTTCTTCGCTGAGTTCTTCTTCAATTTACTAAGTGGGTTCTGGGAAGACATGAAATTCCTTATGAGTACATGTGCTTTGCAAGCACACCGATAAATGAGAGGTTGCGGTACTTCTCGTCGAAATCAAGTCCATAACCCACGACGAATTCCTTAGGAATATCAAAGCCAACGTATTTCACATCAACATCAACCTTTGCAGCTTCTGGCTTACGAAGGATTGCAAGAATTTCAACAGATGCTGCACCGCGAGAGTAAAGGTTTGATTTAAGCCAAGAAAGTGTCAATCCGGTATCGACGATATCTTCAACAATCAGAACATGGCGACCGGTGATGTCACGATCGAGATCTTTAAGAATGCGAACCACGCCACTTGATTTAGTTCCTGAACCGTAGGAAGAAACAGCCATCCAATCCATCTCGATATGAGTCTGCATTGCGCGTGTTAAATCAGCCATCGCCATGATTGCGCCTTTGAGCACGCCAACAAGAAGTACATTCTTATCTTTGTAATCTGCATCGACGAGCGCTGCGAGTTCTGCAATCTTTGCTGCAAGTTGATCTTCTGTTGCTATTACCTTTTCAACATCGGTTCCGACTGCTGCTAAATCCACGTGGTCTGCTCCTTGATACCTAGGGCTGGGCTAAGAGCGAAAGTCTGCCCGAAATTCGCTCAACCTTCACACCACCCGGAAGGCTCACCACCCCTTGACCATGCCAAGAAGTGACCAGCGCCTCAACTGCGGCAAGGTGATCGGCTGTAATTGAGCCTGAAGGGGCGCCAGCGGCATAGAGAGCAGCTCTCAGGACTCGAGAGCGGATTGCTCGAGCCAGGCTCGCTAAGTGATCGCACTCAAGATTTGCGAGGTCAGATGATGAGATTTCACTCTGTGCAATCTCATCGAGGGCATCAGCATCATCACGCAAAATCGATGCACTGCGAGCAAGTGCTGCTGCAATTCCAGGTCCTAACTTCTCCTCCATCGCTGGAAGAACTTCGTTACGAACTCTGACGCGCGAAAATTCAGTATTTCCATTATGCGGATCATTCCAAGGTTCAATATCTAATTCTCTACATGCAGCAACAGTTTCTTCTCGAGTGATCTGTAATAGTGGACGAAGATACATTCCATTTTCTACTGACATTCCAGACAGTGAGCGAGTTCCAGAACCACGTGCCAGTCCTAACAGAACGGTTTCTGCTTGATCATCTCGTGTGTGACCTAAGAAGACTTTCGTTGCTTTCTCTTGCGCAGCGCACGCACTAAGAGCTTGATAGCGTGCATCACGAGCGCCAGCTTCGAGGCCGGACTCAGTAGTAACCACAACTTTCTTAGTAATAACTTTGCCATAACCCATCTCCTTCAATTGCTTCTCAACTTTTTCTGCCTGTGCACCCGAACCACTTTGTAGTTGATGATCGATAGTCACAGCAATTGTTGTAATCGCACATACTTTTGACTCTGTGAGAATTGCAGATGCAAGTGCAAGTGAATCTGCACCACCAGAGACAGCAACCAGCACAACATCGCCGGCTTCAAGCCGAGCAAGGTGAGGTTTTACAGCGTTGCGAATTGCAACGATGGCATCAGTCATAGGTGAAGGTTAGACCCGACCACCAAATGGCATAAGCCCCTTAACGCTATAAATACTTGAGTAGAGCAAGCCCTTATCCTTGGAGTTGGCCTCCCACATCATTCCTCCACCTGCATAAATGGTGATGTGGTGAATTGTTGAGATTGTGCCCTTGTAGGAATAGAAGAGAAGATCGCCAGGCTGAAGTTCAGTCAGCGCAACATGCTTTGTATAGCCTGAATACAACGCAGAGTTAAGTCGATCCCAATTTGGCCAACCAAGACCTGCTGACTTATAAGCAGCATAAACAAGACCTGAACAGTCAAATGAATTTGGTCCTTCAGAACCCCAGATATATGGCTTACGTGCTTGAACCTGCTTCTTAGCAAAGGCAACCGCAGCTAAACGTTGCGCCTCAGTTGTTCTAATTGTTGTGCGACCCTTGAAGCCAATATTCGGCCAAACTTTCGCTTGGTTGATCGTGGTAGCCGCAGTAGAAGCTTGGCTCTCTTCAAGCAACGCCAACTGACGTTGTTGTTCTAGGGTCACACGAACATTACGTGCAGTAGCAAGTTCTTTCATCAACTTATCTTGAACTGCTCGAAGTTTATTGACTTCCTTTTGTTGAAGAGCTTGCGCTGCATCCGCAATTTTCTTTGTCGCTTCAACTTTAGCAGTGGCAACTACTTGGATTGCCTTTGCTTCATCAGCTTTCTTCTTTGCAGCTTTTGCAACAATCTCTGCAGCCTTATAGCGGTCAAGAGCGGTGGTGTTTTGTGCACCTAACGTGTTGAGAGTGGAGAGTTGATCAATGAGATCTTGTGGTCCATTAGAACTCAGTAAGGGTTGGATATCACTCATACTTCCACCGAGGATGTATGCATTGGCTGCAAGTTTTCCGATAACTCGGTGGGCTTCTGCGACTGCAGCTGCTGTCTCAGCAGCATGTTTAGCAGCCGCAATAGCTTGTGCTGTAGCAACTTCGAGTTCTCTCTTCGCTTTGAGATAAACAGCTTGCGCAGCATTGGCTCTGGCAGTTAATTGTTTAAGAGTGAGATTTGCTGCAGCTAATTTCTTTGCTGCAGCATCTGCTGCAGCTTTCTTGGCAGCTTCTGCCTGTTTAGCCGCTTCAATCTCAGCAAGTGTTGGTTTTGGCTTTGCGATGGCAGGGGTGGCCGCAAGAGTCAGGCTTGCAATGATGGCAAGGCACATCAAGGATTTTCTGCGGCGCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP016772|1205627:1215035|1208851_1209607_-|WP_095676657.1|DBSCAN-SWA MLVMGILNVTPDSFADGGLHYEESLAITHGLEMIEDGVDIIDVGGESTRPGADRISEEEEQARVIPVIRELAKKGVVISIDTMRATTAKLAVEAGASIVNDVSGGAADPDMFSTVAQLGCKYTLMHWRGHSKDMNEKAIYGDVVAEVIEEVTLQLDKALAAGVKRENIILDPGIGFAKNAEHNWEVLNRIDEFVALGYPVLIGHSRKRFLGGEHPDEREEATVAVTQSLVGKGIWAVRVHGVKANVKAARS >NZ_CP016772|1205627:1215035|1209613_1210201_-|WP_095676658.1|DBSCAN-SWA MGPGDGRAAHPYDQERAEKAVRELLLALGEDPEREGLKETPARVARAFKENFEGLWQKPEDVLTTTFDIGHEELVIIRDIEVFSHCEHHLTPFHGVAHVGYIPSGKITGLSKVARLVDLFARRPQVQERLTTQIADALVEILKPMGVIVIIDCEHLCMSMRGVRKSRARTTTSAVRGVLRDAATRAEAISLITNR >NZ_CP016772|1205627:1215035|1208498_1208855_-|WP_095692830.1|DBSCAN-SWA MSDVISLTGIWGFGYHGVFDHEAKNGQDFFVDLEIHLDLSRASVSDDLADTIDYGALADIVVEEITGERVQLIERLAGRIADRIKSGHPEISNIAVTVHKPKAPISVQASDISVTITR >NZ_CP016772|1205627:1215035|1208055_1208502_-|WP_095692829.1|DBSCAN-SWA MKAVVALGANIGNPKEQMDLAVAMLKEATIVTAVSSYYTTKPVGGPEQPDYINAVCILESDLPAIDLLSLLHGIEKSLGRERIEKWGPRIIDLDLIQYGTLLSSAKELELPHPRAFERRFVLEPWHEIEPDALLLTHGKISELLAQLS >NZ_CP016772|1205627:1215035|1207234_1208005_-|WP_095692828.1|DBSCAN-SWA MKVVTDARQLPSACAFVPTMGALHAGHASLFALAKKYSSDVVASIFINPLQFENKEDLAQYPRTPDIDIAIADAAGVTHLWLPNQAEIYPDGYVKKSAGPLGDIYEGKSRPGHFDGVLTVVHQFFSLLKPKVAIFGEKDFQQLQLIRTIAEGIEIVAAPTIREVDGLALSSRNVRLSDEGRNAAQVIYRALTKAKSENELREILASESSLTVDYADFIDEKTFDHAQSDTREVRAIVAGWINGVRLIDNMPMGNRE >NZ_CP016772|1205627:1215035|1212289_1212841_-|WP_095676660.1|DBSCAN-SWA MDLAAVGTDVEKVIATEDQLAAKIAELAALVDADYKDKNVLLVGVLKGAIMAMADLTRAMQTHIEMDWMAVSSYGSGTKSSGVVRILKDLDRDITGRHVLIVEDIVDTGLTLSWLKSNLYSRGAASVEILAILRKPEAAKVDVDVKYVGFDIPKEFVVGYGLDFDEKYRNLSFIGVLAKHMYS >NZ_CP016772|1205627:1215035|1212860_1213823_-|WP_095692832.1|tRNA|DBSCAN-SWA MTDAIVAIRNAVKPHLARLEAGDVVLVAVSGGADSLALASAILTESKVCAITTIAVTIDHQLQSGSGAQAEKVEKQLKEMGYGKVITKKVVVTTESGLEAGARDARYQALSACAAQEKATKVFLGHTRDDQAETVLLGLARGSGTRSLSGMSVENGMYLRPLLQITREETVAACRELDIEPWNDPHNGNTEFSRVRVRNEVLPAMEEKLGPGIAAALARSASILRDDADALDEIAQSEISSSDLANLECDHLASLARAIRSRVLRAALYAAGAPSGSITADHLAAVEALVTSWHGQGVVSLPGGVKVERISGRLSLLAQP >NZ_CP016772|1205627:1215035|1205627_1207238_-|WP_095692827.1|DBSCAN-SWA MKLLAPTPGWTATADVIVVGSGIAGLTTALQCRTYGLSVLLVTKARVDEGSTKWAQGGIAAALGDGDSPEAHEKDTLVAGAGLCDVDAVRVLVTEGPEAVRKLIERGAIFDKEDTGEIALTREGGHLRNRILHAGGDATGAEVSRALLAAVQNDPEIEVVEHALVLDALKNDIGEVCGVTLHVIGAGSRDGVGRGLARAVVLATGGLGQVFAQTTNPSVSTGDGVALALRAGAKVADVEFIQFHPTVLWLGDNSEGQQPLISEAVRGEGAYLVDDKGVRFMQDIHPLAELAPRDVVAISIMRVMNTTGAHHVWLDVRHLEGFKERFPTIYASCISHGIDPLKDLIPVAPASHYASGGVRVDLNGRTSVNGLYACGETACSGVHGANRLASNSLLEGLVFSARIAADIAENLPEKSEPVVNASQAILLDPQVRHDIQVSMSRGAGVLRSSDSLLKTSSDLTRIEDRKSTQPCVEAWETTNLFQLAQTIVKAALIRQETRGSHWREDFPETSDLWKKRIVQQVDQAGYWTSGYQEVGK >NZ_CP016772|1205627:1215035|1213832_1215035_-|WP_095676662.1|DBSCAN-SWA MRRRKSLMCLAIIASLTLAATPAIAKPKPTLAEIEAAKQAEAAKKAAADAAAKKLAAANLTLKQLTARANAAQAVYLKAKRELEVATAQAIAAAKHAAETAAAVAEAHRVIGKLAANAYILGGSMSDIQPLLSSNGPQDLIDQLSTLNTLGAQNTTALDRYKAAEIVAKAAKKKADEAKAIQVVATAKVEATKKIADAAQALQQKEVNKLRAVQDKLMKELATARNVRVTLEQQRQLALLEESQASTAATTINQAKVWPNIGFKGRTTIRTTEAQRLAAVAFAKKQVQARKPYIWGSEGPNSFDCSGLVYAAYKSAGLGWPNWDRLNSALYSGYTKHVALTELQPGDLLFYSYKGTISTIHHITIYAGGGMMWEANSKDKGLLYSSIYSVKGLMPFGGRV >NZ_CP016772|1205627:1215035|1210217_1212281_-|WP_095692831.1|protease|DBSCAN-SWA MSSQNPLSKLKKNSAKKALKPTNKDGGKKAPLTRSQKIFRGPLFWIIVAIFGVTIFGQITNAANRYTEIKTSQAIDAISRAQVESAVLVDKSQKIRLILNSGNTIKGATKVEASYIARQEPNLIDALTANPPTKGWSVSVPSQSFLTTFFFTFGPILIIGFLFFLMMSNAQGGNRVFSFGKSRAKLQDSDVPQNTFDDVAGADEAIAELEEIKDFLADPAKYALLGAKIPKGVLLYGPPGTGKTLLARAVAGEAKVPFYSISGSDFVEMFVGVGAARVRDLFTQAKANSPAIVFVDEIDAVGRQRGAGMGGGHDEREQTLNQLLVEMDGFEANGQVILIAATNRPDVLDPALLRPGRFDRQIPVERPDLKGREDILKVHAKGKPLSKDVELVTYARRTPGFTGADLANVLNEAALLTAREGQKTITNTQIDEAIDRVMAGPQRKSRLMSEEERRVTAYHEAGHALVAYALPHTDPVHKITIMPRGRALGYTMVLPDEDKYSTTRNQLLDQLAYSLGGRAAEELIFHDPSTGASNDIEKATALARAMVTQYGMTEAIGAIKLGTDASEPFMGRDYGHQRDYSENVAAIVDREIRKLIETAHQEAYDILEANRTILDEMVLELLEKETLNKEEIAAIFAKVKSWPKRPAWTGSENRLPSARPPVDIPERAIPTDAPRKSRRVKKATSSD |
10 | Acanthocystis_turfacea_Chlorella_virus(16.67%) | tRNA,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|