Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP054036 | Rhizobium sp. JKLM13E plasmid pPR13E05, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP054033 | Rhizobium sp. JKLM13E plasmid pPR13E02, complete sequence | 0 crisprs | csa3,DEDDh | 0 | 0 | 0 | 0 |
NZ_CP054034 | Rhizobium sp. JKLM13E plasmid pPR13E03, complete sequence | 0 crisprs | NA | 0 | 0 | 24 | 0 |
NZ_CP054032 | Rhizobium sp. JKLM13E plasmid pPR13E01, complete sequence | 0 crisprs | csa3 | 0 | 0 | 0 | 0 |
NZ_CP054031 | Rhizobium sp. JKLM13E chromosome, complete genome | 3 crisprs | csa3,cas3,DEDDh,WYL | 0 | 1 | 7 | 0 |
NZ_CP054035 | Rhizobium sp. JKLM13E plasmid pPR13E04, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP054031_1 | 829135-829220 | Orphan |
NA
Consensus repeat of NZ_CP054031_1
|
1 spacers
spacers of NZ_CP054031_1
>1.1|829158|40|NZ_CP054031|CRISPRCasFinder CGCCAAGGCCGAAGCAACTGACGCCGGTGAAGGCGAAGAA |
CRISPR arrays and Neighbor proteins around NZ_CP054031_1
The CRISPR arrays of NZ_CP054031_1 >merge|NZ_CP054031|1|829135-829220|CRISPRCasFinder GCAGCACCGAAGAAGAAGGCTGCCGCCAAGGCCGAAGCAACTGACGCCGGTGAAGGCGAAGAAGCCGCACCGAAGAAGAAGGCTGC >NZ_CP054031|1|1|829135-829220|CRISPRCasFinder GCAGCACCGAAGAAGAAGGCTGC CGCCAAGGCCGAAGCAACTGACGCCGGTGAAGGCGAAGAA GCCGCACCGAAGAAGAAGGCTGC
>NZ_CP054031.1|WP_138329802.1|826743_827334_+|GDYXXLXY-domain-containing-protein MNSFMAKLQSGKGYLLSAVIVAGLQTLILGTIIQSRASILSDGAEVLLKTAPVDPRDFLRGDYVVLNYDISSVPVQTVSGGIPAEAGERVLWVRLKKQEDGFWTVTESSFHELSPQPETVILRSQPFYSGGLAAGDSMRVEYGIERYYVPEGEGKPIEEARNDGNVAIAARVSPDGSAQIRSLLVDGKPVYDEPLY >NZ_CP054031.1|WP_138329804.1|825643_826747_+|DUF2157-domain-containing-protein MYRGRLERDLSLWVGKGLLGQETAGALLAEYDSRPASFSLGRVLMALAAVLLAAAILLVVASNWEAIPRLVRVGGILALIWVVHIAAARMLARGATAAAGGLLVIGAMSFGGAISLVGQMYHLSGDEQTVMYLWFAIATVSAILFRSAAVTVVAGFLSWASFAVYLENNDTHWIGLDPWMAPVMAVIVIGLVRYTGAERARHLAYLLLIGWLAWLYTLYEEIAVALAFAIGGMAAFVLTALPPRPIASLVRTAGAAPAFYSFLVAVIGFLLLHIEIEDGWGLVVLGVATLAASVLAIVLRGRDNGAVRYLAYATFAAEMLYLASVTVGSILGTSSLFLFSGLVVALVAWMVIRLERRFSANAQGERA >NZ_CP054031.1|WP_003539019.1|824628_824775_-|DUF1127-domain-containing-protein MNFSRSFNNWRKYRQTVTELGRMTNRELHDLGIDRSDIHRVAREASHR >NZ_CP054031.1|WP_003539017.1|824190_824334_-|DUF1127-domain-containing-protein MNPIRIARSWLSYRRTLNELGGLSNQTLSDIGVSRYDIRNIASRSFR >NZ_CP054031.1|WP_138329806.1|822663_824097_-|methylenetetrahydrofolate--tRNA-(uracil(54)--C(5))-methyltransferase-(FADH(2)-oxidizing)-TrmFO MNTISSHSPIHVVGGGLAGSEAAWQIASSGVPVILHEMRGVRGTDAHKTDGLAELVCSNSFRSDDATSNAVGVIHAEMRMAGSLIMAAADRCQVPAGGALAVDRDGFSEAVTKAVHDHPLITVVREEVTGLPPKDWDLAIVATGPLTAPSLASAIQTETGEDSLAFFDAIAPIVYRESIDMDICWYQSRYDKVGPGGTGKDYINCPMDEAQYNAFVDALIAGDTVGFKEWEGTPYFDGCLPIEVMAERGRETLRHGPMKPMGLTNAHNPTVKAYAVVQLRQDNALGTLYNMVGFQTKLKYGAQADIFRMIPGLENAEFARLGGLHRNTYINSPTLLDPSLTLKSRPGLRFAGQITGCEGYVESASVGLMAGRFAAAERKGEAISLPPATTALGSLLGHITGGHLVTDEEPGKRSFQPMNINFGLFPELQPGSIVKPEGVKRFRGKDKTIMKRQLIARRALADCAAWLGQESTLAESA >NZ_CP054031.1|WP_065279733.1|822092_822560_+|hypothetical-protein MGIIVWCVLFAIVIFFAFVATRMAAQNKEDGLHDTGLAIIEFGRAFPNEAIRQLQATENGQAVFVRLHDNKAGFMRNMSRHFSCHVIEPGRVRVVGSETGRGLVIDFLDAPHHNGDFEFASAKEASEVSLWLLGNYIAEPDKDLPPGNISAANKQ >NZ_CP054031.1|WP_138329808.1|821163_822021_+|phytoene/squalene-synthase-family-protein MTEAHIATNQDICLVMLRDSDRDRYLACLLSPEEKRGALAALYAFNAELARIRDLVHEPLPGEVRLQYWHDLLEGSAHGSTAANPVAAALLIAIETHRLPRKTLIDMIEARTFDLYDDPMETRLSLEGYAGETASALIQLASLVLSPEEAARSADAAGHAGVAQAVAGLLLLMPLHRRRGQIYIPLQILSATGLDRDAFLAGEDRPRISAAIEAFAGLGREHLAKARAAGPIAPAVFPAFLPATLAEPVLIRAQKRGALLLDRPLQPPQWRRQLRMGLAAARRKI >NZ_CP054031.1|WP_003539009.1|820774_821161_+|membrane-protein MAKGIEIRAAHFPGRAPIDAYGNGGFRFADMSHRGSILCLPSGIHGWDMDMSKPLSPENFRRVLDEAADIEVLLVGTGTELRRLPEELRLALKSRGISSDPMSTGAAVRTFNIMLAEQRAVAAALIAV >NZ_CP054031.1|WP_138329809.1|818229_820770_+|protein-translocase-subunit-SecDF MLHFSRWKTLLIWLAAFAAIVIAAPNLLTEAQRSSLPDWLRHDRVVLGLDLQGGSHIVLKVERSDIVRDRLEEVVANVRNALRGAGIRYTGLTGNDQTVTVRITDAAQTQAAVDLLKPLTTAGGHSGPDVALQQGDQGQLSLQISDAGITADVASARTRSLDIVGRRIAGLGNDNFLVRPDGADRIVVQVLGSIDAERLKNILNQPAKLSFHLIDESMSGQEALNGRWPTTSQVLYSLDDPPVPYLVDRTAFITGSNMVDIEPVVDQQTQDTSIAYRLDAEGTQRLAQATGQNIGKHLAIVFDDQVMSSPVIDAAITGGEGRISANFSEDGVRDLAIMLRAGALPATLTSVEERSVSPRFGADSIFNGLVAGLVAVVLVAALMIALYRILGIIAVASLFLNLILIVAVLSLIGATLTLPGIAGIVLIVGMAVDSNVLIYERIREEEKTTHSFAEAVGRGFSRAFATIVDANVTIFIAAIILFFLGSESIRGFAVTLAVGILTTVFTAFTLTRSIVAVWLRRRHPRHLPKSVLTHLFEHANIRFMGIRRYVFTASAVISLIAMAAFATVGLHLGIDFTGGSLIEVTAKQGNADIADLSSRLNDLNLGDVSVERTGGPSNARIRIASQGGGENAEQSAATLVRGELQEDYDFRRVEVVGPAISGELTMMATLGVLAALAAILIYIWIRFEWQFAVGAIIATLHDVIIMLGLFVLTGIEFNLTSIAAVLTIVGYSLNDTVVVYDRMRENLKRYKKMPLPILIDASINQTLSRTVLTAATTLIALLALFLFGGEVIRSFTFAMLFGVALGTFSSIYIAAPVLIVFRLRPEAPDGEESNKTDAGVKSGTVV >NZ_CP054031.1|WP_003539006.1|817836_818184_+|preprotein-translocase-subunit-YajC MFITPAFAQSATDTATGFGGSGFEMIILFVPLMVVWYFLLIRPQRAQAKKREETLKAIRRGDQVVTGGGLVGKVTKVIDEKEVEVEIADGVRVRIVRSGISEIRVKGEPVKADAA >NZ_CP054031.1|WP_138329798.1|829434_829857_+|nuclear-transport-factor-2-family-protein MTAKHDMKKLVEEIYAVRDRGDVEATLALIGEHCTFRMVGNTRLAPFSTESSGDSFRQAITQLITVWDLSNIRTAGIYVDEDEHMVFAHREGEVRHIPSGVSFHTEFVDKIHFRDGKPVKIVEFVDTLQVAETTKMIQVA >NZ_CP054031.1|WP_138329796.1|829976_831746_-|radical-SAM-protein MSHVLEVARRRFQLILIKPSHYDDDGYVIRWWRAMIPSNSLAALYGIAAECAERKVLGDDTAIDITVIDETNTRIDVAGLLAQFKRHDNFGMISLVGVQTNQYPRALDIARPFRDAGLPVSIGGFHVSGCLSMLDGKAVGLDACRDMGISMFAGEAEGRLDMVLRDAAAGELKPLYNFMNDLPGIGGTPVPFLPKDNIQRTLGLSTSFDAGRGCPYQCSFCTIINVQGRKSRFRSADDVEKLVRMNWAQGIHKFFITDDNFARNKDWEAIFDRLIELKERDGIPLGLMIQVDTLCHKIPNFIEKSRRAGVTRVFIGLENVNPDNLTAAKKNQNKITEYRKMLLAWKAQGIMTLAGYILGFPADTPESIRRDITIIQEELPLDVIEFFILTPLPGSEDHQVLWKKGVEMDADLNIYDVEHVCTAHPKMSKQEWEDIYHEAWALYYSPDHMKTLLRRAVATGVPLARLVKVLVSFATTVPLENVHPLQSGLLRLKTPSERRPDLPRENPLVFWPRFAWETFSKHASLAGTIIGLTISAFLISRDAKSKTYMDQALTPVADDEEETLHLFTQTAGGAAAVSHVRKVAQLTAH >NZ_CP054031.1|WP_173862933.1|831906_832716_-|endonuclease/exonuclease/phosphatase-family-protein MILRIISLNAWGGRLHQALIEYVTAADPDVLCLQEVLRAPGTHSGWSVYRDGDVELPQRFNLFTEISTAMPGHDGFFCPTSRGELFDGDTAIVAEFGLATFVRKTHSVIAQGLDFVHGRFSADGWGEHPRPRNAHCIRLFSHECASTVTIAHMHGLRDPAGKGDTAAREKQAAALVGLIERVWPGDEGLIVCGDFNVLPDSATFAILARLGLSDLVTGSGLVDTRTSYYLKQGRFADYMLVTPGVKVAKFEVVAAPEVSDHRALLLDIG >NZ_CP054031.1|WP_138329790.1|832874_834281_-|Si-specific-NAD(P)(+)-transhydrogenase MLQYDLVVVGSGPAGRRGAIQASKLGKKVLVIEQGKRVGGVSVHTGTIPSKTLRETALNLSGWRERGFYGRSYRVKEEISADDLRRRLLITLNHEVEVLEHQFARNRVQHIRGKASFIDASTLQVIKDDGETTQVTAASVLLAVGTKPFRPDYMPFDGKTVLDSDELLDIQDLPRSMVVIGAGVIGIEYATIFSALDTAVTVIDPKTTMLDFIDKEIIEDFTYQLRDRNMKLLLGQKADKVERLENGKVELTLDSGRRLTTDMVLFAAGRMGATDALNLQAIGLEADSRGRLKVNPETFQTSVANIYAAGDVVGFPSLASTSMEQGRIAARVAVGAVAKEPPKYFPYGIYAVPEISTCGLTEEEMKERGIPYECGIARFRETSRGHIMGLDTGLLKLIFSLKTRRLLGVHIVGEGATELVHIGQAVLNLKGTVEYFVENTFNYPTLAEAYKIAGLDAWNRMGDIKSEL >NZ_CP054031.1|WP_138329788.1|834500_835688_-|alpha-hydroxy-acid-oxidizing-protein MTIRSGNTDRSTRLCRDMLCLDDFEIKARRHLPKPLFGYVAGATETNASLRHNAEAFQAYAFRPRVLRDVSKRSTETSLFGKTHAAPFGIAPMGISALMAYRGDIVLAQGADQSGIPMIISGSSLIPLEEIAAVSPQAWFQAYLPGEPDRIDALIDRVGAAGLQTLLLTVDTATLPNRENNVRAGFSTPLRPGLRLAWQGISHPRWTTDTFLRTIVRHGIPHFENSYATRGAPIISSNVTRDFGRRDHLNWNHLERIRNRWSGKLVVKGIMHPDDAARAVDTGADGVIVSNHGGRQLDGTASPLQVLPEIASRVGDSVAVMVDGGFRRGTDIMKALALGACFVFVGRPFLYAAAVAGLPGVLKAADILKMELHSNMALLGVTKVGDISADYITRA >NZ_CP054031.1|WP_062944698.1|836697_837534_-|type-I-methionyl-aminopeptidase MVNYIEASTAPPKNTGAIRLYDAQAFEGMRKACQLTARCLDALADIVKPGLLTDEIDRFVFDFGMDHGAYPATLNYRGYTKSTCTSINHVVCHGIPNDKPLRDGDIVNIDVTFVLDGWHGDSSRMYPVGVVKRAAERLLEVTYESLMRGIAAVRPGARTGAIGEAIQTYAEAERCSVVRDFCGHGVGRLFHDSPNILHYGRANEGPELREGMIFTIEPMINLGRPHVKVLADGWTAVTRDRSLSAQYEHTVGVTSDGCEIFTLSPGGLDRPGLPSHNG >NZ_CP054031.1|WP_065279742.1|837782_838442_+|TetR/AcrR-family-transcriptional-regulator MLNEAVNLENSPGTGEGPERRVRDRGATERAILAAAKGLLAEEGFQNFGINAVARRAGCDKQLIYRYYGGLDGLVEAIGADLGTWVKDRIPEDAGGMFLLTYGDLMERLSLLLLDALRNDPLMRRILAWEISENTEQVRRLSEARSKALALWLERMRGSLAPPKGVDAAAVNAVVIAAIQHLVLAAAAGGQCAGLSLKTPKDWEKAATALKRIVRGVYG >NZ_CP054031.1|WP_171598891.1|838569_838743_+|hypothetical-protein MGTKIAKAAFLVTAMMFFAVLAVDIAIPTLVLCIMASSTWLVSLDLPVARKHGGEAA >NZ_CP054031.1|WP_138329787.1|838739_839072_+|hypothetical-protein MKWFLIFWAGPIVFLGGWYWLSYYDMSFGIFMLSRQVHDLTFELYGKALGIPPETIPPLVARAIAVDSLVVFAILAFRKRKSIIGWWRARQALNSSPSDLPSKESLSSAP >NZ_CP054031.1|WP_138329785.1|839008_839503_-|Holliday-junction-resolvase-RuvX MTVLTIEEMAATLAPRQAIAGLDLGTKTIGLSMSDLGRRFATPRTVIRRVKFTIDAQALLDFATVEKVAGFIIGLPMNMDGSAGPRVQATRAFVRNMEQKTALPFVYWDERLSTVAAERTLLEMDVSRAKRAERIDSAAASFILQGALDRLSLLGRSDGDEFSA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP054031_2 | 1264718-1264860 | Orphan |
NA
Consensus repeat of NZ_CP054031_2
|
1 spacers
spacers of NZ_CP054031_2
>2.1|1264765|49|NZ_CP054031|CRISPRCasFinder CCGATGTCGAAGCCCGCTCCTGTCCACTACAGCAGCCGCAAGGCCCCGG |
CRISPR arrays and Neighbor proteins around NZ_CP054031_2
The CRISPR arrays of NZ_CP054031_2 >merge|NZ_CP054031|2|1264718-1264860|CRISPRCasFinder CCTATTCCAGCGATTACACCAACGACAGCGACGGCTTCGCCCCGAGCCCGATGTCGAAGCCCGCTCCTGTCCACTACAGCAGCCGCAAGGCCCCGGCCTATTCCAGCGACTACACCAACGACAGCGACGGCTTCGCCCCGAGC >NZ_CP054031|2|2|1264718-1264860|CRISPRCasFinder CCTATTCCAGCGATTACACCAACGACAGCGACGGCTTCGCCCCGAGC CCGATGTCGAAGCCCGCTCCTGTCCACTACAGCAGCCGCAAGGCCCCGG CCTATTCCAGCGACTACACCAACGACAGCGACGGCTTCGCCCCGAGC
>NZ_CP054031.1|WP_138329363.1|1263807_1264326_+|BON-domain-containing-protein MARIGDKTQFSREKPELSREEDYRDLEERNLDEGWPYADGSGADTGGPDNRPYGETTANFDSDPNKGFRIDGTDEDGNENRLKDSLRADTIDRGESDDLEARVNDNLENIPEVDIDSIEVHADGHVVTLKGSVETIGIARKVELGALSVDGVHHVRNKLQLTGVDAHIPNED >NZ_CP054031.1|WP_138329364.1|1263229_1263724_+|peptide-deformylase MPIRPILRYPHPGLKTVCAPVTVFDSSLTALAEDLLATMRAAPGVGITAAHIGVFSRVTVLELDKADGVRLYVNPHITWFSKEMMNHTEGSVSMPGATDEITRPRAIRFRYQDTEGAVHEDGAEDFLAICIQHEVDQLDGIFWLQRLSRLKRDRLVKKWEKAQG >NZ_CP054031.1|WP_138329365.1|1261571_1262987_-|magnesium-transporter MTTTDGEDRIRRRPDDEEADIYDEDGNVRGDFLALVGAAIADRDTLFLRQNVARLHESEIGDLLESIQPDQRLALVRLLGDDFDMTALTEVDEAIRREIVDQMPNAQIAAAIGELDSDDAVYILEDLDKEDREEILAQLPFTERVRLRRALDYPESSAGRRMQTEFVAVPPFWTVGQTIDYMRDEEDLPYSFSQIFVIDPTFKLLGAVDLDQILRTKRQTKIEQIMRETNHPVPAEMDQEEAAQLFEQYDLLSAAVVDENGRLVGVLTIDDVVDVIHEEADEDIKRLGGVGDEELSDNVLSTVRSRFLWLLINLGTAMLSASVIGLFDGSIEKMIALAVLMPIVASMGGNAGTQTMTVTVRALATRDLDIYNAGRIIRREAGVGILNGVVFATIMGAIAGTWFHDYQLGGVIAAAMMINLIAAALAGILLPLLLDKMGADPAIASSVFVTTVTDCTGFFAFLGIATWWFGI >NZ_CP054031.1|WP_011652104.1|1261114_1261507_-|MerR-family-DNA-binding-transcriptional-regulator MPVNKYYSITELTREFGVSTRTLRFYEDEGLIHPERRGRTRLFRQADRRLIGEILRGRRIGFTIAEIREIIQVYKEPPGELGQLKLLMKRVDEKRDDLRQKRKDIDDTLAELDAIEETCLGRLAEIGVTT >NZ_CP054031.1|WP_138329366.1|1260100_1261090_+|DUF1624-domain-containing-protein MTLPAREADAAAKPPRIGLLDTARGVALIAMASYHFSWDMEFMGYLAPGTAETGWLKIYARAIATTFLFIVGISLVLSSKPEIRWPSFWKRFGMIAAAAAVISIATRIAMPNEWIYFGILHCIAVLTLIGVVFLRLPLAFTLIATLALFAAWITDNFGPPGLLRSSFFDPRYLAWIGLAVAPERSNDYVPLFPWATPFFAGLSIASIALRTRLLHRLAAVGTGSWWPARLGRHSLAFYLIHQPILIAIAYGISLVVAPQAPDPVATYLRQCNASCVMQQGEALCHSFCQCTLGKLQAQALFTPLQEGAIDIQNDERVQTIAAECSAEAE >NZ_CP054031.1|WP_003539916.1|1259336_1260101_+|DUF599-domain-containing-protein MTTADYIALAFFAFVWMGYSWLLHGRTFFGRTSLTHAMTERRREWIYNSLRRDLKMIDTQIMAGLQNGTAFFASTSIFAIGSCFALLGATEKVDAVFADLPFVFHGGHAVFEMKVGGLAALFGYAFFKFGWSYRLFNYCTILFGSIPMMRDTERDIIAAERAAERVIRMNVIAGSNFNEGLRAIFLSIGYLGWFINPYVFMLTTAIVIFVLTRRQFFSQARLAIMDTGPPSNLHLSAIRRDRPSSDGNDLSEGL >NZ_CP054031.1|WP_138329367.1|1258649_1259099_+|ribose-5-phosphate-isomerase-B MPATNRIALSSDHAAIQLRQAIAGHIAAQGWIAVDIGPTTPESTHYPKHGEAAARLVASGDCRFGIILCGTGQGIMMAANKVKGIRCGVCVDTFSARMIRQHNDANMLSIGVRVVGEGLALDIVDAFLTARFEGGRHATRVGMIEALEG >NZ_CP054031.1|WP_138329368.1|1257285_1258485_-|acyltransferase MSRPDYIPSLDGLRGVAALLVVGAHVGLVFPITAPHLVTMGDEAVGLFFALSGFLMAHLYGSRPVTRENVLDFLVSRFARIYPVYLVAVVLVAMLSSMQNLDFVQPIAGGTDFVRHVFLLGSSGVFWSIPPEIQFYLLFPVLWLCLAQPHRYSGMIVGLTVAVVVDGLVELPGPGIVLVSKLPYFLFGALAGIMHSYWNSWIPSALTGISTLFLLAVFFTYRHIFPGFSPEFWSLQSAVAAAVIVGLVARQPPIATRVLAAAPVRFFGKISFSLYLFHVPTMFLARLTFDALMPEPALIVVTLCVAVVGAWFIHETIEVPGRRLLVLIWQDNRWRLVSRETPADQMDRAILDLQEIEKRLLRGATSTMDDQRQISTTPEPRDGADGTVIQDERDEKRRA >NZ_CP054031.1|WP_105006097.1|1256846_1257284_+|VOC-family-protein MALKRMDNVGIVVDDLEETIDFFRDLGLELEGRAMIEGEWAGRVTGLGDQHVEIAMMRTPDGHSRLELSRFLRPPAVADHRNAPVNALGYLRVMFALDDIDETLERLSKRGAQLVGEVVDYQDTYRLCYIRGPGGLLIGLAQELS >NZ_CP054031.1|WP_138329369.1|1255089_1256493_-|L-serine-ammonia-lyase MFLSVFDVFKIGVGPSSSHTMGPMSAANRFLELILSNEWPRPSSGAQVTAIKVSLHGSLAHTGIGHGTGRAVILGLMGEAPDSVDPDRMDGIVDTVERSGRITPEGHPAYQFQPKIDLIFDKKQPLLGHANGMVFSAYDRDGRLLVKRIYYSVGGGFVVTDTELEQMRAKKNAPGGTRVPYPFSTAKQMLEMAERSGLSIAQMKRANEESQRSQEALDQGLDRIWEAMRSCIERGLKVEGIMPGGLNVKRRARRIHDKLEEEWRSNRINPLLANDWLSVYAMAVNEENAAGGRVVTAPTNGAAGVIPATIRYYEHFHEDWDQNGIRDYLLTAAAIGGIIKHNASISGAEVGCQGEVGSAAAMAAAGLAAVMGGTPEQIENAAEIALEHHLGMTCDPIAGLVQVPCIERNALGAVKAVTAASLAIKGDGQHFVPLDACIETMRQTGHDMSEKYKETSTGGLAVNVVEC >NZ_CP054031.1|WP_062944327.1|1265108_1265828_-|lipoyl(octanoyl)-transferase-LipB MAMLRTDLEFSMLPQLGTRPVRWRIADGLVPYEEAVETMEREVAIIADGGDELVWLVEHPPLYTAGTSANARDLVQPNRFPVFATGRGGEYTYHGPGQRVAYVMLDLKRRRQDVRAFVAALEDVVIRTLDMMNVRGERREDRVGVWVRRLEKPLLADGTMAEDKIAALGIRLRKWVTFHGLSLNVDPDLDHFDGIVPCGISAYGVTSLVDLGLPVMMADVDIRLRAAFEAVFGETTGET >NZ_CP054031.1|WP_171598747.1|1266191_1267142_+|EamA-family-transporter MASSSQESAAMSEPHQNSLQGMAIMSGAMLILPIMDAIAKYMATFEAMSPGQVTFYRFFFQIACTLPILFALFGLKALSAQRPWMNLLRGALHGGASLLFFVAVKYMPLADVFAIYFVEPFMLTALSALFLGEKVGWRRWTAIVVGFGGAMIVIQPSYEIFGLKALLPVACAFLFSLYLFLNRAIGEADSPLTMQTMAGIGGTVFMAAALFLGNSSGNADFAVSLPSSSLGLVLLLALGSISGYAHMLVVRAFRLAPLSLLAPFQYFEIISATVLGYALFNDFPSFSKWIGIFIIVASGLFIIWRERLQAQSLRSS >NZ_CP054031.1|WP_064648181.1|1267239_1267944_+|hypothetical-protein MSEFRLAFPACVVAGKHRLTAEDIVLLRKHSFPEGIRTSDDVVAMLALNNSCPEKCADWNAFFVEQLAGFIVHYTYPQGSLDEINVAWIMRMFTTDGVVNSALELELILHVMEISADVPVELRALALDQLRLAITDNIGGYKLSRAIDRRGITRQDIDYAMRIFRSVAEGGTIPVSSVEYGVLQQIEQATLRGANHPHWAGIMAAVELRDYAEPRRSRWLRIVDEEPVAEAAVA >NZ_CP054031.1|WP_138329360.1|1268019_1270041_+|acetyl/propionyl/methylcrotonyl-CoA-carboxylase-subunit-alpha MFKKILIANRGEIACRVIKTARRMGILTVAVYSDADRDALHVEMADEAVHIGPAAASESYLVAEKIIAACKATGAEAVHPGYGFLSERASFCAELEKQGIIFIGPKPKAIMAMGDKIESKKFANAAGVSTVPGHLGIIEDAAHAEVISGGIGYPVMIKASAGGGGKGMRIAWNEAEVRDGFERARSEAKSSFGDDRVFIEKFVVEPRHIEIQVLADAHGNIVYLGERECSIQRRNQKVAEEAPSPFLDEATRKAMGEQSVALARAVDYQSAGTVEFIVDRDRNFYFLEMNTRLQVEHPVTELITGIDLVEQMIRVAAGEPLPFAQEDIRLDGWAVESRLYAEDPYRNFLPSIGRLTRYRPPAEGRTGNVVVRNDTGVFEGAEISMYYDPMIAKLCTWAPTRLEAIEAMGQALDGFVVDGIEHNVPFLSALMKHPRWREGRLSTGFIAEEYPDGFAPMKPDRAEEAVLAGIALSASLIETNRRERFADRLRAAAGALREDWVVKIGDNHVTARLLDGLVTIPFDMDIAIDGAIEGGSQNVVTDWRPGDPVWNGKVGGRDITAQIRPVLNGLRIDWQGLSVTTKVFSPRHAELDRLMPVKLPPDTSKLLLCPMPGLVVAIAVAEGQEVKAGETLAIVEAMKMENVLRADRDLVVSKINAAAGESLAVDAVIMQFA >NZ_CP054031.1|WP_138329359.1|1270176_1270650_+|GNAT-family-N-acetyltransferase MAPTTYTTDIKGLDEFSARELYDLLRMRVDVFVVEQNCPYPELDGKDIDALHIRLLEGAELLASARILKPHEPQDASKIGRVVVSPAHRGKRLGDALMSEAITACEQLYPANAIALSAQAHLRRFYESFGFIGTSQEYLEDGIPHIDMVRQQATQPA >NZ_CP054031.1|WP_003539940.1|1270660_1271128_+|YaiI/YqxD-family-protein MIYVDADACPVKPEVLKVAERHGLEVTFVANSGLRPSRDPMIHNVIVSNAFDAADNWIAERAGAGDVVVTADVPLAVRCVATGAFVSGPTGRVFDETNIGMASAMRDLGAHLRETGESKGYNAAFSPKDRSRFLETFDRLCRRAKSLAAEAGGQP >NZ_CP054031.1|WP_027667501.1|1271296_1271611_+|quaternary-ammonium-compound-efflux-SMR-transporter-SugE MAWFLLFLAGLFECGWAIGLKYTEGFTRPMPTALTVISMVISIVLLGLAVKHLPIGTAYAVWTGIGTVGTVFLGIWLLGDEASVSRLACITLIVAGIAGLKLTA >NZ_CP054031.1|WP_003539944.1|1271653_1272046_-|DoxX-family-protein MSTFDRLSAYQPYGLAALRIITALLFIEHGTMKLFGFPASQMSGSLPPLMLFAALLELVGGILILVGLLTRPVAFLLAGEMAVAYFMAHAPSSFFPAVNQGDAAILFCFVFLYLVFSGPGAFAVDNRKTA >NZ_CP054031.1|WP_062944333.1|1272185_1273718_-|acyl-CoA-carboxylase-subunit-beta MKEILEELERRRDVARLGGGEARIAAQHKRGKLTARERIDLFLDEGSFEEFDMFVEHRSTDFGMDKSRIAGDGVVTGWGTVNGRTVFVFAKDFTVFGGSLSEAHAEKIMKVQDMALKNRAPIVGIYDAGGARIQEGVAALGGYAEVFQRNVLASGVIPQISVIMGPCAGGDVYSPAMTDFIFMVRDTSYMFVTGPDVVKTVTNETVTAEELGGAVVHTVRSSIADGAYENDVETLLQVRRLIDFLPLSNTAPLPEIECYQSVTDVDASLDTLVPASSNKPYDIKELIRKVADEGDFFEIQASFAKNIVCGFGRVEGSTVGFVANQPMVLAGVLDSDASRKAARFVRFCDCFNIPIVTFVDVPGFLPGTAQEYGGLIKHGAKLLFAYAEATVPKLTVITRKAFGGAYDVMASKHLRGDLNYAWPTAQIAVMGAKGAVEIIFRKDIADPEKIAAHTKMYEDRFLSPFVAAERGYVDEVIMPHSTRRRLARGLKMLRNKDLANPWKKHDNIPL >NZ_CP054031.1|WP_138329358.1|1273767_1274226_-|hypothetical-protein MNWLRSCFCRATLVCVALTACGSLSATKPAAPVYDVRSAVVLSGPNMPAELLSGINDRVNAAINATVRDTVLPRVVLTIRVVSVQKGLGFQKDRNVAKISIDAASVEDGSVIAVSAFDVTSIAADPKLADEILAEDAAARIRSVFSLSGRAR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP054031_3 | 4538823-4538905 | Orphan |
NA
Consensus repeat of NZ_CP054031_3
|
1 spacers
spacers of NZ_CP054031_3
>3.1|4538848|33|NZ_CP054031|CRISPRCasFinder TCGCGGTTTTGCGATAACGACATGCGCAAAAAT |
CRISPR arrays and Neighbor proteins around NZ_CP054031_3
The CRISPR arrays of NZ_CP054031_3 >merge|NZ_CP054031|3|4538823-4538905|CRISPRCasFinder CTAAAGCATGTCGCGCAAAAGTGTGTCGCGGTTTTGCGATAACGACATGCGCAAAAATCTAAAGCATGTCGCGCAAAACTGTG >NZ_CP054031|3|3|4538823-4538905|CRISPRCasFinder CTAAAGCATGTCGCGCAAAAGTGTG TCGCGGTTTTGCGATAACGACATGCGCAAAAAT CTAAAGCATGTCGCGCAAAACTGTG
>NZ_CP054031.1|WP_138328942.1|4538428_4538728_+|DUF1127-domain-containing-protein MRMTDRTIELDCGKLTPTFPQRLAAGLAPLTSLFRGFRNRMEINTLHDLSDAQLRDIGLSRADLTSAFLASTFFEDPSEHLTRSARNRWRLSLFRSYEQ >NZ_CP054031.1|WP_171599065.1|4537258_4538227_-|LysR-family-transcriptional-regulator MGDNMSAPLDLDQLQTFIAIVDSGSFTKAADRVYKTQSAVSMQMRRLEERIGKQLFIKDGRGNRLTVEGEKLLNYARRIIRLNNEAIAAFDDNRLEGTLRIGTPDDYADRYMPEIIGRFAKTHPNVELYIVCEPSVDLAERMHKGELDIALVTHNPRERMSDVVRTEPLCWVGSANHPIRDDAPVPLAVGRRDCQWRQLACSALDAVGREHQILFTSWSCTVVAAAVLAGMAVSVMPESALRTGMKVLSQADGFPALPPVQIGIMKRPGVSLSLMNAITAHITACLDNITPTVVDDSLEADVKSTQARLYPRLKAANMVPSW >NZ_CP054031.1|WP_171599490.1|4536574_4537252_+|TetR-family-transcriptional-regulator MPSGANKKNGNDMGAESMDRIGLRRKPKQERSIQRLDLILAAAAKIIAEKGVSAMRMTELAIAAKVPIGSVYQYFPEKAAIVKALFDQHASAIQAKMAAMFADVQSLDQAQDVVCATIDWYYDSYHNDPVYLGVWMGTETDQDLLQLNIEHSGHVAGIFHDAVRQLAPDLCDEEMYARTYLFSHLIGAVIRLAAVSEEALARRMLDEWKRVIRASLFATTLPRAA >NZ_CP054031.1|WP_138328945.1|4533303_4536396_+|autotransporter-domain-containing-protein MGERMCGAAIHGMAMLRLAGLASTAALVLAVQPGWAQVITGNDTEIVDGNDPGGAGPGTQPSPWTINTDLLVGDQNGDDAALIIRNGGAVSNDIGVLGVDPGASGTVTVTGAGSTWTNADDLYIGNEGTGTLTIENGGAVDNINGYIGYFSAAASGTVTVTGSGSTWTNSQDLTIGDSGAGTLIISNGGTVNSASGYISNDSTAIGEVIVTGPNSVWSNSTEVSVGTAGAGSLTISNGGSVTGSLGYVGFSTNGSGVVSVTGTGSSWISSSALFVGEFGSGSVTVENGGMISASGVVIADDSGATGTVELVGSAGSGRGVLETGYIDRGAGDAGLVFDGGILRATGNEANFLGGFNAGDVMLDAGGAFIDTNGFAIGIGTDLQGAGGMTKQGAGTLTLSGASSYGGVTTVEAGTLQAGSAGAFVQNGAYAVNGGILDLGGFDLAMGELSGSGGEIAIGGAELTLDQTSNTTYGGIFSGSGDVTMLGSGTLRLTGNSSGYAGTTTVSDGRLIVNGSLGGILMMTGGTLAGSGHIGTVTAGAGVTIAPGNSIGTLTIGGNLTLNPSSTYEVEVDPAGIASDLISVTGIAFLNGASVAHVGMNGDYQPFSTYTILTAAGGINGTFGAVTSDYAFLSANLSYDPNNVYLEIERNDVRFSDMARSRNQMAAAEAAENLGTGNDIYDAIVTLPDDEPLIQASYDALSGEIHASIKTALITQSLFVREAANERLRSAFGDAGAGVIPIQAFWPGGPELVAADPSDAPVFWSTAFGGASETRTDGNAATLNHQTGGFLAGVDAMFDDVRLGLMAGYSNSRFDPRHRSSSGSSDDYHLGLYAGTQWGGLVFRSGIAHTWHEIETSRSVFIGSFEDRLEASYNAGTLQAFAELGYQFDTAAASFEPFVNLAHVGIRTAGFTEEGGAAALHSSSRMTNTTITTLGLHAEMEIRLGETNATLRGMSGWRHAAGDIVPLSTQAFAGGDAFTVAGVPVAENALVLDAGLDFDLTGSAILGIAYSGQIADNAQQHGVKAALSVKF >NZ_CP054031.1|WP_138328946.1|4532599_4533016_+|MAPEG-family-protein MTGYQIFWPLLAHVALVYGLYALLGLRRAKMLRAGSIAKSDYRENRDEPAESLAVKNCLANQFELPVLFHVCCILLYIADADNIVTVGLAWIFVALRYAHAAIHVTSNDLRYRSPIFAAGYLVLAAMWVWLAAWMVMS >NZ_CP054031.1|WP_138328947.1|4531882_4532437_+|hypothetical-protein MKAGSGIPLWIVALLAALCLAVLAWTTFGFVVPFKHETGQAVLDTYFAGYDESAVFHMQKLLDENETATGLLRAMYFGPELIFPALLTALLFLAFLKLGPVGSWFGQSVHPLIGKAVYLLPFIYGIADYGENISSLMAFGNGAPSTVATQLLPWMTRLKFASLAICFILIARFAIARWLSPRQD >NZ_CP054031.1|WP_128440017.1|4530395_4531880_+|IMP-dehydrogenase MARIIETATGADALTFDDVLLQPGHSEVMPGQTNISTRIARDFELNIPIISSAMDTVTESRLAIAMAQAGGLGVIHRNLTPIEQAEQVRQVKKFESGMVVNPVTIGPDATLADALGLMKSYSISGIPVVEKSGRLVGILTNRDVRFASDQEQKIHELMTKDNLVTVKENVDQQEAKRLLHSHRIEKLLVVDTEGRCVGLITVKDIEKSQLNPNASKDAQGRLRAAAAISVGDDGFERAERLIEAGVDLLVVDTAHGHSQRVLDAVTRVKKLSNSVRIMAGNVATYDGTRALIDAGADAVKVGIGPGSICTTRIVAGVGVPQLAAIMSAVQAANDQDVPVIADGGIKFSGDLAKAIAAGASAVMIGSLLAGTDESPGEVYLYQGRSFKAYRGMGSVGAMARGSADRYFQAEVRDTLKLVPEGIEGQVPYKGPVSGVIHQLAGGLKAAMGYVGGKDLKDFQERATFVRISGAGLRESHAHDVTITRESPNYPGAGL >NZ_CP054031.1|WP_138328948.1|4529124_4530117_+|MBL-fold-metallo-hydrolase MAISSGIKRRTLFAAGLGLLAAPVILREKAEAAATGESNHMDGTLLPEINQFKLGSYKFTVVRDGTSIVEKPYETFGTNQDPETVKALLTANFLPADKFMNGYMPALIDTGSDVILVDTGFGAGGRARGAGQLTEGLKAAGYSADDVTLVALTHLHGDHIGGLMEDGAAAFKNARYVVGQAEYDFWSDKAREGTPAEGGHKAVLANVVPLAEKTTFLKEGDSFAPGITAMLAAGHSPGHMVFHVESEGKRLVLTGDTANHYILSLLRPDWEVRFDMDKTQAAATRRKVFEMIASEKIAFLGYHMPFPAVGFVERQDGGYRFVPKTYQFDI >NZ_CP054031.1|WP_138328949.1|4527448_4528873_-|DHA2-family-efflux-MFS-transporter-permease-subunit MNRIVPLILAVALFMEQMDSTVIATALPAIAADLHVGPITLKLALTSYMVALAVFIPVSGWMADRFGAKKIFRLAISVFVIGSILCAISSNLVEFVLARFLQGMGGAMMTPVGRLVLVRTTKRSDLVSAMALLTIPALVGPLTGPPLGGFITTYFSWHWIFLINVPVGIIGIWLATIFLPEVEATAPPRLDFTGFVLTSLSAAGVVFGLSVVSLPALPPIIGITATLIGVLCGFLYMRHAKRHPAPILNLNLFKNPTFRTSTLGGTLFRICVGAMPFLTPLMLQLGFGLTPFQSGLITFAGAIGAITTKFMAKRVFAAAGFRTTLLGAAMVTTLVTVVTGLFTPSTPHLVIIGVLLLGGFSRSFMFTGVNALAFADIDDAEASQATSMSSVMQQISLALGVALAAAILESSIYFRGEALQVADFHLAFYIIAGLTVIATIPFIRMDRNAGALVSGHRLKTMTAATVEAEQPAVK >NZ_CP054031.1|WP_138328950.1|4526954_4527338_+|VOC-family-protein MLLYVTLGTNDLYRAGHFYDAVLSPLGYRRQRSSEEEIGYAAEGDTRCRFWVVTPFNHRRATSGNGAMVALAAETRADVDAFHAAAIAAGAVDEGEPGLRSYHAHFYAAYVRDLDGNKLSAVCENPQ >NZ_CP054031.1|WP_138328941.1|4538992_4539859_-|DUF937-domain-containing-protein MLPLFDMMMQAQNGAAMEAIARQFNLAQEQATKAMAALMPAFSAGLKRSTSNPYDFVGLMQAVSSGNYARYFEDMNRAFTPEGISDGNNILAQLFGSKEVSRAVAAQAAQMTGIGQDIYKRMLPVLADTLMGGLFKQTTGQMASPVNPFVNTAMNETIQKWLESTGFAPKPKTAPEPSIFDNPFTQAMQLMFSVPKPEATSQPNPFLDNPFAKAFQEMMGGLGQQPAAPTKTKTPEAPKEEAKLNADSYTEMLNAMFDSGLEVQKNYQRNLEAIFETYRPKPSSDKGE >NZ_CP054031.1|WP_094086673.1|4540034_4540364_-|MurR/RpiR-family-transcriptional-regulator MNSEENLKRPTDLTELKAMIVKTGLTLPEQQERAARMALTNPELIAFGTVASVALKCSVSPSTVVRVATALGFASFRDFKTFFRQHLKTSRLAEQSINTARHAPPIPNR >NZ_CP054031.1|WP_094086672.1|4540420_4541440_-|Gfo/Idh/MocA-family-oxidoreductase MPANKPVNLGLIGAGRIGSFHGETVSRRLVNANLVAIADPAPGAAARLGETLDVETAYTGVAELLSHPGLDAVIIATPARFHTNILVQAAEAGKAIFCEKPMALTLEDADRAIAAARAANVPLQVGFNRRWDQAFAEGRAAIDEGKIGAPQLIRSLTRDPGPFGADPDRIPLWTIFYETLIHDFDTLLWLNPGAKPVEVHAMADALIRPDAREKGFLDTAVVTIRFDNGSIAVAEANFSALYGYDIRGEVFGSGGMVTMGDVRRSSMTLFDGNGVSNDTWRRDTEHFVHGYTAQLASFVRAVRNRELPKGPTGTDARNALAIALACIASVSKKQAIPLD >NZ_CP054031.1|WP_138328940.1|4541469_4542264_-|hydroxypyruvate-isomerase-family-protein MSAAKTSPFPLAACAEMLWRDKPIEWRAARLKEMGFGVGLWNWPEHDLAKLEATGATFTIMNGYLTGRLTDDEGAAELLRTARQTAEIGKRLGVQRLNLHGTGLGDGGLPVQPVEVVTGAMWLKARDTLLRIVDMAEEENVTFTLENLNLPVDHPGVPFGRAEDTLALVSSVDHPRLRLNLDLYHAQIGEGNLIELCRKCLPWIGEIQVADVPGRCEPGTGEINWRGIARALNAMGYSGPVGMEAWSAGDPEAALEAFRTAFTL >NZ_CP054031.1|WP_029872562.1|4542341_4543130_-|sugar-ABC-transporter-ATP-binding-protein MSMSQTPLVEVKNLVKHFGSIIALNGVSLTVNANEVLCLLGDNGAGKSTLINTLAGVFKPTSGEFLVEGQPRAFAGPRDALDAGIATVYQDLAMIPLMSVTRNFFMGREPLKGFGPFKRMDMEHANNVTRDEMHRIGIDVRDPEQAVGTLSGGERQCVAIARAVYFGAKVLILDEPTSALGVAQTSMVLKYINQVRGKGLGVIFITHNVRHAYAVGNRFTALNRGRTLGTYSKSEIGIEDLQNLMAGGKELQTLSEELGGTI >NZ_CP054031.1|WP_097620998.1|4543129_4544251_-|ABC-transporter-permease MTLANEQNGAATRSDERLKKVSGLTALLRRPELGAVAGLVLVTIFFFFTANPAMFTLAGVINFMAPAAQLGILAIGAALLMIGGEFDLSIGSMVAFAGLVFGTALVVLHLPLSVAILVALAFAAVIGVTNAQITMRTGLPSFIVTLAFLFILRGLTLVGLKWASGGATQLRGMKEAAAGSPLAPFFSGDAFIGLFSWLAGHGIIETFPNGTPKVPGIPVEILWFFGIAVLATWVLLRTRFGNWIFAAGGDPRAARKSGVPVARVKTTLFVITALCATLVAILTVLDAGSTDARRGFQKEFEAIIAAVIGGCLLTGGYGSAIGAFFGSIVFGMVLIGLTYTSIDQDWYLVFLGGMLLIAVIFNNVIRKRVTGER >NZ_CP054031.1|WP_138328939.1|4544356_4545307_-|sugar-ABC-transporter-substrate-binding-protein MTSILKKLAFGALALGVATTAAVGAKAQEPTIIAVTHGQASDPFWSIVKNGMMQAAKDSNVKVDYRAPETFDMVAMAQLIEAAVNQSPAGIVISNPDPDALGSAIEKAVAAGIPVISMNSGISAAEKLGIKLHVGQDELPAGVKVGAKLKSLGLKHVLCVNQEVGNAALDQRCAGTEKGFEGGTVTVLPTTADPAEIEAKIQAALTSDPSIDVVLGLSAPLVGERAVAVVEKMGAGDKVKVASYDLSAGFLQAVADGKALFAVDQQPYLQGYLPVTFLALNARYGTIPAGNVASGPSFVEKDAAASVIEKSSQGIR >NZ_CP054031.1|WP_138328938.1|4545489_4546485_+|LacI-family-DNA-binding-transcriptional-regulator MRKRATAKEVADAAGVSKWTVIRAFTPGASITEESRRRVLEAAAVLNYTPNLLARSLATNLTHQVAVFVDDFANPQKLPFLETLTERLQAEGLVAVLININNHFDHVHALLNADQRQVDAVILFGTAFRNETLSDRQLGRGMPPMFVLARDSQIDGVPAVVCDAELALRDIVDHLYERGYRRPGFMTGAPALSTALRRRQHFIDFWKGKGVDEIVILSADRYSAEAGAGSVRHYLNDIDPAARVDVLMCENDILALGAMDEIRGNFGLRIPDDIAVVGFDNYELGGASCYGLTTYEQPRIEMVQAVIGMIKGRLEPETVTLPGKLVVRTST >NZ_CP054031.1|WP_138328937.1|4546560_4547472_+|myo-inosose-2-dehydratase MTGHITALPVGVRLAVSPLSWANDVLEDLGADISLETCLRDAAESGYEGIELGRKFPREAGVLRSLLDSHRLALASGWHSGELAERSVDDEMKAVAGHAALLRAMDCKVMVYGEVAMMTPGSPLDAPMSQRLRMPAAEVAGYAGRLTDFAGRLATEYGLTLAYHHHLMMVAETFDEISAIFDKTGSEAGLLLDTGHAVAGGFDYARLIDRFGDRIVHIHLKDIRGPVIDEVRAGDMSFNAGVRGGMFTVPGDGIIDFKRLARFVRDSGYRGWLVVEAEQDPAVAEPRPAVERAFKHVQANFRS >NZ_CP054031.1|WP_138328936.1|4547538_4548423_+|hypothetical-protein MAETKSPADHNEAPPSEDGVKPGADISFSYDRMTVEQFRRRFPRARWSDARKAWFVPGRTASRRIGRWLAEMEAEADAHADAKGRDAFAFDPIDSSYLELGKAGFRIRTPYSKTVVDELREVPFSRWDGDLKIWHVPFRSYEELRRRWQEIEAAARRNEPEERRRRAEERKGTEQDIRSKLRSAERKRHRYPLQSDDLPPIGRPVVTAYGIVVFTEITGELVDPNVVTDFYAGVTEDHVWGYWRVPTLEELVRTWPAKMPPVQDAEWWLPTIEELRPARRTARLREARKRAKAP |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP050082 | Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b4, complete sequence | 104174-104206 | 3 | 0.909 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP049732 | Rhizobium leguminosarum strain A1 plasmid pRL11, complete sequence | 105276-105308 | 3 | 0.909 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP053208 | Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1B, complete sequence | 97279-97311 | 3 | 0.909 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP022565 | Rhizobium leguminosarum bv. viciae strain BIHB 1148 plasmid pSK01, complete sequence | 763189-763221 | 3 | 0.909 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP022565 | Rhizobium leguminosarum bv. viciae strain BIHB 1148 plasmid pSK01, complete sequence | 650622-650654 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP050092 | Rhizobium leguminosarum bv. trifolii strain 22B plasmid pRL22b6, complete sequence | 50138-50170 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP025013 | Rhizobium leguminosarum strain Norway plasmid pRLN1, complete sequence | 34932-34964 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP049731 | Rhizobium leguminosarum strain A1 plasmid pRL12, complete sequence | 322318-322350 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP054025 | Rhizobium sp. JKLM12A2 plasmid pPR12A204, complete sequence | 148064-148096 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NC_012848 | Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132501, complete sequence | 486308-486340 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP024314 | Rhizobium sp. NXC24 plasmid pRspNXC24c, complete sequence | 1988932-1988964 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP054034 | Rhizobium sp. JKLM13E plasmid pPR13E03, complete sequence | 111280-111312 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP022569 | Rhizobium leguminosarum bv. viciae strain BIHB 1148 plasmid pSK05, complete sequence | 219298-219330 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP053206 | Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1D, complete sequence | 322648-322680 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NC_011368 | Rhizobium leguminosarum bv. trifolii WSM2304 plasmid pRLG201, complete sequence | 1146199-1146231 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP048283 | Rhizobium leguminosarum bv. viciae 248 plasmid pRle248c, complete sequence | 205080-205112 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP016287 | Rhizobium leguminosarum strain Vaf10 plasmid unnamed1, complete sequence | 643872-643904 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP030764 | Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed4, complete sequence | 401197-401229 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP050106 | Rhizobium leguminosarum bv. trifolii strain 4B plasmid pRL4b5, complete sequence | 88373-88405 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NC_007764 | Rhizobium etli CFN 42 plasmid p42c, complete sequence | 232167-232199 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP050111 | Rhizobium leguminosarum bv. trifolii strain 3B plasmid pRL3b1, complete sequence | 88557-88589 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP030763 | Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed3, complete sequence | 534114-534146 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP032695 | Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525c, complete sequence | 1586539-1586571 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP049248 | Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed2, complete sequence | 94647-94679 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP050088 | Rhizobium leguminosarum bv. trifolii strain 23B plasmid pRL23b6, complete sequence | 213567-213599 | 4 | 0.879 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP049732 | Rhizobium leguminosarum strain A1 plasmid pRL11, complete sequence | 339724-339756 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP050092 | Rhizobium leguminosarum bv. trifolii strain 22B plasmid pRL22b6, complete sequence | 271201-271233 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP050092 | Rhizobium leguminosarum bv. trifolii strain 22B plasmid pRL22b6, complete sequence | 503340-503372 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP025013 | Rhizobium leguminosarum strain Norway plasmid pRLN1, complete sequence | 287487-287519 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP049731 | Rhizobium leguminosarum strain A1 plasmid pRL12, complete sequence | 34941-34973 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NC_012848 | Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132501, complete sequence | 680358-680390 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP048283 | Rhizobium leguminosarum bv. viciae 248 plasmid pRle248c, complete sequence | 115436-115468 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP050081 | Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b5, complete sequence | 34946-34978 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP016287 | Rhizobium leguminosarum strain Vaf10 plasmid unnamed1, complete sequence | 529444-529476 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_HG938356 | Neorhizobium galegae bv. officinalis bv. officinalis str. HAMBI 1141 plasmid pHAMBI1141a, complete sequence | 521785-521817 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP054022 | Rhizobium sp. JKLM12A2 plasmid pPR12A201, complete sequence | 109151-109183 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP054022 | Rhizobium sp. JKLM12A2 plasmid pPR12A201, complete sequence | 594234-594266 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP025014 | Rhizobium leguminosarum strain Norway plasmid pRLN2, complete sequence | 475796-475828 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NC_020062 | Rhizobium tropici CIAT 899 plasmid pRtrCIAT899c, complete sequence | 486570-486602 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP050094 | Rhizobium leguminosarum bv. trifolii strain 22B plasmid pRL22b4, complete sequence | 24348-24380 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP050098 | Rhizobium leguminosarum bv. trifolii strain 9B plasmid pRL9b4, complete sequence | 837677-837709 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP020900 | Rhizobium phaseoli Brasil 5 strain Bra5 plasmid pRphaBra5d, complete sequence | 937176-937208 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP053440 | Rhizobium leguminosarum bv. trifolii strain CC275e plasmid pRltCC275eF, complete sequence | 908326-908358 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP021028 | Rhizobium sp. TAL182 plasmid pRetTAL182d, complete sequence | 328931-328963 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP018229 | Rhizobium leguminosarum strain Vaf-108 plasmid unnamed1, complete sequence | 1284957-1284989 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013110 | Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence | 2176408-2176440 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NC_012858 | Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132502, complete sequence | 511638-511670 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013503 | Rhizobium esperanzae strain N561 plasmid pRspN561c, complete sequence | 322908-322940 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013509 | Rhizobium sp. N1341 plasmid pRspN1341d, complete sequence | 322908-322940 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013520 | Rhizobium sp. N113 plasmid pRspN113c, complete sequence | 322908-322940 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013493 | Rhizobium sp. N6212 plasmid pRspN6212c, complete sequence | 322908-322940 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP054029 | Rhizobium sp. JKLM19E plasmid pPR19E02, complete sequence | 55712-55744 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP024310 | Sinorhizobium fredii strain NXT3 plasmid pSfreNXT3c, complete sequence | 1698046-1698078 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP035000 | Rhizobium acidisoli strain FH23 plasmid pRapFH23b, complete sequence | 102043-102075 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP050083 | Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b3, complete sequence | 485248-485280 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP021033 | Rhizobium sp. NXC14 plasmid pRspNXC14c, complete sequence | 737331-737363 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP049733 | Rhizobium leguminosarum strain A1 plasmid pRL10, complete sequence | 472926-472958 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013526 | Rhizobium phaseoli strain R744 plasmid pRphaR744d, complete sequence | 648928-648960 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP018230 | Rhizobium leguminosarum strain Vaf-108 plasmid unnamed2, complete sequence | 512002-512034 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP016293 | Rhizobium leguminosarum strain Vaf10 plasmid unnamed5, complete sequence | 80596-80628 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP029452 | Sinorhizobium fredii CCBAU 25509 plasmid pSF25509b, complete sequence | 348772-348804 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013562 | Rhizobium phaseoli strain N841 plasmid pRphaN841e, complete sequence | 675696-675728 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013579 | Rhizobium phaseoli strain N671 plasmid pRphaN671e, complete sequence | 644854-644886 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013531 | Rhizobium phaseoli strain R723 plasmid pRphaR723d, complete sequence | 662036-662068 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013541 | Rhizobium phaseoli strain R630 plasmid pRphaR630d, complete sequence | 665437-665469 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013546 | Rhizobium phaseoli strain R620 plasmid pRphaR620d, complete sequence | 659276-659308 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013584 | Rhizobium phaseoli strain N261 plasmid pRphaN261d, complete sequence | 662036-662068 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013573 | Rhizobium phaseoli strain N771 plasmid pRphaN771e, complete sequence | 644854-644886 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013515 | Rhizobium sp. N1314 plasmid pRspN1314d, complete sequence | 371128-371160 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NC_010997 | Rhizobium etli CIAT 652 plasmid pC, complete sequence | 641116-641148 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013605 | Rhizobium sp. N731 plasmid pRspN731d, complete sequence | 371122-371154 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP053207 | Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1C, complete sequence | 501970-502002 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP029232 | Sinorhizobium fredii CCBAU 45436 plasmid pSF45436b, complete sequence | 179923-179955 | 5 | 0.848 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP022569 | Rhizobium leguminosarum bv. viciae strain BIHB 1148 plasmid pSK05, complete sequence | 20980-21012 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP021028 | Rhizobium sp. TAL182 plasmid pRetTAL182d, complete sequence | 440369-440401 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NC_012858 | Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132502, complete sequence | 26976-27008 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013503 | Rhizobium esperanzae strain N561 plasmid pRspN561c, complete sequence | 435236-435268 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013509 | Rhizobium sp. N1341 plasmid pRspN1341d, complete sequence | 435236-435268 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013520 | Rhizobium sp. N113 plasmid pRspN113c, complete sequence | 435236-435268 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013493 | Rhizobium sp. N6212 plasmid pRspN6212c, complete sequence | 435236-435268 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP049249 | Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed3, complete sequence | 1112792-1112824 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP006989 | Rhizobium sp. IE4771 plasmid pRetIE4771c, complete sequence | 400039-400071 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP020909 | Rhizobium etli strain NXC12 plasmid pRetNXC12c, complete sequence | 430263-430295 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP020952 | Rhizobium sp. CIAT894 plasmid pRheCIAT894e, complete sequence | 537958-537990 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP020952 | Rhizobium sp. CIAT894 plasmid pRheCIAT894e, complete sequence | 610828-610860 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013577 | Rhizobium phaseoli strain N671 plasmid pRphaN671c, complete sequence | 331925-331957 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013549 | Rhizobium phaseoli strain R611 plasmid pRetR611b, complete sequence | 332295-332327 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013534 | Rhizobium phaseoli strain R650 plasmid pRphaR650b, complete sequence | 332295-332327 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013545 | Rhizobium phaseoli strain R620 plasmid pRphaR620c, complete sequence | 332287-332319 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NC_021908 | Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1d, complete sequence | 436307-436339 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013571 | Rhizobium phaseoli strain N771 plasmid pRphaN771c, complete sequence | 331925-331957 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP021126 | Rhizobium sp. Kim5 plasmid pRetKim5b, complete sequence | 411507-411539 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP021373 | Rhizobium sp. ACO-34A plasmid pRACO34Ac, complete sequence | 110885-110917 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | CP007642 | Rhizobium etli bv. phaseoli str. IE4803 plasmid pRetIE4803a, complete sequence | 376655-376687 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP032685 | Rhizobium sp. CCGE531 plasmid pRCCGE531d, complete sequence | 307429-307461 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP032690 | Rhizobium sp. CCGE532 plasmid pRCCGE532d, complete sequence | 307428-307460 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP020951 | Rhizobium sp. CIAT894 plasmid pRheCIAT894d, complete sequence | 97290-97322 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP018233 | Rhizobium leguminosarum strain Vaf-108 plasmid unnamed5, complete sequence | 5235-5267 | 6 | 0.818 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP032695 | Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525c, complete sequence | 2127671-2127703 | 7 | 0.788 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP049248 | Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed2, complete sequence | 115954-115986 | 7 | 0.788 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP018229 | Rhizobium leguminosarum strain Vaf-108 plasmid unnamed1, complete sequence | 1304356-1304388 | 7 | 0.788 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP049249 | Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed3, complete sequence | 709940-709972 | 7 | 0.788 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP035000 | Rhizobium acidisoli strain FH23 plasmid pRapFH23b, complete sequence | 11130-11162 | 7 | 0.788 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013600 | Rhizobium sp. N741 plasmid pRspN741e, complete sequence | 643242-643274 | 7 | 0.788 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013504 | Rhizobium esperanzae strain N561 plasmid pRspN561d, complete sequence | 643243-643275 | 7 | 0.788 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013510 | Rhizobium sp. N1341 plasmid pRspN1341e, complete sequence | 641683-641715 | 7 | 0.788 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013521 | Rhizobium sp. N113 plasmid pRspN113d, complete sequence | 645930-645962 | 7 | 0.788 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013494 | Rhizobium sp. N6212 plasmid pRspN6212d, complete sequence | 632802-632834 | 7 | 0.788 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013594 | Rhizobium sp. N871 plasmid pRspN871d, complete sequence | 643243-643275 | 7 | 0.788 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP054028 | Rhizobium sp. JKLM19E plasmid pPR19E01, complete sequence | 849422-849454 | 7 | 0.788 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP022567 | Rhizobium leguminosarum bv. viciae strain BIHB 1148 plasmid pSK03, complete sequence | 104989-105021 | 8 | 0.758 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP054022 | Rhizobium sp. JKLM12A2 plasmid pPR12A201, complete sequence | 417412-417444 | 9 | 0.727 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP013054 | Sinorhizobium americanum CCGM7 plasmid C, complete sequence | 372852-372884 | 9 | 0.727 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP015737 | Shinella sp. HZN7 plasmid pShin-01, complete sequence | 353742-353774 | 9 | 0.727 |
NZ_CP054031_3 | 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder | 4538848-4538880 | 33 | NZ_CP048281 | Rhizobium leguminosarum bv. viciae 248 plasmid pRle248e, complete sequence | 302066-302098 | 9 | 0.727 |
1. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP050082 (Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b4, complete sequence) position: , mismatch: 3, identity: 0.909
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ccgcggttttgcgataacgacatgcgcaaaagc Protospacer .******************************..
2. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP049732 (Rhizobium leguminosarum strain A1 plasmid pRL11, complete sequence) position: , mismatch: 3, identity: 0.909
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ccgcggttttgcgataacgacatgcgcaaaagc Protospacer .******************************..
3. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP053208 (Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1B, complete sequence) position: , mismatch: 3, identity: 0.909
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ccgcggttttgcgataacgacatgcgcaaaagc Protospacer .******************************..
4. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP022565 (Rhizobium leguminosarum bv. viciae strain BIHB 1148 plasmid pSK01, complete sequence) position: , mismatch: 3, identity: 0.909
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ccgcggttttgcgataacgacatgcgtaaaaac Protospacer .*************************.*****.
5. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP022565 (Rhizobium leguminosarum bv. viciae strain BIHB 1148 plasmid pSK01, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacatgcgtaaaaac Protospacer . ************************.*****.
6. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP050092 (Rhizobium leguminosarum bv. trifolii strain 22B plasmid pRL22b6, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacatgcgtaaaaac Protospacer . ************************.*****.
7. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP025013 (Rhizobium leguminosarum strain Norway plasmid pRLN1, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacatgcgtaaaaac Protospacer . ************************.*****.
8. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP049731 (Rhizobium leguminosarum strain A1 plasmid pRL12, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacatgcggaaaaac Protospacer . ************************ *****.
9. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP054025 (Rhizobium sp. JKLM12A2 plasmid pPR12A204, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer aagcggttttgcgataacgacatgcgtaaaaac Protospacer ************************.*****.
10. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NC_012848 (Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132501, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cggcggttttgcgataacgacatgcgcaataac Protospacer . *************************** **.
11. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP024314 (Rhizobium sp. NXC24 plasmid pRspNXC24c, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgacaacgacatgcgcaaaaac Protospacer . ************.*****************.
12. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP054034 (Rhizobium sp. JKLM13E plasmid pPR13E03, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer aagcggttttgcgataacgacatgcgtaaaaac Protospacer ************************.*****.
13. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP022569 (Rhizobium leguminosarum bv. viciae strain BIHB 1148 plasmid pSK05, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacatgcgtaaaaac Protospacer . ************************.*****.
14. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP053206 (Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1D, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacatgcggaaaaac Protospacer . ************************ *****.
15. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NC_011368 (Rhizobium leguminosarum bv. trifolii WSM2304 plasmid pRLG201, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgaaaacgacatgcgcaaaaac Protospacer . ************ *****************.
16. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP048283 (Rhizobium leguminosarum bv. viciae 248 plasmid pRle248c, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacatgcgtaaaaac Protospacer . ************************.*****.
17. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP016287 (Rhizobium leguminosarum strain Vaf10 plasmid unnamed1, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ccgcggttttgcgataccgacatgcgtaaaaac Protospacer .*************** *********.*****.
18. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP030764 (Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed4, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ccgcggttttgcgacaacgacatgcgtaaaaac Protospacer .*************.***********.*****.
19. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP050106 (Rhizobium leguminosarum bv. trifolii strain 4B plasmid pRL4b5, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cggcggttttgcgataacgacatgcgtgaaaat Protospacer . ************************..*****
20. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NC_007764 (Rhizobium etli CFN 42 plasmid p42c, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer gcgcggttttgcgataacgacacgcgtaaaaac Protospacer *********************.***.*****.
21. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP050111 (Rhizobium leguminosarum bv. trifolii strain 3B plasmid pRL3b1, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cggcggttttgcgataacgacatgcgtgaaaat Protospacer . ************************..*****
22. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP030763 (Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed3, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cggcggttttgcgataacgacatgcgtgaaaat Protospacer . ************************..*****
23. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP032695 (Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525c, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ttgcggttttgcgataacgacatacgcaaaatg Protospacer *.*********************.*******
24. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP049248 (Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed2, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ccgcggttttgcgataacgacatgcgggaaaac Protospacer .************************* .****.
25. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP050088 (Rhizobium leguminosarum bv. trifolii strain 23B plasmid pRL23b6, complete sequence) position: , mismatch: 4, identity: 0.879
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cggcggttttgcgataacgacatgcgtgaaaat Protospacer . ************************..*****
26. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP049732 (Rhizobium leguminosarum strain A1 plasmid pRL11, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgctaacgacatgcgtaaaaac Protospacer . *********** ************.*****.
27. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP050092 (Rhizobium leguminosarum bv. trifolii strain 22B plasmid pRL22b6, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgagaacgacatgcgtaaaaaa Protospacer . ************ ***********.*****
28. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP050092 (Rhizobium leguminosarum bv. trifolii strain 22B plasmid pRL22b6, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgctaacgacatgcgtaaaaac Protospacer . *********** ************.*****.
29. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP025013 (Rhizobium leguminosarum strain Norway plasmid pRLN1, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgacaacgacatgcgtaaaaac Protospacer . ************.***********.*****.
30. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP049731 (Rhizobium leguminosarum strain A1 plasmid pRL12, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacatgcgtaaaaca Protospacer . ************************.****
31. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NC_012848 (Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132501, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgacaacgacatgcgtaaaaac Protospacer . ************.***********.*****.
32. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP048283 (Rhizobium leguminosarum bv. viciae 248 plasmid pRle248c, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgctaacgacatgcgtaaaaaa Protospacer . *********** ************.*****
33. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP050081 (Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b5, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacatgcgtaaaata Protospacer . ************************.****
34. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP016287 (Rhizobium leguminosarum strain Vaf10 plasmid unnamed1, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcagttttgcgataacgacatgcgtaaaaac Protospacer . **.*********************.*****.
35. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_HG938356 (Neorhizobium galegae bv. officinalis bv. officinalis str. HAMBI 1141 plasmid pHAMBI1141a, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacatgcgtaaaaca Protospacer . ************************.****
36. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP054022 (Rhizobium sp. JKLM12A2 plasmid pPR12A201, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacatgcgtaaaaca Protospacer . ************************.****
37. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP054022 (Rhizobium sp. JKLM12A2 plasmid pPR12A201, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcggtaacgacatgcggaaaact Protospacer . ***********.************ **** *
38. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP025014 (Rhizobium leguminosarum strain Norway plasmid pRLN2, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgatgacgacatgcgcaaaacc Protospacer . *************.*************** .
39. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NC_020062 (Rhizobium tropici CIAT 899 plasmid pRtrCIAT899c, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgacaacgacatgcgcaaaatg Protospacer . ************.****************
40. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP050094 (Rhizobium leguminosarum bv. trifolii strain 22B plasmid pRL22b4, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgacaacgacatgcggaaaaac Protospacer . ************.*********** *****.
41. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP050098 (Rhizobium leguminosarum bv. trifolii strain 9B plasmid pRL9b4, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgacaacgacatgcgcgaaaac Protospacer . ************.************.****.
42. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP020900 (Rhizobium phaseoli Brasil 5 strain Bra5 plasmid pRphaBra5d, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaaac Protospacer . *************** ********.*****.
43. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP053440 (Rhizobium leguminosarum bv. trifolii strain CC275e plasmid pRltCC275eF, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgacaacgacatgcgcgaaaac Protospacer . ************.************.****.
44. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP021028 (Rhizobium sp. TAL182 plasmid pRetTAL182d, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer gagcggttttgcgataaagacatgcgtaaaaac Protospacer *************** ********.*****.
45. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP018229 (Rhizobium leguminosarum strain Vaf-108 plasmid unnamed1, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcagttttgcgataacgacatgcgtaaaaac Protospacer . **.*********************.*****.
46. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013110 (Sinorhizobium americanum strain CFNEI 73 plasmid C, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgggataacgacatgcacaaaaac Protospacer . ********* *************.******.
47. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NC_012858 (Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132502, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ccgcggtgttgcgatagcgacatgcgcaaaacc Protospacer .****** ********.************** .
48. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013503 (Rhizobium esperanzae strain N561 plasmid pRspN561c, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer gagcggttttgcgataaagacatgcgtaaaaac Protospacer *************** ********.*****.
49. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013509 (Rhizobium sp. N1341 plasmid pRspN1341d, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer gagcggttttgcgataaagacatgcgtaaaaac Protospacer *************** ********.*****.
50. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013520 (Rhizobium sp. N113 plasmid pRspN113c, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer gagcggttttgcgataaagacatgcgtaaaaac Protospacer *************** ********.*****.
51. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013493 (Rhizobium sp. N6212 plasmid pRspN6212c, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer gagcggttttgcgataaagacatgcgtaaaaac Protospacer *************** ********.*****.
52. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP054029 (Rhizobium sp. JKLM19E plasmid pPR19E02, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacctgcgtaaaaac Protospacer . ******************* ****.*****.
53. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP024310 (Sinorhizobium fredii strain NXT3 plasmid pSfreNXT3c, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgggataacgacatgcacaaaaac Protospacer . ********* *************.******.
54. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP035000 (Rhizobium acidisoli strain FH23 plasmid pRapFH23b, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcggcaacgacatgcgcaaaaac Protospacer . ***********..*****************.
55. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP050083 (Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b3, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cggcggttttgcgacaacgacatgcgtaaaaac Protospacer . ************.***********.*****.
56. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP021033 (Rhizobium sp. NXC14 plasmid pRspNXC14c, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacggcatgcgtaaaaac Protospacer . *****************.******.*****.
57. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP049733 (Rhizobium leguminosarum strain A1 plasmid pRL10, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cggcggttttgcgacaacgacatgcgtaaaaac Protospacer . ************.***********.*****.
58. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013526 (Rhizobium phaseoli strain R744 plasmid pRphaR744d, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaaac Protospacer . *************** ********.*****.
59. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP018230 (Rhizobium leguminosarum strain Vaf-108 plasmid unnamed2, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer atgcggttttgcgataacgacatgtgtaaaaac Protospacer .**********************.*.*****.
60. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP016293 (Rhizobium leguminosarum strain Vaf10 plasmid unnamed5, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgctaacgacatgcgtaaaaac Protospacer . *********** ************.*****.
61. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP029452 (Sinorhizobium fredii CCBAU 25509 plasmid pSF25509b, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgggataacgacatgcacaaaaac Protospacer . ********* *************.******.
62. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013562 (Rhizobium phaseoli strain N841 plasmid pRphaN841e, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaaac Protospacer . *************** ********.*****.
63. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013579 (Rhizobium phaseoli strain N671 plasmid pRphaN671e, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaaac Protospacer . *************** ********.*****.
64. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013531 (Rhizobium phaseoli strain R723 plasmid pRphaR723d, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaaac Protospacer . *************** ********.*****.
65. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013541 (Rhizobium phaseoli strain R630 plasmid pRphaR630d, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaaac Protospacer . *************** ********.*****.
66. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013546 (Rhizobium phaseoli strain R620 plasmid pRphaR620d, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaaac Protospacer . *************** ********.*****.
67. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013584 (Rhizobium phaseoli strain N261 plasmid pRphaN261d, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaaac Protospacer . *************** ********.*****.
68. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013573 (Rhizobium phaseoli strain N771 plasmid pRphaN771e, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaaac Protospacer . *************** ********.*****.
69. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013515 (Rhizobium sp. N1314 plasmid pRspN1314d, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaaac Protospacer . *************** ********.*****.
70. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NC_010997 (Rhizobium etli CIAT 652 plasmid pC, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaaac Protospacer . *************** ********.*****.
71. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013605 (Rhizobium sp. N731 plasmid pRspN731d, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaaac Protospacer . *************** ********.*****.
72. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP053207 (Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1C, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cggcggttttgcgacaacgacatgcgtaaaaac Protospacer . ************.***********.*****.
73. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP029232 (Sinorhizobium fredii CCBAU 45436 plasmid pSF45436b, complete sequence) position: , mismatch: 5, identity: 0.848
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgggataacgacatgcacaaaaac Protospacer . ********* *************.******.
74. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP022569 (Rhizobium leguminosarum bv. viciae strain BIHB 1148 plasmid pSK05, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer caacggttttgcgctaacgacatgcgtaaaagt Protospacer . .********** ************.****.*
75. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP021028 (Rhizobium sp. TAL182 plasmid pRetTAL182d, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaagc Protospacer . *************** ********.****..
76. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NC_012858 (Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132502, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer gagcgattttgcgacaacgacatgcgcaaaaca Protospacer ***.********.****************
77. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013503 (Rhizobium esperanzae strain N561 plasmid pRspN561c, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaagc Protospacer . *************** ********.****..
78. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013509 (Rhizobium sp. N1341 plasmid pRspN1341d, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaagc Protospacer . *************** ********.****..
79. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013520 (Rhizobium sp. N113 plasmid pRspN113c, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaagc Protospacer . *************** ********.****..
80. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013493 (Rhizobium sp. N6212 plasmid pRspN6212c, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaagc Protospacer . *************** ********.****..
81. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP049249 (Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed3, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacatgcgaaaagta Protospacer . ************************ ***.
82. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP006989 (Rhizobium sp. IE4771 plasmid pRetIE4771c, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaagc Protospacer . *************** ********.****..
83. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP020909 (Rhizobium etli strain NXC12 plasmid pRetNXC12c, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaagc Protospacer . *************** ********.****..
84. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP020952 (Rhizobium sp. CIAT894 plasmid pRheCIAT894e, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgagaacgacatgcgtaaaaca Protospacer . ************ ***********.****
85. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP020952 (Rhizobium sp. CIAT894 plasmid pRheCIAT894e, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgagaacgacatgcgtaaaaca Protospacer . ************ ***********.****
86. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013577 (Rhizobium phaseoli strain N671 plasmid pRphaN671c, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ctgcggttttgcgataaagacatgcgtaaaagc Protospacer ..*************** ********.****..
87. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013549 (Rhizobium phaseoli strain R611 plasmid pRetR611b, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ctgcggttttgcgataaagacatgcgtaaaagc Protospacer ..*************** ********.****..
88. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013534 (Rhizobium phaseoli strain R650 plasmid pRphaR650b, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ctgcggttttgcgataaagacatgcgtaaaagc Protospacer ..*************** ********.****..
89. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013545 (Rhizobium phaseoli strain R620 plasmid pRphaR620c, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ctgcggttttgcgataaagacatgcgtaaaagc Protospacer ..*************** ********.****..
90. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NC_021908 (Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1d, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaagc Protospacer . *************** ********.****..
91. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013571 (Rhizobium phaseoli strain N771 plasmid pRphaN771c, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ctgcggttttgcgataaagacatgcgtaaaagc Protospacer ..*************** ********.****..
92. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP021126 (Rhizobium sp. Kim5 plasmid pRetKim5b, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaagc Protospacer . *************** ********.****..
93. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP021373 (Rhizobium sp. ACO-34A plasmid pRACO34Ac, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacggcatgcgaaaaatc Protospacer . *****************.****** **** .
94. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to CP007642 (Rhizobium etli bv. phaseoli str. IE4803 plasmid pRetIE4803a, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataaagacatgcgtaaaagc Protospacer . *************** ********.****..
95. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP032685 (Rhizobium sp. CCGE531 plasmid pRCCGE531d, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgacaacgacatgcgaaaaatc Protospacer . ************.*********** **** .
96. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP032690 (Rhizobium sp. CCGE532 plasmid pRCCGE532d, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgacaacgacatgcgaaaaatc Protospacer . ************.*********** **** .
97. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP020951 (Rhizobium sp. CIAT894 plasmid pRheCIAT894d, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcggcaacgacatgcgcaaaacc Protospacer . ***********..**************** .
98. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP018233 (Rhizobium leguminosarum strain Vaf-108 plasmid unnamed5, complete sequence) position: , mismatch: 6, identity: 0.818
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer caacggttttgcgctaacgacatgcgtaaaaac Protospacer . .********** ************.*****.
99. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP032695 (Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525c, complete sequence) position: , mismatch: 7, identity: 0.788
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgataacgacatgcgaaaccta Protospacer . ************************ **
100. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP049248 (Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed2, complete sequence) position: , mismatch: 7, identity: 0.788
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcggtaacgacatgcggtgaaac Protospacer . ***********.************ .***.
101. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP018229 (Rhizobium leguminosarum strain Vaf-108 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.788
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cccatgttttgcgacaacgacatgcgtaaaaac Protospacer .* *********.***********.*****.
102. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP049249 (Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed3, complete sequence) position: , mismatch: 7, identity: 0.788
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ctgcggttttgcggtaacgacatgcgatcaaac Protospacer ..***********.************ ***.
103. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP035000 (Rhizobium acidisoli strain FH23 plasmid pRapFH23b, complete sequence) position: , mismatch: 7, identity: 0.788
tcgcggt-tttgcgataacgacatgcgcaaaaat CRISPR spacer -tgcggcggttgcgctaacgacatgcgtaaaaac Protospacer .****. ***** ************.*****.
104. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013600 (Rhizobium sp. N741 plasmid pRspN741e, complete sequence) position: , mismatch: 7, identity: 0.788
tcgcggt-tttgcgataacgacatgcgcaaaaat CRISPR spacer -tgcggcggttgcgctaacgacatgcgtaaaaac Protospacer .****. ***** ************.*****.
105. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013504 (Rhizobium esperanzae strain N561 plasmid pRspN561d, complete sequence) position: , mismatch: 7, identity: 0.788
tcgcggt-tttgcgataacgacatgcgcaaaaat CRISPR spacer -tgcggcggttgcgctaacgacatgcgtaaaaac Protospacer .****. ***** ************.*****.
106. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013510 (Rhizobium sp. N1341 plasmid pRspN1341e, complete sequence) position: , mismatch: 7, identity: 0.788
tcgcggt-tttgcgataacgacatgcgcaaaaat CRISPR spacer -tgcggcggttgcgctaacgacatgcgtaaaaac Protospacer .****. ***** ************.*****.
107. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013521 (Rhizobium sp. N113 plasmid pRspN113d, complete sequence) position: , mismatch: 7, identity: 0.788
tcgcggt-tttgcgataacgacatgcgcaaaaat CRISPR spacer -tgcggcggttgcgctaacgacatgcgtaaaaac Protospacer .****. ***** ************.*****.
108. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013494 (Rhizobium sp. N6212 plasmid pRspN6212d, complete sequence) position: , mismatch: 7, identity: 0.788
tcgcggt-tttgcgataacgacatgcgcaaaaat CRISPR spacer -tgcggcggttgcgctaacgacatgcgtaaaaac Protospacer .****. ***** ************.*****.
109. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013594 (Rhizobium sp. N871 plasmid pRspN871d, complete sequence) position: , mismatch: 7, identity: 0.788
tcgcggt-tttgcgataacgacatgcgcaaaaat CRISPR spacer -tgcggcggttgcgctaacgacatgcgtaaaaac Protospacer .****. ***** ************.*****.
110. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP054028 (Rhizobium sp. JKLM19E plasmid pPR19E01, complete sequence) position: , mismatch: 7, identity: 0.788
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgacaacgacatgcgtagattt Protospacer . ************.***********.*.* *
111. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP022567 (Rhizobium leguminosarum bv. viciae strain BIHB 1148 plasmid pSK03, complete sequence) position: , mismatch: 8, identity: 0.758
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgacaacgacatgcgctaggga Protospacer . ************.************ *...
112. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP054022 (Rhizobium sp. JKLM12A2 plasmid pPR12A201, complete sequence) position: , mismatch: 9, identity: 0.727
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgacaacgacgtgcgtagcaca Protospacer . ************.******.****.*. *
113. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP013054 (Sinorhizobium americanum CCGM7 plasmid C, complete sequence) position: , mismatch: 9, identity: 0.727
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer ccgcggtttggggataacgacatgcatccagac Protospacer .******** * *************.. *.*.
114. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP015737 (Shinella sp. HZN7 plasmid pShin-01, complete sequence) position: , mismatch: 9, identity: 0.727
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cggcggtttcgcgatagcgacatgcgtggaacc Protospacer . *******.******.*********...** .
115. spacer 3.1|4538848|33|NZ_CP054031|CRISPRCasFinder matches to NZ_CP048281 (Rhizobium leguminosarum bv. viciae 248 plasmid pRle248e, complete sequence) position: , mismatch: 9, identity: 0.727
tcgcggttttgcgataacgacatgcgcaaaaat CRISPR spacer cagcggttttgcgacaacggcatgcgtagattc Protospacer . ************.****.******.*.* .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
425217 : 434489
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP054031|425217:434489|DBSCAN-SWA CATGACCAATGCTTCCACCGAATCCTTCTTCAATCGCTCGCTTGCGGACGTCGATCCGGACATTTTCGGCGCGATCGGAAAGGAACTCGGTCGCCAACGGCACGAGATCGAACTGATCGCTTCTGAGAACATCGTCTCCCGCGCCGTGCTGGAAGCCCAGGGCTCCATCATGACCAACAAATATGCCGAGGGTTATCCCGGCAAGCGTTATTACGGCGGCTGCCAATTCGTCGATATCGCTGAGGAACTGGCGATCGAGCGCGCCAAGAAGCTGTTCGGCGTCAATTTCGCCAACGTTCAGCCGAATTCCGGCTCGCAGATGAACCAGGCCGTGTTCCTGGCGCTGCTGCAGCCGGGTGATACCTTCATGGGGCTCGACCTGAATTCGGGCGGCCACCTCACGCATGGTTCGCCGGTCAACATGTCCGGCAAATGGTTCAACGTCGTCTCCTACGGCGTGCGCGAAGGCGACAACCTGCTCGACATGGATGAAGTCGCCCGCAAGGCCGAGGAAACCAAGCCGAAGCTCATCATCGCCGGGGGCACTGCTTATTCCCGCATCTGGGACTGGAAGCGCTTCCGCGAGATCGCCGACAGCGTCGGCGCCTACCTGATGGTCGACATGGCCCACATCGCCGGCCTTGTCGCCGGTGGCGTGCATCCGTCGCCGTTCCCGCATTGCCATGTTGCCACGACGACGACCCACAAGTCGTTGCGCGGCCCGCGCGGCGGCGTCATTCTGACGAATGACGAGGATCTGGCGAAGAAGTTCAATTCGGCTGTCTTCCCCGGCCTGCAGGGCGGCCCGCTGATGCACATCATCGCCGCCAAGGCCGTTGCCTTCGGCGAGGCGCTGCAGCCGGAATTCAAGGAATACGCCGCACAGGTCGTCAAGAATGCCAAGGCACTGGCTGAAACGCTTGTCGCCGGCGGCCTCGATGTCGTCTCCGGCGGCACCGACAACCACCTGATGCTGGTCGACCTGCGCAAGAAGAATGCCACCGGCAAGCGCGCCGAGGCAGCGCTTGGCCGCGCCTACGTCACCTGCAACAAGAACGGCATTCCCTTCGATCCGGAAAAGCCCTTCGTCACGTCGGGTGTGCGCCTCGGCGCGCCGGCCGGCACGACCCGCGGCTTCAAGGAAGCGGAATTCCGCGAGATCGGCAATCTGATCGTCGAGGTCCTCGACGGTCTGAAGGTCGCCAATTCCGATGAAGGCAACGCCGCCGTCGAAGCCGCTGTGCGCGGCAAGGTGGTCAGCCTCACCGACCGCTTCCCCATGTACGGCTACATGGGATAAGGAGTAGGGATGCGCTGCCCCTATTGCGGTTCGGAAGATACTCAGGTCAAGGATTCGCGCCCGGCGGAGGACAACACGTCCATCCGCCGGCGGCGCATCTGCCCGGATTGCGGCGGCCGCTTCACCACGTTCGAGCGTGTGCAGCTGCGCGAGCTGATGGTCATCAAGAAGACCGGCCGCAAGGTGCCCTTCGACCGCGACAAGCTGGTGCGCTCCTTCGAAGTGGCGCTGCGCAAACGCCCGGTCGAGCGGGATCGCATCGAGCGCGCCGTCTCGGGCATCGTCCGGCGGCTTGAAAGCTCCGGCGAAACCGAGATCTCCTCCGAGCAGATCGGCCTGCAGGTGCTGGAAGCCATGAAGAGCCTCGATGATGTCGGCTTCGTGCGCTACGCCTCGGTCTATCGGGATTTCTCGCTTGCCGAGGATTTCGAGAAGGTCATTTCCGAAATCAACGCCAAGATCGCCCGCGACCCGCTGGACAGGTGAGCCATGACCATCACGCCGCATGACGAACGTTTCATGGCAGCGGCGATCCGCCTGTCGCGCTGGCATCTCGGCCGCACCGCCACCAACCCCTCCGTCGGCTGCCTGATCGTCAGGGACGGCGTCATCGTCGGCCGTGCGGTGACGGCGCTCAGTGGCCGCCCGCATGCCGAGACGCAGGCGCTCGCTGAAGCAGGCGTGCTCGCCCGTGGCGCCACCGTCTATGTGACGCTCGAACCCTGCTCGCATCACGGCAGGACGCCGCCCTGCGCCGAGGCGCTGATCGCTTATGGCGTTGCCCGCGTCGTGATCAGCGTCACCGATCCCGATCCGCGGGTCTCCGGCCGCGGCATCGCCATTCTGCGCGAAGCCGGCATAGAAGTGGATGCCGGCGTGCTGGAGGCCGAGGGCCGGCATTCGCTTGCCGCCTATCTCACCCGCCAGACGAAGAACCGGCCCTATGTGACTCTCAAACTTGCCGTTTCCGCCGATGGCATGATCGGCCGCGAGGGGGCAGGGCAGGTGGCGATAACAGGCCCGGAGGCCCGCGCCCAGGTACAGGCGCTGCGTGCTGAAACCGATGCCATCCTGGTCGGCATCGGTACCGCGATTGCGGATGATCCGCTTCTGACTGTGCGTTCGCCAGGTCTTGAATCGCAATCGCCGATCCGCATCGTACTTGATCCCTCGCTGGCTTTGCGATTGACCAGCAAGCTCGTCGAGACGGCGCGGCAAGTGCCGATCATCGTGGTGGCGAGCGAGGAAGTTTGGCCTTTGTCGGCAGATGCGGAGGGGTTACCCCCCTCTGCCCTGCCGGACTACCGGGGTCGAGCCACGGGTCTCGACCCGTCCTTCGGACCTCCCACAAGGGGGGAGATCGGCAAGGAGCGCGGTCCGAATTCCCCCGTTTCTGTTAGCAATAAATCGCCTTCGGTAAATGCAGCGCAGGGCGCTGGCTCCGACCCAATCTCCCCCATTCTGGGGGAGATGCCCGGCAGGGCAGAGGGGGGTAACCTACCCGCCGACGCCACCGACATGGACTCCCGCCGCGCAGCCCTCGAAGCCGCCGGCGTCGAGGTCGTCCATTGCAATCCCTATCATCCGGAAGTCCTGTTGCCGGCACTCGCGACGCGCGGTATTTCCTCGCTTCTGGTCGAGGGTGGCGCAAAGACGGCACGGCTTTTTCTCGAAGCTGGCCTCGTCGACCGCATCCAGCTTTATCAGGCGCCTGTTGTGATCGGCGAGGGCGGTATTGAATCCCCGCTTGATGCAACCGACATACCATCGGGTTTTGCCCATACGGGCACGCTGATGTTCGGCGAAGATCGCCTGGACGAATACGAAAGAGGGCTTTGATGTTCACCGGAATAGTCACCGATATCGGTACGATCGAATCCGTTTCGCCTCTGAAGGAGGGCATCAAGCTCCGCGTTGCGACGAGTTACGACCCGGCGACCATCGATATGGGCGCATCCATCGCTCACTCCGGCATCTGCCTCACCGTCACCGGCCTGCCGCAGGAAGGCAGCAATGGTCGCTGGTTCGAGGTCGAGGCCTGGGAAGAGGCGTTGCGGCTGACGACGATCGGTGCCTGGCAGGAGGGCAGCCGCATCAATCTCGAACGGTCGCTGAAGATCGGCGATGAACTCGGCGGCCACATCGTTTCCGGCCATGTCGACGGCAAGGCGGAAATCCTCTCGGTGACAGCAGAGGGCGACGCCACGCGCTACCGCCTGCGCGCGCCCGAACATCTGGCGAAATTCGTCGCCCCCAAGGGTTCGATCGCACTCGACGGCACCTCGCTGACGGTCAATGCCGTCAATGGCACGGATTTCGACGTGCTGCTCATCCGCCACACGCTCGAGGTCACCACCTGGGGCGATCGCAAGACCGGCGATTTCGTCAATTTCGAAGTCGACACCATGGCGCGTTACGCCGCCCGGCTGGCGGAATTCCCGAGCACCTGAATCAAAAACCTGTAACGGTTTCCACGGTCACACCACGACTGGCATGCTGGGCGCGAAACATCCTCAGGCGCTCGTCGATGCATCCAACTGTACGTCTGTGACGCGGCGCGGCCCAATAGCTCACCAATCCGGGGACAGCTTCGTCAGATACTGGCCGTAGGCGCTCTTGCCGAGCTTCTCCGCGAGCTTGGCGAGGTCGTCCTGCGAGATGTAACCCATGCGCCAGGCGACCTCTTCGGGGCAGGCAATCTTGAAGCCCTGGCGTTTCTCCAACGTGCTGACGAAACCTGCTGCATCGAGCAGGCTGTCCGGCGTGCCGGTGTCGAGCCAGGCATAGCCCCGGCCCATCAGCTCCACGAAAAGCTGGCCGCGCTCAAGATAGGTGCGGTTGACATCGGTGATTTCCAATTCACCGCGCGGCGACGGCTTCAGGTTGGCGGCGATATCGACCACCTGCTGGTCGTAGAAATAGAGGCCGGTCACCGCCCAGTTCGATTTTGGCTTCTTCGGCTTCTCCTCGATCGACAGCGCGTTCATCTTTTTGTCAAAGCCGACAACGCCATAGCGTTCCGGATCGGTGACATGATAGGCGAATACTGTCGCGCCTTCCCGCCGATTCGTGCCGGATTTCATGATTTCAGGCAGTCCGTGCCCGTAGAAAATGTTGTCGCCGAGAACGAGTGCTGAGCTGTCGCCGTGCAGGAACTCGGCGCCGATGATGAAGGCCTGGGCGAGGCCGTCTGGGCTCGGCTGCACGGCATAGGTCAGCGAAATTCCCCATTGCGAGCCGTCGCCGAGAAGGCGCTTGAAGGCTTCGACGTCGTGAGGCGTGGTGATGATCAGCAACTCCCTGATGCCGGCGAGCATGAGCGTCGTCAGCGGATAGTAGATCATCGGCTTGTCGTAGACCGGCATCAACTGTTTCGAGACGGCCTGCGTGATCGGATGCAGACGCGTGCCCGTGCCGCCGGCGAGAATAATCCCCTTCATGCCCATCTCCCTAAAGACCCTTATTCAAAAGGTCTTGCATGACAATCTTCATTGACTGCTTCCACTCCGGCAGCCGGATTCCATATGTTCTGGCGAGCTTGTTGCCGTTGAGACGTGAATTGGCCGGACGCTTCGCCGGTGTCGGATAGTCCGCCGTCGGGATGCGCTCGACGCCGACGTTTCTGCCTCCGGATTTAGGAAGCTCTGTGAATATCTCTTCGGCGAAGTCAGCCCAGCTTGCTTCGCCGCTGCCGGTCAGGTGAAAGGTCCCGCGGAGCGATGGCTCAGGGTCCGCTACGATACGGGATGCGATCGCAAGAATCGCGTCGGCGATGTCGAGTGCAGAGGTCGGGCATCCTGTCTGATCGGCGACAACGCGAAGATGATCGCGCGTCTCGGAAAGCCGCAGCATCGTTTTCAGGAAATTGGTGCCAAAGGGCGAATAGACCCAGGCGGTGCGTAAGATGACATGGTTCGGGTTCGCCGCCGCAACGGCCTTCTCGCCCGCGAGCTTCGAGCGCCCATAGACGGAGATCGGCGCCGTCGCATCCTCTTCGGAATAGGCGGAGGCCTTGTCGCCGCTGAAGACGTAGTCGGTCGAAATGTGGATGACCGGGGCCCCGATCCGCGCAGCAGCCTCGGCAACCGCCCCGGCGCCTGCTGCGTTGACGGAAAAAGCAAGCCCGGGTTCGCTCTCGGCCTTGTCGACCGCCGTATAGGCCGCCGCCGAGACGATCACATCCGGCCGCAGAGCCGAGAAGGCGGCTGCGATGCTTGCAGGATCCGCGAGGTCCATTTCCGGACGCCCGACCGCGCTGATTTCGACGCCCATTTCGGCGCCACGTCTGAGCAACGACTGAACGACCTGGCCCTGTTTGCCGGTGACGGCAATGCGCATCGTTTATCGTCCCTTGAAAACGCCGAGGCGTTCGCCGGAGTAGACGTTTTCGCGCAGCGGCCTCCACCACCATTCGTTTTCCAAGTACCAGTGCACTGTCTTCTCGATGCCGGTTTCGAAGGTCTCTTGCGCCTTCCAGCCGAGTTCGCTTTCGAGCTTGGAGGCGTCGATCGCATAGCGTGCGTCGTGTCCGGGGCGGTCCGTGACGTTGGTGATCAGACGTGCATGCGGTCCCTTGTCACTGTAGACGCCGTCGAGAATAACGCAGATGCGATGAACGACATCGATGTTCCTGCGCTCATTGCGGCCGCCCACATTGTATTTTTCGCCGGGGCGCCCTCTGGATGCGATCGTGAAGAGCGCGCGGGCGTGGTCTTCGACATAGAGCCAGTCGCGGACATTCGCGCCATTGCCGTAAACCGGAAGAGGCTTGCCCTCCAGTGCGTTCAGGATCATTAGCGGAATGAGCTTTTCGGGGAAATGGAACGGGCCGTAGTTGTTGGAGCAATTGGAGACGACGACGGGCAGACCGTAGGTTCGGTGCCAGGCGATCACCAGATGGTCGCTGGCGGCCTTGGAGGCAGAGTAGGGCGAGGACGGATCGTAAGGCGTCGTTTCCTCGAAGAGGCCCTCGTCGCCGAGCGAGCCGTAGACCTCGTCCGTCGAGACATGCAGAAAGCGGAAGGCGCTCTTGCGGCGAGCGTCAAGCTCATCCCAATAGTGCCGTGCGGTATCCAGCAGGCTGAATGTGCCGACGATGTTGGTCTGAATGAAATCGGCTGCGCCCGAGATCGAGCGGTCGACATGGCTCTCTGCCGCGAGATGCATGACGATATCAGGGCGGAAGGACGCGAATGCTTCCTGCATCCTGGCGCGGTCGCAGATGTCGGCCTGCAGGAATTGATAGTTCGGCGCCGATTCGACGGATTTCAGCGATGCCAGATTGCCTGCATAGGTCAGCGTGTCGACATTCAGCACCTCGGCGCCGATCTCGCTGACGAGATGGCGCACCAATGCAGATCCGATGAAGCCCGCTCCGCCCGTGACCAGTATGCGCATCATCGATCAATCCCTGGGTTGCTGTGCTGAACAGGAAAAATGGCCTGGCAGATCGGCGAGCCGCGGCAGCGTCTTGTCCTTGTCGGACAATACCATGTCTCTCTCATAGAAGGGCCAGCGGATACCGATCGTCGGATCGTTCCACGCGATGCCTCGATCATGCTGCGGGCTGTAGGGTGCGGTTACCTTGTAGCTGATGACGCTGTTCGGCTCGAGCGTTGCGAATCCGTGTGCGAAACCGGCCGGTACCCAGAGCTGCATGCCATTTTCCGGCGAGAGCTCCTGAGAGAGCCATTTGCCGAAGCTCGGTGATTCCTCGCGGATATCGACGACGACATCCATGATCCGGCCGGCAAGGCAGCGCACCAGCTTGCCCTGTGCAAAGGGTGGTATCTGGAAATGCAGCCCCCGGACGGTGCCGGTCTGCGCCGAAAGCGATTCATTGTCCTGGATGAACGTGACGTCGGCCACATTTTCCCGGAACCAGGCATCCTTGAATACCTCGGAGAAGTAGCCGCGGTGATCGCCGAAGCGTGGCGGCGTGATCGCAACGATGCCCTCGATGGCCGTGGTCTCAATACGCATGACTTACCTTTTTGCGGCTTTTATCAAGCGCCCGCCCTCATGACGCTCATCAGCATGTCCGCCTGCGAGCGGCTCTGCATGGCCGCAAGGCCCTTCGCGGCAATGGCATCGCGCTCGTCGGCATCTGCGATCAGCTTGTAGCAGCGCTCGATCATGTCGTCATAGGGCTCAATTGCCAAGCCGCCGATGAAGGGCTGGAGGTCCGGATCGCGGATGTCGCCTTCCGTCAGCACGCAAACGCGATTGGCAAGCAGATAGGAAATGCGGACGATTTCGAAGACGCCGCTCGCATAATGATGGATGTTTATGACGATCTTGGCGCGGGCAATCACCGCGTCGCGTTCCGCACCATAAACGTTGAAGAGATGCACGACCTTGAGCCCGCCCTGTTCAAGCGTATTCAGGATATGGTGGCGGCGTTCGGGCAACGAGCCATAGAAGAGCACGTCTATATCCTTGACGACGGCATGCTGGATCTGGCTCAGGCAACTGTTATAACCGATTTCGAGCACACCGGCGTGATCGATACCCTTGGCGGCAAGGTTTTCGCGGTTGCGCGGGCTGTAGTCGAGAACCGGAAGCGACTTCAGGATCGAGGTGTAGCGCGAATTGATCCAGACGCTTTCTTCGGACACCTGCTCGAGATTGATGACGACGCTATCCTTCGGCAGATGACGGAGGACTTCCGGGGCCAGAAGATTGCCGCCATAGATGATCGGCGCACGGCCGGCGAATGCGTCAATGTCCCGGGTGATCGGTGCCGAGCCGCCGAGTTCCTCGAAGGCACCCTGTAGGCCAAGCGCGACTTCGTCGAAGGCGTGGCTGTGATTGTAGTTGTCGGGCGTCACGATCCAGATGCAATGGCGATCCTTGTGGGCCTGCCATCCGCCGGCAAAGGGCTGCAATGCGTTGCGAAGCGGCTGCTCGACCTTCTGGGCGGCGGGTATCTCGGTACGTACAGCCTGCTGCGGCGCCGGGGAGGCGCCGAAAGCCTGCGGAGCATAGGCACGCATCAGTTCGCGGCCGCCCGTGAACTGGCCGCGCGGCATCCAGGCCGGCGGGTCATCGGTGCCCCAGATAAGGTGCGGCTGCTTGATCAGTGGTTGACGGCCCTCCTGCCAGGAGGTCAAGTGGGCCTGGTCGCGGCTCTGAAGAAAAGCCTCCTTGGCGGCGAAAAGACCATGCAACATCCGGCCGAGGCGGACGCCGAACTTCCACTCGAAATCGGCAGGCGCAGGCACGAAGGCGGCGATCTTCAATCCGTTGGCAGCAGCCTGCTGGACATAGAGCCCAAGCAGACGCTCGATCACGAAGGGGCGCATCGTCGCGTTGGCGTCGCGTGAATACTGCGCCGCACCGGCATAGGCAATGCCCGCCGGCGTCCCGATGCGCGCTTCAGTTTCCAGCGATTCCAGAATGCGTTCGCAGAAGGCGAAATATCCGGACCAGAACTTCCTGTTGCCGCAGAAATAGTTGCAGAACGCGAAGGCGGTTTTGTCTTGCGGCGGCGCGATCGGATAGCCGAGCTCTTGGATATGCAGGAAGATCGGGTCCATGCCGGGATGACCGCCAAGCAGCGAATGTTCCCAGACGTTGCTGTAGATCGCCGCGTGGCCGATCAGCGGGTTGTAAAGATAGGCGTCCGCACCTTCTGCCCTGGCCTTTTCCGCCTCCGTGACAAAAGACGGAAAACTCGTCACCGACTTCATCTCGAATTTGCTGGAGAGCAGCCCCCAGAACACGTCCTCGCCGAGGCCGATCGCCTCGTGATGCGCATGCAGGATGCGGAAGAGCTCGTATTCGCGGGTCGCAGCCTGCGTGTTGAACGAGATGTCGAGCGGCACGGCAGCCGCGTCGAGCTGGCTGCGCTGCGAAGGATCGAGATAAGGCTGATAGATGATGACGGACCCCGTGCCGGCAGCTTTCTCCGTGAGAAGCGCCAGATCAGGAAGACCGGCCGGCAGAGCGGAAATGAGCTGATTGATCGTCAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP054031|425217:434489|430857_431913_-|WP_138328968.1|DBSCAN-SWA MRILVTGGAGFIGSALVRHLVSEIGAEVLNVDTLTYAGNLASLKSVESAPNYQFLQADICDRARMQEAFASFRPDIVMHLAAESHVDRSISGAADFIQTNIVGTFSLLDTARHYWDELDARRKSAFRFLHVSTDEVYGSLGDEGLFEETTPYDPSSPYSASKAASDHLVIAWHRTYGLPVVVSNCSNNYGPFHFPEKLIPLMILNALEGKPLPVYGNGANVRDWLYVEDHARALFTIASRGRPGEKYNVGGRNERRNIDVVHRICVILDGVYSDKGPHARLITNVTDRPGHDARYAIDASKLESELGWKAQETFETGIEKTVHWYLENEWWWRPLRENVYSGERLGVFKGR >NZ_CP054031|425217:434489|431919_432498_-|WP_138328312.1|DBSCAN-SWA MRIETTAIEGIVAITPPRFGDHRGYFSEVFKDAWFRENVADVTFIQDNESLSAQTGTVRGLHFQIPPFAQGKLVRCLAGRIMDVVVDIREESPSFGKWLSQELSPENGMQLWVPAGFAHGFATLEPNSVISYKVTAPYSPQHDRGIAWNDPTIGIRWPFYERDMVLSDKDKTLPRLADLPGHFSCSAQQPRD >NZ_CP054031|425217:434489|429086_429956_-|WP_138328314.1|DBSCAN-SWA MKGIILAGGTGTRLHPITQAVSKQLMPVYDKPMIYYPLTTLMLAGIRELLIITTPHDVEAFKRLLGDGSQWGISLTYAVQPSPDGLAQAFIIGAEFLHGDSSALVLGDNIFYGHGLPEIMKSGTNRREGATVFAYHVTDPERYGVVGFDKKMNALSIEEKPKKPKSNWAVTGLYFYDQQVVDIAANLKPSPRGELEITDVNRTYLERGQLFVELMGRGYAWLDTGTPDSLLDAAGFVSTLEKRQGFKIACPEEVAWRMGYISQDDLAKLAEKLGKSAYGQYLTKLSPDW >NZ_CP054031|425217:434489|428354_428966_+|WP_138328315.1|DBSCAN-SWA MFTGIVTDIGTIESVSPLKEGIKLRVATSYDPATIDMGASIAHSGICLTVTGLPQEGSNGRWFEVEAWEEALRLTTIGAWQEGSRINLERSLKIGDELGGHIVSGHVDGKAEILSVTAEGDATRYRLRAPEHLAKFVAPKGSIALDGTSLTVNAVNGTDFDVLLIRHTLEVTTWGDRKTGDFVNFEVDTMARYAARLAEFPST >NZ_CP054031|425217:434489|429966_430854_-|WP_138328313.1|DBSCAN-SWA MRIAVTGKQGQVVQSLLRRGAEMGVEISAVGRPEMDLADPASIAAAFSALRPDVIVSAAAYTAVDKAESEPGLAFSVNAAGAGAVAEAAARIGAPVIHISTDYVFSGDKASAYSEEDATAPISVYGRSKLAGEKAVAAANPNHVILRTAWVYSPFGTNFLKTMLRLSETRDHLRVVADQTGCPTSALDIADAILAIASRIVADPEPSLRGTFHLTGSGEASWADFAEEIFTELPKSGGRNVGVERIPTADYPTPAKRPANSRLNGNKLARTYGIRLPEWKQSMKIVMQDLLNKGL >NZ_CP054031|425217:434489|432521_434489_-|WP_138328967.1|DBSCAN-SWA MTINQLISALPAGLPDLALLTEKAAGTGSVIIYQPYLDPSQRSQLDAAAVPLDISFNTQAATREYELFRILHAHHEAIGLGEDVFWGLLSSKFEMKSVTSFPSFVTEAEKARAEGADAYLYNPLIGHAAIYSNVWEHSLLGGHPGMDPIFLHIQELGYPIAPPQDKTAFAFCNYFCGNRKFWSGYFAFCERILESLETEARIGTPAGIAYAGAAQYSRDANATMRPFVIERLLGLYVQQAAANGLKIAAFVPAPADFEWKFGVRLGRMLHGLFAAKEAFLQSRDQAHLTSWQEGRQPLIKQPHLIWGTDDPPAWMPRGQFTGGRELMRAYAPQAFGASPAPQQAVRTEIPAAQKVEQPLRNALQPFAGGWQAHKDRHCIWIVTPDNYNHSHAFDEVALGLQGAFEELGGSAPITRDIDAFAGRAPIIYGGNLLAPEVLRHLPKDSVVINLEQVSEESVWINSRYTSILKSLPVLDYSPRNRENLAAKGIDHAGVLEIGYNSCLSQIQHAVVKDIDVLFYGSLPERRHHILNTLEQGGLKVVHLFNVYGAERDAVIARAKIVINIHHYASGVFEIVRISYLLANRVCVLTEGDIRDPDLQPFIGGLAIEPYDDMIERCYKLIADADERDAIAAKGLAAMQSRSQADMLMSVMRAGA >NZ_CP054031|425217:434489|427005_428355_+|WP_138328316.1|DBSCAN-SWA MTITPHDERFMAAAIRLSRWHLGRTATNPSVGCLIVRDGVIVGRAVTALSGRPHAETQALAEAGVLARGATVYVTLEPCSHHGRTPPCAEALIAYGVARVVISVTDPDPRVSGRGIAILREAGIEVDAGVLEAEGRHSLAAYLTRQTKNRPYVTLKLAVSADGMIGREGAGQVAITGPEARAQVQALRAETDAILVGIGTAIADDPLLTVRSPGLESQSPIRIVLDPSLALRLTSKLVETARQVPIIVVASEEVWPLSADAEGLPPSALPDYRGRATGLDPSFGPPTRGEIGKERGPNSPVSVSNKSPSVNAAQGAGSDPISPILGEMPGRAEGGNLPADATDMDSRRAALEAAGVEVVHCNPYHPEVLLPALATRGISSLLVEGGAKTARLFLEAGLVDRIQLYQAPVVIGEGGIESPLDATDIPSGFAHTGTLMFGEDRLDEYERGL >NZ_CP054031|425217:434489|426525_427002_+|WP_003547190.1|DBSCAN-SWA MRCPYCGSEDTQVKDSRPAEDNTSIRRRRICPDCGGRFTTFERVQLRELMVIKKTGRKVPFDRDKLVRSFEVALRKRPVERDRIERAVSGIVRRLESSGETEISSEQIGLQVLEAMKSLDDVGFVRYASVYRDFSLAEDFEKVISEINAKIARDPLDR >NZ_CP054031|425217:434489|425217_426516_+|WP_029872910.1|DBSCAN-SWA MTNASTESFFNRSLADVDPDIFGAIGKELGRQRHEIELIASENIVSRAVLEAQGSIMTNKYAEGYPGKRYYGGCQFVDIAEELAIERAKKLFGVNFANVQPNSGSQMNQAVFLALLQPGDTFMGLDLNSGGHLTHGSPVNMSGKWFNVVSYGVREGDNLLDMDEVARKAEETKPKLIIAGGTAYSRIWDWKRFREIADSVGAYLMVDMAHIAGLVAGGVHPSPFPHCHVATTTTHKSLRGPRGGVILTNDEDLAKKFNSAVFPGLQGGPLMHIIAAKAVAFGEALQPEFKEYAAQVVKNAKALAETLVAGGLDVVSGGTDNHLMLVDLRKKNATGKRAEAALGRAYVTCNKNGIPFDPEKPFVTSGVRLGAPAGTTRGFKEAEFREIGNLIVEVLDGLKVANSDEGNAAVEAAVRGKVVSLTDRFPMYGYMG |
9 | Enterobacteria_phage(28.57%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
507875 : 518580
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP054031|507875:518580|DBSCAN-SWA ACTACCCGGCTCGTGACAGCGTACCCATGCGTCGCAGCGCGTCGACCTGGTCGTCGACCGTCGGCACGAGTTCGCCGGCATTGCCGCTCGGCGTGGCCGGTGCAGGGGCTGCGGCCTCCGCTTCGCCGAAGGTGACCGTGCAGTTGCGGCCGGCGGCCTTCGCCGCATAGAGCGCCCGGTCGGCGGCGGAGATCAGCTTGTCGATATTGGCTTCGATCGAGGAGGCGAGCGCGATGCCGATGCTGACGGTGACCGGAACGATCGCTTCGCCGGTGCTGACGGGAAGACGGCAGAATTCGGTGCGGATCGCTTCGGCGATCGCCTCGGCTTCCGTCTGGTCCGCGACGGCGGCGAAGGCGGCGAATTCCTCGCCGCCCATGCGGCCGAAGATGTTGTTCGGAATATACTGGCGGGCCATGCGCGCGAAGGCGGTCAGCACGGCATCGCCGGACTGGTGGCCGAAACGGTCGTTGACGCGCTTGAAATGGTCAAGATCGAAAAGGATCACCGCGACCTGGCGCTGCTCGTTATGAGCCCGTTCCTGGATGCCGTCGAAATAGCCGAGCAGGCCGCGGCGGTTGAGCACGCCGGTCAGCGAGTCAGTCAGGGTGAGGGCACGCAGGTGCCGTTCAGAGCGCTCCATGATCAGACGGCAGGTGAACGCAAAGGCGATGGTCAGCAGGAAGGCGCTGGCCATGGCGGAGGCACCGCCGAGATTGGTCGCCTCGATATCGCTTGGCATCGTCAATGCCATCGTCAGCGCCGAGCCGAAGCACAGGCAGCCCTGGAGGATGAACACACCCATCAGCTTGATGCGGGTACGCTCACGGCGCATGTCGCCGGCGGCGACCGCCATGGCAAGTGCCGTCGCACCCGTCGCCGAAGCAAGGTTGTAGAGCACCACCCGGTTGGAATAATCGCTGTTGACCCAGGGCAGGAAGACACCGGCGAGCCAGATTGCCGGCGGCAGCAGTGCCCACCATTCGATCCGCTTGCGGTCGAGTGCCAGAAAACCCGCCACCCAGGCGCTCTGGCCGAGCAGGGCGATGGCATTGCCCGTCTCTACTGAAAGCACGCTGGGGATTTCGCCGCGTAGCGCGACCATCGCAAAGCCGATGCCGGTCAGCAGGAAGCCCAGGCCCCAATAGAGATAGGCCTCGTTCCTGACGTTGTGGCGCCAGGCGACGAACAGCACCAAAGCGAGCGTCATGGCCTCGGAACACCAGATGGCGAGACCTGTAGCGAAATTGAACACGGCAGCAACCCCTTGTCTGTCCGTTCACATCCTTGCCGAGGAAGCTCGCCACTTCCTTAATGTTGCTGCGCCGCGGCATTGTTTCAGGCCGATATCTGCCGGTAGGCTTTCCGTTGGGTAAACGTGTATGAGCACAGTCGAATAGAGTAAGCCTTCCTTCATCCTGTCATGGAGAAAAGCCGGGGGAGACCGTACCAAGGGCTCGCCGTTAACAGGCGTTTGGGGGAAAGAGGCTACCAGGAGCCGCCTTGCCCCGCCTTCTTAACGTTATCCTCATGCGTGTCTTGAAGAGGAGGGCCGGGACACTCTATGTAGGGGTAAGACATTTTTCTTTGATTCCCGGCGCCGGCCGGCGAGACGTTGACAAAGGACGCACATGAGAAACCCAGTCGATACCGCAATGGCCCTCGTTCCGATGGTCGTGGAACAGACCAACCGCGGTGAACGCTCCTACGACATCTATTCCCGCCTGTTGAAGGAACGCATCATCTTCCTGACAGGTGCTGTCGAGGACCACATGGCGACGCTCGTCTGCGCCCAGCTGCTCTTCCTCGAGGCCGAAAACCCGAAAAAGGAAATCGCCCTCTACATCAATTCGCCCGGTGGCGTCGTCACCGCCGGCATGGCGATCTACGATACGATGCAGTTCATCAAGCCCGCGGTTTCGACGCTCTGCATCGGCCAGGCCGCATCCATGGGCTCGCTACTTCTCGCGGCCGGCCACAAGGACATGCGCTTTGCCACGCCGAATTCCCGCATCATGGTGCACCAGCCCTCGGGCGGCTTCCAGGGCCAGGCTTCGGACATCGAGCGCCACGCCCGCGACATCCTCAAGATGAAGCGCCGCCTGAACGAGGTCTACGTCAAGCACACCGGCCGCACCTACGAGGAAGTTGAAAAGACGCTCGATCGTGACCATTTCATGGATGCGGACGAAGCCCAGAGCTGGGGCGTGATCGACAAGGTTCTGACGTCGCGCCTCGAAATGGAAGGCGAACAGGCCTAGTAATTAATAGGGATGGTGTTTTCCTCGCCACAGTTCTATCCCATTTGTGATTGAACAAGGGCTTTCATCCGGCGACGAAGACGCTAATAGTAACTATTAATGCTATGTTGAATTTTTGTGACATAGCATTCGGCATTCGAAGGGGAATTCGTTGCCGCCGGGGCCAGGACATGATTGGTTGGGCCCGGTTCGTAGCCCCTGAAGCCCTTCTCGGCAAAGGCTCCGGAAACGGAGTGCCGCCAGGAGCTGATAGAGTGGACCGCGATTTCGCCTTGAAATGCGGCGTGCTGGAAGGAAAGTGATATGAGCAAGGTCAGCGGCAGCAACGGCGGCGACTCCAAGAACACTCTGTATTGTTCGTTCTGTGGAAAGAGCCAGCACGAAGTCCGGAAACTGATTGCCGGACCGACCGTCTTCATCTGCGATGAATGCGTCGAATTGTGCATGGACATCATCCGCGAGGAGAACAAGTCCTCGATGGTCAAGTCCCGCGACGGCGTTCCGACGCCCCAGGACATCATCAAGGTCCTCGACGAATACGTCATCGGCCAGCGGCAGGCGAAGAAGATCCTGTCGGTTGCCGTTCACAACCACTACAAGCGCCTGGCGCACGCCTCCAAGAACGGCGAAGTCGAGCTGGCGAAGTCGAACATCATGCTCGTCGGCCCGACCGGCTGCGGCAAGACCTATCTTGCCCAGACGCTCGCCCGCATCATCGACGTTCCCTTCACCATGGCCGATGCGACGACGCTGACCGAGGCCGGTTATGTCGGCGAGGATGTCGAAAACATCATCCTGAAGCTGCTGCAGTCGGCCGACTACAATGTCGAGCGTGCGCAACGCGGCATCGTCTACATCGACGAAGTCGACAAGATTTCGCGCAAGTCCGACAACCCGTCGATCACCCGCGACGTCTCGGGCGAGGGCGTGCAGCAGGCGCTGCTGAAGATCATGGAAGGCACGGTCGCTTCCGTTCCGCCGCAGGGCGGCCGCAAGCACCCGCAGCAGGAATTCCTGCAGGTCGACACCACGAACATCCTGTTCATCTGCGGCGGCGCTTTTGCCGGCCTCGACAAGATCATCTCTGCCCGTGGCGAGAAGACCTCGATCGGCTTTGGCGCTAGCGTCAAGTCGCAGGATGACCGCCGCGTCGGCGAGGTCCTGCGCGAACTGGAGCCGGAAGACCTGGTCAAGTTCGGCCTCATCCCCGAGTTCATCGGCCGTCTGCCGGTTCTGGCGACGCTCGAGGACCTCGATGAGGACGCGCTGATCCAGATCCTGTCCGAGCCGAAGAATGCGCTGATCAAGCAGTATCAGCGCCTGTTCGAGATGGAGGATGTGGAACTGAATTTCCACGAGGACGCTCTTCGCGAAATCGCCCGCAAGGCGATCGTGCGCAAGACCGGCGCCCGCGGCCTTCGCTCGATCATGGAGAAGATCCTGCTCGACACGATGTTCGAACTGCCGACGCTGGAAGGCGTTCGCGAGGTCGTTATCTCCGAGGAAGTGGTGCGCGGTTCGGCCCGTCCGCTTTACATCTATGCCGATCGCCAGGAAGAAAAGGCCAACGCTTCGGCGTGAGCTTAGGCTTTCGTGTGATTATTTTGGGGGCTTGCCATAGGCAGGCCCCTTTCTATTTGAAAAACCATGCGAAGCGCTGTTAGTATTTGCCGGTGGGAACCGAAATTGCCTTTGCCGATGTTGCGCAAGGGGTAGTGCTTGGCAATGATGCAATGATCCGTAAGCCTCTTGCTGGCGTTTGCATTTTAGCCGGGTCGGAAGTGGTAAGCGCCGGTCCAGCCACGCGCTTCATAGTGTTGGCGTTCCGTTGTAATAAAGCTGTCACATCCGAGGGACGCGGGCGTTAGCGTGGCTTGAAAAGCGTAGTCAGAACCTCCACTTGTAGCAACAAGTGAGATTGCCGGCCGGTCCCAAGCGGCCGTGCAGAGAGCCCGGCAACGGGACGATGGAAAGGACATAAAATGACGAAGAAAACGTCTGTAGCGAGCAGCACTGCCTACCCTGTTCTGCCCCTGCGCGACATCGTGGTTTTCCCGCATATGATCGTGCCGCTGTTCGTCGGACGGGAAAAGTCGATCCGTGCGCTCGAAGAGGTCATGGGTTCCGACAAGCAGATCATGCTGGTGACGCAGATCAACGCCAGCGACGACGATCCGGATCCTTCCGCGATCCACAATGTAGGCACCGTGGCGAACGTGCTGCAGCTCCTGAAGCTGCCCGACGGCACCGTGAAGGTTCTGGTCGAAGGCCGCGCCCGCGCCGAGATCGACACTTATACCAGCCGCGAGGATTTTTACGAGGCGCTCGGCCATGTGCTCGAGGAGCCGCATGACGATCCGGTCGAACTGGAAGCCCTGTCGCGTTCGGTCGTCTCCGAATTCGAGAGCTACGTGAAGCTCAACAAGAAGATTTCGCCCGAAGTGGTCGGCGCCGCAAGCCAGATCGACGACTATTCCAAGCTCGCCGATACGGTCGCCTCGCATCTGTCGATCAAGATCACCGAGAAGCAGGAGATGCTGGAGACCACCAGCGTCAAACAGCGCCTCGAAAAGGCCCTCGGCTTCATGGAAGGCGAGATCTCGGTCCTGCAGGTCGAAAAGCGCATCCGCTCGCGCGTCAAGCGCCAGATGGAGAAGACCCAGCGCGAATACTACCTCAATGAACAGATGAAGGCGATCCAGAAGGAACTCGGCGACGGCGAGGAAGGCCGCGACGAGATGAGCGAACTGGAAGAGCGCATCTCCAAGACCAAGCTGTCCAAGGAAGCCCGTGAAAAGGCCGATGCGGAACTGAAGAAGCTGCGCCAGATGAGCCCGATGTCGGCGGAAGCCACCGTCGTGCGCAATTATCTGGACTGGCTGCTCGGCATTCCCTGGGGCAAGAAGTCGAAGATCAAGGCCGATCTCAACAATGCCGAGAAGATCCTCGAAGCCGATCACTTCGGTCTCGACAAGGTCAAGGAGCGCATCGTCGAATATCTGGCCGTGCAGGCCCGTGCCACCAAGATCAAAGGCCCGATCCTCTGCCTCGTCGGTCCTCCGGGCGTCGGCAAGACCTCGCTCGCCCAGTCGATCGCCAAGGCGACCGGCCGTGAGTATGTCCGCATGGCGCTTGGCGGCGTTCGCGACGAAGCCGAAATCCGCGGTCACCGCCGCACCTATATTGGCTCGATGCCCGGCAAGGTCATCCAGTCGATGAAGAAGGCGAAGAAGTCCAACCCGCTCTTCCTTCTCGACGAGATCGACAAACTCGGCCAGGACTATCGCGGTGATCCGTCCTCGGCCCTGCTCGAAGTGCTCGATCCGGCCCAGAACATGACCTTTATGGACCACTATCTGGAAGTCGAATACGACCTGTCGGATGTGATGTTCATCACGACGGCAAATACGCTGAATATTCCAGCGCCTCTGATGGATCGCATGGAGATTATCCGTATCGCCGGCTACACCGAGGATGAAAAGCGCGAAATCGCCAAGCGGCACCTGCTGCCGAAGGCCATCAAGGAACATGCGTTGCAGCCGGAAGAATTCTCGGTCAGCGACGACGCCCTGATGTCGATCAGCCAGCAGTATACCCGCGAAGCCGGCGTCCGCAACTTCGAGCGCGAGCTGATGAAACTCGCCCGCAAGGCGGTCACCGAGATCATCAAGGGCAAGACGAAGGCCGTTCACGTGACGGCTGCCAACATCTCCGACTATCTGGGCGTCCCGCGCTTCCGCCATGGCGAAGCAGAGGGCGAGGATCAGGTCGGCGTCGTGACCGGTCTTGCCTGGACGGAAGTCGGTGGCGAACTGCTGACCATCGAAGGCGTGATGATGCCGGGCAAGGGCCGCATGACCGTCACCGGCAATCTGAAGGAAGTCATGAAGGAATCGATCTCGGCAGCGGCCTCCTATGTCCGCTCGCGCGCTGTCGATTTCGGCATCGAGCCGCCGCGCTTCGACAAGAGCGATATCCACGTGCACGTGCCGGAAGGCGCGACGCCGAAGGATGGCCCCTCGGCAGGCGTCGCCATGGCAACCGCGATCGTCTCGATCATGACCGGCATTCCAGTGGACAGGCATGTGGCCATGACCGGCGAAATCACCCTTCGTGGCCGTGTGCTGCCGATCGGTGGTCTCAAGGAAAAGCTGCTCGCAGCGCTTCGCGGCGGCATCAAGAAGGTGTTGATTCCGGAGGAAAACGCCAAGGACCTGGCGGAGATTCCTGACAACGTGAAGAACAACATGGAGATCATCCCGGTATCCCGCATGGGCGAGGTGATCAAGCATGCGCTGATCCGGCGGCCGGAGCCGATCGAATGGGATGGCACGGTGGAAACGCCGGTCATCACATCGGTCGAAGGCCTCGATGAGACAGGCGCAACCATAGCGCATTGAGTGGCCTTTGCCACATTTATGCCCAATTGGCCCAAAACACGGAAAAAGGCTGGCGGAAACGCCAGCCTTTTTTGTGAAAACGTCATGTAGACGCTTGCTTTTCCTTGATTTGCAGGGCTTTTGGGTTTTCGGCGGGCCTGATGAAACCTATTCTGCGCCTAGCTTACTTTTGAGTCGTTTCATACCATTTAGAAAGGGGTGGAAACATGAACAAGAATGAACTCGTGTCCGCAGTAGCCGAAAAGGCAGGACTGACGAAGTCTGACGCGGCTTCCGCTGTTGACGCGGTTTTCGACGTTGTCCAGGCTGAACTCAAGAACAAGGGCGACATTCGCCTCGCGGGTTTCGGCAGCTTCACCGTTTCTCATCGTGCCGCATCGAAGGGCCGTAACCCGTCGACGGGCGCTGAAGTCGACATTCCGGCTCGCAACGTGCCGAAATTCACGCCCGGCAAGGGCCTGAAGGACGCCGTCAACGGCTGATCTGAGATTATCCCACCAGCCGGAAACGAGACCGGTTGTGCAGGATTTAAGAGGGGTTCGGCTTTGCCGGACCCCTTTTGCTTTGGCGGGCCTTGCTGTCGCCCGGGCTTTAATCTACGACGTTTCTGTTCTCAGGGCGGGGTGAAACTCCCCACCGGCGGTGAGGGTTTCGGCCTGAGCCCGCGAGCGCCTTCTCTCGTGTCCAAGCGGGGGAAGGGTCAGCAGATCCGGTGCGATTCCGGAGCCGACGGTTAAAGTCCGGATGGAAGAGAATGAGCGTCGGCCGTGCGAAGGGCGGGGGTTGCCCTGTGTCGCCTGTCGGGGTTCATGCGTCCTGATTTATGTTGGTCCCTTTTATGGATGAAAGACATGAATCAGACTTCCGTTGCCGTCGAGCCATCCCGCGCCGTTGTCGGGGCACTCTGGATGGTCCTTGCCGGCATTGCCTTTTCGCTTCTCAACGTCGTCACCCAATGGCTGACGATGAAGCTTGCCTTTCCCTCGGCGTCGGCGGCCTTCTGGCAATATGGCTTCGCCTTCCTGTTTTCGCTGCCGTTCCTGAAGAGGCTCGGCCTTGCGGCGATGCGCACGCGCTATCCCTGGCGTCACCTTGCGCGCGTCGTGCTTGCCGCGCTCGGCGTCGAGGCTTGGGTCGCCGGCCTTGCCGCGGTGCCGATCTGGCAGGCGATCGCACTGGTGATGACCTCGCCCTTCTTCATCATTCTGGGCGCCCGGCTTTTTCTGGGTGAGCGCGTCGGCCCGGCCCGCTGGGCAGCGACGGCGGCGGGCTTCACCGGAGCGATGATCATTCTGCAGCCATGGTCTGATGGTTTTGGCTGGGCGGCTCTCTTGCCTGTGCTTTCGGCGCTGCTGTGGGGCGCCTCGTCGCTGATCACCAAGAGCCTGACAGGCATCGAACGGCCGGAGACGATCACCGTCTGGCTGCTCGTTCTGCTGACGCCGATAAACGGCGGGCTGGCGCTTGCGGCAGGCTTTGCAGTGCCGACGGGCGCAACCCTTGCTCTGTTCCTGCTGGCGGGGCTGCTGACTGCCGTAGCGCAGTATTTCCTGACGCTTGCCTATGCCGCAGCGGATGCCGCCTACGTCCAGCCGTTCGACGATCTGAAGCTGCCGTTGAACGTGCTGGCCGGTTGGCTGTTCTTCGGCTATGCGCCGGCGGGTTACCTCTGGCTGGGAGCGGCTCTCATCCTGTCTGCTTCGCTGTTCATCATGCGAAACGAGATGCGCCGGGAGCGGAAACTTGCCTGACATGATTTGACGCAAAATGCCGCATCTGTCATGCCTCTGTCATGCGGCTCCCGCAGAAAACTCATGTTTCGAATGTTCGACACATGGGGGATTTCATGACAATTTCATCGCGCAGCGCAGCGCTTGCTGCCGTGCTTCTCGCATCGGTAGCCTTTCCGGCCGCGGCCGAGCCGGTTTTCAACCGCATCGCTTCCTTCCCGGTCGCAGAGAACCTGCCGGCCGACAAGGACAAGCTTTCGGTGAGCTCCGCCGAGATCATCACCGCGACCGACGACGGCAACACGCTGATCTATAGCGACAGCCCGCTCGGCGCGATCGGCTTTATCGACATTACCGATGCCAAGGCCCCCAAGGCAGGCGGCGCGCTGATGATGGACGGCGAGCCGACTTCGGTCACCTCGAGCGCCGGCAAGGCGCTGGTTGCCGTCAACACCTCCGAAAGCAAGGCGAAGCCCTCGGGCCGTCTGGCGATCGTCGACGTGGCGACCAAGAAGATCGAAAACACGTGCGATCTCGGTGGTCAGCCGGATTCGATCGCCCTCAACAAGGACAAGACGCTCGGCACCATCGCAATCGAAAATGAGCGCGACGAAGACGTCAATGACGGCAAGATCCCGCAGATGCCGGCCGGCGATCTCGTCGTCTTCCAGGTCAAGAACGGCACCGTCGATTGCGGCACCATCAAGCACGTGACGCTGACCGGGCTTACCGGCGTCGCCCCGGAGGATCCGGAGCCGGAATTCGTCGCCTTCAACGGCCTGAACGAGATTGCCCTGACGCTGCAGGAAAACAACGAGATCGTCATCATCGACGCCAACACGGCTGAAGTGAAGACGCATTTCTCCGCCGGCAGCGTCGATCTCACCGGTATCGACACCAAGCGTGACGGCGCCCTGAAATTCTCAGGCGAAGCCAAAGGTGTGCTGCGCGAGCCCGACGCCGTGAAGTGGCTGGACGACAACCGCCTCGTCGTTGCCAATGAAGGCGACTACCAGGGCGGCTCGCGCGGCTTCACCATCTTCGATAAGACGGGCAAGCTTCTCTACGAATCGGGCGCCTCCTTCGAACGCGCCGTCACCCATATCGGCCACTATCCGGAAAGCCGCTCGGGATCGAAGGGCGTCGAGCCGGAAGGCCTCGAAGCCGCCAAGTTCGGTGACATCAAGTATTTCTTCCTGCTCGCCGAGCGCGCCTCGGTGGTCGGCGTTTTCAAAGATACCGGCGCCGATCCCGAACTCGTCCAACTGCTGCCATCAGGCATTTCGCCGGAAGGCGCGATCGCCATTCCCGCCCGCAATCTCTTTGCGACCGCCAACGAGGTCGATCTCGGCAAGGACGGCGGCACGCGCTCGCATGTGATGATCTACGAGCGCTCGGAAGGCGAGAAGGCCTATCCGCAGATCGTCTCCGCCGAGAAGGACGGCAATCCGATCGGCTTCGCAGCCCTTTCGGGTCTCGCTGCCGTTCCGGGCAAGCCGGGCATGCTCTACGCCGTCAGCGACAGCGTTCTGGGTTCACAGCCGACGATCTACACGATCGATGCCAGCAAGAAGCCGGCCGTCATCACCGACGCCCTCGTCGTCAAGCGTGACGGCGCTCCGGCGCAGAAGCTCGACATCGAAGGCATTGCGGCTGCCGCTGACGGCTCCTTCTGGCTCGCCTCGGAGGGCTATAGCGAGCGTCTCGTTCCGCACGCCCTCTACAACGTCAATGCCAAGGGTGACATCAAAGCCGAGATCGCCCTGCCGAAGGAGCTCGTCGCCAACGAAATCCGCTACGGCTTCGAAGGCGTGACCATTGTCGGCACCGGCGACGACACCACGCTCTGGATGGCGGTCCAGCGCGAGTGGAGCGACGACGAGAAGGGCTTCGTCAAGCTCGTCTCCTACAATCCGAAGAAGAAGGAATGGGGTGCCGTTCGCTATCCGCTCGACAAGACGGAAAGCGGCTGGGTCGGTCTGTCGGAAATATCGGCTCAGGGCGACAGCGTCTACATCATCGAGCGCGACAACCTTGTCGGCGATGCCGCCAGGCTGAAGAAGCTCTACAAGGTCGCGATATCGGAGCTGAAGCCCGCCAAGCTCGGCGGCGAGCTGCCGGTCGTCAAGAAAACCGAAGCCCATGACTTCCTCGGTCAGCTCAAGACCGCGACCAATGGCTATGTGCTGGACAAGCTCGAAGGCTTCACCTTCGACGCGTCGGGCAAACCCTACGCGGTCACCGACAATGACGGCGTCAGCGACTCCTCCGGCGAGACCCTGTTCTTCCCGGTCGAGCTGAGCGCGACAAACTGA
Protein sequences of DBSCAN-SWA_2 >NZ_CP054031|507875:518580|515386_516286_+|WP_138328262.1|DBSCAN-SWA MNQTSVAVEPSRAVVGALWMVLAGIAFSLLNVVTQWLTMKLAFPSASAAFWQYGFAFLFSLPFLKRLGLAAMRTRYPWRHLARVVLAALGVEAWVAGLAAVPIWQAIALVMTSPFFIILGARLFLGERVGPARWAATAAGFTGAMIILQPWSDGFGWAALLPVLSALLWGASSLITKSLTGIERPETITVWLLVLLTPINGGLALAAGFAVPTGATLALFLLAGLLTAVAQYFLTLAYAAADAAYVQPFDDLKLPLNVLAGWLFFGYAPAGYLWLGAALILSASLFIMRNEMRRERKLA >NZ_CP054031|507875:518580|516369_518580_+|WP_173862921.1|DBSCAN-SWA MGDFMTISSRSAALAAVLLASVAFPAAAEPVFNRIASFPVAENLPADKDKLSVSSAEIITATDDGNTLIYSDSPLGAIGFIDITDAKAPKAGGALMMDGEPTSVTSSAGKALVAVNTSESKAKPSGRLAIVDVATKKIENTCDLGGQPDSIALNKDKTLGTIAIENERDEDVNDGKIPQMPAGDLVVFQVKNGTVDCGTIKHVTLTGLTGVAPEDPEPEFVAFNGLNEIALTLQENNEIVIIDANTAEVKTHFSAGSVDLTGIDTKRDGALKFSGEAKGVLREPDAVKWLDDNRLVVANEGDYQGGSRGFTIFDKTGKLLYESGASFERAVTHIGHYPESRSGSKGVEPEGLEAAKFGDIKYFFLLAERASVVGVFKDTGADPELVQLLPSGISPEGAIAIPARNLFATANEVDLGKDGGTRSHVMIYERSEGEKAYPQIVSAEKDGNPIGFAALSGLAAVPGKPGMLYAVSDSVLGSQPTIYTIDASKKPAVITDALVVKRDGAPAQKLDIEGIAAAADGSFWLASEGYSERLVPHALYNVNAKGDIKAEIALPKELVANEIRYGFEGVTIVGTGDDTTLWMAVQREWSDDEKGFVKLVSYNPKKKEWGAVRYPLDKTESGWVGLSEISAQGDSVYIIERDNLVGDAARLKKLYKVAISELKPAKLGGELPVVKKTEAHDFLGQLKTATNGYVLDKLEGFTFDASGKPYAVTDNDGVSDSSGETLFFPVELSATN >NZ_CP054031|507875:518580|514741_515017_+|WP_003547346.1|DBSCAN-SWA MNKNELVSAVAEKAGLTKSDAASAVDAVFDVVQAELKNKGDIRLAGFGSFTVSHRAASKGRNPSTGAEVDIPARNVPKFTPGKGLKDAVNG >NZ_CP054031|507875:518580|509505_510135_+|WP_003547334.1|DBSCAN-SWA MRNPVDTAMALVPMVVEQTNRGERSYDIYSRLLKERIIFLTGAVEDHMATLVCAQLLFLEAENPKKEIALYINSPGGVVTAGMAIYDTMQFIKPAVSTLCIGQAASMGSLLLAAGHKDMRFATPNSRIMVHQPSGGFQGQASDIERHARDILKMKRRLNEVYVKHTGRTYEEVEKTLDRDHFMDADEAQSWGVIDKVLTSRLEMEGEQA >NZ_CP054031|507875:518580|512117_514535_+|WP_029872855.1|DBSCAN-SWA MTKKTSVASSTAYPVLPLRDIVVFPHMIVPLFVGREKSIRALEEVMGSDKQIMLVTQINASDDDPDPSAIHNVGTVANVLQLLKLPDGTVKVLVEGRARAEIDTYTSREDFYEALGHVLEEPHDDPVELEALSRSVVSEFESYVKLNKKISPEVVGAASQIDDYSKLADTVASHLSIKITEKQEMLETTSVKQRLEKALGFMEGEISVLQVEKRIRSRVKRQMEKTQREYYLNEQMKAIQKELGDGEEGRDEMSELEERISKTKLSKEAREKADAELKKLRQMSPMSAEATVVRNYLDWLLGIPWGKKSKIKADLNNAEKILEADHFGLDKVKERIVEYLAVQARATKIKGPILCLVGPPGVGKTSLAQSIAKATGREYVRMALGGVRDEAEIRGHRRTYIGSMPGKVIQSMKKAKKSNPLFLLDEIDKLGQDYRGDPSSALLEVLDPAQNMTFMDHYLEVEYDLSDVMFITTANTLNIPAPLMDRMEIIRIAGYTEDEKREIAKRHLLPKAIKEHALQPEEFSVSDDALMSISQQYTREAGVRNFERELMKLARKAVTEIIKGKTKAVHVTAANISDYLGVPRFRHGEAEGEDQVGVVTGLAWTEVGGELLTIEGVMMPGKGRMTVTGNLKEVMKESISAAASYVRSRAVDFGIEPPRFDKSDIHVHVPEGATPKDGPSAGVAMATAIVSIMTGIPVDRHVAMTGEITLRGRVLPIGGLKEKLLAALRGGIKKVLIPEENAKDLAEIPDNVKNNMEIIPVSRMGEVIKHALIRRPEPIEWDGTVETPVITSVEGLDETGATIAH >NZ_CP054031|507875:518580|510438_511716_+|WP_003547337.1|protease|DBSCAN-SWA MSKVSGSNGGDSKNTLYCSFCGKSQHEVRKLIAGPTVFICDECVELCMDIIREENKSSMVKSRDGVPTPQDIIKVLDEYVIGQRQAKKILSVAVHNHYKRLAHASKNGEVELAKSNIMLVGPTGCGKTYLAQTLARIIDVPFTMADATTLTEAGYVGEDVENIILKLLQSADYNVERAQRGIVYIDEVDKISRKSDNPSITRDVSGEGVQQALLKIMEGTVASVPPQGGRKHPQQEFLQVDTTNILFICGGAFAGLDKIISARGEKTSIGFGASVKSQDDRRVGEVLRELEPEDLVKFGLIPEFIGRLPVLATLEDLDEDALIQILSEPKNALIKQYQRLFEMEDVELNFHEDALREIARKAIVRKTGARGLRSIMEKILLDTMFELPTLEGVREVVISEEVVRGSARPLYIYADRQEEKANASA >NZ_CP054031|507875:518580|507875_509129_-|WP_138328263.1|DBSCAN-SWA MFNFATGLAIWCSEAMTLALVLFVAWRHNVRNEAYLYWGLGFLLTGIGFAMVALRGEIPSVLSVETGNAIALLGQSAWVAGFLALDRKRIEWWALLPPAIWLAGVFLPWVNSDYSNRVVLYNLASATGATALAMAVAAGDMRRERTRIKLMGVFILQGCLCFGSALTMALTMPSDIEATNLGGASAMASAFLLTIAFAFTCRLIMERSERHLRALTLTDSLTGVLNRRGLLGYFDGIQERAHNEQRQVAVILFDLDHFKRVNDRFGHQSGDAVLTAFARMARQYIPNNIFGRMGGEEFAAFAAVADQTEAEAIAEAIRTEFCRLPVSTGEAIVPVTVSIGIALASSIEANIDKLISAADRALYAAKAAGRNCTVTFGEAEAAAPAPATPSGNAGELVPTVDDQVDALRRMGTLSRAG |
7 | Bacillus_phage(16.67%) | protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
807380 : 820770
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP054031|807380:820770|DBSCAN-SWA CATGACCGAATCAAAAGCGATGATCCTTGGCTGCAGCGGCCTTGCCCTCACATCCGAAGAGAAAGCCTTTTACCGAGGCGAGCGACCCTGGGGCTTCATCCTCTTCGGGCGCAACATCTCCGAAGCACGGCAGATCGCCGATCTCGTCGCCGAACTGCGCGATAGCGTCGGCTGGCATGCGCCGGTGCTGATCGACCAGGAGGGCGGCCGCGTCCAGCGCATCCGCCCACCCATTCTGGCGCGTTATCCTTCCGGCCATGCGCTCGGCGATCTCTATCGCCGCGATCGCGCACTCGGCCTGCGCGCTGCCTGGCTGATGTCGCGGCTGCACGCCTTCGATCTTTCGAGCCTCGGCATCGATGTCGATTGCCTGCCGGTGCTTGATGTGCCGGTCGAGGGCAGCAGTAATGTCATCGGCGACCGCGCCTATGGTGGCGATCCGCAGACCGTCATAGCGATGGGCCGGGCGGCTGCCGAGGGGCTGAAGGCCGGCGGCCTGTTGCCCGTGATGAAGCATATGCCCGGCCACGGTCGCGGCTTTGCCGATTCACATCTGGAGCTGCCCGTCGTCACCGTCCCGCGCGATGAGCTGGAGCTTCATGACTTCCCGCCCTTCATCGCGATGAAGGACGAGTTGATGGCCATGACCTGCCACGTCGTTTTCGCCGCCATCGACCCGGACAATCCGGCGACGACCTCGCGCAAGGTGATCGACGGCATCATCCGCGAGCACATCGGCTTTAACGGTCTGCTGCTCTCCGACGACAGCTCGATGAACGCTCTTTCCGGCACGATCGGTGAACGTGCGGCGAATATCATTGCAGGCGGATGCGATATCGTGCTGCATTGCAATGGCAATATGGACGAGATGCGGGAGGTTGTGGCAAATGTTCCGCCGCTCGCCGGCATATCGCTTGCCCGCGCAAAAGCGGTGGAAGCAGGCTTCGCGGCGCCGGATACCGCCGATGAAGCGGAATTGAGGGCGGAATTCGAAGCGATGTTTGCGACGGTCTGATCGTAAGGGAGCAGCTAGGTGAACACGGTCAAGGGCACGGAACGTCCACAGGCGGCAACGCCGATGGACAAGCTGTGGCAGGATAACGGAGCTGAGCGCGCCAGCCACGAGCCGGCGCTGGTGATCGACGTCGCCGGCTTCGAAGGTCCGCTCGACCTGTTGCTCTATCTTGCCCGCAACCAGAAGGTCGATCTGTCGCGCATTTCGGTGCTGGCGCTCGCCGAGCAATATCTGCTGTTCATCGAAAGCGCCCGGCGTATCCGCATCGAGCTTGCCGCCGATTACCTCGTCATGGCAGCCTGGCTTGCCTATCTCAAGTCGAAGCTGCTCATTCCCCAGCAGGTCAAGGATGACGGCCCTTCCGGCGAGGAACTGGCGGCAACGCTCGCCTTCCGCCTGAAACGCCTCGAAGCCATGCGCCAGGCTGCAGACGGCCTCGTCAACCGCAATCGCCTCGGCCGCGATATCTTCGTGCGCGGCGCGCCCGAGCATATTCCCGACCGCCAGCAATCCGCTTATGCGGCGAGCCTTTACGATCTCCTGACCGCCTATGCGGCGCTGCGCCAGCGCCATGCCGTCACCCAGGTGACGATTGAGCGGCGCACCGTCTGGTCGCTGACCGATGCCCGCGAGCTGCTGACCCAGATGATCGGCGAGGTCGCTGACTGGACGGCGATGGAGCATTATCTGCTGCGGTATCTCGCAGCGCCCGAAGAACGCGTCACGGCAATCGCCAGCGCCTTTGCCGCCTCGCTGGAGCTGGTGCGCGAGGGCAAACTCGAAATCCGCCAGGACGGCGCCTTTCAGCCGATATACATGCGCCGCGGTCCGAAACATGCCACGCTGCAGGTGGTCGAACAGGAGCGGCCGGCTTGATCGATCTGAAGGGCGAAGAGGATTTCGAAGGCAATTTCGAAGACAGGAGCCGCGACCTGCAGGCCGAGATCGAGGCCGAGCGCATCGCCGAGGCGCTGGTCTTCGCCTCCTCGCAGCCGGTCTCCGAAGGCTTCCTCGCCGAGCGCCTGCCTGAGAAGACCGACGTGCATGCGATCATGCTGCGCCTGAAGGAGCAATATGCGCCGCGCGGCGTCAATCTCGTGCAGGTCGAAGGAGCATGGGCCTTCCGCACTGCCGCCGACCTGTCCTTCGTCATCCGCCGTGATGACAATGAGGTGAAGAAGCTTTCGCGCGCCGCACTGGAAGTGCTGGCGATCATCGCCTATCACCAGCCGGTGACGCGTGCCGAAATCGAGGATATCCGCGGCGTCCAGACCTCGCGCGGCACGCTCGACGTGCTGATGGAAGCTGGCTGGGTTCGGTTCCGTGGTCGCCGGCGCACGCCGGGCCGGCCGGTGACATTGGGCACGACACGCGATTTCCTCGATCATTTCGGTCTCGAAGAGCTGCGTGACCTGCCCGGTCTCGAAGAGTTGAAGGGAGCGGGCTTGCTGTCGGGCCGCATTCCGGCAACCTTCAATATTCCCTCGCCGTTGATGAACGACGAACTGACCGAGGACGAAGACCCGATCACCCAGATGGACCTCGAGGAACTGGGGTTGCTGGCGCCACGCGGCACCTCTGAAGATTGAGGGGCTTGCGTCTTGCTTTTGCCATGCATGGCGAAAACAATGGAGCCAGCACTTTTCGGCGGGTCAATGGCTTGTCATAAAGGGCGATACCGTGACATTAAGCGCAGTTCAAAGGGCGTGATGTTGGAAACGGTGTCGGAAATCCTTACATCGTGGGAACATAGAAACAGGGAGTTAGCGTAATGGGTGGTTTTAGCATGTGGCACTGGTTGATCGTTCTGGTCATCGTGCTGTTGTTGTTCGGTCGTGGCAAGATTCCGGAACTGATGGGCGACGTTGCCAAGGGCATCAAGAGCTTCAAGAAGGGCATGACGGACGAAGACGCGCCGGATACGGCAAAGACCGTCGATCACAAGGCCGACGAAACGAAGTAACCAAGTCCGGGAAAAAGCGCAGCCCCCGCGCTTCCCGTCCAGGAGCCTTGCATGTTCGATATTGGCTGGACCGAGCTTTTGGTCATCGCGGTCGTACTGATCGTGGTCGTCGGTCCCAAGGATTTGCCGCCGATGCTGCGCGCTTTCGGCAAGATGACGCAGCGCGCCCGCAAGGTTGCGGGTGATTTCCGTGCGCAGTTTGATGAAGCGTTGCGCGAAGCCGAGCTTGACGATGTCCGCCAGACGATCAGCGACGCCCAGAAGCTCAACCCGGTCAACAGTCTGCGCGAGGCGATGAACCCGCTCCGCCAGATGGGCAACGAGATCAAGGCCGACCTGCAGAAGGCGACCACGGTCACGGAAAACAAGACCGAGGTGCCTCCAGATGCTGTCGCAGCCCCGACGCCGTCGATGAGCTTGCTGGAAACGCCGCCATTGGTAGCGACACCTGCGCCGTCGGAACCGGTCGCCGCAGCGATTGTGCAAGCTGATACGGTTGCGGCCAGGCCGAAGGCTGTGCGCAAGCCGCGCGTCAAGCCTGCCGACCAGGTCGATGCTGCGGCCGCCGTTGCCGTGCCTGTGGAAAAGCCGAAACGCACGACGGCGGTCAGGAAGCCTGCAACGCCGAAGAAGCCTGCGCAGACAAAGAAGGACGAGGCATGAGCGGTGATATCGAAGACAAGCCGCAGCCGTTGATCGAGCACCTGATGGAGCTGCGCACGCGGCTGATGTGGTCGATCGGCGCGTTTTTCGTCGCCTTCATCGCATGCTTCTTTTTTGCCAAGAATCTCTTCAATTATCTGGTCATCCCATACAAGACGGCGGTCCAGTGGGCTAATCTCGACGTCGAGAAGGCCCAACTCATCTACACCGCGCCGCAGGAATTCTTCTTCACGCAGGTCAAGGTGGCGATGTTCGGCGGCCTTGTGGTCGCTTTCCCGATCATCGCCGCCCAGGTCTACAAGTTCGTGGCGCCCGGCCTCTACAAGAACGAGCGTCAGGCTTTCCTGCCGTTCCTGATCGCTTCGCCGGTCCTGTTCCTAATGGGCGGCGCGCTCGTCTATTTCTTCTTCACCCCGATGGTCATGTGGTTCTTCCTGTCGATGCAGCAGGCGCCCGGTCATGACGAGATAGCAATCTCGCTGATGCCGAAGGTCTCGGAATATCTGAGCCTGATCATGACGCTGGTCTTCTCCTTCGGCCTCGTTTTCCAGCTGCCTGTCATCACCACGTTGCTCGCCCGTGTCGGTCTTCTGACGTCGCAATGGCTCGCCGAAAAGCGCAAATTTGCCATCGTCCTCGCCTTCGTCGTCGCCGCCGTGCTGACGCCGCCGGACCCGATGTCCCAGATCGGCCTTGCGATACCGACGATCATTCTCTACGAAATTTCCATCTATGCGGCGCGACTCGTGGAGCGCCAGCGTGCCCGGCAGGCGGTCGAAAAGGAAACCGGATCCGCAGACGTTGCCAAGACGGACAGCGTCTGAGCTGCGATCAGGAAATCCCTTCATGAACGCCGTCGACTGCATAGTCCCTAAATCCGAAACGGTGTAAGGAATTAGGCAGCAATTCAAAGTGTTGCGGTGTCCTTTGCGCGTCTGAAAAGACAGGGCGGCGCTGTAACATCCGGTCGAGGCCCAATCAGGCTTTGGCCGGGCCACCTCTGTTGCAACGACCTGGAACGACGATGCTCGATATCAAATGGATCCGTGAGAATCCCGAAGCGCTCGATGCCGCCCTCGCCAAGCGCGGTGCCGAGCCTCTTGCCCAAAGTCTCGTTGCCCTCGATGAAAAGCGCCGCTCCGCCGTGCAGAAGGCGCAGGACCTGCTGTCCCGCCGCAACCTCGCCTCCAAGGAGATTGGCGCGGCGATGGCCCAGAAGAATAGCGAGCTGGCCGAGAAGCTGAAGGCGGAGGTCTCTGAACTCAAGACCCTGCTGCCGGCGATCGAGGAAGAGGACCGGCAGCTGACGGCCGAACTCAACGACGCGCTCTCGCGCATCCCGAACATCCCCTTCGATGACGTCCCGGTCGGCAAGGACGAGCATGACAATGTCGTCACCCGCACCGTCGGCGAAAAGCCACGCTGGAACCACGCACCGAAGGAGCACTTCGAAATCGGCGAAGCGCTCGGCTATATGGATTTCGAGCGCGCCGCCAAGCTCTCCGGCTCGCGCTTCACGGTTCTGACCGGGCCGCTCGCCAGGCTCGAGCGTGCGCTCGGCCAGTTCATGATCGATCTCCACACCAGCGAGCACGGCTATACCGAAGTCAGCTCGCCGCTGATGGTGCGCGACGAGGCCGTCTATGGTACTGCCCAGCTGCCGAAATTCGCCGAGGATCTCTTCAGGACGACTGATGGACGCTGGTTGATCCCGACGGCCGAAGTGACGCTGACCAATCTGGTGCGCGAGGAAATCCTCGACCAGGATAAGCTGCCGTTGCGCTTTACCGCGCTGACGCCGTCCTTCCGCTCGGAAGCAGGATCCGCCGGCCGCGATACGCGCGGCATGCTGCGCCAGCATCAGTTCTGGAAATGCGAGCTCGTCTCGATCACCGATGCCGAGAGCGCCGTTGCCGAGCATGAGCGCATGACCGCCTGCGCCGAGGAAGTGCTGAAGCGCCTCGGCCTGCATTTCCGCACCATGACGCTTTGCACCGGCGACATGGGCTTCGGCTCGCGCAAGACTTACGATCTCGAAGTCTGGCTGCCGGGACAGAATGCCTTCCGTGAGATCTCTTCCTGCTCGGTCTGCGGCGATTTTCAGGGTCGGCGAATGAATGCGCGCTACCGCGGTAAGGACGACAAGAGCAACAGGTTCGTCCACACGCTGAACGGTTCCGGCACGGCGGTCGGCCGCTGCCTGATCGCCGTCCTTGAAAATTATCTGAACGAGGACGGTTCGGTCACGATTCCGGACGTTTTGCTGCCTTATATGGGCGGATTGACCAAGATCGAACGGGCGGCCTGAGGCGATGCGCATCCTGCTTACGAATGACGACGGCATTCACGCCGAAGGTCTTGCCGCGCTGGAGCGGATCGCGCGCACGCTGTCCGACGATGTCTGGATCGTGGCGCCCGAGACCGACCAGAGCGGCCTGGCCCATTCGCTGAGCCTCTCCGAACCTTTGCGGCTGCGCAAGATTTCCGACAAGCATTTCGCCCTGCGCGGCACGCCGACCGATTGCGTCATCATGGGCATCCGGCAGGTGATGGACATCAAGCCGGATCTCGTCCTGTCAGGCGTCAATTCAGGCTCGAACGTCGCCGACGACGTGACCTATTCCGGCACGATCGCGGGCGCCATCGAGGGCACGATGCAGGGCGTGCGCTCCTTCGCGCTGAGCCAGGCTTATCTCTATGAGGACGGTGCGCGCATCGTGCCCTGGGAGGTCTGCGAGACGCATGCGCCGGCTCTTCTGGAAAAGCTGATGGTCCTGGACCTGCCGGAGGGCACGTTCCTCAATCTAAACTTCCCGAACTGCCGTCCCGACGAGGTCGACGGCGCCGAGGTGACCATGCAGGGCAAGCTCGCCTTCAATCTGCAGGTCGACGCCCGCTCCGACGGCCGGGGCTTTCCTTACTACTGGCTGAAGTTTGGCGAACGCGCCGGCGCCTTCGTCGAAGGCACCGATATTCACGCCCTGAAGCATAATAAGATTTCGGTAACGCCTTTGAAACTGGATCTGACCGATTATTCCGTGACGGACCGCGTGGCGCGGGCCTTGGGATACGGAGCACAAGTTTGACGGCAAGACTGGCGGAGAAGGAGGGCTTTGCGGCGCTCGTCCTCAGATTGCGTGCCGAAGGCATCTCGGACCTCGATCTGCTGACGGCGGTCGAGCAGACGCAGCGTTCGTTGTTCGTGCCGCCGCAATTCGCCGACGACGCCTATTCGAGCCGGACGATCCCGATCGAATGCGGTTCCTTCCTCGAGGGTATCGATTTTGCCGTCCGCATCCTGCATCACCTCAAGCTCAAGCCCGGCCAGCGCGTCCTGGAGATCGGCACGGGAAGTGGCTTTACCGCCGCCGTGATGGGCCGCCTGGCCGAGCGTGTTCTGTCCATCGATCGCTACAAGACGCTGACATCAGCCGCGCAGCGGCGCATGGAATCGCTTGGTCTGCGCAGCGTCGTCATCCGCCAGGCCGACGGCAGCGCCGGCATGCAGGGTGAGGGCACCTTCGACCGCATCCTGGTGACGGCGGCGTTCAACGCGATGCCGCGCTTTTATACCGACCAGCTCGTTTCCGGCGGCTCGATGATCGCGCCGCTGATGATTTCCGAGAACGAGTGTCGGATGGTGCGGCTGACGAAAACCGGCAGCCGTTTCGAACGCGAGGAACTGTTCGAAGCGCCCTATCTGCCGATCGTTCCGCGTCTTGCCTCGCTGCTGTAGACGTGGATGATTTCAGGCCGGACAGCCCTGAGATTTGCCATCAGAAAAGAGCGCGCATGATCTCGTCCGAGAACCGCTAAGACTTTTCGGCATCATGCTCAGCACACTGCGATTTTTCGCCCGTAAGCTATGGTTATCAACTTCTCAAAAAAATCGCACTGATTCCAGCATCTTAACTGCGTGGTAATACTAACGCGTTTTAATAGACTCACAAATGGTTGCGTTCTCAGTGGGTCGAGTCATGCGTTTCAGTCTTTCGCCAAAGTTCGGGAAGTCGGCCGGTAATCTTCTGGTTGTTAGCCTGCTGGCAAGTGCCGCAACGGGCTGCAGTTCCGATGTGACACGGTTTGGCGGCTTGTTTTCCTCCTCCGGGCAGGACCAGATCACCACAAGTTCCATTCCGCGCAGGAATGTGAACGGTTCTCAGGGCGATCCGGTGCCGCGCGCCGATCTTAGCAGCTCGGCCGTTGCCAGTCAGTCGGGCTACGGCGGCGGCAATGACGCGCTGAACCAGCCTTATCCCGCCCGCCAGGGTTACGATCCGACCCGCACGTCGAGCTCGAGCGCACGTCTCGCTTCCGCGCCGGTCTCGGTGCAGCGTTCAGAGTTGGCCGCGCCGACGGCGGCGGCACCATCCCGGCAGCGGGAAAAGGAAGTTGCGCTCGCCCAGCCTTTCCCAGCTGCGCCGCAGGCCGAAAAGCCCCGATTGGTGGCGCCGGCTGCGCCGAAGGTGACGCCTGATACGCTGACGACAGGCACGACGCCGAAGGTTTCAGGCTGGTCCGCGACCAACGCTCCGTCGGTGTCACTGCGTCCGGGTGAAAGCATCGCCACGCTCTCCAGGCGTTTCGGCGTACCGGAAAAGGAAATCCTGCGCGTCAATGCTCTGAAGACGGCATCTGCCGCTCAGCCCGGCCAGGCGATCCTGATCCCGACCTTCAACGGCGGCAATGCCGCCAAGGCGGCATCGCAGGCGGCTGACCTTTCCAAGCCCGGCAAAATGCCGGCGCCGAAGGCGCCTGAGCAGAACGTCGCTGTCGTTCCGGGCGCCAATTCTGCCCGCGACAAGACGCTGGCGAGTGGCGATGTCACCGGCAAACTTCCCGCCGGCGCTGGCAAGGATCCGAAGGCGCCTGCCGGCACCTATGTCGTCAAGCAAGGCGATTCGCTGGCAAAGATTGCCAAGGCGACCGGTAGCAATGTCGACGACCTCAAGGCTGCCAACAATCTTTCGGCCAGCTCGCTCCGCATCGGTCAGGCTCTGAAAATCCCGAACGGCACCGCCGACAATATCAAGACCGCCTCGATCCCGGTCGAGAAGGTCGATCCGAAGCCGCCTCAGCCGGCGGCTGCCCAGCAGACGGCTTCCGTTCAGCCTGCGCCCTATAAGGCGCCGGCCGCCACGCAGACCGTCGACGATGTCGAGAAGAAGTCGGACGTCAGCTCCGCCGCGCCGGAATCGACCGGCATCGGCAAATATCGCTGGCCGGTGCGCGGCCAGGTCATTGCTTCATACGGCGCCAACGTCAACGGCAACCGCAATGACGGCATCGACATTTCGGTACCGCAGGGCACGCCGATCAAGGCTGCCGAAAACGGCGTCGTCATCTATGCCGGCAACGGCCTGAAGGAACTCGGCAACACGGTTCTCGTCCGTCACGACGACGGCACCGTCACCGTCTACGGCAACGCCGATACGCTGAGCGTCGCCCGCGGCCAGAAGATCCAGCGCGGCCAGACCGTCGCCGTCTCCGGCATGAGCGGCGACGTCAAGCAGCCGCAGGTCCATTTCGAGGTGCGCAAGGATGCGTCCCCAGTCAACCCGATGACTTTCCTGGAATAGGTACAGCGTACCAGGAAACGCAAAAGCCCGGCCACCAGTCGGGCTTTTGTAATTTTATGCCCGGTCGATGTGAACCCGCATGCGTCCGGCCAGGTCCTGAATATACTGCCAGGCGACGCGGCCGGAGCGTGCGCCGCGAGTCGTTGCCCATTCCAGCGCTTCGGCATGCATGTTGTCGCGTTCGAGCCCGAGCTTGAAGTGATCGGCATAGCCGTCGATCATGCCGAGATAGTCCTCCTGGCTGCATTTGTGGAAGCCGAGCCAGAGGCCGAAGCGGTCGGAAAGCGAAACCTTCTCCTCGACCGCTTCCGACGGATTGATCGCCGTCGACTGCTCGTTTTCCATCATGTGGCGCGGCAGGAGATGGCGCCGGTTGGAGGTGGCGTAGAAGAGCACATTGTCCGGCCGTCCTTCGATGCCGCCGTCGAGTGCCGCCTTCAGCGACTTGTAGGCGGTATCGTCGTGATCGAAGGAGAGGTCGTCGCAGAAGACGATCACGCGGTACGGCGTGTCCTTCAGGAGGTCGAGCAGATTGGGAAGGCTGGCGATATCTTCACGATGGACTTCGACCAGTTTCAGCGAGGCGCTGCTTTCGCGCCTGACATCCTCGTGCACGGCCTTGACCAGAGAGGATTTGCCCATGCCGCGCGCGCCCCAGAGCAGCACATTGTTGGCGGCATAACCCTCGGCGAAACGCACCGTATTCTCGTGCAGGATGTCGCGCACGTGATCGACGCCGCGGATGAGCGTCAACGCCACCCGGTTCGGCCTCCTGACCGGCTGCAGATGCTGGCGCAATGGCGCCCAGACGAAACAGTCGGCAGCATCCCAGTCGTTGAGGGCGGGTGCCGGTCCGGCGAGACGCTCGACGGCGTCTGCGAGCCGCTTCAGCTCGGCAAGTAGGGCTGTGTTGATTTCCTCGGTCATCGGTGGCCTCCTGATTTCTCACTGCGGCAATCCGCTTTGATCTTTCGATGCCGCTAATGCATGTCGCCCGGAAGCGTGGAGCGGTTCCGGGATAACGACATGCTTGAAAACAAAAGAGCTAAAGCGCGTCGCATCGATCTAGTTCGATGCGACGCGCTTTAGCATGGCGTTTTGGCGGCGGAAAGGTTATGAGGCAGCGGAATCGGTACGGTTTCCGGCTGTTTCTGTTGCATTCATCGAAGCCGTAACTATAGTCCGGCAACCTGAAAAGAGAGGCGGAGTTCCGCCGCCCAAGCCTGAGGAGTTTAGCATGTTCATCACCCCGGCATTCGCCCAGAGCGCGACCGATACCGCAACGGGATTCGGCGGCTCCGGTTTCGAAATGATCATCCTGTTCGTGCCGCTGATGGTCGTCTGGTACTTCCTGCTGATCCGTCCGCAGCGCGCACAGGCAAAAAAGCGTGAGGAAACCCTGAAGGCGATCCGTCGCGGCGACCAGGTCGTCACGGGTGGCGGCCTCGTCGGTAAGGTCACCAAGGTGATCGACGAAAAGGAAGTCGAAGTCGAGATCGCTGACGGCGTGCGCGTGCGCATCGTCCGCAGCGGCATCTCCGAAATTCGCGTCAAGGGTGAACCGGTCAAGGCCGACGCGGCGTAACAAGCGATAACGGCAGGACCGCCGGATCGGAAGATCGTCGAGAGAATGTTGCATTTTTCCCGCTGGAAAACACTTCTGATTTGGCTGGCCGCGTTTGCGGCCATCGTCATCGCTGCGCCCAATCTGCTGACCGAGGCGCAGCGATCCTCCCTGCCGGACTGGTTGCGGCACGACCGCGTAGTGCTTGGTCTCGACCTGCAGGGCGGCTCGCATATCGTTCTGAAGGTCGAGCGCTCCGATATCGTCAGAGACCGCCTGGAAGAGGTGGTCGCCAATGTACGCAACGCGCTGCGCGGCGCCGGCATCCGCTATACCGGACTGACCGGCAATGACCAGACCGTCACCGTCCGCATCACCGATGCGGCCCAGACACAGGCGGCGGTCGATCTCCTGAAGCCTTTGACGACGGCTGGCGGACATTCGGGACCTGATGTCGCCCTGCAGCAGGGTGACCAAGGGCAGCTCTCGCTGCAGATTTCCGACGCCGGCATTACCGCTGACGTCGCTTCCGCCCGGACCCGCTCGCTCGATATCGTTGGCCGCCGCATCGCCGGATTGGGCAATGATAATTTCCTCGTTCGCCCCGATGGCGCCGACCGCATCGTCGTGCAGGTGCTGGGATCAATCGATGCGGAGCGGCTGAAGAATATCCTGAACCAGCCGGCCAAGCTTTCCTTTCATCTGATCGACGAAAGCATGTCGGGGCAGGAGGCGCTGAACGGCCGCTGGCCGACGACATCGCAGGTGCTCTATTCGCTTGACGATCCGCCGGTACCGTATCTCGTCGACCGCACGGCTTTCATCACCGGCAGCAACATGGTCGATATCGAGCCTGTCGTCGATCAACAGACGCAGGATACCTCGATCGCCTACCGTCTCGATGCCGAAGGCACGCAGCGGCTGGCGCAGGCGACAGGGCAGAATATCGGCAAGCACCTTGCCATCGTCTTTGACGACCAGGTGATGTCATCACCGGTCATCGATGCGGCGATCACCGGCGGCGAGGGCCGGATTTCGGCAAACTTCTCCGAGGATGGCGTCCGCGACCTTGCGATCATGCTACGGGCCGGCGCCTTGCCGGCGACGCTCACCAGCGTTGAGGAACGCAGCGTCAGCCCGAGATTCGGCGCCGATTCTATCTTCAACGGCCTCGTCGCCGGCCTCGTCGCCGTCGTATTGGTCGCGGCACTGATGATCGCGCTCTATCGCATCCTCGGCATCATCGCCGTCGCATCGCTCTTCCTCAACCTGATCCTCATCGTCGCGGTGCTCAGCCTTATCGGCGCGACGCTTACCTTGCCAGGGATTGCCGGTATCGTGCTGATCGTCGGCATGGCGGTCGATTCGAACGTTCTGATCTATGAGCGAATCCGCGAAGAGGAAAAAACCACCCATTCCTTTGCGGAGGCCGTCGGTCGCGGTTTCTCGCGCGCATTCGCAACGATCGTCGACGCCAATGTGACGATCTTCATCGCCGCCATCATCCTCTTCTTCCTCGGCAGCGAATCCATCCGCGGTTTCGCGGTAACGCTTGCGGTCGGCATCCTGACGACCGTTTTCACGGCATTCACGCTGACGCGCTCGATCGTTGCCGTCTGGTTGAGAAGGCGCCATCCCCGGCATCTGCCGAAGAGCGTGCTGACGCATCTTTTCGAACACGCCAATATCCGCTTCATGGGCATCCGCCGTTATGTCTTTACGGCGTCGGCGGTCATCTCGCTGATCGCCATGGCCGCCTTTGCCACCGTCGGCCTGCATCTCGGCATTGATTTCACCGGCGGCTCGCTCATCGAGGTGACGGCAAAGCAGGGCAATGCCGACATCGCCGATCTCAGCTCGCGTCTGAATGATCTCAATCTCGGCGATGTCAGTGTCGAGCGCACCGGTGGCCCGTCGAACGCGCGGATCCGCATCGCATCCCAGGGCGGCGGCGAGAATGCCGAGCAGTCGGCGGCGACGCTGGTGCGCGGTGAGCTCCAGGAAGATTATGATTTTCGCCGCGTCGAGGTCGTCGGCCCCGCCATCTCGGGCGAATTGACGATGATGGCGACGCTCGGCGTGCTTGCAGCACTTGCGGCGATCCTCATCTACATCTGGATCCGCTTCGAATGGCAATTCGCGGTCGGCGCCATCATTGCCACGCTGCACGATGTCATCATCATGCTCGGTCTCTTCGTCCTCACCGGCATCGAGTTCAACCTGACGAGCATCGCCGCCGTGCTGACCATCGTCGGTTATTCCCTGAACGATACGGTCGTGGTCTACGACCGAATGCGCGAGAATCTGAAGCGATACAAGAAGATGCCGCTGCCGATCCTGATCGATGCTTCGATCAACCAGACGCTGTCGCGCACCGTTCTGACCGCGGCGACGACGCTGATCGCCTTGCTGGCGCTCTTTCTCTTCGGCGGTGAAGTCATCCGTTCCTTCACCTTTGCGATGCTCTTCGGCGTCGCTCTCGGCACCTTCTCTTCGATCTATATCGCCGCTCCTGTCTTGATCGTCTTCCGGCTGCGGCCGGAGGCTCCCGACGGGGAAGAGAGCAACAAGACGGATGCGGGTGTAAAATCCGGCACGGTGGTTTGA
Protein sequences of DBSCAN-SWA_3 >NZ_CP054031|807380:820770|818229_820770_+|WP_138329809.1|DBSCAN-SWA MLHFSRWKTLLIWLAAFAAIVIAAPNLLTEAQRSSLPDWLRHDRVVLGLDLQGGSHIVLKVERSDIVRDRLEEVVANVRNALRGAGIRYTGLTGNDQTVTVRITDAAQTQAAVDLLKPLTTAGGHSGPDVALQQGDQGQLSLQISDAGITADVASARTRSLDIVGRRIAGLGNDNFLVRPDGADRIVVQVLGSIDAERLKNILNQPAKLSFHLIDESMSGQEALNGRWPTTSQVLYSLDDPPVPYLVDRTAFITGSNMVDIEPVVDQQTQDTSIAYRLDAEGTQRLAQATGQNIGKHLAIVFDDQVMSSPVIDAAITGGEGRISANFSEDGVRDLAIMLRAGALPATLTSVEERSVSPRFGADSIFNGLVAGLVAVVLVAALMIALYRILGIIAVASLFLNLILIVAVLSLIGATLTLPGIAGIVLIVGMAVDSNVLIYERIREEEKTTHSFAEAVGRGFSRAFATIVDANVTIFIAAIILFFLGSESIRGFAVTLAVGILTTVFTAFTLTRSIVAVWLRRRHPRHLPKSVLTHLFEHANIRFMGIRRYVFTASAVISLIAMAAFATVGLHLGIDFTGGSLIEVTAKQGNADIADLSSRLNDLNLGDVSVERTGGPSNARIRIASQGGGENAEQSAATLVRGELQEDYDFRRVEVVGPAISGELTMMATLGVLAALAAILIYIWIRFEWQFAVGAIIATLHDVIIMLGLFVLTGIEFNLTSIAAVLTIVGYSLNDTVVVYDRMRENLKRYKKMPLPILIDASINQTLSRTVLTAATTLIALLALFLFGGEVIRSFTFAMLFGVALGTFSSIYIAAPVLIVFRLRPEAPDGEESNKTDAGVKSGTVV >NZ_CP054031|807380:820770|814102_814756_+|WP_011651660.1|DBSCAN-SWA MTARLAEKEGFAALVLRLRAEGISDLDLLTAVEQTQRSLFVPPQFADDAYSSRTIPIECGSFLEGIDFAVRILHHLKLKPGQRVLEIGTGSGFTAAVMGRLAERVLSIDRYKTLTSAAQRRMESLGLRSVVIRQADGSAGMQGEGTFDRILVTAAFNAMPRFYTDQLVSGGSMIAPLMISENECRMVRLTKTGSRFEREELFEAPYLPIVPRLASLL >NZ_CP054031|807380:820770|810165_810357_+|WP_017960199.1|DBSCAN-SWA MGGFSMWHWLIVLVIVLLLFGRGKIPELMGDVAKGIKSFKKGMTDEDAPDTAKTVDHKADETK >NZ_CP054031|807380:820770|816653_817526_-|WP_138329811.1|DBSCAN-SWA MTEEINTALLAELKRLADAVERLAGPAPALNDWDAADCFVWAPLRQHLQPVRRPNRVALTLIRGVDHVRDILHENTVRFAEGYAANNVLLWGARGMGKSSLVKAVHEDVRRESSASLKLVEVHREDIASLPNLLDLLKDTPYRVIVFCDDLSFDHDDTAYKSLKAALDGGIEGRPDNVLFYATSNRRHLLPRHMMENEQSTAINPSEAVEEKVSLSDRFGLWLGFHKCSQEDYLGMIDGYADHFKLGLERDNMHAEALEWATTRGARSGRVAWQYIQDLAGRMRVHIDRA >NZ_CP054031|807380:820770|817836_818184_+|WP_003539006.1|DBSCAN-SWA MFITPAFAQSATDTATGFGGSGFEMIILFVPLMVVWYFLLIRPQRAQAKKREETLKAIRRGDQVVTGGGLVGKVTKVIDEKEVEVEIADGVRVRIVRSGISEIRVKGEPVKADAA >NZ_CP054031|807380:820770|812044_813328_+|WP_138329815.1|tRNA|DBSCAN-SWA MLDIKWIRENPEALDAALAKRGAEPLAQSLVALDEKRRSAVQKAQDLLSRRNLASKEIGAAMAQKNSELAEKLKAEVSELKTLLPAIEEEDRQLTAELNDALSRIPNIPFDDVPVGKDEHDNVVTRTVGEKPRWNHAPKEHFEIGEALGYMDFERAAKLSGSRFTVLTGPLARLERALGQFMIDLHTSEHGYTEVSSPLMVRDEAVYGTAQLPKFAEDLFRTTDGRWLIPTAEVTLTNLVREEILDQDKLPLRFTALTPSFRSEAGSAGRDTRGMLRQHQFWKCELVSITDAESAVAEHERMTACAEEVLKRLGLHFRTMTLCTGDMGFGSRKTYDLEVWLPGQNAFREISSCSVCGDFQGRRMNARYRGKDDKSNRFVHTLNGSGTAVGRCLIAVLENYLNEDGSVTIPDVLLPYMGGLTKIERAA >NZ_CP054031|807380:820770|813332_814106_+|WP_017991200.1|DBSCAN-SWA MRILLTNDDGIHAEGLAALERIARTLSDDVWIVAPETDQSGLAHSLSLSEPLRLRKISDKHFALRGTPTDCVIMGIRQVMDIKPDLVLSGVNSGSNVADDVTYSGTIAGAIEGTMQGVRSFALSQAYLYEDGARIVPWEVCETHAPALLEKLMVLDLPEGTFLNLNFPNCRPDEVDGAEVTMQGKLAFNLQVDARSDGRGFPYYWLKFGERAGAFVEGTDIHALKHNKISVTPLKLDLTDYSVTDRVARALGYGAQV >NZ_CP054031|807380:820770|811016_811844_+|WP_129420703.1|DBSCAN-SWA MSGDIEDKPQPLIEHLMELRTRLMWSIGAFFVAFIACFFFAKNLFNYLVIPYKTAVQWANLDVEKAQLIYTAPQEFFFTQVKVAMFGGLVVAFPIIAAQVYKFVAPGLYKNERQAFLPFLIASPVLFLMGGALVYFFFTPMVMWFFLSMQQAPGHDEIAISLMPKVSEYLSLIMTLVFSFGLVFQLPVITTLLARVGLLTSQWLAEKRKFAIVLAFVVAAVLTPPDPMSQIGLAIPTIILYEISIYAARLVERQRARQAVEKETGSADVAKTDSV >NZ_CP054031|807380:820770|810408_811020_+|WP_138329817.1|DBSCAN-SWA MFDIGWTELLVIAVVLIVVVGPKDLPPMLRAFGKMTQRARKVAGDFRAQFDEALREAELDDVRQTISDAQKLNPVNSLREAMNPLRQMGNEIKADLQKATTVTENKTEVPPDAVAAPTPSMSLLETPPLVATPAPSEPVAAAIVQADTVAARPKAVRKPRVKPADQVDAAAAVAVPVEKPKRTTAVRKPATPKKPAQTKKDEA >NZ_CP054031|807380:820770|814997_816599_+|WP_138329813.1|DBSCAN-SWA MRFSLSPKFGKSAGNLLVVSLLASAATGCSSDVTRFGGLFSSSGQDQITTSSIPRRNVNGSQGDPVPRADLSSSAVASQSGYGGGNDALNQPYPARQGYDPTRTSSSSARLASAPVSVQRSELAAPTAAAPSRQREKEVALAQPFPAAPQAEKPRLVAPAAPKVTPDTLTTGTTPKVSGWSATNAPSVSLRPGESIATLSRRFGVPEKEILRVNALKTASAAQPGQAILIPTFNGGNAAKAASQAADLSKPGKMPAPKAPEQNVAVVPGANSARDKTLASGDVTGKLPAGAGKDPKAPAGTYVVKQGDSLAKIAKATGSNVDDLKAANNLSASSLRIGQALKIPNGTADNIKTASIPVEKVDPKPPQPAAAQQTASVQPAPYKAPAATQTVDDVEKKSDVSSAAPESTGIGKYRWPVRGQVIASYGANVNGNRNDGIDISVPQGTPIKAAENGVVIYAGNGLKELGNTVLVRHDDGTVTVYGNADTLSVARGQKIQRGQTVAVSGMSGDVKQPQVHFEVRKDASPVNPMTFLE >NZ_CP054031|807380:820770|808412_809270_+|WP_020049690.1|DBSCAN-SWA MNTVKGTERPQAATPMDKLWQDNGAERASHEPALVIDVAGFEGPLDLLLYLARNQKVDLSRISVLALAEQYLLFIESARRIRIELAADYLVMAAWLAYLKSKLLIPQQVKDDGPSGEELAATLAFRLKRLEAMRQAADGLVNRNRLGRDIFVRGAPEHIPDRQQSAYAASLYDLLTAYAALRQRHAVTQVTIERRTVWSLTDARELLTQMIGEVADWTAMEHYLLRYLAAPEERVTAIASAFAASLELVREGKLEIRQDGAFQPIYMRRGPKHATLQVVEQERPA >NZ_CP054031|807380:820770|809266_809983_+|WP_138329819.1|DBSCAN-SWA MIDLKGEEDFEGNFEDRSRDLQAEIEAERIAEALVFASSQPVSEGFLAERLPEKTDVHAIMLRLKEQYAPRGVNLVQVEGAWAFRTAADLSFVIRRDDNEVKKLSRAALEVLAIIAYHQPVTRAEIEDIRGVQTSRGTLDVLMEAGWVRFRGRRRTPGRPVTLGTTRDFLDHFGLEELRDLPGLEELKGAGLLSGRIPATFNIPSPLMNDELTEDEDPITQMDLEELGLLAPRGTSED >NZ_CP054031|807380:820770|807380_808394_+|WP_138329821.1|DBSCAN-SWA MTESKAMILGCSGLALTSEEKAFYRGERPWGFILFGRNISEARQIADLVAELRDSVGWHAPVLIDQEGGRVQRIRPPILARYPSGHALGDLYRRDRALGLRAAWLMSRLHAFDLSSLGIDVDCLPVLDVPVEGSSNVIGDRAYGGDPQTVIAMGRAAAEGLKAGGLLPVMKHMPGHGRGFADSHLELPVVTVPRDELELHDFPPFIAMKDELMAMTCHVVFAAIDPDNPATTSRKVIDGIIREHIGFNGLLLSDDSSMNALSGTIGERAANIIAGGCDIVLHCNGNMDEMREVVANVPPLAGISLARAKAVEAGFAAPDTADEAELRAEFEAMFATV |
13 | uncultured_Mediterranean_phage(90.91%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
848529 : 939654
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP054031|848529:939654|DBSCAN-SWA GTCAGAAAGCCAGGTGCGGCGATTCCATCGCCGCAGCAAGCATCGCGGCGTAATCGGCCTTCGGAACGTCGATCGCTCCGAACGTCTTCAGGTGCTCGGTGGTGAACTGGGTGTCGAGCAGGGTGAAGCCCCTTTCCCGCAGCCGGTCGACCAAGTGAACGAGGCAGATCTTCGAGGCATCTGTCCGGCGCGAAAACATGCTTTCGCCGAAGAAGGCCGAGCCGAGCGAGACGCCGTAGAGGCCGCCGACCAATTCATCGTTCTCCCAGGCTTCGACGGTATGGGCATGGCCCATGTCGAAGAGCGTCGAGTAGAGCGATCTGATCGTCCTGTTGATCCAGGTGCTCGGGCGCCCGGACGTCTCCTCCGCACAGGCGGCGATGACCTGATCGAAAGCGTGATCGAAACGGATCTCGAAGGGTTTCCTGCGCACTGTCTTGGCAAGGCTTTTCGACACATGGAAATGGTCGAACGGCAGCACGCCGCGCAGTTCCGGTTCGACCCAGAAGATCTCCGGGTCGTCGGCGGATTCGGCCATCGGGAAGAGGCCGATGGAATAGGCGCGCAGGAGAATGTCAGGGGTTATGCCTGGTGACTTCCTGCGCGATCCTGCCATTCCTGTCCTATGCCGCCGGTGTCTTGGCGAGGTAGTCTTCCAGCCAATGGATGTCGTAGTCGCCGTTGGCGATATCCTGGTTGGAGACAAGATCCTGGAACAGCGGCAACGTCGTATTGATGCCGTCGACGACGAATTCGTCAAGCGCTCGGCGCAGGCGCATCATGCATTCGACGCGGGTACGGCCGTGCACGATCAGCTTGCCGATGAGGCTGTCGTAATAGGGCGGGATCTTGTAGCCCTGATAGGCACCAGAATCGATGCGCACGCCGAGGCCGCCCGGCGCGTGGAAATGCGTGATCGTGCCGGGCGAAGGCACGAAGGTGCGCGGGTGCTCGGCATTGATACGGCACTCGATGGCGTGACCGGAAAAATGCACTTCGTCCTGCGTCACCGAGAGACCGCCGCCGGAAGCGACACGGATCTGCTCGTGCACGAGGTCGATGCCGGTGATCGCTTCGGTGATCGGATGCTCCACCTGCAGGCGGGTGTTCATTTCGATGAAATAGAACTCGCCGTTTTCGTAAAGGAATTCGATCGTACCGGCGCCGCGATATTTCAGCTTCTTCATGGCGTCGGCGCAGACCTGGCCGATCTTCATGCGCTGCTCAACGTTGAGCGCCGGCGAATTCGCCTCTTCCCAGACCTTCTGATGGCGGCGCTGCAGCGAGCAGTCGCGCTCGCCGAGATGGATGGCATTGCCTTCGCCGTCGCCGAACACCTGGATTTCGATGTGGCGTGGCTTGCCGAGATATTTTTCCATGTAGACGGCATCGTTGCCGAAGGCGGCTGCCGCTTCCGTGCGCGCCGTCGACCAGGCTTCGATCAGGTCGGCCTCGCTCTTGGCGACCTTCATGCCGCGTCCGCCGCCGCCGGCCGTCGCCTTGATCAGCACGGGATAGCCGATTTCAGCGGCCGTCTTCAGCGCATCTTCTTCGGTCTTCACTTCGCCGTCCGAGCCCGGAACGACGGGAATGCCGAGTTCGAGCGCCGTTGTCTTGGCGGTGATCTTGTCGCCCATGATGCGGATATGATCCGCCGTCGGCCCAATGAAAGTGATGCCGTGGGCTTCGAGGATATCGGCGAACTTGGCATTCTCCGACAGAAAGCCGTAGCCCGGATGCACGGCGTCAGCGCCGGTGATCTCGCAGGCGGCGACAATCTGATGGATGTTGAGATAGCTCTCGCGCGAGGAGGGCGGGCCGATGCAGACACTTTCGTCGGCAAGACGCACATGCATGGCATCGGCGTCGGCGGTGGAATGAACCGCGACGCAGGCGATGCCGAGCTCTTTGCAGGCCCGGAGCACGCGAAGGGCGATTTCCCCGCGATTGGCGATGAGGATTTTCGAAATCATGGGCATCGGCCTATTCGATGACGACGAGGGCTTGGCCGTATTCGACGGGATGTCCGTCATCGACGAGGATTTCGGTCACCTTGCCTGACTTCGGTGAGGGAATCTGGTTCATTGTCTTCATCGCTTCGATGATGATCAGCGTCTGGCCTTCCTTGACGGTCGCGCCGACTTCGATGAAGGGACGCGCGCCCGGCGCCGGCGCCATATAGACGGTGCCCACCATCGGTGCATTGACGACATTGGCCGGGTTGCGGCTGGGAGCTGCGGCTGGCGCGGCAGCGGCTGCCGCAGCGGGCGCGGCAAAGGCCGGTGCCGCAATCGGCGCCTGAACATATTGCGGCGTGCCTGCACGCGATACGCGGATACGCAGGTCGTCCTGCTCGACCTCGATCTCCGTCAGATCCGTTTCGTTGAGAATATTGGCGAGATCGCGGATCAGTGCCTGGTCGATACCCGATTTCTTTTCAGCCATGGTGTTGCCCTGTCTTATTCTTATGCCGTGATGTTCTTCAGCGCGTGGAGCGCCAGGATGTAGCTGTGAGGCCCGAAGCCGCAGATCACGCCTTCGCATGCCGGCGCGATCATCGACTTGTGGCGGAATTCTTCCCGTGCATGCACGTTGGATATGTGGAGCTCGACGACGGGAATGGAGATAGCGCGGATCGCATCATGCAGCGCGACCGATGTATGCGTATAGGCGCCTGCATTGATGGCAACGCCGACGGCCTTTTCGTCGGCCTCATGGAACCAGTCGACGAGCGTGCCTTCGTGGTTGCTCTGACGGAAGTCGATATCGAGGCCGAGTTCGCGGCCTGCAGCCTTGCAATCTACCTCGATGTCCCTGAGCGTCTTGCCGCCATAAATGCCGGGCTCTCGTTTGCCCAGCATGTTCAGGTTGGGGCCGTTCAGGACAAAAATCGTTTGCGTCATCGAATGTTCCGTAAAATGTACGACGGCAACCTATAGACTCCGCGAAGGCTGAATGAAAGCCCTTACGGATTGGCCAGCGGCAATCGTGCCACATTTGTCCACATGGTTGAAACACTTCGATTTTGGCGATTTGATGGGCTGCTGGCTGGCGACTGGTTTGCTCAGCAGGCGGTCTTGCCGCAGCTGCGCATGTTCTTGACCTTGGCTTCGAGATCGTCGAGCCCGACGGCGCCGGGCACCAGTTCGCTGCCGATCACATAGGAGGGCGTGCCGCTGATGCCGAGACTGGAGGCGAGCTGGTAGGTCGCCTGGACTATACCGTCATTCGGGCTCTTTGCCATCTCGGCGCGGATCTTGTCCTCACTGACGCCGAGCGAAGCGGCGACCGCGACCGCTGTTTCGTCGGAGGCGCGGCCTTCGGTGCCGAGAAGGGCGACGTGGAAATCGGCATATTTCTCCGGTGCGAGCTTGCGGAAGGCGTCTGCCACCTTGTGGGCGGCGACTGAATCCGGCCCGAGGATCGGGAATTCCTTGAGGACGAAACGGACGTTTTTGTCCTTCTTCAGCATCGCCTGCATGTCGGGAAGCGCATGGCGGCAGTAGCTGCAATTATAATCGAAGAATTCGACGACAGTGACGTCGCCCTTGGGATTGCCGAGCGTGACGTCGTTCTTCGATTCGAAGATGTCGGTGGCGTTCTCTTCGATCGCCATATTGGCCTTAACCGACCGCTGGGCCTCCTGCTTCTTCTGCAGCGCATCCTGAACTTCGAGCATGATCTCGGGGTTCTCGATCAGGTATTGCTTGATGAATTCGCCGAACTCCTTCTTCTGCTGGTCGTCGAGCGCTGCCGCCGGAAGCGGAAGGGCGATCGATGCGGCAAGCGCCAGTGCGGTGAAGGTCTTCGGGAAGAATGCCATGATATCCTCGTCATCGGCGATATGGTCGTCTCTTCATCGAAAAGACCGTCGCCGGGTTAATGGACTTGAATGCGTCGCGTCAAGGTACGGTGCGTTCGGTTCTGGTCAAACCAGAATTTTTCGTCCCCGCGCGGCCCTCGCGGGATTGTGATGAGCCGATTAATGCGGCAATTGTCGGGCCGTGATTGCAGAAGCTAAGCGAGAAACGATCTTGTTTTCCATATCGAAACGCAGCGAAGTCGAGCCTTTTCACGCCATGGACGTCCTGGCGGAGGCGACGAAGCGGCGTGCGGCCGGCCATCCCGTGATCTCGATGGCGGTTGGCCAGCCTTCGTATCCGGCGCCTCTAGCGGCCCTCGAAGCGGCGCGCGCAGCCCTTGCCGAAGGTCGGATCGGTTATACCGATGCGCTGGGGACGGCACGGCTTAAATCGGCGCTCGCCTGGCACTACAAGGATCGCCACGGCCTGGAGATTGATCCCAAGCGCATCGCCATCACCACAGGTTCCTCGGCCGGCTTCAATCTCGCCTTCCTGTCGCTCTTCGATGCCGGCGATGCGGTGGCGATCGCAAGACCCGGTTATCCCGCCTATCGCAATATCCTCGGCGCGCTGGGCCTGAAGGTCCTCGAAGTGCCGGTAACGGCCGAGACCCACTTCACCCTGACGCCGCAAAGCCTCGAAGCGGCGCAGAAGGAAAGCGGCATGAGCCTGAAAGGCGTGCTGCTTGCAAGCCCCGCCAATCCGACTGGCACGGTGACCGGCCGCGAGGGGCTGGAGGCGCTTTCGGATTACTGCGCGGCCCAATCCATCGCTTTCATCTCCGATGAAATCTACCACGGGCTGACCTTCGCCGGCGAGGAGGCGAGTGCGCTGGAGCTGACCGACGAGGCGATCGTCATCAACTCCTTCTCTAAATATTATTGCATGACCGGCTGGCGAATCGGCTGGATGGTGCTTCCCGAGCGCCTGGTTCGGCCGATCGAGCGGGTAGCGCAGAGCCTCTATATTTCCCCGCCCGAGCTTTCCCAGATCGCCGCCACGGCCGCTTTGAGCGCCGGCGCCGAGCTCGATCGTTACAAGGCCAGTTACGCCGCCAACCGCGATTTGCTGATGCAGCGCCTGCCGCAGATCGGGCTTTCGATCGCCTCGCCGATGGACGGGGCTTTCTACGCCTATCTCGACGTCACACGCTTCACCAATGACAGCATGGGCTTTGCCAAACGCATGCTCGCGGAAATCGACGTCGCCGCAACGCCCGGTCTCGACTTCGATCCGCTGGAGGGACATCGAACGCTGCGTCTGTCCTATGCGGGCTCAGAAGCGGAGATCGCCGAGGCGGTGGAGCGAATCGCGGCTTGGCTGAAATAGCCCTCGCGGTTGCTTATTGCTCAGCTGCGCCGCGAAAACAGCGAGGCGAGCGTCGAAGTCGGTTTGTTGTTCAGGCGCGGCACCACCACGAGCTGCTTGATGTCGCCGCGCTTGTAGGCGGCGAGGAAGCGGCGGTAATCCTTGAAGGAGACGCAGCCGTTGGAATCACCGTTCTTGCCGAGCATGTAGGTGTGGGCGAGCAGACCGACACGGTCATAGATGGCGTCCGAACCCTCGATCGGTGTTAGCCGCAGCGCCTCGACGCCGTGGAAAAGGCTCTCGCGCATGGTAAGAACATAGGTGTGCGGCGGCGTCGGTCCGCGCATCTTCTTATTGGCGAAACGCGGTTTGTCGCGCATCTTACCGAGCCCGGAATGGGCCTCGAGCCGCTCGCCGTTCGGCAGATAGACGATGCTGTTTTCGATATCGTAGATCGCCACGCCTGCGCGCAGCTTCGGCGAGAAGACCGGCTTGTCGTAGCGCGGCACTGCATCGTCCTCGTCATCCTCGATCGGCGAGTTCGGCTGGGCGTAGGCAAGCACAGGCTCGGCAGAGCGCTCAGAACCCTTGGAGGCCGGTGTCGGTTTGCCGATGAGTCCGTCAGGACGCGCCATCGGCAGCGGCACCGAACCTTCGTCCGGCGCCAGCACGAGATCGAAGGGTGGCGCGTCTTCCGCGGCGGCGACGACCACCGGCGCTGGTTCCGATGCGGGAATGGAGGCCGTTTTGAGGGGAGATGCGGTTGGCTCGACCGGAGCCGGCGCGATTTGCCGGGATCCGGCATCGGCAAGAGCAAGCAGGGTCGGGAGTTCGGCTGCGGCAACCGCCTCCTTCGAGGGGACGATGATCCGGTCACCGTCGTCTTCTTCCGTCTCTGCCGAGGCGAGCTCGACCGGCGCCGCCGCGATCACGTCGTCCTTGCGGAATTTGGCGCTGTCGGAGATCGCCGCCACGACCGGTTGTTCGGCAGCCATGGCGAGCCGGTTGTTTTTGGCGAGAACCGCAAGCGCCGTTGCAGCGGCGGCGGTCTTTTCGGCATGGGATTGAACGAGGCTCGCCGTCAGCGGGGCCTTCGGCTTCGCATCGAAGGCGGCCAGCCGATCGGCCTTGCCGACATGGATCAGCCTCTCATGGCGCGGTGAAGGCGCCACCTTCGGCGCAGCGCCGAGCTGTGCCAGGCGATCGGACGGAGAATAGGGAGCGGCTATCGAATGCATCGTCGCGAAGGTCGCGATCAGCCATGTGGAGGTCAGAAATCCGGCGCCGACAACACCGTAAAGGAAACTGCGAGATCTGCGCCAACGGAAAGCAGATCCGCCAAGAGGGGTGACGTCATCGAACGTCCCGACCGCACACGCCATACTACACTCGTCTTTCAAACTCGCTACGCAGGCCGGCGGCACTGACTGACTACACTTCGCCGGGGCCGGTACAGGCGCAAATGGGCCGAATTTAGACCACCCACGACGTCCGACTGTCTCGCAAGGATGACGAATTATGGTTTCTAATTTGTTTACGCGGCGTCGCTCTGTCTCTCGACGTCTCAAAAAGAGATGTGTTGGGCCTTCTCAGGCGCGCTCCATCACATAACTTCCCGGCGCTTCCTCGATTGCGTTCAACGCATCGCCGCCAGGTTTGCGGGCCGGCACGCGTCTGCCCTCATGCGTCTCGATCCAGGCTTGCCAATGCGGCCACCATGATCCCGGCGTCTCGGTCGCGTGCTCGAGCCAGGTCTCGTAGTCTCCCTTGGCGGGGCCGCCCGTCCAGAATTGATACTTCTTCCTGTCGGGCGGATTGACGACGCCGGCGATGTGTCCGGAGCCGGTCACCACGAATTCCACCTTGCCGCCGAAGAACCGGCTGCCGAGGAAGACGGACTTGGCCGGTGCGATGTGATCCTCGCGGGTGGCGAGATTATAGATCGGGATCTTGACGTCCTTCAGCGACACGGATTTGCCGTCGAGGATCATCTCGTTCCGCGTCAGCGCATTCTTGAGATAGCAATTGCGCAGGTAGAAGGCATGGTTTGCCGCTGCCATGCGGGTCGAATCGGCGTTCCAGAACAACAGGTCGAAGGGCAGGGGATCCTGGCCCTTGAGGTAGTTGTTGACGAAATAAGGCCAGATCAACTCGGAGGCGCGCAACATGTTGAAGGCCATCGACATCTTCGTGCCGTCCAGATAGCCGGCCGCCTGCATATGCTCTTCGAGCGATTCAAGCTGCTCCTCGTCGACGAAGACCTTGAGGTCGCCGGCATGGGTGAAGTCGACCTGGGTGGTGAAGAGTGTCGCGGTTTTGATGCGCTTGTTCTTCTCCTTGGCATGCAGCGCCAGTGTCGCGGCAAGCAGCGTGCCGCCGACGCAGTAGCCGACGGTGTTGATGTCCTTCTCGCCGGTCGCCTTCTCGATTGTATCAAGCGCGAAATCGATGCCCTCGCGGGCATAGGCCGCCCAGTCCTTGTCGGCGTGGCGCGCATCCGGGTTGACCCAGGAAATGACGAAGACCGTCTGACCCTGGTCGACACACCATTTGATGAAGGATTTTTGCGGATTGAGGTCGAGAATATAGAACTTGTTGATCCAGGGCGGGCAGATCAACAGCGGCCGCTTCAGCACCGTTTCGGTCGAAGCCTCGTACTGGATGATCTGGCAAATGTCGTTCTGGGCGATCACCTTGCCTGGCGTCAGCGCCATGTCGCGGCCGACGGCGAATTTCGTCATGTCGGTCTGGCGAAGGCGAAGGTCGCCATGTCCGGCGGCGATATCCTCGGCCAGCATCTTCATCCCGCGCACCAGATTGGCGCCGCTGGTCGCGATCGTCTCGCGGTAGAGCTGCGGATTGGTGGCGATGAAGTTGGTCGGCGAAAGCGCTGCCGTGATCTGCTTCACGTAGAAGCCGGCCTTGTGTTTGGTGTGCTCGTCGAGGCCCTCGGTCTCCGACACCATCTTTTCCACCCAGTCGGTCGTCACGAAATAGACCTGGCGGAGAAAATCGAAGAACGGATTCTTCTGCCAGTCCTCGTCCGAAAAGCGCTTGTCCTTGCGGGTGTCCGGCTCGGGTGGCGTGGGATCGCCTTGCATGCGCTGCATCGAGCGCATCCAGATGCCGAAGAACGAGGACATCAGCTGCGTCTGTGCCTCGAAGGTGCGGCGCGGATCGGAAATCCAGTATTCGGTGACCTTGGAAAGCGTCTTGACCATGTCGGTCATCGGATCGGCGGCGCTTTCGCTGATCTCGCCGCGTTCGCGCGGCGCAAGCCAGGCAGACGCTGCTTGTCCGAGATTTTCGAGCGCCCGGGCAAAATTCATCGCCATGGCCTCAGGATCCTTCAACAGATAGGGATCGAGATCGGTCGCGTCGAAGCCGGCCTTGCCGCCATTCTCCTGCTTGCTGTCGGTCACCATTTCCTCCGGCGGCAGCACCACTCTTTTTATCCATTCGTTGTACATCGTGATCGAACGAGAAAACAAGTTCCACCCGCGTCGGCTCATGCTATTGCTTTGTGGCTGCGACAAATTTAACAGTCGAAGCAGTCAAAACTGGATTTTTGAGGCATGACGAAGAACGGCATTCGGCGCATCGCAACCTTCAACCGCACCGCACTGATCTGCGCCGCGGCGGCGATGGCCCTTGCCGGATGCAATCTGACGGCGGAACAAAAAGCCGCAGCCGCCGCCAAACGAGCGGCGCCGACCGCCGTCGTCATGCCGGCCACCAAGGGCGAGGCGGCCGAGGGCGGCCTTGCAAAATACCCGGACGGCTATCCGAATTTCGGCGCGCCGCTGACCGCCGCCAATGTCCAGATGAGCGACGAGCAGGCGGCCGAACTGCAGCATCAGCTGACGGCACTTGGCGCCCGGCGCAAGGCCGGCACGATCTCGGAAGCCGAATATCAGGCCAAGGTCGCCGAAATGCGGCGTCTGGCCGCCGAGCACGGCACTGATACGCTCTCGGAAATCTCCAAATAGACCTTGCCTTTCAGGCAATTTGGACGCAAAGCCATTCCGAATGGATCGGAAGATGCAATTTTCCTGATAATGCGCGTTACGCGGCCTTTCCGTCAAAGATTCGCCGCGTCGCTTGGGTCTGATGAATACGGCGTTGCGATTCTGAAGTTCGTGTCGCAGCCGCCGGGAGACGGAACGAACTTCGACTGCGGCCCGAAATGCCGCCTTTCCATGGGGAATGAGATGGAAGAGTTTCACAAAGTCCGGCGTTTGCCGCCTTATGTTTTCGAACAGGTCAACCGTTTGAAGGCAAGCGCGCGAGCGGGCGGCGCCGATATCATCGATCTCGGCATGGGAAACCCCGACCTTCCCACTCCCCAGTCGATCGTCGACAAGCTGTGCGAGGTCGTGCAGGATCCGCGCACGCACCGTTATTCCTCCTCCAAGGGCATTCCGGGCCTGCGCCGCGCCCAGGCCGCCTATTATGCCCGCCGTTTCGGCGTCAAGCTCAACCCGGATACCCAGGTGGTTGCCACCTTGGGCTCCAAGGAAGGCTTCGCCAACATGGCGCAGGCAATCACCGCGCCTGGCGACGTGATCCTCTGCCCGAACCCTACCTATCCGATCCACGCCTTCGGCTTCCTGATGGCGGGCGGCGTGATCCGTTCCATGTCGGTGGAGCCGGATGCGAGCTTCTTCGAGCCGCTCGAGCGTGCCGTCCGGCATTCGATCCCGAAGCCGCTGGCGCTGGTTCTCAATTATCCCTCGAACCCGACGGCCTATGTCGCGACGCTCGATTTCTACAAGGACGTCATCGCCTTTGCGAAAAAGCACGACATCATCGTGCTTTCCGACCTTGCCTATTCGGAAATCTACTTCGACGGCGCCCCGCCGCCGTCGGTTCTCGAAGTGCCGGGCGCGATGGACGTGACCGTCGAGTTCACCTCGATGTCGAAAACCTTCTCCATGCCCGGCTGGCGCATGGGCTTTGCCGTCGGCAACGAGCGGCTGATCGCGGCGCTCACCCGCGTCAAGTCCTATCTCGACTACGGCGCCTTCACGCCGATCCAGGTGGCGGCGACGCATGCGCTGAACGGCGATGGCTCCGACATTGCGGAAGTCCGCAACGTCTATAAGCGCCGTCGCGACGTCATGGTCGAAAGCTTCGGCAAGGCCGGGTTCGACGTGCCGCCGCCGGCTGCGACGATGTTCGCCTGGGCGAAGATCCCGGAAAAGTTCCGTCATCTCGGTTCGCTAGAATTCTCCAAGCTGCTGGTCGAGAAGGCCGACGTCGCCGTTGCCCCGGGCATCGGCTTCGGAGAAATGGGCGACGACTACGTCCGTCTGGCACTTGTCGAGAACGAACACCGCATCCGCCAGGCTGCGCGCAACATCAAGAAGTTCATGTCGACCGCAGACGAGACGATGCACAACGTCATCTCGCTGAACGCACACCGCTAATCCAACCGTTGGGCAGCCGCGTCATGCGGCTGCCGCCACACAGACATTTCAGGATCGATCCATGGCAGATGCCCTCAAAATCGGCATTGCGGGCTTGGGCACCGTTGGCGCCTCGCTTGTCCGCATCATTCAGCAGAAAAGCAACGAGCTTGCCGTTACCTGCGGGCGTCCGATCACCATCACGGCCGTCTCGGCACGTGACAGGGCGAGGGACCGCGGCATCGATCTTTCCACTGTTACCTGGTTCGACCGGCCGGAAGAGCTTGCCGAAAAAGGCGATATCGACGTCTTCGTCGAACTGATGGGCGGCGCCGAAGGGGCTGCCAACGTCTCCGTGCGCGCTGCACTCCAGCGTGGTCTCCATGTGGTGACAGCCAACAAGGCGCTGCTTGCCTATCACGGCGTCGAGCTTGCGACGATCGCCGAGGAGAAGGGGTCGCTGCTGAACTTCGAGGCGGCAGTCGCCGGCGGCATCCCGGTCATCAAGGCGCTGCGTGAATCGCTGACGGGCAATTCCGTCTCGCGCATCTATGGCATCATGAACGGCACCTGCAATTATATCCTGACCAAGATGGAGAAGGAGGGGCTTTCCTTCGCCGAATGCCTGAAGGAAGCGCAGCGGCTGGGTTATGCCGAGGCCGATCCGGCCTTCGATATCGAGGGCAACGACACGGCCCACAAGCTTTCCATCCTGACGACGCTCGCCTTCGGCAATCAGATTGCGGCCGACGACATCTATCTCGAAGGCATCACCAACATTTCGATCGAGGATATCCACGCCGCTGCCGAGCTCGGTTACCGCATCAAGCTCTTGGGCGTTGCCCAGCGCACCGATACCGGCATCGAGCAGCGCGTCCATCCCACAATGGTGCCGGTCGATTCGGTCATCGCCCAGGTCGACGGCGTTACCAATGCAGTGGCGATCGAATCCGACGTGCTCGGCGAACTGCTGATGGTCGGCCCTGGCGCCGGCGGCAGCGCCACGGCCTCGTCGGTGCTTGGCGATATCGCCGATATCGCCAAGAGCCAGCCAGGCGCCCAGCGCGTGCCGGTGCTCGGCCATCCCGCAGCGACGCTGGAGCCCTACCGCAAGGCGCAGATGCAGAGCCACGAGGGTGGCTATTTCATCCGCCTGACCGTGCTCGACCGCACCGGCGTCTTTGCCAGCGTCGCAACCCGCATGGCCGAAAACAACATCTCGCTGGAATCGATCGTCCAGCGCTCCAAGCAGCACCTGGCGCCATCGCACCACCAGACGATCATTCTCGTCACCCATGCGACGACGGAAGACTCGGTGCGCAAGGCGGTCGCCTCGATCAAGTCGGAAGGTTACCTCTTCGGCGAGCCGCAGGTGATTCGCATCGAGCGGCCGAAAGAGGAAGGCTGACCTCTTTTTTGCCACAATCCGGGATAGCTGACCCGCTCTGTGCACGCAGGGCGGGTTTTTTTATGCCTGTTTAACCGATCTCACGAAAACTTATATTAACAGATACCGAAACATTGATATAAATAAGCCGGGAGAGGGAGTCGCCGGCTTTGGAGAATTTCGTGAACAAGGACAGCCATGCCGGGAAGGGGTATGGTCGTTCGGAAGAAGAACCTCCCGATCATCCTGATGTGTTTGAGGAAGTTCGCGAAACGGACCATGATGTGGTTCGTTACCAAATCCCCGGGGCCTCCAGCGAGGCCCCTGATGACGACGAAAGCCCTGAAGGCCGCAACTGGGCCATTCTTCTTACCCTGTTTTCCGTCATTCTCGTGCAGATAACCGGCATCGCCCTGATCGTCTGGGCGGTATTCTGGTAAAAATCCGTGTCTGCAAATCCATCGCTATGTTATCTCCGCTGCTCCTGTAGAAGAGCAGATGAAGTCAGCGGAGTGTGGCATGGCAGATCTCGTCGACCCCGTACAGCGCGCGTTTCTCGGCGTGGAGAAATCCGCGCTCGACAATCGCTGGGTCCCGCGGCTCGATCAGGCAGGGCAGAACCGCGCGCTGGCCATGTCGCAGATCCACGGACTGCCGGATCTGATCGCCCGGGTGCTCGCCGGGCGCGGCGTGACGGTAGACGAGGCGATCGAATTCCTCGACCCGACGATCCGCAGCCTGATGCCCGATCCCCATAGGCTGACCGATTGCGAAAAGGCCGCGACCCGCCTCCTGCGCGCGATCGAGCGCGGCGAGAGCGTCGCTATCTTCGGCGACTACGACGTCGACGGCGCAGCCTCCTCGGCACTGATGTTCCGCTTTCTCAGCCATTTTGGCGTCAGGGCGACCATCTATATCCCCGACCGAGTCTTCGAAGGTTACGGCCCCAATCCGGCGGCGATCAATCAGTTGATCGATAACGGCGCTCAGCTGATCGTCACCGTCGATTGCGGCTCCACCAGCCACGAGGCGCTGGCCGCCGCTGCCGCGCGCAATATCGATGTCGTCGTCATCGATCACCACCAGGTTACACATGAGCTGCCGCCCTGCCATGCGCTGGTCAACCCGAACCGCGAGGACGATCTCTCGGGGCAGGGGCATCTTTGCGCGGCTGGCGTCGTCTTCATGGTGCTGGTCGCGACGCTCCGGCTGCTGCGCGCGGCCGGCAACAAGCGCATCCTGGCGATCGACCTGCTGCAATGGCTGGATATCGTGGCTCTTGCGACGGTCTGCGATGTCGTGCCGCTGAAGGGCCTCAACCGCGCCTATGTGGTGAAGGGGCTGGTCGCTGCCCGCCACCAGAGCAATGCCGGGCTTGCCGCACTGTTCCGCAAGGCGGGGCTCGCCGGCCCAGTGACACCCTATCATTTCGGCTTCCTGATCGGCCCGCGCATCAATGCCGGCGGGCGGATCGGCGACGCGGCGCTCGGAAGCCGGCTGCTAACGCTTGATGACGCCGGAGAGGCTGATGTGATCGCGCAGCGCCTGGACGAGCTCAATCGCGAACGCCAGGCGATGGAGGCGATCATGCTGCAGGAGGCCGAAGCCGAGGCACTTGCCGAATATGGCGACGGCGAGGGCGCCTCCGTCATCGTTACCGCCCATGAGAAATGGCATCCCGGCATCGTCGGGCTGATCGCGGCGCGGCTGAAGGAAAAATTCAAACGGCCGGCTTTCGCGATCGCCTTCGATCCCTCCGGCCGCGGCACCGGCTCGGGCCGCTCCATCAACGGCTTCGACATGGGCAGGATGGTCCGGGCTGCTGTCGACGAGGGGCTGCTCGTCAAGGGCGGCGGCCACGCCATGGCCGCGGGTCTGACGGTCGAGCGCGCCGATCTCGGCAGGCTGAGGACCTTCTTCACGGAGAAGGCCGAGAGGACGGTGGCCAGCCTCGTCGCCAACGAAACGCTGAAGATCGACGGCGCGATCGGCGCAAGCGGCGCCACCCTCGAACTCATCGACCGCCTGGAGGCCGCCGGACCCTATGGCTCTGGCCACGCACAGCCGCTCTTTGCCGTGCCGGCGCATAGGGTGCGAGACGCCCGCCTGGTCGGCGAGAAACATGTGAAGGTGACGCTGGAGGCCATGGACGGATCGCGGCTCGACGGCATCGCCTTTCGCGCCGCCGATACGCCGCTTGGCAATCTTCTCATCAATTCACGCGGTGCCAGCCTGCATGTCGCCGGTTCGCTCGGTGCTGACCACTACCAGGGCGCACGCCGCATCCAGCTCCGCGTCTGCGATGCTGCGCCGGCGAAATAGAGTAGTCGCGCAAGAGCGCTGACGATGGGAAGTGCCAAGGGGAGAAAGTGGTACGCCCTAGGGGAGTCGAACCCCTCTTCCCAGAATGAAAATCTGGTGTCCTAACCGATAGACGAAGGGCGCTTCCTTCGTGGTGGCGCCCTTATAGTCAGCGCCTTCGAAGGCCGCAAGCGCCAAGGCGCGAAAAAATCATCGTTTTCGCCGATTTATTTCGGCTGTGGAAAATCACGGTACCGCTTACTGCCGACGCGCAGCACAGGCTGGTCAACGACGTGTATCGATGCACCGAGAGATATGCCGAAACGGCATTTCTGAAAGGCAATCGACGAAACTAGTGATAGCTGCCCTGAGTGCTATGTTCGATATGCGACGGGAGTGAGAAATGTCGCTTTGGGAGATTTTCAAGCTCACGAATATGAGAACGGCAGTCGGACGAACGCAGGCTCTCTGTGCCGGCGTCTCGTCGGCTCAGGAGGTTTTATGGTGTCGACACGGAACTGGAAAGTGGCGTTGGCGGGCGAGTGCATGGTGTGTCGCCCATTTGCCATGCACGACGAGCCTCAATTTACCGAGGTTTGGGACGTCATCAAGGACGCAGACGTCACTTACGGTCATTTGGAAATGAATTTCGCCGACTACGACGAGCTGAAGTGGCCGGCCCGAGGCCAAGGCATCGGAAGTTTCATGATGGCAGACCCTGAGATTGCCAAGGATCTCAGGTGGGCCGGCTTCGATATCATGTCTACCGCGCACAACCATAGTTTCGATTTCGGAGCCGAAGGTCTGATCGCGACCAAGAAGCACATGAAGGCCGCGGGCATCGTCACAGCCGGCACCGGCGCGGATCTCGAGCTCGCCAGCGAGCCGGGCTACGTCGATAAGAAAAACGGACGTGTCGCACTCGTCTCGACAAGTTCCGGCAATCAGCATTTCATGTGGGCGGGCATGCCGAAGGGTGCGCTACGCGGCCGCCCTGGCGTCAATCCCCAACGCCTCACCTTTGAATTCATGGTGGACGAAGAGACCGCCAGGAATCTCAAGGACTTTGGCGAGAAGTTCAATTTCATGAAGAAGCCGAAGCATGGCCGTGAGGGGTCTTTCGGCGTTCAAATTCCCGGCGCCCAGCAATGGGGCGATCCCGAATCCTTCTTCGTGGGCGACCGCTGCGAAGTCATCAGCCGTTGCCACAAACGCGACCTCGAACGCAACCTTCGTTCCGTCCACGAGGCGCGCTCCATGGCAGACCTGGTGATCGTGGCGCATCATTTCAGCGTCTCGGATGGTCCGCGGGGCGATACGCCGCCTGGCTTCGTCAAGGAGTTCGCTCACGCGGTCATCGACGGCGGCGCCGATATCTACATCGGCCACGGCTGGCATCGGACATTGGGCATCGAGATCTACAATAGCAGACCGATATTCTACGGCATCGGTAACTTCTTTGCTCAATCCGAGTTCATCCAGCGCGTTCCCTATGACAGCTACGACGCCTGGGGACATGACGTCGACCGTCTCTCAATGCTGACGCCCGCCGCCCATCCACTTCATCCCGGCCTCGATACGCCCACCGACACCTGGTGGAGCTCCGCCGTCATCAAGCTCGATATGGAGGGGGAGAGGCTGAAACGCATTCTGCTCCATCCGGTTGAAATGGGGCGCGACACCTCGCCTGATGTCAAACAGACCCGGCGCACCGGCAAGGGCGACCATCACAATACCGAAGGGCGGCCGATGGTTGCCAAGGGAGAGGATGCCGTGCGCATCATCGATCGCTATCGTCGCTTGTCTGAACCGTTCGGAACGAAGATCGAAATCAGAAGTGGCGTCGGCGTTATCGAGCTGTAACGAGAATACGCCTGCTGCGGTGGAGGCCGCGGTAGGCAACACATGCGAATAGAAGAACTGTGCCTGTTAGCGGCCAGACATTCGAAAGGGAGGTGACGAATGCTGCATGTCAACGGCGCTGTCGTTGGTGACGGAGCTGCGTCGACCGCCGAAAGCTCCACGCCCTTGCTGGACGTCCAGGATCTGAACGTCTCCTTTCCAGGTCCGCAAGGCGCTGTCCAGATCGTGCGAGGTCTGAGCTACAGCATCGCGCGCGGCAAAACCTTGGGAATCGTCGGCGAAAGTGGCTGCGGTAAGACGATGACGTCGCTTGCTCTTCTCGGGCTTGTTGCCGGCGGCGGCAGGGTCGACGGCACCATCGATTTCAACGGCCGAGATCTGGCAAAGTTTACGCAGCGCCAATGGCAGCAGGTCCGCGGCCGCGAGATCGCGATGATCTTTCAGGAGCCGATGACTGCCATGAACCCGGTGATGCCGGTCGGACGGCAGATCGCCGAGGTTCTCGTCAAGCATGAGAAGCTCCGCAAGGTCGCAGCGCATGGCCGCGCGGTCGATCTGCTTGCGCAGGTCGGAATCCCGTCGCCGGCTCGGCGCGCGGAGGACTATCCGCATCAGCTCTCTGGCGGCATGCGCCAGCGTGCGATGATCGCGATGGCGCTGGCCTGTCGTCCGCAGTTGCTGATCGCCGACGAGCCGACGACCGCGCTCGACGTCACGATCCAGGCGCAGATACTCGATCTTATGTTGACGCTGCAGGACGACGCCCAGATGTCGATCCAGTTCATCAGCCACAATCTCGCTGTCATATCGGAGATCGCCGACTCCATCATGGTGATGTATGCCGGCACGGCCGTTGAATATGCGCCCGCCGACGAGCTCTTCGCCAATCCGCTTCATCCCTACACCCAGGGACTGATTCAGACGCTTCCGGATCCCGACAAGCGGGTAGAGCGTCTCTACGTCATTCCTGGGCGCGTCACGTCCGATATAACGACGGGTTGCCGTTTTGCCAGGCGCTGTCCCTTTGCAGACGACGGCTGCCGACAGATCGAGCCTGATCTTCTCGAAGTCGCTCCCGGCCATAAGGTCGCCTGCCACAAGGTTAAGCCATGACCGAACTCCTGAATCTCGAGCATATCAACGTCGATATTCCGATCGGTGGCGGTTTTCTAAAACCAGCGGTCTGGCTGCAAGCGGTCAATGACGTATCGCTGAGCGTCTTGAAGGGTGAAACACTCGCGCTTGTCGGCGAAAGCGGCAGCGGCAAGTCGACGCTTGGATATGTCGTGGCCGGATTGCGCAAGCAGACCGCCGGGACCGTGACGTTCAACAGCGCCAGGCCGGGCGGCGAAGGACACACACCGGTGCAAGTCATCTTCCAGGACCCGTTTTCCGCGCTCGATCCGCGGATGGTGGTCGGAGAGATCGTTGCAGAGCCACTGAGGCTGAAGGGCGTCCCGGCCGACAAGCGGCAAGCCAGGGCAGTCGACGTCTTAGGACAGGTCGGCCTGCCGCCGGAAGCGGCGCGTCGCTATCCTCATCAGTTCTCCGGCGGCCAGCGCCAAAGGATTGCGATTGCGCGTGCCCTGATCGCCGAACCGGAAATGATCGTCGCCGACGAGCCGCTTTCCGCCCTTGATGTCTCGATCCAGAGCCAGGTGCTCAACCTGCTCGATGACATCAAGAAGCAGCATGGCATCAGCTATCTGTTCATCAGCCACGATCTCGGCGTCGTGCGGCACCTCGCGGACCGTGTCGCCGTTCTCTATCTCGGGCGGTTGATGGAGGTGGCGGCCTCGGAGGATCTCTTCCAGACACCCAGCCACCCCTATACCCAAGCCCTGCTGGCGGCCGTTCCACGCATCGGCCGGGGACGACGAAAGAAAGAACAGATCCTTCGGGGCGAGATCCCTTCGCCCTTGGCCCCACCCTCCGGCTGCGTGTTTCATACCCGTTGTCCAAAGGCGCAGGACATCTGCAAGAGCCAAAGACCTGCGCTTCTGCCCGCACCTGGCCGGCCCACGCAACTTTCCGCCTGTCATTTCAAGGATTGAGCCGATGTTGAGATATACGATTTCACGTCTCCTCCAGGCAGTCCTCGTCACTCTTGTGATGTCGCTTCTGCTGTTCGTCCTGATCGGGCTCATGCCTGGCGACCCTGTCGAGACCATGCTGGAGGGTAATCCATCGCTGACGCCGGAGACGATGGCGCAGATGCGGGCGCTCTACGGAATGGATCAGCCGCTTTTTGCCCGCTATGGTCACTGGCTTTACCAGGCGCTTCATGGCGATCTCGGCTATTCCAGCGTCTATTTCAAGCCGGTCCTGGACGTCCTCTGGCCAGCCGTGATTCAGACCGCGAAGCTTCTCGTCGCGACCGAAATCATCAGTATCCCGCTTGCGATGCTTCTCGGTGCGCTGGCTGCCCGCAAGCCGGGCGGATGGACGGACAACGTCATCAGTCTTATCGCCTTCGCCAGCATTTCGTTGCCCGGGTTCTGGACCTCGCTCATTCTGATCATCGTATTCTCAGTCAAGCTCGGCTGGCTTCCGGCGAGCGGCACGCCCTTGATGAGCGACGCTCCGGTCGGCGAACAGATCGTCCATATGATCCTGCCGGTGGTCGTCCTGACATTCTTCCATGTCGGCCCGCTGGTGCGATATGTGAGAGCATCGATGATCGAGACGCTCAATTCGGACTTTGTCCGGACAGCGCGGGCCAAGGGTCTTTCGGAAACGGCTGTCGTCATGCGTCACGCGCTCCGCAACGCTCTGATCCCGATGGTGACGGTGCTTGCCCTCGGCTTCGGCTCGCTATTCTCAGGGGCGCTCGTCGTGGAGACGATCTTCGGCATGCTGGGTATGGGCAAAGTCATCTACGACGCGATCAGCAACATCGACTTCAATCTGGCGCTCGTCGGCCTTCTGCTGGCGACCATCGTCACCCTGTTTTCAAGCCTTCTTGCCGACCTCGCCTATGCGTGGCTTGACCCGAGGATTACCCTGAAATGAGCATCGTCACCGCAGATACGGCGCCAGCGGGCGCCAGCTCGAATTCGGTCTGGAGCAAGCGCTGGCAGCGCTTCCGGGACAACAAGCTCGCGATGATCTCGCTGGTTTTCGTGGTCCTTCTCATGCTTTCCTGCGCCATGGCCGGGCCCATCGCCTATTTGACGGGAATAGACGCCAACGCCACCAATCTCCTGCGCCGGTTCAAGCCGCCGTCAGCGCAGCACTGGCTCGGCACCGACGACCTTGGCCGCGACGTACTGCTGCGGCTCCTCTATGGCGGCCAGGTATCCATTGCCGTCGGTCTGCTCGCCACCTTCATTACCGGGATCATCGGCATCGGGATCGGCGTCACTGCGGGCTACATCGGCGGCAGGTTCGACAACGTGCTCATGCGGATGACGGATTGCATCATCGCTCTGCCGCTCATCCCGGTGCTGATCGTCCTGGGCGCCGTCGATCTGACGAAACTCGGCCTCAGCCAGGCTGCGGCGACGTCGCCTGCCGCCGTCTTCTTGAGGATCGTCATCATCATCGCTCTCGTCGACTGGACGACGATCGCCAGAATAGCCAGGGCCGGCACCATCACGTTACGCGATGTCGATTACATCAGGGCAGCAAGTGTCAGCGGAGCCAAGGGTCGATATAATATTTTCGTCCACATTTTGCCGAACATCGCCACACCTCTGATCGTCGCGATGACCTTGTCGGTCGGCCGCATCATCCTGTTCGAATCGACGCTGAGCTTCCTTGGCTTTGGCATCGTTCCGCCAACGCCGACATGGGGGAACATGCTGACCAATGCGCAGCAGCTCATCATCTCGGCCCCGATGCTTGCCGTCTATCCCGGCCTGCTGATCTTCACCACCGTAATGGCGGTCAATTTCGTCGGCGATGGCGTTCGCTCGGCCTTCGATCCTCGCTCGGAGACGAGCAAGGGATCGCACTGAAGAAAACAAAAACAAAGTCTGATGGAGGATAAAATGACATCTGCCGGGAAGAGACTAAAATCGTTCGTCACCGTGGCACTCGCCACGGCTGCGCTGACCTCGCTGTCGAGTGCCAGCGCTATGGCTCAGGCAAAGGATCAACTGGTCGTCGGACAGTTGCAGTTCCTGACGAATTTTCATCCGCTGGTTCAAGTCAACAATACCAAGCGCCTCGTCATCAACTATTCCCTCCGACCGATAACGGCATTCGACGAGAACGTCGTCAATCACTGTATCCTTTGCGAAACGCTGCCGACCATCGACAACGGTCTTGCCAAGATCGTCGATCTTCCCGATGGCAAGAAGGGAATGAAGGTCACGTTCAAGCTCAGGGAGGGGCTTGCCTGGGGAGACGGCGTGCCGGTTACCTCGAAGGATATCGAGTTCACCTGGAAAATGGCGCGCGATCCCAAGATCGGCTTCTCCAACTACAATTCCTGGACGCGGGCATCTGCGCTGGAGATCGTCGACGAGCGAACCGTCGTGCTGACGATCCCGCAGATCATTTCGAGCTATAATTCCTGGGACCAGGTCATTCCGGAGCACCTGGAAGGTCCGGTCTATGCGGCCAATCCAACGCTCGACACCTATGTCAAGCAATCGCTCTACAATCTGCAGCCGACCAATCCCGGCCTGTGGAACGGCTCGTTCCTGCTCAGCGACTATCAGATCGGAACACGCATTGTGTTCACGCCCAATCCGAAGTGGCCGGGAGATAAGCCCCATCTGCAGCGGATAATCCTGAGCTATCGCGACAATTCGTCTTCGTTGCTTCAGAATCTGCTCGCTGGATCGGTCGATGCGGTCCCTGTCAGCCCCGGCGGAATCAGCTTCTCGCAGATGCTCGATCTCAAGAACCAGCAGCCTGACAAGTTCACCTACCATCTTGCCTACGGAACGAACGTCGAGCGCATCGCTGTCAATTTCGACAATCCGATCCTCAAGGATAAGGCGGTGCGCCAGGCCATCCTCTACGCAATCGACCGCCAGGCGATAAGCGACGAACTGTTCGGAGGGCTGCAGCCCGTCGCCAACGGCATCCTGAGCAGCGAGAATGCCTATTACAACAAGGATATGACGCTCTACTCATATGACGCTGAGAAGTCTAAAGAGCTGCTCGAAAGCGCTGGTTGGAAACCGGGCAGTGACGGCATCTGTGTCAATGACAAAGGCGAGCGTCTTTCGCTCGAGCTTGTCTCCACTGCGGGCAATCAGACCAGAGAGCAGATTGCTCAGGTCATCCAGAGCCAGTTGAAGGATGTCTGCATCGAGATCACCAACAACTTCGTGCCCCTGCAGGAATTCAATGGCGAGATGGCGCGCAAGCGGAAGTTCAAAGCTCTGATGATGTCCTCGATCGACTTTTCTCCATCCGTTTCGCCGCGGATAGCGCTCGGATCCGACGCCATTCCGGGTCCAAAAAACAATGGCGTCGGCAACAATTTCTCAGCCTATGCAAATCCGGAGATGGACAAGGCCATATCCGAGCTCGAGGCGGCGCTCGATCCTGAGACGGCAAAGCAGAAATGGGCGGCGGTCCAGAAAATCTTTGCCGACGATCTGCCGATGTTGCCGCTCTATTTCTACCCGCGCGCCTACGTCACCGTCACCGGCCTTACGAACTTCCGGCAGGGAACGCTCGACCCGTTGCAGATCTGGTCGGAGGAATGGCAGCGGCAATGAGCAGGGCCGTTTAAGCCAGACTATCCGAAGATCATTTGCGTGGGATATCTATGAAACCGAACATCGCCATTCTTGACGATTATCTGGGTATTTCTCAGGAGGTCGCCGATTGGGGAGGTTTGAAATCCCGCGCCAATGTCGTCGTCTTCGACCGTCCGCTGGCGCTGCCGGATGAGGCCGCGCGCGAACTCGCTGGATTCGACATAATCTGCACTCTCCGTGAGAGGATGCCGATTCCAGGGGAGTTGATCAATCGGCTGCCGCGCCTGAAGTACATCGTCGTGACCGGCAAGCGCTACGATACGGTGGACATCGCGACAGCCGCGAGCCGCGGCGTCCTCGTCTCGAACACTCCCGTCAGCGGGGCTGGCGCCGGCGGCGTTGCCGAACTCGTCTGGGGTCTGATCTTGTCGGCAACGAGACATATCGCTTCGGAAGATCGCAGCATGCGCCGCGGAGGCTGGCAGACCCAGGCGGGAACGACGGTAGGAGGCAAGGTGCTCGGCATTCTTGGCCTCGGCAGCCTCGGCAGAAGGGTCGCGGAGATCGGCAAGGTATTCGGCATGGAGCTCCAGGCCTGGAGCCAGAACATGACGGCGGAGCAGGCGGAGGCAGCCGGTGCGCGTCTGGTATCGAAGCAAGAGCTCTTTGCCACCAGCGACGTCGTCACGATCCATCTGGCGTTGAGCGACCGAACGCGAGGCATCGTCGGCGCGACCGATCTGCAGGCGATGAAAAAGACCGCCTACATCATCAATACCGCCCGCGGCGCCATCGTTGACGAGCCCGCGCTTATTGCCGCGTTGCGCTCAAGATCGATTGCGGGAGCGGGGCTGGACGTCTACGAAAAAGAGCCGCTGCCGGTCGACCATCCATTCCGCAGCCTGTCGAACGTCGTGATCACCCCGCACCTTGGATATTTCACCAGGGACATGTTGGGTACCTACTACGGAGACGCCGTCCGCCTCATCGAAGCATTTTTGGATGGGCGCCCTGAGCGTGTCGTCAATATGGGCCTGGATACCGGATGCTGACGTTCCGCCGGTCTTGCCGAGAATAGCGCCTCCGCGGTTTCACCCGGAAGAGCATTCCAGGGAGCGCCTTCAGGTCGGCTTGCCTCCCAGGAGATCCATGCTGAGGTCGCGCAATATACGAAAAGCCGCTTCGTGCAGCCTGTTCGACAATCGGCGAGCGGGCTGCACAAGTTGAACGGAGAGGATCAATTCGGGTCCGACGATTTTGCGCAGTTCCAATCCATGCCGCGCGTCCGGACCCTTGGATGCCAGCGTGCGGATCGACGCCACCGTGAAGCCGTAGCCGTCCCTCACAAGATCGAACATCGTATCGAGAGGGTCCAATTCAAAGACGATATTGAGCTTCTGGCCGAGCCGTGCGAGTTCGGAATCGAGCAGGACCCTCGTGCTGTTGGGCCTGCAGGCGATGACCATCGGCACGTCGGCAATTGATTCCAGATGTACCGGCTTGTCATCTTTGAACGCGCCTTCGCGGCCAATGAGATAGAGGTTCTCCTCGACAAGGTCGTGGATTTCCAACATCGGGGACGATGGAGCGTCGTACATGATCGCCATGTCGAGGCGGCCGGACAGGATCCATTCCTGAAGCTGGTTTGATCGTCCGTGGACGAGCATCACCTGAGCATCCGACAATTCCTCTCTCAGCTTGCGTATGAGCGCGGTCGCGATTGCAGTCGAGATCGTAGCAGGTAAACCTATTCCGACCTTCCCGGATTTCCCTGTTCGAGAATTCTCGATGTCCTCGTAGGTGCGCTCGATTTGCCGAAGAATGCCACGCGCATTGTCGAGCAGACGTTCGCCGGCCTCGGTGAGCTCGACGCCTCGTCCATTGCGAAGCAGGAGCCGTTCCTTCAACTCCGTCTCCAGGTTTCGGACCTGTCGGCTCAGTGCGGGCTGGGCGATGTGCAGGAAAGCCGATGCGCGGCTGAAGCTTCCGAGTTCCGCCACGCGGACGAAATAAGCGAGTTGGCGGATGTCCATGTCGTCTTCTCCCTGAGATATGCCGAAATGCTATGACTGATAGTATGGCAGAACATCTGGAGATACCAAGATCCTTGAGGTAGCCTCGCTCAAATTCGAGTTTGTGGGAGGAAAAACAATGGTTCCGTCCGCGGAGGTCGCCGACTGGAGCGGCTTCAAGGTGTGGCAGTGCGTCCTGTGCGCGTATGTCTACGACGAGGCGCTTGGAGATCCCGATGGCGGCGTGGCGCCGGGCACCCGCTGGGAAGACGTGCCCGACGATTGGGTTTGCCCGGAGTGCGGCGCTCGAAAGAGCGAGTTCGACATGGTGGTCGTCGGATGACAGCGATCGGATCGGTGAACCCTTGATGTCTGATATCTTGATCATTGGTGCGTCCCATTCCGGGGTCGCTGCCGCGGCCGCATTGCGCAGCGCCAAATATGACGGCTCCATCATGCTGGTAACGCAGGAGAACGTCCTTCCCTACCATCGTCCGCCGCTCTCGAAAGAAGCGCTGTCGAAGGACGACTATTCTCCAACGCCGCTCCGCCCGGAAACGTTCTACGCCTTGAACCAGATCGACCTCGTTCAGGCTGTGAAGATCGTCGCGCTGAATTTAGCGGAAAGTTATGCACTCTCCGAGGATGGAACCCGCTTTCCCTACGGGCGCTTGATCCTCGCCTGTGGGGCCGAACCACGTCGTTTGCCCACGTCTGTGGATGCCGACGGCATCGCCCACGCCCTACGCACGCATGACGATCTGATCGGATTGAAGGCACGTCTTGCCGGCGCCAAGTCGGTCGCGATTATCGGCGGCGGACTGATCGGCATGGAAATCGCCGCAATGGCTTTGGCAAAGGGGCTCGCCGTCACGGTCGTCGAGGCTGGTTCGAGGCTGATGGAGCGGACGGTCAGCAAGTCGATCGCCAATTACGTGCTCGACCGGCATCTGAACCAGGGGCTTCATGTTCGCTTCGGCGCCACAGTAGTGACGATCGTTCGCGACGGTGGCCGGGATGTCGATCTGGTCACTTTAAGCGACCGCGAGCAGATCGAGGCTGATATCGTCGTGGTGGCGATCGGGGCGGCACCCCATGAAAACTTGGCGCGCGACGCGGGCCTTGAGGTGAACAACGGCATTCTCGTTTCGGAGATCGGGCGTTCGTCGCACCCGGCCGTCTATGCCGTTGGCGATTGCTCCGCTTGGTACGATCCGGTGCTCGGACGCCACGTCCGAAACGAAGCGGTCAATCCCGGGCAGGATCAGGCGAAGATCGTTGCCGCGGCAATCGCCGGGGCGTCACCGCCACCCAAGCGTCTGCCGCGCTATTGGTCGCATCAGGCCGCCATCCAGATTCAGATGTCGGGCGACGTGAACGGCGCCGATATGGAAGCGGTGCTGAACGCACCAGCCAGCGGCGCATTTTCGGTTCTGGGCTTCAAGAGCGATCGTCTCGTTGCCGTCCAGACGATCAACGCCCCGCAGCAGTTCGGCAAGTTGCACGAGATGATCGGGATAGACCGAGACGCGGTTGCCGCGACACTCGATGTCGAGTTTCCGCCACCCCATCACCATTGAAGCGCATCACCATAAAAGCAAAATCGGCGCGACGGCCCAAGGAGGATCAATCGATGCAAACTGCGTCCTACACCGAAAAAGAACCGACCCGTTTTCGCCGCGGCCAATCCTGGAAGATCAACATGTTCGGCAAGAACTCGGACATTGCAGCGCCTGATCCACAGGCCTTCCGGCTCGATTTGAACGCTCATCAAAAGCTCGAATCGCATTTCCACATCGTCGACCAGTTCCAGGTCTTCATCGCCGGAAGCGGCACGATCGGCCGTGACGAAGTGCGTCTCGTTACCGTGCATTACGCCGACCATCACACCGGCTATGGCCCGCTGATCGCCAGCGAACAGGGACTGTCATACCTGACTTTGCGCAGCAAGACCGATGCCGGGCTCGTCTACCTGACGACGCCGAACGTTCGCGAAAAGCTCAAACCCACCAAACGTCGTCATCGCACGTCGGGTGCGGTGGCGCTGTCGATCGAGCCGGTTCTCCGCAATCGCACCGAGTTGACCGTCGACACCATCATCGAGGAGCAGCCGGGTGATGATGGCATGAACTGCAAGGTGTTCCGGCTGGGACCGGCCATGGCGGTGCAGGCACCCGATCCGACAGGCAGCGGCGGCCAGTACCTGATCGTGCTCAACGGCAGTCTCATCCATGAAGGGCAGATTTATCAGCCCTTCTCGCTGATGTTCGTGCGGTTCGACGATCCTGCGCCGACCATCACGGCTGGCGAGGATGGACTGGAGCTGATGATCACGCAGTTCCCGACCGAAGATGAGTGGATGAAATCCATCTAGTCCTGAAGGCATGCGTTCAACTGGAGGAGGAAACCGTGACCTATTTTGACCTTGAGGCGATCCGCAGGGAGATCGCTCCTTCCGGAATGCTGGTCTGCGCCCTCAATCACGGCAATGTCGTGCTGGTTCGGCGCGGGCCGACGGACGAGACGCCGACGGGCGTGTCGGTGGACCTCGCACGGTCGCTTGCAGAAAAACTCTCGCTCCAGATCCGCTTTCGACATTACGACAAAGCCGGCGATGTTTCCGCAAGTGTCGGCACGGAGGAGTGGGACGTGTGTTTCCTCGCAATCGATCCGCAGCGAGCCGAGCGCATTGCCTATAGCGACCCCTATGTCCAGATCGAAGGCGCATTTCTGATACGCCGCGCTGCGGGGCCGTTGGGCCTGAAGGACGTGGACCGGCTGCGACTGAAAATCGGCGCCGTTAGGGGAAGTGCCTACGAACTCTTTCTTTCAAGACATGGCGGAGCCGGCGAACTGATCCGTTTCGACAGTTTTCCGGAAGCGGTGGCGGCTTTGACGGGCGGGGATCTCGATGGTCTGGCGGGCGTGCGGCAGGCCATGTCGATCGTCGCTCGCGACCAGCCCGACTTTGCAGTGATGCAGGAACCGTTCATGGCCATTCCCCAGGCCGTCGGCATCAGTGCGGATCGCCCCGCTGCAGCCGCGTTCACGCGCGCTTTCGTCCAGGAGCAGAAGGCGAGCGGTTTCGTTCGCCTGTCTCTGGTGCTGAGCGGACACGCCGATGTCGTCGTGCCACCCTGATTGGCACCTGCTGTTCCACGCGGCGATAGAGCAATCAACCTGGGAGGAAGGCATGAGAATGAGTGGAAAGTTCAAGGCTGTCGTCGGCAAGAGCAGAACGGGTGCTGTCGTCCTGATGGCAAGCCTGTTGCTGAGCGCGGCAAACAATGCGAACGCGACCGACGGAACCGCGCCAATCACGTTGGCGAATGTTACGGGAGGCCGGGTCGAGGGCGTACCGAGCGACACACCCGGCGTTACCCAATTCCTCGGCATTCCATTCGCCGGAAATGTTAGTGGGGAAAACCGTTGGAAGCCAGCGCCGCCGGTGAAGCCATGGGATGGCATTCTCACGGCTGACAAGTGGGGCGACCAGATGCTCCAGAACCCGAACAGTCTCGGCCGGGCGCCGATCAGCGACAACGGCTTGAACCTTGCTGTCTGGACCCCGGCCCACAGCATCGGCGATCGGCTCCCCATCTATATGCTTATCCACGGCGGCGCCAACCGCCTGGGTTCAAGCGAGATGAAAGACCTCTATGCGGCGCAGTTGGCGGCGAAAGGCGTGGTCGTCGTCTCTGTTCAATATCGCCTTGGAGCGATGGGCTGGCTGTCGCTGCCGGAGATGGACAAGGACAGTGGCAAGGGCCCGAAAGGCAATTTCGGGGTCCTCGATCTCGTCGATGCTCTTCACTGGATCCAAAAGAACGCGGAGGCGTTCGGAGGCGATCCGAAGACCGTGACGATCGGCGGCCAGTCGGCGGGTGGCGAAAACACCGTTGCGCTCCTCCGGACGCCCCTGGCCAAGGGTCTCTTCAAGCGCGCATTCATTCAGTCGAGCTTCACCGGCTTCCTGCCGGGGAAGGTTGTCGACTTCGCCAAGAAGTCTGAGCAAAACCAGGAGGCTGTAAACAAGCTCCTTGGCAAGGAGGTGACGCTTGCCGACCTTCGGGCGATCGACAGCAAGACCTGGCTGGAGAATTGGCAGGGCGGAAAGGAAACGCTCTACGGTGCAATGACAGGCGCCGTCGCAACGAACCAGTTCTATACCATCGACGACTACGTGTTCACCAAGGAATCGGTAAACCTGGTTCAACCGGGCGATTTCGACGGTCTGGATATCATCATCGGTCAGACGGCGGACGAATATACCGGACTACGGCCGAACGACCTGAAAATGACGGAGGGCGAGCAGCATGAGGCGATGCTGGCGGCGATCCGACCGCACCAGGCGGGCAATGTCGACGACGCCGTCTTTCCGCACTACGAAACCAACGACCCGGTGGAAGCCTATCGATTATCGCTGCGGATGTTGAACGACTATATGTTCGAGTATGTGCGCGTCGGGGCGGAATTCGCCAAAGCCCACAGCAGCGCCAACGTCTATCTCTACTATTGGGACCACTGGCCGCCGGGCAAGGATCAGGGCTTTCGTCGCGCTTGGCACGCGGGAGACAACTGGTACTTCAACGGTTCGCTGCGCCCTGGAAACCGTGACCAGCTTCCGTGGACCGACCCGGATTTCGCCATGCGCGACATGGCCATTACCTATCTCGCAAACTTCATCAAGACTGGCGATCCGAACGGTAACAGCGTCCCGCACTGGGGGCAGGTGACCCCTCAGTCAGGCGGCCAGTTCATTCGGTTTCATGAGGGGGAAGCTGCCATGCGGACCTCGACGCTCTACCCAAGCCGTGACGCCTACCTGCGGAAAAAGATTCTAGAGGGTATCGGCATGACGGAGAGGGATATCGCTGAAAAACCCTGATCGCGCGGCCGAGCGGCAGGCCGATCTACAAAACTCGCAGAGCTGATAGAGGCCATCGATCCTGCCTATAAAACGGCTCCAGCCCTAAGAATTGGCGGGTCGTTCCCAGGCTTTCCACGGCGTCATTTCAGCTGTCGCGGCCTCCCACGCGCACGCAAAGCGCGGGGCGTTTGCACGCCGATTCGATGGCGGTCGTTTGCTTGCCGTCGTGTGACCTTACCAAGAGGAGGATATCATGAAAATTGGACGTCGGAGATTTATGACTGCTGCTGCCTGCACTGCGGCTTCAGGCTTGCTGCCGGCAAGAGGTGTATCGGCGGCAGGGGCAGAGCTTATTCACACCGTGGCGGCAGAAAGTCCATGGTTCTGCAATCAGGTGGCGCTGACAGCCAAAGGAGACATGTTCTTCGGCCTGCCGCGCTATCCGGACTACGACCAGACGCCCTGTCTCGCAAAGCGGGGTGCGGACGGAAAGCCAGCCGCCTTTCCGGGAAACGCCTGGAACGAATGGAAGCCCGGTGACGACGGATTTGATAGCTTCGTCTATGTCAACTCGATCCATGTCTTCAAGGAGGGGACTGTCTGGGCGGTGGACCAGGGAGCACTGCGCGCAGACTCCTATCCGCCTGCGCTTTCCGAGCCTCACAAGGGCGCACAGAAGCTTGTTCAATTGGATGCGGACTCCGGCGAGGTGCTGCGGGTTCTGCGCTTCGGCGACGATATTCTGCCGAAGGGCGCGAAATTGAACGATCTGCGCGTGTTCGGCGACCATGTCTACGTCACGGACTCCGGGTTGGGCGCCTTGATCTATCATGACCTCAAGACCGGCGTTACGCTTCGCCGGATGTCCGGTTCTCCTGAGATGCAGGCAAAGGTCGAACCGAATATGCAGCAGGGGAGTCACCAGACCCCGAAGATCGACATGATCGAGGTCAGCGACGACGGGGAGTGGCTTTATGCTGCAGCACCGACCGGCCCATTCATCAGGATCAAGACCGCAGCGTTGCGCGACGCGTCGCTTTCGGATGACGACCTCGCCGAACAGGTCGAGGAATACGCGACGATCGCACGCAGCGGCGGCTGCGCCCTTGATACGAACGGCAACCTCTATCTCTCCGAACTGGACAACAAGCGGGTGACGATCTTGTCACCGACCGGAGAGACTGCGGTGCTGACGTCGGATGACGAGTTCATCAGTCCGGACGGATCGTTCATCAGTGTCGATCGCAAGCTCTATATTCCGGTCACGCAATCCCGCCGAACGAGGCTGTTCGGCAACAAGCAGGACATGGTGAAACGGCCTTGGAAGATCTACGTCGTCGATCTGCCCGAAACATTGGGAGGCATCAAGCTCGGCGCTTCTCTGAATGGACCTTCGCTCCCCTGATGCATATCGGAGCAGCCGAGCCCTGTTCTGGTCGAATAGAACGCCGAGAATGGCTTCTGTGACCAGCAGTGGTACGCCCTAGGGGAGTCGAACCCCTCTTTACAGAATGAGAATCTGTCGTCCTAACCGATAGACGAAGGGCGCAGGCAGCCTCGTTAAAATAGGAGCGCGCGCCGGCTTCTGCAAGCGGAAGTGATGGCTTCTTTATGGAGAGAAAGTGGTACGCCCTAGGGGAGTCGAACCCCTCTTCCCAGAATGAAAATCTGGTGTCCTAACCGATAGACGAAGGGCGCTTCCTTCGTGGTGGCGCCCTTATAGTCAGGCGTTTTGAATCCCGCAAGCGGAATTGCGGTTTTCTTTCAAAAAAATGACAGGAAATCCAAGTGCGTTCGAGAAGCCGATTATTGAAGCTGAATCAATAGGCTGCCGGTTGTTTCTTTAACGAGACGAGGTGCAGCTGCCGTAACGTAATGCGAAAATCGCGCGGCCGTCAGATCGCTCTTGCCGTCGCTATCGGTGCGCGCTTGCCGCCGCAGCCCCAGTTCCAGTCACGGGTTTTGCCGGTGTCGAGGTGAACCGAATCCGTGTGGCAGTAGGTGCCGACGCCGCCGCGGTCCGGCAACGAGCGGATATAGGCGGCGATGTCCCACTTCGTAACACCGTCGATCTGGATGTCGGCGGCCTCGCAGCTCTTGTGCATCGATTCGTCAGCCCCGCCGACGAGGCGGTTATGCTCTTCGTCACGGTAGCCGGAGGTGACGATAACAGGCCGGCCGAAATGACTTTCGACCGTCTTGATCACTTTCAGCAGGTCGGGCTTGAAGCAGCCGACCTCGACCTTGTCGTTCTGGAGATGCAGCCCGTTCGGCGCGATACGCGTCATGCCGGGCAGGGCCGCCTTGTGCAGCAGGCCCGAGAGGCTGCCGAGCAGGTTTTGCTTCGGCGCGCTGTAGAGCGCGTTCATCGTCGAGGTCGGAATGCTGCCCGTCGCCAGCGACGCCGTCTGCACGGGCGCAAGCTGCGTGCTGCTATTCTGCGAGGATTGCTGCGGTAGCACCGGCTCCGCCGGCTGTTGGGGCATGGGCGGCTGGCTGTAGACGCTGCTTCGCGCAGGTGCTGCACCTTCCACCGGCACATAGGCCTGCGGCTGAATGACGGTGGACGTGCTGTTGTTCTGCGGCGCCGGCTGGTGATCGGAAAAGATGCTCATCGCCTGCGCGTTGATCCGGGTCGACTGCAGGACGAGCCCGCCGATATTGGCGGGCGCCGTCGTCGCCGAGCCGGCGGGGGACGGCTGCATTCGGATCTTGTGCCACCACCTGAGGCGCACCCTGGACGCCGGAGACGTTGACGAGCCGGGGATCGGTATAGACCGAACGTGGGGGGGCAGGGCTTGCAGGCGCTGCCGTGGCCAGCTGCGGTGCTGCGGCCGGCACAGCCGGAACCACCACCGCCGAATCGAGCGCCTTCTTGTCCGACGCGCAGCCGGACAGCGCCAGCACCGAGGTGGCGATCAGCAAAGCGCCCGTCGCGGGCATCCGCATCTGCATCTTACGCAAGCGACAGTCTCCAGTTGATGCGGAAAATAAGCGAAAAACGCTTCTCTATCCGCTTCGAGTCGCCAGGAGATTGCCTGCGGCGTCGGATCGCGGCAAGCGCCGAGGCGCCGGCCGGACCCGAAATCGAAAAAGCCGCAAGCCTCGCAAAAGCCTTGCGGCAAAAACCCTTACCAGGCGCCGGTATTGCCCATCGAAGACCAAGGTTCGGCGGCCGGAAGTGGGTTGCCCTTCTGCAGGATTTCGATCGAGATGCCGTCGGGCGAGCGCACGAAGGCCATGTTCCCGTCGCGCGGCGGCCGGTTGATGGTGATGCCGTTGTCCATCAGATTTTGGCAGGTGGCGTAAATATCGTCGACCTCATAAGCGAGGTGGCCGAAATTGCGACCTCCGCTATAATCTTCCGTGTCCCAATTGTAGGTAAGCTCGAGGCAGGGGGCTTTTTCGCTGCGCGCACGGTCGAGATCGTCGCGAGCAGCCAGGAAAACCAGGGTGAAGCGGCCCTTCTCGTTTTCGTGGCGGCGAATTTCCTCCAGTCCGAACAGCGTGGTGTAAAAGGCAAGCGAGGCGTCCAGGTCTTTGACACGAACCATTGTATGTAGATAACGCATTCTATTTACTCCTTTATTGGGGCTGGCTTTTTTCGGCGAACCGAGGTCAGCTTCGGGCAAGTCAACTTTATAAAATGTGCACCTGGGAATAACCGAAAACTAGGACTTGCGCTTATCCGTTACCAAGATGTTAATCTTGATTTCAAGAATCAGTTAACCGTAACAGTGTGACGCGTATTCGAGGGGCTGGCGACAATGGCTGACAGGATTTCATCGAACGAATACTCCGACCTCGACGAGCTATCAGGAGAGGCGGTCGATCTTGTTGAAATCACCGGCGTCGTCAAGTGGTTCGATGTCGCCAAGGGTTTCGGCTTCATCGTGCCCGACAACGGCTTGCAGGATGTTCTCCTGCATGTGACCTGCCTGCGCCGCGACGGTTACCAGACGATCCTCGAGGGAACGCGCATCGTTGCCCTCATCCAGCGCCGCGAGCGCGGCTACCAGGCTTTCAAGATCCTCTCGATGGACCAGTCGACCGCGGTTCATCCCTCGCAGCTGCCGCCGGTGCGCACCCATGTGCAGGTCACCGCGACGAGCGGCCTCGAGCGTGCGCTGGTCAAGTGGTTCAACCGCACCAAGGGCTTCGGCTTCCTGACACGCGGCGAGGGCACCGAAGATATCTTCGTTCACATGGAGACGCTGCGCCGCTTCGGCCTGACGGAGCTCCGCCCCGGCCAGGTGGTTCTCGTTCGCTTCGGCGATGGCGAAAAGGGCCTGATGGCCGCAGAGATCCATCCCGACGTTCCGAGCCCAGCAAGCCGGTCGCACTGAATATGCGCGGCCTAATCCTATTTGATGCTATCAGAAGCGCCATCCTGGCGCTTTTTTTCATGGTCGCGCTGCCGGCCTTGGCTGACGAGGAGATGCGGTTCGACAAGGAGCCGCTGCTCATCCAGACCGCTGCGGGCAAGGTGCTGCATTTCACTGTCGAAATCGCCTCGACCCCGGATCAGCGCGCCCGCGGCCTGATGTTTCGCAAGGTGATGGCCGATGATGCCGGCATGATCTTCGATTTCGACCAGCCGCGGCGCGTCACCATGTGGATGGAAAACACCATCCTGCCGCTCGACATGCTTTTTGCCGACGACACCGGCACGATCCGCCACATCAAGGAAAAGGCAGTGCCCTATTCACGCGACATCATCGATTCGATGGTTGCGGTGAAATACGTCGTCGAACTCAACGCCGGCATCGTCGCCAAACTGGGAATCAAGCCCGGCGACAGGATCGTCAGCGCCACGACGACGAAGAAGGCGAAGTGACGCGCGCGCGACCGCGAGCATGGGTGCGCCTGTCAAGGAATTCGAGAATATCGGGATATTTGAATGGTCGGAGTGGAGAGGTTCGAACTCCCGACCCTCATCGAAAACAATGGGAATTCTGAATTTTGTAACGCCGTAACGACCAGCTTTCTGTTTTCAAATCGGTTATAGAGCAGCGGGTTAAAAGGCGCAATTGCTTTCCAGTCTAGCTCATCCGGGTATAATTTCAAACCGAAGGCCATGGCGTTCGGCAATGTCGTTCCTAATGGCCTTTTGCATTATAGGGATGAGAGTTGAGACAGCATCTCGGTCGATGTCATCGAGACCGGGTGAAGTGGCTCCTGTTGCTAGGGCATGTGCAATTTTCACCCGCAACTGCGGAAGTAGCTTACGCAGGGGCTTGGCACCAATTTTCAAGGCGCCGAATTCCTCATCCATTGAACTGTGGAAGTCGCCTGTTTTGCGAGGGTGCTCCAGCTCAATGATCTTATACAAGGCCAAAAATCGGTAGTGGCTAGGTATTCTTGTGCTGTGGCTCTCACTCAGCAGGTCCATTATGTGATGGAGGTTTCTTTCTGACGTGACTGCATAGCTCGTGGCGAGCAGTTGCGCATCGACGGGCTGCTTCGGAGCAATCATTTCTGCGCGTAGAAAAAAGGTTGGTTTCTCGTCATTGCTAAAACTGGAAGCCGAGAAAATTCCGTGGTCCTGCGGCTTAAAAAAATCGGCTTGATCGTCCCACGTGAGCCGAATAATTGAAAGCATATGGTAGGTGGATCGATTGATCCATGCGGACTCAGCAGGCGTCCAAAGGGGCGCTCCGTTTTCTGCCTTGCCGGGCTCCGGGAGCGTGCAAACAATCGCCGCTATGCTGGTGGGCGTTCCCATAATAGAAACGCGCCCAACTTTTCCGGTGGCGAGTGTCTCAGTGGTATAGGATAGTTCATTTATCTGCTGCTGCTCAGCATCACTCAAGAGCGGCATTATCAGAACGAGATTGTGGCCGTATTGCATAGATCGGAACGCCTTTCTTGCACAAGCCATCTCTTTTGTTCGCGAAATCGCTGATTAGTCGCGGCTAAACTACGCGCCGCCGTTCACGGCGGACGGGATCTTCGATGCAAATTCGAGGATGTCATTCGCGAGTTTGGCGCTACCGGTTGCGAAATCCTTCAGGTCTGAAATTACTAGGGTCTCGTCTGGGGAAAAGTCGAGATTCTTTGCGCTAAAGTATGCGGGCATCAGCCTAGGTTCCATCTCCCTCATCTCTTTGACGCCCGAGACTGGTTGGTTCTTATGCCAATGGCCCACGTGCCAATGGGCAATCTTGTGCCGAACCGCAGCTTTATGCTGAACTCGCTTCATGATGTTCTTGAATCTGGGCAAATCATCAGCGGCAAGAACGAGTTGGCCGACCGCCTTGATGATCCTGCACTTCGATTCGATATGACGAGCTTCGTCCAGCGTGAGATAGCATAGCCGGGGAGGCGCGGGATGCATAAGATTTCCGTAAAGCAGTGAGAGCATGCTTTCGACCATTGAGAATGCCGCGATGCAGTTGCCAATCTCGGTCCTCATCGGAACCAAATCGGCAAGCTCAACAGCTTCAGGTTCAGTCATGATAATCACCAAAAATCAAAGTTGCTCGATCGTCTTTGTCTATGCTTGTGCTTAGTTACAGGGAAAGGTGCGCTTTGCTGCAAATATCAGCAGCGGTGCCAGATAGCTAATATGCTCATTTCTCATCGAAGGCTTCTCGTTGAGCATATCGTCGAGCAAACGTCGCCACTGCTCGCCGGCAATCGTCAACTTCGGCGGAGGGCAGAACAGTGGCTTGCCGTCATCCTTGGCAAAGTCGTTGGCCCACGTCAGGGTATTTCCTATCGTCGTTATCCGATCATTTAGCCATTTTTGATCACTCGATTCCATTGCTTTACCAACAGTGTATTGCTCTTCAGCGAATGCGTCGCTGGCTATAAGCATAACTGTAGTTGTAATCAAAAACGCCTTGAGCAAGACAATCTCCATCAATGCTATTCCATCACCTACGCACTCCATAAGCGCAACAGGGAAACCCGCCGATTTTCACCGGCGGGCCGCAGTCACTTTGCAACGTCGATCGAGCTATTTGCCCTTCTTCTCGTCCTGGGACAGGACGGAAGTCGCAGGCTTTTAGCGTCGGTCGAGGTGGGTTTCTTGCTGCCATTGAGAATTTTTGCAGCAAGTGTTGAAATGCTGGAGCTTGATTGCTTCTTCGCCATCATGCACCGCCCATGGCGTTTTAAAGCAGCCAGTGCCATCGATAGATCAGCGCCCGGAGAGTATGCAAGATGGACGATATTGATTTTGTATTGCTCGGACTATTTGCCGCCGATGATGTGAACAAGCGCACTGAACTGGTGTTAAATCAAAAAAAACTAATCCACTATTGTTCTGCCGATACTGCCCTGAAGATCATAAAAAATCGCGAGATCTGGTTGCGCAACGTGCGAGTCATGAATGACTACATGGAAGTGAATCACGGTTTCAATCTCATCAAAAAGTCGCTTCAGCCGCCGGTCGATACCGCGATCGAAACGGGCATGAACGAGGTTAGGAAGGCGCTCGACGTCATTCACCCGGGGATTGCTGATGAAGCTTTTTCGCGGTTTCAAACGTGGTCGCCCTTTATCCAATATGAAACTTACGTGACCTGCCTTTCCGAGCATCTCGATGATGAGAATGAAGACGGCCGTCTCTCGATGTGGCGAAATTATTCTTCTGGGCAGGCGGGCGTCGGCATCGTGATCAACACCTCTATGTTCGCTCGGACTGATGATGCTCTGGGCGTTTATAGCAGCCCCGTCACCTATTTATCTGACCTGGCTCTCGAGCATTCGCTGATGCAAGTCGCTCAAAGGATAAGGGAGCGAGCGGCTTTCCTGTCGACGGTGCCGCGAGAGACCCTGATCGGGCAATATTTTCTTCTGCTTCGAACCATCTCTCATGGTTCGAAGCATCCTGGCTTTAAGGAAGAGCGCGAGTGGCGGATATTCCATACGTTTGGCATGGATGAGCTGAAGATATTGAAGGTCACCAGCGAAAGTTTGGGCGGGGTTCCTCAACGCATCTTGAAGCTACCTCTTGACGGAACCATTGAGGGCATATCTATCGCCGACCTTGTCGACAAAATCTTGGTTGGGCCTTCACAATATCAGTATGTGATAGGGATGGCGTTGGCGGATGAACTCGACCGAGCAGGCAAGAAGGACGCCTATCAAAGCATTAAGTATTCCCCGATTCCGCTGCGTACCTAGCGCGCTTTCGCATCAACGCCCGCAGCTACGGCGGTTGCTCTCACAGTGCTCCCACAACGCCCGGTCCGGGTGTGGTCGCGGCGATACCAAATTATATAAAAATTTCAGTGGGAAAGACGTGGTCGGAGTGGAGAGATTCGAACTCGTATAGACCATTGAATTTCAACGGTTATTTCCACTCTTTGTAACGCTGTTACCGGCGACTTTCTGTTCTTGTGTAGGCGTATAGTTGATAACGAGATCTTGCGCAATCACACATCGTAGTGGTCAATTGCCGTCGGCACGCAAAGGAGATGAACGGTGCTGGGGAATACATTGAAAATAGCGTTGGCAGGGATCGTGCTGGTCCCCGCTATGGTCTCCGAAGGTTACGCATCGGCGTCATCGTCCTATATGCCCTCCGAGGAGCTTATCGAAAACTGTCGCGCGTTTCGTGCCTATCGTGCCGGGCAGGATGTCCTTTACGGTCAGAAAGCATCTCAGCTGGCGTTCAAGGCGGCGACGTGCTGGGCGTATATCCAGGGCGTCGAAGACGACAGCGCTTTGCGCGAACGCTCTTGCGTGCCGTTCCCGACAGAGGTCGATATCTTAGTTGGATTATTTAACGATTACATGGAGGCGAATCCCGAAGCGCGGAAATATGGGGCCGCATCAAATGTGAACGCTGCTCTGCAGAAGGCCTATTGCCCCAAGTAAATCGCCAGATGCAAAAAGCTCGCCGATCTTCACCGGCGGGCTTCGAGCTTCAGGTCTGCCGTTTCCAACAGGCTAACATCGCATTTATCTTTGTGAAATCAAATAGATAAATTAGCGAAAGCTAACCATAGCAGTCTTCCGCTGCATATTTCCTTATCATGAACTCGTAGGCTTCAACACCCAAACGCGAAAATCGGGGCAGCCCATAATTGCGGGCGATGAGAGACTCGAAAACTCGAAACTCTCATCAAGCTTCTATTTAAGTGATTGGTTCTATTTATTTTTATCTATATGAGAGAGTTGCGATTTGTGACGTTGTGACGGGGCTTTTTTTGTGAAGGAATCGTCACTGTCATTGGCGCTCCCGCTGGCAAGCTTTTCTTCAAATAGGAAACTCACTCTGCTAACGATCCGAGCGAAGGTATATCCTGCTTCGATTATCAAGAAGAAGAAAACGCTTCTGGTGCAGAGGCTAGCCCACCACAGATAATCCGTTGCACTCGGGGTGATTGTCACCAAGTAGGGCAAGGCGTGTTTCGTGAACCAGTCGCCTTGCGCCTCCGGTGCAATAACCACCATGGCGAATGCCAAGCCAGCCGAAGCTGTCATCACAAAATAGGAAACGTTGTGAAAGAGCTCTCTAGAGAGTCCATTTACGCGGCGGGCACGGTTCGTCTGTTCCAGAGATGAGTTTGGCGAACTCTGGTCTTCTTTTCCGCTAGCGATGTAGAATAGCACGCTGAAGCCAAAGCCGAGAAGGATCGACGCGACAGTGTTTATCGAGTTAATGAAATCGGTGAGCTTGTCTGTCAGAAGAAATGCCAAAGCCGGGGACGAGATAGAAAATAGGAAGATCCGTGGCCATATACGAACCCGACCACTCGTTCTGTAGTTATAGAAGGTGGCGTTGTTATTTGCGATTATGGAGCGAATGCTAGCCATTCTCAGTAACTTTTATGACGTGTTCATTTAGAGTAATGAGTATTTCATTACATGTGTCTGCGTTGGCCGGGTGTCCTTCGTCATCTACTTCTACGTCAAGTGTGACCCGAGTGGCGCGAAATCCATTCTCGAACATGTGTATCGTAGCGGTCCTTCCGTTCATGTCCGCGAGAACAGTGCAGTCGATAATGTCGTCATCGTCGATCTCTACAACGTCGCTTTGATTCATCAAGCCAGCCACCGCTTGCTTGATCTCACTTTGCTTTCGACCAAAGAACGGGATGACTTGCGTTTTCACTTGCTGGACCAGAGGGTCGTCTTTGCTCGCGCGCGTGAATGCAATCATAACCTTACCCCCTAAGCTGCTTCGGCCCGCAATCGATGTCGCCTTGTTTGCGATGTTTACCCGAATCTCACGCGGTTCGGCATTTTTGTAATACGAAGCGCCCAATCGAATTGACGTGGACGTGACTTCCTTACTCTGGGGCAGCATATCAACTATTGTTTTTCTCAACGTTTCGTAGCCACCATACTGCCCGAGGTATTGGGCGCCGATATAAATTCGACCAGACTCGGCCAGATACAATACAAAGTGAAATGGCCGGAGGTTTACGCTTTGAGACGATATTTGTCCTTTGTGCGTATTTTCGAACGAATGTCCCCAATACGAAGCTCTGAATGTGCCTGTGATCGTCCGAGAGTTTACAAATTCGAGGTTCTCCAGAGGAGCTTCTTGGCGGAAACGCAACCTGTCGATGTCATCTTCGTTGCGAATATCGAAGGACTTTTGAGCTCGCATTCTTGCAAATAACGCTGCAAAGTCTCCAATCGTGAATGGAACGACGACTACCTTGTCAGCTTTCTTCAAGCGCAGAAGATAGTGAAAACTCGCGCTAACTTCCTCGTTCTTCTTCGCAACCATCCGCCCTATACCCCCAACAGCCTTCATTTTGGCAACTAAACCGATTCGTTCTAGTTGTATTCAACCATCAAATGCAGCTTTTGTTGAGAGCGCGACACCACAAAGTCGCTCGCATACATGGGGGGATATCGCTTACGTTCTTGTTTTGTTCTATTTAGGGCCGCGAGTCAAGGTGCTTCGCATGGAAACAGGGTGATCACGCTGCTCAATCCCGCCGTCCGTCTGTTTGCCTCTGGCGTAGCAGCTTTGCGTGTTCGCGCCCGACGCCGACGCGGTTGCCGACAAGTCCGAAGAACACTGTCATGACAAGAACGGCGAGCCACCATCCGGCAAAAGCCGTGCCGAAGACAGCAATGCCCAAACCGGGCATCATTATCGAGGCCAGCGCCACGCCTGCGAGAATGAAAGCGGTCGTCCATTTCTGTTGCGTGGAAGTCCACGCAAACCGCACGAACTGCCTGTGCATAAATCGCGCCGTTGCGCGGTAGTTGAATTGCATCGATGATTCCTATGCGGGCGGCTTCGATTGTGCCGATATCTGAAAGCATAAACATGAAAAACCCACCGATTCCCAGCCCCAAGAGCCCTTCCGTCGACTGGGAGCGCTACCGCCTCGCTAGGCCTTCTTCCAACTCTGGATCATCGCATCTAGCCTCGGCTGCACTCGTCGCCGGGCTTCTTTCCGCCGGACGCCCGTTAACATTAGTTGGTCGTAATCAGTCAGCTGGTGCCGAAGCTCGTTCTGCATCGTAACTTCAACTGCTTCCGCGATCGGCGCGCTACGCCAGTTCTTCGGGACCGTGCAAATGCGGGTCGCGAAGTAGGCGACCGCGAATTCTGGGCACGCGGGATATTCCCGGCGAATGAACTCTTCGACTTCGGCTTGAACGAACCACCGTTTGCGACGCGCGCGCCTTCTTTCGGCACGCGTCTTCAAGCCGCATCCCGGAGCCGGTTCTTGACCGCGTAGAAGATCGCGCTGGCGAAATAGTGCGGCAGAAAGGCCGCATCGCGCACGTCCCATTCCTCTTGCGTCATCGCCCGGGCTTCATCCATCGCGTCGGCGACAAGAAGGCCGATGATGCCAACCATGAGGTCGTCAGCGTCCTTCGCGGACATTCCTTCGTATCTGCTGGACATGATAGCTCCTGAACGAGAAAGGCCCGCCGTGATTGGCGGGCCGTGTGGGGCACTTATAGGTCCGGGTCGCGTTCCATAGCATTCACGATGATTTCGCGAAGCTCGCTCTTGGTTGCCTTCTGCAATCCAAGCCATGCCAACTGCGCGCGAATGGACGGCTCGTGATAGTCCCTTTCATGGTCGTTATTGGTCAACGAAGCGAGTTCGCGCCTCGTATCGGCCAGCCGGGCGAGGATTTCAGCCCTCTTGGCTTCCAGCGGATCGAGGGCCACACCTGATACCATCAGCTTCAGTGCCGGGGAGAGCGCCATCAGATAGACTCCATCGTGATTTCGTCAGCGGGATCGCGAAGGATCAAATAGCCGTCAGAATGATATTCCTTCGTCCACATCTCCACCTTTTCGAGCAACTCCATGGCTTCCTGAAAATGATCCATCTCACTTTCAGGAGTGGCACGAATGCGCTCGATATCGTCCGCAATTAGGTCGAGAAGCTTGGTGCGTGCCTCATGCGTTGCGAAAGCGGCAGAGGTCATGGTATTTTCGTTGGACATCGCGATTTCCTTCAGTGTTATCGCTGTGTGAGATTGAGGGGCGGCAAAGTTCGGCGACCAAACTTCGCTTTGCCGTTCCCATTCTTGAGGCGCTATTCGCCCTTCGCATCGACGCTAAGCAGGGTGTGCCGCGTCGGCTTCCGAAGCTCTGATCGAATGGCCTTTAGCCGTTCGATCTCGGCAAAGGATTTCTTCAAAGCTTGAACGTCGCCACGTGGCAAGTCGGCACTGAATTCGTGTGCGGCGTCGATGGCAGCTTCAATTTCCTGAAGCTTGTATTCGTTGATGAGGGCCGCGAGTTCCTTCGCGGCGTCGAGGAAGACGCCCATCATGCCCCTCGCTCTTTCGAGCGAGGTGAATGACCTTTCCCACGGCGCTTCCGTTTCGAACTTCTGCACTTCCTCGCCGAGACCGGTGCCGCTGATCTGGACGCCGTAGGCATCAAGGTTTTCCTCGAAGTCTACATGACGGCCTTCCATGAAGGCATGGAACGGGGTCTGGACGTAGGCGGGCGGCGCGGCAGGCTGAAGCAGGTTTTCCCGCTCCCTTGCTTTAAGGCGTTCACGATGCTCACGGGTGCGTTCTGCTGGCGTTTTAGCCATTTGAGCCTCATTTCGTTTCGTTACAATCTGCGTGTAACGAAACGCAAATAGGAAGTCAAGCTGCCAGACGCGGCGTGCCGGTGTCTTGCTCCAAGCAATCAATCGGCCAGATGCCGAGATCGCCGAGCACTTCGATAATCCGCGTGCGGAGATCGTCGGGCGCTGCCGGTTCGAAGTTCCGGGCGATGGCTTCGAGGCTGTCGAGGGCTGCATGGTAACTGTCGCTTCGGAAGTATGCAGATGCTTGCATTTCGGTAGTGGTCCTCTGCTCGTGATTGAGCAGACTCTTTCTGATTCATCGCGTGTGATTTGGCAAGAAGAAAAAGTCTGATAACGTATTGATTATGTTGACTTAATAATAAGTAAGCGCTTACTTAAAAATAAAAACCCCTCACGCCAACGAAACGTGAGGGGCTTCGTGTCAGACGCAAAAGATGGAAGCCGCTGCGTCCGGCATTTCTCGGGTTTTCCTCAACAAGCGGGAGAGAGGCCCGCCTGTTGTTCGATTTATAGCGAGATGGCCGAGCGAGCGCATACGATCGTGAGCGTCAGCAAGGAAAGGTGTGCACCAATGCCTTCAGGATTACATTTCCGAGCACAAAGCGCTGCCCTTTTCCCTCATCTGGAGCCTCCACAAGATATCGTTCCAATATGTCTCGCATCATCGTATCGGTGAAAGCTAGATTTTGGGGTGGGCAAACAAGGTTTACGCCTTTGGCTCCTGCGTATGTATTCGCCCAGAGAATGCCATCGCCGGTAGCCGCGATCACTGCGAGGATGACCTTGTCATCGTGAGAGGCACGGACCAGTTGCGCTATCGTGAAGCCTCCTTCAGCCGAGCTCGCTGGCTCATCTTTGGCAAGGTCGTCAAGGGTGAGTTTCTTTTGGTCGGCGGCTGCGACCGCGCCGGACGAGATCAATGCAAGCGCAACGAAGGCCGCGAGGTGTAAATGCATGGTAGTCCTGAGGTGCGAGTGGTCGCTTGGTATGTTAGCAACCAAAGGTCTTGGCGACTACTCGAACCTTCGGTTCTCGCCTGCGCGCTTCGCTTGCCCGCTCACTACGTTCGCGAACCGTCTTCGTATCTCTAACCACGTCACACATCATAATCACTGCGGGAATAGGACTTATTCCCCCTCGACCTGAACGACATATAAAAGCCGACACGAGGTCGGCGGTCGTTCAAGTCGAGGGGGATTTTCTGCGTCAAACAGGTGGTGAAGAGTTCTGGTCCGTTAGACCGGCTAGGCGGCTTCCATTTGCCTAGCAGCACTCCGATATCCTGCCCTTGCGGGCAAAGCGGTAAGCGCCACGCTCGACTTTCCTTCAATCGACACGTGACTGGACGTTGTGGTTGCTGATCCTGTCAGCGTTTCACATTGCGTCCATCTTTTATTGCATCCATTCCACGCGGTTTCCCGGTGCGGCGATAACCCGCTGAAGCGTCCGATGTTCTCCATCGGGGAGCCGGCCAGCATGATCTTCCGTTCGTCTCTGGCTGACTGATAGCGCCGGTCATGTTCCCGGTATCGCCCTCGCATCCAAGGCCAGTATGGTCCTCTTGGAGCTAACGACACTCCGCAAGGGAGCTACCGAAACAGAGTTGACATATAAGGAATTTAATCCTATTAATGTCCTTGCTGAAATCAGAAGCAGCATTCCTAGCCCTCGGCAAAATAGCACATTGCCGGGGGCTTCTTCGTGTTATAGCAAAATCTGTGAGCAAACGCATAGATCCGCTACAAACTCCAATGACCACAACTATACAGGAAATAATTCCTGAAGTGAATCCCCCCCAAAAATCCGACATGACGAGTCACTGAGAGGCGCTCTGAGCGCCGGACGAGGAAAGACACGTCCGGGCGCACGAAACGCGCCAATGGCTCGCTGTGAGTCAAACAGCGGTAGTGAAACTCTGCACCAGTCCTGCCGTTACGATCCAGATGCCGCCTTCGGTCGTAAAAGTCCAATCCTTGCAAGTCCACTTCCGGGCGCTCGTCTCGCCGAACGGCTGAAAATAAAACGGCCGATTCCCGCCCATGCTTTCGAAGAAGCCGGTGAGTTCCTGCATCTGGTCATAAGTCAGGGCGTTCCACGAGAGCGATACCGCCCGCTTGATATGGTTGATGCCCTTCGGCGACGGCTGGCTGTAACCGTCGCCGAAGTCGGCTTCCCAAAGGTTCACGGTCGGCTTGTGCGCCGTGCCGGGTGACGGTCCAACGGGCGGCTGAAAAGTAGGGATAGCCATAATCAGCGACTCCGGGTGTTGAGGAACGAGCCCGGACGTGCGGCAAGCCTGATCTGCTCTTGCACCAAACCCTTCAACTGCTGTTCCACCTGCTTGCCGACGCGGGTCGCAAGGTCGGCGTTCTGGTCCTGCGTTCCGCCGCTGGCATTCACGGTGACGTTCGTATTGATGTTCATGGGGACGGCAGGCGCGGTATTGCCGTTCGCGGCCACGAGGTCAGGCTTGCGAAGTGCCGGGGCGTTGCCGACATAGCCGCCTTCGGCGTAGCCGGGCACCCGGCCATAGTTCATTGCATTGAGCGCCTGGACGCCGATGCGGCGGGTTGCCTCCTTCGTCATGACGTATTCGTCGCCGTGAACGATGCCCTTCGGCTCATACTTCCCGCCCGGTCCCGTCCACCCGCCACCGGAGAAGCCAAGCAGGGAACCGAGGGCACCGAAGAGTCCGCCCCCGGCGCTCCCCGTGCCGGTGAAGTTCTTCAGGTCGAACACCTTGCCGAGAAGGTTGTTCAAGATAGCATCGCCGATCTTGGCGAGCGCATTCGACAAGGCTTCCGCGCCGCTCTTGCCACTGGCGAGGTCGGTGATCAGTCCCTGCGTAACGTCGCGGGCGGTGCCAAGGGCTTCTTCCGCCGAGCGCCTTACGTTATCCTGCTGCTCCTGAAGCTGCTCGGCTGCTGCGGTTGCCCGCCCATATGCCTGCGCCGCCGCGTCGATCTGGGCGGCCATCTCCGGCGTTACAGTCTTATTCTGCTGCTGGGCAGCGTTCATGAGCTGCTGCTTGGCTTGCGCCGTCGCCAGTGCCGCGCCGTAATCATCAATCAACGGGTTCAATGCAGCCTGTGCGGCCGTCGCCGACTGAACGGCCTGGGTGCGAGACTGAAGGTCCTGAAGTTCACGGGCGAAACTGTCACTGCTACCCCCGCCGCCACCTGCGCGACGGCCTGCGCCCGTGCCACTGCTGGGCGTAGTGCTCGTTGCCGGAACGGGATAATCAGCAAGGGACACGGGCTTGACTGCTTCCGCTGCCGGTGCAGCCGGAAGCCGGGAAACCTTCGCTGCCGCTTCTGCGCCTGCGATCGCGCCGGAATTGGTGGCGAATGCCTTAATCTGTTCGGGCGTCAGTCCACCGCTATTCGTAGCGGGTGCGTCGAATGCGTCGGTAATGCGCCGCTGCACGGCGTCGGTCTGAACTATATTCACACCTGCAACTGACTTAACCGTGGTATCACCGGGCAGTGACCTGACGATCAGTTCCCCAAGATTTGCCGCGCCGGAAACGGTGCCAAGCCAATCCGCCCACGACTGAGCGGACTTAATACTGTTCTGAATCGAGGACGTAACCGCTTCGATTTTGGAAATCAGATTGTCGAAATTGACAGAATTAATCTCTGCCGCGACACCATCAATGATCCCGCCGAAGTCCTGCGCTGCCTTTGCCGAAGTGTTGAAGCGCTTCGCGGCGTCGATAAGCACGTTCTGAAGGCGCACAAAGCTAGACGAAATCGTAGCTTCCGAGTTTGCCACCTTGGCCTGAATGATCGGCAAGCCTGCGAGAAACGCCCTGAAGAAGGCTTCGGACGACACCTTGCCATCCTTCACCAGCTGGGTGAGGGCCGAGACGGAACCACCTGCGTCCTTCATGCCAGCGGCTACGGCTTCAAGGATCGGGCGTCCGGCGTCCAAGAGGCTGTTATATTCTTCGGCCTGAATCTTCCCGCCGCCGAGCGCCTGCGAAAGCTGAAGCAGTGCGCCGGCGCTTTCGCCTGCCGACTGACCGGAAACGCGCATCGCAAGCGACACGCCATCGGTGAACCTAAGCAAATCCTGCTGGTTTGCGCCCAAATCCTTCGCCGCGCCGGACGTGCGGCTGTAGAGCGTCACGAGGGCTTCGAGGGGAGTGGCGTTCTTCTGAGCGGATGCGAAAAGCTGGTCGTAAACGCCCTTCAATTCCTTGCCGGCGAGGCCGGTCGTCTTGAGCTGATTCTGGATCTTGATGGAGCTATCAATGAGATCCTTCGCAGCATTCGCCGAGACGAAACCAGCGAAGACGTTGCCAATAGAACTGACGGTGTTGCGAATGCCGGAAGCTGCTTTCGCATAACTCCGCTCGATGTTCTGCGCGGACTGCCGCGCGCGGGCCTCAAGGCTCTTCGTGCCACGGCGCTGCACGTCGTTGGCCTTCGCCATCGCCTTTTCGAGCTTATCAACGCGGCCTTCGATATCGACCACAAGGCCGGGTAGCGGAGTATTCATCAGCTATTCCTCAAAAGGTGAAAATCCCGGCGCTGTCCGGGTCGTTGTAGTGGGAAGCGTTGTCCTGATAGATCGCTGCAAAGTGCACGGCCATGAGCGTGGCAACCGCGCCGTCGATCTTGTCGCGGCTCTTCGACTTGTCGAACTTCCGGTTCCCGGCTGCATCGCGCACGACGGCGACGTTATCGAAGTTCCAGCGAAGGATAGGGTTGCCCGCGTGGTAGAGCTTGCCATCAATGATGGCACGTTCAACTTCGTCACAGGCCGGTGACATAGACTTAAAGCCCTGCCGATATTCGACCACCGGCAAGTCATCGTCCATAAGGTTCTGCTGCGTCTTCTGTGCGCGCCACGGATCGAACGCGATCTGGCGAACGTCATGGTCTTCGCAAATTTGCCTGATCTGGTCTTCGACAAAAGCGTAGTTGACGGACTTGCCGGGCGTCGGCGTGATAAGACCTGCCTTTACCCACTCGTCATAGTTCACGCCTTCAACGCGCGACTTCTTCGCAAGGGCGTCTTCCGGGCAGAAGAACCACGGCTTGACAATGTAGCCGCCGTCATCGGTCGGCCACGCGGCGACAACGGAAGTCAGGTCCGAAACCTCGGAGAGGTCTACGCCGAGGAAACAGGGCTTGCCCGCGAGGGCCTTGAAGTCGATAGGGATGCCGCAACGGTCGTATGTCGCCATATCGACAAACGGGCTGGAACTCTGATTCTGCCAAACGCCAAGATAAAGCTGCTGCAAAATCTCCCGCTCGATGACCGAATATTCAGCCTTGATCTTGCGTTCTTGCAATGCCTGCAATGACGGATAGCCGTTGGCAAGGCCGGGAAGCAGCTTGTGCCAGAGACTTTCGTCCTTCCAATCCACGTCAGCCGAAGCTTCGAAGATGATCGGCAACACATGCGGATCAACGATCTGGCCGAGCTGGATACGCTTGGCTGCGCTCACTTCCTTATAGGCAAGCGTCTCCTGTCCACGGCCTGCCGTCGTCAGCACCACCATGAGGCTGTTGTTCGTCTTGTCGAGCGCGGAAGACAGGACGCGCCAAAGCTCGCGATGCTTTTCAGTCGTCCACGCGTGAAGCTCATCGGCCACGACGACGGACGGCGTGGAACCGTGCTGGCCGAGACCTTCCGAAGATACGGCTTCATACCGGGTGCGGGTCTTCAGGTTGGAAATCTGGCTCTTGTATTCGCGGACGCGCGCCGTGCCGTTGTAGCGGCGATCCTGCGACACGATCAGCGCAACCTCTTCGAAGAGTTCACGGGCCTGCTTGCGGGCGAACGCTGCCGACTGAATAAGCCCGCCCGGAATCTTTTCGGGTCCGAAGGTGCAAAGCAGGACAATTGCAGCGGCGAGGGCGGTCTTGCGCGAACCGCGGCCCAACTGAATGACCACCTTTTTAATCTTGCGGGTGCCGTCCGGGTTGCGCGGACCGAAGATCGCGCGAATAATCCGCTCCTGCCATTCATCCAACTGGAAGGGGTGGCCGGGAGCCGGGTTCTTCGGATGCTTGTTCCGGCGAAGCCATTGCACGGCACGTTCGCCGTCGCCGAAAGTGTCTTCGATAGGCGAGCCGTCAAAAAGCCATTCCGGGTAGATGACCAGGGCGCTCAAACGAAACCCTCGTCATCGTCGGCATACTCGTTCATGGACGCGCGGGACCGGGCCGAAGGCGTCAAGCCGAGTTCGCCAGCACAACGGACAAGCAGAAGCGAAGACTCCTTCAGAATGCCGTAGGCGGGATTGCGCTTGCCATCGCTGATCAGGCCGTTTGCGGCAATTTCGCGCTCTGCCATCCGCAACGCTCCATAGTGGACGCAATAGGTTTCGAGCGTGGCGAGGTCGGCATCGGTGAGCACCTTGCGCTCAAGTGCGAGAATAGGCGCGACACGACGCCATTCCGCCTTCGCCTCCTTGTTCAGGTAGGAAGGCGTCTTCGGGATGCCGGTGACGGGCGAACTACCGGGAACGATGGTCGAGGGCTTCAGGCCACGCGTCATGCTGCAACCGCCTTGATTTCCAGCCCGCCGCGAATGCCGATTTCGGTCCAGCCTTTCACATTGTAGGTGCGGCCCATGAACCGGATACGGTCGGCGGTCGTCAGGCCGGTGAAGAACCGGGTGCGGAAGCTGATACTATCGGTGTCGCGCTCGCCGTTGTCGGCTTCGGCCTCATCAATGGAGTGCTGCAGGACTTCGGCGCGAAGCGTGGCAATATTCGTCCAGGCGGGAACCGGCGTCCGGTTCTCGTTCAGGGGAGCGGTCAAGCGATCAAGGAAGATCACGCGGTTAAGCTTGCCAGCGCGCATCATACGGACCACCGCATGACGGCTTCCACGGTCATGACGCCGTGGCAATAAGACTGCTTCGGGTCCGGGTCGCGCATCCATGCGACGGACGGCAGGGCGAATTCATCGATAGAGAAGCCTTCCGCTGCCGGGGCCTCCTTAAGGGCATTCATGACCGCGAAGCCGACGGCCTTCGCCGTGTCCGCGCCGTCTTCCAACGCCCAAACATGCAGGTCACACGACACGCGGGCGACATACTGCGACCCGGAAGCATGGCCGAGAAATTCCGTATGCCCGCCGCCCATGAGGATGCAGGGGAAGTCTTCGGGCCGCATCGATCCACCGCGAATGTTGTCAGGGTCAACAAGGGCGATCACTGCCGGGTCAGCGGCGAGCGCGTTGCCGATAGTTGCTTGAAGTGCGAGGCTAGGTTCGATCATCGCGCGTTATACTTCTTCAGTGCGGTTGCAATGGCGCGGCTCGTGCGCCGCTCGATCCGGGGCTTGACGGTGCGAAGGCCGGGACGAAGGAACGGCTGCGGCTCGATATTCACGGTGCCGAACTCGATAAAATGGCCGTGCCTCATCGTCTCGTTACCCACGGTCACGAGAACCTGATTGGGTCCAGCCGTGCGCTTGCCCCCGCCTTCGGCATAGGCGGGCGTCGTCTCGCCGGGGCCGGTCACGGCGATGCTGTCAATGAGCGCGCCGCTATCGCGAGACGCTTCGGCGAAGATCGCCGCTGCCGCTGCAACTTCCTCGCCGGACTTCAGGAGCGCGGGTCGAAGGGCGTCCAAAATCTCGTGCGGGATGGCGTTAAGGCGCTTGGAAAGCTCTGCCGACTGCTGCGAGAGGGTCCTATTCTGCGACACGGCCAGTCACTTCCCTGCGATATTTTCGCAAATAATCGTCAATCCCGAAGGGCAAATCGTAGGCGTTTACACCTACAAGCACGGCCTCGCGGTTCTGGTAGAGGTGCGCGGCCATCTGAAGGATAGCCTGTTCAATGTCTCCCGGAACGTCGGGTTCGAAATCGGACAACGGCAAGCCGAGATAGCCGGAAGCCCATGCTTCTGCGGCTTTTAGATAATGCTGAAGCAATTCGTCGGTGCCGCTGGCGAGATTGCCGAGCGCGTTTTCGTCCAGCAAATCGTCGGTGCCGAGGTGCGCCTTAAACAGCGAAAGGGTGACAATGGTCATGCGGTCGGTGCCTTCATGTCGGAAAAACTTATTTCGGGAATCTCGTGCGCAGTGCTCCCCCCGCCGGTCCCTAAGACGCTAGGAAAGTCGGCGACCACCCCCGGGCTGCCCTTGGTCCGCTTGAGCACTGCCAGTTCTTCCGGTGAGCGTGCGGCCTTGATGCGCGGGATAAGTCGTTCGCGGACCGCATCAGGATCGAAGCCGGCGAGATTGCACACGTCAATGAAGCCCTTTGCAGGCCGCATGATGTAGTCTCGAGCATCACGGATCATCACGACGCGGTTCTTCGCGGGTTCATTAACGAAGCCGATGCCGAACGCCTCTTGAACGGAATGCAAAAGCACCGTGCGCCAGAGGATTTCGTGACCACCTACGGCGGTTGTCGTCATATTCGAGTTCGGCGGGCGTCACGCTGCCGTCCTTCCAATGATCTGAAACTTACCATCAGGGAAGCGGCAAAGCGTGAGCAGCGCCAAGCGCTGTTCCGACTTGAAGTGATCATCGATTTCCTCGACGACAGCGCCTGCACCTGCGTTTACCTGGATCTGGTTCTCGCCACCCTGCCGAAGCCCGCAATTAAAGCGGGCGTTCAATTCTGCCGGAACGGTGATGACGGTGCCCTGCGTGAAGTCGAGGATATATCCCCAATCATCATTGTTGAGCGCATAGGTTGGCGCATTGACGGTGCGGACGAGCGTCGGATCTTTCACCAATTCTACGCTCCGGGTGCGAACGACCACCACCGAAGCGCCGCTAACACCATCGGGCATTACCCAAAGCCCGGTCTTGGCATTGCCGAAGGAAAAGAGCTTTTCGGAAGGGTTGTAGTCCTGGGAGACGATGTAGGCATCTGCGTCGGTTGCGGGCTGCTCCCATCCGAAGAAGAACCTGAGAGCCTTGCGTTCCTGATTCTGGATATAGATCGACTGGTGTTCGTCGGCGACCTTCGTCCAGACGTTTGCAGGGCAGTCGATCTTGAGGGTTTCGGTATCCATTAGTTATTCCTTTCCAAGCGCTGCTTGATTGAGTTGTGGCAAGGCGCGCAAAGGGGCTGCCAATTGGTGCGGTCCATCCGGCGAAGCGGGGCGCGGCGGATGGAAACGATATGGTCAACAAGGGTGGCGGGGTTGCCGCATGCGGGCATCGCGCAACGGGGATGCTCAAGCAGGTAGGCGCGGCTCGCCCTACGCCATTCCGCGTCATAGCCGCGTGAGGATGCCGAACCGCGAAGAGCGTCATGGCGGGCGTTACGCTCGCGGGTAGCCCTGATCTGGCAAGGGCACCGCTCGCCGTGCGGGATGATCCGGGTGCCGCACTGGCAAAGATGGGGCGGGCGACGGCTCACGCGCCACCTGCAATCTTCGCCTTCAGGGCGCGAAGGCCGGCGCGGTCAAATTCGGGGTCGAAGCCAAGTGCTTCGTTCTGCGCCGCCTGTTCGGCGTCGGGCTGCTTCTCACCATCGCCAAACCTGCCGATACAAGCTTCGTTGATTTCGGTCGGCGTGGCGTTCCAAGCGGCGTCAGGCGTCCAGCCGAGAGCGCCGGTCGCCGTGCGGTAGAGCATCCGGTAGATATCCGGCCACGCAGTTCCCTTACCGGTCGAGGGCTTCGCCTTTGGATCAGGTGCGATTTGGAGCATTTCCACGAGATCGCCGAGCGGCTTATGCACGGCCATGAAGAAGGGGGAGAGCGGCCTTCCCGAGAGAGAGGACAGAAAGGCCGCTGCACTCTGCCGGTCGGAACAGGCCGTCAGAATGATTTCAGAAATCACGCTCACGTTGAAGTCGCCCAGTGCCCGAAACAATGCCGGGAAGCCGTGACGTGCCTCAAGGATGGTAGCGGCCCGCAAAGAAGGGCGAAGCTTCACGGTCTTTCCGCCGTGCGCAATCGTCACTTCCTCATATGCGGGCCGCTTAAAGGTCATGGCTTAGGCCGCTACCGGGGCGATGGCCGGATGGCCGAGAACAGCCGTAGCGCCGAGCGCGATAGACGTGCCGCCTGCCTTCGTGACAGCAACGCGGACAAAGCGCTTGCGCTTCGAACCGATATAGCCAAGGCAATAGGCGCTGTTCGCGGCAAGGGTGGCCGGGACGGTGCCGAGCCGGTCAGCCGATGCCACGTCGGCGAAATCGGCGTCCATGGTCGTGTCGCTTTCCTGAAGCTTCACGCCGAAGTCACCGGAACCGGCAGTCGCGCCCGTATTGATGACGAAAAGCGCACTGTCGAAGCCGGCGAGATCGACGGCAGAACCGGCAACCGCTGCGGTGACAACGGCAGGGGCAAGCGCCTGCACGGCCTTGTTATCGTGATAGGTATCCTTCATGGCCGGTTCTCCTTAGTTGGCGCTGATCTTGAGGAACTTGACGGCATTGAAGTCGCCAGCGCCGCCGCCGACGCGCTTGTAGCTGTCGAAGACAATCCAGCCCTTCCGCGTGGTCGAGTCCTGTTCGACGCGGATGCCCTGCCGGTCTACGATCACGTAGCCCTGCCGGAAGTCGCCGAAAGCAATGGGATGAGCGCCAGAGCCAATGTCCGGCATGCCGTCGTCAATCTCGACGCGGTAGCCAAGAAGCGGATGGTCGACGCCTTCGATCAGGTTGCCAGTCGGCGCCCAAAGATAGCGTCCGGTCGCGTCAACGATCTGGCGAAGGCGGACAGCGGTGTTGCCGTTCATGAGGAAAACGGCGTTGCCCTTATAGGGGCGGCGAAGCGTGGCGACGAGCTTGATCAGCGCGTCGGCAAGCTGCTTGTCAGTCGGCGCGGTCGCGCCAACCGGGACATACTGGAACTTGCCCCATTCGCGGCTGAAGTCCTTTTCGGAAGCCGTGGGGTAGGTCAGCAAGCCCTTCGGGGAGTTGTCCGCGCCGTCGCCGGTCAGGAAGGATTCGCCTTCGGTTTCGGCGAAGTCCTGAACAGCGTTGTTAATCAGCCAGGCCGAAAGATCGACAGCGGCGTCTTCAAGCAAGGTGCGGGTGGTGGTCGGCGCGGCATAGAGTTCGGCGGTGCCATACGAATGCTTGATCAGGTCCGGCGTTGCCGTGTCCTGCGGACGGTCGGAACGTTCGGAAACCCACTTCGCGCCGCGCTTGCCGAGCGAATAGAAACGCTCATACTTGTCGTTGGCGATGCCGATGACTTCAGCAAGGCCGCGCATCGGCGAGAGGTCCGTAAGCAGGTTGCGGATGGTGAGATCCGTCGTCGGCAGGACGAAATAGCCGCCGTCAACATTGCTGTCGGACGCGATGGCCTTCACTTCGGCAATCGAGCCGGTGCGCATGAACGATTCCAGCGCCTTCTTCTCGCTGTTGTCGTTTGCGCCGTCCGGGTGGTTGTTGTTCGCGGGGCGAAGGCGGTTCGCCTTGGCTTCGAAGGCATCCATACGGGCCTTCAGCGCCTTGATCTCATCCGGGGATGCAACCGGGTCGGACTTCACTTCCGGTTCATTCTTCAGTTCGTTTTCCATGCTCATTCCTTCGATGATGGACTTAACTTCGGTGGTGCGGGCGTCCGGGTGGACCGGGCGACGGCAAAGCGAGATTTCGGTGACGGTGATGTTCGTAAGGACACGCCCGCCTTCCGGGCGGGCCTTGTGTTCATGGCGCTGGTAGCCGATGGACAGGCCAGACATGCTGCCGGCGATAAGGTGACGGCGAGCTTCGCGCGCGGGGCCGACACCTTCGAGAAACAAGCGGCCCTTTACCTCAAGGCCCTTGTCGGTGACGGCGAGCGAGTTCCAGATGCCCACGGCTCCACCGCCCTGCTCATGCTCCATCAGCATCGGGACTTCAGGAGCGAACTTGAAAGCGGTCGGCTCGATAAGGTCGCCGTAGCTGTCCGGCTTGCCGAACGGCCACGCGATGCCGGTCACTGTGCCCGCGTCGTCAATCGAGACTTCGGCCTTAGTTTCGAGAACTGCCGCGTTGGTCATTCCGCAACCTCGCGAAGACCTTCACCTTCGGGAATTTCGGGCCTGCCGTTCCAACGGGCGTCCAGAATGTCGAGGGCGAGCGGGAAGGTTTCATCGAACGGACGGTTCTTCGCGTATGCATCAACAAGCCGCTGCGCTTCTGCCGGATTGGTGCCGCCGCCGATAAGGGCAAGCCGGATTACATGCTCCATGTCAGAGCCGTGAAACTGCGATACCGTCATGCGAAGAAACAGCGCACCGATACCGACGCCAGTCTTGTGCTGAAGCTCTTCAATCATGGGGTCGGTGAGAGCGAAGGTCTTTTCGCCGTCACCGAAATAGGCGGTATGGGAAATCATGCTGCTTCCTTCGGCGCTTCCGGGGCTTCCGGTGCCGTGTTCGCGGTCGTGGTAATATGCGGATTGGAGAGGCTGTCGCCGTCCGGGTGCGGAGGCATATTGAGCATCCGGCGAACCTCGTTCGCCGTGTAGACGCCCATGCTGCGATACTGGCCGAAGGCGGTCGCCTTCTTGGCGAAGTCGGTCGTCAGAAGGTCATCAAGAACCGGCTCGATATAGAACTGGTCGCGCTCTTCCGGCGTGAGCAATACCCTCGAATAAGCCCACGTCCACGCCGTCAGCCAAGGCTTCAGCGTGACGGTGTAGAACTGGCGGGCCATCTCTTCGGTGTTTGACCACGTTCCCCGCGTAAGCTCGAAAAGCATGGTCGGCGGGATACGGAAGGCGCGGCCAATTTCGCGGATTTGCTCAAGGCGATTTTCGGCAAACTGCGCATCCGCAAGCGTTGTTGCGATCTGCTTGTAGTCCATGTCTTCGTCAAGGATCGCGGTGCCGCCCGAATTCTTTCCGCCGTGGGTGGAAAACCAACTATCGGCGACCTTCTGCTTAGGATCGGGGTCCAGGCGCTTCTTCGCCAAGATGATGCCGGAAGGGCGACCGCCGTTTGCCAGCAGTCCGGCCATGTGGCGTTCGAAGGCGATAGCGAGCGCTATAGCTTCGCGGGCCGTCTTGATGGGTGCCTTGCCGCCGAGCGGCTGAATATGAAGAATGTCGGTAAATTCATAGCGCCGCTGGCCGCGATTTTCAGACACGACATAGTAGGGGGTGCCATCATCTTCGGTGCGGTTCTGCACGGTGCCGGGTTCAAGGCGCTGAAGAGCGATAGGCCGGTCATCGCTTGAGCGAATGACGTGCGCATGACCATTGTCATGAAGCATCGCGTCGACAGAAAGGTTGATGCGGAATTGCGCGGCGCTGGTGAAGGCGTCGGCTTCGTCATGAATAAGCTTGTAAGCTGGATGGTCGCGGGCGGTCTCCCGCGTGTCGCTCTTGTATAACTTGAACGGAGTGTCGCCGGTCTTCTCGGAGATAATGCCGATAGCGCAGTTGACGGCAGGAACTTCCATCGCGTTCGTGGGTCCAACGTGAATATCGGACGCGGTGGAACGTATATCGAGAAGAGTTAATGCTAGCTGTTCATTGAGAGCATTTGCTTTCTGCTCGTTTCCGAAGCCGAATGCACTCTTCAAGCCTGCAAGCATCAATAATCCGTCCTCAATTAACTTGAGACGGATTCTCTCACATTAGGAATCTCTTGTGAATCCTATTGTTCAATAAAAATGAACGAAAATAGATAAATCGTTCACTTTTATTTCTCTCTAGCAGTGTCGAGCATCAAATGATGCTGCGAACCTTGTCCATCTGGGCCGCGATTTCGAGCAGGCGTAGGTCGCTACCACCGTAATCATCTGCGGACGATCCGGACGAGCGCCCGGTGATGTAGAGCGCGGCCTTTTCAGAGACGCCAGCGAATAGCGCGTCTTCGAACAGGTGCCGGAACCCGTGGTTCGGCGGGGGCATGTCCTCGCGATGCGGAAAGACCTTCTCTTTGATCCATTCGCGAATGCGCTGGTCTTCGTTCTTGCCACCGGGGAAGAGCTTGCCGTCAGGCTGCGCTTGCACCCAATCAATGAAGCCTTCTTTTATTAGCGCCCGGTGAACGGGAACCTTCCGGCCCTTGTGCGTCTTCGTGTCCCTGCCGTCGCCGACGCGGATATGAAAAAACCAATGGGCTTCCAATTCAAATATGTCTGCCTTCTCAAGAACGGTGATTTCATTCACCCGCGCGCCGGTGTGCGCAATGATCCAAGGTATCCAGCGGTTGCTTGCACGGGTCGCGGTGCGGGCACTCGTCAGAAGGTGCCGGGCATCTTTCAGGCTATAAGTGCGGTCGGCGCTATCGCCCTTCGTATGGGTCGGCATTTCGAGGAAGTCGAAGGGCGTTCCCTTCGGTGTCATCGGGAACAGCTTGCCCCTGCATTGATCCTGCCCCCATCCAAGGATTGCCCGGATCGTCGCCAGTTTGTCGCCGATGGTTTTTCTTGAAAGCTGACCTTCCGAGAGCATGGCGTTGCGCCACGCTTCACCTTCCTCAAGCGTCACAGTCGCAATCTTTTTGCTGCGACGATGATGTTCGAAGTCGTGAACCGTATTCCGGTATTTGATGAGCGTCGAGGCGGATTTCTGCCGCCCGCCCAATCCCATCGCCGTAAGCCGTTCCTTCTCTTGGATTACCTGCTCGAATGTCAGGGCGTTGAATTCAGCGTCCGTAACTTCGGATGCAATCTCGTCTTGCTCGACGGCAGCGGCAAGCATGGGGTTTTCAGGCTTACCGGTAAAATCACCTTCGTCACGTTCGGCGACACGGGCGAGGGCTTCCAGTTCCGACACACAGAGGGCGCGGGCGAGCGTTCGCCATTCCGAGGAACCTTTGACGACGGTCGTGTTGCCTAGGCGTCTGTAGCGCTCAATCCGTTTTCCGACGAGCGGTTCGAGCGCGTCGTCATTGAGGTTTCCAGCTATGCCAGCACGCAAGAGTTCAACAAGCACGTCATCCACAAGTCCGCTTGCGTGCCGATCGTCCATGTTCCGAAGTTCAGTATCGAAGGCGATCCGCTCGTTATAGTTCCGAAGCGCGATTTGATCGACAGGCAGCGGGTAGCGGCCCGGCGTGATGGCCTGGCCTTTCGATACCTGGGCCTTGCGTTCGGCGACTGCGATCCGCGCCTGTAGTTCCGCTACAGCGGTGTGAAGCTGCGCAAGCGCGGTCCGGCGGTCGGGGCCGAGCGGCGATCTAAGTTCGTTCTTATTTTCGAGGAAGGGCCGAAGGTCTTTCGGAATCACGATGCGGGCGAAATAGCGACCATCGCGATTGAGCAGGTATCTCGGATTCTTTGACATTCCCAACTTCCATTTTGTTACATGTTTTGTAACAGAAAGTTGGCCGAAACCCCAAGAAAATCAGCTTCTTCAGCGTTTTCAGGGGCAAGCGATGGTCGGAGTGGAGAGATTCGAACTCCCGACCCTCTGGTCCCAAACCAGATGCGCTACCAGGCTGCGCTACACTCCGTGCCGTGAATGGCAGGGGAGATACACGGTTCACTTCGCCGTCGCAACCCATAAATCCAGCTCTGGCCGAAGATTCCTCGCCGGACCTGTGGATGCAAGGTAGAAAATCTTGATAGGCCGCCAGCCTGCTTTGCCGGGCGCACGAGTTCTGTCTGCCTGGCGCAATTGTACGTCAGCTCAAGGCTTCCTTAGCCGTGGGCGATCAGGCCGATATATTGGGCGTAGCTGATGACAACGTAGAAGAACGCCGTGATGCTAGCACCTATGACAATGGCATTGAACTCCGCCGAACGTCTCATCATGAGCTGCAAATCCACTGAAATCGATTGAGCCGTAAACTAGTGCCGAGGGAAGGCAGCGGCAATCAGGTTGTGATCACAAGGTGAGACAAGCGGAAATATTGCGAAATCCGGCCGGCAATCCCCGGATTTGAAAGGGAGGCGGTGTCGGCCGTCATGCGAACGGAGTCGGCAGCCGTGTCGCACACCGCCTTTTTTCTTGAAAAAGCGGGGATTCGCGGCTAAGCAAAATTCATCTCTTGGCATTATCCTCAGCGATGATGCCAAGGCCCCGAGAACGGGCCAGCAACGGGATTGAGACCATGAAAGGCGGGCGCTGCCCGCATAGTCGGAGACGCTGCAGTCATGTCTGCCAAGATCTATCGTCCAGCCAAGACCGCCATGCAGTCCGGCAAGGCCAAGACCCATCTCTGGGTGCTTGAATACGATCAGGAGTCGGCGCGCAAGATCGACCCGATCATGGGCTACACTTCCTCGGGCGATATGCGCCAGCAGGTGAAGCTCACCTTCGAAACGCAGGAACTCGCCGAAGCCTACGCCCAGCGCAACGGCATCGAATACCGCGTCATCGCGCCGAAGGATCCGGTCCGCCAGGTCGTCGCCTATCCGGACAATTTCCGCTATACGCGCACGCAGCCCTGGACGCATTGAGATTTCTGGTCCAGATATGCCGCATCCGGTCGCGCTCGTGAGACGCGGCCGCCGGACGGCCCCTTAGCTCAGCTGGATAGAGCACCTGCCTTCTAAGCAGGTTGTCGCAGGTTCGATTCCTGCAGGGGTCGCCACTATTTCAAAGTCCATACCGAAGACATGGTAACAGCTGGTTTCGGCCTCTGCTGCGGACTCATGTCTAGTCCAGGAGCGTGAGACTGACGGAAATGCGCACCAATAGCGCATCGTCGCTATTCTCTCCTAAGCTAGTTCGCGCAACTCGTTCACATTTCGTTCCCGTGATGCTAGTCTCGCCCTCAGTGAAACGTGAGGGGGCATTTATGGATCAAAGGGCCAAGTTCGAGCGACTGAAGGCACTTCACCAGGGGGACAGTGCCTTTGTCATCCCGAACCCGTGGGACGCCGGTTCGGCTCGCCTTCTCGCAAGCCTTGGCTTCGAGGCGCTCGCGACCACCAGCGCGGGTTACGCCTTTTCCAAGGGCAAGTTGGATTCATTCGCGAGCCTTGGACGGGATGAGGTTCTTGAGAACGCTGCCGAGATCGTTGGCGCTGCTGATCTTCCTGTCTCTGCCGACCTTGAAGATGGCTTCGGTGCGTCGCCAGAAACCTGTGCGGAAACCATCCGCTTGGCATGCGAAACCGGACTTGTCGGCGGATCGATCGAAGACGCAACAGGCAATCCTGCTGCCCCGATCTACGAGCTGTCGCAAGCCGTCGAACGCATCCATGCTGCAGCAGAGGCTGCGCGTGGTCTGCCTTTTCTTCTCACGGCGCGGGCGGAGAATTATCTTTGGGAGAGACCCGATCTTGACGACACCATCGAACGGCTGCAAGCATTTTCGGCCGCTGGGGCAGATGTGCTTTACGCACCCGGCTTGCCTGACATCGAGGCAATCAGAACCGTTTGCGCGGCCGTGGACAAGCCAGTCAACGTGGTCATGGGTTTGAAAGGGCGAAAATACTCTGTTGCCGAACTGTCGAGCGTCGGCGTGCGCCGCGTGAGTGTCGGCGGCTCTTTCGCGCGGGCCGCTCTGGGGGCGTTGTTGCGCGCCGCTATGGAGGTGAAAACAGCGGGTACGTTTGGATATGCCGACGACGCGCTCTCTGCTGCAGCGGTTAGCCACCTGATGTCACGGGAGAAACGCCAGGACAGGCTATGAGCTGGGCAGAAACCAAAGTCGCGGCGGATTTGCACTGCTCGAGCGTTTCCGCAAATCCTTCTCTGGAATCTGCTCGAGCGATCTCAGTGAACGACCGAGAGGTAGCGGATGCCGGCTTCATCCGCCGCTCTGATCAGCGCTTCCCGCGCGGCGTCCGGGGGCGTGGTTCCGCGGATGGCGTCGAGGCATGTCCTCACCGCCACGACATACTCCTCGCCGTCATCACCCGGCCATTCGGAAATGAGCATGGTGGCTGCATCGCCGAGGGTCCTGACGACGCTGTATTTTTCCTGGCCGCTCATGAGGATCCTCAGTGCGGGAAACTGATGCGATGTGTACCGATTCATTACATTTGGTCCTTGCATCATAAACATATTAGTAACTCTGCAAGAAGCGTTCCAACTGACGCCGCAGGATGATTATCTTCCATGAATCGGCAAAAGCAGTCGGCAACGCTTCTCCAGGCAGGAAGCCGGGAACTTGTTTTCAACACATTGCCGTCATATTGATTCATCTATCGAAATTGCTTGTCGAGCTTGGTTTGCGCTCTGGCGTCCCGGCTGATCTTTCCTTTCAAAATCGATTTCTGCGCCGTCCGGCAGCCGCCGGAGGGTGATTGTCATGCCGGTTGAGGGGGAATCGTCATGAATGCAGGCAAGTCTTTGATCGACGCAAGCCTCCGCCAAACGCACATCGTCGCTGCCTGATATGCCTCCGGTCTTTGACCGCGGACGTCAGTGCCGGCATTCCGGCTGGGTTTTCCAGCCATTGAGGAGAATTTCATGAGCAAGACCAATGCAGAGGCACTGTTGCGCAAAGCGCAGCGCCAGGTTGCCAATACCAGCGGCGGTGGCCATCTCGGCGGCACCAACTTCGGTCAGGGCAATGACGCCAAGACTGCGTCCGGAAGCGGCAACCGCCCGGCGGGCAAGGGCCGGCCGCCAGCTGGCGGCAAACGCTGACCATAGCGCCGCGCGCCTTTTAAGACGCGCAAAGAACGCAGTGAAGACAGCCCCGCCAAAATGGCGGGGTTTTTGTTTGATAGCCTCTGAAGGGCTCGGCTCTTCTCAGCGATCCAGGAAATGATCGAAGTACACCTGGGGGATGGGGTAGGCGTAGACGCCTCTGTCGACGAAGCTGCGTCTATAGAGGCAATTTCTGAGAGCCGCGTTGTAATGCAGGCTTCGCGGGTCCGACGAGCCTCGTCGCCATGAGCTTGCCTCCGGCAGGCAGTGTTGGAGTGCCTTGTCGTATCGAGCCAGGAGGACCGGGTCGGTATCGACCCGAATGCCGCTATAGGATTGATAGTTTGAAGGAGATTCTGCGGCCGATATACAGGCCAGGCTGAGTGCAACACCAACGACTGCCATCAATCTCATTGGACTTATTTCTCCCCTGATCGATTATCGAAGTATCGAGCGAAACGTTATTGGTTCCAGCGCCTGCGGGCAAGGGGACGCCTAGTTCCGGCGTCCCCTTGCCCGAGACGTTAACCGCATAGCTGCCGCCACAGAAATGGCGGCAGCTATGCGTCATCGATCAATCCAGGAGGTCGGCCGGTACCTTGCCGCCGTTTTCCGAAAGCCGCTTCATCAGCGCCTTGTGCAGCCACATGTTCATTTTTGCGGAATCGTTGGTGTCGCCGGTATAGCCGAGTTCAGCGGCAAGTTCCTTGCGCTCGGTCAGGCTCGCATCCATCCCGACGGCTTTCATCAGGTCTACGATCGAGCGGCGCCAATCGAGTTTCTGACCGCTCTTCTTTACCGCCGCGTCGAGGATCGGGACGATATCCACTGTAGCTGTGCCTGGTGCGGTGGCCGTGGGGACCGGCTTGCTCTGCGGGGCAGGGGCCGCAGGCGCCGGACTGGGGGCTGCCGACGGTGAGGCAGGCGCTTGCGCCGGCTCTGTCTTCGGTGCGCCGGCCGCAACGGGTTCGGCTGCTTTCGCCTCTCCGAAGATTGCATGTTTGATCTTGTCGAAAATGCCCATTCTAAACCTCCAGTTCCATCAGATGAGAACATGCTGAAACATGGACTTCCCGCTTGATATATCCAGGCGGAAAGGGAGTTATTTTTAGTATGTTCAATCGCTGATGGAAAAGCCGATGCCCGCGACAGCTGTGGTCGATCTCAAGGAAAAGCTGCCATCGGATTATTCGCCGACGGTTGAAGATTGCTGGCGCAATCGGAGGGAATACTTCAGCAGCCTTCCGGCTGTCAGCCGACATGCCCGCTTTTGGCACAGTCGGTCGGGCGATTCGGAATTCAATAACCGCGAAACATCATTATACGCCGCCGAACCGACTTGCCGCGCCCTGTCGTGAAACATTCGTCATCTCTTCACGTTGCCTTGCCATGGACGTCCTGTAGTTGCAGGCGTCCGATCGATCGAAGGCACGCGCGTTCCTCGATTTTCGAACTGCCGGGCAGATCGTTCGGCGGAATGAAACGGAGGAAAACATGACCCATAACCACAATAGACTGTCCTTTGCAGTGCTCGCATCCACCTGCGCCCTCCTTGCCATGCCGGCAAGCGATGCGCGTGCGATCGATGTCGGTGTGTCGGTGAATGCAGGCAATGCCGTCAGCGCCGATGTCGGGGCTTCGATTGGCAGTGGGAGCGGCATCAGCGCCGATGCCAATGCTTCCGTCGGCGGCTCGAACGGCGTCAATGCCGATGCGACCGCCAATGCCGGCGGTGGCCGGGGCATCGATGCCGATGTCAATGCCAGTGTCGGCGGCGGTCGCGGCGTCAACGCCGATCTCAATGCACGCACCGGCGGTTCCGATGGAATCGATGCCAACGCCACGGCCTCGGTCGGCAGCGGTAATGGCGTCGATGCCAACCTCGCCATCGGCAGGGTAGATGGGGCAGGCAGCAGCGGCAGACCCGCGGGCGAGCGCAGCCTCAGCGCGTCCCAGATCCGCACCCTGGAGGCCTTCCAGGCGAGGCCTGTCAATGAGCAGCGCAAGATGTTCGTCCGTTGCGCCGATATCTCCGCCTCCGGCGGTGATTCCGGTCTTGCCGGTCTCTGCAGTCTACTGCAGGCGACAGCCTCCCGCTGAACGTGAAAGACCCGCCGAACCGGAAGGTGCGGCGGGTCTTTTCAAGGGACGCTTCGATCTAGTGCAGGATCTGGCTGAGGAACAGCTTGGTGCGTTCATGCTGCGGATTGTCGAAGAACTCGGCCGGTGAATTCTGCTCGACGATCTGGCCCTGGTCCATGAAAATGACCCGGTTGGCCACCTGACGGGCAAAGCCCATTTCATGCGTGACGCAGAGCATGGTCATGCCTTCCTCGGCAAGACCCACCATGGTGTCGAGCACTTCCTTGATCATTTCGGGATCGAGCGCCGAGGTCGGCTCGTCGAACAGCATGATCTTCGGGTTCATGCACAGCGACCGGGCGATCGCCACGCGCTGTTGCTGGCCGCCGGAGAGCTGGCCCGGATATTTGTTGGCCTGCTCCGGGATCTTGACCCGCTTCAGGAAGTGCATGGCCACTTCTTCGGCCTGCTTCTTCGGCATCTTGCGGACCCAAATTGGCGCCAGCGTGCAGTTTTCGAGGATCGTCAGATGCGGGAAGAGGTTGAAGTGCTGGAACACCATGCCGACTTCGCGCCGCACCTCGTCGATCTTTTTCAGATCGTTGGTGAGCTCGGTGCCGTCGACGACGATCTTGCCTTTCTGATGCTCTTCAAGGCGGTTGATGCAGCGGATCATCGTTGACTTGCCCGAACCCGACGGGCCGGCAATGACGATGCGCTCGCCGCGCATGACCTTCAGGTTGATGTCGCGCAGCACGTGGAAATCGCCGTACCACTTGTTCATGTTGACGATCTCGACCGCCACTTCCGTTGCGGAAACGGTGAGCTTTTTCGCTGGAGCTTCAGCCATGACTATTTTCCCTCATTCTTATCGTTGTAGCGTTCTTATCGTTTGTGGCCGGTATCGAGATGGCGCTCCATGAAACCTGAATAGCGCGACATGCCGAAGCAGAACAGCCAGAAGATGAAGCCCGCGAAGATCAGGCCCGTGATCGGCGTGACGGCGCTTGCCCAATTGGCATCGGAGAAGTTCAGCTTAACGATGCCAAGCAGGTCGAACATGCCGATAATGGTGACCAGCGACGTGTCCTTGAACGTTCCGATGAAGGTGTTCACGATGCTCGGGATGACCAGCTTGATGGCCTGCGGCATGATGATCAGCCGGGTCTTCTGCCAATAGCCGAGGCCGAGTGAATCGGCGCCTTCGAACTGTCCCTTCGGGATCGCCTGAAGGCCGCCGCGGATCACTTCAGCCATATAGGCCGACGTGAAGATCGACACGCCGATCAGCGCCCGAAGCAATTTGTCCACGTTCCAGCCTGTCGGAAGGAAGAGCGGCAGCATGACACTTGCCATGAACAGAACGGTGATCAGCGGAACGCCTCGAATGACCTCGATGAAGGTAACGCAGAGCATCCGGATGACGGGCATCTTCGAGCGGCGTCCCAGCGCAAGCAGAATGCCGACGGGTAGGGAGACGGCAATACCGACAAAGGACAGAACGAGCGTCACCATCAACCCGCCCCAGAGCGGAGTCTCCACCACTTCGAGGCCGAAGCCGCCGTGAAGAAGCCAGAAGGCGATGACCGGCAGGACGGCGAACAGAAGAATGGCGTTCAGGCCCTTGCGCGGCGCCGACGGGATCAACATCGGAACCAGCAGCAGAATGAACAGGATCCCGACGATCGCCGGCCTCCAACGCTCACCGAGCGGATAGCGGCCGAAGATAAACTGATCGTACTTGGCGCTGATGAAAGCCCAGCATGCGCCACTCCAGCCGTCAGGCTGGATGCCGCCCTGGATCGTTGTCGCGCAGAATGTGCGGTCCGGGCCGGACCATACGGCCTGGATGAACAGCCAGTTGACGAGATGCGGCACGGCCCATGCGATGAGCGCAAGAGCCAGGATCGTCAGGATCACGTCCTTCGGGGTTGCCAGGAGATTGCGACGTATCCAGGCGACGGCTCCCCTCTCGCCGGGGGGCGGCGGTTCGGCGGCAAGGATGGACGTTCTGACAAAGGGTTTATCGGCGACCGACATCTTATCTCTCCACCAGTGCCATCTTGGCATTGAACCAATTCATGAACAGCGACGTGAGAATGCTCAAGCTGAGATAGACGATACCCCAGATGCAGACGATCTCGATCGCTTGACCGCTCTGATTGAGGATCGTGCCACCGACGGCAACGAGATCCGAGAAACCGATCGCGATGGCGAGTGAGGAGTTCTTGGTCAGGTTCAGGTACTGGCTCGTCAGCGGCGGGATGATGATGCGCAGCGCCTGCGGCACCACGACAAGTCTCGTCACGCTCGACGGATGCAGCCCCAGCGCGCCGGCTGCTTCGGATTGTCCCTTTGGAACGCCGCGAATGCCGCCGCGAACGATCTCGGCGATGAACGAGGCGGTATAGAAGGACAGAGCGAGAAACAGCGACATGAATTCGGGGCCGACGACGGAGCCGCCCGTGAGGTTGAATTTTCCGGCGACCGGAATGTCGAAGGTGAGCGGAAAGCCGGATACAACGAAGACCAGCAGTGGCAAGCCGACGATCAGCGCGATCGCCGTCCACACAGTGTGGAACGGCTGACCGGTTGCGGCTTGGCGTTTGTGGGCCCAGCGCACGATGATGATGGTGGCGACAATCGCAATCAGCAGGGCGATGCCGACCGCTATCATGCCTGTCTCGAAGATCGGCTTCGGAAAGGCTAGTCCTCTGTTGTTGAGGAACATGTTGAACGGCAGGCCTACCGACTCGCGCGGCTGCGGCAAGACGGAGAGAACGCCGAGATACCAGAAAAAGATGACGAGCAACGGCGGAATGTTGCGGAAAACCTCGACATAGACCGTGCAGAGCTTGGCAATCAGCCAATTGTGCGACAGCCGACCGATCCCGATCAGGAAGCCGATGATGGTCGCCGTGAAGATACCGGTCACCGCCACCAGCAAGGTATTCAGAATGCCGACGAGAAGTGCGCGCGCATAAGTCGAGTCACTCGAAAAGCCGATCAGTGACTGGCCGATTTCGAAACCGGCGCGGCCGCGAAGAAAGCCGAAACCCGATGCCGTATTGCTGCGCGCAAGGTTCACGGCCGTGTTGTGGGCCACCCACCACACAAATGCCACCAGAACAACGACTGTTAGAACCTGGAAAAATATGCTCCGGTATTTCGGGTCGTACATTGCCGACCGGAAACTCCAGCCGGTGTCATGCAAAGGTGTCCTATCCACAGCCCCATGCGTCATGCCGCGCCAATCCCCTTGTGCCCGTTTTCGGGCTTTCTTTTTAACTTTTTGGAAGCGGGAAGGCGGCTTGGCCGCCTTTCCGGTATTTCATTGCCAGTGCTCGATTAACGAACCGGCGGTGCGTACTGGATGCCGCCCTTGTTCCAGAGAGCATTCAAGCCGCGTGCGATCTTGAGGGGGCTGCCCTGGCCGATGTTGCGCTCGAAGATTTCGCCATAATTGCCGACGCCCTTGATGACGTTGGCGGCCCAATCATTGGTCAGGCCGAGATCGGTACCGATCTTGGTGTCGGTCTCGCTGCCGAGGAAGCGCTTGATATCAGGATTCGGCGAGTTCTTCATCTCATCGACATTTGCCTGGGTGATACCGAACTCTTCGGCATTGATCAGCGCATAAGCCGTCCAGGAAACGATATCGAACCACTGATCATCACCCTGACGGACGGCCGGGCCAAGCGGCTCCTTGGAGATGATCTCAGGAAGGATGACGTGTTCGTCGGGATTCTTCAGCGTCAGACGCAACGAATAGAGACCGGACTGGTCGGTCGTGTAAACGTCGCAACGACCGGCGTCGTAGGCTGCGTTGACCTCAGGAAGATTTTCGAAGACGACCGGATTGTACTGTAGATTGTTCGTCTTGAAATAATCGGCGAGGTTCAGTTCCGTGGTCGTGCCCGACTGCACGCAGATTGCGGCGCCGGAGAGTTCGAGAGCCGACTTCACGTTCAGGCCCTTGCGCACCATGAAGCCCTGGCCGTCATAATAGGTGACGGGACGGAAATTGAAGCCGAGTGCGGTGTCGCGATTGATCGTCCAGGTCGTATTGCGCGAGAGGACGTCGATTTCGCCGGACTGCAGAGCGGTGAAGCGTTCCTTCGCATTTGTCGGCGTGTACTTGACCTTTGTGGGGTCGCCGAACACGGCCGAAGCGACGGCCTTGCAGAAGTCGACGTCGAAGCCGGCCCAATTGCCGGAAGCGTCAGGTGCGGCAAAGCCTGTAAGGCCGGTGTTGACGCCGCACTGAACGAAACCCTTTGCCTTGACGTCTGAGAGAGTGGTGGCGGAGGCGGCCGAGGCGCCAACTGCCAAAACTGCTGCGCCGATGGCGGCGGACAGGAGCTTATTTTTCATTTTCCCAACCTTTTCCGTTGTCTTATCTTTTCTTGTGGGGAGTGCGTGCCGACAATCCGTGCGCCTCCCCATTGTCTCGACACCCTCCCGGCGCGAGTGGGTCTATCACAGTCGTAAATGTCACTATGGTCAAGTGTCGCGCTCAATAATTGCGGCAAAATTCAGAATTTTGGTAGATCGCGCTGCATTCATGGGCACTTGGCCATGAAATAGGCGTCTGGTTGCGGAAAAATAACGCGAAAATCGACAATTCCATCATTCCAGGCCGTCGCTTGCATCAATCTTCAAGGGTTGACCGGAAACGGCGCGACGTCCAAAAGTCTGGTTTCGCGACCCTTCGCCAAAGGCTATTTCCGACATGAACGACAAAGACAGCTTGCTGCAGAATGCTGGCATCAACACCCGCCTGACCCATATCGGCAACGACCCCTTCGATTATCACGGTTTCATCAATCCGCCGGTCGTGCATGCCTCGACGGTGTTGTTTCCGAATGCGCGGGCGATGGAGACGCGCACGCAGAAATATACCTACGGAACGCGCGGCACCCCGACGACGGATGCGCTCTGCGAGGCGATCGACGCACTCGAAGGCTCGGCCGGCACGATCCTTGTGCCGTCGGGCCTTGCGGCCGTCACCATTCCGTTTCTGGGTTTCGTCGCAGCCGGCGATCATGCGCTCGTGGTCGATTCGGTCTATGGCCCGACGCGCCATTTCTGCGACACGATGCTGAAGCGCCTCGGCGTCGAGGTGGAATATTACCACCCGGAGATCGGCGCCGGCATCGAGGCGCTGTTCCGGCCGAACACCAAGCTCGTTCATACCGAGGCCCCAGGCTCCAATACTTTCGAAATGCAGGATATTCCGGCGATCTCGGCGGTTGCGCACCGCCACGGCGCCGTCGTCATGATGGACAATACCTGGGCGACGCCGGTCTATTTCAGGCCGCTCGATTACGGCGTCGACATATCCATCCACGCATCGACGAAATATCCGTCCGGCCATTCCGATATCCTGCTCGGAACGGTTTCGGCCAATGCCGAGCACTGGGAGCGGCTGAAGGAAGCAAACGGTGTGCTCGGCATCTGCGGCGCACCCGATGATGCCTATCAGATTCTGCGCGGATTGCGCACCATGGGCCTACGCCTCGAGCGCCATTATGAAAGCGCGCTTGATATCGCGAAATGGCTGGAGGGCAGGGAGGATGTCGCCCGCGTGCTGCATCCGGCCCTGCCGAGTTTCCCCTCGCACCATCTCTGGAAGCGCGATTTCAAAGGCGCCAGCGGCATCTTTTCCTTCGTGCTGGCCGCCGACGGCCCCGAGAAATCAAGAGCAAAGGCGCATGCCTTCCTCGACGCGCTCAGGATTTTCGGTCTCGGCTATTCCTGGGGTGGCTTTGAAAGCCTTGCTTTGCATGCCTATCTCAACGATCGCAAGGTCGCCAAGGCGCCGACGGACGGTCCGGTCATCCGCCTGCAGATCGGCATCGAGGACGTGGCCGACCTCAAGGTCGATATCGAACGGGGTTTTGCCGCGGCAAGCGCGGTCTGACCAGAGACGCTTTGTCGCCGATGGCGGCGCCTCACGGCAGATCGGCAGCCCGATAGCCGTAGATCCAGTCGAGATCTGCCGCGAGGCTTAGCGGCGGCTTGAGGCCGAGCACGAGATCGCGGCCGATCCGGATCGGCCCTTTCGCATGATAGGCGAACCGGTTGAAGGCGCCACGCTGGCGAAGCCTCGCAATGCGCGGCGCCCGGTGTCTTTCGAACCGCGCCAACGCTTCCGCCACGGGGCTATTCGAAAGAAACGCCGCCAGTTCGTAGGCGTCTTCGATCGCCATCGCTGCTCCCTGTCCGGCAAAAGGCATCATCGCATGCGCGGCGTCGCCGATCAGAACCGTCTTTCGGCCGTCCTGCCAGGCGCCTGATGTGGTTTCGAACAGCGGCCAGAAGGTCAGCTTCCTATTTCTGTCGAGCAGCGAGACGATCGCTGAGTTCCAGCCGGAAAGGCGCGCCCGCAGCTGCGCCCGCTGCTCGGCCGTCGCTTCGCTTTGCCAGGCCTGCGGCGCGATATTGCCGGCGGTGATCGCCACCATGTTGAAGCTGCCGGTCTCCCTCAGCGGATAGCAGACGAGATGCGCCGAACCGCCGAGAAAAGCCGAAACGCTTGCCCGGTCGAGGAATCCGGGCGCTTCCGTTTCGGCAATCGTAAAACGGTAGGCGATATTGCCGGAAAAGCGCGGCGAGGGGCTGCCCGGAACGAATTGCCGGAGCTTCGACCAGACGCCGTCAGCGCCGATCACAATATCGGGCGTCCGCTCGAAAGGCGGTAGGGTCGAATCTATCCGCACGCCAAGGTGAAGCCGGCAGAGCGGATCGGCCGTGACCGCATCCAAAAGCGCTTTCTGCAATGTGGTGCGGTGGAGGACGCCATAAGGAGCACCCCAGCGTTCCCGCGCGAATTTGCCGGCCGGCACCGCCGCGAGTTGGCGCAGCGAACTGCCCGATATCAGCCGGATCGCGTCCGGCTCGAGCCAGACCTTTGACAGCCCGTCAAGGATACCGAGTTCGGCAAGGATGCGGGAGGCGTTCGGCGAAACCTGCAATCCGGCGCCGATGTCGGTGAGTTCGCCCGCCTGTTCGAAGATCTCCGAGCTGATACCCCGGCGCGAAAGCGCAAGGGCAGCGGTCAGCCCTGATATCCCGGCGCCGATGATGGCGGCATGTTCGATCGGCATTGCCCGTCCGATCCCGGATCTGGACTGACTACGCCGCCTTCACGTGGAAAACGCAGCCGGCCGGATTGGTTTGGCGGGGCTTGAGCGCGGAATTGAAGCGATAGAGCGTCGAGCAGTAGGAACAGACCTTCTCGTTGTCGTCGCCCATGTCGATGAAGATATGCGGATGATCGAAGGGAGCCGAAGCGCCGGTGCACATGAATTCCTTGACGCCGACTTCGATAACGCGGTGACCGCCGTCGTTCTGGAAGTGGGGAATGTTGTGGCCGGCCATGTCGCTCTCCGAATGCTTTGAAATGTGCGCGAACCTTATAAGCCTTCGCCGCAAATGTGTAGAGCCAAACCGCCGCGCCGGGCACAGTTTTTAGCCACAGTTTTTGGCTTCACCATGAAAGGCCGCTCGACTATGGTCGCGCCCAAAAAGGAAGGCCCGATGAACCTGAATACGCCCACATTTTCGAGCTTCACCCATGACGGACTGCAGCTCGCCTTCTTCGATGAAGGCGATCCGGCCGGTGTGCCCGTGTTGTTGATCCACGGCTTTGCCTCGACCGCAAACGTCAACTGGGTGCATCCGGGCTGGCTAAAGACGCTGGGCGATGCCGGCTACCGGGTGATCGCCATGGACAATCGCGGCCACGGCGCAAGCGACAAGCCGCATGATGCCGAAGCCTATCGTCCCTGGGTCATGGCCGGCGATGCGATCGCCTTGCTCGATCATCTCGGCATCCCGGAGGCCAATGTCATGGGCTATTCGATGGGCGCGCGCATTTCCGTTTTTGCCGCGCTTGCCAATCCGCATCGCGTCCGTTCGCTGGTGCTCGGCGGCCTCGGCATCGGCATGACCGACGGCGTCGGCGACTGGGACCCGATCGCCGATGCGCTGCTGGCTCCTTCGCTGGAGGCGGTGACGCATGCGCGCGGCCGTATGTTCCGCGCCTTCGCCGAGCAGACGAAGAGCGACCGGGTCGCTCTTGCCGATTGCATCCGCGGCTCGCGCGATCTGGTCGCCCGCTCCGATATGGCCAAGCTCGATATGCCGACGCTGATCGGCGTCGGCACCAAGGATGATATCGCCGGCTCGCCGCAGGAGTTGGCGGCGCTGATGCAAAATGCCGAAGCACTCGATATTCCGGGCCGCGATCATATGCTCGCCGTCGGCGACAGGGTTTTCAAGCAGGCGGTGCTGGCCTTCTATGCAAGGGTCGCCCATCGCTGACAACATCGTGATGCTGAAAAAACCATGCTAAAAAACCATGGCGACGGCACCCATTTATGTTATTGGCGTTTTCCTCTATATATTGGCATTCCGAAAACGGCATTCCTTCCGGCAACGAGGAGAGCGGCGATGGTCGCAAAGACTGACATCCGTGCTTTTGACACAGGCCATCCGGTGAAGGTGATGGATCCCATCTGGGACAGCCTGCGCGAGGAAGCACGGCTCGCCGCCGAACGGGACCCGGTTCTCGCCGCCTTCCTCTATTCGACCGTGATCAACTACCATTCGCTCGAGGAATGCGTCATCCACCGCATCTGCGAACGTCTCGATCACCCCGACATGCAGGCGAACCTGCTTCGCCAGACCTTCGAGGAAATGCTCCTCGACTGGCCGGACTGGAGCTCCATCCTGCGCGTCGATATCCAGGCGATCTATGACCGCGATCCCGCCTGCCTGCGCTTTATGGAGGCGGTGCTTTATTTCAAGGGCTTCCATGCGCTGCAGACACATCGTCTCGCCCATTGGCTGCTGAACCGCGGCCGGCGTGATTTTGCGCTCTATCTGCAGAGCCGCTCCTCCAGCGTCTTCCAGACCGACATCAACCCGGCCGCCCGTATCGGCAAGGGCATCTTCCTCGATCACGCCACCGGCCTCGTCGTCGGCGAGACGGCCGTCATCGGCGACAACGTCTCGATCCTGCACGGCGTCACACTCGGCGGCACCGGCAAGGAGGGCGCTGACCGCCATCCGAAGATCGGCTCCGGCGTCATGATCGGCGCCGGCGCGAAGATCCTCGGCAATATCGAGATCGGCTACTGCTCACGCGTCGCCGCCGGCTCCGTCGTCCTGAAGGCGGTGCCGCCCAAGAAGACGGTGGCGGGCGTGCCGGCCAAGGTCGTCGGCGAGGCCGGTTGTTCCGAGCCGTCGCGCAACATGGACCAGGTGATCGGCGCCGATATCTGAGCGCCCATTGGAAACAAGGATAGGAAAAGTCGGCGGGATCCGAGCGGGGAGGGCCATGAGCAAACGGCAACCTTGCCTTTACACCGCTGCATTTCCTGTGCAAGAAGCGGCCAATCAAGACCGCTTACGGAGATGACAGAGTGAAGCCAGAAGAAATCAAGAAGCTCGACGCCTATTTCAAGCGCATGTTCAACCCGCAGATGATCGTCAAGGCGCGTCCGCGCAAGGATGATTCTGCGGAAGTCTATCTCGGCGAAGAATTTCTGGGTGTCGTCTATATCGATGACGAGGACGGCGACCGCTCCTACAACTTTTCGATGGCGATCCTCGACGTCGATCTCTGATCGCGCTCGAAAGCTTCAAAAGGACCGAATGGCGCGATCCGTTCGGTCCTTTTTTCGTTTTTTCGATTCGGAACGAGAATTTATTCAAAACTTCCGGTTGCGGCATTCTGATATATTTTAGCAAAATCAATTAAAGACGTACGTGCCCCGGCGCGCCGGCTTTGTTATATTATTGTAATGCAACTACTGGATGTTTTGCAATGCACAAGACTACTTGACTATTTGTGCATTGCAGCTACCCTGCCGCCACCAACAGCCCAACGGAGGATGGCTCATGTTCAACATTGAAGACGCCAACAAGAAGAGCAAGGAAGCCGTCGACACGGCCCTGAAAACTTATTCCGACACGACCAAGGGCTTTCAGGCGATCGCCGCTGAAGCCACTGAATATTCGAAGAAATCCTTTCAGGACGCGGTGACGCATTTCGAAACGCTGGCCGGCGTCAAGAGCTTCGAGGCCGCTTTCGAGCTGCAGACGAACTACGTCAAGGCGTATTTTGAAGGCTTTGTCTCCGAGACGACGAAGCTCAGTGAGATGTATGCCGATCTCGCCAAATCAGCCTATAAGCCCTATGAAGCGCCGATCGCCGCTGCCGTCGTCAAGACCGCCAAGTCGGCGACGCCTGCTGCTGCATGAACTGATTTCAAAGGCGCAACCTGCGCTACATATTGAAAAACGAGGACCGGCTACGCATCTGTAGCCGGTCTTTTTGTTTCCGCTTTCGCTGCAGCGCCCGCATGGGATCGACCAGGGACTTTTTTGGTCTTTCCCGTGCATGCTGGCCATTTCTTGATTGCAGTGTCTGACAGGGGGCTTAAAATCAGCCTATTATGAACTAAGTTAGTGTTTCAGATATTCGGCGGGAAAAGCACCCTCCCATGCTCCGCCGGATTGCTTGAGGAATGAATGACAATGATCGCAAAGCCGATCCGGATGCAGAACGACAGCGAAAGGAACGGGGACAACGCAAATCGAACCTCGGTCATCACACGCACCAAGCCGAAGACCAAGAAGCCCAATCTTTATCGTGTGCTGCTTTTGAATGACGACTACACTCCCATGGAATTCGTCATCCATATTCTGGAGCGGTTTTTTCAGAAGGATCGTGAAAGTGCCACCCGCATCATGCTCCATGTCCATAACCACGGCGTCGGCGAATGCGGAATATTCACATACGAGGTAGCGGAAACGAAGGTCAGCCAGGTGATGGACTTCGCCCGGCAACACCAGCATCCGCTGCAATGCGTCATGGAAAAGAAGTGAGGATCTGAACGTGCCAACATTTTCGCCTAGTTTAGAGAAGGCGCTCCATCAGGCACTGACCTTTGCCAACGAGCGGCACCACGAATATGCGACGCTCGAGCATCTGCTGCTCGCCCTGATCGACGATGCCGATGCGGCCGCGGTCATGGGTGCCTGCAATGTCGATCTCGACGCGTTGCGCAAGACGCTCGTCGAATATGTCGATAACGAACTTTCCAACCTGATCACCGGATATGACGAGGATTCGAAGCCGACCTCCGGCTTCCAGCGCGTCATCCAGCGTGCCGTCATCCACGTGCAATCGTCCGGCCGTGAAGAGGTGACCGGCGCTAACGTGCTGGTCGCGATCTTCGCCGAGCGCGAAAGCCACGCCGCTTATTTCCTGCAGGAGCAGGAGATGACCCGCTACGATGCCGTCAACTATATCTCCCACGGGATCGGGAAGCGGCCGGGCGCTTCGGATGTGCGTCCCCCGCGCGGCGCTGAGGACGAAGCCGAAAGCAGCAAGCCGACGGCGCGCGGCGGCGAGGAAGACGGCGGCCCGAAGAAGCAGCAGGATGCGCTCAAGGCCTATTGCGTCAATCTCAATGAGAAGGCCAAGGGCGGCAAGATCGATCCGCTGATCGGCCGTCACGCCGAGGTCAGCCGCACAATCCAGATCTTGTGCCGCCGTTCGAAGAACAATCCGCTCTATGTCGGTGATCCCGGCGTCGGCAAGACGGCGATCGCCGAAGGCCTTGCCAAGCGCATCGTCGAAGGCAAGGTTCCGGAAGCACTCGCCGATGCGACGATCTTCTCGCTCGACATGGGCACGCTCTTGGCCGGCACGCGCTACCGTGGCGACTTCGAAGAGCGCCTGAAGCAGGTCGTCAAGGAACTGGAAGAATATCCGGGCGCCGTGCTCTTCATCGACGAGATCCACACGGTGATCGGCGCCGGCGCCACTTCGGGCGGCGCAATGGATGCATCGAACCTCCTGAAGCCGGCCCTGTCATCGGGCGCGATTCGCTGCATCGGATCGACCACCTACAAGGAATACCGCCAGTTCTTCGAGAAGGATCGGGCGCTGGTCCGTCGTTTCCAGAAGATCGACGTCAGCGAGCCGTCGATCGAAGATGCGATCGAGATCATGAAGGGCTTGAAGCCCTATTTCGAAGAGTATCACCACCTGCGTTATTCGAACGACGCCATCAAGTCGGCCGTCGAATTGTCGGCCCGCTACATCTCCGACCGCAAACTGCCCGACAAGGCGATCGACGTGATCGACGAAACCGGTGCGGCGCAGATGCTGCTGCCGCCGTCCAAGCGCCGCAAGCTGATCACCGAAAAGGAAATCGAGGCAACGGTCGCGACGATGGCGCGCATTCCGCCGAAGACCGTCTCCAAGGACGATGAAGCCGTGCTTGCCAATCTCGAGAAGGAACTGCGCTCGGTCGTCTACGGCCAGGATATCGCCATCGAAGCGCTTTCGACCTCGATCAAGCTGGCGCGCGCCGGTCTTCGTGAGCCGAACAAGCCGATCGGCGCCTATGTCTTCTCCGGTCCGACCGGCGTCGGCAAGACCGAGGTGGCAAAGCAACTGGCATCGTCGCTCGGCGTCGAACTCCTGCGCTTCGACATGTCGGAATATATGGAGCGGCACACGGTTTCGCGCCTGCTCGGCGCGCCTCCCGGCTATGTCGGCTTCGACCAGGGCGGCCTTCTCACCGATGGCGTCGATCAGCACCCGCATTGCGTGGTTCTGCTCGACGAAATCGAGAAGGCGCATCCCGACATCTACAATATCCTGCTGCAGGTCATGGACCACGGCACGCTGACCGACCACAACGGCAAGAAGATCGACTTCCGCAACGTCATCCTGATCATGACGACCAATGCGGGTGCATCCGAAATGGCCAAGGCGGCGATCGGCTTCGGCTCGTCCAAGCGCACCGGCGAGGACGAGGAGGCGCTCACCCGCCTGTTCACGCCGGAATTCCGCAACCGTCTCGACGCGATCATTCCTTTCGCGGCGTTGCCGACGGCCGTCATCCACAAGGTCGTGCAGAAGTTCATCATGCAGCTGGAGGCCCAGCTTTCCGAAAGGAACGTCACCTTCGACCTGCACGAGGATGCAATCGCCTGGCTGGCGGAAAAGGGTTACGACGAGAAGATGGGCGCCCGCCCGCTTGCTCGCGTCATTCAGGATACGATCAAGAAGCCGCTCGCCAACGAAATCCTCTTCGGCAAGCTGAAGAAGGGCGGCGTCGTCAACGTCACTGTCGGCCCGAAGGAAGACGGCAAGCCCGGCATTGTGCTCGAAGCCATTTCGGAAACGGCGCCGATCAAGCCGAAGCCCGAAGCCGAGGTCGTGCATCCCGAAGGCGATGATGGGGATGACGGCGAGCTGAAGACGAAGGCGGCCCGCAAGACCCGCGCCAAAGCAGTGCCGCAGGCCGAGCCCGAGGTTCGCGACGCCCCGAAGAAGGGAAGCGCGGTTCCGAAGGTTCCACGCAAGAAGTAAGATACCGTCACCGAATTGGAAAAGGCCGCGTCACTGCGGCCTTTTTCGTTTTCGGATCTGCATGCGCCCTTTTCAGAGCCTGGTGTAGTCGATCTCGAGAAAGTCGGCGACCTCGGGCACCCAGCGGTCGCGCACGAAGGCGACATGCAGGGGATGGAGGTTGTACGTGTCGTAGCCAGCCTGATCGTCGAATTCCATCGAAAAGCCGAAGGCAAAAGCATTCTTTGGACTCGTCTGCCGAAGCTGCTCGAAATTCCGCACGCTCGGGATCTTGGCAAGCACCAGCGCGTCGGTGAGGAACGACGTTTCTTCAGCCGAGCCGGCTTCGTGTTTCAGACGGAATGCGACAGTATGGCGGATCATGATCTTGTTCCAAATGATTGTCCGGATTGAATTCTCTGAAATCGACGGGTGTTGACCCTACGCCAGTGCGGCCCTGATCTTTTCGGCATTGGCGGCGAGCACGGCGCCGTCCTCCATCTTGCCGGAATGCGGCTTCAGCGCCGCGCCCTCATGCCGCGGGATGACGTGGAAATGCAGATGGAACACCGTTTGCCCGGCGGCCGGCTCGTTGAACTGGGCGATGAATACGCCATCGGCGTCGAAGACGTCCTTCACCGCATTGGCGATCTTCTGGACGACAGTAATTGCGTGGGTGAGGGTGGCTGGATCGGCATCGAGGAGATTGCGGGACGGCGCCTTGGGAAGGACGAGCACATGGCCCGGCGCTTGCGGCATCACATCCATGAAGGCGATGGTATGCTCGTCCTCGTAAACCCGGTGCGAGGGGATTTCGCCGCGGAGGATCTTGGCGAAGATGTTGTCGTCGTCGTAATCGGCCTGCCTAGTCATCGCTAAATCTCCTCGTTTTTCGCAGGGCGTGCGTCTTTGCAAACGTGCACAGGACGCCGTAGCACTTTGAATTACTGCATAATTTTATTCCTAAATCGATTCCGATTTAGAGAGTTATGCGTTGATCGGGTAACGCGGCCGGTGCGGCGGCGTCAATCCTCCTGCAGGCGCTCGCCCTTGCGGAAAGGACTGTGTTCGGTCAGCAACTCGCTCATCTGTTCGACATCGGCGCGCTCTCGCGCGAGATAGTCGCCGATCGCCCGGCGAAGGCCGGCATGCGCGACATAATGGGCGGAATGCGTTGTCACCGGCAGGTAACCGCGGGCGAGCTTGTGTTCGCCCTGCGCTCCGGCCTCGACCCGTTTCAGCCCTTTCGAAAGGGCGAAGTCGATCGCCTGGTGATAGCAGACCTCGAAATGCAGGAAGGGATGATCCTCGATGCAGCCCCAGTGACGGCCGTAGAGCGTATCGCCGCCGATGAAGTTGATCGCGCCGGCTATATAGCGCCCGTCGCGCTTGGCCATGACGAGCAGGATATCGTCTGCCATGCGTTCGCCGATCAGCGAATAGAACTTACGGGTGAGGTAGGGCCGGCCCCACTTGCGCCCGCCGGTATCCATGTAAAACTTGAAGAACTGGTCCCAGATGCGTTCCGTCAGGTCGCGGCCGGTCAGCCAGTCGATGCTGATACCGTTTTCGAGTGCGGCGCGGCGCTCCTTGCGCAATGCCTTGCGTTTGCGCGAGGCGAGCGTTTCGAGAAATTCCTCGTGATTGGCATAGCCGTCATTGATGAAATGGAACTGCTGGTCGGTGCGGTGCAGATAGCCATCCATCTCGAAGACGCCGATCTCTTCGTCCGGCACGAAGGTGATATGGGCCGAAGAGATGCCGAGCCGGCGCACCACCTCCTTCAGGCCTTCGGCGATCGCGCTCTGGATCGGCAGCCGCTGCAGCCCCTCGGCGACGAGAAGACGCGGGCCTGTCGCCGGCGTGAACGGGATCGAGCACTGAAGTTTGGGGTAATAACGCCCGCCGGCCCGCTCGAAGGCGTCGGCCCAGCCATGGTCGAAGACATATTCGCCCTGGCTGTGGTTCTTGAGGTAGCCGGGCAGGGCGCCGATCAGTTCGCCGCGCCCGGTCTCGAGCAGCAGATGATGACCGAGCCAGCCGCTCTCGGCGTCGGCCGAGCCCGATTCTTCCAGCGACGATAAAAAGGCGTGCGAAACGAAAGGGTTGTAGGCAATCGTAGCGCAGGTCTTCGACGCCCCGGAAAGCCTGGACCAGCTCTCCGGGGAAATCGCGGTGAAGGAGCGTTCTACGCGAATGGATAATTCATCAGTCATGGGACAAATGCGAAACCTGTGGGAGGATAGTCTCGGCTCTGCCTTACAGCGCCGTGCGTCTTTCAGACGCGCAAAGGACGCTGTAACACTTCGAGTTCCTGCATAATTTTATCCTGAAATCGATCCGATTTAAGGAATTATGCAGCAGTCTGCGCATGATCTTGGCCTAAAAGCGAGCGTCAAGCAGCCACACGCGGATCGAAGCCTTCGAAGGTCATCTGGTCTGCGTTTGCGAATGTATGCCGGCGCGCCTCTTCGTCACGCACCGTCCAGGTAATAACGGGAATGCCCTTTTCGCGTTCGCCGGTGATAAAGGCGTTCGGCAGGTCGTCATAATGATAGGAGATGAAATCGAGACCGATCTCCATCGCCTCCGCATGTGTCTTGAACGCCTCCGGCGTGTTGCCGTTAGCCGTCAGCCCCAGCGGGTAGGGTGAGCCAAGCGCCTTCAGATCGCGCAGTAGCCAGTGATCGAAGCTCATCAGCGCTGCCTTGCCCTGATAACCTTCGAGGACCTCGAGCACAGCTTCGGCGAAACCTTCGTCATCGGCCTCGCGCCCCTTGAGCTCCAGCACCAGCGGCACCTTGCCCTCGACGAGGTCGAGGAGTTGGCGCAACGTCGGCACCTTGTCGCTGGTGCCGCCGACGGCGATCAGTCCGAGTTCCCGGGAGGTGCGCTCGCGGATGTCGCCGTGGAGGTTGCACAGGCGCTGCAGATCCTCGTCGTGAAAGACGACCGGTACGCCGTCGGAGGCGTAGTGCAGGTCGCATTCGATCGCAAAGCCCGCTTCGACGGCGCGCGAGAAAGCCGAAAGCGTGTTTTCCCAGACCTGCGTGTTGAGGTCGTGATAGCCGCGATGGGCGACCGGCAGGTCCCTGATCCAGCCAACATTGGTCATTCTGCGATTTCCATGATAGCGTCTATCTCGACGGCGGCATTCAGCGGCAAGGCCGCCATGCCGACGGCGGCACGCGCATGTTTGCCGGCTTCGCCAAGCACGCCGGCAATCAGGTTCGAAGCGCCGTTGATGACGAGATGCTGCTCGATGAAGTCGGGCGCCGAGGCGACGAAGCCGTTCAGCTTGATGACACGCCGGATGCGGCCGAGATCGCCACCAAGTGCCGCCTTCGCCTGGGCAAGGATGTTGATGGCGCAGAGTTCGGCGCCGCGCTGGCCCGTTGCAACGTCGATGGTCTTACCGAGATGGCCCGAGACGGCGACCTTGCCGCCTTCGAGCGGCAGCTGGCCGGAGATGTAGAGAAGATTGCCGCTGATGACATAGGGAACGTAATTTGCAGCAGGTGCTGCGGCTTCGGGCAGGGTTATCCCCATCTCAATCAGGCGCGCTGCAATTTCATCGGACATTTTCGTCTCCGCTTTTGTTGTCAATTCTTCAAAAATTATGCATCAAGACTAGAATCTTTAATATGACAGTTTCGAGCCTCGATTTGGCCGAAAGCGTTCTTATAACATCGCGACCGAGTCCAACAGGAGTTAATGAATGTTCCGATCGAGTCTTGTCGCTCTGCTTCTCGCCAGCGTTTCCGCCAATGCATGGGCGGCTGCGCCCGCGGTGAGCGCTGCGATCGCGACCGGCCTCGTCGCCCATCGCGCGGTCTACGATCTGGAACTGAAGGACGCTTCGGACCGCTCCGGCATCGCCGGCATGTACGGCCGCATGGTCTATGAGTTCGACGGCAGCTATTGCCAGGGCTTCACCACCAACTTCCGCTTCGTGACGCAGATCGACACCGGCGACAGCGTGCGCGTCAGCGACCAGCAGACCAAGACCTTCGAGAACCTGAAGGACGGCAAGTTCACCTTCGACACCAAATCCTTCACCGACGAGCAGCTCGATAAAGAGGTCAACGGTGCGGCGCAGGACCAGCCTGATGGCGTCAAGGTCGATCTCAAGCAGCCGGCGAGCCGCGAGCTGCAGCTTTTAGAAAGCCGTTTTCCCACGGAACATATGCTCGATGTGATCCAGCACGCCAAGGACGGCAAGCGCTTCTTCGAGGCGCGCGTCTTCGACGGTTCCGACGACGGCGACAAGTCGCTGGCGACGACGACGATCGTCGGCAAGCAGGAGACGCCGATCGCCGAGGAGGCCGATGCCGGCAATGCCGGCGCCTTCTCCAAGACTGCCTTCTGGCCGGTGACGATCGCCTATTTCAACGAAAGTGCGAAATCGGATGCTTTGCCGGTCTACCGCATGTCCTTCAAGCTCTATGAGAACGGCATTACCCGCGACCTGACGATGGATTACGGCGATTTCGTCCTGACCGGCAAGCTCGCCAAGCTGGAGCTGCTCGACCGCAAGGCCGAGGTTTGCAAGTAGGCGAAGGGCTTTCGCCTGTCATAAAACTGTATCATAAAATTCACTATGGTTCGGCGATCAGCGATGACTGAGTCGCCGTCTGTCGTCCTGCTTTTCGGGTCGGTGCGATCCTGGCGGCCAAGCCATTGCGATGACGTCTATCTGCTAGACTGCGTAGCCATTACGCATCGCAAGCTTGAAAGCCCGAACCATGGCCGACACCGATCTCGCGACTGTTCAGAATGCCGCCCTGCCGGTTGTTGTCGCCGATCCGGCGGAAATTGCCCGCATTTCCGATAGCATCAATATCACCGACCGTGCCGGCATCTCGGTCTATGGCGACCGCGTCCAGCAGGCGGTCAGCGATTATGCCGACAGGATCCTGCGCGAAGTCCGCAACAAGGACCTCGGCGATGTCGGCCGCCTGCTGACCGATATCATCCTCAAATCGAAAAGCCTCGATCCGGCATCGCTGAAGGATAAGGGTTTCCTCAGCCGCATGTTTCTCTCGGCCAAGGCCCGGCTCGAACGCTTCAAGGCGGAGTTCGAGGACGTGGCCGGCCAGATCGACCGGATCGGTCTGGAGCTCGACCGTCACAAGGATACGCTGAGGCGCGACATTGCGTTGCTCGACGACCTGCATGAGGAAACCAGGCAATCGATCATGCGGCTCGAGGCTTATGTTCAGGCCGGCAAGACTTTCGCCGAGCGTTTTCGCAACGTCGAGCTGCCGAGGCTGAAGGCGCAAGCCGAGGCGGCAGCGACCGGCCCCGGCGGCGGCATGCTGGAGGCGCAGACCTATCAGGACAGCCTGCAGGCGCTCGACCGGCTTGAGAAGCGGGTGTTCTACCTGCAGCAGGCCCGCCAGCTCGGCATCCAGCAATTGCCGCAGATCCGCATCGTTCAGGCGGGCGACGAGACGCTGATCGAGAATCTGCAGGCGACTTCGGCGCTGACGGTGCCGGCCTGGAAGCAGAAGATGGTGATCCTGCTCGGCCTGACGCGGCAGAAATCGGCGCTCGACCTGCAGAAGGCGGTGACCGACGCCACCAACGACATGATCCGCCAGGCATCCGAGATGATGAAGGACCAGGCGATATCAATCGAACAGCAGTCGCAGCGCGGCATCGTCGATATCGACACGCTCGCCAAGGCCAACAGGGATCTGATCGATACGATATCAGGCGTGCTGCAGGTTCAGGAGGAGGGGCGCCGCAAGCGGGCGCTCGCCGAACAGCAGATGGAGCAGATGACGATCGAACTCAAGAAGGCGATGACCCAGGCGTGATCGGCAAGACGACGATGCGAGCACTGCTTTCAGGCCTGGCACTTCTTGCCCTTCTGCCCCTTGCCGGTTGCAATCCCTTCGGCCAGGGGCCGGACTTCTCGATCGTATCGGGATCGGAGAACACCGTTCTGCAGCCGATCGTCGAAGAATTCTGCAAGCAGAAGAACGCCACCTGCACCTTCAAATATGAAGGCACGCTCGATATCGGCCTGGCGCTGCAGAGTGACCAGGGCGTCGCGCAGGATGCGGTCTGGCCGGCCTCCAGCGTCTGGGTCGACATGTTCGACACCAAGCGCCGCGTCAAGAGCCTGACCTCGATCGCCCAGACGCCGGTGGTCTTGGGCGTGCGCAGGTCGAAGGCGCAGCAGCTCGGCTGGATCGGCAAGGACGTCTTCATGAAGGACATTCTCGCTGCCGTCGAGAGCGGATCACTGAAGTTCCTGATGACCTCGGCGACGCAATCCAATTCGGGCGCCAGCGCCTATCTTGCCATGCTGTCGAGCGCGCTCGGCAACAAACCGGTGATCGAACCCGGCGATCTCGACGACAGACACGTCCAGGAGAGCGTCCGATCGCTGCTGTCAGGTGTCGTGCGCTCTTCCGGCTCTTCCGGCTGGCTTGCCGATCTCTATGTCGAATCCGCCGGCAAGGGCACGGTCTATGACGCGATGTGGAATTACGAGGCGGTGCTGAAGGAAACCAACGACAAGCTCGCCGCCCTGTCGCAGGAACCGCTTTACGCGATCTATCCGGCCGATGGTGTGGCTATGGCGGATTCGCCGATCGGTTTCGTCGATCATGGCCGCGGGCCTGAAGTCCAGACCTTTTTCAACGATCTGCTCGCCTATCTCAGCTCGGCCCCCGTGCAGCAGCGCATCGCCGATACCGGCCGGCGCATTCCGCTGACCGGCGTAGCCGCAAAACCGGAGTCGAGCTGGAATTTCGATCCCGCCCGGCTGGTGACGGCAATCCGCATGCCGGAGGCGGGCGTCATCCGCCAGGCGCTCAACCTTTATCAGGCCGCGCTGCGCAAACCGTCCCTGACCGCGCTCTGCCTCGATTTTTCCGGTTCGATGCAAGGGGAGGGCGAGGACCAGCTGCAGAAGGCGATGCGTTTCCTGTTGACGCCCGACGAGGCGAGCAAGGTGCTGGTGCAATGGTCGCCCGCCGATCAGATCATCGTCATTCCCTTCGACGGCAGCGTGCGCAACACTTTCATGGCGAGTGGAAATCCGCTGGAGCAGGAAGGGCTGCTGAACGAGATTTCCCGGCAGAAGGCCAATGGCGGCACGAACATGTATGCCTGCGCCGAACGGGCTCTGCAGCAGATTGCCCGAACCGACAAGCTCTCGACTTATCTGCCTGCCATCGTCATCATGACCGACGGCAAGTCCGATGATCAAAGCCAGGCCTTCACCAGCGAATGGAACGCGATGGAGCCGCATGTGCCGATCTTCGGCATCACATTCGGCGATGCCGACAAGACCCAGCTCGACAGCCTCGCCAAGCTGACCTCGGCGCGCGTGTTCGACGGCGGTTCGGATCTCGCCACCGCCTTCCGAACTGCGCGAGGCTACAATTAGGACAGGCATGCGCAACTGGCTCGGCAATGACGGCAACTGGATCGTGGCGGGACTGGCGGCGGCAATCACCGTGCCGCTCTTGAGCTTTGCCGCCGGCATGCCCTTCTGGATCGCCATCATCATTGCCCTTCTGGTCTTTGCCGGCCTCGTCATTCTGCTCGCACCGCGCCGGCTATTCGAAGGCCTCGATATCAAGAGCATCGGCAGCGGGCGCGTCGCCTTTGCCCGCGACCTGCTGGAGGCCGCCGTTCCCTTCGCGCAGAGGCTGGAGACTGCCGCCGACACGATAAACGATCGTCAGATGGCGGCCGCAGTCCGGCATCTCGCTGAAATTGCCGCCGATGTCTTTCGCAAGGTCGAGGCCAAGCCTGAGAGCGCCAATGCAGTACGGCGGTTCCTCTCCTACTATCTGCCGCGTGCCGCCGAGGTGGCGGAAGGTTTTGCCGTCATCGAGGCCAAGCGCGTTCCCGACCCCAAGCAGCTGGAGGAGGTGCGCGGCGTGCTGGTCAAGCTCGAAGAGGCCTTCGTCCATTACGCCGACAGCCTGGTCGATGAGGAACTCGGCACGCTCGACACCGATCTGCGCCTCATCCAGGCATCGCTCAAAGAGGATATTGGACGCTGATGGCCCTTTCTCGTCGCGCCTTTGGAGTGGGACTGCTCGGAGCAGGCGTCGTCGGCACCGGCGGTTATTTCGCCGTCCGGGATCGGCCGGAATTGCAGGGCCTGCTCGGCAGCCGCACGACGCTGTTCGGCTTCGTCGGCGGCGAGAAGGAGGCCTTCCTGGCCGACCCCGACGTCATCAGGGCGCTCGGCGGATACGGATTGACGGTGAACAGCCGTGTCGCCGGTTCCGTCGAGATGGTCCGCGAACAGGCGCTGCTGTCGCAGCATCCGCAATTCCTCTGGCCGTCCTCCTCGATCATGGTCGATATCGCCAGGCAGAACGGCATATCTATCCGCAACGACCGCGTCGTGCTGAACACGCCGATCGTCGTCTATTCCTGGCAGCCCGTCGTCGATGGACTGATGAAAGCGGGATTGGTGACGGTGACGAACGAAGGCCATCATCAGCTCGACCTCAAGGCTTTGCTTGATGCGATCCTTGCCGGATCCGACTGGTCGAAACTCGGCGTCAATTCGCTCTACGGCCGCGCCCGCATCGTCTCGACCGATCCCAACCGCTCCAATTCCGGCTTCATGTTCGCAGGCCTGGTGCTCAGCTTGTTCAGCGGCAATGTCGCGACTTCAGGCGATCTCGCCACCTTCGGCGGCAAGGTGCAGGCGATTTTCCGCAACATGGGCTTCAAGTCGCCGTCCTCAGGCAAACTTTTCGACCAGTATCTGGCAGGCGGTCTTGGCGGCGAGCCGATGATCGTTGGCTATGAAAACCAGCTGGTCGAATGGATCCTCGCCGATCCCGCACGCTGGGAGCGCATCAAGGCGAGCGCCGGCGCAAAGCCGGTGGTGCTCTATCCGCGCCCGACCGTCTATTCGGCGCATCCGCTGATCGTCGTCGACGAGAATGCCAACCGGCTGATCGAAGCGCTGGTGAGCCCGAAGCTCCAGGAGCTTGCCTGGACGAGACACGGCTTCCGCGGTCCGCTGGGGACCGCCACCGGCAACGCCGACAGCGCCATCGGCACCCTGCTGCCGGCCGAGGTCGATGCCATCCTGCCGATGCCCGACGCTGGCGTCATGCTGTCGCTGCTGCCGACGCTGGCAAGCTGAGCCGTCAGGTTTTTTCGGCATAAGCCTTGCTTTTGCGCCCAATCGAATGTATGGGCACCGGCATTCCACACGTAAGGCATGGGATCGTCCGGGAGAAATCCGGGCTGTTCCGCCCGGTGGCATCCTCGAAGAGGGTGCTGTTCGCCTTGCGGAGGTTCAACCGGAAAAGGAGTAACAAGGCATGGCATTGCCTGATTTTTCTATGCGCCAGCTTCTTGAGGCAGGCGTCCACTTCGGCCACCAGACGCATCGCTGGAACCCGAAGATGAAGCCGTACATCTTCGGCGATCGCAACAACATTCACATCATCGATCTGGCCCAAACCGTTCCGATGCTGTCGCGCGCCCTTCAGGTCGTCAGCGACACCGTTGCCCGCGGCGGCCGCGTTCTCTTCGTCGGCACCAAGCGCCAGGCGTCCGAGATCATTGCTGACAGCGCCAAGCGCTCGGCCCAGTACTACGTCAACTCGCGCTGGCTCGGCGGCATGATGACGAACTGGAAGACGATCTCCAACTCGATCCAGCGTCTGCGCAAGCTCGACGAGATCCTGAACGGCGAAGCCCAGGGCTTCACCAAGAAGGAACGCCTGAACCTCGAGCGCGAACGCGAAAAGCTGGACAAGGCTCTTGGCGGTATCCGCGATATGGGCGGCACGCCTGACCTGATGTTCATCATCGACACCAACAAGGAAAAGATCGCGATCGACGAAGCCAAGCGCCTCGGCATCCCGGTTGTCGCCATCATCGATTCGAACTGCGATCCGGACCTGATCGACTATCCGATCCCGGGCAATGACGACGCATCGCGCGCGATCGCTCTTTACTGCGAGCTGATCTCCCGCGCCGCCATCGACGGCATCGCACGCCAGCAGGGCTCTTCCGGCCGCGATCTCGGCGCATCCTCCGAAGTTCCGGTCGAGCCGGCTCTCGAGGAAGCAGCCGAAGGCTGATGAAGGCGGGCGGATCGAAAAGGTCCGCCAAATGCTTGCAGAGACTGGGAAAGGCCGCTCGCGACTCATCGAAGTTCGGCGGCCTTGCCTGTTTCAAGGGGAGGGCATCAGGCTCTCCGTCCGATAAGTTTCGTTATGACGCGTCTTCATCACGCTGTCATACATACAGGTACATTTCGTGCCTCAATCCGGTGCCGAGCCGCGCTTTGCGGCCCACCGTATGAACCGACAAGAGGAAGCTAATGAGCGAGATTACGGCTGCAATGGTGAAGGAACTGCGCGAAAAGACCGGCGCAGGCATGATGGACTGCAAGAAGGCTCTTGCTGAAACCGGTGGCGACATGGAAGCGGCGATCGACTGGCTGCGCGCCAAGGGCATCGCCAAGGCCGACAAGAAGTCCGGCCGCACCGCTGCCGAAGGCCTCATCGGCGTTTCGAGCCAGGGCACCAAGGCCGTCGTCGTCGAAGTCAATTCGGAAACCGACTTCGTCGCCCGTAACGATGCCTTCCAGGAACTCGTCCGCGGCATCGCCAAGGTCGCCGTATCCACGGACGGCACCGTCGATGCCGTTGCCGCTGCGACCTACCCGGCATCCGGCAAGTCGGTTTCCGACACGATCAAGGATGCGATCGCAACGATCGGCGAGAACATGAACCTGCGCCGTTCGGTCGCTCTCTCGGTCGAGGATGGCGTCGTCGCCACCTATATCCACAATGCTGTTTCCGACGGCCTCGGCAAGCTCGGCGTTCTCGTCGCGCTGAAGTCGACCGGCGACAAGGAAGCCCTGAACGCCATCGGCCGCCAGGTCGCCATGCACATCGCCGCCACCGCGCCGTTGGCGATCCGCCCGGAAGAAGTCGATGCCGCCGTCGCCGAGCGCGAGCGCAACGTCTTCATCGAGCAGTCGCGCGCTTCCGGCAAGCCGGACAATATCATCGAAAAGATGGTCGACGGCCGCATGCGCAAGTTCTTCGAGGAAGTCGCCCTTCTCTCGCAGGCTTTCGTCATCAATCCGGATCTGACGGTCGCCGCCGCCATCAAGGAAGCTGAAAAGGCCGTCGGCGCGCCGATCGAGGTTGCCGGCATGGCCCGTCTGCTGCTCGGCGAAGGCGTCGAAAAGGAAGAAACCGACTTCGCGGCCGAAGTCGCGGCTGCCGTCAAGGGTTGATCTTCCAGCCATAATTGGGAAAACACGAAGGGCATCGCGTGACAACGCGGTGCCCTTCGTGTATCGGGCATTCACGGCAATTTACGAGGAGCCAAGATGTCTTTAGAGCCTGTCTATAAACGTGTTCTACTCAAGGCTTCCGGCGAAGCGCTCATGGGTGGCCAGGGTTTCGGGATCGATGTGACGGTGGCGGACCGCATTGCATCCGACATCGCCGAGGCACGGCATATGGGCGTGGAAGTCGGCGTCGTCGTCGGTGGCGGCAATATCTTCCGCGGTGTCGCGGTGGCGTCCAAGGGCGGCGACCGGGTCACCGGCGACCATATGGGCATGCTCGGCACCATCATCAATGCGCTGGCGCTGGCGACCTCGCTGCGCAAGCTGAACATCGATACGGTGGTGCTTTCGGCCATCTCCATGCCCGAGATCTGCGAGAGCTTTTCGCAGCGCGCAACCCTTTATCATCTGTCGATGGGCCGCGTGGTGATCTTTGCCGGCGGCACCGGCAACCCCTTCTTCACCACCGATTCCGCCGCAGCACTTCGTGCAGCCGAAATGGGTGCGGAAGCGATCTTCAAGGGCACGCAGGTGGACGGCATCTATACCGCCGACCCGAAGAAATATCCCGATGCGACTCGCCTCGACCGGCTGACGCACCAGGAAGTGCTGGACAGGGGGCTTGCGGTGATGGACGTTGCCGCTGTGGCGCTCGCCAGGGAGAATTCCATTCCGATCATCGTCTTCTCGATCCACGAGAAGGGTGGTTTTGCTGAAATCTTGACGGGCGGTGGTCTCAAGACCATCGTCTCCGACAACTGATATAAGCTGCGCCGCGGATTCGGCGCAGCCCTCGCTAGAACATGATGATTTAGGCCCGTTTGGCCTGAAAGCTGAATCATGTTCTACATTAAATAGTAAGAGCATGATGTCGCCCGAAAACCGCTGACACTTTTCGGCATCATGCTCTGGACGGGAGCATCGACATGAGTGAAGGTATCGACATCAAGGAACTGAAGCGCCGCATGGACGGCGCGATTTCCGCATTCAAGAGCGACATCGCATCGCTGCGCACCGGCCGTGCTTCGGCCAACATCCTCGACCCGGTGACGATCGAGGCCTATGGTTCGCGCATGCCGCTGAACCAGGTCGCCAACATCACCGTGCCCGAGCCGCGCATGCTCTCGGTTTCCGTCTGGGACAAATCGATGGTCAGCGCCGTCGAGCGCGGCATTCGCGAATCCAATCTCGGCCTCAACCCGATCATCGACGGCCAGAACCTGCGCATTCCGCTGCCGGAGCTGAACGAGGAGCGCCGCAAGTCGCTCGTCAAGGTGGCGCACGACTATGCCGAAAAGAGCAAAGTGGCGATTCGCCATGTCCGCCGCGACGGCATGGACGGCCTTAAGAAAGCCGAAAAGGATGGCGTAATCGGCCAGGACGAGAGCCGGGTGCAGTCGGAACGTGTACAGAAGATGACGGACGAGACGATTTCCGAAATCGACCGCTTGCTTGGCGAGAAGGAAAAGGAAATCATGCAGGTCTAGTGGATCTGTGCCTTTGCCTGGAAAGACCGGACGGGAAATGTCGGAATCTGTATTTGTGACTGTGCCAGAGCATGTTGCCATCATCATGGATGGTAATGGCCGTTGGGCCAAGCAGCGTGGCCTGCCGCGCACGATGGGCCATCGCAAAGGCGTCGAGGCGGTGCGCGAGACAGTCCGCGCCGCCGGTGCGGCCGGCATAAAATATCTGACCCTCTTTGCCTTCTCCTCGGAGAACTGGCGCCGGCCCGAAGCCGAGGTCTCCGATCTGCTCGGCCTGCTCAAGGCTTTCATCCGGCGCGACCTCGCCGAACTCCATCGCCAGAACGTGCGGATCAAGGTGATCGGCGATCGCCACAGCCTGCGCAGCGACATTCTGGGCCTGTTGCTGGAGGCGGAGGAGACGACCAAGGACAACACGTCACTGACGCTGGTCATCGCCTTCAACTACGGTTCGCGCGACGAGATCGCCCGCGCCGTCGTCAGTCTCGCCCGCGACGTCGAGGCCGGTCGCCTGAGAGCACAGGATATTACACCGGCGCTGATCAACGCCCGTCTCGACACGGCCGGCATTCCCGATCCGGATCTTATTATCCGCACCAGCGGCGAAGAGCGGCTTTCCAATTTCCTTCTCTGGCAGGCAGCCTATTCGGAATTCATCTTCCTTCCGGAATACTGGCCGGATTTCAGCCCCGAGATCTTCCGCTCGGCGCTCGAGACATTCGCCTCTCGTGACCGGCGCTTCGGCGGCCTGTCGTCGCAGGCGGCCGCGGTCGGCACCTGATGCAGAGGGAATTGAAGCTCCGCATCGTTTCAGGAGTGATTCTTGCGGCCATCGTTCTTGCCGCCACCTGGTATGGCGGCCTTGCCTTCCGTATCCTGGCGGTCGTGATCGGCCTGCTGATCTACTATGAATGGTCGAAAATGACCGGCATCGCGCGGGATTGGGTCGCCAATGCCGTCGGCTGGATCGGCGAGGCGGTGATTGCCTTTCTGGTGCTTGTCGGCAATTTCGAATTCGCCGCCGGCATGCTGGCCGGCGTCACCGCCGTCGGCATCGCCCTGATCATTCTGCAGGGCACGAGCCGCTGGTTGCCAGTGGGCCTGTTTTATGCCGGCGCCACCGGCCTGGCGCTTGCCGCGATCAGAGGCGATGACCGGCTTGGCCTTTACGCCATGCTCTTCGTCTTTGCTGTCGTCTGGGCAACCGATATCCTTGCCTATTTCGTCGGCAGGGCGCTCGGCGGGCCAAAGCTTGCCCCTTCGATCTCGCCCGGAAAGACCTGGTCGGGTGCAATCGGCGGCGCCGTTTCGGCGGTCGTAGCCGGCGTCGTTCTCGTCCATTTTCTCCTTCCGGGCGCTGAAATCATCGCCGCTGGCGTGGCGTTCGTTCTCTCGGTTTGCAGCCAGTCGGGCGATCTTTTCGAATCGTTCATCAAGCGGAAATTCGGCGTCAAGGATTCGAGCCGTCTCATTCCGGGCCATGGCGGCGTCATGGACCGTGTCGACGGACTGATTTTCGCCTGTTTTTCGGCGTTCTTGCTTGCTGGGCTTTTTTCCCTGATAAAGGGGGCCGGAATGACGTCGCTTGGCGCGGCATTGTTCGGACTCTGACAACGCGGCGGTCTGACGGAAGGGACTCATGGAAACGATGAGCGGCATATACGCATTTCTGATGGGGAACATCGTTACCTTCATCCTCGTGCTGTCACTGCTCGTCTTCGTGCATGAGATGGGCCATTACCTTGTCGGGCGCTGGAGCGGCATTCGCATCCTTGCCTTTTCCGTTGGCTTCGGTCCGGAGATCTTCGGCTTCACCGACCGCCACGGAACACGCTGGAAGATTTCGGTGATCCCGCTTGGCGGCTACGTCCGGTTCTTCGGCGACGAGGATGCCTCGAGCAAGCCCGACACCGACAAGATCGCCGCCATGTCCGAGGAGGACAGGGCGCGCTCCTTTGCCGGCGCCAAGCTGTGGAAACGCGCTGCAACCGTCGCGGCTGGCCCGATCGCCAATTTTCTGCTGGCGATCGCCATCTTCACCATTCTTTTCTCGGTCTATGGCCGCACGATCGCCGATCCCGTCGTTGCCGAGGTCAAGCCCGACGGCGCTGCCGCTGCCGCCGGCATCCTTCCAGGCGATCTGCTGGTCGCCATCGATGGCGGCAAGGTCGAGACCTTCGACGACGTGCGCCGCTATGTCGGCATCCGCCCGAGCCAGAAGATCGTCGTGACGATCGAGCGCGCCGGCCAGAAACTTGATGTGCCGATGGTGCCGCAGCGCGTAGACACGACCGACCAGTTCGGCAACAAGGTCGAGCTCGGCCAGATCGGCATCGTCACCAGCCGGGAGGCCGGCAACTTCCGCCTGAAGACCTATACGCCGCTGCAGTCGCTGCGCGAGGCTGTGATCGAGACCCGCGATATTGTCACCGGCACCTTCAAATATATCGGCAACATCTTCAGCGGAACGATGCGCGCCGATCAATTGGGCGGGCCGATCCGCGTGGCGCAAGCTTCGGGCCAGATGGCGTCGCTTGGAATAGGCGCAGTGTTGCAGCTTGCGGCGGTGCTTTCTGTTTCGATAGGATTGCTTAACCTGATGCCGGTTCCGGTACTTGATGGCGGCCACCTGATGTTCTATGCGGTGGAAGCGGTGAGGGGGAAACCGCTCGGCTCTTCGGCCCAGGAAATTGCATTTCGCATCGGCCTGGCGATGATACTGACATTGATGGTTTTCACGACCTGGAACGACATTGGCTCGTGGATAGGGTAA
Protein sequences of DBSCAN-SWA_4 >NZ_CP054031|848529:939654|857274_857688_+|WP_138329761.1|DBSCAN-SWA MTKNGIRRIATFNRTALICAAAAMALAGCNLTAEQKAAAAAKRAAPTAVVMPATKGEAAEGGLAKYPDGYPNFGAPLTAANVQMSDEQAAELQHQLTALGARRKAGTISEAEYQAKVAEMRRLAAEHGTDTLSEISK >NZ_CP054031|848529:939654|889089_889347_-|WP_138329704.1|DBSCAN-SWA MALSPALKLMVSGVALDPLEAKRAEILARLADTRRELASLTNNDHERDYHEPSIRAQLAWLGLQKATKSELREIIVNAMERDPDL >NZ_CP054031|848529:939654|908437_908617_+|WP_003539245.1|DBSCAN-SWA MSKTNAEALLRKAQRQVANTSGGGHLGGTNFGQGNDAKTASGSGNRPAGKGRPPAGGKR >NZ_CP054031|848529:939654|921250_921601_+|WP_017988219.1|protease|DBSCAN-SWA MIAKPIRMQNDSERNGDNANRTSVITRTKPKTKKPNLYRVLLLNDDYTPMEFVIHILERFFQKDRESATRIMLHVHNHGVGECGIFTYEVAETKVSQVMDFARQHQHPLQCVMEKK >NZ_CP054031|848529:939654|857910_859128_+|WP_026160221.1|DBSCAN-SWA MEEFHKVRRLPPYVFEQVNRLKASARAGGADIIDLGMGNPDLPTPQSIVDKLCEVVQDPRTHRYSSSKGIPGLRRAQAAYYARRFGVKLNPDTQVVATLGSKEGFANMAQAITAPGDVILCPNPTYPIHAFGFLMAGGVIRSMSVEPDASFFEPLERAVRHSIPKPLALVLNYPSNPTAYVATLDFYKDVIAFAKKHDIIVLSDLAYSEIYFDGAPPPSVLEVPGAMDVTVEFTSMSKTFSMPGWRMGFAVGNERLIAALTRVKSYLDYGAFTPIQVAATHALNGDGSDIAEVRNVYKRRRDVMVESFGKAGFDVPPPAATMFAWAKIPEKFRHLGSLEFSKLLVEKADVAVAPGIGFGEMGDDYVRLALVENEHRIRQAARNIKKFMSTADETMHNVISLNAHR >NZ_CP054031|848529:939654|903790_905041_-|WP_173862936.1|integrase|DBSCAN-SWA MDDRHASGLVDDVLVELLRAGIAGNLNDDALEPLVGKRIERYRRLGNTTVVKGSSEWRTLARALCVSELEALARVAERDEGDFTGKPENPMLAAAVEQDEIASEVTDAEFNALTFEQVIQEKERLTAMGLGGRQKSASTLIKYRNTVHDFEHHRRSKKIATVTLEEGEAWRNAMLSEGQLSRKTIGDKLATIRAILGWGQDQCRGKLFPMTPKGTPFDFLEMPTHTKGDSADRTYSLKDARHLLTSARTATRASNRWIPWIIAHTGARVNEITVLEKADIFELEAHWFFHIRVGDGRDTKTHKGRKVPVHRALIKEGFIDWVQAQPDGKLFPGGKNEDQRIREWIKEKVFPHREDMPPPNHGFRHLFEDALFAGVSEKAALYITGRSSGSSADDYGGSDLRLLEIAAQMDKVRSII >NZ_CP054031|848529:939654|925108_926299_-|WP_138329639.1|DBSCAN-SWA MTDELSIRVERSFTAISPESWSRLSGASKTCATIAYNPFVSHAFLSSLEESGSADAESGWLGHHLLLETGRGELIGALPGYLKNHSQGEYVFDHGWADAFERAGGRYYPKLQCSIPFTPATGPRLLVAEGLQRLPIQSAIAEGLKEVVRRLGISSAHITFVPDEEIGVFEMDGYLHRTDQQFHFINDGYANHEEFLETLASRKRKALRKERRAALENGISIDWLTGRDLTERIWDQFFKFYMDTGGRKWGRPYLTRKFYSLIGERMADDILLVMAKRDGRYIAGAINFIGGDTLYGRHWGCIEDHPFLHFEVCYHQAIDFALSKGLKRVEAGAQGEHKLARGYLPVTTHSAHYVAHAGLRRAIGDYLARERADVEQMSELLTEHSPFRKGERLQED >NZ_CP054031|848529:939654|906168_906474_+|WP_003539138.1|DBSCAN-SWA MSAKIYRPAKTAMQSGKAKTHLWVLEYDQESARKIDPIMGYTSSGDMRQQVKLTFETQELAEAYAQRNGIEYRVIAPKDPVRQVVAYPDNFRYTRTQPWTH >NZ_CP054031|848529:939654|898186_898579_-|WP_138329684.1|DBSCAN-SWA MTTTAVGGHEILWRTVLLHSVQEAFGIGFVNEPAKNRVVMIRDARDYIMRPAKGFIDVCNLAGFDPDAVRERLIPRIKAARSPEELAVLKRTKGSPGVVADFPSVLGTGGGSTAHEIPEISFSDMKAPTA >NZ_CP054031|848529:939654|890739_891147_-|WP_138329696.1|DBSCAN-SWA MHLHLAAFVALALISSGAVAAADQKKLTLDDLAKDEPASSAEGGFTIAQLVRASHDDKVILAVIAATGDGILWANTYAGAKGVNLVCPPQNLAFTDTMMRDILERYLVEAPDEGKGQRFVLGNVILKALVHTFPC >NZ_CP054031|848529:939654|928828_929905_+|WP_138329634.1|DBSCAN-SWA MADTDLATVQNAALPVVVADPAEIARISDSINITDRAGISVYGDRVQQAVSDYADRILREVRNKDLGDVGRLLTDIILKSKSLDPASLKDKGFLSRMFLSAKARLERFKAEFEDVAGQIDRIGLELDRHKDTLRRDIALLDDLHEETRQSIMRLEAYVQAGKTFAERFRNVELPRLKAQAEAAATGPGGGMLEAQTYQDSLQALDRLEKRVFYLQQARQLGIQQLPQIRIVQAGDETLIENLQATSALTVPAWKQKMVILLGLTRQKSALDLQKAVTDATNDMIRQASEMMKDQAISIEQQSQRGIVDIDTLAKANRDLIDTISGVLQVQEEGRRKRALAEQQMEQMTIELKKAMTQA >NZ_CP054031|848529:939654|850511_850973_-|WP_003558968.1|DBSCAN-SWA MAEKKSGIDQALIRDLANILNETDLTEIEVEQDDLRIRVSRAGTPQYVQAPIAAPAFAAPAAAAAAAPAAAPSRNPANVVNAPMVGTVYMAPAPGARPFIEVGATVKEGQTLIIIEAMKTMNQIPSPKSGKVTEILVDDGHPVEYGQALVVIE >NZ_CP054031|848529:939654|924176_924467_-|WP_138329643.1|DBSCAN-SWA MIRHTVAFRLKHEAGSAEETSFLTDALVLAKIPSVRNFEQLRQTSPKNAFAFGFSMEFDDQAGYDTYNLHPLHVAFVRDRWVPEVADFLEIDYTRL >NZ_CP054031|848529:939654|896314_896704_-|WP_138329690.1|terminase|DBSCAN-SWA MTRGLKPSTIVPGSSPVTGIPKTPSYLNKEAKAEWRRVAPILALERKVLTDADLATLETYCVHYGALRMAEREIAANGLISDGKRNPAYGILKESSLLLVRCAGELGLTPSARSRASMNEYADDDEGFV >NZ_CP054031|848529:939654|927194_927665_-|WP_138329635.1|DBSCAN-SWA MSDEIAARLIEMGITLPEAAAPAANYVPYVISGNLLYISGQLPLEGGKVAVSGHLGKTIDVATGQRGAELCAINILAQAKAALGGDLGRIRRVIKLNGFVASAPDFIEQHLVINGASNLIAGVLGEAGKHARAAVGMAALPLNAAVEIDAIMEIAE >NZ_CP054031|848529:939654|861013_862816_+|WP_138329755.1|DBSCAN-SWA MADLVDPVQRAFLGVEKSALDNRWVPRLDQAGQNRALAMSQIHGLPDLIARVLAGRGVTVDEAIEFLDPTIRSLMPDPHRLTDCEKAATRLLRAIERGESVAIFGDYDVDGAASSALMFRFLSHFGVRATIYIPDRVFEGYGPNPAAINQLIDNGAQLIVTVDCGSTSHEALAAAAARNIDVVVIDHHQVTHELPPCHALVNPNREDDLSGQGHLCAAGVVFMVLVATLRLLRAAGNKRILAIDLLQWLDIVALATVCDVVPLKGLNRAYVVKGLVAARHQSNAGLAALFRKAGLAGPVTPYHFGFLIGPRINAGGRIGDAALGSRLLTLDDAGEADVIAQRLDELNRERQAMEAIMLQEAEAEALAEYGDGEGASVIVTAHEKWHPGIVGLIAARLKEKFKRPAFAIAFDPSGRGTGSGRSINGFDMGRMVRAAVDEGLLVKGGGHAMAAGLTVERADLGRLRTFFTEKAERTVASLVANETLKIDGAIGASGATLELIDRLEAAGPYGSGHAQPLFAVPAHRVRDARLVGEKHVKVTLEAMDGSRLDGIAFRAADTPLGNLLINSRGASLHVAGSLGADHYQGARRIQLRVCDAAPAK >NZ_CP054031|848529:939654|872472_872676_+|WP_138329739.1|DBSCAN-SWA MVPSAEVADWSGFKVWQCVLCAYVYDEALGDPDGGVAPGTRWEDVPDDWVCPECGARKSEFDMVVVG >NZ_CP054031|848529:939654|877456_878608_+|WP_138329730.1|DBSCAN-SWA MKIGRRRFMTAAACTAASGLLPARGVSAAGAELIHTVAAESPWFCNQVALTAKGDMFFGLPRYPDYDQTPCLAKRGADGKPAAFPGNAWNEWKPGDDGFDSFVYVNSIHVFKEGTVWAVDQGALRADSYPPALSEPHKGAQKLVQLDADSGEVLRVLRFGDDILPKGAKLNDLRVFGDHVYVTDSGLGALIYHDLKTGVTLRRMSGSPEMQAKVEPNMQQGSHQTPKIDMIEVSDDGEWLYAAAPTGPFIRIKTAALRDASLSDDDLAEQVEEYATIARSGGCALDTNGNLYLSELDNKRVTILSPTGETAVLTSDDEFISPDGSFISVDRKLYIPVTQSRRTRLFGNKQDMVKRPWKIYVVDLPETLGGIKLGASLNGPSLP >NZ_CP054031|848529:939654|938520_939654_+|WP_064654224.1|protease|DBSCAN-SWA METMSGIYAFLMGNIVTFILVLSLLVFVHEMGHYLVGRWSGIRILAFSVGFGPEIFGFTDRHGTRWKISVIPLGGYVRFFGDEDASSKPDTDKIAAMSEEDRARSFAGAKLWKRAATVAAGPIANFLLAIAIFTILFSVYGRTIADPVVAEVKPDGAAAAAGILPGDLLVAIDGGKVETFDDVRRYVGIRPSQKIVVTIERAGQKLDVPMVPQRVDTTDQFGNKVELGQIGIVTSREAGNFRLKTYTPLQSLREAVIETRDIVTGTFKYIGNIFSGTMRADQLGGPIRVAQASGQMASLGIGAVLQLAAVLSVSIGLLNLMPVPVLDGGHLMFYAVEAVRGKPLGSSAQEIAFRIGLAMILTLMVFTTWNDIGSWIG >NZ_CP054031|848529:939654|873966_874707_+|WP_138329736.1|DBSCAN-SWA MQTASYTEKEPTRFRRGQSWKINMFGKNSDIAAPDPQAFRLDLNAHQKLESHFHIVDQFQVFIAGSGTIGRDEVRLVTVHYADHHTGYGPLIASEQGLSYLTLRSKTDAGLVYLTTPNVREKLKPTKRRHRTSGAVALSIEPVLRNRTELTVDTIIEEQPGDDGMNCKVFRLGPAMAVQAPDPTGSGGQYLIVLNGSLIHEGQIYQPFSLMFVRFDDPAPTITAGEDGLELMITQFPTEDEWMKSI >NZ_CP054031|848529:939654|867666_868617_+|WP_138329746.1|DBSCAN-SWA MSIVTADTAPAGASSNSVWSKRWQRFRDNKLAMISLVFVVLLMLSCAMAGPIAYLTGIDANATNLLRRFKPPSAQHWLGTDDLGRDVLLRLLYGGQVSIAVGLLATFITGIIGIGIGVTAGYIGGRFDNVLMRMTDCIIALPLIPVLIVLGAVDLTKLGLSQAAATSPAAVFLRIVIIIALVDWTTIARIARAGTITLRDVDYIRAASVSGAKGRYNIFVHILPNIATPLIVAMTLSVGRIILFESTLSFLGFGIVPPTPTWGNMLTNAQQLIISAPMLAVYPGLLIFTTVMAVNFVGDGVRSAFDPRSETSKGSH >NZ_CP054031|848529:939654|871442_872354_-|WP_138329741.1|DBSCAN-SWA MDIRQLAYFVRVAELGSFSRASAFLHIAQPALSRQVRNLETELKERLLLRNGRGVELTEAGERLLDNARGILRQIERTYEDIENSRTGKSGKVGIGLPATISTAIATALIRKLREELSDAQVMLVHGRSNQLQEWILSGRLDMAIMYDAPSSPMLEIHDLVEENLYLIGREGAFKDDKPVHLESIADVPMVIACRPNSTRVLLDSELARLGQKLNIVFELDPLDTMFDLVRDGYGFTVASIRTLASKGPDARHGLELRKIVGPELILSVQLVQPARRLSNRLHEAAFRILRDLSMDLLGGKPT >NZ_CP054031|848529:939654|906815_907655_+|WP_138329667.1|DBSCAN-SWA MDQRAKFERLKALHQGDSAFVIPNPWDAGSARLLASLGFEALATTSAGYAFSKGKLDSFASLGRDEVLENAAEIVGAADLPVSADLEDGFGASPETCAETIRLACETGLVGGSIEDATGNPAAPIYELSQAVERIHAAAEAARGLPFLLTARAENYLWERPDLDDTIERLQAFSAAGADVLYAPGLPDIEAIRTVCAAVDKPVNVVMGLKGRKYSVAELSSVGVRRVSVGGSFARAALGALLRAAMEVKTAGTFGYADDALSAAAVSHLMSREKRQDRL >NZ_CP054031|848529:939654|884370_885336_+|WP_138329717.1|DBSCAN-SWA MDDIDFVLLGLFAADDVNKRTELVLNQKKLIHYCSADTALKIIKNREIWLRNVRVMNDYMEVNHGFNLIKKSLQPPVDTAIETGMNEVRKALDVIHPGIADEAFSRFQTWSPFIQYETYVTCLSEHLDDENEDGRLSMWRNYSSGQAGVGIVINTSMFARTDDALGVYSSPVTYLSDLALEHSLMQVAQRIRERAAFLSTVPRETLIGQYFLLLRTISHGSKHPGFKEEREWRIFHTFGMDELKILKVTSESLGGVPQRILKLPLDGTIEGISIADLVDKILVGPSQYQYVIGMALADELDRAGKKDAYQSIKYSPIPLRT >NZ_CP054031|848529:939654|852562_853720_+|WP_138329764.1|DBSCAN-SWA MFSISKRSEVEPFHAMDVLAEATKRRAAGHPVISMAVGQPSYPAPLAALEAARAALAEGRIGYTDALGTARLKSALAWHYKDRHGLEIDPKRIAITTGSSAGFNLAFLSLFDAGDAVAIARPGYPAYRNILGALGLKVLEVPVTAETHFTLTPQSLEAAQKESGMSLKGVLLASPANPTGTVTGREGLEALSDYCAAQSIAFISDEIYHGLTFAGEEASALELTDEAIVINSFSKYYCMTGWRIGWMVLPERLVRPIERVAQSLYISPPELSQIAATAALSAGAELDRYKASYAANRDLLMQRLPQIGLSIASPMDGAFYAYLDVTRFTNDSMGFAKRMLAEIDVAATPGLDFDPLEGHRTLRLSYAGSEAEIAEAVERIAAWLK >NZ_CP054031|848529:939654|915434_916625_+|WP_138329654.1|DBSCAN-SWA MNDKDSLLQNAGINTRLTHIGNDPFDYHGFINPPVVHASTVLFPNARAMETRTQKYTYGTRGTPTTDALCEAIDALEGSAGTILVPSGLAAVTIPFLGFVAAGDHALVVDSVYGPTRHFCDTMLKRLGVEVEYYHPEIGAGIEALFRPNTKLVHTEAPGSNTFEMQDIPAISAVAHRHGAVVMMDNTWATPVYFRPLDYGVDISIHASTKYPSGHSDILLGTVSANAEHWERLKEANGVLGICGAPDDAYQILRGLRTMGLRLERHYESALDIAKWLEGREDVARVLHPALPSFPSHHLWKRDFKGASGIFSFVLAADGPEKSRAKAHAFLDALRIFGLGYSWGGFESLALHAYLNDRKVAKAPTDGPVIRLQIGIEDVADLKVDIERGFAAASAV >NZ_CP054031|848529:939654|850993_851431_-|WP_138329766.1|DBSCAN-SWA MTQTIFVLNGPNLNMLGKREPGIYGGKTLRDIEVDCKAAGRELGLDIDFRQSNHEGTLVDWFHEADEKAVGVAINAGAYTHTSVALHDAIRAISIPVVELHISNVHAREEFRHKSMIAPACEGVICGFGPHSYILALHALKNITA >NZ_CP054031|848529:939654|889681_890404_-|WP_173862935.1|DBSCAN-SWA MQPSTASKPSPGTSNRQRPTISARGLSKCSAISASGRLIAWSKTPARRVWQLDFLFAFRYTQIVTKRNEAQMAKTPAERTREHRERLKARERENLLQPAAPPAYVQTPFHAFMEGRHVDFEENLDAYGVQISGTGLGEEVQKFETEAPWERSFTSLERARGMMGVFLDAAKELAALINEYKLQEIEAAIDAAHEFSADLPRGDVQALKKSFAEIERLKAIRSELRKPTRHTLLSVDAKGE >NZ_CP054031|848529:939654|900524_902117_-|WP_138329674.1|capsid|DBSCAN-SWA MTNAAVLETKAEVSIDDAGTVTGIAWPFGKPDSYGDLIEPTAFKFAPEVPMLMEHEQGGGAVGIWNSLAVTDKGLEVKGRLFLEGVGPAREARRHLIAGSMSGLSIGYQRHEHKARPEGGRVLTNITVTEISLCRRPVHPDARTTEVKSIIEGMSMENELKNEPEVKSDPVASPDEIKALKARMDAFEAKANRLRPANNNHPDGANDNSEKKALESFMRTGSIAEVKAIASDSNVDGGYFVLPTTDLTIRNLLTDLSPMRGLAEVIGIANDKYERFYSLGKRGAKWVSERSDRPQDTATPDLIKHSYGTAELYAAPTTTRTLLEDAAVDLSAWLINNAVQDFAETEGESFLTGDGADNSPKGLLTYPTASEKDFSREWGKFQYVPVGATAPTDKQLADALIKLVATLRRPYKGNAVFLMNGNTAVRLRQIVDATGRYLWAPTGNLIEGVDHPLLGYRVEIDDGMPDIGSGAHPIAFGDFRQGYVIVDRQGIRVEQDSTTRKGWIVFDSYKRVGGGAGDFNAVKFLKISAN >NZ_CP054031|848529:939654|872701_873913_+|WP_138329737.1|DBSCAN-SWA MSDILIIGASHSGVAAAAALRSAKYDGSIMLVTQENVLPYHRPPLSKEALSKDDYSPTPLRPETFYALNQIDLVQAVKIVALNLAESYALSEDGTRFPYGRLILACGAEPRRLPTSVDADGIAHALRTHDDLIGLKARLAGAKSVAIIGGGLIGMEIAAMALAKGLAVTVVEAGSRLMERTVSKSIANYVLDRHLNQGLHVRFGATVVTIVRDGGRDVDLVTLSDREQIEADIVVVAIGAAPHENLARDAGLEVNNGILVSEIGRSSHPAVYAVGDCSAWYDPVLGRHVRNEAVNPGQDQAKIVAAAIAGASPPPKRLPRYWSHQAAIQIQMSGDVNGADMEAVLNAPASGAFSVLGFKSDRLVAVQTINAPQQFGKLHEMIGIDRDAVAATLDVEFPPPHHH >NZ_CP054031|848529:939654|902451_903657_-|WP_138329671.1|portal|DBSCAN-SWA MLAGLKSAFGFGNEQKANALNEQLALTLLDIRSTASDIHVGPTNAMEVPAVNCAIGIISEKTGDTPFKLYKSDTRETARDHPAYKLIHDEADAFTSAAQFRINLSVDAMLHDNGHAHVIRSSDDRPIALQRLEPGTVQNRTEDDGTPYYVVSENRGQRRYEFTDILHIQPLGGKAPIKTAREAIALAIAFERHMAGLLANGGRPSGIILAKKRLDPDPKQKVADSWFSTHGGKNSGGTAILDEDMDYKQIATTLADAQFAENRLEQIREIGRAFRIPPTMLFELTRGTWSNTEEMARQFYTVTLKPWLTAWTWAYSRVLLTPEERDQFYIEPVLDDLLTTDFAKKATAFGQYRSMGVYTANEVRRMLNMPPHPDGDSLSNPHITTTANTAPEAPEAPKEAA >NZ_CP054031|848529:939654|885636_886032_+|WP_138329715.1|DBSCAN-SWA MLGNTLKIALAGIVLVPAMVSEGYASASSSYMPSEELIENCRAFRAYRAGQDVLYGQKASQLAFKAATCWAYIQGVEDDSALRERSCVPFPTEVDILVGLFNDYMEANPEARKYGAASNVNAALQKAYCPK >NZ_CP054031|848529:939654|918244_919030_+|WP_138329648.1|DBSCAN-SWA MNLNTPTFSSFTHDGLQLAFFDEGDPAGVPVLLIHGFASTANVNWVHPGWLKTLGDAGYRVIAMDNRGHGASDKPHDAEAYRPWVMAGDAIALLDHLGIPEANVMGYSMGARISVFAALANPHRVRSLVLGGLGIGMTDGVGDWDPIADALLAPSLEAVTHARGRMFRAFAEQTKSDRVALADCIRGSRDLVARSDMAKLDMPTLIGVGTKDDIAGSPQELAALMQNAEALDIPGRDHMLAVGDRVFKQAVLAFYARVAHR >NZ_CP054031|848529:939654|883711_884068_-|WP_138329719.1|DBSCAN-SWA MEIVLLKAFLITTTVMLIASDAFAEEQYTVGKAMESSDQKWLNDRITTIGNTLTWANDFAKDDGKPLFCPPPKLTIAGEQWRRLLDDMLNEKPSMRNEHISYLAPLLIFAAKRTFPCN >NZ_CP054031|848529:939654|866716_867670_+|WP_138329748.1|DBSCAN-SWA MLRYTISRLLQAVLVTLVMSLLLFVLIGLMPGDPVETMLEGNPSLTPETMAQMRALYGMDQPLFARYGHWLYQALHGDLGYSSVYFKPVLDVLWPAVIQTAKLLVATEIISIPLAMLLGALAARKPGGWTDNVISLIAFASISLPGFWTSLILIIVFSVKLGWLPASGTPLMSDAPVGEQIVHMILPVVVLTFFHVGPLVRYVRASMIETLNSDFVRTARAKGLSETAVVMRHALRNALIPMVTVLALGFGSLFSGALVVETIFGMLGMGKVIYDAISNIDFNLALVGLLLATIVTLFSSLLADLAYAWLDPRITLK >NZ_CP054031|848529:939654|886966_887896_-|WP_138329711.1|DBSCAN-SWA MVAKKNEEVSASFHYLLRLKKADKVVVVPFTIGDFAALFARMRAQKSFDIRNEDDIDRLRFRQEAPLENLEFVNSRTITGTFRASYWGHSFENTHKGQISSQSVNLRPFHFVLYLAESGRIYIGAQYLGQYGGYETLRKTIVDMLPQSKEVTSTSIRLGASYYKNAEPREIRVNIANKATSIAGRSSLGGKVMIAFTRASKDDPLVQQVKTQVIPFFGRKQSEIKQAVAGLMNQSDVVEIDDDDIIDCTVLADMNGRTATIHMFENGFRATRVTLDVEVDDEGHPANADTCNEILITLNEHVIKVTENG >NZ_CP054031|848529:939654|899531_900113_-|WP_138329678.1|DBSCAN-SWA MTFKRPAYEEVTIAHGGKTVKLRPSLRAATILEARHGFPALFRALGDFNVSVISEIILTACSDRQSAAAFLSSLSGRPLSPFFMAVHKPLGDLVEMLQIAPDPKAKPSTGKGTAWPDIYRMLYRTATGALGWTPDAAWNATPTEINEACIGRFGDGEKQPDAEQAAQNEALGFDPEFDRAGLRALKAKIAGGA >NZ_CP054031|848529:939654|910114_910720_+|WP_138329659.1|DBSCAN-SWA MTHNHNRLSFAVLASTCALLAMPASDARAIDVGVSVNAGNAVSADVGASIGSGSGISADANASVGGSNGVNADATANAGGGRGIDADVNASVGGGRGVNADLNARTGGSDGIDANATASVGSGNGVDANLAIGRVDGAGSSGRPAGERSLSASQIRTLEAFQARPVNEQRKMFVRCADISASGGDSGLAGLCSLLQATASR >NZ_CP054031|848529:939654|924524_924956_-|WP_138329641.1|DBSCAN-SWA MTRQADYDDDNIFAKILRGEIPSHRVYEDEHTIAFMDVMPQAPGHVLVLPKAPSRNLLDADPATLTHAITVVQKIANAVKDVFDADGVFIAQFNEPAAGQTVFHLHFHVIPRHEGAALKPHSGKMEDGAVLAANAEKIRAALA >NZ_CP054031|848529:939654|853740_855081_-|WP_138329762.1|DBSCAN-SWA MACAVGTFDDVTPLGGSAFRWRRSRSFLYGVVGAGFLTSTWLIATFATMHSIAAPYSPSDRLAQLGAAPKVAPSPRHERLIHVGKADRLAAFDAKPKAPLTASLVQSHAEKTAAAATALAVLAKNNRLAMAAEQPVVAAISDSAKFRKDDVIAAAPVELASAETEEDDGDRIIVPSKEAVAAAELPTLLALADAGSRQIAPAPVEPTASPLKTASIPASEPAPVVVAAAEDAPPFDLVLAPDEGSVPLPMARPDGLIGKPTPASKGSERSAEPVLAYAQPNSPIEDDEDDAVPRYDKPVFSPKLRAGVAIYDIENSIVYLPNGERLEAHSGLGKMRDKPRFANKKMRGPTPPHTYVLTMRESLFHGVEALRLTPIEGSDAIYDRVGLLAHTYMLGKNGDSNGCVSFKDYRRFLAAYKRGDIKQLVVVPRLNNKPTSTLASLFSRRS >NZ_CP054031|848529:939654|917839_918085_-|WP_138329650.1|DBSCAN-SWA MAGHNIPHFQNDGGHRVIEVGVKEFMCTGASAPFDHPHIFIDMGDDNEKVCSYCSTLYRFNSALKPRQTNPAGCVFHVKAA >NZ_CP054031|848529:939654|864757_865771_+|WP_138329752.1|DBSCAN-SWA MLHVNGAVVGDGAASTAESSTPLLDVQDLNVSFPGPQGAVQIVRGLSYSIARGKTLGIVGESGCGKTMTSLALLGLVAGGGRVDGTIDFNGRDLAKFTQRQWQQVRGREIAMIFQEPMTAMNPVMPVGRQIAEVLVKHEKLRKVAAHGRAVDLLAQVGIPSPARRAEDYPHQLSGGMRQRAMIAMALACRPQLLIADEPTTALDVTIQAQILDLMLTLQDDAQMSIQFISHNLAVISEIADSIMVMYAGTAVEYAPADELFANPLHPYTQGLIQTLPDPDKRVERLYVIPGRVTSDITTGCRFARRCPFADDGCRQIEPDLLEVAPGHKVACHKVKP >NZ_CP054031|848529:939654|865767_866712_+|WP_138329750.1|DBSCAN-SWA MTELLNLEHINVDIPIGGGFLKPAVWLQAVNDVSLSVLKGETLALVGESGSGKSTLGYVVAGLRKQTAGTVTFNSARPGGEGHTPVQVIFQDPFSALDPRMVVGEIVAEPLRLKGVPADKRQARAVDVLGQVGLPPEAARRYPHQFSGGQRQRIAIARALIAEPEMIVADEPLSALDVSIQSQVLNLLDDIKKQHGISYLFISHDLGVVRHLADRVAVLYLGRLMEVAASEDLFQTPSHPYTQALLAAVPRIGRGRRKKEQILRGEIPSPLAPPSGCVFHTRCPKAQDICKSQRPALLPAPGRPTQLSACHFKD >NZ_CP054031|848529:939654|888512_888920_-|WP_138329707.1|DBSCAN-SWA MGRARCGLSAALFRQRDLLRGQEPAPGCGLKTRAERRRARRKRWFVQAEVEEFIRREYPACPEFAVAYFATRICTVPKNWRSAPIAEAVEVTMQNELRHQLTDYDQLMLTGVRRKEARRRVQPRLDAMIQSWKKA >NZ_CP054031|848529:939654|900116_900512_-|WP_138329676.1|DBSCAN-SWA MKDTYHDNKAVQALAPAVVTAAVAGSAVDLAGFDSALFVINTGATAGSGDFGVKLQESDTTMDADFADVASADRLGTVPATLAANSAYCLGYIGSKRKRFVRVAVTKAGGTSIALGATAVLGHPAIAPVAA >NZ_CP054031|848529:939654|902113_902455_-|WP_138329673.1|DBSCAN-SWA MISHTAYFGDGEKTFALTDPMIEELQHKTGVGIGALFLRMTVSQFHGSDMEHVIRLALIGGGTNPAEAQRLVDAYAKNRPFDETFPLALDILDARWNGRPEIPEGEGLREVAE >NZ_CP054031|848529:939654|874742_875474_+|WP_138329734.1|DBSCAN-SWA MTYFDLEAIRREIAPSGMLVCALNHGNVVLVRRGPTDETPTGVSVDLARSLAEKLSLQIRFRHYDKAGDVSASVGTEEWDVCFLAIDPQRAERIAYSDPYVQIEGAFLIRRAAGPLGLKDVDRLRLKIGAVRGSAYELFLSRHGGAGELIRFDSFPEAVAALTGGDLDGLAGVRQAMSIVARDQPDFAVMQEPFMAIPQAVGISADRPAAAAFTRAFVQEQKASGFVRLSLVLSGHADVVVPP >NZ_CP054031|848529:939654|881551_882040_+|WP_138329724.1|DBSCAN-SWA MRGLILFDAIRSAILALFFMVALPALADEEMRFDKEPLLIQTAAGKVLHFTVEIASTPDQRARGLMFRKVMADDAGMIFDFDQPRRVTMWMENTILPLDMLFADDTGTIRHIKEKAVPYSRDIIDSMVAVKYVVELNAGIVAKLGIKPGDRIVSATTTKKAK >NZ_CP054031|848529:939654|932112_933219_+|WP_138329630.1|DBSCAN-SWA MALSRRAFGVGLLGAGVVGTGGYFAVRDRPELQGLLGSRTTLFGFVGGEKEAFLADPDVIRALGGYGLTVNSRVAGSVEMVREQALLSQHPQFLWPSSSIMVDIARQNGISIRNDRVVLNTPIVVYSWQPVVDGLMKAGLVTVTNEGHHQLDLKALLDAILAGSDWSKLGVNSLYGRARIVSTDPNRSNSGFMFAGLVLSLFSGNVATSGDLATFGGKVQAIFRNMGFKSPSSGKLFDQYLAGGLGGEPMIVGYENQLVEWILADPARWERIKASAGAKPVVLYPRPTVYSAHPLIVVDENANRLIEALVSPKLQELAWTRHGFRGPLGTATGNADSAIGTLLPAEVDAILPMPDAGVMLSLLPTLAS >NZ_CP054031|848529:939654|909194_909644_-|WP_138329661.1|DBSCAN-SWA MGIFDKIKHAIFGEAKAAEPVAAGAPKTEPAQAPASPSAAPSPAPAAPAPQSKPVPTATAPGTATVDIVPILDAAVKKSGQKLDWRRSIVDLMKAVGMDASLTERKELAAELGYTGDTNDSAKMNMWLHKALMKRLSENGGKVPADLLD >NZ_CP054031|848529:939654|870389_871373_+|WP_138329743.1|DBSCAN-SWA MKPNIAILDDYLGISQEVADWGGLKSRANVVVFDRPLALPDEAARELAGFDIICTLRERMPIPGELINRLPRLKYIVVTGKRYDTVDIATAASRGVLVSNTPVSGAGAGGVAELVWGLILSATRHIASEDRSMRRGGWQTQAGTTVGGKVLGILGLGSLGRRVAEIGKVFGMELQAWSQNMTAEQAEAAGARLVSKQELFATSDVVTIHLALSDRTRGIVGATDLQAMKKTAYIINTARGAIVDEPALIAALRSRSIAGAGLDVYEKEPLPVDHPFRSLSNVVITPHLGYFTRDMLGTYYGDAVRLIEAFLDGRPERVVNMGLDTGC >NZ_CP054031|848529:939654|934410_935337_+|WP_029873155.1|DBSCAN-SWA MSEITAAMVKELREKTGAGMMDCKKALAETGGDMEAAIDWLRAKGIAKADKKSGRTAAEGLIGVSSQGTKAVVVEVNSETDFVARNDAFQELVRGIAKVAVSTDGTVDAVAAATYPASGKSVSDTIKDAIATIGENMNLRRSVALSVEDGVVATYIHNAVSDGLGKLGVLVALKSTGDKEALNAIGRQVAMHIAATAPLAIRPEEVDAAVAERERNVFIEQSRASGKPDNIIEKMVDGRMRKFFEEVALLSQAFVINPDLTVAAAIKEAEKAVGAPIEVAGMARLLLGEGVEKEETDFAAEVAAAVKG >NZ_CP054031|848529:939654|886305_886974_-|WP_138329713.1|DBSCAN-SWA MASIRSIIANNNATFYNYRTSGRVRIWPRIFLFSISSPALAFLLTDKLTDFINSINTVASILLGFGFSVLFYIASGKEDQSSPNSSLEQTNRARRVNGLSRELFHNVSYFVMTASAGLAFAMVVIAPEAQGDWFTKHALPYLVTITPSATDYLWWASLCTRSVFFFLIIEAGYTFARIVSRVSFLFEEKLASGSANDSDDSFTKKAPSQRHKSQLSHIDKNK >NZ_CP054031|848529:939654|863296_864658_+|WP_138329753.1|DBSCAN-SWA MVSTRNWKVALAGECMVCRPFAMHDEPQFTEVWDVIKDADVTYGHLEMNFADYDELKWPARGQGIGSFMMADPEIAKDLRWAGFDIMSTAHNHSFDFGAEGLIATKKHMKAAGIVTAGTGADLELASEPGYVDKKNGRVALVSTSSGNQHFMWAGMPKGALRGRPGVNPQRLTFEFMVDEETARNLKDFGEKFNFMKKPKHGREGSFGVQIPGAQQWGDPESFFVGDRCEVISRCHKRDLERNLRSVHEARSMADLVIVAHHFSVSDGPRGDTPPGFVKEFAHAVIDGGADIYIGHGWHRTLGIEIYNSRPIFYGIGNFFAQSEFIQRVPYDSYDAWGHDVDRLSMLTPAAHPLHPGLDTPTDTWWSSAVIKLDMEGERLKRILLHPVEMGRDTSPDVKQTRRTGKGDHHNTEGRPMVAKGEDAVRIIDRYRRLSEPFGTKIEIRSGVGVIEL >NZ_CP054031|848529:939654|914050_915076_-|WP_003559026.1|DBSCAN-SWA MKNKLLSAAIGAAVLAVGASAASATTLSDVKAKGFVQCGVNTGLTGFAAPDASGNWAGFDVDFCKAVASAVFGDPTKVKYTPTNAKERFTALQSGEIDVLSRNTTWTINRDTALGFNFRPVTYYDGQGFMVRKGLNVKSALELSGAAICVQSGTTTELNLADYFKTNNLQYNPVVFENLPEVNAAYDAGRCDVYTTDQSGLYSLRLTLKNPDEHVILPEIISKEPLGPAVRQGDDQWFDIVSWTAYALINAEEFGITQANVDEMKNSPNPDIKRFLGSETDTKIGTDLGLTNDWAANVIKGVGNYGEIFERNIGQGSPLKIARGLNALWNKGGIQYAPPVR >NZ_CP054031|848529:939654|937661_938492_+|WP_138329626.1|DBSCAN-SWA MQRELKLRIVSGVILAAIVLAATWYGGLAFRILAVVIGLLIYYEWSKMTGIARDWVANAVGWIGEAVIAFLVLVGNFEFAAGMLAGVTAVGIALIILQGTSRWLPVGLFYAGATGLALAAIRGDDRLGLYAMLFVFAVVWATDILAYFVGRALGGPKLAPSISPGKTWSGAIGGAVSAVVAGVVLVHFLLPGAEIIAAGVAFVLSVCSQSGDLFESFIKRKFGVKDSSRLIPGHGGVMDRVDGLIFACFSAFLLAGLFSLIKGAGMTSLGAALFGL >NZ_CP054031|848529:939654|851592_852351_-|WP_025394372.1|DBSCAN-SWA MAFFPKTFTALALAASIALPLPAAALDDQQKKEFGEFIKQYLIENPEIMLEVQDALQKKQEAQRSVKANMAIEENATDIFESKNDVTLGNPKGDVTVVEFFDYNCSYCRHALPDMQAMLKKDKNVRFVLKEFPILGPDSVAAHKVADAFRKLAPEKYADFHVALLGTEGRASDETAVAVAASLGVSEDKIRAEMAKSPNDGIVQATYQLASSLGISGTPSYVIGSELVPGAVGLDDLEAKVKNMRSCGKTAC >NZ_CP054031|848529:939654|892085_892439_-|WP_138329694.1|tail|DBSCAN-SWA MAIPTFQPPVGPSPGTAHKPTVNLWEADFGDGYSQPSPKGINHIKRAVSLSWNALTYDQMQELTGFFESMGGNRPFYFQPFGETSARKWTCKDWTFTTEGGIWIVTAGLVQSFTTAV >NZ_CP054031|848529:939654|908722_909034_-|WP_138329663.1|DBSCAN-SWA MRLMAVVGVALSLACISAAESPSNYQSYSGIRVDTDPVLLARYDKALQHCLPEASSWRRGSSDPRSLHYNAALRNCLYRRSFVDRGVYAYPIPQVYFDHFLDR >NZ_CP054031|848529:939654|880334_880775_-|WP_138329726.1|DBSCAN-SWA MRYLHTMVRVKDLDASLAFYTTLFGLEEIRRHENEKGRFTLVFLAARDDLDRARSEKAPCLELTYNWDTEDYSGGRNFGHLAYEVDDIYATCQNLMDNGITINRPPRDGNMAFVRSPDGISIEILQKGNPLPAAEPWSSMGNTGAW >NZ_CP054031|848529:939654|896700_897012_-|WP_138330069.1|head|DBSCAN-SWA MRAGKLNRVIFLDRLTAPLNENRTPVPAWTNIATLRAEVLQHSIDEAEADNGERDTDSISFRTRFFTGLTTADRIRFMGRTYNVKGWTEIGIRGGLEIKAVAA >NZ_CP054031|848529:939654|936320_936881_+|WP_138329628.1|DBSCAN-SWA MSEGIDIKELKRRMDGAISAFKSDIASLRTGRASANILDPVTIEAYGSRMPLNQVANITVPEPRMLSVSVWDKSMVSAVERGIRESNLGLNPIIDGQNLRIPLPELNEERRKSLVKVAHDYAEKSKVAIRHVRRDGMDGLKKAEKDGVIGQDESRVQSERVQKMTDETISEIDRLLGEKEKEIMQV >NZ_CP054031|848529:939654|888101_888395_-|WP_138329709.1|DBSCAN-SWA MQFNYRATARFMHRQFVRFAWTSTQQKWTTAFILAGVALASIMMPGLGIAVFGTAFAGWWLAVLVMTVFFGLVGNRVGVGREHAKLLRQRQTDGRRD >NZ_CP054031|848529:939654|889346_889589_-|WP_138329702.1|DBSCAN-SWA MSNENTMTSAAFATHEARTKLLDLIADDIERIRATPESEMDHFQEAMELLEKVEMWTKEYHSDGYLILRDPADEITMESI >NZ_CP054031|848529:939654|907738_908029_-|WP_138329665.1|DBSCAN-SWA MFMMQGPNVMNRYTSHQFPALRILMSGQEKYSVVRTLGDAATMLISEWPGDDGEEYVVAVRTCLDAIRGTTPPDAAREALIRAADEAGIRYLSVVH >NZ_CP054031|848529:939654|919159_919993_+|WP_017960263.1|DBSCAN-SWA MVAKTDIRAFDTGHPVKVMDPIWDSLREEARLAAERDPVLAAFLYSTVINYHSLEECVIHRICERLDHPDMQANLLRQTFEEMLLDWPDWSSILRVDIQAIYDRDPACLRFMEAVLYFKGFHALQTHRLAHWLLNRGRRDFALYLQSRSSSVFQTDINPAARIGKGIFLDHATGLVVGETAVIGDNVSILHGVTLGGTGKEGADRHPKIGSGVMIGAGAKILGNIEIGYCSRVAAGSVVLKAVPPKKTVAGVPAKVVGEAGCSEPSRNMDQVIGADI >NZ_CP054031|848529:939654|920611_920974_+|WP_138329646.1|DBSCAN-SWA MFNIEDANKKSKEAVDTALKTYSDTTKGFQAIAAEATEYSKKSFQDAVTHFETLAGVKSFEAAFELQTNYVKAYFEGFVSETTKLSEMYADLAKSAYKPYEAPIAAAVVKTAKSATPAAA >NZ_CP054031|848529:939654|894704_896309_-|WP_138330071.1|terminase|DBSCAN-SWA MVIYPEWLFDGSPIEDTFGDGERAVQWLRRNKHPKNPAPGHPFQLDEWQERIIRAIFGPRNPDGTRKIKKVVIQLGRGSRKTALAAAIVLLCTFGPEKIPGGLIQSAAFARKQARELFEEVALIVSQDRRYNGTARVREYKSQISNLKTRTRYEAVSSEGLGQHGSTPSVVVADELHAWTTEKHRELWRVLSSALDKTNNSLMVVLTTAGRGQETLAYKEVSAAKRIQLGQIVDPHVLPIIFEASADVDWKDESLWHKLLPGLANGYPSLQALQERKIKAEYSVIEREILQQLYLGVWQNQSSSPFVDMATYDRCGIPIDFKALAGKPCFLGVDLSEVSDLTSVVAAWPTDDGGYIVKPWFFCPEDALAKKSRVEGVNYDEWVKAGLITPTPGKSVNYAFVEDQIRQICEDHDVRQIAFDPWRAQKTQQNLMDDDLPVVEYRQGFKSMSPACDEVERAIIDGKLYHAGNPILRWNFDNVAVVRDAAGNRKFDKSKSRDKIDGAVATLMAVHFAAIYQDNASHYNDPDSAGIFTF >NZ_CP054031|848529:939654|910778_911552_-|WP_012757299.1|DBSCAN-SWA MAEAPAKKLTVSATEVAVEIVNMNKWYGDFHVLRDINLKVMRGERIVIAGPSGSGKSTMIRCINRLEEHQKGKIVVDGTELTNDLKKIDEVRREVGMVFQHFNLFPHLTILENCTLAPIWVRKMPKKQAEEVAMHFLKRVKIPEQANKYPGQLSGGQQQRVAIARSLCMNPKIMLFDEPTSALDPEMIKEVLDTMVGLAEEGMTMLCVTHEMGFARQVANRVIFMDQGQIVEQNSPAEFFDNPQHERTKLFLSQILH >NZ_CP054031|848529:939654|868650_870339_+|WP_138329744.1|DBSCAN-SWA MTSAGKRLKSFVTVALATAALTSLSSASAMAQAKDQLVVGQLQFLTNFHPLVQVNNTKRLVINYSLRPITAFDENVVNHCILCETLPTIDNGLAKIVDLPDGKKGMKVTFKLREGLAWGDGVPVTSKDIEFTWKMARDPKIGFSNYNSWTRASALEIVDERTVVLTIPQIISSYNSWDQVIPEHLEGPVYAANPTLDTYVKQSLYNLQPTNPGLWNGSFLLSDYQIGTRIVFTPNPKWPGDKPHLQRIILSYRDNSSSLLQNLLAGSVDAVPVSPGGISFSQMLDLKNQQPDKFTYHLAYGTNVERIAVNFDNPILKDKAVRQAILYAIDRQAISDELFGGLQPVANGILSSENAYYNKDMTLYSYDAEKSKELLESAGWKPGSDGICVNDKGERLSLELVSTAGNQTREQIAQVIQSQLKDVCIEITNNFVPLQEFNGEMARKRKFKALMMSSIDFSPSVSPRIALGSDAIPGPKNNGVGNNFSAYANPEMDKAISELEAALDPETAKQKWAAVQKIFADDLPMLPLYFYPRAYVTVTGLTNFRQGTLDPLQIWSEEWQRQ >NZ_CP054031|848529:939654|931495_932113_+|WP_138329632.1|lysis|DBSCAN-SWA MRNWLGNDGNWIVAGLAAAITVPLLSFAAGMPFWIAIIIALLVFAGLVILLAPRRLFEGLDIKSIGSGRVAFARDLLEAAVPFAQRLETAADTINDRQMAAAVRHLAEIAADVFRKVEAKPESANAVRRFLSYYLPRAAEVAEGFAVIEAKRVPDPKQLEEVRGVLVKLEEAFVHYADSLVDEELGTLDTDLRLIQASLKEDIGR >NZ_CP054031|848529:939654|899184_899535_-|WP_138329680.1|DBSCAN-SWA MSRRPPHLCQCGTRIIPHGERCPCQIRATRERNARHDALRGSASSRGYDAEWRRASRAYLLEHPRCAMPACGNPATLVDHIVSIRRAPLRRMDRTNWQPLCAPCHNSIKQRLERNN >NZ_CP054031|848529:939654|882250_883054_-|WP_138329722.1|DBSCAN-SWA MQYGHNLVLIMPLLSDAEQQQINELSYTTETLATGKVGRVSIMGTPTSIAAIVCTLPEPGKAENGAPLWTPAESAWINRSTYHMLSIIRLTWDDQADFFKPQDHGIFSASSFSNDEKPTFFLRAEMIAPKQPVDAQLLATSYAVTSERNLHHIMDLLSESHSTRIPSHYRFLALYKIIELEHPRKTGDFHSSMDEEFGALKIGAKPLRKLLPQLRVKIAHALATGATSPGLDDIDRDAVSTLIPIMQKAIRNDIAERHGLRFEIIPG >NZ_CP054031|848529:939654|855288_857169_-|WP_138330073.1|DBSCAN-SWA MYNEWIKRVVLPPEEMVTDSKQENGGKAGFDATDLDPYLLKDPEAMAMNFARALENLGQAASAWLAPRERGEISESAADPMTDMVKTLSKVTEYWISDPRRTFEAQTQLMSSFFGIWMRSMQRMQGDPTPPEPDTRKDKRFSDEDWQKNPFFDFLRQVYFVTTDWVEKMVSETEGLDEHTKHKAGFYVKQITAALSPTNFIATNPQLYRETIATSGANLVRGMKMLAEDIAAGHGDLRLRQTDMTKFAVGRDMALTPGKVIAQNDICQIIQYEASTETVLKRPLLICPPWINKFYILDLNPQKSFIKWCVDQGQTVFVISWVNPDARHADKDWAAYAREGIDFALDTIEKATGEKDINTVGYCVGGTLLAATLALHAKEKNKRIKTATLFTTQVDFTHAGDLKVFVDEEQLESLEEHMQAAGYLDGTKMSMAFNMLRASELIWPYFVNNYLKGQDPLPFDLLFWNADSTRMAAANHAFYLRNCYLKNALTRNEMILDGKSVSLKDVKIPIYNLATREDHIAPAKSVFLGSRFFGGKVEFVVTGSGHIAGVVNPPDRKKYQFWTGGPAKGDYETWLEHATETPGSWWPHWQAWIETHEGRRVPARKPGGDALNAIEEAPGSYVMERA >NZ_CP054031|848529:939654|883123_883660_-|WP_138329721.1|DBSCAN-SWA MTEPEAVELADLVPMRTEIGNCIAAFSMVESMLSLLYGNLMHPAPPRLCYLTLDEARHIESKCRIIKAVGQLVLAADDLPRFKNIMKRVQHKAAVRHKIAHWHVGHWHKNQPVSGVKEMREMEPRLMPAYFSAKNLDFSPDETLVISDLKDFATGSAKLANDILEFASKIPSAVNGGA >NZ_CP054031|848529:939654|848529_849144_-|WP_129420733.1|tRNA|DBSCAN-SWA MAGSRRKSPGITPDILLRAYSIGLFPMAESADDPEIFWVEPELRGVLPFDHFHVSKSLAKTVRRKPFEIRFDHAFDQVIAACAEETSGRPSTWINRTIRSLYSTLFDMGHAHTVEAWENDELVGGLYGVSLGSAFFGESMFSRRTDASKICLVHLVDRLRERGFTLLDTQFTTEHLKTFGAIDVPKADYAAMLAAAMESPHLAF >NZ_CP054031|848529:939654|927801_928638_+|WP_020049770.1|DBSCAN-SWA MFRSSLVALLLASVSANAWAAAPAVSAAIATGLVAHRAVYDLELKDASDRSGIAGMYGRMVYEFDGSYCQGFTTNFRFVTQIDTGDSVRVSDQQTKTFENLKDGKFTFDTKSFTDEQLDKEVNGAAQDQPDGVKVDLKQPASRELQLLESRFPTEHMLDVIQHAKDGKRFFEARVFDGSDDGDKSLATTTIVGKQETPIAEEADAGNAGAFSKTAFWPVTIAYFNESAKSDALPVYRMSFKLYENGITRDLTMDYGDFVLTGKLAKLELLDRKAEVCK >NZ_CP054031|848529:939654|859189_860515_+|WP_138329758.1|DBSCAN-SWA MADALKIGIAGLGTVGASLVRIIQQKSNELAVTCGRPITITAVSARDRARDRGIDLSTVTWFDRPEELAEKGDIDVFVELMGGAEGAANVSVRAALQRGLHVVTANKALLAYHGVELATIAEEKGSLLNFEAAVAGGIPVIKALRESLTGNSVSRIYGIMNGTCNYILTKMEKEGLSFAECLKEAQRLGYAEADPAFDIEGNDTAHKLSILTTLAFGNQIAADDIYLEGITNISIEDIHAAAELGYRIKLLGVAQRTDTGIEQRVHPTMVPVDSVIAQVDGVTNAVAIESDVLGELLMVGPGAGGSATASSVLGDIADIAKSQPGAQRVPVLGHPAATLEPYRKAQMQSHEGGYFIRLTVLDRTGVFASVATRMAENNISLESIVQRSKQHLAPSHHQTIILVTHATTEDSVRKAVASIKSEGYLFGEPQVIRIERPKEEG >NZ_CP054031|848529:939654|920133_920337_+|WP_003539277.1|DBSCAN-SWA MKPEEIKKLDAYFKRMFNPQMIVKARPRKDDSAEVYLGEEFLGVVYIDDEDGDRSYNFSMAILDVDL >NZ_CP054031|848529:939654|888829_889036_-|WP_138329705.1|DBSCAN-SWA MSSRYEGMSAKDADDLMVGIIGLLVADAMDEARAMTQEEWDVRDAAFLPHYFASAIFYAVKNRLRDAA >NZ_CP054031|848529:939654|875526_877221_+|WP_138329732.1|DBSCAN-SWA MRMSGKFKAVVGKSRTGAVVLMASLLLSAANNANATDGTAPITLANVTGGRVEGVPSDTPGVTQFLGIPFAGNVSGENRWKPAPPVKPWDGILTADKWGDQMLQNPNSLGRAPISDNGLNLAVWTPAHSIGDRLPIYMLIHGGANRLGSSEMKDLYAAQLAAKGVVVVSVQYRLGAMGWLSLPEMDKDSGKGPKGNFGVLDLVDALHWIQKNAEAFGGDPKTVTIGGQSAGGENTVALLRTPLAKGLFKRAFIQSSFTGFLPGKVVDFAKKSEQNQEAVNKLLGKEVTLADLRAIDSKTWLENWQGGKETLYGAMTGAVATNQFYTIDDYVFTKESVNLVQPGDFDGLDIIIGQTADEYTGLRPNDLKMTEGEQHEAMLAAIRPHQAGNVDDAVFPHYETNDPVEAYRLSLRMLNDYMFEYVRVGAEFAKAHSSANVYLYYWDHWPPGKDQGFRRAWHAGDNWYFNGSLRPGNRDQLPWTDPDFAMRDMAITYLANFIKTGDPNGNSVPHWGQVTPQSGGQFIRFHEGEAAMRTSTLYPSRDAYLRKKILEGIGMTERDIAEKP >NZ_CP054031|848529:939654|921611_924104_+|WP_138329644.1|protease|DBSCAN-SWA MPTFSPSLEKALHQALTFANERHHEYATLEHLLLALIDDADAAAVMGACNVDLDALRKTLVEYVDNELSNLITGYDEDSKPTSGFQRVIQRAVIHVQSSGREEVTGANVLVAIFAERESHAAYFLQEQEMTRYDAVNYISHGIGKRPGASDVRPPRGAEDEAESSKPTARGGEEDGGPKKQQDALKAYCVNLNEKAKGGKIDPLIGRHAEVSRTIQILCRRSKNNPLYVGDPGVGKTAIAEGLAKRIVEGKVPEALADATIFSLDMGTLLAGTRYRGDFEERLKQVVKELEEYPGAVLFIDEIHTVIGAGATSGGAMDASNLLKPALSSGAIRCIGSTTYKEYRQFFEKDRALVRRFQKIDVSEPSIEDAIEIMKGLKPYFEEYHHLRYSNDAIKSAVELSARYISDRKLPDKAIDVIDETGAAQMLLPPSKRRKLITEKEIEATVATMARIPPKTVSKDDEAVLANLEKELRSVVYGQDIAIEALSTSIKLARAGLREPNKPIGAYVFSGPTGVGKTEVAKQLASSLGVELLRFDMSEYMERHTVSRLLGAPPGYVGFDQGGLLTDGVDQHPHCVVLLDEIEKAHPDIYNILLQVMDHGTLTDHNGKKIDFRNVILIMTTNAGASEMAKAAIGFGSSKRTGEDEEALTRLFTPEFRNRLDAIIPFAALPTAVIHKVVQKFIMQLEAQLSERNVTFDLHEDAIAWLAEKGYDEKMGARPLARVIQDTIKKPLANEILFGKLKKGGVVNVTVGPKEDGKPGIVLEAISETAPIKPKPEAEVVHPEGDDGDDGELKTKAARKTRAKAVPQAEPEVRDAPKKGSAVPKVPRKK >NZ_CP054031|848529:939654|897848_898190_-|WP_138329685.1|head,tail|DBSCAN-SWA MTIVTLSLFKAHLGTDDLLDENALGNLASGTDELLQHYLKAAEAWASGYLGLPLSDFEPDVPGDIEQAILQMAAHLYQNREAVLVGVNAYDLPFGIDDYLRKYRREVTGRVAE >NZ_CP054031|848529:939654|911587_912742_-|WP_138329657.1|DBSCAN-SWA MSVADKPFVRTSILAAEPPPPGERGAVAWIRRNLLATPKDVILTILALALIAWAVPHLVNWLFIQAVWSGPDRTFCATTIQGGIQPDGWSGACWAFISAKYDQFIFGRYPLGERWRPAIVGILFILLLVPMLIPSAPRKGLNAILLFAVLPVIAFWLLHGGFGLEVVETPLWGGLMVTLVLSFVGIAVSLPVGILLALGRRSKMPVIRMLCVTFIEVIRGVPLITVLFMASVMLPLFLPTGWNVDKLLRALIGVSIFTSAYMAEVIRGGLQAIPKGQFEGADSLGLGYWQKTRLIIMPQAIKLVIPSIVNTFIGTFKDTSLVTIIGMFDLLGIVKLNFSDANWASAVTPITGLIFAGFIFWLFCFGMSRYSGFMERHLDTGHKR >NZ_CP054031|848529:939654|897427_897862_-|WP_138329687.1|DBSCAN-SWA MSQNRTLSQQSAELSKRLNAIPHEILDALRPALLKSGEEVAAAAAIFAEASRDSGALIDSIAVTGPGETTPAYAEGGGKRTAGPNQVLVTVGNETMRHGHFIEFGTVNIEPQPFLRPGLRTVKPRIERRTSRAIATALKKYNAR >NZ_CP054031|848529:939654|849151_850507_-|WP_173862934.1|DBSCAN-SWA MPMISKILIANRGEIALRVLRACKELGIACVAVHSTADADAMHVRLADESVCIGPPSSRESYLNIHQIVAACEITGADAVHPGYGFLSENAKFADILEAHGITFIGPTADHIRIMGDKITAKTTALELGIPVVPGSDGEVKTEEDALKTAAEIGYPVLIKATAGGGGRGMKVAKSEADLIEAWSTARTEAAAAFGNDAVYMEKYLGKPRHIEIQVFGDGEGNAIHLGERDCSLQRRHQKVWEEANSPALNVEQRMKIGQVCADAMKKLKYRGAGTIEFLYENGEFYFIEMNTRLQVEHPITEAITGIDLVHEQIRVASGGGLSVTQDEVHFSGHAIECRINAEHPRTFVPSPGTITHFHAPGGLGVRIDSGAYQGYKIPPYYDSLIGKLIVHGRTRVECMMRLRRALDEFVVDGINTTLPLFQDLVSNQDIANGDYDIHWLEDYLAKTPAA >NZ_CP054031|848529:939654|926478_927198_-|WP_138329637.1|DBSCAN-SWA MTNVGWIRDLPVAHRGYHDLNTQVWENTLSAFSRAVEAGFAIECDLHYASDGVPVVFHDEDLQRLCNLHGDIRERTSRELGLIAVGGTSDKVPTLRQLLDLVEGKVPLVLELKGREADDEGFAEAVLEVLEGYQGKAALMSFDHWLLRDLKALGSPYPLGLTANGNTPEAFKTHAEAMEIGLDFISYHYDDLPNAFITGEREKGIPVITWTVRDEEARRHTFANADQMTFEGFDPRVAA >NZ_CP054031|848529:939654|897011_897431_-|WP_138329689.1|DBSCAN-SWA MIEPSLALQATIGNALAADPAVIALVDPDNIRGGSMRPEDFPCILMGGGHTEFLGHASGSQYVARVSCDLHVWALEDGADTAKAVGFAVMNALKEAPAAEGFSIDEFALPSVAWMRDPDPKQSYCHGVMTVEAVMRWSV >NZ_CP054031|848529:939654|912743_913946_-|WP_138329656.1|DBSCAN-SWA MTHGAVDRTPLHDTGWSFRSAMYDPKYRSIFFQVLTVVVLVAFVWWVAHNTAVNLARSNTASGFGFLRGRAGFEIGQSLIGFSSDSTYARALLVGILNTLLVAVTGIFTATIIGFLIGIGRLSHNWLIAKLCTVYVEVFRNIPPLLVIFFWYLGVLSVLPQPRESVGLPFNMFLNNRGLAFPKPIFETGMIAVGIALLIAIVATIIIVRWAHKRQAATGQPFHTVWTAIALIVGLPLLVFVVSGFPLTFDIPVAGKFNLTGGSVVGPEFMSLFLALSFYTASFIAEIVRGGIRGVPKGQSEAAGALGLHPSSVTRLVVVPQALRIIIPPLTSQYLNLTKNSSLAIAIGFSDLVAVGGTILNQSGQAIEIVCIWGIVYLSLSILTSLFMNWFNAKMALVER >NZ_CP054031|848529:939654|892441_894694_-|WP_138329692.1|DBSCAN-SWA MNTPLPGLVVDIEGRVDKLEKAMAKANDVQRRGTKSLEARARQSAQNIERSYAKAASGIRNTVSSIGNVFAGFVSANAAKDLIDSSIKIQNQLKTTGLAGKELKGVYDQLFASAQKNATPLEALVTLYSRTSGAAKDLGANQQDLLRFTDGVSLAMRVSGQSAGESAGALLQLSQALGGGKIQAEEYNSLLDAGRPILEAVAAGMKDAGGSVSALTQLVKDGKVSSEAFFRAFLAGLPIIQAKVANSEATISSSFVRLQNVLIDAAKRFNTSAKAAQDFGGIIDGVAAEINSVNFDNLISKIEAVTSSIQNSIKSAQSWADWLGTVSGAANLGELIVRSLPGDTTVKSVAGVNIVQTDAVQRRITDAFDAPATNSGGLTPEQIKAFATNSGAIAGAEAAAKVSRLPAAPAAEAVKPVSLADYPVPATSTTPSSGTGAGRRAGGGGGSSDSFARELQDLQSRTQAVQSATAAQAALNPLIDDYGAALATAQAKQQLMNAAQQQNKTVTPEMAAQIDAAAQAYGRATAAAEQLQEQQDNVRRSAEEALGTARDVTQGLITDLASGKSGAEALSNALAKIGDAILNNLLGKVFDLKNFTGTGSAGGGLFGALGSLLGFSGGGWTGPGGKYEPKGIVHGDEYVMTKEATRRIGVQALNAMNYGRVPGYAEGGYVGNAPALRKPDLVAANGNTAPAVPMNINTNVTVNASGGTQDQNADLATRVGKQVEQQLKGLVQEQIRLAARPGSFLNTRSR >NZ_CP054031|848529:939654|935433_936156_+|WP_003539289.1|DBSCAN-SWA MSLEPVYKRVLLKASGEALMGGQGFGIDVTVADRIASDIAEARHMGVEVGVVVGGGNIFRGVAVASKGGDRVTGDHMGMLGTIINALALATSLRKLNIDTVVLSAISMPEICESFSQRATLYHLSMGRVVIFAGGTGNPFFTTDSAAALRAAEMGAEAIFKGTQVDGIYTADPKKYPDATRLDRLTHQEVLDRGLAVMDVAAVALARENSIPIIVFSIHEKGGFAEILTGGGLKTIVSDN >NZ_CP054031|848529:939654|880970_881549_+|WP_003539135.1|DBSCAN-SWA MADRISSNEYSDLDELSGEAVDLVEITGVVKWFDVAKGFGFIVPDNGLQDVLLHVTCLRRDGYQTILEGTRIVALIQRRERGYQAFKILSMDQSTAVHPSQLPPVRTHVQVTATSGLERALVKWFNRTKGFGFLTRGEGTEDIFVHMETLRRFGLTELRPGQVVLVRFGDGEKGLMAAEIHPDVPSPASRSH >NZ_CP054031|848529:939654|916656_917811_-|WP_138329652.1|DBSCAN-SWA MPIEHAAIIGAGISGLTAALALSRRGISSEIFEQAGELTDIGAGLQVSPNASRILAELGILDGLSKVWLEPDAIRLISGSSLRQLAAVPAGKFARERWGAPYGVLHRTTLQKALLDAVTADPLCRLHLGVRIDSTLPPFERTPDIVIGADGVWSKLRQFVPGSPSPRFSGNIAYRFTIAETEAPGFLDRASVSAFLGGSAHLVCYPLRETGSFNMVAITAGNIAPQAWQSEATAEQRAQLRARLSGWNSAIVSLLDRNRKLTFWPLFETTSGAWQDGRKTVLIGDAAHAMMPFAGQGAAMAIEDAYELAAFLSNSPVAEALARFERHRAPRIARLRQRGAFNRFAYHAKGPIRIGRDLVLGLKPPLSLAADLDWIYGYRAADLP >NZ_CP054031|848529:939654|933400_934168_+|WP_003539287.1|DBSCAN-SWA MALPDFSMRQLLEAGVHFGHQTHRWNPKMKPYIFGDRNNIHIIDLAQTVPMLSRALQVVSDTVARGGRVLFVGTKRQASEIIADSAKRSAQYYVNSRWLGGMMTNWKTISNSIQRLRKLDEILNGEAQGFTKKERLNLEREREKLDKALGGIRDMGGTPDLMFIIDTNKEKIAIDEAKRLGIPVVAIIDSNCDPDLIDYPIPGNDDASRAIALYCELISRAAIDGIARQQGSSGRDLGASSEVPVEPALEEAAEG >NZ_CP054031|848529:939654|860676_860934_+|WP_138329757.1|DBSCAN-SWA MNKDSHAGKGYGRSEEEPPDHPDVFEEVRETDHDVVRYQIPGASSEAPDDDESPEGRNWAILLTLFSVILVQITGIALIVWAVFW >NZ_CP054031|848529:939654|936918_937662_+|WP_003539291.1|DBSCAN-SWA MSESVFVTVPEHVAIIMDGNGRWAKQRGLPRTMGHRKGVEAVRETVRAAGAAGIKYLTLFAFSSENWRRPEAEVSDLLGLLKAFIRRDLAELHRQNVRIKVIGDRHSLRSDILGLLLEAEETTKDNTSLTLVIAFNYGSRDEIARAVVSLARDVEAGRLRAQDITPALINARLDTAGIPDPDLIIRTSGEERLSNFLLWQAAYSEFIFLPEYWPDFSPEIFRSALETFASRDRRFGGLSSQAAAVGT >NZ_CP054031|848529:939654|898597_899185_-|WP_138329682.1|DBSCAN-SWA MDTETLKIDCPANVWTKVADEHQSIYIQNQERKALRFFFGWEQPATDADAYIVSQDYNPSEKLFSFGNAKTGLWVMPDGVSGASVVVVRTRSVELVKDPTLVRTVNAPTYALNNDDWGYILDFTQGTVITVPAELNARFNCGLRQGGENQIQVNAGAGAVVEEIDDHFKSEQRLALLTLCRFPDGKFQIIGRTAA >NZ_CP054031|848529:939654|929919_931488_+|WP_138330065.1|DBSCAN-SWA MRALLSGLALLALLPLAGCNPFGQGPDFSIVSGSENTVLQPIVEEFCKQKNATCTFKYEGTLDIGLALQSDQGVAQDAVWPASSVWVDMFDTKRRVKSLTSIAQTPVVLGVRRSKAQQLGWIGKDVFMKDILAAVESGSLKFLMTSATQSNSGASAYLAMLSSALGNKPVIEPGDLDDRHVQESVRSLLSGVVRSSGSSGWLADLYVESAGKGTVYDAMWNYEAVLKETNDKLAALSQEPLYAIYPADGVAMADSPIGFVDHGRGPEVQTFFNDLLAYLSSAPVQQRIADTGRRIPLTGVAAKPESSWNFDPARLVTAIRMPEAGVIRQALNLYQAALRKPSLTALCLDFSGSMQGEGEDQLQKAMRFLLTPDEASKVLVQWSPADQIIVIPFDGSVRNTFMASGNPLEQEGLLNEISRQKANGGTNMYACAERALQQIARTDKLSTYLPAIVIMTDGKSDDQSQAFTSEWNAMEPHVPIFGITFGDADKTQLDSLAKLTSARVFDGGSDLATAFRTARGYN |
98 | Planktothrix_phage(14.29%) | capsid,head,protease,tRNA,lysis,integrase,terminase,tail,portal | attL 896961:896976|attR 912846:912861 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1100595 : 1110849
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP054031|1100595:1110849|DBSCAN-SWA GTCATTCCGCCGCAACCGCCTTCTTCACCGGCCGCCGTTCCAGCAATTCCTTCAGGAAGTGTCCGGTATAGGACCGCTTCTCCTTGACGATCGCTTCCGGCGTGCCGGTCGCGACGATCTCGCCGCCGCCATCGCCGCCTTCAGGACCGAAATCAAGCACCCAGTCGGCGGTCTTGATGACTTCGAGGTTGTGCTCGATCACCACCACCGAATTGCCCTGGTTGACCAGTTCGTGCAGCATTTCGAGCAGCTTGGCGACGTCGTGAAAATGCAGGCCGGTCGTGGGTTCATCGAGAATGTAGAGCGTGCGCCCCGTCGAGCGTTTCGACAGCTCCTTGGCAAGCTTGACGCGCTGCGCCTCGCCGCCCGAGAGCGTATTCGCCTGCTGGCCGACCTTGATATAGCCGAGACCGACATCCTTCAGCGCTTGCAGCTTGTCGCGCACGGCAGGCACCGCCGCGAAGAAATCGACGCCTTCCTCCACCGTCATGTCGAGCACGTCGGCGATCGACCTCTGTTTGAAGGTGACGTTGAGTGTCTCGCGATTGTAGCGCTTGCCGTGGCAGACGTCGCAGGTGACGTAGACATCGGGCAGGAAGTGCATCTCGATCTTGATGACGCCGTCGCCCTGGCAGGCCTCGCAGCGCCCGCCCTTGACGTTGAAGGAGAAACGGCCCGGCTGGTAGCCGCGGGCCTTGGCCTCCGGCAGGCCGGCGAACCAGTCGCGGATCGGCGTGAAGGCGCCGGTATAGGTTGCCGGGTTCGAGCGCGGCGTACGGCCGATTGGCGATTGGTCGATATCGATCACCTTGTCGATATGCTCGAAGCCGTCGATGCGGTCGTGATCGGCCGGGTTTTCACGCGCGCCCATGACGCGGCGCGCGGCCGATTTATAGAGCGTCTCGATCAGGAAGGTGGATTTGCCGCCGCCGGAAACGCCGGTCACCGCCGTGAACACCCCGAGCGGGATCGACGCAGTGACGTTCTTCAGATTGTTGCCGCGCGCGCCGACCACCTTGATCTCGCGGCCCTTCTTCGACTTGCGGCGCTCGTCGGGAACGGGAACGCCGAGTTCGCCGGAGAGATATTTGCCGGTCAGCGACTTCGGATTGTCCATGATATCCTGCGGCGTACCATGGGCGATCACCTGGCCGCCGTGAATGCCGGCGGCCGGGCCGATATCGACCACATCGTCGGCCGACAGGATCGCATCCTCGTCATGTTCGACGACGATGACGGTGTTGCCGATGTCGCGCAGGTGCTTCAGCGTGTCGAGCAGCCGGGCATTGTCGCGCTGATGCAGGCCGATCGACGGTTCGTCGAGCACATAAAGCACGCCCGTCAGCCCCGAACCGATCTGCGAGGCGAGCCGGATGCGCTGGCTCTCGCCGCCCGAAAGCGTGCCGGAATTGCGCGAAAGGCTGAGATATTCCAGGCCGACATCGTTGAGGAAACGCAGCCGGTCGCGGATTTCCTTGAGGATGCGCACGGCGATCTCGTTCTGCTTGGCATTGAAGCTTGCCGGCAGCGTCTCGAACCAGTCGCGGGCGACGCGGATCGACATACCGGTCACTTCGCCGATATGCAGCGTGTCGATCTTCACCGCTAAGGCCTCGGGCTTCAGGCGGAAGCCGTTGCAGGAAGGGCAGGGGGCAGCCGACATGAAGCGCTCGATGTCTTCGCGCGCCCAGGCGGAATCGGTCTCCTTCCACCGCCGCTCGAGATTGGTGATGATACCCTCGAAATTCTTCTGCGTCGTGTAGGAGCGGGCGCCGTCGGCGTAGTGGAATTCGATCTTGTCGTTGGTGCCGTTGAGGATGACGTCCTTGGCCTCGTCGGAGAGATCGTTCCAGCGGGTGCCGAGCTTGAAGCCGTAATGTTTGCCGAGCGCTTCCAGCGTCTGGTTGTAATAAGGCGAGGTCGACTTGGCCCAGGGCGCGATCGCCCCGTCGCGCAGCGTCCGCTCGGGCTCGGGCACGATCAGATCCGGATCGATCTTCTGCTGAGCGCCGAGGCCGTCGCAGGTCGGGCAGGCGCCGAACGGGTTGTTGAAGGAGAACAGCCTGGGTTCGATCTCCGGAATGGTGAAGCCGGAGACCGGGCAGGCGAACTTCTCCGAAAACAGCACACGTTCATGCGTCTCGTTCAGCGACTTGTTGGCCGAGCCGCCGGCCGACGTCTCTTCCGGCGGCAGCGGCTTGTCGGCGAATTCGGCGACCGCTAGCCCGTCGGCGAGCTTCAGGCAGGTCTCCAGACTGTCGGCCAGGCGCGCCGAGACATCCGAGCGCACGACGATGCGGTCGACCACCACGTCGATGTCGTGCTTGTATTTCTTGTCGAGCACCGGCGCCTCGGCAATCTCGTAGAACTGCCCGTCGACCTTGACGCGCTGGAAGCCCTTCTTCATCAGCTCCGCCAGTTCTTTCTTGTATTCGCCCTTGCGCCCGCGCACGAGCGGCGCAAGGATATAAAGACGGGTGCCCTCGCCGAAATCGAGGATGCGGTCGACCATCTGGCTGACGGTCTGGCTCTCAATCGGCAGGCCGGTGGCCGGCGAATAGGGAACGCCGACGCGGGCAAAGAGCAGGCGCATATAGTCGTAGATCTCGGTGACGGTGCCGACCGTCGAGCGCGGATTGCGCGAGGTGGTCTTCTGCTCGATCGAGATCGCCGGCGACAGCCCGTCGATCTGGTCGACATCTGGCTTCTGCATCATTTCGAGGAATTGCCGGGCATAGGCCGACAGGCTCTCGACATAGCGCCGCTGGCCCTCGGCATAGATCGTGTCGAAGGCGAGCGAGGATTTGCCGGAACCCGAAAGCCCGGTCATGACGATCAGCTTGTTACGCGGCAGATCGAGATCGATGCTCTTGAGATTATGCTCGCGCGCGCCGCGGATGGAGATCGTCTTCAGTTCGCTCATCGTCTCTGCTTTGGTTTTCTGGATGACAGCCCTTATTTAGTGATGCCGACGGGCGAGTCGAGGCTGAATGTCCACTACGGTCAAGGTTTTCGATTCTGTTGACAGGATCATACGGATAAACTAGAACAAATAAAGAACAAAATTCCTGATGAATGCATGCGGCTTTCTGATGGATAAACATGTGGATTAAATGCCGCCCATGCGCATCGGCGGTCCGTGATGGTTTAAGGTGCGCCGATCGATTAAACGCCGCCGGGCAGGGCGGCAGGCAGGGTGAGAAGTATGGCTGGTAGTGTGAATAAGGTAATTCTGATTGGAAACGTCGGTGCGGACCCCGAAATCCGTCGCACTCAGGATGGCCGGCCGATCGCCAATCTTCGTATCGCGACCTCGGAGACCTGGCGCGACCGCAATTCCGGCGAGCGCCGTGAGAAGACCGAATGGCACACCGTCGTCGTCTTCAACGAGGGCCTCTGCAAGGTCGTCGAACAATATGTGAAGAAGGGCGCCAAGCTCTATATCGAAGGCCAGCTGCAGACCCGCAAGTGGCAGGACCAGCAGGGCCAGGATCGCTACAGCACGGAAGTGGTGCTGCAGGGTTTCGGTTCGACGCTGACGATGCTCGACGGCCGTGGTGAAGGCGGCGGCGCGAGCGGCGGCCGTGGCAGTGCCGGTGGCGGCAACGATTATGGCGACGATTACGGCGCCCCGGCTCCCGCTTCATCGCCGAGCCGCGGCGGTGGCAGCGGCGGCGGCAACTTCTCGCGGGATCTCGACGACGACATCCCGTTCTGATCGGTAGCCTTTTCTGGCTTGGTGAGAGACGTGAGAGAGCATGATGCCTTGGATGGCGTCATGCTCTTTTTCTTTGATCTCGCAGCCGATCATTCAGATCGCATCGGCCTCAATCATCTGCCGCCGTTTTCAGTGCGGTGTCGCGCTGATGAGGTTGAACGCCGACTTCGCCCCGTCGACGACGAACTGTGCGGCGAGCGCTGCGAGGATGACGCCGAGGAGCCGCGTCAGGATGGCCCGGCCGGTGACGCCGAGGAAACGGTCCAGCCTTTCGGCGATGAGCAGCATGAGGAAGGTCAGCAGCAGATTGGCGGCAATGACGCCGATCAGCTGCGCTTTCCCGATCGAGGTGGCCAGCGTGCCGGCCAGCAGGATCGTCGCCGAGATCGCACCCGGGCCGGCAATCAGCGGCAAGGCGAGGGGGAAGACGGCGATATTCTCGATGTGATCCTTGGTGATCGCCGCCTCGGTCGCCTTCTCCTTGCGCTCCTGCCGTCTCTCGAACACCATCTCGAAGGCGATCCAGAACAGCAGCAGCCCGCCGGCGATGCGGAAGGCGCCGATCGAAATCCCGAGCACGCCGAGCACGCCGGCGCCAAACAGCGCGAAGACGGCGAGGATGATGAAAGCGATAACCGAGCCGCGTAGCGCCACCTGCTTGCGCTCGGTTCGGCTCATGCCTGTCGTCAACCCCAGAAAGATCGGCGCAAGCCCCGGTGGATCGATGGTGACGAGCAGCGTCGTGAAGGCGTTGATCAATTGGTCGGCGCTTGCCATCGGCTCTCCGGAACCCCATATGGGTGCGTGGTTTTTATGATTGTGAAGCGGGGATTTCGATTTGGCAAATATCCCGCGGCGGCTCGCGGCGGATTTGTCCGGTCCGGCCGCGCAACTTGCATGTTTTACCCTCTTTAAGCCGCAAAAGCCTGTTCAAAACGCGCTGTGAAATTGGCGCGGCAACAGGCTTTCCGCTATAAATCTTCAAGTGATTCTAGAAGAGATCGTGATCTGTTTTGACTGAGCAAACACCCCCCGGCGGCGGGAAGCTCCCGCCAGGCATCGAGCCCATCTCCATCATGGAGGAAATGCAGCGGTCGTATCTCGATTACGCCATGAGCGTCATCGTCAGCCGCGCGCTGCCCGACGTGCGCGACGGCCTGAAGCCCGTGCACCGGCGCATCCTCTACGGCATGTCCGAGCTCGGCATCGACTGGAACAAGAAATACGTCAAATGCGCCCGCGTCACCGGTGACGTGATGGGTAAATTCCATCCGCACGGCAATTCCGCGATCTATGATGCGCTCGCCCGCATGGCGCAACCATGGTCGCTGCGGCTGCCGCTGATCGATGGTCAGGGCAATTTCGGCTCGGTCGATGGTGATCCGCCGGCGGCCGAACGTTACACCGAATGCCGCCTCGAAAAGGCTGCCCACTCGTTGCTCGACGATCTCGACAAGGAAACCGTCGATTTCCGCGACAACTACGACGGCACGCTCTCCGAGCCGGTCGTGGTGCCCGCCAAATTCCCGAACCTGCTTGTTAACGGCGCCGGCGGCATTGCCGTCGGCATGGCGACCAACATCCCGCCGCACAATCTCTCCGAAGTCATCGACGGCTGCATCGCGCTGATCGACGACCCGGCCATCGAGCTGCCGGAACTGATCCAGATCATTCCCGGCCCCGATTTCCCGACGGGTGCGAAGATCCTTGGCCGTGCCGGCATCCGCTCTGCCTATGAGACCGGTCGCGGCTCCGTCATCATGCGCGGCGTCGCTGCCATCGAGCCGATGCGCGGTGACCGCGAGCAGATCATCATTACCGAGATTCCCTACCAGGTGAACAAGGCGACGATGATCGAGAAGATGGCCGAGCTGGTGCGCGACAAGCGCATCGAAGGCATCTCCGACCTGCGCGACGAATCCGACCGCCAGGGGTATCGCGTCGTCGTCGAGCTGAAGCGCGATGCCAACGCCGAGGTCATCCTGAACCAGCTTTATCGCTATACGCCGCTGCAGACCTCCTTCGGCTGCAACATGGTGGCGCTGAACGGCGGCAAGCCTGAACAGCTGACCTTGCTCGACATGCTGCGCGCTTTCGTCTCCTTCCGCGAGGAAGTCGTTAGCCGGAGAACGAAATTCCTGCTGCGCAAGGCGCGTGACCGTGCCCATGTGTTGGTCGGTCTCGCCATTGCCGTCGCCAATATCGACGAAGTCATTCGCGTCATCCGCCAGGCGCCCGATCCGCAGTCGGCCCGCGAAGAACTGATGACCCGCCGCTGGCCGGCGGAAGATGTCGAAAGCCTGATCCGTCTGATCGACGATCCGCGCCATCGCATCAATGACGACCTCACCTACAATCTCTCCGAAGAGCAGGCCCGCGCCATCCTCGAACTGCGCCTTGCCCGCCTGACGGCGCTTGGCCGCGACGAAATCGGCGACGAACTCAACAAGATCGGCGAGGAAATCAAGGATTATCTCGATATTCTTTCCTCGCGCGTCCGCATCCAGACCATCGTCAAGGACGAACTTCTCGCCGTCCGCGACGAGTTCGGCACGCCGCGCCGCACCGAGATCATCGATGGCGGCCTCGAAATGGACGATGAGGATCTGATCGCCCGCGAAGACATGGTCGTCACCGTCTCGCATCTCGGTTATATCAAGCGCGTACCGCTCACCACCTACCGCGCCCAGCGGCGCGGTGGCAAAGGTCGCTCCGGCATGACCACCCGCGACGAAGATTTCGTTAGCCGGCTATTCGTCGTCAATACCCATACGCCGGTGCTGTTCTTCTCCTCGCGTGGCATCGTCTACAAGGAGAAGGTCTGGCGCCTGCCGATCGGCACGCCGACCTCGCGCGGCAAGGCGCTGATCAATATGCTGCCGCTCGCACCGGGCGAGCGCATCACCACCATCCTGCCATTGCCCGAGGACGAGACGAGCTGGGACAATCTCGACGTTATGTTCTCGACGACGCGCGGCACGGTTCGCCGCAACAAGCTGTCTGACTTCGTCCAGGTCAACCGCAACGGCAAGATCGCCATGAAGCTCGAGGAGGAGGGCGACGAAATCCTCTCCGTCGAGACCTGCACGGAAGATGACGACGTGCTTTTGACGACGGCGCTCGGCCAGTGCATCCGCTTCTCCGTCGACGATGTCCGCGTCTTTGCCGGCCGCAACTCGATCGGCGTGCGCGGCATCAATCTCGGCGATAGCGACCGCATCATCTCCATGACGATCGTTGGTCATGTCGATGCCGAGCCGTGGGAGCGTGCTGCCTATCTCAAGCGTTCCGCCACCGAACGCCGGGCGACGGGCGTGGATGAAGAGGATATTGCGCTGGTCGGTGAAGAAGTCACCGAGGAGGGACAGCTCTCCGACGAGCGCTATGACGAGTTGAGAGCACGCGAACAATTCGTGCTCACCGTCTCCCAAAAGGGCTTCGGCAAGCGTTCGTCCTCCTATGACTTCCGTATCTCCGGTCGCGGCGGCAAGGGCATCCGCGCCACCGATACGTCGAAAACAGGCGAGATCGGCGAGTTGGTCGCTGCCTTCCCGGTCGATGACGGCGACCAGATCATGCTGGTCTCCGATGGCGGTCAGCTGATCCGCGTGCCGGTCGGCGGCATCCGTATCGCCAGCCGCGCCACCAAGGGCGTCACCATCTTCTCGACGGCCAAGGACGAGAAGGTCGTCTCGGTCGAGCGCATCAGCGAGCCGGAGGAGGATGAGACGGAGGAGGCCGTCGAGGTCGCCGAAGGCGCTCCTGGTGCCGATGAAGGTGCTGCCGACGAAGGTGGCGGTGACGAGGCGCCGGGTGCCGATGGCGATCCGGCCGGGCCTGCCGAGGAATGATCAAATGAATGGGGCGGCTTCGAGCCGCCCTTTTCTTTTCAGATCGAATGATGGTTGAGGACGACACCCAAAGAAAAAGCCGGACGCTGGGTGCATCCGGCTTTGAATGGCCCTGGCGACGGAAGGGTGCCGCTCAAGGGTGGGAAGAGGAGAGAAAGGAACGTCAGAACGCTCTATGTTTTTCGTTCAGAGCCTTCACCTCTTCGCCTTCGCGCCGGAGCGCTGAAATCGTCGAGGCCACAGACACGATGAACATGACGCTGATGAGGGCGGTGAGCATTGTCAAGATCGTGAACATAGGCATTCCTTTCCTGGGGAGTTGCGTTCTCGCCCCTCAGTACTGGCCCAGCACGGCCGTACGGGCCGAAGCATAGGGACCATAAACCTTGGACGTATCGACCTTCTGATCATAGAGAGCGACAGTTGCGGTCAGCATGCCCGTGACCATTACCATCCAGAGCACGTTAATCATCGTAATCTTGTCGGGCATTTTTAATCCTTCCGCGCCGCTCTATGTATCGGTTGAATGAACGCATCCGTCTGTAAATGGTTCCGCCGGATGTTGATCGGGTGTTTACCAATGCCGATCTGAAGCGACCTTGAATGTCTCGTTCATCTGGCATTCAGATTCGAAGGTCTTTTTTCGCGCAGCCGGCCCTGAGCGCGCACGATCGCCGCATGCTCGGCAAGCTCCGGCCGCACCCAGCTCATCTCGTATCGATCGGCGAAATAGCCATAGGCCATCTCGGCGGCGAGGGAGGAAATGACGGCAAGTGCAACGAGCACGATCAGCATGGACCTGGTCTTTCGGGGGAGCGCCGGATGCCGGAATGGCATCGGCTGCTTTGATGATCGGACTTAAGCAGATCCAGTCTGAAACGGTCTTGAACGGGCCATTCATCCGCCGTTTAACCGCCCGGCGCATAACGTCCTTTACGCGTCTGAAAAGACGCGCGGCGGTCTAGCCGGAGCTGGGCAAATCACTTGCGAGAAGGCGTCATCCGTGACAAAAGGCGTCAAACCAACGGTCGGGCCACATGACGACAGCTTTCTATCCGGGGTCCTTCGACCCGATCACCAATGGGCATGTGGATGTTCTGGTCCAGGCGCTCAACGTCGCCGAAAAGGTGATCGTCGCGATCGGCATCCATCCCGGCAAGGCGCCGCTCTTTTCCTTCGAGGAGAGAGCTGAGCTGATCCGCCGTTCTCTGGCTGAAGCGCTGCCTGGCAAGACCGGCGACATCACCGTCGTCTCCTTCGACAATCTGGTCGTCGATGCCGCCCGCATCCATGGCGCCACGCTGTTGATCCGCGGTCTGCGCGACGGAACCGACCTCGATTACGAAATGCAGATGGCCGGCATGAACCGGACGATGGCGCCCGATATCCAGACCATCTTTTTGCCGGCGGGCACGGCCTCGCGGCCCATTACCGCCACATTGGTCCGCCAGATCGCCGCCATGGGCGGCGATGTCAGCGCTTTCGTGCCTGCCGCCGTCTTGCAAGCCCTCACATCCAAGCGCCCAAACTAGCAAGCGCAAAGACTAGGCGGAACCCGCCGCAATGGAGCTCATATGAAACTCTTTTATCTCGCATTTGCCGGCGTGTTGTATCTCGCCTCCTTCGCGGGGGATGCCTTCGCCCAGTCGGCCGATCATTATCTCACCATCGAGCTGAAGAATGGCCCTGTTGTCATCCAGCTCATGCCTGAGGTGGCGCCGAAGCATGTCGCCCAGATCGAGGCGCTGGCCAAGAAGGGCGAATACGACAACGTCGCGTTCCATCGCGTCATCGATGGCTTCATGGCGCAGACCGGCGACGTCAAATACGGCAACATGGAAAAGGGCTTCGATGCGAGCCTCGCCGGCACCGGCTCCTCGGATATGCCTGACATCCCGGCTGAATTCTCCAAGACCCCGTTCGTGCGCGGTACGGTCGGCATGGCCCGCTCGCAGGATCCGAATTCCGCCAATTCGCAGTTCTTCATCATGTTCGCCGAAGGCTCCTTCCTGAACGGCCAGTACACCGTCGTCGGCAAGGTCGTCTCCGGCATGGAGAACGTCGACAAGATCAAGAAGGGCGAAGGCCAGAACGGCGAAGTCAAAAGTCCCGACCGGATGATCAAGGTCACCCTGGGCAAGAAGTAAGTTACGATAGAAACAAGAGGAGAAGGCAAATGGCCGAGATCAAGGATCCCGAAAACACCGTCATTCTGGAAACCACCAAGGGCAAGGTTGTCATTCAGCTTTTGCCGCAGGTCGCCCCGGAACATGTCGCCCGCATCAAGGAACTCGCTCGCGAGAAGGCCTATGACGGTGTCGTCTTCCACCGCGTCATCCAGGACTTTATGGCCCAGACGGGCGATGTCGAATTCGGCAAGAAGGGCTCGGAAACCTTCAATCCCAGCCGCGCCGGCATGGGCGGCTCCTCGAAGCCGGATCTGAAGGCCGAATTCTCGGCGACCACGCATACGCGCGGCACCTGCTCGATGGCCCGTTCGCAGAACCCGAACTCGGCCAATTCGCAGTTCTTCATCTGCTTCACCGACGCGCCCTGGCTGAACAAGCAGTATTCGGTCTGGGGCCAGGTCATCGAAGGCATGGACAACGTCGACAAGATCAAGCGCGGCGAGCCGGTTTCCGATCCGGACTCGATCGTCTCCATGCGGGTTGCCGCCGACGTCTGA
Protein sequences of DBSCAN-SWA_5 >NZ_CP054031|1100595:1110849|1110339_1110849_+|WP_003539661.1|DBSCAN-SWA MAEIKDPENTVILETTKGKVVIQLLPQVAPEHVARIKELAREKAYDGVVFHRVIQDFMAQTGDVEFGKKGSETFNPSRAGMGGSSKPDLKAEFSATTHTRGTCSMARSQNPNSANSQFFICFTDAPWLNKQYSVWGQVIEGMDNVDKIKRGEPVSDPDSIVSMRVAADV >NZ_CP054031|1100595:1110849|1105325_1108163_+|WP_138329482.1|DBSCAN-SWA MTEQTPPGGGKLPPGIEPISIMEEMQRSYLDYAMSVIVSRALPDVRDGLKPVHRRILYGMSELGIDWNKKYVKCARVTGDVMGKFHPHGNSAIYDALARMAQPWSLRLPLIDGQGNFGSVDGDPPAAERYTECRLEKAAHSLLDDLDKETVDFRDNYDGTLSEPVVVPAKFPNLLVNGAGGIAVGMATNIPPHNLSEVIDGCIALIDDPAIELPELIQIIPGPDFPTGAKILGRAGIRSAYETGRGSVIMRGVAAIEPMRGDREQIIITEIPYQVNKATMIEKMAELVRDKRIEGISDLRDESDRQGYRVVVELKRDANAEVILNQLYRYTPLQTSFGCNMVALNGGKPEQLTLLDMLRAFVSFREEVVSRRTKFLLRKARDRAHVLVGLAIAVANIDEVIRVIRQAPDPQSAREELMTRRWPAEDVESLIRLIDDPRHRINDDLTYNLSEEQARAILELRLARLTALGRDEIGDELNKIGEEIKDYLDILSSRVRIQTIVKDELLAVRDEFGTPRRTEIIDGGLEMDDEDLIAREDMVVTVSHLGYIKRVPLTTYRAQRRGGKGRSGMTTRDEDFVSRLFVVNTHTPVLFFSSRGIVYKEKVWRLPIGTPTSRGKALINMLPLAPGERITTILPLPEDETSWDNLDVMFSTTRGTVRRNKLSDFVQVNRNGKIAMKLEEEGDEILSVETCTEDDDVLLTTALGQCIRFSVDDVRVFAGRNSIGVRGINLGDSDRIISMTIVGHVDAEPWERAAYLKRSATERRATGVDEEDIALVGEEVTEEGQLSDERYDELRAREQFVLTVSQKGFGKRSSSYDFRISGRGGKGIRATDTSKTGEIGELVAAFPVDDGDQIMLVSDGGQLIRVPVGGIRIASRATKGVTIFSTAKDEKVVSVERISEPEEDETEEAVEVAEGAPGADEGAADEGGGDEAPGADGDPAGPAEE >NZ_CP054031|1100595:1110849|1100595_1103517_-|WP_138329484.1|DBSCAN-SWA MSELKTISIRGAREHNLKSIDLDLPRNKLIVMTGLSGSGKSSLAFDTIYAEGQRRYVESLSAYARQFLEMMQKPDVDQIDGLSPAISIEQKTTSRNPRSTVGTVTEIYDYMRLLFARVGVPYSPATGLPIESQTVSQMVDRILDFGEGTRLYILAPLVRGRKGEYKKELAELMKKGFQRVKVDGQFYEIAEAPVLDKKYKHDIDVVVDRIVVRSDVSARLADSLETCLKLADGLAVAEFADKPLPPEETSAGGSANKSLNETHERVLFSEKFACPVSGFTIPEIEPRLFSFNNPFGACPTCDGLGAQQKIDPDLIVPEPERTLRDGAIAPWAKSTSPYYNQTLEALGKHYGFKLGTRWNDLSDEAKDVILNGTNDKIEFHYADGARSYTTQKNFEGIITNLERRWKETDSAWAREDIERFMSAAPCPSCNGFRLKPEALAVKIDTLHIGEVTGMSIRVARDWFETLPASFNAKQNEIAVRILKEIRDRLRFLNDVGLEYLSLSRNSGTLSGGESQRIRLASQIGSGLTGVLYVLDEPSIGLHQRDNARLLDTLKHLRDIGNTVIVVEHDEDAILSADDVVDIGPAAGIHGGQVIAHGTPQDIMDNPKSLTGKYLSGELGVPVPDERRKSKKGREIKVVGARGNNLKNVTASIPLGVFTAVTGVSGGGKSTFLIETLYKSAARRVMGARENPADHDRIDGFEHIDKVIDIDQSPIGRTPRSNPATYTGAFTPIRDWFAGLPEAKARGYQPGRFSFNVKGGRCEACQGDGVIKIEMHFLPDVYVTCDVCHGKRYNRETLNVTFKQRSIADVLDMTVEEGVDFFAAVPAVRDKLQALKDVGLGYIKVGQQANTLSGGEAQRVKLAKELSKRSTGRTLYILDEPTTGLHFHDVAKLLEMLHELVNQGNSVVVIEHNLEVIKTADWVLDFGPEGGDGGGEIVATGTPEAIVKEKRSYTGHFLKELLERRPVKKAVAAE >NZ_CP054031|1100595:1110849|1104441_1105089_-|WP_029875185.1|DBSCAN-SWA MASADQLINAFTTLLVTIDPPGLAPIFLGLTTGMSRTERKQVALRGSVIAFIILAVFALFGAGVLGVLGISIGAFRIAGGLLLFWIAFEMVFERRQERKEKATEAAITKDHIENIAVFPLALPLIAGPGAISATILLAGTLATSIGKAQLIGVIAANLLLTFLMLLIAERLDRFLGVTGRAILTRLLGVILAALAAQFVVDGAKSAFNLISATPH >NZ_CP054031|1100595:1110849|1103799_1104312_+|WP_138329483.1|DBSCAN-SWA MAGSVNKVILIGNVGADPEIRRTQDGRPIANLRIATSETWRDRNSGERREKTEWHTVVVFNEGLCKVVEQYVKKGAKLYIEGQLQTRKWQDQQGQDRYSTEVVLQGFGSTLTMLDGRGEGGGASGGRGSAGGGNDYGDDYGAPAPASSPSRGGGSGGGNFSRDLDDDIPF >NZ_CP054031|1100595:1110849|1108497_1108653_-|WP_173862938.1|DBSCAN-SWA MPDKITMINVLWMVMVTGMLTATVALYDQKVDTSKVYGPYASARTAVLGQY >NZ_CP054031|1100595:1110849|1109737_1110310_+|WP_138329480.1|DBSCAN-SWA MKLFYLAFAGVLYLASFAGDAFAQSADHYLTIELKNGPVVIQLMPEVAPKHVAQIEALAKKGEYDNVAFHRVIDGFMAQTGDVKYGNMEKGFDASLAGTGSSDMPDIPAEFSKTPFVRGTVGMARSQDPNSANSQFFIMFAEGSFLNGQYTVVGKVVSGMENVDKIKKGEGQNGEVKSPDRMIKVTLGKK >NZ_CP054031|1100595:1110849|1109200_1109695_+|WP_138329481.1|DBSCAN-SWA MTTAFYPGSFDPITNGHVDVLVQALNVAEKVIVAIGIHPGKAPLFSFEERAELIRRSLAEALPGKTGDITVVSFDNLVVDAARIHGATLLIRGLRDGTDLDYEMQMAGMNRTMAPDIQTIFLPAGTASRPITATLVRQIAAMGGDVSAFVPAAVLQALTSKRPN |
8 | uncultured_Mediterranean_phage(83.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2147644 : 2157605
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP054031|2147644:2157605|DBSCAN-SWA CTTAGCTGTCGAGGAAGCTTCTGAGCTTCCTGGAGCGGCTCGGATGCTTCAGCTTGCGCAGCGCCTTCGCCTCGATCTGGCGGATACGTTCGCGGGTGACCGAGAACTGCTGGCCGACTTCTTCGAGCGTGTGGTCGGTGTTCATACCGATGCCGAAGCGCATGCGCAGAACACGTTCCTCGCGCGGCGTCAGCGAGGCGAGAACACGGGTCGTCGTCTCACGCAGGTTCGCCTGGATGGCGGCGTCGATCGGCAGCAGCGCGTTCTTGTCCTCGATGAAATCGCCGAGATGCGAATCCTCTTCGTCACCAACTGGGGTTTCGAGCGAGATCGGCTCCTTGGCGATCTTCAGGACCTTGCGGACCTTTTCGAGCGGCATGGCGAGCTTTTCGGCCAGTTCTTCCGGCGTCGGCTCGCGGCCGATCTCGTGCAGCATCTGGCGCGATGTGCGCACGATCTTGTTGATCGTCTCGATCATGTGCACCGGAATGCGGATCGTGCGGGCCTGGTCGGCGATCGAACGGGTGATCGCCTGACGGATCCACCATGTCGCATAGGTCGAGAACTTGTAGCCGCGGCGGTATTCGAACTTGTCGACCGCTTTCATCAGGCCGATATTGCCTTCCTGAATGAGGTCGAGGAACTGCAGGCCACGGTTCGTGTACTTCTTGGCGATGGAGATGACCAGGCGAAGGTTTGCTTCGACCATTTCCTTCTTGGCGATACGCGCTTCGCGCTCACCCTTCTGCACCATATGGACGATGCGGCGGAATTCCGAGATCGAGATACCGGTTTCGGTCGCCAGATTCTGGATTTCCTGGCGAATGTCGCGGATCGTCGTGTTTTCGCCCCTGGCGAATTCCTTCCAGCCGCGAGCGGCCAGATTGCCGATCGACTTCATCCAGTTCGGATCGAGTTCGGCGCCCTGATACTGCTCCAGGAAGGAGTCGCGCTTGACGCCGTAGGATTCGGCCAGACGCAGCAGGCGGCCCTCGTTGGAAACGAGGCGCTTGTTGATGTCGTAGAGCTGCTCGACGAGGGCGTCGATGCGGTTCTGGTTCAAGGACAGAGATTTGACGGCCTTGATGAGCTCATCTTTGAGTTCCTTGTAGCGGCGCTCCTGGGCAGTGGACAGCGTGCCGGAAGCCGACAGCCGCTGCTCGACCTGCTGGTCCTGCAGCTTGCGCAGCTTCTTGTAGGTCTCGGCGATCGTATCGAGCGTCTCCATCACCTGCGGGCGAAGCTCGGCTTCCATCGCCGCCAGCGAAAGGTTGGATTCGTCCTCGTCCTCTTCCTCCTCCTCGGGAGGCAGGCCTTCGCCGCCGACATCGGTGATGTCGTCGTCGCCGGAGCGGGCGCGACGGGTCTTTTCCTTCTCTTCTGCGAGCTTGCGATCGGCTTCGATCTTCTCCGGGCTCTGGAACTGCGGCGCAGCCTTGGCTTCAGGACCGGAATAGGTCGTTTCGAGATCGATGATCTCGCGCAGCAGCGTCGTGCCTTCGTTGAGTTCGTCGCGCCAGATGATCAGCGCCTGAAAGGTCAGCGGGCTCTCGCAGAGACCCGCGATCATCGTCTCGCGGCCAGCCTCGATGCGCTTGGCGATCGCGATTTCGCCCTCGCGGGACAGAAGCTCCACCGAGCCCATCTCGCGCAGGTACATGCGAACCGGATCGTCGGTACGGTCGGTCGGTTCTTTTTTCTTGGCGGTCGCAAGTGCCGTGCCGCTCGAAGGGGCGAGTTCGCCGCCTTCGCTCTCCTCGTCGCCGCCGGCGTCGTCGTCATCGCTGCCGCCGGAAGCACCGGCTTCCTCGGCTTCCTCGTCCTCGATGACGTTGATGCCCATGTCGGACAGCATCGACATCGTGTCTTCGATCTGCTCGGACGTCACTTCTTCGGACGGCAGGACCGCGTTAAGCTCGTCCATGGTCACATAGCCGCGCTTCTTGGCGGCCTTGATCATCTTCTTGACCGCGTCGTCGGAAAGATCGAGAAGAGGGCCGTCGCTTGCGCCGTCGCGTTCGACTTCCGCGTCTTCGTTCTCTTTGACCTTGGTTGCCATTTATATCGTCGCCTTCCTGACGCTATCCAATCTCGCTACGGTGAACGGGTTGTCTCAGGGCCTGACGCGCCGCACGCGCCGCCCTCGAATCACCTTTGCCCACCTAACGGGATGACATTTAAAGCCTGATTAACCACGATTACCGGCACCGGACAGCAAAGCTTTCTTGCTCTTGGATTTCCGGCCATGGTTGCGCGATCCGCCTTGACCCAGTTCACCATTCAACAAGTCGAACGGTGATTCCCACATTTTTTGTTCTCGTCAAGCTTTTCCCGAAAAAAAGCCCACAAAAATACGGAAAAAGCCTTATCCACAGGCTTTGAACCGCGGCAGCGATTGCGAACCCTTATTTAAGATGCGGCGAACAAAAGACAAGGGCAGCAAATATTTTGCCCGCCGCACCAGTCGTTTCCACCATTTTTGCAGTCACGCCGAGGAACTATTCGCCACTGCCTGACGGATATCGGCTAACGCGCCGCGCCGTCATACGCCTTCACCGTCAGCGCCCCGATATCGACCGCGGGAATGCAGCGCAGATTGATCGAAGCCATGGCCGCGCCGTTCGGACCCACGGCCTCGCCGAACGGCGCGATGCCGCAATTGGCGCAGAAGTGATGCCGAATGGCATGCCGGTTGAAAGTATAGGTCGAGACATTGTCCTCCGGCGTTGTCAGCACCAGTTTCTCGCGCGGCACGAAGGCGAGAAGTCCGCCGCGCCTGCGGCAAAGCGAACAATTGCAGTCGAGCGCCTCGGTGAATTCGCCTTCGACCTCGAAAGCCACATTGCCGCAATGGCAGCTTCCTTCATAGAGCATGTCGTCTCCTCACATCCAATCCATCACAACCTTGCCGGAATTGCCCGACCGCATCGCCTCGAAGCCGTCGCGGAAATCGTCGATGCCGATCCGGTGGGTGATGATGGGCGCGAGATCGAGGCCGCCCTGAACGAAGGCGATCATCTTGTACCAGGTCTCGAACATCTCGCGGCCGTAGATGCCCTTGAGATTGAGCATCTTGAAGATCACCTTGTTCCAGTCGATTTCGAAGCCCGCAGGCGCGATGCCGAGGATGGCGATCTTGCCGCCATTGTTCATCTTGTCGATCATATCGCGGAAGGCAGGTGCGGCCCCTGACATTTCCAGCCCGACGTCGAAACCCTCCGTCATGCCGATCGCCTTCATCACGTCGGCGAGGTTTTCCTTCGATGCGTCGACGACGTGATCGATGCCGAGCTTGTGCGCCAGCTCCAGCCGGTGCGGATTGATATCGGTGATGACGACCTTGCGGGCGCCGGATCGCTTGGCGACCAGCGCGCCCATGATGCCGATCGGCCCGGCGCCGGTGACAAGCACGTCCTCGCCGACGAGATCGAAGGAGAGCGCGGTGTGCACGGCATTGCCGAACGGATCGAAGATCGCGGCGATCTCGTCGGAAATATCATCCGGGATCGGCACGACATTGCTTTCGGGAATGCAGACAAACTCGCCGAAGGAGCCCGGACGGTTGACGCCGACACCGAGCGTGTTGCGGCAGAGATGGCCCCTGCCCGCCCGGCAGTTGCGGCATTTGCCGCAGACGATATGCCCCTCGCCGGAAACCCGCTCGCCGACATGATAGCGGGTGACCGCCGAGCCGATCTCGGCAATCTCGCCGGAGAATTCATGGCCAACCACCATCGGCACCGGAATGGTCTTCTGCGCCCACTGGTCCCAGTTCCAGATATGGACGTCGGTGCCGCAGATCGCCGATTTCTTCACGCGGATCAGCACATCGTTCGGCCCGACCTCGGGCACCGGCACATTCTCCATCCAGAGCCCGACTTCCGGTTTCGCTTTGACCAGCGCCTTCATCATGTTCGACATCAACGATATCCCGCCCTTGTTCTGGTGCTCGACTTTGCTATCCCCTCTTCTCCCCAGCGGGGAGAAGGTGCCCGTAGGGCGGATGAGGGGGCGGCACGGCACGCCGTCAATCTCAAATCACACCCAATTCCCGCCCTGCCTCGGCAAAGGCCGCAATTGCCCGCTCCACATCCGCCCGCGAATGCGCCGCCGACATCTGCGTGCGGATGCGAGCCTGGCCTTTCGGCACGACGGGGAAGGAAAAGCCGATCACATAGATCCCTTTCTTCAGCATCAGACCGGCCATATCCTGGGCGAGTTTGGCATCGCCCAGCATGACGGGAATGATCGGGTGGCCTTCGCCGGCAAGCGTGAAGCCGAGCTTGGTCATTTCGGTGCGGAAGAGATCGGCATTGTCCGAAAGCCGCTTGCGCAGGGCATCGCCGTTTTCGATGAGGTCGAACACCTTGAGCGAGGCGGCGGTGATGACGGGCGCCAGCGTGTTCGAAAACAGATAGGGCCTCGAGCGCTGCCGCAGCCATTCGACGACCTCCGCCTTGGCGGAGGTATAGCCGCCCGAGGCGCCGCCGAGCGCCTTGCCGAGCGTGCCGGTAATGATGTCGATCCGGCCCTCGACGCCGCAATGTTCCGGCGACCCACGGCCGTTCTTGCCGACGAAGCCGACCGCGTGGCTGTCATCGACCATGACCATCGCGCCGTATTTCTCGGCAAGATCGCAGACGCCGCCGAGATTGGCGATGATGCCGTCCATCGAGAAGACGCCGTCGGTGGCGACCAACTTGAAGCGGCTGCCTTCGGCCTTCTTCAGCTCCTCTTCGAGCGCTGCCATATCGTTGTTGGCGTAGCGGAAGCGCTTGGCCTTCGAAAGCCTGACGCCGTCGATGATCGAGGCGTGGTTCAGCGCGTCCGAGATGATCGCGTCCTCTTCCGACAGCAGCGTCTCGAACAGGCCGCCATTGGCGTCGAAGCAGGAGGAATAGAGGATCGTGTCTTCCATGCCGAGGAAGGAGGAGATGCGCGCTTCGAGCTGCTTGTGCTCTTCCTGCGTGCCGCAGATGAAGCGCACCGAGGCCATGCCGTAGCCGTATCGGTCGAGCGCCTGCTTGCCGGCCTCGGCCAGCTCCTCGTTGTCGGCAAGGCCGAGATAGTTGTTGGCGCAGAAATTCAGCACCCGCTCGCCGGTGGAAATCGCGATCTCGCCCGCCTGCTTGGAGCTGATGACGCGCTCGGATTTGTAGAGGCCGGCATCCTTCAGCGCTGAGATCTCGTTGCTGAGATGGGAGAGAAATTGCGAGGTCATGGACGGCCTTTCCCAGAGATTTCCGTTTGCCCGCCGGGTTAGCATATTGCCGCCGGCTTGTCTTCTCAGCGCTGTACCGAGATCAGCAGGAACATCGGCCGGTCCAGCTCCTCCGCCCAATCGGGATTGTCGCGCAATTCCTGATCGTTCGGGCTCCACTCCTCGACATAGTGAAGGGTAAAGCCCGCCGCAATCAAGGTGTTGAGCGTCGTGCCGAGCTTGCGGTGCTGCTTGACCACGCCCTTGGTGAGCCAGTCGGTGGTGCGCGGACCTTCGACGGAGTAGCGGTCGAGCGGCCAGATGCGCCGCCCCTCGGCATCGGCTGCCCAGGCGGGATTGGTCGGCGCCATGAAGACCGGATGCTCGATGGTGAAGACGAAATGCGATCCCGGCAAAAGCGCGCGATAGACCGTCGCGGCGAGACCGGCGAAATCCTCGATATAATGCAGCGCCAGCGAGCTGTAGGCGAAATCGAAGGAGGCTTGAGCAAGCGTGAGATGCTCGAGATCGGCGATCTCGTAAGTGATCGCTGCTTCAACTGTATCGGCCCTTGCTCTGGCGATCATCTTTTCCGAGATGTCGAGCGCCAGCACGCTTGCAGCACCCTGGCTCACGGCAAAACGCGAAAACCAGCCGAAACCGCAGCCGAGATCGACAACACGCTTGCCGGCAAGATCGGGCAGCAACGCGCGCACCGCCGGCCACTCCGATGCTCCGTCGAGCCCGTGCAAGGATCGTCTCATGCCGCTATATCCGGCGAAGAATTCCGGTCGGTCGTAGATATTCTGGGCCATCGGCGCTCCTCCCTTAAGAGCGGGAATACCATGCCCCCTCGCCCGATCTCAACGATCAGGCCGCGCAGCTCAGCCGTTGTCCTGCGCCAGGATAATATCCAGAAAGGCATCGCCGTAGCGTTCGAGCTTCGCCTGCCCGACACCTGATATGGCGAGCAATTCCTTGCGGCTGCGAGGCCGCTCCGTGGCGAAGGCGATCAGCGTCGTATCGGGAAAGACGACATAGGGCGGCACGCCGAGCGATTTCGCGATCGCCATGCGCTCGGCCCGCAGCGCCTCGAAGAGACTGCCGTCTGCGCCTGATAGTCCCGATTTGCGCTCGCTGCGCTCCGCCTTCTTCGTACGCCGTTCCGAGGCCGGCCGGTCCTTGCGGAAGAACACCTGCCGCTCGTGCTTGAAGACGGAGCGCGCCTCCGGCTCGAGCTTCAGCGCCCCGAAGGCCTCGTGGTCGACGCGGATCAGCCCCATGGCGAGCAGTTGGCGGAAGACCGACTGCCAGACGCGCGCTGGAATATCCTTGCCGGCCCCGAAGACCGGCATTTCGGCATGGCCGAAGCGTTCGGTCTTTTCGTTGACGTTGCCCAGGAGCACGTCGACGACATGGCCAGCGCCGAAACGCTCGCCGGTGCGGTAGACGGCGGCAAGCGCCTTGATCGCCGCCTCCGTGCCCTCCCAGGTCTCGACCGGCTTCAGGCAGGTGTCGCAATTGCCGCATTGTCCGGCATGCGCCTCGCCGAAATGGGCAAGGATTGCCTGGCGGCGGCAGGAGGCGGTCTCGCAGATCGCCAGCAAAGCGTTGAGCTTGGCGCGTTCGACCCGTTTGATCTCGGCGGCAGCATTGCCCTCGTCGATCATCCGGCCGCGCTGGATGACGTCAGCCATGCCATAGGCCATCCAGACCTCGGACGGCAGACCGTCGCGCCCGGCGCGCCCGGTCTCCTGATAGTAGGCTTCCACAGAACCCGGCAGGTCGAGATGGGCGACATACCGCACGTTCGGTTTGTCGATGCCCATGCCGAAGGCGACGGTGGCGACCAGGCAAAGATTCTCTTCTTTCAGGAAGGCATCCTGATTGGCCTCACGAACCGCCCTGTCCATGCCGGCATGGTAGGCACGCGCGCGAATACCCTGCCCGTTCAGCCATTCGGCCGTATCTTCGACCTTGGCGCGAGACAGGCAATAGACGATGCCGCTGTCGCCCTTGTGGCCGGACAGGAACCGCAATAGCTGCTGGCGCGGCTGGTCGCGCTCGACGATCTCATAGGCAATATTCGGTCGGTCGAAACTGGTGGTAAAGATCTCGGCAGTATCGAGCCCCAGCCGCTCGATCATGTCGTCGCGCGTATGCGGATCGGCCGTCGCCGTCAGTGCCACTCTGGGCACACCGGGATATTGCTCGCCGAGCCGCCCGAGTTCGCGATATTCCGGCCGGAAATCATGGCCCCATTGCGAGACGCAATGGGCCTCGTCAATCGCAAACAGCGCAATCTTCTCGTTGGCGATCAGCTCGCGGAAACCATCGGTCAGAATACGCTCGGGCGTCACGTAGAGAAGATCGAGCTGGCCCGCCGAAAGCGCGCGGCGAACCTCGACGAATTCCTCGCGCGACAGAGACGAGTTGAGTGCTGCGGCGCGGATGCCGAGCTGCTTCATCGCCTCCACCTGATCGCGCATCAGCGCAATCAGCGGCGAAACGACGATACCGACACCGTCGCGGCAGAGCGCCGGGATCTGGAAGCACAGCGACTTGCCGGCGCCTGTGGGAAAGAGCACGACCGCATCGCCGCCGGAAACCACGTGCTCGACCACCTGCTGCTGCTTGCCGCGGAAGGAGGAATAGCCGTAGACCTGCTTGAGGATATCGAGCGGCGAGCGGCCGGAAGAAAATAGCGCGCCGGAAGCGGAAAAGCCCGGTTCGTTTCGTTCGGTCAGCGACATCAGTGGTTCGCCGCCCCTTTGACGCGGCCGGAAAGCACGCCGAAGCCATCGATGATGGCCTCCTGGTTCTCAAGGCGCACCCCTTCGAAATGCACTTCCTGCTGCGCTCGCATCAGCTGAACGATCGCCTCGCCGTCGCCGGCTTCGGTTGCCAGCGCGATCTCGCGCTCGAGCTCGATCTTCTGCCGGCGCAGCGCCTTCGCCCGCTTGTGGGAGGCCAGCGCCTGGCGGTATCCCTCACGCGCATCCTCCATCGCCGCCTGCTCGGTCGCGATCCACAGCCGGGCATTGCGCACCTGCTGGTCGAGGCTCTTGATGAGCGGGCCGAAGCCTTCGAATTCCAGCCTTTCGGTCAGATATTCGCGCGTCAGATGCGGGGCGGCGACGGAAGCCGCTGCTCCCAGCATCGCCGACCAGAGCCGCTGCAGCTCGCGGCTGTCATATTCGATCGCGGCAATCTCGTCATAATCGTCGATCATCAGCGAGGGGTGATTGACGACGGTCAGCGCCAGCACGCATTCGCGCAGCGCCGTATTGTTCTGGTGGCCGCGCACGGGACCCGAGCGGGCGAGCCTTTCTGAAATGACGCTGGGGCTTTTCGGCCCGGCCTTGCCGGCATTGTCGCGGCCCGTCCGGTAATTGCCATTGCCGTTGAAGCCGCCGCGGCGATCGCCATTGTTGCGGTTCTGGAACTGCGGCTGGAAGAAGGCGTTCAGCCGATCGCGAATATCCTGCTGGTAGTGACGGCGGACGTTTTCGTCGGCGATGACGGCGACCAGTTGCTTCAGCCGCGCCTCGAGCTCGGCGCGCGCCTCAGGCGTGTCGAACTTGCCGGTGTTGACCTCCCGGCTCCAAAGCATCTCGGAGAGCGGCTTTGCCTGGCTCATCACCTTGTCGAAAGGCGCGCGCCCGTCATCGCGCACGAGATCGTCGGGATCCTTGCCGTCGGGCAGAAGCGCGAAGCGAACGGAACGGCCGGGCTTCAGATGCGGCAGCGCCAGTTCGGCGGCGCGATTGGCCGCGCGGATGCCGGCACCGTCGCCGTCGAAGCAGAGCACCGGCTGCGGCACCATTTTCCAGAGCAGCTCGAGCTGGTTTTCGGTGAGCGCCGTGCCGAGCGGCGCAACGGCATTCTCGATGCCCGCCTGATGCAGCGCGATCACGTCCATATAGCCTTCGACGGCAATAATGGTGCCGGTCGCATTGTCGTCCTGAGAATCACCGCGGCCAGGTCCCTGGATCGCCCGCCGCGCACGGGCAAAATTGTAAAGAACATTGCCCTTGTGGAAGAGTTCAGTCTCGTTGGAGTTCAGATATTTCGCCGGCGCGTCGGCCGACATGGCGCGTCCGCCGAAAGCGATCACCTTTTCCCGCGACGACAGGATCGGGAACATGATGCGGTCGCGGAAACGATCGTAGGAAACCGGCACGTTTTCATGGACGACCAGACCGCAGGCCTCGATCTGCTCCTTGAGTACGCCCTTGCCAGCGAGGAATTCCTTGAGCGCATTGCGGCTGTCGGGCGCATAGCCGAGACGGAAGGTCTCGATGGTGCGCCCCGTCAATCCGCGGTCGCGCAGATAGGCGCGCGCCCTCGCCCCATTCGCCGTCTGCAGCTGATCCTGAAAAAATTGTGTCGCCATTTCCATGACGTCGATCAGCGAGCCGCGCTCCTTCTCGCGCTTCTCCATCACAGGGTCGGCAAGCGGCATCGGCACGCCGGCCATATCGGCGATCTGCTGTACCGCCTCGGGAAAACTCAGACCTTCAAGTTCTGTCAGGAAACGGAAATGGTCTCCCGTCACCCCACAGCCGAAGCAGTGATAGCGGCCCTTTCGGTCCTCGCAATGGAAGCTCGGCGATTTCTCGCCGTGGAACGGGCAGCAGGCCCAGTAGTCGCCGCGGGACACGTTGGTCTTGCGCTTGTCCCAGCTCACGCGGCGCGCAATCACGTTCGAAATCGGAACGCGGTCGCGAATATCATCGAGAAAGGTGTTGGAAAAGCGCAT
Protein sequences of DBSCAN-SWA_6 >NZ_CP054031|2147644:2157605|2150526_2151564_-|WP_138330444.1|DBSCAN-SWA MSNMMKALVKAKPEVGLWMENVPVPEVGPNDVLIRVKKSAICGTDVHIWNWDQWAQKTIPVPMVVGHEFSGEIAEIGSAVTRYHVGERVSGEGHIVCGKCRNCRAGRGHLCRNTLGVGVNRPGSFGEFVCIPESNVVPIPDDISDEIAAIFDPFGNAVHTALSFDLVGEDVLVTGAGPIGIMGALVAKRSGARKVVITDINPHRLELAHKLGIDHVVDASKENLADVMKAIGMTEGFDVGLEMSGAAPAFRDMIDKMNNGGKIAILGIAPAGFEIDWNKVIFKMLNLKGIYGREMFETWYKMIAFVQGGLDLAPIITHRIGIDDFRDGFEAMRSGNSGKVVMDWM >NZ_CP054031|2147644:2157605|2151676_2152864_-|WP_138330443.1|DBSCAN-SWA MTSQFLSHLSNEISALKDAGLYKSERVISSKQAGEIAISTGERVLNFCANNYLGLADNEELAEAGKQALDRYGYGMASVRFICGTQEEHKQLEARISSFLGMEDTILYSSCFDANGGLFETLLSEEDAIISDALNHASIIDGVRLSKAKRFRYANNDMAALEEELKKAEGSRFKLVATDGVFSMDGIIANLGGVCDLAEKYGAMVMVDDSHAVGFVGKNGRGSPEHCGVEGRIDIITGTLGKALGGASGGYTSAKAEVVEWLRQRSRPYLFSNTLAPVITAASLKVFDLIENGDALRKRLSDNADLFRTEMTKLGFTLAGEGHPIIPVMLGDAKLAQDMAGLMLKKGIYVIGFSFPVVPKGQARIRTQMSAAHSRADVERAIAAFAEAGRELGVI >NZ_CP054031|2147644:2157605|2153727_2155596_-|WP_138330441.1|DBSCAN-SWA MSLTERNEPGFSASGALFSSGRSPLDILKQVYGYSSFRGKQQQVVEHVVSGGDAVVLFPTGAGKSLCFQIPALCRDGVGIVVSPLIALMRDQVEAMKQLGIRAAALNSSLSREEFVEVRRALSAGQLDLLYVTPERILTDGFRELIANEKIALFAIDEAHCVSQWGHDFRPEYRELGRLGEQYPGVPRVALTATADPHTRDDMIERLGLDTAEIFTTSFDRPNIAYEIVERDQPRQQLLRFLSGHKGDSGIVYCLSRAKVEDTAEWLNGQGIRARAYHAGMDRAVREANQDAFLKEENLCLVATVAFGMGIDKPNVRYVAHLDLPGSVEAYYQETGRAGRDGLPSEVWMAYGMADVIQRGRMIDEGNAAAEIKRVERAKLNALLAICETASCRRQAILAHFGEAHAGQCGNCDTCLKPVETWEGTEAAIKALAAVYRTGERFGAGHVVDVLLGNVNEKTERFGHAEMPVFGAGKDIPARVWQSVFRQLLAMGLIRVDHEAFGALKLEPEARSVFKHERQVFFRKDRPASERRTKKAERSERKSGLSGADGSLFEALRAERMAIAKSLGVPPYVVFPDTTLIAFATERPRSRKELLAISGVGQAKLERYGDAFLDIILAQDNG >NZ_CP054031|2147644:2157605|2147644_2149702_-|WP_018074040.1|DBSCAN-SWA MATKVKENEDAEVERDGASDGPLLDLSDDAVKKMIKAAKKRGYVTMDELNAVLPSEEVTSEQIEDTMSMLSDMGINVIEDEEAEEAGASGGSDDDDAGGDEESEGGELAPSSGTALATAKKKEPTDRTDDPVRMYLREMGSVELLSREGEIAIAKRIEAGRETMIAGLCESPLTFQALIIWRDELNEGTTLLREIIDLETTYSGPEAKAAPQFQSPEKIEADRKLAEEKEKTRRARSGDDDITDVGGEGLPPEEEEEDEDESNLSLAAMEAELRPQVMETLDTIAETYKKLRKLQDQQVEQRLSASGTLSTAQERRYKELKDELIKAVKSLSLNQNRIDALVEQLYDINKRLVSNEGRLLRLAESYGVKRDSFLEQYQGAELDPNWMKSIGNLAARGWKEFARGENTTIRDIRQEIQNLATETGISISEFRRIVHMVQKGEREARIAKKEMVEANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKIVRTSRQMLHEIGREPTPEELAEKLAMPLEKVRKVLKIAKEPISLETPVGDEEDSHLGDFIEDKNALLPIDAAIQANLRETTTRVLASLTPREERVLRMRFGIGMNTDHTLEEVGQQFSVTRERIRQIEAKALRKLKHPSRSRKLRSFLDS >NZ_CP054031|2147644:2157605|2155595_2157605_-|WP_138330440.1|DBSCAN-SWA MRFSNTFLDDIRDRVPISNVIARRVSWDKRKTNVSRGDYWACCPFHGEKSPSFHCEDRKGRYHCFGCGVTGDHFRFLTELEGLSFPEAVQQIADMAGVPMPLADPVMEKREKERGSLIDVMEMATQFFQDQLQTANGARARAYLRDRGLTGRTIETFRLGYAPDSRNALKEFLAGKGVLKEQIEACGLVVHENVPVSYDRFRDRIMFPILSSREKVIAFGGRAMSADAPAKYLNSNETELFHKGNVLYNFARARRAIQGPGRGDSQDDNATGTIIAVEGYMDVIALHQAGIENAVAPLGTALTENQLELLWKMVPQPVLCFDGDGAGIRAANRAAELALPHLKPGRSVRFALLPDGKDPDDLVRDDGRAPFDKVMSQAKPLSEMLWSREVNTGKFDTPEARAELEARLKQLVAVIADENVRRHYQQDIRDRLNAFFQPQFQNRNNGDRRGGFNGNGNYRTGRDNAGKAGPKSPSVISERLARSGPVRGHQNNTALRECVLALTVVNHPSLMIDDYDEIAAIEYDSRELQRLWSAMLGAAASVAAPHLTREYLTERLEFEGFGPLIKSLDQQVRNARLWIATEQAAMEDAREGYRQALASHKRAKALRRQKIELEREIALATEAGDGEAIVQLMRAQQEVHFEGVRLENQEAIIDGFGVLSGRVKGAANH >NZ_CP054031|2147644:2157605|2152929_2153658_-|WP_138330442.1|DBSCAN-SWA MAQNIYDRPEFFAGYSGMRRSLHGLDGASEWPAVRALLPDLAGKRVVDLGCGFGWFSRFAVSQGAASVLALDISEKMIARARADTVEAAITYEIADLEHLTLAQASFDFAYSSLALHYIEDFAGLAATVYRALLPGSHFVFTIEHPVFMAPTNPAWAADAEGRRIWPLDRYSVEGPRTTDWLTKGVVKQHRKLGTTLNTLIAAGFTLHYVEEWSPNDQELRDNPDWAEELDRPMFLLISVQR >NZ_CP054031|2147644:2157605|2150169_2150517_-|WP_138330445.1|DBSCAN-SWA MLYEGSCHCGNVAFEVEGEFTEALDCNCSLCRRRGGLLAFVPREKLVLTTPEDNVSTYTFNRHAIRHHFCANCGIAPFGEAVGPNGAAMASINLRCIPAVDIGALTVKAYDGAAR |
7 | Vibrio_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
3064958 : 3075007
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP054031|3064958:3075007|DBSCAN-SWA TATGAAAACCATCGTCGTCTGCTCCGGCGGACTGGACTCCGTTACGCTTGCCCACAAGGTAGCAGCGGAACAACAGCTTATCGGTCTCGTATCCTTCGACTACGGCCAACGGCATCGCAAGGAACTCGACTTCGCCGCCAGATGTGCCTCACGCCTCTCCGTTCCCCATCATATCATCGACATCGCCAGCATCGGCGGCCATCTCAGCGGATCGGCCCTGACCGACAATGTCGAGGTTCCGGACGGCCACTACGCCGAGGAGACAATGAAAGCCACCGTCGTGCCCAACCGCAATGCCATCATGCTGGCAATCGCATTCGGTTTGGCGGCCGCGCAAAAAGCAGATGCCGTTGCCGTCGCCGTGCATGGCGGCGACCACTTCATTTACCCGGACTGCCGGCCGGGCTTCATCGATGCTTTCCAGCGCATGCAGAACGAAGCGCTGGAAGGTTATGCGAGCGTGAAACTGCTTGCCCCATACGTCGATGTCTCCAAAGCAGCGATCGTGGTTGACGGCGAAAAACACGGCACGCCGTTTTCGGAGACATGGTCCTGCTACAAAGGCGGCGAGCGCCACTGCGGGCGCTGCGGAACCTGTGTGGAACGTCGCGAAGCGTTCCATCTTGCCGGCGTCCCCGATCCCACGGAATATGAAGACCAGGATTTCTGGAAAGCGGCCGTGTCTCGATATTCGGCCACGGAGGTGCGTTGATGTACCGCATCACCAAGGAGTTTCATCTCTCCGCCTCCCATCAATTGGATCACCTGCCTACCGACCATCAATGTGCTCGGCTTCACGGGCACAACTACGTCGTCATCGTCGAGCTGGCTGCGGAAAGCCTGAATGATGATGGCTTCGTACGTGACTATCACGACCTCTCACCGCTCAAGCGCTATATCGACGAAACTTTCGATCATCGTCACCTGAACGACGTCTTCGGCCATTCGAAAGTCACCTCCGAATTTCTGGCAAGGCACTTTTACGACTGGTGCAAGCAGCGCTTCCCGGAGACCTCATCTGTTCGCGTCAGCGAGACCCCCAAAACATGGGCGGAGTACAGGCCGTGAGCGTCGGAACCATTCGCATCAGCGAGATCTTTGGCCCGACCATACAGGGTGAAGGTGCCTTGATCGGGCTGCCGACGGTGTTCGTGAGGACAGGTGGCTGTGACTATCGGTGTTCCTGGTGTGACAGTCTTCACGCGGTCGACAGCGCCTTCCGGGATCAATGGATTCCCATGTCCACTGAGGCGGTCTGGCATAAGGTCACGGAACTCTCCGGGGGCAAGCCACTGACGGTTTCCCTTTCCGGAGGCAATCCGGCGATACAGCCCTTGAGGCCACTGATCGAACTCGGCCATTCCCAAGGATACCGCTTTGCACTGGAAACACAGGGAAGCGTGGCCCAGGCCTGGTTTCGCGATCTCGATATCCTGGTCATCAGCCCCAAACCGCCATCAAGCGGAATGCTGACGGACTGGGATCACGTGGATCACTGTCTTCAACTGGCCGCCGGCGGACCGGAGGTCGCATTGAAGATTGTCGTCTTCGACGATACCGACTACGAATTCGCCCAATGCGCAGGTCAGCGCTATCCCCAAATTCCCTTGTTCCTTCAGCCCGGCAATCACACACCGCCACCACCCGATGACCATGACGCACGCATTGATATCGACGGCGTGATGGATCAAATGCACTGGCTTGTCGCAAAAGTAACAGCCGACCAATGGTTTACAGCCCGCGTGCTACCTCAACTGCATGTGCTGCTTTGGGGAAACAAGCGAGGAGTATAAGTTTCTAGCGCTCCGGAAAGGCGAACACATCGACCGGCTCGCCGAGCCCGCGCAACGGGAAGGTGCCGAGGCTGTCCATGTCTTTCGCGCAGCCCGCCATTTCGACAAAGGCCCGCGACAAGAGCACCGGACGCTTGACCTCCTTGGTCAGCGTTTCCAGCCGCGAGGCGACATTGACCGCCGGACCGATGACGGTGAAATCCAGCCGCCGGCGCGAGCCGATATTGCCGTACATGACGTCGCCGACATGCACGCCGACGCCATAACGCAGCGGCTCGCGTCCATTGCCCGCATACTGCTGGTTCAATTGCGCCATCAACCCCTGGCCTTCGCGGATCGCCTGCAGCAGATCGTGACAGGCCGTTTCCTTGCTCAGCGGAAAGATCGCCAGCAATCCGTCGCCCATGAACTTTAGGATCTCTCCGCCATGCCGTTCGATCGGATCCGACATGGCGTCGAAATAATCATTGAGCAGATGGATGACATCGTCGCGCGGCCAGAGATCGGAGATCGCCGTAAAGTCGCGCAGGTCGCAGATCATGATTGCCGCACCGACGGTCGCACCGCTGCCGCGCGTCGTGACGCCCGCCAGGATCTGTTCGCTCGCATGCGGTCCGACATAGGTCTGCAGCAGCGTGCGCGCCATGATGTTCTTCAGCCGAATTTCGCTGACGAGGGTCAGGGCCGGCAGCAGGTCACGCAGGAAATCGACATGCTCGCTCGTAAAACCGCCGGGCCGGCTGGTGGAAAATGTCGCGACATGCCGTTTGCCGAAAGTATGCTCGAGCGGCCAGGCGATATACTCCGTGAGGCCGTCATCGCGCAGTTCCTGATAGAATGAATCCTCGTCGCCGTCGGCTGTGCCTTCCAGATGTCTACGCACCTCCTCCGAACCCTGATGGATCGCGTTGACGGGACTCTTCAGGAACTCCGGCGTGTTTTCGACACCATAGGCGAAAGTGTTGATCTTCGCCTCTGCCAAGCCTTCCTTCCAGAGGATGCGGGCGCCGATCCATTGCGGATGGTTCGTCCTGAAATGCAATGTCGCCCGCGCTACCGGCACCCCTGCCGCCAGCAGCTTCTCGCACATTTCCACCAGGATATTGTCGATAAACCGCTCGCCGCGCGTATCGTTCACCAGCCAGTCGAGAATCCGTCTTCTGCGGATCGGCCAGACGCCCTCGTCCGCCTCGACGGCAGTCCCGGCCTTGTTCAAAAAAGACGACATCAAAAACTCCCGCTACGCCACATCATCGGCGAGGACCGAATTTAGTTTCCGCTGAATGTGGGCGATGAGAGAGCAAATGATAAGAGGCAGACTGCAATCATGCCTCAGAAATCCCAATCCTCGTCTTCGGTCGCCACTGCCTTGCCGATGACATAGGAGGAGCCGGAGCCGGAGAAGAAGTCGTGGTTCTCGTCGGCATTCGGCGAAAGTGCCGACAGGATCGCCGGATTGACCTTGCAGGCCTCACCCGGGAACAACGCCTCATAGCCGAGGTTCATCAGCGCCTTGTTGGCATTGTAATGCAGGAATTTCTTGACGTCCTCGGTCAACCCGACACCGTCATAGAGCGCTTCGGTATATTTCGCCTCGCTATCATAGAGTTCGAGCAGCAGCTCGAAGGCGAAATCCTTGATCTCCTGCCTTCTCTCCTCGCCGAGGCGTTCCAGCCCGCGCTGGAACTTGTAGCCGATATAATAGCCGTGCACCGCCTCGTCGCGGATGATCAGCCGGATCATGTCGGCCGTGTTGGTCAGCTTGGCGCGGCTCGACCAATACATTGGCAGGTAGAAGCCGGAATAGAACAGGAAGCTTTCGAGGAAGACGCTCGCGACCTTCTTCTTCAACGGATCGCCGGAGCGATACTGCTCCATGATCAGCGCCGATTTCCGCTGCAGGAATTCGTTCTCCTCCGACCAGCGATAGGCATCGTCGACATCCGGCGTCGAGCACAGCGTCGAGAAGATCGACGAATAGGAGCGCGCATGCACCGCCTCCATGAAGGAGATGTTGGAAAGCACCGCCTCCTCATGCTGCGTCTCCGCATCCTCCATCAGCTGGATGGAACCGACGCCGTTCTGGATCGTGTCGAGCAGCGTCAATCCGGTAAAGACGCGGATGGTGAGCTGCTGCTCGACCGGCGTCAGCGTCGCCCAGGAAGGAATGTCGTTCGAAAGCGGCACTTTCTCCGGCAGCCAGAAATTGCCGGTGAGCCGGTTCCAGACTTCGAGATCCTTGTCGTCCTCGATGCGGTTCCAGTTGATGGCGCGCACGCGAGAAGCGGGTTTGATTTGTATGTTCATTAGGTGGTCCCTCAATATGTCTGGGCGCGCCCTCATCCGCCTGCCGGCACCTTCTCCCGCTGGGGAGAAGAGACTTGTGGCAGCGGCATCCCCACTAAGTCCCCTCTCCCCAGCGGGTAGAGGGTTAGGGTGAGGGGGCTTCTGGCTCGAGGCCCGAAATCAATCACAGCGTACAGGATACGCAGCCCTGCACCTCGGTGCCGGACAGCGCCATCTGGCGAAGGCGGATGTAGTAGATCGTCTTGATGCCCTTCTTCCAGGCATGGATCTGCGCCTTGTTGATGTCGCGCGTCGTTGCCGTGTCGCGGAAGAACAAGGTCAGCGACAGGCCTTGGTCGACATGCTGGGTCGCCGCCGCATAGGTGTCGATGATCTTCTCCGGCCCGATCTCGTAGGCATCCTGATAATAGTCGAGATTGTCGTTGGTCATGAACGGCGCCGGATAATAGACGCGGCCGATCTTGCCTTCCTTGCGGATCTCGATCTTCGAGACGATCGGATGGATCGAGGAGGTCGAGTGGTTGATGTAGGAGATCGACCCTGTCGGCGGCACCGCCTGCAGGTTCTGGTTATAGAGGCCGGAAGCCATCACCGCCTTCTTCAGCGCCGCCCAATCTTCCTGCGTCGGAATGTGGATGCCCGCCGTCTCGAACAGTGCCTTCACCTTTTCCGTCGCCGGCTCCCAGGGCCGGTCGGTATATTTGTCGAAATATTCGCCGGAGGCATATTTCGAATTCTCGAAGCCCTTGAAGCTCGTGCCGCGTTCGACCGCCAATAGGTTCGAGGCGCCGATTGCGTGATAGGTCACCGTATAAAAATAGATGTTGGTGAAGTCGACGCCTTCCTCGGAGCCGTAGAAGATGCGTTCACGAGCGAGATAGCCGTGCAGGTTCATCTGGCCGAGGCCGATCGCATGGCTCTCGTCATTGCCCTTCTCGATCGAGGGAACCGAGGAAATGTGACTCATGTCGGAAACGGCCGTCAGCGCCCGGATCGAGGTCTCGATCGTCTTGCCGAAATCGGCCGAATCCATTGCCGCCGCGATATTCAGCGAGCCGAGATTGCAGGAAATATCCTTGCCGAGATGTTTGTAGGAAAGGTCGTCATTATATTCGCTCGCCTCGCTCACCTGCAGGATCTCCGAGCAGAGATTGCTCATCGAAATACGCCCGGCGATCGGGTTCGCCCGGTTCACCGTGTCCTCGAACATGATGTAGGGGTAGCCGCTCTCGAACTGGATTTCGGCAAGCACCTGAAAGAATTCGCGCGCCTTGATCTTCTTCTTGGAGATGCGGGCATCATCAGCCATCTCGCGGTACTTTTCCGTCACCGAAATCTCGGTGAAGGGTACGCCATAGACGCGCTCCACGTCATAGGGCGAGAAAAGATACATATCCTCGTTGTTCTTGGCTAGCTCGAAGGTGATATCGGGCACGACGACGCCGAGCGACAGCGTCTTGATGCGGATCTTCTCGTCGGCATTTTCGCGCTTGGTGTCGAGGAAGCGCATGATGTCGGGGTGGTGGGCGTTGAGATAGACCGCACCTGCCCCCTGCCGCGCGCCGAGCTGGTTGGCGTAGGAGAAGCTATCTTCGAGCAGCTTCATGACAGGGATGATGCCTGAGGACTGGTTCTCGATATGCTTGATCGGCGCGCCTGCCTCGCGGATATTCGTCAGCGACAGCGCCACGCCGCCGCCGCGCTTCGACAATTGCAGCGCCGAATTGATCGACCGACCGATCGATTCCATATTGTCCTCGACGCGCAGCAGGAAGCAGGAAACCAGTTCGCCGCGCTGCTTCTTGCCGGCGTTGAGGAAGGTCGGCGTTGCCGGCTGGAAGCGGCCGGAAATGATCTCGTCGACCATGTCGCGGGCAAGGGCCTCGTCGCCGCGCGCCAAGGCCAGAGCCACCATGCAGATGCGGTCCTCATAGCGCTCGAGATAGCGCTTTCCGTCGAAGGTCTTCAGCGTATAGCTGGTGTAATATTTGAAGGCGCCGAGGAAGGTTGGGAAACGAAATTTCTTGGCATAGGCTTGGTCGAACAGATCCCGCACGAAATTGAAGGAATACTGGTCGAGAACCTCCTGCTCGTAATAACCCTCGGTCACGAGGTAATCGAGCTTTTCCCTCAGATTGTGGAAGAACACCGTGTTCTGGTTTACATGCTGCAGGAAATATTGCTTGGCAGCCATTCGATCCTTGTCGAGCTGGATCCGCCCCTCATCGTCATAGAGGTTCAGCATCGCGTTCAGCGCGTGATAGTCGAGGGTTTCGGCTGCCTTCGACGGCCTCTCCCCGGCGGGCTGAGCAATAGTGTGATGCGGTTTTGCGCCCGCATCCCGCGTCAGGTTTGTTCCCGTGTCCAAAATCGTTCCATTCCGTGTTTGACCTTGGCGACGTCGTCTTCGGTGCCCATGAGCTCGAACCGGTAAAGGTACGGCACCTGGCATTTCTGCGAGACCACGTTGCCGGCGAGCCCGTATGTCTCGCCGAAATTGCTATTGCCCGCGGCGATTACGCCGCGGATGTGTCCGCGGTTTTCCGCATCGTTGAGGAAACGGATCACCTGCTTGGGAACGGCGCCCTTGCCGCCGTCGCCGCTATAGGTCGGCACGATCAGCACGAAGGGTTCGCGGATATGGAATGCGTCTGCGCCACCTGGCGGAATGCGCGCCGCGCGCAGTCCGAGCTTGACGACGAACCGATGGGTGTTCTCGGATCGGCTGGAATAATAGACGATCAGGCCCATCGCTTGTCCTCACGTCAAGAGAGCGCGCTGATCATGTCGGGACGGAAGCCCGCCCAATGCTGCTCGCCGGCGATGACGACGGGCGCCTGCATGTAGCCGAGGCTGCGGACGCGATCGAGCGCTTCGGCATCCTGCGAGATATCGACGATGTCATAATCGACACCCAACCGGTCGAGGGCGCGGTAGGTGGCAGTGCACTGGACGCAGGCAGGCTTGCTGTAGACGGTAATGGTCATGATATTCCTCGTGACGACGCGATTGGGCGCAGGACATCGATCGGAAATCTGTCGATATCCGTCGAAAAGTTCTCTGATGATTTGAGCTGTGCGGCTGTTTAGCCGCTGGGCGTACTCCGCCCTCTACCTTGAGCGGGTGGCGGCCTGAACTGAGATACTGACATGACTTCACCCCGAAGTGGCTCAATGCGAAGGTTTCGGGGGCGGAGGTTCGGGCCTGCGAAACGCGATGCGGAGATTCCCCGCAGGCATGCAACGCGACCTTCCGGACACCCCGCCCGTGGACGTTTCGTTCGAGGCAGGTCTCCTGGCTCACGGGTCAAAGCGCCTGCCCAGCCTTCCCGGAGCATCATGCTCCAGTGACCTGAAATCGGACAGTTGCTCGCCGCTTACAGTTGCGGGGGCAGCTCCGGCATTGCCGCACCATGATGGCGAAGCGCACCGAATTCCCGTCTTAGCCGCCGATCCTCACGAATCGACGGAACCTCGAACACTAGATATGGTACCCGAATCGCCGATGACGTCAACAAGTTGTTGCGGCGGCGCAGCAATGGCGACAACGAATCGCCGCGCCTGTGTATGATGTGGATAATGAAGCGTTAACGGAAAAGCGCTGTGGAGAACCCATCATAAAAAGAGCCGGCAGAAGCCGGCTCCCCAATCGACATTCATATTTTTATCCGAAGCAACCGCAGTGCGTTGATCGTCACCAGTACGGTGGCGCCGGTATCGGCGAGGATCGCTGGCCAGAGCCCGGTAATGCCGGCGATCGTCGTCACCAGAAACACCGCCTTCAGCCCGAGTGCGATGGTGATGTTCTGCAGGATGTTGCGCATCGTGCGTTTGGAAAGTTCGATCATTCGCGCCACGTCGCCGACGCGTCCGTGCAGCACGGCGGCATCCGCCGTCTCCAGTGCCACATCGGTGCCGCCACCCATGGCGATGCCGATATCGGCAGCGGCCAGCGCCGGAGCATCGTTGATGCCGTCGCCGACCTTGGCGACGATGAAACCCTGGCGCTTCAATTCGCCGACGACCCGCTGCTTGTCCTCCGGCATCATCTCGCCGCGCCAGTCGATGCCGAGCATGCCGGCAACGGCTGCTGCCGTCCGCTTGTTGTCGCCCGTCAACATCATCGCCTTGACGCCTGCTGATTTGAGGGCAGCGAGCCCGGCCTCGGCATCCTCGCGCGGCTCGTCGCGCATCGCGATCAAGCCGGCCGCAACACCATTGACGAGCAGCACCGACACGCTCTTGCCCTCGTCGTTCAGCGCCGTGATGCGTGCATCCTGTTCCGCGCCGAGCGTCCCGCGCTCGCGGGCGGCAGGCGGCGACAGCAGATCCAGCGCCTCACCGCCGACTTTGCCGCTGACACCCTTGCCCGGCAGCGCTTCCAGCTCGAAGGCCGGCGGCACGGGAACACCATCGGCCTTGGCACGGTTGAGGATCGCCAGCGCCAGCGGGTGGCTGGAGCCTTGCTCCAGCACCGCCGCGCGCGACAGCACCTGCGCCTCGCTCAGCCCGAACGAAATGATGTCAGTCACCTGAGGCTTGCCTTCCGTCAGCGTGCCCGTCTTGTCGAAGGCGACCATCGTCACCTTGCCGAGCGTTTCCAGCACCGCGCCGCCCTTCATCAGCAGCCCGCGCCGCGCACCGGATGAAAGCGAGGCGGCGATTGCCGCCGGCGTCGAGATGACGAGCGCGCAGGGGCAGCCGATCAAAAGGATCGCAAGGCCCTTATAGACCCATTCGCCCCACGGCCCAGCAAACAGCAGCGGCGGAACGACCGCGACCAGCGCTGCGACCACGACCACGCCGGGCGTGTAATAGCGCGAGAACCGATCGATGAAGCGCTCCGTCGGCGCCTTCGATTCCTGCGCCTCCTCCACCAGCTTGACGACGCGGGCGATGGTATTGTCGGCGGCAGCCGCCGTGACGCGAACCCTGAGCACCGCGTCGCCATTGACCGTGCCGGCAAAAACGACGGCATCGACGCCCTTGTGTACCGGCGTGCTCTCACCCGTCACCGGCGCCTCGTCGATCGCGCTCTCGCCTGAGAGGATGATGCCGTCCGCCGAGATCCGGTCGCCAGGACGAACCATGATGATGGCGCCGACCGAAAGGCTTTCCGCCGGCACCTCCCGCGTCTGCCCATTGTCTTCGAGCAGCGCGGTCTTCGGCACCAGCGCCGTCAGCGACTGGATGCTTTCGCGCGCCTTGCCCGCTGCCACCCCTTCCAGCAGCTCGCCAACGAGGAACAGGAACACAACGGTTGCCGCCTCTTCGCCGGCATTGATGATGACGGCGCCGACAGCGGCGATCGTCATCAGCATCTCGATCGAAAACGGTGTGCCTGAGAAAGCGGCCATGATAGCGCGCCGGGCGATCGGCACCAGCCCGATCAGCATGGCGACGATGAAGGCATAGGAGGCAACCGCCGGCACGAGATGGCCGACGGCATAGGCTGCAACGAGTGCCGCACCCGAAAGGATGGTCAGCCGGCCCTTCCTACTCTGCCACCAGGGACCGGCCATCGGCGCATGGTCGTGCCCGTGCAGCCCCTCAATTTCCTTCTCGCCATGATCGTCGTGGTCATGACTTGCATGGTCATCGCTATGGCCTGCATGGTCGTGCCCTTCATGATCATGCACCGCATGATCGTGGTCATGGCCCGCGTGATCACGGTGCTCATGTCCATGGTCGTGACGATGCTGCAAGCTGTGGGCATGCGCAGGCGCAGCGTTTCCGGCAAGCGGCGCGACGGAATAGCCGAGCCCGGTCACCTTCTTCTCGATCGCCTTGAGATCACTGCTGCCATCGTGCCGGACGGTCATCGTGCCTGCCATCACCGAAACGGAAACATCGGCGACACCCGCCAAGCGTCTGACCGCCGTATCGATCTTGGTTGCGCAGGAGGCGCAATCCATGCCGCCAACCCGGTATCGTGTCTCGCTCTCAGCCAT
Protein sequences of DBSCAN-SWA_7 >NZ_CP054031|3064958:3075007|3066021_3066750_+|WP_138334978.1|DBSCAN-SWA MSVGTIRISEIFGPTIQGEGALIGLPTVFVRTGGCDYRCSWCDSLHAVDSAFRDQWIPMSTEAVWHKVTELSGGKPLTVSLSGGNPAIQPLRPLIELGHSQGYRFALETQGSVAQAWFRDLDILVISPKPPSSGMLTDWDHVDHCLQLAAGGPEVALKIVVFDDTDYEFAQCAGQRYPQIPLFLQPGNHTPPPPDDHDARIDIDGVMDQMHWLVAKVTADQWFTARVLPQLHVLLWGNKRGV >NZ_CP054031|3064958:3075007|3071403_3071808_-|WP_138334974.1|DBSCAN-SWA MGLIVYYSSRSENTHRFVVKLGLRAARIPPGGADAFHIREPFVLIVPTYSGDGGKGAVPKQVIRFLNDAENRGHIRGVIAAGNSNFGETYGLAGNVVSQKCQVPYLYRFELMGTEDDVAKVKHGMERFWTREQT >NZ_CP054031|3064958:3075007|3064958_3065669_+|WP_138334980.1|DBSCAN-SWA MKTIVVCSGGLDSVTLAHKVAAEQQLIGLVSFDYGQRHRKELDFAARCASRLSVPHHIIDIASIGGHLSGSALTDNVEVPDGHYAEETMKATVVPNRNAIMLAIAFGLAAAQKADAVAVAVHGGDHFIYPDCRPGFIDAFQRMQNEALEGYASVKLLAPYVDVSKAAIVVDGEKHGTPFSETWSCYKGGERHCGRCGTCVERREAFHLAGVPDPTEYEDQDFWKAAVSRYSATEVR >NZ_CP054031|3064958:3075007|3065668_3066025_+|WP_138334979.1|DBSCAN-SWA MYRITKEFHLSASHQLDHLPTDHQCARLHGHNYVVIVELAAESLNDDGFVRDYHDLSPLKRYIDETFDHRHLNDVFGHSKVTSEFLARHFYDWCKQRFPETSSVRVSETPKTWAEYRP >NZ_CP054031|3064958:3075007|3072712_3075007_-|WP_138334973.1|DBSCAN-SWA MAESETRYRVGGMDCASCATKIDTAVRRLAGVADVSVSVMAGTMTVRHDGSSDLKAIEKKVTGLGYSVAPLAGNAAPAHAHSLQHRHDHGHEHRDHAGHDHDHAVHDHEGHDHAGHSDDHASHDHDDHGEKEIEGLHGHDHAPMAGPWWQSRKGRLTILSGAALVAAYAVGHLVPAVASYAFIVAMLIGLVPIARRAIMAAFSGTPFSIEMLMTIAAVGAVIINAGEEAATVVFLFLVGELLEGVAAGKARESIQSLTALVPKTALLEDNGQTREVPAESLSVGAIIMVRPGDRISADGIILSGESAIDEAPVTGESTPVHKGVDAVVFAGTVNGDAVLRVRVTAAAADNTIARVVKLVEEAQESKAPTERFIDRFSRYYTPGVVVVAALVAVVPPLLFAGPWGEWVYKGLAILLIGCPCALVISTPAAIAASLSSGARRGLLMKGGAVLETLGKVTMVAFDKTGTLTEGKPQVTDIISFGLSEAQVLSRAAVLEQGSSHPLALAILNRAKADGVPVPPAFELEALPGKGVSGKVGGEALDLLSPPAARERGTLGAEQDARITALNDEGKSVSVLLVNGVAAGLIAMRDEPREDAEAGLAALKSAGVKAMMLTGDNKRTAAAVAGMLGIDWRGEMMPEDKQRVVGELKRQGFIVAKVGDGINDAPALAAADIGIAMGGGTDVALETADAAVLHGRVGDVARMIELSKRTMRNILQNITIALGLKAVFLVTTIAGITGLWPAILADTGATVLVTINALRLLRIKI >NZ_CP054031|3064958:3075007|3071822_3072044_-|WP_018243700.1|DBSCAN-SWA MTITVYSKPACVQCTATYRALDRLGVDYDIVDISQDAEALDRVRSLGYMQAPVVIAGEQHWAGFRPDMISALS >NZ_CP054031|3064958:3075007|3068082_3069057_-|WP_138334976.1|DBSCAN-SWA MNIQIKPASRVRAINWNRIEDDKDLEVWNRLTGNFWLPEKVPLSNDIPSWATLTPVEQQLTIRVFTGLTLLDTIQNGVGSIQLMEDAETQHEEAVLSNISFMEAVHARSYSSIFSTLCSTPDVDDAYRWSEENEFLQRKSALIMEQYRSGDPLKKKVASVFLESFLFYSGFYLPMYWSSRAKLTNTADMIRLIIRDEAVHGYYIGYKFQRGLERLGEERRQEIKDFAFELLLELYDSEAKYTEALYDGVGLTEDVKKFLHYNANKALMNLGYEALFPGEACKVNPAILSALSPNADENHDFFSGSGSSYVIGKAVATEDEDWDF >NZ_CP054031|3064958:3075007|3066754_3067978_-|WP_138334977.1|DBSCAN-SWA MSSFLNKAGTAVEADEGVWPIRRRRILDWLVNDTRGERFIDNILVEMCEKLLAAGVPVARATLHFRTNHPQWIGARILWKEGLAEAKINTFAYGVENTPEFLKSPVNAIHQGSEEVRRHLEGTADGDEDSFYQELRDDGLTEYIAWPLEHTFGKRHVATFSTSRPGGFTSEHVDFLRDLLPALTLVSEIRLKNIMARTLLQTYVGPHASEQILAGVTTRGSGATVGAAIMICDLRDFTAISDLWPRDDVIHLLNDYFDAMSDPIERHGGEILKFMGDGLLAIFPLSKETACHDLLQAIREGQGLMAQLNQQYAGNGREPLRYGVGVHVGDVMYGNIGSRRRLDFTVIGPAVNVASRLETLTKEVKRPVLLSRAFVEMAGCAKDMDSLGTFPLRGLGEPVDVFAFPER >NZ_CP054031|3064958:3075007|3069220_3071425_-|WP_138334975.1|DBSCAN-SWA MDTGTNLTRDAGAKPHHTIAQPAGERPSKAAETLDYHALNAMLNLYDDEGRIQLDKDRMAAKQYFLQHVNQNTVFFHNLREKLDYLVTEGYYEQEVLDQYSFNFVRDLFDQAYAKKFRFPTFLGAFKYYTSYTLKTFDGKRYLERYEDRICMVALALARGDEALARDMVDEIISGRFQPATPTFLNAGKKQRGELVSCFLLRVEDNMESIGRSINSALQLSKRGGGVALSLTNIREAGAPIKHIENQSSGIIPVMKLLEDSFSYANQLGARQGAGAVYLNAHHPDIMRFLDTKRENADEKIRIKTLSLGVVVPDITFELAKNNEDMYLFSPYDVERVYGVPFTEISVTEKYREMADDARISKKKIKAREFFQVLAEIQFESGYPYIMFEDTVNRANPIAGRISMSNLCSEILQVSEASEYNDDLSYKHLGKDISCNLGSLNIAAAMDSADFGKTIETSIRALTAVSDMSHISSVPSIEKGNDESHAIGLGQMNLHGYLARERIFYGSEEGVDFTNIYFYTVTYHAIGASNLLAVERGTSFKGFENSKYASGEYFDKYTDRPWEPATEKVKALFETAGIHIPTQEDWAALKKAVMASGLYNQNLQAVPPTGSISYINHSTSSIHPIVSKIEIRKEGKIGRVYYPAPFMTNDNLDYYQDAYEIGPEKIIDTYAAATQHVDQGLSLTLFFRDTATTRDINKAQIHAWKKGIKTIYYIRLRQMALSGTEVQGCVSCTL |
9 | Rhodococcus_phage(12.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
0 : 5681
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP054034|0:5681|DBSCAN-SWA
Protein sequences of DBSCAN-SWA_1 >NZ_CP054034|0:5681|2186_2546_+|WP_138334287.1|DBSCAN-SWA MKALALVTTFILCQCVGAVAQQPQQPGSMYQGQMDRLDANTSKTVDRSEYETFMGTAFGSLDKNKDGSLQSDETVQILTVEQFTLTDANHNGRISRSEFMQRVMADFATADHDRDGNLQ >NZ_CP054034|0:5681|245_737_-|WP_138334285.1|DBSCAN-SWA MPKPDHDMMNIRMAMPGEDEILVGHYLKIWDSYGTPPEHYRPDARARILSFIRSGREERRLSSFLAIIDGDIAGSASCQLHQSPFPEVIRPEQRLHGYICSVYVADAFRRRGIALALTNKAVDYLKSIGCTTAVIHASDVGEPVYRAAGFELAKEMRLKFSAE >NZ_CP054034|0:5681|4058_5681_-|WP_138334290.1|DBSCAN-SWA MIRIENISKQLSHRILFIEASAALNRGEKIGLVGPNGAGKTTIFRMINGEEQPDEGQVSCEKGVTIGYFNQDVGEMAGHSAVAEVMNGAGPVSIVAGELRDLEAAMADPEQANNMEEIIERYGEVQARYEELDGYALEGRAREVLAGLSFSQEMMDGDVGALSGGWKMRVALARILLMRPDVMLLDEPSNHLDLESLIWLEEFLKGYEGALLMTSHDREFMNRIVTKIIEIDGGALTTYSGDYEFYEQQRAQNEKQQQAQFERQQAMLAKEIKFIERFKARASHASQVQSRVKKLEKIDRVEPPKRRQSVAFDFQPAPRSGEDVVSLKGVHKKYGSRSIYEGLDFMVRRRERWCIMGINGAGKSTLLKLVTGTAEPDEGSVALGASVKMGYFAQHAMDILDGEHTVFQSLEHRFPQAGQGPLRALAGCFGFSGDDIEKRCRVLSGGEKARLVMAMMLFDPPNLLVLDEPTNHLDLDTKEMLIKALSQYEGTMLFVSHDRHFLAALSNRVLELTPEGIHQYGGGYTEYVARTGHEAPGLRS >NZ_CP054034|0:5681|2628_2793_-|WP_003549551.1|DBSCAN-SWA MLYYALVFLVVALIAGVLGFGGIAGASASIAQVLFFIFLVLFVVSLVMRLMRKV >NZ_CP054034|0:5681|2892_3483_-|WP_138334288.1|DBSCAN-SWA MMTKFRGVRRPLHAIGDGRIGMDWINRYIDQPLLHGAAGLLRSWHLRTRQPPELLEPTWNLVVLSFLLITSGQVMAGQTFTLSVAALVMLALPSARKLLLAVKAGQGGYGAREYKSLRARAIAKREAEWSVRIIVLFASVCLPFIARIDDPVGACFMLGASIWFVLTGPLKAYLDAAEPPEPNEGDRMYSGVFHVG >NZ_CP054034|0:5681|3564_3957_-|WP_138334289.1|DBSCAN-SWA MPMLLDRIEVVTLFVDDIDAANAFYQKVFAPEVVYQDAVSAVLKFSGTMINLLDAAQAPQLVQPVAVSATGSGARVLLTIKVDDVDAVCAELRKLGVMLLNGPIDRPWGRRTAAFADPSGHVWEVAQELR >NZ_CP054034|0:5681|810_1332_-|WP_138334286.1|DBSCAN-SWA MNKILATAFAAVSLTFVGAGAVSAADLGTRTYEEPDLRNGVKIGYLTCDIGGGTGYVLGSSKEADCIFQSTVGNELSDRYTGEMRKLGIDLGFTTRSRLIWAVFAPTAGYHRGSLAGLYVGATAEATLGAGVGANLLVGGTSGSIHLQTVSLTGQLGLNVAAGSASMTLTPAN >NZ_CP054034|0:5681|1917_2166_+|WP_003552191.1|DBSCAN-SWA MRIRRTNIILAGCLAVSACQSGPTSQAVSDRNSEFGCIAGTVGGAIVGGLIGSTIGAGTGQVLAVGAGIGGGGYVGNRLACR |
8 | Tupanvirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
15318 : 23104
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP054034|15318:23104|DBSCAN-SWA GCTACAGCGTGCTTTGAATTGTCTGAGGCAGCCGTATTGGCTCGGCGCCGGTGAAAGCCAGATCCGAGACCTCACCCGTTGACGAATAGCCGGAAACAAGCTCCGCCAATGTCGCACGTGCAGCGATCATATCGTTCCGGTCCAACACCGCGTTGAGCGCGTTGAGCCTTCTCGACAACTCGGGCCAGGACAGGAAATCCTCGCGCGCCTTCATAATCCGAGGATGTTCGGTTGTTTCTGGATTATCCCCGATCAACAGTTCTTCAAAGAGCTTCTCGCCGGGCCGAAGGCCGGTAACGGACAGCTCGATATCGCCTTCGGGATTGTCTTCGTCACGGACGGCCAGCCCGGAAAGCTCGACCATCTTGCGGGCGAGATCTGCGATGCGAACCGGCTCTCCCATGTCGAGCAGGAAGACATCTCCGCCCTCGGCCATCGCGCCGGCCTGTATGACGAGTTGAGAAGCCTCCGAAATGGTCATGAAATAACGAGTTATATCAGGATGTGTCAGCGTCACCGGGCCGCCGTCCTTGATCTGCTGCCGGAAAAGCGGCACGACGGATCCGGAGGAGCCGAGGACGTTTCCGAAACGGACCATGGAGAAATTCGTTCGCATTCTGTCGGTCGCCGATTCCGCCGCAAGCGCCTGCAGAACCATCTCCGCCAGCCTCTTGCTGGCGCCCATCACATTTGTCGGACGCACGGCTTTGTCTGTGCTGATCAGCACGAAATTCGAGACGCCGCATTTATTCGCCGCGCGTGCCGCGACCAGCGTACCCATCACATTGTTCTTGATTCCTTCCACGGCATTGTGTTCGACAAGGGGAACATGCTTGTAAGCGGCTGCATGATAGAGCGTCTGAGGTCGCCAGCTCTGCATGACATGTTCCATGCGATCCTGATCGCGGACGGAACACAGAATCGGAACGATCTGCATATTTTCATGCTTGTATAGTTCGGCCAGCTTCTGCAATTCGGCATGGATATTATAAAGCGCAAACTCGTTCTGATCGATGAGGATGAGGCTGGAAGGCCCGGTGCGCAAAATCTGACGACATAGCTCGCCGCCGATCGAGCCGCCAGCACCGGTGACCATCACCACCTTGTTGCGCATCGACTTGTCGAGTAGCTCCTGGCGCGGCGCGACCGCTTCTCTTCCCAGCAGATCTTCGATCTCCAGCTCTCGGATGTCGGAGACGGCGATGCGTCCCTGAGCCAGGGCAGTGAGATCCGGCAACGTGCGAACATTGACCCTGGCTTTACGGATGTGCTCCAGGATTTCGTTGCGACGCTGCCGCGATGCGGAGGGAAGAGCAAGAAGCACGTTGTGCACGCCGAGAGATTCGGCAAGCACCGGAAGATCCGAGGGGTCGTAGATCGGTAAACCACCCATGACGCCGCCCTTGAGACGCGGATCATCATCCAGATAGCCGACGACATTGAGTTCGGCACTGTTGATCAAGGCACCGGCCAGTTGCCGCCCGGCTGTCCCTGCCCCATAGATCAGCACCTTGGCGAGCATATTCTTGTGAAGGATGCGCTGGTAGGCATCCCCGAGCCAGTAGCGGATACTCAACCTCGACAGCCCGATCGCAATCAACAGAAGGAAGGGCTGGAGGATGCCGACGGTTCTCGGAACACCGGGGACGCTGAGTGCTGTAAATATCGTCATGAAGGCGACGCCGTAGATCGCAATCGCCTTCAGAACAGTAATGAAAGCAGCCATATTGGCATAACGGAAGATCGCCCGATACATGCCCATGACGATGAAGATGGGAAGGGCCATACACAGTGAAACGACGACCGGCAACCATTGCACGCCGGTCAGCACCGTCCATTCGTTCAGGCGGAAACAATAGGCCAGCCAGATCGTCAGAACACAAAAGCTGGAATCCAGCAGCAATGCCAGAGCGCGTTTGGCAACCCGCGGCATCGCCAGCAGAGGGGCGACGAGCGCTTGCATCGGCATTAAGAACCATCCTGAGCGCGGCGTCTCAGTGGGGGTATTTTCGGGCATTGCTTACCAGCGTCTCGCGGATCAATCAGACAACCACCTTCTTGTCATTTTCTGCAACGAGCGCAACTGAAATAATCGGAAAGTAAGGTCCCCCGCCCCGCTTATCTATCAGAAGTAGGTAGCTTAATGCCTTATTCCCTTGCGGCGGATGACCTTCTCGGCCGTCATAAACAGGATGCTTATGTCGAAGCCGAACGAGCGGCGTTCGAGATATTCGACATCGAATTTTACCTTCTCCGGAATCGGCAATTCGTCCCGGCCATTGATCTGCGCCCATCCCGTCAGTCCGGGCAGGAGCTTGTCGACACCATGGGCCGTCCGCAACTCTATCAGATCATATTGATTGTAGAGTGCCGGCCGCGGGCCGACGAAACTCATCTTCCCCTCGAGAATGCACCAGAGCTGCGGCAGTTCGTCGAGGCTGGATTTGCGCAGAAACGTACCGATCGGTGTCAGAAAGCGATCCGGATCTTCGAGCAGATGCGTGGCAACCGTCGGTGTGTCGACGCGCATGCTGCGAAATTTCGGCATCAGGAAGATCTGATTGAAACGGCCGACACGTTTTGACCAATAGAGGATCGGTCCCGGCGAGGTAAGGCGAACACAAAGAGCGACGACAAGGATTGGAACGAGCAGGATCGCCGCCGCAATCAAAGCCAAGAAAAAGTCCACCGCCCGTTTCAAACCCATGCTCCGCAGTTCTTCCGCTCCACCGGCAATCATCCGCCATTTGTGCTATCGCCTTGCCGTATCGTCAAAAAACGTTTCGCGTCAACTGTCAGGACGTGGTGTTTTCTTCATCAACCGACGTGGTCGCCCACCCACTTGCGAGTGCCCCTTCGTCTCGCAGGTTTTCGTCAAACTCCTAACCCTCTGATCTGACAGTGAGAAGCTTCCAATCAGAAGTTATCTTGGCCTTCGTTTCCGTCACGCTTCGTTAACCATAACAATTTGCGCCAGGAAGACCACGAAGCGAATTTCATCCTGTCTTCCGACTTATCGGCGAATTGGTGCGGGCAAGGGCCATCAGCATCGGTCCGATGGCGAATTCCCCTCTCTCGGCGCGCCGTGTCAGATCGCGCAGGTAACCGCCGGCGGAGGTGATGTGCCCTGCCCTTTCGAGGACGCAGGCCATGACGGTGGCCGCATTTTCAGGTCCCATGATGTCGCAGGCCTGCTCATAGGCCGATGGGCTGACGCCCAGCATCGAACGAACGACCACAGCCGCGGCCATCAGCTCGCGCCACGTGCCGACCGAGCCCTGCGGTCCATAGGCTGCGATTTCCGGGCAGGCCTGCAGCACCAGCTCCAGTGGGAAGGATTTGAGCGCGGCGTTGGCAGCGGGGGCAGAACTTGTGGCCGGCACACGCCCGTTCTCCTCTTCATCTTTCTGTATTCGGACTGTCGGTTCGGCCCGACCCGCCGATCGATCGTCCGCCGTTGCGCCCTGCTTCATTTCGGAAGCTGGTTCAAATTCAGTAATGGATTGTGGATTTGAATTCTGTATGTGCCGCTCATAATGTTGGGCATTGCCGCTATGATTTTGAGTTTTAACCTGCATTTCCAACTGATTGGTGATTTCTTCACGCAGCAGCTCCAATTCGTCGAGTGCGGCGGCGACCTGTTCGGCGGAAGGCGACCGTGGCAGCCGCTCGACGACATCGCGAAAATGCAGATGAATCCGGCTCCAATCGCCCGCGGCACCCTCCTCCAATGCCATCTCGATGAGCTTGGCGACATCGCGCCGGCAAAGCGTCAGGCGTTCTCTGAGCCGCTGCAGGTGCAACCGTTCCGCAACCACCTCCGCCGCCAACCGCTCGATCTCCTCAGCGCGCACCAGTAGCGGCGCCAGCGAAAAACCGAATGCCTCGCGGATCTCCCCACCCTGCTCCTTGCGGGCGTAGCGCTTGCCGTTCGGGCTGTCCTTGCGGATCAGCAGCCCGGCCTCGATCAGGCTCGCCAGGTGGCGACGGATCGTCTGCTCCGCCATCCCATGTGCGCGCAGCGAGAGCTGTGCATTGGAAGGGAACACCACGAGCCCGTTTTCCTCAGAGAGCTCCCCCTTGGGATAAAAGCTCAGCAAGGCGTTCAGAATAGCGAGCGCCCGGTCGGTGATGCCGAGCAGCGGCTTGGCCTCGCACAGGGCACGATAGAGTTTCCACTTGTCGATAGATTGGCCGGGCTCGATTTCTCGAGCAGCCGCCTGGCTTGCTAACATGCCAAGCGTCATCGGCCGCCGCCCGAAGGGCGTCGTCACATTTCCGCTTTCCATTTTCTTCGCCTTCTTCAAGGCAAAAGATCGCAGCTCACCAAATTCGGTGCCAAAGACTCTTGACTATGATTCGCGGAAATGTGATTCTCTGGGTGTCTAGATCAGAGAGGGCTTCCGCGACGGCAACGTTCGGGGGCCTTTTCTTTTGCTTATTGATCTCCGTTCTTGGTCAAATCCTGTGTCTTCTCGAACGCCTCGTAGAGGCGATCCAGATTGCTGGCGATGTAGGCTCCAAAGGCAGACGCCTTCTCCGCCTTCTCCGCCTTCAACGCGATGGTGAACTGTTTCCCGTCATCCTTGATCTTCGCCTTGACCGCGCCGTCCTTGCGCTGCCAGGCGTGAGCGGTGGGCTGAAGCGCCGCTATGGATGAGGCCTGTTGTTGCCTCCTGCTGATGAAGGCAACCAGCATATCGAATCGGGCATCCGGCTCCACCGCCTCGAATTCCGCCGATCTGGTGAACTCTATCGCTCGTGCGGCGATGTTTTCGGCCTCGAATTTCACGGACAGATCGTGCCACCGATCACGGCCGGTGGTCTTGGCCGCACCGATCGCGGTCAGGATTTCGGCTGGAATCCGCTTGGTAACGGACAGCATCTTGGAAACCGTCGTCTTGTCGGCGCTGAGCGCGGACATGATCGTTTCCCTGTCGAAGCCCAACGTGTCGAGTTTGTCGGCAAACATCGTCCTCTCGATGAAGGAGAGATCGGCGCGCGCCGAATTCTCCTGGCCCTGCGCAATGACATGGTCGCGATCGTCGAGTTTCTTGACCACGGCGCGGACAGGTCGGCCGAGTTCCTTGGCGGCCCGGGCGCGACGGTGGCCGAAGGCGATCTGATATCGGCCGTCCTTCTCAGGATGCGGCCGGACGAGGATCGGCGAATCCTGACCGCGCAACCGGATCGCCTCGACCAATTCGCGAAACTGTTCGTCGGTATGCGCCAGCCGGTCGGTCACGAAAGAATCTTCGATCAGGGCAGGGTCGAGATCAATCACCGTTTCACCGGTCGCCAGCTTCTCCTCGATCGCTTTGGCCGCATCGGCCTTGGCAGCCAGCGCATCGATACTGCGTGTCACGGCACCGAGAGCGCCGATCCCCTTATAGGTGAGCTGCTGCTTCTCGTCCCTGTGGAGTGGTTCGTCCTCATTGTTTACCGCAGTAAACTTCTTGGAATCGTCCATCAGGCCGGAGAGGAGATTCTTACGCGCCATCAGCCAGCTCCTCGCCCCAGTTTACCGCAGTAAACTTCTCGGTCATTTCCGCCCCCAAGCCTTGTGGATGAGATCGGCGATCTCGGTGTTGACGGAATCCATCGCCTCCATCGCGCGGTCATAGGTCGCGCGGGTAAACTGGTTCCGTTCGACCTCGTAAAGCGTCTGATTGGTCATCGCCGCATCGGAGATCGCGACGCTGCGCAACATCGGATTGGTGAGCACATGGTTCTTGAACATCGAACGCATGAAGGCGACCATTTGCGTCTGCGGCCCGTCCTGCGGATCGTAGCGGGTGACGAGATAACGCAACCAGTCGAGGTTCATGTTGCCGCCGGCACCTTTCAGCGTGTTCAGCACCTCGCCGAGCATCTGCAGGAATTGGCACATCGACATGACGTCGAGCATCTGAGGATGTACGGTGATCAGAACTGCCGTTGCCCCGCAAATGGCGCTCATGGTCAAAAAGCCGAGCTGGGGAGGGCAGTCGATGATGACGACGTCGTAATCGTCGGCGACGGAGGACAGCGCCTCGTCCAGCCGCGCGAAGAAGACGCGTCCATAGTCGCCCGCCTTGCCCTGAGCAAGGACGCGCGGCGTATCGTGCTCGAATTCCATCAGTTCGAGATTGCCCGGCACCAGATGAAGGTTCGGGAAATTCGTCGGACGGATGATCTCTCTCAGCGGACGCCGCTGATCGTCATAACGAATGGCGGCGTAGAGCGTCTCATTTTCGTCGACGTCGAACTCCGGCTGAAAACCGTGGATGGCGGAGAGAGAGGCCTGCGGATCGAGGTCGACGGCAAGGACACGGTGACCCGTCAAGGCAAGATGCTGGGCAAGGTGTGCTGCACTCGTCGTCTTGCCGCTGCCACCTTTGAAGTTGACGACGGCAACGACCTGCAGATGTTCGTTGCCGCGGCGATGACGCATATAATGCGTTCCGGCGCGGGCATTGTGCTCCAAGAAGCTGCGCATCTCTTCCATTTGCTCCGCCGTATAGGATCGACGGCCCGACGGCGTGACCTGGGGAAGGGCGCCTTTGCCTTCCAGCGACAGGTTGCGAAGATAGCCGCTGGTGACACCGAGGAAACGCGCCGCCTCGGCAAGCTGAAATTCCCTCAAACCCTTCAATGCACGCGGCGGAAACATCTCGAGCCGATGCTGCTTCAGCTTATCGGACAGTTCCTGCGCCTGGCCAATAATCAGCTGGTCGACATCAGAAATTGCTTTTTCTATCTGCGGTGCCATATTCATGGAAATCTCGAAAATGCGATTTAAAAGCCCTGGGCGCTGTGAAACCGCATTTAGCGCCGATTCTGGAGGGGTGGCAAGGCAAATAGGGTTAACAAGACCCTACCGCGAGGTTGTCCAATCTTTTTCAAAGGCTTGAACAGGCCGCCCGGTGCCGCTGTCCAGCGTGGCAATCGCGCATTCACTGACCGAATCGCAAGAGCAAGGCTGCACAACCGGCATCATAGCCGATCGGATACTTCGGAAGCGAGATATCGAGCGTGCACGGCTCTTTGAAGTACGTCCGCAATGGGTCCGCAGTGGGCAAGCACCTGGACAGTCAGTAGCTGTTCAGTGCCGCCGTGGCAGAGAACCGTCCTGATATCTGCCTGGGGGAGGGGACACGGCGCTTCTGTCGCGGTCCTGCAGCAGAGCGCCCAGCGCAAGATCCAGGCGATAGGCGGAGAATTCAGCGCCCAGCTCGGCGCCGTCCCTCAGATAGCGATCCGGCAAATGCCAAATCGCAAGCAGACCCTTTCGGGAAATGCCATTGATTGGCCGGTCGGCTCCCCAGGCGCAACGCTGCGATACCCGCCTATTCTCCGAGCGGAAGGGTGATATCCGCTACGGAGGCTCCCAATGTCAGACCTCGACTTTCCGCCTGCGCCATCGCCTTCACCAGCGCGATGACCGTTTCGCGGATATTGCTGTCGGCGATCTTTAGAAAGGCGCGATTGAGCACCAATCCCTCTTTCGTTCGCAGGAATTCAGCGACCGGGTCCATATTCGCGGATAGATCCAGCCCTTGCAGCGTCAACGGCTCGGAATTCTCCTGTTCGAAGAAGAAGCTCGGCGACGTGTGTAGCACCTCGGCGATCCGCTGCAGCCGGCTCGCGCCGATGCGGTTGATGCCCTTCTCGTATTTCTGAACCTGCTGAAAGGTCACGCCGATCTGCTCGGCAAGCCGTTCCTGGCTCATCCCGAGCAACTGCCGGCGCAATCTAATGCGCGCTCCGACATAGCTGTCTATCGCATTCGCTGTCTTGACATTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP054034|15318:23104|22645_23104_-|WP_062941828.1|DBSCAN-SWA MNVKTANAIDSYVGARIRLRRQLLGMSQERLAEQIGVTFQQVQKYEKGINRIGASRLQRIAEVLHTSPSFFFEQENSEPLTLQGLDLSANMDPVAEFLRTKEGLVLNRAFLKIADSNIRETVIALVKAMAQAESRGLTLGASVADITLPLGE >NZ_CP054034|15318:23104|20859_22074_-|WP_062941826.1|DBSCAN-SWA MNMAPQIEKAISDVDQLIIGQAQELSDKLKQHRLEMFPPRALKGLREFQLAEAARFLGVTSGYLRNLSLEGKGALPQVTPSGRRSYTAEQMEEMRSFLEHNARAGTHYMRHRRGNEHLQVVAVVNFKGGSGKTTSAAHLAQHLALTGHRVLAVDLDPQASLSAIHGFQPEFDVDENETLYAAIRYDDQRRPLREIIRPTNFPNLHLVPGNLELMEFEHDTPRVLAQGKAGDYGRVFFARLDEALSSVADDYDVVIIDCPPQLGFLTMSAICGATAVLITVHPQMLDVMSMCQFLQMLGEVLNTLKGAGGNMNLDWLRYLVTRYDPQDGPQTQMVAFMRSMFKNHVLTNPMLRSVAISDAAMTNQTLYEVERNQFTRATYDRAMEAMDSVNTEIADLIHKAWGRK >NZ_CP054034|15318:23104|18313_19606_-|WP_138333939.1|DBSCAN-SWA MESGNVTTPFGRRPMTLGMLASQAAAREIEPGQSIDKWKLYRALCEAKPLLGITDRALAILNALLSFYPKGELSEENGLVVFPSNAQLSLRAHGMAEQTIRRHLASLIEAGLLIRKDSPNGKRYARKEQGGEIREAFGFSLAPLLVRAEEIERLAAEVVAERLHLQRLRERLTLCRRDVAKLIEMALEEGAAGDWSRIHLHFRDVVERLPRSPSAEQVAAALDELELLREEITNQLEMQVKTQNHSGNAQHYERHIQNSNPQSITEFEPASEMKQGATADDRSAGRAEPTVRIQKDEEENGRVPATSSAPAANAALKSFPLELVLQACPEIAAYGPQGSVGTWRELMAAAVVVRSMLGVSPSAYEQACDIMGPENAATVMACVLERAGHITSAGGYLRDLTRRAERGEFAIGPMLMALARTNSPISRKTG >NZ_CP054034|15318:23104|17457_18024_-|WP_138333937.1|DBSCAN-SWA MGLKRAVDFFLALIAAAILLVPILVVALCVRLTSPGPILYWSKRVGRFNQIFLMPKFRSMRVDTPTVATHLLEDPDRFLTPIGTFLRKSSLDELPQLWCILEGKMSFVGPRPALYNQYDLIELRTAHGVDKLLPGLTGWAQINGRDELPIPEKVKFDVEYLERRSFGFDISILFMTAEKVIRRKGIRH >NZ_CP054034|15318:23104|19755_20817_-|WP_138333941.1|DBSCAN-SWA MARKNLLSGLMDDSKKFTAVNNEDEPLHRDEKQQLTYKGIGALGAVTRSIDALAAKADAAKAIEEKLATGETVIDLDPALIEDSFVTDRLAHTDEQFRELVEAIRLRGQDSPILVRPHPEKDGRYQIAFGHRRARAAKELGRPVRAVVKKLDDRDHVIAQGQENSARADLSFIERTMFADKLDTLGFDRETIMSALSADKTTVSKMLSVTKRIPAEILTAIGAAKTTGRDRWHDLSVKFEAENIAARAIEFTRSAEFEAVEPDARFDMLVAFISRRQQQASSIAALQPTAHAWQRKDGAVKAKIKDDGKQFTIALKAEKAEKASAFGAYIASNLDRLYEAFEKTQDLTKNGDQ >NZ_CP054034|15318:23104|15318_17334_-|WP_173863103.1|DBSCAN-SWA MPENTPTETPRSGWFLMPMQALVAPLLAMPRVAKRALALLLDSSFCVLTIWLAYCFRLNEWTVLTGVQWLPVVVSLCMALPIFIVMGMYRAIFRYANMAAFITVLKAIAIYGVAFMTIFTALSVPGVPRTVGILQPFLLLIAIGLSRLSIRYWLGDAYQRILHKNMLAKVLIYGAGTAGRQLAGALINSAELNVVGYLDDDPRLKGGVMGGLPIYDPSDLPVLAESLGVHNVLLALPSASRQRRNEILEHIRKARVNVRTLPDLTALAQGRIAVSDIRELEIEDLLGREAVAPRQELLDKSMRNKVVMVTGAGGSIGGELCRQILRTGPSSLILIDQNEFALYNIHAELQKLAELYKHENMQIVPILCSVRDQDRMEHVMQSWRPQTLYHAAAYKHVPLVEHNAVEGIKNNVMGTLVAARAANKCGVSNFVLISTDKAVRPTNVMGASKRLAEMVLQALAAESATDRMRTNFSMVRFGNVLGSSGSVVPLFRQQIKDGGPVTLTHPDITRYFMTISEASQLVIQAGAMAEGGDVFLLDMGEPVRIADLARKMVELSGLAVRDEDNPEGDIELSVTGLRPGEKLFEELLIGDNPETTEHPRIMKAREDFLSWPELSRRLNALNAVLDRNDMIAARATLAELVSGYSSTGEVSDLAFTGAEPIRLPQTIQSTL |
6 | Ochrobactrum_phage(40.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
26619 : 27708
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP054034|26619:27708|DBSCAN-SWA CATGAATTCCCTTGTTTCCACCACGGCGATCGAGGCACAGTCTCAAGCGGCGGCTTCCGAAGAGGTGGTGAGACTGACCGACGTGAAGCGACGGTTCGGAACCACTGCAGCCCTCGACGGCATTTCGCTGACGGTGAAGAGAGGCGAGATCCTTGGCATCATCGGCCGCAGCGGCGCCGGCAAATCGACGCTGATCCGCTGCCTGAACGGCCTGGAGCGCGCCGATAGCGGCGAGATCCTCATCGAAGGCCGAGACATCACCGGACTTTCAGAACAGGACCTGCAGCCGCTGCGCCGCCGCATCGGCATGATCTTTCAGCATTTCAATCTGCTTTCTGCCAAAACCGTCGAGGAAAACGTCGCGCTGCCCCTGAAGATCGAGGGTCTTGCAAAGGGCGAGCGCCTGAAACGAGCGCATGAGCTGCTCGAACTCGTCGGCCTTGCAGACAAGGCGAAGGCTTATCCCGCATCGCTGTCCGGCGGCCAGAAACAACGCGTCGGCATCGCCCGGGCGCTGGCGGCACGCCCGGCACTGCTGTTGTCCGACGAGGCGACATCGGCGCTCGATCCGGAAACGACCCGGTCGATCCTGGCATTGCTGAAAGACATAAACCGTAAGCTCGGACTGACCATTCTGCTCATCACCCACGAGATGGAAGTGGTCCGCGGCATTGCCGATCGCGTCGCGGTCATCGATGCCGGGCGGATCGTCGAGGAAGGGCAGGTCTGGTCGGTCTTCGCCAATCCGCAGGCTGATATCACCGGGAGCCTGCTCGGCGGCATCCGCCCGCAGCTTCCTGAACATATCGCCGGTCGGCTGTCGGCGACGGCCGGCAGGGAGGTCATTCTCAGCGTCGATCTCGCCGGGCCGCAGGCGCAGGGCGCGCTGTTTGCCGAACTCTCGGCGGCATTGCCGCATTGCTTCCGCCTCGTCCATGGCGGCATCGACCATATCCAGAACCAGCCGGTGGCGCGGTTCTTCATCGCCGTTCCCGCACGCGACCCCGCGCTTGCCGGAAAGGTCGAAAAATTCCTGACGGCCCGGTCCGCCCGGGTGGAGGTGCTTGGTTATGACACCGATCATGCTTGA
Protein sequences of DBSCAN-SWA_3 >NZ_CP054034|26619:27708|26619_27708_+|WP_138333948.1|DBSCAN-SWA MNSLVSTTAIEAQSQAAASEEVVRLTDVKRRFGTTAALDGISLTVKRGEILGIIGRSGAGKSTLIRCLNGLERADSGEILIEGRDITGLSEQDLQPLRRRIGMIFQHFNLLSAKTVEENVALPLKIEGLAKGERLKRAHELLELVGLADKAKAYPASLSGGQKQRVGIARALAARPALLLSDEATSALDPETTRSILALLKDINRKLGLTILLITHEMEVVRGIADRVAVIDAGRIVEEGQVWSVFANPQADITGSLLGGIRPQLPEHIAGRLSATAGREVILSVDLAGPQAQGALFAELSAALPHCFRLVHGGIDHIQNQPVARFFIAVPARDPALAGKVEKFLTARSARVEVLGYDTDHA |
1 | Planktothrix_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
43081 : 43912
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP054034|43081:43912|DBSCAN-SWA ATCATGGTTTTTCTCCTGACGGACTGGAAGCCGGGTCGTCCCGGCGAAGGCCCAGAGAAATGACCGGGCCGTCCCGGTCGGGGCTGAAGATGGAGGGCGCGCCGGCAAGCCTGGGCGGCCAGATGGAAATGTCCTTGGTGATCGTCGCGCCGTAGCGCTCCTTTTCCTCCGGGCGGTCGCGCTTGCGCTCGAAGGCGACGACGCGCGTTGCCAGCGTGAAGGCCTCCCGCATGTCGTGCGTCACCATGACGACGGTCATCTGCGTTTCGTGCCAGAGCCGCTTCATCAATGTGTGGATCTCGGCACGAATGCCGGGATCGAGCGCGCCGAAAGGCTCGTCGAGCAGCAGCACCTTCGGCTTCATGATCAGGGCCTGCGCCAAAGCCAGGCGCTGCTGCATGCCGCCTGACAGTTGCGCCGGATATTTGCCCTCGGCACCGGATAGGCCGACCTCGCCGATCAGTCGCCGCGCTTCGTCGACGGCGTTGCGACGGGCAGCGCCGAAAAGCTTGGCTTTATAGCGAGACGCTGCGAGTTCCCGCCCAAGCAGCACATTGCCGAGCACTGTCAGATGCGGGAACACGGAGTAGCGCTGGAAGACGACACCACGATCCGGCCCTGGCTCCTGTGGCAGAGCTTCGCCGTCGAGCAGGATCGTCCCTCGCGTTGGCCGCTCCTGGCCGAGCAACATGCGCAGGAACGTCGACTTGCCGCAGCCGGACGGGCCGACAAGCGCGACGAAGGCGCGCGAAGCGACGGTCAGCGACACATCTTCGAGTACGATCTGGTCGCCATACTCTTTCCAGACCTTTTCGATCACCAACTCGCTCAT
Protein sequences of DBSCAN-SWA_4 >NZ_CP054034|43081:43912|43081_43912_-|WP_138333972.1|DBSCAN-SWA MSELVIEKVWKEYGDQIVLEDVSLTVASRAFVALVGPSGCGKSTFLRMLLGQERPTRGTILLDGEALPQEPGPDRGVVFQRYSVFPHLTVLGNVLLGRELAASRYKAKLFGAARRNAVDEARRLIGEVGLSGAEGKYPAQLSGGMQQRLALAQALIMKPKVLLLDEPFGALDPGIRAEIHTLMKRLWHETQMTVVMVTHDMREAFTLATRVVAFERKRDRPEEKERYGATITKDISIWPPRLAGAPSIFSPDRDGPVISLGLRRDDPASSPSGEKP |
1 | Bacillus_virus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
50347 : 51430
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP054034|50347:51430|DBSCAN-SWA CATGGCAGAGCTTTCACTCAGCAACATCGTCAAGCGCTTCGGCGGCTTTGAGATCATCCACGGCGCCAATCTGGAGGTGAAGGACGGCGAATTCGTCGTCTTCGTCGGCCCGTCCGGCTGCGGCAAGTCCACGCTGCTCAGGATGATCGCCGGCCTCGAGGACATTACGTCAGGCGAGCTTCAGATCGGCGGCAGGGTCGTCAACGACGTCGAGCCGGCCGATCGCGGCATCGCCATGGTCTTCCAGTCCTACGCGCTTTATCCGCACCTGACCGTCGAGGAAAATTTGAGCTTCGGCCTGCGCATGAACGGCAATCCGAAGGCCGACACCGAGCGGCGCGTGCGCCATGTCGCAGAGATCCTGCAGATCACCGAGCTGATGAAGCGCCGGCCGAAGCAGCTTTCCGGCGGTCAGCGCCAGCGCGTCGCGATCGGCCGCGCCATCGTCCGCGAACCACAGGTCTTCCTGTTCGACGAACCTTTGTCGAACCTCGACGCCGAACTGCGCGTGCAGATGCGCGTCGAAATCTCCAGGCTGCACAAGAAGCTCGGCACGACGATGATCTACGTCACCCACGACCAGACGGAAGCGATGACGCTCGCCGACAGGATCGTCGTGCTGCGCGCCGGCAATATCGAGCAGATCGGCGCGCCGCTCGACCTATACGATGATCCCGCCAATCAGTTCGTCGCCGGCTTCGTCGGCTCGCCGAAGATGAATTTCTTGAACGCCGTGGTGGTTGAAACGCAGCCGGGCAGGGCGGTGATCGCGCTGGAAAGTGACGCCAATACCCGCCTGACGCTGCCGGTCGCCGATCCCATCGAAGCCGGCGCAAAAGTGACGCTCGGCATCCGTCCCGAACATTTCGTCGATGCGGGCACGGGCGATGCCGACCTCACCGTCACCATCGACGTCGCCGAACACCTCGGCAACACCAGTTACATCTACGCCACCATTGGTCCCGAGCAACTGATCATCGAGCGGCCGGAATCGCGCGTCGCCGGCAATCGCGACACGCTGACAGTCGGCCTTCCCGCCAACCGCTCATTCCTTTTCGACGGCGCCGGCAAGCGGCTTCGCTGA
Protein sequences of DBSCAN-SWA_5 >NZ_CP054034|50347:51430|50347_51430_+|WP_138333986.1|DBSCAN-SWA MAELSLSNIVKRFGGFEIIHGANLEVKDGEFVVFVGPSGCGKSTLLRMIAGLEDITSGELQIGGRVVNDVEPADRGIAMVFQSYALYPHLTVEENLSFGLRMNGNPKADTERRVRHVAEILQITELMKRRPKQLSGGQRQRVAIGRAIVREPQVFLFDEPLSNLDAELRVQMRVEISRLHKKLGTTMIYVTHDQTEAMTLADRIVVLRAGNIEQIGAPLDLYDDPANQFVAGFVGSPKMNFLNAVVVETQPGRAVIALESDANTRLTLPVADPIEAGAKVTLGIRPEHFVDAGTGDADLTVTIDVAEHLGNTSYIYATIGPEQLIIERPESRVAGNRDTLTVGLPANRSFLFDGAGKRLR |
1 | Planktothrix_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
58244 : 59990
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP054034|58244:59990|DBSCAN-SWA TTCATTCGGCGGCCTCCCTCATAACGATGTCGGCATCGACGCGCATGGCGCGATGCATGCGGGCGGCAAGCAGCGGATCGTGGGTCGCAACAATAAGGGTGCGGCCCCTGGAGAGCGAAAGCAGGCTCTCGGTAACCTCGGCGGCGGTGGCGGCGTCGAGATGCGCGGTCGGCTCGTCGGCCAGGATGATCCGGAGATGCGGATTGCAGGCCGCCCGCGCGATCGCAAGCCGTAGCGCCTCGCCGCCGGAAAGTCCGATCCCGCCCTCGCCAAGCGGCCGCATGCCATAGGTGGCGGCGACTTTTCCGAGCCTGGCGGCGTCGAGCGCATCCGTCACATCGCCATGCAAGATACCGGGCCTGCCGAGCGCGATATTGCCGGCGATGGTACCCGCGAAAATATGCGGTCTCTGGCCGATCCATGCCATGCTGGCGCGCAAGGCAGCTGCGGTGTCATCCGCAAGTTCGATGCCGCCGATGATGATGCGGCCGCCGGTGCAGGGCGCCAGCCCTGATATGAGCGAAAGAAGCGTCGACTTGCCGGAACCGCTGGCGCCGAGAAACGCCAGATGTTCGCCGGCCGCGATATCGAGGTTGAGACCGTCAAGGATCAACGGCTCGGCGGCGTTGTAGCGGAAATCAACATTCTCAAGTCGGATGGCCGGCGCTTCGACGACGGCAGATACCGCTGGCGCAACGTCGGCTGCTCCTTGGATGGACAGCCCGCCGGCAGCGAGTGAGTCCAGGGCTTTCAGCGCCGCTTCGCCGGCCGCCCGGTCATGCCAGACGGCCGAGAGTTCGCGCAGCGGCTCGAAGAAGGCCGGCGCGAGCAGCAGGATGAACAGTCCCTCGGTCAGGTCGAGCCGTCCCACCCATGTGCCGAAGCGGATCTCGCCGAGCAGGCTGAAGCCGACATAGACAGCAATCATCGCCACGCCGAGTGCGGCGAAAAGCTCGAGCACCGCCGAGGACAGAAGGGCGATCTTCAGCACCGCCATGGTGCGCGCACGCAGCGATTCCGCCTCCGACCGCAGGCGCCGTGCCGTTGCATCCACCGCGTCGAGCGCACGGATGGTCGCCAGTCCGCGCAGCCGATCGAGCAGGAAGCCGTTTAGGCCGCCGGTCGCGACGAGTTGCCTTTCGCTGGCCGCCTGGGCACGCCACCCGATCAGCGCCATGAAGATCGGGATCAGCGGCGCGGCAAACAGCAAAACCAGAGCGGCGATCCAGGAGACCGGAAGGATAAAGGCAAGGATAATGAGCGGCACGAGGCTGGCCTTCATCCGTGCCGGCTGGAAACGGGCGAGATAAGGCACGATCAGCTCGGCCTGTTCGCCAATGACACTTGTCGCTTTGCCGGAGGCAGATCGGTCGCGATCGACCGGCGACGACACGGAAAGTGCCGCGGCGGCGATCTGCCGCCTGCGGCTGAGTTCAGCGCGGGCGGCTTGAAAAGCCAGGCGGCCGCCAGCCGCATCGAGACAGCTTCTTGCAAATCCGAGGGCGAGGATACCGAGGGCGGGCCAGAGAACGTCATGCAATCCGCCGCCATCGGCGATCCGGCCGACCGAGACAGCCAGCAATGCCGCCTGCGGGATCCAGATGGCGGCGGCCATCGCCTGCAGTATTGCCGCTCTGCGCAGCCCACCTTTGACATTATCGGGGCCAGTGACAGGCGAGACAGTATCGCCGTCCGGCACCGGCTTGGCGCCCCGTCCGTCCTTCGCGTCGAAGAAAGCAGTGCCCAT
Protein sequences of DBSCAN-SWA_6 >NZ_CP054034|58244:59990|58244_59990_-|WP_138333995.1|DBSCAN-SWA MGTAFFDAKDGRGAKPVPDGDTVSPVTGPDNVKGGLRRAAILQAMAAAIWIPQAALLAVSVGRIADGGGLHDVLWPALGILALGFARSCLDAAGGRLAFQAARAELSRRRQIAAAALSVSSPVDRDRSASGKATSVIGEQAELIVPYLARFQPARMKASLVPLIILAFILPVSWIAALVLLFAAPLIPIFMALIGWRAQAASERQLVATGGLNGFLLDRLRGLATIRALDAVDATARRLRSEAESLRARTMAVLKIALLSSAVLELFAALGVAMIAVYVGFSLLGEIRFGTWVGRLDLTEGLFILLLAPAFFEPLRELSAVWHDRAAGEAALKALDSLAAGGLSIQGAADVAPAVSAVVEAPAIRLENVDFRYNAAEPLILDGLNLDIAAGEHLAFLGASGSGKSTLLSLISGLAPCTGGRIIIGGIELADDTAAALRASMAWIGQRPHIFAGTIAGNIALGRPGILHGDVTDALDAARLGKVAATYGMRPLGEGGIGLSGGEALRLAIARAACNPHLRIILADEPTAHLDAATAAEVTESLLSLSRGRTLIVATHDPLLAARMHRAMRVDADIVMREAAE |
1 | Organic_Lake_phycodnavirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
68146 : 77325
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP054034|68146:77325|DBSCAN-SWA GATGCGCATCCTCCTCGACAATTTTTCGAAGAGCTTCGGCTCCACCAAAGTCATCGAGAACATGACGCTCGAAGTCGGCAACGGCGAGATGCTGGCGCTGCTCGGCCCTTCCGGCTGCGGCAAGTCGACGACGCTGTTTTCTGTCTGCGGCATCCACCGGCCGAGCGGCGGGCGCATCCTCTTCGGCGACCGCGACGTCACCGACCTGCCGAGCCAGGCCCGCAATGTCGGAGTGGTCTTCCAGTCCTATGCACTCTATCCGCACATGACGGTTACCGAGAACATCGGCTTTCCGCTGAAGGTGAAGGGCATGCCGGCCGCCGAAATCCGCAAGGAAGTCGACCGCATCGCCGCCCTCGTTCAGATCGGCAACCTGATGGGCCGCAGGCCGGCCGAGCTTTCCGGCGGCCAGCAGCAGCGCGTGGCGCTGGCCCGCGCGCTGATCCGCAAGCCCGATGTGCTGCTGCTCGACGAGCCGCTCGCCAATCTCGACGCCAAGCTGCGCCTCGAAATGCGCTCGGAAATCCGCCGCCTGCAGCGCGAGACCGGCATCACCGCCATTCTCGTCACCCACGACCAGGTCGAGGCGATGAGCATGTGCGACCGCATCGCCATCATGAAGGAAGGCGAGATCGTTCAGATCGCCACGCCTGCCGAAATGTACAATGATCCGAAGACCGCCTTCGTCGCCGGCTTTCTCGGCAATCCGCCGATCACCTTCCTGCGCGGCGTCGTGGACAAGGGCGCCTTCATCATCCCGCAAAGCGAGATCCGCGTGCCGCTGCCGGATTCTGTCGGCGCCGCCGAGGGCACGAAGCTGATGCTCGGCGTCCGGCCGGAGCATTTTACGCCGGCAGGCGACATCGCCGTGCCGGGCAAGGTCACCTTTGCCGAAACACAGGGTCGCGAGAACCTCTACGACGTGGCTCTTGCCGGCGGACCGCTGCTGCGCTCGATCCAGCCGGTGCGCAGCGATATCCATGTGGGCGACGACGTCCGCTGGGCGATCGACAGCCGCGGCATATTCGTCTTCGACGAAAATGGCAGGAGGCTCTGATGACCCGCGATTTCTCGGCATTTTTTGAGCGTTACGGCTGGCCATCGGGCAAGGGCCGGCTTCCCTTTTGCATCGGCCATCGCGGCGCCAGCGGCCATGAACGCGAAAATACCATCGCCGCCTTCCGCCGCGCCGCCGAGCTCGGCGCGGAAATGTGGGAACTCGACACGCAGCTGACCAAGGACGGCGTGGTCGTCGTCTCGCATGACGACCATCTCGAGCGCGTCTTCGGCATCGACCGCCGCATTTCCGAGATGACGGCGGCGGAGCTTGCCGGCCTCGACGGCGTCGACGTGCCGAGTTTTTCGGAGGTGGCCGCACTCGGCCGCGAGACCGGGACCGGCCTCTATGTCGAATTGAAGGCGCCTGGGACCGGCATGCGCTGCTGGCAGCACCTTGCCGAGATGAACCAGCGTTTCGCCTGTCTCGGCTCCTTCGACACAGCACAGGTGCGCGAACTTAGCGACGCGGCCTGCGATTTCCCGCTTTCGGTGCTGATCCGGGTCGGCCACGATCCGCACGCGCTCGCCGACGAGGCCGGCGCCGATATCGTGCATCTATGCTGGGAGAGGGCAGGCGAACGTCCGCAGGATCTGGTGACCGACGCGCTGATGCGCCGTGCCTTCGATGCTGGCCGCGAGATCGTGCTCTGGCACGAGGAGCGGCCGGCCATTCTCGATGACATCATGAAGCTTCCGGTTCTCGGCATCTGCACGGATCTGCCCGATCTGATGCGGCCGCCTGCGGCTAAGGAAAAAGCAGTTGGCAGACAGGGATAAGGGACCGCGCAAGGTAACCTCCTTCGACGTGGCGCGTGTGGCCGGCGTCTCGCGCGCCGCCGTGTCGCGCGCCTTCACGCCGGATGCCAGCGTCTCGCCGAAGACGCGCGAGAAGGTCTATCAGGCCGCCAAGGAACTTGGCTACCGGGTGAATTATCTTGCTCGAAGCCTGACCAACAAACGCTCCGATCTCGTCGGCGTCGTCGCCGCCGGCCTCGACAATCCGTTCCGCACGCTGCAGATCGAGCATCTGGCAAGAATGCTGCTCGCCCGCAATTTCCGCCCCATCCTGCTGCCGACCTCGCCGGAGGCGGATACTTCGACGGTCATCGGCCAGCTGCTGCATTACGCCGTCTCAGGCGTCATCGTCACCTCGGATGCCCCGCCGACCGAGATCTGCGAGCAGTGTGCGGCTGAGGGTGTGCCGATCGTGCTGATCAACAAGGGCAATGACATTCCCTTCGTCGACCGCATCATCTCCGATGATCGCATGGCCGGCCATCTCGCCGCCACCCACCTGATCGACAGTGGCGCGCGAAAACCCGCCGTGATGGCCGCCCCTGCCATATCCTATACGGCGCGGCGGCGCAGCGAGGCCTTCATTGCACGCTGCAAGCAGCTCGGCGTCGAGGCGCAGTTTCTGCAGGTCAGAATCAACGACTACCGGAGCGGCTACGATGCTGCAGCCGAACTTGCCGCATCCGGCATAGACGGGCTGTTTTGCGCCAACGACTACATGGCCTGCGGCGTCATCGACCGCGTCATGCAGGGCCGCGGCCGCGATGATACGCCACCGCTCTCCATCATCGGCCATGACGACATTCCTCAGGCGAGCTGGACCGCCTACGATCTCACGACCATCCGCCAACCCTGCGACGTCCAGGCCGAAAAGACGGTCGACCTGCTGATGAGCCGCATCGTCGAGCCGGACCTGACGGCGCGTGTCGAATTCACACCGGTGACACTGATCAGAAGAAGGACTGCCTGATGAATACCGCTTTCGACACCGCCGCCATCCGCGCACGGGCGGAGGCCGCGATCATCGTTGCCCTCGAAGTCGGGCGGGAAACGGCCCGGTTTCGGCACGACTCTGCCCCCGGCTCTCTTGCCGTCGAGAACAAGGGATTGCAGGATTTCGTCACCATCGCCGACAGGAGGGCCGAGCAGGCGATCAGCGACGGCCTGCTCTCGCGCTTTCCGGATGACACTTTCATGGGCGAAGAAAGCGGCGGACGGTCGGGCGAGGGCGGCACCTGGGTCGTCGATCCGATCGACGGCACGACCAACTATATCAGAGGCTTCCGGCACTGGGGTGTCTCGATCGCCTTCGTCGTCGGCGGCAAGGTGGAGATCGGCGTGGTCTACGACGCCGCCGGAGACAAAGTCTTTCACGCCGTCCGCGGCGGCGGCGCCTTCAAGGACGGGCTTCCGGTGCATGCGGCTGCGACCGTCGATCCGGCCAATGCGCTTGTCATTCTCGGCCATTCCCGCAAGACGAGCTTCGACGACCATCTCGCGCTCTCGCGGCGGCTCCACGAACGCGGTATGGATTATCGCCGCATGGGCGCGGCCGCAATCGACCTCGTTCGCGTCGCCGAGGGGGCTGCCGATCTCTATTACGAACGCCATCTCAATGCCTGGGACATGCTCGCCGGCGCGCTGATCGCCGAAGAGGCCGGCGCGGTGGTGGCGATGCCGCCGGTCGACAGGCTGCTTGCGCAGGGCGGGCCTGTCATCGCCCATTCGCCGGGGCTTGCCGGCGAATTCGCATTCATCCTCGACATCGAGGGACTATAGATACACCGTCCCAGAGACGCGATCCGCCTGATTGGCGAAACTGCGAGCGTTACCCAACAACGCGATTGCGGCCGCTCTTTTTCGCCGTGTAGAGATTCTGATCGGCGATCGCCAAAATCTCGTTCGGAGACTTGGCAGACAGGGTACCGGCAACACCGATGCTGATCGTGACGTGCAGCCCGTTGCAGATCTCGGACCAGTCCCACTGTTCAATGGCCGAACGCAGCTTTTCGCAGACATCGGCGGCGCCTCGGGGAGCGCCGGCGAGCAGGACGACGAATTCCTCGCCACCAAAACGAACTGCCTGATCGGTAATTCGCAACTGGCTGCGCAATATGGCTCCAATGGTACTCAGGACCCGGTCGCCGATCATGTGAGAGAAGTTGTCGTTGATCGACTTGAAGTGGTCGACATCCAGCATCGCGATGGAATATGGGCGCTTCTCAGATGTCAGCGCTTTAAAGCGGCTGTCGAGATATTTGCGGTTATAGAGCCCGGTCAGCGTGTCCAGATAGACGGCACTGGCAAGGATATCGGTTTGCTCCTGCAATTTCTGATAGGAGGCCTCCAGCGTTTCTGCGCGCTTCGTCTCCGTTTCGGCAACTTCCTTGAACCGCCTGGTTTCATAGTAAATTTCGGCGAGACGTGCGCGCTGCTGGGTTCTCTCGGCGCTATTGCGCAAATAGGCCTGATGGTATTTCTTATGAGCCGCCAGCGCCGGATCGAACTGTCCCAGCCCTTCATAGGCCTGCGACAGATAGAGATAGATCTGGATACAGGAATCGAAACTACCGCCGTCGTCGATCAGCTCCGTCGCCTCGAGCAGCGGCGACAGGGCTTCCGCGAATTTTCCGGAATTGTTGTAGATCTGGCCCAAGGTGTAGAGATATTGCTGCCTGTCCCTGACATAGCTGTTCTCTTGAAATAGCCGGTAGCGGACCATGCACTGGAGCGCCTTGTCATGTTCGCCCAAGTGGCTGTGGAATTCGGCTTTATTGCCGAGACAGATGCGAGCAGCCCAGCTGTCGCCGGTCTCGGTTGCAATGTCGAGCGCCTGTTCGGTCAGTTCCAGCGCTCTCACCAGCATCTGGCGTCCTTCCCCCGGCTGGCCGAGCGCCTCCGCGATATATCCGCCTTCCGAATGAGCGCCGCCGAGATTGATGAGCCACCAGCATTCGTAGGTCTTATTGCCGATTGATCTGGCGAGTTCGACGGACCGCTCACTCATCAAAATGGCCCGGTCGGGCTGTTTGTTGTACCAGAAGATAATTCCAATGACGTTGGTCGCCAGGCTTTGCGCTTTCAAGTCTGCGGTCTTGTCGGCCAGATCCAACGCGCGGGTGGCTTCTTCGATCGCTTCCTCGGGCAGGCCTAGTTCCAAGAGCAACCATCCGGAATACGCCCGGGCGCAGGATTCCTGTTTGGGCTCCCCGCTCAATTTCCAGAAGTCTACCGCCCGGCGGACATGCGTCAGTCCAAGTTCGGCCTTGCCGATCTGAAAACAGTACCAGGCAATGTCGGTGTCGCATTGCGCGGCGAGCCGATCGTCACCCCGCGCCTTGGCATCGGCAAGAATTTCGCCGGCCAGTGTGAGTGCCTCAACCGACCGGCCCGTACACCCAAGATGCCACGCTTCCTGCAAGCTGGAATTTGCTTGTTGCGCAATTGCTTTAGTATCAACAACTTGCATATCGGACACCAACATTTGTCTGCTGGATATGGATAGCATCAGAAGTTTAAAATGATGCAAACTGCAAAAAAACAGGCGACAAACCCAGAACCAGGGTATCGAAAGCTTCTCATACCTTTCGGCATTATGCTCCAATGTCCGCCGGTTAATCCGCCATGCGGCGGCGCCAGGCGCGTTCTGTCGCCGGTGCCTCGTCGGTGGTGATCTGATCCGGCTTCAGGGCCATCAGCGCCGAAAAGCCTCGCCATTCCTCTTCGCTGAAACCGGCCTCCGGATCCTTCAGGGTGAAGGTCCAGGCATCGACGCGTTTGCCTTCGTCCTGGCAGAGCGCGATCATATCCAGCCCCGCATTGGCCGCATCAAGGATCAGCGGCCAGTGCAGATAGATCGTGTCCGGCTCGGTCGGGCCGCGCAGGTCAGCGCGCAGTTCCGTCTCGACCGCCTTCCAGCCGCTGGCCTTCCTGATGCCGTAGAGCTTGTCAGTCGGATCGATGCCGCGCAGGAGATCGGGAAGTTTCTCCTTGACCGCGACGATGAGGTCGAGACTGTCGCCGCTGACAATCACCGAGGCGGCGATATCCCTAAAATGCGCGGCCAGATGCGCGACGCCCCTGGCGCCGATCGCTTCAAAGTCGTCCTTCATGTCGAATTGCAGCAATGCGGCTGGATGCGTCGATTGCATCATGGCAGCGAGATCCTCGCTGAGGATCAGCGGCCGGTCGCCCTCTTGCATCCTGATATCCCTGAGGTCACTGGCGCTCTTTTCGGCGACCGGTCCATGTCCTGTCGTCTCGCCTTCGAGTTCCTTGTCGTGCAGCACGACGAAACCGCCGTCGGCGCGCACGCGCAGATCGAGCTCCATCGAGGCGCCCGCCGCAAAGCCTTCCGCCATCACCTCGGCGGAAAACAGCGGATCGGCACACTGCTTGCGCAACCGGTGCCATTTCAGGCGGGTACGATGCCCCTCATGCGAAATCTCCAAGCCCTGCGCATCCAGCAATCTCGTCATCTGCATCAAGTCTCCGCGATGCCACTCTCCTGCCTTGACGAGCCGTGATTATCAAGGGCCGAATAGGGCTTTCGTCGGGGTGCCGGGAAGTTTTCGTCAGGCCGGGCTTAAGTCGGGCTGGCGGCCTTGTGGTCGCCGACAGGCCTTCCCGGCTCTGAGAACATCCTATCAGCGGCTATAGGCCGCCGACATCGCCTCCGAAGCGGCTGCGATATCCGCGGCATGGCCGGCCTGCCTGAGGGCCGCACCATACGCCACGACATTCGAGAGCACCGTCTGGAATGCGGCGCGCGGACCAGTGTGGTTCAGCCGCACGAGACGGGCCGGCGCGGTTCCGACGCCTTCGCTCAGCTCGACCCCGAGCTGCCCGGCGGTTGCGATCAGCGCGGCAGGCGCCAGCGCCTCCGGCACCGGCACCGCCGTCACCAGGTTCGAAGCCTTTGCTGCCGGTACCCAGGCCGAGGCTCCGAGTGCGGCAAGCCCTGCCCGTGTCTCCGATGCCGCCAGAGCGTGGCGAGCGACGACATTCTCCAGGCCTTCGGCCTCGATGCGGTCGAGCGCCGTCTCCAGCGCGAAGAATTCCAGCGGCGCCGGTGTTCCCGGCAGGCTACGGCGGCCGCCGTCGATCCAGCTCTTCAGATCGGCCAAAGACAAGATCGAATCGCGCGGGGCGCCGTCTCTGAGGATCAGATCCCAGGCGCCGCCGCTGACGGAAAGTGCGGACACGCCAGCCGGGCCGCCGAGCGCCTTCTGTGGCCCGATCACCGCGATGTCGATGCCGAGATCGTCAACGTTAAGTTCATGGCCACCAACCGAGGCGACTGCGTCGACAACAGTGACGATGCCGCGTGCCTGCGCAAGCGCCAGTATCTCGGGCAGGGGGTTCAGAATGCCGCTCGCCGATTCCGCGTGAACCAAGGCAAGCACATCGATCCGGGGACCGGCGTCGAGCGCTTTTGCCACCGCCTCGATCTCGACCGGCAGGCCTGGCTCGGCGACAATATCGGCGACCGTCGCACCGCCCCGCCGCAGCCATTGCCCGAACCAGCCGCCATAGGGGCTGGTGATGATATTGAGTGCTGTCAGGCCGGGACGCGCAAGACTGACGGCCGCCGCCTCCAGCGCCACCACCGCCTCGGCCTGCACGAGCAGGATGTCGTTGCGGCTGCGCAATATGCCGCCGATCCTGTCGGCAAGGGCAGCGAAGCGCTCTGCGGGGTAGGAGGGGACACCGTGCAGGGGATTCCAACCAGGATCGGACATGGACACTTCCTTCAGCAGGGATCGGTCACGAGACGAGGGACGGCGGCAACGAGCGCGCGGGCGGCATCGGATTGCGGCGCGGCAACGACCGCCGCTGCCGGTCCGGTCTCGACGATCCGTCCGCCATCCATAACGGCGATGCGGTGGGAGACGGCGCGCACCACAGCAAGATCGTGCGAAATGAACAGACAGGCAACGCCCTGTTCTTTCTGCAGATCGACCAGAAGCTCCAAGATCCTGCCACGCACGGTCACATCGAGCGCCGAGACCGCCTCATCGAGCACCAGTAACGAGGGCCGTGTCGCCATCGCCCGCGCAATCGCCACACGCTGCCGCTGGCCGCCGGAGAGTGCTCGCACCGGACGGGCGGCATGATCTGCGGTGAGGCCGACACGCTCCAGCAGCTTGGCGATCTCGCCAGGACGTCGATCTTTCGCGACGACGCCATGGATGCGAAGCGGATCGTCGAGCACTGCACCGACGCTCGCCACCGGATTGAAGGCGGCGAGCGGATCCTGAAACACCATCTGCATACGCGCCCGCCGGCGCCGCAAAGGCGCACCATTCAGCGCGAGCCAATCCTCTCCCTCGAAGCGAATCGAGCCGGCATCGGCAGCAACAAGCCGCAGCAGGATGCGCGATAGCGTCGATTTTCCACTGCCGGAGGCGCCGACGAGACCGAGCGTTTCCCCCGCCGCGATGGTGAGCGACACATTATCGAGAGCGGCAATCCGGCGTCCCGCAGACGAAAAACCTTTCGACAGGTTTTCGATGGAAAGCAGCCTGTCGTTCATGACCGCACCTCCACGATCAGCGGCGGCGTCGCGAGGTCTCGATGGCTGGCGATCAGGGCGGCGGTATAGTCGCTCTTGGGCGCCGAAAGCACCGAACAGACGGGGCCTGCCTCGACCAGCCTCGCATTGCGGAAAACGGCGATGCGATCGACAAAGCCGGAGGCAAGCGCGATGTCGTGGGTGATGAAGAGCAGCGTCATGCCGTCTTCGCGTACGAGCCCGTCGAGCAGGCGGACGATCTCGGCCTGGACGACGACGTCGAGCGCGCTGGTCGCCTCGTCGGCGATCAGCAGCGCAGGCCTCGCGGCAATCGCCGCCGCAATCGCCACGCGTTGGCGCTGGCCGCCGGAAAGCTGATGCGGAAAGGCCCGCATCGCCTTGTCGGGCTGCGGTATTCTCACCCGCTCAAGCAATTCCTCGGCCCTGATATAGGCCTGTTTCCAACTTAGGCCGAGATGGCGTCTGGCGCCCTCGGCGATCTGTTCGCCGATCGTCAAAACGGGATTGAGGCTGGAGCCCGGATCCTGGAAGACGAGGCCGAAATCGCGGCCTGGGCGGGGCGGATGGCCAAGGCCGGGCCAGAGTATCTCGCCGCCGACCTTCGTTCCCTCCGGCAGCAGACCGGCAAGCGCGCGGGCAAGTGTGCTCTTGCCGGAGCCGCTTTCGCCGATGATCGCCAGCCTTTCGCCGGCGACGATATCTAGATCGATATTGTCGAGCGCGGCCGCGCCGTTCCCACTGCGCCCGTAGGTGACCGAAAGCTGCCGCAGGCTACAGAAGACGCCGCTCAT
Protein sequences of DBSCAN-SWA_7 >NZ_CP054034|68146:77325|69201_69981_+|WP_138334012.1|DBSCAN-SWA MTRDFSAFFERYGWPSGKGRLPFCIGHRGASGHERENTIAAFRRAAELGAEMWELDTQLTKDGVVVVSHDDHLERVFGIDRRISEMTAAELAGLDGVDVPSFSEVAALGRETGTGLYVELKAPGTGMRCWQHLAEMNQRFACLGSFDTAQVRELSDAACDFPLSVLIRVGHDPHALADEAGADIVHLCWERAGERPQDLVTDALMRRAFDAGREIVLWHEERPAILDDIMKLPVLGICTDLPDLMRPPAAKEKAVGRQG >NZ_CP054034|68146:77325|75751_76534_-|WP_138334023.1|DBSCAN-SWA MNDRLLSIENLSKGFSSAGRRIAALDNVSLTIAAGETLGLVGASGSGKSTLSRILLRLVAADAGSIRFEGEDWLALNGAPLRRRRARMQMVFQDPLAAFNPVASVGAVLDDPLRIHGVVAKDRRPGEIAKLLERVGLTADHAARPVRALSGGQRQRVAIARAMATRPSLLVLDEAVSALDVTVRGRILELLVDLQKEQGVACLFISHDLAVVRAVSHRIAVMDGGRIVETGPAAAVVAAPQSDAARALVAAVPRLVTDPC >NZ_CP054034|68146:77325|68146_69202_+|WP_138334010.1|DBSCAN-SWA MRILLDNFSKSFGSTKVIENMTLEVGNGEMLALLGPSGCGKSTTLFSVCGIHRPSGGRILFGDRDVTDLPSQARNVGVVFQSYALYPHMTVTENIGFPLKVKGMPAAEIRKEVDRIAALVQIGNLMGRRPAELSGGQQQRVALARALIRKPDVLLLDEPLANLDAKLRLEMRSEIRRLQRETGITAILVTHDQVEAMSMCDRIAIMKEGEIVQIATPAEMYNDPKTAFVAGFLGNPPITFLRGVVDKGAFIIPQSEIRVPLPDSVGAAEGTKLMLGVRPEHFTPAGDIAVPGKVTFAETQGRENLYDVALAGGPLLRSIQPVRSDIHVGDDVRWAIDSRGIFVFDENGRRL >NZ_CP054034|68146:77325|73613_74477_-|WP_171599625.1|DBSCAN-SWA MTRLLDAQGLEISHEGHRTRLKWHRLRKQCADPLFSAEVMAEGFAAGASMELDLRVRADGGFVVLHDKELEGETTGHGPVAEKSASDLRDIRMQEGDRPLILSEDLAAMMQSTHPAALLQFDMKDDFEAIGARGVAHLAAHFRDIAASVIVSGDSLDLIVAVKEKLPDLLRGIDPTDKLYGIRKASGWKAVETELRADLRGPTEPDTIYLHWPLILDAANAGLDMIALCQDEGKRVDAWTFTLKDPEAGFSEEEWRGFSALMALKPDQITTDEAPATERAWRRRMAD >NZ_CP054034|68146:77325|69964_70969_+|WP_138334014.1|DBSCAN-SWA MADRDKGPRKVTSFDVARVAGVSRAAVSRAFTPDASVSPKTREKVYQAAKELGYRVNYLARSLTNKRSDLVGVVAAGLDNPFRTLQIEHLARMLLARNFRPILLPTSPEADTSTVIGQLLHYAVSGVIVTSDAPPTEICEQCAAEGVPIVLINKGNDIPFVDRIISDDRMAGHLAATHLIDSGARKPAVMAAPAISYTARRRSEAFIARCKQLGVEAQFLQVRINDYRSGYDAAAELAASGIDGLFCANDYMACGVIDRVMQGRGRDDTPPLSIIGHDDIPQASWTAYDLTTIRQPCDVQAEKTVDLLMSRIVEPDLTARVEFTPVTLIRRRTA >NZ_CP054034|68146:77325|70968_71778_+|WP_138334015.1|DBSCAN-SWA MNTAFDTAAIRARAEAAIIVALEVGRETARFRHDSAPGSLAVENKGLQDFVTIADRRAEQAISDGLLSRFPDDTFMGEESGGRSGEGGTWVVDPIDGTTNYIRGFRHWGVSIAFVVGGKVEIGVVYDAAGDKVFHAVRGGGAFKDGLPVHAAATVDPANALVILGHSRKTSFDDHLALSRRLHERGMDYRRMGAAAIDLVRVAEGAADLYYERHLNAWDMLAGALIAEEAGAVVAMPPVDRLLAQGGPVIAHSPGLAGEFAFILDIEGL >NZ_CP054034|68146:77325|74645_75740_-|WP_138334021.1|DBSCAN-SWA MSDPGWNPLHGVPSYPAERFAALADRIGGILRSRNDILLVQAEAVVALEAAAVSLARPGLTALNIITSPYGGWFGQWLRRGGATVADIVAEPGLPVEIEAVAKALDAGPRIDVLALVHAESASGILNPLPEILALAQARGIVTVVDAVASVGGHELNVDDLGIDIAVIGPQKALGGPAGVSALSVSGGAWDLILRDGAPRDSILSLADLKSWIDGGRRSLPGTPAPLEFFALETALDRIEAEGLENVVARHALAASETRAGLAALGASAWVPAAKASNLVTAVPVPEALAPAALIATAGQLGVELSEGVGTAPARLVRLNHTGPRAAFQTVLSNVVAYGAALRQAGHAADIAAASEAMSAAYSR >NZ_CP054034|68146:77325|71827_73483_-|WP_138334017.1|DBSCAN-SWA MLVSDMQVVDTKAIAQQANSSLQEAWHLGCTGRSVEALTLAGEILADAKARGDDRLAAQCDTDIAWYCFQIGKAELGLTHVRRAVDFWKLSGEPKQESCARAYSGWLLLELGLPEEAIEEATRALDLADKTADLKAQSLATNVIGIIFWYNKQPDRAILMSERSVELARSIGNKTYECWWLINLGGAHSEGGYIAEALGQPGEGRQMLVRALELTEQALDIATETGDSWAARICLGNKAEFHSHLGEHDKALQCMVRYRLFQENSYVRDRQQYLYTLGQIYNNSGKFAEALSPLLEATELIDDGGSFDSCIQIYLYLSQAYEGLGQFDPALAAHKKYHQAYLRNSAERTQQRARLAEIYYETRRFKEVAETETKRAETLEASYQKLQEQTDILASAVYLDTLTGLYNRKYLDSRFKALTSEKRPYSIAMLDVDHFKSINDNFSHMIGDRVLSTIGAILRSQLRITDQAVRFGGEEFVVLLAGAPRGAADVCEKLRSAIEQWDWSEICNGLHVTISIGVAGTLSAKSPNEILAIADQNLYTAKKSGRNRVVG >NZ_CP054034|68146:77325|76530_77325_-|WP_138334025.1|DBSCAN-SWA MSGVFCSLRQLSVTYGRSGNGAAALDNIDLDIVAGERLAIIGESGSGKSTLARALAGLLPEGTKVGGEILWPGLGHPPRPGRDFGLVFQDPGSSLNPVLTIGEQIAEGARRHLGLSWKQAYIRAEELLERVRIPQPDKAMRAFPHQLSGGQRQRVAIAAAIAARPALLIADEATSALDVVVQAEIVRLLDGLVREDGMTLLFITHDIALASGFVDRIAVFRNARLVEAGPVCSVLSAPKSDYTAALIASHRDLATPPLIVEVRS |
9 | Planktothrix_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
83404 : 84889
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP054034|83404:84889|DBSCAN-SWA GTCATGCGGATATTCCCTGGATGGCGGAGGCAAGAGCGGCCCCGTCTGCCGCGGGCCTCAACACGTGGTGGTCGATGACGAGAATGCGGTCGGCCACCTCATAGGCCTCCTCCGGGTCGGAGGTCGCGATCAGCGTCGCCCGGTCGGTGCGGGCGCGGATTGCCCGGATGATGTCATGACGGGCGCCGACGTCGACGCCCTGGAAAGGTTCGTCGAGCAGCAGCAGCCGGCTTGGTTCTGCCTCCCAGCGGGCGATCACCGCCTTCTGCTGGTTGCCGCCGGACAGCGACCAGACCGAGGCGAGCGGACCCGCGGCCTTGATGCCGAGACGGGTGATTGCCTGCTCGGCCTCGCGGCGCTCACGGCCGCCGACAAGGAAACCGTGCGGATACCATTTGCCAAGATGCGGCAGGCTGATCGTCGCCGACAGCAAATGGCCGGGCCATGCCGGTGGCATCAGCGAGGAACGATGACGGTCTTCGGCCGCCATCGCCACGCCTGATGCAATCGCTTCGGCCGGACCTTTCGGCCGGTAGGGCCGACCGCCGAGGAACATCGCACCGCCTGCAAGCGCAGTTACCCCGAAGATTGCCGATAGCAGCCGGCTCTTGCCGGCGCCAAGCACACCGGTCACCGCCACCACCTCGCCCTCATGCAATGACAGATCGAAGGAGGCAGCCCCAGAGAGCAGTCTGATATCGCGCATCTCGAAAATCACTGGTCCGGTCGCAGGCCGCGCATCCGGCCGAGCCGCATCCAGTCTGCGGCCGATCATCGTCTCGACGGCGCTCGAAAAATCGATCGGCCGCGCGAAGGTGCCGACGACGCGGCCGCCGCGCATGACGAGCGCGCGGTCGGCGATGGCTTCGAGGTCGGCGGTACGATGCGAGATATAGAGGATCGCCAGGCCCCGCTTACGAAGCTTCAGCAGAATATCGAAGAGGCGGCGACTTTCCTCTCCGGAAAGGCTCGCCGTCGGCTCGTCGAGGATGAGGAGGTCGGCGCGGTTGGCAAGCGCGCGAGCGATCGCCACAAGTTGCCGATCGGCGCTGGCGAGGTCACCGAAATCGCGGTCCAGCGGCAGGGTGAAGCCGGCGGCATCGAGCATGGACTGCGCGGCACGGCGGATGCCGGCGCGCGAGACGAAGAAGGGTGTGCTGCGATCGGCAAACCGGTTGAGCAGCAAGGCATCGGCAACGGTCAGACCGGCCGCGCCGACGAGATCCGTCGATTGATGCACCGTGACGACACCCGCCCTTGCCGCTTCGGCCGGGCTGCGCGGCGCAAAGGCGCGGCCATCGAGGCTGACCATGCCGCCATCGGCCGGAAGCACGCCGGAAAGGATCTTGACCAGCGTCGATTTGCCCGCGCCGTTTGCGCCCATCAGAGCCACGATCTCGCCGCGCTCCATAGAAAAATCAGCGCCGGCAAGCGCCCGCGTCGACCCGAAGCTGCGGGTGATATTTTCAATGTGAAGAAGCGGCAT
Protein sequences of DBSCAN-SWA_8 >NZ_CP054034|83404:84889|83404_84889_-|WP_138334035.1|DBSCAN-SWA MPLLHIENITRSFGSTRALAGADFSMERGEIVALMGANGAGKSTLVKILSGVLPADGGMVSLDGRAFAPRSPAEAARAGVVTVHQSTDLVGAAGLTVADALLLNRFADRSTPFFVSRAGIRRAAQSMLDAAGFTLPLDRDFGDLASADRQLVAIARALANRADLLILDEPTASLSGEESRRLFDILLKLRKRGLAILYISHRTADLEAIADRALVMRGGRVVGTFARPIDFSSAVETMIGRRLDAARPDARPATGPVIFEMRDIRLLSGAASFDLSLHEGEVVAVTGVLGAGKSRLLSAIFGVTALAGGAMFLGGRPYRPKGPAEAIASGVAMAAEDRHRSSLMPPAWPGHLLSATISLPHLGKWYPHGFLVGGRERREAEQAITRLGIKAAGPLASVWSLSGGNQQKAVIARWEAEPSRLLLLDEPFQGVDVGARHDIIRAIRARTDRATLIATSDPEEAYEVADRILVIDHHVLRPAADGAALASAIQGISA |
1 | Staphylococcus_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
97619 : 109171
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >NZ_CP054034|97619:109171|DBSCAN-SWA AATGACCACACAGATCGAGCTCAGAGGCGTCAACAAATATTACGGCGCCTTCCACGCCCTGAAGAACATCGACCTTTCGATCGCCAAGGGCACTTTCGTCGCACTCGTCGGCCCGTCCGGCTGCGGCAAGTCCACGCTGCTGCGCTCGCTTGCCGGCCTTGAAAGTATTTCATCGGGCGATCTCAGGATTGCCGGCGAGCTGATGAACGGCGTGCCGCCGCGCAAGCGCGATGTCGCCATGGTGTTCCAGTCCTATGCGCTCTATCCGCATATGACGGTCGAGGAGAACCTGACCTACAGCCTGCGCATCCGTGGCATCGCCAAGGCGGAGGCCAAGAAGGCCGCCGAGGACGTGGCGGCGACGACAGGTCTTTCCCATCTCCTGAAGCGCTATCCGCGCGAGCTTTCCGGCGGTCAGCGCCAGCGCGTCGCCATGAGCCGCGCCATCATCCGCCACCCCAAGGCCTTCCTGTTCGACGAGCCGCTGTCCAATCTCGATGCTGCGCTGCGCGTCCACATGCGCAAGGAAATCCGGTCGCTGCACGACCGACTGCACGCCACCTTCGTCTACGTCACGCACGACCAGGTCGAAGCAATGACGATGGCCGACCATGTGGTGGTGATGCGCGACGGCGTCATCGAACAGCAGGGCGCGCCGCTGGACCTCTACGACCGGCCGGCGAACCGGTTCGTTGCCGGCTTCATCGGTTCGCCGGCCATGAATTTCATCCCCGCGATCGCCGCGGAAGACGGAAAGAGCCTGATCCTGGATTTCGGCGCGGTGAAACAGACGCTTGCGATATCACGCGCCATCGAACCCGGGCGTAAGCTCATCGCCGGCATCCGGCCCGAACATATCGGCGTCGTCGAGCCCGGGCATGGCAGTTTCGATGTGCCGATCGCCTTCGTCGAATCGACCGGTTCCTCGACCTTCATCGTCGCGGAGACCCAGCCGGAACTGACGATCGTCGAGACGCGGCGCGACAGGGTCAAGGCAGGAGACATGATCGGACTTTCGATCGATCCGACCCAGATCCATCTCTTCGACGCGTCGACGGATCACTTGGTATAGACTCAGTCGAGCACCGATCCGGCAGAGCAGCCGTCGTCTGATCGTAGGAGACGTCTCTCGCGTCCATGATCTTGCCGAGGGAAGCACGCCTCAGGTTCTCAAAGAACGGCAAAACGGACAGTGCGTGCCGCGGTCAGGTCACGCGCGCGCGATGAGTGTGTCGTGAATATTACAATTGCATGACGCTTCTGGAATAAAAGCGAAGAAACCCACGATCCGTTCAATGTCATGGAACTTTGAAAAAGTGGTCATCGCGCTGCAATAATCGCGGTTCAAACAGCCCCGCACAAGTGGGGGCTCCATGCGAAAGAAAAACGTTCTACTCATCGTCGTTGACCAATGGCGAGCCGATTTCGTTCCGCACGTTCTGCGTGCCGACGGGAAAAACGATTTCCTGAAGACACCCAATCTCGACCGGCTCTGTCGCGAGGGCGTGACCTTCAGGAACCATGTGACGACCTGCGTTCCCTGCGGCCCGGCCCGTGCGAGCCTGCTCACCGGCCTCTACCTGATGAACCATCGCGCCGTGCAGAACACGGTGCCGCTCGACCAGCGCCACCTCAACCTCGGCAAGGCGTTGCGCGGCGTCGGCTACGATCCGGCGCTGATCGGCTATACGACGACGGTGCCGGATCCGCGCACCACTTCGCCGAACGATCCGCGCTTCAGGGTGCTCGGCGACATGATGGACGGCTTCCATCCGGTCGGCGCCTTCGAGCCCAATATGGAGGGCTATTTTGGCTGGGTGGCACAGAACGGCTTCGAGCTGCCGGAGCATCGTCCTGATATCTGGCTGCCCGAGGGCGAGGGCGCCGTTGCCGGCGCCACCGACCGTCCGTCACGCATTCCGAAGGAATTTTCGGACTCGACCTTCTTCACCGAGCGGGCGCTGACCTATCTCAAGGGCCGCGACGGCAAGCCCTTCTTCCTGCATCTCGGTTATTACCGGCCGCATCCGCCCTTTGTCGCCTCTGCTCCCTATCATGCCATGTACCGGCCGGAAGACATGCCGGCGCCGATCCGCGCAGCCAACCCGGACACCGAGGCTGCACAACATCCGTTGATGAAATTCTATGTCGACAGCATCCGCCGCGGCTCCTTCTTCCAGGGCGCAGAAGGGTCCGGTGCGACGCTCGACGAAGCGGAACTGCGCCAGATGCGCGCGACCTATTGCGGCCTGATTACTGAGGTCGACGATTGCCTTGGCCGGGTCTTTAACTATCTCGATGAGACCGGACAGTGGGACGATACGCTGATCATCTTCACCAGCGACCACGGCGAGCAGCTCGGCGATCATCACCTGCTCGGCAAGATCGGCTACAACGATCCGAGCTTCCGCATTCCGCTCGTCATCAAGGACGCCGGCGAAAATGCACGCGCCGGTGCGATCGAAAGCGGCTTCACCGAGAGCATCGACGTCATGCCGACCATTCTCGACTGGCTCGGCGGAAAGATCCCGCACGCCTGTGACGGCCTGTCGCTGCTGCCCTTCCTGAGCGAGGGCCGGCCGCAGGATTGGCGTACCGAATTGCACTACGAATACGACTTCCGCGACGTCTATTATTCCGAGCCGCAGAGTTTCCTCGGCCTCGGCATGAATGATTGCAGCCTCTGCGTCATCCAGGACGAGCGATACAAATACGTTCATTTCGCAGCGCTGCCGCCGCTGTTCTTCGATCTGCAGCACGATCCGAACGAATTCACCAATCTCGCCGACGATCCGGCCTATGCCGCGCTCGTGCGGGACTATGCGCAGAAGGCGCTCTCCTGGCGCCTGAAACATGCCGACAGAACCCTGACCCACTATCGCTCCGGCCCCGAAGGCCTGAGCGAACGCAGCCATTGATCCGAACCACCAGAGAACCCAAGGAGATATTCCCATGGTCACCTTCACCCGTCGCGGTGCTCTCGGCCTTGCCACCGGCGTCGCCAGCTCGCTGATCCTGCCGCGTTTTTCCATCGCCCAGGCCGACAACCGTCCTTCGATCACCATCGCCGTCCAGAAGATCTCGAACTCGAACACGCTGGATACGTTGCGCGAACAGTCGAATGTCGGCCAACGCATCTTCAATTCGTCGCTCTGGGAAAGCTTGATCGGCCTCGACTGGCTCGGCAACCTCTCGGCCGTTCCGTCGCTCGCCACCGAATGGCGCCGCATCGACGACAAGACCGTCGAGCTGAAACTTCGCCAGGGCGTCAAATTCCACAATGGCGACGAGATGACGGCCGAAGACGTCGCCTTCTCCTTCAGCAAGGAACGCATGTTCGGCGATACGCAGCCGAGCACCGGCAAGACGATCTTCGTCACCGAAAAGAACCCGCTCGGCCGCGAAAGCAAGGAACTGCCGGTCGAAATTCCGGCCGTCGCCCGCCGTATCTGGCCGGCCCTGCTCGGCATCGAAATCATCGACAAATACACCGTCCGCTTCGTCAACGGTTCGCCTGAGGTGACGATGGAAGGCCGCATCTCCGTTGCCGCCAGCGCCATCGCCAACCGCCGCAGCTGGGATGAGGCCAAGAGCTATCTCGACTGGGCCCGCGCACCGATCACCACCGGTCCCTACCGCGTCGCCGAATTCAAGCCGGATACCTACCTGATCTACGAAGCGCATGACGAGTATTGGGGCGGCCGTCCTCCGGTGAAGCAGATCCGCTTCGTCGAAGTTCCGGAAGTCGCCTCGCGCGTCAACGGCCTGCTGTCCGGCGAATATCACCTCGCCAGCGACATCCCGCCGGACCAGATCGAAGGCATTGAAAAGAACGCGGCCTTCGAAGTCCAGGGCGGCATCATCACCAACCACCGCCTGACTGTGTTCGATAAGACCCATGCCCAGCTCGGAAACCCGCTGGTGCGCCGTGCCTTCACGCATGCCATCGACCGCCAGGCGATCGTCGACTCGCTCTGGGCAGGCCGCACGCGCGTTCCGGCCGGCCTGCAGTGGGACTTCTATGGCGACATGCTGGTCAAGGATTGGACCGTGCCGGAATATAATCCGGATCTCGCCCGCCAACTTCTCAAGGAAGCCAACTACAAGGGCGATCCGATCCCGTATCGCCTGCTGAACAACTACTATACCAACCAGACCCCGACGGCGCAGATCCTCGTCGAAATGTGGGCGCAGGTCGGCCTCAATGTGCAGATCGAGATGAAGGAAAACTGGCAGCAGATCCTCGAAAAGACGCCGACGCGCGCCGTCCGCGACTGGTCGAACTCCGCGAGCTTCCCCGATCCGGTTTCCTCGATCGTCGCCCAGCACGGTCCGAACGGCCAGCAGCAGCAGGTCGGCGAGTGGACCAACGCCGAAATGAACACGCTGTCGACCTTCCTCGAGACCAGCACCGATCGCGCCAAGCGTCATGACGCTTTCGCCCGCATGCTGCAGATCTGCGAGCGCGAAGACCCCGCCTATACCGTTCTCCACCAGAACGCCGTCTTCACCGCCAAGTCGAAGTCGATCAATTGGAAGGCCGCATCGGCCTTCGCCATGGACTTCCGCCAGGGCAACTGGTCCTGATCAAGACCTGACGACATCAGCGAGCCGGGGGAAACTCCGGCTCGCTGCCATTTGGTGCACCGGCTATCCGGAAACCAGGCAAGCCCGACAGGCACGAGCATGATGCCGAATAGTGTGAGCGGTTTTCGGACGACATCATGCTCTCATTCGGATCTGGCGGCCGGCGGGAATGCAAGAGGGAGAGGCGAATGCCATTGGTCGAAATTTCCAATTTGAAGGTCGCTTTCAGCGGTATCGAAGTCCTGCACGGCGTCGATCTGGCGATCGAGAAGGGCGAGGCGGTGGGTCTCGTCGGCGAATCCGGCTGCGGCAAGTCGGTCACCTGGCTGGCGGCACTCGGGCTCCTGCCTGGAAAGGCTTCCGTCTCAGGCAGCGTGCGTCTCGGCAGCGATCAGCTGATCGACGCGTCGCGCACGAAGCTCGAAAGCATACGCGGCGGCCGCATCGCCATGATCTTCCAGGATCCGTCGAGCTCGCTCAACCCGGTTATCCGCGCCGGGCGCCAGATCGCCGAAGCCGTCGAACTGCACCGCGGCCTGACCGGCAGGGCCGCCCGCAACGAGGCCGTCCGCCTGATGGAAATGGTCGGCATTCCTGACGCGGTGCGCCGTTTCGACAATTTTCCGCATGAATTTTCCGGTGGCCAGAACCAGCGGCTGATGATCGCCATGGCGCTCGCCGGCAATCCTGATCTGCTGATCGCCGATGAGCCGACGACGGCGTTGGACGCAACGATCCAGGCGCAGATCCTCGATCTGCTGATTTCGATCCGCGAGGAAACCGGCATGGCGATCGTCTTCATCAGTCATGATCTCGGGGCGGTGTCGCAGATCTGCGAACGCGTCTGCGTCATGTATGCCGGCAACATCGTCGAGAAATGCCCGACGGAATCGTTGTTCCGGTCGCCGCGCCATCCCTATACGCGCGGACTATTCGATGCCATTCCGCGTATCGATGCCGGCCGGGACCGGCTGGTACCGATACCCGGCACCGTGCCGCAGCCCGGCCGGATGCCCGGCGGCTGCGCCTTTGCGCCGCGTTGCGGCCATGCCTCGGAACTCTGTCATAACAAGGTGCCGCCGCTCGTCGCACTTGACGACGACCGTGCAACCGCCTGCTTCCATCCGCTCGGCGACGGCGCCACCGCATCAAAGCCGAACCTCACGCAGGCACAGCAGGGAGTGGCATGGGCATGAGCGAACATCAGCTGATCGAAGCGAACGACCTCGTCAAGACCTATACGATGCGCCGCGGCGTCTTCGGAAAGCCGAGCCATGTCCGGGCGGTCGACGGCGTTTCGCTGTCGGTCGCGCCGAAGACGACGCTCGGCATCGTCGGTGAATCCGGCTCAGGAAAATCGACGATGGGCCGGCTGCTGCTCGGGCTCGAAGCTCCGACCGAAGGCAGCGTCCGATTCGACGGGGAGGCGATGCCGGCACTCAGAACGCCGCGCTGGCGTAGCCTGAGGGCGCGCATGCAGCTCGTCTTCCAGGATCCGTTGGCAGCCCTTGACCGGCGTATCTCGATCGGCGCTCAGATCGGCGAACCGCTTGCCATCCATGCCGTCGGAAGCGGGGAGGGGCGGCGCGAACGTGTCGACGAACTGCTCGTCGCCGTCGGTCTTCGCCGCGATCAGGCGGAGCGTTATCCGCACGAGCTTTCGGGCGGCCAGCGCCAGCGCGTCGTCATAGCGCGCGCCATCGCCACCAATCCGGAGCTTCTGGTCTGCGACGAACCGGTCTCGGCTCTCGACGTCTCCATCCAGGCGCAGGTGATCAACCTGTTGCGCGACCTGCAGGAAAAGCGCGGCATCGCCATGGCCTTCATCAGCCATGACCTGAAGGTCGTACGCAACATCGCCGATCGAGTCGCGGTGATGTATCTCGGCCGGATCGTCGAGGAAGCAGCTTCGGAGGATATTTTCCGCAGCCCGTTGCATCCCTATACGCAGGCACTGGTCTCCAGCGTTCCGGTTCCCGGCACGGCGCTGCGCGACCGTATCATCCTGCAGGGAGAACCGCCGAACCCCGCTGCCCGGCCTGCCGGCTGCGCCTTTCATCCCCGCTGCGGCCATGCGGTCGAGCGCTGTCGCATCGAGACGCCTGAACTCGTCACCGTCGAGGCGGGCCGGAGAGCTGCCTGCCACCTCGTTGCGCCCGCACCCGCATCAACGCTCATGGAGAGCTGAGCCATGATCCGTTTCTTCCTGATAAGAGCGTTCCGCGCACTGATGACCATCGTGCTGGTGGTCACCTTCGCCTTCGTCGTCCTGCGTCTTTCCGGCGACCCGGCCTTGACGATCATGGGTCCGGAAGCGCCGCCGGAGGCAATCCGCGCCTTCCGCACCGCCTGGGGTCTCGATCAGCCGATCTGGGTGCAGTATCTGCGTTATTTCGGGGCGATCGCCCGCGGGGATCTCGGTATCTCGATGCGCGACGGCCAATCGGCGATCCAGCTCGTGCTCGACCGCATTCCGGCAACGCTGGAGCTGACGATCCCGGCCCTCATCCTTAAGCTGGTGATCGGCATTCCGGCCGGCGTCTACGCCGCCCTCCACCGCGACAGCTCGACCGACCGCGCCGTCATGGCGAGCGCGGTGGTCGGCTTCACCATGCCGAGCTTCGTGCTCGGCCTCGTGCTGGTGCTGATCTTCTCCGTCACGCTCGGCCTGCTGCCATCGGGCGGCCAGGACAGCTGGATGCATGCCATTTTGCCGATCATCACCATGAGCATCGGCGGTGCCGGCATTCTTGCCCGCTTTGCCCGAAGCGCGATGATCGAGGTGCTTGGCCAGCCCTATATCCGCACCGCCAGCGCCAAGGGCACTGCCTGGCGAAACGTCGTCTGGCGCCATGCGCTGCCGAATGCGGCGATCCCGATCGTGACGATCGCCGGCTTCATGGTCGGCACGCTGATCGCCGGCGCCGTCGTGGTGGAATCGCTGTTCTCCTGGCCAGGCGTCGGCCGGCTTCTCGTCGTCGCCGTCTCAAACCGCGATCTCGCCGTCGTCCAGTGCATCCTGTTGCTGGTGGCAGCGACCATGGTGTTTTCGAATTTCGTCGTCGATATCCTTTACGGCTATCTCGACCCGCGTCTGCGCAGCAACCAGGCAAGGCATTAAGGAGCGGAACCATGACAAATATCTCTGCTCAACCGGCGACCGTCATCCACGAAGTGCGCGCAACGAAGAAGCGCGGCGTGCCCGTTTTCGTGATCATCGGCTTCGCCTGGATCGCCGTCGTGATCCTGATCGCGCTGACGGCCGACTGGATCCGGCCCTACAATATCACCGCCTTCGACCTGAAGAACCGGCTGTCACTGCCAGGCAATGCCGCGCATTGGCTCGGAACCGACGAACTCGGCCGCGATGTTCTCTCCCGCCTCATCGTGTCGATCCGCATCTCGCTGCTGATCGCCTTCGGCGCAACGCTGATCTCGGCCTTCGTCGGCACGACGCTCGGCTTTCTCGCCGCCTATTTTCGCGGCGTTGTCGAACAGATCGTCGTCATGCTTGCCGATTTTCAGGCCGCCATGCCGTTCCTGATCATGGCGCTCGCCGTACTTGCCTTCTTCGGCAGTTCGCTGCCGCTGCTTATCTGCCTGATGGGCTTCTACGGCTGGGAACGCTATGCGCGCATCGCCCGCGGCCTTGCCATCGCCGCCAGTGGCCAGGGTTATGCCGCCGCCGTCACGCAGCTCGGCGCCAAGCCGGCCCATGTCTATCTCAAGCATATCCTGCCGAACATCGCCTCGACGCTGATCGTCTCGATGACGCTGACTTTCCCAGAGATCATCCTGATGGAAAGCAGCCTTTCCTTCCTCGGCCTCGGCGTGCAGCCGCCGATGAGCAGCCTCGGTAACATGGTCGGCTACGGACGCGAATTTCTGACCCGCGCGCCCTGGATCATGCTGGCGCCGTCCTTCGTCATCATGCTGACGACGCTGTCGATCAGCATCACCGGAGACTGGCTGCGCGACAAGCTCGACCCGACGATCGGCTGACATCCTGCCGGTCGAGCTTCGATCGTCTCGCAAAAAAACAGATCCTGCACATTCATCGCTGTTGACAACGGCGAGCAATCGATATTGTCTTGTAAAGCGCTTTACTAAACCGCTTTACAATAGGTCCATCAATGGCCAGGGGAACGACGCCAAGTCTGAAGGATGTTGCCGCCGCTGCCAGCGTCTCGGTCACCACCGTGTCGCGTTTCGTCAACGGAAGTCTCGATCTTCCCTTTCAGACCAAGAAGCGCATCGAGGATGCGATCAAGACGCTGAACTACCGGCCGAACCCGCATGCGCGCCGGTTGAGCAGGGGCCGCTCGGACACGATCGGCCTCGTCGTGCCCGATATCGCCAATCCCTTCTTCGCAACGCTCGTCGCCGCCGTCGAGCAGGCGGCCGATGAAAAGAAGCTCGCGGTCTCATTGCACGCCACGCTCAACCGGCCGGGACGCGAGATCGAATATCTGCAGCTGATCGAGCGCAATCACGTCGACGGCCTGATTTTCGTCACCAACCATCCCGACGACGGCGCGCTTGCCGCGCTGATCAACGGCAGCGGCAAGGTCATCATCGTCGACGAGGACATTCCCAATTCGAAGGCGCCGAAGCTGTTCTGCGACAATGAGCAGGGCGGTTATCTCGCCGGCCAGCATCTGGCCGAACAAGGCCATCGCCATGTCCTGTTCATCGGCGGCGACGAGCGCATGATCAGTGCCCGCAGGCGTTATGACGGCCTGCTGAAGGCGCTCAGGGAACAGCATGGCGACGAGGCCCGGGCCGATCGTTATGCCGGCGAATATACGATAGAATACGGCCGTGCGGCTGCGCTCGACTATCTCTCCGGAAACCGGAAGGCGACGGCGATCTTCGCCAGCTCCGACGAGATCGCCATCGGGCTGGTCGAGGTCTTCAGAAGCCAAGGCGTCTCGATCCCTGCTGACATCTCGGTCATCGGCTTCGACGATGTCGGTCCGCTTCATCTCTTTGCGCCGCCGCTGACGGCCATCCGTCAGCCGGTGCGCCAGATCGGCCGGCGGTCGCTGGAGCTGCTACTGGAAACCAATTGGCACGAGTGGAAACCATCCGCCTCGGAAGAACTCCTGCCTGTCGAAATCGTGGTGCGGAACTCCGTTGCGCCGCCTGCGAAATAATAATCAGCAACGTCAAAAAACCAAAAGAGGATGAGAACATGACACTGAATTTGACAAGACGAACGATGATCGCCGCCGCCGGCTTCACGGCATTGACCTTCATCGGCCTGTCCAGCGCCGCGGCGCAGGAAAAGAAGACCGTCGCGCTGGTGCAGATCAACCAGCAGGCACTTTTCTTCAACCAGATGAATGAGGGCGCCCAGAAGGCGGCCGATGCCGCCGGCGTCAAGCTGGTGATCTTCAACGCCAACAACGAGACGACGGCGCAGAACAGCGCCATCGAAACCTATGTTCAGGAAAAGGTCTCCGGCCTTGCGGTCGTGGCGATCGACGTCAATGGTATCATGCCGGCGGTCAAGCAGGCGGCCGATGCCGGCATTCCCGTCGTCGCCATCGATGCCATCCTGCCTGATGGCCCGCAGAAGGCACAGATCGGCGTCGACAATGCCGCGGCCGGCGCCGATATGGGCAAATACTTCCTCGACTACGTCAAGGCGAACATGGGCGGCAAAGCCAAGCTCGGCGTTGTCGGCGCGCTGAACTCGTTCATCCAGAACATCCGCCAGGAGGGCTTTGAAAAGACCATGAAAGGCGTCGACGGCATTGAAATGGCCGGCGTCGTCGATGGGCAGAACATCCAGGACAACGCCCTTGCGGCGGCTGAAAACCTGATCACCGCCAATCCTGACCTGACCGCGATCTACGCCACCGGCGAGCCGGCCTTGATGGGGGCGATCGCCGCCGTCCAGAGCCAGGGCAAGCAGGATAAGATCAAGGTGTTCGGCTGGGATCTGACTGCCGAAGCCATTGCCGGCATCGATGCCGGCTTCGTCGTCGCCGTCGTCCAGCAGGATCCGGCCGCAATGGGCGGTGCCGCCGTCGACGCACTCGTCAAGGCCTCGGCCGGCGAAGCCGTCACCAAGACGATCTCGGTGCCGATCACCATCGTCACCAAGGAGAATGTCGAGCCGTACCGGGCCGTCTTCAAATGATCGAAAGCGGACAGCGTGACATCGCTGCCGCCGCTTCCGGAGGGCTCGCACCCTCCGGAACCGGAACTTCCGGCGGCACCCGATCCGACCCGCGCATCTCGCTACGCGGCATCCGCAAGTCGTTCGGCTCGCACCAGGCCCTGCGCGGCGTCGATCTCGACATCTTCCCCGGCGAATGCCTGGGCCTCGTCGGCGATAACGCCGCCGGCAAATCGACGCTGACGAAAATTATCTCGGGAACCTACATTCCCGATGCCGGCACAATCACCATGGAAGGCGAAGAGATTCGTTTCTCCGGCCCCGCGGATGCGCGTGGGCGTAACATCGAAATGGTGTTCCAGGATCTCAGTCTCTGCGATCATATCGATGTCGTCGGCAATCTCTTTCTCGGCCGTGAACTCAGCCGCGGGCCGTTTCTCGACACCAGGACGATGTTGGCCGAAGCGCGCAAGATGCTGGATTCGCTTGAGATCCGCATTCCGAGGCTGACCGGCAAGGTCGCTCAGCTCTCCGGCGGGCAGCGCCAGGCGATCGCCATTGCGCGCGCCGCCTCGTTCAAGCCGAAGGTGCTGATCATGGACGAGCCGACGTCAGCACTTGCAGTCGCGGAGGTCGAAGCGGTGCTGGCCCTGATCAACCGCGTCAAGGCGAACGGCGTTTCGGTGATCCTCATCACCCACCGGCTGCAGGACCTCTTCCGGGTCTGCGACCGCATCGCGGTGATGTACGAGGGCACGATGGTGGCCGAGCGGCAGATCGGCAGCACCAACCTCGAGGATCTCGTCAGGCTGATCGTCGGTGAAGGAGCCAGACAATGA
Protein sequences of DBSCAN-SWA_9 >NZ_CP054034|97619:109171|98991_100536_+|WP_138334056.1|DBSCAN-SWA MRKKNVLLIVVDQWRADFVPHVLRADGKNDFLKTPNLDRLCREGVTFRNHVTTCVPCGPARASLLTGLYLMNHRAVQNTVPLDQRHLNLGKALRGVGYDPALIGYTTTVPDPRTTSPNDPRFRVLGDMMDGFHPVGAFEPNMEGYFGWVAQNGFELPEHRPDIWLPEGEGAVAGATDRPSRIPKEFSDSTFFTERALTYLKGRDGKPFFLHLGYYRPHPPFVASAPYHAMYRPEDMPAPIRAANPDTEAAQHPLMKFYVDSIRRGSFFQGAEGSGATLDEAELRQMRATYCGLITEVDDCLGRVFNYLDETGQWDDTLIIFTSDHGEQLGDHHLLGKIGYNDPSFRIPLVIKDAGENARAGAIESGFTESIDVMPTILDWLGGKIPHACDGLSLLPFLSEGRPQDWRTELHYEYDFRDVYYSEPQSFLGLGMNDCSLCVIQDERYKYVHFAALPPLFFDLQHDPNEFTNLADDPAYAALVRDYAQKALSWRLKHADRTLTHYRSGPEGLSERSH >NZ_CP054034|97619:109171|100570_102205_+|WP_138334058.1|DBSCAN-SWA MVTFTRRGALGLATGVASSLILPRFSIAQADNRPSITIAVQKISNSNTLDTLREQSNVGQRIFNSSLWESLIGLDWLGNLSAVPSLATEWRRIDDKTVELKLRQGVKFHNGDEMTAEDVAFSFSKERMFGDTQPSTGKTIFVTEKNPLGRESKELPVEIPAVARRIWPALLGIEIIDKYTVRFVNGSPEVTMEGRISVAASAIANRRSWDEAKSYLDWARAPITTGPYRVAEFKPDTYLIYEAHDEYWGGRPPVKQIRFVEVPEVASRVNGLLSGEYHLASDIPPDQIEGIEKNAAFEVQGGIITNHRLTVFDKTHAQLGNPLVRRAFTHAIDRQAIVDSLWAGRTRVPAGLQWDFYGDMLVKDWTVPEYNPDLARQLLKEANYKGDPIPYRLLNNYYTNQTPTAQILVEMWAQVGLNVQIEMKENWQQILEKTPTRAVRDWSNSASFPDPVSSIVAQHGPNGQQQQVGEWTNAEMNTLSTFLETSTDRAKRHDAFARMLQICEREDPAYTVLHQNAVFTAKSKSINWKAASAFAMDFRQGNWS >NZ_CP054034|97619:109171|108349_109171_+|WP_138334070.1|DBSCAN-SWA MIESGQRDIAAAASGGLAPSGTGTSGGTRSDPRISLRGIRKSFGSHQALRGVDLDIFPGECLGLVGDNAAGKSTLTKIISGTYIPDAGTITMEGEEIRFSGPADARGRNIEMVFQDLSLCDHIDVVGNLFLGRELSRGPFLDTRTMLAEARKMLDSLEIRIPRLTGKVAQLSGGQRQAIAIARAASFKPKVLIMDEPTSALAVAEVEAVLALINRVKANGVSVILITHRLQDLFRVCDRIAVMYEGTMVAERQIGSTNLEDLVRLIVGEGARQ >NZ_CP054034|97619:109171|102393_103401_+|WP_138334060.1|DBSCAN-SWA MPLVEISNLKVAFSGIEVLHGVDLAIEKGEAVGLVGESGCGKSVTWLAALGLLPGKASVSGSVRLGSDQLIDASRTKLESIRGGRIAMIFQDPSSSLNPVIRAGRQIAEAVELHRGLTGRAARNEAVRLMEMVGIPDAVRRFDNFPHEFSGGQNQRLMIAMALAGNPDLLIADEPTTALDATIQAQILDLLISIREETGMAIVFISHDLGAVSQICERVCVMYAGNIVEKCPTESLFRSPRHPYTRGLFDAIPRIDAGRDRLVPIPGTVPQPGRMPGGCAFAPRCGHASELCHNKVPPLVALDDDRATACFHPLGDGATASKPNLTQAQQGVAWA >NZ_CP054034|97619:109171|107399_108353_+|WP_138334068.1|DBSCAN-SWA MTLNLTRRTMIAAAGFTALTFIGLSSAAAQEKKTVALVQINQQALFFNQMNEGAQKAADAAGVKLVIFNANNETTAQNSAIETYVQEKVSGLAVVAIDVNGIMPAVKQAADAGIPVVAIDAILPDGPQKAQIGVDNAAAGADMGKYFLDYVKANMGGKAKLGVVGALNSFIQNIRQEGFEKTMKGVDGIEMAGVVDGQNIQDNALAAAENLITANPDLTAIYATGEPALMGAIAAVQSQGKQDKIKVFGWDLTAEAIAGIDAGFVVAVVQQDPAAMGGAAVDALVKASAGEAVTKTISVPITIVTKENVEPYRAVFK >NZ_CP054034|97619:109171|97619_98690_+|WP_138334055.1|DBSCAN-SWA MTTQIELRGVNKYYGAFHALKNIDLSIAKGTFVALVGPSGCGKSTLLRSLAGLESISSGDLRIAGELMNGVPPRKRDVAMVFQSYALYPHMTVEENLTYSLRIRGIAKAEAKKAAEDVAATTGLSHLLKRYPRELSGGQRQRVAMSRAIIRHPKAFLFDEPLSNLDAALRVHMRKEIRSLHDRLHATFVYVTHDQVEAMTMADHVVVMRDGVIEQQGAPLDLYDRPANRFVAGFIGSPAMNFIPAIAAEDGKSLILDFGAVKQTLAISRAIEPGRKLIAGIRPEHIGVVEPGHGSFDVPIAFVESTGSSTFIVAETQPELTIVETRRDRVKAGDMIGLSIDPTQIHLFDASTDHLV >NZ_CP054034|97619:109171|106338_107361_+|WP_138334066.1|DBSCAN-SWA MARGTTPSLKDVAAAASVSVTTVSRFVNGSLDLPFQTKKRIEDAIKTLNYRPNPHARRLSRGRSDTIGLVVPDIANPFFATLVAAVEQAADEKKLAVSLHATLNRPGREIEYLQLIERNHVDGLIFVTNHPDDGALAALINGSGKVIIVDEDIPNSKAPKLFCDNEQGGYLAGQHLAEQGHRHVLFIGGDERMISARRRYDGLLKALREQHGDEARADRYAGEYTIEYGRAAALDYLSGNRKATAIFASSDEIAIGLVEVFRSQGVSIPADISVIGFDDVGPLHLFAPPLTAIRQPVRQIGRRSLELLLETNWHEWKPSASEELLPVEIVVRNSVAPPAK >NZ_CP054034|97619:109171|103391_104393_+|WP_138334061.1|DBSCAN-SWA MGMSEHQLIEANDLVKTYTMRRGVFGKPSHVRAVDGVSLSVAPKTTLGIVGESGSGKSTMGRLLLGLEAPTEGSVRFDGEAMPALRTPRWRSLRARMQLVFQDPLAALDRRISIGAQIGEPLAIHAVGSGEGRRERVDELLVAVGLRRDQAERYPHELSGGQRQRVVIARAIATNPELLVCDEPVSALDVSIQAQVINLLRDLQEKRGIAMAFISHDLKVVRNIADRVAVMYLGRIVEEAASEDIFRSPLHPYTQALVSSVPVPGTALRDRIILQGEPPNPAARPAGCAFHPRCGHAVERCRIETPELVTVEAGRRAACHLVAPAPASTLMES >NZ_CP054034|97619:109171|105337_106207_+|WP_138334065.1|DBSCAN-SWA MTNISAQPATVIHEVRATKKRGVPVFVIIGFAWIAVVILIALTADWIRPYNITAFDLKNRLSLPGNAAHWLGTDELGRDVLSRLIVSIRISLLIAFGATLISAFVGTTLGFLAAYFRGVVEQIVVMLADFQAAMPFLIMALAVLAFFGSSLPLLICLMGFYGWERYARIARGLAIAASGQGYAAAVTQLGAKPAHVYLKHILPNIASTLIVSMTLTFPEIILMESSLSFLGLGVQPPMSSLGNMVGYGREFLTRAPWIMLAPSFVIMLTTLSISITGDWLRDKLDPTIG >NZ_CP054034|97619:109171|104396_105326_+|WP_138334063.1|DBSCAN-SWA MIRFFLIRAFRALMTIVLVVTFAFVVLRLSGDPALTIMGPEAPPEAIRAFRTAWGLDQPIWVQYLRYFGAIARGDLGISMRDGQSAIQLVLDRIPATLELTIPALILKLVIGIPAGVYAALHRDSSTDRAVMASAVVGFTMPSFVLGLVLVLIFSVTLGLLPSGGQDSWMHAILPIITMSIGGAGILARFARSAMIEVLGQPYIRTASAKGTAWRNVVWRHALPNAAIPIVTIAGFMVGTLIAGAVVVESLFSWPGVGRLLVVAVSNRDLAVVQCILLLVAATMVFSNFVVDILYGYLDPRLRSNQARH |
10 | Planktothrix_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_10 |
115550 : 116630
Sequences of DBSCAN-SWA_10
Nucleotide sequences of DBSCAN-SWA_10 >NZ_CP054034|115550:116630|DBSCAN-SWA TTTAAATCCGCAGGAGCTCTGCGATCTCCACATCGCCCTTTGCCGGCATCGACTGATGGACGAGCTCGTTGTCGCGGAAGACGAACAAGCCGGCGAAACTCGGCTCGACCCTCATCCCGGCCCGAATCCGATCGTCCGGTGCGACCATCGCCTTCAGGCGGGTGCCGTCGGCAAGATCGACATCGACCATCTTGTGGGTGCCGAAATCGACGGTGCGATGGACTGTCGCGGCATCGACCCGCTCCGAGGGACGGATCGTCAGCGCTTCCGGGCGGGCGGCAAGCGTCACCGGCCCATCCTCGACGGGAACCGGAAGCGGGAACAGCGGATGCTCACAGACGCCGTTTCTAACGCGGCTCTGGACAAAATTCATCGAGCCGATGAACCCGGCGACGAAAGCCGTCTGCGGCTGCCGGTAGATGGTGCTCGGCGGCGCGATCTGTTCGGTGCCTCCGTCGCGCATGACGACGATGCGGTCGGCGAGCGCTAAGGCCTCGTCCTGGCCGTGGGTGACGAAGAGCGTGGTGATGCCAAGGCGCTGCTGGATATCGCGCACTTCCTCCCGCAGCCTTTCGCGCAGATGCTGGTCGAGGCTGGCGAAGGGCTCGTCGAGCAGAAGGATCTTCGGCTCAAGCACGAGCGAGCGGGCAAGCGCGACGCGCTGCTGCTGACCGCCGGAGAGTTGCGTCGTCATCCGCCGCCCGTAATCGGCGAGGCCGACCAGGGCCAAAGCATCCTCGACGCGTTGCCGGATTTCCGCTTTCGGCAGCCGGCGGAGCTTCAGGCCGAAGGCGATATTGTTGAAGACGTCCATGTGCGTCCAGAGCGCATGGCTCTGGAACACCATGCCGGTCGGGCGCCGCTCGGGCGGCAGCACCGTCACATCCTTCGCGTCGATGCGGATCGTTCCGCCGCTTGGGCGCTCGAAGCCGCCGATCATCCTGAGAAGCGTCGACTTGCCGGAACCGGATGGGCCGAGCAGGCAGACGAGCTCGCCGTCTCCGACTTCGAGCGAGAAGTCGCGAACGGCAAAGGCGGTCCCGAACAGCTTCGAAACGCCCTCGATCGAAAGATGTGCCAT
Protein sequences of DBSCAN-SWA_10 >NZ_CP054034|115550:116630|115550_116630_-|WP_138334086.1|DBSCAN-SWA MAHLSIEGVSKLFGTAFAVRDFSLEVGDGELVCLLGPSGSGKSTLLRMIGGFERPSGGTIRIDAKDVTVLPPERRPTGMVFQSHALWTHMDVFNNIAFGLKLRRLPKAEIRQRVEDALALVGLADYGRRMTTQLSGGQQQRVALARSLVLEPKILLLDEPFASLDQHLRERLREEVRDIQQRLGITTLFVTHGQDEALALADRIVVMRDGGTEQIAPPSTIYRQPQTAFVAGFIGSMNFVQSRVRNGVCEHPLFPLPVPVEDGPVTLAARPEALTIRPSERVDAATVHRTVDFGTHKMVDVDLADGTRLKAMVAPDDRIRAGMRVEPSFAGLFVFRDNELVHQSMPAKGDVEIAELLRI |
1 | Mycoplasma_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_11 |
121186 : 121972
Sequences of DBSCAN-SWA_11
Nucleotide sequences of DBSCAN-SWA_11 >NZ_CP054034|121186:121972|DBSCAN-SWA AATGGATTTCGGCCTGAAGGACAAGACGGCCCTGGTGCTTGGCGCCGGCGGCGGACTGGGCAGCGCGATTGCCGCCAAGCTTGCGCGCGAAGGCGCCAGAATTGCCGCGGCGGATATCGATCTCGCCGCCGCTGAGAAGACCGCGGCTGCGGTTGAATCCGAAGGCGGCAAGGCGCTGGCGCTGCAATGGGACCTTTCCGATCTCGGTTCGATCGATGCGCATGTCGCTGCAATCGAGCGCCAGTTCGGACCGGTCGGTATCCTCGTCAACAATACCGGTGGCCCACCGCCGACCACTGTCTCCGGCCAGGATCCGGCGGTTTGGAACCAGTATTTCCAAAGCATGGTGCTCTCCGTCATCGCCGTTACCGATCGTGTGCTGCCTCAGATGCGCGCGCGCAAATGGGGACGCATCGTCACCTCGACCTCATCGGGCGTAGTTGCGCCGATCCCCAATCTCGGTATCTCCAACGCGCTGCGTCTGTCGCTGGTCGGCTGGTCGAAAACACTGGCGCGCGAGGTCGGACGCGACGGCATCACCGTCAACATCGTCCTGCCGGGCCGGATCGCCACCGGCCGCATCACCTTCCTCGACGAGCAGAAGGCCAAACGCGAAAACCGCTCCATCGATGACGTCGTCATTGAGAGCACCGGCAGCATTCCGCTCGGTCGCTATGGCCGGCCGGAAGAATATGGCAATGTCGTCACCTTCCTGGCTAGCGAGCCGGCCTCCTATCTCACCGGATCGGTGATCCGCGTCGATGGCGGCATGATCCAGAGCATCTGA
Protein sequences of DBSCAN-SWA_11 >NZ_CP054034|121186:121972|121186_121972_+|WP_138334097.1|DBSCAN-SWA MDFGLKDKTALVLGAGGGLGSAIAAKLAREGARIAAADIDLAAAEKTAAAVESEGGKALALQWDLSDLGSIDAHVAAIERQFGPVGILVNNTGGPPPTTVSGQDPAVWNQYFQSMVLSVIAVTDRVLPQMRARKWGRIVTSTSSGVVAPIPNLGISNALRLSLVGWSKTLAREVGRDGITVNIVLPGRIATGRITFLDEQKAKRENRSIDDVVIESTGSIPLGRYGRPEEYGNVVTFLASEPASYLTGSVIRVDGGMIQSI |
1 | Trichoplusia_ni_ascovirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_12 |
126056 : 127589
Sequences of DBSCAN-SWA_12
Nucleotide sequences of DBSCAN-SWA_12 >NZ_CP054034|126056:127589|DBSCAN-SWA CTCAAATCGCATGGCCGGCCTCTGGGGTTTCCTGCCATTCGCCGGCCATCATCAGCCCCAGCCTGCGGGCATCGGCGCTTTCGGCCGCGACGGGCGGGGACAGGCGGCCGCCGACGATTGCCTGTATGCGATCGGCAAGCGCGATCACCTCGTCGAGGTCTTCCGAGATCAGCAGCACGGCGGTACCCTGCCGGCGGGCTTCGAGCAGGCGTGCATGGACGGCGGCCACGGCCCCTTCGTCGAGACCACGCGCGGGCTGCGCCGCAATCAGGATGCGCGGCCGCCGATGCAGGTTGCGGCCGAGAATGAGCTTTTGCATATTGCCGCCGGAGAGCAGCCGGGTGCGGATGGCTGGCCCGCCGCCGCGGACATCGAATCCGTCGATAATTTCGCTCGCAAAGGCCATGCCCGCCTTGCGGTTGACGAGACCGAGGCGCGAAAACGCCGGCGATGCGATGCGCTCCAGCACGGTGTTTTCCCAGATCGCCATTTCGCCGATCACGCCCTCCTCGTTGCGGTCCTCGGGAATGCGGCCGATGCCGGCGTCGACGACATCGGCGACGCCGAGATTGCCGACGGCTTCCCCGAACAGCAGCAGGTCGCCGGCGCTGCGCGCCAGCATACCGGAGAGAAGATGCGCCAATGTCGCCTGGCCATTGCCGGAAACGCCGATGATGCCGAGGATCTCTCCTTGATGCAGCCGGAAGCTGATCGACTTCAGCCGATCAATGCCGCCGGTGCGCACCGTCACATCGGCCGCTTCAAGGGCAACGGCGCCGGGTGTCGACGGCTCGCGCACGGGCCGCGTCACACGGCGTCCGACCATCAATTCGGCGAGTTCCGCCTTGCTGGTTTCCGACGCCCTGCGTTCGGCGACCATCTTGCCACCGCGTAAGACCACGATGCGGTCGGCCGCGGCCATGACCTCGTCGAGCTTGTGGGAGATGAAGATCAGCGACAGGCCCTGGCGGGCCATTTCCCTCAGCGTCGTGAACAGCCGTTCGGCCTCGATATTGGTCAGCACCGCCGTCGGCTCGTCGAGGATCAGGATGCGGGCATCGTTATAGAGCGCCTTGAGGATCTCGACCCGCTGCTGCTCGCCGACCGACAGGTCGCCAAGACGGGCATCGGGATCGACCTTGAGGCCGAAACGTTCGGAAATTGTGAGCAGCTTCTTGCGTGCCGCTGACGTTCCCGAGCGCCAGGACCACAGTTTTTCGGTGCCGGTCATGACATTTTCGAGAACGGTCAGATTGGGCGCGAGGGAGAAATGCTGATGCACCATGCCAACGCCGGCGCGGATCGCTGCACGTGGCCTGCCCTGCGGCACCTCCGCACCCTCGATCAGAATGCGACCGGCATCCGGCATGTAATGGCCGAAAAGGATGCTCATCAGCGTGGTCTTGCCGGCGCCGTTCTCGCCAAGCAGGGCCACGACCTCACCCTTGGCGAGCGTCATGGAGATATCGTCATTGGCGAGATTGTCGCCGAAGCGCTTGCTGACGCCTGCTATCTCCAGCACAGGCGCGGTCAT
Protein sequences of DBSCAN-SWA_12 >NZ_CP054034|126056:127589|126056_127589_-|WP_138334105.1|DBSCAN-SWA MTAPVLEIAGVSKRFGDNLANDDISMTLAKGEVVALLGENGAGKTTLMSILFGHYMPDAGRILIEGAEVPQGRPRAAIRAGVGMVHQHFSLAPNLTVLENVMTGTEKLWSWRSGTSAARKKLLTISERFGLKVDPDARLGDLSVGEQQRVEILKALYNDARILILDEPTAVLTNIEAERLFTTLREMARQGLSLIFISHKLDEVMAAADRIVVLRGGKMVAERRASETSKAELAELMVGRRVTRPVREPSTPGAVALEAADVTVRTGGIDRLKSISFRLHQGEILGIIGVSGNGQATLAHLLSGMLARSAGDLLLFGEAVGNLGVADVVDAGIGRIPEDRNEEGVIGEMAIWENTVLERIASPAFSRLGLVNRKAGMAFASEIIDGFDVRGGGPAIRTRLLSGGNMQKLILGRNLHRRPRILIAAQPARGLDEGAVAAVHARLLEARRQGTAVLLISEDLDEVIALADRIQAIVGGRLSPPVAAESADARRLGLMMAGEWQETPEAGHAI |
1 | Staphylococcus_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_13 |
132711 : 140309
Sequences of DBSCAN-SWA_13
Nucleotide sequences of DBSCAN-SWA_13 >NZ_CP054034|132711:140309|DBSCAN-SWA ATCAAGCCTCCATGTTCTCTCTCGGATCGTAGGAGTTTGCTGTCGCCCATTCCTTCATCAAGGCCGTCAGCCGCTCATCCGGGGCGTCGGGCAGCACAATCTTCAGCGAGATGTAGACGTCGCCGTGTCCGCCGCCGCGTTTTGCGACACCTTTGCTCTTGAGACGCAGGACCTTGCCCGTGTTCGAATGAGGCGGCAGCGTCAGATTGACCGGTCCCGACGGCGTCGGCACGCGAACCTTGCCACCAAGCACGGCCTCACTGAGCGAGATCGGCAGTTCCAGGCGGATATCGTCGCCGTCTCGCGTAAAGAAGCGGTGCGGGCGAACACGGATTTCGATCAGGGCATCGCCCACCGGCCCACCGCCGATGCCCGGCTCGCCCTTGCCGCGCAGCCGCAAGGTCTGGCCGTCGCGCGTCCCCGGCGGGATCTGTACGTCGAGCGCCGGACCATTCGGCATCTTGACCTCCGTCCTGGTGCCGTTGACGGCCTCGAGAAAGTCGACCTCCATGGAAAATTGGCGGTCACGGCCACGACCACGGGTCTGGCCGCCGCCGGCACGACGTGAAAAGAAGCTGGCGAAGATATCGTCGGCATCGCTGAAATCGGCAAAGCCGGCGCTGTTGTGGTAGGGATCGCCGGGCCCGCTCTTCGACGCGTAGTCGCGGTAATAATTGCGCTGCGCCCGCTCGGCGCCGGTTATATCGATCTCGCCGCGGTCGAAGCGTCCGCGTTTTTGCTCGTCACTCAATATCTCGTAGGCCGTTGAAATTTGCTTGAACCGTTCCTCAGCCTTTTTGTCGCCGGGGTTAAGGTCGGGGTGAAGTTTCTTGGCGAGCTTGCGGAACGCGCTTTGAATATCCTTTTGCGTCGCATCGCGTTTCACGCCCAGAAGCTCGTAGGGATCCTGGCTCATACATATCTCCGGTCGGGCAGCGAATACTGCTGGTTGGCTTGGCAATGATATAGGTTGTCGGGCCGCCGCCGGGGAGGGGAGGCCGTCCGCTCTGATCGACGTCAATCAGACAAAGCGCCCGCTGTCACGCAGGCTCTGTGGAGCTTAGAACAACGCCGCGTCTTCCGCCGGGGACGCCGAGCCTTCCTGCGCGTGATCGGGGCGCCGATCGATAAGAACCGAGCACCCGGCATGGCGCACGACCTTGTCGACGATCGAGGAGAAGACGTAGTCGGTGATATCGGTCACATGGGATGAAACCAGGATAAGATCGGCCGATTTCTCCCTTGCGCTGGAGACGAGCAGGGAGGCGGCAACGCCGATGCGAACCTCGATCATCGCCGGAATTCCCAGCTCCTTGCAGAGCGATGCTAGTTTCTTTTCGGCGTCGACGATGGCGGTCGTTTCGAATTCCTCGGGAAGCTCCGTCAGGTGATGGCGGGGGATATTCTCGATGACGTGCATGACGACGATGCTTCCGCCTTCATCCACCAGCGCGGCAGCCCGGCGCAGAAGGCGGCTGGCCGTCTGGCGGGGACCCATGCCGATGCCGCAGATGATCGTCCGATACATTTGAAATCCATATCCTCGAATGCGAATGATAGCACCGATCATACCGGTGTCGACCGTCCCGGCATTGACGAATATCAAGAAAAAGTGCCGCAACGGCAGCGCGCACGAAACAGCCAGCTGCTCGGAACGAACCTCCGGCCGAAATCGTCGCAGAAGCGCCTGAAACTCAAGGCTTCAGCGCAGTCAACACCCCGACCCAAGAGGGCACGCCATGCTCAGGAAACCCCGGTCCCGCCTGGCACGCTAAGCGAGCCGCCGCCACCTTCAGATGGTACTCCTTCCAGAGTTGCCCACGGTGACGCTGCCTGATTAACCTGTTAGTGTATTTCGGCTTTTTGGATGGTGATGAGGAGGAGTTCAGTATGGTTCCTGCTCCTAAAGTTCGGTCCGGCCAGACCGTTGGAATTCCATCCGGATTTCCTTGCCTTCTGTTGCTCGCGATGGTTTTGGCCGCGAATGTCGCCTCGGCGGCGATTGTGGAGCGGGTTCCGCGCGTGCTCATCCTTTACCCGCTGGATGAAAGAACCCCCGCACCAGTGATTGTCGGTGAATCGGCGAGAAATCGACTGCTGGAGGTAACCACTGGCAAGATTGACCTATTCTCCGAGTTTCTCGACCTTACCAGATTCAGCGACAAAATTCACATAGATCGCATGGCGCGCTATCTCGCCGCGAAATACGTCGATTATCGCCCGGATGTTGTCATCGCCATCGCAGAGCCGGCGGCCAACTTCATCGTCGCGCACCGCAGTACAATCGCCCCAAATGCTAGGATAGTGTTCTCCGGCTTTGGTAGCGACACTGCCGCCAGAATGAAGCTGCCGAGCGATGTGGTAGGTGCGTTTACCGAGTTCGACATCGCGAAAACTCTCCAAATGGCTCGCAGCCTCCAGCCGAGCGCACGCCATCTTTTCATTGTCGGCGGTTCTTCGGAGTTCGACCGTTCGTGGCTTGCACTAGCGCGCAAGGAGCTGGCGCTTCTCGTCAAGAGCTATGAAACAACTTACTTGACGGATTTGACGATTGATGAGTTTGTCGAGCGCGTCGCCCGCCTCCCGTCCGAAAGCATCGTTTTGTTCCTGACCGTTTCGAAGGATAGCACCGGGCGTGACCTTATCCCGAGGGACGCGCTCGAACGGATAGCGGCCACAGCCAGCGCACCGATTTATGGTCCCTTCTCCACCTATGTCGGTCATGGCATTGTCGGTGGCAACACCGTCACCTTTGAATCGGCGGGACTGGCTGTAGCCGACCTCACTGCGGATGCGATTGGCGGAAAGGTGATCGCCAATGTCGACGTGCCGCAGACCTACGTCGCCGATGCGAGACAACTGAAGCACTGGGACTTGTCGGAGAGCAGGCTCCCTCCGGGAACAGTTCAAAGCTTCAAGGAAAAGACCCTATGGGAGGAGCACCAGAGGCTGATCATTGCCATCCTTGTCGTTCTCGCGCTTCAAGGCTTGGTCATTGGGGGATTGCTGGTCGAGCGCCGGCGCCGCCACGGCGCGGAGCAGGAGGCGCGCCGTCGTCTTCTTGAGGTGGTGCACCTTAACCAGTCGGCCACCGCAGGGGCGCTGTCGGCCTCTATCGCCCATGAGCTCAACCAACCGCTTGGCGCGATCCGAGCCAACGCCGAAGCTGCCGAAGTCATGATCCGCGGCAAAACGCCCGACCTCAAACTGATCCAACAAATACTCTGTGACATCCGCGACGATGATCAGCGTGCCGAGGATATCATCGTGCGGCTAAGAGGACTGCTCAAGAAGCGGAGCGAAATCGACTGGCAAGAATTCGACCTGAAGGACGTGATCAAGAGCTCGATCCAAATTCTCCACGCTGAGGCGGGTCGAAGGAACATAACAATCAGTTTCAGCCAACCGGTAGCGCAACTTCCGGTGCGGGCAGACCAGGTTCATGTTCAGCAAGTTATATTGAACCTCGCGACCAATGCGATGGATGCGATGCTCGACGCGGTGTCGACCGAGAGACGGCTGGTGTTTCAGACAACTCTCACAGAGGGATCCGAAGTCGAGTTATCAATTTCGGATACTGGACGCGGCATCCCAAGCGGACAGCTCGCCAGGGTTTTCGACGCGTTCTACACCACCAAGGCGACTGGCACTGGACTTGGCCTCTCCATCGCGCGCACAATAATCGAAACCTACGGCGGCAAGATATGGGCCGACAATCGCGCTGATGGTGGCGCTGTGTTTCGCTTCGTTCTGCCGCTCGCGCAGCGAGGATGAGGCAGTGGGACCAATCGTTCATATTGTCGATGACGACAAGTCGTTTCGCATCGCGGTCGGACGATTGCTGGAGGCCTCGGGCTTTCGAGTAATTTCTTATGAGTCGGGCGACGATATTTTAACGCGTCTGACAGGCTCGGAACCTGGATGCATTCTGCTCGATCTGCAAATGCCTGGCCTGAGCGGGTCAGAACTGCAGGGCTGTCTCGCACAGAAGGCACCGCTGCTGCCTATCGTGTTTCTCACCGGTCAAGCCGAAATCGAGGACAGCGTTCGAGCGATGAAGGCAGGCGCCGAAGATTTCCTAGAGAAGCATGCATCCAACAAGGCATTGCTAGGGGCTATCGAACGGGCGCTGCTGCAGTATGAAAAGCGGCGTACCCAGCAGGACCATGTTCATTCTCTCCACGCCCTCGTCGCCGCCCTAACGCCGCGTGAATACCAAGTTTTCGACCGGATTGTCCGCGGCAAGCGCAATAAACAGATCGCCTACGATCTCAGCACGTCGGAGAGAACTGTCAAGGCACACCGGCACAGTGTCATGGAAAAACTTGGAGTGGGTTCACTTGCCGAGGCAGTGTCGATTGCCGAGCGATTGGGACTTGTCGATCCGGCGGCATGACCGCCCGGCAATAAAAATCCGGCACTTCCCCATAGGACAATATCCGCGGTCCGTTGTTGAGCCTACGATCGGTAAGATTGGAAGTGGCCATATCGCCGGCTGGTTAGGGAAAGGCACACGGTGGAAAGTTCGCCGCCCACAGTGGCTGTCGTGGAAGACGACCAGAGCATGCGGACAAGCGTCGAGCGCTTGCTCAACGCTCATGGGTTCCCGACCAAAGGATTTAGGTCGGCGGAAAGTTTTCTCAACCGCGACGCCACGATCAAGATAGGCTGTATCGTGCTCGATATCCATCTTGATGGCATGTCAGGCATCGAGCTTCGGCACCGGCTTAAACAATCTGACTCCACGCTCTCAGTCGTTTTCATTACCGCCGTCGATGATGACGCGCTCGAACTCGAGGCGGTTCAAGCAGGTTGCATCGCCTATCTTCACAAGCCGTTTCCAGCGGCCTCGTTGATTGGCGCTGTCACCAAAGCGCTGGCTGATTCCTCAACTGGCTGAAAGCGGTCCATGTGCGCTCCCCAGCCAACAAGGCATGCCTCGCCACTTCCGAAAACCATCTTCCCCTAAGGACAATAGCGCAGCCACCTTTTCTCGGTAAGACTTATCACCGGAAGGAGTTTAGCGGTGACTTGTCGCCATCGCTCGAACCGAACCTTTCTTGGCCTCACGGTCGAAGGCTTTGCGATCAACGACGAAACGACGAAAGCCATGACAAAACTTGCCCCTGCCGAACTGACAGCTGGACCTGTGGCAGCGAACACGTCCGACACAAACATGGTCTTGATCCCCGGGGGCACTTTCCGCATGGGGTCGGACACGCATTACCCAGAGGAGGCTCCGTCCCATCGCGTGACTGTCGACGACTTCTGGATGGACAGGGTGCCGGTCACCAACGGAGAGTTCAAACAGTTCGTCAAGGCTACCGGCTATGTAACGATCGCCGAGCAGGTTCCCGATGCGAGAGATTATCCAGGAGCGCTTCCGCACATGCTCTACGCCGGATCGCTGGTGTTCCACAGGGCCTCAGGGCCAGTCGATTTGAGAAATCCGAACAACTGGTGGGCCTTCGTCAAAGGGGCAAGCTGGCGGCATCCGCTCGGCCTCAGCAGCGGCCTGTTCAAGAAGGAAACCCATCCGGTCGTTCAAATCGCCCATACGGACGCACTTGCCTATGCAGAATGGGCCGGCAAGGAACTGCCGACTGAAGCCGAATGGGAATATGCAGCCTGCGGCGGTCTTGAGAACGCCGAATTCGCTTGGGGCGACGAGTTCACACCTGCCGGTCGCCACATGGCCAACACCTGGCAGGGCGAGTTCCCGCACCAGAACCTGGCCGCCGATGGCTACGAGCGGACCTGCCCTGTCAACGCATTCCCGCCAAACGGCTACGGGCTCCTGAGCATGATCGGCAACACCTGGGAATGGACTTCTGACTGGTACACTTCGGAGCATCCAGCCGATGCGCCCAAAGCCTGCTGCATTCCGGAAAACCCGCGCGGCGGTCGCGAGTTGGACAGCTACGATCCGCGCCAGCCGGAGATCAGGATTCCGCGCAAGGTTCTGAAAGGCGGCTCGCATCTCTGCGCTCCAAACTACTGCCGTCGCTATCGCCCGGCGGCGCGCTACCCCCAGCCAATCGATACTTCAACTAGCCATGTCGGCTTCCGATGCATAGTGAGAGGAGACGTCAGATGACCAGTAGCTCGCAGACCCCCACAGGGACGCAGCCGAAGACAGGGTTGCAGCGCCGTGAACTCTTGTTGAGCGCCAGCGCACTCGTCGCGGCAGGCGCCGTCGCCACGGGCGGATTCGGCCTGCCGGTTCTTCGCGGCGCATTCGGTTCGGGGATGGCCGCAATGGCCGCAGAACCACAGCCGAATATCATCCACATTGTTTCGGACGATCAAGGTTGGAAGGATGTCGGATTTCATGGCTCCGACATCAAAACGCCGAACATCGACCGTTTGGCGGAAACAGGCGCGGAGCTGACGCAGTTCTACGCGCAGCCGATGTGCTCACAGACGCGCGCCGCGATGCTAACGGGGCGTTATCCACTCCGTACCGGGTTTCAAACGGCGGTCATTCCCTCCGGGGGCCTCTATGGGGTACCGGTGGACGAATGGCTTCTGCCTCAGGCACTGAAAGACGCCGGATACGAAACCGCGCTCGTCGGGAAATGGCACATCGGCCACGCCAAGCCGGAATATTGGCCACGTCAGCGCGGCTTCGACTATTTCTATGGCGCGATGGTGGGCGAGATCGACCATTTCAAGCACGAAGCGCATGGCGAGGGTGATTGGTATCGCAACAACGATCGCATTGAAGAAGAAGGCTACGACACGACGCTTTTCGGCAACGAAGCGGTCAAGCGGATTGAAGAACACGATCCCAGCAAACCACTATACCTCTACCTCGCCTTTACCGCGCCGCACACGCCATACCAGGTGCCTGACGAGTATCTCGACAATTATCCGGAGATCGCCGATACGTCGCGGCGCACTTACGCAGCAATGATCACCGCCATGGACGATCAGATCGGCCACGTGCTCGAAGCGCTCGACAAGCGAGGAATGCGTGAAAACACACTGATTGTATTCCACAGCGACAATGGTGGAACGCGTTCAGCGAAGTTCACCGGCGAATCGAAGGTAACCGGCGAACTCCCGCCCAACAACGGCTCCTATCGCGACGGCAAGGGAACGATCTACGAAGGCGGCACCCGCGTTGTCGCGCTCGCGAATTGGGCCGGAAAAATCAAGCCCGGGAAGGTCGATGGGATGATGCATGTCGTCGATATCTATCCGACGCTTGTCGGCCTCGCTGGCGGCACGCTCGACAAGACCAAGCCGCTGGACGGCCTCGATGTGTGGGGGGCGATCAGCGAAGGGAAGGTCTCGCCGCGCACCGAAGTCGTCTACAACGTGCAGCCGTGGGCTGCTGGGGTGCGCCAGGGAGATTGGAAGCTGGTATGGCAGGCGCTCATACCCGGAACCTTGGAATTGTTCGATCTGGCGGCTGACCCGTCGGAAGCCAAAAACCTCGCGGAAGCGAATCCCGCAAAAGTGAAGGAACTTCAGGCCAGGATCGCGGAGCTGGCAGCGCAAGCGGCGCCGCCGCTATTCATGACCGATGCATTCCGCCTCATACTTTCCGAACCGCCGTCGACACCCGAGGGCTTTTTCGAAGTGTCGGACTAG
Protein sequences of DBSCAN-SWA_13 >NZ_CP054034|132711:140309|133770_134238_-|WP_138334118.1|DBSCAN-SWA MYRTIICGIGMGPRQTASRLLRRAAALVDEGGSIVVMHVIENIPRHHLTELPEEFETTAIVDAEKKLASLCKELGIPAMIEVRIGVAASLLVSSAREKSADLILVSSHVTDITDYVFSSIVDKVVRHAGCSVLIDRRPDHAQEGSASPAEDAALF >NZ_CP054034|132711:140309|137226_137610_+|WP_138334121.1|DBSCAN-SWA MESSPPTVAVVEDDQSMRTSVERLLNAHGFPTKGFRSAESFLNRDATIKIGCIVLDIHLDGMSGIELRHRLKQSDSTLSVVFITAVDDDALELEAVQAGCIAYLHKPFPAASLIGAVTKALADSSTG >NZ_CP054034|132711:140309|137886_138807_+|WP_138334301.1|DBSCAN-SWA MVLIPGGTFRMGSDTHYPEEAPSHRVTVDDFWMDRVPVTNGEFKQFVKATGYVTIAEQVPDARDYPGALPHMLYAGSLVFHRASGPVDLRNPNNWWAFVKGASWRHPLGLSSGLFKKETHPVVQIAHTDALAYAEWAGKELPTEAEWEYAACGGLENAEFAWGDEFTPAGRHMANTWQGEFPHQNLAADGYERTCPVNAFPPNGYGLLSMIGNTWEWTSDWYTSEHPADAPKACCIPENPRGGRELDSYDPRQPEIRIPRKVLKGGSHLCAPNYCRRYRPAARYPQPIDTSTSHVGFRCIVRGDVR >NZ_CP054034|132711:140309|134600_136484_+|WP_138334300.1|DBSCAN-SWA MVPAPKVRSGQTVGIPSGFPCLLLLAMVLAANVASAAIVERVPRVLILYPLDERTPAPVIVGESARNRLLEVTTGKIDLFSEFLDLTRFSDKIHIDRMARYLAAKYVDYRPDVVIAIAEPAANFIVAHRSTIAPNARIVFSGFGSDTAARMKLPSDVVGAFTEFDIAKTLQMARSLQPSARHLFIVGGSSEFDRSWLALARKELALLVKSYETTYLTDLTIDEFVERVARLPSESIVLFLTVSKDSTGRDLIPRDALERIAATASAPIYGPFSTYVGHGIVGGNTVTFESAGLAVADLTADAIGGKVIANVDVPQTYVADARQLKHWDLSESRLPPGTVQSFKEKTLWEEHQRLIIAILVVLALQGLVIGGLLVERRRRHGAEQEARRRLLEVVHLNQSATAGALSASIAHELNQPLGAIRANAEAAEVMIRGKTPDLKLIQQILCDIRDDDQRAEDIIVRLRGLLKKRSEIDWQEFDLKDVIKSSIQILHAEAGRRNITISFSQPVAQLPVRADQVHVQQVILNLATNAMDAMLDAVSTERRLVFQTTLTEGSEVELSISDTGRGIPSGQLARVFDAFYTTKATGTGLGLSIARTIIETYGGKIWADNRADGGAVFRFVLPLAQRG >NZ_CP054034|132711:140309|136434_137106_+|WP_171599554.1|DBSCAN-SWA MVALCFASFCRSRSEDEAVGPIVHIVDDDKSFRIAVGRLLEASGFRVISYESGDDILTRLTGSEPGCILLDLQMPGLSGSELQGCLAQKAPLLPIVFLTGQAEIEDSVRAMKAGAEDFLEKHASNKALLGAIERALLQYEKRRTQQDHVHSLHALVAALTPREYQVFDRIVRGKRNKQIAYDLSTSERTVKAHRHSVMEKLGVGSLAEAVSIAERLGLVDPAA >NZ_CP054034|132711:140309|132711_133626_-|WP_138334116.1|DBSCAN-SWA MSQDPYELLGVKRDATQKDIQSAFRKLAKKLHPDLNPGDKKAEERFKQISTAYEILSDEQKRGRFDRGEIDITGAERAQRNYYRDYASKSGPGDPYHNSAGFADFSDADDIFASFFSRRAGGGQTRGRGRDRQFSMEVDFLEAVNGTRTEVKMPNGPALDVQIPPGTRDGQTLRLRGKGEPGIGGGPVGDALIEIRVRPHRFFTRDGDDIRLELPISLSEAVLGGKVRVPTPSGPVNLTLPPHSNTGKVLRLKSKGVAKRGGGHGDVYISLKIVLPDAPDERLTALMKEWATANSYDPRENMEA >NZ_CP054034|132711:140309|138803_140309_+|WP_138334123.1|DBSCAN-SWA MTSSSQTPTGTQPKTGLQRRELLLSASALVAAGAVATGGFGLPVLRGAFGSGMAAMAAEPQPNIIHIVSDDQGWKDVGFHGSDIKTPNIDRLAETGAELTQFYAQPMCSQTRAAMLTGRYPLRTGFQTAVIPSGGLYGVPVDEWLLPQALKDAGYETALVGKWHIGHAKPEYWPRQRGFDYFYGAMVGEIDHFKHEAHGEGDWYRNNDRIEEEGYDTTLFGNEAVKRIEEHDPSKPLYLYLAFTAPHTPYQVPDEYLDNYPEIADTSRRTYAAMITAMDDQIGHVLEALDKRGMRENTLIVFHSDNGGTRSAKFTGESKVTGELPPNNGSYRDGKGTIYEGGTRVVALANWAGKIKPGKVDGMMHVVDIYPTLVGLAGGTLDKTKPLDGLDVWGAISEGKVSPRTEVVYNVQPWAAGVRQGDWKLVWQALIPGTLELFDLAADPSEAKNLAEANPAKVKELQARIAELAAQAAPPLFMTDAFRLILSEPPSTPEGFFEVSD |
7 | Indivirus(25.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_14 |
146064 : 156911
Sequences of DBSCAN-SWA_14
Nucleotide sequences of DBSCAN-SWA_14 >NZ_CP054034|146064:156911|DBSCAN-SWA GTTATGGCATGCGTTTGATTGATTGAATCGGCATCGAATGCGGATCGTCCTTGCCGTCGGCGCCTTGCAGCAGCCGCTTGTCAGGATCGATGTCGGGAATGGCGGCGATCAGCGCCGCCGTATAAGGATGCTTCGGCCGCGCGAAGACTTCTTCGGAGCGACCTTCCTCGACGATCTCGCCGCGATACATCACGACGACGCGTTCGCAGAGATTGCGGACGATTGCGAGGTCATGGGCGATGAAGATCAGCGTGAGGTTCATCTTCGCCGTGAGATCGCGGAAGAGCTCGATGATCTGCGCCTGGATGGTGACGTCGAGGGCCGCCACGCATTCGTCGGCGATGATCAGCTTGGGATCGACGGCGAGGGCTCGGGCGATGCCGGCGCGCTGGCACTGGCCGCCGCTCATGCTGCGCGGTTTGCGGCCGGCGAATTCGCGTTCGAGGCCGACGAGATCGAGCAGTTCGCCGATTCGAACAGGGATATCGGCCTTGGCGACCTTGCCCTGCACCTTCAGCACCTCGGCCAGCATCTGCCCGATCGTCAGTCTGGGATTGAGCGCATTATAGGGGTCCTGGAACACCATCGCCGTCTCGCGCCTGAGCTTTGCCAGGCCAGCGCTTTTCTGCAGCGCCAGATCGACGCCGTCGAAAGTGACGTGACCCGAGGAAAGCGGGGTGAGGCCGAGCACGGCGCGGGCAAGCGTGCTCTTGCCGCTGCCAGATTCGCCGACGACGCCGACCGTCTCGCCCGGCATGATCTGCAGGCTGACGCCGGCAACGGCGCTGACCGTCTTCGCACCGCCCCTGAGCAGGGCGCCGCCCGCTCTGAAGCGCACATGGAGATCATCGATTTCGAGTAACGGCCTGGCCGGTTGGCTGGCTTCGGCAAGCTCGGGCAACGGCGCTGATATCGCGCCCGGCAGGGAAGGGTGGCTGTTGATCAAGTTGATCGTATAGGGATGCTGCGGCCGCGCCAGGATCGTCCGCTTCGGCCCCTCTTCGAGCAGTTTGCCGCCGCGCAGCACGGCAATACGGTCGCAAGTCTGGGCGACGATACCGAGATCGTGGGTGATCAGAATGATCGACAGGCCGCGCCGGTCGCGAATGTCGATCAACAGCCGCAGGATCTGCGCCTGGATGGTCACGTCGAGCGCCGTCGTCGGCTCGTCGGCGATCAGGATCTTCGGATTGCAGGACAGCGCCACGCCGATCATCGCCCGCTGCCGCATACCGCCGGAAAATTCGTGCGGATAACTGTCGTACTGGCGCTTCGGATCGGGGAAGCCGACCTGCGCCAGGATTTCCGTCGCCGCGGTGCGGGCCTCGCGGGCGCCAAGGCCCTGATGGTAGCGGATTCCCTCGGCAATCTGGTCGCCGATCCGCATGACAGGGTCAAGATGGCTGGTCGGGTTCTGGAAGATCATGCCGATCTCGCCACCGCGCACTTTCAGCATCTCGCCATCGTCGATCCGCATGAGATCGCGCCCTTCGAGCAGCACGCTACCGCTTTCGATCTTCAGCAGCGAGGAGGGCAGCAGGCGCACCAGGGAACGGCAGAACAGGCTCTTGCCCGAGCCGCTCTCGCCGACGAGACCGAGGATCTCGCCCTTGCCGAGGTCGAGTGAGACCGCATCGAGCAGCGTGCGTGGGCCGGGCTCGACATGTGCCCTGACGGTGAGATCACGGACGGACAGCACGGAGCCGCTCATTCATGCACCCCCAGCAGTTCGCCGAGCGCATCGCCCAGCATGCTGAAACCGAAGGCGAGGCAGACGATGGAGAGGCCGGGAAACAAAGTGATCCACCATGCCGTGGTGATGAAGCTCTGTCCTTCTGCGACCATAACGCCCCATTCGGCGATCGGCGGCTGGACGCCGAGGCCGAGATAGCTGACGGCAGCGCCGCTGAGCAGCACCAGCGTCGCATCGGACATCGAAAAGACGATCGAGCCGGCGATCGCGTTCGGCAGCAGGTGGCGGAACATGATGCGCGGGCGGCTGAAGCCGAGGCTGACGGCGGCAACGGCGTAGTCGCTGCCTTTCAGCACCAGCATCTGCGCCCGGATCAGCCGTGCATAGGAGACCCAGCCGACCAGCGCCATGGCGATGTAGAAGCTGCCGAGGCCGGGGCCGAGGATCGCGATGATCGACAGCATCAGCACCAGGAAGGGGAAGGCGAGGATGATATCGACAAGACGCATGAACAGCGCATCGACGATCCCGCCGAAGAAGCCGGCGATCGTGCCGACCGTCGTGCCGATCAGGAAGGGGAAGATGACGCCGATCAGCGCCATCTGCAGGTCGAGGCGGGCGCCCCAGATAACGCGGGAGAGGATGTCTCGGCCGAAATTGTCGGTGCCGAAGGGATGCAACAATGACGGCGCCTGCAGGCGCACCTCGGCATTCTGCATGATCGGGTCGTAAGGCGCGACAATGGGCGCGCCGATCGCCAGCAGCACGAAGAACAGCAGCAGGCCGGCACTGAGTACAAGCATCGGCCGCCGGCCGAAGAACCGGCGCCAGCCGGGGGAGGCGGGGGCGATCGCCTCGATGCTCATAGCTTCACCCTCGGATCGACGGCGACAGTGACGATGTCGGCGATGAAGTTGATGAGCACGGTGGCGCAGGCAAAGACCATGGCGACGCCTTGAACCACCATATAGTCGCGTGAGAAGATCGCCCTGACGAGCAGCTGTCCCATGCCGGGCAGCGCAAAGACGCTCTCGACCACCACCGTGCCGCCGATCAGCCAGCCAATATTGACGGCAAGCAGGTTGATGGTGGGCACCAGCGAATTCGGCAGGACATGCCGCCAGAAGACGATCCCCTCCGGCATGCCGCGGGCGCGCGCCGCCGTCGCGACATCCGATTTCAACTGCTCGATCATCGCCGCCCGCAGGCTGCGCGTCAGCACGGTCGAGAGCGACAGCGCGACCGTCAGGCTCGGTAGTACGAGATGCGGCAGCTTTTCGCCGATCGTCGCACCATAACCCGAGACCGGCAGCACGCCGAGTTCGACGCTGAAGAGGATGATCAGCATCAGTCCCAGCCAGAAGGGCGGAAAACCGATGCCGAAGGTCGAGACGATGCGCACCGCATGATCCGGCGCCCGGCCGGCATTGCGCGCGGCGATCGCCGCCATCGGCACCGCGATCAGGACCGACAGCACGACGCTGGAGACGACGAGGGCGAGCGTCGGCTCGATGCGGGTGACGATCAGCTTCAGCACGTCGATCTTGTAGAGGATCGACTTGCCCATCTCGCCATTGGCGAGGTTCTTGAGGAAATAGACATATTGCAGCCACATCGGCTGGTCGAGGCCGTACTGGGCGCGAATGCTGGCGAGAGCCGCCGGCGTCGCGCGCGTGCCGAGGATGTTGCGCGCCGGATCACCGGGGATCAGCCTGACCAGAATGAACGTGATGACGCTGATGCCGAAGATCACGGGCAGGAACTGCAGCGGTCGTGTCAGAACGAATTTATAGCGATGCATGACGGCGATCGCCCCTTTGTCTTCGTCGGGTTCGGTCCGCTTCCTGTTGCATCAGCCTGCCTCGTGCCGGGCGAGAAAATCGAGAAGCGCCGGATAATAATCCTGTGGGTTCTCATAGAACGGCATATGGCTGGCATTGGCAAATACCTTCAGCTCGGCATTCGGCAGCGCAAGCTTCATCCTGAGCGCACAGGCCGGCGTCAACTCGTCATGCTCTCCCGTCGTGATGAACACCGGCAGCGTCAGTCGCGGCAGATCGGGGATGCGGTTCCAGTCCTTGAGATTGCCGATATAGAGAAACTCGTTCGGTCCCTGCATCGTCTCATAGGGCACCATGTTCCAATCGTCGAGCGAGCGGCGCACCGGCGCCGGCCATTCCGGCAGACGGCAGACATGGCGGTAGTTGAGGATGGTGACGGCGGCGAGATATTCCGGATGATTGTAGGTGCCCTGCGCCTCGTGCTTCTGCATCATCGACACGGTTTCGGGACCGAGCGCTGCGCGCAGCCGTTCCAGTTCCGAGATCAGATGCGGCATGTCGGCGACGGTATCTTCGAGGATCAGCGTCTTGAGGTTTTCGGGATAGGTCAGCGCATAGTCGATCGCCAGCCACCCGCCCCAGGAATGGCCGAGCATGTGAACCTTGCCGAGCCCCAGCGCCTTGCGCACCGTCTCCGTCTCCTCGACATAACGACCGATCGTCCAGAGCGAGAGATCGTCCGGCCGGTCGGAGGCGCCGGTGCCGAGTTGATCGAAGGCGACGACGCGATAGCCCTTGTCGATGAGGCAGGAATGCGCTTCGCGCAGGTAATCGCAGGGCAGACCCGGCCCGCCGTTCAGGCAGAAAACCGTCTCGCTGCCGGTTCCGAAACTATAGGCAACGACGCGATAGCCATCGACATCAATCTCGAATCGCTCGTCCGGCCGTATTTCACGCCACATTGATCGCTCCGGCAGAAGATTTCGCTTGCTCAATATTTGGGCACGGCTAAATTTTTACCTTAAACCGCATTCAGTGCCAGCTATAAGAAGTGATAGGGTGGGCCGCAATGCCGAACCCGGGAGCAGCCCATGCTTGACGACCCCATTCTTGACGACATCGGAACGATCCGACGGCAGTTTACAGCGCACGAAACGCTGGACGGCCGAATCGACCAAGCCTTCGAGGCGATGAAGCAGATCGGCTTCGAGGCGCTGATCTACGACTATACACCCGTTCCCTACGATCTCGACGGCGCGATCATGATACCATCGCTGCTGAAGCTGCGGAATATTTCAGACGACATGCACGACTACTGGTTCAATCGCGGCTATTTCCGTATCGATCCGGTGCAGCAGGTGGCGCTGCGCACCTCCGCACCCTTCTTCTGGAACTATGACCCGGACGCCGACACGCTGATCAACCGTTTCATGAACGACGACACCGTGCCGGTGACCCGCTATTTAAGCGAACGCGATATGTCTACCGGTGTCACCGTCCCGGTTCATATGCCACGCGGCGACTATGCGACCGTCACCGGTATCCGCTTCGGCCGGAACAAAGATTTCGAACGGCACGCACTGCGCTATATCGCCGACTTCAACCTGCTTGCCCATGTTTTCCATGAGACCGCCTATTCGCTTTTCGATACGCGGGCGAAGAGCGTCGGCACGATCCGCCTGACCGAGCGGGAACGGGAATGCCTGCGCCATTCGGCGGAAGGCTACTCCGCCAAGGAAATTTCCCGCATCATCGACCGTTCCGTACCGACTGTCGTGATGCATCTGAACGCCGCGACAAAGAAACTCGGCGCCCGCAACCGCACCCAGGCCGTGGTGCGCGCCACCCATTACCGTCTGCTCGAAGACCGGCCGACCCATAATTGGCCACCCTATAACTTGTGATAGCTGCCTTCCCCGTTGCCGCGGCGTTATGCTTTTTTCATACCGCTAAGAGATGGAGACGCCCGTGGGCGAAGTCACGACCAACGAAGCCTATCTGCCCTTTCGCGACTATCGCACCTGGTATCGCGTCACGGGTTCGCTGGAGAGCGGCAAGCTGCCTCTCGTCGTCGCCCATGGCGGACCGGGCTGCACCCATGATTATGTCGATTCCTTCAAGGATATAGCCGCCCTCGACGGTCGTCCGGTCATCCATTACGACCAGCTCGGCAACGGCAACTCCACCCGCCTTCCGGAAAAGGGCCCGGATTTCTGGACGGTCGGCCTGTTTCTCGAGGAGCTGGACGCGCTGCTTTCCCATCTCGGCATTCAGCATCGTTACGCCTTCCTCGGCCAGTCCTGGGGCGGCATGCTTGGTGCCGAACATGCGGTGCGCCGGCCGCAAGGCCTGAAGGCACTTGTTATCGCCAACTCGCCGGCCAACATGCACACCTGGGTTTCGGAGGCGAACCGGCTGAGGCAGGAACTGCCGAAAGAGGTGCAGGACACGCTGCTGAAGCATGAACTGGCGGGAAGCCTCACCGATCCGGAATATATCGCCGCCTCGCGTGTCTTCTATGACCGCCATGTCTGCCGGGTGGTGCCGTGGCCGCCTGAAGTGGCGCGGACCTTCGCGATCATGGACGAGGACAACACGGTCTACCGTAACATGAACGGCCCGACCGAATTTCACGTCATCGGCACCATGAAGGACTGGACGATCGAGAACAGGCTCGACCGCATCGAAGCCCCGACGCTGCTGATCTCGGGAAAATACGACGAGGCGACACCCTTGGTGGTAAGGCCTTATCTCGAACGCGTTCCGGGCTGCGAATGGGTGCTCTTCGAAAATTCCAGCCACATGCCGCATGTCGAGGAAAAGCAGCTTTGCCTGGCGACCGTTTCTGGTTTCCTGTCGCGGCGCGATTGAGAAAACTCGCCCGCGCCGGCTACACCGTTTCTTGCCGCGGCAGTGGCGCTTGCGATAGTCTCCGTCCCATTCAAGAGCGAACGGAACGGACCGATGAACGATCATCCGCAAACAGAAAGCGCGACAATCCTCGCAACGAGCAAGGGCCGTGCCTGGCGCGGTCTGGAGGCCGAGCTTTTGCGCATTCCGCCCGGCCGCACGCATATTTCAGGCACGCCCTATCATCGCCTCGGAATCCATGTCGGCCGACCGGTGCGGGCGCAATGCCGCTGTGACGGGCGGGAGCATCGCCGACTGCAGAAGCATGGCGATATCGACGTGGTGCCAGCCGGCCTCGATGGCGTCTGGGAGGACGACCGCGAATGCATGATCCTGCGGCTGAAGATCAGCAAGGATCTTTTCCATCGGGCTGCCATCGACCTCGGCCGCGATCCCGCGACGGCAATCGTGCCGCAATTTCAACTCCGCGATCCGCGGCTCGAGGCAATCGTCTGGGCCTTGAAGGCGGAACTCGAGGCGGATGTGCCGTCCGACAATCTCTATGCTGATACGCTGGCGACAGCCTTGGCGCTCCGTCTGGTCGAGGCGGGCAGCGATACCTCCCGTCCGGTGGATAACGGCAGAGCGCTCTCGCCCCGTCAGAAGCGGATGCTCACCGACTACATCGAGGATAATCTCGACACGTCGCTTTCCCTTGCAGAACTGGCAGGTCTTGCCGGACTGAGCCTCTCGCACCTGAAGACGCAGTTTCGAAACAGTTTCGGCATGCCGCCGCATCAGTATGTGATGCATCGCCGGATCTCGCGTGCCGAGGCACTGATCCGGAGCTCCAGCCTGCCGCTGAGCCAAATCGCGCTGGAAGCAGGCTTCGCGCATCAAAGCCATATGGCAAACAGCATGAAGCGGCTGCTTGGCGTCAGCCCCGGCACCATCGCCCGCCTGCGTAATTGACGAAAAGTGGCCGATCCTGCAACGGATCGGCTGAATGCGCAGGCAATGCCCCTGGCGGCAGTACATCTTCCTTCGACAGCGAACAGAGGAAGGAATGATCATGTTTGTCATCTTCGGCGCCACCGGCAAGGTCGGCAAGGCAACGATAACAAGGCTTCGGGCCGAGGGCGCCCCCGTCAGGGCGGTGCTCCGTGATCCCGCCAAGGCTGCGCTCCTTGCGGCACTCGGATGCGAGATCGCCATGGCCGAGGTCGGCGATGTCGAAGCCATGGCCGCCGCGATGAAAGATGCGACGGCTGTACAGCTCATCTGCCCGATCGATCCGCGCGCCGCCGATGCCAGCCGCCAGATGCTGCAGTCGATCGAGCGGATGGCGGAGGCGATCGATGCCGCCCGGTCGCCGCGCATCTTGGCGATCTCGGATTACGGCGCGCATCTGACGATCGATACCGGCATTACCAGCTTGTTCCGCGCGATGGAGGAGCGGTTCCGGCGCTGCAGCACCAGGGTGAGCTTCCTGCGTTCGGCCGAACATATGGAAAACTGGGCACGCTACCTGAAGATCGCCGGCGAGACCGGCGTGATGCCGAGCATGCACCAGCCGCTGGCGCGACCCTTCCCGACCGTCTCCGCCGCCGATATCGGTGATATGGCCGCCGACCTTCTGCTGGCTGACCAGGATCATGGCCCTTCGCCGCGGATCGTCCATGGCGAGGGGCCGCGCGAATACACGGCGCTTGAGATCGCTGCGGCCGTGGGGCGGTTATCGACTCGGCGCGTCGTCGCACACGAACTGCCGCGTGCCGATTGGCCGGCAACGTTGATCCGCGCAGGACTTTCGGAAAGTTACGCCGATCTGATCGTCAGGCTCGGCGATGCGCACAACAAAGGTCTGATCGGAGCGGAAGCCGGCGTCGGTGAAATCCGCCGTGGCCGCACGGGGCTTTCCGCTGCCCTTGAACCATTTCGTCCCGACATCCGGCAGGCTGTGTCACGATAGCCTCAATCACGCATGGGGAGGAGTCCTGCCGGGGACCCTCTATCGGCGAACCAGCCATTTCTTGGCAACACGGCAAGCCATCGTATAGGCCGGATCGAAGACATTGCCGACATCATAGCGCACCACCTGGCCGAGCAGATGGCTGGCCGCCGTTCTGTCCAGGCCGTAATCCGCTGCCAGCCAATTCAACATTTCGCTGGTGGCGTGCTGGAGCGCCTGATCGAGGGGCCGGGCGTTGCCGATGGTGAAGATGTCTTCGGCCGTTTCTCCGCGCGGCCAGACCAGTCCGGCCTTCTTTTCCACCGTCAGCCGTACCGTGACCTCGAATGTCGTCTCGACGCCGGTTCCGACGATTTCGCCATCCCCTTGCACGGCATGGCAGTCGCCGAGAAAGAACAAGGCGCCGGGGGCCGCTACGGGCAAGCGGACCGTGGTGCCGGGGCCGAACAACCGATAATCCATGTTGCCGCCATATTCGCCGCTGGTTGCGGTCGATATCGCCTGGCCGAGGCTGGGCGCCACGCCGAAGCAGCCGATCATCGGGGCGAGAGGCAGAACGAAATTTTCGAGGCCGCCGACCGGCTCGGAAAGCCGGACGGTCAAGGCCTCACGGTCGATCGCCCAGATGGCGATATCGCGCGGCGGCAGGTCACGAACCGCCTCTGGATCGACCACATTGGCCGCGACGACGCTACGGGTGAAACCCGTGTCCTTGGTCGGGATCATGCTGATGATCTCGACCTTCAGCGCATCCCCGGGCTCAGCACCTTCCACGAAGATCGGGCCGTTCATCGGATTGGGGCCGGATGCCTGCTTGATGCCGTCCTTGTCATGCCCAGCGGCGTCGAGCGTCTCGGTGACGACGGTATCGCCGTCGGCGATGTGCAGGGCCGGCGAAAGAGAGCCGATGACATTGTGAAAGCTCGTCGGAATGAAGCGGTGAGTGGTCATGAAAGTACAGCTCTCGCTCGTTTCGAATGTCATGAGCCTGCGGCTGAACTGCCCGGCGGAGAGAGTGCGAGACTACTGCATAATTCCGTAAATCGGAATCGATTTACGGATAAAATTATGCAGCAATTCAAAGTGCTACAGCGCCGCTGTAGATCGGAGATTGTCGGCAATGCCAATATCCGCGGCACCACGCAATTCCGGTGTCGGCCTGCGCCGAGATTCCTTTCGACACCGCCCGTGCGTTTCATGCAGAAATCGCGCGGCGGCATCTATTGGGCCATCCTATATCAGGATGAGGGACCAATCCCGATCGCGAGGACCGAAGGGATCATTTATGAACCATAGATCGGAAACGCCTCGCACAAAGCGCAGACTTGTGTGCGCACCTCCGCCTCGGACTCGGCGATGCCCTCCGGGTTCTGCGACAGCGCATCGATAACAGTCAGGATCAGCACGCCGATCTTTTCGAACTCGGCTTCACGGAAGCCGCGCGTGGTTCCGGCCGAACTCCCCAAACGAATACCTGAGGTAACGAATGGCTTTTCCGGATCGTTCGGGATTGCATTCTTGTTGCAGGTCAGGCCCGCGCGTTCCAGCGCAATTTCAGCCACCTTGCCGGTTACGCCTTTCGGGCGCAGATCAACCAGGACCATGTGACTGTCCGTACCACCCGAAACGATCCCGAGGCCACCGTCCTTCAGAACTCTTGCCAGGGCTTGGGCATTGGAGATGACCTGGCCGGCGTAATCGGCAAACTCAGGCCGCAGCGCTTCACCAAATGCCACGGCCTTGGCAGCAATGACATGCATCAGAGGGCCGCCCTGATTGCCGGGGAACACAGCGGAGTTCAGTTTCTTCGCCAGTTCGGCATCATTGGTCAGGATGACACCCCCGCGCGGCCCGCGCAGGGTCTTATGGGTGGTCGAGGTGACGATATGGGCGTGGGGTACCGGGTTTGGGTATTTGCCGCCTGCGATCAGACCGGCGTAGTGGGCCATGTCGACCATCAAATAGGCCCCGACCTCGTCCGCGATCTTACGGAACCCGGCAAAGTCGATCTGGCGAGGATAGGCCGAGGCACCTGCGACGATCAACTTTGGCTTCGTCTCCAGAGCCTTTTCGCGGACCTTTTCCATGTCGATGAGATGTGTGTCGGCATCGACTTCATAGGAGACCACATCGAACCATTTACCCGACATGGTCACCGGCGATCCGTGGGTGAGGTGCCCGCCATGGGCGAGCGACAGACCCATGATCCGATCTCCGGGCTGCAGAAGCGCAAGGAAGACCGCTTGATTGGCCTGCGCGCCGGAATGCGGCTGCACATTCGCGAATTCTGCTCCGAAGAGCTGCTTCAGCCGGTCGATGGCCAACTGCTCGACCTTGTCGACGAATTCGCAGCCGCCGTAGTAGCGTTTCCCTGGGTAGCCCTCGGCATATTTGTTTGTCAGCACCGATCCTTGCGCCGCCAGAACGTCCCGAGAAACGATGTTCTCCGAAGCGATCAGCTCGATCTGGGTCTTCTGCCGCTCAAGTTCTTCGGCGATGGCGCTGGCAACCGCCACGTCGGAAACGGTGGTTTGTGACAAACGGTCAAACAT
Protein sequences of DBSCAN-SWA_14 >NZ_CP054034|146064:156911|151505_152408_+|WP_138334135.1|DBSCAN-SWA MGEVTTNEAYLPFRDYRTWYRVTGSLESGKLPLVVAHGGPGCTHDYVDSFKDIAALDGRPVIHYDQLGNGNSTRLPEKGPDFWTVGLFLEELDALLSHLGIQHRYAFLGQSWGGMLGAEHAVRRPQGLKALVIANSPANMHTWVSEANRLRQELPKEVQDTLLKHELAGSLTDPEYIAASRVFYDRHVCRVVPWPPEVARTFAIMDEDNTVYRNMNGPTEFHVIGTMKDWTIENRLDRIEAPTLLISGKYDEATPLVVRPYLERVPGCEWVLFENSSHMPHVEEKQLCLATVSGFLSRRD >NZ_CP054034|146064:156911|149608_150499_-|WP_138334134.1|DBSCAN-SWA MWREIRPDERFEIDVDGYRVVAYSFGTGSETVFCLNGGPGLPCDYLREAHSCLIDKGYRVVAFDQLGTGASDRPDDLSLWTIGRYVEETETVRKALGLGKVHMLGHSWGGWLAIDYALTYPENLKTLILEDTVADMPHLISELERLRAALGPETVSMMQKHEAQGTYNHPEYLAAVTILNYRHVCRLPEWPAPVRRSLDDWNMVPYETMQGPNEFLYIGNLKDWNRIPDLPRLTLPVFITTGEHDELTPACALRMKLALPNAELKVFANASHMPFYENPQDYYPALLDFLARHEAG >NZ_CP054034|146064:156911|155642_156911_-|WP_138334145.1|DBSCAN-SWA MFDRLSQTTVSDVAVASAIAEELERQKTQIELIASENIVSRDVLAAQGSVLTNKYAEGYPGKRYYGGCEFVDKVEQLAIDRLKQLFGAEFANVQPHSGAQANQAVFLALLQPGDRIMGLSLAHGGHLTHGSPVTMSGKWFDVVSYEVDADTHLIDMEKVREKALETKPKLIVAGASAYPRQIDFAGFRKIADEVGAYLMVDMAHYAGLIAGGKYPNPVPHAHIVTSTTHKTLRGPRGGVILTNDAELAKKLNSAVFPGNQGGPLMHVIAAKAVAFGEALRPEFADYAGQVISNAQALARVLKDGGLGIVSGGTDSHMVLVDLRPKGVTGKVAEIALERAGLTCNKNAIPNDPEKPFVTSGIRLGSSAGTTRGFREAEFEKIGVLILTVIDALSQNPEGIAESEAEVRTQVCALCEAFPIYGS >NZ_CP054034|146064:156911|146064_147774_-|WP_138334130.1|DBSCAN-SWA MSGSVLSVRDLTVRAHVEPGPRTLLDAVSLDLGKGEILGLVGESGSGKSLFCRSLVRLLPSSLLKIESGSVLLEGRDLMRIDDGEMLKVRGGEIGMIFQNPTSHLDPVMRIGDQIAEGIRYHQGLGAREARTAATEILAQVGFPDPKRQYDSYPHEFSGGMRQRAMIGVALSCNPKILIADEPTTALDVTIQAQILRLLIDIRDRRGLSIILITHDLGIVAQTCDRIAVLRGGKLLEEGPKRTILARPQHPYTINLINSHPSLPGAISAPLPELAEASQPARPLLEIDDLHVRFRAGGALLRGGAKTVSAVAGVSLQIMPGETVGVVGESGSGKSTLARAVLGLTPLSSGHVTFDGVDLALQKSAGLAKLRRETAMVFQDPYNALNPRLTIGQMLAEVLKVQGKVAKADIPVRIGELLDLVGLEREFAGRKPRSMSGGQCQRAGIARALAVDPKLIIADECVAALDVTIQAQIIELFRDLTAKMNLTLIFIAHDLAIVRNLCERVVVMYRGEIVEEGRSEEVFARPKHPYTAALIAAIPDIDPDKRLLQGADGKDDPHSMPIQSIKRMP >NZ_CP054034|146064:156911|148618_149557_-|WP_138334132.1|DBSCAN-SWA MHRYKFVLTRPLQFLPVIFGISVITFILVRLIPGDPARNILGTRATPAALASIRAQYGLDQPMWLQYVYFLKNLANGEMGKSILYKIDVLKLIVTRIEPTLALVVSSVVLSVLIAVPMAAIAARNAGRAPDHAVRIVSTFGIGFPPFWLGLMLIILFSVELGVLPVSGYGATIGEKLPHLVLPSLTVALSLSTVLTRSLRAAMIEQLKSDVATAARARGMPEGIVFWRHVLPNSLVPTINLLAVNIGWLIGGTVVVESVFALPGMGQLLVRAIFSRDYMVVQGVAMVFACATVLINFIADIVTVAVDPRVKL >NZ_CP054034|146064:156911|152501_153359_+|WP_138334137.1|DBSCAN-SWA MNDHPQTESATILATSKGRAWRGLEAELLRIPPGRTHISGTPYHRLGIHVGRPVRAQCRCDGREHRRLQKHGDIDVVPAGLDGVWEDDRECMILRLKISKDLFHRAAIDLGRDPATAIVPQFQLRDPRLEAIVWALKAELEADVPSDNLYADTLATALALRLVEAGSDTSRPVDNGRALSPRQKRMLTDYIEDNLDTSLSLAELAGLAGLSLSHLKTQFRNSFGMPPHQYVMHRRISRAEALIRSSSLPLSQIALEAGFAHQSHMANSMKRLLGVSPGTIARLRN >NZ_CP054034|146064:156911|153453_154359_+|WP_138334139.1|DBSCAN-SWA MIMFVIFGATGKVGKATITRLRAEGAPVRAVLRDPAKAALLAALGCEIAMAEVGDVEAMAAAMKDATAVQLICPIDPRAADASRQMLQSIERMAEAIDAARSPRILAISDYGAHLTIDTGITSLFRAMEERFRRCSTRVSFLRSAEHMENWARYLKIAGETGVMPSMHQPLARPFPTVSAADIGDMAADLLLADQDHGPSPRIVHGEGPREYTALEIAAAVGRLSTRRVVAHELPRADWPATLIRAGLSESYADLIVRLGDAHNKGLIGAEAGVGEIRRGRTGLSAALEPFRPDIRQAVSR >NZ_CP054034|146064:156911|150628_151441_+|WP_173863104.1|DBSCAN-SWA MLDDPILDDIGTIRRQFTAHETLDGRIDQAFEAMKQIGFEALIYDYTPVPYDLDGAIMIPSLLKLRNISDDMHDYWFNRGYFRIDPVQQVALRTSAPFFWNYDPDADTLINRFMNDDTVPVTRYLSERDMSTGVTVPVHMPRGDYATVTGIRFGRNKDFERHALRYIADFNLLAHVFHETAYSLFDTRAKSVGTIRLTERERECLRHSAEGYSAKEISRIIDRSVPTVVMHLNAATKKLGARNRTQAVVRATHYRLLEDRPTHNWPPYNL >NZ_CP054034|146064:156911|147770_148622_-|WP_003549355.1|DBSCAN-SWA MSIEAIAPASPGWRRFFGRRPMLVLSAGLLLFFVLLAIGAPIVAPYDPIMQNAEVRLQAPSLLHPFGTDNFGRDILSRVIWGARLDLQMALIGVIFPFLIGTTVGTIAGFFGGIVDALFMRLVDIILAFPFLVLMLSIIAILGPGLGSFYIAMALVGWVSYARLIRAQMLVLKGSDYAVAAVSLGFSRPRIMFRHLLPNAIAGSIVFSMSDATLVLLSGAAVSYLGLGVQPPIAEWGVMVAEGQSFITTAWWITLFPGLSIVCLAFGFSMLGDALGELLGVHE >NZ_CP054034|146064:156911|154398_155310_-|WP_138334141.1|DBSCAN-SWA MTTHRFIPTSFHNVIGSLSPALHIADGDTVVTETLDAAGHDKDGIKQASGPNPMNGPIFVEGAEPGDALKVEIISMIPTKDTGFTRSVVAANVVDPEAVRDLPPRDIAIWAIDREALTVRLSEPVGGLENFVLPLAPMIGCFGVAPSLGQAISTATSGEYGGNMDYRLFGPGTTVRLPVAAPGALFFLGDCHAVQGDGEIVGTGVETTFEVTVRLTVEKKAGLVWPRGETAEDIFTIGNARPLDQALQHATSEMLNWLAADYGLDRTAASHLLGQVVRYDVGNVFDPAYTMACRVAKKWLVRR |
10 | Aeromonas_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_15 |
168992 : 175969
Sequences of DBSCAN-SWA_15
Nucleotide sequences of DBSCAN-SWA_15 >NZ_CP054034|168992:175969|DBSCAN-SWA AATGAACACTGCGATGAATGACTCCAACGAAGTATTGGTCGACTGCCAGAACGTCTGGAAAATCTTCGGCGGCCGGGCGTCGGCGGCCGTCAAGGCCGTCTCCGATCGAGGATTGTCCAAGACCCAAATCCTGCAAGAGTTCGACTGCGTGGTCGGGGTGTCCGGTGCCAGTTTGCAGGTCCGACGTGGCGAAATCTTCTGCATCATGGGGCTTTCGGGCAGCGGCAAGTCGACGCTTATTCGCCTTCTCAACCGCCTTATCGAGCCGAGTCTCGGCAAGATCACGGTGAAAGGCAAGGAGATCGCCAAGCTCAATCCCGCCGAACTCCGTGACATGCGCGCGCGCAATATCGGTATGGTCTTCCAGAGCGTAGCGCTCCTTCCCAACAGAACGGTCATCGAGAATGCCGCATTCGGCCTTGAGGTTCGCGGAGTTCCCAAAGTTGAGCGCGAGAAGACGGCACGGGCCGCACTCGACAAGGTAGGTCTTTCTGACTGGACCTCCCGGTATCCATCCGAGCTGTCAGGCGGCATGCAGCAACGCGTCGGTCTTGCGCGCGCGCTTGCTGCCGATCCTGAAATCATCTTGATGGACGAACCCTTCAGCGCTCTCGACCCGCTCATCCGCCGGCAGCTCCAGGATGAATTCAGACAGCTGACGAAGTCGCTCGGCAAGTCGGCAGTCTTCATCACCCACGATCTCGAAGAAGCTATTCGCATCGGTGACCGGATTGCAATCATGAAGGACGGCGTGATCGTGCAGGTCGGCACGGCCGAGGAAATTGTCACGAAACCTGCGGACGACTACGTTTCCGATTTCGTCGCGGGCATTTCCCGGATCCACCTGGTAAAGGCGCATTCGGTGATGTTTCCCGTCGCCGAGTTCAAAGCCGCGCAGCCGCACTGCGACGTCGAAACGCTTCTCAAGACCTCGCCCGAGGCCGATATCGGCGAACTCATCGATCTCACCATGCAGTCCGACCGTGACGCCGTGGCAGTCGTCGAGAACGGAGCCGTTATCGGTGTCGTCACCACCCGCGGCTTGCTTCGCGGCGTAGCCGGAGCGCCTCTCGGAGAGATGCCAGCTTAAGGGAGAAGGCGAATGGACACCTCGGTCATAACGGATACTTTCGATACCTGGACGGATGATGCTCTCAGCTGGATCAGCGACAACGGCGAGTGGCTGTTCGAGTTCCTGCGGTCGGTCCTCGAAGGCACCTATGCAGGCATTCTCTGGCTGCTTCAACTGCCGCCTTTCTACGTTGTCGCAATCATCGTGGCGCTGCTGAGCTGGCGGCTGATAAATTTGACATCCGGCATTCTCGCTGGCATCGCGATCATGGTCTGCGCGATCATGGGGCTGTGGACCGAAACCATGAGCACCCTGGCGCTCGTGGCAACAGCGACCATCCTCGCCCTTGCCGTGGGCATCCCCATCGGCATCGTCGCGGGCTTCGTCAAATCCTTCGACAGGTTCCTCGAGCCGATGCTCGACCTGATCCAGACCCTGCCGCCCTATATCTATCTGCTTCCCGCAATCGCGCTGCTGGGTTATGGCCCTGCGACGGCGCTTGTAGCCACCGTCATCGTTGCGATGCCGCCGGCGATCCGCCTTACGTCGCTTGGTATCCGCATGACGCCGCGCGAGTTCATCGAGCTTGGACAGGCGAGTGGCCTGACACCGTGGCAGATGTTTACCAAGATCAGACTGCCGTTCGCCATTCCGAGTGTGATGGCCGGCATAAACCAGAGCCTGATGATGGCCTTCGGCATGGTCGTGATCGCCGGCATCGTCGGCTCCGGCGGCTTGGGCGAGACGATCTACGGGGCGGTTCGCACGCTTGATATCGCTACCTCTATAAACGCTGCGATTGCCATCGTCATCCTGACCATGGTTCTCGACAGACTCACGCAAAGCGCTGCGCATCATGCGAAGGCAGGTGTAAGATGAACGCTGATCTTTTCCGGTTCTCGCCGGGCGCCTACCTCGCTCCGGTGGTCGATTGGCTCAACACCAATTTCCACCCCTTCTTTGATGCCGTGACGAAGTTCATCGAGGCTGTGCTCGGCGGAATTGAAGCTGTCCTGCTCTATCCGCCGGCTTACGCGGTGATCGTCGTGGTCGTGCTTCTTGCAGCATTCCTCATCAACCTCAGGGTTTCGATCGTGGCGGCTGTCGCACTCATGTTCTGCTTCTTGACGGGGCTTTGGATTGCGTCGATACAGACGCTGGCTCTGGTTACCGTCGCGGTCATCATTTCAGTGGCAATCGCCTTTCCGCTCGGCGTGATCGCGTCGCGTTACAGAAAATTCGAAGCCGCCATCCGGCCGGTGCTCGATATCATGCAAACCGTGCCGCCATGGGTCTATCTCATCCCCGCCGTCATGATCTTCAGTCTGGGCCGTGTCCCGGCGATCATCGCCACCATCGTCTATGGTGTTCCGCCAATGCTGCGTCTGACGACGCTGGCGTTCAAACAGGTTCCCAAGGATCTTCTGGAACTCGGCCAGGCCACTGGTGCCCCGCCCCGGGCGATCCTGTTCAAGATCGAAATCCCTTCCGCCACACCGACGCTTCTGGTCGGCCTCAACCAGTGCATCCTCCTCTCCCTTGCAATGGTCGTGCTGGCCGGACTTGTCGGAGCAGGGGGCCTCGGTGCCGAAGTCACCCGCGGGCTGACGCGCATGGAAATGGGGTCTGAGGGCAGGACTGGCCATCGTCGCGGTCGCTATCCTTCTCGACCGCCTGTCGAAGGGCGCCTTGCAGGGCCGCCGCAATCTTGGCGTGTAAGGCCTGATCGAAAGGAAAACTCATGCGTCGCAGCTTCTTCTGTATCGACTCCCATACTTGCGGCAATCCCGTGCGATTGGTCGCAGGCGGCGGCCCGATGCTGCCGCATCTCCCGATCGCCGAGCGCCGCGAGCTTTTCGTCCGCGACCATGATTGGGTTCGCCAAGCCTTGATGTTCGAGCCCCGCGGCCATGACATCATGTCGGGCGCGATCATCTATCCCGCCTACCGCGAGGATTGTGATTTCGCGGTGATCTTCATCGAGGTCAGCGGTTGCTTACCAATGTGCGGGGCGGGCACTATCGGACTTGTCACCGCCGCAATCGAGGAGGGGCTGGTGACGCCTCGCATCCCAGGTCGGCTGTCCATCGAGACGCCGGCGGGAAGGGTCGATATCTCCTACGACAAGCCTGGCGAATTTGTCGAATCCGTCCGTATGTTCAACGTCGCGAGCTATCTCCATGCGGCGGATGTCGAAGTCGACGTCGCGGGTCTGGGAAAGCTGGTGGTCGACATTGCCTATGGCGGCAATTTTTATGCGGTCGTCGAGCCGCAGGACGCTTGGCAGGGCCTTGATGGCATGACGGCTGCAGACATCGTCGATCTCAGCCGTAAGCTGCGCGACGCGCTGGCCACGATCTGCGATCCGGTGCACCCCGAAGACGAAAGAATAGGCGGGCTCCATCATGCTCTTTGGTGCGACAATGTGAGTGGGCCGGATGCCGACGGACGCGGCGCGGTCTTCTACGGCGACAAGGCGATCGACCGCTCGCCGGGCGGCACCGGAACATCGGCGCGCATGGCGCAGCTACACGGCAAAGGACGGCTGCGTCCTGGAGACACGTTCCGCCAGCAGAGCCTGATCGGCACCGTGTTTGAAGGCCGCGTCGAGGCAGAGACCATGGTCGGCCCGTTCAAGGGCATCAAGCCAAGCGTCGGCGGCTGGGCTAGAATCATCGGGCACAACACAATATTCGTCGATGACCGCGATCCGCTGGCGCACGGTTTCCAGATAGTTTGATTGGACGCAAGTATCGGGTGGCATAATCGGTGATGCAGGCTGGCCCGGATCTGCCGTACCCAAAGCGCTGATCGGTCGCGCTATCCAAACGTCGCGAATGGCGCCGGACGGGGCGCTTTGAGTTCTTCTTCCGGGCAAGGCCGCTCCTACAATGCCTTGCCCACCTTCCTTTCAATCCGATAGGCAGATCATGACAAATATCGAGCAGATGCAGGAACAGCGACTTGAAGCGCTCCGACTGACGAAGTCGGGATCACCGAAGCTGCCGTCCAATCCGTTTTTCGGCGTCGGTCCGATCACGGATGCGGCCACCGACGAAGTGGATAGCCGCCTGCGGCGCATCGCTTTCGATACATGGATCGAGAAGACCTATCGCAAGTTCGACGACAAGGGCAACGATATCGGCGGCTTCACCACCGCCGAAATTACCCGCAGCATGCACCGCGGCTACCCGGCGGATAAAATTCTGACCGACATGATGCGGGCGATCCACCGCTATTTCCACTTCCCGAAGGCGAACCGGATGGCTGTGGGACTCGGCGGCGGTCACAGTGGCTATACCGTCTGCGTCCAGCACCTGATGAACGCCAACGACGCCTCGCAGCGCGTCTACGTCGATACGCCGCGCCCGGAAAGCGATCCGGCCCGCGCGGCAGGCTTCTTCCGTCAGTCCTGGGCCACACAGCTCATCGAGATGCAGCGCTTCGCCGAAAAGGGCTGCGAAAGCCGCATTCATTTTGCCGCCTCCGAGGGTGTGATTCCGACGGCCGCCGAGCTTTCCGCTCTCGGCGTGTCGATCTTCGTCGGCGTCGGCCACGAGACGACCGGCGCCAATGCCTATACCAGCCGTGAGATCCACGAGCTGCTGAACTGGATCGATGGCGATCCGGCAAACCGCCATGCAGTGTTCGACGCGACGTCGATGCTGGGCGCCATGCCTTGGGAGCCCGAGCTGGTCGATGCCGTAATGGCGAAATGCTGCTTGTTCATGCCGTTCCAGAAGGCGATCGGCGGGGTGTCGGGGTATTTCGTCGCCTCCTTCACACCGCATGCTTTGGCGCTCGTGGAGAAAAACCAGCAGGATCCGTCCTGGGCGATCCCGCGCCAGCTCAAGATCGCGCCGCCGATCGATGCCCGGCAGCCGCTTTCAGCCAAGCGATCCGTGGATGCCGGGCCGTTTTACGATGCAGCGGAAGACCGCATGCTCGGCGGCGTCATCAATACCTACAGCGCGCTTGCCTTTGCCGAAACCACCTTCGGCCTCCTGCAGTCCGAGGCGCGGGTCGGCTCGGTGGTCGAGCTCAACAAGCGGTCTGCGGCCAATCGGGCGGTGATCGACGGCTGGGTCGAGTCGCACCCGCTGTTTTCGCTCACCGTCACCGATGCCGAGCGGCGCGGCGCGGCGGTGACACTGTTAAAGGTGGAGGACGCCGGCATAACCGATCCTGCGATCCATGCCCGCATCATCGCTCGTTCGAAGCAACTGCTCGGCTATGAGGGCTTTACCCATCCGAACGACGCGTTCGAGCCCGGCCTCGACGCGGCGCGCTATGTCAACGCATTCCCGGGAACGCCCGGCGACTACCGCGCCTGGGTCGGCGGCATCCGCGAGCCGACCGATGTCGTCGCGCTGCTGGAGAACCTGCAATATGCCTATCTGCGCGCCAAGATCGTCGTGATCGAGGAGGAACTGGCAAAGCAGGGCGTCACGTTCGCGGCGCCTGTGAAGGCCGGCGCGATGGTCCGCAAGGACGATCCCGAACACGCCTATACGGTGCTGATCGCCGATCTGGTCGGCCTGCGCCTCGGCGCCGATGGGGAGCCCGACGACAGCGAGATCAGGACCTATGTCGAGGAAAAGGGCGGCGTCTTCCATGCCGGCCCGATCGACGACAGGGCGGCGCTGGAAAAGGGCCGCATCCATTTCTTCTACCAGCCAAATCTCAGCACCGAGGCCGAAATCCTGCCGCAGACGGACAAGGGCCAGTATGACGCGCTGATCGCGGCGGCGACCTTCATCCCGAAGGCTTCCGTCTTCCCGTTCGGCGGCGTGCGCATCGGCGCGGGTACCGGCAATATGGGCTCAGCCTCCTGGGGCGGCGGCAACGGCGAGGGCGGCGAGGCGCCTTTGATGAACACGCCCGGCATCAACAGTCGGGCGACCGCCCAGATGGCGGTGAAGGCGATCCTCAAGGTTGTTCCCGACCTGCCCGTCGACAGGCTGCACCAGATGGTCGCCAACGGCGACTTTGATACCGGGCGCCAGCTCAAGGATTTCCCGACCGCCAAGCTGGAGGGCCGGAAAATCGCCATTCTAGGCTACGGCAATATCGGTCGCGAAGTCGCCAAGCTCGCCAAGGCTTTCGGCATGAAGGTGGTGATCTACGCGCGCCAGCATCACCAGCGCTGGATCGAGTCGGAAGGCTTCGATTACGCCGCAAGTCCGGTTGAAGCGGCAACCGCCGCCGACGTGCTCTCCGTCCATATCGGTCTCGGCCGGCTGGATGCTGTGACAGGCGTTTACTCCAACGCAGGGGTGGTCGACACACAGGTTCTCGGCACGATGAACGATGGTGCGGTGCTGGTCAACTACGACCGGGGCGAGGTCGTCGATGCAGCGGCGCTCGACCGGGCGCTGTCATCAGGAAAGATTGCCCATGCGGCAATCGACGCGGATCTCTTCAAGGATGCAGAGACCGGCAAGCTCAGCGGACCGATGGTTCCCTATCTGCCGCTCGAAGAGCGCCATAAGGGCAAGCTGGAACTGCTGCCGCACGCCGCCGCCGATACCGACCATCCTTCTCGGGTGACCGGCGCCAAGCAGGCGGTCGATCAGATCTTCGATGTCATCCGCTTCAAGTCGGTCACCAATCTGAAGGGTGATCTGCCGGACGGGTACGTCTCGGCAGGCGGCCGCACACCGGCCGGAATCGGCAGGGTGACGAAGCGGGTGGTCAGCGAGATCAGCGGCAATTCGGAACTGCTGGACGAATTGCGGCGGACCTCGGAAAAGGTCGCGGCGATCGTCGGCGCGCTCTCCGTTGTCTCCGACGCGAGCCACCGCGACCGGATCGTCGATCGCTATACCGGTCTGCTGGCCGAAAATGCCGGCCGGCAGCGGGCGCTTCTCGAACAGCTTGGTCTCTACGGGCCGGCGGCGGAGTAA
Protein sequences of DBSCAN-SWA_15 >NZ_CP054034|168992:175969|168992_170081_+|WP_138334160.1|DBSCAN-SWA MNTAMNDSNEVLVDCQNVWKIFGGRASAAVKAVSDRGLSKTQILQEFDCVVGVSGASLQVRRGEIFCIMGLSGSGKSTLIRLLNRLIEPSLGKITVKGKEIAKLNPAELRDMRARNIGMVFQSVALLPNRTVIENAAFGLEVRGVPKVEREKTARAALDKVGLSDWTSRYPSELSGGMQQRVGLARALAADPEIILMDEPFSALDPLIRRQLQDEFRQLTKSLGKSAVFITHDLEEAIRIGDRIAIMKDGVIVQVGTAEEIVTKPADDYVSDFVAGISRIHLVKAHSVMFPVAEFKAAQPHCDVETLLKTSPEADIGELIDLTMQSDRDAVAVVENGAVIGVVTTRGLLRGVAGAPLGEMPA >NZ_CP054034|168992:175969|170093_170942_+|WP_138334161.1|DBSCAN-SWA MDTSVITDTFDTWTDDALSWISDNGEWLFEFLRSVLEGTYAGILWLLQLPPFYVVAIIVALLSWRLINLTSGILAGIAIMVCAIMGLWTETMSTLALVATATILALAVGIPIGIVAGFVKSFDRFLEPMLDLIQTLPPYIYLLPAIALLGYGPATALVATVIVAMPPAIRLTSLGIRMTPREFIELGQASGLTPWQMFTKIRLPFAIPSVMAGINQSLMMAFGMVVIAGIVGSGGLGETIYGAVRTLDIATSINAAIAIVILTMVLDRLTQSAAHHAKAGVR >NZ_CP054034|168992:175969|172993_175969_+|WP_138334165.1|DBSCAN-SWA MTNIEQMQEQRLEALRLTKSGSPKLPSNPFFGVGPITDAATDEVDSRLRRIAFDTWIEKTYRKFDDKGNDIGGFTTAEITRSMHRGYPADKILTDMMRAIHRYFHFPKANRMAVGLGGGHSGYTVCVQHLMNANDASQRVYVDTPRPESDPARAAGFFRQSWATQLIEMQRFAEKGCESRIHFAASEGVIPTAAELSALGVSIFVGVGHETTGANAYTSREIHELLNWIDGDPANRHAVFDATSMLGAMPWEPELVDAVMAKCCLFMPFQKAIGGVSGYFVASFTPHALALVEKNQQDPSWAIPRQLKIAPPIDARQPLSAKRSVDAGPFYDAAEDRMLGGVINTYSALAFAETTFGLLQSEARVGSVVELNKRSAANRAVIDGWVESHPLFSLTVTDAERRGAAVTLLKVEDAGITDPAIHARIIARSKQLLGYEGFTHPNDAFEPGLDAARYVNAFPGTPGDYRAWVGGIREPTDVVALLENLQYAYLRAKIVVIEEELAKQGVTFAAPVKAGAMVRKDDPEHAYTVLIADLVGLRLGADGEPDDSEIRTYVEEKGGVFHAGPIDDRAALEKGRIHFFYQPNLSTEAEILPQTDKGQYDALIAAATFIPKASVFPFGGVRIGAGTGNMGSASWGGGNGEGGEAPLMNTPGINSRATAQMAVKAILKVVPDLPVDRLHQMVANGDFDTGRQLKDFPTAKLEGRKIAILGYGNIGREVAKLAKAFGMKVVIYARQHHQRWIESEGFDYAASPVEAATAADVLSVHIGLGRLDAVTGVYSNAGVVDTQVLGTMNDGAVLVNYDRGEVVDAAALDRALSSGKIAHAAIDADLFKDAETGKLSGPMVPYLPLEERHKGKLELLPHAAADTDHPSRVTGAKQAVDQIFDVIRFKSVTNLKGDLPDGYVSAGGRTPAGIGRVTKRVVSEISGNSELLDELRRTSEKVAAIVGALSVVSDASHRDRIVDRYTGLLAENAGRQRALLEQLGLYGPAAE >NZ_CP054034|168992:175969|171804_172803_+|WP_138334163.1|DBSCAN-SWA MRRSFFCIDSHTCGNPVRLVAGGGPMLPHLPIAERRELFVRDHDWVRQALMFEPRGHDIMSGAIIYPAYREDCDFAVIFIEVSGCLPMCGAGTIGLVTAAIEEGLVTPRIPGRLSIETPAGRVDISYDKPGEFVESVRMFNVASYLHAADVEVDVAGLGKLVVDIAYGGNFYAVVEPQDAWQGLDGMTAADIVDLSRKLRDALATICDPVHPEDERIGGLHHALWCDNVSGPDADGRGAVFYGDKAIDRSPGGTGTSARMAQLHGKGRLRPGDTFRQQSLIGTVFEGRVEAETMVGPFKGIKPSVGGWARIIGHNTIFVDDRDPLAHGFQIV |
4 | Bacillus_virus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_16 |
189604 : 193599
Sequences of DBSCAN-SWA_16
Nucleotide sequences of DBSCAN-SWA_16 >NZ_CP054034|189604:193599|DBSCAN-SWA ATCACACAAGCGCGATGTCCCGATCGTGCTGTGCGGCGGCGGGTATGACGGCAGCCTCAGCCAATGAGCTTCGAAGTGCTTCGACAACCCTGACGATGTCGAGATCCGTCATCTGGGCATAAAGCGGCAGGACTATCGTTTGCCGCTGCGCGGCCACGCTTCGCTTCAGACTTGTGGCAGCACGATGAGAACTATGGTGGGAATAGGCACCTTCCAGATGAATGTTCATCACACCGCGTCGGGTCGAGATCCCTTGATCGAGCAGTGCCTGCATGACCGTCCGCTGGTCGACGGCATCCGGCAGTCTCACGCAAAAGCTCTGCCAGTTGCTGCGAGCCCAATGCGGCTCGACCGGTGAGAGCCCAGGGATCGAAGACAGGCGCTCACAATATTGCTCAGCAAGCCGCCTCCGCTGTGCAATCAGCCCCGGCAACCGCCGCAACTGTTCCCTTCCGACGGCAGCCTGAAGGTCCGTCATACGGTAGTTGTAGCCCAATTCGTCATAGTCTTCGAAAATCACCTGCTTCGAGCCATGGCGAACAGCATCGGTAACGCTCATGCCATGCTGACGCCACAACCGGAATTTCCGGTCATATTCGGGATTGGCGGTGGTCAACATGCCACCATCACCTGTCGTGACGACCTTTCGGGGATGGAAGGAGAAACAGGCAATGTCGCCATGCGGCTTGCCGATCTTTTCCCAACGCCCGTCCCACAGGATTTCGCTTCCCGTCGCGCATGCCGCATCTTCGATGACCGGGAGCTGATGGCGCTTGCCGATCTCCACGATGGAGCGGAGATCGCAAGGCATGCCAAGCTGATGCACACATAGGATTGCCTTGGTGCGGGGCGTGATGGCTGCTTCGATCAGGCTGGGGTCGATATTGTAACCATCTGCTTCGATGTCGACGAAGACAGGCACCGCATTGCAGTATCGAACAGCATTCGCGGTGGCGATGAACGAATGGCTGACCGTGATGACTTCATCGCCGGCGCCGACATCGACCGCCATCAAGGCGAGATGCAGTGCGGTCGTGCAGTTTGAAACGGCGCAGGCATGTGCAGCGCCGACAAAGGCCGCGAATTCGCCCTCGAAGGCCGCGACTTCCGGCCCCTGCGTAACCCATCCTGACATAATTACACGGCGCGCGGCCTCAGCCTCCTCTTCTCCGAGGACGGGCTTGGCGACCGGAATCGTTGATAGAGACGAGCTCATGCAGCTTGCCCTCCTGCCGCAGCGGTCTTCATCTGCCACCATGCGACAAGGTCGCGAAGCCCTTGCTCCATCGATATTTCCGCCTTGAAGCCCAGAAGCTTCTCGGCCTTGCTGATATCAGCAAGACGACGGGTCACGCCGTTGACCGTGCGGGCTTCCTTATGCTGCGGTTCGAGTGAGACACCCATGATGCTGCTCAACATCTGCGCAAGTTGCAGGAGGCTTATTTCCTCGCCGCTCGCGACATTGAACACTTCATCCGTGACATCGCTCTTCGCAGCCAGCAGGTTTGCCCGGGCGATGTCACGTGCATCGACGAAGTCCATCGTCTGGCTGCCATCTCCATAGACAAGGGGTGGCATTCCGGTCGCCAGGCGCTCCATCCAGCGGATCAACACTTCGGTATAGGCGCCGTAAACGTCCATTCGCGGCCCGTATACGTTGAAATAGCGCAGCGCCACATAGCGCAGGCCGTACATCTCCGCAAAGCTGCGAAGCAGCCCCTCGTTAAACGTCTTGGCCGCCCCGTAGATCGTCCTGTTATTATAGGGATGATGCGCTTCGGTGGTCGGAAAGCTCTCGGCCAAACCCAGCACAGAGGCAGAAGAGGCAGCGACGACCTTCGACACACCTGCCTTTACGGCAGCTTCGAGAACATTGAACGTGCCTTCGGCCAGTACATCGAAGGCAAGCCGTGGGTCTTCGGCGCATTGCGTGATGCGGATGGCCGCTTGGTGAAAGACGATATCCACGCCCTCGAAGGTTTTCGCCAACAACGCTCTGTCACGAATATCTCCCTCGATGATGTTGACGGACCCGCCAGCTATTGCCGTACTGAGATTGTCCCGGCGTCCACGCACGAAGTTGTCGAGGATGATAATCTCGCGCGGCTTCTCCAATGCGACGAGATCGGCGATATGCGACCCGATCAGGCCGGCCCCGCCGGTAATGAGTACCCGTTGATCTCTCACGATCTTCCTCCATGCGATATATCCAGCACAGGATCGGCTTCCGTGCGCCAGCGTATGAACTTCGCCGGCACACCCGCCACGACTGAAAACGGTTCGACATCGGAAACAACGACTGCGCCTGCTCCGACGATCGCCCCTTTTCCGATGCTCACGCCGGGAAGAATAGTTGCATTCGTTCCAATATCGGCCCATGCGCCGATGCGAACCGGCTTGATTTCGAGATCCGTGCGAATGATCGGCACATCCACCGGCAGTGCGGTGTGGGTCGACCCAAGCACTTTGGCGCCAGGGCCCCAGCCGACACAATCCTCGATAACCAGGTCGCGTGCGTCGAAATAGGCCATCGGGCCGATCCAGACATTGTCGCCGATTACGCAAGTGCCGTCGAAGCGTCCCTGGATATAGGCCTGAGCGCCGATGAACACGCCATTGCCGATTTCAAACGTTTCAGGATGTTTGAAACCGGCGCCGCCCGCAACCTGCAGTCCCGCGCCGCAGCTTCGCGCGATCGCTTGCCAGATGGCTTTCCGCATCAGTGTGTCGACGACACCGTCGCCTGTGGCGAAACGACCGTAAAGGTCTATCAGACCAGCGCGCCCGTATGATTGCCTGAGCTCTTCGACCAACCCAGTTTGGTAGCCAGGATCTGCCGGCCTTTCATGGCGACCATGGACCGCCTGCACCAGTCGAGCCTCGCCGGCGCGGTTAACTGACATAAGCATACTCCAGTGCGGCGGCCACCTGGTCGACGTGTCGTAAAGGCATCTCTGGATAGATCGGCAGCGAGAGGACCTCCCGTGCCGCGGCTTCCGAGACCGGGAAATCCCCGGCCTGGTAACCAAGGTCGGCATGGGCTTTCTGCAGATGAACAGGGATGGGATAATGGAGCCCGGACGGAATGCCTTCCGCGCTGAGCAGGCGTTGAAGCCCATCGCGATCACGGCTTCTTATGGCATAGACATGATAGACGTGACGCCGGTCGGCTGCTTCGACAGGGATTGTCAAATCCGTCGATCCGGCAAGCAGCGAAGAATAGCGGCGCGCATGACTGCGACGCGCCTCGGTCCAGGCTTCAAGGTGCCGGAGCTTGACACGTAGGATCGCGCCTTGAATGGCGTCCATGCGATAGTTAAAGCCCTTCAGCAGATGGTGATAGCGCTGCTCCTGGCCCCAGTCTCGCAGCATGCGCATCGTCTTGGCCTGATCATCGTCATTGGTAACGACCATACCCCCCTCGCCGCAGGCGCCGAGGTTTTTGCCCGGATAGAAGCTGAAACAGCCGGATAGGCCGATACTGCCGGCGCGGTGGTCTTTGTATTGCGCGCCATGGGCCTGGCAGGCGTCTTCGATGACGGGTATCCCGTGGCGGTCGGCAATTGCCTTGATCGCGTCCATATCGGCCATTTGACCGTAGAGATGGACGGGAACAATGGCCTTGGTGCGGGGAGTAATCTTTGCCTCGACCTCCGCCGGATCCATCGTGAGTGTCACGGGCTCAACATCGACGAACACGGGTCGTGCGCCCGTGTAGCAGATCGCCGACACGGTTGCGACGAACGTGAATGGCACCGTAATGACCTCATCGCCGGGACCAACGCCCGCCGCAAGCAGGGCAAGATGCAAGGCACTCGTTCCGGTGTTGACGGCTATCGCATGCTTGACGTTGCAATAGTCGGCGAATTCCTGCTCGAAATGCGCGACCTCGTCGCCCAATACGTACTGCCCCGAGGCGAGGACGCCGAGCACGGCGGCGTCAATCTCACCCTTGATTGATTGATATTGTGCCTTAATGTCCAGGAACGGGATCAT
Protein sequences of DBSCAN-SWA_16 >NZ_CP054034|189604:193599|192495_193599_-|WP_138334183.1|DBSCAN-SWA MIPFLDIKAQYQSIKGEIDAAVLGVLASGQYVLGDEVAHFEQEFADYCNVKHAIAVNTGTSALHLALLAAGVGPGDEVITVPFTFVATVSAICYTGARPVFVDVEPVTLTMDPAEVEAKITPRTKAIVPVHLYGQMADMDAIKAIADRHGIPVIEDACQAHGAQYKDHRAGSIGLSGCFSFYPGKNLGACGEGGMVVTNDDDQAKTMRMLRDWGQEQRYHHLLKGFNYRMDAIQGAILRVKLRHLEAWTEARRSHARRYSSLLAGSTDLTIPVEAADRRHVYHVYAIRSRDRDGLQRLLSAEGIPSGLHYPIPVHLQKAHADLGYQAGDFPVSEAAAREVLSLPIYPEMPLRHVDQVAAALEYAYVS >NZ_CP054034|189604:193599|190815_191790_-|WP_138334179.1|DBSCAN-SWA MRDQRVLITGGAGLIGSHIADLVALEKPREIIILDNFVRGRRDNLSTAIAGGSVNIIEGDIRDRALLAKTFEGVDIVFHQAAIRITQCAEDPRLAFDVLAEGTFNVLEAAVKAGVSKVVAASSASVLGLAESFPTTEAHHPYNNRTIYGAAKTFNEGLLRSFAEMYGLRYVALRYFNVYGPRMDVYGAYTEVLIRWMERLATGMPPLVYGDGSQTMDFVDARDIARANLLAAKSDVTDEVFNVASGEEISLLQLAQMLSSIMGVSLEPQHKEARTVNGVTRRLADISKAEKLLGFKAEISMEQGLRDLVAWWQMKTAAAGGQAA >NZ_CP054034|189604:193599|189604_190819_-|WP_138334178.1|DBSCAN-SWA MSSSLSTIPVAKPVLGEEEAEAARRVIMSGWVTQGPEVAAFEGEFAAFVGAAHACAVSNCTTALHLALMAVDVGAGDEVITVSHSFIATANAVRYCNAVPVFVDIEADGYNIDPSLIEAAITPRTKAILCVHQLGMPCDLRSIVEIGKRHQLPVIEDAACATGSEILWDGRWEKIGKPHGDIACFSFHPRKVVTTGDGGMLTTANPEYDRKFRLWRQHGMSVTDAVRHGSKQVIFEDYDELGYNYRMTDLQAAVGREQLRRLPGLIAQRRRLAEQYCERLSSIPGLSPVEPHWARSNWQSFCVRLPDAVDQRTVMQALLDQGISTRRGVMNIHLEGAYSHHSSHRAATSLKRSVAAQRQTIVLPLYAQMTDLDIVRVVEALRSSLAEAAVIPAAAQHDRDIALV >NZ_CP054034|189604:193599|191786_192506_-|WP_138334181.1|DBSCAN-SWA MSVNRAGEARLVQAVHGRHERPADPGYQTGLVEELRQSYGRAGLIDLYGRFATGDGVVDTLMRKAIWQAIARSCGAGLQVAGGAGFKHPETFEIGNGVFIGAQAYIQGRFDGTCVIGDNVWIGPMAYFDARDLVIEDCVGWGPGAKVLGSTHTALPVDVPIIRTDLEIKPVRIGAWADIGTNATILPGVSIGKGAIVGAGAVVVSDVEPFSVVAGVPAKFIRWRTEADPVLDISHGGRS |
4 | Tupanvirus(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_17 |
198489 : 199533
Sequences of DBSCAN-SWA_17
Nucleotide sequences of DBSCAN-SWA_17 >NZ_CP054034|198489:199533|DBSCAN-SWA CCTATCGAATGGCTTCAGGAGTAGTGAGCTCGACTTTCGCGGACGCCAGATGCGGATAAGCCTCTTGCGAGGAATTGGCGCTGGGCCAGTTCCTTGCGCCGCCGGAAAGCTTCCACTCGAAATAAGCGATTGTTCTCTCAAGTCCCTCACGAAGGTTCACCTTCGGCTGCCAGCCCAGTTGTTGCGTTGCGCGGCTGATATCGGGCTTGCGCTGTGTCGGATCGTCAACTGGCAGATCCTTGAAGACGATGCCGGACTTCGATCCGGTCATTTCGACCACCATTTCTGCCAATTCACGGACCTGGAATTCTCCGGGGTTGCCGAGATTGATGGGGCCTGTAACCCCAGCAGGCGCGCCCATCAGACGGATAAAGCCGTCGATCAGGTCGTCCACATAGCAGAACGATCGTGTTTGCCTGCCGTTGCCAAAGATGGTGATCGGCTCGTTTCGAAGCGCCTGAACGATGAAATTGGAGACGACACGGCCGTCATTGGTCTGCATGCGTGGACCGTAGGTATTGAAGATCCGCGCGACCCGGATTTCCACGCCATATTGACGATGATAGTCGAAGAACAGCGTCTCGGCGCACCGCTTGCCTTCGTCATAGCATGCCCGCGGTCCTATGGGATTGACGCTGCCTCGATACTCCTCGGGTTGAGGGTGGACTGCCGGATCGCCATAAACTTCGCTTGTGGATGCCTGGAAGATCTTCGCCTTGGTGCGTTTTGCCAAGCCGAGCATATTGATGGCCCCGTGCACATTGGTCTTCACGGTCTGCACGGGGTCGTGCTGATAGTGGACCGGAGATGCTGGACAGGCGAGGTTGTAAATCTCGTCGACCTCCACATAGAGCGGGAAGGTAATGTCATGGCGGAGCACCTCGAAGCGAGGATCGTCAAGAAGATGCAGCACATTGTCGCGCGAACCGGTGTAGTAATTGTCCACGCAGAGGACGTCATTGCCCTCTCGCAAAAGCCTTTCGCACAGGAATGATCCCAAAAACCCGGTGCCGCCGGTTACCATAATTCGCTTTTGTCCGTGCAT
Protein sequences of DBSCAN-SWA_17 >NZ_CP054034|198489:199533|198489_199533_-|WP_138334192.1|DBSCAN-SWA MHGQKRIMVTGGTGFLGSFLCERLLREGNDVLCVDNYYTGSRDNVLHLLDDPRFEVLRHDITFPLYVEVDEIYNLACPASPVHYQHDPVQTVKTNVHGAINMLGLAKRTKAKIFQASTSEVYGDPAVHPQPEEYRGSVNPIGPRACYDEGKRCAETLFFDYHRQYGVEIRVARIFNTYGPRMQTNDGRVVSNFIVQALRNEPITIFGNGRQTRSFCYVDDLIDGFIRLMGAPAGVTGPINLGNPGEFQVRELAEMVVEMTGSKSGIVFKDLPVDDPTQRKPDISRATQQLGWQPKVNLREGLERTIAYFEWKLSGGARNWPSANSSQEAYPHLASAKVELTTPEAIR |
1 | Tupanvirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_18 |
215439 : 217606
Sequences of DBSCAN-SWA_18
Nucleotide sequences of DBSCAN-SWA_18 >NZ_CP054034|215439:217606|DBSCAN-SWA TATGGAGACCGCATTGAAAAACGAAAATAAGGTTGCGCTGGTTACGGGTATAACTGGCCAGGACGGTGCGTATTTGGCCGAATTGCTCCTGGAGAAGGGCTATATTGTTCACGGCCTCAAGCGACGCTCATCGTCGTTCAACACCAGTCGCATCGAGCATCTATACGAGGATCCCCATGTCGAAAATCCTCGCTTTATTCTGCATTACGGGGACATGACCGATTCAACGAACCTCATCCGCGTCGTTCAGGAAACGCAGCCGGACGAAATCTATAATCTCGCCGCACAGAGCCACGTTCAAGTCTCTTTCGAGACGCCGGAATATACGGCTAACGCCGATGGAACCGGCACCCTTCGGCTCCTTGAAGCAATTCGCCTCCTGGGACTGACCAAGAAGACCCGCTTCTATCAAGCGTCCACCTCAGAGCTCTACGGCAAAGTTCAGGAAGTTCCGCAGAGCGAAACGACACCTTTCTACCCCCGATCCCCCTACGCGGCGGCCAAGCTCTACGCCTATTGGATTGTGGTCAATTATCGCGAGGCATATGGCATGCACGCCTCGAACGGCATTTTGTTCAATCACGAAAGCCCGATCCGCGGCGAAACATTCGTGACGCGCAAAATCACCCGGGCGGCGGCGGCGATCCATCTTGGACTGCAGGAAAGGCTCTATCTCGGCAACTTGGACGCCAAGCGCGACTGGGGACATGCACGAGAATATGTCCGTGGCATGTGGCTGATGCTGCAACAGGACGAACCGGAGGACTATGTCCTTGCGACCGGCGAAACGCACTCGGTTCGTTCCTTTGTCGACAAAGCCTTTGCCCAGGTGGGCATGCCGATTGATTGGCGTGGGAACGGGGTTGAGGAAAAGGGATACGATAAGACATCCGGCAAGTGTGTGGTGGAGATCGATCCAGCTTACTTCCGTCCGACCGAAGTTGATCTTCTGATTGGCGATCCGACGAAGGCGCACACGAAGCTTGGCTGGAAACATGAGACCAATCTTGATCAGCTGGTTGCGGAGATGGTTCGCGAGGATCTGAAACTCATGGCACGAAATGTTCCGTCGGTCGGTGTAACTAAAGGTTTTGCCTATGCCTGAGGTGATTTACAGCCTTGCCGGAAAGAGGGTCTATGTCGCCGGCCACCGCGGAATGGTGGGCTCTGCGATCGTGCGGCGTCTCGCTTCCGAGGGCTGCGAGATTTTGACGGCCACCCGCGCCGAGGTCGATCTCAGACGGCAGGAGCAGGTGGAGGCCTGGATGAGTAAGCATCGTCCCGATGCTGTCTTCCTGGCTGCTGCGAGGGTCGGCGGTATTCTCGCGAATGCTACCTATCCGGCCGACTTCCTTTACGACAACTTGATTCTCCAGGCGAACGTCATCCACGCCGCCCATAGAACTCAAGTCGAAAAACTGATGTTTCTGGGCTCGTCCTGCATCTATCCGAAATTCGCCGATCAGCCGATCGTTGAGGACTCACTTCTGACCGGATCGCTTGAGCCCACGAATGAATGGTATGCGATCGCCAAAATTGCCGGATTAAAGCTCTGCCAAGCCTATCGCAAACAGCACGGTAGAGATTTCATCTCGGCCATGCCTACCAATCTTTACGGTCCAGGCGACAATTTTGACCTCGGGTCAAGCCATGTCATGCCAGCGCTCATACGCAAGACACATGAGGCCAAGGTCAACCAGCAACAAGAGATATGCGTCTGGGGTACCGGCACGCCGCGGCGCGAATTCCTGCATGTTGACGATTGCGCCGACGCCTGCGTCCATCTCATGAAGACCTATTCCGCCGAAAGTCATGTGAACGTAGGTTGTGGCGAAGACATTACCATCCTCGAATTGGCATACCTCGTCTCCGAGATCGTTGGTTTCGAAGGCAAGATCACGCGCGACCTCACCAAGCCAGATGGCACGCCACGTAAACTCCTGAGCGTCGACAAGCTTCGCACTCTCGGCTGGTCTCCTAAGATAGGTCTGAAGGAGGGTATCGCAGATGCCTATCGCTCCTTCCTTGATGGCCATCATCTCGAACGCAGCGACAGAGCTACGTCCGGCGACTTGATCGGTCAAAGCGACATCAGTTTCGAGAAAGCGAAGGGTTCGGCGCCGCATGCGCCCACGCTCTCGACCGTTGCGCATCATCCCTCGCCATAG
Protein sequences of DBSCAN-SWA_18 >NZ_CP054034|215439:217606|216535_217606_+|WP_062941765.1|DBSCAN-SWA MPEVIYSLAGKRVYVAGHRGMVGSAIVRRLASEGCEILTATRAEVDLRRQEQVEAWMSKHRPDAVFLAAARVGGILANATYPADFLYDNLILQANVIHAAHRTQVEKLMFLGSSCIYPKFADQPIVEDSLLTGSLEPTNEWYAIAKIAGLKLCQAYRKQHGRDFISAMPTNLYGPGDNFDLGSSHVMPALIRKTHEAKVNQQQEICVWGTGTPRREFLHVDDCADACVHLMKTYSAESHVNVGCGEDITILELAYLVSEIVGFEGKITRDLTKPDGTPRKLLSVDKLRTLGWSPKIGLKEGIADAYRSFLDGHHLERSDRATSGDLIGQSDISFEKAKGSAPHAPTLSTVAHHPSP >NZ_CP054034|215439:217606|215439_216543_+|WP_138334206.1|DBSCAN-SWA METALKNENKVALVTGITGQDGAYLAELLLEKGYIVHGLKRRSSSFNTSRIEHLYEDPHVENPRFILHYGDMTDSTNLIRVVQETQPDEIYNLAAQSHVQVSFETPEYTANADGTGTLRLLEAIRLLGLTKKTRFYQASTSELYGKVQEVPQSETTPFYPRSPYAAAKLYAYWIVVNYREAYGMHASNGILFNHESPIRGETFVTRKITRAAAAIHLGLQERLYLGNLDAKRDWGHAREYVRGMWLMLQQDEPEDYVLATGETHSVRSFVDKAFAQVGMPIDWRGNGVEEKGYDKTSGKCVVEIDPAYFRPTEVDLLIGDPTKAHTKLGWKHETNLDQLVAEMVREDLKLMARNVPSVGVTKGFAYA |
2 | Acanthocystis_turfacea_Chlorella_virus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_19 |
221303 : 222224
Sequences of DBSCAN-SWA_19
Nucleotide sequences of DBSCAN-SWA_19 >NZ_CP054034|221303:222224|DBSCAN-SWA GTCATTGGGCTAGCAACCAGCCGGGGCCTGGTTTGTTCTCCCGCATGAAGCTGATGAAAGCACTGAGGGCGGCAGGCATCTGCCGTTTGCTTGGGTAAAACAGGTAAATACCCGAGACCCAGGAGCTCCAACCATCGAGAACCCGGACAAGCTCGCCAGTGGCGAGCGCATCCTTTACCAGCACCTTCGGGACCAGACCGATTCCACCTCCGCCACAAATGGCTGAGGCCAGGACACGAAGGTCGTTTGCCACGAAATGCATACCGGTCGCCGGTTCAGCTTTCTCATCGCCGCTCGTCAGTACCCAAGGGCGAACAATTCCGTCCCAGGAGGACCGCAAGCGGATGCACCAATGTTCGCCCAGCTCGGACGGCGTGGCAGGAGCCACCCGATTTCGGAGATAGTCGGGCGATGCCATCAAAACCTCTTCGAATGAATCGACGAGCCGAACGGCGACCATGTCCTTTTCTATCCGCTCTCCAACTTGGATCCCCGCGTCAATTCGGTTTTCAACGAGATCGACGTGAAGATCGTCAACCAGCATTTCAACTTCGATATCCGGATGGGTAGACAAGAACTGGCCGATCAGTGGCGCTGCAAGCAGTGTCGCAGCAGTTCGCGAACCAATGATCCGAAGGCGTCCGCCGGTCTCACTGCGGAAGCGGTTTAGGCCGTCCAGCGCCACATCAAGGCTTTCGAGCAGCGGCTCCAATGATTTCAGGAACTCGTCTCCCGCATCCGTCAGCGAGACGCTGCGGGTGGTTCGTGTCAGAAGGCGGACGCCGAGCTTCTCTTCAAGGCCGCGTAGAGTTTGACTAAAAGTCGGGACCGAGACGCCCACATTCGCGGCAGCCCGCGTGAAGTTGAGCTGGCGTGCCACTTCTACAAAAGCCACCAGCTCCCGCATTTCAGGTCTATGCAT
Protein sequences of DBSCAN-SWA_19 >NZ_CP054034|221303:222224|221303_222224_-|WP_138334211.1|DBSCAN-SWA MHRPEMRELVAFVEVARQLNFTRAAANVGVSVPTFSQTLRGLEEKLGVRLLTRTTRSVSLTDAGDEFLKSLEPLLESLDVALDGLNRFRSETGGRLRIIGSRTAATLLAAPLIGQFLSTHPDIEVEMLVDDLHVDLVENRIDAGIQVGERIEKDMVAVRLVDSFEEVLMASPDYLRNRVAPATPSELGEHWCIRLRSSWDGIVRPWVLTSGDEKAEPATGMHFVANDLRVLASAICGGGGIGLVPKVLVKDALATGELVRVLDGWSSWVSGIYLFYPSKRQMPAALSAFISFMRENKPGPGWLLAQ |
1 | Burkholderia_virus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_20 |
226185 : 227244
Sequences of DBSCAN-SWA_20
Nucleotide sequences of DBSCAN-SWA_20 >NZ_CP054034|226185:227244|DBSCAN-SWA ACTATCGGGAGAGCGCGACGACGATCGCTTGCGAGCGGCTATGCACGCCGAGTTTGCTGAATATCACCGACAGCTGGTTTCGCACCGTCTTTGCACTCTTGCCAAGCCGTTCTGCAATCACGCGGTTGTCTAGCCCCTCCGCGACGAGGTCGAGAAGGGCGGCCTCGGCGGGTGTCAGACCGGCCTCACGAACTGCGCGCGGCTGCGCGGCTCGATCCTGTCCGAGGAAGGCGGCAAGTTCGGCTTGAAACAGGTTCCACGCAGACTCGTCCGGCAAGAGCACATGATTCTTGCTCTCAAGCGGCAAGAAGCGCGCGCCCGGAATCGCGGCGGCAAGCTTGCAACCCTGGTCAAACGGCACGCGCATGTCGCCGCGGCTGTGTGCGATCAACGTCGGGACGCGCAGCATCGCCGCAAGGTCCAGCACATCGATCCCCTGCATCTGCCAAAGAAGTTGTGCGGCGACGTCGGGCGCTGCCGTTTGGCGTTCGAGATCGCCCCACCAGCGATGCTGTTCAGCTGTGCCGTCGGGGATAAAGAGATTGGTGAAGAACCGGCAAAACGCCGGATTATCGCGTCCCCACCCGATGCGAACGAAATTGACAAGGGTCTCGGCTTCCAGCCGCTCGGCCTCGGTCTGGGCTCTGGCGCGGGCACCCTGACCATAGGCGTTCAAGAGCACGAGATGCGATACTCGCTCGGGATACTTGAGGGCGTATGCGATAGCGAGTGCGCCGCCCTGCGAAAGACCGAGGAGGACGAAGCGCGGCTCCTCGATCGACGCTGCCACCGCAGCAAGGTCTGCGTGCCACGCTTCGACCGAAAGGTCAAAGACATGCCGGTCCGACAGACCGCAGCCGCGCGGATCGTAGCGCACAAACCGGTTATGGGCGGACAGCGCCTGGAGCCACGGCCGCCACACCGGGCTTTCGAGATCGTAATCGACATGGCTCAGCCAGTGCGCCGCCCGCAGGATCACCTGGCCCTGCCCGCAAGACGCCATGGCGATACGGGTGCCGTCCTCGGCTGTAGCGAACCGTATGGTCTGACTGAGTTTCAT
Protein sequences of DBSCAN-SWA_20 >NZ_CP054034|226185:227244|226185_227244_-|WP_138334222.1|DBSCAN-SWA MKLSQTIRFATAEDGTRIAMASCGQGQVILRAAHWLSHVDYDLESPVWRPWLQALSAHNRFVRYDPRGCGLSDRHVFDLSVEAWHADLAAVAASIEEPRFVLLGLSQGGALAIAYALKYPERVSHLVLLNAYGQGARARAQTEAERLEAETLVNFVRIGWGRDNPAFCRFFTNLFIPDGTAEQHRWWGDLERQTAAPDVAAQLLWQMQGIDVLDLAAMLRVPTLIAHSRGDMRVPFDQGCKLAAAIPGARFLPLESKNHVLLPDESAWNLFQAELAAFLGQDRAAQPRAVREAGLTPAEAALLDLVAEGLDNRVIAERLGKSAKTVRNQLSVIFSKLGVHSRSQAIVVALSR |
1 | Mycobacterium_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_21 |
247570 : 252297
Sequences of DBSCAN-SWA_21
Nucleotide sequences of DBSCAN-SWA_21 >NZ_CP054034|247570:252297|DBSCAN-SWA CATGGATATGACGCGCATCCGAGATCGCATGGTGGAGCACCACGTTACGCGTCGTGGCATTCACGATCAAAGCGTCATCGAAGCGATGCGCACGGTGCCGCGCGAAAAATTCGTTCCTCCAGGCTCCGAGGAATTCGCCTACGAGGATGCGCCGCTGTCGATCGGCGAGGGCCAGACAATTTCGCAGCCGTTCATCGTGGCTCTGATGCTCGAAAAGGCCAGTCTGAATGCCGGTGACAAAGTGCTCGAGGTCGGAACGGGCTCCGGCTATGCTTCGGCCCTCATCAGCCGGATAGCAAGGCATGTCTACTCCATCGAGCGACATGAAAAGCTGGCGCTTCAGGCCAGGGAACGGTTCGAGAAGCTCGGATACCGCAACATTGACGTTCGCGTCGGGGATGGCAGCAAGGGCTGGGCGAAAGCCGCTCCGTTCGATGCCATTATTGTTTCCGCGGGCGCACCCGAGGTGCCAACCGCCTTGATAGAACAGCTGAACCTCGGAGGGCGGCTCATCATCCCGGTCGGCGGTAGTGAAGGGCAGCGCCTGAAACGGCTCACGCGCACCAGCGCCGCCGCATTCGAGGAAGAAGACCTCGGCGGCGTGCGCTTCGTTCCCTTGATCGGTGAAGACGCCTGGACTGCTACCCATCCGATGTATACGGCTACGGCCCCCTCATTGCCGAAAACGGTTGCCTCCGAGCCAAGCTCTCCGCAAGACGCACAGGGAGGGGGCATGGGAAAGATAGACAAATTGCCCGCGATGCCGCATCAGCATCATCAAGAAGCAGAAATCCGGCTGACAATTGACGACGAAGGAGACACCTCGAGGGCATGAACTCGCTGCCATTTTTGAGTTGATAGCCGCCTGGGAATGCGCGTCAGAAGGATAGGTCGAGCACCCTGCCCTCGAACAAACGGTCCCGGGAGCCTTCGAGATGCGCCTTCATCAGTCTTTGCGCATCGTCCGGCCGGCCATCGGCGATCGCTTTGTAGATGGCGTTATGCTCTTCGTAGACCTGTTCCAACCCCTGGCGGGGGCCGAGCAAAGAGAGGCCGTGGAGATGCATGCCGACGGCGATATGGGCCTTCAGCGCATCGATCGAGGCCGTGTAATAATGATTGTTGGCGGCCTCCGTCACCGCGCGATGGAAGAGAAAATCGGCGTCGGTGCGGTGAAGTTGATGGCTGGTCGCCTCGCGCAGATCCGCAAGCGCGCTGGCGATCTTCTGAATGGCCTGCTCATTGCGGCGTTTGGCGGCGAAATAGGCGGCCGCCGGCTCGATCGTCAGGCGGAATTCGTAGCAGCGCTGAATGTCGGCGATCGTCTTGACCGGCGAATAGGCAAGCTGCGTCGTCGGCGATCCGTGCGCGCTGACGAAGGTGCCGGCGCCTTGCCGGGCATAGACCATGCCCTGATCGCGCAGGCGCGCCAGCGCATCGCGGACGATCGGCCGCGACACACCCAGCATCGAGGCAAGCTCATGCTCGCCGGGAAGGCGTGAATCGGGCGGATAGTTTCCGCCGCGGATACGCTCGAGCAACTGGTCGAAGACGCGATCGACCAGCTTGACGGCCTTGCCGCGTTCGGGCTTGCCCGGTGGGCCGGCTTCCGCGTTCAAAGATTCGCCATTCACTTCAGCCACGTCATTCTCCCCGGCGGTATCTTACTATGCCGCCCGATTCCAAAGAACCAGTCTTAGCAAACTATCGGGCTTGCCGAAGCCCCCGGACTTCACGGCGCACCGAAAATGCCGCCCGTTGGCTGCCGTGACATCGAACCACGGCACGCCGGCCTCGATTTCACCCTTCGGCGCCAGCACTCGGACCCCAAGCGCCTGGAAAACGGCAAGCGCCGTATCGCCGCCGCCGACCATCAGCATGTCAGGCTTCGTATCGTCGATGACCGATCTCACGCCTGCCGCAAAACGGCGGGCGACCAATGCCACATCGGCTGCCATATCGCCGGTGCAGCGCAACAGCGCCGGCAGCGCCATGCCCTCCCCGCCTTCGACCTGCCCCATCGGCGCGTCCACCACCATGCGCAGAACCCCCGAGGCTTCGAGCCGCTCCATCTGGGTAGCGGTAATCGGATCGCGCGAGCCAAAGGCGAACAGGGTGCGTAGGGTCGCTACAAATTCCGGCACCGGCTGCCGACCTGTTTCGCCGAGCTGCCGGGCAAGAGCAGCACCCAGACCCCGTGCGCCGACCGCAAGTGCCAGCCGCCAGTCCTGATCGGCAACGATCTGGTCGAGATCACTGTCATCCTCCGCATCGGCAACGACAACGCTCTTCGCGCGGCCCTCGAACAGATCGGCGATCGGCAGCGGCTTGTCGACGCCGCGGCCGACGACGCAGCCCCGATAGGTCACGCGCTCCTGATCGGGAACGGCAGGCGCCACGACGATGGTCTCCAGGCCGAGCGCCTCCGCCAGCGCCAGGCTTTCCACTGCTACATTGCCCTTCAGGCGGGAGTCGATCTTCTTCATCACGACTGCGGGCTTCACGCCGCAAAGCGCCTCGGTCGCCAAGCGCACCCTTTCGGCAGCTTCGCGCTCGCCGCGCGCGCGCGAGGCGGTATTGATAACGACAACGTCACAGCCGGTGGCGATCGCATCCTCTGCGGCCTCGACATCGACGGCGACCGCAACCGAAAGGCCGGCTTCGACGAAGGGCGTACCGGTATCGAGCGCGCCCGTCAGATCATCGGCAATGATGGCGGCCTTCAGCGTCATGCGGTCTGTCCGGTGTTGACAGCATGGCAGGCGACGAGATGGCCGGGTTTTGCTTCCTGCCAGGCGGGGATGACCTTGCTGCAGACATCCATGGCAATCGGGCAGCGCGTGTGAAAGCGGCAGCCGGTCGGCGGGTTCAGCGGGCTCGGCACATCACCCTGAAGAATCTGGCGCTGGCGGGTGCGCTCATGTTCCGGATCGGTCTCCGGGACGGCAGATAGCAGTGCCTTGGTATAAGGGTGCAGCGGATTGTCGAAGAGTTCCTCGCGGGTGGCAAGTTCGGCCAGCCGCCCGAGATACATGACGCCGACACGGGTGCTGATATGGCGCACGACTGCGAGATCATGGGCGATAAAGAGATAGGTCAGCCCATATTTTTCCTGCAGATCGAGCAGCAGGTTGATGATCTGCGCCTGGATCGACACGTCGAGCGCCGAAATCGCCTCGTCGCAAACAATGAATTTCGGCTGGCAGGCGAGCGCGCGGGCAATGACGACGCGCTGGCGCTGGCCGCCGGAAAGCTCGTGCGGGTAACGCTGGGCAAACCGTGTTGGCAGGCCGACATCGGTGAGCAGCGTCGCGACCCTATCCTTCAGTTCGGCTTTGGTGCAGAGCTTGTGGAAGAGGATTGGCTCGCCGATCGCCTCGCCCACCGTCATGCGCGGATTGAGTGTCGAATACGGATCCTGGAAGACGATCTGCAGGTCGCGGCGGAAGGGCCGCATCTGTTTTTCGTCGAGTGTCGCGAGATCCTGGCCCTTGTAAACGACCTTGCCGCTGGTCGCCTTGTAGAGGTTGAGGATGGTGAAACCGGTCGTCGACTTGCCGCAGCCGGATTCGCCGACCAGGCTCAGCGTCTCGCCTTCCATGATGTCGAGGTTGACATTGTCGAGTGCATAGACGGTTGCCGAGCGTTCGCCGAAGGCGCCGAGCTTGACGTGGAAATGCTTGACCAGGTTTTCGATCTTGAGCAGCGGCTGAGCACCCATCACGCGGCCTCCTGCATTCTCTGGACGACGAAACAGGCGGCGCGGTGATTGCTGCTGCCGCTCAACGGTTCGAGCTGCGGCACCTTCTCGCGGCAGATGTTCTCGACAAGCGGGCAGCGCGGCGCAAATGGGCAACCCTTCGGCCGGCGGCCGGGCTCCGGCGGCGTGCCGCCGATCGAGCTCAGCCGGCTTGCCGGCGGGTCCGAGAGCTTCGGGATCGATGACAGCAGGCCGCGCGTATAGGGATGGCTCGGGCGGGCATAGAGCTCGTCGACCGGCGCATCTTCGACCACCGTTCCGGCATAGAGCACGGCGACACGATCGACGAGGCCGGCGATCAGCGCCAGATCATGGGTGATCCAGACAACTGACATGCCGAGCTTAGTCCTGAGGTCCTTCACCAGATCGACGATCTGGGCCTGGATGGTGACGTCGAGCGCGGTCGTCGGCTCGTCGGCAATCAAAAGCTTCGGATTGCAGGCAAGGCCGATTGCGATCATCACGCGCTGGCGCATACCGCCGGAAAGCTCGTGCGGATAGGCTTGCAGGCGCTCTTCCGGGCCGGGAATGCCGACAAGCCGCAGAAGTTCGACGGCGCGGGCGCGGGCCGCCGCCTTTTTCATTCCGCGATGATAGACCAGCGGCTCGCAGATCTGGTCGCCGACGCGCATCACCGGGTTGAGCGAGGTCATCGGATCCTGGAAGACGAAGCCGATATCGCCGCCGCGCACCTTGCGCAGCTCTCTATTCGACATCGTCTGCAGGTCGCGGCCGTCGAATGTCGCCGATCCCTTGGTGACCTTGATCTTGTTCGGCAGGAGCCGCATCAGGCTCAGCATGGTCAGGCTCTTGCCACAGCCGGATTCGCCGACGACGCCTAGGGTCTCGCCCCTGTCGACATGGAGATCGATGCCGTCGACGACGATGGCGGGGCCGTTGCGGCCGTCGATTTCGACGGTGAGGCCCCTGACATCGAGCAGGCGGGTGGTGCTTTTCGTGTCCATCAT
Protein sequences of DBSCAN-SWA_21 >NZ_CP054034|247570:252297|249236_250301_-|WP_138334250.1|DBSCAN-SWA MTLKAAIIADDLTGALDTGTPFVEAGLSVAVAVDVEAAEDAIATGCDVVVINTASRARGEREAAERVRLATEALCGVKPAVVMKKIDSRLKGNVAVESLALAEALGLETIVVAPAVPDQERVTYRGCVVGRGVDKPLPIADLFEGRAKSVVVADAEDDSDLDQIVADQDWRLALAVGARGLGAALARQLGETGRQPVPEFVATLRTLFAFGSRDPITATQMERLEASGVLRMVVDAPMGQVEGGEGMALPALLRCTGDMAADVALVARRFAAGVRSVIDDTKPDMLMVGGGDTALAVFQALGVRVLAPKGEIEAGVPWFDVTAANGRHFRCAVKSGGFGKPDSLLRLVLWNRAA >NZ_CP054034|247570:252297|247570_248404_+|WP_138334249.1|DBSCAN-SWA MDMTRIRDRMVEHHVTRRGIHDQSVIEAMRTVPREKFVPPGSEEFAYEDAPLSIGEGQTISQPFIVALMLEKASLNAGDKVLEVGTGSGYASALISRIARHVYSIERHEKLALQARERFEKLGYRNIDVRVGDGSKGWAKAAPFDAIIVSAGAPEVPTALIEQLNLGGRLIIPVGGSEGQRLKRLTRTSAAAFEEEDLGGVRFVPLIGEDAWTATHPMYTATAPSLPKTVASEPSSPQDAQGGGMGKIDKLPAMPHQHHQEAEIRLTIDDEGDTSRA >NZ_CP054034|247570:252297|248447_249212_-|WP_003549481.1|DBSCAN-SWA MAEVNGESLNAEAGPPGKPERGKAVKLVDRVFDQLLERIRGGNYPPDSRLPGEHELASMLGVSRPIVRDALARLRDQGMVYARQGAGTFVSAHGSPTTQLAYSPVKTIADIQRCYEFRLTIEPAAAYFAAKRRNEQAIQKIASALADLREATSHQLHRTDADFLFHRAVTEAANNHYYTASIDALKAHIAVGMHLHGLSLLGPRQGLEQVYEEHNAIYKAIADGRPDDAQRLMKAHLEGSRDRLFEGRVLDLSF >NZ_CP054034|247570:252297|250297_251290_-|WP_026236973.1|DBSCAN-SWA MGAQPLLKIENLVKHFHVKLGAFGERSATVYALDNVNLDIMEGETLSLVGESGCGKSTTGFTILNLYKATSGKVVYKGQDLATLDEKQMRPFRRDLQIVFQDPYSTLNPRMTVGEAIGEPILFHKLCTKAELKDRVATLLTDVGLPTRFAQRYPHELSGGQRQRVVIARALACQPKFIVCDEAISALDVSIQAQIINLLLDLQEKYGLTYLFIAHDLAVVRHISTRVGVMYLGRLAELATREELFDNPLHPYTKALLSAVPETDPEHERTRQRQILQGDVPSPLNPPTGCRFHTRCPIAMDVCSKVIPAWQEAKPGHLVACHAVNTGQTA >NZ_CP054034|247570:252297|251289_252297_-|WP_138334252.1|DBSCAN-SWA MMDTKSTTRLLDVRGLTVEIDGRNGPAIVVDGIDLHVDRGETLGVVGESGCGKSLTMLSLMRLLPNKIKVTKGSATFDGRDLQTMSNRELRKVRGGDIGFVFQDPMTSLNPVMRVGDQICEPLVYHRGMKKAAARARAVELLRLVGIPGPEERLQAYPHELSGGMRQRVMIAIGLACNPKLLIADEPTTALDVTIQAQIVDLVKDLRTKLGMSVVWITHDLALIAGLVDRVAVLYAGTVVEDAPVDELYARPSHPYTRGLLSSIPKLSDPPASRLSSIGGTPPEPGRRPKGCPFAPRCPLVENICREKVPQLEPLSGSSNHRAACFVVQRMQEAA |
5 | uncultured_Mediterranean_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_22 |
264735 : 265524
Sequences of DBSCAN-SWA_22
Nucleotide sequences of DBSCAN-SWA_22 >NZ_CP054034|264735:265524|DBSCAN-SWA GTCATGCAGCCGAACTCCCCTGCTGCAACAATTGGCGCAGATGGGCGGCGGTCTCCTGGACATCGAGACTCTCGCGCCGACAGAGCCCACGCTCGTCTTTCAGCGGTGCAAGGTCGATCAGCGCCTGGACGCGGCCCGGATTGGCCGCCAGGACCAGCACGCGTCCGCCGAGATAGATCGCCTCGACGACGCTGTGGGTGACGAACAGGATGGTCGTGCCGGTCCGGCGCCAGACGTCCAGAAGCTCGTCGTTCAGTCGTTCGCGGGTGATCTCGTCGAGCGCACCGAAGGGCTCGTCCATGAGCAGGATATCCGGCTCGCATTGAAGCGCGCGCGCAATGGCGACACGCTGGCGCTGGCCCCCGGACAATTGATGCGGATAGCGGTCGGCCAGATGCGACAGGCCGACCAGTTCGATCCAGTGGTCGGGCCGCGGATCGACCGATCGCGAGACGCTGTTCTTCCCCACCTGCAGCGGCAGCGCGACATTCTCCCTCACAGTCCGCCACGGCAGCAGCGTTGCGTCCTGAAAGACGAAGGCGACCTCGCGCCGCCGGCGCGCCTCCGAGGCTGTCCTGCCGAGCACCGACAGCCGCCCGTCGAGCGGCGGGAGAAGATCCGCCACCGCACGCAGCAAGGTCGACTTGCCACAGCCGGAAGGACCGAGAATGGTCAGAAACTCGCCCCTGCCGACGCTGAGATCGACGCCGGACAGAATGCGGGTCGTCTCGGCGGTGCCGGCATAGCCGACAGCCAGATCCTTGGTTTCGATGGCCGCTGCTTCAGGCAA
Protein sequences of DBSCAN-SWA_22 >NZ_CP054034|264735:265524|264735_265524_-|WP_138334271.1|DBSCAN-SWA MPEAAAIETKDLAVGYAGTAETTRILSGVDLSVGRGEFLTILGPSGCGKSTLLRAVADLLPPLDGRLSVLGRTASEARRRREVAFVFQDATLLPWRTVRENVALPLQVGKNSVSRSVDPRPDHWIELVGLSHLADRYPHQLSGGQRQRVAIARALQCEPDILLMDEPFGALDEITRERLNDELLDVWRRTGTTILFVTHSVVEAIYLGGRVLVLAANPGRVQALIDLAPLKDERGLCRRESLDVQETAAHLRQLLQQGSSAA |
1 | Planktothrix_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_23 |
269543 : 271094
Sequences of DBSCAN-SWA_23
Nucleotide sequences of DBSCAN-SWA_23 >NZ_CP054034|269543:271094|DBSCAN-SWA CATGACGGCCATTTCGTTGAAGCAGCCGGTGACGGCACGCGACGACATCATCCGCTTTGAAGGCATCGTCAAGCATTTCGGCGGCGCGCAGGCGCTTGCCGGTGCGTCGCTGATCGTCAGGCGCGGCACCATCCACGGTCTCGTCGGTCAGAACGGCGCCGGCAAGTCGACGCTGATCAAGCTGCTGGCAGGTCTTCACCAGCCGGATGGCGGCCGGATCGAGATCGAAGGGCAAACATTCGACAGGCTGACGCCGCATCTTGCCGAAGAGCTCGGCATCCACTTCATTCACCAGGACCGGCTGCTGGTGCCGACCTTCACCGTCGGCGAAGCCCTGTTCCTTGGCCGGGAACCACGCATCGCGGGCACACCGTTCCTCGACCGGCGACTGATGCAGCGCCGCGCATCTGACATTCTCAACGACTATTTCGGCATCCGGTTGCCGAACGCCGCCTTGATCGGCGAATTGTCGACCGCCGAAAAGCAGATCGTGCAAATCACCCGCGCCCTCCTCAATCAGCCAAAGGTGCTCGTCTTCGACGAGCCGACGGCCGCATTGGTGCGCCGCGAGGCCGATATCCTCTTCCGGCTGATCCGCCGTCTGCGCGACGAAGGCGTCACCATCATCTATATCTCGCATTACCTGAACGAAATCGAAGAGCTCTGCGACCACGTCACCGTGCTGCGCAACGGGCTGGACGTCGCCTCCCTGCCGATCGGCGACACCTCGGCCGGCGCGATTGCGCGGCTGATGGTCGAGCGCGACATCAAGGAGATGTTTCCAAAGCCGGAAGTGACGCCGGGCGAAGACATCCTCAAGGTCGAGCAGCTGTCGGCGCCGGGGAAATATAGCGACGTGAGTTTCACGCTTCGCCGCGGCGAGGTGCTCGGCCTCACCGGCCTGCTCGGCTCCGGCGCAAAGGAGCTGGTCCGCGCTCTCTTCGGTCTGGAGACCCCGGCTTCCGGCCATATCGAGATCAGCGGCAAGGCTGCCCGTTTCACCAATCCCACCCAGGCAGCCGGCCGCGAGATCGCCCTGGTGCCGGAAGATCGGCGTCGCCACGGCGTCGCGCTCGACCTCAGCGTCGCGGAAAATATCAGCCTTTCGAGCCTTGGCCGCTTCACGCGTTTCGGCTTTCTTGATCGCAGACGCGAACAGAGAGAAGTCGACGGTCTGATTACGCGGCTGCAGGTGAAGACCAATGGCCGCGATGCCCTGCTGCGCACGCTTTCGGGCGGCAACCAGCAGAAGATCGCCATCGCCAAGTGGCTGAGCCGCCGCTCCGAGGTCTATCTCCTCGACGAACCAACTGTCGGCGTCGATATCGGCTCCAAGGTCGAGATCTATACGCTGATCGGCGAACTGGCGGCGCGCGGCGCCGGCGTCATCGTGCTGTCTTCGGATCTGCCCGAACTGATCGGCATCACCGATCGCATCCTGGTGCTCTTCCGCGGCCGCGTGGCGCGGGAATTCATCTCGTCGGAGACCACGGCAGATGCGGTGCTCGCTCAATCGACCGGATCATCCGAAGGACAACGCCATGTCGGCTGA
Protein sequences of DBSCAN-SWA_23 >NZ_CP054034|269543:271094|269543_271094_+|WP_138334276.1|DBSCAN-SWA MTAISLKQPVTARDDIIRFEGIVKHFGGAQALAGASLIVRRGTIHGLVGQNGAGKSTLIKLLAGLHQPDGGRIEIEGQTFDRLTPHLAEELGIHFIHQDRLLVPTFTVGEALFLGREPRIAGTPFLDRRLMQRRASDILNDYFGIRLPNAALIGELSTAEKQIVQITRALLNQPKVLVFDEPTAALVRREADILFRLIRRLRDEGVTIIYISHYLNEIEELCDHVTVLRNGLDVASLPIGDTSAGAIARLMVERDIKEMFPKPEVTPGEDILKVEQLSAPGKYSDVSFTLRRGEVLGLTGLLGSGAKELVRALFGLETPASGHIEISGKAARFTNPTQAAGREIALVPEDRRRHGVALDLSVAENISLSSLGRFTRFGFLDRRREQREVDGLITRLQVKTNGRDALLRTLSGGNQQKIAIAKWLSRRSEVYLLDEPTVGVDIGSKVEIYTLIGELAARGAGVIVLSSDLPELIGITDRILVLFRGRVAREFISSETTADAVLAQSTGSSEGQRHVG |
1 | Acanthocystis_turfacea_Chlorella_virus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_24 |
278601 : 283208
Sequences of DBSCAN-SWA_24
Nucleotide sequences of DBSCAN-SWA_24 >NZ_CP054034|278601:283208|DBSCAN-SWA CTCAGGCCTCCCTGCCGACACGCCAACTGTCGGCGTAGATGCTGATGGCGGACGGATCGAAGGTGACGCGGGCATCCGCGGGAATGTCGGCGTCTTCAGGCACGACGATGGCGATCGGCTGGCCGGCAAAGCGGGCGCGCACGATCTTCTGCCGGCCGATATCCTCGACCTTGCTGACGGTGACCGGCATGCCCTCCCGTCCCAGGCGAATGAATTCGGGGCGGATGCCGAGTTCAATTTTGACCGTGCCTGATGTCTTCGGCGCGTAATCGAGCGTCAGCGTCTCGTCTCCGACCTTGACCGCGCTGCCCTCGATTGTCGCTGGCATGAAATTCATACCCGGCGAACCGATGAAGTAACCGACGAAGGTGTGGCTCGGCCGCTCGAAGAGCTCGGCCGGCGTGCCGATCTGGACGATCTGGCCGTCATACATGACGACCACCTTTTCGGCGAAGGTCAGTGCTTCCGTCTGGTCGTGGGTGACATAGACCATGGTGAAGCCGAACTGCTTGTGCAGCCGCTTCAGCTGCGAGCGCAGCACCCATTTCATATGCGGATCGATGACGGTCAGCGGCTCGTCGAACAGGATGGCGTTGACGTCGTTGCGCACCAGTCCGCGGCCGAGCGAGATCTTCTGCTTCTGGTCGGCCGTCAGGCCCTGCGCCTTGCGCCTGGCCCAGCCGGCAAGATCGATCATTTCGAGGATGTCGCGGACACGCCGGTCGACATCCGCCTCGGCCACGCCGCGGTTGCGCAGAGGAAAGGCGAGATTGTCGTACACGGTCATCGTGTCGTAGATGACGGGAAACTGGAAGACCTGGGCGATGTTACGGCTCTGCGTCGAAAGGTTCGTGACATCGTTGCCGTCGAACAGGATGCGGCCATGTGAGGGCTGCAGCAGGCCGGAGATGATGTTGAGCAGCGTGGTCTTGCCACAGCCGGAAGGACCGAGCAGCGCATAGGCGCCGCCATCGTTCCATTCGTGGTCGACTTCCTTCAGGGAATAATCCTTGTCGGACTTCGGATTGGCGCCGTAGGCGTGGCGGATATGGTCGAGCGTGATGCGTGCCATTGTGCTCTCCTATACCTTTCCGGCCGCAGCGATGGCGCGGCCGTCGCTGCCGAAGGCCATCAGGTGGCGGGTGTCGAGAAAAGCTTCGACCTCCATGTCGGGATCGATGTCGTGAATGCCATGCGCCAGCATCACCCAGCGCACACCGTCATATTCCAGGTGAACGAAGCTCTCGGAGCCGGTGATCTCGGAAACCAGCGTGCGCGCCTGCAGGCGGGCCGCATCGCCGGTCTGCGGCGCAAGGCCGAGATGATGCGGGTGGAAAGCGATGGTGACGGGACCATCTGGCACGACGGCCAGATGCGAGGGAACCGAGATCGTGACCCCGCTCGGACGCGTGAACACATTGCCGGATTTCGTGACGTCGAGGGTGTTCAGCGGCGGATCGGCGAAAATGCCGGCGGTGGCGAGATTGACGGGTCGGCGATAAACCTCGATCGTCGGCCCGAACTGGGTCACCCGGCCCTGGTTCAGCGTTGCCGTGTTGCCGCCAAGCAGGAGCGCCTCGGAAGGCTCGGTCGTCGCATAGACGAAGATCGCGCCGGATTGGGCAAAGATCTTCGGCAGTTCCTCGCGCAGTTCCTCGCGCAGCTTGTAATCGAGATTGGCGAGCGGCTCGTCCATCAGCACGAGGCTGGCATTCTTGACGAGTGCGCGCGCGAGCGCCGTGCGCTGCTGCTGGCCGCCGGACAGATTGAGCGGGGTGCGGTCGAGATAGGGCGTGAGCTTCAGCAGCTCGGCCGCCTTGCGCACCTCACGGTCGATCGTGGCGGCATCCTTGCCCGATATACGCATCGGCGAGGCGATGTTGTTGTAGACGGTCAGTGCCGGATAGTTGATGAATTGCTGGTAGACCATGGCGACGTTGCGCTTCTGCACCGGCATGCCGGTGACGTCAGTGCCGTCGAAATGGATCGAGCCGCCGGTCGGCCGGTCAAGCCCCGCCATCAGCCGCATCAGAGAGGTCTTACCCGCGAGCGTCGGCCCGAGCAGGACGTTCAGCGTGCCTCGCTCGAGAACCAGATCGGTCGGATAGATATGATAGTCCGCGCCCACCATCTTGGCAGCGTTTCGCAGTTCCAGCATTCCGATTCCCGTGCCTCCACGCATCGGGGACCACATGACTCCTACCCGGCCATCGCCGGCATGGCAGCCATGTACTCCTCCAGTGACTGGGCCTGCTCCTTCGAAAGGCGCAGACCATCCTTTGTTCTCCTCCACAGCACGTCTTCGGCGTGCCGGGCCCATTCATGTTCGACGAGATAGGTGACCTCTGCCTCGTAGAGATCGCCGCCGAACAGCCGTCCGAGATCGTCGATGCCGCTGGCATCGCCAAGCAGCGCCTCGGCCCTGGTGCCATAGCGGCGCACCAGCCGGCGGGCATGCCGATCGGCGAGGAACGAATAACGCCGCTTCAGCTTGGCGACCTCCGCTTCGTAACCCCTGACCGCGAAGTCGCCGCCCGGCAGATGGCTTTTGGCCGTCCACGGGGCGCCTTTGGCGCCGATCGAAGCACCGATCTTCTCCAGCGCATGTTCGGAAAGCCGGCGATAGGTGGTGAGCTTGCCGCCGAAGACGTTGAGCAGAGGCGCAGCGGCACCTTCGCCTTCCAGCTTCAGCACATAGTCACGCGTCGCCTCCTGCGCCTTCGAGGCGCCATCATCATAAAGTGGCCTGACCGCAGAATAGGTCCAGACGATATCCTCCGGCCTGACCGGCTCCTTGAAATATTCAGACGCTGCATTGCAGAGATAGATGGTCTCCTCCTCGGAGATCCGGACATCCCGGGGATCGGCGGTATAGTCGCGATCCGTGGTGCCGATCAGGGTGAAGTCGCCCTCGTAGGGGATGGCGAAGATGATGCGGTTATCGGGATTCTGGAAGAAATAGGCTCTCACATCGTCGAATTTCTTCTTCACGACGATATGGCTGCCCTGGACGAGGCGGACATGATGGGCTTCGTTCTGGCCGATGGCGGAACGAATGACATGGTCGACCCAGGGACCGGCGGCATTGACCAACATGCGGGCTTGATGGTTGTCATTGCGACCGGTGAGGGTATCGGTGGTCTCGATATGCCACAGGCCATTCTCGCGCCTTGCCGACACCACCTTGGTGCGCGGCATGATCAGCGCGCCCTTGTCGGCGGCATCGCGGGCGTTCAGCACCACCATGCGGGCATCGTCGACCCAGCCGTCGGAATATTCGAAGGCCTTGGAAAACAGCGCCTTCAGCGGCTTGCCGGCCGGATCGCGGCGCATGTCGAGCACGGCTGTCGGCGGCAGCAGCTTGCGACCGCCGAGATGATCGTAAAGGAAGAGGCCAAGCCGGATCAGCCAGGCCGGACGGATACCGCCCTTGTGGTAGGGCAGTACGAAACGCAGCGGCCAGATGATGTGCGGCGCCATCGCCCAGAGGATCTCGCGCTCCATCAGCGATTCACGCACTAGGCGGAACTCGTAATGCTCGAGATAACGCAGGCCGCCATGGATCAGCTTGGTGGCGCCCGACGAGGTACCCGAGGCGAAATCGTTCATTTCCGCCAGCGCCACCGAATAACCGCGCCCGACGGCGTCGCGCGCAATGCCGCAGCCGTTGATGCCGCCACCGATGACGAATATGTCGTGAATATTTCGGCCCAATTCCCCCTCCGAGCCCGCCGTCGATTTCGCATCGCACAAAACTTGCGCAATTGCGAAAGTAACAACAGTTAAAACGAAAGGAATACGAATGTCAAACGAATGTTTGCGAGGGACCGCTGTCTAGCGAAATCTGCCGTTCGTTTCGATCAGTTTCACATTGTTTTCGAGGCAGAGATTGCGGATCGACGGCACCGGACAGTGATCGGTGATGAAGGTATGAACCTGGGAAAGCTGGCCGATCCTGACTGGTGCGGTGCGTTCGAACTTCGTCGAATCGGCAACGAGAATGACATGTCGGGCATTGGCGATGATGGCCTGGGCGACCTTCACTTCGCGAAAATCATAGTCGAGAAGCGCGCCGTCGATATCGATCGCCGACGCGCCGATCACGGCGTAGTCGACCTTGAACTGGCGGATGAAATCGACGGCCGCCTCGCCGACAATCCCGCCGTCCGAACCCCGCACGACGCCGCCGGCAATGACGACCTCGATCGCCGGGAAAAGACGTAACCTATTGGCAACATTGATATTATTGGTGATGACCATCAGCTCGTGATGGTCGGCGAGCGCCTCGCCCACCGCCTCGGTCGTCGTTCCGATATTGATGAAGAGCGAGGCCCCGCTCGGGATCAGCTCGACCGCGGCAATGCCGATCGCCTGTTTCTCGGAAGCGGCGATCTGGCGGCGTGCTTCGTATTTGACGTTCTCCGTCCCGCTCGGGAATGTGGCGCCGCCATGGATGCGTGTCAGCACCTGCGCATCACAGAGATCGTTGAGGTCCTTGCGGATCGTCTGCGGCGTCACCGAAAAGCGGGAGGCGAGCTCCTCGACCAGCACCCTGCCGCTCGACTTCGCAATCGCCACGATTTCAGTTTGCCGGTCGGTCAAGAACAT
Protein sequences of DBSCAN-SWA_24 >NZ_CP054034|278601:283208|278601_279672_-|WP_138334282.1|DBSCAN-SWA MARITLDHIRHAYGANPKSDKDYSLKEVDHEWNDGGAYALLGPSGCGKTTLLNIISGLLQPSHGRILFDGNDVTNLSTQSRNIAQVFQFPVIYDTMTVYDNLAFPLRNRGVAEADVDRRVRDILEMIDLAGWARRKAQGLTADQKQKISLGRGLVRNDVNAILFDEPLTVIDPHMKWVLRSQLKRLHKQFGFTMVYVTHDQTEALTFAEKVVVMYDGQIVQIGTPAELFERPSHTFVGYFIGSPGMNFMPATIEGSAVKVGDETLTLDYAPKTSGTVKIELGIRPEFIRLGREGMPVTVSKVEDIGRQKIVRARFAGQPIAIVVPEDADIPADARVTFDPSAISIYADSWRVGREA >NZ_CP054034|278601:283208|279681_280758_-|WP_138334283.1|DBSCAN-SWA MLELRNAAKMVGADYHIYPTDLVLERGTLNVLLGPTLAGKTSLMRLMAGLDRPTGGSIHFDGTDVTGMPVQKRNVAMVYQQFINYPALTVYNNIASPMRISGKDAATIDREVRKAAELLKLTPYLDRTPLNLSGGQQQRTALARALVKNASLVLMDEPLANLDYKLREELREELPKIFAQSGAIFVYATTEPSEALLLGGNTATLNQGRVTQFGPTIEVYRRPVNLATAGIFADPPLNTLDVTKSGNVFTRPSGVTISVPSHLAVVPDGPVTIAFHPHHLGLAPQTGDAARLQARTLVSEITGSESFVHLEYDGVRWVMLAHGIHDIDPDMEVEAFLDTRHLMAFGSDGRAIAAAGKV >NZ_CP054034|278601:283208|280799_282314_-|WP_138334284.1|DBSCAN-SWA MGRNIHDIFVIGGGINGCGIARDAVGRGYSVALAEMNDFASGTSSGATKLIHGGLRYLEHYEFRLVRESLMEREILWAMAPHIIWPLRFVLPYHKGGIRPAWLIRLGLFLYDHLGGRKLLPPTAVLDMRRDPAGKPLKALFSKAFEYSDGWVDDARMVVLNARDAADKGALIMPRTKVVSARRENGLWHIETTDTLTGRNDNHQARMLVNAAGPWVDHVIRSAIGQNEAHHVRLVQGSHIVVKKKFDDVRAYFFQNPDNRIIFAIPYEGDFTLIGTTDRDYTADPRDVRISEEETIYLCNAASEYFKEPVRPEDIVWTYSAVRPLYDDGASKAQEATRDYVLKLEGEGAAAPLLNVFGGKLTTYRRLSEHALEKIGASIGAKGAPWTAKSHLPGGDFAVRGYEAEVAKLKRRYSFLADRHARRLVRRYGTRAEALLGDASGIDDLGRLFGGDLYEAEVTYLVEHEWARHAEDVLWRRTKDGLRLSKEQAQSLEEYMAAMPAMAG >NZ_CP054034|278601:283208|282434_283208_-|WP_003549541.1|DBSCAN-SWA MFLTDRQTEIVAIAKSSGRVLVEELASRFSVTPQTIRKDLNDLCDAQVLTRIHGGATFPSGTENVKYEARRQIAASEKQAIGIAAVELIPSGASLFINIGTTTEAVGEALADHHELMVITNNINVANRLRLFPAIEVVIAGGVVRGSDGGIVGEAAVDFIRQFKVDYAVIGASAIDIDGALLDYDFREVKVAQAIIANARHVILVADSTKFERTAPVRIGQLSQVHTFITDHCPVPSIRNLCLENNVKLIETNGRFR |
4 | Bacillus_virus(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|