Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP029562 | Mesorhizobium sp. Pch-S chromosome, complete genome | 5 crisprs | csa3,cas3,DEDDh,WYL | 0 | 1 | 3 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029562_3 | 1497623-1497733 | Orphan |
NA
Consensus repeat of CP029562_3
|
1 spacers
spacers of CP029562_3
>3.1|1497653|51|CP029562|CRISPRCasFinder CCTGGTTCGCCACAGGCTTAAAAGGTACGAATGCCCGAGGCCATAGGGCGC |
CRISPR arrays and Neighbor proteins around CP029562_3
The CRISPR arrays of CP029562_3 >merge|CP029562|3|1497623-1497733|CRISPRCasFinder GGCGCTTCTATCTCCCCCCTTGCGGGGGAGCCTGGTTCGCCACAGGCTTAAAAGGTACGAATGCCCGAGGCCATAGGGCGCGGCGCCTCTATCTCCCCCCTTGCGGGGGAG >CP029562|3|3|1497623-1497733|CRISPRCasFinder GGCGCTTCTATCTCCCCCCTTGCGGGGGAG CCTGGTTCGCCACAGGCTTAAAAGGTACGAATGCCCGAGGCCATAGGGCGC GGCGCCTCTATCTCCCCCCTTGCGGGGGAG
>CP029562.1|QAZ42747.1|1496318_1497533_+|MFS-transporter MSTMMDSKGAAAAATPSARWALLALAIGAFGIGTTEFSPMGLLPVIAQGVDVSIPTAGLLISAYAIGVLVGAPFMTLAFSRFGKRTALMLLMAIFTIGNLMSAMAPGYFTLLAARLVTSLNHGAFFGLGAVVAASVVPKEKQASAVAAMFMGLTIANIGGVPAATWVGQQIGWRMAFAGTALIGLLAITALWLALPRGERGQMPDVKRELRVLTRPAVLLAMATTVLGAGAMFSLYTYVAPTLETLTGASGSFVTLALALIGVGFTIGNWLGGRLADWSLDGATKIFLAALALIMFMLPLLFASHIGVAIGLLVWGGAAFAIVPPVQMRVMEAAAEAPGLASSINVGAFNLGNALGAALGAAVISLGLGYAAVSIAGGLLAAAGLGLVFVGGRQKARTMAVCGA >CP029562.1|QAZ42746.1|1494977_1495802_+|2,5-didehydrogluconate-reductase MQTVTAHGAEIPALGFGVFRMSDAEVERIVPAALEAGFRHFDTAQIYQNEAALGRALDRAGAQREELFLTTKVWVDNYSEGRFAASVDESLDKLKVDQVDLLLLHWPAEKVPVAEQIAMLNAVQAAGKTRFVGVSNQNIAQMRESVERSPAPIVTNQVELHPYLPQPALVAAARAAGVAVTAYYGMADGAVPQDTLLQEIGAKHGKSAAQVGLRWLVQQGFVALSKTANPARVAENIAIFDFALDAADMAAIAGLARADGRLVSPPGLAPAWDV >CP029562.1|QAZ42745.1|1494694_1494886_-|hypothetical-protein MCSRAPRSLSLCLKQFRTENRYALFLELLWGRSAFVFHGNAETVRHFTWMTGPDGPRRRRRLP >CP029562.1|QAZ42744.1|1493691_1494594_-|LysR-family-transcriptional-regulator MDNRTGEMQVFLRVVETGSFSEAARLLRLTPSTVSKLVSRIEARLGVRLIERSTRALSLTAEGQLYYQRSQALLKELDQIEQELSYGAAATGGTVRVNVSVAIGVLGLEPILPEFWRAYPNIVVDLSLSDEIVDLYLDRTDIAIRVGTLPDSGMVARRIGTACRKIVAAPSYLARQGVPRTIEDLARHNCLGFNFRRAEPVWPLKQSGRIVDRVVHGSLLANNGETVRRMTLAGVGLGRMGDYHVRADLAAGRLVEVLADVVEPDAEEMHAVFLGGQRMPQRVRVFLDFVVPRLQGFLEG >CP029562.1|QAZ42743.1|1493069_1493642_+|hypothetical-protein MTNLCLPAKAEVEIVRELDVSRIGYLDEELASSEDGIMLNHIYFDTRGCEAADVELILEEELELLESLEEAGWNTPEASEIIDSHFSDWSELTGFDVGIGGAVLALSAAGATPITSCNGGTIGIEHHSSSVPHILFAGSATMNASAIHQAIEIADLGSVYSGEFGEIYADNVLKFPTFARALIEALMGKD >CP029562.1|QAZ42742.1|1491089_1492769_+|DNA-mismatch-repair-protein MLDQDKENFRFEISLSVLNHLGRNLYRNFITVLGEAISNAWDADAKNVWIDINRETSHFSIKDDGIGMSAEDFQTKFLKIGYSKRSDGATQTNRKRPYIGAKGIGKLALLSCAKRISIISKKDDTDYVGGVIDNSGLDKAITQDLTPEQYPLEALDLGLFKNLTRDHRHGTIIFFEGAKEQIKNSVAHIRKMIALSFKFALIDKEFSIFVNDKEVTFKDIQDVLDATEFTWLINEADDLFLPEMKKLKASPKRLTTTLSLTGFIASVEKPKNLKITGTDERATIDLFVNGRLREKNILRHIPSQRVVESYIYGQIHFDAMDTGATIDPFTSSREGVVEDDRNFAALLEYLKHDALPAIFDQWDDLRLGRGEEGDEDNPRKTKKQRRARDLYAAAREEYEPDEGTENKDEVDNWLDYLRSDAEYNITSYVDCFLSENLLREYIRTYKLPLTSGISKEISEWQGREAINLGAANISYAVRRDSDGLTYLGMDALAVTVEGSKIANGNQSLWTDAIQFKPVRNVVGHTGLLTKNAKTGLTLTFENIKSRVRTLISSKPKSSP >CP029562.1|QAZ47002.1|1489385_1490492_-|DNA-(cytosine-5-)-methyltransferase MNANGILQVDAGGFTTAVIDLFCGAGGLAYGLKSAGLTVKAGVDLDPSCKHPLEENTGAIFACRNVEDVTPADLERWFGEADVRVLAGCAPCQPFSTYSQSRKSVDGRWELLKQFQRLAIALKPEIVTMENVPGLANQQVWEDFIAALQEAEYKVSWCEVACEEYQVPQSRRRLVLLASLLGPIELKASESKLKLTVKDAIGELPRIAAGEKSTDDPLHASAGLGEVNLKRIRASAPGGTWRDWPEELRAPCHRRSTGKTYPSVYGRMEWDKPAPTMTTQCYGFGNGRFGHPEQDRAISLREAAIIQSFPKEYSFLAEGEEVTFERLGTLIGNAVPPKLGEAIGRSILAHVNAVRNGTAPLEGQLNLR >CP029562.1|QAZ42741.1|1488287_1489364_-|DUF262-domain-containing-protein MARQSRLHLISNEQLAAAEAEIVDRSRRIDFYMTEYSVELLAEKMAREEFVIPAYQREFTWEPKRKSRFIESLIMGLPIPFLFFWEMGNGKLEIVDGSQRLRTLHEYILGNMKIDQLDELPSLEGTRFTDLSPSRQRKIKNRSIRGIVLNEHADDQARFDMFERINTGSKVANTAEVRRGALRGPFLELVLELSRLDLFAQLAPVSGKAEKERIREELVTRFFAYGDGLEGYRERPAQFMFDYTKRMNERMQAEPHLIGAYRTRFERMLDFVQRAFPLGFRKPSNPNSTPRARYEAIAIGSHRAMEENPAIFDTVPDTAEWLASDVFSKKTTSDAANTRAKLEGRIEFVRAALLAAAQ >CP029562.1|QAZ42740.1|1487538_1488291_-|hypothetical-protein MSELSDRFDERFGEIVAYLELIDGIEKLVQSGVPRLGDNGPTVTAPQQRILNSSVYLQLYNLVEATVSNCLDAVSKAAMRRVAWSPGDLTSELRKEWVRYMARTHLPSAPDKRLEDAISLCNHLVAALPVAEFEIEKGGGGNWDDKAIRKVAARIGFELRVSRGVEREIKRKVRNDLGSLALIVELRNALAHGRLSFVDCGQDDSAAELRALADRVAAYLREVIAAFDSYLQEHRYIVPERRPEQEAAVV >CP029562.1|QAZ42739.1|1487111_1487507_-|very-short-patch-repair-endonuclease MAKVGPRDTKPEMIVRRILHALGRRFRLHRRDLPGTPDIVLPAARKAIFVHGCFWHRHEGCPKATIPKTRVDFWLDKFARNIERDRRKEHELRDAGWDVLTVWECETRHPLVLKITLRDWLGRTGSHFGCR >CP029562.1|QAZ42748.1|1498351_1499239_+|hypothetical-protein MPYHLPLSLIRYTVDQGARTAKIDTMDVVDKHTRYLAYRENGFADENVCVDRTSTGLLNKVYFGSQDRTPDILLNISQLLAEGFQADGARKSPQAEAASAGCSGTAVSPWMDPYDEEALAAFNRQLCGTRIKVPRELFHTGPAVQCPKDAVCFATKSDFFADLVKPDGAVVSPTNKIAIATPRDIGWIKVREAFFNKRITQLDFDNGVLTTMRVRKESEILGLSQLPLNVVERVLAVPGNAIGMAFGTYQEKLFYLQRRKELHEANSATAPAAPPAEPAVYTDLKCAAGAAAADK >CP029562.1|QAZ42749.1|1499264_1501514_+|hypothetical-protein MFDSKAAGFVLAISLGLAGPAAADEKVLVLRIKPGELTRETAQFIGENGKLTTVNSDRPRTIQNIVQSICATAPPAFIVAVQAANKNSAFDYATQIAAGQNILMPPCPPKQTTSLLARNVQPGDTVWGYYKTGTIGADFAPKYALDVPPDNYAPPLNDAQVVAMWDRDISDVTSDSISTKQVVGAFPLGPNNHLVRRNILDTLGVGAKFDSSGKLFAETGLAISADPDTSFVDMFQRANPRIGSADQLKPGEIVLVPDQTPQEYVLPLRDPAESTKLASLSPEAFTKKLGTPVTDVMETTDFRFFKRLADKCVPEDAARMSTVLAGARQLLEENWKQADQQSATVAPEMVFVDSGIFKPTELPAHPALLKSVDQSSPASLASSPSEAELEPPKLLPDAMHGTNVAALALGTSDFAKLARSIGYRMKVRAFRAFTESTTLALRPDGSITIVDGKADVVVHTVDRDRIFDAIKYSADSIVTLSFGREKPIDELRAKLDPLSQTLFIVAAGNDHKSLDTIPLFPGRHGGGEAFNIMTVASIDGDRKLSGFSNWSSDFVDIAAPGCGVATLSYDKDQSAFFGETVNGTSFSAPQVSYTAALIKAVRPSYTGAQIKARLLAGSDIEPALTGSVQEGRVLNMVKALALQHDVVDVRKANGERELRFGQLKPELKPDQFCKAAPDIGNLRILKVIPAFDERDAKPGTTTINFADPGGLVKSATCESKDFQVTLHELSGATVQFSMGEINDITPAWR >CP029562.1|QAZ42750.1|1501488_1502502_+|hypothetical-protein MTSRPPGAEALPPRKRFATQPCRNGAGRVAPTFLATTLLATTLLATLGLPSTAWAEGKPAAVERPEQLLGAPGSVSIDVNGSDVEVPYAITTDGYAVLEGDMIIGGAQELFQLQKMQEDEARKTQKLTLASVDAFQLFGLFDIAGRVWPKGIVAYRINANVDADGKKRLDAAMAMWTSTTKIQFVSAVAGTRAYVEFVHANNADNCSSEVGFLGRRQLIYMGDNCDAGNMAHEIGHALGLGHEQMRSDQDKFVTIKTANIMSGYLKFFTPKPWAYVDAGSYCYGSLMHYGPKMFSSNNFPTIVPKDPSARIGQRDRVASCDQRVIQAAYQAQFAARP >CP029562.1|QAZ42751.1|1502556_1503039_-|hypothetical-protein MKRLIAAALLAAIVTGTAAAEPQFADYPVKTHLKGKPVLPKFTGDTAQFRTRIRNGMKEGPNFAGHFSLIEIGCGGSCIFTFLIDARTGKVMDFTLGGEEFYQLQLKYRLDSTLLQADWMDTSVGSYDTCIRRFYDISSGTPKQVSQESRKIEQSGYCGE >CP029562.1|QAZ42752.1|1503331_1504684_-|HlyD-family-type-I-secretion-periplasmic-adaptor-subunit MKIDVRNKTRMGAKPAADTPAASDSMRTAAIAGWAIIATFFGGFGAWAVTAPLNGAVVANGYVRVEGNRKSIQHLDGGIVKQLNVREGDRVSEGDVLIRLDDSQARAEYNVLDQQLLVLRATEERLKAELADASGLTMPDDLKQAARDNPNVEGIWRSQIHQFDSRLALLAGQRAVIKEKIAQLEAQITGSEAEAKAFRQQYASVQAERESLTGLLKQGLISKSRYLQLERSGTGLEGQAAETAANIARARQAIAEQLQQMAGLDNERMTEVSKDLRDTQAKLLEVIPKLTNAKAVLSRIDIRSPYSGEVVGLTAFSVGGVIMRGEKIMDIVPDRDSLIIDAQIAVEDIASVHPDMTANVHLTAYKQRITPVVHGTVIQVSADRLIDKRNDAPYYVALVRIDEKELTGLPEVHLYPGMPTTVMIETVQRTALDYLVGPLAMQFEKGFRQK >CP029562.1|QAZ42753.1|1504680_1506429_-|type-I-secretion-system-permease/ATPase MLTEPKSSELRDVLRRCRRYFATALAFSLAINLLYLAGPLYMLQVYDRVISSGSHVTLVMLTIVLMFALACFAGLDFVRARVLTRASVRLDREMAGRLLTATMASSARGVPASSQTLRDFDTYRQFMTGPGIHAVFDLPWAPIYILVIFLMSPLLGWFSLASAVTLVALAVYGQWRVQRPMAESADLASRNYAFADMSLHNAEVVRAMGMMPGLLTRWNIDRDRVIERQVTASDRAATNQSLIRFLRLAMQSLILGLGAYLVIERLQTIGVMFAASILLGRALQPVEQIVGAWRNMIAARNARQRIEMLLATNPAAAPALALPRPEGRLSVEGLVYGLPRNPILRGVSLSLAAGEVLGIIGPSGAGKSTLARQIVGVLAPIAGTVRLDGADVSTWPREELGPHLGYLPQDIELFADTVAANIARFRDDGEDNDVIAAARLAGVHDIILRLPKGYETNVGAGGAVLSGGVRQRIALARAVYGNPAILVLDEPSSNLDSEGDTALLACIAACKQRGTTVIIISHRPNTFAVVDKLLVMKDGAVELFGPKNEVIGQLTRLATVRPEAEPAAPAGAQGATMARAAQ >CP029562.1|QAZ42754.1|1506567_1506915_-|hypothetical-protein MLKNGGVTQKDTNGDGSITAADGYVLNYNNAIEPGKDGFTTSVKANSAAWQTAVGGVPSGSSIFAAMNALTDSTTQNMVVSQNQSHVVMATDLGDHFGLLGVTLNSPDGYLQLHA >CP029562.1|QAZ42755.1|1506908_1512710_-|hypothetical-protein MTQESIPTRDTSTTDPVVTGATDPTIPEETTEGTSSVTPSQQLWVTLDQSDADYAPGETVGITASNVSTGGSLEFSVGHVSAGADGILGTADDVISGDLTGTGTPWVVTDGGAGDLDGLVNGAIQTSWYVNADALNQAFVLSAIDQASGQVATASFTDAAPPDASGAAADLTKDLSAVPGGDPTHNIGGALFQVYHVKSDFGQSAGTGTFDTFVQVQGPNNPPTEQGYNYDRSGTYGDVLSPQYNENTSGEHNTALLLSEIPIVTINGVNYREFLLDSNEATGANKEFISLDSLQIFQETSGTLGVGHVSPGANTPFTPGSGFGVAGEHLVYNMDASGNVWVAMNSTLSSGSGDSDIRVLVPDSAFLHDAAHQYIYVYSAFGFQGGDWQTNSGFEEWGTNPAALITFDISGKKYTDANGDGLVVGDVGLGGITIYIEKDGIAGLTAGDKSTVTAADGSWSFTGLDSSYDGMKVFEVLPNGYVQTLGQAGYTIDGISGQDQTGLNFANFEKFDISGKKFTDANGDGQTVGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGNDQTGLNFANFEKFDISGKKFTDANGDGQTAGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGNDQTGMNFANFEKFDISGKKFTDANGDGQTVGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGNDQTGMNFANFEKFDISGKKFTDANGDGQTVGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGHDQTGLDFANFAKFGISGTKFTDANGDGQTVGDVGLGGVTIFIDKNGDGLNNDGAANQTLTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGSDQTGLNFANFEKFDISGTKFTDTNGDGQTAGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGHDQTGLDFANFAKFGISGTKFTDANGDGQTVGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYIQTLGQAGYVITGTSGQNQTGLNFANFEKFDISGTKFTDANGDGQTAGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGNDQTGLNFANFEKFDISGTKFTDANGDGQTAGDAGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKKVYEVLPNGYVQTLGQAGYTIVGTSGNDQTGLNFANFEKFDISGKKFLDANGDGQTAGDAGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGNDQTGLNFANFEKFDISGKKFLDANGDGLTTGDAGLGGITIYIEKDGIAGLTAGDVSTVTAADGTWSFQNLDASYAGKKVYEVLPNGYVQSLGQAGYTIVGTSGNDQTGMNFANFVPGSIHGFKFNDMDANGKFDGTDVAMAGITIQLKGDVDGNGTIDTLTTTTDANGFFAFVNLHPGTYTISELFTDGNSWAATVDHNNDGVGDATTTVTITSGQELVAVKGEAGQLDPLHSEVVVGNTLTFGNHELGAVGLTPGFWYNHLYVWDAPIIGADTSNGGVDGKGVSLASKLAAAGTIEKADIASLLPGNVDVDKDNHKDLIVTGSNGKTLVIEWDDAREIVGASNGTGGDKLGDFARYAITTLLNDVGVPDFNAPNGVLTDIADWC >CP029562.1|QAZ42756.1|1513046_1514183_+|hypothetical-protein MGVVPKGLLRRPIEPAVVSDLYDAALDPQGLRGLAEIVMAVGQGSSAAVSLERGNSFEEVATFNLPDKALQDYAAYYRRINPCIPIAMARHPGNISRFSDLIAEQDLERTEFYTDFMQVHDTVRAMGAPGIAIGPNLLLQVGVHRSRGSTDFDGNDVARLQGMVPHLKRAMQLRRRLSGGLDINAGLAALEAFAFGCVICDAAGHVLFANRAAEALEAGGIVTLTTRQGLQARNPGQSRQLATSIGATAAGGTGDALILTARDGTRVFALTTPLPVRFGGQPGHVLVTFRSESAATTLDATALQQLFPLTTAEARLALALAGGHSLARYAAEHKVSDNTLRTQIASILHKTGTENQRELVRLLGLLPPVDRMAGVGNS >CP029562.1|QAZ42757.1|1514735_1515596_+|oxidoreductase MRYIRPLTIEDAVGQLVGASGTAAILAGGSDLLVRMKGGFVEPDLIVDIKAIAGLSEIREIAEGFSIGAAVPCAVLGENKVLKKAWPGVVEAAKLIGSKQVQGRCTIVGNLCNASPAADSVPALVAAGARAVVAGPSGKRTVAVESVPTAPGRTSLAKGEIIEAILLDNRAPRSGDAYLRFIPRTEMDIAVVSAGVNLTLDETGVIRTARVALGAAAPTVLLVEEAAEVLVGSKLDEATLERLAKVCAGACRPIDDKRGTIEFRRKVAGVLARRVATTAYERAGGK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029562_4 | 1518730-1518833 | Orphan |
NA
Consensus repeat of CP029562_4
|
1 spacers
spacers of CP029562_4
>4.1|1518756|52|CP029562|CRISPRCasFinder CGCTGTGCCTCGCGGGCGAGAATTCTGGAACGTCTCAGAGATTGAGCGCCGT |
CRISPR arrays and Neighbor proteins around CP029562_4
The CRISPR arrays of CP029562_4 >merge|CP029562|4|1518730-1518833|CRISPRCasFinder TTCACCCAACGCCCCTCATCCTGAGGCGCTGTGCCTCGCGGGCGAGAATTCTGGAACGTCTCAGAGATTGAGCGCCGTTTCACCCAGCGCCCCTCATCCTGAGG >CP029562|4|4|1518730-1518833|CRISPRCasFinder TTCACCCAACGCCCCTCATCCTGAGG CGCTGTGCCTCGCGGGCGAGAATTCTGGAACGTCTCAGAGATTGAGCGCCGT TTCACCCAGCGCCCCTCATCCTGAGG
>CP029562.1|QAZ42760.1|1518337_1518583_+|molybdopterin-synthase-sulfur-carrier-subunit MVAVTLWGSLAAVAGGKDKLDIEAKDIRELFRKLAEQYPAIEPFINRGIAVAIDGTIYRDTWSKKLPEGAEIFLLPRLAGG >CP029562.1|QAZ42759.1|1516073_1518338_+|oxidoreductase MNFDPRLTNRSFSAVGTRPIRPDGIDKVTGRARYGADFNMAGQLVGRVLRSPHAHAVIRGIDTSKAEALPGVKAVITAADLPDLTNGDSALYDILDNCMARRKALYDGHAVAAVAAIDARTARQALKLISVDYEVLPHVTDVDEAFQHGAPLIDDAIFTEGLVEKPVKPSNVVKRTQFGHGDVHKGFGEADFIVERSFRTEQTHQGYIEPHACVASVSADGMAELWVCTQGPFVYRQHCAQLLGMEASKLRVTSSEIGGGFGGKTHVWAEPVALALSRKAGRPVKLVMTRDEVFRASGPTSATSIDVRIGARKDGTITAAEATLRYSCGPYAGGWAEVGAMTAFACYRLDNVRTVGFEVLVNRPKTAAYRAPSAPMAAFAVESSIDELAGKVGMDAVDFRIRNAAQEGTRASYGPVYGPIGIGPTLEAVKKHPHMQAPLKKNQGRGMACGFWFNFGGQTCVDLNIGMDGSVSLAVGTVDVGGSRASLSLVAAEELGIDYGQVKAIVADTSSLGYNDMTDGSRGTFSSSMATISAARNAILILRERAAQMWDIPADDVVWEKGHALARGEKYGNLSPLSLKEIAAQSGTTGGPIAGHSELVADGAGVSFATHICDIEVDPETGATRVLRYTVVQDAGKAVHPTYVEGQYQGGAAQGIGWALNEEYIYGKDGRLQNAGFLDYRIPVCSDLPMIDTQILEIPNPNHPYGVRGVGETSIVPPLAAIANAVSNAVGVRMTHIPMSPPRILAALEVERKA >CP029562.1|QAZ42758.1|1515595_1516069_+|ferredoxin MAGIAVSTTINGDAVEYLCQPDETLLDVLRDRLGLTGAKEGCGTGDCGACSVIVDDRLVCSCLVLGAEAEGRRVETIEGMAHGDTLHPLQQKFLEHAALQCGICTPGFLIAAKDLLAKNPDPTEEEIRFGLAGNLCRCTGYDKIVRAVQDAASVMRG >CP029562.1|QAZ42757.1|1514735_1515596_+|oxidoreductase MRYIRPLTIEDAVGQLVGASGTAAILAGGSDLLVRMKGGFVEPDLIVDIKAIAGLSEIREIAEGFSIGAAVPCAVLGENKVLKKAWPGVVEAAKLIGSKQVQGRCTIVGNLCNASPAADSVPALVAAGARAVVAGPSGKRTVAVESVPTAPGRTSLAKGEIIEAILLDNRAPRSGDAYLRFIPRTEMDIAVVSAGVNLTLDETGVIRTARVALGAAAPTVLLVEEAAEVLVGSKLDEATLERLAKVCAGACRPIDDKRGTIEFRRKVAGVLARRVATTAYERAGGK >CP029562.1|QAZ42756.1|1513046_1514183_+|hypothetical-protein MGVVPKGLLRRPIEPAVVSDLYDAALDPQGLRGLAEIVMAVGQGSSAAVSLERGNSFEEVATFNLPDKALQDYAAYYRRINPCIPIAMARHPGNISRFSDLIAEQDLERTEFYTDFMQVHDTVRAMGAPGIAIGPNLLLQVGVHRSRGSTDFDGNDVARLQGMVPHLKRAMQLRRRLSGGLDINAGLAALEAFAFGCVICDAAGHVLFANRAAEALEAGGIVTLTTRQGLQARNPGQSRQLATSIGATAAGGTGDALILTARDGTRVFALTTPLPVRFGGQPGHVLVTFRSESAATTLDATALQQLFPLTTAEARLALALAGGHSLARYAAEHKVSDNTLRTQIASILHKTGTENQRELVRLLGLLPPVDRMAGVGNS >CP029562.1|QAZ42755.1|1506908_1512710_-|hypothetical-protein MTQESIPTRDTSTTDPVVTGATDPTIPEETTEGTSSVTPSQQLWVTLDQSDADYAPGETVGITASNVSTGGSLEFSVGHVSAGADGILGTADDVISGDLTGTGTPWVVTDGGAGDLDGLVNGAIQTSWYVNADALNQAFVLSAIDQASGQVATASFTDAAPPDASGAAADLTKDLSAVPGGDPTHNIGGALFQVYHVKSDFGQSAGTGTFDTFVQVQGPNNPPTEQGYNYDRSGTYGDVLSPQYNENTSGEHNTALLLSEIPIVTINGVNYREFLLDSNEATGANKEFISLDSLQIFQETSGTLGVGHVSPGANTPFTPGSGFGVAGEHLVYNMDASGNVWVAMNSTLSSGSGDSDIRVLVPDSAFLHDAAHQYIYVYSAFGFQGGDWQTNSGFEEWGTNPAALITFDISGKKYTDANGDGLVVGDVGLGGITIYIEKDGIAGLTAGDKSTVTAADGSWSFTGLDSSYDGMKVFEVLPNGYVQTLGQAGYTIDGISGQDQTGLNFANFEKFDISGKKFTDANGDGQTVGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGNDQTGLNFANFEKFDISGKKFTDANGDGQTAGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGNDQTGMNFANFEKFDISGKKFTDANGDGQTVGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGNDQTGMNFANFEKFDISGKKFTDANGDGQTVGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGHDQTGLDFANFAKFGISGTKFTDANGDGQTVGDVGLGGVTIFIDKNGDGLNNDGAANQTLTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGSDQTGLNFANFEKFDISGTKFTDTNGDGQTAGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGHDQTGLDFANFAKFGISGTKFTDANGDGQTVGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYIQTLGQAGYVITGTSGQNQTGLNFANFEKFDISGTKFTDANGDGQTAGDVGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGNDQTGLNFANFEKFDISGTKFTDANGDGQTAGDAGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKKVYEVLPNGYVQTLGQAGYTIVGTSGNDQTGLNFANFEKFDISGKKFLDANGDGQTAGDAGLGGVTIFIDKNGDGLNNDGAANQTVTAADGTWSFTGLDASYAGKTVYEVLPNGYVQTLGQAGYQIVGTSGNDQTGLNFANFEKFDISGKKFLDANGDGLTTGDAGLGGITIYIEKDGIAGLTAGDVSTVTAADGTWSFQNLDASYAGKKVYEVLPNGYVQSLGQAGYTIVGTSGNDQTGMNFANFVPGSIHGFKFNDMDANGKFDGTDVAMAGITIQLKGDVDGNGTIDTLTTTTDANGFFAFVNLHPGTYTISELFTDGNSWAATVDHNNDGVGDATTTVTITSGQELVAVKGEAGQLDPLHSEVVVGNTLTFGNHELGAVGLTPGFWYNHLYVWDAPIIGADTSNGGVDGKGVSLASKLAAAGTIEKADIASLLPGNVDVDKDNHKDLIVTGSNGKTLVIEWDDAREIVGASNGTGGDKLGDFARYAITTLLNDVGVPDFNAPNGVLTDIADWC >CP029562.1|QAZ42754.1|1506567_1506915_-|hypothetical-protein MLKNGGVTQKDTNGDGSITAADGYVLNYNNAIEPGKDGFTTSVKANSAAWQTAVGGVPSGSSIFAAMNALTDSTTQNMVVSQNQSHVVMATDLGDHFGLLGVTLNSPDGYLQLHA >CP029562.1|QAZ42753.1|1504680_1506429_-|type-I-secretion-system-permease/ATPase MLTEPKSSELRDVLRRCRRYFATALAFSLAINLLYLAGPLYMLQVYDRVISSGSHVTLVMLTIVLMFALACFAGLDFVRARVLTRASVRLDREMAGRLLTATMASSARGVPASSQTLRDFDTYRQFMTGPGIHAVFDLPWAPIYILVIFLMSPLLGWFSLASAVTLVALAVYGQWRVQRPMAESADLASRNYAFADMSLHNAEVVRAMGMMPGLLTRWNIDRDRVIERQVTASDRAATNQSLIRFLRLAMQSLILGLGAYLVIERLQTIGVMFAASILLGRALQPVEQIVGAWRNMIAARNARQRIEMLLATNPAAAPALALPRPEGRLSVEGLVYGLPRNPILRGVSLSLAAGEVLGIIGPSGAGKSTLARQIVGVLAPIAGTVRLDGADVSTWPREELGPHLGYLPQDIELFADTVAANIARFRDDGEDNDVIAAARLAGVHDIILRLPKGYETNVGAGGAVLSGGVRQRIALARAVYGNPAILVLDEPSSNLDSEGDTALLACIAACKQRGTTVIIISHRPNTFAVVDKLLVMKDGAVELFGPKNEVIGQLTRLATVRPEAEPAAPAGAQGATMARAAQ >CP029562.1|QAZ42752.1|1503331_1504684_-|HlyD-family-type-I-secretion-periplasmic-adaptor-subunit MKIDVRNKTRMGAKPAADTPAASDSMRTAAIAGWAIIATFFGGFGAWAVTAPLNGAVVANGYVRVEGNRKSIQHLDGGIVKQLNVREGDRVSEGDVLIRLDDSQARAEYNVLDQQLLVLRATEERLKAELADASGLTMPDDLKQAARDNPNVEGIWRSQIHQFDSRLALLAGQRAVIKEKIAQLEAQITGSEAEAKAFRQQYASVQAERESLTGLLKQGLISKSRYLQLERSGTGLEGQAAETAANIARARQAIAEQLQQMAGLDNERMTEVSKDLRDTQAKLLEVIPKLTNAKAVLSRIDIRSPYSGEVVGLTAFSVGGVIMRGEKIMDIVPDRDSLIIDAQIAVEDIASVHPDMTANVHLTAYKQRITPVVHGTVIQVSADRLIDKRNDAPYYVALVRIDEKELTGLPEVHLYPGMPTTVMIETVQRTALDYLVGPLAMQFEKGFRQK >CP029562.1|QAZ42751.1|1502556_1503039_-|hypothetical-protein MKRLIAAALLAAIVTGTAAAEPQFADYPVKTHLKGKPVLPKFTGDTAQFRTRIRNGMKEGPNFAGHFSLIEIGCGGSCIFTFLIDARTGKVMDFTLGGEEFYQLQLKYRLDSTLLQADWMDTSVGSYDTCIRRFYDISSGTPKQVSQESRKIEQSGYCGE >CP029562.1|QAZ42761.1|1519058_1519757_+|DUF899-domain-containing-protein MSTSSSELATKRRPRPGESEAYAQARQELLAQEIELQRHIDRVAEQRRKLPPGPAIEKDYRFKDVNGTEVGLGDLFGDKQTLVTYFWMYGPERERPCPMCTNLLGPLNANANDLKQRVALAILGRSPVERQVAFARERGWQHLQFHQTVGDDYALDFRGLDPAKGWEYPVLAVFQKSGDGGVRLFWKGEMTGEMADAGKDPRGGPDFAPLWSALDLTPEGRGDKWYPRLEYY >CP029562.1|QAZ42762.1|1519807_1521214_-|chromate-transporter MTAAAPDARLADRPAAHGIGFAEAVKVWARIAALSFGGPAGQIAVMHRILVEEKRWIGETRFLHALNYCMLLPGPEAQQLAIYIGWLLHRTKGGLVAGILFVLPGFVAIMALSWIYAIFGNAGLVQSLFLGLKAAVLAIVLEAVMRIGRRALRNAVMIALAAAAFIAIFVFRVPFPLIVIAAALIGYVGGRAGWAAFAAAGGHGKMGAHEVADADTALGEDIPAHALHAGRSSLKMAAILLMLWLAPVAVLLSLYGQANVFSQIAVFFSKMAVVTFGGAYAVLAYVAQQAVDTYGWLKPGEMLDGLGMAETTPGPLIMVTQFVGFMGAFRAPGAMHPLLAGTLGGLLTTWVTFTPCFLWIFLGAPYIEALRSNRALSAALATITAAVVGVILNLAVWFALHVLFGEVREIHALGATLDVPVLASVKLAALALAIVALLAVFRFRIGMLWVLAGCSLLGVLYGLAAGTI >CP029562.1|QAZ42763.1|1521210_1522035_-|sulfurtransferase MPSPTTISTDKLARLIGTPHCPALIDVRNDEDFALDPRLVPGSIRRSHRDVQTWYPTITRPSAVVICQAGRKLSEGSASWLRQLGTPAESLEGGIEAWFASGLPAVPAARLPPRDTAGRTVWVTRARPKIDRIACPWLIRRFVDPHAAFLFVAPQEVAGVADRFDATPFDVENVFWSHRGETCTFDTMVEEFGLGTEPLLRLAAIVRGADTARLDLAPEAAGLLAASLGLSRMHSDDLAQLEAGMTLYDAFYRWCRDATDESHNWPSARPGAAS >CP029562.1|QAZ47003.1|1522154_1522883_-|ABC-transporter-permease MWLYAWLQHAIAAQREIYLALAARIADFAQTGDWQPLLVYLPMGIVFGAIHAATPGHSKALLATYLTGSSASLRQAMLTTWVLSLVHISTAVLIALLALPLVSVTLGSVGRAPALEQASRGLLGLIGLWMLWQALRGTHGHGHREGTAMGFMAGLIPCPLTLFVMTFAITRGVPQAGIAFAATMLIGVGLTLSAVAFAAVFFRQHVLGALARWPRLIDWFGRSIQAAAGLVILVVAANAVRA >CP029562.1|QAZ42764.1|1522933_1524238_-|MFS-transporter MLAVLRDRTYRHLFAAQVIALVGTGLATVALGLLAYDLAGGQAGAVLGTALAIKMIAYVGVAPVASAFAERWPRRTMLVSLDLVRAAVALFLPFVTEIWQVYVLIFVLQSASAAFTPTFQATIPDVLPDEKDYTSALSLSRLAYDLESLLSPILAAALLTLISFHSLFGGTVIGFLASAALVLTVVLPSPKPSAPRGIYDRTTRGMRIYLATPRLRGLLAINMAVAAAGALVIVNTVVYVQAQFGLDQSRTALALAAFGGGSMIAALVLPRLLERVPDRTAMLCGTGLLAAATLLAALLPSFAWLLPLWFVLGIGYSVAQTPSGRLLRRSANPEDRPAIFAAQFALSHACWLLAYPLAGRVGAMIGLPATAVVLAAIAAAAAALAVRLWPADDPDMVEHSHDDLPASHPHLAGHRRHVHAYVIDDMHAAWPRRH >CP029562.1|QAZ42765.1|1524237_1524513_-|metal-resistance-protein MEPHRHETHPEIVKRLKRADGHLRSVVEMIEAGRPCLDIAQQLHAVEKAIAQAKKTLIQDHLDHCLDHVVGKLGQDQRASIDEFKEIAKYL >CP029562.1|QAZ42766.1|1524674_1524881_+|hypothetical-protein MGIVKIDEDLHEEVRRASTVMCRSINAQAEFWMKIGMLAEANPTLSFTEIVKRELATAQDRPAPSLVA >CP029562.1|QAZ42767.1|1524984_1525752_+|type-I-methionyl-aminopeptidase MVKTPDELALMRQSGRLLASVFEMLDGLELAGMSTLEVDTLVDRFITEDLAARPASKGQYGYRHVLNCSINHVVCHGVPDAAAIIRDGDIVNFDITLEKNGFIADSSKTYLVGDASPTARRLVRVAQEAMWKGIRAVRPGAFIGDIGYAIEKHAKKNGYSVVRDYCGHGIGREMHEEPQVLNHGRPGTGVRLQEGMVFTVEPMVNQGTRRVTTADDGWTVVTNDGKLSAQFEHTVAVTATGVEVLTLRRDEKLAA >CP029562.1|QAZ42768.1|1525974_1527318_+|chloride-channel-protein MRKRRLSSIGMVRRSRALVTSPRLWKPRLVFWAGAFAIGIISAGFAWLADEAQKVFAGVVGAGGWHGYIPLALTPAGFMLCAWLAFRFFPGSQGSGIPQAIAARHLRDDDERGRFLSLRMVWGKIVLTIVGLACGASIGREGPTVQVGASIMVQAARWGGMAQARGLILAGSAAGIAAAFNTPLAGIVFAIEEMGRSYQARTNGVVLSAVILAGLASLALVGTYTYFGVSHAYAAFPGDWPLVIACGIIGGAAGALFSSATLRITRRIRRWRTEFPLSRTLIVAGCAGLVVAVVGVLSSGATFGTGYAEARGAIEGHALPPLFFVAKFVVSLASTISGIPGGLFAPSLSVGAGLGSTLGQIFGADVGLAAILGMAGYFAGVVQAPMTAFVIILEMTGNHDNVIALMAASMLGYGTARLIAPEPLYHSLARLFLADAIRRRRVEQQAS >CP029562.1|QAZ42769.1|1527445_1528330_-|AraC-family-transcriptional-regulator MKAALQTYHARMLRVLDHIDRHLDGDLDLDALSRVAAFSKFHFHRQFMATFGLSVHRYVQLARMKQASQRLAAADAESVTEIAMDAGYDAPDAFARAFRQRLGQSPSSFRKSPDWEPWLVAFGPLYNARSKLMQIIFSQDDVTIRDEEPTPVAMLEHRGDRAMLGTTAERFRAWGKAAGLSAGKRSTFMVFRSERCPANPADYAMDLCLETDRPVDPDDPLMKAGVIPGGRCAVLRYPGNTNNLEPAATYLYREWLPASGEEVRDFPVYCRRRLALIPEVPAHEVVVELFLPLK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029562_5 | 1614320-1614403 | Orphan |
NA
Consensus repeat of CP029562_5
|
1 spacers
spacers of CP029562_5
>5.1|1614344|36|CP029562|CRISPRCasFinder GAAAACAAACATTTGGAGCATTTCCGCGTTTCCGTG |
CRISPR arrays and Neighbor proteins around CP029562_5
The CRISPR arrays of CP029562_5 >merge|CP029562|5|1614320-1614403|CRISPRCasFinder GGAAACGGAAACGCGCTGGTGAAAGAAAACAAACATTTGGAGCATTTCCGCGTTTCCGTGGAAAACGGAAACGCGCTGGTGAAA >CP029562|5|5|1614320-1614403|CRISPRCasFinder GGAAACGGAAACGCGCTGGTGAAA GAAAACAAACATTTGGAGCATTTCCGCGTTTCCGTG GAAAACGGAAACGCGCTGGTGAAA
>CP029562.1|QAZ42838.1|1613991_1614222_-|hypothetical-protein MNKSIPNDVFKPKPTRTESKDDTTSKAAKAIINDELAQRLAKTERLRQARLAQEAVAVVESKAAKPKARARKAGKA >CP029562.1|QAZ42837.1|1611445_1613767_-|membrane-bound-PQQ-dependent-dehydrogenase,-glucose/quinate/shikimate-family MIIVTSLLLALLGLLLGVGGIWLVMLDGSPAYLAIGVVFLVVALLLYRRSPLALWVYALLVIAALAWAIWEVGFDWWQLGPRGGLIILIGLWLLTPWVRRPLRTADGSLASPWPLAIPVLAAIVVAGYSMTTDPHDVAGELPKEGAAVASLGNLVPPGDWHQYGRTQFGQRYSPLDQINVANVASLKEAWRYQTGDVKLPDDIGETTYQVTPLKVGDTLYLCTPHNLAIALDAESGKEKWKFDAKPGLNPDRQHQTCRGVTYWLDTRVAAGQPCAARVYLPTSDARLIALDAASGKVCESFADKGALHLEAGMKYNPAGYYYSTSPPVAVDDKLIIGGAVNDNYSTQEQSGVIRAFDIRTGALLWNWDSGNPDDTRPIAEGATYTTNSPNSWSVFSVDERLGLVYIPLGNQVPDQLGMGRSENVERFSSSIVALDILTGKLRWVRQTVHHDLWDMDVPAQPVLFDLTRADGTIVPALVGPTKQGDIYVLDRRTGEPIIPVREIPAPGGAIPEDFSAPTQPISDLTFSPPPLQEKDMWGVSLFDQMMCRINFHRLNYEGRYTPPSLKGTIVYPGNFGVFNWGSVAVDPERQIMFGMPTYLAFTSRLVPRADIPPKGSNEKGSEQGLNRNEGAPYGVYMGPFLGPLGIPCQAPPWGYVAGVDLKSGKIAYKHKNGTVTDMTPLPLPFKLGVPGIGGPMLTRGGVAFLGAAVDDYLRAYDVTTGRQLWQARLPAGGQATPMTFATAGGKQYVVMVAGGHGSVGTKPGDYVIAYTLP >CP029562.1|QAZ42836.1|1610882_1611140_-|hypothetical-protein MAVPHHFGGIAYPDDLKFLKEIYDSVCLERGFHSDSAAAVDLARATMDLFTQGVTDEAEIRESLDAYLGRRSIAGELLRATGAPS >CP029562.1|QAZ42835.1|1610520_1610886_-|hypothetical-protein MSAFTPLTFRTARGYELNIARLNDVVKAISMAWPDKECELYRNAAHLAEKARLGWCTPRSAYDAFIVAARAQGCIVECRPLRDRVAEQLAAIEPENFNPLPTLHQTRPDSPDEELAKRPPG >CP029562.1|QAZ42834.1|1610153_1610474_+|hypothetical-protein MTQERNARDARIVKRAAEIWSAEGGPMNRHEAQWLMAMRQVDDEDLGIHVSTPESAGTLAPDSLRHRDALEPFGPAAIDEETSRSAKAASGLPAAIRVARWFFARR >CP029562.1|QAZ42833.1|1609470_1609704_-|hypothetical-protein MSSLVGARARWPPSIDSALETAADVAPYALVLVKARLIFTGCMRWHSRFRASTGRGKLLLLVFSKKEISHLIVYIAE >CP029562.1|QAZ42832.1|1608474_1608654_-|hypothetical-protein MAKGQQRSNREIRKPKQDKRLPKPESPFGKPAAGGNDALRDKARPDSSKQTPERREHRR >CP029562.1|QAZ42831.1|1607861_1608143_-|hypothetical-protein MNARINDTVQERQSLAIGVWDNEGGAQLPEGADRQYGRRIEVDQTWTVYHVFTGVPAQADGQTMTGLSRSKATDGMVSLNRRNEARRFAGQKP >CP029562.1|QAZ42830.1|1606821_1607568_+|Crp/Fnr-family-transcriptional-regulator MHGSFDTVDNNLLRALRPDDWAILQPHLEDWSAQSGALIHEPGDTVRHAYFPRGPSLISYLVVLRDGRGIETVLVGREGAVGGIVSQGRLPAYARAVVQVGASFWRIDVQDLEQAKSQSPNLRHLLTRYADCLLAQIFQSVACNAAHTIEQRTAKWLLAAMDRTGRHDLTLTQEHLAAMLGVGRSYLARVFHDMRGEGIIETKRGHVVVHKKSRLRSIACECNAAVTRHFDDVLTGVYPNGQEEAEKR >CP029562.1|QAZ42829.1|1605978_1606278_-|hypothetical-protein MTVLALVCSSLLCGPARAHDAPVGWSYDVECCSGLDCYQAPASDVKETRDGYLLSTGELIPYSDRRIRPSRDEFFHECKPGGQTASPRSLCLYVPNRGL >CP029562.1|QAZ42839.1|1614498_1614708_-|cold-shock-protein MASGTVKFFNSTKGFGFIAPDDGSNDVFVHVSAVERAGMRGLAEGQKLNFDVVADRKTGKSSADNLQAI >CP029562.1|QAZ42840.1|1614963_1615170_+|hypothetical-protein MMPSFYAASRDALFRFGRDLRVLPKEEPVAEMPTIVARSTMATGRCRDNDASGAARRAREMMERNHLI >CP029562.1|QAZ42841.1|1615234_1615933_-|hypothetical-protein MTAQEPVHVRIARALATEFGNSLWQANEPLPSETELARYFKVTRRTLRKALSIVEAEGLIAKNQGRNSLYRGRSIALSHDTIVDLPTAAREAGFRLTTKLLRLGEARAGLADARALSVKLGDTVGEICRLRLLDGRPVVQQRSVIPQNLLLGIRPADLERSSLYRELQHLHGIDGLVIGREQFIQSSATAEEAAFAGIEAGHPVVRVLRLVLADGKPVEYSNSILIGGYFRF >CP029562.1|QAZ42842.1|1615929_1616859_-|ABC-transporter-permease MSLISRFGTLFGLLAIIALFSILSPTGFAQPSNLINITQQMALLAIVALGVVFVMAVSEFDLSIGAVVSMAGITSVYLFGQGWSILPTIAVTLAAGFAVGCVNGLVVSAWRVPSFIMTLAMGTIIGGFTFWLSAGATLFGNIPPAFRDLGRGYLAGIPVPTLWALGAVLFVTLMLDWTELGRRMLALGGNREAARLTGVPVVPTTIKAFGLCSAMAAAAGLLLTAKLGSAHPTGGNGFLLQAYAAVYLGITAFRDGQPSAAGTLLGTAIIAVVANGLTILGIPNYMQDVLTGLIIIASVLVRNVGARAR >CP029562.1|QAZ42843.1|1616855_1618364_-|sugar-ABC-transporter-ATP-binding-protein MLEVAGLSKRFGGVRALDGVALGVAAGRVHALLGENGAGKSTLLKCLSGVHQPDGGSMLLDGREFAPLTPAHSEKAGLRFVHQELNLVPHFTAYENAWVGRRYPRRGGLIDWRAMRARFSETCERYGLEIDIDQPVGRMSIGRQQIVEILRALMDEARILVLDEPTAALSEKEAAVLHRIVRQLAAHGCAVIFVSHRLEEVMAIADDYTVLVNGATAGSGRIAETDRDGLVTMMAGGGFQAGPAASMTTTGPTVLALSKFAAAPGCRPVDLNVAAGEIVGIYGIVGSGRSSLLKAIWGAGAYSQGGISIDGRALPATGIASRIRAGVAFAPEDRRAAGLVMDHSILDNTLLPRLRLNRLVQALPLLSWRSARRDVGQLLAQRGVKYGSINDRMSTLSGGNQQKVLIGRWVRTACRLYLLDEPTRGVDVRSKAEIHALCQSLAGQGAAVVFATSDIEELVTLAGRVLVMAGGTITLDSPNRDLGRRTIVEAAVRAAPEQESHS >CP029562.1|QAZ42844.1|1618427_1619336_-|LacI-family-transcriptional-regulator MKRTLAALLALALSSTAAFAEDIAVLTPYLSSVTTNEMVETYKSEAASKGWSINVVDTRGDFQQLASRVEDVTNAGVKAIVLVSVDPNQIVDQVEAASAKGIPVIVLDGAVAKGVTVNITTDNFALGTILSDYLFKAMGDKGNIVKFFHSAHPGVRQRELALDKALAAHPDIKVIAEHYVQVPGPIDNARQAMETFLRQHGSQINGVWAAWDEPAIGALLAIQSDAADSKVLIAGIDGNPQALDLIRQCTNLVATVRQDFPAIARTAVSETAAALEGKKPEKPEIFVEAKVVDRASLGVTCN >CP029562.1|QAZ42845.1|1619509_1620955_-|xylulose-kinase MNGSTILVVDVGTSALKAVLFGPDGSILASAVEPIATRHGMNGEHEQDVEGWWRALGKVTRTLPGAGAVASMAFSGSMQNLIALHGDGRPAAPAVLYSDQRLDADEVSGLAAKLPADYGRRIGNRPDGAHTIFRLMARQRYELPATDVFWAFGAKDALTFRLTGRRVIDPTVASTTGLMDFSSRRWDRDLLGLAGVDVASLPAIEPANGIVGRITAGAAAETGLPAGIPVFNGAGDAAAATWGAMADEPGAAYCYLGTTGWVAATVHSDDDMLPRDIYTLADPVRADCAIIISPFLTAGAAMDWLSATTGQTIEKLCKRAASHDEAPGKPLFLPYLGGERAPFQDRDVRGAFLGLEQTSDAGALALAVMEGIAFAVRHNLEAAELPRSLLTAIGGGVRHPLQQQILADVLDRDIRIPADSEATTAAGIMRMVAGKAGVKADAIAQDTIVKPRPQRVQRHIRRYEAYLSATSMARTLAAGLE >CP029562.1|QAZ42846.1|1621327_1622311_-|AraC-family-transcriptional-regulator MTPDLTDISSRLALAPGIRSGSADGAVAALSTSYRPHRLHLANAASALDFRHHAIDCEEASVNFLRYGPELTVDAGSFDTFYMLEFPVSGGVDLAYGDRRLGSHAGTGLLLSPGAYIRSTWRADTSQVMLRLKRTFVERAWQSYTGDAERRTPVFRAEIDLGSVAGRRIMSLIGLIVTERLDESAAPASSMPLINAVLQTVFEHAPALSAPRVESALAGAVPHYVSGFRKLLDNPANLHLAIADLAALLHVTPRTLTAGMRRFTGLSPHEYLTGQRMKHARVLLNHGGISVAEAARKVGYANAGRFAASFRAHSGMNPSRLMDTRDS >CP029562.1|QAZ42847.1|1622505_1623756_+|tryptophan-synthase-subunit-beta MTAQFQPNSFRTGPDEQGMFGIFGGRFVAETLMPLILDLEREWNKAKHDPEFKAELSALSTFYAGRPSKLYFAEGLTRHLRDVASAKGLGGGAKIYFKREDLNHTGSHKINNCLGQILLAKRMGKKRIIAETGAGQHGVASATVAARFGYPCVVYMGATDVARQSPNVFRMKLLGAEVRPVTAGHGTLKDAMNEALRDWVTNVEDTYYLIGTAAGPHPYPELVRDFQSVIGTEARQQMLEQEGRLPDTIIAAVGGGSNAIGLFHPFLDDRQVEIIGIEAGGRGLDGVEHCASMNAGRPGVLHGNRTYLLQNEDGQILDGHSVSAGLDYPGVGPEHSFLRDTGRVDYQPILDDEALEAFQLCTRTEGIIPALESAHAIAHAVKIAPGMARDKIIIVNLSGRGDKDVHTVAKMMGMEI >CP029562.1|QAZ42848.1|1623757_1624573_+|tryptophan-synthase-subunit-alpha MTTTRIDRRMAKLKAEGRPALVTYFMGGDPDYDTSLSIMKALPKAGSDIIELGMPFSDPMADGPAIQAAGLRALKGGQTLKKTLAMAADFRTGDDETPIVLMGYYNPIYVHGVERFLTEARASGIDGLIIVDLPPEMDAELCIPALKAGINFIRLATPTTDDRRLPKVLENTSGFVYYVSMTGTTGSAVPDAIEVSAAIRRIKQHTDLPVCVGFGISSADDVAAFAGFADGVVVGSAVVRAIADGTRNATPVEDVSAFVAQLRGATATMPA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029562_6 | 4421204-4421289 | Orphan |
NA
Consensus repeat of CP029562_6
|
1 spacers
spacers of CP029562_6
>6.1|4421229|36|CP029562|CRISPRCasFinder AGCGGTTTTTCGTCCGGAATTGCGTGAGGCAACAGG |
CRISPR arrays and Neighbor proteins around CP029562_6
The CRISPR arrays of CP029562_6 >merge|CP029562|6|4421204-4421289|CRISPRCasFinder CAGAGCAATTCCAGGAAAAGTGCGTAGCGGTTTTTCGTCCGGAATTGCGTGAGGCAACAGGTCAAGCAATTCCAGGAAAAGTGCGT >CP029562|6|6|4421204-4421289|CRISPRCasFinder CAGAGCAATTCCAGGAAAAGTGCGT AGCGGTTTTTCGTCCGGAATTGCGTGAGGCAACAGG TCAAGCAATTCCAGGAAAAGTGCGT
>CP029562.1|QAZ45012.1|4420361_4420883_-|hypothetical-protein MKRLELAIESIILASRWLLVVFYLGLGLALAIYALSFGKKLYEFVSTVMVLGDTDTILKILGLIDAALVASLVVMVIISGYENFVSRFDEHEGKVHWLGTIDVGSLKVKVASTIVAISSIHLLQVFLNSSSYSTDQLMWLTIIHLTFVLSALMLAYIDRLMGHGKDDKSRSPD >CP029562.1|QAZ45011.1|4419472_4420339_+|short-chain-dehydrogenase MSLKGKTLFISGGSRGIGLAIALRAARDGANVTIAAKTDTPHPKLPGTIHTAVEEIEKAGGRGLPVLCDIREETQVAEAVARTVERFGGIDICVNNASAIQLTGTLETDMKRYDLMHQINTRGTFLVSKMCIPHLKLASNPHILNLAPPLDMKAKWFKGHVAYTMAKFGMSMCTLGMSAEFAPAGIAVNSLWPLTAIDTAAVRNLLGGETVAAMSRSPEIMADAAHAILTRSSREATGNFYIDEEVLRAEGVSDFSKYAPGAKGPLAGDFFVPDEVFARSETKVTGLF >CP029562.1|QAZ45010.1|4418690_4419275_+|helix-turn-helix-transcriptional-regulator MADRRLTVLIALGDAGRAERLSASLAAGEEFAPVAAGSGSGVDVAIVDNAGLQLDAPLVLLSSRPVAAVNGNAGRVFAVLPPNAEHGLIAAAARLAAAGYHAVRDEEALDAGFPAETGDHHGEHEILARPSLSPREAEVLALLAEGAPNKVIARRLDISVHTAKFHVAAILTKLGAANRTDAIAIAMRQGLVLV >CP029562.1|QAZ45009.1|4417812_4418682_+|serine-protease MSDFNLGAFSDAVADIAEGAAAAVASLNAHHGRTTSAFHWGAGYFVAAEEVLDADDEIELTLGSGERAKATLAGRDPSTGVVLLKTERTGLPVLTQAAALRPGNVAIALGNSDGAALAVLGTVGEVGPAWRSMRGGTIDRRINLAVGAGSRFEGGPVLDARGGLIGMLLFGPRRRALVIPFETIERTVAVLKEKGHVVRGYLGAALHPARDGDTIGAMVMGLDGSGPAKAAGLHVGDIVVAWNGEPVGGPRDLMRRLGPDSAGREVKLGILRGGSRNEVAVTIGEKPLN >CP029562.1|QAZ45008.1|4416690_4417644_-|serine-protease MALPALDPAPRDDTLLDAYSTTVADAVDRIGPAVCRIERIGGRAGHGSGFVISPDGLIVTNFHVVGDARAVRVAMPDGTVDEGRVLGADPDTDIALVRAGGSFANVAPLGDSKRLRRGQIAIAIGNPLGFEWTVTTGVVSALGRSMRATTGRLIDDVIQTDAALNPGNSGGPLVSSAGEVIGVNTAMIPGAQGIAFAVASNTANFVIAEIIRFGRVRRAFIGISADTTNLPRRAALLSQVTSSTAVRLRSIEPGGPAAKAGLKEGDIIAAIDGRPVTGVDDLVRMLDAERIGRETVCTVLRRTGISRIAVTPVPRTG >CP029562.1|QAZ45007.1|4415674_4416688_+|zinc-binding-alcohol-dehydrogenase-family-protein MRAIGYLEPQPISAPSSLVDIELPRPKALGRDILVEVKAVSVNPVDTKVRANAKPVDGQHRVLGWDAAGVVVETGPEASLFKPGDEVFYAGAINRPGSDAEFHLVDERIAALKPRSLDFAAAAALPLTSITAYEALFDRLDVTRPVAGAANAIVIVGAAGGVGSIAIQLLRALTDLTVIATASRPETVEWVKARGAHHVIDHSKAMADQVAALGVGAPAFVFSLTHSDAHAGEIAKLIAPQGRVALIDDPKSFDIMLFKSKALSLHWEMMFARPVYETADVIRQHELLTRVAGLVDAGKVKSTITDNYGNISAANLMRAHAQIESGTTLGKIVLAGW >CP029562.1|QAZ45006.1|4415204_4415567_-|transcriptional-regulator MGRIRHQDLRNSPACPVEGTLELIGGKWKGVILYHLIQKGTLRFNELRRKMNVTQRMLTNQLRELETSGLIDRKVYAEVPPKVEYSLTERGRSLEPIILALKRWGDDNILATDYVAPGNC >CP029562.1|QAZ47387.1|4414281_4415199_+|arginase MNLIRAPFNLGLRPLRPRHEPGTWRAPQALTEAGLIQALAPAQVIDLERPAYSTEPQPGTKLRNGPAIRSFNLRLADIVADTIGRGAFVLIIGGDCTILLGALAGARRSGPLSLVHVDGHSDFRHPGNDDDNLPLGSAAGMDLAAATGRGETLLTDWPGIAGPLLHDDQVMQIGERENRDTDYAWRDIEATDITQVDVFAAQELGAAGVLAKAEPVLASHGRHWVHFDVDALDQAVMPAVDSPGSPGIDPDQLIAILAALVARPGCVGMNVTIFDPDLDPNGELAVWLVNFLRKVFDGFNPPAGT >CP029562.1|QAZ45005.1|4413734_4414160_+|glyoxalase MKTTSYYPVLMTGDVAGTTAFYVEHFRFKPLFESDWYVHLQSSEDKRVNLGIVQGDHETIPVEGRGRASGLLINFEVKDVDAVHERIVAAGLPILRSLRDEAFGQRHFITRDPNGVLIDVIKPIPPSEEFLAQFAPEAVGK >CP029562.1|QAZ45004.1|4413063_4413666_-|TetR-family-transcriptional-regulator MQEANSRRSNRDRTEATRGELIAAARRLFTEKSYAETGTPEIVAAAGVTRGALYHHFADKQALFRAVVEAEAAAVAKTIEQATPRTLSARDALLAGGEAYVAAMSLPGRTRLLLLDGPAVLGRAEMDDIDGRHGNRTLREGLAATMRAGDLPKLPLEALTSLLGAAFDRAALAVDAGASAQDYRAVLAAVVDGLMRKTSS >CP029562.1|QAZ45013.1|4423818_4424268_+|YcgN-family-cysteine-cluster-protein MESEPFWKAKTLEEMSPAEWESLCDGCGKCCLSKLEDEDTGEIYWTSVACRLFDAGTCRCHDYTNRLAKVPDCVGLTPQNVRTISWLPSTCAYRLVAEGHDLYWWHRLLSGSAETVHEAGISMRGRVRASETDLAEPEDYFDYVLDEEP >CP029562.1|QAZ45014.1|4424463_4424862_+|hypothetical-protein MLTKIKAAALSAFVAFGALAAVPATAQADGIYLNLGSGEPRFGVYAGDRDQRDWRRDRWDRERGWDRRDRGWDRDRPGCSPEWALNKAERMGIWRARIVDVNRRVIKVAGRQDGERTMVVFGRERGCPVLYR >CP029562.1|QAZ45015.1|4424970_4425681_-|hypothetical-protein MTRAFLPLALAAAVAMPAIANAAEAPQPPRIIVSGEGEATVAPDLALLSLSVMREAKTAREALNANNDAMAAVIAAMKAAGIAERDLQTAGIQINPRYNYTNKPDGNQEAELIAYQVTNTLSVRVRDISKTGEILDKAVSLGVNQGGGISFTNENPATVVTEARKKAVADAIAKAKTLAEAAGVSVGKVLEITDQSYAPPPMPMNAKAYDAAGASVPVQAGENAYKVMVNVTFELK >CP029562.1|QAZ45016.1|4425910_4426105_-|hypothetical-protein MNGASAHELMAQFGWLKVEQAETCTRKADRKRLGVKSFGRVADQMENTISRTEKQVRGKSKKMQ >CP029562.1|QAZ45017.1|4426336_4426936_-|NAD(P)H:quinone-oxidoreductase MARILVLYYSSYGHTETMAHAISAGACQAGGQVVVKRVPDLVPPAIAAKAGYIVNQPVPTASVAELPSYDAIIFGTPTRYGMMAAQMKNFLDQTSQLWAQDRLVGKLGAVFTSTGLQHGGHEATILSFHTVLLHLGMIVVGLPYSYGGLTHMDALVGCSPYGASTLSGKRGERTPTEIDLEGAKFQGRHLASLAAKLAG >CP029562.1|QAZ45018.1|4427174_4427546_+|cupin MDNPAQNAGSAPVRVAKFNISDAPLSKLYPEKTIELGDVIDRRSGGTISIGYARYKSGEANDWKVTYDEALVITKGNFTVVYDGEDYTAALGEVIYLKKDTTIFYRANTDVELVYVTYPHWRP >CP029562.1|QAZ45019.1|4427903_4428293_-|hypothetical-protein MATMNPETLTLLTFLGRAAVSFYFLWSASFNTIRFARNVEDLARAGIRRGGAVLVGAGTIAMTVASLLFLNPATVVQGGVGVILFTLGSDLLFHRWWIYRDPNLRIVHQQFACEHLALSGGIVGLMVAG >CP029562.1|QAZ45020.1|4428336_4428831_-|hypothetical-protein MHIALPMTDEQRKLVALEYFNAIDNGSTSDGRPFLELFAEDAQVFFPKWGLANGRVEIGRLFGDVGATLTRIRHHTEDFNWIFSGSEIVVVEGRTSGQHLDGSWEMDQPIWGSGRFCDVFQIRSFMIQRCFVYLDPDYAAKDTGRYRWLKKTGRPAARSPYAGL >CP029562.1|QAZ45021.1|4428937_4429351_-|cupin MKRALIQICTALTLSVVATAALAGTAVTIKKFSIADAPLTRLYPEKPIELGDVIDRTSESTMTVGYGRYKSGESNEWTVTYDEALIITRGKFTVVNDGTEYTALPGEVIYLKNGTKVLYRADDDVELVYVTHPHWRQ >CP029562.1|QAZ45022.1|4429515_4430412_+|LysR-family-transcriptional-regulator MRPSLEELEAFLHVAETGSFTKAADRLGISKSVVSRRLSALERRLGAMLVSRTTHAVTLTELGSGFLERARRTLDDIDEAMEVVGAGAVAIRGLVRITAPTNLGTLHVLPALMTLMALHEGLEADLDLNDRYADIVAGGFDLAIRIGDLKDSALVSRRLGTVRRCVVCSPQYLERKGRPMTPDDLEHHECLAYTNMNPSDQWRFMVDGNWKATRTKYRLRTNDGNALLTAAVAGRGLVGLPNFIVQHDVAEGKLVRVLEDFALPEVGVFALFPSSARLPGRARAVVEHLAEHFQRAFI |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029562_7 | 4809401-4809547 | Orphan |
NA
Consensus repeat of CP029562_7
|
1 spacers
spacers of CP029562_7
>7.1|4809446|57|CP029562|CRISPRCasFinder CGAAAAAACAAGGAGTCAGAGCGTTTCCGCGTTTCCGTAAAAACGGAAACGCTCTGA |
CRISPR arrays and Neighbor proteins around CP029562_7
The CRISPR arrays of CP029562_7 >merge|CP029562|7|4809401-4809547|CRISPRCasFinder GAGCAATTCCAGGAAAAGTGTGGAACGGTTTTCCGCCCGGAATTGCGAAAAAACAAGGAGTCAGAGCGTTTCCGCGTTTCCGTAAAAACGGAAACGCTCTGAGAGCAATTCCAGGAAAAGTGTGAAACGGTTTTCCGCCCGGAATTG >CP029562|7|7|4809401-4809547|CRISPRCasFinder GAGCAATTCCAGGAAAAGTGTGGAACGGTTTTCCGCCCGGAATTG CGAAAAAACAAGGAGTCAGAGCGTTTCCGCGTTTCCGTAAAAACGGAAACGCTCTGA GAGCAATTCCAGGAAAAGTGTGAAACGGTTTTCCGCCCGGAATTG
>CP029562.1|QAZ45298.1|4807498_4809325_-|X-Pro-aminopeptidase MFQTFDPAGDPTVGKPHVARLRGWLSENGLDGFLVPRADEHQGEYVPDRAARLRWLTGFSGSAGVAIVLRDRAFIFVDGRYTLQVRAEVDLDIFAIESLIDNPPASWIPENLGKGARLGFDPWLHTVGEVKALKAAAAKSGAELIPAAENPIDIIWDDQPDAPIAPVELHPQAFAGELAKDKLARLASAIAKEGATHAVLTDPSSIAWAFNIRGGDVPHTPLALGFAILAADGKHQLFMDSRKFSRTVAAYLTQLCDPHEPAEFEGAIVDLAKNGAKIALDPVLAADRLRTLVEDNGGTVVSAADPARIPRAVKNQAEIAGSRAAHRRDGAAVAKLLCWLDRQEPGTLDEITVVKQLEQARRQTGEETQMPLRDVSFATISGAGPNGAIMHYRVSTETNRKLAQDELFLLDSGAQYQDGTTDITRTVPVGKPTEEMRERFTLVLKGMIGISMLRFPAGTRGSEIDAVARMALWRHGCDFAHGTGHGVGSYLAVHEGPQRIARTGTEKLLAGMMLSNEPGYYKEGHYGIRIENLIIVTPAEEIEGGDIAMHGFETLTLAPIDRRLIRTDLLTKAELEWMDAYHARVLAEIGPMVDGETLAWLEKATAPL >CP029562.1|QAZ45297.1|4807176_4807473_+|branched-chain-amino-acid-transport MSATFWIIIAAAIATYLTRVGGHLVISRFERVHPRVEAGLNAVPAAVLTTLVAPAVLSAGPAEWAALIVAGAVSLRGGLLSMFLAGAAVLILARQFMA >CP029562.1|QAZ45296.1|4806427_4807180_+|branched-chain-amino-acid-ABC-transporter-permease MSSEAIAERSRAGEFWDGVRLSTPVVVASAPFAMLFGALAVDNGFSVFEATLMSAAVYGGASQMVGIELFGQHVAPWLIVLSIFAVNFRHVLYSAGIGRRIAHWPIVQQVLGYFILTDPQYAISERKAESGQAVSFIWYLGLGLPVYVFWVAETALGAYFGRLISNTHSLGIDFLLPIYFFGLVMGFRKRPLWLPVVAVSAVASIIAYKTVGSPWHVSIGAVAGVLLAVILPPHHSGVGTKPGAEGEEQP >CP029562.1|QAZ45295.1|4805554_4806256_+|DUF2461-domain-containing-protein MAGTFSGFGQKAIPFLKALDFHQNREWFQENRDLFESELREPLGNLVETLSERFGAVGLGLRGDRKKSLFRINRDVRFSKDKRPYTTHVSAILSPDGTKMEQGVFFFHLGLDACFAAVAWWQPETAMLQAMRRAIETRPEQFRKLVAVLKKGKLELGTQGAMKRMPRGFEHVADADLAAAIRNRHFVVRQDVEPTSIHTPELADILLDFVIRARPLLDWGRAIQGTAVLPPQN >CP029562.1|QAZ45294.1|4803341_4805555_+|DNA-ligase-(NAD(+))-LigA MSEQNKPIEALSAEEAAAELERLAAEIREHDRRYHGEDAPIITDAEYDALRRRNAAIEEAFPDLVRSDSPTGSVGSAPAEGFAKVRHAVPMLSLAKAYTDQDVVDFLERAKRFFERDKDFDIAFTAEPKIDGLSASLRYENGVFVQGATRGDGAVGEDITANLKTIADIPHRLKGSGWPEVIEIRGEVYMTYAEFQALKERSAAAGGQDYVNPRNTAAGSLRQKDPSVTASRNLKFFAYAWGYTSEDPAPTQYEAVQKFAGWGFKVSPLMVRARSVDDLIAQYRLIEEQRSSLGYDIDGVVYKVDQLELQRRWGFVTGEPRWAVAHKFPAEQATTIVRKIDIQVGRTGTLAPVARLDPVTVGGVTVVNVTLHNEDYIKGFDSNGEPIREGNDIRIGDTVVIQRAGDVIPQIVSVVVDKRPAAAVPYEFPHACPVCGSAATREINEKTGKEDSRRRCTGELICPAQAVERLRHFVSRGAMDIEGLGAENIDLFFNAGLLETAADIFRLKDRRAEVQQALFERREAQAQAREAAKGATRKKVLTAEERTYEGLEKLFDAIDARREPELDRFIFALGIRHIGETTAAALSKTFSTAEEFIRVGKEASQADDPKTVFPSINGIGDTVIGALCDFFGNERNDDVLDALLQQVHPKPYIVSVSADSQVAGKTVVFTGSLEKMSRSEAKAMAERLGAKVAGSVSAKTDLVVAGPGAGSKLKLASELGIEVIDEDTWFERVGRSV >CP029562.1|QAZ45293.1|4801572_4803243_+|DNA-repair-protein-RecN MLSRLSIRDIVLIERLDIDFAPGLSVLTGETGAGKSILLDALSLALGARGDASLVRHGAAQGQVTAVFDVPRNHPAREILAENAIDDDGDIILRRLQTADGRTRVFVNDQPSSVTLMRDIGRALVEIHGQHDDRALVDPGAHRDLLDAFGGHLGVVHRTGESWRYWRQAEQDLARHRAKVEAAAREADYLRASVAELAKLDPQPGEETELAELRSVMMRAEKIAAEIHDAQDVLSGPASPLPQLASLLRRLQRKAGEAPGLLEDVVKSLDEAMISLDAAQSGVEAALRATEYDPQRLEKAEERLFALRAAARKHNVVVDNLAQLRDTMSADLADLDAGEEKLAALEKQAAVAREAYDQSASELSGLRQAVASGLAKAVMTELPALKLERAEFIVEISSEPANRLEEGIDQIEFWVRTNPGTRPGPMMKVASGGELSRFLLALKVALADRGSAPTLVFDEIDTGVGGAVADAIGQRLARLSTRVQVLSVTHAPQVAARAATHFLISKSGTDRVATGIAEMDRTARQEEIARMLAGASITDEARAAAERLLRENKVAS >CP029562.1|QAZ45292.1|4800639_4801512_+|outer-membrane-protein-assembly-factor-BamD MFFRQADQLKTPLSAAFLALSVLAAPLALSGCMSSEKDIDLSTYVEQTEPADVLYNQGLANMNAGRLDEAAKKFAAVDRQHPYSEWARKSMVMGAFTNYRQGKYDDAISAAKRYLTLYPSTDEAAYAQYIIGLSYYKQIEDVTRDQKEAKLTIQTMQELVTRWPDSEYVEDAKTKIRYANDQLAGKEMQVGRYYLERKEYLASIKRFRNVVENYSNTRHIEEALARLTEAYYALGLVDEAQTAAAVLGQNYPDSQWYKDSYKLLQSGGLQPRENAGSWISKAGKLITGGA >CP029562.1|QAZ45291.1|4799495_4800440_+|UDP-3-O-acyl-N-acetylglucosamine-deacetylase MGISLQDYQTTLKSRATLTGIGVHSGNTVTIHFLPADADTGIVFHLIDGEQPVREFRALVSEVGATDLCTMLGDPVGQHIATVEHLMAAVLGLGIDNMVIEIDGREVPILDGSSIEFVEAIDQAGIEILSVKRRFIRILKTVRIESGASWAEFRPYSGTRFEVEIDFESPAIGRQSFATDLTADAFRNEIARARTFGFMKDVERLWAAGYALGSSLDNSLVIGDDHRVINVGGLRYPNEFARHKTLDAMGDLALAGARFIGCFRSYRGGHRLNAAALRRLLSDRTAFEVVETTRRGRSGEMSAISAPVYAPWTI >CP029562.1|QAZ45290.1|4797540_4799226_+|cell-division-protein-FtsZ MTINLKKPDITELKPRITVFGVGGGGGNAVNNMITAGLRGVEFVVANTDAQALTMSKSERLIQLGAHVTEGLGAGSQPEVGRAAAEECIDEITDYLSNTHMCFVTAGMGGGTGTGAAPVVARAAREKGILTVGVVTKPFHFEGQRRMKTADLGIEELQKCVDTLIVIPNQNLFRLANDKTTFADAFAMADQVLYSGVACITDLMVKEGLINLDFADVRSVMREMGKAMMGTGEASGEGRAMAAAEAAIANPLLDETSMKGAKGLLISITGGRDLTLFEVDEAATRIREEVDQDANIILGATFDEDLEGVIRVSVVATGIDKSAEQIAAAPISIRAAQPKAPARPAAIPAAEVRTAPMPQAALHETRAADPVAEAIQQAEANAAAFAQQPRVAQAAPQDDFRPQSKLFQAPPAQPQPHPVMQQVAPQPAPQPVREMAPQQPVAAQPRMPRVEDFPPVVRAEVEAKQHPVDHEERGPMGLLKRLTNGLTRREEEPARLQPAQPREPKLRQAAPEVRRLAPQDAQLYAPRRGQADDHGRFAPQQRATHEDDQLEIPAFLRRQAN >CP029562.1|QAZ45289.1|4796160_4797468_+|cell-division-protein-FtsA MSWLGSNNDGASRRSGVLTVLDVGSSKVCCVVAKLKPCEEGHLLRGRSHRIQVIGIGHQKSHGVKSGVVIDLDRAEHAIRLAVDAAERMAGLTVDSLIVNMTAGRLKSETFNATINLGGHEVDENDIKRVLGAGAKQALKSEREVLHALPVGFSLDGERGVRDPRGMVGDTLGVDMHVMTGDSAPLRNLELAINRSHLSVERLIATPYAAGLAALVDDELEMGAACIDMGGGTTTISVFSEGKFVYGDSIPVGGNHVTLDLAKGLSTSLDAAERLKVMHGSALPGSSDDRDLVTIQPIGEEGEAPLQIPRSVMTRIIRARIEETLELLRERLSKSGYGNVVGKRVVLTGGASQLAGMPEAARRVLGRNVRVGRPLGVAGLPEAAKGPAFSTAVGLMIYPQVASFESHPAFGASRFKMTGTGGRLHRMSQWLRDSF >CP029562.1|QAZ45300.1|4809610_4810483_-|50S-ribosomal-protein-L11-methyltransferase MGQTRLYLIASKAEADRIFGVLETVFEDDGLPLAVLELDEERDLHEVSLYADGDIEPVEARLKAELTALGLARTVEREVLPDVDWVARSLEGLKPVRAGRFLVHGAHDRQKRHAGDIAIEIEAGLAFGTGHHGTTAGCLAMLERVAIREKPRNALDLGTGSAVLAIALARLAHIPVLATDIDPVAVDVAAANARLNHVSSLVETVTAPGFHHPIFTARAPFDLIVANILARPLMKLAPEMAKYLTPGGSIVLSGILDRQRDAVVAAYTGQQFRHVRTTHREGWVTIHMKR >CP029562.1|QAZ45301.1|4810521_4811100_-|cytochrome-c-oxidase-assembly-protein MMRFILVGILVLMVAMIGWMSFEWYHARFSGQPYGAPFALTDQAGKPVTEAAFRGQPTVLFFGFTHCPEVCPTTLAQMAGWLDKLGNEGKDIKVYFVTVDPERDTPDIMKSYVSNFSDRITGITGEPDKVRAMAKSFGIYFKRVDTGGGDYNMDHTASVLMLNRKGDWFGTIAYEENPDTALAKLKRLAKDG >CP029562.1|QAZ45302.1|4811310_4811814_+|CREA-protein MFSQTTRKLLGGALVACFVGVTGVALAQQVGKVGVDWLGNDIVVEAIKDPKVEGVTCHVSYFDRGVIDRLQKGNWFEDPSDSAISCQQTGPISIGDIDFSKGGEEVFKQGISLIWKKQVVNRIYDKANDTLVYLSHSRQVQDGSAKMAISTVPLYSQNVTWKNGKPQ >CP029562.1|QAZ45303.1|4811855_4812494_+|antibiotic-acetyltransferase MDRTEDLVPKDPEPRIHPTAELKSCRLGRYTAIGERVVLREVTVGDFSYFERHAEAIYATIGKFCSIAANTRINALEHPMERLTTHKVSYRPNEYFRWLGVDGDFRERRRSKPVSVGHDVWIGHGAVILPGVTVGNGAVIGANAVVTRDVPAYTIVAGIPAKPLRRRFSKDIAARIEALTWWDWPPEKLAKAIPDMQALEIENFLDHWERQA >CP029562.1|QAZ45304.1|4812760_4813069_+|hypothetical-protein MKKFALVLTVAVVSFSTVSSYAGGFGSRGNNNNSGGGLINISPGIALGDIGILNGSPILSGNNTQVGGILNGVGVGLLGVGTGVSKVTNSILGGGNKYKLGR >CP029562.1|QAZ45305.1|4813162_4814344_-|hypothetical-protein MARKRGGFVFLTAGAIFGLASATARAGETIEPTVTTSIERHYTTNAMESDRPIADWYSLLRSTLFRKWGDNDAYISLNAEGRATRHDSVAIEDDTAGGVQVQAFRRFAGGLELRGTLAYNATRDGDHIDIGPFTIGTRTLKQVASGQVQVGLDLGNQMALIVDAMTSLEKVGLTRFEDDIFLPARLDPDKRNVQIALKLIRTVGQFTFGVQGSALRSAVERLGDPLVGVSFNQFGLRGEITYTGVGGATAALALGAELLDGADDTYSRIRPAWQLSVVKPLPKGFELRGTWYGRYEIADSDDPLASWVNRGEFEISYKLRENLKLASGVACEFKRNLLWENVERKRVFYAEATYDASAKASVVLRVDVNRIHKTVIDIDENTVDTFIGLRAKI >CP029562.1|QAZ45306.1|4814727_4815009_+|hypothetical-protein MTDWLEILKQETATGERMGREVPDMLAHPDISAEQVLALFSALDKQADFAEKLKQALEKFGHDFPVITAAERLEERYADLAATVADKLKEMRK >CP029562.1|QAZ45307.1|4816787_4817426_+|TetR/AcrR-family-transcriptional-regulator MPRPTKTNPDRGDARNRLLDAARDVIRARGFTATSVEDLCQAAEVTKGAYFHHFKSKDALGIAAADDWSAGTTAFFAAAPYHAPDDPLERVLAYVAFRKSIIEGEIAQFSCLVGTMVQEVYATSPAIREACGRSILGHAATLEADIEAARTKHRVSGSWTAESLARHTQTVIQGAFVMAKAGNDPALARESLDHLDQYIRLLFGRPEEGQTS >CP029562.1|QAZ45308.1|4817422_4817953_+|hypothetical-protein MSTMIGCTCGKVQLEVGSTPMMSVECCCDSCREAGARLKKLPGAKQVVDRNGATPFVFHRKDHVRILRGADHLKEFRLTPQAGTRRVVASCCNTPVFLEFKGGHWLSLYAGLWPEGTLPPLELRTMTADLPDASVLPNDVPNLKQQSLAFYWRLFRAWAAMGFRSPRIAVNGEINA >CP029562.1|QAZ45309.1|4817945_4818260_+|hypothetical-protein MPDSADRIEIENVVSPHHRQKVDRTKYMAMRKALLAVLPNDPPGLTVAEAKDALLPDLPDDLWPGGAKAGWWLKAVQLDLEAKGVIGRAAMKPVRLFRRVEFNG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP032685 | Rhizobium sp. CCGE531 plasmid pRCCGE531d, complete sequence | 1099488-1099523 | 6 | 0.833 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP032690 | Rhizobium sp. CCGE532 plasmid pRCCGE532d, complete sequence | 1100487-1100522 | 6 | 0.833 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NC_020062 | Rhizobium tropici CIAT 899 plasmid pRtrCIAT899c, complete sequence | 453637-453672 | 6 | 0.833 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP050105 | Rhizobium leguminosarum bv. trifolii strain 4B plasmid pRL4b6, complete sequence | 553192-553227 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP020951 | Rhizobium sp. CIAT894 plasmid pRheCIAT894d, complete sequence | 536453-536488 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP025508 | Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvB, complete sequence | 462459-462494 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP050110 | Rhizobium leguminosarum bv. trifolii strain 3B plasmid pRL3b4, complete sequence | 553194-553229 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP022667 | Rhizobium leguminosarum bv. viciae strain BIHB 1217 plasmid pPR2, complete sequence | 175110-175145 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP050087 | Rhizobium leguminosarum bv. trifolii strain 23B plasmid pRL23b5, complete sequence | 461243-461278 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP050081 | Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b5, complete sequence | 298590-298625 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP050106 | Rhizobium leguminosarum bv. trifolii strain 4B plasmid pRL4b5, complete sequence | 183749-183784 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP049731 | Rhizobium leguminosarum strain A1 plasmid pRL12, complete sequence | 295127-295162 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP025506 | Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvE, complete sequence | 175599-175634 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP025506 | Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvE, complete sequence | 280777-280812 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP050111 | Rhizobium leguminosarum bv. trifolii strain 3B plasmid pRL3b1, complete sequence | 183933-183968 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP030763 | Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed3, complete sequence | 646931-646966 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP030763 | Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed3, complete sequence | 737805-737840 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP030764 | Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed4, complete sequence | 402560-402595 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP030764 | Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed4, complete sequence | 225298-225333 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP022668 | Rhizobium leguminosarum bv. viciae strain BIHB 1217 plasmid pPR3, complete sequence | 63705-63740 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP022668 | Rhizobium leguminosarum bv. viciae strain BIHB 1217 plasmid pPR3, complete sequence | 168884-168919 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP053206 | Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1D, complete sequence | 295459-295494 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP050088 | Rhizobium leguminosarum bv. trifolii strain 23B plasmid pRL23b6, complete sequence | 308146-308181 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP050088 | Rhizobium leguminosarum bv. trifolii strain 23B plasmid pRL23b6, complete sequence | 426416-426451 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP032685 | Rhizobium sp. CCGE531 plasmid pRCCGE531d, complete sequence | 103635-103670 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP032685 | Rhizobium sp. CCGE531 plasmid pRCCGE531d, complete sequence | 1454480-1454515 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP032690 | Rhizobium sp. CCGE532 plasmid pRCCGE532d, complete sequence | 103635-103670 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP032690 | Rhizobium sp. CCGE532 plasmid pRCCGE532d, complete sequence | 1454458-1454493 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP023000 | Rhizobium sp. 11515TR strain 10195 plasmid p11515TR-B, complete sequence | 53381-53416 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP054029 | Rhizobium sp. JKLM19E plasmid pPR19E02, complete sequence | 323392-323427 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP021820 | Sinorhizobium meliloti strain M162 plasmid psymB, complete sequence | 1587260-1587295 | 7 | 0.806 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP016289 | Rhizobium leguminosarum strain Vaf10 plasmid unnamed2, complete sequence | 128327-128362 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP050106 | Rhizobium leguminosarum bv. trifolii strain 4B plasmid pRL4b5, complete sequence | 425508-425543 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP025506 | Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvE, complete sequence | 417863-417898 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP050111 | Rhizobium leguminosarum bv. trifolii strain 3B plasmid pRL3b1, complete sequence | 425692-425727 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP030763 | Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed3, complete sequence | 89473-89508 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP022668 | Rhizobium leguminosarum bv. viciae strain BIHB 1217 plasmid pPR3, complete sequence | 305970-306005 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP032685 | Rhizobium sp. CCGE531 plasmid pRCCGE531d, complete sequence | 654304-654339 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP032690 | Rhizobium sp. CCGE532 plasmid pRCCGE532d, complete sequence | 653853-653888 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP024314 | Rhizobium sp. NXC24 plasmid pRspNXC24c, complete sequence | 1025530-1025565 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP023000 | Rhizobium sp. 11515TR strain 10195 plasmid p11515TR-B, complete sequence | 1457003-1457038 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP025013 | Rhizobium leguminosarum strain Norway plasmid pRLN1, complete sequence | 211090-211125 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP025017 | Rhizobium leguminosarum strain Norway plasmid pRLN5, complete sequence | 15307-15342 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NC_007765 | Rhizobium etli CFN 42 plasmid p42e, complete sequence | 152480-152515 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP021028 | Rhizobium sp. TAL182 plasmid pRetTAL182d, complete sequence | 167268-167303 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP020909 | Rhizobium etli strain NXC12 plasmid pRetNXC12c, complete sequence | 152285-152320 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NC_021908 | Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1d, complete sequence | 152276-152311 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP013516 | Rhizobium sp. N1314 plasmid pRspN1314e, complete sequence | 156470-156505 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NC_012848 | Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132501, complete sequence | 613268-613303 | 8 | 0.778 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP050105 | Rhizobium leguminosarum bv. trifolii strain 4B plasmid pRL4b6, complete sequence | 344230-344265 | 9 | 0.75 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP025508 | Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvB, complete sequence | 249321-249356 | 9 | 0.75 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP050110 | Rhizobium leguminosarum bv. trifolii strain 3B plasmid pRL3b4, complete sequence | 344232-344267 | 9 | 0.75 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP022667 | Rhizobium leguminosarum bv. viciae strain BIHB 1217 plasmid pPR2, complete sequence | 388322-388357 | 9 | 0.75 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP050087 | Rhizobium leguminosarum bv. trifolii strain 23B plasmid pRL23b5, complete sequence | 249332-249367 | 9 | 0.75 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP032685 | Rhizobium sp. CCGE531 plasmid pRCCGE531d, complete sequence | 1451867-1451902 | 9 | 0.75 |
CP029562_6 | 6.1|4421229|36|CP029562|CRISPRCasFinder | 4421229-4421264 | 36 | NZ_CP032690 | Rhizobium sp. CCGE532 plasmid pRCCGE532d, complete sequence | 1451845-1451880 | 9 | 0.75 |
1. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP032685 (Rhizobium sp. CCGE531 plasmid pRCCGE531d, complete sequence) position: , mismatch: 6, identity: 0.833
agcggtttttcgtccggaattgcgtgaggcaacagg-- CRISPR spacer agcggttttccgtccggaattgc--gagacaacaaaga Protospacer *********.************* ***.*****..
2. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP032690 (Rhizobium sp. CCGE532 plasmid pRCCGE532d, complete sequence) position: , mismatch: 6, identity: 0.833
agcggtttttcgtccggaattgcgtgaggcaacagg-- CRISPR spacer agcggttttccgtccggaattgc--gagacaacaaaga Protospacer *********.************* ***.*****..
3. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NC_020062 (Rhizobium tropici CIAT 899 plasmid pRtrCIAT899c, complete sequence) position: , mismatch: 6, identity: 0.833
agcggtttttcgtccggaattgcgtgaggcaacagg-- CRISPR spacer agcggttttacgtccggaattgc--gagacaacaaaga Protospacer ********* ************* ***.*****..
4. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP050105 (Rhizobium leguminosarum bv. trifolii strain 4B plasmid pRL4b6, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
5. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP020951 (Rhizobium sp. CIAT894 plasmid pRheCIAT894d, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaaaaacga Protospacer ********* *****************.. ** *.
6. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP025508 (Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvB, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
7. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP050110 (Rhizobium leguminosarum bv. trifolii strain 3B plasmid pRL3b4, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
8. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP022667 (Rhizobium leguminosarum bv. viciae strain BIHB 1217 plasmid pPR2, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
9. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP050087 (Rhizobium leguminosarum bv. trifolii strain 23B plasmid pRL23b5, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
10. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP050081 (Rhizobium leguminosarum bv. trifolii strain 31B plasmid pRL31b5, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
11. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP050106 (Rhizobium leguminosarum bv. trifolii strain 4B plasmid pRL4b5, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
12. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP049731 (Rhizobium leguminosarum strain A1 plasmid pRL12, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
13. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP025506 (Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvE, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
14. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP025506 (Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvE, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgagaaaacaaagg Protospacer ********* ************** **.. * ***
15. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP050111 (Rhizobium leguminosarum bv. trifolii strain 3B plasmid pRL3b1, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
16. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP030763 (Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed3, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
17. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP030763 (Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed3, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgagaaaacaaagg Protospacer ********* ************** **.. * ***
18. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP030764 (Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed4, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
19. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP030764 (Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed4, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtggagacaaaga Protospacer ********* ****************..* * **.
20. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP022668 (Rhizobium leguminosarum bv. viciae strain BIHB 1217 plasmid pPR3, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
21. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP022668 (Rhizobium leguminosarum bv. viciae strain BIHB 1217 plasmid pPR3, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgagaaaacaaagg Protospacer ********* ************** **.. * ***
22. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP053206 (Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1D, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
23. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP050088 (Rhizobium leguminosarum bv. trifolii strain 23B plasmid pRL23b6, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtgaaaacaaaga Protospacer ********* *****************.. * **.
24. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP050088 (Rhizobium leguminosarum bv. trifolii strain 23B plasmid pRL23b6, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgagaaaacaaagg Protospacer ********* ************** **.. * ***
25. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP032685 (Rhizobium sp. CCGE531 plasmid pRCCGE531d, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttccgtccggaattgcgagagtgcaaaga Protospacer *********.************** *** * **.
26. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP032685 (Rhizobium sp. CCGE531 plasmid pRCCGE531d, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttccgtccggaattgcgagaagacaaaga Protospacer *********.************** **.* * **.
27. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP032690 (Rhizobium sp. CCGE532 plasmid pRCCGE532d, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttccgtccggaattgcgagagtgcaaaga Protospacer *********.************** *** * **.
28. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP032690 (Rhizobium sp. CCGE532 plasmid pRCCGE532d, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttccgtccggaattgcgagaagacaaaga Protospacer *********.************** **.* * **.
29. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP023000 (Rhizobium sp. 11515TR strain 10195 plasmid p11515TR-B, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttccgtccggaattgcgagagaataaaag Protospacer *********.************** ***. * *.*
30. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP054029 (Rhizobium sp. JKLM19E plasmid pPR19E02, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtg-aggcaacagg CRISPR spacer agcggttttgcgtccgcaattgcgtgaaaacaacga- Protospacer ********* ****** ********* *..****..
31. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP021820 (Sinorhizobium meliloti strain M162 plasmid psymB, complete sequence) position: , mismatch: 7, identity: 0.806
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttccgtccggaattgcgggaaaacaaagg Protospacer *********.************** **.. * ***
32. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP016289 (Rhizobium leguminosarum strain Vaf10 plasmid unnamed2, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcatgaaaacaaaga Protospacer ********* *************.***.. * **.
33. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP050106 (Rhizobium leguminosarum bv. trifolii strain 4B plasmid pRL4b5, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcatgaaaacaaaga Protospacer ********* *************.***.. * **.
34. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP025506 (Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvE, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcatgaaaacaaaga Protospacer ********* *************.***.. * **.
35. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP050111 (Rhizobium leguminosarum bv. trifolii strain 3B plasmid pRL3b1, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcatgaaaacaaaga Protospacer ********* *************.***.. * **.
36. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP030763 (Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed3, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcatgaaaacaaaga Protospacer ********* *************.***.. * **.
37. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP022668 (Rhizobium leguminosarum bv. viciae strain BIHB 1217 plasmid pPR3, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcatgaaaacaaaga Protospacer ********* *************.***.. * **.
38. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP032685 (Rhizobium sp. CCGE531 plasmid pRCCGE531d, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttccgtccggaattgcgagaaaacaaaga Protospacer *********.************** **.. * **.
39. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP032690 (Rhizobium sp. CCGE532 plasmid pRCCGE532d, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttccgtccggaattgcgagaaaacaaaga Protospacer *********.************** **.. * **.
40. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP024314 (Rhizobium sp. NXC24 plasmid pRspNXC24c, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer ggcggttttccgtccggaattgcgtgaaaacaaaga Protospacer .********.*****************.. * **.
41. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP023000 (Rhizobium sp. 11515TR strain 10195 plasmid p11515TR-B, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggtttttcgtctggaattacgtgaaaacaaaga Protospacer **************.******.*****.. * **.
42. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP025013 (Rhizobium leguminosarum strain Norway plasmid pRLN1, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccgaaattgcgtgaaaacaaaga Protospacer ********* ******.**********.. * **.
43. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP025017 (Rhizobium leguminosarum strain Norway plasmid pRLN5, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcatgaaaacaaaga Protospacer ********* *************.***.. * **.
44. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NC_007765 (Rhizobium etli CFN 42 plasmid p42e, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgttcggaattgcgtgaaaacataga Protospacer ********* ***.*************.. *.**.
45. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP021028 (Rhizobium sp. TAL182 plasmid pRetTAL182d, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgttcggaattgcgtgaaaacataga Protospacer ********* ***.*************.. *.**.
46. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP020909 (Rhizobium etli strain NXC12 plasmid pRetNXC12c, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgttcggaattgcgtgaaaatataga Protospacer ********* ***.*************.. *.**.
47. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NC_021908 (Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1d, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgttcggaattgcgtgaaaacataga Protospacer ********* ***.*************.. *.**.
48. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP013516 (Rhizobium sp. N1314 plasmid pRspN1314e, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgttcggaattgcgtgaaaacataga Protospacer ********* ***.*************.. *.**.
49. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NC_012848 (Rhizobium leguminosarum bv. trifolii WSM1325 plasmid pR132501, complete sequence) position: , mismatch: 8, identity: 0.778
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggtattgcgtgaaaacaaaga Protospacer ********* ******* *********.. * **.
50. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP050105 (Rhizobium leguminosarum bv. trifolii strain 4B plasmid pRL4b6, complete sequence) position: , mismatch: 9, identity: 0.75
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtggaaacaagag Protospacer ********* ****************... * ..*
51. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP025508 (Rhizobium leguminosarum bv. viciae strain UPM791 plasmid pRlvB, complete sequence) position: , mismatch: 9, identity: 0.75
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtggaaacaagag Protospacer ********* ****************... * ..*
52. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP050110 (Rhizobium leguminosarum bv. trifolii strain 3B plasmid pRL3b4, complete sequence) position: , mismatch: 9, identity: 0.75
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtggaaacaagag Protospacer ********* ****************... * ..*
53. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP022667 (Rhizobium leguminosarum bv. viciae strain BIHB 1217 plasmid pPR2, complete sequence) position: , mismatch: 9, identity: 0.75
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtggaaacaagag Protospacer ********* ****************... * ..*
54. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP050087 (Rhizobium leguminosarum bv. trifolii strain 23B plasmid pRL23b5, complete sequence) position: , mismatch: 9, identity: 0.75
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttgcgtccggaattgcgtggaaacaagag Protospacer ********* ****************... * ..*
55. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP032685 (Rhizobium sp. CCGE531 plasmid pRCCGE531d, complete sequence) position: , mismatch: 9, identity: 0.75
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttccgtccggaattacgtggaaacaaaga Protospacer *********.***********.****... * **.
56. spacer 6.1|4421229|36|CP029562|CRISPRCasFinder matches to NZ_CP032690 (Rhizobium sp. CCGE532 plasmid pRCCGE532d, complete sequence) position: , mismatch: 9, identity: 0.75
agcggtttttcgtccggaattgcgtgaggcaacagg CRISPR spacer agcggttttccgtccggaattacgtggaaacaaaga Protospacer *********.***********.****... * **.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
5616153 : 5632245
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP029562|5616153:5632245|DBSCAN-SWA CGTGACCGTCACCAATATTGATATCGAGTTCCGGCACTTCCATCTGTTCTGCGGCCTTGGCGGTGGTGCCAAGGGCTTCAACAAGGGTACGGCGCGTGTCGGCAGCCTGAAGGCGCGGTTTCGCTGCATAGGCGGTATCGATGTCGACCCGGCATCCATTCGCGATTTTTCCCGCGCTGCCAAGGTCGAAGGCACCGTTAGGGACCTCTTTGACGAGGATCAATATCGAGCGTTTCATGGTGAGGCACCGCCTCCTGGATGGCGACCAGCGATGCCGGCTGATATCCGACGCGCGGCGCACGGCGAGCGGCCGCACGTCGTTTTCCTTTCCGCGCCATGTAAAGGTTTCTCGGGACTTCTACCTGAAAAGTCCTCGATGACCGACAAATATCAAGCACTGAACGGCCTGACACTTCGAGGCGTCTGGCTGATGCTTGAAGCTTGGAAGGACCATCCCGAGGGCTTACCCGAACTGATCATCTTCGAGAACGTCCCACGCATCATGACGCGCGGTAGGTTCCTTATCGATCAGATCGTCGCACTGCTTCGCGCTTATGGTTACGCCGTCGCCGAGACGACGCATGATTGCGGCGAAATAGGCGGCCTTGGCCAGAGCCGGAAGCGATACCTGCTTGTTGCCCGCCACGAAGCGAAGGTTCCTCCGTTCCTGTATGAGCCAGAAAAGCGGCGCCTACGCGGCGTCGGCGAAATCCTAGAACGGCTGCCTTTGCCGGGTGATCTCCGTGCCGGCCCAATGCACCGGTTGCCAGCTCTCCAGTGGAAGACATGGGTTCGCCTAGCCTTTGTTGAAGCCGGCTCCGACTGGCGCTCGCTGAACAAACTTGCGATCGAGAATGGTTCCCTACGCGATTTCGGCATCATGCCGGATCACGAATGGCAGGCCGGCGTTTTGGGTGTTCGCCAATGGCTAGATACCAGCGGCGTCGTCGCGGGCCGTTCCAGCCCAACAAACGGCGCTTTCAGTGTTGCAGATCCTCGGCCCGAAAATTTCCCAGCCGATCGCGGCGGACTGCTCGGGGTCGGGAAATGGGCGGAAACGGCATCCTTGATCACGTCCCAACGATCGCCGCAGCAAGGTCGCTACTCTGTTGCCGATCCGCGCTTCGTCGGTCGGGAAAACCAACAGTACGGCGTTCGCCAGTGGGATCGCCCCTCGGGTGCCCTTACCTCACAAAGTACACCAGGCGGAGGCACGTTCTCGGTTGCCGACCCTCGTATGGAAGGCAAACCGCGCTTCAATAACGTTTTCCGGATTGTGCCCTGGGCGGGAACTTCGCCTGCGGTTGCAGGGCCTGGAGGTCCAGCTGGCGGCTTGGCTGTTGCAGATCCCCGCGCCGAGGCTGACGCCACATACCAGCAAACGAAGTATCGTGTAACGGGGTACAAGGAGCCGGCCGGAACCGTCATCGCAGCATCCACCACAGGCAATGGCGCCTTCGCGGTCGCCGATCCAAGACCAGTCGACGATCTCCGTTCGGGTGCCCATGGCGTCCGGAAGTGGGGTGAGACTTCCGGAACAGTTCAGGGCGAAAGCCTGCCGTCGAATGGTGCCTTCGCAGTGGCCGACCCGAGGCCTGGCAAAGCGACGGAGGATTTGCAGGGCAATTGGTATGTAAACCCATTCGACCAAGCTCCTCATGCTGTCGTCGGGAATCGAAAATCAGGTGCCAGCGCCGTCGCCGATCCTCGCCCCGGATACGGAGAAAACACCCACCAGAACATCCTATCAGTCGGCGCATGGGAGAAGCACTCAAAGGCTGCAACAGGCGCAACGCATGTAGCCGGTGGCGCGCTGTCTGTTGCAGACCCGCGCCCTGCGGCTTTCGGTGACAAGCGAGAGAACTACCAGACAGGCGGCCACTATGGCGTGGTGCCATGGAAGGGTACGGCCTATGCCGTTCCAGGCTTCGCCAAGCATGATCGTGGCAATTGGTCCGTGGCCGATCCCCGAGAAACCGAAGCCGAGCCTCTATTTGAGCTGCCGAAACCCGATGACCGCCTCGTCGCAGTTATCCGTGCGCTCGACGGTACTTGGCACCGGCCGTTCACCACGCTCGAGCTCGCAGCGCTGCAAAGCCTGGTCGATCCAGATGAAGTTGCCGAAGGATATTCGCTCGACGGCCAGTCGGATTCGGCATGGCGTGAGCGTATTGGCAACGCGGTTCCGCCGGACGCTGCCGAAGCCATTGCTGGCGTCATGGGGACCACCCTGCTCCTGGCAATGACCGGTGAGACGTTCATGCTTTCCTCGCAGCCGATCTGGGTTCGTGAGATCGCGACGGCCTTGGCTACCGACCAGCGCGGTAACATTCCTTGGGAGGCGGCAGAGTGAACGCACAACTCCCAAACGTCATCGACATGACGGAGGAGGCCGCATTAAGCGGCCCCTCCTTCAAAGGATCGCTAAGCAGTGACGAGTCCACCGGATCCTTGCTTGATCATCCACGGCTGGACCGCAGGTGCTTTCTCAAGGATGCCCTTTTTGACGCCAAAGGCGCGCATCGCGTCTCGGGCCACCTTGATCGGCTTAAGCCCATCATAAGCCATGAGGCACGTGCTCAGCGTGGTTTCGTAGATCGCATCACGCCTGTCTTCGGGCCATTCGTAGAGGAAGTCGATCGCGTCATCGATGCTTTCGATCTCCCTCACAAGGTGCTTACCCTCCTTCAGATAGATGGGTCTGTCGAAGGTCGTCTTGCTCATATTCTCCTCACTATTTGTCAGCATTTTCAACGACTAAGCCGACTTTTCTCGCCGGCTATAGGAGATTTGAGAAAGAGATTGCGGCATTTCAAGGGGGCATTCGCATGAACGCGCACTCCCGCGATTTCACACCGGCCCGTGAGGCACCAAACAACATTGAGGCGGAACAGGCGCTATTGGGCGCGATCCTGGTCAACAACGATGCGTTTCATCGTGTGGCCGACTTCCTCAAGGCTGCACACTTCTATGAGCCGCTGCACCGAAAAATCTATGACGTGGCATCTGAACGTATCCGGTCGGGCAACGCTGTTGATCCGATCCTGATCAAATCCTTCCTGCCGGCCGGCGAGATGGTCGGCGAGTTGACCGTGGCGCAATACCTGGCGCGGCTTGCCGTCGAGGCCGTCACAGTCATCAACGCCGCGGACTACGGGCGCGCTGTCTATGACATGGCACAGCGCCGCAGCCTGATCATCATCGGCGAGGAGATGGTGAACGTCGCCTATGACGCGCCACCAGATATGCCGGCAATGCAAATCGGCCGCGACGCTGAGGCACAGATCATCGAGGCCTTTGCCGAAGCGCCGGATGACGACAACTGCAACGTTGAAGATATCGTCGACGACATGGTTGCGGCGTTCTCGGCCATGGCGAAGAAGCCGGTGGTTCCACTGCCACTTCCTCAACTGCGGGAGACAATAGGCGGCGATATGGAGGGGGGCATGCTCATAGGCATGCTGTCGGGCTCAGGTGAGGGCAAGACATCTCTCGCACTCCAGATCGTCGGCGAGGCGCTGGCGAATGGCCATCCCGTCATGATCCTGTCCTTCGATCAGTCGAAGCAGCAGATCATCGATCAGATCGTCAGCCAGCGGACGGGGATTGAGAACACCCGCATTCGCGACCGGACGATGATGGACAAGGAGAAGGTTCGGTATATCGAAGCGCTCGCCGATGTCAGGACAACACCGCTGCGGATTAGGAAGTGCAATGGCTCGTACGATACCGCGGGTCATCTGGTCAGCTATGTGAAGAGGACGCTGCTTCCGCTCTGCCGGAGGCTGGGCAAGACCGGCCTTGTCGTTGTCGATCACGCCCGCAAGGTATGTCCACGCGATCCCAAAGCGCATGAGGGGCGCATCGCTGCCGAGATCAACGGCGTCTTCAAGCACTTCGCCGATGAACATGGCCTGGTCTGGCTCAACCTTATGCAGAGATCGGCCTCAGGCGCCAAGCGCCGCAATCCTCGCCCAATCGATACCGACATCTTCGGCGGCGAGCAGGGCAGGGAGGATTATGACGCCGTCTTCTACCTCTACCGGGCGTGGAAGTACTGGCAGAACCAGATCAAGACTTCTGAGGACGATAAGGACGAAGACCGGATCAACGCTCGCTTCAATCGCGAGAAGTGGACCGAGGACCAGGCCGAAATCGGTGTCCTGAAATGGCGGTTCGGCGATCCGAACAAGCGCTATCGCGTCCGGTTCGAGGCGCAGTTCACCCGCTATGTCTCCATGCGCGAGGAGCCAGATCCTCAGCTTTTCGAGGAAACCCTCTGATGGCCAACACACCCGACTTTCTCCCCGACTATTCGAGCATGCCGTTCAACGGCCGCTACAGGCCCCTTTTTAAGTTGTTCAGTACCGACAACTGGAAATTCGTCCGCGAGAACGGCACGCCAGTGGAGCGCGACACGGCGCACCAAGCCATCCAGGCGGCCAAGGACTGTGTCAGGCGCATCCTTAACCCTGAAATCCGAGCCGAGCAGGCCGAGATCGTTGCTGACGTCCTAGGCGTCGAGGAATGGCGACGGGAGAGGGCGGCACTGGCTGCAGGTAATCAGGAGGCGGTGCTTGGTGCGGTGATTGTCAAAGGCCGGCAGGTGAAAGTCGAGCGGGTGCGGAGGCGGGCATGACGGAGCTTTCCGCACCGCAAGAGATGCTGCTTGGCGATGTCGTTCGACACGGCGCCGAAGTCGCGTCCCGCAATGCCAAGCCGGTTAGAGCCCTGCTGGCGGCAGGGCTCGTAGCCAACGGCGGACCAAATCGTAAGGGGCAGATGGTTCTGGTTCCCACGCCAAAGGGCAGGGAAGAATGGAACCGCATCGCTTCTGCCCGTTTGAGGGCGCAGGCATGAGCTTCGTCACTCCCGCATCCCAGATCATCGCCGGCGCTCGACCGATCGTCATCCCTCGCACCGAAGCGCAGCAGATCATCGCGGACGTGGCTCATGAGCATGGCGTGACCTATGCCGAGGTGTTGTCGGCCAGCCGCGCGCGGAAGCTCGTCGTGGCCCGTTATGACGCTATGGCCGCCGTCTACCGCGCAAAGCCGCAACTTTCGCTTATGCAGATGGGTAAGATGTTCCGGCGCGATCCGAAGACCGTCTGGCATGGCTTGATGCGACGGGGGCTGAAGTGAACGTCCACCAGCCCGCAGCATCCCTACACGGCCTTGTCTGCCCGTCCTGCCAGTCGCCAGACCTGCGGACGCTGGAGACACGCCCTACCGCTGGTGGTGTCAAACGGCGCCGGGAATGCGAGACATGCGGGCATCGGTTTACGACGGTGGAAAGGCTGAAGGAGCGGCCGGCGAAATGATCGATCCCCGCGTCCAGGCCCTGTGCGAGGAGTTCGAAGTCCGGATTGTGCCGAAGTCGGTATATCCGGGCCCGGGTGAGACGCGCGCCGTCGGTGCGCTGGCCAAGATCATCAATCGCCACGGCATGGAACACGCGCGGCTCGTGATGACGACGCTGGCAGAAACCGAGAACAACAAGGCAGCGCTCGAGGCGGCTGTGTTCGGCGCTGCGTCCGATCTGATCCGGGCAAGGCCCGAGTGGGTCGAGGATACCTCCAAATGGCTGGCGGTGTGGGATCGGTGCCCGGTTGGCGAGTTGCAGGCCTTGACGCACGAATTGCGCGGCCACGTATCGATGCGCAGCGCGCTCGCCGGCCTTATCTACGAACGGCTGTGGCGTGCGTTTGGTCCGCGTTCCATTCAACCAGATCTCTATGACGACAGGCGGCGAGCATGAACATCGGCGATATCGCGAACATCTTCATCCGAGCGGCCGAGATCGACCGCAACAGCCACGAGCACGTCGGGCCCCACGCTTTGCGAGCCCAACAGCTTCCCTATGTCCACGACCAGGCCGACAAGAACGGCTGGGGCAAGGCCAGGGAAAAGCGCGTCGTGAGGCGGGGCAAGCTCATCCAAGGCGACTGGCTGGAGGAGGGCGACGATCCATTGGCGGCCGAGCGCAAAGCGTTCTTCGATCGAGTGGCGGCCATGCCGACTGCGGCTGAGATTACGCTTGTCGAAAGCCTGTTCGACTGGCTCCAGGCAACGGACGATGATGCCGAGCGCCGCGCGCTATGGGCATGGGCTCGTGCCAAGGCTGGCGGCAAGGCATTCCGTCGCTGGTGCTTCACTGTCGAAGGCATCCATCCCGAGACTGGTAGAAGGCGAAAAGATCGTGCCTTGGAGCGAATTTCGCGTCATCTTGCCGGGAAACAGCGTTTGCATGCACAAAACCCCGAAATCAGGGTGTTGTCATGCGGCCATGAAATCAGGGATGTTTCGGATACGATCGCCGAAGATGCGGGCAAACGAGACAACCTCAATTCATGGCTGGCAGATGATGCCTTTGCACCGTTCATGAGCAACGAGCCACAGGCGGCTTTCTCCTGGGCGAAGAAGCGGAACGAGATGCGTCGGCAGCGCGAGGCGAAGCGCCAGAAGCAGGCGGCTTAGCCCGGCAGCGGGGGAGTATGGACGAAAGCGTCATAAGTCCACCATGCGCCAACGCCGAAGGCGGCGAATGACGCAAGGCCGAGCAAAATCACGACGACGTTGGCGACGAGTCCTGCCATCTTTGTGGCAGGTGTGTCGTATCCATAAAGCCATTGGGACAAGTACGTGCCTCCGGAAACCAATCCGGCGAGGAGTGTTCCAAAGACGAATGGCGCGAGGCAGCCGGCGAAAGTCGGGATAGATGATGATCGGATGCTGGCTAGGTGCCCGAGAAATGCCAGCAGAGCTACTGAAGCCCCTCCGTTTATCGTTGTCATGCTGCGGATGGCGTTCTGGCCCAGTGTGATCACGGAGCGGAACATTTCCAAGTCGGTGGCGTGGGCGAGTTTGATGTTCTCAACGTGCTGCGTGAGTTCTGCCTTATATCGTTCGAGCTCGGCCGGCGACGGGTTCGGTTCCGGTGATTGTTCTACCCTGGCGAGGTAGTTGATCAGATTCTCGGTGGTCACCGCCTCTAGGCCGTTGGCCTGCAATGTTTCAATTTCGGACTTCAGTCGTCCCGCAAACTCTCTGATCCCCATGCCCATTCTCCCCAATGCCCGTCATGAGAGCTTCGCCCAGGCGTTGGCAAAGGGCAAGTCAGCGGGCCGGCGGTGAAGGGATTCGCCTAAAGCGCCTCGTTGAGGATCTGCTTCAGTCGCTGGGCAGAAGCTGATGGAAGATAAAACCCCATCCAACCGGCTCCTGGGTGCCTGATAGCCAGAACGACGCCATCACGATCTGGCCCGGGAGTGATCTTCCAAAACGGATTGACCAGCGCGGTGATGGCGGCGTTGGGATCAAGATCCGTGGGGACCTCGTCGGTCATATCCGCCCTTGCACGGATGAATGACGCGATGATCTCAGACAGTTCCTCGGGGCTGGTGAGGCTCCACGCTGTCATCTCTCCTGCTACATGGGTTTCAATCCGTGCCGCGCTACGATCGGGAGTGAGGCCGATGACGTACTCAGCTTTCTGCATCGGTGACCATCCATCGTTCGATTAGCCGCAACGTCGTCGAAGCAAAGGTAATCTCACGTGACGCTCACTGACAAGCAGCAGCGCTTCGTCGCCGAGTACGTCATCGACTTGAACGCCACACAGGCCGCAATCCGGGCTGGATACGCAGCAAAGACGGCAAATCGCGAAGGGTCGCGGCTGCTGTCAAATGTAGACATCGCAGATGCGATCGCCCGAAAAGCGGCAGAGAAGGCCGCAGCGCTCGATCTAAGCGCGGAGAGGGTGCTCAAAGGTCTGTTTGAGGAGGCGACCCGTACCGGCGAGGGAAGCTCCCACGGTGCCCGTGTGTCGGCCTGGGGACTACTCGGAAAATACCATTCGCTGTTCACCGATAAGATCGAGGCGAGCGTTACCGCCGATGTGACAGTGACTGACGCAAGAGGACAGCTTGAATCTCTCATCGCTCGCCAGCTTGCCGCCGGAGGTAAGAAGCCAGGCGCTTAGCAGCCTGACCGATCAACAGTGCCGCGAACTTCTGCATGACTGGCGGTTTCTCGCCCGTCCAGAGCAACTGGAACCGGAAGGTGATTGGCAAACCTGGATGATCCTCGCCGGCCGCGGCTTCGGCAAGACGAGAACCGGCGCGGAGTGGACCCGAGAGCAGGTGAGGGCCGGTGCTACCAGGCTGCATCTCATCGCTCCGACGGCGTCCGACGCCCGCGATGTCATGGTGGAGGGCGAAAGCGGGTTGCTGGCGGTATGTTGGTCGGGAGACCGGACTTACGCCGGCGAGCCGATAGGCCGCCCATCCTATGAGCCATCCAAGCGCCGGCTGACGTGGGCCAATGGCGCCATTGCCACCCTGTTCTCAGCGGAGGAGCCAGAACGTCTTCGCGGTCCTCAGGCCGAAGCAATGTGGTGCGACGAGCTCGCGGCATGGAAGTATCTGCGCGAGACCTGGGACATGGCCATGTTCGGTCTTCGCCTCGGCGACCGACCGCGGACCTGCATCACCACCACACCGAAGCCGCTGAAGATCCTGCGTGAGATCATGCAGGACAAGCTCACGGTGGTGACGCGGGGCTCTACCTTCGCGAATTCTGCCAACCTTGCCCCGACCTTCCTCAAGGCGATCAAGGACAAATACGAAGGCACTCGCCTCGGCCGACAGGAACTTGAAGCGGAAGTGCTGGAGGAAGCCGAGGGGGCTCTATGGAGCCGCGCCTTGGTCGAGCAGTCCCTGCTTAAGGGCGCGTTGCCCGAAATGAAGCGGATTGTCGTCGCAGTCGATCCGGCAGTGACTGCCAAAGAGGAAAGTGCGGAGAGTGGCATCATTGCAGCCGGATTGGGGCGAGATGACCGCGGATACGTCCTGCAGGATGCATCGGGCCGCATGTCGCCAGGCAAATGGGCGGCAACCGCGGTGAGACTCTACCACGATCTCAAAGCCGACCGGATTGTGGCCGAGGGCAACCAGGGCGGTGATTTGGTCAAACATGCAATCCACACCATCGATGCCACCGTGCCAGTGACGATCGTCCACGCTTCTCGCGGCAAAGCAGCTCGAGCCGAACCTGTGGCCGCACTCTACGAACAGGGCAAGGTCAGCCACGTGAAAGGCCTAGCCGATCTCGAAGACCAGATGGTGAATTGGGAACCCCTTTCAGGGATGCCTTCTCCCGATCGCCTCGACGCTGCAGTGTGGGCCTTAACTGCGCTGATGCTGACCCAAGTTGTGCCGACCGCGGTTGTCGGCACCTATCAAACCGCACGGTAGCCATGGCACTCGAACCAAAGAACGATCCGTCGAACCCATCGGCGGATTACAAGGCTATGGCGCCTGATTGGGCGATGATTGCCGATATCCGCGCCGGCGCTCGCCGTGTGAAGGAGAAGGGCGAGCTCTACCTGCCTCGGTACGAGAACGAAGCTGTTTCGGCCTACAAGAAACGGCTGGAGGCGACGCCTTGGCGGCCTGAGTTTGTCGACGCGCTGCGCAATCTGTGCTCGAAGCCTTTCACTAAGGCTGTAGCGCTGCAGGGGACAGTGCCCGACGCGATCAAGGAGATTGCTGAGGATATCGACGGGGAGGGCAACGACCTTCACAGCTTCGCCCGGGCTCAGTTCGTAGAAGGCGTGGCAGCCGGCGTGGTCGGCATCTACGTCACCTATCCAGACAGCGAGCCGGCTAAGACGGTTGCCGAGGAGAAGAAGGCCGGCACGCGCCCCTATTGGGTTCCGCTCCTCGCCGAGAACATCCTCGCTCTCTACACGGTCAAGGTGAACGGGCGCGATCTGGTCCAGCATATCCGCCTTCGTGAATGCCATGTCGAGCGCGATGGCTACGCCGAGAAGGAAACGGAGCGGGTCCGCGTTATCGACATTGCGGCGCATGGCGCTTCGCCGACATGGGAGTTGTTTGAGAAGCAGGTCGACGCGACAACGAGGGATGTCACTTGGGTCTCGGTCGGCCGCGGCGCGATCACACTCGACCTGATCCCGATCGTCCTGTTCTTCACTGGTGAGCGCTCCGGCATCTATCGAGTGAAGCCGCCGCTGATCGACTTGGCGGTCATGCAGATGGAGATCTATCGCGCACTGAGCCGCGAGGATGAGATCCTGACCTTCGCCGGATCTCCGATGCTCAAGGCGAAGGGGATGAACCCGCCTGCGCCGACCACCGCTCCGGTCGTCGTGGATGGCAGGGAACGACTGGTCGAGACACCCGCGCCCCAGATCACCGTCGGGCCGAAGACTGTGCTCTTCGCCCCGCCAAGCGAAAACAACCAGGCTGATTGGGATTTCATCCAGCCTGATGCAGCCAACATCACAGCGGTATCGGAAAGCGTCGACAACAAGATCGACCACTTTCGACGGCTCGCGCTTCAGCCAGCCACTCGAAAGTCGGGCAATCTGGTCGCCACGGTATCCGCGATCGACGCCGCCAAGGCGCACAGCGCCGTCGAGGTCTGGGCAAACGGGCTGAAGGACGTGTTGGAGCAGGCCTTCGTCTTCACCTGCATGTGGATGAAGATCGCGACCACGGTCGAGGTCTCGGTGCATACCGACTTCGGCATTGATGCCGGCGGCACCGATGAGACGGGCAATCTGCTCGAAGCCCGGAAGAATGGCGACCTGTCCCAGCGCACCCTGTGGGACGAAATGCAGCGCCGCGGCACGCTCGGGCCTCAGTTTGATCCTGAAGTGGAAGAGCAGCGCCTGCTCGAAGAAGTGCCCGGCGAAGACAGCCCCGACGATATTGAGGCTGCAACGACGCCTCGCAAGCCGGCAAACGCGGCCTGATCAACACAAGTTTTGGATTTTGCCTGCCCAACGGATGTTTGGCGGGTGTTTCGGCGCGGATGCGCCATAGCCGGGCGGATGCCCAGAAAGACCAGCAATGAAACTGAAACTCGTGACTGTCGAAGGGAAGACCTACGCCGAGGTGCAGGACGGCAAGCCCGTCTTTGTCCATGACGACGGCAAAGAGGTCGCATTCGATGCTGTCGGTACCGTCGCCACGATCACAAGGCTGAATGGCGAGGCCAAGACGCACCGTGAGGCCAAGGAAGCCGCCGAGACCAAGCTGAAGGCCTTCGAAGGCATCGAGGACGGCGAGGCGGCCCGTGCTGCACTGGAAACGGTCAAGAACCTGGATGCCGGCAAGCTCATGGAAGCTGGCAAGGTCGAGGAACTGAAGGCCGGTATCAAGAAAGCAGCTGAGGAATCCGTCGCCGCTGCGAACAAGGCCAATGCCGAAGCGCTGGCCGCTGAGAAGGCGCGTGGTGACAAGCTGGAGCTCGCCCTCAACGGTGAGATGATCGGCGGCAGGTTTGCTCGCTCCCAGTATGTCGGCGACAAGCTCGTCCTGCCTGGTCCTGCCGCTCAGAAGGTCTTCGGCGATCATTTCAAGATCGAGGATGGCAAGGTCGTGGCGTACGACGCCGCCGGCAACAAGCTGTTCTCTCGCGCTAAGCCCGGCGAAATCGCCGAGTTCGATGAAGCCATGGAGATCCTGGTCGATGCTTATCCCTACAAGGACAGCATCCTGAAGGGTACCGGCCACAAAGGCGACGGTTCGCGCGGCAGCCAGGGCAATGGGCAGGGCGGCGCTAAGACCATGTCCCGGCAGGCATTCGATGCGCTCGATCCCGCGTCCAAGTCCGCAAAGATCAAAGAGGGCTACACCCTCACTGAATAGCGTCCCCACGACGCTGTGACTGCCCCACGCGGATGCGGGACGGGGCGACCGGGCTGGATGGCCCAATTCCCCCAAACATCAATCCCAAATCTGAAAGGTAGCCACCATGGGTAATACTCTGACGGGCCTCGTCCCGACTATCTACAACGCGCTGGACGTGGTTTCGCGCGAATTGGTCGGGCTCATCCCGGCTGTGACCTCGGACATGACCTATGCGCGCGCCGCAGTTGGTCAGACCGTCATGTCGCCGGTTACCCCGGCTGCAACGGCCACCGATATCACTCCGGCCGTGACGCCGCCGAACGATGGTGACCAGACCATCGGTAACGTGCCGATGACCATCACCAAAGCTCGCCGTGTCCCGATCCGCTGGAATGGCGAGGAAAAGCTCGGCCTCGACAACAACGGCGCCAGCTACAACACCATCCTTTCCAACCAGTTCCAGCAGGGCATGCGTACCTTGGTGAACGAGGTGGAAGGCGATTTGGCCGCACTGCACACCAAGGCCTCTCGCGCTTACGGTACCCCGGGCACCGCGCCTTTCGGCACCCCTGCTGACCTGAGCGATTCCGCAGGCGCTCTTCGCATCCTGGAGGAGAACGGTGCTCAGGGCCTCGACTTCCAGCTTGCCCTTGGCACCGCCGCCATGGCCAATCTGCGCGGCAAGCAGTCGGTACTGTTCAAGGTGAACGAAGCCGGCCGCGAAGACATGCTGCGAAACGGCATCACCGATCGCCTGCAGAACCTTGCGCTGCGCCAGTCGGCTCAGATCAAGACCTTCACGGCGGGTACCGGCGCAGCGGCCACCACCAACGCCACGGGCTATGCCGTCGGCGCCACTGCGATCACCCTGGCTTCGGCCGGCACCGGTACCATCCTCGCGGGCGATGTGATCAGCTTCGCTGGTGATACCAACAAGTACGTTGTAGCCGCAGGGGACACGGATGTGTCCAATGGCGGCGTCATCACCATTGCAGCCCCTGGCCTGCAGAAGGCAATCCCGGCTGCGGCCACGGCGATTACGGTGTCTGCAACGGGCGCGCGCAACATGTTCTTCGCTCGCTCTGCCATCGCTCTGGCCACTCGCGCTCCGGCGCTGCCGCCGCAGGGTGACTCGGCCATCGATCGCATGATCGTGACCGATGCGCTGACCGGCCTGAGCTTCGAAGTCTCGATGTACGCCCAGTACCGCCAGATGCAGTACGAAGTCGCGCTGGCGTGGGGCTGCGCCGCAATCAAGCCTGAACATATCGGCCTGCTGCTCGGCTAAGCCTTCCGTCATCGACGGGCGCGCTTCGGTGCGCCCGTTCCTCTTTCAATCAGGATCAACGACATGAAAACGATCAAGGTTCAGCCTTGGGGCGAAGATCAGGGCGAGTACGTGCTGATCAACGAGGAAGACTTCGACGCCAAGTTCCACGAGCCCTTCGATGAAGGTGCGCAGCCGACCGACGCGCCGAAAATCGAACTCGGTACCGACAGCGGCGAGGAGTTCAGCGACGAGCAGCTTCGCGACGCCATCGAAGCAGCTAGCGGTCAGCGTCCCCACCACATGACGGGTCGCGCCAAGCTGATTGCGCAATTCAACGAACTCAATGCAGAGGCGGCGGCCGAGTAGGGCCGAGTGAACTATGCTCATCGTCGAGGATGGCTCCGGCCTTGCCAATGCAGAAAGCTACGTCAGCGTGAGCGATGCCGTGGCGTATGCCGCTGCGCGCGCGCTCACGTTCCCTGCAAGCCCCACCGACAAGGCTGAAGCAGCGCTGCGCCGTGCAACAGCCTACATCGACAACACCTATCGCACGCGGTTCCCAGGCCAGCGGAAGGAGTTCCGCCTGCAGGCTTTGGAATGGCCTCGCGTCGGCGTTGTCGACATGAACGGGTTTCCGGTCACGAGCGACGAGATCCCGGTCGAGATCGTCCGGGCCGCTTGTGAAGCAGCGGTACGTGAACTGGCAGCGCCTGGTTCGCTCACGCCAGACGTAACGCCCGGCAAGGTGAAGAAACGCGCCAAGGTTGGAGACATCGAGGTCGAATATGCGGTGGGTGGCGGTGGCGTTGCGAGCCAGCAGCCGATCTCGCCGATTATCGACGGGATACTCGCTGCGCTGATCGGCATAGGGCAACCGTTCACAGCTTCGGCGGTGCGCGGATGACCGGCTTCTACGAAGAACTCCGCGGCGTGGCCGACGAACTGTTTGGCGAGTTCAAGCAGGGGAGCGTGCAACTCCGCCGCGTCACCACAACGCCGGGCCCGAACGAATGGGACCCGCCGACTGAGACGACTGAGACGTGGGAACTGGGCGCAGCGGTGAAGCGTGTCGACCAGCGTTACGAGAATGGCATCTTGATCGTGCAGACCGGCGACATCGTGACGTTTGCCGTGCCGCCAGTGGCGCCGATCCTGAGCGACGGACTCGTCATCGACGGCACTCTTCGGGCGATCACCAGCCTGCGCCCTACGCCGTCAGCGGGCACTGTAGTGGCCTGGGAGGTGTTCTGTGCCGCCTAAGTAAGTGTGTAAAAGATGGCTTAGAATAGAGCCGATGATATTGCGGACTCACTTGGATTTTGTTTACGTTACTGCGCGGAGAAAATTGCCAAGGGGGATTAGATGTGTCGAGCGCTGGCTATTTGCTTAACTGTGGTAGTCGCTGGGTGTACAACTGGTGGAGATATGAATGATTTAACTCCGGTTGGGTACACAGCGCTTAACCCGACAAGCACTGGTGTTAAAGTTGGCCACCTTTATTTTGAAGGACCTGGTAAACGTCCAGCCTTCAAGACGGCAAACGGTAGGCTTTACAGTTCCCTATGCTACATAGATTTTGAAAAGACACCGCCGCTTATCGATATTGGAAAACACGTAGTTGATGAGGGTGTGAGCGTTTCCAAACTAGATACGAGCGACTCGGGTAACCTTAGCGGTTCCGGGTCAATACCAACTCTGGGCGCTGCAGCGGCCGGCGCTAGTGCGGGGGCCAAGAAAGGCTACGTTGTCGAGAACTTGCATGTATTCACTCTTGCCGGCGACGGCGACGTAGTGGTGCGCAAATCCATTAAGCCGAATTGTCGTGATCAAATCCGAACCCTCAAAAGAGCAAACAAGAACGTTGTTCTCGCAACCGGTGCCACGCGTGCGGAAACGCTCTCTGAAACAAAGAGCTACAGCCTGCAAGGGAACATAGGTGGGGAATTCAAGGTCAAGATCGGGAACGAAAAAAAGGCAGCTACCTTTCAAGTGGACACCGGGGAAAAACAAGGCGGAGGGTTCACACACGCAAAAACCCTGACCAGAAAAAATGTCTATCTTTCAGTTAAGCTAGACCGGTTTCAACAATAGCGGGCAAAAGGCCTGCGCATGGCGGGCTCATGCTCAAACGCTTATCACCTCGCGAGCGCTTCGAACAGCTTGTCGCGACATGGGAGCCGCTGCTTCGCGCCGCGTTCATTGAGGCGATCGACGACATTCGGTCGAACATCGTCTTGCGACGGATTGTAGAGCGGCTGGAACGCGCTGACATTGCCGGTGCAATCCGAGCCCTCAACCTCGAGGAGGCGGCATTCCGGCCGCTGGAAGAGGCCATCCGGCAGGCATTCAACGGTGGTGGCATTGCTGCGGTCGAGCAGATGCCTCAACTGCGTGAGCCCGATGGGCATCAGGTCGTAGTACGCTGGGATGCTCGCAACCTGGCAGCCGAGAATTGGCTGCGTGAGCACAGCGCCACGCTGGTCACGAACATCGTCGAGGATCAAAAGGTTGCGATCCTATCCGCTTTGGAGGAGGGGCTGTCCCGCGGCGATAACCCAACCAAGACAGCGCTGCAGGTCGTCGGACGGGTCAACCGGGTCACAGGGAAGCGCGAGGGCGGCGTGATCGGGCTCACCACAGCCCAAGCGGAGTATGTCGCCACAGCCCGACAGGAACTGTCCCTCGGTGAGCCCGATCAGCTGAGAAACTACCTATCGAGGAACCGCAGGGACAAGAGGTACGATCGCACTATCCTTGCGGCAATGAAGACGGGCGAGCCGCTGCCGACCGAGACCATCGATCGGATCACATCGAGGTACGCTGACGGGCTGTTGAAGCTTCGCGCCGACACGATCGCGTTGCATGAGACATTCGAAGCTCTGAACACCTCCAAAGAGATCGCGTTCCGCCAGCAGATCGACAAGGGCAGGGTGCAGGCTCAGAACGTCACCAAGACATGGCGCCACACACCACAAGAGCATCCCCGAGCGCACCATGTGGCGATGAACGGCCAGAAAGTGCGGTTTGATCAGCCGTTCATGGCGCCGGATGGAACCGCAATCCCATATCCACATGCTCCCGGCGTTCCGGCGCGGCATACTCTCGGATGCAAGTGCTACTGCGATTTCCGAATTGATTTCGTCGCTGAACTGGTGCGCTGA
Protein sequences of DBSCAN-SWA_1 >CP029562|5616153:5632245|5629984_5630344_+|QAZ45950.1|DBSCAN-SWA MTGFYEELRGVADELFGEFKQGSVQLRRVTTTPGPNEWDPPTETTETWELGAAVKRVDQRYENGILIVQTGDIVTFAVPPVAPILSDGLVIDGTLRAITSLRPTPSAGTVVAWEVFCAA >CP029562|5616153:5632245|5616153_5618505_+|QAZ45936.1|DBSCAN-SWA MTVTNIDIEFRHFHLFCGLGGGAKGFNKGTARVGSLKARFRCIGGIDVDPASIRDFSRAAKVEGTVRDLFDEDQYRAFHGEAPPPGWRPAMPADIRRAAHGERPHVVFLSAPCKGFSGLLPEKSSMTDKYQALNGLTLRGVWLMLEAWKDHPEGLPELIIFENVPRIMTRGRFLIDQIVALLRAYGYAVAETTHDCGEIGGLGQSRKRYLLVARHEAKVPPFLYEPEKRRLRGVGEILERLPLPGDLRAGPMHRLPALQWKTWVRLAFVEAGSDWRSLNKLAIENGSLRDFGIMPDHEWQAGVLGVRQWLDTSGVVAGRSSPTNGAFSVADPRPENFPADRGGLLGVGKWAETASLITSQRSPQQGRYSVADPRFVGRENQQYGVRQWDRPSGALTSQSTPGGGTFSVADPRMEGKPRFNNVFRIVPWAGTSPAVAGPGGPAGGLAVADPRAEADATYQQTKYRVTGYKEPAGTVIAASTTGNGAFAVADPRPVDDLRSGAHGVRKWGETSGTVQGESLPSNGAFAVADPRPGKATEDLQGNWYVNPFDQAPHAVVGNRKSGASAVADPRPGYGENTHQNILSVGAWEKHSKAATGATHVAGGALSVADPRPAAFGDKRENYQTGGHYGVVPWKGTAYAVPGFAKHDRGNWSVADPRETEAEPLFELPKPDDRLVAVIRALDGTWHRPFTTLELAALQSLVDPDEVAEGYSLDGQSDSAWRERIGNAVPPDAAEAIAGVMGTTLLLAMTGETFMLSSQPIWVREIATALATDQRGNIPWEAAE >CP029562|5616153:5632245|5631204_5632245_+|QAZ45952.1|head|DBSCAN-SWA MLKRLSPRERFEQLVATWEPLLRAAFIEAIDDIRSNIVLRRIVERLERADIAGAIRALNLEEAAFRPLEEAIRQAFNGGGIAAVEQMPQLREPDGHQVVVRWDARNLAAENWLREHSATLVTNIVEDQKVAILSALEEGLSRGDNPTKTALQVVGRVNRVTGKREGGVIGLTTAQAEYVATARQELSLGEPDQLRNYLSRNRRDKRYDRTILAAMKTGEPLPTETIDRITSRYADGLLKLRADTIALHETFEALNTSKEIAFRQQIDKGRVQAQNVTKTWRHTPQEHPRAHHVAMNGQKVRFDQPFMAPDGTAIPYPHAPGVPARHTLGCKCYCDFRIDFVAELVR >CP029562|5616153:5632245|5618576_5618876_-|QAZ45937.1|DBSCAN-SWA MSKTTFDRPIYLKEGKHLVREIESIDDAIDFLYEWPEDRRDAIYETTLSTCLMAYDGLKPIKVARDAMRAFGVKKGILEKAPAVQPWMIKQGSGGLVTA >CP029562|5616153:5632245|5629165_5629450_+|QAZ45948.1|DBSCAN-SWA MKTIKVQPWGEDQGEYVLINEEDFDAKFHEPFDEGAQPTDAPKIELGTDSGEEFSDEQLRDAIEAASGQRPHHMTGRAKLIAQFNELNAEAAAE >CP029562|5616153:5632245|5629463_5629988_+|QAZ45949.1|DBSCAN-SWA MLIVEDGSGLANAESYVSVSDAVAYAAARALTFPASPTDKAEAALRRATAYIDNTYRTRFPGQRKEFRLQALEWPRVGVVDMNGFPVTSDEIPVEIVRAACEAAVRELAAPGSLTPDVTPGKVKKRAKVGDIEVEYAVGGGGVASQQPISPIIDGILAALIGIGQPFTASAVRG >CP029562|5616153:5632245|5623298_5623652_-|QAZ45942.1|DBSCAN-SWA MQKAEYVIGLTPDRSAARIETHVAGEMTAWSLTSPEELSEIIASFIRARADMTDEVPTDLDPNAAITALVNPFWKITPGPDRDGVVLAIRHPGAGWMGFYLPSASAQRLKQILNEAL >CP029562|5616153:5632245|5630713_5631175_+|QAZ45951.1|DBSCAN-SWA MSVSKLDTSDSGNLSGSGSIPTLGAAAAGASAGAKKGYVVENLHVFTLAGDGDVVVRKSIKPNCRDQIRTLKRANKNVVLATGATRAETLSETKSYSLQGNIGGEFKVKIGNEKKAATFQVDTGEKQGGGFTHAKTLTRKNVYLSVKLDRFQQ >CP029562|5616153:5632245|5627938_5629102_+|QAZ45947.1|coat|DBSCAN-SWA MGNTLTGLVPTIYNALDVVSRELVGLIPAVTSDMTYARAAVGQTVMSPVTPAATATDITPAVTPPNDGDQTIGNVPMTITKARRVPIRWNGEEKLGLDNNGASYNTILSNQFQQGMRTLVNEVEGDLAALHTKASRAYGTPGTAPFGTPADLSDSAGALRILEENGAQGLDFQLALGTAAMANLRGKQSVLFKVNEAGREDMLRNGITDRLQNLALRQSAQIKTFTAGTGAAATTNATGYAVGATAITLASAGTGTILAGDVISFAGDTNKYVVAAGDTDVSNGGVITIAAPGLQKAIPAAATAITVSATGARNMFFARSAIALATRAPALPPQGDSAIDRMIVTDALTGLSFEVSMYAQYRQMQYEVALAWGCAAIKPEHIGLLLG >CP029562|5616153:5632245|5621908_5622631_+|QAZ45941.1|DBSCAN-SWA MNIGDIANIFIRAAEIDRNSHEHVGPHALRAQQLPYVHDQADKNGWGKAREKRVVRRGKLIQGDWLEEGDDPLAAERKAFFDRVAAMPTAAEITLVESLFDWLQATDDDAERRALWAWARAKAGGKAFRRWCFTVEGIHPETGRRRKDRALERISRHLAGKQRLHAQNPEIRVLSCGHEIRDVSDTIAEDAGKRDNLNSWLADDAFAPFMSNEPQAAFSWAKKRNEMRRQREAKRQKQAA >CP029562|5616153:5632245|5618980_5620435_+|QAZ45938.1|DBSCAN-SWA MNAHSRDFTPAREAPNNIEAEQALLGAILVNNDAFHRVADFLKAAHFYEPLHRKIYDVASERIRSGNAVDPILIKSFLPAGEMVGELTVAQYLARLAVEAVTVINAADYGRAVYDMAQRRSLIIIGEEMVNVAYDAPPDMPAMQIGRDAEAQIIEAFAEAPDDDNCNVEDIVDDMVAAFSAMAKKPVVPLPLPQLRETIGGDMEGGMLIGMLSGSGEGKTSLALQIVGEALANGHPVMILSFDQSKQQIIDQIVSQRTGIENTRIRDRTMMDKEKVRYIEALADVRTTPLRIRKCNGSYDTAGHLVSYVKRTLLPLCRRLGKTGLVVVDHARKVCPRDPKAHEGRIAAEINGVFKHFADEHGLVWLNLMQRSASGAKRRNPRPIDTDIFGGEQGREDYDAVFYLYRAWKYWQNQIKTSEDDKDEDRINARFNREKWTEDQAEIGVLKWRFGDPNKRYRVRFEAQFTRYVSMREEPDPQLFEETL >CP029562|5616153:5632245|5620969_5621296_+|QAZ45940.1|DBSCAN-SWA MEPHRFCPFEGAGMSFVTPASQIIAGARPIVIPRTEAQQIIADVAHEHGVTYAEVLSASRARKLVVARYDAMAAVYRAKPQLSLMQMGKMFRRDPKTVWHGLMRRGLK >CP029562|5616153:5632245|5620434_5620791_+|QAZ45939.1|DBSCAN-SWA MANTPDFLPDYSSMPFNGRYRPLFKLFSTDNWKFVRENGTPVERDTAHQAIQAAKDCVRRILNPEIRAEQAEIVADVLGVEEWRRERAALAAGNQEAVLGAVIVKGRQVKVERVRRRA >CP029562|5616153:5632245|5621471_5621912_+|QAZ47543.1|DBSCAN-SWA MIDPRVQALCEEFEVRIVPKSVYPGPGETRAVGALAKIINRHGMEHARLVMTTLAETENNKAALEAAVFGAASDLIRARPEWVEDTSKWLAVWDRCPVGELQALTHELRGHVSMRSALAGLIYERLWRAFGPRSIQPDLYDDRRRA >CP029562|5616153:5632245|5622627_5623212_-|QAZ47544.1|DBSCAN-SWA MGIREFAGRLKSEIETLQANGLEAVTTENLINYLARVEQSPEPNPSPAELERYKAELTQHVENIKLAHATDLEMFRSVITLGQNAIRSMTTINGGASVALLAFLGHLASIRSSSIPTFAGCLAPFVFGTLLAGLVSGGTYLSQWLYGYDTPATKMAGLVANVVVILLGLASFAAFGVGAWWTYDAFVHTPPLPG >CP029562|5616153:5632245|5624079_5625408_+|QAZ45944.1|DBSCAN-SWA MNLSSLASLPPEVRSQALSSLTDQQCRELLHDWRFLARPEQLEPEGDWQTWMILAGRGFGKTRTGAEWTREQVRAGATRLHLIAPTASDARDVMVEGESGLLAVCWSGDRTYAGEPIGRPSYEPSKRRLTWANGAIATLFSAEEPERLRGPQAEAMWCDELAAWKYLRETWDMAMFGLRLGDRPRTCITTTPKPLKILREIMQDKLTVVTRGSTFANSANLAPTFLKAIKDKYEGTRLGRQELEAEVLEEAEGALWSRALVEQSLLKGALPEMKRIVVAVDPAVTAKEESAESGIIAAGLGRDDRGYVLQDASGRMSPGKWAATAVRLYHDLKADRIVAEGNQGGDLVKHAIHTIDATVPVTIVHASRGKAARAEPVAALYEQGKVSHVKGLADLEDQMVNWEPLSGMPSPDRLDAAVWALTALMLTQVVPTAVVGTYQTAR >CP029562|5616153:5632245|5627031_5627832_+|QAZ45946.1|DBSCAN-SWA MKLKLVTVEGKTYAEVQDGKPVFVHDDGKEVAFDAVGTVATITRLNGEAKTHREAKEAAETKLKAFEGIEDGEAARAALETVKNLDAGKLMEAGKVEELKAGIKKAAEESVAAANKANAEALAAEKARGDKLELALNGEMIGGRFARSQYVGDKLVLPGPAAQKVFGDHFKIEDGKVVAYDAAGNKLFSRAKPGEIAEFDEAMEILVDAYPYKDSILKGTGHKGDGSRGSQGNGQGGAKTMSRQAFDALDPASKSAKIKEGYTLTE >CP029562|5616153:5632245|5623709_5624135_+|QAZ45943.1|terminase|DBSCAN-SWA MTLTDKQQRFVAEYVIDLNATQAAIRAGYAAKTANREGSRLLSNVDIADAIARKAAEKAAALDLSAERVLKGLFEEATRTGEGSSHGARVSAWGLLGKYHSLFTDKIEASVTADVTVTDARGQLESLIARQLAAGGKKPGA >CP029562|5616153:5632245|5625911_5626934_+|QAZ45945.1|DBSCAN-SWA MNGRDLVQHIRLRECHVERDGYAEKETERVRVIDIAAHGASPTWELFEKQVDATTRDVTWVSVGRGAITLDLIPIVLFFTGERSGIYRVKPPLIDLAVMQMEIYRALSREDEILTFAGSPMLKAKGMNPPAPTTAPVVVDGRERLVETPAPQITVGPKTVLFAPPSENNQADWDFIQPDAANITAVSESVDNKIDHFRRLALQPATRKSGNLVATVSAIDAAKAHSAVEVWANGLKDVLEQAFVFTCMWMKIATTVEVSVHTDFGIDAGGTDETGNLLEARKNGDLSQRTLWDEMQRRGTLGPQFDPEVEEQRLLEEVPGEDSPDDIEAATTPRKPANAA |
19 | Sinorhizobium_phage(18.18%) | coat,terminase,head | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
5809416 : 5858276
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP029562|5809416:5858276|DBSCAN-SWA ATCAGGCTGTCGCTTCCGGCGCGGGTTCGACGTAACGCGGAAACACCGGTTGCGGCGCCGGCAGTGCCGTGCCGGGCACCAGTGCATATGCGTCCGCCAGATGCTCGAAGTCGCGCTTGTCATCGGCCACGGCCAGCGTATCGAGCAGCTTCGCGGCCGAGCCCGGGATATAGGGCTGGCACAGGATGCCGACACGCCGGATGATCTCCGCCGTCGTCCACAGCACCGTTTCCATGCGGGCGGGGTCGGTCTTCTTCAGCGCCCAAGGCTCCTGCGCCGCGAAATAGCGGTTGGCATCGGCGACGACCGCGAAGATCGCCGCCAGTGCCTGATGGATGGCCTGCTCGCTCATCGCCTTGCGTGCTGCCGCCAAGGCCGCGGTTGCCTGCTCGAGCATCGCGTTGTCGGCCTCCGCAAGCGTGCCACGCTGCGGCACCACACCGTTGCAGTTCTTCGCGATCATCGACAGCGAACGCTGCGCCAGATTGCCGAGATCGTTGGCGAGGTCGGCATTGGTGCGGTTGACGATCGCCTCATGGCTGTAGCTGCCGTCCTGGCCGAACGGCACCTCGCGCAGGAAGAAATAGCGAACCTGGTCGAGGCCGTAATGATCGACCATCGTGAACGGGTCGATGACGTTGCCGACCGATTTCGACATCTTCTCGCCGCGGTTGAACAGGAAGCCGTGTGCGAACACGCGCTTCGGCAGTTCGATGCCCGCCGACATCAGGAAGGCCGGCCAATAGACGGCGTGGAAACGCACGATGTCCTTGCCGATGATGTGCGTGGCCGGCCAGTAGCTCCAGGACGGCGACTGGGTGTCCGGATAGCCGGCGGCGGTGATGTAGTTGGTGAGTGCGTCGACCCAGACATACATGACGTGTTTTTCGTCACCCGGCACCGGGATGCCCCAGTCGAAAGTGGTGCGAGAGATCGACAGGTCCTTCAGGCCCGATTTGACGAAGGAGGCGACCTCGTTGCGGCGCTCGGACGGACCGACGAAGTCGGGCTGATTTTCATAGAGCGCCAGCAGTTTGTCCTGATAGGCTGAGAGCCGGAAGAAGTAGCTTTCCTCCTCGACCCATTCCACCGGTGTACCCTGCGGGCCATAGCGGACATTGTCGGCACGGACTTCGGTTTCTTCCTCGCCGTAATAGGCCTCGTCGCGAACCGAATACCAGCCGGCGTAGCCACCCTTGTAGATGTCGCCATTGGCTTCCATCGCCTTCCAGATGGCCTGGGAAGAAGCATGGTGGCGCTGTTCCGTCGTGCGGATAAAATCATCATTCGACGCATTGAGCGCCTGCGCCATCCGTCGGAATTCGGCAGCGTTGCGGTCAGCCAGCTCGCGCGGGGTGATTCCTTCCTTCTTGGCCGTCTGCAGCATCTTGATGCCGTGCTCGTCCGTGCCGGTCAGGAAGAACACGTCCTTGCCGTCGAGGCGCTGGAAGCGCGCAAGCGAATCCGTTGCGAGCAGTTCGTAGGCATGGCCGATATGCGGCTTGCCGTTCGGATAGGAAATCGCAGTGGTGATGTAGAATTTCTCACGGGACATTGTGGGCCGTCATGAATGTCTGAATGGAAGTTGTGGCGTGGATATCGCATCGGCATGGCGAAAGCCATCCCGAGAGGGCCGTCACATTCGCATCGCAGAATTCAGTCGGTCGATCATCGTGAGTGCGTGTTGTTTCTTGTCGAGATTGTAGGTCTCGGTCTCAGATATAGCGCGCTGCGCATCTTGCCAAGTGTCGGACAAAAGTTTGGCACGCCCCAGATCGCCAGCGATCGCGGAGGAACTTGCCGCCGAGGACAGCAATTCCAGGGCATGGCGGTTGAAGGTCTCGAACGGGATCGCCTGATCGCGACCGGCGACGGTTTCAGCCAGTTTGTAGGCGCCGGCGAGGTCCACCCGACGGGTCGAGACCAGCTTTTCGAGTGCTGCGGCAATCTCCAGGCCGCCATATTGGGTCAGCAGGATGGCAGCCCGGGCGCTGCCCCCTGCCCGCTCCGCAAGCGCTGCACTTTGGGCGGCGTCATCCGGTGTGCCTTGACCGATTGCATCCAGCACCGTCATCAGGTCTCGCTCTCCAAGCGGCGCGAGCCTCAGCACCTGACAGCGCGAGCGGATGGTCGGCAGCAGGCTGCCCGGTGCATGCACGATGAGCACGAAAATCGTACGTGCCGGCGGCTCTTCAAGATTTTTGAGCAACGCGTTGGCAGCATTGGTGTTCATGTCGTCCGCTGGATCGACGATCACCACCCTGTAGCCGCCGTCGTGCGAGGTCATCGACAGGAACTGTCCGACCTTGCGGATCTCGTCGACTGTCACCACGCTCTTGAATGCCTTGGTACGGTCATTCATCGGCCGTGTGAGATGCAGGACGGAGGGATGCGCGCCCATCGCGATCTGCCGGTAGAGCCCCGACGCGGGATCAGGAACAACCAGGGTTTCCGGCGCCCGATTGCCGTCCGGGTTTTTCAGCAGATGATACGCGAGGTGGAAGGCGAGCGTGGCCTTGCCGATGCCTGCCGGCCCGCACAGCATCAGCGCATGCGGGAGCTTGCCCGCTCTGTAGGCAGCCGCCAGAGTTGTCGTGACGTCCGAGTGTCCGATTAGGCGCGGGTTCTCTGCGGGCTCCGGAACCTCATCCAGCGTATCGTGCTGCTCAGGCGCCAGCCGCTCGAAACTCATGCCGGCGCCTTGTGGCTGGTCTTGGCCGGCCTGGACTTCTCCAGCATCGCGAAAACGGCATCCGTGACCGCATGTTCGACCGTGTTGGCGTCACCGGAAGCGTCGATGACCACGCAACGCTTGGCCTCGGCCCTGGCGATGGCCAGGTAGGCATCGCGGCGCTGTTGATGAATGGCCAGGGTCTCCTTTTCGAAGCGGTCCGGCGTTTCGCCGGGATTGCGCCGGGCTGTTGCGCGGCGCAATCCTTCTTCCGGATCGATGTCGAGGATCAAAGTAAGGTTCGGCACCATGCCATTGATGGCCACGCGCTGCAGGGTATCGATGAACGCCGGGTCGAGGCCGCCGGTGGCGCCTTGGTAGACGCGCGAGGAATCGATGAAGCGATCGCAAAGCACGATCGCGCCACGCTCCACGGCAGGCCGGATCACCTGTTCTACATGATCGGAGCGCGCTGCGGCGAACAGCAGCGCCTCCATCTTCGGTCCGAAGGGCTCGGCCGCACCGGACAGCAGAACGTGTCGAATGGCCTCGGCCCCGGGTGAACCACCCGGTTCCCGGGTCACCAGGACATCGTATTTCTTGCCACGCAGCCGCTCGGCCAGCCGCTTGATCTGCGTCGACTTGCCGGCACCCTCGCCGCCCTCGAAAGTGATGAAAGATCCGTTCGCCAAACGCTGCGCCTCGGTTGTGGCCCCGACATCAAACATAGGTGTCCAACGCCGCTGCGGCGAGAGGACTAGCGCATCGGACGGAAAATCGGAAATCGATTTTCAACAAGCGCGATGCGCGAATCCAGGTGCCGGAATGTGCTTTGCCTGTCGCCGCAACCCGAGTCGCCCAGACGCACATCCGAACGACCCACTCATAGTGCCGGCCGACGGCAAGGTCCACAACTGACACACAAAGGCGTGCACAAGAATTGCTGAAGAGCGACGCCCCTGCCCGCCTAGCGCAGCCAGCCGACCGCCAGCTCCTTGACCGCATCCGCCGCGCGGCGCGGCAGCGAACCGACTTCCACCGTTTCGGCGGCGAACACCGGGGTTTCCTGACTCAACGTGTCGCCAATCCAGACGCGCAGCGTGCCGACAGGCTGGCCGACCTCGACAGGCGCCGGTAGCGGGCCGTTGTAGACGATACGCGCTGTAACCTTGTCGCGATTGGCGATCGGCAGGACGAGGTCTATCGCTCCCTTGGCTTTCAGCGCCACGCCGGATTTGACGCCGCCAAACACCGCTGCTTCGCCAACAACTTCGTCCTTGGCGAAGACTTCGGTCTTTTCAAAGGAGCGCAGGCCCCAATCCAGCAGCTTGCGGACTTCCTCCGAGCGCTCGCGGTCGTTGGCCAGGCCACTCATGGCAGCAATCACCCGCTTGCCGCCCTGGTTGAGGGCACCGACAATGGCAAAACCATCCTGTTCACTGGCGCCGACCGCCAGTCCTTCAGCACCGGCATCCATCGCCAGCAACGGGTTGCGGTTGCGCTGGGTGATCTTGTTCCAGGTGAAATCCTTGAGGGCGAAATAGGGATAGAATTGCGGATACTCGCGCGAGAGGCGCAACGCCAGCAGCGTCATCTCGCGCACCGTCGTCTTCTGACCTTCCGCCGGCAGCCCGGTAGCATTGACGAAGGTCGAGATCGGCAATCCGATCTGACGTGCGCGCTCGTTCATCTGGAGAGCGAAATTTCCTTCCGAGCCGGCAAATCCTTCGGCGACGATGATGCAACCGTCATTGGCGGCCTGCACCGCGATGCCCTTCATCAGGTCTTCGAGCCGGATCGCCGACTTGAGCGCGGCAAACATGGTGGAGGTGCCCGAAGGCGCACCGCCCTTGCGCCACGCATTTTCGCTGACCACGAAGGTGTCGTTGAGCGTCATGCGGCCGCTCTTCACGGCATTGAAGGCCATTTCAAGCGTCATCAGCTTGGCGAGCGACGCCGGCTGGATCGGTTTGTCGGCCTGCTTGGAGAACAGCACCGTGCCGGTCTCGGCGTCGATCATATACGCGGTCGCCGCCTTGGTTTCGAACAGCTGCGCGGCCGCCGGCCATGCCGACAGCGCCAGCAGGGAAAGGAGAAACAGGCCTGCCAGAGAAGAGCAAAAGCGGGATTTGGCAATCATAATGAGACGTTATCAGGATCGTCCGTCACGGATCAATCGCCAGGAAGCATGGGGCACGACTTATCCGCCGTGCTCGTTACCCGCTCAGTCACGAACGGCGAAAGCGTCGGGCGCTCCGTGGCTCCAGGCCGCTTCCAGCATGCTGTCCACGCTCATGCGGCCGTCCTGGTAGAGGTTGACCGAATACCAGTCCTTGCCGTCGAGCTTGGCCTGATCGATCTCGACGCGGCCATAGGCCTTCAATGCGGCTGCGACCTGCTTTGCTTCCCTGGCGTTGTCGAATGTGCCCGCGGCCACATAGTCGTCGGAGGGATCCTCGGGCTGCTTCCAGCTGCCGAGTGCCGCCGCAGGCCCTTCCAGCGCAGCAAAGGCACTTGCGGAACGCTCCGTGCGCTCCGCGGCGTAGTTCAAGCTCGCCATCTCGAAGGGCAGTTTCGGGCTCACCGCGAAGTCCGGCCGTGCTGGCGCGATCGGGCCGAAATCCGGCAGCGCGACGCTGTCGGCCCCCATCGGCTGCGCAGCCAGAACAGGTTCGGGCAGCGACGTCGTATCCGTCAACTGGCCGGGGAATGGAATGGCAGCAGCGCTTGCACGCACTGGCGCGCTGGGCATGGAGCCGTTCATTGCAACCATCACGCCGGTCGGCAGGCCATCGGACGGATCAGGTATCCTGTTGCCGGGATGATAGGAGGCCATGAGGTAGGAATCGTCATTGCCTTCGAGTGGCGCGCGGCCGACATAGTCGACCTTGACCCGGGCAGTGCCGATCTTCGAATAGCCGAGCATATCGGCGGCGCGCTTGGACAGGTCGATCAGGCGGCCTTCGTGATACGGGCCCCGGTCGTTGACGCGCACGATGACCGAGCTGCCGTTGTCGAGATTGGTGACGCGGGCATAGCTCGGCAGAGGCATGGTCGGATGCGCGCCGGTCAGCTGCGACAGGTCATAGACCTCGCCGTTGGCGGTCATGCGGCTGTGAAAGGCCTCACCGTACCAGGAGGCGGTACCTGTCTTGGAGTAGCGCTTTTCTTCCTTCGGGTAGTACCACTTGCCGCGCACCTGATAGGGCTTGCCGAGCTGGTCGCGCCCTCCGCCGCGCCGCATGGCGACACGCGGGCTGGCCTTCACGCCATATTCGGATTCGGCGAAATATTCCTTCGAGCGTTTCTTGCTCACCATGCTCTTCGGTTCGGACGAAGCACAGGCGGCAAGAACGCCGACCGAGACGGCCAGCAGCGCATAGGTGGAAAAACGACGAAGACGCGCCGGACTTGGCATACTGGAACTGTCCCCTCCGAACAACGGCAACGCTACTCAAACACGGGCGAAGCCGAAGCGCTTCGTGTGACAAACCTGTCCGACGCCCCGATCCCCAGAGCCCCTTTTTGCGCAGGAAATGCTTATCACACGCTTAACCGACTATGGTTCGAAACCGGCGAGATTGTGGCGAAATGACGGCATTGTGCCGTGACCCCCTAACCAATTGTGGAGTCTCCAGCTGGACTGGCGAGGTGTTGCCGCATCTCGAGGCCATTCAGAGCCACAGTTTCCCTTCCGCAGCAATCACGCCGCTGGCACTGATGCGTACCGCAAGGCCGTCGATTTCAACTTCGATAAGGCTCGGACGGCCGAGTTTCGTGCCCTGCTCGATAACGACGCGTTCAGGCGAAGCAAAACCGTGGTGGACGAGATGGGCAGCCAGGGGGCCTGCTGCCGTACCAGTCGCCGGATCCTCGCCGATACCCATTGTGGGATTGAAGAAACGAGCATAGGCGAGCGTCTCTACGGATGGGGGATCAAGGCTGAAAACATAGCAGCCTTCGGCTCCGGACTGCCTCAGGATCGCGCCGAGCGCGTCAACATCGGGCCTTGCGGCATCGACGGCTTCGCGATCCCTGAGCGCGACCATCAGATGGCCCGCGCCGGTATCGACCACCTGTATCGGCGGGCCGTCGCCATTGACGGTGTCTGGCGCAAGGCCAAGTGAAGCAGCCAGGCCGGCCAGATCTTCCACCTGGCTTCCCCATCGGGCGGGGGATTGTGCCATCGTCACGCGGATCGGTTGGCCGGGCGTCGCGTCGATCGATAAAGGAAAAAGTTGCCCGCCGATCTCCTGTGTGAAGTCTGTTCGCCTTTCCTTCAACTCCAGCCGTCCCGATTGCGCAAGCCAGAGCCATGCGCCGAGGGAGTTGTGTCCCCCTGCCCCGAACACCTCGTGCCCCGCAGCCGTGAAAGAGCGCAGCCGCCGGGTTGCGCCGGGAAGCGTCGCTTGCAGGACGAAAGTCGTCTCGACCAGGTTGAACTCACCGGCGATGGCGGCCAGCGTGTCGTCGGGCAGCTCGTCGCCGCCTTGGACGACGGCCAGCGGGTTGCCTGCAAGTGGGCTGGCGGCGAAGACATCGATGATGGCGAATGGCAATGACATTGCTGTTCTCGCGAAAGTTGGCGTTCAAGCGTTCAGTTTCGAGAAGAATGCGCGGATATCCTGCGCCAGCAGATCCGGTGCATCGGACGCGGCAAAATGACCGCCGCGCGGCATCTCGGTCCAATGCACGATGTTGTTGGCACGTTCGGCAAAGCTGCGCACGGAGCGGAAATCATTTGGGAAAACTGCGACGCCGGTCGGTGTCGGGTTCATCACTTCGCGGTAGCCGGCACCCGTCTGCTGGTCTTCGTAGTAGACGTTCGCCGCAGATCCACCTGTTCCCGTGAACCAGTAGAGCGATGCCTGCGCCAGGAAACGATCACGATCGACATTGTCGATGTTGTTTCCTTCGAAGCCGAACCAGAGTTCTGTGTTCCAGGCCATCTGGGCGACAGGCGAATCGACGAGCCCATAGGCAAGCGTACCGGGACGCTTCGCCTGTATCTGATGATAGCCGTTGTACTTCTCGAAATTGGCGAGGATCGCCATGCCCTCCATCTCGAAAGGCGAGAGCCTGTCCATTTCGCCATCCGCACCGGCGGGAAAGGCGAAGATCTGCAGCACATGCGTGCCGACGAGACCGTGCGGTTGTAGGATACCCAGTTCGCGCCCGATGCCGGAGCCAATATCGCCGCCATGGGCACCATAACGGTCATAGCCCAGTCGTTTCATGAGCGTGTCCCATGCTCTGGCGATCCGGGCCGAATCCCAGCCGCCTTCACGCAGCGGACTGGAGAAGCCGAATCCGGGCAGCGAGGGGATCACCAGATCGAAGGCCTGTCCGCCACTGGTCGGGTTGGTCAGCGGCTCGATCAGGTCGAGGAACTCCATCACACTGCCCGGCCAGCCGTGGCTCAGGATCAGCGGAAATGCATCGGGATGCTGCGACCTGATGTGCAGGAAATGGATCGGCTGGCCGTCGATTCCTGTGATGAATTGCGGATAGCTGTTCAACCGGGCTTCATGCTCGCGCCAGCGGAAGTCGTTCATCCACCGGTCGACGAGTTCCATGACAAAGCCGACCGGTTGCCCACGCGAATGATCGTTGGTCGTGGCATATGGCCAGCGTATCTTGCCAAGCCGCGCCTTCAGGTCGGCAATCTGCGTATCGGGAATGGCGATCCGGAACGGCTGTATGGCGGTCGGGTCAGACGTGGCGACATGGTTGGCCTGGGACATTTTGTTCTCCTGCGGGGTGTTTCGATGCCAAGGAGTGTCGTTGTCACCGCGCACGAAAAAAATGGAGATTTTGCCTACGAGCCTATAGAATATCTATATGGTTGCTCTGAATCGCATTCCCCTGAACGGCCTGCGTGCCATCGAAGTGGCGGGCCGCCTCGGCTCGCTGGCAAAGGCGGCCGAGGAGATGGGCATCAGCGTTGGCGCGGTCAGCCAGCATGTCATCCGAACCGAGAAGCTGCTTGGGCGTCCGGTTTTCGAGCGCACAGCGCGTGGCCTGACACCGACGCCGTTCGGCGCACGGATGCTGGCCTCGCTGGAGCCGGCGTTCCGTTCTATCGCGCAGGCGCTGGATCAGGCCGGCGCGACCGATCGTACGGTGCTGGCTGTCACCACCACGCCTGTCTTCGCCACCCGATGGCTGGTGCCGCGGCTCAGCGCATTTCAGCGGAAGCGACCCGAAATCCAGGTGCGTATCGAAACCGGGCTCACATTGACCGACCTCGATCGGTCCGATCTCGATGTCGCACTGCGCATGGGCCGGGGCACGTGGCCGGGCACCCGTGCGGAATTGCTTTTGCGGCAGCGCATCTTTCCGGTCTGCGCCCCGGCGCTCGCCGCAAATCTGAGGACGCCGGCCGATCTGAAATCGGCCCCGGTGATCCGCTATCCTGCCATGGAGAACTGGACGGACTGGCTGGCCCCGCATGACATGGCCGAGGCCGATCTGCCGCAAGGGTTGAGTTTCTCCGACGCATCCCTGTCGCTCGAGGCGGCAACGGCCGGGCTCGGCGTGATGATCGCCTGGCAGGCGATCGCTGCCGACGCACTTGCGGATGGCAGGCTGGTCAAGCCGTTCGGGTGGGAGGTCGAGACCGGAATCGACCTCTGGTTCGTCGCGTCCGCGGCCCATGCCAACGATGCCAAGATCGTTGCCTTCAAGCGATGGCTGAAAAGCGAATTGGCGGGGCAGGAATGATGATGCGGTTGCCGATGTTTTCGCCGGTCGCTCCCCCGGATCCCGCTTGACGCCGATCGTTCTTAGCCTAGACGCCAGCGGTGCGATCCTCACGCAGAGAAGCGATCCATCCATGGCCGGGATGCATTGCCTCAAGCCAGCCGGTCAGGTCAGCTCTCTCCCCAGCATCGAGGCGGGGCAGCGCGGTACGAAAATCGAGATCATCCTTTTCCCGACGGTGCTTCGCCTTGAACAGCAGCACGATGGCCGGCGCCAGATAGGCAATTCCGGTGGAGTTCCTGCGGATCGCCGCAGTCCTCGGCACCCGGATCGCGGGTTCCCGTTTGTAGATCCAGAAATCAGGCGTTCCCCGCTCGATCATCATGTCCACGCGCCAGAAACCCGCGGTCATGTCGGCTCCCCAAAGCTGCCAGAGATCAGTGGCAAGTGCTGTCTTGGCAGGATGATGCGTCAGGTTACCGTCACGGGCGGCAAAGAATTCCAGATCCGACAGGAGATCGCGACAACGTCCGACCTGGTCCGGCAGAACAGCAAACTCCAGATCCTCGTGTTCGCGGGTAAGACGCCCATGCCACAGGTCCAGGGCCCAGCCGCCAACCACATACCAACTGGGATCATCGTGCCCAAGCCGAGAAGCCAGATCCTGTGGCGACCATGGTTCCCAGGCATCTTCTGGAATAGGTTTCATGCCGACACTTGCATCGATTGTCGGGAAAGTGGCGGAGGGAGTGGGATTCGAACCCACGGTGACATCGCTGCCACGCCGGTTTTCAAGACCGGTGCCTTAAACCGCTCGGCCATCCCTCCAAATAGTTGAGTTCAAAGCACTTTTTGCCTTTCTGCGGTTTGCTCAAAAGGCCGTTTGCTACGCACTTTGCAACTATTTAGCTTTTCGACTTGCGCTTATAGCGGCCTGCAACGCGGCGTCAACTTGCTCGGCAGCATCTGCTTGCATTCCTGGCATGACATGCGAATAAAGATCGAGCGTGATGCCGATCGTGGAATGCCCCAGACGTTCACTGGCCACTTTCGGATGAACGCCGGCAGCAAGCATCTGCGATGCGTGGGTGTGTCGAAGATCGTGGAAACGGATACGCGGCAGCGAGGTCTTGCCCACGATGCGCACCCACTCATGCGTCAAGGAACGCGGCTGGATCGGCTGCCCGTCATACTGCGCGACGACGAAACTGTCCCCATCTGGACGAACACCTAGCCGTAGCTGCTCCTCTGCTTGTCTGACGCGATGGGCGCGCAATTCCGCCAGCACGGTTGACGATAGCGCTACGGTGCGTGCACGGCCTGATTTCGGTTCTTTGTACCTAACGCCGTCCTTTGTCTGTTCAGCGCTCTCCACAATTGAAAGTGCCTTCAAGTTGTCGCTGACATTACGCCAGCGAAGCGCCAGGATCTCCCCTCGACGCAAGCCGCACATGACGGCAAGCACAGTGGGGATAAACATGCGCGTCGGCCGAATTGCCTCCAGTAGCTCGGCAGTCTGTGCGGCGTCATAGGCGAGCATCTTCTTGCGCTCGACCTTCGGCGGCGTGGTTGCATTCGCTGGGTTCTTGCTGAGCCGTTCCCAGGTCACAGCCTGTCCCAGGGCTTTGATAAGCACTCGACGGGCATGGTGGACGGTCCGGGGTGAAAGACCGCCCTCCCCGTCCCTGCGGCCAGCTGTGAGCATCGTAGTAAGCGCGGTGTCGATCCGATCAGTCTTAAGCTTCGAGAGGATCACGTCCCCGATCTGCGGCGCGATGGTCTTCCGGCAAATCTCGGTGTAGCGCTCAAGCGTCTTTGGAGCGACGGACGGACCAACGAAGGCCAGCCATTCGTCTAGGAACTGCGCCACGGTCTGCTTGGATGGTGCCACATAGTTGCCGGCTTCAAGCTCGGCAATCAACCGCGCGCATTCGCTGTCAGCCTGCCGCTTCGTGCCTTTGAAGCTGTGCCATTTGCGTTTCTTCTTGCCGGTTTTCGGATCGCGTTCTCCGACATCAAGCACGATGGCCCAATGGCCGGGGCTCCGCTCGCGGACGTGACCTTTCATTGGTCCTTTTCCTCCATCCCAGAGATCATGCCCTCCAGCTCGCTCCGCTTCTCATCAGCAAGGCGCATGGCCTCCTTTAGATCGCCTCCCCTGGCGAGCGCCCACGTCTCAAGCCGCGCCACTTGCGCCCGATGCATCAGGATATTGGTTCGCCTATAGAGCTTTCGGTATTCGTCTTTCAGGAATTCCACTTCATCGGTTTGGTCTCCTGATTTGTTGCGTTCTTTCCATGCTGCGGTTGTCGACTTCATCGCCTCCGCGACTTTTCGGAGCGCGCTCATCAGTTCCTTTTCCTGATCCTCGTAACGCATCGCGATCTGGCATAGACGCCGGATCGCCTCCGATCTGTTCGGCACGCGATGTCTGAATTGCCAATCGTCGATCGCCCCAATTTCGTCTTTGGTGATGACCATCTGGAGGCGGATGCTCTCACTGTCGCCGAGCTTTGGTCGTGCCATTCGGGGATAACTCCACTTTTCATGGCAAAAATACACACGAAACCTTGCGCGTCAATCTGGCGCGGGATAAATAGGTATTGTGTGTATGATTGGCATGAACGAAAGGAGAAACATCTTGACGCTGGATGAAGCACTATCCCGCCCGACCATTTCCGTTGTGGATGCCGGAGCGATTTTTTTCGGCCTTGCCCGCAACGCCAGCTATGACGCCGCCAAGAGCGGCGACATTCCGACCATCAGGATAGGAGGTCGCGTCTTGGTCCCTGTGGCACCTTTGGCCGAAAGCCTCGGGCTCAAGACTACATTCGGAAGGGCTGCGGCATGAGCAAGCTGTTCTTCAAAGTGTCCTGGCAACACAACATTGACCGTTACCGCGACTTGCAGAAACTGGCTCGAAAAGCCGGCATGTCCCTTCAGGTTAAGAACGTGAGCCGGACCCGCGGCATACCCGTCTTCCTGTTTTTCCTCACCGAGCCGGGCAAGAACGGCCAGCCCTTTCACAGTCTCGAAGATGCAGCTTCGGCGCTCAATGCGATGGCAGGCATGGGGAAGGTCTAAATGCGGTTACTCCTCGCCCTCCTCTGCTTCTATCACGCTTGGCGTGCCCGCCAGCACTCATCGGCCTATCGCCGACACATGGAGCGCCGGCAGGCGATCCTCCTCCGCACGTCGTACCCGGAGAAATCTGCATGAGAGCTTTTGCAGCCGTGCCCCCGACGGTCTGGCAAACCGACATTAAAAAGCTTCGTGGTGAGCCCGAAGCGGTGGCGGTCTACTTCCACCTCCTAACCAGCCCGCACACGAATATGATCGGCATGTACCCGCTCGATCTCCAATATGTAGCGATCGACTTAGGGAGCCCCTTGGAAGGGGCTTCGAAGGGGCTTCGAAGGGTCTGCGAGGCAGGTTTAGCCACCTATGACGAAGTCAACGAGATCGTTTGGGTTCACGACATGGCAATCTCGCAGGTGGCGCCCCGCCTGAGCCCGAAAGACAACCGAGTGCAGGCTGTCGCCAAACTACTCGCAACCCTTCCTATCTGCCCGATAACATTGAGTTTTTACACTCGATATCGTGATGTGTTTCACCTTCGGGATGCCCTCGTTTTGGAGGATTTCGAACGCGCATCCCGAAGCCCCTTCGAAGCCCCTTCGGAGCCCCTTCGAAGCAAGGAGAAGGAAAAGGAACAGGACAAGGACCCAGGACAAGGAAAGGAAAAGTTGGGCTCTGAGGGTGAAGAACTTGGCGCTACGCGCGTGAGTGATGAAAGAGAGGATCCGTACGAGCCGAGATCGTCCATCGAAGACGGAAGACGGTTTCTCATCGGTATCGGCGTTCCTGTGTCACGAATGGAGACGGCGCTCCAACGACTGATGCGAGGTGCGCTCTTCCCGTGCGATGTCGATGAATGGAAGCATGAGGCTGTGGCAATGCGAGGTGCGGCATGACCAAGCAACTTGAACTGTTCGCCTGGGCGGATGCTCGACCATCAAACGTAATCGATGCGGTACCCGCTCTTATCCGCAAGGCTGCGATCGAAACGATGTATGCACCGCCTCGCCCCAAGGACACGGGCGACGTCATCTCCTTAACTCCGAGGGCCGCATGAGCGAAGCCATCCAGAAGGCGGCTAAGTGGCTTGCCGAAACGCCAGAACGCCAGCGACCACGGCACTTGCTGGCCCACCTGCAGCATGAGTTCGGACTGACGGCCGCCCAGGCCGTCGCGGCAATCCGAGAAGCGAACTTAATCCGAGCGAGGGCACATTGATGATCGGTGATGACGACATCCCGGAAGACGTGAAGCGCGGTGTTGAAAAAGCGCTGTTCGGCAATCGAACAGGCTATCCGACCGTTTCAGTGAAAAAGAAATACCGATTTCCGGGGCCGAGCGAGCGTCAGTTGCGTGAGCGCCAAGACGCGCAGATGTCGGTCTATCAGGTGAAAGACCGAGGTGACGATTGAGCACCAGCCGGAAAGGCAAACTCGACGCCCTGCTCGACGGCCTGGGCATAAAGCTGGTGCCGGTCAATCGCCGACGCCGAGCTGCGCAGAGCCATGCCCGAGCCACGATGCACGAGATCTGCAACCAGTATGACGAAGGCCATCTGACATTCGTCCTTCGCTGCATCAAGCAGACGAAGAACAATCGCGATGAGCTTTGGTCGGAAACGATTGGCGCAATCTCCGACATTCTCGCACAGCGGCAGGACTGGGCCATGGAGCGCGCTGGAGCAGTCCTTGAGGCCTTCGACCAAATCCCCCTCGGGGTGTTGCGTGGTGAAGCCGTAGCCCGTCGCCCTTGGCCTGTACGGGCCACACTCCGAACACTTATCTACAAACGCCTGGAGGCCATCCTTGACGAACCGGAACAGCGCCTTGCCGTCTAAGCCAATCGACTATGCCCTCGAGGCGGCCCTCGCCGATTTAATCTGGATCGTGAAAGGCCGTTTCGTTGAAGCGGCCGACACGATCGCCAACATGGACGTTGGGCAGCTTCGGCCATCTAAGGCCCGCTCGCTCTGGCCGGCGTATCAGGTCGACAATGTTGGCGGCCATCATCCCGGCTACGGCATCAATGGCAGCAACGTGCAGTACCGTCCTTCAAGCATCACGATATCGCGCGCCGAGGAAGTCATGTACGGTTGGCTGCTCGACTATGTCGACGGCGACGATGCCCGGAGGTTGCTCGGTCAGTGGGCCGCATGCCAGGCAACGCCGCGGCTGGCCGGATCATTCCGGGCCTTTTGCAAGAAAAGTGGCCTGTCGCGTAGTACAGCAGAGCGCCGTGTAGACGGTGCGTTCAAAGCTGTTGCGGCCGCAATCCTTAAAAACGCTCAATCGTTACAAGGCCCTAGCTGGTCCAGGGTGATGCCAATGATGCCGAATTCGCGTATTGATCCGGGCAAGATGGCGACGGTCACGCGGTGGATGGCGCCCGATGCAAAGCCGAGGCACATTCCAGAGATGCTTGAGCCTTTACGAGGAGAGGCAGCATAGCGCTCGCTCCTCAAGAGCCGAGCGCCGAAGTTCGTTCCGCCAGGCTCGCCAGCTTTTTGACGAAGTCCTGCGTTCCAAGAGGCTGCAAGGTCAGGTCCGCGAGAGCGGACAGAGTTGTGAGGCGATGCAGTGGCTGAGTGGTCTAAAGCTGAAGCCCGAAATGATAAGGGCCTGACCGTCTAGTTTCCTGGGCTGGCGACGGATAGCCAATAGCTTCGGGAACCGTGGGTTCAAATCCCTACCTGCATCGCCTCGCTACTCGCAGATATCGATCTTCAACCGGCTGAGTTTTTCGGCCCGTTGCATTGCCGCACCTTCACAAAGGACACCAGCATGAAACGCAGGTCATTCCTCGCGATGCTCGGCTTTGCGCCAATCGCAGGCGCTGCCACCGTGCAGGCTGTTGCAGCCAAACCTTCGCCGTCGGAACTTCTGATCGACGCCACGGGTGCGAACAGCCAGCTTGCCATCGACCTCGAAGCGCTGAATGCAGACGCCAGTCGCTTCCTTGAGTGGACGTCCGAAGGGATGCGCCTCACCGTGGCAAGGATCGACGGTGTGCGCCCCTATCTCTTCGACGCCGATGGGCGGTGCCTGAACCCGTGAGCACGAAGCTCAAGACCCTTGGCCAGCGTGTCGGCACGGTAAGCTCACGCTTCGCCAGCCAGGCACACGACGGCAAGGAACGCGACAGGCAGCGCCAGCAGACCAGCGAATGGCGCGCCTGGTACAAGACGGCCAGATGGCAGAAGCTTCGCCGCAAGGTGCTCAAGCGCGACTTCTACACCTGCCAGCGCACTGGAGTGCTGCTCATCGGCAAGCACCCGGCGCCCAACAGCCCAGTGGTCGACCACAAGAAGGCGCACCGCGGCAACCCGGAATTGTTCTGGGATGAGAAGAACCTACACGCGGTGAGCAAGGCCTACCACGACAGCACAAAGCAGGCCGAGGAAGCCAAGGACATCAAGGGAGTTTGGTACTGATGGTCTACCGCCCGCTGCCTCCTGTTAATCCCTGCCCGAACTGCGGTGCCAAGCCGACAGTGCGCTATCGCTTGTTCAAGGGATGCCGAGCCGAATGCAAATGTGGCGTCTCTGGTCCATTCGTAGCCTGGAGCGGTTGGCCAGAGCACGATGGACTATCCAAGTGGCCAGTCGTAGCCGGTGAGGTGGATATGCCCCGGCCACCAGCGCCAGCAGCAATGAGGGTGAAGGACGTACGCCCCGCCGATCTCATCCGCGAGGCAGGCCACCTTCAGCGCTTGGAGATCAAGCCAGGAGAACGCTTCGTCCTGACCTGTGACTATGCGCTGAGCGAAGCTGATGCTGACCGAGTGCGACAGGCATGGGACGCATTCATGGGTGACGTCCCGCTCCTTGTCCTCACTGATGGAATCTCGCTCGGTGCAGTGTCTCATCCCGAGCCCGTCGAAGCCGAGCCGGCAGCGTGACCAAGGGGGGGTATGTCGACATAGGCAGACCCTCTCCCGCCTAGACCCGCGCCCCATACATTCGCAGAATTTTTTCTCTCGGCCGGGAATTCTGGCTGAAATTCGGTGATCCATGGAAAAACCCGAGGACAAGAACCCGTCCGGCAAGCGTGGCCGGCCGGCGTATCGCCCTGCGCTCAAAGATCGGCAGACCGTCGAGCAGATGAAGTTTTGCGGTGAGAGCGAGAACACGATCGCCAGGTCTCTGGGCATCGACGTCGGCACGCTGCGCAAGCACTTCGCCGAAGAACTTGAGAACGGTTACGCCAATCGGCGCCATGAGGTGATTGGCCTCCTTTTCGAAGCTGCAAGGTCGAAGAATGTTGCCGCGATCAAGAAGCTCGAGGAGATGGGCAGAGTGGCGAATGCGGCCGAAGCCGTGAATGGCCGTGGCAAGGGTGAGCGCCAGCCTAAGCTTGGCAAAAAGGAGGAGCGGCAGATTGCTGCTCGTAATGTCGGCGGCAAATTCGCCACGCGACAGCCGCCTCAACTGGTCGCGGTCAGCGGCGGCAAGAAGTGAAGGAGTGGAGCACCGCTTGCCCTGACTGGCGCGAGCGTATCGTCGAAGGCAGATCGCTGATCCCCTTCGATCCGCTGTACCCGGATCAGGCCGAGGAAGCGGTCAACATCTTCAAGTCGCTGCGTGTCGTTGACGTGGCCGGCAGCCCCACTTTCGGCGAATGTTGCGAGGAGTGGGTTTTCGACTTCGTTCGCGCGATCTTCGGCGCATACGACGCAGATACGGGCCAGCGCCAGATCGAAGAATTCTTCTTGTTGATCAGCAAGAAGAATTCGAAGTCGACAATTGCCGCCGGCATCATGCTCACGGCGCTGATCCTCAACTGGCGGCTGTCTGCCGAGCTGATGATCCTGGCACCGACGATCGAGGTCGCGAAGAACTCGTTCGAGCCAGCGCGCGACATGGTCTATGCCGACCCGGAACTGCTCGAACTCCTGAATGTGCAGGAGCACATCAAGACGATCGAGCATCGCGTTACGAAGGCAAAACTCAAGATCGTCGCCTCAGACAGCGAAACGGTCAGCGGCAAGAAGGCTGCGTTTATCCTCGTCGACGAGCTCTGGATATTCGGCGAGCGTGACGGCGCGGGAGCGATGCTGCAGGAGGCCACCGGCGGCCTTGCGTCGAGGCCTGAGGGCTTCACCATCTACCTGTCGACACAGAGCGACAAACCGCCGGCCGGAGTGTTCAAGGAGAAACTGGAATACGCCCGCGATGTCCGCGACGGCGAAATAAAAGATCCGCGCTTCCTGCCGGTGCTGTACGAGTTTCCGCCCGAGATGATCGAGGCCGAGGAACACAAGGACCCGGCCAACTTCTACATGACCAACCCGAACATGGGGCGGTCGGTCAGGCAAGACTATCTGGAGCGGAAGCTGCGTCAGATCCTGAAAGGTGAAGGCGAGGAAGGCGAAGACATCCAGACGTTCCTCGCTAAGCACCTCAATGTCGAGATCGGGCTTCGAAACCGTCGCGATAGATGGGGCGGCACCGACTTCTGGTCTGATGCTGCTGAAGAAGGACTTACGCTCGAAACGCTGCTGGAACGCTGCGATGTCGTCACGATAGGCATCGACGGCGGCGGCCTGGATGACTTGCTCGGGTTTGCGGTGATAGGCCGTGAGAAGCTCACGCGTCATTATTTGGTGTGGTGTCGGGTTTGGGCGCATCCGATTGTGCTGAAGCGTCGGAAGGACATCGCCGAGCAGCTGCGCGACTTCGAAAAGGACGGTGACCTCGTGTTCTGCAAGACATCGACGCAGGACCTTGATGAGGTTGTCGCGATTTGCTGCAAAGTGCGCGACGCTGGCCTCCTCCCCGAAAAAGCGGGCATCGGCGTCGACAAACTGGGCCTTCCAGCCCTCTTGGATTCCTTGATCGCTGCAGGCTTCAAAACGGATGAGGAAGGCGGAACGGTCACCGGCATCGGTCAGGGTGGCTTCCTCAATGACGTGCTCGTGGGAGTGGCCCGCAAGCTCTCCGACGGCTCTCTGAAGCACGCCGGGCAAGCCGTTTTGCAATGGGCGGTTAGCAACGCAAAGGAAGTGCTGAAAGGATCTGCCCGCGCGGTGACCAAGCAGACCTCGGGAACTGCCAAAATCGACCCTTTCATTGCCTTGCTGAACGCGGCGAAGCTTATGAGCCGCAACCCGGAGGCCTACCGGAAGAAAAAGGCGACAGTCCGCATCATCGGCTGATCCACCCACAACCCGAACCTTTGGAGGTCGTCATGTCCGTGACGCGCCGCGCTTACTCGTCTCTCACCATCAAGTCAGTGAGTGATGAGAAGCGCATCATCCGTGGCATCGCGACAACGCCGACGCCTGACCGCGTTGGCGACATAATCGAGCCTCTTGGTGTCAAGTTCACCAATCCGCTCGCGTTGCTCTGGCAGCACCAGCATGACAAGCCGATCGGCACTGTCAAATTCGACAAGCCGACGAAAGCTGGCATCACCTTCGAAGCCGAAATCCCGATCGTAGAGGAAGACGGCACGCTGCGCGACCGCACCGAGGAAGCCTGGCAGTCGATCAAGCTCGGGCTCGTGCGTGCTGTGTCGATCGGTTTTCGCGCCATGGAATACGCTTTCCTCGACAGCGGCGGCATCCATTTCACCAAGACCGAGGCTTACGAGCTCTCGGCCGTCACCATCCCGGCGCAGCCCGACGCGGTGATGACCTCTATCAAGAACATGGACGCGGCTGGTGTCGCGATCATCAAGACCTTCGACCCGAACGCTCCCGCCGCGACTGGCGAACTCCAGCGTCCGACGAAGACCGCCCCCGGCGCTACGGGGAAATCAACCACTCCCGTCAACTTGCGCCCCAAGGAGGGCACCACCATGAAAACGCTCGCCGAGCAGATCACTGCTCTCGAAGCCAAGCGCGCTGCGCATGCGGCCCGCATGGAAGCCATCATGCAGAAGTCGGCCGATGAAGGCCGTTCGACTGACCAGGCCGAACAGGAGGAATTCGACGGCCTCTCGGATGACGTTGAAGTCATCGACGGCGACCTCAAGCGCCTCCGTGCACTTGAAAAGGCCAAGGCGGCTTCCGCAACCCCGATCGTCGCCAACCAGATCAAGACGGCGGAACTCGGCACTGCCATCCGCACCGGCGTTGCTCTGGAGCCGGCCAAACTGGAAAAGGGCATCCGCTTCGCCCGCTACGCGAAGTGTATCGCCATCGCTACCAAGACGAACCAGCCCATCACCGCTGTCGCCGAGACACTGTATGGCAAGGCCGATCCCGAGTTCATCGAGATCACCAAGGCCGCCGTGTCGGCGATGACCACTGGCAACACTGGCCAGCTTGTCGGCAACATCGGCGGCTTCGCTGACTTCATCGAGTTCCTGACCCCTCAGACGATCATCGGCAAGTTCGGTACCGGCAGCATCCCATCGCTGTCGAAAATCCCGTTCCGGACCCCGATTATCGGTGAACTCAGCGAGGCTGTTGCCCAGTGGGTTGGTGAAGGCAACGCGAAGCCGCTCACTCGCACCACTTACGGCCGCACCACCCTTGATCCGCTCAAGGTCGCTGCGATCGCGGTGCAGACCATGGAACTCATCCGCGACAGTTCCCCGTCGTCTGACGTCCTGATCCGCAACGCGCTGGCCAAGTCGATCGTAACCCGGCTCGATCTGACCTTCTTGGATCCTGCCGTCGCTGCAATCGCCGGCATCCGCCCGGGCTCGATCCTGAATGGTATCGCTCCTGTTGCGCCTAGCGCCGCGACTGGTGCTGACGGTGTCCGTGAGGATGCCCAGAAGATCATGGCCGCCTTCGTGACCGCGAATAACCCGCTCACCTCCGGTGTGTGGCTGATGTCGGGTCTCACGGCGCTCAAGATCATCGGCTTCCTCAACCCGCTCGGCCAGCGCGAGTTCCCCGGCCTAACGCTGCAGGGCGGCACGTTCTTCGACCTGCCGGTGATCGTGTCCAACTATGTCGGCGACTACCTGACCCTGGTGAATGCCGAAGACATCCACTACGGCGACGAAGGCGGCATCGAGGTCGCGATGTCGACCGAAGCTTCGCTGGAAATGGACTCGGCACCGACCCACGACTCGGACACCCCGACCGCTGTGGAACTCGTCTCGATGTTCCAGACCAACAGCGTTGCGTTCCGCGCCGAGCGCACGATTAGCTGGGCGCGGCGCCGTCCGACTGCCGCGGCGTACGTCGTTCCGGCCTGGAACTGATCAAGGCCATAGCACCCCGGCTGCCGCGAGGTGGCCGGGGCATCCTTGCAATCGAGGCCAGCCATGAAAAAGTCCAGCTACATGACGCGCGCGCTCCAGTGCCGCGATCCCCGCTTTGCGGTCATCCTTGGCAAGCTCGGCTATGAGGCGGCAGACATGCGGGCGGCTCCGGCCGGTGGACAGGTCGACCTCCTTGCCGACCTGCGGGCCGAGTATGCGCTGATCGTCGGCAAGAAGCCTTTCCACGGCTGGGATGCCGCCACTCTTCGCGAGAAGATCGCAGCGGCGAAAGGCTGATCATGCGCCTGTTGGGTTTCAACATCACGAGGGCTGCAAAGTCGCTCTCGCCAGTCTCGCCGCGGGGTGGCCGCAGCGGTTGGCGCATTCTCGAATCGTTCGCCGGTGCCTGGCAGCGCAATGTCGAGGTGAACCGCGACAGTGTTCTGTCCAACTTCGCCGATTTCGCCTGCCGAACTCTGATCGCGTCCGATATCTCGAAGCTGCGCGTCAAACTGGTAAAGAACACAGGTGATGACGGCGACGACATTTGGCAGGAGACGAAAAATCCTGCCTATTCGCCCGTCATCCGCAAGCCAAACCACTTCCAGAACCGAATTCAGTTTTTCGAAAGCTGGGTGCTGTCCAAGCTTCAGAGCGGCAACACCTACGTGCTGAAGCAGCGCGACGGCCGAGGTGTAGTGGTCAGGCTCTATGTGCTCGACCCGAGCCTCGTCACTCCGATGATCGCGGATGATGGAAGCGTTTTCTATGAGCTTCAATCGGACAAGCTTCTGGGCTTGGGCTTGCCGGAAACGATCCGGGTGCCAGCGCGCGAGATCATCCACGATCGTTTCAACTGCTTCTATCATCCGCTTGTCGGCATCTCCCCGATCATGGCCGGTGGTCTCGCAGCCACGCAGGCGATCGCGATCCAGGATGCATCGGCGACGTTTTTCCAGAACGGTGCCCAGCCGAGTGGTGTGCTCTCGTCGCAACTTGAGATGGACGATGCCGACGCCGAAAAGCTGCAGGCCTATTTCAACGAGCAGTTCGGCGGGAAAAACCGTGGCAAGATTGCGTCGTCGGCGGCGGGCTGGCGTATCATGGCATTACCGCAAAGGCCGTGGACAGCCAGCTTGTCGAGCAACTGAAGTGGACGGCCGATGTCGTCTGTTCGGTCTATCATGTGCCGCCATACAAGATCGGCATGGGTCCGGTACCAGCGAACACCAACGTCCAGGCTCTCAACATCGAATATTATAGCTCGTGCCTCCAGTCGCTCATCGAGAGCATAGAGGCCGGTCTCGACGAAGGTCTTGGGATGGATGGGGAGACCATCGGCACCGAGTTCGATACCGATGCGCTGATGCGGATGGACACGCTGACTCAGTACGAAGTCGCCATGAAGGCCAAGGGCATCGGCACCCTCGACGAGCAGCGGCGCATGATCGGCCTTGGCAAGATCAAGGGTGGCGACACCACGTACTTGCAACAGCAGGATCACAGCCTTGCGGCCATTGAAGCACGCGACCGTGCGCTGATCGACGGTGTCAACAATCCGCCGGCGCCACCACCCGCGAATGACAATCCTCCCGAGGTGCAGGCGGCCTATTTCGGCGCGATGCTGCGCAAAAACCTGATGGGTGATCGATGACACAGGAAGAACTCGACGCCCTGTCGGCGGCTATGGCCTCCGTCGTGAAAGAGCACGTCGACACGGCCACCCGCCCGCTGCTGGAGCGTATCGCCAGCCTTGAAGCGCGTGAGCCTGTGTCAGGCACCAGCGTAACCAGCGCCATCATCGATCGAGCCGGCAATCTGGTGCTGACGATGTCGGACGGCTCGACGAAGGATCTCGGCCCAGTCGTCGGCAAGGATGGCGAGCCGGGTAAGGACGGCAGCGACGGTCTGGCCTTCGAGGACATGACCGAGGAGCTCGAAGACGATGGCCGCACGATCATCCGCCGCTACAGCCGAGGCGACCAGGTCAAGGAGTTCCGGCATCAGGTTTCGGTGGTGCTCGACCGTGGCGTCTACAAGGACGGCCGGGAGTACGAGCGCGGCGACGGTGTGACCTGGGGCGGATCGTTCTGGATTGCGCAACAGAAAACGACAGAGAAGCCCGACGGCGGCGATTGCTGGCGTCTGGCTGTCAAGCGAGGCAAGGACGGCAAAAGCGCACCAGTGACCGCGCCTGTGGCGAATGGGCCTATCCGGGTCGGCAACCCGGCGAAGGAGGCTTAACCGATGGCCTCACTGGTTACGCTGGAGTTCGTCAAAGCCGCACTGCACATCATCGAGCGCGATGAAAACGGCGTGGTGCTCGATCACGAGGACGATGGCCTGATCCAGGGCTATATCGACAGTGTCGAGGAAGCCGTGTTGCGGTATCTGCGCAGGCTGGCAGTCACGCCACCATGGACAGCGGCAGACGCGCCCAAGGCCGTAAAGCAGGCGATCGTTCTGGGCGTGGCATCCCTCTACGACCCCGAGGCGCCGGAACTCCTTTCCGGCCTCGGCTCATCCGATCCGAAGAACCCGCTCGTCGGGCTCCTTTGCATGATGCGCAAGCCGACGGTGGCGTGAGGTGCTGTGATGTGGATCAGGTTCACGGAAGACTTCGACTTCTCTCCAGCGGCTCTCGGTGGCCGCGTCACCACTGCCTTCAAGGCGGGCATGGTGGAGAACGTCACGCGGGAGTGCAGTGAGGCAGCAATCGCGGCCAATCGTGCCGTACCCGCTTCCAGGCCTCGCACGAAGGATGATCAAGATGGCAATGCCGGCGAGCAGCAACAGCCTGCGTGAGCGGGTTGCCTTCGACAAGCGCGGTACCGGGTCGGACGGCGGCGGCGGTGTCACCACACCATGGCAAGAAACGTTCTCGCGTCGCGCCGCGTACGTTCATCGGAATACTGGTGAGGCGGTGATGGCTGATCGCCTTCAGAGCAAGCGAACACTGCTGATCCGGGTGCGCGCCGATAGCCAGACAAGAACGATCACGGGTGGCTGGCGCGCACGGGATGCGAGGACAGGCCAGGCCTACAACATCCTCGACGCTACCCCGACGCCTGATCGCAGGTGGGTTGATATCCTTGCGCAGACTGGCGGCCCGAACGGATGAAGATCGCCGGCAAACAGAAGTTTCTCGGCCAGATCGCCGCCTTGCCTCGTGCCATGAAAGACGAGATTCGCAAGGCGCTGGAAGTCTCGGCGGAGGAAACCACCGACCTCATGAAGCGGTTCGCTCCCGTAAAGAGCGGCGCTCTGAAGGCCAGTATCGGCTACACCTTTGGGACTTACACCCCTGACAATGCGAACGTGCGCGGCCTGAAAGCGAGTGGTTCCGACGCCACCGAGTTGACCGTAACTCTGCACGCTGGCGACAAAAAAGCGTGGTACGCCAGTCTCGTCGAGTTCGGCACGCGGGCGCATACGATCAAGGCAAAGCGGCCTGGTGGCCTGCTCAACATCCATGGTCGGTTGATCGAAGAGGTGCACCATCCCGGGGCAACCGCTTCGCCTTTCTTTTTCCCGGGTTATCGACTGGGCAAGAAACGAGCTCGAACGCGGCTCGCCCGGGCGGTCCGCAACGGTGCCAAGAAAGCTTTCAAATGATGATCGGCGACCAGTTGCAGAAGGCAATCTATGCCGCGCTCACGGCTGCGCCCGCGCTTGTCGATGGCAAGGTGTTCGACACGGTTCCTCCGGACACGGAGGCGCCTTACATCCACATCGGAAACGAGCAGGTGCTCGACGACAGCGACGATTGCGCCGAAGGCTGGGAGGTCTTCACCGACATCCACATCTGGTCCGAGCCGGAGACCGGCAGCAAGCTCGAAGTGAAGAGCATCGGGCCGGCCGTAGTGCAGCGCCTTACCCAGATCACCGCGATTGAGGGCTTTCGCGTCGTCATCGCCTCGATCGAAAACATTCGGTATCTCGACGATCCGGACGGTGTGTCCAAGCACGGCATCGTGACCTCTCGTCACGTGCTCGAGCCTGCTTAGCATTCCTCAACCCTCAGTCCCTGCAACCGCTGCCATAACTCTCGAAAGGAGAAGCCGATGGCTACCCCTCCCATGCCTGTGAAATCCATGAATGGCACTCAGCTGCTCGTCCAGATCGGCGACGGTGCCACACCTGAAGTCTTTGCCGCCGACTGCCTAATCAACACGCAGCGCGGCATCACCTTCAGTTCCGACACGAACGAAGATATCAATCCGGACTGCAACAACCCGGATGATCCCGCGTGGAAGGAAGTCACCAAGGACGGCCTTTCTGGCCAGATCGCCGGCGCCGGCCGCGTGCATACGCCGTCAATCAAGGACTGGTGGGAGTGGTTCATCTCCAAGGACACCAAGAACTGCCGCGTCCTGCTCAACAATGTCACGTTGGCCAACGGAGGCGGTTACTGGTCCGGCGCTTTCCACCTCACCAACTTCGAAGTGACCGGCGAGCGCAACCAGAAGGCTCAGGTGTCGGTGACCCTGATGTCGTCTGGCCCGATCGTCTGGGTGGATGCCGCCGCATGAACCGGCACGGCGCGATTGAAATCACCTGGGCGGGAGGTGATCACACTTTCCGCCTTGGTCTCGACGAGGTCGAGGAACTGGAAGCGGCAACCGAAATGTCGATCTTCCTGCTCTATTCGTCGATGGTGTCGGCCTCGCCATTCGCCAAGCTCAAGCATTATTCGGAGACAATCCGGATCGGCTTGATCGGCGGAGGCATGTCGCCTGTCGACGCCCGCTTGATGGTGCGGCGCTATGTCGACATCCGCCCCTTGGTGGAAAGCGTTGCCCTCGCCGCAGTCATCATCCGGGCAGCGCTGGAACGTGTTCACAGCAACGAAGTGGACACGTCATCGGGGGAAGCGGAAACGGTGGAGCCGAACGGCTCGACTTCGCCGTCATTCGAGGAAACGCCGTCCTGATGGGAATAGCCAATGTCGGCTCACTGACCCTCGGCCAGTACGGGGCGATCTGCCGGCAGTGGAACAAGGCGCATGGGCAAGTTTCAGCGCCGACGGAAGATGAGCTCGACCTGGCTGTCGCGAGGGCACGCGGGCTGCATTAGCGTTGGGCTTGCTCTTGGGTGATGTAGCCCTTCTCGGCGCAATACTGCTTCAGCGCTGGCAGGTTCTCATCAACGGCCGCGAGGCACCCGCGCCTCATGGTCTCCTGCTTTCCCAACTCTTCTGCAACAACTGCCTGTTGGTGCTGTTGCCAGAAATAGAAGCCGCCGACCGCGATCACAGTGACGCACGCGGCGGCCACCAATCCTTTCAACCAATTGTCCAAGGTGCCCCTCCATGGCCATCGAAGCCGAACGCCTATTGGCCATATTCGAAGCACGTTTCACGTCGCTTGAGAAGTCGCTCGAAAAGGCGCGCACGAACGCCAACAAGGCATTCGGTGACATTGAGGCATCCGGCACCAGGGCTGAAGACGCTCTGTCAAAGGTCGGCGAAAAGGGATTGCCGGGTATCGATCGAGCATCGAAGTCCGTCGCCGCAATGAAGGGAAACACCGCGAACCTTGCTGCGCAATTGAACGACATTGGCGTTCAGTTGGCGGGTGGTCAGTCGCCGTTTTTGATTGCAATCCAGCAGGGTACGCAGATCAATCAAGCTCTCGGGCCGACGGGTGCGCGCGGGGCTGTTGCGGCGTTGGGCGGTGCCTTTATGTCCCTCTTGAACCCGGTCGGCCTTGCCACCTTGGCAATCATTGCGGGTGGCGGGTATGCCGCACAGTACTTCCTCGAAATCATATCGGGCGGGGAGAAGTCGGCGGAAACGCTTAAGAACGAAGCCGAGCTAATCCAGCGCGTCGTGAACAAATGGGGCGATGCGCTACCCGCTTTGCAAGCCTACGCCAAAGAGCGCGAAGCTCTCGCCGACAAGAAGGACCTGGAAGAAGCCACCAACATCGGTAAAGACAACGCTTTCGAGAAAACACGCGAGCAGGTCGACGGATTACGAGCCGACCTAGCCCTTCTGATCGGTGACCTTGGCTCGTTCGCCGGCCAGGAAACGCAAGTTGCTCGCTTGCAGGATGCTTTCTCGATCCTGCAGCAAAGGATCGAATCCCAAACAGCGACGGCTGAAGACGCAAAGCGAGTGCAAACTGTGCTTGCCGATGTCCTGAGCGGAACCGGCGTGCCTGCTGCCGGTTCCCTTGCAGACGAATTCAGTCGGCTGGCACAGGAAATTGCGAAGGCGAGCGCAAAGGCGTTTGAATTCCAAAGTCAGCAGGATGGTAGAATTCGCGCTTCCACCAAAGGCGGACCCGCAAGGTATCAGTCAGGGCAGATCGATCTTCCCGAAACCGCTCCTACACCAGATCGAACGCCCAACCGAGAGGACGTGTTCGCCGAGCAGGATCGATATCGCGAGCGTGCGGCACGTCGTGGGGCCCGTGGCCAGCGCCTGAACGCCGGTCAGCGCACCGACGAGGATCTGCGCGCGGTAAAGGACAGGACCGAAGCTCTGCGCCAGGAAGCGGCCATGATCGGCCTATCCTTTCAGGAGCAGGAGCGCCGCCGCATGGCGCTCGATCTGGAGCAGGAAGCGCTCAAGCGACTGCGCGAGGAAGCTGCCCGCAAAGGTCAGAAGGATCTCGAATCGATCCGGCTGTCTCCCGATCAGGTTGCAAAGATCAACGCGGTGTCCGAAGCCTATGCCCAGCAGGCGGAGGTCTTGCGGCAGGTGCGCGAGCGGCAACAGGAGGCCGAGCAGGCATCGGGTGAATTCTACGACACTTTCAAGAACGGCGCGATCGATGCCATCACCGGCGCGCGGAAGCTCAGCGATGTGCTGAAGGAGCTCGGCACGCGCTTCGCCTCGATGCTGCTCAACAGCGGTTTCGACATGCTGTTCAAGCCGAAGTCCGGCAATTCGTCGGGTGGCTCATTCGGCAGCGTCTTCGATTTGATCGGCAAGTTCATCACGGGCAGCTTCGCCAACGGCACATCCTCCGCGCCGGGTGGTCTTGCGGTGGTCGGCGAGCGCGGCCGAGAACTGGTCAACCTGCCGAGGGGCAGCCAAGTCGTGCCCAACGACATCACGGAAAAGCTGATCGGTGGCGATGGTGGCGGCTCACCGATCTACATCGGCGGCCCGACCATCAACGTCGATGCGCGCAATGCCCAACCCGGCGTCGGCGAGGAAGTCAGACGGGCAGTCAAGGATGCCACCGGCAACATGACGCAACTCGTTCGCAACGCGCTGCGCGAGATGAAGGTCAAGGGCATGAAGCCATGATCGTTGATCTTCCCGCCATCCGCTTTGCGCCGACCAGGCCGGAGCTTATCGATTCCGTGTCGATGAGCCGCGCCGGTAATCGCGTCATCAGCGTCGTCGACTATGCTGATCCCTTCTGGCAGATCCCGATGCAAACGCTTCGAATGACTGCAAAGGAGCTTCAGCTACTCCTCGCCTTCCGCGATCGTGTCCGCAACGGCATGGTCACGGTGGTGCATCGTCCGACAGACAACTGCCTGCCTCAAGCCTACTGGGGCAATGCCGGTGCTATCCAACTCAATGACGGCACGCTGCAGTCTGTGACCAACGGTTTCAACGTCACGCTCAACAACCTGATCAACGGGTTGAAGATGATGCCGGGCGACCTGTTCTCGCTCAAGGCAGGCAACTACCGGTCAAGGCATCGTGTCATGACCGGCGGCACTGCGGCGGCCAATGCACTTGCCCTGACTGTCGAGCCATTCGTGCCGCCGTACATCACTCCCGGCGCCGTGGCGAAGTTCCTGAGGCCGGAACTCAACACCCGCGTCCTGCCGGGATCGTTCTCCGTTTCGGATGACCATTTCCCGGTGGCATCCTTCACTCTCGTGGAGGTGCCGCAATGAGTTATCCCGCCCGCCTCGAGCAGCTTCTCGACGAAGGCCGCATCGCGATCCGGGGCCTGATCCTCTACAAGTTCGGCAATGCGTGGATGGGTGTCTGGACCGGTAATTACGAGCTCGTCTACGAAGGTGTGACCTACGTGCCGAACCAACTCATCACGGCCGAGCCGCCAGACGGCGCGATGGGGATGGAGGCAACCGAGTTCGTTGTCACCATGCCGGCGCGTTCTGATTTCGGCATCACGCCTGACAAGCTCGCCGAAATCGAGAGCCAGGATTACAAGGGTCGGCCCTGCCAGCTGCGTGAAGCATACTTCGATCCGGACACGCGCGAACTGCTGCACGTCGAGGAACTGGCCTTTGGCTATGTCGACTACATCACGCACGTGACTGAAGGCGGCGAAATGCGGCTTGAGGGGCATGTGATCTCCGGTGCCCTGGACAACCACCGCGACGGCTACCGCTCGGCATCGCACGAGGATCAGCAGCTGATCTCTGAAGGCGATCGCGGTTTTGAATACGCCACGGTCATCAAGACCGAGAAATTCGCCATCGAACTCTGAGCATTCCGACATGTTGAAGATTTTGCCCCGGTTGCCCGACTGGGATCGCCGCCTTGCGCGCGTGACGGAAAAGCACCTTCGCCTTCCAGGCGAATGGGGTGTCTCCGACTGCCTGATGACGGCGATGGATGCTGTCGAGGCAGTGACCGGCTTCGATCCTGCCGCCAAGGTTCGTGGCACCTATTCGACCGAGCAGGGCGCGGCCAAGCTTTTGCGCCGTCGAAAGGCTGAGACCGTCGAAGAGATGCTTGCCAAGCTGTTCCCGACGCTGCCGTCCGCCTTCTCGGCGCTGCGTGGTGATCTGGTCGTTGTCGAGCGCAATGGCGTGCTGTCGGCAGGCTATGTCTGCGAATACGAGGTTGCGGTCAAAACCGAGACCGGCCTTGCCTTCGTTGGCATCACCGAAATCCGCTCGGCCTATCAGGTGGGTGCTCGGTAATGGCATTCCTTGGTCCCGTCATTGCAGGCGTCGGCTCCGTTCTCAGCGGTGCGTTCTCTTTCTTCAGCGGAAGCACGATCCTTGCCGGTGCGCTGAAGATCGGCCTTGGCCTTGCCGCGCAATACGCCATCGGCGAACTGTTCAAGCCGGACGTCCCGGCACAGGTCTCGCACCTCGAAACGCAGTATGGCCAGGATCTGCCGCGGAGCGTGGTGCTCGGCACACGAGGCCTTGCCGGTCATCACATCTACCGCAACGCCTATGGCAAGGGCGGGCGGACGGTGCAGGATCTCTATGTCCTGTCGCACTTCCGCATCTTCGGTGTTCCGCGCGTCCGATACGAGGGCAAGTGGTATTCGCTCGGCGGTGCCGTCGATGCCGAGCGGGGCCAGCGTATCCAGGGCATCGAGCCGGAAATCTGGGTCAAGGTCTATCAGGGTGATCTCGCCCAGGCGGCTGATCCGGGACTGATCGCACGCGCCAACCCCGCCGGCCGATGGACCGTTGCCCATCGCCTTGCCGGTGTTGCCTACGCCGTTGTCACGCAGCAGCTCGATCGCGAGAAACTGCCGAACGCTTGGCAGGCGTTTTTCGAGGTTACCGGTGTCGCTTATGACTGGCGGCAGGATAGCTCGGCAGGCGGCTCGGGTGCGCAGCGCTGGACCGATCCGACGACGTGGGGACAGTCGAGCAATCCGATCGTCCTGCAATACAATCTGGAACGCGGCTTCTGGCTTGGCGATCAGCTGATTGTCGGCAAGGGCATTTCGACCGATCGCCTGCCCATCGGCAAGTGGACGCTGGCCGCCAATATCTGTGATGAGGTTGTCGGAACGGGTGCGCGTTACAAGGCTGCCTATATCGCCACAGCAGGCCAGGGCATCACCCATGCCGGCAACATGGCACCGCTTCAGGCTGCGTGCGCTGCGTCATGGTATGAGGGTCCGGACGGCGAATATCCGATCGCTGGCGCCAGCCAAGCCATCGTCGCCACCATCACCGATGACGACCTGGTCGACGGCGAAGACAAGAACTTCTCGCGCTACCGCACACGGTCGGAACTGGTCAACACGATCGCCGGCACCTATGCGTCGCCGGACGCCTTCTATGACGGCGTCCCACTTGCCACCCGCACTGATGCTCCAGCCTTTGCGGCAGATCGCGAACGGCTGGCAACGTCCATCCCCTATGACGCTGTCACTGATCCGAAAGTTGGCGACCGGCTGAACGACATCGCCCTGCGCGCCAGCCGGTATCAGGGCAACGGCGAAATCTGCCTTCGGCACAAGTTCCTCGGCCTCAAGGTCGGCCAGTGGTTCCAATGGCAGAGTGAAGACGAGGCCGGTGGCTTTACCAAGACGTTCCAGATCGTGCGCCGCCGGCTCGGTCCAATCGGGCCGAACTCATGCCGCCTTGTCTATGTCACCATACAGGAAGTCGGCGAAGGCATCTTCGACCCGACCGCCTATCAGACCAACCCGCCGGACGCGACGGGACCGGGTGCGCCGGACTATCAGAGCGAGCTCGTCAACTTCAACATTACGGCGATCAAGAACACGGCTTCCACCGGATCGGCTCGCGCGGGCATCCGTGCATCGTGGAACCCGATCGAAGATCAGACCATCACTGAAATTCGCATCGAGTATGGCCCAGTCAATGCGGTTGGTGACATACCGGCAAGCACTGTCCCTGCGCCGCCGGCGTTGAAGAACCAGACCATCCACGACCTCTATCAGGGCGTGATGTCCAATTCGCTTTGGGCAGTACGCTACATCTTGCGATCGCAGCCGGCGCGCGCCTTCTTCCCGTCTCCTTGGAAGTACGTCAAGACGCTGCAGGCGACGCTCGGCGACGACGACGTTTACCTCCCGGGCATGGTCGAGGAGGTCAACCGCAACACCGCGAACCTGCTGCCGCCGCTTCAGGAAGACATCCGCACGCTGATTGAGCGTGCTGAAAAATCCGCGCAGCAGTCGCTTGATCAGGATGCAGGGCAGTTCCTCAACAAGGTGTCGCAAGTCACTCAACTGAAGTCGGCATACGAGACCGTCACCGCCGAATACAAGTCAGTGGTGTTTGCCGCCACCGGGCCGAACTCTGCGATCGTCCAGCGCCTGGACACCTACGAAGCTCGCTTTGCCGATATGGCCAGCGCCACGGCGCTTAACCTGGTCAAGGTCCAGGTCGACCAGCAAGCTGGCCAGATCACGGCGCAGGGAACGCTGATCAGCGGCGTGCAGGCGACCGTCGGTGGCATCTCGGCGCAGGGTCTTTTCCGAACCTATGTCACCGCTACGGAAGCCGGCGCAGAAGCAACTGTCGGCCTTGCGGCATCTGCCACAGCGAACGGTTCGACCCGATCAGCAGCGATACTGCTTTCGGCCCGATCGGACGGGAAGACCATGGCCGGTCTGGTCGCGGACCTGATCTACTTCACCGATGGCGCGGGCAGCAAGCTGTTCCCGGTCGCGATCGAGAACGGCACGCTGCGCGCAAACTTCGCCAATCTCGGCACCGTCACAGCGGCAATCCTTCAGTCCCCGAATGGCTTGCACATCTACAACGTGGCCAACGGCACACAGGAATGGTGGCGGGCATGACAACGTCTGCCGCGTTCTGGGATTGGGTCGTTGACCACTGGCGTGGCCGCTGGGTCTTGCCGGGGTACAGCGCGCGCGATGCGAACGTGCCTCGCAACAAGGTCGTGATGGATACCGATGATATCGGCATGCTCTCGAAGCTGGCGAATGGCCAATACACGCTGAGCTATGGCGGTAGCCTGGACACAGGGCCGGTGACAATCGCGTCGTGGACAGATCCCGGCTTCGTGCCGCTTTGCCTGTACCGCTTCTCGATCGCAGGTGGCGAGTGGGAAAATGTCTACACCATGCAGAACACAGGCAGCACCAGTGGAAACTACTACATCAAGGTGCATCGCACCGGCATCATCGCTTCGCTGAAGCTGATCAACAGTTCATCCCCGGTAACGATTGCCTGGACGGCGCTTCGGCTGAAGGTGATGTAGATGATCCGGCAAGCCTTCCGCGGCTTCGACGGCGGCGGCGTGTTCAAGTTCCGCATCTTGAAGCCAGGCAGCACCGCCGATGCTCGCTATGCGTCGGTCGACGACTGCCTGTTGCACGAGCAGATGCTGGCCACTCAGCCCTATTGGCTTGGTTTCGTGCCGTGTCCGTTCGCCGGCAGCACGACCAAAGATCCGCTCGATCAGACCACGAATGTCAGCATTCCGGACGCTGGCGTGACCGATCCGGAATACATGCTGTGGCCTCGCAACGTGAGCGGCCGGAATTGCTTTCCGCGTCCGGCGTCGGTCGGGGTCGGCAACAGTCAGGATGGCTATCCCAGCGAAGCATGGGCCATTCGCATGGTCAGCATCACCGCGAACACACTGACGTTGCGCTTCATCAAGCCAGCGCTCAGCACGCAAAGCCCGCTCGGCTGTTCCGTTGCTCTCTTCAAGAGAGGTTGAGCAGTGGCAACCCTCGAAGCTCGAATTTCGAACGCTGGCATCGAAATCGCCAAGCCTGGCTATGATGTGCGTACCGCCTCGCTGGCGAACATGGTGTTCTCGCCGAATCTCGTAGCGATGCGGGTGGCGTTCGAAGGGACGTTCACGGCAGGTCCGCATGGCGAGGGAAACCCCTACGCGGCGTACTACAAGGCGATCAAGTATTTCGACACGCCGTTTCCGAATGATCCTCCCTACGCCCTTGCGGCCGGCATAGGGAGTGACGGCACCAGCTATCAGGCGCCCTTCGTCATCAACTCGGCGGGTGGCAACACGCTCGAAGTCACGCCGCACTATGAACTGAACACCTACACCAACCGCATCGAACTCTATGTGATGCAGTGGACGGCCGGAGGCGTGATCCTGCCGCTCACGTGGAAGTTCTTCATCCTCCAGAACACCCTTTCCTAAACTTCGACAGGACACTACCGACATGAACCAGCCTGATCGGGCGGGGCAGGAAACTGCTGCTCCCGCGCAATTTCACGTGCATCCCGTGGCGGCGCTCACGGAGGCGAATGCACTCGTCGGATTCCTCCAGAACCGCAACCTGATGCTGGCGCACGATCTGCTCAGCCTGAAAGGCGAGAACGATGGCTATCGCCAGCGGATCAACGGTCTGGTCGAGCAGGTTCTCACCCTTGAGGCTCAGATCAAGGAACTGACTGCTGCCCCCATCGCGTCGGAGCCGGACGAGGAGGCATAACCATGGGCATCACCTATTACGATACAGGCGCCGCGACGCTGTCGGTCGGTTCCAAGACCATGACCGGACAGGGAACGCTCTGGGCCGGCTGGGTGAAGCCGGGCGATCAGGTCATGCCAGAGGAAGGCACCAACAATGTCGTTGATGTCGTGGTCAGCAATACCGAGATCACATTGCTGAAGCCGTACCATGGCGTCGCGCAGGCTGCGCGGCCATACACCATCATGCGCACGCCCGATGTCGTGTTCACGGAAAGCCTTGCACGAAAGGTTCTGCAGGATATCTCCAGTTCCCCTCTTGTGGCGCTGGCCGGCTTGCCCGCTGTGTCACGATCGCTGATCGGCTTCGGCCCCACGCAAGTCGCGGAGACGGTTCCGCTTTCAGACAAAGTGAAAGCGTTCCTCGCCACCATCCCATCCGACGATGTCGTATCGTTGCTTGCGGCAGCAAATTTCGCGTCTATCCGCAATTCGTTGAATGTGGCTCCGAAACAGTCTGCCATTGACGACGCGACGGCGGGAGCAGGTCTTACGGTCGGAGCGTTTGGCTTGGGCGGTGTGATCGGGTTGTCGATACCCCTCAACGATTACAACAAGGCACTCCGCAGCGGCATCTACGCCGGCAGCGGTGCTTCGGCAGTCAATGGTCCGCCCAATGGGGCCGCTTATGGCCCCATGCTTGTCATGGCGCGGGCCTCGACGGTGGTCACCCAGATGGCAGGGTTTGGCGACACTGTTGGCAGCGCATTTACTCTTTATCTGCGTCACACCGGAGACGGAGGCGCGACCTGGACACCGTGGCGCCCGATTGTGCCTGAGAAAGGTGCCAACTCCAGCGGTTTTTACAGCCGCTTTGCCGATGGCACCCAAATCTGCTGGATGCAGGACCTGACTTTTGGGCCAATTACAACAGCTGCGGGCTCGATCTATTCCGGGGCCGCTGAGCAGACATTTTCCTTTCCAGCGGCCTTTGCGAGCGCCGGTCAGATTGATTGCCTTGTGATGTCTGACAGCACCAACATCTGGGGGGCGTGTTCAGTGCCGTCCAACACTGCCGTGAGAGCGCAGCTGTTCTGCTACGCATCCATCGCATCAAGCCGCAAGGCCAACATACTCGCGCATGGAAGGTGGTTCTGATGATCATCACTCTCTCACCCCAGCGCCGCGACGATACGCTTGTCGTGTCCAAAGCAGGCGACATCCTGACGATCAATAGCGAGCAGTTCGATTTCTCGACATTGCCCGATGGCGCAACGATCCCTCACGGGGACATCCCCTGCGAATGGATCGCTGGATCAGTCGATCGCATTGCCGGCGAGCTGCAAATAGCCTTGATCTTGCCACACGGACCGAACCCTTCTCAGGCGGAGGCATTTCCCGATCCGATCACAGTGATGAGCGACGGGCCGATTGCTCTGCCGCACGACGAAGAGGAGACAGCCAATGTGGACGCCTGATCCTTCCATCATCATCACGGCCGAACAGAAGGCAGCGGCTCTGATCCCGTCGTCGGTTTCGGCTCGACAGTTCAAGCTCCAGCTTCTTGCCGCAGGCCTGCTTGACCAAGTTGAAGCCTGGATTGCCGCACAGGACCAAGCAGTCCAGATCGCCTATGCGAACAGCGGTACATTCGTCCGTTCCGAGCCGATGATGCAGACCGGCTTTTCCGCGCTCGGCTTTACGCCTCAACAGATCGATGCGTTCTTCGTCGCGGCATCTCAGCTGTAAACTTCGGCCTTAGCCGCCAACTCCCACGATCCTCCAAGGAAACGCATCATGGACCGAAGCAAGTTCTTCGGGAGCCTCCGCGCGCGCTCGTCAGGCGTGTTCGGGACTTCCCTGTCTCAGCCTCAGGTCGATGGGCTGAAGGTTATACTCGATGCCGGCCAGAACGCCGGCCTGCCGCTCCGGCATCTTTCGTATGCCCTCGGCACCGCCTATCACGAGACCGGTGGCAGAATGCAGCCAGTCACCGAGAACCTGACCTATACGAGCGCCGCCCGCCTTCGTGAGGTCTGGCCCTCTCGCTTCACCAGCACCGCCGCAGCCATCCCCTACGTTCGCAACCCGCAGGGTCTCGCCAACCGCGTCTATGCCAACCGCATGGGCAACGGTTCGGAAGCATCCGGCGACGGCTGGCGATATCGCGGCCGCGCGCTGCCTCAGCTGACCGGCAAGGAGAATTACCGGAAGGCGTCAAAGCTGGTCGGCATTGACCTCGTCGCCAATCCGGAAGCGGCAAACGGCAGCGCGATATCGGCTCGCATCCTCATCGAGGGCATGCGGGCAGGAATGTTCACCGGCAAGAAGCTGTCGGATTTCATTTCTGGCGATCGAGCCGACTATGTCGGGGCCCGGGCAATCATCAACGCTGATGTGAAAGCGAACGGTCAGATCATCGCCGGCTATGCCCAAGCGTTCGAAACCGCCCTTCGCCAAGCCGGCTACATCGGTCAGGCACCGAAAACCATCACCATCCCGGCAGAGCGGCCCGTCAGCGCGCCTCAGGCCATTCCGGCCACTCCACAGCCCCCAGCGCCTCAACCCTCGCCAGCGGCCACGCCTGAGTCAAAGCGCAGCCTTCTGACAATCCCGCTCGACATCGTGTTCAACCGCAAAGGAGCCTGACATGGGCCCGTACGTCCGCATTCTCCTCCGCTATGGCGTGGGCGCCGTCATCGGCTACCAGATAGGGGACCAGCTTTCCAATGACCCGGATGTGGTGACCGTCGCCACGGTCGCGGCCACGGCCGCCGTCGGCGTTGCGACCGAGTTCGCCTATCACCTGGCGCGCAAGTTTGGCTGGGAGCGCTGACCATGCTGGCGCTCGTCCTCCAATATTGGCCGGTGATAGCTGGAGCGTTGGCGGCTGCATTCGCCGGTTGGAAGCTTCGGCAGTCAGGTGTCAACGCAGAACGTGCTCGCCAAGCCAAGGAGAAGGTCGCTGCCGCTGAGGACCGGCTCGAAATGGATCGCGAGGCAACCGACATCGAGCGGCGCGTCACCGGCATGACCGATGAGGAAGCCCGCAAGGAGGCGACGCGATGGGCAAGGCGTTGAGGCTCATCTGTGCAAGCCTAGCGTGCTTGTCTCTCACGGCGGCCTGTTCGGTGAGAGACAAAGGCCCGCTTCCATCAGATCCGCGCTCGATCTGGTGCGACCACAACTCACCGCGGCGGGATGCCAGATCGGACACGCCGCGCTCCGAACTCGATGAGATCAACAAGCACAACTTCCAGGGCGCAAAGTGGTGCGGCTGGAAGCCTTCGGAAGGATCGGGACATTGAGCGAGGAAGCCAACATGCCGAACGTCTCCGAGAAGATCGTGAACAGCGTCATCATCACCATGGCGGCAAGGCTCTCCATGGTGCTTGCAATCCCGACACTCACCTTCCTGTTCTGGCTCTACACCGGCTGGCAGGCTGAGAAATTCGACAAGGTCCAGGACCAGGTCGAGCAGACGCAGAAAACCGCACAGCAGGCCTCGGACCAATCCATCAGGCTTTCCGAGCGCCTGACAACCGTGGAGACAAAGCAGGCCGAGGCGACGGTCTCGAATGAGAGGTTTCAGAACGCGACGCTCAACCGGCTGGATAGGGTGCAAGACAGCATCGTCGGGCTGTCCAACACGGTCGCGGCGCTGACTGCCACCCTGCAGGCGATCGTCGAGGATAAGCGTCGCAGGCCGCCGTAAAGGGTGGTTGCGATTTCGGCTCAATGCAGCTATTGGCGCCCCGCGCTTGGCTGTCAGCGAGCGCTCTCAGGCATCAAAGGCCCGTTCCGGTTCACGCCGGAGCGGGCTTTTTTCTGCGTTTGAAGGCACCTGTATCACGCAGGTACCTGGAAGGTATTGCCATGATGGCAATCCCCCGTGTGCGCAAACACAAGCGCCGCCCCTAAGGGCGTTCTTGCATTTTTAAGCAAATCGGTTGATCGTATCCGTGGGTGTGGAAACCCGGACCGCGATAAGCAGGACATGATTGGCGTCATGTGCGCTGCTGACGCGCTAAGAGACCTTGGCGGGTCTCGATGCGCATAGGTAACGCCTTGGCGGGCGTTCCAACTCGGCTGAATATGGCCGGGGGCGAACGCTATGTCCAGAGCGAAAGCTTAAAATCGTTCGACCTGTCTCGCGGCAGGTTTCCACCCCCCGGCTACTGGCCCGCCGCCAGAGATAGGAACGGCACCTGTGGAAAGGTGAGCGTTCCCCGGCTGTGGAAGGCCGGTTCTACCGCGAGTGAGACAATGACTTCCTACATCCCTGAGCACTATTACGGGCTTGTCCCGGGCAGCTACCGTGAGCCAGGTGGCGAGCGATATGGCTTGATCCCATGGCCGGCGCATATGAGGCCGCAACAGCAGCCGAGCTATGCTCCCCTGCCCGATATCGACCTTCGGCAACTCTTCGGCGAAACGGAAGGTATCGAACTCACCAAGAAGGGTGAGGCCTATGCCGAGGCGGCGAAGACTGACGTGTTCTTCCTGCGCATTACCCCGAAGGACCGGACCAGCATTGCCTATACGATCGAGAGCCTGATCAACCTCCTCGACGCTATGGAGCCTGATCCCGATCTGGAGCCAGATCTCGGCAGCGGCGACGACCGTGAAGGAAATGGCGGCGCTGAAGGCGGCGACGAAGATCGTGAACCATCCTTGGGCTGGTCAGCCGATGGTCAGGTCGGTCAAAACCCCGCTTGGTGCAATTGCTGGGGTGAACTCGAACATGACGATGCGGATGATGAATTCACTCTCGGATGGGAGAAGGGCTCAATGCCCTATGATCAGACGTATCTGAATATGTTCGGTACCGACGAGGCCGAAGACGTGTCGGAAGATGAAGGGGCCACCACGGGCGACGATGAGTACTCGCTCGGGACAACCGAGGAAATAGATCAAGAACGCCGGCAGGATGTGGCTGAAGGCTATTTCATCCCCGACGGAGAACCGTACTTGGGCTGGGCCGAATCGTTCGGCCGTGGCGTGGTCGGCGAACAGGACTGCCTCGACGATCGCGAACACGAGGATGAACGGGAACAAGATCAGGCGGAGATGGGCATCGCCGATCAGGATGCCCTGCAAGACCCTGGCATCTGGCCTTCGCACCTACCGACGTTCCCAGGCTGCAAGGGTGCAGTCGATTTCCAAGGCGACGGAGTTCGTGAGGCCAACGCGATGCTGAAAAAGGCCCGTGCCCAACAAGCAAGGCCGAGCGCGTCACCCAATATCATAAATCTCCGCAACTTCGCGCCCACGGATGACGACATCCTGTCGCTGGACATGCTGGCTCACTACCGCGCAGCCCATGGACAACGCCCATGCTGAACCGGCGCACAATCCTCACAGGAGCACCAGCATTGGTTTTGCTACCTGTGCCGTCTCCAGCGGCTGAGAATGCCGCGCAGCGCGTCCAGCGCCTTTGTAGGGAGCTTTCGGACGCGCTGGACGAATGGACCGGTGGTGACTTCATGGCGGAGGTCTTTCCCGCCAGCGTGGTCTTGGAAGGGTTCAAGCCCTACGCGTTTGATCAGCGTCGCTATGCTCTCGATCCAACCCGCAGGGTGTTGATCGACTGCGCTCGACAGTTTCAGGACCTGCGTGAGAGGGACGCATGACCACGTTGGACGTGATGTTCATGGGAACGGTGCTGGCCGGATGGCTGGCGCACTACCTGATCGCAGGATGATTTGAGCGGCCCGGCTGCTTCGGCGCTGCCGAAAACCCTTCAGCTACTTTAAGGTTTTTCCCGCGCCCGGCTGCTTCGGTGGCCGGGCTCTCCTCGGCAAGAACATGCGGTGAACGCTTTGCTACGCTTATTGCTACGCTACTTCGTAGCATGATGCAGCACAGAACGGGATTGAACGAGAGTCAATCCCTCTATCACATTGAAAAAACAGTATATGTAAGGATTGAGCGGGACGGCAAGGGACTGCTCTAAGCGAATTTCAAGACCGGTGCCTTAAACCGCTCGGCCATCCCTCCACTGTCCAGCCTGTAGGGGCAATTGCCTCAGCATGCCGAACCAGTCGAAACCGGCTTTAGCGCTTTCCTGTTCCCAGCGTCAACGGTATCGGCGTCGTCATCGCGATTGTCAGTGGCGGCAGGGCAATCAAACAATGGCTGAAGTGCCGGCTTTCCACGACCTTTCCACGCAGAGCGCTTGACGCCACAGGTCGATGCTCGTAGGGCCAAATTCATGAACCCCCGATGAGGAGCGCTTCCATCGATGGAAATCTTCACCGCAGCAGGTTTGTCGGCTCTTCTCCAGGTCATTGCCATTGACCTCGTCCTTGCCGGCGACAACGCAATCGTCATTGGCCTTGCTGCGGCCGGACTTCCAAAGGACCAGCGCAAGAAGGCGATCCTCGTCGGCATCGCCGCAGCCACGGTGCTGCGCATCATTTTCGCCCTGATCACCCAATGGCTGCTCGCAATCGGTCCGATGCTGCTCATCGCCGGTGGCCTGCTGCTTCTGTGGGTATGCTGGAAGATGTGGCGCGAGCTGAGCGTCACGCACGAGGAAGAACACGAGGCCACCGAGGCCCTCGACGATATCGACAATGGGAAGGCGTCGTCCGGCCCCGCGCGGAAGACCTTTGCCCAGGCCGCCTGGCAGATCGTCATCGCCGACGTTTCGATGTCGCTGGACAATGTGCTCGCCGTTGCCGGTGCGGCGATGGAACATCCGACGGTGCTGATCATCGGCCTTGCGCTGTCGATCGCACTGATGGGGTTTGCGGCGTCCTTCGTCGCCCGGCTGCTGCATCGCTATCGCTGGATCGCCTATATCGGTCTGCTGATCATCCTCTATGTTGCCGTCAAGATGCTGCTTGACGGTGCAGTCGAGCAGTTCCCCGAACAATTCCAGTTCCTGTCGCCCTGGTTCGGCAAGGCCGCTGGTGCCGCCCATTGAGTGGCGCCTGAGCCAACCCCGGCCAGGTCGCTCGGCTGACGCCGTTTGCTTGACCTTCGAACTGTGCTATTGCGCCGCTGAATTGAGCGCCGCTTCGGGCGCACCAACAGAAAACCAGAGATTTCCGGGCCGGCCCTTCAAGGGTTTCGGCCCGTGTCCATTGGAACGCCACGATGACCGCAGCGCCCCGTATCAGCTTTGTCAGCCTCGGATGCCCGAAGGCACTTGTCGATTCCGAGCGCATCCTGACCCGGTTGCGCGCGGAAGGTTACGAGATAGCACGCAAGCATGACGGGGCAGACCTCGTCGTGGTCAACACGTGTGGTTTTCTGGATTCCGCGCGGGATGAATCGCTCGATGCCATCGGCACCGCGCTGAAGGAGAACGGCAAGGTCATCGTCACGGGATGCATGGGCGCGACGCCCGAGCTGATCCGCGAAAAGCATCCGAATGTCTTCGCCATCACCGGCCCACAGGCCTATGAAAGCGTGATGGCGGCGGTGCATGAAGCCGCCCCGCCTGCACACGATCCGTTCACCGATCTGGTGCCGCCGCAAGGCGTCAAGCTGACGCCGCGCCACTACGCCTATCTCAAGATTTCCGAAGGCTGCAACAACCGCTGCACCTTCTGCATCATTCCCGACCTGCGCGGTGACCTCGTCTCGCGCCCCGCGGCGGATGTGCTGCGCGAAGCCGAAAAACTTGCCGCCGCCGGTGTCAAGGAACTGCTGGTCATCTCGCAGGACACCAGCGCCTATGGCGTCGACATCAAATACGCCAGCAGCCCGTGGAAAGATCGCGAGGTCCGCGCCAAGTTCCTCGACCTGTCGCAGGAATTGGGTGAACTCGGCGTCTGGGTGCGCATGCACTACGTCTACCCCTATCCGCATGTCGCGGACGTCATCCCGCTGATGGCCGAGGGCAAGATCCTGCCCTATCTCGACATTCCCTTCCAGCACGCCTCGCCGCAGGTCCTGAAGAACATGCGCCGGCCCGCCCATGGCGAAAAGACGCTGGATCGCATCCGTTCGTGGCGCGAAGCCTGCCCGGATCTAGCCATCCGCTCGACCTTCATTGTCGGATTCCCCGGCGAGACCGAGGAAGATTTCGAAATGCTGCTCGACTGGCTGGACGAAGCTAGGATCGACCGCGCCGGCTGCTTCAAATACGAGCCGGTGAAGGGCGCGCGCTCCAACGATCTCGGCCTCGAGCAGGTCCCGGACGACGTCAAGGAGGCGCGCTGGCACCGCTTCATGCAGCGCCAGCAGAAGATCTCCGCCGCACAGCTGCAGAAGAAGGTCGGCAAGCGCCTGCCCGTCATCATCGATGAAGCGAACGGCGCGGTCGCCAAGGGCCGTACCAGATATGATGCTCCGGAGATCGATGGTTCGGTGCACATCCAGTCGCGCCGGCCGCTCCGTGTCGGCGACATCGTCACCGTCAAGGTCGACCGGGCCGATGCCTACGACCTGCACGGCATGGCTGTCTGATCCACCGGTCGGGCTGCCCAGAACGCAGCCGGTCAGCTGAACCAAAAAAGGGCGCCGCAAGGCGCCCTTTTCCATTTGCGGCTTGCCGCGCGCTTACTGCGGCAGCTTGTCGTCCACGCCCTTGACGTAGAAATTCATGCCGAGCAGTGTCTTGTCGTCGGCAACTTCGCCGTCCTTCAGCCAGGCCGATCCATCCTGCTTGGCGACCGGACCGGTGAAGGGATTCCAGCCGTCGGCGATCTTCTTCTGAGTGGCTTCCGCCATCGCCTTCACGTCATCCGGCATGTTGGTGTAGGGCGCCATGAAGACCGCTCCGGCCTTGATGCCTTCCCAACGATCCTCCGGCTTCCAGGTACCGTCCAGCGCCTTCTCGACGCGCTCGATGTAATACGGAGCCCAATCGTCGACGATCGCGGTGTATTGCGACTTCGGCGCGAACTTGATCATGTCCGAAGCCTGTCCGAAGCCATGCAGGCCGCGTTCCTGTGCCACCTGCAGCGGCGCCGTGGAATCGGTGTGCTGGACGATGATGTCCGCGCCCTGGTCGAACAGTGCCTTGGCTGCGTCGGCTTCCTTGCCCGGATCGAACCAGGAGTTCACCCAGACGATCTTGGCCTTGAAGTTCGGATTGATCGACTGTGCGCCGAGCATGAAGGAATTGATGCCCATCACCACTTCCGGGATCGGGAAGGAGACGATGTAGCCGGCAACGCCGCTCTTGGACTGCTTGGCAGCGATCTGCCCCAGGATGTACCGGCCCTCATAGAAGCGGGCATTGTAGATACCGAGGTTCGGTGCGGTCTTGTAGCCGGTGGCGTGCTCGAACTTGATGTCCGGGAACTTGCTGGCGACCTTCACTTCGGCATCCATGAAGCCGAATGAGGTGCCGAAGATGAGCTTGCAGCCCTGGCGTGCCAGACGCTCGAACGCGCGCTCGGCATCCGGGCCTTCGGCGACGTTTTCGAGATAGGCGGTCTCGACCTTGTCGCCCAGCTTCTTCTCGACCGCGAGGCGGCCCTGGTCATGCTGGTAGGAGTAGCCGAAGTCGCCCACCGGGCCGACATAGACCCAGCAGGCCTTGAACTTATCCGCGGCTTCCGCGGAGACGGCAACGGACATCGCCGCCGCCGACGTCATCAGGGCAATCAGTAGTTTTTTCATGATGTTACCTCTCTGGATTAACCTTGCTATGCTTGTGTTTTTTTAGGTTGTCAACGATCCGGAACGAAAGCCTGTCCGAGCGAGGCCGGGGTGTTCATCATGGTCAGGCGCTTGTTGCGCGAGATGATGACGAGCACGACGATGGTGGCGAGATAAGGCAGCGACGACAGGAACTGCGAGGGTATTGGTATGCCGAAAGCCTGAGCATGCAACTGGCCGATCCAGACCGCGCCGAAGATGTAGGCGCCGGCAAGTACGCGCAGCGGCCGCCAGGACGAAAACACCACCAGGGCCAGCGCGATCCAGCCGCGGCCGGCCGTCATGTTCTCCACCCATTGCGGTGTATAGACCAGCGAGAGGTGGCCGCCGGCGAGGCCGGCGCAGGCGCCGCCGAACATCACTGCCAGATAACGGTACTTGATGACCTGGATGCCCAGCGCATGGGCCGAGGTGTGGCTGTCGCCGATCGAGCGCAGGGTGAGCCCGGTGCGGGTCTTGAACAGGAACCAGGATACGCCGAACACCAGCGCGATCGAGATATAGAAGATCGGGTCCTGGCCGAACAGCAATCTGCCGACCACCGGCAGGTCAGTCAGCACCGGAATGTAGAGATTGGGCAGGCGTTCGCCCGGCAGGCCGACGAAGCCGGTTCCAAGCATGCCGGACAGTCCGAGCCCGAGAAGCGTCAATGACAGGCCGGTCGCGACCTGGTTGGTGGCGAGCGACAGCGTCATCACTGCGAACAGCAGCGAGAACAGCGCACCGACCACGATGGCCGCAAGAAGGCCGATCCAGGGCGAGCCAGTGAGGAGACCGGCACCAAAGCCGGCGACAGCGCCCATCACCATCATGCCTTCGACGCCAAGGTTCAGGACGCCGGAGCGTTCGACGACCAGTTCGCCGATGGCCGCGATCAGCAGCGGCGTTGCCGCCGTGGCGATGGTCAGGAGAATGCTGACAGTCATGTCCATGTCAGTGAGCCTCTTCCAGCTTCGGCGAAGCGTCAATTTTGGGGGCTGGTGCTAGGCCGACAAGCCGGATGCGATAGTGGATCAGCGTGTCGCAGCCCAGCACGAAGAACAGCAGCATGCCCTGGAAGACGCGCGCCACCTTGTCGGAGATGCCGAGTACGCTCTGCGCCGCCTCTCCGCCGAGATAGGTCAGGCCAAGCACCAGGCCGGCAGCGACGATGCCCAGTGGGTTCAGACGGCCAAGGAAGGCGACAATGATGGCCGTGAAGCCATAGCCTGGAGAAATCGAGGGCTGCAGCTGGCCGATGGCACCGGAAACCTCGGAGATGCCGGCCAGTCCGGCGAGAGCTCCCGAAAGCATGAAAGCGAAAAAGACCAAGCGGTTGAACGAGAAGCCGGCGAACCGCCCTGCCCGCGGACTTGAGCCCAGCACGCGGATTTCAAAGCCCTTCAGCATGCGGCTCATCATGAGCCAGACCAGGACCGCGGCGACAAGCGCGAAGATGAAGCCCAGGTTGGCTCGGCCGGACATCGGCATCAGTTCAGGCAGCACGGCTGACGGGTTGAACTGCACGGTCTGCGGGAAGTTGAACCCCTTGGGGTCACGCCAGGGGCCGCGCACCAGCCAGTCGAGGAAGAGCTGCGCGACATAGACCAGCATCAGGCTGGTCAGGATCTCATTGGTGTTGAACCGGGTTTTCAGGAATGCAGGGATTGCGGCATAGGCCGCACCGCCAACCATGCCCATCAGCAGCATGAGCGGCAGCACCAGAGGGCCCTGCAGGTCCGGATAGAGCACCGGTATGACGGAACCCACGATCGCCCCGAAGACGAACTGGCCCTCGGCGCCGATGTTCCAGATGTTGGAGCGGAAGCAGACGGACAGGCCGACCGCGATCAGGATCAGCGGGGCTGCCTTGATCGCCAGCTCGTGCAGCTGCCACACTTCGGTGATCGGCGAAATGAAGTAGGTGTAGAAGGCCGTCAGCGGATTGATGCCGAGCAGCGCGAAGAGGATAGCGCCCGCCACTATCGTCAATCCGAAGGCGATGAAGGGCGACAGCGCCGAAAACAGCACGGAACGTTGCGGGCGCTTGACCAGTTCCAGGCGCATTACGCGGTCTCCAGCTTGTGCTCGACGTGACCGGGTTCGGCGCCGCCCATCAGCAGGCCGATCCTTTCGAAGGTGGCTTCCGCGATAGGGATCGGCTTGGACAGTTCGCCATTGTGCATCACCGCGATGGCATCGGAAATCTCGAACAGTTCATCGAGGTCCTGGCTGATGACCAGAACCGCCGAGCCGCCACGCGCCAGTTCGATCAAGGCCTGGCGGATGTGGGCGGAAGCACCGGCATCGACACCCCAGGTCGGCTGGTTGACGATCATGACCGTCGGCTGGCGATCCAGCTCCCGGCCGACGATGAATTTCTGCAGGTTGCCCCCAGACAGCGCGGCAGCTTCCGGATCCGGCGCGCTCTTGCGCACATCCATGTTCTTGATGATGCGCTGCGCCGCTTCATAGACAGCGCCGTCCTTGACCATGCCGCCCGAGCCGACAAATGCCTTGCCGTCGGTCGCGTAGCGCGAGAGCAGCAGATTTTCCGAGAGCTTCATGCGCGGTGCCGCGCCATGCCCCAGCCGCTCTTCCGGAACGAAGGCTGCGCCCAGCAGACGACGACCAGTGATGTCGAGTTTGCCGGCGTCCTTGCCGCGGATGCGCACGGCATCGGCGCTGGTTTGCTGCACCTCGCCGGACAACGCCTCGAAAAGTTCGCCCTGGCCGTTGCCCGCAACGCCCGCGATGCCGAGGATCTCGCCTGCTTTCACGGAAAGCCAGATGTTCTTGAGCGGCATGGAAAACGGTCCGGCGGGCTTTCGGCTCAATCCGCAGACGCTGAGCAGTTCCTGCGCGCCTTCGAGCGTGATCGGGGTTCGTTCGGCAGATTTGACCTCGTTGCCCACCATCATGCGGGCCAGCGACGAAGCCGTTTCCTCGCGTGGGTTGCAATGGCCGACAACCTTGCCGTGGCGCAAAACCGTTGCCCGATCGCACAGGCGTTTCACTTCCTCCAGGCGGTGCGAGATGTAGAGGATCGACTTGCCTTCGGCGCGCAGCCGCTCGAGGGTTTCGAACAGCTTGTCCGCTTCCTGCGGCGTGAGCACCGACGTCGGCTCGTCGAGGATGATCAATTGTGGCGCCTGCAGCAGGCAGCGGATGATCTCGATGCGCTGGCGTTCGCCGACGGAAAGGTCGCCGACGAGCGAGCCGGGATCCAGCGGCAGGCCGTAAGAATAGGACAGCGCCCTGGCTTTTGCGGCGATCGAGCCGATCGGCGCGCCATCGTCTAGCGAGAGAGCGATGTTTTCTGCTGCGGTCAGCGCTTCAAACAACGAAAAATGCTGGAAGACCATGCCGATGCCGAGCTTCTTTGCCACGCTCGGACTGGTCATCCTGACCGGCTCGCCTTCCCAGCGGATTTCGCCGGAATGGGGCTCCAGCGATCCGAACAGCATCTTCACAAGTGTCGACTTGCCGGCGCCGTTCTCGCCGAGGAGCGCGTGGATCTCGCCTTTCTGGATGGTGAGGTCGACATGGTCGCACGCCGTCAGCGTGCCGAATACCTTCGTCAGCCCGCGAACCTCGAGCAGGCTCGTGGCGGCTTCTTGTGCACTCACAGTGCCTCCCATTCGGTTTTCCCGAATATGTTCTGTGTTCCCGCGATCATCGTAGGCAGCAGCGACTTAACTCCGATAGCGGCACGCAAATTGAAAGAGCTTCAAGAATTTCAGCAAAACGCCAGTTGAAGGCTCACGGAACAAGCACAGTCGTTCCTGTCGTCTTGCGTCCTTCAAGGTCGGCCTGGGCTCTGCCGGCGTCTTTCAAAGCATAAGTCTGGTTGATCTTTATTTCGACGACGCCCTTCGCCACGACGTCGAACAATGCCTTGGCCGACGCTTCGAGATCCTCGCGTCTGGCGTTGTAGACGAAGAGTGTCGGCCGGGTGGCGAACAGCGAACCCTTTTGCGAAAGGAGCGCGATGTTGAAAGGCGGGATCGGTCCGGATGACTGGCCGAAGCTGACAAAGGTGCCCAGCGGACGCAGACAGTCCAGCGAACCCGGGAACGTGTCCTTGCCGACCGAGTCATAGACCACATCGCATTTGCGACCATCGGTCAGTCGGGCCACTTCGGCGACGAAATCCTGCGTCCTGTAGTCGATGACATGGTCATAGCCATGTGCCTTGGCAAGCTCGGCTTTTTCCTGGGAACCGGCAGTGCCGATCACGGTGGCGCCGAGATGCTTGGCCCATTGTCCGAGGATCAGGCCGACGCCGCCGGCGGCGGCATGATAAAGGATCTTATCGCCCTGCCCGACCTTGAACGTGCGTCGCAGCAGGTATTCTGCCGTCATGCCCTTCAGCATCATGCCGGCAGCCTGTTCGTCGCTGATTCCGTCGGGGATCTTCACCACGCGGTCGGCGGCGATCACCCGCTGTTCGGCATAGGCCCCGGTATTGATCGCGTAGGCGATACGGTCGCCTTGCTTCAGCCAGCTGACGCCTTCGCCCAGTTCCAGCACCACACCGGCGGCTTCGCCGCCTGGTATAAGCGGAAGACCATTCGGGGCCGGATAGAGCCCGGTGCGAAAGTAGATATCGATGAAGTTGAGACCGATGGCCGTATGTCTGACAAGGATCTGGTCGGGACCTAGCTGGCCTGGATCGGCGTCCTCATAGACGAGGGCTTCGGGACCGCCATGGGCGTGGATGCGAACAGCCTTGCTCATCTTCACACTTTCCTGGCGCCAAGGTTCGGGAAGAACTGCATGACGCCGCCGATACAAAACAGGTAGATGCCAGTGATGCTGATGCCGATCTTGGTGAAAAGGCTGACCTGTTCGCCCAGGACACCGATCTGATTGTAGAAACTGGCGAGGGCCGACTGCGCCAGCGCCAGCGCGCCGAAGGCGCACCAGAGCAGCGTGACGGTGAGGTTCACGGGCCGCAGCCGCACGACACGGACCGGGTGCAGGAAATTGATCGGCACGAAGGTGAGGATGCCGGCGATCACCACCACGGCGAAGGACACCCATTGCCCCGGTTCAATGACGAACAGCGTGAAGACCACCATGTTCCACACCACCGGGAACCCCTTGAAGAAGTTTTCCTTCGTCTTCATGCCGGTGTCGGCATAGTAGATCGCGCTCGACACAACGATGATCGCTGCCGACAGGAACGAAAGCCCCTCGCCCATGAAACCGCGCTGATACAGGGCGAAGGCCGGGATCAGGACGTAGGTCACGTAGTCGATGATGTTGTCGAGCAACTCGCCTGACCAGGTTGGCAGGATTTCCTTCACTTCCAGTTTGCGCGCGATCGGCCCGTCGATGCCGTCGACGAACAGGGCAAGACCAAGCCACCAGAACATCGCAGTCCAGCGTTGCTCGCTCGCCGCCACCAGCGACAGGAAAGCCAGGAACGAGCCGGATGCGGTGAGCAGGTGGACGGAGAACGCCCTCGCCTGAGGCCACGTCACCTTCTTCTTCGGACGTGGGATACGCTCGGCGATCTTTTTCGCGGTTGTCTTGTTCTTCACGGGCCCCGCCAATCCAGCTTGCAG
Protein sequences of DBSCAN-SWA_2 >CP029562|5809416:5858276|5822003_5822762_+|QAZ46127.1|DBSCAN-SWA MRAFAAVPPTVWQTDIKKLRGEPEAVAVYFHLLTSPHTNMIGMYPLDLQYVAIDLGSPLEGASKGLRRVCEAGLATYDEVNEIVWVHDMAISQVAPRLSPKDNRVQAVAKLLATLPICPITLSFYTRYRDVFHLRDALVLEDFERASRSPFEAPSEPLRSKEKEKEQDKDPGQGKEKLGSEGEELGATRVSDEREDPYEPRSSIEDGRRFLIGIGVPVSRMETALQRLMRGALFPCDVDEWKHEAVAMRGAA >CP029562|5809416:5858276|5845941_5846184_+|QAZ46157.1|DBSCAN-SWA MLALVLQYWPVIAGALAAAFAGWKLRQSGVNAERARQAKEKVAAAEDRLEMDREATDIERRVTGMTDEEARKEATRWARR >CP029562|5809416:5858276|5848493_5848736_+|QAZ46161.1|DBSCAN-SWA MPSPAAENAAQRVQRLCRELSDALDEWTGGDFMAEVFPASVVLEGFKPYAFDQRRYALDPTRRVLIDCARQFQDLRERDA >CP029562|5809416:5858276|5846407_5846818_+|QAZ46159.1|DBSCAN-SWA MSEEANMPNVSEKIVNSVIITMAARLSMVLAIPTLTFLFWLYTGWQAEKFDKVQDQVEQTQKTAQQASDQSIRLSERLTTVETKQAEATVSNERFQNATLNRLDRVQDSIVGLSNTVAALTATLQAIVEDKRRRPP >CP029562|5809416:5858276|5837282_5837891_+|QAZ46146.1|DBSCAN-SWA MIVDLPAIRFAPTRPELIDSVSMSRAGNRVISVVDYADPFWQIPMQTLRMTAKELQLLLAFRDRVRNGMVTVVHRPTDNCLPQAYWGNAGAIQLNDGTLQSVTNGFNVTLNNLINGLKMMPGDLFSLKAGNYRSRHRVMTGGTAAANALALTVEPFVPPYITPGAVAKFLRPELNTRVLPGSFSVSDDHFPVASFTLVEVPQ >CP029562|5809416:5858276|5854769_5856332_-|QAZ47564.1|DBSCAN-SWA MSAQEAATSLLEVRGLTKVFGTLTACDHVDLTIQKGEIHALLGENGAGKSTLVKMLFGSLEPHSGEIRWEGEPVRMTSPSVAKKLGIGMVFQHFSLFEALTAAENIALSLDDGAPIGSIAAKARALSYSYGLPLDPGSLVGDLSVGERQRIEIIRCLLQAPQLIILDEPTSVLTPQEADKLFETLERLRAEGKSILYISHRLEEVKRLCDRATVLRHGKVVGHCNPREETASSLARMMVGNEVKSAERTPITLEGAQELLSVCGLSRKPAGPFSMPLKNIWLSVKAGEILGIAGVAGNGQGELFEALSGEVQQTSADAVRIRGKDAGKLDITGRRLLGAAFVPEERLGHGAAPRMKLSENLLLSRYATDGKAFVGSGGMVKDGAVYEAAQRIIKNMDVRKSAPDPEAAALSGGNLQKFIVGRELDRQPTVMIVNQPTWGVDAGASAHIRQALIELARGGSAVLVISQDLDELFEISDAIAVMHNGELSKPIPIAEATFERIGLLMGGAEPGHVEHKLETA >CP029562|5809416:5858276|5825514_5825763_+|QAZ46133.1|DBSCAN-SWA MRVKDVRPADLIREAGHLQRLEIKPGERFVLTCDYALSEADADRVRQAWDAFMGDVPLLVLTDGISLGAVSHPEPVEAEPAA >CP029562|5809416:5858276|5838461_5838890_+|QAZ46148.1|DBSCAN-SWA MLKILPRLPDWDRRLARVTEKHLRLPGEWGVSDCLMTAMDAVEAVTGFDPAAKVRGTYSTEQGAAKLLRRRKAETVEEMLAKLFPTLPSAFSALRGDLVVVERNGVLSAGYVCEYEVAVKTETGLAFVGITEIRSAYQVGAR >CP029562|5809416:5858276|5833142_5833640_+|QAZ46140.1|DBSCAN-SWA MKIAGKQKFLGQIAALPRAMKDEIRKALEVSAEETTDLMKRFAPVKSGALKASIGYTFGTYTPDNANVRGLKASGSDATELTVTLHAGDKKAWYASLVEFGTRAHTIKAKRPGGLLNIHGRLIEEVHHPGATASPFFFPGYRLGKKRARTRLARAVRNGAKKAFK >CP029562|5809416:5858276|5846168_5846411_+|QAZ46158.1|DBSCAN-SWA MGKALRLICASLACLSLTAACSVRDKGPLPSDPRSIWCDHNSPRRDARSDTPRSELDEINKHNFQGAKWCGWKPSEGSGH >CP029562|5809416:5858276|5835097_5835325_-|QAZ46144.1|DBSCAN-SWA MDNWLKGLVAAACVTVIAVGGFYFWQQHQQAVVAEELGKQETMRRGCLAAVDENLPALKQYCAEKGYITQEQAQR >CP029562|5809416:5858276|5847468_5848446_+|QAZ46160.1|DBSCAN-SWA MRPQQQPSYAPLPDIDLRQLFGETEGIELTKKGEAYAEAAKTDVFFLRITPKDRTSIAYTIESLINLLDAMEPDPDLEPDLGSGDDREGNGGAEGGDEDREPSLGWSADGQVGQNPAWCNCWGELEHDDADDEFTLGWEKGSMPYDQTYLNMFGTDEAEDVSEDEGATTGDDEYSLGTTEEIDQERRQDVAEGYFIPDGEPYLGWAESFGRGVVGEQDCLDDREHEDEREQDQAEMGIADQDALQDPGIWPSHLPTFPGCKGAVDFQGDGVREANAMLKKARAQQARPSASPNIINLRNFAPTDDDILSLDMLAHYRAAHGQRPC >CP029562|5809416:5858276|5845753_5845939_+|QAZ46156.1|DBSCAN-SWA MGPYVRILLRYGVGAVIGYQIGDQLSNDPDVVTVATVAATAAVGVATEFAYHLARKFGWER >CP029562|5809416:5858276|5833636_5834032_+|QAZ46141.1|DBSCAN-SWA MMIGDQLQKAIYAALTAAPALVDGKVFDTVPPDTEAPYIHIGNEQVLDDSDDCAEGWEVFTDIHIWSEPETGSKLEVKSIGPAVVQRLTQITAIEGFRVVIASIENIRYLDDPDGVSKHGIVTSRHVLEPA >CP029562|5809416:5858276|5821635_5821872_+|QAZ46126.1|DBSCAN-SWA MSKLFFKVSWQHNIDRYRDLQKLARKAGMSLQVKNVSRTRGIPVFLFFLTEPGKNGQPFHSLEDAASALNAMAGMGKV >CP029562|5809416:5858276|5850207_5851524_+|QAZ46163.1|DBSCAN-SWA MTAAPRISFVSLGCPKALVDSERILTRLRAEGYEIARKHDGADLVVVNTCGFLDSARDESLDAIGTALKENGKVIVTGCMGATPELIREKHPNVFAITGPQAYESVMAAVHEAAPPAHDPFTDLVPPQGVKLTPRHYAYLKISEGCNNRCTFCIIPDLRGDLVSRPAADVLREAEKLAAAGVKELLVISQDTSAYGVDIKYASSPWKDREVRAKFLDLSQELGELGVWVRMHYVYPYPHVADVIPLMAEGKILPYLDIPFQHASPQVLKNMRRPAHGEKTLDRIRSWREACPDLAIRSTFIVGFPGETEEDFEMLLDWLDEARIDRAGCFKYEPVKGARSNDLGLEQVPDDVKEARWHRFMQRQQKISAAQLQKKVGKRLPVIIDEANGAVAKGRTRYDAPEIDGSVHIQSRRPLRVGDIVTVKVDRADAYDLHGMAV >CP029562|5809416:5858276|5830088_5830322_+|QAZ46136.1|DBSCAN-SWA MKKSSYMTRALQCRDPRFAVILGKLGYEAADMRAAPAGGQVDLLADLRAEYALIVGKKPFHGWDAATLREKIAAAKG >CP029562|5809416:5858276|5825875_5826322_+|QAZ46134.1|DBSCAN-SWA MEKPEDKNPSGKRGRPAYRPALKDRQTVEQMKFCGESENTIARSLGIDVGTLRKHFAEELENGYANRRHEVIGLLFEAARSKNVAAIKKLEEMGRVANAAEAVNGRGKGERQPKLGKKEERQIAARNVGGKFATRQPPQLVAVSGGKK >CP029562|5809416:5858276|5834553_5834958_+|QAZ46143.1|DBSCAN-SWA MNRHGAIEITWAGGDHTFRLGLDEVEELEAATEMSIFLLYSSMVSASPFAKLKHYSETIRIGLIGGGMSPVDARLMVRRYVDIRPLVESVALAAVIIRAALERVHSNEVDTSSGEAETVEPNGSTSPSFEETPS >CP029562|5809416:5858276|5841915_5842380_+|QAZ46151.1|DBSCAN-SWA MIRQAFRGFDGGGVFKFRILKPGSTADARYASVDDCLLHEQMLATQPYWLGFVPCPFAGSTTKDPLDQTTNVSIPDAGVTDPEYMLWPRNVSGRNCFPRPASVGVGNSQDGYPSEAWAIRMVSITANTLTLRFIKPALSTQSPLGCSVALFKRG >CP029562|5809416:5858276|5835336_5837286_+|QAZ46145.1|DBSCAN-SWA MAIEAERLLAIFEARFTSLEKSLEKARTNANKAFGDIEASGTRAEDALSKVGEKGLPGIDRASKSVAAMKGNTANLAAQLNDIGVQLAGGQSPFLIAIQQGTQINQALGPTGARGAVAALGGAFMSLLNPVGLATLAIIAGGGYAAQYFLEIISGGEKSAETLKNEAELIQRVVNKWGDALPALQAYAKEREALADKKDLEEATNIGKDNAFEKTREQVDGLRADLALLIGDLGSFAGQETQVARLQDAFSILQQRIESQTATAEDAKRVQTVLADVLSGTGVPAAGSLADEFSRLAQEIAKASAKAFEFQSQQDGRIRASTKGGPARYQSGQIDLPETAPTPDRTPNREDVFAEQDRYRERAARRGARGQRLNAGQRTDEDLRAVKDRTEALRQEAAMIGLSFQEQERRRMALDLEQEALKRLREEAARKGQKDLESIRLSPDQVAKINAVSEAYAQQAEVLRQVRERQQEAEQASGEFYDTFKNGAIDAITGARKLSDVLKELGTRFASMLLNSGFDMLFKPKSGNSSGGSFGSVFDLIGKFITGSFANGTSSAPGGLAVVGERGRELVNLPRGSQVVPNDITEKLIGGDGGGSPIYIGGPTINVDARNAQPGVGEEVRRAVKDATGNMTQLVRNALREMKVKGMKP >CP029562|5809416:5858276|5838889_5841490_+|QAZ46149.1|DBSCAN-SWA MAFLGPVIAGVGSVLSGAFSFFSGSTILAGALKIGLGLAAQYAIGELFKPDVPAQVSHLETQYGQDLPRSVVLGTRGLAGHHIYRNAYGKGGRTVQDLYVLSHFRIFGVPRVRYEGKWYSLGGAVDAERGQRIQGIEPEIWVKVYQGDLAQAADPGLIARANPAGRWTVAHRLAGVAYAVVTQQLDREKLPNAWQAFFEVTGVAYDWRQDSSAGGSGAQRWTDPTTWGQSSNPIVLQYNLERGFWLGDQLIVGKGISTDRLPIGKWTLAANICDEVVGTGARYKAAYIATAGQGITHAGNMAPLQAACAASWYEGPDGEYPIAGASQAIVATITDDDLVDGEDKNFSRYRTRSELVNTIAGTYASPDAFYDGVPLATRTDAPAFAADRERLATSIPYDAVTDPKVGDRLNDIALRASRYQGNGEICLRHKFLGLKVGQWFQWQSEDEAGGFTKTFQIVRRRLGPIGPNSCRLVYVTIQEVGEGIFDPTAYQTNPPDATGPGAPDYQSELVNFNITAIKNTASTGSARAGIRASWNPIEDQTITEIRIEYGPVNAVGDIPASTVPAPPALKNQTIHDLYQGVMSNSLWAVRYILRSQPARAFFPSPWKYVKTLQATLGDDDVYLPGMVEEVNRNTANLLPPLQEDIRTLIERAEKSAQQSLDQDAGQFLNKVSQVTQLKSAYETVTAEYKSVVFAATGPNSAIVQRLDTYEARFADMASATALNLVKVQVDQQAGQITAQGTLISGVQATVGGISAQGLFRTYVTATEAGAEATVGLAASATANGSTRSAAILLSARSDGKTMAGLVADLIYFTDGAGSKLFPVAIENGTLRANFANLGTVTAAILQSPNGLHIYNVANGTQEWWRA >CP029562|5809416:5858276|5842906_5843125_+|QAZ46153.1|DBSCAN-SWA MHPVAALTEANALVGFLQNRNLMLAHDLLSLKGENDGYRQRINGLVEQVLTLEAQIKELTAAPIASEPDEEA >CP029562|5809416:5858276|5814310_5815504_-|QAZ46118.1|DBSCAN-SWA MPSPARLRRFSTYALLAVSVGVLAACASSEPKSMVSKKRSKEYFAESEYGVKASPRVAMRRGGGRDQLGKPYQVRGKWYYPKEEKRYSKTGTASWYGEAFHSRMTANGEVYDLSQLTGAHPTMPLPSYARVTNLDNGSSVIVRVNDRGPYHEGRLIDLSKRAADMLGYSKIGTARVKVDYVGRAPLEGNDDSYLMASYHPGNRIPDPSDGLPTGVMVAMNGSMPSAPVRASAAAIPFPGQLTDTTSLPEPVLAAQPMGADSVALPDFGPIAPARPDFAVSPKLPFEMASLNYAAERTERSASAFAALEGPAAALGSWKQPEDPSDDYVAAGTFDNAREAKQVAAALKAYGRVEIDQAKLDGKDWYSVNLYQDGRMSVDSMLEAAWSHGAPDAFAVRD >CP029562|5809416:5858276|5817927_5818809_+|QAZ46121.1|DBSCAN-SWA MVALNRIPLNGLRAIEVAGRLGSLAKAAEEMGISVGAVSQHVIRTEKLLGRPVFERTARGLTPTPFGARMLASLEPAFRSIAQALDQAGATDRTVLAVTTTPVFATRWLVPRLSAFQRKRPEIQVRIETGLTLTDLDRSDLDVALRMGRGTWPGTRAELLLRQRIFPVCAPALAANLRTPADLKSAPVIRYPAMENWTDWLAPHDMAEADLPQGLSFSDASLSLEAATAGLGVMIAWQAIAADALADGRLVKPFGWEVETGIDLWFVASAAHANDAKIVAFKRWLKSELAGQE >CP029562|5809416:5858276|5821399_5821639_+|QAZ46125.1|DBSCAN-SWA MIGMNERRNILTLDEALSRPTISVVDAGAIFFGLARNASYDAAKSGDIPTIRIGGRVLVPVAPLAESLGLKTTFGRAAA >CP029562|5809416:5858276|5843127_5844261_+|QAZ46154.1|DBSCAN-SWA MGITYYDTGAATLSVGSKTMTGQGTLWAGWVKPGDQVMPEEGTNNVVDVVVSNTEITLLKPYHGVAQAARPYTIMRTPDVVFTESLARKVLQDISSSPLVALAGLPAVSRSLIGFGPTQVAETVPLSDKVKAFLATIPSDDVVSLLAAANFASIRNSLNVAPKQSAIDDATAGAGLTVGAFGLGGVIGLSIPLNDYNKALRSGIYAGSGASAVNGPPNGAAYGPMLVMARASTVVTQMAGFGDTVGSAFTLYLRHTGDGGATWTPWRPIVPEKGANSSGFYSRFADGTQICWMQDLTFGPITTAAGSIYSGAAEQTFSFPAAFASAGQIDCLVMSDSTNIWGACSVPSNTAVRAQLFCYASIASSRKANILAHGRWF >CP029562|5809416:5858276|5823692_5824313_+|QAZ46130.1|DBSCAN-SWA MPSKPIDYALEAALADLIWIVKGRFVEAADTIANMDVGQLRPSKARSLWPAYQVDNVGGHHPGYGINGSNVQYRPSSITISRAEEVMYGWLLDYVDGDDARRLLGQWAACQATPRLAGSFRAFCKKSGLSRSTAERRVDGAFKAVAAAILKNAQSLQGPSWSRVMPMMPNSRIDPGKMATVTRWMAPDAKPRHIPEMLEPLRGEAA >CP029562|5809416:5858276|5826318_5828019_+|QAZ46135.1|terminase|DBSCAN-SWA MKEWSTACPDWRERIVEGRSLIPFDPLYPDQAEEAVNIFKSLRVVDVAGSPTFGECCEEWVFDFVRAIFGAYDADTGQRQIEEFFLLISKKNSKSTIAAGIMLTALILNWRLSAELMILAPTIEVAKNSFEPARDMVYADPELLELLNVQEHIKTIEHRVTKAKLKIVASDSETVSGKKAAFILVDELWIFGERDGAGAMLQEATGGLASRPEGFTIYLSTQSDKPPAGVFKEKLEYARDVRDGEIKDPRFLPVLYEFPPEMIEAEEHKDPANFYMTNPNMGRSVRQDYLERKLRQILKGEGEEGEDIQTFLAKHLNVEIGLRNRRDRWGGTDFWSDAAEEGLTLETLLERCDVVTIGIDGGGLDDLLGFAVIGREKLTRHYLVWCRVWAHPIVLKRRKDIAEQLRDFEKDGDLVFCKTSTQDLDEVVAICCKVRDAGLLPEKAGIGVDKLGLPALLDSLIAAGFKTDEEGGTVTGIGQGGFLNDVLVGVARKLSDGSLKHAGQAVLQWAVSNAKEVLKGSARAVTKQTSGTAKIDPFIALLNAAKLMSRNPEAYRKKKATVRIIG >CP029562|5809416:5858276|5841474_5841915_+|QAZ46150.1|DBSCAN-SWA MVAGMTTSAAFWDWVVDHWRGRWVLPGYSARDANVPRNKVVMDTDDIGMLSKLANGQYTLSYGGSLDTGPVTIASWTDPGFVPLCLYRFSIAGGEWENVYTMQNTGSTSGNYYIKVHRTGIIASLKLINSSSPVTIAWTALRLKVM >CP029562|5809416:5858276|5813053_5814226_-|QAZ46117.1|DBSCAN-SWA MIAKSRFCSSLAGLFLLSLLALSAWPAAAQLFETKAATAYMIDAETGTVLFSKQADKPIQPASLAKLMTLEMAFNAVKSGRMTLNDTFVVSENAWRKGGAPSGTSTMFAALKSAIRLEDLMKGIAVQAANDGCIIVAEGFAGSEGNFALQMNERARQIGLPISTFVNATGLPAEGQKTTVREMTLLALRLSREYPQFYPYFALKDFTWNKITQRNRNPLLAMDAGAEGLAVGASEQDGFAIVGALNQGGKRVIAAMSGLANDRERSEEVRKLLDWGLRSFEKTEVFAKDEVVGEAAVFGGVKSGVALKAKGAIDLVLPIANRDKVTARIVYNGPLPAPVEVGQPVGTLRVWIGDTLSQETPVFAAETVEVGSLPRRAADAVKELAVGWLR >CP029562|5809416:5858276|5812103_5812814_-|QAZ46116.1|DBSCAN-SWA MFDVGATTEAQRLANGSFITFEGGEGAGKSTQIKRLAERLRGKKYDVLVTREPGGSPGAEAIRHVLLSGAAEPFGPKMEALLFAAARSDHVEQVIRPAVERGAIVLCDRFIDSSRVYQGATGGLDPAFIDTLQRVAINGMVPNLTLILDIDPEEGLRRATARRNPGETPDRFEKETLAIHQQRRDAYLAIARAEAKRCVVIDASGDANTVEHAVTDAVFAMLEKSRPAKTSHKAPA >CP029562|5809416:5858276|5844260_5844581_+|QAZ47563.1|DBSCAN-SWA MIITLSPQRRDDTLVVSKAGDILTINSEQFDFSTLPDGATIPHGDIPCEWIAGSVDRIAGELQIALILPHGPNPSQAEAFPDPITVMSDGPIALPHDEEETANVDA >CP029562|5809416:5858276|5856465_5857443_-|QAZ46167.1|DBSCAN-SWA MSKAVRIHAHGGPEALVYEDADPGQLGPDQILVRHTAIGLNFIDIYFRTGLYPAPNGLPLIPGGEAAGVVLELGEGVSWLKQGDRIAYAINTGAYAEQRVIAADRVVKIPDGISDEQAAGMMLKGMTAEYLLRRTFKVGQGDKILYHAAAGGVGLILGQWAKHLGATVIGTAGSQEKAELAKAHGYDHVIDYRTQDFVAEVARLTDGRKCDVVYDSVGKDTFPGSLDCLRPLGTFVSFGQSSGPIPPFNIALLSQKGSLFATRPTLFVYNARREDLEASAKALFDVVAKGVVEIKINQTYALKDAGRAQADLEGRKTTGTTVLVP >CP029562|5809416:5858276|5819689_5820856_-|QAZ46123.1|integrase|DBSCAN-SWA MKGHVRERSPGHWAIVLDVGERDPKTGKKKRKWHSFKGTKRQADSECARLIAELEAGNYVAPSKQTVAQFLDEWLAFVGPSVAPKTLERYTEICRKTIAPQIGDVILSKLKTDRIDTALTTMLTAGRRDGEGGLSPRTVHHARRVLIKALGQAVTWERLSKNPANATTPPKVERKKMLAYDAAQTAELLEAIRPTRMFIPTVLAVMCGLRRGEILALRWRNVSDNLKALSIVESAEQTKDGVRYKEPKSGRARTVALSSTVLAELRAHRVRQAEEQLRLGVRPDGDSFVVAQYDGQPIQPRSLTHEWVRIVGKTSLPRIRFHDLRHTHASQMLAAGVHPKVASERLGHSTIGITLDLYSHVMPGMQADAAEQVDAALQAAISASRKAK >CP029562|5809416:5858276|5809416_5810970_-|QAZ46114.1|tRNA|DBSCAN-SWA MSREKFYITTAISYPNGKPHIGHAYELLATDSLARFQRLDGKDVFFLTGTDEHGIKMLQTAKKEGITPRELADRNAAEFRRMAQALNASNDDFIRTTEQRHHASSQAIWKAMEANGDIYKGGYAGWYSVRDEAYYGEEETEVRADNVRYGPQGTPVEWVEEESYFFRLSAYQDKLLALYENQPDFVGPSERRNEVASFVKSGLKDLSISRTTFDWGIPVPGDEKHVMYVWVDALTNYITAAGYPDTQSPSWSYWPATHIIGKDIVRFHAVYWPAFLMSAGIELPKRVFAHGFLFNRGEKMSKSVGNVIDPFTMVDHYGLDQVRYFFLREVPFGQDGSYSHEAIVNRTNADLANDLGNLAQRSLSMIAKNCNGVVPQRGTLAEADNAMLEQATAALAAARKAMSEQAIHQALAAIFAVVADANRYFAAQEPWALKKTDPARMETVLWTTAEIIRRVGILCQPYIPGSAAKLLDTLAVADDKRDFEHLADAYALVPGTALPAPQPVFPRYVEPAPEATA >CP029562|5809416:5858276|5824903_5825296_+|QAZ46132.1|DBSCAN-SWA MPEPVSTKLKTLGQRVGTVSSRFASQAHDGKERDRQRQQTSEWRAWYKTARWQKLRRKVLKRDFYTCQRTGVLLIGKHPAPNSPVVDHKKAHRGNPELFWDEKNLHAVSKAYHDSTKQAEEAKDIKGVWY >CP029562|5809416:5858276|5828051_5830025_+|QAZ47562.1|capsid|DBSCAN-SWA MSVTRRAYSSLTIKSVSDEKRIIRGIATTPTPDRVGDIIEPLGVKFTNPLALLWQHQHDKPIGTVKFDKPTKAGITFEAEIPIVEEDGTLRDRTEEAWQSIKLGLVRAVSIGFRAMEYAFLDSGGIHFTKTEAYELSAVTIPAQPDAVMTSIKNMDAAGVAIIKTFDPNAPAATGELQRPTKTAPGATGKSTTPVNLRPKEGTTMKTLAEQITALEAKRAAHAARMEAIMQKSADEGRSTDQAEQEEFDGLSDDVEVIDGDLKRLRALEKAKAASATPIVANQIKTAELGTAIRTGVALEPAKLEKGIRFARYAKCIAIATKTNQPITAVAETLYGKADPEFIEITKAAVSAMTTGNTGQLVGNIGGFADFIEFLTPQTIIGKFGTGSIPSLSKIPFRTPIIGELSEAVAQWVGEGNAKPLTRTTYGRTTLDPLKVAAIAVQTMELIRDSSPSSDVLIRNALAKSIVTRLDLTFLDPAVAAIAGIRPGSILNGIAPVAPSAATGADGVREDAQKIMAAFVTANNPLTSGVWLMSGLTALKIIGFLNPLGQREFPGLTLQGGTFFDLPVIVSNYVGDYLTLVNAEDIHYGDEGGIEVAMSTEASLEMDSAPTHDSDTPTAVELVSMFQTNSVAFRAERTISWARRRPTAAAYVVPAWN >CP029562|5809416:5858276|5842383_5842830_+|QAZ46152.1|DBSCAN-SWA MATLEARISNAGIEIAKPGYDVRTASLANMVFSPNLVAMRVAFEGTFTAGPHGEGNPYAAYYKAIKYFDTPFPNDPPYALAAGIGSDGTSYQAPFVINSAGGNTLEVTPHYELNTYTNRIELYVMQWTAGGVILPLTWKFFILQNTLS >CP029562|5809416:5858276|5815760_5816651_-|QAZ46119.1|DBSCAN-SWA MSLPFAIIDVFAASPLAGNPLAVVQGGDELPDDTLAAIAGEFNLVETTFVLQATLPGATRRLRSFTAAGHEVFGAGGHNSLGAWLWLAQSGRLELKERRTDFTQEIGGQLFPLSIDATPGQPIRVTMAQSPARWGSQVEDLAGLAASLGLAPDTVNGDGPPIQVVDTGAGHLMVALRDREAVDAARPDVDALGAILRQSGAEGCYVFSLDPPSVETLAYARFFNPTMGIGEDPATGTAAGPLAAHLVHHGFASPERVVIEQGTKLGRPSLIEVEIDGLAVRISASGVIAAEGKLWL >CP029562|5809416:5858276|5820852_5821314_-|QAZ46124.1|DBSCAN-SWA MARPKLGDSESIRLQMVITKDEIGAIDDWQFRHRVPNRSEAIRRLCQIAMRYEDQEKELMSALRKVAEAMKSTTAAWKERNKSGDQTDEVEFLKDEYRKLYRRTNILMHRAQVARLETWALARGGDLKEAMRLADEKRSELEGMISGMEEKDQ >CP029562|5809416:5858276|5816675_5817830_-|QAZ46120.1|DBSCAN-SWA MSQANHVATSDPTAIQPFRIAIPDTQIADLKARLGKIRWPYATTNDHSRGQPVGFVMELVDRWMNDFRWREHEARLNSYPQFITGIDGQPIHFLHIRSQHPDAFPLILSHGWPGSVMEFLDLIEPLTNPTSGGQAFDLVIPSLPGFGFSSPLREGGWDSARIARAWDTLMKRLGYDRYGAHGGDIGSGIGRELGILQPHGLVGTHVLQIFAFPAGADGEMDRLSPFEMEGMAILANFEKYNGYHQIQAKRPGTLAYGLVDSPVAQMAWNTELWFGFEGNNIDNVDRDRFLAQASLYWFTGTGGSAANVYYEDQQTGAGYREVMNPTPTGVAVFPNDFRSVRSFAERANNIVHWTEMPRGGHFAASDAPDLLAQDIRAFFSKLNA >CP029562|5809416:5858276|5824588_5824783_-|QAZ46131.1|DBSCAN-SWA MASWLFAPVASIRSSDGEGLAATACTVAAPAIGAKPSIARNDLRFMLVSFVKVRQCNGPKNSAG >CP029562|5809416:5858276|5834089_5834557_+|QAZ46142.1|tail|DBSCAN-SWA MATPPMPVKSMNGTQLLVQIGDGATPEVFAADCLINTQRGITFSSDTNEDINPDCNNPDDPAWKEVTKDGLSGQIAGAGRVHTPSIKDWWEWFISKDTKNCRVLLNNVTLANGGGYWSGAFHLTNFEVTGERNQKAQVSVTLMSSGPIVWVDAAA >CP029562|5809416:5858276|5851617_5852685_-|QAZ46164.1|DBSCAN-SWA MKKLLIALMTSAAAMSVAVSAEAADKFKACWVYVGPVGDFGYSYQHDQGRLAVEKKLGDKVETAYLENVAEGPDAERAFERLARQGCKLIFGTSFGFMDAEVKVASKFPDIKFEHATGYKTAPNLGIYNARFYEGRYILGQIAAKQSKSGVAGYIVSFPIPEVVMGINSFMLGAQSINPNFKAKIVWVNSWFDPGKEADAAKALFDQGADIIVQHTDSTAPLQVAQERGLHGFGQASDMIKFAPKSQYTAIVDDWAPYYIERVEKALDGTWKPEDRWEGIKAGAVFMAPYTNMPDDVKAMAEATQKKIADGWNPFTGPVAKQDGSAWLKDGEVADDKTLLGMNFYVKGVDDKLPQ >CP029562|5809416:5858276|5811051_5812107_-|QAZ46115.1|DBSCAN-SWA MSFERLAPEQHDTLDEVPEPAENPRLIGHSDVTTTLAAAYRAGKLPHALMLCGPAGIGKATLAFHLAYHLLKNPDGNRAPETLVVPDPASGLYRQIAMGAHPSVLHLTRPMNDRTKAFKSVVTVDEIRKVGQFLSMTSHDGGYRVVIVDPADDMNTNAANALLKNLEEPPARTIFVLIVHAPGSLLPTIRSRCQVLRLAPLGERDLMTVLDAIGQGTPDDAAQSAALAERAGGSARAAILLTQYGGLEIAAALEKLVSTRRVDLAGAYKLAETVAGRDQAIPFETFNRHALELLSSAASSSAIAGDLGRAKLLSDTWQDAQRAISETETYNLDKKQHALTMIDRLNSAMRM >CP029562|5809416:5858276|5837887_5838451_+|QAZ46147.1|DBSCAN-SWA MSYPARLEQLLDEGRIAIRGLILYKFGNAWMGVWTGNYELVYEGVTYVPNQLITAEPPDGAMGMEATEFVVTMPARSDFGITPDKLAEIESQDYKGRPCQLREAYFDPDTRELLHVEELAFGYVDYITHVTEGGEMRLEGHVISGALDNHRDGYRSASHEDQQLISEGDRGFEYATVIKTEKFAIEL >CP029562|5809416:5858276|5853657_5854770_-|QAZ46166.1|DBSCAN-SWA MRLELVKRPQRSVLFSALSPFIAFGLTIVAGAILFALLGINPLTAFYTYFISPITEVWQLHELAIKAAPLILIAVGLSVCFRSNIWNIGAEGQFVFGAIVGSVIPVLYPDLQGPLVLPLMLLMGMVGGAAYAAIPAFLKTRFNTNEILTSLMLVYVAQLFLDWLVRGPWRDPKGFNFPQTVQFNPSAVLPELMPMSGRANLGFIFALVAAVLVWLMMSRMLKGFEIRVLGSSPRAGRFAGFSFNRLVFFAFMLSGALAGLAGISEVSGAIGQLQPSISPGYGFTAIIVAFLGRLNPLGIVAAGLVLGLTYLGGEAAQSVLGISDKVARVFQGMLLFFVLGCDTLIHYRIRLVGLAPAPKIDASPKLEEAH >CP029562|5809416:5858276|5823274_5823703_+|QAZ46129.1|DBSCAN-SWA MSTSRKGKLDALLDGLGIKLVPVNRRRRAAQSHARATMHEICNQYDEGHLTFVLRCIKQTKNNRDELWSETIGAISDILAQRQDWAMERAGAVLEAFDQIPLGVLRGEAVARRPWPVRATLRTLIYKRLEAILDEPEQRLAV >CP029562|5809416:5858276|5823083_5823278_+|QAZ46128.1|DBSCAN-SWA MIGDDDIPEDVKRGVEKALFGNRTGYPTVSVKKKYRFPGPSERQLRERQDAQMSVYQVKDRGDD >CP029562|5809416:5858276|5818876_5819497_-|QAZ46122.1|DBSCAN-SWA MKPIPEDAWEPWSPQDLASRLGHDDPSWYVVGGWALDLWHGRLTREHEDLEFAVLPDQVGRCRDLLSDLEFFAARDGNLTHHPAKTALATDLWQLWGADMTAGFWRVDMMIERGTPDFWIYKREPAIRVPRTAAIRRNSTGIAYLAPAIVLLFKAKHRREKDDLDFRTALPRLDAGERADLTGWLEAMHPGHGWIASLREDRTAGV >CP029562|5809416:5858276|5832786_5833146_+|QAZ46139.1|head,tail|DBSCAN-SWA MIKMAMPASSNSLRERVAFDKRGTGSDGGGGVTTPWQETFSRRAAYVHRNTGEAVMADRLQSKRTLLIRVRADSQTRTITGGWRARDARTGQAYNILDATPTPDRRWVDILAQTGGPNG >CP029562|5809416:5858276|5832272_5832611_+|QAZ46138.1|DBSCAN-SWA MASLVTLEFVKAALHIIERDENGVVLDHEDDGLIQGYIDSVEEAVLRYLRRLAVTPPWTAADAPKAVKQAIVLGVASLYDPEAPELLSGLGSSDPKNPLVGLLCMMRKPTVA >CP029562|5809416:5858276|5849347_5850034_+|QAZ46162.1|DBSCAN-SWA MEIFTAAGLSALLQVIAIDLVLAGDNAIVIGLAAAGLPKDQRKKAILVGIAAATVLRIIFALITQWLLAIGPMLLIAGGLLLLWVCWKMWRELSVTHEEEHEATEALDDIDNGKASSGPARKTFAQAAWQIVIADVSMSLDNVLAVAGAAMEHPTVLIIGLALSIALMGFAASFVARLLHRYRWIAYIGLLIILYVAVKMLLDGAVEQFPEQFQFLSPWFGKAAGAAH >CP029562|5809416:5858276|5857445_5858276_-|QAZ46168.1|holin|DBSCAN-SWA MQAGLAGPVKNKTTAKKIAERIPRPKKKVTWPQARAFSVHLLTASGSFLAFLSLVAASEQRWTAMFWWLGLALFVDGIDGPIARKLEVKEILPTWSGELLDNIIDYVTYVLIPAFALYQRGFMGEGLSFLSAAIIVVSSAIYYADTGMKTKENFFKGFPVVWNMVVFTLFVIEPGQWVSFAVVVIAGILTFVPINFLHPVRVVRLRPVNLTVTLLWCAFGALALAQSALASFYNQIGVLGEQVSLFTKIGISITGIYLFCIGGVMQFFPNLGARKV >CP029562|5809416:5858276|5844567_5844852_+|QAZ46155.1|DBSCAN-SWA MWTPDPSIIITAEQKAAALIPSSVSARQFKLQLLAAGLLDQVEAWIAAQDQAVQIAYANSGTFVRSEPMMQTGFSALGFTPQQIDAFFVAASQL >CP029562|5809416:5858276|5831675_5832269_+|QAZ46137.1|DBSCAN-SWA MTQEELDALSAAMASVVKEHVDTATRPLLERIASLEAREPVSGTSVTSAIIDRAGNLVLTMSDGSTKDLGPVVGKDGEPGKDGSDGLAFEDMTEELEDDGRTIIRRYSRGDQVKEFRHQVSVVLDRGVYKDGREYERGDGVTWGGSFWIAQQKTTEKPDGGDCWRLAVKRGKDGKSAPVTAPVANGPIRVGNPAKEA >CP029562|5809416:5858276|5852735_5853656_-|QAZ46165.1|DBSCAN-SWA MDMTVSILLTIATAATPLLIAAIGELVVERSGVLNLGVEGMMVMGAVAGFGAGLLTGSPWIGLLAAIVVGALFSLLFAVMTLSLATNQVATGLSLTLLGLGLSGMLGTGFVGLPGERLPNLYIPVLTDLPVVGRLLFGQDPIFYISIALVFGVSWFLFKTRTGLTLRSIGDSHTSAHALGIQVIKYRYLAVMFGGACAGLAGGHLSLVYTPQWVENMTAGRGWIALALVVFSSWRPLRVLAGAYIFGAVWIGQLHAQAFGIPIPSQFLSSLPYLATIVVLVIISRNKRLTMMNTPASLGQAFVPDR |
58 | Sinorhizobium_phage(25.0%) | capsid,integrase,terminase,holin,head,tail,tRNA | attL 5819578:5819617|attR 5849065:5849104 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
6192542 : 6205593
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP029562|6192542:6205593|DBSCAN-SWA ATCAGGTCGTAAGCGCCTTGCCGTTCGCTGGCTTCTCTTCGGCTGCATCAGGGGCCTGGCGCAGCTTGAAGAGGATGAGCAACGGCGCGGCGATGAAGATCGACGAATAGGTTCCGAACACCACACCGAACAGCATTGCCAGGGTGAACGAGCGGATCACCTCGCCGCCGAACAAGACCAGAGCCAGGAGTGCCAGGATCGTGGTCACCGCGGTCAGTGTCGTTCTGGACAGCGTCTCGTTGATCGCCGTGTTCAGAATCGTCGGCATCGGCGTCTTCTTGTATTTGCGCAAATCTTCGCGAACACGGTCATAGACCACGATCGTATCGTTCAGCGAATAGCCGATGATCGTCAGGATCGCGGCGATCGACGACTGGTTGAACTCGATTCCGGTTACGCAGAAGAAGCCGATCGTCATGACGATATCGTGCACGGTCGCCACGATGGCACCGACGGCGAACTGCCATTCGAAGCGGAACCAGACATAGATCAGGATGCCCACCAGCGCGACCAGCATGCCGATCGTGCCTTGAGTAGCGAGTTCGCTCGACACGGTCGGGCCAACAACTTCGACACGGCGGAAGTCGTATTTGTCCTGCAGTTCAGCGCGCACCTTGTCGATGACACTCTGTTCGGCGTTGTCGCCTTCCTCCTGCGCCGCCACGCGGATCAGGACATCGGTCGGACTGCCGAACTGCTGCACCTGCACATCGCCGATGTTGAGTTCCGACAGCTTGCCGCGGATATCGGCGATGTCCGCCGGTCCGCTCTTCGACTGCACTTCAATCGTCGAGCCACCCTTGAAGTCGACGCCATAGTTGATGTCGACGGTCATGAAGAGGACGACGGACATGATCGACAGCAGGGTCGACAACGCGAAGGTCCAGCGGCGCAAGCCCATGAATGGGAACTTTGTGCCAGGCTTCACGAAAGTGACCGGAGCCCGTGGCAGTTCCTTCGGCTTGGCACGGCGCAGCCAGATCGAGACCAGCAGACGCGTGAAGGTGAAGGCGGTAAAGACCGTGGTCAGGATGCCGATGGCGAACGTGACGGCAAAGCCCTTCACCGGGCCGGAGCCGAGCCAGAACAGCACGATGGTGGCAATGAAGGAGGTGACGTTGGAATCGACGATCGTCGCCAGCGCTTTCGCAAAACCGGTATCGATCGCCTGGATGACGGAGCGGCCCTGTCTACGCTCCTCGCGGATACGTTCGTAGATCAGCACGTTGGAATCTACCGCCATGCCGATGGTGAGCACGATACCGGCAATACCTGGCATGGTCAGCGTGGCGCCGAGCAGCGACAGCAATCCGACGATCATGGCGACGTGCACGACCAATGCGATGTTGGCCAGGAAGCCCAGGAAACCGTAGGCGACGAACATGAAACCGACGACCAGGATCGAGCCGATGATGCCGGCTACCTTACCGGCCTGGATCGAGTCCTGCCCGAGGCCTGGGCCGACTGTGCGTTCTTCGACGACGGTCAGCGTTGCCGGCAGTGCGCCGGCGCGCAGCAGGACCGCGAGGTCATTGGCGCTCTGGGCGGTGAAATTGCCGGAGATCTGGCCGGTACCGCCAAGGATCGGCTCGCGAATCTGCGGCGCCGAAATCACCTGGTTGTCGAGAATGATGGCGAAGAGCTTGCCGACATTCTGCTGGGTCGCCTGGCCAAAACGTGTGGCGCCCTTGGAATCGAAGCGGAACGAGACGACCGGCTCGTTGGTCTGCGAGTTGAATGTCGCCTGAGCATCCGTCAGGTTCTCGCCGGAAACGATGACGCGGTTCTCGATCAGGTACGGAACGGGCGGATCATCCTTCGAGTAGAGCACGCTCGAGCCAGCTGGCGGACGGCCATTGATGGCCTCCTGCACCGGCATCGTCTGATCCACCATCTGGAAGGTCAGCTTCGCGGTCTGACCAAGAATCTCCTTCAGCCGCTGCGGATCCTGCAGGCCGGGCACCTGCACGACGATGCGATCGTCGCCCTGGCGTTGCACGATCGGTTCGGTGGTGCCGAGTTCGTTGACGCGTCGGTCGACGACCTCGATCGACTGCGTCAGCGCGGTCGAGGAACGATATTTGACACCGGCATCCGTCATGGTGAACCGGAGCAGACCCGGCTCAGGCTCGTCCTCGGTCCACTCGGTGATGGTGCCACCGCTGAACAATCCCGACGTTATCGGCGCCACCAGCCCCTTCAACGCTTCCTTTGCCGCGTCAACCTGGGCCACATCACGGATGCGGACCTGAACCACGCGGCCCGAACCGGACAGACCGGTGTAGCCGATCTTGGCGTCGCGCAGCAGGGTGCGGATTTCATCACGGCTGGCTTCGAGCCGATCCTTGATCAGGTCGTTCTGATCGATCTGGAGTAGAATATGAGAACCGCCCTGCAGATCGAGACCCAGGGTCATCTGCTTCTTCGGCACCCAACTCGGCAACTTGGCCAAGGTGTCGGCCGGGATGAGGTTGGGCGCGGCAAGGACGAACGTGACGGCAACAGCCAGCCAGATCAGGATTGCCTTCCAGCGCGAAAAATAGAGCATGTTCTACGATCCAGTGCGTCGATCCGGCGTGCCAGGCACGCCGCTGCGCTTAGTTCTTGGCGTTCTGGTTGGCGACCGGCTCACCCTTGACCCGTACGTCGGCGATGGTGCTGCGCAGTGCGGTGACCTTCAGGCCGTTGCCGAGATCAACCTCGAGTTCGTGGTCGTCGACCACCTTGGTGACCTTGCCGACAAACCCGCCGCCGGTAACGACCGTGTCGCCGCGGCGGATTGCGGCCAGCATCTCACCGCGCTTCTTCATCTGCGTACGCTGCGGGCGGATGATCAGAAAATACATGATCACGAAAATGAGGACAAACGGCAAAATGCTGATGAAGACATCGGGGGTGGCGCCAACGCCCTGGGCGTATGCCGGTGTCACGAACATCGAAAACTCCTGAGACGTAGAGGAAAGCCGTTATCCGGCCCACGAAATTTGCGCGGAATATAGTCGGTAAAGCCACTATTGCAACCATTCCCCGCCCTGCTCCCACCGCTGTTATGCCACCGTGTCGCTTTGGTCGCACCAATGCTTGCAAAAAGCCGAGGCGCGTGAGAGAACCGTACGTTACGGTACGCATGTAAAATGCGTACCTCATCGTGATACCGAGGACGTCCCTTGCCCCAGGAAAATATCGAAGCGCTGTCTGAAAAACTCGACCGGTTGATTGAAGCCGTTGCCCGGCTGGCGCCGCCGCCACCGCCGAAAACCGACCTTGCCGCAGCCGACTGTTTCGTCTGGCAGGCAGATCCCGGCTATCTTGAGCCGATCTCCAGGGTCAACCGGGTCGACATTGCCCTCATCCGCAGCGTCGATCGGGTGCGCGACATCCTGTTCGACAACACCGAACGCTTTGCCTTGGGCTATGCAGCCAACAACGTGTTGTTGTGGGGTGCGCGCGGCATGGGCAAATCCTCGCTCGTCAAAGCTGTCCATGCTGCGATCAACGCATCCGGCAAAGCCGACGGTGCGCTCAAGCTGATCGAAATCCATCGCGAGGACATCGACACCCTGCCGAAGCTGATGACGATGCTGAAGGCAGCCCCCTACCGGTTCATCCTGTTCTGCGATGACCTCTCTTTCGATCACGACGACACCTCCTACAAGTCGCTCAAGGCTGCGCTCGAAGGAGGCGTCGAGGGCCGTCCTCACAACGTCATCTTCTATGCAACCTCGAACCGCCGACACCTGTTGCCGCGCGACATGATCGACAACGAGCGCTCGACGGCGATCAATCCGTCGGAAGCCGTCGAGGAAAAAGTGTCATTGTCGGATCGGTTCGGGCTGTGGCTTGGCTTCCACAAATGCTCACAGGACGACTATCTCGACATGGTCAACGGCTATGTCCGCCATTTCGGACTGGACATCGATGCGGACGCGCTACGAGCGGAAGCCCTGGAATGGGCAACTACGCGCGGCAGCCGGTCCGGCCGCGTTGCGTGGCAGTTCACGCAGGATCTGGCCGGCCGGCTTGGCAAGCCGCTCTAGTCCTCTGATTTGCCAGCCATAACCGAAAAAGCCCGCTCCTTGGGGAGCGGGCCAAGCTGCCGGGCCGGACTGGAGGTCGACGACCCGTAATCTTCTTGTTCTTCTATTCGAGCCCGACTTACTCGAGGAAAGTCTGCGGATCGACCGGCGAGGAATTCTTGCGCACCTCGAAATGCAGCTTTGGCGCATCGGTGGTGCCGCTCATACCGGAGCGGGCGATCTCCTGGCCACGCTTGACCTTCTGGCCGCGCTGCACCTCCAGCGAAGAAGCATGACCGTAAACCGTGACAAGGCCGTCTTCGTGGCGCACCAGCACCGTGTTGCCGAATTCCTTCAACCCATCACCAGCATAGATGACGACGCCGTTCTCGGCGGCCTTGATTGGCGTGCCTTCCGGAACCGCGATGTCGATACCGCCTGCCCCCGACTTGTAGCCGGAAATGACGCGACCGCGCACCGGCCAGCGCATCTTGCCAATCCCAGTCGAGTCAGGCGCTTCCGCAGCGTCGCTCTCGGCTTGCTGAATGACCTTCTGGTCCTTCTTTGGCGGCGTGTAGGTGGCGAGGGTTGCGCTCGGCTTCTCCGGCGCGGGGATCGATGCAGTCTTGACCGCATCGACCGGTGCGGCAGGCTTTGCAGCCGCCACGGTCGCAGCAGGCGCAGCGCCAGCTGCCGGGACCTTCAGCGTCTGACCGATGCGCAGCAGGCCGTCCTGCATACTGTTGGCCTGTTTCAGCGCCGTGGTCGAAACGCCGGTCTTACGAGCGATGGAAGACAGCGTATCGCCCTGCTGGACGGTGTAGCCGCCTGCGCCAGCCTTCGGCTGCACCGCAGCGACCTGAGCCGGCTTTGCCTTCGGATCGCTGGCTGCGGCGGATGCATCGACCTGAGCAGCGGCCTTGCTCTCCTTGACCTTCGGCTGCTGCGGCAACACGGCCACCTTTTCTACAGGCGCGTTGCCGGGCAGCGGCTTGTTGGCAGGTCCGTTGGTGCCGCGCGACGACTTCGCATCGGCGACGTTCGGATCGTTGTCGGGCGCGGAAACCGGCGCCTTGTTGGTGTAGACGTAGGTCGGGATAACCAGCTTCTGACCGCTCTTCAGACCATCGGCGGAGGCCAGGCCGTTGGTCTTCATGATGACATCCGCCGGCACACCGTAACGGCGTGAGAGGTTGTAGACCGTCTCACCTTCGCGTGCAGTGATCTGCGTGCCTCCGGCGCGTGACCAGCCCTTCACATCCTCGGCGGAAGCCTGCGCGCTGGCAATGGTGGAATTGACCGGCGCCTGCAGCTGATTGATGCCGCCAGTAACCGTCTGGTCGACCTTGGCCACCCTGGCCTGCCCCTGCGCGATCACCGGCTCGGCAGCGGCGTGTGCCTTGTTGACCACCGGAGCGGCGGCCTGTGCGACGGTCGGCAGCGGCTGTGACGAAACCGGCTGCAGGCTGGAGCGGGTCACCGAAGTGCGCGAATAATTGACGTTCGACGCCTGATACTGCGCATCGCCAGGGTATGGCTGGGCCGCCGCCTGCTTGGAGATGATCGAACGCTGATTGTCGGTCGAAGCTGTCGACATCGTATCGACACCACCGAAGCGAGCAACCTGGGAGCTGCACCCAGAAACCGCACCGATGACCAGCAACATGGCGCAGCCACGCGTCAGATTGCGCTTGTTTGCCTTCAAGGATTTGAGCCGCATCGCAATAACCCGCACTTACCCGGTACTCACTGACCCGGATTAAAGCGCGTTAATGTTACTGGTCGGTTAAGCCGTTAGAATCCGTCGAAATTTTTCTTAAAATGTTTGTCCGGATCGGGGCAGCGCGGGGGGCTGGTCAGATGAAGGCCGCAGCGCCCGGCAGGATAGGCTGGAAGCGTACCGTACCGATGTCCTCCCGCTCGAAGCGGCTGCCAACCTTGAGCAGCTTGGCCAGAACCTGCTCTCCCTCGTCCGGCCCGATCGGCGCGATCACGGTACCGCCGCTGGAGAGCTGGTCGAGCAGAAAACGCGGCAGCGAATCGAAGGCTGCCCAGACAATGATCCGGTCGAATGGTCCTTCGTTGACGAGACCGTTCGAGCCATCCGTGTGGCGCGCGAACACATTGGCGATGCCCAGCGCCTCGAAGCGCTGCTTGGCAAGCTCGATAAGCGACTTGTAGCGATCGACAGTGACCACCCGCGTGGCCAACCGCGACATGACCGCCGCGGTGAAACCTGAGCCCGTGCCGATCTCCAGCACGCGATTGCCAGGCTCGATGTTGAGCGCAGTGATGACCGCAGCCTGGAGGTCTGCCCCTTCCATCGCCTCACCGCATTCGATCGGCAGCATGCGGTCGGTCCAGGCAATGGAATGGAACTGGCCCGAGATGAACGCATGTCGCGGCGTCGCCTCGAAGGCAGCGATCAACGCTTTCGGGACAGCACCTTTGCCACGCAGGCGAAGCAGAAAGGCGGCAAACCCTTCGCGTTCGTCGGAACCGATCATGCAAGCGCCTTCGTCAGTTGGTCTCGAATTTCATAAGCGGTCAGATCGAGGTGAAGCGGTGTCACCGATACCAGCTTGTTGCGCATGGCGTGCAGGTCGGTGCCAGTCTTGCCTTCCGCCGGCTCGCGGCCGAAACGCAGCCAGTAATACGGCAGGCCGCGGCCGTCGCGGCGCTCATCAACCCATAACGAGTGCACCAGCTTGCCTTGCGACGTCACCACGGTGCCCTCGACCTCTTCCGGCGTACAATTCGGAAAATTGACGTTGAGCAGCACGCCTTCCGGCAAAGGTGTCGCAACAAGTTTCTTCAACAAAGCCGGGGCCAGCGCCTCGGTCGTCTCATAGGGAACGACCCGGTCCTCACCGACATATGAATAAGCCTGGCTGATGGCGATGGAACGCACGCCGAGCAGCGTGCCCTCCATGGCTCCGGCGACGGTGCCCGAATAGGTGACATCGTCGGCGATGTTGGCGCCCGAATTGACCCCCGACAGAATGAGATCCGGCAGCTCGGGCAGAATCTTTCGCGCGCCCATGATGACGCAGTCGGTCGGCGTACCGCGCACGGCATAGTGCTTGTCGCCGATCTTGCGCAGGCGCAGCGGTTCGGAAATCGACAGCGAATGCGCATAGCCGGACTGGTCTGTCTCGGGGGCCACCACCCACACATCGTCGGAAAGCGTGCGGGCGATGCGCTCGAGCGACGCCAGTCCCTCGGCATGGATGCCGTCGTCGTTGGTCAGAAGAATACGCATTAGTTCGATTCGATTCTTTCCAAGCCGCCCATGTAGGGCTTGAGGACTTCAGGAATGGTTACGCTGCCGTCCTCGTTCTGGTAATTTTCGATGACGGCAATCAGTGCGCGCCCGACAGCGACGCCCGATCCGTTCAGCGTGTGGACGAACAGGTTGCCCTTGCCTTCACGATCCTTGTAGCGGGCGTTCATGCGCCTGGCCTGGAAATCACCGCAGACCGAACAGGACGAAATCTCGCGATAGGCGTTCTGACCAGGCAGCCAGACCTCGATGTCGAAGGTCTTGCGCGCCCCGAAACCCATGTCGCCGGTGCACAGCGCCACGGTCCGGAACGGCAGTTCAAGCCGCTTCAGCACTTCCTCAGCGCATTGCGTCATGCGCTCGTGCTCGGCGATCGACGTCGCCTGATCGGTGATCGAAACCAGTTCGACCTTGTAGAACTGATGCTGGCGCAGCATGCCACGCGTGTCGCGACCGGCTGAACCGGCCTCCGAACGGAAGCAGGGCGTCAGTGCTGTGAGGCGCAGTGGCAGCCTGTCATGCGCGACGATCTCCTCACGCACCAGATTGGTCAGCGGCACTTCCGCGGTCGGAATGAGGCCAAGCCTGCTCTCACCATGCGGGGTGAAGAACAGATCCTCTTCAAATTTCGGCAGCTGGTTGGTGCCGAAAAGCACTTCGTCACGCACCAGCAGCGGTGGCTGCACCTCGGTGTAACCGTGCTCGGTCGTGTGCAGGTCGAGCATGAACTGGCCGAGCGCACGATCAAGCCTCGCCAACTGCCCGCGCAGAATGGTGAAGCGCGAACCCGACAGCTTGGCGGCGCGCTCGAAATCCATCAGGCCCAGCGCCTCGCCGACCTCGAAATGCTCCTTCACCCAGTTCGGCCGTATCGCGACCTTGCCGATCTTGTGCACCTCGACGTTGTCGTGCTCGTCCTTGCCGACCGGCACGTCATCAAGCGGCACGTTGGGTATGACGGCAAGGATACCCTCGAGAGCCTCGTCGAGCCGACGTTCCTCGGCTTCACCACCCTGAATGAAGCTTTTGATCTCGGCGACTTCAGCCTTCAGCTTTTCAGCCAGCTCGGCATCCTTCGTGCTGAGCGCCTTGCCGATCTCCTTGGACGCCGAATTGCGGCGCTCCTGTGCTTCCTGCAACCGACCGAGATGGGCGCGACGGGCTTCGTCCCTGGCGACAATGTCGTCAACCGTGGACTGAGCGTCGGCCTGCGACCACGAGCGCTTCACCAGCGCTTCAACCAAGGCCTGCGGGTTTTCGCGAATCCATCTGATATCGAGCATGTCCATCCATCCTGAAGAGGGAGCCGTGGGGTAACTCTCCTCGGCGCAGGATGCAAGGGCGGGAGGGCGGGGCGCCAGCCTCAGCGCCTGTATATGCCGATCTCGTCCATCAGATTGGGAGTGGTTGCCTTGTCCGCCGCGCCGCGGTCACGGCCAACAAGTCCATGGACATAGACCTCCAAAGCCTTGTTGAGCTTTTCGTTGGCTTTCCCGTCCGGGTCCGCAATCGCATCGATCAGTTCTTCGAGGGAGATCTTGATCTCATTCAACCCGAGTTTTTTTTCGATATCCTTGATGATCAGGTAAGCGTTCGGCTGCCGCTTCAATTCCGCGAAAGCCTGCCGCAAGGCCGCACCATATTCTTTCAAGGAAGCATAGTCGGATTGCTTAACCCCGAGCTCCTGGCCCGCACGCTCGATCAGGGAGACTTTCAGCGCCGTCACATCGAGGCTGTTCACGCTGAACATGGCCTCGACGGCCTTGGCCCGAACTTGCGTTGGCAATTCACCCGGCCCTGCGGCGCCTTTCCTTGTCGGATCGTGATGAAGTGCGTCAAGGATCTTGTCGCCGGCTGTCGCGCGTTTTTCCGGCTGCGGCGCAATGGGAGCCATCTGCAGGAAAAGCGACGAAACCTGGTTGAGTGCAATGGTCGAAAGCATGGTGATCTCGCTGACGCAGCCGCTCGTCATGAAGCGGAGCGCCGGAAATCGCGGCTCGAGACGTCATGCCTCGTGCAACTGCATCATTCAGCGTGCCGCAGGATGGAGGAATTTTACTAACGAAATCTCACCGATCGCCGTGACCAGAAGCACACGGGCAATGCGGCAGGCCCATCAGGATTGCTGCGGGGAATCCGTAGATGCCTCGGAGGGTTGTTCCTCGGCGGCAGCAGCTGCTTCCTCTTTCTCACGCGCCAGCTTTTCGCGCTTCTGGCTCTTCTCGATCATGCGCGAGGTTATGATCGCCACCTCGTACAAGATGATCGTGGGCAGAGCCAAACCGATCTGGCTGAGCGGATCGGGTGGCGTGAGCACCGCGGCCACCACAAAAGCGATGACGATTGCCCAACGGCGCTTGTCGACCAGCGCTTGCGAAGACAGCAGGCCGACACGCGTCATCAGGCTGGTCACCACCGGCAACTGGAAAACCAGTCCGAAGGAGAAGATCAGCGTCATGATGAGGCTGAGATATTCCGAAACCTTGGGCAGCAGCGAAATCTGCACCTCACTGTCGGTGCCCGCCTGCTGCATGGCGAGGAAGAACCACATCACCATCGGCGTGAAGAAGAAATAGACAAGTGCTGCGCCCATCAGGAACAGGATCGGCGAAGCGATCAGGAACGGCAGAAAGGCAGCACGTTCGTTCTTGTAGAGGCCGGGAGCGATGAACTTGTAGATCTGCGTCGCAATCAGCGGAAACGCGATGACCATGCCGCCGAACATGGCGAGCTTGACCTGCGTAAAAAAGAACTCCTGTGGCGCGGTATAGATCAACTCGACCTTGTGTGGGTCGAGGCCGGCCCACTGCGTCGCCCATTTGAATGGCACGACGAGCAGGTTGAACAGCTTCTTGGCGAAGAAGAAGCAGACCAGGAAGGCGACAAAAAAGCCTGCAATCGACCAGATGAGCCGGCGGCGCAGCTCGATCAGATGCTCGATGAGCGGAGCCGACGACTTGTCGATCTCATCCTGTTCCGAAACGCTCACTTGGCGGTCCTCACCGTCTTCTTTGCAGGTGCTTTTGCAGTGGATGGCGCTTTCGCGGACTTCACTGGCGCCTTGGCTGCAGGAGCTTTCGCAACAGGTTTGGTTCGAGCTTTCGAACCGGCTGCCTTACGTGCTGCCTTCGTCGCGACAGGCGCGGCCGCAGCAGCAACAGCCGCTTTCACCGGTTTGGCGGCAGGACTGCTTTTTGCAGGTGTTGCAGAAGCCGCGGTGGCAACAGGCTCAGCAGCCGGAACGATAGGGGTTGCCGGCATGGTCTCCGGACCGTTCACGCCGGGCAGCGCGGTAGCGCCATTCTTCAGAGGTTCCGTCGGCTGCGGCTCAGCGGCAGCCGGTGCCGCCGGATCTTCCGCCGGCTTCGGTTTCATCATGTTGTCGACACCGGCGCGCACGTCGGCAGCGGCTTGCTCGAATGGATTGAGCTGCTTGCGGATTTGCGCAGCCGGATTGAGGCTGCGCAGCTCATCGACCGACTTCTTGACGTCGTCGAGTTCGGCTTCCTTCAGCGCTTCGTTGAACTGCTTCTGGAAATCACCCGCCATGGCGCGCAATTTGGCCACCGTGCGGCCGAAGGTACGCAACATGTTCGGCAAATCCTTCGGGCCGACAACCACGATCATGACGATCGCGATCACCAGCATCTCGGTCCAGCCGACTTCGAACATGACTATCGGTTCCGACTAGAGCAATTCCAGCAAAAGTGCGCAGCGGTTTTGCGTCCGGAATTGCGTAGAAACAAAGAAGTAAAGCGCTTCCGCGTTTCCGTAAAACGGAAACGTCCAGTGCCTGACGGTGTCTCCGGAAAGCGCCGGCTCAGCTCTTGCCAGCCTTTTCCTTCATGTTCGAGACGGTCTCGTCGGCGCGGTGCTCGACGGTGCGGGAATCGTCGGCTTCGTCGTCGGCCATGCCCTTCTTGAAGCTCTTGATGCCCTTGGCCATGTCGCCCATCAGCTCCGGAATCTTGCCCCGGCCAAACACCAGGAGCACGATGACCAGCACGATCAGCCAGTGCCAAATCGAAAAGGAACCCATTGTTTTTCTCTCTTGAACTGTTTCCCTGCCGCTGATCTATGCGTTTTCGCATCCGGATTCAAACGGAACCGCGACAGCTTCTATCACGTCCGGCCAATTGTCAGCCGGATTTGTTGAGATCATCCGAAAAATAGGCCGCTACCAAACAGCGCATGAACCGGTTTGCTGGCGCTGACCAATCACTCTTCCGTGACGCGCGGCGTGAGGAGACCAAGCTCTTCGAGATCGATATCGGTCAGCGGATCCTCGTCCTCGGTCAATTCGTCGGGATCGGCAGGCGGCAGCGGCACCGAGAAGTTGGCTGGCATGCGGGTGGAGAGCAGACCGGCCCCCTTCAGTTCTTCGATACCCGGCAGATCGCGGATTTCCTCCAGCGCGAAATGGTCGAGGAACCTGTCGGTGGTGCCATAAGTGACTGGTCGGCCCGGTGTCTTGCGTCGGCCACGCATCCGGATCCACTCCGTCTCCAGCAGCGTGTCCAGCGTGCCCTTCGAGGTTTCGACGCCGCGGATGTCCTCGATTTCAGCACGGGTGACCGGCTGGTGGTACGCGATGATGGCCAGCACCTCGAGCGCCGCGCGAGAGAGCTTCTTCTGCTGAACCGAATCCCGGCTCATCAGGAAAGCGAGATCGCCCGCAGTGCGGAAGGCCCAGCTATCGCCGACGCGCACAAGGTTTACACCGCGGCGCGAATAGATGTCCTGCAGTTCGACCATCGCTGCCGCCACGTCGACACCGTCGGGCAGGCGCTGGGCAAGCTGCCTCTCAGTCACAGGCTCGGCGCTGGCAAAGACAATGGCCTCGGCCATACGCACGGCCTCGGCCTGATGCAGACGTTCCGCCGGATTCTTTACCGAGACTGCTTCGTCCTCGTCTTCCACCGTGAAGGGGATGACCGAAGCGTTGCTGCTTTCGTTCATTGCACGACCTCGACTGCCTTCGCGCGCTGACGCCCGCGCAGATAAAGCGGGGCGAACACTTCATCCTGGCGCATTTCCATGCTGCCTTCGCGCACGAGCTCCAATGTCGCCGCGAAGGAAGACGCCATGGCAGTGCGCTTTTCAGCCGGGCTCGCCAGATATTCAATCAGGAAGCTGTCCAGCGCTGTCCACTCCGCCGATGTACCCATCAACCGGGCAAGGATGGTGCGTGCTTCCTTGAGCGACCAGACCCCGCGCTTGGCGATGCGTACATTCGAGATCGCCTGACGCTGACGCTGGCCGGCATAGGCGGTGAGCAGATCGTAGAGCGTCGCAGAATACTCGTTGCGCTTCTCGACGATGACCATCTCTGGAATGCCGCGCTGGAAAACATCGCGGCCGAGCCGGTTGCGGTTGACGAGACGTGCGGCGGCATCGCGCATGGCCTCCAGGCGACGCAGGCGGAATTGCAGCACCGCTGCCATTTCCTCACCGCTCTCGCCCTCCTCGCCAGGCTGCTTGGGAATAAGCAGCTTGGATTTCAGGAAGGCCAGCCATGCAGCCATAACCAGATAGTCAGCAGCGAGTTCGAGCCGCAGCGCCCTCACCTTCTCGACGAAAGCAAGGTATTGCTCCGCCAGCGCCAGGATCGAAATGCGCGCGAGATCGACCTTCTGGTTGCGGGCGAGATGGAGGAGAAGATCGAGCGGTCCCTCGAAGCCGGCGACGTCAACGACCAGCTCCGGGTCGCCGGTGACGCGGTCGTCGTCACCGTCGGCCCAGAGGCGGTCCAT
Protein sequences of DBSCAN-SWA_3 >CP029562|6192542:6205593|6204058_6204799_-|QAZ46424.1|DBSCAN-SWA MNESSNASVIPFTVEDEDEAVSVKNPAERLHQAEAVRMAEAIVFASAEPVTERQLAQRLPDGVDVAAAMVELQDIYSRRGVNLVRVGDSWAFRTAGDLAFLMSRDSVQQKKLSRAALEVLAIIAYHQPVTRAEIEDIRGVETSKGTLDTLLETEWIRMRGRRKTPGRPVTYGTTDRFLDHFALEEIRDLPGIEELKGAGLLSTRMPANFSVPLPPADPDELTEDEDPLTDIDLEELGLLTPRVTEE >CP029562|6192542:6205593|6201203_6201812_-|QAZ46420.1|DBSCAN-SWA MTSGCVSEITMLSTIALNQVSSLFLQMAPIAPQPEKRATAGDKILDALHHDPTRKGAAGPGELPTQVRAKAVEAMFSVNSLDVTALKVSLIERAGQELGVKQSDYASLKEYGAALRQAFAELKRQPNAYLIIKDIEKKLGLNEIKISLEELIDAIADPDGKANEKLNKALEVYVHGLVGRDRGAADKATTPNLMDEIGIYRR >CP029562|6192542:6205593|6203660_6203879_-|QAZ46423.1|DBSCAN-SWA MGSFSIWHWLIVLVIVLLVFGRGKIPELMGDMAKGIKSFKKGMADDEADDSRTVEHRADETVSNMKEKAGKS >CP029562|6192542:6205593|6195708_6196578_+|QAZ46416.1|DBSCAN-SWA MPQENIEALSEKLDRLIEAVARLAPPPPPKTDLAAADCFVWQADPGYLEPISRVNRVDIALIRSVDRVRDILFDNTERFALGYAANNVLLWGARGMGKSSLVKAVHAAINASGKADGALKLIEIHREDIDTLPKLMTMLKAAPYRFILFCDDLSFDHDDTSYKSLKAALEGGVEGRPHNVIFYATSNRRHLLPRDMIDNERSTAINPSEAVEEKVSLSDRFGLWLGFHKCSQDDYLDMVNGYVRHFGLDIDADALRAEALEWATTRGSRSGRVAWQFTQDLAGRLGKPL >CP029562|6192542:6205593|6199818_6201123_-|QAZ47608.1|tRNA|DBSCAN-SWA MLDIRWIRENPQALVEALVKRSWSQADAQSTVDDIVARDEARRAHLGRLQEAQERRNSASKEIGKALSTKDAELAEKLKAEVAEIKSFIQGGEAEERRLDEALEGILAVIPNVPLDDVPVGKDEHDNVEVHKIGKVAIRPNWVKEHFEVGEALGLMDFERAAKLSGSRFTILRGQLARLDRALGQFMLDLHTTEHGYTEVQPPLLVRDEVLFGTNQLPKFEEDLFFTPHGESRLGLIPTAEVPLTNLVREEIVAHDRLPLRLTALTPCFRSEAGSAGRDTRGMLRQHQFYKVELVSITDQATSIAEHERMTQCAEEVLKRLELPFRTVALCTGDMGFGARKTFDIEVWLPGQNAYREISSCSVCGDFQARRMNARYKDREGKGNLFVHTLNGSGVAVGRALIAVIENYQNEDGSVTIPEVLKPYMGGLERIESN >CP029562|6192542:6205593|6192542_6195089_-|QAZ46414.1|DBSCAN-SWA MLYFSRWKAILIWLAVAVTFVLAAPNLIPADTLAKLPSWVPKKQMTLGLDLQGGSHILLQIDQNDLIKDRLEASRDEIRTLLRDAKIGYTGLSGSGRVVQVRIRDVAQVDAAKEALKGLVAPITSGLFSGGTITEWTEDEPEPGLLRFTMTDAGVKYRSSTALTQSIEVVDRRVNELGTTEPIVQRQGDDRIVVQVPGLQDPQRLKEILGQTAKLTFQMVDQTMPVQEAINGRPPAGSSVLYSKDDPPVPYLIENRVIVSGENLTDAQATFNSQTNEPVVSFRFDSKGATRFGQATQQNVGKLFAIILDNQVISAPQIREPILGGTGQISGNFTAQSANDLAVLLRAGALPATLTVVEERTVGPGLGQDSIQAGKVAGIIGSILVVGFMFVAYGFLGFLANIALVVHVAMIVGLLSLLGATLTMPGIAGIVLTIGMAVDSNVLIYERIREERRQGRSVIQAIDTGFAKALATIVDSNVTSFIATIVLFWLGSGPVKGFAVTFAIGILTTVFTAFTFTRLLVSIWLRRAKPKELPRAPVTFVKPGTKFPFMGLRRWTFALSTLLSIMSVVLFMTVDINYGVDFKGGSTIEVQSKSGPADIADIRGKLSELNIGDVQVQQFGSPTDVLIRVAAQEEGDNAEQSVIDKVRAELQDKYDFRRVEVVGPTVSSELATQGTIGMLVALVGILIYVWFRFEWQFAVGAIVATVHDIVMTIGFFCVTGIEFNQSSIAAILTIIGYSLNDTIVVYDRVREDLRKYKKTPMPTILNTAINETLSRTTLTAVTTILALLALVLFGGEVIRSFTLAMLFGVVFGTYSSIFIAAPLLILFKLRQAPDAAEEKPANGKALTT >CP029562|6192542:6205593|6196696_6198277_-|QAZ46417.1|DBSCAN-SWA MRLKSLKANKRNLTRGCAMLLVIGAVSGCSSQVARFGGVDTMSTASTDNQRSIISKQAAAQPYPGDAQYQASNVNYSRTSVTRSSLQPVSSQPLPTVAQAAAPVVNKAHAAAEPVIAQGQARVAKVDQTVTGGINQLQAPVNSTIASAQASAEDVKGWSRAGGTQITAREGETVYNLSRRYGVPADVIMKTNGLASADGLKSGQKLVIPTYVYTNKAPVSAPDNDPNVADAKSSRGTNGPANKPLPGNAPVEKVAVLPQQPKVKESKAAAQVDASAAASDPKAKPAQVAAVQPKAGAGGYTVQQGDTLSSIARKTGVSTTALKQANSMQDGLLRIGQTLKVPAAGAAPAATVAAAKPAAPVDAVKTASIPAPEKPSATLATYTPPKKDQKVIQQAESDAAEAPDSTGIGKMRWPVRGRVISGYKSGAGGIDIAVPEGTPIKAAENGVVIYAGDGLKEFGNTVLVRHEDGLVTVYGHASSLEVQRGQKVKRGQEIARSGMSGTTDAPKLHFEVRKNSSPVDPQTFLE >CP029562|6192542:6205593|6201956_6202829_-|QAZ46421.1|DBSCAN-SWA MSVSEQDEIDKSSAPLIEHLIELRRRLIWSIAGFFVAFLVCFFFAKKLFNLLVVPFKWATQWAGLDPHKVELIYTAPQEFFFTQVKLAMFGGMVIAFPLIATQIYKFIAPGLYKNERAAFLPFLIASPILFLMGAALVYFFFTPMVMWFFLAMQQAGTDSEVQISLLPKVSEYLSLIMTLIFSFGLVFQLPVVTSLMTRVGLLSSQALVDKRRWAIVIAFVVAAVLTPPDPLSQIGLALPTIILYEVAIITSRMIEKSQKREKLAREKEEAAAAAEEQPSEASTDSPQQS >CP029562|6192542:6205593|6202825_6203512_-|QAZ46422.1|DBSCAN-SWA MFEVGWTEMLVIAIVMIVVVGPKDLPNMLRTFGRTVAKLRAMAGDFQKQFNEALKEAELDDVKKSVDELRSLNPAAQIRKQLNPFEQAAADVRAGVDNMMKPKPAEDPAAPAAAEPQPTEPLKNGATALPGVNGPETMPATPIVPAAEPVATAASATPAKSSPAAKPVKAAVAAAAAPVATKAARKAAGSKARTKPVAKAPAAKAPVKSAKAPSTAKAPAKKTVRTAK >CP029562|6192542:6205593|6195138_6195477_-|QAZ46415.1|DBSCAN-SWA MFVTPAYAQGVGATPDVFISILPFVLIFVIMYFLIIRPQRTQMKKRGEMLAAIRRGDTVVTGGGFVGKVTKVVDDHELEVDLGNGLKVTALRSTIADVRVKGEPVANQNAKN >CP029562|6192542:6205593|6204795_6205593_-|QAZ47609.1|DBSCAN-SWA MDRLWADGDDDRVTGDPELVVDVAGFEGPLDLLLHLARNQKVDLARISILALAEQYLAFVEKVRALRLELAADYLVMAAWLAFLKSKLLIPKQPGEEGESGEEMAAVLQFRLRRLEAMRDAAARLVNRNRLGRDVFQRGIPEMVIVEKRNEYSATLYDLLTAYAGQRQRQAISNVRIAKRGVWSLKEARTILARLMGTSAEWTALDSFLIEYLASPAEKRTAMASSFAATLELVREGSMEMRQDEVFAPLYLRGRQRAKAVEVVQ >CP029562|6192542:6205593|6199060_6199819_-|QAZ46419.1|DBSCAN-SWA MRILLTNDDGIHAEGLASLERIARTLSDDVWVVAPETDQSGYAHSLSISEPLRLRKIGDKHYAVRGTPTDCVIMGARKILPELPDLILSGVNSGANIADDVTYSGTVAGAMEGTLLGVRSIAISQAYSYVGEDRVVPYETTEALAPALLKKLVATPLPEGVLLNVNFPNCTPEEVEGTVVTSQGKLVHSLWVDERRDGRGLPYYWLRFGREPAEGKTGTDLHAMRNKLVSVTPLHLDLTAYEIRDQLTKALA >CP029562|6192542:6205593|6198413_6199064_-|QAZ46418.1|DBSCAN-SWA MIGSDEREGFAAFLLRLRGKGAVPKALIAAFEATPRHAFISGQFHSIAWTDRMLPIECGEAMEGADLQAAVITALNIEPGNRVLEIGTGSGFTAAVMSRLATRVVTVDRYKSLIELAKQRFEALGIANVFARHTDGSNGLVNEGPFDRIIVWAAFDSLPRFLLDQLSSGGTVIAPIGPDEGEQVLAKLLKVGSRFEREDIGTVRFQPILPGAAAFI |
13 | uncultured_Mediterranean_phage(90.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|