Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_023036 | Mycolicibacterium neoaurum VKM Ac-1815D, complete sequence | 3 crisprs | WYL,cas3,csa3,cas4,DEDDh,DinG | 0 | 1 | 2 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_023036_1 | 1131556-1131638 | Orphan |
NA
Consensus repeat of NC_023036_1
|
1 spacers
spacers of NC_023036_1
>1.1|1131579|37|NC_023036|CRISPRCasFinder GGCTCCGACGGCCGAGGAGACCGCCGTCGACGCGCCC |
CRISPR arrays and Neighbor proteins around NC_023036_1
The CRISPR arrays of NC_023036_1 >merge|NC_023036|1|1131556-1131638|CRISPRCasFinder GTCCTCGACGAGCCGGAAGCCGAGGCTCCGACGGCCGAGGAGACCGCCGTCGACGCGCCCGTCCTCGACGAGCCGGAAGCCGA >NC_023036|1|1|1131556-1131638|CRISPRCasFinder GTCCTCGACGAGCCGGAAGCCGA GGCTCCGACGGCCGAGGAGACCGCCGTCGACGCGCCC GTCCTCGACGAGCCGGAAGCCGA
>NC_023036.2|WP_023985265.1|1130353_1131334_+|Ppx/GppA-family-phosphatase MRLGVLDVGSNTVHLLVVDARRGGHPTPMSSTKAALRLAEAIDSTGKLTRKGADKLVSTVDEFAKIATSSGCSELMAFATSAVRDATNSEAVLARVQAEAGVSLRVLSGVDESRLTFLAVRRWYGWSAGRIINIDIGGGSLELSSGVDEEPDVALSLPLGAGRMTREWLAEDPPGRRRVAMLRDWLSTELSEAGSVIQSAGTPDLAVATSKTFRSLARLTGAAPSGAGPRVKRTLTASGLRQLIAFISRMTTADRAELEGVSAERAPQIVAGALVAEASMKALGVETVEICPWALREGLILRKLDSEADGTALVETIPATPEGKRR >NC_023036.2|WP_019513802.1|1129515_1130280_-|hypothetical-protein MGGEVDLDFAREWVEFYDPEDSNHVISADMTWLLSRWTCVFGTPACQGTVAGRPDDGCCSHGAFLSDDDDRAMLDDAVTKLTDEDWQFRSKGLGRKGYLEDDEYDGKPNQRTRKYKGACIFLNRPGFAGGIGCALHSKALKLGVEPLTMKPEVCWQLPIRRTQDWVTRPDGSEILKTVITEYDRRGWGEGGADLHWYCTGDPAAHVGAKPVWQSYAPELTELLGEKAYAELAAMCRRRGQLGLIAVHPATRAAE >NC_023036.2|WP_019513801.1|1128812_1129499_+|response-regulator-transcription-factor MTSVLIVEDEESLADPLAFLLRKEGFEATVVSDGPSALAEFERAGADIVLLDLMLPGMSGTDVCKQLRSRSSVPVIMVTARDSEIDKVVGLELGADDYVTKPYSARELIARIRAVLRRGADNDDAGIADGVLEAGPVRMDVERHVVSVNGEQITLPLKEFDLLEYLMRNSGRVLTRGQLIDRVWGADYVGDTKTLDVHVKRLRSKIESDPASPVHLVTVRGLGYKLEG >NC_023036.2|WP_019513800.1|1127555_1128731_+|two-component-sensor-histidine-kinase MSVGSALLLAAALTVLALGIGVAVGMVAMRRITARRTERDIEEGGGITVSQMLSHIAAMSPMGIVVVDTFRDVVYMNDQAIELGLVRDRLLDDRAWQAVQRCLATGADVDIDLSPRKRQKSGRSGLAVRGHVRLLVEGAHQFAVVFVGDQSEQARMEATRRDFVANVSHELKTPVGAMGVLAEAMMASTEDPDTVRRFAEKIIIESVRLADMIGELIELSRLQGAEPLPDLESVDVDDVVAEAVSRYKVAADSAHIKITTDAPTGFRVLGDERLLVTAIANLVSNAIAYSPDGSDVSISRRRRGDEIEIAVTDRGIGIARADQERVFERFFRVDKARSRATGGTGLGLAIVKHVAANHNGSIRLWSQPGTGSTFTLSIPAIPEGRSADDEE >NC_023036.2|WP_019513799.1|1126692_1127436_+|phosphoglyceromutase MPTLILLRHGESDWNQKNLFTGWVDVDLTDKGRAEAVRGGKLLAEQGVLPDVLYTSLLRRAITTANLALDAADRHWIPVHRDWRLNERHYGALQGLDKAATKEKYGEEQFMAWRRSYDTPPPPIEKGSEFSQDADPRYAGIPGGAPLTECLADVVERFVPYFEQAIVPDLKAGKTVLIAAHGNSLRALVKYLDGMSDADIVGLNIPTGIPLLYELDENLKPTVAGGKYLDPEAAAAGAAAVAAQGAK >NC_023036.2|WP_019513798.1|1126174_1126669_+|YbjN-domain-containing-protein MSVTRIIEETLAANDLEYTQHKGVKGGLPGLVVALPGERRLKTNTILSVGEHSVRVEAFVCRRPDENFESVYKFLLKRNRRLYGVAYTLDNLGDIYLVGWMANSSVTADEIDRVLGQVLEAVDSDFNTLLELGFRSSIQKEWEWRVARGESLKNLEAFEHLIED >NC_023036.2|WP_019513797.1|1124858_1126178_+|D-inositol-3-phosphate-glycosyltransferase MRVVPEPSGLTEARRVAVLSVHTSPLSQPGTGDAGGMNVYVLQTALELARRGVEVEIFTRATSSLDEPVVQVAPGVLVRNVVAGPFEGLDKNDLPTQLCAFTAGVLRAEATHEPGYYDIVHSHYWLSGQVGWLASDRWAVPLVHTAHTLAAVKNAALADGDTPEPVLRSVGEQQVVDEADRLIVNTEIEARQLVSLHHADPASIDVVHPGVDLSVFTPGSRRHARAALGLAEDDKVVAFVGRIQPLKAPDVLLRAAAKVADLRVLIAGGPSGSGMDTPNGLVRLAAELGMTDRVTFLPPQSRDELVGIYRAADMVAVPSYNESFGLVAVEAQACGTPVVAAAVGGLPVAVRDGVTGALVDGHDAGDWATALRSVLAGDADRLSAAAVAHAATFSWAHTVDGLMDSYGRAITDYRSRHPRSAAPTRRTGRRFALRRGVRA >NC_023036.2|WP_110806960.1|1123530_1124814_+|ROK-family-protein MVSTATVVRQTPAAHAKRALLARHHIVAPSLKVAEVAAASVFGAARQRGPIARDAIARVTGLSIATVNRQVTALLDAGVLRERADLAVSGAIGRPRIPVEVNHEPFLTLGLHIGAKTTSIVATDLFGRTLDVVETPTPRGSQSAALAALAGSASRYLSRWHRRRPLWVGVASGGVVDSTSGYLDHPRLGWAEAPVGPVLAETLGLPVSVASHVDAMAGAELLLAVRRPNTQAGTSLYVYARETVGYALSIGGRVHSPASGPGTIAALPVSSELLGGSGKLESTVSDEAVLTAARAQRIIPAEGPTSTMATLLRAARGGHEGARALLAERARVLGEAVALLRDMLNPDDLVVGGQAFTEYPEGMELVERAFADRSVLGARDIRVTAFGNRVQEAGAGVVSLGGLYADPIAAMRRAQQRRSEAAVLGAS >NC_023036.2|WP_019513795.1|1122546_1123305_-|SDR-family-oxidoreductase MTTSTDKRRVAVVTGASAGIGEATAKTLASLGFHVVCVARREAPIRALAAEIDGTAIVADVTDPAAVASLAERLDRVDVLVNNAGGARGLESVAEADIEHWRWMWESNVLGTLQVTKALLGKLIDSGDGLIVTVTSIAALETYDNGSGYTSAKHAQGVLHRTLRSELFGKPVRLTEVAPGMVKTDFSLNRFDGDEGRAEKVYAGVTPLVAEDIAEVIGFVASRPSHVDLDLIVVRPRDQVTGATGSRINRRT >NC_023036.2|WP_045546369.1|1121263_1122550_+|L,D-transpeptidase-family-protein MGLGGAGLLAACAGKPAGTSQAEESAAAKAPTVTLTPDDAATDITPTSPAGVVVSDGWFQKIALTNANGKVVAGKLNRDRTEFTVSEPLGYGAEYTWSGSVVGQDGQAVPVTGGFRTVNPQTTVNGQFQLSDGQTVGVAAPIILQFDAAIADEHRADVEKALKVTTTPAVEGSWAWLPDEAGGSRVHWRTKDYYPTGTTVHVDADLYGVKFGPQAYGAADSTLDFTIGRRQVVKAEASSHRIQVLDGAGAVIMDFPCSYGEGDLDRNVTRSGIHVVTEKYEDFYMTNPAAGYANVRERFAVRISNNGEFIHANPASSGAQGNSNVTNGCINLSLTDAEQYFQTAMYGDPVEVTGTRIDLSYADGDIWDWAVPWSEWQAMSALSKDSPPSGIPVTAPVTPSGAPTPSGTPTSTPTSTSTSTAAPTTAGR >NC_023036.2|WP_019513805.1|1132393_1133239_+|sugar-phosphate-isomerase/epimerase MRPAIKVGLSTASVYPLRTEAAFEHAARLGYDGVELMVWAEAVSQDIDAIEAMSQRYGIPVLSVHAPCLLISQRVWGANPIAKLERSVRAAEQLGAQTVVVHPPFRWQRRYAEGFSAQVAALEAGSDVLVAVENMFPFRADRFWGTGKPSIERMRRRGGDPGPAISAFAPSYDPLDGGHAHYTLDLSHSATAGTDALELARRMGDGLVHLHLCDGSGASTDEHLVPGRGNQPAAQICRQLATSDFTGHVILEVTTSGARNAAERDALLIESLQFAREHLLR >NC_023036.2|WP_019513806.1|1133235_1134042_+|thioesterase-family-protein MSVLFSDAMRLETAGDGVYTGALNEHWTIGPKVHGGAMLALCANAARTEIGVPGVEPIVVSGNFLWAPDPGPLQVFTDVRKRGRRISLVDVELRQGERVAVRAAITLGVPEDDTVPLLSTNPVVPLMTPEPPPGLEPIGPGHPMADVVHLAHGCDIRPSLTTMAPRSDGGPPVIEYWVRPRGAAPDVLFALLCGDVSAPVTFGVNRLGWAPTVQLTAFLRAVPVDGWLRVLCTTTQIGQEWFDEDHVVVDASGRIIVQSRQLALVPAS >NC_023036.2|WP_019513807.1|1134078_1134930_+|pyrroline-5-carboxylate-reductase MARIAIIGGGSIGEALLSGLLRAGRQVKDLVVAEKHPDRARQLSETHQVLVTSVADAVENASYVIVAVKPGDVSAVTAEIAEAVAKADNDSDETVFVSVAAGVSTIFFENKLPAGSPVVRVMPNAPMVVGGGVSAVAAGRFATPEQLKEVAAIFDTVGDVLTVTETQMDAVTAVSGSGPAYFFLMVEALVDAGVAAGLSREVSTELVVHTMAGSAAMLLDRRDSAPNGVMDTSATALRAIVTSPGGTTAAGLRELERGGLRSAVADAVQAAKTRSEQLGITSE >NC_023036.2|WP_023985266.1|1135067_1135298_+|helix-turn-helix-domain-containing-protein MTSMNGPSARDSAGDGQPKAQFLTVAEVASLMRVSKMTVYRLVHNGELPAVRVGRSFRVHAKAVHDLLETSYFDAG >NC_023036.2|WP_003402602.1|1135367_1135469_+|AURKAIP1/COX24-domain-containing-protein MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK >NC_023036.2|WP_019513809.1|1135498_1136566_+|SDR-family-oxidoreductase MDELGSEPKVVLVTGACRFLGGYLTARLAQNPAIDHVIAVDAIAPSKDLLRRMGRAEFVRADIRNPFIAKVIRNGNVDTVVHAAAASYAPRAGGRATLKELNVMGAIQLFAAAQKAPSVRRVVLKSTSEVYGSSSRDPVRFTEDGSARRPPTDGFARDSIDIEGYARGLARRRPDIAVTILRLANMIGPAMDTALSRFLAGPVVPTVLGRDARLQLLHEQDALGALERATVAGRPGTFNVGADGIIMMSQAIRRSGRVALPVPRSALAVVDSLRRATRYTELDREQLNYVSFGRVMDTARMRNDLGYIPKWTTAEAFDDYVRGRGLTPIIDPNWVRSVEGRAVALAQRCGGLGTT >NC_023036.2|WP_023985267.1|1136591_1137656_+|acyltransferase-family-protein MAGESKAKVIPLRANSGRSTAARRAAQRADGARRHPSLLSDSDERASAEEIAAVVREIDEHRNNGAAAAPEDVPNELSKAISAIADFATRRMTGDYTVDEFGFDPHLNDNVVLPLLRGLFRNWFRVEVSGIENLPLDGAALVVANHAGVLPFDGLMASVAVHDHHPRQRALRLLAADLVFDMPVVGQAARKAGHTVACSSDAHRLLAAGELTAVFPEGFKGLGKPFKDRYKLQRFGRGGFVSAALRAQVPIVPCSIVGSEEIYPKIGDITLLARLLGLPYFPVTPLFPLAGPLGLVPLPSKWHIKFGEPISTDGYDEGAADDPMITFELTDHVRETIQHTLYQLLANRRNTFLG >NC_023036.2|WP_019513811.1|1137772_1138618_+|hypothetical-protein MRIPFVGAEAVASGELTPFALRRRYRPIYRGIYVPAEHEVSLRDRIVGIGLAAPDAVIAGVAASALYGAKWVDADEPIEVVMGGRRTQQGLIVRNDTLQPDEIATISGVRVTTPARTAFDLARYHPRDWAVARLDALARARRFSVEQVATIAERHPRARGVTRLRTTLPFVDGGAESPKETWLRLLFIDAGLPRPTTQFVVYDEEGRYVRRIDMCWTEFKVGAEYDGQQHLTSRYDYVNDVKIGRVLRRLDWRIQHVIKEDRPAEIISEARTTLLSRGWRP >NC_023036.2|WP_019513812.1|1138680_1139211_-|hypothetical-protein MGIAEEIVGTHYRYPDYFEVGREKIREFATAVKDEHPAHHSEEGAAENGHDSLVASLTFIAVAGRRVQLEIFNQFDVPVNLERVLHRDQKLVFHRPIKAGDKLWFDSYLDSVIESHGAILTEVRAEVTDDDGNPVLTSVITILGEAEHEGEADEVTAQIAAARDASIARMVANQNS >NC_023036.2|WP_019513814.1|1140076_1140328_+|glutaredoxin-family-protein MDHQVLLLTRAGCGLCATAAATLDALAAELGMRWESVDVDIAAEGGQPALRAEYGDRLPVVLLDGVEHSYWEVDEAQLRKDLS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_023036_2 | 4393025-4393128 | Orphan |
NA
Consensus repeat of NC_023036_2
|
1 spacers
spacers of NC_023036_2
>2.1|4393052|50|NC_023036|CRISPRCasFinder CGATCCTCGCTCGTCCCTCGCTGCGATCCTCACTCACGCCCGTCTCCCGC |
CRISPR arrays and Neighbor proteins around NC_023036_2
The CRISPR arrays of NC_023036_2 >merge|NC_023036|2|4393025-4393128|CRISPRCasFinder GACATCGGTCGACTCCCGCGTTCGCTCCGATCCTCGCTCGTCCCTCGCTGCGATCCTCACTCACGCCCGTCTCCCGCGACATCGGTCGACTCCCGCGTTCGCTC >NC_023036|2|2|4393025-4393128|CRISPRCasFinder GACATCGGTCGACTCCCGCGTTCGCTC CGATCCTCGCTCGTCCCTCGCTGCGATCCTCACTCACGCCCGTCTCCCGC GACATCGGTCGACTCCCGCGTTCGCTC
>NC_023036.2|WP_019510494.1|4391616_4392561_+|2,3,4,5-tetrahydropyridine-2,6-dicarboxylate-N-succinyltransferase MTAASGVGLATIAADGTVLDTWFPAPELTGDGSTGTVRLSVAELPESLGALTGPDADRDVEVVAVRTSIADLDDKPVDTYDAYLRLHLLSHRLTKPHEANLDGIFGLLANVVWTNFGPAAVEGFELVRAKLRNRGAVAVYGVDKFPRMVDYVLPAGVRIADADRVRLGAHLAPGTTVMHEGFVNFNAGTLGNSMVEGRISAGVVVDDGSDVGGGASIMGTLSGGGKEVISIGKRCLLGANAGVGISLGDDCVVEAGLYVTGGTKVTTGDGQTIKAKELSGSNNLLFRRNSVSGAVEVVKRDGTGITLNEALHAN >NC_023036.2|WP_019510493.1|4390514_4391582_-|succinyl-diaminopimelate-desuccinylase MGLDLRADPITLTAALVDIPSESRHEQRIADEIESALRAQAPHFEVIRSGNAVLARTNLGRSSRVLLAGHTDTVPAADNVPSRRDGDLMYGCGTSDMKSGDAVFLHLAATIAEPAHDLTLVMYDCEEIESSANGLGRIERDLPAWLAADVAILGEPSGGFIEAGCQGTIRVVATAAGTRAHSARSWLGDNAIHKLGAVLDRLSRYQARSVDIDGCVYREGLSAVRIDGGIAGNVIPDAASVTINFRFAPDRSVEQAVAHVHEVLAGLDVTCETTDAAAGALPGLANPAAAALVAAAGGQVRAKYGWTDVSRFAALGIAAVNYGPGDPNLAHKVDEHVDITAITATTETLRAYLTA >NC_023036.2|WP_031601636.1|4388819_4390406_-|ABC-F-family-ATP-binding-cassette-domain-containing-protein MSIVLSHLSFNWPDGTVVFDELSSAFGRGRTGLVAPNGAGKTTLLRLIAGELTPTAGSLTVDGVLGYLPQNLPFLARHTVSDVLGITAVLTALDALAAGDASEQVFAAIGDDWDIAERSRAQLDRLGLADLELDRPLSTLSGGQVVSLGLTAQLLRHPDVLLLDEPTNNLDIDARQRLYAALDDFGGCLLVVSHDRVLLDRMDGIAELRNGEVSHFGGGFTDYQIAVQSAQELAENNIRNAEQELKRQKQQMQQARERADKRASTAKRNLKDAGLPKIVAGKLKRDAQQSAAKADDVHARRIGDARSRLDDAERALREDDLVALDLPETEVAAGRVFFSGTGLSSRVFADIDLDIRGPERIALTGGNGAGKSTLLRIIAGDLEPGGGVVQRGDGRIAYLSQRLDLLEERATVADSLAVSAPGLSITRRRHLLAQFLFRGDRIDLPIAALSGGERLRATLACVLFAEPAPQLLLLDEPTNNLDLASVAQLESALNAYRGAFVVVSHDRTFLDNIGIQRWLRLADGVLAT >NC_023036.2|WP_019510491.1|4388024_4388807_+|ESX-1-secretion-associated-protein MTDQFSVQTDGVRNYAQTHSDVNSGLVGLPALDGTGLNNSHGAIAASVSTALGTALSGRGGAMGATSTSASTISDLLQQAARAYAGGDKEGGRRLRAAADALDGGQPGAGGAGAAGAAGAGGADAMGQMGQIMGQVGQQVGQLAQSVTAPLQGLAQGLQQVPQQIMQGVQQAVQAAGGAGASGAAGGAGVKLPSGDELKDAEKAVAEKADTAQETDRAERGETGERTEATDAQGGQDGSGRAPVEAPAPAQPAPTRPQVD >NC_023036.2|WP_019510490.1|4386735_4387956_+|hypothetical-protein MPDLVDAAAAALARGDLAVAEEQARSALADGTSLPALLILAQALAWQGRGTDADTVLARVDPAGLGDADLIAWALPRAANQFWMLDQPERATAFLRAIRGRLSSAVTIDALLCTFAMNAGSPQRALDIAESVLSCDHAEDRAVGWAAAAAGLSAARMGRFDQVDGLAARAGAAGHPGVLRFTSTYGQITARLLTGDIGAADDVADGLVCDTGPSRAIALVLRADIALARGVLDEAVEALREAAPALSTTGYSWGQLAWMLLAQAHAQQGRAVDAAKALSRAESRHGLKSMLFAPELALAKAWTAAARRDQPGAVRAAREAARAALRGGQHAVALRALHDAVRLGDTRAAEAVAGVSCECVFGRLTAEHAQALSSGDIAGLESVAARWDGLGWGAAARDAARQAGRS >NC_023036.2|WP_045546546.1|4386120_4386711_-|TIGR00730-family-Rossman-fold-protein MEPRRNRRYVLPVPETRPWAVCVYCASGPTHPELLKLAGEVGRSIADRGWTLVSGGGNVSAMGAVADGARQHNGATIGVIPKALVHRELADVDADELVVTDTMRERKQVMEERADAFIALPGGIGTLEEFFEAWTAGYLGMHDKPIIMLDPFGHYDGLLTWLRGLVATGYVSEGALDRLVVTADVETALSACSPNH >NC_023036.2|WP_019510488.1|4384338_4386111_-|long-chain-acyl-CoA-synthetase MSDDTTRTSVGLLEIATKLPGFLRDAPAIARGVLTGMSARPSAKTSIGKVFQERAAQYGDRVFLKFDDQQITYRKANETVNRYAAVLAAKGVGHGDVVGIMLRNSPDSVLLMLATVKCGATAGMLNYHQRGNVLAHSIGLLNAKAIIAESDLVEPITESGVQTTGLTTLEEMRQAATTAPTTNPATTAAVLAGDKAFYIFTSGTTGMPKASVMTHYRWLRALAGFGGLGLRLNSSDTLYCCLPLYHNNALTVSVGSVLNAGAALALGKSFSASRFWDDVIRFDATAFVYIGEICGYLLNQPPKPTDRAHKVRVIVGNGLRPAIWDQFVERFGIPRVCEFYAASEGNTAFVNVFNVSKSTGICPSPVAYVEYDLESGEPARGPDGRLRKVKRGQPGLLLSKVSSFQPFDGYTDKSASEKKLVRDAFKDGDVWFNTGDLMRAQGFGHAAFADRLGDTFRWKGENVATTEVEAAISADSQVEEATVFGVEVPGAGGRAGMVALQLKDGQEFDGAALAKSVYAHLPGYAVPLFVRLVKELAHTSTFKSQKVELRKQGYGEEVEDPLYVLAGKDEGYVPFYPEYVDEVVEGKRPK >NC_023036.2|WP_019510487.1|4383411_4384287_-|dihydropteroate-synthase MQRTFLGRPVAGDRALIMAIVNRTPDSFYDRGATFSDEAAKEATHRKIADGADIIDIGGVKAGPGQTVDADEEIARVVPFIEWLRGTYPDQLISVDTWRAAVAKQACAAGADLINDTWAGADPGLPEVAAEFDAGLVCSHTGGAVPRTRPFRVHYGVTERGVVDDVIAEVTAAAERAQAIGVARDRILIDPTHDFGKNTHHGLSLLRHVKDLVNTGWPVLMALSNKDFVGETLGVGLTERLEGTLAATALAAADGAAMFRVHEVGPTRRVLEMVASIQGSRPPTRTVRGLA >NC_023036.2|WP_019510485.1|4382339_4383308_-|glucosyl-3-phosphoglycerate-synthase MTLISELTPELTDIDKTDAVVGHPWFADHSFGRPAWTVEELIEAKRGRTISVVLPALNEEETVASVVETITPLLGNLVDELIVLDSGSTDDTEIRAVAAGARVISRETALPELAPRSGKGEVLWRSLAATTGDLVVFVDSDLIDPDPMFVPKLLGPLLTVDGVHLVKGFYRRPLKVSGSEDANGGGRVTELVARPLLAALRPELTCLLQPLGGEYAGTRELLTSVPFAPAYGVEIGLLVDTYNRYGLDGIAQVNLGVRTHRNRPLTELASMSRQVIATLLNRCGIEDSGMGLTQFFADGDDYTPRTSGVSLADRPPMNTLRP >NC_023036.2|WP_023986126.1|4381938_4382295_-|DivIVA-domain-containing-protein MTLILMYLVVLILVGAVLFAIGSVLFGRGEQLPPLPKATTATVLPASGVTGADVDAVKFTQTLRGYKTSEVDWVLDRLGAELESVRGELAALRAAYGVEDPTTFPAEHEAAHARSEQS >NC_023036.2|WP_019510496.1|4393199_4394600_-|acyl-CoA-synthetase MLLTSLDPAAVAAGHDLADAVRIDGVSLSRSDLVGAGTSVAERVARAQRVAILATPTATTVLAVVGCLIAGVPFVPVPPDVGATERAHLLSDSGAQAWLGELPAETEGLPHIPVRMHARSWHRYAEPAPQSTAIIMYTSGTTGLPKGVKISRQAIAADIDGLVQAWQWTAEDTLVHGLPLFHVHGLVLGLLGSLRIGNRFVHTGKPSPAGYAEARGSLYFGVPTVWSRIAADERAARALAGARLLVSGSAALPVPVFEELVRLTGHAPVERYGSTESLITLSTRADGERRPGWVGLPLDGVQTRLVDEDGALVPHDGETIGHLQLKGPTVFTGYLNREDATAEAFDPEGWFRTGDVAVIDADGMHRIVGRESVDLIKSGGYRIGAGEIETVLLGYPGIDEVAVVGLPDADLGQRIVAFVVGDVEPQQVIDFVAEQLSVHKRPREVRIVESLPRNAMGKVLKKELAK >NC_023036.2|WP_019510497.1|4394623_4395163_+|NUDIX-hydrolase MARIERLSSREVYRNNWMTVREDAIRRPDGSEGIYGVIDKPTYALVIARDSDRFHLVEQYRYPIGLRRWEFPQGTAPDLADLEPEELAARELREETGLRAESLVRLGMLDVAPGMSSQRGWVFLATGLHEGAHEREHEEQDMRSEWFTAAQIEEMIRGGAITDAQTIAAWAMVLLSERN >NC_023036.2|WP_019510498.1|4395284_4395779_+|hypothetical-protein MLVRRLCAALAALMVAGLFPAPSAGAAAQWWNGRYQVVSYASQKNGTSVAARQPEGDLTALYTFATACGTACVATVVDGPAPSNPTIPQPQRYTWSAGKWTFSYNWQWECFRGEGLPRLYSPAQSWVTYTPQPDGSLQGSWYTDILSGPCRGNVLIPAAAFPAP >NC_023036.2|WP_019510499.1|4395791_4396739_-|proline-dehydrogenase MSVFTRVARPAILAAGRRDGLRRTAQRLPITRAVVHRFVPGDTVQDAMASVADLRDSGRMVSIDHLGEDVDDIATAQATVRAYLGLLDALHARAETASAIRPLEISLKLSALGQALERDGEKVALENARVICERAAAAGVWVTVDAEEHTTTDSTLTIVRDLRADFGWVGTVLQAYLKRTPADCADLADSRIRLCKGAYDEPASVAHRDAGEVTESYLRCLRILMKGPGYPMVASHDPAIIERVPGLAAEYGRGNDDFEYQMLYGIRDDEQRRLAGGGGRVRVYVPFGSQWYGYFVRRLAERPANLMFFLRALRD >NC_023036.2|WP_019510500.1|4396738_4398379_-|L-glutamate-gamma-semialdehyde-dehydrogenase MNAITGIAQVPAPTNEPVHEYAPGSPERTRLTAALNELSGNAIDLPHVIAGVHRMGGGESIDVVQPHRHRARLGTMTNAGHAEAQAAIEAAEDAKAQWAHLPFEERAAVFLRAADLLAGPWREKIAAATMLGQSKTAYQAEIDAPCELVDFWRFNVAFAREILAQQPVSGPGVWNRTDHRPLEGFVYAITPFNFTAIAANLPTAPALMGNTVIWKPAPTQTFSAYLTMQLLEAAGLPPGVINLLTGDGQAVSEVVLADPRLAGIHFTGSTATFRHLWRQVGTHVERYRSYPRLVGETGGKDFVLAHSSARPDVLRTALIRGAFDYQGQKCSAASRAYVPRSVWQQMGDDLLSATEALRYGDVTDLSNYGGALIDARAYAKNTRALQRAKSTPGLTIAVGGEYDDSEGYFVRPTVLLADDPSDESFATEYFGPILAVHIYPDGEFDRILTVVDQTAPYALTGAVIADDRTAIVTAQDRLRHTAGNFYVNDKPTGAVVGQQPFGGGRASGTNDKAGSPLNLQRWTSPRSIKETFVPPTRHEYPHMGDL >NC_023036.2|WP_019510501.1|4398463_4400017_+|helix-turn-helix-domain-containing-protein MSGVRLGQLLLALDATLVSLVEAPRGLDLPVASAALLDREDIQLGVAPAFGSADVFFLLGIDHPDTIRWLDQHGRSPVAIFAKHPSPEVIRRATRAGIAVVAVEPRARWERLYRLVDHVFDHHGAGSAHDSGTDLFGLAQSIAERTRGMVSIEDAESHVLAYSASNEEADELRRLSILGRAGPPEHLAWIARRGIFDALHAKPDPVRVAERPELGLRPRLAIGIFAATGDTRRAPAFLGTIWLQQGDRPLAEDTEEVLRGAAVLAGRLITRLTAKPSGHAVLVQDMLGLTGDPPEIEAISRELGIPATGRAAVIGIDSTTEGTRLADVLALSASAFRPDAQVAAAGGRVYVLFPDAGKGLPSWVRSTVSALRTELGLELRAVTAAQLEGLAGAAAARAEVDRVLDSAARRPGSLAAITSPAEARTTVLLDEIVTMIAADGRLVDPRIRALRADEPVLAHTLTVYLDSFGDVASAAAALHVHPNTVRYRVRRIEGILGASLAEPDVRLLMTLSLRATA >NC_023036.2|WP_023986127.1|4400041_4400854_+|endo-alpha-1,4-polygalactosaminidase MARVRRLLAVAVSSLAVSTTVVGPHASAAPAALPPTTGGFDYQLGGASDVPALAVVVRDSTAQPLAGAYNICYLNGFQTQPGADWSGDRGSALLRDESGTPVADADWPDEYILDPSTPSQRTTILQVLTPGLNRCAANGFDAVEIDNLDTFTRFPAIERAGAMELARSYIALAHGRGLAIGQKNAAELAGIGRGQLGFDFAVTEECAAYDECNAYTGPYGPHVLQIEYVDNLPAPFAAVCAAPDRAPLTILRDRDLTPPGAAGHVYQQCP >NC_023036.2|WP_031601638.1|4400870_4401959_-|succinyldiaminopimelate-transaminase MSATLPVFPWDTLADVTAAAKAHPDGIVDLSVGTPVDEVAPVIRDALAQASGVPGYPTTAGTSALRSAIHAALARRFGITDIAAEAVLPVIGSKELIAWLPTLLGVGAEDTVVIPELAYPTYDVGARLAGAQVMAADSLTQIGPQVPALIYLNSPSNPTGKVLGADHLRKVVGWARERGVLVASDECYLGLAWDAEPLSVLHPSICGGDHTGLLAIHSLSKTSSLAGYRAGFVAGDPAVVTELLAVRKHAGMMVPGPVQAAMVAALTDDEHIAVQRERYARRRALLLPALLAAGFTVDHSEAGLYLWATRGEPCRQTLAWLAQRGILVAPGEFYGPAGAQHVRVALTATDERIAAAVQRLGQ >NC_023036.2|WP_019510504.1|4401976_4402297_-|ferredoxin-family-protein MTYVIAEPCVDVKDKACIEECPVDCIYEGGRMLYIHPDECVDCGACEPVCPVEAIYYEDDVPDQWSSYTQSNADFFSELGSPGGASKVGQTDNDPQAIKDLEPKGE >NC_023036.2|WP_019510505.1|4402560_4403118_+|PadR-family-transcriptional-regulator MALPHAILVSLCEQVGSGYELAHRFDRSIGYFWSASHQQIYRSLRTMESEGWVQVREVAQRGRPDKKVYSVTPAGRAELAHWIAAPLSGRGSTVADNRTRDLAVKIRGCGYGDIEAVRAQAVALRAERAALLDTYRGFEKRQFPDPARLTGAELHQYLVLRGGIRAEEGAIEWLSEVVSALGGVQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_023036_3 | 4616428-4616510 | Orphan |
NA
Consensus repeat of NC_023036_3
|
1 spacers
spacers of NC_023036_3
>3.1|4616454|31|NC_023036|CRISPRCasFinder CGGTGCCCCCGAGACCGCGCCGGCTCCCGCG |
csa3 |
CRISPR arrays and Neighbor proteins around NC_023036_3
The CRISPR arrays of NC_023036_3 >merge|NC_023036|3|4616428-4616510|CRISPRCasFinder GGCAGCCAGGGCACCCCGTTGCCCGCCGGTGCCCCCGAGACCGCGCCGGCTCCCGCGGGCAGCCAGGGCACCCCGCTGCCCGC >NC_023036|3|3|4616428-4616510|CRISPRCasFinder GGCAGCCAGGGCACCCCGTTGCCCGC CGGTGCCCCCGAGACCGCGCCGGCTCCCGCG GGCAGCCAGGGCACCCCGCTGCCCGC
>NC_023036.2|WP_023986186.1|4615108_4615462_+|winged-helix-turn-helix-transcriptional-regulator MGHGVEGRTPPVASLDAAAAVKVAETLQALASPNRLLILTRLRQAPCTVTELSGAVGMEQSAVSNQLRLLRALGLVAGDRAGRNIIYRLYDAHVAQLLDEAVYHIEHLRLGARDDTA >NC_023036.2|WP_045546381.1|4613112_4615038_-|cadmium-translocating-P-type-ATPase MIGVERVERSIGAQRRRTAWSLESVRWAVGALMFFLAGLILQLGGAPEMVWWTAYLACYLAGGWQSAVDGLTALRRRRLDVDLLMVLAALGAAAIGQVFDGALLIVIFATSGALEDVATRRTEDSVRALLDLAPERAVRVDRGVETEVPVSELRVGDRILVRPGERVSADATVIEGASDVDQAGITGEPLPVAVHVGDEIFAGTLNGTGALAAEVVRDPSDSVVARIVAMVGQASATKARTQLFIEKIEQRYSMVMVGATLALFTVPLLFGADLRSALLRAMTFMIVASPCALVLATMPPLLSAIATAGRHGVLVKSAVVMERLATVQAVAFDKTGTLTTGRPRVVGEPPETVLALAAAAEQLSEHPLGRAIVAAARERALPIAPVTRFRALPGRGVSALVGGRRVEVVSPAALGGPAPESVAALESAGATAVVVTVDGECAGVIGLRDTVRQSAADVVADLHRCTGRVPLLLTGDNRCAAARLGDELGMEVRAELLPDQKVAAVRARAERVLVVGDGVNDAPAMAAAHVSIAMGRSGSDVTLQTADAITVRDDLATIPAALALARRARRVVFANLVIAASFIAVLVVWDLFWHLPLPLGVAGHEGSTVIVALNGLRLLRDRAWFMAVRQSRGGAESRCRA >NC_023036.2|WP_019510252.1|4611941_4612979_-|iron-ABC-transporter-substrate-binding-protein MAAKATALAGAAALAVALTACGSDSGQSDGAQAGASDQLVVYSGRSEDLVGPLLERFTEATGIGVEVRYAGSGELAAQLITEGDASPADVFLSQDAGALGAVSAAGLFAPIEAETLAAVPAAYSAADGTWVGVSGRARVLTYNPELAPTVPDTIDGLLDPQWRGKIGYAPSNASWQSFVTGLRVLRGEQGAKDWLEAFAAQDPRAFEGNGPMRDAINSGELPMGLTNHYYLYELIDSTSADDVVAKNQYMAAGDPGGLVNVAGVGVLKSAPHAEQANEFAAYLVGTAAQEYFATETAEYPLVEGVTPSAALPPLAQLQPPAVDLSQLDDVETTQELLVETGLLTN >NC_023036.2|WP_019510251.1|4610385_4611900_-|iron-ABC-transporter-permease MTLVAAAAVVVAGLLLPLWYLAQRANERGLGFVVRELIQPRTAALVGRSALLVVVVTVACVVLGLGFAVLIRRTDIPARRALTIALTLPLAMPSYLLSYLWVSTVPGIAGFWGAALVLTLVSYPLIMMPTLAALARSDPAQEEVARSLGLNGFAVLCRVTLRQARAAIAAGALLVALYVLSDFGAVAAMRYEAFTWVIYGSYRSGFNPARAAVLSLVLLVLAVALVLAEHRARGRAAASRIGSGAPRPAPVNRLGRWTVLAWLPVAVVLTAALIVPFVALGDWLLAGGVRFDAQRWWSALGATVWLSGVAAVVCTAAALPLGVLAARYRTRTTRMLEGAAYLSHGLPSIVVAIAMVSVGVLLLRPIYQREPLLILAYAVLFVPLAVGSIRSAVEAAPIRLEEVARSLGRSPLAAFCTVTARGAAPAVAAGAALVLLTCMKELPVTLLLHPTGTSTLATQLWGHSFVSDYAAAAPYAAALFVFAAIPTAVLGLWSADIGTGDGRD >NC_023036.2|WP_019510250.1|4609328_4610399_-|ABC-transporter-ATP-binding-protein MAVTEPATGLPSGIRVEGVTKSFVERTVLDGIDLEVPNGHITAVLGPSGCGKTTLLRIIAGFEEPDRGAVSVGGVPVVGAGTGRRDGSVPAHRRRVGLMPQEGALFPQLSVGRNVTFGLPRARRSDTAIAEHWLGVVGLDGLADARPHQLSGGQQQRVALARALAAEPSVLLLDEPFAALDAGLRVRVREEIATILRATQTTALLVTHDQAEALSLADSVALLIAGRVAQHGPPAQLYDRPVNLEVARFVGGTVELDGDIRGGILTCALGTHRPEVAPADGPVTVVVRPERVHVVDPACGAQAVVSECRFYGAELGVHVVLGDGTALVLRLPATQSCSAGQRVGLAVDGPMLAYPR >NC_023036.2|WP_081649901.1|4607361_4609341_+|propionyl-CoA-synthetase MPISSEQAWGYTGPGRGFRPLQPRCNVTQITVLPMSGYRAIFDASISDPETFWADAAKAVTWTREPHRVLDDSNPPFYRWFPDGELNTCANALDRHIDERGDQAALIYDSPVTGTKRTYTYRELLEATARFAGVLKGLGVTKGDRVVIYMPMVPEAVIAMLACARLGAVHSVVFGGFAGHELATRIDDARPTVVVSASCGIEPTRTVEYKPMLDTALELAEHSTPKCVILQREQHPCELVAGRDIDWAEAMATAEPVDPVPVAATDPLYVLYTSGTTGKPKGIVRDNGGHAVALLWTMRNIYDLNPGEVFWAASDVGWVVGHSYIVYAPLLFGATTVLYEGKPVGTPDAGAFWRVAAEHKVKALFTAPTAIRAIKKEDPDARHLGDYDLSGLKYLFQAGERLDPGTYEWASDKLGIPIIDHWWQTETGWAIAADPMGIEQLPVKPGSATVPMPGYDVRVVRPDGSECDAGEEGSIVVKLPLPPGTLPTLWGEDDRFVSSYLRAFDGYYLTGDGGHIDSDGYLFVVGRTDDVINVAGHRMSTGSIEAVLATHPAVAECAVIGVADDLKGQVPRALVVLKSGFSADGLDTELVEAVRNDIGAVASFKLVDVVAALPKTRSGKILRKTMRGIADGKDEPVPSTIEDPSVLEALKQTLRPHLG >NC_023036.2|WP_019510248.1|4606728_4607280_+|TIGR03086-family-protein MTDSSVAETYAGLADGMAGVIASVTPQQWDAASACEGWSARDVVAHLIDTQREFFQRHEFPLPTRPDLADPVAAWSAHTAAIGEILADPRVPARTFDGHFGPTTIGETLLRFYGFDLIAHRWDIAAATDSRYRFTDGELDRLEEGIAAFGDALRMEGVCGPAVEVGPDADRQTRVLAVLGRHG >NC_023036.2|WP_019510247.1|4605854_4606706_-|helix-turn-helix-transcriptional-regulator MASGLDKPDRPDNTVADEPAHLLDPAHRAAIHIARPAAPTDLDGLVRRFWFPVWRVPAGQTFTQQVLQYPVCLMVITDTYARFYGPASGLAGTPLTGDGWAAGVMFEPAAGTLITGGSVSRWTDTHVDLADQLGAAGAALTEQIRSIMADDPAAPRAQSAAVDCYAAFLRRFGPVDELGRTVNDIVAHIEDNPDVSRVADVCAQFGISERSLQRLTRHRIGLSPKWLVRRRRIQDASWRLRTGATTVAAVAADLGYADEAHLSRDFRRVTGQTPGAFAARYAD >NC_023036.2|WP_019510245.1|4603382_4604063_+|response-regulator MTRVLVIDDEPQILRALRINLSVRGYEVHTAATGAEALRAAADHRPDVVILDLGLPDMSGIDVLAGLRGWLTVPVIVLSARTDSSDKVEALDAGADDYVTKPFGMDEFLARLRAAVRRASAAIEDDQPVIETSSFTVDLAAKKVTKSGTEVHLTPTEWGMLEMLVRHRGKLVGREELLKEVWGPAYAKETHYLRVYLAQLRRKLEVDPSRPKHLITEAGMGYRFQE >NC_023036.2|WP_031601661.1|4600851_4603386_+|sensor-histidine-kinase-KdpD MGDDGLVTDRPKRGELRIYLGAAPGVGKTYAMLGEAHRRLERGTDLVAAVVETHGRSKTAELLEGIEIIPPRLVEYRGATFGELDVDAVLARRPQVVLVDELAHTNTPGSANPKRWQDIEQLLAAGITVITTVNVQHLESLNDVVAQITGIEQQEKVPDEVVRAADQIELVDITPEALRRRLSHGNVYAPDRIDAALSNYFRRGNLTALRELALLWLADQVDAALAKYRSDNKITATWEARERVVVAVTGDKESETLVRRASRIASKSSAELMIVHVVRGDGLAGVSAPMMGTVRDLAASLGASVHTVVGDDVPAALLDFAREMNATQLVVGTSRRTRWARILDEGIGAAVVQNSGTIDVHMVTHEQTGRATARSGNRNWRQHAASWLAAVVVPTALAAVAVLWLDRYLGVSGESALFFVGVLAVALLGGVAPAALSAVLSGLLINYFLAEPRYTFTISEPDSAITIAVLLMVAVAVAALVDSAAKRAREARRASQEAELLAHFAGSVLRGADPTALLERVREVYSQTAVSLLRERDGETHVVACAGKQPCVDVDSADTAIEVGDDEFWLLMSGRKLPAKDRRVLGAVAKQAAGLVRQRELISEAGRAEAVARADELRRSLLSAVSHDLRTPLAAAKASVSSLRSSDIDFSPEDTAELLATVEESVDQLTALVGNLLDSSRLAAGVVKPELRRVYLEEAVQRALLGISRSSKDSGWDRVKVDVGDAVALADPGLLERVLVNVIDNALRYGGDNPVRVNAGRVGERVLITVADEGPGIPRGAEEQLFAPFQRLGDQDNSTGVGLGLSVASGFVTAMGGTISATDTPGGGLTVVIELAAPQEGPQP >NC_023036.2|WP_131701285.1|4616830_4617487_-|hypothetical-protein MSTWNSGGGPPPIVPRPPSRGGPNVALIAGVAAAVLAIGGGVAYFVLSPSDPDEPVGQQTSVSAQSGASEGATTEQDEGDNDRLMKVLPRGYPDGACKPVARLDGALATIACTVNKDPGGPMSATYSLLVDSAALKAAIDNLETTSTVVDCPGRIQSPGPWRHNASLHEVSGTLMCGIQNDNPMLAWTNFDDQMFAVVQGRPAGPTLDNLYAWWSTHS >NC_023036.2|WP_019511881.1|4617519_4618464_-|Ppx/GppA-family-phosphatase MGVKVGAIDCGTNSIRLLIAEGGSPGLVDVHREMRIVRLGQGVDATGEFAPEALARTETALADYVALMREHDVARVRMVATSAARDAGNRDEFFAMTARLLGTVSDGAVAEVISGTEEAELSFRGAVGELDSTGAPFIVVDLGGGSTELVLGDGAGVSASFSANIGCVRIKERCLPSDPPPAEEIEAARTVVRAALDEALRAVPVERARTWVGVAGTFTTLAALAHRLPVYDPAAIHLSRTGFGDLSTVCADLLAMTAQERLALGPMHEGRADVIAGGAIVVQELARVMADRAGIDKLVVSEHDILDGIALSIA >NC_023036.2|WP_019511882.1|4618454_4618946_-|DUF501-domain-containing-protein MVEQADLDAVARQLGREPRGVLEIAYRCPNGEPAVVKTAPKLPDGTPFPTLYYLTHPALTAAASRLESSGLMREMTERLAEDPEVAAAYRRAHESYLAERDAIESLGTDFTGGGMPDRVKCLHVVIAHSLAKGPGLNPFGDEALAVLAVEPGMAGILDRKVWA >NC_023036.2|WP_023986188.1|4618938_4619646_-|septum-formation-initiator-family-protein MPDAKRPDPRRRGPAPRPGKAGGAGRPRASSVRRDPKAREPKAIESSKSRQADGAAGFADDTGGPDTVAEAIRRSVAETADTHSEQRFGSAARRAAILAAVVCVLTLTIAGPVRTYFAQRTEMNQLKMVEAQLRSQIADLEQQKIKLADPVFIAAQARERLGFVMPGDIPYQVQLPPTAAVEPDTGPEAPTAINTDPWYTSLWHTIADQPHGITPAVPPAPPAPGGTPTPVPAGG >NC_023036.2|WP_023986189.1|4619655_4620945_-|phosphopyruvate-hydratase MPIIEQVGAREILDSRGNPTVEVEVALIDGTVARAAVPSGASTGEHEAVELRDGGPRYLGKGVEKAVEAVLDEIAPAVIGLSADDQRLVDQALLDLDGTPDKSRLGANAILGLSLAVARAAAESAGLPLFRYVGGPNAHILPVPMMNIINGGAHADTGVDVQEFMIAPIGAPTFKESLRWGAEVYHSLKSVLKKQGLSTGLGDEGGFAPDLPGTKAALDLIGTAIEGAGFKLGTDVALALDVAATEFHTEGKGYAFEKETRTAAQMAEFYAGLLDTYPLVSIEDPLSEDDWDGWVELTTAIGDRVQLVGDDLFVTNPERLEEGIDKGAANALLVKVNQIGTLTETLDAVALAHNSGYRTMMSHRSGETEDTTIADLAVAVGSGQIKTGAPARSERVAKYNQLLRIEEALGDAARYAGDLAFPRFAPASQ >NC_023036.2|WP_031601662.1|4621068_4621809_-|lipoprotein MTAFWRARWVRAALVLVAALLLLASSCSWHRGEHIPDGVPPPRGAAVPAIDTNAAGRPADQLRDWAAELAPKTGIPEQALQAYAYAARVAEVVNPKCNLAWPTLAGIGMVESHHGTYKGADIAPNGDVRPPIRGVQLDGTMGNMEILDTDQGLLDGDPTMDRAMGPMQFIPETWRLYGVDANNDGVISPDNFDDAALSAAGYLCWRGKDLSTPRGWMEALRAYNLSNQYARNVRDWATTYADASIS >NC_023036.2|WP_019511886.1|4622062_4622998_+|EfeM/EfeO-family-lipoprotein MKRHFAWQLPIAAIALVLSACSNGDSNSATDTSSGGATSGASTSSSAAAAPNPLTEKAAVEYKAYATAQIDELVGAVKVFTDAVRAGDLKAAQEAYAPSRAPWERIEPIAGLVEKIDGKIDARVDDFAGVDDPGFTGWHRLEYLLFEKNTTEGGAPFADQLDADIAELKAQFPAVEVKPVDVATGAAELIEEVSEGKITGEEDRYAKTDLWDFDANVQGARDAIGKLNPALVQADPALLGKIEAGINSVFDTLGPLRRGDGWVLFCTENDPYPSARCPEVTVTPDVIDTLKSELAGLSENLSQVSGVLKLQ >NC_023036.2|WP_023986190.1|4622994_4624266_+|Dyp-type-peroxidase MNRPRGISRRGFVAGALGAGAAVGAAGLAGCGQEPAAPPDAARFVEFEGAHQAGITALPIPEQGLIASFNVHAKNRAQLKSTLQELTDEIRGLMAGRPPEQRDPAYPPVDSGILGEHPPPDNLSIVVGVGASLFDGRFGLADRKPRELETMPFLANDRLDPKLSHGDISIIFESGHNDTMQFALRQLMRRTRSDLVLKWMIDGYARGIGAGKAATQDGIQATTPRNLLGFKDGTANLDVSDAAVMDRHVWVGPDDVGPGREPEWTVGGSYQAVRIIRNFVEFWDRTQLVEQEALIGRSKVSGAPLGMAGEFDDPDYADDPDGLRIKLNAHIRLANPRTPQTDENLILRRGFNYSRGFDGAGRLDQGLAFVAYQRSLQKGFLTVQERLKGEPLEEYIMPVGGGFFFVLPGVTGGDRFLGDTLVD >NC_023036.2|WP_019511888.1|4624299_4624839_+|hypothetical-protein MSTPTTHAHGVRAVLVGISAAATTVGAHAAAAGTVPHGAALIAALLVCATSGAAAGSLTVSGRYAGVIVPALALGAAQLLSHLVLTVAGGHHGDMGLTPSMIAAHAVAAVLLGFAIAAVEHLYRVCASVLCWLRLFATAHAPAPAHRARRRTDNVVAQSVLLAPGLGMRAPPRGAVATV >NC_023036.2|WP_019511889.1|4624946_4626416_+|PepSY-domain-containing-protein MTIPDDTVDIDPTDTTPPSTLAHRRSWRPFVVRLHFYAGILIAPFILIAATTGGLYAMAPTIERIIYADILTVTPAGQALPLAEQVAAAQQAFPALTVTGMRPPAAADASTRVEFADPALDPELLRSVFVDPYTGRVLGDEATWLGYLPVSTWLDGFHRHLQLGEPGRVYSELAASWLWVVALGGLALWLTKAAAQRRRGRPGRILRVDRSSSGRARTMNWHGATGVWLLAGLLFLSATGITWSTYAGEHVTELRSAMDWKRPVLDTTLHPAAAVADGHGDHGGHGEHQGHGQHGADAPGAPAGAIDYEAVLQAAATAGVHQPVELALPTEAGNGVRVAELDKPYRLTTNVAAVDPATNTVSSEIDYWRDYSVVAMLADWGIRGHMGLLFGLANQLILLGVAVALVAVIIGGYRMWWQRRPTRGSGWAVGRPPLRGTWKRLPPWAIGTIVVTAVAIGWFLPLLGLSLAGFVLVDALVGAAKARKENADA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_023036_3 | 3.1|4616454|31|NC_023036|CRISPRCasFinder | 4616454-4616484 | 31 | NZ_CP027859 | Streptomyces clavuligerus strain ATCC 27064 plasmid pCLA1, complete sequence | 44037-44067 | 5 | 0.839 |
NC_023036_3 | 3.1|4616454|31|NC_023036|CRISPRCasFinder | 4616454-4616484 | 31 | NZ_LR594676 | Variovorax sp. PBS-H4 plasmid 2 | 7664-7694 | 5 | 0.839 |
NC_023036_3 | 3.1|4616454|31|NC_023036|CRISPRCasFinder | 4616454-4616484 | 31 | NZ_CP032827 | Sphingomonas sp. YZ-8 plasmid unnamed2, complete sequence | 217701-217731 | 7 | 0.774 |
NC_023036_3 | 3.1|4616454|31|NC_023036|CRISPRCasFinder | 4616454-4616484 | 31 | NZ_CP026305 | Streptomyces lunaelactis strain MM109 plasmid pSLUN1, complete sequence | 19046-19076 | 7 | 0.774 |
NC_023036_3 | 3.1|4616454|31|NC_023036|CRISPRCasFinder | 4616454-4616484 | 31 | MT521990 | Microbacterium phage Bri160, complete genome | 13515-13545 | 8 | 0.742 |
NC_023036_3 | 3.1|4616454|31|NC_023036|CRISPRCasFinder | 4616454-4616484 | 31 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2326822-2326852 | 8 | 0.742 |
NC_023036_3 | 3.1|4616454|31|NC_023036|CRISPRCasFinder | 4616454-4616484 | 31 | NC_011368 | Rhizobium leguminosarum bv. trifolii WSM2304 plasmid pRLG201, complete sequence | 1168981-1169011 | 8 | 0.742 |
NC_023036_3 | 3.1|4616454|31|NC_023036|CRISPRCasFinder | 4616454-4616484 | 31 | NC_012811 | Methylorubrum extorquens AM1 megaplasmid, complete sequence | 405481-405511 | 9 | 0.71 |
NC_023036_3 | 3.1|4616454|31|NC_023036|CRISPRCasFinder | 4616454-4616484 | 31 | NC_010510 | Methylobacterium radiotolerans JCM 2831 plasmid pMRAD01, complete sequence | 92504-92534 | 9 | 0.71 |
1. spacer 3.1|4616454|31|NC_023036|CRISPRCasFinder matches to NZ_CP027859 (Streptomyces clavuligerus strain ATCC 27064 plasmid pCLA1, complete sequence) position: , mismatch: 5, identity: 0.839
cggtgcccccgagaccgcgccggctcccgcg CRISPR spacer caccgcccccgcgaccgcggcggctcccgcg Protospacer *. .******* ******* ***********
2. spacer 3.1|4616454|31|NC_023036|CRISPRCasFinder matches to NZ_LR594676 (Variovorax sp. PBS-H4 plasmid 2) position: , mismatch: 5, identity: 0.839
cggtgcccccgagaccgcgccggctcccgcg CRISPR spacer cgccgaccccgagactgcgccggttcccgcg Protospacer ** .* *********.*******.*******
3. spacer 3.1|4616454|31|NC_023036|CRISPRCasFinder matches to NZ_CP032827 (Sphingomonas sp. YZ-8 plasmid unnamed2, complete sequence) position: , mismatch: 7, identity: 0.774
cggtgcccccgagaccgcgccggctcccgcg CRISPR spacer aggtgctcccgataccgcgccggctagccag Protospacer *****.***** ************ * *
4. spacer 3.1|4616454|31|NC_023036|CRISPRCasFinder matches to NZ_CP026305 (Streptomyces lunaelactis strain MM109 plasmid pSLUN1, complete sequence) position: , mismatch: 7, identity: 0.774
cggtgcccccgagaccgcgccggctcccgcg CRISPR spacer cggtgcctccgagaccgccccggagtacggg Protospacer *******.********** **** . ** *
5. spacer 3.1|4616454|31|NC_023036|CRISPRCasFinder matches to MT521990 (Microbacterium phage Bri160, complete genome) position: , mismatch: 8, identity: 0.742
cggtgcccccgagaccgcgccggctcccgcg CRISPR spacer cgagatccccgagaccgagccggcccccgac Protospacer **. ..*********** ******.****
6. spacer 3.1|4616454|31|NC_023036|CRISPRCasFinder matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 8, identity: 0.742
cggtgcccccgagaccgcgccggctcccgcg CRISPR spacer gcacgcagccgagcccgcgctggctcccgcg Protospacer ..** ***** ******.**********
7. spacer 3.1|4616454|31|NC_023036|CRISPRCasFinder matches to NC_011368 (Rhizobium leguminosarum bv. trifolii WSM2304 plasmid pRLG201, complete sequence) position: , mismatch: 8, identity: 0.742
cggtgcccccgagaccgcgccggctcccgcg CRISPR spacer atgtttggccgcgaccgcgccggctgccgcg Protospacer ** . *** ************* *****
8. spacer 3.1|4616454|31|NC_023036|CRISPRCasFinder matches to NC_012811 (Methylorubrum extorquens AM1 megaplasmid, complete sequence) position: , mismatch: 9, identity: 0.71
cggtgcccccgagaccgcgccggctcccgcg CRISPR spacer accggctcccgagaccgcgccggcgccagga Protospacer **.***************** ** * .
9. spacer 3.1|4616454|31|NC_023036|CRISPRCasFinder matches to NC_010510 (Methylobacterium radiotolerans JCM 2831 plasmid pMRAD01, complete sequence) position: , mismatch: 9, identity: 0.71
cggtgcccccgagaccgcgccggctcccgcg CRISPR spacer ctcgaacccccagaccgcgtcggctcccggc Protospacer * . **** ********.*********
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
2150586 : 2157289
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_023036|2150586:2157289|DBSCAN-SWA GTCAGCGCTTACCCCAGTCCCACGGCGGAGCGGTCAGCATGGTCTGGCCCTCGATGAGGGTTTCACCCCACTCCTTGACCAGTTCCACGGTCAGCGCGCCGGCCGCATCGAGCCGTTCCCGCATGACGGTCGAATGGGTGACGACGATGACCTGGGTTGCCGCAGCGGCGTTGCCGATCATGCCGGCCAGAGCGGGGATCAGGTCCGGGTGCAGCGAGGTTTCGGGCTCGTTGAGCACCATCAGCGACGGCGGTCGGGGACTGAGCAGCGCCGCGGCCCACAACAGGAAACGCAATGTGCCGTCGGAGAGCTCCGCGGCCCGTAGTGGGCGCAGCATTCCGCGCTGGCGCAGCTGCAACTCGAAAAGTCCGTCGTTGGCGACCACCGATACGCCTGCGCCGTCGAAGGCATCGGATATGGCGCCCTGCAGCTCGCCGTCGCCGAGCTCGATGATGGTCTGTAGCGCGGCGGCCAGGTCGGCGCCGTCGTCGGCGAGCACCGGCGTGCGGGTGCCCACCCTGGGCTGGCGGGCCGGTGCACCGGCGTCGGAGCGGAATCCGTCATAGAAGCGCCAGTCGCGCAGGCGGTTGCGGACGGCGTTGATCTCGGGGAGCGCCCCGGCGTACTCCGCGAGGACGCTGCGGTACAACGGCAATGCGCGGGACAGATCGGTGAACCCGCGCCCGGTCTCGTCGGCAACGTCGGCGTGCGGGCCGTTGCGGCGGACCAGCGCCGAGGACGGCCGCAGTCGCGGCCCGGCGAAGATGGCCTCGCGTTTGATCTCCGGATCGCGGCCGAAAAGCGAGGCCGGCCCGGCGTGCTGGGGCATCCCGAGATCGACCAGATAGCCGAATTCGTCTGAGGCGAAGCCGAGTTCCAGTGATACCGGGCGGGTGCGCACGGTGCCCTGGGTGCTTCCGGTCCGACGGGCTCCCGAGGTCTGCTCGGGTCCGGCCCACAGCGCTGATTGCAGGCCTCCTTCGCGGGCCAACGAGGCGATGACCTCCCCGCGGCCGCAATCCGCGAGCAGCATGAGGGCGCGGTACAGCGAGGACTTGCCGGTGCCGTTGGCGCCGGTGATCACGGTCAGCTGCTCCAGCGGCAACAGGACTTCGCGCAGTGATCGGTAGCCGCGGACCGCGATCGTGTTCAGCATGGCATCTCCAGTTCCAATCGGACACCGAGCAATCGGATGGGACGGTCCAGATCGAATTCGTCGAGCAGCGCCAATGCCACACGCTGGATCTCCGCAGGGTCGTTGCCCGGTGCGGCGAGCTTGCGGATCTTGGTGCGGGTGAAAAAGGTGGCGGTGCGCACGGTGACAGCGACCCGGGTGACGGTGCGGCCGTCCGCGCTGACCTCCGTCAGCGTCGTGGACGCCAGATCGACGACGGCCGAGTCCATATCGGCTCGATCGGTCAGATCGTGCGCGAACGTGACGGCGTGGCTGCGGGATCGTGGCACCCATGCCTGCGCGTTGATGTCGGTGTCACCACCGCCCTTGGCCAGCAGGAGAAGTTGCAGGCCGGTGCTGGGACCGAACGTCGAGGTGAGCGTCGTGGCGTCGGTTTCGGACAGCTCTGCGACGGTCGATATGCCCATGGCGGCAAGCTTTTTGGTGGTCTTGGGTCCGACGCCCCACAGTGCGTCGACGGGTCGGTCACCCATCTGCGCCATCCAATTCGACGCGGTGAGTGCATATACGCCATCGGGTTTGGCGAAACCGGTGGCGACTTTCGCCCGCTGCTTGTTGTCGCTGATCCCCACCGAACACCTCAGCCCGGTCTCGGCGGCGATGACGGTCCGGATGCGTTCGGCGAGCTCGAACGGATCGGCCACATCGGCGCCGAGGTAGGCCTCGTCCCAACCCCAGACCTCCAATGGATGGCCGAGATCGCGCAGCAGGCCCATCACCTGCTCGGAGGCGGCATCGTAGGCCTGCGGATCCGCGGGTAGGAACGTCGCGTCGGGGCACTTGCGGGCCGCCGCCCGCAACGGCATGCCGGCGTGTACGCCGAACTCCCGCGCCTCGTAGGAGGCGCAGGTCACCACCTTTCGGGGTTCATGCGGATCACCGCTTCCACCGACGATCACCGGCGTCCCGACGAGTTCGGGGTGGCGACGCAGTTCCACCGAGGCCAGGAACTGGTCGAGGTCTATATGAAGGATCCAGCTCAGGTCCGTCGTCACGTGCCACAGAGTTTGCGGGCGACATCGTCCCAGCCGGCGGCGAGTTCCTCGAGCGTCTTGCCGCGGTCTGTCATCTCGTGGGCCACACGGTCGGCATCGAGCAGAGCGAACAGTGCATCGGTCTGGGCGTCGAGATCGCCTGTCGTCCCTGCGGATTCGAGCAGCATTCGGACATGGCGGCGGTGCAGGGTTGCCGGGGAGTTGTAGCGGGTATGCGGATCGCGTGCGGCATCGGAAAGCAATGCTCGGTGGGTGTGGACGAAGTGCAGTCGGCTGTATCCGTAGGCGAGCAGTCTCTCCAGTGGCGGGGCGCCGGGTCCCAGCGGGGGAGGGCCGAACATGAATGCCTGTTGCTCGGCGATCTCATCCTCGTCGAGCAGCACGATCATCAGACCGGACCTACTGCCGAAGCGACGGAACAGGGTGCCCTTGCCGACCCCGGCGGCCGCGGCGACATCGTCCATCGAGACGGCGTCGGCTCCGCGCTCGGCGATCAGGCGGCGCGCGGCATCGAGGAGCAGGGCGCGGTTGCGCGCCGCATCGCCGCGCTCGGCCGGGACGGCGGACGAGAGCGGCAGCACGGTGAGAGGTTCGGGTGCGCCCACACCTGCACTTTAACTCAGCCGGAATTATTCGGACCGTGGTCCGGTTATCCTGTGCAAGGATCACCCGACGACGGAAGGAACCTCGAACATGACCAACGTGCTGGTACTCATCGGAAGCCTGCGGAAGGCATCGATCAATCGGCAGCTCGCCGAGCTCGCCGTCGAATCCGCGCCCGAGGGCGTGACACTGCAGCTGTTCGACCGACTGGGTGAGCTGCCGTTCTACGACGAAGACATCGACAACGACGACGTGGCCGAGCCGGTGCGGGCGCTTCGCGAGGCCGCCGCTGCCGCGGACGCCGCGCTGGTGGTCACCCCGGAATACAACGGGTCGATCCCCGGTGTGCTCAAGAACGCCATCGACTGGTTGTCGCGTCCGTATGGCAGTGGCGCTCTGCAGGGCAAGCCGTTCGCCGTGATCGGCACCGCACTGGGTCAGTATGGCGGCGTCTGGGCACACGACGAGACCCGCAAGTCGTTGGCGATCGCCGGGCCGCGGGCGGTCGAAGACCTCAAGCTGTCGATCCCGTCGGCAACCCTCGACGGTAAGCATCCGCGCGAGCATGCCGAGGTCGCCGGTCAGGTACGCGAGGTGGTCGGCAAACTGGTCGCCGAGGTCGGCTGAACCGATTTGACAAGAACCGCCCGGGAGCGTTGCTCCCGGGCGGTTCTGTCGTTCCGGCGACCGCGTGGCAGGGGCATGGTTGTCGGACCCCGCCGATAGAGTGCGACGCACCCCCGGATGTGATCTGCGACACGCCGTCCGATGACACGCCCAGGGGCTTACGACCTGGGAATTTCACGAATGTTGTTCAGAACATCTTGTATCTCTTCGGTTTGTCGGACACTAGGTGTAGTGTCTGACGGACCGCGAGGCCCCAACGCCACAGGGGCCCGGCGGAGCGAGATCCACCCGGTACCGGCCACGCCGGACACCACGGATCCAGCTCCGGGGGACCGGAATTTCCGGGCCCTCCGGACCGCTGAATTGAAGTGAATTGCATAACCAGAGCAGTTGCCCGTCCCAGTCATGCACGAGCGTAGGAGCCGCTAGATGAACGTCACCGTGTACACCAAGCCCGCTTGCGTGCAGTGCAATGCCACCTACAAGGCGTTGGACAAGGCGGGTGTCGCGTACGACGTCGTCGACATCACCCTCGACAACGAGGCCCGCGACTACGTGATGGCGCTGGGCTACCTGCAGGCACCCGTGGTCGTCGCGGGCAACGAGCACTGGTCCGGATTCCGTCCCGACCGCATCAAGGCGTTGGTCGGAGCATCCGCCGTCACTGCCTAGACCATCCAGCCGACCGAGAGGAGTGCAGACGTGAGCAACCTCGTCTACTTCTCCAGCGTCTCGGAGAACACCCATCGTTTCGTGCAGAAGCTCGAACTGCCTGCCATCCGGATTCCACTCAAGGAACGGATCCAGGTCGACGAGCCCTACGTACTGGTCCTGCCCACCTACGGCGGCGGACACGCCAACGGCCCGGATCCTGACCGCGGGGGCTATGTCCCCAAGCAGGTGATCGCCTTCCTCAACAATGAACACAACCGGTCGTTGATCCGCGGCGTCATCGCCGCGGGCAACACCAACTTCGGCGCCGAATTCGGCTACGCGGGGGTCGTCGTGTCTCGTAAGTGCGGCGTTCCATTTCTCTATCGCTTCGAACTCATGGGAACGACGGACGACGTCTTCGCCGTCCGCGCAGGATTACAAGACTTCTGGAAGGACCAGACGTGCCACCAACCGTCACAGCTGCAGAACCTGTAACCACCGGCGCTCACGCGCTCCCGGGGGAGACGGACTACCACGCGCTCAACGCGATGCTGAATCTGTATGACGCCGACGGCAAGATTCAGTTCGACAAGGATGTCCAGGCCGCGCGGGAGTATTTCCTGCAGCACGTCAACCAGAACACGGTGTTCTTCCACAGCCAGGACGAGAAACTCGATTACCTGATCGAGAAGGAGTACTACGAGCGCGAGGTGCTCGACCAGTACAGCCGCAATTTCGTCAAGAGTCTGCTGGATCGGGCATACGCCAAGAAGTTCCGCTTCCCGACGTTCCTGGGTGCCTTCAAGTACTACACCTCCTACACCCTGAAGACATTCGACGGTAAGCGTTACCTGGAGCGCTTCGAGGACCGCGTGGTGATGGTCGCGCTGACGCTGGCCGCCGGTGACACCACGCTGGCCGAAAAGCTGGTCGACGAGATCATCGACGGCCGCTTCCAGCCGGCCACCCCGACATTCCTGAACTCGGGCAAGAAGCAACGCGGCGAGCCCGTCTCCTGCTTCCTGCTGCGCATCGAGGACAACATGGAGTCCATCGGGCGCTCCATCAACTCGGCGTTGCAGTTGTCCAAGCGTGGCGGCGGAGTCGCGCTGCTGCTGAGCAACATTCGCGAGCACGGTGCGCCGATCAAGAACATCGAGAACCAGTCCTCGGGCGTCATCCCGATCATGAAGCTGCTGGAGGACTCGTTTTCCTATGCCAATCAGCTCGGCGCACGGCAGGGCGCCGGCGCGGTGTACCTGCACGCGCACCACCCCGACATCTACCGGTTCCTCGACACGAAACGAGAGAACGCCGACGAGAAGATCCGGATCAAGACGCTCTCGCTGGGCGTGGTGATCCCGGACATCACCTTCGAGTTGGCGAAGAAGAACGAGGACATGTACCTGTTCTCGCCGTACGACGTCGAGAAGGTGTATGGCGTTGCCTTCGCGGACATCTCGGTGACCGAGAAGTACCACGAAATGGTCAATGACGGTCGGATCCGCAAGACCAAGATCAAGGCGCGCGAGTTCTTCCAGACACTGGCCGAACTGCAATTCGAATCCGGCTACCCGTACATCATGTACGAGGACACGGTCAATCGTGCCAACCCGGTGGAAGGCAAGATCACCCACTCCAACCTGTGCTCGGAGATCCTGCAGGTTTCGACGCCGTCGCTGTTCAACGAGGACCTGTCCTACGCCAAGGTGGGCAAGGACATCTCGTGCAACCTCGGTTCGCTCAACATCGCCAAGGCGATGGACTCACCGGATTTCGCCCAGACCATCGAGGTGTCCATCCGCGCGCTGACCGCGGTCAGCGACCAGACCCACATCTGGTCGGTGCCCTCGATCGAGCAGGGCAACAACGAGTCGCATGCCATCGGCCTCGGTCAGATGAACCTGCACGGATACCTGGCGCGCGAGCGGATCATGTACGGCTCCGAAGAAGGTGTGGACTTCACCAACATCTACTTCTACACCGTGCTCTACCACGCGATTCGTGCCTCCAACCGGCTCGCGATCGAACGCGGCCGGGCCTTCGGTGGATTCGAGAGATCCAAGTACAAGTCGGGGGAGTTCTTCGACAAGTACACCGATCAGGTCTGGGAGCCCGCCACCGACAAGGTGCGCACGCTCTTCGCCGATGCCGGTATCCGCATCCCGACCCAGGATGACTGGAAGCGGCTGAAGGAGTCGGTGCAGACCCACGGCATCTACAACCAGAACCTGCAGGCGGTCCCGCCGACGGGCTCGATCAGCTACATCAACCATTCGACATCGTCGATCCACCCGGTCGCGTCGAAGATCGAGATCCGCAAGGAAGGCAAGATCGGTCGCGTCTATTACCCGGCGCCGTACCTGACCAATGACAACCTGGAGTACTACCAGGATGCCTATGAGATCGGGTACGAGAAGATCATCGACACCTACGCGGCGGCGACCCAGCATGTGGATCAAGGGCTTTCGCTGACGCTGTTCTTCAAGGACACCGCGACCACCCGCGATGTCAACAAGGCGCAGATCTACGCGTGGCGTAAGGGGATCAAGACGCTGTACTACATCCGACTCCGCCAGATGGCCCTGGAGGGCACCGAGGTGGAGGGCTGCGTGTCCTGCATGCTGTGA
Protein sequences of DBSCAN-SWA_1 >NC_023036|2150586:2157289|2154434_2154677_+|WP_019514554.1|DBSCAN-SWA MNVTVYTKPACVQCNATYKALDKAGVAYDVVDITLDNEARDYVMALGYLQAPVVVAGNEHWSGFRPDRIKALVGASAVTA >NC_023036|2150586:2157289|2155120_2157289_+|WP_031601417.1|DBSCAN-SWA MPPTVTAAEPVTTGAHALPGETDYHALNAMLNLYDADGKIQFDKDVQAAREYFLQHVNQNTVFFHSQDEKLDYLIEKEYYEREVLDQYSRNFVKSLLDRAYAKKFRFPTFLGAFKYYTSYTLKTFDGKRYLERFEDRVVMVALTLAAGDTTLAEKLVDEIIDGRFQPATPTFLNSGKKQRGEPVSCFLLRIEDNMESIGRSINSALQLSKRGGGVALLLSNIREHGAPIKNIENQSSGVIPIMKLLEDSFSYANQLGARQGAGAVYLHAHHPDIYRFLDTKRENADEKIRIKTLSLGVVIPDITFELAKKNEDMYLFSPYDVEKVYGVAFADISVTEKYHEMVNDGRIRKTKIKAREFFQTLAELQFESGYPYIMYEDTVNRANPVEGKITHSNLCSEILQVSTPSLFNEDLSYAKVGKDISCNLGSLNIAKAMDSPDFAQTIEVSIRALTAVSDQTHIWSVPSIEQGNNESHAIGLGQMNLHGYLARERIMYGSEEGVDFTNIYFYTVLYHAIRASNRLAIERGRAFGGFERSKYKSGEFFDKYTDQVWEPATDKVRTLFADAGIRIPTQDDWKRLKESVQTHGIYNQNLQAVPPTGSISYINHSTSSIHPVASKIEIRKEGKIGRVYYPAPYLTNDNLEYYQDAYEIGYEKIIDTYAAATQHVDQGLSLTLFFKDTATTRDVNKAQIYAWRKGIKTLYYIRLRQMALEGTEVEGCVSCML >NC_023036|2150586:2157289|2152771_2153380_-|WP_019514552.1|DBSCAN-SWA MGAPEPLTVLPLSSAVPAERGDAARNRALLLDAARRLIAERGADAVSMDDVAAAAGVGKGTLFRRFGSRSGLMIVLLDEDEIAEQQAFMFGPPPLGPGAPPLERLLAYGYSRLHFVHTHRALLSDAARDPHTRYNSPATLHRRHVRMLLESAGTTGDLDAQTDALFALLDADRVAHEMTDRGKTLEELAAGWDDVARKLCGT >NC_023036|2150586:2157289|2150586_2151741_-|WP_019514550.1|DBSCAN-SWA MLNTIAVRGYRSLREVLLPLEQLTVITGANGTGKSSLYRALMLLADCGRGEVIASLAREGGLQSALWAGPEQTSGARRTGSTQGTVRTRPVSLELGFASDEFGYLVDLGMPQHAGPASLFGRDPEIKREAIFAGPRLRPSSALVRRNGPHADVADETGRGFTDLSRALPLYRSVLAEYAGALPEINAVRNRLRDWRFYDGFRSDAGAPARQPRVGTRTPVLADDGADLAAALQTIIELGDGELQGAISDAFDGAGVSVVANDGLFELQLRQRGMLRPLRAAELSDGTLRFLLWAAALLSPRPPSLMVLNEPETSLHPDLIPALAGMIGNAAAATQVIVVTHSTVMRERLDAAGALTVELVKEWGETLIEGQTMLTAPPWDWGKR >NC_023036|2150586:2157289|2151734_2152763_-|WP_031601416.1|DBSCAN-SWA MSWILHIDLDQFLASVELRRHPELVGTPVIVGGSGDPHEPRKVVTCASYEAREFGVHAGMPLRAAARKCPDATFLPADPQAYDAASEQVMGLLRDLGHPLEVWGWDEAYLGADVADPFELAERIRTVIAAETGLRCSVGISDNKQRAKVATGFAKPDGVYALTASNWMAQMGDRPVDALWGVGPKTTKKLAAMGISTVAELSETDATTLTSTFGPSTGLQLLLLAKGGGDTDINAQAWVPRSRSHAVTFAHDLTDRADMDSAVVDLASTTLTEVSADGRTVTRVAVTVRTATFFTRTKIRKLAAPGNDPAEIQRVALALLDEFDLDRPIRLLGVRLELEMPC >NC_023036|2150586:2157289|2154707_2155154_+|WP_019514555.1|DBSCAN-SWA MSNLVYFSSVSENTHRFVQKLELPAIRIPLKERIQVDEPYVLVLPTYGGGHANGPDPDRGGYVPKQVIAFLNNEHNRSLIRGVIAAGNTNFGAEFGYAGVVVSRKCGVPFLYRFELMGTTDDVFAVRAGLQDFWKDQTCHQPSQLQNL >NC_023036|2150586:2157289|2153468_2154005_+|WP_019514553.1|DBSCAN-SWA MTNVLVLIGSLRKASINRQLAELAVESAPEGVTLQLFDRLGELPFYDEDIDNDDVAEPVRALREAAAAADAALVVTPEYNGSIPGVLKNAIDWLSRPYGSGALQGKPFAVIGTALGQYGGVWAHDETRKSLAIAGPRAVEDLKLSIPSATLDGKHPREHAEVAGQVREVVGKLVAEVG |
7 | Escherichia_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
4030864 : 4081277
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_023036|4030864:4081277|DBSCAN-SWA CTCATTTCAGGTCGGCGAGTCGCGCGGTGATCCGTTCGACCTCCTCGGCCGCCAACTGTTGCCGACCCTTGATCTTCTCGACGACGTTCTCGGGTGCCTTGGCCAGGAAGTCGGCGTTGCCGAGCTTGCCCGTTGTCTGCGCGAGTTCCTTCTGCGCGGCGGCAAGATCCTTTTCCAGACGGCGCTTCTCGGCGGCCACGTCCACGGTGCCCGAGGTGTCCACCTCGACGGTCACCGTGCCATTGGTCAACCGCACCTCGACCGCTGCCGAGGCGGCGAAATCCTCGGTGGGGTCGGTGAGCCACGCCAGCGCGGTGACCGCGGGCAGCTGCGACTCGATATCGCTGGCATCCAGACCGGTCAACCGGGCCGGCACCTTCTGCCGATCGGCCAGGCCCTGATCGCTGCGGAACCGGCGCACCTCGGTAACCAGCTTCTGCATATCTGTGATGCGTCGGGTCGCGGTGGCGTCCACGCCGTGCGCGGTGACGGTCGGCCAGTCGGCGATGACGACCGACTCCCCGCCGGTGAGGGCCTTCCACAGCGTCTCGGTGACGAACGGCATGACCGGGTGCAGCAGCTTCAGCAGGGTGTCGAGCACCGCGGCCAGAACCGCGCCGGTGTGCGAATGACCCTGTGCTACCTGGACCTTGGCCAGTTCCAGGTACCAGTCGCAGAACTCGTCCCAGGCGAAGTGATACAGCGCCTCGCAGGCCCGGCTGAACTCGTATGCCTCCAGTGCGGCATCGGCCTCGGCACGCACCTCCTCGAGGCGCCCGAGGATCCAGCGATCGGCATCGGTGAGCTGATCGGCGCCGGGTAGATCGGTCAGCTCGGCGCCGTTCATCAGGGCGAACTTGGTGGCGTTGAACAGTTTGGTGGCGAAATTGCGGGAGGCGCGAGCGTGGTCCTCACCGATGGACAGGTCGCCGCCGGGGCTCGCGCCGCGGGCCAGGGTGAACCGCAGCGCATCGGCACCGAACTTCTCCACCCAGTCCAGCGGGTCGATGCCGTTGCCCTTGGACTTGCTCATCTTGCGGCCGAACTCATCGCGGATCAGGCCGTGCAGGAACACATTCTCGAACGGGACCTGCGGCCGGCCGGTGGCCGGGGCCGGCACGGCGTCATCTCCGGAGACGAAGGTGCCGAACATCATCATCCGCGCCACCCAGAAGAACAGGATGTCGTAGCCGGTCACCAGAACCGTTGTGGGATAGAACTTCTGCAGCTCGGGAGTAGCGTCCGGCCAGCCCATCGTGGAGAACGGCCACAACGCCGAGGAGAACCAGGTGTCCAGGACATCGGGATCCTGTTCCCACCCCGGCGGTGGGGTCTCGTCGGGGCCGAGGCAGACCTGCTGACCGTCGGGCCCGTGCCAGATCGGGATGCGGTGACCCCACCACAGCTGCCGTGAGATGCACCAGTCGTGCATGTTGTCGACCCAGGCGAACCAGCGAGGCTCAAGGCTGGCGGGATGGATCACGGTGTCGCCCTTGCGGACCGCGTCACCGGCGGCCTTGGCCAGGGCATCGACCTTGACCCACCACTGCAGGCTCAGCCGGGGTTCGATGGGTTCGCCGCTGCGCTCGGAGTGGCCGACGCTGTGCAGATACGGTCGCTTCTCGGCGACGATGCGCCCCTCGGCGGCCAATGCCTCGCGCACCGCCACCCGCGCTGCGAAGCGGTCCATACCGTCGAAGCGCGTCCCGGTCCCGGTGATGGCGCCGGTGGTGTCCAGGATGGACGGCATCGGCAGGTTGTGCCGCAACCCGATCTCGAAGTCGTTCGGGTCGTGGGCCGGGGTGACTTTCACTGCGCCGGTGCCGAACTCGGGGTCGACGTGAGCGTCGGCCACGATGACGATTTCGCGGTCCAGATATGGGTGGGGCAGCGTGGTGCCCACCAGCGCGCGATAGCGCTCGTCGTCGGGATGCACCGCCACCGCGGTGTCGCCGAGCATCGTCTCCAACCGGGTGGTGGCCACCACGATGTGCGGCTCGTCATCGTTCATCGAGCCGTACCGGAACGAGACCAGCTCGCCCTCGACGTCCTCGTACTTGACCTCGAGATCGGAGATCGCGGTCTGCAACACCGGCGACCAGTTGACCAGCCGTTCGGCCTGATAGATCAGCCCCGCGTCGTAGAGCCGTTTGAAGATGGTGCGCACGGCCCGGGAGAGACCCTCGTCCATGGTGAACCGGTCGCGGCTCCAGTCGACACCATCGCCGAGGCGGCGCATCTGGCCGCCGATGGTGCCGCCGGACTCACGCTTCCAGTCCCAGACCCTCTCGATGAAGAGCTCGCGTCCGAGGTCCTCCTTGGTCTTGCCGTCGGCGGCGAGCTGCTTCTCCACCACCGACTGGGTGGCGATGCCGGCATGGTCCATGCCCGGCAGCCAGAGCACCTCGTAGCCCTGCATGCGGCGACGGCGGGTCAGCGCATCCATCAGGGTGTGGTCCAGGGCGTGTCCCATGTGCAGGCTGCCGGTCACATTCGGCGGCGGCAACACGATCGAATACGGCGGTTTGTCGCTGCCGGGATCGGCGGTGAAATACCCGGCGTCGACCCAACCCTGATACAGGTCGCTTTCTACCGCACCCGGCTCCCAGGATTTGGGCAGGGCATCGGCACGCGAAGAGGGGCTGGCAGTCACCTGGCCATTCTAGGAAGGCCGGTCGGGCAGATGCCGAGGGCCCACCGTTGGCCCGGTCCGGGAGCGGCTGCTCAGTCGAGTCGCGCCGTCTCCAGGGTGACCCCGGCCGCCGGTAGGCGAGTGAGCAGTGCCGAACCCGCCGCGGCGGCCGGGGTCAGCACCCCGCGCAGATCCGAGAGCTGGTCGCGGTCCAGCGCCAAGGCGAGCCCGCATTCGGCGAGCAGCACGGCGGTGGCCTTGTACCCCGGGTCGCCCTGCTGGGCGATGGTCGCGCGGTACCGGGCTCCGGTGCTGGTCGTGGTGTAGGTCTCGATGCGGTAGTGACCGTTGTCACGGGTGCGCTCGCTGGGTCCGCTGCCCGGCTTGGGCAGGATCCGCTCCAGCAGTCCCGCGGGCACCTTGTTGATATAGCGGCTGCCGAGGCTGAAGGTCGCCACACTTGCGGCGGTCTCCAGCGCCGCGGCGGCCGGTGCCAATACCGAGGAGCCCAGGCTCATCTGCTCGGCGTAGCGGAAGTGACGCCCGTATGCCCAGTCGAGCAGCGCGTTGCTGCGACGCACGATGCGCGAGTTCGGTGCGGCCATCGCGAAGGCACCGGTCCAGATGCCGTCCAGCTCGGGTGCGATATCGCGGCCGCGCCGCCACGGGTTGTCGGATTGATGGCCCAGTTCCGGTTCCGCGCCGCGGTCGGTGGTCAGCGTGTACGGATCTTCCAGGTCACGGCGGGCCTGCGGGTCTTCGGCCGCGGCCCGCATGAACTCGATCATCGAGGCGGCAGTACCGCCGGAGACCCCGCCGGCGAAGGTGCGCAACACCAGATTGGTCTCGGTCAGCTCGCCTGAGCCGTCGGCGGTCACCCGGTCGTACAACGCGTACACGCTGATATCCGATGGCACCGAGTCGAATCCGCAGGAATGCACGATCCGCGCGCCGGTGTCGACGGCCTGCTTGTGGAACTGTTCGGCGCTGTCCCTGATGAACAGCGTCTCACCGGTGAGGTCGGCGTAATCGGTGCCCGCTGCGGCGCACGCCTGCACCAGCGGGAGCCCGTACTTGGTGTAGGGGCCGACGGTGGTGATGACCACCTGGGTGCGGGCGGCCATATCGGCCAGTGTCGAGGGCGAGCCGGCGTCGGCCTCGACGAGCTGCCAGTCCTGGGCCTTGGGGCCCAGGCTGTCGCGCACGTCGCGCACCTTGGCCAGCGAGCGTCCGGCCAGCGCGATCCGGGCGCCCGAGGCGGCCAGCGCCAGGTACTGCGCGGTGAGCTTGCCCGCGAAGCCGGTGGCGCCATACAGGACGATGTCGAATTCGCGCTGCGGCGAGGTCGGAGAGGTCGGCGAGGTCATGGAACCCGAAGCTACCCGAGGATCTCGGGTACAAACATGTCCCGCCCGAACAGGTGGTTGCCCAGGAAGGCCGAGATCACCTGGTACCAGATCTTGGTGTGCTGCGGAGCGGCCACCCAGTGCCCCTCGGACGGGAAGTACAGGAATTGGTGGGGGCTGGTGCCGTCCTCGTCGGCCGGAAGACCGGACCGGCTGAGCAGCTCGTACCACAGTCGCAGACCTTCCCCGATCGGGACGCGGTAATCCTTGTCACCGTGGATGACCAACATCGGGGTGCGGATCTCACCGACGTGGTGGTGCGGCGAATTCTGTTGTGTCATATGTTCGGTCATCTCGCGCTCCCACCAATACGCGCCATCGGTGGTAGGTCCGAACTGATCCAGCGCCCACAGGCTGGCATGACTGACGATCGCGGTGAACCGGTCGGTGTGCCCGGCGATCCAGTTGGCCATGTACCCGCCGAACGAGCCGCCCATCGCGGCGGTGCGCTGCCGGTCGATGCGCGGATGCTCGCAAGCGGCATCGGTGGCGGCCAGCAGATCGGTGTAGGGCGGCCCACCCCAGGCGCCCCACCCGCGCTGGATGAAATCCTGGCCATAGCCGGTGGACAGCGCCGGGTCCGGCAGCAGCACGGCATACCCCTCGGCAGCCAAGAGCCATGGGTTCCAGCGCCAGTGCCAGGTGTTCCAGCTGCCGAGGGGGCCACCGTGCACCCACAGCAGTAACGGCGCCGGCTCGGTTCCGCCGGGCAACACCAGCCAGGAGTGCACGGGGGTGCCGTCCGGTGCCCGAGCGGTGAGTTCCTCCACGGTGCCCGGCAGTTTCGGCAGCTCGACGCACGGCAGGATCGTGATGGTTCCGTCGGGGTCGATGCGCACCGGGTGCGGTGGCGCGAGGTAGGAGCTGCGCAGCGCGAAGATCACCCCACCCGGTGCGGTGCACACATCGCTGTAGGCCTGATCGTCCTCGGTCAACCGGGCCACCGTGCGGGTCGCCGGGTCCACCGCGAAGATGGGATGCCTGCCATTGTCGTCCGCGGTCACGATGAGCCTGCTCCCGTCGGCGCTCCATGTCACCGAGGCCGGCCACCGGTCCCATTCGGTCAGCTCGGTCCAGGCCTGGCCGATTTCGCCCCAGCACAACGTGATCCGTGGCGCCTGCTCGGGTGTCGAGGTGGTCTCTCGCAGAAAGGCGATGCGCCGACCGTCCGGGGCCAGTGCGGGCGCGTCGAGGTCTGCGCCGGGATCGTCGGCGATCACGGTGCGCTGCCGGGTCGCGGTGTCGATCCGCACGATCTGCGAGCGCAGCGCCACCCCGGCCACCGGCCCGTACCAGCTGCTGACCAGGAAGGTGCCGTCGCCGCTGATATCGAGCGCGGTGTCCCGCAGTGCCGCACCGGGCTCGGGTGTGAGATCGCGCCGTCCGTCGGTGTCGAACAGGTGCGGCAGTTCGGGGCCCAGATCGTGGTCCCAATGCCGCACCGGGTAGCCGCGGTGCAGAATCGCCGAGACCTTGTTGTCCTTGCGCAACGCCCGCAATCGCTCGTCCTCGTCGACCCCGGCGGCCGAGCGCAGCAGCGAGGTGGTGACCACCGTTCGCGGCGCGTCACCGGCGACATGCACCGCCGACACCCCACCGGGCAGCGACATCTCCTCGACGGCCTCGCCGCCCTCTTTGGGCAGTCGCCAGAGCGCTGCCGGCGCCGACTCCTGCCCCGATGTGGGCCGGCTGGCAAGAAATAACAGGTCACCGTCGGAGGTGAAGACCGGTGCCGACTCGCCCTTCCCGCCCCGGGTGAGCCGGCGGGCGGAGCCGATCCCGGCGGGGTCCAATTCCCAGATCGCCGTGCGATATTCGGTGCCCTCGGAGCTGAGCTCGCTGACGGTGGTGACCAGCCGGTCACCGGTCGGGCTGACGGCCAGCTCCGCAACCCTTGGCAGGGCGAGATAGCTGTCGAGATCGTCGAACGGTGTCATGGTGCCACCGTAGTGGGCGGGCCGAGGCGGTAGCGGCGGCCGCTTCCCGCAACCATGCCGCACCGGCCGCGATCACTGCCCGACGGTCAAGCCCGTCGCGGTCGTCGACAGCGAGAACGGATTCACCGTCCAACTGGATGTCGTCGACGCTCGCTGGTGGGCCCTGTTCGTCCGCCAAGGACTGTCCGCCGGCTTCAGTTGGCGTGTCACCGAGCGCCGCTCGTCGTTCACGATCTACGACCGCAAGATCCGGTTGGATTGGGTTGCCGGCGTGCCGCGACTGGCGGGCACGATGGAGTATCAGGGCGGCCGCATACTGGGGTTCGCCAGGCAAAGGATCTGGGCACTGTCCGACCGAGGTGTGATCGAGCCGGTGGTCGACTACAGGCCCAATGCCAGGGAGGGCCGCGACCTGATCCGTCTGGTGGCCACCGAACTCGTTCTGCGCGAACGACTTCCGATGACGCTGAAACCCACGCCAGTGTGCGTGCTCATCACCCCCCGCCATCTTCGCCGGTTGGGGCATCGTCACCGTGGTGCAGCGGCTGCTCGGCATCCATATCTAGTCATCCAGTCCGGAAGGGATTTTCGGCGCGATGCACTCCGGGTGACGCAGCGGGCCGACCGATCGCGCCGAACGCCATGTCCTGACATCGGACTCTGGCCATAGCGCCATCGTATGACTCTTCGTGCGCGGTCTTCACAGGGATTTAACAGCCGATGACCACCGGGACACATCGGCGCGGTTACTTTCATCTCGGTACACCGTCAGTACCTCACGGCCACTGTGGATAAACCCCGGTCGCCCTCATCCCCCACACCCGAACGGCGGAGGTTGTTTTGACACCCGAAGTGGAGGACGCCCTCGCGACGATGGCGTCAGTGAACAATGAATTCTTCTACTGGATGTCCATCGCGCTGATGATGCTCATCCATGCCGGCTTCCTGGCCTACGAGGTCGGCGCGTCCCGGTCGAAGAACGTCCTGGCCACGGCAATGAAGAACCTGCTCGCCTTCGCGACCATCGTCGCATCGTTCTATTTCGTCGGCTGGTTCCTCTACAACGCCATGCCGAGTGGCTTCATCGAGTTCAACGACGCGGCCAAGGCCGCCTTGCCATGGGGCGACAACATGGGCCCCAACACCGCCGACTCCGCCAGTGGCATCTTCTGGGGCGCCTTCGCCCTTTTCGCGGCGACCACCGGCTCGATCATGTCCGGCGCTGTGCTCGAACGCATCCGGACAAGTGGCTTCCTGGTGCTGACGGTATTGGTCGGCTCGGTCACCTGGATCATCGGCGCCGCCTGGGGGTGGCACGGCGCCGGCTGGATGCTCACCAAACTCGGCTTCCACGACGTCGGCGCCGCGGGTTGTGTGCACATGATCGCCGGCTTCGCGACACTGGGCATCCTGATCAACCTGGGCCCGCGGATCGGACGGTTCGGCCCCGACGGCAAACCCGTCACAATCCGTCCGCATAACCTGCCGCTCACCATGGTCGGCCTGATGCTGATCTTCACCGGCTTCTTCGGCTTCCTGATGGGCTGCGTCATCTACGCCGGTGACGGATTCACCACCATCTACGCCAGCCCGACCACGTTGAGCGCCTTCGCCTTCAACACCCTGATGGGCCTGGCCGGTGGGATCATCGGCGCCTATCTGACCTCCCGCGGCGAGCCCTTCTGGACCATCTCCGGCGGCCTGTGCGGTGTGATCGGCGTGGCATCGGGCATGGATCTGTACCACCCTGCACTCGGGTTCGTCATCGCCTTCGGCGCAGGCGCGCTCGCACCGTTCATCGGGAAACTGTTGGAACGCTTCAAGATCGACGATGTGGTGGGTGCGGTGTCGGTGCATGGCGGCATCGGGCTGTACTCGGTGCTGATGGCCGGCGTGTTCCTGTCCGGATACCCGAACACCGACGGCAACCCGTCGGTGTCGCTGTGGGGTCAGATGATCGGCGCGCTGGTGTTCGCCACGCTCGGCTTCGTCCCGACCTATGTCGTATCGCTACTGCTGAAGAAGGTCGGATTGCTTCGCATACCGGCCGCGGTCGAGGAACAGGGCCTCGATCTCAGTGAGGTGCCCGCGACGCCGTACCCCGAAGGCATCCCGATCACCACCATGCCGCTGAACGGCGGCAAGGCATTGCTCATCGCGGAGGCGAAGTAGATGTTCAGTCCCATCGAGAACTACGACGAAGCCGGTGTGGTCTTCACTTTCGGCGGATACTCGTTCGGCGTGTGGCTGTTCTTCATCCTCGCGGTGTTGCTGTTCGTCGGATTCTTCGTCCGGATGATCCAGCACGAGAACAAGGCTTATAAGGCGATCATCGAACACACTCCGGTGGAACCCGGACCGGCCGCCGAGGGGGAACCCACCGCGTACTGAGCCCGCTGCACGACCCCAGAGAATGCCCGGATCGCACAAGCGATCCGGGCATTCTCTTCCGTGCATCAGCCTGCGTCGCGCAACCACCGCACGGCACGGTCACGGTAGCCGATGATGCCCTTGTAGTCGGCCATGTCATCGGGCAGATCGGCGAGCACCGCGGCGGTGCGCACCCGCCACGGCTCGACCGCGGCGATATCGGCGAGCGGAAGCATGTAGGTGCGGATCAGGAAACAGATCGCACCGGACTCCGGTAGCCGGATCAGATGCTGGACCTCCACCCTCAGATGGACGAGCCTGCCGAACTCCTCGTCCCCGACCCCGGTGATCATCACCCGGTCCGGTCCCCACTCGGGGTAACGCTCGGTGGAGACGTCGAGTCGACGTCCGATGGTCAGTGTCCAGTTGGTGCGCCGGTAGGTTTCGCCGGGCTGCAACCGCATCAGGAACTCCCTGGCCCGGGTGATGACCCCGGTGGCCCGCAGCCGCGGGACCGGACCGTGGATCTCCAGGAAGGTCATGCCGACGTCGAAGCCGAACGACCAGTCGGCGGCGAAGGTCACCACTCCCGCGTCACCGAACAAATCGCCCTCGCGCTGATCCAGCAACACGATGTCCTCCTGGACCTGACCGGCGATATAGGCCAGCGGCTCGGCCGGAAGCGAACTGTCATCGCCGAGAACGAAGGTCTGGTCGATGCCGAGTCGCTGGTTGCGCCATCGCCAATGATCTCCGTCGCGTTGCAGCGACATGCTGTCGGGATACGCGGCCGCCAGCTCGCGCATCAGCGTCAGCATGACATCCCAGCATGCCGGTCGCATGTGCGGCAGCACCGCGTACCGGGACGGGTCGGCCAGGAGAACCGCTGCCCGCTCGGCCAATTCGTGCTCGTACTCGCTGTCGATATCGACCACCCGCTCGCCCCAGCGCCCCACCGGGGTGGTGACCGGTGCACCCGCCGGTTCCACGTTCGTGGTGTAGCGGTAGCTGTCCGCAGAGAAGGGATAGGGAAAGGTCTCGACCAGATCGGTCGACGAGATCATCCCGGTTGTCACAACGCCACCTCCAGAGTTCCGCCGTCACCGCGAGACACGCACGCCATCAGCGTGTCTCCCGCTCGCCTGTCGTCGTCGCCGAGAAAAAGGTCACGATGGGTGATCGCGCCACCGGCCACCGGTATCCGGCATTCACCGCATACTCCCTTGCGGCACAGGTTGGGAACGTCGAATCCGCGGCGTTCCAACGCCTCGAGCAACGACACCCCCGCCTCGACGACGAACTGCTCCCCCGTCGAGGCGATTCTCGCGGTGAAGGGTTCACCCGGATCCAGCGCAACGGAACCGAAGTGCTCAAGGTGTATCCGGCTGTCCGGCCAGCCGAGTTCGACGGCCGCGGCGGTCACATCGGCGATGAATGCGGCGGGGCCACACACGTAGAGGTGCGCACCGAACGGCTGGCCGGCGAGGGCGGTGCGCAGTGCCGAGTCGAATCCGGCCCGGCCGGTATGGATCGAGACATCGTCGGCGAGCCGGGTGACGGTGTCGACGTGCACCGCCCGGCCGGGTCGGTGGACATACAGCAGCTCGGTGTGACGGCCCCATACGCGCGCCGACCGCAGGTGCGACACCATCGGCGTGATGCCGATGCCCGCGGCGACCAGTAGGTGCTTGCGGGCCCGCAGTACCGGCGCGAAGGCACTGCGCGGTGGACTGGCGATCACGGTGTCGCCGAGGGTGACCCGATCATGTAGCCACAGCGAGCCGCCGGATCCGCTGTCGCACCGGAGCACGGAGATCTCGTAGGAATCCGGCGCAGTGCCGTCCCCGGTCAGGGAGTAGGCGTTGGCGACCGCGCCACATTCCAGCACGATGTGACTGCCCGGGGTGAAGGACGGCAGCGGCGCGCCGTCGGGATCACTCAGTGCCAGCGTCCGGATGCCGGAGACGGTGTCATCGACGGCGGTGACGACGAGCTTGAGCGTTCTCATGCCGGCATCTCCTCGGCATCGGCCATGAATCCGAGGTAGGTGCCGCGTCGTCGCGAAACATGATAGTAGACAAGCAGATTGCGCCCACAACCGGCGCACGGCGTGACGTCCTCCAGGTCCACCACGACGTGATTGACGGTGGGGCAGTGCACGCACTGCACCGATCGCGCGGTCACCTCGGTGCTGGCGAACGTCATCTCGTCATCGGCGACGCCGAGAGCCACCGCGTCGGCCCGCAGACTCAAGCAGGCATACGCGGGACCGGCCACCATCAGCCGCCAGCCGACCGTTGCATCGGCCAGGTCGGCGCGCAGTGCGGTCACGGCGTCGGCGATATCGGCGGCACGGTGCATCCGCACCGCCGATTCCGCTGCCGCCGAAACGATCTGACGCTGCCACTCGTGAGCCACTGCGATGGCTGCAGAGTCGAAGGCGATGAGGGTCCAGGATCGTCCCCGGGTACCGGCCGGCGGTGTGGTGGGCGCGATGGCCCAGCCGGGCACACTGGTCACGTCCAGATCGGGTTTCATGAGCCCCTTTCGGTACACCATGAGTTGACTTAGTTGCCCTTGAGGAAACCAGTTCTGCGTTTCGTCGGTGTGACCCGAATGTCTCGGCTTGTTGACACCTACGCTCGGGGCCATGCCACAACGCTTCCTCGTCGTCGGCGCCGGGATCGCCGGCTTGGCCACCGCCGTCGCGTTGCGCCGGGTCGGTCACGACGTGCGGGTCATCGAGGGACGCGCCGAAAGCGAGGTCGGGACGGGTGCGGGCATCAGCATCTGGCCCAACGCGCTCGCCGCGCTCGATGTCCTCGGACTGGGCGATGAGGTCCGCGCGGCCGGTGGTCGGGTCGGCGCCGGCGCGGTCCGCTGGCGCGACGGCCGGTGGTTGCGACGTCCGGCCACCGACCGGATGGTGCGAGCACTCGGCGAACCGCTGGTGGTGATCCACCGCCGAGCGCTGACAGACATCCTCGCCGCGGCACTGCCCGCAGGCACCGTGACACATGACTGCGCGGCGGCACGGGTGCGTGTGACACCCGAGACCGCCGGCGTCGTGCTGTCCGATGGCGCGGTGCTCGACGCAGACGCGGTCATCGGAGCCGACGGTGTCGATTCGATGGTCGCTCGCAACCTCAACGGGACCCTGGCGAAGCGCTACGCCGGGTACACCGCCTGGCGCGGTATCGCCGAGCACGCGCTGGAACCGGACCTTGCCGGCGAGACGATGGGACCTGGGCTCGAGGTCGGCCACGTTCCGCTGGGCACCCGCCACACCTATTGGTTCGCCACCCAGCGCGCGCCGCAGGGCGCCACCGCGCCGGACGGCGAACTGGCCTACCTACAGCAGGTGTTCGGCGAGTGGCCGGAGCCCATCCCCGCGCTGTTGGCCGCGACCGACCCCGCGGCGGTACTGCGCAACGACCTCTACGACCGCGCCCGCGCCCGCCGATGGTCGAGCGGACGGGCGGTCATCGTCGGAGATGCCGCCCACCCGATGCGCCCTCACCTGGGACAGGGTGGCTGCCAGGGCCTGGAGGATGCGGCCATCCTCGCGGCCATGACGGCCGAGGACGCCGATCTGCCGGCGGCCTTTGCCCGGTTCGCCGCATTCCGCAGCCCACGGGTGCTGGCTCTGGTCCGCGAGGCGCGCACCATCGGGCAGATCGTGAACCTGCGGCCCCCACTGCTGAGCGCGGCGGCCAGTAGGGCGTCGACGTTGGTGCCGGAGGCCCTGCTGACCCGCCACTTGGCGGGTATCGCGGGGCGCTCGGCGTTCGTCATGCCGGCGTAGGCGCACCGACGAGACGTCAGCGCGGCTGGAACGTGCGTCCCACGTTCTCCAGCGCCATGCACATCTGCCGGAAATCGAGGATGCCCAGCACGGCCCCGCCGCGGATCGCCTGGCATCCGGTGGCCGTCACCGGCGCGGGAGCCTGCGGCAGCGTCGGCGTCGTACCCCAGACCGCGGGTGTGGCGCCACGGGAGCCGCACTTGGGCCACGCCTTGAGCCCCTGGGAGCGCAGCACGTTCTCGGCAACCCGGATCTGCTCGGCGCGTGAGGCGCCGGCCGGCGAGCCGACGCCGCCGTTGGCGTTCCAGGTGGCCTGCTTGAACTGCAGACCGCCGTAATGACCGTTGCCGGAATTGACCGTCCAGTTCCCGCCCGACTCGCACTCGGCGATGGCATCCCAATTCACGGTGTCGGCGCTCGCGGTGGCCGTGAGAACCATCGGCGCCAGCGACAGGCCGGCGGCTGCGATGAGCACCCACAGTGCCCTGCGGGCGATGCTCTTGACGGTGCTGCTGAAGCGGATCATTGTGCGGTCCTTCCCCGACCGGTGACACGGACCGCGATTCACATGACGAGCGTTTGCGCGCCGCGGTTGTCTCGTTTGTCGAGTGACCGATATCACCCGTTACACAGGGGGCGGAAGTTTTAACAGTTCCTTATGTGACACCAGGTATTTGTGAGAAAACAATGGCGCCGGGCGGAAAGCCCGGCGCCATTGTGAAAAGTCGGCTAGGTAATTCAGCCCCGACGACCGCAGGACGGCCATGCACCGATGCCCTGGCTGCGCAGGACGTTCTCCGCGACGCGGATCTGCTCCTCGCGGCTCGCAGAGTGCGGCGCACCCGATCCGCCGTTGGACTGCCAAGTGCTCATGGTGAACTGCAGGCCACCGTAGTAGCCGTTACCCGTGTTGATCGCCCAGTTGCCGCCCGACTCGCAGGCCGCGACGGCATCCCAGTTCACGGAGTCCGCATGTGCGGTTCCGGTCGCCAACATCAGGGGGGCTGCACTGAGCGCCCCGGCAATGGCGGCCATCCCGAACGTCTTGCGGATATTCTTCAACGTCGATCCTCTCGCGAAACGCGCGCCAAAGAACACGCTCGCAACGAATGCGAACGCGGCATGCCTGAAGGGTCAGGCGGCGAATTTGGGGCGTGCCGTCTCGGTCGGGCACGACAGTGGGAGACAAATCGGTGTCTTGCCGGATGACTCCTCCGGCGACCAACGAGCGCTCAGCGACTCGCATCGGCCCACTCACACGCAGATGCTTGGATATGAAATTGTTTGCTCCCAGCGGAGTTGAAAATGAACGTACAAAAGTCTGCAGACGAAGTCACTTCGGTGTCGAACGTGTCGCGTTCAGATCACGAGATGATCACGACGACCGAAAGCCGATAAGTACGCAGGTCAACCGGGAATTAGTACCGCAAGACTAAACCGCAGGTCGAGGCGCTATCGGTGGGATAGATCACACGATTTTTGTGGCCCCGGCCACATATATGTGGGTGCCATGCGCTGGTTGCCGACCTGCGTACCGTTTTGTGTACCCGATATCGGCGTTTCGACACCCGCCCCACAGGTCGATTCACGGGCAGTTCGCTACACATCACCCTATCCCCGTTTATCGGGAACGCCGCGTAAACCATTGCCATGCAACGCTTTGCGCACAAGTCACCGAATCCGCCAACCACCGCCAGGCACAGCATAATTGGTACGGTGAAAGCCGAAAATCTCTATACCACAGCACTTTTCAATGTACGGCGATCAGACTTCAACGGGGCCGCATCCACGCGGCGACGCAAGGAGCGCATTCGAGCCCATTGCTGCGCACAGGCAACCGGAGCAATAGCGAGCAACCGCGAAATTCTGTCTGCCGAAAGCCAGTTCGCACACGCACCATGATTTGCGGAAGTGCCTGCGCTGCATCAGGTCTCGCGCCCCCACGGGGGTGATACACGGGCATGCAGATAAACGCCGCGCTCGAATTATCGTACTTTCCGTTGCATGACATTGACGCTGATAACGCCCGCAATGAGTCTTGTAAACAAATTCGGCCGCAAAGTCGCTTTAATTGCGAATTTGCTGGATCCGGTCACGTGGCTCAGACCGGCAGGTCGTCCGGGGTGTTGACATTGGTGAGCGCGCGCTGGGGTTCGGTGACGATGCGCTGGGTGTCGACGGCATCGACAAGTGCCCTCATGCTGCGCTCTCCCCCAGCCACCAGATCATCGGCGACGCCTGCCAACGAGGTGCGGTAGAGCCCGGCGAGGTAGTGGTCACGGCCATCCCAGACCAACACCACCGCGGCCGGGGTCGTCGCGGCGGGCTCCAAAAGCTCGTCGATGAACGAAGCGGTCAGGTGCGGCATGTCCACCGCGCAGAGAAAGGCCCATTCAGCGCCCGCATCGGCGGCCGCACGTAACCCCCTGGCCGTCGCGAGCAGTGGTCCCAGGCCGCGCACCTCGTCGCGCAGGATCTGGGCGGGCACCGTCGGCAACGCCTGACCCGGTGCCGCCACCACGAAGACCGGATCGCAGCGGGCGGATACCGTGTCGACCACGCGCTCGACGAGCGTGCGCCCCTCGAACACCAGGGTTGCCTTGTCGCGACCCATCCGTCGCGACGCGCCACCGGCCAGCACAACGGCGGCCGGTGCTGTGCGCGGGGTCACACCAGCAGGTTAGTCCACCGACCAGGTGTCTTTACCGCGAAGCAGCGATTGCAGCGCGGCCGTGTCGTGCACCTTGGCTTCCCGCGCGGCCGCCACCTGCTGACGAGCAGCATCGTCGTAGGTGGGCTTGTTCACCTGCCGGAAGATGCCCATGACCATGTGATCCAGGTTCTGCTCGCTCAGGCGCGACAGCGCGAAGGCGTAGGCCGGATCGTCGATGGTGGCGTCGTGCACCACGATCTGATCGGCAGGCACGTCGGCGGTCTTGGCGATTTCCAGACCGAAACCGGACTTCACCACGGCGTACTCACCATCGGCACCGAACGTGATCGGCTCACCGTGGGTGATGTTGATCAGCCGGTCCTCGGCACCCTCCTTGCGCAGGGCGTCGAAGGAGCCGTCGTTGAAGATCGGGCAGTCCTGCATGATCTCGACCAGGGCGGCACCCCGGTGCTGTGCGGCGCCGCGCAACACCTCGGTCAGACCCTTGCGGTCGGAGTCCAGCGCCCGGCCGACGAAGGTGGCCTCCGCACCCAGCGCCAGCGACACCGGGTTGAACGGGTAGTCCAGCGAGCCCATCGGGGTCGACTTGGTGACCTTGCCGACCTCGGAGGTCGGCGAGTACTGGCCCTTGGTCAGACCGTAGATCCGGTTGTTGAACAACAGGATGGTGATGTTGATGTTGCGGCGCAGGGCATGGATCAGGTGATTGCCACCGATGGACAGCGAGTCGCCGTCACCGGTGACGACCCACACCGAGAGGTCCTCGCGAGCCAGCGCCAGGCCGGTGGCGATGGTCGGGGCGCGGCCGTGGATCGAGTGGAACCCGTAGGTCTCCAGGTAGTAGGGGAACCGGCTGGAGCAGCCGATACCGCTGACGAACGCGATGTTCTCGCGTCGCAGGCCCAACTCGGGCAGGAAGTTGCGGATGGTGTTGAGGATGACGTAGTCACCGCAGCCGGGGCACCAGCGGACCTCCTGGTCACTGGTGAAGTCCTTGCCCTTCTGCGGCTGGTCCGTGGTCGGCACCAGCGCCGTCTTGCTGAGGGCTTCGGTCAGGCCCAGGTCAGAGCCGATGAGGTCGGTCATACGTTCGCTCCCACAGATTCCACGGTGGCCGCGGCCAGCCGTGCGAACTTCGCCTTGTCAGTTTCCTTCTCGCGCAACGTGCCGTCCAGCGCCGAATCGATGATGCCCTCGACCTCGTCGGCCAGGAAGGCCATGCCCTCGACCTTGGTCACCGACTGGATATCGACCAGGTACTTGCCGCGCAGCAGCAGGGCGAGCTGGCCGAGGTTCATCTCCGGCACGACCACCTTGGGGTACTTGCGCAGCACCTCTTCGGTGTTGGCCGGGAACGGGTTGAGGTGACGCAACTGGGCATGCGCGACCTTGATGCCCTTGCGTCGCGCCCGCCGGCACGCCTCGCCGATGGGGCCATACGAGCTGCCCCATCCGAGCATCAGCAGCTCGGCATCGTCGGTCGGATCGTCGACCACGAGATCGGGCACGGTGATGCCGTCGATCTTGAGCTGGCGCAACCGCACCATCAGATCGTGGTTCTTGGGTTCATAGGAGATGTTGCCTGAACCGTTGGCCGCCTCGAGACCGCCGATGCGGTGCTCCAGGCCCGGGGTGCCCGGCACGGCGAACTGGCGGGCCAGCGTCTCGGGATCGCGGGCGTAGGGCTCGAAAGGCTCGCCGGACTTGGCGAACTTGTGCTCAATGGCCGGGTAGGTGCTGATATCCGGGATACGCCACGGTTCGGAACCGTTGGCGATCGCGCCGTCGGACAGGATGATCACCGGGGTGTGGTAACCGATCGCGATCCGCACTGCTTCCACCGCGATGTCGAAGCAATCCGACGGCGAGCACGGCGCCAGTACCGCGACCGGCGATTCGCCGTTGCGGCCGTAGAGCGCCTGCAACAGGTCGGCCTGCTCGGTCTTGGTGGGCAGGCCGGTGGACGGACCGCCGCGCTGGACATCGATGACGAGCAACGGCAATTCGGTCATCACGGCCAGGCCGATGGCCTCGGACTTCAGCGATACGCCCGGACCCGAGGTGCTGGTGACGCCGAGCGCACCGCCATAGGAGGCGCCGATGGCGGCACCGATACCGGCGATCTCATCCTCGGCCTGGAAGGTCAGCACGTTGAAGTGCTTGTACTTCGACAGCTCGTGCAGGATGTCGCTGGCCGGGGTGATGGGGTACGTGCCGAGCACGACCTGGATACCGGACAGGTGCCCCGCGGCGACCACACCGTATGCCAGTGCGGTGTTACCCGAGATCTGGCGGTACTCGCCCGACTTCAGCTTCGCCGGAGCGACCTCGTAGGTGGTGGCGAACGCCTCGGTGGTCTCGCCGTAGTTCCAACCCGCCTTGAGCGCCAGCACGTTGGCCTCGGCGATCTCGGGCTTACGGGCGAACTTCTCCCGGATGAATGCCTCACTGGCCTCCAACTCGCGGCCGTACATCCACGAGAGCAGCCCGAGCGCGAACATGTTCTTTGCGCGCTGGCCGTCCTTCTTGGTGGCGCCGATCGCCTCGACGGCGCCGAGGGTCAGGGTGGTCATGGCCACCGACTGCACGACGTAGTCGGACAGTTCCTCGTTCTCCAAGGGATTGGTGTCATAGCCGACCTTGGCCAGGTTGCGCTTGGTGAACTCGTCGGAGTTGGCGATGATCAGACCGCCGCGCGGCAGGTCGCCGACGTTGGCCTTCAGCGCCGCCGGGTTCATCGCGACGAGGACGTCGGGACGGTCGCCGGCGGTCAGGATGTCGTAGTCGGCAATCTGGATCTGGAATGACGACACACCCGGCAGGGTGCCCTGTGGTGCCCGGATCTCGGCGGGATAGTTCGGCTGGGTCGCGAGGTCGTTGCCGAAAAGGGCGGCCTCCGAAGTGAAGCGATCGCCGGTAAGCTGCATTCCGTCACCGGAGTCGCCGGCGAAGCGGATGACCACTTTTTCCAGCTTCTGCCGCGGAGCGCCGGTGCTGCCGTTCTGACCCACGACTTCCTGCCTTTCACACCCCGGCCGGTGGCGCGTGGCCAGGCGGGCGTTACGTCAAAAACTTTTGGTGTCACTGCGAGTAACCAGTATGGCACTTCTCTTAGGGTGTCCATTGCCACGCCGCGTGACGACACGCAGGTGGCGACGCGAAGAACTCGCTGGTCAGCGACGTGCAGGGTGTGGCGGTCCACACATCATGTGGTTTTGGTCACTCAAAGAGCGCGTCATTTCCTAAGGATCTGGTTACCGATCAGTAGCTCTGAGCGACGTACAGGCCCGGCGGAGTTGACACCTGTCAAGTCTGCGCAGGTCAGGCCTTGGTCGACGGCATCTGCGCCCGTTTTTGCAGCTCGCGGGCGACCAGTTCGCTGGCGATCTCCACCGCCCGAACCGGGTTCGCCCCCGCCGCACGATGCGCCACGTAGCTCGCGGTGAACATATCGCCGGCACCGGTGGTCTGCACGTCCATCACGCGCCATGCGGCAGGAACCCGAGTCACCGAGCCGTCCCGGTAGATGTCACAACCCTCGGACCCATAGGTGACCAGGATCTCCGGCACCCCGAGCCGGTGCGCGGCCGCGAGGTCGAACGCACCATCGGCGACGATCACGGCCTCGTCCTCGGCCAACTTGAGCACGCTGAGGTGACGCAGCAGCTCCGGGGAGAAATCACGGTCCAGCACCAGCGGCCCGACCCGATCGGCGCGCACCAGACCCTGACCGTCATAGGCGATACGGTGACCGCGCGCCGCCAGATGGGCGAGGGTGTCGGCCGGGAAATCGGTGCGCAGCAGCGGTGCCAGGTGGATCCAGGTGGTGCGCGGGTCGTCGGCATCGATATCGCCGGGACGCCATACCGGCCCGATCGCCGAGACGCTCATGTGCCGGTGGTCGGTGTCGTCATAGTCGAGGCTGAATGCGCTGGTGCGATCGGCGGGAAGCAGCCGAACCAGCGCGCCGAACCGGTCGAGCACCGGGCCGAAGAGTTCGTGATGACGTCGTTCGCCCATGGCCGCGATGTGCGTGCTGCCCGGTGCCGATTGCAGTGCCACCCCGGCAAATGATGCGCAGCCACCGGGACTCGGGGCGGCACCGTTGATGACGTCGATGGCGAGGTTGCCCAGCACCGTCACGCCCGGGACCAGCTGTGGCGCCATCAATCCGAATCCCCCGCCTGCGGTATGTCGTGTGGTCGACACACGGGTGACGTGGCCGAGCGGTGAAATGGTAGTCCGCATCGGGCGCTTTCAGCGATTCCGCAGCGCGGCCCCGTTCCGGCAGATCCCGCCCCGGAACGTCCTATCCAGGCACGTTGTGGGGCGTGACGACCTGGATCGGCGCCGGACGCACGGACTCCTCGGGCAGCGCGCGTTGCCGAGCCAGGATCACCCGCATGGCAAGGCGGGCATCGGCGCGCAGGTCGTTGTGCAGCACCACGGACAACTTGCCCTGGCGCAGCAGTCTGCGGTTGTCGACATCGAGATCGTGTGCGACGAATACCCGGCACGCGCGTCCGAGCTTCTCGAACGCGGCGACGGTCGCGGCGTTACCGCCCCCCGGCGAGTAGACGGCCTCGATACCGGGGTGGCGGCGCAGCGCGTCGAGCACCAACGCCTCGGTGTTGGCGTCGATGCCATCCCCCTCGCTGATCTCGACGATCTCGCGACGGGATCCCCGCAAACCTGCCCGGAACCCCACCTCCCGTTCGCCTTCACCACGGAATACCGTGCGGCTCAGGGTGATCAGCACATCGGCGGGCGCGTCACCCAGCCATTGGTCGACGAGGTACGCCGCCGTCAGCCCGGCACCGTGATTGTCGATACCGACGTAAGCGGTGCGATCGCTGTTGGCCACATCGGAGGTGTAGGTGACCACAGGCACCCCGGCGCCCACCAGCCGATCGACCTGTTCGGCGACCTCGGGCTCGTCCTGCGCCTTGAGGATGACGCCGTGGCTTCCCTTGATACGGCCGAGCACCTCGACCGTGCGGGCCGCCGAACCGGATTCCCAGAGGTGGAAGCGAGCCCGCACCGCGGCGGGCGCGAACGCCGGGAGCTCGGCCTCCACGGCTGCTCGGAACGCGTCGGAGAACCGCTGCGGCGTCTGCATCACGACGTCGATCAGGTAGCGACGGCCGTTGAGCCGCAATTGGGCGCGCTGCTTGTCCAGATCGGCGATGGCCTGCAGCACCTCGGCGCGGGTGTTCTCCCGGACCCCCGGACGGTCGTTGAGGACGCGGTCGACCGTCGCTTCGCTGAGACCGCACTGCTGGGCGATCTCGCGAACCTTGTATCGGTGCATCCCCTACCTCTTCCGCGAGCAGACGTAGACCTGCGCTCTGTGCGCCGATTTCGGCAGTTCTGCGTCTGTTCGCCGATATAGCGGTTGATGGTTTTTTGATGGCTTTCTGCTGTTGATTGCGGTCGGTGTCACAGCAAGACTAACCGTGGTGTGCGCCTTGGACGCACCGGACGAACGGAGAATCCCATGACCACCATCGGTGTGATCGGACTGGGCCGCATCGGCGCATTCCACACCGAAACCCTTGCCGGACTCGACGGGATCGACGGGCTGGTGATCACCGACGAACGTCCCGACGTGACCGCGGCGGTGGCCGCCAAGCACGGCGCCACGGCGGTCGGATCCGTCGAGGAACTGCTGGCGTCCGGCGTCGACGGGGTCGTGGTCGCCGCCGCCACCCCCGCACACGCGGACCTCACGCTCGCCGCGGTCGAACGCGGGATCCCCACCTTCTGCGAGAAGCCGATCGCCTCCACCGCCGCCGAGAGCGCGCGGGTCGCGGAGACCATCATGTGCACCGGGGTACCGGTGCAGGTGGGCTATCAGCGCCGGTTCGACGCCGCGTTCGCCGCGGCCAAGACCGCCGTCGACAACGGCAGCCTCGGCATCCTGCACACCGTGCGCAGCACCACGATGGATCCCGCTCCCCCGCCGCTGGACTACATCAAGGGCTCCGGCGGCATCTTCCGGGACTGTGCCGTCCACGATTTCGACGTCGTCCGCTGGATCACCGGCCAGCAGGCCGTCGAGGTCTACGCCACCGGGTCGGTGCAGGGTGACCCGCTGTTCGCCGAGTACGGCGATGTCGACACCGCCGCGGTGGTGGTGCGATTCGACGGTGGCGCACTGGGAGTCATCTCCAACGCCCGATACAACGCACGCGGTTACGACTGCCGTCTCGAGATCCACGGCTTCGACGATTCGGTGGCCGCAGGCTGGGACCAGGGTGCACCACTGCGCAATGTCGACCCGGCCAACGAGTTCCCCACCGGACCGGCGCACAACTTCTTCATGGATCGCTTCACCGAGGCGTTCCGCACCGAGCTGGCCGGTTTCCTTGAGGTCGCCAAGGGCGGCCCGGTCCGCGGCGCGACCGTCGCCGACGCCGTCGAGGTGGCCTGGATGGCAGAGGCCGCCACCGAGTCCCTGCGCCGTGGCACCCCGGTGGCACTCGAGTCGGTCAAAGCGAAGGTAGGCACCGCATGAGCATCAAACTTGCCGGAGCACCCATCTCCTGGGGCGTCTGCGAGGTGCCGGGATGGGGCCACCAGTTGGCCCCCGCACGGGTCCTCGCCGAGATGCGCGGTGTCGGCCTGACGGCCACCGAACTGGGCCCGGAGGGTTTCCTGCCCGCCGACCCCACGGAACTGACGTCCGTCCTGGCCGAGCACCAGCTCAGCTGCGTGGGCGGCTTCGTCCCGGTGATCCTGCACCAGGTCGACCATGATCCGGCCGAAGAGCTTGCCGGGCCGCTGGATTCGCTGATCGCCGCCGGCGCCGGCGTGGTGGTGCTGGCCGCCGCCACCGGTGCCGATGGCTACGACTCACGACCCGTGCTCGACGAAGCGCAGTGGAACACGTTGCTGGCCAACCTGGATCGGCTGGCCGGCATCGTCGCCGACCGCGGCCTGTTGGCGGTGCTGCACCCGCATGTCGGCACCATCGTGGAAACCCGCGCCGAAGTCGACCGGGTGCTGGGTGGTTCGTCCATCCCGCTGTGCCTGGACACCGGGCACCTCCTCATCGGCGGCACCGACCCGCTGGAGCTGGCCAAGGCCGTCCCGCAGCGCATCGCGCACGCCCATCTCAAGGACGTCGACGCGGCGTTGGCCGCCAAGGTGCAATCAGGCGAGCTCAGCTACACCGCGGCCGTCAAGGCGGGTATGTACACCCCGCTGGGCACCGGCGATGTGGATATCGAAGCCATCGTCGGTGTGTTGCGCGACAACGGGTTCGACGGTTGGTTCGTGATGGAGCAGGACACCATTCTCGACGGCGCGCCCGCCGGTGACGGTCCGGTGGCCGATGTGCGGGCCAGCGTCGCCTTCCTCAACGGCATCATGGCCTGATACCGAAAACTCCCAGACGACATCGCGACCCGACAACCGTCGGGTCGCGATGTCGTTGGTGCGGTCAGTGCGTCAGATACCTGTCGGCAAGCCCGGTCAACGTCTTCTCGATGCCCTGGGCATTCTGCTTCTTCATGCCCAAGAGCTCGATGACGAACGGCACCTTGGCCGAGGAATAGTCGAAGGTCTCGGTGACCTTGGTGATGCCGGGACTCACCTCGGCGAACTCCCAACGCCACTTGTGCCCGAGCGGGTGCTGCCACTCGACGATCTCGTTCTCCCTGGCCTTCGTCACCGTGGACGTGATCTTGTAGGGCAGCCCGTATTGGGTCATGCCGATCGTGAACTTGTCGCCCTCGGACACCCGGTGCGGACCCTTGACCTCGACATCGCGCACCGTTCCCGATCCGTCGATCTCGTGGTGCCGATGCGGGTCGGCGATCTGCTCGAACAGCGTGGCCACCGGTGCGGTGACCTGCACGCTACGGCTGACGGACTGCTTGCCTGCGTCTTCTGTCTTCAGTGTCGTCGTGGTCATGTCGCCGTTATACCCGCGCACCGGGTGGCCTACACGAGACGGCAGGCCGGGCGCGGGGTGAGTCAGCGACCGAGCTCGTGGCTGAGCGCCTCGAGCTCGTCACCGCCGGCCATCTGTTGGGTCAGATGCTCCAGCGTGATGTCGTCATAGGTGCAGTCCAGCTTCTGACGGCCCCGGTTGAGCAGTACGAAATGGTCGCCCACCATGTGGGCGTGGTGCGGGTTGTGTGTGATGAAGACGACACCGAAGCCGGCCTCCTTGGCCGCGGTGATGTATTTGAGCACCACACCGGACTGTTTCACACCCAGCGCCGCGGTCGGTTCGTCGAGAATCAGCACCCGCGCACCGAAGAAGACCGCCCGCGCGATGGCGACGCACTGGCGCTGCCCACCGGACAGCGACCCGATCGGCGCGTCGACATCGGGCAGTTCGATGCCCATCTTCGACAGCTCGGCCAGCGTGGTCGCCCGCATGGCGTTGGCGTCCAGCGACCACGGGAACGACTTCTTGCGCACCTCCTGGCCGAGGAAGAAGTTGCGCCACACCGGCATCAGCGGGACGACGGCGAGGTTCTGGTAGACGGTCGCGATGCCCTTGTCCAGGGCGTCGGCCGGTGAGGAGAACGTGGTGACCTCACCGTCGACCAGCAGTTCACCCTCAGTCTGCTGATGCAGGCCGGCGATGATCTTGATCAGCGTGGACTTGCCTGCACCGTTGTCGCCGAGAATGCCGGTGACCTCACCGGCATGCACGCGCAGGCTGATGTCCGCCAGCGCGGTGATGTTGCCATAGGACTTGCCGACATTGCGCAGCTCGACCAGTGGCACCTTCTTACCGCCGGAGGATGCGTTGCTCGGGGTCTCGACTGTTGCGGTCATCAGATCACTTCTTCGCTGCGTAGTTACGGAAGGCATTGTTGGCGATCACCGCGAACAGCAGCATCCCGCCGAGGAAGAACTTGAACCAGTCCGGATCCCAGCCCGCATAGACGATGCCCTGATTGGTCATGCCGAAGATGAACGCACCGATGGCGGCGCCGACCGCGGTGCCGTATCCACCGGTGAGCAGACAGCCACCGATGACCGCGGCGATGATGTAGAAGAACTCGTTACCGATGCCCTGCCCGGACTGCACGGTGTTGAACGCGAACAGCAGATGCATGCCGACGAACCAGGCGCAGAAGCCGACGAACATGAACAGCCCGATCTTGACCTTCGTCACCGGAATGCCGATGGCACGGGCGCTTTCGGCATCGCCGCCGACGGCGAAGATCCAGTTGCCGATCCTGGTCTTGAACAGCACCCAGGTCGCGACGACGGTGAACACCAGCCACCACAGCACGGTCACCCGGATACCGACACCGAACACGGTGAAGCTGGAGGAGAAGACCTTCTGCGCGGACTCCCAGCCGGCCATATCGTTGACGCTCTGGGTGGCCACCTGTCCGGCAACCAGTTTCGTGACGGCCAGGTTGATACCGGCCAGCATGAAGAACGTGCTCAGCGTGATCAGGAAGCTGGGGATCTTGGTCTTCATCACCAGGAAGCCGTTGAAGAAACCCACCGCCAACGACAGCACCAGCGCCAACGCGGCACCGACCCAGAGATTCAGGTGCAGGTTGTAGGCCAGCATCGATGCCGCCAGTGAACTGAACGTCACCGCGACGCCGGCCGAGAGGTCGAACTCGCCGCCGATCATCAGCAGCGCCACACCGCAGGCCATGATGCCGATAGTGCTGCTGGCGTAGAGCACGGTGGCCAGCGAGGAGGCCTCCCGGAACGGCGCCGCCACCACCAGGAACGCGATGAAGATGCCGATGGCGCCGATACCGGCACCCATCTCCGGGCGGATCAGTATGCGCTGTAGACGATTTCGTTCCTTGACTCGTTCGTCACGGACGACCGTGTGGTTTTCGAGGGTCACCTCGGTCTGGGTGGACATCTCTGCTCTTTCCGACACTGCTAGCGAGTTCCGCCCTTGGCCAGTTCGGCCACGGCGTCGATGTTGGTGTTGTCGATGAAGGCGGGCCCGGTCAGGGTCGGCTTCCCGCCACCGATGACGTTCTTGTTGTTCAGGTAGAGCCACAGCGAGTCGACGGCCAGGTAGCCCTGCAGGTAGGGCTGCTGGTCGACCGCCCACTGGATATCGCCACCCTTGATGGCGTCCACCAGCGCCGCGTTGGTGTCGAAGGTGCCGATCTTCGCCGAGCTGTTGGCGTTCTTGGCCGAGGTGACCGCGGTCAGCGCGAACGGCGCGCCGAGCGCGACGATCATGTCGATGGCGGGATCCTGCTGGAGCTTGGCGGTGATGGTCGATTCGACCGAGGGCATGTCCTTGCCGTTGACATTGAGGATCTCGGTGGCCCCGAACGTGTTCTTCATCCCCGCACAGCGGGCCTCGAGGTCGACGTTGCCCTGCTCATGGATGATGCAGATGGCTTTCTTTGCACCATCTTTGGCGACGCGCTCGCCCGCGCCCTGGCCGGCGATGTAACCGTCCTGACCGAAGAACTCCTGCACACCCATGTCCTGCCAGGTGTCCATGCCGGCGTTGAACGCCACGACGGGGATTCCCTTGGCCGCGGCGGCCTGCACGGCGGCCCGCATGGCATCGGGTTTGGCCAGCGTGACCGCGATACCGTCGACATTCGCATCGACGGCGCTCTGGACCAGGTTGGCCTGGTTCGGAGCTTCCGGATCGCTGGAGTAGCGCAGCTCGACATTGTCCTTCTTGGCCGCCGTCTCGGCGCCCTTGCGGATGAGATCCCAGAAGGAGTCACCGGGGGCCTCGTGGGTGACCATCGCGATCGTGACCCGCGGGGTGTCGACGGTTCCGCCGCCCGAACCGCCGTTCCCGCCCGCGGTGTCCGGGTTGCCGCCGGTCGACGAGCAGGCCGCCATTCCCAGCGCCAGCACACCCGCCCCGGTCAGCGCTGCCCATCGGCGCAACGACCGGATTCCTCGACTGTTCGCGCGAGCGGTCATCCCTGGCTTCTCCTTCGTCCCGCATCCGGGGCTCGTGAACTTCGACGTCCAGGCGGTCGCGGTATCCCGGCGACGCGCCACCGTTGATGTAATACGGCTCACACCGGAAAGTCAACACTTTGTCCTGACATTAGGACTGCACGTCATATGCGAGCCGGTGGCGCCGGGCGTTCAGCTCACCGGTCGACGAGGGTTGTCTCGAAGTAGTACCGCGACGCCCGGTAGCAGTGACTTCCGAATTCGACCGCGCGCCCCGAATCGTCGAAGGCCGTCCGGCTCATGGTCAGCAGCGGGGCCCCGACCTTCTCTTCGAGCAACCGAGCCTCGGTGCGCTGGGCCGGTCTGGCACCGATCCGCTGGCGCGCCAGCCGGATGTGCACCCCCCTGCCACGCAGCGCCTGATACAACCCGCTTGCCTCGAGTTCCCCGGCGTCCGGGGCGATCTCGGCCGGCAGGTGGTTGGTCATGATGGCCAGCGGCTCCCCGTTGGCATAACGCAAGCGCTGCACGGTGACGATCTCGTGGTCGGTGGACAAGTTGAGCTCGGCGGCGATCTCGTCATCGGCGACGCCGCGGCGGTACTCCAACAGCTGTGTTGTCGGATCCTGTCCGGATTTCGCGAGATCGTCGAAGAGGCTGGTCAATTCGACGCGCCGGTGCACCGGGTTCTGCACCACCTGCGTGCCGACGCCGCGCTTGCGGACCAGCAACCCCTTGTCCACGAGCTCCTGGATGGCGCGCCGGGTCGTCGGCCTCGACAGCGCCAATCGGCCGGCCAAGGACAGCTCGTTCTCGAATCTGTCACCCGGGGCGAGTTCGCCACTGCGGATCGCCGCCTCGATCGCCTGCGCGAGCTGGTAGTAGAGGGGCACAGGGCTGGACCTGTCGAGTTCGACCGCTAGGGGCACGTCCACTCCGATTTCGTCGGCGATGATCAGCAAGACTAACCGAACCTGCGCACTATGACCGAAAGTTCGAATGTCAGGACAAATTGTTGACAGGGGTAGGTGATCGGGGGCACAGTGCTTGCGACAGAGCCCTGCACCCACTTCGGGAGATCACGTGTCGTCACCACCCCAGCCATCAACCGATCCGTTCGATGTCATCGCCATCGGCCGCAGCGGTGTCGACATCTACCCGCTGCAGACGGGTGTGGGACTCGACGAGGTCGACACCTTCGGGAAGTTCCTCGGCGGCAGCGCGGCCAATGTCGCGGTGGCCGCCGCGCGACTGGGCAACCGCACCGCCCTGATCTCGGGAACCGGCGACGATCCGTTCGGACGGTTCGTCCGCGATGAACTGGCCCGCCTCGGGGTCGACAACCGCTATGTGGGCATCCACGGCCGCTATCCCACCCCGGTCACCTTCTGCGAGATCTTCCCGCCCGACGATTTCCCGCTCTACTTCTACCGCAAGCCCTCGGCGCCCGATCTGCAGATCGAAGCCGGTGACATCGATGCCGACGCCGTGCGCACCGCACGATTGTTCTGGTCCACGCTGACCGGCCTGTCCGAAGAGCCCAGCCGCAGCGCGCATTTCGCGGCCTGGGCCGCACGTGCCCGCACGCCGCTCACGGTGCTCGATCTCGACTACCGGCCGATGTTCTGGGCCGACCCCGCCGCAGCAGGTGAGCAGGCGGTGCGTGCGCTGGGCCAGGTCACCGTCGCCGTGGGCAACCGCGAGGAGTGCGAGATCGCCGTGGGTGAGACCGTTCCCCACCGCGCCGCCGATGCGCTGCTCGACCTGGGGGTCGAACTGGCCATCGTCAAACAAGGCCCGCGCGGTGTGCTCGGCAAGACCAGGCACAGTTCGGTGACGGTCGCACCCAACGACGTCGACGTCGTCAACGGCCTGGGTGCCGGAGACGCGTTCGGGGGCAGCCTCATTCACGGTCTGCTGCGGAACTGGCCACTGGAGAAGACCCTTCGCTACGCCAACGCCGCCGGCGCCATCGTGGCATCCCGGCTGGAATGTTCTACGGCCATGCCCACCGCCGCCGAGGTCGCCGAACTCGCCGAACAGAGCGCCGTGGAGGCCGTCAATGTCTGACGCACTGTGCCGTGACTACGCCGAGGTCACCGAGTTGCGCGCCGCCGACCCGGCGTCGGTGACGAAGGCCTGGCACGCCCGCACCACCCGGCCCACGGTGCGTGGTGACGGCAGGCTGATGATCGTGGCGGCCGACCACCCGGCACGCGGCGCGCTGTCGGTAGGCACCCGGGTGACCGCCATGAACAGCCGCATCGACCTGCTCGACCGGCTGCGGACCGCGTTGGCCGATCCCGGCGTCGATGGTGTGCTGGCGACCGCGGATATCCTCGACGACCTGGTGTTGCTCGGTGCGTTGGAGGACAAGGTGGTGTTCTCGTCGCTGAACCGTGGCGGACTGGCCGGTTCGGTCTTCGAGCTCGACGACAGGATGACCGGCGCGACCGCGACCTCGACCGCCGATGCCCGCATGAACGGCGGGAAGATGTTGTGCCGCATAGATCTCGACGATCCCGGCACCGTCTCGACCCTCGCCGGCTGCGCGCAGGCCGTCGACCAGCTGGCCGCCCACGGATTGATCGCCATGCTCGAACCGTTCTTGTCCACCCGGGTCGACGGCAAGGTCCGCAATGACCTCTCCCCCGACGCGGTGATCAAGAGCGTGCACATCGCCCAGGGCCTGGGTTCGACATCGGCCTACACCTGGCTCAAGCTGCCCGTTGTGCCCGAGATGGACCGGGTGATGGAATCGACGACGATGCCCACCCTGCTGCTCGGCGGTGACCCGACCGACCCGGACGAGGCCTTCGCGACCTGGGCGTCGGCACTGGCCCTGCCCGCGGTGCGCGGTCTGATCGTCGGGCGCACCTTGCTCTACCCGCCCGATGACGATGTGGCTTCCGCGGTCTCGGGTGCCGTCGGATTGGTGCGGTGATGAACAGCTCCTGGTACATACCGGCGGGTTCGGCCGACGCGCCGTACTCGGTGGCCGTCACCCCGGAGTCAGCAGGCTGGTCCGAATGTGGGCTGCACGTTCTGGATCTGGGCACCGACGGTACGGTCGCGCTGCAGACCGGCGACACCGAGGTGATGATCCTGCCGCTGGCCGGCGGCGGCTCCGTCGAATGCGCTGACGACGTGTTCGAACTCGGCACGCGGACATCGGTTTTCGACGGCCCCGCCGATATGGTCTATCTGGGTGTCGGGCAGTCCTATGTGGTGACCGGACACGGGCGCATCGCCGTCTGCGCCGCCCGAGCCAGCCGGTCCCTGCCCAACCGGCGGCGGGCCGCCGCCGATGTGCCGGTCGAGTTACGCGGGGCGGGCAACTGCAGCCGGCAGGTACACAATTTCGGCACCGCGGACACCTTCGAGGCGGATTCACTGATCGCGTGCGAGGTCATCACCCCCGGCGGTAACTGGTCGAGCTATCCGGCCCACAAACACGATGAGGACAGCTACGTCGAGTCCCAGCTCGAAGAGATCTACTACTTCGAGATCGACGACAGCCCGGCCGGCACTCCGGGATTCGGCTATCACCGCGTGTTCGGCACACCTGCCCGGCCCATCGAGGTACTCGAAGAGGTCCGCACCGGCGATGTGGTCCTGGTGCCGCACGGCTATCACGGCCCGTCGATCGCCGCACCCGGTCACCACATGTACTACCTGAATGTCATGGCCGGATCCGGGCCCGAGCGGGCGTGGCGGATCTGCGACAACCCGGACCACACCTGGCTGCGCGCCAGCTGGGATCACCAAGAGATCGACCCGCGCCTGCCGATGCGCACGAACCGAGGAGTTTGACCGTGGTGTCCACCGCCCCGAAAGCAGCCGACAAACTCGCGGATACCGAAGCCACCGTCCGTCTCACCGTCGCGCAGGCCACCATCGCGTTTCTGGCCGCCCAGTATGTCGAGCGCGACGGTGATCGCACGCCGTTCTTCGCCGGATGCTTCGGCATCTTCGGCCACGGCAACGTTGCCGGTCTCGGCCAGGCATTGCTGCAGGACGAGATCGAGGCGCAGGCGGCCGGACGCGCACCACGCATGCCCTACGTGCTGGGCCGCAACGAACAGGCGATGGTGCACAGCGCCGTCGCCTATGCGCGGCAGAAGGACCGGCTGCAGACCTGGGCGGTGACCGCCAGCGTCGGACCCGGATCGACCAATATGCTCACCGGCGCCGCGTTGGCCACCATCAACCGGCTGCCGGTCCTGCTGTTGCCCGCCGACACGTTCGCCACCCGGGTCAGCTCGCCGGTGCTCCAGGAGCTCGAACTGCCGTCCTCCGGTGATGTCACGGTCAATGACGCGTTCAAGCCACTGTCCCGCTACTTCGACCGGGTCTGGCGGCCCGAACAGCTGCCGGCCGCCCTGCTGGGGGCCATGCGCGTGCTCACCGACCCCGTCGAGACGGGCGCGGCCACGGTATCCATCCCGCAGGACGTCCAGGCCGAGGCCCATGACTGGCCGGAATCGCTTTTCGCCGAACGCACCTGGCACATCGCACGTCCGCTGCCCGAACGCGCGGTGGTCGCCCGGGCGGCCGCCCTCATCGCCGCGGCGCGCCGGCCGTTGATCATCGCCGGTGGCGGAGTGCACTATTCCGGCGCGGAGGCGGCGCTGACCGCCCTGGCCGAGCAGACCGGCATCCCCGTGGCCGAGAGCCAAGCCGGTAAGGGGTCGTTGCGCCACGATCATCCACAGAGCGTCGGTGCGGTCGGCTCCACCGGCAGCACCGCGGCCAACGCCCTGGCCACCGATGCCGACGTGGTGATCGGAATAGGCACTCGGTACAGCGATTTCACCTCCGCCTCGCGCACCGCCTTCAACAATCCACAGGTGCGTTTCGTCAACATCAACGTCGCATCCCTCGACGCGGTGAAGCAGGGCGGGGTGAGTGTCGTCGCCGATGCGCGCGAGGCCATCGAGGCGCTCGGCCCGGCGCTGGCCGATTACCGCGTACCCGACGAATACCGTTCTCACATAAGCGAGCTGGCCGGAGAGTGGGATGCCGCGGTCTCGGCTGCGTTCGCCACCGAGGACGGCGCGCAGTTGAACCAGAACCAGGTGATCGGACTGGTCAACTCCCTGTCGGACCCGCGCGACGTGGTGGTGTGCGCCGCCGGTTCGATGCCCGGTGATCTACACAAGTTGTGGCGATCACGTGACCGCAAGAGCTACCACGTCGAATACGGGTTCTCCTGCATGGGATATGAGATCGCCGGCGGTATCGGGGTACGCATGGCTGCGCCGGATCGCGACGTGTTCGTGATGGTCGGCGACGGTTCGTATCTGATGATGGCCACCGAGATCGCGACCGCGGTGCAGGAAGGCGTCAAGGTGATCCCGGTGCTCGTGCAGAATCACGGGTTCGCCTCCATCGGCGGGCTTTCGGAGTCACTGGGCTCGCAGCGCTTCGGTACCGCCTATCGGTACCGCGGCACCGACGGCCGCCTCGACGGTGACCGACTGCCGGTCGACCTGGCCGCCAACGCCGCCAGCCTCGGCGCCGATGTCATCAAGGTCGCCACGGCCGCCGAGTTCACCGATGCGGTCAAGGTAGCCAAGGCCGCCGACCGGATCACCGTCATCCACGTCGAGACCGACCCGCGCGTCTACGCGCCCGACAGCCACTCGTGGTGGGACGTGCCGGTGTCGCAGGTGTCGGCGTTGGAATCGACACAGCAGGCCTACCAGCGCTACACGGAATGGAAGAAGGTGCAGCGCCCGCTGATCGCGCCGTCGGACGGCTGATCGCCGGGTGCGGGGGCTGCGGTCACCAGAGGCCCGGACGGGCCTGACCCCCACACCCGGCGATGCTTCATGTCGCCGAACTGCGTCTACAGGATGTAGAGCATCTCCTGGTAGGTGGGCAGCGGCCACAGGTCATCGGCCACCACACCTTCGAGTGCGTCGGCGGCCGCCCGCACGGCATCCATGGCCGGCAGCAACACCTTCTGCGCATGGGCGGCCTCGTCGAGCGCGGACTCCGCGGAGTGGTCCGACAACCCGGCCTTGAGCGCACCGACGGCCGCGGTCAGCTCGGCAATCGGTGTCGAAACCGCCTCCAGCAGCGTGGTATCGGCATCGAAACCAGCCGCCTTGAGGGTGGCGACGTTCTGCGCCAGTTCGGTCTGGTAGCGGACGGCAGCGGGCAGGATCACCGTGGTGCCGAGTTCGAGCGCGAGCTTGGCCTCCACCGCGATGGTCAGCGCGTACTGCTCCAAGCGCACCTCGTAGCGGCTGTGCAGCTCGCGCTCGTTGAACACCCCGTACTTCTCGAACACCTCGACGGCCTCCGGGGTGATCAGCTCCGGGATGGCATCCAGCGTGGTCTTGAGATTCGGCAGACCACGCTCGGCCGCCTCGGTCTGCCAGTTCTCCGAGTAGCCGTCACCGTTGAAGACCACCGCGCCGTGCTCGGTGATGATGTCGGTGAGCAGCTGCTGCACCGCGGTGTCGAATTCGGTACCGTCCGCGACGGCCTTCTCCAATACCGTCGCCATGTAATCCAACGAGTCGGCCATGATGGTGTTCAGCACGATCATCGGCACCGCCACCGTCTGCCCGGAACCGGGCGCACGGAACTCGAACCGGTTGCCGGTGAAGGCAAACGGGCTGGTGCGGTTGCGGTCGCCCGGATCGGTCGGCAGCTGGGGCAGGGTGTCGACACCGATGTGCATGACGCCCTTGCCCTTCGACGAGGTCGCCGCGCCCTTGGCGATCTGCTCGAACACATCGGCCAGCTGCGCGCCGAGGAAGATCGAGATGATGGCAGGCGGCGCCTCGTTGGCGCCGAGGCGGTGGTCGTTGGTGGCCGAGGCCACCGAAACTCGCAGCAGCCCGGGGAATTGGTGGACGGCACGGATGACGGCGGCGCAGAACACCAGGAACTGCGCGTTCTCGTGCGGGGTGTCACCGGGAACCAGCAGGGAACCGAGTTCGGAATTGCCGACCGAGAAGTTCACGTGCTTACCGGAACCGTTGACACCGGCGAACGGCTTCTCGTGGAACAGACATTCCATGCCGTGCTTGCGGGCGATGGTCTTGAACACGGTCATCAGCAGCTGCTGATGATCGGAGGCGATATTGGCCCGTTCGAACATCGGCGCCACCTCGAACTGCGCGGGCGCCACCTCGTTGTGCCGGGTCTTGGCCGGGATACCGAGCTTGAACAGCTCGCGCTCGGTGTCCATCATGAAGCCCAACACACGCTCGGGCACCGCACCGAAGTAGTGGTCGTCGAACTCCTGACCCTTCGGCGGCTTGGCGCCGAAAAGCGTGCGGCCGGCGTTGATCAGGTCGGGCCTGGCCAGGAAGAAGTGCCGATCGACAAGGAAGTACTCCTGCTCGGGACCGCAGAACGAGACGACCTTCTCCAGGTTCTTGTGCCCGAACAGCGTCAGGATGCGCTCGGCGTGCACACCCATCGCCTGCTGGCTGCGCAGCAGCGGGGTCTTGTAGTCCAGGGCTTCACCGGTCATCGAGACGAAAACCGTTGGGATGCAGAGCGTGTTCCCGTTCGGATTCTCCAGGATGTAGGCCGGGCTGGTGACATCCCAGCCGGTGTAACCGCGCGCTTCGAAGGTGCTGCGCAGGCCGCCGGAGGGGAAGCTCGACGCGTCGGGTTCGCCCTGGATCAGCGTCTTGCCTGCGAACTCGGCCAGCGTCTGACCGTCGGAGACCGGCTCAAGGAAGCTGTCGTGCTTCTCGGCGGTGAGCCCGGTCATCGGGTAGAACACGTGCGCGTAGTGGGTCGCCCCCTTGGACAGTGCCCAGTCCTTCATCGCCGAGGCGACGGCGTCGGCGACGGCCGGGTCGAGAGTGGCGCCCTTCTCGATGGTCGCCACGACCGATTTGAACACCGACTTGGGCAGCCGCAGCTGCATCTCGGCCTTGGTGAAGACGTTGGCGCCGAAGATCTCTCCGGGGGCCTCGGCGGGGTCGAAGCTGATGGCCGGCGGCACATAGGCCTCGACGTTGTTGATGGCCTTAAGCCGGACTGCGTTACCGCTCAATGTAGTTCCTATCGCTGCACGTGCTTGACCGGCCAACGGTAGGAATATTCGATGCCGATTCTGTTACGCGCGCGTCAACGTCTGAATGCGCGGGCGGCGCCGTGGAATCGTCAGGCTCAGAAAAATCTTCTGTCTCTTTGACCCCTGGTGTCACATTCCATCCGGATTTGTCGGGTGTAGCACGTATGTGGCGCTCCTAGTGTTTTAGTGGGCGTCTTTGTGCTGTTGAGGGTTGATTTGCAGTGCAGTGCAAGAGGTTCAGCGGGTCGTTCGTGAAATGAAGGACGCGCACGCGGCGTGCCTCTAGGTTTCGGGGTTACCACACCAACCTCAACCAGAGGATCGATCCGCGTGCGCGTCAGTACTGCATTTAACCGTCTTCTTCAGATTCCTGGTGCATCCGTGGTCGAGGTGTCGATCGGCGACCGCGACGTCGAAGTCACCCTGCGCCCCACAGCCCGACTGCTGAGGTGCCCCTGCAGCAAACGCGTCCGGTCAGTCTATGACCGCCGCCGGCGGCGATGGCGACACCTCGATCTGGGCACCAAACGACTGTGGCTGACCTACGACATCCGCCGACTGCACTGCCCGGATTGCGGTGTGACCACCGAAGAGGTGCCCTGGGCCCGCCCGGGAGCCAGGTTCAGCCGAGACTTCGAAGACACGGTGCTGTGGCTGGCCCAGCGCACCGACCGCACGACAGTGTCCACACTGATGCGCTGCGCCTGGGAATCGGTCACCTCGATCATCAAACGCGGAGTCGCTGAGCTGCTCGACCAACGTCGACTAAGGGCCCTGTATCAGATCGGTGTCGACGAGATCTGTTACCGCCACCCGCACCGCTATCTCACCATCATCGGCGATCACACGTCTGGCACCGTCATCGATGTCCAACCCGGAAAGAGTCGCGAATCACTCGCTAAATTCTATACAAGCCAACCAGATTCGACCCTCGCCGGAATCCAAGCGGTCACCATGGACGTCAGCAGCGTCTACACCGCAGCCACCCAAGAGCACCTGCCCCAGGCAACGATCTGCTACGACGGATTCCACATTCTGCAATGGGTCAACCGTGCACTGGACCGTGTCTTCTCCGAGGCTGCCGCTGGGCCAGACCGAGTCCACATGTCCTCAACGCAATGGCGGACCACCCGCTGGGCGCTGCGCACCGGAGAGGACAAACTCCCCGACGACAAACGCGCCCTGGTCAACGAGATCGCCAAACAGAACCGGCACGTCGGGCGGGCATGGGCACTCAAAGAACAAGCTCGCGACCTCTACCGCTACGACCACGAACCCGGCGCTGCACGCCAACTCCTCAGAGCCTGGATCACCGCCGCCAAACGCTCCCGCATCCCGGCCTTCGCCGCACTGGGCAAACGGTTCGAGGTCTACACCGAACCCATCCTCGCTGCGATTGAACTCAAACTGTCCAACGCACTCGCCGAAGGCATCAACGCCAAGATCCGACTCATCAACGCACGCGGCTACGGACATCACTCAGCCGAAACACTGACCTCGATGATCTACCTGTGCCTCGGCGGGCTCCACATCAAGCTCCCCACGAAAACCTGAGGAGTAGCACGTATGTTCGAATCATGACCACGGCGACGGTGCGAGTGACGGAGGCGGTAGCGAATTTGCGTGCCGCCTTCGACGCGTTCGCCGCCACCGACCTCGAGTCGCTGAGCGGTGCCGAGTTGATCGCCGTCATGGACGAATACGAGATGTTGACCTGCCGGCTTCCCGCTCAGCGGCATCGTCTGCTGACGCAGTTGCAAGCCGAGACCACGGCCCGCGAGATGGGTGCGAAGTCCTGGAACGAGGTGCTGCGGATCCGGTGGCGGCTCTCGACTGCAGAGGCCGACCGGCGGCTGCATGAGGCCGCCGACCTGGGGCCGCGACGGTCGTTGACCGGCGAACCGCTGCCTCCGCTTCTGCCAGTGGTCGCAGTCGCCCAGGCGGCCGGGCTGATCACCGGCGAACACGTCAAGGTGGTACGTGCGGCGGTGCGGGACCTGCCCGGCTCGGTCAGTACCGCCGACCGAGAGAAGTTCGAGGTCGCCCTGGTGCGCGAGGCCGTCGGGGCGGGGCCCAAGGCACTCGCCGAGTCCGCGGCCGACCGGTTGTTCCTGCTCGATCAGGACGGCCCGGTGCCCGACGAGCGTGAGCGGCAACGCAAGCGTGGGGTGATCATCGGCAACCAACGCCGCGACGGGACCACACCGATCACCGGGAACCTCGATCCGGAAGCGATGGCGGTATGGGAACCCCTCTTCGCGAAGTTCGCCGCCCCCGGGATGTGCAACCCCGCCGATGACCGACCATGCACTCTCGGTACGCCCAGCCAGGATCAGATCGACCACGACCACCGCACCCAGGCCCAACGCCGGCACGATGCGATGATCGCCATCGGACGCATCGCCCTGATGTCCAACCCCGGTCAGCTCAACGGCCTACCGGTGGCGGTGATCATCCGCACCACCGTGCAGGAACTGCACTCGCTGGCCGGGATCGGCACCTCCGGCGGCGGCACGAAGATCCCGATCCGCGATGTCATCCGGATGGCCGGGCACGCCTCTCACCACCTGGCCATCTTCGACGGTGCCACCGGCGCGGCCCTGAACTACTTCCGCACTCGGCGCACCGCCAGCGCGGCGCAGCGGATCATGCTGATCGCCCGCGACGGGGGTTGCACCAAACCGTGCTGCACCAAGGGCCCGTACTTCTGCCAGGCACACCATGGAAAAGCCGATTTCGCCCGCGGTGGCAACACCAATGTCGATGACATGACGCTGGCGTGTGGCTGTGACAACCGGATGGTCGACGAGAATGGCGGCTACACGACGAAGTTCAACGCCCGCAACGAATGCGAATGGCATCCACCACCGGCACTGGAGCACGGCCAGGCCCGGGTCAACTACCACCACCGCCCGGAACTCCTCCGGCACCGGCCGCACGACGGTTCGGAATGGGCTGAACGCGTCGGGCAACTGGATCGACTGCTGTTTCCAGCCGAGTACCCGCCCGAGGATGCGTCCGAGGACAGGTGCGATGACGCCGGCTGGCCCGCAGACCCCGGCTGGCATGAAGACGATGGCTGGCATGAAGACGACGGCGACGACCTCACCGAGGAGAGGTTGCTCGAACAGTTCCGCGCAGTCACCGCCGATATCGACTTCTGGGAACTGGTCACCGGCGACCACCACCACGGCGGCACCGGTCTGCGCGGACCGTGACTCACCAGCCGTCGAGGTGGCCCTCGATATTCAGTCCGGCGGCCACCTCGACCAGCGCGGACGCCACGATGGAGGCCGGCACCCGGTCGAGCACCACGAGACCGATTGCCGGCTGGGCGCCGTGCTGCACCATCGGTCGTGCGGAAAATCCCTGCGGCAAGCCGAGAGTCGGCAACCAGGCCGTCGAGGCGATGGTCGCCCGGCCCGAACCGGTGAGATGCGCGTACAGCGCGTCGACCGAATCCGCCTCCACGGCCGGGCGATACTGCACTCCCTCGGCCGCCATGTTGGCGTCCAAGATCCGGCGATTGCGCATGGTGGTGGTCAGTACGCACAACTCCAACCTCGCGGCATCCACCCAGGCCACCTCTGGGGCCGCCACCAACGGATGGTCCGCGGGAGCAAGCAGCACATACCGTTCGCGATACAGCTCCACCGAACGGGTGCCCGGCGGTGCCTCGTCGTCGAGATACGTCAGGCCCGCGTCGATCTCGAAATCGGCCAATCGCCGCGCGATCTCCCGCGACGACAGCGCCTCGATGCGCACCGACGCCGCCGGGTTACGCACCAGGAACTCCGCGGAGATGAACGGGCTGACCGGCACGGCCGTCGGAATGGCCCCGATGCGGGCGGTGACGGTCAACCGACCGCGCATCCGGTCGATATCGGTGAGCATGTCATCGCGTTCGGCCACGATGCGCTGGGCCCAGGCCACCACCCGGCGCCCCTCTTCGGTGAATCCCTCGAAGCGGTGGCCGCGCTGGACGATGACGATGCCGAGGTCCTTCTCCAGCCGGCGGATGGCCACCGACAACGTCGGCTGACTCACGTGGCACCGCGCCGCCGCACGGCCGAAATGCCGCTCGGCGGCCAGCGCCAGCAGGTAGTCCAGATGCTGCAGCTGGATATCGTTACCCATCGATAGCAAGCGTCTATCACTGCATGTGAAATGGCAAATATGATCTGGCCTGCACCGGACGCGGCCGGGTCTAGGTTGGAGTCATGACCGATGTCGACGCCATGTACCACGAGGACGATCTGGTCGTCAGCTCCCCCAAGGACGAGGCCGCCGGCGTCAAGGCGGTGATGGTCAGCATGCATCGAGCCCTTGAGCAGATGGGCCCGCTGCGCACGGCGGCCACCCTGACCAGACTCAACCAGCGGCACGGCTTCGACTGCCCAGGATGCGCCTGGCCCGAGGAGCACGGCGGCCGCAAGGTCGCCGAGTTCTGCGAGAACGGCGCCAAGGCGGTCGCCGAGGAGGCCACCAAGCGGACCGTCACCGCCGACTTCTTCGCCAGGCACACCATCGCCGACCTGTCCGAGAAACCCGAGTACTGGCTGTCCCAGCAGGGCCGACTGACCGAACCGATGGTGTTGCGGCCGGGCGACGAGCATTACCGTCCGATCGGCTGGGACGAGGCCTACCGGCTGATCGCCGACGAGATGCGGGCCCTGGACAGCCCGCACGAAGCGGCGTTCTACACCTCAGGACGCACCAGCAACGAGGCCGCCTTCCTCTACCAGCTCCTGGTCCGCAGCTTCGGCACCAACAACCTGCCCGACTGTTCGAACATGTGCCACGAGTCGTCGGGCACCGCGTTGATCGATTCCATCGGTATCGGCAAGGGCTCGGTCACGGTCGAGGACCTCACCGTCGCCGACCTGATCGTCATCGCGGGCCAGAACCCGGGCACCAACCACCCGCGCATGCTCTCCATTCTGGAGAAGGCAAAGGCCAACGGCGCCAAGATCATCGCCGTCAACCCCCTCCCCGAGGCCGGGCTGATCCGGTTCAAGGATCCGCAGAAGGTGCGCGGCGTGGTGGGCGATGGTGTGCCCATCGCCGACGAGTTCGTGCAGATCCGCCTCGGAGGGGACATGGCACTGTTCGCGGGACTCGGCCGGTTGTTGTTCGAAGCCCAGGATGCGGCCGGTGACAGCGCACAGATCGTCGACCGGGACTTCATCGCCCAGCACACGTCCGGATTCGAGAGCTACGAGCGCCAAACCCGGGCGGTCGATCTGGACACCGTGCTGGCCGCCACGGGTATCGACCGCGCCCAACTCGAGAAGGTGGCCCGCATGATGGCCACCTCCCAGCGCATCATCGTGTGCTGGGCGATGGGTCTGACCCAGCACAAGCACGCCGTACCCTCGATTGCCGAGATCACCAATCTGCTGCTGATGCGCGGCATGATCGGTAAGCCCGGCGCGGGTCTGTGTCCGGTCCGCGGGCATTCCAACGTCCAGGGCGATCGCACCATGGGCATCTGGGAAAAGATGCCCGACTCGTTCCTGGATGCCCTCGACGCCCGTTTCGGCATCGTCAGCCCGCGTGAGCACGGTTATGACACCGTGGATGCCATTCGCGCCATGCGCGACGGCCGGGCCAAGGTGTTCATGGCGATGGGCGGTAACTTCGCCTCGGCCACCCCCGATACCGAGGTCACCGAGCACGCGTTGCGCAATTGCACGCTCACCGTGCAGGTTTCGACCAAACTGAATCGCAGCCACCTCGTGCACGGCCGGACCGCACTGATCCTGCCGTCGCTGGGCCGCACCGACCGCGATATCCAAGCCGGCGGCAAACAGCTTGTGTCGGTCGAGGATTCGATGTCGATGGTGCATCTCTCCCGCGGCAGCCTGCACCCACCCAGTGATCAGGTGCGCAGCGAGGTGTCCATCATCTGCCAGCTCGCCCAGACGCTGTTCGGGCCGGAGCATCCGGTGCCGTGGCGCACTTTCAACGCCGACTACGACACCATCCGCGACGCCATCGCCGCCGTCGTGCCCGGCTGTGAGGACTACAACCGCCGGGTCCGCCAACCGGACGGATTCCAACTCCCGCATCCGCCACGGGATTCCCGCGAGTTCCCGACGATCACCGGCAAGGCCAACTTCGCCACCTACCCGCTGGAATGGGTACCGGTGCCGCCCGGCCGTCTGGTCCTGCAGACCATGCGCAGCCACGACCAGTACAACACCACCATCTACGGTCTCGACGATCGTTACCGCGGCGTGAAGGGTGGCCGCCGAGTGGTGTTCGTCAACCCGGCCGATATCGCCGCCTTCGGCCTGACCGAGGGTGACACGGTCGACCTGGTGTCCGAGTTCGAGGGTCAGGAGCGCAGGGCCGAGTCGTTCCGTGTGGTCGCCTACGCCACTCCGGTCGGCAACGCGGCGGCGTACTACCCGGAGACCAATCCGCTGGTACCGCTGGACCACGTTGCGGCCCGGTCGAATACGCCGGTGTCCAAGGCGATCGTGGTGCGCCTGGAGAAGGTGTGCGTCCCGGCGACGGGAGGCGGCCATGGGTAGGGTCACCACCCGGGTGCGGGCCCAACACGTGACCGCCGGCGGCGCCGGCAATCCAGCCACCGCCGAGCGCGCGGTCGCCCGGCCCGAGACGCTGGCCGTCGAGGAGCCGCTGGAGATCAGGGTGAACGGGACCCCGCTGACGGTCACCATGCGCACACCGGGCTCCGATGTCGAACTGGCGCAGGGATTTCTGCTCACCGAGGGATTGATCGGCCGCCGTGCCGACATCGCCACCGTTCAGTACTGCAGCGGGGCGGGACCGGACGGGCTGAACACCTACAACGTGCTGGACGTGACGCTGGCACCCGGTGTCACGCTGCCGGATGTCGATGTCACCCGCAACTTCTACACGACCTCGTCGTGCGGTGTCTGCGGTAAGGCATCGCTGGAGGCGGTGCGGTTGAGCAGCAAGCATGGCCCCGGCGACGATCCGGTGACCGTCAGCACCGAAATGCTGGCAGCGCTGCCCGACCGGTTGCGCGCACGCCAGAAGGTGTTCGCCGCCACCGGCGGCCTGCACGGGGCGGCGCTGTTCGATACCGACGGCGAGCCGCTGGTGGTCCGCGAGGACATCGGGCGGCACAACGCCGTGGACAAGGTCATCGGCTGGGCGCTGGAAGCCGACCGGGTGCCGTTGACGGGCACCGTCCTGCTGGTCAGCGGTCGCGCCTCGTTCGAGTTGACCCAGAAAGCGGTGATGGCCGGTATCCCGGTGCTGGCCGCGGTGTCGGCCCCCTCCTCGCTGGCGGTGGATCTGGCCAGCCAGTCGGGTCTGACGCTGGTGGCGTTCCTGCGCGGGGATTCGATGAACATCTACACCCGGCCCGACCGGATCGTCTGATTCCGGGCCGCAAGCCCGACACACAAGGAGCCAAACACAAAAGCCCCCGCCGATCCGGCGGGGCCTTTTGTGTACGGCGTGCTAGGCGCTCTTGTCGCGACGTTCGCTGCGCGACGGCTTGCGCGGCACGATGGTCGGCAGCACGTTGTCCTGCACGGTCTCCTTGGTGACGACGACCTTGGCCACGTCATCGCGGCTCGGGATGTCGTACATCGCCGGCTGCAGGACCTCTTCCATGATGGCGCGCAGGCCACGGGCACCGGTGCCGCGGTGGATGGCCTGGTCGGCGATGGCGTCCAGGGCCTCCGGCGTCATCTCCAGTTCCACACCGTCCATCTCGAACAACCGGGTGTACTGCTTGACCAAGGCGTTCTTCGGGGTGGACAGGATCTGGACCAGCGACTCCTTGTCCAGGTTGGTCACCGAGGCGACGACCGGGAGACGGCCGATGAACTCGGGGATCAGACCGAACTTGATCAGGTCCTCGGGCATGACCTCGGCGAAGTGATCCTGGGTGTCGATCTCGGCCTTGGAGTGCACCTCGGCGCCGAAGCCCAGGCCACGCTTGCCGACCCGGTCGGAGACGATCTTCTCCAGGCCCGCGAACGCACCGGCCACGATGAACAGCACGTTGGTGGTGTCGATCTGGATGAACTCCTGGTGGGGGTGCTTGCGACCGCCCTGCGGGGGCACCGAGGCCTGGGTGCCCTCCAGGATCTTCAGCAGCGCCTGCTGCACACCCTCGCCGGAGACGTCGCGGGTGATCGACGGGTTCTCGCTCTTGCGGGCGATCTTGTCGACCTCGTCGATGTAGATGATCCCGGTCTCGGCACGCTTGACGTCATAGTCGGCGGCCTGGATCAGCTTGAGCAGAATGTTCTCGACATCCTCGCCGACATAACCGGCTTCCGTCAGTGCCGTCGCATCCGCGATGGCAAACGGGACGTTGAGCATCTTGGCCAAGGTCTGCGCCAGGTAGGTCTTACCGCAGCCGGTGGGGCCGAGCATGAGGATGTTGGACTTGGCCAACTCCACCGGCTCGGCACGCGAGTCGCGTGACTTCTCCTGCGCCTGGATGCGCTTGTAGTGGTTGTAGACCGCTACCGCGAGCGTCTTCTTGGCGGTGTCCTGACCGATGACGTAGCCCTCGAGGAACTCGCGGATCTCGGCAGGCTTGGGCAACTCATCGAGCTTGACGTCGTCGGCATCGGCCAACTCCTCCTCGATGATCTCGTTGCACAAGTCGATGCACTCGTCGCAGATGTAGACACCGGGCCCTGCGATGAGCTTCTTGACCTGCTTCTGGCTCTTTCCACAGAATGAGCACTTCAGCAGGTCTCCGCCATCTCCAATGCGCGCCATGTGGGAGGGTCCTACTTCCTGTTTGCAATCACTCAAAGGTGGGTTTCACCGGGTGTGAACCCGACGCTACCCGTTCGTTCCGTCACGGTGCGACCGAAGGGCCGAATCGCGTCGGTGGTATTTGTTCGCGTGCAGGAGAACATATCTCTCTTGTTGCCTTGCCACCCGGCGGCACGCGGACCGTGTCCCTGGCGTGTCGCCGTCGTTACTCCGATCGCGGTGGAAACAACCAGGTCGCGCCGTGCAGTGCGCCGCTGTCCACAATACCGTCGGCGGCAACTGGCAAACATCGGCAGTTCGCGGTCAGCGGGGCCCCAGCCCACCCGGACCCACCGGTCCCGGCCCGACAGGGCCCGGTCCCACCGGCCCCGGCCCCACCGGGCCGACGCCCACGGGACCGACCACGCCGCCGACACCTACCGGCCCGGCGGGTCCGACGACGGGGTTCACCCCGTAGGGCAGACACTCCACCACCGCCGGGTTCCAATATGTGCCCACCGGGCACACCGGCTCCTCGGCCTGGCCCACCGCCGGGTTGGCCAGTGTCATCCCCATCGCCGCGGCGATTCCCGCCATGCTCCATGCCAGCTTGTGCGCGCTCTTCACAGCCCACTCCCCTCGGTCTTCCCCGACCTGACGACAGCTTCGCATCACCGACGCGCGAGCGCATCCTTTTTGACGCAGACGTCAAGCAAGATCACCCGGGACAGCACACACCCCGGTCCGCGAGGCGAACCGGGGTGTGCGTGAAAGCTCTGATCAGGCGTTCTGGGCCGAGAGCTTGCGGTACTCCAGCACCGTGTCGATGATGCCGTAATCCTTGGCCTCGGCGGCCGTGAGGATCTTGTCCCGGTCGGTGTCCTTGCGGATGACCGCAGGATCCTTGCCGGTGTGCCGGGCCAGGGTGGAGTCCATCAGGCTGCGCATGCGCTCGATCTCGGCGGCCTGGATCTCCAGATCCGACACCTGACCCTGGATGGCCCCACCGACGGCGGGCTGATGGATCAGCACGCGGGCATTCGGCAGCGCCAGCCGCTTACCGGGGGTACCGGCGGCCAGCAGCACCGCGGCGGCCGAGGCGGCCTGCCCGAGGCAGACCGTCTGGATGTCGGCGCGGACGTACTGCATGGTGTCGTAGATCGCCATCAACGAGGTGAACGAGCCACCCGGGGAGTTGATGTACATGGTGATGTCGCGATCAGGATCGAGCGACTCGAGCACCAGCAACTGGGCCATGATGTCGTTGGCCGAGGCGTCGTCCACCTGCACGCCGAGGAAGATGATGCGCTCCTCGAACAGCTTGTTGTACGGGTTGGACTCCTTGACACCGAAGCTGGAGTGCTCGATGAACGACGGCAGGATGTAGCGCGATTGCATCGGGTTCATTTGATACCTGCTCCTGGTCCCTCGCCGTTGACGCTGACACTGGTGATGATGTGGTCGACGAAACCGTATTCCAGGGCCTCCGAGGAGGTGAACCAGCGGTCGCGATCGGCGTCGGTTTCCACCCGCTCCAGGGACTGACCGGTGAACTGGGCGTTGAGCCGGTTCATTTCCTTCTTGGTCAGTGCGAACTGCTCGGCCTGGATCGCGATGTCGGCGGCGCTACCACCGATACCTGCCGACGGCTGGTGCATCATGATCCGGGCGTGCGGCAGGGCGTACCGCTTACCCTTGGTGCCGGCGGCCAGCAGGAACTGACCCATCGACGCGGCAAGGCCCATCGCGTAGGTCGCGACATCGCACGGAGCCAGCACCATGGTGTCGTAGATGGCCATACCCGCGGTGACCGAGCCACCGGGTGAGTTGATGTAGAGGTGGATGTCCTTGGTGGGGTCCTCCGCGGCCAGCAGCAGAATCTGCGCGCACAACCGGTTGGCGATGTCGTCATCGACCTGGGTTCCCAGGAAGATGATCCGCTCGGCGAGCAGTCGCTCGTATACCGAATCGATGAGGTTGAGCCCCGACGTGCCACCACGCATGTCAGTCACGGCTGGATACCTGCTTTCTTCGAGTTGTCGTACTCGTTCACCGACATTAACGAACGCGCGGCGAAGAGCACGCCCCAACAGGCGCGCTTTCGCTCACAGCGTCACTCCGAGGAGTCGTCGCTCTTCTTGTCGGACTTTTTCTTGGCCTTCTTCTTATCCTTCTTCTCCGCCTTTTCGGCGGCAGCATCCTCGTCGGCAGCGGCCTCGGACTCGGACTCTGCGTCAGCGGCCGGAGCCTCGACAGCCACGTCCTCGGCCACCTCGACCTCGGCGGTCTCCTCGCCACCGGCCGGTCCGAAGAACTCGGCGGTGTCGACCACGTTGCCCTCGGAATCGGTGACCGTGGCGCCCTGCACGACGGCGGCGATGGTCAGGCCGCGGCGGACATCGGCGAACATCGCCGGCAGCTGGTTGTTCTGCTGCAGCACCTGAAGCAGCTGCTGCGGCTCGATGCCGTACTGCCGCGACATCAGCACCAGCCGCTCGGTCAGGTCGGCCTGGCCCACCTGGATGTCGAGCTTGTCGGCGACGGCGTCCATCAGCAGCTGGGTCTTGACGGCCTTCTCGGCCTCGTTGCGGGTGTTCGTGTCGAACTCCTCGCGGCTGCTGCCCTGCTCGGTGAGGCTCTCGTTGAACTTGTCCTCGTCGTGGTCGAGGCCGTGGATGGCGTTGTGCAGGGTGTCGTCGATCTGCGCCTGCACGATGGCCTCGGGCAGCGGCACCTCGACATCGGCGAGCAGCACCTCGAGCGCCTTGTCGCGGATCTGCTCGGCCTGCTGGATGCGCTTGACCCGCTTGACCTGCTCGACGAGGCTCTCCTTGAGCTCGTCGATGGTGTCGAATTCGCTTGCCAGCTGCGCGAACTCGTCGTCTGCCTCGGGCAGCTCGCGCTCCTTGACGGACTTGACGGTGACGGTGACCTCGGCCTCCTGGCCGGCGTGCGGGCCGGCGGCCAGCGTGGTCGTGAAGACCTTGGACTCACCGGTCTTGAGCCCGATGATCGCCTCGTCGAGGCCGTCGATGAGCTGGCCGGAACCGATCTCGTGGGACAGGCCCTCGGTGGCGGCCTCGGGCACCTCGGTGCCGTCGACGGTGGCCGACAGGTCGATGGAGACGAAGTCGCCGTCGGCGGCGGCGCGCTCGACACCGGTCAGGGTGCCGAAGCGGGCACGCAGATTCTGCAGTTCGGTGTCGACCTCGTCATCGGCGATCTCGATCGGGTCGACGGTGATCTTCAGCGCGGTCAGATCGGGCAGCTCGATCTCGGGGCGGATGTCGACCTCGGCGGTGAACACCAGCTCCTCGTTGTCCTCGAGCTTGGTGACCTCGATATCGGGCTGGCCGAGCGGCTGGATCTCCGCGGACGTGACGGCCTCGCTGTAGCGGCTGGGCAGCGCGTCGTTGACGACCTGCTCCAGGACCGCGCCGCGGCCGATGCGAGCCTCGAGCAGCTTGCGGGGGGCCTTGCCGGGACGGAATCCGGGCAGCCGCACCTGGCCGGCCAGCTGCTTGAACGCGCGGTCGAAGTCTGGCTCCAGCTCGGTGAAGGGCACCTCCACGTTGATCCGGACCCGGGTCGGGCTCAACTTTTCGACGGTGCTCTTCACTGCGTTACTCCTCGTAGATGGTTCGGTGTTGCTCGTACGGTCGGGGTGACAGGATTTGAACCTGCGGCCTTCCGCTCCCAAAGCGGATGCGCTACCAAGCTGCGCTACACCCCGTGCCTCTTCGTGCCTGTCTTCGGTCGACACGCACGCACGCCGACCACGCGAGATACTACGGGCATGTGTCGCGACGCCATCAATTGGATTTGATCGCGCCTTCGCTAGTACAGTCTCCGATGCACCGCATGCGGGCGTAGCTCAATGGTAGAGCCCTAGTCTTCCAAACTAGCTACGCGGGTTCGATTCCCGTCGCCCGCTCCATGCAACGCAGGTCAGGGCCATTTCGGAGTTTGCCAGCAAACCCGAGGTGGCCCTGATCCCGCATTTTCCCCGCACAATCACTTTTCAAGGCTGGTTGCTGGCCCACGACTCAAGCACCGCCCGCGTGTCCGGCCCGACTGTGACGCGCTCCACATAGTGCCCCTCGGTCGTCGCCAGCTGCGTGTGCGAGAGCTGACGTTGCGCCGCCTCCACGCCGAGCCCGTCCCGGACCACCGTGGCCACCGTGCGGCGGAAGCTATGCGGTGTCACCCACCGGAGATCCTCATGATCGGCCAGCGCCTCCCGCAGAGCCGACCTGATGTTGGTCAGGGCCACCAGGCCGCCGTCCCGGTTGGCCAGCACCGTGCCCTCCGGGCCGGTCACCGCGTACAGCTCTGTGAGCACCTCGACGCCGAACGTCGGCAAGCTGACGGTGTGTGCTGGAGCGCCGCCTTTGCGTGAGTCCTGGCGATGCAGAGGCTTCCCAGCGACCCTGCCCGAGTCAACCACCGTCCCGGTCACCGTGACGGTTGGCGGCGTGCCGAGCAGGTCGACGTCCTCCCAGCGGATCGCCAGCACCTCCCCCGGCCGCGCCCCGGTGGCCGCGAGCAGCTCCACAAACGCTGGGAGCATCCGCCCGCGCCGCGGCCCCGGCCCCTTGCGGTTGCAGAACGCCGCGACGGCCGCGCGCACCCTCTCGAATTCCATCGTCGTGAGCGCGCGGGCCGGCTTCCGCTCGGCCTTATTGGTCCGGGTCTCCCGGATCGGATTGTGCGCGATCACGTCGAACCTGACGGCCAGCGAGTACATGCCCGAGAGGATGACGCGCATGTAGGTCGCCGGCGCTGGCAGCAGCGCCTTCAGGTAGGCATCCGCTCGGCTCGTCGACAGCTCTCCCACCCTCAACGCGCCGAGCTGACCGTCACCGTGCAGCCGCCAAGTGTCGCGGTAGAGCGTGGCGGTGCGCTCGCCGATGCCGTTCTCTGCGACCTTGGCCGGCAACCACACCTCGAACAGCTCGGTCAGCGTAGTGCGCTGATTGATCGCGCCCGTGGGCTGCCCGGCGGCAAGCTCAGCCTGGATCCGACGTTTCAGCTCCCGCCGCGCATCCTCGGCCGACTTCCGTGACGACGCCTCACGCTCACGCAGCTTTCCATTGTGCAGCCGAACATAGGTTGTGGCACAAAATAACTCGCCGCGTCGTCCCACGGTAATCTTGCCGTGCTCGCCCGGCGCCATCCGCTGCCTAGGCATGATCCCTCAGCACAGCACCGGGAATCCGACCAAGGTCCGCCTCAAGTTGACTGAGCTTGTCCTCAGCGCTGATCAGCTCGTCCATCGACATGCCCCGTTTCTCGGCCTGAGATGCGGTCTCGCCCTTTGCTATTCCTGCCCGCTCGATCATCCTTTCAATGTGGCGGTCTAGCCGCTCGTGACGGTGCCGAACTTCAGCTAACTCCCTGGATTTGAGGAGCAGTCGGGCGCCATCCGAACCACCCGACCTCTCCTCACCCGCAAACCGGCGGACTGCGTCAATCGACGAAATCGACTCACCAGGAACAACTTCCACCATGCCATCCGGAAGCTCCGGATACAGGAGAGCAATCGGCGGCACATCCAGCGCGGCCGCAATGACAATCAGTTCCGCGATGCTCAAGACGCTGCCGCGATGTCCGGAATCTAGCTTCGCAATCACCGTCGGCGACACCCTGTAGCCAAGTTCAGCCGTTCGAGCACTTAGCCATGCCGCCGACTTCCCTCCCCGGGCTTGTTTCATCGCCTTGCCTATGCGCTCAACAAGCTCGCTTGCCCAGCGCTGACCCGAATCTTCATTCGCCATGCGACGAGTATGACGGACTTTCTTGCGTCTGTAGAAGATCGTGTTAGTCTCGTCGCCAAGCGAAGATCAAACCGATAGATCTTCGTTACACGAAGGAATCGAGGGGATGAAAGCAATCGCCGAACCACAACCGGACGACGATGACCGACTCAGTCAACTTTGGCCGGTAGAAGCAGTGATGGCCCGGCTGTCAGTTGGCAAGTCGACCGTCTTCGCACTGATCACGAGCGGGGAGCTGCGCAGCGTCAAGGTGGGTCGACGCCGACTGATTTCCGAGGCCGCCATCCGAGAGTTCATCCAGAAGGTCGACAACGGGGGCAGCGCCGCCTGATGTGGCCACAAGAGACTCACGCCGATCTGGGGCTGGTCGACGACGACCACACCCCGGATATGCGAATGGCGCCGGTTGCAGCCGACGCCATTCCGAACGAACAACCCCCCAGCACATATCGGAAGGAAGTTCAGCACCAATGATGACACAGAGCACGAACACTGACCACGACCTGTTGGCCGAAGTCTCGACCCCGGACGGTGTGACCGCGGGCCCGTGGCGCACCGACAGCCAGGGCCATCTCGAACGTGACTTGTCCGACGGGCGCATCCAGCGCGGAAACGGTGATATCGCCGAAGCGGCGGCTGGCGATGCATGCAGAATCCGCGTCGGCACGGTTACAAGCTGGCACGAAGTGGCAACCCACCTGACGCCCACGCAGAACACTTTCCTGCGTGACTCGGAGTCCCGCGGCGCCGAACCCGAAGATTTGGCAGAATACGCCCGCGGGCTGGCAGAACAGAATGTGCGCGACGCTGCGGTATTCGGCGATCTACCTGTGCCGCCGGATGCCACGGCCGTGTACACGTCCAACCAAATGCCTGACGGTCGATACTCCCGTGATTTCGTCGGAGCTCGACGCAGGATAGGGCTGTTGTGCCTCAGCGTCGAAGGCACCCAGTTCGATGACGGTGCTGTCCGCCGCCGGCTGCATATAGGCTTCGACAGCACCGACGGATCAGAACCCGAGTTTGACAGTCATCAGGTGCATCAGCTGATTTTGGTGCTCGGCCAGCTCGCTAGCGCCATTGAGCAGACGCAATGAGCGCGAAGGAAACTCCCGAGGTCAGTAGAGTCCGACAGCGCGCGTACCGGCTCAACCTGCGACTGATCAAGAGTGGCGAGGCGGTGCAGCTTCGCGACCAGGACGGCACGATAAAGCACAGCGGCAGCCTCGCCAGTGCAGACGCCTACATGGAACGCATCAGGTCATGGCAACGCGGGGGCAGTGTTCCGAAAAGCGTTCCAGTCGAATGGGTCCCATTGATAGACGGATACCTGAAGGAACAAGACGCTGCAGGCTTCTCGCAGAGCACTGTCAAACTACGACGCGAACAGCTCGGCAACATGGCCCGCGAGATCGGCGTTACTCCCGATAACGCCACTCGCCAAACCCTTACAGCGTGGCTCGCCCGGCATCGCGAGTGGAAACCTGAGACCCGCCGCAGCATGTTCAGCACACTCAAGAGCTTCTGCATCTGGGCAAGTGACAACGGGTGGTTTGCGACCAATCCAGCAGCAGAACTTCCGAAAGTGCGATTACCGCCGCCAGCAGCGCGGCCTGCACCCGACGACATATGGCACACATCGATCGCCCGGGCCCGCGGTAACCCCAGGGTCACACTTCTGCTGCGGTTGGCGAGCGAAGTCGGCCTGCGCCGAGCTGAGGCCGCCCGTGTCAGCACAGACGATCTGATGGGCGGCCTCGGCCGAGCGCAACTGCTGGTGCACGGCAAAGGCGGGAAGAAACGCGTCGTTCCGATCAGCGATTCGCTCGCCGCTGCGATCGCAGCAGGCGCTGCTGGACACACAACAGGGGCACCTTCGACCGGCTGGCTGTTCCCCGGGTCCAAACCGGATGCTCACCTGACACCAAAGCAGGTGGGCGACCTCATCCGGACAGTGATGCCCGAGGGCTGGAGTATGCACACTCTGCGGCATCGGTTCGCCACCCGCGCGTACCGCGGCAGTCGCAACATCCGGGCTGTGCAAACGCTTCTCGGACACGCTTCGGTGGCCACGACGGAACGGTACACCGCGGTCGATGACGACGAGGTCCGGGCCGCTATGTTGTCCGCGTCCGACTGGGGGGACGTCTCCGACTCGTGA
Protein sequences of DBSCAN-SWA_2 >NC_023036|4030864:4081277|4039294_4040272_-|WP_019513224.1|DBSCAN-SWA MISSTDLVETFPYPFSADSYRYTTNVEPAGAPVTTPVGRWGERVVDIDSEYEHELAERAAVLLADPSRYAVLPHMRPACWDVMLTLMRELAAAYPDSMSLQRDGDHWRWRNQRLGIDQTFVLGDDSSLPAEPLAYIAGQVQEDIVLLDQREGDLFGDAGVVTFAADWSFGFDVGMTFLEIHGPVPRLRATGVITRAREFLMRLQPGETYRRTNWTLTIGRRLDVSTERYPEWGPDRVMITGVGDEEFGRLVHLRVEVQHLIRLPESGAICFLIRTYMLPLADIAAVEPWRVRTAAVLADLPDDMADYKGIIGYRDRAVRWLRDAG >NC_023036|4030864:4081277|4056765_4057497_-|WP_023986051.1|DBSCAN-SWA MPLAVELDRSSPVPLYYQLAQAIEAAIRSGELAPGDRFENELSLAGRLALSRPTTRRAIQELVDKGLLVRKRGVGTQVVQNPVHRRVELTSLFDDLAKSGQDPTTQLLEYRRGVADDEIAAELNLSTDHEIVTVQRLRYANGEPLAIMTNHLPAEIAPDAGELEASGLYQALRGRGVHIRLARQRIGARPAQRTEARLLEEKVGAPLLTMSRTAFDDSGRAVEFGSHCYRASRYYFETTLVDR >NC_023036|4030864:4081277|4041212_4041746_-|WP_019513222.1|DBSCAN-SWA MKPDLDVTSVPGWAIAPTTPPAGTRGRSWTLIAFDSAAIAVAHEWQRQIVSAAAESAVRMHRAADIADAVTALRADLADATVGWRLMVAGPAYACLSLRADAVALGVADDEMTFASTEVTARSVQCVHCPTVNHVVVDLEDVTPCAGCGRNLLVYYHVSRRRGTYLGFMADAEEMPA >NC_023036|4030864:4081277|4074742_4075342_-|WP_019511796.1|protease|DBSCAN-SWA MRGGTSGLNLIDSVYERLLAERIIFLGTQVDDDIANRLCAQILLLAAEDPTKDIHLYINSPGGSVTAGMAIYDTMVLAPCDVATYAMGLAASMGQFLLAAGTKGKRYALPHARIMMHQPSAGIGGSAADIAIQAEQFALTKKEMNRLNAQFTGQSLERVETDADRDRWFTSSEALEYGFVDHIITSVSVNGEGPGAGIK >NC_023036|4030864:4081277|4066195_4067836_+|WP_019511803.1|DBSCAN-SWA MTTATVRVTEAVANLRAAFDAFAATDLESLSGAELIAVMDEYEMLTCRLPAQRHRLLTQLQAETTAREMGAKSWNEVLRIRWRLSTAEADRRLHEAADLGPRRSLTGEPLPPLLPVVAVAQAAGLITGEHVKVVRAAVRDLPGSVSTADREKFEVALVREAVGAGPKALAESAADRLFLLDQDGPVPDERERQRKRGVIIGNQRRDGTTPITGNLDPEAMAVWEPLFAKFAAPGMCNPADDRPCTLGTPSQDQIDHDHRTQAQRRHDAMIAIGRIALMSNPGQLNGLPVAVIIRTTVQELHSLAGIGTSGGGTKIPIRDVIRMAGHASHHLAIFDGATGAALNYFRTRRTASAAQRIMLIARDGGCTKPCCTKGPYFCQAHHGKADFARGGNTNVDDMTLACGCDNRMVDENGGYTTKFNARNECEWHPPPALEHGQARVNYHHRPELLRHRPHDGSEWAERVGQLDRLLFPAEYPPEDASEDRCDDAGWPADPGWHEDDGWHEDDGDDLTEERLLEQFRAVTADIDFWELVTGDHHHGGTGLRGP >NC_023036|4030864:4081277|4043751_4044075_-|WP_019513219.1|DBSCAN-SWA MKNIRKTFGMAAIAGALSAAPLMLATGTAHADSVNWDAVAACESGGNWAINTGNGYYGGLQFTMSTWQSNGGSGAPHSASREEQIRVAENVLRSQGIGAWPSCGRRG >NC_023036|4030864:4081277|4043029_4043539_-|WP_019513220.1|DBSCAN-SWA MIRFSSTVKSIARRALWVLIAAAGLSLAPMVLTATASADTVNWDAIAECESGGNWTVNSGNGHYGGLQFKQATWNANGGVGSPAGASRAEQIRVAENVLRSQGLKAWPKCGSRGATPAVWGTTPTLPQAPAPVTATGCQAIRGGAVLGILDFRQMCMALENVGRTFQPR >NC_023036|4030864:4081277|4033587_4034862_-|WP_019513228.1|DBSCAN-SWA MTSPTSPTSPQREFDIVLYGATGFAGKLTAQYLALAASGARIALAGRSLAKVRDVRDSLGPKAQDWQLVEADAGSPSTLADMAARTQVVITTVGPYTKYGLPLVQACAAAGTDYADLTGETLFIRDSAEQFHKQAVDTGARIVHSCGFDSVPSDISVYALYDRVTADGSGELTETNLVLRTFAGGVSGGTAASMIEFMRAAAEDPQARRDLEDPYTLTTDRGAEPELGHQSDNPWRRGRDIAPELDGIWTGAFAMAAPNSRIVRRSNALLDWAYGRHFRYAEQMSLGSSVLAPAAAALETAASVATFSLGSRYINKVPAGLLERILPKPGSGPSERTRDNGHYRIETYTTTSTGARYRATIAQQGDPGYKATAVLLAECGLALALDRDQLSDLRGVLTPAAAAGSALLTRLPAAGVTLETARLD >NC_023036|4030864:4081277|4057651_4058638_+|WP_019513206.1|DBSCAN-SWA MSSPPQPSTDPFDVIAIGRSGVDIYPLQTGVGLDEVDTFGKFLGGSAANVAVAAARLGNRTALISGTGDDPFGRFVRDELARLGVDNRYVGIHGRYPTPVTFCEIFPPDDFPLYFYRKPSAPDLQIEAGDIDADAVRTARLFWSTLTGLSEEPSRSAHFAAWAARARTPLTVLDLDYRPMFWADPAAAGEQAVRALGQVTVAVGNREECEIAVGETVPHRAADALLDLGVELAIVKQGPRGVLGKTRHSSVTVAPNDVDVVNGLGAGDAFGGSLIHGLLRNWPLEKTLRYANAAGAIVASRLECSTAMPTAAEVAELAEQSAVEAVNV >NC_023036|4030864:4081277|4051184_4052204_+|WP_019513213.1|DBSCAN-SWA MTTIGVIGLGRIGAFHTETLAGLDGIDGLVITDERPDVTAAVAAKHGATAVGSVEELLASGVDGVVVAAATPAHADLTLAAVERGIPTFCEKPIASTAAESARVAETIMCTGVPVQVGYQRRFDAAFAAAKTAVDNGSLGILHTVRSTTMDPAPPPLDYIKGSGGIFRDCAVHDFDVVRWITGQQAVEVYATGSVQGDPLFAEYGDVDTAAVVVRFDGGALGVISNARYNARGYDCRLEIHGFDDSVAAGWDQGAPLRNVDPANEFPTGPAHNFFMDRFTEAFRTELAGFLEVAKGGPVRGATVADAVEVAWMAEAATESLRRGTPVALESVKAKVGTA >NC_023036|4030864:4081277|4074119_4074746_-|WP_019511797.1|protease|DBSCAN-SWA MNPMQSRYILPSFIEHSSFGVKESNPYNKLFEERIIFLGVQVDDASANDIMAQLLVLESLDPDRDITMYINSPGGSFTSLMAIYDTMQYVRADIQTVCLGQAASAAAVLLAAGTPGKRLALPNARVLIHQPAVGGAIQGQVSDLEIQAAEIERMRSLMDSTLARHTGKDPAVIRKDTDRDKILTAAEAKDYGIIDTVLEYRKLSAQNA >NC_023036|4030864:4081277|4049008_4049830_-|WP_023986048.1|DBSCAN-SWA MTVLGNLAIDVINGAAPSPGGCASFAGVALQSAPGSTHIAAMGERRHHELFGPVLDRFGALVRLLPADRTSAFSLDYDDTDHRHMSVSAIGPVWRPGDIDADDPRTTWIHLAPLLRTDFPADTLAHLAARGHRIAYDGQGLVRADRVGPLVLDRDFSPELLRHLSVLKLAEDEAVIVADGAFDLAAAHRLGVPEILVTYGSEGCDIYRDGSVTRVPAAWRVMDVQTTGAGDMFTASYVAHRAAGANPVRAVEIASELVARELQKRAQMPSTKA >NC_023036|4030864:4081277|4030864_4033516_-|WP_019513229.1|tRNA|DBSCAN-SWA MTASPSSRADALPKSWEPGAVESDLYQGWVDAGYFTADPGSDKPPYSIVLPPPNVTGSLHMGHALDHTLMDALTRRRRMQGYEVLWLPGMDHAGIATQSVVEKQLAADGKTKEDLGRELFIERVWDWKRESGGTIGGQMRRLGDGVDWSRDRFTMDEGLSRAVRTIFKRLYDAGLIYQAERLVNWSPVLQTAISDLEVKYEDVEGELVSFRYGSMNDDEPHIVVATTRLETMLGDTAVAVHPDDERYRALVGTTLPHPYLDREIVIVADAHVDPEFGTGAVKVTPAHDPNDFEIGLRHNLPMPSILDTTGAITGTGTRFDGMDRFAARVAVREALAAEGRIVAEKRPYLHSVGHSERSGEPIEPRLSLQWWVKVDALAKAAGDAVRKGDTVIHPASLEPRWFAWVDNMHDWCISRQLWWGHRIPIWHGPDGQQVCLGPDETPPPGWEQDPDVLDTWFSSALWPFSTMGWPDATPELQKFYPTTVLVTGYDILFFWVARMMMFGTFVSGDDAVPAPATGRPQVPFENVFLHGLIRDEFGRKMSKSKGNGIDPLDWVEKFGADALRFTLARGASPGGDLSIGEDHARASRNFATKLFNATKFALMNGAELTDLPGADQLTDADRWILGRLEEVRAEADAALEAYEFSRACEALYHFAWDEFCDWYLELAKVQVAQGHSHTGAVLAAVLDTLLKLLHPVMPFVTETLWKALTGGESVVIADWPTVTAHGVDATATRRITDMQKLVTEVRRFRSDQGLADRQKVPARLTGLDASDIESQLPAVTALAWLTDPTEDFAASAAVEVRLTNGTVTVEVDTSGTVDVAAEKRRLEKDLAAAQKELAQTTGKLGNADFLAKAPENVVEKIKGRQQLAAEEVERITARLADLK >NC_023036|4030864:4081277|4068838_4071157_+|WP_019511801.1|DBSCAN-SWA MTDVDAMYHEDDLVVSSPKDEAAGVKAVMVSMHRALEQMGPLRTAATLTRLNQRHGFDCPGCAWPEEHGGRKVAEFCENGAKAVAEEATKRTVTADFFARHTIADLSEKPEYWLSQQGRLTEPMVLRPGDEHYRPIGWDEAYRLIADEMRALDSPHEAAFYTSGRTSNEAAFLYQLLVRSFGTNNLPDCSNMCHESSGTALIDSIGIGKGSVTVEDLTVADLIVIAGQNPGTNHPRMLSILEKAKANGAKIIAVNPLPEAGLIRFKDPQKVRGVVGDGVPIADEFVQIRLGGDMALFAGLGRLLFEAQDAAGDSAQIVDRDFIAQHTSGFESYERQTRAVDLDTVLAATGIDRAQLEKVARMMATSQRIIVCWAMGLTQHKHAVPSIAEITNLLLMRGMIGKPGAGLCPVRGHSNVQGDRTMGIWEKMPDSFLDALDARFGIVSPREHGYDTVDAIRAMRDGRAKVFMAMGGNFASATPDTEVTEHALRNCTLTVQVSTKLNRSHLVHGRTALILPSLGRTDRDIQAGGKQLVSVEDSMSMVHLSRGSLHPPSDQVRSEVSIICQLAQTLFGPEHPVPWRTFNADYDTIRDAIAAVVPGCEDYNRRVRQPDGFQLPHPPRDSREFPTITGKANFATYPLEWVPVPPGRLVLQTMRSHDQYNTTIYGLDDRYRGVKGGRRVVFVNPADIAAFGLTEGDTVDLVSEFEGQERRAESFRVVAYATPVGNAAAYYPETNPLVPLDHVAARSNTPVSKAIVVRLEKVCVPATGGGHG >NC_023036|4030864:4081277|4060383_4062333_+|WP_023986052.1|DBSCAN-SWA MVSTAPKAADKLADTEATVRLTVAQATIAFLAAQYVERDGDRTPFFAGCFGIFGHGNVAGLGQALLQDEIEAQAAGRAPRMPYVLGRNEQAMVHSAVAYARQKDRLQTWAVTASVGPGSTNMLTGAALATINRLPVLLLPADTFATRVSSPVLQELELPSSGDVTVNDAFKPLSRYFDRVWRPEQLPAALLGAMRVLTDPVETGAATVSIPQDVQAEAHDWPESLFAERTWHIARPLPERAVVARAAALIAAARRPLIIAGGGVHYSGAEAALTALAEQTGIPVAESQAGKGSLRHDHPQSVGAVGSTGSTAANALATDADVVIGIGTRYSDFTSASRTAFNNPQVRFVNINVASLDAVKQGGVSVVADAREAIEALGPALADYRVPDEYRSHISELAGEWDAAVSAAFATEDGAQLNQNQVIGLVNSLSDPRDVVVCAAGSMPGDLHKLWRSRDRKSYHVEYGFSCMGYEIAGGIGVRMAAPDRDVFVMVGDGSYLMMATEIATAVQEGVKVIPVLVQNHGFASIGGLSESLGSQRFGTAYRYRGTDGRLDGDRLPVDLAANAASLGADVIKVATAAEFTDAVKVAKAADRITVIHVETDPRVYAPDSHSWWDVPVSQVSALESTQQAYQRYTEWKKVQRPLIAPSDG >NC_023036|4030864:4081277|4055566_4056589_-|WP_023986050.1|DBSCAN-SWA MTARANSRGIRSLRRWAALTGAGVLALGMAACSSTGGNPDTAGGNGGSGGGTVDTPRVTIAMVTHEAPGDSFWDLIRKGAETAAKKDNVELRYSSDPEAPNQANLVQSAVDANVDGIAVTLAKPDAMRAAVQAAAAKGIPVVAFNAGMDTWQDMGVQEFFGQDGYIAGQGAGERVAKDGAKKAICIIHEQGNVDLEARCAGMKNTFGATEILNVNGKDMPSVESTITAKLQQDPAIDMIVALGAPFALTAVTSAKNANSSAKIGTFDTNAALVDAIKGGDIQWAVDQQPYLQGYLAVDSLWLYLNNKNVIGGGKPTLTGPAFIDNTNIDAVAELAKGGTR >NC_023036|4030864:4081277|4080362_4081277_+|WP_081650101.1|integrase|DBSCAN-SWA MERIRSWQRGGSVPKSVPVEWVPLIDGYLKEQDAAGFSQSTVKLRREQLGNMAREIGVTPDNATRQTLTAWLARHREWKPETRRSMFSTLKSFCIWASDNGWFATNPAAELPKVRLPPPAARPAPDDIWHTSIARARGNPRVTLLLRLASEVGLRRAEAARVSTDDLMGGLGRAQLLVHGKGGKKRVVPISDSLAAAIAAGAAGHTTGAPSTGWLFPGSKPDAHLTPKQVGDLIRTVMPEGWSMHTLRHRFATRAYRGSRNIRAVQTLLGHASVATTERYTAVDDDEVRAAMLSASDWGDVSDS >NC_023036|4030864:4081277|4079586_4080213_+|WP_019511790.1|DBSCAN-SWA MMTQSTNTDHDLLAEVSTPDGVTAGPWRTDSQGHLERDLSDGRIQRGNGDIAEAAAGDACRIRVGTVTSWHEVATHLTPTQNTFLRDSESRGAEPEDLAEYARGLAEQNVRDAAVFGDLPVPPDATAVYTSNQMPDGRYSRDFVGARRRIGLLCLSVEGTQFDDGAVRRRLHIGFDSTDGSEPEFDSHQVHQLILVLGQLASAIEQTQ >NC_023036|4030864:4081277|4078522_4079116_-|WP_023986053.1|DBSCAN-SWA MANEDSGQRWASELVERIGKAMKQARGGKSAAWLSARTAELGYRVSPTVIAKLDSGHRGSVLSIAELIVIAAALDVPPIALLYPELPDGMVEVVPGESISSIDAVRRFAGEERSGGSDGARLLLKSRELAEVRHRHERLDRHIERMIERAGIAKGETASQAEKRGMSMDELISAEDKLSQLEADLGRIPGAVLRDHA >NC_023036|4030864:4081277|4064948_4066172_+|WP_023985132.1|transposase|DBSCAN-SWA MRVSTAFNRLLQIPGASVVEVSIGDRDVEVTLRPTARLLRCPCSKRVRSVYDRRRRRWRHLDLGTKRLWLTYDIRRLHCPDCGVTTEEVPWARPGARFSRDFEDTVLWLAQRTDRTTVSTLMRCAWESVTSIIKRGVAELLDQRRLRALYQIGVDEICYRHPHRYLTIIGDHTSGTVIDVQPGKSRESLAKFYTSQPDSTLAGIQAVTMDVSSVYTAATQEHLPQATICYDGFHILQWVNRALDRVFSEAAAGPDRVHMSSTQWRTTRWALRTGEDKLPDDKRALVNEIAKQNRHVGRAWALKEQARDLYRYDHEPGAARQLLRAWITAAKRSRIPAFAALGKRFEVYTEPILAAIELKLSNALAEGINAKIRLINARGYGHHSAETLTSMIYLCLGGLHIKLPTKT >NC_023036|4030864:4081277|4072079_4073360_-|WP_019511799.1|protease|DBSCAN-SWA MARIGDGGDLLKCSFCGKSQKQVKKLIAGPGVYICDECIDLCNEIIEEELADADDVKLDELPKPAEIREFLEGYVIGQDTAKKTLAVAVYNHYKRIQAQEKSRDSRAEPVELAKSNILMLGPTGCGKTYLAQTLAKMLNVPFAIADATALTEAGYVGEDVENILLKLIQAADYDVKRAETGIIYIDEVDKIARKSENPSITRDVSGEGVQQALLKILEGTQASVPPQGGRKHPHQEFIQIDTTNVLFIVAGAFAGLEKIVSDRVGKRGLGFGAEVHSKAEIDTQDHFAEVMPEDLIKFGLIPEFIGRLPVVASVTNLDKESLVQILSTPKNALVKQYTRLFEMDGVELEMTPEALDAIADQAIHRGTGARGLRAIMEEVLQPAMYDIPSRDDVAKVVVTKETVQDNVLPTIVPRKPSRSERRDKSA >NC_023036|4030864:4081277|4046769_4048698_-|WP_019513216.1|DBSCAN-SWA MGQNGSTGAPRQKLEKVVIRFAGDSGDGMQLTGDRFTSEAALFGNDLATQPNYPAEIRAPQGTLPGVSSFQIQIADYDILTAGDRPDVLVAMNPAALKANVGDLPRGGLIIANSDEFTKRNLAKVGYDTNPLENEELSDYVVQSVAMTTLTLGAVEAIGATKKDGQRAKNMFALGLLSWMYGRELEASEAFIREKFARKPEIAEANVLALKAGWNYGETTEAFATTYEVAPAKLKSGEYRQISGNTALAYGVVAAGHLSGIQVVLGTYPITPASDILHELSKYKHFNVLTFQAEDEIAGIGAAIGASYGGALGVTSTSGPGVSLKSEAIGLAVMTELPLLVIDVQRGGPSTGLPTKTEQADLLQALYGRNGESPVAVLAPCSPSDCFDIAVEAVRIAIGYHTPVIILSDGAIANGSEPWRIPDISTYPAIEHKFAKSGEPFEPYARDPETLARQFAVPGTPGLEHRIGGLEAANGSGNISYEPKNHDLMVRLRQLKIDGITVPDLVVDDPTDDAELLMLGWGSSYGPIGEACRRARRKGIKVAHAQLRHLNPFPANTEEVLRKYPKVVVPEMNLGQLALLLRGKYLVDIQSVTKVEGMAFLADEVEGIIDSALDGTLREKETDKAKFARLAAATVESVGANV >NC_023036|4030864:4081277|4059511_4060381_+|WP_019513204.1|DBSCAN-SWA MNSSWYIPAGSADAPYSVAVTPESAGWSECGLHVLDLGTDGTVALQTGDTEVMILPLAGGGSVECADDVFELGTRTSVFDGPADMVYLGVGQSYVVTGHGRIAVCAARASRSLPNRRRAAADVPVELRGAGNCSRQVHNFGTADTFEADSLIACEVITPGGNWSSYPAHKHDEDSYVESQLEEIYYFEIDDSPAGTPGFGYHRVFGTPARPIEVLEEVRTGDVVLVPHGYHGPSIAAPGHHMYYLNVMAGSGPERAWRICDNPDHTWLRASWDHQEIDPRLPMRTNRGV >NC_023036|4030864:4081277|4073663_4073966_-|WP_019511798.1|DBSCAN-SWA MKSAHKLAWSMAGIAAAMGMTLANPAVGQAEEPVCPVGTYWNPAVVECLPYGVNPVVGPAGPVGVGGVVGPVGVGPVGPGPVGPGPVGPGPVGPGGLGPR >NC_023036|4030864:4081277|4077360_4078530_-|WP_019511794.1|integrase|DBSCAN-SWA MPRQRMAPGEHGKITVGRRGELFCATTYVRLHNGKLREREASSRKSAEDARRELKRRIQAELAAGQPTGAINQRTTLTELFEVWLPAKVAENGIGERTATLYRDTWRLHGDGQLGALRVGELSTSRADAYLKALLPAPATYMRVILSGMYSLAVRFDVIAHNPIRETRTNKAERKPARALTTMEFERVRAAVAAFCNRKGPGPRRGRMLPAFVELLAATGARPGEVLAIRWEDVDLLGTPPTVTVTGTVVDSGRVAGKPLHRQDSRKGGAPAHTVSLPTFGVEVLTELYAVTGPEGTVLANRDGGLVALTNIRSALREALADHEDLRWVTPHSFRRTVATVVRDGLGVEAAQRQLSHTQLATTEGHYVERVTVGPDTRAVLESWASNQP >NC_023036|4030864:4081277|4039010_4039229_+|WP_019513225.1|DBSCAN-SWA MFSPIENYDEAGVVFTFGGYSFGVWLFFILAVLLFVGFFVRMIQHENKAYKAIIEHTPVEPGPAAEGEPTAY >NC_023036|4030864:4081277|4071149_4071998_+|WP_019511800.1|DBSCAN-SWA MGRVTTRVRAQHVTAGGAGNPATAERAVARPETLAVEEPLEIRVNGTPLTVTMRTPGSDVELAQGFLLTEGLIGRRADIATVQYCSGAGPDGLNTYNVLDVTLAPGVTLPDVDVTRNFYTTSSCGVCGKASLEAVRLSSKHGPGDDPVTVSTEMLAALPDRLRARQKVFAATGGLHGAALFDTDGEPLVVREDIGRHNAVDKVIGWALEADRVPLTGTVLLVSGRASFELTQKAVMAGIPVLAAVSAPSSLAVDLASQSGLTLVAFLRGDSMNIYTRPDRIV >NC_023036|4030864:4081277|4054487_4055546_-|WP_019513209.1|DBSCAN-SWA MSTQTEVTLENHTVVRDERVKERNRLQRILIRPEMGAGIGAIGIFIAFLVVAAPFREASSLATVLYASSTIGIMACGVALLMIGGEFDLSAGVAVTFSSLAASMLAYNLHLNLWVGAALALVLSLAVGFFNGFLVMKTKIPSFLITLSTFFMLAGINLAVTKLVAGQVATQSVNDMAGWESAQKVFSSSFTVFGVGIRVTVLWWLVFTVVATWVLFKTRIGNWIFAVGGDAESARAIGIPVTKVKIGLFMFVGFCAWFVGMHLLFAFNTVQSGQGIGNEFFYIIAAVIGGCLLTGGYGTAVGAAIGAFIFGMTNQGIVYAGWDPDWFKFFLGGMLLFAVIANNAFRNYAAKK >NC_023036|4030864:4081277|4052206_4053067_+|WP_023986049.1|DBSCAN-SWA MKLAGAPISWGVCEVPGWGHQLAPARVLAEMRGVGLTATELGPEGFLPADPTELTSVLAEHQLSCVGGFVPVILHQVDHDPAEELAGPLDSLIAAGAGVVVLAAATGADGYDSRPVLDEAQWNTLLANLDRLAGIVADRGLLAVLHPHVGTIVETRAEVDRVLGGSSIPLCLDTGHLLIGGTDPLELAKAVPQRIAHAHLKDVDAALAAKVQSGELSYTAAVKAGMYTPLGTGDVDIEAIVGVLRDNGFDGWFVMEQDTILDGAPAGDGPVADVRASVAFLNGIMA >NC_023036|4030864:4081277|4049996_4050998_-|WP_019513214.1|DBSCAN-SWA MHRYKVREIAQQCGLSEATVDRVLNDRPGVRENTRAEVLQAIADLDKQRAQLRLNGRRYLIDVVMQTPQRFSDAFRAAVEAELPAFAPAAVRARFHLWESGSAARTVEVLGRIKGSHGVILKAQDEPEVAEQVDRLVGAGVPVVTYTSDVANSDRTAYVGIDNHGAGLTAAYLVDQWLGDAPADVLITLSRTVFRGEGEREVGFRAGLRGSRREIVEISEGDGIDANTEALVLDALRRHPGIEAVYSPGGGNAATVAAFEKLGRACRVFVAHDLDVDNRRLLRQGKLSVVLHNDLRADARLAMRVILARQRALPEESVRPAPIQVVTPHNVPG >NC_023036|4030864:4081277|4040280_4041216_-|WP_019513223.1|DBSCAN-SWA MRTLKLVVTAVDDTVSGIRTLALSDPDGAPLPSFTPGSHIVLECGAVANAYSLTGDGTAPDSYEISVLRCDSGSGGSLWLHDRVTLGDTVIASPPRSAFAPVLRARKHLLVAAGIGITPMVSHLRSARVWGRHTELLYVHRPGRAVHVDTVTRLADDVSIHTGRAGFDSALRTALAGQPFGAHLYVCGPAAFIADVTAAAVELGWPDSRIHLEHFGSVALDPGEPFTARIASTGEQFVVEAGVSLLEALERRGFDVPNLCRKGVCGECRIPVAGGAITHRDLFLGDDDRRAGDTLMACVSRGDGGTLEVAL >NC_023036|4030864:4081277|4045693_4046773_-|WP_019513217.1|DBSCAN-SWA MTDLIGSDLGLTEALSKTALVPTTDQPQKGKDFTSDQEVRWCPGCGDYVILNTIRNFLPELGLRRENIAFVSGIGCSSRFPYYLETYGFHSIHGRAPTIATGLALAREDLSVWVVTGDGDSLSIGGNHLIHALRRNINITILLFNNRIYGLTKGQYSPTSEVGKVTKSTPMGSLDYPFNPVSLALGAEATFVGRALDSDRKGLTEVLRGAAQHRGAALVEIMQDCPIFNDGSFDALRKEGAEDRLINITHGEPITFGADGEYAVVKSGFGLEIAKTADVPADQIVVHDATIDDPAYAFALSRLSEQNLDHMVMGIFRQVNKPTYDDAARQQVAAAREAKVHDTAALQSLLRGKDTWSVD >NC_023036|4030864:4081277|4045114_4045684_-|WP_019513218.1|DBSCAN-SWA MTPRTAPAAVVLAGGASRRMGRDKATLVFEGRTLVERVVDTVSARCDPVFVVAAPGQALPTVPAQILRDEVRGLGPLLATARGLRAAADAGAEWAFLCAVDMPHLTASFIDELLEPAATTPAAVVLVWDGRDHYLAGLYRTSLAGVADDLVAGGERSMRALVDAVDTQRIVTEPQRALTNVNTPDDLPV >NC_023036|4030864:4081277|4067837_4068755_-|WP_019511802.1|DBSCAN-SWA MGNDIQLQHLDYLLALAAERHFGRAAARCHVSQPTLSVAIRRLEKDLGIVIVQRGHRFEGFTEEGRRVVAWAQRIVAERDDMLTDIDRMRGRLTVTARIGAIPTAVPVSPFISAEFLVRNPAASVRIEALSSREIARRLADFEIDAGLTYLDDEAPPGTRSVELYRERYVLLAPADHPLVAAPEVAWVDAARLELCVLTTTMRNRRILDANMAAEGVQYRPAVEADSVDALYAHLTGSGRATIASTAWLPTLGLPQGFSARPMVQHGAQPAIGLVVLDRVPASIVASALVEVAAGLNIEGHLDGW >NC_023036|4030864:4081277|4062419_4064597_-|WP_019513202.1|DBSCAN-SWA MSGNAVRLKAINNVEAYVPPAISFDPAEAPGEIFGANVFTKAEMQLRLPKSVFKSVVATIEKGATLDPAVADAVASAMKDWALSKGATHYAHVFYPMTGLTAEKHDSFLEPVSDGQTLAEFAGKTLIQGEPDASSFPSGGLRSTFEARGYTGWDVTSPAYILENPNGNTLCIPTVFVSMTGEALDYKTPLLRSQQAMGVHAERILTLFGHKNLEKVVSFCGPEQEYFLVDRHFFLARPDLINAGRTLFGAKPPKGQEFDDHYFGAVPERVLGFMMDTERELFKLGIPAKTRHNEVAPAQFEVAPMFERANIASDHQQLLMTVFKTIARKHGMECLFHEKPFAGVNGSGKHVNFSVGNSELGSLLVPGDTPHENAQFLVFCAAVIRAVHQFPGLLRVSVASATNDHRLGANEAPPAIISIFLGAQLADVFEQIAKGAATSSKGKGVMHIGVDTLPQLPTDPGDRNRTSPFAFTGNRFEFRAPGSGQTVAVPMIVLNTIMADSLDYMATVLEKAVADGTEFDTAVQQLLTDIITEHGAVVFNGDGYSENWQTEAAERGLPNLKTTLDAIPELITPEAVEVFEKYGVFNERELHSRYEVRLEQYALTIAVEAKLALELGTTVILPAAVRYQTELAQNVATLKAAGFDADTTLLEAVSTPIAELTAAVGALKAGLSDHSAESALDEAAHAQKVLLPAMDAVRAAADALEGVVADDLWPLPTYQEMLYIL >NC_023036|4030864:4081277|4037678_4039010_+|WP_023986045.1|DBSCAN-SWA MTPEVEDALATMASVNNEFFYWMSIALMMLIHAGFLAYEVGASRSKNVLATAMKNLLAFATIVASFYFVGWFLYNAMPSGFIEFNDAAKAALPWGDNMGPNTADSASGIFWGAFALFAATTGSIMSGAVLERIRTSGFLVLTVLVGSVTWIIGAAWGWHGAGWMLTKLGFHDVGAAGCVHMIAGFATLGILINLGPRIGRFGPDGKPVTIRPHNLPLTMVGLMLIFTGFFGFLMGCVIYAGDGFTTIYASPTTLSAFAFNTLMGLAGGIIGAYLTSRGEPFWTISGGLCGVIGVASGMDLYHPALGFVIAFGAGALAPFIGKLLERFKIDDVVGAVSVHGGIGLYSVLMAGVFLSGYPNTDGNPSVSLWGQMIGALVFATLGFVPTYVVSLLLKKVGLLRIPAAVEEQGLDLSEVPATPYPEGIPITTMPLNGGKALLIAEAK >NC_023036|4030864:4081277|4034873_4036838_-|WP_019513227.1|DBSCAN-SWA MTPFDDLDSYLALPRVAELAVSPTGDRLVTTVSELSSEGTEYRTAIWELDPAGIGSARRLTRGGKGESAPVFTSDGDLLFLASRPTSGQESAPAALWRLPKEGGEAVEEMSLPGGVSAVHVAGDAPRTVVTTSLLRSAAGVDEDERLRALRKDNKVSAILHRGYPVRHWDHDLGPELPHLFDTDGRRDLTPEPGAALRDTALDISGDGTFLVSSWYGPVAGVALRSQIVRIDTATRQRTVIADDPGADLDAPALAPDGRRIAFLRETTSTPEQAPRITLCWGEIGQAWTELTEWDRWPASVTWSADGSRLIVTADDNGRHPIFAVDPATRTVARLTEDDQAYSDVCTAPGGVIFALRSSYLAPPHPVRIDPDGTITILPCVELPKLPGTVEELTARAPDGTPVHSWLVLPGGTEPAPLLLWVHGGPLGSWNTWHWRWNPWLLAAEGYAVLLPDPALSTGYGQDFIQRGWGAWGGPPYTDLLAATDAACEHPRIDRQRTAAMGGSFGGYMANWIAGHTDRFTAIVSHASLWALDQFGPTTDGAYWWEREMTEHMTQQNSPHHHVGEIRTPMLVIHGDKDYRVPIGEGLRLWYELLSRSGLPADEDGTSPHQFLYFPSEGHWVAAPQHTKIWYQVISAFLGNHLFGRDMFVPEILG >NC_023036|4030864:4081277|4041858_4043013_+|WP_019513221.1|DBSCAN-SWA MPQRFLVVGAGIAGLATAVALRRVGHDVRVIEGRAESEVGTGAGISIWPNALAALDVLGLGDEVRAAGGRVGAGAVRWRDGRWLRRPATDRMVRALGEPLVVIHRRALTDILAAALPAGTVTHDCAAARVRVTPETAGVVLSDGAVLDADAVIGADGVDSMVARNLNGTLAKRYAGYTAWRGIAEHALEPDLAGETMGPGLEVGHVPLGTRHTYWFATQRAPQGATAPDGELAYLQQVFGEWPEPIPALLAATDPAAVLRNDLYDRARARRWSSGRAVIVGDAAHPMRPHLGQGGCQGLEDAAILAAMTAEDADLPAAFARFAAFRSPRVLALVREARTIGQIVNLRPPLLSAAASRASTLVPEALLTRHLAGIAGRSAFVMPA >NC_023036|4030864:4081277|4075452_4076958_-|WP_019511795.1|DBSCAN-SWA MKSTVEKLSPTRVRINVEVPFTELEPDFDRAFKQLAGQVRLPGFRPGKAPRKLLEARIGRGAVLEQVVNDALPSRYSEAVTSAEIQPLGQPDIEVTKLEDNEELVFTAEVDIRPEIELPDLTALKITVDPIEIADDEVDTELQNLRARFGTLTGVERAAADGDFVSIDLSATVDGTEVPEAATEGLSHEIGSGQLIDGLDEAIIGLKTGESKVFTTTLAAGPHAGQEAEVTVTVKSVKERELPEADDEFAQLASEFDTIDELKESLVEQVKRVKRIQQAEQIRDKALEVLLADVEVPLPEAIVQAQIDDTLHNAIHGLDHDEDKFNESLTEQGSSREEFDTNTRNEAEKAVKTQLLMDAVADKLDIQVGQADLTERLVLMSRQYGIEPQQLLQVLQQNNQLPAMFADVRRGLTIAAVVQGATVTDSEGNVVDTAEFFGPAGGEETAEVEVAEDVAVEAPAADAESESEAAADEDAAAEKAEKKDKKKAKKKSDKKSDDSSE >NC_023036|4030864:4081277|4079222_4079447_+|WP_019511792.1|DBSCAN-SWA MKAIAEPQPDDDDRLSQLWPVEAVMARLSVGKSTVFALITSGELRSVKVGRRRLISEAAIREFIQKVDNGGSAA >NC_023036|4030864:4081277|4053131_4053605_-|WP_019513211.1|DBSCAN-SWA MTTTTLKTEDAGKQSVSRSVQVTAPVATLFEQIADPHRHHEIDGSGTVRDVEVKGPHRVSEGDKFTIGMTQYGLPYKITSTVTKARENEIVEWQHPLGHKWRWEFAEVSPGITKVTETFDYSSAKVPFVIELLGMKKQNAQGIEKTLTGLADRYLTH >NC_023036|4030864:4081277|4058630_4059512_+|WP_019513205.1|DBSCAN-SWA MSDALCRDYAEVTELRAADPASVTKAWHARTTRPTVRGDGRLMIVAADHPARGALSVGTRVTAMNSRIDLLDRLRTALADPGVDGVLATADILDDLVLLGALEDKVVFSSLNRGGLAGSVFELDDRMTGATATSTADARMNGGKMLCRIDLDDPGTVSTLAGCAQAVDQLAAHGLIAMLEPFLSTRVDGKVRNDLSPDAVIKSVHIAQGLGSTSAYTWLKLPVVPEMDRVMESTTMPTLLLGGDPTDPDEAFATWASALALPAVRGLIVGRTLLYPPDDDVASAVSGAVGLVR >NC_023036|4030864:4081277|4053667_4054483_-|WP_019513210.1|DBSCAN-SWA MTATVETPSNASSGGKKVPLVELRNVGKSYGNITALADISLRVHAGEVTGILGDNGAGKSTLIKIIAGLHQQTEGELLVDGEVTTFSSPADALDKGIATVYQNLAVVPLMPVWRNFFLGQEVRKKSFPWSLDANAMRATTLAELSKMGIELPDVDAPIGSLSGGQRQCVAIARAVFFGARVLILDEPTAALGVKQSGVVLKYITAAKEAGFGVVFITHNPHHAHMVGDHFVLLNRGRQKLDCTYDDITLEHLTQQMAGGDELEALSHELGR |
43 | Streptomyces_phage(25.0%) | tRNA,transposase,integrase,protease | attL 4068674:4068690|attR 4081587:4081603 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|