Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_007511 | Burkholderia lata chromosome 2, complete sequence | 2 crisprs | csa3,PD-DExK,WYL,DinG,cas3,RT | 0 | 2 | 1 | 0 |
NC_007509 | Burkholderia lata chromosome 3, complete sequence | 0 crisprs | WYL,csa3,c2c9_V-U4,RT | 0 | 0 | 0 | 0 |
NC_007510 | Burkholderia lata chromosome 1, complete sequence | 2 crisprs | WYL,csa3,cas3,DEDDh,DinG,PrimPol | 1 | 1 | 3 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_007511_1 | 3151-3267 | Orphan |
NA
Consensus repeat of NC_007511_1
|
2 spacers
spacers of NC_007511_1
>1.1|3169|34|NC_007511|PILER-CR GGGAAATCAGGGTTTTCCTGCCCGGAACCCGCTG >1.2|3221|29|NC_007511|PILER-CR CCCAGATCACAGACTGTGATTCCAGCAAC |
CRISPR arrays and Neighbor proteins around NC_007511_1
The CRISPR arrays of NC_007511_1 >merge|NC_007511|1|3151-3267|PILER-CR AGGTGAGGTTTTTCGGGAGGGAAATCAGGGTTTTCCTGCCCGGAACCCGCTGAGGTGAGATTTTTCGGGGCCCAGATCACAGACTGTGATTCCAGCAACAGGTGAGACTTTTCGGGA >NC_007511|1|1|3151-3267|PILER-CR AGGTGAGGTTTTTCGGGA GGGAAATCAGGGTTTTCCTGCCCGGAACCCGCTG AGGTGAGATTTTTCGGGG CCCAGATCACAGACTGTGATTCCAGCAAC AGGTGAGACTTTTCGGGA
>NC_007511.1|WP_011353618.1|1193_2375_-|DNA-binding-protein MTLDEIRHAIRDELESLRANGARRQELSLHACKRLFFDLGIRPSAANVRDLTQTGSASDIPKDIDHFWERIRSASKIRLDGAAIPKAVEEKAGALLGALYEEALKAARDSLDGDREQVRAGVADAEQRLRDATVRQETLEGALARGEAKNEQLQARVTELEVQLASQTTHGSASEATLLTTVARLEKELAAAAGRVEAEQTHNATLRDRIDTLQAELQQRTEHYAQQIKDAVAEAERRVKPMLVELDSLRSMASTYQSGLRDVQRKEFDFLQQLSSAKARADRLEEQLRSQGDELERATRDVSALRASRGMNPEIAALIRRLADAGQLDPDAFAAIGTSLDQEIPVPAHCPHCDGEPELSHSEEGFEVACPECEHASGAWPSRFEAVARFAHT >NC_007511.1|WP_011353617.1|37_1057_+|tyrosine-type-recombinase/integrase MSLPASPDTPARPEETDLFDRGASDWIVSPEAAFDAWLAMQDYRRSSADVYRAQWGAFLTWMRAHQKNLATVDTATIANFVGELPIRKTQRMRYLRLIERVLDHIRRTEYASTNPARFIAQDGEANWRKARDNEPTGFLAPAERAALLAYLFSPIGVSGSAYWKERRDRALVAAFLGAGIKTGEARALTISCINTSGTSLQIESTHPDFARETHLASFAIALLEAWLTERKRQDIPGELVFPASHAGRPMHKATMLRAIDAIVESAGLTSSRTARASPQTLRNTYAAELFEHDVPPERVGKWLGFMRPISSNRLHRAWKNWRDGLADSNGDASDSDETH >NC_007511.1|WP_011353619.1|4196_5564_+|replication-initiation-protein MATTKRAKKTDVDVVSASSAELRKAVEAIAIQPKSGKITLLTRKLFNVLLAVAQQADDSGDTYRALLSDIVANSAFDSNDTALVKEHLRRMVSVQVEWSTGTSSQKPGRKWGISTLIADAEILEDPATRRVWVEFSFAPKIKKKLLDPVQYARLSLQFQSQLRSSAGLALYEICVRYLTNPSHLTMREPWEWWRPILSGTPDTEAGDEAKREYKYFKRDYLRPAIAEVNAVTNIFVELIEHREGRRVAEIQFRVTERKQPMLALDEHPNVFDSTLVDRMVKLGIPLKEAQTLYADSEENRIRAALQMTEQRMRSTTLPPVRSAPALFKDALKKGYAPPVESVDALPAGTPSPKIAAAQPDDLKARLLSEFAAFRRKEAKVLYEEQGDAEREVARESFESEALPTMGTHLRDDWRKRGLDSKLAETAFFDWLAQRTWGEPTDGDLLSFTLNQSRAA >NC_007511.1|WP_011353620.1|5654_6716_-|ParB/RepB/Spo0J-family-partition-protein MKPSQFAKGFQARPDITTSEKRTALDRLNAIDGIVKSETPTPAPTKSAKKDIAPPPAPELTLDPSIDESAQYRAWRLENRYAPGQVIELSLKAIKHSPFNPRHFYLKSSIAELAVNLAKQGQQQAIHVIPDYENPGTYFVSDGGRRVRALKEANKESVKAIVIDVPIGIQSYKLGYDLNVQRDSQTVFDNAVVWRRFLDDKHFQSQKELSEHLGLDESTVAVALSIGKLPEAIMQEMVARPDRFGSNMAYQVGRYHNARGTEATLRLINKIVSDDLSTRQVSDIVKGRVAAQETPKAAGRQRYAQRLEIKLGGKSVGDLKSYGEDRIELRLRGLPKDKRDAILEQLERMLLSE >NC_007511.1|WP_011353621.1|6739_7402_-|AAA-family-ATPase MAAEIIAVTQQKGGVGKSTIAMHLGAAFHEKGKRVLVIDADRQNTLVHWSSASGDSDTGIPFPVVNLAEADGQIHREIKKFINDYDIIVVDCPPSITEKVSGVVLLAASIAVIPTSSSPADYWSSVGLVKLIQQAQVMNEDLRAVFLLNKTEEKRMLTRELKRALEELGFPLLKTQIPTREAYKQAMALGQTVLQMNDRGGKLAAAEIRACADEIVAMLP >NC_007511.1|WP_011353622.1|8492_8852_+|arsenate-reductase-(glutaredoxin) MMITIYHNPRCSKSRETLALVESLNTTGAPLNVVEYLKTPPTVEELEALHRQLGRPVRDMLRDGEEPYKELNLARADLTDAEAYAAIAAHPILLQRPIVVYRGKAAIGRPPESVQALFE >NC_007511.1|WP_011353623.1|8939_9161_-|zf-HC2-domain-containing-protein MLPGKCTDVTRLLSDALDRHLTLHERLQVRIHLPTCSGCRAYRGQIALLRTAAKAAAGRGPAPDDDDTSSGEG >NC_007511.1|WP_011353624.1|9138_9738_-|RNA-polymerase-factor-sigma-70 MPSAYDDPVYLAQLRRDLLRFARLQLRDADAAEDAVQEALTAAWSHAGDFAGLSAHKTWVFGILRNKLIDVLRARQRTVSLSALDAELDGESVLDRELFKENGHWAAHAKPRPWPRPETLLQQQQFWTLFEACLDHLPEQIGRVFMMREFLDFEMTDICSELTLTTNHCSVLLYRARTRLRTCLSEQGLTTEDAAGEMY >NC_007511.1|WP_011353625.1|10105_10660_+|NAD(P)H-dependent-oxidoreductase MSYRIAVVVGSLRRASWNRALAHAVISLAPADFSFEFVEIGELPLYSQDYDADFPEVAKRFKQSIEAADALLFFTPEYNRSIPGVLKNALDWGSRPWGSNSWSGKPGAVLGTSPGATGTALAQQHLRNVLAYLDVKTLGQPEMFIKHDPARIDDQGQIVSEDTRKFLQGFVDRYTGWVRLLKSA >NC_007511.1|WP_041493139.1|10712_12971_-|hybrid-sensor-histidine-kinase/response-regulator MTSDRPADSHAPASPPADDWQDDGNYASGAPEHDFAVRRVTLIVLLVAAIVLPCIYVVVMAYNDLKTREAAASDVTMRTVRVAEEHALKVFDLSETLDARIVDLVQDMDDATVRNRESDIHEALNTIGGGYPQVAAVSIFGASGMLLANSLYYPAPYASIANRDDFAGIRDGKVIEHISRLMMGPLKLENIPVFNTGVARRHSDGSFAGMVSIALKSSYFNAFYRDLLGGASTPMTMALARSDGAVIASYPPPPALAHTDRAVTFGDARNDPRAGVVRVRHDGASSEIVAYRQVGSYPVYVTCAYRTSAIWHEWYEHLSVLFISMFAPSVALWSVIWLSLKRLKAEEEAWDRWQAEASMRRSIESAYRQSRKMQALGNLVGSVAHDFNNLLMIISSNVQIVRRRGVQHLDKELGAIERALKNGQSLTRQLLGVARKQPLHNETIDVGQWVGTCRELLKTSLGSKSSLVVAIEPGVWPIRVDVAELELAVINLAVNARDAMATGGRFTVGARNVTLRREDGFPLTGDFVQISLDDTGSGMAPDVLARAFEPLFTTKAQGMGTGLGLPQVFAFCERSGGLATIDSAVGAGTSVRLYLPRARAEDVVARPQAVVHDAGAGALAGLHVLLVEDNSEVAAGTEALLSLLGHRVTYAPTADDALRLIEGAAANDAFDLVISDIHMPGRLNGIDLAEAVEQRPEKLPVILVTGYAEELDRTRTVNVRVLSKPFDIALLDEILLGIREARDARHARHAGT >NC_007511.1|WP_011353627.1|13079_13421_-|hypothetical-protein MVSLALTTARRTSLGAMRSGAGNRLAAARGRRDAGARDAPRGERRRAHISVIGKARRDSLSKPEAMQLAIFTISNRIGPNTHHSENSRDHFRDKHAKFFLVYVCAAGFVAFSI >NC_007511.1|WP_011353628.1|13441_13810_+|response-regulator MPLPIVIADDSLLARKLLTKALPGGWDVEVTYAANGREALALYRDGKASVMFLDLTMPDMSGYQVLETLRHEDLNTFVIVVSADIQPQAQARVRELGAIAFVAKPVTSEALLPILKEYGLYA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_007511_2 | 225249-225568 | Orphan |
NA
Consensus repeat of NC_007511_2
|
4 spacers
spacers of NC_007511_2
>2.1|225275|52|NC_007511|CRISPRCasFinder GTTGACCGGCGCGACTGCACTGCCGGCTATGGCAGGTGACATGAACAACGCG >2.2|225353|46|NC_007511|CRISPRCasFinder CGGCGCAGCGGTCGGCGGTGCGCTCGGCGGCAGCACGGGCTCGGTG >2.3|225425|31|NC_007511|CRISPRCasFinder GGGTGGCGCGGTGACGTCGAACCGTCGCGAG >2.4|225482|61|NC_007511|CRISPRCasFinder CCTCGGCGGCGGCGCAGGTACCGCGGCAGGCAACGCAATGGGCGGCCGCACGGGCGGCCTG |
CRISPR arrays and Neighbor proteins around NC_007511_2
The CRISPR arrays of NC_007511_2 >merge|NC_007511|2|225249-225568|CRISPRCasFinder ATCGCCCCCGCTCTCGTCGTCGCCGCGTTGACCGGCGCGACTGCACTGCCGGCTATGGCAGGTGACATGAACAACGCGCTTGGCGGCGCGCTCGGCGGGGTAGCCGGCGCAGCGGTCGGCGGTGCGCTCGGCGGCAGCACGGGCTCGGTGATCGGCGGCGCCATCGGCGGCGGCGCGGGTGGCGCGGTGACGTCGAACCGTCGCGAGCGCACCGGCGCGATCATCGGCGGCGCCCTCGGCGGCGGCGCAGGTACCGCGGCAGGCAACGCAATGGGCGGCCGCACGGGCGGCCTGCTGGGTGCGGCGGTCGGCGGCGGCGC >NC_007511|2|1|225249-225568|CRISPRCasFinder ATCGCCCCCGCTCTCGTCGTCGCCGC GTTGACCGGCGCGACTGCACTGCCGGCTATGGCAGGTGACATGAACAACGCG CTTGGCGGCGCGCTCGGCGGGGTAGC CGGCGCAGCGGTCGGCGGTGCGCTCGGCGGCAGCACGGGCTCGGTG ATCGGCGGCGCCATCGGCGGCGGCGC GGGTGGCGCGGTGACGTCGAACCGTCGCGAG CGCACCGGCGCGATCATCGGCGGCGC CCTCGGCGGCGGCGCAGGTACCGCGGCAGGCAACGCAATGGGCGGCCGCACGGGCGGCCTG CTGGGTGCGGCGGTCGGCGGCGGCGC
>NC_007511.1|WP_011353822.1|224198_224858_+|isoprenylcysteine-carboxylmethyltransferase-family-protein MTVTRKVAVVSGVSTLVYLGLAVLGSGGFAAFFSHPPLTVVVVATLAMAVVAMFTEGNLSSGERENRDNRWVLAAFGVSGFLLAYLPALTDRLDFWTFGGDAVRWIGVVLYIAGGVLRIWPVFVLGKRFSGLVAIQPGHTLVTDGIYSRIRNPSYLGLVVNSVGWALAFRSGVGLLLVVLTMVPLVARIRSKEALLRAQFGAEYDAYCARTWRLLPGVY >NC_007511.1|WP_011353821.1|223871_224036_+|hypothetical-protein MTRLLITLALIGGLSGCYVAPPYGYAPAPAYYGYAPAYYAPPVSVGIGGNFRIR >NC_007511.1|WP_041493445.1|223196_223616_+|HIT-domain-containing-protein MSYDNNNPFAKILRGELPCVKVAEDDATLAIMDLMPQADGHVLVIPKEPAAQIFELSGDAAAAGIRMTQRVAAAVRAALEPDGVFIGQFNGAAAGQTVPHVHFHVIPRWEGAELKMHAREIADAATLEALAQRIRARFV >NC_007511.1|WP_041493444.1|221722_222697_-|hypothetical-protein MPTNGDVNPYGIAFVPRGVPSWSTLKPGDVVVSNFNAASNAQGTGTTIVKLTPGKTPATTFFQGSNLGLTTALTVLRSGFVLVGNVPAPDGKTVVAPGSLLVIGPQGNLVTQLSSAALLDGPWDMTVIDRGQRVTAFVSNVLNGTVSRIELAIGDNGVTMLPGSRVIASGYVNRTDPNALVVGPTGLAYDPNIDVLYVASTGDNAVFAIQNAASTNRNGGVGRMIYFDAAHLHGPLALALAPNGHLVTANGDAVNADPQQPSEIVEFTVDGRFVAQMQVDTVSGAAFGLAFGHGSKGQLEFAAVDDNTNTATVWTLRSDNNNAQ >NC_007511.1|WP_011353818.1|220129_221530_-|serine-hydrolase MTIPFKKLILRGVALAVVAAVGYTGYMLSRLAPIATGYAAKALCSGVYVSGRPAESVIDVDIMAGVHPLLKLVHPSIDPDHHRATATFAGFAEREADFRPGLGCTLALGPSPGALPAALPPLPDPPSTQPAPATPPAGVDAQKLQTALDRAFDEPDPARPRRTRAVVVMWRGQVIAERYAPGFTADTPLPGWSMTKTVTAALVGTLVAQHKLSLDTSALLPEWRGSGDPRAAITLDELLRMTSGLQFNEDYDDPLSDVALMLYAQSDTARFASAKPLAATPGTRWSYSSGTSAIVARVMREALGGSEDDYLALPRRALFAPLGMRSAVFEPDASGTLVSPSYMYASARDWARFGQLLLQDGVWDGQRLLPEGWVRYLTRTTPQSERQEYGAQLWVKVPEPFNDRDPHAVAMPADAFHAVGHEGQFVSVVPSRQLVVVRLGLSRPESAWNHEAFLARVLDALPAPGA >NC_007511.1|WP_011353817.1|219398_220022_+|glutathione-S-transferase MLHILGKIPSINVRKVLWLCTELNLPFEQEDWGAGFRTTNDPAYLALNPNGLVPVIKDDDFVLWESNTIVRYLANRYGGDALYPAEPQARARVDQWIDWQGADLNRSWVGAFLGLVRKSPDHQDPAGIAQSIAGWTKHMQVLNAQLKATGAYVAGNGFTLADIPIGLSVNRWFGTPFEHPDFPAVKRYIERLATREGFQKYAGSANP >NC_007511.1|WP_011353816.1|217826_219326_+|TIGR01777-family-protein MNTLPAYDWALNLLIVQGAMGAFDTLYHHELTQDLPHSPRARLELGIHAVRSVLYGLVFASIANVAFHGAWVAALAAVVVVEVVLTLWDFVVEDQSRKLPSTERVLHTLLAVNGGALFGMIAMQLAVWAHEPTALQALDLGWRGWVLSLFAVGVAISGIRDGIAACRIARHAPVANPFAGQAPGSVLVTGGTGFIGETLVNQLLDAGHVVTLLARDPLRAAYLFHGRVRSVTSVEQLQPHERFDTVINLAGAPVLGARWSKRRQAVLLASRVGVTESLMRWVETAEVRPRTWIQASAIGYYGVRPSDERLDESSNAGTGFMSELCRQWEQSAQPLERHGARSVVLRLGVVFGPGGALRPMLLPHYFGMGGRFGDGKQVMSWIHRDDVLRIIARAMANPGMRGVYNAVAPAPLTQREFVQVVSKVLHRPAWLHVPAAPLRIAMGEMAEVLLDGQRVMPARLHQDGFMFRFPTAEHALRDLTNHPHADFPRTACCLPRGRV >NC_007511.1|WP_011353815.1|216963_217830_+|DUF393-domain-containing-protein MSSSELVLYFDGRCPLCVAEIKRLEARDARHRLAFVDIAEPGFDPAPLGVDLPALNRELHARLPDGRMLTGVDSILAAQALTGRRWLVRLLRVPVVRTALAPLYRRFARNRQAVSRWLGYRAEAPCDGAACGSGRAAEPTANRPAGDAARRIVVTWMYGAAIAHLLVGIAVPWVAGTPWLDAYHRVIELHFWAGAAPDAARAQQVWWMSLIGATVQCASVWMLALVHLGNRLRRREVWGWLLAGLLVWAPQDMLFSLQAHVSGHVAIDAAALVAMVPPLVWLWRRDTV >NC_007511.1|WP_011353814.1|216392_216863_+|DUF2269-domain-containing-protein MNTYLVIKALHILSSVLLVGTGFGTAFYLFFANRTRSVPAIAAVSRLVVRADWWFTTPAVIFQPASGLWLAHTAGWPWDTPWLVASIVLYAIAGACWLPVVWLQVELAAMAKLAHANGDAGLPERYWRYAKRWELLGYPAFFAMLTVYFLMVLKPM >NC_007511.1|WP_011353813.1|215088_216396_+|SDR-family-oxidoreductase MNAVLPSISQHDCTGARSMTVLVCGANGFIGRALCAQLEAGGHRVLRGVRHAAGPCDVAIDFAKDVDPDAWLARLEGVDVVINAVGILADQRGATLDAVHRAAPCALFTACCRAGVRRVIQISALGVERGDTRYFASKVAADRFLQTLPIDYRIVRPALVYGTTGASARFFRMLASLPVQVLPAGGHQRLRPVHVDDLAEVVARCVDAPAAGRPVIDVVGNDEVEYREMLARYRAALGFPPAVRIALPGPLAGAAATLFGMLPGAMFTRDTWTMLRGGNTGDPAAATAMLGRPPRGIDGFIGAEAAALRRDALAMWRRPLLRGALAIVWIWTAIASAFIHPLHASLALLAPAHLTGLPALIALYAASALDFAFGIATVVAPSRRLWVAQAALIVAYSAVIAVTMPGLLAEPFGPVLKNVPILAILLILFSEEEQA >NC_007511.1|WP_157687225.1|226221_226461_+|hypothetical-protein MESIATFNWSIANDFAIVGDARVRHRTVPLAAPRRMIASATARSPAPQAWAAASEEPDEPVAKPGRPNEGACAFKPPRQ >NC_007511.1|WP_011353825.1|226550_227387_+|(S)-ureidoglycine-aminohydrolase MSKTTYYAPHGGHPPQTDLLTDRAMFTEAYAVIPKGVMRDIVTSWLPFWTNTRLWVIARPLSGFAETFSQYIVEVNPGGGSDKPEQDKNAEAVLFVVEGEAELTLQGKKHVLKPGGYAFIPPGADWTLHNVSDAAVRFHWVRKHYQAVDGIPLPEAFVTNEQDVEPIPMPGTNGAWVTTRFVDMSDMRHDMHVNIVTFEPGGVIPFAETHVMEHGLYVLEGKAVYRLNQDWVEVEAGDFMWLRAFCPQACYSGGPGRFRYLLYKDVNRHMNLLLNPAR >NC_007511.1|WP_081436647.1|227539_228610_-|hypothetical-protein MMTSSDGLTSPARLLRLASWVIAIIFAVFLNMLGSLVIRDMAFAPRGGPPVVEQFADAPAKARLDAARRDLQAQHDTLAGKADAMEVARGRAAKEYAAEKESFRNWLATRSVTGDSARDPDILARTRKLDTLQAVVINWQHQIDAIGDQQRALASQQTQVDTQIADADAAAERRFEKADRQYELQVFGLRLALTLPILLIAIWLFMRYRKMRYWPFVYGFGLFSLSAFFIELVPYLPNFGGYVRVLVGIALTVFAGLYMMKAFQRYAERKRLELQQDQGERARTIGYEKAVRSLEKKRCPSCDKQWNLGGDDSTFCVHCGLRLFNVCGCGGRNFFFFPHCHQCGATQGNESPAASD >NC_007511.1|WP_011353827.1|228765_229560_-|3'-kinase MFDRYLGLWGLVPDGGPILTASGGLLPVVWQARPAMLKVATCDEERRGNALMTWWNGQGAAQVWQHDSDAVLLERARPEPTLAGFSASGHDDDAMRIACNVVARLHAHRASEPPSVVPLHDWFYALLSNDADNAALRRSAAIARQLLVAPPVDDFVLHGDIHHGNILHFGERGWRAIDPKGLRGDRAFDYANLFCNPSHDIAVDPVRFEQRVTLVASAAQLDRRRLLQWILAWSGLSAVWLIEDGLSPDTRLQVAMLAAAALGV >NC_007511.1|WP_011353828.1|229798_231922_+|PBP1A-family-penicillin-binding-protein MTDHTASPPPPPAPPPRRRRVWRTLAGALLGLTVACAGIGAWTIQRIWTQLPSVEHLAVYRPALPLRIFSRDGDLLAEYGVERREFVPLERIPPLMRQALLAAEDAKFYQHGAVDFNGLARATFANVVTGQPGQGGSTITMQVARNFYLTRDKVLSRKLAEILMATKLEREYSKDKLLELYMNEIYLGERAYGFAAAANVYFGKPLDALSAGEAAVLAGLPKAPSAFNPVVNPARATARRNYVLGRMHALGQLDDATYRAAVDAPIALATTPPPGIIAAPYVAERARRMMVERFHDDAYTLGLDVTTTISMRDQRAAESSLARTLSRQPRAKRDARNGLEGALISLDAATGDMLALVGGADFNRNVFDHALQAYRQPGSSFKPFVYSAALEKGYFPGVLVDDTQRTLTHEETGARPWRPRNFANNYEGFIPVRRGLMRSKNLVAVSLMQATDARYVQQHAVHFGFDAQRNPASLPLALGAGAVTPLELASAYSVFANGGTRMEPRLILSVKQRHGGAIYEATAPAGERVVSARNAFVMDSMLRDVVKSGTARGALALRRDDAAGKTGTSNGSKDVWFAGYSSGIVAVAWLGYDTPRPMGRATGASLALPVWLDYMKTAVDGRTPVEATPPQDVALVDGDFVYAEYTRGTCTADVPSYIRSRFACGGAAAVSGASDAPGNNGKPAEPMPGAVDAAERERVLDLFRTED >NC_007511.1|WP_011353829.1|231930_232542_+|trimeric-intracellular-cation-channel-family-protein MHTLYLIAIVAEAMSGALMGMRRGMDRFGLALVGAVTALGGGTVRDVLLGHYPLGWIAHPEYLVITLVAATVASWVARHVARMKTLFVTVDAIGLAAFTIIGCDIGASTGAVPIIVVLAGAITGVCGGMLRDLLCNEMPLILREELYASVAFVTGGLYVGMQYVGIDAGFATVVALAAGFSMRMLAVRLGWKMRTFGAADLEH >NC_007511.1|WP_011353830.1|232630_233017_-|lactoylglutathione-lyase MPLPMTRIILYVQDVASLKSFYQQYFDLPVIEEIENEWVVLGAGAIELALHLAGPAFRHAAPADAEARAETSHVKFVFSIGQDIGAHRGRLAGDGVTVRDLKRYDGFAYTMYDGIDPEGNVFQVMQAD >NC_007511.1|WP_011353831.1|233136_233958_-|heme-ABC-transporter-ATP-binding-protein MLTAHHLDVARRHHAILRNLSLSIEPGRVTALLGRNGAGKSTLLKTFAGELTGGVAPNGVRVTGDITLNGEPLARIDAPRLACLRAVLPQAAQPAFPFSVDEIVLLGRYPHARRSGATSHRDRDIAWCALERAGADALVGRDVTTLSGGELARVQFARVLAQLWPDDEAIESGPRYLLLDEPTAALDLSHQHRLLETVRAVAREWQLGVLAIVHDPNLAARHADTIAMLADGTIVAHGSPRDVMTPAHIAQCYGFAVKMVETGDGAPPVMVPA >NC_007511.1|WP_011353832.1|233977_235066_-|iron-ABC-transporter-permease MPAHASPFPASSPASRSGAARIGTSRRFAPFALAALACLVCAMSVVALCVGAYRIPLAEAWAALTGAAAAQQARAVLFDIRAPRVALALLVGGGFGATGAAMQALFRNPLADPGLVGVSSGAALGATTMIVLGPALFAAHVSAAALPIAAFAGALAVAALVYRLAASRGRLALPLLLLAGIAINALVGAAIGLLTFVADDAQLRSLTFWSLGSLGGAQWSALAAVAPCVAIGCVLLARERDALNALQLGETEALHLGVPVQRLKRRVLVAVALAVGALVSCAGIIGFIGLVAPHCVRLACGPDQRVVLPGAALLGALLTLAADLAARTVAAPAEIPLGVLTALLGAPFFLALLWKSRGALGG >NC_007511.1|WP_011353833.1|235081_235999_-|ABC-transporter-substrate-binding-protein MSARPFDPRRRAVLASAAAGALAGALPGSVLAQVAPAAPKRVVVIGGALAETAFALGGAETPRYRLVGADTTCTYPDAAKRLPKVGYQRALSAEGLLSLRPDLVLASAEAGPPTAIAQVKNAGVAVTTFDERHDVESVRAKITGVAQALDVRDAGTALLQRFDRDWQDARDAVAARAPGGAQPPRVLFVLNHTGNQALVAGQRTAADAMIRYAGARNAMQGFDHYKPLTTEALAAAAPDVVLISDEGLAAVGGRAALLATPGFGATPAGRAQRVVALDALFLLGFGPRLPLAVTTLHRRLSDALA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_007511_2 | 2.3|225425|31|NC_007511|CRISPRCasFinder | 225425-225455 | 31 | NZ_AP014705 | Methylobacterium aquaticum strain MA-22A plasmid pMaq22A_1p, complete sequence | 1073398-1073428 | 8 | 0.742 |
NC_007511_2 | 2.3|225425|31|NC_007511|CRISPRCasFinder | 225425-225455 | 31 | NZ_CP030074 | Streptomyces sp. ZFG47 plasmid unnamed1, complete sequence | 871327-871357 | 8 | 0.742 |
NC_007511_2 | 2.3|225425|31|NC_007511|CRISPRCasFinder | 225425-225455 | 31 | NZ_CP022193 | Yangia pacifica strain YSBP01 plasmid unnamed3, complete sequence | 36974-37004 | 9 | 0.71 |
NC_007511_2 | 2.3|225425|31|NC_007511|CRISPRCasFinder | 225425-225455 | 31 | NZ_CP043499 | Rhizobium grahamii strain BG7 plasmid unnamed, complete sequence | 433713-433743 | 9 | 0.71 |
NC_007511_2 | 2.3|225425|31|NC_007511|CRISPRCasFinder | 225425-225455 | 31 | NZ_CP014311 | Burkholderia sp. PAMC 26561 plasmid unnamed4, complete sequence | 174204-174234 | 10 | 0.677 |
NC_007511_1 | 1.1|3169|34|NC_007511|PILER-CR | 3169-3202 | 34 | NZ_CP030128 | Indioceanicola profundi strain SCSIO 08040 plasmid unnamed2, complete sequence | 312113-312146 | 11 | 0.676 |
1. spacer 2.3|225425|31|NC_007511|CRISPRCasFinder matches to NZ_AP014705 (Methylobacterium aquaticum strain MA-22A plasmid pMaq22A_1p, complete sequence) position: , mismatch: 8, identity: 0.742
gggtggcgcggtgacgtcgaaccgtcgcgag CRISPR spacer accgtacgcggcgccgtcgaaccgtcgcgag Protospacer . .*****.* *****************
2. spacer 2.3|225425|31|NC_007511|CRISPRCasFinder matches to NZ_CP030074 (Streptomyces sp. ZFG47 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
gggtggcgcggtgacgtcgaaccgtcgcgag CRISPR spacer gaagagcgcggtgacgtcgaacggccgcggt Protospacer *.. .***************** *.****.
3. spacer 2.3|225425|31|NC_007511|CRISPRCasFinder matches to NZ_CP022193 (Yangia pacifica strain YSBP01 plasmid unnamed3, complete sequence) position: , mismatch: 9, identity: 0.71
gggtggcgcggtgacgtcgaaccgtcgcgag CRISPR spacer ccaaggcgcggagacgtcgaaccggcgctcc Protospacer . ******* ************ ***
4. spacer 2.3|225425|31|NC_007511|CRISPRCasFinder matches to NZ_CP043499 (Rhizobium grahamii strain BG7 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.71
gggtggcgcggtgacgtcgaaccgtcgcgag CRISPR spacer aaaggtgacggcgacgacgaaccgtcgcgag Protospacer ... * .***.**** **************
5. spacer 2.3|225425|31|NC_007511|CRISPRCasFinder matches to NZ_CP014311 (Burkholderia sp. PAMC 26561 plasmid unnamed4, complete sequence) position: , mismatch: 10, identity: 0.677
gggtggcgcggtgacgtcgaaccgtcgcgag CRISPR spacer attctcggcggtgacgtcgaccggtcgcgac Protospacer . . ************* * *******
6. spacer 1.1|3169|34|NC_007511|PILER-CR matches to NZ_CP030128 (Indioceanicola profundi strain SCSIO 08040 plasmid unnamed2, complete sequence) position: , mismatch: 11, identity: 0.676
gggaaatcagggttttcctgcccggaacccgctg CRISPR spacer cctgtttcagggttttccggcacggaacccgtaa Protospacer . ************ ** *********. .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1154712 : 1159250
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_007511|1154712:1159250|DBSCAN-SWA CATGATGTCTTCGCGTCCCCAATCGCCCACGCTGGGCAACGGCCAGGCCGACGAGTGCGTGACGCTCGTCGCCACCGCGTCGCTGCCCACGCGCTACGGTACGTTCACGTCCTACGCTTTCCGCGTATCCGGCAGCGATGCCGAACACCTTGCGCTCGTGATGGGCGACGTCGCCGGCGAGCAGTCCGTGTTGACGCGCCTCCATTCCGAATGCCTGACCGGTGATGTATTCGGCTCGTACCGCTGCGACTGCGGCGAGCAGCTCGATCTCGCATTGCGCTACATCGCGGCCGAAGACCGCGGCGTGCTGCTGTATCTGCGCGGCCACGAAGGGCGCGGGATCGGCCTGAGCAACAAGATTCGCGCGTACGCGCTGCAGGAGCAGGGGCGCGACACCGTCGAGGCGAACCTCGACCTCGGCCTGCCCGATGACGCCCGCGAATATGATTCGGCCGCCGCGATCCTCCGGATCCTCGGTGTGACGTCCGTGCGGCTGATGAGCAACAACCCGGCGAAGTTCGACACGCTCGCGAAGCACGGTATTCCGGTTTGCGAACGCGTCGCGCTTGCGGTGCCCGTGCGTGAGGAAAACGAGCGTTATATCCGCACGAAGCAGACGAAGTTCGGGCATTACTTCGAAGAAAACGAGTGACAGAATCAAGGGGCTCTTCGGAGCCCCTTGTCTCATCTAGGTTTTGTGCCTTAGAGGCTTTCTGGTAAAGGGTTTGAGCCGAAAGTTGCGTTTTCCAGGTGGCTTGTCTGCACTTGACGAAACAGGCGAAAACTTGGCACTCTAGCGTAGCATTTTGACGGGCCAGACCGAAAAGCTACGCTGCGGTCCCGGCATCCAATAAAAACCTACGCCGACCGCCGATGAGATTCGATGCACGCACCGCGAGCAAGCTTCCTGCGGGCCAACACCTGACCTTTGACGGATTCCCGGGGCTCCGCTTCCAGGCGAGTGAAAGCCGCCGCTCATGGATCTACCGCTACAAATCCCCCATTGACGACCGGATGCGCCAGGTCAAGCTTGGCGAATGGCCAGCTATGGGCTTTCCTGCTGCCATCGCAGAGTGGGAGCGAAAGCGGTCCGACCGTGACGCCGGCGCAGATCCTGCCGCGGCAAAGCGCGAGAAGCGCACTGCAGTCGCCATTTCTCGGGCCGTCGACGCCTATACCGTCAAGCAAGTGTGTTTCGACTATATGGGTGGCTATCTGGAGCCGAACCGCAAGGACAAGGGTGTGGTCGAGGTCCGACGAATGTTCAAAGCGATGCTCGGCCCAATCGAGACGGTTGCCGCCGCGTCGATTACCCGCGCTCAGGCTTTCGAGTTTCTCGATTCCTTTCGTGCCACGCCAGCGCTCGCGGCTCGCCTTCGTATGGAACTGGGTGGGGCGTGGGACTATGCAATGGATGCCGGACGACTGCCCGACGGCACGCCGAACTGGTGGCGGATGATTCTGCGCGGAAAGCTGCGCAGCAAGGGGCGCACGATCGACGGCGTCGCCATGGGTACAAAAAAGCGAGTTTTGAGCGAGGACGAAGTCGGCACTTTGCTCCGCTGGCTCCCGAATATGAGCCTAACGGTTTCCGACGCGATCACGTTGTACCTGTGGACCGGTGCCCGCGGAGGCGAAATCATTTCGATGGAGGCGCACGAAATCGCGGACGAAAGCGATGGCCTCTGGTGGACTGTGCCGAAGGAAAAGACAAAGAACTCGTGGCGCTCGAAGGCGGGAGACTTGCGGGTTCCGATCACCGGTCGCGCCGAGGCCATTGTGCGACGGCGCAAGGAACAGGCCGTGAACGGATTTCTGTTCCCGACCTCGACCGGCGAAATGATGAAACAGACCGTCATTTCGCATGGCGTCTACTACCATCAGCCATACTGCAAACAAGCTCCCAATCACAATCGGCCGCGCCTGACGGTAACGCACTGGTCGCCGCATGATCTGCGCCGTACCGCCCGCACGATGCTCGCGGCGCTTGGTTGTCCCCATGATGTCGCAGAGGCGGTGCTCGGCCATATTCAGCCAGGCGTGGCCGGCGTGTATAACCGGCACCATTACGACCGCGAGCGGCGTGAATGGCTCACGCGCCTTGGACAGTTTCTTGACGAGCTTGCGTTGAAGTATCCGAGGAAGTAGCGCGCCGGCGGCGATTGCTGTGGCCGGTGTTGGGTGGAGGCGCGAGATCCGATTCCGGGCACGCTTCCGCCCAGGCCTCAACTTCGCGCGTTAGCCAGGCAACTCGCCGATCGGAGAGCAGCCGCGGCTTCGGAAATTTGTTCTCCCTTACGAGTTTCTGGACGGTCGCCTCCGACAGTGAAAGAGCGGTCGCCACAGCGGGTAGATCGAGGTAGAGCGGTTTCATTTTGGTGGATGCGGACATGCTACGATTCCTCTGTCTCAAAAAGGCGTTGAATCAATGCAGAAGAAATTTGTAATGCGCGGGTACGAGATGAACTGCGAACCGCGTGTGACGGAGGACGGCAAGTACGCCGCCCAGGTCGAGGTAACGAAGCTGGGATTTAGCCGGGAGGCTGCGTTCCGGAATCTGGGTGAGTTTGATTCCGAAACTAAGGCCGTCGATTACGCGAGGCAATTCTCGGTTGAATGGCTTAGCCGGTACGGTTGAGCAAGCAGGGGCAACGTGAACAAAAACACGCCGCGGCGACGTAAGTCCGTGAAAGATTGGCTCGGCCTTGGTCTGTTGATCTGGGTTGTCGCGCCCCCACTTGCATCAATGGCCTTCTTGTTGTTGTTGATGCTCTGGTGTCGTGTCTTCGCAAGTTGCAAAATTTAGTGTGCCAATGGAGTTAGTGTTCAGCGCATTCCGCAAGGAGAGTTCGCCATGTCTAACTGGAAAATCGTCGATAAAACACACACTGCTCATTTGACGTTACGCCCGAGTTTGCAACCTGGTATGTTTGTTGTTGCAGTATGGATTTTGCCGAACGGTGTGCCGAAGGGCGATTCGCCACAATGGGCCCAGGAATTCCCTTCGGAGCTTGAAGCACGCAATGCTGGTGAGGATATGGCAACGGCCAAACTGCGAGCTCTCTCTCCTTGATCTGGGGCGCTGGAGACTCGTTTCGCAAGATTGGATTTGCTCGAGATCCTAGATTGGGGGGGGGGCATGGTGCGGATACAGACGCAGACGGGTGTTCGGAATGTTCAAGAGGACACTTACAAGGGCCGCAAAATTGAAGTGGTGACCGGTTACGATTCGCTCTCTGACAAGTGGCCTTTCCACATCTATATCGACGGCACTCACCTTGTCGGGCAGTGGAAAGCCGATCGAATGGAAGAAGCTTTCGATTCTGGATTTCAGATCGCTCAAGAGCAGCTAGATCGCGTCTAGATGCTCATGAAGTGGCGTTCGAGCTGTCTATCGCGCTAGGTGCAGCAGCGCGTGAATCTCGCCGCTCCGCGACGAACCGCTCAAGCCATTCGATAGCATCGGTCGGCGATCCAAGCGATCCCGTGACCTTCCTCCGCTTGTTCATCGCGCCGCACTGGCGAAAGGTTACTGGAATGGCACCGGCGGCGATCGCCGCGGCACGCTTGCTGATGGCGATGTCGAAGTGCTCGTCGCGCGTGCCGGGATGCTGAATCCATTTCGGGTTGACGCCGATCTCGCGCACCATGGCGAGCAGTTCCTCGGTCGTATCGGCAATGAGGTGCGACATCTTCATCCGGCCGAGTTTGCCGATCTCGTGCCGGTACATGTCGTCGACATAGACGGCCATCATGACTCCAGGAGCTGGTATGGGAGGTGGCGACCTGGAGGCGCAGAGTCAGGACATACAACTATCACTCGCCCGGATGCCAGCTTGATTTTTCCGGTTCCGTGCGTGCTACGCTGTCCTCGTGAAAGTATTCGGAGGCAGCTATGCCAACTTGGGAACAGGCAATGGAACGTGACGCGGCGTACGAAGCTCTCGTCACGCTTCAGCGGAAACGCAGAGCGGATTGGCAATCCGAGGGTCACCATCAAGATCAGCAGTACCGCATGGCAGAAGCTGCCGAATTCATGCGCGCGTACGTTCGGTACGAGATTGCATGGCGCCCATTTTCCGGGTACGGAAGCTTCCTTGACGCGTAAATCGCAGAGGTAAGCGACGGCCGCGATGGTTCCGATTTGGCGGCTGCTCACGATGCTTCTCCGGTGCGGGCTACGGCCGCATGCAATTTTGCGACTTCACCGGGGTTTTCGTCGCGCCACTGACGCCAGCTCGGGTCGGTGAGCTTCCACCGGATCCAGTCTGGGTGATCGGGCTTCGCTTCGAAAGGGGCGACGTACGGACCATCATCGATTTTCGTCGCGCACCGTTCACACATCCCGGTACCACTTAGGTGCGCGTGGCAGAAATACAGGCCGCAGCCTTCTTCGCCGCCATAGGGTTCTTCGTGTGCGCAGACATGCGCCAGGCCGCGATCGATCTTCGCATTGCATTCTGGGTGATCGCAGGTAGCCGGCACATCGTATCCGATGTCTCGCTTCCAGTTGTCGTCGTATCCGATTGACCAGCCCAT
Protein sequences of DBSCAN-SWA_1 >NC_007511|1154712:1159250|1157139_1157349_+|WP_011354636.1|DBSCAN-SWA MQKKFVMRGYEMNCEPRVTEDGKYAAQVEVTKLGFSREAAFRNLGEFDSETKAVDYARQFSVEWLSRYG >NC_007511|1154712:1159250|1158866_1159250_-|WP_041493237.1|DBSCAN-SWA MGWSIGYDDNWKRDIGYDVPATCDHPECNAKIDRGLAHVCAHEEPYGGEEGCGLYFCHAHLSGTGMCERCATKIDDGPYVAPFEAKPDHPDWIRWKLTDPSWRQWRDENPGEVAKLHAAVARTGEAS >NC_007511|1154712:1159250|1156803_1157085_-|WP_081436663.1|DBSCAN-SWA MKPLYLDLPAVATALSLSEATVQKLVRENKFPKPRLLSDRRVAWLTREVEAWAEACPESDLAPPPNTGHSNRRRRATSSDTSTQARQETVQGA >NC_007511|1154712:1159250|1154712_1155363_+|WP_011354634.1|DBSCAN-SWA MMSSRPQSPTLGNGQADECVTLVATASLPTRYGTFTSYAFRVSGSDAEHLALVMGDVAGEQSVLTRLHSECLTGDVFGSYRCDCGEQLDLALRYIAAEDRGVLLYLRGHEGRGIGLSNKIRAYALQEQGRDTVEANLDLGLPDDAREYDSAAAILRILGVTSVRLMSNNPAKFDTLAKHGIPVCERVALAVPVREENERYIRTKQTKFGHYFEENE >NC_007511|1154712:1159250|1155584_1156859_+|WP_011354635.1|integrase|DBSCAN-SWA MRFDARTASKLPAGQHLTFDGFPGLRFQASESRRSWIYRYKSPIDDRMRQVKLGEWPAMGFPAAIAEWERKRSDRDAGADPAAAKREKRTAVAISRAVDAYTVKQVCFDYMGGYLEPNRKDKGVVEVRRMFKAMLGPIETVAAASITRAQAFEFLDSFRATPALAARLRMELGGAWDYAMDAGRLPDGTPNWWRMILRGKLRSKGRTIDGVAMGTKKRVLSEDEVGTLLRWLPNMSLTVSDAITLYLWTGARGGEIISMEAHEIADESDGLWWTVPKEKTKNSWRSKAGDLRVPITGRAEAIVRRRKEQAVNGFLFPTSTGEMMKQTVISHGVYYHQPYCKQAPNHNRPRLTVTHWSPHDLRRTARTMLAALGCPHDVAEAVLGHIQPGVAGVYNRHHYDRERREWLTRLGQFLDELALKYPRK >NC_007511|1154712:1159250|1157565_1157784_+|WP_157687258.1|DBSCAN-SWA MSNWKIVDKTHTAHLTLRPSLQPGMFVVAVWILPNGVPKGDSPQWAQEFPSELEARNAGEDMATAKLRALSP >NC_007511|1154712:1159250|1157814_1158075_+|WP_157687260.1|DBSCAN-SWA MDLLEILDWGGGMVRIQTQTGVRNVQEDTYKGRKIEVVTGYDSLSDKWPFHIYIDGTHLVGQWKADRMEEAFDSGFQIAQEQLDRV >NC_007511|1154712:1159250|1158079_1158463_-|WP_011354637.1|DBSCAN-SWA MAVYVDDMYRHEIGKLGRMKMSHLIADTTEELLAMVREIGVNPKWIQHPGTRDEHFDIAISKRAAAIAAGAIPVTFRQCGAMNKRRKVTGSLGSPTDAIEWLERFVAERRDSRAAAPSAIDSSNATS |
8 | Burkholderia_phage(50.0%) | integrase | attL 1145859:1145872|attR 1158863:1158876 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_007510_1 | 864043-864269 | Orphan |
NA
Consensus repeat of NC_007510_1
|
3 spacers
spacers of NC_007510_1
>1.1|864066|49|NC_007510|CRT TTCGAACCCGCTGGCGCCGGTCCAAGGCGTCGTGAACCAAGTGACGGGC >1.2|864138|37|NC_007510|CRT CGGCGCACTGACGGGCGCACTCGGCACGGTCACGGGT >1.3|864198|49|NC_007510|CRT CTCGAGCCCGCTGGCGCCGGTCCAAGGCGTCGTGAACCAGGTCACGGGC |
CRISPR arrays and Neighbor proteins around NC_007510_1
The CRISPR arrays of NC_007510_1 >merge|NC_007510|1|864043-864269|CRT GCACTGGGCGGCATCGGCGGCGGTTCGAACCCGCTGGCGCCGGTCCAAGGCGTCGTGAACCAAGTGACGGGCGCACTGGGCAGCGGCAACCCGGCCGGCGCACTGACGGGCGCACTCGGCACGGTCACGGGTGCACTGGGTGGTATCGGCGGCGGCTCGAGCCCGCTGGCGCCGGTCCAAGGCGTCGTGAACCAGGTCACGGGCGCACTGGGCGGCATCGGCGGCGG >NC_007510|1|1|864043-864269|CRT GCACTGGGCGGCATCGGCGGCGG TTCGAACCCGCTGGCGCCGGTCCAAGGCGTCGTGAACCAAGTGACGGGC GCACTGGGCAGCGGCAACCCGGC CGGCGCACTGACGGGCGCACTCGGCACGGTCACGGGT GCACTGGGTGGTATCGGCGGCGG CTCGAGCCCGCTGGCGCCGGTCCAAGGCGTCGTGAACCAGGTCACGGGC GCACTGGGCGGCATCGGCGGCGG
>NC_007510.1|WP_011351122.1|860744_861989_+|FAD-binding-oxidoreductase MDFDVIVLGAGIVGVSSALHLQDRGLRVALVDRRAPGEETSHGNAGLIERSSVVPYAFPRRLGTLLRYARNRSVDLYWDYRALPAYAGWLARFWRESSPQRLAAAARDLLPLVAASVVEHDALLARTDAQPLVHDGGWIEAFRSPALFDAETRAQQRVADAHGLRMTVLDARALRAREPGVSDAFCGAFHWQDPKTVSSPGGLTKAYARLFERDGGTFALGDARTLVQVDDGWQVGTEHGPISARSAVVALGPWSDHVFEPLGYRIPLRAKRGYHMHYRPTRAPLNVPVCDTEEGFVVAPMEGGRLRLTTGVEIALRGAPPTGVQLARAEPLARDAFGIGERLDPEPWLGMRPCTPDMRPVIGPAPRHRHLWFAFGHCHHGLTLGPATGRLLAEMMTGAPTYIDPHPYRPARFG >NC_007510.1|WP_011351121.1|859897_860677_+|ABC-transporter-substrate-binding-protein MNWKLSLCAAAALACAAVTAHAEQTTLRFGIEAAYPPFESKTPAGQLQGFDVDIGNAVCAKLNMKCVWVENSFDGLIPALQARKFDAINSAMNITSKRRQSIDFTPAIYVVPIVMIAKHGSPLRPDVASLRGKHVGVLQGSSQEDFLKAHWATAGVAVVSYQDQDQIYADLVAGRLDAAVQEAQTAQDGFLDKPAGRDYQIVGEPLKDPATLGEGTGFGMRKNDKALQAKIVGALDALKKDGTLSALSQKYFKRDIVSK >NC_007510.1|WP_011351120.1|858555_859518_-|ornithine-cyclodeaminase-family-protein MTQTAPTLPLTVDEAAVRAALPSLDVLGTLRSMFASLAATRAVQPPQTLTLFPDQAGDFITYLGALADAQVFGAKLSPYVVTGGKPIVTAWTALMSMRTGQPLMWCDAGLLTVERTAGTTALAVDCLAPRDARHLAIVGAGAVGLAHLRHTAGLRDWETIRVYSPALAGDAAQQAALAQLDPRARAAASVEACVRDADVVMLCTSSGTPVLGDGMLTRPALVTSISTNVARAHEIPPAWLPDMDVYCDYRHTTPASAGEMQIAAAEHGWDAARIAGDLPALVAGTCPAPSYTRHAFFRSIGLGLEDIAIAHALYTHLARA >NC_007510.1|WP_041492752.1|857764_858559_-|PAS-domain-containing-protein MSGARARGRYNGATAAAAARGRTGMPANGAPARRPDALPGPRPPEMANPMRKKQSPVKNLLLTRYAPIADGIAALFFPYAEVVIHDLHDQTVLYLANNLSKREVGDDSALEEIDHSARERVIGPYEKLNWDGRRMRCVSNVLFDDEGRPAGMMCINFNIAVFDEVRATLDLFIKGAGVVAQPDELFRDDWQERINTFLHGWLRERQVGLNGLTREHRRELVEALYAEGAFRGKSAANYVANVLGMGRATVYKHLKHLKETQGDA >NC_007510.1|WP_041492751.1|856826_857558_-|DsbA-family-oxidoreductase MTTAPAPTARPTLTVEIWSDLICPWCWIGKRRFDEALAAFAHADRVDVALRAYRLMPGQPVEPVEAMLAGKYRMSPAQVDQMLRQVTDAAASVGLRYDLPGTLVGDTLDGHRLVKLAEATGRAHALTERLYRAYFCEHGSLFDHAELTEFAVEAGLERSAVDAVLRSDLYRNEVEADAARAAQIGGRGVPLFVFGGRYAVSGAQPADAFAQALDQAWRDGIVELDGSDAAACGPDGCALPARS >NC_007510.1|WP_011351117.1|855247_856687_-|DHA2-family-efflux-MFS-transporter-permease-subunit MTHGIHGEKRWYALIVLCLGVLMIVLDSTIVNVALPSISTDLHFTETALVWVVNAYLLTFGGCLLLGGRLGDLYGQRRMFLAGLVVFTLASLACGLAQSQTMLIAARAVQGIGGAVVSAVALSLIMNLFTEPGERARAMGVYGFVCAGGGSIGVLLGGLLTSSLSWHWIFLVNLPIGIAVYAMCVALLPRLRAPAGTARLDVAGAITVTASLMLAVYGIVGGNEAGWLSTQTVSLIGAAVVLLALFIAIEARAAHPLMPLSLFASRNVALANVIAVLWAAAMFAWFFLSALYMQRVLGYGPLQVGLAFLPANLIMAVFSLGLSARIVMRFGIRGPIAAGLLIAACGLALFSRAPVDGGFVWHVLPGMTLLGIGAGVAFNPMLLAAMSDVDPADSGLASGIVNTAFMMGGALGLAVLASLAAARTDALAAANAAPLDALNGGYHAAFAFGAAFAAAAALIGLALRIRRQGAVEGVGPAMH >NC_007510.1|WP_011351116.1|854117_854906_-|sulfite-exporter-TauE/SafE-family-protein MSLPHIDLLYSLSGLFVGILVGLTGVGGGSLMTPILVLLFGVHPATAVGTDLLYAAATKATGTLVHGLKGSIDWRITGRLAAGSVPAAAVTLWWLHTHGMNSPGTARMIQLVLGVALLLTSLALIFRPQLTAFAARNPLAPSPARTLWSTVLTGAVLGVLVSMTSVGAGAIGVTVLLLLYPALATTRIVGSDIAHAVPLTLVAGMGHWLLGSVDWSMLLSLLLGSLPGIVLGSLLSARAPERLLRNLLAATLVAVGIRLVLA >NC_007510.1|WP_011351115.1|851953_853789_+|peptidase-S1 MTTRKSLKDGFALFGTTLSVPLAAAAAAALLVTGCGGDDGSGPAASAAAAAATSAGASTSANTNATAAAADQPYVDNDVYGTGPNDAVTDSTEGAAVVHRTVTIGGKTIKYTATTGHLTTIDPTTSAPNAKMFYVAYTQDNPDPSKPRPVTFFYNGGPGSSSVYLLLGSYGPKRLQSSFPNFTPPAPYKLLDNPDSLLDRTDLVFINPVGTGYSTAIAPAKNKDFWGTDQDARSIDRFIQRYLTKYSRWNSPKFLYGESYGTARSAVVSWVLHEDGIDLNGITLQSSILDYANALSAPGTFPTLAADAFYWKKTTLNPTPTDLDAYMIQARNYADNTLAPLAQKPNPQDGGFVNVRLNLNLQTAQQMGSYIGTDPTSLIQTFGNPAALGNVPSSDDNPPYTFFLTLVPGTQIGQYDGRANFTGKGIAPYILPNSGSNDPSITNVGGAYTVLWNSYINTDLKYTSTSSFVDLNDQVFNNWDFSHTDPTGANKGGGNTLYTAGDLASTMSVNPDLKVLSANGYFDAVTPFHQTELTLAQMPLDPTLKAQNLTIKNYPSGHMIYLNDASRTALKGDLANFYDGILANRTALQRVLKLQMRTQQLKQQKLQQQGQ >NC_007510.1|WP_011351114.1|849250_851341_+|D-(-)-3-hydroxybutyrate-oligomer-hydrolase MTRLGWGRRMVFGAALAAVAILGACNGDESAERNRLPGFVSGSVRTTAYDGASDDLLTAGLGKTGLGSASAPGFANAARPTSAELRRLAIWSNYRALVDMSANGGYGRFWGPNVDLDGNDTLGEGKIPGTEYLAYSDDGSGSKNVTLLVQVPAGFNPAQPCIVTATSSGSRGVYGAISAAGEWALKRGCAVAYNDKGGGNGAHELMSDTITLIDGTLANAVLAGTSSLFTANVTSGDLATFNSRFPNRYAFKHAHSQQNPEQDWGHVTLQSVEFAYWALNEQFGPLIDGSRHGVRYRAGDIMTIAASVSNGGGASLAAAEQDTRGWITAVVVGEPQINVRMAPNVVVRSGGQPVPSFGRPLADYATLANLLEPCAAASASLAGAPYLSALPVATTQSIRTQRCATLAAAGLVSGADTQSQAADALAQLHAAGYLADSDLLQAPMWDSQAIPAIAVTYANAYTRSRVTDNLCNFSFATTNPATGAVAAPATSPMPAVFGVGNGVPPTAGIDLVFNTGAGVDHRLATPDASFAGALCLRQLWTNGMLGMPANVDAVRVNANLQGKPAIIVQGRSDSLVPVNHASRAYVAQNGISEAGRSQLVFYEVTNGQHFDAFLPVPGFDTRFVPVHYYNLQALNLMWKHLKNGAPLPPSQVIRTVPRGGTPGAAPALTSANLPPISAAPGANAITVGAGAIDVPL >NC_007510.1|WP_011351113.1|848748_849114_+|BON-domain-containing-protein MKSIVLRALGVAAVAACLSGSVYAQSSDAAATEAPAAATSAPKAAAKTAKKANRKLGYAVRKAISKESGENVSNLVVRSKGGAITLEGTMPAQDQIDKAEAAAKGVKGVTSVTNKLTVQQQ >NC_007510.1|WP_011351124.1|865583_865943_+|hypothetical-protein MKRILVAAALTVAIVRPAAAEPPRAGDGKLVDEAHMTLYVFDRDAPGTSACDRACAANWPPALADAYDKATGELSLVARDDGSKQWAYRGHPLYRWKQDRKPGDAGGDGIGGMWHVARP >NC_007510.1|WP_011351125.1|865970_866486_+|sigma-70-family-RNA-polymerase-sigma-factor MSYESDLLVWLPHLTRYARALTGDRAWADDLVQDTLERALNRPPRDCGNLRAWLLTLLRHRFIDQLRARHEIAVDDATAPWQTMAAPAGEIGGLVLRDVQRALYRLPVEQREVLLLVALEELSYRDAAEVLGVPVGTVMSRLARARGQMRALLSDEPSAHGTASLRVIGKT >NC_007510.1|WP_011351126.1|866482_867265_+|anti-sigma-factor MMDDPHKPSNERDDDASAQLLSALLDGELSGQERREVLERVLADPQEAERFAHYRAQRDALQALFPLPGAAPALFVQRRAPRRRAIAYAFAGLAAGLLIGVALHVGWVTFGSEPAFAARADVAYATYAADRDHPVEVGAGDPERLVAWLSARVGRPVRAPSLDEYGYVLIGGRLLPGEAGPAAQLMYQRADGARVTLYMTAYDARRLAPQAMSAGDRYTYFWSDRGMGYALSGQGDERRLRELAIDACGDLGGPTDAWKG >NC_007510.1|WP_041493018.1|867338_867749_+|quinol-oxidase MLVLVLVLVALIAGSALARADGDAPVRVPVDADGVQRVAIVGGSYFFRPNHVIVRAHVPVELTVSAEPGLVPHSFEIDAPQAGIAVHTELATTPKTFRFTPAQPGRFAYYCTHRLLFFRSHRERGMEGVLDVEAAP >NC_007510.1|WP_041492753.1|867745_868378_+|DUF1571-domain-containing-protein MIAALLAASLIGASMTADPVTVAQAHFDHVRSYRATIRSSARSGEHTEIRYAYLKPGFVRMDFVSPHHGAVLAYDPGDGKVRLRPFGAHAPPALTLSPSNPLVRDRSGHRVDRSDVGELLRNVHALQEGGATVTEGEEAVGGRTALRVSVTGAPAHVVDGVHRYRLWLDTEDGFPLKVVSFADDDDVPLETVTLDDVEIDVAFPERFFAP >NC_007510.1|WP_011351129.1|868422_868938_+|SRPBCC-family-protein MAEYRFSTTWRVDAPLAVVWDAIYQVDRWPDWWKGAVSTVEIEPGDARGVGALHRYTWKGALPYRLTFDMRVRRVEPPHALEGRASGAIEGDGCWSFIADGARTIVRYDWHIRTHVRWMNRLEPLGRPLFRWNHDVVMREGAKGLARLLGAAVETEGRTFRPLSDAGCADA >NC_007510.1|WP_011351130.1|868993_870043_-|ABC-transporter-ATP-binding-protein MSELRIHGLSKSFGSHTVLHDIDLTVRRGSRLALLGPSGSGKTTLLRVLCGFERAERGTVEIDGRRVAAPDVHLPPEQRRIGYVPQEGALFPHLSVADNIAFGLPRAARRRHHRVDELLEMVGLPASYASRAPQQLSGGQQQRVALARALAPEPSLVILDEPFSALDTSLRHETRSAVADALAAAGATSVLVTHDQPEAMTLGDEVAVMWHGRLVQTAAPQMLYRQPVTRDVASFVGQAVLLAGHVRSMRIACALGELPLVAPLPDGPADAMLRPEQIGIVPVTHADAARGHTARVEAAQFAGQDADVLVRLDADGDVLVRARVPGHRCPAVGERVALVVDGAVMAYAR >NC_007510.1|WP_011351131.1|870071_871670_-|iron-ABC-transporter-permease MSDAVSPTASTARDGEHTATRRRPSRALVAAAACGPLAILLPLGFTLYRAATFGVDDAIELLWRPLVGELLVNTLSITLAATLACTLLGTALAWFIERTDLPARPLWTALAAAPLAVPPFITSYAWVSLSLDLQDFLGALIVLTSAYFPLVYLPVAAALRELDPALEESARTLGCSPWHTFFRVVLPQLRPALCGGMLLVALGVLSEFGAFQLLRFRTFTTEIYAEYRTAFDGGGASLLGCVLIALCLLCLAIEARARGHARYGHTHRAARRVAMRYPLGYLRGPVTVAFAALAAATLGVPLAMIGYWLTQQGAAAVTPADVSPELLWQATLASSGYGLMAAAVTTLLALPLAFLLVRYPGRVATLLERTAMTVQGVPGIVVALAIVSITVRLLQPLYQSAPALVGAYAILFLPLAVVSVRAALSHVQVRLEETARSLGLGWRATLTRVVLPLAAPGLGAAATMVFISVVTELNATLLLSPIGTRTLATQVWSDTATLAFAAAAPYVALLVALSLGASGLLFLLLGRASVRD >NC_007510.1|WP_011351132.1|871672_872707_-|iron-ABC-transporter-substrate-binding-protein MNTSPVSRPLRHLISAAAAALMLTGALHAARAHAATLTLYNAQHEQVVNQLVKDFEAQSGITVKVRSGEGPALAAQLVAEGDRTPADVYFTENSPELVLLDRKGLFAKTDAATLQSVPARFNPADGNWVGVLARENVLVYNTAKIQPQQLPASLLDLAKPEWKGKVGVAPSDADFLPLVSAVLALHGEAATLQWLKGLKANAQIFDDDEGVTAAVNRGGVLTGIINNYYWDRLHAELGDKSTRSAIYHFGKGDVGGAVNVSGAAVLKASKHQPEAQKFVAYLVSERAQKLMAGGHISFEYPLHPGVAPDPILKPFNELSPPALTFEQLGDDSQAGKLLRQAGLL >NC_007510.1|WP_011351133.1|872773_874084_-|deferrochelatase/peroxidase-EfeB MADDSNQPPRPTRRGFLKAGGAAVAAGLAAASIPAAKAADAPAAAPAPAPASAHDGVEPFYGKHQSGIATPQQRHAYFAALDLTTAQRADVIALLKTWTDAAARMARGDTALPLATTGNDEVAPADGGDALGLGPARLTITFGFGPGMFALAGKDRFGLAKHRPAALVDLPRFNGDQLLPEKTGGDLFIQACADDAQVAFHAVRQLVRLGAKATQMRWGQAGFTSGKPGETPRNLMGFKDGTMNPPMSDPAAMDEFVWAGSEGPAWMNGGTYTVVRRIRITLEHWDNTELGFQEQVVGRHKYSGAPLGQKHEFEALDLDAADKDGNPVIPDNAHARLASPQLNNGAQILRRAYSYNDSTSFYIERWPPWRQQTEYDAGLMFVAHQRDPRKGFIPINEKLAKMDIMNQFTTHVGSAIFACPPGAQPGSYIGAALFEA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_007510_2 | 3000899-3000971 | Orphan |
NA
Consensus repeat of NC_007510_2
|
1 spacers
spacers of NC_007510_2
>2.1|3000922|27|NC_007510|CRISPRCasFinder AAAGCGCCAAGGGCGGTCGACCGCCTT |
CRISPR arrays and Neighbor proteins around NC_007510_2
The CRISPR arrays of NC_007510_2 >merge|NC_007510|2|3000899-3000971|CRISPRCasFinder GGCATGGGACCTCCCTAAGCGCTAAAGCGCCAAGGGCGGTCGACCGCCTTGGCATGGGACCGCCCTAAGCGCT >NC_007510|2|1|3000899-3000971|CRISPRCasFinder GGCATGGGACCTCCCTAAGCGCT AAAGCGCCAAGGGCGGTCGACCGCCTT GGCATGGGACCGCCCTAAGCGCT
>NC_007510.1|WP_011352990.1|2999997_3000660_+|adenylate-kinase MRLILLGAPGAGKGTQANFIKEKFGIPQISTGDMLRAAVKAGTPLGVEAKGYMDAGKLVPDALIIGLVKERLKESDCANGYLFDGFPRTIAQADAMKEAGVAIDYVLEIDVPFSEIIERMSGRRTHPASGRTYHVKFNPPKVEGHDDVTGEPLIQRDDDKEETVKKRLEVYEAQTKPLITYYGDWAQRGEENGLKAPQYRKISGLGSVEEIRERAFGALK >NC_007510.1|WP_011352989.1|2999003_2999789_+|3-deoxy-manno-octulosonate-cytidylyltransferase MTQPFIAVIPARLASTRLPNKPLADLGGKPMVVRVAERAREAGAQQVLVASDAQSVLDAARDHGFEAVLTRADHPSGTDRLAEVAATLGWSDDTVVVNVQGDEPLIDPVLVRDVASHLAAHPACAIATAAHPIHDAADVFNPNVVKVALDAQSVALYFSRAPIPWSRDAYQPHWPDVAAMPAPAFPVYRHIGLYAYRARFLRTYPTLAQAPIEQAEQLEQLRALWHGERIAVLITESAPEAGIDTPADLARVQALFQPSSK >NC_007510.1|WP_010090898.1|2998697_2998892_+|Trm112-family-protein MDARLLEILVCPICKGPLHYDRAAQELICNADKLAYPIRDGIPVMLVDEARQTVEGTPVDPAGR >NC_007510.1|WP_011352988.1|2997688_2998717_+|tetraacyldisaccharide-4'-kinase MSAPDGPLARLEARLTREWQRRGALAWALTPFACVFGLCAALRRTAYAQGWKKPVDVGVPVVVVGNVTVGGTGKTPTVIALVDALRAAGFTPGIVSRGYGANVKAPTAVTTASRASAGGDEPLLIARRTGAPVWVCPDRVAAAQALRAAHPDVDVIVSDDGLQHYRLARTVELVVFDHRLGGNGFLLPAGPLREPLSRHRDATLVNDPYSGALPPWPDTYSLALTPGAAWHLDQPALRRPLSQFANERVLAAAGIGAPERFFATLRSAGLAPATRALPDHYAFADNPFVDDAVDAILITEKDAVKLGASWRDARLWVVPVEAALDPRLIALVVEKLRGRSPA >NC_007510.1|WP_011352987.1|2995855_2997238_-|exodeoxyribonuclease-VII-large-subunit MPSDSPFAAPGATRGGDEVIPVSALNRAISTMLERSFPLLWISGEVSNFTRAASGHWYFSIKDQQAQMRCVMFRGRAQYAEFTPREGDRIEVRAVVTMYEPRGEVQLNVEAVRRTGQGRLYEAFLRLKAQLESEGLFAPERKRPLPAHPRAIGIVTSLQAAALRDVLTTLARRAPHIPVIVYPAPVQGAGSADKLVTAVETANARREVDVLLVCRGGGSIEDLWSFNDEALARAIAASELPVVSGVGHETDFTIADFAADLRAPTPTGAAELASPQRALLLREVGERQRALARGMERRLEQRAQQLDWLARRLVSPAERLQRQRTHVEQLAVRLASAASRPVRDARARFALAQLRWQRARPDPSQARQALAGLSQRLALALQRRHERDTARVSACAARLEVLSPQRTLERGYAALIDAQTGRAVRTPNALKPQRRLTVHLAEGSADVSLADVQPRLSDTI >NC_007510.1|WP_011352986.1|2995049_2995628_-|superoxide-dismutase MAHTLPPLPYAEDALAPTISLETIQYHYGKHHQAYVTNLNNLIPGTEFENLSLEEIVKKSSGGIFNNAAQIWNHTFFWNSLSPNGGGAPTGALGDAINAKWGSYDAFKEAFTKAAVGTFGSGWAWLVKKADGSLDIVSTSNAATPLTTADKALLTIDVWEHAYYIDYRNARPKFVEAFWNIVNWDFAAKNFA >NC_007510.1|WP_011352985.1|2993729_2994941_+|chromate-efflux-transporter MSTSTASAPSRHPWPVFVAFLRLGLTSFGGPVAHLGYFRTEFVTRRGWLTERAYADLVGLCQFLPGPASSQVGMAIGLARAGYAGMFAAWLGFTLPSALLMMLFALGVHATGAPIEAGALHGLRIVSVAVIAQAVWGMARTLCPDARRATLMAAAACVALLAPAAWTQVAVIVAAGLAGLVLLPQPARGAHDPLPLHVSPRAGMLWLALFAALLVVLPFAARALRSDTLAVVDAFFRTGALVFGGGHVVLPLLQAAVVAPGWVGDAAFLAGYGVTQAVPGPLFTFSAFLGASLRNAPNGWLGGTIALVSIFAPSFLLVAGTAPFWERLRRSTRMQAALAGVNAAVVGLLVAALYHPVWTDTIVAPRDLAAALVAFVALVFWRVPPWAVVIASAALGWGLGVAA >NC_007510.1|WP_011352984.1|2992935_2993643_+|Crp/Fnr-family-transcriptional-regulator MPSSLAPYLPQIEANPWFAALPPALRADLLARAALRRLPAGHALFRRGDPPCGLYAVLAGSLTIGAVDPQGKEALLTVAEPVTWFGEIALFDGQPRTHDAIALDDALLLHVPQAALLAILDATPQYWRQFALLMAQKLRLSFLTVESMSVMPAAQRLAARLLMIADGYGGISAGRTHIRLSQEKLAAMLSLTRQTTNQLLKALQADGVVRLHVGEIELVDVDALRRASGLPDSLR >NC_007510.1|WP_011352983.1|2992329_2992854_-|DUF962-domain-containing-protein MKTLEDHLAQYAAYHRDARNIATHLVGIPMIVFAVEVLLSRPALGMTAGVALSPALLLAVVFALFYLRLDLRFGIVMTVLFALSLWAAQALALLPTAQWLAIGIGAFVVGWIVQFVGHWFEGRKPAFVDDLVGLMVGPLFVVAEVAFFAGLRADVRREVERRAGPVHGGAHSHV >NC_007510.1|WP_011352982.1|2991290_2992337_-|2-dehydropantoate-2-reductase MSDAPVCVFGAGAVGCYLGGRLAAAGANVTLVGRARIGDAIHRHGLTLTDQRGYRATLAPADVVFETDPAAAAAARLVLVAVKSAATREAAAQLAGVLRPGTVVISFQNGLHNADVLREVLPQATVLAGMVPFNVIEREPGAFHQGSAGALAAEASPTLQPFAGAFARAGLPLALHRDMPAVQWAKLLLNLNNAVNALANLPLRDELAQRAYRRCVALAQREALHWLARAAIRPARLTPLPAGWIPAVLDLPDPAFRVLGGRMLAIDPLARSSMSDDLAAGRATEVEWINGEVVRLAARFGGQAPVNARLCALVHDAERAAARPAWRGEALWAELTAQVPRAGDVRAA >NC_007510.1|WP_011352991.1|3001005_3001764_+|SDR-family-NAD(P)-dependent-oxidoreductase MEIRGNVFLITGGASGLGAGTARMLAQAGGTVVLADLNDAAGTALAAELGGIFVHCDVSSEADAQAAVNAATRAGTLRGLVNCAGIAPAAKTVGKDGAHPLDVFAKTINVNLVGTFNMIRLAAAAMAATAPTADGERGVIVSTASVAAFDGQIGQAAYAASKAGVAGMTLPIARDLSRSGIRVMTIAPGLFETPMLLGMPQDVQDALGAMVPFPPRLGKPAEYALLVRQIVENPMLNGEVIRLDGAIRMQPK >NC_007510.1|WP_011352992.1|3001827_3002670_-|SirB1-family-protein MTRVLDYFSTLVADDDSLPVTEAALSLAQDAYPDLDLQGTLAELDMLAARLRRRFTDDADLKGRVAALNDFFFRELGFACNHNDYYDPDNSHLNAVLKRRRGIPISLSVLYLELAEQIGVPARGVSFPGHFLLRVTLPDGDLIIDPANGHSLSEAEMVEMLEPYVARAAGAVDSALRALLQPATSREIIARMLRNLKTIYLQTERWQRLLAVQQRLVILLPEHLDEVRDRGFAYARLDYLRPALEDLEQYLGERPEADDATVVESQVIELRQRMQRDGED >NC_007510.1|WP_011352993.1|3002678_3004229_-|murein-biosynthesis-integral-membrane-protein-MurJ MNLFRALLTVSGFTLLSRVTGLARETLIARAFGASQYTDAFYVAFRIPNLLRRLSAEGAFSQAFVPILAEFKNQQGHDATKALVDAMSTVLAWALAVLSVFGIVGASWVVFAVASGLHTDGQAFPLAVTMTRIMFPYIVFISLTTLASGVLNTYKSFSLPAFAPVLLNVAFIAAAVFVAPHLKVPVFALAWAVIVGGVLQFLVQLPGLKKIDMVPLIGLNPLRALRHPGVKRVLAKMVPATFAVSVAQLSLIINTNIASRLGQGAVSWINYADRLMEFPTALLGVALGTILLPSLSKAHVDADSHEYSALLDWGLRVTFLLAAPSALALFFFATPLTATLFNYGKFDAHTVTMVARALATYGIGLVGIILIKILAPGFYAKQDIKTPVKIAIGVLIVTQISNYVFVPLIGHAGLTLSIGVGACLNSLLLFIGLRKRGIYQPSPGWLRFFVQLIGAALVLAGLMHWLSISFDWTGMRAQPLDRIALMAACLVVFAALYFGMLWVMGFKYAYFRRRAK >NC_007510.1|WP_011352994.1|3004533_3004806_+|30S-ribosomal-protein-S20 MANSAQARKRARQAAKANSHNSALRSKFRTAIKAVRKAVDAGDQAKAADLFKAAVKTIDTIADKKIVHKNKAARSKSRLAAAVKGLQAAA >NC_007510.1|WP_011352995.1|3004959_3005280_-|DUF3579-domain-containing-protein MAETPPTEFFIQGITKDGKKFRPSDWSERLAGVMACFGPGASGRNARLQYSLYVRPTMLGDLKCVILDSRLRDVEPMAFDFVLNFAKDNNLVVTEACELPDYGEKK >NC_007510.1|WP_011352996.1|3005693_3006623_+|ornithine-carbamoyltransferase MTAKTIRHYLQFTDFSLEDYEYVLERTGILKRKFKNYETYHPLHDRTLAMIFEKSSTRTRLSFEAGIFQLGGHAVFMSTRDTQLGRGEPVEDSAQVISRMVDIIMIRTFEQEVIKRFADNSRVPVINGLTNEYHPCQVLADIFTYYEHRGPIAGKTVAWVGDANNMLYTWIEAAQILGFKLRLSTPPGYALDMKLVSPDSAPFYEVFEDPNEACKGADLVTTDVWTSMGFEAENEARMQAFADWCVDEEMMGHANPDALFMHCLPAHRGEEVTAGVIDGPQSVVWDEAENRLHVQKALMEFLLLGRLKH >NC_007510.1|WP_011352997.1|3006713_3007763_-|UDP-N-acetylmuramate-dehydrogenase MPMPLDDSTLSLLPDHPLAAHNTFGIAATARFAARITHASQFEALHRDPRVAHLPQLVLGGGSNVVFTRDFDGVVLLDEIAGRRVVREDDDAWYVEAGGGENWHAFVAWTLEHGMAGLENLALIPGTVGAAPIQNIGAYGLEMKAYFDSLVAVELATGCSERFDAARCAFGYRDSFFKREGRGRFAIVSVTFRLPKQWVPRLGYADVTRELDARGIAPDAATARDVFDAVVAIRRAKLPDPLVLGNAGSFFKNPVIDAAQFDALRARAPEVVSYPQPDGQVKLAAGWLIDRCGWKGRALGAAAVHDRQALVLVNRGGATGADVLALARAIQADVQTQFGVELEAEPVCL >NC_007510.1|WP_011352998.1|3007900_3008386_+|YajQ-family-cyclic-di-GMP-binding-protein MPSFDVVSEANMIEVKNAIEQSNKEISTRFDFKGSDARVEQKERELTLFADDDFKLGQVKDVLIGKLAKRNVDVRFLDYGKVEKIGGDKLKQIVTVKKGVTGDLAKKIVRLVKDSKIKVQASIQGDAVRVNGTKRDDLQSVIAMLRKDVTDTPLDFNNFRD >NC_007510.1|WP_011352999.1|3008466_3009105_-|glycerol-3-phosphate-1-O-acyltransferase-PlsY MQILLAALVAYLIGSVSFAVVVSGAMGLADPRSYGSKNPGATNVLRSGNKKAAILTLVGDAFKGWIAVWLARHLGLPDVAVAWVAIAVFLGHLYPVFFRFQGGKGVATAAGVLLAVHPVLGLATALTWLIVAFFFRYSSLAALVAAVFAPVFDVFLFGTGHNPVAWAVLAMSVLLVWRHRGNISKLLAGQESRIGDKKKAAADGGAQDGGKA >NC_007510.1|WP_011353000.1|3009213_3009705_-|Cys-tRNA(Pro)-deacylase MSKSRHVSETPATQLLRRHGVAFGEHPYDYVEHGGTGESARQLGVDEHSVVKTLVMEDEHAKPLIVLMHGDRTVSTKNLARQIGAKRVEPCKPEVANRHSGYLVGGTSPFGTRKAMPVYVEATILELPTIYLNGGRRGYLVSLAPAVLTSLLGAQPVQCASVD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NC_007510_1 | 1.1|864066|49|NC_007510|CRT | 864066-864114 | 49 | NC_007510.1 | 863934-863982 | 0 | 1.0 |
1. spacer 1.1|864066|49|NC_007510|CRT matches to position: 863934-863982, mismatch: 0, identity: 1.0
ttcgaacccgctggcgccggtccaaggcgtcgtgaaccaagtgacgggc CRISPR spacer ttcgaacccgctggcgccggtccaaggcgtcgtgaaccaagtgacgggc Protospacer *************************************************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_007510_2 | 2.1|3000922|27|NC_007510|CRISPRCasFinder | 3000922-3000948 | 27 | NZ_CP007794 | Azospirillum brasilense strain Az39 plasmid AbAZ39_p1, complete sequence | 993183-993209 | 5 | 0.815 |
1. spacer 2.1|3000922|27|NC_007510|CRISPRCasFinder matches to NZ_CP007794 (Azospirillum brasilense strain Az39 plasmid AbAZ39_p1, complete sequence) position: , mismatch: 5, identity: 0.815
aaagcgccaagggcggtcgaccgcctt CRISPR spacer ggagcgccatgggcggtcgatcgccat Protospacer ..******* **********.**** *
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
744669 : 753761
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_007510|744669:753761|DBSCAN-SWA AATGGGAAAGATCATCGGTATTGACCTCGGCACCACGAACTCGTGCGTCGCCATCATGGAAGGCAACCAGGTCAAGGTCATCGAGAACTCGGAAGGCACGCGCACCACGCCGTCGATCATTGCCTACATGGACGACAACGAAGTGCTCGTCGGCGCGCCGGCCAAGCGTCAGTCGGTGACCAACCCGAAGAACACGCTGTTCGCGGTGAAGCGCCTGATCGGCCGCCGCTTCGAAGAGAAGGAAGTCCAGAAGGACATCGGCCTGATGCCGTACACCATCGTCAAGGCCGACAACGGCGACGCATGGGTTGAAGCACACGGCGAAAAGCTGGCGCCGCCGCAGGTTTCGGCGGAAGTGCTGCGCAAGATGAAGAAGACGGCCGAAGACTACCTCGGCGAGCCGGTCACGGAAGCCGTGATCACGGTGCCGGCGTACTTCAACGACAGCCAGCGCCAGGCAACCAAGGACGCCGGCCGCATCGCGGGCCTCGAAGTCAAGCGGATCATCAACGAGCCGACCGCAGCCGCGCTCGCGTTCGGCCTCGACAAGGTCGAGAAGGGCGACCGCAAGATCGCCGTGTATGACCTCGGCGGCGGTACGTTCGACGTGTCGATCATCGAAATCGCGGACGTCGACGGCGAAATGCAGTTCGAAGTGCTGTCGACCAACGGCGACACGTTCCTCGGCGGCGAAGACTTCGACCAGCGCATCATCGATTACATCATCGGCGAGTTCAAGAAGGAGCAGGGCGTCGACCTGTCGAAGGACGTGCTCGCGCTGCAGCGCCTGAAGGAAGCCGCTGAAAAGGCGAAGATCGAGCTGTCGTCGAGCCAGCAGACCGAAATCAACCTGCCGTACATCACGGCAGACGCGTCGGGCCCGAAGCACTTGAACCTGAAGATCACCCGCGCGAAGCTGGAAGCGCTGGTGGAAGACCTCGTCGAGCGCACGATCGAACCGTGCCGCATCGCGATCAAGGACGCAGGCGTCAAGGTGTCGGACATCGACGACGTGATCCTGGTCGGCGGCCAGACGCGCATGCCGAAGGTGCTGGAGAAGGTGAAGGAATTCTTCGGCAAGGATCCGCGTCGTGACGTGAACCCGGACGAAGCTGTCGCAGTCGGCGCGGCGATCCAGGGCCAGGTCCTGTCGGGCGACCGCAAGGACGTGCTGCTGCTCGACGTGACCCCGCTGTCGCTCGGCATCGAGACGCTCGGCGGCGTGATGACGAAGATGATCAGCAAGAACACGACGATCCCGACGAAGCACGCTCAGGTGTACTCGACGGCGGACGACAACCAGGGCGCCGTGACGATCAAGGTGTTCCAGGGCGAACGCGAAATGGCAGCGGGCAACAAGCTGCTCGGCGAGTTCAACCTCGAAGGCATCCCGCCCGCACCGCGCGGCGTGCCGCAGATCGAAGTGACCTTCGACATCGACGCGAACGGCATCCTGCACGTCGGCGCGAAGGACAAGGCGACCGGCAAGGAAAACAAGATCACGATCAAGGCGAACTCGGGTCTGTCCGAAGCCGAAATCGACCAGATGATCAAGGACGCGGAAGCGAACGCAGCGGAAGATCACAAGCTGCGCGAGCTGGCTGATTCGCGCAACCAGGGCGACGCGCTGGTTCACAGCACGAAGAAGGCGCTGACCGAGTACGGCGACAAGCTGGACGCGGGCGAGAAGGAAGCCATCGAAGCGTCGCTGAAGTCGCTCGAGGAACTGCTGAAGGACTCGTCGGCCGACAAGGCTGCGATCGACGCGAAGGTCGAGGAGCTCGGCAAGGTGTCGCAGAAGCTCGGCGAAAAGATGTACGCCGACATGCAGGCCCAGCAAGCAGGTGCGGCCGGCGCAGCGGGTGCAGCGGAAGGCGCGGCCCACGCAGGCGGTGCACAACAGGCTGCCGACGACGTCGTCGACGCCGAGTTCAAGGAAGTGAAGAAGGACTAAGCCGGGTTGCCCCGGTGCCGCACGCCGCGCGCGCAACCCGCGCGCGGCGCGGCTGACGGGTTGGCTGACAAGCGGTCTGTCTTTCCTTCCGGATGGCCGGATCGACTCCACGCCTGGCGGGCTTCGCGGCCCTCCGGGCACATTTGTTTTTTGGCGGGTGCTGCACGAGCCGTCCGCGGACGGGCGCAGTGCCAGACGAGTCGAGAGACTTTACTGCAGGCGAAAGGAGCCGCCGCGTGCGCGATGCGCGGCTGCACAGTGAATCGATATGGCGAAACGGGATTACTACGAGGTTCTGGGCGTCGCGAAGAATGCGGGCGACGACGAAATCAAGAAGGCGTATCGCAAGCTTGCGATGAAGTATCACCCTGACCGCAATCCGGACAACAAGGATGCGGAAGAGCATTTCAAGGAGGTGAAGGAAGCCTATGAAATGCTGTCGGACGGCCAGAAGCGGGCAGCGTACGACCAGTACGGCCACGCGGGCGTCGATCCGAACATGGGCGGTGCGGGCGCACAGGGCTTCGGCGGTTTCGCGGACGCGTTCGGCGACATCTTCGGCGACATCTTCGGCCAGGCAGCGGGCGGTGCCGCGCGCGGCGGCCGTGGCGGCCCGCAGGTGTATCGCGGTGCCGACCTGCGCTACAGCATGGAAATCACGCTCGAGCAGGCCGCACACGGCTACGACACGCAGATCCGCGTGCCGAGCTGGGTATCGTGCGAGGTCTGCCACGGGTCGGGCGCGAAGCCCGGCACGAAGCCGGAAACCTGCCCGACCTGTCACGGCCAGGGCACGGTGCGCATGTCGCAGGGCTTCTTCAGCATCCAGCAGACCTGCCCGAAGTGCCACGGCACGGGCACCTACATCCCCGAGCCGTGCGTGCATTGCCACGGGTCGGGCAAGGTGAAGGAAACCAAGACGCTCGAAGTGAAGATCCCGGCCGGGATCGACGACGGGATGCGGATCCGCTCGGCCGGCAATGGCGAGCCGGGCATCAACGGCGGGCCGCCGGGCGACCTGTACGTCGAGATCCACATCAAGCCGCACTCGGTGTTCGAGCGCGACGGCGACGATCTCCACTGCCAGATGCCGATCCCGTTCACGACCGCCGCACTCGGCGGCGAGATCGAGGTGCCGACGCTGGCCGGCCGTGCGTCGTTCCCGGTGCCGGAAGGCACGCAGTCGGGCAAGACGTTCCGCCTGCGCGGCAAGGGCATCAAGGGGCTGCGTTCGAGCATCGCGGGCGATCTGTACGTTCACGTGCAGGTCGAGACGCCGGTGAAGCTGACCGACAACCAGCGCGACCTGCTCAAGCAGTTCGAGAAGTCGCTGGCCGAGGGCGGCGCGCGTCACAGCCCGCAGAGCAAGAGCTGGTTCGACCGTGTGAAGAGCTTCTTCGAGTAACTCGAGTAACAGCATGACTGAAGGTAACGAGAGCGCGTCGTTCGCGCTCTTGGACGATTGCGACTCGACCGCGCTCGCGCGGTCGAGTCGTTTGTATTCGGGGTTCGTGCGCGAACGTGTGTGCACGGATCCGGCTCGACTCGACGAGGTCGACGCAGCCGTGGCGCAGGATCTGCGCGACGGGCTGCATGCGGTCGTCGTCGGCGATTACGAATTCGGACGCAATCTGCAACGAGCGCAGCCGGGCCATGCCCCGCTGCGCTTTTTGCTGTTTGCGCGCTGCGAGCGCCTGTCGCGGGACGAAGTCGACGCGTGGCTCGCGCAGCGGGACGGCGGCGGCACGCCGTCGATCGCGGGCGTCGCGCATGTCGCGAAGAGCGTGTCGCGCGATGCGTTCGACGTGGCGATCGCCGCGGTGCACGACGCGCTGCGCGCAGGCGATTCGTATCAGGTCAACTACACGTACCGGCTGAACTTCGACGTGTTCGGCACGCCGCTCGCGCTGTACCGGCGGCTGCGTGCGCGTCAGCCCGTGCGCTACGGTGCGCTGATCGCGTTGCCCGACGGCACGTGGGTCGTGTCGTGCTCGCCCGAGCTGTTCGTCGAGAAGTACGGCGACGTGCTGCGCGCGCGGCCGATGAAGGGCACCGCGCCACGTTCGGCCGACCCGCGCGACGATGCGGCCGCGGCCACGTTCCTTGCGAACGATCCGAAGAACCGCGCGGAAAACGTGATGATCGTCGACTTGCTGCGCAACGACGTGTCGCGGATCGCGCGCACCGGGACGGTCCGCGTGCCGGCGCTGTTCTCCGTCGAGCCGTATGCGTCGGTGTGGCAGATGACGTCGACGGTCGAGGCCGGCTGGCGCGACGGAACGACGTTCGCGCAGATGCTGCGCGCGCTGTTTCCGTGCGGATCGATCACGGGCGCGCCGAAGCACAAGACGATGCAGCTGATCGATGCGATCGAGTCGACGCCGCGCGGGCTCTATACGGGCGCGATCGGCTGGCTTGACGCTGCGAAACAAGGCGCGGATTCCGACGCGCCAGGTGATCGCCTGGCAGGTTGCGGCGATTTTTGCCTGTCGGTCGCGATCCGTACGTTGACGCTCGATGCGGCCGGCGAAGGCGATGATCGTGGAGGTGCAACGCGAGCCGACGTCGAAGCACGCCAACCGGCAACGGCAATCGCCGGCCGGCGCCGCGGCACGATGGGTGTCGGCGCGGGCATCGTGCTCGACAGTGTCGCGGCCGACGAATATGCGGAGTGCGAATTGAAAGCGCGATTCCTGACGGATGCCGATCCCGGCTTCCAGCTGTTCGAAACGACTGCCGCCACGCGTGCGGACGGCATACGGCATCTCGATCGCCATCTCGCGCGGCTGCAGCGTAGCGCGGATGCGTTCGGCTTCCGTTTCGACACCGATGCATTGCGTCGCGAGATCGACGCGCGTTGTGCGGCGCTCGACGGCGACGGCGCATACCGGATGAAGCTCTCGCTCGCGAAGGACGGCACGATCGAGATCGTCGCGGCACCGCTCAAGCCGCTGCCGGCGGGGCCGGTCGGCGTGCTGCTGGCGTCCGCGCACGGCTTCGCACCGACCCGTACGAGCGATGCGCTGCTGCTGCACAAGACCACACGCCGCGCCGAATACGATCGCGCGTGGCAGGCGGCGGAGGCGCTTGGCGGCTTCGACATGCTGTTCGTCAACGAGCGCGGCGAGGTGACGGAAGGCGGGCGCTCGAACCTGTTCGTGAAGCTCGACGGCCAGTGGGTGACGCCGCCGCTCGAGTCGGGCGTGCTGCCGGGCGTGATGCGCGGCGTGCTGCTCGACGATCGTGCGTTCAGCGCGACGGAGCGGGTCGTGACCCGCGACGATCTCGCGCGTGCGGAGGCGCTGCTGCTGACCAACGCGCTACGCGGCGCGCTCGACGCGGTACTGAAGTGAAATAACCGCCAGATAATCGATCAGCGCTGGCAGCGCAGGAAAGAAAAAGCGCGGCGCCCCGCAAAGGGCGCCGCGCTTTTTTCATCGAGGCGCAGGCAAACGCATCGCCGGCGCCGCGACAGACGTCAGAACGAATGCTCGGGGCCGGGGAACGAACCGTCCTTGACCGCGCGCACGTAAGCTTCGACGGCCGCCTGGATGTTCGGCTCGCCCTGCATGAAATCCTTCACGAAGCGCGGCCGCTTGCCGGGGAACACGCCGAGCATGTCGTGCAGCACGAGCACCTGGCCCGAGCAGTCGGCGCCCGCGCCGATGCCGATCGTCGGCACGCGCAGCATGTGCGTGACCTCGGACCCGATCAGCGTCGGCACGGCTTCGAGCAGCACGACCTGCGCACCCGCCGCTTCGACCGCGCGCGCGTCGCGCAGCAGTTGCGCGGCGCCGGCTTCGGTCTTGCCCTGTACCTTGAAGCCGCCGAACGCGTGGACCGACTGAGGCGTGAGGCCCACGTGCGCACATACGGGCACCGCGCGCTCGACGAGGAAGCGGATCGTCTCGGCGAGCCATTCGCCGCCTTCGAGCTTGACCATCTGCGCACCGGCGCGCATCAGCTTGACGCTGCTTGCGAATGCCTCGGCCGGCGTGCCGTACGTGCCGAACGGCAGGTCGGCCATGATCAGCGCGCGCGGCTGCGCGCGGGCAACACAGGCGGTGTGATACGCGATGTCGTCGAGCGACACGGGCAGCGTGGTCGTGTGGCCTTGCAGCACGTTGCCGAGCGAATCGCCGATCAGCAGCACGTCGACGCCCGAACGGTCGAGCAGTGCGGAAAAGCTCGCGTCGTAGCAGGTGAGCATCGCGATCTTCTCGCCGGCGTCGCGCATTGCCTGCAGCTTGGGCACCGTGACGGCAGGCCGGCTCGATTCCTGGAGATAGGTCATGGGAAATCCGGTTCGAATGGGTGAAGAACGACAGGCGAGGACGCGTCAGCGCGTCGTCTCGCCCTTGACGAAGAATTCCTTGCGGCCGCGCATCGTCTCGATGCGCTCGACCAGCAGGGCGAGATCTTCGGGGGAATCCAGCGGGTTCAGGTGTTCCGCGGCGACCGTCAGCACCGGCGTGCGGTCGTAGTGGTAGAAGAATTCGTTGTACGCGTCGACGAGCGAGCGCAGGTACGCGTCGCTGATCTGCAGCTCCATCGGCAGCCCGCGCTTCTGGATGCGCGAGAACAGCACTTCAGGGCTCGCCTGCAGATAGACGACGAGGTCGGGCGACGGCGCCTGCGGCACGTCCACGTGCGTCGCAACCGAGCGGTAGAGCTGCCACTCGTCTTCCGGAAGGTTCAGCCGCGCGAAGATGTCGTTTTTCTGCGGCATGAAATCGGCGATGACCGGGCGGCCCGTTTCGAGCGCACCGATCAGCTCGCGCGCCTGCTGCGCACGCTGCAGCGCGAACGACAGCTGCACGGGCAGCGCGTAGCGCGCGGTGTCGCGATAGAAGCGTTCGAGGAACGGGTTGTCCTGCGGGCGCTCGAGCAGCGTCTGCATCGACCAGCGTTCGCCGAGCAGGCGCGCGAGCGTCGTCTTGCCGACTCCGATCGGGCCTTCGATCACGAGATAGCGATGCGGGGCGCGCAGGTCGGGCGCAGTAACGGTAAGGGGAGTCAGGGTCATCGGCAGCGGTTCTTGTCGGCGTCTTCGCTGGCAGCGAGCGCCTTCTGCATCAGGCACTGGCAGGTCTGCACCTTCTCGACCCGCTGATCAGCGACGGCGGCGAGGAAGGCATCGGCACGGCCGCGTGCGGGGATGTCGAGTGCCGGCTCGATCTCGACGAGCGGCACGAGCGCGAACGCACGGTCGGTGAGGCGCGGGTGCGGGACGATCAGGTCGGGTTCGTCGATCGAATCGTCGCCGAACAGCAGGATGTCGAGATCGAGCGTACGCGGCGCGTTGCGATACGGACGCTCGCGGCCGAAGTGATGTTCGATCTTCTGGCAAAGCGCGAGCAGCTCGCGGGCCGACAGCGTCGTGTCGAGCTTGACGACGCAGTTGTAGTAGTCGTCGCCGCCCGCTTCGAAGGGCGCCGTGCGATACAGGCTCGATTTGCCGAGGATCGAGATGGTGCGCTGCTGCGCAAGGCACACCACCGCGTCCTTCAGGGTCTGGCGCGCATCGCCGAGATTCGCCCCCAGTCCGATATATGCAACCGTCATGGCATCACTTCCTACGTCGTTCGCCGGCGAGCGCGTCAGTCGTCGGAACCGTCCGAGGCGTCCGGTGCCTGCTCGGCAGCACCTTCACCCGGCTTGCGGTTTCGCACGCCGCCGCGGCGGCGCCGCTTGCGGGGCGATTTTTCCTTCGTGCCGCCCTGCGTGAGCAATGCCTCGCGAGCGGCCGCGTCGCCTTCGATGAAATCCGTCCACCACTGTCCGACTTCCGCATCAAGCTCGCCGGATTCGCAGCGTAACAGGAGGAAATCATACCCCGCTCTAAACCTTTGGTGTTCCAGCAGCCGCATCGCGCTGCGGCCCGAGCGCTTCTCGAGGCGCAGCTGCAGGCCCCAGATCTCGCGCATGTCGGCCGAATAACGCTTGTGGATCGCGAGTTTCTCGGTCTGCATGTCGAGCACGTCGTCCATCGCGCGATGGAGCGCCGGCACCGGGATTTCGCCTTCGGCCGTGTATTGCTCGAAGCGCTGGCGCATGTCGTGCCACAGCAGCGTCGCGAACAGGAAGCCCGGCGAGACCGGCTTGCCGGCGCGCACGCGTGCGTCGGTGTTGTTCAGCGCGAGCGTGATGAACTTCTCGCCCTGCGGCTGTTCGAGCACGACGTCGAGCAGCGGCAGCAGCCCGTGGTGCAGGCCTTCCTTGCGCAGCTGCGTCAGGCACGCGAGCGCGTGGCCCGACAGCAGCAGCTTCAGCATTTCGTCGAACAGGCGCGCGGCCGGCACGTTGTTGATCAGGTCGGCGAGCGCGTTGATCGGCTCGCGCGTATGCGGCTCGATCTCGAAACCGAGCTTGGCCGCGAAGCGCACGACGCGCAGCATCCGCACCGGATCCTCGCGGAAGCGCGTGGCCGGATCGCCGATCATCCGCAACAGGCGGGCGCGCACGTCGGCCATCCCGTCGTGGTAGTCGAGCACCGTCTGCGTCGACGGGTCGTAGTACATCGCGTTGATCGTGAAGTCGCGGCGCGCGGCGTCTTCGTGCTGCTCGCCCCACACGTTGTCGCGCAGCACGCGGCCGCTCGCGTCGACCGCGTGCGTGCGGCGATCGAGTTCGTCGCGCTTCAGGCGCTTCGGCGGCTCGGCGGCGGCGGCCTCGGGCGGCGCATCGACCAGCGCGCGGAACGTCGACACCTCGATCAGCTCCTGGCCGAACTGCACGTGCACGATCTGGAAGCGGCGGCCGATCAGGCGCGCGCGGCGGAACAGGCGCTGCACCTCGGTCGGCGTTGCATCGGTCGCGACGTCGAAGTCCTTCGGCGCGATGCCGAGCAGCAGGTCGCGTACCGCACCGCCGACGATGAAGGCACGGAAGCCCGCCTGCTGCAGCGTGTCGGTCACGCGCACGGCATTCTTCGAAATCAGCGCCGGGTTGATGCCGTGCACGCTGGCCGGCACGACGGTCGGTTCATGGTTGCTGCGCGGTTTCTTCGCGCCGCCGCCGCGGGCGCCCTTCGGGGCGCGCGGGGTAGCAGGGGCCGCTTCGTCGGCCGGGGCCGTTGCGGGAGAGGTTTGCTCGGTTTCGTCCTGGCCGAGCAGCTTGCGGATGAATTTTTTGATCAC
Protein sequences of DBSCAN-SWA_1 >NC_007510|744669:753761|750100_750916_-|WP_011351027.1|DBSCAN-SWA MTYLQESSRPAVTVPKLQAMRDAGEKIAMLTCYDASFSALLDRSGVDVLLIGDSLGNVLQGHTTTLPVSLDDIAYHTACVARAQPRALIMADLPFGTYGTPAEAFASSVKLMRAGAQMVKLEGGEWLAETIRFLVERAVPVCAHVGLTPQSVHAFGGFKVQGKTEAGAAQLLRDARAVEAAGAQVVLLEAVPTLIGSEVTHMLRVPTIGIGAGADCSGQVLVLHDMLGVFPGKRPRFVKDFMQGEPNIQAAVEAYVRAVKDGSFPGPEHSF >NC_007510|744669:753761|751644_752187_-|WP_011351029.1|DBSCAN-SWA MTVAYIGLGANLGDARQTLKDAVVCLAQQRTISILGKSSLYRTAPFEAGGDDYYNCVVKLDTTLSARELLALCQKIEHHFGRERPYRNAPRTLDLDILLFGDDSIDEPDLIVPHPRLTDRAFALVPLVEIEPALDIPARGRADAFLAAVADQRVEKVQTCQCLMQKALAASEDADKNRCR >NC_007510|744669:753761|752222_753761_-|WP_011351030.1|DBSCAN-SWA MIKKFIRKLLGQDETEQTSPATAPADEAAPATPRAPKGARGGGAKKPRSNHEPTVVPASVHGINPALISKNAVRVTDTLQQAGFRAFIVGGAVRDLLLGIAPKDFDVATDATPTEVQRLFRRARLIGRRFQIVHVQFGQELIEVSTFRALVDAPPEAAAAEPPKRLKRDELDRRTHAVDASGRVLRDNVWGEQHEDAARRDFTINAMYYDPSTQTVLDYHDGMADVRARLLRMIGDPATRFREDPVRMLRVVRFAAKLGFEIEPHTREPINALADLINNVPAARLFDEMLKLLLSGHALACLTQLRKEGLHHGLLPLLDVVLEQPQGEKFITLALNNTDARVRAGKPVSPGFLFATLLWHDMRQRFEQYTAEGEIPVPALHRAMDDVLDMQTEKLAIHKRYSADMREIWGLQLRLEKRSGRSAMRLLEHQRFRAGYDFLLLRCESGELDAEVGQWWTDFIEGDAAAREALLTQGGTKEKSPRKRRRRGGVRNRKPGEGAAEQAPDASDGSDD >NC_007510|744669:753761|746890_748027_+|WP_011351025.1|DBSCAN-SWA MAKRDYYEVLGVAKNAGDDEIKKAYRKLAMKYHPDRNPDNKDAEEHFKEVKEAYEMLSDGQKRAAYDQYGHAGVDPNMGGAGAQGFGGFADAFGDIFGDIFGQAAGGAARGGRGGPQVYRGADLRYSMEITLEQAAHGYDTQIRVPSWVSCEVCHGSGAKPGTKPETCPTCHGQGTVRMSQGFFSIQQTCPKCHGTGTYIPEPCVHCHGSGKVKETKTLEVKIPAGIDDGMRIRSAGNGEPGINGGPPGDLYVEIHIKPHSVFERDGDDLHCQMPIPFTTAALGGEIEVPTLAGRASFPVPEGTQSGKTFRLRGKGIKGLRSSIAGDLYVHVQVETPVKLTDNQRDLLKQFEKSLAEGGARHSPQSKSWFDRVKSFFE >NC_007510|744669:753761|750961_751648_-|WP_011351028.1|DBSCAN-SWA MTLTPLTVTAPDLRAPHRYLVIEGPIGVGKTTLARLLGERWSMQTLLERPQDNPFLERFYRDTARYALPVQLSFALQRAQQARELIGALETGRPVIADFMPQKNDIFARLNLPEDEWQLYRSVATHVDVPQAPSPDLVVYLQASPEVLFSRIQKRGLPMELQISDAYLRSLVDAYNEFFYHYDRTPVLTVAAEHLNPLDSPEDLALLVERIETMRGRKEFFVKGETTR >NC_007510|744669:753761|744669_746622_+|WP_011351024.1|DBSCAN-SWA MGKIIGIDLGTTNSCVAIMEGNQVKVIENSEGTRTTPSIIAYMDDNEVLVGAPAKRQSVTNPKNTLFAVKRLIGRRFEEKEVQKDIGLMPYTIVKADNGDAWVEAHGEKLAPPQVSAEVLRKMKKTAEDYLGEPVTEAVITVPAYFNDSQRQATKDAGRIAGLEVKRIINEPTAAALAFGLDKVEKGDRKIAVYDLGGGTFDVSIIEIADVDGEMQFEVLSTNGDTFLGGEDFDQRIIDYIIGEFKKEQGVDLSKDVLALQRLKEAAEKAKIELSSSQQTEINLPYITADASGPKHLNLKITRAKLEALVEDLVERTIEPCRIAIKDAGVKVSDIDDVILVGGQTRMPKVLEKVKEFFGKDPRRDVNPDEAVAVGAAIQGQVLSGDRKDVLLLDVTPLSLGIETLGGVMTKMISKNTTIPTKHAQVYSTADDNQGAVTIKVFQGEREMAAGNKLLGEFNLEGIPPAPRGVPQIEVTFDIDANGILHVGAKDKATGKENKITIKANSGLSEAEIDQMIKDAEANAAEDHKLRELADSRNQGDALVHSTKKALTEYGDKLDAGEKEAIEASLKSLEELLKDSSADKAAIDAKVEELGKVSQKLGEKMYADMQAQQAGAAGAAGAAEGAAHAGGAQQAADDVVDAEFKEVKKD >NC_007510|744669:753761|748040_749975_+|WP_011351026.1|DBSCAN-SWA MTEGNESASFALLDDCDSTALARSSRLYSGFVRERVCTDPARLDEVDAAVAQDLRDGLHAVVVGDYEFGRNLQRAQPGHAPLRFLLFARCERLSRDEVDAWLAQRDGGGTPSIAGVAHVAKSVSRDAFDVAIAAVHDALRAGDSYQVNYTYRLNFDVFGTPLALYRRLRARQPVRYGALIALPDGTWVVSCSPELFVEKYGDVLRARPMKGTAPRSADPRDDAAAATFLANDPKNRAENVMIVDLLRNDVSRIARTGTVRVPALFSVEPYASVWQMTSTVEAGWRDGTTFAQMLRALFPCGSITGAPKHKTMQLIDAIESTPRGLYTGAIGWLDAAKQGADSDAPGDRLAGCGDFCLSVAIRTLTLDAAGEGDDRGGATRADVEARQPATAIAGRRRGTMGVGAGIVLDSVAADEYAECELKARFLTDADPGFQLFETTAATRADGIRHLDRHLARLQRSADAFGFRFDTDALRREIDARCAALDGDGAYRMKLSLAKDGTIEIVAAPLKPLPAGPVGVLLASAHGFAPTRTSDALLLHKTTRRAEYDRAWQAAEALGGFDMLFVNERGEVTEGGRSNLFVKLDGQWVTPPLESGVLPGVMRGVLLDDRAFSATERVVTRDDLARAEALLLTNALRGALDAVLK |
7 | Hokovirus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1099634 : 1107922
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_007510|1099634:1107922|DBSCAN-SWA TATGAAAATCACCATCATCGGCACCGGCTATGTCGGTCTCGTCACGGGCGCCTGCCTCGCAGAGATCGGTCACGACGTCTTCTGTCTCGACGTCGATCCGCGCAAGATCGACATCCTGAACAACGGCGGGATGCCGATTCACGAACCGGGGCTGCTGGACATCATCGCGCGCAACCGCGCGGCCGGGCGCCTGCGCTTCTCGACCGACATCGAGTCGAGCGTCGCGCACGGCGAGATCCAGTTCATCGCCGTCGGCACGCCGCCCGACGAGGACGGCTCGGCCGACCTGCAGTACGTGCTCGAAGCCGCGCGCAACATCGGCCGCCACATGACGGGCTTCAAGGTGATCGTCGACAAGTCGACGGTGCCGGTCGGCACTGCGCAGCGCGTGCGCGGCGTGGTCGACGAGGCGCTTGCCGCCCGCGGTCTGGCAGGCAGCGTCGCGCATCGCTTCTCGGTCGTGTCGAACCCGGAGTTCCTGAAGGAAGGCGCCGCGGTCGAAGACTTCATGCGTCCGGACCGGATTATCATCGGCGTCGACGACGACGAGACGGGTACGATCGCACGCGAGAAGATGAAGAAGCTTTACGCGCCGTTCAACCGCAACCACGAGCGCACGATCTACATGGACGTGCGTTCGGCCGAGTTCGCGAAATATGCGGCGAATGCGATGCTCGCAACGCGCATTTCGTTCATGAACGAGATGTCGAATCTCGCCGACAAGGTCGGCGCGGACATCGAGGCCGTGCGCCGCGGGATCGGCTCCGATCCGCGCATCGGCTATCACTTCCTGTACGCCGGCGTCGGCTACGGCGGCTCGTGCTTCCCGAAGGACGTCCAGGCGCTGATTCGCACCGCAGGCGAGAACGGCCAGCCGCTGCGTATCCTGGAAGCCGTCGAAGCGGCCAACCATGCGCAGAAGGACGTGCTGATCGGCAAGATCGAGCAGCGTTTCGGCGCCGACCTGACCGGCCGCGAGTTCGCGGTCTGGGGCCTGGCGTTCAAGCCGAACACCGACGACATGCGCGAGGCGCCGAGCCGTCGCCTGATCGCCGCGCTGCTCGAACGCGGCGCGACCGTGCGTGCGTACGATCCGGTCGCGGTCGACGAGGCGCAGCGCGTGTTTGCGCTCGATTTCGGCACCGATCCGGACACGCTGGCGCGGCTGCATCTCGTCGAGACGCAGGACATCGCCGTGACGGGTGCGGACGCGCTCGTGATCGTGACCGAGTGGAAGGAATTCCGGAGCCCCGACTTCACGCGCCTGAAGGCCGAACTGAAGGCGCCGGTGATCTTCGACGGGCGCAACCTCTACGAGCCGGATGCGATGGCCGAACTGGGCATCGACTACTACGCGATCGGCCGGCCGTATGTCGATCCCCAGTCGTCCACCCGTGGCTGACCACACGATGAATACTCTCCGCGAAGTCGTTCCGGTGCCGCGCGAACAGCTCGCGCGCTCGCGCGTGCTTGTCGTCGGCGACGTGATGCTCGACCGTTACTGGTTCGGCAACGTCGATCGCATTTCGCCTGAGGCGCCGGTGCCGGTCGTGCACGTGCAGCGTCAGGAGGAGCGCCTCGGCGGTGCAGCGAACGTCGCGCGCAATGCCGTGACGCTCGGCGGCCAGGCCGGGTTGCTGTGTGTCGTCGGTTGCGACGAACCCGGCGAGCGGATCGTCGAGCTGCTCGGCAGCAGCGGCGTGACGCCGCATCTCGAGCGCGACCCGGCGCTGCCGACCACGATCAAGCTGCGCGTGCTCGCGCGCCAGCAGCAATTGCTGCGCGTCGACTTCGAAGCCATGCCGACGCACGAGGTGCTGCTCGCGGGGCTCGCGCGCTTCGATGCGCTGCTGCCGCAGCACGACGTCGTGCTGATGTCGGATTACGCGAAAGGCGGTCTGACGCACGTCACGACGATGATCGAGAAGGCGCGTGCGGCCGGCAAGTCCGTCCTCGTCGACCCGAAGGGCGACGACTGGGCACGCTATCGCGGCGCGTCGCTGATCACGCCGAATCGCGCGGAACTGCGCGAGGTGGTCGGGCAGTGGAAGTCGGAAGACGATCTGCGCGCGCGCGTTGCGAAGCTGCGCGCGGAACTCGGCATCGACGCGCTGCTGCTCACGCGTTCGGAAGAAGGGATGACGCTGTTTTCCGCCACCGGCGAATTGCACGCACCGGCGCTCGCGCGCGAGGTGTTCGACGTGTCGGGCGCGGGCGATACCGTGATCGCGACGGTCGCGACGATGCTCGGCGCAGGCGTGCCGCTCGTCGATGCCGTCGTGCTCGCGAATCGCGCGGCAGGCATCGTGGTCGGCAAGCTCGGCACGGCCACGGTGGACTACGACGAACTGTTTCACTGAGCGCATGCGGTGGCGCGACGAGCGCGCCGCACGCATGGCTCGCACTTTTCAGGCAGGACGATCATGACCCTCATCGTCACCGGCGCAGCCGGTTTTATCGGCGCGAACATCGTCAAGGCGCTCAACGAGCGCGGCGAGACGCGCATCATCGCGGTCGACAACCTGACGCGCGCGGACAAGTTCCGGAACCTCGTCGATTGCGAGATCGACGACTATCTGGACAAGACGGAATTCGTCGAACGCTTCGCACGCGGCGATTTCGGCAAGGTACGCGCAGTGTTCCACGAAGGCGCCTGTTCGGACACGATGGAAACCGACGGCCGCTACATGATGGACAACAACTTCCGCTACAGCCGCGCGGTGCTCGACACCTGCCTCGCGCAGGGCACGCAGTTCCTGTACGCGTCGTCGGCGGCGATCTACGGCGGCTCGACGCGTTTTGTCGAAGAGCGTGACGTCGAGGCGCCGCTGAATGTCTACGGCTATTCGAAATTCCTGTTCGACCAGGTGATCCGTCGCGTGCTGCCGACGGCGAAGAGCCAGATCGCCGGCTTCCGCTATTTCAACGTGTACGGCCCGCGCGAAACGCACAAGGGGCGCATGGCGTCGGTCGCGTTCCACAACTTCAACCAGTTCCGCGCGGAAGGCAAGGTGAAGCTGTTCGGCGAGTACAACGGCTATGCACCGGGCGAGCAGACGCGCGACTTCGTGTCGGTCGAGGACGTGACGAAGGTGAACCTGTTCTTCTTCGATCATCCGGAGAAGTCGGGCATCTTCAACCTCGGCACGGGCCGTGCGCAGCCGTTCAACGATATCGCGTCGACGGTCGTGAATACGTTGCGTGCGCTCGACAACCAGCCGCCGCTGACGCTTGCGCAGCAGGTCGAGCAGGGGCTGATCGAATACGTGCCGTTCCCCGATGCATTGCGCGGCAAGTACCAGTGCTTCACGCAGGCCGACCAGACGAAGCTGCGCGCGGCTGGCTACGACGCACCGTTCCTGACCGTGCAGGAAGGCGTCGACCGCTACGTGCGTTGGCTATCCGGCCAGGTTTAAGCGGAAGCGTTGCATCGATAGACTGGGTCTCACGGTCGGCAGCCGCCGGCCGTATTCATCGGAGATCCAGCACATGATCAGGAAATGGTTGGCCGCAGCGGCAATGCTCGGCACGATCGCGTCGGCCTGGGCGGCCGTCGACGTCAACACCGCGAACGAGGACGCGCTCGTCGGCATCAAGGGCATCGGCCCGGCACGAGCGAAGGCGATTCTCGACGAACGCGGCGCACGCGGTCCGTTCAGGAATGCGGACGATCTCGCATCACGCGTGAAAGGCATGGGCGGCCATACCGTCGAGCGGCTTCAGCGGGAAGGGCTGACGATCGGCGCGGCCGGTACGAGTGCCGCGCTGCCGGCCGAGGGCAAGAAGCCGGCGAAGCCCGCGACTCCACCCGCACGCACCGTACAGAAGTAGCGCGCCACGCGCGGCCGGCAACGAGCTCCTTCAGTCGCCGTCGTTGCGACGGCTTCCCGCGGCATGCCGCGGGTTTTTTGCGTGGGCACGGCGCAGGGCGGCCACGCGTTGCATGCCACGCGTATTCCATCGTCGCAATGGGGTTGCCGGCGCGAGCAGCGGTGCTGGTTTACAATCGTTTGATTGATCGGCCTCGTACTCACGGTATATCCATGGCTTACAAAACTATCGAAGACACGATCGGCAATACGCCGCTCGTCCAACTGGTCCGCTTGCCGGACGACGAAATCCGCGCGCGCAACAACGTGGTGCTCGCGAAGCTCGAAGGCAACAATCCGGCCGGTTCGGTGAAGGATCGTCCGGCACTGTCGATGATCAGCAAGGCCGAGGCGCGCGGGCGCATCAAGCCGGGCGATACGCTGATCGAGGCGACCAGCGGCAACACGGGTATCGCACTCGCGATGGCAGCGGCGATCCGCGGCTACAAGATGGTGCTGATCATGCCGGAGGACCTGTCGGTCGAGCGCCGCCAGAGCATGGCCGCCTACGGTGCCGAGATCATCCTGACGCCGGTGAAGGGCGGGATGGAACTGGCACGCGATCTCGCGGACCAGATGCAGCGCGAAGGCAAGGGCGTGATCCTCGACCAGTTCGCGAACCCCGACAATCCGATTGCGCACTACGAAGCGACGGGCCCGGAAATCTGGCGCGATACCGAAGGGCGCATCACGCACTTCGTATCGGCCATGGGCACGACGGGCACGATCATGGGCACGTCGCACTATCTGAAGGAACAGAATCCGGCGATCGAGATCGTCGGTGCGCAGCCGGAAGACGGTTCGCGCATTCCGGGCATCCGCAAGTGGCCGGAAGCGTACATGCCGACCATCTTCGATCGCAGCCGCGTCGACCGCGTCGAGAACGTGAGCCAGGCCGCGGCGGAAACGATGGCGCGCCGGCTCGCGTCGGTCGAAGGCATTTTCGCGGGCATTTCGTCGGGCGGCGCATGTGAAGTCGCGATGCGCATCGCGCGTCAGGTCGAGAATGCGACGATCGTGTTCATCGTCTGCGATCGCGGCGACCGCTACCTGTCCACGGGCGTGTTTCCTGCCTGAGCGCGGGCAGGCGCGCGTCGTGCGCGTGAGATAAAAAAAGCGCCGCATCTGCGGCGCTTTTTCTTTGGGCGGTCGGTGCTTACTGCGACTCGGTGTCGGCAGTGGCCGGCTGGAGTGCGCCGCTCGCTTCCATCTGTGCCTTCACGGCCTCGCCAAGCTGGTACACGGTGAGCGCGTAGAAGAAGCTGCGGTTGTAGCGAGTCAGCACATAGAAATTCTTGAGACCGAGCATGTATTCGGTCGCACGTCCCGGCGACGGCAGGTCGACCACCGTCACGGGCGTGCCGGCTTCAGTGGTGATGTTGACCGCCGGCTCGTTCAGCGTCATGCCCGCGCGCAGCAGTTGCGACAGCGCCCAGTGCGGTTCGGGCTGGCCGTCGGCGGCGGCCTGGGCGATCCCCAGGCTGCCCGTATCGGGCGTGATCTGCCAGACCACCGGGCGGTCGGTTTCCCAGCCGTGCTGCTTCAGGTAGTTCGCGACGCTCCCGATCGCATCGACAGGACTGCTGCGCAGGTCGACATGGCCCGTGCCGTCGAAGTCGACCGCATATTCGCGGATGCTGCTCGGCAGGAACTGCGGAATCCCGATCGCGCCCGTATACGAGCCGAGCACGGTGGTCGGGTCGAGCTGGTTGTCGCGCGTCCAGACCAGGAAGTCCTCGAGGTTCTTGCGGAACGTCGCCTGGCGGCTGTCGCGATTCGGCGTGTCCGGATAGTCGAACGAGAGCGTCGTCAGCGCGTCGAGCACGCGGAAATTGCCCATGTAGCGGCCGTAGATCGTCTCGACGCCGATGATGCCGACGATCACTTCGGGCGGTACGCCGAACTGTTCGGATGCGCGCTGCAGCGTGGCCTGGTTCGCCTTCCAGAATTTCACGCCCGCGTTGATACGGATCGGTTCGATGAAGCGCGAGCGATAGACGCGCCAGTTCTTGACCGTCGGCGACGCAGCCGGCTTCACGAGCTTGACGGCCGTCGCCGAATAGCTGATGCGCGAGAACAGCGCATGCAGGCTGGTGGAGTCGAAGCCGTTGCGGCTCACCATCTCGTCGATGAACGCGTCGACCTTCGCGTTGTTCGCGTAACGCTGCGGAACAATTTCCTCTTCGAAGGTCTGGCCTTGCGGCGTCGGCTGCTGCGGCTGGGCCTGTGCGACGAGCTTGCCCGCGGACTTCGCGGGTTGCGTCTGCGCGCCGGCAGGCGCGGTGCCGAGGGCCGCGACCACCGCGGCCGCAACGAGCGGCACGCGAACACGGAACAGCGCGGAGAGAAGGGCGGCAGGCTTGCTGGAATTCATGTCGAAGCGGGCGGACAAGGCGATTGGGTACGGCGCAGTATACCCGACGGATCCGGCGCACCGGGGCGTGGCAGGCCCCCGTTGTGGTAAGTTAGCGGCGAATTCGCGGCAGACGATGGACATCATTGATGGAGACCTGCGGCGCACCGTGCGCCGTGGATGACAGCTTATGGCAACCGCTTTCTATACGCACCCCGACTGCATGCTGCACGAGATGGGGGAATGGCATCCGGAATGCCCGGCTCGCCTGTCGGCGATCCAGGATCAACTGATCGCGAGCCGCATCGACGACCTGATCGTGCACGAAACCGCGCCGTTCGCGAGCGAGGTGGCGCTGGGTCGGGTGCATACGCAGGCGCACATCGACTACATCCGGAGCATGACGCCGGTCGACGGCTACGTCGAGATCGATCCCGATACGCTGATGAACCGCGATACGTGGCGCGCTGCGCTGCGTGCGGCCGGTGCCGCGATTGCTGCGACCGACGCGGTGATCGAAGGCCGCTATGCGAATGCATTCTGCGGCGTGCGCCCGCCCGGTCACCATGCGGAGCCTGCGCGCGCGATGGGTTTCTGCTTCTTCAACAACGTCGCGATCGCCGCGCGACATGCGCTCGACGTACACGGTCTCGAGCGAGTCGCGATCATCGATTTCGACGTGCATCACGGCAACGGCACCGAGGCCGCATTCGCGAATGACGAGCGCGTGCTGATGTGCAGCTTCTTCCAGCATCCGCTGTACCCGTTCTCGGGCGTCGATCACCAGGCGCCGAACATGGTCAACCTGCCGATGCCGGCGCGCAGCAACGGGATGGCGATCCGGGAAGCCGTCGACATGTTCTGGCTGCCGCGGCTCGATGCGTTCAAGCCGCAGATGCTGTTCGTGTCGGCGGGTTTCGATGCGCATCGCGAGGACGACATCGGCAACCTCGGCCTCGTCGAAGCCGACTTCGAATGGCTGACCGCGCAGATCGTCGACGTGGCGCGCCGGCATGCGCAGGGCCGCATCGTGAGCTGCCTCGAAGGCGGCTACAACCTGTCCGCACTCGGGCGCAGCGTCGTGGCCCACCTGCGCGTGCTGGCCGGCATCTGATTACCGATTGACCGGGGCCGGGCCGGGCACGCTCAGCGCGCCGGCGCGCGTGCGTGAATCCACGCGATCAGCGCGTCGATCACGCGGTCGCGTTCGAGATCGTTCATCGTTTCGTGAAAGCCGCCTTCGTACAGCGTCAGCGTGCGATCGGGCGAGCCGACGCGCGCGCCGAACGCCCGGCTGCCGTCGGGCTCGGTCAGCTTGTCTTCGGTGCCGTGATAGACGAGCACCGGAACGCGCAACCCACCACGGCCGCTTTCGATGCGTGCCATCGCGTCGAGAATCTCCGCACCGGTGCGTGCGGGCACCGCGCCGTGATGCACGAGCGGATCGGCGCGATTGGCCGCGACGATGGCCGGATCGCGCGACAGCAGCGCCGCATCGATCTTGATGGCAGGGAAGGTCGGCCATACGCGGCTGATGACGCGGCTCACCGCGAGCATCCAGCGCGGCACGTCGCGCCCCGGCGCGAGCGCCGGGCTCGACAGCACGAGGCCCGTCAATGCGTGGCCACGTGTCGGCGCGCGCTCGATTGCGTACAGCGCCGCGACCGCGCCACCCATGCTGTGCCCCATCAGGAACAGCGGCGAATTGCCGCGTGCGGCTTCCGCGACCAGCGCATCCGCATCGTTCAGGTAGCCGTCGAAACGCTCGACCCAGGCGCGCTTGCCGGGTGACTGGCCATGGCCGCGCAGGTCGATCGCGAGCACGTCGATGCCGGCCGCATTCAACCGGCCGGCGAGTGCGGCATAGCGGCCCGCGTGTTCGGCGAGGCCGTGCACGAGCGCGATCGTCGCGCGCGGTGGCGCGGTGCCGCCTCCGGCGGGCCACCGGTACGACGCCAGTTCGAGCCCGTCCACGGTACGGAGCCGGCCCATGGTCGGCGCGGGCGACGACGAAGGTGCCGGTCCGGCGGTGGCGGGTGCGGCGGCCGGCGTAGTGGTGGCGGTCAT
Protein sequences of DBSCAN-SWA_2 >NC_007510|1099634:1107922|1101042_1101993_+|WP_011351339.1|DBSCAN-SWA MNTLREVVPVPREQLARSRVLVVGDVMLDRYWFGNVDRISPEAPVPVVHVQRQEERLGGAANVARNAVTLGGQAGLLCVVGCDEPGERIVELLGSSGVTPHLERDPALPTTIKLRVLARQQQLLRVDFEAMPTHEVLLAGLARFDALLPQHDVVLMSDYAKGGLTHVTTMIEKARAAGKSVLVDPKGDDWARYRGASLITPNRAELREVVGQWKSEDDLRARVAKLRAELGIDALLLTRSEEGMTLFSATGELHAPALAREVFDVSGAGDTVIATVATMLGAGVPLVDAVVLANRAAGIVVGKLGTATVDYDELFH >NC_007510|1099634:1107922|1107001_1107922_-|WP_011351345.1|DBSCAN-SWA MTATTTPAAAPATAGPAPSSSPAPTMGRLRTVDGLELASYRWPAGGGTAPPRATIALVHGLAEHAGRYAALAGRLNAAGIDVLAIDLRGHGQSPGKRAWVERFDGYLNDADALVAEAARGNSPLFLMGHSMGGAVAALYAIERAPTRGHALTGLVLSSPALAPGRDVPRWMLAVSRVISRVWPTFPAIKIDAALLSRDPAIVAANRADPLVHHGAVPARTGAEILDAMARIESGRGGLRVPVLVYHGTEDKLTEPDGSRAFGARVGSPDRTLTLYEGGFHETMNDLERDRVIDALIAWIHARAPAR >NC_007510|1099634:1107922|1103676_1104579_+|WP_011351342.1|DBSCAN-SWA MAYKTIEDTIGNTPLVQLVRLPDDEIRARNNVVLAKLEGNNPAGSVKDRPALSMISKAEARGRIKPGDTLIEATSGNTGIALAMAAAIRGYKMVLIMPEDLSVERRQSMAAYGAEIILTPVKGGMELARDLADQMQREGKGVILDQFANPDNPIAHYEATGPEIWRDTEGRITHFVSAMGTTGTIMGTSHYLKEQNPAIEIVGAQPEDGSRIPGIRKWPEAYMPTIFDRSRVDRVENVSQAAAETMARRLASVEGIFAGISSGGACEVAMRIARQVENATIVFIVCDRGDRYLSTGVFPA >NC_007510|1099634:1107922|1099634_1101035_+|WP_011351338.1|DBSCAN-SWA MKITIIGTGYVGLVTGACLAEIGHDVFCLDVDPRKIDILNNGGMPIHEPGLLDIIARNRAAGRLRFSTDIESSVAHGEIQFIAVGTPPDEDGSADLQYVLEAARNIGRHMTGFKVIVDKSTVPVGTAQRVRGVVDEALAARGLAGSVAHRFSVVSNPEFLKEGAAVEDFMRPDRIIIGVDDDETGTIAREKMKKLYAPFNRNHERTIYMDVRSAEFAKYAANAMLATRISFMNEMSNLADKVGADIEAVRRGIGSDPRIGYHFLYAGVGYGGSCFPKDVQALIRTAGENGQPLRILEAVEAANHAQKDVLIGKIEQRFGADLTGREFAVWGLAFKPNTDDMREAPSRRLIAALLERGATVRAYDPVAVDEAQRVFALDFGTDPDTLARLHLVETQDIAVTGADALVIVTEWKEFRSPDFTRLKAELKAPVIFDGRNLYEPDAMAELGIDYYAIGRPYVDPQSSTRG >NC_007510|1099634:1107922|1106045_1106969_+|WP_011351344.1|DBSCAN-SWA MATAFYTHPDCMLHEMGEWHPECPARLSAIQDQLIASRIDDLIVHETAPFASEVALGRVHTQAHIDYIRSMTPVDGYVEIDPDTLMNRDTWRAALRAAGAAIAATDAVIEGRYANAFCGVRPPGHHAEPARAMGFCFFNNVAIAARHALDVHGLERVAIIDFDVHHGNGTEAAFANDERVLMCSFFQHPLYPFSGVDHQAPNMVNLPMPARSNGMAIREAVDMFWLPRLDAFKPQMLFVSAGFDAHREDDIGNLGLVEADFEWLTAQIVDVARRHAQGRIVSCLEGGYNLSALGRSVVAHLRVLAGI >NC_007510|1099634:1107922|1103122_1103464_+|WP_011351341.1|DBSCAN-SWA MIRKWLAAAAMLGTIASAWAAVDVNTANEDALVGIKGIGPARAKAILDERGARGPFRNADDLASRVKGMGGHTVERLQREGLTIGAAGTSAALPAEGKKPAKPATPPARTVQK >NC_007510|1099634:1107922|1102056_1103049_+|WP_011351340.1|DBSCAN-SWA MTLIVTGAAGFIGANIVKALNERGETRIIAVDNLTRADKFRNLVDCEIDDYLDKTEFVERFARGDFGKVRAVFHEGACSDTMETDGRYMMDNNFRYSRAVLDTCLAQGTQFLYASSAAIYGGSTRFVEERDVEAPLNVYGYSKFLFDQVIRRVLPTAKSQIAGFRYFNVYGPRETHKGRMASVAFHNFNQFRAEGKVKLFGEYNGYAPGEQTRDFVSVEDVTKVNLFFFDHPEKSGIFNLGTGRAQPFNDIASTVVNTLRALDNQPPLTLAQQVEQGLIEYVPFPDALRGKYQCFTQADQTKLRAAGYDAPFLTVQEGVDRYVRWLSGQV >NC_007510|1099634:1107922|1104658_1106002_-|WP_081436585.1|DBSCAN-SWA MMSIVCREFAANLPQRGPATPRCAGSVGYTAPYPIALSARFDMNSSKPAALLSALFRVRVPLVAAAVVAALGTAPAGAQTQPAKSAGKLVAQAQPQQPTPQGQTFEEEIVPQRYANNAKVDAFIDEMVSRNGFDSTSLHALFSRISYSATAVKLVKPAASPTVKNWRVYRSRFIEPIRINAGVKFWKANQATLQRASEQFGVPPEVIVGIIGVETIYGRYMGNFRVLDALTTLSFDYPDTPNRDSRQATFRKNLEDFLVWTRDNQLDPTTVLGSYTGAIGIPQFLPSSIREYAVDFDGTGHVDLRSSPVDAIGSVANYLKQHGWETDRPVVWQITPDTGSLGIAQAAADGQPEPHWALSQLLRAGMTLNEPAVNITTEAGTPVTVVDLPSPGRATEYMLGLKNFYVLTRYNRSFFYALTVYQLGEAVKAQMEASGALQPATADTESQ |
8 | Bacillus_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2004235 : 2059218
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_007510|2004235:2059218|DBSCAN-SWA CATGCCGAATCTCGACGATCTGCCCGACGATGTCGCTGCATTGAAGGCCATGCTGGCTGAGGCTCGCGCGTCGGCAATCGAAAGAGAACTCGAAATCGAGCAACTGCGCCGCGAGATTGCCGAGAGCGATCTTGAGATTGCACGGCTGAAACTCCTGATCGACAAGCTCAAGCGAATGCAGTTCGGCCGCAAGTCGGAGCAGCTCGCCCGAGAGATCGAGAGGCTTGAATTGCGGCTCGAGGATCTGAGCGCCGGGAGCAGCGTCGCCGACGTGCAGCATGCGAAGGTCCGGCGAGAAAAGCCAGCGACAGGTGGCGAGTCGTCGGCCCGGGAGCCGCTGCCGCCACATCTGCCGCGTGAAGACCGCGTGCTCAAACCGGATTCGATCTGCCCCAAGTGCGATGGCACCATGCAGAGCCTCGGCGAGGACGTCTCCGAACAGCTCGCCCGCGTCGCGGCGATGTTCAAGGTGATTCGCACGATCCGGCACAAGATGGTCTGTCCCAGCTGTGGCCACATCGAGCAGCCGTCGATGCCGGGGTTGCCGATCGAGCGTAGCATCGCCCATCCGAGTCTGCTTGCCGACATCCTGGTATCGAAGTACGCAGATCATGCGCCGCTGTACCGCCAGTCGGAGATCGCCGCGCGCGACGGTGTGACGCTTGATCGCGCCAGCATGGGCCGCTGGGTCGGGCAATGCGAGGCGCTCTGCCGCCCGCTGACCGACGCACTGCGCCGGTACACGATGGCGGGCACGAAGCTGCACGCGGACGACACGCCGATCCCCGTGCTTGCGCCGGGCAACAAGAAAACGAAGACCGGACGACTCTGGGTGTACGTGCGCGACGACAGCCGTTCGGGCTCGACGGAGCCGGCCGCGGTGTGGTTCGCGTACTCGCCCGATCGCAAAGGCATCCATCCCCAGACCCATCTCGCTGGATTCGAAGGCATCCTGCAAGCCGATGCATATGGCGGCTTCGACGAGCTGTACGTGAACGGCAAAATCTGCGAGGCTGCGTGTTGGGACCACGCGCGGAGAAAATACTACGAGGTCCACGCTAGCACGCCGACGGACGAAACCAAGAGCTTGCTCGAAATGATCGGCGAGCTCTACAGCATCGAAGCCGACATCCGTGGCAAGCCGCCCGATGAACGAAAGCGCGTGCGGCATGAGAAAAGCAAGCCACTGCTGGAGGCCTTCGAAGCGAGGATTCGGGGCAAGCTCGCAACGCTGTCGCGCAAGTCGGAGCTGGCCGGCGCGATCCAGTACTCGTTGAATCACTGGAATGCTCTGACGTTGTTCTGCGAGGCCGGGCAAGCCGAGATCAGCAACGCGCTGGCCGAGAACGCTCTGCGCTGCGTGAGCCTTGGGAGGAAAAACTTCCTGTTTGCCGGCTCCGACAGCGGAGGCGAGCGGGCCGCCGCAATGTACAGTCTGCTTGGGACATGCAAGCTCAGCGGTATCAACCCGCGCGCCTATCTGGAATACGTCCTGACCTACATTGCTGATCACGTCGCCAATCGCGTTGACGAACTGTTGCCCTGGAACGTGGCTGACAAACTGAGGCTAGCCACTCCGCCCAAGGCGAACATCTGATGCATAAGTCAAGACCTGAGCTCCCCGTCAGCGTCAGATCAGGTCTGAAATTACTTCGCCGTCACTTCATCAAGGGTCGTGATCGACCCTTCAGAGACATCCGCCCGAATTCGGCAGGGCGACCGCTTCAGAGCTTGTATCGGACACTGACGACGAAATGTTGGCGCAAATCGAGGCAAAGCGCGACTCAATCGGCCAGGGGCGACGGAGTATTCATCTGTCCGGCTCCGGCGTGTGGTTACACGAAGTGAAGAGGCGCACCTAAAGTGCGCTTCTCTTTTGTCCCCCGACTCAGCGATCGGCTTGCAGCTGTATGAGGTACTTGTCGAGCGCGTCCATTGTCCCGCCCCAACCTTGAGACATGCTGGCGTGCGACGCGTCGAAGGTGGCTTCCTCTGCTTCACCGGCGTCGAGCGCGCGCCACTCAAGGTGCATTAATGTGCCACCCTCGACTTCGCTGAGCGTCGTCGTCGAAAGCGTATAAAGCGGCCATTGCGCGGACATCGGATGTCGCGTCACTCCGCCTTCGGCATCACTAAACTGCACGACGACAACAATACGCGTAGGGGGCGTCAATTCGCGGAACGTCCACTTGCCCCACATAGCGGGAGCGTCGGGCACGCCGCGCGGTTTCATCGCGTAGTGAAAGACGCCACCGACACGCGCATCCACTGTGCAGTGAGACATTTCCATCCCGGGTGGACTCATCCACGTACTCAAGTGCTCAGCTTCGGTCAACGCGGCGTATACGAGTTCACGCGGCGCAGCGAAGCGGCGCTCGATGACGAAAGGCGCGGAAAGTTTGTCAGTCATGGGGGCGAGTCTTCCTGGTTCTGGTTGAGGAAACTTTAGGTGTGGCTCTCGGCGCAGCGTTTACGTTGGAGTCCTTGACAGGATGCCGCTGCAGTTGCTGCAAGTAACTGTCGAGCTGGTCGAGACGTTGTTCCCACAGGGCGCGATACTGTAGTGTCCAGTCGGCGGCCTGCTTGAGTGGCCCAGGCTCGATACGACACGGGCGAAACTGGCGGTCGCGCCCTTGAGTGATCAGGCCGGCCCTCGTGAGCACGCGCAAGTGCTTGCTGATGGCCGGCGCACTCAAATCGAAAGGGCCTGCCAAATCGCCAACCGTGGCTTCGCCTTGCGCCAGCCTCGCGAGGATGGCTCGTCGCGTAGGGTCAGCCAGCGCGGCAAATGTGACGGTGAGAGGGTCCGCTTCCAATTATTTTCCCAATTGGTTAATTAGCTGATTGGTTAATCTTGATTCAGCCGAATCCGCTTGTCAAGCCGTCCATGCAGGGGCAGCCCGATCCGGATGACTCTGGTTGGTTGCACGGCCGACATTGGAGGGTCTCGACGCGAGCGCCGACCAGCGGCGACTCGTGGCCGAACTGAGTCAGACGCTCTGACGCTGTCTGTTGGGTCTCGTCCAGCCTTATACATCCGATTGCCAGGCAAATCTGGATAGATCGTGCCCATGATCAGCGATCGTGGACACGATCTAACGTGATCCACACGGAATTCTCCAAGCTCTCACATCATGCTCCACGCGCGCGTGAATCGGGGACTGGCCGCGCCGAGCCGCTTACCGTCGTCGGCGAACCGACCATTAATCTCTTCGGTACGACCTTTGCGGTGGTGGAATTCGATTGGGATTTCATGGCGGTCAGGCGGTCGAACGGGGGCAAGCTTCATTCGACAGGACGCGAAAGCCATGTATACATTCATGTCGACGATCACGGCTGGCGGCTGTTACACGTGCATTACTCGGAACCAACTGTCGCGGGGGGAGGGGGCGAGCGAAGGGTGGCCCTGAACCAGAATTAGTCCCGGTCGGACGTGGAGATTTCCGGCTTGTGGCCGTGAACCTAAGTTAGGAAACTCCCTAGATATCGGGTTACGGCCATCTAGGAGCGTGACTCGAGTGGATGCGCGTCTCGTTAACGTAAGCAAGCTCTTTATTCGAGTAGAGTCGGTGATGTCGCGCCGACGGTGATAGAGCTGCTCATCGCGGCGTTCTTCGTCTGTGAGTTGGCGGAGTCACGAGTCCGCAATCACTATGTACGCTAGTGTGACAAAGATTGGCCTTGCACACAAAACACTACCGCTTTAGAGACCAGAGCTTTTCAGAAGCTTCGATGGCGAAATATTCAGAGCGTCAGCGATTCGGATGAGGTTGAGGATGGTCAAATTGCGCTGGCCTCTTTCGATGCGGCCCATGTGACTTCGATCGATTCCGGTGAGAAGGGCAAGCGCTTCCTGCGAGACGCCAAACTCCAGCCGACGAGCACGGACCGTCGCGCCAATCGCGGCCAGATGTCCTGCGTCGTCCCGTTTGCCAGATACCCTTTGCATCCGGTGATGGTCGCGGTATGATGCTCGTTTGGCCACGGCATTTACGACCCATTTTCTCGGGCGCGGCCAGTTTCTCTAGCTGCAAGGCATATGTTTTTGATTCAGGATGGGAACCGTCTACCGCTACAGTCGGCGATGGCCGGTGCAGCTGTGTGGGCTGTTATGGAACAGTGGACACCTGGCGACTTACTGGTTCTACGGGTGTCTGAAAAGCGAGCACGCGGGATTCCCTGCGTGATTTGGACGGACCTCACATCGGGAGCGGCATCGCTGAAGCAGTTTCTCGACGGTAGTAACGTGCTGCGAGGGGCGGCGGTGGTCAGCACGCGCCGTGCGCGTCAGGTAGATGACTGGTATTTGGATCCACTGAGCGAAATCCGAGTCGGGGCGACCGAGCTCAGCGACGACGATCGCACACTGCTGACGGTGACGCAGTTCACAACGGTGACGGGGCGAAAGTTTTCCGTACCGAGTGGTTTCGCGACGCGCTACGCCCGCGGAAGGCGGGTATGGCAGGCGCATCGGTCGCGCTCGGACTCAGCATAGTGACCGAGCGGGGGGCTTTCCGATGGACAGGCCATGTGTGCGGCTTTGGCTCTCGTAATGAGACGGGAGGAAGTCCAAGTGTTCTGTGCTGAATATGCCCAGGGTGAGCGACTATGACGGATACACGCTTCCAAAACGGACCGACCTGGTCGGTACCTTCCAGCGGCCGTAGCAGTTGGAAAAATCGTTTCTTCGTCGACACCGAGTTCACGGACTTCGAGGCGCCTCAACTCATTAGTTTGGCCATCGTTGGGGAAAACGGGGCTGAATTCTACGGCGAATGCACGGACTACGATGCAACGCGCTGCAGTGACTTCGTTCGGGCCGTGGTGCTGCCGCAACTGGGGCGACTCAAGGGCCGTGCGATGCCATTTGATCAACTGAGTCAAGCGCTACAGGCTTGGGGCGAGAGCATCCCGACGACCTCGAAGCCGGTGCTTTGCTACGACCTTCAAACAGACTTAGATCTGCTGCGATCCCTGCTTGGTGGCTCGTTCCCCGAGGGATGGGCGCGTGAGGATGTCCGTGGCCAAATCGATCTTCGGCGGCTCGCCGAATACTTCGCGCGACACGGCGGCGAGCACCACGCGTTGCACGATGCTCGCGCAAACGCATACGCCTATATTGGTTGAACGACCCGATCGAAGCCGTTGCATGTTGGCCTGTATTTCATCCAATCGTGCCGTCTCTTTTGTCTCAGACGTTTTGCCCAGTTATCGGGTTCCACGGGACGTGTCTCGATGTTGTGATCGGCGGGGCTGGCGCTGGAAGGCTGCACCCACCTGACGCGCGCGCTTAGCCTCTTCCCCATGGAGATAGGTCGACGTGGTCGCCACCGAGGCGTGACGCAGGTTGTCTCGCACGATAACCAGCTCCACGCCGGCGTCTAGTGCATGGGTGGCATGTGTATGACGTAGCCAGTGGGGACTGGCATGCAGTAGCTTCTCGGCAAGCGGGGGATTGCGGGTGCTCACGCGCTCGGCCGTGTCCCGAAAACACTCACCTAACACCTGCCGCAACCGCGCTGCACTGATGCCGGCGTGGCGTTTCCCGCGACGCTCTGAATTGAGTGCGGTGACCAACGGTGTCGCCGGATGCCAGCGAGTGCGCATGACCGGCAGACCCCGCGCCTTCAGTGCGCGCCGCAGCGCGTCTCGCGCGAGGGGCGGCAGGGCCACGCGCCCCGATTTCGCCCCTTTTCCTGTTACCTGCAGCCATAACGCGCCGCGGGCGTCCGCCGTGATGTCGCCCAGCGTCGCCGCCACTAGCTCCTGTGCCCGCAGGCCCGTCGCATAGCCAAAATCCAGTACAAAGCGCAGCCGCTCTGCGGCCGGTGCGGTCCAGCCGGTCCGCTCGAGCTGGTCGGCTGCAGCCCGCACGATCTTCCACTCTCGACCATTCAACGCCCGCCCGGTATCAAACGGTGCACGGGCTGCGCCGCCACGCACTGTCACGCCTGCAAACGGATTGAGCACGACGTAATGCTGGTCAACGAGCCACCCGAAGAGGGCGCGCAGGACGGCCAGCGCATACGCGGTTGACCGCGCCGTCAGGTGTCCCGTGAACGGACGCCACGCGGGCGTCGTGCGCGGGCGCGGTGGACCGATCCAGCGGGCGGCGGGCGTCGGGTGGCGCAGGAAAGCCCGATAGGCGATCGCGTCGTCGGTGGTGAGCGACGAGAGGGGCTGGCGGCGCTCGACAATGGCCCACAACAACAGCCGTTCGGCCTCGCGCCGGTAGGCTCGCACCGTATGGGTGCCCTCGTGGAGTGCCAGCCATGCCTCGACCGCCTGGTGGTCGTTGGCGGCGGCGAGGCCGCAGGTGTCGTGCGGGGCCCGGAAGCGGCCCCGCGAACCGTCGAGGTCGGCGGGCAGCGCATCCAGGGACCATCGCGGGGCCAACTGAGCGGTCGCTGCCTCGTAAGGAACCAGCGTGGCGGCGCGCCGGCTCAGGTCCGGGTGTTGGGAGAGGAAGGCTTCGACGCGGTGTGCACCCTGGCCCCCCAGCCCCGGAATCGAACGCCACCAGCCCGGGCGGTGGACCCTCCGCACCAGCAGGTCGCCGAACGAGTCGATGCCGGCGACGCCGAGTGCGCGTACGATGCGCGGCGGCAGCCAGCGGGCCACCGGGTCGGTCGGCTGTGGCACCGGGATGGGGGCAGTCCGCAGGACGTCGAGCGCGGTGATGACCTGCGCCGCATGACGGGTACGGTCGGCGGCTGGGTGGGTGAACAACGGGATCAAGTCGTCGCGCTGGCGGGCGATCGCGCACGCGACGAGTTGGCGGCGGATCTGCCCCAGGAGCCCGCGTGACGACTGGCCGTCGGCCCTGCGGTGCGGCAGATAGCGCTCGACCGCCGCGCGGGCCGTGAGCCCGGCGTGCCAGGCGCGCAGGGCGGCCAACGAGGCGAGATTGGGCAGGGCGGTCGCTGGGAGATTTCCGTCGTCCGACCCTGCGGAGGAGGTCGATGCAGTACCCATGCAACCAGTTTAGGTCGCTGGGAAAGTACTCGCGATAAGACTATTTATCGCAAATTACCTATTTGCACATCCGGTCGTGAGGTGACATTTCACGACAGCAAGATGTGGATAGCAAGTCGAGCACGGCCGCAGCCACAGACGATTGCGCAAATAGGTAAATCAGCTCGCCCAGACTCTAGCGAGGGTAGGCGACGCAATACCTAGCGAATCACCCGTGTCGACAAGCAGATGTTTGCCAGATTGGATTCCTGATCGTTCCTTCTGCGGTGGCCCGCTCTGAATCCAATGTGCTGGTCAAGCTGGCACATCCTGACGCCGCGCGAATCTTCATCACGGATCCACAACTTGTCGTCTGGGATGAACGACTGCTCACGGTCAACGTTTCTCCTGTCTGATTGTCGACCGGCGATTTTTTGATTTTTCCGTCGGCAAGGAATCGACAGTCGCGGAAGGTACAGCACCGATACCAACGAGCTTATCCCGAGCGTCCCGGTACGCTTCATAGAATTTTGCCCACTTGCTGAAGCCGCCTAGTTCGCCGACAGCGCGCAACATAGCGAGGTGTGACGCGGTAAGCAACTGCACGGTAGCTTCGAGCTCGGCGATCCTGATATCCTTGTCCGCAAGCGACGCGGCGGTGTCAGTTCCAGACCGCTTCGCGACGCGGCCACGCCACCGCCGGTATTCGGACTGGCGCTGCTGATACTCGGCAAGTAAGCGACTGCGAGACTCACTGCGGGTAATCGACGAGGCCGCTTTGATCGATGGATGAAGCCGTGCAACAGCACGCGCCGTGATGTTTTCGTCGTCGGCCAGCAGCACATCAAGAATCTCGCGCATTGGCTCGTCGCCAGCTGATACCATTTCGTTCTTGTCAGTCATCAAGTACTCCACGTGGCGGCGGCAGTGCGAGATCAAGGCCATCGGGGAACGGATGGTCGCCTGGCTGCGTTGCGAGCAGTTTCTGAACGCCCGCAAGCCGCTTTTCCGCATGTTCGAGCTGGTTCTTCCAGCCGATGCTCGTTGAAGGCCGAACTTTAATCGTCTCCAGCGCCAGCGAGAGCTTGCCTTCGAGCCGGATGAGATTCTGCCGGTGCTCAGGGAGATCGGTCGCCGAGAGGTGACGGCAGCCGGCGAAGCACTCGAGATGCTTCGGGCAAGGGTCGACGGTGAACGAGTTCAGGCAATGCCCGTACGGAGTGGCATGGAATCCGTCGGCTTCGGCACGCAGGTATTCGTATGCAGCTGCGTCGCCATCTGCGGCCTGGATGCGACGGAATGCATCGACGATTGGACCGCTAGCTTTGCCGCCCTTGATGAGCCTAGCAACGGTGGTCGCTTTCGCGCCGAGCATGATCTCGATATCCTGCGGAATCTCGATCTGCTCGAGTTCCTCGGCCAAGCTGCGATGATCGTACTCGTAGCTTTGCGCGACACTGCGCCGGTTGAAGCGCTTCGAGATGATCGTGTCGGCCACGCCCAGCCGGAACAGCTCTGTATTCTGGAGATGGCGTAGCATGTGAGACTCCATGCGCAACGCCCGGTCGTCATCGGTCAGCCCATATTTCTCGAACAACGTGACACACGTTGGATGCTCTCCCAGCGCCAAACCGATAAGATCATGCGCAGGCCGTCGCACGGACATATATCGGCTAATGTCACACAGCAATTCGTTGCGCTCCTCAGCAAGCGAGCGCTTGGGCAGCGTGAACAGAAATTCCCAAGGTTGAATCACGCCGCTTTCCAGTGGCAACGGCGCAGTATCAGATGCCTTCGTCGGCGTCGCGAGCCGGATGTGCTCCTCGAGTTCGCCGACGCGCAGATACGCCGCATGCCATTCCATTCGTTCGAGCCGCTCAATTGGCGAACCGTCGCCGTGGCGGAAGCATAGACTGGTCTCGCCCTCAAGCATCGCTGTCTTCAGGCGATTCCCGAATTTGTAGACGGCCATGTCGAGCCGCAGGTCGCCACTGCGTCGCCGCTCATGCTGGTATTCGTGTAGTTCAGCAAGGACGGCGGGATCGAAGCCATTCCGATAGCGTTCGATGAACGCTTCCCGTTCAATTTTCAGCCAGAACGGATTGCCCATGAGGCGTGTATAGACTTCGATGACGGGCACGAGTCCGTCAACTGGATACCATGGCAGCAACCGACCCGTTTCGCACTGCAGCTTCAACGTCGCACGCAGGGGTTCGGTGATCCGACCCACATGATCAAGTGTTTCTGTCAACAGCGTCCGGAACATCTCAGGCACGGGCTGGGTCGCTTGACGCAGCACCCGACTGTCCGTCTCTTCCTCCTGCTGCTTTTCCGCGAAGTGGCGGACCATCAAGGATGTCGAGATACCGCCGGAGTCACCGGCCGGTCGTCCCTTCGAATCCAGGTACGTTCGCTCCCGCTTCCAGTCAACCGGCAGCAGTGCGACTTCGCCCGCACGCATGCCGGTCACAATCATCGTGCGTATCGCAGCGAACCGCAGATCGTCCACGAACGTCCGCGGCTGTTCGGTCATCACGATTCGCGTCAGCTCCCAGAACGCCCTGCGCTCCGGCAGACGCTCACCACGCTTGCGTGCTTCGAGGTCGTCGCGAAGCTCGTCCGATGACCACGTGTGTTTTGCCTTGATCGCGCTTTTACCTTTCATGCGCGGAACAGCAAGCGCCGGGTATAACGGACCCGCGTCGCAGATGTGCTGGGCATCAAGCACCGCCTTGACGATGCCGACCACCAGGTCCCCTAGCTTCCCGGTGGCCTGGATAGCCTTGCCAATGCGCACAGCAACGTGCATGTCATCGACGGTCAACTCCCAAGGCTCCTTGCTCACGCACGTAGCAACCACGCGTAACGGGCGAGCGATATTCTGAGCCACATGTCCAGTCGTGTTCCTCCTGAACAGCAACTGCTCGGCGACTGCCGCCTTAATCAGGTCCTGCCATGCAAAGGAGAGCGGCCGTCTGGCGAGTGGAATCAGGCCGCGCTCGACACGCTCTGCATTGACGATGGCCAATGCCTTCACTTCGGTGCCGAAGTCTCGTAGATAGTAGGTCGGAGGTGGCACATCGCTAACAGCGGTCGTCAGATTCCAGCCAACACCGTCTTGAGCGGAGCCTGACGGGTCAAGAGGTATTTCCCAGCACATCCCCTTTTCACGCGCGAGCGTTTTACCCAGTTCAATGAATGCAGAGTAGTGCGGATTCATCGGCCCTCGCTTTCCAACTCATCGATGACCGTCTGTATTTCGGCGACCGTGCGCTGCAACTGCAGGTAGGTCGGAGACCGGGTGTCGCCTCGAGAACTCTGCTCAAAGAACACGACCACCTCGCGCATCGAGGCCAGTACGCTCTCGTGCATCACCTTGTCATGCAATGGCATGAACTTCCTACACCCGTAGCACGACGTCACCGGGTTATAGGGGCACGCAGGCTGTCCCGATGTACAACCGCCGATGCCCGCGATCGGAATGCCGTGAGGCACGCCAGCAATCTGCTGTTCATCTTTAAGTAAGGTGAGCTCTTCCGGGGAGATGAACCGGTCGTGCGCGATCTTTGCCACGCGTCGGTAGATGTCCGATGCTCCTAGCGCGCGGTTGACCCGTTCAGCGTGGGAGGCCGACGTCGAGAAGTACACCAGTCCGGTCTGGACGTACGAATGCCCCATGAACTCGGCAAGCTCTTCGTGGCTCGCTCCGGCGTCAACAAGCCGCTGGGCCGCAGTATGCCGCAGATCGGTAGCTGTGCCGAGTTCATCGGAACCAATCAGCTTGCGCACAAGTGCAGCGATGCGAGCCCCAGCCTCGTAGTTTGACTGAACATTGAAGAAGTGTGCGTCGCCAGTGGCTCCGCCGACCCGCCGCCATTTTTCGAGCGCAACAAACAGGGGCGCCCACTCGCGCTTGACTCGACGCGTAAGGGGACGCTGCTTGGCTTTACCGCGCTGTTTGGCCATATGGAACGTCAGATGCACGGTTGGCAAGCCGTCTTGGGCATCGTGCCAGATGCGGACGTTTCGCATATTCAGCATCGCAATCTGGATGGGACGCATGGCAAACTGGTAGGCACACAGCAGCATCCCCGTATCGCAAAGCACCTCGTTTGAAACCAATGCCGGGGAGGTAAGGGCAGTGACGGTCTCGTCCAGATGCCGCACGACCGCGGCCTCCTCATCCGCGTTCAGAAACACATCACCCGAGTGAACACCCATGTAGGCATCGCGGGCCGGTAAGGGTAGGGTATTCGTGATGTAGGAACTGTAGGAGTCGGACCAATCTTGCAGCCGGTGACGGCACAGTAGCCGAAGCAGCGACTTTGCCAGGATGTAGGCCGCGAGAGGCATTTCCCGCACGCGCAACCCAGCCCAGACAGCCGCTATTCCCAAAGGCCCTGCGCGCACGATGTCAGCGATATCTTTCAGCGACATGTGTGTCGCCGCGATCAAGTAGTTCGCAGCGGTGGAAACGGTGAGATCCACGCTCAGGAGATACAGGAAGACGTGCTTTAACAGCAGGGCGTGCCTTTTCTCATGCCGAGTGAAGTCGACCCTGATCGTTCGGCCGTTGATTGCCAGTTCAAACAGCATCGCACTGGCAGGATCGTGAATCGAGTGTTGGGTGTCGTCAAAGTCATCGTAGTAACGAATGAATGGTGGCAAGACCGGCAGCGCCGCCAGGACACGATCGATGACATCGCCGGCGAATCGCGGCTCCGAAGACAGCCCTGAACTATGTGGCTTTCCCAAATCAGCTCCGAAGAGGTAAGGCGCGAAGCAGCGCAACCCGGTCATCGAAGGCGTCATTCCATACGCCGGCCAGCCTGTCCTCAAACACCGCACGTGCGTACCGGGACGCCATGGCCGATGTCTTGGACCAACCAAAGAATGTTCGCATCTTCTGGAGTGCTTCATCCATTGCATCGCCGTGCTCAAGCAACTGGTGCAATCGAACGACAGCGCAGGTATGGCGCAGATCATGGGGCGTTATCGCATCCTTGCCAGTGCGATTCTTCAGTTCCGCGAGTACTTCACGCGGCAGGCGGCAAGATATCCGCGCGAATATCTTGGTCAGCGCTTCGGTCGACAGAGGCGTATCACACTGGGAGTTCAGCAGATAGGAATGAGAAGGTCGGCCGCGATAGTTCTCGACGTAACGTTGGACCACGCCGGCTGTCAGCTCGCTAACAGGAACCTGCCGAACCGAATGCGCGTTCTTGATGCCCGGCTTCGAGTAGCGAGGGTCAACCTCCGAATCCTCGTATTCGTTTTCCTGGACGTTGAGCCAGTATCGCGTTCGCTGCTGCCTGTGGTCGTAAGCGCTTTTCACCGCGTCGGCCGGCAGAAGCAACGCTTCGCCTCGGCGTAGCCCTTGGTGCAACATCAACACAAAGGCAACAAAAACGCGCCAGCGCGTCCGTTCGCGTGCAAACGGATTTCGCCGGGAATCCGGGTCAAGCAACAGGTACAGCGCTTCAACAATGCTGGCGGGCAACGAGCGCACCATCTCGACGGAATTGTCCCTGCGCACGTGAAGCTGGCTGTAAAGCCCGGCGAGCCTGTGTAGCCGCGGTTCGATTTGTCTCAGCCGGTCATTGGCGGCTTCGCTTTTCGATAGCCACGTAACCACGGATGACACGAATTCAAGGCCCGTCTGCCAGCGCTTTTCATCGGCACCGGTCACCACAGACTGATTGCGGATCGAAACGAACCACGATTCGAGGATTTCAGCCAGTGCTTCGTGGTTTAACGTTCCCAGCGCGTCATCCAAGGCATTTGGACCGCCTAGCACGGCTGCATGCCGATACAAGTCTTCCAGATACCGGAGCTTTTTCGTGTGCGTGGACACGGCTAACTGAGAAGACGACATCGTAGACCAGACCGCCGCCCAATAACGCGGCAACCCGGTGTCATCAAGCAGCAGTGGGCCACGGAGCGACGGCGGAACCGCAACGCCGAACAGTTGTTGAAACACGGCAGCCGTCCGTTAAAAGACGCAACGGCTAAAGATACCATTAATTGTATGACGTATTGCACTGAAAGACCGGTTTGGGCCGCCTGGACGCAACGGCTCCAAACATACCAGTCCTGTCGATAACTATGATTATGTCAAATACGATATTCCAGCCCCAAACCCCGCGGCAAGCGGCCCCCAACCGACCGCTGCCGCACTTTTGAGCGCCTCCGAAACGCTCAGGCAACCGGATTCATCCACTCGGTCAGCCCTTCACGCGCATTCAATTCCAGGAACGTCGGCGTCCCGCGAAGGTGCTGGATATAGCGCACCAGATTCGGCCACGTCGTTGCCGGGCGCGGCATGTTTCGCGTCCAGCGCATGAGCATCAGCGCCAGGAAGTCGACCGTATTCAACCGGTCGTTGATCAAATACGTCCGACCGTCGGACAGCAGATTGTCGAGATGCGCCATCGACGCCTCGACTTGCCCGCGCGAAAACACCTTCACCATTTCAGCACATCGCGGATCACCATCCTTCTCCGCATAAAACCAGTTTCGCATTGCCGGCAAGACGGAATTGGCGAGATAGATCATCATCTCGAACCACTCGGGACGTTCAGGCGCGCCCGGATGCGGAGCCAGGCCCGGCTCCGGATGACGTTCGGCAAGCAGCATCAGCAAGGCGGCCGATTCGGTCCGCGGGATGCCGTCGACGACCAGCGTCGGCACTCGGCCGGCGGGATTCAGCCGCAGGTACTCGGAATCGTGCTGCGCCCCCGTATCGATATCGACGAGCCGCGTTTCGAACGGCACGCCGAGTTCGATCAGCATCCAGTGCACGGCCATGCTTGCCGCGCCGGGCGAGTAATACAGAACGTAGGACATGGTCTGAGTTCGTCTCTTTGAGTTGAGTCGTATGCACGACGATCCTATGTCATCAATGCAAGATAATCAGCCGGAAGCCGGAAGCGTTGCCCCCTGCCGCCCGTCGTCCGTTGGCCGCACTCATTCGAACGTCGCGCTTCGTGGTTCATCGGATCCGATCGCCACGCCATCAGAAATTTGCCGCTGAAGTTGCCGGTGCGCAATCAGAATGAACGGCACAAAGACCGCACTCACGGCCGCAATCACGAGATAAGTGCCGGGAATGGCAAAGCGGTCGGCCAGCAAACCGCCGGCAAACGAACCGACGGGATACCCCCCATATACGATTACCCGAAAAAAGCCGTTCACGCCCCCGAGCAAATGCTGCGGCACCAACTCTTGCCGCATCGTGATAACGATGGATGACCAGATCGTCGCGCCGATTCCGGTCAGGAACAAGGCTGCTCCAACGATGACAGGCGCTTTCGTGACGCTGGAGATCAGTATCAGGAAGAGAATGCCGGCGGCGTCGACGAACAGCAGCATCAGGTTGTTCTTGTCGATCCGTATTCTTTCCAGCACCAATGCAGCAGCCACCGCCCCGCTGGCGAGCACCGCCATCATAAAACCGTACGAGCCTTTCGTCAGGGTTCTGGGAACGGCTTCTATCAGGTAATACGGCTCGAGGGAAAGCCACGCTCCCCAGGCGACAGACATGCCGAAACCCACCGCAGCCAACGGCGCCAGCACCCTGCTCCTCAGCAGCCAGACCACTGGATCGAGCAGGGATTTCGGAATGGAGCGCCGGGTGACATGCTCCGGGACAGCCGGTCGCGCGCCCCTGAGTAAAAACGACAGCAAGCCTGCCGATACGGCATAAGACAAAGCGATGATCGCAAGACCCAGCACCGGTCGGAAAAGACTGAGAGTCGACCCGAGAACCGGCCCAAGAAATTCCGAGCAAACCGTATGCGTAACCGCCAGTCGCGTATTGCTTCTGACGCGATTCTCGGCCGTCGTCAGATCCAGCACCAGCTTCGGCAACGACACTTCGAATAGCACATCGAACAATCCATATACGAACGCAGCGAAATAGAGATGGTAGAGCCCCAGCCGGCCGTGTAACGCAACGATTGCGAGATAGGCAAGACGCCACGGACAAGATGACACCAGAATGGCACCGACCTGACTTGCAGACGATCAACCAGCACACCCGAGAGCAAGCCGAACAGCGGCCAGCCGAGCGTCAGGAAAGTGAACACGTAGGCGACGTCGCTGGGCGCGATCGCGAGGGACAGCGCAACGAACGGGAGAAAAATCTTCTGGCTGCCATCCGTGAAATTGGACAGCGCAACCGATCGCCAGAGTTGCCTGAACCGAGCTCCGCTCATGTCGGCTCACCTCCAAGGTCGCGTTCGGAATGGGTGCCGCCGCCACTGTCCATGCGCGTTGTTTCCGAAATCGACGTACGTCGCGGAGATCCATTTGTCACGAACGTCGGACGAAGCGCTCCACTCTTGCCGACATTGACGACCGGGTCATGCGATGCCCGGCATAGAAATCCAGCCGGCTCCCCGCCGACCATATACAGCCCCATGTCCGTGTATGCCGCAAAGCGGCCGGGGTCATCGAATACAGGGTTCAGCACCTCTTGCCGATCGAGTTCGACCCACTCCTGCACGATATACCGGTCGAGTGTGGGCGACGCAACGAGTTCACGCCATTCGGCATCCGAGGTATGGCGACCGATGTGAACACCCCGCCCGCGGGATTCGATCACCTGCTTGAGCACGCACTGGTTCTTTTCATGCAACAGTCGTTCCCGTCCGCCGAATTGCTGCAGGCCATCTGCGCTCAGCGACCAGGTTCGCGCAACGAGACGTTTGATCGCGTCGACTTCCGGGCTCGATAGATGGCGCTGGACTCGCTCGGTATGCAGCATCGCAAGCATCCGCTTGTTCTCGGCCAGCAATTGCGAGGCAAACGAGTTGACGTATTTAAAGGCACCATCACGGACCGCTTCGAGGAACGGTTGGCGCACGGCTTCCGGGCCATAAATCCCGAGATCCATGTCGCCGGCCTCCGTCACGACAGCATCGAATTTCAGATAGGCAACCGTGACCGGCGTTCCTTTGTGCGTAAGTTGCCTGCCGTCGTAGTCCAGATCCTGCACGTCGCAGCGGATTGCCCCATATCCACGCTCCTGCGCCAATTCGGTCAGCCGGTCCAGATCCGTGACAATGCGACGGTGCTTGCTGCTGATGAACGCCATGTTCAACGGCGCACCGCGATCGACGAGCGACGCCAGATAGTCAACAAAATAGAACTTGTTATCGACGGTCAAAGGGACATGAGTACCGGCGATATGTGCGTTGAAGAATGCCGACGCACGGAATGCCTGATTGATGTCGCCCACCGTCGTCAGTGCTCCCGGGCACCCCGCATTCGTCTCCATCATCCTGACCCCGCCTCCCTTGACGATCGCGAGATCAAATCTCGCGAGATCGATCAATCTCGACGTCGATCCCGGGAAACGAACCAGGTCTTCCAATCCGTGAAGCTCGGGGAAATGGCGCCATAGCGAGCCATCGCTGCTGACTATTCCGATCGCCTTCTCGAGCGCGACCAGAACTTGTCTCGCGTCTCGACAAATCTCGTCGTATTCCCGCTCGGGAATCAGCAGAGGCGTAGGAGATACCGGAAACGGGCGATTCTTGTCGAGGAAGGTCACGCTATTGTGCATGGCCGCCCTCAGGGCGCTATAGTCGTCGGACATGGACACGATCCCTTATTGCGAATACGTCAGAATTCCAGCAGGAGCGATCCCCATGACAGCCCCACTCCGAAGCCGGTCAACAAAACCTTGCGTCCGGCGAGCGCTCGCTCTTGCATGATTCGACCAAGCGCGAGGGGAATGCTGGACGAAGTGGTGTTCCCTCCGTCCTGAAGGAGGACCGGCACCTCGGTGTTAACGCCGAACGGCAGGCGCTCGATCAATAGCTCAAGCATCGGGCGAGAAGCTTGGTGCAATACGACAATATCGACATCGCTCGGTTGCACACCGTTCGACGCAAGGCATTTCGTCATGCTTTCCGGCACACGTTTGCTGAAATAGCTCGCCAGCTTTGCACCATTCATCCGGACCGAATGCGCGTCGGCCTCCGGAACGGGCGTATCGTCCAGCAAATGGCTGGTGCGCCGCGCATCTCGCCACTGCGGTGTATGAAGCGCGCGCCACTCGCCTCCGTCGGTCCCGAAATCCCCGACACAGATTGCCCCGCCGCGATCGCGTTCAAGCAGCGTGGCTGTCGCCGCGTCGCCGAAAATGGCGGCGGTCTTCGGTTGATCGAGTGTCAGGAGACGGGAATAAGGATCGTTCGTGACGATCACCGCCCGATTCCAGTTCTGCGCTTCCAGCATTCCCCGACACAACGACAAAACATAGGGAAACGCAGAACACCCCAGCGATACATCGAATGCGGCCGGATGGCCCTGATATGACAACATGCCGAGCAACTGCGCTGCACTGTGCGGCATACTGCGGCCCGGGTTCTGCGTGACATAAACGAGGAGATCGATATCGCCGTCGCCAAAACCGGTCTGCGCCTGCAACTTGGCAAAAGCATCTCTCGCCAGTGGAAGCTCGCACTGGTCACGCTCCAGTAAAAACCGGCGACGAATACCTGTCTTGGCCAGTACCTGTTCCTCGGTTAGCCCAACTCTGCTGCCGTAAGCCTGAGCCGTTAGCATCGTCTCCGGAAATGCCCAGGCGATTGACGATACGCCGATCATGTCGACACCCCTTTTAGCGTATCGACGGCGGCAACCGTGGTCAGGCCGATATTGCCGTCGGTCGGCAACGCGAAAACGGCATCGTCGTCTGCCTGAGCACCTTGAATCCTGACAAGCCAGCCGTCCGGCTGATCTCCGAACACGAAAGGGCCAGCAGTCCCACTGCAATGCCTGAGTCCCAGGCTGCCGTCGTGATCGCGAAACCAAAGAGGAAACTTGTCGGCGACGACAAATCGCTGCACGACCCAATCCTCTCCATCGTCCCGCTTCATCTTGGTGCGGGAAATGATGTCGTCCCAAATGGCATCGTCGGTCCGGCTGCCGATGAATACCTGCTTGCCTACGTGCGACGCGCAGCGCTTCAGGACCAGCGTGCCTTTTTGCGCTCTGACGAACTTCTCCAGATCCACGATCTCGCCATCGACCGTCGTGACCTTCCTGTCCATCAAACGTGTCCATGGCACAAGTGCCTTGAGTCGACCGAGTTCCGGGTCGTCTTGCGACTGCTGGTCGAGGAACTCGCTCACCAGTGCCAGCGCGCCCTTGTCGTCGACGCCGATATCCAGCGGATCACCGAGCGTGATCGTCCGTGTCGAGCGGGCCGCGTTCAACGCCTGTGCCATGCCTGCAAGCAAGATCGGCTCGTGAAGGAAACAGGCGTCCCGATAGAAAATGCCATATTCCGTTTGACCGTCGGTGAGCCACTCCCCTTGCCGGAGCGCCTCCGGAAAAACCGTTCTCGCTGGAACGCCCAAACGAGCACTGATCCGAGCGGCCAGATGATCGGATTGGTCCAGATAGAATCGGCTGAAATCGGCGAGTGGCGTGATGCACACCGATTGCCGTGATGCCTTGGTCAACTGGTTTTGAATGAAACCCAGTTGACTCTCGAAAGGATTCTGCAATTCGTACGCGAGAAGACTGCCAAGACCTGAATCGAGGTAACCGTCCGTCAATACGCCGACCTCCTGAATCCCCCCCATTCCGGAATCCATATTGAACTCCAGGACCCGGAGCTTGCCCCCTGCCAGAATGCAGTCGGGACGCCCCATCAGCGGCTGCCTTTCCGCATCCCTGTATCGTTCCACGATCGAACGGTCGTGCGCCCTGATTGGCGCACGGTCGTCGAGATGTGCAAGCGACACTGCTCGGGCAAGCTGCATGAGCACGCGCAAGTCGTGTGACGCCTGTTTCAGGAACGCGGCCTCCATCAACACGGGTGCAATCGGGATATCGCCGTACTCGTAGCAAAATCCGAATTCCGCGTAAGCAGCGAGGAGCGCATCGTGCCAGCCCAAATCCGTGTGCAGGCACTGTGCCTCATTCGCTTTCAGTTCGATGAGGGGGCTCGCGGAACTCATAGAAGGCCAATGCTCCGTCCCACTTCCGCGAAAGTGACAAGCGCGCTCGAGATATCCGATTCCGTCAGCGACGCGCTGGCGTTGACGCGGATGCGGGCGCGATTCATCGCAACCGCCGGGTAACAGATCGGATTAACGAAGACGCCGCGCTCATGCATGCCGCGCGCGAAATTGAAGGCGGTCGCCCGATCAGGCATCGCGAGCGGGATGATCGGCGTCTCGCTATGCATCACGTCGAAGCCGACTTCCCGCAATCCGTCGCGAATCTGCCGCTCGTTCCTGTGCAGCCGCACCTGGATGTCGGGTTCGCTTTCGAGTATTTCGACCGCCTTGATCAAGCCACCGGCAACCGAAGGCGGAATGGACGCCGAGAAAATGTAGCCGTTGGAGCCGTAGCGAAGCAGTTCGCCGATTTCCTCGCTGGCACACGCAAACCCGCCCACTCCCGGCAGTCCTTTGCTCAAACTGCCGATCAGGACGTCCACCTCCTTCTCGAGTCCGAAGTGCGCCAAGGTTCCTCGTCCGTTCGGCCCGCAGGACGCCGTCCCGTGGGCGTCGTCGACGATAACGAACGCGCGATATCTCTTTGCCAGCTCGACAATCTGCGGCAACGGCGCAAAGTCGCCGTCCATGCTGAATACGCCATCGGTAATGATGAACTTGCGATCCGAGAGCGGCGCGGATCGCAACAACCCTTCGAGCTGTGCAACGTCCTTGTGCGAATACTTCACCTGCTTCGCCCGCGACAGCCGAATACCGTCAGCGATCGATGCGTGATTGAGCTCGTCGCTGTAAATAACGTCCGTTTCGCCTGCCAGTGTCGAGATCACGGCGAGATTCGCTACATATGCGCTTGAGTACGTTACGCACGATTCGAAGCGCGTAAACGCCGCCAGCGCGTTTTCCAGATCGAGATGCAACGCGGTAGTTCCGTTCAGGATCCGCACCCCGCTGGTCGTCGCACCGTGACGATCCAGCGCCTCGCGTGCGGCATCAACGACTTCATGCCGGTCGTTCAGCCCCGAGTAGTTATAGGAGCCGAAATGAATCTGCGGCACATCCGGATGATCGCCATGCGAGAGCCGTGCAGTCACTGTCCTGCCCGGCACGCCGGAACACACCCGGCTCCAGACCTGCTTGTTGCCGCTTTGTGTCAACCATTCGTTCAGTTTATTGAGCGCCCGAACCCGGGTGGAGCCGTCAAAGATATTCAACGTCATTCGACTTCCCCGTTCGCGTTCGCCTCATTGGTAATGCGGTCAGCAATTCTTCCGATGCTTTTCAGCCGTCCCACCTGCTCCTCATCTATCTCGACGCCAAAGGTGTCTTCAATGAATATTGTCAACTCGAGAACAGTGAAAGAATCGATCGCCCCATCCGCAAAAATTTCCGCATCCGCATCGAACTCTCCGGCCTTGATTCCTCCATGCTTGACCAGAAAATCCACGATCTTTTGCGCAATCTCATGCTGCTGCATATCAGATTTCCTTTCGGTTATAGTTGCCGATCAATGTCCGTATGGTGTTGGCGGTCTGCTGGTGAACCTCATCAAATGCCGCGCGCAGCACATCTGGATCCCGACCTGCATCCATCATCCCGCCAAGCAAATGTGCGGAATATCGGATATGAAATTTCTCGTCGTTGTAGATGGCAGAAAACGTGTCGTTAAGCCGACTCATCAAGGGATCATTGAACGCGGCTGTAATGTGCAAAACATGCTGAACGTAGAACATCGTCCGGACTTCAGCGGCATGAATAAAGCACATGAACCGATAAAGATCGCCACCCCATTTTTCATGCAGATCCATATAGTAAGGATCGATGTAGTCTGGCGGGCCGAGCATTTCATCAGGGGCATAGTGACCAAGATCCCTGAGCAGCTTCAAAAAAAGCATCCCGTGACGCTCTTCGTCTTCAGCATGGCTGATCAGGATTTTCCGATGCTCCCCGCCCCGCTCCTCCGCAACCAGCCGCACCCAACGCCCTCCGTGATTTTCCGTGAACGCGTGCTCTGTCAGCTTATCTTTGAGCTCGATGGAATCGATCATCATGACCGCATTGGCAGCCATCAGATCAACCTTCGCATACTGAGCACCGTCGCTGGCTTTCGCCAGAACCCGCCAGAAAACATCGGGGATCTTCATGCGCTTTTCGTCAGCCAGCGTACGCACCAATGCGACAGACGAGCCTTCGATACGGGGAGGCACGAACATTTCAATTCCCGGAATATGAGATATTTCGATGCGACTGGAACGGATATCGAGAATCCGTAGTTCACAAAACTTGAAACCGGCGTTCTTTTCGATTCGTTCTACAAAAAATGGAACTCGATCAATTTCTTGCTCATTACATCCAGCATCACAACATACCTGCACCCGAAACGATCGAATTCTTTCAAAAACCATTCGATCCATAAATCACAAAATAAATTACCTCTCCAGCCTTCCAAGAACAAGCTGTCAATCCTCCCAGTACGAACTCGTGAAAGCACTCAATTCGTGCGTGTAATTAATAAAAATTTCCCACACAACGACGGAATCGCAATACGAACGAAAAGAAGCGGGACCCTGTCGAATGACAGATAGCTGGCCGCCCCATCGGGAGCGGCCGACAACGGAAAATGCGTCATGCGGTGAAAATGAAAGGGAGCGGATTACCCTGGCGAGTCGTTCGAGTTCACAGCGCCGAATCAAGCGTCACTGCAAACCGTCCTGTGCACCCGAAATGACACGGCCTCGTGGTGATTGACGGGGCCGCTGAGACCATTGGCAAGGAGTGCCGGGAGGCGACTTCCTTCTTCAACCCAAGTCACTCCCCCACCGCCTCCGCCTGCCGATCCTCCCGCTTCGCCACCGCCACACGCCGTCCCCGTCGCCCAACCGACGCCACGACCCGAAACGCCACCGGCGCGAACACGAGCCCGAACACCGTCGCCGCCAGCACGCCTCCGAACGCGCCCGTGCCGATCGACCGCCGGCTTTCCGCCCCCGCGCCCGTTGCCAGCACGAGCGGCACGACGCCCAGCAAGAACGCCATCGACGTCATCACGATCGGCCGGAATCGCGCCGCGGCCGCATCGACCACGGCCTGCCGCAACGGCACGCCGTGCCCGGCCAGGTCACGTGCATACTGCACGATCAGGATCGCGTTCTTCGCCGCCAGCCCGACCACGGTAATCATCCCGACCTTGAAGTACACGTCGTTCGGCATGCCGCGCGCAAGCGCTGCGGCGATCGCGCCGATCATGCCGAGCGGCACGATCGTCAGCACGGACAGCGGAATCGTCCAGCTTTCGTACAGCGCGGCCAGCGCCATGAAGACCGCGAGCACCGACAACCCGACGAGCAGCGGCGTCTGCCGCGCGGCGACCTGCTCCTCGCGCGCCGCATCGACCCAGTCGAAGCCGATCCCGGCCGGCAGCGCACCGGCAAGCCGCTCCATCTCCGCCATCGCCGCGCCCGAACTCGTGCCGGTCGCCGCCCGGCCGCTGATGTCGAGCGACGGATAGCCGTTGTACCGGTTCAGCATCACCGGCCCGATCGTCCAGTGCGGCGCGGCAATGGCCGACAGCGGCACCATGTCGCCGGTACGGTTCGGTACGGTCAGCGCCATCAGTTGCGTATCGGTCGCGCGTGCGACCGGATCGGCCTCGATGATCACCCGCCGCATCCGGCCCGACGCCGGGAAGTCGTTGATGTAGTTCGAGCCGAACGTGCCGCCCAGGAGCCCCGCAATCCGCTCGAACGGTACGCCGAGCGCATACGCCTTCGCACGATCGACATCAAGCTCGATACGCGGCGCGTCCGGCAGGTCCTCGAAATGCACGGCGGACAGCAACGGATCCGCCTTCGCGCGTTCGGCAAGCTGCTCGCGGGCCGCTTTCAGCGCGTCGAGCCCGACCCCGCCGCGATCCTCGAGCCGGAACGTGAAGCCGTCCGAATGTCCGATGCCGCGCACCGAAGGCGGCAGCGCCGCCTCGACGTCGCCGTCGAGGATCGCGCCGAACCGCCCGTTGAGCCGGTCGCGCAACGCCATCGCGTCGACGTCGCGCTTCGCCCAGTCCTTCAATTCCACGAACGCCATCCCGACGTTCTGCCCGCTGCCGACGAAACTCCAGCCGATCACGCTCGTCACGTTCGCAATCGCCGGCTCCCCGTGCAGGATCGCCTCCACACGCTCGACGACCGCGAGCGTACGCGCCTGCGTGGCGCCGGCCGGCAACTGGATCATCACCTGCAGTTGCCCCTGATCTTCGGTCGGCAGGAAGCCGCCCGGCATCATCCAGTACAGCAGCCCGCATCCGATCACGAGCGCGACATAGACGGCCACGACCATGCCGGTGCGGCGCACGGCAAACACGGCGACGCGACGATAACCGGTTTCGGCGCGCGCGAAGGCCGCGCCGAACCGGTCGGCCAGACGCGCGCCGATGCCGCGTCGCCGCGCACGGCCGGTGCCGCCATCGTGTCGCGCGACCGGCTTCAGCAGGTTCGCACACAGCGCGGGCGTGAGCGACAGCGCCATGAACGACGACACCAGCATCGACGCGATCATCGCCACCGCGAACTGCCGGTAGATGCCGCCGACGCTGCCCGGGAAAAACGCCATCGGCACGAACACGGCGGTCAGCACCGCGGTCACGCCGACGATCGCCCCGCCGATCCGCTTCATCGCCCGGCGCGTGGCATCGCGCGGCGACACGCCCTCCTCCATCACGCGGTGCACGCTCTCGACGACGACGATCGCATCGTCGACCAGGATGCCGATCGCCAGCACGAGGCCGAACATCGTGAACACGTTGATCGACAGCCCGAACGCCCACATCGCGACGAACGCGCCCATCAGCGTGACCGGGATCACGACGGTCGGCACCAGCGTGTAGCGCAGGTCGCGCAGGAACAGCCACATCACGCAGAACACCAGCACGACGGCCTCGACGAGCGTCAGCACGACTTCGTGGATCGCGATCTTGACGAAGTGCGCACCGTCGAACGGAATCTCGACCGCGACGCCCGGCGGCAATGTACGCGACAGCTCGGCGAGCCGTGCGCGGATTGCATTCGACGTTTCGAGCGCGTTGCCGCGCGGGCCGAGCTGGATGCCGACCGTCGCGGCCGGCCGGCCGTTCAGCCGCGAATAGAACGAATAGTCGTCACGGCCGATCTCGACGCGCGCGACATCCGCGACGCGCACCACCGAGCCGTCCTGTTTCGACTTCAGCACGATCCGGCCGAACTCCTCGGGCGACGTAAGCTGCCCCTTGACCACGACCGTCGCGGTGAGCTGCTGGCCGCGCACGAACGGCGCGTCGCCGATCGCGCCGGCCGTGACCGTCGCGTTCTGCGCGCTGATTGCCGCGATCACGTCGTCGGCGCCGAGATCGTATTCGCGCAACTTGGCTGGATCGAGCCAGACCCGCAGCGCCTCGTCGGCATCCCACAGCTCGGCGGCGCCTACCCCCGGCGCACGCTTCAGCTCGCGCAGCACGTAGCGGTTCAGGTAATCGCCGAGCTGCGCGGAATCGCGCGCGCCGTCGGTCGACGTCAGCGTGACGAGCATCAGGAACGTGTTCGCCGCCTTGAACACGCCGATCCCCTGCTGGACGACCTGCTGCGGCAGCCGCGGCTCGACCTGCTTCAAGCGGTTGTTGACGTCGACGAGCGCGATGTCCGGATCGGTGCTCGGCGAAAACGTCACGTCGATCTCGAGGTTGCCGTGGCCGTCGCTGCTCGTCTCGTAGTAGAGCAGCCCGTCGGCGCCGTCGAGGCTTTCCTCGATGATGCTGCCGACGTCGGCATCGACCGTCTCGGTCGACGCGCCCGGATAGGACGCGGTGATCACGACCCGCGGCGGCGCCAGGCGCGGGTACTGCGCGATCGGCAACTGCGGAATGGCCAGCACGCCCGCGACGACGATCGCGAGCGCGACGATCCACGCAAAGACGGGACGGTCGATGAAGAATGACGGCATCGGCAAGAGTGGGCGAACGCGCTACAGCGTGAGCTCGAACGAGCGCGCAAGCAGCCCCGGCAGCTGCGCGCCTCCGGTGGCGCGCTCGCCGTACGTGTCGTCGTTCGCGAGTAGCACCGGCGCCGCGCCGAGTTGCTCGAGCTCCGCCCCGAACAGCCGGTAGAAGCTGTCGTCGAACCGCATCGCGTCGATAGGTGCGACGATGCGCCCGCCTTCGACCCAGAACGTCGCGAACCGCGTCATCCCCGTCATCCGGCACGCGATCGGGTCCGAGAAGTTCAGGTACCAGAGGTTGCCGACGTAGAGCCCCGTGTCGAGCGCGGCCAGCACGTCGGCATCGGCGAGCGTACCGCCCGCGATCGTCAGCGACTGCGGCGATTCGCCGTTGTCCGCGCCGTTCGGCGTGCCGCCTTGCTCGCGCGCGGTGCGTGCGCAGACGAGCTGCCCGGCGCCCCGCCCTTCGACGACGAGCGGCACGCTGTCACGCCGGTAGCCGTCCGCGTTGAACGCCGGCGCGATACCCAGCGAGAAGTCCTCGGTGATCGACACACGCGGATCGAGCGCGACTTCGCCGGCGTACAGGCGATGCAGCGGGCTGCGGGCGCTCGCCACCGCACGCGCCGAAAAGCCGCTCCAGCCGAGCAGGCTCGTCATCTCGTGGACGGCGTCCGGCGCGAAATACGCGCGGTAGTGCCCGGGCGCAAGCGCCTTCGGCGTGCGGCCGAGCACGGGCAGCCGCGCGGCGGCCGCGTCGACCTTCGCCGCGAATGCCGCGTCGCTCCAGTCGTCGCCCGCATAGACGGTCTTGATCGCGCGACCGCTCGGGTCGTACAGCGACCAGCTGAAGTTGAAATTCTCGACCTCGTACCAGCCGCGGCTGCCGGTCGACGACGCGAAGCCGCGCGCCAGCGTGCCGCCTGCGTAAAAGCCGACGAAGTCGAGCCCGCGCGCGCATTCGGCCACGATTCGCGCGAGCCCGTCCGCATTCGGCAGGCGCCCGGTGCGGCGCGTTTCCTGCAGCCACGACGAGGTATCGAACAGCAGGTGCGGATCGTCCTGCGCATCGCGCAGGCCGTCGCGCAACACGGCAACCGCGTCTGCCAGTTCGCGCAGGTCGGCCGCCGGATCGCCGCAGACGGTCAGCGCCGAGTGCGCCTGCCGCGCCCCGTCGACGAGCCGCACGGACAGCCGCCCCTGCGTGACGCGCCCGGTCTGGCGAATCTTGCCGCCATTGAAGCGGATGAAGTCGGACGTTTCGCTGGAGAAACCGAGCAGCACGGTTTCGGTCGGCGTCGTCAGCCGTTGCGCGTCGTCGGCGAGCCGGGTGAAATGCGCGTGCCAGTCGATTCCGGCCTGGCGTGCCATCGGATTGAATCCCGTCATCACGCGCCTCCGAATACGTCGACGCCGGCGAACACGCAGGCCGGCGCGGCGTGGCCGACACGGATGATCTGCGCGGGCTCGCCCTTCCCGCAATAAGGCGTGCCGTAGATGCCGAACGTGCTCGCATCGCCGACCGCGCGCAGGCTGCGCCAGAACTGAGCGGAGATGCCGCGGTAGTTCGGGCGCTTGACGACCTGCGTGAGCCGGCCGTTCTCGATCAGTTGCCCGAATTCGCAGCCGAACTGGAATTTGTTGCGGTGGTCGTCGATCGACCAGGACGTGTTGGTGCGCATCAGGATCCCGTGCTCGGTGCCCGCGATCAGCGCACCGAGCGACTGGTCGCCGGGTTCGACGTTCAGGTTCGCCATCCGGTCGATCGGCGCGCGGTTCCAGCTCGACGCGCGCGAGTTGGCGACGCCCGGCAGGCCCGCGCGTTGCTGCGACAGCGCCCCGCCGAGCAACCGTTCGAGCACGCCGTTGCGAATCAGGTACTGCTTGCGCGCCGGCGTACCGTCGTCGTCGAACGCGTAGGACGCGGCTTCGGCCGCCGGCTCCGGATCGAACGTGACGTTCAGCAGCGGCGAGCCGTACTGATAGTGGCCGACCATCTCCGGCCTGACGAAGCTCGACCCGGCGAAGTTGCGCTCGTCGCCGAGAATCCGGTCGAGTTCGAGCGGATGACCGATCGACTCGTGGATCTGCAGCATCATCTGGTCCGGCATCAGCAGCAGATCGCGCCGGCCGGTCGGGCAGTTCGGCGCGGCGAGCAACTGCAGCGCCTCGTCGGCGACGCGTGCGCCGGCGCCGTCGAAGCCGTAGCGCGCCAGCACGTCGATGCCGCCCTGCGCGAGCGTGCCCGATTGACCGAGCGAGCGCGTCTGCGTGTCGCCGTCGCCGTGCGCGACGGCCTGCAGTTCGGGCATCAGGAAGCGGAAGCGCTGGTCGATCCGCACGCCGTCGCTCGTCACGTAACACTGCTCGGTGTGCACGATCATCACGCTCGCGGCGCGCTCGACGATGCGCGCGCCGAGGTTCGCCGCCGCGCATTCGTGCGCAAGGCGCTCGATCCATTCCGCGCGCGACGGCAGCGCCGCATCGACATCGGGCGACGCATACGTGCCGCTCGCCTGCGGGCGCGCGGCCTGGCGATGATCGACCAGCGCCCACGGCGCGCTGGCCCGCGCCCGCGCGGTCGCGATGTCGAGCGCTGCCTGCAGCCCGGCCGGGGACAGGTCGGGCGTCGCCGCATAGCCCGCGCCCGTGCCGGCCCAGGCCGTCAGCATCGCACCACGCTCGCGCGTGCGCGAAAACGGCTGCGCGACGTCGTTGCGCACCGCATGTTCGTCGATCGTCTCGTCGACGATGCGCAGCGACCAGAAATCGGCGTCGCTGCGCAACGCTTTCGCGGCTTGCACCGCTCGGGCGCCCGGCGCCCGGCTCTCGTCAATCATCGGTTCTCTCTTGCCTCTCCTGCGGGCCGCGCTCCGGCGCCAGCATGGACGACGCGAGCAACGCTCGCGTATACGGATGGGATGGCGCGTGCAGCACGTCGAGCGTCTCGCCTGCCTCGACCACGCGCCCCGCCTTCATCACGATGACCCGGTGAGCCATCGCGCGCATCACCGCCAGATCGTGGGTGATGAACAGGTAGCTGAGTTTGTACTTCTTTTGGAGATTCGTCAGCAGATTCAGCACCTGCTTCTGGATCGACACGTCGAGCGCACTCGTCGGCTCGTCGAGCACCAGCAGTTCCGGCTCCACCGCCAGCGCCCGCGCGATCGCGATCCGCTGCCGCTGCCCGCCGGAAAACTCGTGCGGATAGCGCAGCATCGCTTCGGGCGGCAGGCCCACCTCCTGCAACAGCGCGGCGATCCGCGCCCGCCGCGCGTCGCCGGCCACCTGCGGCCGGTGCACACCCAGCCCTTCGCCGACGATCTGCTCGACCGTCATGCGCGGCGACAGCGAGCCGAACGGATCCTGGAACACCACCTGCATGCGCCCGTACAGCGTGCGCCGGCCCTGCGTCGTGCACAGCGACGCCAGCGGCATGCCGTCGATCTCGATCCCGCCCGACGTCGGCCGCTGCAGGCCGAGCACGGCCGACGCGAGCGTCGATTTCCCCGAGCCCGACTCGCCGACGATCCCGAGCGTTTCGCCGCGCTTCAGGCTCAACTGCACGTCGTGCACGGCGCGGAACGTCGTCTTGCCGAGCACCGAGCGCCAACCCTTTGCCGCGATCCGGTAGTCGACCGCGAGGTGCTGCACGTCGAGGATCGTCTGCGCGCCGGCGGCGACCGGCTCGACCGCCCGTTGCGGCGCGCTGTCGAGCAGCCGGCGCGTATACGGATGCTGCGGCGCCGCGAACAGGGCGGCGGTCGTGTTGGTCTCCACCAGCACGCCCTTCTCCATCACCGCCACGCGCTGCGCGAAGCGCCGCACGAGGTTCAGGTCGTGCGTGATCAGCAGCACGGCCATCCCGCGCGCGGCGGCCTCCTGCTCCTGCAGCTCGATCAGCAGGTCGACGATCTGCTGGCGCACCGTCACGTCGAGCGCGGTGGTCGGCTCGTCCGCAAGCAGCAGCCGCGGCCGGCACGCCAGCGCCATCGCGATCATCGCGCGTTGCCGCTGCCCGCCCGACAGTTGGTGCGGAAAACTGTCGATGCGACGCTCCGGCTCCGGAATCCCGGTGCGCCGCAGCAGCGCGATCCCGCGCTCGCGAGCTTCGCCCGGGCGCAGCCCCTCGTGCAGCCGCAGGCTCTCGGCGATCTGCTTGCCGATCGTATACAGCGGATTGAGCGCGGTCATCGGCTCCTGGAACACCATCGCGATATCGGCACCGCGAATCCCGCGCATCTGCTGCTCGGTCTTTGCGAGCAAATCCTCGCCGTCGAACAGCATCCGCCCCGACAGCGTCGCGTGCTGCGCAAGCCGCAGGATCGACAGTGCGGTCACGCTCTTGCCCGACCCTGATTCGCCGACGAGCGCCACGCGCTCGCCGCGACCGATCGACAGGCTCAGGTCCTGTACGGCCGCGTTCGCGCCGAAATGCGCGGAAAACCCGTCGATCTGCAGCAACGGTCGCGTCATCACCGCCCTCCGCCGAATGCCGAACCGCGCGTGCGCGTGTCGAGCGCGTTGCGCAGCGCATCGCCCATGAAGGTGAGCAGCAGCAGCGTCACCACCAGCGCCGCGAATGCCGCAATCGAGATCCACCACGCATCCAGGTTGTTCTTGCCCTCCTGCAGCAGTTCGCCGAGGCTCGGCGTTGGCGGCGGCACGCCGAGGCCGAGGAAATCGAGGCTCGTCAGCGACAGGATCGCCGCGCTCATCCGGAACGGCAGGAACGTGATCACCGGCGTGAGGCTGTTCGGCAGCACGTGGCGCCAGATGATCTGTGTGTTCGTCAGCCCCATCGTGCGTGCCGCTTTCACGTAGTCGAGCCCGCGGTTGCGCAGGAATTCGGCGCGCACGTAGTCGGACAGCACGAGCCAGCCGAACATCGACAGCAGGATGAACAGCAGCCACAGCGACGGCGTGAAGATCGACGCGAAGATGATCAGCAGGTACAAGTCCGGCAGCGCGCTCCAGATCTCGATCAGCCGCTGCCCGACCAGATCCGTGCGGCCGCCGTAGAACCCTTGCAGCGCGCCCGTCAGCACGCCGACCAGCACGCCCGATACCGTCAGCGCGAACGCCATCAGCACCGACAGCCGGAACCCGTACAGCAATCGCGCGAGCACGTCGCGCCCGAACTGGTCGGTGCCGAGCCAGTTGCTCGACGACGGCGGCGCCGGATACGGCCGCGACGCGAAATAGTCGATCGTGTCGTAGCGGTAGCGGTTCGGCGGATAGATCGCGAAATTGCCGTGCGACTCGATCTTCGTGCGGATATACGGATCGAGATAGTTCGTCATCGCGGGAAAGTCGCCGCCGAACAAGGTCTCCGGATAGTCCTTCACGATCGGGAAGTAATAGTGCCCGTCGTAGCGCACGATCAGCGGCCGGTCGTTCGACAGCAGGTTTGCGCCGAGGCTGAGCGCGAACAGCACCGTGAAGATCACGAGGCTCCAGTAGCCGAGCGGCTGTGCACGAAAGCGCAGCCACGTGCGCCGCCACGGCGACGGCGAGGCCGCGCACGCGGCGCACGCCTGGTCAGTGGTCCAGGCGGTTGAACTGGATACGGGGGTCGACGAGGACATAGCAGATATCCGCGATGAGTTTGGTCAGCAGGCCGATCAGCGTGAACAGGAACAGCGAACCGAGCACGACCGGATAGTCGCGGCGGATCACCGAGTCGTACGACAGTTGCCCCATGCCGTCGAGCGAGAACAGCGTCTCGATCAGCAGGTTGCCGTTCAGGAACGCACCGACGAACGCGGCCGGCAGCCCGGTGAGCAACGGGATCGCCGCGTTGCGCAGCACGTGCTTCCACAGCACGTCGCGCTCCGGCGCCCCCTTCGCGCGTGCAGTCAGCACGTACTGTCGGCCGATTTCCTCGAGGAACGTGTTCTTCGTCAGGATCGTGACGATCGCGAAGTTGCCGATGACCGACGCCGTCACCGGCATCACGATGTGCCACAGATAGTCGAGCGCCTTGCCGATCAGCGTCAGGTCGTCGAAGTTGTCGGACGTGAGCCCGCGCATCGGGAACAGCTGCCAGAACGTGCCGCCGCCGAACAGCATCAGCAGCAGCACGCCGAGCACGAAGCCCGGCACCGCGTAGCCGGCCAGCACCAGCACGCTCGTCACGGTGTCGAACCGCGACCCGTTGCGCACCGCCTTCGCGATGCCGAGCGGCACCGACACCAGGTACGTGAGAATCACCGTCCACATCCCGAGCGTGATCGACACCGGCAGCTTCGAACGGATCACCGCCCACACGCTGCGATGCGCGAAATAGGACTGGCCGAGGTCGAACGTCGCATAGCTTTTCAACATCAGCACGTAGCGTTCGAGCGGCGGCTTGTCGAAGCCGAACTGCTTGCGGATCTGCTCGATCTGCTGCGGGTCGACGCCCTGGCTGCCGTGGTAGCCGCCGCCTCCGCCGCCCGCCTCCCCGCCGCGCGCACTGCCGTGGCGCAGCTGCGCGAGCACCTGTTCGACCGGGCCGCCCGGCACGAACTGCGTGACCGCGAACGTGATCGTCACCACGCCGATCAGCGTCGGGATCATCAGCAGCAATCGCCTGAGTATGTATGCCAGCATCGTCGTTCCTCCGTCAGGCCGCCGGCGCGGGTTTCGCCGCCGGCTGCTTCACGTACCAGTAGTCGATGATCCAGTCCTCGTACTGATAGGAATCCGGCACGATCGCCGGGTGGCCGAGCGTCGCCTTGTATGCGATCCGCGCATTCGGCAGGTAGTACTGCGGGATCAGCGCGTACAGGTTGATCAGCACTCGGTCGAGCGCGTGCGTGGCCGTCTCGAGATCGTCGAGCGTGTCGGCCGCGAGCGCCGCATGGATCAGTGCATCGACGGCCTTCGACTTCACGCCCGGATAGTTCTCGGAGCCCGGCTGCGATGCGGCCGCGCTGCCGAATCGTCGCGTCAGCTCCGCCCCCGGGATCGTCACCGGCAGGTAGATGTACGTCGTCATGTCGTACTCGAAATTGTCGAGGCGCTTCAGGTAGACCGCACTGTCGATCTCGCGCAGGTAGGCCTGGATGCCGAGCATCGCGAGCGCCTGCGTGTACGGCAGGATCAGGCGGTCCATGCCGGGCTGGTCGTCGATGATCTCGATCGTCATCGGCGTGCCGTTCGCGTCGCGCAGCGCGCCGTCGCGGTAATGCCAGCCGGCCTGCGCGAGCAGGTCGCGCGCTTCCTTCAGGTTCGCGCGCAACGAGTTCGGCGGCAGCGTCGACGGCTGCTGGACCATCGGGCCGAATACCTCGGGTGGCAGCGTCGAACGGAACGGTTCGAGCAGCGCGAGCTCCTTCTCGCTCGGCATGCCGGATGCCGCGAACGGGCTTGCCTCCCAGAAGCTGTTGGTCCGGCGGTATTGCCCGTAAAACATCATCCGGCTCATCCAGTCGAAGTCGAACGCCAGCGCAAGCGCGTGCCGCACCCGCACGTCCTGGAACATCGGCTTGCGCATGTTCATCAGGAAGCCCTGCATCTGCGCGGGGCCGTCGGGAAACTCGCCGCGCTTCAGCATGCCGTTGCGAAAATTCTTGCCGACGTACTTGCGCGCCCACTGCGTCGCGCTGTATTCCATGCGCGCGTCGATGTCGCCGGCCTTGAACGCTTCGAGCGCCGTGTACTGGTCCAGGTACAGCTTGAACGACACGTGCGCGAAGCGGAACATGCCGCGTCGCGACGGCAGGTTCGCTGCCCAGTAGTGCGGGTCGCGCACGTAGCTGATCTGCTTGTCGTTCTTGCGCTGGTCGATCAGGTACGGGCCGCTCGCGATCGGCGGCACGTTCGCGATCTGGTCGAACGGCGGGCGCGTGCCGTCCGCGCGCTGCCCCCACTTCGGCGAGAACACCGGCAGGTCGCCCGCGATCAGTGCGGCGTCGCGCTCCGCGTGCTTGAACTCGAAGCGCACCGTGTGGCCGTCGACGGCCACCGCACGCTTGATGATCGCGAACTGCGCGTTGTACAGCGGCGACGCCTGCGGGCTCGTCAGCGTGTCGAACGAATATTTCACGTCGGCCGCGGTGATCGGGTCGCCGTTCGAAAAGCGTGCGGCCGGGTTGATGTGGAACGTCGCCGACAGCCCGTCGGGCGCAACGTCGACGTCGTCGGCGATCAGCGCGTATTCGGAAGCGAGCTCGTCCCAACTGCGCTGCATCAGCGTGTCGAACATCAGGTTCTTGATATCCGGTGCGGGCGCGCCGCGCACGAGAAACGGGTTCAGCGAGTCGTAGCTCTGCGCTTCGTCGTAGTTTTCGAACTGGAGCGTGCCGCTGTCCGGCGCCTCCGCATTCGCGTAGTCGAAGTGCGTGAAACCCGGCGGATATTTCGGATGGTCGTATTGCGAGATCGCCGGCACGGCGAGCGCGGCCTGCGGCACGACGCCGCCCGACACGAACCACGCGGCGCACGCAAGCAGCGCGGCGCGCGTGGCGGCGCGCATCCGCATGTGCGGGCGCTTCAGGTGCGAGTGGCGCGGCACGCGTGCCCGTCGCCCATTCAACAGTCTCATGGTCAGTCTGTCCGGAAGCGTGCGGGCGAGATGCCTGCGCGCGGTCGTTACCTGTGCCGGACGCCGCGTGCCGACCGCGTCCCGCGGAGGGTTCGTGCTGGTCACGATCGGATAAGCTGCACCGCCGCGCGTCCCGGTTGCGGGCCGTCGTGGCGCCAGGCGAAAAAAGGCCGCCCCGAAAAGGGGGTAGTGGACGTGGAGCGGCCGGTCTAAAGCCTGTCAGTGCATCGCGATCGCCGTCAGAACTTGTACGTCGCGCCGATCTGCGCGAACCGGCCGATGTACGAATACACCGACGTGTCGTACCCGCTCTGGTCCAGCGTGCCGTTCGCGAAGATCGGGTCGTACGGCGGCGTGCGGTTGAAGAGGTTGTTGATGCCGCCGTAGATCGTCCAGTTCTTGAAGCCCGTGTACGACATCATCAGGTTGAACTGGCTGTACGAACCGACCTTCGACGGCGCGGGCGCCGGCGTGAGGTTCTGCGCGTACGGGCCCGTGAACTGCCACGTGAGCGCCGCATCGAACTTCCGGTAGTTCCAGTCCAGCGTCGTGTTGCCGCGCCAGCGCGGGAAGCTGCCGCCGAACGGCTGCGTGATCGTGAAGTTGTTGCCCGCGCCGTTGACCGGCGTGCCGCCCGGGAAGCCGATCTTGTAGCTGTTGATGTACGCCCAGTCGCCCGACAGCGTGACCGTGCCCCAGCCCTTGAGCGGCAGGCCCTGGCGGAACGTGCCTTCGAAGCCGTTGGTGTCGAGGTAGCCGAGGTTCTGGTACGGGATCACCTTGTACAGCAGCTGGCCCGTCGTCGGGTCGGTCGCGATCTGCGACGGCGTGCCCTGGCCGATCACGTTGTCGATGCGGATCTTGTACCAGTCGAAGCCGATATCCGTGTACCGGCTCGGCGACACCTGGAAGCCGATGTTGAAGTTGCGCGTGCGTTCCGGCGCGAGGTTCGGGTTGCCGACGGTGATCGACGTGTAGTTCTGGCCGGTGGCCGGGTCGATCTGGATGCCGAGCGTCTGCGACTTGCTGTCCTCGACGAAGGTCGGCGCGCGGAAGCCGCGGTTGTACGACGTATACAGCGTCAGTGCCTTGATCGGCTGGTAACGCAACGCGAAGCGCGGCGAGAAGGCGCCGCCCACGTCCGTGTAGTGGTCGTAGCGGCCCGCCTGGCTGAACGTCAGGTTCTCGAGGATCGGCACGTTGATCTGGTAATAGACGGCCGCGACGTTGCGCTGGCCATCCACCGTCTCGAGGTCCGGCGTGATGACGGCCCCGCTCAGGTATTCGGAGCCCGGTGTCAGCGTTTCGCTCTGGTGCGTGAACTGTGCACCGAAGCCGATGCCGACGTCGCCCGCCGGCAGGTGGAACAGGTTCGGCGTCGATACCGTCGCGTCGATCGTGTCGAGCTTCGAGATGCCGAGGTTGTTCGCCTCCTGATAAAGCCCGTTGAACGCGTTCGGCGTTGCCGCCGGGTTCGCGAAGTTCAGCGTGCCGTTCTGGTAGATGTTGTTCAGCGCGTTGACGTTCAGCTGGTTCGTGAACACGTTCGACACCGTGCTCTGCGAATGGCTGACCGACGTCGCCCAGTCCCAGTCACCGTACGGCAGCGTAAACGACCCCTTGATGCCGGCGGCCGCACGCCAGTAGTTGGCCCAGGTCTTCTGCGCGACCGTGTTCGGAAATGCGTAAGTAAGCGGCGTGGCCGCGCCTGTCGTGTTGTACGGGTTGCTGACCGGTACCGTGAAATTGAACGGTGACAGCAGCTTCGTCTGCGGATTCCAGACGAGCGCCGGGTTCTGCGAATTGCCGATCACGTTGTTCCACAAGCCGTCGTTGGTCGTGGTCGTGTTGTAGCTCTCGAGAAAATCCGCGAACGCGGTCGTCTTGTCGTCGATCTTGAAGTCCGCATGGAGCTTCGCGTTCAGGCGCTCGGTCATCGGCAGAATCGACGTGCTTTCAGCCGTGTTGTACCCGCAGACCGTGCCCGGCGACCCGGCCGACAGCGAATTCGACACCGCGGGATGCACCGAGCCGCCGAACGGGCAGCCGCCGCTCAGCGCCTGCGCGACGCCGCCGGGCATGTTCCAGTACGACGGCGCGAGCAGCGAGAAGCCGCCCGGCTTGCCCGTGAAGTCCTGGTTGCGCGTCGAATCGCGGTCGGCGAGCGTGAAGCCGTTCGACTTGTAGTAGCTCAGCGCGGCCGTCACGTTGAAGCGGTCCGCGTTCAGGTCGCCGAAACCGCCGAGCACGCCGAACTTCACCGTGCCGTCGCCGTTACCGGCGTTGATCGCGCTGCCGAGGCTGCCGTCGAGCTGCAGCCCGCGGAAATTGTGTTTCGTGATGATGTTGACGACACCGCCGATCGCGTCCGAACCATACTGCGACACGGCGCCCGTCTTCACGATTTCGATTCGCTCGATATCGTTGAGCGGGATCGTGTTGAGGTCGAAGAACGAATCGACGCTGTTCGAAAAGAATGCGTACGGTGCGACGCGCTGGCCGTCGACCATCACGAGCGTGTACTTCTCGGACAGGCCGCGCAGCGCGATGCCGGCCGCGCCTGCCGCAAAGTTGCCCGATTGCCCTTCACCCCAGCTGTTCGCCGAGTTCGCCGCCGCGTCGCGCAGGAAGTCGGTGACGGTCACCGCGCCGCTGGCCTGGATTTCCTTCGGCGTGATCGTCTGCACCTGCTGGAAGCCGGTCTTGTCGGCCTGCCGGATCAGCGAACCCGTCACTTCGAATCGCTTGATCTGCGCGACCTTGCCCTGGCCGTTCGCACCCGGTGCGGCGGGCGTCACCTCGGCTGCCGCGCCGGCCGCTGCGTCAGGTGCCGCGGGCGCCGCGGTGCCCGTGGCAGCCGCGGCACCCGGCGTGCCGCCCGCCTGTGCAACGGTCACGGGCGCGCCGCTCGCGACCGCATCCGCGATCGCGACGGCGCCCGGCGCCGGCTGGCTCTGGGCGAACGCCGGCGGCGCCAGTGCGGCCGTCAGCGCCAGTTCCGCCCAGACTATCCTCTTGATGGCCAACGCCAAAACTCTCTGCTTCATCTTGTTCCCTCTCGCTCGTAGTAAGACCCCACCTGCTGCGCGCCGGAACCTTGTGAATTCCTTGCGCGTCCTGGGTGCCCGTGTACCGGAACCCGCGATTGCCACGCTTTCGACTGCCACGACTCGAACGGATCGGCCTCGGGTCACCCGCACCATCCGGCATCACCGTCGTTTTCTATTTTGATTCGTCATACTAAACACCTACATTGAGCGCATTGCCTGCTTCTTCCCGATTCGACCGACAGCGTGTTTTGCATATGGCGAGCCCGTGCGCGCGCCGTACTGCGCACCGTCTGCCGTCGATCGGGTACTTCCCCCTCCTGCCGACCGCCTTCGCTAGAACAGCCGGGGCGTGCCGACCCAGACGATCACCGCCTCTTCGTCGGCCGTGTTGCGCCACGCATGCGGCATCGTCGATTCGTAATGCGCGGTATCGCCCTCGTTCAGCGTGAACGTGCAATCCTCGAGCGTCAGCGCCACCTGCCCGCGCAACACGTAGACGAACTCCTCGCCCGCATGCGTCGTCATCTCGGACGACAGCTGCCCGGCCGGCATCCGGACAAGAATCGCATCGAGCTTGCGGCCGTCCACCAGGTTCGTCAGCCTCGCGAACCGGCTGGCCGAGTTCGTGAACTGGAAATACTGCAGCGCGTTGCCACGGCATACCGAGCGCGCTTCCGTCGGCGTGTCGATGAAGTACTGCATCGTCACGCCCAGCGCCTTCGCGATCCGCACCAGCGACGTGATCGACGGCGTCGCCCGCCCGCGCTCGACCTGCGACAGGAACGGCTTCGAAATGCCGGCGATCGTCGACGTTTCGTCGAGCGTGAGTTTCAGGCGCTGGCGCAGCGCCCGGATCTTGTTGCCTAGCGACATGGCGACGAGCACCGACTCATCAGGGGGCAAAACCATGGAAAGACGGACCCTCGCAGTCGAAATCGGTTTGATGTCGTCAAGCCTCGACCGGTGCCGTGACACACCCGGCCCGGCGCGATCCCCCGCTGCACGCTGTCGCGGCGCGCACGCCACGCGCGGTGCGGCACGGCAGCCGCGTCGCGCGGCCCCGCAGCACTCGCCTATTTCATTCGGTTTACTAATTAACAAGGATGCCCCTTTTCATCGGCTGACCTGCCCGTCGTGCATCGGTGCGATCGCGGCGGCACACCGCTTGCCGACCGCGTCCGCGTTGCGCGCCGTCGTGCCCTTCCGTTGCGTTACCGGGATGCCACCACCGTCCAGAGGCGTGCCAGGTCAGCGGAACCGTCGGCCCCCTTGACCGTGCGCAGCACCTGGACCGCACGCGCCTTCTGGCCGGCCACGTAATACGCCTCGCCGAGCCGCAGCCGTGCCGCATCGGGATGCTCGAGTCCGCCCTTTGCGATCGCCTGCTCCATCATCGGCAGCCCTTGCTGCGCACGGCCCTCGAAGACGAGGTTCATGCCCGCGTCGATCGGCGCGGCCGGCGTCGTGGCGTCGCCGCCCGATTGCGCGCGCTTCGCGGCGAGTGCCTGCAGCCGCTTCTCGCGGTCGGCCTGCGCGTCCTTGCCGAGTACGCCGGATGCGAACCCCTGGTCGATCACCTGCTTGCCTTCGGCGGGCGACCCCGCGACCAGCGCAAGCTGAGTCATCTCCATGTATGCGTTGGCCGACGCAAGCGAGCCCGTTGCCCGGCGCAACCGGTAGATATCGAGGTCGAGCGACGGCAGATAGCCGGGGTTGCCGCGAATGGCCGTGACCATCTCGTCCCAGTACGCGGGGCTCGGATGGTAGGCCACCAGCAACCCGAGCGCACCGCGATACGCATTGCCGTCCTTCACCTTCTGCGCGCAGGTCGCCAGCATCTGCAACTGCCCTTCGTCCGGTGCGTGTCCGCCATTCGCCTGCGCGTCGGTGCTCGCTTTCAGCTGGCTCACGAGCGGTGCGCAATCGTTTGACAGGTAGTAAGACTGCGTGAGCAGCGTGCGCATTTCCGGATCCGAGCCACCGGCCTTCAGATAACGCTGCGCGACCCGGATCGCCAGCGGGTAGTTCTTCTGCTGGAAGTAGATGCCGGCCAGCGCCGCCGTGGTCCGCTGCTCGTCCTCGCCCGACAGCCGCCCCGAGTTCAGCACGCTCTCGTAGGCTTGCGCGGCCGTGCCCGAGTCACCGGCCGCCATCGCTGCCGCACCGCGCATCTCTTCCACCATATAGGTTTCGTACGGCGTCCGGTTCGGCACGGCAGCTGCCTGCGCGATCTTGCCCAGCGCATCGCGATACTTGTGCGCCCGATACAGCTCCTGTGCGGCCGCCAACGGTTTCGCCACGTCCGGCCGCAGCGCGTCCGCCGGCAGCGCCGGGCGACCGAGCAGGCCTGCCGCGCCGAGCGCCGTCACCGCCATCATCCGTTTCCATCGTCGCTGGTTCATCGGCTCGTCCGTGTCGTCATTGCATGAACTGTTCGTTACCGATCAGGCCGATCTTGGTCGCGCCCACGCGTTGCGCGGATGCGAGCACGGCGGCCACGTCCTTGTACGGCACCAGCTTGTTGGGGCGCAGATGGATCTCGGCCTGCACGGGCTCCGCCGCCACCTGGGTCAGTTTCGACTCGAGCGCCGCGCGGTCCGGCACCGGTGCGCCGTTCCATGTCGTCGTGCCGTCGAAGTCGATATCGATCTGCACGATTTCCGGCGGCGTGGCCGGCGGCGGCGGATTCCCCACCGGCAGGTCCATCTTTACCGAATGCATCTGAATCGGGATCGTGATGATCAGCATGATCAGCAACACCAGCATCACGTCGATCAGCGGCGTCGTGTTGATGTCGACCATCACTTCCGGTTCGGCATTGCTCCCGCCCGAAGGCACATTCATTCCCATCGTTGACTCCTGTCGACCTGCGTTAGTGCCTGAGCACTTACGCCGATCCCACCGTTCTCGGTCAACCAGCGTCAGCGCGCCAACACTTGCCGCCGGCCCCCTGTTCCACCACGATTCCCGCGCGCGACTACCCCCCGCGCGCCGGCGGCTCCGTAATGAACGACAGCTTCGCGATCCCCGCGCGCTCGCACATCGTGACCACCCGTCCGATGAACTCGTAGCGCGTGTTCTGGTCGCCCCGCACATGGACACTCGGCTGCGGCTGCTGCTGCGACACGCCCTTCAGCTTCGCGAGCAGCGTCTGTGCATCCACCAGCTGCTCGCCCCAGAAGAAATCGCCGTCGCGGTTCACCGCGATCTCGATGCTCTTCGGCGTCGTCTGCAGCGGCTGCACCGTCTCCTTCGGCAACTGAAGCTGGATCGTGTGCGTCACGACCGGGATCGTGATCAGGAAGATGATCAGCAACACCAGCATCACGTCGACGAGCGGCGTCGTGTTGATGTTGGCGATCACCTCGTCGCTATCGTCCTGCCCGACGTTCATGGCCATCGCGCGTCTCCGTCAGCCGTTCAGTTCGCGAGCGACGCGGCTGCCGGCGACGACGCCGCGCGTACCGGGCGCCGATTGCCGGCCAGCAGCACGGTATGGAGTTGCGCGCCGAAGTTGCGGACCCGCTCCATCACCGACTTGTTGCGGCGGACCAGGAAGTTGTAGCCGAGCACCGCCGGCACCGCGACGGCGAGGCCGATCGCGGTCATGATGAGCGCCTCGCCCACCGGGCCCGCGACCTTGTCGATCGACGCCTGGCCGGCGATGCCGATGGCCGTCAGCGCGTGATAGATACCCCAGACCGTGCCGAACAGCCCGACGAACGGCGCGGTGGAGCCCACCGTGGCCAGGAACGCGAGCCCGTCCTGCATCCGGTTCGACACGTTCGTGATCGAGCGCTCGACCGACACGTCGATCCACGTGTTGCGGTCCACCGCCTCCAGCAGCGCGTCTTCATGATGCTCGCCGGCCTCGATCGCCGTTTCGGCGATGAACCGGAACGGCGACGCCTCGTCGAGCAGCCTGGCGCCTTCTGCCAGCGACGGCGCGGTCCAGAGCTGCGCGTCCGCCAGCTTCGCGCGGCGATTCGCGCGCAACTGCTCCAGGAACTTCGTGATCATGATGTACCAGCTGCCCATCGACATGATCACCAGCAGGATCAGCACGAAACGGGCGACGAAATCGCCGTTCTTCCATAGCGCGCCAAGCCCGTACGGATTCTCGACTGCCTCCGTCGTGGCCGGCGCGGGCGGCGGAGCCGGCTCGGCGGCGGCCGGAACCGGGGCCGGTGCAGCCGACGGTGCCGCGGCCGGTGCCGATGTGCCGGCCGCGCCGCTTGCCTGCGCGTGGGCGAGTTGCGGTGCAACGAACCCGTCGATTGCGGCGACGGACATCAGGATGCTTGCCGCCAGCGCGGCCAGTGAACGCTTCGTCATACCCCACTCCAGTTATTTCGGTGAACTTAATACGTCAGTTCGGGATGGTTTTTCCCGGCCCGGATCGCACCGGGCCGCCCACTGCATCAATTCAGATTGAACGAGAACGGAACCTGGACCCGTACCGACTGGCCCTGCGCGACGCACTTGAATCGTTTCACGGTGTTGTAGGCCGCGCGATCGAGGATCGGGTCCGCCGACTGCGCCACGCGTTCGTTCGTGATGTTCCCTTCCGCATCCACGACGAATTCGATCGTCACGTCGCCCGTGATGTTGTTTTCCTGCGCTTCCTTCGGGTACTGGATCGACGCGCGCAACGTGTCCGAATTCGGGCAGACCACCCCCACGTCATGGCTGACGGGCTTGGCCGGCGCCGGGGGCGCCACGACCGGCGCCTGCACGGCCGGTGCGGACGGCACTGGAGCCGACTGGTGCGTGATCGTCGCCTGCGGCGGCGCCTGCACCGGTACTTCCGGCGGCGGGACGAACGGCGGTGGCGGCGGCGCGAATTTCGGCGGCGGGAGTTTGACCACGGGCATCGGCGGTGGCGGCGGTGGCTTGACCGGCTCGATGATGCGGGTTTCGATCGGGTGCTGGATCACCTGCACGACTTTCGTCGCGAGGCCGTTGAGCAGCGCGTAGATCAGCACTGCATGCAGCACCAGGACGATTCCGATGCCGCCGAACCGGCGTACCGGATTCTGTTGCTTCTTGCCGAACTCCCGCGGACGGCCCAGCGTAGCCAGTCCGCTGTTCGACGTAGCCAGATGTTCTTCGACTTTCATGCCCACCATCCATCCCCCAAGCGATTCGCCCAGTTTCGGATCGCTGCCGGCGGCAAAGCCAGGTCGGCATCGCCACCGCATCAACCCGAACATTTCATTTCAAATTCAATTAGCAATCCGAACTGAAAGAATTACATTCGAATTCCAATTCATTCTTTTTCAAGGAATCGTTCCGGCCGGCGCGGCCCGGTGTCTCTCCGGAAAACCACGCGCCTGCACGTTCAGGCAATTGTCAGCTGAGCGGGCGGCACCTTACATTTCGTATGACGATCCGCCGTTCGTTACACGGTGATGTCATTTGCGCCGTTCACTTGGGTGCAGAAGATAGCAAGGAAAAAATTTTGACGTCAAACTTTATTTGATACCCGAATACAAAGTTTGGAGGGTTTATTCAAATAATATTCCGGTAAGCACAAATCAAACTTTTATTATTAAAACGGTAATTGAATTGTATGGATATATGTGAAAACGGATATTAAGTTGTTTCAACATAAATAGGCGCATATTTCCTGCACTGCATCAAAAACAGCCGCCACGGAAGCAACCGGGACGGTGCGCCCCCTCCTCTCGTCAACGTCAGCATCGGTCGCGCAATGCGCTGTTCAACCACCTCGAGCTGGCCGAACGGCGATTCCTTGAAAGATCCAACACACCCTGGATATCCGAACGGTTAATCCGGAATGCGATTGAAAATTACGGACGATACGCCGACCAACGATCGACACGATAATTTCGCACATCGAATTAACTGTCGCCACCATACCCACTGCCGCGACACACAACCGTGTCTCACGGGCACCGGCCAGCGCCGACGCACCACGCGCGTCAGGAGTGCATGCCCTGCATCTTGTGCGAGAAATGCGCGCGTTCGGCCGTCTCCATCAGCATCAGGAATTCATCCTGCATCATGTGGATCAGCGCCTCGTGATCGCCGGCCTCGAAATAGACTTCCGGCTGCTGCGCGAGACGTTCCTCGAGAAACGTCTTCATCCCGTACGCCATGCACACCGGCGGCAACGCGCCCATGTCGCAATCCTTGAACAACTCGCGCAGCTCGACCTCCCGGGCGAGCACGAGATGGCGTCCCGTCTTCACCCAGAGATCCGACAGGCGCACGGCGTACGTGGTCGGCAATACGGCGGCGACGTAGCCTTCGTCGTCTTCGAGGAGCACGGTTTTCGCGAGGCGATCGCCGGGAATATGCGCGGCGGCGGCCGTCTCCATGCTCGTATGGCTATAGGGGTGGTACACCACTTCGTACCGCGATGACTTCTGACGCAGGCAATCCTGCAGGGTGGCTGACACAGGCATGGCACACCTCGTCTGGGCGAAACGGGCGAACGGCGTTCGCACCGGATTCGTTCAGCATAGGTCGCATTCCGGCGGATGGAAATCCTCACCGCTCCCGCACCTCGCACGCGCTCAACGCACGGGTTCAGTCCCCCGGATCCAGGCCCTTTTCGCGCAGCCATTCCCGATTGAAAAGCCGCGCGCGATAGATGTCGCCACGATCGCACAGCAAGGTCACGATCGTGTGCCCCGGGCCCATCCGGCGGGCCAACCAGACCGCTGCCGCGACGTTGATGCCGCTCGATCCGCCGACATACAGCCCTTCCTCGCGCAGCAGCCGGTAGACCATCGTCACGCACTGCTGATCGTCGATCCGCACCGCGTCGTCGATCGGCGTGCCCGCGAGGTTCATGGTGACGCGCGTCGATCCGATGCCTTCGGTGATCGAGCTGCCTTCTGCGCCGAGGTCGCCCGTTTTCACGTACCCATAGAGCCCGCTGCCGCACGGGTCGGCCAGCACGATCCGGACGTCCGGATTCTGCTCTTTCAGATAGCGGCTCACGCCGGCCAGCGTGCCGCCCGTTCCGGTCGCGCAGACGAACGCATCGATCGTGCCCGCCGTATCGCGCCAGATTTCGGGCCCGGTCGTCTCGTAGTGCGCCTGACGATTGACGACGTTGTCGAACTGATTGGCCCACACCGCGTTGTCGAGTTCGTCCGCGAGCCGCCCGGCAATCTTCTGATAGTTGTTCGGGTCACGGTAAGGCGCCGCCGGCACCGGGCGCACCTCGGCGCCGAGCGTGCGCAGGATCGCCAGCTTGTCCGGCGATTGTGTGTCAGGAATCACGATCACGCAGCGATAGCCGCGCGCCGCGCAGATATGCGCCAGACCGATGCCGGTGTTGCCCGCCGTACCCTCGACGACGGTGCCGCCCGGCTTCAGCACGCCACGCCGCTCGGCGTCGCGGATGATGTACAGTGCCGCGCGGTCCTTCACCGACCCGCCCGGATTCATGAATTCCGCCTTGCCCAGGATCTCGCAGCCCGTCTCCGCGCTGAGCTTCGTCAAGCGAATCAGCGGCGTGCGCCCGACGCAGTCGACGAAGCCTTGTCGCACGTCCATGTCAGCCTCCATTCCTTCGTGCAGCGCTCGCGCCACACACAACGAAGGATAGGCCGCCCTCGCGGCTTTCGTCGCCGGCCGCCGCGTATTAAGCTTGAAGAGCCCATTGCGGCGCACGCGCCGCATCGCGAGACGAGGTCCGCCATGCTGCAGCTGCACACTGTTGCCGTCGACAAGCCCGAGACCATGAACTTCATCCTCGGCCAGTCGCATTTCATCAAGTCCGTCGAAGACATCCACGAGGCGCTGGTCGGCACCGTGCCCGGCATCCGCTTCGGGCTCGCATTCTGCGAGGCGTCGGGCAAGCGGCTGGTGCGCCACTCCGGCACCGATGCGACGCTCGTCGAACTGGCCTGCCGCAACGCCAGCGCGATCGGCGCCGGTCACGCGTTCATCGTGTTCCTCGGCGACGGGTTCTACCCGGTCAACGTGCTCAATGCGATCAAGGCCGTGCCGGAGGTCTGCCGGATCTATTGCGCGACCGCGAACCCGACCGAGGTCATCGTCGCGGAAACCGCGCAAGGACGCGGGATCGTCGGCGTCGTCGACGGCTTCGGGCCGCTCGGTGTCGAAAATGACGACGACATCCGCTGGCGAAAGGACCTGCTGCGCACGATCGGCTACAAGGCCTAGCGACAGACTGCCCGGCCCCGGCCTGAAGACCTGGCTGCATCGAAGGTGCGATGCGGCCGAGATTTCAGGTCGAATGGCGCCTAGACCGTTAGTTGCGTTTTTGACGCAGGTACACGCCGCCATGCGTGTCGTGTCGCTGCCGTCCGCCTGCCCTATACTGCCCGCAGCAGACTGACTGACGGCGCGCTGACGCACCGGCCGCCACGATTCTTCCCGGAGCCACCATGCATTACCAGTTGACCTACGAACTCGTCGACGATTACCTGTCCCGTCGCGAACCGTTCCGCGCCCAGCACCTCGCGCTTGCGCAGGCTGCGACCGAGCGCGGCGAACTCGTGCTCGCCGGCGCACTGTCGGACCCGGCCGATCAGGCCGTGCTCGTGTTCGAAGGGGATTCGCCGGAGGCGGCCGAGTCGTTCGCGCGCGCCGATCCATACGTGCAGAACGGCCTCGTGAAATCGTGGCGCGTGCGTCCGTGGCGCGTCGTGATCGGCAAGCACGCGCCGCCCCGCGCGTGAAGCGAGACTACAGCCACCCTGCCCGCTTGAATCGCCACCACAGCGCGAAATCGGCGACGACCATCGCGGCGATGCAACCGTAGAAACCGTATTTCAGGTGCAGTTCGGGCATGTTCGCGAAGTTCATCCCGTAGATGCCGGCGATCATCGTCGGAATCGCGAACAGCGCGGCGAACGAGCCGAGCCGCTTCGTCACCTCGCTCTCGGCCAGCGAAATCATCCCGAGATTGACCTGGATCGCCGTGACCACCATCTCGCGCCGCCCTTCGATCGTCTTCACGATCCGCTGCAGATGGTCGTACACGTCGCGAAAGTACGGCTCCATCCCGCTGCAGACCTGCGGAATGCGCCCGCCGGTGAGCTTCGCAAGCGGCTCGATCAGCGGCGCCGTGTGCTGGTACAGCATCACGAGCCGGCGCTTCAGCGAATAGAGATCCTCGATGATTGCACGCGACGACGCGGGCGTGGCCTTCGCGAAAATGCGGTCCTCGAGTTCCTCGAGCTCGGTGCCGAGCGTTTCGAGGATCGGGAAATAGCGGTCGACGACCTGGTCCATCAGCGCGTAGAACACGAATGCCGAGCCTTCCTGGAGCAGATGCGGCTCGCGCTCGCAACGCTTGCGCACGTCGCGGAAATCCTGCTCGGTATGGTTGCGGATCGACAGCACGTAATTCGGTCCGACGAACACGTTCAGCTCGCCGACATTCAATTCATCGTCGTCGTCCAGCTCGACCATATGCAGCACCGCGAACAGCGAATCGCCGTACTCCTCGATCTTCGGTCGCTGATGCCCCTTGCGGGCATCCTCGAGCGCCAGTTCGTGCAGCCCGAACTCCTCGCCCATCAGGTCGAGTTCGGCAGGCGTCGGATCCTTCAGCGCGACCCACACGAAGCATTCGGGCTTCGACACGTAGTCGCTGATGGCGTCGATATCGATGTCGGCCAGCTTGCGGCCGTCCTGGTATGCAGCACAGTTGATCAGCATGTTGTCCGTGATGTTGCGGTTTGCGCATCAGTTTACCGGTTTGCTGCGCGGGCGGCCCGGCTTGCGCGGCACGGCGATATTGGTCATGATCGGGCTGCGTGCGCCGCAGCATGCGCACCCGGACACTTCGGACCACAAGCCGCACGGCCGTGCGGCCCAACCGTCAAGGAGACTGACCCATGTGGTATTTCTCGTGGATACTCGGCATCGGCGTGGCGCTCGGCTTCGGCATCATCAACGCGATGTGGCTGGAAGCGGAAGTCTTCTCGCGCGGCGACAAGCGCAACGGCGCGTCGGGCCCGCGCCCGGGAGGCAAGGCTTCGTGACGCACCTGGTGTTCTTCTGCGGTCACGCCGGCACCGGCAAAACCACGCTCGCGAAGCGGCTCATCGGGCCGCTGATGCGCGCGACCGGCGAGGCCTTCTGCCTGCTCGACAAGGACACGCTGTACGGCCGCTACAGCTCGGCCGCGATGGGCGCGCTCACCGAGGATCCGAACGATCGCGACAGCCCGCTCTACCTGAAGCACCTGCGCGATCCCGAATACCAGGGCCTGCTCGACACCGCACGTGAGAATCTCGCGCTCGGCATCGGCGCGCTCGTGGTCGGCCCGCTGTCGCGCGAAGTGCGCGATCGCCGCATTCTCGACCGTGCGTGGCTCGGCATCGCGCCCGACGTCACGCTGACGGTCGTCTGGGTCCACACGTCGGAAGACGTCGCGCACCAGCGCATCGTCGCGCGCGGCAACCCGAACGATGCCTACAAGCTCACGCACTGGGACGAATACCGGCAGCGCCGCTTCGTGCCGACCGGCCACGAATGCGACGGCATCGTGATGTTCGACAACACCGCGCCATCCGATGCCGACGTCGACACATTGCTGTACCGCATCGCGCCGCCACCGCCCGCGGCCACGATCGTCCCGCCGCTGCCGGCCTGACCGCCGCCCTTTTTTCGTTCCCCGCAAAAAACAACGCCGGAAGCGCTGCTGCGCTCCCGGCGTCGTACGGCCCTCGTGCCTGAGGCTGGCGGGTGGTCAGGCCACCGCGTGCACGCGCTCCATCGTGCCGCCTTCGTACAGCTTGCCGAAGCGGTTCGACAGGAACGCCTTCAGCGACACCTTTTCCTGCGGGATGAAACCGCTCTGCGGCAGCTTCTTCTCGCGGAACAGATCCAGCACCGCGCACATCGCGCCGGCCGTCGTGATCTGGATCGCGCTCATGTGCATCCCGCACACGTCCTTCGCGAAAATCTTGCGGGTGAACACGTCCTGCACCAGCTGGCCGTCCTTCACGCCCGTCACCGTGATGAACACGAGCACGACGTCCTGCTTCGTCGACGGCACCGCACGGCGCATGATCGACTTCAGCGTGTCACGGTCGGTCGACAGGCGCAGGTCTTCGAGCAGGAACTGCACGAGATCGCGGTGGCCCGGATAGCGCACCGACTTGTAGTCGAGCGTCTCGACCTTGCCCGACAGCGTTTCGCACAGCGTGCCCAGGCCGCCCGACGTGTTGAACGCTTCGTATTCGGTGCCGTCCAGCGAGAAGTGCTCGAGGCCTTCGAGCGGCTGCACCCATTGCTTGCGGCCGTCGCGGATCGCTTCGCACGGCTGGCAGTATTCGTTGATCAGGCCGTCGACGCTCCACGTCAGGTTGTACTTCAGCGCGTTGGTCGGATACTCGGGCAGCGCGCCGACGCGCATCTTCACGTCGCGCACTTCGCTGAAGCCGTTCACCAGTTCGTGTGCGGCGAGACCGATGAAGCCCGGTGCGAGGCCGCACTGCGGCATGAACGCACGGTCGGAGCCTTCGGCCAGTTCGCGGATCGCGTGGGTGGCGCGCACGTCTTCGGTCAGGTCGAAGTAGTGGACGCCGGCAGCCTTCGCGGCGGCGGCCACGTTGACCGCGAGGTAGTACGGCAGTGCGTTGACGAGCGCATCGAAGCCCTTCACGGCTTCGCGAATCGCGTTCGCGTCGGCCGAATCGACCCGTTGCGTCGCGATGCCTTCGCGCGACAGCTTCGCAAGCGCATCCGCATCACGGTCGAACGCGACGACTTCGTAGTCGCCCGTTTCACGCAGCATGTGAGCAATGGTGTGGCCGATCAGACCTGCGCCGACGATGGCGATTTTCATGCGCTTATCTCCTGATGATGGGTTGGTGTTGTGAACGGCGTTGAGCCATGAACACAGTTTAGGGACGCTCAAAATCAGTTCCAAGACGAAGAATGATGCGAAACGACCGATAATTTCGACGTTCCGACGACACATTTCGTCGAAACGTCGAAACGACGTTTTTTTCCCGCTGCCGGCCTCGCTATGGCTGGCCTAAGTCTCGGGTTTCAGACTGCATGCCGACCACCCGATTGCCAAGGAGGCATCGCATGCAATCCCTGCCCGCCCATCGCCCGCCGCACTGGCTGAACAAGGTGCCCGACGTCGCGCTGTCGTTCTGGATCATCAAGATCATGTCGACCACGGTCGGCGAAACCGGCGCGGACTTTCTCGCGGTCAACGCGGGGCTCGGCCAGACCGTCACGCGCGTCGCGATGGCGTCGCTGCTCGCGATCGCGCTGTGGCTGCAGTTGCGCACGCGCCGCTACGTGCCGTGGATCTACTGGCTCACGGTGGTGCTGGTCAGCATCGTCGGCACGCAGATCACCGACCTGCTCACCGACGGCCTGAACGTCAGCCTCTATGCGAGCACCGCCGCGTTTGCCGTCGGGCTCGCCGCGATCTTCTTCGTGTGGCATCGCGTCGAAGGCACGCTGTCGATCGACACGATCGTCACGCCACGGCGCGAGCTGTTCTACTGGGCCGCGATTCTCTGCACGTTCGCGCTGGGCACCGCCGCGGGCGATCTCGCCACCGAGGCGCTCGGTCTGGGCTTCACGCTCGGCGCGCTGTGCTTCGGCGGTCTGATCGCGGCCGTGTTCGCGGCGTGGCGGCTCGGTGCCAACGCGGTGCTGGCGTTCTGGATCGCTTACATCCTGACGCGCCCGTTCGGCGCGTCGCTCGGCGACCTGCTCACGCAGGCCCGCACCTATGGCGGGCTCGGCTTCGGCGCCGCCTGGACGAGCCTGCTGTTCCTCACCGTGATCGTGCTGCTCGTCGCCGTCGCGCAGTTCGGCAGCGGCCCGCGCACGCGTGCCGGTACCGCCGAATGACGCCCGTTCCTTTCCGTTCTTCCCATTCATCGAATCGAGGACTCGCCATGCGCATCCCCCATCCCGTTTCCCGTCTCGCCGTTGCGGCGGCCACGATTGCCCTGCTGAGCGCCGCCGCCGGCTGCTCGAAGCAGCAGGACGCGAACGCAGCGACGCCGGTGCCCGCCTCGGCCGCCACATCGCCCGCGACGGCCCACGCATCGAAGCTCGGCGACCTGTCGCAGTTCCGCACGATCGCCGTCGACGTGAACACGCTCGTCGCCAAGGGCGACCTGCCCGGCGCGAAAACGCGCATCAAGGATCTCGAGATCGCGTGGGATTCGGCCGAAGCCGGCCTCAAGCCGCGCGCCGCCGCCGACTGGCACATGGTCGACCAGGAGATCGACCGGGCACTCTCCGCATTGCGCGCCGATCATCCGACCCAGGCCGATTCGGCCGCCGCGATGAAGAACCTGCTCGCGGCCCTCGACCGGTTCAGCGGCAAATAACGATCGCAGCGCACCGCGGCACGTGACGCACGTGCGGGAATCGCATACGCTGGCGGCGTTTGCGCGGCACGGAGAACACGCATGCGAATACTGCTCGTCGAAGACGACCCGATGATCGGCGAGGCCGTCCATGCCGCGTTGAAGGACGCGTCGTACGCCACCGACTGGGTCACCGACGGCGTGCGCGCGTTGACCGCGTTCGCCGCGCAGCCTTACGACCTGGTCCTGCTCGACCTCGGCCTGCCCGGCCGCGACGGGCTCGACGTGCTGGCTGCCATCCGCGCGAAGGACGCCAGCGCGCCGCTGCTGATCGTCACCGCACGCGACGGGCTCGACGATCGCCTGGCCGGCCTCGACGGCGGCGCCGACGACTACATCGTGAAGCCGTTCGAGATGGCCGAGCTGCTCGCGCGGATTCGCGTCGCGATCCGGCGCCGGGCCGGTTCGGCTGCGCCGCTGCTGAGCAATGGCATCGTGTCGCTCGATCCGGCCACGCGCGAGGCCTCGGTCGACGGGCACGCACCGGTGCCGCTGTCGAATCGCGAATTCGCGCTGCTGCGCGCGCTGCTGGTGCGCCCGGGCGCAATCCTGTCGCGGCGCGAGCTCGAGGATCGCCTGTACGGCTGGGGCGAGGAAGTCGAAAGCAATGCGGTCGAATTCCTGATCCATTCGCTGCGCCGCAAGCTCGGCAGCACCGTGATCAAGAACGTCAGGGGTGCGGGATGGATGGTTTCAAGAAGCGGCTGAACGAATCGATCGGCTTCCGCCTGTCGGTCGCGCTGTCGGCCGCGATCCTGGTCGTCGCGACGGCCGCGGCCGCGTTCGCGTTTTCATCGGCATTCGACGAAGCGCACGAACTGCAGGACGACGTGCTGCGCGAGGTCGCGACGCTGCTCGACCGCGAGCACACGCCGTCGCTCCATGCCGACGGCACCGGGCCCGCGCGCGAAAGCGACGAGCTGTCGCGCGTGATCGTGCAGCCGCTCGGCGGCACACCGCAGCCAGGCAGCGACGGCACGCCGCCGCTCGCGCTGGCGCCGACGCTCGCCGACGGGCTGCATACGGTCGATACCGGCGGCAGCACGTATCGCGTGATGGTCCGGACCTTCGCGAACGGCGAGCGCATCGCGGTCGCGCAGGAAGCCGGCATGCGCGACGATGTCGCCCGCGAAAGCGCGTGGCGCACCGCATTGCCGCTGCTGATTCTCGTGCCGATCCTGCTGCTGCTCGTCGCCGATCTCGTGCGCAAGCTGTTCCGCCCGGTGGCCGTGCTGTCCGCCGGCATCGATGCGCGCGACAAGCACGACCTGCGCCCGGTGCCGGCCGAGCGCGTGCCGGTCGAGGTGCGGCCGTTCGTCGTCGCGATCAACCGGATGCTGGAGCGCGTCGCGCAGTCGGTTGCCGCGCAGCGCCGCTTCGTCGCCGACGCCGCGCACGAGTTGCGCTCGCCACTCACCGCGATGTCGCTGCAGGCCGAGCGCCTCGCCGAGGCCGACCTGCCGGACGATGCGCGCGCACGGCTCACCGCGCTGCGCGGCGGCCTCGACCGCAGCCGCCACCTGATCGGGCAGCTGCTCGCGCTCGCACGGGCGCAAAGCGCGCCGGCCGCGCCGCCCGGTAACGTGTCGGTCCATGCGGTCTATCGCCGCGTGCTGGAAGACCTGATGCCGCTCGCCGACGCGAAAGCCATCGACATCGGCATCGAAGACGGCCCGGATACCTCCGTGCCCGTCGACGAGCTCGAACTCGTATCGCTCGTGATGAACCTGGTCGACAACGCGATCCGCTACACGCCGCCGGGCGGACGCGTGGACCTGTCGACGCAACGCACGGACACGCACGCATGCGTGACGATCGCCGATACCGGCCCGGGCATCGCGCCGCACGAACGCGAGCGCGTATTCGATCCGTTCTACCGGGTGCCGGGCAATGCGCAGATCGGCTCCGGGCTCGGCCTGTCGATCGTGAAGATCGTCGCGGACCGGATCGGCGCCGGCATCACGCTCGCGTATGCGGACGAAGCGGAATCGCGCGGGTTGCGCATCTCGGTGCGGATTCCGCTGGTGCAGACGCCTGGCGCGACGCAGCCCGACAGCGCGGCCGATACGCCGGCGCAACACGCACCGCGCGACGGAGGCCATGCTTGAACGTCAGCCGCCGATTGTCCCGCGATCGATCTTGCGCGACAGGATGATCGACGTCGTCGTGCGTTCGACGCCTTCGAGCGTGCCGATCTGGTCGAGCAGGTCGTTGAGCCGTTCGGGCGAATCCGCGCGCAGCCACGCGACGTAGTCGTACTCGCCGCTCACCGCGCACAGCAGCTGCACCTCGGGCATCCGGTCGAGCTTGCGCAATACGTCCTTGCCGAACTTCGGCGTGAGGATGATGCCGACGTACGCGTAAATGCTCGCGTCGAGCACATCCTGCCCGAGCCGGACGCTGTAGCCGGCGATCACGTTGGTGCGCTCGAGCCGCGCGATGCGGGCGACGACCGTCGTGCGCGCAACGTCGAGCTGGCGCGCGAGATCCGCAACGCTGGCGCGCGCATCAGCCTGCAGCAGCGCGACGAGTTGCCGATCGAGATCGTCGAGTTGGTCGAGGCGGGGTGGACGCATGAGGCTGGGCCGGGGCACGACGCGCCGGCTGAGTGGAAGGTGACGCGATGATAGCGCCGAACGCCGGCCCGCCCAAACGCCATCGCCCGGCGCCGAGTCGAACCTTTCAATTTCGATTCGTATCGGATGTAAGCAAGTGTCGCATTCACTGCTGCGCCGCACGCAACGTGCTACTAATAGCAGGCGACGCGAGAAGCGCGCTTGCGGCAGTTGCCGCCGCGCCCGCTCGCAAGACACCACCCCGGAGAGCCAACCATGAAGAAAATCTCGCTCACCCTCGCTACGCTCGCCGTTGCAGCCAGCGCGTTCGCGCAAACCCCGGCACAGCCGCAGGCGCCCGCCCAGGCCACGACGGCCGCGGCTTCGGCCCCGAGCGCCGAGCAGCGCGAGGCGCGCCACGAAGCGCGCGTCGAGCAACGTATCAAGTACCTGCACGACCAGCTGAAGATCACGTCGGCGCAGGAACCGCAGTGGAAGACGTTTGCCGACACGATGCGCGACAACGGCGACACGATGGGCCGCCTCTATCGCGAACGGATGGCCAAGCACGATGTCTCCGCACTCGACGACATGAAGCAGTACGCGGAACTGTCGCAAGCGAACGCCGACGGCGCGAAGAAGCTCGCCGACTCATTTGCACCGCTCTACGAGAGCTTCCCGGCCGACCAGAAGGCACTGGCCGACACGACGTTCCGCAGCTGGCTGCATCACGGCGGCGAACACCGCGGCAAGGGCAAGACGAAGGGCAAGGAAGGCAAGGCAGCAGCCGCGCCGGCCGCCAGCGCACCCGCGCAGCCCTGACGCCCCCTCCGCGCCGGCGCCGTGCCGCCCGTCCGGGCCGAACGGCGCCGCGCCGCTTCAGGCGACCGGCGCGTCACACGGCGCGCCGGATTCGGGATAAGCTCCCGGCTTTTACAGCTTTCCCGCCCTCTCCCGCCATGCTCGAACTCAAGCAAATCGACCTGCGTACCGATCCGCAGGCCCGTCGCGTCGTCAAGGACGAAACCGTCACCGTTTCCTTCGCAGAGGCCGACGGCGAACTGATGAGCCTCGAAGGCCCGAACCGCTACGTCGCGGGCGACGCGCTGATCACGGGCTCGACCGGCGACCGCTGGGTCGTGTCGCGCGCGCGCTTCGACGCCAAATACCTGCCGGCCGATCCGGCACTCGCACACGGCGCACCGGGCGCCTACCGCAACCTGCCGGCCGTCGTGCTCGCCCGCCGGATGGACGAGCCGTTCACGATCGCCCGCTCGGAAAACGGCGACACGCTGCGCGGCGTCGCCGGCGACTGGGTGATGCAGTACGCGCCCGGCGACTACGGTGTCGTGCAGGCGCAGCGCTTCGCGCAGGTCTACCGCGACGCGTAACGCAGCGCGCCGGCATTGCGCCGGCGCGAACGATCAAGCAAAGTCTTCGTCGAGTTCGAGTTCCGCGTCGCGTTCGACGGCCTCGCCGGCCGCGTGACGCTCCTCGAGCGACCCGAGCATCGCTTCCGTATAGGCATGCAGCACCGCGTGGCTGCGTGCCCGTAGCATCGGCTTGAGCGCCAGGTAGCGCGCGAGCGCGGCCGGCACGATCCGGATATCGAGCGTCATCCGCTTCGGCGCCGAGCCGAAACGCATCGCCAGCCACTGCACGGACAGGAACCGCCCGCGTGCGTCGGTGTTTTCGGCGAAACGCGTCTGCTCCGGAAAGAGGTCGCAGATCACGCGTGCGAGCGCGTCGAATTCCGCGCTCTGGCATTCGATCAGATAGTCATCCATCCCGTCTCTCCCGTCAGGCTGCCGATTGCGACGGCCGGCATGCCTTCACCAGCACGACGACCAGGATCGCGGCCAGCACGAAATACACGGCCTCGACCCAGCCGGCCGGCAGCACGCGCACCTGGTACAGCAGCGCCGGTGCGGCCGCGAGCGCATGCAGCACGATCGCGAGCGGCAACACGCCCGCGCGGCCGGCACGCACGCCGCGCCAGACCAGCACCGACAGCGCGAGCTGCAGCACGAATGCCGCGCAGCGTTCGAGCAGCAACAGCAGGATCGACTGTGCCGACAGCGTCGCGAGCATCACGTGCAGCCGCACGACGGTATCGCCCGGCAGGTCGGCGAGCTGGGTCTCCAGCTGGCCGCGGCTCGCGAGCCACGCGAGATACGACCACTGGCCCCACACGAGGACGCCGACGAACCATGCTTCGGCGCCACCGTGACCGATCCCGTAGCCGATTCCGCGGCCGTCGCCGGCCGACGGGCCATAGCGGCGGTTCAGGAAACGCATCCCGAGATAGCGGCCGACTTCCTCGAATACGGCGGTTGCGAGCGCGCTGTACGCGACGAACGCGAGCGGCTGCGTGAGCCAGCCATCCGGCGGCGTCTGGCTCAGCACGAGCCCGTGGAACGCGCGTTCGAGGATCATCGCAAACAGCGTGAAGACGGCCACGCCGACGATCGTGTCGCGGCGGTTCAGCGCCAGCGGCTTGCGCAGGAGACGGAAGAGGACGAGCGGCAGGAGCGCGATGATCAGCGTGGCGGCGATCAGCACCGCCAGCGTGATGGGGGCGACAGTCAT
Protein sequences of DBSCAN-SWA_3 >NC_007510|2004235:2059218|2035737_2037597_-|WP_041493083.1|DBSCAN-SWA MRMRAATRAALLACAAWFVSGGVVPQAALAVPAISQYDHPKYPPGFTHFDYANAEAPDSGTLQFENYDEAQSYDSLNPFLVRGAPAPDIKNLMFDTLMQRSWDELASEYALIADDVDVAPDGLSATFHINPAARFSNGDPITAADVKYSFDTLTSPQASPLYNAQFAIIKRAVAVDGHTVRFEFKHAERDAALIAGDLPVFSPKWGQRADGTRPPFDQIANVPPIASGPYLIDQRKNDKQISYVRDPHYWAANLPSRRGMFRFAHVSFKLYLDQYTALEAFKAGDIDARMEYSATQWARKYVGKNFRNGMLKRGEFPDGPAQMQGFLMNMRKPMFQDVRVRHALALAFDFDWMSRMMFYGQYRRTNSFWEASPFAASGMPSEKELALLEPFRSTLPPEVFGPMVQQPSTLPPNSLRANLKEARDLLAQAGWHYRDGALRDANGTPMTIEIIDDQPGMDRLILPYTQALAMLGIQAYLREIDSAVYLKRLDNFEYDMTTYIYLPVTIPGAELTRRFGSAAASQPGSENYPGVKSKAVDALIHAALAADTLDDLETATHALDRVLINLYALIPQYYLPNARIAYKATLGHPAIVPDSYQYEDWIIDYWYVKQPAAKPAPAA >NC_007510|2004235:2059218|2041921_2043112_-|WP_011352156.1|DBSCAN-SWA MNQRRWKRMMAVTALGAAGLLGRPALPADALRPDVAKPLAAAQELYRAHKYRDALGKIAQAAAVPNRTPYETYMVEEMRGAAAMAAGDSGTAAQAYESVLNSGRLSGEDEQRTTAALAGIYFQQKNYPLAIRVAQRYLKAGGSDPEMRTLLTQSYYLSNDCAPLVSQLKASTDAQANGGHAPDEGQLQMLATCAQKVKDGNAYRGALGLLVAYHPSPAYWDEMVTAIRGNPGYLPSLDLDIYRLRRATGSLASANAYMEMTQLALVAGSPAEGKQVIDQGFASGVLGKDAQADREKRLQALAAKRAQSGGDATTPAAPIDAGMNLVFEGRAQQGLPMMEQAIAKGGLEHPDAARLRLGEAYYVAGQKARAVQVLRTVKGADGSADLARLWTVVASR >NC_007510|2004235:2059218|2029157_2030519_-|WP_011352148.1|protease|DBSCAN-SWA MTGFNPMARQAGIDWHAHFTRLADDAQRLTTPTETVLLGFSSETSDFIRFNGGKIRQTGRVTQGRLSVRLVDGARQAHSALTVCGDPAADLRELADAVAVLRDGLRDAQDDPHLLFDTSSWLQETRRTGRLPNADGLARIVAECARGLDFVGFYAGGTLARGFASSTGSRGWYEVENFNFSWSLYDPSGRAIKTVYAGDDWSDAAFAAKVDAAAARLPVLGRTPKALAPGHYRAYFAPDAVHEMTSLLGWSGFSARAVASARSPLHRLYAGEVALDPRVSITEDFSLGIAPAFNADGYRRDSVPLVVEGRGAGQLVCARTAREQGGTPNGADNGESPQSLTIAGGTLADADVLAALDTGLYVGNLWYLNFSDPIACRMTGMTRFATFWVEGGRIVAPIDAMRFDDSFYRLFGAELEQLGAAPVLLANDDTYGERATGGAQLPGLLARSFELTL >NC_007510|2004235:2059218|2011584_2012091_-|WP_011352136.1|DBSCAN-SWA MTDKNEMVSAGDEPMREILDVLLADDENITARAVARLHPSIKAASSITRSESRSRLLAEYQQRQSEYRRWRGRVAKRSGTDTAASLADKDIRIAELEATVQLLTASHLAMLRAVGELGGFSKWAKFYEAYRDARDKLVGIGAVPSATVDSLPTEKSKNRRSTIRQEKR >NC_007510|2004235:2059218|2056185_2056650_-|WP_006481526.1|DBSCAN-SWA MRPPRLDQLDDLDRQLVALLQADARASVADLARQLDVARTTVVARIARLERTNVIAGYSVRLGQDVLDASIYAYVGIILTPKFGKDVLRKLDRMPEVQLLCAVSGEYDYVAWLRADSPERLNDLLDQIGTLEGVERTTTSIILSRKIDRGTIGG >NC_007510|2004235:2059218|2024644_2025571_-|WP_157687183.1|DBSCAN-SWA MVFERIRSFRVQVCCDAGCNEQEIDRVPFFVERIEKNAGFKFCELRILDIRSSRIEISHIPGIEMFVPPRIEGSSVALVRTLADEKRMKIPDVFWRVLAKASDGAQYAKVDLMAANAVMMIDSIELKDKLTEHAFTENHGGRWVRLVAEERGGEHRKILISHAEDEERHGMLFLKLLRDLGHYAPDEMLGPPDYIDPYYMDLHEKWGGDLYRFMCFIHAAEVRTMFYVQHVLHITAAFNDPLMSRLNDTFSAIYNDEKFHIRYSAHLLGGMMDAGRDPDVLRAAFDEVHQQTANTIRTLIGNYNRKEI >NC_007510|2004235:2059218|2044130_2045045_-|WP_011352159.1|DBSCAN-SWA MTKRSLAALAASILMSVAAIDGFVAPQLAHAQASGAAGTSAPAAAPSAAPAPVPAAAEPAPPPAPATTEAVENPYGLGALWKNGDFVARFVLILLVIMSMGSWYIMITKFLEQLRANRRAKLADAQLWTAPSLAEGARLLDEASPFRFIAETAIEAGEHHEDALLEAVDRNTWIDVSVERSITNVSNRMQDGLAFLATVGSTAPFVGLFGTVWGIYHALTAIGIAGQASIDKVAGPVGEALIMTAIGLAVAVPAVLGYNFLVRRNKSVMERVRNFGAQLHTVLLAGNRRPVRAASSPAAASLAN >NC_007510|2004235:2059218|2034671_2035724_-|WP_011352152.1|DBSCAN-SWA MLAYILRRLLLMIPTLIGVVTITFAVTQFVPGGPVEQVLAQLRHGSARGGEAGGGGGGYHGSQGVDPQQIEQIRKQFGFDKPPLERYVLMLKSYATFDLGQSYFAHRSVWAVIRSKLPVSITLGMWTVILTYLVSVPLGIAKAVRNGSRFDTVTSVLVLAGYAVPGFVLGVLLLMLFGGGTFWQLFPMRGLTSDNFDDLTLIGKALDYLWHIVMPVTASVIGNFAIVTILTKNTFLEEIGRQYVLTARAKGAPERDVLWKHVLRNAAIPLLTGLPAAFVGAFLNGNLLIETLFSLDGMGQLSYDSVIRRDYPVVLGSLFLFTLIGLLTKLIADICYVLVDPRIQFNRLDH >NC_007510|2004235:2059218|2048390_2048876_+|WP_041493084.1|DBSCAN-SWA MQLHTVAVDKPETMNFILGQSHFIKSVEDIHEALVGTVPGIRFGLAFCEASGKRLVRHSGTDATLVELACRNASAIGAGHAFIVFLGDGFYPVNVLNAIKAVPEVCRIYCATANPTEVIVAETAQGRGIVGVVDGFGPLGVENDDDIRWRKDLLRTIGYKA >NC_007510|2004235:2059218|2025974_2029136_-|WP_011352147.1|DBSCAN-SWA MPSFFIDRPVFAWIVALAIVVAGVLAIPQLPIAQYPRLAPPRVVITASYPGASTETVDADVGSIIEESLDGADGLLYYETSSDGHGNLEIDVTFSPSTDPDIALVDVNNRLKQVEPRLPQQVVQQGIGVFKAANTFLMLVTLTSTDGARDSAQLGDYLNRYVLRELKRAPGVGAAELWDADEALRVWLDPAKLREYDLGADDVIAAISAQNATVTAGAIGDAPFVRGQQLTATVVVKGQLTSPEEFGRIVLKSKQDGSVVRVADVARVEIGRDDYSFYSRLNGRPAATVGIQLGPRGNALETSNAIRARLAELSRTLPPGVAVEIPFDGAHFVKIAIHEVVLTLVEAVVLVFCVMWLFLRDLRYTLVPTVVIPVTLMGAFVAMWAFGLSINVFTMFGLVLAIGILVDDAIVVVESVHRVMEEGVSPRDATRRAMKRIGGAIVGVTAVLTAVFVPMAFFPGSVGGIYRQFAVAMIASMLVSSFMALSLTPALCANLLKPVARHDGGTGRARRRGIGARLADRFGAAFARAETGYRRVAVFAVRRTGMVVAVYVALVIGCGLLYWMMPGGFLPTEDQGQLQVMIQLPAGATQARTLAVVERVEAILHGEPAIANVTSVIGWSFVGSGQNVGMAFVELKDWAKRDVDAMALRDRLNGRFGAILDGDVEAALPPSVRGIGHSDGFTFRLEDRGGVGLDALKAAREQLAERAKADPLLSAVHFEDLPDAPRIELDVDRAKAYALGVPFERIAGLLGGTFGSNYINDFPASGRMRRVIIEADPVARATDTQLMALTVPNRTGDMVPLSAIAAPHWTIGPVMLNRYNGYPSLDISGRAATGTSSGAAMAEMERLAGALPAGIGFDWVDAAREEQVAARQTPLLVGLSVLAVFMALAALYESWTIPLSVLTIVPLGMIGAIAAALARGMPNDVYFKVGMITVVGLAAKNAILIVQYARDLAGHGVPLRQAVVDAAAARFRPIVMTSMAFLLGVVPLVLATGAGAESRRSIGTGAFGGVLAATVFGLVFAPVAFRVVASVGRRGRRVAVAKREDRQAEAVGE >NC_007510|2004235:2059218|2008842_2009361_+|WP_011352134.1|DBSCAN-SWA MTDTRFQNGPTWSVPSSGRSSWKNRFFVDTEFTDFEAPQLISLAIVGENGAEFYGECTDYDATRCSDFVRAVVLPQLGRLKGRAMPFDQLSQALQAWGESIPTTSKPVLCYDLQTDLDLLRSLLGGSFPEGWAREDVRGQIDLRRLAEYFARHGGEHHALHDARANAYAYIG >NC_007510|2004235:2059218|2006122_2006644_-|WP_011352129.1|DBSCAN-SWA MTDKLSAPFVIERRFAAPRELVYAALTEAEHLSTWMSPPGMEMSHCTVDARVGGVFHYAMKPRGVPDAPAMWGKWTFRELTPPTRIVVVVQFSDAEGGVTRHPMSAQWPLYTLSTTTLSEVEGGTLMHLEWRALDAGEAEEATFDASHASMSQGWGGTMDALDKYLIQLQADR >NC_007510|2004235:2059218|2023162_2024386_-|WP_011352145.1|DBSCAN-SWA MTLNIFDGSTRVRALNKLNEWLTQSGNKQVWSRVCSGVPGRTVTARLSHGDHPDVPQIHFGSYNYSGLNDRHEVVDAAREALDRHGATTSGVRILNGTTALHLDLENALAAFTRFESCVTYSSAYVANLAVISTLAGETDVIYSDELNHASIADGIRLSRAKQVKYSHKDVAQLEGLLRSAPLSDRKFIITDGVFSMDGDFAPLPQIVELAKRYRAFVIVDDAHGTASCGPNGRGTLAHFGLEKEVDVLIGSLSKGLPGVGGFACASEEIGELLRYGSNGYIFSASIPPSVAGGLIKAVEILESEPDIQVRLHRNERQIRDGLREVGFDVMHSETPIIPLAMPDRATAFNFARGMHERGVFVNPICYPAVAMNRARIRVNASASLTESDISSALVTFAEVGRSIGLL >NC_007510|2004235:2059218|2006636_2007050_-|WP_011352130.1|DBSCAN-SWA MEADPLTVTFAALADPTRRAILARLAQGEATVGDLAGPFDLSAPAISKHLRVLTRAGLITQGRDRQFRPCRIEPGPLKQAADWTLQYRALWEQRLDQLDSYLQQLQRHPVKDSNVNAAPRATPKVSSTRTRKTRPHD >NC_007510|2004235:2059218|2020815_2021805_-|WP_011352143.1|DBSCAN-SWA MIGVSSIAWAFPETMLTAQAYGSRVGLTEEQVLAKTGIRRRFLLERDQCELPLARDAFAKLQAQTGFGDGDIDLLVYVTQNPGRSMPHSAAQLLGMLSYQGHPAAFDVSLGCSAFPYVLSLCRGMLEAQNWNRAVIVTNDPYSRLLTLDQPKTAAIFGDAATATLLERDRGGAICVGDFGTDGGEWRALHTPQWRDARRTSHLLDDTPVPEADAHSVRMNGAKLASYFSKRVPESMTKCLASNGVQPSDVDIVVLHQASRPMLELLIERLPFGVNTEVPVLLQDGGNTTSSSIPLALGRIMQERALAGRKVLLTGFGVGLSWGSLLLEF >NC_007510|2004235:2059218|2043128_2043560_-|WP_011352157.1|DBSCAN-SWA MGMNVPSGGSNAEPEVMVDINTTPLIDVMLVLLIMLIITIPIQMHSVKMDLPVGNPPPPATPPEIVQIDIDFDGTTTWNGAPVPDRAALESKLTQVAAEPVQAEIHLRPNKLVPYKDVAAVLASAQRVGATKIGLIGNEQFMQ >NC_007510|2004235:2059218|2012083_2014408_-|WP_011352137.1|DBSCAN-SWA MNPHYSAFIELGKTLAREKGMCWEIPLDPSGSAQDGVGWNLTTAVSDVPPPTYYLRDFGTEVKALAIVNAERVERGLIPLARRPLSFAWQDLIKAAVAEQLLFRRNTTGHVAQNIARPLRVVATCVSKEPWELTVDDMHVAVRIGKAIQATGKLGDLVVGIVKAVLDAQHICDAGPLYPALAVPRMKGKSAIKAKHTWSSDELRDDLEARKRGERLPERRAFWELTRIVMTEQPRTFVDDLRFAAIRTMIVTGMRAGEVALLPVDWKRERTYLDSKGRPAGDSGGISTSLMVRHFAEKQQEEETDSRVLRQATQPVPEMFRTLLTETLDHVGRITEPLRATLKLQCETGRLLPWYPVDGLVPVIEVYTRLMGNPFWLKIEREAFIERYRNGFDPAVLAELHEYQHERRRSGDLRLDMAVYKFGNRLKTAMLEGETSLCFRHGDGSPIERLERMEWHAAYLRVGELEEHIRLATPTKASDTAPLPLESGVIQPWEFLFTLPKRSLAEERNELLCDISRYMSVRRPAHDLIGLALGEHPTCVTLFEKYGLTDDDRALRMESHMLRHLQNTELFRLGVADTIISKRFNRRSVAQSYEYDHRSLAEELEQIEIPQDIEIMLGAKATTVARLIKGGKASGPIVDAFRRIQAADGDAAAYEYLRAEADGFHATPYGHCLNSFTVDPCPKHLECFAGCRHLSATDLPEHRQNLIRLEGKLSLALETIKVRPSTSIGWKNQLEHAEKRLAGVQKLLATQPGDHPFPDGLDLALPPPRGVLDD >NC_007510|2004235:2059218|2019400_2020789_-|WP_011352142.1|DBSCAN-SWA MSDDYSALRAAMHNSVTFLDKNRPFPVSPTPLLIPEREYDEICRDARQVLVALEKAIGIVSSDGSLWRHFPELHGLEDLVRFPGSTSRLIDLARFDLAIVKGGGVRMMETNAGCPGALTTVGDINQAFRASAFFNAHIAGTHVPLTVDNKFYFVDYLASLVDRGAPLNMAFISSKHRRIVTDLDRLTELAQERGYGAIRCDVQDLDYDGRQLTHKGTPVTVAYLKFDAVVTEAGDMDLGIYGPEAVRQPFLEAVRDGAFKYVNSFASQLLAENKRMLAMLHTERVQRHLSSPEVDAIKRLVARTWSLSADGLQQFGGRERLLHEKNQCVLKQVIESRGRGVHIGRHTSDAEWRELVASPTLDRYIVQEWVELDRQEVLNPVFDDPGRFAAYTDMGLYMVGGEPAGFLCRASHDPVVNVGKSGALRPTFVTNGSPRRTSISETTRMDSGGGTHSERDLGGEPT >NC_007510|2004235:2059218|2045131_2045830_-|WP_011352160.1|DBSCAN-SWA MKVEEHLATSNSGLATLGRPREFGKKQQNPVRRFGGIGIVLVLHAVLIYALLNGLATKVVQVIQHPIETRIIEPVKPPPPPPMPVVKLPPPKFAPPPPPFVPPPEVPVQAPPQATITHQSAPVPSAPAVQAPVVAPPAPAKPVSHDVGVVCPNSDTLRASIQYPKEAQENNITGDVTIEFVVDAEGNITNERVAQSADPILDRAAYNTVKRFKCVAQGQSVRVQVPFSFNLN >NC_007510|2004235:2059218|2058429_2059218_-|WP_011352176.1|protease|DBSCAN-SWA MTVAPITLAVLIAATLIIALLPLVLFRLLRKPLALNRRDTIVGVAVFTLFAMILERAFHGLVLSQTPPDGWLTQPLAFVAYSALATAVFEEVGRYLGMRFLNRRYGPSAGDGRGIGYGIGHGGAEAWFVGVLVWGQWSYLAWLASRGQLETQLADLPGDTVVRLHVMLATLSAQSILLLLLERCAAFVLQLALSVLVWRGVRAGRAGVLPLAIVLHALAAAPALLYQVRVLPAGWVEAVYFVLAAILVVVLVKACRPSQSAA >NC_007510|2004235:2059218|2004235_2005831_+|WP_011352128.1|transposase|DBSCAN-SWA MPNLDDLPDDVAALKAMLAEARASAIERELEIEQLRREIAESDLEIARLKLLIDKLKRMQFGRKSEQLAREIERLELRLEDLSAGSSVADVQHAKVRREKPATGGESSAREPLPPHLPREDRVLKPDSICPKCDGTMQSLGEDVSEQLARVAAMFKVIRTIRHKMVCPSCGHIEQPSMPGLPIERSIAHPSLLADILVSKYADHAPLYRQSEIAARDGVTLDRASMGRWVGQCEALCRPLTDALRRYTMAGTKLHADDTPIPVLAPGNKKTKTGRLWVYVRDDSRSGSTEPAAVWFAYSPDRKGIHPQTHLAGFEGILQADAYGGFDELYVNGKICEAACWDHARRKYYEVHASTPTDETKSLLEMIGELYSIEADIRGKPPDERKRVRHEKSKPLLEAFEARIRGKLATLSRKSELAGAIQYSLNHWNALTLFCEAGQAEISNALAENALRCVSLGRKNFLFAGSDSGGERAAAMYSLLGTCKLSGINPRAYLEYVLTYIADHVANRVDELLPWNVADKLRLATPPKANI >NC_007510|2004235:2059218|2018154_2019183_-|WP_011352141.1|DBSCAN-SWA MSSCPWRLAYLAIVALHGRLGLYHLYFAAFVYGLFDVLFEVSLPKLVLDLTTAENRVRSNTRLAVTHTVCSEFLGPVLGSTLSLFRPVLGLAIIALSYAVSAGLLSFLLRGARPAVPEHVTRRSIPKSLLDPVVWLLRSRVLAPLAAVGFGMSVAWGAWLSLEPYYLIEAVPRTLTKGSYGFMMAVLASGAVAAALVLERIRIDKNNLMLLFVDAAGILFLILISSVTKAPVIVGAALFLTGIGATIWSSIVITMRQELVPQHLLGGVNGFFRVIVYGGYPVGSFAGGLLADRFAIPGTYLVIAAVSAVFVPFILIAHRQLQRQISDGVAIGSDEPRSATFE >NC_007510|2004235:2059218|2050558_2050705_+|WP_011352166.1|DBSCAN-SWA MWYFSWILGIGVALGFGIINAMWLEAEVFSRGDKRNGASGPRPGGKAS >NC_007510|2004235:2059218|2054757_2056182_+|WP_011352172.1|DBSCAN-SWA MDGFKKRLNESIGFRLSVALSAAILVVATAAAAFAFSSAFDEAHELQDDVLREVATLLDREHTPSLHADGTGPARESDELSRVIVQPLGGTPQPGSDGTPPLALAPTLADGLHTVDTGGSTYRVMVRTFANGERIAVAQEAGMRDDVARESAWRTALPLLILVPILLLLVADLVRKLFRPVAVLSAGIDARDKHDLRPVPAERVPVEVRPFVVAINRMLERVAQSVAAQRRFVADAAHELRSPLTAMSLQAERLAEADLPDDARARLTALRGGLDRSRHLIGQLLALARAQSAPAAPPGNVSVHAVYRRVLEDLMPLADAKAIDIGIEDGPDTSVPVDELELVSLVMNLVDNAIRYTPPGGRVDLSTQRTDTHACVTIADTGPGIAPHERERVFDPFYRVPGNAQIGSGLGLSIVKIVADRIGAGITLAYADEAESRGLRISVRIPLVQTPGATQPDSAADTPAQHAPRDGGHA >NC_007510|2004235:2059218|2050701_2051319_+|WP_041492851.1|DBSCAN-SWA MTHLVFFCGHAGTGKTTLAKRLIGPLMRATGEAFCLLDKDTLYGRYSSAAMGALTEDPNDRDSPLYLKHLRDPEYQGLLDTARENLALGIGALVVGPLSREVRDRRILDRAWLGIAPDVTLTVVWVHTSEDVAHQRIVARGNPNDAYKLTHWDEYRQRRFVPTGHECDGIVMFDNTAPSDADVDTLLYRIAPPPPAATIVPPLPA >NC_007510|2004235:2059218|2047265_2048243_-|WP_011352162.1|DBSCAN-SWA MDVRQGFVDCVGRTPLIRLTKLSAETGCEILGKAEFMNPGGSVKDRAALYIIRDAERRGVLKPGGTVVEGTAGNTGIGLAHICAARGYRCVIVIPDTQSPDKLAILRTLGAEVRPVPAAPYRDPNNYQKIAGRLADELDNAVWANQFDNVVNRQAHYETTGPEIWRDTAGTIDAFVCATGTGGTLAGVSRYLKEQNPDVRIVLADPCGSGLYGYVKTGDLGAEGSSITEGIGSTRVTMNLAGTPIDDAVRIDDQQCVTMVYRLLREEGLYVGGSSGINVAAAVWLARRMGPGHTIVTLLCDRGDIYRARLFNREWLREKGLDPGD >NC_007510|2004235:2059218|2031962_2033606_-|WP_011352150.1|DBSCAN-SWA MTRPLLQIDGFSAHFGANAAVQDLSLSIGRGERVALVGESGSGKSVTALSILRLAQHATLSGRMLFDGEDLLAKTEQQMRGIRGADIAMVFQEPMTALNPLYTIGKQIAESLRLHEGLRPGEARERGIALLRRTGIPEPERRIDSFPHQLSGGQRQRAMIAMALACRPRLLLADEPTTALDVTVRQQIVDLLIELQEQEAAARGMAVLLITHDLNLVRRFAQRVAVMEKGVLVETNTTAALFAAPQHPYTRRLLDSAPQRAVEPVAAGAQTILDVQHLAVDYRIAAKGWRSVLGKTTFRAVHDVQLSLKRGETLGIVGESGSGKSTLASAVLGLQRPTSGGIEIDGMPLASLCTTQGRRTLYGRMQVVFQDPFGSLSPRMTVEQIVGEGLGVHRPQVAGDARRARIAALLQEVGLPPEAMLRYPHEFSGGQRQRIAIARALAVEPELLVLDEPTSALDVSIQKQVLNLLTNLQKKYKLSYLFITHDLAVMRAMAHRVIVMKAGRVVEAGETLDVLHAPSHPYTRALLASSMLAPERGPQERQERTDD >NC_007510|2004235:2059218|2014404_2015919_-|WP_167316025.1|integrase|DBSCAN-SWA MSSEPRFAGDVIDRVLAALPVLPPFIRYYDDFDDTQHSIHDPASAMLFELAINGRTIRVDFTRHEKRHALLLKHVFLYLLSVDLTVSTAANYLIAATHMSLKDIADIVRAGPLGIAAVWAGLRVREMPLAAYILAKSLLRLLCRHRLQDWSDSYSSYITNTLPLPARDAYMGVHSGDVFLNADEEAAVVRHLDETVTALTSPALVSNEVLCDTGMLLCAYQFAMRPIQIAMLNMRNVRIWHDAQDGLPTVHLTFHMAKQRGKAKQRPLTRRVKREWAPLFVALEKWRRVGGATGDAHFFNVQSNYEAGARIAALVRKLIGSDELGTATDLRHTAAQRLVDAGASHEELAEFMGHSYVQTGLVYFSTSASHAERVNRALGASDIYRRVAKIAHDRFISPEELTLLKDEQQIAGVPHGIPIAGIGGCTSGQPACPYNPVTSCYGCRKFMPLHDKVMHESVLASMREVVVFFEQSSRGDTRSPTYLQLQRTVAEIQTVIDELESEGR >NC_007510|2004235:2059218|2051415_2052516_-|WP_011352168.1|DBSCAN-SWA MKIAIVGAGLIGHTIAHMLRETGDYEVVAFDRDADALAKLSREGIATQRVDSADANAIREAVKGFDALVNALPYYLAVNVAAAAKAAGVHYFDLTEDVRATHAIRELAEGSDRAFMPQCGLAPGFIGLAAHELVNGFSEVRDVKMRVGALPEYPTNALKYNLTWSVDGLINEYCQPCEAIRDGRKQWVQPLEGLEHFSLDGTEYEAFNTSGGLGTLCETLSGKVETLDYKSVRYPGHRDLVQFLLEDLRLSTDRDTLKSIMRRAVPSTKQDVVLVFITVTGVKDGQLVQDVFTRKIFAKDVCGMHMSAIQITTAGAMCAVLDLFREKKLPQSGFIPQEKVSLKAFLSNRFGKLYEGGTMERVHAVA >NC_007510|2004235:2059218|2030518_2031970_-|WP_011352149.1|DBSCAN-SWA MIDESRAPGARAVQAAKALRSDADFWSLRIVDETIDEHAVRNDVAQPFSRTRERGAMLTAWAGTGAGYAATPDLSPAGLQAALDIATARARASAPWALVDHRQAARPQASGTYASPDVDAALPSRAEWIERLAHECAAANLGARIVERAASVMIVHTEQCYVTSDGVRIDQRFRFLMPELQAVAHGDGDTQTRSLGQSGTLAQGGIDVLARYGFDGAGARVADEALQLLAAPNCPTGRRDLLLMPDQMMLQIHESIGHPLELDRILGDERNFAGSSFVRPEMVGHYQYGSPLLNVTFDPEPAAEAASYAFDDDGTPARKQYLIRNGVLERLLGGALSQQRAGLPGVANSRASSWNRAPIDRMANLNVEPGDQSLGALIAGTEHGILMRTNTSWSIDDHRNKFQFGCEFGQLIENGRLTQVVKRPNYRGISAQFWRSLRAVGDASTFGIYGTPYCGKGEPAQIIRVGHAAPACVFAGVDVFGGA >NC_007510|2004235:2059218|2057588_2058020_+|WP_011352174.1|DBSCAN-SWA MLELKQIDLRTDPQARRVVKDETVTVSFAEADGELMSLEGPNRYVAGDALITGSTGDRWVVSRARFDAKYLPADPALAHGAPGAYRNLPAVVLARRMDEPFTIARSENGDTLRGVAGDWVMQYAPGDYGVVQAQRFAQVYRDA >NC_007510|2004235:2059218|2049100_2049394_+|WP_011352164.1|DBSCAN-SWA MHYQLTYELVDDYLSRREPFRAQHLALAQAATERGELVLAGALSDPADQAVLVFEGDSPEAAESFARADPYVQNGLVKSWRVRPWRVVIGKHAPPRA >NC_007510|2004235:2059218|2033605_2034718_-|WP_011352151.1|DBSCAN-SWA MSSSTPVSSSTAWTTDQACAACAASPSPWRRTWLRFRAQPLGYWSLVIFTVLFALSLGANLLSNDRPLIVRYDGHYYFPIVKDYPETLFGGDFPAMTNYLDPYIRTKIESHGNFAIYPPNRYRYDTIDYFASRPYPAPPSSSNWLGTDQFGRDVLARLLYGFRLSVLMAFALTVSGVLVGVLTGALQGFYGGRTDLVGQRLIEIWSALPDLYLLIIFASIFTPSLWLLFILLSMFGWLVLSDYVRAEFLRNRGLDYVKAARTMGLTNTQIIWRHVLPNSLTPVITFLPFRMSAAILSLTSLDFLGLGVPPPTPSLGELLQEGKNNLDAWWISIAAFAALVVTLLLLTFMGDALRNALDTRTRGSAFGGGR >NC_007510|2004235:2059218|2041043_2041619_-|WP_011352155.1|DBSCAN-SWA MVLPPDESVLVAMSLGNKIRALRQRLKLTLDETSTIAGISKPFLSQVERGRATPSITSLVRIAKALGVTMQYFIDTPTEARSVCRGNALQYFQFTNSASRFARLTNLVDGRKLDAILVRMPAGQLSSEMTTHAGEEFVYVLRGQVALTLEDCTFTLNEGDTAHYESTMPHAWRNTADEEAVIVWVGTPRLF >NC_007510|2004235:2059218|2007334_2007655_+|WP_011352131.1|DBSCAN-SWA MIHTEFSKLSHHAPRARESGTGRAEPLTVVGEPTINLFGTTFAVVEFDWDFMAVRRSNGGKLHSTGRESHVYIHVDDHGWRLLHVHYSEPTVAGGGGERRVALNQN >NC_007510|2004235:2059218|2024382_2024643_-|WP_041492849.1|DBSCAN-SWA MQQHEIAQKIVDFLVKHGGIKAGEFDADAEIFADGAIDSFTVLELTIFIEDTFGVEIDEEQVGRLKSIGRIADRITNEANANGEVE >NC_007510|2004235:2059218|2009442_2011209_-|WP_011352135.1|integrase|DBSCAN-SWA MGTASTSSAGSDDGNLPATALPNLASLAALRAWHAGLTARAAVERYLPHRRADGQSSRGLLGQIRRQLVACAIARQRDDLIPLFTHPAADRTRHAAQVITALDVLRTAPIPVPQPTDPVARWLPPRIVRALGVAGIDSFGDLLVRRVHRPGWWRSIPGLGGQGAHRVEAFLSQHPDLSRRAATLVPYEAATAQLAPRWSLDALPADLDGSRGRFRAPHDTCGLAAANDHQAVEAWLALHEGTHTVRAYRREAERLLLWAIVERRQPLSSLTTDDAIAYRAFLRHPTPAARWIGPPRPRTTPAWRPFTGHLTARSTAYALAVLRALFGWLVDQHYVVLNPFAGVTVRGGAARAPFDTGRALNGREWKIVRAAADQLERTGWTAPAAERLRFVLDFGYATGLRAQELVAATLGDITADARGALWLQVTGKGAKSGRVALPPLARDALRRALKARGLPVMRTRWHPATPLVTALNSERRGKRHAGISAARLRQVLGECFRDTAERVSTRNPPLAEKLLHASPHWLRHTHATHALDAGVELVIVRDNLRHASVATTSTYLHGEEAKRARQVGAAFQRQPRRSQHRDTSRGTR >NC_007510|2004235:2059218|2053594_2054035_+|WP_011352170.1|DBSCAN-SWA MRIPHPVSRLAVAAATIALLSAAAGCSKQQDANAATPVPASAATSPATAHASKLGDLSQFRTIAVDVNTLVAKGDLPGAKTRIKDLEIAWDSAEAGLKPRAAADWHMVDQEIDRALSALRADHPTQADSAAAMKNLLAALDRFSGK >NC_007510|2004235:2059218|2021801_2023166_-|WP_011352144.1|DBSCAN-SWA MSSASPLIELKANEAQCLHTDLGWHDALLAAYAEFGFCYEYGDIPIAPVLMEAAFLKQASHDLRVLMQLARAVSLAHLDDRAPIRAHDRSIVERYRDAERQPLMGRPDCILAGGKLRVLEFNMDSGMGGIQEVGVLTDGYLDSGLGSLLAYELQNPFESQLGFIQNQLTKASRQSVCITPLADFSRFYLDQSDHLAARISARLGVPARTVFPEALRQGEWLTDGQTEYGIFYRDACFLHEPILLAGMAQALNAARSTRTITLGDPLDIGVDDKGALALVSEFLDQQSQDDPELGRLKALVPWTRLMDRKVTTVDGEIVDLEKFVRAQKGTLVLKRCASHVGKQVFIGSRTDDAIWDDIISRTKMKRDDGEDWVVQRFVVADKFPLWFRDHDGSLGLRHCSGTAGPFVFGDQPDGWLVRIQGAQADDDAVFALPTDGNIGLTTVAAVDTLKGVST >NC_007510|2004235:2059218|2037899_2040707_-|WP_011352154.1|DBSCAN-SWA MKQRVLALAIKRIVWAELALTAALAPPAFAQSQPAPGAVAIADAVASGAPVTVAQAGGTPGAAAATGTAAPAAPDAAAGAAAEVTPAAPGANGQGKVAQIKRFEVTGSLIRQADKTGFQQVQTITPKEIQASGAVTVTDFLRDAAANSANSWGEGQSGNFAAGAAGIALRGLSEKYTLVMVDGQRVAPYAFFSNSVDSFFDLNTIPLNDIERIEIVKTGAVSQYGSDAIGGVVNIITKHNFRGLQLDGSLGSAINAGNGDGTVKFGVLGGFGDLNADRFNVTAALSYYKSNGFTLADRDSTRNQDFTGKPGGFSLLAPSYWNMPGGVAQALSGGCPFGGSVHPAVSNSLSAGSPGTVCGYNTAESTSILPMTERLNAKLHADFKIDDKTTAFADFLESYNTTTTNDGLWNNVIGNSQNPALVWNPQTKLLSPFNFTVPVSNPYNTTGAATPLTYAFPNTVAQKTWANYWRAAAGIKGSFTLPYGDWDWATSVSHSQSTVSNVFTNQLNVNALNNIYQNGTLNFANPAATPNAFNGLYQEANNLGISKLDTIDATVSTPNLFHLPAGDVGIGFGAQFTHQSETLTPGSEYLSGAVITPDLETVDGQRNVAAVYYQINVPILENLTFSQAGRYDHYTDVGGAFSPRFALRYQPIKALTLYTSYNRGFRAPTFVEDSKSQTLGIQIDPATGQNYTSITVGNPNLAPERTRNFNIGFQVSPSRYTDIGFDWYKIRIDNVIGQGTPSQIATDPTTGQLLYKVIPYQNLGYLDTNGFEGTFRQGLPLKGWGTVTLSGDWAYINSYKIGFPGGTPVNGAGNNFTITQPFGGSFPRWRGNTTLDWNYRKFDAALTWQFTGPYAQNLTPAPAPSKVGSYSQFNLMMSYTGFKNWTIYGGINNLFNRTPPYDPIFANGTLDQSGYDTSVYSYIGRFAQIGATYKF >NC_007510|2004235:2059218|2043687_2044110_-|WP_011352158.1|DBSCAN-SWA MAMNVGQDDSDEVIANINTTPLVDVMLVLLIIFLITIPVVTHTIQLQLPKETVQPLQTTPKSIEIAVNRDGDFFWGEQLVDAQTLLAKLKGVSQQQPQPSVHVRGDQNTRYEFIGRVVTMCERAGIAKLSFITEPPARGG >NC_007510|2004235:2059218|2052764_2053547_+|WP_011352169.1|DBSCAN-SWA MQSLPAHRPPHWLNKVPDVALSFWIIKIMSTTVGETGADFLAVNAGLGQTVTRVAMASLLAIALWLQLRTRRYVPWIYWLTVVLVSIVGTQITDLLTDGLNVSLYASTAAFAVGLAAIFFVWHRVEGTLSIDTIVTPRRELFYWAAILCTFALGTAAGDLATEALGLGFTLGALCFGGLIAAVFAAWRLGANAVLAFWIAYILTRPFGASLGDLLTQARTYGGLGFGAAWTSLLFLTVIVLLVAVAQFGSGPRTRAGTAE >NC_007510|2004235:2059218|2049401_2050379_-|WP_011352165.1|DBSCAN-SWA MLINCAAYQDGRKLADIDIDAISDYVSKPECFVWVALKDPTPAELDLMGEEFGLHELALEDARKGHQRPKIEEYGDSLFAVLHMVELDDDDELNVGELNVFVGPNYVLSIRNHTEQDFRDVRKRCEREPHLLQEGSAFVFYALMDQVVDRYFPILETLGTELEELEDRIFAKATPASSRAIIEDLYSLKRRLVMLYQHTAPLIEPLAKLTGGRIPQVCSGMEPYFRDVYDHLQRIVKTIEGRREMVVTAIQVNLGMISLAESEVTKRLGSFAALFAIPTMIAGIYGMNFANMPELHLKYGFYGCIAAMVVADFALWWRFKRAGWL >NC_007510|2004235:2059218|2007937_2008183_-|WP_041492845.1|DBSCAN-SWA MQRVSGKRDDAGHLAAIGATVRARRLEFGVSQEALALLTGIDRSHMGRIERGQRNLTILNLIRIADALNISPSKLLKSSGL >NC_007510|2004235:2059218|2054116_2054782_+|WP_011352171.1|DBSCAN-SWA MRILLVEDDPMIGEAVHAALKDASYATDWVTDGVRALTAFAAQPYDLVLLDLGLPGRDGLDVLAAIRAKDASAPLLIVTARDGLDDRLAGLDGGADDYIVKPFEMAELLARIRVAIRRRAGSAAPLLSNGIVSLDPATREASVDGHAPVPLSNREFALLRALLVRPGAILSRRELEDRLYGWGEEVESNAVEFLIHSLRRKLGSTVIKNVRGAGWMVSRSG >NC_007510|2004235:2059218|2015944_2017168_-|WP_041492846.1|integrase|DBSCAN-SWA MFQQLFGVAVPPSLRGPLLLDDTGLPRYWAAVWSTMSSSQLAVSTHTKKLRYLEDLYRHAAVLGGPNALDDALGTLNHEALAEILESWFVSIRNQSVVTGADEKRWQTGLEFVSSVVTWLSKSEAANDRLRQIEPRLHRLAGLYSQLHVRRDNSVEMVRSLPASIVEALYLLLDPDSRRNPFARERTRWRVFVAFVLMLHQGLRRGEALLLPADAVKSAYDHRQQRTRYWLNVQENEYEDSEVDPRYSKPGIKNAHSVRQVPVSELTAGVVQRYVENYRGRPSHSYLLNSQCDTPLSTEALTKIFARISCRLPREVLAELKNRTGKDAITPHDLRHTCAVVRLHQLLEHGDAMDEALQKMRTFFGWSKTSAMASRYARAVFEDRLAGVWNDAFDDRVALLRALPLRS >NC_007510|2004235:2059218|2046655_2047141_-|WP_011352161.1|DBSCAN-SWA MPVSATLQDCLRQKSSRYEVVYHPYSHTSMETAAAAHIPGDRLAKTVLLEDDEGYVAAVLPTTYAVRLSDLWVKTGRHLVLAREVELRELFKDCDMGALPPVCMAYGMKTFLEERLAQQPEVYFEAGDHEALIHMMQDEFLMLMETAERAHFSHKMQGMHS >NC_007510|2004235:2059218|2058053_2058416_-|WP_011352175.1|DBSCAN-SWA MDDYLIECQSAEFDALARVICDLFPEQTRFAENTDARGRFLSVQWLAMRFGSAPKRMTLDIRIVPAALARYLALKPMLRARSHAVLHAYTEAMLGSLEERHAAGEAVERDAELELDEDFA >NC_007510|2004235:2059218|2017386_2018034_-|WP_011352140.1|DBSCAN-SWA MSYVLYYSPGAASMAVHWMLIELGVPFETRLVDIDTGAQHDSEYLRLNPAGRVPTLVVDGIPRTESAALLMLLAERHPEPGLAPHPGAPERPEWFEMMIYLANSVLPAMRNWFYAEKDGDPRCAEMVKVFSRGQVEASMAHLDNLLSDGRTYLINDRLNTVDFLALMLMRWTRNMPRPATTWPNLVRYIQHLRGTPTFLELNAREGLTEWMNPVA >NC_007510|2004235:2059218|2056905_2057451_+|WP_011352173.1|DBSCAN-SWA MKKISLTLATLAVAASAFAQTPAQPQAPAQATTAAASAPSAEQREARHEARVEQRIKYLHDQLKITSAQEPQWKTFADTMRDNGDTMGRLYRERMAKHDVSALDDMKQYAELSQANADGAKKLADSFAPLYESFPADQKALADTTFRSWLHHGGEHRGKGKTKGKEGKAAAAPAASAPAQP |
50 | Stx2-converting_phage(20.0%) | transposase,integrase,protease | attL 2014914:2014931|attR 2021543:2021560 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|