Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP034953 | Escherichia coli strain MT102 chromosome, complete genome | 8 crisprs | csa3,PD-DExK,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4 | 0 | 12 | 8 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034953_1 | 652069-652209 | Orphan |
NA
Consensus repeat of CP034953_1
|
1 spacers
spacers of CP034953_1
>1.1|652110|59|CP034953|CRISPRCasFinder GACGCTTGTCGCGTCTTATCCGACCTACGGGAACACACATGTAGGGCGGATAAGGCGTT |
CRISPR arrays and Neighbor proteins around CP034953_1
The CRISPR arrays of CP034953_1 >merge|CP034953|1|652069-652209|CRISPRCasFinder CACGCCGCATCCGCCAGTGGCGCGGTGCAGATGCCGGATGCGACGCTTGTCGCGTCTTATCCGACCTACGGGAACACACATGTAGGGCGGATAAGGCGTTCACGCCGCATCCGCCAGTGGCGCGGTGCAGTTGCCGGATGC >CP034953|1|1|652069-652209|CRISPRCasFinder CACGCCGCATCCGCCAGTGGCGCGGTGCAGATGCCGGATGC GACGCTTGTCGCGTCTTATCCGACCTACGGGAACACACATGTAGGGCGGATAAGGCGTT CACGCCGCATCCGCCAGTGGCGCGGTGCAGTTGCCGGATGC
>CP034953.1|QAA88436.1|649851_651870_-|NADPH-dependent-2,4-dienoyl-CoA-reductase MSYPSLFAPLDLGFTTLKNRVLMGSMHTGLEEYPDGAERLAAFYAERARHGVALIVSGGIAPDLTGVGMEGGAMLNDASQIPHHRTITEAVHQEGGKIALQILHTGRYSYQPHLVAPSALQAPINRFVPHELSHEEILQLIDNFARCAQLAREAGYDGVEVMGSEGYLINEFLTLRTNQRSDQWGGDYRNRMRFAVEVVRAVRERVGNDFIIIYRLSMLDLVEDGGTFAETVELAQAIEAAGATIINTGIGWHEARIPTIATPVPRGAFSWVTRKLKGHVSLPLVTTNRINDPQVADDILSRGDADMVSMARPFLADAELLSKAQSGRADEINTCIGCNQACLDQIFVGKVTSCLVNPRACHETKMPILPAVQKKNLAVVGAGPAGLAFAINAAARGHQVTLFDAHSEIGGQFNIAKQIPGKEEFYETLRYYRRMIEVTGVTLKLNHTVTADQLQAFDETILASGIVPRTPPIDGIDHPKVLSYLDVLRDKAPVGNKVAIIGCGGIGFDTAMYLSQPGESTSQNIAGFCNEWGIDSSLQQAGGLSPQGMQIPRSPRQIVMLQRKASKPGQGLGKTTGWIHRTTLLSRGVKMIPGVSYQKIDDDGLHVVINGETQVLAVDNVVICAGQEPNRALAQPLIDSGKTVHLIGGCDVAMELDARRAIAQGTRLALEI >CP034953.1|QAA88435.1|649390_649807_+|type-II-toxin-antitoxin-system-antitoxin-HigA MIAIADILQAGEKLTAVAPFLAGIQNEEQYTQALELVDHLLLNDPENPLLDLVCAKITAWEESAPEFAEFNAMAQAMPGGIAVIRTLMDQYGLTLSDLPEIGSKSMVSRVLSGKRKLTLEHAKKLATRFGISPALFID >CP034953.1|QAA88434.1|649079_649394_+|mRNA-interferase-HigB MHLITQKALKDAAEKYPQHKTELVALGNTIAKGYFKKPESLKAVFPSLDNFKYLDKHYVFNVGGNELRVVAMVFFESQKCYIREVMTHKEYDFFTAVHRTKGKK >CP034953.1|QAA88433.1|647659_648796_+|23S-rRNA-(guanine(1835)-N(2))-methyltransferase-RlmG MSHLDNGFRSLTLQRFPATDDVNPLQAWEAADEYLLQQLDDTEIRGPVLILNDAFGALSCALAEHKPYSIGDSYISELATRENLRLNGIDESSVKFLDSTADYPQQPGVVLIKVPKTLALLEQQLRALRKVVTSDTRIIAGAKARDIHTSTLELFEKVLGPTTTTLAWKKARLINCTFNEPQLADAPQTVSWKLEGTDWTIHNHANVFSRTGLDIGARFFMQHLPENLEGEIVDLGCGNGVIGLTLLDKNPQAKVVFVDESPMAVASSRLNVETNMPEALDRCEFMINNALSGVEPFRFNAVLCNPPFHQQHALTDNVAWEMFHHARRCLKINGELYIVANRHLDYFHKLKKIFGNCTTIATNNKFVVLKAVKLGRRR >CP034953.1|QAA88432.1|647071_647575_-|M48-family-peptidase MSNLTYLQGYPEQLLSQVRTLINEQRLGDVLAKRYPGTHDYATDKALWQYTQDLKNQFLRNAPPINKVMYDNKIHVLKNALGLHTAVSRVQGGKLKAKVEIRVATVFRNAPEPFLRMIVVHELAHLKEKEHNKAFYQLCCHMEPQYHQLEFDTRLWLTQLSLGQNKI >CP034953.1|QAA88431.1|646302_646995_-|vancomycin-high-temperature-exclusion-protein MLRAFARLLLRICFSRRTLKIACLLLLVAGATILIADRVMVNASKQLTWSDVNAVPARNVGLLLGARPGNRYFTRRIDTAAALYHAGKVKWLLVSGDNGRKNYDEASGMQQALIAKGVPAKVIFCDYAGFSTLDSVVRAKKVFGENHITIISQEFHNQRAIWLAKQYGIDAIGFNAPDLNMKHGFYTQLREKLARVSAVIDAKILHRQPKYLGPSVMIGPFSEHGCPAQK >CP034953.1|QAA88430.1|645237_646224_-|Gfo/Idh/MocA-family-oxidoreductase MIRFAVIGTNWITRQFVEAAHESGKYKLTAVYSRSLEQAQHFANDFSVEHLFTSLEAMAESDAIDAVYIASPNSLHFSQTQLFLSHKINVICEKPLASNLAEVDAAIACARENQVVLFEAFKTACLPNFHLLRQALPKVGKLRKVFFNYCQYSSRYQRYLDGENPNTFNPAFSNGSIMDIGFYCLASAVALFGEPKSVQATASLLASGVDAQGVVVMDYGDFSVTLQHSKVSDSVLASEIQGEAGSLVIEKLSECQKVCFVPRGSQMQDLTQPQHINTMLYEAELFATLVDEHLVDHPGLAVSRITAKLLTEIRRQTGVIFPADSVKL >CP034953.1|QAA88429.1|643989_644955_-|TerC-family-membrane-protein-Alx MNTVGTPLLWGGFAVVVAIMLAIDLLLQGRRGAHAMTMKQAAAWSLVWVTLSLLFNAAFWWYLVQTEGRAVADPQALAFLTGYLIEKSLAVDNVFVWLMLFSYFSVPAALQRRVLVYGVLGAIVLRTIMIFTGSWLISQFDWILYIFGAFLLFTGVKMALAHEDESGIGDKPLVRWLRGHLRMTDTIDNEHFFVRKNGLLYATPLMLVLILVELSDVIFAVDSIPAIFAVTTDPFIVLTSNLFAILGLRAMYFLLAGVAERFSMLKYGLAVILVFIGIKMLIVDFYHIPIAVSLGVVFGILVMTFIINAWVNYRHDKQRGG >CP034953.1|QAA88428.1|642346_643591_-|serine/threonine-transporter-SstT MTTQRSPGLFRRLAHGSLVKQILVGLVLGILLAWISKPAAEAVGLLGTLFVGALKAVAPILVLMLVMASIANHQHGQKTNIRPILFLYLLGTFSAALAAVVFSFAFPSTLHLSSSAGDISPPSGIVEVMRGLVMSMVSNPIDALLKGNYIGILVWAIGLGFALRHGNETTKNLVNDMSNAVTFMVKLVIRFAPIGIFGLVSSTLATTGFSTLWGYAQLLVVLVGCMLLVALVVNPLLVWWKIRRNPFPLVLLCLRESGVYAFFTRSSAANIPVNMALCEKLNLDRDTYSVSIPLGATINMAGAAITITVLTLAAVNTLGIPVDLPTALLLSVVASLCACGASGVAGGSLLLIPLACNMFGISNDIAMQVVAVGFIIGVLQDSCETALNSSTDVLFTAAACQAEDDRLANSALRN >CP034953.1|QAA88427.1|641790_642342_+|YgjV-family-protein MTAYWLAQGVGVIAFLIGITTFFNRDERRFKKQLSVYSAVIGVHFFLLGTYPAGASAILNAIRTLITLRTRSLWVMAIFIVLTGGIGLAKFHHPVELLPVIGTIVSTWALFCCKGLTMRCVMWFSTCCWVIHNFWAGSIGGTMIEGSFLLMNGLNIIRFWRMQKRGIDPFKVEKTPSAVDERG >CP034953.1|QAA88437.1|652295_654647_-|alpha-glucosidase MKIKTILTPVTCALLISFSAHAANADNYKNVINRTGAPQYMKDYDYDDHQRFNPFFDLGAWHGHLLPDGPNTMGGFPGVALLTEEYINFMASNFDRLTVWQDGKKVDFTLEAYSIPGALVQKLTAKDVQVEMTLRFATPRTSLLETKITSNKPLDLVWDGELLEKLEAKEGKPLSDKTIAGEYPDYQRKISATRDGLKVTFGKVRATWDLLTSGESEYQVHKSLPVQTEINGNRFTSKAHINGSTTLYTTYSHLLTAQEVSKEQMQIRDILARPAFYLTASQQRWEEYLKKGLTNPDATPEQTRVAVKAIETLNGNWRSPGGAVKFNTVTPSVTGRWFSGNQTWPWDTWKQAFAMAHFNPDIAKENIRAVFSWQIQPGDSVRPQDVGFVPDLIAWNLSPERGGDGGNWNERNTKPSLAAWSVMEVYNVTQDKTWVAEMYPKLVAYHDWWLRNRDHNGNGVPEYGATRDKAHNTESGEMLFTVKKGDKEETQSGLNNYARVVEKGQYDSLEIPAQVAASWESGRDDAAVFGFIDKEQLDKYVANGGKRSDWTVKFAENRSQDGTLLGYSLLQESVDQASYMYSDNHYLAEMATILGKPEEAKRYRQLAQQLADYINTCMFDPTTQFYYDVRIEDKPLANGCAGKPIVERGKGPEGWSPLFNGAATQANADAVVKVMLDPKEFNTFVPLGTAALTNPAFGADIYWRGRVWVDQFWFGLKGMERYGYRDDALKLADTFFRHAKGLTADGPIQENYNPLTGAQQGAPNFSWSAAHLYMLYNDFFRKQ >CP034953.1|QAA88438.1|654663_655734_-|protein-YgjJ MKLITAPCRALLALPFCYAFSAAGEEARPAEHDDTKTPAITSTSSPSFRFYGELGVGGYMDLEGENKHKYSDGTYIEGGLEMKYGSWFGLIYGEGWTVQADHDGNAWVPDHSWGGFEGGINRFYGGYRTNDGTEIMLSLRQDSSLDDLQWWGDFTPDLGYVIPNTRDIMTALKVQNLSGNFRYSVTATPAGHHDESKAWLHFGKYDRYDDKYTYPAMMNGYIQYDLAEGITWMNGLEITDGTGQLYLTGLLTPNFAARAWHHTGRADGLDVPGSESGMMVSAMYEALKGVYLSTAYTYAKHRPDHADDETTSFMQFGIWYEYGGGRFATAFDSRFYMKNASHDPSDQIFLMQYFYW >CP034953.1|QAA88439.1|655867_657301_-|amino-acid-permease MSDTKRNTIGKFGLLSLTFAAVYSFNNVINNNIELGLASAPMFFLATIFYFIPFCLIIAEFVSLNKNSEAGVYAWVKSSLGGRWAFITAYTYWFVNLFFFTSLLPRVIAYASYAFLGYEYIMTPVATTIISMVLFAFSTWVSTNGAKMLGPITSVTSTLMLLLTLSYILLAGTALVGGVQPADAITVDAMIPNFNWAFLGVTTWIFMAAGGAESVAVYVNDVKGGSKSFVKVIILAGIFIGVLYSVSSVLINVFVSSKELKFTGGSVQVFHGMAAYFGLPEALMNRFVGLVSFTAMFGSLLMWTATPVKIFFSEIPEGIFGKKTVELNENGVPARAAWIQFLIVIPLMIIPMLGSNTVQDLMNTIINMTAAASMLPPLFIMLAYLNLRAKLDHLPRDFRMGSRRTGIIVVSMLIAIFAVGFVASTFPTGANILTIIFYNVGGIVIFLGFAWWKYSKYIKGLTAEERHIEATPASNVD >CP034953.1|QAA88440.1|657363_657813_-|beta-galactosidase-subunit-beta MRIIDNLEQFRQIYASGKKWQRCVEAIENIDNIQPGVAHSIGDSLTYRVETDSATDALFTGHRRYFEVHYYLQGQQKIEYAPKETLQVVEYYRDETDREYLKGCGETVEVHEGQIVICDIHEAYRFICNNAVKKVVLKVTIEDGYFHNK >CP034953.1|QAA88441.1|657809_660902_-|beta-galactosidase-subunit-alpha MNRWENIQLTHENRLAPRAYFFSYDSVAQARTFARETSSLFLPLSGQWNFHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGWQGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYLVGKHLTHINDFTVRTDFDEAYCDATLSCEVVLENLAASPVVTTLEYTLFDGERVVHSSAIDHLAIEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDANGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWEKVYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYEEDRDAEVVDIISTMYTRVPLMNEFGEYPHPKPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDHGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHARDLTRGELKVENKLWFTTLDDYTLHAEVRAEGETLATQQIKLRDVAPNSEAPLQITLPQLDAREAFLNITVTKDSRTRYSEAGHPIATYQFPLKENTAQPVPFAPNNARPLTLEDDRLSCTVRGYNFAITFSKMSGKPTSWQVNGESLLTREPKINFFKPMIDNHKQEYEGLWQPNHLQIMQEHLRDFAVEQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAADGQVNVALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWHYTQENIHAAQHCNELQRSDDITLNLDHQLLGLGSNSWGSEVLDSWRVWFRDFSYGFTLLPVSGGEATAQSLASYEFGAGFFSTNLHSENKQ >CP034953.1|QAA88442.1|661085_662069_-|transcriptional-regulator-EbgR MATLKDIAIEAGVSLATVSRVLNDDPTLNVKEETKHRILEIAEKLEYKTSSARKLQTGAVNQHHILAIYSYQQELEINDPYYLAIRHGIETQCEKLGIELTNCYEHSGLPDIKNVTGILIVGKPTPALRAAASALTDNICFIDFHEPGSGYDAVDIDLARISKEIIDFYINQGVNRIGFIGGEDEPGKADIREVAFAEYGRLKQVVREEDIWRGGFSSSSGYELAKQMLAREDYPKALFVASDSIAIGVLRAIHERGLNIPQDISLISVNDIPTARFTFPPLSTVRIHSEMMGSQGVNLVYEKARDGRALPLLVFVPSKLKLRGTTR >CP034953.1|QAA88443.1|662287_662620_+|tRNA-binding-protein METVAYADFARLEMRVGKIVEVKRHENADKLYIVQVDVGQKTLQTVTSLVPYYSEEELMGKTVVVLCNLQKAKMRGETSECMLLCAETDDGSESVLLTPERMMPAGVRVV >CP034953.1|QAA88444.1|662661_664152_-|putrescine-aminotransferase MITEFVFIPIFAIAAGVAQSLQYLNRYHVIREPPEHILNRLPSSASALACSAHALNLIEKRTLDHEEMKALNREVIEYFKEHVNPGFLEYRKSVTAGGDYGAVEWQAGSLNTLVDTQGQEFIDCLGGFGIFNVGHRNPVVVSAVQNQLAKQPLHSQELLDPLRAMLAKTLAALTPGKLKYSFFCNSGTESVEAALKLAKAYQSPRGKFTFIATSGAFHGKSLGALSATAKSTFRKPFMPLLPGFRHVPFGNIEAMRTALNECKKTGDDVAAVILEPIQGEGGVILPPPGYLTAVRKLCDEFGALMILDEVQTGMGRTGKMFACEHENVQPDILCLAKALGGGVMPIGATIATEEVFSVLFDNPFLHTTTFGGNPLACAAALATINVLLEQNLPAQAEQKGDMLLDGFRQLAREYPDLVQEARGKGMLMAIEFVDNEIGYNFASEMFRQRVLVAGTLNNAKTIRIEPPLTLTIEQCELVIKAARKALAAMRVSVEEA >CP034953.1|QAA88445.1|664458_665979_+|PAS-domain-S-box-protein MSSHPYVTQQNTPLADDTTLMSTTDLQSYITHANDTFVQVSGYTLQELQGQPHNMVRHPDMPKAAFADMWFTLKKGEPWSGIVKNRRKNGDHYWVRANAVPMVREGKISGYMSIRTRATDEEIAAVEPLYKALNAGRTSKRIHKGLVVRKGWLGKLPSLPLRWRARGVMTLMFILLAAMLWFVAAPVVTYILCALVVLLASACFEWQIVRPIENVAHQALKVATGERNSVEHLNRSDELGLTLRAVGQLGLMCRWLINDVSSQVSSVRNGSETLAKGTDELNEHTQQTVDNVQQTVATMNQMAASVKQNSATASAADKLSITASNAAVQGGEAMTTVIKTMDDIADSTQRIGTITSLINDIAFQTNILALNAAVEAARAGEQGKGFAVVAGEVRHLASRSANAANDIRKLIDASADKVQSGSQQVHAAGRTMEDIVAQVKNVTQLIAQISHSTLEQADGLSSLTRAVDELNLITQKNAELVEESAQVSAMVKHRASRLEDAVTVLH >CP034953.1|QAA88446.1|666132_666756_-|PadR-family-transcriptional-regulator MSHHHEGCCKHEGQPRHEGCCKGEKSEHEHCGHGHQHEHGQCCGGRHGRGGGRRQRFFGHGELRLVILDILSRDDSHGYELIKAIENLTQGNYTPSPGVIYPTLDFLQEQSLITIREEEGGKKQIALTEQGAQWLEENREQVEMIEERIKARCVGAALRQNPQMKRALDNFKAVLDLRVNQSDISDAQIKKIIAVIDRAAFDITQLD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034953_2 | 980239-980632 | Unclear |
I-E
Consensus repeat of CP034953_2
|
6 spacers
spacers of CP034953_2
>2.1|980267|33|CP034953|PILER-CR,CRISPRCasFinder,CRT GGCAAAAACCGGGCAATCGCAAAAAGGCGTAAT >2.2|980328|33|CP034953|PILER-CR,CRISPRCasFinder,CRT TGTGTTTGCGGCATTAACGCTCACCAGCATTTC >2.3|980389|33|CP034953|PILER-CR,CRISPRCasFinder,CRT GACGTGGTCATGGGTGCTGCTGTTGCAGAGCCA >2.4|980450|33|CP034953|PILER-CR,CRISPRCasFinder,CRT GAGCAGATACACGGCTTTGTATTCCGTGCGCCC >2.5|980511|33|CP034953|PILER-CR,CRISPRCasFinder,CRT GAATAGCAATAGTCCATAGATTTGCGAAAACAG >2.6|980572|33|CP034953|PILER-CR,CRISPRCasFinder,CRT GGAGCCTGACGAGACTACTGAGGCCGTTCTGTC |
CRISPR arrays and Neighbor proteins around CP034953_2
The CRISPR arrays of CP034953_2 >merge|CP034953|2|980239-980632|PILER-CR,CRISPRCasFinder,CRT GTGTTCCCCGCGCCAGCGGGGATAAACCGGCAAAAACCGGGCAATCGCAAAAAGGCGTAATGTGTTCCCCGCGCCAGCGGGGATAAACCTGTGTTTGCGGCATTAACGCTCACCAGCATTTCGTGTTCCCCGCGCCAGCGGGGATAAACCGACGTGGTCATGGGTGCTGCTGTTGCAGAGCCAGTGTTCCCCGCGCCAGCGGGGATAAACCGAGCAGATACACGGCTTTGTATTCCGTGCGCCCGTGTTCCCCGCGCCAGCGGGGATAAACCGAATAGCAATAGTCCATAGATTTGCGAAAACAGGTGTTCCCCGCGCCAGCGGGGATAAACCGGAGCCTGACGAGACTACTGAGGCCGTTCTGTCGAGTTCCCCGCGCCAGCGGGGATAAACC >CP034953|2|1|980239-980632|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACC GGCAAAAACCGGGCAATCGCAAAAAGGCGTAAT GTGTTCCCCGCGCCAGCGGGGATAAACC TGTGTTTGCGGCATTAACGCTCACCAGCATTTC GTGTTCCCCGCGCCAGCGGGGATAAACC GACGTGGTCATGGGTGCTGCTGTTGCAGAGCCA GTGTTCCCCGCGCCAGCGGGGATAAACC GAGCAGATACACGGCTTTGTATTCCGTGCGCCC GTGTTCCCCGCGCCAGCGGGGATAAACC GAATAGCAATAGTCCATAGATTTGCGAAAACAG GTGTTCCCCGCGCCAGCGGGGATAAACC GGAGCCTGACGAGACTACTGAGGCCGTTCTGTC GAGTTCCCCGCGCCAGCGGGGATAAACC >CP034953|2|2|980239-980632|CRISPRCasFinder GTGTTCCCCGCGCCAGCGGGGATAAACC GGCAAAAACCGGGCAATCGCAAAAAGGCGTAAT GTGTTCCCCGCGCCAGCGGGGATAAACC TGTGTTTGCGGCATTAACGCTCACCAGCATTTC GTGTTCCCCGCGCCAGCGGGGATAAACC GACGTGGTCATGGGTGCTGCTGTTGCAGAGCCA GTGTTCCCCGCGCCAGCGGGGATAAACC GAGCAGATACACGGCTTTGTATTCCGTGCGCCC GTGTTCCCCGCGCCAGCGGGGATAAACC GAATAGCAATAGTCCATAGATTTGCGAAAACAG GTGTTCCCCGCGCCAGCGGGGATAAACC GGAGCCTGACGAGACTACTGAGGCCGTTCTGTC GAGTTCCCCGCGCCAGCGGGGATAAACC >CP034953|2|1|980239-980632|CRT GTGTTCCCCGCGCCAGCGGGGATAAACC GGCAAAAACCGGGCAATCGCAAAAAGGCGTAAT GTGTTCCCCGCGCCAGCGGGGATAAACC TGTGTTTGCGGCATTAACGCTCACCAGCATTTC GTGTTCCCCGCGCCAGCGGGGATAAACC GACGTGGTCATGGGTGCTGCTGTTGCAGAGCCA GTGTTCCCCGCGCCAGCGGGGATAAACC GAGCAGATACACGGCTTTGTATTCCGTGCGCCC GTGTTCCCCGCGCCAGCGGGGATAAACC GAATAGCAATAGTCCATAGATTTGCGAAAACAG GTGTTCCCCGCGCCAGCGGGGATAAACC GGAGCCTGACGAGACTACTGAGGCCGTTCTGTC GAGTTCCCCGCGCCAGCGGGGATAAACC
>CP034953.1|QAA88714.1|979227_979899_+|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAVIGRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >CP034953.1|QAA88713.1|978948_979089_-|hypothetical-protein MSEENKENGFNHVKTFTKIIFIFSVLVFNDNEYKITDAAVNLFIQI >CP034953.1|QAA88712.1|976704_978003_+|phosphopyruvate-hydratase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >CP034953.1|QAA88711.1|974979_976617_+|CTP-synthase-(glutamine-hydrolyzing) MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >CP034953.1|QAA88710.1|973960_974752_+|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADSSAENSSEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >CP034953.1|QAA88709.1|973554_973890_+|endoribonuclease-MazF MVSRYVPDMGDLIWVDFDPTKGSEQAGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG >CP034953.1|QAA88708.1|973306_973555_+|MazF-MazE-toxin-antitoxin-system-antitoxin-MazE MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKEVW >CP034953.1|QAA88707.1|969964_971238_+|IS3-like-element-IS2-family-transposase MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEAVEYGRGKKVDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >CP034953.1|QAA88706.1|968309_969611_+|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKTLFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHDVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILAPQLEALLPKVRACLGSLQAMRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK >CP034953.1|QAA88705.1|965496_968253_-|two-component-sensor-histidine-kinase-BarA MTNYSLRARMMILILAPTVLIGLLLSIFFVVHRYNDLQRQLEDAGASIIEPLAVSTEYGMSLQNRESIGQLISVLHRRHSDIVRAISVYDENNRLFVTSNFHLDPSSMQLGSNVPFPRQLTVTRDGDIMILRTPIISESYSPDESPSSDAKNSQNMLGYIALELDLKSVRLQQYKEIFISSVMMLFCIGIALIFGWRLMRDVTGPIRNMVNTVDRIRRGQLDSRVEGFMLGELDMLKNGINSMAMSLAAYHEEMQHNIDQATSDLRETLEQMEIQNVELDLAKKRAQEAARIKSEFLANMSHELRTPLNGVIGFTRLTLKTELTPTQRDHLNTIERSANNLLAIINDVLDFSKLEAGKLILESIPFPLRSTLDEVVTLLAHSSHDKGLELTLNIKSDVPDNVIGDPLRLQQIITNLVGNAIKFTENGNIDILVEKRALSNTKVQIEVQIRDTGIGIPERDQSRLFQAFRQADASISRRHGGTGLGLVITQKLVNEMGGDISFHSQPNRGSTFWFHINLDLNPNIIIEGPSTQCLAGKRLAYVEPNSAAAQCTLDILSETPLEVVYSPTFSALPPAHYDMMLLGIAVTFREPLTMQHERLAKAVSMTDFLMLALPCHAQVNAEKLKQDGIGACLLKPLTPTRLLPALTEFCHHKQNTLLPVTDESKLAMTVMAVDDNPANLKLIGALLEDMVQHVELCDSGHQAVERAKQMPFDLILMDIQMPDMDGIRACELIHQLPHQQQTPVIAVTAHAMAGQKEKLLGAGMSDYLAKPIEEERLHNLLLRYKPGSGISSRVVTPEVNEIVVNPNATLDWQLALRQAAGKTDLARDMLQMLLDFLPEVRNKVEEQLVGENPEGLVDLIHKLHGSCGYSGVPRMKNLCQLIEQQLRSGTKEEDLEPELLELLDEMDNVAREASKILG >CP034953.1|QAA88715.1|981271_982750_-|sugar-kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFAGHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIPRHMLFDVQMPGTVLGHITPQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDARAQDLSPEDLLNKKASCVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNHFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDNMVRVKDIFIPIESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNVDSIQSWSNA >CP034953.1|QAA88716.1|982776_984054_-|MFS-transporter MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQIAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSVLGLLTLTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGMVIVFTFLLFQKIRTADSAPAMASSK >CP034953.1|QAA91862.1|984372_985158_+|SDR-family-oxidoreductase MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVGITAEGAPQKIIAACCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASPASNYVNGHLLVVDGGYLVR >CP034953.1|QAA88717.1|985227_986682_+|FAD-linked-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYILEDMKTGFNILREIMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKVTGEGIAEIVARYPQCQRVDSKLIETWFNNLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >CP034953.1|QAA88718.1|986775_988113_+|MFS-transporter MNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLASIAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >CP034953.1|QAA88719.1|988090_988870_+|electron-transfer-flavoprotein-subunit-beta/FixA-family-protein MNILLAFKAEPDAGMLAEKEWQAAAQGKSGPDISLLRSLLGADEQAAAALLLAQRKNGTPMSLTALSMGDERALHWLRYLMALGFEEAVLLETAADLRFAPEFVARHIAEWQHQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDALFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAEMPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWQDYLRQRMQP >CP034953.1|QAA88720.1|988866_989727_+|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNIAIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSVRKSHWGNALTATLQTEKRPLCLSLARQAGAAKNATLPSGMQQLNIVPGALPDWLVSTEDLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKVVLEALVTNIHADCQ >CP034953.1|QAA88721.1|989874_990450_-|glycerol-3-phosphate-responsive-antiterminator MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >CP034953.1|QAA88722.1|990466_990727_-|ferredoxin-family-protein MSVARNLWRVADAPHIVPADSVERQTAERLINACPAGLFSLTPEGNLRIDYRSCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >CP034953.1|QAA88723.1|990717_991989_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERRITHESLSLLTPDGVTTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGRICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034953_3 | 1006183-1006945 | TypeI-E |
I-E
Consensus repeat of CP034953_3
|
12 spacers
spacers of CP034953_3
>3.1|1006212|32|CP034953|PILER-CR,CRISPRCasFinder,CRT CTTTCGCAGACGCGCGGCGATACGCTCACGCA >3.2|1006273|32|CP034953|PILER-CR,CRISPRCasFinder,CRT CAGCCGAAGCCAAAGGTGATGCCGAACACGCT >3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT GGCTCCCTGTCGGTTGTAATTGATAATGTTGA >3.4|1006395|33|CP034953|PILER-CR,CRISPRCasFinder,CRT TTTGGATCGGGTCTGGAATTTCTGAGCGGTCGC >3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT CGAATCGCGCATACCCTGCGCGTCGCCGCCTGC >3.6|1006519|32|CP034953|PILER-CR,CRISPRCasFinder,CRT TCAGCTTTATAAATCCGGAGATACGGAAACTA >3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT GACTCACCCCGAAAGAGATTGCCAGCCAGCTT >3.8|1006641|32|CP034953|PILER-CR,CRISPRCasFinder,CRT CTGCTGGAGCTGGCTGCAAGGCAAGCCGCCCA >3.9|1006702|32|CP034953|CRISPRCasFinder,CRT GGGGGCGCATGACCGTAAACATTATCCCCCGG >3.10|1006763|32|CP034953|CRISPRCasFinder,CRT GGAGTTCAGACATAGGTGGAATGATGGACTAC >3.11|1006824|32|CP034953|CRISPRCasFinder,CRT CCCGGTAGCCAGGTTTGCAACGCCTGAACCGA >3.12|1006885|32|CP034953|CRISPRCasFinder,CRT GCAACGACGGTGAGATTTCACGCCTGACGCTG |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around CP034953_3
The CRISPR arrays of CP034953_3 >merge|CP034953|3|1006183-1006945|PILER-CR,CRISPRCasFinder,CRT GAGTTCCCCGCGCCAGCGGGGATAAACCGCTTTCGCAGACGCGCGGCGATACGCTCACGCAGAGTTCCCCGCGCCAGCGGGGATAAACCGCAGCCGAAGCCAAAGGTGATGCCGAACACGCTGAGTTCCCCGCGCCAGCGGGGATAAACCGGGCTCCCTGTCGGTTGTAATTGATAATGTTGAGAGTTCCCCGCGCCAGCGGGGATAAACCGTTTGGATCGGGTCTGGAATTTCTGAGCGGTCGCGAGTTCCCCGCGCCAGCGGGGATAAACCGCGAATCGCGCATACCCTGCGCGTCGCCGCCTGCGAGTTCCCCGCGCCAGCGGGGATAAACCGTCAGCTTTATAAATCCGGAGATACGGAAACTAGAGTTCCCCGCGCCAGCGAGGATAAACCGGACTCACCCCGAAAGAGATTGCCAGCCAGCTTGAGTTCCCCGCGCCAGCGGGGATAAACCGCTGCTGGAGCTGGCTGCAAGGCAAGCCGCCCAGAGTTCCCCGCGCCAGCGGGGATAAACCGGGGGGCGCATGACCGTAAACATTATCCCCCGGGAGTTCCCCGCGCCAGCGGGGATAAACCGGGAGTTCAGACATAGGTGGAATGATGGACTACGAGTTCCCCGCGTTAGCGGGGATAAACCGCCCGGTAGCCAGGTTTGCAACGCCTGAACCGAGAGTTCCCCGCGCCAGCAGGGATAAACCGGCAACGACGGTGAGATTTCACGCCTGACGCTGGTGTTCCCCGCATCAGCGGGGATAAACCG >CP034953|3|2|1006183-1006701|PILER-CR GAGTTCCCCGCGCCAGCGGGGATAAACCG CTTTCGCAGACGCGCGGCGATACGCTCACGCA GAGTTCCCCGCGCCAGCGGGGATAAACCG CAGCCGAAGCCAAAGGTGATGCCGAACACGCT GAGTTCCCCGCGCCAGCGGGGATAAACCG GGCTCCCTGTCGGTTGTAATTGATAATGTTGA GAGTTCCCCGCGCCAGCGGGGATAAACCG TTTGGATCGGGTCTGGAATTTCTGAGCGGTCGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CGAATCGCGCATACCCTGCGCGTCGCCGCCTGC GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAGCTTTATAAATCCGGAGATACGGAAACTA GAGTTCCCCGCGCCAGCGAGGATAAACCG GACTCACCCCGAAAGAGATTGCCAGCCAGCTT GAGTTCCCCGCGCCAGCGGGGATAAACCG CTGCTGGAGCTGGCTGCAAGGCAAGCCGCCCA GAGTTCCCCGCGCCAGCGGGGATAAACCG >CP034953|3|3|1006183-1006945|CRISPRCasFinder GAGTTCCCCGCGCCAGCGGGGATAAACCG CTTTCGCAGACGCGCGGCGATACGCTCACGCA GAGTTCCCCGCGCCAGCGGGGATAAACCG CAGCCGAAGCCAAAGGTGATGCCGAACACGCT GAGTTCCCCGCGCCAGCGGGGATAAACCG GGCTCCCTGTCGGTTGTAATTGATAATGTTGA GAGTTCCCCGCGCCAGCGGGGATAAACCG TTTGGATCGGGTCTGGAATTTCTGAGCGGTCGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CGAATCGCGCATACCCTGCGCGTCGCCGCCTGC GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAGCTTTATAAATCCGGAGATACGGAAACTA GAGTTCCCCGCGCCAGCGAGGATAAACCG GACTCACCCCGAAAGAGATTGCCAGCCAGCTT GAGTTCCCCGCGCCAGCGGGGATAAACCG CTGCTGGAGCTGGCTGCAAGGCAAGCCGCCCA GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGGGCGCATGACCGTAAACATTATCCCCCGG GAGTTCCCCGCGCCAGCGGGGATAAACCG GGAGTTCAGACATAGGTGGAATGATGGACTAC GAGTTCCCCGCGTTAGCGGGGATAAACCG CCCGGTAGCCAGGTTTGCAACGCCTGAACCGA GAGTTCCCCGCGCCAGCAGGGATAAACCG GCAACGACGGTGAGATTTCACGCCTGACGCTG GTGTTCCCCGCATCAGCGGGGATAAACCG >CP034953|3|2|1006183-1006945|CRT GAGTTCCCCGCGCCAGCGGGGATAAACCG CTTTCGCAGACGCGCGGCGATACGCTCACGCA GAGTTCCCCGCGCCAGCGGGGATAAACCG CAGCCGAAGCCAAAGGTGATGCCGAACACGCT GAGTTCCCCGCGCCAGCGGGGATAAACCG GGCTCCCTGTCGGTTGTAATTGATAATGTTGA GAGTTCCCCGCGCCAGCGGGGATAAACCG TTTGGATCGGGTCTGGAATTTCTGAGCGGTCGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CGAATCGCGCATACCCTGCGCGTCGCCGCCTGC GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAGCTTTATAAATCCGGAGATACGGAAACTA GAGTTCCCCGCGCCAGCGAGGATAAACCG GACTCACCCCGAAAGAGATTGCCAGCCAGCTT GAGTTCCCCGCGCCAGCGGGGATAAACCG CTGCTGGAGCTGGCTGCAAGGCAAGCCGCCCA GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGGGCGCATGACCGTAAACATTATCCCCCGG GAGTTCCCCGCGCCAGCGGGGATAAACCG GGAGTTCAGACATAGGTGGAATGATGGACTAC GAGTTCCCCGCGTTAGCGGGGATAAACCG CCCGGTAGCCAGGTTTGCAACGCCTGAACCGA GAGTTCCCCGCGCCAGCAGGGATAAACCG GCAACGACGGTGAGATTTCACGCCTGACGCTG GTGTTCCCCGCATCAGCGGGGATAAACCG
>CP034953.1|QAA88735.1|1005792_1006077_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLPV >CP034953.1|QAA88734.1|1004873_1005791_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTWLPLNPIPLKDRVSMIFLQYGQIDVIDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVRLAAQVGTLLVWVGEAGVRVYASGQPGGARSDKLLYQAKLALDEDLRLKVVRKMFELRFGEPAPARRSVEQLRGIEGSRVRATYALLAKQYGVTWNGRRYDPKDWEKGDTINQCISAATSCLYGVTEAAILAAGYAPAIGFVHTGKPLSFVYDIADIIKFDTVVPKAFEIARRNPGEPDREVRLACRDIFRSSKTLAKLIPLIEDVLAAGEIQPPAPPEDAQPVAIPLPVSLGDAGHRSS >CP034953.1|QAA88733.1|1004258_1004858_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL >CP034953.1|QAA88732.1|1003597_1004272_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVSQ >CP034953.1|QAA88731.1|1002503_1003595_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGEA >CP034953.1|QAA88730.1|1002008_1002491_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRLLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA >CP034953.1|QAA88729.1|1000507_1002016_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGGPSNG >CP034953.1|QAA88728.1|997426_1000093_+|CRISPR-associated-helicase/endonuclease-Cas3 MEPFKYICHYWGKSSKSLTKGNDIHLLIYHCLDVAAVADCWWDQSVVLQNTFCRNEMLSKQRVKAWLLFFIALHDIGKFDIRFQYKSAESWLKLNPATPSLNGPSTQMCRKFNHGAAGLYWFNQDSLSEQSLGDFFSFFDAAPHPYESWFPWVEAVTGHHGFILHSQDQDKSRWEMPASLASYAAQDKQAREEWISVLEALFLTPAGLSINDIPPDCSSLLAGFCSLADWLGSWTTTNTFLFNEDAPSDINALRTYFQDRQQDASRVLELSGLVSNKRCYEGVHALLDNGYQPRQLQVLVDALPVAPGLTVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEASASHLFSSPNLILAHGNSRFNHLFQSIKSRAITEQGQEEAWVQCCQWLSQSNKKVFLGQIGVCTIDQVLISVLPVKHRFIRGLGIGRSVLIVDEVHAYDTYMNGLLEAVLKAQADVGGSVILLSATLPMKQKQKLLDTYGLHTDPVENNSAYPLINWRGVNGAQRFDLLAHPEQLPPRFSIQPEPICLADMLPDLTMLERMIAAANAGAQVCLICNLVDVAQVCYQRLKELNNTQVDIDLFHARFTLNDRREKENRVISNFGKNGKRNVGRILVATQVVEQSLDVDFDWLITQHCPADLLFQRLGRLHRHHRKYRPAGFEIPVATILLPDGEGYGRHEHIYSNVRVMWRTQQHIEELNGASLFFPDAYRQWLDSIYDDAEMDEPEWVGNGMDKFESAECEKRFKARKVLQWAEEYSLQDNDETILAVTRDGEMSLPLLPYVQTSSGKQLLDGQVYEDLSHEQQYEALALNRVNVPFTWKRSFSEVVDEDGLLWLEGKQNLDGWVWQGNSIVITYTGDEGMTRVIPANPK >CP034953.1|QAA88727.1|996333_997068_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >CP034953.1|QAA88726.1|994546_996259_+|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLTDAERMKHESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPARPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKIAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD >CP034953.1|QAA88736.1|1007027_1008065_-|aminopeptidase MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEIFDKAGIAVLSVEATNWNLGNKDGYQQRAKTPAFPAGNSWHDVRLDNHQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >CP034953.1|QAA88737.1|1008316_1009225_+|sulfate-adenylyltransferase-subunit-2 MDQIRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >CP034953.1|QAA88738.1|1009226_1010654_+|sulfate-adenylyltransferase-subunit-CysN MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEETFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMPWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFAGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >CP034953.1|QAA88739.1|1010653_1011259_+|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVQQRELHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >CP034953.1|QAA88740.1|1011308_1011632_+|DUF3561-family-protein MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >CP034953.1|QAA88741.1|1011825_1012137_+|cell-division-protein-FtsB MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNDDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >CP034953.1|QAA88742.1|1012155_1012866_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSRFAQLPLANHPQITVVDGGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >CP034953.1|QAA88743.1|1012865_1013345_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >CP034953.1|QAA88744.1|1013341_1014391_+|tRNA-pseudouridine(13)-synthase-TruD MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >CP034953.1|QAA88745.1|1014371_1015133_+|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVTCSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034953_4 | 1537358-1537475 | Orphan |
NA
Consensus repeat of CP034953_4
|
1 spacers
spacers of CP034953_4
>4.1|1537389|56|CP034953|CRISPRCasFinder TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA |
CRISPR arrays and Neighbor proteins around CP034953_4
The CRISPR arrays of CP034953_4 >merge|CP034953|4|1537358-1537475|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGCTGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAACCGAGCCGTAGGCCGGATAAGGCGTTTACGC >CP034953|4|4|1537358-1537475|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGC TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA CCGAGCCGTAGGCCGGATAAGGCGTTTACGC
>CP034953.1|QAA89218.1|1536130_1537261_-|ribonucleoside-diphosphate-reductase-1-subunit-beta MAYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEALHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAVGLDLPFQTRSNPIPWINTWLVSDNVQVAPQEVEVSSYLVGQIDSEVDTDDLSNFQL >CP034953.1|QAA89217.1|1535876_1536131_-|ferredoxin-like-diferric-tyrosyl-radical-cofactor-maintenance-protein-YfaE MARVTLRITGTQLLCQDEHPSLLAALESHNVAVEYQCREGYCGSCRTRLVAGQVDWIAEPLAFIQPGEILPCCCRAKGDIEIEM >CP034953.1|QAA89216.1|1535172_1535823_+|lipopolysaccharide-kinase-InaA MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMTHHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRALLVTEDMAGFISIADWYAQHAVSPYSDEVRQAMLKAVALAFKKMHSINRQHGCCYVRHIYVKTEGNAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEPIPKADWEQVKAYYYAM >CP034953.1|QAA89215.1|1533633_1534710_+|glycerophosphodiester-phosphodiesterase MKLTLKNLSMAIMMSTIVMGSSAMAADSNEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDNLVVLHDHYLDRVTDVADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTFEEEIEFVQGLNHSTGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQCFDADELKRIKNELEPKMGMELNLVQLIAYTDWNETQQKQPDGSWVNYNYDWMFKPGAMKQVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKLPEYTPDVNQLYDALYNKAGVNGLFTDFPDKAVKFLNKE >CP034953.1|QAA89214.1|1532270_1533629_+|glycerol-3-phosphate-transporter MLSIFKPAPHKARLPAAEIDPTYRRLRWQIFLGIFFGYAAYYLVRKNFALAMPYLVEQGFSRGDLGFALSGISIAYGFSKFIMGSVSDRSNPRVFLPAGLILAAAVMLFMGFVPWATSSIAVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLFLLGMAWFNDWHAALYMPAFCAILVALFAFAMMRDTPQSCGLPPIEEYKNDYPDDYNEKAEQELTAKQIFMQYVLPNKLLWYIAIANVFVYLLRYGILDWSPTYLKEVKHFALDKSSWAYFLYEYAGIPGTLLCGWMSDKVFRGNRGATGVFFMTLVTIATIVYWMNPAGNPTVDMICMIVIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGSVAASAIVGYTVDFFGWDGGFMVMIGGSILAVILLIVVMIGEKRRHEQLLQERNGG >CP034953.1|QAA89213.1|1530369_1531998_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-A MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRNHGLLHSGARYAVTDAESARECISENQILKRIARHCVEPTNGLFITLPEDDLSFQATFIRACEEAGISAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDPFRLTAANMLDAKEHGAVILTAHEVTGLIREGATVCGVRVRNHLTGETQALHAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRINQHVINRCRKPSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAPVMAKTRILRAYSGVRPLVASDDDPSGRNVSRGIVLLDHAERDGLDGFITITGGKLMTYRLMAEWATDAVCRKLGNTRPCTTADLALPGSQEPAEVTLRKVISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVENLNVNSLLDLRRRTRVGMGTCQGELCACRAAGLLQRFNVTTSAQSIEQLSTFLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL >CP034953.1|QAA89212.1|1529120_1530380_-|glycerol-3-phosphate-dehydrogenase-subunit-GlpB MRFDTVIMGGGLAGLLCGLQLQKHGLRCAIVTRGQSALHFSSGSLDLLSHLPDGQPVTDIHSGLESLRQQAPAHPYSLLEPQRVLDLACQAQALIAESGAQLQGSVELAHQRVTPLGTLRSTWLSSPEVPVWPLPAKKICVVGISGLMDFQAHLAAASLRELGLAVETAEIELPELDVLRNNATEFRAVNIARFLDNEENWPLLLDALIPVANTCEMILMPACFGLADDKLWRWLNEKLPCSLMLLPTLPPSVLGIRLQNQLQRQFVRQGGVWMPGDEVKKVTCKNGVVNEIWTRNHADIPLRPRFAVLASGSFFSGGLVAERNGIREPILGLDVLQTATRGEWYKGDFFAPQPWQQFGVTTDETLRPSQAGQTIENLFAIGSVLGGFDPIAQGCGGGVCAVSALHAAQQIAQRAGGQQ >CP034953.1|QAA89211.1|1527933_1529124_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-C MNDTSFENCIKCTVCTTACPVSRVNPGYPGPKQAGPDGERLRLKDGALYDEALKYCINCKRCEVACPSDVKIGDIIQRARAKYDTTRPSLRNFVLSHTDLMGSVSTPFAPIVNTATSLKPVRQLLDAALKIDHRRTLPKYSFGTFRRWYRSVAAQQAQYKDQVAFFHGCFVNYNHPQLGKDLIKVLNAMGTGVQLLSKEKCCGVPLIANGFTDKARKQAITNVESIREAVGVKGIPVIATSSTCTFALRDEYPEVLNVDNKGLRDHIELATRWLWRKLDEGKTLPLKPLPLKVVYHTPCHMEKMGWTLYTLELLRNIPGLELTVLDSQCCGIAGTYGFKKENYPTSQAIGAPLFRQIEESGADLVVTDCETCKWQIEMSTSLRCEHPITLLAQALA >CP034953.1|QAA89210.1|1526841_1527741_-|ISNCY-family-transposase MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAYYSDVLWSVETSDGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHQDKGYDRVPLVVPLLFYHGETSPYPYSLNWLDEFDDPQLARQLYTEAFLLVDITIVPDDEIMQHRRIALLELIQKHIRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYLLQCGDTSRFTRFIEEIAERSPLQKERLMTIAERLRQEGHQIGWQEGMHEQAIKIALRMLEQGFEREIVLATTQLTDADIPNCH >CP034953.1|QAA89209.1|1526643_1526829_-|hypothetical-protein MTIAERLRQEGHQIGWQEGKLEGLHEQAIKIALRMLEQGFDRDQVLAATQLSEADLAANNH >CP034953.1|QAA89219.1|1537494_1539780_-|ribonucleoside-diphosphate-reductase-1-subunit-alpha MNQNLLVTKRDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADLISRDAPDYQYLAARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFKQMDTFIDHDRDMTFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAACLFSNYPRETRLQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINATSSAIVKYVSQRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMWHLEVESLLVLKNNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFFADQEEFERLYTKYEKDDSIRKQRVKAVELFSLMMQERASTGRIYIQNVDHCNTHSPFDPAIAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELEELAILAVRALDALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQYYLLKASNELAKEQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTHGLRNSTLSALMPSETSSQISNATNGIEPPRGYVSIKASKDGILRQVVPDYEHLHDAYELLWEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLTAYKFGVKTLYYQNTRDGAEDAQDDLVPSIQDDGCESGACKI >CP034953.1|QAA89220.1|1540475_1544228_+|AIDA-I-family-autotransporter-YfaL MRIIFLRKEYLSLLPSMIASLFSANGVAAVTDSCQGYDVKASCQASRQSLSGITQDWSIADGQWLVFSDMTNNASGGAVFLQQGAEFSLLPENETGMTLFANNTVTGEYNNGGAIFAKENSTLNLTDVIFSGNVAGGYGGAIYSSGTNDTGAVDLRVTNAMFRNNIANDGKGGAIYTINNDVYLSDVIFDNNQAYTSTSYSDGDGGAIDVTDNNSDSKHPSGYTIVNNTAFTNNTAEGYGGAIYTNSVTAPYLIDISVDDSYSQNGGVLVDENNSAAGYGDGPSSAAGGFMYLGLSEVTFDIADGKTLVIGNTENDGAVDSIAGTGLITKTGSGDLVLNADNNDFTGEMQIENGEVTLGRSNSLMNVGDTHCQDDPQDCYGLTIGSIDQYQNQAELNVGSTQQTFVHALTGFQNGTLNIDAGGNVTVNQGSFAGIIEGAGQLTIAQNGSYVLAGAQSMALTGDIVVDDGAVLSLEGDAADLTALQDDPQSIVLNGGVLDLSDFSTWQSGTSYNDGLEVSGSSGTVIGSQDVVDLAGGDNLHIGGDGKDGVYVVVDASDGQVSLANNNSYLGTTQIASGTLMVSDNSQLGDTHYNRQVIFTDKQQESVMEITSDVDTRSDAAGHGRDIEMRADGEVAVDAGVDTQWGALMADSSGQHQDEGSTLTKTGAGTLELTASGTTQSAVRVEEGTLKGDVADILPYASSLWVGDGATFVTGADQDIQSIDAISSGTIDISDGTVLRLTGQDTSVALNASLFNGDGTLVNATDGVTLTGELNTNLETDSLTYLSNVTVNGNLTNTSGAVSLQNGVAGDTLTVNGDYTGGGTLLLDSELNGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAKVGGYLNNLRAANQAFMMERRDHAGGDGQTLNLRVIGGDYHYTAAGQLAQHEDTSTVQLSGDLFSGRWGTDGEWMLGIVGGYSDNQGDSRSNMTGTRADNQNHGYAVGLTSSWFQHGNQKQGAWLDSWLQYAWFSNDVSEQEDGTDHYHSSGIIASLEAGYQWLPGRGVVIEPQAQVIYQGVQQDDFTAANRARVSQSQGDDIQTRLGLHSEWRTAVHVIPTLDLNYYHDPHSTEIEEDGSTISDDAVKQRGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW >CP034953.1|QAA91875.1|1544355_1545078_-|bifunctional-3-demethylubiquinone-3-O-methyltransferase/2-octaprenyl-6-hydroxy-phenol-methylase MNAEKSPVNHNVDHEEIAKFEAVASRWWDLEGEFKPLHRINPLRLGYIAERAGGLFGKKVLDVGCGGGILAESMAREGATVTGLDMGFEPLQVAKLHALESGIQVDYVQETVEEHAAKHAGQYDVVTCMEMLEHVPDPQSVVRACAQLVKPGGDVFFSTLNRNGKSWLMAVVGAEYILRMVPKGTHDVKKFIKPAELLGWVDQTSLKERHITGLHYNPITNTFKLGPGVDVNYMLHTQNK >CP034953.1|QAA89221.1|1545224_1547852_+|DNA-gyrase-subunit-A MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAPTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITDAGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE >CP034953.1|QAA89222.1|1548000_1549689_+|DUF2138-domain-containing-protein MSGEKKAKGWRFYGLVGFGAIALLSAGVWALQYAGSGPEKTLSPLVVHNNLQIDLNEPDLFLDSDSLSQLPKDLLTIPFLHDVLSEDFVFYYQNHADRLGIEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETVPVYQLRYNGNNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIAGDLLSGKKRWQASFGLEERTAEKTPVRQRIVVSARWLGFGYQRLMPSFAGVRFEMGNDGWHSFVALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQENDKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGAHESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYFFRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYINPQGIAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLAQMEPGAAWQWLPITWQPL >CP034953.1|QAA89223.1|1549685_1550309_+|DUF1175-domain-containing-protein MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYRLNFLAR >CP034953.1|QAA89224.1|1554847_1556497_+|DUF2300-domain-containing-protein MNWRRIVWLLALVTLPTLAEETPLQLVLRGAQHDQLYQLSSSGVTKVSALPDSLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESITRDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLTTLKPETSVTVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWFADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQVASGQCVEVELFARYPLKKITAEKSTTAVNPGVLNGRYRVTFTNGNHITFVSHGETTLLSEKGKLKLQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLTIPDSSATQRVSASPATTGARTMTAWTQDLIYAGDPVHYHGSRATEGTLSWRQATAQAGQGERYDQILAFAYPDNSLSRWGAPRSTCQLLPKAKAWLAKKMPQWRRILQAETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRLDLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD >CP034953.1|QAA89225.1|1556501_1557278_+|DUF2135-domain-containing-protein MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPAEGEDASFSQSINYPASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDGSFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWDTDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPIHGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGELTLVKSFDW >CP034953.1|QAA89226.1|1557351_1558536_-|acetyl-CoA-acetyltransferase MKNCVIVSAVRTAIGSFNGSLASTSAIDLGATVIKAAIERAKIDSQHVDEVIMGNVLQAGLGQNPARQALLKSGLAETVCGFTVNKVCGSGLKSVALAAQAIQAGQAQSIVAGGMENMSLAPYLLDAKARSGYRLGDGQVYDVILRDGLMCATHGYHMGITAENVAKEYGITREMQDELALHSQRKAAAAIESGAFTAEIVPVNVVTRKKTFVFSQDEFPKANSTAEALGALRPAFDKAGTVTAGNASGINDGAAALVIMEESAALAAGLTPLARIKSYASGGVPPALMGMGPVPATQKALQLAGLQLADIDLIEANEAFAAQFLAVGKNLGFDSEKVNVNGGAIALGHPIGASGARILVTLLHAMQARDKTLGLATLCIGGGQGIAMVIERLN >CP034953.1|QAA89227.1|1558566_1559889_-|TIGR00366-family-protein MIGRISRFMTRFVSRWLPDPLIFAMLLTLLTFVIALWLTPQTPISMVKMWGDGFWNLLAFGMQMALIIVTGHALASSAPVKSLLRTAASAAKTPVQGVMLVTFFGSVACVINWGFGLVVGAMFAREVARRVPGSDYPLLIACAYIGFLTWGGGFSGSMPLLAATPGNPVEHIAGLIPVGDTLFSGFNIFITVALIVVMPFITRMMMPKPSDVVSIDPKLLMEEADFQKQLPKDAPPSERLEESRILTLIIGALGIAYLAMYFSEHGFNITINTVNLMFMIAGLLLHKTPMAYMRAISAAARSTAGILVQFPFYAGIQLMMEHSGLGGLITEFFINVANKDTFPVMTFFSSALINFAVPSGGGHWVIQGPFVIPAAQALGADLGKSVMAIAYGEQWMNMAQPFWALPALAIAGLGVRDIMGYCITALLFSGVIFVIGLTLF |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034953_5 | 2135099-2135222 | Orphan |
NA
Consensus repeat of CP034953_5
|
1 spacers
spacers of CP034953_5
>5.1|2135142|38|CP034953|CRISPRCasFinder CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA |
CRISPR arrays and Neighbor proteins around CP034953_5
The CRISPR arrays of CP034953_5 >merge|CP034953|5|2135099-2135222|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA >CP034953|5|5|2135099-2135222|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA
>CP034953.1|QAA89739.1|2134658_2134964_-|monooxygenase MATLLQLHFAFNGPFGDAMAEQLKPLAESINQEPGFLWKVWTESEKNHEAGGIYLFTDEKSALAYLEKHTARLKNLGVEEVVAKVFDVNEPLSQINQAKLA >CP034953.1|QAA89738.1|2132928_2134533_-|FAD-NAD(P)-binding-protein MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENSKMMLANIASIEIPPIYCTYLEWLQKQEDSHLQRYGVKKETLHDRQFLPRILLGEYFRDQFLRLVDQARQQKFAVAVYESCQVTDLQITNAGVMLATNQDLPSETFDLAVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTSLSGLDAAMAVAIQHGSFIEDDKQHVVFHRDNASEKLNITLLSRTGILPEADFYCPIPYEPLHIVTDQALNAEIQKGEEGLLDRVFRLIVEEIKFADPDWSQRIALESLNVDSFAQAWFAERKQRDPFDWAEKNLQEVERNKREKHTVPWRYVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLALREAGIIHILALGEDYKMEINESRTVLKTEDNSYSFDVFIDARGQRPLKVKDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEDIRGRVAFGALPWLMHDQPFVQGLTACAEIGEAMARAVVKPASRARRRLSFD >CP034953.1|QAA89737.1|2132104_2132917_+|hypothetical-protein MIITQADLREWRIGAVMYRWFLRHFPRGGSYADIHHALIEEGYTDWAESLVEYAWKKWLADENFAHQEVSSMQKLATDPGERPFCSQFARSDDHARIGCCEDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSRISSAGDSSRIANTGMRVRVCTLGERCHVASNGDLVQIASFGANARIANSGDNVHIIASGENSTVVSTGVVDSIILGPGGSAALAYHDGERVRFAVAIEGENNIRAGVRYRLNEQHQFVEC >CP034953.1|QAA89736.1|2131315_2132101_+|thiosulfate-reductase-cytochrome-B-subunit MNPSQHAEQFQSQLANYVPQFTPEFWPVWLIIAGVLLVGMWLVLGLHALLRARGVKKSATDHGEKIYLYSKAVRLWHWSNALLFVLLLASGLINHFAMVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGDNGHHYRIRRQGWLERAAKQTRFYLFGIMQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGLLCLYPQAVGDVFPGVRYWLLQTHFALAFISLFFIFGHLYLCTTGRTPHETFKSMVDGYHRH >CP034953.1|QAA89735.1|2130650_2131319_+|4Fe-4S-dicluster-domain-containing-protein MSFTRRKFVLGMGTVIFFTGSASSLLANTRQEKEVRYAMIHDESRCNGCNICARACRKTNHVPAQGSRLSIAHIPVTDNDNETQYHFFRQSCQHCEDAPCIDVCPTGASWRDEQGIVRVEKSQCIGCSYCIGACPYQVRYLNPVTKVADKCDFCAESRLAKGFPPICVSACPEHALIFGREDSPEIQAWLQQNKYYQYQLPGAGKPHLYRRFGQHLIKKENV >CP034953.1|QAA89734.1|2129939_2130587_+|YdhW-family-putative-oxidoreductase-system-protein MGKMNHQDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHSAAGQLVARAVFLSPPYLVAEEELSVLLESIKQNGDYADIACLTGSKDDYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAPYYFQEAQIEAAIAAMDVAPEYADIRQVESSTAVLYLFSERFMTYGKAYGLCEWFEVEQFQNP >CP034953.1|QAA89733.1|2127833_2129936_+|aldehyde-ferredoxin-oxidoreductase MANGWTGNILRVNLTTGNITLEDSSKFKSFVGGMGFGYKIMYDEVPPGTKPFDEANKLVFATGPLTGSGAPCSSRVNITSLSTFTKGNLVVDAHMGGFFAAQMKFAGYDVIIIEGKAKSPVWLKIKDDKVSLEKADFLWGKGTRATTEEICRLTSPETCVAAIGQAGENLVPLSGMLNSRNHSGGAGTGAIMGSKNLKAIAVEGTKGVNIADRQEMKRLNDYMMTELIGANNNHVVPSTPQSWAEYSDPKSRWTARKGLFWGAAEGGPIETGEIPPGNQNTVGFRTYKSVFDLGPAAEKYTVKMSGCHSCPIRCMTQMNIPRVKEFGVPSTGGNTCVANFVHTTIFPNGPKDFEDKDDGRVIGNLVGLNLFDDYGLWCNYGQLHRDFTYCYSKGVFKRVLPAEEYAEIRWDQLEAGDVNFIKDFYYRLAHRVGELSHLADGSYAIAERWNLGEEYWGYAKNKLWSPFGYPVHHANEASAQVGSIVNCMFNRDCMTHTHINFIGSGLPLKLQREVAKELFGSEDAYDETKNYTPINDAKIKYAKWSLLRVCLHNAVTLCNWVWPMTVSPLKSRNYRGDLALEAKFFKAITGEEMTQEKLDLAAERIFTLHRAYTVKLMQTKDMRNEHDLICSWVFDKDPQIPVFTEGTDKMDRDDMHASLTMFYKEMGWDPQLGCPTRETLQRLGLEDIAADLAAHNLLPA >CP034953.1|QAA89732.1|2127186_2127813_+|ferredoxin-like-protein MNPVDRPLLDIGLTRLEFLRISGKGLAGLTIAPALLSLLGCKQEDIDSGTVGLINTPKGVLVTQRARCTGCHRCEISCTNFNDGSVGTFFSRIKIHRNYFFGDNGVGSGGGLYGDLNYTADTCRQCKEPQCMNVCPIGAITWQQKEGCITVDHKRCIGCSACTTACPWMMATVNTESKKSSKCVLCGECANACPTGALKIIEWKDITV >CP034953.1|QAA89731.1|2126522_2126732_+|fumarate-hydratase-FumD MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRIQRSELEKQAMETVINALVK >CP034953.1|QAA89730.1|2124553_2125966_-|pyruvate-kinase-I MKKTKIVCTIGPKTESEEMLAKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKTAAILLDTKGPEIRTMKLEGGNDVSLKAGQTFTFTTDKSVIGNSEMVAVTYEGFTTDLSVGNTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNLPGVSIALPALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVIEIREHLKAHGGENIHIISKIENQEGLNNFDEILEASDGIMVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATQMLDSMIKNPRPTRAEAGDVANAILDGTDAVMLSGESAKGKYPLEAVSIMATICERTDRVMNSRLEFNNDNRKLRITEAVCRGAVETAEKLDAPLIVVATQGGKSARAVRKYFPDATILALTTNEKTAHQLVLSKGVVPQLVKEITSTDDFYRLGKELALQSGLAHKGDVVVMVSGALVPSGTTNTASVHVL >CP034953.1|QAA89740.1|2135536_2136793_+|hypothetical-protein MGSDAKNLMSDGNVQIVKTGEVIGATQLTEGELIVEAGGRAENTVVTGAGWLKVATGGIAKCTQYGNNGTLSVSDGAIATDIVQSEGGAISLSTLATVNGRHPEGEFSVDKGYACGLLLENGGNLRVLEGHRAEKIILDQEGGLLVNGTTSAVVVDEGGELLVYPGGEASNCEINQGGVFMLAGKASDTLLAGGTMNNLGGEDSDTIVENGSIYRLGTDGLQLYSSGKTQNLSVNVGGRAEVHAGTLENAVIQGGTVILLSPTSADENFVVEEDRAPVELTGSVALLDGASMIIGYGAELQQSTITVQQGGVLILDGSTVKGDSVTFIVGNINLNGGKLWLITDAATHVQLKVKRLRGEGAICLQTSAKEISPDFINVKGEVTGDIHVEITDASRQTLCNALKLQPDEDGIGATLQPA >CP034953.1|QAA89741.1|2136833_2138207_-|multidrug-efflux-MATE-transporter-MdtK MQKYISEARLLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGRRERIAHQVRQGFWLAGFVSVLIMLVLWNAGYIIRSMENIDPALADKAVGYLRALLWGAPGYLFFQVARNQCEGLAKTKPGMVMGFIGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFLAMVSYIKRARSMRDIRNEKGTAKPDPAVMKRLIQLGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLMVGVCMATLTAIFTVSLREQIALLYNDNPEVVTLAAHLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFYITFTAYWVLGLPSGYILALTDLVVEPMGPAGFWIGFIIGLTSAAIMMMLRMRFLQRLPSAIILQRASR >CP034953.1|QAA89742.1|2138421_2139063_+|riboflavin-synthase MFTGIVQGTAKLVSIDEKPNFRTHVVELPDHMLDGLETGASVAHNGCCLTVTEINGNHVSFDLMKETLRITNLGDLKVGDWVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQIWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARENAMNQPGTEA >CP034953.1|QAA89743.1|2139102_2140251_-|cyclopropane-fatty-acyl-phospholipid-synthase MSSSCIEEVSVPDDNWYRIANELLSRAGIAINGSAPADIRVKNPDFFKRVLQEGSLGLGESYMDGWWECDRLDMFFSKVLRAGLENQLPHHFKDTLRIAGARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPFMQYSCAYWKDADNLESAQQAKLKMICEKLQLKPGMRVLDIGCGWGGLAHYMASNYDVSVVGVTISAEQQKMAQERCEGLDVTILLQDYRDLNDQFDRIVSVGMFEHVGPKNYDTYFAVVDRNLKPEGIFLLHTIGSKKTDLNVDPWINKYIFPNGCLPSVRQIAQSSEPHFVMEDWHNFGADYDTTLMAWYERFLAAWPEIADNYSERFKRMFTYYLNACAGAFRARDIQLWQVVFSRGVENGLRVAR >CP034953.1|QAA89744.1|2140541_2141753_-|Bcr/CflA-family-multidrug-efflux-MFS-transporter MQPGKRFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPASAVSASLSLFLAGFAAAQLLWGPLSDRYGRKPVLLIGLTIFALGSLGMLWVENAATLLVLRFVQAVGVCAAAVIWQALVTDYYPSQKVNRIFAAIMPLVGLSPALAPLLGSWLLVHFSWQAIFATLFAITVVLILPIFWLKPTTKARNNSQDGLTFTDLLRSKTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVIGLSYVPQTIAFLIGGYGCRAALQKWQGKQLLPWLLVLFAVSVIATWAAGFISHVSLVEILIPFCVMAIANGAIYPIVVAQALRPFPHATGRAAALQNTLQLGLCFLASLVVSWLISISTPLLTTTSVMLSTVVLVALGYMMQRCEEVGCQNHGNAEVAHSESH >CP034953.1|QAA89745.1|2141865_2142798_+|LysR-family-transcriptional-regulator MWSEYSLEVVDAVARNGSFSAAAQELHRVPSAVSYTVRQLEEWLAVPLFERRHRDVELTAAGAWFLKEGRSVVKKMQITRQQCQQIANGWRGQLAIAVDNIVRPERTRQMIVDFYRHFDDVELLVFQEVFNGVWDALSDGRVELAIGATRAIPVGGRYAFRDMGMLSWSCVVASHHPLALMDGPFSDDTLRNWPSLVREDTSRTLPKRITWLLDNQKRVVVPDWESSATCISAGLCIGMVPTHFAKPWLNEGKWVALELENPFPDSACCLTWQQNDMSPALTWLLEYLGDSETLNKEWLREPEETPATGD >CP034953.1|QAA89746.1|2142794_2143820_-|HTH-type-transcriptional-repressor-PurR MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVIDNAFEGGYMAGRYLIERGHREIGVIPGPLERNTGAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRPTAVFCGGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREEPQSIEVHPRLIERRSVADGPFRDYRR >CP034953.1|QAA89747.1|2144118_2144208_+|YnhF-family-membrane-protein MSTDLKFSLVTTIIVLGLIVAVGLTAALH >CP034953.1|QAA89748.1|2144373_2145543_+|MFS-transporter MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRSALIFLMAIFTLGNVLSAIAPDYMTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTLANIGGVPAATWLGETIGWRMSFLATAGLGVISMVSLFFSLPKGGAGARPEVKKELAVLMRPQVLSALLTTVLGAGAMFTLYTYISPVLQSITHATPVFVTAMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEFGAAISMVVWGAATFAVVPPLQMRVMRVASEAPGLSSSVNIGAFNLGNALGAAAGGAVISAGLGYSFVPVMGAIVAGLALLLVFMSARKQPETVCVANS >CP034953.1|QAA89749.1|2145704_2146286_-|superoxide-dismutase-[Fe] MSFELPALPYAKDALAPHISAETIEYHYGKHHQTYVTNLNNLIKGTAFEGKSLEEIIRSSEGGVFNNAAQVWNHTFYWNCLAPNAGGEPTGKVAEAIAASFGSFADFKAQFTDAAIKNFGSGWTWLVKNSDGKLAIVSTSNAGTPLTTDATPLLTVDVWEHAYYIDYRNARPGYLEHFWALVNWEFVAKNLAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034953_7 | 3061743-3061887 | Orphan |
NA
Consensus repeat of CP034953_7
|
1 spacers
spacers of CP034953_7
>7.1|3061795|41|CP034953|CRISPRCasFinder TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC |
CRISPR arrays and Neighbor proteins around CP034953_7
The CRISPR arrays of CP034953_7 >merge|CP034953|7|3061743-3061887|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGCTGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTCGTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC >CP034953|7|7|3061743-3061887|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC
>CP034953.1|QAA90573.1|3060393_3061677_+|putative-acyl-CoA-thioester-hydrolase MNTFSVSRLALALAFGVTLTACSSTPPDQRPSDQTAPGTSSRPILSAKEAQNFDAQHYFASLTPGAAAWNPSPITLPAQPDFVVGPAGTQGVTHTTIQAAVDAAIIKRTNKRQYIAVMPGEYQGTVYVPAAPGGITLYGTGEKPIDVKIGLSLDGGMSPADWRHDVNPRGKYMPGKPAWYMYDSCQSKRSDSIGVLCSAVFWSQNNGLQLQNLTIENTLGDSVDAGNHPAVALRTDGDQVQINNVNILGRQNTFFVTNSGVQNRLETNRQPRTLVTNSYIEGDVDIVSGRGAVVFDNTEFRVVNSRTQQEAYVFAPATLSNIYYGFLAVNSRFNAFGDGVAQLGRSLDVDANTNGQVVIRDSAINEGFNTAKPWADAVISNRPFAGNTGSVDDNDEIQRNLNDTNYNRMWEYNNRGVGSKVVAEAKK >CP034953.1|QAA90572.1|3059765_3060242_+|kinase-inhibitor MKLISNDLRDGDKLPHRHVFNGMGYDGDNISPHLAWDDVPAGTKSFVVTCYDPDAPTGSGWWHWVVVNLPADTRVLPQGFGSGLVAMPDGVLQTRTDFGKTGYDGAAPPKGETHRYIFTVHALDIERIDVDEGASGAMVGFNVHFHSLASASITAMFS >CP034953.1|QAA90571.1|3058417_3059707_+|adenosylmethionine--8-amino-7-oxononanoate-transaminase MTTDDLAFDQRHILHPYTSMTSPLPVYPVVSAEGCELILSDGRRLVDGMSSWWAAIHGYNHPQLNAAMKSQIDAMSHVMFGGITHAPAIELCRKLVAMTPQPLECVFLADSGSVAVEVAMKMALQYWQAKGEARQRFLTFRNGYHGDTFGAMSVCDPDNSMHSLWKGYLPENLFAPAPQSRMDGEWDERDMVGFARLMAAHRHEIAAVIIEPIVQGAGGMRMYHPEWLKRIRKICDREGILLIADEIATGFGRTGKLFACEHAEIAPDILCLGKALTGGTMTLSATLTTREVAETISNGEAGCFMHGPTFMGNPLACAAANASLAILESGDWQQQVADIEVQLREQLAPARDAEMVADVRVLGAIGVVETTHPVNMAALQKFFVEQGVWIRPFGKLIYLMPPYIILPQQLQRLTAAVNRAVQDETFFCQ >CP034953.1|QAA90570.1|3057290_3058331_-|biotin-synthase MAHRPRWTLSQVTELFEKPLLDLLFEAQQVHRQHFDPRQVQVSTLLSIKTGACPEDCKYCPQSSRYKTGLEAERLMEVEQVLESARKAKAAGSTRFCMGAAWKNPHERDMPYLEQMVQGVKAMGLEACMTLGTLSESQAQRLANAGLDYYNHNLDTSPEFYGNIITTRTYQERLDTLEKVRDAGIKVCSGGIVGLGETVKDRAGLLLQLANLPTPPESVPINMLVKVKGTPLADNDDVDAFDFIRTIAVARIMMPTSYVRLSAGREQMNEQTQAMCFMAGANSIFYGCKLLTTPNPEEDKDLQLFRKLGLNPQQTAVLAGDNEQQQRLEQALMTPDTDEYYNAAAL >CP034953.1|QAA90569.1|3056139_3057294_-|8-amino-7-oxononanoate-synthase MSWQEKINAALDARRAADALRRRYPVAQGAGRWLVADDRQYLNFSSNDYLGLSHHPQIIRAWQQGAEQFGIGSGGSGHVSGYSVVHQALEEELAEWLGYSRALLFISGFAANQAVIAAMMAKEDRIAADRLSHASLLEAASLSPSQLRRFAHNDVTHLARLLASPCPGQQMVVTEGVFSMDGDSAPLAEIQQVTQQHNGWLMVDDAHGTGVIGEQGRGSCWLQKVKPELLVVTFGKGFGVSGAAVLCSSTVADYLLQFARHLIYSTSMPPAQAQALRASLAVIRSDEGDARREKLAALITRFRAGVQDLPFTLADSCSAIQPLIVGDNSRALQLAEKLRQQGCWVTAIRPPTVPAGTARLRLTLTAAHEMQDIDRLLEVLHGNG >CP034953.1|QAA90568.1|3055397_3056153_-|malonyl-ACP-O-methyltransferase-BioC MATVNKQAIAAAFGRAAAHYEQHADLQRQSADALLAMLPQRKYTHVLDAGCGPGWMSRHWRERHAQVTALDLSPPMLVQARQKDAADHYLAGDIESLPLATATFDLAWSNLAVQWCGNLSTALRELYRVVRPKGVVAFTTLVQGSLPELHQAWQAVDERPHANRFLPPDEIEQSLNGVHYQHHIQPITLWFDDALSAMRSLKGIGATHLHEGRDPRILTRSQLQRLQLAWPQQQGRYPLTYHLFLGVIARE >CP034953.1|QAA90567.1|3054727_3055405_-|ATP-dependent-dethiobiotin-synthetase-BioD MSKRYFVTGTDTEVGKTVASCALLQAAKAAGYRTAGYKPVASGSEKTPEGLRNSDALALQRNSSLQLDYATVNPYTFAEPTSPHIISAQEGRPIESLVMSAGLRALEQQADWVLVEGAGGWFTPLSDTFTFADWVTQEQLPVILVVGVKLGCINHAMLTAQVIQHAGLTLAGWVANDVTPPGKRHAEYMTTLTRMIPAPLLGEIPWLAENPENAATGKYINLALL >CP034953.1|QAA90566.1|3052127_3054149_-|excinuclease-ABC-subunit-B MSKPFKLNSAFKPSGDQPEAIRRLEEGLEDGLAHQTLLGVTGSGKTFTIANVIADLQRPTMVLAPNKTLAAQLYGEMKEFFPENAVEYFVSYYDYYQPEAYVPSSDTFIEKDASVNEHIEQMRLSATKAMLERRDVVVVASVSAIYGLGDPDLYLKMMLHLTVGMIIDQRAILRRLAELQYARNDQAFQRGTFRVRGEVIDIFPAESDDIALRVELFDEEVERLSLFDPLTGQIVSTIPRFTIYPKTHYVTPRERIVQAMEEIKEELAARRKVLLENNKLLEEQRLTQRTQFDLEMMNELGYCSGIENYSRFLSGRGPGEPPPTLFDYLPADGLLVVDESHVTIPQIGGMYRGDRARKETLVEYGFRLPSALDNRPLKFEEFEALAPQTIYVSATPGNYELEKSGGDVVDQVVRPTGLLDPIIEVRPVATQVDDLLSEIRQRAAINERVLVTTLTKRMAEDLTEYLEEHGERVRYLHSDIDTVERMEIIRDLRLGEFDVLVGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNVNGKAILYGDKITPSMAKAIGETERRREKQQKYNEEHGITPQGLNKKVVDILALGQNIAKTKAKGRGKSRPIVEPDNVPMDMSPKALQQKIHELEGLMMQHAQNLEFEEAAQIRDQLHQLRELFIAAS >CP034953.1|QAA90565.1|3051027_3051936_+|uridine-diphosphate-N-acetylglucosamine-binding-protein-YvcK MRNRTLADLDRVVALGGGHGLGRVLSSLSSLGSRLTGIVTTTDNGGSTGRIRRSEGGIAWGDMRNCLNQLITEPSVASAMFEYRFGGNGELSGHNLGNLMLKALDHLSVRPLEAINLIRNLLKVDTHLIPMSEHPVDLMAIDDQGHEVYGEVNIDQLTTPIQELLLTPNVPATREAVHAINEADLIIIGPGSFYTSLMPILLLKEIAQALRRTPAPMVYIGNLGRELSLPAANLKLESKLAIMEQYVGKKVIDAVIVGPKVDVSAVKERIVIQEVLEASDIPYRHDRQLLHNALEKALQALG >CP034953.1|QAA90564.1|3049641_3050631_-|GTP-3',8-cyclase-MoaA MASQLTDAFARKFYYLRLSITDVCNFRCTYCLPDGYKPSGVTNKGFLTVDEIRRVTRAFARLGTEKVRLTGGEPSLRRDFTDIIAAVRENDAIRQIAVTTNGYRLERDVASWRDAGLTGINVSVDSLDARQFHAITGQDKFNQVMAGIDAAFEAGFEKVKVNTVLMRDVNHHQLDTFLNWIQHRPIQLRFIELMETGEGSELFRKHHISGQVLRDELLRRGWIHQLRQRSDGPAQVFCHPDYAGEIGLIMPYEKDFCATCNRLRVSSIGKLHLCLFGEGGVNLRDLLEDDTQQQALEARISAALREKKQTHFLHQNNTGITQNLSYIGG >CP034953.1|QAA90574.1|3061910_3064172_-|hydratase MIKLSEKGVFLASNNEIIAEEHFTGEIKKEEAKKGTIAWSILSSHNTSGNMDKLKIKFDSLASHDITFVGIVQTAKASGMERFPLPYVLTNCHNSLCAVGGTINGDDHVFGLSAAQRYGGIFVPPHIAVIHQYMREMMAGGGKMILGSDSHTRYGALGTMAVGEGGGELVKQLLNDTWDIDYPGVVAVHLTGKPAPYVGPQDVALAIIGAVFKNGYVKNKVMEFVGPGVSALSTDFRNSVDVMTTETTCLSSVWQTDEEVHNWLALHGRGQDYCQLNPQPMAYYDGCISVDLSAIKPMIALPFHPSNVYEIDTLNQNLTDILREIEIESERVAHGKAKLSLLDKVENGRLKVQQGIIAGCSGGNYENVIAAANALRGQSCGNDTFSLAVYPSSQPVFMDLAKKGVVADLIGAGAIIRTAFCGPCFGAGDTPINNGLSIRHTTRNFPNREGSKPANGQMSAVALMDARSIAATAANGGYLTSASELDCWDNVPEYAFDVTPYKNRVYQGFVKGATQQPLIYGPNIKDWPELGALTDNIVLKVCSKILDEVTTTDELIPSGETSSYRSNPIGLAEFTLSRRDPGYVSRSKATAELENQRLAGNVSELTEVFARIKQIAGQEHIDPLQTEIGSMVYAVKPGDGSAREQAASCQRVIGGLANIAEEYATKRYRSNVINWGMLPLQMAEVPTFEVGDYIYIPGIKAALDNPGTTFKGYVIHEDAPVTEITLYMESLTAEEREIIKAGSLINFNKNRQM >CP034953.1|QAA90575.1|3064354_3065788_-|anion-permease MNKKSLWKLILILAIPCIIGFMPAPAGLSELAWVLFGIYLAAIVGLVIKPFPEPVVLLIAVAASMVVVGNLSDGAFKTTAVLSGYSSGTTWLVFSAFTLSAAFVTTGLGKRIAYLLIGKIGNTTLGLGYVTVFLDLVLAPATPSNTARAGGIVLPIINSVAVALGSEPEKSPRRVGHYLMMSIYMVTKTTSYMFFTAMAGNILALKMINDILHLQISWGGWALAAGLPGIIMLLVTPLVIYTMYPPEIKKVDNKTIAKAGLAELGPMKIREKMLLGVFVLALLGWIFSKSLGVDESTVAIVVMATMLLLGIVTWEDVVKNKGGWNTLIWYGGIIGLSSLLSKVKFFEWLAEVFKNNLAFDGHGNVAFFVIIFLSIIVRYFFASGSAYIVAMLPVFAMLANVSGAPLMLTALALLFSNSYGGMVTHYGGAAGPVIFGVGYNDIKSWWLVGAVLTILTFLVHITLGVWWWNMLIGWNML >CP034953.1|QAA90576.1|3065863_3066916_-|4-oxalomesaconate-tautomerase MKKIPCVMMRGGTSRGAFLLAEHLPEDQTQRDKILMAIMGSGNDLEIDGIGGGNPLTSKVAIISRSSDPRADVDYLFAQVIVHEQRVDTTPNCGNMLSGVGAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQTPNGVVEYEGSARIDGVPGTAAPVALTFLNAAGTKTGKVFPTDNQIDYFDDVPVTCIDMAMPVVIIPAEYLGKTGYELPAELDADKALLARIESIRLQAGKAMGLGDVSNMVIPKPVLISPAQKGGAINVRYFMPHSCHRALAITGAIAISSSCALEGTVTRQIVPSVGYGNINIEHPSGALDVHLSNEGQDATTLRASVIRTTRKIFSGEVYLP >CP034953.1|QAA90577.1|3067099_3068053_+|LysR-family-transcriptional-regulator MKHELSSMKAFVILAESSSFNNAAKLLNITQPALTRRIKKMEEDLHVQLFERTTRKVTLTKAGKRLLPEARELIKKFDETLFNIRDMNAYHRGMVTLACIPTAVFYFLPLAIGKFNELYPNIKVRILEQGTNNCMESVLCNESDFGINMNNVTNSSIDFTPLVNEPFVLACRRDHPLAKKQLVEWQELVGYKMIGVRSSSGNRLLIEQQLADKPWKLDWFYEVRHLSTSLGLVEAGLGISALPGLAMPHAPYSSIIGIPLVEPVIRRTLGIIRRKDAVLSPAAERFFALLINLWTDDKDNLWTNIVERQRHALQEIG >CP034953.1|QAA90578.1|3068093_3069089_-|6-phosphogluconolactonase MKQTVYIASPESQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSPDKRYLYVGVRPEFRVLAYRIAPDDGALTFAAESALPGSPTHISTDHQGQFVFVGSYNAGNVSVTRLEDGLPVGVVDVVEGLDGCHSANISPDNRTLWVPALKQDRICLFTVSDDGHLVAQDPAEVTTVEGAGPRHMVFHPNEQYAYCVNELNSSVDVWELKDPHGNIECVQTLDMMPENFSDTRWAADIHITPDGRHLYACDRTASLITVFSVSEDGSVLSKEGFQPTETQPRGFNVDHSGKYLIAAGQKSHHISVYEIVGEQGLLHEKGRYAVGQGPMWVVVNAH >CP034953.1|QAA90579.1|3069243_3070062_+|pyridoxal-phosphatase MTTRVIALDLDGTLLTPKKTLLPSSIEALARAREAGYQLIIVTGRHHVAIHPFYQALALDTPAICCNGTYLYDYHAKTVLEADPMPVIKALQLIEMLNEHHIHGLMYVDDAMVYEHPTGHVIRTSNWAQTLPPEQRPTFTQVASLAETAQQVNAVWKFALTHDDLPQLQHFGKHVEHELGLECEWSWHDQVDIARGGNSKGKRLTKWVEAQGWSMENVVAFGDNFNDISMLEAAGTGVAMGNADDAVKARANIVIGDNTTDSIAQFIYSHLI >CP034953.1|QAA90580.1|3070062_3071121_-|molybdenum-ABC-transporter-ATP-binding-protein-ModC MLELNFSQTLGNHCLTINETLPANGITAIFGVSGAGKTSLINAISGLTRPQKGRIVLNGRVLNDAEKGICLTPEKRRVGYVFQDARLFPHYKVRGNLRYGMSKSMVDQFDKLVALLGIEPLLDRLPGSLSGGEKQRVAIGRALLTAPELLLLDEPLASLDIPRKRELLPYLQRLTREINIPMLYVSHSLDEILHLADRVMVLENGQVKAFGALEEVWGSSVMNPWLPKEQQSSILKVTVLEHHPHYAMTALALGDQHLWVNKLDEPLQAALRIRIQASDVSLVLQPPQQTSIRNVLRAKVVNSYDDNGQVEVELEVGGKTLWARISPWARDELAIKPGLWLYAQIKSVSITA >CP034953.1|QAA90581.1|3071123_3071813_-|molybdenum-ABC-transporter-permease MILTDPEWQAVLLSLKVSSLAVLFSLPFGIFFAWLLVRCTFPGKALLDSVLHLPLVLPPVVVGYLLLVSMGRRGFIGERLYDWFGITFAFSWRGAVLAAAVMSFPLMVRAIRLALEGVDVKLEQAARTLGAGRWRVFFTITLPLTLPGIIVGTVLAFARSLGEFGATITFVSNIPGETRTIPSAMYTLIQTPGGESGAARLCIISIALAMISLLISEWLARISRERAGR >CP034953.1|QAA90582.1|3071812_3072586_-|molybdate-ABC-transporter-substrate-binding-protein MARKWLNLFAGAALSFAVAGNALADEGKITVFAAASLTNAMQDIATQFKKEKGVDVVSSFASSSTLARQIEAGAPADLFISADQKWMDYAVDKKAIDTATRQTLLGNSLVVVAPKASVQKDFTIDSKTNWTSLLNGGRLAVGDPEHVPAGIYAKEALQKLGAWDTLSPKLAPAEDVRGALALVERNEAPLGIVYGSDAVASKGVKVVATFPEDSHKKVEYPVAVVEGHNNATVKAFYDYLKGPQAAEIFKRYGFTIK >CP034953.1|QAA90583.1|3072752_3072902_-|multidrug-efflux-pump-associated-protein,-AcrZ-family MLELLKSLVFAVIMVPVVMAIILGLIYGLGEVFNIFSGVGKKDQPGQNH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034953_8 | 3317549-3317645 | Orphan |
NA
Consensus repeat of CP034953_8
|
1 spacers
spacers of CP034953_8
>8.1|3317576|43|CP034953|CRISPRCasFinder ATCGCATCAGGCATTGTGCACCAATTGCCGGATGCGGCACCGG |
CRISPR arrays and Neighbor proteins around CP034953_8
The CRISPR arrays of CP034953_8 >merge|CP034953|8|3317549-3317645|CRISPRCasFinder TTGTAGGCCTGATAAGATGCGTCAAGCATCGCATCAGGCATTGTGCACCAATTGCCGGATGCGGCACCGGTTGTAGGCCTGATAAGACGCGTCAAGC >CP034953|8|8|3317549-3317645|CRISPRCasFinder TTGTAGGCCTGATAAGATGCGTCAAGC ATCGCATCAGGCATTGTGCACCAATTGCCGGATGCGGCACCGG TTGTAGGCCTGATAAGACGCGTCAAGC
>CP034953.1|QAA90796.1|3316416_3317484_+|5-(carboxyamino)imidazole-ribonucleotide-synthase MKQVCVLGNGQLGRMLRQAGEPLGIAVWPVGLDAEPAAVPFQQSVITAEIERWPETALTRELARHPAFVNRDVFPIIADRLTQKQLFDKLHLPTAPWQLLAERSEWPAVFDRLGELAIVKRRTGGYDGRGQWRLRANETEQLPAECYGECIVEQGINFSGEVSLVGARGFDGSTVFYPLTHNLHQDGILRTSVAFPQANAQQQAQAEEMLSAIMQELGYVGVMAMECFVTPQGLLINELAPRVHNSGHWTQNGASISQFELHLRAITDLPLPQPVVNNPSVMINLIGSDVNYDWLKLPLVHLHWYDKEVRPGRKVGHLNLTDSDTSRLTATLEALIPLLPPEYASGVIWAQSKFG >CP034953.1|QAA90795.1|3315910_3316420_+|5-(carboxyamino)imidazole-ribonucleotide-mutase MSSRNNPARVAIVMGSKSDWATMQFAAEIFEILNVPHHVEVVSAHRTPDKLFSFAESAEENGYQVIIAGAGGAAHLPGMIAAKTLVPVLGVPVQSAALSGVDSLYSIVQMPRGIPVGTLAIGKAGAANAALLAAQILATHDKELHQRLNDWRKAQTDEVLENPDPRGAA >CP034953.1|QAA90794.1|3315070_3315793_+|UDP-2,3-diacylglucosamine-diphosphatase MATLFIADLHLCVEEPAITAGFLRFLAGEARKADALYILGDLFEAWIGDDDPNPLHRKMAAAIKAVSDSGVPCYFIHGNRDFLLGKRFARESGMTLLPEEKVLELYGRRVLIMHGDTLCTDDAGYQAFRAKVHKPWLQTLFLALPLFVRKRIAARMRANSKEANSSKSLAIMDVNQNAVVSAMEKHQVQWLIHGHTHRPAVHELIANQQPAFRVVLGAWHTEGSMVKVTADDVELIHFPF >CP034953.1|QAA90793.1|3314573_3315068_+|peptidylprolyl-isomerase-B MVTFHTNHGDIVIKTFDDKAPETVKNFLDYCREGFYNNTIFHRVINGFMIQGGGFEPGMKQKATKEPIKNEANNGLKNTRGTLAMARTQAPHSATAQFFINVVDNDFLNFSGESLQGWGYCVFAEVVDGMDVVDKIKGVATGRSGMHQDVPKEDVIIESVTVSE >CP034953.1|QAA90792.1|3313014_3314400_-|cysteine--tRNA-ligase MLKIFNTLTRQKEEFKPIHAGEVGMYVCGITVYDLCHIGHGRTFVAFDVVARYLRFLGYKLKYVRNITDIDDKIIKRANENGESFVAMVDRMIAEMHKDFDALNILRPDMEPRATHHIAEIIELTEQLIAKGHAYVADNGDVMFDVPTDPTYGVLSRQDLDQLQAGARVDVVDDKRNPMDFVLWKMSKEGEPSWPSPWGAGRPGWHIECSAMNCKQLGNHFDIHGGGSDLMFPHHENEIAQSTCAHDGQYVNYWMHSGMVMVDREKMSKSLGNFFTVRDVLKYYDAETVRYFLMSGHYRSQLNYSEENLKQARAALERLYTALRGTDKTVAPAGGEAFEARFIEAMDDDFNTPEAYSVLFDMAREVNRLKAEDMAAANAMASHLRKLSAVLGLLEQEPEAFLQSGAQADDSEVAEIEALIQQRLDARKAKDWAAADAARDRLNEMGIVLEDGPQGTTWRRK >CP034953.1|QAA90791.1|3312457_3312979_+|metal-dependent-hydrolase MPTVITHAAVPLCIGLGLGSKVIPPRLLFAGIILAMLPDADVLSFKFGVAYGNVFGHRGFTHSLVFAFVVPLLCVFIGRRWFRAGLIRCWLFLTVSLLSHSLLDSVTTGGKGVGWLWPWSDERFFAPWQVIKVAPFALSRYTTPYGHQVIISELMWVWLPGMLLMGMLWWRRR >CP034953.1|QAA90790.1|3312137_3312350_+|ribosome-associated-protein-YbcJ MATFSLGKHPHVELCDLLKLEGWSESGAQAKIAIAEGQVKVDGAVETRKRCKIVAGQTVSFAGHSVQVVA >CP034953.1|QAA90789.1|3311269_3312136_+|bifunctional-methylenetetrahydrofolate-dehydrogenase/methenyltetrahydrofolate-cyclohydrolase-FolD MAAKIIDGKTIAQQVRSEVAQKVQARIAAGLRAPGLAVVLVGSNPASQIYVASKRKACEEVGFVSRSYDLPETTSEAELLELIDTLNADNTIDGILVQLPLPAGIDNVKVLERIHPDKDVDGFHPYNVGRLCQRAPRLRPCTPRGIVTLLERYNIDTFGLNAVVIGASNIVGRPMSMELLLAGCTTTVTHRFTKNLRHHVENADLLIVAVGKPGFIPGDWIKEGAIVIDVGINRLENGKVVGDVVFEDAAKRASYITPVPGGVGPMTVATLIENTLQACVEYHDPQDE >CP034953.1|QAA90788.1|3310256_3310799_-|type-1-fimbrial-protein-subunit-FimA MKLRFISSALAAALFAATGSYAAVVDGGTIHFEGELVNAACSVNTDSADQVVTLGQYRTDIFNAVGNTSALIPFTIQLNDCDPVVAANAAVAFSGQADAINDNLLAIASSTNTTTATGVGIEILDNTSAILKPDGNSFSTNQNLIPGTNVLHFSARYKGTGTSASAGQANADATFIMRYE >CP034953.1|QAA90787.1|3309344_3310037_-|molecular-chaperone-FimC MMTKIKLLMLIIFYLIISASAHAAGGIALGATRIIYPADAKQTAVWIRNSHTNERFLVNSWIENSSGVKEKSFIITPPLFVSEPKSENTLRIIYTGPPLAADRESLFWMNVKTIPSVDKNALNGRNVLQLAILSRMKLFLRPIQLQELPAEAPDTLKFSRSGNYINVHNPSPFYVTLVNLQVGSQKLGNAMAAPRVNSQIPLPSGVQGKLKFQTVNDYGSVTPVREVNLN >CP034953.1|QAA90797.1|3317678_3318572_-|carbamate-kinase MKTLVVALGGNALLQRGEALTAENQYRNIASAVPALARLARSYRLAIVHGNGPQVGLLALQNLAWKEVEPYPLDVLVAESQGMIGYMLAQSLSAQPQMPPVTTVLTRIEVSPDDPAFLQPEKFIGPVYQPEEQEALEAAYGWQMKRDGKYLRRVVASPQPRKILDSEAIELLLKEGHVVICSGGGGVPVTDDGAGSEAVIDKDLAAALLAEQINADGLVILTDADAVYENWGTPQQRAIRHATPDELAPFAKADGSMGPNVTAVSGYVRSRGKPAWIGALSRIEETLAGEAGTCISL >CP034953.1|QAA90798.1|3318568_3319384_-|DUF2877-domain-containing-protein MTIIHPLLASSSAPNYRQSWRLAGVWRRAINLMTESGELLTLHRQGSGFGPGGWVLRRAQFDALCGGLCGNERPQVVAQGIRLGRFTVKQPQRYCLLRITPPAHPQPLAAAWMQRAEETGLFGPLALAASDPLPAELRQFRHCFQAALNGVKTDWRHWLGKGPGLTPSHDDTLSGMLLAAWYYGALDARSGRPFFACSDNLQLVTTAVSVSYLRYAAQGYFASPLLHFVHALSCPKRTAVAIDSLLALGHTSGADTLLGFWLGQQLLQGKP >CP034953.1|QAA90799.1|3319394_3320654_-|DUF1116-domain-containing-protein MFTSVAQANAAVIEQIRRARPHWLDVQPASSLISELNEGKTLLHAGPPMRWQEMTGPMKGACVGACLFEGWAKDEAQALAILEQGEVNFIPCHHVNAVGPMGGITSASMPMLVVENVTDGNRAYCNLNEGIGKVMRFGAYGEDVLTRHRWMRDVLMPVLSAALGRMERGIDLTAMMAQGITMGDEFHQRNIASSALLMRALAPQIARLDHDKQHIAEVMDFLSVTDQFFLNLAMAYCKAAMDAGAMIRAGSIVTAMTRNGNMFGIRVSGLGERWFTAPVNTPQGLFFTGFSQEQANPDMGDSAITETFGIGGAAMIAAPGVTRFVGAGGMEAARAVSEEMAEIYLERNMQLQIPSWDFQGACLGLDIRRVVETGITPLINTGIAHKEAGIGQIGAGTVRAPLACFEQALEALAESMGIG >CP034953.1|QAA90800.1|3320663_3322331_-|acyl-CoA-synthetase-FdrA MIHAFIKKGCFQDSVSLMIISRKLSESENVDDVSVMMGTPANKALLDTTGFWHDDFNNATPNDICVAIRSEAADAGIAQAIMQQLEEALKQLAQGSGSSQALTQVRRWDSACQKLPDANLALISVAGEYAAELANQALDRNLNVMMFSDNVTLEDEIQLKTRAREKGLLVMGPDCGTSMIAGTPLAFANVMPEGNIGVIGASGTGIQELCSQIALAGEGITHAIGLGGRDLSREVGGISALTALEMLSADEKSEVLAFVSKPPAEAVRLKIVNAMKATGKPTVALFLGYTPAVARDENVWFASSLDEAARLACLLSRVTARRNAIAPVSSGFICGLYTGGTLAAEAAGLLAGHLGVEADDTHQHGMMLDADSHQIIDLGDDFYTVGRPHPMIDPTLRNQLIADLGAKPQVRVLLLDVVIGFGATADPAASLVSAWQKACAARLDNQPLYAIATVTGTERDPQCRSQQIATLEDAGIAVVSSLPEATLLAAALIHPLSPAAQQHTPSLLENVAVINIGLRSFALELQSASKPVVHYQWSPVAGGNKKLARLLERLQ >CP034953.1|QAA90801.1|3322647_3323697_+|ureidoglycolate-dehydrogenase MKISRETLHQLIENKLCQAGLKREHAATVAEVLVYADARGIHSHGAVRVEYYAERISKGGTNREPEFRLEETGPCSAILHADNAAGQVAAKMGMEHAIKTAQQNGVAVVGISRMGHSGAISYFVQQAARAGFIGISMCQSDPMVVPFGGAEIYYGTNPLAFAAPGEGDEILTFDMATTVQAWGKVLDARSRNMSIPDTWAVDKNGVPTTDPFAVHALLPAAGPKGYGLMMMIDVLSGVLLGLPFGRQVSSMYDDLHAGRNLGQLHIVINPNFFSSSELFRQHLSQTMRELNAITPAPGFNQVYYPGQDQDIKQRKAAVEGIEIVDDIYQYLISDALYNTSYETKNPFAQ >CP034953.1|QAA90802.1|3323718_3324954_+|allantoate-amidohydrolase MITHFRQAIEETLPWLSSFGADPAGGMTRLLYSPEWLETQQQFKKRMAASGLETRFDEVGNLYGRLNGTEYPQEVVLSGSHIDTVVNGGNLDGQFGALAAWLAIDWLKTQYGAPLRTVEVVAMAEEEGSRFPYVFWGSKNIFGLANPDDVRNICDAKGNSFVDAMKACGFTLPNAPLTPRQDIKAFVELHIEQGCVLESNGQSIGVVNAIVGQRRYTVTLNGESNHAGTTPMGYRRDTVYAFSRICHQSVEKAKRMGDPLVLTFGKVEPRPNTVNVVPGKTTFTIDCRHTDAAVLRDFTQQLENDMRAICDEMDIGIDIDLWMDEEPVPMNKELVATLTELCEREKLNYRVMHSGAGHDAQIFAPRVPTCMIFIPSINGISHNPAERTNITDLAEGVKTLALMLYQLAWQK >CP034953.1|QAA90803.1|3324964_3325750_+|(S)-ureidoglycine-aminohydrolase MGYLNNVTGYREDLLANRAIVKHGNFALLTPDGLVKNIIPGFENCDATILSTPKLGASFVDYLVTLHQNGGNQQGFGGEGIETFLYVISGNITAKAEGKTFALSEGGYLYCPPGSLMTFVNAQAEDSQIFLYKRRYVPVEGYAPWLVSGNASELERIHYEGMDDVILLDFLPKELGFDMNMHILSFAPGASHGYIETHVQEHGAYILSGQGVYNLDNNWIPVKKGDYIFMGAYSLQAGYGVGRGEAFSYIYSKDCNRDVEI >CP034953.1|QAA90804.1|3325977_3327123_-|glycerate-3-kinase MKIVIAPDSFKESLSAEKCCQAIKAGFSTLFPDANYICLPIADGGEGTVDAMVAATGGNIVTLEVCGPMGEKVNAFYGLTGDGKTAVIEMAAASGLMLVAPEKRNPLLASSFGTGELIRHALDNDIRHIILGIGGSATVDGGMGMAQALGVRFLDADGQALAANGGNLARVASIEMDECDPRLANCHIEVACDVDNPLVGARGAAAVFGPQKGATPEMVEELEQGLQNYARVLQQQTEINVCQMAGGGAAGGMGIAAAVFLNADIKPGIEIVLNAVNLAQAVQGAALVITGEGRIDSQTAGGKAPLGVASVAKQFNVPVIGIAGVLGDGVEVVHQYGIDAVFSILPRLAPLAEVLASGETNLFNSARNIACAIKIGQGIKN >CP034953.1|QAA90805.1|3327144_3328446_-|uracil/xanthine-transporter MFNFAVSRESLLSGFQWFFFIFCNTVVVPPTLLSAFQLPQSSLLTLTQYAFLATALACFAQAFCGHRRAIMEGPGGLWWGTILTITLGEASRGTPINDIATSLAVGIALSGVLTMLIGFSGLGHRLARLFTPSVMVLFMLMLGAQLTTIFFKGMLGLPFGIADPNFKIQLPPFALSVAVMCLVLAMIIFLPQRFARYGLLVGTITGWLLWYFCFPSSHSLSGELHWQWFPLGSGGALSPGIILTAVITGLVNISNTYGAIRGTDVFYPQQGAGNTRYRRSFVATGFMTLITVPLAVIPFSPFVSSIGLLTQTGDYTRRSFIYGSVICLLVALVPALTRLFCSIPLPVSSAVMLVSYLPLLFSALVFSQQITFTARNIYRLALPLFVGIFLMALPPVYLQDLPLTLRPLLSNGLLVGILLAVLMDNLIPWERIE >CP034953.1|QAA90806.1|3328502_3329864_-|allantoinase-AllB MSFDLIIKNGTVILENEARVVDIAVKGGKIAAIGQDLGDAKEVMDASGLVVSPGMVDAHTHISEPGRSHWEGYETGTRAAAKGGITTMIEMPLNQLPATVDRASIELKFDAAKGKLTIDAAQLGGLVSYNIDRLHELDEVGVVGFKCFVATCGDRGIDNDFRDVNDWQFFKGAQKLGELGQPVLVHCENALICDELGEEAKREGRVTAHDYVASRPVFTEVEAIRRVLYLAKVAGCRLHVCHVSSPEGVEEVTRARQEGQDVTCESCPHYFVLDTDQFEEIGTLAKCSPPIRDLENQKGMWEKLFNGEIDCLVSDHSPCPPEMKAGNIMKAWGGIAGLQSCMDVMFDEAVQKRGMSLPMFGKLMATNAADIFGLQQKGRIAPGKDADFVFIQPNSSYVLTNDDLEYRHKVSPYVGRTIGARITKTILRGDVIYDIEQGFPVAPKGQFILKHQQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034953_9 | 3457789-3457933 | Orphan |
NA
Consensus repeat of CP034953_9
|
1 spacers
spacers of CP034953_9
>9.1|3457832|59|CP034953|CRISPRCasFinder CGGAGCACTTATTGCCGGATGCGGCGTGAACGCCTTATCCGGCCTACGGTTCTGGCACC |
CRISPR arrays and Neighbor proteins around CP034953_9
The CRISPR arrays of CP034953_9 >merge|CP034953|9|3457789-3457933|CRISPRCasFinder TTTTGCAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCATCGGAGCACTTATTGCCGGATGCGGCGTGAACGCCTTATCCGGCCTACGGTTCTGGCACCTTTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAT >CP034953|9|9|3457789-3457933|CRISPRCasFinder TTTTGCAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAT CGGAGCACTTATTGCCGGATGCGGCGTGAACGCCTTATCCGGCCTACGGTTCTGGCACC TTTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAT
>CP034953.1|QAA91942.1|3456527_3457712_+|MFS-transporter-AraJ MKKVILSLALGTFGLGMAEFGIMGVLTELAHNVGISIPAAGHMISYYALGVVVGAPIIALFSSRYSLKHILLFLVALCVIGNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSKIIKPGKVTAAVAGMVSGMTVANLLGIPLGTYLSQEFSWRYTFLLIAVFNIAVMASVYFWVPDIRDEAKGNLREQFHFLRSPAPWLIFAATMFGNAGVFAWFSYVKPYMMFISGFSETAMTFIMMLVGLGMVLGNMLSGRISGRYSPLRIAAVTDFIIVLALLMLFFCGGMKTTSLIFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAFNLGSAVGAYCGGMMLTLGLAYNYVALPAALLSFAAMSSLLLYGRYKRQQAADTPVLAKPLG >CP034953.1|QAA90915.1|3453255_3456402_+|exonuclease-subunit-SbcC MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAICLALYHETPRLSNVSQSQNDLMTRDTAECLAEVEFEVKGEAYRAFWSQNRARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRSMLLSQGQFAAFLNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTELEKLQAQASGVTLLTPEQVQSLTASLQVLTDEEKQLITAQQQEQQSLNWLTRQDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIAEHSAALAHIRQQIEEVNTRLQSTMALRASIRHHAAKQSAELQQQQQSLNTWLQEHDRFRQWNNEPAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALAAITLTLTADEVATALAQHAEQRPLRQHLVALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKTICEQEARIKTLEAQRAQLQAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEGATLRGQLDAITKQLQRDENEAQSLRQDEQALTQQWQAVTASLNITLQPLDDIQPWLDAQDEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQLLLTTLTGYALTLPQEDEEESWLATRQQEAQSWQQRQNELTALQNRIQQLTPILETLPQSDELPHCEETVVLENWRQVHEQCLALHSQQQTLQQQDVLAAQSLQKAQAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLVTQTAETLAQHQQHRPDDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIRQQLKQDADNRQQQQTLMQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQGLTLDNLVHLANQQLTRLHGRYLLQRKASEALEVEVVDTWQADAVRDTRTLSGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDALDALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSKLESTFAVK >CP034953.1|QAA90914.1|3452056_3453259_+|exonuclease-subunit-SbcD MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETAQTHQVDAIIVAGDVFDTGSPPSYARTLYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDIMAFLNTTVVASAGHAPQILPRRDGTPGAVLCPIPFLRPRDIITSQAGLNGIEKQQHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIYIGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGSPIPLSFDECGKSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVSQEPPVWLDIEITTDEYLHDIQRKIQALTESLPVEVLLVRRSREQRERVLASQQRETLSELSVEEVFNRRLALEELDESQQQRLQHLFTTTLHTLAGEHEA >CP034953.1|QAA90913.1|3451177_3451867_-|phosphate-response-regulator-transcription-factor-PhoB MARRILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGIQFIKHLKRESMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRISPMAVEEVIEMQGLSLDPTSHRVMAGEEPLEMGPTEFKLLHFFMTHPERVYSREQLLNHVWGTNVYVEDRTVDVHIRRLRKALEPGGHDRMVQTVRGTGYRFSTRF >CP034953.1|QAA90912.1|3449824_3451120_-|phosphate-regulon-sensor-histidine-kinase-PhoR MLERLSWKRLVLELLLCCLPAFILGAFFGYLPWFLLASVTGLLIWHFWNLLRLSWWLWVDRSMTPPPGRGSWEPLLYGLHQMQLRNKKRRRELGNLIKRFRSGAESLPDAVVLTTEEGGIFWCNGLAQQILGLRWPEDNGQNILNLLRYPEFTQYLKTRDFSRPLNLVLNTGRHLEIRVMPYTHKQLLMVARDVTQMHQLEGARRNFFANVSHELRTPLTVLQGYLEMMNEQPLEGAVREKALHTMREQTQRMEGLVKQLLTLSKIEAAPTHLLNEKVDVPMMLRVVEREAQTLSQKKQTFTFEIDNGLKVSGNEDQLRSAISNLVYNAVNHTPEGTHITVRWQRVPHGAEFSVEDNGPGIAPEHIPRLTERFYRVDKARSRQTGGSGLGLAIVKHAVNHHESRLNIESTVGKGTRFSFVIPERLIAKNSD >CP034953.1|QAA90911.1|3448098_3449418_-|branched-chain-amino-acid-transporter-carrier-protein-BrnQ MTHQLRSRDIIALGFMTFALFVGAGNIIFPPMVGLQAGEHVWTAAFGFLITAVGLPVLTVVALAKVGGGVDSLSTPIGKVAGVLLATVCYLAVGPLFATPRTATVSFEVGIAPLTGDSALPLFIYSLVYFAIVILVSLYPGKLLDTVGNFLAPLKIIALVILSVAAIVWPAGSISTATEAYQNAAFSNGFVNGYLTMDTLGAMVFGIVIVNAARSRGVTEARLLTRYTVWAGLMAGVGLTLLYLALFRLGSDSASLVDQSANGAAILHAYVQHTFGGGGSFLLAALIFIACLVTAVGLTCACAEFFAQYVPLSYRTLVFILGGFSMVVSNLGLSQLIQISVPVLTAIYPPCIALVVLSFTRSWWHNSSRVIAPPMFISLLFGILDGIKASAFSDILPSWAQRLPLAEQGLAWLMPTVVMVVLAIIWDRAAGRQVTSSAH >CP034953.1|QAA90910.1|3446649_3448023_-|proline-specific-permease-ProY MESKNKLKRGLSTRHIRFMALGSAIGTGLFYGSADAIKMAGPSVLLAYIIGGIAAYIIMRALGEMSVHNPAASSFSRYAQENLGPLAGYITGWTYCFEILIVAIADVTAFGIYMGVWFPTVPHWIWVLSVVLIICAVNLMSVKVFGELEFWFSFFKVATIIIMIVAGFGIIIWGIGNGGQPTGIHNLWSNGGFFSNGWLGMVMSLQMVMFAYGGIEIIGITAGEAKDPEKSIPRAINSVPMRILVFYVGTLFVIMSIYPWNQVGTAGSPFVLTFQHMGITFAASILNFVVLTASLSAINSDVFGVGRMLHGMAEQGSAPKIFSKTSRRGIPWVTVLVMTTALLFAVYLNYIMPENVFLVIASLATFATVWVWIMILLSQIAFRRRLPPEEVKALKFKVPGGVATTIGGLIFLLFIIGLIGYHPDTRISLYVGFAWIVVLLIGWMFKRRHDRQLAENQ >CP034953.1|QAA90909.1|3444676_3446494_-|maltodextrin-glucosidase MMLNAWHLPVPPFVKQSKDQLLITLWLTGEDPPQRIMLRTEHDNEEMSVPMHKQRSQPQPGVTAWRAAIDLSSGQPRRRYSFKLLWHDRQRWFTPQGFSRMPPARLEQFAVDVPDIGPQWAADQIFYQIFPDRFARSLPREAEQDHVYYHHAAGQEIILRDWDEPVTAQAGGSTFYGGDLDGISEKLPYLKKLGVTALYLNPVFKAPSVHKYDTEDYRHVDPQFGGDGALLRLRHNTQQLGMRLVLDGVFNHSGDSHAWFDRHNRGTGGACHNPESPWRDWYSFSDDGTALDWLGYASLPKLDYQSESLVNEIYRGEDSIVRHWLKAPWNMDGWRLDVVHMLGEAGGARNNMQHVAGITEAAKETQPEAYIVGEHFGDARQWLQADVEDAAMNYRGFTFPLWGFLANTDISYDPQQIDAQTCMAWMDNYRAGLSHQQQLRMFNQLDSHDTARFKTLLGRDIARLPLAVVWLFTWPGVPCIYYGDEVGLDGKNDPFCRKPFPWQVEKQDTALFALYQRMIALRKKSQALRHGGCQVLYAEDNVVVFVRVLNQQRVLVAINRGEACEVVLPASPFLNAVQWQCKEGHGQLTDGILALPAISATVWMN >CP034953.1|QAA90908.1|3444090_3444672_+|ACP-phosphodiesterase MNFLAHLHLAHLAESSLSGNLLADFVRGNPEESFPPDVVAGIHMHRRIDVLTDNLPEVREAREWFRSETRRVAPITLDVMWDHFLSRHWSQLSPDFPLQEFVCYAREQVMTILPDSPPRFINLNNYLWSEQWLVRYRDMDFIQNVLNGMASRRPRLDALRDSWYDLDAHYDALETRFWQFYPRMMAQASRKAL >CP034953.1|QAA90907.1|3442927_3443998_-|tRNA-preQ1(34)-S-adenosylmethionine-ribosyltransferase-isomerase-QueA MRVTDFSFELPESLIAHYPMPERSSCRLLSLDGPTGALTHGTFTDLLDKLNPGDLLVFNNTRVIPARLFGRKASGGKIEVLVERMLDDKRILAHIRASKAPKPGAELLLGDDESINATMTARHGALFEVEFNDERSVLDILNSIGHMPLPPYIDRPDEDADRELYQTVYSEKPGAVAAPTAGLHFDEPLLEKLRAKGVEMAFVTLHVGAGTFQPVRVDTIEDHIMHSEYAEVPQDVVDAVLAAKARGNRVIAVGTTSVRSLESAAQAAKNDLIEPFFDDTQIFIYPGFQYKVVDALVTNFHLPESTLIMLVSAFAGYQHTMNAYKAAVEEKYRFFSYGDAMFITYNPQAINERVGE >CP034953.1|QAA90916.1|3457956_3458865_-|fructokinase MRIGIDLGGTKTEVIALGDAGEQLYRHRLPTPRDDYRQTIETIATLVDMAEQATGQRGTVGMGIPGSISPYTGVVKNANSTWLNGQPFDKDLSARLQREVRLANDANCLAVSEAVDGAAAGAQTVFAVIIGTGCGAGVAFNGRAHIGGNGTAGEWGHNPLPWMDEDELRYREEVPCYCGKQGCIETFISGTGFAMDYRRLSGHALKGSEIIRLVEESDPVAELALRRYELRLAKSLAHVVNILDPDVIVLGGGMSNVDRLYQTVGQLIKQFVFGGECETPVRKAKHGDSSGVRGAAWLWPQE >CP034953.1|QAA91943.1|3458989_3459901_+|recombination-associated-protein-RdgC MLWFKNLMVYRLSREISLRAEEMEKQLASMAFTPCGSQDMAKMGWVPPMGSHSDALTHVANGQIVICARKEEKILPSPVIKQALEAKIAKLEAEQARKLKKTEKDSLKDEVLHSLLPRAFSRFSQTMMWIDTVNGLIMVDCASAKKAEDTLALLRKSLGSLPVVPLSMENPIELTLTEWVRSGSAAQGFQLLDEAELKSLLEDGGVIRAKKQDLTSEEITNHIEAGKVVTKLALDWQQRIQFVMCDDGSLKRLKFCDELRDQNEDIDREDFAQRFDADFILMTGELAALIQNLIEGLGGEAQR >CP034953.1|QAA91944.1|3459978_3460062_-|hypothetical-protein MTQRPWSKLQRKTHNIAALKIIARRSE >CP034953.1|QAA90917.1|3460547_3460832_-|pyrimidine/purine-nucleoside-phosphorylase MLQSNEYFSGKVKSIGFSSSSTGRASVGVMVEGEYTFSTAEPEEMTVISGALNVLLPDATDWQVYEAGSVFNVPGHSEFHLQVAEPTSYLCRYL >CP034953.1|QAA90918.1|3460903_3461581_-|AroM-family-protein MSASLAILTIGIVPMQEVLPLLTEYIDEDNISHHSLLGKLSREEVMAEYAPEAGEDTILTLLNDNQLAHVSRRKVERDLQGVVEVLDNQGYDVILLMSTANISSMTARNTIFLEPSRILPPLVSSIVEDHQVGVIVPVEEMLPVQAQKWQILQKSPVFSLGNPIHDSEQKIIDAGKELLAKGADVIMLDCLGFHQRHRDLLQKQLDVPVLLSNVLIARLAAELLV >CP034953.1|QAA90919.1|3461838_3462030_-|protein-YaiA MPTKPPYPREAYIVTIEKGKPGQTVTWYQLRADHPKPDSLISEHPTAQEAMDAKKRYEDPDKE >CP034953.1|QAA90920.1|3462079_3462604_-|shikimate-kinase-AroL MTQPLFLIGPRGCGKTTVGMALADSLNRRFVDTDQWLQSQLNMTVAEIVEREEWAGFRARETAALEAVTAPSTVIATGGGIILTEFNRHFMQNNGIVVYLCAPVSVLVNRLQAAPEEDLRPTLTGKPLSEEVQEVLEERDALYREVAHIIIDATNEPSQVISEIRSALAQTINC >CP034953.1|QAA90921.1|3462786_3463245_-|YaiI/YqxD-family-protein MTIWVDADACPNVIKEILYRAAERMQMPLVLVANQSLRVPPSRFIRTLRVAAGFDVADNEIVRQCEAGDLVITADIPLAAEAIEKGAAALNPRGERYTPATIRERLTMRDFMDTLRASGIQTGGPDSLSQRDRQAFAAELEKWWLEVQRSRG >CP034953.1|QAA90922.1|3463364_3464174_+|pyrroline-5-carboxylate-reductase MEKKIGFIGCGNMGKAILGGLIASGQVLPGQIWVYTPSPDKVAALHDQFGINAAESAQEVAQIADIIFAAVKPGIMIKVLSEITSSLNKDSLVVSIAAGVTLDQLARALGHDRKIIRAMPNTPALVNAGMTSVTPNALVTPEDTADVLNIFRCFGEAEVIAEPMIHPVVGVSGSSPAYVFMFIEAMADAAVLGGMPRAQAYKFAAQAVMGSAKMVLETGEHPGALKDMVCSPGGTTIEAVRVLEEKGFRAAVIEAMTKCMEKSEKLSKS >CP034953.1|QAA90923.1|3464190_3465306_-|diguanylate-cyclase-AdrA MFPKIMNDENFFKKAAAHGEEPPLTPQNEHQRSGLRFARRVRLPRAVGLAGMFLPIASTLVSHPPPGWWWLVLVGWAFVWPHLAWQIASRAVDPLSREIYNLKTDAVLAGMWVGVMGVNVLPSTAMLMIMCLNLMGAGGPRLFVAGLVLMVVSCLVTLELTGITVSFNSAPLEWWLSLPIIVIYPLLFGWVSYQTATKLAEHKRRLQVMSTRDGMTGVYNRRHWETMLRNEFDNCRRHNRDATLLIIDIDHFKSINDTWGHDVGDEAIVALTRQLQITLRGSDVIGRFGGDEFAVIMSGTPAESAITAMLRVHEGLNTLRLPNTPQVTLRISVGVAPLNPQMSHYREWLKSADLALYKAKKAGRNRTEVAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP034953_6 | 6.1|2799478|40|CP034953|CRISPRCasFinder | 2799478-2799517 | 40 | NZ_CP041417 | Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence | 47951-47990 | 0 | 1.0 |
CP034953_5 | 5.1|2135142|38|CP034953|CRISPRCasFinder | 2135142-2135179 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 113727-113764 | 2 | 0.947 |
CP034953_2 | 2.2|980328|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 980328-980360 | 33 | NZ_LR134258 | Klebsiella aerogenes strain NCTC9644 plasmid 5, complete sequence | 3574-3606 | 4 | 0.879 |
CP034953_2 | 2.2|980328|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 980328-980360 | 33 | LR134281 | Klebsiella aerogenes strain NCTC9793 genome assembly, plasmid: 6 | 3567-3599 | 4 | 0.879 |
CP034953_2 | 2.2|980328|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 980328-980360 | 33 | KY271401 | Klebsiella phage 1 LV-2017, complete genome | 21043-21075 | 4 | 0.879 |
CP034953_3 | 3.8|1006641|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006641-1006672 | 32 | NC_021229 | Arthrobacter nicotinovorans pAO1 megaplasmid sequence, strain ATCC 49919 | 65474-65505 | 5 | 0.844 |
CP034953_2 | 2.2|980328|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 980328-980360 | 33 | KY653119 | Morganella phage IME1369_02, complete genome | 18216-18248 | 6 | 0.818 |
CP034953_3 | 3.1|1006212|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006212-1006243 | 32 | NZ_CP009293 | Novosphingobium pentaromativorans US6-1 plasmid pLA4, complete sequence | 152196-152227 | 6 | 0.812 |
CP034953_3 | 3.2|1006273|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006273-1006304 | 32 | KY883647 | Vibrio phage JSF33, complete genome | 9760-9791 | 6 | 0.812 |
CP034953_3 | 3.8|1006641|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006641-1006672 | 32 | NZ_CP017422 | Arthrobacter sp. ZXY-2 plasmid pZXY21, complete sequence | 208287-208318 | 6 | 0.812 |
CP034953_2 | 2.3|980389|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 980389-980421 | 33 | NZ_CP007129 | Gemmatirosa kalamazoonesis strain KBS708 plasmid 1, complete sequence | 755172-755204 | 8 | 0.758 |
CP034953_3 | 3.1|1006212|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006212-1006243 | 32 | NZ_CP007130 | Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence | 750410-750441 | 8 | 0.75 |
CP034953_3 | 3.2|1006273|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006273-1006304 | 32 | MN855762 | Bacteriophage sp. isolate 505, complete genome | 4840-4871 | 8 | 0.75 |
CP034953_3 | 3.2|1006273|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006273-1006304 | 32 | NC_020548 | Azoarcus sp. KH32C plasmid pAZKH, complete sequence | 224460-224491 | 8 | 0.75 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | NC_013856 | Azospirillum sp. B510 plasmid pAB510b, complete sequence | 375744-375776 | 8 | 0.758 |
CP034953_3 | 3.8|1006641|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006641-1006672 | 32 | MK113951 | Phage 5P_3, complete genome | 11967-11998 | 8 | 0.75 |
CP034953_3 | 3.8|1006641|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006641-1006672 | 32 | AP017924 | Ralstonia phage RP12 DNA, complete genome | 11643-11674 | 8 | 0.75 |
CP034953_3 | 3.12|1006885|32|CP034953|CRISPRCasFinder,CRT | 1006885-1006916 | 32 | NZ_AP018516 | Acetobacter orientalis strain FAN1 plasmid pAOF1, complete sequence | 48296-48327 | 8 | 0.75 |
CP034953_3 | 3.1|1006212|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006212-1006243 | 32 | MN234174 | Mycobacterium phage Efra2, complete genome | 35614-35645 | 9 | 0.719 |
CP034953_3 | 3.1|1006212|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006212-1006243 | 32 | MN234165 | Mycobacterium phage Yunkel11, complete genome | 35570-35601 | 9 | 0.719 |
CP034953_3 | 3.1|1006212|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006212-1006243 | 32 | MN234201 | Mycobacterium phage Guanica15, complete genome | 35571-35602 | 9 | 0.719 |
CP034953_3 | 3.2|1006273|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006273-1006304 | 32 | NZ_CP015585 | Roseomonas gilardii strain U14-5 plasmid 1, complete sequence | 104261-104292 | 9 | 0.719 |
CP034953_3 | 3.2|1006273|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006273-1006304 | 32 | NZ_CP054618 | Azospirillum oryzae strain KACC 14407 plasmid unnamed4, complete sequence | 142898-142929 | 9 | 0.719 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | NZ_CP010957 | Sphingobium sp. YBL2 plasmid 3pYBL2-3, complete sequence | 26182-26214 | 9 | 0.727 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052797 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N18S2039 plasmid pN18S2039, complete sequence | 45808-45839 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052795 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0125 plasmid pN19S0125, complete sequence | 282589-282620 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | NZ_CP047882 | Salmonella enterica subsp. enterica serovar Infantis strain 119944 plasmid pESI, complete sequence | 94965-94996 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052804 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S973 plasmid pN17S0973, complete sequence | 304288-304319 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP038508 | Salmonella enterica subsp. enterica serovar Infantis strain FARPER-219 plasmid p-F219, complete sequence | 112376-112407 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052802 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S976 plasmid pN17S0976, complete sequence | 315682-315713 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052788 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0611 plasmid pN19S0611, complete sequence | 203378-203409 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052840 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S024 plasmid pN16S024, complete sequence | 127648-127679 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052786 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0641 plasmid pN19S0641, complete sequence | 215302-215333 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052838 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S097 plasmid pN16S097, complete sequence | 214483-214514 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | NZ_CP028316 | Salmonella enterica subsp. enterica serovar Typhimurium var. 5- strain CFSAN067217 plasmid pSC-31-2, complete sequence | 108893-108924 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP051676 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1234 plasmid pN16S1234, complete sequence | 83669-83700 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052783 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0679 plasmid pN19S0679-1, complete sequence | 194119-194150 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052836 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S103 plasmid pN16S103, complete sequence | 18410-18441 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | NZ_CP022063 | Salmonella enterica strain FDAARGOS_312 plasmid unnamed3, complete sequence | 64615-64646 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052781 | Salmonella enterica strain CVM N19S0949 plasmid pN19S0949, complete sequence | 169480-169511 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052834 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S041 plasmid pN17S0041, complete sequence | 6457-6488 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052793 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0388 plasmid pN19S0388, complete sequence | 25758-25789 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052779 | Salmonella enterica strain 19TN07GT06K-S plasmid pN19S1233, complete sequence | 140403-140434 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052832 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1040 plasmid pN17S1040, complete sequence | 160727-160758 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | NZ_CP031362 | Salmonella enterica subsp. enterica serovar Heidelberg strain 5 plasmid p3, complete sequence | 140152-140183 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052830 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1105 plasmid pN17S1105, complete sequence | 193709-193740 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052828 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1126 plasmid pN17S1126, complete sequence | 126974-127005 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052826 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1245 plasmid pN17S0637, complete sequence | 110984-111015 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | NZ_CP016409 | Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502916 plasmid pFSIS1502916, complete sequence | 94916-94947 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052824 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1265 plasmid pN17S1265, complete sequence | 91497-91528 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052822 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1349 plasmid pN17S1349, complete sequence | 110984-111015 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | NZ_CP016407 | Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502169 plasmid pFSIS1502169, complete sequence | 94916-94947 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052820 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1442 plasmid pN17S1442, complete sequence | 94916-94947 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | NZ_CP016413 | Salmonella enterica subsp. enterica serovar Infantis strain CVM44454 plasmid pCVM44454, complete sequence | 94916-94947 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | NZ_CP016411 | Salmonella enterica subsp. enterica serovar Infantis strain N55391 plasmid pN55391, complete sequence | 94916-94947 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052816 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1598 plasmid pN17S1598 | 165317-165348 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052814 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S349 plasmid pN17S0349, complete sequence | 99109-99140 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | NZ_CP022662 | Salmonella enterica subsp. enterica strain RM11065 plasmid pRM11065-2, complete sequence | 54379-54410 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052812 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S376 plasmid pN17S0376, complete sequence | 1671-1702 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052810 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S535 plasmid pN17S0535, complete sequence | 212751-212782 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052808 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S637 plasmid pN17S0637, complete sequence | 306376-306407 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052806 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S816 plasmid pN17S0816, complete sequence | 164579-164610 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052791 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0552 plasmid pN17S0637, complete sequence | 168074-168105 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052818 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1509 plasmid pN17S1509, complete sequence | 190524-190555 | 10 | 0.688 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | CP052799 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S990 plasmid pN17S0990-1, complete sequence | 6457-6488 | 10 | 0.688 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | CP046443 | Pseudomonas coronafaciens pv. coronafaciens strain B19001 plasmid unnamed2, complete sequence | 31933-31965 | 10 | 0.697 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | NZ_LT963392 | Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP1, complete sequence | 103013-103045 | 10 | 0.697 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | NZ_LT963392 | Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP1, complete sequence | 110510-110542 | 10 | 0.697 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | NZ_CP034079 | Pseudomonas syringae pv. pisi str. PP1 plasmid pPP1-1, complete sequence | 48454-48486 | 10 | 0.697 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | NZ_CP034080 | Pseudomonas syringae pv. pisi str. PP1 plasmid pPP1-2, complete sequence | 39480-39512 | 10 | 0.697 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | NC_005918 | Pseudomonas syringae pv. maculicola strain ES4326 plasmid pPMA4326A, complete sequence | 31117-31149 | 10 | 0.697 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | NZ_CP047262 | Pseudomonas syringae pv. maculicola str. ES4326 plasmid pPma4326A, complete sequence | 30966-30998 | 10 | 0.697 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | NZ_CP026560 | Pseudomonas amygdali pv. morsprunorum strain R15244 plasmid p3_tig5, complete sequence | 19118-19150 | 10 | 0.697 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | NZ_LT963406 | Pseudomonas syringae pv. avii isolate CFBP3846 plasmid PP4, complete sequence | 54820-54852 | 10 | 0.697 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | LT985193 | Pseudomonas syringae strain CFBP 2116 genome assembly, plasmid: PP2 | 32077-32109 | 10 | 0.697 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | NZ_LT963393 | Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP2, complete sequence | 50597-50629 | 10 | 0.697 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | NZ_LT985210 | Pseudomonas syringae pv. cerasicola strain CFBP 6110 plasmid PP1, complete sequence | 105842-105874 | 10 | 0.697 |
CP034953_3 | 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006457-1006489 | 33 | NZ_LT985211 | Pseudomonas syringae pv. cerasicola strain CFBP 6110 plasmid PP2, complete sequence | 84272-84304 | 10 | 0.697 |
CP034953_3 | 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006580-1006611 | 32 | NZ_CP028970 | Aminobacter sp. MSH1 plasmid pUSP2, complete sequence | 156123-156154 | 10 | 0.688 |
CP034953_3 | 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006580-1006611 | 32 | NZ_CP053984 | Achromobacter pestifer strain FDAARGOS_790 plasmid unnamed, complete sequence | 21888-21919 | 10 | 0.688 |
CP034953_3 | 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006580-1006611 | 32 | NC_010935 | Comamonas testosteroni CNB-1 plasmid pCNB, complete sequence | 28766-28797 | 10 | 0.688 |
CP034953_3 | 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006580-1006611 | 32 | JX469826 | Uncultured bacterium plasmid pB12, complete sequence | 11283-11314 | 10 | 0.688 |
CP034953_3 | 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006580-1006611 | 32 | JN106171 | Uncultured bacterium plasmid pAKD26, complete sequence | 11289-11320 | 10 | 0.688 |
CP034953_3 | 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006580-1006611 | 32 | NC_016968 | Comamonas testosteroni plasmid pTB30, complete sequence | 11287-11318 | 10 | 0.688 |
CP034953_3 | 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006580-1006611 | 32 | NC_016978 | Comamonas testosteroni plasmid pI2, complete sequence | 11272-11303 | 10 | 0.688 |
CP034953_3 | 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006580-1006611 | 32 | NZ_CP017760 | Cupriavidus necator strain NH9 plasmid pENH91, complete sequence | 67078-67109 | 10 | 0.688 |
CP034953_3 | 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006580-1006611 | 32 | NZ_CP053554 | Diaphorobacter sp. JS3050 plasmid pDCNB, complete sequence | 4235-4266 | 10 | 0.688 |
CP034953_3 | 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006580-1006611 | 32 | NC_019263 | Delftia acidovorans plasmid pLME1, complete sequence | 11288-11319 | 10 | 0.688 |
CP034953_3 | 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006580-1006611 | 32 | NC_019264 | Delftia acidovorans plasmid pNB8c, complete sequence | 11288-11319 | 10 | 0.688 |
CP034953_3 | 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006580-1006611 | 32 | NC_019283 | Delftia acidovorans plasmid pC1-1, complete sequence | 11288-11319 | 10 | 0.688 |
CP034953_3 | 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006580-1006611 | 32 | NC_006830 | Achromobacter xylosoxidans A8 plasmid pA81, complete sequence | 11350-11381 | 10 | 0.688 |
CP034953_3 | 3.8|1006641|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006641-1006672 | 32 | NC_002580 | Propionibacterium freudenreichii plasmid p545, complete sequence | 2898-2929 | 10 | 0.688 |
CP034953_9 | 9.1|3457832|59|CP034953|CRISPRCasFinder | 3457832-3457890 | 59 | MT230312 | Escherichia coli strain DH5alpha plasmid pESBL31, complete sequence | 97-155 | 10 | 0.831 |
CP034953_2 | 2.2|980328|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 980328-980360 | 33 | MF158039 | Shigella phage Sf12, complete genome | 4974-5006 | 11 | 0.667 |
CP034953_2 | 2.2|980328|33|CP034953|PILER-CR,CRISPRCasFinder,CRT | 980328-980360 | 33 | MF158042 | Shigella phage Sd1, complete genome | 937-969 | 11 | 0.667 |
CP034953_3 | 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT | 1006334-1006365 | 32 | NZ_CP026128 | Acinetobacter baumannii strain ABNIH28 plasmid pABA-1fe1, complete sequence | 49165-49196 | 11 | 0.656 |
CP034953_9 | 9.1|3457832|59|CP034953|CRISPRCasFinder | 3457832-3457890 | 59 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 40375-40433 | 11 | 0.814 |
1. spacer 6.1|2799478|40|CP034953|CRISPRCasFinder matches to NZ_CP041417 (Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence) position: , mismatch: 0, identity: 1.0
gcgctgcgggtcattcttgaaattacccccgctgtgctgt CRISPR spacer gcgctgcgggtcattcttgaaattacccccgctgtgctgt Protospacer ****************************************
2. spacer 5.1|2135142|38|CP034953|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 2, identity: 0.947
cggacgcaggatggtgcgttcaattggactcgaaccaa CRISPR spacer cagacgcagaatggtgcgttcaattggactcgaaccaa Protospacer *.*******.****************************
3. spacer 2.2|980328|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LR134258 (Klebsiella aerogenes strain NCTC9644 plasmid 5, complete sequence) position: , mismatch: 4, identity: 0.879
tgtgtttgcggcattaacgctcaccagcatttc CRISPR spacer ggggttcgcggcgttaacgctcaccagcatttc Protospacer * ***.*****.********************
4. spacer 2.2|980328|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to LR134281 (Klebsiella aerogenes strain NCTC9793 genome assembly, plasmid: 6) position: , mismatch: 4, identity: 0.879
tgtgtttgcggcattaacgctcaccagcatttc CRISPR spacer ggggttcgcggcgttaacgctcaccagcatttc Protospacer * ***.*****.********************
5. spacer 2.2|980328|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to KY271401 (Klebsiella phage 1 LV-2017, complete genome) position: , mismatch: 4, identity: 0.879
tgtgtttgcggcattaacgctcaccagcatttc CRISPR spacer ggggttcgcggcgttaacgctcaccagcatttc Protospacer * ***.*****.********************
6. spacer 3.8|1006641|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NC_021229 (Arthrobacter nicotinovorans pAO1 megaplasmid sequence, strain ATCC 49919) position: , mismatch: 5, identity: 0.844
-ctgctggagctggctgcaaggcaagccgccca CRISPR spacer tccgctcg-gcaggctgcaacgcaagccgccca Protospacer *.*** * ** ******** ************
7. spacer 2.2|980328|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to KY653119 (Morganella phage IME1369_02, complete genome) position: , mismatch: 6, identity: 0.818
tgtgtttgcggcattaacgctcaccagcatttc CRISPR spacer aggttgtgcggcgttaacgctgaccagcatttc Protospacer * * ******.******** ***********
8. spacer 3.1|1006212|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP009293 (Novosphingobium pentaromativorans US6-1 plasmid pLA4, complete sequence) position: , mismatch: 6, identity: 0.812
ctttcgcagacgcgcggcgatacgctcacgca CRISPR spacer ctcacgcagacgcgcggcgacacgctcattct Protospacer **. ****************.*******. *
9. spacer 3.2|1006273|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to KY883647 (Vibrio phage JSF33, complete genome) position: , mismatch: 6, identity: 0.812
cagccgaagccaaaggtgatgccgaacacgct CRISPR spacer aagccaaagccaaagctgatgccgaaactgct Protospacer ****.********* ********** .***
10. spacer 3.8|1006641|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP017422 (Arthrobacter sp. ZXY-2 plasmid pZXY21, complete sequence) position: , mismatch: 6, identity: 0.812
-ctgctggagctggctgcaaggcaagccgccca CRISPR spacer tccgctcg-gcaggctgcaacgcaagccgcccc Protospacer *.*** * ** ******** ***********
11. spacer 2.3|980389|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP007129 (Gemmatirosa kalamazoonesis strain KBS708 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.758
gacgtggtcatgggtgctgctgttgcagagcca CRISPR spacer tccgtggtcgtgggtgctgctgttgctggagcg Protospacer *******.**************** *.. *.
12. spacer 3.1|1006212|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP007130 (Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence) position: , mismatch: 8, identity: 0.75
-ctttcgcagacgcgcggcgatacgctcacgca CRISPR spacer ggcttcacgaa-gcgcggcgatacgctctcgct Protospacer .***.*..* **************** ***
13. spacer 3.2|1006273|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to MN855762 (Bacteriophage sp. isolate 505, complete genome) position: , mismatch: 8, identity: 0.75
cagccgaagccaaaggtgatgccgaacacgct CRISPR spacer aagccgaagccaaaggtgatttcgagctggtc Protospacer ******************* .***.* *..
14. spacer 3.2|1006273|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NC_020548 (Azoarcus sp. KH32C plasmid pAZKH, complete sequence) position: , mismatch: 8, identity: 0.75
cagccgaagccaaaggtgatgc---cgaacacgct CRISPR spacer tagcagaagccgaaggtgatgccggcgagcag--- Protospacer .*** ******.********** ***.**
15. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NC_013856 (Azospirillum sp. B510 plasmid pAB510b, complete sequence) position: , mismatch: 8, identity: 0.758
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ctgatcgcgcattacctgcgcgtcgccgacgcg Protospacer * .********* ************** *
16. spacer 3.8|1006641|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to MK113951 (Phage 5P_3, complete genome) position: , mismatch: 8, identity: 0.75
ctgctggagctggctgcaaggcaagccgccca CRISPR spacer gagcatcggctggctgcaaggcaagctgcccc Protospacer ** .******************.****
17. spacer 3.8|1006641|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to AP017924 (Ralstonia phage RP12 DNA, complete genome) position: , mismatch: 8, identity: 0.75
ctgctggagctggctgcaaggcaagccgccca CRISPR spacer gcggaagcgctggctgcacggcaagcggccca Protospacer .* .* ********** ******* *****
18. spacer 3.12|1006885|32|CP034953|CRISPRCasFinder,CRT matches to NZ_AP018516 (Acetobacter orientalis strain FAN1 plasmid pAOF1, complete sequence) position: , mismatch: 8, identity: 0.75
gcaacgacggtgagatttcacgcctgacgctg CRISPR spacer tcaacgacggtaagatgtcacgcctaaagaat Protospacer **********.**** ********.* *
19. spacer 3.1|1006212|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to MN234174 (Mycobacterium phage Efra2, complete genome) position: , mismatch: 9, identity: 0.719
ctttcgcagacgcgcggcgatacgctcacgca CRISPR spacer gatcaccagacgcgcggcgtcacgctcacggc Protospacer *. ************* .*********
20. spacer 3.1|1006212|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to MN234165 (Mycobacterium phage Yunkel11, complete genome) position: , mismatch: 9, identity: 0.719
ctttcgcagacgcgcggcgatacgctcacgca CRISPR spacer gatcaccagacgcgcggcgtcacgctcacggc Protospacer *. ************* .*********
21. spacer 3.1|1006212|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to MN234201 (Mycobacterium phage Guanica15, complete genome) position: , mismatch: 9, identity: 0.719
ctttcgcagacgcgcggcgatacgctcacgca CRISPR spacer gatcaccagacgcgcggcgtcacgctcacggc Protospacer *. ************* .*********
22. spacer 3.2|1006273|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015585 (Roseomonas gilardii strain U14-5 plasmid 1, complete sequence) position: , mismatch: 9, identity: 0.719
cagccgaagccaaaggtgatgccgaacacgct CRISPR spacer cagctggagccaaaggtgatgcccgtgcggat Protospacer ****.*.**************** . * *
23. spacer 3.2|1006273|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP054618 (Azospirillum oryzae strain KACC 14407 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.719
cagccgaagccaaaggtgatgccgaacacgct CRISPR spacer gcgccgaaggcaaaggtgttgccgaggccgag Protospacer ******* ******** ******. **
24. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP010957 (Sphingobium sp. YBL2 plasmid 3pYBL2-3, complete sequence) position: , mismatch: 9, identity: 0.727
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ctcgacccgcataccttgcgtgtcgccgcctcg Protospacer * . * ********.****.**********
25. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052797 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N18S2039 plasmid pN18S2039, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
26. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052795 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0125 plasmid pN19S0125, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
27. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP047882 (Salmonella enterica subsp. enterica serovar Infantis strain 119944 plasmid pESI, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
28. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052804 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S973 plasmid pN17S0973, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
29. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP038508 (Salmonella enterica subsp. enterica serovar Infantis strain FARPER-219 plasmid p-F219, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
30. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052802 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S976 plasmid pN17S0976, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
31. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052788 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0611 plasmid pN19S0611, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
32. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052840 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S024 plasmid pN16S024, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
33. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052786 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0641 plasmid pN19S0641, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
34. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052838 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S097 plasmid pN16S097, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
35. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP028316 (Salmonella enterica subsp. enterica serovar Typhimurium var. 5- strain CFSAN067217 plasmid pSC-31-2, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
36. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP051676 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1234 plasmid pN16S1234, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
37. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052783 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0679 plasmid pN19S0679-1, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
38. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052836 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S103 plasmid pN16S103, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
39. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP022063 (Salmonella enterica strain FDAARGOS_312 plasmid unnamed3, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
40. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052781 (Salmonella enterica strain CVM N19S0949 plasmid pN19S0949, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
41. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052834 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S041 plasmid pN17S0041, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
42. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052793 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0388 plasmid pN19S0388, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
43. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052779 (Salmonella enterica strain 19TN07GT06K-S plasmid pN19S1233, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
44. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052832 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1040 plasmid pN17S1040, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
45. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP031362 (Salmonella enterica subsp. enterica serovar Heidelberg strain 5 plasmid p3, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
46. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052830 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1105 plasmid pN17S1105, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
47. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052828 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1126 plasmid pN17S1126, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
48. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052826 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1245 plasmid pN17S0637, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
49. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP016409 (Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502916 plasmid pFSIS1502916, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
50. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052824 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1265 plasmid pN17S1265, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
51. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052822 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1349 plasmid pN17S1349, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
52. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP016407 (Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502169 plasmid pFSIS1502169, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
53. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052820 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1442 plasmid pN17S1442, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
54. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP016413 (Salmonella enterica subsp. enterica serovar Infantis strain CVM44454 plasmid pCVM44454, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
55. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP016411 (Salmonella enterica subsp. enterica serovar Infantis strain N55391 plasmid pN55391, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
56. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052816 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1598 plasmid pN17S1598) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
57. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052814 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S349 plasmid pN17S0349, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
58. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP022662 (Salmonella enterica subsp. enterica strain RM11065 plasmid pRM11065-2, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
59. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052812 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S376 plasmid pN17S0376, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
60. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052810 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S535 plasmid pN17S0535, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
61. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052808 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S637 plasmid pN17S0637, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
62. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052806 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S816 plasmid pN17S0816, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
63. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052791 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0552 plasmid pN17S0637, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
64. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052818 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1509 plasmid pN17S1509, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
65. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP052799 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S990 plasmid pN17S0990-1, complete sequence) position: , mismatch: 10, identity: 0.688
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tactccctgtcggttgtgtttgatagcgcctc Protospacer .***************. ******..*..
66. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to CP046443 (Pseudomonas coronafaciens pv. coronafaciens strain B19001 plasmid unnamed2, complete sequence) position: , mismatch: 10, identity: 0.697
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ttcgctgcgcatctcctgcgcgtcgccgccggt Protospacer . ...****** .**************** *.
67. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LT963392 (Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP1, complete sequence) position: , mismatch: 10, identity: 0.697
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ttcgctgcgcatctcctgcgcgtcgccgccggt Protospacer . ...****** .**************** *.
68. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LT963392 (Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP1, complete sequence) position: , mismatch: 10, identity: 0.697
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ttcgctgcgcatctcctgcgcgtcgccgccggt Protospacer . ...****** .**************** *.
69. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP034079 (Pseudomonas syringae pv. pisi str. PP1 plasmid pPP1-1, complete sequence) position: , mismatch: 10, identity: 0.697
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ttcgctgcgcatctcctgcgcgtcgccgccggt Protospacer . ...****** .**************** *.
70. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP034080 (Pseudomonas syringae pv. pisi str. PP1 plasmid pPP1-2, complete sequence) position: , mismatch: 10, identity: 0.697
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ttcgctgcgcatctcctgcgcgtcgccgccggt Protospacer . ...****** .**************** *.
71. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NC_005918 (Pseudomonas syringae pv. maculicola strain ES4326 plasmid pPMA4326A, complete sequence) position: , mismatch: 10, identity: 0.697
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ttcgctgcgcatctcctgcgcgtcgccgccggt Protospacer . ...****** .**************** *.
72. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP047262 (Pseudomonas syringae pv. maculicola str. ES4326 plasmid pPma4326A, complete sequence) position: , mismatch: 10, identity: 0.697
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ttcgctgcgcatctcctgcgcgtcgccgccggt Protospacer . ...****** .**************** *.
73. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP026560 (Pseudomonas amygdali pv. morsprunorum strain R15244 plasmid p3_tig5, complete sequence) position: , mismatch: 10, identity: 0.697
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ttcgctgcgcatctcctgcgcgtcgccgccggt Protospacer . ...****** .**************** *.
74. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LT963406 (Pseudomonas syringae pv. avii isolate CFBP3846 plasmid PP4, complete sequence) position: , mismatch: 10, identity: 0.697
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ttcgctgcgcatctcctgcgcgtcgccgccggt Protospacer . ...****** .**************** *.
75. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to LT985193 (Pseudomonas syringae strain CFBP 2116 genome assembly, plasmid: PP2) position: , mismatch: 10, identity: 0.697
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ttcgctgcgcatctcctgcgcgtcgccgccggt Protospacer . ...****** .**************** *.
76. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LT963393 (Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP2, complete sequence) position: , mismatch: 10, identity: 0.697
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ttcgctgcgcatctcctgcgcgtcgccgccggt Protospacer . ...****** .**************** *.
77. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LT985210 (Pseudomonas syringae pv. cerasicola strain CFBP 6110 plasmid PP1, complete sequence) position: , mismatch: 10, identity: 0.697
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ttcgctgcgcatctcctgcgcgtcgccgccggt Protospacer . ...****** .**************** *.
78. spacer 3.5|1006457|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LT985211 (Pseudomonas syringae pv. cerasicola strain CFBP 6110 plasmid PP2, complete sequence) position: , mismatch: 10, identity: 0.697
cgaatcgcgcataccctgcgcgtcgccgcctgc CRISPR spacer ttcgctgcgcatctcctgcgcgtcgccgccggt Protospacer . ...****** .**************** *.
79. spacer 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP028970 (Aminobacter sp. MSH1 plasmid pUSP2, complete sequence) position: , mismatch: 10, identity: 0.688
gactcaccccgaaagagattgccagccagctt CRISPR spacer acgtcaccccggaagcgattgccagcacacgc Protospacer . ********.*** ********** .* .
80. spacer 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP053984 (Achromobacter pestifer strain FDAARGOS_790 plasmid unnamed, complete sequence) position: , mismatch: 10, identity: 0.688
gactcaccccgaaagagattgccagccagctt CRISPR spacer aggtactgacgaatgagaatgccagccagctt Protospacer .. * . **** **** *************
81. spacer 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NC_010935 (Comamonas testosteroni CNB-1 plasmid pCNB, complete sequence) position: , mismatch: 10, identity: 0.688
gactcaccccgaaagagattgccagccagctt CRISPR spacer aggtactgacgaatgagaatgccagccagctt Protospacer .. * . **** **** *************
82. spacer 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to JX469826 (Uncultured bacterium plasmid pB12, complete sequence) position: , mismatch: 10, identity: 0.688
gactcaccccgaaagagattgccagccagctt CRISPR spacer aggtactgacgaatgagaatgccagccagctt Protospacer .. * . **** **** *************
83. spacer 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to JN106171 (Uncultured bacterium plasmid pAKD26, complete sequence) position: , mismatch: 10, identity: 0.688
gactcaccccgaaagagattgccagccagctt CRISPR spacer aggtactgacgaatgagaatgccagccagctt Protospacer .. * . **** **** *************
84. spacer 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NC_016968 (Comamonas testosteroni plasmid pTB30, complete sequence) position: , mismatch: 10, identity: 0.688
gactcaccccgaaagagattgccagccagctt CRISPR spacer aggtactgacgaatgagaatgccagccagctt Protospacer .. * . **** **** *************
85. spacer 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NC_016978 (Comamonas testosteroni plasmid pI2, complete sequence) position: , mismatch: 10, identity: 0.688
gactcaccccgaaagagattgccagccagctt CRISPR spacer aggtactgacgaatgagaatgccagccagctt Protospacer .. * . **** **** *************
86. spacer 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP017760 (Cupriavidus necator strain NH9 plasmid pENH91, complete sequence) position: , mismatch: 10, identity: 0.688
gactcaccccgaaagagattgccagccagctt CRISPR spacer aggtactgacgaatgagaatgccagccagctt Protospacer .. * . **** **** *************
87. spacer 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP053554 (Diaphorobacter sp. JS3050 plasmid pDCNB, complete sequence) position: , mismatch: 10, identity: 0.688
gactcaccccgaaagagattgccagccagctt CRISPR spacer aggtactgacgaatgagaatgccagccagctt Protospacer .. * . **** **** *************
88. spacer 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NC_019263 (Delftia acidovorans plasmid pLME1, complete sequence) position: , mismatch: 10, identity: 0.688
gactcaccccgaaagagattgccagccagctt CRISPR spacer aggtactgacgaatgagaatgccagccagctt Protospacer .. * . **** **** *************
89. spacer 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NC_019264 (Delftia acidovorans plasmid pNB8c, complete sequence) position: , mismatch: 10, identity: 0.688
gactcaccccgaaagagattgccagccagctt CRISPR spacer aggtactgacgaatgagaatgccagccagctt Protospacer .. * . **** **** *************
90. spacer 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NC_019283 (Delftia acidovorans plasmid pC1-1, complete sequence) position: , mismatch: 10, identity: 0.688
gactcaccccgaaagagattgccagccagctt CRISPR spacer aggtactgacgaatgagaatgccagccagctt Protospacer .. * . **** **** *************
91. spacer 3.7|1006580|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NC_006830 (Achromobacter xylosoxidans A8 plasmid pA81, complete sequence) position: , mismatch: 10, identity: 0.688
gactcaccccgaaagagattgccagccagctt CRISPR spacer aggtactgacgaatgagaatgccagccagctt Protospacer .. * . **** **** *************
92. spacer 3.8|1006641|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NC_002580 (Propionibacterium freudenreichii plasmid p545, complete sequence) position: , mismatch: 10, identity: 0.688
ctgctggagctggctgcaaggcaagccgccca CRISPR spacer ccctgagagctggctgccacgcaagccgctgg Protospacer *. . .*********** * *********. .
93. spacer 9.1|3457832|59|CP034953|CRISPRCasFinder matches to MT230312 (Escherichia coli strain DH5alpha plasmid pESBL31, complete sequence) position: , mismatch: 10, identity: 0.831
-cggagcacttattgccggatgcggcgtgaacgccttatccggcctacggttctggcacc CRISPR spacer tcagtgcac-gatcgccggatgcggcgtgaacgccttatccgtcctacggttctgtgctc Protospacer *.* **** **.**************************** ************ .*
94. spacer 2.2|980328|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to MF158039 (Shigella phage Sf12, complete genome) position: , mismatch: 11, identity: 0.667
tgtgtttgcggcattaacgctcaccagcatttc CRISPR spacer attgtttgcagcattaacgctccccaagtgccg Protospacer *******.************ ***. ..
95. spacer 2.2|980328|33|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to MF158042 (Shigella phage Sd1, complete genome) position: , mismatch: 11, identity: 0.667
tgtgtttgcggcattaacgctcaccagcatttc CRISPR spacer attgtttgcagcattaacgctctccaagtgccg Protospacer *******.************ ***. ..
96. spacer 3.3|1006334|32|CP034953|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP026128 (Acinetobacter baumannii strain ABNIH28 plasmid pABA-1fe1, complete sequence) position: , mismatch: 11, identity: 0.656
ggctccctgtcggttgtaattgataatgttga CRISPR spacer tttgaactgtcggttgtaattggcaatgtatc Protospacer . ****************..*****
97. spacer 9.1|3457832|59|CP034953|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 11, identity: 0.814
cggagcacttattgccggatgcggcgtgaacgccttatccggcctacggttctggcacc- CRISPR spacer ggtacggctttttgccggatgcggcgtaaacgccttatccggcctacggtt-tggtgcga Protospacer * * .*** ****************.*********************** ***..*
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1020414 : 1027553
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP034953|1020414:1027553|DBSCAN-SWA ATTAACTCCTTAATTCCGCAATTTCACCTGCGGTCAGATAACGGATCGGGCGGTCACCGAGAATAAAAATCAGCTTTGCCGTTTCCTCCAGCTCTTCCATATTGTTGGCGGCTTCTTGCAGGCTTTCACCGCAAACCACTGGGCCATGATTTGCCAGTAAAAAAGCCTGATTGTCTGCTGCCAGTTCCGCCAGATCCTGTGCGATGCGTTTATCGCCCGGTCGGTAATAAGGCACCAGCGGGACATTTCCCATCCGCATCACCACGTATGGTGTGAACGGACGAATAACGTTGCTGCTGTCCAGCCCTTGCAGGCAGGAAAGCGCCGTCGACCATGTGCTGTGCAAATGCACCACCGCTTTACAGCGCGGATTGTTGCGATACAGCGCCAGATGAAAGAGCACCTCTTTCGAGGGTTTGTCACCACTTAACCATTCGCCATCCGCGGCGACTTTGGAAAACCGCTGCGGATCGAGATTGCCCAGGCATGAACCTGTCGGTGTCGCCAGTAAATTCCCGTCAGGTAAAAGCAGCGACAGATTGCCAGCCGAACCGGTTGCATAGCCGCGCTGAAAGAATGAACTGGCAATCCGCGTCATCTCCTCTCGCAAAGACTGCTCTACTTTTGCGAAATCGCTCATGATAAAAACTCTCTTTGGGCTCGTGAAAAAAAGGCGTCATCACCGAAGTTGCCAGATTTAAGGGCGAGTGAGACAGGCTTATCCAGTGCGTTTACCCACGGCACGCCGGGGAAATGGTTGGGCCAATATGAAACCCTTTTATTCCCAGGCTCTGTGTGACTACGCCGGAGGTCTCACCGCCTGCGACAATAAAGCGTGTCACGCCTTCCGCTGCTAACCGCGCCGCTAGTTGAGAAAACAGTGTTTCTACTGCCTGACTGGCTTTTTGTGCACCGTATTGCTGTTGAATTGCTGCCAATGCGTCAGTGCTGGCGGTGGCAAAAACCAGTGGAGCAAGTACACTTTCCTGGCCCAGAACCCACTCTGCCAGTTCGTGTGCATAAGCGGCCAGAGTTTCAATTGAGAGGCAGCGTGCCACATCAACTTCACGGGCTGGTGCAATTTGACGGTAATGTGCTACCTGGCGGTTGGTCATTTGAGAGCATGAACCGGAGAGCACTACGCCGCGCCCAGCGAGCGGACGCCCTGCTTTGCGAGCCTGGTTACCGTTTTCTTGCGCCCACTGCCGGGCCAGGCCAATCGCCAGACCAGAACCGCCCGTTACCAGTGGGGCATCGCGCAAGGCTTCTCCCTGAATTTCCAGATGGTGTTCGGTCAGCGCATCAAGCACCGCGTAGCGGTAGCCCTCTTGCTGTAAGCGAGCCAGCTCTTGACGAACGGCATCCACACCTTGTTCGAAAACATGTGCCGAAACGACGCCGCAGCGCCCTGTGGATTGCGCTTCAACCAGACGGGGAAGATAGCTGTCGGTCATGGGATTTACCGGGTGATGGCGCATCCCGGATTCGGCCAGCAGTTGATTCATTACGAACAAATACCCCTGATAAACCGTACGTCCGTTGACCGGCAGGGCCGGAGAGAAGACCGTAAACGGCGTGTCGAGAGCATCCATTAAGGCATCGGTAACCGGGCCAATATTACCTTTCGCCGTACTGTCGAAAGTAGAGCAGTATTTGAAATAGATCTGTTTGCAACCTTGCTGTTGCAACCAGCTCAGAGCCGCCAGCGATTGCTGTGTGGCTTCAACCACTGGACAGGAGCGCGTTTTCAGGCTGATCACCAGTGCGTCGATTGCTTCCGGCATTTTACCTGTTGGAACACCGTTAATTTGTACCGTTGGTAGACCGTTTTCCACCAGAAAACTGGCGATATCCGTCGCGCCGGTAAAATCATCGGCGATAACGCCAATCTTGATCATGATTTCGCTCCCGGTAGAGTGATGCCAGAGAAAATCTTGATAACTGCGCTATCGTCTTCTTTCCCGTAACCCGCGTTACTGGCGCTGGTGAACATATTCAATGCTGTTGAGGCCAATGGCAGCGGGAAGTGCAGGGCTTTGGCTGTATCGGCAACCAGACCAAGATCCTTAACAAAAATATCGACGGCTGAATGCGGGGTGTAATCGCCATCCACCACATGACGCATCCGGTTTTCGAACATCCAGGAATTTCCGGCGGCATTGGTCACGACGTCATACATCACATCCAGCGGGATCCCCGCACGGGCTGCAAGTGCCATCGCTTCGGCTCCGGCAGCAATATGTACGCCCGCTAACAACTGGTGAATAATTTTTACGGTCGAACCTAGTCCCGGTTCTGCACCTATGCGATAAACTTTTCCGGCAACGGCTTCCAGCACGGGTGCCAGTCGTTCAAAGGCAATATCGCTACCGGAGGCCATGACAGTCATTTCACCGTTAGCGGCTTTTACTGCACCACCAGAAACTGGCGCATCCAGCATTTCCAGATCGAATCCAGCCAGAGCGGTAGCAATTTCTTGCGCATCAGCACTAGCGATAGTGGAAGAAACCATTACTGCCGTACCGGGTTTCAGATGTTGTGCAACGCCTGTTTCACCAAACAGCACCTGTTTAACCTGGGCCGCATTGACCACCAGCACCAGCAGTGCGTCCAGTTTTTCGGCAAACGTCGCGGCGTTATCAGAAACCCCGCAAGCACCTGCCTCTTTCAACGTAGCGCAGGCATTGCTGTTCAGGTCTGCGCCCCAGGTAGAAAGACCTGCGCGGACATATGACAGTGCTGCTCCCATTCCCATTGACCCTAAGCCAACGATACCGACATGAAACTCAGATCCCGTTTTCATCTGCTCTCCTTGTTAATTTAAGTGATATTTTGTTTGATATTGTGAATATAAGCGCTGGAAGATAACGGTATGGTGATCTGATTCACATAAATTAACATTGTGTGTTATTTTATGTGAACTAAGCGTTAGTTGCGCCGCGCACGTTTCGCAGGCAAATAGCGTAGAATGTCAGCAGGACAAAGGAAGGAGCAAAAGTTGATACCCGTAGAGCGTCGCCAAATCATCCTTGAGATGGTAGCTGAAAAAGGCATTGTCAGTATTGCTGAACTAACGGACAGAATGAATGTGTCACATATGACCATTCGTCGGGATTTACAAAAACTGGAGCAGCAGGGAGCCGTTGTGCTGGTGTCCGGAGGCGTCCAGTCTCCGGGACGCGTGGCGCATGAACCTTCTCATCAGGTAAAAACTGCGCTGGCAATGACGCAAAAAGCGGCTATTGGCAAGCTGGCGGCAAGTCTTGTTCAGCCGGGAAGTTGTATCTATCTGGATGCGGGAACGACCACGTTAGCGATAGCACAGCATCTGATTCACATGGAGTCACTGACTGTGGTCACAAACGATTTCGTTATTGCGGACTACTTGCTCGACAACAGTAATTGCACAATTATTCACACTGGCGGTGCAGTGTGTCGGGAAAACCGTTCCTGTGTCGGGGAAGCCGCTGCGACCATGCTGCGCAGCCTGATGATTGATCAGGCTTTTATTTCTGCATCGTCATGGAGTGTGCGGGGGATTTCTACGCCAGCGGAAGATAAAGTCACGGTGAAACGGGCGATTGCCAGTGCCAGCCGCCAGCGAGTTTTGGTCTGTGATGCGACGAAATATGGTCAGGTGGCGACATGGCTGGCGTTACCATTAAGCGAGTTTGATCAGATTATTACAGACGACGGTTTGCCGGAGAGTGCCAGTCGCGCGCTGGTGAAGCAGGATCTCTCTTTGCTGGTAGCGAAAAATGAATAATGGCCTGCAATAACACCGGGTTACTCATGCTTCACAGAAGAAGCATGAGACTACTTTATTTTATAAAATGACAGCCGCCCGCTGTTCGGCGATCCGGTATCAATATAAATCTGGTTAGCGAACGTCTGAATGTTATCAAACATCATATGTCCAAATATAAAATAATCAGCGCCGTTTATTTGTTGTAACTCGCCATTAAGCGATTTCTGCACACGATCAACAGGCCAGAGTAATTCGCTCTCCGCTATTTCTTTACCAAAGAGATATTCACTCCCCGGATAATCTGCATGTGCGATGGCATATTTTATGTTGTCGTTAGTGATTTCAATAATATGTGGAAGGTGATGGAATTTCAGCAACAGATCTATTGCCTCTTGTTGCTCTGAATCATTTAAATCGAAAAACCAGTCACCACCGCTGGCAAGCCACATATTGCCATCGCCAGTTTCGAATGCCTCAAGCGCCATCGCTTCGTGGTTGCCTTTAACCGACGTAAACCAGGGTTGGTTTAGCAGGCGCAGGACGTCAAGACTCTCCGGTCCACGATCAATATTATCGCCGACAGAAATAAGTAAGTCGATTTTGGGGAAAAAAGAGAGTTGATGTAAGCGGGATTGTAATAACTGATATTCACCATGAATATCACCAACGACCCATATATGGCGATAGTGATGGGCATTGATTTTTTGATAGCGTGTAGATGGCATGGTTTTACCCTGTAAAATAAGCTTTCCTATTATACAGGGTGTTTTTATTTTATTCGTCAGTTGTCGTTAATATTCCCGATAGCAAAAGACTATCGGGAATTGTTATTACACCAGGCTCTTCAAGCGATAAATCCACTCCAGCGCCTGACGCGGGGTGAGTGAATCCGGATCAAGATTTTCCAGAGCTTCGACCGCAGGCGAAGTTTCTTCTGGTACTGACAGCAAAGACATTTGCGTACCATCCACTTGCGTAGCGGCGGCGTTCGGCGAAATGCTTTCCAGCTCACGCAGCTTTTGCCGTGCGCGCTTAATAACCTCTTTTGGCACGCCTGCCAGAGCTGCAACCGCCAGGCCGTAGCTTTTGCTCGCCGCGCCATCCTGCACGCTGTGCATAAAGGCAATGGTGTCGCCGTGCTCCAGTGCATCGAGATGCACGTTAGCGACGCCTTCCATTTTCTCCGGTAACTGGGTCAGCTCGAAATAGTGGGTAGCAAATAACGTCAATGCCTTAATCTTATTCGCCAGATTTTCCGCGCACGCCCACGCCAGCGACAGACCATCGTAGGTGGACGTTCCACGCCCGATCTCATCCATTAACACCAGACTGTATTCGGTGGCGTTATGTAAAATATTGGCGGTTTCAGTCATCTCCACCATAAAGGTTGAGCGCCCGGACGCCAGGTCATCTGCCGCGCCTACGCGGGTAAAGATGCGATCGATAGGTCCAATCTCGACTTTTTGTGCCGGTACATAGCTGCCGATGTAGGCCATCAGCGCAATCAGTGCGGTCTGGCGCATATAGGTACTTTTACCGCCCATGTTCGGACCGGTGATGATCAACATGCGGCGCTGCGGCGACAGATTCAGCGGGTTGGCGATAAATGGCTCATTCAGTACTTGTTCAACTACCGGATGGCGACCTTCGGTAATGCGAATGCCCGGTTTATCAATGAAGGTCGGGCAGGTGTAGTTCAGGGTATAGGCCCGTTCCGCCAGGTTAACCAGCACGTCGAGTTCCGCCAGCGCGCTCGCGCTCTGTTGCAACGCTTCCAGATGCGGCAACAGCAGGTCGAACAGCTCTTCATAAAGCTGTTTTTCCAGTGCCAGTGCTTTGCCTTTTGAGGTGAGAACTTTATCTTCGTACTCTTTTAGCTCTGGAATGATGTAGCGCTCGGCGTTTTTCAGCGTCTGGCGACGCATGTAGTTGATGGGTGCCAGATGGCTTTGCCCACGGCTGATTTGAATGTAGTAGCCGTGCACCGCATTAAAGCCAACTTTCAGCGTGTCCAGGCCGGTACGTTCACGCTCGCGGACTTCCAGACGCTCCAGATAATCGGTCGCGCCGTCAGCCAGCGCGCGCCACTCATCCAGCTCTTCGTTATAGCCCGATGCGATAACACCACCGTCGCGTACCAGCACCGGCGGTGTGTCGATGATTGCTCGCTCCAGCAGATCGCGCAGCTCGGCAAACTCGCCCATCTTCTCACGTAGCGCCTGTACCGGTGCACTATCGACAGTTTCTAACTGCGCACGCAGCTCCGGCAGTTGCTGGAAAGCGTGGCGCATACGGGCCAGATCGCGTGGGCGAGCAGTTCGTAAAGCCAGACGTGCCAGAATACGTTCCAGGTCGCCGACCTGACGCAGTACCGGCTGTAGCCCGGCGGTGAAATCCTGCAATGCGCCAATAGTTTGCTGGCGCTCAAGCAACACGCGGGTATCGCGCACTGGCATATGCAGCCAGCGTTTCAGCATACGGCTGCCCATCGGCGTGACGGTGCAGTCGAGCACAGAAGCCAGCGTATTTTCCGCACCACCCGCCAGGTTCTGGGTGATTTCCAGATTACGACGCGTCGCGGCATCCATAATGATGCTGTCCTGCTCACGTTCCATGGTGATGGAACGAATATGCGGCAGAGTCGTACGTTGGGTATCTTTCGCATACTGCAACAGACAACCGGCAGCACAAAGTCCGCGCGGCGCGTTCTCGACGCCAAAACCGACCAGATCGCGGGTCCCAAATTGCAGATTCAACTGCTGGCGCGCGGTGTCGATTTCAAACTCCCACAGCGGGCGACGGCGCAGGCCGCGACGGCCTTCAATTAACGACATTTCAGCAAAATCTTCTGCATACAGCAGTTCCGCAGGATTAGTGCGTTGCAGTTCTGCCGCCATCGTTTCGCGGTCAGCCGGTTCGCTCAGGCGAAAACGCCCGGAACTGATATCCAGCGTCGCGTAGCCGAAACCTTTGCTGTCCTGCCAGATAGCCGCCAGCAGGTTGTCCTGACGCTCCTGCAACAGGGCTTCATCGCTGATGGTGCCTGGCGTAACGATACGCACAACTTTGCGCTCAACCGGACCTTTGCTGGTCGCCGGATCGCCAATTTGTTCGCAGATGGCAACGGACTCTCCCTGATTCACCAGTTTGGCGAGATAGTTTTCCACCGCATGGTAGGGAATCCCCGCCATCGGGATCGGCTCTCCCGCCGAAGCACCGCGTTTGGTCAGTGAAATATCCAGCAGTTGCGACGCGCGTTTTGCGTCGTCATAAAACAGTTCATAAAAATCACCCATCCGGTAAAACAGCAGGATCTCGGGATGCTGGGCTTTCAGCCTGAGATACTGCTGCATCATGGGCGTATGGGCGTCGAAATTTTCTATTGCACTCAT
Protein sequences of DBSCAN-SWA_1 >CP034953|1020414:1027553|1021144_1022311_-|QAA88752.1|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQARKAGRPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSIETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPACRG >CP034953|1020414:1027553|1022307_1023216_-|QAA88753.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSYVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNAAQVKQVLFGETGVAQHLKPGTAVMVSSTIASADAQEIATALAGFDLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGAEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS >CP034953|1020414:1027553|1024991_1027553_-|QAA88756.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAGLQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV >CP034953|1020414:1027553|1024229_1024886_-|QAA88755.1|DBSCAN-SWA MPSTRYQKINAHHYRHIWVVGDIHGEYQLLQSRLHQLSFFPKIDLLISVGDNIDRGPESLDVLRLLNQPWFTSVKGNHEAMALEAFETGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDNIKYAIAHADYPGSEYLFGKEIAESELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGSPNSGRLSFYKIK >CP034953|1020414:1027553|1020414_1021053_-|QAA88751.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRFSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS >CP034953|1020414:1027553|1023411_1024179_+|QAA88754.1|DBSCAN-SWA MIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGSCIYLDAGTTTLAIAQHLIHMESLTVVTNDFVIADYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSVRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALVKQDLSLLVAKNE |
6 | Escherichia_phage(83.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1408134 : 1419344
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP034953|1408134:1419344|DBSCAN-SWA TTTACCCATTGGCGCGGCTTAAGAGCTTATTTTTGAATTCACAATGGTCACGATATAACCATCTTGCTCGCCCGTGGATAACTTTGGCTTTTGGCAGGTCGCCTGACTTAATCCGGTCATAGATGAAGGTTTTACCAAAGCCAGTATCAGCCATGATGAATTTCAAATCAACCAGTGAATCAGGCTGTAGTTCGTGTTGCATGAGTGCTATCTCCGAATAGGGAATCGAACCTGCAAATCAGGTAATAAAAATACGCTCTATGACGGCGATGGTAGATCAGGATATTTTAAGAAACTGACAGGCCTCATCGAGTGTGAGGCTGTATGGCTCTATTATTTCACCTCTTGTTGTGACATTGTTGAAAAATGGATACCAGCTCGTTGCTGCCAGATGATCCAACCGAGAGTCATATCCCATGCCATGTATTCGTTATCGCCGTTTTTTGCTCTCCGACGATCTACTAAGTCACCGAAACGCTTTTCCATGAATAATTCATAGGCTTCGCGTTCATCTGGCTCTACTTCCAGAGATACGAGTGCGATTTCATAAGCACGGCGCTCAATATCGTCTCGAACCTCTAGGCTGCTGATTCGTTCTTTGATTTCTTTAATCAGTTCTTTATTGGTAAATGTGGTCATTATGCTCCAGCCTCCGGTGCTTTTGGCATTACTGCCCAGTGAGTGATATTGACGTTTTCAAGGTCCCCAACCTGAAATGTCCACTGCCATTCTCCGGTTTCTTTTTGTCCCCAGGTGTACCAGAGAGAACGCCAGCCAATCAGCCAGCCTTCTCCGTTAGCATCAAATAACAGAACACTTTCATTTGCTGGTGGCAGTTCAGCTGACACTGGTATTATTTTGTTTTCCAGTGCCGCACATTTAGCTTCAAGCGCGTCGAATTTACGTACCAGGTACTCAGCATTTGTTTCGTTCACTTTCAGATCTCGCGGTACACATTTCCCGCGAAGAAACCCTTCCATTTCGAAAACATTCATGCGCATTTGCGTAACTCCGATAAATCGTTAAAACGTTCCATAAACATCCCGTAGGCATGACCCGGTGCCAGTGGAATCACGTTGAACATCTCTGTTGCCGGGATGCCTTCCAGTACAGGCCAGAAAGAGCCATCATCAAGCCCGAGATCACGGCGTTCGGTTGCCAGCATAATGAGATCGGCATATTTCACTGGCGTGCTCATAACAGGAGGTAACCCGTATTTCTCACGGATTACTGCATCTATTTTTTCTTCCATCCGTTTATAGTCAGGAAGAAGGCGTTTCAGTGGTGCGGGGATGTCCTGGCAATACGCTTCTGTTGCATCATGCATTAATGCTTCAAAAGCAAATTCCTGCGGCACCAGCTGGCTGCAAAGCACCGCATGTTGGGCGACACTGTAGAAGTGAGAAAGATGACCGGCAAAGCGGCAGATATTTGAAAGGGAAACCGCGATATCGTTAATCACGATGTCGTCTTTATTTATCTTGTCATAATAAAAATGCTTCCCGGAAAAAGTTTTAATAAATGACATTTTGTTCTCCACGTATATGCGCTGCACCGCGCTGAGTTTGGGTAAAAGGAAGCCCTCACCATCCGGTGATTATTGAGTTAATTACGTTTCCATAAATGCCCCCGCAGGGGCATTTGCAGTAATGAAATCAGGCGGTGAAAGTACCAATAAAGGTTTCTACTTTGCTGTCTTTGAATTTCTCAACAAGCAGATCACGAAATTCGTTAGCCATATCTTCCTGCACCGCTTCCAGCTGAATAATGCGCAGAACCAGTACAGGACGATCGCCAGTGATAATGCTGAGGCGTAATTTAAACGGACGTTCTTTCAGACCTTCAAACGGAACGCATTTAAATTCAAATGCCACTGGCATAATGTCTTTGGTCTTCGCTTCGACAGATTCCATCAGGGAGCGTTTGCCGCTGAAGTCATTATCTTCAAAATCAGCGGTCTGGTTTGCTTCAATCGTGATTTTACGGATTGCCGCAGCCGCTTTTGTTGCCTGAATGGCGTCACCATTAGCATCAAAGCCCACAAGGTAGTCGGCCCAGTCTTCAATCCATTCTGCCAGTGATTTCTGGGAGTTACGCTCGCCGTTAACAGACAACAGGGCAGAGAACGGTGCTGTCTTTTTCAGTTTGAGAGTGGCGGTGTTATCTGCGTGACCTGGTTCATCAATAGTACCCAGGTTAAGCACACTGACGGCACGCATATTATCAGCATCGATAAAGCAGCGGGTGCCTTCATCTGCAAGATCTTTAGAATAACGGGTAAAGTCATCGATGCTGGCAGTGGAAAGCGCACCACGGAAACGGAAGCGATTTAAATTAAATTTTTCCAGATCATGAATGCGGAAATTCTCAGGCAATGCCACAGCATCGGCACCAATCTTACTGATAATTTCATTAACACCCTGAGCAGAAATAAGGGCATGGATTTGATTAATTGCGGTTGCGTCTAAGTTCTGAGACATAATAAGTCCTCACTATATTAAGATATTCAGTGATGAGATAAATAATTAGTTAATTAAGAATGATATTAATGACCTGCTGCGCGGAGTTTTCCGTCAGGCTCACCGGCAAGAGTCAGTAATTGTCCCTGGTCTTCCTGCAGAATAGTCAGGCGACCACCGCGATTGACATACATCGGCGTTTCGGTGGTGTCTTCTTCGGAAATTTTCCCACGGTTAGTCGGGCGAACATATGAGAGTTTGTGTTTGATTTTCAGACGGTTCTCATCAAATGGTTCGATTTCCAGGTTGAGTGAGACCTTACCTTTGGTTTTCGTGTTCATCACACCGGAAGCGACTTCACTGAGAACTGCGCCGATTTTGGTTTCAAATACGCCGCCGTCCAGCTCCCCGATAAATGCCTGCACATCAGTACTGCGTTCGCTAGCCATTTTGCTGCTCCTCATCATAACCGCAATGCGTCGAATGTTTATCAGCTTAACGTTGCGAAGCTTCAGGCAGCGGCATTTTCTCAACTGTCAGACTCTGACCCGTCAAAATCTGACGCATCAAAATCTGACCCGTCAAAATTTGATGCGTCGAAATCTGGCAAAAAATCGGGTTTTCACCCGTCAGAATCTGGCGGGGATCCGTCAGTAAAATCAAAACATGATCCGTCAGATAAAAAACCTTCTCGTCCGGACGCTTCGCAACCGGACACGCAGACGGATGAACAGGATTTTTTAACTCGCCATCCTGATGCGGTTGTATTCAGCCCTAAAAAGCGCCAGTGGGGAACGCAGGATGATTTGACCTGCGCACAGTGGCTCTGGAAAAAAATCATCGCCCTGTACGAGCAGGCCGCCGAATGTGACGGCGAGGTGGTTCGTCCCAAAGAACCGAACTGGACAGCCTGGGCAAACGAAATTCGCCTGATGTGTGTGCAGGATGGTCGTACTCACAAACAAATCTGCGAGATGTACAGCCGCGTCAGCCGCGATCCGTTCTGGTGCCGTAACGTGCTCAGCCCGTCGAAGCTGCGGGAAAAATGGGATGAGCTTTCCCTGCGCTTATCGCCGTCCGTCAGCACGTACACAGAAAAACGCGAAGACCCGTACTTCAAAGCCAGTTACGACAATGTGGACTACAGCCAGATCCCGGCAGGATTCAGGGGGTGAGCATGAGTCTTTTGAATGACGTTCAGAAATTCATTGAAGCCCATCCGGGCTGTACTTCCGGAGACATTGCGGATGCTTTTGCAGGTTACTCACGGCAGCGCGTTCTGCAGTCAGCAAGCAAGTTACGTCAGAGTGGGCGTGTGGCTCACCGTTGTGAAGGAGATACACACAGACATTTCCCGCGCCTGACTGAGAGAGCGCAGGATCCGGAACCACAACCAGTTCGTGAAACCAGACCTGTGCGCAATTTCTATGTCGGCACTAACGACCCGCGGGTGATTTTGTGCCTGACCCGCCAGGCGGAAGAACTGGAGTCCAGGGGCTTATACCGTCGTGCTGCAACGGTGTGGATGGCGGCATTCCGTGAAAGCCACTCCCAGCCAGAACGAAACAATTTTCTGGCGCGTCGTGAACGGTGTTTACGGAAAAGCAGTAAGCGGGCTGCATCAGGTGAAGAGTGGTATCTCTCAGGGAATTACGTGGGGGCTTAATGAGTAATAAATATTGCCAGGCGCTGGTGGAGCTGCGGAACAAACCAGCCCATGAACTGAAGGAAGTGGGCGATCAGTGGCGCACGCCGGACAACATTTTCTGGGGAATTAACACCCTGTTTGGCCCGTTTGTTCTGGATCTGTTTACTGATGGTGATAACGCCAAATGTGCCGCGTATTACACTGCGGAAGACAACGCGCTGGCGCATGACTGGTCAGAACGTCTTGCGGAGCTTAAAGGTGCTGCCTTTGGTAATCCCCCGTACCCGTGGTAACCCCCAGACCGGCACGCCTGCCACCGATCTGGATGATGACTACTTTGACATGTTGCAGGAGGAGCTTTGCAGCGTTGTGGAGGCCTCCGGTGCCAGCCTGGAGAAGGGGCGGCATGACCAGCTGCTTACCGCGCTTCGTGCGCTGCTGTTAAGCCGCAAGAATCCGTTTGGCGATATCAAATTGGACGGCACGGTGCAAAAGGCTCTCGAAAACCTTGGTTTGGGAGAAGCGGCTAAAAGAAATGTAGGGACAGGGGCGAATCAGATACCTGATATGGGTAGTTTTATGCTTTCTGCTTCAGTTCCTGGATATCAAAAATTGCCATCCGGTTTAATTATTCAATGGGGGCCAATTGATGTTCCGCTGACGTCTCAGGACACAGTAACCTATTTTCCGATTGCATTTCCGAATAGATGTCTGCGGGTATTTGCGACTCAGGATTACACTCCCGGCAGTGCAAATGTTGGCTATATAGCTTGTGCCGGTTATAACCAGGACCCGGTGAAATTTATTTCCAGAGCAGGCGTACCCGGTATCGGTGCTTCATTTTTCGCATTAGGGTGTTAATTTCTTTTAACTTTATGGAGTGAAAAATGAATTACATATATTCCGCGACTACAAACTCTTTCTATCCCTTGGAGATGAAAGAGGATTACACGCAAGCTGGCTCATGGCCAGATGATGCTGTTGAAGTTGATGAGCAAGTGTATATTGAGTTTTCCGGATTACCGCCGAAAGGAAAAATCCGTATCGCTGGAGAAAATGGTTTTCCTGCATGGTCTGAAATTCCACCACCAACACATGAGGAACAGATTGCTGCAGCCGAACTGAAAAAGCAGCAATTGATTAATCAGGCCAACGATTATATGAACAGTAAACAATGGGCTGGTAAAGCGGCTATTGGTCGTCTGAAAGGTGAGGAACTGGCGCAATATAATTTGTGGCTGGATTATCTGGACGCACTGGAGCTGGTCGATACTTCCAGTGCGCCAGATATTGAATGGCCTACTCCTCCGGCAGTTCAGGCCAGATGACATCCGGCGCGGTGCTGGTATCTGTTGCCGTCACCGCGTCAATGTAATCCAGCACAGTGTTAAGTCGGGAGGTCTCTGCCTGCATCAGCTTCCGCCCAGCCTGTAATTTCAGTTGAATCAGACTAATGGAAGCCATTGCAGTATCAATCAGCGACTGGCGCTGTGCTTCTGCTGCATCTACTGCGACGCTATGCTGTGCCTCGGTATCCGTCACCCATTTCTCACCATCCCATTTATCGTATGGCGTTAATGGGGCGATAGTCGCGGGACGCATACAAACATAACCCCTGGATGTTATGTGTCTATCGAGAATCAAAGTGGAAGTCCTTATGTTAAGTATAAGACAGGAATCACTTATAACGATAATCAGTTAGTATATGTATCTATCACTGTTGATGATAATATCAGCACTTGGTTCTGGAGGGGGTTTGTTGTGGGCAATGATGCATTTAAGTTATCGTCTGCAGATAGAGGAGATATTACAATAAACAACGAATCAGGGCATTTGATAGTCAATACCGCAATTCTATCAGGAGATATAGTCACTCTAAGAGGAGGAGAAATTAGGTTGGTATTATAGCTTGTGCGCGCCATGATTGGCGCGCAATTTAAACTTAGTGCTTTACATCGCTATTGTCTTGATTTCTTTGAATTATTTTATAAATTAAAAAAACGACTGTTATGTATAAGCAAAGGTCGAACGAAAAATACATTCCAAATAAATGCTTGCTTAAATCTCTATATCCTTCCCCGAAAAATGACACATAAAATTGAGATATTCCAAAAAGAGATACTACAAATAAAGATGCCTTTATTTTATTATTTCTAATAAAAATAGAAGCAATAAAAAATAATAACAATGATATAAATCTAATGTTTTTAAATATATTGTCTTTTATGTTAGTAATAGTCGTTAGTATGTTTGATTCTCCATATATTACGTGTAGTTTTTTATATACATGGAAATAATTTTCTTTATACTGAGACATCACACCATCATCAAATGGAAGTTTGAAGATGGTGCTTGGTTTGCTAACCAATAAAAAGAGTGCATTCGAAAACGTTTCATCTTTATGAGATTCGAAACATTCCGTTCCAACTTCTGTTGGGGTTGCGCCAAATGATATGTCGAATTTATTACCCCAGGCATCTAACCCAACACACTTATCATCAACATACGATGGCATTTTATACCCGTTGTTTTTCATATAAAGATAACTACCGAAGTATGTTGCATGGTACTTATTTAATTCAACAACACCTTTTGAATAAGATATTGCAAATATTGACGCAAGCAAGCACACCACGCATATTACAGATTTAATAATTAGTTTGTGTCTATCAAAAAAAATATAATATGAATACACTATTAGTGGGGTTAATATAAATTGATTTTTAGCAGTAGATATTATTAGCAACGAAAAAAATAGCAAAAGCATAGATTTATTGTTTTTGCATGTTAATGAATACACCAAAAATGGAAGGAAAATTATAACTATTTGCTCTTGATAGAATGATGCAAACATTGACAGTGTTGATGAAGAACACATAAGAAAAGCTAAAATAAAGAATGTAAATAAAGTTACATACCTTTCTGTTTTTATATATGAAGTAAATATCGCATATAATGAATAAATATATATTACTTTTAGAAATAACGATAACACCCTAGCATCAAGATATTCAGTAAAAATGCTAATAAACAGGACGTATGTATACAAAATATAACTATATGAACTTATATACTCATAGCTATCAATTGAATCAATATGACCTTTCAATTTATACAATAAAGGTAAGTCATATGTAAATGCTGGTATGTCTTCAATTAATGGCTTAATGGCTCTTCCGAAATCAGATGTATTTAACATCATTATGTTTTTAGATAAGGCGCAAATAATCAAAACAAAAGATATATACAATGATACTTTTATTGCTTTATTCATTTTTTGACTCTCTTGATGATGTATTTCGGGCGTTTTTTGGTTTCAATGTATGTGCGTCCAATATATTCACCTAATACTCCTATTCCAATCATCTGAATTCCACCTAAAAACAGTATTGAAACAAGTAGTGAAGGATATCCCCTAACAGCATTTCCAAATATGATAGTATCTAAAATCATCCACGCCCCATAAATAAATGCTACACTGGCTACCACTAACCCTATGTATGTCCAGATGCGAAGAGGGAATGTGGAAAAGCTTGTAATACCCTCAAGTGCTAAATTCCAAAGTTTCCATCCATTAAATTTTGTATCTCCAGCAATTCTTTCCGCTCGCACGTATTCAACAATATCTGTCTTTCCTCCTACCCAGCTCAGAATACCTTTCATGAAAAGGTTTCGTTCTGGCATAAGTTTAATATTTTCGACAACATCACGGCTCATCAGCCTGAAATCACCAACATTCTCTTCAATTTTAGGATTGCTTATTTTATTGTGGAGCTTATAGAACCACTCAGCCGTTTTTCGCTTCAGGCGTCCATCAGTTGAGCGGTCAGATCTTTTAGCAAGAACCATATCAGCACCTGCTTGCCATTTTTCAATAAGATGAGGAATAACCTCAATCGGGTCTTGCAGGTCAACATCAATTGGGATTATCGCATCCCCGGTTGCATGGTCTAACCCTGCAAACAATGCTGGTTCTTTACCAAAGTTGCGTGTAAATGACAGCGGAACAACTAGAGGATCTGAAACAGCCAGAGCATTAATGATTGACTCCGTAGCGTCTTTGCTGCCGTCATTTATGAAAACGATTTCCACTTCATATGACTTCAATTCTTCGAATTCACGTACCGTTTTATAAAAAATTGGTATCGCTTCTTCTTCATTGAAGACAGGAACTACAAGAGATATCTTCATTTCGCATCCCTAAAGACAATGAACTTTGAATAGACGAAACCGCACACCAGGCTGATGGCGGAGAAGGTGACAAGAGTTATCATCGGGGGAAGTGCGCATCTATCAGCAGCCCATCCAACAGTAGCACTCAGTGTCCCCATGAACCCAACATATAGCATGTAGCGCATCGTTGTAGTCGATGCCTTGAATGTGAATTTTGCATTCGCGAAGAAGCTAAAGCTCACAGCCACAACGAAACCTGCGAAGTTTGCAAGAGCTTGGTTTGTATGCGCGACATAGATACAAACACCAAAAACCACCCAGTGTATAAGGGTGTTCAGCACACCAATAGAGGTGTACTTTGCAAATAGCTTTAACATTTCTTCTATCAGCTAATAATCAAAGGAATGAAGTCTATCATCCAAGTCTTAATCGATCGATACTTGCTGTGGTTGATGAGACAAAACTTATACACACAAAGCTTTGCACTGGATTGCAAGACTTTGTGCTATTCGATAGTTGTTAAGGTCGCTCACTCCACCTTCTCATCAAGCCAGTCCGCCCACCATTGCATCATTTCTCTGCGTTTATCGAGATACTGAGCATGGTTGTAAATCCCACGCACAGATCCGCCGTTTGCATGTGCCAGTTGCACTTCAATAGCGTCAGCAGGCCATTCGTGCTCGTTCATAATCGTGCTGAATTCATGCCTGAATCCGTGACCGCTTTCCAGACCCTCATAGCCGATTTGTTTGATCACAAGCAGTACCGCGTTCTCGCAGATTGGCTTCTTCTTATCGTTGCGCCCGGCAAAAACAAACTCTGAGACTGGTTTGGTGATGGAGCTTAGCGTAGTGAGAAGTTCAACTACCTGGTCTGACATAGGAACCACATGAATTTTGCGTCCTTTCATCACACTGGCGTCGATGGTGATAATCCTATTTTCAAAATCGACGTTCTTCCATAGCATGGAACGAAGCTCTTTCGTTCTTAGGGCTGTGTAGCGTAAAACTTTGGTCGCAATGAGCGATACGATACTTCCTGAAAATGTTGCCAGTGCTTTGTTGAATGCCGGGATCTGGTCTGCAGGAAGAAACGGGAAGTTCTTCTTGCGGTATCCCTTCATGGCGTCAGCAAGGTCAGGTGCCGGGTTATATTTAGCCCTTCCGGTGACAATAGCGTAACGGAAAACCTCGCCGCATCTTCTGCGTGCTTTGTTGGCTCGTTCCATTGCACCGCGATCTTCAAACCTGCGGATTACTTCCAGCAGTTGCATCGGCTCAATATCCTGAATTTCAAGGCCGCCAATGATAGGTAAAATGTCGTCGTCAAACATTTTGGCAAGTTCAGTTGCATACCCTACTGACCATACTTGCTTCTTGTGCTCGTACCATTCCTTGTAAATCGCACTAAAGGAATTGTTGTTAGACGAAGCCTTTTTCGCCTTTACAGGATCGATGCCAACCGAGATGTCTTTCCTCGCAGTCCATGCTTTATCCCTTGCCTCCTGCAAAGTCATAAGCGGATATTTTCCGACGGTCAGGATTTTCTCCTTACCGTCAATCTTGTAGCGAAGCTGCCATACCTTTTTCCCTGACACAGGGACATAAAGGTACAGGCCATTACCATCGAGAAGGCGGTATGGTTTTTCTTTCGGCTTTGCTGCTTCAATCTGCTTAACGGTGAGCATGGGTAAAAATCCGGTGGGTAAAATTATTTTATCCACTTTTTACCCGTCATGGAGTGCGGCTGTCAACGATCTGAAGCGAACCATGACGAACTGTAAATCTACGGAATGCTTGATATTCAGGGGATTTTGCGGACTGGTACGGATGGGAGCGAACTGATAAATGGTGTCCCCTGCAGGAATCGAACCTGCAATTAGCCCTTAGGAGGGGCTCGTTATATCCATTTAACTAAGAGGACAATGCGGCATGAGTATACCCGCTAATGGAGTGCGGGGTAAGTACGCTGCCGCTCGATTGCTTAAACCCTCGCCATTTATGCCGGGTTTTTATAATTTTTCTTAATGTTTTCCGCACGTTCTGCTTTTTGGCGTGCTTCTGCTTTACGCTTATTGCTCATGTCGTTACGAATCTGTGCATGACTCATTAACGCGAAGATAAAGGTGCCGCCGCAGATGTTCCCCGCTAAAGTAGGTAGTGCGAAGGGCCAGATGAAATCGCTCCAGTGCAGCGTACCGTTAAACACCAGATAGAGGATTTCAACAGAACCGACCACGATATGGGTGGTGTCACCCAGGGCAATAAGCCAGGTCATCAATATAATCACCACAATCTTTGCCGCACCCGCTGCAGGAAACATCCAAACCATAGTGGCGATCAGCCAGCCGGAAATGATCGCGTTGGCAAACATCTCGCTGGGGGTGTTCTTCATCACATCCATGCCGATTTTGACAAATGCATCGCGAGTTTCTTCATTGAAGATAGGCATATATTCAAATGCCCACGCCGCAATACCTGTCCCGAGAATATTACCCAGCAGCACGACGCCCCATAACCGTATAAGTAAGCCGACGTTGCTCATTGTCGGTTTTTGCATGACGGGTAGTACCGCAGTCACGGTATTTTCGGTAAATAATTGCTGGCGGGCCATAATGACGATAATAAAACCAAAGGTATAACCGAGATTCTCCAGCAAGAAGCTGCCCGGCACACCTTCCAGTTCGACTTGAAATATCCCTTTTGCCAGTAACGAAGCGCCCATCGACAGACCCGCCGCAATGGCTGACCACAGTAGCGCCATTGCGTCGCGTTCCAGCTCTTTTTCACCATCCTGGCGGATATGCTCATGAATTGCCATCGCCCGGGAGGGGAGTCGGTCTTCATCTATTTCTATTTTTTTGCCGCGCTCTTTTTCTTCGCTCTCAACTTCAATTTCGTCGCTGTGTTGATCAATTTTGTCGTTGTCCAT
Protein sequences of DBSCAN-SWA_2 >CP034953|1408134:1419344|1414182_1415514_-|QAA89104.1|DBSCAN-SWA MNKAIKVSLYISFVLIICALSKNIMMLNTSDFGRAIKPLIEDIPAFTYDLPLLYKLKGHIDSIDSYEYISSYSYILYTYVLFISIFTEYLDARVLSLFLKVIYIYSLYAIFTSYIKTERYVTLFTFFILAFLMCSSSTLSMFASFYQEQIVIIFLPFLVYSLTCKNNKSMLLLFFSLLIISTAKNQFILTPLIVYSYYIFFDRHKLIIKSVICVVCLLASIFAISYSKGVVELNKYHATYFGSYLYMKNNGYKMPSYVDDKCVGLDAWGNKFDISFGATPTEVGTECFESHKDETFSNALFLLVSKPSTIFKLPFDDGVMSQYKENYFHVYKKLHVIYGESNILTTITNIKDNIFKNIRFISLLLFFIASIFIRNNKIKASLFVVSLFGISQFYVSFFGEGYRDLSKHLFGMYFSFDLCLYITVVFLIYKIIQRNQDNSDVKH >CP034953|1408134:1419344|1408771_1409134_-|QAA89095.1|DBSCAN-SWA MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPANESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA >CP034953|1408134:1419344|1410678_1411041_-|QAA89098.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRLKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGH >CP034953|1408134:1419344|1409124_1409661_-|QAA89096.1|DBSCAN-SWA MSFIKTFSGKHFYYDKINKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFNVIPLAPGHAYGMFMERFNDLSELRKCA >CP034953|1408134:1419344|1408466_1408772_-|QAA89094.1|DBSCAN-SWA MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEPDEREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTLGWIIWQQRAGIHFSTMSQQEVK >CP034953|1408134:1419344|1413866_1414148_+|QAA89103.1|DBSCAN-SWA MSIENQSGSPYVKYKTGITYNDNQLVYVSITVDDNISTWFWRGFVVGNDAFKLSSADRGDITINNESGHLIVNTAILSGDIVTLRGGEIRLVL >CP034953|1408134:1419344|1409788_1410613_-|QAA89097.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAIRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEDMANEFRDLLVEKFKDSKVETFIGTFTA >CP034953|1408134:1419344|1415510_1416431_-|QAA89105.1|DBSCAN-SWA MKISLVVPVFNEEEAIPIFYKTVREFEELKSYEVEIVFINDGSKDATESIINALAVSDPLVVPLSFTRNFGKEPALFAGLDHATGDAIIPIDVDLQDPIEVIPHLIEKWQAGADMVLAKRSDRSTDGRLKRKTAEWFYKLHNKISNPKIEENVGDFRLMSRDVVENIKLMPERNLFMKGILSWVGGKTDIVEYVRAERIAGDTKFNGWKLWNLALEGITSFSTFPLRIWTYIGLVVASVAFIYGAWMILDTIIFGNAVRGYPSLLVSILFLGGIQMIGIGVLGEYIGRTYIETKKRPKYIIKRVKK >CP034953|1408134:1419344|1416427_1416790_-|QAA89106.1|DBSCAN-SWA MLKLFAKYTSIGVLNTLIHWVVFGVCIYVAHTNQALANFAGFVVAVSFSFFANAKFTFKASTTTMRYMLYVGFMGTLSATVGWAADRCALPPMITLVTFSAISLVCGFVYSKFIVFRDAK >CP034953|1408134:1419344|1413127_1413568_+|QAA89102.1|tail|DBSCAN-SWA MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQAR >CP034953|1408134:1419344|1416942_1418100_-|QAA91874.1|integrase|DBSCAN-SWA MLTVKQIEAAKPKEKPYRLLDGNGLYLYVPVSGKKVWQLRYKIDGKEKILTVGKYPLMTLQEARDKAWTARKDISVGIDPVKAKKASSNNNSFSAIYKEWYEHKKQVWSVGYATELAKMFDDDILPIIGGLEIQDIEPMQLLEVIRRFEDRGAMERANKARRRCGEVFRYAIVTGRAKYNPAPDLADAMKGYRKKNFPFLPADQIPAFNKALATFSGSIVSLIATKVLRYTALRTKELRSMLWKNVDFENRIITIDASVMKGRKIHVVPMSDQVVELLTTLSSITKPVSEFVFAGRNDKKKPICENAVLLVIKQIGYEGLESGHGFRHEFSTIMNEHEWPADAIEVQLAHANGGSVRGIYNHAQYLDKRREMMQWWADWLDEKVE >CP034953|1408134:1419344|1411763_1412258_+|QAA89100.1|DBSCAN-SWA MSMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA >CP034953|1408134:1419344|1411398_1411767_+|QAA89099.1|DBSCAN-SWA MTCAQWLWKKIIALYEQAAECDGEVVRPKEPNWTAWANEIRLMCVQDGRTHKQICEMYSRVSRDPFWCRNVLSPSKLREKWDELSLRLSPSVSTYTEKREDPYFKASYDNVDYSQIPAGFRG >CP034953|1408134:1419344|1412582_1413101_+|QAA91873.1|DBSCAN-SWA MLQEELCSVVEASGASLEKGRHDQLLTALRALLLSRKNPFGDIKLDGTVQKALENLGLGEAAKRNVGTGANQIPDMGSFMLSASVPGYQKLPSGLIIQWGPIDVPLTSQDTVTYFPIAFPNRCLRVFATQDYTPGSANVGYIACAGYNQDPVKFISRAGVPGIGASFFALGC >CP034953|1408134:1419344|1412257_1412533_+|QAA89101.1|DBSCAN-SWA MSNKYCQALVELRNKPAHELKEVGDQWRTPDNIFWGINTLFGPFVLDLFTDGDNAKCAAYYTAEDNALAHDWSERLAELKGAAFGNPPYPW >CP034953|1408134:1419344|1408134_1408335_-|QAA89093.1|DBSCAN-SWA MQHELQPDSLVDLKFIMADTGFGKTFIYDRIKSGDLPKAKVIHGRARWLYRDHCEFKNKLLSRANG >CP034953|1408134:1419344|1418411_1419344_-|QAA89107.1|DBSCAN-SWA MDNDKIDQHSDEIEVESEEKERGKKIEIDEDRLPSRAMAIHEHIRQDGEKELERDAMALLWSAIAAGLSMGASLLAKGIFQVELEGVPGSFLLENLGYTFGFIIVIMARQQLFTENTVTAVLPVMQKPTMSNVGLLIRLWGVVLLGNILGTGIAAWAFEYMPIFNEETRDAFVKIGMDVMKNTPSEMFANAIISGWLIATMVWMFPAAGAAKIVVIILMTWLIALGDTTHIVVGSVEILYLVFNGTLHWSDFIWPFALPTLAGNICGGTFIFALMSHAQIRNDMSNKRKAEARQKAERAENIKKNYKNPA |
17 | Enterobacteria_phage(50.0%) | integrase,tail | attL 1404444:1404460|attR 1421354:1421370 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1663847 : 1673288
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP034953|1663847:1673288|DBSCAN-SWA AATGATTGAATTTAGCCATGTCAGCAAACTGTTCGGCGCACAAAAAGCCGTTAACGATCTCAATCTCAATTTTCAGGAAGGGAGTTTTTCGGTGCTGATTGGCACATCTGGCTCCGGCAAATCCACCACCCTGAAAATGATTAACCGCCTGGTGGAGCATGACAGCGGAGAGATCCGCTTTGCCGGAGAAGAAATTCGCTCGCTGCCAGTACTGGAGTTGCGCCGCCGGATGGGCTATGCCATTCAATCTATTGGCCTGTTCCCCCACTGGAGCGTGGCGCAAAACATCGCTACCGTGCCGCAATTACAAAAATGGTCGCGGGCGCGGATTGACGATCGTATCGACGAATTAATGGCGCTACTGGGGCTGGAGTCAAATTTGCGTGAGCGTTATCCGCATCAGCTTTCCGGTGGTCAGCAGCAACGTGTGGGAGTGGCGCGTGCACTGGCTGCCGATCCGCAAGTCTTACTAATGGATGAACCTTTTGGCGCACTGGACCCGGTAACGCGCGGCGCGTTGCAACAAGAGATGACGCGCATTCACCGTTTGCTGGGGCGTACCATTGTGCTGGTCACTCATGATATTGATGAGGCGCTACGGCTGGCAGAACATCTGGTATTGATGGATCACGGTGAAGTAGTGCAGCAGGGCAATCCGCTGACGATGCTGACTCGTCCGGCGAATGATTTTGTCCGCCAGTTTTTTGGACGTAGTGAACTGGGTGTGCGCCTGCTTTCGTTACGTAGTGTGGCGGATTACGTGCGTCGCGAAGAACGAGCAGATGGTGAGGCACTGGCAGAAGAGATGACGCTACGCGATGCGCTCTCTCTGTTTGTTGCGCGGGGATGCGAGGTGCTGCCGGTGGTGAACATGCAGGGCCAGCCTTGCGGCACGCTGCATTTTCAGGATCTGCTGGTGGAGGCGTAAGCGTATGAAGATGTTGCGCGATCCGCTGTTCTGGCTCATTGCTCTGTTTGTGGCGCTGATTTTCTGGCTGCCTTACAGCCAGCCGCTGTTTGCTGCCTTGTTCCCACAACTGCCACGACCCGTTTATCAGCAAGAAAGTTTTGCAGCTCTGGCACTGGCTCATTTCTGGCTGGTGGGAATTTCGAGTTTGTTTGCGGTGATCATTGGCACTGGTGCCGGAATTGCTGTCACTCGCCCGTGGGGCGCGGAATTTCGCCCACTGGTGGAAACTATTGCCGCCGTTGGACAGACTTTTCCGCCCGTCGCAGTGCTGGCGATCGCCGTTCCGGTGATCGGCTTTGGTCTGCAACCAGCGATTATCGCCTTGATCCTTTACGGTGTGCTGCCCGTCCTGCAGGCGACACTTGCCGGGCTGGGAGCGATTGATGCCAGCGTGACAGAAGTTGCGAAAGGTATGGGAATGAGTCGTGGTCAGCGAGTGCGTAAGGTCGAGCTACCGCTGGCGGCTCCGGTGATTCTGGCGGGCGTGCGAACTTCGGTGATTATCAACATTGGTACGGCGACGATCGCCTCAACGGTAGGGGCCAGCACGCTGGGTACGCCCATCATCATCGGGCTTAGCGGATTTAATACCGCGTATGTGATCCAGGGGGCGTTACTGGTGGCACTGGCGGCGATCATCGCAGACCGCCTGTTTGAAAGGCTGGTGCAGGCGCTTAGCCAGCACGCAAAATAAAGGTATAACCTGCGAGCATGACGCCACCAATTCCGCCTAACGCCATAAACAGGAACAGGGCGATGACCCCAATTTTAGCTATGCGCATAATGCACTCCTTATGTTAACGAAAGGATTGTACAGTAAAGCGCATTTGTTAACGAATCATTAAATGCCGAGTGGGAAAATATCATGGCCTTGTTCTTGCCAACTGGTGAGTTGCTGCTGTTGGGCGGAGGTTCGATTTTCACCGCACCACACCAGCAATGTACGGCCTTCGAATAGTTCAGGGCGTAGTTGATTGAGCGAGTGGGCGAGGACATCAATGCGCCATCCTTGTTGACTGGCAATCCAGCCCTCCAGCCACAGACGGGTGGTATCCTGAATATTCCAGCCAACCACCAGCGCATCTTTACCCTGTTTTTTACGTGCCGAAGCCAGACAAATGGCGATGTAGTTGATCAGTACGCCGTCGAGGATCGCCAGCAGCGCCTGGAGAGTCGGTTGTTGGCACTGAAGCCGTCGGCGCAGAGGAATAAACAGATGTGTGGTGAGTGTCTGGGCGGGGTAATCCTGACCGCGCTCTTTGATCCACGTTCGCAGGCTATGTAGATTGCCGCTTTGCAGGTAAGTCAGTAATGTTTCTTGCTGATCGCGCCAGCCGTTCTGCACATCAACATTTTCATTACTGAGCAGCATTTTAACTTTGCTGACCTGCACGCCGTTGTCGATCCAGCGTTTGATCTCGCGGATCCGGTCAATATCGGCATCGTTGAACAGCCGATGACCGCCGTCTGTCCGTTGCGGTTTCAGCAATCCGTAACGCCTCTGCCACGCGCGTAACGTGACAGGATTAATATCACAAAGCAACGCCACTTCACCAATTGTGTAAAGCGCCATCGTCTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCAGTTTTGCGAACCAGGTAGTTTTGCCCGTTTTTTGTGCATCTATAGGGTGATTTTATTTTTGCCAGGCGATTTTGAGTGATCGTACTCACGAATTCTCATTTTTCTGCAAGAGTTCAAAGAAAGTTAAACGCAGGCAATGTATGTTACGCGTTTTAAAGGGAAGTGTGGTTTGCGGGTATGTACGATTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTTTTTTTAGTCATTGCATGGTTAATGAGTAAAACGCCATTATTCATACCGTTAATGCAGGTCACGGTTCGTCTGCCGCATAAATTTCTCTGCTACATCGTCTTTTCCATCTTCTGCATCATGGGCACCTGGTTTGGGTTGCACATTGACGATTCTATTGCCAATACCCGTGCGATAGGCGCGGTAATGGGCGGCTTACTCGGCGGTCCGGTCGTCGGTGGGCTGGTTGGGCTGACCGGCGGCTTACATCGATATTCGATGGGGGGCATGACCGCGTTAAGTTGCATGATCTCGACCATCGTTGAAGGATTACTCGGTGGCCTGGTACACAGCATCCTGATCCGCCGCGGGCGCACTGATAAAGTCTTTAACCCCATTACCGCCGGTGCCGTCACGTTCGTCGCTGAAATGGTGCAAATGCTGATCATCCTTGCGATCGCCCGACCTTATGAAGATGCGGTGCGTCTGGTGAGTAATATTGCTGCGCCAATGATGGTCACCAATACCGTCGGCGCGGCGCTGTTTATGCGTATATTGCTCGATAAACGCGCGATGTTTGAAAAATACACTTCGGCTTTTTCTGCCACTGCGCTGAAAGTGGCAGCCTCGACGGAAGGCATTTTGCGCCAGGGGTTTAACGAAGTGAACAGCATGAAAGTGGCTCAGGTGCTGTATCAGGAACTGGATATTGGTGCAGTCGCGATTACCGATCGAGAGAAATTGCTGGCCTTTACCGGAATTGGTGACGACCACCATTTACCCGGCAAACCGATTTCTTCAACTTATACTTTAAAAGCGATTGAAACCGGTGAAGTGGTCTACGCTGATGGCAACGAAGTACCTTACCGTTGCTCTTTGCATCCGCAATGCAAACTGGGGTCGACGCTGGTAATTCCGTTGCGTGGTGAAAATCAGCGGGTGATGGGCACCATCAAATTGTATGAAGCCAAAAACCGTTTATTCAGTTCAATCAACCGCACGTTGGGTGAGGGGATTGCGCAATTGCTTTCGGCGCAGATCCTTGCCGGGCAATATGAGCGGCAAAAAGCGATGCTCACCCAGTCAGAGATCAAACTGCTTCACGCCCAGGTGAATCCCCATTTTTTGTTTAATGCGCTTAACACCATTAAAGCGGTGATCCGCCGCGACAGCGAACAGGCCAGCCAGCTGGTGCAGTATCTTTCCACTTTTTTCCGCAAAAACTTAAAGCGGCCTTCGGAGTTTGTTACTCTCGCCGACGAAATTGAACATGTGAATGCTTATCTGCAAATTGAAAAGGCGCGCTTCCAGTCGCGGTTGCAGGTCAACATTGCTATTCCGCAAGAATTATCCCAGCAGCAATTGCCCGCGTTTACCCTGCAACCGATAGTGGAAAACGCCATTAAACATGGGACATCACAACTGCTGGATACAGGGCGAGTGGCAATCAGCGCCCGACGTGAGGGGCAACATTTGATGCTGGAGATCGAAGACAATGCCGGTTTGTATCAACCGGTAACCAATGCCAGTGGGCTGGGGATGAATCTGGTGGATAAGCGTTTACGTGAACGGTTTGGCGATGACTATGGAATAAGCGTCGCCTGTGAGCCTGATAGTTACACCCGAATAACGTTACGACTACCATGGAGGGACGAGGCATGATTAAAGTCTTAATTGTCGATGATGAACCGTTAGCACGGGAGAACCTGCGTGTATTTTTGCAGGAGCAGAGCGATATTGAAATCGTTGGAGAGTGTTCAAACGCCGTGGAAGGGATCGGCGCGGTGCATAAACTGCGCCCGGATGTGCTGTTTCTCGATATCCAGATGCCGCGCATCAGTGGTCTGGAAATGGTGGGGATGCTTGACCCGGAACATCGCCCGTATATTGTTTTTCTCACTGCGTTTGACGAATACGCAATTAAAGCCTTTGAAGAACATGCCTTTGATTATCTGCTGAAGCCAATTGATGAAGCGCGACTGGAGAAAACGCTGGCGCGATTGCGTCAGGAGCGCAGCAAGCAGGATGTTTCGCTGTTACCGGAAAATCAACAGGCGCTGAAATTTATCCCTTGTACGGGGCATAGTCGGATTTATTTGCTGCAAATGAAAGATGTGGCATTTGTCAGCAGTCGGATGAGCGGTGTCTACGTTACCAGCCACGAAGGGAAAGAGGGCTTTACCGAATTGACATTACGTACCCTGGAAAGTCGTACACCACTACTGCGCTGCCATCGTCAGTATCTGGTTAACCTCGCGCATTTACAGGAGATTCGTCTGGAAGATAACGGCCAGGCCGAGTTGATTTTGCGTAATGGCTTAACCGTGCCGGTCAGCCGCCGTTATCTGAAAAGCTTAAAAGAGGCGATTGGCCTGTAAAAGACTGCTAAAATGGCTTTTTGCCTCATCAACACCTGAAGGCCTCATGCTAAGTAACGATATTCTGCGCAGCGTGCGCTACATTTTGAAAGCCAATAATAATGACCTGGTGCGTATTCTGGCGCTGGGTAATGTCGAAGCCACCGCGGAACAGATCGCCGTCTGGCTACGTAAAGAAGACGAAGAGGGTTTTCAGCGTTGTCCGGACATTGTTTTGTCGTCATTCCTCAATGGCCTGATTTATGAAAAACGCGGCAAGGATGAGTCTGCTCCGGCACTGGAGCCGGAACGTCGCATTAATAACAACATCGTGCTGAAAAAATTACGCATCGCGTTTTCGCTGAAAACCGATGACATTCTGGCTATCCTCACCGAACAGCAGTTCCGCGTTTCGATGCCGGAAATTACAGCGATGATGCGCGCACCGGATCATAAAAACTTCCGCGAATGCGGCGATCAATTTTTACGTTATTTTCTGCGTGGACTGGCAGCGCGCCAGCATGTGAAAAAAGGCTAAAAAATTGGCGGTCATGTTTAGAGCATGACCGCCAACCGATTATTTCACTTCTTTAAAACCAGCGGCTTTCATCACCAGTTCCATTTGCGCCATAGTGATACCTTTTTTGGCATCTTCAGCAGAAACGTTGATTCCTGAAATACCCTGCAGGGCTTTAAAATCCACTTTTTCCATATCGATAGTCACGTTTTCCTGCGCGTAGGTATCGGTATAGGTTAATTTTTCTTCAACACCCGCGATGTTTTTGTATTTGGCGCTTAACGGCTCAAGTGTCTTGGCAGCATCTTCTTTGGTGGTTGCACCAATGGAGGCAAATTGAATTTTGGTTTCAGAAGATTGCTTAAGCACCTTGTCACCTTTGTAGACATAGGTAATGGCAATTTCAGTGCCGTTCAGATTGGCGCTGAATTTCTTCGATTCTTCTTTGTCACCGCAGCCAGCAAGAGAGAAAACCAGAACAGATGCAACAACGAGGGAAAACAGCTTATTGAAAGCCTTCATGTAAAACTCCATTTTATTTAATCAAGGAACTGGTGACTCTCACCAGGGGCTATATAGGATATGCCTAATACCGTGGCGTGAGCAGTCCGGAACTGGAGTAGAACTCTTAGTAAAAAGCACTATTTCATCCTTGTTGCTGAAGCATGGGGAATAATTGTTCGCAAAGCAAAACACCGTTATTCATTGCTTCTACCCGTGCCTCGCTTTCTGTATTACGAAATTGTCCCAACACATGTGCCAGCCGATAAAAACCCACCGCGGTGAGGTCATTCGCCAGCAACTATGCCTGACCAATAGCACTCTGTTCCTGATAGCGCCAGCCGTTATGGAGCAGTTGAATAAGTAACGCCTGGCAGCGTATCAGCAACTGATGAGCAGTAGACGGCACAGGCAAAACGCTGGCAGAAGGTAGCGGTGCCACAGGCGTAGTTTCTGCGTCCAGCGCCCAGGCACGGGTTTTTGTCATCATCACCCGTGGTTCCAGTGTCAATTGCCCTTCAACAAAACTGACAAAGCCAGAAACCAGACACACGGGGTCGTCTGTTTGTTGCAAAAGCGCCGCCATGCGTTCAACGGCATAAGGTGCGCTGGCTGAGGCTGGTAATGATAACGTCAGCACATTATCTTCCCCTTCGCCGCTAATGACCTGCGCATCCAGCGTCTGGCGGCTGCTGTCCCAACCGAGCGAAATACACTCAGCGACCGGCAGAATAAATAAGTTATCGACCTGATTAAGAGGCCGTATGCAGGCGGGGGGACGCTGGCGTAAATATTCCCGCAAAGCCACAATGCCCGGCTGGCGTAACGGCGCGCTCAACATTTGCCAGGCATCAGGCGACAGCGGCACAACGCTGCTTAAGCGGTTGCGGGTAGCTAACAGCAGCTCGCCATCGGCACTGCGTTTTGCTGCTTGTGAAACAATTTGCCCGCCCGCCAGTGCGCCAGCCTGAAAACTAAACAGCCGACGCGTAGCTGCCGGTGAGTTTTCCTGTTCACTTCGCGGCCAACTGCGCGAAAGGTGCAAAATACTGCCGGTGTCGGGATCGGTAAACCAGATGCGTAAACCATAATGCTCAATATCCTGCCAGCAACGCATACCTAAAGACACCAGCCGCAGATGATCAAGCTTTGCTTCTCCGGCAATGCCAGAGCCAACGACCGTGCGCCACGGCACAGGAGGAACTTCACCAATACTGTCGCGCCGGGCCATCTCTTGTGCGCAATTTAATCGACTGTTTAATGCCGCAAGCTGATGTAAGCATTCTCCGGCATTATAGTGGCTGGCGCGGGCGTGGAAGGCATCAACGCTGGCGCGCAGTTGCCGTAGCGATTCACTCACCCAGCGCCAGTTGCAGGTCTCTGCCGCCTGCAATGCGCGGTTGAATGCTGCCTCGTAATGGATGAGCGGCTGGCTGATGCCGCCAAGCCATAATGTCTGGCTTAATTGCTGAACATATTGACGACACGCGTTGCCTTCTTCGCTGGCAAACGGATCGTCAGATGATGTGACGTGTTCGCTGCGCATCTGCCAGATTAAATGGTTAAATTCTGCTTGCTGCGCTTTGGCCTCGACGAAGGCCTGTACCGCCAGTACGACATGTTCGCAAAGTGTGCCTTCAATACAATCACAACGGGCGAAACGAATACTGCTGCGGGAATAAAAACGCACATCGCTCATCGGTAAGCGGGCAGAGGGAATTTCACCCGGCGCACAGAACAACTCAATGGTGATGCCTTTAGCGACCAGCGCCTGTGCGCGTTTGCGGGTAGCATCGGGAAGGGTAGCCAGTTCTTCCAGCCAGATTGCCGGATCCCACTCTTCTTCTTTTTCCGTAGACTGAGTGGTGGCACAAAGTCGTTGATAACTTAACACCAGCATCACGCGATGACGGCACATACCGTTGGCCCCGCAACTGCACTGAGCCTCTTTCAGTGCCTGGCCGTTCGCCAGCTGGGTACGGACACCGTCACTGAAGGTGGCGATTAAAGCGTCGTTCTCATGGCTGATCTCCGGGACGTTGCCATTTTCCAGTTCCTTAAGACTGCGCTTAACAAAACCGGCATTGCTTAACGCCGTCAGGGCCTGCGGTGTCAGTTCTAATAATTCCGGACGTAGTGAATTCATGACTGAAGATTCTCCGCAAGCCATGATGCCAGCTCGCCCGGCGTCATGGCGGCTATTTGTGCGCCGACATTAACCAGCGCCTGGGCCGTATCGCGGTCATAGCAAGGTGTTGCGGTGCTATCGAGCGCTGCCAGTCCCAGCACTTTGATGCCGCTCTGGACACACTTTTTCACCTGATGCGTCAGTAATGATGATGAACCCCCTTCGTAAAAATCGCTCACGAGGATAATGACGCTTTTCGCTGGTTGTTCAATAAGTTGCCGACCATACTCCACGGCACTGGCGATATTGGTCCCGCCGCCCAACTGTACTTTCATTAATAACTCTACCGGATCGGCAACGTCTGCCGTGAGATCAACGACGCTTGTGTCAAACGCCACCAGATGGGTACGAATGCCGGGTAACTGCCACAAACAGGCCGCCATCACCGCAGAGTGGATCACCGAATCGACCATCGATCCGCTTTGATCAACCAGTAAGACCAGTTGCCATTGTTCGCTTTGGCGTTTAATGCGGCTGTTAAAGCGGGGGGATTCGATATACAACTTGCCGTGTTGCGGGTGCCAGTGTTGCAGGTTGGCGCGCAGAGTACTTTTGAAATCAAAGTTTCGCGCCAGTGGAATAAATGAACGGCGACGGCGATCGCGGACACCAGAAAAAGCCTGACGAACTTCCTTTGCCAGTCGAGCCATAATTTCTTCAACAACCTGGCACACTATCCGGCGGGCGGCAGCCAGTACTTCGGGGTTCATCAGATGTTTGGTGTGCAAAACAGCGCGTAGCAGGCTTTCAGAAGGCTGCATACGTTCCAGCACGTCGAGATTCGTCACCACATCTTCAATGCCGTAGCGCAGCACGGCATCGCTTTCCAGCCGCTCAATCACCTGCTGCGGAAACAGCGTGTGAATACTGTTGATCCACTCAGGGGTGGTGAGATTTGAGCCACCTAATCCACCGGAACGTTCACCACGCTGGAGCCGTTCAGGATCGCGCCCATACAGCCACTCCAGCGCGTGGTCTATCTGCCGGGCGTTGTCATCCAGCCCACAAAGCGTCGTTTCTGCCGCTTCGCCAAGAATTAATCGCCAGCGTTGTAGCTCACGGGTGGTCAGAAGATCGTTCAGTTCAGACAT
Protein sequences of DBSCAN-SWA_3 >CP034953|1663847:1673288|1663847_1664774_+|QAA89313.1|DBSCAN-SWA MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGEIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMALLGLESNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLTRPANDFVRQFFGRSELGVRLLSLRSVADYVRREERADGEALAEEMTLRDALSLFVARGCEVLPVVNMQGQPCGTLHFQDLLVEA >CP034953|1663847:1673288|1668292_1669012_+|QAA89318.1|DBSCAN-SWA MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >CP034953|1663847:1673288|1672151_1673288_-|QAA89322.1|DBSCAN-SWA MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGGSNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLAAARRIVCQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKSTLRANLQHWHPQHGKLYIESPRFNSRIKRQSEQWQLVLLVDQSGSMVDSVIHSAVMAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTAQALVNVGAQIAAMTPGELASWLAENLQS >CP034953|1663847:1673288|1670310_1672155_-|QAA89321.1|DBSCAN-SWA MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENDALIATFSDGVRTQLANGQALKEAQCSCGANGMCRHRVMLVLSYQRLCATTQSTEKEEEWDPAIWLEELATLPDATRKRAQALVAKGITIELFCAPGEIPSARLPMSDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVEAKAQQAEFNHLIWQMRSEHVTSSDDPFASEEGNACRQYVQQLSQTLWLGGISQPLIHYEAAFNRALQAAETCNWRWVSESLRQLRASVDAFHARASHYNAGECLHQLAALNSRLNCAQEMARRDSIGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDIEHYGLRIWFTDPDTGSILHLSRSWPRSEQENSPAATRRLFSFQAGALAGGQIVSQAAKRSADGELLLATRNRLSSVVPLSPDAWQMLSAPLRQPGIVALREYLRQRPPACIRPLNQVDNLFILPVAECISLGWDSSRQTLDAQVISGEGEDNVLTLSLPASASAPYAVERMAALLQQTDDPVCLVSGFVSFVEGQLTLEPRVMMTKTRAWALDAETTPVAPLPSASVLPVPSTAHQLLIRCQALLIQLLHNGWRYQEQSAIGQA >CP034953|1663847:1673288|1665490_1665598_-|QAA89315.1|DBSCAN-SWA MRIAKIGVIALFLFMALGGIGGVMLAGYTFILRAG >CP034953|1663847:1673288|1665657_1666389_-|QAA89316.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNGWRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKKQGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFEGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFPLGI >CP034953|1663847:1673288|1669568_1670030_-|QAA89320.1|DBSCAN-SWA MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKLTYTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK >CP034953|1663847:1673288|1666610_1668296_+|QAA89317.1|DBSCAN-SWA MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILAIARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDEA >CP034953|1663847:1673288|1664778_1665510_+|QAA89314.1|DBSCAN-SWA MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWGAEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRVRKVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQHAK >CP034953|1663847:1673288|1669058_1669529_+|QAA89319.1|DBSCAN-SWA MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQRCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLKTDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAARQHVKKG |
10 | Enterobacteria_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1765356 : 1774027
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP034953|1765356:1774027|DBSCAN-SWA CATGCCATTTAAAAAACTCTCCCGACGCACGTTCCTGACGGCAAGCTCGGCGCTTGCCTTCCTCCATACCCCTTTCGCCCGCGCGCTTCCCGCCCGACAAAGCGTTAACATTAACGACTACAACCCACACGACTGGATCGCCTCATTTAAACAAGCCTTCAGCGAAGGGCAAACAGTCGTCGTGCCTGCCGGATTGGTTTGTGACAATATCAACACCGGCATCTTTATCCCTCCCGGTAAAACGTTACACATTCTTGGAAGCTTGCGCGGCAACGGCAGAGGGCGATTTGTCTTACAGGACGGCAGCCAGGTGACAGGGGAGGATGGCGGCAGTATGCATAACATCACCCTGGATGTGCGTGGTTCTGACTGCACCATCAAAGGGCTGACTATGAGCGGCTTTGGCCCGGTGACGCAGATTTATATCGGCGGCAAAAACAAACGGGTCATGCGCAACCTGATCATCGATAACCTGACCGTTAGCCACGCTAATTACGCCATCTTACGCCAGGGATTTCATAACCAGATTATCGGTGCCAACATCACCAATTGTAAGTTCAGCGACTTACAAGGCGACGCCATTGAATGGAACGTGGCAATTAACGACCGTGATATTTTGATATCTGACCATGTCATCGAGCGCATCAACTGTACCAACGGCAAAATCAACTGGGGCATCGGCATAGGCCTTGCGGGAAGCACTTATGATAACAACTACCCGGAAGACCAGGCAGTGAAAAACTTTGTCGTGGCGAATATCACGGGATCGGATTGTCGGCAGTTGATACATGTTGAAAATGGTAAACATTTTGTTATTCGTAATATCAAAGCCCGCAATATCACGCCGGATTTCAGTAAGAAAGCGGGCATTGATAACGCGACAGTCGCTATTTACGGTTGTGACAATTTCGTGATTGATAATATTGAAATGATTAATAGCGCCGGGATGTTAATCGGCTATGGGGTAATTAAAGGCAAATATCTCTCGATACCGCAAAATTTCCGGGTGAATGATATTCAACTGGATAATACTCATCTTGCTTATAAATTGCGCGGCATTCAAATCTCCGCCGGGAATGCTGTTTCCTTTGTGGCGCTAACTAACATTGAGATGAAGCGTGCGTCGCTGGAGTTACACAACAAACCGCAACATTTTTTTATGCGAAATATCAATGTGATGCAGGAATCCTCAGTTGGACCCGCATTGAGCATGAACTTTGACATGCGCAAAGACGTTCGTGGTGTTTTTATGGCGAAAAAAGAAACACTGCTGTCTCTTGCAAATGTTCATGCGGTGAATGAAAAAGGACAAAGCTCCGTCGATATCGACAGGATTAATCACCATATTGTTAATGTGGAGAAGATTAACTTTAGATTGCCGGAACTGAGGGAGTAGATTTGCGACCATTCCTGGAAAAATGGAGTCATACTTAGGAACAATGCTACTGCAATCCACAACGAAGCGGCGTAACATCACAAGTAATTCAGTAATCAATTCAGGGTAATTGATGCTGGCGAAAAAAATCGAACAAGCTATAATTCAGCAACCATTTTACAGGTGGATGAAATAATGACGAATTTAAAAGCAGTTATTCCTGTAGCGGGTCTCGGGATGCATATGTTGCCTGCCACTAAGGCGATACCCAAAGAGATGCTACCAATCGTCGACAAGCCAATGATTCAGTACATTGTTGACGAGATTGTGGCTGCAGGGATCAAAGAAATCCTCCTGGTAACTCACGCGTCCAAGAACGCGGTCGAAAACCACTTCGACACCTCTTATGAGTTAGAATCACTCCTTGAGCAGCGCGTGAAGCGTCAACTGCTGGCGGAAGTACAGTCCATCTGTCCGCCGGGCGTGACCATTATGAACGTGCGTCAGGGCGAACCTTTAGGTTTAGGCCACTCCATTTTGTGTGCGCGACCTGCCATTGGTGACAACCCATTTGTCGTGGTACTGCCAGACGTTGTGATCGACGATGCCAGCGCCGACCCGCTACGTTACAACCTTGCTGCCATGATTGCACGTTTCAACGAAACGGGCCGCAGCCAGGTGCTGGCAAAACGTATGCCGGGTGACCTCTCTGAATACTCCGTCATCCAGACTAAAGAGCCGCTGGACCGTGAGGGTAAAGTCAGCCGCATTGTTGAATTTATCGAAAAACCGGATCAGCCGCAGACGCTGGACTCAGACATCATGGCCGTAGGTCGCTATGTGCTTTCTGCCGATATTTGGCCGGAACTGGAACGTACTCAGCCTGGTGCATGGGGACGTATTCAGCTGACTGATGCTATTGCCGAGCTGGCGAAAAAACAATCCGTTGATGCAATGCTGATGACCGGCGACAGTTACGACTGCGGCAAAAAAATGGGCTATATGCAGGCGTTTGTGAAGTATGGCCTACGCAACCTGAAAGAAGGGGCGAAGTTCCGTAAAGGTATTGAGAAGCTGTTAAGCGAATAATGAAAATCTGACCGGATGTAACGGTTGATAAGAAAATTATAACGGCAGTGAAAATTCGCAGCAAAAGTAATTTGTTGCGAATCTTCCTGCCGTTGTTTTATATAAACCATCAGAATAACAACGAGTTAGCAGTAGGGTTTTATTCAAAGTTTTCCAGGATTTTCCTTGTTTCCAGAGCGGATTGGTAAGACAATTAGCGTTTGAATTTTTCGGGTTTAGCGCGAGTGGGTAACGCTCGTCACATCATAGGCATGCATGCAGTGCTCTGGTAGCTGTAAAGCCAGGGGCGGTAGCGTGCATTAATACCTCTATTAATCAAACTGAGAGCCGCTTATTTCACAGCATGCTCTGAAGTAATATGGAATAAATTAAGTGAAAATACTTGTTACTGGTGGCGCAGGATTTATTGGTTCAGCTGTAGTTCGTCACATTATAAATAATACGCAGGATAGTGTTGTTAATGTCGATAAATTAACGTACGCCGGAAACCGGGAATCACTTGCTGATGTTTCTGATTCTGAACGCTATGTTTTTGAACATGCGGATATTTGCGATGCACCTGCAATGGCACGGATTTTTGCTCAGCATCAGCCGGATGCAGTGATGCACCTGGCTGCTGAAAGCCATGTTGACCGTTCAATTACAGGCCCTGCGGCATTTATTGAAACCAATATTGTTGGTACTTATGTCCTTTTGGAAGCCGCTCGCAATTACTGGTCTGCTCTTGATAGCGACAAGAAAAATAGCTTCCGTTTTCATCATATTTCTACTGACGAAGTCTATGGTGATTTGCCTCATCCAGATGAAGTAAATAATACAGAAGAATTACCCTTATTTACTGAGACGACAGCTTACGCGCCAAGCAGCCCTTATTCCGCATCCAAAGCATCCAGCGATCATTTAGTCCGCGCGTGGAAACGTACATATGGTTTACCGACAATTGTGACTAATTGCTCGAACAACTATGGTCCTTATCATTTCCCGGAAAAGCTTATTCCACTGGTTATTCTTAATGCACTGGAAGGTAAGGCATTACCTATTTATGGCAAAGGAGATCAGATCCGCGACTGGTTGTATGTTGAAGATCATGCGCGTGCGTTATATACCGTCGTAACCGAAGGTAAAGCGGGTGAAACTTATAACATTGGTGGGCACAACGAAAAGAAAAACATCGATGTAGTGCTCACTATTTGTGATTTGCTGGATGAGATTGTACCGAAAGAGAAATCTTATCGTGAGCAAATCACTTATGTTGCTGATCGTCCGGGACACGATCGCCGCTATGCTATTGATGCTGAGAAGATTGGTCGCGCATTGGGATGGAAACCACAGGAAACGTTTGAGAGCGGGATTCGTAAAACGGTGGAATGGTACCTGTCCAATACAAAATGGGTTGATAATGTGAAAAGTGGTGCCTATCAATCGTGGATTGAACAGAACTATGAGGGCCGCCAGTAATGAATATCCTCCTTTTTGGCAAAACAGGGCAGGTAGGTTGGGAACTACAGCGTGCTCTGGCACCTTTGGGTAATTTGATTGCTTTTGATGTTCACTCTACTGATTATTGCGGTGATTTTAGTAATCCTGAAGGTGTAGCTGAAACCGTAAGAAGCATTCGGCCGGATATTATTGTCAATGCAGCCGCTCACACCGCAGTAGACAAAGCAGAATCAGAACCGGAGTTTGCACAATTAATTAACGCAACAAGTGTCGAAGCGATTGCGAAAGCAGCAAATGAAGTTGGAGCCTGGGTTATCCATTACTCGACTGATTACGTCTTCCCTGGAAATGGCGATATGCCATGGCTGGAGACGGATGCAACCGCACCACTAAATGTTTACGGTGAAACCAAGTTAGCCGGAGAAAAAGCGTTACAGGAATATTGCGCGAAGCATCTTATTTTCCGGACCAGCTGGGTCTATGCAGGAAAAGGAAATAACTTCGCCAAAACGATGTTACGTCTGGCAAAAGAGCGTGAAGAATTAGCGGTTATTAACGATCAGTTTGGTGCGCCAACAGGTGCTGAACTGCTGGCTGATTGTACAGCACATGCCATTCGTGTCGCACTGAATAAACCGGATGTCGCAGGCTTGTACCATTTGGTAGCCAGTGGTACCACAACCTGGTACGATTATGCTGCGCTGGTTTTTGAAGAGGCGCGCAAAGCAGGCATTCCCCTTGCACTCAACAAGCTCAACGCAGTACCAACAACAGCCTATCCTACACCAGCTCGTCGTCCACATAACTCTCGCCTTAATACAGAAAAATTTCAGCAGAACTTTGCGCTTGTCTTGCCTGACTGGCAGGTTGGCGTGAAACGAATGCTCAATGAATTATTTACGACTACAGCAATTTAATAGTTTTTGCATCTTGTTCGTGATGGTGGAGCAAGATGAATTAAAAGGAATGATGAAATGAAAATGCGTAAAGGTATTATTTTAGCGGGTGGTTCTGGTACACGTCTTTATCCTGTGACTATGGCTGTCAGTAAACAGCTATTACCTATTTATGATAAACCGATGATCTATTACCCGCTCTCTACACTGATGTTGGCGGGTATTCGCGATATTTTGATTATCAGTACACCTCAGGATACTCCTCGTTTTCAACAATTGCTGGGTGACGGTAGCCAGTGGGGCCTGAATCTTCAGTACAAAGTGCAACCTAGCCCAGATGGCCTCGCGCAGGCATTTATCATCGGTGAAGAGTTTATTGGTGGTGATGATTGTGCTTTGGTTCTTGGTGATAATATCTTTTACGGTCACGATCTGCCGAAGCTAATGGAGGCCGCTGTTAACAAAGAAAGTGGTGCAACGGTATTTGCCTATCACGTTAATGATCCAGAACGCTATGGTGTCGTTGAGTTTGATAAAAACGGTACGGCAATCAGTCTGGAAGAAAAACCGTTAGAACCAAAGAGTAATTACGCCGTTACAGGTCTGTACTTTTATGATAACGACGTGGTTCAGATGGCGAAAAACTTGAAGCCGTCTGCACGTGGTGAGTTAGAAATTACAGATATTAACCGTATTTATCTTGAGCAGGGACGTCTGTCTGTCGCGATGATGGGGCGTGGCTACGCGTGGCTGGACACGGGGACTCATCAGAGTCTGATAGAAGCAAGTAATTTTATTGCGACAATTGAAGAGCGCCAGGGATTGAAGGTTTCCTGTCCTGAAGAGATTGCATTTCGTAAAGGTTTTATTGATGTTGAGCAAGTAAGAAAATTAGCTGTACCACTAATAAAGAATAATTATGGGCAGTATCTTTATAAAATGACGAAGGATTCAAATTAATGAATGTGATTAGAACTGAAATTGAAGATGTGCTAATTCTGGAGCCAAGAGTATTTGGTGATGATAGAGGTTTCTTTTATGAGAGCTTTAATCAATCAGCATTTGAACATATTCTAGGCTATCCGGTCAGCTTTGTTCAAGACAATCACTCACGTTCATCAAAAAATGTACTCAGAGGCCTTCACTTTCAACGCGGCGAGTACGCACAAGATAAACTTGTACGCTGCACTCATGGAGCAGTTTTTGATGTTGCTGTTGATATTCGACCCAATTCGGTATCCTTTGGTAAATGGGTTGGTGTTCTGCTTTCAGCTGATAATAAGCAGCAGTTGTGGATACCAAAAGGGTTTGCTCATGGCTTTTTGGTTCTGTCTGATATCGCTGAATTTCAATATAAAACTACAAACTATTATCATCCTGAAAGCGATTGTGGAATATGTTGGAATGATGAACGCATTGCAATTGATTGGCCCCAAACATCAGGGTTAATCCTTTCGCCAAAAGATGAAAGGCTCTTTACGTTAGATGAGCTTATCAGATTAAAATTAATTGCATGAATACGAATAAATTATCTTTAAGAAGAAACGTTATATATCTGGCTGTCGTTCAAGGTAGCAATTATCTTTTACCATTGCTTACATTTCCATATCTTGTAAGAACACTTGGTCCTGAAAATTTCGGTATATTCGGTTTTTGCCAAGCGACTATGCTATATATGATAATGTTTGTTGAATATGGTTTCAATCTCACAGCAACTCAGAGTATTGCCAAAGCAGCAGATAGTAAAGATAAAGTAACGTCTATTTTTTGGGCGGTGATATTTTCAAAAATAGTTCTTATCGTCATTACATTGATTTTCTTAACGTCGATGACCTTGCTTGTTCCTGAATATAACAAGCATGCCGTAATTATATGGTCGTTTGTTCCTGCATTAGTCGGGAATTTAATCTACCCTATCTGGCTGTTTCAGGGAAAAGAAAAAATGAAATGGCTGACTTTAAGTAGTATTTTATCCCGCTTGGCTATTATCCCTCTAACATTTATTTTTGTGAACACAAAGTCAGATATAGCAATTGCCGGTTTTATTCAGTCAAGTGCAAATCTGGTTGCTGGAATTATTGCACTAGCTATCGTTGTTCATGAAGGTTGGATTGGTAAAGTTACGCTATCATTACATAATGTGCGTCGATCTTTAGCAGACGGTTTTCATGTTTTTATTTCCACATCTGCTATTAGTTTATATTCTACGGGAATAGTTATTATCCTGGGATTTATATCTGGACCAACGTCCGTAGGGAATTTTAATGCGGCCAATACTATAAGAAACGCGCTTCAAGGGCTATTAAATCCTATCACCCAAGCAATATACCCAAGAATATCAAGTACGCTTGTTCTTAATCGTGTGAAGGGTGTGATTTTAATTAAAAAATCATTGACCTGCTTGAGTTTGATTGGTGGTGCTTTTTCATTAATTCTGCTCTTGGGTGCATCTATACTAGTAAAAATAAGTATAGGGCCGGGATATGATAATGCAGTGATTGTGCTAATGATTATATCGCCTCTGCCTTTTCTTATTTCATTAAGTAATGTCTATGGCATTCAAGTTATGCTGACCCATAATTATAAGAAAGAATTCAGTAAGATTTTAATCGCTGCGGGTTTGTTGAGTTTGTTGTTGATTTTTCCGCTAACAACTCTTTTTAAAGAGATTGGTGCAGCAATAACATTGCTTGCAACAGAGTGCTTAGTTACGTCACTCATGCTGATGTTCGTAAGAAATAATAAATTACTGGTTTGCTGAGGATTTTATGTACGATTATATCATTGTTGGTTCTGGTTTGTTTGGTGCCGTTTGTGCGAATGAGTTAAAAAAGCTAAACAAAAAAGTTTTAGTGATTGAGAAAAGAAATCATATCGGTGGAAATGCGTACACAGAGGACTGTGAGGGTATCCAGATTCATAAATATGGTGCACATATTTTTCATACCAATGATAAATATATATGGGATTACGTTAATGATTTAGTAGAATTTAATCGTTTTACTAATTCTCCACTGGCGATTTATAAAGACAAATTATTCAACCTTCCTTTTAATATGAATACTTTCCACCAAATGTGGGGAGTTAAAGATCCTCAAGAAGCTCAAAATATCATTAATGCTCAGAAAAAAAAGTATGGTGACAAGGTACCTGAAAATTTGGAGGAGCAGGCGATTTCATTAGTTGGGGAGGACTTATACCAAGCATTGATAAAGGGTTATACGGAGAAGCAGTGGGGAAGAAGTGCAAAAGAATTGCCTGCATTTATTATTAAGCGAATCCCAGTGAGATTTACGTTTGATAACAATTATTTTTCCGATCGCTATCAAGGTATTCCGGTGGGAGGCTACACTAAGCTTATTGAAAAAATGCTTGAAGGTGTGGACGTAAAATTAGGCATTGATTTTTTGAAAGACAAAGATTCTCTAGCGAGTAAAGCCCATAGAATCATCTACACTGGACCCATTGATCAGTACTTCGACTATAGGTTTGGAGCGTTAGAATATCGCTCTTTAAAATTTGAGACGGAACGCCATGAATTTCCAAACTTCCAAGGGAATGCAGTAATAAATTTCACTGATGCTAATGTACCATATACCAGAATAATTGAGCATAAACATTTTGACTATGTTGAGACAAAGCATACGGTTGTTACAAAAGAATATCCATTAGAGTGGAAAGTTGGCGACGAACCCTACTATCCAGTTAATGATAATAAAAACATGGAGCTTTTTAAGAAATATAGAGAGTTAGCTAGCAGAGAAGACAAGGTTATATTTGGCGGGCGTTTGGCCGAGTATAAATATTATGATATGCATCAAGTGATATCTGCCGCTCTTTATCAAGTGAAAAATATAATGAGTACGGATTAA
Protein sequences of DBSCAN-SWA_4 >CP034953|1765356:1774027|1766925_1767819_+|QAA89394.1|DBSCAN-SWA MTNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEILLVTHASKNAVENHFDTSYELESLLEQRVKRQLLAEVQSICPPGVTIMNVRQGEPLGLGHSILCARPAIGDNPFVVVLPDVVIDDASADPLRYNLAAMIARFNETGRSQVLAKRMPGDLSEYSVIQTKEPLDREGKVSRIVEFIEKPDQPQTLDSDIMAVGRYVLSADIWPELERTQPGAWGRIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKGIEKLLSE >CP034953|1765356:1774027|1770233_1771115_+|QAA89397.1|DBSCAN-SWA MKMRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFIGGDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDKNGTAISLEEKPLEPKSNYAVTGLYFYDNDVVQMAKNLKPSARGELEITDINRIYLEQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAFRKGFIDVEQVRKLAVPLIKNNYGQYLYKMTKDSN >CP034953|1765356:1774027|1768191_1769277_+|QAA89395.1|DBSCAN-SWA MKILVTGGAGFIGSAVVRHIINNTQDSVVNVDKLTYAGNRESLADVSDSERYVFEHADICDAPAMARIFAQHQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEAARNYWSALDSDKKNSFRFHHISTDEVYGDLPHPDEVNNTEELPLFTETTAYAPSSPYSASKASSDHLVRAWKRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKALPIYGKGDQIRDWLYVEDHARALYTVVTEGKAGETYNIGGHNEKKNIDVVLTICDLLDEIVPKEKSYREQITYVADRPGHDRRYAIDAEKIGRALGWKPQETFESGIRKTVEWYLSNTKWVDNVKSGAYQSWIEQNYEGRQ >CP034953|1765356:1774027|1772923_1774027_+|QAA89400.1|DBSCAN-SWA MYDYIIVGSGLFGAVCANELKKLNKKVLVIEKRNHIGGNAYTEDCEGIQIHKYGAHIFHTNDKYIWDYVNDLVEFNRFTNSPLAIYKDKLFNLPFNMNTFHQMWGVKDPQEAQNIINAQKKKYGDKVPENLEEQAISLVGEDLYQALIKGYTEKQWGRSAKELPAFIIKRIPVRFTFDNNYFSDRYQGIPVGGYTKLIEKMLEGVDVKLGIDFLKDKDSLASKAHRIIYTGPIDQYFDYRFGALEYRSLKFETERHEFPNFQGNAVINFTDANVPYTRIIEHKHFDYVETKHTVVTKEYPLEWKVGDEPYYPVNDNKNMELFKKYRELASREDKVIFGGRLAEYKYYDMHQVISAALYQVKNIMSTD >CP034953|1765356:1774027|1765356_1766751_+|QAA89393.1|DBSCAN-SWA MPFKKLSRRTFLTASSALAFLHTPFARALPARQSVNINDYNPHDWIASFKQAFSEGQTVVVPAGLVCDNINTGIFIPPGKTLHILGSLRGNGRGRFVLQDGSQVTGEDGGSMHNITLDVRGSDCTIKGLTMSGFGPVTQIYIGGKNKRVMRNLIIDNLTVSHANYAILRQGFHNQIIGANITNCKFSDLQGDAIEWNVAINDRDILISDHVIERINCTNGKINWGIGIGLAGSTYDNNYPEDQAVKNFVVANITGSDCRQLIHVENGKHFVIRNIKARNITPDFSKKAGIDNATVAIYGCDNFVIDNIEMINSAGMLIGYGVIKGKYLSIPQNFRVNDIQLDNTHLAYKLRGIQISAGNAVSFVALTNIEMKRASLELHNKPQHFFMRNINVMQESSVGPALSMNFDMRKDVRGVFMAKKETLLSLANVHAVNEKGQSSVDIDRINHHIVNVEKINFRLPELRE >CP034953|1765356:1774027|1769276_1770176_+|QAA89396.1|DBSCAN-SWA MNILLFGKTGQVGWELQRALAPLGNLIAFDVHSTDYCGDFSNPEGVAETVRSIRPDIIVNAAAHTAVDKAESEPEFAQLINATSVEAIAKAANEVGAWVIHYSTDYVFPGNGDMPWLETDATAPLNVYGETKLAGEKALQEYCAKHLIFRTSWVYAGKGNNFAKTMLRLAKEREELAVINDQFGAPTGAELLADCTAHAIRVALNKPDVAGLYHLVASGTTTWYDYAALVFEEARKAGIPLALNKLNAVPTTAYPTPARRPHNSRLNTEKFQQNFALVLPDWQVGVKRMLNELFTTTAI >CP034953|1765356:1774027|1771668_1772916_+|QAA89399.1|DBSCAN-SWA MNTNKLSLRRNVIYLAVVQGSNYLLPLLTFPYLVRTLGPENFGIFGFCQATMLYMIMFVEYGFNLTATQSIAKAADSKDKVTSIFWAVIFSKIVLIVITLIFLTSMTLLVPEYNKHAVIIWSFVPALVGNLIYPIWLFQGKEKMKWLTLSSILSRLAIIPLTFIFVNTKSDIAIAGFIQSSANLVAGIIALAIVVHEGWIGKVTLSLHNVRRSLADGFHVFISTSAISLYSTGIVIILGFISGPTSVGNFNAANTIRNALQGLLNPITQAIYPRISSTLVLNRVKGVILIKKSLTCLSLIGGAFSLILLLGASILVKISIGPGYDNAVIVLMIISPLPFLISLSNVYGIQVMLTHNYKKEFSKILIAAGLLSLLLIFPLTTLFKEIGAAITLLATECLVTSLMLMFVRNNKLLVC >CP034953|1765356:1774027|1771114_1771672_+|QAA89398.1|DBSCAN-SWA MNVIRTEIEDVLILEPRVFGDDRGFFYESFNQSAFEHILGYPVSFVQDNHSRSSKNVLRGLHFQRGEYAQDKLVRCTHGAVFDVAVDIRPNSVSFGKWVGVLLSADNKQQLWIPKGFAHGFLVLSDIAEFQYKTTNYYHPESDCGICWNDERIAIDWPQTSGLILSPKDERLFTLDELIRLKLIA |
8 | Enterobacteria_phage(28.57%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2220778 : 2230742
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP034953|2220778:2230742|DBSCAN-SWA GTTATATTTTTTCGATCTCGACCAGATTAGTGTGCTGCGGGTTTCCCTTCGCCAGTGGTGAAGGGCGCAGAGTGGTTAGCGTATTCACACAGCCGCCATGGTCGATTTTATCGCCAGACATATTGGCCTCGTGCCAGGCTCCCTGGCCCATAGCGCTAACTCCAGGGAGAATACGTGGTGTTACTTTGGCTGGTAGCCGAACTTCGCCACGATGGTTAAACACCCGCACCATATCGCCGTTGGCAATCCCACGTTTCTGCGCATCTATAGGGTTGATCCACACCTCCTGACGGCAGGCAGCCTTCAGGAGATCAATATTGCCGTAGGTCGAGTGAGTACGGGATTTGTAATGGAAACCAAACAGTTGCAGTGGGAAGGTTCTACGTTCAGGGGAGTTCCAGCCTTCAAAGGTTGAGGCATAAACTGGCAATGGGCTTATCACTTCATCTTTTTCCAGTTCCCAGGTACGGGCAATTTCCGCCAGCCTGCTGGAATAAATTTCAATCTTACCGGAAGGCGTTTTAAGTGGATTTGCCTCGGGGTCGTCACGAAATGCTTTGTAGGCGACAAAATGGCCATTGGGATCTTTACGCTTATAGATACCCATTTTTTTCAGTTCGTCGTAAGACGGTAACGCCGGATCTTTGGCAAGCATTTTGGCGTACAGATGTTGTAACCATTGTTCCTGCGTGCGACCTTCTGTGAACTTTTGATAGACGTCAGGTCCAAGACGTTTCGCGACTTCACTCAGGATCCAGTAAATCGGTTTGCGTTCGAATTTTTCGCTGGTGACAGGCTGGAGGAAAATGAGATATCCCATGTTACCGGCGTAGTCGTTAGGAATAATATCTTCCTGCTCAACGGTCATCAGGTCTGGCAGCAGAATGTCGGCATATTTTGCCGATGAGGTCATAAAGTTTTCGATGACCACAATCATTTCGCATTTCGATTCGTCCTGCAGAATTTCATGCGTTTTGTTGATGTCAGAATGCTGATTAACGAGGGTATTTCCCGCGTAGTTCCAGATGAACTTAATGGGCACATCCAGTTTATCTTTGCCGCGGACGCCGTCGCGGATTGCCGTCATTTGCGGACCATGATCGATAGCATCTGTCCAGCTGAAGCAGGAGATTGACGTTTTGACCGGATTATCCAGCACCGGCAGGCGTTCTATGGTAATGGTATAGGTCGATTCACGCGCGCCACTATTTCCGCCGCTGATGCCGACATTGCCCGTCAAAATAGGTAACATAGCAATAGCGCGTGCAGTCAGTTCGCCGTTTGCCTGGCGTTGTGGCCCCCAGCCCTGGCAGATATAAGCGGGTTTTGCTGTGCCAATTTCACGCGCCAGTTTGATGATACGGTCCTCCGGGATACCGGTAATTTGCGAAGCCCACTGCGGCGTTTTCGCTGTTTTATCGTCACCTTCACCAAGAATATAGGCTTTATAGTGACCATTTTTGGGTGCATCTGCGGGTAAGGTTTTTTCGTCATAGCCGACGCAGTATTTATCGAGAAAAGGTTGATCAACGAGATTTTCGTTAATCAATACCCAGGCAATACCCGCAACCAGCGCGGCATCGGTGCCCGGGCGAATAGGGAGCCATTCGTCTTCACGACCGGCAGCCGTATCGGTATATCGCGGATCGATAACAATCATTTTGGCGTTCGATTTCTCGCGCGCTTTTTCAAGAAGATAAGTGATGCCACCACCGCTCATGCGGGTTTCTGCCGGGTTGTTACCAAACATCACGACCAGCTTGCTGTTTTCAATATCCGTGGTGCTGTTGCCATCATTACTGCCGTAGGTGTAGGGCATGGCACAGGAAATTTGCGCAGTGCTGTAGGAGCCATACTGGTTGAGTGAACCGCCGTAGCAGTTCATCAGGCGTTTGACCGCCGAGGCTGATGGCGAAGAGCGGGTCATATTGCCGCCAACGATCCCCGAAGAGTACTGAATATATACAGCCTCATTGCCATATTGTTCGACGGTTTTCTTCAGGCTACTGGCGATAGTATCCAGGGCTTCATCCCAGCTAATCCGTTCGAATTTGCCTTCGCCGCGTTTGCCCACGCGTTTCATTGGGTAATTCAAGCGATCGGGATGATTAATACGCCGGCGGATGGAGCGACCGCGCAAACAGGCGCGTACCTGATGGTTGCCGTACTCATCGCTGCCGGTATTGTCAGTTTCCACCCAGGTCACTTCATTATCTTTAACATGTAGACGAAGTGCACAGCGGCTACCACAGTTGACGGAACAGGCACCCCAGACCACTTTTTCGCGGGCCTGTTGTACCGCTGCTGCTGCATTGCGCAGGGTAAACGGCAAAGAAAAACCGCCTGCAGCCAGCGCCAGAGAACCTATCGCGGTAGATTTAACGAGTGTTCTGCGGCTGATGCCCACCATTCGTTCATTTTTGGACATAACTCACTCCCTGTTCTTTATCGTTATATAAAAGTTTATATATTGAATATTTAGCGCGCTAACAATAGAGGGAGTCTACCCATTTTGGGTTAAGAATTATTAATCCATATCAATAGAAGGGTATGAGTAATAAGGTGGGATTATGTTGTATGTTCAAATCGCCGGATTTGTCATATCCGGCGTTCAGTCGATAATGTGTTACTGCGGTTCGGCAGGCGCGCCATCCTGGCTAGACTGCGCGGGAGCAGAGACGTTACCGCTGGTGGTGCGGGTATAGAGAATTTTATGCGTATCATTAGCGCAATGGCCGACGACCTGGGAATCAGGCTGATCAACCTGGTCATTGGGTACAATACTTAACGTGAAGCTGCTTTCGGGTACGCCATTATTGATAATGCGCTGTGATATATCGCTCTGTATGCGCTCACAGGATCCCGGCGCGGCGAGTACCGCGGGTGAGGCGAGGGCGAGCAGAAGCGCGGCACAGCAGGTTGAGAGTTTCATCATAAGCTCCTTACGCGAAGATAACTTCTTTAAGCATAGCATTTAACGTGTAAAGTACTGTATTTGCTACTATGATTGAGAATCATCTCTACTCTCTGGTGACTGTTGTGAAATACAAATTACTACCATGCTTACTCGCGATATTCCTCACAGGATGTGACCGCACAGAGGTAACACTTTCATTTACCCCTGAGATGGCCAGTTTCTCTAATGAATTCGATTTTGATCCGCTGCGTGGTCCGGTAAAAGATTTCACTCAGACATTAATGGATGAGCAAGGTGAAGTGACGAAACGTGTTTCTGGGACTTTGTCGGAAGAAGGCTGTTTTGATTCACTCGAATTACTGGATCTGGAAAATAATACCGTGGTCGCTCTGGTACTGGACGCCAATTATTACCGTGATGCCGAGACGCTGGAGAAGAGAGTACGTTTACAGGGAAAATGCCAGCTAGCAGAATTACCTTCTGCCGGGGTGAGTTGGGAAACCGATGATAATGGCTTCGTGATTAAAGCCAGCAGCAAACAAATGCAGATGGAATATCGCTATGATGATCAGGGTTATCCGCTGGGTAAAACCACGAAAAGTAACGACAAAACATTATCTGTCAGCGCCACGCCATCAACGGATCCGATCAAAAAATTAGATTACACAGCGGTTACTTTACTGAATAATCAACGGGTTGGTAATGTAAAACAGAGCTGTGAATATGACAGTCACGCTAATCCGGTGGACTGTCAGCTAATCATTGTTGATGAAGGAGTAAAACCCGCCGTCGAACGGGTTTACACCATCAAAAATACGATCGATTATTATTAATGCTATTGTGCGGTCGGCTTCAGGAGAGTCTGACCCGGTGTTTTGTGCTCTGCCAGATACTGATGCTGGAATATACACATGCGAATGGCATTACGATATTGACCATTAATAAAGAACTCGTGCATCAATTCACCTTCAACCGAAAAGCCAAGCTTGCGGTAAATGTGAATCGCTTTTTCATTCTCTTTATCAACGATCAGATACAGCTTATAGAGATTGAGAACGGTAAAGCCATAGTCCATTGCTAATTTGGCGGCACGGGTTGCCAGACCTTTCCCCTGATACTCCGGGGAGATAATTATCTGAAATTCTGCGCGGCGATGAACATGGTTAATTTCCACCAGCTCCACCAGACCGGCTTTTTCGCCGTCACATTCCACCACAAAGCGCCGTTCGCTCTGATCGTGAATATGCTTATCATACAGATCAGAGAGTTCAACAAAGGCTTCGTAGGGTTCCTCAAACCAGTAACGCATCACACTGGCGTTATTGTCGAGTTGATGTACATAGCGTAAATCTTCACGCTCCAGCGGGCGTAGCTTAACACTGTGGGCGCTTGGCATAACGTGTCCTTACATTCCTTAAATCAATAACAGGTTAGGGGGTAATAACGCGGCCAGTTCGACGGTCCAGGCAGCGCAAAGTATTGGGCTCCCAGTAGGCATTGATGTTGGCGCTTTGCTCACATTTATCGCGGTTATCAAAAGCGGCGTCGGCTTTATCCCACTCTTTTTCAGTGCGTTTATTCACTTTCTGGCGCAGATTGCGCGTGTCATTCCATTGCTCTTTTTCCATAGCGGCGTGCTGGCGGCTTTGTGCACTGTCGCCAGACTCAATCACCAGTTTGTTAGTTTCGGCATGAACAGTTGTGCTCAATGCCAGTGCGCAAGGCAGCAGAATAGCGAGCAGGCCGATTCGTTTGCTGAGAGTGATTTTCATAATTCATTCCCTGTATGAATGATTAAAGGTGATTCTACACCATCCACTGCGGACGCAAAACGTACCAGGAGGGTGTTTATATTGATGATATTATGTCGCCCTATAACTATACATGATGTCAATAAGAGACAAAGATGATTAAAACAACGTTACTATTTTTTGCTACTGCGCTGTGTGAAATTATTGGATGCTTTCTGCCCTGGTTGTGGTTAAAACGAAACGCCAGTATCTGGCTGTTGCTTCCGGCGGGGATTTCACTGGCGCTGTTTGTCTGGTTGTTAACGTTGCATCCAGCGGCGAGTGGGCGTGTTTACGCGGCTTATGGTGGCGTTTATGTCTGCACGGCGTTGATGTGGCTGCGCGTTGTGGATGGCGTGAAACTGACTCTTTATGACTGGACGGGTGCGTTGATTGCGCTTTGCGGCATGTTGATCATTGTTGCGGGCTGGGGGCGCACGTAGGAACATAAATCCATTTTATCAATAAGATAAGAGGAAGTGTCAGCTGACAAAAGGTATTCTATTTCATCTTTTGTCAACCATTCACAGCGCAAATATACGCCTTTTTTTGTGATCACTCCGGCTTTTTTCGATCTTTATACTTGTATGGTAGTAGCTCAGTTGCGTAGATTTCATGCATCACGACAAGCGATGCAAGGAATCGAACATGAAGATCGTAAAGGCTGAAGTTTTTGTTACCTGTCCGGGGCGTAATTTCGTCACATTAAAAATCACCACTGAGGACGGTATTACGGGCCTTGGGGATGCCACCCTCAATGGACGTGAGCTTTCCGTGGCCTCTTATTTGCAGGATCACCTTTGTCCGCAGCTTATTGGTCGCGATGCGCACCGTATCGAAGATATCTGGCAGTTTTTCTATAAAGGTGCTTACTGGCGTCGCGGTCCGGTTACGATGTCGGCCATTTCAGCGGTTGATATGGCGCTGTGGGATATTAAAGCCAAAGCTGCCAACATGCCGCTTTACCAGTTACTCGGCGGCGCGTCTCGTGAAGGGGTGATGGTTTATTGCCATACCACCGGTCACAGTATTGATGAAGCTCTGGATGATTATGCCCGTCATCAAGAGCTTGGATTCAAAGCCATCCGCGTGCAGTGCGGAATCCCTGGTATGAAAACCACCTACGGCATGTCGAAAGGTAAAGGTCTGGCTTATGAACCCGCAACCAAAGGACAGTGGCCGGAAGAGCAGCTGTGGTCGACGGAGAAATACCTCGATTTCATGCCGAAATTGTTTGACGCGGTACGTAACAAGTTTGGTTTTAATGAACATTTGCTGCATGACATGCACCATCGCTTAACGCCTATTGAAGCGGCGCGCTTTGGTAAAAGCATTGAAGATTATCGCATGTTCTGGATGGAAGACCCGACGCCTGCGGAAAACCAGGAATGCTTCCGTCTCATTCGCCAACATACCGTCACACCCATCGCAGTGGGTGAAGTCTTCAACAGCATCTGGGACTGCAAACAACTGATTGAAGAGCAACTCATCGATTATATCCGCACCACGCTGACCCATGCAGGCGGAATTACCGGTATGCGCCGGATTGCCGATTTTGCTTCGCTGTATCAGGTACGTACTGGCTCACACGGTCCTTCCGATTTGTCACCAGTCTGCATGGCTGCGGCGCTGCACTTTGATCTGTGGGTCCCCAATTTCGGTGTCCAGGAATACATGGGTTATTCCGAACAAATGCTCGAAGTCTTCCCGCACAACTGGACTTTCGATAACGGCTATATGCATCCGGGAGACAAACCGGGTCTTGGTATCGAATTCGATGAAAAGCTGGCGGCGAAATATCCCTATGAACCTGCTTATCTACCAGTCGCACGTCTGGAAGATGGCACGCTGTGGAACTGGTAAGGAGTAAGATAATGAAAAGCATATTAATTGAAAAACCGAATCAACTGGCGATTGTCGAACGTGAAATACCCACCCCGTCAGCGGGTGAAGTACGAGTAAAAGTGAAACTTGCCGGAATTTGTGGTTCAGATAGCCATATTTATCGTGGGCATAATCCTTTTGCGAAATATCCGCGCGTCATTGGTCATGAATTCTTTGGCGTCATTGATGCAGTGGGTGAAGGCGTGGAAAGCGCCAGAGTCGGTGAACGTGTTGCTGTCGATCCGGTGGTCAGCTGTGGGCATTGCTATCCGTGCTCTATAGGTAAACCGAACGTTTGTACGACACTGGCTGTATTAGGTGTGCACGCTGACGGTGGTTTCAGTGAATATGCCGTGGTTCCGGCAAAAAATGCGTGGAAAATTCCTGAAGCAGTGGCCGATCAATATGCGGTAATGATCGAACCTTTTACCATTGCGGCTAACGTAACCGGACATGGTCAACCGACTGAAAATGATACCGTTCTGGTTTATGGTGCCGGTCCAATCGGCCTGACGATCGTTCAGGTATTAAAAGGCGTCTATAACGTTAAAAATGTGATTGTTGCCGATCGCATTGATGAACGACTGGAAAAAGCGAAAGAGAGCGGGGCTGACTGGGCGATTAATAACAGCCAGACACCGCTTGGCGAGATTTTCACTGAAAAAGGCATCAAGCCGACATTAATTATCGATGCGGCTTGTCATCCTTCTATCCTGAAAGAGGCCGTAACGCTGGCTTCTCCAGCGGCACGTATTGTATTGATGGGGTTCTCCAGTGAACCGTCTGAAGTGATTCAGCAAGGAATTACCGGAAAAGAACTCTCTATTTTCTCTTCACGCTTAAATGCAAATAAATTCCCGATCGTTATCGACTGGTTAAGTAAAGGGTTAATTAAACCAGAAAAATTAATTACCCATACGTTTGATTTCCAGCATGTTGCTGATGCCATTAGTTTATTTGAACAGGATCAAAAACATTGCTGCAAAGTCTTACTCACTTTTTCTGAATAATACCAATAACGGCGAGTAAGTAGTACGCATCTTACCTCTTTTTTAGAGATAACCATTATGACAATAGAAAAACACGAAAGAAGCACTAAGGATTTGGTGAAAGCAGCAGTATCGGGATGGCTGGGCACTGCGCTTGAATTTATGGATTTCAAGAGTCATGCGTGTTAACTATTTGATAAATATTAAATTAATTTTTCATTGCTTCGTTATGGGGCATGGTTGGGGCAAACTCGCTTAACTGTGTATTTAACAAAGCTACCTGTGCATTATTGTTTTCAGACATCCATTTTCCGTATACCTGAAATACCATTTGCGCATCTGCATGGCCCATCTGGTTTGCTATAAATGCCGGGTTAGCACCAGCTGTCAGCGACCAGCAGGCATAAGTATGTCTCGACTGATATGATTTTCGATGGCGGAGTCCGGCACGTTTTATCGCTGCGTCCCACATCTGCCTTATTGAGTCAACGGTAAAATGGTCACCATAATTTTTTACTCTCGCTGACACTTCAGGTTGAAAAACAAAGGTGCATTTTTGTTTTTCTGTTCTGCCATACTCTCTGAGGTGAACATCAATGATATGCTCTTTGCTCAGTCTCGTTAATGTCATCTGACTCCGGAGAGCGTCGATTGCTGGCTTAATAAGATGAATGACCCGATTGGTTCCCGCCTGTGTTTTTGGTACCGTGAAACGGTCTTTTGCTAAATTTCTCCTGATCATCATTGTTCCATTTTTCAGATCTATGTCCTCCCATCCAAGTGCACACAGCTCACCAGGGCGAACTCCAGTATAAACAGAAACACACCATAAATTTTTTGCTTGCTGATTTCTGCACGCATCGATAAGACGGATAAATTCTTCCCGCGAAAGAGGATCCGGAATGGTTCTTGATTCCTTTAATGGCGAGATCCCCTTAAACGGATTATCTGCCAGGTAACCGTTATCAACACCAAACTGGAACACGGCGTTAAGATTTGTCATGTAATTATTTACAGTTACAGCCGATCTCCCTGGTTGTGTAACAATATAGTTACTTTTGGGGATTAGACTGGCCCCCTGAATCTCCAGACAACCAATATCACTTAAATAAGTGATAGTCTTAATACTAGTTTTTAGACTAGTCATTGGAGAACAGATGATTGATGTCTTAGGGCCGGAGAAACGCAGACGGCGTACCACACAGGAAAAGATCGCAATTGTTCAGCAGAGCTTTGAACCGGGGATGACGGTCTCCCTCGTTGCCCGGCAACATGGTGTAGCAGCCAGCCAGTTATTTCTCTGGCGTAAGCAATACCAGGAAGGAAGTCTTACTGCTGTCGCCGCCGGAGAACAGGTTGTTCCTGCCTCTGAACTTGCTGCCGCCATGAAGCAGATTAAAGAACTCCAGCGCCTGCTCGGCAAGAAAACGATGGAAAATGAACTCCTCAAAGAAGCCGTTGAATATGGACGGGCAAAAAAGTGGATAGCGCACGCGCCCTTATTGCCCGGGGATGGGGAGTAAGCTTAGTCAGCCGTTGTCTCCGGGTGTCGCGTGCGCAGTTGCACGTCATTCTCAGACGAACCGATGACTGGATGGATGGCCGCCGCAGTCGTCACACTGATGATACGGATGTGCTTCTCCGTATACACCATGTTATCGGAGAGCTGCCAACGTATGGTTATCGTCGGGTATGGGCGCTGCTTCGCAGACAGGCAGAACTTGATGGTATGCCTGCGATCAATGCCAAACGTGTTTACCGGATCATGCGCCAGAATGCGCTGTTGCTTGAGCGAAAACCTGCTGTACCGCCATCGAAACGGGCACATACAGGCAGAGTGGCCGTGAAAGAAAGCAATCAGCGATGGTGCTCTGACGGGTTCGAGTTCTGCTGTGATAACGGAGAGAGACTGCGTGTCACGTTCGCGCTGGACTGCTGTGATCGTGAGGCACTGCACTGGGCGGTCACTACCGGCGGCTTCAACAGTGAAACAGTACAGGACGTCATGCTGGGAGCGGTGGAACGCCGCTTCGGCAACGATCTTCCGTCGTCTCCAGTGGAGTGGCTGACGGATAATGGTTCATGCTACCGGGCTAATGAAACACGCCAGTTCGCCCGGATGTTGGGACTTGAACCGAAGAACACGGCGGTGCGGAGTCCGGAGAGTAACGGAATAGCAGAGAGCTTCGTGAAAACGATAAAGCGTGACTACATCAGTATCATGCCCAAACCAGACGGGTTAACGGCAGCAAAGAACCTTGCAGAGGCGTTCGAGCATTATAACGAATGGCATCCGCATAGTGCGCTGGGTTATCGCTCGCCACGGGAATATCTGCGGCAGCGGGCTTGTAATGGGTTAAGTGATAACAGATGTCTGGAAATATA
Protein sequences of DBSCAN-SWA_5 >CP034953|2220778:2230742|2225600_2225927_+|QAA89823.1|DBSCAN-SWA MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLTLHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLTLYDWTGALIALCGMLIIVAGWGRT >CP034953|2220778:2230742|2228435_2228546_+|QAA89826.1|DBSCAN-SWA MTIEKHERSTKDLVKAAVSGWLGTALEFMDFKSHAC >CP034953|2220778:2230742|2229469_2230742_+|QAA89827.1|transposase|DBSCAN-SWA MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEAVEYGRGKKVDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >CP034953|2220778:2230742|2228565_2229456_-|QAA91899.1|integrase|DBSCAN-SWA MGCLEIQGASLIPKSNYIVTQPGRSAVTVNNYMTNLNAVFQFGVDNGYLADNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAKNLWCVSVYTGVRPGELCALGWEDIDLKNGTMMIRRNLAKDRFTVPKTQAGTNRVIHLIKPAIDALRSQMTLTRLSKEHIIDVHLREYGRTEKQKCTFVFQPEVSARVKNYGDHFTVDSIRQMWDAAIKRAGLRHRKSYQSRHTYACWSLTAGANPAFIANQMGHADAQMVFQVYGKWMSENNNAQVALLNTQLSEFAPTMPHNEAMKN >CP034953|2220778:2230742|2225124_2225466_-|QAA89822.1|DBSCAN-SWA MKITLSKRIGLLAILLPCALALSTTVHAETNKLVIESGDSAQSRQHAAMEKEQWNDTRNLRQKVNKRTEKEWDKADAAFDNRDKCEQSANINAYWEPNTLRCLDRRTGRVITP >CP034953|2220778:2230742|2223816_2224527_+|QAA89820.1|DBSCAN-SWA MKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY >CP034953|2220778:2230742|2220778_2223205_-|QAA89818.1|DBSCAN-SWA MSKNERMVGISRRTLVKSTAIGSLALAAGGFSLPFTLRNAAAAVQQAREKVVWGACSVNCGSRCALRLHVKDNEVTWVETDNTGSDEYGNHQVRACLRGRSIRRRINHPDRLNYPMKRVGKRGEGKFERISWDEALDTIASSLKKTVEQYGNEAVYIQYSSGIVGGNMTRSSPSASAVKRLMNCYGGSLNQYGSYSTAQISCAMPYTYGSNDGNSTTDIENSKLVVMFGNNPAETRMSGGGITYLLEKAREKSNAKMIVIDPRYTDTAAGREDEWLPIRPGTDAALVAGIAWVLINENLVDQPFLDKYCVGYDEKTLPADAPKNGHYKAYILGEGDDKTAKTPQWASQITGIPEDRIIKLAREIGTAKPAYICQGWGPQRQANGELTARAIAMLPILTGNVGISGGNSGARESTYTITIERLPVLDNPVKTSISCFSWTDAIDHGPQMTAIRDGVRGKDKLDVPIKFIWNYAGNTLVNQHSDINKTHEILQDESKCEMIVVIENFMTSSAKYADILLPDLMTVEQEDIIPNDYAGNMGYLIFLQPVTSEKFERKPIYWILSEVAKRLGPDVYQKFTEGRTQEQWLQHLYAKMLAKDPALPSYDELKKMGIYKRKDPNGHFVAYKAFRDDPEANPLKTPSGKIEIYSSRLAEIARTWELEKDEVISPLPVYASTFEGWNSPERRTFPLQLFGFHYKSRTHSTYGNIDLLKAACRQEVWINPIDAQKRGIANGDMVRVFNHRGEVRLPAKVTPRILPGVSAMGQGAWHEANMSGDKIDHGGCVNTLTTLRPSPLAKGNPQHTNLVEIEKI >CP034953|2220778:2230742|2224529_2225090_-|QAA89821.1|DBSCAN-SWA MPSAHSVKLRPLEREDLRYVHQLDNNASVMRYWFEEPYEAFVELSDLYDKHIHDQSERRFVVECDGEKAGLVELVEINHVHRRAEFQIIISPEYQGKGLATRAAKLAMDYGFTVLNLYKLYLIVDKENEKAIHIYRKLGFSVEGELMHEFFINGQYRNAIRMCIFQHQYLAEHKTPGQTLLKPTAQ >CP034953|2220778:2230742|2226132_2227347_+|QAA89824.1|DBSCAN-SWA MKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQCGIPGMKTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFNEHLLHDMHHRLTPIEAARFGKSIEDYRMFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYEPAYLPVARLEDGTLWNW >CP034953|2220778:2230742|2227358_2228378_+|QAA89825.1|DBSCAN-SWA MKSILIEKPNQLAIVEREIPTPSAGEVRVKVKLAGICGSDSHIYRGHNPFAKYPRVIGHEFFGVIDAVGEGVESARVGERVAVDPVVSCGHCYPCSIGKPNVCTTLAVLGVHADGGFSEYAVVPAKNAWKIPEAVADQYAVMIEPFTIAANVTGHGQPTENDTVLVYGAGPIGLTIVQVLKGVYNVKNVIVADRIDERLEKAKESGADWAINNSQTPLGEIFTEKGIKPTLIIDAACHPSILKEAVTLASPAARIVLMGFSSEPSEVIQQGITGKELSIFSSRLNANKFPIVIDWLSKGLIKPEKLITHTFDFQHVADAISLFEQDQKHCCKVLLTFSE >CP034953|2220778:2230742|2223403_2223709_-|QAA89819.1|DBSCAN-SWA MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGAPAEPQ |
11 | Escherichia_phage(16.67%) | integrase,transposase | attL 2222680:2222693|attR 2235824:2235837 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2233946 : 2253144
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >CP034953|2233946:2253144|DBSCAN-SWA ATCACTCACCTGAGTTTCTTTCCAGCCAGCGACGGGCACCATTTTCGGTTTTAAACGTTTTGCTTTTGGTATACGTCATTGCGGTGAACGTGCCGTCCTGGTTTGGAAACACGCCGTACACCAGAGATTCGTTGTTGCCAAGATCGATAGTATCCATGCTGACCTCATTTCCCCTTAACGCTGGGGTAGCGGAACTGTTTGCTGAGAACACCGTGCGGTGTGTTGATGCAAACAAGATTAGCCATGACTAACATATCGGTCAAGTGATTTTGTATGCTATAGCTAACATAATTGATGTGGTAAAAGATAACTCATTGATGATGTTATCTTTTATTTGTCCGCTGACGGGCTTTTAGTAATTCTTCAAAGAGTTTATTGAAGTTTTTTACTCGAGCTCGCATTTCGGCGAGCTGGGTATCCTGTTCTGATTCTGGCAGTGCATTAAACAGCTCAAGGAGCTCTAGTTCTTTGGGGGATAAGGCAACTGGCTTCTCAACAGGTGGTGTTGGTTGCTTGTCTTCATCGCCAAATAGAATCCATGTTGGTGAGCATTGCAATACTTTACTGAGGGCAAAAAGGTTCTTCCCTGTAGGTTCACTATCACCCCGTTCCCATTGTGATACAGACACATGGGAGATTTTCAGGGCTTTAGCAAGAGACCTTTGGGTGTGTTTGAGGTTTTTCCGACGATACCTGATGCGTTCGCCGATAGTTAAATTTTTTGTTTCCATAGTTAGCTAATGCTAAATCGTATTGACTATGTTTTTGTTAACATCTATCTTGTTAGTTATGACTAACAATAAAGGTGTTTTAAATGCTTAAAACTGACGCTCTTTTGTATTTCGGTTCAAAAACAAAACTTGCACAAGCAGCAGGTATTCGTTTGGCTTCGCTTTATAGCTGGAAAGGGGATTTAGTTCCCGAAGGTCGCGCGATGCGTCTACAGGAGGCATCTGGCGGGGAGCTTCAGTATGATCCCAAAGTTTATGATGAATATCGTAAGACGAAGCGGGCGGGGCGGTTGAACAATGAAAATCACTCCTGAACAGGCTCGTGAGGCTCTGGATGCCTGGATATGTCGACCAGGAATGACACAGGAGCAGGCGACGATATTAATCACTGAAGCATTCTGGGCTTTGAAAGAGCGCCCGAACATCGATGTTCAGCGTGTCACATATGAAGGTGGCGCGATTGATCAGCGAGCGCTTGGCGTTAATCGAGTGAAGATATTTGAACGCTGGAAGGCTATCGACACCAGGGATAAGCGTGAAAAGTTCACGGCGCTAGTGCCTGCAATTATGGAGGCTACCACTGGATGATGAAAAGAGTGTCCATGAACGAGTGGATAATGCCCGTATTGATTTACGCTCAAAATACTATGTTAAACCTAAAGCTGACCATCCCTGGCTTACGCGCCGAACGCAAAGTCATCAGCAAGTTAAGCCCCCGAAGTTACCTAAAAAGAAGCCTGATCCCGATAAAAAAGATTGAAACCAAGATCGATTCGGTTGAGTGCATATCCATTCATAGGGTAGATTCTTAAGTCGCGTTTCTGGTGTTCATTTTCGGGTGGTTTGTTACTTGTTTTACCGGGGATATGCCAGAAACGCGCTGAGTCAGTCTGGGCGGTGCGCGTAATGAGGCGTTATGGTAAATAGCCTATGCTAATGTCCGCTAAGAGCAAGAAGCGGAAGTTGGCAGTTTTGTGGACTGTCCCCACAAAAGTGACTACAGAAATAGTTGCAATTCATAATTGATCATGGGTTGTCAGTTAAACTCGTGGCGATTTAAATAGACTAATTGGGAGTGCGTCCATTACTTATATCTTGTAATGTTAACTATCAGAAATGATACAAAGATAATATGTCTTTAAAGAAAAGGCTGATGGCGAAAAGTGGCCCGATGAGGGCCACAATACGGCTGTCACTTAGACGTAAATATCAATGGTGCCAGCGGTATTTGTATCGTCTTTTTTCTCTTCTTTTTTATCAGGCTGAACTGTCGCGTCTTCATTCTTTTTCTCTGCCTGCTGCCTTAACAACTGCTCCAGTTGAGCCCAGAGGCTTTCAATTTGCTTCTGTACCAATGCAGCCATTTCTTTTTTCTGCTGTGTCGTCATCCCCTCTTCCGATGAGATTTTCCCAAGCTTTTCAGTCAGCACCTGAATTTGTCTTGTGATTTTGGCTATTTCTGATGTTCCTTCCGGGGCGGAGTTGTTTGAAATAACGGTTGAGGTATTTCCCTGAATTGTGACAGACATAGATTTCTCCTTTTAAAAAAGCACTATCGGCATGCACAAAAAAATCTTTAATCGTATTTCTTGTGTCATTAATTGTTTGATGTTCAGATTGTTTTCCTCGCGGGCTGGCGCGCCTCAGAAAGTAAAGCTTGTTGACAGGGGTAAACGTTCGGCAATAATTTTCTGCCGCATGCGGGTGTTGCATAAAACGTGTTACGTTCCTTTATCGACAGGTCAGGTCACCGCTCACCCGCCGACGAGAAAGCAACACTGACATGCTAAAGCAAAAAATAGATGAATAAGTTGAGTTGTGCATATGTAGCCTGACCGTCACAAAGTATATGGTGTCTGTACCAGTAAGATGATGGCCGGACTCTTTAAAAACGAGCTGACCTGCACAATACAGGATGGACTTAGCAATGGCTGCTCCTGGCACAAAGCGGACAGTGATCACCGTTCTTACGACTACTTTCTGACTTCCTTCGTGACTTGCCCTAAGCATGTTGTAGTGCGATACTTGTAATGACATTTGTAATTACAAGAGGTGTAAGACATGGGTAGCATTAACCTGCGTATTGACGATGAACTTAAAGCGCGTTCTTACGCCGCGCTTGAAAAAATGGGTGTAACTCCTTCTGAAGCGCTTCGTCTCATGCTCGAGTATATCGCTGACAATGAACGCTTGCCGTTCAAACAGACACTCCTGAGTGATGAAGATGCTGAACTTGTGGAGATAGTGAAAGAACGGCTTCGTAATCCTAAGCCAGTACGTGTGACGCTGGATGAACTCTGATGGCGTATTTTCTGGATTTTGACGAGCGGGCACTAAAGGAATGGCGAAAGCTGGGCTCGACGGTACGTGAACAGTTGAAAAAGAAGCTGGTTGAAGTACTTGAGTCACCCCGGATTGAAGCAAACAAGCTCCGTGGTATGCCTGATTGTTACAAGATTAAGCTCCGGTCTTCAGGCTATCGCCTTGTATACCAGGTTATAGACGAGAAAGTTGTCGTTTTCGTGATTTCTGTTGGGAAAAGAGAACGCTCGGAAGTATATAGCGAGGCGGTCAAACGCATTCTCTGAACCAAAGCATGACATCTCTGTTTCGCACCGAAGGTGACACTTCTGCTTTGCGTTGACAGGAGAAGCAGGCTATGAAGCAGCAAAAGGCGATGTTAATCGCCCTGATCGTCATCTGTTTAACCGTCATAGTGACGGCACTGGTAACGAGGAAAGACCTCTGCGAGGTACGAATCCGAACCGGCCAGACGGAGGTCGCTGTCTTCACAGCTTACGAACCTGAGGAGTAAGAGACCCGGCGGGGGAGAAATCCCTCGCCACCTCTGATGTGGCAGGCATCCTCAACGCACCCGCACTTAACCCGCTTCGGCGGGTTTTTGTTTTTATTTTCAACGCGTTTGAAGTTCTGGACGGTGCCGGAATAGAATCAAAAATACTTAAGTAGCGCGCAGGGATAAGAGGGATGGTCCCTTAAAGGGGAGAGCTAATTATCCGGAAGGATTCTGATGATGAACATCGAAGAACTGCGTAAAATTTTTTGTGAAGATGGCCTCTATGCTGTGTGCGTTGAAAATGGAAATCTTGTTAGTCATTACCGCATTATGTGTTTGCGAAAGAATGGGGCTGCGTTAATTAATTTTGTGGATGCTCGGGTCACGGACGGATTTATCTTGCGCGAAGGTGAGTTTGTCACTTCATTACAGGCATTGAAAGAGATCGGAATAAAAGCTGGCTTTTCTGCTTTTTCAGGAGAATAAACTCATCTACAATCTTGCGCGGGGCTGAACTCCCGCTGAGTAACACCGTGCCACCGGAGAAAACCGATGGCACGCAACGCAAAATATTACAATTCTGATAATTCGCCCGTTCTTGCCTGCACGCACGGGCGGTATTCTCACGCATTCAAGTCTGAATGGTTCCAGCACCCTCCATGCACTGCAGAACAGGCCGAATGGCTGATTCATTCTTACCGCAGGCGCGGGTTCGAGGTTAAGAAAGCTCTCAGTCTCGACTATCGGCACTGGATAATCTCTGTCAGGCTGCCTTATTCCGAACGCCCACCACGTGCGTCCCGCACTTTCCAGCAACGGATCTGGAGGTAACGTGCGGGTATTACTTAGACCTGTTCTGGTGCCTGAGCTTGGGCTGGTGGTCCTTAAGCCGGGCCGTGAATCCATACAGATATTTCATAATCCTCGAGTGCTGGTGGAACCGGAACCAAAAAGCATGCGTAATCTGCCATCCGGAGTCGTTCCTGCCGTTCGCCAGCCGCTGGCGGAAGACAAAACATTGCTGCCGTTTTTTAGTAACGAACGGGTGATTCGTGCTGCTGGCGGCGTTGGCGCATTGTCCGACTGGCTATTACGTCATGTTACATCCTGCCAGTGGCCTAATGGCGATTACCATCACACTGAAACAGTCATTCACCGTTATGGTACCGGCGCAATGGTGTTGTGCTGGCACTGCGACAACCAACTGCGTGACCAGACATCGGAATCACTGGAGCTGCTTGCTCAACAAAATCTGACAGCATGGGTGATTGACGTCATCCGTCACGCAATAAGCGGTACGCAGGAGCGGGAATTATCTTTGGCTGAATTATCCTGGTGGGCGGTCTGCAATCAGGTGGTGGATGCACTACCTGAGGCTGTATCGCGTCGTTCGCTGGGATTACCAGCGGAAAAAATCTGCTCGGTGTACCGCGAAAGCGACATCGTACCGGGAGAGCAGACCGCCACCAGCATATTGAAACAACGCACAAAAAATCTTGCACCGTTGCCTTACGCCCACCAGCAACAAAAATCACCACAGGAAAAGACGGTGGTAAGCATCACCGTTGATCCAGAGTCTCCGGAATCTTTCATGAAGCTGCCTAAACGTCGCCGCTGGGTTAAGGAGAAATACACACGTTGGGTTAAGACACAGCCGTGTGCTTGCTGCGGTATGCCAGCCGACGATCCGCATCATCTGATTGGTCACGGGCAGGGCGGAATGGGAACAAAAGCACATGATCTCTTTGTGTTGCCTTTGTGCAGAAAGCATCACAACGAGCTGCATACGGATACAGTGGCATTTGAAGATAAGTATGGCTCCCAACTGGAGCTGATATTTCGTTTTATCGATCGCGCGCTGGCAATTGGCGTACTGGCGTAAGTGGAGAACGAGCATGAACCTTGAAGCCTTACCAAAATATTACTCCCCAAAATCTCCAAAATTGAGCGATGACGCTCCAGCGACAGGCACCGGTTGTTTAACAATTACGGATGTAATGGCAGCGCAGGGGATGGTGCAGTCGAAAGCACCACTTGGGTTGGCCTTATTTCTGGCAAAAGTTGGTGTTCAGGACCCTCAGTTTGCGATTGAAGGCCTGCTAAATTACGCGATGGCACTGGATAACCCGACATTGAACAAATTGAGTGAAGAAATCCGGTTACAGATTATTCCTTACCTCGTGAGTTTTGCCTTTGCTGATTACTCCAGGTCTGCGGCAAGTAAGGCTCGCTGTGAGCATTGTTCAGGTACGGGATTTTATAATGTATTGCGCGAAGTGGTGAAACACTACAGACGCGGGGAATCTGTAATCAAGGAAGAATGGGTGAAGGAACTATGTCAGCATTGCCATGGTAAGGGCGAAGCCAGCACAGCGTGCAGAGGGTGTAAGGGTAAAGGGATTGTTCTGGATGAAAAAAGAACCCGGTTTCATGGCGTACCGGTATATAAGATTTGTGGGCGTTGTAATGGAAACCGGTTTAGTCGTTTACCGACCACGCTGGCACGACGTCATGTCCAGAAGCTGGTACCAGACCTGACCGATTATCAGTGGTATAAGGGGTATGCGGACGTCATTGGTAAACTGGTAACAAAGTGCTGGCAGGAAGAAGCATACGCGGAAGCGCAATTGAGGAAGGTGACGAGATAAATGATTTTTGCTGAAGATGGCGACATGATGTTTGCATTTTTCAAAAAATATGGATAAAATTTTTTCAACGATGGGCTTTGTATACCCGACGTTAAGAAAAAGTAGAAAACCCGCTGATGAGCGGGTTTTGTGCTTTAAATGGGGCAATGGTAATGTTGAATCTCATCCCGGGACTCATGTCTGTTAACTTATTATTTAGCTGGTGACTTGGTTATTTGCCTGATGTTTAAAATGTTTTCTTCCAGTACAATGTCCCTAAACACAATGAGTCTGCTTATTATATTATTAGCAGAGCTATTACGGCCAAAGTACAGCATAAGCTTTTAAAGCCAATCAACCAGTCATCAAGACAGACGGGGTTATTCATAAAAACTCTCCATGTGTGATCCGATGGGGCCTGAAATTAAAGCTTTAATATAGCTCATGAAAGGTAAACATTGGCAGCTGAAGGGCCACGCAGACCATTTATCCGGCAAAATTCCACGCGTAATCCGGTGGTAATTTCTTCTGCATCGCGGAGATTGAGCGCTGAAACATGAAGCTGGACATCGATACGACCATCGGATGGGGTGATAAGACCCTTGCCGCTTTTGCCGTCAAAGGTTTTGACAATTCCTGTCATTTTACGGGACAAAAAAATTCCTTAATACTGATAACTTGGCGCACTATACACACGTTCCTGAAGAAAGCTATAGTTTTTTGATGGGGTTGAAGATGGCTGGATGTCTAAAATAAACATTGCTTCATATGTTCAACTATGCGTTAATGATTGCGTCGGTTTGAAGAACAGACGATATACGAAGTAGTTTACTAAAGCAGTTCTCATTTCAGGTGTTATTCACTTATTCCTTCTTTGAGTCTCTCCAATTAAGTACGAAGTCGTTTCTGTTATGCAAACCATTTATGCCGAAAGGCTCAAGTTAAGGAATGTAGAATGTCAAATAAAATGACTGGTTTAGTAAAATGGTTTAACGCTGATAAAGGTTTCGGCTTTATTTCTCCTGTTGATGGTAGTAAAGATGTGTTTGTGCATTTTTCTGCGATTCAGAATGATAATTATCGAACCTTATTTGAAGGTCAAAAGGTTACCTTCTCTATAGAGAGTGGTGCTAAAGGTCCTGCAGCAGCAAATGTCATCATTACTGATTAAAATTCATCGCTCGTCTGTATACGATAACGAAGAAGGCTGATGCCTGAGTAGAGATACGGACAGAGTAGTGAATATTGGATCTCTTTAATAAAAAGTAAGGAGGTCCAATACATGAAACAATGGCTAGCATATTTGGCAAAATCTTAATCAGGAAAAGTATGCTAACCATTGTGGTGAAGTGCAGGTTTGCTGCATGAATAGTTTTACAGCAGAAGCTAACTGCTGGCATGGCAAAACAAAGTGCGTAAGTGGATGACTCCCACAAAAAGCACCACAATCTCAAACCCGCTCAGGCGGGTTTTTTATTATCTGCTTTAAATATATTATTAAAATATAAAAAATACTTGTTACTAATAAAATCAATCAGGCTACAGCTTTAAGATTTGTCTGGAATACTTTGTTGCAATGAGGGCAGATCAAAAGGGCACCTTTTTGTACTCTTGAAAAACTGTGTTCTGACTCTTGGGTGCAGTTTGGGCAGGAACATTTAACGAGATAATTACGGCGTGATTTTGAGTTTTTACGTTCTGACATAGGCTTTTCCTGTATAAATGGCCGTATACAGTACACTAAATATGAAAACATTTCTCGTATTATTATTTTATATATGACTTTCTTTCAAAATAATTACCCACATTTTTAATGTGTATGTTTTTTTAGCGCCGTTGAGAACAACGTGTGCTGTCAAAACTACCCCGTAGACTCCGATCTTTTCAAACATATTGCACCATCCGTGTACATCGGGGTGAGGATATGAAATCAATGGATAAGTTAACAACAGGTGTTGCCTATGGCACATCGGCGGGTAATGCTGGTTTCTGGGCATTGCAGTTACTCGATAAAGTAACTCCGTCACAGTGGGCTGCAATCGGTGTGCTGGGTAGCCTGGTTTTTGGCCTGCTGACGTATCTGACAAATCTTTATTTCAAGATTAAAGAAGACAGGCGTAAGGCTGCGAGAGGAGAGTAATCCAATGACTCAAGACTATGAACTGGTTGTGAAAGGAGTCCGTAATTTTGAGAATAAAGTTACGGTAACTGTAGCCTTACAGGACAAAGAACGCTTTGACGGTGAAATTTTTGACCTGGATGTCGCCATGGACCGTGTTGAAGGAGCTGCGCTGGAGTTTTATGAGGCAGCAGCCAGAAGGAGCGTCCGGCAAGTCTTCCTGGAAGTAGCAGAAAAATTGTCAGAAAAAGTTGAGTCTTATCTGCAGCATCAGTACTCCTTTAAGATTGAAAATCCTGCCAATAAGCACGAGCGTCCTCATCATAAATATCTATGAACACAAAAATCAGATACGGCCTGTCGGCTGCCGTTCTGGCGCTGATTGGTGCTGGCGCATCTGCTCCTCAGATACTTGACCAGTTTCTGGACGAAAAAGAAGGTAACCACACAATGGCATACCGCGATGGTTCTGGCATATGGACCATCTGTCGGGGTGCCACAGTGGTGGATGGAAAAACCGTTTTTCCCAATATGAAACTGTCGAAGGAAAAATGCGACCAGGTCAACGCCATTGAGCGTGATAAGGCGCTGGCATGGGTGGAGCGCAATATTAAAGTACCACTGACCGAACCACAAAAAGCGGGTATCGCGTCATTTTGTCCCTATAACATTGGCCCCGGTAAGTGTTTCCCGTCGACGTTTTATAAGCGGCTGAATGCTGGTGATCGTAAAGGTGCATGCGAAGCGATTCGCTGGTGGATTAAGGATGGCGGACGCGATTGCCGCATTCGTTCAAATAACTGTTACGGTCAGGTTATTCGTCGTGACCAGGAGAGCGCATTAACCTGCTGGGGGATAGAACAGTGAATCAGATATTCATGGTGATTTTTCTCGTGTTGTCAGGATTTATCGTCGGAAATGTCTGGAGCGACCGAGGATGGCAAAAAAAATGGGCGGAACGTGATGCTGCCGCATTATCACAAGAGGTAAATGCTCAATTTGCTGCTCGAATAATTGAACAGGGGCGAACTATAGCCCGTGATGAGGCTGTTAAAGATGCGCAACAGAAATCTGCTGAAATTTCTGCCAGGGCTGCTTATCTGTCTGATAGTGTTAACCAGTTGCGTGCCGAAGCAAAAAAATATGCCATACGCCTTGACGCAGCGAAGCATACCGCAGATCTTGCCGCTGCCGTCAGAGGCAAAACAACCAAAACCGCCGAAGGAATGCTCACCAACATGCTCGGAGATATTGCAGCAGAAGCTCAGCTTTATGCTGAAATTGCTGACGAACGCTACATCGCAGGAGTGACTTGTCAACAGATCTATGAATCTTTAAGAGATAAAAAGCATCAAATGTAGGGTAATATTAAATCGGAACATTTACATCGCGGAATGTAAAATTTAAATAAAAAGGACTCTTCCATGAGCCAAAATTCCTGAAATCTTAAGGGTAAGATAAAAGGTCTTAATCAGAATGACACGTTTTATTAATAAATAAAGCTATTCTTTCATTGCTGTGTTTTTCTTTACAAAAGTAATCCTTGCTATGGGTGGTTAATCATGCGTTAATGGTGTTCTGGTTTGTTACAAATTTATCTGAAGCAGTCATTGTTATAATTTTATTATTTGTACCTCTTGAGATTTCCTTGTTGGTTTTTCTCTCTGATATTTTTTTTCGGACCATTCTGCCCAAGGGCTAATTTCTTCAAAAGGTAATAATTATGTCTAACAAAATGACTGGTTTAGTGAAATGGTTTAACCCTGAAAAAGGTTTTGGTTTCATCACGCCGAAAGATGGCAGCAAAGATGTGTTTGTCCATTTCTCAGCAATTCAGAGCAACGATTTCAAAACATTAACTGAGAATCAGGAAGTTGAATTTGGTATTGAGAACGGACCTAAAGGTCCTGCCGCTGTTCATGTAGTGGCGCTTTGAGGTAGACAATATTACAAACCATATTCACTTTAGATGCCCGTGTTGTCATGGTTCCCAGTATAGAACATCATCTTTTGATGTTTCTGACATGAATCCTTTCGGGGCAAAATGTATCTTTTGTAAATCAATGATGATTACATTTGATAATATTTCACAATACTTAAATGCCAGCCGTCTGTCGTTGGATTTAAAAAAGTGAAAATGAAGGCTCCTTCGGGAGCTTTTTTGCTTGGTGTCTATTCGATGGATACTCACATACTACGGTAACATCATGAAAAAAATCATAGTTTTTTTAACTCTGAACCAGCAGTGGTAGTGCCAGCGATGACTGGAGTTAACACCATCATGCGTGAATATCCAAATGGCGAAAAAACACACCTTACTGTAATGGCCGCAGGGTTTCCATCTCTGACCGGAGATCATAAAGTCATTTATGTAGCCGCGGATCGACATGTTACTTCAGAAGAAATTCTGGAAGCAGCAATAAGGCTCTTGAGTTGATTTGATGCTATTGCATTGATAATTCAGGAAAATTCTCTTTGTCTGTTTGTGTAAAATTTAGACTATCGTATGTTGATTATTGCGATGTTTCATCTTATCTTTTACACGTTTGCACCATATAATCGACTTACTGTGTAACTGGAAAGTCATAACAGACTAAAAGAGGAAATGATGAATATTGAAAACTTAAAAACAAAAGCAGAAGCAGATATTTCTGAATATATAACAAAAAAAATTATTGAACTTAAGAAAAAGACCGGGAAAGAAGTTACCAGTATTCAGTTTACCGCACGGGAAAAAATGACGGGTCTTGAAAGCTATGATGTCAAGATTAATTTAATCTGATGTATTCAATAATAAAATTTATCCATAAACCTCGTTTTTACGGGGTTTTGTTATATTTGAATGGTTCCGAATATCTAAATCACAATTGTTGATGGTTTTTATTAAACCAATGCAGTCCGGCTCAGGAGTGAGAGAAGCCGGACGTTATGGTTTAGCGTGGTAAGATCTGTGTAGTTTTCTGGATGCTTTCAGTAAATAGTAATGAATTATCAAAGGTATAGTAATATCTTTTTTGTTCGTGGATATTTGTAACCCACCGAAAAACTCCTGCTTTAGCAAGGTTTCTTCTGTATTCCTGAAATGTGATCTCTCTGGATTTCAGCTTATTAGAGGTCGTTTCTATAAGATGCCTATCCTTTGAAAATTTGACAGACACAATGTTTTTTAGGCCCTTTAATAACACTGTATTATCATTTTTTAATACAATATGAACATTCTCTGTGGCTAAATAGTAAATGTAATGTGAGACATTGTGACGTTTTAGCTCAGAATAAAACCATTGATAGTTTAAATCGTTTCGAACTTTATCAAATATTTGTTTAAAAATGACTACCTGATCCATAGATAAACCTTCCATGTGATATGAGGGGGCGTAGTCTGCACGATTATCTAAATTGCTTCAATCTGGTCTGACCTGTTTTCTGAGCAATTCAGTAATGTCACTCTTTTCTTTGTTTGCTTCAGAAGAAACTCTTTTTTCTGAGCACAGTCTCCGGCGGCAGGCTTCAATGACCCAGGCTGAGAAATTCCCGGACCCTTTTTGCTCAAGAGCGATGTTAATTTGTTCAATCATTTGGTTAGGAAAGCGGATGTTGCGGGTTGTTGTTCTGCGGGTTCTGTTCTTCGTTGACATGAGGTTGCCCCGTATTCAGTGTCGCTGATTTGTATTGTCTGAAGTTGTTTTTACGTTAAGTTGATGCAGATCAATTAATACGATACCTGCGTCATAATTGATTATTTGACGTGGTTTGATGGCCTCCACGCACGTTGTGATATGTAGATGATAATCATTATCACTTTACGGGTCCTTTCCGGTGATCCGACAGGTTACGGGGCGGCGACCTCGCGGGTTTTCGCTATTTATGAAAATTTTCCGGTTTAAGGCATTTCCGTTCTTCTTCGTCGTAACTTAATGTTTTTATTTAAAATACCCCCTGAAAAGAAAGGAAACGACAGGTGCTGAAAACGAGCTTTTGGGCCTCTGTCGTTTCCTTTCTCTGTTTTTGGCCGTGGAATGAACAATGGAAGTCAACAAAAAGCAGCTGGCTGACATTTTCGGTGCGAGTATCCGTACCATTCAGAACTGGCAGGAACAGGGAATGCCCGTTCTGCGAGGCGGTGGCAAGGGTAATGAGGTGCTTTATGACTCTGCCGCCGTTATAAGATGGTATGCCGAAAGGGATGCTGAAATTGAGAACGAAAAGCTGCGCCGGGAAGTTGAAGAACTGCGGCAGGCCAGCGAGACAGATCTCCAGCCAGGGACTATTGAGTACGAACGCCATCGACTTACGCGTGCGCAGGCCGACGCACAGGAGCTGAAAAATGCCAGAGACTCCGCTGAAGTGGTGGAAACCGCATTCTGTACTTTCGTGCTGTCGCGGATCGCAGGTGAAATTGCCAGTATTCTCGACGGGATCCCCCTGTCGGTGCAGCGGCGTTTTCCGGAACTGGAAAACCGACATGTTGATTTCCTGAAACGGGATATCATCAAAGCCATGAACAAAGCAGCCGCGCTGGATGAACTGATACCGGGGTTGCTGAGTGAATATAACCGCGCTGACAGACAATACGCAGGGGGCAGCAGGTCTTGAGTTATACGAGGTGTATAACAACGGATATCCAACAGCGTATGGAAATATCATTCACCTGAAAGGGATGACAGCCGTTGGCGAAGGTGAGTTACTCATCGGCTGGAGTGGTACAAGCGGTGCTCATGCTCCGGCATTTATTCGTTCACGACGGGATACGACCGACGCAAACTGGTCGCCGTGGGCGCAGCTTTACACCTCGGCTCATCCTCCTGCAGAGTTTTATCCAGTCGGTGCACCAATCCCGTGGCCATCAGATACCGTTCCGTCTGGTTATGCCCTGATGCAGGGGCAGACTTTTGACAAATCTGCATACCCGAAACTTGCAGTTGCTTATCCGTCAGGCGTGATCCCTGATATGCGTGGCTGGACGATTAAGGGCAAGCCCGCCAGTGGTCGGGCCGTATTATCTCAGGAACAGGACGGCATTAAATCGCACACCCACAGCGCCAGCGCATCCAGTACGGATTTGGGGACGGAAACCACATCGTCGTTTGATTACGGAACCAAATCCACGAATAACACCGGGGCGCATACCCATAGTATTAGCGGGACCGCAAATAGTGCCGGTGCGCACCAACACAAGAGTTCCGGTGCATTTGGTGGCACGAACACGAGCATTTTCCCTAATGGTTATACCGCGATTTCAAATCTAAGCGCGGGGATTATGAGCACAACAAGCGGTAGTGGCCAGACTCGTAATGCAGGGAAGACATCATCAGATGGTGCTCATACCCACTCGCTGTCCGGCACTGCTGCAAGCGCAGGCGCGCATGCACATACTGTCGGTATTGGTGCTCATACGCACTCCGTTGCGATTGGTTCACATGGACACACCATCACCGTTAACGCTGCTGGTAACGCGGAAAACACCGTCAAAAACATCGCATTTAACTATATTGTGAGGCTTGCATAATGGCATTCAGAATGAGTGAACAACCACGGACCATAAAAATTTATAATCTGCTGGCCGGAACTAATGAATTTATTGGTGAAGGTGATGCATATATTCCGCCTCATACAGGTCTGCCAGCAAACAGTACCGATATTGCACCGCCAGATATTCCGGCTGGCTTCGTGGCTGTTTTCAACAGTGATGAGTCATCGTGGCATCTCGTTGAAGATCATCGGGGTAAAACGGTTTATGACGTGGCTTCCGGCGACGCGTTATTTATTTCTGAACTCGGTCCGTTACCGGAAAATGTTACCGGTTATCGCCGGAAGGAGTTTCAGAAGTGGAACGGCACAGCCGGTGAAGGATACGGAAGCAGAAAACTGTTCCGGATCGGGAGGCGGAAGAAACAAAAACAACCTGATGCAGGTAGCCAGTGAGCATATTGCGCCGCTTCAGGATGCTGCAGATCTGGAAATTGCAACGGAGGAAGAAATCTCGTTGCTGGAAGCATGGAAAAAGTATCGGGTATTGCTGAACCGTGTTGATACGTCAACTGCACAGGATATTGAATGGCCAGCACTGCCGTAGGGTAAAACATATAAATTCTATAATTAGATGTATCTTTCCATTTACGGCAAGGAAGGGGCTTGGAAGACGTAAAGCATCTCACACCGAGATTATTTTTTATATGTCAGGTGTCTGAAGTTTTGCTTTGGCTCTTAAAATGGTTTGCCGCGAGGTTTTGAATTCCCGGGCAATGGCACTTATACTTACACCTGACTTAATTCGTTCGAATACCGCCTGTTTCTGTTCTTCATTTAACACAGGTGGTCGACCAAAACGTTTCCCTGCGCCGCGGGCTCTTACTATCCCGGAATGAGTGCGTTCAAGTAAAAGGTCTCGTTCAAATTCAGCGACTGCTGAAATTACTTGCATCATCATTTTTCCTGTTGGACTGGTCAGGTCAATGCCCCCCAATGCTAAGCAATGCACTCTGATACCTGTTTCGGTCAGTTGTTCCACTGTTTTCCTGATATCCATTGCATTACAACCAAGGCGATCCAGTTTTGTCACAATCAATTGATCACCACATTTCAGGCGAGCAAGCAACCGGTTAAAACCGGACGCTCACTGGTTGCTGCTGAGCCGCTAATGTGTTCTTCGATTATTTGCTGAGGTTTGATTTTAAAACCTGCACTTTCGATTTCCCGGCGTTGATTTTCGGTGGTCTGATCCAGCGTTGATATCCGACAGTAAGCAAAAATTTGAGACATAGTGAGACTCTATACGAAATTGGTGTTCATATCATAATGCATCTCAGAAAATAATTATGATTATTTTTGTGCATATTTGTATGTACACGTTCGAAAATAAACGAATGCGTATGCAACCCGTAATTTTGGTGAGACCCAAAATCGATTTTGTGAAAAATGGCTTTAACTCGGTTTGTTTTTCGAGTTCCGGGCGGACTCAAGGAAGAAGAATAGTGTTGCGTGTTATTTTAACCAGATTTCAAGTTGTTTGGTCGTGGAAAAGTGGAGCAAAATGTTGTTAAAGTGGAAAAATGATAAAAAGTAAGTTTATTATATTACATTTTACCATTTAAATTTTGGTTGTCTTTAAGAACTGATATCGCTGTTTGTAATAATTCTTTGTTATCCAGCCATGATTTTTTCTTTATGTTTCCTTCAATGTAATCAAGCAATGTTCTGGTATTGATAGGTCTTCCCTGTTTTGCTACTTCCACTACAGCATCCCCTAGGATAATTCTTACTTCAGGAAGCTGCGCAGGGAACCACTTTAGGGTGTCTTTGATTTCATGAAGATATTCCTTAAAATATTATTGATTTTCATTGCGATATTGTATGTCTGATTCAGGATATGTTGACTTATACATCGGTTTTGTCTGGGTTATTGGATATGCCAATCCCTAATTTTATTAGGGCATGACTAAAAATGCTGAATATGATAAGGAGAGATGTGATTATCAGTATGCTGTTCATATAGCCTCGAATTAGTAATGTGTTATATATGATATAGTTGACAATTTTTATCTTGGGTGTTCTTAAAGTTCGTAGATAAACATTGTCGTTTCAGGTATACAGGAATGCTAACAGGTGGCAGCAAAAATCAGGCGGTTTATGGCGCAAGCTGAAGCGGCAACTGCAAACTATCTTATGTAGAGACTCTACACGGATTGGGTTTAAAAGTATACATAGATAACAGTTTTTATCTGAAAAAGAAAAATATCAAGGTGATATAGCCTATATGCCTTTGATGCGGAGGAATGAATGTGATGGGAGTGATGTATCTGAATAGTTGAAAAACCGCAGACACACCTTATGCAAGAACGTGCTGCGATTGGCTGGTAAATTTTTCGATAGTGTGAGTATTGAATGATTTCCAGCCGTTCTTGATTTTACGCATAAATCCATGAAAAAACTACTTATCTGTTGGGGAGTTTTTTTGGGGCATATATGGGACAGAAATAGGCCCCAGATAGACACTGAGATCGAACTTAGGATGCTTTTTAAAAAAATGCAACTATCTGAAAAAACCTAGAAAACGCCAAGGAAACCACAGGATGGGAAAAAACACCTGTGAATTATGGATTTCCAGTTATATTCGCTCGGCGCAGCGTTAGTGTTTCATGAAATATTTTTTCCTGAATCATCAACGGCAATGGCGTTAATTCTGGCAATGGGAACCTACGGTGCAGGTTATGTGGCGCGTATTGTCGGAGCATTTATTTTCGGCAAAATGGGCGACAGAATAGGGCGTAAAAAAGTGCTCTTTATTACCATCACCATGATGGGGATCTGTACCACCTTAATTGGTGTGTTACCGACCTATGCACAGATTGGTGTTTTTGCACCCATCTTGCTGGTGACGTTGCGTATTATTCAGGGGTTGGGTGCAGGTGCGGAAATTTCCGGTGCCGGTACGATGCTGGCGGAATATGCGCCAAAAGGTAAGCGCGGAATTATCTCCTCATTTGTGGCTATGGGAACTAACTGCGGAACCTTGAGCGCAACGGCAATCTGGGCCTTTATGTTCTTCATTCTCAGTAAAGAGGAACTGCTGGCGTGGGGATGGCGTATACCGTTCCTGGCGAGTGTTGTCGTGATGGTCTTTGCTATCTGGTTGCGTATGAATCTGAAAGAAAGCCCGGTCTTTGAGAAGGTTAACGACAGTAACCAACCGACAGCAAAACCTGCACCTGCTGGTAGCATGTTCCAGAGCAAATCCTTCTGGCTGGCAACAGGGCTGCGTTTTGGTCAGGCGGGTAACTCCGGGTTAATTCAGACTTTCCTTGCAGGCTATTTAGTGCAGACGTTATTGTTTAACAAAGCAATTCCAACAGATGCATTGATGATCAGTTCGATTCTCGGCTTTATGACCATTCCGTTCCTTGGTTGGTTATCCGATAAAATTGGTCGCCGGATCCCGTATATTATTATGAATACCTCCGCGATTGTGCTGGCATGGCCAATGCTTTCTATCATTGTAGATAAAAGCTATGCCCCGAGCACCATTATGGTTGCACTGATTGTGATTCATAACTGTGCGGTGCTGGGATTATTTGCTCTGGAAAACATTACCATGGCAGAAATGTTCGGCTGTAAAAACCGCTTTACCCGGATGGCTATTTCTAAAGAAATTGGTGGTCTTATCGCTTCCGGTTTTGGTCCTATCCTGGCGGGTATTTTCTGCACCATGACGGAATCCTGGTATCCGATCGCCATTATGATCATGGCATATTCAGTGATTGGTTTAATCTCTGCGCTGAAAATGCCAGAGGTGAAAGACCGTGATTTAAGTGCGCTGGAAGACGCTGCGGAAGATCAACCGCGTGTTGTAAGAGCTGCGCAACCTTCCAGAAGTCTGTAAACCCTTAATCCCTTCTCTTACCGGAGAGGGGATTTTTATTCATATAACAAAACATATAGCTTGCCATATTTATATTTAAGGAATTGTTATGGGAAATAATTTGTTATCAGCAAAAGCGACACTCCCTGTTTATGATCTTAATAACCTGGCTCCAAGAATTGTTCATTTAGGCTTTGGTGCATTTCACCGTGCGCATCAGGGTGTGTATGCCGATATTCTTGCTACGGAACATTTCAGTGACTGGGGATATTATGAGGTCAACTTAATCGGCGGCGAACAGCAAATTGCCGATTTACAACAGCAAGATAATCTTTATACCGTTGCGGAAATGTCGGCCGATGTGTGGACGGCTCGCGTCGTTGGCGTCGTTAAAAAAGCCTTGCACGTACAGATAGATGGCTTAGAAACCGTGTTGGCAGCGATGTGTGAACCGCAAATCGCGATTGTCTCTCTGACAATCACCGAAAAAGGGTATTTCCACTCTCCGGCGACCGGACAGTTAATGCTCGATCACCCGATGGTAGCTGCCGACGTGCAAAATCCCCACCAGCCGAAAACAGCAACAGGGGTGATTGTTGAGGCGCTGGCTCGCCGTAAAGCGGCAGGACTTCCCGCATTTACCGTCATGTCATGTGACAACATGCCAGAAAACGGTCATGTTATGCGTGACGTTGTCACTTCCTACGCACAAGCCGTTGATGTAAAACTGGCACAATGGATCGAAGATAACGTGACTTTCCCATCAACAATGGTGGACCGTATTGTGCCCGCAGTGACAGAGGATACGCTGGCGAAAATCGAACAACTTACCGGTGTGCGCGATCCTGCGGGCGTTGCCTGTGAACCTTTCCGCCAGTGGGTAATAGAAGATAACTTTGTTGCCGGACGTCCGGAATGGGAAAAAGCGGGAGCCGAACTGGTTAGCGATGTGCTGCCTTATGAAGAGATGAAGTTGCGCATGCTCAACGGCAGTCATTCATTCCTGGCGTATCTGGGGTATCTTGCAGGATATCAGCACATTAATGACTGTATGGAAGATGAACATTATCGTTATGCGGCGTATGGCTTGATGTTGCAGGAACAAGCGCCGACGTTGAAAGTGCAGGGCGTTGATTTGCAAGATTACGCTAACCGATTAATTGCACGCTATAGCAACCCGGCGTTACGTCATCGAACCTGGCAGATTGCGATGGATGGTAGCCAGAAATTGCCACAGCGGATGTTGGATTCTGTTCGCTGGCATCTGGCGCATGACAGCAAGTTCGATCTGCTGGCGCTGGGCGTCGCGGGTTGGATGCGTTATGTCGGTGGTGTTGATGAACAGGGAAATCCGATAGAAATCAGTGACCCACTGTTACCTGTTATTCAGAAGGCTGTACAAAGTAGTGCCGAAGGGAAAGCGCGCGTCCAGTCATTGCTGGCGATTAAGGCGATCTTTGGTGATGATTTGCCAGACAATAGTTTGTTTACTGCAAGAGTGACGGAAACGTACTTGTCTTTATTAGCGCATGGCGCGAAAGCGACCGTGGCAAAATATTCCGTGAAGTAA
Protein sequences of DBSCAN-SWA_6 >CP034953|2233946:2253144|2236976_2237264_+|QAA89838.1|DBSCAN-SWA MAYFLDFDERALKEWRKLGSTVREQLKKKLVEVLESPRIEANKLRGMPDCYKIKLRSSGYRLVYQVIDEKVVVFVISVGKRERSEVYSEAVKRIL >CP034953|2233946:2253144|2236737_2236977_+|QAA89837.1|DBSCAN-SWA MGSINLRIDDELKARSYAALEKMGVTPSEALRLMLEYIADNERLPFKQTLLSDEDAELVEIVKERLRNPKPVRVTLDEL >CP034953|2233946:2253144|2241055_2241271_+|QAA89846.1|DBSCAN-SWA MSNKMTGLVKWFNADKGFGFISPVDGSKDVFVHFSAIQNDNYRTLFEGQKVTFSIESGAKGPAAANVIITD >CP034953|2233946:2253144|2244827_2245001_+|QAA91903.1|DBSCAN-SWA MNIENLKTKAEADISEYITKKIIELKKKTGKEVTSIQFTAREKMTGLESYDVKINLI >CP034953|2233946:2253144|2244500_2244656_+|QAA89852.1|DBSCAN-SWA MREYPNGEKTHLTVMAAGFPSLTGDHKVIYVAADRHVTSEEILEAAIRLLS >CP034953|2233946:2253144|2246762_2247725_+|QAA89856.1|tail|DBSCAN-SWA MNITALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA >CP034953|2233946:2253144|2240542_2240755_-|QAA89845.1|DBSCAN-SWA MSRKMTGIVKTFDGKSGKGLITPSDGRIDVQLHVSALNLRDAEEITTGLRVEFCRINGLRGPSAANVYLS >CP034953|2233946:2253144|2248388_2248778_-|QAA89857.1|DBSCAN-SWA MTKLDRLGCNAMDIRKTVEQLTETGIRVHCLALGGIDLTSPTGKMMMQVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPPVLNEEQKQAVFERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI >CP034953|2233946:2253144|2246242_2246812_+|QAA89855.1|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIRWYAERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIPGLLSEYNRADRQYAGGSRS >CP034953|2233946:2253144|2250311_2251595_+|QAA89861.1|DBSCAN-SWA MDFQLYSLGAALVFHEIFFPESSTAMALILAMGTYGAGYVARIVGAFIFGKMGDRIGRKKVLFITITMMGICTTLIGVLPTYAQIGVFAPILLVTLRIIQGLGAGAEISGAGTMLAEYAPKGKRGIISSFVAMGTNCGTLSATAIWAFMFFILSKEELLAWGWRIPFLASVVVMVFAIWLRMNLKESPVFEKVNDSNQPTAKPAPAGSMFQSKSFWLATGLRFGQAGNSGLIQTFLAGYLVQTLLFNKAIPTDALMISSILGFMTIPFLGWLSDKIGRRIPYIIMNTSAIVLAWPMLSIIVDKSYAPSTIMVALIVIHNCAVLGLFALENITMAEMFGCKNRFTRMAISKEIGGLIASGFGPILAGIFCTMTESWYPIAIMIMAYSVIGLISALKMPEVKDRDLSALEDAAEDQPRVVRAAQPSRSL >CP034953|2233946:2253144|2234759_2234990_+|QAA89833.1|DBSCAN-SWA MLKTDALLYFGSKTKLAQAAGIRLASLYSWKGDLVPEGRAMRLQEASGGELQYDPKVYDEYRKTKRAGRLNNENHS >CP034953|2233946:2253144|2238025_2238304_+|QAA89841.1|DBSCAN-SWA MARNAKYYNSDNSPVLACTHGRYSHAFKSEWFQHPPCTAEQAEWLIHSYRRRGFEVKKALSLDYRHWIISVRLPYSERPPRASRTFQQRIWR >CP034953|2233946:2253144|2237335_2237491_+|QAA89839.1|DBSCAN-SWA MKQQKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >CP034953|2233946:2253144|2243082_2243580_+|QAA89850.1|DBSCAN-SWA MNQIFMVIFLVLSGFIVGNVWSDRGWQKKWAERDAAALSQEVNAQFAARIIEQGRTIARDEAVKDAQQKSAEISARAAYLSDSVNQLRAEAKKYAIRLDAAKHTADLAAAVRGKTTKTAEGMLTNMLGDIAAEAQLYAEIADERYIAGVTCQQIYESLRDKKHQM >CP034953|2233946:2253144|2243942_2244155_+|QAA89851.1|DBSCAN-SWA MSNKMTGLVKWFNPEKGFGFITPKDGSKDVFVHFSAIQSNDFKTLTENQEVEFGIENGPKGPAAVHVVAL >CP034953|2233946:2253144|2233946_2234102_-|QAA89831.1|DBSCAN-SWA MDTIDLGNNESLVYGVFPNQDGTFTAMTYTKSKTFKTENGARRWLERNSGE >CP034953|2233946:2253144|2245152_2245563_-|QAA89853.1|DBSCAN-SWA MDQVVIFKQIFDKVRNDLNYQWFYSELKRHNVSHYIYYLATENVHIVLKNDNTVLLKGLKNIVSVKFSKDRHLIETTSNKLKSREITFQEYRRNLAKAGVFRWVTNIHEQKRYYYTFDNSLLFTESIQKTTQILPR >CP034953|2233946:2253144|2234268_2234676_-|QAA89832.1|DBSCAN-SWA METKNLTIGERIRYRRKNLKHTQRSLAKALKISHVSVSQWERGDSEPTGKNLFALSKVLQCSPTWILFGDEDKQPTPPVEKPVALSPKELELLELFNALPESEQDTQLAEMRARVKNFNKLFEELLKARQRTNKR >CP034953|2233946:2253144|2239368_2240121_+|QAA89843.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATGTGCLTITDVMAAQGMVQSKAPLGLALFLAKVGVQDPQFAIEGLLNYAMALDNPTLNKLSEEIRLQIIPYLVSFAFADYSRSAASKARCEHCSGTGFYNVLREVVKHYRRGESVIKEEWVKELCQHCHGKGEASTACRGCKGKGIVLDEKRTRFHGVPVYKICGRCNGNRFSRLPTTLARRHVQKLVPDLTDYQWYKGYADVIGKLVTKCWQEEAYAEAQLRKVTR >CP034953|2233946:2253144|2235872_2236205_-|QAA89835.1|DBSCAN-SWA MSVTIQGNTSTVISNNSAPEGTSEIAKITRQIQVLTEKLGKISSEEGMTTQQKKEMAALVQKQIESLWAQLEQLLRQQAEKKNEDATVQPDKKEEKKDDTNTAGTIDIYV >CP034953|2233946:2253144|2242244_2242556_+|QAA89848.1|DBSCAN-SWA MTQDYELVVKGVRNFENKVTVTVALQDKERFDGEIFDLDVAMDRVEGAALEFYEAAARRSVRQVFLEVAEKLSEKVESYLQHQYSFKIENPANKHERPHHKYL >CP034953|2233946:2253144|2242552_2243086_+|QAA89849.1|DBSCAN-SWA MNTKIRYGLSAAVLALIGAGASAPQILDQFLDEKEGNHTMAYRDGSGIWTICRGATVVDGKTVFPNMKLSKEKCDQVNAIERDKALAWVERNIKVPLTEPQKAGIASFCPYNIGPGKCFPSTFYKRLNAGDRKGACEAIRWWIKDGGRDCRIRSNNCYGQVIRRDQESALTCWGIEQ >CP034953|2233946:2253144|2244165_2244354_+|QAA91901.1|DBSCAN-SWA MTNHIHFRCPCCHGSQYRTSSFDVSDMNPFGAKCIFCKSMMITFDNISQYLNASRLSLDLKK >CP034953|2233946:2253144|2249593_2249707_-|QAA89860.1|DBSCAN-SWA MNSILIITSLLIIFSIFSHALIKLGIGISNNPDKTDV >CP034953|2233946:2253144|2237707_2237959_+|QAA89840.1|DBSCAN-SWA MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDARVTDGFILREGEFVTSLQALKEIGIKAGFSAFSGE >CP034953|2233946:2253144|2236407_2236713_-|QAA89836.1|DBSCAN-SWA MSLQVSHYNMLRASHEGSQKVVVRTVITVRFVPGAAIAKSILYCAGQLVFKESGHHLTGTDTIYFVTVRLHMHNSTYSSIFCFSMSVLLSRRRVSGDLTCR >CP034953|2233946:2253144|2249292_2249520_-|QAA89859.1|DBSCAN-SWA MKDTLKWFPAQLPEVRIILGDAVVEVAKQGRPINTRTLLDYIEGNIKKKSWLDNKELLQTAISVLKDNQNLNGKM >CP034953|2233946:2253144|2240398_2240488_-|QAA89844.1|DBSCAN-SWA MNNPVCLDDWLIGFKSLCCTLAVIALLII >CP034953|2233946:2253144|2242024_2242240_+|QAA89847.1|lysis|DBSCAN-SWA MKSMDKLTTGVAYGTSAGNAGFWALQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDRRKAARGE >CP034953|2233946:2253144|2248798_2248978_-|QAA89858.1|DBSCAN-SWA MSQIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSERPVLTGCLLA >CP034953|2233946:2253144|2251683_2253144_+|QAA89862.1|DBSCAN-SWA MGNNLLSAKATLPVYDLNNLAPRIVHLGFGAFHRAHQGVYADILATEHFSDWGYYEVNLIGGEQQIADLQQQDNLYTVAEMSADVWTARVVGVVKKALHVQIDGLETVLAAMCEPQIAIVSLTITEKGYFHSPATGQLMLDHPMVAADVQNPHQPKTATGVIVEALARRKAAGLPAFTVMSCDNMPENGHVMRDVVTSYAQAVDVKLAQWIEDNVTFPSTMVDRIVPAVTEDTLAKIEQLTGVRDPAGVACEPFRQWVIEDNFVAGRPEWEKAGAELVSDVLPYEEMKLRMLNGSHSFLAYLGYLAGYQHINDCMEDEHYRYAAYGLMLQEQAPTLKVQGVDLQDYANRLIARYSNPALRHRTWQIAMDGSQKLPQRMLDSVRWHLAHDSKFDLLALGVAGWMRYVGGVDEQGNPIEISDPLLPVIQKAVQSSAEGKARVQSLLAIKAIFGDDLPDNSLFTARVTETYLSLLAHGAKATVAKYSVK >CP034953|2233946:2253144|2238305_2239355_+|QAA89842.1|DBSCAN-SWA MRVLLRPVLVPELGLVVLKPGRESIQIFHNPRVLVEPEPKSMRNLPSGVVPAVRQPLAEDKTLLPFFSNERVIRAAGGVGALSDWLLRHVTSCQWPNGDYHHTETVIHRYGTGAMVLCWHCDNQLRDQTSESLELLAQQNLTAWVIDVIRHAISGTQERELSLAELSWWAVCNQVVDALPEAVSRRSLGLPAEKICSVYRESDIVPGEQTATSILKQRTKNLAPLPYAHQQQKSPQEKTVVSITVDPESPESFMKLPKRRRWVKEKYTRWVKTQPCACCGMPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHNELHTDTVAFEDKYGSQLELIFRFIDRALAIGVLA >CP034953|2233946:2253144|2245620_2245854_-|QAA89854.1|DBSCAN-SWA MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSEKRVSSEANKEKSDITELLRKQVRPD >CP034953|2233946:2253144|2244356_2244422_+|QAA91902.1|DBSCAN-SWA MKAPSGAFLLGVYSMDTHILR >CP034953|2233946:2253144|2235286_2235436_+|QAA89834.1|DBSCAN-SWA MDNARIDLRSKYYVKPKADHPWLTRRTQSHQQVKPPKLPKKKPDPDKKD |
35 | Enterobacteria_phage(33.33%) | lysis,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2445686 : 2476017
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >CP034953|2445686:2476017|DBSCAN-SWA AATGAAAAGCAAAGTACTGGCACTTTTAATTCCTGCCCTGCTCGCCGCAGGTGCTGCACATGCAGCCGAAGTTTATAATAAAGACGGCAACAAATTAGATCTGTATGGCAAAGTTGATGGCCTGCATTATTTTTCTGATAATTCAGCGAAAGATGGCGACCAGAGCTATGCGCGTCTGGGTTTTAAAGGCGAAACCCAAATTAACGATCAACTCACTGGCTACGGTCAATGGGAATACAATATTCAGGCAAACAACACTGAATCTTCAAAAAACCAGTCATGGACCCGTCTGGCATTTGCCGGGCTGAAATTTGCAGATTACGGTTCTTTCGATTACGGACGTAATTATGGCGTAATGTACGACATCGAAGGCTGGACCGATATGCTGCCTGAATTTGGCGGTGACTCTTATACCAATGCAGACAACTTTATGACTGGTCGAGCCAATGGCGTCGCGACTTATCGTAATACTGATTTCTTCGGTCTGGTAAATGGTCTGAACTTCGCGGTGCAGTATCAAGGTAACAACGAAGGAGCCAGTAATGGTCAGGAAGGCACCAACAACGGACGTGATGTTCGCCATGAAAACGGTGACGGCTGGGGTCTTTCCACAACATATGATTTAGGCATGGGCTTTAGCGCTGGTGCGGCATACACCTCTTCTGACCGCACCAATGACCAGGTTAACCATACTGCGGCGGGTGGTGATAAAGCAGACGCGTGGACTGCTGGGCTAAAATACGATGCTAACAATATTTACCTGGCAACCATGTATTCAGAAACGCGTAATATGACCCCGTTTGGCGACAGCGATTATGCTGTGGCAAACAAAACCCAGAATTTTGAAGTCACTGCACAGTACCAGTTTGATTTTGGTCTGCGTCCGGCAGTCTCTTTCCTGATGTCTAAAGGCCGTGACCTGCACGCTGCGGGTGGTGCAGACAACCCGGCAGGTGTTGATGATAAAGATCTGGTTAAATACGCCGATATTGGCGCGACTTACTATTTCAATAAAAACATGTCCACCTACGTTGACTATAAAATCAACCTGTTGGATGAAGATGACAGCTTCTACGCTGCCAATGGCATCTCTACCGATGATATTGTCGCTTTAGGTCTGGTTTATCAGTTCTAAATCCTCCTGCCCGCTGTTATGGCGGGCTTTTTCTGCTTATTCTTCCTCCTTTGATCTAAATTAAAAATGTGAACTCCGTCATTACACAAAAAGTGTCATCTGGCGTTACACTTTATGCGGATACTAAAACAGGAGGTTTTATGAACAGAACGATTCTTGTCCCTATCGATATTTCCGATTCAGAATTAACTCAACGCGTGATTAGCCACGTTGAGGAAGAGGCAAAGATTGATGATGCAGAGGTTCATTTCCTGACGGTAATACCTTCACTGCCCTACTATGCCTCTCTGGGTTTAGCGTATTCCGCAGAATTACCGGCAATGGATGACCTGAAAGCGGAAGCCAAATCGCAACTGGAAGAGATCATTAAAAAATTTAAACTGCCAACCGACAGAGTGCATGTCCATGTTGAGGAAGGCTCGCCCAAAGACCGCATTCTGGAATTGGCGAAGAAGATCCCCGCTCATATGATCATCATTGCTTCCCATCGACCGGATATCACCACTTATCTGCTCGGTTCCAACGCCGCAGCTGTAGTGCGTCACGCAGAGTGCTCCGTGCTGGTTGTGCGCTGACACTAACGCCCGCACATTGCTGCGGGCTTTTTGATTCATTTCGCAAATGTGCTGACATTTTCCCCTCTAATCCGTACCATACACGCCACAGTTTTTTATATCAGATTTCTTATGCTGGGCGTTCCGGCATGACGCATACTCTTCTGATGCCATATAACGAATTGAGTCGCTTTTAAATGTCGCAAAATCAAGAAATTAGTAAGAAAGAACAATACAACCTGAACAAGTAAGGGCAAAAATCACAACTCTCTGACTCATAAGTATTTTACTTATTTTTCAATGTGTTAAATATCATCAGCGACAACAATAAGCTACGATAATCCACTCTTTTTTTGCCCCTTTTTTGCCCCTTTTGCAGCGTTTTGCCCCATTTTTGCCACCGAAAAAAATTCCAAAACGTCTCAATCAAAGTCAAACCAACCGCAGCACGTTCTTGCATACGACGTGACTGCGGTTTTTCAACTATTCAGATACATCACTCCCATCACATTCATTCCTCCGCATCAAAGGCATATAGGCTATATCACCTTGATATTTTTCTTCTTCAGATAAAAACTGTTATCTATGTATACTTTTAAACCCAATCCGTGTAGAGTCTCTGCATAAGATAGTTTGCAGTTGCCACTTCAGCTTGCGCCATAAACCGCCTGATTTTTGCCGCCACCTGTTAGCATTCCTGTATACCTGAAACGACAATGTTTATCTACGAACTTTAAGAACACCCAGGATAAAAATTGTCAACTATATCATATATAACACATTACTAATTCGAGGCTATATGAACAGCATACTGATAATCACTTCGCTCCTTATCATATTCAGCATTTTTAGTCATGCTCTAATAAAATTAGGGATTGGCATATCCAATAACCCAGACAAAACCGATGTATAAGTCAACATATCCTGAATCAGACATACAATATCGCAATGAAAATCAATAATATTTTAAGGAATATCTTCATGAAATCAAAAGACACCCTAAAGTGGTTCCCTGCGCAGCTTCCTGAAGTAAGAATTATCCTAGGGGATGCTGTAGTGGAAGTAGCAAAACAGGGAAGACCTATCAATACCAGAACATTGCTTGATTACATTGAAGGAAACATAAAGAAAACATCATGGCTGGATAACAAAGAATTATTACAAACAGCGATATCAGTTCTTAAAGACAACCAAAATTTAAATGGTAAAATGTAATATAATAAACTTACTTTTTTATCATTTTTCCACTTTAACAACATTTTGCTCCACTTTTCCACGACCAAACAACTTGAAATCTGGTTAAAATAACACGCAACACTATTCTTCTTCCTTGAGTCCGCCCGGAACTCGAAAAACAAACCGAGTTAAAGCCATTTTTCACAAAATCGATTTTGGGTCTCACCAAAATTACGGGGTTGCATACGCATTCGTTTATTTTCGAACGTGTACATACAAATATGCACAAAAATAATCATAATTATTTTCTGAGATGCATTATGATATGAACACCAATTTCGTATAGAGTCTCACTATGTCTCGAATTTTTGCTTACTGTCGGATATCAACGCTGGATCAGACCACCGAAAATCAACGCCGGGAAATCGAAAGTGCAGGTTTTAAAATCAAACCTCAGCAAATAATCGAAGAACACATTAGCGGCTCAGCAGCAACCAGTGAGCGTCCTGGTTTTAACCGGTTGCTTGCTCGCCTGAAATGTGGTGATCAATTGATTGTGACAAAACTGGATCGCCTTGGTTGTAATGCAATGGATATCAGGAAAACAGTGGAACAACTGACCGAAACAGGTATCAGAGTGCATTGCTTAGCATTGGGTGGCATTGACCTGACCAGTCCAACAGGAAAAATGATGATGCAAGTAATTTCAGCAGTCGCTGAATTTGAACGAGACCTTTTACTTGAACGCACTCATTCCGGGATAGTAAGAGCCCGCGGCGCAGGGAAACGTTTTGGTCGACCTCCTGTGTTAAATGAAGAACAGAAACAGGTGGTATTCGAACGAATTAAGTCAGGTGTAAGTATAAGTGCCATTGCCCGGGAATTCAAAACCTCGCGGCAAACCATTTTAAGAGCCAAAGCAAAACTTCAGACACCTGACATATAAAAAATAATCTCGGTGTGAGATGCTTTACGTCTTCCAAGCCCCCTTCCTTGCCGTAAATGGAAAGATACATCTAATTATAGAATTTATATGTTTTACCCTACGGCAGTGCTGGCCATTCAATATCCTGTGCAGTTGACGTATCAACACGGTTCAGCAATACCCGATACTTTTTCCATGCTTCCAGCAACGAGATTTCTTCCTCCGTTGCAATTTCCAGATCTGCAGCATCCTGAAGCGGCGCAATATGCTCACTGGCTACCTGCATCAGGTTGTTTTTTGTTTCTTCCGCCTCCCGGATCCGGAACAGTTTTTCTGCTTCCGTATCCTTCACCCAGGCTGTGCCGTTCCACTTCTGAAACTCCCCTTCCGGCGATAACCAGGTAACATTTTCCGGTAACGGACCGAGTTCAGAAATAAATAACGCGTCGCCGGAAGCCACGTCATAAACCGTTTTACCCCGATGATCTTCAACGAGATGCCACGATGACTCATCACTGTTGAAAACAGCCACGAAGCCAGCCGGAATATCTGGCGGTGCAATATCGGTACTGTTTGCTGGCAGACCTGTATGAGGCGGAATATATGCATCACCTTCACCAATAAATTCATTAGTTCCGGCCAGCAGATTATAAATTTTTATGGTCCGTGGTTGTTCACTCATTCTGAATGCCATTATGCAAGCCTCACAATATAGTTAAATGCGATGTTTTTGACGGTGTTTTCCGCGTTACCAGCAGCGTTAACGGTGATGGTGTGTCCATGTGAACCAATCGCAACGGAGTGCGTATGAGCACCAATACCGACAGTATGCGCGTGTGCACCTGCGCTTGCAGCAGTGCCGGACAGTGAGTGGGTATGTGCGCCAGCAGATGATGTTGCATAGTTTTGATTATGCACAACAGACAATCTTGTTGATGCACTACCAGCACCGGAGTTAGCACTAGCCGTGTTCACGTTGGCTAGTGAGTGTGTGTGTGCTCCAGCCGAGTTTGTAGAGCCGCTCACACTGTGTGTATGTGCCCCGGTGTTATTCGTGGATTTAGTGCCGTAATCAAACGACGATGTGGTTTTCGTCCCCAAATCTGTACTGGATGCGCTGGCGCTGTGGGTATGCGATTTAATGCCGTCCTGTTCCTGAGACAATACGGCCCGACCACTGGCAGGTTTGCCCTTAATCGTCCAGCCACGCATATCAGGGATCACGCCTGACGGATAAGCGGCTGCAAGTTTCGGGTAAGCAGATTTGTCAAAAGCCTGCCCCTGCATCAGGGCATAACCAGACGGAACGGTATCTGATGGCCACGGGATTGGTGCGCCGACTGGGTAGCTTTCTGGTGGAAGATTTTTCGAGGTATAAACTTCTGCCCAGTCTTCCTCAAAACCATAACCGTCTCTTGAAGAACGGTAGAACAGACCACCATTTCTGTAATGCGCCTTCATCTGCAGGGTCCGGCAACTTCCGACTCCGGTATAGAAGTTAACCAGAATATAGCTGTCGCCAGAGCGGGTGACATTGTAAGCGCCAGATTCGGCATTCCATGGAACGCCACCATCCGCATCGGCATATGTATCCGTTGCCCTTCTGGCAAAAGCAGCCACATGCGCGGCGGTTAAAGTAATATCTTTGGAACCATCAAACTCAACACCAGAAACCCGTCTTGGCGTTTGCAGCTTTGTTGCTGTTAATGCATTACCGTTCAGACTTGCGGACAGTTTGGTTCCAATAACCAGTTCGCCGGTTGCGTTATCAATAGCAAACGGTCTTAATGTATTCCAGCCACCATAAACATCACCTTGATTGGTAAGCAGCAGGTAAGTTTTAGCGCCATCATTACGCCATAATGCACCATACTCCCCACCTATCATTCGAATCTGATTACCACCACGCGCTACAATTTCGTCTGTGGCAAAAAGTTTTTTGCACGACAAGTTATCGTTAACGATTAACGAATGAGACTCATAAAAACCACGCCCACTCTTAAAATCAAGGATAACGTCCGCCGCGATACATTCAGTCGCCGGATTTGTTGCCCCAAACTTATAGGTCGTATCATTAACAACGAGATCAGCACCAGGTGCGGATATTGACAGGCCATCTTCAATAAACGCAAAAACAGGGAAAGCAGCGCCATCAACATAGAACACAGAGCGCAAATCATCGCCCTTATTACTCATCATTATTGAGTGAATGGCTCGTTCATTGTTTTGATATTGCCAGAACATGCCATAAGCATAACGCCCCCTGTCAGTCCAGCCACCAGGCATAACAAATCCGTTAAACTCGCAGTTATTCATCGGATCGCCTGCGGTTCGCGTTGCCGTGGTGATAATGACTCTTGATGCCAGTTCGCTTACTGAGCCAGCAGAACGCATAACAACAACAGGGTAATATTTTCCAGATGTTGCACCTGCAGGAGCGTTAACCCGCACATAACGCATACCACGCTTATCAGCAAAGTCTGTTTTACTGACCGCGTTAATGTTGTTCAGGAAGCATCCCTTATCGGGTATATCAGCGCCGTTCTGGTCTTTCTGCAGACGTTTCTCTGCATTGTCATAGGCTGATTTTACTGCCTTTGGCGTTGCCGCCAGCGTTTCAGACGTACTGTTGGTCGCACTGCTGAGCTGTACTATCCCCTTTTTCGTCGTACTTGCATCCTCAAGCGCCACGGCGGATGCAATATCCTCTGCCCGTTTAGCTGCTGTCTCGGCGCGCGTTGCCGCGGATTCCGCCGTACTTTTGCTCTGAGCTGCCGCCGTCGCACTGCCAGCAGCCTCTGTCGCCTTCGTGGATGCCGTCGTGGCGCTGCTCTTCGCTGCTGACGCTTGTCTGGTCGCCTCATCTTTTGAAGCAGACGCCGATGATGCCGATGACGCCGCCGAACTGGCTGACGATGCGGCAGCCGTTTTTGAGGATTCTGCGCTTGTTTCCGACGCTTTCGCGTTCGTTTCGGATGTCTTCGCTGCGGAAGCAGACCTCGCTGCTGCGCTGGCCTGTTCAGTGGCTTCGCCAGCCTTCGTTGTGGCTGTTGAAGCAGACGATGCGGCGCTTTCTGCCGATTTTCCGGCGGCGGTGGCACTGGCTGAGGCCTGCCCGGCACTTGTTGACGCTGCACTGGCAGACGACGCAGCCGCTGTTTTTGAGCCTGCCGCAGCCGAGGCGCTCTGTCCCGCTGCCGTTTCAGAAGACCTGGCGTTCGTCTCGGACGTTTTTGCCGCCTTCGCGGAATTTCCTGCCGCCGTTGCCGAGGAAGCTGCACTACTGGCGCTTGATGATGCGTTCGTTTCTGATGATTTTGCCGCTTCTTTTGAGGCCGCCGCATCCCGGGCCGAGGTCGCAGCTTCTGATGCCTTCGTGGTCGCGGTGGATGCAGATGTGGCTGCTGATTGTAGTGACGCTGAAGCATTCGTTTCTGACGTTTTCGCCGCACCGGCACTGGTGGCCGCCGCGCTTTTTGAGGACTCTGCAGCGGCAGCACTTTTTGATGCTTCAGTGGCCTTTGTTGATGCCGTTCCTGCGCTGGAAGACGCTGACTGAGCCGACGACGCGGCCTGTCCGGCTGACGTGCTGGCTGCGCGTGCTGAGTCCGCAGCATCAGCCGCATGGGTTGCCGCCTCACGGGCTGATGTGCTGGCATCACTGGCTGACTTCTTCGCGGCTGCCGTGTTCTGTGCCACCGCGGACGCGTTACGCGCCACCTCTTCCACCATCAGTTCAAAACGGCGCAGTGCCTCCGGACGGGCATCATCCTCCGTCATGGCACCGAGAAAATCATTCAGCGTACCGGGTTGAGAATCTTCATACACGGTAATGGTCCCGGCATGTGACGGCGGGAATCCTTCCACCAACAGAATAACGCTGTACTGACCGTACTCAACGTCCATGCTGTAACGCCCGGCTTCATCCGGATTTTCTGAGGCCAGCGTGTTCACCACCACCGTGGTGCTGTTACGTTTTGCTTTCAGCTGGATTGTGCAGTTCTGTACCGGTTTTCCTGTGCCGTCTTTCAGTACACCTGAAATCTTTACTGCCATATTCACCCCACAAAAAAGCCCGCCTGAACCGGCGGGCTGTCATAACACTGTGTTACCTGGCTAATCAGAACTTATAACCGACACCCACGATGAAACCGTCAGTGCGCCAGTCGCCACTGCCGGGGCCTTCATAAGCAATATCAATGGCCACGGATTCGGTCGGGTTAAACTGCACGCCAGCTCCCCACGCCAGAGACGTGTTGCTGTGGCGACCGTCATCACTTCCGGTCAGCACGTCGTGCGTTTTCCCCTTGGGAAGGTGCGAACAAGTTCCTGATATGAGATCATCATATTCATCCGGAGCGCATCCCAGAGGGACATCATGAGCCATCAACTCACCTTCGCCGATAGTGAATTCAGCACTAAGCGCCGTCAGACCCGAAAAGAGATTTTCCTCTCCCGCATGGAGCAGATTCTGCCATGGCAGAATATGACCGCTGTCATCGAGCCGTTTTATCCCAAGGCGGGCAATGGCCGACGGCCCTATCCGCTGGAGACCATGCTGCGTATTCACTGCATGCAGCATTGGTACAACCTGAGCGACGGTGCCATGGAAGATGCCCTGTACGAAATCGCCTCCATGCGCCTGTTTGCCCGATTATCCCTGGATAGCGCCCTGCCGGACCGCACCACCATCATGAATTTCCGCCACCTGCTGGAGCAGCATCAACTGGCCCGCCAATTGTTCAAGACCATCAATCGCTGGCTGGCCGAAGCAGGCGTCATGATGACCCAAGGCACTTTGGTGGATGCCACCATCATTGAGGCACCCAGCTCTACCAAGAACAAAGAGCAGCAACGCGATCCGGAGATGCATCAGACCAAGAAAGGCAATCAGTGGCACTTTGGCATGAAGGCCCACATTGGTGTCGATGCCAAGAGTGGCCTGACCCACAGCCTAGTCACCACCGCGGCCAACGAGCATGACCTCAATCAGCTGGGTAATCTGCTTCATGGAGAGGAGCAATTTGTCTCAGCCGATGCCGGCTACCAAGGAGCGCCACAGCGCGAGGAGCTGGCCGAGGTGGATGTGGACTGGCTGATCGCCGAGCGTCCCGGCAGGGTAAAAACCTTGAAGCAGCATCCGCGCAAGAACAAAACGGCCATCAACATCGAATACATGAAAGCCAGCATCCGTGCCAGGGTGGAGCACCCGTTTCGCATCATCAAGCGGCAGTTCGGCTTCGTGAAAGCCAGATACAAGGGGCTGCTGAAAAACGATAACCAACTGGCGATGTTATTCACCCTGGCCAACCTGTTTCGGGTGGACCAAATGATACGTCAGTGGGAGAGATCTCAGTAAAAACCGGAAATAACGCCAGAAATGGTGGAAAAAATAGCCTAAATAGGCTGATTCGATGTGTTTGCGGGAAAAAAATCGGCCCAGATCCGCGAAATTTTAATCAGCGAGTCAGCTTGGGAAGAAATGACCTGCTTATTCGCACCTTCCCTTGTTGTCAGTTACGCGGAGATAATCCCCGGAGAAAGTCGACACACGGCTGTAAGCCATACCCGCCATCGCATACGCGCTGAACCATTCATTCACGCGCACAGACGGCCCCACCATCACGCTGAACCAGCGGTTACGCACGGAATCTTCATGCCAGCGGGTATCGCTGTAACGGGTAAGCTGGCGATTCTTGTCTCCTGCATAGCTGAATGGCCGCCCTGCTGCATTAACACCATAAATGGCGACTGACCAGTGGACAACCCAACAACAATATCCGTCATTTGTGCAGGCAACATGCGCATAGCAAAAGCCGTTTGTTTTGCCGACATTCCGGTTTTGCTTAATTGTGATTGAGTAACCTCAAGCTCACTCCGCATAGCACGAAGTTTTCCAGAAAGCTCCTCATACATTTCAGGAGAAAGCATCCCCTTAGCTTTTGCTTCATTGAGCTGTTTCTGTTGTTCTACCAGACGATTAAAAGCAGTTCCGACAGGATCAAGTTGAGCAATCAGACGTTGCAAAGCAACAACCTGTTCATCATGCGCTTTTGCTGCTTCTCGCTCTGCCTGAGCCTCTCCGGTAAGCTCTCGCCGTGTTTCCTGTATTTTTCGGCTATAATTCTCAAACTGAGAACCATTTATTTTCCCGGATGCAAACGCAGCATTAAGTTCATCATGCTGTTGTTCAAGATTTCTTAGCGCCGCAGCCAGAGGGTCGATCTTGTCCAGCATTCTTTGAAAGGCCTGAGCCTGCGCTTCCTGCTGAGCGGCAGCAAGTTTTCCGGCCTTCTCGGCTTCTCTCTGCGCTTGCGCAACCCCGCTCAATTCCTCTGTGGTTTCATTAAGTTTACGGACAAGAAATTCATATTCTTCTTTATCAATAAGCCCTTTATCGAAAAATTTCTTTAATTCAGAATAGCGTCGACCGACAGTATCAATTGCGGCACCAACTGGATCAATAGCTGCTTTTAATTTTGCGAGCGCGTTCTTCTCATCTTCTGTTGCCTTAGTCACTTTCCCTGCGCTATTTGCAGCAGTTTCCCCAGCCTGCGTCATTTTGACTAATGAGGAGGTCAGATTGTCAGCATTATTTTTCGCTCCAGTGCTATCAATAATTATTGCGAGACGCGAGGTTTGCTCTGCCATTTATTAAAACTCCTGACAACAAAAAACCCACCGCGAAGTGGGTTTCAGGCGACATAATAGTAGATATAGCGATTACGAGGCCACGCAATGCTTTTCTCCAGGAGCATCATCGATTTAATTAAAGACACCATCACATCTCTGTAACAGAGTGTACGTAATTAACAACTACACACACTGCTCCTGAAAATACTGGTCATCCAGTGCAAAGATCACTGCTTCAAATTCATCGCGCTCAATCAATACCGGATGAGTGGCTAAATATTCATTTATCTCTGTCAGAGATAAAGGCAAAGGCACCCCAGCCATTCCAGCATAACGTCGGGCACGGGATATTACCGAATAGGCGTACAACAACTCCTTAAGCACCGGGTCTATTTCTGGTTCCGGTATCGGTGGCAACCTGAGTTTTTCTCGCTTCCATCTTGCCTTTTCCCCCCTTTCTCCCCCGAACTCCGATAACCACCGCTGGGCAGCTATGGCTTTTTTATCGTATCCTGCTTCTGCTGCTCTTTACCCTGGGCGATGCTGGCTGCTTCTGCAAGGATCTGCCAGTACAACTCTGGATTCTGCTTAAGCAGCGCGATCCCTCGTTCTGCCGTATATTCCAGTGCAACCTCAACACCATTAACCAGTTCACCAACCCCTTTCCAGTCTTTCAGCAGATAACGAGCGGCATTATCAATGAGTAAATCATCAACAGAATCCACCTCGGAAACCTTTGAAATATCAAACTCCTTCGTTCCGACGTGCAAACTGGCATCCATTTTCTCAATGTGGCGACGGATTAATGCATTACGGGAGCGATACTGATCGTTATCGCTGCTTCTGCTCATCCGTCAGGCCAGACAGAACAATGACCGGAACAGAATCCATTTTGAGCATTTCAGCCGCCATAACACGACCGTGACCCGCAATAATTTCGCCCTTTTCGTCAATCAGCACCGGATTAGTCCAGCCGAATTGCTTAATACTTTCTACCAGTTGTGCCACCTGCTCAGTACTGTGCGTCCTGGCGTTGTGCGCATACGGTGACAATTCTTGTAATGGGCGATAGACTATCTTTAATTTCTCGCTCATACAGCCTTGCTTTATGAATAAAACGCACCCCAGCAGCCAGTGCTACTGGGGACGGAGGTGTTGCTGGTAAAGTTAGGTATTGGATCAATGAGTGAGTCAACATAATATTAAACTCACAATTATAAATCAGCCATATATTAGGAGCGCCAAAAAAAACCTGAAAACAATATAATAACAGGATAAATTTCAAGGCGACCAAGAATCATAGCTATGCACATTAAGCATTTTGCAATGTCATTAAGCACTCCGAATGACGATGCAGTAGCCCCAAAACCTAATCCCATATTATTAATACATGCAGCCACTGTTGCAAATGATGTAAGAAAATCATATCCCATACCATTTAACACCAGTATAAAAAACACCGTGAAGAGAGTATAAAGAAAAAAGAAACTCCATACAGACCTCATTACACGATCTGTAACTATCTTCCCTCCTACATTTACACTCAACAACGCTCTGGGATGAGAAAGCTGATTTATCTCGTGTTTGCTTTGTTTGAAAAGTATAAGAAATCGAAGTGACTTAATTCCACCACAAGTTGAACCTATACATCCCCCAAAGAAACTTGACAACAGCAAAAACACTATCGTGTGCGTGGGCCAACTTGCATAATCCTGCGTAGCTAAACCATTATCAGTGAGCATGGAGCTGGCAAGAAAAAACGAATGAATAAAACTTCCATGCAAGTCATACATACCTATATGCCAGACCTGGAAAGAGGTAACAATGATCACCCCTAAGGCTATTAACAGAAAGAAACGAAGTTCAATATCTCTGATTAAAGGTTTTATCGTTTTCCTGCTAATAACAATATACCAAAGAGTGAAGTTGAAAGCCGATAGCAGGGAAAAAGAACCAGCCACCAGCTCAACCAAATAGTTATTAAAATATCCGATACTCTCGCTATGAGTTGAGAAACCACCAAGCGAAACTGTGGAAATCCCGTGACAAATAGCATCAAACAAAGGCATTCCTGCAAGTCTATAACAGACAATACAAGCAATACCTAATAAAGAATAAGTTATCCACAGTGTCCGTGACGTATCGGCCAGGCGGGGAGTGAGTTTGTCATCCTTAAATGGCCCCGGCATTTCTGACTGATAAAGCTTTGCACCACCAATACCCAATAATGGCAATACAGCAACCGCCAGAACAATAACTCCTAAACCACCTATAAAATTTAACTGTGACCGATAGTACAAATATGCCCGAGGTAATGAACTAACATCATCAATTACAGTTGCTCCTGTTGTTGTTATTCCAGAAACCCCTTCAAACAGAGCATCAATGAACGTTAAATTAAGTTCTGAGTCAATCCATAAAGGGAATGCACTAATAACAGAAAACAAAATCCAAAACATTACAATTATAATAAACCCATCACGGGTACGTAATTGAATGCCAGATTTCTTAGTTGTATACCACGCTCCGCCACCAATGCAAAAAAATATAACGAAAGTTATAAAGAAAACAAACAGGCTTTTTTCTTTATAAAACAATGCTACAACCATTGGTGGCAACATTGAAAGACTATAGAGCCAAACCAGGAACCCACACATATGAGTAACAACTCTTACATGAGATGTATTCATATCTAAATATTCTTTCAATTATAACCACCTTGCTGCAATATTATGATTATACTGTATAAAATTTAACTCCTCTTAGATCTTACTTCACTGTTCCTTATGAAACAATCATCAAAATGAATCATATTGTAGTTAAGATTTTACTTTAAACACTGCTCGGTTATGTATTGCTGAGCACCTTCAAGTTGGGCCTGCATCATTACCAGTCGTTCCCGGAGGGTGAAATAATCCCGTTCAGCGGTGTCTGCCAGTCGGGGGGAGGCTGCATTATCCACGCCGGAGGCGGTGGTGGCTTCACGCACTGACTGACAGACTGCTTTGATGTGCAACCGACGACGACCAGCGGCAACATCATCACGCAGAGCATCATTTTCAGCTTTCGCATCAGCTAACTCCTTCGTGTATTTTGCATCGAGCGCAGCAACATCACGCTGACGCATCTGCATCTCAGTAATTGCCGCGTTCGCCAGCTTCAGTTCTCTGGCATTTTTGTCGCGCTGTTCTTTGTAGGCGATGGCGTTATCACGGTAATGATTAACACCCCATGACAGGCAGACGACGATGCAGATAACCAGAGCGGAGATACCGATGCTGCTTTCGGCTCTGCTGGTAAATTATCGCCCGGTATGCAGTAACGAAATTTACCGCCCTGATTTACGCGAATCAGACGACCTTTGCTGATTGCCATTGCCAGCGTTGAAGCCACTTTGCGTGATGTGGTACCAAACAATGTAGCCAGCTCATCAGCCGTTTGTGGTCCTCGTTGTTCAATCGTCGCGGTTAAATCGCACTCTGAGATTTTCGCTACTGTTGCTGTGGTGATTTCTTCCGGCAGTTCTGCCTGCGCTGGCTGTTCCTGCTGAACATTGTTATCAGCCACACGCCAGGTGTACGCGCTTTTATCAACAAAACCAGCCTTTTTCAGTTCCCATAGTTCGTTCAGCACTTCTTCACGACTGATATCAAGTCGCGCAGCAAGTTCTATGGATGTGGCTTTTCCCATTGCTTTCAGTGCGTCAAAAACAGTCTCCATTAAATTTTTCTCCCGGTAAAAATTACTTCGCAATTCCTGGCTGGACGACATTCGGACGCCAGCTCTCCCAGTTAAAATTCACCCATCGCCCGCCGTTCATGGTCATGCGATCCATAATCCGCTCGCCGAGCAATGTTTTCATGGCCTCATAGTTCAGGTTTGTCAGCATCCCCACGCTGCGCATCGACGCTGTCCGGCGATCAACAATCTGGTGCAGTACCACCTGCTCGTTTTTCGTCTCGCGCTGAATGCCAATTTCATCAAGAACCAGCAGATCCACTTCGCACAGTTCCCGCAAAAATTTTTCGCCTGACTGCCCATCGTCATAGCTGGCGTGCAGGGCACTCATAACATCAGCCACGGTAACCACAATCACTGTCTGACCGTCTTTCAGCAGGCGATTCCCGATAGCTGCCGCTAAGTGGTTCTTCCCGGTACCAGGTTTTCCGCTGAACGCAAAATTTGTACACCCGGTCATCAGTTCATCAGCGATGGATTTCGCCTGACTCAACGCGTATCGCTGCCCTTCGTTCTGCACCTGGTAATTCGAAAACGAGCATTTGCGGTGCAATGGCTGGATGCCAGAGCGATTCAGAATTTTTTCCACCCGCAACTGACGATTCTGACGGTTGATCTCCTCACAACGTTTCTGGCCTTCGGAAAGTTGCCACTCGCGCCACTCCGCTACCGTCTTGAATGGCGCGGTTACATGTGACGGGGCCAGTCTGCGGATACGTTCAAGAACATCGCCTGTCGCAATATTTTTCATGGTCAGTTACCCCCTGAAGCCTGGCGGGATCGCACTATCCGGTAACGAGACGGTGTTAACCTGTCGGAGTAACGTCTCAGGTCGAACACCTTTCGGCGCGAACAAGCCCTGGTATTCATTGGCGATGCTGTGTCGAATCACCTGCTCAGGTGAAAAACCCTGCTGGCGGAATTTTTCCAGCTCCCGTATCGCCCCGTTAGCGCCCTGCTCCGTTCGAATCGGTTTTCGCAATGCCTGGCGAAATTCAACCCACTCACGCCAAAGCGAGACAGAAATCCAGTTCGGCAAAGTAATATCCAGAGGGTCAAACTTTTTGACACCTCGATTCCCCCGGGGGGGATTTAGGGGGGGATCTGTTTTTAGATCTTTATCTGTATCTTTATTAGTTGCCTTTGTGTTGACATCATGTTCAAACACCACTTCAACATCTGTTTGAACACCTGTTAAATTTCTCTCTTGTTTTGTTTGAACATCTGCTTCCTTTCTGCTTCTTCTGGCCTGAACAGATGCTTTTCCTGCGGCTGATTTTTTGGTTAATTTTTCCCTGACTGATGCCAGATCTTCCTCAATCCGAAGATGCACCCATTCCTCGCCGTTATCGCAAAAAAACTCCTGCAAGGATGGTTCAACATCAGCCCATCGCTCGTTAGTCAGACGGGCAATTTTTGCCAGCCTGTTTTTAGGTATTGGCTTTCCTGTTTGCCAGTAATTGAACATCAGCAACAAATACGCACCATGCTCCTCTGCTGACAAATGCATGGTGTCAGCCAGGTAATCAGCTATGTACAGTTGCATGTATGGTAATGCGGCCATAATTGCCCCGTATGATGCTGCCCGGTGGCTTAGAATAAGCACAAACAGCATGGAAACTTTTGCTTAATGAACAATGACAGAATCGTCGGAAGAACCGCCGCCGCTGAAATGCGCTTTCCGGTAAACGGCTTGGACTGCATCATCATGCGCATCAATTGCCGTACTTAACGCTTCCTGCGCCGCCAGTAATGCACGGCGTTCCAGGGTATCGAAGATGCAGAGTCGGTGACGCAGCTCGCGCGGAAGGATTGCCAGAATTGCTGGGATCAGCTTCTGAATTTTTTCTCTTTGCGTTTTCGTTTCACCTTTCAACCAACGGTGATAGATATTCTGCTGATTGTTCCAGTCCTTGCCTGGAACCAGGGGCAATTCGCCGCCCCCCTGGCGCAGATATTCTTCAGTAATTGCATTGGCTACCCATGCCTGCCCTTTTTCGGCTGCTAGGGCAAACAACACTGATTCGATGTGCTCATGCTTGATTTTCATGAATCATTTGCCTCTTGATGTTTCAGGTATGATCAAATGAGGATTTGTTACTGTCATTTAGTTGCTTCACTGACATATTCTGCGAACAACATGCCGAACGTCGTAAATATGACCAGTCAATATCAGGACGAAGTTCTTCGCACAGAACCTCACCTCTTGTTGCACGTTCAATTGCTGGACATCTCTCGGCAGGCAATTGACGTACCCCTTTGATCCATTGATTTACGCTTGGAGGTGATACACCTAAAAGCCTAGCCATTGCTGATTGCCCACCGACAACAGCACAAGCTTGCTTGAATGAATAGTTCTCTTTTTTCATCGAATGAACTCCAAAAACACACAGAAATATTAGGCGACGCCTAACGCAATTGTCAATAGGCTGTGCCTAATGCAGTAAGGGTAGGGATTGCCTAATGTAATGCGCATAGGAGAATATTAAGCAATGCTTAGTGGTAAAGACTTAGGCCGAGCGATAGAGCAGGCCATTAACAAAAAAATCGCATCGGGATCCGTCAAATCAAAGGCGGAGGTCGCACGCCACTTTAAAGTCCAACCACCATCAATTTATGACTGGATTAAGAAAGGCTCTATAAGTAAAGATAAACTTCCAGAATTATGGCGTTTCTTTTCTGATGTTGTTGGTCCAGAGCATTGGGGGCTTAACGAATACCCCATACCAACCCCCACCAATTCAGATACAAAAAGTGAACTTTTAGATATAAACAACCTTTATCAAGCAGCCTCTGATGAAATAAGAGCGATTGTAGCTTTCCTGTTATCTGGAAATGCTACAGAACCAGATTGGGTTGACCACGATGTTCGCGCCTACATAGCAGCGATGGAAATGAAAGTGGGTAAGTATCTGAAAGCTCTTGAATCTGAACGGAAAAGCCAGAACATCACAAAAACTGGAACTTAAACTTATATGGTCTGACGGAAAACTCCTGGATTCCGTTATTTAACCCCCCCATCACTTTCTGCTGTCGCCATCACCTATTAGGTTACGCTCAAAACATTAGGCATAGCCTATTGACAATCAATTAGGCATTACCTATAGTTCCAGCATACCACCCACCCCGCCCCACAGAACGCCGGGCAATACTTCGAGTTACCAGGCAGTGGTAAGGGGTTAAGTAGCCAGCCCGAGGCGTATGAACATGACGGCGGGATTCAAATTTTGCAGTGCAGCAGTTAGTTCCGCCACCCGGCGTTAAGGGGAGAGATAAGATGGTGCATTACGAAGTAGTTCAGTATTTGATGGATTGTTGCGGTATCACTTACAACCAGGCTGTGCAGGCTTTACGCAGCAACGACTGGGATCTCTGGCAGGCAGAAGTCGCTATACGTAGCAACAAGATGTGAGATTCGCAAAATGCAAAAAATCGACCTCGGCAACAACGAATCCCTGGTGTGCGGCGTGTTCCCCAACCAGGATGGAACGTTCACTGCCATGACGTATACCAAAAGCAAAACATTTAAAACCGAAACTGGTGCGCGCCGATGGTTGGAGAAGCACACAGTAAGCTAACGATTAAAACGTCTACTCCTGCTGTTCCAGAATAACTTCATAAAATGGGAGTATTTTTCGGTGACGAGATAATAAGAACAGTTTGCGCTATCACTCTGATGTTGAATGATGCCCTTCCGTTCTAATTTTTTCATAACCGGGTTACGGCAAGGAGAAGTGATAATAAGATTTCCTGTTTTAAGGAAATCTTTAAATACAGCGATTTCTTTCTCAGATAAACGAAGCAATACTCGTTGCTCTGGTAGTAATGAATAATGCTTTTGAATATGTGCTCGCAATCTTGAGAAGGAAATGGCGACCACGAAAGAAAAGGCAAAAACGATAATCTGAAAGAGCCAAGGTATTTCAGTATAAGCATTGAATGCGACAGTAAACTCTTTCGGTATCAGCCAGAGAGTGAGACCAAAAATGATAATCGTATACATAAGTCTTTCGAGTGGCTCGTTAGCAAAAAGTTTCAACAATGGAGTAAATACATCCAACATATCAATAACTCTCAACTGTAAGGGTATTGAAATGTTAACACAAGCTCTCGCTGTAGGGGTATAGCCGAGACCACCGAAGCCCGGAGGTGGTGAAATAAAACCGGGCACAACACGAAGGCGCATTTCCGATATCCATAAAGAGTCGGTCTTGTCTGTTAAATTTAAATGGTGGGAGTGCGCCTCCGGTTGTAAATAACGACATTGCTGTGTGTAGTCCTGGCGGCATCAGTTTTTTTCTTGAAGTTCGGCTGATGTCCGCCCTTTTTAAAGTGAATTTTGTGATGCGGTGAATGCGGCTAAGCGCACGTGGCACAGTTAAAAGTCATGTTAGTCCTTATTGGTTTGGGTGGGAAAGCCGACTGTAATTGTTAACTGGTTGCAGTCACCTGGAGGCACCAGACACCGCATCAACAAAGTTCATTTGTAAAAATGGAGATAATTATGATTGCACATCACTTCGGAACTGATGAAATACCACGTCAGTGTGTGACTCCTGGCGATTATGTTCTTCATGAAGGCCGGACATATATTGCCTCGGCAAACAATATTAAAAAGCGAAAACTATATATTCGTAACCTGACCACAAAAACATTCATTACTGACCGCATGATTAAAGTCTTCCTCGGTCGTGATGGTTTACCTGTAAAGGCGGAGTCATGGTGATGACTAAGAAAATAAAATGTGCTTACCACCTTTGCAAAAAAGACGTTGAAGAAAGCAAAGCTATTGAAAGAATGCTTCACTTCATGCACGGGATTTTATCAAAAGACGAACCGAGAAAATATTGCAGTGAAGCTTGTGCCGAAAAAGACCAGATGGCACATGAACTTTAATTAATTGACTATTCGAAACTGAATTTATGCCAGAAATGGCAGGTATTCGCTCAACCTTAATTAAGGAGAAAAACATGATTACCAATTATGAAGCCACTGTTGTAACTACCGATGACATTGTTCACGAGGTGAATCTGGAAGGAAAGCGCATTGGCTACGTAATTAAAACAGAAAATAAAGAAACCCCATTCACTGTGGTTGATATCGATGGTCCATCAGGCAACGTAAAAACACTTGATGAAGGTGTCAAAAAAATGTGCCTGGTGCATATCGGAAAGAATCTGCCCGCAGAAAAAAAAGCCGAATTTCTGGCAACTCTAATTGCAATGAAATTAAAAGGTGAAATCTGAAAGAAATAGCCTGCGTATGGCGCAGGCTATGAACAGTGTGTATCCGGCAAGATCATTCACTGAACAAAACGAATTTTAATCTGAGTTGAGGTTAAAAAACAATGAGCACAAAACCACTCTTCCTGTTACGGAAAGCGAAAAAATCATCCGGTGAACCTGACGTCGTCCTGTGGGCAAGCAACGATTTTGAATCGACCTGTGCCACTCTGGACTACCTGATCGTTAAGTCAGGTAAAAAACTGAGCAGCTATTTTAAAGCTGTTGCCACGAATTTTCCTGTCGTTAATGACCTGCCCGCTGAAGGTGAGATCGATTTTACCTGGAGTGAACGCTATCAACTCAGCAAAGACTCCATGACATGGGAACTAAAACCGGGAGCAGCACCAGACAACGCTCACTATCAAGGCAATACCAACGTCAACGGCGAAGACATGACTGAGATTGAGGAGAATATGCTACTCCCAATTTCTGGCCAGGAACTGCCCATTCGTTGGCTTGCTCAACACGGCAGCGAAAAACCGGTAACGCACGTTTCACGCGACGGACTCCAGGCATTACACATTGCTCGGGCTGAAGAACTACCGGCTGTTACTGCCCTGGCTGTTTCCCACAAAACCAGCCTGCTCGACCCGCTGGAAATTCGCGAACTCCACAAACTGGTTCGTGACACTGACAAAGTTTTCCCTAATCCTGGTAATTCAAACCTGGGACTGATAACTGCTTTTTTCGAAGCATACCTGAACGCTGACTACACCGATCGAGGACTGCTGACAAAAGAGTGGATGAAGGGTAATCGTGTTTCACACATCACTCGCACGGCTTCCGGTGCTAATGCTGGCGGCGGAAACCTCACCGATCGCGGCGAAGGTTTCGTACACGATCTGACGTCACTGGCGCGCGACGTAGCCACTGGCGTACTGGCCCGTTCAATGGATCTGGACATCTATAACCTTCATCCGGCACACGCTAAACGCATTGAGGAAATTATCGCTGAAAATAAACCGCCCTTTTCTGTTTTCCGCGACAAATTCATCACCATGCCTGGCGGGCTGGATTATTCCCGCGCCATCGTGGTTGCGTCCGTAAAAGAAGCACCAATTGGGATCGAGGTCATCCCCGCGCACGTCACTGAATATCTGAACAAAGTACTGACTGAAACCGATCATGCCAACCCTGATCCGGAAATCGTGGATATTGCCTGCGGTCGCTCCTCTGCCCCGATGCCGCAGCGAGTAACAGAAGAAGGAAAACAGGATGATGAAGAAAAACCGCAACCATCTGGAACAACGGCAGTTGAACAGGGAGAGGCTGAAACAATGGAACCGGACGCAACTGAACATCATCAGGACACGCAGCCGCTGGATGCTCAGTCACAGGTAAATTCTGTTGATGCGAAATATCAGGAACTGCGGGCAGAACTCCATGAAGCCCGGAAAAACATTCCATCAAAAAATCCTGTCGATGACGATAAATTGCTTGCTGCATCACGTGGTGAATTTGTTGACGGAATTAGCGACCCGAACGATCCGAAATGGGTAAAGGGGATCCAGACTCGCGATTGTGTGTACCAGAACCAGCCAGAAACGGAAAAAACCAGCCCAGATATGAATCAACCTGAGCCAGTAGTGCAACAGGAACCGGAAATAGCCTGCAATGCCTGCGGCCAGACTGGCGGGGATAACTGCCCTGACTGTGGTGCGGTGATGGGCGACGCAACATACCAGGAAACATTCGATGAAGAGAGTCAGGTTGAAGCTAAGGAAAATGATCCGGAGGAAATGGAAGGCGCTGAACATCCGCACAATGAGAATGCTGGCAGCGATCCGCATCGCGATTGCAGTGATGAAACTGGCGAAGTCGCAGATCCCGTAATCGTAGAAGACATAGAGCCAGGTATTTATTACGGAATTTCGAATGAGAATTACCACGCGGGTCCCGGTATCAGTAAGTCTCAGCTCGATGACATTGCTGATACTCCGGCACTATATTTGTGGCGTAAAAATGCCCCCGTGGACACCACAAAGACAAAAACGCTCGATTTAGGAACTGCTTTCCACTGCCGGGTACTTGAACCGGAAGAATTCAGTAACCGCTTTATCGTAGCACCTGAATTTAACCGCCGTACAAACGCCGGAAAAGAAGAAGAGAAAGCGTTTCTGATGGAATGCGCAAGCACAGGAAAAACGGTTATCACTGCGGAAGAAGGCCGGAAAATTGAACTCATGTATCAAAGCGTTATGGCTTTGCCGCTGGGGCAATGGCTTGTTGAAAGCGCCGGACACGCTGAATCATCAATTTACTGGGAAGATCCTGAAACAGGAATTTTGTGTCGGTGCCGTCCGGACAAAATTATCCCTGAATTTCACTGGATCATGGACGTGAAAACTACGGCGGATATTCAACGATTCAAAACCGCTTATTACGACTACCGCTATCACGTTCAGGATGCATTCTACAGTGACGGTTATGAAGCACAGTTTGGAGTGCAGCCAACTTTCGTTTTTCTGGTTGCCAGCACAACTATTGAATGCGGACGTTATCCGGTTGAAATTTTCATGATGGGCGAAGAAGCAAAACTGGCAGGTCAACAGGAATATCACCGCAATCTGCGAACCCTGTCTGACTGCCTGAATACCGATGAATGGCCAGCTATTAAGACATTATCACTGCCCCGCTGGGCTAAGGAATATGCAAATGACTAAGCAACCACCAATCGCAAAAGCCGATCTGCAAAAAACTCAGGGAAACCGTGCACCAGCAGCAGTTAAAAATAGCGACGTGATTAGTTTTATTAACCAGCCATCAATGAAAGAGCAACTGGCAGCAGCTCTTCCACGCCATATGACGGCTGAACGTATGATCCGTATCGCCACCACAGAAATTCGTAAAGTTCCGGCGTTAGGAAACTGTGACACTATGAGTTTTGTCAGTGCGATCGTACAGTGTTCACAGCTCGGACTTGAGCCAGGTAGCGCCCTCGGTCATGCATATTTACTGCCTTTTGGTAATAAAAACGAAAAGAGCGGTAAAAAGAACGTTCAGCTAATCATTGGCTATCGCGGCATGATTGATCTGGCTCGCCGTTCTGGTCAAATCGCCAGCCTGTCAGCCCGTGTTGTCCGTGAAGGTGACGAGTTTAGCTTCGAATTTGGCCTTGATGAAAAGTTAATACACCGCCCGGGAGAAAACGAAGATGCCCCGGTTACCCACGTCTATGCTGTCGCAAGACTGAAAGACGGAGGTACTCAGTTTGAAGTTATGACGCGCAAACAGATTGAGCTGGTGCGCAGCCTGAGTAAAGCTGGTAATAACGGGCCGTGGGTAACTCACTGGGAAGAAATGGCAAAGAAAACGGCTATTCGTCGCCTGTTCAAATATTTGCCCGTATCAATTGAGATCCAGCGTGCAGTATCAATGGATGAAAAGGAACCACTGACAATCGATCCTGCAGATTCCTCTGTATTAACCGGGGAATACAGTGTAATCGATAATTCAGAGGAATAATTCAGCCTGGCGGTGTAATGCACCGCCAACTTGAAATATTTTTTATGAGAAAAATTATGAGATATGACAATGTTAAACCATGTCCATTTTGTGGTTGTCCATCAGTAACGGTGAAAGCCATTTCAGGATATTACCGAGCGAAGTGTAACGGATGCGAATCCCGAACCGGTTATGGTGGAAGTGAAAAAGAAGCACTCGAAAGATGGAATAAACGAACCACTGGAAATAATAATGGAGGTGTTCATGTATAAAATTACCGCCACTATTGAAAAGGAAGGTGGCACTCCTACTAACTGGACAAGATATTCAAAATCTAAACTAACGAAATCAGAATGCGAAAAAATGCTCTCAGGTAAAAAAGAAGCAGGCGTTTCCAGAGAGCAGAAAGTAAAACTGATAAATTTTAATTGCGAGAAACTTCAGTCCTCGAGAATTGCATTGTATTCAAATTAAAACTTCATAGCTGATTATTAATAATCAACATCGGGCGTCAATTTCAGTCTAACATTGGCGCCTGCCAGAGGTGATGCGATGGCACAAGTAATCTTTAATGAAGAGTGGATGGTTGAATACGGCCTGATGCTTCGCACTGGTCTGGGGGCCAGACAAATTGAAGCATACCGCCAGAACTGTTGGGTGGAGGGCTTCCACTTCAAACGAGTATCTCCTTTAGGTAAGCCAGACAGCAAACGAGGGATTATCTGGTACAACTATCCAAAGATAAATCAGTTTATCAAAGACTCATGATATGTCTAAATTACCAACAGGTGTCGAGATTAGAGGTAGATACATTCGCATCTGGTTCATGTTTCGAGGAAAACGATGTCGGGAAACATTAAAAGGCTGGGAGATTACAAACAGTAATATTAAAAAGGCCGGAAATTTAAGAGCGCTGATAGTTCATGAAATAAACTCCGGTGAATTTGAGTATTTAAGACGTTTTCCCCAGTCCAGCACTGGGGCAAAAATGGTGACAACGAGAGTCATAAAAACGTTCGGAGAGCTTTGTGATATCTGGACAAAAATTAAAGAGACAGAGTTAACAACAAACACAATGAAGAAAACGAAATCACAATTAAAAACACTTAGAATAATAATTTGTGAAAGTACCCCGATATCACATATTCGTTATAGCGATATCTTAAACTACCGGAATGAACTGCTGCATGGAGAAACGCTTTACCTGGATAATCCAAGATCCAACAAAAAAGGAAGAACCGTGCGCACAGTTGATAACTATATCGCCCTGCTCTGTTCGCTGTTGCGTTTTGCGTATCAGTCGGGATTTATATCAACCAAACCATTTGAAGGAGTAAAAAAATTACAGCGAAACAGAATAAAGCCTGATCCGTTATCTAAAACAGAATTCAATGCATTAATGGAAAGTGAAAAAGGACAGAGCCAGAACTTGTGGAAATTTGCCGTTTACTCAGGACTTCGTCACGGGGAACTGGCAGCTCTGGCGTGGGAGGATGTGGATCTCGAAAAGGGAATAGTGAATGTCAGAAGAAACCTGACGATACTTGATATGTTCGGTCCCCCAAAAACAAATGCCGGGATCCGAACAGTAACACTACTGCAGCCTGCTCTTGAAGCACTGAAGGAGCAATACAAACTGACCGGGCATCATCGCAAAAGCGAAATCACCTTTTATCATCGGGAGTACGGCAGAACCGAAAAGCAAAAACTGCATTTTGTTTTCATGCCCAGGGTGTGTAACGGAAAACAAAAACCTTATTACTCGGTAAGCAGTTTGGGGGCAAGGTGGAATGCAGCAGTAAAACGTGCTGGTATTCGCCGCCGTAATCCGTACCATACGCGGCATACTTTTGCCTGCTGGCTGTTGACGGCAGGAGCGAACCCGGCATTTATAGCCAGCCAAATGGGGCATGAAACTGCGCAGATGGTGTATGAAATTTACGGTATGTGGATTGATGACATGAACGACGAACAGATAGCCATGTTGAATGCGCGGTTATCGTAGTTGCAAAGTTTGCCCCCAATTTGCCCCATTTAGTACCAGAGAACTGAAATAATGCAAGAAAATCAACAAATTACAAAGAAAGAACAATACAACCTGAACAAATTACAAAAACGTCTGCGTCGTAACGTGGGCGAAGCCATTGCTGACTTCAATATGATTGAAGAAGGCGATCGCATCATGGTTTGCCTCTCCGGGGGTAAAGACAGCTATACCATGCTGGAGATTCTGCGCAATTTGCAGCAAAGCGCGCCAATCAATTTTTCGCTGGTGGCTGTTAACCTCGATCAAAAGCAACCGGGCTTCCCGGAACACGTTCTGCCCGAGTATCTTGAAAAGCTGGGCGTTGAGTACAAGATTGTTGAAGAGAATACTTACGGTATCGTGAAAGAGAAGATTCCAGAGGGCAAAACCACTTGCTCACTGTGTTCTCGCCTTCGTCGCGGTATCCTTTATCGTACCGCAACGGAACTGGGGGCGACGAAGATCGCGTTGGGTCACCATCGTGACGATATCCTGCAAACGTTGTTCTTAAATATGTTCTACGGCGGTAAGATGAAAGGTATGCCTCCGAAACTGATGAGCGATGATGGCAAACATATCGTTATTCGTCCGCTGGCCTACTGCCGCGAGAAAGATATTCAGCGATTTGCCGATGCAAAAGCGTTCCCGATTATTCCGTGCAACCTGTGCGGTTCACAGCCTAACCTGCAACGTCAGGTGATTGCTGACATGTTGCGTGACTGGGATAAACGTTATCCAGGGCGTATCGAGACGATGTTCAGCGCGATGCAGAATGTGGTGCCGTCGCATCTGTGCGATACCAACCTGTTCGATTTCAAAGGCATCACCCACGGTTCTGAAGTGGTTAACGGGGGTGATCTGGCGTTTGATCGCGAAGAGATCCCACTACAACCGGCGTGCTGGCAGCCAGAAGAAGATGAAAATCAGTTGGATGAGTTACGGCTGAATGTGGTTGAAGTGAAATAACCAGGATAGCGCCCGATGCGCAAGCGTATCGGGCTACTCTTATGGAGGCCGGATAAGACGCGGCCAGCGTCGCATCCGGCAATCCCGAATAAGATGTTTACTCTTGCACCCGGCAATTCAACATTTCATTATTTTAATAACCGCACCCGGCACGTTTTTCCTTTAATCTTCCCGCCCTGTAACTGTTTCCATGCTTTATGAGCAACAGCCTGACGGACCGCGACATAGACATGCGCCGGATGCACGGCGATTTTGCCAATATCTGCGCCATCAAGCCCGATATCTCCTGTCAGTGCACCTAATACATCACCCGGGCGCATTTTGGCTTTTTTCCCGCCATCGATACACAACGTTGCCATTTCTGCTTCCAGCGTCGCAATGGAACTATTAGCTGGCGGCGTTTGCCAGTTAAGTTTTATCTGCAACATGTCAGAAATGATATTGGCCCGCTGTGCTTCTTCCGGAGCACAGAAACTGATCGCCAGACCGCTATTTCCTGCACGAGCTGTACGACCGATGCGATGTACATGAACTTCAGGGTCCCACGCCAGCTCAAAGTTCACCACCAGCTCAAGCGATTTAATATCCAGACCACGCGCAGCAACATCAGTCGCGACCAGTACACGGGCGCTACCGTTAGCAAAACGTACCAGGGTCTGATCGCGATCGCGTTGCTCCAAATCGCCGTGTAATGACAATGCACTTTGCCCTACTTCATTCAGCGCGTCGCAGACAGCCTGGCAATCTTTTTTGGTATTGCAAAACACCACGCAAGAGGATGGCTGATGCAAGCTTAATAACCGTTGCAACAGAGGAATTTTGCCTTTGCTGGATGTCTCATAAAATTGTTGTTCAATGGGTGGCAAAGCATCTGTTGAGTCAATTTCAATCGCCAAAGGATCGCGTTGCACTCGTCCGCTGATTGCAGCGATGGCTTCCGGCCAGGTTGCCGAAAACAGAAGCGTCTGTCGAGATGCAGGCGCAAAACGGATGACATCATCAATGGCATCGCTAAATCCCATATCCAGCATGCGGTCGGCCTCATCCATCACCAGCGTATTCAACGCATCCAGTGATACCGTGCCTTTTTGCAGGTGATCCAGCAAACGCCCCGGCGTTGCCACGATAATATGCGGCGCATGTTGCAACGAATCACGCTGCATACCGAACGGTTGACCACCGCACAACGTCAAAATTTTGGTATTTGGCAGAAAACGCGCCAGCCGACGCAATTCACCTGCCACCTGATCCGCCAGTTCACGCGTAGGACACAGCACTAAAGCCTGGGTTTGAAATAGCGACGCATCAATTTGCTGTAACAAGCCGAGGCCAAAAGCCGCCGTTTTGCCGCTGCCGGTTTTCGCCTGCACGCGAACATCTTTTCCGGCAAGGATCGCCGGAAGCGCGGCGGCCTGCACCGGCGTCATGGTTAAATAACCCAACTCATTAAGGTTCGTGAGTTGGGCGGGAGGCAAAACATTCAGGGTAGAAAAAGCGGTCACAATCTATTCTCGTGGTCATCGACGCAAAGTTAGCAGGCGCGTATCCTCGCAGATCTACGCTCACGATGCGACAATTTAATCGGTTCTTCATCGGGTGGTGGGTCAGGCATGGGTTGCGGGCGAGGGATCGGATCGGGCACTGGAACAGGATCGCCAGGAATCGGTTCAGGGACAGGAATTTGCAAATAAATAAGTGTCGTCATATTTCCCTCTGGTCATTGGGTGGACTCTTAAAGGGTAGACGCTGATAAATAACAGGCAAAAAAAAGCCGACTCATCAAAGTCGGCGTCGTACGAATCAATTGTGCTATGCAGTAATTCAAAAAAGGAAGTAAGACAATATGGAGCGCAACGCCCATCGCTTGACGTTGCATTCACCTGCAAGAGAGATATTGCCCTGAATGGGTAGAGAGTTTATTGACTTCGCTCAAACTTTGCGGCGTTTTTGTATACAGACAGCCGGAAAAATTGCTTTTGTTACAACCATTTACTACGATGCAACCATAAAGCAACACCACCAATAAGAACAACTAACAGAATACAAAAAATTGAAAATCCGAATTGCCACCCGCCGCCAGGGATCCCACCAAGGTTGACGCCAAATAACCCGGTCAGAAAGGTACTGGGTAAAAAGACCATTGCCATCAACGACATTGTATAGGTACGACGAGCTAAATTTTCCTGCATCACCTGAGCGATTTCATCCGCCATCACGCCAGTCCGTGCTATACAGGCGTCGATTTCGTCAAGGCCGCGCCCAAGGCGATCGGCAATATCCTGCATCCGACGGCGTTGGTCATCGCTCATCCACGGCAAACGTTCACTGGCAAGACGAGCATAAACATCACGTTGCGGTGCCATATAGCGACGCATCACAATTAATTGTTTGCGCAGCAGAGCCAGGAATCCACGCGGTGGAATTTGCTGATCAAGGAGATTATCTTCAAGGTCGATAATTTTATCGTGCAGCTGCTCGATAAATTCACTGGAATGATCGGTCAACGCATCGCACACATCCACCAGCCATCCCCCGCAATCGGTCGGACCCGTGCCCTCTTCCAGATCGCTCACCACATCGTCCAGCGCCAGCACTTTGCGTTGTCGGGTCGAAACAATTAACCGCCCGTCCATATATACACGCATGGCGACCAGTTGATCGGGGCGTTCATCGGTGCTGCCGTTTATACAGCGCAATGTAATCAGCGTGCCTTCACCGAGACGGCTGACTCGGGGACGCGTGCTCTCGCCCGCCAGCGCATCACGTACGTTATTGGGAAGCAGCGGTGTTGTCGCCAGCCATTGGGCGCTATCATGGTGTACATAATTAAGGTGGAGCCAACAGGGATGCGCTTCATCAATCACATCTGTATTTTCCAGCGGTTTAACGCCGCCTCTACCATCCAGCATCCAGGCAAATACTGCATCCGGGACATTAACGTCCGATCCCTTAATCGCTTCCACAGTGCCTCCATCATCAACGCATTATTTTGTAGTCTAGCCTTCTGGCCCTGTTACGCAACATCTCATCACCCCATTACCCTGAAATGATTAATAAAATTCTGTCTAAATTGAATACAAAAAGCAAAATGCTTTTCCGTATACAAACCGTGTGAAGTGTTAAATAGCGTCTATCATTATCAGAATTATCTGATCATATGACGTGGCTTTTTTGCGATCGGATAGCAACAAAAATTGATAAAAATAACGGGATCTCAATGATTACGCACAACTTCAATACCCTGGACTTACTCACCAGTCCTGTCTGGATCGTTTCGCCCTTTGAGGAACAGTTAATTTATGCCAATAGCGCGGCGAAACTGTTGATGCAAGACCTCACGTTTAGTCAGCTACGAACCGGACCCTATTCCGTCTCCTCACAAAAAGAACTGCCGAAATACCTCTCCGATCTGCAAAACCAACACGATATTATCGAAATCCTCACTGTTCAGCGTAAAGAAGAGGAAACAGCATTGAGCTGTCGGCTTGTTTTGCGAAAGCTGACAGAAACAGAACCGGTGATTATTTTCGAAGGTATCGAAGCGCCGGCAACGCTGGGTTTAAAAGCCAGTCGCTCGGCAAATTATCAGCGCAAAAAACAAGGTTTTTATGCGCGCTTTTTTCTGACTAACTCTGCACCAATGTTGTTGATTGACCCGTCACGAGATGGACAAATCGTCGATGCTAACCTCGCCGCGCTCAATTTCTATGGTTATAACCATGAAACGATGTGCCAGAAACATACCTGGGAAATAAATATGCTCGGGCGTCGCGTCATGCCTATCATGCATGAAATCTCGCATTTACCCGGTGGTCATAAACCTTTGAATTTTGTTCATAAACTGGCGGATGGTTCGACTCGTCATGTGCAGACCTATGCCGGACCGATTGAAATTTATGGCGACAAGCTCATGTTATGTATTGTGCATGATATTACTGAGCAAAAACGGCTGGAGGAGCAGCTGGAACATGCTGCTCACCATGACGCGATGACCGGATTACTGAATCGGCGACAGTTTTATCACATTACGGAACCAGGCCAAATGCAGCATCTCGCCATCGCTCAGGATTACAGCTTGTTGCTCATCGACACCGATCGTTTTAAACACATTAACGATCTCTATGGGCATTCTAAAGGTGATGAGGTGTTATGCGCCCTCGCCCGCACCCTCGAAAGTTGCGCTCGCAAAGGCGATTTGGTGTTTCGTTGGGGAGGCGAAGAGTTTGTCTTATTGCTACCAAGAACCCCACTGGATACCGCGCTTTCGCTGGCTGAAACTATCCGCGTAAGCGTGGCAAAAGTGAGTATTTCGGGCTTACCACGCTTTACCGTCAGCATTGGTGTGGCGCATCACGAAGGAAATGAAAGCATCGATGAACTGTTTAAACGCGTTGATGATGCTTTGTATCGGGCGAAAAATGATGGACGCAACCGCGTGCTGGCGGCATAA
Protein sequences of DBSCAN-SWA_7 >CP034953|2445686:2476017|2462338_2462815_+|QAA90030.1|DBSCAN-SWA MLSGKDLGRAIEQAINKKIASGSVKSKAEVARHFKVQPPSIYDWIKKGSISKDKLPELWRFFSDVVGPEHWGLNEYPIPTPTNSDTKSELLDINNLYQAASDEIRAIVAFLLSGNATEPDWVDHDVRAYIAAMEMKVGKYLKALESERKSQNITKTGT >CP034953|2445686:2476017|2465193_2467794_+|QAA90035.1|DBSCAN-SWA MSTKPLFLLRKAKKSSGEPDVVLWASNDFESTCATLDYLIVKSGKKLSSYFKAVATNFPVVNDLPAEGEIDFTWSERYQLSKDSMTWELKPGAAPDNAHYQGNTNVNGEDMTEIEENMLLPISGQELPIRWLAQHGSEKPVTHVSRDGLQALHIARAEELPAVTALAVSHKTSLLDPLEIRELHKLVRDTDKVFPNPGNSNLGLITAFFEAYLNADYTDRGLLTKEWMKGNRVSHITRTASGANAGGGNLTDRGEGFVHDLTSLARDVATGVLARSMDLDIYNLHPAHAKRIEEIIAENKPPFSVFRDKFITMPGGLDYSRAIVVASVKEAPIGIEVIPAHVTEYLNKVLTETDHANPDPEIVDIACGRSSAPMPQRVTEEGKQDDEEKPQPSGTTAVEQGEAETMEPDATEHHQDTQPLDAQSQVNSVDAKYQELRAELHEARKNIPSKNPVDDDKLLAASRGEFVDGISDPNDPKWVKGIQTRDCVYQNQPETEKTSPDMNQPEPVVQQEPEIACNACGQTGGDNCPDCGAVMGDATYQETFDEESQVEAKENDPEEMEGAEHPHNENAGSDPHRDCSDETGEVADPVIVEDIEPGIYYGISNENYHAGPGISKSQLDDIADTPALYLWRKNAPVDTTKTKTLDLGTAFHCRVLEPEEFSNRFIVAPEFNRRTNAGKEEEKAFLMECASTGKTVITAEEGRKIELMYQSVMALPLGQWLVESAGHAESSIYWEDPETGILCRCRPDKIIPEFHWIMDVKTTADIQRFKTAYYDYRYHVQDAFYSDGYEAQFGVQPTFVFLVASTTIECGRYPVEIFMMGEEAKLAGQQEYHRNLRTLSDCLNTDEWPAIKTLSLPRWAKEYAND >CP034953|2445686:2476017|2460603_2461461_-|QAA90027.1|DBSCAN-SWA MLFVLILSHRAASYGAIMAALPYMQLYIADYLADTMHLSAEEHGAYLLLMFNYWQTGKPIPKNRLAKIARLTNERWADVEPSLQEFFCDNGEEWVHLRIEEDLASVREKLTKKSAAGKASVQARRSRKEADVQTKQERNLTGVQTDVEVVFEHDVNTKATNKDTDKDLKTDPPLNPPRGNRGVKKFDPLDITLPNWISVSLWREWVEFRQALRKPIRTEQGANGAIRELEKFRQQGFSPEQVIRHSIANEYQGLFAPKGVRPETLLRQVNTVSLPDSAIPPGFRG >CP034953|2445686:2476017|2448173_2448287_+|QAA90015.1|DBSCAN-SWA MNSILIITSLLIIFSIFSHALIKLGIGISNNPDKTDV >CP034953|2445686:2476017|2464816_2465092_+|QAA91914.1|DBSCAN-SWA MITNYEATVVTTDDIVHEVNLEGKRIGYVIKTENKETPFTVVDIDGPSGNVKTLDEGVKKMCLVHIGKNLPAEKKAEFLATLIAMKLKGEI >CP034953|2445686:2476017|2474784_2476017_+|QAA90044.1|DBSCAN-SWA MITHNFNTLDLLTSPVWIVSPFEEQLIYANSAAKLLMQDLTFSQLRTGPYSVSSQKELPKYLSDLQNQHDIIEILTVQRKEEETALSCRLVLRKLTETEPVIIFEGIEAPATLGLKASRSANYQRKKQGFYARFFLTNSAPMLLIDPSRDGQIVDANLAALNFYGYNHETMCQKHTWEINMLGRRVMPIMHEISHLPGGHKPLNFVHKLADGSTRHVQTYAGPIEIYGDKLMLCIVHDITEQKRLEEQLEHAAHHDAMTGLLNRRQFYHITEPGQMQHLAIAQDYSLLLIDTDRFKHINDLYGHSKGDEVLCALARTLESCARKGDLVFRWGGEEFVLLLPRTPLDTALSLAETIRVSVAKVSISGLPRFTVSIGVAHHEGNESIDELFKRVDDALYRAKNDGRNRVLAA >CP034953|2445686:2476017|2456939_2457203_-|QAA90023.1|DBSCAN-SWA MSEKLKIVYRPLQELSPYAHNARTHSTEQVAQLVESIKQFGWTNPVLIDEKGEIIAGHGRVMAAEMLKMDSVPVIVLSGLTDEQKQR >CP034953|2445686:2476017|2448355_2448589_+|QAA90016.1|DBSCAN-SWA MKSKDTLKWFPAQLPEVRIILGDAVVEVAKQGRPINTRTLLDYIEGNIKKTSWLDNKELLQTAISVLKDNQNLNGKM >CP034953|2445686:2476017|2449593_2450169_-|QAA90018.1|tail|DBSCAN-SWA MAFRMSEQPRTIKIYNLLAGTNEFIGEGDAYIPPHTGLPANSTDIAPPDIPAGFVAVFNSDESSWHLVEDHRGKTVYDVASGDALFISELGPLPENVTWLSPEGEFQKWNGTAWVKDTEAEKLFRIREAEETKNNLMQVASEHIAPLQDAADLEIATEEEISLLEAWKKYRVLLNRVDTSTAQDIEWPALP >CP034953|2445686:2476017|2446960_2447395_+|QAA90014.1|DBSCAN-SWA MNRTILVPIDISDSELTQRVISHVEEEAKIDDAEVHFLTVIPSLPYYASLGLAYSAELPAMDDLKAEAKSQLEEIIKKFKLPTDRVHVHVEEGSPKDRILELAKKIPAHMIIIASHRPDITTYLLGSNAAAVVRHAECSVLVVR >CP034953|2445686:2476017|2456599_2456959_-|QAA90022.1|DBSCAN-SWA MSRSSDNDQYRSRNALIRRHIEKMDASLHVGTKEFDISKVSEVDSVDDLLIDNAARYLLKDWKGVGELVNGVEVALEYTAERGIALLKQNPELYWQILAEAASIAQGKEQQKQDTIKKP >CP034953|2445686:2476017|2463420_2463909_-|QAA91913.1|DBSCAN-SWA MLDVFTPLLKLFANEPLERLMYTIIIFGLTLWLIPKEFTVAFNAYTEIPWLFQIIVFAFSFVVAISFSRLRAHIQKHYSLLPEQRVLLRLSEKEIAVFKDFLKTGNLIITSPCRNPVMKKLERKGIIQHQSDSANCSYYLVTEKYSHFMKLFWNSRSRRFNR >CP034953|2445686:2476017|2459850_2460597_-|QAA90026.1|DBSCAN-SWA MKNIATGDVLERIRRLAPSHVTAPFKTVAEWREWQLSEGQKRCEEINRQNRQLRVEKILNRSGIQPLHRKCSFSNYQVQNEGQRYALSQAKSIADELMTGCTNFAFSGKPGTGKNHLAAAIGNRLLKDGQTVIVVTVADVMSALHASYDDGQSGEKFLRELCEVDLLVLDEIGIQRETKNEQVVLHQIVDRRTASMRSVGMLTNLNYEAMKTLLGERIMDRMTMNGGRWVNFNWESWRPNVVQPGIAK >CP034953|2445686:2476017|2459267_2459828_-|QAA90025.1|DBSCAN-SWA METVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVSPLWLSASSSACHGVLIITVITPSPTKNSATKMPEN >CP034953|2445686:2476017|2445686_2446820_+|QAA90013.1|DBSCAN-SWA MKSKVLALLIPALLAAGAAHAAEVYNKDGNKLDLYGKVDGLHYFSDNSAKDGDQSYARLGFKGETQINDQLTGYGQWEYNIQANNTESSKNQSWTRLAFAGLKFADYGSFDYGRNYGVMYDIEGWTDMLPEFGGDSYTNADNFMTGRANGVATYRNTDFFGLVNGLNFAVQYQGNNEGASNGQEGTNNGRDVRHENGDGWGLSTTYDLGMGFSAGAAYTSSDRTNDQVNHTAAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPFGDSDYAVANKTQNFEVTAQYQFDFGLRPAVSFLMSKGRDLHAAGGADNPAGVDDKDLVKYADIGATYYFNKNMSTYVDYKINLLDEDDSFYAANGISTDDIVALGLVYQF >CP034953|2445686:2476017|2450168_2453531_-|QAA90019.1|DBSCAN-SWA MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA >CP034953|2445686:2476017|2470631_2471567_+|QAA90041.1|tRNA|DBSCAN-SWA MQENQQITKKEQYNLNKLQKRLRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQKQPGFPEHVLPEYLEKLGVEYKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIQRFADAKAFPIIPCNLCGSQPNLQRQVIADMLRDWDKRYPGRIETMFSAMQNVVPSHLCDTNLFDFKGITHGSEVVNGGDLAFDREEIPLQPACWQPEEDENQLDELRLNVVEVK >CP034953|2445686:2476017|2461918_2462215_-|QAA90029.1|DBSCAN-SWA MKKENYSFKQACAVVGGQSAMARLLGVSPPSVNQWIKGVRQLPAERCPAIERATRGEVLCEELRPDIDWSYLRRSACCSQNMSVKQLNDSNKSSFDHT >CP034953|2445686:2476017|2463268_2463424_+|QAA90032.1|DBSCAN-SWA MQKIDLGNNESLVCGVFPNQDGTFTAMTYTKSKTFKTETGARRWLEKHTVS >CP034953|2445686:2476017|2453595_2453811_-|QAA91911.1|DBSCAN-SWA MSGTCSHLPKGKTHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVAIDIAYEGPGSGDWRTDGFIVGVGYKF >CP034953|2445686:2476017|2473546_2474530_-|QAA90043.1|DBSCAN-SWA MEAIKGSDVNVPDAVFAWMLDGRGGVKPLENTDVIDEAHPCWLHLNYVHHDSAQWLATTPLLPNNVRDALAGESTRPRVSRLGEGTLITLRCINGSTDERPDQLVAMRVYMDGRLIVSTRQRKVLALDDVVSDLEEGTGPTDCGGWLVDVCDALTDHSSEFIEQLHDKIIDLEDNLLDQQIPPRGFLALLRKQLIVMRRYMAPQRDVYARLASERLPWMSDDQRRRMQDIADRLGRGLDEIDACIARTGVMADEIAQVMQENLARRTYTMSLMAMVFLPSTFLTGLFGVNLGGIPGGGWQFGFSIFCILLVVLIGGVALWLHRSKWL >CP034953|2445686:2476017|2464350_2464572_+|QAA90033.1|DBSCAN-SWA MIAHHFGTDEIPRQCVTPGDYVLHEGRTYIASANNIKKRKLYIRNLTTKTFITDRMIKVFLGRDGLPVKAESW >CP034953|2445686:2476017|2463123_2463258_+|QAA90031.1|DBSCAN-SWA MVHYEVVQYLMDCCGITYNQAVQALRSNDWDLWQAEVAIRSNKM >CP034953|2445686:2476017|2468839_2469049_+|QAA90038.1|DBSCAN-SWA MYKITATIEKEGGTPTNWTRYSKSKLTKSECEKMLSGKKEAGVSREQKVKLINFNCEKLQSSRIALYSN >CP034953|2445686:2476017|2467786_2468596_+|QAA90036.1|DBSCAN-SWA MTKQPPIAKADLQKTQGNRAPAAVKNSDVISFINQPSMKEQLAAALPRHMTAERMIRIATTEIRKVPALGNCDTMSFVSAIVQCSQLGLEPGSALGHAYLLPFGNKNEKSGKKNVQLIIGYRGMIDLARRSGQIASLSARVVREGDEFSFEFGLDEKLIHRPGENEDAPVTHVYAVARLKDGGTQFEVMTRKQIELVRSLSKAGNNGPWVTHWEEMAKKTAIRRLFKYLPVSIEIQRAVSMDEKEPLTIDPADSSVLTGEYSVIDNSEE >CP034953|2445686:2476017|2458994_2459180_-|QAA90024.1|lysis|DBSCAN-SWA MRKLKMMLCVMMLPLVVVGCTSKQSVSQCVKPPPPPAWIMQPPPDWQTPLNGIISPSGNDW >CP034953|2445686:2476017|2468652_2468847_+|QAA90037.1|DBSCAN-SWA MRYDNVKPCPFCGCPSVTVKAISGYYRAKCNGCESRTGYGGSEKEALERWNKRTTGNNNGGVHV >CP034953|2445686:2476017|2448905_2449496_+|QAA90017.1|DBSCAN-SWA MSRIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSERPGFNRLLARLKCGDQLIVTKLDRLGCNAMDIRKTVEQLTETGIRVHCLALGGIDLTSPTGKMMMQVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPPVLNEEQKQVVFERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI >CP034953|2445686:2476017|2461473_2461896_-|QAA90028.1|DBSCAN-SWA MKIKHEHIESVLFALAAEKGQAWVANAITEEYLRQGGGELPLVPGKDWNNQQNIYHRWLKGETKTQREKIQKLIPAILAILPRELRHRLCIFDTLERRALLAAQEALSTAIDAHDDAVQAVYRKAHFSGGGSSDDSVIVH >CP034953|2445686:2476017|2469344_2470580_+|QAA90040.1|DBSCAN-SWA MSKLPTGVEIRGRYIRIWFMFRGKRCRETLKGWEITNSNIKKAGNLRALIVHEINSGEFEYLRRFPQSSTGAKMVTTRVIKTFGELCDIWTKIKETELTTNTMKKTKSQLKTLRIIICESTPISHIRYSDILNYRNELLHGETLYLDNPRSNKKGRTVRTVDNYIALLCSLLRFAYQSGFISTKPFEGVKKLQRNRIKPDPLSKTEFNALMESEKGQSQNLWKFAVYSGLRHGELAALAWEDVDLEKGIVNVRRNLTILDMFGPPKTNAGIRTVTLLQPALEALKEQYKLTGHHRKSEITFYHREYGRTEKQKLHFVFMPRVCNGKQKPYYSVSSLGARWNAAVKRAGIRRRNPYHTRHTFACWLLTAGANPAFIASQMGHETAQMVYEIYGMWIDDMNDEQIAMLNARLS >CP034953|2445686:2476017|2453853_2454834_+|QAA90020.1|transposase|DBSCAN-SWA MSHQLTFADSEFSTKRRQTRKEIFLSRMEQILPWQNMTAVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMRLFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQGTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSGLTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVDWLIAERPGRVKTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGFVKARYKGLLKNDNQLAMLFTLANLFRVDQMIRQWERSQ >CP034953|2445686:2476017|2457340_2458798_-|QAA91912.1|DBSCAN-SWA MNTSHVRVVTHMCGFLVWLYSLSMLPPMVVALFYKEKSLFVFFITFVIFFCIGGGAWYTTKKSGIQLRTRDGFIIIVMFWILFSVISAFPLWIDSELNLTFIDALFEGVSGITTTGATVIDDVSSLPRAYLYYRSQLNFIGGLGVIVLAVAVLPLLGIGGAKLYQSEMPGPFKDDKLTPRLADTSRTLWITYSLLGIACIVCYRLAGMPLFDAICHGISTVSLGGFSTHSESIGYFNNYLVELVAGSFSLLSAFNFTLWYIVISRKTIKPLIRDIELRFFLLIALGVIIVTSFQVWHIGMYDLHGSFIHSFFLASSMLTDNGLATQDYASWPTHTIVFLLLSSFFGGCIGSTCGGIKSLRFLILFKQSKHEINQLSHPRALLSVNVGGKIVTDRVMRSVWSFFFLYTLFTVFFILVLNGMGYDFLTSFATVAACINNMGLGFGATASSFGVLNDIAKCLMCIAMILGRLEIYPVIILFSGFFWRS >CP034953|2445686:2476017|2464571_2464742_+|QAA90034.1|DBSCAN-SWA MTKKIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHEL >CP034953|2445686:2476017|2471695_2473069_-|QAA90042.1|DBSCAN-SWA MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTGSGKTAAFGLGLLQQIDASLFQTQALVLCPTRELADQVAGELRRLARFLPNTKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTLVMDEADRMLDMGFSDAIDDVIRFAPASRQTLLFSATWPEAIAAISGRVQRDPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQPSSCVVFCNTKKDCQAVCDALNEVGQSALSLHGDLEQRDRDQTLVRFANGSARVLVATDVAARGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEAQRANIISDMLQIKLNWQTPPANSSIATLEAEMATLCIDGGKKAKMRPGDVLGALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKAWKQLQGGKIKGKTCRVRLLK >CP034953|2445686:2476017|2456291_2456492_-|QAA90021.1|DBSCAN-SWA MLKELLYAYSVISRARRYAGMAGVPLPLSLTEINEYLATHPVLIERDEFEAVIFALDDQYFQEQCV >CP034953|2445686:2476017|2469127_2469343_+|QAA90039.1|DBSCAN-SWA MAQVIFNEEWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYNYPKINQFIKDS |
36 | Escherichia_phage(43.33%) | lysis,tRNA,tail,transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
3259833 : 3304196
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >CP034953|3259833:3304196|DBSCAN-SWA GTTAGTTCTTCTTTTCGGATCCGGCACTTCTGGGGGGGAAATCCAGCGATGGCTGGATTATGTCGTCAATTAAAAATGCGGCGAGTAGATTAGCAAATATCCACGCTTTCGCGAGTTCAGGTTCCTTTGCACGCAAAGCATCCAGGTGCAGCAAACTTTTGAGCCGCTTAAAAGCCAGTTCAATTTGCCATCGCAGACGGTAACAATCAGCCACTTGCTCTGCTGAATATTCATCTTCCGGTAATGATGTTAGCAATAGCACATGGCCCGCTGCTTCCAGCGTTTCCGCCTGAACTACTCGTCCTTTTCGACGATTCTCGCTGAGCAGTCGGGTTTTACTGATTAATGCTTTTTCGGGAGGAAGTGATACGGCAATGAGACGTGCCGGAAAGGGAGCTCCGGCTTTTTTATTACCTGAATTGCCTATCATTACAGTGGTTTCACCGTTCTTACCGCAATCCAGCCCGCGCAGAAAACCCATCATGTCAAAGCGCATTCCTTCTGCAGTTAACCAGCGCAATCCTCGCCAGTGAACCCGGACGATATAATCAGCTTCTCCAAAAGCAAGTGAGCGGATACATTCGGGACGCGAACCGAATCCCCGGTCAGCAATGCGTATCTCGTCTGCCGTTTGCGCAAATCGGTCCAGCCGTTCAGCGTCTCTGCTGTCGGTTAGCTCAAAATCAGTGAACTGACAGGTATGAGGATCATATCCCATATGTAGTCGCCATTCAGCGCTGCCGCCCCCGGGCGCACTGATTGCTGTTCCATCGACAAGACGCAATCTCTTTCCGCTTGTACAACCCGTAACTGCGGCGCGTACAGCAAGTGTTTGTGCGGCAAGTATGCCAAACCAGTCGGCGGCATTCCGCAGCCGCTTCAGGAGAGCCACGTCAGATAATGTTGCAACGTCATGGAGCTGAGCCCATGCAGTGACTTCACGTAATGACATCCCCCCGGGGCCGTAAGCCAGCCCCAGACGTAGCAGAGTTGCAGCATCACGAATTTCGCGGCGGCGGGTTAGAGCCCCGGCATTACGTGCCGAAGTATCCAGTTCTTCGGGCTTACCAATATGGGCCAGAATTGCTGACCAGTTATCGTGAGAGTAATTCATCGGCACGTTAAATCATATCAGGCGTAATACCACAACCCTTAAGTTAGCGCTTATGGGATTACTCCCCGCACAACGGCTACTTCTTCGGTTCGTAAGCGAGAACAGCCTTAAACTCAATATTACGTTCCTTCACCGTAAACTCACACAGCGAGTCTCCGACCAGAAGCGTAAATCCCAGCACCGTTAAACACAGCACTATGACTGCCGCAAGGGCATATTTCGTCAGCATGTTTATATTGCCTCCATGCGAAGAAGGGACTACCATCCAAATTGTTCGGGTTTGGAGTGGCAGCCCCAGGGTTGATAGAAATATCTCCTGGGGCTTCATCTTTCCAGACCTCAGAGTAAACCTGAAACCGGAAAGCTTCAGGCACCCGCCGTTATCTTACCCTTCCTCAACAAAATAGCTGTTTATTTATACAGTGAGTCCTGTTTCGGACTTCTCACTCCTCATTTAATCAACTGAAGCTGAACCCGTTTTTTTCTGCATGACTCAATCGTCAAAATTCTTCGCCACCAGTCAGCACTGCCATAACTTGTCACATTGCAAAAGTAGCCAAATTGCTTCAGGCACCATCAGTTCACCATCACCAAAAAAAAGAAACATTCCAGAACTTCCGCATCTCGCCCAGCCGCTCCGACTTCTATACTGAATAGAAAACGCCAACATAAGAGAAACCTATGCCATTACCCGATTTTCATGTTTCTGAACCTTTTACCCTCGGTATTGAACTGGAAATGCAGGTGGTTAATCCGCCGGGCTATGACTTAAGCCAGGACTCTTCAATGCTGATTGACGCGGTTAAAAATAAGATCACGGCCGGAGAGGTAAAGCACGATATCACCGAAAGTATGCTGGAGCTGGCGACGGATGTTTGCCGTGATATCAACCAGGCTGCCGGGCAGTTTTCAGCGATGCAGAAAGTCGTATTGCAGGCAGCCACAGACCATCATCTGGAAATTTGCGGCGGTGGCACGCACCCGTTTCAGAAATGGCAGCGTCAGGAGGTATGCGATAACGAACGCTATCAACGCACGCTGGAAAACTTTGGTTATCTCATTCAGCAGGCGACCGTTTTTGGTCAGCATGTCCATGTTGGCTGCGCCAGTGGCGATGACGCCATTTATTTGCTGCACGGCTTGTCACGATTTGTGCCGCACTTTATCGCCCTTTCCGCCGCGTCGCCATATATGCAGGGAACGGATACGCGTTTTGCCTCCTCACGACCGAATATTTTTTCCGCCTTTCCTGATAATGGCCCGATGCCGTGGGTCAGTAACTGGCAACAATTTGAAGCCCTGTTTCGCTGTCTGAGTTACACCACGATGATCGACAGCATTAAAGATCTGCACTGGGATATTCGCCCCAGTCCTCATTTTGGCACGGTGGAGGTTCGGGTGATGGATACCCCGTTAACCCTTAGCCACGCAGTAAATATGGCGGGATTAATTCAGGCTACCGCCCACTGGTTACTGACGGAACGCCCGTTTAAACATCAGGAAAAAGATTACCTGCTGTATAAATTCAACCGTTTCCAGGCCTGTCGCTATGGGCTTGAAGGCGTCATCACCGATCCGCACACTGGAGATCGTCGACCGCTAACGGAAGATACCTTGCGATTGCTGGAAAAAATCGCCCCTTCCGCACATAAAATTGGTGCATCGAGCGCGATTGAGGCCCTGCATCGCCAGGTCGTCAGCGGTCTGAATGAAGCGCAGCTAATGCGCGATTTCGTCGCCGATGGCGGCTCGCTGATTGGGCTGGTGAAAAAGCATTGTGAGATCTGGGCCGGTGACTAAGCCGGGAATTGCCTTACAGAAGAATATCCCTGACAATGGTGTTTTAACTTACATTTAACAATGTTATGAAACACCCTTTAGAAACCTTGACCACCGCAGCAGGCATTTTGCTGATGGCTTTCCTCTCTTGCCTGCTGCTGCCCGCCCCCGCACTGGGGCTGGCGCTGGCACAAAAACTGGTGACCATGTTCCATCTGATGGATCTTAGTCAGCTTTACACTTTATTGTTTTGTCTGTGGTTTTTAGTGCTGGGCGCTATTGAGTATTTTGTTCTGCGCTTTATCTGGCGACGCTGGTTCTCGCTGGCGGATTAAACGTCGATTATCGCCGTACTTCGCGGCATACTTTGCTTATTCTCCTTCGCCTAAAGGAACGTTTATGGATAAGCAATCACTGCACGAAACGGCGAAACGCCTGGCCCTTGAGTTACCCTTTGTCGAGCTTTGCTGGCCTTTTGGCCCGGAGTTCGATGTTTTTAAAATTGGCGGCAAGATTTTTATGCTGTCGTCGGAGCTACGCGGCGTCCCCTTTATCAATCTGAAGTCCGATCCACAAAAATCCCTGTTAAATCAGCAAATATACCCAAGCATTAAGCCAGGGTATCACATGAATAAAAAGCACTGGATTTCAGTGTATCCCGGCGAAGAAATCTCCGAAGCGTTACTTCGCGATCTGATCAACGATTCGTGGAATCTGGTGGTTGATGGTCTGGCTAAACGCGATCAAAAAAGAGTGCGTCCAGGCTAAAGCGGAAATCTATAGCGCATTTTTCTCGCTTACCATTTCTCGTTGAACCTTGTAATCTGCTGGCACGCAAAATTACTTTCACATGGAGTCTTTATGGATATCATTTCTGTCGCCTTAAAGCGTCATTCCACTAAGGCATTTGATGCCAGCAAAAAACTTACCCCGGAACAGGCCGAGCAGATCAAAACGCTACTGCAATACAGCCCATCCAGCACCAACTCCCAGCCGTGGCATTTTATTGTTGCCAGCACGGAAGAAGGTAAAGCGCGTGTTGCCAAATCCGCTGCCGGTAATTACGTGTTCAACGAGCGTAAAATGCTTGATGCCTCGCACGTCGTGGTGTTCTGTGCAAAAACCGCGATGGACGATGTCTGGCTGAAGCTGGTTGTTGACCAGGAAGATGCCGATGGCCGCTTTGCCACGCCGGAAGCGAAAGCCGCGAACGATAAAGGTCGCAAGTTCTTCGCTGATATGCACCGTAAAGATCTGCATGATGATGCAGAGTGGATGGCAAAACAGGTTTATCTCAACGTCGGTAACTTCCTGCTCGGCGTGGCGGCTCTGGGTCTGGACGCGGTACCCATCGAAGGTTTTGACGCCGCCATCCTCGATGCAGAATTTGGTCTGAAAGAGAAAGGCTACACCAGTCTGGTGGTTGTTCCGGTAGGTCATCACAGCGTTGAAGATTTTAACGCTACGCTGCCGAAATCTCGTCTGCCGCAAAACATCACCTTAACCGAAGTGTAATTCTCTCTTGCCGGGCATCTGCCCGGCTATTTCCTCTCAGATTCTCCTGATTTGCATAACCCTGTTTCAGCCGTCATCATAGGCTGCTGTTGTATAAAGGAGACGTTATGCAGGATTTAATATCCCAGGTTGAAGATTTAGCGGGTATTGAGATCGATCACACCACCTCGATGGTGATGATTTTCGGTATTATTTTTCTGACCGCCGTCGTGGTGCATATTATTTTGCATTGGGTGGTACTGCGGACCTTCGAAAAACGTGCCATCGCCAGTTCACGGCTTTGGTTGCAAATCATTACCCAGAATAAACTCTTCCACCGTTTAGCTTTTACCCTGCAGGGGATTATCGTCAATATTCAGGCGGTATTCTGGCTGCAAAAAGGCACCGAAGCGGCAGATATTCTGACTACCTGCGCGCAGTTGTGGATCATGATGTATGCGCTGCTTTCAGTCTTCTCGTTGCTGGATGTTATTTTGAATCTGGCGCAGAAATTCCCGGCAGCATCTCAGTTACCGCTGAAAGGGATATTTCAGGGGATTAAACTGATCGGCGCGATTCTGGTCGGCATTTTGATGATCTCGCTGCTGATTGGTCAGTCGCCAGCGATTCTGATCAGCGGTCTTGGTGCAATGGCTGCCGTGCTGATGTTGGTATTTAAAGATCCGATTCTTGGTCTGGTGGCAGGTATTCAGCTTTCCGCGAACGATATGCTGAAACTGGGCGACTGGCTGGAGATGCCGAAATACGGCGCGGATGGCGCGGTGATCGATATTGGGTTAACCACCGTCAAAGTGCGTAACTGGGACAATACCATTACCACTATTCCCACCTGGTCTCTGGTTTCTGACTCCTTTAAAAACTGGAGCGGGATGTCAGCATCTGGCGGGCGACGTATTAAGCGCAGTATCAGTATTGATGTCACCAGTATTCGTTTTCTTGATGAAGACGAAATGCAACGTCTGAATAAAGCGCATTTGTTAAAGCCTTATTTAACCAGCCGCCATCAGGAAATTAATGAGTGGAATCGCCAGCAAGGTTCTACGGAGTCGGTATTAAATCTGCGCCGAATGACCAATATTGGAACCTTTCGTGCCTATCTGAACGAATATCTGCGTAACCATCCGCGGATTCGTAAAGATATGACCTTAATGGTACGCCAACTGGCTCCAGGTGATAACGGTTTACCGCTCGAGATCTACGCGTTTACTAACACCGTGGTGTGGCTGGAATATGAAAGCATTCAGGCTGATATATTCGATCACATATTTGCGATTGTCGAAGAGTTTGGTCTGCGACTTCATCAGTCGCCAACCGGCAATGATATTCGCTCTCTGGCGGGTGCATTTAAGCAGTAATTAAAAAAACCGCTCTCATCGAATGGATGAGAGCGGTTTCGGATGGTTGACATCGTTTTGTCGGATGTAGCGTGAATGCCTTATTTCCGACGCAGCGTTTTAAATGCCATAAACAGGAATACAATCCACACCGGCAGCAGGATCGCTGACAAGCGCATATCATCCATCGTGCACATCAGCAGCAAAATCATGCCGAGGAAGGCAATGCAGAGATAGTTGCCGAACGGATAGAGCAGCGCCTTAAACTGTGTTTCACGCCCCTGACGTCGCATCGCTGCACGAAAACGCAGATGCGCCAGACAGATCATAATCCAGTTCAACAGCAGCGTTGCTACCACCAGCGCCATCAGCAGACCAAACGCTTTTTGCGGCAGCAGATAGTTGATTAACACCACCAGCGAAGTGATCGCTCCGGAAAGCATCAGCGAGTTAATCGGCACACCGCGACGGCTGACGCGAGTCAAAAACTTCGGCGCATTACCCTGCACAGAAAGGCCAAACAGCATGCGGCTGTTAGAGTAAACCCCGCTGTTATACACTGACAGCGATGCTACCAGAATGACGAAGTTCAGCGCAGAAGCTACCACGTTGCTGTCGAGATTATGGAAAATCATCACAAACGGGCTACTGTTGGATTTCACTTCCACCCACGGATAGAGCGCCAGTAAAACCACCAGTGAACCGATGTAAAACAGCAGGATGCGATACACCACCTGATTTACCGCTTTTGGAATGCTTTTTTCCGGATCGCGCGCTTCAGCGGCAGTAATCCCAATCAGCTCCAGACCGCCGAAGGAGAACATAATTACCGCCAGCGACAAAATCAGCCCATTCCAGCCGGTGGCGAAGAAACCACCGTAGCGCCAGAGGTTGTCGATACTGGCTTTCTCGCCGCCGTGACCAGAAAACAGCAGCCACAGGCCAAAGCCGATCATACCGATGATTGCCAGCACTTTAATCAACGCAAACCAGAACTCGGTTTCGCCATATAAGCGCACGTTCACCAGGTTAACGGCGTTGATGATAATAAAGAAGGCGGCAGCCCAAATCCACGTTGGAACATCCGGGAACCAGTACTGCATATAGATGCCCGCAGCGGTCAGCTCTGCCATTCCCACCAGCACGAACATTACCCAGTAGTTCCAGCCAGAGAGGAAGCCCGCAAACGGTCCCCAGTATTTATAGGCAAAGTGGGCAAATGAACCGGATACCGGCTCCTCAACCACCATTTCGCCAAGCTGGCGCATAATCAGGAAAGCGATGATCCCGGCGACGCCGTAGCCCAGCAATACAGCCGGACCCGCCATCTGAATCGCCGGGCCAATGCCAAGAAACAGACCAGTACCAATTGCGCCACCCAACGCAATCAGTTGAATATGACGGTTATGTAATCCGCGATGAAGCGTCGGCTCTTGATTCGACGCAGTATCTTCCGATACGGTTGACGCGTTTTTCACGCCTTTCCCCTGTGTGTCTTTTTTGTTGAGGGGCACCTTTTAACATTTAGTGCCCATCGTCGCAAGACACAATCCACACGGTTAAACGGGGTATCCTGCTTTTATTTCCGTACCCGATGTCGGTGCAGCCACATCAGCTTATACGCCGCCGGGATAATAAACAGCGACAGCAAAGGTGCGGTGATCATGCCGCCAATCATCGGCGCGGCAATCCGGCTCATCACCTCTGAACCAGCCCCCGTTCCCCACAGAATCGGCAGCAGACCGGCGATAATCACCGCCACCGTCATCGCTTTCGGGCGCACGCGCAGGACCGCGCCGTGATATAACGCCTCATCCAGCTTCTGCTCGCTGAATGTTTGTGGATTATTCAACGACGGCACGGCCTCTATGGCGTGACGTAAATACATCAGCATCACCACGCCAAATTCGGCGGCGACCCCGGCGAGGGCGATAAAGCCAGTGCCCGTCGCCACGGAAAGATGAAAGCCCATCCACCACAGCAACCAGATGCCGCCCACCAGCGCAAACGGTACGCTGCTGATAATCAGCAACGCTTCGCCCACCCGACGGAACGCCAGATACAACAGCACGAAGATAATCATCAACGTCATCGGCACCATGAGTTTAAGCTTATGGTTGGCGCGCTCCAGTAGCTCGAACTGCCCGGAGAATGCCACGCTGGTGCCCGGTTTTAACTGCACTTTTTCAGCTATCGCTTTTTGCAAATCGTGAACCACCGACACCATGTCACGATCGCGGGCATCGATATAAATCCAGCTCGTCGGGCGCGCATTCTCGGTTTTCAGCATCGACGGTCCGGTAGAGACTTTAATGTCGGCCACGTCTGCCAGGGTGATTTGCTGCTTCATCGGCGTCAGGATCGGCAGCTGGCGCAGTGCCTGCGGACTATCGCGCCAGCTTTGCGGATAACGCAGATTAATTGGATAACGGGCAATCCCTTCCACCGTTTCGCCAACCATCGCCCCGCCCACCGCAGAAGTCACAAACAACTGCACATCCGCCACCGTCATACCGTAACGTGCGGCTTTTTCACGGTTAATCTCAACGTTGATATAGCGCCCACCTTCCAGCCGCTCGGCAAGCGCAGAAGCTACGCCTGGCACCGTTCGCGCTACTTCTTCAATTTGCTCAGCCATCGCGTCGATATCCGCCAGCACAGTGCCGGAAACTTTAATGCCGATGGGGCTTTTAATGCCGGTTGAGAGCATATCGATACGGTTACGAATTGGCGGCACCCACAGATTCGCCAGCCCCGGCAGCCGCACGGTGTTATCCAGTTCCTCAATGATTTTGTCCATCGTCATGCCTGGCCGCCACTGCTCCTGCGGCTTAAGCTGGATGGTCGTTTCTACCATCTCCAGCGGAGCAGAATCGGTGGCGGTTTCCGCTTTCCCGGTTTTGCCAAATACCCGCGCCACTTCAGGTACGCTCATAATTAGCTTGTCGGTTTTTTGCAGCATACTCGCCGCCTCTGCTGCGGAAATCCCCGGCAGCGTCGATGGCATATACAACAAGTCGCCTTCATTGATCTGCGGTAAAAATTCCCCGCCAACTTTATTGAGCGGCCAGAGAACCGTCAGCACCGAAAGCGCCGCCACCAGCAGCGTGGTTTTCGGCCAGTGCAGTACTTTCAGCAACAGCGGATGATAAACACGAATCAAAAAGCGATTGAGCGGGTTACTGCTTTCCGGCGGAATTTTGCCACGGATCCAGTAGCCCATCAGGATCGGGATCACTACGATCGCCAGCAGCGCCGCACCCGCCATCGCATACGTTTTGGTGAACGCCAACGGGCCAAACAGACGCCCTTCCTGCCCTTCCAGGGTGAAGATCGGGATAAACGACAACGTGATAATCAGCAGACTGATAAATAGCGCCGGCCCCACTTCAACAGACGCATCGGTGATCACCTGCCAGCGCGTTTTATTATCCAGCGTGGCGTCAGGATGCTGGTGCTGCCACTCTTCCAGCCGTTTATGCGCATTCTCGATCATGACGATAGCAGCATCGACCATCGCCCCGACGGCAATCGCAATGCCACCCAACGACATAATATTGGCATTCAGTCCCTGGAAGTGCATGACAATAAAAGCAATACACAACCCCAGCGGCAACGAAATAATCGCCACCAGCGCCGAGCGCACATGCCAGAGAAACAGCGCACAGACTACCGCCACCACAATAAACTCTTCCAGCAACTTGCCGCTGAGGTTGTCGATGGCGCGGTCAATGAGCTGGCTGCGATCGTATGTTGTAACTATCTCCACGCCTTCCGGCAGACTACTTTTCAGCGTTTCCAGTTTGTCCTTCACGGCGGCGATCACTTCTCGGGCGTTTTTGCCGGATCGCAGGATCACCACCCCGCCCGCCACTTCGCCTTCGCCGTTTAGTTCGGCAATGCCCCGGCGCATCTCCGGGCCAATCTGGACCTTCGCAACATCGCGCAGATAAACGGGCACGCCATTTTCACTGGCTTTTAAAACGATGTGATTAAAGTCGTCGAGCGTTTGCAGATAGCCGCTGGCGCGCACCATATATTCCGCTTCCGCCAGTTCGATCGACGAACCGCCCGCTTCCTGGTTTGAAGCATCCAGCGCGCTTTTTACTTCGGCGAGACTGATGCCATACTGCGCCAGGCGCTGGGGATCGATAACCACCTGATACTCTTTCACCACACCGCCCACCGACGCCACTTCCGCAACGTCAGGGATGGTTTTCAGCTCATATTTGAGAAACCAGTCCTGTAATGAGCGTAAATCGGCCAGATCGTGCTTACCGCTGCGATCCACCAGTGCATATTCATAGATCCAGCCAACACCCGTGGCATCTGGCCCCAGCTCGGCGCTGACTCCCGCAGGCAGCTTACCCTGTACCTGGTTGAGGTACTCCAGCACCCGCGAACGCGCCCAGTACGGATCGGTGCCATCTTCGAAAATGACATACACATAAGAGTCGCCAAACTGTGAGAAACCGCGCACAGTCTTCGCGCCAGGCACCGACAACATGGTGGTGGTTAGCGGATAAGTCACCTGATTTTCAACGATTTGTGGTGCCTGACCGGGATAGCTGGTTTTAATAATAACCTGCACATCGGAGAGATCCGGCAGCGCATCCACTGGCGTATTAATGATGGTCCAGGTGCCCCAGATGCTCAGAAACAACGCGCCCATCAGCACCAAAAAACGGTTCGCCACCGAGCGACGAATAATCCATTCAATCATTGGTTATTCCCTCAATGCGCATGGGTAGCACTTTCAGAGCGCATCCGCTCCAGTGCGCCAGAAATATTGGCTTCGGAATCAATCAGGAACAGGCCGCTGGAAACCACCTTTTCACCTTCCGCCAGACCAGAGCGTAATGCGGTGACGCCTTGCGATGCCTGGAAAACAGCAACGCGTTTCGGTACAAAGCGCCCGTCGGCATCAACGGTAATCACCCGCTGTTCGCTGCCGGTATCAATCAGCGCTTGTGACGGAATGAGCAGCATCGGTTCGCTGGCGGTGTTGAGTTGCAACCAGGCGTTCATGCCCGGTTTTAGCGCCTCGTCGGCGTTGTCGACTTCCAGACGCAGCTGCAGCGTGCGGGTCGCGGCATCCACGCCAGGTAGCAGCGTCCATTTGCGGATGGTGAGTGTTTTATCCGGTCGCGCCGGAACGGTGAGGGTAAACTGCGAGGCATCTTTCACCAGCCAGGCGATAGACTCCGGGATCGCAGCAGTGACCCACACCGGGTCCATACCCTGAATTTTCGCTACCACGTTATCTTTGGCGATATTCATTCCCGCGCGCAGATCAAACGCGGTGATCACGCCATCAATGGGCGCTTTGAGCGTAAAGCGAGTCTGGATTTTTTGCGTGGCGATCAGGCGGCGAATATCCGCCTCCGGCATTCCCGCCAGTCGCAGCCGCTCAAGAATGCCTTCAGTCTGGGTCGCCGTACCGCCGGTTTCGCGCAGCAGTAAATACTCACTCTGCGCTTCCACCCAGTCAGGAATGGTCAGGTCGAGAAGCGGTGTGCCCTTTTGTACTTTATCGCCCACGGTAAGCGGATACACCTTGTCGATAAACCCGGCAGCGCGAGCCTGCACAATGGCATACTGATACTCGTTGTAACTGACATTCGCCGGGAAACTCTGGGCAAAAGTCAGCGGTCCGCGCGTGACGGTAGCCGTTTTCACCCCCAGATTCTGCGTCTGAGTCGGGTCAATGCGCACACCAGACGCAGAACTCTCTTCATCGGCATATTTCGGCACCAGATCCATATCCATAAACGGCGATTTACCTGGTTTATCGAACCGCGTATTGGGATACATTGGGTCGTACCAGAATAAGATTTTACGTTCTGCGGTCGACGTTTTTTCTGCAGGCGGTTCCGCCTTTGCAACCCAGGTAAAACCTGCCGCAGAAATAATACCGCCCGCGATCATGCTGCCGATAATAAGCGCGATTTTTTTCATCTCATTAAACCTGGGTTACTGGCTGACTTTAATATCCTGTAATAAAGAAAGGTTGCCCTGCTGGACAAAATTAAACGCCACTTTGTCGCCGGTTTTAATTTCACTCATTTTCGTCTGCGGGGTGATGGTAAAGCGCATGGTCATCTCCGGCCAGTTCACGGCAGCAATCGGATCGTGATGGATGGTGATTTTTTTGCTTTCCAGATCGATACCCTTTACCACGCCAGTGGCGCTAATAACCTGTGGTTGTGCTTCGCTCATGGTTTCATGATGATGTTCGTTAGCCTGGGCATTAAAGCCAATAACGGTAAACAGACTGAACATTGCGACTTGCAGTGCTTTTTTCATTTATTCTCTCCTGGAGTTAAAAGTCATACTCTAAATAATTCGAGTTGCAGGAAGGCGACAAGCGAGTGAATCCCCAGGAGCTTACATAAGTAAGTGACTGGGGTGAGCGAACGCAGACGCAGCACATGCAACTTGAAGTATGACGAGTATATTGCTGTCAACCGCCACCAAGTGCGGTATACAAAGAAATTTCGTTAACCTGACGGGCATAATTCAGATCAAGTAAAGTTTGTCGGGTTGCAAATAAAGAACGCTCGGCATCCAGCACTTCCAGATAACTTACTGCGCCGTGCTGATATAATGCCCGCGCCCGTTGCAAAGTAATTTGCAGCGATGCCAGATAACGCTGCTGGGCGCTAATTTGATCGTTCAGGCTTTGACGTAATGCCAGCGCATCTGCCACTTCTTTAAAGGCGTTCTGGATTTTCTGTTCATAATTCACCACCGACTGCTGCTGGCGAATTTCGGCGATATCCAGATTGGCCTGGTTGCGTCCGGCATTAAAAATGGGGATCTCAATTTTGGGTATAAAATTCCACATCCCGCTGCTGGCGTTAAATAATGACGATAGATCGCTACTGGCGGTCGATATTCCGCTGGTCAGGCTTATAGACGGAAAAAATGCCGCGCGTGCAGCGCCAATATTGGCATTAGCCGCCATTAACGCGTGTTCAGCTTCCATAATATCCGGGCGCTGCAATAAGATTTGCGACGACAATCCCGCCGGTAATTTAACGCTTTGCAGGCTGTCGCTGTTTACTGTCTGCGCTTGCGGCAGCTTGCCGTAGCTTCCCAGCAACAGTTGCAATGCATTATTCGCCTGCGCCAGTTCCCCCTGACGTTTAGCGATGTCGCTGCGGGTACTTTCTATCACCCCGCGAGCCTGTTCCAGCGCCAGAACATTACTGCTACCTGTCAACAGTTGTTTTTCGACAAACGCATATGACTGCTGATAATTACGCAGCGTTTCTTCGGCTATTTGCAATTGCGCATACGCCAGTTGCTGATTGAAATAGCTTTGCGCGACATTAGAAACCAGCAGAATATGCACCGCGCGCTGAGCTTCCTCAGTGGCTAAATAATTTTGTCGCTCGGCTTCGCTCATGTTCTTTAAGCGACCGAAAAAATCGAGGTCAAAGCTGGCGTTAAGGCCGGTCGAAAACTCCCGTGTCGTGGCGGTATTCCCTTTAAGATTACCGCTCCAGCTGCCGCTGCCCTCGCCATTGAGCTGTGGGTAGCGGTCGGCATCGGTCAGACGATATTGCGCCCGTGCCTCCTGCACTTTCAGCGTCGCCATGCGCAAATCCCGATTATTCACCAGCGCCTCGCTAATCAGCGTCTTCACCTGATTATCAACAAAAAAGGTGCGCCAGCCCGCGTTCTGATAGTTATCTGCTGCGTTAACCAGGCCGTTCTGGCTGAGTGAGAACTGCTGCGGCACGGGCATTGCCGGACGCTGATAATCCGGTGCCAGTGAACAACCGGTTAGCGCAAGGGCCACACAAAATGGCAGAAGTTTACAAGGAGACATAGGCTCATAATTTCTGGTGATTTTATGCCGCCAACTTTACTCGCCAGGCTCTGATTTTCCGGTGACAGAAAAATGACAAAATTGTCATTTTGCCAATAAGCGATTGCCATCTGATCCCGCTACTCTAGAATTGCCCGGGCAACATGCGGAGGAAATATGAAACTGTTGATTGTCGAAGATGAAAAGAAAACCGGAGAATACTTGACCAAAGGGTTAACCGAAGCCGGTTTTGTGGTCGATTTGGCCGACAACGGGCTGAATGGCTACCATCTGGCGATGACCGGTGATTATGATCTGATAATCCTCGATATTATGCTGCCGGACGTGAACGGCTGGGATATCGTGCGCATGTTACGCTCCGCCAATAAAGGGATGCCGATTCTGTTGCTTACCGCGCTTGGCACCATTGAACATCGCGTCAAGGGGCTGGAGTTGGGGGCAGATGACTACCTGGTGAAGCCATTCGCTTTTGCTGAACTGCTGGCGCGGGTGCGCACATTACTGCGGCGCGGGGCGGCGGTGATTATCGAAAGTCAGTTTCAGGTTGCCGATTTGATGGTCGATCTCGTCAGCCGCAAAGTCACCCGCAGCGGCACGCGCATCACTTTGACCAGTAAAGAGTTTACTCTGCTGGAGTTCTTCCTTCGCCATCAGGGCGAAGTGCTGCCCCGCTCGCTTATCGCCTCGCAGGTATGGGACATGAATTTTGACAGCGATACCAATGCTATTGATGTGGCGGTGAAGCGGCTGCGCGGCAAAATCGACAACGACTTTGAGCCGAAGCTAATTCAGACCGTGCGCGGCGTGGGTTACATGCTTGAGGTGCCGGATGGTCAGTAAGCCATTTCAGCGCCCGTTTTCGCTGGCAACCCGCCTGACCTTTTTTATCAGCCTGGCCACCATCGCGGCGTTTTTCGCCTTTGCATGGATCATGATCCACTCAGTAAAAGTTCATTTTGCCGAGCAGGATATTAATGATTTAAAAGAGATTAGCGCCACCCTTGAACGGGTACTAAATCACCCTGACGAAACGCAAGCCCGACGCTTAATGACGCTGGAAGATATCGTCAGTGGTTATTCCAATGTGTTGATTTCCCTGGCAGATAGTCAGGGTAAAACGGTGTATCACTCCCCCGGTGCGCCGGATATCCGCGAGTTTACGCGTGACGCCATACCCGATAAAGACGCTCAGGGTGGCGAGGTGTATCTCCTTTCCGGCCCGACGATGATGATGCCAGGCCACGGTCACGGGCATATGGAACACAGCAACTGGCGGATGATTAACTTGCCGGTTGGCCCGTTGGTGGACGGCAAACCGATTTATACGCTCTACATCGCGCTTTCGATCGATTTTCATCTTCATTACATAAATGATTTGATGAATAAACTTATTATGACCGCATCGGTAATCAGCATCCTGATCGTCTTTATCGTACTGTTGGCGGTACATAAAGGTCACGCGCCGATCCGCAGCGTCAGCCGTCAAATCCAGAATATTACCTCGAAAGATCTCGACGTTCGCCTCGACCCGCAGACCGTGCCCATTGAGCTGGAACAGCTGGTACTGTCGTTCAACCATATGATCGAGCGTATTGAGGATGTCTTTACCCGCCAGTCCAATTTCTCAGCGGATATCGCTCACGAAATTCGCACACCAATTACGAATCTCATAACGCAAACCGAAATCGCCCTCAGCCAGTCGCGCAGCCAGAAGGAGCTGGAAGATGTGCTCTACTCTAATCTCGAAGAGCTGACGCGAATGGCGAAAATGGTCAGCGATATGCTGTTTCTCGCTCAGGCCGATAACAACCAGCTAATCCCCGAAAAGAAAATGCTCAACCTGGCGGATGAAGTCGGCAAAGTGTTCGATTTTTTCGAGGCGTTAGCGGAAGATCGCGGCGTGGAGTTGCGGTTTGTTGGCGACAAGTGTCAGGTCGCGGGCGATCCGCTGATGCTGCGTCGGGCGCTAAGCAACCTGCTTTCGAACGCCCTGCGTTATACGCCAACCGGAGAGACAATTGTAGTGCGCTGTCAGACGGTCGATCACCTGGTGCAAGTTATCGTCGAAAACCCCGGTACGCCCATTGCGCCCGAGCACTTACCGCGATTGTTTGACCGTTTCTATCGCGTTGACCCCTCCCGCCAGCGAAAAGGTGAAGGTAGCGGTATTGGCCTGGCGATAGTGAAATCGATTGTTGTCGCGCATAAAGGCACGGTTGCGGTAACGTCGGATGCGCGGGGGACAAGGTTTGTTATCACATTACCCGCTTAATCCTTCAGCAAACGGCAACTTTTATAACCAGTGTAAAAATAACGTGCCGCAATAGTCATACCTATTAATGGTAAAAAGCTGTCACAATTCATAAAAAACCTTAATATACGCCACCCTAAACATAACCAGCGTTAATGTAAGGTTTTTGTGTGGACTGGCTTCTTGATGTTTTTGCTACCTGGCTCTACGGCTTAAAAGTAATCGCGATAACGTTAGCGGTCATCATGTTCATCAGCGGGCTGGACGATTTTTTTATTGATGTCGTCTACTGGGTACGCCGCATTAAACGCAAGTTGAGTGTTTATCGCCGCTACCCGCGAATGAGTTACCGCGAACTGTATAAACCAGATGAAAAACCGTTAGCGATTATGGTTCCGGCGTGGAATGAAACGGGCGTCATCGGCAATATGGCCGAGCTGGCGGCGACCACGCTCGACTACGAAAACTATCATATCTTTGTTGGCACCTACCCCAACGACCCCGATACTCAGCGTGATGTTGACGAAGTGTGCGCTCGCTTCCCGAATGTGCATAAGGTAGTCTGCGCGCGTCCTGGCCCCACCAGCAAAGCCGACTGTCTGAACAACGTGCTGGACGCCATCACCCAATTTGAGCGTAGCGCCAATTTCGCTTTTGCTGGTTTTATTCTGCATGACGCCGAAGATGTGATTTCACCGATGGAATTGCGTCTGTTCAACTATCTGGTCGAGCGTAAAGATCTGATTCAGATCCCGGTGTATCCGTTCGAACGCGAATGGACGCACTTCACCAGCATGACTTACATTGATGAGTTTTCAGAGCTGCATGGCAAAGATGTTCCGGTGCGTGAAGCCCTCGCCGGACAAGTGCCCAGCGCAGGCGTCGGCACCTGTTTCAGCCGCCGCGCCGTGACCGCACTGTTAGCTGACGGTGACGGTATTGCTTTCGACGTGCAGAGTCTTACTGAAGATTACGACATTGGCTTCCGCCTGAAAGAAAAAGGTATGACGGAAATTTTTGTCCGTTTTCCGGTGGTGGACGAAGCCAAAGAACGCGAGCAGCGTAAATTTTTACAGCACGCGCGGACATCAAACATGATCTGCGTGCGCGAATATTTCCCCGATACCTTTTCGACTGCGGTTCGACAAAAATCCCGCTGGATCATCGGCATTGTTTTCCAAGGCTTTAAAACCCATAAATGGACCTCCAGCCTGACGCTGAACTACTTTCTCTGGCGCGACCGCAAAGGGGCAATCAGTAACTTTGTCAGCTTCCTCGCGATGCTGGTGATGATCCAGCTTTTGCTGTTGCTGGCGTATGAAAGTTTGTGGCCCGATGCCTGGCATTTCCTTTCTATTTTCAGCGGCAGCGCATGGTTAATGACCCTGCTGTGGCTAAACTTTGGTTTGATGGTTAACCGCATCGTGCAGCGGGTGATTTTCGTTACTGGCTACTACGGCCTGACGCAGGGGCTGCTTTCCGTCCTGCGTCTTTTCTGGGGCAACCTGATTAACTTCATGGCCAACTGGCGCGCGCTAAAACAGGTACTTCAACACGGCGATCCACGTCGCGTGGCGTGGGATAAAACAACGCATGACTTCCCCAGCGTGACTGGCGATACCCGCTCGTTGCGCCCGTTAGGTCAAATTCTGCTGGAAAATCAGGTCATCACTGAAGAACAACTCGATACAGCACTGCGTAATCGCGTCGAAGGTCTACGCCTGGGCGGTTCAATGCTGATGCAGGGGCTGATTAGCGCCGAGCAGCTGGCACAGGCGCTGGCAGAGCAAAACGGCGTGGCGTGGGAATCCATCGATGCCTGGCAGATCCCTTCCTCGCTGATTGCCGAAATGCCGGCCTCCGTGGCGCTGCATTATGCGGTACTGCCGCTGCGTCTGGAAAATGACGAGTTAATTGTCGGCAGTGAAGATGGTATTGACCCGGTTTCGCTGGCGGCCCTGACGCGTAAAGTCGGACGCAAAGTGCGTTACGTCATTGTTCTGCGGGGACAAATTGTCACAGGGTTACGTCACTGGTATGCACGCCGACGCGGTCACGATCCGCGGGCAATGTTGTACAATGCGGTTCAGCATCAGTGGCTCACGGAACAGCAGGCCGGTGAAATCTGGCGGCAATATGTGCCGCATCAGTTCCTGTTCGCCGAAATACTGACCACGCTCGGTCATATTAATCGTTCAGCAATTAACGTGTTGTTATTGCGCCATGAACGCAGTTCTCTGCCGCTCGGCAAGTTTTTGGTCACCGAAGGCGTTATCAGCCAGGAAACGTTGGATCGCGTCCTGACAATTCAACGCGAATTACAAGTTTCGATGCAATCACTATTACTCAAAGCAGGTTTAAACACAGAACAGGTTGCGCAACTGGAGTCCGAAAATGAAGGAGAATAACCTTAATCGCGTCATCGGATGGTCTGGTTTACTGCTGACGTCTTTATTGAGTACCAGCGCACTCGCAGACAATATCGGCACCAGCGCAGAAGAGCTGGGGCTGAGCGATTATCGCCATTTTGTTATTTATCCCCGTCTCGATAAAGCGCTGAAGGCACAGAAAAATAACGACGAAGCAACCGCCATCCGCGAATTTGAATATATACACCAGCAGGTGCCGGATAATATTCCGCTGACTTTATACCTTGCGGAAGCCTATCGCCATTTTGGTCATGATGACCGGGCGCGGCTGTTGCTTGAGGATCAACTGAAACGTCACCCAGGAGATGCCCGACTTGAGCGCAGTCTGGCGGCTATTCCGGTTGAAGTGAAAAGCGTTACGACAGTTGAAGAACTGCTTGCCCAGCAAAAAGCGTGCGATGCTGCGCCGACCCTGCGTTGTCGCAGTGAAGTCGGGCAGAATGCCCTGCGGCTGGCACAGTTACCTGTCGCCAGAGCGCAACTGAACGATGCGACGTTTGCTGCATCGCCGGAAGGAAAAACGCTGCGAACCGATCTGCTGCAACGGGCAATCTACCTGAAACAATGGTCCCAGGCAGATACGCTATACAATGAAGCACGCCAGCAGAACACATTAAGCGCGGCAGAACGCCGTCAGTGGTTTGACGTGCTTCTTGCCGGGCAGCTGGACGATCGGATCCTGGCACTGCAATCACAGGGGATCTTCACCGATCCTCAGTCATATATTACTTACGCGACCGCGCTGGCTTATCGTGGCGAAAAAGCACGCCTCCAGCATTATCTCATTGAAAATAAGCCACTATTTACCACGGACGCACAAGAGAAAAGTTGGCTCTATCTGTTATCTAAATACAGCGCTAACCCCGTTCAGGCGTTGGCGAATTATACGGTACAGTTTGCCGACAACCGCCAGTATGTTGTTGGCGCGACGCTACCGGTGCTGTTAAAAGAAGGTCAGTACGACGCAGCGCAAAAACTGCTCGCCACCCTCCCCGCCAATGAAATGCTTGAGGAGCGTTATGCTGTCAGCGTGGCGACCCGTAACAAGGCTGAAGCTCTGCGTCTGGCACGATTGCTGTATCAGCAAGAACCGGCAAATCTTACCCGCCTGGATCAACTAACCTGGCAACTGATGCAGAACGAGCAGTCACGCGAAGCTGCCGATTTATTGCTGCAACGCTATCCTTTCCAGGGCGATGCGCGTGTCAGCCAGACTTTAATGGCGCGACTGGCGTCTCTGCTGGAAAGTCATCCTTACCTGGCAACGCCGGCGAAGGTGGCGATTTTATCGAAACCCTTACCGCTGGCGGAGCAACGTCAGTGGCAAAGTCAGTTGCCGGGTATTGCAGATAATTGCCCGGCAATAGTTCGCTTGCTGGGCGATATGTCGCCTTCCTACGATGCCGCCGCCTGGAACCGTCTGGCAAAGTGTTATCGGGACACGCTACCCGGTGTGGCGTTGTATGCATGGCTTCAGGCCGAACAACGACAACCGAGCGCCTGGCAACATCGTGCGGTAGCCTATCAGGCGTATCAGGTTGAGGACTACGCCACCGCACTGGCGGCCTGGCAGAAAATCAGTCTTCACGACATGAGCAATGAGGATCTGCTTGCTGCTGCCAATACCGCCCAGGCGGCAGGAAATGGTGCGGCTCGCGATCGCTGGCTACAACAGGCAGAAAAACGTGGACTGGGAAGCAATGCCCTCTACTGGTGGCTGCATGCGCAACGTTACATTCCTGGTCAGCCGGAACTCGCACTGAACGATCTCACGCGCTCAATCAATATTGCGCCTTCTGCCAACGCTTACGTTGCGCGGGCGACAATTTATCGCCAACGTCATAATGTCCCGGCCGCGGTGAGTGATTTGCGCGCCGCGCTGGAACTGGAACCGAATAATAGCAACACCCAGGCAGCGCTTGGTTACGCCTTGTGGGATAGCGGTGATATCGCACAGTCGCGGGAAATGCTCGAACCGGCGCATAAAGGGCTTCCGGACGATCCGGCACTGATCCGACAACTGGCCTACGTGAACCAGCGTCTGGATGACATGCCTGCGACGCAGCACTACGCCCGGCTGGTGATTGATGACATTGATAATCAGGCGCTGATAACCCCACTGACCCCAGAACAAAATCAACAACGCTTCAATTTCCGCCGTTTGCATGAGGAGGTCGGTCGCCGCTGGACGTTCAGTTTCGATTCTTCCATCGGCTTGCGTTCCGGCGCAATGAGTACCGCTAACAATAATGTCGGCGGCGCAGCGCCAGGGAAAAGCTATCGTAGCTACGGACAACTGGAAGCCGAGTACCGCATCGGACGCAATATGCTGCTGGAAGGCGACCTGCTCTCAGTTTATAGCCGCGTCTTTGCCGATACCGGAGAAAACGGGGTGATGATGCCGGTGAAAAATCCGATGTCCGGCACCGGTCTGCGCTGGAAGCCGCTGCGCGATCAGATCTTTTTCATCGCCGTCGAACAGCAGTTGCCGCTGAACGGCCAAAATGGCGCATCCGATACCATGCTGCGCGCCAGCGCCTCATTCTTTAATGGCGGCAAATACAGCGACGAATGGCACCCGAACGGTTCAGGCTGGTTTGCCCAAAACCTGTACCTCGATGCGGCGCAATATATCCGCCAGGATATTCAGGCGTGGACGGCAGATTATCGCGTCAGCTGGCATCAGAAGGTAGCTAACGGACAGACTATTGAGCCTTACGCTCACGTTCAGGACAACGGCTATCGTGATAAAGGCACTCAGGGCGCGCAGCTTGGCGGAGTCGGGGTCCGCTGGAATATCTGGACCGGCGAGACGCACTACGACGCCTGGCCGCACAAAGTCAGTCTCGGCGTCGAGTATCAACATACCTTTAAGGCGATTAATCAACGTAACGGAGAGCGCAACAACGCGTTTCTCACCATTGGAGTGCACTGGTAAATGCGTAAGTTCATTTTCGTATTGCTGACACTGCTTTTGGTCAGCCCTTTTTCCTTTGCGATGAAAGGTATTATCTGGCAACCACAAAACCGAGATAGTCAGGTTACCGATACCCAGTGGCAGGGGCTGATGAGTCAGTTACGTTTGCAAGGCTTCGATACCCTTGTTTTGCAATGGACCCGTTACGGCGATGCATTTACCCAGCCAGAACAGCGCACGTTATTGTTTAAGCGGGCCGCAGCTGCGCAACAGGCTGGTCTGAAGCTTATTGTCGGGCTGAACGCCGATCCGGAATTTTTTATGCACCAGAAACAGTCGTCCGCAGCGCTGGAAAGCTATCTTAATCGCCTGCTGGCTGCCGATCTCCAGCAAGCCAGATTATGGAGCGCCGCGCCTGGCATAACGCCGGATGGCTGGTACATCAGCGCGGAAATTGACGACCTGAACTGGCGCAGCGAAGCCGCCCGTCAGCCTTTGCTAACATGGTTAAACAACGCGCAGCGGCTGATTAGCGATGTTTCAGCAAAACCGGTTTATATCAGTAGTTTTTTCGCCGGAAACATGTCGCCCGATGGCTATCGCCAACTGCTGGAACACGTTAAAGCAACCGGCGTTAATGTCTGGGTACAGGATGGCAGCGGCGTGGATAAACTGACCGCTGAACAGCGTGAACGTTATTTACAGGCCAGCGCCGATTGCCAAAGTCCCGCCCCTGCCAGCGGCGTTGTTTATGAACTTTTTGTCGCCGGCAAAGGCAAAACCTTTACAGCGAAACCGAAACCGGACGCAGAAATTGCCTCGCTGTTAGCGAAACGTTCCTCTTGCGGTAAAGACACTCTCTATTTCTCTCTGCGCTATTTGCCCGTCGCGCACGGCATTCTCGAGTATTAAATCTCCTCCAGGTAAGTCGGGTACGACCTGGCTTACCTCTTTCGCTCTCATCAGAATTCCATCAAGATTAATTGCTAAAAAGCGTGCTAAAGAAAGAAGATCGTTATCGGCGAACGGGCCGATCACTCATGCAGGAAGCATAGATCCTTTGAGCGGAGTTTCTGATATCTGAGGAGTGCGAAATGCAATTGAGCAGCAGTGAACCTTGCGTGGTGATCCTGACCGAAAAAGAGGTAGAGGTAAGCGTCAATAACCATGCTACGTTTACCCTTCCGAAAAACTACCTGGCCGCCTTCGCGTGCAACAATAACGTCATTGAACTCTCAACGTTAAATCACGTATTAATCACCCACATCAACCGTAACATCATCAACGATTATCTGTTGTTTTTAAATAAGAACTTAACCTGTGTAAAGCCCTGGTCGCGGCTGGCAACCCCGGTTATCGCTTGTCATAGCCGTACACCGGAAGTGTTCCGGCTAGCCGCCAACCACAGCAAGCAGCAACCCAGCAGACCCTGCGAAGCGGAGTTGACGCGCGCATTGCTTTTTACCGTATTGTCTAACTTTCTTGAGCAATCGCGGTTTATTGCCCTACTGATGTATATCTTACGCAGCAGCGTCCGCGACAGCGTCTGCCGCATTATTCAAAGCGATATTCAGCATTACTGGAATCTGCGAATTGTCGCCAGTTCGCTATGTTTAAGCCCCAGCCTGCTCAAAAAGAAATTAAAAAACGAAAATACCAGCTATAGCCAGATTGTCACAGAGTGTCGTATGCGTTACGCCGTACAGATGTTATTGATGGATAACAAAAATATCACTCAGGTGGCGCAATTATGTGGCTATAGCAGCACGTCGTACTTTATCTCTGTTTTTAAGGCGTTTTACGGCCTGACACCGTTGAATTATCTCGCCAAACAGCGACAAAAAGTGATGTGGTGAAGGGCAAAGCGGAAACGGATAAGACGGGCATAAATGAGGAAGAAATGGCGCGCCCTGCAGGATTCGAACCTGCGGCCCACGACTTAGAAGTTCCTAGAACGACATTTTAAGTCAACAACTTACCGCGCCATCTCTGCGCTCACACGTCCCACTACCTCAAAACATGTAAAGCCTTGCAAGCCATTGCGAGGCCTTATGTGTCTCAGTTTTGTCCCTCTTTTTTGTACTAAAAAACATAGTAATTGAGGATAAACCTCATGCTATTTTCGCTTATATGCCTCTAAAGGCATGGCACTTAAATAGATAAAAGCACCACAAAAGCATAAAAAAACCACACAGTAAAACCGAAATATGAAACAATAACAGATAATTAAACCAAAAACAGATAGCGCATTGTGATAATCATTCAATACTAAACAAAATATAAACAGTGGAGCAATATGTAATTGACTCATTAAGTTAGATATAAAAAATACATATTCAATCATTAAAACGATTGAATGGAGAACTTTTATGCGGGCGAAACTTCTGGGAATAGTCCTGACAACCCCTATTGCGATCAGCTCTTTTGCTTCTACCGAGACTTTATCGTTTACTCCTGACAACATAAATGCGGACATTAGTCTTGGAACTCTGAGCGGAAAAACAAAAGAGCGTGTTTATCTAGCCGAAGAAGGAGGCCGAAAAGTCAGTCAACTCGACTGGAAATTCAATAACGCTGCAATTATTAAAGGTGCAATTAATTGGGATTTGATGCCCCAGATATCTATCGGGGCTGCTGGCTGGACAACTCTCGGCAGCCGAGGTGGCAATATGGTCGATCAGGACTGGATGGATTCCAGTAACCCCGGAACCTGGACGGATGAAAGTAGACACCCTGATACACAACTCAATTATGCCAACGAATTTGATCTGAATATCAAAGGCTGGCTCCTCAACGAACCCAATTACCGCCTGGGACTCATGGCCGGATATCAGGAAAGCCGTTATAGCTTTACAGCCAGAGGTGGTTCCTATATCTACAGTTCTGAGGAGGGATTCAGAGATGATATCGGCTCCTTCCCGAATGGAGAAAGAGCAATCGGCTACAAACAACGTTTTAAAATGCCCTACATTGGCTTGACTGGAAGTTATCGTTATGAAGATTTTGAACTCGGTGGCACATTTAAATACAGCGGCTGGGTGGAATCATCTGATAACGATGAACACTATGACCCGGGAAAAAGAATCACTTATCGCAGTAAGGTCAAAGACCAAAATTACTATTCTGTTGCAGTCAATGCAGGTTATTACGTCACACCTAACGCAAAAGTTTATGTTGAAGGCGCATGGAATCGGGTTACGAATAAAAAAGGTAATACTTCACTTTATGATCACAATAATAACACTTCAGACTACAGCAAAAATGGAGCAGGTATAGAAAACTATAACTTCATCACTACTGCTGGTCTTAAGTACACATTTTAAGAACGCCAACTAAAATTTCCCCGAGGTGAAAATCGCCCCGGGGAATAACTAGCCATTTCAATGTAACAATTAACCCTTAAAATAAACCCAGAAGGTTATTAACTAAATCACATAGAAAACCATCAATTATAGTATGTATAAAATAGGCGACAGCAACCCAATTACAAATTAATGGTTCCAGAATATCACATCAAAAAAAACGCTGTATAATATTATAATTAACATGTAGACAACTTGTAATAAACATTATCAGTCAATTGTTTTGTTTATTCCATCTGTGACGCCGATTATTTTCTCAAAATAATGAGATGGCGTGACACCATAATAATCTTTAAATGCACATATGAAATATGAAGTACTGTTATAGCCACATTTCTGGGCTACGACATTGATAGAATAAGAGTTTGAAGTTATGAGTTTTTTTGCATACCTCATCCTAGTATCTCTCAATATTTCAGTAAATGACGTTCCTTCATCCCTTAATCTTTTTTTTATTAAACTTTCACTCGTATAAATCAATTCCGCAATATCTTTTAAATGCCATTGCCGCTCAATATTAAAACTGATTATTCCAGTAATTTTACAGGTAAATGTATTTATATTTGTTAGTATAAATGAATTTACACTTTCGCGTTTTTTGAACATGGCAAGTAAGGATATACATAGTCTTTCTTTTAACCAAAGGGAGTGTGAGTCTGCTATTTTAATCCCTTCAAACAGAGAAAAAACAAGCGATAATGGAGGTTCCTCTTCAGCAATATAGCCATTCTTATCAAGAGTAAATTTGCCAGGCAGCTCATTATTCACGTCGATAAAAAAGGATAAACATGTTTTCTTATCTATATCAACAATTCTTAGTTTAGAGGGGCATACTGGTAACTCCCTTCTAATTTTGTCGCTTACAATAAACAATGAATTTTTTTTGAACGAGATAACTCTCCTGTTTATAATTAAATCAAATGATTGACAGATGAAAACTACGGAGCAAACATAATCCATCTTGCACCTATCATAAAATTAAAACAAGTTGATAGTAGTCAAATAACAACCAATTAAATACACAATCATAATCAGGATGATGTGCATTTATATTTTTATACACAAAATTATAGTTTGCAAATTTTAATAAATTTCATTTAAGATTAAATTATTATATGTATGTTGTTTTTTATTCTAACTTATTTCAAAGTTACATTTTTCAACGCTTACTATGCTTTTTATTAACATAAACTCACTACAACGCACCTGAAACCTCTTGCTATATATATGTCAACCGTTTGAATTTAAAATAAAAAGAGTATCATTTTTACTTGCATTTCTTATCAAGTCACATTCAACAACAGTAAAAAAACATTATTAGAACCATTCAATTAACAAAAAACCAACATCCAGCTTGCTTAATTTTTCTTTATTAAACGATATTGAAAATCAATTGATAAAATACATCTAAACAACCTTTTGGGGCGCAAAAGCATAACATCAAACAAACAAATAACACACCGAAAAAACTCACAATTAATAACCTATGATATACATACTGTTTATTATGGTTGAATAAGCCACTCGATATCTGGTGCTACGGAAGTGTCCACACGGTTTAGCAGCACCCGATACTTTTTTCAGGCTTCCAGCAACGATCTTTCTTCCTCCGTTGCGATTTCCAGATCTACAGCATCCTGCAGTGGCGCAATATACTCACTGAATTCCTGGATGTAGAACTGTGTGGTGACGGTCTTCCAGCCATTCGGCTCCTGCTGTATTGAAGCATACCAGGCTATTTCAATATCGCTATGCTGCGGCAGCATTTAACCCCTTGTAATTCATTGCCATAATTGATTTAATTCACAAATAAAACTATAACATGGTGAAATTAATAAAAAAACACAGATGATGGGGCTAGAATTTACACACCACTTACCCTAAAGCTTTATGACTGGTGGGTTTTGGGAGTATCAAATCGGCTTGCATGGGGATGTCCTACAAAGGAACACCTTCTTCCACACTTTCTGGAACATTTAGGTAACAACCATCTGGATATTGGTGTTGGAACTGGGTTTTACCTTACTCACGTACCTGAGAGTAGTCTGATATCTTTAATGGATTTGAACGAAGCTAGCCTGAACGCGGCATCTACAAGGGCTGGGGAATCAAAAATTAAACATAAAATTAGCCATGATGTTTTTGAACCTTATCCCGCGGCGTTACATGGTCAATTTGATTCCATTTCCATGTCTTACCTTCTTCACTGCCTGCCTGGAAATATATCTACAAAAAGCTGTGTAATACGCAATGCGGCGCAGGCCTTAACTGACGATGGAACTCTATACGGAGCCACAATTCTTGGCGATGGAGTTGTGCACAATAGCTTCGGTCAAAAACTGATGCGCATTTACAATCAGAAAGGCATCTTTTCAAACACAAAAGATTCCGAAGAAGGCTTAACACATATACTCTCAGAGCATTTCGAGAATGTTAAAACCAAGGTTCAAGGTACTGTAGTAATGTTTTCCGCTTCAGGAAAAAAATAGCATCCAACCGCAGCATGTTCTTGCTTAAGACGTGCTGCGGCATAATCCCAATGATTACTCCCTGACAGGGTTCGTAGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATATTTTTTCCAGGCTTCCAGCAATGAGGTTTCTTCCTCCGTTGCGATTTCCAGATCTACAGCATCCTGAAGTGGCGCAATATGCTCACTGGCTACCTGCATCAGGCTGTTTTTTGTTTCTTCCGCCTCCCGAATCCGGAACAGTTTTTCTGCTTCTGCATCTTTCACCCAGGCTGTACCGTTCCACTTCTGAAACTCCCCTTCCGGCGATAACCAGGTAACATTTTCCGGTAACGGACCGAGTTCAGAAATAAATAACGCGTCCCCTGACGCTACGTCATAAACCGTTTTACCCCGATGATCTTCAACGAGATGCCACGATGACTCATCACTGTTGAAAACAGCCACGAAGCCAGCTGGAATAAGGGTGTTGCGCTGCTTATGCTCTATAAAGTAGGCATAAACACCCAGCAGCATTTTGGAATAACCGACACGGGCAGACTTCACCACATTCACCTCACGGATGTAGTCGCTGCCCATCGCATTCATGATGGCCCGCTGAAAGGGCAGTGTTTCCCAGCGCCCTTCCTGGTATGCGGATTCTTTCGGGAGATAGTAATTAGCATCCGCCCATTCAACGGCGGTCTGTGGCTCCGGCCTGAACAGTGAGCGAAGCCCGGCGCGGACAAAATGCCGCAGCCTGTTAACCTGACTGTTCGATATATTCACTCAGCAACCCCGGTATCAGTTCATCCAGCGCGGCTGCTTTGTTCATGGCTTTGATGATATCCCGTTTCAGGAAATCAACATGTCGGTTTTCCAGTTCCGGAAAACGCCGCTGCACCGACAGGGGGATCCCGTCGAGAATACTGGCAATTTCACCTGCGATCCGCGACAGCACGAAAGTACAGAATGCGGTTTCCACCACTTCAGCGGAGTCTCTGGCATTTTTCAGCTCCTGTGCATCGGCCTGCGCACGCGTAAGTCGATGGCGTTCGTACTCAATAGTCCCTGGCTGGAGATCTGTCTCGCTGGCCTGCAGCAGTTCTTCAACCTCCCGGCGCAGCTTTTCGTTCTCAATTTCAGCATCCCTTTCGGCATACCATTTTATGACGGCGGCAGAGTCATAAAGCACCTCATTACCCTTGCCACCGCCTCGCAGAACGGGCATTCCCTGTTCCTGCCAGTTCTGAATGGTACGGATACTCGCACCGAAAATGTCAGCCAGCTGCTTTTTGTTGACTTCCATTGCACATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAAAGCTCGCTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGGGGGTATTTTAAATAAATACATTAAGTTACGACGAAGAAGAACGGAAACACCTTAAACCGGAAAATTTTCATAAATAGCGAAAACCCGCGAGGTCGCCGCCCCGTAACCTGTCGGATCGCCGGAAAGGACCCACAAAATGATAATAATTATCATCTACATGTCACAACGTGCATCTACGCCATCAAACCACGTCAAATAATCAATTATGACGCAGGTATCGTATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACAATACAAATCAGCGACACTGAATACGGGGCAACCTCATGTCAACTAAGAACAGAACCCGCAGAACAACAACCCGCAACATCCGCTTTCCTAACCAAATGATTGAACAAATTAACATCGCTCTTGAGCAAAAAGGGTCCGGGAATTTCTCAGCCTGGGTCATTGAAGCCTGCCGTCGGAGGCTAACGTCAGAAAAGAGAGCATATACATCAATTAAAAGTGATGAAGAATGAACATCCCGCGTTCTTCCCTCCGAACAGGACGATATTGTAAATTCACTTAATTACGAGGGCATTGCAGTAATTGAGTTGCAGTTTTACCACTTTCCTGACAGTGACAGACTGCGTGTTGGCTCTGTCACAGGTTAAGTAGTTTGAATGATTAGCAGTTATGGTGATCAGTCAACCACCAGGGAATAATCCTTCATATTATTATCGTGCTTCACCAACGCTGCCTCAATTGCCCTGAATGCTTCCAGAGACACCTTATGTTCTATACATGCAATTACAACATCAGGGTAACTCATAGAAATGGTGCTATTAAGCATATTTTTTACACGAATCAGATCCACGGAGGGATCATCAGCAGATTGTTCTTTATTCATTTTGTCGCTCCATGCGCTTGCTCTTCATCTAGCGGTTAAAATATTACTTCAAATCTTTCTGTATGAAGGTTTGAGCACGTTGGCCTTACATACATCTGTCGGTTGTATTTCCCTCCAGAATGCCAGCAGGACCGCACTTTGTTACGCAACCAATACTATTAATTGAAAACATTCCTAATATTTGACATAAATCATCAACAAAACACAAAGAGGTCAGACCAGATTGAAGCGATAAAAACGATAATGCAAACTACGCGCCCTCGTATCACATGGAAGGTTTTACCAATGGCTCAGGTTGCCATTTTTAAAGAAATATTCGATCAAGTGCGAAAAGATTTAAACTGTGAATTGTTTTATTCTGAACTAAAACGTCACAATGTCTCACATTATATTTACTATCTAGCCACAGATAATATTCACATTGTGTTAGAAAACGATAACACCGTGTTAATAAAAGGACTTAAAAAGGTTGTAAATGTTAAATTCTCAAGAAATACGCATCTTATAGAAACGTCCTATGATAGGTTGAAATCAAGAGAAATCACATTTCAGCAATACAGGGAAAATCTTGCTAAAGCAGGAGTTTTCCGATGGATTACAAATATCCACGAACATAAAAGATATTACTATACCTTTGATAATTCATTACTATTTACTGAGAGCATTCAGAACACTACACAAATCTTTCCACGCTAAATCATAACGTCCGGTTTCTTCCGTGTCAGCACCGGGGTGTTGGCATAATACAATACATGTACGCGCTAAACCCTGTGTGCATCGTTTTTAATTATTCCCGGACACTCCCGCAGAGAAGTTCCCCGTCAGGGCTGTGGACATAGTTAATCCGGGAATACAATGACGATTCATCGCACCTGGCATACATTAATAAATATTAACAATATGAAATTTCAACTCATTGTTTAGGGTTTGTTTAATTTTCTACACATACGATTCTGCGAACTTCAAAAAGCATCGGGAATAACACCATGAAAAAAATGCTACTCGCTACTGCGCTGGCCCTGCTTATTACAGGATGTGCTCAACAGACATTTACTGTTCAAAACAAACAGACAGCAGTAGCACCAAAGGAAACCATCACCCATCATTTCTTCGTTTCTGGAATTGGGCAGAAGAAAACTGTCGATGCAGCTAAAATTTGTGGCGGCGCAGAAAATGTTGTTAAAACAGAAACCCAGCAAACATTCGTAAATGGATTGCTCGGTTTTATTACTTTAGGCATTTATACTCCGCTGGAAGCGCGTGTGTATTGCTCAAAATAATTGCATGAGTTGCCCATCGATATGGTCAGCTCTATCTGCACTGCTCATTAATATACTTCTGGGTTCCTTCCAGTTGTTTTTGCATAGTGATCAGCCTCTCTCTGAGGGTGAAATAATCCCGTTCAGCGGTGTCTGCCAGTCGGGGGGAGGCTGCATTATCCACGCCGGAGGCCGTGGTGGCTTCACGCACTGACTGACAGACTGCTTTGATGTGCAACCGACGACGACCAGCGGCAACATCATCACGCAGAGCATCATTTTCAGCTTTCGCATCAGCTAACTCCTTCGTGTATTTTGCATCGAGCGCAGCAACATCACGCTGACGCATCTGCATGTCAGTAATTGCCGCGTTCGCTAGCTTCAGTTCTCTGGCATTTTTGTCGCGCTGGACTTTGTAGGCGATTGCGTTATCACGGTAATGATTGACCGCCCATGACAGGCTGACGATGATGCAGATAATCAGAGCGGATATAATCGCGGTTACTCTGCTCACTGTTGCCCCCACAAACAGACTTCACGCTCAATCTCACGACGAGTCATCAGGCCTTTCCATTGCTTACCGCCAGCGTATGTCCAGCGACGCAGCTGATCACATGCGCCTTTGATATCGCCCTGGTTTATTTTGCGAAGAAGCGTCGATGTTCTAAAATTGCCAGCACCCACGTTGTAAACGAATGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATATACGGGTTAATTTGTCTGGCGACAGTGGCAAGGTCTTTATTCAAGAGTGCTTTGCATTCTGCTTTGGTATACGTTTTACCGAGCATGATGTCTTTTCCGGTGTGTCCGTGACATACAGTCCATACACCAACAATATCTTTGTATGGTATGTAGCTGACACCTTCCAGACCATCGTTACCACTTGGGCCAGTGATTAACACTGATGCTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACGGCTTTTCGTAATGATGGAGGCATTATTCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAGGCTACCCAGCACACCTATTGCTGCCCACTGTGAGGGAGTGACTTTATCTAGCAGCTGTAAAAACCAGTACCCGGCACTACCTGCTGAGGTGCCATAGGCGACACCCGTTGTTAACTTATCCATGGATTTCATAACCCCACCTCGCAGACAAAGCGGGTGTAAATTGAGGGAATACTACGAAACGTAACAGACTCGGAGTCAGTGAATAACTCAGGTATTGGGTTATCAGCTAATATCGAGACTCAAAAAATGGAAAAACCCGCTCGACGGCGGGTTTAAGCTGTGTGACGAAGTAACCACTCTTAACAGCATAACCAATTTTTTACGTACGTAAACCACTAAATGATATTTGAGAGAATGCTACCGAGTATTGAAAACACCACTACAAATACATAAGCAAATCTCAACAAATAACCAAAAAATAATTTCCAGTGTTATTTTTAGCCGGTTTAAATTGAACCTTCAAATTATAGAGCACTTATAAATAACAGCCGTTAATATAAATTGGCTAATAGATTTATTTTTATTCAGCCAAGAGCCATGAATAGGATTCGATAGAAAAAAGTTCAGATAAAAATAGAGATCTACTTCACAAATCAAACGAGAAACCAAAACTTACATCTTGAAATAATCACATTGATTAGATGAATATTTATCGCGCAGTGACATCATTTTTTAATAATAGTTCAAAAAAAGGGCTCACGATGAAAAAATTAACAGTGGCAATTTCTGCTGTAGCTGCATCAGTACTAATGGCGATGTCTGCTCAGGCAGCTGAAATTTATAATAAAGACAGTAACAAGCTGGATCTGTACGGGAAAGTTAATGCTAAGCACTACTTCTCCTCTAATGATGCAGATGATGGTGATACTACTTATGCCCGTCTTGGCTTCAAAGGTGAAACCCAAATCAACGATCAACTGACTGGTTTCGGTCAGTGGGAATATGAATTCAAAGGCAACCGCGCTGAATCTCAAGGTTCCTCCAAAGATAAAACCCGTCTTGCCTTCGCTGGCCTGAAATTCGGTGACTACGGCTCCATCGATTACGGCCGTAACTACGGTGTAGCATACGACATCGGTGCGTGGACTGACGTCCTGCCAGAATTCGGTGGTGACACTTGGACTCAAACCGACGTGTTCATGACTCAACGTGCAACTGGTGTTGCAACCTATCGTAACAACGACTTCTTTGGTCTGGTTGATGGTCTGAACTTTGCTGCTCAGTACCAAGGCAAAAACGATCGTAGCGATTTCGATAACTACACTGAAGGTAACGGTGATGGCTTCGGTTTCTCTGCTACCTATGAATACGAAGGATTCGGTATCGGTGCAACTTATGCGAAATCTGATCGTACCGACACTCAAGTTAATGCAGGGAAAGTTCTTCCTGAAGTATTTGCTTCCGGTAAAAATGCAGAAGTTTGGGCCGCAGGTCTGAAATATGACGCTAACAACATTTACCTGGCCACTACCTATTCTGAAACCCAGAATATGACTGTATTTGCTGATCACTTCGTTGCTAATAAAGCCCAAAACTTCGAAGCTGTTGCACAATATCAGTTCGATTTCGGTCTGCGTCCGTCCGTTGCTTACCTGCAATCTAAAGGTAAGGATCTTGGAGTATGGGGCGATCAGGACTTAGTCAAATATGTTGATGTAGGTGCAACCTATTACTTCAACAAAAATATGTCTACTTTCGTTGATTACAAAATCAACCTGCTTGACAAAAATGACTTCACTAAGGAAGGTGCGAACAAGTCCCTGATATGAGATCATGTTTGTCATCTGGAGCCATAGAACAGGGTTCATCATGAGTCATCAACTTACCTTCGCCGACAGTGAATTCAGCAGTAAGCGCCGTCAGACCAGAAAAGAGATTTTCTTGTCCCGCATGGAGCAGATTCTGCCATGGCAAAACATGGTGGAAGTCATCGAGCCGTTTTACCCCAAGGCTGGTAATGGCCGGCGACCTTATCCGCTGGAAACCATGCTACGCATTCACTGCATGCAGCATTGGTACAACCTGAGCGATGGCGCGATGGAAGATGCTCTGTACGAAATCGCCTCCATGCGTCTGTTTGCCCGGTTATCCCTGGATAGCGCCTTGCCGGACCGCACCACCATCATGAATTTCCGCCACCTGCTGGAGCAGCATCAACTGGCCCGCCAATTGTTCAAGACCATCAATCGCTGGCTGGCCGAAGCAGGCGTCATGATGACTCAAGGCACCTTGGTCGATGCCACCATCATTGAGGCACCCAGCTCGACCAAGAACAAAGAGCAGCAACGCGATCCGGAGATGCATCAGACCAAGAAAGGCAATCAGTGGCACTTTGGCATGAAGGCCCACATTGGTGTCGATGCCAAGAGTGGCCTGACCCACAGCCTGGTCACCACCGCGGCCAACGAGCATGACCTCAATCAGCTGGGTAATCTGCTGCATGGAGAGGAGCAATTTGTCTCAGCCGATGCCGGCTACCAAGGGGCGCCACAGCGCGAGGAGCTGGCCGAGGTGGATGTGGACTGGCTGATCGCCGAGCGCCCCGGCAAGGTAAGAACCTTGAAACAGCATCCACGCAAGAACAAAACGGCCATCAACATCGAATACATGAAAGCCAGCATCCGGGCCAGGGTGGAGCACCCATTTCGCATCATCAAGCGACAGTTCGGCTTCGTGAAAGCCAGATACAAGGGGTTGCTGAAAAACGATAACCAACTGGCGATGTTATTCACGCTGGCCAACCTGTTTCGGGCGGACCAAATGATACGTCAGTGGGAGAGATCTCACTAAAAACTGGGGATAACGCCTTAAATGGCGAAGAAACGGTCTAAATAGGCTGATTCAAGGCATTTACGGGAGAAAAAATCGGCTCAAACATGAAGAAATGAAATGACTGAGTCAGCCGAGAAGAATTTCCCCGCTTATTCGCACCTTCCCTAAAGCACTCGGTGTAAGCACTGATGACATCGTTGCTGTAGGTCTGGTTTACCAGTTCTAATCTGATTACGAAAAAGATATGTTGCGGGAGGCGTTGCCTCCCCAACATATAAGTGGCTCCCTCAAGCCACTTCCTTTAGAAGCACAACCTTGCTTCTAACTATATAAACCTTCTGTTATATATTACCCTTTATTTTTGGGGGCGTCTCAACGCCCCATTTTTAATAATTTTTAGTAAACAATTGGCATATTAATTAGAGTTATTAACAACGATATCCATCTCTAACCGGATATCTAATGCCATTAACATCCCTTCAATTATGCCCTCAGCCTTCTGTAACCTTTTCCCGATATAACCATCAGAGCAGCAATGCTTACCTGCCAGTGACATGAATGTCATACCGACTACATAATAATCTACTAATAAATCGTGCAAATCGCTGTTGTTCTTTTTCAGACGGGCCATGCACCCGCAAATGATCATCGCGTCATCGTCACAACATTGCGGGCGAGATTTTACTTTTGAAGTAATTAATCCCTTAAAACCGGCGGCAATGGACGACCAGGTCACATCTTCATGATTATTAGCCGCCCACGCTCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTACTCCACAAAAATCAGACCAGAACGCCAATTACAAGCAAAAATCAACAAAACAGTATTAGTTGATTGTTATCTCTGACTTCATACTCCTGCTCCTGTCAGGGTTTTGGCGTAATTCTTCAGTATTCGGTAATCGGTCAAAACAGAACCGGGGAAACGATATAAGCGCAGACGCCCCCAGCGGTGGCGAAGAAGTTCTGCCATATTAAACTCAAACATCATTCATTCCCCATTTCGGTGATGGTCAGTTCCAGCCTCCCACCTTTGGTAACAGGCATCTTCACAACGCGGTAATCAACGACCTGAGCATCATCCAGCCAGAAACCTGCTTTAGTGAGTGCGTCAAAAGCGGCTTTTTGCAGATTATCCAGGTCACGGCGACGGCGATCCGGCATGTGGCACTCAATGCGGATTTTCACAGGCATAGCCAGGCCGATATCCAGCATTGCGTTTTTAATGATTCGGGCGACGTTATCGCGGTATGCCTGCCCCTCTGCGCTGACGTGCGTGCGCCCGCGATTATGGCGGTAATAGCGATTATTGCTCGGAGGCCAGGGTAATGTGATGCTGTAGGTATTCACGCCTTAATAACCCCCTCTTTCAGCCAGATAACCTGTGTTCTCGCCATACCTTCCAGCGCGCATTCTTTTGCATATCCAGCGTCAACAAAATGCGTGCGGCGGTCGATTTCGTCGTGGCAGGCAGAACATGCAATGGTGGCAATCAGGTCTGGCGGTTTCGTACCGGTGCCGCACAATCCAGTCAGCCGGATATGTGCCAGTACAGACGTTTCAGGGTTGCCATTACATACGCCAGGGATTCTTACCTGGCATTCCCGACCACGCGCTGCTTTTCTCAAATCAGCCATGACTCCTCCTTGCTGCCAGTCGCAACCATTTTTTATCAACCAAGCTGGCGGTATATCCGAGCAGTGTTGGTATTTCGGATGGCTTCAGCTCAGGTTTACGCTTACGACGATTTGGTACTCTGTAGATGTGTCCGTTCATGACACGAATAAGCGGTGTAGCCATTACGCCTCCTGCTTGTCGCGCAGCAGCTGGAACTCGCAGCTCTGTGGAATAGTCAGGTGGCAACCAATATTCATCGCCCAGGCTTCAACCTTACACAGGAAGACATACATCTCTCCGGTATCAAGATCGGAGGTATGGCGTAACGACTGGATAGTGGTGATATCACCGGTTACGACATCAACCAGGTCTTTGGTTTCATAACCGAGATATGTGTGTTTGAGAGCATCTTTTACCCAAGCTGGAGTGGCGAACGTTTTACCCCTGCTGATGAGGTATTCACTGATTTCGCTGTACCACATGTGGCTGAGTGCATTCTGGGAAAGACTGCGTTTCTCACGCCACGGTTTAAGCACCATGCGAAAGCATTTGCCCTCCTCCAGATAAGGCTGGATCTGCCGACCGATAGCGGTGAAGTTGCCGCGATGTAATTTGATGCCGTCTTGTGAGAGGTTCACGCTTCCCCTCCGCAGAGGTCAAACGCTAGATGCAAAGAATTGCAGGTGCATTTCTGCATCTGTGAAGGGAGAAGAGAGTTTGGATTGTATGTGCGCATAAACGTCCCCGTTTAGCGCAGAAGTCACCGGAGTTGTTCAGGCTCCGGTGACATAATTATGCCGTGTTGATTTCCCAAAATCAAAATCGATAGAATTGCTCCTTCTTAAAACACTTTTACTCTCTGGAAGCTTTTCTTATCTCTCTTGGTGTTATATTAAAACGATTATGAAATCTTTCAGTAAAACGAGAAGGACACTTATAACCATTTTCTCTGGCAATCTCGCTTATAGGTTTTACCGTCGTTTGTATAGCAGACAACGCATTATTTAACCTCACATCGTCCAGTATACTTTGGAAACTTACCCCCTCGCTTGCTAGACGGCGATGTAATGTAGAAACAGAAATGTAGAGATATCGAGCAACCTTGTTTGCTGTCCATTTTGTGCCGGGTTCGGATAGCAGCAGGTTATAACAACGACTTATCAATGATTGTTTACTATATGATAAAAGTAAATGATTAACATGATTCACTCCTAACGAAAGTAGAACGCCCATTGCTAAGTGCTCCTGAATTTTAGTTGAGAAGCCTCGGGAAACAGATGTTTTTAGTTGCTCCCAACAATATATTAACTCAGGATTCTGAGGTAAAAAGAAACTTGTTTTGTTACGTATTTGATCAGTTACCGTATAAAGTTTTTGGAAACTCTCAATTAAATCAATGGGTAAGTAAAGCATTTCTGCAAGATAAAGCCCTGCTTCAGGATAATTCTCAATATAAAATTCATAACCACAAGGAAATAATATTATTTGATTATTATCAACAGTTAAAGTATGCGTCTCCCAATTGATAACTTTCTTTCCCTGACGGATACGACACAAAGCTGGCATAAGAGGCTTAACCCTATGAATCTCATGATGTTTATGCATCCGTATTTCTTCGATCTTTAAGTTAGTCTTACCTCTTGCCAGCATACTCTCACCCTACTTTATCTCATAAACTGGTGTTATCTCAGCGGTTGCGATTTTATTAGCATTAAGCATATAACCAACTAACGCTCCGCTGGAGTTAGAATCTACAGGAATCTTTTCAGTTTTTAGAGCCCATACTTTAAACTGGTAATGATGTGGTTTATCTCCTTTAGGAGGACATGCGCCACCAAACCCAGCATAGCCAAAATCATTTCGGCCTTGAACAGCACCAGTCGGCAGTTTTGTTCCATCACGTCTCCCTGCATCAACGGGCAAATATGTTACTGTTGCTGGAATATTAACAACAGTCCAATGCCACCAACCACTGCCTGTAGGTGCATCTGGATCATATACAGTTACGGCAAAGCTTTTGGTACCTTCAGGAACACCAGACCAGGTTAATGAGGGCGATGTATTACCACCTTCACACCCAAATCCAGAAAAGACATGAGACGTTGTAAGTTGCTCTCCTGTTTTTATTTCATTACTAGTGACCTGAAATGCTGCAGCCTGCGCAGAAAATGTTATGAATGCCAATACAGTTGAAACGATAAGTGTTTTCATAAAAACCTCTTTGTTATGACCTATCGTTATTTTATTTGATATTCCTTTATCTCATTATGCATAAAGGCGCAATGTTCATGCAAAAGCAATCACAATTGTACCCCCAACCCAATTATTTGCCACAATATACACAAAGCACATTGATACTATCTAAAAACTCTGCTTTATTATTAGTAACACCTACGAAAGTCGGTGTTATTTTTTAACCTACCATTCAAAATACGTGACATACACCATTTTGCTCATAATAATTTGTCACGTATTTTCAGTATTTGAATCTGCGACCAAGAGTTCTCACCTAACAAATGATTAAGATTGTATAGCTCATTTACTACCCCAATACAGCCGTACAAAACTCGCTTGTGGGAGCAAACAAAGTAATTACCCATTAAGTTTCGTCAAAGATAATTAATTCTGTCTTGCACTTTATCACCATAGCATAACTTAAAATCCGAGATCATTATTTAGAAATAAATCTCACCATCAACCATATATTTGAGAGCACTTATCGCCTGCTGGGCGGATATTACTTTCATTAAAGGATAGTGTTTAAAAACAATGCCATTCATAAAATAGATATCACAGGTTTTATTATCCGTATTAATTATGATTTTTTCGAATGTTTTATAGGCAAGTGTACGGCATAACTCTCGTCCATTTTTACTGGTTAAGTCAATAGCATAAAAATCACTGAATGAATTTACACCTTTACTCTTCAAAGTTTTCAATGATACCGAAGCCCTTCGTAATTCCTTATCTAATAGTCTTATTTTCTCTGCTATAGCGGTAACTTCAGGCGCGACAGACAATGCAACGATTAAATTATTAATTTTCATCTGAAGCTCAATAATTTTTAACTCTAAAGTTTCATTAGCATCTTTCTTGTTTTCAACTGGTTGAATTTTGCTACAATTAAAAAGCAATTCATTAATGATATTATAATCAACCAAATCTCTTTTTATTGATGGCCTGTCACATCGATGTAATCTTCTCATCGGACAAACATAATAGCCATGCAAACTTCCAGATACCGCATGAACAATCATGGTATTACCACAAGCCTCACACTTCATAACTGTTCGAAGTAGATTTATTAGCATAGGATTCTTGCTACTATTGCTAATACCAAAAGGTGCCAACCGAATTTCCTGTACAGCGTAAAACAAATCATCTGATATGACTCTGGGATAATAGCCAGCGATTTCACTTATCCCTTTCCCTCTTGCACGATATGAAGGTACGCAAATACCTATCAGAGCTTTATTCGCTAATAATTTTTCAATTACAGAAGGTCCCCATGCACTTTCTTTTCCTGAGAAATTCTTTACAGCATGATCATTTAAATACTTGGCTATTGCATTCAATGAGCGCCTTTCCATCCTGAGTTTAAAAATTAGCTCAATAGTTTTCACCCTGTCGGGGTCTGGAACAAAAGCCGTTCTTTTGTCATCTAAGGAGAGCCATCTCGGACAAGACGCCGTCATAATCGTACCTGATTCCAGTGCATCCTGCCGTTTTTTCTTCCATGATAATTTAACCCGACTTGACTTTATCTCGCTTTCTTCATTTGCCCTTTGTGCTATAAGTATGGCTTTTATTAATGAATATGGCTCATTCAAAGAGTCAATATTATAGACTGTATTGTCGCAAAGAGTTATAACATCAATACCGTGATTCAAAATCAATTTCAGACGTTCAATCGCTTCACCGACTTTTTCTCTTGAAAGTCTGTCCAGACTTTCAACTAACAATGTAGTTCCTGGCAATATATAACCATGCTCTATAGCATCTAAAAATTCCGAAAAAGCTCCTGATTGTGCATGCTTTCCTTTGAATGCACTTAATCCTAAATCTTCATATGTTATGGTATCAAGATAATAATCACTATTTACCTTTAACCATTCAGCAATAAGTCTTCTCTGTCGGTTTAATGAGTCGCCAGACATCTGACCTGGTGATGAAAATCGCATATATGCTATGGCTTTTTTCATGGTGACACCTGCTAACGTATGCTTTTATAAACCTTAGTGGTGGGATATAATTTTTGTTTATTTTTTATTTAAAAAGACAATTAAGGTCACATTATCTTGAATATACAACAATAATCGTATTGCAATTTTCTTACGCCATAATCTTGAAAGCACAAAAGAATACATAAAAAATAAAGACATTAACAAAAAGCATAAAACGAGGCTCATATAAATATAAGAGCCTCCATATTTTAGTCGTTTAGAAACAAATTATTTTAATGTGGTGTGCTTCGTGACAATAAATTAATAATCAACACACCGGCACAAATCAACATCATGCCTATAATGGCTGGCAGGTCCAGCCGTTGGCCGAAAAATCCCCATGACAGTAAGCTAATCAGGACAATACCGACTCCTGACCAGATAGCATAAGCAATCCCTGTAGGAATATAAGCCAGCGTCTGAGCTAATAACCAGAATGATGCACAATAACAAATAATTGTACCAACAGATGGCCATAACCGTGTAAAACCTTCTGAAAACTTCATTAAGGTTGTACCAATGACCTCTGCAAGTATTGCACCACCAAGATAAATATAAGGGTTCATAGCATATTCTTTCCTGTTCAAACTGGAGAGAATTGTACTACAGTTTGAACTCAACTCACCTGTTTCATCATTGTGTTCCCATTGATGTTCTTTTATATACCCTCAATACCCGTTTCATCGCGGCACTCTGGCGACACTCCTTAAAAATCAGATTCGTGCTCACCTTTCCTTCCCATTCTTCTCTGGTAGCGAACCGGTAATACACCGTTCGCCAGACCTTACCATCAACGACCAGGATTCCTGCCCGCGCCATTTTAGCCGCAGCCTGATTTATGCTGGTTGATCCTACCCACGTAATATGGACACAGGCCTAAGCGAGGTTCTTGTTTTCAAATTGTTCCGGACTGAGGCCGCCACACCAACTGTGCCGCCGCCACCGATTGTAATCACATTCGATATAATTAAACACCGTTGCCCGCATTATTTCCCGGCTGATAAAGTGTTCTCCATGGATACATTCCACTTTCAGCGAATGAAAGAAGCTTTCCACGCAGGCATTATCGTAGCAGCAACCTTTTGCGCTCATACTTCCACGCAGATTATGCCGCTTCAGTTGCGCCTGATAATCTGCTGAACAGTACTGGCCTCCACGGTCCGTGTGAACGATAACGTTCCGGGGCCTCTTACGCCGCCACAGCGCCATCTGCAGGGCATCGCAGGCCAGTTGCGCCGTCATGCGTGGCGACATTGACCAGCCAATAACGGCACGTGACCACAGGTCAATGACCACTGCCAGATACAGCCAGCCTTCATCTGTACGTAAGTACGTGATGTCTCCTGCCCACTTCTGGTTCGGGCCACTGGCGTAAAAATCCTGCTCCAACAGATTTTCTGACACAGGCAGGCCGTGTGCGCGGTAGCTGACCGGGCTGAACTTCCGGGAGGCCTTTGCCCTCAGTCCCTGACGGCGCAGGCTTGCCGCCACGGTTTTTACGTTAAAGGGGTAACCCTGAGCACGCAGTTCATCCGTCAGGCGTGGGGCACCGTAACGCTGTTTTGACCGGGTAAAAGCCGCGAGGACAACGCTGTCGCAGTGTTGGCGGAACTGCTGACGCGTGCTTATCCTTGTCCGCCGCTGACACCACGTATACCAGCCGCTGCGGGCCACCCGGAGCACGCGGCACATTGCTTTGATGCTGAACTCAGCCTGATGTTTTTCAATAAAGACATACTTCATTTCAGGCGCTTCGCGAAGTATGTCGCGGCCTTTTGGAGGATAGCCAGCTCTTCATCCCGTTCTGCCAGCTGGCGTTTGAGACGTGCAATCTCGGTAGACATCTCCAGTTCACGTTCAGAAGACGTCTGCTGATTTTGCTGTTTACTGCGCCAGTTGTAGAGTTGTGATTCATACAGGCTGAGTTCACGGGCTGCGGCAGTAACACCGATGCGTTCAGCAAGCTTCAGGGCTTCACTGCGAAATTCAGGCGAATGCTGTTTACGGGGTTTTTTACTGGTTGATACTGTTTTTGTCATGTGAGTCACCTCTGACTGAGAGTTTACTCACTTAGCCGCGTGTCCACTATTGCTGGGTAAGATCAGATTACGGTTGCGCCTGTTACCGCGGCAACGTCCTGTGCACAGAAGCTCTTATGCGTCCCCAGGTAATGAATAATTGCCTCTTTGCCCGTCATACACTTGCTCCTTTCAGTCCGAACTTAGCTTTAATTTCTGCGATCTTCGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGCCCATAACAGGAAGTTGTTTTACTGGTTCAGGTATCGTCTCACCACGGTTAATTCGCGCTGTCATACAGGTCAGTTCATCGGCAGCCTTGCGCCGTAATTCCGCGTCAGCCAGCGCATTGGCCCGCATGTTCTGGTACAAGTTGGTAACCAACCAGTAATGCGCGTTCGATTTCCACGGATAAGACTCTGCATCCGGATACAGGCCACGCTTCCGGCAATACTCGTACCTCCCGGGATTTCATGAAATTCCGGCTCGGTGGTTTCGAGGCAATAAAATCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACGCGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGCATGAAGCGTGAAGGCCTGCATTATGTCGTGATTGAGCGGAATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTACCTCACAACACGGCAAGCCTGCATTGCGGCGCTTCAGTCTCCGCTGCATACTGTCCAGGTGAGCGCGGGTGATGGCATAACAGAGGAAAGAAAATGTCACTCTTCCGCAGAAATGAAATATGGTATGCCTCGTATTCGCTCCCGGGCGGGAAACGAATTAAGGAATCTCTTGGCACAAAGGACAAACGGCAAGCTCAGGAGTTGCACGACAAGCGAAAAGCAGAACTCTGGCGAGTAGAAAAGCTAGGGGATTTACCTGATGTCACTTTTGAAGAGGCCTGCCTAAGATGGCTTGAGGAAAAAGCTGATAAAAAATCTCTCGATTCAGATAAAAGCCGGATTGAGTTCTGGCTTGAACATTTTGAGGGTATAAGGCTTAAAGATATCTCGGAGGCAAAGATTTACTCTGCTGTAAGCAGAATGCATAACAGAAAGACGAAAGAAATATGGAAACAGAAAGTTCAGGCCGCCATCAGGAAAGGTAAAGAACTGCCTGTTTATGAACCAAAGCCAGTATCAACTCAGACAAAGGCAAAGCATCTTGCCATGATAAAGGCCATTCTCCGTGCTGCAGAACGCGACTGGAAGTGGCTGGAAAAAGCGCCTGTCATCAAGATACCAGCGGTCAGAAACAAGCGAGTCAGATGGCTGGAAAAGGAGGAAGCAAAACGCCTTATTGATGAGTGCCCCGAACCACTGAAATCTGTCGTCAAGTTTGCGCTGGCAACTGGTCTGAGAAAGTCGAACATCATAAATCTGGAATGGCAACAAATCGACATGCAGCGACGAGTTGCCTGGGTGAATCCAGAAGAGAGCAAATCAAACCGCGCCATTGGTGTGGCGCTGAACGATACCGCCTGTAAAGTGTTGCGTGATCAAATAGGCAAGCATCACAAATGGGTGTTTGTACATACCAAGGCGGCTAAGCGAGCAGATGGAACATCAACGCCTGCGGTCAGGAAGATGCGCATCGACAGCAAGACATCATGGCTATCAGCTTGTCGTCGTGCAGGAATTGAAGATTTCCGTTTCCATGACCTCAGACACACCTGGGCAAGCTGGCTGATTCAGTCAGGCGTCCCATTATCAGTGCTTCAGGAAATGGGCGGATGGGAGTCCATAGAAATGGTTCGTAGGTATGCTCACCTTGCGCCTAATCATTTGACAGAGCATGCGAGGAAAATAGACGACATTTTTGGTGATAATGTCCCAAATATGTCCCACTCTGAAATTATGGAGGATATAAAGAAGGCGTAA
Protein sequences of DBSCAN-SWA_8 >CP034953|3259833:3304196|3297566_3298118_-|QAA90775.1|DBSCAN-SWA MKTLIVSTVLAFITFSAQAAAFQVTSNEIKTGEQLTTSHVFSGFGCEGGNTSPSLTWSGVPEGTKSFAVTVYDPDAPTGSGWWHWTVVNIPATVTYLPVDAGRRDGTKLPTGAVQGRNDFGYAGFGGACPPKGDKPHHYQFKVWALKTEKIPVDSNSSGALVGYMLNANKIATAEITPVYEIK >CP034953|3259833:3304196|3272037_3273411_-|QAA90746.1|DBSCAN-SWA MSPCKLLPFCVALALTGCSLAPDYQRPAMPVPQQFSLSQNGLVNAADNYQNAGWRTFFVDNQVKTLISEALVNNRDLRMATLKVQEARAQYRLTDADRYPQLNGEGSGSWSGNLKGNTATTREFSTGLNASFDLDFFGRLKNMSEAERQNYLATEEAQRAVHILLVSNVAQSYFNQQLAYAQLQIAEETLRNYQQSYAFVEKQLLTGSSNVLALEQARGVIESTRSDIAKRQGELAQANNALQLLLGSYGKLPQAQTVNSDSLQSVKLPAGLSSQILLQRPDIMEAEHALMAANANIGAARAAFFPSISLTSGISTASSDLSSLFNASSGMWNFIPKIEIPIFNAGRNQANLDIAEIRQQQSVVNYEQKIQNAFKEVADALALRQSLNDQISAQQRYLASLQITLQRARALYQHGAVSYLEVLDAERSLFATRQTLLDLNYARQVNEISLYTALGGG >CP034953|3259833:3304196|3284580_3285330_-|QAA90754.1|DBSCAN-SWA MDYVCSVVFICQSFDLIINRRVISFKKNSLFIVSDKIRRELPVCPSKLRIVDIDKKTCLSFFIDVNNELPGKFTLDKNGYIAEEEPPLSLVFSLFEGIKIADSHSLWLKERLCISLLAMFKKRESVNSFILTNINTFTCKITGIISFNIERQWHLKDIAELIYTSESLIKKRLRDEGTSFTEILRDTRMRYAKKLITSNSYSINVVAQKCGYNSTSYFICAFKDYYGVTPSHYFEKIIGVTDGINKTID >CP034953|3259833:3304196|3296759_3297557_-|QAA90774.1|DBSCAN-SWA MLARGKTNLKIEEIRMHKHHEIHRVKPLMPALCRIRQGKKVINWETHTLTVDNNQIILFPCGYEFYIENYPEAGLYLAEMLYLPIDLIESFQKLYTVTDQIRNKTSFFLPQNPELIYCWEQLKTSVSRGFSTKIQEHLAMGVLLSLGVNHVNHLLLSYSKQSLISRCYNLLLSEPGTKWTANKVARYLYISVSTLHRRLASEGVSFQSILDDVRLNNALSAIQTTVKPISEIARENGYKCPSRFTERFHNRFNITPREIRKASRE >CP034953|3259833:3304196|3259833_3260946_-|QAA91940.1|transposase|DBSCAN-SWA MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPRSAGSEKKN >CP034953|3259833:3304196|3288565_3288760_+|QAA90758.1|DBSCAN-SWA MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSEKRAYTSIKSDEE >CP034953|3259833:3304196|3282102_3282864_+|QAA90752.1|DBSCAN-SWA MQLSSSEPCVVILTEKEVEVSVNNHATFTLPKNYLAAFACNNNVIELSTLNHVLITHINRNIINDYLLFLNKNLTCVKPWSRLATPVIACHSRTPEVFRLAANHSKQQPSRPCEAELTRALLFTVLSNFLEQSRFIALLMYILRSSVRDSVCRIIQSDIQHYWNLRIVASSLCLSPSLLKKKLKNENTSYSQIVTECRMRYAVQMLLMDNKNITQVAQLCGYSSTSYFISVFKAFYGLTPLNYLAKQRQKVMW >CP034953|3259833:3304196|3288924_3289131_-|QAA90759.1|DBSCAN-SWA MNKEQSADDPSVDLIRVKNMLNSTISMSYPDVVIACIEHKVSLEAFRAIEAALVKHDNNMKDYSLVVD >CP034953|3259833:3304196|3261022_3261175_-|QAA90736.1|DBSCAN-SWA MLTKYALAAVIVLCLTVLGFTLLVGDSLCEFTVKERNIEFKAVLAYEPKK >CP034953|3259833:3304196|3300363_3300696_-|QAA90778.1|DBSCAN-SWA MNPYIYLGGAILAEVIGTTLMKFSEGFTRLWPSVGTIICYCASFWLLAQTLAYIPTGIAYAIWSGVGIVLISLLSWGFFGQRLDLPAIIGMMLICAGVLIINLLSRSTPH >CP034953|3259833:3304196|3262811_3263060_+|QAA90738.1|DBSCAN-SWA MKHPLETLTTAAGILLMAFLSCLLLPAPALGLALAQKLVTMFHLMDLSQLYTLLFCLWFLVLGAIEYFVLRFIWRRWFSLAD >CP034953|3259833:3304196|3263124_3263493_+|QAA90739.1|DBSCAN-SWA MDKQSLHETAKRLALELPFVELCWPFGPEFDVFKIGGKIFMLSSELRGVPFINLKSDPQKSLLNQQIYPSIKPGYHMNKKHWISVYPGEEISEALLRDLINDSWNLVVDGLAKRDQKRVRPG >CP034953|3259833:3304196|3283377_3284331_+|QAA90753.1|protease|DBSCAN-SWA MRAKLLGIVLTTPIAISSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGRKVSQLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDQDWMDSSNPGTWTDESRHPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRDDIGSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVESSDNDEHYDPGKRITYRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNNNTSDYSKNGAGIENYNFITTAGLKYTF >CP034953|3259833:3304196|3300166_3300316_-|QAA90777.1|DBSCAN-SWA MSLVLCFLLMSLFFMYSFVLSRLWRKKIAIRLLLYIQDNVTLIVFLNKK >CP034953|3259833:3304196|3295919_3296090_-|QAA90771.1|DBSCAN-SWA MATPLIRVMNGHIYRVPNRRKRKPELKPSEIPTLLGYTASLVDKKWLRLAARRSHG >CP034953|3259833:3304196|3296541_3296643_-|QAA90773.1|DBSCAN-SWA MRTYNPNSLLPSQMQKCTCNSLHLAFDLCGGEA >CP034953|3259833:3304196|3290900_3291398_-|QAA90763.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIMLGKTYTKAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSTLLRKINQGDIKGACDQLRRWTYAGGKQWKGLMTRREIEREVCLWGQQ >CP034953|3259833:3304196|3295277_3295640_-|QAA90769.1|DBSCAN-SWA MNTYSITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE >CP034953|3259833:3304196|3295140_3295281_-|QAA90768.1|DBSCAN-SWA MMFEFNMAELLRHRWGRLRLYRFPGSVLTDYRILKNYAKTLTGAGV >CP034953|3259833:3304196|3273567_3274251_+|QAA90747.1|DBSCAN-SWA MKLLIVEDEKKTGEYLTKGLTEAGFVVDLADNGLNGYHLAMTGDYDLIILDIMLPDVNGWDIVRMLRSANKGMPILLLTALGTIEHRVKGLELGADDYLVKPFAFAELLARVRTLLRRGAAVIIESQFQVADLMVDLVSRKVTRSGTRITLTSKEFTLLEFFLRHQGEVLPRSLIASQVWDMNFDSDTNAIDVAVKRLRGKIDNDFEPKLIQTVRGVGYMLEVPDGQ >CP034953|3259833:3304196|3291397_3291613_-|QAA90764.1|lysis|DBSCAN-SWA MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >CP034953|3259833:3304196|3298582_3300109_-|QAA90776.1|DBSCAN-SWA MKKAIAYMRFSSPGQMSGDSLNRQRRLIAEWLKVNSDYYLDTITYEDLGLSAFKGKHAQSGAFSEFLDAIEHGYILPGTTLLVESLDRLSREKVGEAIERLKLILNHGIDVITLCDNTVYNIDSLNEPYSLIKAILIAQRANEESEIKSSRVKLSWKKKRQDALESGTIMTASCPRWLSLDDKRTAFVPDPDRVKTIELIFKLRMERRSLNAIAKYLNDHAVKNFSGKESAWGPSVIEKLLANKALIGICVPSYRARGKGISEIAGYYPRVISDDLFYAVQEIRLAPFGISNSSKNPMLINLLRTVMKCEACGNTMIVHAVSGSLHGYYVCPMRRLHRCDRPSIKRDLVDYNIINELLFNCSKIQPVENKKDANETLELKIIELQMKINNLIVALSVAPEVTAIAEKIRLLDKELRRASVSLKTLKSKGVNSFSDFYAIDLTSKNGRELCRTLAYKTFEKIIINTDNKTCDIYFMNGIVFKHYPLMKVISAQQAISALKYMVDGEIYF >CP034953|3259833:3304196|3290117_3290411_+|QAA90761.1|DBSCAN-SWA MKKMLLATALALLITGCAQQTFTVQNKQTAVAPKETITHHFFVSGIGQKKTVDAAKICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSK >CP034953|3259833:3304196|3296089_3296545_-|QAA90772.1|DBSCAN-SWA MNLSQDGIKLHRGNFTAIGRQIQPYLEEGKCFRMVLKPWREKRSLSQNALSHMWYSEISEYLISRGKTFATPAWVKDALKHTYLGYETKDLVDVVTGDITTIQSLRHTSDLDTGEMYVFLCKVEAWAMNIGCHLTIPQSCEFQLLRDKQEA >CP034953|3259833:3304196|3302231_3302327_-|QAA90780.1|DBSCAN-SWA MTGKEAIIHYLGTHKSFCAQDVAAVTGATVI >CP034953|3259833:3304196|3302649_3302913_+|QAA90781.1|DBSCAN-SWA MKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVIERNEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP034953|3259833:3304196|3290501_3290684_-|QAA90762.1|lysis|DBSCAN-SWA MRKLKMMLCVMMLPLVVVGCTSKQSVSQCVKPPRPPAWIMQPPPDWQTPLNGIISPSERG >CP034953|3259833:3304196|3275832_3278070_+|QAA90749.1|DBSCAN-SWA MDWLLDVFATWLYGLKVIAITLAVIMFISGLDDFFIDVVYWVRRIKRKLSVYRRYPRMSYRELYKPDEKPLAIMVPAWNETGVIGNMAELAATTLDYENYHIFVGTYPNDPDTQRDVDEVCARFPNVHKVVCARPGPTSKADCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELRLFNYLVERKDLIQIPVYPFEREWTHFTSMTYIDEFSELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLADGDGIAFDVQSLTEDYDIGFRLKEKGMTEIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSTAVRQKSRWIIGIVFQGFKTHKWTSSLTLNYFLWRDRKGAISNFVSFLAMLVMIQLLLLLAYESLWPDAWHFLSIFSGSAWLMTLLWLNFGLMVNRIVQRVIFVTGYYGLTQGLLSVLRLFWGNLINFMANWRALKQVLQHGDPRRVAWDKTTHDFPSVTGDTRSLRPLGQILLENQVITEEQLDTALRNRVEGLRLGGSMLMQGLISAEQLAQALAEQNGVAWESIDAWQIPSSLIAEMPASVALHYAVLPLRLENDELIVGSEDGIDPVSLAALTRKVGRKVRYVIVLRGQIVTGLRHWYARRRGHDPRAMLYNAVQHQWLTEQQAGEIWRQYVPHQFLFAEILTTLGHINRSAINVLLLRHERSSLPLGKFLVTEGVISQETLDRVLTIQRELQVSMQSLLLKAGLNTEQVAQLESENEGE >CP034953|3259833:3304196|3287631_3288177_-|QAA90757.1|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELLQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIPGLLSEYIEQSG >CP034953|3259833:3304196|3295636_3295927_-|QAA90770.1|DBSCAN-SWA MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIKA >CP034953|3259833:3304196|3303032_3304196_+|QAA90782.1|integrase|DBSCAN-SWA MSLFRRNEIWYASYSLPGGKRIKESLGTKDKRQAQELHDKRKAELWRVEKLGDLPDVTFEEACLRWLEEKADKKSLDSDKSRIEFWLEHFEGIRLKDISEAKIYSAVSRMHNRKTKEIWKQKVQAAIRKGKELPVYEPKPVSTQTKAKHLAMIKAILRAAERDWKWLEKAPVIKIPAVRNKRVRWLEKEEAKRLIDECPEPLKSVVKFALATGLRKSNIINLEWQQIDMQRRVAWVNPEESKSNRAIGVALNDTACKVLRDQIGKHHKWVFVHTKAAKRADGTSTPAVRKMRIDSKTSWLSACRRAGIEDFRFHDLRHTWASWLIQSGVPLSVLQEMGGWESIEMVRRYAHLAPNHLTEHARKIDDIFGDNVPNMSHSEIMEDIKKA >CP034953|3259833:3304196|3286913_3287657_-|QAA90756.1|terminase|DBSCAN-SWA MNISNSQVNRLRHFVRAGLRSLFRPEPQTAVEWADANYYLPKESAYQEGRWETLPFQRAIMNAMGSDYIREVNVVKSARVGYSKMLLGVYAYFIEHKQRNTLIPAGFVAVFNSDESSWHLVEDHRGKTVYDVASGDALFISELGPLPENVTWLSPEGEFQKWNGTAWVKDAEAEKLFRIREAEETKNSLMQVASEHIAPLQDAVDLEIATEEETSLLEAWKKYRVLLNRVDTSTAPDIEWPTNPVRE >CP034953|3259833:3304196|3281029_3281920_+|QAA90751.1|DBSCAN-SWA MRKFIFVLLTLLLVSPFSFAMKGIIWQPQNRDSQVTDTQWQGLMSQLRLQGFDTLVLQWTRYGDAFTQPEQRTLLFKRAAAAQQAGLKLIVGLNADPEFFMHQKQSSAALESYLNRLLAADLQQARLWSAAPGITPDGWYISAEIDDLNWRSEAARQPLLTWLNNAQRLISDVSAKPVYISSFFAGNMSPDGYRQLLEHVKATGVNVWVQDGSGVDKLTAEQRERYLQASADCQSPAPASGVVYELFVAGKGKTFTAKPKPDAEIASLLAKRSSCGKDTLYFSLRYLPVAHGILEY >CP034953|3259833:3304196|3286803_3286941_-|QAA90755.1|capsid|DBSCAN-SWA MAYEPCQGVIIGIMPQHVLSKNMLRLDAIFFLKRKTLLQYLEPWF >CP034953|3259833:3304196|3286232_3286859_+|QAA91941.1|DBSCAN-SWA MYTPLTLKLYDWWVLGVSNRLAWGCPTKEHLLPHFLEHLGNNHLDIGVGTGFYLTHVPESSLISLMDLNEASLNAASTRAGESKIKHKISHDVFEPYPAALHGQFDSISMSYLLHCLPGNISTKSCVIRNAAQALTDDGTLYGATILGDGVVHNSFGQKLMRIYNQKGIFSNTKDSEEGLTHILSEHFENVKTKVQGTVVMFSASGKK >CP034953|3259833:3304196|3270308_3271532_-|QAA90744.1|DBSCAN-SWA MKKIALIIGSMIAGGIISAAGFTWVAKAEPPAEKTSTAERKILFWYDPMYPNTRFDKPGKSPFMDMDLVPKYADEESSASGVRIDPTQTQNLGVKTATVTRGPLTFAQSFPANVSYNEYQYAIVQARAAGFIDKVYPLTVGDKVQKGTPLLDLTIPDWVEAQSEYLLLRETGGTATQTEGILERLRLAGMPEADIRRLIATQKIQTRFTLKAPIDGVITAFDLRAGMNIAKDNVVAKIQGMDPVWVTAAIPESIAWLVKDASQFTLTVPARPDKTLTIRKWTLLPGVDAATRTLQLRLEVDNADEALKPGMNAWLQLNTASEPMLLIPSQALIDTGSEQRVITVDADGRFVPKRVAVFQASQGVTALRSGLAEGEKVVSSGLFLIDSEANISGALERMRSESATHAH >CP034953|3259833:3304196|3293257_3294274_+|QAA90766.1|transposase|DBSCAN-SWA MFVIWSHRTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMVEVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMRLFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQGTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSGLTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVDWLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGFVKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH >CP034953|3259833:3304196|3264347_3265595_+|QAA90741.1|DBSCAN-SWA MQDLISQVEDLAGIEIDHTTSMVMIFGIIFLTAVVVHIILHWVVLRTFEKRAIASSRLWLQIITQNKLFHRLAFTLQGIIVNIQAVFWLQKGTEAADILTTCAQLWIMMYALLSVFSLLDVILNLAQKFPAASQLPLKGIFQGIKLIGAILVGILMISLLIGQSPAILISGLGAMAAVLMLVFKDPILGLVAGIQLSANDMLKLGDWLEMPKYGADGAVIDIGLTTVKVRNWDNTITTIPTWSLVSDSFKNWSGMSASGGRRIKRSISIDVTSIRFLDEDEMQRLNKAHLLKPYLTSRHQEINEWNRQQGSTESVLNLRRMTNIGTFRAYLNEYLRNHPRIRKDMTLMVRQLAPGDNGLPLEIYAFTNTVVWLEYESIQADIFDHIFAIVEEFGLRLHQSPTGNDIRSLAGAFKQ >CP034953|3259833:3304196|3261627_3262746_+|QAA90737.1|DBSCAN-SWA MPLPDFHVSEPFTLGIELEMQVVNPPGYDLSQDSSMLIDAVKNKITAGEVKHDITESMLELATDVCRDINQAAGQFSAMQKVVLQAATDHHLEICGGGTHPFQKWQRQEVCDNERYQRTLENFGYLIQQATVFGQHVHVGCASGDDAIYLLHGLSRFVPHFIALSAASPYMQGTDTRFASSRPNIFSAFPDNGPMPWVSNWQQFEALFRCLSYTTMIDSIKDLHWDIRPSPHFGTVEVRVMDTPLTLSHAVNMAGLIQATAHWLLTERPFKHQEKDYLLYKFNRFQACRYGLEGVITDPHTGDRRPLTEDTLRLLEKIAPSAHKIGASSAIEALHRQVVSGLNEAQLMRDFVADGGSLIGLVKKHCEIWAGD >CP034953|3259833:3304196|3278056_3281029_+|QAA90750.1|DBSCAN-SWA MKENNLNRVIGWSGLLLTSLLSTSALADNIGTSAEELGLSDYRHFVIYPRLDKALKAQKNNDEATAIREFEYIHQQVPDNIPLTLYLAEAYRHFGHDDRARLLLEDQLKRHPGDARLERSLAAIPVEVKSVTTVEELLAQQKACDAAPTLRCRSEVGQNALRLAQLPVARAQLNDATFAASPEGKTLRTDLLQRAIYLKQWSQADTLYNEARQQNTLSAAERRQWFDVLLAGQLDDRILALQSQGIFTDPQSYITYATALAYRGEKARLQHYLIENKPLFTTDAQEKSWLYLLSKYSANPVQALANYTVQFADNRQYVVGATLPVLLKEGQYDAAQKLLATLPANEMLEERYAVSVATRNKAEALRLARLLYQQEPANLTRLDQLTWQLMQNEQSREAADLLLQRYPFQGDARVSQTLMARLASLLESHPYLATPAKVAILSKPLPLAEQRQWQSQLPGIADNCPAIVRLLGDMSPSYDAAAWNRLAKCYRDTLPGVALYAWLQAEQRQPSAWQHRAVAYQAYQVEDYATALAAWQKISLHDMSNEDLLAAANTAQAAGNGAARDRWLQQAEKRGLGSNALYWWLHAQRYIPGQPELALNDLTRSINIAPSANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNTQAALGYALWDSGDIAQSREMLEPAHKGLPDDPALIRQLAYVNQRLDDMPATQHYARLVIDDIDNQALITPLTPEQNQQRFNFRRLHEEVGRRWTFSFDSSIGLRSGAMSTANNNVGGAAPGKSYRSYGQLEAEYRIGRNMLLEGDLLSVYSRVFADTGENGVMMPVKNPMSGTGLRWKPLRDQIFFIAVEQQLPLNGQNGASDTMLRASASFFNGGKYSDEWHPNGSGWFAQNLYLDAAQYIRQDIQAWTADYRVSWHQKVANGQTIEPYAHVQDNGYRDKGTQGAQLGGVGVRWNIWTGETHYDAWPHKVSLGVEYQHTFKAINQRNGERNNAFLTIGVHW >CP034953|3259833:3304196|3271547_3271880_-|QAA90745.1|DBSCAN-SWA MKKALQVAMFSLFTVIGFNAQANEHHHETMSEAQPQVISATGVVKGIDLESKKITIHHDPIAAVNWPEMTMRFTITPQTKMSEIKTGDKVAFNFVQQGNLSLLQDIKVSQ >CP034953|3259833:3304196|3274240_3275683_+|QAA90748.1|DBSCAN-SWA MVSKPFQRPFSLATRLTFFISLATIAAFFAFAWIMIHSVKVHFAEQDINDLKEISATLERVLNHPDETQARRLMTLEDIVSGYSNVLISLADSQGKTVYHSPGAPDIREFTRDAIPDKDAQGGEVYLLSGPTMMMPGHGHGHMEHSNWRMINLPVGPLVDGKPIYTLYIALSIDFHLHYINDLMNKLIMTASVISILIVFIVLLAVHKGHAPIRSVSRQIQNITSKDLDVRLDPQTVPIELEQLVLSFNHMIERIEDVFTRQSNFSADIAHEIRTPITNLITQTEIALSQSRSQKELEDVLYSNLEELTRMAKMVSDMLFLAQADNNQLIPEKKMLNLADEVGKVFDFFEALAEDRGVELRFVGDKCQVAGDPLMLRRALSNLLSNALRYTPTGETIVVRCQTVDHLVQVIVENPGTPIAPEHLPRLFDRFYRVDPSRQRKGEGSGIGLAIVKSIVVAHKGTVAVTSDARGTRFVITLPA >CP034953|3259833:3304196|3294671_3295055_-|QAA90767.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLITSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVVGMTFMSLAGKHCCSDGYIGKRLQKAEGIIEGMLMALDIRLEMDIVVNNSN >CP034953|3259833:3304196|3301006_3302169_-|QAA90779.1|transposase|DBSCAN-SWA MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWRSKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILPKGRDILREAPEMKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCDSVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAVVIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQYCSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISREIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENKNLA >CP034953|3259833:3304196|3292185_3293253_+|QAA90765.1|DBSCAN-SWA MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYARLGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYGVAYDIGAWTDVLPEFGGDTWTQTDVFMTQRATGVATYRNNDFFGLVDGLNFAAQYQGKNDRSDFDNYTEGNGDGFGFSATYEYEGFGIGATYAKSDRTDTQVNAGKVLPEVFASGKNAEVWAAGLKYDANNIYLATTYSETQNMTVFADHFVANKAQNFEAVAQYQFDFGLRPSVAYLQSKGKDLGVWGDQDLVKYVDVGATYYFNKNMSTFVDYKINLLDKNDFTKEGANKSLI >CP034953|3259833:3304196|3265675_3267052_-|QAA90742.1|DBSCAN-SWA MKNASTVSEDTASNQEPTLHRGLHNRHIQLIALGGAIGTGLFLGIGPAIQMAGPAVLLGYGVAGIIAFLIMRQLGEMVVEEPVSGSFAHFAYKYWGPFAGFLSGWNYWVMFVLVGMAELTAAGIYMQYWFPDVPTWIWAAAFFIIINAVNLVNVRLYGETEFWFALIKVLAIIGMIGFGLWLLFSGHGGEKASIDNLWRYGGFFATGWNGLILSLAVIMFSFGGLELIGITAAEARDPEKSIPKAVNQVVYRILLFYIGSLVVLLALYPWVEVKSNSSPFVMIFHNLDSNVVASALNFVILVASLSVYNSGVYSNSRMLFGLSVQGNAPKFLTRVSRRGVPINSLMLSGAITSLVVLINYLLPQKAFGLLMALVVATLLLNWIMICLAHLRFRAAMRRQGRETQFKALLYPFGNYLCIAFLGMILLLMCTMDDMRLSAILLPVWIVFLFMAFKTLRRK >CP034953|3259833:3304196|3289416_3289827_+|QAA90760.1|DBSCAN-SWA MAQVAIFKEIFDQVRKDLNCELFYSELKRHNVSHYIYYLATDNIHIVLENDNTVLIKGLKKVVNVKFSRNTHLIETSYDRLKSREITFQQYRENLAKAGVFRWITNIHEHKRYYYTFDNSLLFTESIQNTTQIFPR >CP034953|3259833:3304196|3263586_3264240_+|QAA90740.1|DBSCAN-SWA MDIISVALKRHSTKAFDASKKLTPEQAEQIKTLLQYSPSSTNSQPWHFIVASTEEGKARVAKSAAGNYVFNERKMLDASHVVVFCAKTAMDDVWLKLVVDQEDADGRFATPEAKAANDKGRKFFADMHRKDLHDDAEWMAKQVYLNVGNFLLGVAALGLDAVPIEGFDAAILDAEFGLKEKGYTSLVVVPVGHHSVEDFNATLPKSRLPQNITLTEV >CP034953|3259833:3304196|3267153_3270297_-|QAA90743.1|DBSCAN-SWA MIEWIIRRSVANRFLVLMGALFLSIWGTWTIINTPVDALPDLSDVQVIIKTSYPGQAPQIVENQVTYPLTTTMLSVPGAKTVRGFSQFGDSYVYVIFEDGTDPYWARSRVLEYLNQVQGKLPAGVSAELGPDATGVGWIYEYALVDRSGKHDLADLRSLQDWFLKYELKTIPDVAEVASVGGVVKEYQVVIDPQRLAQYGISLAEVKSALDASNQEAGGSSIELAEAEYMVRASGYLQTLDDFNHIVLKASENGVPVYLRDVAKVQIGPEMRRGIAELNGEGEVAGGVVILRSGKNAREVIAAVKDKLETLKSSLPEGVEIVTTYDRSQLIDRAIDNLSGKLLEEFIVVAVVCALFLWHVRSALVAIISLPLGLCIAFIVMHFQGLNANIMSLGGIAIAVGAMVDAAIVMIENAHKRLEEWQHQHPDATLDNKTRWQVITDASVEVGPALFISLLIITLSFIPIFTLEGQEGRLFGPLAFTKTYAMAGAALLAIVVIPILMGYWIRGKIPPESSNPLNRFLIRVYHPLLLKVLHWPKTTLLVAALSVLTVLWPLNKVGGEFLPQINEGDLLYMPSTLPGISAAEAASMLQKTDKLIMSVPEVARVFGKTGKAETATDSAPLEMVETTIQLKPQEQWRPGMTMDKIIEELDNTVRLPGLANLWVPPIRNRIDMLSTGIKSPIGIKVSGTVLADIDAMAEQIEEVARTVPGVASALAERLEGGRYINVEINREKAARYGMTVADVQLFVTSAVGGAMVGETVEGIARYPINLRYPQSWRDSPQALRQLPILTPMKQQITLADVADIKVSTGPSMLKTENARPTSWIYIDARDRDMVSVVHDLQKAIAEKVQLKPGTSVAFSGQFELLERANHKLKLMVPMTLMIIFVLLYLAFRRVGEALLIISSVPFALVGGIWLLWWMGFHLSVATGTGFIALAGVAAEFGVVMLMYLRHAIEAVPSLNNPQTFSEQKLDEALYHGAVLRVRPKAMTVAVIIAGLLPILWGTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKLMWLHRHRVRK |
49 | Enterobacteria_phage(56.0%) | integrase,transposase,protease,capsid,terminase,lysis | attL 3282908:3282954|attR 3304210:3304256 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|