Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP016043 | Edwardsiella hoshinae strain ATCC 35051 chromosome, complete genome | 4 crisprs | DEDDh,cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,csa3,DinG | 0 | 9 | 3 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016043_1 | 873489-874065 | TypeI-E |
NA
Consensus repeat of NZ_CP016043_1
|
9 spacers
spacers of NZ_CP016043_1
>1.1|873517|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT TGACCCGTTTCTCCTTGGCGCGCTCAATAACCG >1.2|873578|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT TGGCATCCTGCCCATAGCGCCCTCGGGTATCCG >1.3|873639|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT CATGAAAGTCCCCTCCGGAACTAAGACTGAGGA >1.4|873700|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT TGTGGCCCCCTCCAGCGACTCATACTCCTGACC >1.5|873761|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT TAAACTAGGGACTGATTATTATTTTTATATACG >1.6|873822|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT TGGCGCTCCGCCTGCTACCGATGCAGATGACCG >1.7|873883|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT CGGGCCCGGCAAGCACTGGGCCTAACTCAATCG >1.8|873944|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT TGCCGTTATCAGGCGCATGGTGGCGGAGTATCC >1.9|874005|33|NZ_CP016043|CRISPRCasFinder,CRT CTGATCCAGATCGCTCTGGCCATTTACCGCCTG |
cas2,cas1,cas5,cas7,cas6e,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around NZ_CP016043_1
The CRISPR arrays of NZ_CP016043_1 >merge|NZ_CP016043|1|873489-874065|PILER-CR,CRISPRCasFinder,CRT GCTTTCCCCACGACCGTGGGGGTGTTTCTGACCCGTTTCTCCTTGGCGCGCTCAATAACCGGCTTTCCCCACGACCGTGGGGGTGTTTCTGGCATCCTGCCCATAGCGCCCTCGGGTATCCGGCTTTCCCCACGACCGTGGGGGTGTTTCCATGAAAGTCCCCTCCGGAACTAAGACTGAGGAGCTTTCCCCACGACCGTGGGGGTGTTTCTGTGGCCCCCTCCAGCGACTCATACTCCTGACCGCTTTCCCCACGACCGTGGGGGTGTTTCTAAACTAGGGACTGATTATTATTTTTATATACGGCTTTCCCCACGACCGTGGGGGTGTTTCTGGCGCTCCGCCTGCTACCGATGCAGATGACCGGCTTTCCCCACGACCGTGGGGGTGTTTCCGGGCCCGGCAAGCACTGGGCCTAACTCAATCGGCTTTCCCCACGACCGTGGGGGTGTTTCTGCCGTTATCAGGCGCATGGTGGCGGAGTATCCGCTTTCCCCACGACCGTGGGGGTGTTTCCTGATCCAGATCGCTCTGGCCATTTACCGCCTGGCTTTCCCCACGACCGTGGGTGGACGTA >NZ_CP016043|1|1|873489-874004|PILER-CR GCTTTCCCCACGACCGTGGGGGTGTTTC TGACCCGTTTCTCCTTGGCGCGCTCAATAACCG GCTTTCCCCACGACCGTGGGGGTGTTTC TGGCATCCTGCCCATAGCGCCCTCGGGTATCCG GCTTTCCCCACGACCGTGGGGGTGTTTC CATGAAAGTCCCCTCCGGAACTAAGACTGAGGA GCTTTCCCCACGACCGTGGGGGTGTTTC TGTGGCCCCCTCCAGCGACTCATACTCCTGACC GCTTTCCCCACGACCGTGGGGGTGTTTC TAAACTAGGGACTGATTATTATTTTTATATACG GCTTTCCCCACGACCGTGGGGGTGTTTC TGGCGCTCCGCCTGCTACCGATGCAGATGACCG GCTTTCCCCACGACCGTGGGGGTGTTTC CGGGCCCGGCAAGCACTGGGCCTAACTCAATCG GCTTTCCCCACGACCGTGGGGGTGTTTC TGCCGTTATCAGGCGCATGGTGGCGGAGTATCC GCTTTCCCCACGACCGTGGGGGTGTTTC >NZ_CP016043|1|1|873489-874065|CRISPRCasFinder GCTTTCCCCACGACCGTGGGGGTGTTTC TGACCCGTTTCTCCTTGGCGCGCTCAATAACCG GCTTTCCCCACGACCGTGGGGGTGTTTC TGGCATCCTGCCCATAGCGCCCTCGGGTATCCG GCTTTCCCCACGACCGTGGGGGTGTTTC CATGAAAGTCCCCTCCGGAACTAAGACTGAGGA GCTTTCCCCACGACCGTGGGGGTGTTTC TGTGGCCCCCTCCAGCGACTCATACTCCTGACC GCTTTCCCCACGACCGTGGGGGTGTTTC TAAACTAGGGACTGATTATTATTTTTATATACG GCTTTCCCCACGACCGTGGGGGTGTTTC TGGCGCTCCGCCTGCTACCGATGCAGATGACCG GCTTTCCCCACGACCGTGGGGGTGTTTC CGGGCCCGGCAAGCACTGGGCCTAACTCAATCG GCTTTCCCCACGACCGTGGGGGTGTTTC TGCCGTTATCAGGCGCATGGTGGCGGAGTATCC GCTTTCCCCACGACCGTGGGGGTGTTTC CTGATCCAGATCGCTCTGGCCATTTACCGCCTG GCTTTCCCCACGACCGTGGGTGGACGTA >NZ_CP016043|1|1|873489-874065|CRT GCTTTCCCCACGACCGTGGGGGTGTTTC TGACCCGTTTCTCCTTGGCGCGCTCAATAACCG GCTTTCCCCACGACCGTGGGGGTGTTTC TGGCATCCTGCCCATAGCGCCCTCGGGTATCCG GCTTTCCCCACGACCGTGGGGGTGTTTC CATGAAAGTCCCCTCCGGAACTAAGACTGAGGA GCTTTCCCCACGACCGTGGGGGTGTTTC TGTGGCCCCCTCCAGCGACTCATACTCCTGACC GCTTTCCCCACGACCGTGGGGGTGTTTC TAAACTAGGGACTGATTATTATTTTTATATACG GCTTTCCCCACGACCGTGGGGGTGTTTC TGGCGCTCCGCCTGCTACCGATGCAGATGACCG GCTTTCCCCACGACCGTGGGGGTGTTTC CGGGCCCGGCAAGCACTGGGCCTAACTCAATCG GCTTTCCCCACGACCGTGGGGGTGTTTC TGCCGTTATCAGGCGCATGGTGGCGGAGTATCC GCTTTCCCCACGACCGTGGGGGTGTTTC CTGATCCAGATCGCTCTGGCCATTTACCGCCTG GCTTTCCCCACGACCGTGGGTGGACGTA
>NZ_CP016043.1|WP_083274988.1|873149_873455_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MLIVLANDLPPAVRGRMKLWFIEPRPNTFVSGIKDSVADTVIEYLYQHCSPAAGVVIFKSVARTPGYQIHTIGSPTKTLCEITGLQLVVEKRLEQQVNYNM >NZ_CP016043.1|WP_070244588.1|872287_873169_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MSGGGQRLFVKITRESLPQVKDKYPFIYLERGRLEIDDSSLKWLDAEGQVVRLPVATLNAILLGPGTSLTHEAVKTAAAANCAICWVGEDSLLFYAAGFLPTANTRNLNHQMRLACNKKSSLEVARRMFAYRFPDADLAGKGLKEMMGMEGSRVRALYQQKAQQYGVGWRGRQYIPGKMEISDTTNRVLTSVNAALYGILCSALHAMGYSPHMGFIHSGSPLPFVYDLADLYKENLCIDLAFSLTREMAGRYEKALVSSRFRERVIELNLLASVARDIPQLLGGVNIDADSAS >NZ_CP016043.1|WP_070244587.1|871570_872284_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MNQPYLLLWLEGPLQSWGHDSRFGRRETLHFPTKSGVLGLVCAALGAGGPQISLLAQFADLDMQVHSFARRHKNGELAPREPLLRDFHMVGSGYDDKDPWQSLLIPKTSEGKKAVGGGTKMTYRYYLQDQAFAVLLQIPNALLTEVAQALQNPVWDLSLGRKTCVPSEFIFQGQFANRDDALTAAFNLAEQKQRTQDFMVIQGAVEGGELLILNDVPLQFGQHKRYRDRQVTLINEG >NZ_CP016043.1|WP_070244586.1|870523_871567_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MNNALKNTRIEFHILQSFPVTCLNRDDVGAPKTAVVGGVTRARVSSQCWKRQVRLAMQAFGVKLGIRTKKVADLVTEHCIKLGAAQEPAQACASKIAELLADDTLLFISDSEAEALADYAREQGFDSNKIKDKELAKRAKKNRNPALDALDIALFGRMVAKAADMNVEAAASFSHAISTHKVANEVEFFTALDDRQEESGSAHMGSLEFNSATYYRYISLDLGQLADTMSGDELNKEQLKQAIAVFTKALFVAVPAARQTTQSGASPWEFAKVLVRKGQRLQVPFEEPVKAAGHGFLVPSVAALKGYISKKEALTGSLFGKLGDYEWGEDETFSLDHLIAKLQNHVE >NZ_CP016043.1|WP_070244585.1|869874_870531_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MTLYASVLRLDRAAVKALRVTDLYSLHRVVYALFEDVRSEAQKQASVPSGIQWVDKGGDHRCRQILLLSDRLPQAGEYGEVESRPLPDDFLSHRHYRFAVTVSPTRRDNQSRQLKPVKGREAIADWFIERAATNWGFYIAPERIQVDDVRVAQFKGKAERAITLQQATLNGYLTVTDPERFALSVASGVGRGRAFGCGLLQVVPLIDPPLFLTRNHHE >NZ_CP016043.1|WP_070244584.1|869353_869878_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MNMQPQASQRREVDFVAYICQRCQKDKGFAARLCRADNPATEYQSWDTLAAFGINLEWAEERQPFALIAAAVARSDQACNGTLPLGQAIALAFSEGRESDQAKARLRRLLACDDTEEVCRILRPLLMLIRSRVNQPLDYAALLVDLRWFHRSADRAKARWAQQFYGRQEKEITA >NZ_CP016043.1|WP_070244583.1|867872_869357_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MEHCFNLIDEPWIPVTDNGRVGLKDIFTHPEYRALGGNPVQKAAILKLLQAIAQAASTPQDLKAWQQLGWQGMAERVCCYLAQWRDRFYLYGPIPFLQMPAIAKAAIKSFGTVQPDVATGNTTVLTQSQAEQPLDDGERALLLITQMGFALGGKKTDNSVVLTPGYSGKSNDKGKPSTGKPGPSVAYMGLLHNYCLGSSLLESIWLNLFTEAEIVDLTLYPSGLGIAPWEKMPEGEDCHVAKQLKGSLMGRLVPLCRFCLLADEGLHYSEGIAHGNYKEGVFDPSVAIDISGKEPKVRWADPERRPWRELTSLLGFIDQGGKSLDCYQIKLALRKAKKQVARFAIWSGGLRVSSNAGEQYVSGSDDMVESLCWLSPSHVNELWFNRFQTEIGQLDGLAKTLYGCVMSYYKAQMMDGESLAKQASNLFWQLCERQSQALIDGCDEVKARQQLRRQFARYTTQVFDQFCPHQTARQMDAWAKTKPNLSVYLQQEQS >NZ_CP016043.1|WP_070244582.1|865223_867869_+|CRISPR-associated-helicase-Cas3' MINRRRKSTDTESAIAAVPFELCPAKTYKDRQGVPHLGRSVFNHCQIVGQTAKALLERIPATIRYPLFPRGSALQAALHDIGKISPTFFLKLQCAVEGEDSPWLQRMSQFRGIQEREWGGHAGVSELALAAITNNPFVPSVAGQHHGFNPPEVMLTADAPPLGGAPWQTERCKLVEALQRAMGEALPVITTPAQARILAGLTSVADWIGSGPHFEDPAIPWQPRIEQALDDAGFILPKVRNGLTFGDIFASEDGVPYQPNEPQQLLHQYTQGVGVYVLEAPMGLGKTEAALYAAYKMLEEGRATGIYFALPTQLTSNKLLDRFNGYLKQILTEESPHRHSLLLHGNAWLANHALGEEGKPGRSWFNCAKRGLLAPFAVGTLDQALMAAMNVKHGFVRAFGLVGKVVILDEVHSYDAYTGVILDELIRLLRTLHCTVIILSATLSQARRSELLGQPAQQDAYPLISVSPGITPSPLQELSVTPEEPRTVYLQCKAMADQTVLEEVLKRASQGQQVLWIENTVAEAQERYLDLATRAQELGVACGLLHSRFTALHRQKNEAHWVGCYGKVGRAARREQGRILIGTQVLEQSLDIDADFLVSRMAPSDMLFQRLGRLWRHEQTPRPPEAIREAWILAPDLDAARQDPYQAFGATAHVYSPYLLCRSLEVWLVQVKVGMVSLPEHIRTLVESTYRERSEDDAMARWKRELFEGSHRRKGVNTLRQLARLTLSKGGKTLPEAKAQTRYSEQESGDLLLLSGLSLNNHDQATTLTFLDGEQIVIPWHGHRLTPAEWRNRAARVTQHLVSCCLSQLPRPAERLWCQKTGLGHVLYLGNPDQDDAAISIALVATDHQLHAVDGRSAPLSDRLSYRYRDDIGLIITQYKE >NZ_CP016043.1|WP_070244389.1|864083_864638_+|hypothetical-protein MERFILSGTCFYELNGLRYLLQEAGYPVFDEVAVKTFGPDDVFVLALSAEPLLGWGRHVRYIRHCRRRLPCRMVVLVPPSLGTLRVFDGTCPVISGHLPRAELISQLLTLCRDALALPREPQSPFRLLNVKQGGSRRQLLQQYRENTPLRRMAKSDYYHRGRLLDVLGIEKMQTLSIVGQELLS >NZ_CP016043.1|WP_070244581.1|861467_862325_-|bifunctional-methylenetetrahydrofolate-dehydrogenase/methenyltetrahydrofolate-cyclohydrolase-FolD MTAKIIDGKTIAQQVRSEVAAQVQQRLAEGKRAPGLAVIMVGDDPASRIYVGSKKRACEEVGFLSRAYALPENTHQAELLALIDTLNADAAIDGILVQLPLPAGIDNSKVLERIRPDKDVDGFHPYNLGRLCQRTPKLRPCTPRGIITLLERCGIETQGMDAVMVGASNIVGRPMALELLLAGCTTTITHSRTRDLQQHVERADLIVAAVGKPNFIPGAWVKPGAVVIDVGINRLESGKVVGDVDFAGAAQRASWITPVPGGVGPMTVATLMQNTLQACEAFHDC >NZ_CP016043.1|WP_070244590.1|876180_876384_+|type-II-toxin-antitoxin-system-prevent-host-death-family-antitoxin MLIYTSTQARAKISAVLDAVSRGEVVEITRRNGAVAVVISKAEFEVYQKVKLDNECDSQIIAFQLSR >NZ_CP016043.1|WP_024523759.1|876905_877451_-|helix-turn-helix-transcriptional-regulator MTQVTVYTDNNLLANFICDLILNIEENASILEYQPLKLLRCENEIIVFNLIRSSHNIVATINFLNKYKIRLSRMIVSMIVPSKLIDLCLELSLFKISYLLTEKSTPNDYARLLHGNIPSAAPRKNILSCRERTILQLLLQEYSPQGVANELHISYKTVCAHKLNIMKKLQLKNLSGIFMYC >NZ_CP016043.1|WP_081702253.1|877610_878099_-|hypothetical-protein MHDKVKAYSDQLNQAESAARLEADKTELSARLAGDKQEQATRLESDQKEQSARAQADYAESIARANGDKQTLASANHYTDEKVNRTEKRLNAGLAGIAAISSIPYVNGNTFSYGVGVGNYRNGNAAAMGMQYKISHNINARLNASWDSSHNTAVGFGLAAGW >NZ_CP016043.1|WP_156774553.1|880313_880478_+|hypothetical-protein MLIAHIPALPLRYRGKRGSVGLVLEATDGLCQLYALRLQRGGGFANGGGWLLGT >NZ_CP016043.1|WP_024523133.1|881010_881247_-|hypothetical-protein MSQKNRQVRFILLFIIFFIVLIAVGNIIVKRRISPQLVETEVRNIKLTAEAQSIIKAQITREPAQQRAISESVRSLPQ >NZ_CP016043.1|WP_024523134.1|882302_882635_+|acid-resistance-protein MKVNTTFLGASALALTLALAGSACAQEPTMTTVTTPETMTCHEFTQMNPKAMTPVMVWVVNQDRQYKGGDYVDWQKIQTVMVPKVMKICKEQPGKKVIEFRNQVQDLISD >NZ_CP016043.1|WP_070244593.1|882739_884617_-|bifunctional-glutathionylspermidine-amidase/synthase MSVETHHNDAPFGTLLGYAPGGVAIYSSDYSTLDPRVYPDEASLRSYIDDEYMGHKWQCVEFARRFLFINYGVVFTDVGMAYEIFSLRFLRQVVNDNLLPLYAFANGSPRPPVAGALLIWQKGGEFKGTGHVAVITQLRGDKVRIAEQNVIHAPLPPGQQWTRELTLQRENGRYTIQDTFDDTEILGWMIHTEDARDSLAQPTLAPQAMAIHAARRPDRALFEGRWLNEDDPVELSYVQANQGHVINHDPSQYFTISESAEQELIKATNELHLMYLHATDKVLRDDSLLALFDIPKILWPRLRLSWQQRRHCMITGRLDFCMDERGLKVYEYNADSASCHTEAGLILQHWAERGDGVNGYNPGEDLLNELAGAWRHSHAHPFVHIMQDEDEEESYHALFMQRALSQAGFDSKIVKGLAPLRWDATGQLIDDEGRLVTCVWKTWAWETAIEQVREVSDAEYAAVPIRTGKPEKQVRLIDVLLRPEIMVFEPLWTVIPGNKAILPILWSLFPNHRYLLNTDFVPNTALARSGYAVKPIGGRCGSNIDLVSRHEEVLDTTSGKFHDQKNIYQQLWCLPQVADKHIQVCTFTVGGSYGGACLRSDRSLVIKKESDIEPLVVLKDSAFLR >NZ_CP016043.1|WP_156774603.1|885079_885628_+|lipid-IV(A)-palmitoyltransferase-PagP MHHYISALASLCAFFTWGASASTPSLVETLRANVVQTWQQPQHHDFYLPAITWHARFAYSREKIESYNERPWGAGFGQSRWDEKGNWHGLYLMAFKDSFNKWEPIGGYGWEATWRPIADSDFHWGAGYTLGVTMRDNWKYIPIPVVLPMASLGYGPLTMQMTYIPGTYNNGNVYFAWLRFQF >NZ_CP016043.1|WP_083275060.1|885765_887001_+|SpoIIE-family-protein-phosphatase MVAENSLLADTVLIVDDSPGYRRLLATILARWQYRVIEAEDGEQALACLARHQVHIVISDWEMPLMDGATLCRAIRAQDYGHYVYLILLTIRQSSEDLVAGMEAGADDFLTKPLNQGQLRSRLHAAQRIIQLESTLAARNATLAHAYQQIESDLQAAAAMQRSLLPSHDQTINGYHADWLFLPSTYVSGDLLNYFMLDAHHLGFYCVDVAGHGVSAAMLAQSVAREFTSALLTHSLLFRSPDTSPAAPQAVVSELNRRFCLEPQDDGIVRYFTLIYGVLDTRDGRLRLCQAGHPTPLWFQADGGLRRVGDGGLPIGLFDWATYEDHALLLAPGDRLCLYSDGISECYSPQGEQFGEARLCQVLQAPRPTSVPATLARLAEALAQWHSPAAVTPRQPFADDISLLMITRCAD >NZ_CP016043.1|WP_070244596.1|887024_887363_+|STAS-domain-containing-protein MNIAVEEWEGVTVVSPLIRRLDASVAGIFRQEVVTLIEQGHHQLLLDFSQVDFIDSSCLGALVSLLKLLNNRGDLRLCGLNDNILGMFRITRMDRVFHIGVDRQQALARQFG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016043_2 | 991737-991821 | Orphan |
NA
Consensus repeat of NZ_CP016043_2
|
1 spacers
spacers of NZ_CP016043_2
>2.1|991761|37|NZ_CP016043|CRISPRCasFinder CGTCGCCGTGGCCGCTCGCCAGGCCTTCATAGTCGAT |
CRISPR arrays and Neighbor proteins around NZ_CP016043_2
The CRISPR arrays of NZ_CP016043_2 >merge|NZ_CP016043|2|991737-991821|CRISPRCasFinder CCGTTATATCCGCAGGCGGGGAGGCGTCGCCGTGGCCGCTCGCCAGGCCTTCATAGTCGATCCGTTATATCCGCAGGCGGGGGAG >NZ_CP016043|2|2|991737-991821|CRISPRCasFinder CCGTTATATCCGCAGGCGGGGAGG CGTCGCCGTGGCCGCTCGCCAGGCCTTCATAGTCGAT CCGTTATATCCGCAGGCGGGGGAG
>NZ_CP016043.1|WP_024523228.1|991109_991646_-|GNAT-family-N-acetyltransferase MQHTILQAGPELQLIPAHPRFAQALFTLVEQNREYLCRFLSWPHSMVHVDNLSQSLHAQAEAHHAGSARHYIIHYQQHCIGVIALNSIDHQRRCAPIGYWLAQSYQGKGLISASLQALMQHYVARREVTRFVIQCISDNLRSNAVARRNGFTLVETRHQACELEGIRYDQNTYLRRFD >NZ_CP016043.1|WP_024523227.1|990476_990815_-|phnA-family-protein MQLPHCPKCHSEYTYQDNDMYICPECAYEWNDATDAAPEEESLIVKDANGNLLADGDSVTVIKDLKVKGSSSMLKIGTKVKNIRLVEGDHNIDCKIDGFGPMKLKSEFVKKN >NZ_CP016043.1|WP_070244633.1|988956_989676_-|LuxR-family-transcriptional-regulator MIADNESLAQDIKLFIDQSLSYYGQLQFAYLLLNKKNPTEITIISNYPDEWVKLYQEHHYQQIDPVVICALRRTSPFLWDEKITVNAKLNLSKIFSLAKKYQVNKGYTFVLHDAENQLAMLSMMVDEHSLNIVEQNQGALQMLLINAHERFMAGQRQIQAQQHNIKNNNMENIFSARENEILYWASMGKTYQEIALILGIKSGTVKFHIGNVVKKLGVLNAKHAIRLGVELQLIKPVER >NZ_CP016043.1|WP_024523225.1|988307_988976_+|GNAT-family-N-acetyltransferase MFEIFAVDYQSLSRTTSSELFSLRKDTFKDRLNWAVNCSDGMEFDEYDSEHTTYLLGVKDNDVVCSVRLIETRYHNMIVGTFFQYFAKAEIPASSQFFESSRFFVDKYRARTLLANQYPLCHLLFLAMINYTLASGHQGIYTIVSQPMLRILNRSGWNVSVVEQGISEKDQAIYLVYLPADAENQRSLIERINQELQHPWQEALLQAWPLVLEEETISAQPV >NZ_CP016043.1|WP_070244632.1|986967_987711_+|type-2-GTP-cyclohydrolase-I MDNWLLEQQINQLLNVAEIQDYAPNGLQVEGRREVRRVITGVTACQALLDAALQAEADAVLVHHGYFWKNETPTIRGMRRQRLKTLLANDINLYAYHLPLDAHPQLGNNAQLAQLLKITPQGLIAPLLPYGDLAEPCSAGEMIGRLERKLHHSVLHSGDNAPALIRRVAWCTGGGQGFIEQAASFGVDAFITGEVSEQTIHTAREMGLHFFAAGHHATERGGVRALGAWLAQEYGLEVTFIDIANPA >NZ_CP016043.1|WP_083274991.1|985849_986911_+|3',5'-cyclic-nucleotide-phosphodiesterase MTGDESIWDDRESKQMLTKRILWCGVVGAWLLYGLLALPALAGFQVVALGSGGGLSGDNLPAYLIRHERDTGYVALDAGSTLPGIAKALAQGAFPEASAERAAPWTPQGYVLRELITAYFISHPHLDHVAGLLLAAPEDSRKPIYTLASSAETLRTHYFNWKSWPNFSDAGQGQRLGTYRIHSVRPAQRFSLGNSGMSAQVYPLSHAGVTSAMILLERAGEYFAYFGDTGADSVEQSNHLDRIWRRLGPLLASGALKGMIIETSFSDAVPPSHLFGHLTPRLLNQELVQLARYSGDSAALQGFPVVIAHIKPSLRAGETAEQTIMAQLAAGNRSGVRFHHLRQGEHALFSGRE >NZ_CP016043.1|WP_070244631.1|984432_985863_+|deoxyribodipyrimidine-photo-lyase MTTHVVWLRNDLRMNDNRALHAACAAPHARVLALYVATPRQWQMQDMAPRQAQFIWQNLRLLQAELAARHIALHCLSVADYDAQRQAVAQFCDQHQATALFFNRQYELNERRRDAALCATLPIPCHRFDDALLLPPGTVLTGSGQMFKVFTPFKRAFLSLLSQHDIAPLPAPAPRAPQPAAPLLTTPFDYPAAAIDATLFPAGECAALARLAHFCHNDLATYALRRDFPAQAGTSLLSPYLTLGILSPRQCVAAMRERLSCAPNAASGSEAWLNELIWREFYRHLLVAWPDLCRHRPFIAWTARIRWRDDAQGLQHWQQGMTGFPLIDAAMRQLNHCGWMHNRLRMLTASFLVKDLLIDWRLGERYFLSQLLDGDLAANNGGWQWAASTGSDAAPYFRIFNPTTQGQRYDALGLFIRRWLPALRDVPDSEIHHPQRWALRQRRVLDYPDPLVDHARARRDTLLAFQRARAADDDGG >NZ_CP016043.1|WP_024523221.1|983469_984423_+|DUF523-and-DUF1722-domain-containing-protein MSSAIPVGISACLLGQTVRFDGGHKRLALACETLAPFFHFLPVCPEMGIGLPSPRPALRLMRREASSEIALVDSRDPSLDYTAAMQAFSARQLPQLHALCGFILCARSPSCGMERVKLYSGQEARKSGVGLFAAALMAAMPWLPVEEDGRLSDALLRENFIARVYALHEFNQLWRQGLTRGALVAFHSRYKLLLLAHSQSDYRALGRLVAAIAQYDSLAQFAADYRVRLMALMRQPATRRNHTNVLQHVQGYFSPRLSAAQRAELSELILQYRQGTQPLLAPLTLLKHYLREYPDDYLASQRYFSPYPDVLRLRYGH >NZ_CP016043.1|WP_024523220.1|983172_983376_+|YbfA-family-protein MPYPPYSWSRILLRRCCVILVGALALPVMLWRKDRARFYSYLHRVWCKTSDKPVWLSESEKVKPDFF >NZ_CP016043.1|WP_024523219.1|982519_983020_+|lactoylglutathione-lyase-family-protein MAYPRSFSHIGISVTDLARAVDFYTSVMGWYLVMPPTEIREDDSAIGVMCNDVFGPGWGSFRIAHLATGDKIGIELFQFPNSEARVNNFEFWKNGVFHFSVQDPDVEGLAARIVAAGGKQRMPVREYYPGEKPYRMVYMEDPFGNIIEIYSHSYELTYSAGAYQNV >NZ_CP016043.1|WP_024523229.1|991857_993141_-|citrate-synthase MADNKATLTVGSERIELDVLSGTLGYDEIDIRKLGSHGYFTFDPGFTSTASCESQITYIDGDEGILLHRGYPIDQLAKHSSFLEVCYILLYGEPPTQAEYDTFKTTVTRHTMIHEQITRLLQGFRRDSHPMAVMCGVTGALAAFYHDSLDISNERHREIAAFRLLSKMPTVAAMCYKYSLGQPFIYPQNDLSYAGNFLRMMFATPCEEYQVNPVLERAMDRILILHADHEQNASTSTVRTAGSSGANPFACIAAGIASLWGPAHGGANEACLKMLEEINHVDHIPAFIKRAKDKNDSFRLMGFGHRVYKNYDPRATVMRETCHEVLKELGMNDNLLEVALELEHIALNDPYFIEKKLYPNVDFYSGIILKAMGIPSTMFTVIFAIARTVGWIAHWNEMHEDGLKIARPRQLYTGYARRDFSSQLERR >NZ_CP016043.1|WP_024523230.1|993780_994170_+|succinate-dehydrogenase-cytochrome-b556-subunit MGNTVKKQRPVNLDLPTIRFPITAIASILHRVSGVIVFVSIAILLWLLGLSLSSAEGFAQASALVDGLLVKLVLWGILTALAYHICGGLRHLLMDFGYLEETFSVGCRSAQVAFAVTVLLSICAGVWLW >NZ_CP016043.1|WP_024523231.1|994163_994511_+|succinate-dehydrogenase-membrane-anchor-subunit MVSNASALGRNGVQDWLLLRASAIIMTLYVIYLLAFIAVAGPLNYGLWLDFFSSRLTQVFTLLTLLCVLVHAWIGMWQVLTDYVKPLALRLLLQLAIVVVLVVYLGYGTLVVWGI >NZ_CP016043.1|WP_024523232.1|994511_996278_+|succinate-dehydrogenase-flavoprotein-subunit MKLPVREFDAVVIGAGGAGMRAALQISQQGFRCALLSKVFPTRSHTVSAQGGITVALGNTHEDNWEWHMYDTVKGSDYIGDQDAIEYMCKTGPDAILELEHMGLPFSRLDDGRIYQRPFGGQSKNFGGEQAARTAAAADRTGHALLHTLYQQNLKNHTTIFSEWYALDLVKNADGAIVGCTALCIETGEVVYFKSRATILATGGAGRIYQSTTNAHINTGDGIGMALRAGVPLQDMEMWQFHPTGIAGAGVLVTEGCRGEGGYLLNKHGERFMERYAPNAKDLAGRDVVARSIMIEIREGRGCDGPWGPHAKLKLDHLGKEVLESRLPGILELSRTFAHVDPVKEPIPVIPTCHYMMGGIPTRISGQALTQDTNGADQVIPGLFAVGEIACVSVHGANRLGGNSLLDLVVFGRSAGLHLAQSLQEQGPLRQASESDIEASLARLHRWNGTRKGEDPAQIRKDLQSCMQHNFSVFREGEAMAQGLAELKTIRERLASARLDDTSSEFNTQRIECLELDNLMATAFATAMAANYRTESRGAHSRFDFPQRDDANWLCHTLYLPESERMVQRHVNMQPKLRAAFPPKVRTY >NZ_CP016043.1|WP_024523233.1|996296_997013_+|succinate-dehydrogenase-iron-sulfur-subunit MKLEFSIYRYNPDVDRAPHMQDYTLEAEEGRDMMLLDALIRLKEQDPTLAFRRSCREGVCGSDGVNMNGKNGLACITPISALRRGGRKIVIRPLPGLPVVRDLVVDMGQFYAQYEKIKPYLINDGRTPPAREHLQSPEERAKLDGLYECILCACCSTACPSFWWNPDKFVGPSGLLAAYRFLIDSRDTASAQRLEGLDDAFSVFRCHGIMNCVSVCPKGLNPTRAIGHIKSMLLHREA >NZ_CP016043.1|WP_070244634.1|997355_1000163_+|2-oxoglutarate-dehydrogenase-E1-component MQNGTMKAWLDSSYLAGANQSYIEQLYEDYLTDPDSVEHSWKLLFQQLPANGLPPDQFHSQTRDYFRRLAKDPARFGQRFNDPQTDAKQVKVLQLINAFRFRGHQQANLDPLGLWKQEPVPDLDPAFHHLSAEDFDETFNVGSFAVGSETMRLADIYRALQQTYCGTIGAEYMHLTNTDEKRWLQQRLESVMGQPSFNPQEKRRFLRELTAAEGLERYLGAKFPGAKRFSLEGGDALIPMLKELIRHAGLHGTREVVLGMAHRGRLNVLINVLGKHADELFDEFAGKHKDHLGTGDVKYHMGFSSDMATEGGPVHLALAFNPSHLEIVSPVVMGSVRARRDRLDRTRSDIVLPITIHGDAAITGQGIVQETLNMSQARGYEVGGTVRIVINNQIGFTTSNPLDARSSQYCTDIGKMVQAPIFHVNADDPEAVAFVTRLALDFRNTFKRDVFIDLVCYRRHGHNEADEPSATQPVMYQKIKKHPTPRKLYADRLMAQGIVSLEEATEMVNLYRDALDSGGCVVEEWRPMTMHSVTWEPYLHHEWDEPYPHAVETQRLQDLARRISRVPEEVEMQPRVAKIYADRAAMAEGSQPFDWGGAETLAYATLVDEGIPVRLSGEDCGRGTFFHRHAVIHSQKDGALYVPLENVHHAQGDFKVWDSVLSEAAVLAFEYGYASAEPRTLTIWEAQFGDFANGAQVVIDQFISSGEQKWGRLCGLVMLLPHGYEGQGPEHSSARLERYLQLCAQQNIQVCVPSTPAQVYHMLRRQALRGMRRPLVVMSPKSLLRHPLAVSSLEALAQGSFQPAIGEIDALDPQQVKRVVMCSGKVYYDLLEQRRKNGQENVAIVRIEQLYPFPHQAVQAVLAAYTQARDFVWCQEEPLNQGAWYCSQHNLREVIPFGAVLRYAGRPASASPAVGYLSVHREQQQALVDDALHVE >NZ_CP016043.1|WP_024523235.1|1000222_1001443_+|2-oxoglutarate-dehydrogenase-complex-dihydrolipoyllysine-residue-succinyltransferase MSSVEILVPDLPESVADATVATWHKQVGESVARDEVLVEIETDKVVLEVPALDAGVLEAILEPEGATVGARQLLGRLRPADVSGVAIGSGPQVAQATPAERHTAALDGGNNDALSPAVRRLVAEHDLDPAALQGSGVGGRLTREDVEKHLSAQPVTPPSAELPRAAASAAPLTAEREKRVPMTRLRKRVAERLLEAKNSTAMLTTFNEVNMQPIMALRSQYGEAFEKRHGVRLGFMSFYVKAVLEALKRYPEVNAALDGEEVVYHNYFDISIAVSTPRGLVTPVLRDVDTLSMAEIEKRIKTLAVKGRDGKLTVEELTGGNFTITNGGVFGSLMSTPIINPPQSAILGMHAIKDRPMAVNGQVVILPMMYLALSYDHRQIDGRESVGFLVTVKEMLEDPTRLLLDI >NZ_CP016043.1|WP_024523236.1|1001522_1002689_+|ADP-forming-succinate--CoA-ligase-subunit-beta MNLHEYQAKQLFARYGLPTPVGYACSTPRQAEEAASKIGSGPWVVKCQVHAGGRGKAGGVKCVARKDEIRAFAEQWLGKRLVTYQTDAQGQPVRQILVEGATEIAHELYLGAVIDRSSRRVVFMASTEGGVEIEQVAQQTPHLIHRVALDPLTGPQPYQGRELAFKLGLSGKQAQQFGQIFLGLATLFLQCDLTMAEINPLVITPQGDLLCLDGKLDVDSNALFRQPALREMEDPEQNDAREAHAAQWELNYVALEGNIGCMVNGAGLAMGTMDIVKLHGGAPANFLDVGGGATKERVTEAFKIILSDEHVKAVLVNIFGGIVRCDLIADGIIGAVAEVGVHVPVVVRLEGNNAELGTQILADSGLNIIAATSLTDAARQVVAAVEGK >NZ_CP016043.1|WP_024523237.1|1002688_1003567_+|succinate--CoA-ligase-subunit-alpha MSILINRETRVICQGFTGSQGTFHSEQALAYGTRLVGGVTPGKGGGEHLGLPVFNTVREAVQATAASASVIYVPAPFCKDSILEAIDAGITLIICITEGIPTQDMLLVKAKLDQCPGVRMIGPNCPGVITPGECKIGIMPGHIHQPGRIGIVSRSGTLTYEAVKQTSDVGLGQSTCVGIGGDPIPGSSFIDILALFQADPQTDAIVMIGEIGGNAEEEAAAYIKQHVSKPVVAYIAGVTAPKGKRMGHAGAIIAGGKGTADEKFAALEAAGVTTVRSLAEIGQTLLRVLERA >NZ_CP016043.1|WP_024523238.1|1004256_1005819_+|cytochrome-ubiquinol-oxidase-subunit-I MFDIVELSRLQFALTAMYHFLFVPLTLGMAFLLAIMETVYVLTGKQIYKDMTKFWGKLFGINFALGVATGLTMEFQFGTNWSYYSHYVGDIFGAPLAIEGLMAFFLESTFVGLFFFGWDRLGKVQHMLTTWLVALGSNLSALWILVANGWMQNPIASDFNFETMRMEMVSFSELVLNPVAQVKFVHTVSAGYVTGAMFILGISAYYLLKGRDLAFAKRSFAIAAAFGMASVIAVILLGDESGYEMGDVQKTKLAAIEAEWETQPAPASFNLIALPDQQTESNHYAVQVPYLLGLIATRSLDTPVIGLKDLMKEHEVRIRNGMKAYQLLQELRTGNTDPAVRDAFNHAKQDLGYGLLLKRYTDNPAQASEEQIAKATKDSIPEVAPLYFAFRIMVGCGILMLLVIFASFYSVVRGRVGEKRWLLRAALLGIPLPWIACEAGWFVAEYGRQPWAIGEVLPTAVANSSLTAGDLWFSIILICGLYTLFLVAELYLMFKFARLGPSSLKTGRYHFEQTHAVDAQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016043_3 | 1740587-1740980 | Orphan |
I-B,III-A,III-B
Consensus repeat of NZ_CP016043_3
|
6 spacers
spacers of NZ_CP016043_3
>3.1|1740615|33|NZ_CP016043|CRISPRCasFinder CGCTTCACAGAGTAGCTGGCCGACTTCGGTCTA >3.2|1740676|33|NZ_CP016043|CRISPRCasFinder,PILER-CR TCATGTCATGAACGTTCATGCGGCTTTTCCTTG >3.3|1740737|33|NZ_CP016043|CRISPRCasFinder,PILER-CR TTTTACCTCCTTCGCAAATTTGTGTCCCCAGCG >3.4|1740798|33|NZ_CP016043|CRISPRCasFinder,PILER-CR ACTGCCTGCACTGGCAAGAGCTACGTGTACTGA >3.5|1740859|33|NZ_CP016043|CRISPRCasFinder,PILER-CR GCATCGTCCAGGCGACGACCACGCGCAAAGGGA >3.6|1740920|33|NZ_CP016043|CRISPRCasFinder,PILER-CR GCTGTGGTGCTGCTGGCAATGGCATCCAGTGAG >3.7|1740615|37|NZ_CP016043|CRT CGCTTCACAGAGTAGCTGGCCGACTTCGGTCTAGAAC >3.8|1740676|37|NZ_CP016043|CRT TCATGTCATGAACGTTCATGCGGCTTTTCCTTGGAAA >3.9|1740737|37|NZ_CP016043|CRT TTTTACCTCCTTCGCAAATTTGTGTCCCCAGCGGAAA >3.10|1740798|37|NZ_CP016043|CRT ACTGCCTGCACTGGCAAGAGCTACGTGTACTGAGAAA >3.11|1740859|37|NZ_CP016043|CRT GCATCGTCCAGGCGACGACCACGCGCAAAGGGAGAAA >3.12|1740920|37|NZ_CP016043|CRT GCTGTGGTGCTGCTGGCAATGGCATCCAGTGAGGAAA |
CRISPR arrays and Neighbor proteins around NZ_CP016043_3
The CRISPR arrays of NZ_CP016043_3 >merge|NZ_CP016043|3|1740587-1740980|CRISPRCasFinder,CRT,PILER-CR GAAGCACCCTCACGGGCGTGGGGAAGACCGCTTCACAGAGTAGCTGGCCGACTTCGGTCTAGAACCCCCCCCACAGACGTGGGGAAGACTCATGTCATGAACGTTCATGCGGCTTTTCCTTGGAAACACCCCCACAGGCGTGGGGAAGACTTTTACCTCCTTCGCAAATTTGTGTCCCCAGCGGAAACACCCCCACAGGCGTGGGGAAGACACTGCCTGCACTGGCAAGAGCTACGTGTACTGAGAAACACCCCCACAGGCGTGGGGAAGACGCATCGTCCAGGCGACGACCACGCGCAAAGGGAGAAACACCCCCACAGGCGTGGGGAAGACGCTGTGGTGCTGCTGGCAATGGCATCCAGTGAGGAAACACCCCCACAGGCGTGGGGAAGAC >NZ_CP016043|3|3|1740587-1740980|CRISPRCasFinder GAAGCACCCTCACGGGCGTGGGGAAGAC CGCTTCACAGAGTAGCTGGCCGACTTCGGTCTA GAACCCCCCCCACAGACGTGGGGAAGAC TCATGTCATGAACGTTCATGCGGCTTTTCCTTG GAAACACCCCCACAGGCGTGGGGAAGAC TTTTACCTCCTTCGCAAATTTGTGTCCCCAGCG GAAACACCCCCACAGGCGTGGGGAAGAC ACTGCCTGCACTGGCAAGAGCTACGTGTACTGA GAAACACCCCCACAGGCGTGGGGAAGAC GCATCGTCCAGGCGACGACCACGCGCAAAGGGA GAAACACCCCCACAGGCGTGGGGAAGAC GCTGTGGTGCTGCTGGCAATGGCATCCAGTGAG GAAACACCCCCACAGGCGTGGGGAAGAC >NZ_CP016043|3|2|1740591-1740980|CRT CACCCTCACGGGCGTGGGGAAGAC CGCTTCACAGAGTAGCTGGCCGACTTCGGTCTAGAAC CCCCCCCACAGACGTGGGGAAGAC TCATGTCATGAACGTTCATGCGGCTTTTCCTTGGAAA CACCCCCACAGGCGTGGGGAAGAC TTTTACCTCCTTCGCAAATTTGTGTCCCCAGCGGAAA CACCCCCACAGGCGTGGGGAAGAC ACTGCCTGCACTGGCAAGAGCTACGTGTACTGAGAAA CACCCCCACAGGCGTGGGGAAGAC GCATCGTCCAGGCGACGACCACGCGCAAAGGGAGAAA CACCCCCACAGGCGTGGGGAAGAC GCTGTGGTGCTGCTGGCAATGGCATCCAGTGAGGAAA CACCCCCACAGGCGTGGGGAAGAC >NZ_CP016043|3|2|1740648-1740980|PILER-CR GAACCCCCCCCACAGACGTGGGGAAGAC TCATGTCATGAACGTTCATGCGGCTTTTCCTTG GAAACACCCCCACAGGCGTGGGGAAGAC TTTTACCTCCTTCGCAAATTTGTGTCCCCAGCG GAAACACCCCCACAGGCGTGGGGAAGAC ACTGCCTGCACTGGCAAGAGCTACGTGTACTGA GAAACACCCCCACAGGCGTGGGGAAGAC GCATCGTCCAGGCGACGACCACGCGCAAAGGGA GAAACACCCCCACAGGCGTGGGGAAGAC GCTGTGGTGCTGCTGGCAATGGCATCCAGTGAG GAAACACCCCCACAGGCGTGGGGAAGAC
>NZ_CP016043.1|WP_070244877.1|1740285_1740561_+|PerC-family-transcriptional-regulator MRGKPIVWRHHNLRQIQEHMLNAERCEQRGLWRRAGHEWMQVIEHCTDDVLVEYAVQQRNYCAQMGVFGSASIDPRMVAANQCDEPLQDEG >NZ_CP016043.1|WP_070244876.1|1739561_1740296_+|DNA-replication-protein MKNLVSAVQRRDAAALSRMAGQPLQERVVNGNAEKLVDVLFENLLLLFPASRNTVFAAPDEVAAMKRQWITAFAEGGITTLEQVKAGVSMARQHGGDFWPSCGRFMEWCREGVRSAGGLPSDDEVLAEFHRYARDKARFASPEAFDWAHPVMYWVVLDVRQRMYRYNYTEAEVLRAIKAQMQRWERNIRAGQRIPTPVKQLVHVQRPPAIADQLDPTGGAGFYQVGVAFLEQIRQRLRGGEHEG >NZ_CP016043.1|WP_070244875.1|1738461_1739565_+|replication-protein-O MSSLIQILDRPIAYNPALAKLRAGKVKAGPVAAVFLSQMIYWHNRMGGEWMYKTQADITTETALTRDEQETARKRLVALGVLDEARRGVPATLHYRINVARLEALLLEAATPVATPAPTAKTRTRDIQNSEPSQPGPAHSDQSRMVQSQNVETPQSGLVQPRKLDCGDAANKNVETPQTSMGEPTEQVCGDPANFHTGDYTENTQENKKPSCPDAAQPDEPDSDHDFLSRHPEAVVFSAKKRLWGRQEDLTCAEWIWGRIVRLYELAAEDDGEVVRPKAPNWTVWANEVRLMCHQDGRTHRQICELFGRVNRDPFWCRNVLSPAKLREKWDELVIRLGAPGAGAQDRSLKTLLGADWNTEQGWESVL >NZ_CP016043.1|WP_083275005.1|1738291_1738465_+|DUF4222-domain-containing-protein MRIPKQGSYYQDRNGVVVRITGYERESQRVLYRRPGYEWGCASPLVVFNAKFRRYQG >NZ_CP016043.1|WP_156774564.1|1738127_1738292_+|hypothetical-protein MHTPTAGLLRLRCRVLPEREDYRYEVNVLGRWWPCNYTLARWTVEYCRQGWGGM >NZ_CP016043.1|WP_070244874.1|1737647_1738097_+|hypothetical-protein MGDIKDAVKAMCESMPGGRAAMAGALGMSPTSFNNRLYEKNGCKFFDRHDLEAMEDLSNTHHLADYFAARRGRITVRVQSRDELDPVELFTLATLTAAHKGQVDLAIQHSISDGIINSSEERDILALHSQYVAARDAYVRAIIALHKAQ >NZ_CP016043.1|WP_070245634.1|1737285_1737471_+|hypothetical-protein MLKQTVVKHFGSQRAVAQALQVSDSAVSQWKTLIPERAALKLHRITAGKLKYSPCFYQKSS >NZ_CP016043.1|WP_167352271.1|1736556_1737189_-|helix-turn-helix-domain-containing-protein MGSRILKRRKELKLSQVTLSKAVGVSNVAISQWERDETAPRGDALLALARELLCPAEYLVNGTPADTPLAIPVALHPKGKYPLLSWTLVNHGSLAIRSYTREKAEHWYSTTVDCSAASFWLTVEGDSMTATAGLSIPEGTAILVDPDRNPTNGKLVVAASLSDDEAIFKRYILDVGKKYLKPLNTMYSMVEINDNYEIIGVVVEARIAIP >NZ_CP016043.1|WP_070244872.1|1734771_1735650_-|hypothetical-protein MIKTPTDLESYMQYVYSCLLNLQNEGVVVSRRAILKGKSTNHEIDVFYQFERAGVIHKVAIECKYLSRPVEKKDVMVFRGRLEDIGNIQGIMVSKFGYQKGAYEYAKHYDIDLKTIDDVPSLNIITAEQVKSGGLPSKNNIGQPFWILMEKYLDNVDAIYYGVNDDCDGKFTIPLFLSKRDAIHFLKKKKLQKKFAIRGLTQRNLEVLIGFGKVGGCKFYLMPSPYNSENNGGIIISPDALKFNYLISEITEDEYSEDYFVKPKRKHLSLARIVEELMDSRSLELLKNIKRK >NZ_CP016043.1|WP_156774563.1|1732044_1734210_+|hypothetical-protein MSNLIELLTILSKTDWSYGITMNRPIAFADLFQQIRESEGTGFLDRAHQRSFSLNIFKMNALELLWLSQRVRDPKQCIALMAEKNREAGIQAHRELNRHIHNFVSSSLTLVEHTRVFMRTHYAGTEILTIYENKARDTFANSPVAQFVQGLRNYMLHRGLPNSSMFMKFEAAPSEMGGVGTMETGVSYDTSSLLDWKDWKSVARTYIEQAGEHLDIHEVTQEYLALVSRFHDWLDSTLEKYHHSDLQEVSQLKIKLNKISPDNEPTLQTNPSDSLPIEPFMFTSAHAAELELISFDIFGKVKEIRIPHEVDDFATERPITLITGQDIIGDVISWVQDVNGTMSIIFFKYNEKTYGLVESDYKFLNELIDVVMKAAWARTKISRKFVETTFFNWVRQQFPSVQIPFSEALSDAVQKSVMNIEVLAPIANMEVEQGFDFGPVRIDSITANTIENLRSGAPLPSPEQEPDVRQFFEKLKNDIQGYAAVIVSVEAEKKFAAERAFQIAQDAVGLLSFFSPAASCSYIFNSVALAGTEYLPRSKLIVKYEGGFSHTECILPKNIGYWRLSKRKFAEINSELLKAAASLVVSEGLSEFALAVRASILTYSKGTTLIVLQDRLRYCLFALESILLKHDMEPRAYSVINRMCSILVSGGAVGEDVKAVIQQIYWLLDQPQLTELGDRENSLIATFISYTYYILQVVLGNVEHFSYKIQFLDEVDRVDKC >NZ_CP016043.1|WP_083275066.1|1740983_1741880_+|transcriptional-regulator MILLKNILNQLQSKQHKNVIQKIERLDCSPEFASAEFSAHVKNIQAGAVNRDSKYYEMTKDGFVFLVMGFTGKKAAAFKEAYIAEFNRMEAMLRQPHSLPTVHLTIEQQGTLKALVKSRVDALPQNKRAKAAISLWSALKSHFGVSYKAIAADQFTDALSLVARLTLDGEALVPLTNRSRYHFPLECADPHDRGLANAWMTPRVILDIRNRAPELELLEALEQDGHDITGAKIRIHAMYDITGQFVAMQKELATVRSYLSTLNDMLKGRSEERGLNVCFAEPNKGRLFGGFRERGFTR >NZ_CP016043.1|WP_083275006.1|1741809_1742289_+|DUF968-domain-containing-protein MSALPSQIRAVCLVAFGNEALRDSDKVWAWESPHLQFIEVNMVEGIIMLVGDEAPLAGCMLRPKLLRWESSKYTRWVKTQPCCGCGNPADDPHHIINSGLGLGGIGTKTHDLFVIPLCRRCHDELHHDVGGWEQRNGSQLVLLVQFLNRALGIGAIIKA >NZ_CP016043.1|WP_070244878.1|1742567_1743122_+|hypothetical-protein MANGSYGLNLEEIGQSVRNNLQLIIESQGLPLAVGPLTDEDFRILSGGFGELEWDYALTKYGNDPNKFEFCIKLVKQVTETVPSGVALCVYGIDDRVFRIHMIERFCRDDESHPLKGRMVALAIMAAFIFCKAVDAIDVFIMEPVAELVDYYHSFGFVEHESCSYVLRASVNELVSAFEMFAQK >NZ_CP016043.1|WP_167352257.1|1743299_1743470_+|hypothetical-protein MKKQVAKSEVRFDTQKAFAGMGAAVELLMRAAPNVLEHKVSGPEKQGKARMRKAAA >NZ_CP016043.1|WP_070244879.1|1743591_1743930_+|phage-holin,-lambda-family MHHNPGSWLEWKELLWGWWQGETPVGGVLLAILTAAVRVTYLGGGWKQTALEGALCGALTLTVVATLDYFNLPKSLTPAIGGAIGFIGVQQVQHFALYILHRKLGLPTDKER >NZ_CP016043.1|WP_070244880.1|1743932_1744484_+|glycoside-hydrolase-family-108-protein MALTKDQIFDALLGREGGYVDHPHDKGGPTKWGITEKVARAHGYTGDMRNLTRAQALKIYESDYWSGPRFDQVAELSARVAAELCDTGVNMGTSVPSKWLQRWLTAFNDGERLYPDISADGVIGPRTLSALRAYLDARGEEGEQVLLRALNCSQGDRYLALAEQRVQNESFLYGWVRERVTLS >NZ_CP016043.1|WP_070244881.1|1745013_1745556_+|DUF2514-family-protein MFNNLWKPLALIALVALLLWGVSTWRYASGYAAGKRLAEQAWQLKWETRNRDEETARANRERGERAEEQRRWQAMIKVKQNADQQLEQIKADAARSTADVERLRRTLSQLRQQLADRSPCRVSTAGGASSASAAAGFLFADVLGESLQRNAALAAYADRARAAGLACERLYDAVTQSRAQ >NZ_CP016043.1|WP_156774565.1|1746957_1748108_-|IS3-family-transposase MTKSVSTSKKPRKQHAPEFRNEALKLAERIGVAAAARELSLYESQLYNWRTQQQQQLSSSDRENELAAENARLKRQLAERDEELVIPPKGSDILCEAPEMKYVFIEKHQAEFSVKIMCRVLRVARSGWYAWRLRRHQLNRRQQFRLVCDAAVRQAFSDAKQRYGTPLLADELPRYNIKTSATSLHRQGLRAKAARKFSPVSYREHGLPVSGNLLKQDFTASGPNQKWAGDITYLRTDGGWLYLAVVIDLWSLAVIGWSMSSRMTAQLACDALQMALWRRKRPENVIVHTDRGGQYCSADYQSLLKRHNLHGSMSAKGCCYDNACVESFFHSLKVECIHGERFIRREIMRTTVFNYIECDYNRWRRHSAGGGLSPEQFENENLA >NZ_CP016043.1|WP_156774566.1|1749440_1750736_+|hypothetical-protein MMGLYLDLDKNDVAELGRCNDFFVRDVKPIAERVENIRLYKKENIEINEYDLLSYHCYVYWVRFYALYVDRVDELDRGTRYNQSVLGEKLIFSREQYKNDALGFLSNLCRVLYEYNFITGGLESNNNRTIGRSDLDNLADKYHHRSAESQQFAWIRDVMPILIAQYIVTQPNFIDAIKMADDVKKQVDEIEVRITTKLNRSFFTIENEKKEIKIHVDSAKKEINDHLDSKMAAVREIEGKILAAREHIESDNKNIDRLKEIISNYRSEFNFVGLSQAFEKIRKIKRRGFIYATLCYIVLGVAMLAAPVGAFWLHLTTPSFFSQGLSGLLSLLPLATIELIFFYFFRLSYIEVKSLKIQILQIDVRLSLCAFIHSYMDFRKMNGGDISELLKCFDTMIFSPIQANEGNIPSMFDGSEAIANFLSKVVTGKGQ >NZ_CP016043.1|WP_070244885.1|1750857_1751676_+|hypothetical-protein MPIAIEQLIKMFDPRSVSAECLHLIRAVPGITREQILGAFAAVAQRHPLGFDLLLARYREDRQAEQRARRAAADRVCRSPHPPYGTAVCQLAVTVALGRALPAQQVVLAALLRKHGPRATLAAKQLADIQRQQKGLEKARVTLSEDDWRYRRNLAQYDALAGRSVALRRALADWADAEAARSPHCPRCRGSGQLLRPQPHCCDTCGGRGKISVTAEHFRRSLVGEGMVITPERWRAEYQPWVNDTLNGLYQEMQLAGDALSIRLALEGQAVA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016043_4 | 1746286-1746863 | Orphan |
I-B,III-A,III-B
Consensus repeat of NZ_CP016043_4
|
9 spacers
spacers of NZ_CP016043_4
>4.1|1746315|32|NZ_CP016043|CRISPRCasFinder,CRT CAAAGGCCTGCCCCATCCAAGGGATAAACTCC >4.2|1746376|32|NZ_CP016043|CRISPRCasFinder,CRT CGGCATAGGTGTTTTTTGCCTCGCTCTCTTTG >4.3|1746437|32|NZ_CP016043|CRISPRCasFinder,CRT GCATCATGAATGTCATAGCAGACTCTGACATA >4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT GCAGCGCCCGCGCTGCTCTATCAGCCTAATCG >4.5|1746559|32|NZ_CP016043|CRISPRCasFinder,CRT GCCAGCTTTAGGATGTTGCGTGAGGTGTCGGC >4.6|1746620|32|NZ_CP016043|CRISPRCasFinder,CRT GAATCCAGATCTAACAACTTTGGCCACAACAT >4.7|1746681|32|NZ_CP016043|CRISPRCasFinder,CRT GGTGGTATCACCATGAATCACAACGGATCTAC >4.8|1746742|32|NZ_CP016043|CRISPRCasFinder,CRT TATACAAACACTTACCAACGGTCTATTTTTGT >4.9|1746803|32|NZ_CP016043|CRISPRCasFinder,CRT TGCTTCATAGGTCGCCAAATCATGCAATGGCA >4.10|1746315|33|NZ_CP016043|PILER-CR CAAAGGCCTGCCCCATCCAAGGGATAAACTCCA >4.11|1746376|33|NZ_CP016043|PILER-CR CGGCATAGGTGTTTTTTGCCTCGCTCTCTTTGA >4.12|1746437|33|NZ_CP016043|PILER-CR GCATCATGAATGTCATAGCAGACTCTGACATAA >4.13|1746498|33|NZ_CP016043|PILER-CR GCAGCGCCCGCGCTGCTCTATCAGCCTAATCGA >4.14|1746559|33|NZ_CP016043|PILER-CR GCCAGCTTTAGGATGTTGCGTGAGGTGTCGGCA >4.15|1746620|33|NZ_CP016043|PILER-CR GAATCCAGATCTAACAACTTTGGCCACAACATA >4.16|1746681|33|NZ_CP016043|PILER-CR GGTGGTATCACCATGAATCACAACGGATCTACA >4.17|1746742|33|NZ_CP016043|PILER-CR TATACAAACACTTACCAACGGTCTATTTTTGTG >4.18|1746803|33|NZ_CP016043|PILER-CR TGCTTCATAGGTCGCCAAATCATGCAATGGCAA |
CRISPR arrays and Neighbor proteins around NZ_CP016043_4
The CRISPR arrays of NZ_CP016043_4 >merge|NZ_CP016043|4|1746286-1746863|CRISPRCasFinder,CRT,PILER-CR GGAAACACCCCCACAGACGTGGGGAAGACCAAAGGCCTGCCCCATCCAAGGGATAAACTCCAGAAACACCCCCACAGACGTGGGGAAGACCGGCATAGGTGTTTTTTGCCTCGCTCTCTTTGAGAAACACCCCCACAGACGTGGGGAAGACGCATCATGAATGTCATAGCAGACTCTGACATAAGAAACACCCCCACAGACGTGGGGAAGACGCAGCGCCCGCGCTGCTCTATCAGCCTAATCGAGAAACACCCCCACAGACGTGGGGAAGACGCCAGCTTTAGGATGTTGCGTGAGGTGTCGGCAGAAACACCCCCACAGACGTGGGGAAGACGAATCCAGATCTAACAACTTTGGCCACAACATAGAAACACCCCCACAGACGTGGGGAAGACGGTGGTATCACCATGAATCACAACGGATCTACAGAAACACCCCCACAGACGTGGGGAAGACTATACAAACACTTACCAACGGTCTATTTTTGTGGAAACACCCCCACAGACGTGGGGAAGACTGCTTCATAGGTCGCCAAATCATGCAATGGCAAGAAACACCCCCACAGACGTGGGGAAGAC >NZ_CP016043|4|4|1746286-1746863|CRISPRCasFinder GGAAACACCCCCACAGACGTGGGGAAGAC CAAAGGCCTGCCCCATCCAAGGGATAAACTCC AGAAACACCCCCACAGACGTGGGGAAGAC CGGCATAGGTGTTTTTTGCCTCGCTCTCTTTG AGAAACACCCCCACAGACGTGGGGAAGAC GCATCATGAATGTCATAGCAGACTCTGACATA AGAAACACCCCCACAGACGTGGGGAAGAC GCAGCGCCCGCGCTGCTCTATCAGCCTAATCG AGAAACACCCCCACAGACGTGGGGAAGAC GCCAGCTTTAGGATGTTGCGTGAGGTGTCGGC AGAAACACCCCCACAGACGTGGGGAAGAC GAATCCAGATCTAACAACTTTGGCCACAACAT AGAAACACCCCCACAGACGTGGGGAAGAC GGTGGTATCACCATGAATCACAACGGATCTAC AGAAACACCCCCACAGACGTGGGGAAGAC TATACAAACACTTACCAACGGTCTATTTTTGT GGAAACACCCCCACAGACGTGGGGAAGAC TGCTTCATAGGTCGCCAAATCATGCAATGGCA AGAAACACCCCCACAGACGTGGGGAAGAC >NZ_CP016043|4|3|1746286-1746863|CRT GGAAACACCCCCACAGACGTGGGGAAGAC CAAAGGCCTGCCCCATCCAAGGGATAAACTCC AGAAACACCCCCACAGACGTGGGGAAGAC CGGCATAGGTGTTTTTTGCCTCGCTCTCTTTG AGAAACACCCCCACAGACGTGGGGAAGAC GCATCATGAATGTCATAGCAGACTCTGACATA AGAAACACCCCCACAGACGTGGGGAAGAC GCAGCGCCCGCGCTGCTCTATCAGCCTAATCG AGAAACACCCCCACAGACGTGGGGAAGAC GCCAGCTTTAGGATGTTGCGTGAGGTGTCGGC AGAAACACCCCCACAGACGTGGGGAAGAC GAATCCAGATCTAACAACTTTGGCCACAACAT AGAAACACCCCCACAGACGTGGGGAAGAC GGTGGTATCACCATGAATCACAACGGATCTAC AGAAACACCCCCACAGACGTGGGGAAGAC TATACAAACACTTACCAACGGTCTATTTTTGT GGAAACACCCCCACAGACGTGGGGAAGAC TGCTTCATAGGTCGCCAAATCATGCAATGGCA AGAAACACCCCCACAGACGTGGGGAAGAC >NZ_CP016043|4|3|1746287-1746863|PILER-CR GAAACACCCCCACAGACGTGGGGAAGAC CAAAGGCCTGCCCCATCCAAGGGATAAACTCCA GAAACACCCCCACAGACGTGGGGAAGAC CGGCATAGGTGTTTTTTGCCTCGCTCTCTTTGA GAAACACCCCCACAGACGTGGGGAAGAC GCATCATGAATGTCATAGCAGACTCTGACATAA GAAACACCCCCACAGACGTGGGGAAGAC GCAGCGCCCGCGCTGCTCTATCAGCCTAATCGA GAAACACCCCCACAGACGTGGGGAAGAC GCCAGCTTTAGGATGTTGCGTGAGGTGTCGGCA GAAACACCCCCACAGACGTGGGGAAGAC GAATCCAGATCTAACAACTTTGGCCACAACATA GAAACACCCCCACAGACGTGGGGAAGAC GGTGGTATCACCATGAATCACAACGGATCTACA GAAACACCCCCACAGACGTGGGGAAGAC TATACAAACACTTACCAACGGTCTATTTTTGTG GAAACACCCCCACAGACGTGGGGAAGAC TGCTTCATAGGTCGCCAAATCATGCAATGGCAA GAAACACCCCCACAGACGTGGGGAAGAC
>NZ_CP016043.1|WP_070244881.1|1745013_1745556_+|DUF2514-family-protein MFNNLWKPLALIALVALLLWGVSTWRYASGYAAGKRLAEQAWQLKWETRNRDEETARANRERGERAEEQRRWQAMIKVKQNADQQLEQIKADAARSTADVERLRRTLSQLRQQLADRSPCRVSTAGGASSASAAAGFLFADVLGESLQRNAALAAYADRARAAGLACERLYDAVTQSRAQ >NZ_CP016043.1|WP_070244880.1|1743932_1744484_+|glycoside-hydrolase-family-108-protein MALTKDQIFDALLGREGGYVDHPHDKGGPTKWGITEKVARAHGYTGDMRNLTRAQALKIYESDYWSGPRFDQVAELSARVAAELCDTGVNMGTSVPSKWLQRWLTAFNDGERLYPDISADGVIGPRTLSALRAYLDARGEEGEQVLLRALNCSQGDRYLALAEQRVQNESFLYGWVRERVTLS >NZ_CP016043.1|WP_070244879.1|1743591_1743930_+|phage-holin,-lambda-family MHHNPGSWLEWKELLWGWWQGETPVGGVLLAILTAAVRVTYLGGGWKQTALEGALCGALTLTVVATLDYFNLPKSLTPAIGGAIGFIGVQQVQHFALYILHRKLGLPTDKER >NZ_CP016043.1|WP_167352257.1|1743299_1743470_+|hypothetical-protein MKKQVAKSEVRFDTQKAFAGMGAAVELLMRAAPNVLEHKVSGPEKQGKARMRKAAA >NZ_CP016043.1|WP_070244878.1|1742567_1743122_+|hypothetical-protein MANGSYGLNLEEIGQSVRNNLQLIIESQGLPLAVGPLTDEDFRILSGGFGELEWDYALTKYGNDPNKFEFCIKLVKQVTETVPSGVALCVYGIDDRVFRIHMIERFCRDDESHPLKGRMVALAIMAAFIFCKAVDAIDVFIMEPVAELVDYYHSFGFVEHESCSYVLRASVNELVSAFEMFAQK >NZ_CP016043.1|WP_083275006.1|1741809_1742289_+|DUF968-domain-containing-protein MSALPSQIRAVCLVAFGNEALRDSDKVWAWESPHLQFIEVNMVEGIIMLVGDEAPLAGCMLRPKLLRWESSKYTRWVKTQPCCGCGNPADDPHHIINSGLGLGGIGTKTHDLFVIPLCRRCHDELHHDVGGWEQRNGSQLVLLVQFLNRALGIGAIIKA >NZ_CP016043.1|WP_083275066.1|1740983_1741880_+|transcriptional-regulator MILLKNILNQLQSKQHKNVIQKIERLDCSPEFASAEFSAHVKNIQAGAVNRDSKYYEMTKDGFVFLVMGFTGKKAAAFKEAYIAEFNRMEAMLRQPHSLPTVHLTIEQQGTLKALVKSRVDALPQNKRAKAAISLWSALKSHFGVSYKAIAADQFTDALSLVARLTLDGEALVPLTNRSRYHFPLECADPHDRGLANAWMTPRVILDIRNRAPELELLEALEQDGHDITGAKIRIHAMYDITGQFVAMQKELATVRSYLSTLNDMLKGRSEERGLNVCFAEPNKGRLFGGFRERGFTR >NZ_CP016043.1|WP_070244877.1|1740285_1740561_+|PerC-family-transcriptional-regulator MRGKPIVWRHHNLRQIQEHMLNAERCEQRGLWRRAGHEWMQVIEHCTDDVLVEYAVQQRNYCAQMGVFGSASIDPRMVAANQCDEPLQDEG >NZ_CP016043.1|WP_070244876.1|1739561_1740296_+|DNA-replication-protein MKNLVSAVQRRDAAALSRMAGQPLQERVVNGNAEKLVDVLFENLLLLFPASRNTVFAAPDEVAAMKRQWITAFAEGGITTLEQVKAGVSMARQHGGDFWPSCGRFMEWCREGVRSAGGLPSDDEVLAEFHRYARDKARFASPEAFDWAHPVMYWVVLDVRQRMYRYNYTEAEVLRAIKAQMQRWERNIRAGQRIPTPVKQLVHVQRPPAIADQLDPTGGAGFYQVGVAFLEQIRQRLRGGEHEG >NZ_CP016043.1|WP_070244875.1|1738461_1739565_+|replication-protein-O MSSLIQILDRPIAYNPALAKLRAGKVKAGPVAAVFLSQMIYWHNRMGGEWMYKTQADITTETALTRDEQETARKRLVALGVLDEARRGVPATLHYRINVARLEALLLEAATPVATPAPTAKTRTRDIQNSEPSQPGPAHSDQSRMVQSQNVETPQSGLVQPRKLDCGDAANKNVETPQTSMGEPTEQVCGDPANFHTGDYTENTQENKKPSCPDAAQPDEPDSDHDFLSRHPEAVVFSAKKRLWGRQEDLTCAEWIWGRIVRLYELAAEDDGEVVRPKAPNWTVWANEVRLMCHQDGRTHRQICELFGRVNRDPFWCRNVLSPAKLREKWDELVIRLGAPGAGAQDRSLKTLLGADWNTEQGWESVL >NZ_CP016043.1|WP_156774565.1|1746957_1748108_-|IS3-family-transposase MTKSVSTSKKPRKQHAPEFRNEALKLAERIGVAAAARELSLYESQLYNWRTQQQQQLSSSDRENELAAENARLKRQLAERDEELVIPPKGSDILCEAPEMKYVFIEKHQAEFSVKIMCRVLRVARSGWYAWRLRRHQLNRRQQFRLVCDAAVRQAFSDAKQRYGTPLLADELPRYNIKTSATSLHRQGLRAKAARKFSPVSYREHGLPVSGNLLKQDFTASGPNQKWAGDITYLRTDGGWLYLAVVIDLWSLAVIGWSMSSRMTAQLACDALQMALWRRKRPENVIVHTDRGGQYCSADYQSLLKRHNLHGSMSAKGCCYDNACVESFFHSLKVECIHGERFIRREIMRTTVFNYIECDYNRWRRHSAGGGLSPEQFENENLA >NZ_CP016043.1|WP_156774566.1|1749440_1750736_+|hypothetical-protein MMGLYLDLDKNDVAELGRCNDFFVRDVKPIAERVENIRLYKKENIEINEYDLLSYHCYVYWVRFYALYVDRVDELDRGTRYNQSVLGEKLIFSREQYKNDALGFLSNLCRVLYEYNFITGGLESNNNRTIGRSDLDNLADKYHHRSAESQQFAWIRDVMPILIAQYIVTQPNFIDAIKMADDVKKQVDEIEVRITTKLNRSFFTIENEKKEIKIHVDSAKKEINDHLDSKMAAVREIEGKILAAREHIESDNKNIDRLKEIISNYRSEFNFVGLSQAFEKIRKIKRRGFIYATLCYIVLGVAMLAAPVGAFWLHLTTPSFFSQGLSGLLSLLPLATIELIFFYFFRLSYIEVKSLKIQILQIDVRLSLCAFIHSYMDFRKMNGGDISELLKCFDTMIFSPIQANEGNIPSMFDGSEAIANFLSKVVTGKGQ >NZ_CP016043.1|WP_070244885.1|1750857_1751676_+|hypothetical-protein MPIAIEQLIKMFDPRSVSAECLHLIRAVPGITREQILGAFAAVAQRHPLGFDLLLARYREDRQAEQRARRAAADRVCRSPHPPYGTAVCQLAVTVALGRALPAQQVVLAALLRKHGPRATLAAKQLADIQRQQKGLEKARVTLSEDDWRYRRNLAQYDALAGRSVALRRALADWADAEAARSPHCPRCRGSGQLLRPQPHCCDTCGGRGKISVTAEHFRRSLVGEGMVITPERWRAEYQPWVNDTLNGLYQEMQLAGDALSIRLALEGQAVA >NZ_CP016043.1|WP_070244886.1|1751760_1752042_-|type-II-toxin-antitoxin-system-RelE/ParE-family-toxin MTYKLSFEKRALKEWKKLAPPIQSQLKKKLIERLENPHVPAARLSGRANRYKIKLRASGYRLVYEVNDSEIILLVIAIGKRADNEVYQTADSR >NZ_CP016043.1|WP_070244887.1|1752031_1752283_-|type-II-toxin-antitoxin-system-Phd/YefM-family-antitoxin MAYQILTTTAASITDLKRNPMGTVAEGDGNAVAILNRNEPAFYCVPPELYAYYLELAEDAALNRIADERLEDAEFVSVSIDDL >NZ_CP016043.1|WP_024522083.1|1760943_1762458_+|cyclic-diguanylate-phosphodiesterase MSQEGIKKIITKKLLVAFASGVSVLVILTLCLLIFSIKSLYKDTTLKVNFARQHIDGILDHAKNAAQSTSHLLGHACSERAINALIHQVTLTPNVRSIELFSKNGGYCTSLYKEVSGIDREKIEKVSGLYLLAGDEATPSLPVLFYNDKLAQGAVLVGVDGYFIANTLRVINTFPSVYFAVGGEILSADGRVTPRFQRIPEGYHAITSDYGYTIIYILTKHTILANLTENYMLGIYLSLLLAVVAMLGVFLRLNRPLSITELIRNGLRNNEFVPYIQPIIDLQTNSVTGGEILIRWNRPGIGIIPPNQFIPSAEDSGLIVPMTRQLILDTREALRGRLSQPVHIGFNISQKYLQHRSIVADCEQFLEAFAGHQLELTLELVERDEIAAKREVKANFERLKGLGVTFALDDFGTGYSTYSYLQKFHVDYIKIDKSFIQMIGLDEISSHIVNNVIELAGSLHLKIIAEGVETAQQEAYLKAHDVLYLQGYRYSRPIPLETFIQRYL >NZ_CP016043.1|WP_083275068.1|1762505_1762949_+|hypothetical-protein MKIKVNDIAFPLRRMLLLGVSWSLAALTQAQGDANAALRSPQAGVLCDRYFCADAQGVSRSLTVRYLGLRAAQRVFSPGAFDHTAFTFANGIFCDTRARLCWEDRYYGSDGKHSAAISARYTALLFPPTPSRAKVATGEVSPDKSGH >NZ_CP016043.1|WP_070244888.1|1763046_1763772_-|hypothetical-protein MQRYSFIALALLSCASLSPVYATADNSITRHALQFAKGQSATSVHGSIKGSEVIDYTLIAAQGQQMDVTLKGGNATYFNLLAPGSHAEALFNGAIAGDRFQGALPAKGQYTVRLYQMGAAKDTTTAHPFTLVISIKGDAARDATPPHSASGTLPCAQHSGQPMGQCPFRVMRQANGDATLTLTLPDQRQRTLFFSHGKPLSADLSQADGDMRFTWQQQDDLLLIRCGQERYEIPSAAITGG >NZ_CP016043.1|WP_024522080.1|1763953_1764418_-|hypothetical-protein MLTAIPMHGARIAGHLARAPQLAFFNTNGEEVARYANPAASEQCSGKKQLLALLRQGQIRRLVVRNVGQHMAQRLLALGIEIRLAHGGEWQAAYCQEECDLARLSDASQARPPRKPHHTAHSCGCGGTAQVTTAATRLSPRQGGVPHIIRCRQG >NZ_CP016043.1|WP_024522079.1|1764430_1764709_-|DUF134-domain-containing-protein MPRPKIPRRICSHPQHRCFKPNGIPLPQLEQVLLARDEFEALRLVDREGLQQQQAAAEMGVSRQTLANILKRARFKLLDCLSNGKALMIDES |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP016043_3 | 3.1|1740615|33|NZ_CP016043|CRISPRCasFinder | 1740615-1740647 | 33 | NC_026611 | Edwardsiella phage GF-2 DNA, complete sequence | 41783-41815 | 0 | 1.0 |
NZ_CP016043_3 | 3.1|1740615|33|NZ_CP016043|CRISPRCasFinder | 1740615-1740647 | 33 | MH898687 | Edwardsiella phage Edno5, complete genome | 39330-39362 | 0 | 1.0 |
NZ_CP016043_3 | 3.2|1740676|33|NZ_CP016043|CRISPRCasFinder,PILER-CR | 1740676-1740708 | 33 | NC_026611 | Edwardsiella phage GF-2 DNA, complete sequence | 39212-39244 | 2 | 0.939 |
NZ_CP016043_3 | 3.7|1740615|37|NZ_CP016043|CRT | 1740615-1740651 | 37 | NC_026611 | Edwardsiella phage GF-2 DNA, complete sequence | 41779-41815 | 3 | 0.919 |
NZ_CP016043_3 | 3.7|1740615|37|NZ_CP016043|CRT | 1740615-1740651 | 37 | MH898687 | Edwardsiella phage Edno5, complete genome | 39326-39362 | 3 | 0.919 |
NZ_CP016043_3 | 3.6|1740920|33|NZ_CP016043|CRISPRCasFinder,PILER-CR | 1740920-1740952 | 33 | KC139516 | Salmonella phage FSL SP-016, partial genome | 43535-43567 | 5 | 0.848 |
NZ_CP016043_3 | 3.8|1740676|37|NZ_CP016043|CRT | 1740676-1740712 | 37 | NC_026611 | Edwardsiella phage GF-2 DNA, complete sequence | 39208-39244 | 6 | 0.838 |
NZ_CP016043_3 | 3.4|1740798|33|NZ_CP016043|CRISPRCasFinder,PILER-CR | 1740798-1740830 | 33 | NC_011880 | Cyanothece sp. PCC 7425 plasmid pP742501, complete sequence | 145824-145856 | 8 | 0.758 |
NZ_CP016043_1 | 1.2|873578|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT | 873578-873610 | 33 | NZ_CP022081 | Burkholderia cepacia strain FDAARGOS_345 plasmid unnamed1, complete sequence | 11227-11259 | 9 | 0.727 |
NZ_CP016043_1 | 1.2|873578|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT | 873578-873610 | 33 | NZ_CP023519 | Burkholderia cepacia strain FDAARGOS_388 plasmid unnamed1, complete sequence | 109783-109815 | 9 | 0.727 |
NZ_CP016043_1 | 1.2|873578|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT | 873578-873610 | 33 | NZ_CP012984 | Burkholderia cepacia ATCC 25416 strain UCB 717 plasmid pBC25416 | 162232-162264 | 9 | 0.727 |
NZ_CP016043_1 | 1.2|873578|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT | 873578-873610 | 33 | NC_008545 | Burkholderia cenocepacia HI2424 plasmid unnamed1, complete sequence | 62979-63011 | 9 | 0.727 |
NZ_CP016043_1 | 1.2|873578|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT | 873578-873610 | 33 | NZ_CP034556 | Burkholderia cepacia ATCC 25416 plasmid unnamed1, complete sequence | 61242-61274 | 9 | 0.727 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | NZ_CP041049 | Citrobacter sp. CF971 plasmid pBM527-3, complete sequence | 12881-12912 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | NZ_KX863568 | Citrobacter freundii strain AtetA plasmid pLNU-11, complete sequence | 4297-4328 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | NZ_KJ541068 | Serratia marcescens strain A4Y426 plasmid pG5A4Y426, complete sequence | 4340-4371 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | NZ_KJ541070 | Escherichia coli strain A4Y413 plasmid pG5A4Y413, complete sequence | 4277-4308 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | NZ_KJ541071 | Escherichia coli strain A4Y217 plasmid pG5A4Y217, complete sequence | 4277-4308 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | NZ_KJ541069 | Serratia marcescens strain A4Y201 plasmid pG5A4Y201, complete sequence | 4340-4371 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | NZ_LN832561 | Paracoccus aminovorans isolate JCM7685 plasmid III, complete sequence | 2714-2745 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | NZ_LR130553 | Escherichia coli strain MS14386 isolate MS14386 plasmid 2 | 24449-24480 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | NZ_CP016184 | Escherichia coli strain EC2 plasmid pEC2-4, complete sequence | 23639-23670 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | NZ_CP016183 | Escherichia coli strain EC2_1 plasmid pEC2_1-4, complete sequence | 138969-139000 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | NZ_CP047573 | Escherichia coli strain 2EC1 plasmid p2EC1-2, complete sequence | 36066-36097 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | LC542613 | Klebsiella quasipneumoniae subsp. similipneumoniae MS2H7 plasmid pMS2H7VEB-1 DNA, complete sequence | 1367-1398 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | LC542924 | Klebsiella pneumoniae MS2H5 plasmid pMS2H5VEB-1 DNA, complete sequence | 1367-1398 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | NZ_MK731977 | Escherichia coli strain ENV103 plasmid pSGMCR103, complete sequence | 2787-2818 | 9 | 0.719 |
NZ_CP016043_4 | 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT | 1746498-1746529 | 32 | MN945901 | Mycobacterium phage Ximenita, complete genome | 36904-36935 | 9 | 0.719 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | NZ_CP041049 | Citrobacter sp. CF971 plasmid pBM527-3, complete sequence | 12881-12913 | 10 | 0.697 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | NZ_KX863568 | Citrobacter freundii strain AtetA plasmid pLNU-11, complete sequence | 4296-4328 | 10 | 0.697 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | NZ_KJ541068 | Serratia marcescens strain A4Y426 plasmid pG5A4Y426, complete sequence | 4339-4371 | 10 | 0.697 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | NZ_KJ541070 | Escherichia coli strain A4Y413 plasmid pG5A4Y413, complete sequence | 4276-4308 | 10 | 0.697 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | NZ_KJ541071 | Escherichia coli strain A4Y217 plasmid pG5A4Y217, complete sequence | 4276-4308 | 10 | 0.697 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | NZ_KJ541069 | Serratia marcescens strain A4Y201 plasmid pG5A4Y201, complete sequence | 4339-4371 | 10 | 0.697 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | NZ_LN832561 | Paracoccus aminovorans isolate JCM7685 plasmid III, complete sequence | 2714-2746 | 10 | 0.697 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | NZ_LR130553 | Escherichia coli strain MS14386 isolate MS14386 plasmid 2 | 24448-24480 | 10 | 0.697 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | NZ_CP016184 | Escherichia coli strain EC2 plasmid pEC2-4, complete sequence | 23639-23671 | 10 | 0.697 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | NZ_CP016183 | Escherichia coli strain EC2_1 plasmid pEC2_1-4, complete sequence | 138968-139000 | 10 | 0.697 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | NZ_CP047573 | Escherichia coli strain 2EC1 plasmid p2EC1-2, complete sequence | 36065-36097 | 10 | 0.697 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | LC542613 | Klebsiella quasipneumoniae subsp. similipneumoniae MS2H7 plasmid pMS2H7VEB-1 DNA, complete sequence | 1367-1399 | 10 | 0.697 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | LC542924 | Klebsiella pneumoniae MS2H5 plasmid pMS2H5VEB-1 DNA, complete sequence | 1367-1399 | 10 | 0.697 |
NZ_CP016043_4 | 4.13|1746498|33|NZ_CP016043|PILER-CR | 1746498-1746530 | 33 | NZ_MK731977 | Escherichia coli strain ENV103 plasmid pSGMCR103, complete sequence | 2786-2818 | 10 | 0.697 |
1. spacer 3.1|1740615|33|NZ_CP016043|CRISPRCasFinder matches to NC_026611 (Edwardsiella phage GF-2 DNA, complete sequence) position: , mismatch: 0, identity: 1.0
cgcttcacagagtagctggccgacttcggtcta CRISPR spacer cgcttcacagagtagctggccgacttcggtcta Protospacer *********************************
2. spacer 3.1|1740615|33|NZ_CP016043|CRISPRCasFinder matches to MH898687 (Edwardsiella phage Edno5, complete genome) position: , mismatch: 0, identity: 1.0
cgcttcacagagtagctggccgacttcggtcta CRISPR spacer cgcttcacagagtagctggccgacttcggtcta Protospacer *********************************
3. spacer 3.2|1740676|33|NZ_CP016043|CRISPRCasFinder,PILER-CR matches to NC_026611 (Edwardsiella phage GF-2 DNA, complete sequence) position: , mismatch: 2, identity: 0.939
tcatgtcatgaacgttcatgcggcttttccttg CRISPR spacer tcatgtcatgcacgttcatgcggcttttcctta Protospacer ********** *********************.
4. spacer 3.7|1740615|37|NZ_CP016043|CRT matches to NC_026611 (Edwardsiella phage GF-2 DNA, complete sequence) position: , mismatch: 3, identity: 0.919
cgcttcacagagtagctggccgacttcggtctagaac CRISPR spacer cgcttcacagagtagctggccgacttcggtctattcc Protospacer ********************************* *
5. spacer 3.7|1740615|37|NZ_CP016043|CRT matches to MH898687 (Edwardsiella phage Edno5, complete genome) position: , mismatch: 3, identity: 0.919
cgcttcacagagtagctggccgacttcggtctagaac CRISPR spacer cgcttcacagagtagctggccgacttcggtctattcc Protospacer ********************************* *
6. spacer 3.6|1740920|33|NZ_CP016043|CRISPRCasFinder,PILER-CR matches to KC139516 (Salmonella phage FSL SP-016, partial genome) position: , mismatch: 5, identity: 0.848
gctgtggtgctgctggcaatggcatccagtgag CRISPR spacer gttctggtgctgctggcagcggcatccagtgcg Protospacer *.* **************..*********** *
7. spacer 3.8|1740676|37|NZ_CP016043|CRT matches to NC_026611 (Edwardsiella phage GF-2 DNA, complete sequence) position: , mismatch: 6, identity: 0.838
tcatgtcatgaacgttcatgcggcttttccttggaaa CRISPR spacer tcatgtcatgcacgttcatgcggcttttccttattgg Protospacer ********** *********************. ..
8. spacer 3.4|1740798|33|NZ_CP016043|CRISPRCasFinder,PILER-CR matches to NC_011880 (Cyanothece sp. PCC 7425 plasmid pP742501, complete sequence) position: , mismatch: 8, identity: 0.758
actgcctgcactggcaagagctacgtgtactga CRISPR spacer actgcctgcactggcaaaacctacggttggcaa Protospacer *****************.* ***** *. ..*
9. spacer 1.2|873578|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP022081 (Burkholderia cepacia strain FDAARGOS_345 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.727
tggcatcctgcccatagcgccctcgggtatccg CRISPR spacer tggcatcgtgcccatagcgccatcggcggcgtt Protospacer ******* ************* **** .. .
10. spacer 1.2|873578|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP023519 (Burkholderia cepacia strain FDAARGOS_388 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.727
tggcatcctgcccatagcgccctcgggtatccg CRISPR spacer tggcatcgtgcccatagcgccatcggcggcgtt Protospacer ******* ************* **** .. .
11. spacer 1.2|873578|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012984 (Burkholderia cepacia ATCC 25416 strain UCB 717 plasmid pBC25416) position: , mismatch: 9, identity: 0.727
tggcatcctgcccatagcgccctcgggtatccg CRISPR spacer tggcatcgtgcccatagcgccatcggcggcgtt Protospacer ******* ************* **** .. .
12. spacer 1.2|873578|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT matches to NC_008545 (Burkholderia cenocepacia HI2424 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.727
tggcatcctgcccatagcgccctcgggtatccg CRISPR spacer tggcatcgtgcccatagcgccatcggcggcgtt Protospacer ******* ************* **** .. .
13. spacer 1.2|873578|33|NZ_CP016043|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP034556 (Burkholderia cepacia ATCC 25416 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.727
tggcatcctgcccatagcgccctcgggtatccg CRISPR spacer tggcatcgtgcccatagcgccatcggcggcgtt Protospacer ******* ************* **** .. .
14. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to NZ_CP041049 (Citrobacter sp. CF971 plasmid pBM527-3, complete sequence) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
15. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to NZ_KX863568 (Citrobacter freundii strain AtetA plasmid pLNU-11, complete sequence) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
16. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to NZ_KJ541068 (Serratia marcescens strain A4Y426 plasmid pG5A4Y426, complete sequence) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
17. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to NZ_KJ541070 (Escherichia coli strain A4Y413 plasmid pG5A4Y413, complete sequence) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
18. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to NZ_KJ541071 (Escherichia coli strain A4Y217 plasmid pG5A4Y217, complete sequence) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
19. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to NZ_KJ541069 (Serratia marcescens strain A4Y201 plasmid pG5A4Y201, complete sequence) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
20. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to NZ_LN832561 (Paracoccus aminovorans isolate JCM7685 plasmid III, complete sequence) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
21. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to NZ_LR130553 (Escherichia coli strain MS14386 isolate MS14386 plasmid 2) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
22. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to NZ_CP016184 (Escherichia coli strain EC2 plasmid pEC2-4, complete sequence) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
23. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to NZ_CP016183 (Escherichia coli strain EC2_1 plasmid pEC2_1-4, complete sequence) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
24. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to NZ_CP047573 (Escherichia coli strain 2EC1 plasmid p2EC1-2, complete sequence) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
25. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to LC542613 (Klebsiella quasipneumoniae subsp. similipneumoniae MS2H7 plasmid pMS2H7VEB-1 DNA, complete sequence) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
26. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to LC542924 (Klebsiella pneumoniae MS2H5 plasmid pMS2H5VEB-1 DNA, complete sequence) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
27. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to NZ_MK731977 (Escherichia coli strain ENV103 plasmid pSGMCR103, complete sequence) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccagcgcccgcgcagctctttcagctcggcgg Protospacer ************ ***** *****..... *
28. spacer 4.4|1746498|32|NZ_CP016043|CRISPRCasFinder,CRT matches to MN945901 (Mycobacterium phage Ximenita, complete genome) position: , mismatch: 9, identity: 0.719
gcagcgcccgcgctgctctatcagcctaatcg CRISPR spacer ccggcgcccgcgctgcgcgatcagcccgacat Protospacer *.************* * *******..*.
29. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to NZ_CP041049 (Citrobacter sp. CF971 plasmid pBM527-3, complete sequence) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
30. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to NZ_KX863568 (Citrobacter freundii strain AtetA plasmid pLNU-11, complete sequence) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
31. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to NZ_KJ541068 (Serratia marcescens strain A4Y426 plasmid pG5A4Y426, complete sequence) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
32. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to NZ_KJ541070 (Escherichia coli strain A4Y413 plasmid pG5A4Y413, complete sequence) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
33. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to NZ_KJ541071 (Escherichia coli strain A4Y217 plasmid pG5A4Y217, complete sequence) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
34. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to NZ_KJ541069 (Serratia marcescens strain A4Y201 plasmid pG5A4Y201, complete sequence) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
35. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to NZ_LN832561 (Paracoccus aminovorans isolate JCM7685 plasmid III, complete sequence) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
36. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to NZ_LR130553 (Escherichia coli strain MS14386 isolate MS14386 plasmid 2) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
37. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to NZ_CP016184 (Escherichia coli strain EC2 plasmid pEC2-4, complete sequence) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
38. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to NZ_CP016183 (Escherichia coli strain EC2_1 plasmid pEC2_1-4, complete sequence) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
39. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to NZ_CP047573 (Escherichia coli strain 2EC1 plasmid p2EC1-2, complete sequence) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
40. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to LC542613 (Klebsiella quasipneumoniae subsp. similipneumoniae MS2H7 plasmid pMS2H7VEB-1 DNA, complete sequence) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
41. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to LC542924 (Klebsiella pneumoniae MS2H5 plasmid pMS2H5VEB-1 DNA, complete sequence) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
42. spacer 4.13|1746498|33|NZ_CP016043|PILER-CR matches to NZ_MK731977 (Escherichia coli strain ENV103 plasmid pSGMCR103, complete sequence) position: , mismatch: 10, identity: 0.697
gcagcgcccgcgctgctctatcagcctaatcga CRISPR spacer ccagcgcccgcgcagctctttcagctcggcggc Protospacer ************ ***** *****..... *
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1736556 : 1748108
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP016043|1736556:1748108|DBSCAN-SWA ATCAGGGGATTGCGATTCGGGCTTCAACGACAACACCGATGATTTCATAATTATCGTTTATCTCAACCATGGAATACATCGTATTTAAAGGTTTGAGGTATTTCTTACCAACGTCGAGGATATACCGTTTGAAGATAGCCTCATCGTCAGAGAGGGAAGCGGCGACCACAAGTTTCCCATTCGTTGGGTTTCTATCTGGATCGACGAGGATAGCGGTACCTTCGGGAATGCTAAGGCCCGCTGTGGCTGTCATGCTGTCGCCTTCTACCGTTAACCAGAATGAAGCGGCTGAGCAGTCAACTGTCGTGGAGTACCAATGCTCAGCCTTCTCTCGTGTATAGGAGCGAATGGCGAGGCTTCCGTGATTGACTAGCGTCCAACTGAGTAGGGGATACTTTCCCTTGGGGTGTAGCGCGACAGGAATAGCGAGAGGTGTGTCGGCAGGGGTACCATTGACTAAGTATTCTGCCGGGCATAACAACTCCCGAGCGAGCGCCAATAGCGCATCCCCTCTTGGTGCGGTTTCATCCCGTTCCCATTGGGAGATGGCAACGTTAGATACGCCGACAGCCTTGCTGAGCGTAACCTGGCTGAGCTTCAGCTCTTTCCTCCGCTTTAGGATGCGGGAACCCATTGTAGTCATCGTCATGTTAAGCGATCTTAAACGTTATTGACTGAAGATTCCTTATAGCTTATAAGTGAAGAAGACTTTAATATGGAGGCACCACAGATGCTGAAGCAGACCGTGGTGAAGCACTTCGGTAGCCAGCGTGCTGTGGCCCAGGCGCTCCAAGTGAGCGACTCGGCTGTCTCTCAGTGGAAGACATTGATTCCAGAGAGAGCGGCATTGAAGCTGCACAGAATAACAGCTGGGAAGCTGAAGTATTCACCTTGTTTTTACCAAAAGTCTTCTTGATTCACCGCATCACCGCTCTTTAACCAACGAGCCACCCGGCTCAATGGAAGCCGCCGACAACGCGGATACCAACACCGAATAACCAATCTCTATTGGCTGTGGCTTTGCCACGCCTATCAAACACTGGCGCCATCAGGGCACTTAGCCTATTGGCGTGACCACCCGGGGGAAAACCCATGGGAGACATCAAAGATGCCGTAAAGGCGATGTGTGAATCGATGCCTGGCGGGCGAGCGGCGATGGCCGGCGCGCTGGGGATGTCACCAACGAGCTTTAACAATCGGCTATATGAGAAAAACGGCTGCAAGTTCTTCGATCGCCATGACTTAGAGGCGATGGAGGATCTATCGAACACCCACCATTTAGCGGACTACTTTGCGGCGCGTCGCGGCCGCATCACGGTACGAGTGCAATCGCGCGACGAGTTAGATCCCGTCGAACTGTTTACTCTGGCGACGCTGACGGCGGCGCATAAGGGGCAGGTTGATCTTGCGATTCAGCACTCCATCAGCGACGGCATTATCAACTCATCTGAAGAGCGGGACATCCTTGCGCTGCACAGCCAGTACGTGGCGGCACGCGATGCATATGTCCGCGCAATCATCGCCCTGCATAAGGCGCAGTAGCAGCGGCGATCCTGTCACGGAGGATTAGCCATGCACACACCAACTGCCGGACTACTCCGGCTACGTTGCCGCGTTCTGCCCGAGCGCGAGGATTACCGCTATGAAGTGAATGTACTGGGGCGCTGGTGGCCCTGCAACTACACGTTAGCTCGCTGGACGGTAGAGTACTGCCGTCAAGGGTGGGGAGGTATGTGATGCGCATACCTAAGCAGGGAAGCTATTACCAAGATCGCAATGGCGTGGTGGTTCGCATCACAGGTTATGAAAGGGAAAGCCAGCGGGTTCTCTACCGTCGTCCTGGCTATGAGTGGGGTTGTGCATCACCGCTGGTGGTGTTTAACGCCAAGTTCAGGAGGTACCAGGGATGAGCAGCTTGATTCAGATATTGGATCGCCCCATCGCCTATAACCCAGCGCTGGCGAAACTGAGGGCAGGGAAGGTCAAGGCGGGCCCGGTTGCGGCCGTATTTCTCTCCCAGATGATCTACTGGCATAACCGTATGGGTGGCGAATGGATGTATAAGACCCAGGCGGATATTACGACTGAAACCGCGTTAACTCGCGATGAGCAGGAAACCGCCCGCAAGCGTTTAGTTGCGCTAGGTGTGCTGGATGAAGCGCGCCGTGGCGTTCCTGCGACCTTGCATTACCGCATCAATGTAGCACGCCTTGAGGCGTTGCTATTGGAAGCGGCTACACCCGTAGCAACGCCTGCGCCGACAGCTAAAACCAGAACGAGGGATATTCAAAATAGCGAACCCTCGCAACCCGGCCCGGCACACTCCGATCAATCCAGAATGGTGCAATCCCAGAATGTGGAAACCCCGCAATCTGGATTGGTGCAACCCCGCAAACTAGATTGCGGTGATGCCGCAAACAAGAATGTGGAAACCCCGCAAACAAGTATGGGGGAACCCACCGAACAAGTCTGTGGGGATCCCGCAAACTTTCATACAGGAGATTACACAGAGAATACACAAGAGAATAAAAAACCTTCTTGTCCGGACGCTGCGCAACCGGACGAGCCTGACAGCGATCATGATTTTTTGTCTCGTCATCCAGAGGCGGTGGTGTTCAGTGCCAAGAAGCGTCTGTGGGGCAGGCAGGAGGATCTGACCTGCGCCGAGTGGATTTGGGGGCGTATCGTCCGGCTGTACGAGCTGGCCGCTGAGGATGACGGGGAGGTCGTCAGGCCGAAGGCGCCCAACTGGACGGTTTGGGCCAACGAGGTGCGCCTGATGTGCCATCAGGATGGGCGTACTCACCGCCAGATTTGCGAGCTGTTTGGCCGAGTAAACCGTGATCCGTTCTGGTGCCGCAACGTACTGAGCCCGGCGAAGCTGCGCGAAAAATGGGATGAGCTGGTGATCCGCCTGGGGGCACCGGGAGCGGGTGCGCAGGATCGCTCACTGAAAACCCTGCTGGGGGCAGACTGGAATACGGAGCAGGGATGGGAGAGCGTGCTATGAAAAATCTGGTTTCTGCCGTACAGCGCCGGGATGCAGCCGCACTGTCCCGCATGGCGGGTCAGCCTCTTCAGGAGCGGGTGGTGAACGGTAACGCTGAAAAGCTGGTGGATGTGCTGTTTGAAAATCTGCTGCTGTTGTTCCCGGCGTCGCGTAATACGGTGTTTGCCGCGCCGGATGAGGTGGCGGCGATGAAGCGCCAGTGGATCACGGCGTTCGCGGAAGGGGGGATTACGACGCTGGAGCAGGTCAAGGCGGGTGTCAGCATGGCCCGCCAGCACGGTGGGGATTTCTGGCCCTCCTGCGGGCGCTTCATGGAATGGTGTCGTGAGGGTGTACGGAGCGCAGGTGGATTGCCCTCTGACGATGAGGTGCTGGCTGAGTTCCACCGTTATGCTCGCGATAAGGCCCGCTTCGCCTCTCCGGAGGCGTTCGACTGGGCACACCCGGTGATGTACTGGGTGGTGCTGGATGTGCGCCAGCGCATGTACCGTTACAACTACACCGAGGCCGAGGTACTGCGGGCGATCAAGGCGCAGATGCAACGTTGGGAACGGAATATCCGTGCTGGCCAGCGTATCCCGACACCGGTGAAGCAGCTGGTGCATGTGCAGCGTCCACCCGCGATAGCCGATCAGCTTGATCCCACCGGGGGGGCCGGATTTTATCAGGTCGGTGTGGCGTTTCTGGAGCAGATCCGCCAGCGGTTACGGGGAGGTGAGCATGAGGGGTAAGCCCATCGTATGGCGTCACCACAACTTGCGCCAGATCCAGGAGCATATGCTCAATGCCGAACGGTGCGAGCAGCGAGGCTTATGGCGTCGCGCCGGCCATGAGTGGATGCAGGTTATCGAGCATTGCACCGATGACGTGCTGGTGGAGTACGCTGTCCAGCAGCGTAATTACTGCGCACAGATGGGGGTGTTCGGGAGCGCAAGTATCGATCCGCGAATGGTGGCGGCTAACCAGTGCGACGAGCCTTTGCAGGATGAGGGGTAGAGGGGGCGTATTGTAATCTGAGCCGGAAGCACCCTCACGGGCGTGGGGAAGACCGCTTCACAGAGTAGCTGGCCGACTTCGGTCTAGAACCCCCCCCACAGACGTGGGGAAGACTCATGTCATGAACGTTCATGCGGCTTTTCCTTGGAAACACCCCCACAGGCGTGGGGAAGACTTTTACCTCCTTCGCAAATTTGTGTCCCCAGCGGAAACACCCCCACAGGCGTGGGGAAGACACTGCCTGCACTGGCAAGAGCTACGTGTACTGAGAAACACCCCCACAGGCGTGGGGAAGACGCATCGTCCAGGCGACGACCACGCGCAAAGGGAGAAACACCCCCACAGGCGTGGGGAAGACGCTGTGGTGCTGCTGGCAATGGCATCCAGTGAGGAAACACCCCCACAGGCGTGGGGAAGACCCCATTATATTGTTAAAGAACATACTAAACCAGCTACAGAGTAAGCAGCACAAGAACGTTATCCAGAAGATTGAAAGGCTGGATTGCTCCCCGGAGTTTGCGTCGGCTGAATTTTCAGCCCACGTGAAAAACATCCAGGCTGGCGCGGTAAACCGGGACTCCAAATACTACGAAATGACCAAAGACGGTTTTGTCTTTCTGGTCATGGGCTTTACCGGTAAGAAGGCCGCTGCGTTCAAAGAGGCCTATATCGCCGAGTTCAACCGCATGGAGGCGATGCTTCGTCAGCCTCATTCGCTCCCTACGGTTCATCTGACTATCGAGCAGCAAGGCACGCTCAAGGCGTTGGTCAAATCTCGGGTCGATGCCTTGCCACAGAATAAACGCGCTAAGGCGGCCATCAGTCTGTGGTCGGCGCTCAAGTCGCACTTCGGTGTAAGCTACAAGGCGATCGCTGCTGATCAGTTCACCGATGCGCTCTCTCTGGTTGCTCGACTAACGTTGGATGGGGAGGCGCTGGTGCCGCTCACCAATCGCTCCCGCTACCATTTCCCTCTGGAGTGCGCCGACCCACATGATCGTGGTTTGGCCAATGCTTGGATGACGCCGCGCGTTATTCTGGATATTCGTAATCGAGCCCCGGAACTAGAGCTGCTTGAGGCACTGGAGCAGGATGGCCATGACATTACCGGCGCCAAGATCCGCATCCATGCCATGTATGACATCACAGGACAGTTTGTGGCGATGCAGAAAGAGTTGGCTACGGTACGCAGCTATCTCTCAACGCTGAACGATATGCTTAAAGGGCGCAGTGAGGAGCGTGGCTTGAATGTCTGCTTTGCCGAGCCAAATAAGGGCCGTCTGTTTGGTGGCTTTCGGGAACGAGGCTTTACGAGATAGTGATAAGGTGTGGGCGTGGGAAAGCCCACACCTTCAATTTATCGAGGTTAATATGGTTGAGGGAATCATTATGCTGGTGGGCGACGAAGCGCCACTCGCCGGGTGTATGCTACGGCCCAAGCTGCTGCGCTGGGAGAGTAGTAAATACACGCGCTGGGTGAAGACACAGCCGTGTTGCGGGTGCGGAAACCCTGCCGATGACCCACATCACATCATCAATTCGGGATTGGGGTTGGGGGGCATAGGAACTAAGACGCATGATTTGTTCGTGATCCCGCTGTGCCGGCGGTGCCATGACGAGTTACATCATGATGTGGGTGGCTGGGAGCAGAGGAACGGCAGCCAGTTGGTGTTGTTGGTGCAATTTCTGAATAGGGCGTTGGGGATAGGAGCCATTATAAAAGCGTAAGGTGTGGGGTGAGTGGATGAGTGATATGTACGGATGATGAGGCCCCCGTTATTAATATACGTGACTATTTTTATATAGTTTTAGATGGAGCACTTACGCAAAGTACGCAAACAAGATAAAAATGATAACCGCGTAGACATATTGATATCTGAGGGGGGCGGCATGTTGTTATTGGCTATACGGTATACGTTTCTGCCTGCGTATCATGGTGTATCTATATGGTCATGGCTAACGACTTCATGTAATATACTGAAAATTCAACGTGCAGATGGTGCTATATGGCAAATGGTAGCTATGGGTTAAACCTTGAAGAGATTGGGCAATCTGTCCGTAATAATCTTCAATTAATCATAGAAAGTCAGGGATTACCACTTGCTGTTGGCCCATTGACTGATGAAGATTTTCGTATACTGTCTGGTGGGTTTGGGGAGTTGGAGTGGGATTATGCGCTAACTAAGTATGGAAACGACCCGAATAAATTTGAATTTTGTATTAAGTTAGTTAAGCAAGTCACGGAAACTGTCCCATCAGGTGTTGCGCTATGTGTTTATGGCATTGATGATCGGGTGTTCCGCATTCATATGATTGAGCGATTTTGTCGCGATGATGAATCTCATCCGTTGAAAGGGCGAATGGTTGCATTAGCAATAATGGCGGCGTTTATATTTTGTAAAGCTGTCGATGCTATAGATGTATTCATCATGGAGCCTGTCGCGGAGCTAGTAGATTACTACCATTCATTTGGATTTGTTGAGCATGAATCCTGCTCCTATGTGTTGCGTGCATCTGTTAACGAACTTGTGTCTGCGTTCGAGATGTTTGCCCAAAAATGAGTGTACTCATGGTACAGAAGTGCTACGATAACCATCCGTTTGACGCACTCGTCATCGAGCGTCGTCAACACTCATCAACGTTCCTCCGTTGTTAGTAGATCTGGTCAACATGAACTAAGCTCATCAGTAGATGTCACAGTACATGAGCTGAAGAAGTGGTGTATAGGGGTAGTTAACATGAAAAAACAAGTAGCTAAATCAGAGGTGAGGTTTGACACTCAGAAGGCGTTTGCTGGTATGGGTGCGGCTGTTGAACTTTTGATGCGTGCTGCCCCAAATGTTCTCGAGCATAAGGTTTCCGGTCCTGAGAAGCAGGGGAAAGCTCGTATGCGTAAAGCCGCAGCATAAAATCCTACCTTTTTTATAAAAGCCTCGCTTGATGCGGGGCTTTTGTATTCTACCGACAACCCTGGCAAATGCCGGGTTTTTTTTATCCAAATTTCCCCAACGCGGGGTAATGAGATGGCATATGCACCATAACCCAGGAAGTTGGTTGGAGTGGAAGGAGTTGCTGTGGGGCTGGTGGCAAGGGGAGACCCCGGTAGGCGGCGTATTACTGGCCATTCTGACGGCGGCTGTCCGGGTGACCTACCTGGGCGGCGGCTGGAAGCAAACGGCGTTAGAGGGAGCGTTATGCGGTGCCCTGACACTGACCGTGGTGGCGACGCTGGACTATTTTAACCTCCCCAAGTCGCTGACCCCGGCGATCGGCGGTGCCATTGGTTTTATCGGTGTGCAGCAGGTACAGCATTTTGCCTTGTACATCTTGCACCGCAAGCTGGGACTACCGACAGACAAGGAGCGGTAATTATGGCACTCACCAAGGATCAAATTTTTGATGCCTTACTGGGGCGTGAAGGAGGCTACGTCGATCACCCTCACGACAAGGGTGGGCCGACCAAGTGGGGGATCACGGAAAAAGTCGCTCGAGCCCACGGCTATACCGGCGATATGCGCAATTTAACGCGGGCGCAGGCGCTGAAAATCTATGAAAGCGACTACTGGTCGGGACCCCGTTTTGACCAGGTGGCGGAGCTCTCTGCGCGGGTGGCTGCCGAACTGTGCGATACCGGCGTCAATATGGGGACATCGGTGCCCAGTAAGTGGCTACAGCGTTGGCTGACCGCCTTTAACGATGGCGAGCGCTTGTACCCGGATATCAGTGCCGATGGGGTGATTGGGCCACGGACATTGTCGGCGCTGCGTGCCTACCTGGATGCCCGAGGAGAAGAGGGTGAGCAGGTGCTGTTACGGGCACTCAATTGTAGCCAGGGTGATCGTTATCTGGCGCTGGCCGAGCAGCGGGTGCAGAACGAGTCGTTTCTGTATGGCTGGGTTCGGGAGCGGGTGACGCTGTCTTAACCATCTCTGAAATATATCCAAAAGCCTCGGCTATGCCGGGGCTGTTTTATATCTGCGTATCGCAACGCATATCACCAAGAGCCTTTCAGGATGAACCTTGAGGAGCCGGCTGGCTGTCGGAGCCTTCTTGGGGCCGTCTTCCTGTGCGAACAAGGTTCATCACTAAAAGGTAAAGCCGATATGAACACATCTGTAGCGTCTTCTTATACCAATGCATCAAGTAATCATCATGTCGTTAATGAGTTCGCTGACATTGTTCCTTTATCAGCGGCCGGATCGGTGAGCGTGAAACCAATATTGTTAGTGCCAGAGCACTGCATGGTGCTTTGGGGGTTGGCCGAGACTTCACCAACTGGATTAAAGGGCGGGTTAGTCAGTATGGCTTCGTGGTCGGAGTTGACTACATCGCTGTTGAAAATTTGATCTCGCCAAAACGGGCGAGCGCAAAATCTCGTCAGCAGATAGAGCATGATTACTTTTTGACCCGGCTGTCTCAGCGCTGGATGGCGCGGCGTCTGGGAGTCGCATAATGTTCAATAATCTCTGGAAGCCACTAGCTCTCATTGCGTTGGTGGCTTTGCTTTTATGGGGCGTCTCGACCTGGCGCTATGCATCCGGGTACGCCGCTGGTAAGCGCCTGGCTGAGCAGGCGTGGCAGCTTAAGTGGGAGACGCGTAACCGGGATGAGGAGACCGCCAGGGCAAACCGGGAGCGGGGCGAGCGGGCTGAGGAACAACGCCGCTGGCAGGCCATGATTAAGGTGAAACAGAATGCGGATCAACAACTGGAACAGATTAAAGCCGATGCTGCTCGCTCTACCGCTGATGTTGAGCGCCTGCGGCGTACGCTTTCTCAATTGCGGCAGCAGTTGGCAGACCGTTCCCCCTGCCGAGTTTCCACCGCTGGTGGAGCCAGCTCGGCAAGCGCCGCTGCCGGATTTCTGTTTGCCGACGTGCTCGGCGAATCTCTCCAACGCAATGCAGCGTTGGCAGCCTATGCTGACCGGGCCCGAGCCGCCGGCCTTGCCTGTGAGCGGCTCTATGATGCCGTGACGCAGTCGCGAGCGCAGTGAGGGCTGTGTTTTACGCGCATGTTGTGGCTGTGCTGTGCTAAAAAAATGCTGACCGTGCGGTCATCGCTAGGATAACCATCATGACAAAACCGGACTGGGCGGCGATTGCAGTGGCGTTCCAGGACGGTGAGCTCTCCCTGCGAGCTATCGGAGCTCAGTACTTCGTATCCGAAGGCGCCATTCGAAAAATGGCCAAAAAACATGGCTGGGTACGCGGTGAAAAAAACGGTACGCAAAAAAGTACGCAGGTACGCAAAAAAGGTACGCAGAAAAAACAGAGGAAAAAATGCGTACCGATAAAAAGTGAGCCTAGACAGGGTGCTGTGACGCGGTACGCAGGCGTGCATCCCGCGATTTCTTCCGGAGAAAAACCAATTCGCGGCTCACGTCACGCGCCGCCCATCCGTCCTTTTCTGCCGCACAATACGGCGGCGGTGACTCATGGCGCTTATGCTCGCCGCATGCTGTGGCCCGATGACATTATGCAGGACGCGCAGTTGCTGCAACTGAACGACGAGTTATTGCTGTTGCGGGCGGCCAATTTGACGGCAGCGACGAATATCGGACGCTGGATGACGCAGTTAGAGCAGGCTGAGCCGGAGCTGAGCCAGAACTTGCGCGACAATATCGGAGCGGCAGAGCGCGGTATTTTACGTAATACGGCGCGCATCGAATCGTTGGAGCGAACCTTACGGGTGAATGCCTTGAATGAGGCGACGACCGCTAAACGGAAACACCCCCACAGACGTGGGGAAGACCAAAGGCCTGCCCCATCCAAGGGATAAACTCCAGAAACACCCCCACAGACGTGGGGAAGACCGGCATAGGTGTTTTTTGCCTCGCTCTCTTTGAGAAACACCCCCACAGACGTGGGGAAGACGCATCATGAATGTCATAGCAGACTCTGACATAAGAAACACCCCCACAGACGTGGGGAAGACGCAGCGCCCGCGCTGCTCTATCAGCCTAATCGAGAAACACCCCCACAGACGTGGGGAAGACGCCAGCTTTAGGATGTTGCGTGAGGTGTCGGCAGAAACACCCCCACAGACGTGGGGAAGACGAATCCAGATCTAACAACTTTGGCCACAACATAGAAACACCCCCACAGACGTGGGGAAGACGGTGGTATCACCATGAATCACAACGGATCTACAGAAACACCCCCACAGACGTGGGGAAGACTATACAAACACTTACCAACGGTCTATTTTTGTGGAAACACCCCCACAGACGTGGGGAAGACTGCTTCATAGGTCGCCAAATCATGCAATGGCAAGAAACACCCCCACAGACGTGGGGAAGACCCCATTATATTGTTAAAGAACATACTAAACCAGCTACAGATTGTGGTTGGTTTACTGGTCGAATTGATCCTACCCACATAATGTGGACACCGCCCTAAGCGAGGTTTTCGTTTTCAAATTGTTCCGGGCTGAGACCGCCACCGGCACTATGGCGACGCCACCGATTGTAATCACACTCGATATAATTAAACACCGTCGTCCGCATAATTTCCCGGCGGATAAAGCGTTCTCCGTGGATGCACTCCACTTTCAGCGAATGAAAGAAGCTTTCTACACAGGCGTTATCGTAGCAGCAACCCTTTGCACTCATGCTGCCATGCAGATTATGTCGTTTCAGAAGGCTCTGGTAATCTGCTGAACAGTACTGGCCCCCACGGTCAGTGTGAACGATAACGTTTTCCGGGCGTTTACGACGCCATAGCGCCATTTGCAGTGCATCGCAGGCCAGTTGCGCTGTCATTCGTGACGACATCGACCAGCCAATAACGGCGAGTGACCACAGGTCAATGACGACTGCCAGATACAGCCAACCTCCATCTGTACGTAAGTAAGTGATGTCACCCGCCCATTTCTGGTTCGGGCCGCTGGCGGTGAAGTCCTGCTTCAGCAGGTTTCCCGATACCGGCAGGCCATGTTCCCGGTAGCTGACCGGACTGAACTTCCGCGCAGCCTTTGCCCGAAGCCCCTGACGGTGCAGGCTGGTTGCGCTGGTTTTGATGTTGTACCGGGGCAGTTCATCAGCCAGAAGCGGTGTACCATACCGCTGTTTTGCGTCACTGAACGCCTGCCGGACAGCTGCATCACAAACCAGACGGAACTGCTGGCGTCGATTGAGCTGATGGCGTCGCAGGCGCCATGCATACCAGCCACTGCGGGCAACCCGAAGCACACGACACATGATTTTGACGCTGAACTCAGCCTGATGTTTTTCGATAAAGACATACTTCATTTCAGGCGCTTCGCAAAGTATGTCGCTGCCTTTTGGAGAATGACCAGCTCCTCGTCTCTCTCCGCCAGTTGGCGTTTGAGACGGGCATTTTCAGCGGCGAGTTCATTTTCGCGATCAGAGGATGAGAGCTGTTGCTGCTGTTGAGTGCGCCAGTTATAGAGCTGCGATTCATACAGACTAAGTTCACGGGCAGCGGCAGCAACACCGATGCGTTCTGCAAGCTTCAGGGCTTCGTTGCGAAATTCAGGCGCATGTTGCTTACGTGGCTTCTTACTGGTTGATACGGATTTTGTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP016043|1736556:1748108|1743591_1743930_+|WP_070244879.1|holin|DBSCAN-SWA MHHNPGSWLEWKELLWGWWQGETPVGGVLLAILTAAVRVTYLGGGWKQTALEGALCGALTLTVVATLDYFNLPKSLTPAIGGAIGFIGVQQVQHFALYILHRKLGLPTDKER >NZ_CP016043|1736556:1748108|1743299_1743470_+|WP_167352257.1|DBSCAN-SWA MKKQVAKSEVRFDTQKAFAGMGAAVELLMRAAPNVLEHKVSGPEKQGKARMRKAAA >NZ_CP016043|1736556:1748108|1743932_1744484_+|WP_070244880.1|DBSCAN-SWA MALTKDQIFDALLGREGGYVDHPHDKGGPTKWGITEKVARAHGYTGDMRNLTRAQALKIYESDYWSGPRFDQVAELSARVAAELCDTGVNMGTSVPSKWLQRWLTAFNDGERLYPDISADGVIGPRTLSALRAYLDARGEEGEQVLLRALNCSQGDRYLALAEQRVQNESFLYGWVRERVTLS >NZ_CP016043|1736556:1748108|1745013_1745556_+|WP_070244881.1|DBSCAN-SWA MFNNLWKPLALIALVALLLWGVSTWRYASGYAAGKRLAEQAWQLKWETRNRDEETARANRERGERAEEQRRWQAMIKVKQNADQQLEQIKADAARSTADVERLRRTLSQLRQQLADRSPCRVSTAGGASSASAAAGFLFADVLGESLQRNAALAAYADRARAAGLACERLYDAVTQSRAQ >NZ_CP016043|1736556:1748108|1746957_1748108_-|WP_156774565.1|transposase|DBSCAN-SWA MTKSVSTSKKPRKQHAPEFRNEALKLAERIGVAAAARELSLYESQLYNWRTQQQQQLSSSDRENELAAENARLKRQLAERDEELVIPPKGSDILCEAPEMKYVFIEKHQAEFSVKIMCRVLRVARSGWYAWRLRRHQLNRRQQFRLVCDAAVRQAFSDAKQRYGTPLLADELPRYNIKTSATSLHRQGLRAKAARKFSPVSYREHGLPVSGNLLKQDFTASGPNQKWAGDITYLRTDGGWLYLAVVIDLWSLAVIGWSMSSRMTAQLACDALQMALWRRKRPENVIVHTDRGGQYCSADYQSLLKRHNLHGSMSAKGCCYDNACVESFFHSLKVECIHGERFIRREIMRTTVFNYIECDYNRWRRHSAGGGLSPEQFENENLA >NZ_CP016043|1736556:1748108|1736556_1737189_-|WP_167352271.1|DBSCAN-SWA MGSRILKRRKELKLSQVTLSKAVGVSNVAISQWERDETAPRGDALLALARELLCPAEYLVNGTPADTPLAIPVALHPKGKYPLLSWTLVNHGSLAIRSYTREKAEHWYSTTVDCSAASFWLTVEGDSMTATAGLSIPEGTAILVDPDRNPTNGKLVVAASLSDDEAIFKRYILDVGKKYLKPLNTMYSMVEINDNYEIIGVVVEARIAIP >NZ_CP016043|1736556:1748108|1738127_1738292_+|WP_156774564.1|DBSCAN-SWA MHTPTAGLLRLRCRVLPEREDYRYEVNVLGRWWPCNYTLARWTVEYCRQGWGGM >NZ_CP016043|1736556:1748108|1742567_1743122_+|WP_070244878.1|DBSCAN-SWA MANGSYGLNLEEIGQSVRNNLQLIIESQGLPLAVGPLTDEDFRILSGGFGELEWDYALTKYGNDPNKFEFCIKLVKQVTETVPSGVALCVYGIDDRVFRIHMIERFCRDDESHPLKGRMVALAIMAAFIFCKAVDAIDVFIMEPVAELVDYYHSFGFVEHESCSYVLRASVNELVSAFEMFAQK >NZ_CP016043|1736556:1748108|1737647_1738097_+|WP_070244874.1|DBSCAN-SWA MGDIKDAVKAMCESMPGGRAAMAGALGMSPTSFNNRLYEKNGCKFFDRHDLEAMEDLSNTHHLADYFAARRGRITVRVQSRDELDPVELFTLATLTAAHKGQVDLAIQHSISDGIINSSEERDILALHSQYVAARDAYVRAIIALHKAQ >NZ_CP016043|1736556:1748108|1737285_1737471_+|WP_070245634.1|DBSCAN-SWA MLKQTVVKHFGSQRAVAQALQVSDSAVSQWKTLIPERAALKLHRITAGKLKYSPCFYQKSS >NZ_CP016043|1736556:1748108|1740983_1741880_+|WP_083275066.1|DBSCAN-SWA MILLKNILNQLQSKQHKNVIQKIERLDCSPEFASAEFSAHVKNIQAGAVNRDSKYYEMTKDGFVFLVMGFTGKKAAAFKEAYIAEFNRMEAMLRQPHSLPTVHLTIEQQGTLKALVKSRVDALPQNKRAKAAISLWSALKSHFGVSYKAIAADQFTDALSLVARLTLDGEALVPLTNRSRYHFPLECADPHDRGLANAWMTPRVILDIRNRAPELELLEALEQDGHDITGAKIRIHAMYDITGQFVAMQKELATVRSYLSTLNDMLKGRSEERGLNVCFAEPNKGRLFGGFRERGFTR >NZ_CP016043|1736556:1748108|1740285_1740561_+|WP_070244877.1|DBSCAN-SWA MRGKPIVWRHHNLRQIQEHMLNAERCEQRGLWRRAGHEWMQVIEHCTDDVLVEYAVQQRNYCAQMGVFGSASIDPRMVAANQCDEPLQDEG >NZ_CP016043|1736556:1748108|1738291_1738465_+|WP_083275005.1|DBSCAN-SWA MRIPKQGSYYQDRNGVVVRITGYERESQRVLYRRPGYEWGCASPLVVFNAKFRRYQG >NZ_CP016043|1736556:1748108|1739561_1740296_+|WP_070244876.1|DBSCAN-SWA MKNLVSAVQRRDAAALSRMAGQPLQERVVNGNAEKLVDVLFENLLLLFPASRNTVFAAPDEVAAMKRQWITAFAEGGITTLEQVKAGVSMARQHGGDFWPSCGRFMEWCREGVRSAGGLPSDDEVLAEFHRYARDKARFASPEAFDWAHPVMYWVVLDVRQRMYRYNYTEAEVLRAIKAQMQRWERNIRAGQRIPTPVKQLVHVQRPPAIADQLDPTGGAGFYQVGVAFLEQIRQRLRGGEHEG >NZ_CP016043|1736556:1748108|1738461_1739565_+|WP_070244875.1|DBSCAN-SWA MSSLIQILDRPIAYNPALAKLRAGKVKAGPVAAVFLSQMIYWHNRMGGEWMYKTQADITTETALTRDEQETARKRLVALGVLDEARRGVPATLHYRINVARLEALLLEAATPVATPAPTAKTRTRDIQNSEPSQPGPAHSDQSRMVQSQNVETPQSGLVQPRKLDCGDAANKNVETPQTSMGEPTEQVCGDPANFHTGDYTENTQENKKPSCPDAAQPDEPDSDHDFLSRHPEAVVFSAKKRLWGRQEDLTCAEWIWGRIVRLYELAAEDDGEVVRPKAPNWTVWANEVRLMCHQDGRTHRQICELFGRVNRDPFWCRNVLSPAKLREKWDELVIRLGAPGAGAQDRSLKTLLGADWNTEQGWESVL >NZ_CP016043|1736556:1748108|1741809_1742289_+|WP_083275006.1|DBSCAN-SWA MSALPSQIRAVCLVAFGNEALRDSDKVWAWESPHLQFIEVNMVEGIIMLVGDEAPLAGCMLRPKLLRWESSKYTRWVKTQPCCGCGNPADDPHHIINSGLGLGGIGTKTHDLFVIPLCRRCHDELHHDVGGWEQRNGSQLVLLVQFLNRALGIGAIIKA |
16 | Enterobacteria_phage(23.08%) | holin,transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1940538 : 1949855
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP016043|1940538:1949855|DBSCAN-SWA TTTAACTCGGCTGATAGTAGAGCCAGTCGCGATCGCCGCCGCGGACGCGGCTGAAGGGGACACCGAACAACGCGCTCAAGGTCGCCAGCTGCATCACCGTCGCCGTCGGCCCCTGTGCCGTCACCCGCCCCTGCCCAACCATCCATACCTCATCGGCATGAAACAAGGTATGGTTAAGATCATGGGCGCTGGCGATCACGGAGATCCCGCTGGCACATAACTCGGCGAGCAGTCCATCTAACGCCGCCTGTTGGCGGATATCCAGCCCGCTCATCGGTTCGTCTAACAGCAACAGGCGAGCGTGGGGATTGAGCGTCGGCCAGATCTGGAGTAGCACCGCGGCCAGACGGACCCGCTGCCATTCGCCCCCAGAGAGCTGGCTTAAGGGCCGGCGCAACTTGTCCGTCAGACTCAAACGCGCGCTCAGCGCGGCTAAGACCTGATCCACCGCCGCCGGGGCCGCCGGCGGGGCGTACAGCCCCAGGTAGGCAAAGACCGGCATCAAGCCGCTCGGCAATTGCTGCTGCACCAGACAGGCACGCCGCCGCGCCTGTTGGGCACTCGACAGCGCGAACAGGGCCTGCCCATCCAGCCACACCTCACCGCTGCCCTCCAGCAGTCCGGCCAGACGCAGCAAGAGCGTACTCTTGCCCGCGCCGTTGGGGCCGATCACATGCACCAGCCGCCCGGCGGCCAGTTCAGCGTCGACAGGCGCCAGACGCCCGGCAACCGCCAGGCGTGTGATACGCAGCATCGGCTCAGCCATCCTGCCTCCCCGCCGGGCGGCGTAGCAGCATCCAGATAAACAACGGCGCCCCGATGGAGGCGGTCACCACCCCGATGGGCAGCTCGGCGGAGAATAACGCCACCCGCGCCAGCACGTCCGCCAGCAGCAGGACCGCCGCCCCGGCCAAGGCACAGGCCGGTAGCAACATGCGATGTGCGGTGATCCCCCACAGGCGTAACATATGCGGCACGACCAACCCGACGAAACCGATCACCCCGGCCAACGCCACGCTCACCCCGACCAGCCAGCCGATGGCCAATACCAGCAGACTCCGCCAATAGGCCATGCCTAGCCCAAGCTGGCGCGCCGGCAACTCGCCCAGCGCCAACCGATCCAGCGTCGGCCCGCAACGCAGTAACCAGGACAAGACTGGTAATAACGCCACCACCAGCCAGCCCTGACGCCAATCGACGCCACTGAAGCCCCCCATCAGCCAATAGAGCAACTGACGTAGATCCAGGCTAGTGCTGAAATAGACCGCCCAGGTCATCACCGCGCTACAAGCGATACCCAACGCGACACCGATCAGCAACAGACGGGCGTTATGGATACGCCCACGACGGGCGAATAACAGTAATAACAAGGTCGCCGCCAACGCGCCGACGATGGCCAGTGCGCTCAATTGCCAGAGCGGCAACAGGCTCGGCCCCAGCAACACCGCGATCACCAGGGCAACGCCGGCACCGTTGGAGACGCCAAGCAAGCCCGGCTCCGCCAGGTGGTTATCGAACAGCGCCTGCATCACCGCGCCACTGACGGCCAGAGAGGCACCGACCGCCATCACCGCCAAGATGCGCGGCAGGCGTAGCTGGCCGACGAAGAGTTGCCCTTGCGGCGAGAGCCATTGCGTCGGCCATAACCAGACGTCGCCGGCACTCAATCCCAGCAATAGCGCGCCGAACAACAACACCACCAGCCCCAGCAAACGCCGCCGATCATGGGAACGTTGGCGGCGTAGTAAGGCGCTGAGACTGACTTCCGCTTCAGCAGGAGGCATCATCGTCATAGGGCTCGACAAAGATCCCCTGTTGGCAGCCGATCTCGTTGAAGAACCAAATTCCATTCGGGTAGTCGTCCAACGCCACCAGATACATGGTCCCCTCGCTGAACGGCTCCAACGCCAGGATACGCCCAATGCGGCGCGGACCTCCGTCCGTCTTTACGCTAACCCTATCGTTAACTTGCATGATTCATGCCTCTGTGTGCGCCGCGCAAACGGCTGAATATCGCAAGAAAAAGGCCACCTAAAGGCGGCCTTTTGTCCTGTTTGCCTGATTTAGTCCTTGGGTGTAGCGTTCTCGACCCGGCTCTTTAACTTCTGCCCTGGACGGAAGGTCACCACACGGCGCGCCGTAATCGGGATATCTTCCCCGGTTTTCGGGTTGCGCCCCGGCCGCTGGTTCTTGTCGCGCAGATCGAAGTTGCCGAAACCGGACAACTTCACCTGCTCGCCATTCTCCAGGGCGCGGCGAATCTCTTCGAAAAACAGCTCGACGAGGTCTTTGGCATCCCGTTTACTCAGCCCAAGCTTTTCAAACAGGTGTTCAGACATTTCAGCTTTTGTAAGCGCCATAGGTTTAATCCCTCAAGGATGCTTGGAATCGCTGTTTTAACGCCTCTACACATTTGGCGACGGTAGCGGCAATTTCCTCTTCTTCCAGTGTACGCGTGGTGTCCTGCAGCACCAGACTGATCGCAAGACTCTTATAACCCTCTGCAACACCCTTACCACAATACACGTCAAATAAGTTTACGCCAACTACCTGATTTGCGCCAACTTTCTTACATTCGGCCAAAACATCTTCCGCCGGCACGTTTTCGGCCACAACAACCGCGATATCGCGGCGGTTTGCCGGGTAGCGAGAAATCTCGCGCGCCTGTGGAATTCGGCGATCAGCCAGGGCATCCCAACGGACTTCAAACACCACGGTGCGACCATTGAGATCGAGTTTACGCTCCAACTCAGGATGTACAACCCCAATGAAACCAATACGTTCACCTTGCAGGTACAACGCGGCGCTCTGCCCCGGATGTAGCGCATCATGCGCCTCAGCACGGAACTGCACCTCATCCAGCTTACCCGTCAGCTCCAACACCGCCTCCAGGTCGCCTTTCAGATCGTAAAAGTCGACGACCTGGCGCTCCATGCCCCAGTGCTCTTCGTTACGCGGGCCGGCAATCACCCCGGCCAGCATCATATCTTGGCGGATCCCTAAGTTGGCGTGCTCGTCCGGGACAAAACGCAAGCCGGTTTCAAACAGGCGCACACGCGTCTGCTGGCGATTCTGGTTATAGACCACCGCGCCCAGCAAACCGCTCCACAACGACAGCCGCATCGCCGACATATCGACGGAGATCGGGCTCGGCAGGATCAACGCCGGCTGTTGCGGATGCAGCAATGCTTGCACCTTAGGATCGACGAAGCTGTAGGTAATCGCCTCTTGATAGCCGCGGTCGACCAGCAGGGTCTTCACGCGTTTCAGCGGCAGATCCGCCTCGCGGTGCGCGGTCATCACCAGATCGGCACGCAGCGGCACGTCGGGGATATTATTGTAGCCGTAAACCCGCGCTACCTCTTCGATCAGATCTTCTTCGATGGCCATGTCGAAACGCCAGCTCGGCGCCAGCGCCTGCCAGCCCCCGTCGATGACCTTCACCTGGCAGCCGAGGCGTTGCAGGATAGCCAACACCTGGTCATCGGCAATATGATGACCGATCAGGCGATCCAGCTTATTGCGGTGCAGGGTAATGTGCGCCGGCTGCGGCAGGTAGGCTTCGTTGCTCACCTCGATAACCGGGCCGGCTTGGCCGCCACACAGCGCCAGCAGCAGTTCGGTGGCGCGCTCCATGGCCCGCCGCTGCAACTGCGGATCAACGCCACGCTCATAGCGGTGTGAGGCATCGGTGTGCAGGCCGTAACGACGGGCGCGGCCCGTGATGGCCAGCGGGTTGAAGTAGGCACACTCCAGCAGTACATCACGGGTTTCCCCATTGACGCCGGAATGCTCCCCGCCGAAGATCCCCCCCAGCGCCAACGGCTGCTGATGGTCGGCAATCACCAACGTGTCGCTGCTCAAGCTGGCCTGATTGCCATCGAGCAGAGTCAGGGTCTCCCCTTCACGCGCCATACGCACCACGATGCCTCCCTGCAACCGGTTCAGGTCGAAGGCGTGCATCGGCTGCCCCAGCTCTAACAGGACATAGTTAGTGACGTCGACCACCGGATCGATGCTGCGGATGCCGCAGCGACGCAGCTTCTCGCTCATCCACAACGGCGTCTTCGCCGTCACATCGATGCCGCGCACCACACGCCCCAAATAACGCGGACACGCCTCCGCCGCCTGCACCTCGATCGGGAAGCGATCGTCGATGCTGACCTCGACCGGGGTCACTGCCGGCTCCGTCAGAGAGAGGCCATTCAGCACCGCCACATCGCGAGCAATGCCCAGAATACCGAGGCAATCGGCACGGTTCGGCGTCACGCTGATCTCAATGCTGTTATCGTCCAGCTTCAGGTAATCACGCAGATCGCACCCCAGCGGCGCATCCGCCGGCAGCTCGATGATGCCGTTGTGATCATCGCTGATCCCCAGCTCCGAGAAAGAGCACAGCATCCCTTCCGAGGGTTCACCACGCAGCTTGGCCGCCTTAATCTTGAAATCGCCCGGCAGCACCGCGCCGATGGTGGCGACCGCGACACGCAGCCCCTGGCGGCAGTTCGGCGCGCCACAAACGATGTCCAGCAGGCGCCCTGCGCCGACATCGACCTTAGTCACCCGCAGCTTATCGGCGTTAGGGTGTTGTTGGCAGGCGACCACCTCGCCGACCACCACGCCGTGGAAGGCGCCAGCCACCGGCTCCACGCCGTCCACTTCCAGGCCGGCCATGGTGATTTGTTCGGACAGCGCCGCGCTGTTAATGGCAGGATTCACCCACTCGCGCAACCAAAGTTCACTGAATTTCATGTGTTAATCCCGCCTTATTTGAACTGTTTGAGGAAACGCAGATCGTTTTCGAAGAATGCGCGCAAATCGGTGACGCCGTAACGTAACATCGTCAGGCGCTCCATGCCCATCCCGAAGGCGAAACCGGAGTACACTTGCGGATCGATGCCGACGTTGCGCAGGACGTTGGGGTGTACCATGCCGCACCCCAACACTTCCAACCACTTCCCATTCTTACCCATCACATCCACTTCAGCAGAGGGTTCGGTAAACGGGAAATAGGAGGGACGGAAACGGATCTGGAGATCCTGCTCAAAGAAGTTGCGCAGGAAGTCATGCAGCGTCCCTTTCAGATTGGTGAAGCTGATGTCTTTGTCGACGATCAGCCCTTCCATCTGATGGAACATCGGGGTATGGGTCTGATCATAGTCGTTACGATACACGCGACCCGGCGCGATGATGCGGATCGGCGGCTGTTGATCGCGCATGGTGCGGATCTGCACCCCGGAGGTCTGAGTACGCAGCAAACGCGTAGCGTCAAACCAGAAGGTATCGTGATCGGCGCGTGCCGGGTGATGCGCCGGAATGTTCAGCGCGTCGAAGTTGTGGTAGTCATCCTCGATCTCCGGCCCAGTGGCAACGGCAAAGCCCAATTCGGCGAAGAAGGATTCGATGCGTTCGATCGTACGGGTGACCGGATGCAGGCCGCCATTTTCCATGCGACGCCCCGGCAGCGAGACGTCGATGGTCTCTTCGGCCAGGCGGGCATTCAGCGCCGCCGCTTCCAACGCCGCTTTACGGGCGTTTAGCGCTTCCTGTACCGCCTGCTTAGCCTGGTTGATCACCGCCCCGGCCGCCGGGCGTTCTTCTGCCGGCAGCTCGCGTAGTGACGTCATTTGCAACGTTAAATGACCCTTCTTCCCCAGGAACTCGACACGCACGTTATCCAGCGTGGCGACGTCCTGGGCACTCTCTACGGCTGCCTTCGCCTGGGCAACCAACTCGGCGAGATGTGACATGACTTTCCTCTTCTTCCGGCCAGTATGGCCCGATGGTCGATGTTGAGCGATGAGTCGCTCATCGTATGACGGCACGCGCCCTCTTCGTCTGCGGAGCCGTGGCACCGCGCTAAAAAAAAAGCCTCCACGAGGGAGGCTTTTGGCGCTGTTTTCCGTTTCTTTTCTTACGCGCAAAGCCTCCTGACAATCAGGCGCTAAAGTAAAAAAAGAAGCGGAAAATAGCAGCGTTCATGCTTGCGTTACCTTGAACATTCAGAACTTATTGAAAACTCTGCATCTATTGAAAAGCTCTCGCCAGAGAATGTCAATAAATTTCGTCTAACCGCCGGCGACAACGCGCTCAAGACGCCCGACCGCATCAGACAGCAAAAGAGGGAGCAAGCTCCCTCTTTCGTCTGACTTATGCCAGAGCCGCTTTCGCTTTTTCAACCAGGGCGGTGAACGCCACTTTGTCGAATACGGCGATGTCGGCCAGGATCTTACGGTCGATCTCAACAGAGGCCTTCTTCAGACCATTGATGAAACGGCTGTAAGAGATGCCATTCTGACGAGCAGCCGCGTTGATACGCGCGATCCACAGCTGACGGAACTGACGCTTACGCTGACGACGGTCACGGTAAGCGTACTGACCCGCTTTGATCACTGCCTGGAAGGCAACACGATATACGCGCGAACGGGCACCGTAGTAGCCTTTCGCCTGCTTCATTACTTTCTTGTGACGTGCACGGGCAATCACACCACGTTTTACGCGAGCCATATGCTCTCTCCTAAAGTCTTATTCTTGATTCAAAAAAATTACTTATGCGTACGGCAGGCACGCGACTACCAGACCCAGATCGCCTTTGGAGACCTGAGACTTCGGACGCAGATGACGCTTACGCTTGGTAGATTTTTTGGTCAGAATATGACGCAGGTTAGCATGTTTACGCTTGAAACCACCACCGGCGGTTTTCTTAAAGCGCTTGGCCGCGCCACGCACAGTTTTAATCTTAGGCATTTCGATTTCCACTTCGCATTGTTAATTACAACGAATCAGTGAGGCGAACGGTTCCACAGACGCCGTAGCGTACCGTGGAACCCTTACTTGATAGCCTTACTGTTTCTTCTTGGGGGCGAGCACCATGATCATCTGGCGGCCTTCGATCTTCGTTGGGAAGGATTCGACCACGGCCAGTTCACTCAGATCGCTCTTGACGCGGTTAAGCATCTCAATACCGATCTGTTGGTGCGCCATCTCACGACCGCGGAAACGCAGTGTGATCTTGGCTTTATCGCCATCTTCCAGAAAGCGAACCAGGCTGCGGAGTTTTACCTGGTAGTCGCCATCATCGGTACCAGGACGGAATTTAATTTCCTTCACCTGGACAACTTTCTGTTTCTTCTTCTGCTCTTTGCTAGCCTTACTCTTCTCATAGAGGAATTTGCCGTAATCCATGATTCGGCAAACCGGCGGTTCAGCGTTGGGGCTGATTTCGACCAGGTCGACGCCAGCTTCCTCGGCCTTCTCAAGAGCTTCTCTCAGACTGACAATACCAATCTGCTCGCCATCCACACCTGTCAAGCGAACCTCTTGGGCGCGAATTTCCCCGTTAATACGATTAGGACGCGCCGTTTGAACTCGTTTTCCGCCTTTAATACCTTATTCCTCCAACTGATGAAGACTACGGCTGCGAATCTCTTGCAGCAGCTTCTCGATTACGTCGTCGATGCTCATCACGCCCAGGTCTTTCCCACGGCGTGTACGCACGGCCACTTTACCGGCCTCGACTTCTTTGTCGCCGCAAACCAGCATATAAGGAACACGACGTAAAGTATGTTCGCGAATTTTAAAGCCAATCTTCTCGTTTCTCAAGTCTGATTTTGCGCGAATCCCTGCGGCTTGCAGTTTTTTGTTCAATTCTTCGACATATTCAGCCTGAGAATCAGTGATATTCATCACTACGGCCTGCACCGGTGCTAACCACGTCGGGAAGAAGCCGGCGAACTCTTCGGTCAAGATCCCGATGAAACGCTCCATAGAGCCGAGGATAGCGCGGTGAATCATCACCGGAACCACACGCTCATTGTTTTCACCGACATAAGACGCGTTCAGACGGCCCGGCAGGGAGAAGTCGAGCTGCACGGTACCACACTGCCAGGCGCGATCCAAGCAGTCATGCAAAGTGAATTCGATCTTCGGCCCGTAGAAGGCCCCCTCACCCGGCTGGTATTCGAACGGAATGCCGTTCTCTTCCAGCGCGGCGGCCAGATCCTCTTCGGCACGATCCCAGATCGCATCGCTACCGATACGCTTCTCCGGACGCGTGGAGAGTTTCACCACGATCTTCTCGAAGCCAAAGGTGCTGTACATGTCGTACACCATCTTGATGCAGCTATTGACCTCGTCGCGGATCTGCTCTTCAGTACAGAAGATATGCGCATCGTCCTGGGTGAAGCCGCGCACACGCATCAGGCCATGCAGCGAACCGGACGGCTCGTTACGGTGGCAACTGCCGAATTCGGCCATACGCAACGGTAGATCACGGTAGGACTTCAACCCCTGGTTGAAAATCTGCACGTGTCCCGGGCAGTTCATCGGTTTGATGCAGTACTCACGGTTCTCCGAAGAGGTAGTGAACATCGCATCCTTGTAGTTTTCCCAGTGACCGGTCTTCTCCCACAGCACCCGATCCATCATGAATGGGCCTTTGACCTCTTGGTACTGGTATTCTTTCAGCTTCATCCGCACGAAAGCTTCCAGCTCGCGGAAGATGGTCCAGCCGTCGTTATGCCAGAACACCATGCCCGGCGCCTCTTCCTGCATATGATACAGGTCGAGTTGCTTACCGATCTTACGATGGTCGCGCTTGGCGGCCTCTTCCAGACGTTGCAGGTAGGCGTTCAGTTGCTTCTTGTCCGCCCAGGCGGTGCCGTAAATACGCTGCAACATCTTATTGTTGCTATCGCCGCGCCAGTAAGCGCCAGACGATTTCTGTAACTTAAAATGGTGACAGAAGCGCATGTTCGGCACGTGCGGCCCACGGCACATGTCAACATACTCTTGGTGATGATACAGACCAGGCTGATCATCGTGGCTGATGTTTTCATCGAGGATGGCTACCTTGTACGGCTCATCCCGCGCGACGAAAGCGTCGCGTGCTTCCTGCCAGCTGACTTTCTTCTTGATGACGTCGTAATCTTTTTCCGCCAGCTCATGCATGCGTTTTTCCAGGCGATCGAGATCTTCCTGGGTCAGGGTATGCTCCAAATCGACATCGTAATAGAAGCCGTTGTCGATCACCGGGCCGATAGCCATCTTGGTTTGTGGCCACAATTGTTTGATGGCGTGGCCCAGTAGGTGGGCGCAGGAGTGACGCAGGATCTCCACCCCTTCCTGATCCTTGGCGGTAATGATGGCGACGGTGGCGTCGCGCTCGATCAGATCGCACGCATCGACCAACTCACCGTTGACGCGACCGGCAATGCAGGCTTTAGCCAGACCGGGGCCGATATCGCGGGCGATATCCATAACGGAAACGGGATGATCGAAGTGACGCTGACTGCCGTCAGGAAGCGTAATAACAGGCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP016043|1940538:1949855|1947926_1949855_-|WP_024523381.1|tRNA|DBSCAN-SWA MPVITLPDGSQRHFDHPVSVMDIARDIGPGLAKACIAGRVNGELVDACDLIERDATVAIITAKDQEGVEILRHSCAHLLGHAIKQLWPQTKMAIGPVIDNGFYYDVDLEHTLTQEDLDRLEKRMHELAEKDYDVIKKKVSWQEARDAFVARDEPYKVAILDENISHDDQPGLYHHQEYVDMCRGPHVPNMRFCHHFKLQKSSGAYWRGDSNNKMLQRIYGTAWADKKQLNAYLQRLEEAAKRDHRKIGKQLDLYHMQEEAPGMVFWHNDGWTIFRELEAFVRMKLKEYQYQEVKGPFMMDRVLWEKTGHWENYKDAMFTTSSENREYCIKPMNCPGHVQIFNQGLKSYRDLPLRMAEFGSCHRNEPSGSLHGLMRVRGFTQDDAHIFCTEEQIRDEVNSCIKMVYDMYSTFGFEKIVVKLSTRPEKRIGSDAIWDRAEEDLAAALEENGIPFEYQPGEGAFYGPKIEFTLHDCLDRAWQCGTVQLDFSLPGRLNASYVGENNERVVPVMIHRAILGSMERFIGILTEEFAGFFPTWLAPVQAVVMNITDSQAEYVEELNKKLQAAGIRAKSDLRNEKIGFKIREHTLRRVPYMLVCGDKEVEAGKVAVRTRRGKDLGVMSIDDVIEKLLQEIRSRSLHQLEE >NZ_CP016043|1940538:1949855|1942901_1945289_-|WP_024523385.1|tRNA|DBSCAN-SWA MKFSELWLREWVNPAINSAALSEQITMAGLEVDGVEPVAGAFHGVVVGEVVACQQHPNADKLRVTKVDVGAGRLLDIVCGAPNCRQGLRVAVATIGAVLPGDFKIKAAKLRGEPSEGMLCSFSELGISDDHNGIIELPADAPLGCDLRDYLKLDDNSIEISVTPNRADCLGILGIARDVAVLNGLSLTEPAVTPVEVSIDDRFPIEVQAAEACPRYLGRVVRGIDVTAKTPLWMSEKLRRCGIRSIDPVVDVTNYVLLELGQPMHAFDLNRLQGGIVVRMAREGETLTLLDGNQASLSSDTLVIADHQQPLALGGIFGGEHSGVNGETRDVLLECAYFNPLAITGRARRYGLHTDASHRYERGVDPQLQRRAMERATELLLALCGGQAGPVIEVSNEAYLPQPAHITLHRNKLDRLIGHHIADDQVLAILQRLGCQVKVIDGGWQALAPSWRFDMAIEEDLIEEVARVYGYNNIPDVPLRADLVMTAHREADLPLKRVKTLLVDRGYQEAITYSFVDPKVQALLHPQQPALILPSPISVDMSAMRLSLWSGLLGAVVYNQNRQQTRVRLFETGLRFVPDEHANLGIRQDMMLAGVIAGPRNEEHWGMERQVVDFYDLKGDLEAVLELTGKLDEVQFRAEAHDALHPGQSAALYLQGERIGFIGVVHPELERKLDLNGRTVVFEVRWDALADRRIPQAREISRYPANRRDIAVVVAENVPAEDVLAECKKVGANQVVGVNLFDVYCGKGVAEGYKSLAISLVLQDTTRTLEEEEIAATVAKCVEALKQRFQASLRD >NZ_CP016043|1940538:1949855|1941295_1942321_-|WP_070245635.1|DBSCAN-SWA MPPAEAEVSLSALLRRQRSHDRRRLLGLVVLLFGALLLGLSAGDVWLWPTQWLSPQGQLFVGQLRLPRILAVMAVGASLAVSGAVMQALFDNHLAEPGLLGVSNGAGVALVIAVLLGPSLLPLWQLSALAIVGALAATLLLLLFARRGRIHNARLLLIGVALGIACSAVMTWAVYFSTSLDLRQLLYWLMGGFSGVDWRQGWLVVALLPVLSWLLRCGPTLDRLALGELPARQLGLGMAYWRSLLVLAIGWLVGVSVALAGVIGFVGLVVPHMLRLWGITAHRMLLPACALAGAAVLLLADVLARVALFSAELPIGVVTASIGAPLFIWMLLRRPAGRQDG >NZ_CP016043|1940538:1949855|1946687_1947044_-|WP_024523383.1|DBSCAN-SWA MARVKRGVIARARHKKVMKQAKGYYGARSRVYRVAFQAVIKAGQYAYRDRRQRKRQFRQLWIARINAAARQNGISYSRFINGLKKASVEIDRKILADIAVFDKVAFTALVEKAKAALA >NZ_CP016043|1940538:1949855|1947086_1947284_-|WP_005293643.1|DBSCAN-SWA MPKIKTVRGAAKRFKKTAGGGFKRKHANLRHILTKKSTKRKRHLRPKSQVSKGDLGLVVACLPYA >NZ_CP016043|1940538:1949855|1947380_1947923_-|WP_070244941.1|DBSCAN-SWA MKGGKRVQTARPNRINGEIRAQEVRLTGVDGEQIGIVSLREALEKAEEAGVDLVEISPNAEPPVCRIMDYGKFLYEKSKASKEQKKKQKVVQVKEIKFRPGTDDGDYQVKLRSLVRFLEDGDKAKITLRFRGREMAHQQIGIEMLNRVKSDLSELAVVESFPTKIEGRQMIMVLAPKKKQ >NZ_CP016043|1940538:1949855|1946474_1946519_-|WP_106120997.1|DBSCAN-SWA MNAAIFRFFFYFSA >NZ_CP016043|1940538:1949855|1942307_1942511_-|WP_024523386.1|DBSCAN-SWA MQVNDRVSVKTDGGPRRIGRILALEPFSEGTMYLVALDDYPNGIWFFNEIGCQQGIFVEPYDDDASC >NZ_CP016043|1940538:1949855|1942600_1942897_-|WP_005285818.1|DBSCAN-SWA MALTKAEMSEHLFEKLGLSKRDAKDLVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRPGRNPKTGEDIPITARRVVTFRPGQKLKSRVENATPKD >NZ_CP016043|1940538:1949855|1945303_1946287_-|WP_024523384.1|tRNA|DBSCAN-SWA MSHLAELVAQAKAAVESAQDVATLDNVRVEFLGKKGHLTLQMTSLRELPAEERPAAGAVINQAKQAVQEALNARKAALEAAALNARLAEETIDVSLPGRRMENGGLHPVTRTIERIESFFAELGFAVATGPEIEDDYHNFDALNIPAHHPARADHDTFWFDATRLLRTQTSGVQIRTMRDQQPPIRIIAPGRVYRNDYDQTHTPMFHQMEGLIVDKDISFTNLKGTLHDFLRNFFEQDLQIRFRPSYFPFTEPSAEVDVMGKNGKWLEVLGCGMVHPNVLRNVGIDPQVYSGFAFGMGMERLTMLRYGVTDLRAFFENDLRFLKQFK >NZ_CP016043|1940538:1949855|1940538_1941303_-|WP_024523388.1|DBSCAN-SWA MAEPMLRITRLAVAGRLAPVDAELAAGRLVHVIGPNGAGKSTLLLRLAGLLEGSGEVWLDGQALFALSSAQQARRRACLVQQQLPSGLMPVFAYLGLYAPPAAPAAVDQVLAALSARLSLTDKLRRPLSQLSGGEWQRVRLAAVLLQIWPTLNPHARLLLLDEPMSGLDIRQQAALDGLLAELCASGISVIASAHDLNHTLFHADEVWMVGQGRVTAQGPTATVMQLATLSALFGVPFSRVRGGDRDWLYYQPS |
11 | Brazilian_cedratvirus(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
3205796 : 3249648
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP016043|3205796:3249648|DBSCAN-SWA ATTAACCGAGCGCATTCACTGGCGGTAACTCAGCCAGAGGCCAACGCGGACGCACGGTGATCGACAAGTCGGCATCGACCCCGCTCTTCAATCGTACCATACCTGCGTAGGCGATCATCGCGCCATTGTCGGTGCAAAACTCAGGCCGGGCGTAGAACACTGCGCCACCACGCTGTTGCATCATCTGCGCTAGGCGCGCGCGCAACGCGCGGTTAGCACTCACACCACCGGCCATCACCAGGCGCTGAAAGCCCGTCAACTCCAATGCCCGGCGGCATTTGATCGCCAACGTCTCTACCACCGCATCCTCGAAGGCGCGCGCAATATCGGCACGCGTCTGCGCATCATCGCCATTGGCATGAATGGTATTGGCGGCAAAGGTTTTTAGACCCGAAAAACTTAAATCTAACCCCGGACGATCGGTCATTGGACGCGGGAAAACGAAACGGCCAGGTACACCCTGCTGAGCCATCTGTGACAGCACCGGCCCACCGGGATAATCCAATCCCAACAGTTTGGCAGTCTTGTCAAAGGCTTCGCCGGCGGCATCATCGATCGACTCGCCCAAGAGGCGATACTCACCAATGCCAGTAACACTGATAAGCTGAGTGTGACCGCCAGAGACCAACAGCGCAACAAACGGGAATGCTGGCGGCGTATCCTCCAACATCGGCGCTAACAGATGCCCTTCCATATGATGCACCGGCACCGCAGGCAACCCCCAGGCAAAGGCCAACGCACGCCCGACGGTAGCGCCAACTAATAGCGCCCCCACCAGTCCCGGTCCAGCGGTATAGGCTACGCCATCCAGATCGGCCGAGGTCAGCCCCGCCTCATGCAGCGCCACCTGGATCAGCGGTACGGTCTTACGCACATGGTCGCGCGACGCCAGTTCCGGCACCACGCCACCATAGTCGGCATGCAGTTTAATCTGGCTATATAGCTGATTGGCCAGGATCCCTTTTTCATCGTCGTAAATGGCGATACCGGTTTCATCACAGGAGGTTTCGATCCCCAGAATACGCATGACGGCTCCAACAACTCGTTCTGAACAACTCAAAGTGGCGCATAATAGCACAATTCTGCTATACTACTTCGCTTTCTTACTGCGGCCCAACGCCGTTGAAGGATGAGACAGCGCTTTACAATCGGGTGCTGTTCGGATTAAAATTCCGCACCATTTTTGAACAGCTGGCAGATGGCCAGCGAAAAGACCGATTTTTATCGAGGTGAGAGGCACATGCCGGTAATTAAAGTACGTGAAAACGAGCCGTTCGACGTAGCTCTGCGTCGCTTCAAACGTTCCTGCGAGAAAGCAGGTATCCTGGCTGAAGTTCGTCGTCGTGAATTCTATGAAAAACCGACCACCGAACGTAAGCGCGCTAAAGCGTCCGCAGTTAAACGTCACGCCAAGAAACTGGCTCGCGAAAACGCACGCCGTACTCGTCTGTACTAATATCGGCGGGGTTTTCCCCGCTCGTTAGTAGCTCACAGATCATGTCGTTGTTGGTAGTATTCGGACAATGACATATGAAGGCCGTGCCTTCCGAAAGGAATGCGCGGCTTGTTCTCGTTTAGAGGCTGTAAAACTGTTTGCCCTCTTCCCTGCGGGGACGCCCGGCAGACAACCACTGTAAAGGGCTTATGGCTGGACGAATCCCACGCGTATTTATCAACGATCTGCTGGCTCGAACCGATATCGTCGAGCTGATCGATGCGCGGGTGAAACTCAAGAAGCAAGGCAAGAACTATCATGCCTGCTGTCCATTCCATAACGAGAAGACTCCCTCGTTCACCGTTAACGCGGACAAACAGTTTTACCATTGCTTTGGCTGCGGTGCCCACGGCAACGCCATCGATTTTTTGATGAACTATGACCGGCTTGAATTTGTCGAAAGCATTGAAGAGTTGGCCACTATGGCGGGACTGGAAGTGCCCTATGAGTCTGGCACTGGACCGAGCCAAATGGAGCGCCATCAGCGCCAAAATCTCTACCAGTTAATGGAAGGGTTATGTCGCTACTACCAGCAGGCGTTACGCCAGCCTGACGCGACACAGGCGCAGCACTACCTCGCCGAGCGCGGTATGAGTCAGAGCATCATGGATCGCTTTGCCATCGGTTATGCACCGCCGGGTTGGGATAATGCGCTCAAGCGTTTTGGCCGTGAGCCCGCGGGACGTAGTGCGCTTAACGATGCCGGGATGCTGGTCAATAACGACAACGGTCGCAGTTATGACCGCTTCCGTGACCGCATCATGATCCCCATCCGCGACAAACGCGGGCGGGTAATCGCCTTTGGCGGGCGCGTATTAGGTAACGGTACGCCTAAGTACCTTAACTCGCCCGAGACAGAGATTTTCCATAAAGGCCGCCAGCTATTCGGCCTGTATGAGGCATTACAACACGCAGCGCAACCGGAGCGTTTGCTGGTGGTCGAAGGGTATATGGATGTCATCGCCCTGGCGCAGTATGGCATCAATTACGCCGTTGCGTCGTTAGGCACCTCTACCACCGCCGAACATATACAATTACTGTTCCGCCATACTGACAGCGTAGTGTGCTGTTACGATGGCGATAACGCTGGCCGTGAAGCGGCTTGGCGTGCGCTGGAGACCGCGCTGCCCTATCTGAGCGATGGCCGCCAACTGAAATTTATGTTTCTCCCCGAGGGCGAGGATCCGGATACATTGGTGCGCCAAGAGGGGACGGACACTTTCGAGCAAAGGATGAACGCGGCGCATCCTTTGTCGACTTTTCTGTTCGACAGCCTGCTGCCCCAGGTCGACTTGAGCACCCCGGATGGTCGGGCCAAACTCAGCGCCCTTGCGTTACCGCTCATCGGCCAGGTACCCGGTGAGACCTTGCGCCTCTACCTGCGCCAGGCGCTCGGACAGAAGTTAGGCATACTCGACGATAGTCAGTTGGAAAAGCTGCTGCCACAGCGTAATTCGGCGGTAAAGAATTATCAGCCGCCCAGGTTAAAACCGACGACTATGCGTATACTAATAGCATTGCTGGTGCAAAATCCGGCGCTAGCCGCCGAAGTGCCAACACTAGAAGGATTGCGCCAACACTCGCTGCCGGGTCTGCCGCTGTTTATCGAGCTGGTGGAGACCTGCCTGGCACAACCGGGGCTAAGCACCGGACAGTTGCTAGAGTTGTATCGCGATAATAAATATGCGCCACAGCTTGAAACATTAGCAACATGGAACCACATGATCATAGAGCCAATGGTGCTCGAAACGTTCGTCGACACCCTTGGCAGCCTATATGATGCCGTGCTGGAGCAACGGCTGGAACATCTTATCGCCCGCGCCAGAACCGAGGGTCTGACGCCCGCGGAGCGCGAAGAGGTGCGTTCACTGAACGAGGCGTTAGCCAAAAAGCATTAATGCATAATAGACACCGCGTCCTGAATGGAGGCGGTGATTAGACATCATCATAAATTTAGCGGCTTGATTGCCGATACCACGCAGGATGTAACACCTGCATGCCGCAATACCGGGGAGCGGCGACGAATGAAAGTAACCCGCGTACGGTCCTGATGATTCGGGCCTTAAGCATCGTTGGCTGGCATAGATAATCCGACCAACACCAATTAAAACTCTGAAGTGTGGATACCGTCTTATGGAGCAAAACCCGCAGTCACAGCTCAAACTTCTTGTTACCAAGGGCAAGGAGCAAGGCTACCTGACCTATGCTGAGGTCAATGACCATCTGCCGGAAGATATCGTCGACTCCGATCAGATCGAAGACATCATCCAAATGATTAATGACATGGGCATCCAAGTGATGGAAGAAGCACCGGATGCCGATGACCTGATGCTGGCGGAAAACATTGCCGACGAAGACGCCGCCGAAGCGGCGGTACAGGTTCTGTCTAGCGTCGAATCCGAAATTGGCCGTACCACCGACCCGGTGCGCATGTACATGCGTGAAATGGGTACCGTCGAACTGCTGACCCGTGAAGGGGAGATCGATATCGCCAAGCGAATCGAAGACGGTATCAACCAGGTGCAGTGCTCCGTCGCCGAGTATCCGGAGGCCATCACCTACCTGCTGGAGCAGTACGATCGCGTCGAAGCGGGCGAAGCCCGTCTCTCCGATCTGATCACCGGCTTCGTCGATCCTAACGCCGAAGAAGAGATGGCACCGACTGCAACTCACGTGGGCTCCGAGTTGCCGGAAGAAGAGCTGAACGACGACGATGAAGACGAAGATGAAGATGGCGACAGTGATGACAGCGACGATGACAACAGCATCGATCCCGAGCTCGCACGGCAAAAATTCATCGAACTGCGCGAGCAGCATGAGAAGACGCGTCTGGCCATCAAAGCCGAAGGACGCAGCAGCGCCAAAGCGCAAGAAGAGATCTTAAACCTGTCGGAAATCTTCAAGCAATTCCGCCTGGTACCGAAGCAGTTCGACTACCTGGTCAATAACATGCGTGAGATGATGGATCGCGTGCGTGGCCAAGAGCGACTGATCATGAAGCTGTGCGTCGAGCAGAGCAAGATGCCGAAGAAAAACTTCATCACCTTGTTCACTGGCAACGAAACCAGTATCACTTGGTTCGAGGCCGCGTTAGCGATGGGCAAGCCTTGGGGTGAGAAACTCAAGGATATGCGTGAAGAGGTCGAGCGCAGCCTGCAAAAACTGCATCAGATCGAAGAAGAGACCGGTCTGACCATCGAACAGGTCAAAGACATCAACCGCCGTATGTCTATCGGCGAAGCGAAAGCGCGCCGCGCCAAGAAAGAGATGGTCGAGGCCAACCTGCGTCTGGTTATCTCCATCGCCAAGAAGTACACCAACCGCGGCTTACAATTCCTGGATCTGATCCAAGAGGGTAACATCGGTCTGATGAAGGCGGTTGATAAGTTCGAATACCGTCGTGGTTATAAGTTCTCAACCTACGCCACTTGGTGGATTCGCCAGGCGATTACCCGCTCCATCGCCGATCAGGCGCGTACTATCCGCATCCCGGTACATATGATTGAGACCATTAACAAGCTCAACCGTATCTCACGCCAGATGCTGCAAGAAATGGGACGCGAGCCGACGCCGGAAGAGTTGGCCGAACGCATGATGATGCCGGAGGATAAGATCCGCAAGGTGCTGAAGATCGCCAAAGAGCCGATCTCGATGGAAACCCCGATCGGTGACGATGAAGATTCGCATCTGGGTGATTTTATCGAAGACACCACGCTGGAGCTGCCGCTGGATTCCGCCACCTCCGAGAGCCTGCGCTCCGCAACCCGTGACGTGTTGGCCGGACTGACCGCACGCGAAGCCAAGGTACTGCGTATGCGTTTCGGCATCGACATGAACACCGACCATACCTTGGAAGAGGTCGGTAAACAGTTCGACGTTACCCGTGAGCGTATCCGCCAGATCGAGGCCAAGGCGCTGCGCAAGCTGCGCCATCCCAGCCGTTCCGAAGTCCTGCGCAGCTTCCTGGACGACTAATCGCGTCCTCTGCTGTTGTCTTTCCCCCCGGCCATCATCGCCGGGGGTTTTCTTTTAGGGCCGAACCACACGCAACCGTGGCACGCTCACCTAACGCAATGCCTGGTATAACGCCTGATAGTGAGCGACTAATTCAGCCAACGTTGCCCGATTCAGGCCGCTCGGATTGGGTAACAGCCAGACCTCACTCTCGCCGATGCGTTGTGCCTGTCGTCCCCAACTAGGGCGTCGCACGCCGAACGCCTGGGTGAAAGCCTGTTTGCCTAGTATCGCTAACGCCTGCGGCTGATACTGTTGTATCTTCGCCCGCAACGCCTCACCTCCAGCATACAGCTCGGCTGCCGTCAGCTCCGCCGCCGTCACCGTCGGACGTTGCACCAACGCCGTGATTCCGCAGCCAGTCTCCAACAGATGACGCTCCTCCGACGGCGCCAGTTGGCGCGCGGTGAAGCCCGCCAGATGGATTACCTGCCAGAAGCGATTATGCGGATTGGCGAAATGGTATCCCGTATGGGCCGAGCTTAGACCCGGGTTAATGCCACAAAACACCACCCGCAGCCCAGGAGCTAGAATATCTACGCTCATCACTGCACCTTACAATTATCCGAATGACACAAATCTTATCGTAATTACAAGGGGCTAGCCATGTTCAAAAAAACGGCTAGCGCGGTCTAGGCACTGGATCTCGCCCCGTGATTTACTCTATAATCCCCGCCACTCGGCCCCTTAGCTCAGTGGTTAGAGCAGGCGACTCATAATCGCTTGGTCGTTGGTTCAAACCCAACAGGGGCCACCAAATTTTAGCTTTAAAATCATATAATTAAGCCACTCGAAAGAGTGGCTTTTTTGTTATCTCGTTTTTAAGTGGCGATAAAATGGCGGTGGATTTTTCACAGCCATATTTATGGCGATAAAAAACTGCTTTCGCGGGCTCTTCTCACATCCACAAAGGTTGCTGTCCACTCCGCTCCGGGTGCGGCACCGCCGGATTCACTTTACCCGGCAAAACAATGGAAGCCTGAAATGTCTCCATTGTCTTAAATGTATGCGAGCAATTGACATTCTGGCACTGGTGATAGCGCTCTTTGGTATTCTCACTTAAATAGCGACTTGATCTAGCATGTGCCGCATAACCACAGATTGGACAGTGAAACATATTCCCCCCAGACCATGTATCTATGAGAAATATTACTCCATTATTCACATAATGTGAAAACAATTCACCAAAAGCTAATCATCTTCAACGTACTCTACATCAGATAATTTAACCTCCAGCTCTAGCGATGTTGTATAGCCAGCATCGCTGATGCTGTTCACCACTTTTGTGATTGTCCATCCCTGTTCATCGATAATGGATTTAAACCCCGATACGCGAGCCGGTGCATCTGGGTAAAGGTCCGGCCTTCCCATTGCTAGGGTTATAGAGAACTCGGCCACGCCGCGCTGTAGTTTGTCCCACTTTGCCTGAGCTGCTCGCTGGGCTTGAGCCTTGGTTGAGTAAACCGTCGTCAGCGCCAAAACGTTATCAGGATCACCGGCCATATACTCCCCCTCGCGAGCCTCCTTTTCTTTCTTCTTTTTCTTGGCCGCTGGCTTCGCTTTTGGGTGCTCTAACGCGCGCAGATGCTTCTCTTTTGGTTTACGTTTGAGCTTCACCTTCTTCTCTTGTTCTTTAGGGTCACTCGTATGTAGCCACCGCGCCGTTACACCCGTGTATGCCGCTCTATCAGCAATCGCAAATTGATGCCGGTCACCATCAGTTCGCTCAATAACCACGGAGGGGATCGGTTTTCCGCTGGCCGTCATATTGGATCCCGCTTTCAAAAATAAAAGCTTGCCCGCCTTAACTGATACCTCGGCGCCATTTCTCTCCGCTAGGCGTGCCAGAAAGCAGGCATCTGACTCCTGTGACTGGTCTATATGAGGTATTTTTATCCCCCGTAACCCTTCCGCCACTGCGGCCTGCAGCTTGTTACGCTGTGCGACAGTCGTGACAATATCGCCGAGCGTCGTATCGTGATACGACATCTCGCGCCGACTGTTCAACGAGCCCCTGAAATCAGCACTACGCGCTCGAATAGTGAGCGTATCCGGCGCCCCTCTATGCTCGATTTCATCGACAGTAAAATCCCCCTTGCCGATCAACCCGTGCCCACTCCAGCCGATAAACAGTGTCAGTACCGCACCGCGCAGCGGTAGCTCGATTAGCCCATCACTATCATCCAGCTCGATATCTACCCTGTCAGCCTCAAACCCTCGATTATCCGTATGCACCAAGCTGATCAGGCGATCGGCAATATTGCGGGTGATATCCTTTCCTCCCAACGTCAACATGATTGCCGGTGCCAAGCTGGCCCCCGCATCAAGCGTTAATCCGGTAAGCATCAAAAAAGCCCCTCAATGTTTGCCACCATCTTATTGGCCGCCGCTGCCGCATTCCCAATCATGCCCTCTGCCTGTTTTTTCAGATCGCCAAACATCGTCACAAATGACTCATCCACTCGAGTCAGCGTCAGAGAAAACTCAATACGCCGCGGAGCCCCATCGGCAAAAAAAACGGTTCTGGTTTCATGGATCGAGGAGGCAACAAACATTCCAAAAATCGTGCCATCACCACCGATAAGTGGCCACGCCTTACCCTCTGCCGCCATGGCATCTAACGTCAGCATGGAAAGGCGCCCGCCGGTTATCTCCGGCAACAAGACGCCTGATAATGTGATCCGCTCTTCATTGACTCCAAGAAACTGTATCGCCGGCCGCTTACCGACGCGGCTATTACTTGGATAACGGTAATCAACATCACGCTGTAGACTTTGATATGGCGTGGTCTGGAGCTGAAACACAAACATCCCCAACGTCAGCATCATCGCGTGCTCTCCTAATAATCGATATTCATATTGGAGCGCAGACGCGCCCGGCGTTCCCGCTGCTCTGCCTGTAATCCCTCTCTAATCTGGCGTTGCACATCGATCGGGCTGCTGGCACCGTTAACCGGAATCTCGTAGTGGTGTTGGCTCTGGTCAATATAACTGCTACCCGTTGGAGCACTTACTGGTTGATATCCACCGGTTAAAATGCCACCTGTCGGTGAGTAGCCTCGACCGTTCGCACCGGTGGCATAGACATTGGCCTTTTCCGCGGCCTTATCCAGATCACTGGATTTGCTATCGATAACCCCCAATTTTTCCAGCACCCAATCGATCCCACTTCGTAGCTTATCGATCGCATGAATCGGCGCCAGGATGGCATCTGCCAGCGATTGCCCGAATAACTGCCCCGCATTCCGACATATGTCTAGCGCTTCTTGGCTAGCTTTAACCGGAGCAATTAGATCACTAAACCATGTCCATACGGCTTTAAGCTTCTCTCCCAACATGTCGAACATTGGAGTTAGCGGTGAGAAAATTTCCGCTACTGGTGCAAATGCTTCTCGCATCCCGTCAATTACACCGCCGAAAAAGGCGCTAATCGGCTCCCAGTATTTACGTACCAGTAGCGCACCAGCTACGATAGCCGCTCCCAGCGCCACCATCGGTAATGATATGGCGGCCAGCGCTGTTCCAATGGCTCCGCCCGCAACGCTAAAGGCTACACTCAGCGCCCCCGCAGCGGCGATGATGGCGTTAATACCAGCAATCACCGGCCAGGCCACCAACCCAATCGCGCCCACCAAGGCGATAACGCCTAATGCCGCGCCACCAATAGCGAGAATTGAGCGTGCCATCCCTTGGTTTTTCTGGATCCACGCGTCGAGGCGCAACACATATTTTGTGGTTGTTTGGAGCAACTTTCGCAACGAGCTTTCCTGCTGGTCAAACAGGTCCGTCCCTACTGCCTCATAGGCTGACTGAAACTCCTTAAAGTCCCCACCAAGGTTATCCTGCATAACTGCGACTAGCTGCGCCGTCTTCCCATCCGAGGCTTTCAGAGTTGCCGCTAGTTGATCAAGCTTTCCGCTGGTCGCATCTGTAAGCAATACGTTTGCCGCAGAGCTGGCCTCTTCGCCGAATATCACCTTCATGTATTCGGCCCGCTGTCCTGACCCAAGTTTATGGCGCGTAAAGCTAGCCTGAATTTCCCTAAGAATGGTAAAAATCGGACGAGTATTCCCTTTACTGTCCATCGTCTTAACGCCCAGTTCTTTTAAGGCGGCAAACGCTTGTCCCACCGGCGCCTGCAAGCGACTCAGCACTGCGCGGCTCCCGGTACCTGCCATTGACCCGGTTATCTTGGCATCGTGCAGCGCACCCACCATAGCGGCAGTCTCCTCGATACTGACCCCCGCGTTTTTAGCCACAGGTGCGACATAGGTCAAAGCATCGCTAAGCTCTTCAAAATTTACCGCCGTTTTGTTCATAACGGTCGATAGCACATCGCCAATATGTGCAACCTTATCGTTGGCGAGCTGAAACGCAGACCGCATTCCCATCAACAGCGCGGCGTTTTCCTCCATGGTGCGCCGGTTTGCTAGCGCCATCTCCAGCGTTACCGGCGTCGCAGCCTGAATAGCGGCAGCATCCCCACCCGCTTTAGCGATGATGATTTGTGCCCCGGCGGCATCGTCAGCGGATGCGGCAGTGTTATCTCCTAACTGGCGCGCCTGTTTGCGTAGTGCCTGCATCTCTGGCGATTGTTTGCCAACCCCGAGCACAGCCTGTAGCTCTGAGTTTTTCTGCGCAAAGTCATACCCCGGTTTCAAAATGGCGACGCCCGCCATGGTGCCGGACGTAGCAATACCAACCCCGGCGGCGCCGACTGCACCAGCCGTTCCGGCAATTTCCTTTCCTGCCTGATACCGTCGCTTTACCGCATTCAGACGCGCCTGCTGCGCACTGACGCGCGCCAGTGCATCACGCTGACGGTTAAGTTGTAGCGTTGTCTCGCTGATGGAGTTTTTCAGCCGGCGCTCATCAGCGGCGAGGGTGCGAGTATTGATCCCCGTCTTAGCCAACTCTTGCCCCTGGCGCTGTACCGCCATACGCAGACTATTCTCCTTGGCCTGGAGATCGGCGGCCGCTCGCTTGGCCGCCTCCAATACCTTGGCATGCGCGCGTGTAGGCCGCTCTGTGGCCTTGAATTGCTCCTCCAACGCATTGGCCTCATCCTTGGCCTTTTTCAGCGCCTGGCTGGTTACAGCAAGCTGGGCGCTGGTTTTACGAAATCCCTCAATTTTCCCCGCCTGATTATTCAGCTCGCGGAGTGTTTGCTGCGTGCCGCGCAGATCCGCCGACAGCGATTTCCCCGCGCCCTGAATGTGCTTAAACGGGCGCGTGGCCTGGTCAATGGCCTTGAGCAACACCTCAATCTTTACGTTATTACTCATCGGTGGTTTGTCCGCTTCGGGTGAGCGCCTTACCGCGCCAGAAGATAAGCTCGGCCAGGCTCATGGGATACAGTTCTGATGGCGGCCAGTGAAAAATCACTGCGATATCCGCCATCAGATCATCGACCGACATGTCGTGCGGAATGTCTAGCGTCCCCAGCTCTAGCCGAAAAAATTAATCACCTTGGCCGTGATGGCGGTCATATCAGGCAGCGACAGGGAGGCCACCTCCTGTTCGGTCAGGCTGGGCATGGTCATGCGAGGTAGCAACTTGATCAGCGCATCAACCTCAGCAGAGGCGACCGCAGCCAGTGACAGCCCGCGCAGGTGTCCGGTATTCGGGGTGATGAGGGTGACCTGTTCGATGGTCTGCCCGGCACGTTTCACCGGGTGTTTCAGGGTGATGGTGTTCGGTGCTTCTTGCGCCAGCTCGGTCACGGTTTTCTCTTTTGCCATGGTGATTCTCTCTTATTCGTCACGGATAAAATGGCGGCCAGTTATCCCGACCGCGGGTGATTACAGGCCGATATTGCGGCGGTGCTGCTCCAGACGGTCAACGCTGCCGACCTTTTCCACCATGTTGATGGTGTCGATCTCGATCATGTCCTTGCCGTCGATGGTCAGCTTGTAATAGGTGCATTGGGTGCTGATTTTTACCTCGGTATCTTCACCCTGCTTGGCGTCGCCGGCGTCGATCTCCTTATGGCGGCCGCGCATGACCACCTCGACCGCTACAATGTCGCCGGTGTCGTCACGCTGATAAGAGCCAGCAAAGCGCAGCGGGACCGCGCTGGCACTCGGGGCGGCATACTGTGACCATAGGGTTTCATCCGGTAATCCGCCCATTGTCCACTCGACAACCAGCGCATCATCGTCCAGCCCCAAATCGACCGGCGCCGCGCCATTCATACCGCCGCCGCGGTAGTTCTCCAGCTTGCGGGTCAGTTTCGGCAGCGTGACCGACTTCGCCACCCCCATATAGCTCAGGCCGTCATTGAACAGGTTCAGGTATTTCAGTTTGCGGGGTAATGCCATGGTGCAGGCTCCTTAGCTGTTGACCGACTGCGCCAGATTCACCAGGTATTTATCGGTGATGCGTTGGCGTAGGGTCAGGTTTTCCAGCGGCGGCACCGGCGTATAGTCGTAATCGATATACAGCTTGCCAGCCTTGAGTGTTTCTTTGCTGTTGGCCGCCTCATCGAACCAGGCGTCGCCGTCGATGATGTAGCCATTGGATTTCAGCTCGCGGAACTTCGCACGGATCCCCTCAACGATGTCTTTAATCAGCGTCGGGGTAACCGGTTTATCTACCGCCCACATGTGCGCCTCTGCCATGGTGTCGGCCAGTACCTGGGCGGTGCGGGTGTAGTTCTCAAACAGGAACAGCGGATCATCCGAGCAGCAACGGTTACCCCAGAAGCGGAAACCATCCTTGCGCACCAGCGTGGTAACGCCAGCCTCATTCAGCAGATCGGCATCCGTTCCCGGCGCCTGCAAATCCCAATACACCGATGCGCTGATGCCGGTGACGCCATTTACCCCGACGTTGGACAGCGTTTTATGCCAGCCGATCGTTTGGTCGATATGCGCACGCAGCCCCAGCGCCCGCGCCGTCGCCCAGGCGGTGGTCGCCGCATTGGTCTTGGTGTCCCAGGCGAGAAAATCGGGCCAGATAACCATCAGTTCGCGCTGGCTGAAGTTTTTGCGGTAGGCGATCGCCTCGGAAATACTCTTGCAGTCCCAGGCACTGACATAACCAAAGGCGCGTAGCTTCTGGCAGATGGCCCCCAACGCCGTGGCGACAGCCTGCGTATCAAGCCCCGGCACGCCGAGAATACGCGGCTTGACCCCGGTCACCACCGCCGCATCCAACAGCGCTTTCATGCCGGTGTATTTCCCGTTTTCATCCGTCGTACCGATGACATTACTGACGGTGGCCGCCAGCTTCTCCTCTTCGGTGCTGCCTTCACCGTCAGCGACACGCACCACCACCGTGACCGGCTTGGACTGGTCAGCGATAGCCTGCAACGCCGCGGCGAGAGTTCCCTTTGTCCCCGCCTTGCCGACGGCGCCCTGCACATCGGTGATCAGCACCGGCACATTAAGTGGAAATGCGGTGGCGTCCGCATCACTGGCCGTACAGACCATGCCGATAATGGCCGTCGAGACCGTTGAAATGACGCGGGTGCCGTCGTTAATTTCGAGCACCTGCACGCCATGGTGAAAATCACTCATCGGATTTACTCCATCGTGGATAGGTGCGGTTATTTTCTGATGCGGGTAGGCGCTGGGCGAGTGATAGCGGATGGATGAGCGGTGGCACAACGGGAAGTGAAAGCGCCCCGAATGGGGCGCCTGGTGTATCAGCATTTTTTAGGCGGTGGCAGTACTCTATCCGTTACGGTTGCCGCGCAGGGGATATAACCCGGCCAGCCTTGACGGCGCCGCTGGCATCCTTTGCAGTGGCACGGCCCGATTATCTTTTTCATGCTGGTTTAGACGGCCAAGTAATATCTGGTGCCAAACTAATATTAACGTTATCGATCTGCTGAGCATAGCGCATCCATTCCGTCAGCGTTTCCTTATCCTTTTCGGTGATAATGCCCAGAGCCAGCTGGGTTTGCCACAGTTTGGTTACAGTTTCCACCTCTGATAGCAATGCCACTTTTTCCTGCTCAGCCGCCTTTATCAAATCGTCTTTTGTCGGCGGTGGGTTAATAATCTCCATGGCTTCACTCTCTGATATTGGCTGCATACCTTCCTCAATACAACTATCTTGGCTACCATCAGATGCAAATGCATACACCACACCTTGGTTATCTTTAAAGTATTTCATCAAAATAACTCCGACCACACTGATAATGATGGAGCACCACCAGCAGAGGCTTTTATGACATATCTGCCGCCTGGAGGCACGATTGCACTTACCCCTCCAGGAATCCACCTATCACCTAGAGATGCCGTTGATGATGCAATAACACCATCAATCTCAATAACTGCGGTACCGCCGTTTGGCGGGCGAACCTCAGCAAAGACAACGATCGGCTTTTGATAAGCATTCGTATAAACGACCCCGATCCTCCGCTCGGAGGTTACACTATGCCATGTTTGATTTATTCCTACAGGATAGGAAAATCGCACATCATCACCCGCCGCAACAGTACCAGAGACAGAGCCAATATTTTTTGATGCTGAGTCACCTAATCCCAGCTCCTCAGGCGACGGCTTGTTTATGCTGCTGTATTGAACTACCCATGGATTGTATCTTTCCGGCCCGTGCCATGTTTGGCGCGTAGCTATATGTCCGGGATGGCTGATATACAATTGCTCAATGGCAGTGCCACTTTTCGTAACAATCATAAAGCCATAGCCATAAATTGGTTTGTTATTTACTCTAGGTATATCGGCGACAGAGCTCGTATCATCAATAGCAATGCTGTAAATCCCCGGGTTAATACAACTCCCAAATGTTCCGCCATTTCTCACTGAACCAACAGTATTTTTTGACCAGGCATTGGAGGCCAAACTGATTGTTTCTCGCAAGCCAAGATATTGAACAATCCCTGCGATATCTTTACCACTAAGGGCTGTTAATGTGCTATCGAGCGGTTGTTTCCCTGCTAACTGGTTAATAATTTCGTTAGCAAAGTTGGGATTATTATTCAGTGCAGCCGCAATTTTCTGTAGCGTATCCAATACTCCCGGCGCTGAGCCGACGAGCGCCGCGATTTTTGCCGCGACAAATTCTGCGTTGGCCATTTCCAATCCGCGCGCACTCAACGTGGGAGTCGGTGTAGTCGGGGTCCCGGTAAATGATGGGCTATTGAGTGGTGCTTTAGTTACCGCTAAATCCCGTACTTTTTTAACGGAAGCCGGCGTGGCCGCCTTGTTCTCTGCGGCACTATTATCGTCACTGCTCAGTTGCACAAAGCCCTTGGCCGATAACGTTGCGTCTGGGTGATTACGTGATCGCGCATGCGCCTCCAACTGGCTATCGATATAGTTGCGCACCTCAATGACGTGATCATCGACATACTGGCGCGTGACCAGCACGACAGCCGGATCAATTTTTATTGTCACGGCGTCAGTACTGGATACCGTCAATACCATTCGAATGGTCTGGGTGCGGCCACTGCCCTCCTGCAATAGAGGCTTATAGGTCTCCGGGCAGTTAGCGACGGCAATCAGGACGCCATCGCTGTCATAGAGCCCAATTTCTCTAACCCACCATCCCCCCTCGCTTTCCGGAATGACCTGTTCGGCGATGATTGCACTGGTATTGTTAGGGTCAACGCTCAACATATTCAGCGCGGCACGGCGGCGCTCGCCTCGTAGCGCGGTTTGCGCCGGGTTTGGAGTTTGCAACGTCCCCCCGCCATCCCCCACCGCCATATGGGTCAAGTTAAGCTTCGTCCCGAGCGCCGTCGCGTTAGCCATTTTCGCGGCGCCCAGATTAGTCAGAATGGCAAAATATTTTGCGCTCATGCGGTTACTCTCAGGTTGTCAATCAGGTGAATGGCCGAGGCCGGATAGTGATTACCACCGACCTCGATCAGCTCCGGTGTGTAGGGATACACGGTTAGTGCATCCCCGTCGTAGCAGGAGGCGCCCACATATCCCCGCCCGCTACTCGTCAAACTGATGGTAAGGCCGGTCAGGTGCCGACTAGCCGGTTTAGCATCGGCAACCAGGCGCTCAAGCTCCCGAAACATCTCCTCGGTGATCCCGGTCTCCAGCACACCAATGACAAGGCGGAATGTGCCCGGCGTTTCGTTGAGCTGGAACCATTCGCGCACCTCAATCAGATAGCCAAGCGGCTCCACTACACGCCGGATAGCGCCAATCGTCCCCTTGTGACGGTGAACAAAGTGCGCCACCGCCACCACGTTGCGCTTGGTTTCCTCCGGCCAGTGCTCATTCCACCGATCAACCGAGAACGCCCAGGCCAAATAGGGCAGCAAATTAACGGGGCAGGTGCGCCAGTTCCACAGGGTACGCAATGGCACCGGCACGCGCGCCAGCTCGGCACAGGCTTTGGCAGCGGCCAACTCCAGCACCGATGACCCCGTCGGCAACAGGCGATCGTTATTCATCCGTCCCCCCAATGGTCAGCTGATAGGCGCTGCAATAGGAGGCCTGGCTCTTATCCAGCACGATATCAGCCGTTGGACTGGCCAACGCTACACGCTGCACCCCCTCAACATGCAGCGCGGCGTAAATTGCCGACAGGCGAATATCCCGACCGAGGCGGTGCTGGGCGTTAATGTAAGCCTTGAGCTTGGCCTCTGCCGCCTGGCGGATGGGCTCCGCTTCCGGCCCCGGATAGAGGTATAGGGTGGCGTTAATCTGGTAGGGCACGATAGTGGCGGACTTTACGGTGACGCGATCGGCGATGGGTCGCACATCCTCATCGTTTAGCGCCTTGGCTACGATAGCGAGTAGCTCCTCACTCGCCGTACCATTCCCCTCGCGTGACAGCACCGCAATCGTGACACAGGCCGGCGCCGGGCTATCGACCGAGACATCGGCGACCCGGCCGTCGGCGCTGCGGCCATGATATTCATAGGCGCCAACCGGCCCCGCCACGCTTAACCCCTCAAATGCCTGTTGCGCCCGCAGGCGCAAATCATCGTCAGACTCCATCACCGCCGGTGTCGGCGGTAGGGTCGTCTCGTCGGCAGGCACAATCGTCAGGCGCTTAGTGTTTAAATTTGCCGCCAACATATCCATATCGCTACCGGAGGAATAGGCCAACATAACCGCCCGCGCCGCCTCGTTTACCCGCTGGCGCAGCAGCAATTCACGATAGGCATTTTCCTCCAGCAGCTTGACGATCGGCTCTGACTCCAGCGTCAAGGTATGTGCGATGGCATCACGTTGCTCTTCGGGATACAGCGAAATCAGCGTCGTCTTGCGCTCCGCCAGCAGGCTTTCATAGTCCAGCTCCTCGATAACATCAGGCGCAGGCAGTTGGCTCAGGTCAATGATCGGCATCGTATCAACTCAATGGCACGGTTAATGAGAAAGCGCCGCCCCCGGTCAGGGCGCTGACGCCAGTGATATCGACAAACAACCCGCCGGGGGTCGATTGTTCAAAACGGATAGCTGTCAGGCGTACACGCGGCTCCCAACGCAAGATGGCCATGTAGCAGGCGGCCATAATTTGCAGGCGTAATGCCGGGCTTTGCGGTTGGTCAATTAGCGCCGACAACAAGGAGCCATACTCCCGGCGCATGATGCGTGAGCCGATCGGCGTGATGAGGATGTCGCGCACGCTCTGTCTGATATGCGCGGCATCCTCCAGTGAAAGTCCCGACGCCTGGCTCATGCCGACATAACGCGCAGTCATTGTGTCCCCTCCGTCCAACTTACCCCCGATTGCACACCGCCATGACTGTGTTTATCCACCCGCACCCCGTTGGAGGTAAATGCGCCGCCGCTATGTTCAATATCGCCGCGCATCGTCCCGCCTTTTTGCACCTCCAGCGTTCCGGTTGTCAGTTTGTTGGTACACACCACCTCGGGCGTATCCAGCAAAATGCGGGTCTCCGCTTTCACGGTGACCAGCGGCACGGTGGCCGTAATCGATGTGGATGCCGTGATGTCGGCTGTCTGGATCCCGCTGGCCGTTAGCGCACCGGTTTTGGGCTCGTACTCGATGACCGCACCATCGGGAAAATCAACACGCCATGCCTGTTCAGAGTCCGATGGCGCCGGGTGTTCATCGGAGTAAATGGCCGGCAGCACAAAGGCGGTATCCAGTTCACCCCCTACGGCCAGCAGGATCACCTGCTCACCGACGGAGGGCGCCCACCAGGTGCGCGCGCTTCCGGCGCGCAGCGTCAACCACTGCAACCAGTCTGTTTTGACGCCCCCGGTTTGTACCCGGCAACGCCCCGACTGGGTGTTGACCTCGACGATCACCCCGGTGCGGATCAGGTTGCGCAGTAAGCGCAGTATCTCTGTGATGTTGGCTTGTAGTTTCATGATGAAAGAATGCCGCTGGCAGGGCGTCAGCGGCAATTTGTTGGTGATGGGTGAACGGTGGGACAACAGCTAGTTTTTCAGATGGGAGAGGAGCACCTCTTCAACGATTTTTTTGTCCTCGTCGGTAAAGCCCAGCAATGGGCGAGCCTCATACTGCACTTCCGCACTGTGGCGGTTTGGCTTGTCTTTCAGACCGTACTGATGGACCTGCGCAATGCGCTGGACCCGTCCGGCAAACGTCACCGTGGCCGCATCAGGATCCGCCGTTGCCAGCATGTAGCGATTGGTGCGCAGCTTGGCGAACATGGCTCGCCGTACCCGCCCACGTTTTCCCCTTACCTGTTGGGGTTTGCGCGCTGCGTAGGGCGTTCCGTCTGGCGCCTGCTGTGCCCTGATGCGACGCTGTTGGCTGGCTCGCAGTCGGCGGGCAATCTCCGCCGCAATCTTGCGCCGCTCAGTGGCGGTCAGGGTGATGATCAACCCCGCTAGGGTGTCGTTAAAGGGTGAGAACTCACTCATGCCACTGACTCACTAACTCGCCGTTGATGTAGAGCTCCGTCGGCCGCTCGACGGGCTCCGGTTCGGGCGGCTCGCCCACATTGCGCACATACAGACGGGCTCCCTCCTGTTGCACCAGGGTGCGCTCGGTCAGCTTTAGACTGATACTTAAATCCTGGCTGTCGCCGGTATTGATGTCGCTGATATAGGTGAATCCCTTCCCGTCGCCTGTCGTGAAGATATCGGGCTGATTCTCCCGCAGCCATGCCGCCAGCGGTACCAGAATCAGATTTATGTCGCCGCGATAGTCCAGGATCAGCACATTCAATGTGTACCGGTTCTCAAAGGAGAGCGATGGCGCCAGCGTGGCGGCGATCATCCCCTCATCGATAAAGATCCGCATCATATCCGGGTTTTCGGCCAGTACCGGCAACGCCTTATAGAGCGCCTGTTTCAGACTGTTCGGCTTTAGCACGCAGTTCCTCCTGACAGTGTTTAACGGTCTCCACTTGTAGCGCGCACTGCACCAGCGCGGCCTCTAACTGTCGATTATCGTCACTCAGATCGCCGTTGCTGGTCGGGTGGCTCCCCGGTATCCGGCAACTGCTCACGGCGGGACAGCCACTGTAAATAATCGTCGGGGTTATCGAAGGCGGGGCGGGTGTGCAGCCGGACAACCACATCAGGCAGAGCAGTCCGCCACCACGAACGCAGGGTCTCATTTTCATTGAGTAGCCTCGTGATGGTTTTATTTCGGTGGGTGGCGAGGGTGTTGGCGGCATCGAGCTGCTGGCGCAGGACAACCTGATCCCGCTCTTGCCTGCGGCTCACGGCGGAGGCCGTACTTAGCTGATTTTTCAGCATGGTAATCTGCGTTTTTTGCGCGGATGCCACTTTATTGGCCGTATCAAATCCCCGTCGTAGGTTGCTGTTTTCATGGCGCAGCCACAGCGCCGCCCCGAGCGCTAGCGACATGACAATCAGGAACAGGGTTAGGCGCCACGTCATGCCACCACCCCGCCAGCCTGGTGATATACGCTAACCAGCGTGTCGAGTGCGTGCTCTCGCTGACCGTAGCCCGCCCCCGGCAACGATGCCCAAATGCCCCGACACTTGGAGATAGCGCGCTCGATATCGCCGCGCTTGATATCTTCCAGCGCTCGACGCTCGCAGATCAGCTGTATCGCCAACTTATCCTGCGATGCCGGGCTGAAATCGGGCAGGTTGAGCTGCTTTTTATAGTGCGGCCAATACAGGTAAAGCTGCTGATAGCGTCCCGAGGCCGTCGACCGCTCCCCACGACGGTTAAATACCTTGGCGCGGCGACCATGTGCAAACGGGTGATCGCTGTAGTCGGTAAACACTTCCGGCCCCCCATCCATGCCCGTAACAATCACGTCGTACCCTCGGTTACGCGTCAGCGGATGGGTCGCCGTTCCCTCAGCAAAGGCGATCATATCGAGGAACGCCGCCACGTTCGGATGCAGATTAATGGCGCTCATCGCTGCCCCCTTTGGCCGCATTGCCCCCTTTACCAATCCGACGCTGGATCAGGATCTCCACTACCTGATAGCCAGCGATGCCCAACATGGCGCCGATACCATTCACCGCCGTCGGCGAAAGGTCGGGGAACTGCACCAGCGCCACCCCGGCGACCATCGAGACAAAGCCCCCCAAAAGACAGCGACCGATGAACAGCCGCGGGGATACCGGCTCGCCGCCGACCAGTACCTTACCGATGACGATCATCACGCCGATCACAAACAGCGAGAGCACACTCTTATCGGTTTCGTTCATGCGTTAATCCCATAAGTTAATGGTTTCAGATATTGGCGCCGCCTCAATTTCGGGTAACTCGACGACGGTACCGTGTGGCAGGATGGCGCCGAGCGAGGCCAGTCCCGGATTCGCCTCCAGCACCGCCTCGACGACCCCTTGTGTGCGGCCGTAATAGGTGGCGCACAGGGCATCAAGCGTTTCCCCCTGTTGTGCAATCACTTTCATAGCTGGCTCACAATGCAGCGGGGGCGCCCCTGCAACCGAGACACGGACCAGCGCATATCCCGCCACAGCTCATCGATCACCGCCTCGACACTCTCGGCCTTTTTCTCCCCCTTACCGCTGGCGTCATACCCGCGATAGCGCTCATACAGCGAGGCCGCGGTCATGGCGCTGACGGCACGTAGGTAGTGGAAACAGCACACACTCTCGCCATCGATGGTCTCTGCCGGCACGGCGTCCAGGGTCTTGAACCCCAGCAACATCTGCCGCTCGCGGAACTCGACCAACTCTGCGTTAACCTCAGCGATACCGGCACAGATGGCATCGCGCATACGGGCAGGGGTGACCGTGTATTCCAACCGCATCAGATCACGGATACGGCGCGGCTCAACGTCAGGAAAAAAGAAGGTGTTTTTAATTACCGGTTCACGCTGCGAGTCCGGTGGAATGATCACCGCATCACCGGCAGGCATCCCGCTTTGGTCAATAATCACTGTCGTCATGACAACCTCGAATGGGTGGGCGGTGGACGCAGGCCGACCGCGAGGTCATGCCTGCATTGGCCTGCGTGCCGCCCTGGCGCGTGGCGCATTCTGTTAACTGGCGGCTTTCACCTTGGGCGGGCGCCCGCGTCTGGCCGGTGTCCCGGTTACGCTGCGCGGACGCGCCGCCGTCTTTTTCTTGGCTGGCTTAGCTGGTTGTTTCACCGGCTTTGGCCGGATCGCGCGCTCCAGCGCCTGAATCTCCTTTTTCACCCCGGCGTTGTCGTCCAACTGCATTGCGCGCTGGAGCTGTGCCAAGGCGGCAGCGTTATCCCCCTGGTCTCGCAACAACAGGCCGACGGTTTTATGCAGGCGGGCGCGCACGTTGTCGGGCATATCCTGATCTACCGTCAGCGCGATGGCTCGCTCCAGCGTATCGATCGACAAAGCGGCACCGGCGGCGCGTAGACGTGCGGCAGCAGCGGCGGTTTCTTCCACCACGACGCACCCGGTCGTGCGGCTATGCTTTTCCGGCATTCTCAGGTTATGGCGTAAGGCATACTCCGCGATATCCAAGGCGCCGGTGATGTCATCCACATCAAACCGCCACAGCATGACGGTCATCACGATGTCATCCTGCGCACCACGGCCAGCGGCCAACACACCGCTGATCCACGGCCCATAAAACGGCAGCATGTCGCGCTTATGCAGCGCCTTGGTCTCAACGGATTGGATACCTTTCAGCTTGCGGCGATCAGCGGCCAGTTTGACGAGCATCTGCTCATACTCGCTGGCATGGCGCAGCGGGGTGTTTTCCCGCTGCGCAGCCAACGTGGCCGAGACCCGCATTACATGACACTCTGCGGGACTCAACATGGCTTAGCCCCCTGCGTTCTCAACGGGTGCGCCTTGTTCCGCGCTCAGTGATGCCACCGCAGGCTTAGCGGGGAAGGTGCCAAACTGGATGTTTTCGATCATCGCGGCACAGCCGTAATCCTCTATCACATAGTCAATTTTCAGTGACTCGTAGTTTTCGACCCGATCGAACTTGGCGTTCTCGTCGATGTGGCGGCGGTGGCTGCTGTCCATCACGTAGATGGAGAGGTTATCCAGTCGGGTGACCATCAGCGCGTTGTCCGGGAAATAGGGGACACGGACCGCTGGCAGGTTACCGATGCGCTTCTGGCTGACGATAACGTCTGCCGCCATCGCCTCGCTGTTACCCTGCTTCTTATTCACCAGCGGGAAATACTTGTCGGCCAGCAGCTTGCGACCGCAAATCACGACTAGCTCCGGGTCTTCGCTATGCCACGGATCCAGCAGGGTATCGGTGGCATTCATGACGGCCGCATCGAGGTTCTCATAGTCACCATTGGCACCGATGCGCACCACGGCTGAGACCACCTCTCCCGAGGTGTTGGTGATTTTGTCCAGGACACGTTTCGGCGCCTCATTGCGCATCTTCTGCAACCAGCCGACAGCCACATCCTGCAACAGCGGGTTGGTCTTGCGGTTGGAGGTGGCGGCGCGCGACGTTCCGTTAAACCCGGCCATGATGTAATCCAGCCCCATGCGCTTGGCGATGGCGTCACGCAAACGCAGCTGGAAGTCTTGATAGCGGGCCCACAGGTCGAGGGTGTTATAACGGATATGGAAGTCGAAGTTGACCTGTTCACAGCGGTATTTGCGGGAGGCCAGCGCCATAAAGTCCGCGGTCGCGCGGCCCGTGCCACCGTCCGTATCGGCGGTACTGGCGATAGAGCCGGTCACGCCCAAGCCGATTTTTTCCCCCTCCTGCTCATCGACCGGCACCATGTTGATGCGGGTCAGAAACTCGGAGGTGTCCTGCACCGAGGTGATCAGCGTCTGCGTGACGGAGGGCTCAACGGCGAATTTATTTTTCAAATCGTCGACGGCCACGCCATTCAGCTCAGCGACACGGGTCAGGTACTGATTAAACTTGAAACGGGTATTCTTGCGCATAGTGTCTCTCTTGTTGTTTTACCCCAATAAAACCCACGACCAGGCGGGCGGCGGCATAGCCCCGATCAGCAGTTGGTTAACAGGGTGTCTTCACCATTGCCACCGGTAGCCGGTTGGCGACGTGACTGGATTGGGTTTTCGGTGTTGTCCAGGGCGGCGCGCAGCGCAGCAACCTGTTGGCGTTCCTCGTCCAACTCACCGCGCAGGGTTTTGGTAACCTCCTCCAACGCCTGAAAACGTTGCTCGGTGCTATCGCTGTTGCTTTGCACCTGTTCGGCAATGGTGGTGACGGCCTCATGCACATCGGCAAAGCGATCGTCATCATGCCGGGCTTTACGCGAGAACAGGCCGCGCACCTTGTCAGCCAAGGAATCGAGGACATGGCTCTGCTCGATAAACTCCAGTTCGGCGAGAGAGGCCACGGAGAACAGGTCGCCTGGCTCCTCCTTGCGATTGGCCAGCGGGTTGGCGGTGGCATTGGCGCAAAATTCCAGGTATTCCGTGCCGAGACTGGCCGGGTCATCCGTTACCGCTAGGCCGACCAGGTAGGCTTTGCCGGTATTGGCAAAGTTGGGGCGGATCTCCATGGAGGTGTAGACCTTCTGCCCATCACGCACCCACTCGACCAGCGTATCGGTCGGGGCAATCTTGCCGTAGAGCGCCCACTTGCCATTCAGCGCGCTGTCATCGTCGATTTTCTCGGCCTTAAGCTCCACCACATCCCCCAGGCGCTTAAAGTCACCGGTCGGCAACAGGCCGCGGATGTGCTCCAGATTGATGCGGCAGCCATAGACACGCGGATCAAAGGTCTCGGCCATCTGTTGGATGTCATTGGCATCGATAACGCGGCCATCGCAGGTGTCACCCTCGACACCGATGCGGAACCACTTAGAAATTTTTTTTGCCATCGTCAGTCCTGCGTTGGTAGTTGAATGTCGGGGTTAGTTTCCCGACTCACCCCCGTAGCCGCCAGCGGTGACGGATGGGTTAGCGCTGGCACAACAGGCACTTAAGGCAGATGCAGCGGCGCTTCCGTAGCCTTTCCCTCGTAGAATGAGCGAGGGATGACATGACCATCACCACAGACACATCACTATTGCGCGATCCGCGCCGACAGGCGGCGCTGCTGTATTGGCAGGGGTTTTCCATCGCGCAGATCTCCGAAATGCTGCAACAGAAACGCCCGACCGTGCAGAGCTGGAAGCAGCGCGACGCCTGGGAGGACACGGCGCCGATCACCCGCATCGAAAACAGCATCGAGGCACGCTTGGTGCAACTGGTTCTCAAGGACAAAAAAGAGGGAGCGGACTACAAGGAAATTGACCTGCTCGGGCGCCAGATCGAACGCCTGGCGCGGGTGAACCGCTACAGCCAGACCGGCAACGAGGCCGACCTCAATCCCAACGTGGCCAATCGCAACAAGGGGGAGCGCAAGAAACCGAAAAAGAACTTTTTCAGCGACGAGGCGATCGAAAAACTGGAGGAAGTATTTTTCGACCAATCTTTCGAATACCAGTTGCAGTGGTATCGCGCCGGGCTGGCTCACCGTATCCGCGACATCCTCAAATCGCGCCAGATCGGCGCGACGTTCTACTTCTCCCGTGAGGCGTTACTGCGTGCCCTAAAAACCGGCCATAACCAGATTTTTTTGTCGGCCAGTAAAACGCAGGCTTACGTCTTCCGTGAATACATCATTCAGTTCGCCCGGTTGGTCGATGTCGACCTGACCGGTGACCCGATTGTCATCGGCAATAACGGGGCAAAGCTGATTTTCCTTGGCACCAACTCCAACACGGCGCAGAGCCATAACGGCGACCTGTATGTCGATGAAATTTTCTGGATCCCCAACTTCCAGAAGCTACGCAAGGTTGCCTCCGGCATGGCCTCACAGAAGCACTTGCGATCGACCTACTTTTCGACGCCCTCCACGTTGGCACACGAGGCGCACTCGTTCTGGTCTGGCGACCTGTTCAACAAGGGACGCGCCAGCGCCGCCGATCGCATTGAAATCGACATCAGCCACAGCGCCCTGGCCGGTGGCCTGCTGTGCGCAGATGGGCAGTGGCGGCAAATCGTCACCATCGAGGATGCCCTGGCCGGTGGCTGCACCTTGTTCGACCTCGATCAGCTCAAGCGTGAAAACAGCGCCGACGACTTTAAAAACCTGTTTATGTGCGAGTTCGTCGACGACAAGGCCAGCGTGTTCCCGTTCGAGGAGTTACAGCGCTGTATGGTCGACGCCCTGGAGGCGTGGACGGACGTTAACCCCTATGCCGACCATCCCTTTGATCGCCCGGTCTGGATTGGGTACGACCCCTCACATACGGGCGACAGTGCGGGCTGTGTGGTACTGGCGCCGCCTGCGGTACCGGGTGGTAAATTCCGCATGCTGGAGCGCCACCAGTGGAAAGGCATGGATTTTTCTACCCAGGCCGAAGCCATCCGGGCGCTGACGGAGAAATACCGCGTCGATTACATCGGCATCGATGCCACGGGGATCGGGCAGGGGGTTTTCCAGTTGGTGCGCGAGTTCTACCCGGCGGCGCGCGAGCTGCGCTACAACCCGGAGGTAAAAACCGCCATGGTGCTGAAAGCCAAGGACACCATCGCTAACGGACGCCTGGAATATGACGTCAGCTACACCGATGTGACCGCCTCGTTTATGGCCATCCGCAAAACCATGACCGCCAGCGGTCGCAGCACCACCTATGACGCCAGCCGCAGTGATGAAGCCAGCCACGCCGATCTGGCCTGGGCGACTATGCACGCGCTGCTAAACGAACCCCTGACCGCCGGCAGCGGCCGCCCCTCATCGTCTATTTTGGATATCAACTGATGAAAAAACGCAAAAATCGTCCGACTAAAAAAATGGTTGCTGAATCCGCCGCGGCGGGCAACGGCGCGATGGCGTTTACCTTCGGTGAACCGACCGCCGTGCTCGATAAGCGCGATATCCTCGACTACGTGGAGTGCATCAGCAACGGCAAATGGTATGAACCGCCGGTTAGCTTCTCAGGGCTTGCCAAAAGCCTGCGCGCAGCAGTACATCACAGCTCGCCGATTTACGTGAAGCGCAATATTCTGACCAGTACCTACATCCCGCATCCGATGCTATCGCAGCAGGATTTCTCGCGTTTCGTGCTGGACTATCTGGTGTTTGGCAACGCCTTTTTTGAACTGCGCCAAAGCGTCACCGGTAAAGCGCTACGCCTGGAGACGTCGCCGGCCAAATACACCCGGCGCGGGGTTGAGGAGGATGTTTACTGGTACGTGCAGTCGTTCACTCAGCCGCACCGCTTTGACCCCGGTGCGGTTTTCCACCTGATGGAGCCGGATATCAACCAAGAGCTTTACGGCATGCCGGAATACTTGTCCGCGCTTAACTCCGCCTGGCTGAATGAGTCGGCCACGCTGTTCCGCCGGAAGTATTATCAAAACGGGGCACATGCCGGGTACATCATGTACGTTACCGATCCGGCGCAAAATGCTACTGATGTGGAAGCGCTGCGCGATGCAATGCGCAGCTCGCGCGGCATTGGCAACTTTAAGAACCTGTTTTTTTATGCCCCGAACGGAAAAGGCGACGGGATAAAAATCATCCCGCTCAGTGAGGTGGCCACCAAGGATGATTTCTTTAACATCAAGAAAGTTAGCGCCGCCGATCTGCTCGACGCCCACCGTATCCCCTACCAACTGATGGGCGGAAAACCCGAAAACGTCGGCTCCGTTGGGGATGTCGAGAAAGTTGCCCGGGTGTTTGTCCGCAACGAGCTAACCCCGCTACAGGCGCGGATCGCCGAGATTAACCAGTGGCTGGGTGAGGAGGTGATCCGGTTCAAGAAGTACAGCCTAGACGATAGCGAGGAGTAATACCGATGCCGTCTGCGGGCGGCTTTTTTACATCACCACACAGAATCCCCTCAGACGCGCCACACGCAGCGCAACACCTATAACACCCAACGACACCAACAAAATCAACCAAGCGCACAGCGCCGCGCTGGCGCACTCCTGCGCGGCATATTTTACCGCGTTGCGCGCAATGCTATCCCCGCCTCGCCTGCCCGCTTCGTGGGTCGATTTTAATGCAGGTGCATTTACCCTGCTGTGCTTACTGGTTGCCCATCTTGGACTGTGAACAGACAACTTTTTTTTGCATGCATTTTGATGCAGAATTACTCATGCTGATTTAATTTTTAGTTACATTCTAGAGCTACACGCAGCCACTGCCCCCAACATGTCGGCAACAGTGGGAAAAGGGATAAATTATGGTTTCGTCAAACGTTAAAATCACAGCTTTTCCAGCCAAAAGATTTTTTGTGGAAATGCTAACTCGAGACATTGAGTTGTCTGACTCTATATTGGATTTATTAGATAACTGTTTGGACGGGGTATTACGGAAGAACAATTTCACACCCGAACAGACCTTTGGTAAATCAGACGTATACAATGGTTACCATGCGCACATTGAGTTTGACGAAAACTGCTTCAAGATAGTTGATAACTGTGGTGGCATTCCAGGAGAGCTTGCGGAGAACTATGCATTTAGATTAGGCAGACCTTCGGAAAGGGAAGCTGAAGATCTTCCTACCATTGGTGTATATGGTATTGGTATGAAACGCGCCATATTTAAGATGGGAACATCAGCTCAAATTAAAAGCAAGACAGATACGGAACAATTCTCAGTTAATATATCACCTGAGTGGATGACTGATGACAATAATTGGTCATTAGATCTTGAGAGAAGTGATGTAGATTTAAGTGAAACAGGTGTCAGTATTGCCATAAATGATCTAAGAAATGACATCAAGGCTTCATTATCAAAAGACCGCGATTTTGAAAGCGATTTAATAAACATTATTGCTAATCACTACAGCTTAATAATCAAGAAAGGTTTTGAGGTTAAGATTAATGGGAAGGTTGTAAAGCCAAATAGCACAACTCTTATTTTTGACGAGAGTTCTATTAAGGATAATACGGATGGCATAGCCCCATATATATATAAAAACGAATCTAATGGTGTTTCAATAAAGGTAGCTGTTGGTTTCTATCGTAATTTGCCAAGTGATGAGGAAGAAGAACAATTATTGTCAGGTCGTTCAACTACAGAAAAAGCAGGGTGGACTATCATCTGTAATGACCGTGTCGTTCTTCATGCCGATAAATCAAAATTGACTGGATGGGGAGAGGCGGGTGTTCCTCAATATCACACCCAATTTATTGGTATTTCAGGTGTTGTGATATTTACATCCTCAAAAGCTGAACTATTACCTATAACGACAACTAAACGTGGTGTTGATGGTAATTCAGAACTATATCTTTCCACAAAAGACTTCATGAGAGAGGGACTCAAATACTTTACTGATTTTACCTATAAATGGAAAGCTAATAACGAAGAAAGAAAGCAGCTTATAAGCACTGCTTCGAATATGGTATCTACGACTGAAACTGATTTCGCAAAATCAATTCCACAGGAAAAATGGTCGACAGTACGGCGTTCCATCGGTGGTCAAGTTTTTAAACCAAAACTCCCGATGCCACGGGAAACCGATCCTTTACGTCAGATAAAGTTTAGCCGTCGACTTAGTGAAATAAAATTAGTTTCAGAATTCATCTTTGATGATGCGACTCAACCACCAACTGAAGTTGGTCAATATTGTTTTGATGAATATTTAAAAAAGGCTAAACAATGAGCACCGGTGGAAGTATCCCATATCATTTAAGGCAAAATAAGGCCATTGAAAGGAACCTGTTTATAGAGTCCTTGAGAAGATTGAATAATTACACCAATATATCCGAGTATGAATATATTGGTTTCGGAGGGCCATTCCTTGAAGATTTTAAACAGGTTCATAATCTTTTAAAAGTCAATAAGATGATATCTATTGAAGGTGATGAAAATGTATATCGCCGTCAACAGTTTAATAAACCACTATCATGCATAGATCTAGGGGAAGAACCTGAAATGAGTGGTGATTTCATCAATCGCTATAATTTTGATGAAAAAACCATAATTTGGCTAGATTATGCAATGCCTTCCGAACTCAATGCTCAGTTGAATGAAATTGTTAATTTAATTACTAAGTTAAAACCAAAAGATATTTTTAAAGTCACATTGAACGCACATCCTGAAACATTAGGCAAAGATCCGGGTGAGCGAGACCCAAGACCCTATAGATATAGAAAGATAAATGAAATTTTAACTGAAAGCTTTATGCCAGTTGACACTACCGAAGAAGATGTCGGATTTAAAAAATATCCTACATTACTGATTAATGCATTGAAACGAGCTGTCGGGAATGGTTTAAAAGGGCGTAACGATATAAGAATTCACCCTTTAACATCTTTCGTATATAAAGATGGACAACAAATGGTCACTTTAACGGCAATCGTTTTGGACAATACAGACGAAGAGGAAGCTAAATTTATTGATTCCTCGAGAATAAGAAATTGGCCATTTTATGCTGGGGAATGGCGAAAACCAAAAGATATTAACGTTCCTGCAATGTCCTTAAAAGAAAGGATTCATATTGAATCGTTATTGCCTGAAGCTACAGTTGAGAATATTCATGAAGAATTAGGCTTCTATATTGGATCCAGTACAGCTGGAGCCAATATAGACCTTAATAACTTTATCGAATACTACAAGGTAGTACCATGGTATTCGAAAGTTCTTTTTTAACACTTAAGCAATAATTATAGATAGGTAATAGCATAGCCTCAGAAACTAATGGACTAACGCTATTACCTATTTGTCTAAAACTATGCCATTTAGTAGGATGGAATCGAAACCAATCGGGAAAACCTTGCAACCTAGCAGCTTCCCGGGGTGCAATGACACGTGCTTGCGTCGGGTGAATTGGCCTGACAGCTTGATAACTTCCTTTGTCACTACCTGTTCCGGCCCTCAATGTAGGGCAAAAACCATTAGGATCAAGTCGCTGAGATCTTGAAATCTTATCTGTTTCACCAAAAGAAAGACTGCCATATCTCTTTATTATTTCGTCTGAGTGGACTGTGCCCAAGAATCCTGATACCAGACCATCCTCTAGTTTTTTTAAAGATTCGGCATCTCCAACTCTGTCAGGGATATGCCCCCAAAGCCTATCATAGAAATATCCTTTTCTATCCATTTTCACTTTTCGCCAACCTTGTGATTCTTCTTGCCATTCTTTTTTAATTATTCGCGGCAAGCCGTATAAGGCGTCTTTAATAAAAGTTTGCTCAATAATATTTTTTGGGAAGAAATCAGACTCCTTTAACTGGCTTGAATAATCCTTTCTGAAACCAATAAAGAATATACGAGTCCTAGTTGTCGGCGCACCATAATTTGAAGCGTTAACTTTAATCGGATGCAATAAAGTGTAGCGATCACTAACTAATGAAAATGCTTTTTCTCTTACAGAATTATATTTTTCATTCATAATTCCCGGAACATTTTCAGCCAAAAAACAAATTGGAGACAATTCACTCACTAGTCGAAAGAAATGTACATACAACTCATTTCTTGTATCATCAGCATTACCTTTACCTATAGAACTAAACCCTTGACATGGTGGGCCGCCCACAACGCAATCTATTTCTTTGACATTACATCCAGATAAAATATCTTGAGCAGTAAGTTTACTAACATCCTTATGGAGATGTGCAGACTCAGGGAAATTTAATTTGTGAGAAAAAATAGCATGCTTATCTATTTCAACGGCCCCAGCTAAATTAAAGCCCGCACGCGTTGCCCCCAAGCTTAAACCGCCTACCCCTGAGTATAAATCAACCACATTCATATTTAGATAACCATGCCCAACAGTTCATTTAATGATCCTGTATCATACCACCACTTCTTCTTTTTACTATAGAACTCATTTTCTTTTACCATTTTCTTGATATAACAGACAAACGTCTTGTCGATGAATGTGACTCCACAGGGCTCTCATTTTGCTTACATTATCGCCAGCTCTCATCTTCCCAAACTTCCTGAAGGATACCGTCCAACGCTTCTCGGTCTGACTCCTGGTCGAACCCAACCAGTCCTACTCCTGTCATTGACCCTTTTTTACCGTCACTCTCGTTGATGGGAAAATTGATTGTACGCGTCGGGTCAACTCGCACTGCAAAGCATCGACAGCTTGCTGCCCGATTTTTTGATCCTTATCCAACGTGATGTTAACCCTCACTTCGCCCCTTTTTTTAATCTTTGTTCAACAGGAGCGGGAGCGAAAAAAACTGAAAAAGAGTTGTTTTTCATCAAGTTCCCTCTCGCAATTTCCGCAATTAAATTCAAAGCAATTTCACGATCTCTTTCCTGACAAACGCCCTCTGTCGTCAGACGCGCAATCATCTCGACCCGTTCAATCATGACTTGCTCGTTTAACTCTCTATCCACACAACCTCCAATACGGAATACTGTATAAATACACAGTATCACGTATCGATAAAAAGATGAAAGAAAAAGTTACGCTATAAAATGACGTATGTGCATGATATGGATATGAATTAGTTACAGTCTCAACTTAGTAACTGACGCTAACCCCGCGACTCGATTTAGGATTTGTCTGGCCTGCGCTCGCTGTGACGGTGCTGTCGGGAAAATTTCACCAGTTGATGAACCACGGCACCATTTGCCGTTTATGCAACTTTTGCCACCGGCTATCAGGTGCAGGGCCTCACCCCGGCTAATGGTTTCGCCGGTAGTGAGCTGAATCTCGTCTATTGTTCTGTCAATGGCTGCAATTTGTTTATCCGTTCCGTGGATAAAATCACGCCTAGTGACCCGTTTTTTGTCCCTGAGTCTGGCTGTTAGCTTCCGCCTTTCACTTCGACTCAACGGTTTGGATAAATCCAGCTCCGGCAGATCGCTTTCGCTTCCCGTACAGTTATTGACAGAACTCCGAGAGGGCGCGGTGGCGCCCTTAACGTCAACCCCCAAATCCTGCGCACGCTTTGGCACAATCTTCCATTGTGTGGCACGGGTGATGATAGGGACGTCGCTACCTACGCTAGCGTCGTAAACACCACGGATGCGGATCTGCTCCTCTCCGTACTGGTTGTAGGTTTCCGCGCACTCATACCAGGTACGCACCTGCAAATCGTCACGGCGTACAAATGGCCCACCTTGGGCATTGACATAATCCGCCCAGCGACCAGCATCGGCCGCATCATGCACGGCTGCAAACTCAATGCTGAGGCCGTGGGCCGTGTCTGTATCGGATAGTCGGCGTAGCTCACGGTAAACCGTCACCGGCGCACCACCGATAAACTGGAATTGACGGATATGCCAACGAGCCGCCCACGCAGAAACCGCTGGCGCCGTTTCTTTTAGTAGCTCACCGCTTTCATCATCGGTTTCACCATCAAGGGCATATCCGTCAATGTTTTTAGAGATGTATTTAGCGACGTAGCCGGTCGCGCTACCTTTTTCTGGATCGATAGCTTCCGCATGAAACCTGGCTTTGGCCGCCTTATCACTTGCCAGCTCATACGCATCTTCTGCGCAGGCATAGTTGCGGATAATTTCACGCACTCGGTCTGCATCCTCTGGGCGCATAAACATCAGCATGTGCCAGTGCGGGGTCCCGTCATGATGTGGCTCCGCGACACGAATCCCGAATATCCGTAACTCTGCGCGGTGAAGCTTGGCGCGGATTTTTGACCACAGGCCGGTGAGATAGGATTGCGTTGTCGATGGGCTAGCGCCGCTCCATTTATGGTTCCGGTATCCTGCCTTGGTTGTCGCGTGATATTTCGACGGTGCGGTGATGGTGTAAAACTCACCCACATATCCCAACTGCGTGCAGATATCTTCGAATCCACGAATACGAGTCATTAGCTCGCAGCGCCGAATAGCCGGGTTTGCCACTGAACCATCATATTTTTCGATCAGGCTGATACGGTTTCCGTCCTCATCCTCCAGCTCCATTCCTTTCAGAAATTCACGGATGCGGCGGCGCTGCTCCCGCCATTCGCTTACACACGTATGGCTGGCATAGGGCGTTTTTTTCTTACTGACATTACCGGTAGCGATCTGTAGATGTTCCCGCCATGCCGCAGCAACGCGGCGCAGCCGGCCACGCCACCATTTCTCATTGATCATGCGCTGAGTCGCCGGACCAATATCGTCGGCCATCACATAGCTAGTCGTGATCCGCTTCCACAGCGGCGCCTCCACACGAAATAAATCCGCAATCGTGGCAGCGCGCATGTAAAGGACATGCAGATTTTTTAACTCACCGACATCACCTAATGCCTCATCCTCCACGGCAAGATCACTACGGATAAAGTTGGCAATATCACCGGCCAGCAATTCAATATCCGATTTTGACATGTCAGGCAGGCGGTTATAACGAAACACTAACCCCATCAACAGCGATGCCGTATTCGGCACGTTCTCGTTTTTCGAGCGGCCACGGAATAATTTAACGGCCATAGCCGGATCGATATCGGCGATTCGGTACTGCGCCGATACCTGCTCTATGCGCGGCAATGCCCTTTTGCAGAATGAGATCAAAAAGGCATTGGCTCGCCGAGGATCTCCATTACGCTCCAACTGCTCACACGCCTGACGAACAGGAAACCGAACACACTCCAACTGACGACTGATAGCGTCTTGAGCATGCAGAAAAGCCGCAATCTCTCGATCGCGGCGATGTATTTCTTCATAAGTTGGATATGGGCTAGCAATGGCTTCACGCGGCACGTTCCAGCCATAGGGATAACTCATTCACATACTCCCGCATAAACACTGGTGCATACCGCGGTACTGTTCATCTCAGCCAGCATGTCAAACTGGCGTCCGCCTCGTGTAGTCAGTGCCCAATCTCGATAGCTCTCAATTCCATGAGTAGATAGGCTGACACACTCAATTCGGCGCTCTGTCTTTGCAGGGTCTTGCGTGGAGGGGAAAAATGTTGAATTACCACGGCGTGAACATTCCGCTACTAGGTGTTCCCATGCTGCTACGCGCGCAATCTCATCAGGCCAGCGCTGAAATATTTCTGCCAACTCACTTTTGCGCGCATGGATGCATGGCATACAGCCAACACGGCTACATCCTTGCAGATACAGTGGATTCGGCTTAATGCCGTGACGTTTAGCGATAGCGAAAACATCTTCATGTTTCCACTCGATGATTGGGCGATAGACATGTAGCCCCGGCGTATTGTCTGCATCCGTTTCCCACACTGGCAATAATGCTCTTGCAGGAGATTCTTGGCGCCGGACGCCTTGCCAGCTGATTACCTCATCGTACTGTTCCAAGGCGGGAGCGATGACTTGAGTGCGCACCGGTTCATGCTTCAGCTCGAAAGTGCAAAAGCGAACCCTCGTTGATGGAAACCGACCCTTCCACATACACAGATCGAGGAACGGGATCCCCGTCGGCTTGAGTACGTCTAACGCACGCTGCACACGTTCTGCCGCGGCACTTGGTGATAGCCCACACTTCTCGACGAGAGAAATTGGCCATTTCTCTGCGATAAACTGCCGCTTGTTTTCAATCTGGCGAGAGAAGTCCGCTTTAACGCGAGTTATTTTCCCCAGCTTAGACTCCAAGTAGTCCAAATACTCCATTGTCTGGGGATGTTCATGGCCTGTGTCAGCAAAGACACGTGTGGTAGTAACACCGTTTTCAATAGCCCATAGCCATTGAGCTAAGCTGTCCTTTCCTCCTGACACACTGATAATATTGATGGTGTCATCCGCAAAGCAGCGCTGATCGATGATAGTTGGCGTGCTCACATCAAACCCCCTGATAGTGTTTGCTTTTCAGCTCAAGAATTTCCTGGCACGTAATGCAGCACCGCACCCCCAGCACCGCTCGGCGGCGTGCTTCTGGGATTGGGTCGCCACATTCCTCACAAAATGAGGCTGACGGGTGGACTGCCCGATCCGTCACGCGCTGTAAGTTGCGCGCCAGTTCCTCCTCTGCGCGCCGTTGCGCCATATCGATCGCATCACTCATCAGTGCAGCTCCCGCGCTTGGTTCTGGTAGTATTCGCATTCCCCGCGCAGCAGCTCTGCCGCATCGACCGCGCTCAGGCGCTCAGACTGAATGTGATTGGCAATGCACTCTAAACGCGCCGCCATCACTACTGCACGGTTACGGCGCTCGTCGGTACGAACCTCAGCCAACATGCGCGTTAATGCGCTTCTTGCTGACGTGCTCGGCATTTCGAACTCTTTTTTCATTAGGTCTCTCCTGTTTTCAGGCAAAGCGATGCCCGGCGGGTTTACGCCAGATTTTTTTACTGGTAATTAATTCGGCATTGAGAGCCGTCGAGGAAATAGGCTCACAACTGCCCGTAGCTGGTTCATCGCTGCGATCAGCGCTGCCTGCTCGGCAGTGGTCAACTCACTAAAATCAACGTCGTGTCGCGCTGCTGGAATGTTTGCTAAGTAGAGAATCGCCCCTAATGCCCGGCGGTTTTCTTTGTGGAGCGGATCCCGCACATCCTTCATCTCGTCAAAGAAGCGATGCAACTCTGCGGTTGAGTCCTCCTTGAAAACCTCGCGGCGGATAGCGGCAATGCGGTTCAACGCACTAACCCGGCTTCCTGCGCTTATCGGAACAGCGCGCGCAGCTTCTGTGAAAGCCATGACTCCCCCTGTTTCGCCCCAGAAAGTGATGCCAGCAATTCAGCCTGTGATCTGCACGGGTGCCAGCGCTTGCCGTTTTCACCAGTGATCCAGCCGTGCCCGTAGTGCATTGATGGGCTCTGTTTTTTCAGAAATGACGCTAACGATGGCTCCATGATCATCACCTCACAGCAAGCCAAAGCTGGCGCCCATACCCGTTACGGTATCGACCACACTGTTCATTGCCGGATTGGACTGTAATCGCGCATGTAATGTCAGTGCTGCGAGCGACAGGCAACGGATTCCGGCGTTAACACTTTCAACGACAGTGTGTTTGTGTGAACGCGTGATCGCCCCCGTCATCGCCACACCAGCCACTTTCCCGACCTCTGCTGTCGCGCTCAAGACGTAGGCTTGCAACTTCTCCGGCGCCACCTCATTCACGGGTACACACGGCAGACAATGGAGCTGAGTTAAAAAGCCGTCAACGAGCGTGGCGTCTTCTGTCACATCCGTCAGGTTTAGGATCTCAATCCAAGTGAGTGCATGAGGCTGCTCTGGGTTCAGCTTGTTACGCAGTGTCTGCGCTGACATTCCGACCTTTGCGGCTAGCTGTGCCAGGTTATGCCGCAAAGCGAAAGCGCGGCATGCTTCGTCAAAGTGCGGATGTTTGGAAACCTGAAAATCAAACATGTTGCATCCTTACAATTCACATAAAGTGAATCAAGCGCCAATGACTAGCTGAAAACGGGAATGACCCAACGCTTTACGCATTTGTTCTTCTTTCCAGCGCGCGTAGTAGATACGAATCGGGCCACCTGCTTTCTTACAGCCTTTGCGGATAGTGCGAGGTTCGATTGGCACACAAGGGTTGTCGCCGGTAGTCCAGCGGTAAGCAGTACGTTCAGAAACACCCTCAAGCTCTGCGAACTGCTGCAGAGTAACGATAGGTGCAGGCACTTTGATGATTGCGATTTCAGAAGCCATATTGCATGATTCCCCATTGGTAAATATTTCCCATTGATAGCCTAAGTTTTGCCAACGTTTGCCATCAATGAACTTCACAATTGAGGACTCTATTCAACAAATGAGTAGCTGTCAACATGGAAATCACAAATGAAGATTAGAGACTACACATTTGAGGCTGTGTCTATACTGGATCGCATATGCGAAATCTATGGTTTTCGCCAGAAAATTCAGTTAGCAGACCATTTTGAAATTTCCGCAAGCTCTCTTTCAAACCGCTATACACGCGGCACTATCTCATATGATTTTGCAGCTATATGCGCGCTGGAAACTGGTGCCAGCCTAAAATGGTTACTGACAGGAGAGGGAGAGCAGTACAGCGAAAACGCAATTAACTCCAACGTAAAAGAAATCCCATCATTCACTTTAAGTGAAGAAAGATTAACTGAAGACTCCCCTTTGAGTATTGACCCTAAGTTAATCAACAAGCCGGAATCTGATTGCTACGTAGTTCGTGCTGATGGAAAACTTCACTTCATAGATAGAGACTCAACTCTCTCCGATGGCTTGTGGTTAGTTAACATTGATGATGCGATAAGCATTCGAGAGCTAACCAAACTCCCCGGCAAGAAACTCCATGTAGCAGGTGGCAAAGTTCCTTTTGAGTGCGGGATAGATGAGATTAAGACTCTTGGTCGAGTAATCGGGTTTTATAGCGATGCAAGCTAATCGCTTTTAATATTATTAATAAGGAAATTATGATGGATACTATAATTCCATTTTTGTTATTAGGCTTGTCGGTTTTCTCAATGGTTATTTACTTAAAATCACCTGAAAAACTGATTATGCGTGCGGTTAAAGGAATTACAGCTTTTATCTTCCCGGTTGGAGCTTTAGGCGCATTTTTGAGTGGTGATTATGCAACAGCGTTAACAATTTCCGTTATTGTCTTTCTCATTTCACTTCGTCGTATGAATATAAACAAAAAAGACTCATTTCCATCAGTTGCAACTGAACAAAGTACCCATGTGAATAAGTCAAAGCCATTCTCTGGCAATCACAACACAAAGGATTGGTTCAAAAATATTTCCTTCAGTTACACGGATTCTAACGGCAATTCATCTTATAGAGAGGTTGACATAAAGGAAATTAATGAACAAAGCATGACAGGCTATTGTCACTCACGCAGACAACTACGAACATTCCGATTAGACCGAATTGATAATAGCGAAATTGTAATCCGCGACACCGGCGAATTAATCAATGTTTATGACTGGATTGTCCAGCTATATGAGGAATGAGGTTAACTAATGACCGTGCGTAAAAATCCTGCTGGCGGTTGGATTTGTGAGCTTTATCCAAACGGGGCAAAAGGCAAGCGAATCAGAAGGAAATTCGCCACCAAAGGTGAGGCTCTGGCGTTTGAGCAGTACACCGTTCAAAACCCGTGGCAGGAAGAAAAGGAAGACAGGCGCACGTTAAAAGAGCTGGTTGATTCATGGTATAGCGCTCATGGCATTACACTGAAAGACGGCTTGAAACGCCAGTTAGCCATGCACCATGCTTTTGAGTGTATGGGCGAACCACTCGCACGCGATTTCGATGCGCAGATGTTTTCCCGCTACCGAGAAAAACGGTTAAAAGGTGAGTATGCCCGTTCAAACAGAGTGAAAGAGGTATCGCCTCGCACGCTTAATCTTGAGCTAGCCTACTTCCGGGCAGTGTTCAATGAGCTAAACCGCCTCGGAGAATGGAAGGGGGAAAACCCACTGAAAAACATGCGCCCATTCCGCACAGAAGAAATGGAAATGACCTGGCTAAACCACGACCAGATTTCGCTACTGCTCGGAGAGTGCAAACGGCATGACCACCCTGATTTAGAAACCGTGGTCAGAATCTGTCTTGCCACTGGCGCCCGGTGGTCTGAAGCTGAGGGTCTGAAAAAAAGCCAGCTCGCGAAATACAAAATCACATACAGCAACACGAAAGGCAGAAAAAACCGCACTGTTCCAATCAGCAAAGAGCTCTACGGGTCTCTGCCTGATGATAAAAAAGGTCGGTTATTTAGCGATTGTTATGGCGCGTTCCGGTCAGCTCTGGAAAGAACAGGTATCGAACTACCGGCAGGACAGCTTACCCACGTTTTGCGCCACACCTTTGCCAGCCACTTTATGATGAATGGTGGTAATATTCTAGTTTTGCAGCGCGTGTTAGGCCATACCGACATAAAAATGACAATGCGGTATGCACACTTTGCTCCAGATCATTTAGAAGATGCCGTTAAATTTAATCCGTTAAGCAACATTCAATTATAAAGCCCATATTAAGGAGTACATATGGCAGCAACATCTTATACATGCCAATTATATGAGAGTGAAGGTATCGCGAAAATAATTTTATTTTCCGGATTGGAATTTGACTTGATAATACATCCATATGTATATGGGGAACCATTAGTAACCACACAGCCAGTCTATTTCCTTGAGCAATTGGGTTCAATAGCAAAATTAAGAGTTGAGCACCCTAAAAAAATCACAGAGCTCGAAACCGAATATCTGATAGATAATTATTTATTTGAATACAGTCTTTTATATAGCACATCTCGCCTTTGTTCAAAGATAAGCACACCAGCATTTTGGGCTCCGGATTTTAGCGACTTTTACCAATATCATGATCAGCGTAGAACAAGGGCATTGACTCTTGATCCATTAAATGACGAGTCAATCATTTCAATTCAAGACCTAGAAGGAAATGACTGGCCTTTTACAGACTACTGTATACCTAAAGAGTTCCTTGATGAAGCCCTTACGGAATCAGCCACAAAAATTCTAAAACTACATGAGCAGAAGCTAATAACAATGATTACCCCCTCTCCTGAAAGGGATGGGTATTATGGCAAACTGAGGTTACTAGAAAGCGAAGGGACTTTAATCGATTTTTCCCAAGCTGTTCAACTAGACGCCATAAGTGAATACAATATAAAAGATAAAGATACATATGAATTATCATCAGAAATTGATCCTAGCAATAAAATTGAGCTTCCAATTGAAAGAGTAACTACAACAGAAAAATACTCCCCTACGCTGCTGTCATATTATTTTTCTGGCCTTAGAGAACGAAATCCATTAATTAGCTTTACGGGTTTTTATAATGTTCTAGAGTATTATTTAGAAGAAGCTCCTGTAATTTTAGGAGTTCCTCCACTGAAAACAGAAAGAGAGAATCTTCAAAAAGTAGTTGAACTACTCACAGATCAAAATGAGCTTTATACCAAACTCAACTCTTTCAATAGCACATTAAGAGCAAAACTACAAAATGATATTATCTCATCCTCACAAGTCAAAATTAAAGGATTACGAATAATTAATCGTTCATCATTGTTAAAGGATGTAAGTAATTGGCTGTACGGGATACGTTGTGCTGTAGTCCATTCTAAAAAAAGTAGAAAAGGGAGAGTAGAGGCTATCTTTGAACCGTACTCAAAAGAATCTGAAAATGTCACTCCAGCGCTGGAGGTCATTAAATGGCTTGCTCAAAAATGTATCGTTAAGGATAATCAACTCTCGAAACAAACCGTATAAAAATTGTCCATGATTTGACCGAACTTAGCTTCGCCATCACCATCTGATGGCGATAAAGTGGCGGTAGAAATGGCGGATAATGAGTAAACATTGGCAAACACTGGCAATCTATGTCAATGATAAATAACGCAAACTATTGATTTTCGGTTGTCCCTATAGGAACTCATAATCGCTTGGTCGTTGGTTCAAACCCAACAGGGGCCACCAAATTTTAGCTTTAGATTCAGTCATTTAAGCCACCTCTAACCGGGTGGCTTTTTTGTTGCTTGGAAATGATTGCCCCTATTTTGTCCCTCAGTCCTCAAAGAGGCGCCATGAACGCTTACCGCGTCACGGTGAGCATAAACCGCCAGGTGCTCCCGCTCACACTGCGCTAACGTCACCGTCCCGGCTAGCGCGCCGATGAACGGTTCGGCAACCTCCGGCACCAGCCTGATAGCTTAGCCCCCTCGGTGGCAAGGTATGGTTAGGCGTCGCATGCGCACGCAATCGGTGCTGTCGAGATAGTCCTCCATGCTGGTGTCCTTAATCACGCTGGGGTAAGGCTGATTTTTGTCCTACTCAGCGATGAAAAGATCAGCAATGATTGATTAGTATCAACTTTACTTTCATTCGGAAGCTATTTAATCTTCATAAGAAGAACGTTGGAGTCTTAATGATGATATTTAATATTCAGCGTTACTCCACACACGATGGTCCCGGGATCCGCACGGTGATCTTTTTTAAAGGGTGTTTGCTCGCCTGCCGCTGGTGCCAGAACCCGGAGAGTTTAGCGCGCGCTCCAGAATTGCTCTATGACGCCCGCAGCTGTTTGGCGGCGTGTACGCAGTGCCAGACAGCTGCGCCCAACGTCGCGGTACGCGAAGGACGGGGGATAAGACTGCTACGCGCACACGCAGATGCGCGCGCCATTGCAGCGCTACGTGACTGCTGCCCCAGCCAGGCGTTGACGGTCTGTGGCGAAGAGATGTCCGTCGCGGCGATCCTCCAGCAGATCGAACGCGATCGTCCTTTTTATCAACGCAGCGGCGGCGGCATTACCCTCTCCGGTGGGGAACCCTTCATGCAGCCACACTTAGCCGAAACGCTGCTGCGTCGCTGCCAGCAAGCCGGGATCCACACCGCTGTCGAAACCTGCCTGCATGTCCCTTGGCGCTATGTGGCCCCGGCGCTGCCGTGGACAGATCTGTTTCTGGCCGATCTCAAACAGGTGGATAGGGAGCGCTTCCGTACCTGGACCGGCGGTAGCGTAGGACGCGTCATGGATAACCTGCGTCGCCTGTCCGCCCGCGGTAAAGCCCTGATCCTGCGTGTCCCATTGATCCCTGGCTTCAACACCGATCTCAACGACATTCAACGCATCATCGATTTCGCCGCCAGTGAGCTGGCGACCAGCACGATCCACTTTCTGCCCTACCACACCTTGGGGCGCAACAAGTACCGCCTGCTCGATCGCCCCTATCTGGCCCCGGAACAGGCATTCAACGATCCCGCCTTGCTGGATGCCGCTTGCGCCTATGCAAGTCAATGCGGCTTGAGCGCGACACTGGGAGGATAGCATCATGACCACTCTGCACCTGACTACTCTGAGCGCACGCATCCGGGCGCATAAAGCCGCCCTGATCCACATCGCCACGCCGTCCATCTGTACCGAGCGAGCCCAACACTATACCGAGGCCTACCAGCGTCATCTGGACAAACCGCTCCCAGTGCGACGCGCCTTGGCGCTGGCGGAGCACCTGGCCCGCCGAACTATCTGGATCGATCACGACGAACTGATCGTCGGCAATCAAGCCAGCCGCGTGCGCGCCGCACCGATCTTCCCGGAATACACTGTCAGTTGGATCGAACAAGAGATTGATGAGCTGGCCGATCGTCCCGGGGCCAGGTTCAACGTCAGCGAGCAGGATAAAGCCGTGTTACACCGTCTCTGTCCCTGGTGGCGCGGGCAAACTGTGCAGGATCGCTGCTACGGCATGTTCACCGACGAGCAGAAGGCGCTACTCGCCAGCGGGATCATCAAGGCGGAAGGCAACATGACCTCCGGTGACGCGCATTTGGCCGTCAACTATCCCTTACTGTTGACGCTGGGGATTGCGGGGTTGCGCGCGCGGGTCGCCGCACGCCGACGCCGTATCAACCTGACGCAGCTCGACGATCTGCACGGTGAGCAGTTCCTCAAGGCGATCGACATCACCCTGGTGGCGGTCAGTGAACACTGCCTACGTTTTGCTGCCCTAGCGCGGCGCATGGCCGCCGACGAGCAACGGCAGAGCCGGCGCGCCGAACTGCATACTATCGCCACCAACTGTGAACACATCGCCCAACAGCCACCGCAAACTTTCTGGCAAGCGCTACAACTATGCTATTTCATTCAATTACTGTTGCAGATCGAGTCCAACGGCCACTCCGTCTCCTTTGGCCGAATGGATCAGTACCTCTATCCCTGGTATCGCCGCGAGGTCGAGCTGGAGCATAGCCTCGATCGCGAACAGGCCATCGAGCTTCTACAAAGCTGCTGGCTGAAGCTGCTGGAGGTTAACAAAATACGCTCCGGTAGCCACTCCAAGGCCTCCGCCGGCAGCCCGCTGTACCAGAACGTCACCATCGGTGGACAAAATTTGGTCGACGGCGTCCCCCAGGACGCGGTTAACCCGCTCTCCTATGCCATTCTGGAGTCCTGCGGACGCCTACGCTCCACCCAGCCTAACTTGAGCGTGCGCTACCATGCCGGGATCAGCGACGACTTCCTCGATGCCTGCACCCAGGTGATCCGCTGCGGCTTCGGTATGCCGGCCTTCAATAACGATGAGATCGTCATCCCCGCCTTTATCGATCTGGGCGTCGCGCCACAGGATGCCTACCAGTATGCTGCCATCGGCTGCATCGAAACCGCCGTTGCTGGGAAGTGGGGCTACCGCTGTACCGGAATGAGTTTTATCAACTTCGCCCGCGTCCTGCTGGCGGCACTCGACAACGGCAAAGACGCCACCAGCGGTCAAGTTTTCCTGGCGCAGCCGCAGGCGCTGTCACAAGGGAACTTCGCCGACTTCGCCCAAGTCATGGCCGCCTGGGAGCGCCAGATCCGCTACTACACACGCAAGTCCATCGAGATCGAATACGTGGTAGATACGGTACTGGAAGAGAATGCGCACGACATCTTATGCAGCGCTTTGGTGGATGACTGCATTGAGCGCGCCAAAAGCATCAAACAAGGCGGGGCCCACTATGATTGGGTCTCAGGCCTTCAGGTCGGTATCGCTAATCTGGGTAACAGCCTGGCAGCGGTGAAGACGCTGGTGTTTGAACAGGGACGCGTCAGCCAACAAGCGCTGGCACGGGCACTGGCTGAAGACTTTAACGGCTCGCAGCATGAACAGCTACGCCAGCGCTTACTCAACGGTGCGCCGAAATATGGCAACGATGACGATCGCGTAGACGCCTTACTGGCACAGGCCTATGGCTACTACATCGACGAGTTGTCTCATTACCATAACCCTCGCCACGGTCGGGGACCGATCGGCGGGGGCTACTACGCGGGCACCTCCTCCATCTCGGCCAACGTCCCTTTCGGAGCGGCCACCCTGGCTACCCCGGATGGCCGCAAGGCGCACACGCCACTGGCCGAAGGCGCCAGCCCGGCCTCCGGCAGCGACCGTCTCGGGCCGACAGCGGTGATGGGATCGGTCGCTAAACTACCAACCGCCGCCATCCTCGGAGGGGTACTGCTTAATCAGAAACTGAACCCCGCCACGCTGGAGAGTGAGCGCGACCGCCAAAAACTGATACAACTGCTGCGCACTTTCTTCGAGGTACACCAAGGCTGGCATGTGCAATACAACATCGTCTCACGCGAGACGCTGCTGGATGCCAAGGCTCACCCGGAGCGCCACCGTGATCTGGTGGTACGTGTCGCCGGCTATTCGGCCTTCTTCACCGCACTGTCGCCGGATACCCAGGACGACATCATCGCTCGTACTGAACACCAACTGTAA
Protein sequences of DBSCAN-SWA_3 >NZ_CP016043|3205796:3249648|3244408_3245653_+|WP_083275040.1|DBSCAN-SWA MAATSYTCQLYESEGIAKIILFSGLEFDLIIHPYVYGEPLVTTQPVYFLEQLGSIAKLRVEHPKKITELETEYLIDNYLFEYSLLYSTSRLCSKISTPAFWAPDFSDFYQYHDQRRTRALTLDPLNDESIISIQDLEGNDWPFTDYCIPKEFLDEALTESATKILKLHEQKLITMITPSPERDGYYGKLRLLESEGTLIDFSQAVQLDAISEYNIKDKDTYELSSEIDPSNKIELPIERVTTTEKYSPTLLSYYFSGLRERNPLISFTGFYNVLEYYLEEAPVILGVPPLKTERENLQKVVELLTDQNELYTKLNSFNSTLRAKLQNDIISSSQVKIKGLRIINRSSLLKDVSNWLYGIRCAVVHSKKSRKGRVEAIFEPYSKESENVTPALEVIKWLAQKCIVKDNQLSKQTV >NZ_CP016043|3205796:3249648|3222933_3223575_-|WP_070245292.1|plate|DBSCAN-SWA MKLQANITEILRLLRNLIRTGVIVEVNTQSGRCRVQTGGVKTDWLQWLTLRAGSARTWWAPSVGEQVILLAVGGELDTAFVLPAIYSDEHPAPSDSEQAWRVDFPDGAVIEYEPKTGALTASGIQTADITASTSITATVPLVTVKAETRILLDTPEVVCTNKLTTGTLEVQKGGTMRGDIEHSGGAFTSNGVRVDKHSHGGVQSGVSWTEGTQ >NZ_CP016043|3205796:3249648|3235168_3236320_-|WP_020316745.1|DBSCAN-SWA MNVVDLYSGVGGLSLGATRAGFNLAGAVEIDKHAIFSHKLNFPESAHLHKDVSKLTAQDILSGCNVKEIDCVVGGPPCQGFSSIGKGNADDTRNELYVHFFRLVSELSPICFLAENVPGIMNEKYNSVREKAFSLVSDRYTLLHPIKVNASNYGAPTTRTRIFFIGFRKDYSSQLKESDFFPKNIIEQTFIKDALYGLPRIIKKEWQEESQGWRKVKMDRKGYFYDRLWGHIPDRVGDAESLKKLEDGLVSGFLGTVHSDEIIKRYGSLSFGETDKISRSQRLDPNGFCPTLRAGTGSDKGSYQAVRPIHPTQARVIAPREAARLQGFPDWFRFHPTKWHSFRQIGNSVSPLVSEAMLLPIYNYCLSVKKELSNTMVLPCSIR >NZ_CP016043|3205796:3249648|3246311_3247211_+|WP_070245674.1|DBSCAN-SWA MIFNIQRYSTHDGPGIRTVIFFKGCLLACRWCQNPESLARAPELLYDARSCLAACTQCQTAAPNVAVREGRGIRLLRAHADARAIAALRDCCPSQALTVCGEEMSVAAILQQIERDRPFYQRSGGGITLSGGEPFMQPHLAETLLRRCQQAGIHTAVETCLHVPWRYVAPALPWTDLFLADLKQVDRERFRTWTGGSVGRVMDNLRRLSARGKALILRVPLIPGFNTDLNDIQRIIDFAASELATSTIHFLPYHTLGRNKYRLLDRPYLAPEQAFNDPALLDAACAYASQCGLSATLGG >NZ_CP016043|3205796:3249648|3209423_3211265_+|WP_024523601.1|DBSCAN-SWA MEQNPQSQLKLLVTKGKEQGYLTYAEVNDHLPEDIVDSDQIEDIIQMINDMGIQVMEEAPDADDLMLAENIADEDAAEAAVQVLSSVESEIGRTTDPVRMYMREMGTVELLTREGEIDIAKRIEDGINQVQCSVAEYPEAITYLLEQYDRVEAGEARLSDLITGFVDPNAEEEMAPTATHVGSELPEEELNDDDEDEDEDGDSDDSDDDNSIDPELARQKFIELREQHEKTRLAIKAEGRSSAKAQEEILNLSEIFKQFRLVPKQFDYLVNNMREMMDRVRGQERLIMKLCVEQSKMPKKNFITLFTGNETSITWFEAALAMGKPWGEKLKDMREEVERSLQKLHQIEEETGLTIEQVKDINRRMSIGEAKARRAKKEMVEANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKLNRISRQMLQEMGREPTPEELAERMMMPEDKIRKVLKIAKEPISMETPIGDDEDSHLGDFIEDTTLELPLDSATSESLRSATRDVLAGLTAREAKVLRMRFGIDMNTDHTLEEVGKQFDVTRERIRQIEAKALRKLRHPSRSEVLRSFLDD >NZ_CP016043|3205796:3249648|3216750_3217044_-|WP_005295823.1|tail|DBSCAN-SWA MAKEKTVTELAQEAPNTITLKHPVKRAGQTIEQVTLITPNTGHLRGLSLAAVASAEVDALIKLLPRMTMPSLTEQEVASLSLPDMTAITAKVINFFG >NZ_CP016043|3205796:3249648|3227435_3228539_-|WP_070245297.1|capsid|DBSCAN-SWA MRKNTRFKFNQYLTRVAELNGVAVDDLKNKFAVEPSVTQTLITSVQDTSEFLTRINMVPVDEQEGEKIGLGVTGSIASTADTDGGTGRATADFMALASRKYRCEQVNFDFHIRYNTLDLWARYQDFQLRLRDAIAKRMGLDYIMAGFNGTSRAATSNRKTNPLLQDVAVGWLQKMRNEAPKRVLDKITNTSGEVVSAVVRIGANGDYENLDAAVMNATDTLLDPWHSEDPELVVICGRKLLADKYFPLVNKKQGNSEAMAADVIVSQKRIGNLPAVRVPYFPDNALMVTRLDNLSIYVMDSSHRRHIDENAKFDRVENYESLKIDYVIEDYGCAAMIENIQFGTFPAKPAVASLSAEQGAPVENAGG >NZ_CP016043|3205796:3249648|3223644_3224094_-|WP_070245293.1|DBSCAN-SWA MSEFSPFNDTLAGLIITLTATERRKIAAEIARRLRASQQRRIRAQQAPDGTPYAARKPQQVRGKRGRVRRAMFAKLRTNRYMLATADPDAATVTFAGRVQRIAQVHQYGLKDKPNRHSAEVQYEARPLLGFTDEDKKIVEEVLLSHLKN >NZ_CP016043|3205796:3249648|3239249_3240254_-|WP_070245673.1|DBSCAN-SWA MIDQRCFADDTINIISVSGGKDSLAQWLWAIENGVTTTRVFADTGHEHPQTMEYLDYLESKLGKITRVKADFSRQIENKRQFIAEKWPISLVEKCGLSPSAAAERVQRALDVLKPTGIPFLDLCMWKGRFPSTRVRFCTFELKHEPVRTQVIAPALEQYDEVISWQGVRRQESPARALLPVWETDADNTPGLHVYRPIIEWKHEDVFAIAKRHGIKPNPLYLQGCSRVGCMPCIHARKSELAEIFQRWPDEIARVAAWEHLVAECSRRGNSTFFPSTQDPAKTERRIECVSLSTHGIESYRDWALTTRGGRQFDMLAEMNSTAVCTSVYAGVCE >NZ_CP016043|3205796:3249648|3222586_3222937_-|WP_047059219.1|plate|DBSCAN-SWA MTARYVGMSQASGLSLEDAAHIRQSVRDILITPIGSRIMRREYGSLLSALIDQPQSPALRLQIMAACYMAILRWEPRVRLTAIRFEQSTPGGLFVDITGVSALTGGGAFSLTVPLS >NZ_CP016043|3205796:3249648|3225077_3225575_-|WP_070245296.1|DBSCAN-SWA MSAINLHPNVAAFLDMIAFAEGTATHPLTRNRGYDVIVTGMDGGPEVFTDYSDHPFAHGRRAKVFNRRGERSTASGRYQQLYLYWPHYKKQLNLPDFSPASQDKLAIQLICERRALEDIKRGDIERAISKCRGIWASLPGAGYGQREHALDTLVSVYHQAGGVVA >NZ_CP016043|3205796:3249648|3236706_3236919_-|WP_070245302.1|DBSCAN-SWA MDRELNEQVMIERVEMIARLTTEGVCQERDREIALNLIAEIARGNLMKNNSFSVFFAPAPVEQRLKKGAK >NZ_CP016043|3205796:3249648|3240270_3240492_-|WP_070245304.1|DBSCAN-SWA MSDAIDMAQRRAEEELARNLQRVTDRAVHPSASFCEECGDPIPEARRRAVLGVRCCITCQEILELKSKHYQGV >NZ_CP016043|3205796:3249648|3224628_3225081_-|WP_070245295.1|lysis|DBSCAN-SWA MTWRLTLFLIVMSLALGAALWLRHENSNLRRGFDTANKVASAQKTQITMLKNQLSTASAVSRRQERDQVVLRQQLDAANTLATHRNKTITRLLNENETLRSWWRTALPDVVVRLHTRPAFDNPDDYLQWLSRREQLPDTGEPPDQQRRSE >NZ_CP016043|3205796:3249648|3226073_3226577_-|WP_005281582.1|head|DBSCAN-SWA MTTVIIDQSGMPAGDAVIIPPDSQREPVIKNTFFFPDVEPRRIRDLMRLEYTVTPARMRDAICAGIAEVNAELVEFRERQMLLGFKTLDAVPAETIDGESVCCFHYLRAVSAMTAASLYERYRGYDASGKGEKKAESVEAVIDELWRDMRWSVSRLQGRPRCIVSQL >NZ_CP016043|3205796:3249648|3211355_3211850_-|WP_024523600.1|DBSCAN-SWA MSVDILAPGLRVVFCGINPGLSSAHTGYHFANPHNRFWQVIHLAGFTARQLAPSEERHLLETGCGITALVQRPTVTAAELTAAELYAGGEALRAKIQQYQPQALAILGKQAFTQAFGVRRPSWGRQAQRIGESEVWLLPNPSGLNRATLAELVAHYQALYQALR >NZ_CP016043|3205796:3249648|3232804_3234229_+|WP_070245301.1|DBSCAN-SWA MVSSNVKITAFPAKRFFVEMLTRDIELSDSILDLLDNCLDGVLRKNNFTPEQTFGKSDVYNGYHAHIEFDENCFKIVDNCGGIPGELAENYAFRLGRPSEREAEDLPTIGVYGIGMKRAIFKMGTSAQIKSKTDTEQFSVNISPEWMTDDNNWSLDLERSDVDLSETGVSIAINDLRNDIKASLSKDRDFESDLINIIANHYSLIIKKGFEVKINGKVVKPNSTTLIFDESSIKDNTDGIAPYIYKNESNGVSIKVAVGFYRNLPSDEEEEQLLSGRSTTEKAGWTIICNDRVVLHADKSKLTGWGEAGVPQYHTQFIGISGVVIFTSSKAELLPITTTKRGVDGNSELYLSTKDFMREGLKYFTDFTYKWKANNEERKQLISTASNMVSTTETDFAKSIPQEKWSTVRRSIGGQVFKPKLPMPRETDPLRQIKFSRRLSEIKLVSEFIFDDATQPPTEVGQYCFDEYLKKAKQ >NZ_CP016043|3205796:3249648|3221069_3221681_-|WP_070245290.1|tail|DBSCAN-SWA MNNDRLLPTGSSVLELAAAKACAELARVPVPLRTLWNWRTCPVNLLPYLAWAFSVDRWNEHWPEETKRNVVAVAHFVHRHKGTIGAIRRVVEPLGYLIEVREWFQLNETPGTFRLVIGVLETGITEEMFRELERLVADAKPASRHLTGLTISLTSSGRGYVGASCYDGDALTVYPYTPELIEVGGNHYPASAIHLIDNLRVTA >NZ_CP016043|3205796:3249648|3217635_3218823_-|WP_070245288.1|tail|DBSCAN-SWA MSDFHHGVQVLEINDGTRVISTVSTAIIGMVCTASDADATAFPLNVPVLITDVQGAVGKAGTKGTLAAALQAIADQSKPVTVVVRVADGEGSTEEEKLAATVSNVIGTTDENGKYTGMKALLDAAVVTGVKPRILGVPGLDTQAVATALGAICQKLRAFGYVSAWDCKSISEAIAYRKNFSQRELMVIWPDFLAWDTKTNAATTAWATARALGLRAHIDQTIGWHKTLSNVGVNGVTGISASVYWDLQAPGTDADLLNEAGVTTLVRKDGFRFWGNRCCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDKPVTPTLIKDIVEGIRAKFRELKSNGYIIDGDAWFDEAANSKETLKAGKLYIDYDYTPVPPLENLTLRQRITDKYLVNLAQSVNS >NZ_CP016043|3205796:3249648|3240491_3240719_-|WP_070245305.1|DBSCAN-SWA MKKEFEMPSTSARSALTRMLAEVRTDERRNRAVVMAARLECIANHIQSERLSAVDAAELLRGECEYYQNQARELH >NZ_CP016043|3205796:3249648|3237033_3239079_-|WP_167352273.1|DBSCAN-SWA MERNGDPRRANAFLISFCKRALPRIEQVSAQYRIADIDPAMAVKLFRGRSKNENVPNTASLLMGLVFRYNRLPDMSKSDIELLAGDIANFIRSDLAVEDEALGDVGELKNLHVLYMRAATIADLFRVEAPLWKRITTSYVMADDIGPATQRMINEKWWRGRLRRVAAAWREHLQIATGNVSKKKTPYASHTCVSEWREQRRRIREFLKGMELEDEDGNRISLIEKYDGSVANPAIRRCELMTRIRGFEDICTQLGYVGEFYTITAPSKYHATTKAGYRNHKWSGASPSTTQSYLTGLWSKIRAKLHRAELRIFGIRVAEPHHDGTPHWHMLMFMRPEDADRVREIIRNYACAEDAYELASDKAAKARFHAEAIDPEKGSATGYVAKYISKNIDGYALDGETDDESGELLKETAPAVSAWAARWHIRQFQFIGGAPVTVYRELRRLSDTDTAHGLSIEFAAVHDAADAGRWADYVNAQGGPFVRRDDLQVRTWYECAETYNQYGEEQIRIRGVYDASVGSDVPIITRATQWKIVPKRAQDLGVDVKGATAPSRSSVNNCTGSESDLPELDLSKPLSRSERRKLTARLRDKKRVTRRDFIHGTDKQIAAIDRTIDEIQLTTGETISRGEALHLIAGGKSCINGKWCRGSSTGEIFPTAPSQRAQARQILNRVAGLASVTKLRL >NZ_CP016043|3205796:3249648|3225561_3225870_-|WP_005281588.1|holin|DBSCAN-SWA MNETDKSVLSLFVIGVMIVIGKVLVGGEPVSPRLFIGRCLLGGFVSMVAGVALVQFPDLSPTAVNGIGAMLGIAGYQVVEILIQRRIGKGGNAAKGGSDERH >NZ_CP016043|3205796:3249648|3234225_3235218_+|WP_064169842.1|DBSCAN-SWA MSTGGSIPYHLRQNKAIERNLFIESLRRLNNYTNISEYEYIGFGGPFLEDFKQVHNLLKVNKMISIEGDENVYRRQQFNKPLSCIDLGEEPEMSGDFINRYNFDEKTIIWLDYAMPSELNAQLNEIVNLITKLKPKDIFKVTLNAHPETLGKDPGERDPRPYRYRKINEILTESFMPVDTTEEDVGFKKYPTLLINALKRAVGNGLKGRNDIRIHPLTSFVYKDGQQMVTLTAIVLDNTDEEEAKFIDSSRIRNWPFYAGEWRKPKDINVPAMSLKERIHIESLLPEATVENIHEELGFYIGSSTAGANIDLNNFIEYYKVVPWYSKVLF >NZ_CP016043|3205796:3249648|3243382_3244387_+|WP_070245311.1|integrase|DBSCAN-SWA MTVRKNPAGGWICELYPNGAKGKRIRRKFATKGEALAFEQYTVQNPWQEEKEDRRTLKELVDSWYSAHGITLKDGLKRQLAMHHAFECMGEPLARDFDAQMFSRYREKRLKGEYARSNRVKEVSPRTLNLELAYFRAVFNELNRLGEWKGENPLKNMRPFRTEEMEMTWLNHDQISLLLGECKRHDHPDLETVVRICLATGARWSEAEGLKKSQLAKYKITYSNTKGRKNRTVPISKELYGSLPDDKKGRLFSDCYGAFRSALERTGIELPAGQLTHVLRHTFASHFMMNGGNILVLQRVLGHTDIKMTMRYAHFAPDHLEDAVKFNPLSNIQL >NZ_CP016043|3205796:3249648|3207035_3207251_+|WP_005295814.1|DBSCAN-SWA MPVIKVRENEPFDVALRRFKRSCEKAGILAEVRRREFYEKPTTERKRAKASAVKRHAKKLARENARRTRLY >NZ_CP016043|3205796:3249648|3213649_3214132_-|WP_070245284.1|tail|DBSCAN-SWA MMLTLGMFVFQLQTTPYQSLQRDVDYRYPSNSRVGKRPAIQFLGVNEERITLSGVLLPEITGGRLSMLTLDAMAAEGKAWPLIGGDGTIFGMFVASSIHETRTVFFADGAPRRIEFSLTLTRVDESFVTMFGDLKKQAEGMIGNAAAAANKMVANIEGLF >NZ_CP016043|3205796:3249648|3212202_3212421_-|WP_005281637.1|DBSCAN-SWA MFHCPICGYAAHARSSRYLSENTKERYHQCQNVNCSHTFKTMETFQASIVLPGKVNPAVPHPERSGQQPLWM >NZ_CP016043|3205796:3249648|3241292_3241799_-|WP_070245307.1|DBSCAN-SWA MFDFQVSKHPHFDEACRAFALRHNLAQLAAKVGMSAQTLRNKLNPEQPHALTWIEILNLTDVTEDATLVDGFLTQLHCLPCVPVNEVAPEKLQAYVLSATAEVGKVAGVAMTGAITRSHKHTVVESVNAGIRCLSLAALTLHARLQSNPAMNSVVDTVTGMGASFGLL >NZ_CP016043|3205796:3249648|3240785_3241127_-|WP_070245306.1|DBSCAN-SWA MAFTEAARAVPISAGSRVSALNRIAAIRREVFKEDSTAELHRFFDEMKDVRDPLHKENRRALGAILYLANIPAARHDVDFSELTTAEQAALIAAMNQLRAVVSLFPRRLSMPN >NZ_CP016043|3205796:3249648|3207439_3209188_+|WP_024523602.1|DBSCAN-SWA MAGRIPRVFINDLLARTDIVELIDARVKLKKQGKNYHACCPFHNEKTPSFTVNADKQFYHCFGCGAHGNAIDFLMNYDRLEFVESIEELATMAGLEVPYESGTGPSQMERHQRQNLYQLMEGLCRYYQQALRQPDATQAQHYLAERGMSQSIMDRFAIGYAPPGWDNALKRFGREPAGRSALNDAGMLVNNDNGRSYDRFRDRIMIPIRDKRGRVIAFGGRVLGNGTPKYLNSPETEIFHKGRQLFGLYEALQHAAQPERLLVVEGYMDVIALAQYGINYAVASLGTSTTAEHIQLLFRHTDSVVCCYDGDNAGREAAWRALETALPYLSDGRQLKFMFLPEGEDPDTLVRQEGTDTFEQRMNAAHPLSTFLFDSLLPQVDLSTPDGRAKLSALALPLIGQVPGETLRLYLRQALGQKLGILDDSQLEKLLPQRNSAVKNYQPPRLKPTTMRILIALLVQNPALAAEVPTLEGLRQHSLPGLPLFIELVETCLAQPGLSTGQLLELYRDNKYAPQLETLATWNHMIIEPMVLETFVDTLGSLYDAVLEQRLEHLIARARTEGLTPAEREEVRSLNEALAKKH >NZ_CP016043|3205796:3249648|3242222_3242801_+|WP_070245309.1|DBSCAN-SWA MKIRDYTFEAVSILDRICEIYGFRQKIQLADHFEISASSLSNRYTRGTISYDFAAICALETGASLKWLLTGEGEQYSENAINSNVKEIPSFTLSEERLTEDSPLSIDPKLINKPESDCYVVRADGKLHFIDRDSTLSDGLWLVNIDDAISIRELTKLPGKKLHVAGGKVPFECGIDEIKTLGRVIGFYSDAS >NZ_CP016043|3205796:3249648|3216580_3216721_-|WP_070245286.1|tail|DBSCAN-SWA MSVDDLMADIAVIFHWPPSELYPMSLAELIFWRGKALTRSGQTTDE >NZ_CP016043|3205796:3249648|3219073_3219424_-|WP_083275038.1|DBSCAN-SWA MKYFKDNQGVVYAFASDGSQDSCIEEGMQPISESEAMEIINPPPTKDDLIKAAEQEKVALLSEVETVTKLWQTQLALGIITEKDKETLTEWMRYAQQIDNVNISLAPDITWPSKPA >NZ_CP016043|3205796:3249648|3221673_3222582_-|WP_070245291.1|plate|DBSCAN-SWA MPIIDLSQLPAPDVIEELDYESLLAERKTTLISLYPEEQRDAIAHTLTLESEPIVKLLEENAYRELLLRQRVNEAARAVMLAYSSGSDMDMLAANLNTKRLTIVPADETTLPPTPAVMESDDDLRLRAQQAFEGLSVAGPVGAYEYHGRSADGRVADVSVDSPAPACVTIAVLSREGNGTASEELLAIVAKALNDEDVRPIADRVTVKSATIVPYQINATLYLYPGPEAEPIRQAAEAKLKAYINAQHRLGRDIRLSAIYAALHVEGVQRVALASPTADIVLDKSQASYCSAYQLTIGGTDE >NZ_CP016043|3205796:3249648|3205796_3206822_-|WP_070245282.1|tRNA|DBSCAN-SWA MRILGIETSCDETGIAIYDDEKGILANQLYSQIKLHADYGGVVPELASRDHVRKTVPLIQVALHEAGLTSADLDGVAYTAGPGLVGALLVGATVGRALAFAWGLPAVPVHHMEGHLLAPMLEDTPPAFPFVALLVSGGHTQLISVTGIGEYRLLGESIDDAAGEAFDKTAKLLGLDYPGGPVLSQMAQQGVPGRFVFPRPMTDRPGLDLSFSGLKTFAANTIHANGDDAQTRADIARAFEDAVVETLAIKCRRALELTGFQRLVMAGGVSANRALRARLAQMMQQRGGAVFYARPEFCTDNGAMIAYAGMVRLKSGVDADLSITVRPRWPLAELPPVNALG >NZ_CP016043|3205796:3249648|3214143_3216588_-|WP_070245285.1|tail|DBSCAN-SWA MSNNVKIEVLLKAIDQATRPFKHIQGAGKSLSADLRGTQQTLRELNNQAGKIEGFRKTSAQLAVTSQALKKAKDEANALEEQFKATERPTRAHAKVLEAAKRAAADLQAKENSLRMAVQRQGQELAKTGINTRTLAADERRLKNSISETTLQLNRQRDALARVSAQQARLNAVKRRYQAGKEIAGTAGAVGAAGVGIATSGTMAGVAILKPGYDFAQKNSELQAVLGVGKQSPEMQALRKQARQLGDNTAASADDAAGAQIIIAKAGGDAAAIQAATPVTLEMALANRRTMEENAALLMGMRSAFQLANDKVAHIGDVLSTVMNKTAVNFEELSDALTYVAPVAKNAGVSIEETAAMVGALHDAKITGSMAGTGSRAVLSRLQAPVGQAFAALKELGVKTMDSKGNTRPIFTILREIQASFTRHKLGSGQRAEYMKVIFGEEASSAANVLLTDATSGKLDQLAATLKASDGKTAQLVAVMQDNLGGDFKEFQSAYEAVGTDLFDQQESSLRKLLQTTTKYVLRLDAWIQKNQGMARSILAIGGAALGVIALVGAIGLVAWPVIAGINAIIAAAGALSVAFSVAGGAIGTALAAISLPMVALGAAIVAGALLVRKYWEPISAFFGGVIDGMREAFAPVAEIFSPLTPMFDMLGEKLKAVWTWFSDLIAPVKASQEALDICRNAGQLFGQSLADAILAPIHAIDKLRSGIDWVLEKLGVIDSKSSDLDKAAEKANVYATGANGRGYSPTGGILTGGYQPVSAPTGSSYIDQSQHHYEIPVNGASSPIDVQRQIREGLQAEQRERRARLRSNMNIDY >NZ_CP016043|3205796:3249648|3241829_3242093_-|WP_070245308.1|DBSCAN-SWA MASEIAIIKVPAPIVTLQQFAELEGVSERTAYRWTTGDNPCVPIEPRTIRKGCKKAGGPIRIYYARWKEEQMRKALGHSRFQLVIGA >NZ_CP016043|3205796:3249648|3241090_3241291_-|WP_083275039.1|DBSCAN-SWA MMIMEPSLASFLKKQSPSMHYGHGWITGENGKRWHPCRSQAELLASLSGAKQGESWLSQKLRALFR >NZ_CP016043|3205796:3249648|3229608_3231375_+|WP_070245299.1|terminase|DBSCAN-SWA MTITTDTSLLRDPRRQAALLYWQGFSIAQISEMLQQKRPTVQSWKQRDAWEDTAPITRIENSIEARLVQLVLKDKKEGADYKEIDLLGRQIERLARVNRYSQTGNEADLNPNVANRNKGERKKPKKNFFSDEAIEKLEEVFFDQSFEYQLQWYRAGLAHRIRDILKSRQIGATFYFSREALLRALKTGHNQIFLSASKTQAYVFREYIIQFARLVDVDLTGDPIVIGNNGAKLIFLGTNSNTAQSHNGDLYVDEIFWIPNFQKLRKVASGMASQKHLRSTYFSTPSTLAHEAHSFWSGDLFNKGRASAADRIEIDISHSALAGGLLCADGQWRQIVTIEDALAGGCTLFDLDQLKRENSADDFKNLFMCEFVDDKASVFPFEELQRCMVDALEAWTDVNPYADHPFDRPVWIGYDPSHTGDSAGCVVLAPPAVPGGKFRMLERHQWKGMDFSTQAEAIRALTEKYRVDYIGIDATGIGQGVFQLVREFYPAARELRYNPEVKTAMVLKAKDTIANGRLEYDVSYTDVTASFMAIRKTMTASGRSTTYDASRSDEASHADLAWATMHALLNEPLTAGSGRPSSSILDIN >NZ_CP016043|3205796:3249648|3242830_3243373_+|WP_167352265.1|DBSCAN-SWA MMDTIIPFLLLGLSVFSMVIYLKSPEKLIMRAVKGITAFIFPVGALGAFLSGDYATALTISVIVFLISLRRMNINKKDSFPSVATEQSTHVNKSKPFSGNHNTKDWFKNISFSYTDSNGNSSYREVDIKEINEQSMTGYCHSRRQLRTFRLDRIDNSEIVIRDTGELINVYDWIVQLYEE >NZ_CP016043|3205796:3249648|3224086_3224548_-|WP_070245294.1|tail|DBSCAN-SWA MLKPNSLKQALYKALPVLAENPDMMRIFIDEGMIAATLAPSLSFENRYTLNVLILDYRGDINLILVPLAAWLRENQPDIFTTGDGKGFTYISDINTGDSQDLSISLKLTERTLVQQEGARLYVRNVGEPPEPEPVERPTELYINGELVSQWHE >NZ_CP016043|3205796:3249648|3225873_3226077_-|WP_024523586.1|tail|DBSCAN-SWA MKVIAQQGETLDALCATYYGRTQGVVEAVLEANPGLASLGAILPHGTVVELPEIEAAPISETINLWD >NZ_CP016043|3205796:3249648|3231374_3232409_+|WP_070245300.1|portal|DBSCAN-SWA MKKRKNRPTKKMVAESAAAGNGAMAFTFGEPTAVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLRAAVHHSSPIYVKRNILTSTYIPHPMLSQQDFSRFVLDYLVFGNAFFELRQSVTGKALRLETSPAKYTRRGVEEDVYWYVQSFTQPHRFDPGAVFHLMEPDINQELYGMPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVTDPAQNATDVEALRDAMRSSRGIGNFKNLFFYAPNGKGDGIKIIPLSEVATKDDFFNIKKVSAADLLDAHRIPYQLMGGKPENVGSVGDVEKVARVFVRNELTPLQARIAEINQWLGEEVIRFKKYSLDDSEE >NZ_CP016043|3205796:3249648|3212495_3213650_-|WP_070245283.1|DBSCAN-SWA MLTGLTLDAGASLAPAIMLTLGGKDITRNIADRLISLVHTDNRGFEADRVDIELDDSDGLIELPLRGAVLTLFIGWSGHGLIGKGDFTVDEIEHRGAPDTLTIRARSADFRGSLNSRREMSYHDTTLGDIVTTVAQRNKLQAAVAEGLRGIKIPHIDQSQESDACFLARLAERNGAEVSVKAGKLLFLKAGSNMTASGKPIPSVVIERTDGDRHQFAIADRAAYTGVTARWLHTSDPKEQEKKVKLKRKPKEKHLRALEHPKAKPAAKKKKKEKEAREGEYMAGDPDNVLALTTVYSTKAQAQRAAQAKWDKLQRGVAEFSITLAMGRPDLYPDAPARVSGFKSIIDEQGWTITKVVNSISDAGYTTSLELEVKLSDVEYVEDD >NZ_CP016043|3205796:3249648|3247215_3249648_+|WP_024523565.1|DBSCAN-SWA MTTLHLTTLSARIRAHKAALIHIATPSICTERAQHYTEAYQRHLDKPLPVRRALALAEHLARRTIWIDHDELIVGNQASRVRAAPIFPEYTVSWIEQEIDELADRPGARFNVSEQDKAVLHRLCPWWRGQTVQDRCYGMFTDEQKALLASGIIKAEGNMTSGDAHLAVNYPLLLTLGIAGLRARVAARRRRINLTQLDDLHGEQFLKAIDITLVAVSEHCLRFAALARRMAADEQRQSRRAELHTIATNCEHIAQQPPQTFWQALQLCYFIQLLLQIESNGHSVSFGRMDQYLYPWYRREVELEHSLDREQAIELLQSCWLKLLEVNKIRSGSHSKASAGSPLYQNVTIGGQNLVDGVPQDAVNPLSYAILESCGRLRSTQPNLSVRYHAGISDDFLDACTQVIRCGFGMPAFNNDEIVIPAFIDLGVAPQDAYQYAAIGCIETAVAGKWGYRCTGMSFINFARVLLAALDNGKDATSGQVFLAQPQALSQGNFADFAQVMAAWERQIRYYTRKSIEIEYVVDTVLEENAHDILCSALVDDCIERAKSIKQGGAHYDWVSGLQVGIANLGNSLAAVKTLVFEQGRVSQQALARALAEDFNGSQHEQLRQRLLNGAPKYGNDDDRVDALLAQAYGYYIDELSHYHNPRHGRGPIGGGYYAGTSSISANVPFGAATLATPDGRKAHTPLAEGASPASGSDRLGPTAVMGSVAKLPTAAILGGVLLNQKLNPATLESERDRQKLIQLLRTFFEVHQGWHVQYNIVSRETLLDAKAHPERHRDLVVRVAGYSAFFTALSPDTQDDIIARTEHQL >NZ_CP016043|3205796:3249648|3217104_3217623_-|WP_070245287.1|tail|DBSCAN-SWA MALPRKLKYLNLFNDGLSYMGVAKSVTLPKLTRKLENYRGGGMNGAAPVDLGLDDDALVVEWTMGGLPDETLWSQYAAPSASAVPLRFAGSYQRDDTGDIVAVEVVMRGRHKEIDAGDAKQGEDTEVKISTQCTYYKLTIDGKDMIEIDTINMVEKVGSVDRLEQHRRNIGL >NZ_CP016043|3205796:3249648|3226670_3227432_-|WP_024523584.1|terminase|DBSCAN-SWA MLSPAECHVMRVSATLAAQRENTPLRHASEYEQMLVKLAADRRKLKGIQSVETKALHKRDMLPFYGPWISGVLAAGRGAQDDIVMTVMLWRFDVDDITGALDIAEYALRHNLRMPEKHSRTTGCVVVEETAAAAARLRAAGAALSIDTLERAIALTVDQDMPDNVRARLHKTVGLLLRDQGDNAAALAQLQRAMQLDDNAGVKKEIQALERAIRPKPVKQPAKPAKKKTAARPRSVTGTPARRGRPPKVKAAS >NZ_CP016043|3205796:3249648|3228604_3229447_-|WP_070245298.1|capsid|DBSCAN-SWA MAKKISKWFRIGVEGDTCDGRVIDANDIQQMAETFDPRVYGCRINLEHIRGLLPTGDFKRLGDVVELKAEKIDDDSALNGKWALYGKIAPTDTLVEWVRDGQKVYTSMEIRPNFANTGKAYLVGLAVTDDPASLGTEYLEFCANATANPLANRKEEPGDLFSVASLAELEFIEQSHVLDSLADKVRGLFSRKARHDDDRFADVHEAVTTIAEQVQSNSDSTEQRFQALEEVTKTLRGELDEERQQVAALRAALDNTENPIQSRRQPATGGNGEDTLLTNC |
48 | Erwinia_phage(27.5%) | terminase,head,capsid,tail,lysis,holin,plate,portal,integrase,tRNA | attL 3212016:3212074|attR 3245815:3245873 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|