Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_017386 | Ketogulonicigenium vulgare WSH-001 plasmid 1, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NC_017384 | Ketogulonicigenium vulgare WSH-001, complete sequence | 2 crisprs | DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,csa3,WYL | 0 | 7 | 5 | 0 |
NC_017385 | Ketogulonicigenium vulgare WSH-001 plasmid 2, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_017384_1 | 1461978-1462671 | TypeI |
I-C
Consensus repeat of NC_017384_1
|
10 spacers
spacers of NC_017384_1
>1.1|1462009|35|NC_017384|PILER-CR CCCCGATCCAACCGCGAAGTGGGCCAGCCGCGATG >1.2|1462075|36|NC_017384|PILER-CR CACTGTCACTCCGACTTGGAGAGCATCTACAATGAC >1.3|1462142|34|NC_017384|PILER-CR CAGTTGGCGTGCGGGCGTCCACATCTGAATGATG >1.4|1462207|35|NC_017384|PILER-CR CTCATTGTAGATGGTGGTATCGAAACGCGACACCT >1.5|1462273|35|NC_017384|PILER-CR CTCCCATAAAAAAACCCGCCTCTAAGGGCGGGCTG >1.6|1462339|34|NC_017384|PILER-CR CTGCTTCCGGTAAAACTCCACCGCAGCGTGGAAC >1.7|1462404|42|NC_017384|PILER-CR CGATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGCGGTCGCT >1.8|1462477|37|NC_017384|PILER-CR CAGCGACCAATGGCGCGTAGCGACACCCTATCAGGAC >1.9|1462545|36|NC_017384|PILER-CR CTACGCTATGACCCAGATCCTCCGGGGGCGCGCAAC >1.10|1462612|35|NC_017384|PILER-CR CCCCAGCAGGCGCGGCGCAGACGACAGGCGGGACG >1.11|1462010|34|NC_017384|CRISPRCasFinder,CRT CCCGATCCAACCGCGAAGTGGGCCAGCCGCGATG >1.12|1462076|35|NC_017384|CRISPRCasFinder,CRT ACTGTCACTCCGACTTGGAGAGCATCTACAATGAC >1.13|1462143|33|NC_017384|CRISPRCasFinder,CRT AGTTGGCGTGCGGGCGTCCACATCTGAATGATG >1.14|1462208|34|NC_017384|CRISPRCasFinder,CRT TCATTGTAGATGGTGGTATCGAAACGCGACACCT >1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT TCCCATAAAAAAACCCGCCTCTAAGGGCGGGCTG >1.16|1462340|33|NC_017384|CRISPRCasFinder,CRT TGCTTCCGGTAAAACTCCACCGCAGCGTGGAAC >1.17|1462405|34|NC_017384|CRISPRCasFinder GATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGC >1.18|1462471|36|NC_017384|CRISPRCasFinder AGCGACCAATGGCGCGTAGCGACACCCTATCAGGAC >1.19|1462539|35|NC_017384|CRISPRCasFinder,CRT TACGCTATGACCCAGATCCTCCGGGGGCGCGCAAC >1.20|1462606|34|NC_017384|CRISPRCasFinder,CRT CCCAGCAGGCGCGGCGCAGACGACAGGCGGGACG >1.21|1462405|35|NC_017384|CRT GATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGCG >1.22|1462472|35|NC_017384|CRT GCGACCAATGGCGCGTAGCGACACCCTATCAGGAC |
cas2,cas1,cas4,cas7,cas8c,cas5,cas3 |
CRISPR arrays and Neighbor proteins around NC_017384_1
The CRISPR arrays of NC_017384_1 >merge|NC_017384|1|1461978-1462671|PILER-CR,CRISPRCasFinder,CRT GTCGCTCCCCCCACGGGAGCGTGGATAGAAACCCCGATCCAACCGCGAAGTGGGCCAGCCGCGATGGTCGCTCCCCCCACGGGAGCGTGGATAGAAACACTGTCACTCCGACTTGGAGAGCATCTACAATGACGTCGCTCCCCCCACGGGAGCGTGGATAGAACCAGTTGGCGTGCGGGCGTCCACATCTGAATGATGGTCGCTCCCCCCACGGGAGCGTGGATAGAAACTCATTGTAGATGGTGGTATCGAAACGCGACACCTGTCGCTCCCCCCACGGGAGCGTGGATAGAAACTCCCATAAAAAAACCCGCCTCTAAGGGCGGGCTGGTCGCTCCCCCCACGGGAGCGTGGATAGAAACTGCTTCCGGTAAAACTCCACCGCAGCGTGGAACGTCGCTCCCCCCACGGGAGCGTGGATAGAAACGATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGCGGTCGCTCCCCCACGGGAGCGTGGATAGAAACAGCGACCAATGGCGCGTAGCGACACCCTATCAGGACGTCGCTCCCCCCACGGGAGCGTGGATAGAAACTACGCTATGACCCAGATCCTCCGGGGGCGCGCAACGTCGCTCCCCCCACGGGAGCGTGGATAGAAACCCCAGCAGGCGCGGCGCAGACGACAGGCGGGACGGTCGCTCCCCCCACGGGAGCGGGGATAGAAAG >NC_017384|1|1|1461978-1462670|PILER-CR GTCGCTCCCCCCACGGGAGCGTGGATAGAAA CCCCGATCCAACCGCGAAGTGGGCCAGCCGCGATG GTCGCTCCCCCCACGGGAGCGTGGATAGAAA CACTGTCACTCCGACTTGGAGAGCATCTACAATGAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAC CAGTTGGCGTGCGGGCGTCCACATCTGAATGATG GTCGCTCCCCCCACGGGAGCGTGGATAGAAA CTCATTGTAGATGGTGGTATCGAAACGCGACACCT GTCGCTCCCCCCACGGGAGCGTGGATAGAAA CTCCCATAAAAAAACCCGCCTCTAAGGGCGGGCTG GTCGCTCCCCCCACGGGAGCGTGGATAGAAA CTGCTTCCGGTAAAACTCCACCGCAGCGTGGAAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAA CGATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGCGGTCGCT CCCCCACGGGAGCGTGGATAGAAACAGCGAC CAATGGCGCGTAGCGACACCCTATCAGGACGTCGCTC CCCCCACGGGAGCGTGGATAGAAACTACGCT ATGACCCAGATCCTCCGGGGGCGCGCAACGTCGCTC CCCCCACGGGAGCGTGGATAGAAACCCCAGC AGGCGCGGCGCAGACGACAGGCGGGACGGTCGCTC CCCCCACGGGAGCGGGGATAGAAA >NC_017384|1|1|1461978-1462671|CRISPRCasFinder GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC CCCGATCCAACCGCGAAGTGGGCCAGCCGCGATG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC ACTGTCACTCCGACTTGGAGAGCATCTACAATGAC GTCGCTCCCCCCACGGGAGCGTGGATAGAACC AGTTGGCGTGCGGGCGTCCACATCTGAATGATG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TCATTGTAGATGGTGGTATCGAAACGCGACACCT GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TCCCATAAAAAAACCCGCCTCTAAGGGCGGGCTG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TGCTTCCGGTAAAACTCCACCGCAGCGTGGAAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC GATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGC GGTCGCTCCCCCACGGGAGCGTGGATAGAAAC AGCGACCAATGGCGCGTAGCGACACCCTATCAGGAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TACGCTATGACCCAGATCCTCCGGGGGCGCGCAAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC CCCAGCAGGCGCGGCGCAGACGACAGGCGGGACG GTCGCTCCCCCCACGGGAGCGGGGATAGAAAG >NC_017384|1|1|1461978-1462671|CRT GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC CCCGATCCAACCGCGAAGTGGGCCAGCCGCGATG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC ACTGTCACTCCGACTTGGAGAGCATCTACAATGAC GTCGCTCCCCCCACGGGAGCGTGGATAGAACC AGTTGGCGTGCGGGCGTCCACATCTGAATGATG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TCATTGTAGATGGTGGTATCGAAACGCGACACCT GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TCCCATAAAAAAACCCGCCTCTAAGGGCGGGCTG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TGCTTCCGGTAAAACTCCACCGCAGCGTGGAAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC GATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGCG GTCGCTCCCCCACGGGAGCGTGGATAGAAACA GCGACCAATGGCGCGTAGCGACACCCTATCAGGAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TACGCTATGACCCAGATCCTCCGGGGGCGCGCAAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC CCCAGCAGGCGCGGCGCAGACGACAGGCGGGACG GTCGCTCCCCCCACGGGAGCGGGGATAGAAAG
>NC_017384.1|WP_013384683.1|1461510_1461801_+|CRISPR-associated-endonuclease-Cas2 MLVLVTYDVNTLSDGGKKRLRQVARACEDWGQRVQFSVFEIELDPAQWTKLRARLESIIDAKTDSLRYYFLGTNWERRIEHVGAKPAKDLNGPLII >NC_017384.1|WP_013384682.1|1460471_1461506_+|type-I-C-CRISPR-associated-endonuclease-Cas1 MKKLLNTVYVTTEGAALKKDGENLVAEIEGSEKARVPLHMVASVVTFGPIFVSPALIGTCAERGITIALMDRIGRFQARIEGPVSGNVLLRRAQYRTADDAVDVVRSIVLGKLANQRAVIRRGLRDYGDEMAAPVRDALERASDRIEMILRRVQVKDDSIDLLRGAEGEAATLYFGVFNHLIRSPDATLHWTGRSRRPPLDPMNALLSFLYTLLTHDCRSACEAVGLDPAVGFLHRDRPGRPSLALDLMEELRAPLADRLALSLVNRRQLRAGDFRQMDNGAVLLTDEARKTVLTAWQERKKEERLHPFLNEKAPFGLVPYLQAQMLARHLRGDIEAYPPWFWS >NC_017384.1|WP_013384681.1|1459830_1460475_+|CRISPR-associated-protein-Cas4 MGAEEDAIPLSALQHAVYCLRQAALIHLERLWVANRFTAEGDVLHAVADKGGARRARGVRRVMSLPLASARLNLIGTADLVEFIPGPAGEVAFPVEYKRGKPKLHRADEVQLCAQALCLEEMTGQPVPEGALFYAHTKRRVTVPFDTELRALTQNAAQSLADILASRITPAPTAHKSRCRACSLHEACRPETYARPVLAWRDQMLARSLKDISE >NC_017384.1|WP_013384680.1|1458877_1459828_+|type-I-C-CRISPR-associated-protein-Cas7/Csd2 MTTLSRRHDFVLIFDVTNGNPNGDPDAGNQPRLDPETGHGLVSDVSLKRKIRNYVELAHEGKDGHHIYVQEGAILNEKHRAAYIAKRPGDEKAKTDKKLNPKDDAEAKELRDWMCANFFDVRTFGAVMSTGINCGQVKGPVQMTFATSVERILPAEITITRMAATNEAEKKKAEEGSDGDQRTENRTMGRKHIVPYGLYVAHGFVSAKFAERTGFSEADLDLLLEALKNAFEHDRSAARGEMATRKLIVFKHENALGNAPAHELFDRVRIGRNLAGEFRPIGDSRLDNQPPARKFGDYLIEIDREALPAGVDIIEL >NC_017384.1|WP_013384679.1|1457111_1458881_+|type-I-C-CRISPR-associated-protein-Cas8c/Csd1 MSVLSSLSRAYERLPDAPPYGFSSEKIGFCIVLNADGSVHDVIDKRQDDKKRSPAMLLVPQAVKRTAGIAPNFLWDKTAYVLGVTAGEGKRTADEHAKFREMHVQWLAGTQDEGLLALLRFLDAWTADRFTAPVWPEDMRDQNIVFALASEYRERYIHERSGAKEIWQRLGTEGASDPQICLVSGDPAPIARLHPSIKGVWGAQTAGASLVSFNLDAFTSYGHEQGDNAPVSEVATFQYTTALNRFLEKDSGHRLQIGDASTVFWADASGLISEMAESLFAGMFDTPEAAHDDDSIETKKIAAKLERIRRGERLDEVEPQLTQGVRFHVLSLAPNAARLSVRFYWENDFAQLTRNYQAYLEDTKIEPPPRDGWPPLWRYLVELAVLNKRENIPPNLAGEWTRAILTRTPYPMTLLATALMRIRSDGEVNALRAAVLKSVLVRNFNKEAPVALNPTFSDSKGYLLGRLFAAYEEVQREALGRNVNATIKDKFYGAASASPQKVFATLDSGAQNHFTKLRKINAGRAVNLDQLIMSIMDQMSPDKDPIPAFLNAPEQGLFGLGYYHQRSDFFRKRDTKTDSVTETQPETTA >NC_017384.1|WP_013384678.1|1456350_1457115_+|type-I-C-CRISPR-associated-protein-Cas5 MTYGIRLHIWGTHGLFTRPELKVERASYDVITPSAARGILEAIHWKPAIRWIVDRIHVLEPIRFQSIRRNEVGHKAPAGKIRAAMNRGDLADLQILVDQDRQQRASNVLVAPAYVIEAHFDLTAKAGPDDTVGKHLDIFNRRAAKGQCFNQPSLGTREFVAHFALVPPDAVIPSPDGHSWGAMLHAGQTNGAPRPSRAPEAETSDLGFGTPRDLGFMLWDIDHMAPGRPSMFFRATLRDGIVEIPAPGSPDIKR >NC_017384.1|WP_013384677.1|1453731_1455969_+|CRISPR-associated-helicase/endonuclease-Cas3 MSENRIYAHSGKHADRSDWQLLSDHHDAVSALANARGVKIGLPLAASLAGALHDFGKNDPAFDQVLQGRDIRVDHSTAGGWLALKQAAPKDQPTAEAIAYTILGHHAGLPDKLNDQNSCMTARIETFNRTTNPIPQNLIAQEPPDFAPISHELFAKCAGPNPAFNLSLATRMVFSCLVDADFRDTESYYAKIDAYDRPRDWPSLPDLLPDLRERFDNHMNRLPDTGDVNPIRRHILHHVRAQASQRPGLFTLSVPTGGGKTLASMGFALDHAMAHGHRRIIYAIPYSSVTEQTAGTFRDLFGDVVLEHHAAIDANERDAGQREKLQLAMEDWAAPIVVTTNVQLFESLFSARPSRARKLHNIAGAVIILDEAQCLPRHLLLPTLRMIESLCTHYGCTVVLCTATQPAFDSRQLKEGGLALDDRELAPQPELLAQQLRRAHIRQGGEMSDQDLIDALAGTEQGFVIVNSRAHALELFKAAEQADIKGLVHLTTRQYPSDRRALIAEIREKLKSNQPCRLIATSLIEAGVDLDFPVGWRAEAGLDSCVQAAGRVNREGRRPLAGSILTVFSASDRPMPAAIKTLADAMRSTARRFDDLLNPAAMRDWFEHVYWQAGPKLLDAADIIGKLRVGRGGTDFAFRTIAEAYRMIDTTMVPVIIPGDAIAQREIDRLEVEAVSSGTLARALQHYTVQIPEKSREALRTNGKGDFAAQHLRGAQFFVLTDASLYRRDSGLWWEGADYIADSIW >NC_017384.1|WP_013384676.1|1451581_1453321_-|PQQ-binding-like-beta-propeller-repeat-protein MNPTTLLRTSAAVLLLTAPAAFAQVTPITDELLANPPAGEWINYGRNQENYRHSPLTQITADNVGQLQLVWARGMEAGAVQVTPMIHDGVMYLANPGDVIQALDAQTGDLIWEHRRQLPAVATLNAQGDRKRGVALYGTSLYFSSWDNHLIALDMETGQVVFDVERGSGEDGLTSNTTGPIVANGVIVAGSTCQYSPYGCFISGHDSATGEELWRNHFIPQPGEEGDETWGNDFEARWMTGVWGQITYDPVTNLVFYGSTGVGPASETQRGTPGGTLYGTNTRFAVRPDTGEIVWRHQTLPRDNWDQECTFEMMVANVDVQPSAEMEGLRAINPNAATGERRVLTGAPCKTGTMWSFDAASGEFLWARDTNYTNMIASIDETGLVTVNEDAVLKELDVEYDVCPTFLGGRDWSSAALNPDTGIYFLPLNNACYDIMAVDQEFSALDVYNTSATAKLAPGFENMGRIDAIDISTGRTLWSAERPAANYSPVLSTAGGVVFNGGTDRYFRALSQETGETLWQARLATVATGQAISYELDGVQYIAIGAGGLTYGTQLNAPLAEAIDSTSVGNAIYVFALPQ >NC_017384.1|WP_013384675.1|1450636_1451542_+|LysR-family-transcriptional-regulator MLDIRRLRYFKVIAECGSMAAAARQLNIAQPALSGHIAQIEEHYALQLFQRHARGVTLTPAGEALLRHARRILENLSEAEAELRHLSPVTTRAPVRLGLLPSWGASLAPAIIQATQLALPDIALRIVEMRHDESLDAIRQQNIDLAVVLEDTAPAQTQLLGSEALLYVSHVAVADRMSFRDVAALPLILPSAANLLRHQLDKAAKAAHVVLGPVMEIDGQDTIKSAVKAGVAGSIMSWNSIRNECLDHSLSACLIDDPEITRNVYLRRGEHVPANLAEAFFVVLRQVAEDNSYSRLRHART >NC_017384.1|WP_193365312.1|1449899_1450565_+|4-carboxy-4-hydroxy-2-oxoadipate-aldolase/oxaloacetate-decarboxylase MAHVRRSFTRPSPEAIDAIRPYSPATVHEAQGKLGALDSRIKPLRHGWQICGSAITAQCHIGDNLMIFEAINLAKPGDVLVLSAGNNPEQGGFGDVLAAACRGKGIAGLIIDAGVRDGRGLRAGDFPVFSLGLCVKGTSKDTLGTVNMPVMVGNQLITPGDIIVADDDGVVVVRQEPFALAKACEAREASEAKLIEMHLSGRMEIEDRYDMMRAKGCVWED >NC_017384.1|WP_013384684.1|1462797_1464249_-|aldehyde-dehydrogenase MDLLGLLINNETLPASGGKTFTRKNPISGEVATEAAAATTDDAQRAADAAAAAFPEWSRSSPKTRRTVLLKAADMLEANGPQFVAAMGAEIGATAGWAMFNVTLAADMLREAASLVTQIKGEIIPSNRPGSTAMAVRQPAGVVLAMAPWNAPVILGVRALATPLACGNTVVMKTSELCPRTHHLIVSSLLQAGLPAGVLNAVSNAPEDAAEIVEALIAHPAVRRVNFTGSTRVGRIIAEKAGRYLKPALLELGGKAPFVVLDDADLDEAVAAAAFGAYMNQGQICMSTERIVVMESVADAFVEKLAVKARTLIAGDPREGKTPLGSVVDVSAAQRIEQLIKDATSKGAVLAAGGRIDGTLMDAALLDHVSPAMRIYGEESFGPVVTVVRVGSIDEAVRVANDTEYGLSSAVFGGDVNRALAVARRIESGICHVNGPTVHDEAQMPFGGTKASGYGRFGGNWGIHEFTELRWITVQDGHIHYPI >NC_017384.1|WP_013384685.1|1464276_1464993_-|ABC-transporter-ATP-binding-protein MLKIANLSLRYGRHLALQCVNVAVARGETVVILGANGAGKSSALKAVGGIVRPDAGSVVTLDDVPLLGAPAHQIVDRGLALVPEGRGVFADLTVAENLLLGANPKRARAGEGARRDFVYTLFPRLAERRRQTVRTMSGGEQQMVAIGRALMSNPDYLLLDEPSLGLAPIVVAELFAALRRVKETGVAILLVEQNVALSLSLADRGYLMEAGRIVGEGTADTLRNDPAVQNAFLGGSAA >NC_017384.1|WP_013384686.1|1464985_1465714_-|ABC-transporter-ATP-binding-protein MTVLLQVDGLKKQFGGLMAVNDLSFTVAEGEILALLGPNGSGKTTVMNLISGALPATAGRIQLDGVQISGLPAHRIARLGVARTFQLVRILPSLTVAENVIAALAFRAQPLWGDDAARAAEALLAEVGLAGRGGEYAADLTYIDQKRMELARALGAAPKLLLLDEWLSGLNPTELRVGIALILSLKARGMTIMLVEHIMEAVRALCPRTVVMNAGRKIADGPTNDVLADPAVVAAYLGGVDA >NC_017384.1|WP_013384687.1|1465710_1466637_-|branched-chain-amino-acid-ABC-transporter-permease MSARTLTGLGLAALALVMLAWLPSQLDAYGVGLLLGMTGYVTLATAWALFSGPTRYISLATVAFFGIGAYTVAVLSEAMPYPMVLITAALVGGAVALVVGLATLRLAGVYFVIFSFGLAELVRQLVTWYEVNVTGTLGRYIFLPITAQQIYWQLLALCALTFLIGWLIARSRLGLALRVIGDDEAVAAHTGINIAGAKLALFVISATLITLVGAIQAPRWTYVEPAMVFNPTTSFLTVIMALLGGAHRLWGPILGAIPLFLLFEWLSANFPNHYAIILGLLFITIVFLVPKGVLALVESAFARRRRLQ >NC_017384.1|WP_013384688.1|1466633_1467509_-|branched-chain-amino-acid-ABC-transporter-permease MLLTALISGLVLGGTYALVAMGLTLQYGISRIMNLAYGEFVIFAAFCAFVLFTAYGISPLVGMLLVLPAAFVVGYVLYGVMMQPLVKRSGKTGALDVDAILATFGLMFVVQGVMLVTFGANYTSYSYMNFGVNILGASIAANRLLAFVLAVVIGGALYMVLARTRWGTAIRAVAVAPEAAPLVGINVNHMARLAFALGMVLAAAGGIMISMYQTFNASMGVVFAMKALIIVIMGGQGNLLGALVAGLMLGLVETFVATYIDPGLTLAATYAIFLAVLLWRPSGLFGKGGRA >NC_017384.1|WP_013384689.1|1467608_1468832_-|amino-acid-ABC-transporter-substrate-binding-protein MIMFSRRLALRSFGATIAVAAAMTGFSASAQQTSIKIGYAVSLTGGNAGGAGITTLPNYRLWVSEVNAAGGLELPDGTRLPIEVVEYDDRSSTEEVVRAIERLATQDQVDFILSPWGTGFNLAVAPLLDRFGYPQLASASVTDRADEFAQRWPRSFWLLGGGADYAGGLADVLATATASGVMNGDVAIISVADGFGIDLINAARPAFAAAGLNIVMDRTYPPGTTDFSPMLNEAKSSSATAFVAFSYPPDTFALTQQAQVADYNPAVFYLGVGAPFPTYLGANGANAEGVMSLGGIDTSNAAMMDYRARHEAFAGQPPDSWASMITYAGLEILQEAIKRAGLDRDAVSAEIASGSFTTILGETQLQNNQLRDLWLTGQWQDGTFVAIEPTDRPGASAAIVPKQPWAN >NC_017384.1|WP_013384690.1|1468998_1469877_+|helix-turn-helix-domain-containing-protein MTLRPTHSHLDMLQEVAAQVIATTLDTAAWRMLGRRNRIFVLSAGTGSITYKGESHLLAAPGLVWVPAGAPAQLSLDAGSKGAWLAISDRAILQVDLAGNIAEDMRRFAQRPQFGRKISREMAARLIGLMALMAEELQRSEAGMQEMIRHHLSILAILLWRGSDLRPIAARPAPRVIFSEFLRLVDQHMRAHWRVSDYARYLGVSIDRLTSTVQRDTGQPPLAIIHTRLHAEACQMLETSAMQIAEISASLGFPDPAYFSRFFKRISGYSPRDYRNGLYREVTSGQAWAAWP >NC_017384.1|WP_060486309.1|1470034_1470799_-|MBL-fold-metallo-hydrolase MQITQVRNATLVVEYANTKFLIDPLFAAQGAFPGFAGSASSQLANPLVPLPIAQEQLIAVDAVIVTHLHEDHWDAAASAALPKDMPLFAQNEEDADKIRNEGFTDVRVLTQQSQFNGIGLCKTGGQHGTDATLDVIPLGEVCGVVFSHPQYATLYIAGDTIWNDHVQTAIDSHQPDAIVLNIGNAVFMGYDPIIMGLEDAVAVHRAAPNAILIASHMEAINHCILSRQTLYDYAKANGFDTRLLIPADGETVSV >NC_017384.1|WP_014537828.1|1471050_1472013_-|GlxA-family-transcriptional-regulator MSQNEVLEVGLLIYPGVQMASVLGMTDLFEMANHVNGKDAAKTIRISHWKTTEDDGAPARVFDSFPLAESEPAVLVVPPAFGVPITPEVAQVYAPWLRQRHGGGTALGSVCTGSFLLAETGLLDGRRITTHWTVDAYLRARFPKVALDADQVIVEDGDIMTGGGAMSWIDLGLRIVDRLLGPAIMAETARSLLVDPPHREQRYSSTFAPRLNHGDAAILKVQHWLQATSAKEGDVERLANVAGLEGRTFLRRFKKATGLTTTEYFQRIRVGRAQELLQAGSQSIDQIAWDVGYSDPGAFRKVFTRIIGLSPGDYRKRLRT >NC_017384.1|WP_044008052.1|1472349_1473261_+|polysaccharide-deacetylase-family-protein MRSDTGQYRFAPMRGRPRLHWPGNARMAFWVAPNIEHYELDPPVNPNRSPYARVQPDVLNYGWRDYGNRVGFERMARVMADRGIRGSVSLSVAVIEHFPDIIAQCNDLGWEMFSHGVYNTRYFYGMTEDQQRQVIRDSRESLARVGQTLDGWLTPAITPSLATQDLLAEEGVRYTLDYFHDDQPMPVKVRNGRLISVPYSIEMNDVPMVNWQNASPTAMLDSLKAQFDRLYAEGAGNPNVMCFATHPFLLGQPHRISVLTEFLDYVKSHDQVWYPTAREIADYYYTHHYDRVNTWLASLEESA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_017384_2 | 1749281-1749380 | Orphan |
NA
Consensus repeat of NC_017384_2
|
1 spacers
spacers of NC_017384_2
>2.1|1749306|50|NC_017384|CRISPRCasFinder TGGTATACGGAACTGAACACCGCGCAGCATGTGGGCGCGCCCGGCTTTAT |
CRISPR arrays and Neighbor proteins around NC_017384_2
The CRISPR arrays of NC_017384_2 >merge|NC_017384|2|1749281-1749380|CRISPRCasFinder GGACGAGGGCAAAGCCGCCTTCCGTTGGTATACGGAACTGAACACCGCGCAGCATGTGGGCGCGCCCGGCTTTATGGACGAGGGCCAAGCCGCCTTCCGT >NC_017384|2|2|1749281-1749380|CRISPRCasFinder GGACGAGGGCAAAGCCGCCTTCCGT TGGTATACGGAACTGAACACCGCGCAGCATGTGGGCGCGCCCGGCTTTAT GGACGAGGGCCAAGCCGCCTTCCGT
>NC_017384.1|WP_162467563.1|1748340_1748505_+|hypothetical-protein MAETIPVSRKYTCPPMARLTNFISPQGPQAEAAFFLDAGRKVKDHKNNSSELSY >NC_017384.1|WP_013384834.1|1747145_1748354_-|ROK-family-protein MVSAMTTAEGATEKRKRAIGANPERNRAHNRSLVLNLLREHGQIGRAAMARHTRLTQQAVGNIIDELLLEGMVIETGRLRVGRGQPARQFALNPCGPVSLGVEIAAGHLAIVFQALTGAIRARSIVPLADTAPAPVIAALVAQIEKLKSEAGAPEIIGMGVVMPGPFEIEGISAVGPATLKGWAGLDPAALIADATGIGTVVYENDATAAAVFESLHGVGRGLRDFCHVYFGVGLGLGLIHDGRPLRGAFGNAGEIGQIAVPPRGGGAAAALEDRASVFALRDFLRETRGAPDDLDLLASLDPAEDPALQDWIARAADQLSPVLAILENIFDPETITLGGLMPRPIIEAMIDCLQPLPVTVSSRSARGLPRLMLAQTGPYTAALGAAAMPFMDQNTTATLRQ >NC_017384.1|WP_013384833.1|1744772_1747127_-|PIG-L-family-deacetylase MLTDRNRLFRRIADPRMVRLARALGRLGSTVTMMNTGAHPDDEQTELLAWFSFGRNMRVVIACSTRGEGGQNALGPERGAALGLVRSRELEESARVIDADIAWLGHGPVDPVHDFGFSKDGKDTLERWGRARVIDRLVRAYREYRPDIVLPTFLDVPGQHGHHRAMTEAAEAALALAADPTYEVDGLAPWRVAKYYLPAWSGGGSTYDDELPPPPATVDVRVEGFDPVSGMRYDQIGEASRGYHASQGMGTWRATPRRHWALHGTAPEGDILDGLPATLGALADVAGAPAELALAASEIAKARAAFPDDQAMITALVAAHKAFSAPMGADFDALHGHRIRQKIAEVEAALALAAGVDVAAWLQDPLVPGRSATLAVWVNAGHATLRGIAAAASDGIAQKGGSEERDGLHLLTLAVPADLSPASRYLPGWARLGGNGILAASVTVEVGGITFALPVDLEEEPLLQPAASVTPSADAVLVNLNDPQPVHFAMTGTASAASLGLGSIDGLTVENADGHVTLTASGLAPGKTRLPISVAGAAGWQAKPINYGHIGRLAQVVPAGVDILALDLKIPDGRVGYIGGGADRVGLWLERMGVDVVDLDAEAFDAARANGFAGFDTLVVGIFTFGLRPDLAAATADLRAWVHAGGNLVTLYHRPWDNWKPDETTPAHMVVGSPSLRWRVTRPGAPVTILEPDHDLLAGPNTITHADFDGWDKERGLYFLSSWDQVYQPLLAMSDPDEQPLLGSLVTGRIGKGRHTHTALVLHHQMDRLVPGAFRLMANLIQPA >NC_017384.1|WP_013384832.1|1743884_1744772_-|DMT-family-transporter MKPATALDDQTSAAEMRQGMTWVLLDMALVSAMTVMVKKGGVDFPAVQMVFFRSLVGLVAVLPLVLRHWRVIRQTRNVKRNVFRVTCNAVALSCNWGALTILPLATANAIGFLRPLIVMVMAIFLLSERVTGWRWAGAALGLMGVGVMLLPSLTGMGEAQDHLLGYAFAGGAILFGAMATIQTRALKGENTTVMMVFYTVGLTLFTAIPAFFVWQPVALHHLPHLLGIGIIAQVAQYCYLRGYQLAPASKLAPLGYLSLIFATVMGYVFFDEVPTVYTAGGAIVIIIGLIVARRA >NC_017384.1|WP_014537902.1|1743263_1743878_+|nuclear-transport-factor-2-family-protein MKKLIASFAIAASIAAPVFADTEVRPGVFFAGTVETAGSDRAVMQDLIFDLATAWAVCDRDAMANAITDDVSFSYPTSAVNGREAIMADLEAFCGAATDTSLYFPADAFYIDVDTGRIAAEVQFRTFQRGNRQVVNDVWIATVTDGKVSVIKEYLDGRVKDLQAQGVLQLEESPDFLTPWPPRTEAWASCFPIVRAAPTNDCVQ >NC_017384.1|WP_013384829.1|1742419_1743259_+|bifunctional-5,10-methylenetetrahydrofolate-dehydrogenase/5,10-methenyltetrahydrofolate-cyclohydrolase MTTIFTGFDLAADILQGVRADIATLGRAPVCVTLFDDSSAPARAYLNRQITLARGAGIDLRPMGYADAQLAQLAADARVDAIATLYPLPSGLTPMGAAQAIGGGKDIDGQHPNHAGPLLLGDGTLRPAATAQASLICARAILGDLAGAEIVLIGASRLIGRPLAMLLLDAGATVTTCHIQTRDLARHTRAADLVISAAGVPALLTADNIAKGGRILDLAIIPKDGSLVGDADLPSLMGHAALVSAVPDGVGPVTTACLFANIAAAAKSRAMNLPLLQQD >NC_017384.1|WP_060486272.1|1741631_1742423_+|helix-turn-helix-domain-containing-protein MPATDENRALFEDDKEISLTLARGLDLIEAFAGDERRLSIPELAARTGMNRTVVRRLVRTLEKKGYASADRGQYELTPHILRLIRGFIEGRSLPQIVHPLLRAAAEDIGESVSFAMLDDTEAVYVAHAFLPARFTLNMVTVGSRAPLLPTAVGRVIVAFLPDIERSAILSRLSPQAHTPQTETDAARLDAIFADCRRLDYCMADGEYVEGVASLAVPVFDGMRRVTGALSIIFPTHGHDATEIAEKLAPRMQATASALGSALQ >NC_017384.1|WP_044008068.1|1740787_1741645_+|bifunctional-5,10-methylene-tetrahydrofolate-dehydrogenase/5,10-methylene-tetrahydrofolate-cyclohydrolase MMALLLDGDALAAKLRQQMTERVAASGIRPVMATVLVGDNPASESYVARKHKDCREIGIEALRIRLPAGASPEQVLAEVARLNDDPSVDGFFVQFPLPEGHDEQAIAAAIRPDKDIDGLHPENLGRLITGKGGIPPCTPMAVLSLLRGYNVPLAGKHVVIIGRGLLVGRPLAMVLSAPGVDASVTLLHSQTPDIAAFTRNADVVIAAAGHPELIRADMIRAGATVVGVGITYGDDGAMVSDIAADVSAIAGAVTPAHGSVGSLTRAMLLQNLINLALEKHSHARN >NC_017384.1|WP_013384825.1|1739640_1740786_+|hypothetical-protein MNRILTALTASVAAFAGTTASAGEFATAHYTTPLADVCPSPFYIQKDWLAQAEHGGLYQMIGAGGTMESGAYRGPLGATGIELAILEGGGGIGLGDGETAYSALFNGNSKAGVIPHLGFQELDNAYIFSNLFPVVGVFVPLDIAPSGLIWDTGTYPDGFHSVDDLKAFGESGAGMIYVSTITRTFGLWLVEQGVSRDAFVEGYRGDLENFVANNGTWLNQGFVTTEVFNLSNGMNWAKPVDAVTVNELGYPTITGMVSVAQPRLEELAPCLELLVPIMQQAAVDYINDPAEVNQLIADFTAGGFSASWWRATPELNAYSAAAQRDRGIVGNGNNATIGDFDLDRAAAMLELVKPMLDDRANPDVTVDDVVTNRFINPEIGL >NC_017384.1|WP_014537901.1|1738724_1739585_+|ABC-transporter-permease-subunit MTEIAPKIRTEAYAPTPVAPPRTPAQKFAATFLPPLIMGVLVVLLYWVVRESLPAHRQFLMPSASGMWDKALSQPAVWAELGSRSLTTLTIALTGLAFSIPIGMALGIIMFRFFVMERAVYPFLVALQSIPIMAIIPLIQSALGFGFMPKVLIVILFTFFAIPTTLLLGLKSLDQGVLNLFRLQGASWWTMLRKAGLPSSAPALFAGFRISTSMAVIAAVTSELFFMAGRGGLGQMLVNAKTDFKYEQMYAALIASATLSISIFVVFTLVGNRIFASWYETAERKS >NC_017384.1|WP_013384836.1|1749869_1750805_+|sugar-ABC-transporter-permease MSVTDPNGAAEGRPGLWNRLGIRTKHVLWAWAFLAIPVLFYVVIRFYPTFDAFWLSLTDGNIRRGPSFIGLENYARMYADPVFWKVFGNTFLYLLIGTPVSLVISFTIAYYLDRVRFMHGLIRALYFLPYLTTAAAMGWVWRFLYQPVPIGMINSFLTSIGLEQQPFLRSTDQALMAATIPAIWAGLGFQIIIFMAGLRAIPSSFYEAARIDGLGEWAILRKITLPLLKPTTIFLVVLSSIGFLRIFDQVQSLTANDPGGPLNATKPLVMLIYQTAFSSFRMGYASAQTVILFLVLLLISLLQLWLLRDKK >NC_017384.1|WP_013384837.1|1750801_1751650_+|carbohydrate-ABC-transporter-permease MSASTELAANRRNIRPGRVIAWTLLILGGFLMALPILYMFSTSLKPASDTFDLRLIPAAPTLANYIDILQDGRFIRWFYNSMIIAVAVTASNVFFDSLVGYTLAKFDFRGKNIVFIAILSTLMIPTEMLVIPWYMMSAKLGWLDSHWGIMFPGMMTAFGTFLMKQFFEGVPNDFLEAARVDGLNEFTIWWKIAMPMVLPAISALAIFTFLGNWTAFLWPLISTTSPDLYTLPVGLNSFAVGEAVRWERIMTGAALATIPTLLVFLALQRFIVRGVMLAGLKG >NC_017384.1|WP_013384838.1|1751653_1753171_+|argininosuccinate-lyase MSNPNDPRLTDGSVFPDPVYKETVLRPLFDGAKTHHVAAFGAIDRAHLVMLAETGILPAADAGKIAVARAALDTEIDPATLTYTGEVEDYFFLIEKELKARVGAELGGRLHTARSRNDIDHTLFKLGLRARLNLLIEQAIALHGAIVAKAEAESATLIVAYTHGQPAQPSTLGHYLSAMAEILARDIQRLFEAYRIVNLSPMGAAAITTSGFPINRERVAELLGFAAPLQNSYSCIASVDYITSTYSAMELMFLHLGRPIQDLQFWTSFEVGQIYVPNALVQISSIMPQKRNPVPIEHLRHLASQTVGRAHSMLTIMHNTPFTDMNDSEGETQETGYQAFEVAGRVLTLLAALVAQIKVDPARVASNIRRSCITITELADSTVRREGLSFREGHEIAAAVARAVVAAEGDLTTDGYAPFVTAFKHATGRDPQIDAAAFAQITSPEYFVAVRDRTGGPAPEALAQAISGYKTQNAGFAAQLATLIATQSAADADLATAFNLLKESA >NC_017384.1|WP_013384839.1|1753170_1754232_+|sn-glycerol-3-phosphate-ABC-transporter-ATP-binding-protein-UgpC MAKIELEGLVKDYGKVRAVHGIDLQIEDGEFVVFVGPSGCGKSTTLRMIAGLEDISGGALKIGGKVVNQLEPKQRNIAMVFQNYAIYPHMTVGQNIAFGLYTSKLPKAEKDRLVREAGETLGLTPYLDRRPAALSGGQRQRVAIGRAMVRSPSAFLFDEPLSNLDAQLRGQMRIEIKRLHQRLGTTIVYVTHDQVEAMTMADKIVVMRDGRILQVGSPLDLYENPVDVFTARFIGSPSMNVIEGESDGVNLRLGNSTLPGFGANLPAGKVMVGLRPHDLKVGVPGDATLEAVVTAIEPLGAETLVHMEVAGQPLVGSAPGRVLPVVGSTVTASVTRGVLYVFDAQTEKALGRA >NC_017384.1|WP_013384840.1|1754228_1755134_+|ribokinase MTSKNNGEGVLSLGRIYADLAFAELDAPPTPGREVYAQSFSLTPGGGAVITAAHLVAAGRPAHLLARLGTDPIAVAIASELTALDLDLTYVERAADAGPQLTVAIVTPEDRAFITRRSPRGMPSQAAAALHGAGLRHLHIAEYATLAENPALIVTAKMAGLTISLDPSWDESLIHGPALLAASSGVDVFFPNMDEATALTGKTAPAAALDILAQHFPVVALKCGSAGAMLAVGSTRFSVTAPKTVVVDTIGAGDSFNAGFLDAWLSGLAPEEVLRRAVQRGSQSVMAAGGTGCLSQMKSAS >NC_017384.1|WP_013384841.1|1755146_1756490_-|Ktr-system-potassium-transporter-B MKTAVKGWAARFLSLPPPLVVAGIYIATITMGASLMMLPMAQAMPMRWSDAFFMATSAVTVTGLAVVDVGSHLSLLGQAVLVTLVQLGGLGLMTFAVLILEIVGRPVGLMGEAYLREDLKQNALWRVGRLVRRIAVVVFAIEAVGIAILCLSFIPDLGFWPGLWAAIFHGIGAFNNAGFSIFRTGLMEYVADPIVNLVIPALFITGGIGYFVLHDLIYKRRWRYWSLNTRIMLAGTAVLIPWSVLMFAALEWTNPATLGGLDGIWPRIAASWFQGVTPRTAGFNTLDISGIHDSTAMLFISLMLIGGGATSTAGGIKVTTFVVMILATIAFFRRQTQLHIFGRGIGPDEVLKVMAIVAVSLVLVFCGVFLLSLSHDGHFLDIAFEVASAFSTTGLSRNYTPELNDFGRCVIMVIMFIGRLGPLTLGFFLATQLSPRVRYPQERIHIG >NC_017384.1|WP_013384842.1|1756493_1757156_-|TrkA-family-potassium-uptake-protein MARTEQSFVVIGLGAFGAAVASELARFGNRVMGIDLDERRVAQMVSVLPTALILDATDEIALREAGVDRYDVALVAIGQNIEASILATMNLRLLGLETVWVKAASRVHHRILVKIGADRVILPEQEMGRHIAQMLNNPVVQDYVSLGNGFNVVSIELPKALDGATPKSLGLVGREEPRLMAAMRGTQQLDIANPDLRFAPNDKLILLGRRVVLQAFSDGL >NC_017384.1|WP_013384843.1|1757270_1757618_-|pore-forming-ESAT-6-family-protein MIATSAFAQAPAMPDMDEAVAAANNQLGVLEYCAAEGHIESTAVEVQERLLQVLPPASDPTAVEAAYAAGKEGTIAVSGTEMSLADAATGQGTDVAALCQQLGSMVEQAGASLPN >NC_017384.1|WP_162467564.1|1757972_1758395_+|hypothetical-protein MKKLLGLLALCALAACVDADGRHVLGVSDEQQTFSLTLTGQGTDGARCWAEGHEGTSESLNDVFGRPQVRMKGAIQTGHVRCRMADGTTYMSLINQTVGLGWWSNIAMVRYVEGSMFLRATMDVPHGRRIYQSGMIRLPS >NC_017384.1|WP_013384845.1|1758869_1759076_-|hypothetical-protein MNTAPFVRIALRYIAGGLVSYGILTPEGATAFASDPQVIAQASIVLGAATAALTEGFYALAKRWGWRT |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | NC_017903 | Escherichia coli Xuzhou21 plasmid pO157_Sal, complete sequence | 24723-24756 | 6 | 0.824 |
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | NC_011148 | Salmonella enterica subsp. enterica serovar Agona str. SL483 plasmid, complete sequence | 37052-37085 | 7 | 0.794 |
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | NZ_KU963390 | Escherichia coli strain ECO37 plasmid ECO37P2, complete sequence | 30385-30418 | 7 | 0.794 |
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | NZ_CP010193 | Escherichia coli strain M8 plasmid B, complete genome | 33332-33365 | 7 | 0.794 |
NC_017384_1 | 1.16|1462340|33|NC_017384|CRISPRCasFinder,CRT | 1462340-1462372 | 33 | NZ_AP022566 | Mycolicibacterium alvei strain JCM 12272 plasmid pJCM12272, complete sequence | 45464-45496 | 7 | 0.788 |
NC_017384_1 | 1.5|1462273|35|NC_017384|PILER-CR | 1462273-1462307 | 35 | NC_011148 | Salmonella enterica subsp. enterica serovar Agona str. SL483 plasmid, complete sequence | 37052-37086 | 8 | 0.771 |
NC_017384_1 | 1.10|1462612|35|NC_017384|PILER-CR | 1462612-1462646 | 35 | NZ_CP007130 | Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence | 539756-539790 | 8 | 0.771 |
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | NZ_CP049247 | Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed1, complete sequence | 18758-18791 | 8 | 0.765 |
NC_017384_1 | 1.20|1462606|34|NC_017384|CRISPRCasFinder,CRT | 1462606-1462639 | 34 | NZ_CP007130 | Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence | 539756-539789 | 8 | 0.765 |
NC_017384_1 | 1.5|1462273|35|NC_017384|PILER-CR | 1462273-1462307 | 35 | NZ_CP049247 | Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed1, complete sequence | 18757-18791 | 9 | 0.743 |
NC_017384_1 | 1.5|1462273|35|NC_017384|PILER-CR | 1462273-1462307 | 35 | NZ_CP019707 | Pantoea alhagi strain LTYR-11Z plasmid pPALTYR11Z, complete sequence | 62089-62123 | 9 | 0.743 |
NC_017384_1 | 1.6|1462339|34|NC_017384|PILER-CR | 1462339-1462372 | 34 | NZ_AP022566 | Mycolicibacterium alvei strain JCM 12272 plasmid pJCM12272, complete sequence | 45464-45497 | 9 | 0.735 |
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | NZ_CP019707 | Pantoea alhagi strain LTYR-11Z plasmid pPALTYR11Z, complete sequence | 62089-62122 | 9 | 0.735 |
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | NZ_CP007740 | Bacillus methanolicus MGA3 plasmid pBM69, complete sequence | 16414-16447 | 9 | 0.735 |
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | NZ_LT222315 | Pseudomonas cerasi isolate Sour cherry (Prunus cerasus) symptomatic leaf plasmid p58T3 | 108477-108510 | 10 | 0.706 |
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | NZ_LT963398 | Pseudomonas cerasi isolate PL963 plasmid PP3, complete sequence | 75521-75554 | 10 | 0.706 |
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | CP034538 | Pseudomonas poae strain CAP-2018 plasmid unnamed | 107853-107886 | 10 | 0.706 |
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | NC_008738 | Marinobacter hydrocarbonoclasticus VT8 plasmid pMAQU01, complete sequence | 62733-62766 | 10 | 0.706 |
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | NC_007678 | Salinibacter ruber DSM 13855 plasmid pSR35, complete sequence | 14814-14847 | 10 | 0.706 |
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | NZ_AP022559 | Geobacillus subterraneus strain E55-1 plasmid pGspE55-2, complete sequence | 821-854 | 10 | 0.706 |
NC_017384_1 | 1.22|1462472|35|NC_017384|CRT | 1462472-1462506 | 35 | NC_021289 | Burkholderia insecticola plasmid p1, complete sequence | 984065-984099 | 10 | 0.714 |
NC_017384_1 | 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT | 1462274-1462307 | 34 | NZ_CP029542 | Streptomyces sp. NEAU-S7GS2 plasmid unnamed1, complete sequence | 9497-9530 | 11 | 0.676 |
1. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to NC_017903 (Escherichia coli Xuzhou21 plasmid pO157_Sal, complete sequence) position: , mismatch: 6, identity: 0.824
tcccata-aaaaaacccgcctctaagggcgggctg CRISPR spacer -gcgacacaaaaaacccgcctctaagggcgggtta Protospacer * *.* ************************.*.
2. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to NC_011148 (Salmonella enterica subsp. enterica serovar Agona str. SL483 plasmid, complete sequence) position: , mismatch: 7, identity: 0.794
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer cgacacaaaaaaacccgcctctaaggacgggtta Protospacer . **.********************.****.*.
3. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to NZ_KU963390 (Escherichia coli strain ECO37 plasmid ECO37P2, complete sequence) position: , mismatch: 7, identity: 0.794
-tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer gcttcat-aaaaaacccgcctctaaaggcgggtta Protospacer ...*** *****************.******.*.
4. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to NZ_CP010193 (Escherichia coli strain M8 plasmid B, complete genome) position: , mismatch: 7, identity: 0.794
-tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer gcttcat-aaaaaacccgcctctaaaggcgggtta Protospacer ...*** *****************.******.*.
5. spacer 1.16|1462340|33|NC_017384|CRISPRCasFinder,CRT matches to NZ_AP022566 (Mycolicibacterium alvei strain JCM 12272 plasmid pJCM12272, complete sequence) position: , mismatch: 7, identity: 0.788
tgcttccg--gtaaaactccaccgcagcgtggaac CRISPR spacer --ccgccgacgtgaaacaccaccgcagcgtggaag Protospacer *. *** **.**** ****************
6. spacer 1.5|1462273|35|NC_017384|PILER-CR matches to NC_011148 (Salmonella enterica subsp. enterica serovar Agona str. SL483 plasmid, complete sequence) position: , mismatch: 8, identity: 0.771
ctcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer gcgacacaaaaaaacccgcctctaaggacgggtta Protospacer . **.********************.****.*.
7. spacer 1.10|1462612|35|NC_017384|PILER-CR matches to NZ_CP007130 (Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence) position: , mismatch: 8, identity: 0.771
ccccagcaggcgcggcgcagacgacaggcgggacg CRISPR spacer catcgacgggcgcggcgcagacgacaggcaagacc Protospacer * .*..*.*********************..***
8. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to NZ_CP049247 (Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.765
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer gggcaacaaaaaacccgcctcgaggggcgggctt Protospacer ** ************** *.*********
9. spacer 1.20|1462606|34|NC_017384|CRISPRCasFinder,CRT matches to NZ_CP007130 (Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence) position: , mismatch: 8, identity: 0.765
cccagcaggcgcggcgcagacgacaggcgggacg CRISPR spacer atcgacgggcgcggcgcagacgacaggcaagacc Protospacer .*..*.*********************..***
10. spacer 1.5|1462273|35|NC_017384|PILER-CR matches to NZ_CP049247 (Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.743
ctcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer tgggcaacaaaaaacccgcctcgaggggcgggctt Protospacer . ** ************** *.*********
11. spacer 1.5|1462273|35|NC_017384|PILER-CR matches to NZ_CP019707 (Pantoea alhagi strain LTYR-11Z plasmid pPALTYR11Z, complete sequence) position: , mismatch: 9, identity: 0.743
ctcccataa-----aaaaacccgcctctaagggcgggctg CRISPR spacer -----aaaaggcttaaaaaaccgcctctaagggcggtctt Protospacer * ** ***** **************** **
12. spacer 1.6|1462339|34|NC_017384|PILER-CR matches to NZ_AP022566 (Mycolicibacterium alvei strain JCM 12272 plasmid pJCM12272, complete sequence) position: , mismatch: 9, identity: 0.735
ctgcttccg-----gtaaaactccaccgcagcgtggaac CRISPR spacer -----cccgccgacgtgaaacaccaccgcagcgtggaag Protospacer .*** **.**** ****************
13. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to NZ_CP019707 (Pantoea alhagi strain LTYR-11Z plasmid pPALTYR11Z, complete sequence) position: , mismatch: 9, identity: 0.735
tcccataa------aaaaacccgcctctaagggcgggctg CRISPR spacer ------aaaggcttaaaaaaccgcctctaagggcggtctt Protospacer ** ***** **************** **
14. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to NZ_CP007740 (Bacillus methanolicus MGA3 plasmid pBM69, complete sequence) position: , mismatch: 9, identity: 0.735
tcccataaaaaaacccgcctctaagg-gcgggctg CRISPR spacer aaccataaaaatacccccctctaaggtgtttgtt- Protospacer ********* **** ********* *. *.*
15. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to NZ_LT222315 (Pseudomonas cerasi isolate Sour cherry (Prunus cerasus) symptomatic leaf plasmid p58T3) position: , mismatch: 10, identity: 0.706
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer caaaacgaaaaaacccgcctattagggcgggttt Protospacer . *..************* * ********.*
16. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to NZ_LT963398 (Pseudomonas cerasi isolate PL963 plasmid PP3, complete sequence) position: , mismatch: 10, identity: 0.706
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer caaaacgaaaaaacccgcctattagggcgggttt Protospacer . *..************* * ********.*
17. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to CP034538 (Pseudomonas poae strain CAP-2018 plasmid unnamed) position: , mismatch: 10, identity: 0.706
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer acttcacaaaaaacccgcctttaatggcgggttt Protospacer *.. *************.*** ******.*
18. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to NC_008738 (Marinobacter hydrocarbonoclasticus VT8 plasmid pMAQU01, complete sequence) position: , mismatch: 10, identity: 0.706
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer agaaaacaaaaaacccgcctcaatgggcgggttt Protospacer * ************** * *******.*
19. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to NC_007678 (Salinibacter ruber DSM 13855 plasmid pSR35, complete sequence) position: , mismatch: 10, identity: 0.706
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer agtcagtaaaaaacccgcctttacgggcggggca Protospacer .** *************.** ******* ..
20. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to NZ_AP022559 (Geobacillus subterraneus strain E55-1 plasmid pGspE55-2, complete sequence) position: , mismatch: 10, identity: 0.706
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer gaaaaataaaaaacccgcctcaaaaggcggggtc Protospacer * ************** **.****** *
21. spacer 1.22|1462472|35|NC_017384|CRT matches to NC_021289 (Burkholderia insecticola plasmid p1, complete sequence) position: , mismatch: 10, identity: 0.714
gcgaccaatggcgcgtagcgacaccctatcaggac CRISPR spacer acgacgaatggcgcgtcgcgacacccatctgcaac Protospacer .**** ********** ********* ... .**
22. spacer 1.15|1462274|34|NC_017384|CRISPRCasFinder,CRT matches to NZ_CP029542 (Streptomyces sp. NEAU-S7GS2 plasmid unnamed1, complete sequence) position: , mismatch: 11, identity: 0.676
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer cgtaacgaaaaaacccgcctcggagggcgggact Protospacer . . *..************** .******** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
175167 : 183561
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_017384|175167:183561|DBSCAN-SWA TTTAGTGGCCCGTTGCCCCAGCCGCAGGCAGCGCGGCCGACCAGCTGAGATTGCGATGCGCATCGCGCAGGCGCAGCAATGACGAGACATGGCTATCCGCGTCCAGCGCCGTCATCACACGGTGCAGATGTTCGGCATCGCGCACATCGACATCAATGCGCAGGCTGTAGAAATCCAGCTTGCGATCGAGGAAATCGAGGTCCGAGATATTGGCGTCCTGCTCGCCAATCAGCGTGCAGACGCGGCCCAGAACGCCCGCATCATTGGCGACAGACAGTTCCAGCGAGACGGTGTTGATCGCGCGGTGCTGCCCCTCTTGCCAGCGCAGATCGACCCAGCGATCGGGCTGATCCTCGTATTCCGACAGGGCCGGGCAGTCGATGGCATGGACGATAACACCCTGACCGCGGAATGTGATGCCGACGATCCGCTCGCCCGGCACCGGCTGGCAGCAGGGCGCACGGCGGAAACTTTGGTCGGGGCTAAGGCCGACGACCGCTTTTTCCGCGTCGATTTCATTCGCATCGCGCAGCTTCAGATCGGGATAGATGGCGCGCACAACCTCGCGCGCGGTAATTTCGGCGGCTCCGACGCGCAGCAAAAGCTGCTCGGCATTCTCGAAAGCCAGCGCGCGCGCGGCGGTGGCCAGCGCTTTGTCAGTGGATTTTTTCCCTGCGTTTTCAAAGGCCACGCGGGTCAGCTCGGTGCCAAGCTTGATGAACCGGTCACGGTCCTTTTCGCGCAGCCAACGGCGGATCGCGGATTTCGCGCGGCCGGTGACGGCAATGTCGATCCATGTCGCCTGCGGGGTCTGGCCGTCGGCGATGATAATCTCGACCGATTGACCGTTCTTCAGCCGTGTCCACAGCGGCACGCGCAGCCCGTCGACCTTGGCGCCGACGCAGGCATGGCCGATGCGGGTGTGGATCGCATAGGCGAAATCAATCGGGGTCGCGCCTTGCGGCAGCTTGATTACTTCGCCCTTGGGGGTAAAGCAAAAGACCTGATCTTGATACATCTCAAGCTTGAACGTCTCGAGGAATTCGTCGTGGTCCTGATCCTCTTCGAACCGCTCGGAGAGTTGCGCGATCCAGCGGGCGGGATCGACGACAAAGCGGTTCTGCACCGGCTCGCCGTCGCGATAGGACCAGTGGGCCGCGACGCCTGCCTCGGCCACTTCGTGCATTTCGCGGGTGCGGATCTGCACCTCGACCCGCTTGCCGCCGCGCGCGGACACAGCCGTATGGATCGAGCGGTAGCCGTTCGATTTGGGCTGGCTGATGTAATCTTTGAACCGGCCGGGGACCGAGCGCCAGCGTTGATGGATCGCCCCCAGCGCGCGATAGCAATCGACATCGGTTTTCGTGATCACGCGAAAGCCATAGATGTCCGACAATTGCGAGAAGCTTTGATCCTTCTCCTGCATTTTGCGCCAGATCGAATAGGGCTTTTTCGCGCGGCCATGCACCTCGGCGGGAATGCCGGATTTCTCGAACTCGGCCAGCAGGTCGGTTTTGATCTGATCGACCAGATCGCCCGATTCCTGTTGGATCAGGCTGAACCGCTGGATGATGGAATCGCGCGCCTCGGGGTTGAGGACGCGAAAGGCCATATCCTCCAGCTCTTCACGCATCCATTGCATCCCCATGCGGCCGGCAAGCGGCGCATAGATATCCATGGTTTCGCGGGCTTTTTTCACCTGCTTTTCCGGCCGCATCGAGGCGATGGTGCGCATATTGTGCAAGCGGTCGGCCAGCTTGACCAATGTCACCCGCAAATCGCGCGAGGTTGCCATGATCAACTTGCGGAAATTCTCGGCCTGTTTGGTTTCAGCCGAATGCAGTTGCAGATTGGTCAGTTTGGTGACGCCATCGACCAGATCGGCAATGACCGCGCCAAAGCGGGCCTCGACCTCGGCATAGGTGGCGCGGGTGTCCTCGACCGTATCATGCAGCAGGGCGGTGATGATCGTGGCATCATCCATCTGTTGCTGCGCCAGGATCATCGCGACGGCTACGGGATGGGTGAAATAGGGCTCGCCCGAATGGCGGTACTGGCCGTCATGCATCAGACGGCCGAATTCCCAGGCATCGCGCAGCAGGGATTCATTGGTGGCGGGATTATAGCTGCGCACGCGCGCGATCAGATCGTCTTGCGTGATCAGCGGTTCGGGCACGACTTGCGCCGCGCCCGCAGGCAGGGTGGGGGCGTTGTTATCCACCCCCGTCGACAAAGCCACATCAGACCGCGTTACATCAGGCGGGCCCTGATCATTCGCCGCCGCGCTGGCTTTGTCACTGCCTTCCATGGGTGTGGCCTCAGCGGCCCTCTTGCGCTTGCAGCAGCTCGCGCAGCAGGCGCTCTTCGGTCATGTCGTCCTGGGCGGGACGATCCAGCTCGGCGCTCATCAGCAGAGCCATCGAATCGTCTTCGGGCTCGTCGACTTCGATCTCGTGCTGGTTGGCTTCGATCATCCGCTCGCGCAGGTCATCGGCCAGCTGGGTTTCTTCGGCGATCTCGCGCAGCGATACAACGGGGTTCTTGTCGTTGTCGCGCGGCACAGTCAGGGCCGACCCTGCAGCGATCTCGCGTGCACGATGTGCGGCCAGCATGACCAGTTCGAAACGGTTCGGAATCTTGTCTACACAATCTTCGACGGTGACACGTGCCATGCGACTACTCCCCTTCGGTGGGTTTGACGGGTCTTGCCGGTAGCTACCCGCACGAATCCGATATATCGTGCAGCGACCGGGGGAAAACATTCATTTAGAGGGGGCAGACGCGGAACGCAAGGGCCGCAGGCAGGATAGCGAGGATATCCGTGTCAAATTCGCCGTTGGGGAGGGGTATCCGTTAAAACCGCGCGGTCTTATTTTAGAGTTGCTCTAAATTTGGTTAACGCAACTTTATTATAGACGAGGATTTTCCGCGTGTTTTCGGTAATTTGGGGGAACTTTAGACTTTATGCAGCATTAGATTCGCGTTTTACTTGCACCGCGAGGTGAAGTTGTTAACTAATTTGTTATCCGCTGCACATCGCACTGATGTTTCATTTCCTGTTTCCGTAGCAGACCGCGATTGGAAGGATAATATATGTTTTACCGGGACGAGCGGATTGCGCTCTTTATCGATGGGGCGAATCTGTATGCCGCATCGAAATCCTTGGGGTTTGATATCGACTACAAACTGTTGCGTAGCGAGTTCATGCGACGCGGACGGCTGATCCGTGCTTTCTACTATACAGCGCTGCTTGAGAACGAGGAATATTCCCCGATCCGACCATTGGTTGACTGGCTGCATTACAACGGCTTTTCAATGCGTACAAAGCCCGCGAAAGAGTTTCAGGACGCCCAAGGCCGCCGCAAGATCAAAGGCAATATGGATATCGAGCTGACCGTCGATGCGATGGAATTGGCGCCCCATGTCGATCACATTGTTCTGTTCTCGGGCGATGGCGATTTCCGTCCATTGATCGAGGCGCTGCAGCGGCGCGGCGTGCGCGTATCGGTTGTATCCACCGTGCGTAGCCAGCCGCCGATGATTGCCGATGAATTGCGCCGTCAGGCTGATAATTTCATCGAACTGGACGAGCTGCGCGATGTGCTGGGTCGCCCGCCGCGCCCCGATGCCCGCCCCGGTATGCCGCGCGACGAGACGGTCGAGACGCAAAGCCTGCTGGATTAAACCACGCCCGGCGCCCTTTACGGGGGCGCCGCATCGGCTTATCTGAAAGGGCAACGCTGTCTCGGAGCCTCGCCCATGTCCCTGCCTCCTCTTACGGTCTATCTGGCCGCACCGCGCGGCTTTTGTGCCGGTGTGGATCGCGCGATCCGTATTGTGGAAATGGCGCTGGAAAAATGGGGCGCGCCGGTTTTTGTGCGCCACGAGATTGTTCATAACAAATATGTCGTCGACGCGCTGCGCGCCAAAGGTGCAGTCTTTGTCGAGGAATTGGATGAATGTCCCGAAGATCGCCCTGTGATCTTTTCGGCGCATGGCGTGCCGAAATCTGTCCCGGCCGAGGCGGTGCGCCGCAATATGATCCATGTCGATGCCACCTGTCCGCTGGTGACAAAGGTGCATAACGAGGCCGCCCGTCATCATACCAACGGTTTGCAGATGATCATGGTCGGTCACAAGGGCCACCCCGAGGTCATCGGCACCATGGGCCAACTGCCCGATGGCGAGGTGATGCTGGTCGAGACGCTCGCCGATGTCGCGACGGTTCAGGTGCGTGACCCCGCGCGCCTGGCGATGATCACACAGACCACATTGTCGGTCGATGATACTGCCGAGATTGCCGCCGCGCTGAAAGCCCGCTTTCCGGCGATCAATGTCCCCGCGAAAGAGGACATTTGCTATGCCACCACCAATCGGCAAGAGGCGGTCAAAGTGATGGCCCCCAAATGCGATGCGATCCTTGTGGTCGGCGCGCCCAATTCCTCGAACTCGAAACGTCTGGTCGAGGTCGGCAGCCGCGCCGGTTGCGATTACTCGCAGCTTGTCCAGCGCGCGGATGAGATTGATTGGCGGGCCTTGCAGGGCATCCGCACATTGGGTGTAACCGCCGGCGCCTCGGCCCCCGAAATTTTGATCGAAGAAGTGATCGATGCGTTTCGCGCCCATTATGACGTAACGGTTGAACTGGTCGTGACCGCCGAAGAACGGGTAGAGTTCAAAGTTCCCAAAGTCCTGCGCGAGCCTGCCTGATATGCCTGAATTCATCTGCTTTACCGACGGCGCCTGTTCGGGCAACCCGGGGCCCGGCGGTTGGGGCGTTTTGATGCAGGCGCGCGAGGGTGGGGCCGTGGTCAAAGAGCGACCGCTCTGCGGCGGCGAGGCGATGACCACCAATAACCGCATGGAATTATTGGCCGCGATCAATGCTTTGGAAAACTTTACGCGTTCCAGCACCATCACCATCGTGACCGACAGCGTCTATGTGAAAGACGGCATTGGCGCGTGGCTGTTCAACTGGAAGCGCAACGGCTGGCGTACCTCGCAGGGCAAGCCGGTCAAGAATGATGATCTGTGGCGGCGTCTGGATGCCGAGGTGCAGCGCCATCAGGTGACGTGGAAATGGGTCAAGGGTCACGCGGGCCATCCCGAGAATGAACGCGCGGACGAACTCGCCCGCCAAGGCATGGCCCCGTTCAAGGCCGCCCGCGCGCTTTAACGCTGGAAAGCGCAGGGCTTGCCTGCTAGCCAGTCGGGCGATGCATCTAAAGGCCCGTCCATGTCCAATTATATCCTGACCGTTACCTGCGCTACGACCCGTGGCATTGTTGCTGCTGTCTCGGGGTTCTTGGCGGAAAACGGTTGTAATATCACCGATTCCGCGCAGTTCGACGATGTGCTGACGGGCAAGTTTTTCATGCGTATCAGCGTCACCAGCCAAGAGGGCGCGACGCTTGCCGATCTGCAAAGCCGCTTTGCAACTGTGGGTGCGCGCTTTGGCATGGAATTTGCCTTTTTTGACGCCAGCGAACGGGTCAAGGCGGTGATCATGGTCAGCCGTTTTGGCCATTGTCTGAATGATCTGCTGTATCGCCAGCGCATCGGCGCGCTGCCCATTGATATCGTGGGGGTGATCTCGAACCACTTCGAATATCAAAAGCTGGTGGTGAACCACGATATCCCCTTCCACCACATCCGCGTCACGCCCCAGAACAAGCCCGAGGCCGAGGCCGCCCAGATGCAGATCCTGCGCGAGACCGGTGCCGAGCTGGTGGTGCTGGCCCGTTATATGCAGATCCTGTCGGACGAGATGTGCCGCGAGATGTCGGGGCGGATCATTAATATCCACCACTCGTTCCTGCCCAGTTTCAAAGGGGCAAACCCCTATAAACAGGCGTACGAGCGGGGCGTAAAGTTAATTGGCGCGACGTCACATTATGTAACGGCAGATCTGGACGAAGGCCCGATCATCGAACAAGATACGGTCCGCGTGACTCATGCGCAGTCGCCCGAGGATTACGTCAGCCTTGGTCGCGATGTTGAAAGTCAGGTTCTGGCGCGTGCGATCCACGCGCATATTCACCGCCGTGTCTTTATCAACGGCAATAAAACCGTCGTCTTCCCGGCCTCGCCCGGATCTTATGCATCGGAACGGATGGGATGAAATATCTATTAGCCGCATTATTGCTGGTGCCTTTGCCCGCCTTTGCGCAGGGCGAGGATAAGGGCGATTGCCCTGATGCCTCCTCGACATCGGAAATCGTTGTTTGCCTGAACGAGCTGTATTCAACGGCGCAGCTGGAAATGCAGGTGCGGCTGGACGGGTTGGTCGCGGGCATGGCATCCAGCAATCGCGTTGCGGCGCTGAATGCGGCGCAAGCTGTCTGGAAAGCCTTTCGTCAGTTGGAATGCGAATCGCAGGCGCTGATCGCCGAAGGTGGCACCCTTGCCAATGTGCTGGGGGCCAGTTGCTATCTGCATATGACGCGCGACCGGATTGTGGCGTTGAACGCCTACGATCAGACCAATTAAAGCCGCGCGGGCAGTGTCAGGCCCAGCAGCGCGTCGGCCACCGGCATGGCGCGTTTGCCCTCGCGCTGGATGTCCAGCACGCGCAGGGCGCCGCTGCCGCAGGCCACGGTAAAGCCGTCTAGCACGGTGCCAGCCGCTGCCGTGCCGGGCACCGCCTCGGCGCGCAGCAGTTTCACCCGCTCATCCCCGATCATGCACCACGCACCCGGAAAAGGGGACAGCCCGTTGATCTGGCGGGCAACCGTGGGGGCGGGGCGCGTCCAGTCGACCAGCGCTTCGGCCTTGTCGATCTTGGCGGCATAGGTCACGCCATCCTCGGGCTGTACCTGCGGGATCAGGCCGCCCAGCCGTTCCAGCGTATCGATAATCATCCTTGCGCCCATCTGGGAAAGGCGCTGGTGCAGATCGCCGCTGGTATCCGTCGCACCGATCGGGGTTGCCGCGCGCAGCAGCACCGGGCCGGTATCAAGGCCCGCCTCCATCTGCATGATGCAGACGCCGGTTTCCGCATCGCCCGCCATGATTGCGCGCTGGATCGGGGCGGCACCGCGCCAACGCGGCAAAAGGCTCGCGTGGATGTTCAGGCAGCCATGTTTGGGCGCATCCAGCACCACCTGCGGCAAGATCAGGCCATAGGCGACCACGACCGCGACATCGGCATTCAGCGCCGCAAATTCGGCTTGCTCATCCGCGCCGCGCAGCGATTTGGGGTGGCGCACCATCAGCCCCAGGCTTTCGGCGCGGGCATGGACCGGCGTGGGGCGGTCTTTTTTGCCGCGACCGGCAGGGCGGGGCGGCTGGCAATAGACGCAGGCGATCTCGTGCCCGGCCGCGACCAGCGCCTCCAAAACCCCAACGGAAAAATCAGGGGTGCCCATAAAGACGACGCGCATCAGCCGAACTTCCTTGCCTTGCGCAAAAACATGTCGCGCTTGATTTTGCTAAGGCGGTCAAAATACATCTTGCCCGCCAGATGGTCGATCTGATGCTGCATCGAGGTGGCCCAAAGCCCGACCAGATCGCGTTCCTCGACCTCGCCCCATGCGTTCAGGAACCGCACCGTGACGGCACGGGGTCGGCTGATGACAGCGCTGATCCCCGGCAGATTTGGCGACGCTTCCTCGTGCTCGCGCAGTTGCACCGAGGCGTGCAAAATCTCGGGGTTGGCCATGCGGATCGCCTGACCGCGCGCATCCGATGCATCGACCACGGCCAGCGCCAGCGGCACGCCCAGTTGCACCGCGGCAAGGCCGACGCCCGGCATCGCATCCATCGCCTCGACCATCTCGTCCCATAGGGCGGTGATCTCGGGGGTGATCGCCTCGACCTGGGCGGCGGGTTTGCGCAGCACGGGGGCGGGCCACATCACAAAGGGACGGTGCATCTATTCGCCCCGCGCGCGTTCACGCTTTAGCTTTTCCATCTTGCGGGTGATCAACTGCCGCTTCATCGGACCCAGATAGTCGATGAACAGCTTGCCGTCCAAATGGTCGATCTCGTGCTGGACGCAGGTGGCCCATAGGCCCTCCATCTCGCGGTCCTGTTCATTGCCGTTCAGATCCAGCCAGCGCACTTTGACCGAGGCGGGACGCTCGACCTCGGCATATTGGTCGGGGATCGACAGACAGCCTTCCTCATAGACCGAGCGGTCGTCCGAAGACCAGACGATCTGCGGGTTCACCATGACCAAGGGCTGCGGCGCGTCCGGGTCTTTGGCACAATCCAGCACGATGATGCGCTGCAATTGACCGACCTGCGGCCCCGCAAGGCCAATGCCGGGCGCGTCATACATGGTTTCCAGCATGTCATCGGCCAGCCGACGGATCTCGTCCGAGATGTCGGGCAGCGGCTTTGCGATGGCGCGCAGGCGCGGATCGGGGTGGATAAGAATAGGGCGCGTTGTCAT
Protein sequences of DBSCAN-SWA_1 >NC_017384|175167:183561|179877_180342_+|WP_013383423.1|DBSCAN-SWA MPEFICFTDGACSGNPGPGGWGVLMQAREGGAVVKERPLCGGEAMTTNNRMELLAAINALENFTRSSTITIVTDSVYVKDGIGAWLFNWKRNGWRTSQGKPVKNDDLWRRLDAEVQRHQVTWKWVKGHAGHPENERADELARQGMAPFKAARAL >NC_017384|175167:183561|181283_181655_+|WP_013383425.1|DBSCAN-SWA MKYLLAALLLVPLPAFAQGEDKGDCPDASSTSEIVVCLNELYSTAQLEMQVRLDGLVAGMASSNRVAALNAAQAVWKAFRQLECESQALIAEGGTLANVLGASCYLHMTRDRIVALNAYDQTN >NC_017384|175167:183561|178925_179876_+|WP_014537458.1|DBSCAN-SWA MSLPPLTVYLAAPRGFCAGVDRAIRIVEMALEKWGAPVFVRHEIVHNKYVVDALRAKGAVFVEELDECPEDRPVIFSAHGVPKSVPAEAVRRNMIHVDATCPLVTKVHNEAARHHTNGLQMIMVGHKGHPEVIGTMGQLPDGEVMLVETLADVATVQVRDPARLAMITQTTLSVDDTAEIAAALKARFPAINVPAKEDICYATTNRQEAVKVMAPKCDAILVVGAPNSSNSKRLVEVGSRAGCDYSQLVQRADEIDWRALQGIRTLGVTAGASAPEILIEEVIDAFRAHYDVTVELVVTAEERVEFKVPKVLREPA >NC_017384|175167:183561|177484_177838_-|WP_013383421.1|DBSCAN-SWA MARVTVEDCVDKIPNRFELVMLAAHRAREIAAGSALTVPRDNDKNPVVSLREIAEETQLADDLRERMIEANQHEIEVDEPEDDSMALLMSAELDRPAQDDMTEERLLRELLQAQEGR >NC_017384|175167:183561|182547_183039_-|WP_013383427.1|DBSCAN-SWA MHRPFVMWPAPVLRKPAAQVEAITPEITALWDEMVEAMDAMPGVGLAAVQLGVPLALAVVDASDARGQAIRMANPEILHASVQLREHEEASPNLPGISAVISRPRAVTVRFLNAWGEVEERDLVGLWATSMQHQIDHLAGKMYFDRLSKIKRDMFLRKARKFG >NC_017384|175167:183561|180402_181287_+|WP_013383424.1|DBSCAN-SWA MSNYILTVTCATTRGIVAAVSGFLAENGCNITDSAQFDDVLTGKFFMRISVTSQEGATLADLQSRFATVGARFGMEFAFFDASERVKAVIMVSRFGHCLNDLLYRQRIGALPIDIVGVISNHFEYQKLVVNHDIPFHHIRVTPQNKPEAEAAQMQILRETGAELVVLARYMQILSDEMCREMSGRIINIHHSFLPSFKGANPYKQAYERGVKLIGATSHYVTADLDEGPIIEQDTVRVTHAQSPEDYVSLGRDVESQVLARAIHAHIHRRVFINGNKTVVFPASPGSYASERMG >NC_017384|175167:183561|175167_177330_-|WP_193365330.1|DBSCAN-SWA MITQDDLIARVRSYNPATNESLLRDAWEFGRLMHDGQYRHSGEPYFTHPVAVAMILAQQQMDDATIITALLHDTVEDTRATYAEVEARFGAVIADLVDGVTKLTNLQLHSAETKQAENFRKLIMATSRDLRVTLVKLADRLHNMRTIASMRPEKQVKKARETMDIYAPLAGRMGMQWMREELEDMAFRVLNPEARDSIIQRFSLIQQESGDLVDQIKTDLLAEFEKSGIPAEVHGRAKKPYSIWRKMQEKDQSFSQLSDIYGFRVITKTDVDCYRALGAIHQRWRSVPGRFKDYISQPKSNGYRSIHTAVSARGGKRVEVQIRTREMHEVAEAGVAAHWSYRDGEPVQNRFVVDPARWIAQLSERFEEDQDHDEFLETFKLEMYQDQVFCFTPKGEVIKLPQGATPIDFAYAIHTRIGHACVGAKVDGLRVPLWTRLKNGQSVEIIIADGQTPQATWIDIAVTGRAKSAIRRWLREKDRDRFIKLGTELTRVAFENAGKKSTDKALATAARALAFENAEQLLLRVGAAEITAREVVRAIYPDLKLRDANEIDAEKAVVGLSPDQSFRRAPCCQPVPGERIVGITFRGQGVIVHAIDCPALSEYEDQPDRWVDLRWQEGQHRAINTVSLELSVANDAGVLGRVCTLIGEQDANISDLDFLDRKLDFYSLRIDVDVRDAEHLHRVMTALDADSHVSSLLRLRDAHRNLSWSAALPAAGATGH >NC_017384|175167:183561|178259_178850_+|WP_013383422.1|DBSCAN-SWA MFYRDERIALFIDGANLYAASKSLGFDIDYKLLRSEFMRRGRLIRAFYYTALLENEEYSPIRPLVDWLHYNGFSMRTKPAKEFQDAQGRRKIKGNMDIELTVDAMELAPHVDHIVLFSGDGDFRPLIEALQRRGVRVSVVSTVRSQPPMIADELRRQADNFIELDELRDVLGRPPRPDARPGMPRDETVETQSLLD >NC_017384|175167:183561|181651_182548_-|WP_013383426.1|tRNA|DBSCAN-SWA MRVVFMGTPDFSVGVLEALVAAGHEIACVYCQPPRPAGRGKKDRPTPVHARAESLGLMVRHPKSLRGADEQAEFAALNADVAVVVAYGLILPQVVLDAPKHGCLNIHASLLPRWRGAAPIQRAIMAGDAETGVCIMQMEAGLDTGPVLLRAATPIGATDTSGDLHQRLSQMGARMIIDTLERLGGLIPQVQPEDGVTYAAKIDKAEALVDWTRPAPTVARQINGLSPFPGAWCMIGDERVKLLRAEAVPGTAAAGTVLDGFTVACGSGALRVLDIQREGKRAMPVADALLGLTLPARL >NC_017384|175167:183561|183039_183561_-|WP_013383428.1|DBSCAN-SWA MTTRPILIHPDPRLRAIAKPLPDISDEIRRLADDMLETMYDAPGIGLAGPQVGQLQRIIVLDCAKDPDAPQPLVMVNPQIVWSSDDRSVYEEGCLSIPDQYAEVERPASVKVRWLDLNGNEQDREMEGLWATCVQHEIDHLDGKLFIDYLGPMKRQLITRKMEKLKRERARGE |
10 | Synechococcus_phage(50.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
696004 : 707603
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_017384|696004:707603|DBSCAN-SWA CATGTTCGGATTTGGCGAGAAAAAACAACCGGCGCCGCTGGTCGAGGTCAAAGCCTCGGCGGCGGGAAAGGTGGTGGGTTTCGGCACAGCCGGTCGCACCGGATTCCAGCCGCGCGAGGGATCATCGCTGGTGCGCGCGGGCTTTGCCGCCAACCCCATCGGCTTTCGTGCCGTGCGCCTGATCTCTGAGGCGGCCTCGGCCCTGCCGCTGATCTTGCAAGACGCGACGCAGCGCTATGACACCCACCCCGTGCTGGATCTGCTGGCCCGCCCCAATCCCGCCCAAGGCCAGCTTGAACTGTTCGAGGCGATCTACGCCCAGCTTTTGTTGACCGGAAATGCCTATGTCGAGGCCGTGTCGGATGGCGCGCTGCCCACGGAACTGCATGTCCTGCGCAGCGATCGTATGTCGGTCGTGCCGGGGCCCGATGGCTGGCCCACCGGTTATGACTATGCGGTGGCCGGGCGCAAACACCGCTTTGACGCCGCCGCGATCTGCCATATCCGTGCCTTTCACCCCCATGACGACCATTACGGCCTGTCGCCCCTGACGCCCGCGGCGGCGGCGGTCGAGGTGCATAATGCCGCCTCGCGCTGGTCGCGGGGGCTTTTGGAGAACGCGGCGCGGCCTTCGGGCGCGATTGTCTTTCGCGGCGCCGATGGCAACGGCACCTTGTCCAACGGCCAATTCGACCGGCTGGTGGCCGAGATGGAAAGCCAGCATCAGGGCGCGAGGAATGCCGGACGCCCGATGCTGCTTGAGGGCGGCCTCGATTGGAAGCCGATGGGCTTTTCGCCCTCGGACATGGAATTCCTGCAAACCAAAGAGGCCGCCGCGCGCGAGATTGCCACCGCCTTTGGCGTGCCGCCCATGTTGCTGGGCATCCCCGGGGACGCGACCTATGCCAATTACCAAGAGGCGAACCGCGCCTTTTACCGCCTAACCGTGCTGCCTTTGGCGTCGCGGGTCACGGGTGCGCTGGTGAATTGGCTGGACGATTTTGCGGGCACCTGGCTGGATCTGCGGCCCGATCCCGACCAGATCGCCGCCTTGCAAACCGAACGCGACGCCCTGTGGGCACGCGTGGGCGCCGCAAGCTTCCTCTCGACCGCCGAAAAACGCGCCTTGCTCGGCCTTCCCGGAGAGCCAGATGGAGCCGCGTGAGCGCCCCTTCCTCTGCGCCCCCGGCCTGAAGATCGAGGCGCAGGAGCGGCTGGTCGCGCTGCAATTCCAGCAATTGCAGCAACAGCTTGCCCGGGTAGAGGCGCTGATCGAGCGGTTAGAAAAACGTCTGTGGCTGACGGTTTACGGCGTTTTGGGCGCGATTCTGGCGCAGGCGTTTCAATCATTCCTGTCGGTTGCTCCCGGTTAAATGGAGGGGATTTTGGATCTCGAATATAAATACGCAACGCTATCCGCGCCCGATCCTGCGGGCGTGAGCGTCGCAGGCTATGCCTCGGTCTTTGGGCTGCGCGATCAAGGCGGCGATATCGTGCAAAAGGGGGCCTTTGCCGCTTCGCTCGCGCGTCTTGCCGCTGCTGGCACCAAGGTGCGGATGCTGTGGCAGCACGACCCCAGCCTTCCTATCGGCGTCTGGGACGAGGTCACCGAGGATGCCACCGGCCTGCGCGTCAGCGGCCGCCTGCTGCCCGAGGTCGCCAAGGCCGCCGAAGTTTCCGCGCTGCTGGCGGCCGGTGCGATCGACGGCCTTTCCATCGGCTATCGCACCCTGCGCTCTACCAAATCGGACACAGGCACCCGCCTCTTGCACGAGGTCGAGCTGTGGGAGGTGTCGCTGGTGACATTCCCCATGCTGCCCAACGCCCGCGTACACACCAAAACCGACGCCGCGCTGATCGCTGCTTTGCGCAGCGCCCGCGCCACCATCCGCAACCTCTAGGAGCCGCCATGGATACGCCCTCCGTAACCGACGAGATGAACGGTTTCATCTCTGATTTCAATGTCTTTGCAGGCGAAGTGAAACAACGTCTTGAACAGCAGGAGACCCGCATGACCCGTCTTGACCGCAAATCCGCCTACCGTCCCGCCTTGTCCGCTGCCGTCGACACCGACGCGCCGCATCAAAAGGCCTTTGACGCCTATTTGCGCTCGGGCGACGATGACGGCCTGCGCCATATCGAGATCGAGGGCAAGGCGATGTCCACCGCCGTCGCCGCCGATGGCGGCTATCTGGTCTCGCCGCAAACCGCGCAAACGATCCAGTCGGTGCTCAATGCCACCGCCTCGATCCGCGCGATCTCGAGCGTCGTGAATGTCGATGCCAGCAGCTATGACGTGCTGGTCGACCGGACCGAGCCGGGCGCGGGCTGGGCGAGCGAGACCGGCACCGTTGCAGAAAGCACGACCCCGGTCATCGACCGCATTTCGATCCGCCTGCATGAACTCTCGGCGCTGCCCAAAGTCTCGCAGCGCCTGCTCGATGACAGCGCCTTTGATCTTGAAGACTGGCTCGCCACCCGCATCGCGCAGCGCTTTGCCCGCGCCGAGGCGGCCGCTTTCGTCAATGGCGATGGCGTCGATAAGCCGAACGGCTTCCTGACGGTGACGAAAGTCGCGAATGCCAGCTTTAGCTGGGGCAACCTCGGCTATGTCGCAACCGGCTCGACCGCAGCGCTGCCTGCGGATTCGATCGTCGATCTGGTCTATGCGCTGGGCGCCGAATACCGCGCCGGCGCCAGCTTTGTGATGAACTCCAAAACCACCGGCGTTTTGCGCAAGCTGAAAGACAGCGACGGCCGCTTTTTGTGGTCTGACGGCCTTGCCGCGGGCGAGCCTGCGCGCCTGATGGGCTATCCCGTTCTGATTGCCGAGGACATGCCCGATATCGCCGCCAATGCCTTCCCCGTCGCCTTTGGCAATTTCACCGCTGGCTACACGATTGCCGAACGCGCCGACCTGCGCGTCTTGCGCGACCCGTTCTCGGCCAAGCCGCATGTGCTGTTCTATGCGACCAAACGCGTCGGCGGTGCGGTCACCGATTATGCCGCGATCAAGCTGCTGCGCTACGCGACCGCGTAAATCCAACGGGCGGGGCCATCAGCCCCGCCATCCCCCCAATGATCACACGGAAGGGCTGGGCGCGATGATGCTGGTCGAAGAAACAACGGTGGATGATGCCGCCTTGCCGGTCGCGGCGCTGGGTGCCTTCCTGCGCCTCGGCTCTGGCTTTGGCACGGATGGGTTGCAAGATGACCTGCTGCGCGCCTTCCTGCGCGCCGCCCTTGCCGCGATCGAGGGGCGTATCAACAAAATCCTGATCGCTCGCAGCTTTGCGCAGCAGATGACATCCGGTCAGGCGATGGCGGTCGGCCCTTTGCGCGCGGTGCTGTCGGTGACGGTGGATGGCACGGCGCAGCCCTATGCGCTGGCGGGCACAACAATCACCGCGCCGACAACGGCCAAGCTGACCGTCCGCTATGACGCCGGGCTGGCGGATGATTTCGCCGCGCTGCCCGCCGATTTGCAACAGGCGGTGCTGATGCTGGCGGCGCATTATTACGAATACCGCCAGGACCCGGCGCTGGACGGCGCCTGTATGCCCTTTGGCGTCTCGGCCCTGACCGAACGCTATCGCACGCTGCGTCTGTCGATGGGGGCCCGCACATGAGGCCGCCGCGCATGAACCGCGCCCTAATCCTGCAGGCCCCGACCCGCACCCCCGATGGCGCGGGCGGCTATACGCAAGGCTGGCAGACCCTCGGCACGCTTTGGGCTGCGGTGACACCTGCCACGGGGCGCGAGGCGGCGGCGCTGGGCGCGGCACTGGCGCGCGTGCCGGTGCGCATCACCCTGCGCGCCGCGCCCGCCGGCGATCCCCGCCGCCCCATCGCGGGCCAGCGGCTCACCGAAGGGCCGCGCAGCTTTCTGATCCTAGCGGTGCAAGAGACCAGCGCCCGCCTGCTGACCTGCATTGCCGAGGAGGAGCTGGTGCGATGACCCAATCCCTTGCCCTGCAACAGGCGCTGTATACACGGCTGACGGCCGCGCTCGATGGCGTGGACATCTATGACGCGCTGCCCAGCGGCCCCGTGCCCGCGCTTTACGTCGCCCTCGGCCCCGAGGAGGTCGAAGACCTTTCCACGCATGAGGGCGCGCTGACCATCCACGAGGTGAAAATCTCGGTCATCGCGACGGGTGGCGGCTTTGGCAGCGCCAAGACCATTACCACCGCCATCACCGAGGCGCTGGCCGCGCCGCTCACCCTGCCGTCCTTTACCGCCAGCCCCGCGCAATTCCTGCGCGCCTCGGCCAAGGGCACATCCGCCTCGGGGGCCGAGCGCCGCATCGACCTGTTCTTCCGCATCCGTATCGAACCCTAAAGGAGCCTCATATGAGCGCGCAAAACGGCAAGGATCTCCTCATCAAGATCGACATGACGGGCGATGGCCTGTTCGAGACCGTCGCGGGGCTGCGCGCCTCGCGGATCAGTTTCAACGCCGAAACGGTGGATGTCACCACGATGGAAAGCCAGGGCGGCTGGCGCGAATTGCTGGCCGGGGCGGGGATGCGTTCGGCCTCCGTGTCCGGCGCGGGCGTGTTCCGCGACCAATCCACGGACGAGCGGATGCGCGCCCTGTTCTTTTCCGGCGAGGTGCCCGCCTTTCGCATCATCATCCCGCATTTCGGCGCGATCGAGGGCCGCTTTCAGATCACCGCGCTGGAATATGCCGGCACCTATAATGGCGAGGCCACCTATGACGTGACCCTCGCCTCGGCAGGCGCACTGACATTCGAGGCCGAGGTATGAGCGCCAATCCTCATGCGGGCGAGGTCGAAATCCCGCTCGACGGCGTCATCCACATCGGCCGTCTGACCCTTGGCGCACTGGCGCGGCTGGAGGCGGATCTGCAATCGGGCAATCTGCCCGATCTGGTCGCGCGGTTTGAAAGTGGCGATATCCGCACGGCGGATGTGCTGGCGCTGATCGTTGCGGGCCTGCGCGGCGGCGGCTGGCAGGGCACGGCGGCGGATCTCGAGCAGATCGACGTGGGCGGCGGGCCGCTGGCAGCGGCGCGTATTGCAGGCCAGCTTTTGGCCCGCGCCTTTGCAAGCGCGGGATAAGCCCATGGATTTCCCCGGCCTTTTGCGCCTTGGCCTGCAGCATCTGCGCCTGAAACCCGCCGAATTCTGGGCGCTGACGCCGATTGAACTGATGCTGATGCTGGGCCTTGCAGCAGGCAGCCAACCCATGGCGCGCGCGCGCCTTGATGCGCTCGTCCGTGCCTATCCCGATCACGCCCCACTCCAGGAGGCCACCGATGGCTGACACCACCACCGCCGAGCTTCAGCAAACCCAATCCGTCACCGCCGCCTTCAACGCGGGCCTGCGCGAGATGCGCGGCACGCTATCGGCGACCTCGCGCGATGTGGCGGGGCTGGAACGCGGCCTCTCGCAGGGGCTGCGCCGCGCCTTTGACGGGCTGGTGTTCGACGGCGACCGCCTCAGCACGGCGGTCAGCAGCATCGCGCAAAGCGTTCAAAACGCCGCCTATAACGCCGCCATGCGGCCCATCACCGACAAGATCGGCGGCTGGTTGGCCAGTGGCATCGAAAGCCTGATGCCTTTCGCTGAGGGTGGCACCTTCACCCAAGGCCGCGTCATGCCCTTTGCCAAAGGCGGTGTTGTCACCGCCCCCACCACCTTTCCGATGCGCGGCGGCACCGGGCTGATGGGCGAGGCGGGGCCCGAGGCGATCATGCCGCTGACGCGCGGCGCCGATGGCCGTTTGGGCGTCGCGGCGCAAGGCGGCGGCGGCGTCAATCTGGTGATGAACATCCAGACGCCCGATGCCGCCAGTTTCCACCGCTCGCAAAGCCAGATCGGCGCGCAGGTCTCGCGCCTTGTGGCGCGCGGCAACCGCAACCGCTGACAGGGGGGCATCATGGCCTTTCACGACATCCGCTTTCCCGCCGCCATCAGTTTTGACTCGCTCGGCGGCCCGACGCGGCGCACGGAAATCGTCACGCTGACGAGCGGCTATGAACAGCGCAACACCGCTTGGGCCCATTCCCGCCGCCGCTATGACGCAGGCGTCGGCCTGCGCTCGTTGGATGATGTCGCGCAGCTCATCGCCTTTTTCGAGGCGCGCGGCGGGCAATTGCATGCCTTTCGCTGGAAAGACTGGTCGGATTACAAATCCTGCGCGCCGTCCGCCGCGATTTCCGAAATGGATCAGACGCTTGGATATGGCGATGGCGCAACCGCCGACTGGCCGCTGGTGAAAAACTATGTCTCGGGCGAGGGCGCTTATGCCCGCCCGATCACCAAACCTGTCGCCAATACCGTCCAGATCGCCGTCGCCGGTCAAAAGCTGGACGAGGGGACGGATTACACGCTGAACCTTGGCCTTGGCCGCGTGATCTTTGCCAGCCCGCCTGCGCCGGGGGCCGAAATCAGCGCGGGCTTTGAATTCGACGTCCCCGTGCGGTTTGAAACCGACACGATCCAGATCTCGGTCTCGTCCTTTCGGGCGGGCCAAATCCCCTCCGTCCCCCTGATCGAGGTGCGCCCATGAGCGACCATTCCACCACCCGCTGCACCGCCTGGGCCATCACCCGCACGGACGGGCTGCAACTGGGCTTTACCGATCACGATGGCGATCTGACCTTTGCGGGCCTGACGTTCCGCGCGGGCGCGGGCATGAGCGGCGCGGCACTGGTGCAAGGCGCAGGCCTTGCTGTCGACAATACCGAAGGCTTTGGCATGATCACCGATGATGCCGTGGGCGAGGGCGACCTGCGGGCAGGCCGTTTCGACGGGGCCGATATCCGCATCTATCAGGTCAACTGGCGCGCCCCTGCCGACCGCAGCCTGATCTTTCACGGCACTTTGGGCGAGATCACCCTCGAGGATGGCGCATGGCGGGCCGAGCTGCGCGGCGCGGCCGAGGCGCTGTCCCGTCCCATCGGGCGCAGCTATCAGCGCGGCTGCGCGGCGGTGCTGGGCGATGCCGCCTGCGGCTTTGACCTCGATACCCCCGGCTTTGCGATGGATGCCGCGCTGATCGCGGTGGATGACACCACGCTGACCATCGCCGCGCCGGACCTTGATCCGCGCTGGTTCGAACGCGGCGTTGTGAAAATCACCACGGGGGCTGCGGCTGGCCTCTCGGGCATGATCAAATCCGACGCCAGCTTGGGCGCACAGCGGCTGATCTCGCTCTGGTCGCCCTTGGGGGTGCAACCGCAGGCAGGCGATCAGATCCGCCTGCTGCCCGGCTGCGACAAACGCCTCGCCACCTGCCGCGCGAAATTCGGCAATCTGCACAACTTTCGCGGCTTCCCCCATATCCCGGGCGAGGATTGGCTGATCGCCGCCCCTAAAACAAACGGCAGCGGCGAGAGCCTGTTCCGATGACACCTGACACCCTTGCCCGCGCATGGATCGGCACGCCTTTCGTGCACGGCGCTAGCCTGCAAGGGGTCGGGGCCGATTGCCTTGGCCTCATCACCGGCCTTTGGCGGCAGATTTACGGCCCCGCGCCGTGGCCGCTGGACTACAGCCCCGACTGGTCCGTAACGCTGGGGCCAGATGCGATGGCGCGCGCCGCCGACCGCTATCTGCCGCGCGCAGCGCAGCTTTATCCCGGCGCGCTGATGCTGTTGCGCCTGCGCCCGCATTTGCCGCCCGCGCATCTGGCCATCTGCGCGGGGCCGACGTTCATCCATGCCTTCCACGCGGGCGGCGTCGTTGAAAGCCCCCTCAGCCTGCCGTGGCGCCGCCGCATCGCAGGCCTTTACCATTTCGCCCCTAAACAGGAGTCCTAACATGGCAACGCTGGTTCTTTCTGCCGTTGGCGCCTCGGTCGGTGCCTCGATCGGGGGCGGGATTTTGGGCCTCTCCTCGGCCGTCATCGGCCGCGCTGTCGGCGCTGTTGCAGGCAGCCTGATCGACCAGCGCATCTTGGGCGGCGGCGCGCAGCCCGTTGAAACCGGCCGCATCGACCGCTTTCGCGTCACAGGTGCGTCCGAAGGCGCCGCGATGGCGCGCCTTTATGGCCGGATGCGTGTCGGCGGGCAGGTGATCTGGGCGACCAAATTCATGGAAACCAGCACCCAGACCCGCGCTGGCAAAGGTCAGCCAAAGACCACCACTTTCAGCTATACCACCTCGCTGGCCATTGCGCTGTGCGAGGGGCCGATCAATGGCATCGGTCGCATCTGGGCCGACGGGACCGAGATTGCGCCCACCGACCTAAGCCTGCGCCTTTACCACGGCCATATGGATCAACTGCCCGACCCCCGTATCAGCGCGGTCGAGGGGGCAGACAACACCCCCGCCTATCGCGGCACCGCCTATGTGGTGATCGAGGACCTCGACCTTGGCCCCTATGGCAATCGCGTGCCGCAGTTTTCATTCGAGGTGATCCGCAACGATCCCGCCCGCGATGATACATTCGCAGGCGCGGTGCAGGCTGTCGCCATGATCCCCGGCACCGGCGAATATGCCCTGTCGGATACACCCGTCGCCCTGCGCTATTCCTATGCCGATGAAGGCACACAGAACGAAAACACCCCCAGCGGCCAAAGCGACTTTCTGACGGCGCTTGACCAGTTGAATACCGAACTGCCGCGCGTGACATCCGTCTCGCTGGTGGTCTCTTGGTTCGGCGATGACCTGCGCGCGGGCCAGTGCAAGGTGCAGCCAAAGGTCGAACAGACCGCCTTTGATGCCCCCGACCAGCCGTGGCGGGCAGGCGGCATCACGCGCAGCGCCGCGGCAACCGTGCCCCGCGTCGGCGGCTCGCCCATCTATGGCGGCACGCCGTCCGATGCCGCAGTCATCAGCGCCATCCGCACCATCCGCGCGCGCGGGCAGGAAGTGATGTTCTATCCCTTCATCCTGATGGATCAACTGGCGGATAACACCCTGCCGAACCCCTGGACGGGGCAGGCTGGTCAGCCGCCCTTGCCATGGCGCGGCCGCATCACGACCAGCCTTGCCCCGGGTCAACCCGGCACAACCGATGGCACAGCCGCCGCACGGGCCGAGGTTGCGGCCTTTTTCGGCACCGCCACCCCCGCGCATTTCACCCGCACCGGCGAGCGGGTGCATTATACCGGCCCCAATGAGTGGTCGCTGCGCCGCTTCATCCTGCATTACGCGCATCTATGCGCGGCGGCGGGCGGCGTCGACAGTTTCTGCATCAGCTCGGAAATGGTGGCGCTGACGCAAGTGCGCGACGATATCGGCTTTCCCGCCGTCAGCGCACTCATGGCGCTGGCCGCCGATGTGCGCAGCATCCTCGGCCCCGATACCCTGATCACCTATGCCGCGGATTGGAGCGAATATCACGGCTACCAACCGCTTGGGACGGGCGACAAGCTGTTCCACCTCGACCCGCTTTGGGCGCATGAGGATATCGACTTCATCGGTATCGACAATTACATGCCGCTGTCGGATTGGCGCGACGGCGATAGCCACTTGGACGCGCAGGCGGGCGCGATCTATAACCTCGATTACCTGACCGCCAATGTCGCAGGCGGCGAGATGTACGATTGGTTCTACCACTCGCCCGAGGCACGAGATGCGCAAATTCGCACTGCAATCACAGATGGTTACGATCAGCCTTGGATGTGGCGCGTGAAGGATATCTTAGGGTGGTGGAGCCATGCGCATTTCGACCGCGTGGACGGCGCGCAGGGCCCGCAAAGCCCTTGGCTGCCGCGTTCCAAACCGATCCGCTTTACCGAAATCGGCTGCGCCGCCATCGACAAAGGCACCAACCAGCCGAACAAATTCCTCGATCCGAAATCCTCGGAATCGGCGCTGCCGTACTATTCCAACGGCCTACGCGACGACTTTATCCAGCTTCAATATCTGCGCGCCCTAAACCGCCATTTCGCCGATCCCTCGCAGAACCCGACCTCTGAAATCTACGACGGCCCCATGGTCGAAATGGACTACGCGCATGTCTGGGCGTGGGACGCGCGGCCCTATCCGTGGCTCCCCGCGCGCGGCGATCTGTGGTCGGATGGCGCGAATTACGACCGTGGCCATTGGCTGAACGGCCGCGCCGGCGGGCAGGCTTTGGCCGCCGTCGCGGATCAGATTTGCACGGATGCGGGTCTGTCCGCGAACACCGATGCGCTTTGGGGCATGGTGCAGGGCTATGCGATGGACCGGATCGAGACGGGGCGCGCCGCGCTGCAACCGTTGATGCTGGCGCATGGGTTCGATGCGGTCGACCGCGACGGTGCGTTGCACCTGATCACCCGCCACGGCCGCCCCATCGCCACGCGCGAGATGGATGATCTGGTGGCCCACGACGCCCCCGCGCTGGTGCGGACCCGTCTGCCCGAGGCCGAGCTCGCCGGTCAGGTCCGCGTGGCTTTTGTCGCAGCGGGCGGCGATTTCAGCATCGGCGGGGCCGAGGCGACGCTTGCCGATACGCCGCGCGATACGGTCTCGACCTCGGACCTGCCGCTGCTGATGTCGCGCGCCGACGCCACCCGCGCCGCCGAGCGCTGGCTGCTGGAATCCCGTCTCGCGCGCGAGGTGGCGACATTCACGCTGCCGCCCTCGGACGCATGGCTGCGCGTGGGTGATGTGCTGACGCTTGCGGGCGATGACTACCGCATCGACCAGCTAGAGCGGGCCGAGGCGCTGGGCATCACCGCCACCCGCACCAGCCGCAGCCTGTTCCTGCCCCATGATGCGGTGGAGGATATCCCCCAGCCCGCCGCCTTTGCGCCGCCGATGCCGGTCGCGGCAACCTTCCTTGATCTGCCGTCCGAGACCGGCCCCAGTTTCGCTGTCGCCCTCACTTCGGCCACATGGCCGGGCGAGGTCGCGATCCACGCCGGGCCGCCGTTGGTGGAACTCGCCCGCAGCGCCGCCCCGGCGGTGGTGGGCGAGACGCTGAACGATCTATCGGCCGCCCGCGCAGGGATCTGGGATCGCGGCCCCGCGCTGCGGGTGCGGCTGGTGTCCGGCACGCTCGCCTCGCACCTGCCCGAGGCATTGCTATCGGGCGCGAACCTTGCCGCTATCGGCGATGGCACCAGTGATATCTGGGAGGTCTTTCAATTTGCCGAGGCCGCGCTGGTGGCCCCGAACGAATACGCCCTCAGCCTGCGCCTGCGCGGTCAGGGCGGCAGTGATGGGGTGATGCCGCCCGTCTGGCCCGCAGGGTCGCGGTTCGTGCTGTTGGACAATCGCCTGACGCCGCTTGATGTGCCGCGCGGTGTGTCGCGCGACTGGCATTGGGGGCCGGTGCAACGCCCGATGAGCGACCGCACTTGGCGGCAGGCCAACCGCGCCTTTACCGGCGTGGGCTTGCGTCCCTATGCGCCCTGCCATCTGCGCGTGAGTGATACCGCCGTGACATGGCAACGCCGCACCCGCAGCGGCGGCGACAGTTGGGACGGCATCGACGTGCCGCTGGGTGAGGAGCGCGAGCTGTACCGCCTGCGCATGTATCAATCCGGCGCGCTGCTGCGCGAGGTGATGCTGGACACGCCCGCCTTCGCCTATCCCGCCGCCATGCGCGCGGCAGATGGGGCGGGTGTGACGGTCGAAGTGGCGCAGATGTCCCAAGTCTTCGGTGCGGGGCCCGCGCTGGTTGGGGCGATCTGA
Protein sequences of DBSCAN-SWA_2 >NC_017384|696004:707603|699628_699961_+|WP_013383951.1|head,tail|DBSCAN-SWA MRPPRMNRALILQAPTRTPDGAGGYTQGWQTLGTLWAAVTPATGREAAALGAALARVPVRITLRAAPAGDPRRPIAGQRLTEGPRSFLILAVQETSARLLTCIAEEELVR >NC_017384|696004:707603|697376_697904_+|WP_013383947.1|head,protease|DBSCAN-SWA MEGILDLEYKYATLSAPDPAGVSVAGYASVFGLRDQGGDIVQKGAFAASLARLAAAGTKVRMLWQHDPSLPIGVWDEVTEDATGLRVSGRLLPEVAKAAEVSALLAAGAIDGLSIGYRTLRSTKSDTGTRLLHEVELWEVSLVTFPMLPNARVHTKTDAALIAALRSARATIRNL >NC_017384|696004:707603|699957_700344_+|WP_014537611.1|DBSCAN-SWA MTQSLALQQALYTRLTAALDGVDIYDALPSGPVPALYVALGPEEVEDLSTHEGALTIHEVKISVIATGGGFGSAKTITTAITEALAAPLTLPSFTASPAQFLRASAKGTSASGAERRIDLFFRIRIEP >NC_017384|696004:707603|700355_700772_+|WP_013383953.1|tail|DBSCAN-SWA MSAQNGKDLLIKIDMTGDGLFETVAGLRASRISFNAETVDVTTMESQGGWRELLAGAGMRSASVSGAGVFRDQSTDERMRALFFSGEVPAFRIIIPHFGAIEGRFQITALEYAGTYNGEATYDVTLASAGALTFEAEV >NC_017384|696004:707603|699107_699632_+|WP_014537610.1|DBSCAN-SWA MMLVEETTVDDAALPVAALGAFLRLGSGFGTDGLQDDLLRAFLRAALAAIEGRINKILIARSFAQQMTSGQAMAVGPLRAVLSVTVDGTAQPYALAGTTITAPTTAKLTVRYDAGLADDFAALPADLQQAVLMLAAHYYEYRQDPALDGACMPFGVSALTERYRTLRLSMGART >NC_017384|696004:707603|703793_707603_+|WP_013383960.1|DBSCAN-SWA MATLVLSAVGASVGASIGGGILGLSSAVIGRAVGAVAGSLIDQRILGGGAQPVETGRIDRFRVTGASEGAAMARLYGRMRVGGQVIWATKFMETSTQTRAGKGQPKTTTFSYTTSLAIALCEGPINGIGRIWADGTEIAPTDLSLRLYHGHMDQLPDPRISAVEGADNTPAYRGTAYVVIEDLDLGPYGNRVPQFSFEVIRNDPARDDTFAGAVQAVAMIPGTGEYALSDTPVALRYSYADEGTQNENTPSGQSDFLTALDQLNTELPRVTSVSLVVSWFGDDLRAGQCKVQPKVEQTAFDAPDQPWRAGGITRSAAATVPRVGGSPIYGGTPSDAAVISAIRTIRARGQEVMFYPFILMDQLADNTLPNPWTGQAGQPPLPWRGRITTSLAPGQPGTTDGTAAARAEVAAFFGTATPAHFTRTGERVHYTGPNEWSLRRFILHYAHLCAAAGGVDSFCISSEMVALTQVRDDIGFPAVSALMALAADVRSILGPDTLITYAADWSEYHGYQPLGTGDKLFHLDPLWAHEDIDFIGIDNYMPLSDWRDGDSHLDAQAGAIYNLDYLTANVAGGEMYDWFYHSPEARDAQIRTAITDGYDQPWMWRVKDILGWWSHAHFDRVDGAQGPQSPWLPRSKPIRFTEIGCAAIDKGTNQPNKFLDPKSSESALPYYSNGLRDDFIQLQYLRALNRHFADPSQNPTSEIYDGPMVEMDYAHVWAWDARPYPWLPARGDLWSDGANYDRGHWLNGRAGGQALAAVADQICTDAGLSANTDALWGMVQGYAMDRIETGRAALQPLMLAHGFDAVDRDGALHLITRHGRPIATREMDDLVAHDAPALVRTRLPEAELAGQVRVAFVAAGGDFSIGGAEATLADTPRDTVSTSDLPLLMSRADATRAAERWLLESRLAREVATFTLPPSDAWLRVGDVLTLAGDDYRIDQLERAEALGITATRTSRSLFLPHDAVEDIPQPAAFAPPMPVAATFLDLPSETGPSFAVALTSATWPGEVAIHAGPPLVELARSAAPAVVGETLNDLSAARAGIWDRGPALRVRLVSGTLASHLPEALLSGANLAAIGDGTSDIWEVFQFAEAALVAPNEYALSLRLRGQGGSDGVMPPVWPAGSRFVLLDNRLTPLDVPRGVSRDWHWGPVQRPMSDRTWRQANRAFTGVGLRPYAPCHLRVSDTAVTWQRRTRSGGDSWDGIDVPLGEERELYRLRMYQSGALLREVMLDTPAFAYPAAMRAADGAGVTVEVAQMSQVFGAGPALVGAI >NC_017384|696004:707603|697154_697376_+|WP_013383946.1|DBSCAN-SWA MEPRERPFLCAPGLKIEAQERLVALQFQQLQQQLARVEALIERLEKRLWLTVYGVLGAILAQAFQSFLSVAPG >NC_017384|696004:707603|702536_703382_+|WP_013383958.1|DBSCAN-SWA MSDHSTTRCTAWAITRTDGLQLGFTDHDGDLTFAGLTFRAGAGMSGAALVQGAGLAVDNTEGFGMITDDAVGEGDLRAGRFDGADIRIYQVNWRAPADRSLIFHGTLGEITLEDGAWRAELRGAAEALSRPIGRSYQRGCAAVLGDAACGFDLDTPGFAMDAALIAVDDTTLTIAAPDLDPRWFERGVVKITTGAAAGLSGMIKSDASLGAQRLISLWSPLGVQPQAGDQIRLLPGCDKRLATCRAKFGNLHNFRGFPHIPGEDWLIAAPKTNGSGESLFR >NC_017384|696004:707603|701090_701291_+|WP_013383955.1|tail|DBSCAN-SWA MDFPGLLRLGLQHLRLKPAEFWALTPIELMLMLGLAAGSQPMARARLDALVRAYPDHAPLQEATDG >NC_017384|696004:707603|701907_702540_+|WP_013383957.1|DBSCAN-SWA MAFHDIRFPAAISFDSLGGPTRRTEIVTLTSGYEQRNTAWAHSRRRYDAGVGLRSLDDVAQLIAFFEARGGQLHAFRWKDWSDYKSCAPSAAISEMDQTLGYGDGATADWPLVKNYVSGEGAYARPITKPVANTVQIAVAGQKLDEGTDYTLNLGLGRVIFASPPAPGAEISAGFEFDVPVRFETDTIQISVSSFRAGQIPSVPLIEVRP >NC_017384|696004:707603|703378_703792_+|WP_013383959.1|DBSCAN-SWA MTPDTLARAWIGTPFVHGASLQGVGADCLGLITGLWRQIYGPAPWPLDYSPDWSVTLGPDAMARAADRYLPRAAQLYPGALMLLRLRPHLPPAHLAICAGPTFIHAFHAGGVVESPLSLPWRRRIAGLYHFAPKQES >NC_017384|696004:707603|696004_697168_+|WP_013383945.1|portal|DBSCAN-SWA MFGFGEKKQPAPLVEVKASAAGKVVGFGTAGRTGFQPREGSSLVRAGFAANPIGFRAVRLISEAASALPLILQDATQRYDTHPVLDLLARPNPAQGQLELFEAIYAQLLLTGNAYVEAVSDGALPTELHVLRSDRMSVVPGPDGWPTGYDYAVAGRKHRFDAAAICHIRAFHPHDDHYGLSPLTPAAAAVEVHNAASRWSRGLLENAARPSGAIVFRGADGNGTLSNGQFDRLVAEMESQHQGARNAGRPMLLEGGLDWKPMGFSPSDMEFLQTKEAAAREIATAFGVPPMLLGIPGDATYANYQEANRAFYRLTVLPLASRVTGALVNWLDDFAGTWLDLRPDPDQIAALQTERDALWARVGAASFLSTAEKRALLGLPGEPDGAA >NC_017384|696004:707603|701283_701895_+|WP_013383956.1|tail|DBSCAN-SWA MADTTTAELQQTQSVTAAFNAGLREMRGTLSATSRDVAGLERGLSQGLRRAFDGLVFDGDRLSTAVSSIAQSVQNAAYNAAMRPITDKIGGWLASGIESLMPFAEGGTFTQGRVMPFAKGGVVTAPTTFPMRGGTGLMGEAGPEAIMPLTRGADGRLGVAAQGGGGVNLVMNIQTPDAASFHRSQSQIGAQVSRLVARGNRNR >NC_017384|696004:707603|700768_701086_+|WP_013383954.1|DBSCAN-SWA MSANPHAGEVEIPLDGVIHIGRLTLGALARLEADLQSGNLPDLVARFESGDIRTADVLALIVAGLRGGGWQGTAADLEQIDVGGGPLAAARIAGQLLARAFASAG >NC_017384|696004:707603|697912_699043_+|WP_013383948.1|capsid|DBSCAN-SWA MDTPSVTDEMNGFISDFNVFAGEVKQRLEQQETRMTRLDRKSAYRPALSAAVDTDAPHQKAFDAYLRSGDDDGLRHIEIEGKAMSTAVAADGGYLVSPQTAQTIQSVLNATASIRAISSVVNVDASSYDVLVDRTEPGAGWASETGTVAESTTPVIDRISIRLHELSALPKVSQRLLDDSAFDLEDWLATRIAQRFARAEAAAFVNGDGVDKPNGFLTVTKVANASFSWGNLGYVATGSTAALPADSIVDLVYALGAEYRAGASFVMNSKTTGVLRKLKDSDGRFLWSDGLAAGEPARLMGYPVLIAEDMPDIAANAFPVAFGNFTAGYTIAERADLRVLRDPFSAKPHVLFYATKRVGGAVTDYAAIKLLRYATA |
15 | Paracoccus_phage(37.5%) | tail,protease,head,portal,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
752663 : 774711
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_017384|752663:774711|DBSCAN-SWA GTTAATTTCGCGTTTTCCACATTTGCCCAGAATGTTTTCCAATGTTCTGATCTTGTTCCGTTCCGACAACGGCGCGGCGACGGTCAGCTGACTTGGTATAGTGATCGACCTCTTTCAGTGTTTCGTGCCCCGTCCAAGCGCTAATCTGGTGCGCCGTGGCACCGCCCTCTGCCAATTCGGTGGCGAGGCTTTTGCGCAGCCCGTGCGCGCTTTTCTTAAAACCAGCTTCCATCGCAGCCTCGCGTATCATCGTTCCGAGAGATTTTTCCGACCGGGCTTTGCCGCGTGCATCTAGATAGGTCATCTGACCCGCAAGTGCCTCGAGGGCGTGGTGCATGATGGCCCGGTCTTCGGAAAGGTGCGCTGCGAAATCAGGTAGGGGGCACGACCATGGAATATGTGCCGGACCGTCAGTCTTAACCTGCCGATATGTCAGCACGCCACCACGCACCATTCCCGGCCCAAGGCGAACAGCATCGCTGATCCGCGCGCCGCTAAAGCGCAGAAGCTCGAAGGCAGCACGTGTTGCTGATCCCATTGGCCAGCGTGTGCGAAAAGCTGCGATTTCATCCTGCGTCCATGGTGGGTGGCCGCCTTTGCTCTTTGCCTTAGGCATGGCGACGTCGAGCGCCTCGTTGCGCTCCAGCAGATTGGTCGCCACCCCGAAACTACAGAGGAAGCGCCAAGTGCGAAGTCGGTGGCCTTGTGAAGTGCTTTTCGGAAGGTTTGCCCGGATATGCCGCGATCTAAGTCCCCGCGCGGGAAGCTCGCCGAATTCTTCCACCAGCGCATCGCAATGGCGGCGGAACATGGCGCGGGTAGCTAGTTTCGATTGGCTATAATGGTCTGACTTTAGCGCAGCTCGCACCAACGCTGCAAAAGATCCAGCGGCTGCTGGTGACTTCTCGGGGTTGGCGGACTTTGCTTCTGCGTAGGCCCTTAGGAAGGCTGGATCGTCCATCGGCAAGTCCGGCAGCTTCACCAAAGGAAAGCCTCGGCGCTGAAAGTAGCGATAGGTCTTTCCCCTCCGTACCACCGTCTTAATCCCTTCAAGCGTCACTACGCTGCCCCTATAGCTTTGTCCCATGCGTCCTCTCCGGTTTCGGGTTCGGACGCTAAGTCTGTTTGTGTGTCTGTTGCAAGGATCATTACGCTACCATCACGCATGATGCGCAGCGCTATGACCGGCTTTCCTGCGCCTTCAAGCGCAGCAATGCCGCGCTTGATAGCTGCTTGGGTTGGAGTGGATACTGTCATTTGATCCTCTCAATCAAAGTTCAGCTCAGCAAACCCCTGCCTGCGCGCATTGAACGCAGCGAGGTTTGCCGATGGGTGTGGGCGGTCGATCCGGCGCAACCAGTTCCGCTCTTTCAGGATGCTGCGGACTGTGGCGCGATGGATCTTTAGGCGCTCGGCCAGCTGGCGCTGCGAGATCGCGCCGCCGGTTATTTTGATCATGCCGTCGATGCGGCAGGCATGAGCTTCGCGGCGGAAGGGGCGGTCGATCAAGTTCATGCTGCCGCCTCCAAGGCGTATCCGCCCCACTGTGACGCACACGCTCGAGCAATGCCCGCGAATGTTTGACTGCGGAATTTCCAGCGATCGGGACCGGGCGGCGCGCGATGCACGGCCTCCCATCGCTTCGCCTCGTCGCTGCCAGTCGGCGGCAGCGTTAGCTTGTCAGTCGGGCTCAGCCGGGGCAGGCCCCGCAAATAAAAGCCCGTGGCCTTCTTTGCCCGATCCCCAAACCACCATGGCTGCACGATCTGTGGGCGCGGCAGGTTCGCTGGCATCCTTTCACGAGCATGTCGGTGCATGATTGGATTTTCCACGGCGACCCGATCAATTGGGGCTCGCCAGCATGTCGTGAAAAGCGCCACGCCCTCGTCTAGCTCGCGCCACATATCCGCTAGAGTGCGTCCAGGTGGCGGGTTATGCAGCCAGCGGACGCCGCTGTTACAAAGCCTAGTGCACGGCGGGTGCGCGACGATCAGCAGATCCCAGCCGTCGCCCAAGTGGTCACGAACATCACCACGGATGTGGCGGTTGCTGCCGTCATCTGCGGGGAGGAGGTCACAAGACCAAGTGTCGTGTCCAAGCTCGGAGAATGCGCGGCGCATCACGCCCGATGTTTCGCAGCCAATGAGGACGCGCATTATGCACCGCCTTTCAGGGCTGCAAGGATTCGGGACGTGTGGTGGGAGTTCGCGAATGCCTGCGCATCCCGCAGAGCACCAAAGCGGCCAATCGGTATCGACCAAAACAGCAACTCGAATTGGTGCCGTTCCGCCCAAATTGCATAGTCACCAGCAACTAGTTCGTCGGGGTGTGCGTCCTCGTCATTGCGCGGTCCCCAAGCAAGCGACTTGACCTCGCGCGCCTCGGCCAGTTCAGCGCGAAGGCGGGTGATCTCGGCGGCAGAGTCGTTGCAGCAGTCGACCATCCATAGCAATGTGGTGCTCGATGATCCCTTGAGGCCAAGGAAATTCATCAGCGCTTTGGCCCTGCGCTCAGCCCCATTAAGACGTATCAACAGATCGGCGCTCATTCCTGCGATCCTTCCAGGGCTGCGCGGGAAGGCTCAACCGCCTCCAGCATGTGCATAACCATGTCGCGGAAGTTGTCGTAGCCAGCGATGACGATGGCATCGATGACCTGACAGGCTTCCAGCGCCGGTTTCAGTTGCGCTTTCAGCCGGTCAATCTCGGCTTTCATCTGGTCGCGTGTCGGCACCATTTTGGTATCTGCGGCGGCTGGCACCTGGGTAATTGGAATCTTGCCGGTGCCGTCGCAGGCGGCGCAATCCTCGTCGCACAGCCAACCAATAACTTGGCCGTGGCCGTCGCATTCTGTGCAGTGATTGATGTGCGGCTTGGTCAACTCGTCAATTCGCTTTTGCATGGCGCGAATGGTCAGGCCGGCATGGTGCATGGCCTCGACGCGGTTATCGCGGCGTTGGCCAATTAGTTCGCCAGCGACTTGCAGCACATCGGCGCGGTCGGTAGAAACCTCGCGGGCCGCGACCGGCTGTTTGGCCTCGGCAGCCAGCGCTTCGCCGCGCAGGCGTTCGGGTGCGAGGTTGTAAGGGTTTGGCGCTTGGGCCTCATTCGTCGTGCGCGGATCCTGCGCGGTCGGTGCGTGATCGCGCATTATGCTGCCTCCGCATTCTGGTTGGTTAGATGCTGGATCACGGCCTCAATCCCGCCGACCGTGCGGACGGTTTCGCGGACGCCCGTGGGCAGCTGGACGCCGAAACGGTTCTCGATCAGCAGGTAAAGCTGTGCGAGCATGTCAGGCGTCAGGCCAAGGTCGGCCACCAGTTCCGACGCCGATGTGACCTCGGCCAGCATTTTGCTGCTGACCAGTGCGGCCAGTTGCTTCACCGCGCTGGAATTGTGGGTATTGAATAGGTTGGACGGGTTAGCACCGGCCAGCAATTGGGCAGCGGTTAATGCCTTTGCATGTGCGTCCATTGCGAGCTGGTGTAGGCGCTCATAGGCCTCGGCCACCTTATCGCGGTGCTGGTGCAATGTGCGCTCGAGCGAGCGGCTGGTGTCCAGATCCTGCAGGATGAACCAAATGTCTTGCAGTCCGACCCCAATGGAAATGAGAAACCGGATCAGCGCCAACCGCTCCAGCACGTCAGGCTTGAAATAGCGAAGGTTGCGGCCAGCCTTCAGTGCTTGACCGGGTAGGCGGCGGCGAATGCCATTCGCGTCTGTTGCGGGGGGATCAGCTCAAGGCTTTCATAGTGACGGATGGCACGCGGGCTAAGGCCTGTCACGCGCTTGATATCCTGCGTTGTGATTAGCTTGGTCTGCATTGTCGCCCCGGATCAAACTGCATTGCGGCGCAGACGCGCGGCGACGGCAGCGCTGATCCGCGGTGCGGCGTCATCCAGCACTTGGCGCAGGCTTGGCTGGGCGTGGGTGCGCAGGCCGCTGCCTTCGCGGCGCATGACGTGGCCGGTCAGCATGAGATTTTCGGGGTTCAGGCGGATGCCGCGCTCTTCGCAAAGGGCCTGCCAGACGTGCTGTTTCGCGGCCACGGGGCCGTGCAGAATAGAGTCGGAAGTCGCCGCGTGGGCGAGTTGGGCGAAGTTAGCGGGGGTGGAGGTCGTCTGCATTTGATGCTCCATCGGTGTATCGATGGGGCAGTGGTATTGGAGTAAAAAAACTCCGTCAACTAAATAGAGTTATTTTACTCCACTTTTTATTGACGTGATTACTCGGACGTGCAGAATCGGGGCGAGAAAAAAACCGCCACTAGGGCGGGGTGTGCTATCAGGCGCTTAGAGCGGCGCTACCAGAGGGGGCATGATGTTTGGCTTGGAATTTTTGGAGCTCACGCTCACGCTGCAGGTTTCGCTCGGCGCTGGTTACCTAGCGTATGCGGTTGCATACGCAGGCTATCGAAAGGATCATCGAGCGGAAGATGCAATCTTCATCACAATAGCCTTTTCCGCGCTGGCGCAGCTTTTATTCGTTGTTCTAGAACATGCGCAGTTTTTGCCGATAATTTCTGTGCCTAGCGTGTTCTTAACCACAATTGCGGCAGGCGCATTATGGCGCAAATTTGGCATGAGCTTCTGGCAGACGTTGATGAGATGGTCTGGTGTCCATCGCGATGATGGGAACCTGCGTATCTGGCCAATTCTTGTACATGGAAACCATGAGATCAACCAGTGTCAGGTATACCTAAAGGATGGGCGCGTTCTTAACTTATTGAATGGGCCCGCATATATTGGCCAGCCGTGGGACGGGCTACTTTTGGGTGACGATGGTTCGGTTTTGATGGTTGTCGAAAACGAATACATGCCGAACGGATCCGTTATTGAGCATGGCGCGAATTTTGACGCTGAGTGGGGTACTTCACTGACCTATGTTCCGGCCAGTGAAGTAAGTCGTGTGATTTTTACTCTGAAGAAGTCGTAGGTGTATTTTTCTTTGCTGGCGGGGGACCGGACGTTTTGACAGATTTCAGTGGCGATGCGTTGTTCGTCACTTCGGCCGCTTTGTTAGAGATTACGCGCGCGTGGCCAGATGGTGTAGTTGGTTTTGCCATATATGAACTCCTAACTGGATTTTCTTACTAGTTCGGGTTGCAAATAATGCTTGATTGGCGCCGCCCATTTCAGGCGTCGATTGAACATTGGTTCGGCTGTCATGTTCAGGGAAAGCAGGTGGAATAGGTGTGGCTCATTCCCTTGTTTCACCTGCTTCAGCCAAACCTTACCTTCCTCGCATTCGGCTACGCAGATCTTATTGAGGGCTTCGGAGGGGACGCCGATAGTGTCGCGTGAGTAAAACACCACGCCGCCTGATTCGTAGAAAGGTTCCATGCTGTCGCCCGTCACCTCGACCGCCACGATGCCGTGTGTCGGAAGGTGATTTGGAGTCTCGACGTGGTAGATGCCATCGCCCTTGGCAAAGTCGTCCATCAGTTCGACTTGTGCGCCCGCGCCGATGCGGCCTGCTACAGCGGTTTTACGACCTGACCCGCTAAAGTTGCCATCATAGAAGTCTTCAAGGGTTACTCCAAACGCTGCCGCGATTTTCATTGCGTCGTCAACGTTGGTGGTTTTCGCCTTGCCTTGCCTTACATTTTTCATCTGGTCGTAAGACACGCCAGCTTGGGCGCAAACACTACGTAGTGAGCGTCCCGTTGTCGCGATGGCATGTTCTAGGGCGGAGGCGAAGGTGCTTTGCATGGAGGTAAATTACTCCAGGCTCAAAGCTATAGCATCGGGTATAAAAACCCTTGACGACTGGAGTATAATAACTCCATACCGTAAGCCATGGAACAATTCGTTACTGAAGTTGAGGCATACGCGAAAGCGGTAGGCAGGCCCCCACAGTCGATTTTGAAATCGGCGGTTCGAGCATCTTGGCGCGTGTGGCAGGCGTGGCTTGATGGGAAATCCAGCCCAACGATGGCTGTTGCCGACCGCGTTCGCGCCTACATGGCTGCAAACCCGCCCGCTGCCCAAAATGAAAGGGCGGCCTGATGCAGCAGACCGCCCCTCACAACCTAGCATCAAATGTTTCGGTTCGTTCGACACGAGACCAACATAAGGGGGTGCCGTTGGAAAAGTCTTTCCGAAAATCTCGCCAACCGAAAAACGGCGAGGATGCAGAGCGGGCGCGGTTCGCGCGGCTGCTGTGGCGGGCGTTCCCTGATGCCAACTCCGAAAATGAGCTGGCCGAGCTCGCGGCGATGGTGCTGACCTCGGAATCGCGCCCTGTCGATAAGCGGACGGTGCGCAATTGGCTGCGGTGCGAGAACGCGCCGCACTTTCGCTATGTGATCAAGGTGATCGCGCTTGTGGGGGCCGAGGCTGTTTTCCAAATGATTGACCCGGAGGTGGGCGAATGAAGCGGATCCTCTGGCGGATTGCACAACGTTACCTGCTTGCGCGCGCGGCTCGTGCCCGCCGCGCAGGCAACACGATGGCGGCGGCTAAGTTTAAGTCGCGCTCGGAAAAGTTTTTCCACAAGATCAAGGGGGCAATGCGGTGAGCTGGCGTCAATATAAGTTTGCTAGCTGTGGTGCGCGATCAATGAACGATATCGTTGTCGCGACGCAGAAATCCGATGCGAAAGCTATTTTCGCAAGTGTGGCAGAACTTTCTGATGCTGCCTCCCTGAAGCATGCTGCGTTGTCCAGATGCGAAACAGTACGGGCAGAGGTAGTGACTTGGCTCAGTGCCTTCCGGGGATCGTGGTGCCATGACAATGCCGCCCGCAGGTGTCGCTGTCAGCGCGTAGACATCTGCGTCCATGATCTCGACATGACGCTCGACGTCCAGAAGCGCCTCCTTAGCGCGCTGAAGATTATCCTAGCTAAGTTTGGCGTTGTCGCTAAGCAACTCATTTGTCTCGCAAAGCAATCTTGCGGCTTTCTCGATGCCATTGATGGATCTCAGGATTTCATCTCTCTTGGCGTTGGCTTCCGCAGTGAGAAGAGACGATATTTCTTTCGCAGCCGCTCTCATCTCATCGGCAGCTTTTACGCTTTCGTCCGTCAGTTTTCGGAGACGGTGATATTTCTTTCGGTCTTGGGTCATTGGTTGCTCGCCTCACGACTTTACCGCGCTATCTCGACGGAAACGGGTGTGACGGCGATGCGTTCCATCACTTTGAGGGCGTTGGCGTGAAAGCGTTACCACTGAAACAGACAGACGCGGATCGTGCTGTCGCAGATCTGGCGTTTGCGGTGACCTCGGCCGAAATCAAAGAGATTGTCGAAGAATATGAAACGCTGGCCGAAGAGCGTTCCGGCGTCGGAGACCGTATGAGCGAGGTGATGGCGCGCGCGAAGGCAAGAGGATACGACACAAAGATTTTGCGTCGGATCATCGCGCTGCGCAAGCGTCATGCCGATGACTTGGCTGAGGAAGAAACCATCCTCGAAATCTACAAATCAGCATTGGGAATGTGATGACCGCAATTTCAAGCCTGCGGCCTGTCGCTGTGGATGAGATCGGGGACTATCCGTTCTCGGTCGAGGATCGCTTCGAAAGCCACTATTTTATGGCATGGGAACGGCGGCGCTGGCTCAATTCCGACATGCGCCTGCGTGGTGATCCGGAATGCCGCGCCCTGTTCTTTGATCTGATCAATATCAGCTATGATCAGTCGCCGGTCGGCACCTTGCCAACGGATCATAAGATCTTGGCGAAATTGCTGTTCGTTACTGATGACCGCTTTGAGCAGCTGTGCCGCCAAGAATTCGGGCCGCTGCACAAGTGGCGCAAGTTCAACTGCGAAGGCGAAATCCGCCTGGGCCATCCAATGGTTCTAAAGACGTTGAATGAGGCTGTCGCGCGCCGTGCAGAGAACCAAGCGCGCACCGATGCAGCCAGCAACATCAAGCGCCTTCAGCGTATGCGCAGCATGTTGGCGGGTTACAGCAAAGATCTGGCCTCTAATGAGGCGGCAGTGCGCTGGATTGATGAATGGCTGATCGAACAAGGATGCACAAAGCGTAGCTCTGACTGGCTGGAGCGGGCCATGTCGGCATGGTCGGGGCACATGCATCGCCTCTCAGGTCGTGGGCGTAGTATCTAAACGAAACTACAGTCTATCGTATGTGTCCACATGGACAGACGAAGACAGACGAAGACAGATCGAGACAGACGAAGACAGTCCCGGACTGTCCACCACCATAAGGACAAGGACAATGACAAGATAAATACAAATACCCGTGCAGCGACAGTTGGACGGTCAAGCTGTGGATAAGTCGCGCCGAACAAGCCGCATGCAGGTGGCAATCGGGCATTGCTGAGAAAAGAGGAAAGGCTGATGGGGTTCGACACAAACCAACCAGAGACTAATCGCGATCGGGTCAGGCGCTTGCTGCTGGTGCCGCTGGGGTTCCGGCACCCGCGCAAGACCGAAGAGGCAGTGGGCCGCGCATCGCTGGACGCGATTGCAGATGAAATGGCGTATCTGCCTGATGACGTGCTGATCGCGATGGTTGAGGCCATGCGGGCGCGGGGCGAGGGGGCCAGCAAGGATTTCTGGCCGTCGCGCGCCTCGTTCATCGGGGTGGCCGAATGGCTGCATCCGCGCCCGCTGGAATTCTCGCCCAAGCTGCTCAGCTGGTTCGCCTCGATCGAGGGGCCGCGCGCGATTGAGAACGGGACGCTCGTCGAAACGTGGCAATTCTTTGAACAGAAAAAGCGCCCGCCGTCGCTGCCTGAGGATCGCAAGCGCGTCGAGGTTGTCGCGCAGCACAACGCGCGCAAGCTGCAACTGATTGCGGAACGTCGCGAACGTGGCCGCGAACAGTTTCCCGACGATCTGGCATGGGAACGCTGGTATCTGGATCGGCGGCTGTATTGCGAGCAAGCCGTGGCGGATGCGCGGGCCGGAAAGGCGGTGGCATGATGGATGCCCGTCAACTCGTTCCCGATGCCGATTACGAAATCGAATACGTGGCGGCAAAGGATCCGCGCCAGCCGCCCGAGCCAAAGCTGAAACCCATTGCGGGGCGCATGAAGGTCGCGGATGTGTTCGACCTGATGGCGCTGCAAGCGGGTCGTCGGGGCGGGGCCTTGGCACTGACGCCAACGCAAATCGCAATGGCGCGCACCTATAGCGCGTTGGTCGAGCGGCATGAGGCGGGCGGGATTAAATGTTCGTCTGCCGAGGCGGGCGGTGGCGGTGGTGGCGGCTCTGCCGATAGCTTCACGCAGGCGCGGTTGGATGCATCGCGCGAACTCGACCGGATCCGCGCCCGGATCGGTGACGGCGTTGCGTTGGAGGTGCGTCGTCGGCGCAAGGTGGATGACAAGGCCGTAACGGTGAAGTCGCAAATCACCGATCGCAAATTGGTGGATGCCGTTTGCCTCGCGGATCAAAGTTTGGCGTCATTGCTTCGCTCGCATGGCTGGTCGCCGGATGCTACTACGCGTGGATCGCTGATTAAGGCGCTGGGGGCCGCATTGGATCGGATGACTGGGCCTACTGCGCGGCACCGACATGCCACCTTACGGTGCGGCGTGGTGGCGCGTAGCCCCTTCGCCAAATAGGGGGTTGACGCCTTATCCCTTCCGACGATAGGAAATAAGCACTATCCATAATTGCGCCCACGGGAAACCGGCGGGCGCTTTTGCGTTCAGGGGTTTGGTGGTGTCGCCCGCAAGGACAGATAACAACAGAACATCTTGCAGGCGACTTGGGTAGATTTGGCGTTCCGACTAAAAGCCGCCTGCGCGAGGATTGGAACAGCTAACGCAGGCGGCTGATATTGGCGGAGTAAGCCAATTGCGATGGCAACATAAGCGACCGTGCCTGCCATCGAATCTCAGGTTAACGACGGCGGCTGACCTCGTCTACAGCATTAACCATCAGTGCTGCCTTATACTGCTGGTGCCGTCTGCATGGGAAAGAACTGTCTATGCAGACGGCATAGCGCGGCAAGGAAAGCCAAGGGTGACGGCGGGCTAGGTGCATACGTAGGACCCGCCGTCGACGTTCAGGCTATTTGGTCTTGAAGTGCTCGTCGATCTGCTTTTCGGCCTCTTCTTTGACAATGCCGTAGCGTTCTTGGATCTTTCCGGCCAGTTTTTCGCGCTGGCCGTCGATCTCAGTCAGATCGTCGTCGGTGAGTTGACCCCATTTGGTCTTCACATCACCCTTTGCTTGGGCCCATTTGCCCTTCACGGTATCCCAGTTCATGCCGTCCTCCATTGGTTAAGAGTTCACAAGATTGGTCTTCACGTGAGTTCATCGTCGTTAATATTTTGCCTAAACCGTAATCACATTTTTTGAGCGAGCTAAAACGACGCTTTTTTGCCGATTGGTCAGTTTCGCGTTTCTGGAGGGTGCAATGGCATCAGGTCTTTCTGTGCTGACCCACGGCAATGGCGTAACTCTGACGGGTGATCGCGCGCTGATCCGGTCTTTGACCGATCTCTCGGATCGCGATCTGAATATTGCCGCGACATGGGCGCTGAATGATACGGCGGCAGATGTGCTGACCGATGTGCAGCAGCGCATGGGGCAGGTGTTCGACCGGCCAACGCCTTTTGCGCTGAACGCATTTCAGGTGCGCAAGGCGCGGCCCAATGACTTGCAGGCAGCGGTGCAGGAACGGCCCTCGGTCGGAAAGCGCCATTTCCTGAAAGTGCAGGAACTTGGCGGCGTTCGCCCGCAAACCGGCCTTGAACGCAATCTAGGTTTCAAGCTGCCTTACGCTGGGCATTTGCAAACTGTGACGCCGGGGCCTGCGGCGCTGCTCGATGCGCGCGGCAATTGGTCGACGGGTGAACGCAACCGCGTCCTCTCTAGCATTCGCGCCCAGAGCGATAAGCACATGAACAGCAAGCGCGGCGCGGCCAAGGTGAAGCGATCGAGCGCTGCGCAGTTCTTTGTGCCGCGCGCCGGATCGAAACTGTCGCCCGGTGTATGGAAGCGCACCAGTCGCGGGGCCAAGCTGCAAAAGGTGCTGAACTTCACGGATCGCTCGGCGGTCTACAAGCCGCGCCTGAAGTTCCTCGATGGTGCCGAGGTGGTGGCCTCTCGCCAGATGCCGCTGCACCTTCGCCGGACGCTAGCGCAAATGGTCGCCAAGCGGGCCGTCAAGGTCTGAAAAAATCCGCGCGGGTCCTTCCTAGGGTTCGCTTGCATGCGGGTAATTCGCGCCCCGTGCCGTGCGATTTTTTTGGTTTTCCATTTGCAAATAAGGGGTTGTTGTTGGGGTTTGCCATGGCCGACGATCAAACTGTTACGCTCGTCGCGGTCGAGGTCAGCGCCGAGTTTCAAAAGATCCTTGCTGACTATCCGCTGCCCGCGACGGTGCAGGACGCGGATATGAACCAAGAGGAACTGGCCTCGGCCCTTAACCAATCCGTCAACACGATTGCAAAGTGGATTCGGCAAGAGGGCATGCCGGTCGCCCAGGCGGGCGGCAACGGTAAATCCTATGTGCTGCGGCTGTCGCATTGCTGGGCGTGGCTCAAGGCGCGGGATGCGGATCGCGACTTGCGCAGCCAACACAACAAACAGCAGGCGGCTGCGCTGCAAGCCGAAATGCTGGGTCTCGATGTGTCAGACCCAAACGCGCACATGACGCCGAAAGCGCGCCGCGAAATGGCCGAGGCTGATCTTGTTTGGAACAAAGCGCAGCGCGAACGCCGGACGCTGGTGCAGTTGGACGAGGTGCATGACCTGTTGGAGAGCGTGCTGACCATGGTGCGCGATGGGATCGAGGCGATGCCCGATCTGCTGGAACGGGAACTGAACCTGAAACCCGATCAGGTCGCGGCGGCTGTTGCTGTCGGGCACGATATCCTGACCAGCCTGACGGAAAAGATTGAAGCCGCCGAACTGCAAGAACGCACGGTCGCTGACCTGCCGGATCGGCAATTGTGGATGAATTAGGTCTCGTCAGATTGCTTGCGCGCGGCCTCGATCAGTTGCGCGATATGGCGTCTATCCTCTTTGAGCATGCGCTTGAGCTGCTCCATCTGCAAATTTAACGAACTGTTGTCTCGCTCGATCGCGGCAATCTTGGTGGAGAGCTTCAGCATTCTCTCCTGCAAATCCTTAATCTCCATGGACCGATCCTGTCTAACCTGTTGGGTTACCTACAACTACAAGCTGAATTGGATCAGATTGGATCGTTCTGTCTACTCCCAATTCACGTTGGCTAGAGCAGTGCAGCCACAATGGATGAAATTAAATTAAGAGGGCTTTTATCGATGCAGATGCGCGGGTTCGAGCCATTGATGCCTTATGCGGATCCGCGCGTAGCGCTGAAGGCGGCGCTGCCTGCGCTTCGTCCTGCCGAGCGGATCTCGGTCACCGATGCCGCCGAAAAGTATATGCGCGTGAATGTTTCGGGAACGTGGAAAAAGTTCAGCCGCGACGTGACGCCTTACATGACCGAGCCGAGCGACATGATCCCTTCGCGGATGTATCGCGGGCTGGTGCTGTGCGGGCCGTCGCAGTCGGGCAAGACCCAGATGCTGCAATCGGCGGTGGCCTATACGATTGCCGCGAACCCCGGACGGGTCGGGCTGTTCCAGATGACACGGGATGCGGCGCAGCTGTTTGAGCGCGAAAAGATCGCGCCGATGATCGCGAACTCGCCTGAGCTGCGGGCGAAGCTGGCGAAGGGGCGCGGGTCGGATACGATCTTTCAAAAGCTGTTTCTGGGCGGCACGCATCTGACGCTGAACTGGCCGACGATTACGCAGCTCAGCTCGACCACGATCCGGCTGGTGCTGGGCACGGATTATGACCACTTCCCCGAGAGCGTGGACGGCGAGGGCGATGCCTATTCGCTGATGCGGGCGCGGGCCAAGACCTACATGTCGCGGGGCATGGTGGTGGTTGAAAGCAGTCCAGGCGCGCCGATCACGGATGAACATTGGCAGCCGCAATCAGCGCATGATGTGCCGCCGGTGGAATACGGGGTGCTGTCGCTCTATCCGAACGGGACGCGGGGCCGGTGGTATTGGACATGCCCGTGCTGCGGCGAGGAATTTGAACCGACCTTTGCGCGCCTCGTCTATCCCGAAGGGGCCGAGCCTGCGGAAGCCGGTGACGCTGCCGAGATGCGGTGCCCGCATTGTCGCCAGACCTTCGGGCATGGCCTCAAGCGCGCGCTGAACAGTGAAGGACGCTGGCTGCATGAAGGCCGCGAGGTTGACGAATACGGACGCCCGCGCCTTGTCACCATCGACAGCGGCTTAGTGCGGCGCACCGATATGCTTAGCTACTGGCTGAACGGGGCAGCCGCCGCGTTCTCGACTTGGGCGGAACTGGTGGAAACCTATGAGAACGCCAAGCGCGCCTTTGACAAGACCGGCGACGAGGAAAAGCTAAAGACCGCAGCGAACACGGGGCAGGCGATGCCCTATATGCCGCGCTCTGCCTCGGATGAAATGACGCTCTCGCTGCAAGGGCTGAAGGATAAGGCGGAAGGGCATGATCTGCCGCGCGGGATCGCGCCGCATTGGGTGCGTTACCTCACGGTATCGGTGGACGTGCAAAGCACCTACTTCTCGGTGGGTGTCACGGGCTGGGGCGAGAACGGTCGGCACTGCCCGATTGACCGCTTTGATATTGTGAAGCCGCCCAATCAGGACGCGGATGCGCCGGAGCGGACGCTAAAGCCGTTCGAGATTGCCGCCGATTGGGATGCGCTGGTGGATCTGGCGACGCGCGAATGGCCGGTTGATGGGGCCAATGGATCGCTGCTTGCACGCTCGATCGCGATCGACATGCACGGCGGCGGCTCGACCACGGAAAACGCCTATCGGTTCTACCGCAAGCGCCGCAAGGCGGGCGAGGGGCAGCGCTGGTATCTGACGCGCGGCGAGGGCGGTTTGAAGAAACCCGATCGCGTGTGGCTGAAAGCGCCGGAACGGTCGAACAACGTCAAGCGCAAGGCGGCAGCGGATATCCAGATCCTGCACATGGCGACGGATCGGTTGAAAGATGCGGTGGCGGCAAGTCTTCGCATGCAGGACGGCGGCACCAATGATTGCGAGGTGCCGGGCTGGATGACCACCGATGAGCTGGGCGAGCTGATGGCCGAGCGGCGCGGCAAGTCCGGCTGGGAAAAACGACCCGGCGTTGTGCGAAACGAGAGTTTCGACCACCTCGTCCAAGCAAGGGCCCAGCATCTGATCAAGGGCGGCGAGCGGGTCAATTGGGACGATCCACCGGAATGGGCGGTTTTGGGTAACCAGAACCGCTTTTTTGTTGCCCCGATCAAACTCGCCCCGCTTGCGGATCCGGAAGCTGCGGTGCCGGTGCCCGCTAAAAGAAAACCGCGCCCGCAAGCGCAGCCTGCCAAGCCCGAAACCGGCGGCTGGATCAAACACCGGAAAGGAAGCTGGCTTTGAGCAAGGTCAATACGCAAATCGAGCAGATCGAGGACATGATCGCGAAGGGTGTGCTTTCGCTGGAGCAAAACGGCGAAAAGGTCACGTTACGTAGCTTTGACGAAATGCAGCGCACCCTGACGTGGCTCTACCGCAGGCGCGATGGCGACACGCGCAAGACGTTCCACACGCCAAAGTTCTCGCGGGGGTGATATGGGCTTCATGAACAAAGTCGCGCTCGCGGTCGCCCCGGCATGGGCGGCGCGGCGTGAACGCTCGCGCCTTGTCGCGATGCATTATCGCGCCGCACGGCTTGGGCAGCGTTCGGATAGTCTGCGCCCCACGAACTCCGATGCCAATGTGGCGGGGCAGGCGCGACAGCGGGTGTCGCAATATATGCGGGATCTGGTGCGCAATGCGCCTTTTGCGGCGCGGGCGCAGGCGGTCATCGCGAACAATGTGGTGGGCGACGGGATCATCCCGAAGGTCACTATGAATGCGCAGGTCGCGGATCGGGATCTCGGCCAGCGCCTGAAGGATCGCGGCATGCACCATATCGAGACCTCACTCGATACCGTGCATATCGACAAAAAGGGGCTGCAAAATCTTTACGGCTTGCAGCGACAGGTAATCAACACGGTCGTGGAATCGGGCGAGGCGCTGATCAGGCGGCACAAGCTGCCGGTTGAAAATGGCGCGGGGCTGCCGCTGAAAATCGAGGTGCTTGAACCCGAATACCTCGACAGTGGTCGCATGTGGGCGGGCGAGGGCGAGAATATCGTGCGGGACGGGATCGAATACGATCCGGTTGGCAACCGTGTCGCCTATTGGCTGTTCAGCCAGCATCCCGGCGGCGAGTTCGTGCCCAGTAAATCGCAGATCACTTCGGTTCGGGTGCCTGCCGATGAGATTATTCACGTCTTTCGACCGGATCGCCCGGGACAGGATCGCGGTATGAGTTGGCTCGCGCCGGTGGCGGAAAAGATGCTCATGCTCGACGATTACGAGGACGCGCAGCTGATGCGGCAGCGGATCGCGGCTTGCTTTGCCGCGTTCCGTAAAACGGGGCCGGATGCGAAGGCATCGCCTGAAATCAGCAAGACGCTGGTTCCCGGCACAATCGTCGATATCGGGCCGGAGGAGGATGTGGTTTTTGCTGCGCCGCCGACGGCATCGGGGGATGACCAGTTCAAAGGCACGATCCTGCGCGGCATCGCGATGGGGCTTGGCATCTCTTACGAGGCGCTGACCGGCGATCTGTCGGGCGTCAACTTCAGCTCGGCCCGCATCGGACGGCTGGAAATGGATCGCAATGTTAGCAGCTGGCAATGGCTGATGATGGTGCCGCAGATGCTGCAACCACTCGGTTGCTGGATCATGGAAGCATGGGCCGAATACGAGGCGCAGACGCGGGAGATAAAGGATCTGCCGCTTGGCGACTTCATCGACGGCGGCGGGCTTGTTCTGACGTGGGTGCCGCCGGTGCGGCTGATGATCGACCCTTCGGGCGAGTTCGCGGCGTTCAACACGGCGGTGCGCTCGGGGTTCATGAGCCGTCAGGGCGTGGTGCGGACAACCGGCATCGATCCCGAAAGGCTGATGCAGGAGCAATATCAAGACATGCGCGAGGCGGATGAGCTGGGGCTGATCTTTGACAGCGACCCGCGCAAGGACATCTCGCGGCAGATCCTCAGCGAGCAGGCGGAAAGGGCACGCAATGAATGAGATCTACCTGTCCGGCACGATCGGGCAGTCCTTTTGGGATGAGGAATTCTTCACCGAAAGCTCCGTCCGCGACGTGCTGGCGGGTCTAAGCGGCCCTCTGACGGTTAATTTGAACTCGGGTGGCGGGATCGCGACCGAGGGGCAGGCGATCTATACGCTGCTGAAGGAATATCCGGGCGAGGTGACGATTGTTGTGAACGGCATCGCGGCCTCTGCCGCCTCGCTGATCGCGATGGCCGGTGATCACATCATTATGCCCGAAGGCGCGATCATGATGATCCACGATCCGGCCAGCTGGGCGGTCGAGGGGCGCGGCACTGAGGCCGATCACCTCAAAGCCGCCAAGCATCTGGGCGTTATCGCCAATGCCTATGCGGCGATTTACGCCGCGCGGGCAGGTATCAGCCCCGAGGCTGCCCGCAAGATCATGCAGGTCGAGACCTATTACGATGGGCCGCAAGCGGTCCGCGCGGGGTTTGCGACCGCGACAGATGACAGCGCGGCGGCAGCAGCTGCGGCGCTTTTTGATTACAATCTTTACGCTCATGCCCCCGCAAGTTTGCGGGCGGTGGGTGACGCAGTCGGTCGGCGTCAGTCCAAGCGCGCCGTCGCGGCGATGATGGCCGGATCTAACAATGCTCAAAAGGAGGCTTTCATGCCCAAACCCCGTGCGACCCGTGCAAATGCAACCATGAACGATGATCAGGCTCCGACGCTGGAGGATGATCTGGATCCGGCGCTGGATGACCAGAACGTTGACGACATCACCGCCAGCGACGACGACGACATTACGGCCAGCGACGACGATGAAATCACCGCCGATGACGGCGATGATCTCGATGATCTGAATGCATCCGGGGCAGATCCCGATGACGAACTTAAAACCGCGATGCGGGCAACGGCGGTGATGAAGCTGTGTAAGGCGCGCGGCATCAGCACGGCGGTTGCGCATCACTATATCGCGGCGGGCATGTCCTCGGATGACGTGCGCCGCCTCCACCCTGCGAACAAGGATGGCCGCATGCCCCAGACCACCGCCTTCACGCCGCGCGCCCGCATTGTGCGCGATGAGGTCGACACCCGCCGCGCGGCGCTGACCGGTGCGATCTTTGCCAAAATGCAGCAAAAGCGCGGTCGCAAAGTCGAGGTGCAGGGCGCATCGCGCCAGTATATGGATGCGGGTCTGATCGAGATGGCATCGGTCGCAACGGGTCGCAAGCTGCCGCGCTTTGCCATGTCTTATGGTGTGCGCGAACAGTTCATGATGGATGCGATGCATTCCACCTCGGACTTTCCGGCGATTTTCCAAAATGCGCTGCATAAGGTGCTGCTCGACACCTACAGCAACTTCACACCGACCTATCAGCGCGTGGCCGAGCGCAAGGACTTTAACGACTTCCGCCCGATGCCACTGGTGATGACCGGCAGCTTGCCGATGCTGAAGCCGATTACCGAGACCGGTGAAATCAAGCACGGCACCTTCTCCGATCGCGGCGAGATGGCGACAATCGAACGCTATGGCGTGATTGTGCCGATTGACCGGGCGATGATCGTGAATGACGAACTGGGCGCGATTGCTCGCGTTATGGAGCGCTATGGCCGCACGGTTGGCGTGTTCGAGGAACGCACCTTCTATTCGCTGGCATTCTCTGGCCTCATGTCTGACGAAAAGCCGATCTTCCATACGGATCACAAGAACGTCGCGCCGAACGGCTCGGATATTACGGACGACGCAGTCAGCGAGGCACGCACCTATCTGCGTGAGGCGAAAGGGCTGGACGGCGAGCCGCTTTACCTGAACCCGAACTTGCTGCTGGTCGGCCCCAAGAACGAGACAGCGGCGCAGCGCCTGCTGGCGCAGACCACGCCCGCCAACGCTGACGACGTCAACGTCTTTAGCGGCCAGCTGGGGCTGGTGGTTACGCCGGAGATCACCGGTCGGGAATGGTATGTGTTTGACAATACAAACCCCTGCTGGACTTACGGCTATCTCGAGGATGCCTCGGCCCCGCGTGTCAGCACGCAGGAGCTGTTCGATCAGCAGGGCATGAAACTGAAACTGGAGCATGACTTTGGCGTGGGTCACGCCCGCTATGAGAGCGGCTTCAAGAACATCGGCCGCTAAAAGCCACTGCCTATTCACTGACAAGAGCCGCCTTCGGGCGGCTTTTGTCGTTCACCTCTTAGATGTCAGGAAAGACATATGAAAAACTCCATTCAACGCGGCCATTCCTTGGCCATCATTGCAGCGGCTGCGGTGGAAAGCGGCCAATTGGTCGTCGCGGGTCGCATGGTCGGCGTCGCGGGTGCGGATGCAGACGCGGGTGATCGGGTCGAGATCCACCTGGAGGGCGTTTATGAGTTGACCAAAACACCGACGCAGGCATGGTCTGTCGGGGCACTGGTCTATGCCGTGCCCGCTACGGGCGTTCTGACCACGGCGGCATCTGGCAACGTGCTCGTTGGGGTGGCGGTCGAGGCGACCGTCAATCCATCTGCCACGGGCATTGTTCGGCTCAATGGGTCGTTCTCGCCCGCTGCGGCCTAGGCCATGACGGCGCTGTTTGATGGTGTCGCGGGGCTGCTGACCGGCACCTTCGGCACGGTTGTCGAATACCGGCGCGACGGGCGCATGGAGTGGACGGATATTCCGTCCACCTTCCGCGAACAGCCGATCGAGGTCACGGATGCCGAGGGGCGCGATGTGCTGATCATCGCGCCGACTTGGCGGGTCATTCACCGGCTGGTGCCGGATCTCAAGCGCGATGATCTGGTGCGCCCCGCCTCGGGTAAGGTCTACCGCGTCGTAAACTTTGACCCGACTGGCTCGCCTGCTGCGGATGCCCAAGTGCTGTGTCAGCTGGAGGAATGGTATGGATAGCCCTGCCAGTATCCGCAAAGCCTATCGCGACACGGCGGCGGCGGCCATGGCGGCGGCGGATCGGTTCGCCGGTTTTGCGCGCCTGCCGTCTTGGGTTGAGGCGGTGAATGATAGCGTGCTGCCTGCCTTTGGCGCATTCACGCTGTCGGATGCCTTCGGGCGCACCACCTTGGATGGCGAACATCAGTGGGATGTGAAGCTGACGGTCGCCGCCAAGCGCACCGGGCTGCCCTCGGACCTCGATGACGACCTCGATCTCGATCTGGAAACGATTATCGCCACGATCATCACCGCGCTGCACGGCGCGAGCCTCTGCGGCGCGCAGATCCAGTGCGACGTCGAGACCAGCCAAACAGTTGTGAATGCCGACGGCGGCAAGCGCGTCGGCACGGTCGCGGTGACATTCAATTGCCGCTGCTACTGGATCTTTTAACAGGAGGGCAACATGCCTGCATCCACTGCCACCACCGGCTTCAGCACCCGTTTCGGGCGCAAGACCGGCACCGGCAATACCTATGCCTATTTCGCCGAGGTCTCGAACATTAACGGCATCGGCATGACGCGCGAGGCCATCGACGCAACCCATCTGGAAAGCCCTGATGGCTACAACGAGTTCATCGCCGGTATGAAGACTGGCAAGCCGGTGACGATCACACTGAACTTTGTGCCTGCCGCCACTTCGGACGTGACGGATGCCTTTGCAACCGGCTCGGGCGAGTTCCGCATTCTGTTTCCGGGCGGGACGGTGGCGTTCGACTTTACCGGGATCGTCACCGAATTTGCGCTTGGTGATATCAGCAATGACAAGCTTACTGCATCGCTAGTCATGCAGCCCTCTGGCAAGCCCACGCTCGGGTTGGTGGCAGGTGGCTGATGGCACAGCAACTTAAGGGTGAAGTGACCGCCAAGTATGATGGTCAAAGTTATAAGTTGGTGCTGAACTTCAACGCCCTGTGCGACTTTGAGGACGTGACTGGGCGCAACGGTTTAACGCTCGTTGATCAGATCGAGGCGGGCGAACCGATTAATGCGCGCGATATGCGCTACCTGATCTGGGCCGCACTGCGGCAACACCACCTCGATGCCACGGTCGGGCTGGCAGGCTTGATTATCACTCATGATCAGAAAGCCATCAGAAAGCTGACGGGCGCAGAGCCGGGAAAGTCCCGGGAGGCGGAGGAGTTGGAGCCAGCGCAAGCGCCGGGGGAGGCTGTAGAGCCGCCGAACCCGGATCCGCCCAAGCCGAAGCGCAAGACCCGGCAACAATAGAAGATCTGCTGCGCGCCTATGTCTCGGCGGGGTTTGACCCTGCCGGGTTCTGGGCGCTCACACCCGCGCTCTACCGTATCCATATGGCAGGCGCAGTGGAGCGGATTGAGCGCGAGGCCGAGGCGCTGACCCGGCAGGCATGGCTGATCGCCAAGCTCGTGGTGGTGCCGCTGGGCGGCGATATCCGGCCCTATGATCAGATATTCACCAAATCCCGCGCGGGCACGCCAATGCCACCTGCAGCGATGCAGGCGGTGCTGATCGCGCTTGCCAGTGCATGGGGTGCAGAGATGCCGCCCGATGTGCGCGACCACCTCAAGCAACAGGAAACAGGCAATGGCCAGTCCGTCGATCATCGGTAATCTGCGCGTGAACCTCGGGCTGGACGCGCGCCAGTTCCAGCAGGGCGCGCGCGGGTCGACCACGCAAGTCAATACGCTGCGCAGCGCGCTGGGGCCGCTGACCACCGGCCTTCGGGGCGTCGGTGCTGCCATGCGCACCGCTTTCCAATTTGCGGGCTTCACCTCGCTTGTCGGGCTTGCCGTGACACTCGGGCAAAAGTTCTTCGATCTGGTGAAGGTGACGGGGGGGATCGGTCAGGCGTTTGGTTATGTCCGCGATATCGGGATCGAGGCATGGGGCCGCGTCGGGACGGCGGTCGATGTGATGGGCAAGGGTATTACCGCCGCGCTCAAGGGCATGCAGTCGGGCTTTGCTACCATGGCCTCGCATATTGTCACCGGCGGGGTGGATTTTGCGAACCGCTACATCGGCATCTATCGCGGGGCCTTTGAGGCGGTCAAAGCGATCTGGGGCCAGCTGCCCGGCGCGATCGGCGATCTGGCCTATGGCGCGGCCAACGCCATGATTGGCGGTGTTGAGGCGATGCTGAACGGCGTCATTGGCCGCGTGAACAGTTTTATCGCGGGGGTGAACAGCGCGCTGGCGCTGATCCCCGACTGGATGGGCGGCGAGAATGCTCGCATTGGCATCATCGGTGAAGTCGAGATTGAGCGGATCCAGAACCCCCATGCGGGCGCGGCGCAAGAGGCTGGCGCGGCAGCGGCGGGAGCTTTCGCGGCAGGGTTCAATTCCAACACCTTCGATGGCAGTGGCGTGTCGGCGGCGCTGGCGCAAACGGCGGCGGATGCGGCAGAGGCGGCGCGGACTGCGCTGGCCGATGCATCGGCAGGCTTTGCCGATGTGATCGCCCCGATGCAAAGCCTCGAGGCGATCCGCGATCTTTTTGCCGAGATCTCGGGTGGCGGCGGTGGCAGCGCCAGTGGCGCGGCCAGTGCGATCAGTGAGGTGACCGACGCCACCAATGATCTGAGCGATGCGGCCAAGTCGGGTGCCTCGCTTATGTCGGATACGTTCCTGGGCCTCGTGACCGGCACAAAGTCTTTGAAGTCCGCCCTTGGGGATCTCGCTTTGCAGATCGCCAAGACCTTCGCGCAGCAGGGCTTTGCGCAGCTTGCGGCGGGCGGCGGGCTTTGGGGTGGGATCGCCTCGGGCCTTGGCAGTTTGATCGGGGCCAATGCCAACGGCACCAATAATTGGCGCGGCGGGTTGACGCAGATCAACGAGCGCGGCGGCGAGATTGTCGATCTGCCGTCCGGCACCCGGATCATCCCGCATGATGTGTCGAACCGCATGGCCGATGGCATGGGGCGCGGCAGTGTGCGGCAGATCAATATCGACGTGACCGGCGCGCGCGGCAATGCCGAGATTATGGACATGGTGCGGGCGGGTATGGCGCAGGCTGTGGCAACAATGGATGCCATGCTGCCGATGCGCGTCAATGAAATCGCGGCAAACCCGCGCATGGGGTAA
Protein sequences of DBSCAN-SWA_3 >NC_017384|752663:774711|756978_757596_+|WP_044008018.1|DBSCAN-SWA MMFGLEFLELTLTLQVSLGAGYLAYAVAYAGYRKDHRAEDAIFITIAFSALAQLLFVVLEHAQFLPIISVPSVFLTTIAAGALWRKFGMSFWQTLMRWSGVHRDDGNLRIWPILVHGNHEINQCQVYLKDGRVLNLLNGPAYIGQPWDGLLLGDDGSVLMVVENEYMPNGSVIEHGANFDAEWGTSLTYVPASEVSRVIFTLKKS >NC_017384|752663:774711|770931_771276_+|WP_013384024.1|DBSCAN-SWA MKNSIQRGHSLAIIAAAAVESGQLVVAGRMVGVAGADADAGDRVEIHLEGVYELTKTPTQAWSVGALVYAVPATGVLTTAASGNVLVGVAVEATVNPSATGIVRLNGSFSPAAA >NC_017384|752663:774711|772054_772483_+|WP_013384027.1|capsid|DBSCAN-SWA MPASTATTGFSTRFGRKTGTGNTYAYFAEVSNINGIGMTREAIDATHLESPDGYNEFIAGMKTGKPVTITLNFVPAATSDVTDAFATGSGEFRILFPGGTVAFDFTGIVTEFALGDISNDKLTASLVMQPSGKPTLGLVAGG >NC_017384|752663:774711|762648_762846_-|WP_013384016.1|DBSCAN-SWA MNWDTVKGKWAQAKGDVKTKWGQLTDDDLTEIDGQREKLAGKIQERYGIVKEEAEKQIDEHFKTK >NC_017384|752663:774711|760194_760731_+|WP_060486302.1|DBSCAN-SWA MAWERRRWLNSDMRLRGDPECRALFFDLINISYDQSPVGTLPTDHKILAKLLFVTDDRFEQLCRQEFGPLHKWRKFNCEGEIRLGHPMVLKTLNEAVARRAENQARTDAASNIKRLQRMRSMLAGYSKDLASNEAAVRWIDEWLIEQGCTKRSSDWLERAMSAWSGHMHRLSGRGRSI >NC_017384|752663:774711|755811_756303_-|WP_014537625.1|DBSCAN-SWA MLERLALIRFLISIGVGLQDIWFILQDLDTSRSLERTLHQHRDKVAEAYERLHQLAMDAHAKALTAAQLLAGANPSNLFNTHNSSAVKQLAALVSSKMLAEVTSASELVADLGLTPDMLAQLYLLIENRFGVQLPTGVRETVRTVGGIEAVIQHLTNQNAEAA >NC_017384|752663:774711|756497_756710_-|WP_013384006.1|DBSCAN-SWA MAAKQHVWQALCEERGIRLNPENLMLTGHVMRREGSGLRTHAQPSLRQVLDDAAPRISAAVAARLRRNAV >NC_017384|752663:774711|757736_758372_-|WP_014537628.1|DBSCAN-SWA MQSTFASALEHAIATTGRSLRSVCAQAGVSYDQMKNVRQGKAKTTNVDDAMKIAAAFGVTLEDFYDGNFSGSGRKTAVAGRIGAGAQVELMDDFAKGDGIYHVETPNHLPTHGIVAVEVTGDSMEPFYESGGVVFYSRDTIGVPSEALNKICVAECEEGKVWLKQVKQGNEPHLFHLLSLNMTAEPMFNRRLKWAAPIKHYLQPELVRKSS >NC_017384|752663:774711|753928_754177_-|WP_013384002.1|DBSCAN-SWA MNLIDRPFRREAHACRIDGMIKITGGAISQRQLAERLKIHRATVRSILKERNWLRRIDRPHPSANLAAFNARRQGFAELNFD >NC_017384|752663:774711|754817_755210_-|WP_013384003.1|DBSCAN-SWA MSADLLIRLNGAERRAKALMNFLGLKGSSSTTLLWMVDCCNDSAAEITRLRAELAEAREVKSLAWGPRNDEDAHPDELVAGDYAIWAERHQFELLFWSIPIGRFGALRDAQAFANSHHTSRILAALKGGA >NC_017384|752663:774711|761016_761553_+|WP_013384014.1|DBSCAN-SWA MLLVPLGFRHPRKTEEAVGRASLDAIADEMAYLPDDVLIAMVEAMRARGEGASKDFWPSRASFIGVAEWLHPRPLEFSPKLLSWFASIEGPRAIENGTLVETWQFFEQKKRPPSLPEDRKRVEVVAQHNARKLQLIAERRERGREQFPDDLAWERWYLDRRLYCEQAVADARAGKAVA >NC_017384|752663:774711|771601_772042_+|WP_013384026.1|DBSCAN-SWA MDSPASIRKAYRDTAAAAMAAADRFAGFARLPSWVEAVNDSVLPAFGAFTLSDAFGRTTLDGEHQWDVKLTVAAKRTGLPSDLDDDLDLDLETIIATIITALHGASLCGAQIQCDVETSQTVVNADGGKRVGTVAVTFNCRCYWIF >NC_017384|752663:774711|755206_755812_-|WP_014537624.1|DBSCAN-SWA MRDHAPTAQDPRTTNEAQAPNPYNLAPERLRGEALAAEAKQPVAAREVSTDRADVLQVAGELIGQRRDNRVEAMHHAGLTIRAMQKRIDELTKPHINHCTECDGHGQVIGWLCDEDCAACDGTGKIPITQVPAAADTKMVPTRDQMKAEIDRLKAQLKPALEACQVIDAIVIAGYDNFRDMVMHMLEAVEPSRAALEGSQE >NC_017384|752663:774711|773214_774711_+|WP_014537637.1|tail|DBSCAN-SWA MASPSIIGNLRVNLGLDARQFQQGARGSTTQVNTLRSALGPLTTGLRGVGAAMRTAFQFAGFTSLVGLAVTLGQKFFDLVKVTGGIGQAFGYVRDIGIEAWGRVGTAVDVMGKGITAALKGMQSGFATMASHIVTGGVDFANRYIGIYRGAFEAVKAIWGQLPGAIGDLAYGAANAMIGGVEAMLNGVIGRVNSFIAGVNSALALIPDWMGGENARIGIIGEVEIERIQNPHAGAAQEAGAAAAGAFAAGFNSNTFDGSGVSAALAQTAADAAEAARTALADASAGFADVIAPMQSLEAIRDLFAEISGGGGGSASGAASAISEVTDATNDLSDAAKSGASLMSDTFLGLVTGTKSLKSALGDLALQIAKTFAQQGFAQLAAGGGLWGGIASGLGSLIGANANGTNNWRGGLTQINERGGEIVDLPSGTRIIPHDVSNRMADGMGRGSVRQINIDVTGARGNAEIMDMVRAGMAQAVATMDAMLPMRVNEIAANPRMG >NC_017384|752663:774711|772482_772878_+|WP_044008021.1|DBSCAN-SWA MAQQLKGEVTAKYDGQSYKLVLNFNALCDFEDVTGRNGLTLVDQIEAGEPINARDMRYLIWAALRQHHLDATVGLAGLIITHDQKAIRKLTGAEPGKSREAEELEPAQAPGEAVEPPNPDPPKPKRKTRQQ >NC_017384|752663:774711|768720_770853_+|WP_014537635.1|protease|DBSCAN-SWA MNEIYLSGTIGQSFWDEEFFTESSVRDVLAGLSGPLTVNLNSGGGIATEGQAIYTLLKEYPGEVTIVVNGIAASAASLIAMAGDHIIMPEGAIMMIHDPASWAVEGRGTEADHLKAAKHLGVIANAYAAIYAARAGISPEAARKIMQVETYYDGPQAVRAGFATATDDSAAAAAAALFDYNLYAHAPASLRAVGDAVGRRQSKRAVAAMMAGSNNAQKEAFMPKPRATRANATMNDDQAPTLEDDLDPALDDQNVDDITASDDDDITASDDDEITADDGDDLDDLNASGADPDDELKTAMRATAVMKLCKARGISTAVAHHYIAAGMSSDDVRRLHPANKDGRMPQTTAFTPRARIVRDEVDTRRAALTGAIFAKMQQKRGRKVEVQGASRQYMDAGLIEMASVATGRKLPRFAMSYGVREQFMMDAMHSTSDFPAIFQNALHKVLLDTYSNFTPTYQRVAERKDFNDFRPMPLVMTGSLPMLKPITETGEIKHGTFSDRGEMATIERYGVIVPIDRAMIVNDELGAIARVMERYGRTVGVFEERTFYSLAFSGLMSDEKPIFHTDHKNVAPNGSDITDDAVSEARTYLREAKGLDGEPLYLNPNLLLVGPKNETAAQRLLAQTTPANADDVNVFSGQLGLVVTPEITGREWYVFDNTNPCWTYGYLEDASAPRVSTQELFDQQGMKLKLEHDFGVGHARYESGFKNIGR >NC_017384|752663:774711|764546_764726_-|WP_014537634.1|DBSCAN-SWA MEIKDLQERMLKLSTKIAAIERDNSSLNLQMEQLKRMLKEDRRHIAQLIEAARKQSDET >NC_017384|752663:774711|761549_762197_+|WP_013384015.1|DBSCAN-SWA MMDARQLVPDADYEIEYVAAKDPRQPPEPKLKPIAGRMKVADVFDLMALQAGRRGGALALTPTQIAMARTYSALVERHEAGGIKCSSAEAGGGGGGGSADSFTQARLDASRELDRIRARIGDGVALEVRRRRKVDDKAVTVKSQITDRKLVDAVCLADQSLASLLRSHGWSPDATTRGSLIKALGAALDRMTGPTARHRHATLRCGVVARSPFAK >NC_017384|752663:774711|771279_771609_+|WP_014537636.1|DBSCAN-SWA MTALFDGVAGLLTGTFGTVVEYRRDGRMEWTDIPSTFREQPIEVTDAEGRDVLIIAPTWRVIHRLVPDLKRDDLVRPASGKVYRVVNFDPTGSPAADAQVLCQLEEWYG >NC_017384|752663:774711|759221_759818_+|WP_044008019.1|DBSCAN-SWA MNDIVVATQKSDAKAIFASVAELSDAASLKHAALSRCETVRAEVVTWLSAFRGSWCHDNAARRCRCQRVDICVHDLDMTLDVQKRLLSALKIILAKFGVVAKQLICLAKQSCGFLDAIDGSQDFISLGVGFRSEKRRYFFRSRSHLIGSFYAFVRQFSETVIFLSVLGHWLLASRLYRAISTETGVTAMRSITLRALA >NC_017384|752663:774711|772973_773240_+|WP_013384028.1|DBSCAN-SWA MERIEREAEALTRQAWLIAKLVVVPLGGDIRPYDQIFTKSRAGTPMPPAAMQAVLIALASAWGAEMPPDVRDHLKQQETGNGQSVDHR >NC_017384|752663:774711|759814_760102_+|WP_013384010.1|DBSCAN-SWA MKALPLKQTDADRAVADLAFAVTSAEIKEIVEEYETLAEERSGVGDRMSEVMARAKARGYDTKILRRIIALRKRHADDLAEEETILEIYKSALGM >NC_017384|752663:774711|759033_759180_+|WP_014537630.1|DBSCAN-SWA MKRILWRIAQRYLLARAARARRAGNTMAAAKFKSRSEKFFHKIKGAMR >NC_017384|752663:774711|762997_763759_+|WP_013384017.1|DBSCAN-SWA MASGLSVLTHGNGVTLTGDRALIRSLTDLSDRDLNIAATWALNDTAADVLTDVQQRMGQVFDRPTPFALNAFQVRKARPNDLQAAVQERPSVGKRHFLKVQELGGVRPQTGLERNLGFKLPYAGHLQTVTPGPAALLDARGNWSTGERNRVLSSIRAQSDKHMNSKRGAAKVKRSSAAQFFVPRAGSKLSPGVWKRTSRGAKLQKVLNFTDRSAVYKPRLKFLDGAEVVASRQMPLHLRRTLAQMVAKRAVKV >NC_017384|752663:774711|752663_753749_-|WP_013384001.1|integrase|DBSCAN-SWA MGQSYRGSVVTLEGIKTVVRRGKTYRYFQRRGFPLVKLPDLPMDDPAFLRAYAEAKSANPEKSPAAAGSFAALVRAALKSDHYSQSKLATRAMFRRHCDALVEEFGELPARGLRSRHIRANLPKSTSQGHRLRTWRFLCSFGVATNLLERNEALDVAMPKAKSKGGHPPWTQDEIAAFRTRWPMGSATRAAFELLRFSGARISDAVRLGPGMVRGGVLTYRQVKTDGPAHIPWSCPLPDFAAHLSEDRAIMHHALEALAGQMTYLDARGKARSEKSLGTMIREAAMEAGFKKSAHGLRKSLATELAEGGATAHQISAWTGHETLKEVDHYTKSADRRRAVVGTEQDQNIGKHSGQMWKTRN >NC_017384|752663:774711|763863_764550_+|WP_162491175.1|DBSCAN-SWA MGFAMADDQTVTLVAVEVSAEFQKILADYPLPATVQDADMNQEELASALNQSVNTIAKWIRQEGMPVAQAGGNGKSYVLRLSHCWAWLKARDADRDLRSQHNKQQAAALQAEMLGLDVSDPNAHMTPKARREMAEADLVWNKAQRERRTLVQLDEVHDLLESVLTMVRDGIEAMPDLLERELNLKPDQVAAAVAVGHDILTSLTEKIEAAELQERTVADLPDRQLWMN >NC_017384|752663:774711|756338_756485_-|WP_014537626.1|DBSCAN-SWA MQTKLITTQDIKRVTGLSPRAIRHYESLELIPPQQTRMAFAAAYPVKH >NC_017384|752663:774711|767216_768728_+|WP_013384022.1|portal|DBSCAN-SWA MGFMNKVALAVAPAWAARRERSRLVAMHYRAARLGQRSDSLRPTNSDANVAGQARQRVSQYMRDLVRNAPFAARAQAVIANNVVGDGIIPKVTMNAQVADRDLGQRLKDRGMHHIETSLDTVHIDKKGLQNLYGLQRQVINTVVESGEALIRRHKLPVENGAGLPLKIEVLEPEYLDSGRMWAGEGENIVRDGIEYDPVGNRVAYWLFSQHPGGEFVPSKSQITSVRVPADEIIHVFRPDRPGQDRGMSWLAPVAEKMLMLDDYEDAQLMRQRIAACFAAFRKTGPDAKASPEISKTLVPGTIVDIGPEEDVVFAAPPTASGDDQFKGTILRGIAMGLGISYEALTGDLSGVNFSSARIGRLEMDRNVSSWQWLMMVPQMLQPLGCWIMEAWAEYEAQTREIKDLPLGDFIDGGGLVLTWVPPVRLMIDPSGEFAAFNTAVRSGFMSRQGVVRTTGIDPERLMQEQYQDMREADELGLIFDSDPRKDISRQILSEQAERARNE >NC_017384|752663:774711|758746_759037_+|WP_013384009.1|DBSCAN-SWA MEKSFRKSRQPKNGEDAERARFARLLWRAFPDANSENELAELAAMVLTSESRPVDKRTVRNWLRCENAPHFRYVIKVIALVGAEAVFQMIDPEVGE >NC_017384|752663:774711|764870_767024_+|WP_013384020.1|terminase|DBSCAN-SWA MQMRGFEPLMPYADPRVALKAALPALRPAERISVTDAAEKYMRVNVSGTWKKFSRDVTPYMTEPSDMIPSRMYRGLVLCGPSQSGKTQMLQSAVAYTIAANPGRVGLFQMTRDAAQLFEREKIAPMIANSPELRAKLAKGRGSDTIFQKLFLGGTHLTLNWPTITQLSSTTIRLVLGTDYDHFPESVDGEGDAYSLMRARAKTYMSRGMVVVESSPGAPITDEHWQPQSAHDVPPVEYGVLSLYPNGTRGRWYWTCPCCGEEFEPTFARLVYPEGAEPAEAGDAAEMRCPHCRQTFGHGLKRALNSEGRWLHEGREVDEYGRPRLVTIDSGLVRRTDMLSYWLNGAAAAFSTWAELVETYENAKRAFDKTGDEEKLKTAANTGQAMPYMPRSASDEMTLSLQGLKDKAEGHDLPRGIAPHWVRYLTVSVDVQSTYFSVGVTGWGENGRHCPIDRFDIVKPPNQDADAPERTLKPFEIAADWDALVDLATREWPVDGANGSLLARSIAIDMHGGGSTTENAYRFYRKRRKAGEGQRWYLTRGEGGLKKPDRVWLKAPERSNNVKRKAAADIQILHMATDRLKDAVAASLRMQDGGTNDCEVPGWMTTDELGELMAERRGKSGWEKRPGVVRNESFDHLVQARAQHLIKGGERVNWDDPPEWAVLGNQNRFFVAPIKLAPLADPEAAVPVPAKRKPRPQAQPAKPETGGWIKHRKGSWL >NC_017384|752663:774711|767020_767215_+|WP_013384021.1|DBSCAN-SWA MSKVNTQIEQIEDMIAKGVLSLEQNGEKVTLRSFDEMQRTLTWLYRRRDGDTRKTFHTPKFSRG |
31 | Rhodobacter_phage(22.22%) | tail,protease,portal,integrase,terminase,capsid | attL 746480:746496|attR 774902:774918 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
898310 : 905523
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_017384|898310:905523|DBSCAN-SWA CATGACCACGCCGCGCCAAGCCGCCCCGCGTCAGGGGCTTTTGATCATCTTGTCCTCGCCTTCGGGCGCGGGAAAATCGACGCTGTCGCGCCGTCTGATGGCCTGGGATGAGACGCTGCGATTCTCGGTCTCGGCCACGACGCGTGCGCCGCGGGCGGGCGAGGTGGATGGCGAACATTACCATTTCATGACGCGCGACGGCTTTGGCCAGTTGATCGCCACCGATCAGATGCTGGAGCATGCCGAGGTCTTTGGCAATTATTACGGCAGCCCGCGCGGTCCGGTGGAAATGGCGATGGCGCAGGGCCGCGATACGCTGTTCGACATCGACTGGCAGGGCGGGCAACAGATCCGCAACTCGCCGCTGGGGGCGGCTGTTGTGTCGATTTTCATCCTGCCTCCCTCGATTGCCGAGCTGGAAAGCCGTCTGCGCGCGCGCGCGCAAGACAGCGAAGAGGTGATCGCGAAACGTATGCGCGAAAGCATGAACGAAATCAGCCATTGGGCGGAATACGATTACGTGCTGGTCAACGAGGATCTGGATCAGGCCGAGGCGCAGCTGATCACCATCATTCAGGCCGAACGCGCCCGTCGCAGCCGCCAACCTTGGCTGAATGGCTTTACGCGCGGTCTTCAGCAGGAATTTACATTGCGTGGCACGTCGTCATAATTCCGTTTCGTTTCGCCCGGATAAGCGCGAGACGAAGGAGACCGACATGCCCGACAATCTGAACGCCATCATTGGCGCGCTGGATTTGCCCGCTTTGATTATCTCGCCCGAGGGTCAGATCATCGCGCATAACGCGGCGGCGCGGGATTTGATCGGGATGGATATGGTCGGGCTGCCCCATGCGGCGGTGCTGCGCCAGCCTGCGGTCAGTGCTGCGGTTGATCTGGTGCTGTCGGGCGCGCCTGAATCGCGGGCGCGACTGACCCAGCGCGGTGCCTCGCGCGATTCGATCTGGCAGATGCGCGCCGCCGCATTCGAGGGGCAGCGCCGCGCCATTTTGGTGACCTTTACCGATCTGACCGCAGTCGAGGAGGCGAACCAGATCCGCCGCGATTTCGTCGCCAATGTCAGCCACGAGCTGCGCACCCCGCTGACCGCGATCATGGGCTTTATCGAGACATTGCGCGGCCCCGCCCGCGATGACCCCGGCGTGCGCGGCCGTTTTCTGGATATTATGGAGCGCGAGGCCAACCGTATGGTGCAGCTGGTCGATGGCCTGCTTTCGCTGTCGCGGGTCGAGGTGGATGAACGCGTCCGCCCGACGACACCCGTGGATCTCAAGGCGCTGGCCGAAGAAACCATTGCCGCGCTAGAGCCTTTGGCCGCGCAGGGCAATAATACAGTCACCCTGCACGCCGAACCCGGCAATTGGATTGTCCCCGGCGATATCGGGCAGTTGCATCAGGTGCTGCGCAATCTGGTGCAGAATGCGCTGAAATACGGCGGGCCGGACAAGAATGTCGTGATCGCGTTGCATCCGGCGCAGTTCGATGTGGCTTTGCGCGCGACAGCCGTACGGATCGACGTGCAGGACGAAGGCCCGGGGATCGAGGCGCACCACATTCCCCGCCTGACCGAGCGGTTCTATCGCGTCGATGCGCATCGCGCCCGCACCGTGGGCGGCAGCGGCCTTGGCCTTGCGATCGTCAAGCATATCGTGAACCGCCACCGCGGCCGGTTGGCGATTTCCAGCACACCCGAAAAGGGCAGCACGTTCAGCGTGCTATTGCCGCAAGAGTGATCCGTTATACGTCTTGCGCGGAATTTCCCCCTCGCCATCCCTTGACCTTTGAAGGCGATGGGTCACATATCAGCTTACCAAGGCATTGAAAGGATTAGTCCCATGGCTTTCACCCTTCCGGAACTTCCCTACGCCCACGACGCACTTGCCGCCAAAGGCATGTCGCGTGAGACGCTGGAATACCACCACGACCTGCACCACAAGGCCTATGTCGACAACGGCAACAAGCTGATCGCCGGCACCGAGTGGGAGGGCAAGACCCTCGAAGAGATCATCACCGGCACCTATAATGCCACTGCTGTTGCGCAAAACGGCATCTTCAACAACATCAGCCAGCTTTGGAACCACAACCAGTTCTGGGAATGGCTGTCGCCCGAAACCGTCGCCATCCCGGGCGAGCTGGAAAAGGCCCTGACCGAGTCCTTTGGTTCGGTTGCCAAGTTCAAAGAAGAGTTCTCGGCCGCTGGTGCCGCCCAATTCGGTTCGGGCTGGGCATGGCTGGTCAAAGACAAAGACGGCAGCCTGAAAGTCACCAAGACCGAAAACGGTGTGAACCCGCTGGTGTTTGGTCAAACCGCATTGCTGGGCGTCGACGTGTGGGAACACTCGTATTACATCGACTTCCGCAACAAGCGTCCGGCCTATCTGACCAACTTCCTCGACAATCTGGTCAACTGGGAAAAGGTTGCGTCGGCGCTTTAAGCCGCCTACAGATTTCGAACCAGACCCGGCGCTGTGGCGCCGGGTTTTTTCTTGGGTGAGACATGCGCGACTTGCAGCAATCCTCGGGCGTTCTGGCCCTGATCGGCACCTATCTGCTCTGGGGCTTTATCGCGATCTATTTCGGGGCGGTCTCGCATGTGCCGCCGATGGAAGTGCTGGCCTATCGCGTGTTCTGGGCGGCAGTGTTTTATGGGTTGATCCTGCTGGTGCAGGGGCGGTTCAGCGAAGTGCCCACGGCCATGCGCGATCCGCGCAAGCTGCGGCTGATGCTGCTGGCCGGGCTGATGATCGCCGCGAACTGGCTGCTGTTCATCTTTGCCGTCAGCAATGGCCACGCGACCGAGGCCTCGATCGGTTATTACATCCTGCCGCTGATCGCCGCTGTCACCGGCTTTGCCGTCTTTCGCGAAAAGCTGGGGCGCTGGCAGATCGTCGCACTGCTGATCGCGGCCAGCGGCGTGCTGGTGCTGACGCTGGGCCTTGGGCGCGCGCCATGGGTCAGCCTGCTGCTGGCGGGCACGTTCGTCATCTATAATGTGATCAAAAGAACGCTAAAGGATGTGCCCTCGCTGGTCTCGGTGATGGCCGAGGTGATCTTGCTGGTCATTCCCGCCGCGCTTTATCTGGTGTTTTTCGGCGAGACGCTGTGGCAAGCGCCGCTGTCGCCCGCCTGGTGGCAGGATCAACTGCTGCTGATGTTGGCGGGGCCGATTACGGCCATTCCGCTGATCTTGTTCGGTTATGGCGCGCAACGGGTCAGTATGGCCACAACGGGGATCATCTCTTACATGAACCCCACGATGCAACTCCTTGTTGCAACGCTTTATTTCCACGAGGCGCTGACGATCTGGCACGGTGTCGCACTGGCGCTGATCTGGCTGGCCCTTGCGGTCTATACCGGGGCCAGCCTGCGGGCGCATCACGCCGCGAAGTAAGGCTTCAGCGCGGCTTCGATCTCTTCGACCGTGTTCACCACCGTGGTGTAGGACAGCAGGCTGCTTTCGGCAAAGCCCTGCGCCACGATGTTTTCCAAAAGCTGCACCAGCGGGTCCCAGAACCCGTCGACATTCAGCAAAAAGATCGGCTTTTGGTGCAGGCCGATCTGGCGCCAGGTCAGCACCTCGAAATACTCGTCCAGCGTGCCTGCGCCGCCGGGCAGGACGACGATGGCATCCGCGTTCATGAACATGACCTTTTTGCGTTCATGCATCGTCTCGGTGATGATCAGCTGATCCAGTTGGCGGCGACCAACCTCGCGCTTCATCAGATGGGTGGGGATGACGCCCACGGCCGCGCCGCCGGCCTCTTGTGTGGCGCTGGCGACAAGGCCCATCAGGCCGACATCGCCCGCGCCGTAAATCAGGCCCCAGTCATTGCGCGCGATCATCGCACCCGTGGCACTGGCCAGTTCTGCATAATGCGCCAGCGTGCCATAGCGAGAGCCGCAGTATACGCAGATTGATTTCTTGGTCAGCGCCATGAATTTACCCCTAAACTGTTTCCTTCAAGTCTCAGGTCCCAGATTAGTTGGGGCGGGCGGGTAACTCAACGCATGTCGAACTCGTCACGATGACAGGATTGTGTCACATCCTGTTCAAATTCTTGATGGGATTCGGATAGGCAGTGTCAAAGACGGTGAAAATTATCCTTGGGTTGCTGGCGGCTGCGGTTGCCTTTGTGGTGCTGATCCTGTTCGGCACAAGCGCGCGCATGACGCCCGGCACCCAAGTTCCCGCAGCAGCAGCAGAGGCACCCGCCGTCGCACCGCCAGAAACAGCAACCGCTGAGGCCGATACGGCCGAGCAACAGTCCGCCGCCGCAACGCCTGATCCGGCTGCGGATGCTGCGCCCGAACCTGAGCCCGTCGCGGATGAACCCGCCGCCCAGCCCGCACAGGACGCGCCGCGCCTTGATGTCGTGCGTCTGGCCGGAGATGGTCTGGCTGTTGTCGCCGGAAATGCCGCACCGGGCAGTGATGTGACGATTGTGGTGGATGGCGCGCCATCGACCACGGTGACCGCCAGCGCGGATGGCAGTTTTGCTGGCGTTGTCGATATCGGCGCCAGCGATGCGCCGCGCGTCATCGGCTTGCAAACAGAAACGGCCGACGGTCCCGTTGCCAGCTTGTCCGAGGCGATCCTTGCGCCGAACCCGCCCGCAGCAGCGCCTGAACCTGTCGAAGTGGCTGATGGTGAGGCGGCGGGTCAAGTGCCGACCGAAGGCGAACCCGCAGATGAACCCGCAGAACAATTGCCGGTGCAACAGGCCGCGCCGACGATCCTGATCACCGATGCCGATGGTGCGCGCGTGATCGCCGCGCCTGCGCCGCTCTCTGCCGATGCGCCGGCGCAGCTGCTGCAGACAATCTCCTATAATGCCGCCGGGAATGTGCTGCTGTCCGGCCGCGCTGCCGGTGTCGGTGGGCGCGTTGCGATCTATGTCGATAACGTGCTGCTGGGCTTTGCCGCGGTCGCCGATGACGGCAGTTGGAGCCTGCAACACGACGGCATCGACCCCGGTCGCCACACGTTGCGCGTCGATTGGCTGGACACGGAGGGCCGTGTGCGCAACCGTGTCGAGACGCCTTTCCTGCGCGAGGACGAAGGCGCGCTGGCGCAGGCCGTTGCAACCGAGGCCACTGCCGCCACCACCGGCATCGCCAGCCGCACCGTGCAACCCGGCAATACGCTCTGGGCTATTGCGCGCGAACGTTACGGCAGCGGCATTCTTTATGTGCAGGTGTTCGAGGCGAACCGCGACCGCATCCGCGATCCCGATCTGATCTATCCGGGCCAGATTTTCGACCTGCCGGATCTGCCGGAATCGCCCGATCGCCCTGCGACGCCGTAACCGCGCAGACGCACTGGCGCATCCCGGTCCAAGGGACTAGGGTGCGCCTCCTTGGGAAAAGGGGGCAACATGCACAGGCCGCGTATTTCAATCACTGACGCTTCGGCAAAGCCGGTGCAGGGCCTTTCCATCCTGCGGCGCGTGCTGCCCTATTTGTGGCCCGATGGCGCGAATTGGGTGAAATACCGGGTGGTTGCGGCCCTTGTCCTGCTGATGATCGCGAAACTGATCACGGTTGCGACGCCTTTGTTTTACAAATGGGCGGTCGATAGCCTGTCGGGCGTGGTCAGCGGGCCCGCGGGCATGATGGCGCTGGGGGCGGTGGGCCTGACCGTCGCATATGGCGGCGCGCGTCTGCTGACGGTCGGCTTTCAGCAGCTGCGCGATGCGGTCTTTGTGCGCGTCGGCCAGCGGTCGCTGCGGATCATCGCGGGACAGGCCTTTGCGCATATGCATCAACTTTCGATGCGCTATCACATCACGCGCAAGACCGGCGGCCTTAGCCGCATTATGGAGCGCGGGATCAAAGGCGTCGACTTCCTGCTGCGGATGTTCGTCTTTTCGCTGGGGCCCTTGGTGCTGGAGCTGGTGCTGGTCTGCGCCACCTTATTCTTTCTGTTCGACGTGCGTTTCCTGCTGGTCGTTGCGGGCACGATTGCCGCCTATATCGTGTATACATTGCTGGCGACCGAATGGCGCGTCCGCATCCGCCGCAAGATGAACGAGCAGGACAATGACGCCAATCAAAAGGCCATCGACAGTCTGCTGAACTTTGAAACGGTGAAATATTTCGACGCCGAGACGCGCGAAGTGAACCGCTATGATTCGGCGATGGAGAAATATGAGGATGCCGCCGTCAAAACTGGTGTCTCGCTGGCCGCGCTGAATTTTGGCCAATCGCTGATCATCACCCTTGGCCTGACCGCCGTCATGGTGCTGGCCGCGATGGGGGTGCAGGATGGCACGATGACGGTTGGCGATTTCGTCATGGTCAACGCCTATATGATCCAGGTCACCCAGCCGCTGAACTTCCTTGGCACGATCTATCGCGAAATCCGGCAGGCTCTGGTCGATATGGGTCAGTTGTTCGATCTGATGGGCGAGGCGGCCGAGGTCAAGGACAAGCCGGGCGCGCCCGCGCTGCGGATCACCGGCGGCGAAATCCGCTTTGAGGACGTCCGCTTTCACTATGCCCCGGATCGCGAGATTTTGAAGGGCGTCAGTTTCACCGTGCCCGCGGGCAAAACGCTGGCGCTGGTCGGCGCGACGGGATCGGGGAAATCCACCATCGGGCGGCTGTTGTTCCGGTTCTATGACGTGACGGGCGGACGCATCTTGATCGACGGCCAAGATATTCGTGACGTGACCCAGGAATCGCTGCACCGCGCCATCGGTGTGGTGCCGCAAGACACCGTGCTGTTCAACGACACCATCGGCTATAACATCGGCTATGGTCGCGCAGGCGCCACGCAGGCGCAGATCGAGGATGCTGCCCGCGCCGCGCAAGTCCATGATTTCATTGCCAGCCTGCCCGAAGGCTATGAGACGCAGGTGGGCGAGCGGGGCTTGAAACTCTCGGGCGGGGAAAAGCAGCGCGTGGGTATTGCCCGCACGCTTTTGAAAGATGCGCCGCTGTTGCTGCTGGACGAGGCGACATCTGCACTCGACACCAATACCGAGATGGGCGTGCAAGAGGTGCTGGCGCGGGCCGAGGCGGGGCGCACCACGATTTCCATCGCGCACCGCCTGTCCACCATTGCCGACGCCGACGAGATCATCGTTCTGGATCACGGCGCGGTGGTCGAACGCGGCAGTCATGGCGGGCTTTTGGCCCAGAACGGCCGCTATGCCCAGCTATGGGCGCATCAGCAAGCTTCCGACCAAGACTGA
Protein sequences of DBSCAN-SWA_4 >NC_017384|898310:905523|903717_905523_+|WP_193365337.1|DBSCAN-SWA MSITDASAKPVQGLSILRRVLPYLWPDGANWVKYRVVAALVLLMIAKLITVATPLFYKWAVDSLSGVVSGPAGMMALGAVGLTVAYGGARLLTVGFQQLRDAVFVRVGQRSLRIIAGQAFAHMHQLSMRYHITRKTGGLSRIMERGIKGVDFLLRMFVFSLGPLVLELVLVCATLFFLFDVRFLLVVAGTIAAYIVYTLLATEWRVRIRRKMNEQDNDANQKAIDSLLNFETVKYFDAETREVNRYDSAMEKYEDAAVKTGVSLAALNFGQSLIITLGLTAVMVLAAMGVQDGTMTVGDFVMVNAYMIQVTQPLNFLGTIYREIRQALVDMGQLFDLMGEAAEVKDKPGAPALRITGGEIRFEDVRFHYAPDREILKGVSFTVPAGKTLALVGATGSGKSTIGRLLFRFYDVTGGRILIDGQDIRDVTQESLHRAIGVVPQDTVLFNDTIGYNIGYGRAGATQAQIEDAARAAQVHDFIASLPEGYETQVGERGLKLSGGEKQRVGIARTLLKDAPLLLLDEATSALDTNTEMGVQEVLARAEAGRTTISIAHRLSTIADADEIIVLDHGAVVERGSHGGLLAQNGRYAQLWAHQQASDQD >NC_017384|898310:905523|900824_901718_+|WP_014537686.1|DBSCAN-SWA MRDLQQSSGVLALIGTYLLWGFIAIYFGAVSHVPPMEVLAYRVFWAAVFYGLILLVQGRFSEVPTAMRDPRKLRLMLLAGLMIAANWLLFIFAVSNGHATEASIGYYILPLIAAVTGFAVFREKLGRWQIVALLIAASGVLVLTLGLGRAPWVSLLLAGTFVIYNVIKRTLKDVPSLVSVMAEVILLVIPAALYLVFFGETLWQAPLSPAWWQDQLLLMLAGPITAIPLILFGYGAQRVSMATTGIISYMNPTMQLLVATLYFHEALTIWHGVALALIWLALAVYTGASLRAHHAAK >NC_017384|898310:905523|899025_900060_+|WP_013384143.1|DBSCAN-SWA MPDNLNAIIGALDLPALIISPEGQIIAHNAAARDLIGMDMVGLPHAAVLRQPAVSAAVDLVLSGAPESRARLTQRGASRDSIWQMRAAAFEGQRRAILVTFTDLTAVEEANQIRRDFVANVSHELRTPLTAIMGFIETLRGPARDDPGVRGRFLDIMEREANRMVQLVDGLLSLSRVEVDERVRPTTPVDLKALAEETIAALEPLAAQGNNTVTLHAEPGNWIVPGDIGQLHQVLRNLVQNALKYGGPDKNVVIALHPAQFDVALRATAVRIDVQDEGPGIEAHHIPRLTERFYRVDAHRARTVGGSGLGLAIVKHIVNRHRGRLAISSTPEKGSTFSVLLPQE >NC_017384|898310:905523|898310_898979_+|WP_044008153.1|DBSCAN-SWA MTTPRQAAPRQGLLIILSSPSGAGKSTLSRRLMAWDETLRFSVSATTRAPRAGEVDGEHYHFMTRDGFGQLIATDQMLEHAEVFGNYYGSPRGPVEMAMAQGRDTLFDIDWQGGQQIRNSPLGAAVVSIFILPPSIAELESRLRARAQDSEEVIAKRMRESMNEISHWAEYDYVLVNEDLDQAEAQLITIIQAERARRSRQPWLNGFTRGLQQEFTLRGTSS >NC_017384|898310:905523|900162_900762_+|WP_013384144.1|DBSCAN-SWA MAFTLPELPYAHDALAAKGMSRETLEYHHDLHHKAYVDNGNKLIAGTEWEGKTLEEIITGTYNATAVAQNGIFNNISQLWNHNQFWEWLSPETVAIPGELEKALTESFGSVAKFKEEFSAAGAAQFGSGWAWLVKDKDGSLKVTKTENGVNPLVFGQTALLGVDVWEHSYYIDFRNKRPAYLTNFLDNLVNWEKVASAL >NC_017384|898310:905523|901702_902257_-|WP_044008154.1|DBSCAN-SWA MTKKSICVYCGSRYGTLAHYAELASATGAMIARNDWGLIYGAGDVGLMGLVASATQEAGGAAVGVIPTHLMKREVGRRQLDQLIITETMHERKKVMFMNADAIVVLPGGAGTLDEYFEVLTWRQIGLHQKPIFLLNVDGFWDPLVQLLENIVAQGFAESSLLSYTTVVNTVEEIEAALKPYFAA >NC_017384|898310:905523|902406_903633_+|WP_014537688.1|DBSCAN-SWA MSKTVKIILGLLAAAVAFVVLILFGTSARMTPGTQVPAAAAEAPAVAPPETATAEADTAEQQSAAATPDPAADAAPEPEPVADEPAAQPAQDAPRLDVVRLAGDGLAVVAGNAAPGSDVTIVVDGAPSTTVTASADGSFAGVVDIGASDAPRVIGLQTETADGPVASLSEAILAPNPPAAAPEPVEVADGEAAGQVPTEGEPADEPAEQLPVQQAAPTILITDADGARVIAAPAPLSADAPAQLLQTISYNAAGNVLLSGRAAGVGGRVAIYVDNVLLGFAAVADDGSWSLQHDGIDPGRHTLRVDWLDTEGRVRNRVETPFLREDEGALAQAVATEATAATTGIASRTVQPGNTLWAIARERYGSGILYVQVFEANRDRIRDPDLIYPGQIFDLPDLPESPDRPATP |
7 | Bacillus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1677675 : 1695440
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_017384|1677675:1695440|DBSCAN-SWA TTTACCCCGGCAGATCCGGCCACCTCACCTGCGCCGGATCTGTAAGGTTGCCGGGCAGATCTCGCAGCGCCTGCCGATAGGCGAGCCACGCGGCGCGCGCCTCATCGCTTTGCGGGTAGTCTGGCTGGCTGCGATAGTCGCAGGCCGCGAGCAGCCGCGTGCGCTCCATCCGCAGCTGCGCCCATTGGTAGGTATCGAGATCAGGATTGATGACCCAATCCCATGCGACAGGGTCCCAGACATGGCGATCATCCGGGCGCGGCGGCAGCTCGACCGGCATCCCCCGCACCCAATGCGTCGCAGCCGACCAGTATCCCTGCAGAAGTCCAAAGCCCGCAGGGACACTGATTGTCGGATCGGCGACCACAAACGCCGCGATCTCTCCGGTATCCAGCCGATAAAGGGTAAACGGCACCTCTTGCTGCATGGCTATCTCCGGAATTGGTGCGCCGACAGATAGCGCTGGTAAACATTCGAGGCGTGCACCACCGTCCGCGCCATCAAAGTGTAGGTCGTCGGACCTTGGCCTGTATCCCAGTCGACAAAGTTAATGACCGATGACTGCCGCCATGCGCCACCATAACCGTAGGACCCACCGACCTGCTGACCATTGCGCAGGAGCCACACAACGATCCGCATGTCCGTCGAGCTGCCATCAAGCTGTGCATTCGCGGTGATCATCGTGGCGAGGCCGCGCCGATCGACAGTCAGCTCGAGCAGCGGGTAGTCCGCCGAGTTGGTGACAAAGGTCGGACTGCTTGGCTCCCAATACGCATAAGCCGGGACGGTCACCGCATTGCCCGCGATCTGCAATGTGTTGACCGCAAGATCCGCGATCGCGCCGTTTGCCGCGGTGACGGAATTGGCCGCCATCTTGTCCGCAGTGACGATTTGGGACGTTAGGTGTTGGGTTTGGATGCCGCCCTCGACGATGAGCAGCACGGCGTTGCGCTCCAGCAGTTGCGCTGACCCGATGTTGAGGACAGGCCGACCATAACTGGGGTCGATGTAGACCTCGAGCTTTGCCGTTTTCGCAGTGGCGGGAGAGGTGCCACTGACACTGAACCGGTTGATCCACGCAGCGCCATGCAAAGCGATGTGGCCGAGCTCTCCGACGTAGTTTCCATCCCTGTCCAGATAGGTAATCCGGAACAAAGTCCTGCTCCCACCGCTCACCCAGTAGCCCAATTCGAACGAATAGGACGTGGCGCTTTTGATCGGAAACACTTCGCCACTTAGAACCCAGGCCCAAGGATTCCCTGTGACCGTGCTAGGCGCGGTATTCAGCTGGATCCCTACCCCTTGCGGCCCGTAATTATTGCCATTTTGCGTATCGACAAAGCGCTGCTGGAATACGACGCCTTGCGCGGCGTTCCAGAACCACGAGCGCCCGCCCTGCCCGACCGGCGTGCCGAGATCCTGATAGAGATAATCCGGCACCAAGGAGCCGCCAGTGATCAGCACATGATGCGCGCGAAGTGCACCTGTGGCGATCTCTCGCGCCCCAACGGCGCCCGCCATGAACGCAGCACCCGTCAGGGTGTTGACGACGATATCGCCGCCATCGGTCTTATTAGACCATGCGCCAAGCTCGCTATCCCAACGATAGATCTTTTGATCTGTCGTCAGCACCAGCATGTCGCCGACCGAGCCTGTGGCCGGCAGCACATCGACCACCTCTGGCACCGTGAGGCCCGCCGCAAACTTTGTTTTATCAAGCGACGCCGCAGAGACCCCAGCAAAGACATTCTGCGTCCAAACCCCCGCCGCTGCATCCCAGCGCCAGATCTCGCCCGTGGTGCGGATCATCACCAGCTGATCGGCCCGCGCCCCCGCATCGGGCAGCGCATCCACAGGCTGAATGCCCTGCGCCGCCGCCATCTCCTCAATCCGGGCGACAAAGCCGTCCGGCAGATCGCCCTCCTGCATCTGGATCGCGGGCGTGGTGACCGTGATCCAGCCCGACCAGACATAGCCGCCGGTCAGCTCGGACATCAGCGCACCGCGCACCTCGTAGACAGACGCCGGTGCAACGCCGGTGATCCGCCACTGATAGGGCCGATCATAGGGGCGCATGACGTCAAAGGTCGGCTCGCTTGCTCCCAGACGCCTTGCCTGAATGCGCGCTTGCGCGATGCCTATGGCATCGCCCGCGCAGCCGACGCGGATCGCAGCGACCCGCCCCACGCCTGCATCATCGGCTACCTCGTCGCCGACCGCCGTCCAGCCGTCGATGGCCTGCACCCACGGGATCTCCGGCACCGGGGCCACGCTATCATAGGGCAGCTCGAAGCTCGGCGACCAATCGTAATCGGCGGGATCGACCTCGCGCAGGCGCACGGTGACATTCATGCCGGGCAGCTTGGTCACCTGCTCAACGACGAACTGCTTTGCGTCATAGCCGTTGCGGGCGCTGGTCCAGCTGATCGTGTCCACCAGCGGCTCAAGCGCATAGGCGCCGGGATGCAGGCTGAACTCCACGATAGTATCGCGGCGATAGTCCTCGAGCTGCGCCCGCATCAGCCGTTGCACCTGGTGCGGATAGGGCACCGCCGGATAGCTGAGCGAGGTCGGCAGATGCCGCCCGCCGTCAGCCGCCACGCCCGCTTCCGAGATATACTCGGGCGCATCGCGCGAGGCCCACTTGGCGGTGGGGTCCGGGAAGGTCGCCGTGATCGCATTGAACGTCTGGCTTGCCGGCGCAAAGGGCGTCAGGCTCTGCCCCTCGCTGATGACGATATCCGCATCGGTGATGGCAAACACCGGCGCGGCCGGCAGGCCGACCACGGGCTTTAGCATCCCCCCGACCTCGGCATAGCGCATGTTTGCGCCCTTGCCGATTTCCTCCATCACCCCCAGCGGCTCATCGCTGACCGTGATTTGCAGACCGGCGCGGTAGGCCGGTTCCGTGCCGCCCTCGGCCAGCGCGACCGGCATGTCGCAGGCGTTCATTGCCGCCATCCATTCGGCCAGCGGCAGACGCCACTGCGCCATGTTCTTGCCGCCGAACAGCCATTCATCGCCCCAGTAGATCCCGCGTGCGATATTGTAGCTGATCACCGCCGCATTGCGGCTGGGCTGATAGGTCGCGCGCTGGCCCCAGCGCTGCGCGCCCTGCCCGCCCGCCGTGCTGTCATAGCGCGGGTCATAGAGCGGCAGCGGCTCCGGCTCCCATAGATAGGTCGGATAGCTGGTCAGCGTCTCGCTATCAAAGCGCGTGGTGACGATGACATAGGTCTTGCCGGTGCCGATATGGTTGGCCGTCCACGGGTAATCTGCATCCTCGCCAAATGCCCACAGCAGGAACGGATCAGCGGCGGTTTGCGTGCCGTCGATCCATTTGACCCAGATCCGCGCGCCGAGGTCGCCCTCGCCCTCGTCATCCTTCATATTCGACAGCGGCCAGCCGACAACGGTATGGGTGGCAGGTACTGCGCCAATATCATGCGCGGGCAGCGAACGGATACTGGACGGGCCAAAGAGCGAAAGACCCGTGACGACTGTTCCGACTTTGCCCGCGACCAGGTCGGCGCGCTCATCATTGACCCATGCGCCCGCAAAGCCCTGTGGCAGGCTCGAGATCTCGATCACCTCGGTGATGAACCGCGTCTCGCGGCCCCATTTGGCGATAAACCGGCGCTTGCCCGCCGTCGCGAAGTCGCCGGCGGTGAAGGTCAGCGCGGTATCGTCGCCGAATTCGACATCGAACTTCACATCGTAATTTTGGCCCTGCTGCGCCGCCACGCCGGTCAGCGCCGCGCCAAGGATGCCGGTGCCGAGGCTGACAACCGCGCTAGCCAAAAACGCGCCTATGGTGGCACCCGCCCAGCCAAAGACATTGACGGCAAGCCAGCCCGTCACCGGGTCAGCCGCAGCCGGGGCGGCCACGCTCATCAGCGCGACCACCGTCAGGACCAGCGTCCAGATGCGTGTCATATTAACCTACCTTGAAAGCCCGCTTGATCTGGTCGCGCGGAACCCGCGCGTGACCTGTCTTGTCCAGCACGATCACGCTGGAGATATCGATGACCCCGAGCGCGCCGATCTGCCCCTCGCCGTCCAGCAGCGCCAGATCACCGATCTGGGCGAAGGCCGGATGCACTTCTGGCAGCAGCGCCGCCATCGCATCGCCCAGGCTCTCATAGCCCGCCTTGCGCAGCACACGCAGCGCGCCAGTGGGTGTTTTGTAATTGGCCCAGGCGGGGCGCAGATCTTCGCCCGTGATCGCCTCGACCGCGCCGGCCGCCAAGCCCAGCCCGCAATCATGGCGGCCCCACTCAAACGGATGATCCCGCTGGATATCGAGCGCATGGCTGAGACGCGCCCGCCAGTCCGGCAGGCGGGTCAGCGATCCTTTTTGTACCATTGGACAGACCTCGATCCGATGACCGCGGCGAATTCGCAAAAGCGGTCGATAAGGTTACGACGCTTCTGATGGGCGTCGGATGATTTGGCGGGGTTGCGCGCGGTCAGCTGCCACATGATCTCGGAGCGGATCTGATAGGTGATGCCGCCCTCGCCGTTTTCGCCCGGCGTATTGATCGGCCCCGCGTCGATGATGCCGACCCATTGCAGCTGGGGCGCGGCCGTAAACGCGCCGCCTGTCATGGTGGTGGCGTGGATCTCGCAATAGGCCAGCCGCAGATCATAGCCCCGCGCCAGCAGCTGCGCGGCATCCACCCCTTGGCTGGTCGAGACCGCCACCGGGTTATCGGTCAGATCGCCGACATAGGTCAGATCCTCGACCTTGAGGTTTACCCCGCCGAAGTAGTGCCGCGAGACGTTACTGCCATCGGGCTGCGTTAGCGTCAGCGTGATATCCTCATCGCCAGACCACAGCCCGATCGGCACTTCCGCGCCGCCAGCACGCGGCCGCGCCAGCACCCAGAAAAAGTAAACGGGGGCAATGCCCCCGTCCCGCGCCGCCGTAAGGCTGGCCGCAAATTGTGCGTCGTATGTGCGCATTATCATTTTCCTAGAAAGGCTTAGGCAAGGGTAGACATCTGACAAACCCTTGAGTCATTATCTGGCCACGCAACCTATAAGGGAGTGACACGCCATGATTGACGGTTTTTATAGTGTTGAGTTCGAGACGGTGTTGGGTTCCGGCGGAGGCGTCGTCGTCCTCGAGGACGGCAGCCTGCGCGGCGGCGATTCCAAGCGATATTTCCTTGGAAGCTATCGCATAGAAGATCAGAAGCTGTTGGCCGATGTGCACGTCGGAACACATATGGACAAACTCGACATTCCGCCGGTCTTCGGGGTTAATGAACTCGACCTGAAGATCACCGGAAAATTGACAGCTAGTGCAGCAATTGAAGGCACGGCGCGTAGTCCCCAACGCCCGGACAGCGTCATGGTGTTCAACATGAAGCGCATATCGGGCTGATCATCGCTTTTCCAACATGGTGACGCTCGCGCCGGTCGCCGCTGAACCGCGCCGATAGCTATAGGGCGTGTATCCGTCGTCCGGCACGAACATGCGGCAGACAGGGCGCAACAGTTCGATCTGCGCCCCGACTTGCAGCGACAACGGCGGATAGGGATAGACCGCCACCTGCACACTGCCACCCGCCACCGCGCCGCCGTCCGCGATCTCGCCCAGGTAATAGCGCCCCGTGCCCCAAGGCGCACTGAACCGATCGCCGGCCGCGAACTCAAACCCCGCAGGCAAGCCCGAGAATGTGATCCGTGTGCGATCAGCCGAGAAGCCCGCGACCCGCACGTCGGCATTCACCAGATGCGCCGGCGCGCCAATCGCTGGCCCGCTGTAGGTCGGATCGGCCCACAGAAAAGCCCGATTGGTGCCGAGCGCCCGGAACCGCGCATCGGTGCTCCGCGCGCGGCCGATGCAGTCGCGCGAGAGGTTATTGAGGTCAAAGCTGGCTGTCCACAGCGGCGGCGCGAGCTCAGCCGCCCACCGCCGCCCGTCGCCACCGCCGGACATCTCGTCATAGCGTTGCAGCTTCAGACTGACCTCTGCGCGCGTGGCGATCAGATCGCCGAGGAAAGCCAGCGGGTAGAGATCCGGCAGGGTCATCTGGTCTTCCTTCTTTCTACCATTGCATTGCGGACTTTGACATTGAACCGCTTGTCATAATCGCTGAGCACTTGTGTCATAGCGGCTCTGGCGCCGACGGCGGCGCGCTCTTCGATCGCGGCATCCCCTTGCGCTCCGCTGACATCGACCGTGATCGCCACCGACATGGACTGCGGGCCGGCATGGTCAGCAGTGGTCAAGCTGGACATCGCCGGAAGTCGCATGCCGACCAGCCCCCCTTCCGAGTAGCCAGGTATGCCCGCAGTCTGACGTAGCATTTCTGCGGCCGCCGGCCCGCCCACGCGACTTACATCATCCTGCGACCATACGACCTCACCTTTATGCACAATGCCCGCAGGCTGATACCGCCCGCCGGGTCCAGTGTATCCGCCGGTGTCATAGCTAAGGCCGCTGCCGATCGCATTAATCAGGCCGCCGCCCCATGAGGTCGAGCCAATCAGCCCCATCATCCCCTTGACCATCTGCACCTTAGCAATCTGCATCAGCAAATCCGCAACAGCCTCGCGCGCCGATTTCGACCCATCGATGATCGAGCCAAAGAAATCTTCCAATGTCGCGCGCCCGGTATCGGATGCCTCCTTGATCTTGCCGAGCTTTTCGGCGGCTTCCTCAGCCCCGAGGCCTGCCGTCACATAGGCCTGTGCAAGTGCATCGATCTCCGCCATCAGTTCCGGCGTCATCGCGCGACCTTGCTGTTGCGCCGCGTGCAACAGCTCAGCCCGCTTACGTGCGAACTCCAGCGCATCACCATAGTCTCGCGTGCCATCAGCCACAGAAAGAATTGCAGCAGCCTCGATTTGGAAAGCCTGGGTCTCCTCCCGGATCGATTTGACTGCGTCCGTATAAGGCGTGTCATCGCGCCTGCGTCCCGATCGGCCTTGCTCGCGCCTTGCGTCCTGCGCCGCGAGATTACCAGCAGCGATATGACGGATCTGGGCCTCGGTCAGCTGCACCTCATCACTGAGCGATTGCCGACGAACGCTTGCGATCTCATTTTCCAGCGCAAGTGCTTCACGTGTCAGACCGTTGCGACGATTGGCCTCTGTGACGTAATCCGCAGCCTCGCGCTCGATCCGCTGCCCTTCGCTACGTGACGCCCCATACTCCGTCGCAGCGGCGGCGCGTGCCGCGAGCGACGGATCAGATGCCAGTGCGGCGGCCGCTGACGCGCGATCAGCTGTCCTAATCAGATCAGCAAGCGCTCCCGTCATCTGTGCAATACGACCCAGAAATGGTGCAACGTTTGGATCGGCTTCACCGATGTCTTGCAAAGCCTGCTGCGCCTCGTACGCGGACATAGTTCCTTCTTGCAATGCGACCCTGACAGCCATCAATTGCGCGATGGTTTCGCTCGCGATGTCATCATATTGAAGACCCTTCAAATAGCGATCCAACTCGCGCACGGCCTCCAACTGATCAGAAAGAATGTCGCTCATACCAGACTGCGCAATTGCATCGCGCATCGCACTGGTGGCACGCACATGATTTGCAAGAAGGCTGATGATCTCCCGCCCATCCTCTGTGAAGCTCGACGTATCAATGCTCTCTAGAGAACGGATCACATCATCAGCGCTGATGCGAAGGGCGCCGAACTCTACTGCCAAGTCCCGAACTGCGCTGAGGGCATCCTTTTCTTCACCACCCCGACCCCAACCACCGAACAAGCCGCCCGCGCTCCCGGCGCGATCGACGATCGATTGCAAGTCAACCGTCGAGACTTTCTCCAATTCCTCACGCAAGTCGCGCAAGTTTGCCAGTCGTCCGTCCAATCCATCAAGAGCGGAGCCCACCGCATCAATCGCACTCGCGGCCGGCGGCGCAACAAGTCCTAGGCGCTCCAGCTCGTCACTTACACGCTGCGTCCGGCCTTCGGCTTTCAGTGCAGCATCCGAATACATCACCAGCCCGCCGGCAACGACACCACCGAGCAAAATGCCCAGAGGCCCTGCCGCAGCACTCATGCCGCCAAGCGCCGCAGCAAGACCGCCCACGCTCGAGGCTGCGCGCATGGCTGTAACGTATTTCATGATCTCAACGGCACCGGTGCCAAAACGCACCGCAAGAGCAGCGACAGAACGACCGAGTACCGCCCCAGCCAAAACCGCCGCGACTTTCAGACCTTTATCTGCAACCGTGTCGAAGTTATCCGCAATGATCGTCAACGCATCGGCGATGCGCGCCGAGATCCCCACCGCATCATCACCGCGACCGATATATTCCAGCAGCGCGTTGTTCAGCAGCATGAAGCCATCCTGGATCGTCGCAGGCATGGCCGCAGCCTCATCGCGCAGCTTTGCCATCTGCGAGGTGATCCCGAGCAGCTCCGCCCGTCCGATCTTGCCATCCCGGCCCATCTTGCGCAGCTCGAGTGTGGTGACGCCCATCGATGCTGCGAGCGCCTGTGCCACCCGCCCCCCGGACGCAATGACGGTGTTGAGGTTGTCGCCCTGCAACGTCCCGAGCGCCATCGCGTTCGACAGTGCATTCATCACACGCGAGGCGACATCGCCTTTCGCACCGGAGACAACCAGAGCATTGTTTAGGCTCTCGACGAAGTCGAGCTGGGTATTTGTCGCGACTCCGAGATCGCCAAGCACAGACGCGAAGCTAATATAGCTGTCCGCCGTCATGCGCAGATCGGAGTAAGTCCGGCGCGCCATGTCGCTGACACGCTCCATGACATCGGTGCCGCGATCCATCGACCCAGCCGCGTTATTCACGCGACTGGTAATATCCGTCCAGGCGCTGGTCATCGCTATCAGTTGGCGGCCGCCAAGCGCCGCCGCCAAAGGCGCAAGGATCGAGCCTGCCCGCGCTTTGATATTGTTGAGGCTGCGCCCGATCTGTGTCTCCATATCCTTGAAGGATCGGGTTAAAGTCGCATTCTGGCGCTGGAACTTTGTTGTCGCGTCGCGTTCCATTTTCGCCAGACGGCCCTCGAGCCGCACAATGGATTGGGTGAACTGTTTCTCGGACAACCCAACTGGGAGCTCAAGCGGATCTTCTGAAGCCATCATTTCACCCCAAGTGCCTTCAGGCGATCGAGAGTCACCTCGCCGCCACCTTCCGTTTTGATGCCGTTAAATCGCTTCCAGGCGGCGAAGGCGCTGTGGAACTCCCACAGGCTCATGCGCTGGACCTGTTCGGGGGCGAACCCCATCACGGCCCCGCTGCCATAGAGGAGCGAGAACCTTAACCGTTCTCGTCCGTCATCTCCGCCGCCACTGGCTTTTCCGGCTGATCGTGATCCTTGCCGGAGAAAAAGCCTGTTACGATGCCGTAGCAGATCTTTACGAGCTCCGCGAAATCGGCCTCCTCCATCGCGCGCAAAGCGAGCGCGCGCGCAGCATCCCCTTCCATACCGCCGCCGATCAGTCCCAGGCGAATGCAATCAATGACCTCGCGCACCCGCACCGGCGAGAAGCCGAGCGAGCCACGCTCGATCCCCTGCCGGCAGCGGAAGCGGAAATCCGCGATGCCTTGCGGCGTTAGATCGTCAAGCGCTTCGGCCTCGCCAATACGAAGCCGGAACGCATGCTCGCCGCAGGTCCAATTGATGACCTGTGCCTCCGCCATTAGGACCCGACCTTCAGGCTGCGGGTTGGCGTTTTTGCGAAGCGTGCCTCAACCGAGGCGGAAACCACCGCGCCCTTGACGCGTGACTGGCCAAGGCTGGTGATGATGATCGGGCCAGTCTCGTATTCGATCTCGCCGGCGCCTGCGTTCAGATTGCCGATCCGCGCCAGCATTTCAGTGCTGTCGCCCGCATAGAACTTTTTCAGGAAGGTATCATACCCACCCTGCGTCCAGTTGCCGGTGCCCGAAAAGCTGACCGCAAGGCTCTGCACCCGCACGCTTACGTTGTTCGGTTTGCTCTCGTCGTCGCAATCATCGACCGTTTCGGTCTCACTGGTCGATGCCGTCCGTGTCACATCGGCGCCCATGATCACGCAATTACGTGCCCAGGTAGTGCCGTTGTCTTCAGAGAACTCGAACACGAGCTCGTGGTATTCTTCATGAAAAGGCGCTGCCATTTTGGCGATCTCCTCGTGTTGCAAACAAAAATGCAGCCGGTCAGGGCTGCGCAGGGTCGTCAGGGGTTTTGTGCGTGCTGCGGCGCGGAGGTGGCACTGTGGCACAACCCGCACTGACCGCCGCATCTATAAACTCTTGCGGGAAGTGCTGAGGATCTGGACCCGCCTTGGCTGACCATCCCGCATTGCGCAGACGGCTGGTGTAGTGGAAGTCACGGTGGAAGATCGCTTTAGCCATCATAGCGCTCCATCTGGAATTCGTAGCGAAGCACACCGTGCAGCGTCAGTCCGTTGGGATCGCGCAGGACCTGGCTAAATGGATCGCCCCGTGCCACGATTGGATGTCGATCTGTCTGGACTTGACCCATGATCCGCCGCAGCCGTTGCATGATCTCTTCGCAATGCGCGCGGCCCGTCCGGTTCGACCAGGCATCGAGCTGCAAAGATACATCCTCGATCGCTTGGCACCCGCCTTCTTGTGAGACGCTGTCCCAAGGCCCGAAGCTGATGTAGCCATTAGCAGCTCCCCAAGGGCGATCCGGCACCCGGTCATAGATGCCGGCAATGATTGCCATAAGTGCAGCATCCGCCCGCACCGCTGCCTCAATGGCGTCTTGCAGTTCCGTCGCTGGTGATGACATCAGCCACCTTCCTTCCATGCTTTGCGCACCGCCCGCCGTATTGCTGCCATGGCCTGCTTGCGGCGCAAACGCTTTGCGGGGTTGAAGAAAGGCTTTGCGGGCATTGATTGCGTGCCGACCTCTTGGAGCTTCGCGTTTTGGAAGCGCTCGCCTTGCTTATTGGTAACGATCGTCGCGTCGTTACCCGCCCTTACAACGACCCCAATGAAGCCTGATCTGCCCTGGCTGGTCATGACTGCATCAGCCTCCTCAACGCGGATCGACGCTGCCAGATCCCCATCCGATGCACCCGCGAATGCACGGGCAAGGCTCGCGATCTCTTCGCCCTCTTTGCGGGCTTGGGACCGCCCCGCCTCGATCGCGGCCTTGGCATGGGTGCGCAATTTACGGCGCACAGCGTCGAGTCCCTTAACCATATGCCACCCCCGCCTCCACCAGCAGATAGATCCAGGCGCGATCACTGATCGGATCGACCTCGCGGATATTGTAAACGCCTGTCTGCGGCACATCGCCCTCGTAGGTGGCGCGCCGCATGTCGATCATCCGCCATGATTGCAGGATCTCGCGCGCAGCGGCGCATTGGCGGATCTTGATCTTGAACACAGACCGCCCTTCCAGACGCGCCGCATAAACCGCCTCGCCGCCGCGCGCATAGATGAATTCGGCCCGGCATTCGTATTCATGGGTCCAAATGCCAGTGCGCATATCCGGCGACTCGAACTGAACCCGCTCGATCAGTTTGCCCGCAGTGCTCATCCGAATGCGAAGCTCCTGTAGGTTGCGACCATGTCGCGCACGCCAAGCGGCACCTGGGCGAGACCTGCACCAACGGCTTCGCGATTTAGATACCAATGCGCGACCAGCATGCGCATCGCCTGCACGAGGCGCGCCGGCACATTACCGGGCCCGAGGCCCGCCACGAACACCACCCTCGCCGACGCGACATTCTCCGCGCCGTATACGATCAGGCGGCCATCGCGCAGAGCGTAGCGGCTGGCGTCGAGCGTCACTGCGGCGCCCAGATCATCGACATAGCCTACGCTGGTGATCTCAGGATCGGGATAACCCGCGAAGCCGAGATCAAAGCGCGATCCCGTTGCGGTGAACTCCGCGCGCTCGAGGCGCGTCTGACAGACCGATTGCACCCAATCCACCGCCGCGTCGATGTAGAGGCTGATCAGCGCATCATCATCGCCGAAGTCATCGGCGCGGCAATGCTTCTTGGCATCCTCGAGGGTGATAATCACGCCCGTTGCCTCGCTCGTTTGCTGAATATCCATGCTCACCCTCCAGCCAGCAGAGCGGGGCCCAGCGGCCCCGCCCCAGTTGGTTACTCGCCGCCGGCGGCCGTCAGATCGCCATAGACGATGCCTTCGGGACGCAGCGTTTCCAGCTGCAGCCGTTCTTCCAGAAGGATGGTAACCAGGTTCTTGACGAAGTTGTCGCGATCCTCGGTCGAACGGCGCACTTCGATCCCCTTGCGCTGCCAAATCAGCGTGTTGCCGATAAAGCCGCCGACGATGAACTTGCCCTGCGGCAGGCCCTTGGTGCGGACGACCGGCAGACCCCATGCGGTATTGCCGGCAAAAGCCGGGTGCAGATAGCGACCATCCGCATCTTTCGCCAGATCCAGCGCGGCAGCGTCCAGATGGTTCATCACGATGGCGGAGGCCAGCAGATCCGCCTCGGCCACCTGCGCAATAGCGATGCGGATATCATCCATGGCGTTGACCGGCGTAATGCCGGGCACCAGCCCGCTGTCATAGGTGGTGGAGTTCGCCAGCAGGCCATCGATGCGGCCAGTGGTGCCGGGGCCATTCAGCAGCTCGCCCTCTTCCTTCAGCTGCAAGCCGTAGAGGCCGCGCTGATTGATATAGGCTTCCATGCCGTCAACATCGTCCAGTGTCTCTTCTGTGACCCGGAAGAAGTGCGCCATCTTCACCATCGGCGCTGCCTTCGGCTCGAAGTCCAGCTCGGACTGCGGCTTCTGCGCGCCTTCGGCAACCACGCCGGCATTGTTGGTGTAGCCGCTCTCTTGCAGATATTCGATCACGGCGGCCGTGGTCGCGGCCGTGGGGATGATATCGCGCAGGAACAGAGCCTGATTGACCGGCTGGATCAGCCCGCGCGAGCCGCGGCGTACGCCGCCCGGCAAGTCGATGGTGCCAAAGCTGCCGGTGGTCACATCCTTGACCTCGAAACGCGCCTTGCGGTCATCCTTCAGGGTCTTGAACTCGTCATGCTCGGCGATGATCCGGCCCAGCGACTTACCTTCCAGATCAGCGCCGCGATGCGACAGCTTTTTGGTCAGGTCGGCGACCTGATCGCCCAGTTCGGTCAGGCCCTTGCGGCTGTCCTCGACGTGACCCTTGATATCGGTGATATCCTCGCCGGAGGCTTTCTTCTGGTCCAGCACCTTCAGTTGGTCGCTGAGATCAACCTGCGCCTTCTTCACCTCGGCCAGCGTTTTGCTCGCGTTATCGAGGGCCTCTTTGATTTCCAAGTCCATGGTTTATCCTTTCGATGGACGGTTAGAACTTGAAGGCGTCCTTGATCGCCTTCGCGGTCTCCGAAGCGGACGCGTCGCGCTGTCCATCGCCCAGAGCTTCTGGCGCGCGGGCGGCAATCGCCTTGCGCAGCCAAACCGGGAGGCCCGCGTCACGCAGGGCCACCTCCACGGATTTCTTCAGGGGCGCGAAGTCACCATCCTTCGCCGCCCGCATGATCTGTACCGAGGATTTCACGGCATCGACGCCAGAGCCGTCTTCCATCGGAAAGGTCACGACCGAGATCTCCCACAGGTCGATCTCTTTGAGGTGCCGGATACCGTCGATGATATCGGCGTCTTTGGTGCGATAGCCGATCGACAATCCCTCGATGGCGCCCATCTTCATCAGCGCATGGGTCTCGCGGCCCTTGACCGTTTCCAGCGCCAGACGTCCTTCGACGCGCAGCCCCTTCTCGTCCTCACTGATCGACGTCCAAACCCCGATCGGCTGCGCTGCATCATGCTGCCACAGCAGCTTCACCGACTTTTGGCCCGCGATCGATTTGGCATAGGCCCCCGGGCGCACCATATCGCCGCCCTGATCGACCACATTGAAGCGCGAGGCATAGCCGACGAACACGCCTTCTTCGGTCGCGGCCTTTGCATCGAGCCGCGCCAGCTTAAATTCCATTGCCATGATCTGCCCCTATTCGGTGATTGGCGCGCCGCCGGCCGCGGGCAATGCGTCACCGCCCTCGACCGGGTTCTTGCCCAGCCATTCCAGAATGCTGTTCGGCGTTTCCCAGGCGGCATTGTTGCCCAGGGACTTGGCCGCGTAATCAGCGCGCGCGGCCAGATCCATGCGGTAGAACTGGGTCTCGTCAAAATCGACGTATTCATCCGCACCCAGCAGCGAGAACCGGATCGCTTCCTCCCAGCGCTTAACCCAAGGCCCCAGCGTGATCGTCACGTGGTAATCCATCGCATCCGAGATCCGCGTCAGCGACTGCCCCGCCGCGTCATGCGCCAAGAAGATCGGGTGGATGCCATAGGCACGCGCCACCTCCTCGATAATGAATCGCCGCGTCTCGAGCAGCTGCAGCTCGGCCTGCGTCGGGATGATGCTGTTGTATTTGGTGCCGCTATCAAACACCGGCGTGTTCGGCAGCTTGTCCTTCAGTGCTTCCTTCACCATCTTGGCAGAGTCCTTGGAAAGCTGCTGATCGGTGGTGATGTAGCCCGGCACGGCCTTCTTTCGGCCATCCTCCATCTGGCGATCCTCGAGGGTCAGCGCGAGGCGTAGCACCTTGCCGATCTCGGCCGTGATGTTCAGCCCCTCGATCTCGTTCCACCTCGGGCAGGTCACTTCAATGAAGTCGCGGCGCGTTAGGTTCGACAAGAAGCCAAGGCCGGGGATATTGGCATCGTACCAGACCTTGCCGGTGTCATAATCGCGGCGGATCGTGACCCCGCCGTCATTGATCGGGATCAGCGCGCGGATGCGCCCGCGATAGCCGCGATGGATATAGGCCCGCCCTACGCCCCGGAACACCGCCCACATTGTCAGGGTCTCGACAAACTCAGTCGGCGTCATCCAGTCGTTGGGCGCGAGCGTCAGGCGCTCGGTCAGCTCGCCCGCGCGGATCGGCGTGCGGATCGCGCGGCCCAGCCGGTCATGGCCCAGTTTGCCAACCGTGATCGGCAGGCTTGCCACCCCTTCCGCGATCCGCATGCCGGCCGCCAACGAGGCAGTGACCCGCAGCTGCTTCTGGTGCTCGGACACAGCGCGCTCGACGATGTTCTGTTGGTAAAACCGATTGCCCGTGCCGGGATCCGTGTTCTTTTGTCCGAAGGGCCACAGCTTCATGTGAACAATACCCCGCCTTCCATGTAGCTGCGCCGCCCCTTGCGGTCGGCCTTGGTGGCGCCCACGCCCATTGCAAGCGCGACCATGCCGTCGATACGGCCGCGCTGACGCTGTTTGTCGAATGCCTGATTGCCCTGTCCGTCCGAGATCAGCACCGTGTTCGCGGCGCAGACATGGGTCATCTTGTTCTCGGCGATCAGGATCTCGCGCTTGAGGATCTTGTCCGTCATGTGCGTGATCGAATGTGGCATGCACAGCTGCCGCCCCTCGAAAGCAATGCGTGTGCCCTGGGCGTGTGTCACGATCTTCAGGCCCGTGCCCTCAGGCTCATCGGGCCCCATGTACCGCCAGACCTCGAACGAGATCTGCTCACAGGCCGCGATAAAGTCGGAGACATAGGCGGGGTCAACCACCAGCGCCTCAACCTCTTGCGCGACGCACAGCTCCTTCACCTGCTGCGCGACGAAGGTGTAATCGATCACCTCGCCCCGCACCGCGATCAGGTCACCGCCCGCGATATAGGATTTATAGGGCGTGCGATCCTCGGCCTCGCGCCGCTCTAACCCGCCCTCAGTGGTCCAATACCAAAGCTTTGCCGTCAGCTGGTCATCATCATCGCGCCAGACCGCAGAAAGCGCAGTCAGGTCGTTCTTCTTGGACAAATCCAGCGCCAGCACACAGGGCGTGCCGCGCATATCGTCCTCCGCCACCGGCCCCAGCACCGCGCGCCAGGCAGCTTCGTCAGCCAGCCAAAAGCCAGACGATCCGACAGGGCGCCCGAAGTAGAGCCGCTCGGTCGCCAGCCGCTCGGACGGCATGTGCTTCGCAGTCTCGACCCGCCGGCGCACGTTGTCGATCGGATAGGTGACGCCAAGCGCAGGCAGCGCCTTCACCCAGCAGCTCTCATCGCTGAACGGGTCATCGTCGACGTCGACCCGGGCGATATAGCTAAAGGCGCTATCGTCCTCGATCATCCCCAGCGCGACGCGCTGATAAAACTCGCTATAATCCGTGCCCACCGCCTGATCGGCCGCTGGCGTATTGGTGCCAAGGATCATCAGTGGATCGCCTGGCATTTTGTCGATCGCCGCCTTCCAAAGCTGGATCGCCTTGTCAGTGCGCATCTCGTGCACCTCGTCGGCAAAGACCGCGATCGGCTTCGGCCCCGAGATGCTGTCGGCCGAGGCGACCGGCAAGAACTTAGCGCCGCTCTCTGGCACCTCGATCTTCCACGCGTTGTCGCCGACGCCGCGGATCTTCACCTTGGCGACGCTTTCGAGCGTCGCGCCATCCTTGCCCGGGATCGGCGCGCGGCACAGCGCAACCGCATCCGAGAAAAGGACCTTAGCCTGGTCCTTGTCGTTGGCGATGGCATAGGCCTCGGCGCGCGGCACGCCGTTGAAGCCGATCATGTAAAGCCCGATCGCACCCATCAGCGGTGATTTGGCCTGGCCCTTCCCTGTCTCGATCCAAGCCTGGCGAAAGCGCCGTGTGCCATCGGCATTGCGCCAACCGAACAAGCTGCCCACCACAAACTGCATCCATTCCAGCAGATGGAACGGTTCGCCCGCCTTCGCGCCGGCCGTGACCGTGAACATGGCCGGAAAGAACCGGAAGGCCCGCGCCGCCTCATCTATATCGAAATTCAGCCCGCGCGCCGCGCCCTCCCGCAGATCGTTCAAGTGCCGCTGGCACGCAGCCCGCACGAACTTGCCTGCGATGATATCGCCGGACAAAACCCTATTGGCGTAATCCGTTGTAAGATCCGAGGAACTCATCTGCCGGGGCGCTCCCCTTCTTCGGCTTCTCGGCCGGCGGCCGCGCCTGTGCCTCCCCGAACATCGCCCGCTCCAGCTTCAGCATCCGGTCATTCAACTTCTCGACGGCCGACCAGGTAAAACTGAACACCTCGCCCCCGTTCGGCCCGCGCGTCACCGGGCCCGCCTCTGCCGCCTCCGGATAGAGCGCCTCGAACTCCACCTTCGCGCGCACATAGCGGTCCGCACGCGCCAGATTCACCCGGCTGAGTGCGGAAACCTCTTCAAGTGCGGCAATCACCTCTTTCCAGAGCAGTTTTGCGAGTGAAACACGCGCCTCATCACCCCTAAAAACAGCGTCATAGCGCGGCATTTGCGGCTTTTTTGCGGCCATCGGTCACCTCAAGAATGCCCCCCGACCCCTACCCCCTTTTTGAACCGCTTCTCAGTGCAAACGAAGGGGGGACGCGGGTGTGGCGCCAGACGGGGGGTCTTTAAAATCTAGGGGGGTGGGCCGGCCCTCCCCGACTACGATGCGCCCAATGCCCCTGAGCGCGCCCAGGGATGCGCGGGATCGATCGGCCTGCCATCCTCGTCGTGCCCAGCGATGAAGCCCCTGTGCGTCGCCTCCTGAATGGAGCGGTCGTGACACTGCTTGCACACGGCCATCAAATTGTCCGGGTCGAGGAACAGATCAATGTCGCCCTTGTGGTCTCGCTTGTGGTGCGCCGTCGCCGCGTTCGGTGGCAGCGGCTCATCCCTCTTCGGCTGCGCCAGCATCACCCCGCAACCAGGCCACTGACAGGTCCACATATCCCGCACAAAGATCTGCCGCCGCAGCATCTTCCATGCTTTGAGGTCATACAGGCGACGGTGGCCGTCAGTGCGACGGAAGCGTGACAGGCGCGATGTGATCAT
Protein sequences of DBSCAN-SWA_5 >NC_017384|1677675:1695440|1689740_1690919_-|WP_013384772.1|capsid|DBSCAN-SWA MDLEIKEALDNASKTLAEVKKAQVDLSDQLKVLDQKKASGEDITDIKGHVEDSRKGLTELGDQVADLTKKLSHRGADLEGKSLGRIIAEHDEFKTLKDDRKARFEVKDVTTGSFGTIDLPGGVRRGSRGLIQPVNQALFLRDIIPTAATTAAVIEYLQESGYTNNAGVVAEGAQKPQSELDFEPKAAPMVKMAHFFRVTEETLDDVDGMEAYINQRGLYGLQLKEEGELLNGPGTTGRIDGLLANSTTYDSGLVPGITPVNAMDDIRIAIAQVAEADLLASAIVMNHLDAAALDLAKDADGRYLHPAFAGNTAWGLPVVRTKGLPQGKFIVGGFIGNTLIWQRKGIEVRRSTEDRDNFVKNLVTILLEERLQLETLRPEGIVYGDLTAAGGE >NC_017384|1677675:1695440|1678103_1681610_-|WP_013384759.1|DBSCAN-SWA MTRIWTLVLTVVALMSVAAPAAADPVTGWLAVNVFGWAGATIGAFLASAVVSLGTGILGAALTGVAAQQGQNYDVKFDVEFGDDTALTFTAGDFATAGKRRFIAKWGRETRFITEVIEISSLPQGFAGAWVNDERADLVAGKVGTVVTGLSLFGPSSIRSLPAHDIGAVPATHTVVGWPLSNMKDDEGEGDLGARIWVKWIDGTQTAADPFLLWAFGEDADYPWTANHIGTGKTYVIVTTRFDSETLTSYPTYLWEPEPLPLYDPRYDSTAGGQGAQRWGQRATYQPSRNAAVISYNIARGIYWGDEWLFGGKNMAQWRLPLAEWMAAMNACDMPVALAEGGTEPAYRAGLQITVSDEPLGVMEEIGKGANMRYAEVGGMLKPVVGLPAAPVFAITDADIVISEGQSLTPFAPASQTFNAITATFPDPTAKWASRDAPEYISEAGVAADGGRHLPTSLSYPAVPYPHQVQRLMRAQLEDYRRDTIVEFSLHPGAYALEPLVDTISWTSARNGYDAKQFVVEQVTKLPGMNVTVRLREVDPADYDWSPSFELPYDSVAPVPEIPWVQAIDGWTAVGDEVADDAGVGRVAAIRVGCAGDAIGIAQARIQARRLGASEPTFDVMRPYDRPYQWRITGVAPASVYEVRGALMSELTGGYVWSGWITVTTPAIQMQEGDLPDGFVARIEEMAAAQGIQPVDALPDAGARADQLVMIRTTGEIWRWDAAAGVWTQNVFAGVSAASLDKTKFAAGLTVPEVVDVLPATGSVGDMLVLTTDQKIYRWDSELGAWSNKTDGGDIVVNTLTGAAFMAGAVGAREIATGALRAHHVLITGGSLVPDYLYQDLGTPVGQGGRSWFWNAAQGVVFQQRFVDTQNGNNYGPQGVGIQLNTAPSTVTGNPWAWVLSGEVFPIKSATSYSFELGYWVSGGSRTLFRITYLDRDGNYVGELGHIALHGAAWINRFSVSGTSPATAKTAKLEVYIDPSYGRPVLNIGSAQLLERNAVLLIVEGGIQTQHLTSQIVTADKMAANSVTAANGAIADLAVNTLQIAGNAVTVPAYAYWEPSSPTFVTNSADYPLLELTVDRRGLATMITANAQLDGSSTDMRIVVWLLRNGQQVGGSYGYGGAWRQSSVINFVDWDTGQGPTTYTLMARTVVHASNVYQRYLSAHQFRR >NC_017384|1677675:1695440|1688409_1688805_-|WP_013384769.1|DBSCAN-SWA MRRKLRTHAKAAIEAGRSQARKEGEEIASLARAFAGASDGDLAASIRVEEADAVMTSQGRSGFIGVVVRAGNDATIVTNKQGERFQNAKLQEVGTQSMPAKPFFNPAKRLRRKQAMAAIRRAVRKAWKEGG >NC_017384|1677675:1695440|1686748_1686895_-|WP_013384765.1|DBSCAN-SWA MGFAPEQVQRMSLWEFHSAFAAWKRFNGIKTEGGGEVTLDRLKALGVK >NC_017384|1677675:1695440|1691604_1692765_-|WP_014537879.1|portal|DBSCAN-SWA MKLWPFGQKNTDPGTGNRFYQQNIVERAVSEHQKQLRVTASLAAGMRIAEGVASLPITVGKLGHDRLGRAIRTPIRAGELTERLTLAPNDWMTPTEFVETLTMWAVFRGVGRAYIHRGYRGRIRALIPINDGGVTIRRDYDTGKVWYDANIPGLGFLSNLTRRDFIEVTCPRWNEIEGLNITAEIGKVLRLALTLEDRQMEDGRKKAVPGYITTDQQLSKDSAKMVKEALKDKLPNTPVFDSGTKYNSIIPTQAELQLLETRRFIIEEVARAYGIHPIFLAHDAAGQSLTRISDAMDYHVTITLGPWVKRWEEAIRFSLLGADEYVDFDETQFYRMDLAARADYAAKSLGNNAAWETPNSILEWLGKNPVEGGDALPAAGGAPITE >NC_017384|1677675:1695440|1683063_1683714_-|WP_013384763.1|DBSCAN-SWA MTLPDLYPLAFLGDLIATRAEVSLKLQRYDEMSGGGDGRRWAAELAPPLWTASFDLNNLSRDCIGRARSTDARFRALGTNRAFLWADPTYSGPAIGAPAHLVNADVRVAGFSADRTRITFSGLPAGFEFAAGDRFSAPWGTGRYYLGEIADGGAVAGGSVQVAVYPYPPLSLQVGAQIELLRPVCRMFVPDDGYTPYSYRRGSAATGASVTMLEKR >NC_017384|1677675:1695440|1686927_1687311_-|WP_013384766.1|DBSCAN-SWA MAEAQVINWTCGEHAFRLRIGEAEALDDLTPQGIADFRFRCRQGIERGSLGFSPVRVREVIDCIRLGLIGGGMEGDAARALALRAMEEADFAELVKICYGIVTGFFSGKDHDQPEKPVAAEMTDENG >NC_017384|1677675:1695440|1682733_1683063_+|WP_013384762.1|DBSCAN-SWA MIDGFYSVEFETVLGSGGGVVVLEDGSLRGGDSKRYFLGSYRIEDQKLLADVHVGTHMDKLDIPPVFGVNELDLKITGKLTASAAIEGTARSPQRPDSVMVFNMKRISG >NC_017384|1677675:1695440|1689162_1689690_-|WP_013384771.1|head,tail|DBSCAN-SWA MDIQQTSEATGVIITLEDAKKHCRADDFGDDDALISLYIDAAVDWVQSVCQTRLERAEFTATGSRFDLGFAGYPDPEITSVGYVDDLGAAVTLDASRYALRDGRLIVYGAENVASARVVFVAGLGPGNVPARLVQAMRMLVAHWYLNREAVGAGLAQVPLGVRDMVATYRSFAFG >NC_017384|1677675:1695440|1683710_1686752_-|WP_013384764.1|tail|DBSCAN-SWA MMASEDPLELPVGLSEKQFTQSIVRLEGRLAKMERDATTKFQRQNATLTRSFKDMETQIGRSLNNIKARAGSILAPLAAALGGRQLIAMTSAWTDITSRVNNAAGSMDRGTDVMERVSDMARRTYSDLRMTADSYISFASVLGDLGVATNTQLDFVESLNNALVVSGAKGDVASRVMNALSNAMALGTLQGDNLNTVIASGGRVAQALAASMGVTTLELRKMGRDGKIGRAELLGITSQMAKLRDEAAAMPATIQDGFMLLNNALLEYIGRGDDAVGISARIADALTIIADNFDTVADKGLKVAAVLAGAVLGRSVAALAVRFGTGAVEIMKYVTAMRAASSVGGLAAALGGMSAAAGPLGILLGGVVAGGLVMYSDAALKAEGRTQRVSDELERLGLVAPPAASAIDAVGSALDGLDGRLANLRDLREELEKVSTVDLQSIVDRAGSAGGLFGGWGRGGEEKDALSAVRDLAVEFGALRISADDVIRSLESIDTSSFTEDGREIISLLANHVRATSAMRDAIAQSGMSDILSDQLEAVRELDRYLKGLQYDDIASETIAQLMAVRVALQEGTMSAYEAQQALQDIGEADPNVAPFLGRIAQMTGALADLIRTADRASAAAALASDPSLAARAAAATEYGASRSEGQRIEREAADYVTEANRRNGLTREALALENEIASVRRQSLSDEVQLTEAQIRHIAAGNLAAQDARREQGRSGRRRDDTPYTDAVKSIREETQAFQIEAAAILSVADGTRDYGDALEFARKRAELLHAAQQQGRAMTPELMAEIDALAQAYVTAGLGAEEAAEKLGKIKEASDTGRATLEDFFGSIIDGSKSAREAVADLLMQIAKVQMVKGMMGLIGSTSWGGGLINAIGSGLSYDTGGYTGPGGRYQPAGIVHKGEVVWSQDDVSRVGGPAAAEMLRQTAGIPGYSEGGLVGMRLPAMSSLTTADHAGPQSMSVAITVDVSGAQGDAAIEERAAVGARAAMTQVLSDYDKRFNVKVRNAMVERRKTR >NC_017384|1677675:1695440|1682018_1682639_-|WP_013384761.1|DBSCAN-SWA MRTYDAQFAASLTAARDGGIAPVYFFWVLARPRAGGAEVPIGLWSGDEDITLTLTQPDGSNVSRHYFGGVNLKVEDLTYVGDLTDNPVAVSTSQGVDAAQLLARGYDLRLAYCEIHATTMTGGAFTAAPQLQWVGIIDAGPINTPGENGEGGITYQIRSEIMWQLTARNPAKSSDAHQKRRNLIDRFCEFAAVIGSRSVQWYKKDR >NC_017384|1677675:1695440|1692761_1694543_-|WP_013384775.1|terminase|DBSCAN-SWA MSSSDLTTDYANRVLSGDIIAGKFVRAACQRHLNDLREGAARGLNFDIDEAARAFRFFPAMFTVTAGAKAGEPFHLLEWMQFVVGSLFGWRNADGTRRFRQAWIETGKGQAKSPLMGAIGLYMIGFNGVPRAEAYAIANDKDQAKVLFSDAVALCRAPIPGKDGATLESVAKVKIRGVGDNAWKIEVPESGAKFLPVASADSISGPKPIAVFADEVHEMRTDKAIQLWKAAIDKMPGDPLMILGTNTPAADQAVGTDYSEFYQRVALGMIEDDSAFSYIARVDVDDDPFSDESCWVKALPALGVTYPIDNVRRRVETAKHMPSERLATERLYFGRPVGSSGFWLADEAAWRAVLGPVAEDDMRGTPCVLALDLSKKNDLTALSAVWRDDDDQLTAKLWYWTTEGGLERREAEDRTPYKSYIAGGDLIAVRGEVIDYTFVAQQVKELCVAQEVEALVVDPAYVSDFIAACEQISFEVWRYMGPDEPEGTGLKIVTHAQGTRIAFEGRQLCMPHSITHMTDKILKREILIAENKMTHVCAANTVLISDGQGNQAFDKQRQRGRIDGMVALAMGVGATKADRKGRRSYMEGGVLFT >NC_017384|1677675:1695440|1690941_1691595_-|WP_013384773.1|head,protease|DBSCAN-SWA MAMEFKLARLDAKAATEEGVFVGYASRFNVVDQGGDMVRPGAYAKSIAGQKSVKLLWQHDAAQPIGVWTSISEDEKGLRVEGRLALETVKGRETHALMKMGAIEGLSIGYRTKDADIIDGIRHLKEIDLWEISVVTFPMEDGSGVDAVKSSVQIMRAAKDGDFAPLKKSVEVALRDAGLPVWLRKAIAARAPEALGDGQRDASASETAKAIKDAFKF >NC_017384|1677675:1695440|1695050_1695440_-|WP_060486270.1|DBSCAN-SWA MITSRLSRFRRTDGHRRLYDLKAWKMLRRQIFVRDMWTCQWPGCGVMLAQPKRDEPLPPNAATAHHKRDHKGDIDLFLDPDNLMAVCKQCHDRSIQEATHRGFIAGHDEDGRPIDPAHPWARSGALGAS >NC_017384|1677675:1695440|1677675_1678101_-|WP_013384758.1|tail|DBSCAN-SWA MQQEVPFTLYRLDTGEIAAFVVADPTISVPAGFGLLQGYWSAATHWVRGMPVELPPRPDDRHVWDPVAWDWVINPDLDTYQWAQLRMERTRLLAACDYRSQPDYPQSDEARAAWLAYRQALRDLPGNLTDPAQVRWPDLPG >NC_017384|1677675:1695440|1681611_1682040_-|WP_013384760.1|DBSCAN-SWA MVQKGSLTRLPDWRARLSHALDIQRDHPFEWGRHDCGLGLAAGAVEAITGEDLRPAWANYKTPTGALRVLRKAGYESLGDAMAALLPEVHPAFAQIGDLALLDGEGQIGALGVIDISSVIVLDKTGHARVPRDQIKRAFKVG >NC_017384|1677675:1695440|1688818_1689166_-|WP_013384770.1|head,tail|DBSCAN-SWA MSTAGKLIERVQFESPDMRTGIWTHEYECRAEFIYARGGEAVYAARLEGRSVFKIKIRQCAAAREILQSWRMIDMRRATYEGDVPQTGVYNIREVDPISDRAWIYLLVEAGVAYG >NC_017384|1677675:1695440|1694505_1694916_-|WP_014537880.1|DBSCAN-SWA MAAKKPQMPRYDAVFRGDEARVSLAKLLWKEVIAALEEVSALSRVNLARADRYVRAKVEFEALYPEAAEAGPVTRGPNGGEVFSFTWSAVEKLNDRMLKLERAMFGEAQARPPAEKPKKGSAPADEFLGSYNGLRQ >NC_017384|1677675:1695440|1687310_1687769_-|WP_013384767.1|DBSCAN-SWA MAAPFHEEYHELVFEFSEDNGTTWARNCVIMGADVTRTASTSETETVDDCDDESKPNNVSVRVQSLAVSFSGTGNWTQGGYDTFLKKFYAGDSTEMLARIGNLNAGAGEIEYETGPIIITSLGQSRVKGAVVSASVEARFAKTPTRSLKVGS >NC_017384|1677675:1695440|1687999_1688410_-|WP_013384768.1|DBSCAN-SWA MSSPATELQDAIEAAVRADAALMAIIAGIYDRVPDRPWGAANGYISFGPWDSVSQEGGCQAIEDVSLQLDAWSNRTGRAHCEEIMQRLRRIMGQVQTDRHPIVARGDPFSQVLRDPNGLTLHGVLRYEFQMERYDG |
20 | Paracoccus_phage(27.27%) | tail,protease,head,portal,terminase,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|