Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_017571 | Shewanella baltica BA175, complete sequence | 4 crisprs | RT,cas3,DEDDh,DinG,WYL,csa3 | 0 | 0 | 6 | 0 |
NC_017570 | Shewanella baltica BA175 plasmid pSBAL17501, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NC_017572 | Shewanella baltica BA175 plasmid pSBAL17502, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_017571_1 | 1400916-1401025 | Orphan |
NA
Consensus repeat of NC_017571_1
|
1 spacers
spacers of NC_017571_1
>1.1|1400955|32|NC_017571|CRISPRCasFinder CTTCAGCACCCACAACACTCAGATAACCTTCT |
CRISPR arrays and Neighbor proteins around NC_017571_1
The CRISPR arrays of NC_017571_1 >merge|NC_017571|1|1400916-1401025|CRISPRCasFinder TATCATTCGCTTGCAGCGATAGGGGATTAGTCAGCGACTCTTCAGCACCCACAACACTCAGATAACCTTCTTATCATTCGCTTGCAGCGATAGGGGATTAGTCAGCGACT >NC_017571|1|1|1400916-1401025|CRISPRCasFinder TATCATTCGCTTGCAGCGATAGGGGATTAGTCAGCGACT CTTCAGCACCCACAACACTCAGATAACCTTCT TATCATTCGCTTGCAGCGATAGGGGATTAGTCAGCGACT
>NC_017571.1|WP_006082669.1|1400022_1400913_+|LysR-family-transcriptional-regulator MLDIHWLKTFVTLAEHKHFGKAATALHMTQPNVSLHIKQLEQSTRVKLIDRNPFRLTQAGFRLLESSQKTLMELQICQADLNAINDLSQGTLTIAASDIISRLLLIQPFQLFKAEFPGIDFSLLNTTSSQASELVKNAEADLGFVIAQKESQPLHFTELQQIKWCAIGDNLQQWQQANLSVDTPLAEQPTLILLGHDTRTRELLDPALPSLNLPNYRIMEVGSVDAQIDWAEAGFGVAIVPEFSVYTKANLTTKVTPLPQFPTTSLGYIVRQNQILSKAIKQLLYWVNQEIIRSQR >NC_017571.1|WP_006082670.1|1398625_1399882_-|adenylosuccinate-synthase MPSIVVVGANWGDEGKGRIVDYLAADAAASIRFQGGNNAGHTVVNDYGTFKLHQLPSGIFNPNCIAVLGPGMVISPEKLSIEIAEVQATGVDVKLCISDRATLCLPLHALEDTLEEQRLGDLAYGSTRQGIAPAYGDRVMKKGILVGWLNQPEVLQERIQFMLDWKMPQLKALYPSCDFNQTAAEMTAWLLEVTAPWRPFICNVTEPLKALQKQDASLLFEAQLGAGRDLVYGEYPYTTSSNVTAAYAGIGSGLPALRPERVIAVAKAFSSSVGTGTLVTAMEEQDAFRENANEFGATTGRPRDMGYFDAVATRNGVELQAATEIALTKIDCLSGMKDLKICVAYEGEHTENPIWPQTAALSPVYEQMPSWSEDITGCRTFESLPVAAQHYVERIEALMGVRVSMVSVGPERDQMIIR >NC_017571.1|WP_006082671.1|1397485_1398466_+|chemotaxis-protein-CheV MSEIKKSEILTESGTNELEIIEFHLHKILPNGGHKVCHYGINVAKVREVIRVPETSDYPNAQPHMVGVFSLREKLIPLVDLAGWLGISTPEDLTHKVVIVTDFNKMINGFLIDSVRSIHRVSWEQVESPSQFLEAGENDCVVAVVRRDGMLIMILDFEKIIADINPELSMDKYDVTLDRSVLINDKMLAKREAKTILIVDDSAFIRKMIENTLRSAGYNIITAKDGGDALEMLMEFESLAEQDNASISDFVSAVITDVEMPRMDGMHLVKRLRESKAYQQMPIVMFSSLMSEDNRTKAMALGANDTITKPEIGRMVGMIDKYVFEE >NC_017571.1|WP_006082672.1|1396675_1397107_-|thioesterase-family-protein MPQFSPTFSFPVQIYYEDTDFSGVVYHPNFLKYFERAREHVIGAERLSALWQQNQLGFAVYRSDMLCHDGVEFADIIDVRTKFYFESKYRTVWQQEIWRPQGKKPAVTATIEMVCMNQARQLAPMPADLIAVLSQGFVSNEQK >NC_017571.1|WP_006082673.1|1394844_1396338_+|bifunctional-diguanylate-cyclase/phosphodiesterase MFEIGLVVASLILLIFSLYILQAKYRSQRSQQKFLAQLCLQLEQQKIERQALDLGTVPQEFAPLYHTLDELLKALPASAGKDKLTGLVNRVGLKRALTSMMPLTQGTLVLIDIYRFRYVNDLFGFVFGDILLKQFAERLNSLSLSPRLIARLNGDEFCLYYEQALTEEQLIHLRGRLQVPFSIKNTPVSVKLQIGCVHLQEHHADTSQMLRRVDLALKKARHSRSAIAYYAENDDIQQLRELRIIDSLPKGLQRNQVYMVYQAKQDIATGRCFQVEALMRWEHDELGVISPGEFIPLAEYAGMIDLVSQWALDQVLAQQAKWRAAGIQLCVAVNLSTRDLDSETLPQEIAARLAHYKLPPESLMIEITESTLMADLTKAVETLDKIRALGVRLAIDDFGTGHSSLAYLKHLPVNEVKIDKAFLKDLEQDKPSEYILEASINIAKKLGYEVTVEGIETPEIRGILVAMGVDTLQGMYYAKPMRAAELEMNWARLHNAT >NC_017571.1|WP_006082674.1|1393908_1394751_+|HDOD-domain-containing-protein MSTEHLLLVSLLKKLKDDALVLPTLPEVAMRVQEVVGRSDASLKQVAEVIGQDAAISARIIKVANSALYSRGVPAESINSAVSRIGLIQIKSIATSVAMEQLFISTNEMVWEVMDEVWRTSIDVTAAACAMLQMYNKRHSTSRLNFDTLTLAGLVHNIGALPVLTEAEAQPHLFTSIDQLRALVRKMQGPLGRAVLKSWDFSTEVMEVVERWADLPYLPDHVTYLDFIRAAAFYTGELRAGTELEQRLGVFVARGLPVTTDDLASDEFMEKFHSIRESYQ >NC_017571.1|WP_006082675.1|1392198_1393566_-|tRNA-5-hydroxyuridine-modification-protein-YegQ MFKPELLSPAGTLKNMRYAFAYGADAVYAGQPRYSLRVRNNDFKMENLATGIQEAHALGKKLYVVSNIAPHNAKLKTYIKDMEPVVAMNPDALIMSDPGLIMMVREAFPDQVVHLSVQANAINWASVKFWQTQGIKRVILSRELSLDEIEEIRQRCPDIELEVFVHGALCMAYSGRCLLSGYINKRDPNQGTCTNACRWKYDVHEAQQNDSGDIIAMPNAVQIETPTTLGAGAPTDQIYLLQEANRPGEYMPAFEDEHGTYIMNSKDLRAIQHVERLTKMGIDSLKIEGRTKSFYYVARTAQLYRRAIEDASLGKDFDRSLMNQLEGLAHRGYTEGFLRRHVHDEYQNYDYGYSVSDTQQFVGELTGKRNLAGLAEIEVKNKFSVGDSVELMTPQGNISLTIEQLENRKAEAVAAGLGSGHTVYLPVPKEVDLSHGILLRNLPQGQDTRNPHELG >NC_017571.1|WP_006082676.1|1391942_1392194_-|YfhL-family-4Fe-4S-dicluster-ferredoxin MALLIDDSCINCDMCEPECPNQAITMGEEIYEIDPDRCTECVGHYDKPTCVSVCPIDCIDPDPNRVESNDELLVKFAVLTQKA >NC_017571.1|WP_006082677.1|1389553_1391443_-|S8-family-peptidase MHKKHLIAVAVATGLAYFPVNANEYQATMVSVPQSKAIKDTYIVVFNTPSVLNLSNNNTIAEFAVQQAESLVNQYDVRVMKNFGNVLNGVLINASAQQVKALLKDPNVKYVEQDQVMSVTPMMEANADQPSPTWGIDRIDQRNLPLDNNYHTDYDGSGVTAFVIDTGVLNTHNEFGGRASSGYDFIDNDYDATDCNGHGTHVAGTIGGSTYGVAKNVNVVGVRVLNCSGSGSNSGVIAGINWVKNNASGPAVANMSLGGGASQATDDAVNAAVAAGITFVVAAGNDNSNACNYSPARAADAITVGSTTSNDSRSSFSNYGTCLDIYAPGSSITSSWYTSNSATNTISGTSMASPHVAGVAALYLDENPNLSPAQVTNLLKTRATADKVTDAKTGSPNKLLFSLANDDGGCGNDCPVDETQLQNNVGIAISGATGSATYYYIDVPANAASLGINLAGGSGDADIYVSQGQKPTTTSYQCRPYQNGNNESCNFTAPTAGRWYVMVQGYSNYANAQLTASYNLNGGGNCTDTNCLTNGVPVTNLSGATGTETLYKIVVPANSQLNITTSGGTGDVDLYVKAGAVPTTTSYDCRPYKNGNNESCSITVTQAGTYHVMLRGYANYSGVQLSASY >NC_017571.1|WP_006082678.1|1388901_1389435_+|SCP2-sterol-binding-domain-containing-protein MSAVFSAKIAQQLLEFTPKAIGFPLAKLPFSFKAKAISQLLGLLLAEQAADDELGFLTDKWVAIQVEDLNLAFEVSFNGKWQVRELTDAQVTFCANSAELILVAAAKEDPDTLFFQRKLSIEGDTELGLEVKNLLLSIEFASMPTPIRLSIAKLAAVIERLQIEAKPTVILAKSPSY >NC_017571.1|WP_037392912.1|1402453_1402888_-|PACE-efflux-transporter MKTTERIFNALLFEAIALAIIIPVSALISGKGTSELVIVGIALSLYTVVWNYFYNLGFDRVFGANRERRTLKTRILHTFGFEGGLIFISIPTIAWFLQIGWLAAMGLEAVFLIFFFFYSTLFHWCYDKYQPYKTWFTMQATKVK >NC_017571.1|WP_006082666.1|1402978_1403866_+|LysR-family-transcriptional-regulator MFSLEQLEAFIETVESGSFSNAARQLKKAQSVISQHVMNLEIDCGVELFDRSGRYPVLTAEGSKLLPYARATILQHRRLKHTALSLFGRQSSEVCLAMDEGIPIQRISEIIQLLESDFPNIELEFLSASSIDIIDMVHSGRASTGLIFSELSMPSSIDFESIGVIEFDLYVASSHPLAKSVAPNMDSLRLYRQLCIRSRNTKTSSFQQAFSPDVWYADNYYVLLELALQGHGWCLLPEHMASSSVSSGNLTRVPIEFEQIGWYANVDVIQHQDKSSLPLLKRLRQLFRELLVGQK >NC_017571.1|WP_011847480.1|1403975_1404599_-|TetR/AcrR-family-transcriptional-regulator MMSTVTQTPCVGRPRAFDTDDALAKALEVFWRKGFEGTSLTDLTQAMGINKPSLYAAFGNKEQLFLKAIELYEQRPCAFFYPALEQKTAYLVVESMLLGAASSLVDKSHPQGCLIVQGALTCSEAGQAIKDTLINRRRDGEIALCERLQRAKDEGDLPADAEPLLLARYVGTVLQGMAVQATSGICPNELRKVAELVLANFPRNDES >NC_017571.1|WP_006082664.1|1405226_1406465_+|efflux-RND-transporter-periplasmic-adaptor-subunit MKSNKPLRIMMLTAVAAFVLSACGEQTAQQGPAPSAPMVDVAQVLHERVTEWDEFTGRLQAPESVTLIPRVSGYIDSVNFKEGALVKKGDVLFRIDPSVFEVEVARLKADLASALSAEQLATNDFERAHKLFDQKAVSAELLDTRESNKRQTAAAVASVKAALMRAELDLAYTQVQAPIDGRVSYANVTAGNYVTAGQSVLTSLVSTASMYAYFDVDEQTYLKYVKLTAEKKRNDPRAGDNPVYMALANERDYQHIGIVDFVDNAMDKQTGTIRVRATFDNEDNSLLPGLFARLRTAGSGAYEGILIDDKAVGTDLNNKFVLVVGADGLVEYRGVTLGEKVQGLRIVTQGLEAEDKIVVNGMQRVRPKMQIDPHMVEMVDSDKLEALRQAQLILDKNQDTLTAQAVETASRG >NC_017571.1|WP_006082663.1|1406478_1409637_+|multidrug-efflux-RND-transporter-permease-subunit MLSQFFIKRPIFAAVLSLLFFITGAIAVWQLPITEYPEVVPPTVVVTANYPGANPKVIAETVASPLEQEINGVEDMLYMSSQATSDGRMTLTITFAIGTDVDRAQTQVQSRVDRAMPRLPQEVQRLGIVTEKSSPDLTMVVHLLSPDNRYDMLYLSNYAALNVKDELARIKGVGAVRLFGAGEYSLRIWLDPNKVSALGLSPADIIAAVREQNQQAAAGSLGAQPSGSADFQLLINVKGRLTELSEFEDIIVKVGQNGEVNRLKDVARIELGATSYALRSLLDNKDAVAIPVFQASGSNAIQISDDVRAKMAELSQSFPEGLEYEIVYDPTVFVRGSIEAVVKTLLEAVLLVVLVVVLFLQTWRASIIPLVAVPVSLVGTFAFMHLLGFSLNALSLFGLVLAIGIVVDDAIVVVENVERNIAAGLSPVAATQKAMKEVTGPIVATTLVLAAVFIPTAFMSGLTGQFYKQFALTITISTFISAINSLTLSPALSALLLKSHDAPKDALTRLMDKLFGAWLFTPFNRLFSRASDGYGWLVRKVIRFGGIIGLVYLGMVALTGVQFANTPTGYVPGQDKQYLVAFAQLPDAASLERTDTVIKKMSEIALNHPGVAHSIAFPGLSINGFTNSPNSGVVFVALDDFESRKSPALSANAIAGQLNQEFAGIQDAFIAIFPPPPVQGLGTIGGFRLQIQDRANLGYEALYQVTQQVMYKAWADPQLAGIFSSYQVNVPQLELDIDRTKAKQQAVSLDQIFQTLQTYMGSTYVNDFNRFGRTYQVNMQADEAFRQSPQQISQLKVPNLNGDMIPLGSFINVSQSSGPDRVMHYNGFTTAEINGGPAAGVSTGQAQAAIEKILAETLPNGMTYEWTELTYQQILAGNTGLLVFPLVILLVFMVLAAQYESLSLPLAIILIIPMTLLSALSGVLIYGGDNNIFTQIGLIVLVGLATKNAILIVEFAKEKQDHGMAPMDAILEAARLRLRPILMTSIAFIMGVVPMVFSTGAGAEMRQAMGVAVFAGMIGVTVFGLILTPLFYYALAKRGSKKAEEQKELTVS >NC_017571.1|WP_006082662.1|1410219_1410897_+|methylamine-utilization-protein MILNRALFALVLSVSASLTPQVMAAELNIKLVDLQQQAIADTVVELIPAVMPDKAMPVGHYEMSQKNRTFIPFVLAVPKGAKVDFPNLDRTRHHVYSFSEAKPFELKLYVGQAEAPILFDKPGLVALGCNIHDYMQAFIYVADSPIVAVTDEKGEIQFKDLTAGQYKVKLWHPWQKAASEPTELMLTSGANQLSLSLDIERQAKPSAPPSGFGHYDAQGKPLHAK >NC_017571.1|WP_006082661.1|1410886_1413193_+|EAL-domain-containing-protein MLNSFRARLILVFFIVLSLVQFATAFSVLTATERDNFLQQQKSLDIGANVFLEVLSNRGIQLSQSLSVLSADFGFKRAIATGEQETIESVLSNHGSRIGADAAVLLSPKGELLTSSLPGLTQDDIQSLFDMTTSNNNALAILNFDHASYQFVLQPVKAPTLIAWVGMGFLLDEKVAQQAKAITGIDVSFVNQATGRTEIASTLSDDEKRSVVEQASLFPSLLKMPSENIPVDYLSMALKLYDHNGSQLALLHQSNLKWQQSYQHLRNNMLLIFALTLALAIVIAIWLSGSLTEPVHQLVTYARKIGQGKQPGSIQGAPAELRVLAKSLSLMRDDIEAREKDLVYQSRHDSLTGLLNRFAAKQHLADLKHELAGAMVLLDIKHFRHINDIIGFANADTLLVLFARRLEQLAPTPDLLARLDGDSFLLLYSQGIRPEHLLKSLDMLESPFPIQGSNISLTVRAGLLEIDGNGADIDVLVRRAEIALNQACLEEQRIVSYRQGADEKYQRELTIIRDLPIGLAQNQLYLVFQPKVDLHLNQCTGAEALIRWQHPTLGFIPPDEFIQLAENSGNIDIVSQWVLQQAIQQLVAWQQRGMLLKLAINLSAHDLVDTRLPNQIASLLKDNNLPSGALCIEVTEGAVMKDAQTVVSVLQRFRDMGVSVAIDDFGTGHSSLAYLKILPVNEVKIDRSFIKDMLTNSQDVMIVNTSIQLIHGLGFTVVAEGVEEPEGVDILRNLNCDIIQGYVFSKPLKAAEFDLWFEAFNHANPSDT >NC_017571.1|WP_037392910.1|1413278_1414148_+|DUF3034-family-protein MNALKLNSLAWNCAKALPCLFLLLGTVPAFAEGSRVVATGGGTTIEGSAGGGIVPWAVINGYGSSDEWSATAMATGVYVDDFSLKVIGASLSFDNRFELSVARQTFDLDTMGGELGQDIFGVKYKLAGELLYTAMPQITLGAQYKRVDDFAIPQAVGARDDWGLDVYIAASKVFFDAVAGRNLLLNGTVRATKANQTGLLGFGTLASNDYQFVLEASAAVLLTDNVALGIEYRQKPNELAFAREDDWQDVFLAWFINKHLSVVTAYANLGSIAGFADQQGWYVSVEGTL >NC_017571.1|WP_006082659.1|1414144_1414621_+|group-1-truncated-hemoglobin MSRLLLKLLPVAALFFFTGCAVQTAESSAPQVNSNNGNSSQQTLYQELGGEAGLSAIVDGLLARIATDPRIVHHFQETDIALFRERAIEHFCLIADGGCVYHGENMALSHQGLNITQADFDALVGHLIESMKEQHIPLSTRNALLKLLAPMYSDITYH >NC_017571.1|WP_006082658.1|1414700_1415174_-|NYN-domain-containing-protein MKKIALFVDVQNIYYTCREAYGRQFNYRKLWQHLGYEGDIALAVAYAIHKGDDGQLKFQDALKHIGFEVKLKPFIQRSDGSAKGDWDVGITIDIMEAASEVDTVILLSGDGDFDLLLQKIHQKYGVETQVYGVPTLTAKSLRDAASQFHPIDEALLL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_017571_2 | 2089205-2089470 | Orphan |
NA
Consensus repeat of NC_017571_2
|
2 spacers
spacers of NC_017571_2
>2.1|2089266|58|NC_017571|PILER-CR CCTGTTGTTTATAATGAGTCGAGCCCGATGAGCTGTTTCTCGTTATAAACACATATAG >2.2|2089385|61|NC_017571|PILER-CR GCTACCATGCTGCTCCATCCCGCGTCCGATATCTATCTGTTTTACTTTATTATTTTGCTTT |
CRISPR arrays and Neighbor proteins around NC_017571_2
The CRISPR arrays of NC_017571_2 >merge|NC_017571|2|2089205-2089470|PILER-CR TGCCGAAGCCGGATTTGAACCGACGACCTTCGCCTGTTGTTTATAATGAGTCGAGCCCGATGAGCTGTTTCTCGTTATAAACACATATAGAAAGTGGTTGCGGGAGCCGGATTTGAACCGACGACCTTCGGGTTATGAGCCCGACGAGCTACCATGCTGCTCCATCCCGCGTCCGATATCTATCTGTTTTACTTTATTATTTTGCTTTAAATTGGTTGCGGGAGCAGGATTTGAACCTACGACCTTCGGGTTATGAGCCCGACGAG >NC_017571|2|1|2089205-2089470|PILER-CR TGCCGAAGCCGGATTTGAACCGACGACCTTCGCCTGTTGTTTATAATGAGTCGAGCCCGAT GAGCTGTTTCTCGTTATAAACACATATAGAAAGTGGTTGCGGGAGCCGGATTTGAACC GACGACCTTCGGGTTATGAGCCCGACGAGCTACCATGCTGCTCCATCCCGCGTCCGATATC TATCTGTTTTACTTTATTATTTTGCTTTAAATTGGTTGCGGGAGCAGGATTTGAACCTACG ACCTTCGGGTTATGAGCCCGACGAG
>NC_017571.1|WP_006082079.1|2088107_2088521_+|hypothetical-protein MGLILLPLLILWLGVGIYAIRIGYQVLAGTSQLSYTLSVCAIAMVALLLYLYFGFAQFKENKALWAFEIPMFFAANKLAFGVMILGLLLHWFGQGVLTSAYLKPLPFIMIFTVSFGAMAGVILSDTFMAKFEIQKTH >NC_017571.1|WP_006082080.1|2087351_2087843_+|GNAT-family-N-acetyltransferase MEKITIRHAEKRDAAAIQAVYACPNAYTGTLQLPWPSTDKWESRLAATSNHISRYVAEIEGEIVGELGFEVYEQPRRRHVASFGMGVKDSYQGRGVGSALINAMLELTDKWMNIKRIELTVYTDNHAAIGLYKKFGFVIEGESKDFAFRNGEFVSVYHMARIK >NC_017571.1|WP_006082081.1|2086323_2086923_+|recombination-protein-RecR MKFSPLLDELIQSLRCLPGVGPKSAQRMAFQLLERDRKAGLKLASALSSAMSDVGHCQSCRTYTEETLCPICASHKRGTSSTICVVETPADVLAIEAGGHFTGRYFVLLGHLSPLDGVGPEELGLALLERHLASGDVAELILATNPTVEGEATAHFIADMARRHKVMISRIAHGVPVGGELEYVDSTTLALSFNGRLPL >NC_017571.1|WP_006082082.1|2085981_2086311_+|YbaB/EbfC-family-nucleoid-associated-protein MFGKGGMGNLMKQAQMMQEKMAKVQEEIARMEMVGESGAGLVKVTMTGAHTVRKVEIDPSLMEDDKEMLEDLIAAACNDAARRIEENQKTKMAEVTGGMQLPPGMKMPF >NC_017571.1|WP_037392567.1|2082299_2085758_+|DNA-polymerase-III-subunit-gamma/tau MSNHDMLSFKVLGAHAPALVSHTLGSSMSYQVLARKWRPATFDQMVGQSHVLHALTNALSQQRLHHAYLFTGTRGVGKTSLARLFAKGLNCEQGVTATPCGVCGSCVEIAQGRFVDLIEVDAASRTKVDDTRELLDNVQYRPTRGRFKVYLIDEVHMLSRSSFNALLKTLEEPPEHVKFLLATTDPQKLPVTVLSRCLQFNLKSLTQGEIGTQLNHILTQEQFPFDAEALKLLAKAANGSMRDALSLTDQAIAFGGGNVMLTQVQTMLGSIDEQHVVALLKALTDADIGVLMHTCAQVLAYGADAQEVLRSLLELLHQITLTQFAPAAAQQSLYSAQIRAFAEQLTPEQVQLYYQILLTGRKDLPHAPDPKSGLEMALLRAVAFVPEKPVKRWQPDAVAEIRLPEGQTPVAATAAAQAPHVNEQQFAEPHTAETHTAETHTAEPHSTEPHIAETVPAEKKTALIELTSANQHVETAQAAAVTTAEASIEPADTVEQALDAELEADAALIAEQAVILSQAQSQGFNADSLNAYTVNAEVEPQTQTVSTAQAPSIHISEPVADIEASVASLDAHNSEVPELIGATAESSIVNPAMAETVVGSTSADNNSAAENTLDNNPTANNTLEQNGLDEHSPYGTEAEHSYPAMGAYAQESAPLDSYQDAYVEFSSGSYNDDDFHHSFDSHAVPDDVQHSANVQHSDSVQHSADVQSTASIQSALGMQGSVDSQSAAMAQVSPQTLTPISIPTLASASLADDDILSAVLAARDSLLSDLDALSVKDGDEKKSSLDSKLKTANSKANGISLSKPVSKSDVSQTSASFSADPAADLDPDLTIDFDDDFDLDLEPMALHQSVPSAASVSGSVPAEPVKPAAYDRPPWEAAPEVASTAELIETTQVDSGNTDIADAVMAQGSDINDDSASSCQGSLDTSNTANDGQNNAADGNDSQSAEQDSGSKSQALQTQARHQDKVQATALTPASTTALTQTASAVERTQEREVALSTLPISGHPLDLHWYKLMASLEVGGRVRQLAVNSVCQTQSDPLPLLLKPNQKHLAADVAIVQLEQALSAALGNPRRVQVVIGIDAQRETPLELRKRFHQELLQQAHQSLIHDDNVQWLIQRMGAELDADSLVYPPELLNLRSQQIQALPELTEAAS >NC_017571.1|WP_086010671.1|2081710_2082262_+|adenine-phosphoribosyltransferase MAMNTETLSLIKQSIKTIPNYPKEGILFRDVTSLLENAAAYKATIDLLVEHYRGQGFTKIVGTEARGFLFGAPLALELGVGFVPVRKPGKLPRATISQSYELEYGHDSLEIHTDAINPNDKVLVVDDLLATGGTIEATVKLIRQLGGEVKHAAFVISLPDLGGEARLTALGLELVKLCEFEGE >NC_017571.1|WP_006082085.1|2081074_2081443_+|YbaN-family-protein MVLKRGLFLLLGLTALALGLLGIVLPLLPTVPFILLAAFCFARSSERLHHWLMTHPWFADALTQWQEQRAMRKGLKRKAMVVSALSFSVSIIVVPILWVKGLLLVMALVLLWFLKGIPEIEG >NC_017571.1|WP_006082087.1|2079973_2080720_+|putative-DNA-binding-domain-containing-protein MDFKQVQQSFIDYIRDPSRPLPADTDVRRMQVYRELFFNNVLGFVSNGFPVLKSLYSEEEWLALVQSFFSQHDCQSPIFIDIAGEFLEFLQQEYQPTANDPVFMLELAHYEWLELAVAVAQASGDESQLSPAQIPTQALCLSQTAKVAQYHFEVHHIRQDYRPQQQLDTPVFFCLYQDADCEVCFLQLNPLSAQVLAFLQAQGQASFKEILDWLTITYPQMAPEIIAQGCSQLLEQLAAKGIVRGRQS >NC_017571.1|WP_006082088.1|2079104_2079974_+|DUF692-domain-containing-protein MNDQKNAAQVGLGLRREMLSEFCESVPEAINFFEVAPENWMTLGGKFGRQFRELTEQHTFYCHGLSLSIGSPEPLDLAFVKNIKTFMDLHQIQVYSEHLSYCSGQGHLYDLMPIPFTDEAVKHVAARVKQVEDILERPLILENVSFYAAPGAHMSELEFVNTVLQEADCKLLLDVNNIYVNSINHQYDADAFLQAMPTERIAYLHIAGHYKQAEDLLIDTHGAAINDPVWALLQRCYALHGVKPTLLERDFNVPTTAELLLELNQIHAYQAAAPSFHIQNRSHVVKRIA >NC_017571.1|WP_006082089.1|2078558_2078990_+|hypothetical-protein MNSVKKTAVAVALGSVVMSSAFAVNAQTNPFGFEAMDAGYQIVGSEGKCGEAKCGADMKKAAAEKAKEGKCGEAKCGAEMKKAAADKAHEGKCGEAKCGADVKKAAMAKAHEGKCGEAKCGADMKKTAEKVDAKVEAVKKEMK >NC_017571.1|WP_006082078.1|2089750_2091880_+|bifunctional-23S-rRNA-(guanine(2069)-N(7))-methyltransferase-RlmK/23S-rRNA-(guanine(2445)-N(2))-methyltransferase-RlmL MLNFFAAAPKGFEYSLAQELTEFGATEVKESVAGVYFTASLALAYRITLWTRLASRIVLVIYKGSCESAEQLYNAAYCVDWPAHFSNKSTFSIDFHGTGGFLNNTQFGALKIKDAIVDRFRDDDIERPNVSRVDAEFKVDAHFRNGVITIAMNFSGPSLHQRGYRSTTGEAPLKENLAANMLVRSGWQAAPSTLLDPFCGSGTVLIEAALMAADIAPGLQRSRFGFEHWRRHDKAVWQEIVEEAKARASLGVKRCEIKFYGSDIDSRLVALAKRNAENAGVLELIEFQVADALTIAPPAESGYLITNPPYGERLGNVSELLQLYYQLGDKFKKEFGGWKVAMLCSDIELVSSLKLKADKQMKMFNGALECAFNIYTLHANSTRRDTPVLPDGVDIADIAPAFANRIKKNAKLLEKWAKKEGIDSYRIYDADIPEYNVAVDKYLDYVIIQEYMAPATIPEAVTKRRLSDVLLALPSAIGINPNKMIMKTRERQKGTSQYQKLDERKLELITTEYGAKFKLNLTGYLDTGLFLDHRLTRRLVGQKSKGRRVLNLFSYTGSASVHAALGGAKSVTTVDMSNTYIAWAKDNFALNGLQGKQYEFVQSDCMQWIRDCNEQYDLIFIDPPTFSNSKRMEDSFDVQRDHVNLLASLVKLLSPTGELVFSNNKRKFKMDIETLTKMNINVTNIDDVTLPMDYKRNPHIHNTWLITHA >NC_017571.1|WP_006082077.1|2091872_2092127_+|glutaredoxin-family-protein MPELTQAERDNRTYLLYHTDGCHLCELAAALLDAADIGYQAIDICDDEYLAQRYGVSIPVLKAWDDRELHWPFNATQLQEFTGA >NC_017571.1|WP_006082076.1|2092128_2094054_+|ABC-transporter-ATP-binding-protein MSLVRINSGSLAYGYTPLLQKADFTIQRGERVCIVGRNGAGKSSLLKVLSGDVLLDEGEFNIAGNVSVSRLQQDPPKAEQGTVYAYIAAGLKEVGEALERYHQLSHDVAHADPEQMDRMLNEMQGLQETLDHYNGWQLDSRIQQNCELLGLDPDKSLSELSGGWQRKVALARALVSEPDLLLLDEPTNHLDIDTIEWLEKFLLDYQGAIVFISHDRGFIARMATRIVDLDRGVVTSWPGNYQMYLDGKQEWLRVEAEKNALFDKRLADEEVWIRQGVKARRTRNEGRVRALKALRDERSERLNRQGNAKMAVSDTERSGKLVFDVQDLNFNLPDKNLVKNFNTTVIRGDRIALIGPNGCGKSTLIKLLIEKLQPQSGEIKVGTKLEIAYFDQYREALDPEQTVEDNVGEGKKTITINGQDRHILSYLQDFLFSPMRARTPVKALSGGEKNRLLLAKLLIRPANLIILDEPTNDLDIETLELLESLLTEYQGTLLLVSHDRAFIDNTVTSSWWYAGNGHWSEYVGGYQDAVNQGAKFYSEEPSSQKAVEAPAVETKAVEVKAAEPAKAVKKLSYKLQRELESLPTVMEQLEADILALQTTIGHSDFYSQAQDKVNQVLSQLADKEKQLEVCFERWEELESLK >NC_017571.1|WP_006082075.1|2094082_2095918_+|DUF3466-family-protein MKLQLDKALSLVALGVLGVLNSAHAAPVYEIVNIDSYDLQGTLEGTRSGYALGVNANDELVGISKGKKKLSSSDVEGGVIDIADGIAPEETITYSIDKAIIANNFAFVAKENDDTSKPWLPTFDSINGTTPPSDTAVINSVDTFYYGINNAGIKVGSMTAPEKKTENTATANVADDYWYYRDYEFRGVAKSGSTEIPLVPPYTQFVNADKTKTVELGGWSAATAINNNNLVAGYASTAISKYGSDRVNYCLGTDNTLPLDICVQREQYPNSTGTRNIQYQTRAYVWQIDNDVATGSELPLGLTPATDNTLTFTAQALGLNDNGIVVGRSHVYRNGNSKALAQDAAYWAKDTEGNYQYHWIPMGSSIENSIAYGVNDNGIVVGSYRSYIQGYRRDKFFVFDTNTPDVAYVTPNDFASTTTDLSSKPKDINNKGQVVGYIETTYDKEKPRPKAGFLYEKSTGEFNNLNKLLTCESKGYEKASDGSWARHQIEVQDGSGKILQYNADILVVEGSSINEEGTIVGTAFIRKPSYQFDKDGNIVIGENGLPLFELSGSGEPVTAYIPRMVVLKPASSGEACTVEDNTDTGNFERSGAATLAWFFALPLVWFRRRIR >NC_017571.1|WP_006082074.1|2096105_2096282_+|ribosome-modulation-factor MKRQKRDRLDRAFSKGFQAGVGGRSKELCPYANLDSRSQWLGGWREGVDGRVNGLFNK >NC_017571.1|WP_006082073.1|2096405_2096921_-|bifunctional-3-hydroxydecanoyl-ACP-dehydratase/trans-2-decenoyl-ACP-isomerase MNKANSFNKEELIACGHGKLFGPNSPRLPVDNMLMMDRIVTINDNGGEFGKGEIVAELDINPDLWFFDCHFITDPVMPGCLGLDAMWQLVGFYLGWEGAEGKGRALGVGEVKFTGQVLPGAKKVTYKLNIKRTIHRKLVMGIADAILEVDGRQIYSATDLKVGVFSDTSTF >NC_017571.1|WP_006082072.1|2096995_2098729_-|AAA-family-ATPase MNSLLIPTSSLAPEFSLPTLSETAANLSALLLGQERTVDAFKLHQAIVDQHLYLADFPSIDRQLMIKACIDSLAPLSPAYLVATRPIDKAVTFQWQSDKPQENQGSIAEKTTHYRYLSGNIRRADLIGRMAQTGTTSQYQAGALAQCHYVFICAESLWKREGLWELVMQILTHKEYQISSNLSPIPLNCKIVLVGAGMIYSQVRTEDWQFQRHFTLLAELASEIDLVRYKESQYVAWLQAVAQSVDVTLEQSSLAPLFRYSARLTEHQRRLSLSMLEFAQLMMQAKAYRGKPSINASSIDHALTQANYRHNSSEEYSGQSFDDNFINLPTSGAMVGQINGLTVIDAIDYSYGEPARITASVHYGDGEVADIERKSELGGNIHAKGMMILSACLYRIFGRDAPLHLNANIVFEQSYQEIDGDSASLAEYCCLMSAIAEQPIIQSLAITGALDQFGNVQAIGGINEKIEGFFNLCERRGLTGEQGVIMPKSNVLQLNLNPKVITAVGKGLFHIYAIEHMDQAVELLMQMPAGVADEDNDFPHDSLYGLVQERLDKLAGNGEEEIGLITRLLAKLGFFRR >NC_017571.1|WP_006082071.1|2099928_2101926_+|methyl-accepting-chemotaxis-protein MLLPTRISQRLTMGSIVLLLLTGLAVFGVMSLRGQPRVVAASQELIEQTGSSIVRQLALKLASIEGITLSLAHLAEVLPHEQALYLNSLPNLIDNNGDLTIAGGGIWPEPDKFVAGSTRHSFFWARGSDNLLAYSDDYNAQSGPGYHNDPWYTGARTSSVSQCLWSEAYQDTVTKVAMVTCSVPYRLTGTFAGVATIDLKLDDLAKFLTEHGNVTDGYAFALDQAGNILHFPEANKSDTMLRFTDLVGQLPWLAPVEAALKQTSINNVISVSLDKDGRLNQASEVSLFVMPDTGWVIGLVTPKTRVTGLARELTFDILLFLLPLLAILLYLAWLSGKKLLAQLEETTDQIASLGSGNLGANVELQIQRDDEIGALRRAVNTYAGTLRAMLDLITQEAKKVQFEATQLSGLSHRLAQRAEQQRLESAQLAAAITQMSSSAMEVANNTNDCAATAQSSLIVVREGQTRVASSNASIEALAGEIANATKVISQLAQDSQKVGAVLDVIKAISEQTNLLALNAAIEAARAGDQGRGFAVVADEVRTLAGRTQDSANEISTMINALQSASRLAVQAMQTGESRTVHAVAEAEGAASSLSSTVQSFDDISQRAQQIALAAQQQSQVTQEINELAVRINSISEDNSLDATALDALSLEMQKLSDRLININRA >NC_017571.1|WP_006082070.1|2103268_2103913_+|UvrY/SirA/GacA-family-response-regulator-transcription-factor MISIYLVDDHELVRTGIRRILEDERGIKVVGEAPDGETAVQWARQNEADVILMDMNMPGMGGLEATRKILRYQPHAKIIVLTVHTEDPFPSKVMQAGASGYLTKGATPPEVLQAIRQVSRGQRYLSPEIAQQMALSQFNPADENPFKSLSERELQIMLMITNGEKVNDISEQLNLSPKTVNSYRYRLFAKLGISGDVELTRLAIRYKMLDAGHF >NC_017571.1|WP_037392563.1|2104016_2105846_+|excinuclease-ABC-subunit-UvrC MSNEFNAQSFLRTVSSSAGVYRMYDVKNDVIYVGKAKDLKKRLTSYFRKNLANVKTQALVSHIHHIDVTLTHSETDALLLENDYIKQYMPKYNVLLRDDKSYPYILLSQHEHPRLAYHRGPQREKGHYFGPYPNGGAVRESLHLMQKLFPIRQCDDLYYKSRSRPCLQYQLSRCSAPCVGKVSNADYDEQVKLASLFLKGKDQQVISALVDKMELAAERQAYEQAARFRDQIMALRKVAEQQEVSNNKGDMDVIGVHYSSGIACFHLLFIREGKIFGSRSYYPSVPAQTDMDEVLRSFILQFYLNADIQRTIPKEVVISHNFEELHELEAAVSEALDKKFSIKTNVRADRASFLRLAVTNATNAVVTRLSHKNTVEQRFVLLEEILELSTPIQRMECFDISHTMGESTVASCVVFNREGPHKGEYRRYNIEGITPGDDYAAMKQAVSRRFDKIEAGGKIPDILFIDGGLGQLRIAQKIVDEKFVHLDKAPQLIGVAKGEGRKPGLETLILGDTETSFSLEGDSPALHLIQHIRDESHRFAIAGHRNRRQKTRNTSTLESIPGIGPKRRKALLQHLGGLQEVKGASVAELVKVPGISIEMAQTIHDALRG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_017571_3 | 3324127-3324197 | Orphan |
NA
Consensus repeat of NC_017571_3
|
1 spacers
spacers of NC_017571_3
>3.1|3324153|19|NC_017571|CRISPRCasFinder ATGCTTTGTCCCCTCTTCG |
CRISPR arrays and Neighbor proteins around NC_017571_3
The CRISPR arrays of NC_017571_3 >merge|NC_017571|3|3324127-3324197|CRISPRCasFinder AATAAGTGGGGACAGCCATGTCTTTTATGCTTTGTCCCCTCTTCGAATAAGTGGGGACAGCCATATCTTTT >NC_017571|3|2|3324127-3324197|CRISPRCasFinder AATAAGTGGGGACAGCCATGTCTTTT ATGCTTTGTCCCCTCTTCG AATAAGTGGGGACAGCCATATCTTTT
>NC_017571.1|WP_006081043.1|3323728_3324106_+|DP-EP-family-protein MFAPQGLAQFVKVNVTLENGEPVFIYTDASGEQCPGDVTITQAGTVTYLLNDQTGKGLKFVGVGFVTPFDEIVDAVTISSDGMLVQLVDLDRTPGKTKFQFVLSNTTNTLLVLSPDPEIINRPQN >NC_017571.1|WP_006081044.1|3320479_3323707_+|winged-helix-turn-helix-domain-containing-protein MSDSTFFFGEWQINPSANSLLLGKQVKQLEPKAMDVLLFLCQRAGEVISSDEIVSHCWPGVDTGDNPLHKIINQLRRALGDSATDPTYIETIRKRGYRTLAEVRFPIGHEATASPQSWQGGSPFPGLQAYKANYAEVFFGRSEQISTLLNRISQQIIYGRAFCLVLGPSGSGKSSLINAGVVPNLMKGGGYNGIGVASFSSLDFADVSKGQLLTDLASAMLDWEINDTPVFEGMSADTLAVQLVEDIQGIINQCTQALKVQPFTQPFFALFIDRLEVLLSSPLFSDTERTVFVELLEQLATSKAVIIISACRNDFYPLLVGFPSLMAGKSRGAHFDLAPPTRTELLQMIRLPAVAANLSWEVDSETAMPLDEMLCSDAASNPDALPMLQYTLQALYLQRSADDKLLVSVYHALGGIEGAIGKNAEQAISHLSDAEKASLPRILSLLVTLREDEKSITSRTARWSQLQSTAETALVQAMVDSRLFVSHLQNGEPCFSIAHEALLRRWPRATAWISEHSDSLSIKSRLQHLSTRWLSEAKHSAYLLAEGKPLKEAQSLSQNPLFDLDDPETAFITASTKRANMLRWTRRLTVTLLCVLTLTSIIMSVRSIEAEKLALQKRLAAEDLLGFMVGDFADKMRGIGRMDLLDGISNKALEYFSDFSSPDDEKYLSFDARFQHGQTLEAMGEVAYSRNKIDEARSALIAAQEKMLPLLKLQPENLALLKTLGANAFWLGQLKYDVSDWAASRPFYEQYLKYSQTMYALAPEDKDALMELSYAHNTLGSVSMKQQDFAKAQQDFEESLRLKLLALAKAPEDSQLIADVANARSWLASAAVSQGDVLSAIQIHIQLQQELGKNIKQPYILDRLSASHQILADLYGYQNQPEQALEQTKLGLEAISTALEKDPKNEIWIKQKYYQKFNILALSHADIDEVKNLKSLLYSDDELNSSSKKDEIYASFFLAAAKNLQQQDKPTESYEFAIQAKKEFLKLSKKYHQNTTYISKLSESILLEAKISKDKHTQYTLCLYSKNLLSPIIERDKSPNFTVSYIKSLDCLGEVINDETLKKLLLESQILNISF >NC_017571.1|WP_006081045.1|3318654_3319983_+|DUF4785-family-protein MHTLSKLASVLALSGLLAACQDEATPIDPKTNASPMYAKGLTLAAPSNTDIVDSNIASPTLAPINTSTDYISFITPLYGEYQASAPQLEQASQSDEYWVNVTGAQLNAGVGLTMSQASSLVRIAPRGDTSSGALMHAEAIAPERVQIQHVSPNQSPQGKAANSSGINTNESLVKSMANADALASAGLTDDSSALQMSAKATPGQYRLQISQPLTPSANYLVNVKEKGSPYQLSVKASSAIAADAQTLGLELALSQSDNTFVPQATLKQADGDMQPLTMVKQGETWQAVIPAGVALSSSNAGFSEVEITVQTQVDGRPVQRTVKSVFKSYVNSASIKPEVLTVWDKGLPNQINFELSVAEAGRFGLSGTLTGTNAEGQKVAILRTQAANWLTPESPKLKLMLDPKLIQASGLQPPFELNELELQDQGQMARLSYQATALILTR >NC_017571.1|WP_006081046.1|3317785_3318643_+|hypothetical-protein MKIKKALMLLGLLAVSPATLAATGVAFIHGTGHQTNALADYWTPEFVNSVRQGIPVPNNYTVINCDFDQYMWEESAAGCLATQLNSFIQTKNIDDLILLTHSNGGNVVRWILSNPTWDSRYPAVINATSKVIALAPSSAGTPLADAVNQGNVFETSLGWLLGYGSNAVKQQQVGWMANYNNTMLYGTAGRPSLGKPFEVVVGSDVDSAIWDSDSYCAGYQYQVALETTQNWLDSCSDGFLNCSSQKAAGTVWFTDKQKTRGQEPLSHQQSRRACFNLDTLIRNHI >NC_017571.1|WP_011846399.1|3316538_3316886_-|hypothetical-protein MVSCLSGLQCRTNFTIKNITEYMLPDTKEAFYLHLDGKSPNLIIRPAFEVFSGELATIAGVHAKYDYFHNAEMTRFPKRLHKSLTETHYGLAFSFDTVEAVQQFVSRLGAIVKGA >NC_017571.1|WP_006081048.1|3315712_3316414_-|TetR/AcrR-family-transcriptional-regulator MSLIPGRPRGVVPFTAQPESPPILGRPKGNSDARQRLITAALSLFSHRSYPTVSTREIAREAEVDAALIRYYFGSKAGLFEQMVRETLEPVLTRLREISAAEAPNNVGEIMQTYYRVMAPNPGLPRLIIRVLQEGDGSEPYRIILSVFEQVLTLSRQWLESTLVNSGLLKEGVDPDLARLSFVSLMVFPLIAPPVLMRQFGFFDADAALLQRLALHNMQVFTQGLLCEPRSQS >NC_017571.1|WP_006081049.1|3314576_3315716_-|HlyD-family-efflux-transporter-periplasmic-adaptor-subunit MSSTKGLTMLNDQDFSRNRFSKNNSSNSLSNKRLSSRKLSNSNLSNNKLTNQTALAKRVATGVVLGCLLLQLAACSKESPSVLGTVERDRLTLTAPVGELITQVNVVEGQQVKAGEVLLTLDSTSANARLALRQAELEQAKAKLSEAVTGARLEDIERAKAVLDGANASVKEAQRAFERTNRLYATKVLSQADLDTARAARDTSLAKQAEAEQSLRLLENGTRSEQLEQAKAAVAAASASVAIEQKALADLSLVAARDAVVDTLPWRVGDRIAAGTQLIGLLASEDPYVRVYLPATWLGRVKAGDKVNIRVDGREMPIAGTVRNIRSQPAYTPFYALNERDRARLMYLTDITISAAGQDLPTGMALAVELDVQPKVKQP >NC_017571.1|WP_006081050.1|3313623_3314580_-|ABC-transporter-ATP-binding-protein MSHADFAIETQGMTRAFGGVNAVENLDLAIPKGTIYGFLGPNGCGKSTSIRMLTGLLSPTSGEIRVLGETLPGAEEKLRRRIGYMTQKFSLYDNLSVRENLEFVAQIYGLNRRASKLRIAELVTLYDLVGREKQMAGSMSGGQKQRLALAAATLHHPELLFLDEPTSAVDPENRREFWERLFDLCAQGTTILVSTHYMDEAERCHGLAILERGIKRADGSPQQLMAAMGARVVEISGDDLRNLKQSLISESAVLSAAQIGSRLRVLVRSDIEDPLAWLKPRVASRTMEEVRASLEDVFVTCTGERLAGVSKGETDHVA >NC_017571.1|WP_006081051.1|3312512_3313634_-|ABC-transporter-permease MWRRIFAIVVKELRQLSRDRMTFGMIVMIPLVQLMLFGYAINTDARHLPAGLVNLSDSAYSRSLVQAVEATQVVDFKQRYLSAAEAEAAITRGEVKAVLYLPADLDERLVRHPSFAGQQYLAQPVGQWLVDGSDTVVASTIRSLRQMPLDEVAGRAQKTVMPSFEVVQYFNPEQRSVVNIVPGLLGVILTMTMVMFTSAAIVREREQGNMEFLITTPVRPLELMLGKIVPYVIVGFVQVTIILSAGHLLFDVPIRGGIDSIALAAMLFICASLTLGLVISTIAKTQLQSMQMTVFILLPSILLSGFMFPYEAMPIAAQWIAEALPATHFMRMSRAIVLRDAQVMDLQFDALWMIGFTCIGLFIASMRFSKRLD >NC_017571.1|WP_193377961.1|3311431_3312058_-|signal-peptidase-I MLPLIVLRGFFFEPFSIPASSMKPTLAPANHVLVSKYGFGNYRYLGFQLAKSTPSVTPARGDILVFQYPANPAIDYVKRVIGLPGDRIIYRDKTIFVQKACNVSREACAGLDSQYDLIDKTLLPELSTETQAVYQESLDDIHYQVLLLRHQKELVDRYYVQENTLRGEWLVPAGQYFVLGDNRDNSVDSRYFGFIPQDLIIGKVIYIW >NC_017571.1|WP_006081042.1|3324600_3325647_+|hypothetical-protein MTSARRMLIDANTTPFYHVINRCVRRAFLCGEDKLTGRSYEHRRGWIVDKIKALSAIFCIDICAYAVMNNHYHLVLKIDVEEAKTLSPKEVISRWCQITKGHTVATKYMNGEALIDGERLLLDGLITEWHERLSSISWFMRCLNEEIARKANREDECKGAFWEGRFKSQALLDEQALLACMMYVDLNPIRAGIADSLQSSDFTSIQERISALKQVNNIIISPQAPVQSAENKPQTINKSLAKFDAATHSSQQTGIPFHFADYLELIDWTGRAIRHDKKGFIDNQRPKLLNELGIASDAWLSSAKDFRRQYSGISGRWDAMCAFKLQQGGKWCRGKSSSEALHPNQPLG >NC_017571.1|WP_006081041.1|3325953_3326982_+|diguanylate-cyclase MTSSPKQTRSQHPLSLVGEDNGLQCTAQQGLVEPALPLTFFEHASIETIQQFIQHLTEPVIIVNAQGYIYFSNSKGADLLTCPQASLKGQDWRNFLTEHHQARYDNLLSNDIKLGKNCGIPVQHCAEEITLITASGKAKDVELSISYIPSHEPLFVMMMHDLTQHKAENQKLRKLAATDSLTGLANRRYFDEMLHHYWEECTGKLRPISVVIIDVDYFKVFNDQFGHIQGDECLRKIAKVIADIVPIDIGLAARYGGEEFALILPSHNAKMALTIAQKVQQGINDLRFTEQGLRDYVSVSASQGIASEINGQFRTSLAMLCAADTALYRAKADGRDRINTSL >NC_017571.1|WP_006081040.1|3327129_3329724_+|FAD-binding-protein MNKLEASRNIRAVIEKSLPVSTSPLAIFTYQCWVKTSAGGRVFHYKTDREDTYFAVDINSLGHIECVINTTLLRVEAISTLGGLNDGQWHLLSICHEYHEINCYVDGEITHTQLNNLSLVADNEFLIENVSLPNHRDTHFDGEIFGITLLNRLLTRTEIVDYYRNPAQQKSITENNLYIYQHQDVYNFRDENVIPLQRQKVLLVIFNDTEYQFIKTTADSNAYNQQLPKIIPPHERRAYIIECDYNTWPNFNYVVNYAASNQADITINIEVFKSLTAYRSNIKVTVANELERDCLIMQSTEEELSAEVRISENLVITQAKNFVNFINEVRAHIPADNIITAGHYYSEQQFSVSTGKQMIAYQQACQLFNRRLQKKPLAIIKCTSTEEVKIAYKAAIDYNLPISVRSGGNDHEGESTETNTIVLDLLKMDSLTLDPITGIAAIGPGNRFINLTTALAKKGVMIPHGTSGNVALAGFIMGGGSGPWTRKYGMCCESLLQAEIVLGIGETQVVSVANKPELLWALKGGGGLSYGIVTRFFVQTFPLPPCLLKFELEWNCYDKQTQELIEHTPTKDILQRWEAIINADSTGCLIGTNLKINAKHFPAEQHKTPDIDTESIKHNCLMYGFWEGNSASLNHFIQTQFNEFDLVPNDIRIEGMGGLTKAYGENLMANWERESFHHLQADLQGINRSPTPPDLDEPAPHKVTSRLVNQTGLRDGYKPLLESLTSRYVLEGNRQLGLFSYVTLGAIAGDYYRTMGEEQKSRSAFPYKDRQYTIQYQTWWNNALQEKQQLQDSQVFTRINRALDWIDASRNYDIPNTSGAFISFKDKAIPTDVYFDHNYAALKRIKAAYSQDSFNHFRSRKSII >NC_017571.1|WP_006081039.1|3329829_3330708_+|EamA-family-transporter MPSSALAIIFALLAAILMGTIGVLARFAALPAEHITFYRLLLGALFLLAYMLLTGKGHQIRHRLSKRNLVNGAMLAGFMAFYIEAIQYTQMANVIMIIYLAPVVSAIFAHYFFAEKLTRLSMASVVVALMGFMLMIPTGPQQASNHTELLGYFYALLALLTYCGFMLINRKPSLSSPYQSTLVQLSVGALCLLPLVINAPLTPSLPQIAWLIAIGFFPGFLAILLAVKALRQLPAVTFGTLAYVEPVIVVTLAWWIFGETLSPMQLSGVGLIILAGISQGFISQRKVRVAIE >NC_017571.1|WP_006081038.1|3330883_3331408_+|lipocalin-family-protein MRKLLLVISIFLLSGCLGMPKLVQPVNDFELNKYLGKWYEIARLDHSFERGLSQVSAEYSLKDDGGVMVINRGFSAAKNEWKEAEGKAYFVNGDSEGYLKVSFFGPFYGSYVVFELDHENYQYAFISGPDTDYLWLLAKTPTVPPEVLQKFVEMSKARGFDTDSLIYVQQELAP >NC_017571.1|WP_080561616.1|3331537_3333130_-|ABC-F-family-ATPase MITTANITMQFGAKPLFENISVKFGGGNRYGLIGANGCGKSTFMKILCGDLDPSSGNVSLDVNERLGKLSQNQFGYEEFNLIDTVIMGHRELWKVKQERDRIYSLPEMSEEDGIAVANLEMDFAEMDGYTAESRAGELLLGVGIGIESHFGLMSEIAPGLKLRVLLAQALFSDPDVLLLDEPTNNLDIDTIRWLQDMLNQRSSTMVIISHDRYFLNSVCTHMADLDYGELRVYPGNYDEYMQAATQARERLLSDNAKKKAQISELQTFVARFSANASKAKQATSRARQIDKIKLDEVKASSRVNPFIRFEQEKKLFRNALVIENLTKGYDKPLFKDLDLIVEVGERIAVLGENGIGKTTLLRTLIHDIPQDAGTIQWSENANIGYYAQDHESDFANEMTLFEWMSQWRKPEDDDQSVRGILGRMLFGSDDIKKSVKVLSGGEKGRMLFGKLIMQKANILVLDEPTNHMDMESIESLNNALEMYEGTLIFVSHDRAFVSSLANRILEVTHTGVNDFRGTYDEFLASKGIEV >NC_017571.1|WP_006081036.1|3333442_3335677_-|copper-translocating-P-type-ATPase MSMTRLYVANMNCAGCVAKIEKAFDAQAGVEARVNLADKQVTVEGKMSADTALTVMENAGYPAQVVVDAKVAAEEKRLEDATEYRLRMRQAIAALVVGVPMMLWGLLGGEMMINSPAMQLGWGIMGLVTLALLVTTGGHFYQGMWRALKAKTTNMDTLIVLGTSTAWAYSMLVVIMPSAFPMDTRHVYFEASVMILGLINLGHGLELKARGKTSEAVQRLLGLQSTTAIRITDKGDEKVEISQLKLGDKLRLRPGDRVALDGEVETGQSLLDEAMLTGEPIPVLKNRGDSVSAGTVNGNGSLVYRVTAGQEDTKLAKIIALVQQAQTSKLPIGRLADRISAVFVPTVVAIALLAAAIWYFVGPAPALSHALVVLTSVLIIACPCALGLATPMSIMVAVGRAAQMGVLVKNGEALQTASKVDCVVLDKTGTVTLGKPQVTDFTLVQTLSDADKEALLGEIASLEQHSEHPLAGAIVSYAKQSLSQLPDTQAFTNHQGKGIEGKVDGVSLAIGNLALMTVLDIVNYDGSALDPKATLSFANQGKTPIYVAKAGKLVATIALADPIKPDAKDAIAAMLQRGIRVVLLTGDNPQTAQAVADQVGITEVIAGVLPEQKQQHVLDLQKQGHVVAMVGDGINDAPALMSADVGIAMGSGTEVAIESADITLLSQQLIVIANLLALSRATITNIKQNLFGAFVYNSLGIPVAAGVLYPLTGMLLSPVIAGAAMALSSLTVVTNANRLRRQKL >NC_017571.1|WP_006081035.1|3335826_3336318_+|MerR-family-DNA-binding-protein MKIGEVAKQTGLSVKSIRYYHDIGLVCGERNEAGYRVYRHQDIEALKFVHQCRDLGFSLEDCKSLLGLKGNESRNAEDVKQLTRTHLAYVEDQINKLQQLRAQLQHMVAQCEGGDQPHCTIIDSLSHEDHETDEDKSQKDQSLHDAGSECCRKASRNSNGQES >NC_017571.1|WP_006081034.1|3336332_3338381_-|S9-family-peptidase MKLHGYLWVLGSLFCLGACERSNKETPHLSDNGERIVAPYGSWLSPLSANDVFEQADNVAELQSVDGAIYFSESDGDKQGQVGIKRLDKKGNIADVVPPSFNVKSAVHEYGGAAFLGIGQSLFATKAQDQLFYRFAPNQPPLPLTPNGTRHADCIAYPKGSRIICVREDHRQPGEPKASLVTINLNFAGEGDTFVSGHDFISSPAISPDNSQLAWITWEHPYMPWDNSILWLGDLDRKGQLHNIRKVNTPTDSSVTQPLFSPDGQLYVVSDISNWWNIYRVTAQNTLAPVLNKKAEFAVADWYLGNHNYAFESEKTIIASYMLGSQAALIRLHLDSGLTESLAVDFGEITQVIRGEDAVYFVGAKATPEKGIYKVIGRGTELVYTPNLPKLDPNYISRAKNITFKTGVNQQAYGYFYGPVNPNYTAPHDTRPPLIVMLHGGPTARASLAYRSDIQFWTSRGFAVLDVNFRGSSGFGRAYRQSIYGKWGQADVEDAVNAARYLVDKGWVDGQKLAIRGVSAGGLTVMSSLAFYDVFQAGVSYAGISDFEQLAKSTHKFETGYLDQLIGPYANNQARYRELSPLYHLEGLNEPLLIFQGLRDQIVPPQQSRQIYEALKVRGVPTAYIAYQDEPHGGWKPEHRAAGLETELAFYGQVFNFTPAGKLATLVLDNAAALKPTLVRNQ >NC_017571.1|WP_006081033.1|3338444_3338687_-|hypothetical-protein MSWQDKALWLEKITKRMMLIVGVLGLIVIYCGFFFLLFSGRSVAVIPWFFLISPWVCIYFGLTQVQQVQVVNWFLKKFKK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_017571_4 | 4371090-4371195 | Orphan |
NA
Consensus repeat of NC_017571_4
|
1 spacers
spacers of NC_017571_4
>4.1|4371116|54|NC_017571|CRISPRCasFinder ACAACTGAGCTAATGTCGCCTTTTGTATCGCTGCTACATTTGCCTTCAACATTA |
DEDDh |
CRISPR arrays and Neighbor proteins around NC_017571_4
The CRISPR arrays of NC_017571_4 >merge|NC_017571|4|4371090-4371195|CRISPRCasFinder ACAAAGGTATCGTCTATCAACCTTTAACAACTGAGCTAATGTCGCCTTTTGTATCGCTGCTACATTTGCCTTCAACATTAACAAAGGTATCGCCTATCAACCTTTA >NC_017571|4|3|4371090-4371195|CRISPRCasFinder ACAAAGGTATCGTCTATCAACCTTTA ACAACTGAGCTAATGTCGCCTTTTGTATCGCTGCTACATTTGCCTTCAACATTA ACAAAGGTATCGCCTATCAACCTTTA
>NC_017571.1|WP_006080046.1|4369572_4370028_-|hypothetical-protein MRIQQLRDLLEYVANCRLDMAQLYGRLNNQADSARVKMMLDYFESHQKTMAEKLRDYIEEAPHRILDTWYKDFTFEDFTKRCQDTMLPANMNEDDVLNLHLDLENRLIGLLEKTVNSTTAEDARNALANLIRVEKTQQQRLVHSTIRMDDI >NC_017571.1|WP_006080048.1|4367817_4369302_-|bifunctional-ADP-dependent-NAD(P)H-hydrate-dehydratase/NAD(P)H-hydrate-epimerase MAQDLSQFPKALFTQAQVRQAELSAVSQGASSLYELVERAGAAAFECLTKHNLNASSVFFLAGSGNNGADALVCARLARASGMAVSVMMTSVAGTPECQQALAHYLKDGGELLPKAVAPILAAKIIVDGLLGTGVRDAARDDMAEYIRAINDNAAWVLSLDLPSGVIADTGAVAGVAVMADVTLCFGGWKQGLLTGKARHYSGELEFAALGLTPFFAEASAQRVGKETLKDYFAARARDSHKGQSGKVTVIGGDMGMAGAVRLASEACLRAGAGLVTVISRPEHQLTVNVSRPELMFWGCDLVDMEVYLRLGWAQVIVLGPGLGKHDWGYNLFKAVGLSDKPCVLDADALNLLSNEPRQQTNWVLTPHPGEAARLLGCSVAEIEQDRFAAVRAIQQKYGGVVLLKGAGTVIFDGKQMVVAPVGNPGLASGGCGDVLSGIIGALMAQGMDNMQATVLGVVVHGCAADLAAIQGERGMLASDLMPFIRQLVNSDLL >NC_017571.1|WP_006080050.1|4367256_4367715_-|tRNA-(adenosine(37)-N6)-threonylcarbamoyltransferase-complex-ATPase-subunit-type-1-TsaE MTELTFFLDNEDDTIAVGQKLARHVQAPLTLYLTGDLGAGKTTLSRGLIQGLGHKGAVKSPTYTLVEPYELEGVEVYHFDLYRLNDPEELEFMGIRDYFTDKSLCIVEWPDKGEGLLPDADVHMHLSYQNSGREIRIEALSESGEKLLKAIK >NC_017571.1|WP_006080051.1|4365826_4367248_-|N-acetylmuramoyl-L-alanine-amidase MIKNNPYFQIIILSLCSFLSVAAHAANQLESVRIWAAPESTRIVFDLSEAPDYTYFSLDGPNRLVVDLKKSATKVALKNLENNSKLVKGVRVSKSPTKGDLRLVIDLVKPLNASLFSLPVTAPYGNRLVVDLEDKTLTTATAVVSSTPVKTVTQAAQSSRDIVIAIDAGHGGDDPGSIGPSGVYEKKVALEIARRVSSKINDTPGMRAVMIRTGDYFVNLNKRSELARNSKADLLISIHADAFTSPNPRGASVWVLSMRRANSEIGRWLEQKEKHSELLGGAGEIIQNTDNEQYLAMTLLDMSMNSSMAIGHSVAGDILKDLGGVTDLHKSRPESASLAVLKSPDIPSILVETGFISNPKEERLLSSSRHQENIANAIYKGVSRYYHNNPPADTLLAQRGGASSSSAKSTKERTPTAYVGSSTASNVKHKVSRGESLSAIAQRYRVPMSSIKQANGMKSDVVQLGQTLVIPQS >NC_017571.1|WP_193377954.1|4363899_4365816_-|DNA-mismatch-repair-endonuclease-MutL MMGIQILPPQLANQIAAGEVVERPASVVKELVENSLDAGASRVDIEIDKGGSKLIKIRDNGSGIPKDELALALSRHATSKLHTLDDLEAILSFGFRGEALASISSVSRLTLTSRTAEQTEAWQAHAEGADMAVKIMPAAHPVGSTIEVVDLFFNTPARRRFLKSDKTEFTHIDEWLKRIALVRGDIHFTLTHNGKTVRNYRPAMNEAQYLQRLTQVSGRPFAEQALKIECQHDDLRLSGYLQSPWSPVISDTHYFYVNGRLIRDRLVNHAVRQAFAQKAELEQPGYVLMLDIDPHQVDVNVHPAKHEVRFHQSRYVHDYILQALQSALEEAGELDFVHSSSLDEVEDVFVDAPISATEISAPFVLGADSAQVNVPADTLESAQPLVASAVQVKSAGAGREGASFGTQTNTFGSMATPRDNSRGNYSAGESRQRTELPSKAAIASYGALLQTPSYSVKDQDYQPSLPMPAILDSQYWVMATADKLSLLPIKSVALATRCQEIEAKLATGLIGQPLLMPVSVAADADWQAVLDEHDTLIRQLGLELTIRYQQLIIKKVPPYIRESQLAKVIPEWLQSLRFETPAPSALAFWLAKHSLTGFVSAPEIWAAFSQLAEEKKQLIANKAILLPWQSWLEEQASE >NC_017571.1|WP_006080053.1|4362980_4363907_-|tRNA-(adenosine(37)-N6)-dimethylallyltransferase-MiaA MNKELQPKVIFLMGPTASGKTALALELAEKHNCEIISVDSALIYRGMDIGSAKPSADELARGPHRLIDIRDPSESYSAADFRADAIAEIEQIVSMGKTPVLVGGTMMYFKALLEGLSPLPSADEAIRAEIQAEADEKGWEALHDQLRVIDPVSAERIHPNDPQRLSRALEVYRISGKSMTELTQTKSAPLPYDVVQFAIAPRERKVLHDLIAQRFAIMLKQGFLEEVTELKARGDLHLDLPSMRCVGYRQCWQYLDGEFDYDTMVEKAVAATRQLAKRQLTWLRSWPELNWLESGAEGNLVTLMRQCR >NC_017571.1|WP_006080054.1|4362615_4362888_-|RNA-chaperone-Hfq MAKGQSLQDPFLNALRRERVPVSIYLVNGIKLQGQVESFDQFVILLKNTVSQMVYKHAISTVVPARPFNVTGHQNAQGGYGAQDDTPSGE >NC_017571.1|WP_006080055.1|4361284_4362592_-|GTPase-HflX MFDRYEAGETAVLVHIDFSDEERREDLVELQLLVESAGARSVGVITGSRRSPDRKFFVGSGKAEELAALVAATEANVVIFNHALSPAQERNLEQVCHCRVLDRTTLILDIFAQRARTHEGKLQVELAQLRHMSTRLIRGWTHLERQKGGIGLRGPGETQLETDRRLLRGRIKNINKRLERVDKQREQSRRARKRSDLSTVSLVGYTNAGKSTLFNALTSSDVYAADQLFATLDPTLRKLDLPDGAVILADTVGFIRHLPHDLVAAFKATLQETRQAELLLHIVDCADENMADNFDQVQSVLKDIEADEVMQLVVCNKIDLLEDVTPRIEYNDIGKPVRVWVSAQKRLGFDLLLKAITELIGEVIKELTLRIPATAGHYLGQFYRLDAIQQKEYDDLGNCILSVRLSDADWRRLAKQSQGELETFIYEDSIDEAVC >NC_017571.1|WP_006080057.1|4360108_4361248_-|FtsH-protease-activity-modulator-HflK MAWNEPGNKGNDPWGNKGGNDKGPPDLDEVFRNLSKRFGGKGNGSGQSFSSFSLIIILAVAVVVWGLSGFYTIKEAERGVALRFGKHAGEIGPGLHWKATFIDQIYPVDIQSVRSIPASGSMLTSDENVVKVELDVQYRILDAYSYLFSAVDANASLREATDSALRYVIGHNKMDDILTTGRDAIRRDTWKELERILEPYKLGLAVVDVNFLPARPPEEVKDAFDDAISAQEDEQRFIREAEAYAREIEPKARGEVERMAQQANAYKEREVLEARGKVARFELLLPEYQAAPDVTRKRLYLDTMQQVMTDTNKVLIDAKNNGNLMYLPLDKLIQQKPATTELEAKPQQNVNSSSVSTTRANEPLSGRPSRDDRTRQGRE >NC_017571.1|WP_006080058.1|4359211_4360105_-|protease-modulator-HflC MGRLSVILIAVLLGIGLSSLMVVNEGERAIVARFGEILKDNVDGNRVTRVYGPGLHIKVPVIDKVKLLDARIQTLDGAADRFVTSEKKDLMVDSYVKWRIADFEKYYLSTNGGIKSNAESLLQRKINNDLRTEFGRRTIREIVSGKRDELQNDALENASESAKDLGIEVVDVRVKQINLPANVSNSIYQRMRAERQAVAKEHRAQGKEQSEIIRATIDANVTVKIAEAERKALTIRGEGDALAAKIYSDAYSKDAEFFGFVRSLEAYRASFSGKSDIMVLEPDSEFFKYMKSTAPKK >NC_017571.1|WP_006080044.1|4371324_4371870_-|oligoribonuclease MTADVNNLIWIDLEMTGLEPDVDRIIEIATLVTDQELNIIGQGPVIAIHQSDDVLAAMDDWNQKHHGESGLIDRVRASQETEAQAVAKTIAFLEQYVPKGASPMCGNSIGQDRRFLNRYMRELEDYFHYRNVDVSTIKELVKRWSPETMAGFKKQNTHQALQDIQESIAELQYYRSKVFKI >NC_017571.1|WP_006080042.1|4371982_4373047_+|small-ribosomal-subunit-biogenesis-GTPase-RsgA MSKKKPLSQGQLRRMRANHEKRLNRDSGEKNTPELQDSSLGQEQPGTVISRFGQHADIETESGQVVRCNIRRAVTSLVTGDKVIVRLAIESQANSGIAGIVEAVHPRRSSLTRPDLYDGVKIIASNIDQILIVSSVLPSFTTQIIDRYLVAAEDTDIPPIIILNKIDLLNPDEAQAIDEALQRYKDIGYPVYKVSSKLGDGIDTIKDILKDKVSVFAGQSGVGKSSIINALLPDAELVIGDVSDNSGLGQHTTTTAKLLHLPSGGDLIDSPGVREFALWHLPAQRVGWCFIEFREYLGACKFRDCKHGDDPGCALQEALSQGKISADRFHNYHKIIASLDEQRHARHFRAATDE >NC_017571.1|WP_006080041.1|4373109_4373988_+|phosphatidylserine-decarboxylase MDKVKIALQYMLPKHLLSRLVGKLAAAEAGALTTAAIKWFIKQYKIDMSEAAQSEPEAYKSFNAFFTRALKPGIRPLDMDADIMVHPVDGAVSQLGPIKNGRIFQAKGHHYSSLTLLGDQAEDAKRFEGGDFATIYLAPKDYHRIHMPIKGTLSKMTYVPGELFSVNPLTAKNVPGLFARNERVVAIFETELGPLAMVLVGATIVASIETVWAGTVTPPTGKQVFTWEYPTQGPDAITLDKGEEMGRFKLGSTVVMLFAKDAIATFAEGVEAEAVTRMGQAFANLKDVKQAD >NC_017571.1|WP_006080040.1|4373994_4374876_+|EamA-family-transporter MSSAKPSGLIELHLAVLLFGGTALFSKLIPLSALDITFLRCIVAATVLGLLVKLSRRRLTLASKQDYLVAIGLGVIVSLHWVTYFAAMQLSSVAIGMIAFFTYPVMTVIAEPLLTGSKIKLLDMISGVLVLIGVILLIPEANLGNDTTLGIAIGIVSAILFTARNLLHKRYFSQYSGPHAMFYQTLVAVVFLMPWHQTELNSISLEVWGLIILLGVVFTAAPHALFTSALRQLSAKTVGLVSCLQPFYGAMLALIILGEELNLNTVIGGTIIVATAIFETHQSHQSQRRKKNA >NC_017571.1|WP_037392785.1|4374944_4378148_+|mechanosensitive-ion-channel MPRILLFVIAFFSFSVLANSPLQLDKRLGMNKQDQQQNLSIDEQINDLKINIEKLAASATSFQQAQIDFEQHKKRTNQALKEAYKPLSLEKGTDLSQQASMAYIRLSELKESESTLGRQVNDLLQRHNDLPTVIASARQSVAQNKKAELAPLDTPTGELQQTQRLFFEQSLATNEAELASSQKRIELTQLQLQLVRQQLVQQEALIETINKAINKQRQQQTDATLAKNLVNTGKVVDPITRNIADTNQIYGQKLQTLTLQINNVVEQQEQSETQYQSQAKQLANIQEQITWVKMNSAFGERFLQMLQSLPKPPNHEKLQTLIADARLDRYHLEQQQALNDQELEQPGLYNEQQIKLLHSQQSLISQLMQSYDQYLSELGKLKVNYQQLSQQHLTLKNTLNEHLFWVPNAASISKLWLTDLERSAVWLVQQAQWDELGKAWEEQNDYWSWWIILLVLCLVVQDLITPKFNRLLNHYTTYVGNVTQDKFIYTLKALACSVSYALIKPFPVIMAGLIFYQSDRNFVQAVGMGILAIGLVYLLYRFIFILTLDKGVLVGHFKRPSKLIQAGQSRLRHFVLVASPLLGLMGFTEVLDTSLVRNSIGRGAFIVFCIVLFLLYKDILILSRKNSDAKKDGKNKKLIQKILWTLLISVPLLSAGLAFRGYYFTAFQMLLQLQLSIVLGLGFLLLYQLIKRWMLIERRRIAFDRAKAKRAERLAQREKGETHHLSADGLDTYEEPVVDLETISSQSLGLVRSLLLLAFLASLIGLWTQTHTALFSFLDGITLWTTNTSVNGLEQQLPITMKSLMFGLIVVGFSLMIATNLPGLLELMILQRLDLSPGTGFAITTVSRYLVVFLGLLIGFSNLGMEWSKLQWLIAALSLGLGFGLQEIFANFISGLIILFEKPVRIGDTVTIRDLTGTVSKIQIRATTIIDWDRKEIIIPNKAFITEQLINWSLSDPITRVIVYVSVARDSDPAKVEATLYQAVQECEDALASPEPEVWFAGFGKHTQDYEVRAYAKDMSARWPLRHDLHKRVTKKLKENNLELAYPQMEIHIKNGQSREQQGIIRS >NC_017571.1|WP_006080038.1|4378162_4378885_+|glycerophosphoryl-diester-phosphodiesterase MLIFAHRGASGYQAENTLAAMAKALELGAQAVELDVHNVEGELVVFHDRRLDNKSSGSGVIHLVSKDYLASINVKGEPIPTLWQVLELIAGRCIVNIELKGINTVKPLIDLYPKALSQLGFIQEQLLISSFNHPYLRELKQALPEALVAPLLASIPLDNAAVVGELNAYSLHLDLSFISQDLVDDAHNRGAKVYVYTVDHSDDIHALKQMGVDGVFSNYPDRAMQALAQVATADYSGYFE >NC_017571.1|WP_006080036.1|4378905_4379238_-|DUF3392-domain-containing-protein MDQLVALIAQLGNMLRPWAFDIATAMVVCLILVFSADVNRILRRHLLGHSFVLRTFVFIIVNAFGYGLLIVKATPWVARHIVAMPSHWMFLLIVAMFIFIGYWAQRNSQA >NC_017571.1|WP_006080035.1|4379332_4380268_+|D-2-hydroxyacid-dehydrogenase MGHKLLLLTKANQQYRDLIEAQELPDLEILDDNPAGIIDADIWLAEPKLAAPLLPHAKNLQWLQSSFAGIDALMSPRGRKDYQLTNIKGIFGPLMSEFVFGYLLAHIRGHGFYRQQQQQKQWQVQPPIRHSSLQGMRLLILGTGSIAQHLAKTAKHFGMHVTGINRSGNTVEGFDAIEAMANLAQTLTQADVVVNLLPSTPATQSLLNADTLGKLKDDAVLFNVGRGDALDLDALNIQLIAKPAQQAVLDVFAQEPLPNSHPIWERGNAIITPHISAPSHPAQIVEIFSQNYRRYISGETLQNRIDFSKGY >NC_017571.1|WP_006080033.1|4380372_4381947_-|methyl-accepting-chemotaxis-protein MRTNLPVTQREYDYPADWILLSTTDTQSHITYANQSFCTVAGFELDAMLGKPHNMVRHPDMPPHAFEDMWRCIKKGEPWKGIVKNRCANGDHYWVDAYVSPISVNGKVVEFQSVRTKPTREQVKRAETAYAELNKKGSVSALKRTMSMPVKLLLLALLAMLPMLYLALQIGLMGFIALAISILVLLVGGNLVLSRYYAIVAKAKHIYDNPLMAHIYSGSSDDLGAIDLALQMQTSELKSILGRASDSCYNVSEQAKISAAKGKQIQSTSQAQLSEIEQVATAMQEMTATLGDMSSNCADAASATQMASAETINGDKTVSSTISSIQAMAEQLQQTSKVITELEAHSKDIGTVLDVIQSIAGQTNLLALNAAIEAARAGEQGRGFAVVADEVRALAQRTHDATKEIQSMINLLQQGTNKAVVSMQQGVVAASQCISTADLAGTALRTIREAISTITDMTHHIASAVEEQSSVANEMNRSVVNVSQFTHSSHQLGSEMVDLNDEVTREMDSHSVLVDQFLKRSFRV >NC_017571.1|WP_006080031.1|4382208_4382400_+|bacterioferritin-associated-ferredoxin MYVCLCHAITDTQIKNAVSQGDSSLADVKKRLGVADQCGKCARMAVQIIQNQLEREPNYYEVA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
132542 : 168306
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_017571|132542:168306|DBSCAN-SWA CTTAGCCTTGGCAATCTAGCGGCAGATATTCGTCGGGAATGTCACTACTGCTACACACCCAAGTGATATTGCTTTGCTCATCAACTTGTGGCGAAAGATAGACTTCGTAGCCTTGATAATCACCGAGCGCTTCGCCTCCAACAAATCCGACTATGCCATCGGAATAGAGATTGAAGGTCGACAACTGCTCGTCGGTAAACCCGAGCACTTCTAACTCAGTTGGCCAAGCTTTATTACCTTGGTAGTAATGTGTTAGTACTTCCGTTATCGGAGCTAATGACTGATGCAACTGGTCAATATGTTCAATGGCGAGTGCGGCATCGGGGACTGTGGCAGGCCCTGTGTTCATTTGCGGCGCCATGATAACTGGATCGGTATAGCTGCTGGCTGTACTCATCTTTTCTTGATATTGCTTGTATGCTGGCAACGCAACAGCGGCGGCAACTCCAATAATCGCAGCAAAAATAATGAGTCCACCGCCTGCCGTGCCCATGCGTGGCATAAAGAGGCTAAATACAAATGCCCACGCACTGCGTGATGGCGGCGTGTATTCAGTTCCCTGCGCTTTGGCGTCGACTCTTGCCATACGCTTAGTCAGCCAAGGATAGCTGCCGTTAAGCTCGTGCAGTGACATCCAAAAACCACTCGATTCGCGAGCTTGGCGGATATATTTATCCACATTCATACGTTTCCATTGCTCTACGCCAGCGGCTAAGACGGCGACGGCATTGGTGGCAGAACGTAAGCTGTTGCAGCAACGTAACCCGTGGAGGTCGCAGGTATATTCACAGGCGCGAGAATAAGCCGCACCCACTAAAGGCAACCAAGTGGCGAAGACTAAAAAGGGCGCTTTGCCAATATGATTGCGGCGAATATGGCCTAATTCGTGGCCGATATAGAAGTTAAGTGCATCTTTATCGGACTCAAGCGCATCCACAATCGATGAAAACAACACGATATAGTTACGGCCGAGGAAGCGGGTTGCCAGCGCGTTGAGCATGCCATCGGCGGCCAACAAGTAAGCCCGTGGTGGCTCTTTCATTTCTAAACGATCGCAGCATGCTAAATATTGTTTGTGTAATTCAGGGAATTGCTCTGCATTGATTTCGACAGCCGTACCTTTTAGGTAGCTGATAAAAGCGGAGTGGCTGAAAAGATAAATAATGAAAAACATCAACACATAGAGCAGGGCCATGCCCAAAGTGCCTATGATCAATGCTGCCCAAACGATCCCTGAGATAATCGCTAACAGAGTGAACAGCGATTTTTCCTTCGAATAGACAATTTCTGTCATTTGGATGGGATTATCTTTCCCTGCAGTTGCGGTAATAACTTGTTCCATGTTGATCCTTTTCCTAGTGAAGTGACAAGCGGCGTGAAAATAGTCAATTTCGCTCGAAAATTCAACCACTATCACGCGTAATGCCAAAATTAACCGTTTTGTTATTCGGTTAAATCAGTATTAAGCGCAACTGACGTAGGCAGTTTGTGTCAGGATCTTGTGGCTTTTTGCCAAAGGCAGTTAAGGTGTCTTTAACGCCGTGCCCGCTTTAATGGCTGCTTGAGCGAGGCTGACAATTTGCTTCTCTGCGGTATTGGAAAGGCTTTCATGAACTTTGCTTAGCTCATTCATTTGTTGATCAAGCACGTCTATCGATGCGCTCAGACTGTCGATCTCCGCCTGTATCTTAGGATCTTCATTCTCGGTCGCAACATTGAGTAAAAGTTGCTGAGTTTGGCTGTTGATCGCGTTGCCTTTTTGCTGCAGCTGTTCGCCTATCGCCTGCATTTTTTCTCCCTGTTGCTGCATAGGCATGATGGCGGTTTTGATTTGCGCCACTATGGCGGGATCTGTGATCACGTAAGGCGTTCCTGCGGTTTTCACCCACAGATAATCGGCATGGTAAGGATTGCCTTGGCGTTCAAGTTGCTCCCAATCGTCACTGTGACCAGCGCCGATAGTCATTTCGCCCTCTTGCTGAACATAAATCCATGAGAAATCAGGCTGACTTTGGCTGAGCACTGATTGGCTGATGACTGATATGCCTCCATCGGCACTATTGCTCGTTGTTTGGCTCTTTGTTTGAGTCGCCCAAATGGCGGTTGGGACCAGAGCGAGAATAAACAGGCCGCCCATCAGTTTAGAGTGCGTATTCATAGTGCTTCCTTGTATGGCTGTGTGGATCGGCGGGTTATGCCAGAGTCGCGTTAGTCTGCCACAGCAGAATCTGTGCCATAAAATTTTCATATTACTTATCAATCTAATGGCTATTTAAGTGTCCGCCGACCGCTGCCAAGATAGCGCGAGAGTGTCCGGCAATGTGTCCGATCAGGCTGGTGGACACGAGCTGGTTATGTAGGGATAAGCCAAAGGCTTTTTCTCAATGAGTCGTTTTTACACTCGCTCGTCAATGTTTGTTCAACCCTGCAGTAGCGCTAAGTATTAAGGTCGTAATGAAAGCTTTTGGGAGCATCAAGGCGAAGCAAAAGTGATTTGGGGCTACGAAGCGCCAGAAATGACCTGTAAACTGGTGACCAAATAGTGTTAATTGAAGGTCGCCATGAAACAAGCTCTCGTGCATATCGCCTTGGTCGTAAGAGATTACGACGAGGCGATTGATTTTTATGTGAATAAATTGAAATTCGAACTGGTTGAAGACACTTATCAAGCCGAGCAAGATAAGCGCTGGGTCGTCGTTGCCCCGCCGGGTTCTAAGGGCGCGTCTATTTTGTTAGCGCGGGCTTCTAAGCCAGAGCAGTTTGATTTTATTGGTAATCAGGCGGGTGGGCGAGTGTTCTTATTCTTGAATACCGATGACTTTTGGCGTGATTATCGCCGTATGGTCGCCGATGGTGTCGAGTTTGCGCGCGAGCCGCAAGAACAAGACTACGGCACTGTCGCTGTGTTTAAAGATCTCTACGGTAACTTATGGGATTTACTGCAATTGAATCCGAATCATGTCATGGCTAAGCGTATGAGTTAAGATTCAGTATTTCGTATGAATTATCGCGCAAGTTATTTAATGGGGTCAAATGCTGAGAATACTCTTTGTGTATGACCTCTGTTAAATTCATCGACAATTCTCAGTTATTGATCATTTTTAGTGGGGATTCACTGTAAATTTCAATTTATTCTTGTGGAACGGATAAGTTTTATAATGGATTGTTCAATGCCATTATGTTTTTATTTAAAATTCAATGCCTATCGGATAAATTACTCTTTTGTTTTTATCATTACCCATTCACGTTTTACGTACTTTAGTATTTAACTCACATTTAAATTAATTAACGTCAGCTTTTGTTGACAGTTTTATTCATCACTTATTTGCATGGGCGATTATTGTCAATTAGCCTCTTTAATGTTTTTAGCTATAAATTTTTAGCTATAAAATTAGTCGTGAATATCATTACGTCTAAAGCAAGCTGGCGTTTAAAAATAATATTTAATCTGAGTAACTTACTAAAGGAATTTGATAAGTATGAATATTTTTAAAATACGTAGTGCGGCATTATGGATAGGCCGCGCGTTAAGTGTCTCCGCGATAATAGCGTTACCATCGATGGCATCTGACATGCCAACTAGTCAGTATCATATTGATTCAGATGAAATAAAAATGGTAGATATGCCCAGTGTGTTACTGCCATTTAATAATCTGATGTTTTTATATGGCGTAGATGCATCGCAGTTTGATTTAGCTGACTTTATTTACGTTAATGCCCCTGACTTAATTGATAAAGAAGAGGCGATTACGCATTGGGCTGGATATTATAGTATTAACCCCAAAGTCATACTGACTTTGATGGAAATGCAATCTCAGTTAATTTCGTCTCCGACAGAGGAAGCGCTTAACCGACCTTTGGGAGCATTATCGGATAAGCAGGGCTTTGATGAACAACTTCAAGATGTATTGGCTCAGTTATCACAACGTTTTTATGCCTATGAAGAGTCTCAATTAAAAGGCCTATACCCGCCAAGTACCGACGCAGTTAATGCATCAAGTTTCGCATTATTGGCTTTATTAAACGGTCGTAGGATAGAGCAACAGCAGCATGCCGTTATGTCGGGTGAGCACGCTTTGGGATTGGACCCATTTATTGAGCAGTTTAGATTATTATTTGGTAATACTGACCGTGAGTTGTTAATGAGTTCCGTTGCTCAAAATCCGCCAGTAGCTGATTCGACTCAGTCGATGCAACAAGTCGTCCCATTGGCTAATATCACCGCGAGCAGTCTGCCACCCAGTAACATGCTGCAAATGCCTTGGCGACAGGGCTATTCATGGCAATCCAATGGTGCCCATTCGCATACTGGGTCTGGCTATCCGCTATCATCAATCGATGTCTCCTACGATTGGCCGCAATGGGGCAGTCCGACGTATTCGGTTGCTTCCGCACATGGCGGAACTGTCAATGTGTTGTCTCGCTGCCAAGTACGGGTGACAAATGCTAACGGTTGGGCGACTAACTATTATCATATGGATCAAATTACAGTGCGTAATGGTCAATATGTAATCCAAAATACCGTGATGGGCATTTATGCAAATAATAAAAATGCAGCCTTATGTGAAGGCGGGAGTTCAACTGGGCCCCATTTGCATTTTTCTTTATTAAAGGATGGCCGCCATGTGTCGTTGCAGGATGTACATTTGGGTCAATATAGAGTCAACATTGGTAGTTACAACTATGACAATAACTGTAGTCGTTTCAATCTGTTTGATGTAAGTAACAATCGTACAATGTGCGCTTGGGCGCCGCTGTATAATGCGGGAAGTCTCTAATCCATGATAGAAAGCCCCTCAATCGAGGGGCTGATATTATGCGGGGCACAGTTGATCGCTAAGCTAGCTGGACCCTCAAATTATTTGGATATCCCCCCAGTTTTAAGTACAACTAACGTGAGTTAATCTCAGTGATCTTGAGGTTAATTAACTGCTTCTGCCATTGAGCTTTATCTATGCCTTCGACCCAAATCACGCTGCTTTTGCGCTGCTCGCCCTTGCGCAGATAAGTTTCAGGCATGCGGAAAATGGCTTCCCATACTTCGAGTTTTTCGGTTTTGAGCAAAGTGCCATTATCGGCGAAGAAGCTGGCTTCCACTTCGACTAAGGTCACGTGTTCATTGCTGGTACTGGCAAGATCGAAACTGAGTCCAAGTTTATCGCCTTTCCATTGACCTTCGGCAAAGCTGACTTTGATGCCATCTTTACCGGTTGAGCCTAGTAGCTCAGGTTGGGCAATTGCGGTGGCAGTCAAGGTGGTGACTGGCGCAGCCACAGTTTGAGTGGCGAGAGCAACACTTGGGACGCTGGCAATATTGCTTGAGGTGCTCACTGCCATCGCAGCTAAGGCACTGGGCGCAGCTTTGGTTTCTGTGATCACATATTCCCAGGTAAAGTCGTCTTTGAGTCTGACTTGGGCGCCATTTTCTAGGGTGATTTGAGCTACGTCGGCGGCCATAACCGATGATGTTAGCAATAACAATGAGCTAAGTGCGCTAAGGCGTAATGAAGTTTGCATTAATTTTCTGGCTCCGATATCCGTCTGAGTCAATATTCTGCCCTAACATAAACAATTTGCCGCATATTAGTCACTCTCTAAGTCGTGTTTTGATGAATAAAGTGGATTTAGCTGGAAGATACTGACTCGAATACAGATTGAGCTCACTGAGTGTGGCACTATATTGATAGCGCGGTGACAGGACACAGTCAGATAGCGGCAATGATCAATGAATTAAGCGTGAGGAGATTAATATGCAGATTAAACATGGAATTTATGGCGTCGTTATTAGCTTGGCATTGAGCGCGGCAGCATGGGCCGATGAAGCGCCTAAAGTGGGCTGCGCGGCGAAGTTAGAAGCCATCAGTGCCGAACTTGCCCAAGCGAAAGCGGCGGGCAATAAGCATAAAGTCGATGGGCTCGAAAAGGCCTATTACGAAGTCTCAACCCATTGTGACGATGACAGTTTATATGCCGAGCGCGTCGCTAAAGTTGCGGCCTTAGAAGAAAAACTGACTGAGCGCCAAAACGAGTTGGCTAAAGCCATAGAAGAAGGTCGTTCTATGGATAAAATCAATAAGAAACGCAATAAGGTTGCCGAAGCGGAATTGGAGCTTGCTAAGGCGCGAGCCGAGCTAACGCAATAAATAGCTCAAGCCCAGTCGCTAGGAGCTTAGCCCGCGTAACCTATGCTTAAAACTGTGGATTCTGTGCTGGAGTTATAGCCCGCGCTCAGGATGAAATGTTCTGTGATGCGGTATCTTAGCCCTGCTTCTGCGCCCCAACGGGTGGTTTTATCTTCTTCCCATGTGGGTTTGCCATCGACAATTTTGGGTGAAATGTTATTCGTGTAAGACACAGTGCTGATATAGCCGCTGCCGCCCGCATAAAAGCTTAAGTCGGCAGTGAGGCTGTAGTTTAACCCTAAGCGAAAGAGGCTTTCTTGGCGATATGACAGCGCGGGTTCTGGCACTATTACATCGGTTTCACGGCTGCGGGCATAGCCGAGGTAATAGCCCCATTGCTGGTAATTATTATCGAAATGATAGGGTGCTAATGTAATGCCGTAGAGATTGTCTGCGGCGGGAACGTAATCCAGCATGAGGGCGATATAAGAAGGATTGGCGGTTTTCTCTGATTTATCTGACATCACCAATGGCTTCGAAGTGCTATATTCGGCAGCTTGAAGTGGTTGGCTTAGGCTCAATAGTCCAAAGCTAAGACAGGCTAGCATAGATGAATAACGACTCATGGGATGATTTCCTTGGGAACAGGTTGAAGATTGTTCATTTTCCTTTGTGCTCGGTCAATTCAAAATGCTAAAATCTCTTTTAAAATAAGTGTTTGTAAATATATTGTGTTTGGATATCAGTATTTTCTAGGGTGACATTAACCGAGGTCAACTCGGCGGTTACCTTTGTATGGGTTTGTTGCGCTAGCGCCTACGTTGGAGTGGAATTGTGGATCAGGAAATAAGGCTCGAAATTTCGGCATGGTTAGCGGGCTTTGGGATTGATAGCCAACCCTCGGACGGCATATCAACCAGCATCATGATCATTGCCTGCTTACTGCTCGCGGGCATCGCTTATTTCATTGTTCGACGTGTAGTCATACGTGCGGTCAACATGGTGATCCTGCGCTCCAAAGTTACCTGGGACGACGTGTTCATGCGTTACAAGGTGCTCGAAAAGCTGGCCATGCTAGTGCCTGCAATCGTGCTCAATCTATTAGTGCCTATCGCCTTAACCGAACATCAAATCTTAAGTAATTTAGTCGATCGGCTGCTGAGTATTTGGCTGGTGGTGCTGATGATCCGCGCTATTTATGCGGGTTTAGATGCGGTCGATGAAATTTCAGATGTGAATCTGGTGAGCCGGCGTTTACCCGTTAAAAGCTTTGTGCAGTTAACTAAGCTGTTTTTATTCTTTGTCGGTATTATCGTTTCTATCTCCATCTTGGCCGATCAGTCGCCTGTGTACTTCCTCAGCGGCTTAGGTGTGGCGACGGGTTTTGTGATGTTAGTGTTCCGCGACACTATCTTAGGTTTCGTGGCGGGCATTCAGTTAGCGGCTAACCGTATGGTGAGCAAGGGCGACTGGATCCAGATGGACAAATACGGCGCCGACGGTGCGGTAGAAGAAGTCTCGCTGACCACGGTCAAAGTGCGCAACTGGGACAAAACTATCACTATGATCCCAGCCTATGCCTTAGTGTCGGATGCGTTTCGCAACTGGCGGGGTATGTCTGAGTCGGGTGGTCGTAGGATCAAACGCGCCATCAATATCGATATCAACAGCATTAAGTTTTTGAGCGAGGAAGAGCGCGAGCGTCTGAGTAAGATCAATTGCTTAAAAGAATATTTCCCCGCCAAAATCAACGAAATCCGTGAGTCGAACGCCAGAGTGTCTGACCTCGACATGAAAGTAAACGGTCGCCATTTGACTAACGTCGGCACCTTCCGCGCTTACCTGCAGGAATACCTACAGCGTCACGATAAAATCCATAAGGAAATGACCTTAATGGTGCGCCAATTAGCGCCGACCACTGAGGGATTACCGATCGAGGTTTATATCTTCACTAACGATACCCGCTGGGCATTTTATGAGGCGATACAAGCGGATATCTTCGACCATATTTTTGCGATATTGCCTGAGTTTGGTCTGCAGGCATTCCAAGCGCCAACTGGCAACGATATTCGTAGCTTAAAGTCGGTTAAGGTTGACTAGTGTTAATGACTGTCGACGATTATTGATTTAATTTCGAGAGATAATAATCGCGGCTAAGGTGAGAAAGACACCGCCGCTGGTTCTGTCAAACCAATGCAGCTTGTTGCTCGCCTTTAGGCTGGGCGCAAGCACATTGGCCATGCTGGCGTAAATCATCACAAAGCTAAAATCCACCACGGCCCAAGTGGCTGCTAGTATAATAAGTTGCGGTAATTGCGGCGCGGCAAGATCGATAAATTGTGGAAACAGCGCGGCAAAAAACAGTAAATCCTTGGGATTACTGATCCCCACCAGAAACGCCTGCTGATAAAGTTGCCTTGGCGTGCCTTTGCCTTTTAATTGGCTAACCTCAAGGGTTTGACTTTGGTTTTTGGTCAACAACAGCTTAATACCTAAAAACACTAAATAGGCCGCACCGCACCATTTTAGCAGGGTAAAACCGTATTCTGAGGCGCTGATAATGGCGCCTAATCCCGCCGCTGATGCCATCATCAATACTAATGCGGCACTCACACTGCCAAGACCAGTGGCCACGCTGCGCATCTTGCCGAAATGGATCCCGTGGGACATAGACAACATAGCGATGGGGCCTGGTGAGATCCCAATCAATACAATGGCGAAGAGATAGAGTAACCAAGTTTCGGGTTGCATGTGTGCCGCTCTAAATGGGGAAAATATTAATGCTTCGCACTCGCCACAAAATCAGGGATCACCTTAGGATGGCGGCCTGTAATGGTGCGATAAAAGCTTAAAATCTCGTGCATGTCGGTCGCGATATCACCACTGGGTTGCAGTGGCGTACGGATAACGACAGTGCGGCGGCTAAAGTCTAAGCCAACGGGAGTGATAGGCACGTGAGCCTTGCAGGCAATATGGTAGAAGCCGCACTTCCAGCGTGTTACTGGGCTGCGAGTGCCTTCGGGCGCCAACGCTAGCTTATAATCGGCTTTCGACTCAAACAGCTGTACGGCAGCATCGACTAAGTTATTGTTTTTGCGCCTATCCACTGGGCTGCCGCCAATCGCGCGGAAGAACCAGCCCCAAGGCGGGATAAAGAGTTGGTGTTTACCGAGAAAGTGAATTCGAGTGCCAAGCGCCCCACGGGCTAAGATACCCACGATGAAGTCCCAGTTACTGGTATGCGGCGCGACTATGATGATGTATTTAGCGCAGTCGGGCAGTTGGCCTTCTATCTGCCAGCCTGAAAGCTTAAGTAACCAACGGCAAAGTGGTGTAAACATAGTTCGGTGCCCAGAGTCGAAAAATTAATAATATCATCAGCTTTTTTAGATTTTAAATTGGCTTAAGCAAGATAGTCTGTGCTGAATTCGCTATAGTACAGTTTAACCTGATTGTGGGGGAACAATACTATGTTGAAAAATGGACTATTAAACAAGGTTAGCTGGCAGCGTGTTTTTGCGGCACTTTTACTCTTAGTGACTTGCCATGTGAGTGCGGTGCCGACGGATGAAATTACGCAGATGCTCAAAGGCCAAGAAGACGCCTGGAATCGTGGCGATCTCGATGCTTATATGCAAGGTTACTGGAAGAGCGAGCAACTGCGCTTTGTCTCTAACGGTAAGTTCCGTTTCGGTTGGGAGGATACGCTCGCTGCCTATAAGAAAAATTATCCCAATAAAGAAGCACTTGGTGAGCTGAAGTTCACGATCAAAGAAATCAAAATGCTCAGTAATTACGCCGCTATGGTCGTTGGGCGCTGGGATTTACGGCGGATAAAAGATGCTCCAACTGGCGTGTTTACCCTGTTGATTGAGAAAATTGACGATCGCTGGGTGATCACTATGGATCATAGTTCGGACTAAGCCACGGCTTTAATTGAACAATAGCGCTGACTAAAGTGCGCCGTTTCAATACAAGGCTGCACAGTTGTTCAGTTTAGATTTGCGTTAATAAACAGTCTGGCTTAGCGGGGGGCTCGGCGGGCAGGGTTAAATGGAAATCCTGCTGTCCGAGCTGTCGACCTTTGCCGTCGTATCTCACTTCGGTAAAATGCATGCCGCCCTCAGCATCTTCAAGCAGAATGCTGGTGGAGCGCGTGCCATAGTCGGGGTGACGAATATAAATCGCGGCTAAGCGCCGTTCCCACTCGATCCCCACTCCCGTGTTAGGCAATTCATTATCTTGTGGTTGGGAGTCGTCTTTCATCAACTGTAACAGCGCTTGCACTTCTAGGTTGTCGGACTGATTGATCACGGCTTCTAGCGCTTGCTGTCCCTTCGCCATTTTCGGCCAAATATCATCGAGTGCACCATTGCTAATGGCATGAAACCCATCGGCGAGTTTGACTGTGTCTTTATTGATACTGTTAAAACAATATAAGTCAGTACCTTGGCCGAAGACGAGATTGAATGGCTGATAATGGTCACTGTGCTCAATGAGCCAATTCGGGCACACGAGCGACCCAGAGTTGAGCGTCATAGTGATGAGTTCGCCACGGCTACGCATGGCTTCTGGGTTCTTTTGCGGTACGCGCAGATTCGTCACTGCGGCGACTTGTCCCTGTTTATTCACCCCAAACCAAGTGCCGCCCGCCTGCAGATCTTTACCTGCAAGTATATTTTCTTCTGGTGGCCAAAAGTGCGCTGGCGCGGTCGGTCTATGGTGAAACTCATCGCGGTTGGCACAGATGATTAAGGGATACTTAGGATGGGCATTAATGGCGACAAAGAGTATGCACATAGCAGGCTAAATTAATCCAAAGGTAGAATAACTTAGCCTGTATGCTCACAAATGAACAGTGTTAGTGGTGCGTGTGCTTATTTGCTTTTGCTTTATCGTGTGTATGCTCATCGGAATGATCATGGGCACTAGGCGTTTCAGCAGTGCTCTGCACAGATGTGTGCGAAGCGCTAGGTTCATGGCTATGTGCATGGGAATGGCCATGCGCTTGGCTGTGCGAATGAGCTGCCGGTTTGCTCAGCATTAAGCGGCGTAGAAATTTACGTGGGCCTAAGCGCAGTAAGCTGGCAGCAAATAATCCCGACAGGATCAGTAGCGCGACTAAGTTCACCACTTCCGGCAAAGTGAACAGGGAGAACCACAGTGGCAAACGGAGTGCTGCGACTAAACTCAGGGCAATAATACTCAGTAATATTGCCCTTTGCGGCCATGTCATTAACTTAAGCTGCGCTATGTTAAGCACAGGTGCGGCAATCAGCGGCATAATAATCGCGATAGGGCTCCAGCCACTATAAGCCAGTGCAAGAGCCAGTACCGCTGCGCCTAAGTTGCAGAAACGCATAGGTAAGAATACGAGTAACAGCACTATAACCTGCAGCAGCGGATTGCTTAATGGCACTGAAGGATGGCCGATTAAATTCACTAGTACTAGGCTGAGTAAAATCCACGGTGCGCTGCGATCGACTAAATGTGCAAAGCCGAAACGTAGGGAATTACTGGGTAATTTAGTATGTGGATCGGTAATAGTGACGTTAGCGTGGCTTAAATAAGCGCTGAGAATAAAGACCACAAGTAACTGGAATAACGCTAACCAAGGACCGAGCAACAGGGCGGTGATCACTAAGGCTTCTGGCCCTGCTAAGCGTTGGAACCAGCGACGGGCTAAGCTGCTATCCTGCGGTGTCAGACCTAATTTAAAGCGCAGCGCCGCCGCACCATAGCTCAGCAGTAATATGGGCGCGAGTGTCAGTAGCCAAGTCATTAATTGTTCTGTGCTGTGATCATGGTGGGCATGTTCGTGGCCGCCAGAATCCATCAAGAGCAGCATCACGAGCAAGCCTATGCCGAGCAAACTGCCGATGCCAGCTTGGTATTCGTATTTGCCTTGCTTGTCGGTATCGTGCTGACCGTGGGGTTGATGCAGCACCACATGCAAAATAGAACCGGTAACAAAGGCTTGTAAATAAACGGTATTTTCTAAACTGAGCTGAGTGATTAATTGCTCGCCTGCAAAGTAACCTACACCTGTAAGCAGCATCATAGCGACCAGCACTAAGCTAGCCCAGCGGGTGCCGACTTGAGGTTTGAGTAACCACCAAATGGCGAGGCCGACGGGGAGGCGATGCATAATGACCCCGAGGGCAAGTAAGATCGAACTGCCGTCTTGCTGCGCCAATACCATAGCGCCACCGTCGGTGATGGTGTGCAGTAGCAAGCCGCCAATCCCTAGGGACAAGGTGAGATTATGGGTGATTTCTGAATAACGGTGGAAAAGGCGTTCGCTAGCGGTCGGTCCCCAGATACCTAGGAATACAAAGACTACGGCTAATAAACCGCCATGCTGCAGTAGCTCTGGCAAGATGTGGATCAGCACTAGTCCGCCTAATGAGACGAATATAAAGCCATCGAGCCCCTTCTGTAATCCGCTGCCCGATGAAAAGTAGCGATAAAACAGTGGCCCAATTAAGAGTGCGATACAGCTAGCAAGAAGATATAGCATGGATTCCACGGAAAATTTGCTGAAAAAACGCCATAGTATACCAGCTGTAAGGCAAAGTGAGCAGTGACTTGCTGGCAATTTACCAAACTGACATTTTGCGCTGGGTTTTTCGGGGGCGAATTTTATTCACTTTGGCCATTTTGCGGAGAAATTTTCAGATCTAAGCCGAGAAAAGGTTATACAATATGGGCTTATGTTGGCTTGGGTGTTGATACGTTAAATGGATATATTTTCTGCTGCGGTAATGTTGTTTTTGATCATGGACCCCCTGGGGAATTTGCCGATTTTTGCGTCGATTCTACGGCATATCGATCCCAAGAAACGTCGTAAGGTATTGATCCGCGAGTTACTCTTCGCCTTAGTGATCATGCTGTCTTTCCTTTATGCGGGTGAGGCCATCTTAAGTTTCTTAAATCTTCGCTCTGAATCTGTGAGTATCGCGGGCGGGATTATTCTGTTTCTTATCGTTATGAATAACAGATTTATATAACAATTCTTCCTTGTTCTCTCCTTTCTTAGCTTCAAAATCCTCCTTTGTAAATAATTTGAACATAAAACTGTTGGAGGATTTGCACTTGTAGTATTGTGTTTAGGATAACAATATAATATATAACATTTGGGGTAACGCGTTATGTTTCCGATGGCCATGGCGATAGTACAACCTCTAATTACTAAAAAAGGTCAGGAACTAACTGTTCCATTGATAATGATCGGCGATGGTTCCGATATTGTTATTTTGTCTTCTTATGCTCGTTATGCAGCCGACAATGCTCACAAATCATTAAGTTGGCATGTTGAATCAGCTCGTTCTATCGAACTTCTATTTAAGTACAAATCAGCTACGCAAGATCAATTCCCATCAGTAAGGTCTATGTTTGAGAATTTTACTGAAGCACTGCTACATGGCACTCATGAGAATGGAAAAGATAAGACAGACCTCTGGTGGAATCCTTATAGCGCGAAACAGGTAAGTCGGTATATTTACTATTTAACGTCTTATAGCGAGTGGCTTTATGTTGATACTGAGCAAAAGACTGAGTTGTTAAATCCCCAAATTGCATCTTCAAAATCTGAATACTTGATGAACATGGCTGCATACCATCATCGCAAAAACAATAGCTTTTATAAGCACCTCAAAAACGACAATCAAGCCAGACAAGATGCGAAAACAAGTTACGCAGTCAAGTTGAGAGACCGTGCAAATCACAATGTAACTAGCCCAGAACATTGCTTTCCCTCAGAAATAACTAATGATGTTATTCAAGGGTTTACTCTGCCAGGCTCAAGGTCACATGATCCGATTTACAAACGCTTAGACTTAGCAAAGGTTTTAATTTTTATGCTAACGCGATTTGGTGGTGTTCGTATTTCCGAACCCTTTCATCTATACATCAATGATATCCAGCCACATCCATCCGAAGAGAGGCAAATGATAATTAAGATTCATCATCCATCTGAAAGCGTGGCTCCTGTGAAATGGGCGGAACAGTGGGTTACCAGAGAGGTTTATCTTCGGGAGAATTTTGGTTTAACACCGAGGCATCTAAAAAAATCGACTGGTTGTTATAAAGCGGGTTGGAAAAATCCTGCACTCCATAAAGTCGCTCATGGTAATAAGAAAACAGAGCTTTTCTTTTATGTTGAATTTCTTAATGCAAGTGAACGAGAATTATTTTATCACTTGTGGTTCTTGTACTTAAAGAAGCAAAGGCCAAGAAACAATTACTCTCCTTTTGCCTTTTTAAGTAAAAAGACAGGAATGCCACTCACTATGGGAGCTTACGATGCAAAACTTAAAAAGGTGATCAATAAATTGGGGTATGAATATTCTAAAGCAGCAGGTACTACGCCCCATGGTTGTCGACATTTGTTTAAAGCGGAAGCTAAAGCGCGTGGCATCAGTACACAAGTCATTAGAGAGTTGCTTCATCATAAATCGCTGATGAGCCAGGAAGAATACGCAAAACCATCCATAGAGCAAGTTCGGCAAGTGATGAAAGAGAAAGAAAATGAAATGAGGGAAAAAATACAGGCGCAACACAATCAACTAATGGATCAGCTTAAATTAGAGGTGGATAATAATGGAAACTAAAAAGTATCCAACTCTCGCAGCAGCCAGTGAAGCGGCAATTAAGCTACTGAATAGTATGGGGGTTGCGCTAACGTCCACAAATTACACTAAACACTACAAGGCAGACCCAAGACTGCCAGCGTCCCCTCATGCTAGCTATAAAAACGATTGGGTAGGCTGGTATAGCTACTTAGGCACGACTGCGCCAGAAGATAAGTACCCAACTCTCGCAGCAGCCAGTGAAGCGGCAATTAAGCAACTGAATAGCATGGGTGTTGCGCTAACGTACACAAATTACACTAAACACCACAAGGCAGACCCAAGACTGCCAGCGAACCCCGACTACACCTATAAAAACGATTGGGTGGGCTGGTATGGCTACTTAGGCACGACTGCGCCAGAAGATAAGTACCCAACTCTCGCAGAAGCCAGTGAAGCGGCAATTAAGCTACTGAATAGCATGGGGGTTGCGCTAACGGCCAAAAAATACATTAGACACTACAAGGCAGACCCAAGACTGCCAGCGACCCCCAGCGACTTCTATAAAAACGATTGGGTGGGTTGGTATCGCTACTTAGGCACGACTGCGCCAGAAGATAAATACCCAACTCTCGCAGCAGCCAGTGAAGCGGCAATTAAGCTACTGAACAGCATAGGTGTTGCAAAAATAATAGAAAATTATAAGAAACACTACAAGGCAGACCCAAGATTGCCAGCGAGCCCCGACTACATCTATAAAAACGATTGGGTGAGTTGGTATAGCTACTTAGGCACGACTGCGCCAGAAGATAGGTACCCAACTCTCGCAGCAGCCAGTGAAGCGGCAATTAAGTTACTGAATAACATGAGTGTTGCGCTCACGTCCAAAAATTACGCTAGACACTACAAGGCAGACCCAAGACTGCCTGCGACCCCCAGCGACTTCTATAAAAACGATTGGGTGGGTTGGTATAGCTACTTAGGCACGACTGCGCCAGAAGATAGGTACCCAACTCTCGCAGAAGCCAGTGAAGCGGCAATTAATCTACTGAATAACATGAGTGTTGCGCTAACGTCCAAAAAATACGTTAGACACTACATGGCAGACCCAAGACTGCCTGCGACCCCCAGCAACTTCTATAAAAACGATTGGGTGGGTTGGTATAACTACTTAGGCACGACTGCGCCAGAAGATAAGTACCCAACTCTCGCAGCAGCCAGTGAAGCGGCAATTAAGCTACTGAATAGCATGGGAGTTGCGCTAACGTACACAAATTACACTAAACACTACAAGGCAGACCTAAGACTGTCAGCGACCCCCTACTACACCTATAAAAACGATTGGGTGGGCTGGTATAGTTACTTAGGCTCGACTGCACCAGAAGATAAGTACCCAACTCTCGCAGAAGCCAGTGAAGCGGCAATTAAGCTACTGAATAGCATAGGTGTTGCAAAAAAAATAGAAAATTATAAGAAACACTACAAGGCAGACCCAAGATTGCCAGCGAGCCCCGACGAGATCTATAAAAACGATTGGGTGGACGGGCCTCACTTTCTCAATCTTCTTGATTATTACAATTCTCCACAAGAAGCTTATGACGCGGCGATAAATATTGTTGGTATAAGCGAAAAGCTAACCCTTCCAAGGTATGGCGGATTGCGAACTATTGACTCTAGACTGCCAGCAAAACCTCATATTTTCTATGGGTTAGGTTCATGGCAACAGTTTATTGCTTGCCGTTTTTACCCCTCTATTGAAGAATGCGCGAAAGCTGCTTTAAAGGTTATTCCAGAGGACAGTCGTACAGTAAATGGGTACGCCAAAAATTACAAAAAAGACCCAAGACTTCCATCTCGACCAGAACTTATATATTCCGCATGGGAGTCTTGGAAAGTCTTTTATGAAGGTAAAAATGCCAGACCTTACAGTGTTGAAAAGTGCATTGAAATAGCGAATGAAAATGCACTTTGGGATGTTGAAAGTTATGAGGAATTCAGAAAAAAAAACGACAGCCGCTTGGTATCTCATCAAACCTTGACATCTAAGACCTGTAAAGATTTTAGGGGGTCGTTAGGTATTGGATATTTCACACTAGAAGAGGTAAAAGACTATTGTCAGAAAAATCGTATTGCTACGATGCCAGAATATAAAGTTCATGCCCAAAAGCACCCTCGCTTAAAAGTAACTTTAACCCCAAAGAGTTTGCCAAGTTACACCAACTTCAATGATGTAAAATGGCGAATAGAGGAAGGACAGGAGCTAGAAGATTTAGGCTTAGAAAGATGGTATGAAATTTATCGAGCCTACAGGAATAATACGCCAGAATTAGGGCCTACAGCAAAGGTAGCTATCATGGCCTTTTTTACTTTTTTTGGTTATTTTTTTATTAAAAACAAAGAAATTGGAAGTATTTTTTCTGAGGCAATAACCGCGCCTGACCTCAATCAATTTCTTGAGTCTGATAAAGTTTATCCAACTAACAAAACGGCCTACCTAACACACATTTTTGCCTTTTTAAGCTTTGCATTCAATAGAACTTGCACAATCAAGGATGAAGATGATTCTGATGTTGTTGTGCTTGAGGGATATAAGCTGCCTTACGTCGAATTATTCACCAAGGTTTCAAATGACAAAAAAATCGCTTACAGTCAAACAATTAAACCAGCACTCCCATATGGCTATATTGTTAAGGCGAGAGATGCATTGGTTCCTGAAAGAGCACAACACTTTTCAGACTTACTTTTTGCTCATACTCTTAACCAGTCTGACTGGATTGAAGTTAGTGAAGAATTTTACAATCAAGAAAAGGATATCGCCGAGGGAGATAATAGGGGGGATCCTGATTGCATTGTGCGTCAAAGGAAGGTCACTGGGCGCACAAAAGGACGAAGCGGCGTTTACTCAACGATCTATGAGATGTGGTGTCCTGTTCGAGCAATGGCATTGTATTTTTTAATATCACATCCACTTCGTGGGCAGCAGATATGTTGGATGGACAGTGGTGAAACCGATAAATATCGCCTGGAATACAACTCAAATGCACCACAGGATGCTCAGTTTAAGTGGATAGAAAACACCTCTCCGCTGCGGCAAAAGTCAATCAAAAAGCGAGATACAGGAATTTTCTGGAAGACCGGACACCAAAATGAATTCGGTATGTTTACCAACACCAATAAAACAGGACAACCGTTTACGTCAGTGTACTTTCCTCTTGAGCTCGCTAAATGGTTGGTTAAGTTGCGTGATTGGCAGGAAAAATACAATGCGATCTCAAAGCCGACTTCGTGGAGGGATCTCGCTTTTAATACAATAAGCAAGCCTTCAGAGGGCGTTTTAAGAAAGAGAGGTTCAACGTCTTTCTTATTTAGGTTGCCAAAGCAATCGCAGTTAAAAGTCGGTCGTACAGAAGGCTCACCTATGAACCCTGACGCACTAGTTCCAGCACTTGCTAAGCTTTTACAACATGTCGAAGATGAAAACCTTCCCCTCACTTGTATACAAAATGGAAACCTTAAGTCTTATTACAGCAAGCATGCAATGCGAGTCAGTTTAATTACGGCTTATGTTGTCCACGGTGATGTACCTTTGGAAGTGATGGTAAAAATTGTTGGTCATGCAACCATTTTAATGACTTTGTATTATACGAAAATAGAGCAAGGTGAAATCAAACGTATACTCAATAATGCAGAAAAAAAAGCACTGCAAAATCATGCTAAGCAAGTTGAGGATGCGATCCTTCAAGGCGAGATAAACAGTATAAAAAGTGAATTGATTATCAATGAAAATTCTGTTCTTCATACTCCTGATTACCCTTTAGCATCGCTTCAATTTACCGACTATGGTATTTGCCCAATGGCATTTAATGGTTGTGATACTGGAGGTTCGGTGGTTGATTCGAAAGGGGGTGAGGTTGAAGCACCTGTAAAGCCTGGTTGGTTAGGTGAGAAAAATTGCTTTCGATGCAAGCACTTTGTCACAGGCTCTGCTTTTATGGCGGGGCTTGCCGCAAAAGGAAATGAAATCGCTAGCGCGAAATTAAGATTGAGTCAGGCGATTCAGGACTTGGAGACTAAATGTGATCAGTTAATGCGAGACGCTTCAGATCTTGAAGGTTTGGGTGAAAAAAGTGCTGCGGCCAGAAAGCTAAGTGACAAAAAACTTATTGAACGTGATATTGAGAGCAAAACTTCATTATTAGATCTCTACATGTGCGATATTTTTGCGATTTATAAGCTGATTATACAATCTATCGAGCTTCTTTCAAATACAGAAAATACAGAAGGAACAAAGGGTGTGCCGCTCCTGTTAAATAGTTCAAGTTTATCAATTACTATTGACGAGGTTGATGAATACCAAGTGCTTAATGAGGTCTGTCAGAACCGAGAGGTATTCGTTTCAGGCATGGCTGACCATGACAACCTCAAACGAGCACAGATGTTAGATCGAATGCTAAAGAAGAATGGCATGGAGCCTGCACTTTTTGAGCTTCCTCCAGCGGTGCAGTTAAAGGTCGGGAATCAAATGACAAACCTCATGTTAGAGCGCCTAAAAGGAGATTGGAACGGCGTAAACGCCATTATCAAAGGACAAAAGCCACTTAATTTTATTGAATCAAAAGACAACAGCCGAGAGATGAGTTTAGCTGATGAATTGAAGATGACTATCGCCCCAGCTCTTCGCAATAAAATAGAATTTATGGAGGAGTAAAATGAATTTTGATCGCGCATTAAAAGAACTAAAACAAAGTGTAAAATCGCCAAGAACCTTGAAAAGTTTAGAAATAATTCAGGAGGTTTGCCGAGAAGAAGTGTTAAATAAATCCTTCGACTTTACAGTAACTAATATCGGCAAAAAGAGTGGCGAACGCGGGGGGGTGAAAACGCAACCTATCAGAAATAAATCCGGTCAACTTTACCGCGACCTAATCAAACTATACTCGGAAGAATATGAGCAGGACAAGAATGTAAAGGCTTCCTCATTAACTAATGAAGACTCATGGGTTCATGGGATTAATGATATCGCAACAAAGTATCTGATCGAGCTCACGCTTCGTGAAAACAAAGAACTAAAAGCGAAGAACAACATGCTCAGAATTAATTGGCGGCAGTACGAACCTGTAATTATTGAGGAGGTCAATCAATACGAAAAAAAATCAGAAGCGCAAGTTGAAGGTAAATTAATTGTCTCCCCCACCCGATTATCACTAACAGAAAAAGTATTGTTTGAAGATTTGGTGAGTGAGACTAAATGGCGCTCTAAAGGGTGTTACATGGAAAATGGGAATGTCTATCACGAAGAAAGTGATAAACCAATATTCACTCAAGAACATATAGCCTTAATAAAAAAAGTGCTTGAACAGGGGTAAAGTGCCAATGCAGTGTTAAAGGTATGCTTTATTGACACTTTATTTGTTGTGACATTGTGAGATAGTGACTGGTAAGGCCTAGGAATACTAGACACTTTCTAACGTCCTAAAGCACTGTCCGGTTCATTGCCGGCTAAAGTGTGAGTAAGCGTGCGCTTCTATCGCTTGTCATCGTTTCCCTGTCAATACCGTTCACGTCTGCGCTGTCTTATCTGTCTCATTTCACGTTCTGCCCGTCAAGGGAGGCTTCCTATAACAAAAGTCACTCTTCACTTCGATAAAACGATGTTTAATCAGGGTAAAAAACACTCTACAATATTTGAATGATAGCATTAGTGTACTTTGCTCAATGGAGTCTTATGGTGACACCATATGACTAAGTTTTATGGTGACACCATAAGACTAAGTTTTCATAAGTCACATTTGTTTTTGGTAGTAATTACAATACTCAGAACCTTTTTAAAGTGGAGACAATTAATTATGGTTACGGAAGCTCCAAGTGCGAGTGAATTATTTCGAAGAGCATGTGATATTTACGATGGTGACCTCGTCCTTGCGGAACAACTGTTCAACAGTAATATTCCCGCGCTTGGGAACAGGAAACCCAATGAGTTACTTAGTTCACCTGATGGTCGTAGATTGCTTGATGACTTACTTCTTAAAATTGAGTATGGAGAATTTTCTTGAAAATTAGTGCCCGTCTTAGTACAGAACTTCCCCAATCAGTGTTTACAAGTTGCTGTAAAGCCAACTACTGTCTTACATAAAAGCAATAAGGCAAGATCTACCCCTTATTTTATTGCTGGGATAAGGTGGGGACTGCAAAAGAAATTTTCATCTTTAGTCGATTTAGGCAGTACGTAGCCGTCAACGGTGCCGAAAGTAACTGGGATGTTTCTTCATGTTTTGGGGTCACAAGTTCTTTTTGACAAGCATTCTTTCATCTTTTGTGGCTTTCTAACCTGAAATACCGATTGTCATGAAGATCTACTGTGTTTTTTCTGAATTATTGGAGGGATTTGATTGGGCTACGAAAGAAGCTTTCATCTTTCCGATAATTCGATGAAATTAATGCTTTAATGCGAAACAGACCTTCATCTTTCCGATAATTCGTTTGAAATTAATGATTTAATGTGAAAGAGACCTTCATCTTTCCGATAATTCGGTTGAAATTAATGCTTTAATGCGAAAGAGTCCTTCATCTTTCCGATAATTCGGTTGAAATTAATGCTTTAATGCAAAACAGACCTTCATCTTTCCGATAATTCGGTGAAATTGATGTTTTAATGCGAAAGAGACCTTCATCTTTAGATTATTTAGATAAAAAATTTAGTGTGTTCCGGTTCTTGTGTGCTATTCACAGGAACTGATCGAATCAGGAGTAGAGGATGCACATTAAGTTTAAAAAATAAAATAACACCACGTTGCAACTGCAAAGTTGAGAATGAGGTTCTGTGAAATAAAAGCTAGTCCAATTGTACCAATTTGATTAATCTTGTCTCAATAACTGGATTTTTTGAGATAAAATTTATGTCAGGTCAGTATCTATCACCGCCTGAGTGTCCGTTTAGCACTATCAGTGATGTTATATTTGAGATCCGTAATCATGGTGATGATCCTAATAAATACATGGATTACATCAGTTCATTTCATATCAAGTATCATTCCTTTGATGAGTTTAGACGCAGCGTTAAGCCTAGCCTAAATGTTAATTTCTGTTGGTGGTTAGTTCGGCTACATAGAGACAGTTCGCTCATCAATCTCACCCCGTTAGAAAATAAGCCGTTACCACTACACCTGCCTGAGTCCGTAAACCGAGCGCTATCGCAATCTCCTGACTTCATGACATATTGCCGATTTAATCACTTGCATGAGTTTAGTCGGGTTTTGTCACAGATTGATCGCTACACCACGCATAGTGCGATGCTTGGGGTGCTTGAGCAAATTGGAGAAAGCGTACATTTCGAGCACTTTAAAAATAATTTGATTGATGATGAGGCTATATCAAGTAGTCAGCTAGAGGGCGCTGCAACAACGACGCTGATTGCGAAAGAAATGCTCCGAAAGAAGCGTAGGCCTTGTACTGATGGTGAGCGGATGATTGTTGGGAATGCTGACTTAATGAATGCAGCGTGGGACCTCAAGGATGAGGCATTCAGTATTGATTTGATAAAAACCTTACATAGAATGGGTACGCAGCACTTAGAGCAAAAAAACTACACCTCTGGGATATTTAGAACCACAGATGATATTGCAGTTGTGAACGCTGATGGTGAGGTCGTTCATCAGCCACCAAAGCATGATGCTATTGAGTGCCTATTATCCCGCGTCGTGAATTGGTTAAACTCGACTGACGATCACATTCACCCGGTTATACAGGCCATCACTCTTCATTTTGTGATTGGCTATGTTCATCCGTTCCGTGATGGTAATGGACGGATAGCAAGAGCTTTGTTTTATTGGTATCTGTTTAAGTGTGGCTATACGGGGTTCCGCTATATTGCGATTTCAGCACTGTTGAAAAAATCGCCCATCAAATACGGTCAGAGCTATCTGGATTGTGAGCATGACAACTTAGATTTAACCTATTTTATTCGATATCAGTTGAGTATTATCAATCGTGCCATTACTGAATTCGTTTCTGCATATGAGCGTGCTGCATATCGAATGACTGAATTGGCCGAGCAGTATTCTGATTTAGATGATATTGAAAAAAAAATTATTGGTTTTGTTGCTGGTGAGCCTCGTTATCGAAAAAAACCGACAATTACAGCCAGGGAGGCCGAAAAACTAATGGGTCTTAGTTATAACACGGTTAAGCAAAGACTTGATGACATGGTTACTTCTGGAATACTCAGACGTGAAAAAATAGGTAAAGCGACAGAATATTTTTTAGGATGAATATAATTTGCTGCGTTGTCACGTAAAGACCGACGGATGAGTTTTAACGGAACATACTACTTTACGAAAATTGTTGTTTGGCAGCCTCTCGTTGCGGTGTGCGTTGCGTTAATCTTAGGTTTCCTAGTGGTAATAGCTCTCAATCTTAACGTGCTTTAGCCTCATTCCTAGAGTCAGTAAGATTCTGATGATTTTCCTTAGAATGAGTATGGGGAGCTATTTACACACAGATAGTCTGAAATCCCAAATTCATGAAAAATTGATTTTTTGGTTTTAATTCAAATCCTACTTTTCATATATTTCTGGGTAATGAAAATTTTTCAAATAAAAAATTCAAAATCAATCCTTTATTTGAGCTTCTAATTTTTTTATCAAAGCTATTTTCTGCTCTTTAGGTAACTGATCTATCAGACAAGCAAGTTTCGCAGATGCATCATCTTCACAAAAAAAATAGTTTAACGGCACCCCTAGTTCGTCTGCAATACGCTGTAACGTGTTGATGTCAGGTGTGTGACGACCGAGTTCGTAATGATTCATACGGCTACTAGCAGAATTTTGGTGAATTCCGATTTTTGCACCCAAGTCCGCCTGAGTAATACCTGCTTTTGTTCTTGCTTCTTTAAGGCGTTTTGGGACCGTATTTTCTTTGTGCACGCATATTTCATCTGGATAACCTTGAGTAACTAAGATTGTCTAAGTTTTACTGTAATGAGTATACTTAGGAATCCTAAGTATTTGGTAGGGAAATCTACTATGTCTAGAGCCGTATCATTGCATCAACACTTTTATAACTTATTGAAAGTTAGAAAAATGAACCATTTTACGGTAACAGAGTTCCGTGATGCAGTAATTGCAGATTCGCAAACGTCATGGTGCAAAGATGAAGCTCGAAAATTTGTATACCGCCGCTTATTAGTTCTCGTCAGGAAAAGGCTGCTGTATCGAACTGGGAAAAAACATTCACCAGATACTCTATACTCGACAACGAAACTATTTTCACAAACTGCGTTTAAGGTGAAGAACACCATTCTGGCAGCTGGAAAAGGCGGCTTCTTAAAAGTCCAAAAAAACTCAAGTGATTCATTTTTGACTGTCTTGCAAAAAGAAAAGGTTATGTACGAAACTGATTTGAGATTTGCTACTGAGGAAATGGAGGTTTTTGAGTGTCTTATACAGCGTTTTCCAGAACAAAAGGAATTCATCGAACCGTTTTTTCACGAGGCAAAAGTCCGTTCTGCGCAGCTTTATAGCCATGTCAGTGCTCTGTCAAAGATATTAAATATTAATGCCAATGGTTTATGCCAATACTAAATGCATCTTTCTTGGTGGGCTTGATTTCGTTATATGTCTGTTTAAATTGACAATATCGTAGCGTTTTATAGATTAGAGAATATTTGGTACAACGATGCGGAATCGTTTTGTACTGAAAGGGCAAATCTAGCGTGCCCATTCACCCTCGAATATCGCGTTTCTATAAATGACAAGAGCCACATTAAGTGGCTCTATTACGCGTTACCTGCGGTCAACACGCTATGCTCAGAGGACAGGGGTATTTGAATTCATTTGTCTACGAGTGGAGCCGAAGCCGCGTTTCTCATAACAAGCGTTAGTAGTATATTCTTGGAACCGGAACCTGTCGAATTTAAATCTGGCTATTCTTGCTGAGGGTGCTGACAAAGTTATGTCCAAAAGTGTGACAGGATAGCTTAGCAATAAGAAACTAAGCACAAATCAGTCGACCAAATTTTAGTAATACCACACGGTCGGGAGTTGCTACTCTAATAGTGACGATTCAAAGAGAAGAATTGATGATGAAGAATACTAAACTCAGTATATTGCTCTCAAAACGTGACGTAACGGCACCAATACCTGATGACTTAGCTGAATGGCTTGATTCTCCTTCTGTTGGAAATGAGATTATCACGGATTGCGAAGATACGCCTCTGAGTTCTGATGATGAGTATCGTGCAGCATTAAAGAAAATTGAAGAATTGTTTGATGCAATACCTAATACCGCTGAAGGTTCTGAGCTCGAAAAATGGATTTCATTAGTTGAAACATATGAAAATGAACACTTCCCAATGTAACTCACAGTTGCCCCAAAACGTGTTTCGCACCACCAAGCTTGCCCTACTGCATCACGCTTTATTGACCAAATTTTAGTAATGCTACTGAGTCACTAGACTAATAGTGAGGATTCTCTATGGTATATTAAAATATCTATAGTTATTTTAGTTAGGAAAAGTTAATGGCGTTTAGGGCAAGAAAAATTAGAGGTATTTCACTATTTGAAGGCGAAAGCTCAGCGAATAACGCTTTTACACTTGTCATTGGAAACAACGGATGTGGTAAAACTCAGCTTTTGTTGGATATATGCAACTACTATCAAATGCTATACGGTGAGCTGATAAGGTCGAAATCTGCTGACTTACAAGTAATGCGTAGAGATTTTTATAAGAATGACTATAGATGGGATATGGTTGAGAGAGCTTTTGGGTACCCAATACCTCAAAAACTCATTTGTGCATCAACTAGCCAATTCGAGAAGTTTTCCGAAAACTGGAAACTTAAGAATGACTTTGTCCAAGGCGGGTATTATGCCTACATTGGTTCTAAGCCTTACATTCCTAATCGCCTACCTTCTACACGCATAGCCTCAACAGCTTTAAACCAACTTCTTGCTAGGGATACTGAAGTGGTCAAGTAAAATTGGCCACGGTTTTAGAGTCATTCCAATATAAACGTTCTGACTCATTTGGCGTTAACCCACCATTATATTGATGGGGTCTGGTCTCACTGTAATATCCAGTGATGTAATTTGTGATCTCTTTATCTGCTTCTGTAAAATTCCCATAACCACTGGCGGGTATCCACTCAGATTTCAAACTTCTAAAGAAACGCTCCATTGGTGCGTTATCCCAACAATTCCCGCGGCGACTCATACTTTGCTCAATCTGAAGACGCCATATTAATTGTCTGAATTTCAAGCTTGTGTAATGCGAACCTTGATCGCTGTGATATATCACATTCTCAGGCTTACCCCTTAACTCAAATGCCATTTTCAAAGCTTTACAGGTCAGCTCACTATCTGGTGATAATGACAAAGCCCATCCAACAGGTTTGCGGGAAAATAAGTCTAAAACAACGGCTAGATAGGCCCAGCGATTACCGACCCAAACATACGTCACGTCGCCACACCACACTTGATCTGGTTGAGTTACTGCAAACTGTCGATTAAGCGTATTGGGGATTGCAACATGCTCTTGTGTCGCTTTCCGATAAGAATGCTTAGGCTGCTGACAGCTTAATAACCCAAGCTTTTTCATTAGATTACGGGCTCGATAACGACTGAGACCAATCCCTTTGTTAGTCACTAATTGCGCTATCGTTCGCGCACCGGCAGAGCCATGACTAATACGATGTGCGGCCTTCACTTCACTGCATAAAATAGTAAAGTCTGTATCAATATGTTCTTTTCTTTTTAGCCAATACTTATAGCTACTTCGATGCACTTTGAATGCATTACAAATTGTGAGAATAGAGTGGCTCTGCTTGAGTGCGCTGATTAGCGTAAATTGTTCATCGAGTCTGACATCAAGAGAGCGGTAGCCTTTTTTAGGATAAGATTGTGCTCCTCAAGACGAGCTATTTGCTTCTTCAGCTCACGGATTTCAATTTGTTCAGGGGTAATCGGAGATGCTTTTGGTGTGCCACCCTGGCGTTCTAGTTTTAACTGTCTCACCCATTTATCCATAGTGGACTTACCTACGTTCATCGCCTTTGCAGCTTCGATGATGCTATGGTTTTTATCGAGGACTAATTGGGCTGCTTCAAGTTTGAATTCAGCACTAAATGTTTTTCTGGTTGTCGATCTCATGTGTTCACCTAAAGTCATCTACGAGGTAATAATATCACCTCTATTTAGGTGGCCAAAATTAGTGTGCCACTACAAAGTGATTAAAGTAGTATGCGGCGGCTTTTGTATTGCGTTGCTCACCCCTTAACATGGCGTTAAAAGTCTATGAGCCATCCACTATTGTCAATGCCAGAACGAACAGAAGAATCAGATGCGGTTTACGCTGTGCTGGGAAGAGCATTGGCATACGCTACAGAATTTGAAAGTAACTGTCGTACTTTGGCCCATCTTTTTGATATTGAAAAATCAGACAGTGAGTTCTCTTACGAGATCTATAAGTTGGTCAAGTCAGGTACTTTGCATGCAAAGATAAAGTTACTCATTGATGTGCATGGATTGCCTGACTGGGTAGAAGAAAAGGTACATGAGGCAAGAAAAGCTAGAAACTTTATTGCTCATGAAGCGGCGGAGGATCATAAGAGAATGATGTCAACACCACAGCTAATGAAAAGCTTCGAAACTACCATCATGCAGATGACAGGGGAAATAGCTGATGGAAATCACATCGTGCTTGATGTGACGAGAATGCTAAAGGAAGGAAAACGTGTCCATGGGTCAGATGTTGTTGCATATTCGTTCGCTGTGGCTTCGTGGATTAGCCGTGAAAACTTTTAACAAGGCGCTGCTTCGGACAAATTTTCCGCTTGGCTTCAATTTACCTCAGAGCGCGGCATTAATCCATAGAAATATTAATCCGAAACCAACGAGTTAGAGTAAAAAATCTAACAAGGTTTTGCCCCAAAGCAATTCAGCGCTTCATTTTTAGACTCCGCTTTAATTTTATTTATGCATGAGATTTTATAATCCATATTTGTTCAGTTAAGAGCGCTATTTCCTATACTTCAAGTGTCTATTCATTACAACCTGAGGAGGTTTGTATGGACGCTTGGAATAAAGGTCATCCAATGGGACAAAAACGACCTTTTAAATTGGAAGAGATTTGGCGCATTCGTACTCGTTTGGAGTTAGAACATAATATTGCTGAGTTAGCGTTACTTAATCTGGCGATTGATTCAAAATTGCGAGCCTGTGACTTATTACACTTGAAGGTCCAAGACATTACCTCATCTGGCATAATCAATCCGCGAGTGCGATTAACACAGCGTAAAACCCATCGTGACGTTCAGTTTGAAATTACTCCAAAAACTCAGCACAGCCTGACATCTTGGTTGCTCGAAAAGGGATTAACCGCTGAGGATTATCTATTTCCTAGCCCAAGGTGTATTCATCAGCCGATGAGTTATAGCTATTACCGCTCGAGGGTAAAAAACTGGGCGAGTCAATTAGGGCAGAATAGTGAATTGTATGGCACGCACTCGTTGCGTAGAACCAAGGTGACATTGATTTATCGTCGAACCAAGGATATTCGGGCGATTCAAATCTTGCTTGGACATGTCAAGTTAGATAACACCGTACGCTATTTAGGTGTCGAGATTGAAGATGCACTGCGGATTTCAGAAGCTATTGAAGTATAAATTTATCAAGTGTGATTATGATTAAACCATCATAGCTGCACTGTTTTGAAAGGAAAATGAAATTAAACAAAGTGCTTTCACTTTGTTGCTAAGGTGTGTTTATTTTTATCCAGAAATGAGTTAGTGAATCAGACTTACCTCATCTGATTCACACGCTAACATCTTTAATGTTATGTATTTTTGGGTTTATAAAGCCGTAGCCGAATTCAGTCGAACACTATGAATGTTAAATTTCAGTTTTTATAATTTCAAAAATCAAAATTTACAATTTGTACTCAATAATGCTGTTTACCGTAACAGGCTAACCCCGCTGACTTGATTACGGCCATGTTCTTTCGCAAGGTACAATTGCTTATCTGCTGATTCAATAAGTGTTTGCCATTTTAGATGCCGATGGGGCTGCATGCTCGCAACGCCAGCACTAATCGTGACCTCTTGGAAACTGGAGCCTAAATGCTCGATGTGCAAGGCTTGCACTGCGTTAACGATGAGTTGTGCGATGTGCATCGCGCCTAAATGCTGAGTGTCTGGTAATAGGCAAATAAACTCCTCACCACCATAGCGCGCCACTAAATCGTTCGGGCGGTGTATCGCTGTGCGTAGTGCTTTTGCAACGAGACGTAGACATTGATCTCCTTCCTGATGACCATAGCGATCATTGAAAAGTTTAAATGAATCGACATCTAACATCACCACTGACAATGGTGCACTATTTCTGCAGCAGTACTTCCATATTTCGGGAAGTCGTTGCTCAAACAGGTGCCGATTTGCAACCCCTGTTAGCCCGTCCAGTAAAGCAATCGATTGGAGTAAATCGGCTTGTTGTTTGAGGATAAACTGATTATTCACCCTCGTGTTCGTGATAATCGGATTGATGGGTTTATGAATAAAATCAGCGGCGCCAAGCTGAAAGCCTTTAACCTCTTCTTGTTCATCAAAATGCGCGGTAACAAAGATAATGCAGATATTGGCTGTCGCGGGATCGGCTTTGAGCCGTTGACACACTTCATAGCCCGTCATACCGGGCATTTCTATATCCAAAAGTACTAAATCAGGCAGTTCATTCTGACAGACGTTGATGGCTTGCTCTCCGTTTGTGGCCATAAACATTTCATAATCTTCATGAAAAAGCTGATGCAATATTTTTATATTGAGGGGTTGGTCGTCAACGACAAGAATTTTTCCCTTCATGCGATTGGCTTGCGGGATGATTTCCGTAAGATGCATTATTTATTCTCCAACATGATGATGCGTAAAGTGTCTGATGCCTTATCGAATTCGAGTGCTTGCACTTGTCCCTGTAGTGTTATCCACTGTGGATGGTGTGGTAATTTTTTGATTAATCGGTCAACGAAAGTCATGGCTTCTAAATTGTTCGCTTTTAAGAGGAGCATGAGGGCGTTCAGTTCAGTCCAAGCACTCGGCATTAACGCTTGAGAATCGATGGCCTGTGTCATTTTATCTTGAGCTAACACAGTCGATTCAGGTTGAGGGATGTAGGTAGATAGTTGCTGAGTACTTTCATTGATTAATAATTCTAATTGTTCGGTCCACACTTTTATCTCAATAAACTCAAGGTCATTTTGTTTGAGTTGTTTCTCTAAAAAAGCAGCATAAGCGGCTAAACGCAGTGCACCAAAATTACTGGCAAGACCTTTGATGCCATGTCCCACGGCCGCCGCAGTGGCATACTCAAATACTTTAGTAGACTGCTTGAATAGATTTAGTTGTTTCAACATTTCAGGTGCAAAACCTTTCGCCATTTTCTGGAAAAAAGCCTGATTACCGCCAAAACGACGCAGTATTAGCTGGATGTCATCCAGTAATGGCATATCGTCGAGTGGGACTTGTAACTCCGCTGGTTGATGCTGTTGTGCTAATGAAGGGGCTTCTTCCTGTCCTACTAAACGTAAAATGCTGGGGAGTAATATTTGCATGTCGATTGGCTTACCCACATGATCATTCATACCTGCATCGAAGCAATCCTGTTTATCGCTTTGGGATGCATTGGCAGTCATCGCTAAAATGGGTAGCTCCGCAAATCGGCTATCAGCGCGGATAAGCCGTGTTGCGGCTAGGCCGTCCATATCGGGCATTTGTATGTCCATGATCACGATATCGAATACGTTTCCACTTTCTAACACTTGAAACACTCCTTCAACGCCACCGTTGGCAAGTTGCACGTTTGCTCCTTCATAACTCAGCAGCTCATCAATCACTTGTCGGTTGAGTTGGTTATCTTCAACCACTAAAATGGCGAGTGTAGATAACCGTCGTTGCGAGTGTGGCGTAGGATTCACTGTCATCGTTTTGCCTTCGATAGCATTGATTATCGCCTCGGCTAAAATCTGTGATGTGACCGGTTTAGTTAAAAAATTCACGAAGGGGACATTTTTTATATGCTGACTTTCGCTAATCACCTCATGGCCGTAGGCGGTGAGCATAACGATGAGCGGAATAGGGCTATCTGAGTGAGAGTTACATATCAAATCAGCCGTTTGTAGCCCATCAAGATCTGGCATACGCCAATCCATTAGTACGACATCGAATGGTCTTGCGTTTGTATTGGCTTGCTTTACTTTTTCAAGCGCTTGATATCCCCCCGAGGCGGTTTCTGCACTGCAGCCAAAGTAAGTCAAAATCTTATGTAAAATTTCGACGGTAAGCGGGTTATCATCGACAATTAAAATATGATACCCACTGAGGTCAATGGCGTGAGTAGCTTCAACTTCCATGACGCAGAACGTTACATCAAACCAGAAGCGGCTACCCACACCTACACGGCTGGTGACGTTTAACTGGCCTCCCATGAGTTCGACTAACTTTTTGCTTATCGCTAATCCTAGCCCTGTACCGCCAAATCGCCTTGATGTTGACGATTCGGCTTGTTCAAAGCCCGTAAAAATGCGCTCTATTTGCTCTTCGCTAATACCAATACCAGTATCTGAAATTGAAATCTGCACAGTAACGGTATTGACCTCATGACGTAAGCATTCCATGGCGACGGTGACTTGACCGTGGCGTGTAAATTTAAGGGCATTACCTGCTAAATTAACCAAGATCTGTTGTAGCCTCAGTTGGTCAGCTAAGAGCCATGCGGGTAAAGCCGGATCAAGGTCAAACATGACCTCGACATTTTTGTTGCCATGGTTACCCGACAGCACTATTGCAAGGTCTCGCATAAGCAGTTCGATTGAACAAGGGTGCAGATCGAGTGTTAATTTACCGGCATCGATTTTTGAGAAATCGAGAATATCATTCAGTAATCCTAGCAATGACTTCGCCGCAGTTTGGGTTTTGGAGACATAATCCTGTTGTTGCACTGACATTGAGGTATGTTGCATTAATTGCAGCATCCCTAAGACCGCATTCATTGGGGTGCGAATTTCATGGCTCATATTGGCTAAAAATGCGGACTTTGCCGCATTAGCCGCATCAGCTTCATGTTTAGCCTCTCTCAAGGTTTCTTCTAATTGCCGCTGTTCGGTAATATCTAAATTGATACCGATAACTTTTGTGACCTGACCCGTGACATCGCGTTCGATTTGGGCGGCAGCTTGCACGTAACGAATAGTGCCATCGGTTCTGACGACTCTAAAAATGGGATCATATTGGTCTTTTTCTTCTATTGCTGCTTTTAGCTTCGCCTCGGCGAAGTCAATATCATCTGAATGGATCCGCATACGCCAATGTTCATAGGTTAATCCTTTATCCTGTAAACTTAGGGGGTGATCGTAAATAGCAAACATCCGCTCATTCCATTCGAGGGAGTTATCGCGTAAATCCCAGCTCCAAATCCCAAGCTTGGCGACTTCCGCAGCCTTACTTAAGTGATTACTCGCAGTGATAAGTGCTTTTTGTTGTTGTAGTACTTGGGAAATATCAATCGCAATGCCTAAATAACCAATAATCTCGCCGACATTATCCAGCATGGCCGTGACAGAGAGAGAGACTTGGCATTGACTGGTGTCTTTTCGCACATATGTCCAAGTGCGCGTTTCAGAACCGAGGGTTCTCGCTTTATAGACGAACACATCAAAGCCTTGAATATCTTCACCATACTCACTGGACAATTCAGCCGATCTTGCCGCGACTTCCTCGGGCAAATGTAGCGGTGCCGGTGTTGATAGCCCGATCATATCCACGGCGGCGTAACCGAGTAATTGCTCAGCCCCACGATTGAAAATCGTGATCACGCCCTTTTTGTCCGTTGCAATGATGGACATTTCGGAGGCGGCATCGAGTACGTTCGTTAATAACGAACCTAACTTATTTTTTTCAATTTGCGCCAATTTACGCTCTGTGATGTCGGTTCTTAATGCGACAAAACGTTCTATATTGCCGTTTTCATTAAAGACAGGACCAATCACAGTATCAAACCATTTCAGCATTTGGTGTCGGTCACGATTACAAATCTCACCATGCCAAGATAGGCCCGATTTTATTTGGGTCCACATGGTGCGCCAGAATGCAGCATCATGCTCACCAGAGTTCAAAATGGCATGGGTTTCACCTAACAATTGTTTGCGTGAATAACCACTTACGTGACAAAAATTATCATTCACCTCTAAAATGACGCCATTCGGGTCGGTGGCAGAGTAGAGCAGCTGTTGATTGATGGTGTCGAGCAACGTCTGGTTTTCGAGTAATGCTTGCTGGAGCGCATTGGTACGCTTGGTGACTTCTTGTTCTAAACGTGCATTCAAGCTTAATATATATCGTTCTGCTTCTTGCTGCGTCGTAATATCGCGGATCGTTTGGCTAAGTCCCACAATGGCACTATCCTCATCATATATAGGCAGCGACGTTGTCGAAGTCGATACATGGCTGCCATTCTGTCGTTGATGGGTTGATATTTGATTTAATACCGTTTTTCCTGAAAACACCTCAGCAAACAACACCTGCTCTTCTAAAACAATGCTTGGTGGGATAATCAAATCGCAACTTGAATGACCAATAACGTCTGTCTGTGTATAGCCAAATATCTCCTCAGCACCTTGGTTCCAACTAGTAATATTGCCCGTTAAGTCATAGCTGATAATGGCATCGAGGGAGTGCTCCAGCATGCTAGCGCGCCTTGCCTGTTCAAGTTGCACTTGCTGCTTACGCTGTAGACTGATAGACCATATAGCGATAAAGGCCGCAAGCAATAAGCTTAATAAACTGCCGCTCAGTAATACTAAGCTTGGTTGATTAAGGTGTAATGACTGAATAAAGGCCGGGTAGGCAATGACTTCGACCTGCCATTTGCGGCCAAAAATTTCTTTCGTCAGTGTGTGGCTATATGCCGATAATGGAGACGCATCATCAGGGTGTGTTTCAAAAAAACTGATGGGACGCTCAGTTTGGGTGATATCGCTAAGCAGTAGCTTCGTCATCTTTTGATTTAATGCTAACCCTGTGAGCACTTCATTTGTCACTAAGGGCGCATAACTCCAACCATACCCTGCTGCAAGTCGTTCTTGTGGTGTTTTAGGCACAGAAATTGTTCGATAAATAGGTTGTAGTATCAAAAATGATTGTAGGGGTTTCCTCGTAGCCTGCACTAAAGTAATTGGCGCAGACAAACGTACTTCACCGGATAACATAGCTGCATCTGCCGCTGCTTTACGATGTGCCTCTGATGCAACATCTAACCCAATGGCAGCACGGTTTCTGTCTATCGGCTCAATGTATTCAATCACATACTTTTCGCCGCTATGGGGGGTCAGTTGCCGAATATTGAAGTCCGGCCAATCGTCATTTTTGGCTCTTGTTAAAAATGCGGCTTCATCTGAATTGGGCACTCGGCGAATAAAGCCAAATCCACGGGCTCCGGGGAACTCTTGATCCACATCCCTCGATAAACTATAGCGATGGAAGCTAGCACGACTAATACCATACTCGCCTGCGCTTAGAATCGAACCTCGGGCACCGCGTAAACCATATTGATATAAGGTTATACGATTAAGCACGTTCTCACTGATTTGCTCCGTGTTTTCGCGCAATGCTTGGCCAATCGTTTGGGCGTTCATGCGTCCTGTCAACCATGTGAATAGTGCGCTTAGCAGTAATCCAATCAACAGGACTAATAAGCTCCATTTCGAAGCATTTTTATAATTCATAGAAAGGAGCATAAATGCCTAATAATTTATAATCAGAGAGTTATGATTTTGGACTAAGAAGATTTTTTTTTCCACTTGTAGTAACACACTAATTTGGGGCACCTAAATAGAGGTGATATTTAGATGAACAGATGAGATCGACAACCCCAAAAACATTTAGTGCTGAATTTAAACTTAAAGCAGCCTAATTAGTCTCGATAAAACCATACTACCGGCGCCGCGAAGGCGATAAACGTGGGTAAATCAACTATGGATAAATGGGTTAGACCAGAATTCAGATAATAACTGCGACACTCTAATCGAGGTGTAGAAGAAGCCAACTGCTGAAAAGTGGTACGCTTTATCTCGCCAAAGTAAAAGAGTATCACCATGAATTATCAACAGTTGACCGAAGCAAGACGATACCAGATTTCCGTGCTGTTAGAACAGGGCTTTTCGATAGCACACATAGCTAAATGCATTGATTGTCATCGCTCTAGTGTCTATAGGGAAGTTAAACGATGCCAAGGGCAATCACGCTATCAGCCTGATATTGCACACAAGCAGGCTGTCACAATGAGGCGTCAGTCGAGTAAATATGCGATACCCTCACTGCGAATAGAGAGCATCATCCTTTTGCTGCAATTGGATTGGAGCCCGGAGCAAATTTCGCAGGTGTTGCGTCTGGTGAAGGAGCCCGTCAGTCACGAGTGGATTTATCGCTATATTGCACGCGATAAACGCCGTGGTGGCAAACTTTATCGTCATTTACGTCAAGGACATAAGCGTTATCGGCGCGGTAAAGTAGAGCAAGCTCCTACCATAAAAAACGCCGTATCCATTGACGACAGACCAGCCATTGTTGATAGCCGAGAGCGATTTGGTGACTGGGAAATCGACACCGTATTGGGGCTGCATGGAACGGGTTCGATTGTAACCCTATTAGAGCGTAAGACTCGGTTTTATTTGATAAAGAAAGTCAATTCGAAGTCAGCGGCAGATGTAACGAAAGCGACGATTGAGTTACTGATGCCCTATAAGGCGCATGTGCTGACAATCACGGCAGATAATGGACGCGAATTTGCTCACCACCAAGAGATAGCAGAGGCGCTAGATACACAGGTGTATTTTGCGCACCCTTATCGTTCTTGTGAACGAGGAGCGAATGAAAACGCCAATGGATTACTGCGGCAGTACGTCAAAAAAGGGGTTGATTTGAGGCTAGTGAGTGATGAATTAATCGAATTTGCTCAGAATAGAATTAACTATCGACCCAAAAAGTGTTTGAAGTTTAAGCAGCCTGCGGTGGTATTTCACCAATTAGCTGCTTAA
Protein sequences of DBSCAN-SWA_1 >NC_017571|132542:168306|167364_168306_+|WP_014615270.1|transposase|DBSCAN-SWA MNYQQLTEARRYQISVLLEQGFSIAHIAKCIDCHRSSVYREVKRCQGQSRYQPDIAHKQAVTMRRQSSKYAIPSLRIESIILLLQLDWSPEQISQVLRLVKEPVSHEWIYRYIARDKRRGGKLYRHLRQGHKRYRRGKVEQAPTIKNAVSIDDRPAIVDSRERFGDWEIDTVLGLHGTGSIVTLLERKTRFYLIKKVNSKSAADVTKATIELLMPYKAHVLTITADNGREFAHHQEIAEALDTQVYFAHPYRSCERGANENANGLLRQYVKKGVDLRLVSDELIEFAQNRINYRPKKCLKFKQPAVVFHQLAA >NC_017571|132542:168306|139497_140763_+|WP_006083709.1|DBSCAN-SWA MDQEIRLEISAWLAGFGIDSQPSDGISTSIMIIACLLLAGIAYFIVRRVVIRAVNMVILRSKVTWDDVFMRYKVLEKLAMLVPAIVLNLLVPIALTEHQILSNLVDRLLSIWLVVLMIRAIYAGLDAVDEISDVNLVSRRLPVKSFVQLTKLFLFFVGIIVSISILADQSPVYFLSGLGVATGFVMLVFRDTILGFVAGIQLAANRMVSKGDWIQMDKYGADGAVEEVSLTTVKVRNWDKTITMIPAYALVSDAFRNWRGMSESGGRRIKRAINIDINSIKFLSEEERERLSKINCLKEYFPAKINEIRESNARVSDLDMKVNGRHLTNVGTFRAYLQEYLQRHDKIHKEMTLMVRQLAPTTEGLPIEVYIFTNDTRWAFYEAIQADIFDHIFAILPEFGLQAFQAPTGNDIRSLKSVKVD >NC_017571|132542:168306|156253_156811_+|WP_006083696.1|DBSCAN-SWA MSRAVSLHQHFYNLLKVRKMNHFTVTEFRDAVIADSQTSWCKDEARKFVYRRLLVLVRKRLLYRTGKKHSPDTLYSTTKLFSQTAFKVKNTILAAGKGGFLKVQKNSSDSFLTVLQKEKVMYETDLRFATEEMEVFECLIQRFPEQKEFIEPFFHEAKVRSAQLYSHVSALSKILNINANGLCQY >NC_017571|132542:168306|138291_138684_+|WP_006083711.1|DBSCAN-SWA MQIKHGIYGVVISLALSAAAWADEAPKVGCAAKLEAISAELAQAKAAGNKHKVDGLEKAYYEVSTHCDDDSLYAERVAKVAALEEKLTERQNELAKAIEEGRSMDKINKKRNKVAEAELELAKARAELTQ >NC_017571|132542:168306|154129_155500_+|WP_006083698.1|DBSCAN-SWA MSGQYLSPPECPFSTISDVIFEIRNHGDDPNKYMDYISSFHIKYHSFDEFRRSVKPSLNVNFCWWLVRLHRDSSLINLTPLENKPLPLHLPESVNRALSQSPDFMTYCRFNHLHEFSRVLSQIDRYTTHSAMLGVLEQIGESVHFEHFKNNLIDDEAISSSQLEGAATTTLIAKEMLRKKRRPCTDGERMIVGNADLMNAAWDLKDEAFSIDLIKTLHRMGTQHLEQKNYTSGIFRTTDDIAVVNADGEVVHQPPKHDAIECLLSRVVNWLNSTDDHIHPVIQAITLHFVIGYVHPFRDGNGRIARALFYWYLFKCGYTGFRYIAISALLKKSPIKYGQSYLDCEHDNLDLTYFIRYQLSIINRAITEFVSAYERAAYRMTELAEQYSDLDDIEKKIIGFVAGEPRYRKKPTITAREAEKLMGLSYNTVKQRLDDMVTSGILRREKIGKATEYFLG >NC_017571|132542:168306|148194_151941_+|WP_193377958.1|integrase|DBSCAN-SWA MGWYSYLGTTAPEDRYPTLAEASEAAINLLNNMSVALTSKKYVRHYMADPRLPATPSNFYKNDWVGWYNYLGTTAPEDKYPTLAAASEAAIKLLNSMGVALTYTNYTKHYKADLRLSATPYYTYKNDWVGWYSYLGSTAPEDKYPTLAEASEAAIKLLNSIGVAKKIENYKKHYKADPRLPASPDEIYKNDWVDGPHFLNLLDYYNSPQEAYDAAINIVGISEKLTLPRYGGLRTIDSRLPAKPHIFYGLGSWQQFIACRFYPSIEECAKAALKVIPEDSRTVNGYAKNYKKDPRLPSRPELIYSAWESWKVFYEGKNARPYSVEKCIEIANENALWDVESYEEFRKKNDSRLVSHQTLTSKTCKDFRGSLGIGYFTLEEVKDYCQKNRIATMPEYKVHAQKHPRLKVTLTPKSLPSYTNFNDVKWRIEEGQELEDLGLERWYEIYRAYRNNTPELGPTAKVAIMAFFTFFGYFFIKNKEIGSIFSEAITAPDLNQFLESDKVYPTNKTAYLTHIFAFLSFAFNRTCTIKDEDDSDVVVLEGYKLPYVELFTKVSNDKKIAYSQTIKPALPYGYIVKARDALVPERAQHFSDLLFAHTLNQSDWIEVSEEFYNQEKDIAEGDNRGDPDCIVRQRKVTGRTKGRSGVYSTIYEMWCPVRAMALYFLISHPLRGQQICWMDSGETDKYRLEYNSNAPQDAQFKWIENTSPLRQKSIKKRDTGIFWKTGHQNEFGMFTNTNKTGQPFTSVYFPLELAKWLVKLRDWQEKYNAISKPTSWRDLAFNTISKPSEGVLRKRGSTSFLFRLPKQSQLKVGRTEGSPMNPDALVPALAKLLQHVEDENLPLTCIQNGNLKSYYSKHAMRVSLITAYVVHGDVPLEVMVKIVGHATILMTLYYTKIEQGEIKRILNNAEKKALQNHAKQVEDAILQGEINSIKSELIINENSVLHTPDYPLASLQFTDYGICPMAFNGCDTGGSVVDSKGGEVEAPVKPGWLGEKNCFRCKHFVTGSAFMAGLAAKGNEIASAKLRLSQAIQDLETKCDQLMRDASDLEGLGEKSAAARKLSDKKLIERDIESKTSLLDLYMCDIFAIYKLIIQSIELLSNTENTEGTKGVPLLLNSSSLSITIDEVDEYQVLNEVCQNREVFVSGMADHDNLKRAQMLDRMLKKNGMEPALFELPPAVQLKVGNQMTNLMLERLKGDWNGVNAIIKGQKPLNFIESKDNSREMSLADELKMTIAPALRNKIEFMEE >NC_017571|132542:168306|157311_157587_+|WP_037392962.1|DBSCAN-SWA MKNTKLSILLSKRDVTAPIPDDLAEWLDSPSVGNEIITDCEDTPLSSDDEYRAALKKIEELFDAIPNTAEGSELEKWISLVETYENEHFPM >NC_017571|132542:168306|160294_160891_+|WP_006083692.1|integrase|DBSCAN-SWA MDAWNKGHPMGQKRPFKLEEIWRIRTRLELEHNIAELALLNLAIDSKLRACDLLHLKVQDITSSGIINPRVRLTQRKTHRDVQFEITPKTQHSLTSWLLEKGLTAEDYLFPSPRCIHQPMSYSYYRSRVKNWASQLGQNSELYGTHSLRRTKVTLIYRRTKDIRAIQILLGHVKLDNTVRYLGVEIEDALRISEAIEV >NC_017571|132542:168306|132542_133883_-|WP_006083716.1|protease|DBSCAN-SWA MEQVITATAGKDNPIQMTEIVYSKEKSLFTLLAIISGIVWAALIIGTLGMALLYVLMFFIIYLFSHSAFISYLKGTAVEINAEQFPELHKQYLACCDRLEMKEPPRAYLLAADGMLNALATRFLGRNYIVLFSSIVDALESDKDALNFYIGHELGHIRRNHIGKAPFLVFATWLPLVGAAYSRACEYTCDLHGLRCCNSLRSATNAVAVLAAGVEQWKRMNVDKYIRQARESSGFWMSLHELNGSYPWLTKRMARVDAKAQGTEYTPPSRSAWAFVFSLFMPRMGTAGGGLIIFAAIIGVAAAVALPAYKQYQEKMSTASSYTDPVIMAPQMNTGPATVPDAALAIEHIDQLHQSLAPITEVLTHYYQGNKAWPTELEVLGFTDEQLSTFNLYSDGIVGFVGGEALGDYQGYEVYLSPQVDEQSNITWVCSSSDIPDEYLPLDCQG >NC_017571|132542:168306|157748_158207_+|WP_006083694.1|DBSCAN-SWA MAFRARKIRGISLFEGESSANNAFTLVIGNNGCGKTQLLLDICNYYQMLYGELIRSKSADLQVMRRDFYKNDYRWDMVERAFGYPIPQKLICASTSQFEKFSENWKLKNDFVQGGYYAYIGSKPYIPNRLPSTRIASTALNQLLARDTEVVK >NC_017571|132542:168306|162117_167007_-|WP_014615278.1|DBSCAN-SWA MLLSMNYKNASKWSLLVLLIGLLLSALFTWLTGRMNAQTIGQALRENTEQISENVLNRITLYQYGLRGARGSILSAGEYGISRASFHRYSLSRDVDQEFPGARGFGFIRRVPNSDEAAFLTRAKNDDWPDFNIRQLTPHSGEKYVIEYIEPIDRNRAAIGLDVASEAHRKAAADAAMLSGEVRLSAPITLVQATRKPLQSFLILQPIYRTISVPKTPQERLAAGYGWSYAPLVTNEVLTGLALNQKMTKLLLSDITQTERPISFFETHPDDASPLSAYSHTLTKEIFGRKWQVEVIAYPAFIQSLHLNQPSLVLLSGSLLSLLLAAFIAIWSISLQRKQQVQLEQARRASMLEHSLDAIISYDLTGNITSWNQGAEEIFGYTQTDVIGHSSCDLIIPPSIVLEEQVLFAEVFSGKTVLNQISTHQRQNGSHVSTSTTSLPIYDEDSAIVGLSQTIRDITTQQEAERYILSLNARLEQEVTKRTNALQQALLENQTLLDTINQQLLYSATDPNGVILEVNDNFCHVSGYSRKQLLGETHAILNSGEHDAAFWRTMWTQIKSGLSWHGEICNRDRHQMLKWFDTVIGPVFNENGNIERFVALRTDITERKLAQIEKNKLGSLLTNVLDAASEMSIIATDKKGVITIFNRGAEQLLGYAAVDMIGLSTPAPLHLPEEVAARSAELSSEYGEDIQGFDVFVYKARTLGSETRTWTYVRKDTSQCQVSLSVTAMLDNVGEIIGYLGIAIDISQVLQQQKALITASNHLSKAAEVAKLGIWSWDLRDNSLEWNERMFAIYDHPLSLQDKGLTYEHWRMRIHSDDIDFAEAKLKAAIEEKDQYDPIFRVVRTDGTIRYVQAAAQIERDVTGQVTKVIGINLDITEQRQLEETLREAKHEADAANAAKSAFLANMSHEIRTPMNAVLGMLQLMQHTSMSVQQQDYVSKTQTAAKSLLGLLNDILDFSKIDAGKLTLDLHPCSIELLMRDLAIVLSGNHGNKNVEVMFDLDPALPAWLLADQLRLQQILVNLAGNALKFTRHGQVTVAMECLRHEVNTVTVQISISDTGIGISEEQIERIFTGFEQAESSTSRRFGGTGLGLAISKKLVELMGGQLNVTSRVGVGSRFWFDVTFCVMEVEATHAIDLSGYHILIVDDNPLTVEILHKILTYFGCSAETASGGYQALEKVKQANTNARPFDVVLMDWRMPDLDGLQTADLICNSHSDSPIPLIVMLTAYGHEVISESQHIKNVPFVNFLTKPVTSQILAEAIINAIEGKTMTVNPTPHSQRRLSTLAILVVEDNQLNRQVIDELLSYEGANVQLANGGVEGVFQVLESGNVFDIVIMDIQMPDMDGLAATRLIRADSRFAELPILAMTANASQSDKQDCFDAGMNDHVGKPIDMQILLPSILRLVGQEEAPSLAQQHQPAELQVPLDDMPLLDDIQLILRRFGGNQAFFQKMAKGFAPEMLKQLNLFKQSTKVFEYATAAAVGHGIKGLASNFGALRLAAYAAFLEKQLKQNDLEFIEIKVWTEQLELLINESTQQLSTYIPQPESTVLAQDKMTQAIDSQALMPSAWTELNALMLLLKANNLEAMTFVDRLIKKLPHHPQWITLQGQVQALEFDKASDTLRIIMLENK >NC_017571|132542:168306|151942_152599_+|WP_006083700.1|DBSCAN-SWA MNFDRALKELKQSVKSPRTLKSLEIIQEVCREEVLNKSFDFTVTNIGKKSGERGGVKTQPIRNKSGQLYRDLIKLYSEEYEQDKNVKASSLTNEDSWVHGINDIATKYLIELTLRENKELKAKNNMLRINWRQYEPVIIEEVNQYEKKSEAQVEGKLIVSPTRLSLTEKVLFEDLVSETKWRSKGCYMENGNVYHEESDKPIFTQEHIALIKKVLEQG >NC_017571|132542:168306|142133_142586_+|WP_006083706.1|DBSCAN-SWA MLKNGLLNKVSWQRVFAALLLLVTCHVSAVPTDEITQMLKGQEDAWNRGDLDAYMQGYWKSEQLRFVSNGKFRFGWEDTLAAYKKNYPNKEALGELKFTIKEIKMLSNYAAMVVGRWDLRRIKDAPTGVFTLLIEKIDDRWVITMDHSSD >NC_017571|132542:168306|137431_138058_-|WP_006083712.1|DBSCAN-SWA MQTSLRLSALSSLLLLTSSVMAADVAQITLENGAQVRLKDDFTWEYVITETKAAPSALAAMAVSTSSNIASVPSVALATQTVAAPVTTLTATAIAQPELLGSTGKDGIKVSFAEGQWKGDKLGLSFDLASTSNEHVTLVEVEASFFADNGTLLKTEKLEVWEAIFRMPETYLRKGEQRKSSVIWVEGIDKAQWQKQLINLKITEINSR >NC_017571|132542:168306|138710_139289_-|WP_006083710.1|DBSCAN-SWA MSRYSSMLACLSFGLLSLSQPLQAAEYSTSKPLVMSDKSEKTANPSYIALMLDYVPAADNLYGITLAPYHFDNNYQQWGYYLGYARSRETDVIVPEPALSYRQESLFRLGLNYSLTADLSFYAGGSGYISTVSYTNNISPKIVDGKPTWEEDKTTRWGAEAGLRYRITEHFILSAGYNSSTESTVLSIGYAG >NC_017571|132542:168306|142659_143463_-|WP_006083705.1|DBSCAN-SWA MCILFVAINAHPKYPLIICANRDEFHHRPTAPAHFWPPEENILAGKDLQAGGTWFGVNKQGQVAAVTNLRVPQKNPEAMRSRGELITMTLNSGSLVCPNWLIEHSDHYQPFNLVFGQGTDLYCFNSINKDTVKLADGFHAISNGALDDIWPKMAKGQQALEAVINQSDNLEVQALLQLMKDDSQPQDNELPNTGVGIEWERRLAAIYIRHPDYGTRSTSILLEDAEGGMHFTEVRYDGKGRQLGQQDFHLTLPAEPPAKPDCLLTQI >NC_017571|132542:168306|143524_145183_-|WP_006083704.1|DBSCAN-SWA MLYLLASCIALLIGPLFYRYFSSGSGLQKGLDGFIFVSLGGLVLIHILPELLQHGGLLAVVFVFLGIWGPTASERLFHRYSEITHNLTLSLGIGGLLLHTITDGGAMVLAQQDGSSILLALGVIMHRLPVGLAIWWLLKPQVGTRWASLVLVAMMLLTGVGYFAGEQLITQLSLENTVYLQAFVTGSILHVVLHQPHGQHDTDKQGKYEYQAGIGSLLGIGLLVMLLLMDSGGHEHAHHDHSTEQLMTWLLTLAPILLLSYGAAALRFKLGLTPQDSSLARRWFQRLAGPEALVITALLLGPWLALFQLLVVFILSAYLSHANVTITDPHTKLPSNSLRFGFAHLVDRSAPWILLSLVLVNLIGHPSVPLSNPLLQVIVLLLVFLPMRFCNLGAAVLALALAYSGWSPIAIIMPLIAAPVLNIAQLKLMTWPQRAILLSIIALSLVAALRLPLWFSLFTLPEVVNLVALLILSGLFAASLLRLGPRKFLRRLMLSKPAAHSHSQAHGHSHAHSHEPSASHTSVQSTAETPSAHDHSDEHTHDKAKANKHTHH >NC_017571|132542:168306|135102_135525_+|WP_006083714.1|DBSCAN-SWA MKQALVHIALVVRDYDEAIDFYVNKLKFELVEDTYQAEQDKRWVVVAPPGSKGASILLARASKPEQFDFIGNQAGGRVFLFLNTDDFWRDYRRMVADGVEFAREPQEQDYGTVAVFKDLYGNLWDLLQLNPNHVMAKRMS >NC_017571|132542:168306|141440_142004_-|WP_006083707.1|DBSCAN-SWA MFTPLCRWLLKLSGWQIEGQLPDCAKYIIIVAPHTSNWDFIVGILARGALGTRIHFLGKHQLFIPPWGWFFRAIGGSPVDRRKNNNLVDAAVQLFESKADYKLALAPEGTRSPVTRWKCGFYHIACKAHVPITPVGLDFSRRTVVIRTPLQPSGDIATDMHEILSFYRTITGRHPKVIPDFVASAKH >NC_017571|132542:168306|134063_134699_-|WP_006083715.1|DBSCAN-SWA MNTHSKLMGGLFILALVPTAIWATQTKSQTTSNSADGGISVISQSVLSQSQPDFSWIYVQQEGEMTIGAGHSDDWEQLERQGNPYHADYLWVKTAGTPYVITDPAIVAQIKTAIMPMQQQGEKMQAIGEQLQQKGNAINSQTQQLLLNVATENEDPKIQAEIDSLSASIDVLDQQMNELSKVHESLSNTAEKQIVSLAQAAIKAGTALKTP >NC_017571|132542:168306|140790_141414_-|WP_006083708.1|DBSCAN-SWA MQPETWLLYLFAIVLIGISPGPIAMLSMSHGIHFGKMRSVATGLGSVSAALVLMMASAAGLGAIISASEYGFTLLKWCGAAYLVFLGIKLLLTKNQSQTLEVSQLKGKGTPRQLYQQAFLVGISNPKDLLFFAALFPQFIDLAAPQLPQLIILAATWAVVDFSFVMIYASMANVLAPSLKASNKLHWFDRTSGGVFLTLAAIIISRN >NC_017571|132542:168306|161179_162118_-|WP_006083691.1|DBSCAN-SWA MHLTEIIPQANRMKGKILVVDDQPLNIKILHQLFHEDYEMFMATNGEQAINVCQNELPDLVLLDIEMPGMTGYEVCQRLKADPATANICIIFVTAHFDEQEEVKGFQLGAADFIHKPINPIITNTRVNNQFILKQQADLLQSIALLDGLTGVANRHLFEQRLPEIWKYCCRNSAPLSVVMLDVDSFKLFNDRYGHQEGDQCLRLVAKALRTAIHRPNDLVARYGGEEFICLLPDTQHLGAMHIAQLIVNAVQALHIEHLGSSFQEVTISAGVASMQPHRHLKWQTLIESADKQLYLAKEHGRNQVSGVSLLR >NC_017571|132542:168306|145814_147275_+|WP_006083702.1|integrase|DBSCAN-SWA MFPMAMAIVQPLITKKGQELTVPLIMIGDGSDIVILSSYARYAADNAHKSLSWHVESARSIELLFKYKSATQDQFPSVRSMFENFTEALLHGTHENGKDKTDLWWNPYSAKQVSRYIYYLTSYSEWLYVDTEQKTELLNPQIASSKSEYLMNMAAYHHRKNNSFYKHLKNDNQARQDAKTSYAVKLRDRANHNVTSPEHCFPSEITNDVIQGFTLPGSRSHDPIYKRLDLAKVLIFMLTRFGGVRISEPFHLYINDIQPHPSEERQMIIKIHHPSESVAPVKWAEQWVTREVYLRENFGLTPRHLKKSTGCYKAGWKNPALHKVAHGNKKTELFFYVEFLNASERELFYHLWFLYLKKQRPRNNYSPFAFLSKKTGMPLTMGAYDAKLKKVINKLGYEYSKAAGTTPHGCRHLFKAEAKARGISTQVIRELLHHKSLMSQEEYAKPSIEQVRQVMKEKENEMREKIQAQHNQLMDQLKLEVDNNGN >NC_017571|132542:168306|158199_159377_-|WP_086010601.1|transposase|DBSCAN-SWA MRSTTRKTFSAEFKLEAAQLVLDKNHSIIEAAKAMNVGKSTMDKWVRQLKLERQGGTPKASPITPEQIEIRELKKQIARLEEHNLIPKKGYRSLDVRLDEQFTLISALKQSHSILTICNAFKVHRSSYKYWLKRKEHIDTDFTILCSEVKAAHRISHGSAGARTIAQLVTNKGIGLSRYRARNLMKKLGLLSCQQPKHSYRKATQEHVAIPNTLNRQFAVTQPDQVWCGDVTYVWVGNRWAYLAVVLDLFSRKPVGWALSLSPDSELTCKALKMAFELRGKPENVIYHSDQGSHYTSLKFRQLIWRLQIEQSMSRRGNCWDNAPMERFFRSLKSEWIPASGYGNFTEADKEITNYITGYYSETRPHQYNGGLTPNESERLYWNDSKTVANFT >NC_017571|132542:168306|155839_156154_-|WP_006083697.1|DBSCAN-SWA MHKENTVPKRLKEARTKAGITQADLGAKIGIHQNSASSRMNHYELGRHTPDINTLQRIADELGVPLNYFFCEDDASAKLACLIDQLPKEQKIALIKKLEAQIKD >NC_017571|132542:168306|153079_153286_+|WP_006083699.1|DBSCAN-SWA MVTEAPSASELFRRACDIYDGDLVLAEQLFNSNIPALGNRKPNELLSSPDGRRLLDDLLLKIEYGEFS >NC_017571|132542:168306|159521_160031_+|WP_006083693.1|DBSCAN-SWA MSHPLLSMPERTEESDAVYAVLGRALAYATEFESNCRTLAHLFDIEKSDSEFSYEIYKLVKSGTLHAKIKLLIDVHGLPDWVEEKVHEARKARNFIAHEAAEDHKRMMSTPQLMKSFETTIMQMTGEIADGNHIVLDVTRMLKEGKRVHGSDVVAYSFAVASWISRENF >NC_017571|132542:168306|136020_137319_+|WP_006083713.1|DBSCAN-SWA MNIFKIRSAALWIGRALSVSAIIALPSMASDMPTSQYHIDSDEIKMVDMPSVLLPFNNLMFLYGVDASQFDLADFIYVNAPDLIDKEEAITHWAGYYSINPKVILTLMEMQSQLISSPTEEALNRPLGALSDKQGFDEQLQDVLAQLSQRFYAYEESQLKGLYPPSTDAVNASSFALLALLNGRRIEQQQHAVMSGEHALGLDPFIEQFRLLFGNTDRELLMSSVAQNPPVADSTQSMQQVVPLANITASSLPPSNMLQMPWRQGYSWQSNGAHSHTGSGYPLSSIDVSYDWPQWGSPTYSVASAHGGTVNVLSRCQVRVTNANGWATNYYHMDQITVRNGQYVIQNTVMGIYANNKNAALCEGGSSTGPHLHFSLLKDGRHVSLQDVHLGQYRVNIGSYNYDNNCSRFNLFDVSNNRTMCAWAPLYNAGSL |
28 | Shigella_phage(33.33%) | protease,transposase,integrase | attL 142060:142074|attR 172044:172058 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1430538 : 1438105
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_017571|1430538:1438105|DBSCAN-SWA ACTACTTAATCATGCGCTTGTATTTCAAGCGGTGTGGCTCAACCACATCGGCGCCATAGGTGTTTTTCAACCATGCTGAGTATTCTGTGTAGTTACCTTCGTAGAAGTTCACTTGGCCTTCGTCACGGTAATCCAGAATGTGGGTCGCGATACGGTCGAGGAACCAACGGTCATGCGAAATAACCATAGCGCAACCTGGGAATTCCAGAATCGCTTCTTCCAGTGCGCGCAGGGTTTCAACGTCTAAGTCGTTGGTGGGTTCGTCGAGTAATAAGACGTTACCGCCCGCTTGCAGCAGTTTGGCTAAATGCACGCGGTTACGTTCACCACCAGACAGAGTGCCGATTACTTTTTGCTGATCGCCACCACGGAAGTTGAAGCGGCCTACATAGGCGCGGCTCGGGATTTCCATGTTGTTGATACGCATGATGTCTTGACCACCGGAGATTTCTTCCCAAATGGTGTTTTTATCATTCATTGAATCGCGGAACTGCTCAACCGATGCTAATTGCACAGTATCACCCAGCTCAATCGTGCCGCTGTCTGGTGTTTCAGCGCCGGATAACATGCGGAACAGGGTCGATTTACCCGCGCCGTTGGCACCGATAATACCGACGATCGCGCCCTTAGGCACAGAGAACGACAGGTTGTCGATGAGTACGCGGTCACCATAAGATTTGGTCAGGTTATTCACTTCAATAACCTTGTCACCTAAACGTGGTCCGGGCGGAATAAACAGCTCGTTGGTTTCGTTACGTTTTTGGTAATCGTTAGTGTTCAGTTCTTCAAAGCGCGCCATACGGGCCTTGCCTTTAGACTGACGGCCTTTGGCGCCTTGGCGTACCCATTCCAATTCCTTGGCAATGGTTTTTTGGCGAGCGCTTTCGGTGGCAGATTCTTGTTTCAGACGGGCGTCTTTCTGCTCAAGCCATGAAGAATAGTTACCTTCCCACGGAATACCTTCACCGCGGTCGAGTTCTAAAATCCAGCCAGCGGCATTGTCGAGGAAATAACGGTCATGGGTAATCGCGACCACAGTACCAGCATATTCCTGCAAAAAGTGCTCAAGCCACGCCACAGATTCAGCATCCAAGTGGTTGGTCGGTTCGTCCAGTAACAGCATTTCTGGCTTTTCAAGCAGCAAACGACATATCGCTACGCGGCGACGTTCACCACCGGATAACACTTCGATTTTTTCATCCCAATCAGGCAGACGCAGGGCGTTAGCGGCGCGCTCAAGGACGTACTCTAAGTTATGGGCATCTTGGGCTTGGATAATGGCTTCGAGTTCGCCTTGCTCTTTTGCAAGGGCGTCGAAATCGGCGTCCGGCTCGGCGTAGGCCGCGTAAACAGCATCGAGGCGAGTTAGGGCATTTTTTGCCTCTGCAACCGCTTCTTCAATCGCTTCACGCACGGTTTGATTCGGATCGAGTTTAGGTTCCTGCGGCAGGTAACCGATTTTCAATCCCGGCATTGGGCGCGCTTCACCTTCAATTTCGGTGTCGATACCCGCCATAATGCGCAGTAGTGTCGACTTACCAGAACCGTTGAGGCCGAGTACGCCAATTTTGGCGCCGGGGAAAAAGCTTAAAGAAATATCTTTAAGGATCTGCTTCTTAGGCGGAACAATTTTACCCACCCGCAGCATGCTGTAAACAAACTGAGCCATTTTTCTCTATCTTAAGTGACTAATGATGCGACTAATTCTACTCGATTGCGCGCCGAACTCAACTCGACAATGAGCCAAGTTCAAGCTAAATCCATAGGAATTTCATGTTTCAAATACTGGGATTGTGGTCGCGTTCCGAGTAGAATACCGCGCAACCCTACCTATAAGAATTGGCAGCTGGAGTAAGCAATGCTGAAAAAAGATATGAATATCGCAGATTATGATCCGGAACTGTTTAAGGCAATTCAGAATGAAACTCTGCGTCAAGAAGAGCATATTGAGCTGATTGCTTCTGAAAACTACACCAGTCCACGCGTGATGGAAGCGCAAGGTTCACAGTTAACCAATAAGTACGCTGAAGGCTATCCAGGCAAGCGTTACTACGGTGGTTGTGAGTATGTGGACGTAGTTGAAACCTTAGCGATTGAACGTGCGAAGGAACTGTTTGGCGCAACTTACGCTAACGTACAACCTCATTCAGGTTCACAGGCAAACAGCGCGGTTTACATGGCATTGTTAAAACCTGGCGATACCGTACTGGGCATGAACTTAGCCCACGGTGGTCACTTGACCCACGGTTCACCCGTTAACTTCTCAGGCAAACTGTACAACATCATTCCTTACGGTATCGACGAGTCAGGTAAAATTGACTACGACGAAATGGAACGTCTGGCCGTAGAACATAAGCCTAAGATGATGATCGGCGGTTTCTCTGCCTATTCTGGTATCGTTGACTGGGCAAAAATGCGTGAAATCGCAGACAAAATAGGTGCTTACCTATTTGTCGATATGGCGCACGTTGCCGGTCTTATTGCTGCTGGCGTTTATCCAAACCCAGTACCACACGCACACGTTGTGACATCAACGACTCACAAAACCTTAGCGGGTCCTCGTGGTGGCGTGATCCTGTCTGCTGCTGACGATGAAGACTTGTACAAAAAGTTGAATTCAGCGGTATTCCCAGGCGGTCAAGGTGGCCCATTAATGCACGTTATCGCCGGTAAAGCGGTTGCGTTCAAAGAAGCACTAGAGCCAGAATTTAAAGTGTACCAACAGCAAGTTGTTAACAACGCTAAAGCTATGGTTGAAGTGTTCCTTGAGCGCGGTTACAAAATCGTCTCTGGCGGAACCAGCAACCACTTAATGTTAGTTGACCTGATTGGTCGTGACTTAACGGGTAAAGAGGCCGATGCCGCCTTAGGTAGCGCTAACATTACCGTGAACAAAAACTCAGTGCCAAACGACCCACGTTCACCCTTTGTGACCTCAGGTGTGCGTATTGGTACGCCAGCCATTACGCGCCGCGGCTTTAAAGAAGCGGAATCGAAAGAGCTGACTGGCTGGATTTGTGACATCCTTGATGATGCAAGCAATCCTGCTGTGATTGAGCGTGTTAAGGGCCAAGTATTGGCACTGTGTGCACGTTTCCCTGTTTACGGCTAATTTGCTGACGCATCTACGCTAAGTAGATGACTGGCTGAGAGTGATCGCTTGCTGACTAGGTAATTACATGAGGTACTTACATCAGGTACTTAGCATCCGATAGTTAACCTCAGCCCATAAATAGGATAAAATCCCATGGCCGCATCAAGCGGCCATTTTTATTGGTAAAATCTTAGGCTTAAATCCATTGCGTTTAGCTGAGATTTTAACAGGTAAAGCCATTAACGCTTTACTCACATTCACTGGCAGGAGGCTCAATGCATTGTCCATTTTGCAGCGCGACAGATACTAAAGTCATCGATTCTCGATTAGTGGCGGAAGGCCATCAAGTGCGCCGTCGCCGAGAATGCACCGAATGCCATGAAAGATTTACCACCTTCGAAGGCGCCGAGTTAGTCATGCCTAGGGTGATTAAACGTGATGGTTCACGCCAACCCTTCGATGAAGAAAAACTCCAAGGTGGCATGCTGCGCGCGGTTGAAAAACGCCCCGTGTCTATGGATGAAATCGAGCAAGCGCTTAGCAAAATCAAATCTACCCTGCGCGCCACTGGCGAGCGGGAAGTGCCCTCAGAAATGGTCGGTAACTTGATGATGGAGCAATTGATGAGCCTAGATAAAGTCGCCTACATACGTTTTGCTTCTGTGTACCGCGCCTTTGAAGACGTGTCTGAATTTGGTGAAGCCATCGCCAAACTGCAAAAGTAGTTATTCGATTTTTGTGGCCTGTCGTTCAGTCTTCTGGTTGCGCTAGCTTTAAAAGAGGCTTTTGATTAATTGCTTCTGCAAGACAGGTTTTAAACACAGATTTACAACACTGGTGAAAGCCGAAACCTTTAACAACAGGGTTGAACATGAGCTTGAACATAAGTTGGTCCGTACTCGATACCCAAATGATGAGCCGCGCGATTCAATTGGCGCGCAAAGGCTTTTACACCACGAGACCCAATCCCAGTGTGGGCTGCGTTATCGTTAAAGATAATCAGATTGTCGGCGAAGGTTATCATCAAAAAGCGGGTGAGCCCCACGCCGAAGTGCATGCGCTACGCATGGCTGGCGAACACGCTCGCGGCGCAACAGCCTATGTCACCTTAGAGCCCTGCAGCCATTATGGTCGCACGCCACCCTGTGCCTTGGCGCTGATCAATATTGGCGTTAAGCGTGTGGTGGTGGCGGTGGAAGATCCAAATCCGCAAGTGGCTGGTCGCGGTATTCAAATGCTGCGCGACGCCGGCATTGAAGTGGATGTGGGATTACACCGCGATGAAGCTTATGCCTTGAATTTAGGCTTTATGAAGCGGATGGAATCAGGTTTACCCAGAGTTACAGTCAAATTAGCTGCGAGTCTCGATGGTAAAACCGCGCTGTCTAACGGTGTGTCTAAATGGATTACCGGGCCTGAATCTCGCCGCGACGTGCAGCGCTTACGCTTACGTTCCTGCGCGTTGGTCACTGGGATTGAAACCATACTGGCTGACGATCCCTCACTCAATGTGCGATATCAAGAACTTGGCGGCATAAAAGACAGCGTAACCGAGGCGCAATTACTGCAGCCATTAAGAGTCATACTCGACAGTCGCGCCCGTTTGCCGTTATCGGCTGCCTGTTTGGCTATCGTATCGCCGATACTCTTGGTCTCGACCGTGGCTTATCCCGCAGAGTTTCAAGCACAATTGCTCTCCCATGTGAGTTGCCTCGTGTTACCTGCAATTGCAGGTCGCGTATCCTTACCTGAACTATTAAGCTATTTAGGCCAAAGTTGTAATCACGTGCTGGTGGAAGCGGGTGCAACCTTAGCGGGCGCTTTTTTAGCAGAGGGGCTGGCCGATGAATTAATGCTATACCAAGCGATGAAAATCCTTGGTGGGCAAGGGCGCAATCTGCTGCAGCTACCCGATTATAAAACCATGGCGCAAATTCCTGCATTAACTTTGGTGGATGAGCGCAAATTAGGGCCAGATACCCGTTTAATCTTGAGCGTTAAGCTTGATACCTCTTTAATTAATTAAGTGAGTTAACCATGTTTACTGGGATTATTGAAGCCGTTGGCACCCTGCGCAAACTGGAGCGTAAAGGCGATGATATTCGTCTTACCGTGGCGAGCGGCAAACTGGATTTAAACGATGTGCGTTTAGGCGACAGTATTGCCACCAACGGTGTGTGCTTAACGGTTGTGCAGCAATTAGCCGACGGTTATGTGGCTGATGTCTCTGCCGAAACAGTGAGTCTAACCGGTTTTGCTAACTACAAAGTGGGCACTAAAGTGAATCTTGAAAAAGCCGTCACCCCAACGACACGCTTAGGCGGGCACATGGTCAGCGGCCATGTAGACGGGATTGCTATGGTTGAGCAACGTCTAGTGCGTGGTCAAGCCATCGAGTTTTGGTTAGCCGCACCTGCAGAGCTTGGGCGTTACATTGCCCATAAAGGTTCCATCACCATTGATGGCGTGAGTTTGACGGTTAACGAAGTGGACGGCAGTCGATTCCGCCTGACCATAGTGCCGCATACCGCAGGGGAAACCACGTTAATCGACCTGCAAGCGGGCGATAAGGTGAATATCGAAGTGGATTTAATCGCCCGTTATCTTGAACGCTTGATGAATTATGATGGCAAAGACAGCAAGAGCGGTGGTGTCACCATGGAAATGTTAGCCCGTGCCGGCTTTGTGCGTTAGTGCACTGGTAATCGTACTGCTAAGGTATAGAATTTCATAACAACAGCAAAATCATAAGGTCCTACAATGGCGCTGCACAGTATAGAAGAGATCATCGAAGATATTCGTCAAGGCAAAATGGTTATTTTGATGGATGACGAAGACAGAGAAAACGAAGGTGATTTGATCATGGCGGCCGAAATGGTAACGCCAGAAGCGATTAACTTTATGGCGAAATATGGCCGTGGACTCATTTGCCAGACCATGACCAAAGCCCGTTGTCAGCAGTTAAATCTGCCCTTAATGGTGACGAATAACAACGCCCAGTTCTCGACTAACTTTACGGTTTCTATTGAAGCAGCCGAAGGCGTGACTACCGGTATTTCGGCCCACGACCGCGCGGTAACGGTAAAAGCGGCCGTGGCTAAAGAGGCTAAAGCGTCTGATTTAGTGCAACCTGGGCATATCTTCCCGTTAATGGCACAGGACGGCGGCGTATTAACCCGCGCAGGCCACACTGAAGCTGGTTGTGATTTAGCCCGTCTTGCGGGACTTGAGCCATCGGGCGTTATCGTTGAGATTTTGAACGAAGACGGCACTATGGCACGCCGCCCAGATTTAGAGATTTTCTCCGAGTTGCACGGTATCAAAATCGGCACTATCGCGGCATTGATCGAGTATCGCAACACTAAAGAAACCACGGTTGTGCGTGAAGCTAAATGCAAACTACCGACCCGTTTCGGCGAGTTCGATATGGTGACTTTCAGAGACACTATCGACAATCAACTGCATTTTGCCTTAGTCAAAGGTGAGGTGAAGAGCGATTGTTTAGTGCGCGTGCATCTGCAAAACACCTTCAACGATTTACTCCATTCAGAGCGCGATCAGCAACGCAGCTGGCCACTCGAAAAGGCGATGGAGCGTATTTCAGCAGAAGGTGGCGTATTGGTTTTACTAGGGAATCAAGAACATCCCTGTGAAATCCTCTCTAAGGTGAAAGCCTTTGAAGCCGAAGATCAAGGCCAAGCGCCTGCTTCTGCAAAATGGCAGGGGACGTCGCGCCGCGTGGGTGTGGGCTCGCAAATCCTCGCTAGCCTTGGCGTGACTAAGATGCGTCTGCTCAGCTCGCCTAAACGTTACCATTCACTTTCGGGCTTTGGCCTTGAAGTGACTGAGTATGTGGCGGACTAAAACCCAAAGTTACACTGAAGTAAAATTAACAATTTCTGATAATTTGCATAGGGCTCTGGGCAATTGGCTGTGATATCATGTCGCCACTTTTCGCCCCGAGCTGGGTGCTTTAGCTAAATTAGGTAAGATAATGAACGTAGTTCAAGGTAATATCGAAGCGAAAAATGCCAAAGTTGCGATTGTAATTTCGCGTTTCAACAGCTTTTTAGTTGAGAGCTTGCTTGAAGGTGCACTTGACACGCTGAAACGTTTTGGCCAAGTCAGCGACGACAATATCACTGTTGTCCGTGTACCAGGTGCGGTTGAGTTGCCGCTAGCTGCGCGTCGCGTAGCTGCCAGTGGTAAATTCGACGGTATTATCGCATTAGGTGCGGTGATCCGTGGTGGTACTCCTCATTTTGATTTTGTTGCAGGCGAATGTAATAAAGGTCTAGCTCAAGTTGCTTTAGAATTTGATCTGCCTGTTGCTTTCGGTGTTTTGACCACAGATACCATTGAACAAGCTATTGAACGTTCAGGTACCAAAGCGGGTAACAAAGGCGGCGAAGCTGCTTTAAGTCTGCTTGAAATGGTCAATGTTCTGCAAGAACTTGAACAACAGTTGTTATAG
Protein sequences of DBSCAN-SWA_2 >NC_017571|1430538:1438105|1435668_1436325_+|WP_006082641.1|DBSCAN-SWA MFTGIIEAVGTLRKLERKGDDIRLTVASGKLDLNDVRLGDSIATNGVCLTVVQQLADGYVADVSAETVSLTGFANYKVGTKVNLEKAVTPTTRLGGHMVSGHVDGIAMVEQRLVRGQAIEFWLAAPAELGRYIAHKGSITIDGVSLTVNEVDGSRFRLTIVPHTAGETTLIDLQAGDKVNIEVDLIARYLERLMNYDGKDSKSGGVTMEMLARAGFVR >NC_017571|1430538:1438105|1437625_1438105_+|WP_006082639.1|DBSCAN-SWA MNVVQGNIEAKNAKVAIVISRFNSFLVESLLEGALDTLKRFGQVSDDNITVVRVPGAVELPLAARRVAASGKFDGIIALGAVIRGGTPHFDFVAGECNKGLAQVALEFDLPVAFGVLTTDTIEQAIERSGTKAGNKGGEAALSLLEMVNVLQELEQQLL >NC_017571|1430538:1438105|1432395_1433649_+|WP_006082644.1|DBSCAN-SWA MLKKDMNIADYDPELFKAIQNETLRQEEHIELIASENYTSPRVMEAQGSQLTNKYAEGYPGKRYYGGCEYVDVVETLAIERAKELFGATYANVQPHSGSQANSAVYMALLKPGDTVLGMNLAHGGHLTHGSPVNFSGKLYNIIPYGIDESGKIDYDEMERLAVEHKPKMMIGGFSAYSGIVDWAKMREIADKIGAYLFVDMAHVAGLIAAGVYPNPVPHAHVVTSTTHKTLAGPRGGVILSAADDEDLYKKLNSAVFPGGQGGPLMHVIAGKAVAFKEALEPEFKVYQQQVVNNAKAMVEVFLERGYKIVSGGTSNHLMLVDLIGRDLTGKEADAALGSANITVNKNSVPNDPRSPFVTSGVRIGTPAITRRGFKEAESKELTGWICDILDDASNPAVIERVKGQVLALCARFPVYG >NC_017571|1430538:1438105|1430538_1432206_-|WP_006082645.1|DBSCAN-SWA MAQFVYSMLRVGKIVPPKKQILKDISLSFFPGAKIGVLGLNGSGKSTLLRIMAGIDTEIEGEARPMPGLKIGYLPQEPKLDPNQTVREAIEEAVAEAKNALTRLDAVYAAYAEPDADFDALAKEQGELEAIIQAQDAHNLEYVLERAANALRLPDWDEKIEVLSGGERRRVAICRLLLEKPEMLLLDEPTNHLDAESVAWLEHFLQEYAGTVVAITHDRYFLDNAAGWILELDRGEGIPWEGNYSSWLEQKDARLKQESATESARQKTIAKELEWVRQGAKGRQSKGKARMARFEELNTNDYQKRNETNELFIPPGPRLGDKVIEVNNLTKSYGDRVLIDNLSFSVPKGAIVGIIGANGAGKSTLFRMLSGAETPDSGTIELGDTVQLASVEQFRDSMNDKNTIWEEISGGQDIMRINNMEIPSRAYVGRFNFRGGDQQKVIGTLSGGERNRVHLAKLLQAGGNVLLLDEPTNDLDVETLRALEEAILEFPGCAMVISHDRWFLDRIATHILDYRDEGQVNFYEGNYTEYSAWLKNTYGADVVEPHRLKYKRMIK >NC_017571|1430538:1438105|1434502_1435657_+|WP_006082642.1|DBSCAN-SWA MSLNISWSVLDTQMMSRAIQLARKGFYTTRPNPSVGCVIVKDNQIVGEGYHQKAGEPHAEVHALRMAGEHARGATAYVTLEPCSHYGRTPPCALALINIGVKRVVVAVEDPNPQVAGRGIQMLRDAGIEVDVGLHRDEAYALNLGFMKRMESGLPRVTVKLAASLDGKTALSNGVSKWITGPESRRDVQRLRLRSCALVTGIETILADDPSLNVRYQELGGIKDSVTEAQLLQPLRVILDSRARLPLSAACLAIVSPILLVSTVAYPAEFQAQLLSHVSCLVLPAIAGRVSLPELLSYLGQSCNHVLVEAGATLAGAFLAEGLADELMLYQAMKILGGQGRNLLQLPDYKTMAQIPALTLVDERKLGPDTRLILSVKLDTSLIN >NC_017571|1430538:1438105|1433906_1434356_+|WP_006082643.1|DBSCAN-SWA MHCPFCSATDTKVIDSRLVAEGHQVRRRRECTECHERFTTFEGAELVMPRVIKRDGSRQPFDEEKLQGGMLRAVEKRPVSMDEIEQALSKIKSTLRATGEREVPSEMVGNLMMEQLMSLDKVAYIRFASVYRAFEDVSEFGEAIAKLQK >NC_017571|1430538:1438105|1436391_1437495_+|WP_006082640.1|DBSCAN-SWA MALHSIEEIIEDIRQGKMVILMDDEDRENEGDLIMAAEMVTPEAINFMAKYGRGLICQTMTKARCQQLNLPLMVTNNNAQFSTNFTVSIEAAEGVTTGISAHDRAVTVKAAVAKEAKASDLVQPGHIFPLMAQDGGVLTRAGHTEAGCDLARLAGLEPSGVIVEILNEDGTMARRPDLEIFSELHGIKIGTIAALIEYRNTKETTVVREAKCKLPTRFGEFDMVTFRDTIDNQLHFALVKGEVKSDCLVRVHLQNTFNDLLHSERDQQRSWPLEKAMERISAEGGVLVLLGNQEHPCEILSKVKAFEAEDQGQAPASAKWQGTSRRVGVGSQILASLGVTKMRLLSSPKRYHSLSGFGLEVTEYVAD |
7 | Staphylococcus_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1472248 : 1483381
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_017571|1472248:1483381|DBSCAN-SWA CATGATCCGCATCTTAGTGAGCAACGATGACGGGGTTAATGCTCCTGGGATCAAAGCCTTAACTGAGGCGTTGACCGAAATTGCGACTGTGCTGACAGTGGGGCCAGATCGTAATTGTTCTGGTGCGAGCAATTCATTAACCTTGACTAATCCATTAAGAATTAATAGGTTAGATAACGGTTACATTTCTGTTCACGGTACACCGACTGATTGTGTGCATTTGGCTATCCGAGAATTGTACGATGGCGAACCCGATATGGTGGTGTCTGGTATCAATGCGGGCGCGAACATGGGCGATGATACCTTGTATTCGGGGACTGTCGCCGCCGCGATGGAAGGGCGCTTTTTAGGTTTTCCTGCAGTAGCGATTTCGCTCAATGGCCGAAAATTCGAACATTATCAATCGGCAGTGGTTTATGCTCGGCGGATAGTTCAAGGCTTGTTGGCCCAGCCTTTGGCAAAGGATCAGATTTTAAATGTGAATGTGCCAGACTTGCCGCTTGATCAAATTAAAGGCATTAAAGTGACTCGCCTTGGCGCTCGTCATAAGGCTGAAGGCATAGTACGAACGCAAGATCCCGCTGGACGTGAGATCTTCTGGCTCGGCCCACCAGGACAAGAGCAGGACGCCACAGAGGGCACTGATTTTCATGCTATCGCCAATGGCTATGTGTCAATTACGCCTTTGACGGTTGATTTTACTGCCTACGGACAACTAACGGCATTGCAAAATTGGGTAGATAAAATATGACTCGAGTTGCCTTAACATCGGCGGTGAACTTAGCTAAAAAGCTTCATGATGCGGGGATCCGCAATCAAGCCGTGTTAAAGGCGATATCGCATACGCCGCGGGAAATGTTTCTCGATAATGCGCTAGCCCATAAAGCCTATGAAAATACCGCTTTACCGATTGGCCAAGGGCAAACCATCTCGCAGCCTTATATCGTCGCCCGTATGACCGAACTCTTACTGCATAAGATGCCCCAGCGGGTATTAGAAGTGGGCACGGGATCGGGTTATCAGGCGGCAATTTTGGCGCAATTAGTGCCGCAGCTTTGTACTATCGAGCGAATAAAAGGCTTACAAATACAAGCGAGACAGCGCTTAAAACGATTAGATTTGCATAATGTGTCGTTTAAATATGGTGATGGCTGGTTAGGTTGGGCGAATCGTAGCCCGTTCGATGCCATTATGGTGACCGCTGCCGCTTCGACTATACCTGAGGCCTTACTGTCTCAATTAGCCGAGGGCGGAGTGTTAGTTCTGCCAGTCGGCGAAGATACCCAGCAATTGATGCGAATCACTCGTACTGCGGACCGCTTCAGTTCGGAAACAATTGAAACGGTTAAATTTGTACCTTTGATTAATGGAGAGTTAGCTTAACCTTGTGTTGTGCTACTGTCTCCACTGAACAGCTTGTCTTGCAACTAGGGCATTGTTTTCTCATCATAAACAGTAATACAGCCTCACCAAGGAGTTTTTATTGTTGAATGCGGGTTTAGTCTTAAACCTCTGCTTTATATTTCTGCTTGCAGGCTGTAGCTTTCAGGCCAGCCGACCCGCGCCCGTCGAAAATATTTCCCATAGCTACTCTAAAAACAACAAAGGCCACATTAAATCGAATTCATATAAAGTTAAGAAAGGCGACACGCTTTACTCTATTTCTTGGGGAGCAGGAAAGGATTTTTCTGAAATAGCTAAACTTAATCAGTTAGATAACCCTTATACAATTTACCCGGGACAGATTTTATATTTAACGAATGCTAAGTCGAGTAGTCAGTCCAAATCTACAACTTTAGATGGAAGGAAAACGAGCTCCTCGGGTAATAATTCTAGTAAAAATGATAACAAACAAACAGATAATCAAATTTCACAAACCGACGCTAAGTCAAGTAATGATCTGAAAAAAACACTTGATCAGAAAGCTAAGCCCGCGTACCTTGTAACAAGTTCCCAACAAAATGTTAACTCGATCATGGTTCCACCGACTTCGACACTACCAACCAGTGTGAGTCAGTGGGAGTGGCCAATAAGAGGAAAGTTAATCGGGACTTACTCTGCCAATGAGCAGGGAAATAAAGGCATTAAGATCGCGGGAAATCGGGGAGATATCATCAAAGCTGCTGCAGATGGGCGGGTGGTATATGCAGGAAGTGCTCTTAGGGGTTACGGTAATTTAGTGATTATTAAACATAGTGACGATTACCTTAGTGCTTATGCTCATACGGACAAGATCTTAGTCAAAGAAAAGCAACATGTCCTCGCAGGGCAGACAGTTGCAAAAATGGGCAGTACAGGTGCCAATCGGGTAATGTTACATTTTGAAATTCGTTACCATGGACAGTCTGTAAACCCACTTACTTATTTACCCAAACAATGATTGCTTGGGGCCTTGTTGGCAAATGGAGGATAGGAAATTGCACTAATTCAGCAGTTTAATCAGGTTTGGGAGATTTAATCATGAGCCGCATAAAAAGCACTGCCGCAGAAGCACTAGTAGATTTATCCGTAGACACCGTTGACTTCGACCTTGAAAAAGAAGAAGTTGCATCCGATTTAGTTCAAGAGCTAGGAATAGAACAACAGGTTCAAGATGACCTGCAAAAAAATCTTGATGCCACTCAACTTTACCTTGGTGAGATTGGTTTTTCCCCACTGCTTAGCGCAGAAGAAGAAGTTTACTTTTCCCGTAAAGCCTTAAAAGGCTGTGCAAAATCCCGTAATCGCATGATCGAAAGTAATTTACGTCTGGTCGTAAAGATTGCCCGTCGTTACAACAATCGCGGCTTAGCACTGCTTGATCTTATCGAAGAAGGCAACTTAGGCTTAATTCGTGCCGTTGAAAAGTTTGATCCAGAAAGAGGTTTCCGTTTTTCAACCTATGCGACTTGGTGGATCAGACAGACCATTGAGCGCGCCATTATGAATCAAACTCGTACAATCCGTTTGCCTATTCATGTTGTTAAAGAACTCAACGTTTACTTGCGCACCGCTCGCGAATTAGCACAAAAACTCGATCACGAGCCAACGGCTGAAGAAATTGCCGAAAAACTACAAGTGTCGAGTGCTGATGTCAGCCGTATGCTTAAGCTGAATGAGAAAATCACCTCTGTCGATATGCCTTTGGGGGGCGATAATGACAAGGCGTTGCTGGATATTTTAGCCGATGACGATAACGTTGGCCCTGACTATAAAGTGCAAGATGAAGATATTTCTAATTCAGTGGTCAAATGGCTCAATGAGTTAAACACTAAACAAAGAGAAGTGTTAGCCCGCCGTTTTGGTCTGTTAGGTTATGAACCTTCTACACTCGAAGATGTGGGTGCTGAGATTGGTTTGACTCGTGAACGTGTACGTCAAATCCAAGTGGAAGCATTGAAACGTTTACGCGACTTATTAGGATCTCAAGGTTTATCTGTTGAAGCATTATTTCGAAACTAATAAATGAGAAACTAAGATCAGTTGTTCGTTGATCTCTATGCCTAGCTCCTAGCTAAGGTCTAGTGATTTAGTCTAGTGCTTAGGTCTAGTCGATAAAAGTAAAACGCCCACTAAAGAGTGGGCGTTGTTGTTTCTAATCCTTTATTGCTGATCATATCGCTTGGCGAATATCACCCGTTCACAGAGCTTTAGCTCAGACGTTTCAGCTCATAGAGCAAATCAAGTGCCTGCTTTGGCGTTAAGTTATCGGGATTAATGCTACTTAACTTCGTTAACGCTGGATTCTCAACTGGTTCAGGTAACGCGAGTAAGCTTTGAATAGGCGTCCTCGTTCCTTCAGCTTGATGATCGCGACTCTCCAATTGCTGCAACTTATGTTTAGCGGCTTTAATCACTTTGTTTGGTACGCCGGCAAGCGCCGCAACTTGTAAGCCGTAGCTTTTACTGGCGGCGCCTTCTTGCACTGCATGCATAAAGGCGATGGTATCGTCGTGCTCAATGGCATCAAGGTGCACATTGTAAACACCAGCCATTAATTCAGGTAACTGAGTTAATTCAAAATAATGGGTGGCGAATAGGGTCATAGCGCCGACTTGCTGGGCTAAATACTCTGCTGCCGACCAAGCAAGTGATAAACCATCATAGGTGGATGTGCCGCGGCCGATTTCATCCATTAACACTAAGCTACTGGCGGTGGCGTTGTGTAGAATGTTGGCGGTTTCTGTCATTTCTACCATAAAGGTTGAACGGCCAGAGGCCAGATCGTCAGAGGCGCCAATACGGGTAAAGATACGATCGATAGGGCCTATCAGTGCGTGATCTGCTGGCACAAAACAGCCAATGTGGGCCATCAGCGTGATCAAAGCGACTTGACGCATATAGGTCGATTTACCGCCCATGTTAGGTCCGGTGACAATCAACATACGTCTTTGATTGTGCAAGGTGACTGGGTTAGCGATAAACGGTGTTTGACTCACACGTTCCACCACTGGGTGGCGACCAGCCTCTATCTGTACGCCGATATCTTGGCTCAGCTCTGGGCAAGTGTAGCCTAAGGTTTCGGCGCGCTCAGCAAAGTTACTTAACACATCAAGCTCAGCCGCCGCTCGAGCAAAAGCTTGTAATTCATGTAATTTTGGCAGAATAAGATCAAATAATTGTTCCCATAACTGCTTTTCGAGTGCCAGTGCTTTACCTTGGCTCGAGAGCACTTTTTCTTCGTACTCCTTAAGTTCGGGCGTGATATAACGCTCCATATTCTTAAGGGTTTGACGTCGTTGGTAATTGAGTGGCACTTGCGAGGATTGCAAGCGACTCACTTCGATGTAATAGCCGTGTACACGGTTATAGCCGACTTTAAGTGTGTTGATACCTGTACGTTCTTTTTCTCTGGCTTCGAGTTGTACTAAGTAATCGCTCGCACCTTCGCTAAGACCGCGCCATTCATCTAATTCGCTGTTATAGCCTTCACGGATCACTCCGCCATCGCGGATAAGCATGGGAGGATTATCGACTATCGCGCGCTCAAGTAAGGCTTGTTCAGCGGGGAACTCACCTAAGTGTTGCCGTAATTGAGTCGTGTGCGGCGCGCTCAGTGTGCTTAAACTCTGTTGTAATTCAGGTAGTAAGCCCAGCGCTTGACGTAAACGGGCAAAGTCTCTTGGGCGAGCGGTACGCAGCGCGAGTCTTGCCATGATACGTTCAATATCGCCAAGTGCTCTTAACTGTTCATGCAAACCTTCGTGGGCGGCGGTATCGAGTAGTTCAGTCACCGCCTGTTGGCGTGCTTTGATGTGTTTTGGATCTCGCAGTGGCTGATGGATCCAGCGTTGCAACATGCGGCTGCCCATAGGCGTCGCCGTGTTATCCAATACTGCGGCTAAGGTATTATCGCGGCCACCGGCAAGGTTTTGAGTGAGTTCTAGATTTCGGCGCGTTGCCGCATCGAGCACTATGCTATCGGTTTGATTAAAGCGAGTAATGGCATTGATATGGGGTAGGGCTGTGCGCTGTGTGTCTTTGACATATTGCATCAAGCAACCCGCAGCCTGCAGTGATAAACGTGCATCTGCGATGCCAAAACCATGCAAGTCTTTCGTGCCAAACTGAGCCAGTAGTAACTTGATGCTAGTGTCGTAATCAAATTCCCATTCAGGGCGACGACGTTTACCTTTAAAACCGTTTAGTAGGCCTAGTTCGCCAAAGTCTTCACTGTAGAGAATTTCAACGGGATTAGTGCGTTGCAATTCAGCTTCTAATGACTCTCGCGTGTCGAGCTCGGCAATCACAAAACGTCCGGATGAAACATCCAGAGTGGCATAACCAAAACCGATTTTACCTTGATAAACAGCTGCCAATAGATTGTCTTGGCGTTCCTGCAGCAGGGCTTCGTCGGTTAAGGTGCCAGGCGTGACGATACGCACCACTTTACGCTCAACAGGACCTTTAGAGGTTGCTGGGTCGCCAATTTGTTCACATATCGCAACCGATTGACCAATTTGAACTAATTTCGCAAGATAGCCTTCCACAGCATGGTAAGGCAGTCCCGCCATGGGGATAGGATCGCCACCACTTTTACCGCGCGCCGTGAGTGAAATACCTAACAGCTCAGAAGCACGTTTAGCATCGTCATAAAACAACTCGTAGAAGTCACCCATGCGATAAAACAGCAACATGTCGTGATGTTCTGCTTTCATGGTCAAGTACTGACGCATCATAGGAGTATGTTTTTCTAAATCATCGGTATCAATTACATTCATTAAATCGTTATCGTCCGTTGTGACTGCCGTGGTTATGTGGGATTAAGCGCTATGGAAGCAGAAGACTGCTGTACAATTTTTAGCTAAAAAGACCACGCAAAATTTGCTGCGCCAATCTTAGCAATAATTTACCGAAAATTGAGCCCTTGATGACGGATATTCGTCAAAGGGATGCAATATCCGATTTGATATTCGCTAAAACCTTAGCCATATTAAAAAATCCCATTAGCTAAAAATAATTCAGCCATAGGTCTTGATACTGTATGATTGTACAGTATACTAGTCAGCAGAATTAAGACAAAAAGCGCGGCTTTTTGAATTACCCGTTTGTAAATTGAATATTACTTTTGACAGCTTAAGGCTTGTCTTGATGAGGGAACTGGAATGAAGGTCGATCCAAATAAAGAGAAAGCACTTGCTGCGGTATTGATCCAAATTGAGAAACAATTTGGTAAAGGCTCCATCATGAAGCTGGGCGAAGATCGCTCTATGGATGTTGAGACCATTTCTACCGGTTCTCTGTCTCTGGACGTTGCTTTAGGTGCTGGCGGTTTGCCAATGGGTCGTATCGTTGAGATCTATGGTCCTGAATCATCAGGTAAAACGACACTGACTTTAGAAGTGATTGCCGCAGCGCAACGTGAAGGTAAAACCTGTGCCTTTATCGATGCAGAGCATGCACTCGACCCTATCTATGCTAAAAAATTGGGCGTAGATATTGATAACCTTCTGTGTTCACAACCAGATACCGGCGAGCAAGCGCTTGAGATTTGTGATGCATTAACTCGCTCAGGCGCTGTTGACGTTATCGTCGTCGACTCAGTGGCGGCATTAACGCCTAAAGCTGAAATCGAAGGCGAAATTGGTGATTCTCACATGGGCCTAGCGGCACGTATGATGAGCCAAGCTATGCGTAAACTTGCTGGTAACTTAAAGCAATCTAACACCTTACTTATCTTCATCAACCAAATCCGGATGAAGATTGGTGTGATGTTCGGTAACCCAGAAACGACAACCGGTGGTAACGCGCTGAAGTTCTATGCTTCTGTTCGTTTAGACATTCGTCGTACTGGTGCCATTAAAGACGGCGATGAAGTTGTCGGTAACGAAACTCGCGTTAAAGTGGTGAAAAACAAGGTTGCAGCACCGTTCAAGCAAGCCGAATTCCAAATCCTTTACGGCCAAGGTATTAACCGTACTGGTGAGTTAGTTGACTTAGGCGTAGCCCATAAGTTGATTGAAAAAGCGGGTGCTTGGTACAGTTATAAAGGTGATAAAATCGGTCAAGGTCGTGCTAATGCGGGTAAATATTTGACTGAAAACCCAGCCATTGCTACTGAAATTGACAAGACATTACGTGAGTTACTTTTGAGCAATCCTAGTGCACTGGCTGCTAAAGATGATGCTAGCACGGAAGATAATGTTGATTTAGAAACAGGCGAAGTATTCTGAGTCAGTCAGCCCGTCAAATTGCGGTCGCGCTGTTAGCGCGCCGCGATTATTCCCGCTTACAAATCCGCGATAAATTGCTCGAAAAAGGCTTCGAACTTAACGATATCGAACCTGTGCTCGATGCATGTGAATCCTCCGGTTTTATCAATGACAATCGTTATGCCGAATTGTTGGTCCGTAGCCACATTAGTCGAGGCCACGGCGCGATTCGTATTCGCCAAGCGATAGCGCAAAAAGGCCTGTCTAAAGATTGCATTGAAGCGGCGATAACCAATAGTGATTGTGACTGGTTTGAGCTAGCAAAAGATAAAGCCACTAAAAAATATGGAATTCCCCGCGTAACAGAAGTGAAGGGCTCTAAAGCCCGGGATCTCATCGCGAAGGAAAAGGCCAAACGAGTGCGGTTTTTAATGGGCCAAGGCTTTAGCTACGATCAAATCACTTATGCGTTAGATTCAGATCCCAATGATCTCGACGACGATTATTAATCGCCTTAATTCATCGATTGCGATGTAACCTTAAGACTGCAAAAGCTTCTTTAACAAGTCCTCAGAAAATCTCCTCTAAAATGTCATCAAATGGGCTTTGTTGCTTTCGTGCCCGACACTTTACCCACTTGCTCAAATACTGTATTTTTTGCACCTAACCCGCAGAGTCTGCCATTGTCCTTTATTATTGCTGTTTTGCCTCTAGCTACTCTGGTATGGGCTATTCACACTAAAATATTAAATTCCTGATGTGTATATCACCAATCAAGGCTGGTCGTCCTTACTCAGTCTGCTTATAATGCTGGGCAAATAGAAATACGGCTAGCCTAAGGCGTAGCTGGTAATGCTTACCCAATTCAGGATGATTTCATGTATCAAACTACAGCAGAGCTTAGAAGCGCTTTCCTCGAATTTTTCCGCAGCAATGGTCATCAAGTTGTGGACAGTAGTTCATTAGTGCCAGGCAACGATCCCACACTTTTATTTACCAATGCAGGCATGAACCAGTTCAAGGACGTGTTTCTTGGCATGGACAAACGCAATTATACGCGTGCGACCACTGCTCAACGTTGTGTACGCGCCGGTGGTAAGCATAACGATTTAGATAACGTCGGTTATACAGCACGTCATCACACCTTTTTCGAAATGCTGGGTAACTTCAGTTTTGGTGATTATTTTAAAGAAGAAGCGATTCGTTTTGGCTGGACTTTCTTAACTGAAACCTTAAAGCTGCCGAAAGAACGTCTGTGCGTCACGATTTATCAAACAGATGATGAAGCCTTTGAAATCTGGAATAAAAAGATTGGTGTTGCTGCCGAGAACATTATCCGTATCGGCGACAACAAAGGTGCCGCTTATGCGTCAGATAACTTCTGGCAAATGGGTGATACCGGTCCTTGCGGCCCATGTTCTGAGATTTTCTATGACCATGGCGACCATATTTGGGGCGGCCGTCCTGGCAGTCCTGAAGAAGATGGCGACCGTTTCATCGAAATCTGGAACATAGTGTTCATGCAGTACAACCGTCAAGCATCGGGTGAAATGTTGCCTTTGCCTAAGCCATCGGTTGATACCGGCATGGGGATTGAGCGTATTGCCGCGATTATGCAAGGCGTACATTCTAACTACGAAATCGATATCTTCCGTACGCTGATCGCTAAAGCGGCAGAAATCATCGGCGTAAGCGATTTAACCGAGAAGTCACTGCGCGTCATTGCCGACCATATCCGTTCATGTGCATTCCTTATTGCCGATGGTGTTATGCCGTCGAACGAAGGCCGCGGTTATGTGCTGCGCCGCATTATTCGTCGCGCCGTTCGCCATGGTAACAAGTTGGGCGCGACCGAAGCTTTCTTCTATAAGCTGGTTCCGTCTTTGATTGCGGTGATGGGCGATGCGGCCAAAGGGTTAGCTGAGACTCAAGCGATTGTTGAAAAAGCACTGAAAGCGGAAGAAGAGCAATTTGCTCGTACGCTTGAACGTGGTTTAGGCATTTTAGATGCGGCCTTAAACGAGTTAAAAGGTGACACCTTAGATGGTGAAACCGTATTTAAGCTGTATGACACCTATGGCTTCCCAATGGATTTAACCGCTGACGTGTGTCGCGAGCGCAACATCATTGTCGATGAAGCTGGTTTCGAAGTCGCCATGGCTGAACAACGTAGCCGCGCACAAGCTGCCGGTAACTTTGGTGCAGATTATAACGCGGCACTGAAAATTGATGCTGAAACCGCATTCAGTGGTTACACCGAATTAGCGGGTCAAGCTAAGGTCACCGCGATTTATCAAAACGGTGAATCAGTGACGGCTATCAAAGCGGGTGATGAAGCCGTTGTCGTGCTCGACGTGACACCTTTCTACGCCGAATCAGGCGGTCAAGTGGGCGATAAAGGCCAACTGGTTGCGAGCGGTGTTGAATTTACCGTTAACGACACGCAAAAGTACGGTCAAGCAACGGGCCATCAAGGTGTTTTAGCTACAGGTAACTTGAGTGTTGGCCAAGTTGTTGAAGCAAAAGTCGACAAAAAACTGCGTCACCGTACTCAGTTAAACCATTCTGTTACGCATTTACTGCATGCCGCACTGCGTCAAGTGCTCGGTACTCATGTTTCGCAAAAAGGTTCTTTGGTTGATCCTGAGCGTTTACGTTTTGACTTCTCCCATTTCGAAGGCGTAAAAGCCGCGGAATTAAAAGAAGTTGAAGAATTAGTTAACACTCAAATTCGTCGTAACCATGAGCTGAAAACCGCTGAAATGGGCATAGATGAAGCTAAAGAGAAAGGCGCTATGGCACTCTTTGGTGAGAAATATGATTCACAAGTGCGTGTTGTGACTATGGGCGATTTCTCAATCGAATTGTGTGGTGGTACCCATGTGGGCCGTACGGGCGATATCGGCTTATTCAAAATCACCTCTGAAGCGGGTATTGCTGCGGGTGTACGTCGTATCGAAGCCGTTACTGGCGCTGCGGCTATGGCCTATGTGGCGCAGCAACAAGCTGAACTGGAAGAAGCTGCAGCCTTATTGAAAGGCGATGCCAACTCAGTCGTTGCTAAGTTAAAAGCCCAGCTCGACAAAATGAAGCAACTCGAAAAAGAAATGGCGCAGTTGAAAGACAAGCTAGCTGCCGCAGCGAGTGCTGATTTAGTGGGTGATGCTATTGTGGTTAATGGCGTTAACGTGCTGATCAAAAAGTTAGACGGCGTTGAGGCAAGTTCATTACGCGGTTTGCAAGATGAATTAAAACAAAAATTGAAATCGGCCATTATCGTTCTTGGCACTGCGCAGGAAGGGAAAGTTAACCTGATCGCCGGTGTGAGTAACGATCTTATCGGCAAAGTCAAAGCGGGTGAGCTAGTCGCTATGGTCGCTGCACAAGTGGGCGGTAAAGGCGGCGGTCGTCCTGATATGGCTCAAGCGGGCGGTAGCCAGCCAGAAAATCTGGATGCGGCATTGGCTCAAGTATTACCTTGGATTACTGAGCGTCTAGCTTAA
Protein sequences of DBSCAN-SWA_3 >NC_017571|1472248:1483381|1473730_1474627_+|WP_006082609.1|DBSCAN-SWA MLNAGLVLNLCFIFLLAGCSFQASRPAPVENISHSYSKNNKGHIKSNSYKVKKGDTLYSISWGAGKDFSEIAKLNQLDNPYTIYPGQILYLTNAKSSSQSKSTTLDGRKTSSSGNNSSKNDNKQTDNQISQTDAKSSNDLKKTLDQKAKPAYLVTSSQQNVNSIMVPPTSTLPTSVSQWEWPIRGKLIGTYSANEQGNKGIKIAGNRGDIIKAAADGRVVYAGSALRGYGNLVIIKHSDDYLSAYAHTDKILVKEKQHVLAGQTVAKMGSTGANRVMLHFEIRYHGQSVNPLTYLPKQ >NC_017571|1472248:1483381|1480756_1483381_+|WP_006082604.1|tRNA|DBSCAN-SWA MYQTTAELRSAFLEFFRSNGHQVVDSSSLVPGNDPTLLFTNAGMNQFKDVFLGMDKRNYTRATTAQRCVRAGGKHNDLDNVGYTARHHTFFEMLGNFSFGDYFKEEAIRFGWTFLTETLKLPKERLCVTIYQTDDEAFEIWNKKIGVAAENIIRIGDNKGAAYASDNFWQMGDTGPCGPCSEIFYDHGDHIWGGRPGSPEEDGDRFIEIWNIVFMQYNRQASGEMLPLPKPSVDTGMGIERIAAIMQGVHSNYEIDIFRTLIAKAAEIIGVSDLTEKSLRVIADHIRSCAFLIADGVMPSNEGRGYVLRRIIRRAVRHGNKLGATEAFFYKLVPSLIAVMGDAAKGLAETQAIVEKALKAEEEQFARTLERGLGILDAALNELKGDTLDGETVFKLYDTYGFPMDLTADVCRERNIIVDEAGFEVAMAEQRSRAQAAGNFGADYNAALKIDAETAFSGYTELAGQAKVTAIYQNGESVTAIKAGDEAVVVLDVTPFYAESGGQVGDKGQLVASGVEFTVNDTQKYGQATGHQGVLATGNLSVGQVVEAKVDKKLRHRTQLNHSVTHLLHAALRQVLGTHVSQKGSLVDPERLRFDFSHFEGVKAAELKEVEELVNTQIRRNHELKTAEMGIDEAKEKGAMALFGEKYDSQVRVVTMGDFSIELCGGTHVGRTGDIGLFKITSEAGIAAGVRRIEAVTGAAAMAYVAQQQAELEEAAALLKGDANSVVAKLKAQLDKMKQLEKEMAQLKDKLAAAASADLVGDAIVVNGVNVLIKKLDGVEASSLRGLQDELKQKLKSAIIVLGTAQEGKVNLIAGVSNDLIGKVKAGELVAMVAAQVGGKGGGRPDMAQAGGSQPENLDAALAQVLPWITERLA >NC_017571|1472248:1483381|1474707_1475688_+|WP_006082608.1|DBSCAN-SWA MSRIKSTAAEALVDLSVDTVDFDLEKEEVASDLVQELGIEQQVQDDLQKNLDATQLYLGEIGFSPLLSAEEEVYFSRKALKGCAKSRNRMIESNLRLVVKIARRYNNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMNQTRTIRLPIHVVKELNVYLRTARELAQKLDHEPTAEEIAEKLQVSSADVSRMLKLNEKITSVDMPLGGDNDKALLDILADDDNVGPDYKVQDEDISNSVVKWLNELNTKQREVLARRFGLLGYEPSTLEDVGAEIGLTRERVRQIQVEALKRLRDLLGSQGLSVEALFRN >NC_017571|1472248:1483381|1475876_1478447_-|WP_006082607.1|DBSCAN-SWA MNVIDTDDLEKHTPMMRQYLTMKAEHHDMLLFYRMGDFYELFYDDAKRASELLGISLTARGKSGGDPIPMAGLPYHAVEGYLAKLVQIGQSVAICEQIGDPATSKGPVERKVVRIVTPGTLTDEALLQERQDNLLAAVYQGKIGFGYATLDVSSGRFVIAELDTRESLEAELQRTNPVEILYSEDFGELGLLNGFKGKRRRPEWEFDYDTSIKLLLAQFGTKDLHGFGIADARLSLQAAGCLMQYVKDTQRTALPHINAITRFNQTDSIVLDAATRRNLELTQNLAGGRDNTLAAVLDNTATPMGSRMLQRWIHQPLRDPKHIKARQQAVTELLDTAAHEGLHEQLRALGDIERIMARLALRTARPRDFARLRQALGLLPELQQSLSTLSAPHTTQLRQHLGEFPAEQALLERAIVDNPPMLIRDGGVIREGYNSELDEWRGLSEGASDYLVQLEAREKERTGINTLKVGYNRVHGYYIEVSRLQSSQVPLNYQRRQTLKNMERYITPELKEYEEKVLSSQGKALALEKQLWEQLFDLILPKLHELQAFARAAAELDVLSNFAERAETLGYTCPELSQDIGVQIEAGRHPVVERVSQTPFIANPVTLHNQRRMLIVTGPNMGGKSTYMRQVALITLMAHIGCFVPADHALIGPIDRIFTRIGASDDLASGRSTFMVEMTETANILHNATASSLVLMDEIGRGTSTYDGLSLAWSAAEYLAQQVGAMTLFATHYFELTQLPELMAGVYNVHLDAIEHDDTIAFMHAVQEGAASKSYGLQVAALAGVPNKVIKAAKHKLQQLESRDHQAEGTRTPIQSLLALPEPVENPALTKLSSINPDNLTPKQALDLLYELKRLS >NC_017571|1472248:1483381|1479973_1480387_+|WP_006082605.1|DBSCAN-SWA MLEKGFELNDIEPVLDACESSGFINDNRYAELLVRSHISRGHGAIRIRQAIAQKGLSKDCIEAAITNSDCDWFELAKDKATKKYGIPRVTEVKGSKARDLIAKEKAKRVRFLMGQGFSYDQITYALDSDPNDLDDDY >NC_017571|1472248:1483381|1472248_1472998_+|WP_006082611.1|DBSCAN-SWA MIRILVSNDDGVNAPGIKALTEALTEIATVLTVGPDRNCSGASNSLTLTNPLRINRLDNGYISVHGTPTDCVHLAIRELYDGEPDMVVSGINAGANMGDDTLYSGTVAAAMEGRFLGFPAVAISLNGRKFEHYQSAVVYARRIVQGLLAQPLAKDQILNVNVPDLPLDQIKGIKVTRLGARHKAEGIVRTQDPAGREIFWLGPPGQEQDATEGTDFHAIANGYVSITPLTVDFTAYGQLTALQNWVDKI >NC_017571|1472248:1483381|1478831_1479899_+|WP_006082606.1|DBSCAN-SWA MKVDPNKEKALAAVLIQIEKQFGKGSIMKLGEDRSMDVETISTGSLSLDVALGAGGLPMGRIVEIYGPESSGKTTLTLEVIAAAQREGKTCAFIDAEHALDPIYAKKLGVDIDNLLCSQPDTGEQALEICDALTRSGAVDVIVVDSVAALTPKAEIEGEIGDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRTGAIKDGDEVVGNETRVKVVKNKVAAPFKQAEFQILYGQGINRTGELVDLGVAHKLIEKAGAWYSYKGDKIGQGRANAGKYLTENPAIATEIDKTLRELLLSNPSALAAKDDASTEDNVDLETGEVF >NC_017571|1472248:1483381|1472994_1473630_+|WP_006082610.1|DBSCAN-SWA MTRVALTSAVNLAKKLHDAGIRNQAVLKAISHTPREMFLDNALAHKAYENTALPIGQGQTISQPYIVARMTELLLHKMPQRVLEVGTGSGYQAAILAQLVPQLCTIERIKGLQIQARQRLKRLDLHNVSFKYGDGWLGWANRSPFDAIMVTAAASTIPEALLSQLAEGGVLVLPVGEDTQQLMRITRTADRFSSETIETVKFVPLINGELA |
8 | uncultured_Mediterranean_phage(33.33%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2351831 : 2362132
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_017571|2351831:2362132|DBSCAN-SWA TATGAAGCTTCCTATCTATTTAGATTATGCCGCCACCACGCCTGTTGATCCAAGAGTCGCGGAAAAGATGTTCCAATGCATGACAATGGACGGCATCTTTGGTAATCCAGCCTCACGTTCACATCGCTACGGCTGGCAAGCTGAAGAAGCGGTGGATATTGCTCGCAATCAAATTGCCGAGTTAATTAACGCCGATCACCGCGAAATTGTGTTTACCTCTGGCGCGACGGAATCGAACAACTTAGCGATTAAGGGTGTTGCTCATTTCTATCACAAGAAAGGCAAGCACATCATCACCAGTAAGACTGAACATAAAGCCGTTCTTGATACCTGTCGCCAATTAGAGCGCGAAGGTTTCGAAGTGACTTATTTAGAGCCTGCGGCTAACGGTATCATTCCGATGGAACGTTTAGAAGCGGCAATGCGTGACGACACTATCCTTGTCAGCATCATGCACGTGAACAACGAAATTGGTGTGATCCACGATATCGATGCAATCGGTGAGCTATGCCGTTCAAAGGGCATTATTTTCCACATGGATGCAGCGCAAAGTGCAGGCAAAGTGCCTATCGATGTGCAAGCCACTAAAGTGGATTTGATCTCGATTTCTGGTCACAAAATGTATGGCCCTAAAGGCATCGGCGCGCTATATGTTCGTCGTAAGCCACGCATTCGCTTAGAAGCGCAAATGCACGGTGGTGGTCATGAGCGTGGTATGCGTAGCGGTACGTTACCAACGCATCAAATCGTAGGTTTAGGTGAAGCTGCTGCAATCGCAAAAGCGGAAATGGCCTCTGATGATGCTCGTATCGGTGCATTACGTGACAAATTATGGAATGGCATCAAGCACATTGAAGAAACCTATATCAATGGTGATGCGATTGAGCGCGTTAGCGGTAGCCTCAACGTTAGCTTCAACTATGTTGAAGGCGAGTCGTTAATGATGGCACTGAAAGATTTAGCGGTTTCATCGGGTTCGGCTTGTACCTCAGCCAGCCTAGAGCCTAGCTACGTATTACGTGCGCTTGGCTTAAACGATGAAATGGCACATAGCTCGATTCGTTTCTCTATCGGCCGTTTCACAACCGAAGAAGAAATTGATCACGCTATCGAAGTAATTACTCAATCTATTGATAAATTAAGAGAAATGTCTCCTTTGTGGGAAATGTTTAAAGATGGAATCGACCTGAACCAAGTTCAATGGGCACATCATTAATTCTGACCTTTACAATAGATAATTTTGGAGCAGTACCATGGCTTACAGTGAAAAAGTGATAGATCATTATGAGAACCCACGTAACGTTGGTTCTTTTGATAAAAACGATCCTTCGGTCGTGACCGGTATGGTAGGCGCGCCTGCTTGCGGTGACGTGATGAAACTTCAGTTGAAAATTGGTGCTGATGGCATCATTCAAGACGCTAAGTTCAAAACTTACGGTTGTGGTAGCGCGATTGCGTCTAGCTCACTGGTAACTGAGTGGGTTAAAGGCAAGACTATCGAACAAGCTGCGGCGATTAAGAACACAGATATCGCTGAAGAATTGGCATTGCCACCGGTGAAGATCCACTGTTCAATTTTGGCTGAAGATGCCATTAAAGCTGCTATCGACGAGTACAAATCGAAACAAGCTAAGTAACATCTGGAGTTAAGATGGCGATTACAATGACCCCAGCGGCAGCCGATCGTGTCAGATCTTTCTTAGTCAATCGAGGCAAAGGTGTAGGCCTGCGTCTCGGCTTAAGAACATCAGGCTGTTCTGGTATGGCTTATGTACTTGAGTTCGTTGATTCTTTAAATGATGACGATGAAGTGTTTGATATCGAAGATGTGAAAATCATCATCGATGCCAAGAGCCTCATCTATCTTCAAGGGATCGAGCTGGATTTTGTTAAAGAAGGGCTGAACGAAGGCTTCCAATTTAACAATCCTAACGCAAAAGGTGAGTGTGGTTGCGGTGAGAGTTTCACTGTTTAACCGCTAAAGTTGACGCATGAATTATTTCGAGCTGTTTAAATTTTCCCCTGCCTTCGATATTGATACCGCCTTACTTGCAGAACGCTATCGCGAACTGCAACGGGCGGTTCATCCCGATAAATTTGCCAATGATACTGAGCAGCAAAAATTGCTGTCGGTGCAGCGCACGGCGCAAGTCAATGATGGTTTTCAAACCTTAAAAGATCCCATTCGCCGCGCCGAGCATATGTTGTCGCTGCGCGGCATTGAATTAAGCCATGAAACAACCACAGTGAAAGATACTGGCTTCTTGATGCAGCAAATGGAATGGCGCGAAGCGTTGGAATACATACGCGATAGTGCCGATCCTCAAGCGAGTATTGACGAGTTATATCAATCGTTTGCGCAGTACCGCGCGCAACTCACTCAGCAATTGACTCAACTGTTAACCAGTGAGCAGGCTGAAGATGCATTGTTAGCGGCGGATCAAGTTCGCAAACTCAAATTTATGGCAAAATTACACGACGAGTTAACCAGAGTCGAAGACGCTCTGTTAGATTGATTTTCCGTGTTTATGTTACTTTGCTGACTCAAGCACGCTTGAGTTGGCATTACTAAGTTGGATATATATGGCCCTTTTGCAGATAGCTGAGCCCGGTCAAAGTGCCGCGCCGCACCAACATAGACTTGCCGTTGGCATCGATTTAGGTACGACCAATTCTTTGGTGGCGGCTGTTCGAAGTGGAGAGACGGCAACCCTGCCGGACGAACTTGGACAGCATTCATTACCTTCTATCGTGCGTTATACCCAAGATTCAGTCGAAGTTGGCGCCCTTGCGGCGTTGAGTTCGGCGCAGGATCCGCAAAACACGATCGTTTCGGTTAAGCGTTTTATGGGCCGCAGCCTGGCTGATATCAAAGCCGGCGAGCAATTATTCCCCTACGAGTTTGCCGAAAGCGAAAACGGTTTACCTTTATTCGTGACTCCCCAAGGCCAAGTGAATCCCGTGCAAGTGTCTGCGGAGATTTTACGTCCGCTGATTGCCCGTGCTGAAAAGACCTTAGGCGGCGAGCTGCAAGGTGTCGTAATAACCGTACCGGCTTATTTTGATGATGCCCAGCGCCAAGGCACGAAAGATGCCGCAGCTTTGCTGGGTGTTAAAGTGCTACGTCTGTTGAATGAACCGACGGCTGCTGCGATTGCCTACGGCTTAGACTCTAAGCAAGAGGGCGTAATTGCCATCTATGACTTAGGCGGCGGTACCTTCGATATTTCTATTTTGCGTTTGAATCGCGGTGTATTCGAAGTGTTAGCCACCGGTGGTGATTCAGCGCTTGGTGGCGATGATTTCGATCATTTACTGCAAGCACATATGCAACAAGTGTGGCAGCTTAGCGACATCGACTCACAATTAAGCCGTCAACTGTTGATTGAATCGCGTCGAGTCAAAGAAGCCTTAACGGATGCAGCAGAAACTGAAGCAAAAGTGATCCTTGCTGATGGGACTGAACTCACGCAAATCGTTACTAAAGCTGAATTTGATGCCATGATTGCGGCGTTGGTTAAGAAGACCATTGCTAGCTGTCGCCGTACCCTGCGTGATGCGGGCGTGACGACAGATGAAGTGCTTGAAACTGTGATGGTTGGCGGTTCAACGCGCGTGCCATTAGTGCGTGAACAGGTTGAAGCTTTCTTCGGCAAACCACCACTGACGTCTATCGATCCCGACCGTGTTGTCGCCATTGGTGCGGCCATTCAAGCCGACATTTTAGTGGGTAATAAACCTGAATCTGATTTGCTGCTCCTCGATGTGATCCCTTTGTCATTAGGCATAGAGACCATGGGCGGTTTAGTGGAAAAAGTGGTGTCGCGTAATACGACGATTCCGGTTGCGCGAGCACAGGAATTTACCACTTTTAAAGATGGTCAAACGGCCATGGCATTCCATGTGGTGCAGGGCGAGCGGGAACTCGTTGCCGATTGCCGCTCACTGGCGCGTTTTACTCTGAAAGGTATTCCGCCGTTAGCCGCAGGCGCTGCCCACATTCGTGTGACTTTCCAAGTGGATGCCGATGGTTTACTCAGCGTGACAGCGATGGAGAAATCCACCGGCGTGCAATCTAGCATTCAAGTTAAGCCGTCTTTTGGTTTATCGGATACTGAAATTGCCACTATGCTGAAAGATTCGATGAAGTATGCCAAAGACGATATCGGTCGTCGTATGCTGGCCGAGCAGCAAGTGGAGGCGGCGAGGGTACTCGAGTCTTTACATGCCGCGTTAGCGAAAGATGGCGACTTGCTGAATGCCGATGAGCGTGGACAAATCGATGCCACTATGGCCAATGTGGCGCAAGTTGCCGCGGGCGATGATGCCGATGCGATTAAGCTCGCGATTGAAAAACTGGATGAGCAAACCCAAGATTTTGCTGCCAGACGTATGGACAATTCTATTCGAGTGGCATTCAAAGGCCAGTCGATCGACAACATATAGGTGATGTAATGCCCCAATTAGTCTTTCTTCCCCATGCCGAGTTGTGCCCAGATGGTGCAGTGCTTGAGGCGAATGTTGGTGAGACCATTCTTGATGTTGCGCTGCGTAATGGCATCAATATCGAGCATGCATGTGAAAAGTCCTGTGCTTGCACGACGTGTCACTGTATCGTGCGTGAAGGCTTTAATGATCTTGAGCCAAGTGATGAGCTAGAAGATGACATGTTAGACAAAGCTTGGGGACTTGAGCCCGAAAGCCGTCTATCTTGTCAGGCGAAAGTGGTCGACACTGACATGGTGATTGAGATCCCTAAATACACGGTTAACATGGTCAGCGAAGGCAATTAATGCTGCCGCGATAATTGTTAAAACCAGTCCACGTGACTGGTTTTTTGCTTTTGGCGTTGAGCATTTGTGCAACTAATCTTGTCTATGCCTCTAGTTGTTTGATTTATCCCTGCGCAGGCGTATTATGCGCTCAAAATTAATGGTTGCCGGAGACCAGAATGAGAATAGCGATCTTATCGCAAGGGCCTGAGTTATATTCAACGAAGCGACTCGTTGAAGCGGCTCAATTACGTGGTCATGAAGTACACGTAATCAATCCCCTTGAATGTTATATGAATATCAACATGCGGCAATCGAGTATCCACATTGGTGGGCGAGAGTTACCCGCATTTGATGCTGTGATCCCCCGCATCGGGGCGTCGATTACCTTTTATGGCTCTGCGGTATTGCGTCAGTTTGAGATGATGGGCGTATATGCTTTAAATGATTCTGTTGGGATCTCGCGCTCACGTGACAAATTGCGTTCTATGCAGCTGATGTCACGCCGCGGCATTGGCTTACCCATTACTGGATTTGCCAATAAACCGAGTGATATCCCCGACTTGATTGATATGGTGGGTGGCGCGCCTCTGGTGATTAAATTACTCGAAGGCACCCAAGGTATCGGCGTGGTATTAGCCGAGACCCGTAAAGCGGCTGAGAGCGTGATTGAAGCCTTTATGGGCCTTAAAGCCAACATCATGGTGCAGGAATATATTAAAGAAGCCAATGGCGCAGACATTCGCTGTTTCGTGCTCGGCGATAAAGTGATTGCCGCCATGAAGCGTCAAGCTATGCCCGGTGAGTTCCGCTCTAACTTGCACCGTGGTGGCACGGCAAGTTTAGTCAAATTAACCCCAGAGGAACGCTCAGTCGCCATTCGTGCGGCTAAGACCATGGGGCTCAATGTCGCTGGCGTTGACTTGCTACGCTCAAACCACGGCCCTGTGATCATGGAAGTGAACTCTTCACCCGGTCTTGAAGGAATTGAAGGCGCCACCACGAAAGATGTGGCGGGTGCTATTATTGATTTTGTTGAAAAAAATGCGATTAAAGTGAAAAAGGTGACGCAAGCACAGGGTTAGTGTCCTTGCTGTCGAGTATTGACTACGGAATATCAACACTTGATTCACCTCCTTATGGGTAACGCCGTTTAGCAAGAACTTACCCATTAAAGCGAACCGAATAGGGGAAAACGATGACATTAACCTGGATAGATTCGTTAGATATTGCGTTGGAATTACTCGAAGCACACCCAGAGGTGAATCCTACTCAGCTGCATTTCACCCAATTGTATGAGTGGGTGTTAGCCTTGGATGATTTTGCCGATGATCCCAAGCACTGTAATGAAAAAATACTGGAAGCGATTCAGCAGTGTTGGATTGATGAAAAGTAGTGAGCACAGCGGTGATACGGCTTTTTATGTGAGGCATCTTGTCCTTTTGACGTTTAACTAAGGTGATTTACCTCACGTATTTGCAGGGGCGCAGAGAATATTTGGCGAGAGTTTGATCACAGTCAAGTTCATTTTTAGCGGATATCGTACCTTGTTGTATATTGGTTAAGGCATTTAAATGGCTTAACTTTGTTATCGCAACTGCAAGGATGTTCACATGAAACTTTTGCTTGTATTGTTATGTATGCTTGTCCCTTCAATGGCTTGGGCTCATCCAGGGCATGGCGGTGTCGGTTTATTCCATCACTTGTTAGATCTTGCTCCCGCAGTCATTTTAGTCGCTTTGATTGCTTGGGCAGGTATTTGGGCAAAAAATAGAAAATAATCCCATTTTTTAATGAGTTAATGGACCCTGCTGAGGATTTATCCTCCTAGCGGGGTCTTTTGCGTGATCAAGCTGATTACGAATAAAGCGAATGTGCGCAATATCTCAGACTAATTTGCTATAAAGATAGGATTTAGCTTTTAAATTTCGTATAATCCGCGCGAATTTTTTTAAACTTGCTGACAAAGGAAGCTAGTTATGGCGATCGAACGCACATTTTCTATTATCAAACCTGATGCTGTTGCAAAAAACCACATCGGTGCTATCTACAATCGTTTTGAAACTGCTGGTCTTAAGATCGTTGCATCAAAAATGTTACACCTGACTAAAGAACAAGCTGAAGGTTTCTATGCTGAGCATAGCGAGCGCGGCTTCTTCGGTGCTCTGGTTGCATTCATGACTTCTGGTCCTATCATGGTCCAAGTATTAGAAGGCGAAAACGCTGTTTTAGCTCACCGTGAAATTTTAGGTGCAACTAACCCAGCTCAAGCGGCTCCTGGTACTATCCGTGCTGATTTCGCTGAAAGCATTGACGAAAACGCGGCTCACGGCTCTGATGCTGTTGAATCTGCTGCTCGTGAAATTGCTTACTTCTTCAGCGCTGAAGAACTGTGTCCACGCACTCGTTAATTCGATGCTAAGACTGAAAAAGGAGCCCTAGGCTCCTTTTTTATTGGCTGCAATTTAGCGAATACTGCTTGCCGTGCGCAATGAATCTAATGCAGCGATACCAATTACATCTCGCTTCACCGTTTTAAAGTTGAATTAGCGTTTGTTTTTATTGGTCTTTTAAAGTTCTGCACATTTTTTTAATTTTCTTGCTTGAAATTCTCTCTCCCGTCCTTATATCTATTTCAGGATCGCTCGATGAGGGTCCTAGTGTGAATTCTGTGCCGTAAGGACAGAAGATTTAATTCGGCAGACTCTATAAACGTTTTCATCCTGAAAACAGTCCGCCAATTCTCCATATTGCTTAAGACAAGGATATGAATATGCGTACTTATGATTTAACACCACTTTACCGCAGTGCCATTGGTTTTGACCGTTTAGCGCAATTGGCAGAACATGCCGCCGCCAATAATGGCAACTCAGGTTATCCTCCATATAACATCGAATTACTCGGCGAAAATCGTTATCGCATAACGATGGCCGTAGCCGGATTCTCAATGGATGAACTTGAGATCAGCAGTGAAGGTGAAAAATTACTGGTGAAAGGCAATAAAGCCGAAAGCCAAACCGAGCGTAAATATCTGTATCAAGGCATAGCTGAACGCGGTTTTGAGCGTACTTTCCAACTGGCAGACTATGTCACTGTGCTCGGTGCAAGTTTAGAAAATGGCTTACTCAATGTTGATTTAGTGCGTGAAATTCCTGAAGCATTGAAACCACGCAAAATTGAGATCACCTCATCGCGCTTGTTAGACAGCCAGTCATAATTGATGCTGTGATAATCGCTTAAAAAACAAGGGGAGCATTATGCTCCCCTTGATGTTTTTGTGCATATGCGAAAATGTCCGCGAGATTTCCGTGAGAAAGTTACGCTTTCATGGCCTTTTCACCACGGGCCAAACCCACCACGCCGGAGCGTGAGACTTCGACCAGTTTGGTGACTTCCCCCAGCGCATGAATAAAGGCATCGAGTTTGTCTGACGTGCCCACCATTTGGATGGTGTATAAATTGGCGGTGACATCGACAATTTGACCGCGGAAAATATCCGCCATACGTTTCACTTCTTCCCTAAATTCACCTTGGGCTCGTACCTTAACAAGGGCGAGTTCGCGTTCAATATAAGCAGACTCGGTAATATTCGATACTTTTAACACATCGATTAACTTATGTAGCTGCTTCTCTATCTGCTCTAACACCATTTCATCGGCAACGACTGTGATGTTAAGCCGCGACAGTGTGATATCGTCCGTTGGCGCCACCGTTAAACTTTCAATGTTATAACCGCGCTGTGAAAACAAACCGACCACACGAGACAGGGCGCCGGATTGGTTTTCTAATAATACAGATATAATTCGACGCATTAGCATTTCTCCGTCTTGGTCAGCCACATTTCATTCATCGCACCACCGCGGATCAACATGGGGTACACGTGCTCAGTTTCATCCACCATGATATCGATAAAGACCAGTCTGTCTTTCATCGCCAGCGCTTCAGCCATTTTTGACTCAAGTTCGGCGGGATCGCTGATCGTCATGCCAACATGACCATAGGCCTCGGCAATTTTGGCGAAGTTAGGCACTGAATCCATATAGGAATGTGAGTGACGGCCAGAGTAAATCATGTCCTGCCATTGTTTTACCATGCCAAGGAAACGATTGTTTAAGTTGATGATTTTAACCGGCGTATCATATTGCAGCGCGGTCGAAAGCTCTTGAATGTTCATCTGGATGGAACCATCACCCGTGACACACACAACAGTAGCATCAGGCATGGCCATTTTTACGCCCATCGCGGCGGGCAGGCCGAATCCCATTGTCCCTAGGCCACCTGAGTTGATCCAGCGGCGCGGCTTATCGAACGGATAGTACAAGGCGGCAAACATTTGGTGCTGGCCAACGTCAGAGGCGACATAGGCATCGCCATTGGTCAGCTTATACAAGGTTTCAATCACTTGCTGTGGTTTGATGCGGCCACTGTTTTTGTCATAGGCCAGACAATCGCGACTGCGCCATTGTTCAATGTCATTCCACCAAGTTGCAATGGCATCGTTGTCGTTATGTTCGTTCGATTCATCTAACAGTGCCAACATGCTGTCTAAAATACTGTCGGCCGAACCCACGATAGGAATATCCACGTGGATAGTTTTCGAAATAGAAGAGGGATCGATGTCGATGTGCAAAATGGTCGCGTTCGGACAGTATTTTTCCACATTATTGGTGGTTCTATCGTCGAAGCGTACACCAATACCGAAGATCAAATCGCAGTTATGCATTGTCATGTTGGCTTCGTAGCGACCGTGCATCCCAAGCATGCCTAAACTGTTTTTGTGAGTGCTTGGAAAAGCGCCAAGCCCCATCAGCGTGCTGACGACGGGAATATTTAATCTCTCAGCTAATTGCAGAATTTGCTTATCACAACCCGAAATAATTGCGCCGCCACCGACATAAAGTACCGGTTTTTTTGCGGCTAACAGGGCTTGAAGACCACGGCGGATCTGACCTTTATGGCCTGATGTCGTTGGATTGTACGAGCGCATTTTGACGCTTTCTGGGTAGCAATATTCGTGCAGCAGGGCGGGGTTTAAACAATCTTTTGGCAGATCGACAACCACAGGACCGGGACGGCCAGTAGAGGCAATGTAAAAAGCCTTTTTAATAATTTCAGGAATTTCAGTCGGATCTTTCACTAAAAAGCTGTGTTTCACTACAGGGCGAGAGATACCTATCATGTCGCATTCTTGGAAGGCATCGTTACCGATAAGATTGCTCGGGACTTGACCCGATAACACCACAAGGGGGATAGAATCCATGTAAGCGGTGGCAATACCGGTAATCGCGTTGGTCGCGCCTGGGCCTGAGGTGACTAATACCACGCCCACTTTACCCGTGGCACGGGCGTAGCCATCGGCCATGTGTACGGCGGCTTGTTCGTGGCGAACGAGTATGTGTTCAATACCGGGGATAACGTGCAGAGCGTCGTAGATATCTAAAACTGAACCGCCAGGATAGCCAAAAATGTGCTTTACGCCTTCATCGATCAAAGAACGCACTATCATGCTGGCGCCGGATAACATCTCCAT
Protein sequences of DBSCAN-SWA_4 >NC_017571|2351831:2362132|2353821_2354346_+|WP_006081854.1|DBSCAN-SWA MNYFELFKFSPAFDIDTALLAERYRELQRAVHPDKFANDTEQQKLLSVQRTAQVNDGFQTLKDPIRRAEHMLSLRGIELSHETTTVKDTGFLMQQMEWREALEYIRDSADPQASIDELYQSFAQYRAQLTQQLTQLLTSEQAEDALLAADQVRKLKFMAKLHDELTRVEDALLD >NC_017571|2351831:2362132|2356781_2357687_+|WP_006081851.1|DBSCAN-SWA MRIAILSQGPELYSTKRLVEAAQLRGHEVHVINPLECYMNINMRQSSIHIGGRELPAFDAVIPRIGASITFYGSAVLRQFEMMGVYALNDSVGISRSRDKLRSMQLMSRRGIGLPITGFANKPSDIPDLIDMVGGAPLVIKLLEGTQGIGVVLAETRKAAESVIEAFMGLKANIMVQEYIKEANGADIRCFVLGDKVIAAMKRQAMPGEFRSNLHRGGTASLVKLTPEERSVAIRAAKTMGLNVAGVDLLRSNHGPVIMEVNSSPGLEGIEGATTKDVAGAIIDFVEKNAIKVKKVTQAQG >NC_017571|2351831:2362132|2359375_2359819_+|WP_006081847.1|DBSCAN-SWA MRTYDLTPLYRSAIGFDRLAQLAEHAAANNGNSGYPPYNIELLGENRYRITMAVAGFSMDELEISSEGEKLLVKGNKAESQTERKYLYQGIAERGFERTFQLADYVTVLGASLENGLLNVDLVREIPEALKPRKIEITSSRLLDSQS >NC_017571|2351831:2362132|2358581_2359013_+|WP_006081848.1|DBSCAN-SWA MAIERTFSIIKPDAVAKNHIGAIYNRFETAGLKIVASKMLHLTKEQAEGFYAEHSERGFFGALVAFMTSGPIMVQVLEGENAVLAHREILGATNPAQAAPGTIRADFAESIDENAAHGSDAVESAAREIAYFFSAEELCPRTR >NC_017571|2351831:2362132|2359919_2360414_-|WP_006081846.1|DBSCAN-SWA MRRIISVLLENQSGALSRVVGLFSQRGYNIESLTVAPTDDITLSRLNITVVADEMVLEQIEKQLHKLIDVLKVSNITESAYIERELALVKVRAQGEFREEVKRMADIFRGQIVDVTANLYTIQMVGTSDKLDAFIHALGEVTKLVEVSRSGVVGLARGEKAMKA >NC_017571|2351831:2362132|2357800_2357998_+|WP_006081850.1|DBSCAN-SWA MTLTWIDSLDIALELLEAHPEVNPTQLHFTQLYEWVLALDDFADDPKHCNEKILEAIQQCWIDEK >NC_017571|2351831:2362132|2358215_2358383_+|WP_006081849.1|DBSCAN-SWA MKLLLVLLCMLVPSMAWAHPGHGGVGLFHHLLDLAPAVILVALIAWAGIWAKNRK >NC_017571|2351831:2362132|2360413_2362132_-|WP_006081845.1|DBSCAN-SWA MEMLSGASMIVRSLIDEGVKHIFGYPGGSVLDIYDALHVIPGIEHILVRHEQAAVHMADGYARATGKVGVVLVTSGPGATNAITGIATAYMDSIPLVVLSGQVPSNLIGNDAFQECDMIGISRPVVKHSFLVKDPTEIPEIIKKAFYIASTGRPGPVVVDLPKDCLNPALLHEYCYPESVKMRSYNPTTSGHKGQIRRGLQALLAAKKPVLYVGGGAIISGCDKQILQLAERLNIPVVSTLMGLGAFPSTHKNSLGMLGMHGRYEANMTMHNCDLIFGIGVRFDDRTTNNVEKYCPNATILHIDIDPSSISKTIHVDIPIVGSADSILDSMLALLDESNEHNDNDAIATWWNDIEQWRSRDCLAYDKNSGRIKPQQVIETLYKLTNGDAYVASDVGQHQMFAALYYPFDKPRRWINSGGLGTMGFGLPAAMGVKMAMPDATVVCVTGDGSIQMNIQELSTALQYDTPVKIINLNNRFLGMVKQWQDMIYSGRHSHSYMDSVPNFAKIAEAYGHVGMTISDPAELESKMAEALAMKDRLVFIDIMVDETEHVYPMLIRGGAMNEMWLTKTEKC >NC_017571|2351831:2362132|2356284_2356623_+|WP_006081852.1|DBSCAN-SWA MPQLVFLPHAELCPDGAVLEANVGETILDVALRNGINIEHACEKSCACTTCHCIVREGFNDLEPSDELEDDMLDKAWGLEPESRLSCQAKVVDTDMVIEIPKYTVNMVSEGN >NC_017571|2351831:2362132|2353481_2353805_+|WP_006081855.1|DBSCAN-SWA MAITMTPAAADRVRSFLVNRGKGVGLRLGLRTSGCSGMAYVLEFVDSLNDDDEVFDIEDVKIIIDAKSLIYLQGIELDFVKEGLNEGFQFNNPNAKGECGCGESFTV >NC_017571|2351831:2362132|2354413_2356276_+|WP_006081853.1|DBSCAN-SWA MALLQIAEPGQSAAPHQHRLAVGIDLGTTNSLVAAVRSGETATLPDELGQHSLPSIVRYTQDSVEVGALAALSSAQDPQNTIVSVKRFMGRSLADIKAGEQLFPYEFAESENGLPLFVTPQGQVNPVQVSAEILRPLIARAEKTLGGELQGVVITVPAYFDDAQRQGTKDAAALLGVKVLRLLNEPTAAAIAYGLDSKQEGVIAIYDLGGGTFDISILRLNRGVFEVLATGGDSALGGDDFDHLLQAHMQQVWQLSDIDSQLSRQLLIESRRVKEALTDAAETEAKVILADGTELTQIVTKAEFDAMIAALVKKTIASCRRTLRDAGVTTDEVLETVMVGGSTRVPLVREQVEAFFGKPPLTSIDPDRVVAIGAAIQADILVGNKPESDLLLLDVIPLSLGIETMGGLVEKVVSRNTTIPVARAQEFTTFKDGQTAMAFHVVQGERELVADCRSLARFTLKGIPPLAAGAAHIRVTFQVDADGLLSVTAMEKSTGVQSSIQVKPSFGLSDTEIATMLKDSMKYAKDDIGRRMLAEQQVEAARVLESLHAALAKDGDLLNADERGQIDATMANVAQVAAGDDADAIKLAIEKLDEQTQDFAARRMDNSIRVAFKGQSIDNI >NC_017571|2351831:2362132|2353083_2353467_+|WP_006081856.1|DBSCAN-SWA MAYSEKVIDHYENPRNVGSFDKNDPSVVTGMVGAPACGDVMKLQLKIGADGIIQDAKFKTYGCGSAIASSSLVTEWVKGKTIEQAAAIKNTDIAEELALPPVKIHCSILAEDAIKAAIDEYKSKQAK >NC_017571|2351831:2362132|2351831_2353046_+|WP_006081857.1|DBSCAN-SWA MKLPIYLDYAATTPVDPRVAEKMFQCMTMDGIFGNPASRSHRYGWQAEEAVDIARNQIAELINADHREIVFTSGATESNNLAIKGVAHFYHKKGKHIITSKTEHKAVLDTCRQLEREGFEVTYLEPAANGIIPMERLEAAMRDDTILVSIMHVNNEIGVIHDIDAIGELCRSKGIIFHMDAAQSAGKVPIDVQATKVDLISISGHKMYGPKGIGALYVRRKPRIRLEAQMHGGGHERGMRSGTLPTHQIVGLGEAAAIAKAEMASDDARIGALRDKLWNGIKHIEETYINGDAIERVSGSLNVSFNYVEGESLMMALKDLAVSSGSACTSASLEPSYVLRALGLNDEMAHSSIRFSIGRFTTEEEIDHAIEVITQSIDKLREMSPLWEMFKDGIDLNQVQWAHH |
13 | Faustovirus(12.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2606330 : 2617856
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_017571|2606330:2617856|DBSCAN-SWA TTTGTCTCAAGGAAATAGTGTACGCACGCTCACTGGCCTGCAACGTCTGCTGGAAGGCGGACTCATCATTTGTTGTGTCCTCGCCACTTATATTTTACTCGCATTGACGAGTTTTAGTCCGTCTGATCCTGGTTGGAGTCAGTCGCATTTTCAAGGCGATATTAAAAACTGGACCGGAGCAGTAGGGGCTTGGATTGCCGATATTCTGTTATATTTCTTCGGCGTCACGGCTTACATCATGCCAATCATTGTCGCGTCGACCGGCTGGCTGCTGTTCAAACGCGCCCATCATTTGCTCGAAGTTGATTACTTTTCTGTGGCCCTGAGATTGATTGGATTTTTGCTGATCATCCTTGGGTTTTCTGCGTTGGCGAGTATGAACGCCAACAATATCTACGAGTTTTCGGCAGGCGGGGTGGCGGGCGATGTGATCGGCCAAGCCATGCTGCCGTATTTCAACAAACTCGGCACCACCTTATTGCTGCTGTGCTTCTTAGGTTCTGGCTTTACCCTGTTGACGGGGATCAGCTGGCTGACCGTGGTCGAGAAAATCGGCTTAATCTCTATTTGGATTTATAAGAAACTTAAATTATTACCGCAAGCGATGAAACGTGAGCGTGAAACCGAAGATACCCGTGGTTTTATGTCTGTGGTCGATAAGTTTAAACAGCGCCGCGAATCACAATACGTGCTTGAGAAGCCGCCAGTCGTTGCCACGCCAAAGGTGCGTGAACGTCATATTGGCCGCAGAGCCGAAATAACCCCAACCCTGTCTACTGCAGCAGATGATGGGTTTATTACCGAAAGCATTAATACTGAAGAAGTTGCGCCGCAAAAAACCAAGCTATCGGCCCTTGCAAAAATTTTGAGTTTGAATGGCAGTAAAAGCAAAAATGCACAAAGGGTTGAACCTCAAATCGATCAAGAGGATTTTGCCGCCCATGGCAATTTTGAAGCGCCACCTTGGTTAGCTGAACCTCAACATGCAAGAAATGATGAGCAAGAAAGAGCCGATTTCAATTCTCATTCATTAGATCACGACGACAATGAGCCAGTGTTTAACAGCCAAACCTTGGCGGAAGATGACGATGAAAGCTTAGGCTTTACCGATGATGACGTCATTGATTTTGATACAAAAGCCTCAACGGGCGCCGTGAATCAAGCCCAGCGTAAGAAGCAAGATCAAAAAGCGAAGATTGTGGATGGCATTGTGGTGTTACCGGGACAAGAAGATAAACCCGCGCCAAAAAAACCAATGGACCCGTTACCCAGTATCAATTTGCTTGATGTGCCAGATCGTAAGAAAAACCCGATCAGCCCAGAAGAGTTAGATCAAGTCGCTCGTCTAGTTGAAGTGAAGTTAGCCGATTTCAACATCATCGCCAATGTGGTGGGGGTGTATCCAGGTCCTGTGATCACGCGCTTCGAATTAGAATTAGCGCCTGGCGTTAAAGCGTCAAAAATCTCTAATCTGTCGAAGGATTTAGCCCGTTCACTGCTGGCTGAAAGTGTGCGCGTGGTTGAAGTTATCCCGGGCAAGGCTTATGTCGGACTCGAACTGCCGAATAAATTCCGTGAAACAGTGTTTATGCGCGATGTGTTAGATTGCGCCGCCTTTACCGACAGTAAATCCAACTTAACTATGGTGTTAGGACAAGATATTGCCGGCGACCCAGTCGTGGTTGATCTTGGAAAAATGCCGCATTTATTAGTTGCGGGAACCACAGGTTCTGGTAAGTCTGTGGGCGTAAATGTGATGATCACCAGCTTACTGTATAAGTCTGGCCCAGAAGATGTGCGCTTTATCATGATCGATCCCAAAATGTTGGAACTCTCTGTCTATGAAGGTATTCCGCATTTGCTCTGTGAAGTGGTTACCGATATGAAAGAAGCCGCCAATGCACTGCGTTGGTGTGTGGGCGAGATGGAACGCCGCTACAAGCTGATGTCAATGATGGGCGTGCGTAACATCAAGGGCTATAACGCCAAAATTGCTGAGGCGAAAGCCAATGGCGAAGTGATTTTAGATCCTATGTGGAAGTCATCCGACAGCATGGAGCCAGAAGCGCCTGCGTTAGATAAGCTGCCATCGATTGTTGTCGTGGTCGATGAATTTGCTGACATGATGATGATTGTCGGTAAGAAAGTTGAAGAGTTGATTGCCCGTATCGCGCAAAAGGCCCGCGCTGCAGGCATACATTTGATTCTGGCAACCCAAAGACCGTCAGTGGATGTGATTACTGGCTTGATTAAAGCTAACATTCCGACACGTATGGCGTTCCAAGTGTCATCGCGTATCGATTCGCGCACCATTTTAGATCAACAGGGTGCTGAAACCTTACTCGGTATGGGGGATATGTTGTTCTTGCCACCGGGTACTGCGGTTCCAAATCGTGTCCATGGCGCCTTTGTTGACGATCATGAAGTTCACCGCGTTGTTGCCGATTGGTGCGCCCGTGGTAAACCGCAATATATCGATGAAATTCTCAATGGTGCCAGCGATGGTGAGCAAGTCTTACTACCGGGTGAAACTGCTGAAACCGATGAAGAATATGATCCACTCTACGATGAAGCCGTCGCCTTTGTGACTGAAACCCGTCGCGGCTCGATTTCAAGCGTACAGCGTAAATTTAAGATTGGTTATAACCGTGCAGCGCGCATTATCGAACAGATGGAAGCCCAAGGTATCGTTTCGGCTCAAGGCCACAACGGTAACCGTGAGGTCTTAGCGCCGCCGGCACCAAAACATTATTAAGCCGCGCTGAGGCGTGGCTTTGATACACCCAAGGGTTTGACAGGGTAAGGAATAAAATGAAAAAACGATTATGCGCTGTGTTGTTAGCTTCACCTTTGCTATTTAGCGCGGCTGTGTTTGCCGATGATGTGCAGCAGTTAAGAGACAAGTTAATTGGTACTGCTTCACTAAAAGCCGATTTCAAACAAACCGTGACTGACGTGAATAAAAAGGTCATTCAGACGGGTTCTGGTATCTTTGCATTGGCCTATCCAAATCAGTTTTATTGGCATTTAACTCAGCCAGATGAATCGCAAATTGTCGCCGATGGTAAAGATTTATGGATCTACAATCCCTTCGCCGAAGAAGTGGTCATCATGGATTTTGCCGAGGCAATCAATGCTTCGCCTATTGCTTTGTTAGTCCACCGCGACGATGCCACTTGGTCACAGTATTCCGTGACCAAGCAACAAGACTGCTATGAGATCAAACCTAAAGCGATTGATTCGGGCATATTGTCCGTCAAGGTGTGTTTCAAAAATGCCCAGTTAGCCAACTTTAATGTTGCCGATGACAAAGGTAACTTGAGCCAATTTGATTTGAGCAATCAGCAAGCGATTACTGATAAAGACAAAGCGCTGTTCAGCTTTGTGCTGCCTGACAATGTCGATGTTGATGATCAACGTCGTAAAACAGCGCACTAGGCGATAGCGTGAGCAGTTTATCCTTTAATTTCGCCCCCGACTTTCGTCCCTTGGCCGCACGTATGCGGCCAAGAACGATCGCCGAGTACATAGGTCAAGCCCATTTACTGGGTGAAGGCCAGCCGCTACGCAAAGCATTGGAAGCGGGACGCGCCCATTCCATGATGTTGTGGGGGCCGCCGGGCACAGGTAAAACGACCTTAGCCGAATTAATCGCACATTATTCAAATGCGCACGTTGAACGCATCTCTGCGGTCACCTCTGGCGTCAAAGATATTCGCGCGGCGATTGAGCAAGCGAAAGCCGTGGCTGAATCCCGTGGTCAACGCACGTTATTGTTTGTCGATGAAGTCCACCGATTCAATAAAAGCCAGCAGGATGCCTTTCTGCCATTTATTGAAGATGGCACTGTGATTTTTATCGGTGCGACCACTGAAAACCCATCCTTTGAAATCAACAATGCCTTGCTCTCGCGGGCACGGGTTTATCTTATCAAGCGCTTAAGCCATGATGAGATTGCCCATATAGTGACTCAAGCCTTAAGCGATACCGAGCGCGGCTTAGGCCAACGCCAATTTGTGATGCCAACCGATGTGCTCACCACACTGGCGCAACTTTGTGATGGTGATGCCCGTAAAGCTTTAAATCTCATCGAGTTGATGAGCGATATGCTCGCCGATGGCGGCACCTTTACCACTGAAATGTTGATCCAAGTGGCGGGGCACCAAGTTGCCGGATTCGATAAGAACGGCGATCAGTTTTACGATTTAATCTCAGCCGTCCATAAATCAATCCGCGGCTCAGCACCCGATGCGGCGCTGTACTGGTTTTGTCGAATATTAGAAGGCGGCGGCGATCCGCTTTATGTCGCAAGGCGCTTACTGGCGATTGCCTCTGAAGATGTCGGCAATGCCGATCCTGCGGCGATGACCATAGCGCTTAATGCTTGGGATTGTTTTCACCGTGTTGGCCCAGCCGAAGGTGAGCGAGCAATAGCACAAGCCATTGTTTATCTTGCCAGCGCGCCTAAGAGTAACGCTGTCTATACCGCATTTAAGGCCGCGCGTGAGTTAGCTCGCGATACTGGGCAAGTCGAAGTGCCGCACCATTTACGCAATGCACCGACTCAGTTAATGAAAGACATTGGCATTGGAGCAGGGTATCGATATGCCCACGATGAAGCCAATGCCTATGCCAGTGGTGAAAATTATTTCCCCGAATCCCTGCAAACAGCGCAGTTTTATTTTCCGACTGAGCGAGGGTTCGAGAAGCGAATCAAAGATAAGTTGGCGCAATTAGCCCAGTTAGATCAAGCCAGCGAGCATAAAAGATATGAATAATCTCTTACTTGTGGCGCTAGGTGGTTCAATTGGGGCTGTTTTTCGCTATCTTATTTCAATATTCATGATCCAAGTATTTGGCAGCAGTTTTCCTTTTGGTACACTGTTGGTTAATGTCCTCGGTTCATTTTTAATGGGCGTAATTTACGCACTGGGGCAAATGAGTCATATCAGCCCAGAACTCAAAGCGCTGATCGGTATTGGCCTGTTGGGCGCTTTGACAACGTTTTCGACTTTCTCTAATGAAACCTTATTGCTGCTGCAAGAAGGGGATTGGCTGAAGGCGACTTTGAATGTGGTTTTGAATCTAAGTCTATGTTTATTCATGGTGTACTTAGGTCAGCAACTGGTTTTTTCTCGCATTTAACTATTAAGAATATATCACATGTTAGATCCTAAATTTTTGCGCAACGAATTAGAAGTTACCGCTGAGCGACTGGCCACCCGTGGCTTTATTTTAGACATAGCTCACCTCACTCAATTAGAAGAAAAGCGTAAGTCACTGCAAGTGACTACCGAAGAATTACAAGCGTCGCGTAATGCCATCTCCAAATCCATTGGCCAAGCCAAAGCCCGTGGTGAAGATGTTGAAGCGATCATGGCGCAAGTGGGCGATTTAGGTTCACAGCTGGATGCCAAAAAGATTGAACTGGCGGCCGTGCTTGAAGAAGTCAATACCATTGCCATGTCAATGCCAAACCTGCCAGACGAATCCGCGCCAATCGGTGCCGATGAAACGGAGAACGTTGAAGTTCGTCGCTGGGGCACGCCACGCACGTTTGATTTCCCAATTAAAGATCATATCGATTTAGGCGAAGGCCTAAACGGTTTAGATTTTAAAAATGCAGTGAAAATCACTGGCTCACGTTTTATCGTTATGAAAGGCCAAGTTGCCCGCTTAAACCGCGCCATTGGTCAGTTCATGCTGGACTTGCACACCACAGAGCACGGTTATACAGAAGCCTATGTGCCGTTACTCGTTAACGAAGCAAGCTTACTGGGCACAGGTCAACTGCCTAAGTTTGGTGAAGACTTGTTCCACACTAAACCTGCCACCGAAGAAGGCCAAGGCTTAAGCTTGATCCCAACGGCCGAAGTGCCATTAACCAACTTAGTGCGCGATAGCATTGTTGATGAAGACGAGCTGCCGATTAAATTAACCGCGCATACCGCATGTTTCCGCAGCGAAGCGGGCTCATACGGCAAAGATACTCGCGGTCTGATCCGTCAGCATCAGTTCGATAAAGTTGAAATGGTGCAAATCGTTAAGCCGGAAGATTCAATGGCGGCGCTCGAAGCGCTAACAGGTCATGCTGAAACGGTATTGCAACGTTTAGGTCTGCCATACCGTACTGTTATTTTATGTACTGGTGATATGGGCTTTGGCTCAAGCAAAACCTACGACATCGAAGTCTGGTTACCGGCACAAAACACTTATCGTGAAATTTCTTCTTGCTCAAACATGAAGGATTTCCAAGCCCGTCGTATGCAAGCACGTTACCGCGTGAAAGCTGATAATAAGCCCGCGTTACTGCATACCTTAAACGGTTCCGGTTTAGCTGTGGGTCGTACCTTAGTGGCTATTTTAGAAAACTATCAAAATGCCGATGGCAGTATCACAATCCCAGAAGTGCTGCGTCCCTACATGGGCGGCTTGACTAAAATAGGCTAATGGAACGCAAAATAATACTATGGATAAAGGCCAGCGCTATCTCCGTTGGCTTTCCTATACCGCTGTCATTGCAGTATTTTTTGCGGTGATGTTGGCGACCATAGGCAAAGCCGTCTGGGTTGTGTTAGAAAACGTCAAATAACCCTATTAACTTTGTTCAAGTAAAAAGAAAGGCATGTTTCCCAAGGGAACATGCCTTTTTGTTTGTGCCATATTGATGTTTAATGCTCACATTTGACGCTTACGTACGATATTGAGTTAAGTGTGATGACTTAACCCTATACGATTAAATGCATTTAGCTGGCTTTGGCAGGCCTGCAATTTTGGTCGCTTGTTTTGCTGGGCCAATAGGGAACAAGGTATATAAGTACTTGGAATTGCCTTTGTCTGGCCCTAAAGCTTGGCCAATAGCTTTGACTAATACGCGAATAGCAGGGCTCGTTTTGTATTCAAGATAAAAATCGCGAACAAAGTTAATCACTTCCCAATGGGCTGGCGTAAGCACGATTTGCTCTTCTGCCGCCAATAGTAAAGCCATGTCCGGTTGCCAATCGTTGAGATCTTTTAAATAACCTTGATGGTCACGGGCGATTTCTGCGCCATTAAATTGCAATGGATTTACCACGTTATCACCTTGTCATGGAGTAATGATTGAGCCACAAACTCATTGTAATCGATAAGCGTCACGTTCTTTAAACGCTCAGATAGGCCGCGAGCGACGACATCGTCTTTTAAGACCATGAGCTTGAAGGGCGATAATGCCATTGCCCATTGACGCAGCAGTAACGCGTTCACGCCATCGCTCGATAGTAGAATCGCATCTTCCTTACAGGCGTAACGTAAGCAGAGTTTCAACGCACTATCTCGGCTGGGAGATGTTTGAATATGATGTAAAATCATCAGAATACTAAGACCTCATCAACCTGTTTTAAATGGGCAGCGATGGCCTCATCGTTGACCACAGTAACAGGAACTGACAGTAAGCCATGGCTTAAGCCGTAGTCGGCTAAGGATTGCTTACAGACAAATACCGATTCGATATCGTACAGCGGCAGTGCTTTAAAGGTGGAGATGTAATCTTTAGCGCCGATAAGCTCAGGCTGTTGATCTTTAATCAAATGCAGCACACCTTCATCGACAAACACTAAACTTACTTCTTGCTCAAAGCTGGCGCTCAGCAGGGCGAGATCTAAACCTTCGCGGCCACGAACAGTACCGTGTGGCGAACAGCGAAATAGGATACAAAATTTTTTCATAGTCACCCTGAGAGTTAAAAACTGATCAAACGATCGGCCGATTCAATGCCGGTGACAAGCTCGCCTAAACCGCCCATGATAAATGACTCGCCCACATTCCAGTGGCTTAGGCCATTCTCGCTGGCATCTTGCTTGCTTACAATCCCACGGCGCTGCGCTGCTGAAACGCAATTGACTAACGGGAGCTGATGGGATTGCGCGAGCTGTTTCCAGTCACTGATCACATCGTTTTCATCGGAGGCGGGTAAGCTTAAATCTGTGGAGTTATACACGCCGTCTTGATAAAAAAACACACAGATAATCTGATGCCCAGCCAAAAGTGCAGCTTGGGAAAAGCGTAAGGCGTTAACACTCGCCGACGTGCCATAAGCGGGCCCGTTCACTTGGATAATAAATTTGCTCATTTAAAATAAAAATGGCCCTAAAAGTAGGGCCATTCTAACGCAAGTTAAGCTAAAAATCGCTAACTAGGATTAGTCATCGTTGCCTATGCCCATAAGGTGCAACAGCGCGATGAAGAGGTTTAAGAAATCTAAGTATAATGAGATAGTTGCGCGAATGTAGTTCGTCTCGCCACCGTTCACAATACGGCTAGTATCAAACAGGATAAAACCTGTCATCAATAATGCAATACCGGCGTTAATCGCCATAAAGGCAATGCTGTTACCCATAAAGATGTTAATTACGGCAGCGGCAATCACCACAATCAAGCCAGCAAACAAGAATCCACGTAGAAAAGAGAAATCTTTCTTAGTGGTCACCGCATAGGCTGATAACGCAACGAAAATCACTGACGTTAACCCCAGTGCTTGCATGATCAGTTGTGGACCGTTTGCCATGCCTGCGTAGTGATTCAGCATGTAGCCCAGTGACGCGCCTTCCATACCAGTAAAGGCAAATACCCAGAAGATACCCGCTGCAGATTCGGCTTTACGTAAGGTTACAAACAACAGCACTAGGCCACCAATGGATAAGCCAATGGAGAGTAATGGACTGATATTTAATGCCATTGCCAAGCCAGCACACAGGGCAGAGAAGGCCAATGTCATAGACAACAGCATATAAGTGTTTTTAAGAAGTTTGTTCACTTCCAGTGTTGACGCACTTGTCGAATATAAGGTTTCTTGGGTCATGTTTATCTCCGTTGACACATTTCCAAACTGATCATTTTGATTTTTTACAGCACACTAAGTTCCCGTAAAATCCTAAAACGAACTTAACTGCATTCATGATACCGTACCTGAGATACGGGTGAAAGTCATAGAACCATGAATAACAAATTGTTAATCTTAAAAATTTTGCCTCATCTCACATATCAATCTTAAGTGCTAAACGTTTAGGATTGAGATGGCTGCATCAGTATCTTTATCCACTAAGCACATGAATGCTTGGTTAAAACTTAATTAGCTTGATAAATAGGCCTGCTGAACTTAGGTGCACTTACAGCAAATTTCAAGTTCAATGCCGATTTCAAAGACAAGTTCAATGCCAAGTATAAGGACTAGATCAGCATATTGAGTTTAAACCAAGCTTAAGCCAGCCACTCTTTATGCAATTTTTCAGTATCGCCCAGATATTCCAGCACCCAAGCGAGCGCAGGCGACATCTTGTCGGCATTCCACGCGAGACAACAAGGGCTGGTCTGCTTAGGATTTTCCAGTTGTTTCTCAACCAGCGCACCCGCTTTAATAAACACACTTGCTAAGTGCACTGGCATATAACCAACGCCTAAACCTTCGCGGAAACAGTTGATGGCGCGGATCCAATCGGGTACAACGAGGCGACGTTGATTTTCGAGCAACCAAGTCATACGTTTTGGAATTTCCCGAGAGGTGTCTTCTAGGCAAATCGAGGGGAAAGGGCGCAGCTCATCGTCGCTTAGCGGACGGTCGATATTGGCCAATGGATGATTTTTGCTGACTAAAAAAGCCCACTCAATATCGCCCATGTCTTTATATTGGTACACGCCGCCGACCGGAATTGCGGTTGTTGCACCAATTGCAATGTCACTGCGGCCCGTGGCGAGCGCTTCCCAAACCCCGTTAAAGACCTCAATGCGAATAATCAATTCAATATCATGGAAATGACGGTAGAAGTCAGCAATCAATACACTGATCCTATCGGCACGGACGATATTATCCAGCGCGATGGACAAGGTCGGTTGCCAACCGTTTGCTACGCGCTGCGTGCCACGCTTCATCTCATCCATCTGCGTAAGCAAGGTTCTGGCTTGCTTAACAAAATGTTCACCCGCAGGGGTTAAGGTTACGCTGCGATGGTGACGTTCGAAAAGAATGACGCCGAGTTCTTCTTCTATTTGCTTGACCGCATAGCTCACGGCAGAGGGCACTTTATGCAGCCGATTCGCCGCCGCGGTGAAACTTCCTACACGAGCCACAATATCTATGAGTTCTAGCGCTTGTTCTGAGAGCATAGCTGACGTCCGTTCCAATGAAAATAATTGATAGCACAAGTCAAAAATAAACGTTTCCAATAAATTTAACAAGTCATTAGACTGCACGCTTTAGTCAATCCGACCCGATTATTTTATGAAAACGTCTAGTAACACCTTTGTAAACATAAAGTTCTTTATGTTTCTATTCTATTTAGCCATGTTGAGCATGCTCGGCTTTATTGCCACTGACATGTATTTGCCTGCTTTCAAAGCAATTGAAGGCTCACTTAACAGTACCCCATCACAGGTGGCGATGTCGCTGACCTGTTTCCTGGCGGGTCTTGCCTTAGGTCAGCTACTTTATGGCCCCTTAGTCAATAAAATCGGCAAACGTTGGGCACTGATTTTTGGCCTAGTGCTATTTGCCGTTGCCAGCGTGTTTATTGCCAACAGTGATTCTATCCTAATGCTCAATACCGCGCGCTTCTTCCAAGCGATTGGCGCCTGTAGTGCAGGGGTTATTTGGCAAGCGATTGTGGTTGAGCAATATGATGCCGATAAAGCACAGGGTATTTTCAGTAACATTATGCCACTGGTCGCATTATCGCCTGCATTAGCGCCGATCCTTGGCGCTTACATTCTGAACGAATTAGGCTGGCGTGCAATCTTTATCTCTTTGTGTGTTATCGCTTTTCTACTGATATTGATGACATTGTACTTTGTGCCAGGCAAGCGTCAGCATACACACAGTGAACATGCTGCGGTATCTTATGGGCAAATTTTGAAAAACACCCGCTATTTAGGCAACGTCGTGATCTTCGGCGCCTGTTCCGGTGCTTTCTTTGCTTATCTAACCGTATGGCCGATTGTGATGGAAATGCACGGCTATCTGCCAACTGAAATCGGCCTTAGCTTTATTCCACAAACCATCATGTTTATTGTCGGTGGTTATGCGAGTAAGTTACTGATAAAACGCATTGGCGGGCATCAAACACTGAATATCTTGTTGTCGATTTTCGCAGCCTGCGTTATTTCGATCATCATGTTCACGCTGATTTTCCCAGCAAGCACGATTTTCCCACTGCTTATTTCGTTCTCTATTTTGGCCGCAGCTAATGGCGCCATTTACCCGATAGTCGTGAACAGCGCGCTACAGCAGTTTAGTCAGAATGCCGCTAAGGCCGCAGGATTACAGAACTTCTTGCAGATCAGCATTGCCTTTGGTGCATCTAGCTTAGTCGCTATGTGGGCGAGCCTAGGTGAAGTAGCGATCGGTTGGGGTATCCTATGTTGCGCTATCTTAGTTGTGGTCGGCTATGTGCTGAAGAGCCAACAAGGGTGGGGCGATTTTGCCAAACATTTCACTAAGCCAGATCCTGCACGTGTTGGCATAGTGGCCGATGTGAAGCAAAATCCAGCGGATTGA
Protein sequences of DBSCAN-SWA_5 >NC_017571|2606330:2617856|2614049_2614439_-|WP_006081639.1|DBSCAN-SWA MSKFIIQVNGPAYGTSASVNALRFSQAALLAGHQIICVFFYQDGVYNSTDLSLPASDENDVISDWKQLAQSHQLPLVNCVSAAQRRGIVSKQDASENGLSHWNVGESFIMGGLGELVTGIESADRLISF >NC_017571|2606330:2617856|2615566_2616469_-|WP_011846842.1|DBSCAN-SWA MLSEQALELIDIVARVGSFTAAANRLHKVPSAVSYAVKQIEEELGVILFERHHRSVTLTPAGEHFVKQARTLLTQMDEMKRGTQRVANGWQPTLSIALDNIVRADRISVLIADFYRHFHDIELIIRIEVFNGVWEALATGRSDIAIGATTAIPVGGVYQYKDMGDIEWAFLVSKNHPLANIDRPLSDDELRPFPSICLEDTSREIPKRMTWLLENQRRLVVPDWIRAINCFREGLGVGYMPVHLASVFIKAGALVEKQLENPKQTSPCCLAWNADKMSPALAWVLEYLGDTEKLHKEWLA >NC_017571|2606330:2617856|2606330_2609084_+|WP_006081648.1|DBSCAN-SWA MSQGNSVRTLTGLQRLLEGGLIICCVLATYILLALTSFSPSDPGWSQSHFQGDIKNWTGAVGAWIADILLYFFGVTAYIMPIIVASTGWLLFKRAHHLLEVDYFSVALRLIGFLLIILGFSALASMNANNIYEFSAGGVAGDVIGQAMLPYFNKLGTTLLLLCFLGSGFTLLTGISWLTVVEKIGLISIWIYKKLKLLPQAMKRERETEDTRGFMSVVDKFKQRRESQYVLEKPPVVATPKVRERHIGRRAEITPTLSTAADDGFITESINTEEVAPQKTKLSALAKILSLNGSKSKNAQRVEPQIDQEDFAAHGNFEAPPWLAEPQHARNDEQERADFNSHSLDHDDNEPVFNSQTLAEDDDESLGFTDDDVIDFDTKASTGAVNQAQRKKQDQKAKIVDGIVVLPGQEDKPAPKKPMDPLPSINLLDVPDRKKNPISPEELDQVARLVEVKLADFNIIANVVGVYPGPVITRFELELAPGVKASKISNLSKDLARSLLAESVRVVEVIPGKAYVGLELPNKFRETVFMRDVLDCAAFTDSKSNLTMVLGQDIAGDPVVVDLGKMPHLLVAGTTGSGKSVGVNVMITSLLYKSGPEDVRFIMIDPKMLELSVYEGIPHLLCEVVTDMKEAANALRWCVGEMERRYKLMSMMGVRNIKGYNAKIAEAKANGEVILDPMWKSSDSMEPEAPALDKLPSIVVVVDEFADMMMIVGKKVEELIARIAQKARAAGIHLILATQRPSVDVITGLIKANIPTRMAFQVSSRIDSRTILDQQGAETLLGMGDMLFLPPGTAVPNRVHGAFVDDHEVHRVVADWCARGKPQYIDEILNGASDGEQVLLPGETAETDEEYDPLYDEAVAFVTETRRGSISSVQRKFKIGYNRAARIIEQMEAQGIVSAQGHNGNREVLAPPAPKHY >NC_017571|2606330:2617856|2611099_2611474_+|WP_006081645.1|DBSCAN-SWA MNNLLLVALGGSIGAVFRYLISIFMIQVFGSSFPFGTLLVNVLGSFLMGVIYALGQMSHISPELKALIGIGLLGALTTFSTFSNETLLLLQEGDWLKATLNVVLNLSLCLFMVYLGQQLVFSRI >NC_017571|2606330:2617856|2613065_2613404_-|WP_006081642.1|DBSCAN-SWA MVNPLQFNGAEIARDHQGYLKDLNDWQPDMALLLAAEEQIVLTPAHWEVINFVRDFYLEYKTSPAIRVLVKAIGQALGPDKGNSKYLYTLFPIGPAKQATKIAGLPKPAKCI >NC_017571|2606330:2617856|2609775_2611107_+|WP_006081646.1|DBSCAN-SWA MSSLSFNFAPDFRPLAARMRPRTIAEYIGQAHLLGEGQPLRKALEAGRAHSMMLWGPPGTGKTTLAELIAHYSNAHVERISAVTSGVKDIRAAIEQAKAVAESRGQRTLLFVDEVHRFNKSQQDAFLPFIEDGTVIFIGATTENPSFEINNALLSRARVYLIKRLSHDEIAHIVTQALSDTERGLGQRQFVMPTDVLTTLAQLCDGDARKALNLIELMSDMLADGGTFTTEMLIQVAGHQVAGFDKNGDQFYDLISAVHKSIRGSAPDAALYWFCRILEGGGDPLYVARRLLAIASEDVGNADPAAMTIALNAWDCFHRVGPAEGERAIAQAIVYLASAPKSNAVYTAFKAARELARDTGQVEVPHHLRNAPTQLMKDIGIGAGYRYAHDEANAYASGENYFPESLQTAQFYFPTERGFEKRIKDKLAQLAQLDQASEHKRYE >NC_017571|2606330:2617856|2611492_2612779_+|WP_006081644.1|tRNA|DBSCAN-SWA MLDPKFLRNELEVTAERLATRGFILDIAHLTQLEEKRKSLQVTTEELQASRNAISKSIGQAKARGEDVEAIMAQVGDLGSQLDAKKIELAAVLEEVNTIAMSMPNLPDESAPIGADETENVEVRRWGTPRTFDFPIKDHIDLGEGLNGLDFKNAVKITGSRFIVMKGQVARLNRAIGQFMLDLHTTEHGYTEAYVPLLVNEASLLGTGQLPKFGEDLFHTKPATEEGQGLSLIPTAEVPLTNLVRDSIVDEDELPIKLTAHTACFRSEAGSYGKDTRGLIRQHQFDKVEMVQIVKPEDSMAALEALTGHAETVLQRLGLPYRTVILCTGDMGFGSSKTYDIEVWLPAQNTYREISSCSNMKDFQARRMQARYRVKADNKPALLHTLNGSGLAVGRTLVAILENYQNADGSITIPEVLRPYMGGLTKIG >NC_017571|2606330:2617856|2613397_2613679_-|WP_006081641.1|DBSCAN-SWA MILHHIQTSPSRDSALKLCLRYACKEDAILLSSDGVNALLLRQWAMALSPFKLMVLKDDVVARGLSERLKNVTLIDYNEFVAQSLLHDKVITW >NC_017571|2606330:2617856|2609140_2609767_+|WP_006081647.1|DBSCAN-SWA MKKRLCAVLLASPLLFSAAVFADDVQQLRDKLIGTASLKADFKQTVTDVNKKVIQTGSGIFALAYPNQFYWHLTQPDESQIVADGKDLWIYNPFAEEVVIMDFAEAINASPIALLVHRDDATWSQYSVTKQQDCYEIKPKAIDSGILSVKVCFKNAQLANFNVADDKGNLSQFDLSNQQAITDKDKALFSFVLPDNVDVDDQRRKTAH >NC_017571|2606330:2617856|2616584_2617856_+|WP_006081635.1|DBSCAN-SWA MKTSSNTFVNIKFFMFLFYLAMLSMLGFIATDMYLPAFKAIEGSLNSTPSQVAMSLTCFLAGLALGQLLYGPLVNKIGKRWALIFGLVLFAVASVFIANSDSILMLNTARFFQAIGACSAGVIWQAIVVEQYDADKAQGIFSNIMPLVALSPALAPILGAYILNELGWRAIFISLCVIAFLLILMTLYFVPGKRQHTHSEHAAVSYGQILKNTRYLGNVVIFGACSGAFFAYLTVWPIVMEMHGYLPTEIGLSFIPQTIMFIVGGYASKLLIKRIGGHQTLNILLSIFAACVISIIMFTLIFPASTIFPLLISFSILAAANGAIYPIVVNSALQQFSQNAAKAAGLQNFLQISIAFGASSLVAMWASLGEVAIGWGILCCAILVVVGYVLKSQQGWGDFAKHFTKPDPARVGIVADVKQNPAD >NC_017571|2606330:2617856|2613678_2614035_-|WP_006081640.1|DBSCAN-SWA MKKFCILFRCSPHGTVRGREGLDLALLSASFEQEVSLVFVDEGVLHLIKDQQPELIGAKDYISTFKALPLYDIESVFVCKQSLADYGLSHGLLSVPVTVVNDEAIAAHLKQVDEVLVF >NC_017571|2606330:2617856|2614508_2615168_-|WP_006081638.1|DBSCAN-SWA MTQETLYSTSASTLEVNKLLKNTYMLLSMTLAFSALCAGLAMALNISPLLSIGLSIGGLVLLFVTLRKAESAAGIFWVFAFTGMEGASLGYMLNHYAGMANGPQLIMQALGLTSVIFVALSAYAVTTKKDFSFLRGFLFAGLIVVIAAAVINIFMGNSIAFMAINAGIALLMTGFILFDTSRIVNGGETNYIRATISLYLDFLNLFIALLHLMGIGNDD |
12 | uncultured_Caudovirales_phage(28.57%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
4887006 : 4900743
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NC_017571|4887006:4900743|DBSCAN-SWA ATTACGCCTCCTCCAACATGATAATCCGTAGCGTATCTAATGCCTGATTAAAATCGAGCGCTTGCACTTGCCCATGCAATATGGACCATTGTGGATGGCCTGATAACAACTCGGCTAACTGGTCAGTTAAGGCCATAGCCTCTAAATTGTTCTCTTTTAATAGGTTCATTAGGGTATTAAGGTCCGTCTTAACACTTTGCACCGATACCGAAGAATACACGGCTTTTTCTACATTATCGCTAATAACTGCATTAAATTCTTCCACCTCAAAGTAGGTTGAAAGCTGCTGACTACTCTGATTAATGAGTTCCTCTAACGTATCCGTCCAGCGCTTAATATCAATCAGTTCAGGACATTCTTGCTTAAACTGTTTCTCTAAGTAAGCCGCATGGGCAGCTAAACGTCGTGCGCCAAAATTACTGGCAATGCCTTTAATACCATGGCTAATCGCCGCCGCCGTTGAGTGATCAAATATCTTAGTCGACTGCTTAAAGAGCGCGAGTTGCTTCATCATTTCAGGCGCAAAACTTTTGGCCATTTTTTGGAAGAAAGACTGATTCCCCCCAAAACGACGCAGGATCAGGCGAATATCATCGAGTAATTGCTCATCACTCTGCTCCTGCGTCGCTTCAGGCCAGTGCCCTTCAAATTCAGCAGAGGTCATATCCTCACGGCCAACTAAGCGCAAAATACTCGGTAGTAATAATGGCATATCGATGGGTTTGCCAACATGGTCATTCATACCCGCATTGAGACACTCTTGCCTATCGGACTGTGAAGCATTAGCTGTCATCGCAAGTATCGGTAATGCTGCGAAACGTTCATCAGCTCGGATACGTCGAGTCGCCTCCAATCCATCCATATCGGGCATTTGCATGTCCATGATGACAATATCGAATAAGTCACCACTCTCGAGCACCAAGGAAACACCTTCCATGCCGCCTTCTGCCAATACGACATTCGCGCCCTCATAACTCAATAGCTCATCGATAACTTGTCGATTGAGTTGATTATCTTCCACCACTAATATGGTTAGCCCTGCTAAAAATCGCTGTGAACGCGGCTGAGGGTGCGCCTCCATTGTCTTACCTTCGATGGCATTTAACACAGCTTCAGCCAAGATCTGCGATGTCACAGGTTTAGTGAGGAAATTAACGAAAGGGACATGTCTAATTTGCTGACTCTCGGCAATCACTTCATGACCGTAAGCCGTTAACATCACCACCAGCGGAGTCTGGTTATTCGCGTCTGAGTTACGCAGCATTTCGGCGGTTTGCAGCCCATCAAGGTCTGGCATACGCCAATCCATCAACACCACATCAAACGGTGTCGATTTTTCATGTGATTGTTTAACCTTCGCGAGAGCCGAATATCCGCCTAAGGCAGTTTCAACCTCACACCCAAATCCGGTCAAAATTTTCTCTAAAATCTCTACCGTCAATTCATTATCATCAACCACTAACACACGGTATCCGCTAAGATCTGTGTGCCGAGTTTCTTCGACTTGTATAATAGGAAACGTCATATCAAACCAGAAACGACTTCCAACACCCACCTTGCTGGTCACCTGTAACTCACTGCCCATAAGGCCGATCAGCCTTTTGCTTATTGCTAGCCCGAGCCCTGTGCCCCCAAAACGGCGTGATGTTGAGGATTCTGCCTGTTCAAATCCGGTAAAAATCCGCTCAATCTGCTCGTCACTGATACCAATACCCGTATCGGTAATTGAAATTTGTACTGTGACTGTATCAGCCTCATGTCGTAAACATTCAATGCCGACAATCACTTGACCGTGGTGAGTGAATTTAAGGGCATTACCTGCGAGGTTAACGAGGATCTGTTGCAATCTAAGTTGATCTGCTAATAACCAAGGAGGTAAAGCCGTGTCGAGATCAAACATCACCTCGACATCTTTATCACCATGGTTACCCGACAGTACAACGGCAAGGTCCCGCATCAGCAATTCAATCGAACAAGGATGAAGATCGAGTACTAACTTGCCTGCATCAATTTTAGAAAAGTCGAGAATGTCGTTCAGTAGTCCTAATAAGGACTTAGCGGCCGTTTGAGTTTTATTAACATAATCTTGCTGCTGCCTAGACAAAGAGGTGTATTGAATTAATTGCAGCATGCCTAATACGGCATTCATTGGGGTGCGAATTTCATGGCTCATATTGGCTAAAAATGCAGACTTAGCCGCACTCGCCGCATCGGCCTCCTGCTTAGCTTCTCGCAAGCTTTCTTCCAATTGTTGCTGTTCAGTGATATCTAAATTAATCCCAATCACTCTAATCGCATGACCCTGTTTATCGCGCTCAACTTGGGCGGCTGCCTGAACGTAGCGAATCGAGCCATCTGTCCTTACCACCCTAAAAATAGGGTCATATTGCGCCGTTCCTTCCACAGCGCCTTTCAATTTAGCATCAGCAAAAGCAATATCGTCAGGATGAACCCGCATCGACCAGTGTTCATAGGTTAATCCTTGTTCACGAAGACTTTCAGGCTGATCGTAAATCGCAAACATACGTTCATTCCATTCAAGCGAATTACCGAGCAGATTCCAAGTCCAGATACCTAATTGAGCCACTTCGGCTGCCTTACTTAAATGGTTACTGGCCGTCATCAGTGCTTGTTGCTGCAGTAACATCAGTGAAATATCGACGCCAATGCCTAAATAACCAATAATCTCGCCAACACTATCACGCATAGCAGTGACAGAAAGTGACACTTGGAACTGGCTACCGTCTTTACGCACATAGGTCCAATTGCGAGTTTCAGAACCTTCGGCTTTTGCTTTATAAACAAAAACATCAAAGCCTTGAATGTCTTTACCGTATTCAGCGGATAACTCTGCAGATCTTATCATCACCTCCTCGAGCAAGTGCAGAGGCGCAGGGGTAGATTGACCTATCATTTCCTCGGCTGAATAACCGAGTAAGCGCTCTGCTCCACGGTTAAAAATGGTGATATTCCCTTGCACATCAGTGGCTATGATGGACATTTCAGAGGCGGCATCGAGCACGTTAGTCAGTAATGACCCTAAGTTATTTTTCTCAATCTGTGTCAACTTACGGTCGGTAATATCAATCCTTAATGCCACAAAACGTTCAATTTTACCGTATTCATCAAAAACGGGACCAATCACAGTATCGAACCATTTCAACGTTTGATGCTTATCAAGGTTGCAGATTTCACCGTGCCAAGACTGTCCCGAGTTTATCTGTTGCCACATAGATCGCCAGAAAGACACATCATGTTCACCTGACTGAAGTATTGCATGGGTTTTCCCCACCAGTTGCTCACGCGAATAGCCACTAATGCGGCAAAAGTTATCATTCACCTCTAAGATGACGCCATTGGGGTCAGTAACGGAATAGAGTAGCTGTTGATTAATAGTGTCGAGCAAGGTTTGGTTTTCGAGTAACGCCTGCTGCAACGCATGGGTACGCTTGGTGACTTGCCGTTCGAGACTCGCATTCAAGCTTAAAATATGTCGTTCAGCTTCAAGCTGCGCTGTAATATCGCGGATCGTTTGGCTCATGCCAGCTATCTTGCCGTGCTCGTCATAAATAGGCACGGCCGTAGTTGAGGTCGATAAATGACTACCATCTTGCCGTTGATGGCTAGTTATCCGATTTAAAGAGGTTTTACCCGCTAACACTTCGGCGAATAATGCTCTTTCTTCCTGCACTATGCTGGCAGGAACAATCAATTTACAACACGACTGGCCAAGAGCGTGCGCCTCGGTATAGCCAAATAGCTGCTCCGCACCTTGGTTCCAACTGGTAATTTGGCCTTTAGTGTCATAACTGATAATGCCATCGAGAGAATGCTCCAACATGCTAGCGCGCCTTGCCTGTTCAAGCAGCACCTGCTGCTTACGCTGCAAACTGATGGACCACATGGCAATTAATGCGGCTAAAAGCAGGCTAAACAAACTGCCACTCAACAGCACTAAACTGGGTTGATTAAGATGCAATGACTTAATAAATGATGGATAAGCGATGACTTCTATCTGCCATTGACGACCAAAAATGTCTTTCGTCATTTTATGGCGATATGCAGATAATGGAGACGCCTCATCAGGGTGTGTTTCAAAAAAACTGATGGGACGCTCAGCTTGGGTGATATCGCTAAGCAACAGCTTAGTCATCTTCTGATTCAACGCTAATCCCATTAGAACTTCGTTCGTCACTAAGGGGGCATAACTCCAACCATACCCAGCTGAAAGTCTATCCTGTTGTGATTTAGGTACTAAACCCGTGCGATAAATAGGCTGCAGAATTAAGAATGATTGTAGCGGCCTACCCGTAGCCTGCACTAATGTGATGGGTGCAGATAAGCGCACTTCACCGGATAACATAGCCGCATCGGCAGCGGCTTTACGATGTGCCTCTGACGCAATATCTAAGCCAACGGCAGCTTTATTGCGGTCTACAGGCTCAATATATTCAATAACGTATCTTTCGCCATCATGAGCGTTAAGTTGGCGAATATTAAAGTCAGGCCAATCATCTTGACGAGCTCTTGCTATAAATGCCGCTTCATCGGCACTGGGTACTCGACGAATAAAGCCAAATCCACGGGCGCCGGGAAACTCATGGTCCACATCCCGCGTTAAACTATAACGATGGAAGCTAGCACGACTAATATCATGCTCACCCGCAGTGACTATCGTGCCGCGTGCACCGCGTAAACCATATTGATATAAGGTGATACGGTTAATCACGTTCTCACTGATTTGCTCCGCGTCCTCGCGCAAGGCTTGCTCAATCTTTTGGGCGTTCATGCGCCCCGTCAACCCTGTTAATAACGCGCATAGCACCAATCCAATCAACAGGACTAATAAGCCCCATTTCGAAGCATTTTTATAACTCATGGACAAGTGCATAAACGCCTAATAATTTTTAATCAGAGAGTTATGATTTTGGACTAATAAAAATCCTTTTGCCACTCAGCGAGTGGCAACAAAGGATAAATGCGCCAAAAGTTGCTGATGCCCATTTAGTTACAATTTCACCCTTGGCGTGAATAAACGCGAGTATTCCCATGCAATTTCATTTGCTTAGTATTAACATTGGGCAATAAATGTCAGCATTTATCGCATTCAAGCCACTAAGGTGAGCTAGTCGCCTAGGTTGAATGCGGCTATATTAGGCAGATTCGTTTTCAATTTCAGCCAGTTGGTTAATCTTAACTGTTTCTCGTCAGTAGAATAAGCACCAATCTGAGCAATTAAAGGACATCCCATGGGACAAGAAACCTCGAAAATCCTCGTCGTCGATGATGATATGCGCCTGCGAGCGTTACTCGAGCGTTACCTGATGGAGCAAGGTTATCAGGTGCGCAGCGCGGCCAATGCCGAGCAGATGGACCGCCTATTAGAACGCGAAAACTTCCATCTACTCGTGCTCGACTTGATGTTACCCGGTGAAGATGGCCTATCTATCTGCCGCCGTTTGCGCCAGCAAGGTAATCCTATTCCGATCGTGATGCTGACGGCCAAGGGCGATGAAGTCGACCGTATTATTGGCCTAGAGTTAGGCGCCGACGATTATCTGCCAAAACCTTTTAATCCACGGGAATTACTCGCACGGATCAAAGCCGTAATGCGTCGCCAAACCCAAGATGTGCCCGGTGCGCCCGCCCAGCAGGAAGCGCAGATCAGCTTCGGAGAGTTTTCGCTAGATTTAGCCACCCGTGAGATGTACCACGGCGACGAATCGATTGCGCTCACCAGCGGTGAGTTTGCCGTGCTTAAAGTGCTCGTCACCCATCCACGCGAGCCCTTGTCTCGGGATAAGTTGATGAACCTCGCCCGTGGTCGCGATTATTCGGCGCTAGAACGCTCCATCGACGTGCAAGTGTCACGTTTGCGGCGTTTAATCGAAAAAGATCCTGCGAATCCGCGTTACATTCAAACTGTGTGGGGATTAGGTTATGTGTTTGTGCCCGATGGCGCAGCGCGTCGATGAGCCGCATTGCCATCTTAATCTCCTGCGGTTATTGAGCCCCCGATGAAAGCTAAGTTTTGGTGGCGGTTTGTGCCTCGCAGCGCGTTTAGCCAAACCGTGATGCTGATTGGTTGTCTATTGCTGATCAATCAGCTGGTGTCTTATGTCACAGTTGCCGTTTATGTGCTTAAGCCAAGCTACCAGCAAATCAATCAGTTAATCGCTCGCCAAGTCAAATTGCTGTTTGTCGATGGCATAGATATTGGCCGCGAACACTTAACCATAGTCGATGCCCTCAATGCCAAAGTCCACGACGATGGCATGAAGATCTACAATCAACAGCAAGCGCGTGAAGCTGGCATTGAGCAAGCGACCTACTATGGTTTCTGGTCGGCGCAGATGTCGGAATATCTCGGCGGCGACGCCGAAGTGCGCGTCACCCACGGCAGTGTATTGCAGATTTGGATCCGTCCACCACAGGCGCCCTCGATTTGGATAAAAGTGCCGCTTATCGGCCAGAATGTTTCAGACCTTTCGCCGCTTACCCTCTATTTAATGGTCATTGGTGCCCTCAGCGTCGCCGGGGGTTGGTGGTTTGCCCGCCAACAAAACCGTCCACTTAGGCGACTACAAAAAGCCGCGATTGCCGTTTCCCGCGGTGAATTCCCCGATCCCTTGCCGCTTAATGGTTCGAGTGAAATTGTCGAAGTGACCAACGCCTTCAACCAGATGGCGCACAGCATGAAACAACTGGAGCAGGACAGAGCGCTATTGATGGCGGGGATTTCCCACGACTTGCGCACGCCGCTCACCCGTATTCGTCTCGCATCTGAGATGATGGTTGAGGAAGATCAATATCTTAAAGATGGCATAGTCAACGATATCGAAGATATGGACGCCATCATCAGCCAGTTTATTGCGTATATTCGCCAAGATCAGGAAACCAGCCGTGAACTAGGGCAAATCAATAAACTCATTCAAGATGTCGCTCAAGCAGAAGCCAATCGCGCCGGTGAAATAGAAGTGGTGTTAACTGACTGCCCCGAAGCCCAATTCCAAGCGATTGCCATTAAGCGGGTACTGAGCAACTTAGTCGAAAATGCCTTTCGCTATGGTTCTGGCTGGATCCGCATTAGCTCGCAGTTCGATGGCAAACGTATCGGTTTTACTGTTGAAGATAATGGTCCAGGAATCGACGAGTCACAAATCACCAAACTATTTCAACCCTTTACCCAAGGTGATATCGCCCGTGGCAGTGTCGGTTCAGGCCTAGGTCTCGCCATCATCAAACGGATTATCGATAGACACCAAGGGCAAGTCACCCTCTCCAACCGCGCCGAAGGCGGTTTAAGAGCCCAAGTGTGGCTGCCGCTGGAATAGCGCGAGATCATATCTCTGCAAATGAAATGACAACGCTGACATTCAAGTGGGCGCTTGTCATATTAATGTCACGCTAAATTCATAAACTCAGAACAGTTAACTGGCTCTGACAATTATGAAGATTGCAAAATGAAAATTTCAACGCTGTCGCTCTCTGCTTCTGCCCTGTTGTTATTGCTCGCGGGATTATTAGCGGCCGTAGTGCTGTGGAGTAGTGATCAGCGACAACATATAGAACAACAAACACTCACACTGCAAAGCATACAAGAAGACTTCCTCGTCGGCGTACGCCGCGATCTCGACGGCTATCTCGCCAGCGGCAACTCTAGCCAACTTGAACAAGCTAAAACTAAACTCAGCGCCATTAAGAACCAGCTCACAGAGCTTAATCTCGCCACCATGGGCGCTACCGATAATGACTTGCAAGCAAGTCTCAGCAGCTTCATTCAAGATTTAGATACCAAGTACCGCGCAGCGGGTAAACTCGCTGGCAACCCCAGGCAACTGCTGGCCCACGCCGAATCTGAAATGCTCGACTATAACCGTCGTTTGGGCAGCTATGCCGATAAAGGCTTAGCCACAAATGCGGAGGTCGCCGAACAATATCTGCAATTGAGCCGCGATTTACCTTCTATCGTCTATCAACTTTCCCAGCTGACCGATGGCTATCTGATCGGCAAAAATCAGCAACTCAAGGGCATTTTAGACAGCACCAGCAAAGAGCTAAATACATGGCACGACACCCTCAGCGCACTGCCGTTAATCGGCGTATATGAGCAGCAAGAAGCCGATGAATTTGCCCTCGGTGCCAGCGAACCAGAACAGATTGAAGTCGGCGAAAGTGACCGCAGTGAACTGCTCAGTCTCGCCAATCGATACAACAAAGAAGTCGCCAATACCCACCAATTGCTGCAAGCCAACCAAGAGATGCAAGATCAACTCATTCAAGCAATTAGCAATGTGGAACAACAGCTTATTGGCTTAGGTGAGGCTCAGGCGGCAAAGAATCAACAGCTGAAATATGAGTTACAGCTAATCCTCTACACTATGGTTTCCATCATGGCCTTGTTTGCTATAGGTTATTTAATCCTGCAGCAAAATCGCGTCGTCAAACCGCTGAAACGTCTTAATCAAGCCTTTATGAAATTAAGCGAATCGAACAGCCGCGAGCGTTTAGATATTAATCGTCGCTGCGAGACGGGCCAAATCGCAGGTCATTTCAACCAGTTGCTGCAAAGGTTTGAACAGGAAGACGAAGCACAGCGCCAGCAAATCACTAAGGTTTCTCAATCATTAAGCCAACTGGTTGCGCGAATTACTCAGCTGTCGCAGCACACAGAGCACACCCAAACGATTGTGGCCGATACGCAAACACAGACCGAACATATCCGCAGCCTCGCCAATGAGGTGAGCCACACTTCGGCACTCGTGGAGGACAGCGCCGCCGAAACCATGCGCCAAATGCAGTCGAGCCAAACCGAAGCAGAAGCGGTACTGAGCGCAACTGAGCAGACGCAAACCGCCGTTGGCCTTTGCCATGCTTCCCTTGAAAGCCTGAATAACTCAGTGACGGATGTGTCGAAAATCATTGATGTAATTGGCAATATCGCCGAACAAACTAACTTACTCGCCCTCAATGCCGCCATTGAAGCTGCGCGGGCGGGAGAGCAAGGTCGCGGCTTTGCCGTGGTTGCCGATGAAGTGCGAAATTTAAGTCAGCGCACCCAAGTGTCCTTAAACGAAATCGTGAAGATTCTGCAGCAACTGACCCAATCGAATCATGCCCTAAGTGAGAGTGTCGATGGCATAGCTCAAGCCACCAGCAGCCAAAAACTGCGCGCTCAAAGCCTATGGCAAGTGGCACAAACGGTGCAAAACCAAGCGAGCGACATGGCCAATACCGCTAAACAAGGCTCGCTCAATGCCAAGGAGCAAGTCGATTATCTCGATCAATTTGTGCGGACTATGGATAGCTTAAAAGAACAGGCGCAAACCAGTTCACAACAGAGTGAAGTCATTGCCCAAGAAGTGCAGCAGAGCGTCGAAGACATTGAAACCAGTTTAGGCATTGCCGATGCTGGTAACTTACCGCCACAGTCGCGGGCCGCTTAACCCACTAAAATACCAATAAAAATGGGAGCCTAAGCTCCCATTTATTTTGATGTCCGATTACGTTCGGACCTTAATCATTACAGTGCAGCTAAAACCACTTCTGCTTTACTGGCTTCAAAAGCCTTTGGCTCTTCAACATTCAGCAGAGTCACTACGCCGTTATCTATGATCATGGCGTAACGTTGTGAACGCACGCCACCAAAACCAGCAGTGTCCATTTCTAAGCCTAACGCTTTAGTAAAGCTGGCATCGCCATCGGCTAGCATCATCAATTCAGAGGCATTTTGCGCTTCGCCCCACGCTTTCATCACGAAGGCATCGTTCACAGCAACACAGGCAATCAAATCCACGCCTTTCGCTTTAAATTGATCGGCTAATACCACATAGCCAGGTAAATGCGCTTCAGAACAGGTCGGAGTAAACGCACCGGGTACAGCAAACAATACCACTTTTTTACCAGCGAACAGTTCAGTAACTTGGTGATTCACCATGCCATCTTTCGTTAGTTGGCCTAATGTAGCTGCTGGTAATGTTTGACCTTGAGCAATCATATTTATCTCCATTTAGTTAAATTAACCTGACCATACTAGCCCTATTTACCTGAGATAAACACAGAGAGAAAAAAGGGTGATTAGCCGCGAGTACGATTCATCCGCAGCCCAGCACACAAACCGAGGAACAACAGACCCAATACCGAGAAGGCACCGCCACTTTCTTCGACGACGATAACTTCAGGCTCTCTGTCTCGATCGCTACTTTCCAGTGGCAACGCATACAAACCATCGGTTTCATCGGCACTGATGGTCGCGACAATATCGCTGTAACCCACTTCGTACACATCGATTAGCACATCGTAGTGGTCCGTCGCATAACCCGTGTAGAGCGTGGTCAGCACTTCATAATCGTCCTGTGTTGAATCGCCATAAATAGTAAACACGTCCGTGGTGTAGTAATGCACCCAAGGGCCGCCATCACGACTGAGATAAAGCTCGGCAAATAAGTCAGCTCTTTCATTAAGATACGCGCCGTTAACATCGACGTCGAAGGTCACGCTAAAGGTTTGATAAAACCCATCGTAGTCAAAGTCTTCGAACAAACGACTGCTGGCATCAAAAATCGAAAAGCTGTGATACACAGGGGCGCGGTAGGGATCTTCACTGGTCGCGCTCGAACTCGGCATGCCTTGGCTCGCATGTTTCGCAGTCACTTGCTCACGCGTCATCGGCGATGCGCCCATTAGTTGCACGCGGTTTGCCGTCGGCGCTTTAGAGGATAACGCGGGCGCCAAGGATTTAACGGCTGCAGCAGCGGCTTCAGTGACATTTTTTGCGGGCGCAGCTTGTTGTAACAGGGCTAAAGCTTGCTGTTCCTGCTCGGCTGCTTGCTCGCTGTTTTCAGCTTTTTTGGCAATGCCGACACTGGCCGCAGTAAATGGCGTTAAGCTTTGCGATGATTCCAGTTCGCGGGGCTGAGCGCTGACGGCAGTGGACGCTAAAAACAGCGCGGCAATGGCCGTCGCTTTAACGAAATGGGTGTTAACCTGACTGTTAACTGTGGCTGCCATACCTTGTTTATTGAGTGTGTTCATCTTGAATACCCCATGGATAACCCGTTCAAATTGGAGCCATTAAAGTGCAATGAAGGTGAACATAAGCTGAACATCAACATTTAGTGTTCTTAGCGCGACAATACGTTCAGCCTTAGTTCATCTGGATTAGCCGACACTAGGGCATATTGAATGCGAGCATAGGAATAGACTCGCTGACAGAGCTTCAGTTCTGTCACGAATAACAAAATGACTACCAAAGTTAATTTTGATGGAATTGTGCCAACCAATTGGCTGATCTCTATGTGTTTGTTAAGGTGATGAGGTATTCTTAGTCGACTCGATAAGTTATCCATCTCACCCATTAACTTAAGATCGCCGAACATTGCGGGTACCAGATATGAAACTTGGTTTGACATTGGCCCTAATCGCTTGCTTGTTTGCCTCCTTTGGCAGCATGGCAGGCAATGATAGACAGAATGATCGCAACCAAGGTGCGAAGAATGAGCAGCGCCGTCTGGCGGTCAATAGCCCAGATCAAGCTGTAGCTATGGTGCAACGCCAATATCAAGGCAAGGTACTGAGCGTGCAATCCAGCGGCTCTGGGTATCGGGTTAAGCTCCTCAATAATGACGGCCAAGTGTTTTCAGTCTCAGTGGATGCCGCAACTGGCCGAGTGTCGAGGAACTAACTATGCGATTGTTATTGGTTGAAGATGATTTAGAACTGCAGGCTAACTTAAAACAACATCTGCTCGATGCCCATTACAGTATCGATGTGGCGAGCGATGGCGAAGAAGGCTTATTTCAAGCACTCGAATACAACTATGATGCGGCGATCATCGATGTCGGTTTGCCTAAGCTCGATGGTATCGCGCTTATCCGCAGCGTACGCGAACAGGAACGTGACTTCCCAATCCTGATTTTAACCGCGAGGGACAGTTGGCAAGATAAAGTAGAAGGACTCGACGCCGGTGCCGACGACTACCTGACGAAACCTTTTCATCCCCAAGAACTGGTCGCGCGGCTAAAAGCCTTGATCCGTCGCTCTGCCGGCAAGGCCAGCCCTTTGATTTATAACGGTCCCTTTAGCCTAAATACCAGCAGCTTAGAAGTGCGTAAAGGCGAAGAACTGGTTAACCTCAGCGGCTCTGAATACAAGTTATTCGAATTTTTTATGCTGCATCAGGGTGAAGTGAAATCCAAAACCGTCCTCACAGAGCACATCTACGATCAGGATTTTGATTTAGATTCTAACGTCATCGAAGTCTTTATCCGCCGGTTACGTAAAAAACTCGACCCTGATAATCAATACAACCTGATTGAAACCCTGCGCGGCCAAGGTTATCGCCTAAAAGCGTTAACGCCCGAGCCAACTGCTGATGAGTAAACGTGCCATTGTGTGGCAAGTCTTGAACTCCCTTAAAGCGCGCTTAGTGATTAGCGCGCTATTGTTTATCTTAGTGTTATTACCTTTAATCGGCGTCGCCTTAAACGATGCCTTTACCGAGCAAGTTAAAAGCGCGACCAAAAATGAACTCAGCGCCTATGTCTATTCGATACTCGCGGTGACTGAAGTCGAGAATAAACAAATCTCTATCCCAGAATTAGTACTCGAAAATCGTTTTAACCTCATTCAATCTGGGCTGTATGCCATTGCGACCACTGAAGATGCCAGCGGCAAACAAACGATCGTGTGGCACTCGCAATCTTTTATGGGCATGGTGCCACCGCCGCATTTTACCATCCCAGCAACCGGTCAAAGTGCCTTTGAGCAAATCGAGCTCGCCGAACAACCCCATTTGATTTATAGCTTTAGCGTCAGTTTCGCCAGTCAAAATCAAAACGTGCCTGTGACGATTCACATCATTAAGGACGAACGCGAATTTCAGCAGCAAATCGATCAATTTAATCAACAGCTTTGGACTTGGCTGCTGATCTTAATGTTCGTCATGCTGGTGTTCCAACTGAGTTGGCTAGTGTGGACACTGCGGCCATTGGCGCGCTTTACCCAAGAATTACATTCCGTCGAACAAGGAAAGTCGATGCAGCTAAGTAGTCAGTACCCCACTGAATTACAAGCCGTTGCTCGGCAGCTCAATATTTTGCTCAATACCGAACAAACCCAGCGCAAACGCTATCGTAATGCCTTGGCCGATCTTGCCCACAGCCTCAAAACACCGCTGGCGGTGATCAAGAGTCAGGCCGACTTAAGCGAAACCTCCAGCGAGCAAGTATCGGTGATCAGTCGCATTATTGGCCACCAGCTGAAACGCGCCCAAACGGCAGCGGCAGCCTCGTGGCATTTAGGGATTCGCATCGATGACGTCGCCGCTAAGTTATTACGCACCTTAGCTAAAATTTATCGTGAGCCGCAAATCAACCTCAGCGGCGAGATGGCTGACGAAGCAGTATTCAAAGGTGATGAAGCGGATCTCACGGAAATTCTCGGTAACGTGCTCGACAACGCCTGCAAAGCGGCAAAGTCCACGGTTAAATTAACCGTGACCGGCGATGCCTATCAATTGCTGATCTGCATCGAAGATGATGGGCCAGGGATCAGTGAAGCGCTGCAAAATCAAATCTTTGAACGCGGTATTCGCGCTGATTCCTATCATCAGGGTAATGGTATTGGGCTTGCGATCGTGCGCGATTTAGTCGACAGCTATAACGGCAGAATTTCAGTCTCACGTTCAGAAACCTTGGGCGGCGCCAAGTTCAGCATCAGCTTTATGCACTCAATTTAA
Protein sequences of DBSCAN-SWA_6 >NC_017571|4887006:4900743|4892254_4892980_+|WP_006079507.1|DBSCAN-SWA MGQETSKILVVDDDMRLRALLERYLMEQGYQVRSAANAEQMDRLLERENFHLLVLDLMLPGEDGLSICRRLRQQGNPIPIVMLTAKGDEVDRIIGLELGADDYLPKPFNPRELLARIKAVMRRQTQDVPGAPAQQEAQISFGEFSLDLATREMYHGDESIALTSGEFAVLKVLVTHPREPLSRDKLMNLARGRDYSALERSIDVQVSRLRRLIEKDPANPRYIQTVWGLGYVFVPDGAARR >NC_017571|4887006:4900743|4893022_4894339_+|WP_006079506.1|DBSCAN-SWA MKAKFWWRFVPRSAFSQTVMLIGCLLLINQLVSYVTVAVYVLKPSYQQINQLIARQVKLLFVDGIDIGREHLTIVDALNAKVHDDGMKIYNQQQAREAGIEQATYYGFWSAQMSEYLGGDAEVRVTHGSVLQIWIRPPQAPSIWIKVPLIGQNVSDLSPLTLYLMVIGALSVAGGWWFARQQNRPLRRLQKAAIAVSRGEFPDPLPLNGSSEIVEVTNAFNQMAHSMKQLEQDRALLMAGISHDLRTPLTRIRLASEMMVEEDQYLKDGIVNDIEDMDAIISQFIAYIRQDQETSRELGQINKLIQDVAQAEANRAGEIEVVLTDCPEAQFQAIAIKRVLSNLVENAFRYGSGWIRISSQFDGKRIGFTVEDNGPGIDESQITKLFQPFTQGDIARGSVGSGLGLAIIKRIIDRHQGQVTLSNRAEGGLRAQVWLPLE >NC_017571|4887006:4900743|4896528_4897002_-|WP_006079504.1|DBSCAN-SWA MIAQGQTLPAATLGQLTKDGMVNHQVTELFAGKKVVLFAVPGAFTPTCSEAHLPGYVVLADQFKAKGVDLIACVAVNDAFVMKAWGEAQNASELMMLADGDASFTKALGLEMDTAGFGGVRSQRYAMIIDNGVVTLLNVEEPKAFEASKAEVVLAAL >NC_017571|4887006:4900743|4894468_4896451_+|WP_006079505.1|DBSCAN-SWA MKISTLSLSASALLLLLAGLLAAVVLWSSDQRQHIEQQTLTLQSIQEDFLVGVRRDLDGYLASGNSSQLEQAKTKLSAIKNQLTELNLATMGATDNDLQASLSSFIQDLDTKYRAAGKLAGNPRQLLAHAESEMLDYNRRLGSYADKGLATNAEVAEQYLQLSRDLPSIVYQLSQLTDGYLIGKNQQLKGILDSTSKELNTWHDTLSALPLIGVYEQQEADEFALGASEPEQIEVGESDRSELLSLANRYNKEVANTHQLLQANQEMQDQLIQAISNVEQQLIGLGEAQAAKNQQLKYELQLILYTMVSIMALFAIGYLILQQNRVVKPLKRLNQAFMKLSESNSRERLDINRRCETGQIAGHFNQLLQRFEQEDEAQRQQITKVSQSLSQLVARITQLSQHTEHTQTIVADTQTQTEHIRSLANEVSHTSALVEDSAAETMRQMQSSQTEAEAVLSATEQTQTAVGLCHASLESLNNSVTDVSKIIDVIGNIAEQTNLLALNAAIEAARAGEQGRGFAVVADEVRNLSQRTQVSLNEIVKILQQLTQSNHALSESVDGIAQATSSQKLRAQSLWQVAQTVQNQASDMANTAKQGSLNAKEQVDYLDQFVRTMDSLKEQAQTSSQQSEVIAQEVQQSVEDIETSLGIADAGNLPPQSRAA >NC_017571|4887006:4900743|4898397_4898688_+|WP_006079501.1|DBSCAN-SWA MKLGLTLALIACLFASFGSMAGNDRQNDRNQGAKNEQRRLAVNSPDQAVAMVQRQYQGKVLSVQSSGSGYRVKLLNNDGQVFSVSVDAATGRVSRN >NC_017571|4887006:4900743|4898690_4899386_+|WP_006079500.1|DBSCAN-SWA MRLLLVEDDLELQANLKQHLLDAHYSIDVASDGEEGLFQALEYNYDAAIIDVGLPKLDGIALIRSVREQERDFPILILTARDSWQDKVEGLDAGADDYLTKPFHPQELVARLKALIRRSAGKASPLIYNGPFSLNTSSLEVRKGEELVNLSGSEYKLFEFFMLHQGEVKSKTVLTEHIYDQDFDLDSNVIEVFIRRLRKKLDPDNQYNLIETLRGQGYRLKALTPEPTADE >NC_017571|4887006:4900743|4887006_4891896_-|WP_006079509.1|DBSCAN-SWA MHLSMSYKNASKWGLLVLLIGLVLCALLTGLTGRMNAQKIEQALREDAEQISENVINRITLYQYGLRGARGTIVTAGEHDISRASFHRYSLTRDVDHEFPGARGFGFIRRVPSADEAAFIARARQDDWPDFNIRQLNAHDGERYVIEYIEPVDRNKAAVGLDIASEAHRKAAADAAMLSGEVRLSAPITLVQATGRPLQSFLILQPIYRTGLVPKSQQDRLSAGYGWSYAPLVTNEVLMGLALNQKMTKLLLSDITQAERPISFFETHPDEASPLSAYRHKMTKDIFGRQWQIEVIAYPSFIKSLHLNQPSLVLLSGSLFSLLLAALIAMWSISLQRKQQVLLEQARRASMLEHSLDGIISYDTKGQITSWNQGAEQLFGYTEAHALGQSCCKLIVPASIVQEERALFAEVLAGKTSLNRITSHQRQDGSHLSTSTTAVPIYDEHGKIAGMSQTIRDITAQLEAERHILSLNASLERQVTKRTHALQQALLENQTLLDTINQQLLYSVTDPNGVILEVNDNFCRISGYSREQLVGKTHAILQSGEHDVSFWRSMWQQINSGQSWHGEICNLDKHQTLKWFDTVIGPVFDEYGKIERFVALRIDITDRKLTQIEKNNLGSLLTNVLDAASEMSIIATDVQGNITIFNRGAERLLGYSAEEMIGQSTPAPLHLLEEVMIRSAELSAEYGKDIQGFDVFVYKAKAEGSETRNWTYVRKDGSQFQVSLSVTAMRDSVGEIIGYLGIGVDISLMLLQQQALMTASNHLSKAAEVAQLGIWTWNLLGNSLEWNERMFAIYDQPESLREQGLTYEHWSMRVHPDDIAFADAKLKGAVEGTAQYDPIFRVVRTDGSIRYVQAAAQVERDKQGHAIRVIGINLDITEQQQLEESLREAKQEADAASAAKSAFLANMSHEIRTPMNAVLGMLQLIQYTSLSRQQQDYVNKTQTAAKSLLGLLNDILDFSKIDAGKLVLDLHPCSIELLMRDLAVVLSGNHGDKDVEVMFDLDTALPPWLLADQLRLQQILVNLAGNALKFTHHGQVIVGIECLRHEADTVTVQISITDTGIGISDEQIERIFTGFEQAESSTSRRFGGTGLGLAISKRLIGLMGSELQVTSKVGVGSRFWFDMTFPIIQVEETRHTDLSGYRVLVVDDNELTVEILEKILTGFGCEVETALGGYSALAKVKQSHEKSTPFDVVLMDWRMPDLDGLQTAEMLRNSDANNQTPLVVMLTAYGHEVIAESQQIRHVPFVNFLTKPVTSQILAEAVLNAIEGKTMEAHPQPRSQRFLAGLTILVVEDNQLNRQVIDELLSYEGANVVLAEGGMEGVSLVLESGDLFDIVIMDMQMPDMDGLEATRRIRADERFAALPILAMTANASQSDRQECLNAGMNDHVGKPIDMPLLLPSILRLVGREDMTSAEFEGHWPEATQEQSDEQLLDDIRLILRRFGGNQSFFQKMAKSFAPEMMKQLALFKQSTKIFDHSTAAAISHGIKGIASNFGARRLAAHAAYLEKQFKQECPELIDIKRWTDTLEELINQSSQQLSTYFEVEEFNAVISDNVEKAVYSSVSVQSVKTDLNTLMNLLKENNLEAMALTDQLAELLSGHPQWSILHGQVQALDFNQALDTLRIIMLEEA >NC_017571|4887006:4900743|4899378_4900743_+|WP_006079499.1|DBSCAN-SWA MSKRAIVWQVLNSLKARLVISALLFILVLLPLIGVALNDAFTEQVKSATKNELSAYVYSILAVTEVENKQISIPELVLENRFNLIQSGLYAIATTEDASGKQTIVWHSQSFMGMVPPPHFTIPATGQSAFEQIELAEQPHLIYSFSVSFASQNQNVPVTIHIIKDEREFQQQIDQFNQQLWTWLLILMFVMLVFQLSWLVWTLRPLARFTQELHSVEQGKSMQLSSQYPTELQAVARQLNILLNTEQTQRKRYRNALADLAHSLKTPLAVIKSQADLSETSSEQVSVISRIIGHQLKRAQTAAAASWHLGIRIDDVAAKLLRTLAKIYREPQINLSGEMADEAVFKGDEADLTEILGNVLDNACKAAKSTVKLTVTGDAYQLLICIEDDGPGISEALQNQIFERGIRADSYHQGNGIGLAIVRDLVDSYNGRISVSRSETLGGAKFSISFMHSI >NC_017571|4887006:4900743|4897082_4898039_-|WP_006079502.1|DBSCAN-SWA MNTLNKQGMAATVNSQVNTHFVKATAIAALFLASTAVSAQPRELESSQSLTPFTAASVGIAKKAENSEQAAEQEQQALALLQQAAPAKNVTEAAAAAVKSLAPALSSKAPTANRVQLMGASPMTREQVTAKHASQGMPSSSATSEDPYRAPVYHSFSIFDASSRLFEDFDYDGFYQTFSVTFDVDVNGAYLNERADLFAELYLSRDGGPWVHYYTTDVFTIYGDSTQDDYEVLTTLYTGYATDHYDVLIDVYEVGYSDIVATISADETDGLYALPLESSDRDREPEVIVVEESGGAFSVLGLLFLGLCAGLRMNRTRG |
9 | Bacillus_phage(42.86%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|