Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_010001 | Lachnoclostridium phytofermentans ISDg, complete sequence | 7 crisprs | csa3,DEDDh,cas3HD,DinG,WYL,c2c9_V-U4,cas3 | 0 | 0 | 4 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_010001_1 | 288524-288631 | Orphan |
NA
Consensus repeat of NC_010001_1
|
1 spacers
spacers of NC_010001_1
>1.1|288550|56|NC_010001|CRISPRCasFinder GGGTTCCGATAATAGAACTTATGATTGCCGGAACACGTTTTTGGTTCCGATAATAA |
DinG |
CRISPR arrays and Neighbor proteins around NC_010001_1
The CRISPR arrays of NC_010001_1 >merge|NC_010001|1|288524-288631|CRISPRCasFinder AACTTATGATTACCGGAACATGTTTTGGGTTCCGATAATAGAACTTATGATTGCCGGAACACGTTTTTGGTTCCGATAATAAAACTTATGATTATCGGAACATGTTTT >NC_010001|1|1|288524-288631|CRISPRCasFinder AACTTATGATTACCGGAACATGTTTT GGGTTCCGATAATAGAACTTATGATTGCCGGAACACGTTTTTGGTTCCGATAATAA AACTTATGATTATCGGAACATGTTTT
>NC_010001.1|WP_012198262.1|287320_288520_+|hypothetical-protein MSEVNKNLEAIPNPPTVKKGKGKGQRAKYSNVIIFLVLIYVLTIASMVKPVKGYSESENRVLEGRPKFSLESLFNGTFISKYETFVTDQFVSRDAWIGIKTRTELAMLKKDINGVYIGKDGYLIEKVDNSDLEMEQVNRNEKRLYAFINKYKEQLGDEHVFAMIAPTAFEILKDKLPPYASGFDQGAFLDRLDEALANQFIDLRETLTEHKKDYIFYRTDHHWTTLGAYYAYVEWANKIGETPMSQDEFEIKKISNDFLGTIYSKINLKLSSDDMYLYDSGKNYTVEYNMDGVKKNSLYEMSHLDTKDKYSVYLGGNNPVVKIDSDNHNGKKLLIIKDSYAHSFAPFAANHFETTYMVDLRYFNMPMSRFIEENGITDVLVLYNVNTYVKEKSLDNMVR >NC_010001.1|WP_012198261.1|285890_287309_+|MBOAT-family-protein MVFSSLPFLFRYLPIVLILYFIAPRKYRNAVLFFTSLVFYAWGEPIYVLLMLFSTLVDYTHGMLVHHFKQKGEITKAKIALSSAMIINISLLGFFKYSDFAISNINALTGSDIGLLKLVLPIGISFYTFQTMSYTIDIYRGEAEVQKNIISFGAYVVLFPQLIAGPIVQYKTIAKQLQERREDFDQFSYGVLRFMSGLGKKVLLANNIGILWDRISVTPNGELTVVTAWLGITAFAFQIYFDFSGYSDMAIGLGNMLGFQFLENFNYPYMSKSITEFWRRWHISLGTWFRDYVYIPLGGNRCGLGKQIRNIAIVWFLTGFWHGASWNFIMWGVYFGVILILEKFVLLKFLNKLPSFLSHIYAIVLVWIGWAIFAFDDFSKGINYIKAMFGVNTIGFINDNARYLLMNYAIILIVLILGSTDLPKRVANRLVGEHSEKKTTAVVQGLFIVGVFVISVAYLVDASYNPFLYFRF >NC_010001.1|WP_012198260.1|285207_285843_+|hypothetical-protein MKTNREFIEGIYKKAELLRQQKENSKSESWHLRFLRFNREKKRVPAFAASLATFALFALVIITGSQAGKSPNIDNIENKHLRTVEGQNPIANVSAYGIDEDVSNENINTVLGVITEVVDIQNQKYINIQVSKLLCGEGTPDYITITEGLPLTITTESLKEMNVIVSVKPILGQEEYALIDENSIYFYAKEENNQNYYQAIDGTIVSEDSFK >NC_010001.1|WP_012198259.1|284606_285221_+|RNA-polymerase-sigma-factor MNNADNFSSVGKDISSKEGVQSEQDKIIYQNFLDGDMEAFEELVIKHKDRLIYFIQRLVNNLTIAEDLAQDAFVEVLVHKERYHFQVSFKTYLFTIGRNKAIDYIRKNKRMMLVEDYPESYDEENRMEENIIRKEESKLLYDAMKKLKPDYKAAISLIDLEQMSYAEAAKVLKKSDAQMKVLIYRARKSLAKLMEKEGYSYENK >NC_010001.1|WP_012198258.1|283703_284357_+|DUF4358-domain-containing-protein MRNKKLMALSLVAVLAFTACGKKETNEPTPTPTVAPTETPAATETPTETPAEPGTDGSGEELGATELTSNETLDKIHEEVKAAYGDNYLPNMPFTVENLDEMFGIKADWYDAAIAEGPMMSAHVDKLIGIHVTEGNLENVQNALNEYQKKIATDIQYPMNLPKVQASVVETAGDYVFFVMLGTIDEMKYTEDSDMIKAFGEQNQIAVDIIKKNIEAK >NC_010001.1|WP_012198257.1|282191_283226_-|sugar-kinase MLTVNENREFDALALGEILLRLSAPSNERIVRGDTFEKCAGGAELNVVSGISMMGLRTGIISKVPQNDIGTYVKNHLRFCGVSDDCLIFDESRDARLGIYFYENGAYPRKSSVVYDRRNSSINTISMDDIPESTFSSTKLFHTCGITLALSPQTRDVTEECIKKFKEQGALISFDVNYRANLWDEATAKEYIERILPYVDILFVSEETSRRTFGKTGTIKEIMKSYTEDFNIKIVATTERIVISPKKHTFGSTIYNAVEDKFYEEAPYQNIEVIDRIGSGDAYVSGVLYGLLAYDDCQKALEIGNAASAVKNTIPGDLPSTDLKEIQKIISSHQNIGPQSEMNR >NC_010001.1|WP_012198256.1|280344_282126_+|DUF885-domain-containing-protein MKDLKKRGKQGLVTIALSLSILVTGCANKEKPKDLTFEDYSNQMFQEIVSSSAITYSQFIEDPENFGITEYDHVLATLSKKEYDKSIKQCEEDLAQLLKFDYDTLTTAQKIDYDITKGMLERSIASKDSYYYSEPLSPLDGDHITLSGIVSLYGNRYFQTLVEKEKGNKKEVEKFFEIYEMIGKYFNEVAQYEKEKAKAGLFMNSSRAEVVRKACLSVVNNNASDYKKTFQEEVTKLSFLSDSEKKELIEQSDSLVEKHIVPAYQKLVDTMNDLKDQGGKSKGFYETEAGKIYYENLLKSTCSVNATPEELMKLLEENLAVFVNEKDQILADHPNIENEIVISARQWPDAESITKMLSNKAKEDFPDADLAWGVKEMPTCMNSFAGGLFYPFAIDSTLKEEYIYLGTMNAPGTLSFLQVLAHEGVPGHLFHYNYLNDIGTTDYRKVLAWAGTGLVGYLEGWTTYVEEIGYSYGGLSDVQAREAQLNRLIEITLVTMVDIGVNYYGWENDKISEVISQYAPQYLIMSTYIKSIVEESPGLYSSYAVGYLYTKHIIDAINEKSGGTMSKKEVHEKYLSVGPVTYDILMRELGVAQ >NC_010001.1|WP_012198255.1|279074_279992_-|diacylglycerol-kinase-family-lipid-kinase MYHFIINPHSKTGKAKELWQGLRQRLENESINYKEYFTTGHGHATQIAKEICTIDNERKTIVIVGGDGTANEVINGIDNYEDVLLGYIPMGSSNDLARGLLLPKNPAEALDRVLNPRKIRAVDHGQVTFEDGLPRRFSVSSGIGYDAAICQVAQTTKIKNFLNKIGIGKLTYFLIGVKEIFANKPCDATVIADGITYSVKNLIFMASLIHKCEGGGLLMAPDASDNDRKLSICLVSNIPKLKILFVMPTIFLGKHTKIKGVQMITCSSVSIHTQSPLYVHTDGEVLGEHTDLTLRCTSEQVNIIT >NC_010001.1|WP_012198254.1|276344_278903_+|ATP-dependent-DNA-helicase MPDTDLKSIKISVRNLVEFIMKSGDLDNSVGKRDPDAMQEGSRLHRKIQRRMGPEYKPEVALRVTVPVSREDIEFELIIEGRADGIITNIEPTKEDNPILEEKPTLEEKPTLEEKPILEGKPILEGNPILEENPTLGEHHPSEEHPSKQNGAEGNIHVIIDEIKCVYADISQITEMIPVHRAQALCYAYIYAKERVLDTISIQITYCHLETEAIRILSEELKFKELSNWFQNLIQEYCKWAAWQIKWMESRNESIKQIEFPFEYRPGQRDLVTGVYRTIIRDKKLYIEAPTGVGKTISTVFPTVKAMGEGFVSKIFYLTAKTITRTVAEDTYQLLLERGLSMKLVTITAKDKICILDKPNCNPAACERAKGHYDRVNDAVFDLLTSESRISRELIEQYAMKHCVCPFEMCLDVTLWADGIICDYNYAFDPNVYLRRFFENDKKQDYVFLIDEAHNLVDRAREMYSAMLYKQDFLTVKGIVKDKSKTMVKRLEACNEVMLRLKRGCDDIEVLQDVNDLVLPLLRLMSEYEEFFKEYGDFEGREVVSQLFFDLRKFLAIHDILGEDYLIYSDYDERGEFRVKLLCMDPARNLLTCLNKGRSSIFFSATLLPITYYKEQLGGSEEDYAIYAPSPFEVSKRLLMIAKDVSTKYTRRGQDEYERIVSYIEGFVNAKVGNYFVFFPSYQMLQQIAQLSEDRIPNLLLQKTSMGELEKEEFLAAFEENPTNTKVGYCVMGGIFSEGIDLKKDRLIGAVIVGTGLPQVGNERELFRGYYDDRNGSGFDHAYLYPGINKVLQSAGRVIRTVEDKGAILLLDERFLNSQYKNLFPREWEQYDIVNQEKMQELLEDFWSQKNE >NC_010001.1|WP_012198253.1|275695_276031_-|DUF1292-domain-containing-protein MDKHGDDCNCSSDEFFHDQVTLTLEDDTEVVCDIIAVFPCGEKQYIALLPEDAGEEGEVFLYEFIQNGDEIELESIEDDAEFEAVSEAFDEFIDSEEFDEMFGDEEAEDEE >NC_010001.1|WP_012198263.1|288695_289409_-|1-acyl-sn-glycerol-3-phosphate-acyltransferase MKRILLMLLRSFFNLPIWFFQLKRLCNIEKHDRFERYAWLHKNAPVANRRGRVTIDCHGLENLPKEDGYILFPNHQGLFDALAFLETHERPFVTVMKKEVKDIFFLRDVIKLLQAEIIDREDIRQSMTVIKNMTTRVKGGENFVIFAEGTRSKNGNQIGEFKGGSFKSAMNARCPIVPVALIDAFKAFDTNSIKKITVQIHYLKPLYYDDYKGMKSTEIAELVENMIKETIAKFAQE >NC_010001.1|WP_012198264.1|289501_290617_-|RNA-polymerase-sigma-factor-RpoD MEEQVNTFEARLKELIAFANDNKGVIEVDKVNDFFKELNLNVRQIDKIYEYLEANNIVVLNPTDEDEPNEDALLELEDDSDMIGDTEDLSAMTSTISDDPVKQYLKEIGSYPLLSVAEEIELAKKIEAGDNMAKQILAESNLRLVVSIAKRYVGRGLSFLDLIQEGNLGLIKAVDKFDYNKGYKFSTYATWWIRQAITRSIADQSRTIRIPVHMSEVINKTYRVSRNLLQELGREPSEQELADAMNLPIEKVREILKVSADPISLDTPIGEEDDSHLGDFIKDDTIMGPEDAASYAVLQDQISKLLDTLTEREQRVLILRFGLQDGRSRTLEEVGKEFNVTRERIRQIEAKALRKLRHPSRARMLKGYELN >NC_010001.1|WP_012198265.1|291401_291824_+|30S-ribosomal-protein-S12 MPTFNQLVRKGRKTMEKNSQAPALQKGFNSLRKKTTDASAPQKRGVCTAVRTATPKKPNSALRKIARVRLSNGIEVTSYIPGEGHNLQEHSVVLIRGGRVKDLPGTRYHIVRGTLDTAGVAKRRQARSKYGAKRPKEAKK >NC_010001.1|WP_081428460.1|292017_292533_+|30S-ribosomal-protein-S7 MNDCYLTVDIKEGSNVPRKGHIQKRDVLADPIYNNKTVTKLINNIMLDGKKGTAQKIVYGAFEKVAEKSGKDATEVFEEAMNNVMPVLEVKARRIGGATYQVPIEVRPDRRQALGLRWLTMFSRKRGEKTMVDRLAGEILDAAANTGSAVKRKEDMHKMADANKAFAHYRW >NC_010001.1|WP_012198267.1|292843_294961_+|elongation-factor-G MAGREYPLERTRNIGIMAHIDAGKTTLSERILYYTGVNYKIGDTHEGTATMDWMEQEQERGITITSAATTCHWTLELEHKKAPGALEHRINLIDTPGHVDFTVEVERSLRVLDSAVGVFCAKGGVEPQSETVWRQADKYNVPRMAFVNKMDISGANFFNVVDMIKSRLGKNAVPIQLPIGKEDTFKGVIDLFEMKAYYYLDDKGEQIEIKEIPDDMKDQAEEYRAAMIESICETDDDLIEAFLEGNEPSNEELKKALRNATISVQIIPVLCGSAYRNKGVQKLLDAVIEFMPAPTDIEDIKGFDEEGNEIHRISSDEEPFAALAFKIMADPFVGKLAFFRVYSGTLNAGSYVLNATKNKKERVGRILQMHANKREDLDKVYSGDIAAAVGFKFTSTGDTICDEKHPVVLEAMEFPEPVIDVAIEPKTKAGQDKMGEALAKLAEEDPTFRVRTNEETGQTIIAGMGELHLEIIVDRLLREFKVEANVGAPQVAYKEGFTKEVDIDSKYAKQSGGRGQYGHCKVKFSPMDVNGEKVFEFVSTVVGGAIPKEYIPAVQAGIEDAMKCGVLGGYPVLGVRANCYDGSYHEVDSNEMAFKIAGSMAFKDAMHKAGPILLEPIMRVEVTVPDDYMGDVIGDISSRRGRIEGTEDNNGSKIIRGFVPLSEMFGYSTTLRSKTQGRGAYSMFFSTYEPVPKNVQEKVLSNKTK >NC_010001.1|WP_012198268.1|295199_296393_+|elongation-factor-Tu MGKAKFERNKPHCNIGTIGHVDHGKTTLTAAITKTLHDRLGTGEAVAFDKIDKAPEERERGITISTSHVEYESKARHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGVMAQTKEHILLSRQVGVPYIVVFMNKCDMVDDPELLELVEMEIRELLSEYEFPGDDTPIIQGSALRALEDPNSQWGDKILELFDAVDTWIPDPQRATDKPFLMPIEDVFSITGRGTVATGRVERGVLHVSEEVEIVGVKEETRKVVVTGIEMFRKLLDEAQAGDNIGALLRGVQRTDIERGQVLCKPGTIKCYKKFTAQVYVLTKDEGGRHTPFFNNYRPQFYFRTTDVTGVCNLPEGVEMCMPGDNIEMNIELIHPIAMEQGLGFAIREGGRTVGSGKVATIIG >NC_010001.1|WP_012198269.1|296630_297458_+|hypothetical-protein MNMQNKTVFHRKFGKGIIVDLNQNKLSVQFDAGKKIFVYPDAFRQFLVLMEKDGKSYVDGMLKELDRKEEISNRIARKMERHNQLIDKLKLHPSSQIVIRFVENDKATFLEDKIINTGLIQTGKTKGSPVRPSRLHQNSACILTERNEEEDESSRTIFGIAMVEEDFLGTDCKDGKVTLHSQYVLLLPDHLQKLKFWNYYTDERYPEKLVWKSGEFRYCSNLISAQILKDIMSLPLENDAAALAEEFYHYFCEINQIEESTLPLPLGKLLEEGIK >NC_010001.1|WP_012198270.1|297865_299836_+|EAL-domain-containing-protein MILGGLSSRRGYYDEHNTRNKRKIRFFVIMGIGILSIVAFLLSLKGMLHSEAEKKLLEYTGLSADYIKKSEAGKQFMEEKWGSGIFTIIPGAKNPTYFQGNAHSYVVTIKGEAIGAFSESGKDALSIYGNNVIDSIETWEDTKVYQEIIDKKGLVILTKLANGEKYYIAFTSPSWLENGYIISIVSYQEIIQEIQSVLKMAIVIVLFSLLAIILAFFYSILHRNRIKKRRMDMGAVDKITGLPNPLLHKKKVKEKLTKGNESYAYVTFCIDNFELIYELSGKQYCEKLLKQIASKIQIMLVDGELFTRYQNDEFGMLLEYHGELNLRKRLVEMFKYAGDLPQEDNNFCSITFQCGVCEMKKNMDVKDLIQYARQVRNNEVNGYTPNIEFYNKKEEKGEQPKIEEITNALSHNEFLVYLQPILQLDTKRIAGAEALVRWNHKMDGILPPNVFLPMLEEDGSIVKLDMYVLEEVCEYLRDWMDKGKRAVPISVNLSGKHLERPEFITELVEIVDYYQIPHELLEFEFSEVNLYGAMDMMKNAIQKLRELGFLIAIDQFGAGFSSLQLLKELPIHVLKIDKKLIMNLEDSEFSNQEKTIVMHILSFAKARNLTVIAEGVETKEQQDLLIDQQCDMMQGFYYQKPMPSEEFERLLDASGA >NC_010001.1|WP_041703052.1|299815_300406_-|hypothetical-protein MNTTKMERKKITHEIHLVSNYRALLIAKHIGTLLLILYFAPAAITGFDESPALYILLLHNILPAVFFFLFTNKNNSNTKSPRILKPSAYDEVKEEPKKKHTLSFSFAKEMEADTPLLPQLKKKYQYSRVKYQSNSISFLLTCFFLYLWQQQDLTQTNFYLYRYMPVAILAVIVLTRFICIIFYECYIHYSLRSGGI >NC_010001.1|WP_012198272.1|300817_301807_+|MreB/Mrl-family-cell-shape-determining-protein MLGCDIGIDLGTASVLVYIKGKGVVLKEPSVVAFDRDTNKIKAIGEEARLMLGRTPGNIVAVRPLRQGVISDYTVTEKMLKYFIQKAVGKQRFRKPIISVCVPSGVTEVEKKAVEDATYQAGARDVAIIEEPIAAAIGAGIDISRPCGNMIVDIGGGTTDIAVISLGGTVVSTSIKIAGDDFDEAIVRYMRKKHNLLIGERTAEDIKIKIGSAYRRPEVVAMDVRGRNLVTGLPKTISVTSEETEEALKETTSQIVEAVHSVLEKTPPELAADIADRGIVLTGGGCLLYGLEELIEEKTGITTMTAEDPMTAVAIGTGKYVEFLSGKKD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_010001_2 | 1152785-1152926 | Orphan |
NA
Consensus repeat of NC_010001_2
|
1 spacers
spacers of NC_010001_2
>2.1|1152831|50|NC_010001|CRISPRCasFinder TTGGAACATGTTTTTGTTCCGATAATATAACTTATCAGAAGTTTGCTTTT |
CRISPR arrays and Neighbor proteins around NC_010001_2
The CRISPR arrays of NC_010001_2 >merge|NC_010001|2|1152785-1152926|CRISPRCasFinder TAAACTTCTGATAAAAGGTGCGCACTTTGCACCGTCAATCCGATTATTGGAACATGTTTTTGTTCCGATAATATAACTTATCAGAAGTTTGCTTTTTAAACTTCTGATAAAAGGTGCGCACTTTGCACCGCCAATCCGATTA >NC_010001|2|2|1152785-1152926|CRISPRCasFinder TAAACTTCTGATAAAAGGTGCGCACTTTGCACCGTCAATCCGATTA TTGGAACATGTTTTTGTTCCGATAATATAACTTATCAGAAGTTTGCTTTT TAAACTTCTGATAAAAGGTGCGCACTTTGCACCGCCAATCCGATTA
>NC_010001.1|WP_012198930.1|1150708_1152709_-|ABC-F-family-ATP-binding-cassette-domain-containing-protein MILACKNISKSFGTTPILDKVAFHVNEREKVAIVGINGAGKSTLIKIIMGELTADEGEIIFAKGATVGYLAQHQDLSTDSTIYEEVLAIKSDIIKMEETIRRLEIDMKSATGAELERMLSSYSRLTHDFELKNGYAYQSEVIGVLKGLGFTEEDFNKKVSTLSGGQKTRVALGKLLLSTPDIIFLDEPTNHLDMESIAWLETFLVNYSGAVVVIAHDRYFLNKVVSKVVELDNTKATMFEGNYSDYAMKKEQLRETMIRHYLNQQREIKHQEEVIAKLRSFNREKSIKRAESREKMLDKIDRLDKPVTVNDKMHIALEPNIISGNDVLTVTDLRKSYGSLTLFDQLNFEVKRGEKVAIIGNNGTGKTTILKIINQIINADAGDVKLGAKVFVGYYDQEHHVLNMDKTIFDEIQDTYPNMDNTRVRNILAAFLFTGDDVFKLIKDISGGERGRVSLAKLMLSDANFLIMDEPTNHLDITSKEILENAINNYTGTVLYVSHDRYFINRTASRILDLTNQTFLNYIGNYDYYLEKKPEMEWRAFGNNNGYNQSDANDNILNGLRINKHLDQANKELQTSSFPQEPVSENKLDWQQQKEEQAKLRKRQNELKKVEDEITRLEARNEELEVLLADSSIYTNSSKLIEVHKEKKELEERLEVLMEQWEELSE >NC_010001.1|WP_012198929.1|1149783_1150539_+|DUF3221-domain-containing-protein MKSKIYLYLFILLFTGILVGCTKKPFSIENTKFKAVVIDNNNGLLVKPDVDSNEFRLADKISIGANSAMIFNQDKKEVDLNEIQIGDVVKITYDGIILESYPAQITAVYIEIFESNLLIDGYITIIDDIYKLDSGLNSDINMIALNLTEATNLSEIDKEILLMKLYEMYGLEVKESTFDQLVKEGLINEEELYFPTGIIITISNSEYNEGKQTLEYVINKWRSGLGAIGYKGKAKFDGEEWIISKKSMWIS >NC_010001.1|WP_012198928.1|1148790_1149330_-|cupin-domain-containing-protein MKIGAKIKELRVQKSLTQEELADRAELSKGFISQLERDITSPSIATLVDILQCLGTNLEAFFTDTTSEQVVFKRGDYFEKVDNELNNKIEWIIPNAQKNMMEPILLTLEPGGSTYPDNPHEGEEFGYVISGSITIHIGNKTHRVKKGESFYFTPNKNHYIAATGKTGATLLWVSTPPSF >NC_010001.1|WP_041703992.1|1147702_1148776_-|ABC-transporter-ATP-binding-protein MDNKLIDLINITKRYGNNVVIDDLNLYIRENEFLTLLGPSGCGKTTTLRIIGGFEQPDQGRVIFDGKDITKLPPNERQLNTVFQKYALFTHMTIEENIAFGLKIKKKSRQYIKDKISYALKLVNLDGFENRMPDSLSGGQQQRIAIARAIVNEPKVLLLDEPLGALDLKLRQDMQYELIRLKNELGITFVYVTHDQEEALTMSDTIMVMNQGYIQQIGTPEKIYNEPKNAFVADFIGESNIINATMVQDRLVNILGANFPCVDVGFGKIQPVDVVIRPEDIDLVAPEAGIITGRVTSLIFKGVHYEMTVMANGFEWLVHSTDLSPVGAEVGIKVDPYDIQIMNKPESEDEEAVGVNE >NC_010001.1|WP_012198926.1|1146816_1147710_-|ABC-transporter-permease MNNQEESKVTAQEATVTLVRKSHHFSGKSLLTFPYILWMGAFIIIPLIMVVYYGFTTKANNSFTLENIKLIADPVNQKALYLSLKLSLISTLICLLLAYPLALILKSMKLKSNSFVVFVMILPMWMNFLLRTIAWQNILENNGIINTLLKALNLPTVNIINTPTAIVLGMVYNFLPFMILPIYNTLAKIDDNVINAARDLGANGWITFRKIIFPLSIPGVISGITMVFVPSLTTFVISNILGGSKIVLIGNVIEQQFQKVGNWHAGSGLSTVLMVFILISMAILAKYDKESEGTNVW >NC_010001.1|WP_012198923.1|1143167_1144637_-|catalase MDRRNEKKCCNYLTDSLGRPIPNDTNSLTVGSDGPVLLQDVHLIDKISHFDRERIPERVVHAKGTGAFGYFQPYCDWTDYTCAEFLKNPNCKTKVFVRFSTVIGSKGSADTVRDPRGFAVKFYTTDGIYDIVGNDLPVFFIRDGIKFPDVIHSLKPSPDNNLRDPQRFWDFVSLSPEATHMVTWLYSDRGTIKDFRHVDGFGVNTYIWVNECGKRVYIKYHWKTQQGLQTIDRFEAEQLAGSDPDVAVRTLYESIANGFYPSWELCVQMMDPDMIECLDFDPLDDTKVWPEDQFPLMPIGLMTLDCNPENFFAEVEQAAFCPGNIVPGVELSADKMLQGRSFSYFDTQRHRLGPNFAQLPINRSISCINNNQRDGQGTYIFNPNPINYSPNSLNCGFPKVAEVCQSEPECVCGYIARIPIKNPCDFKQAGERYESLSCEERCHLIDNIAVELYKCNQDIIDRVLCFFFKAHQEFGEQVECAIDYYRQMC >NC_010001.1|WP_012198922.1|1142073_1143030_+|D-2-hydroxyacid-dehydrogenase MKIVIMEANTLGNDVDLGMFQEFGDVVIYGESNPLENAERIKDADVIIVNKIPMNEDILKGATKLKLICLTATGTNNIDFTYTEKRGISVANVKGYSTQSVVQHTFALLFYVYEKLAYYDQYVKSGDYTRSDIFSNFDVKFHELYGKTFGIIGLGEIGQGVAKIAELFGCKVVYYSTSGKNLNSDYERVDLQTLLKISDVVSIHAPLTKATTNLIGEAELEMMKPDAILLNLGRGAIVNQEALANALLAGKIGGAGLDVLTVEPMLADNPLLKVKDSTRLIITPHIAWATVEARNRCAKEVYFNIKSYLSGEPRNIVE >NC_010001.1|WP_081428497.1|1140678_1141920_+|HAD-IA-family-hydrolase MIKNIIFDIGQVLAEFRWRDYIDELTIKEEYKERLAKATVLSPYWNEVDRGVLSKEEIMKRCISIDPEIEKEIKLFFDDTSQLVEEFEYSEELVKDLKSQGYHIYILSNYGRENFSYVKNVFRFLKHVDGAVISYEEQHIKPEPQIYEALISRYGIVPEESVFLDDLAGNLEGAKTFHFHTICFHSLWQAKKELRNLGVMVEEREFDSIIFDLDGTMWDSTENAAIVWKEIAKKDSRITDEVTGPKLKALYGLPLEDIARGLFLSVPEDVAIETMEKCVVAQCPYLAEHGGILLGKIEETLKELSKKYRLFIVSNCKSGYIEAFLEAHKLGQYFDDFECPGGTGKLKADNIRIVMKRNQLRNPIYVGDTGGDGDAAHQAKIPFVYARYGFGEATEYEYVIDSFDQLTTLRMTE >NC_010001.1|WP_012198920.1|1139973_1140453_-|S-ribosylhomocysteine-lyase MKPIASFTIDHLKLLPGVYVSRKDSAGDAIITTFDLRMTRPNFEPVMNTAEMHAIEHLAATFLRNHAVFGSKIIYFGPMGCRTGFYLLLSGDYTSAEIIPLMKELFTFISEFEGEIPGAAAIHCGNYLDMNLPMAKFLAKRYLTEVLDSITEEQLEYPN >NC_010001.1|WP_012198919.1|1138602_1139820_-|hypothetical-protein MEETNQAEKSPDNITSFSGQSRTSKMIQAAAPYLDAGTRKTADLFIKFNDFMDMIRTFRQQGGLGLFGRKKADTKDDTVSATGLPGLQGLFPGLQGLFGSGKGEGSINFEGILRSIRPYCTPPEISLVDNVLNIFSMKRVMDMYQNMSGMMNMPGMNNMQGMNNMQGMNNMQDMMNNLPNIMNMMNMMNTMSGSPFAGATSGSPTQNAPSQDYNSQGYPPPNASAPPPSSMNYDNSYYESTSPTPYDLLYQAMYGQGPPPDATTNTAGQTVSEASNMQMPGNSPTSGNVQTNWPNYNVPPPVNLPPYDMGNMNRDVTSAPYYTNNAPYRATRSAAEAARKGNSPKQAQNVSAASNTQSGQSGNHSPARQNNQQMFDMLASMVPPEQKNTFDTMKKMFESGMFMPT >NC_010001.1|WP_012198931.1|1153270_1153918_+|redox-sensing-transcriptional-repressor-Rex MYDKTISSAVIKRLPRYYRYLGELLENDVVRISSKELSEKMNVTASQIRQDLNNFGGFGQQGYGYNVEYLYTEIGKILGLDKKYNVIIIGAGNLGQALANYTDFERRGFYICGIFDVNPRLIGISIRGIEIRLIDELEEFMKTNTVNIAALTIPKAKAPQVAADLVSLGIHAIWNFAPTDLNLPKDVMVENVHLAESLMRLSYNLKAAEESGESI >NC_010001.1|WP_012198932.1|1153914_1154682_+|hypothetical-protein MRLGKRFGDKSEPKTKSLSDLDDLDEDFDLEFENVKVSSYVEPSRMEPRKINRSKVFSEMSLRQIFRNRKIPIVTLDERFINLFPEEKMSGVQRRLRDELVELMKDQSRVLDDIKGLKRYKSQLMQEIMDNMEVDHTPIGRLKERKLAKNQKLIEDINQKLLIAEDSLEKLPGEIAAKNEELMVESLQSCYGNIFEKNLRKKSLEDEIRETEVKLRNLKKQKLEIEKDYRGTYTYLYDMLGTEMMRKIDEEQDLL >NC_010001.1|WP_012198933.1|1154706_1155075_+|holo-ACP-synthase MIFGIGTDMIEINRVVKACERKTFLTKIYTEQEQKLLLSDIRKAASNFAVKEAVVKMFGTGFRAIAPNEIEVLRDNLGKPYVNLYGNAEILAKEHNVERIHVSITNTKELVSAYVIGEIIRE >NC_010001.1|WP_012198935.1|1155088_1156636_+|bifunctional-ADP-dependent-NAD(P)H-hydrate-dehydratase/NAD(P)H-hydrate-epimerase MRYALDAVQMKNLDKKTIEQIGIPAMVLMERAALYVAEQVREHAKPTDKIIAVCGTGNNGGDGIAAARILHLWGYHVTIGIIGEMEKFSKECREQWKIAKNLGLSIRTEWEITEYNIVIDGIFGIGLGKPVSGEYAKVIQSINQSDCYVVSIDIPSGISASNGQVFGCAVKANETVTFGEQKLGLLLYPGATYAGKIHIADIGFAKEKLDSLTYTYYETTDLDKLPIRMPYSNKGSYGRVLVIAGTESMTGAAYFSAAAAYRMGAGLVKILSAKKAIPVLQGMLPEALFAAYDEEDYEEQVNKALEFATVIVIGPGLGVEAIAKKLLLKVCKEAKVPLIVDADGINLLAMLADEMIPDVLQLTDEVERLHQRIYYIKEILPEGTILTPHLKELSRLTLYPLNKIPCNLIDIASYCTYNNLMIYVLKDSRTIVASKDLRYINVSGTHGMATGGSGDALTGIIAGLIAGGLEAGKAATLGVYLHGLAGEEAAKVKSTYSMLAGDMIEALPEVLRNHD >NC_010001.1|WP_012198936.1|1156628_1157798_+|alanine-racemase MIEETYNQRYLRVSANINLDAIIHNVAEARKNIKKETGIFAVIKADGYGHGAVPIARAIDNDVEAYAVAIVEEGIELREAGITKPILILGYTAPELLTEIVQYDLTQTVFQLSMAEKLDEIARTLGKVAKIHIKLDTGMSRIGYQPTAESIDEIVRMKKLSNLMLEGIFTHMACADMTDKTSAKKQFELFTAFVNQLEEQGVKLPIQHISNSAGTIDLPEMNLSMVRFGISLYGLYPSEEVDKNHLSLEPAMELKTHISFVKELEPGHGIGYGSTFVTKKTMTIATVPVGYGDGFPRQLSNVGRVLVHGEFAPIVGRICMDQFMIDVTDIPEVKQGDIVTLVGRDGDNIIPVEEPADLAGSFNYEFVCNVGKRIPRVYYQNGKPVSIRH >NC_010001.1|WP_029501644.1|1157930_1158278_+|type-II-toxin-antitoxin-system-PemK/MazF-family-toxin MIIKRGDIFYADLRPVIGSEQGGVRPVLIIQNDTGNKHSPTVICAAITSKMNKAKLPTHVEIDADKYGIVKDSVILLEQVRTIDKSRLKEKVCHLDQDILKRIDKALLISFALDT >NC_010001.1|WP_012198938.1|1158766_1160230_+|HlyC/CorC-family-transporter MDGHPIRGLVLILVLVALNAIASAAEAAIENVNEALAEKRAEEGDKKAKRLVRLLDTPHRYINVIEILLTLASLLIGMTYSFQLYRVIEKLVETSTLPEAMAITTSIAMVLVTILITYLIVLFGMLLPRKLALKYADSCAFKMAGMILTCSHLFAPIIWLLEKNTNGILRLFGIRPSDLEDNVTEEEIMSMVNEGHEQGVLEAEEAEMISNIIEFNEKAAKDIMTHRKKMIAINSALCIEDALRFMLDENYSRFPLYDGDIDNIVGLLHLKDVMLYFLDPRLKVEPLSKVAREPYFIPDTQSIDVLFHDMQTKKIHMAIAIDEYGQTAGIVAMEDILEEIVGDIQDEYDDEEELYTRLEDDSYLLSGEASLEDLEDILSLPFAEEDIKNYDTLNGLIVSLLDHIPGDDERATIRYCGYEYELMEIQNRMITSVRVRKIPEEELKASDNEDNQVSQRLGAAMTDAIDTTDEKILSNVEDIILEKKKDK >NC_010001.1|WP_012198939.1|1161111_1161885_+|threonine/serine-exporter-family-protein MNYKLLVDTAVLAGEIMLRSGAETYRVEDTIYRILKTSGFDRCDVFVVSTGIIVTLADSSIDAISQVRRVAERQTDLGNIYYANDISRKLCSGEIDLETANEKLSELTKTVRYPVWLAYLCLIIAAPGFAILLGANFIECFLAMWNGIFIMVSNIMSKRLKINRFVTNMMICAVMAISTTGIVNLFHLNAEMELIIAGAIMPLLPGVALTNGIRDTLQGDYVSGAARLVEAFVTAASLAVGIGAGLALAKVLLGGIV >NC_010001.1|WP_012198940.1|1161881_1162484_+|threonine/serine-exporter MIVQIIGAFIAVFALALAFGVPRKFLVYSSIVGAIDWLVYLISLERGLGLAMSVFVSTLVIAFISHAFARKFKAPVTVFLIPGILPLVPGVGTYRIVYYLILEDGANASYYFYQTLQIAGMIAIGIFIIDTFFKFFQKPLIKAGVCEVAEDTLPQGSDSLEDSTGHSPEEEERRMEQDLRARAEALRKKMKEREKDDLGL >NC_010001.1|WP_012198941.1|1162541_1163021_+|23S-rRNA-(pseudouridine(1915)-N(3))-methyltransferase-RlmH MKITVVCVGKIKEKYLTMAIEEYSKRLSRYCKLEIIELADEKTPDNASPAEELQIKKKEGERILKNIKDNAYVIALAIEGKMLSSEELADKMQLLGVNGESHLAFVIGGSLGLDSEVLDRADFKLSFSKMTFPHQVMRTILLEQVYRGFRIMSGEPYHK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_010001_3 | 1568588-1568684 | Orphan |
NA
Consensus repeat of NC_010001_3
|
1 spacers
spacers of NC_010001_3
>3.1|1568612|49|NC_010001|CRISPRCasFinder CTCCAATATAAAATAAAAAGTGATTACTGTGTTGGTTCCAATTATAAAA |
CRISPR arrays and Neighbor proteins around NC_010001_3
The CRISPR arrays of NC_010001_3 >merge|NC_010001|3|1568588-1568684|CRISPRCasFinder TAAAAAGTGATGAATGTATTAGTTCTCCAATATAAAATAAAAAGTGATTACTGTGTTGGTTCCAATTATAAAATAAAAAGTGATGAATGTGTTAGTT >NC_010001|3|3|1568588-1568684|CRISPRCasFinder TAAAAAGTGATGAATGTATTAGTT CTCCAATATAAAATAAAAAGTGATTACTGTGTTGGTTCCAATTATAAAA TAAAAAGTGATGAATGTGTTAGTT
>NC_010001.1|WP_012199273.1|1566937_1567600_+|response-regulator-transcription-factor MNHILVVEDDEKLRNGLVLSLSSNNQEVKAAPSIKSAKELLKIHKFDLLILDCNLPDGNGIEFCREISGATEIPIIFLTVNDTEIDIVSAFRVGATDYVTKPFSIMVLRERVKAALRRNKCRDDIYKDDYYYFNFTALEYKIEGVEVILSVVEQKIIKLLVCNKNKIIPRERLIDLVWSCNEEFIDDNALTMAIKRLRFKIGNEAIKTVYGLGYMWVGKI >NC_010001.1|WP_012199272.1|1566224_1566848_+|response-regulator-transcription-factor MRIVIVDDDRLVCSSLKVILEMDSEIKVEAIGNDGREAITLYEKYTPDVLLMDIRMSTMSGIDAAEKILVKHKDAKILFLTTFSDDEYIVKALKIGAKGYLLKQDYDSIQPALKAVSMGQSVFGSDVITKLPDLMQKKKEFPYGAYQLTKKEYELLTMVAEGLSNKEIADVMFLSEGTVRNYLSNLMLKLDVRDRTQAAIFYYKQMQ >NC_010001.1|WP_012199271.1|1565059_1566136_+|two-component-sensor-histidine-kinase MNSIYDRLLILICCSFFLDINTNIKYAVIALLVAMAISEFNYVMEKELFLYGSYICYFLLCFFMPQFFAFLPLLLYEAVCFRKKTLSIAVVFTAGYQLVQFDSTFKCLFFIMLHIISVTFAVRTMKQSQLKAQLFKLQDTTKESNMELEARNQELIRQQDTEIYLATLKERNRIAREIHDNVGHMLSRSILIVGAAIAVNKNEESNELLCGLKDTLSDAMNSIRLSVHDLHDGAIDLRTSVEQLVNDFSFCKVELDYDMGNVVNRNVKYCFLTILKESFSNMIKHSNATKVEVLLREHPGMYQLLIKDNGTGGKKTSEEGIGLMNMKDRVNALGGNITITSEKGFRIFVMIPKTQEEK >NC_010001.1|WP_012199270.1|1562953_1564849_+|ABC-transporter-ATP-binding-protein MQNKPVIKRKGVLKRLIKTLFEFYPVMLPIVVVCIVFNAVISSIPSIFMQNIISSVESTWQTGDWKSVSGHIAGLVGLLATCYVLSLAASFAFNRMMAIITQGSLKKLRVKMFHGMQNLPIQYFDTHNHGDIMSYYTNDIDTLRQMVSQSIPQLMTSGIIAITVFCIMLYFSVWMTLVVLVGVFFMYKITKKVGGGSAKYFIRQQAALGKVEGYVEETMNGQKVVKVFCHEEECKAGFDEINDALFADAERANKYANTLGPILNNIGNILYVIVALFGGFLLLTDAPNVSISGFAISISIVVPFLNMTKQFSGNINQVSHQINAVVMGLAGASRIFELIDELPEEDEGNVTLVNAREENGAIVECKERTGIWAWKYPHDDGTVSYTKLTGDVRMFDVDFGYVENKTILHNITLYAEPGQKIAFVGATGAGKTTITNLINRFYDIADGKIRYDGININKIKKSDLRRSLGVVLQDTNLFTGTVMDNIRYGKLDATDGECIEAAKLAGAHDFITRLPDGYQTPLTSNGSNLSQGQRQLLAIARAAVADPPVMILDEATSSIDTRTEAIVQRGMDALMKGRTVFVIAHRLSTVKNSDVIMVLEQGHIVERGNHDQLIAEKGKYYQLYTGAFELE >NC_010001.1|WP_041704100.1|1561218_1562964_+|ABC-transporter-ATP-binding-protein MYKVLAKSIREFKKHSIKAPVFVSFEVMMECTIPFITAKLVNQIKAGCDFGVIARYGLLLVVMALLSLMFGTIAGTACATASTGFARNLRKDLFYRIQTYSFENIDRFSASSLVTRLTTDVSNVQNAYMMIIRTAIRCPLMLIFSFTMAFVMGGKMAFIFLFVVPVLGFGLFFIIRKVMPLFKKVFRKYDVLNNSIQENVKGMRVVKSYVREDYEKSKFEVAAGDVCADFTRAEKILAFNNPLMQFCLYTVMVFVLYFGSYTIITSRGLDLDVGQFSALLTYSFQILSSLMMLSMVFVMITIASESASRIVEVLQEESTLTSPELSLKEVKNGSIDFEQVSFKYSKKAERMSLEEINLHIKSGETIGIIGGTGSSKSSLIQLIPRLYDATKGVVKVGGEDVKKYDLDSLRNQVAVVLQKNVLFSGTIKENLRWGNKEATEEELVEACKLAGADEFISRFPDGYDTYIEQGGANVSGGQKQRLCIARALLKNPKILIMDDSTSAVDMKTDALIRKSLKEFIPETTKIIIAQRTASVEDADRIIVMEGGTINAIGTHAELIRSNNIYQEVYLSQNKVGDQDAE >NC_010001.1|WP_012199268.1|1560164_1560710_+|hydrolase MKREEAWKLLTEFNKEEFHLEHAQIVEQTMKYFAKKLGYNEEEDFWGIVGLLHDLDFEQFPDEHCIKEQEIMRERGVDERIIHAAASHGYGITVDIKPEHEMEKILYAVDELTGLIGAVVIMRPSKSVQDLELKSVKKKYKSKGFAAGCSREVIERGADILGWTLDELLQETIDALKTFRD >NC_010001.1|WP_012199267.1|1558777_1559959_+|aminotransferase-class-I/II-fold-pyridoxal-phosphate-dependent-enzyme MKPLSERTANFSDSVIRRMTRISNQYDAINLSQGFPDFNPPKEITDRLANIAGEGPHQYALTWGAENFRYALAKKQEQFSGMKINPDTEIVVTCGSTEAMMAAMMTVTNPGDKVIIFSPFYENYGADVILSGAEPIYVPLKPPAFSFDANELEDAFKKGVKALILCNPSNPCGKVFTYDELKIIADLAIKYDTYVITDEVYEHIIYEPNQHIYMATLPGMRERTIICSSLSKTYSITGWRLGYVIASPFVIERVKKVHDFLTVGAAAPLMEAAVVGLNFGEEYYKELQKHYTQKKDLFIGGLSDLKLNFTDPQGAYYVLVDVSEFNVKDDVRFCEWLAREVGVGAVPGSSFFKEEVNHLIRLHFAKKDETLIGALDRLTDLRKKAIQSNGYFK >NC_010001.1|WP_012199266.1|1557320_1558700_+|FAD-binding-protein MAEVNVKILNSIINDGDRILIDKIDDSYLSDALGRIKGHADVVLFPVNVDEVSKIMRYAWENQIPVTPRGAGTNLVGSTVPVEGGIVLDLTRMNQIIEFDEETMTATVEAGVVLADFQEYVEAKGCFYPPDPGEKTATIGGNISTNAGGMRAVKYGVTRDYVRGLEVVLANGEILWVGSKNVKDASGLSLKNLIVGSEGTLAIITKCILKIIPKPEVTLSVLLPYRDVKTAIPGVLTIIKENANPTAIEFIERDVIKLGEDYTGLSFPYPKAGAYILMTFDGRSLELEGNVERVKKSAIKQGALDVLILDSEELLMNVWKIRGCFVKAVEAVSEQEPVDLVVPVNKIVEFISYVSEYEKKSGMRMIRFGHAGDGNIHLCMVRGNRSDDKWEKELQEHLNAIYQKAFLLGGLTSGEHGIGLSKRIFYLKETAPQNLELMRQMKRAFDEREILNRHKTYLA >NC_010001.1|WP_012199265.1|1556432_1557260_+|metal-ABC-transporter-substrate-binding-protein MRKLKKLGILALGLTLAFAVTGCGKKDAAKDNKVVKVGVVGESNEMWVPVIEELKKEGIEVQLVTFTDYNTPNAALNGGEVDLNAFQHYAYLNKEKDNNGYKIDSIGDTFISAMNIYSKKIDNLSGIKEGDKVAVPNDATNEGRALKVLEAAALIELNKAAGDSPEVKDITANPFNLELVEVDAANVYALLPDVTIAVINCNYALDNGLNPGKDSLFQDSVSIYAGKNYVNLIAARTEDLDNEVYKKIVKAYQSDAVKDVYADTFKGSYLAAWEE >NC_010001.1|WP_012199264.1|1555389_1556349_+|L-lactate-dehydrogenase MAKPRKVIIIGAGHVGSHAGYALAEQGLAEEIIFIDIDREKAKAQALDIYDATVYLPHRVKVKSGDYSDAADADLMVIAVGTNPDKNKGETRMSTLTNTALIIKEVAWHIKNSGFDGMIVSISNPADVITHYLQHLLQYSSNKIISTSTVLDSARLRRAIADAVEIDQKSIYGFVLGEHGESQMVAWSTVSIAGKPILELIKEKPEKYGQIDLSKLSDEARAGGWHILTGKGSTEFGIGASLAEVTRAIFSDEKKVLPVSTLLNGEYGQHDVYASVPTVLGIHGVEEIIELNLTPEEKGKFDASCRTMKENFQYALTLS >NC_010001.1|WP_012199275.1|1568747_1569437_+|ABC-transporter-ATP-binding-protein MLIEIKNLKKTYGIGETTVHALKGINLSIEQGEFIAIVGTSGSGKSTLLNLIGGLDYPTEGNILINDRDIYALKPDELTIFRRRSIGFVFQSYNLVPILNVYQNIMLPLQLDNVRPDKKFLELIINTLGISEKKNSLPNNLSGGQQQRVAIARALIAHPQVILADEPTGNLDSKTALEVILLLKQLNETYGQTIIMITHDEEIAQIATRRIHIEDGRLISDTKEVFGHE >NC_010001.1|WP_012199276.1|1569429_1571748_+|FtsX-like-permease-family-protein MNKGLIVLASHIIKSKKIRTLAISVSIMLTAILFITVGGITSCIYQSLEISKQLATGSNFHAVIDEVPISKKKEIEEHRLVKNSYVVNHLGQATIGSTKTDEYCEIYSCSDSTILNHMFMNIIEGSYPVNDSQILIDEEYLLKHNIPLNVGSEIYLYNIYSEETRYILSGYYQSTADNTATRPAFTISNDDKETTIYLLLNNPINIEGKIKKIISDVQLVPNYQVNEAFNLAKTHFFNVQSVSIIIFVFLVILSCGFLAIYNIYYIALTGEIKFYGLLETLGTTTKQLKKLVFYQVTMIYCFSFPIGLLLGYFIGWKIISPIFMSLSGKEYIYSFHFSIFIFTVLFTYLTIIISAILPIKRITNMSCISALNEEGIKNCNNSRILMKDRISLWYFAIKNLKRNLKKAIISIISIAISIILFLFTMSMANILLEDSRVQTYDFCIDELKELIDIKKEIHLLLERDLENIQQIPGIKAVIPIYTKKISKGVEDIIIYGIPNEAIEKFKTQWFIGKFDKELFKTGTNAIIYKYEKTSENTTFDTENNIIELDMLKNPYGIQAFEKGNRPILSNFTILNFYSIYDYALYIPFDQFNHEFSDYHIESINIQAEKGYEDIILRQLKSMFDSNIQIRDRREQLSELSERLMALKVTGYSMSVILAFIGILNYLNVTICSLYERRREFALLNIVGMTQKQIFLCLLLECLYYVILAVMISILFGTICFKIIYLIIGMDVKMQFSSIIGMGLILFLTTIFTNFLVYFRMKKILPIEALRSC >NC_010001.1|WP_012199277.1|1571955_1573083_+|exonuclease-SbcCD-subunit-D MKFMHLSDLHIGKRVNEFSMIEDQTYILQKILELADEEKPDAVLIAGDVYDKNLPTIEGVNLLDDFLSDLHKRKIPVFMISGNHDSAERLNFASRILRNNEVYIAGTYQGEIARYTLNDGHGPVNIYLLPFVKPAIASVYHEGIESYHDAVKAILAAAKVNKAERNILVAHQFVTAGDISPECCDSENISVGGLDNVDVSVFDDFEYVALGHLHGPQRIGRDTVRYAGSPLKYSFSEAKQKKSVTMVTIDTKGEIKQEYIPLIPLRDMRQLKGPIDELLNPKNYHNGNTKDYIHATLTDEEEIYDAIGRIRSIYPNVMRIEFDNSKTKPNETAKLVAEDVIRKDPLCLFEEFFKNQNNVSMSDEQNEIMKKLLFE >NC_010001.1|WP_012199278.1|1573126_1576243_+|SMC-family-ATPase MKPLELAISGFGPFKGEVNVPFEKIGESGLFLISGDTGAGKTTIFDAIAFALFGCASGENRTTDSMRSDYATGDDKTYVKLVFSHKGRRYEVERNPLYQRAKKRGDGFTEEKPNATLIKWDGSVVAGYQPVTNEIMEILSIDYKQFKQIAMIAQGEFMKLLTASSEERGVIFRKVFQTGNYEAMQKKLKSMASELRGECDQLERSMVQYLSGILLSKENEVLEEWKRKPDIHKINDLLELLELDLEEDKSRYDTLEVENKELSGKLVELTTKITLVEEQEKKKQELEQRRLLLEDLRKQSEQIKLNQIDLQNAKKALYQVKPVADAYHKSRVETENLVREIIEQKKRFDIVSEETKKKQAEYHEHEKDKARLEELAIAINQCKEELAQFENLKQLEIKIQSNLKNQDAIVSNEKKIEEQGKLLKGDHSELTKELQGYLSIDQEILECIQIGKDLKSKITKLKSLLDELNRIELESSNLKVLQQEYFKKENSYQTANKEYQTLELAYFREQAGLLAMNLKGEEPCPVCGSTKHPKKAECSKEAPTEAMLNQAKVKLESETVVLNQQSLSVSNQNTKISLMWDNLCVVCEELFEESFGKDKIKDRITEELSRSEEAFLQKNEEYRVLKKNQERRDWCNKRTTEISAALEENVQNIQNLNQEKIAIATVLGQMEGSREQILERRKYATKEECENKHTALLLESNQLRSNLERLEKEFHELRSQWSALKAVIEDNEIKEAKQKVTLEEEEKAYQRKLTETSFDSEESYLACLWTEDKIEQTQKMIEDYEKQVSEQHLMIEKLVNEIKETDSVDIQILKDSRDEINAQKSVCERQKEEVNRRIRNNDQIYKDAKKQLEAKGEIQRKYLSINELSKTANGELTGKVKIAFEQYVQAFYFDTVIEEANKRLRKMTFSQYTLHRADSVNLRSQGGLEIFVLDHYTGKQRTVKSLSGGESFKAALALALGLSDVIQSYAGGIELDSMFIDEGFGSLDSESLEQAIETLISLTSGNRLVGIISHVTELKERIDKKILIHKTMEGSYIK >NC_010001.1|WP_012199279.1|1576392_1577199_-|phosphotransferase MEDMLGKLVGSGGTSNVYEWGNNEVIKIYKPRIEENTINNEMYIGQFLNKFSLNIPKCIGSIDYNGKKALIYERIYGNVMAEPLLKGVYDIELANKFAQMHYDIHKKTIEELPSQNEFLKKRILELKDTLGEKATLSLLNLLDDIPNDFKLCHGDYQPLNIIGEANEYIVIDWNGACIGNPILDVAWSYMTLNSPVVEYLLGDLVSDLFSKFAKDYLSYYCKLSGIKQVSVLKCLPIVATRRLYDNNMNDNENSRIEREWLFSFIRKI >NC_010001.1|WP_012199280.1|1577526_1578117_-|dipicolinate-synthase-subunit-B MKLSGKNVGVALTGSFCTFAKTIQEIQNIVNEQANVIPIFSFNAQTIDSRFGKAADFMEQITKITGNKPVLTIAGAEPLGPKGMIDIMIIAPCTGNTLAKFCNGITDTPVLMAAKGHLRNQKPLVISLATNDALGINFKNVGYMLNCKNVYFVPFGQDDFNKKPNSMISNTSLIIPTLELAMEGKQIQPIIESPEG >NC_010001.1|WP_012199281.1|1578133_1579018_-|dipicolinate-synthase-subunit-DpsA MSQLSKIVFLGGDLRQYYMIKQLMEAGFPVAVYGLDRGEFGDTIYEATTLKEALSFGNIVICPIPVSKNQVDIVSKQTIPDLNLDKLKENLTEGHTLFGGCFNKSMSEFCDKKNIRLYDFMEIESVSIANAIATAEGTIAEAIQRSPVNLHKNECLVLGFGRCAKILADKLKGMGAKVSVGARKEEALAYIDAYGYENIPISELSKHLHRFPFIFNTIPAMVLDSALISYVRKDAVIIDISSKPGGVNFDYCNQLGINASLCLGLPGIYAPKASATILVTALFNCISGSASSKD >NC_010001.1|WP_012199282.1|1579430_1580060_+|histidine-phosphatase-family-protein MNIFLIRHGRQSSQLCNVDVDLAVEGREQAKLLGKRLSEYGIDCLYTSDLLRARETAEIAKIYLGNVDYRIRTELREIDFGRMTGNSDEYNNMAFADFKKKRMELSEDLPFPGGECGQDVVDRVRDVLEEMIHSGKQRIAVVTHGGVIRSIVTDILGMPQSKKLLFAVSLENTSITQLRFDRDYQRFYLERFNDFTHLEANNNLLRRNW >NC_010001.1|WP_012199283.1|1580099_1580870_-|Cof-type-HAD-IIB-family-hydrolase MIKIIASDMDGTLLLNGCQQVSDRAISIIKQLHDKDILFVAASGRQYPNLYRNFKDVAKHMAFICENGSLVMYQDKVLYKSVMEPKLAKELFQTIYEREGCEVLASGQNTSYLLPKTDSYVHRMKNIVKNNVVVINSFEEIPEDIIKISVYEVDGISHSASYFTSLFGNKLKATISGEQWLDFVNPFVNKGAALSHLLDYLSLSPDEAMAFGDNYNDLEMLSLVSYGYVMDNAVPDIKNRYSYKTSLVEDTLEKLL >NC_010001.1|WP_012199284.1|1581245_1582583_+|MATE-family-efflux-transporter MFTKKNLIKLLVPLVIEQLLAVTVGMADTIMIAKRGEEAVSGISAVDAICVLLIGLFSALATGGAVVAAQFIGQKNREKANEAANQLVLSVAFLSVILMVISLIGNEAILHLIYGKLSPLTMQNAKTYFYIVAVSFPFIAIYNAGAALFRAMGNSKISMMTSLWMNIINIVGNSILIFGFGMGVAGAAISTLLSRMIAAIIVIYRLRNQENAICIEYNFRLGYQPEMIRRILKIGIPNGLENSIFQFGKLLVGSLIATYGEVGMTANAIGNSVASFNCIPGSAIGLAMITVVGQCVGAGKLDEAKKYTWKLLKYASISMLVLNIIVLLSINPIVNLFEAQAATKELATKLLIYHCICCIIIWPSAFTLPNALRAANDVKYTMFTSISSMWIFRVGFSFVLAQTFGLGVFGVWVAMTIDWVFRAILFLSRMISGGWKKHARMEHAR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_010001_4 | 3365564-3365671 | Orphan |
NA
Consensus repeat of NC_010001_4
|
1 spacers
spacers of NC_010001_4
>4.1|3365588|60|NC_010001|CRISPRCasFinder CGCTGTAACACATCCAGCCTAATATACAATTCTATTATAAGCACGGTAATGAATCCACCC |
CRISPR arrays and Neighbor proteins around NC_010001_4
The CRISPR arrays of NC_010001_4 >merge|NC_010001|4|3365564-3365671|CRISPRCasFinder TAATTCATAATTTCATTACATGACCGCTGTAACACATCCAGCCTAATATACAATTCTATTATAAGCACGGTAATGAATCCACCCTAATTCATAATTTCATTACATAAC >NC_010001|4|4|3365564-3365671|CRISPRCasFinder TAATTCATAATTTCATTACATGAC CGCTGTAACACATCCAGCCTAATATACAATTCTATTATAAGCACGGTAATGAATCCACCC TAATTCATAATTTCATTACATAAC
>NC_010001.1|WP_012200774.1|3362997_3365094_-|polyribonucleotide-nucleotidyltransferase MYKSFSMELAGRTLSVDVGRVAAQANGAAFMHYGDTVVLSTATASDKPREGIDFFPLSVEYEEKLYAVGKVPGGFNKREGKASENAILTSRVIDRPMRPLFPKDYRNDVTLNNLVMSVDPDCSPELTAMLGSAISVAISDIPFDGPCATTQVGLVDGELVFNPTAAQKAVSDLALTVASTRDKVIMIEAGANEVPEDKMLEAIFAAHEVNQEVIQFIDKIVAEFGKEKHGYISCEIPEEMFAAIKEVVTPEQMEEAVFTDVKQVREENIREIKNKLAEVFEESNPEWLHLIDEAVYKYQKKTVRKMILKDHKRPDGREIHQIRRLAAEVDMLPRVHGSGMFTRGQTQILTVTTLAPLSEAQKLDGLDEAEKSKRYMHHYNFPSYSVGETKPSRGPGRREIGHGALAERALIPVLPSEAEFPYAIRTVSETMESNGSTSQASVCASTLSLMAAGVPIKKQVAGISCGLVTGDTDDDYLVLTDIQGLEDFFGDMDFKVAGTHDGITAIQMDIKIHGLTRAIIEEAIRRTKEAREYIINEVMTPAIAEPRTEVGKYAPKIIQIQIDPQKIGDVVGQRGKTINAIIEQTGVKIDINDEGAVSVCGTDKDMMDKAINMIRTIVTEFEEGQVFEGKVISIKEFGAFLEFAPGKEGMVHISKISKERINHVEDVLTLGDVVKVVCLGKDKMGRISFSIKDYKEEN >NC_010001.1|WP_012200773.1|3361897_3362767_-|sugar-specific-permease-EIIA MAQYDYIAIWVGGIALAVLVIMIIAFLIYRKRHVNHHSMKEQKSRKMKDLPAKKKMVKNVQGNDSQTKKITPVVTLDSGNNTSGNQRLDGARISHRRNHPDRVAEESMWKEKKEQEEDEILKKMENERKNKVFMIYSPCNGEMGDAVENVTDAKECGLDYPGVIIAPSDDKVYAPINGRISWKSENPNMVSIQSDTGVEVLLSVLKEDEVLQTEVFTMKTAQGAYIGMGEQLCQFTQGLIRKGNRIYKMKMELSSYQEGQLLLVKRFSYISHGDKIITLKTERNAVETA >NC_010001.1|WP_081428552.1|3360352_3361735_-|insulinase-family-protein MGLIMVKVNVLKNGIKVVTEELSYLRTVSFGVWIRVGSAKENKENNGIAHMIEHMLFKGTKTKTAKEIADIIASIGDDVNAFTSKEQTCYYGTTITESLSILVELIADMLCNSLLSEEDLRKEKRVIYEEIDMYEDSADDMVHEILQQNVFKDQPLGYIISGAKKNVRSFKRMQLIDFMAKHYVAENIVISVAGNFSEKELMDQLERCFGGIRGTNPKALNSLTLLKKKKDELLLAPYEEKFQKKHDDIPSYHTCFCQRHKDNEQLHINLAYPSIPLGSDESVVFAVVNSMLGGSNNSRLFQRIREELSLVYSIYTYGSAFEKAGLYHLDITVNPQQAFRVLRETKLVMDEFLTTPITKEELDTHKAQVKTEFILGSESAKARMNSNAKSVLVRGYVKTLDEIIEELNRLSAEDIIRFANKVWGESSASLCVIGAESGVSFRALKKEYQNLFFINPNTKA >NC_010001.1|WP_012200771.1|3358909_3360187_-|O-acetylhomoserine-aminocarboxypropyltransferase MEYNKLSTICVQAGYTPKNGEPRVLPIYQSTTFKYDSADTVGKLFDLQEEGFFYTRLANPTVDCVEKKIAALEGGIGAMCTSSGQSATLLAILNICNAGDHIISSSAIYGGTTNLLAVTLKKLGIEVTFVNPDATKEELEVAVKENTKLYFAETLANPSLVVIDIKLWAEVARQNGVPLFIDNTFATPINCRPLEFGANIVIHSTSKYMDGHASALGGVIVDGGNFDWNNGKFLGLTTPDESYHGVIYTEFAGKAAFITKARTQLMRDMGVMPSPNNAFLLNLGLETLHLRVKRHCENALIVAKWLSENDKITWVNYPSLEGNKYYALAKEYMPNGTSGVISFGVRGGREAAMKFMDQLKLAAIVVHVADARTSVLHPASTTHRQLSDEQLISAGVSADLIRMSIGIEDVADIIADINQALDSVE >NC_010001.1|WP_157668816.1|3357416_3358253_-|peptidase-S14 MLSAENQEVTNKELVKETPLKDDANNTQKKSPTGDKQVKGNIKKEKLEEENLKNENEKLKGQKIQDYGQATLEDNGKNHKIHLLSIIGEIEGHECLSQNAKTTKYEHVLPQLATIEDDTETDGLLILINTVGGDVSCGLALAEMIASLSKPTVSLVIGDSHSIGVPLAVATNYSFIVPTGTMIVHPVRMSGMVIGAPQTYDYFKLIQDRIVGFVSSHSKIKKEKLEQLMLNTGMLSKDLGTILVGDEAVAEGIINEVGGIKQAIEKLHQMIEEKNSNR >NC_010001.1|WP_012200769.1|3356415_3357237_-|undecaprenyl-diphosphate-phosphatase MDFIELLKVIFLGIVEGITEWLPISSTGHLLLVDEFLKVNLSKDFMSMFNVVIQLGAILAVVVLFFKKLWPFSKEEKNFIKKDTFTLWFKIVVACIPGIVMIPFDSKIEDLFFNPQTIATTLILYGILFIIIENRNAGKQPKVAKLSDITYQMAFMIGLFQILAMIPGTSRSGATIIGAMLFGASRYVAAEFTFFLAIPTMFGASLLKLLKFGFTFTGAEIVALITGMLTAFIVSIIVIKFLMGYIKKNNFKVFGWYRIVLGAIVAGYFLLAR >NC_010001.1|WP_012200768.1|3353394_3356235_-|DNA-translocase-FtsK MASKQTGTRSKQSQRQTSSKPKTSNSKRTNQTKGKPTASRSSKTQRNKQVQEYLAENESIRDEVILIVTALTSFLLLLSNFDLCGPVGKQIKTFFFGLLGHFTYLFPFALFFFIAFAVSNRGSVIARRKIIGSIVLIFTLTSLIQLLEGYNGEMKYFDYYLQSAKNSNGGGLIGGTLVSILCPLFGTIASVIILIVMLLLCFIFITGKALLTLMREKGEQKLNDHRQLRENYAKEFKQLDMEDETYGEERRKPRIVNLQKQANEKVKSFFDQDEDDEDDLKYDEMEDGPVNFLEELKRRGKDKKQNQKKEVVEEPISVFEMTEIKSEQNDGLNSETNFSPSEDMLQEVNSIYEDELNRKFGQNEDNNEVEINTSYEVKNIKPLNANTEFYKDDAVKETKDQNVNVDSNLKDVSAEASVDSSSHMPEGNNDNKAKPKEVKAESGSEDILTVDQKLEPLKKYEFPPIELLGKPKANQRGMSDKDLKETAIKLQKTLESFGVRVTITNISCGPAVTRYELQPEQGVKVSKITGLSDDIKLNLAAADVRIEAPIPGKAAVGIEVPNKENSAVMLRELLESKEFNSHPSDIAFAVGKDIGGQAVVTDIAKMPHLLIAGATGSGKSVCINTLIMNILYKANPADVRLIMVDPKVVELSVYNGIPHLLIPVVTDPKKASAALNWAVMEMTDRYKKFAEYGVRDLKGYNEKVAEIAHLNDPAFTKLPQIVIIVDELADLMMVAPGEVEDAICRLAQMARAAGLHLIIATQRPSVNVITGLIKANVPSRIAFSVSSAIDSRTILDGSGAEKLLGKGDMLFFPSGYPKPVRVQGAFVSDKEVSAVVDFLKSQNHQITYNEEINDKIKNAQVSSAAGGASGGNDRDEYFIEAGKFIIEKDKASIGMLQRVYKIGFNRAARIMEQLSDAGVVGPEEGTKPRKILMSMEEFEQYVDEYV >NC_010001.1|WP_041703702.1|3352712_3353135_+|GNAT-family-N-acetyltransferase MRFSLWPHHNENELYNEMLQILEGKTFYKNELSWTVFVAVRENGSLGGFIEITIYPQLDLCDSKPIGYIEGWYVDEDLRNSGVGKRLVDIAQKWAVENECTEIASDVEVDNKVSQLAHQALGFNKYHEANECIFYKKSLI >NC_010001.1|WP_041703701.1|3352390_3352645_+|hypothetical-protein MSKFVKGIIAMIFFYIFNMFMTIIGQVIFFGDSFTLSYHLFTYTGLMTLCGVIVVCTCIIIEKLNEIKNLYNRVDIENNTKINE >NC_010001.1|WP_012200766.1|3351403_3352327_+|hypothetical-protein MKKRKLLVIFLFCLLALPFPFSLISWIGRSHLETTYQSDMPMLTISEYDIYVDTRTATSAELARSGVSYEAVAAIKSNDIEDELTRLSALPDEELSNRGYNTGQIEILHDYTGERIETNPKLRGIFADVKCNFYQYTANNISLSLKIVWEWTNKPMLSGISITDIVVIRWQGTNTAGLPMNLALNSSGSSCMINYYNPYESYQSQSSVSISTTDPYGHAYAKFPMSNGIANGSSYAKTGTLITKIDRTGTDAIKEAAFVFAYGHTTVALTNPSLSLPDPFGIYFSFGVTTMCKEVIRMNSSGIITRY >NC_010001.1|WP_012200775.1|3365703_3366978_-|bifunctional-folylpolyglutamate-synthase/dihydrofolate-synthase MNYQEAIKYLRSYKRNTGELSLKNLNKLLDYMDHPEKKLKFIHVAGTNGKGSTCKMLSSILRCAGLKVGLFTSPFLETENEQIQINGEVISNEDFAKVCKKVKDFTTYLMLDEIPTEFELTTAMAFQYFYDTKCDLVVLEVGLGGELDATNVIETPLVSVLTNIGIDHVDYLGTTLKEIACKKAGIIKENGIVVSYEQEKEVEEVIKLTCEERHNKLVFAEFSELKLHQENLSRQKFSYKQNTNLSLSLIGEHQRKNAAVALEVIAQLQTLGYKISENAISEGMNYVTWPGRFEVLCKQPLVILDGGHNVQCVEAFSEVLKQFIPGKKAIVILGVLADKDYKGMIPYLVPFTKRFIAVTPKNTRALPSEQLAEELSKHHPLVSHNATPVEGIMAALRDAREDDIICVIGSLYMAAEIRDCFIGE >NC_010001.1|WP_012200776.1|3367215_3367779_-|folate-family-ECF-transporter-S-component MLNQEKNVKNKDLKKGKKVFTLETFIVLALLVAIEVILTRFLSLKEWNIRFSFGFIPVVIAAILYGPIASATVAACSDFLGAILFPMGAYFPGFTITAFISGIVYGLFLHKKQSLPNIVGAAVVNQFFCGLVINSYWLSIISGKSTFWGLIPIRSIQSAVMSIVIISVTYVISKTIVPIIKKAIVIM >NC_010001.1|WP_012200777.1|3368014_3369376_-|MATE-family-efflux-transporter MNLIHEMKQDKAFLKKAAMIAIPIALQGLLNNVLNFVDTLMISRLDTTTVAAVGIANKIFFVVSLLLFGICSGSCILTSQYWGMRDIKNIKRVVGLSMLLGVTSAFLFTLVSFLKPQLVMSIFTNSEPTIIIGAKYLKIVCISYVITAVTQIFMSALRSVNQVKLPVVISLVAIVTNVILNYVLIFGKFGFPELGVEGAAIATLIARIVEVVAMILLVYYKKSPVSISVSHLFFYDKDLYSIYFKTASPVIMNEFMWGLGITMYSLAYGRMGDNAMAAITITQNIEQILQVVFMGISNATAVILGNELGAGKLKDAELHAKFILILQAMVTVVIIALGIVFMNPMIAVFHMEPVVSASIRKCLLVFLAYLFFKVFNTVNIVGILRSGGDTKAALFLDVTGVWLIGIPMAFLGGLVFHFPIEAVYAMVLSEEIYKMILGIPRYRKKKWLRNIVA >NC_010001.1|WP_012200778.1|3369632_3369899_-|30S-ribosomal-protein-S15 MISKEKKQEIINAYGRNANDTGSPEVQIALLTERIAELTEHLKINKKDHHSRRGLLKMVGQRKGLLEYLKKTNLEGYRELIARLGLRK >NC_010001.1|WP_012200779.1|3370083_3371748_-|Na/Pi-cotransporter-family-protein MKMESLLALLAGLGLFLYGMKLMSDGLEKAAGARLRSILEMCTKNQFIGMIVGILFTAVVQSSSATTVLVVSFVNAGLLNLMQATGVILGANIGTTVTAQLIAFNLSAVAPVFLMVGVCMVMFVKKPMVKRIGEVVLGFGMLFFGMSIMSGSMDSLRSSEQVMNLIASMDNPFLGVLVGFVITAIVQSSSATVGIVLVMASQGLIPLNICFYIILGCNMGSCVSALLASIGSKKTAKRAAWIHLLVNIIGSFAIFVILLFFENQIKDFIIAISGGNTNEVVDGVSQTIARQVANTHLIFKVFEVAICFPITKYIAKAATLIVPGEDKKVDNMHLEFITDFTSFQTTAAVPNAINEIVRMAQITFHNLSIALSSLLNSDEKQISEVYETESSINYLSREITNYLVNANQYSLPIDDRKVLASLFHVVNDIERIGDHAENVADFAKQAIEGNLHFSSEAVEEINKMATAVQKLLSYSIEMFENKNREYLEEILKFENSIDDMERRFQKNHVVRLTKNACSAETGMIFSDLLSNLERVADHGTNIAFSILDEDPEDI >NC_010001.1|WP_012200780.1|3372247_3373639_-|PLP-dependent-aminotransferase-family-protein MLIIPLNLGNKVPLYEQIYEFIKKEIKTGKLPVATKLPSSRNLAQSLQISRSTVELAYQQLISEGYIESIPKSGYYVQGIADLIQITERKKALGKEKVEKVKRLRYDFSPFAVDLSEFPFHTWRKLSNQCMNDMNQSLFLLGENQGDHSLREAIVAYLHSSRGVKVEASQVIVGAGADYLLVLLSQIFGNDQIIAMENPVYKRAYRIFQGVPYPIQPITVNTDGISIEELMNTDATVVYVTPSHQYPLGAVMPIKRRLELLQWAAKGDNRYIIEDDHDSEFRYKGKPIPSLQGIDENDRVIYLGTFSRAIAPAIRMGFMVLPQRLYQVYKDKYSFYASTVSRIDQAIVCEFLNGGYFERHVNKMRKRYKMKHDLLLHELKSYEDNITITGENAGLHIVVSFHTSLTEEEILKKVRKKEIELYPLSKHYITDYKPTYPTFLMGFANLSEELIIEGVNLLMKELF >NC_010001.1|WP_012200781.1|3373607_3374561_-|bifunctional-riboflavin-kinase/FAD-synthetase MEYIYGSTDFKYHNTCVTLGKFDGLHRGHQLLLSELAKFEQQGLTSVMFTFDYHPGNLFSEKEIDLIYTEEEKKELLSRLGPKVLISYPFTEETASMEPEDFIKEVLIGKLDAKAIVIGADYRFGRKRKGDAALLKKYSIMYGYELVICEKLTYHDNVISSTRIREELKNGQMESVNEMLGHPYTIMGTVVTGNKIGRTIGVPTVNLLPAEHKLLPPNGVYASIIKFQENTYYGVTNIGYKPTVGAGQKLGVETHIFGFTGDLYGKIIEVELYRYERPETKFTSIEELKQRVQLDIQNVKEFFARGCYADNTTQSRE >NC_010001.1|WP_012200782.1|3374688_3375612_-|tRNA-pseudouridine(55)-synthase-TruB MNGIINVYKEKGFTSFDVCAKLRGILKQKKIGHTGTLDPDAEGVLPVCVGNATKLCDLLTDKDKVYEAVLTLGIITDTEDMTGEVLERRLVTATYDRVLEVVEQFTRTYDQIPPMYSAIKVNGQKLYELARQGKVIERKPRTVTIHAIDILGVTPLEEQPEIVHEVRMRVSCSKGTYIRSLCRDIGEALQCGGCMKSLIRTQVSIFTLENTLRLAEIEECVKNQTLEQVLMPVDKLFLSMPKVVVKKESCKFLYNGNQLVEDNFTWEKVSDQINIDKIRVYDSEDVFTGIYEYDEKKNCYQPVKMFL >NC_010001.1|WP_012200783.1|3375686_3376661_-|bifunctional-oligoribonuclease/PAP-phosphatase-NrnA MKRFEEDIRKANRIGITGHVRPDGDCTSSCLALYNYLQENYNADRTKTIDLHLEPIAEPFRFLTSSNCIQSDYKDEEPYDLFFALDCGSLDRLGAAQYYAMQAKKTVNIDHHISNTGFASVTLMVSDSSSTCEVLYDLFDVDKISKATAEALYLGIVHDTGVFKHSNTTEKTMAIAGKLITLGATPNKIIDETFYQKTFVQNQVLGRCLLESILLLDGKIIVSSISKRAQNFYNVVPSDLDGVIDQLRITKGVEVAIFLREDDVQEYKVSMRSNGIVDVSKIAVFFGGGGHILAAGCSMKGSLHDVINNLTIGIEHQLKNAEKC >NC_010001.1|WP_012200784.1|3376672_3377083_-|30S-ribosome-binding-factor-RbfA MRKNSIKNTRINQEVQKELSMLISRELKDPRINPMTSIVAVEVAPDLKTAKVYISVLGDELSQKNTLAGLKSAAPFLRGQLARGINLRNTPELLFVVDQSIEYGVSMSKLINEVNAGNHKASDEEESDDKGHEDEQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_010001_5 | 3417248-3417362 | Orphan |
NA
Consensus repeat of NC_010001_5
|
1 spacers
spacers of NC_010001_5
>5.1|3417279|53|NC_010001|CRISPRCasFinder AAGGAAATAATAATTCCGAATTAAGGAGAATGGTAAAATGGAGGCTTTCGAAT |
CRISPR arrays and Neighbor proteins around NC_010001_5
The CRISPR arrays of NC_010001_5 >merge|NC_010001|5|3417248-3417362|CRISPRCasFinder ATAACGAAGTTTAACTATTGAAACAATTCAGAAGGAAATAATAATTCCGAATTAAGGAGAATGGTAAAATGGAGGCTTTCGAATATAACGAAGTTTAACTATTGAAGCAATTCAG >NC_010001|5|5|3417248-3417362|CRISPRCasFinder ATAACGAAGTTTAACTATTGAAACAATTCAG AAGGAAATAATAATTCCGAATTAAGGAGAATGGTAAAATGGAGGCTTTCGAAT ATAACGAAGTTTAACTATTGAAGCAATTCAG
>NC_010001.1|WP_012200815.1|3415540_3416119_-|zf-HC2-domain-containing-protein MDCLNAQRLITPFIKDELSMTELEGFLAHVKECPVCREELEVYYALLTAIKLLDEDKEMSNNFTEELNRKIRSCEEHIRRNKRNKVNRRIVFMLVVVGVTIVSSLSIRKLTEIPAAPTKPPYILRYSGIPRRYDPMFRIRTDYDTMACEYVKKVKDGRLEFYRKNREEYEIVRQIYVNQRELIEIDNIENSD >NC_010001.1|WP_041704515.1|3412479_3415452_-|DNA-polymerase-I MNDIINNKQDSKEGDYLLVIDGSSLLSTQFFGNLPKEIMFAKTMEEKEKYFPKIMQTATGVYTNAVYGFLRVLLKIIKDQKPTYLAVAWDISRNTFRREIYPDYKGNRGETLEPLKDQFKLCQHVLKEMGIVQFMDERYEADDFSGTLCQKFEEEVPIRVMTKDNDYLQLITERTNLWLIHSTAKKTDELYEKYGLSKKELNVPDRTFLFTPELVEKEFGIEPSSVPSLKGIGGDSSDNIKGVPGVGEATAVALIKEYKTVENLYEILNNLDETGKKEINEYWKTLGIKRTPINALLKISDTELVGEKAAILSKTLATIKKDIDLKDLGLEQLRIHINTENAQKCFNELEFKTIKMDNAEVEDSSINNLRFEADKIKITSNLEEVETLFSNLIKLWEKNQKKLKKTKKSRNDKSDSKITIKEIKKPEYASEDAVGIKLIMENKSLVGISVYYGSEASFIIPCEGFITPDFLTSKLNGLLEKKITLAIFDIKKYLPYLNANEESPCFDVTIAGYLLEPDASTYEYQTIAEKYLELDLPSEKEVFSGQTYASLSLLDQDQYKKAACYESYVAHHIYPVLLKLLSERGLLPLFAGIEMPLVYTLYDMEQRGIRVDTNGLKDYSDQLGVSIVELEKQIFELVGVEFNINSPKQLGEILFQRLGLSYGKKTKTGYSTSAEVLEKLSSEHPVIKLILQYRQLTKLKSTYADGLVSYVEGDGRIHGTFNQTIAATGRLSSTEPNLQNIPIRMELGRKIRKVFIPEDGYLFLDADYSQIELRLLAHMSNDARLIEAYRQAQDIHRLTASEVFHTPFDEVTSAQRSNAKAVNFGIVYGISSFSLGQDLDITRKEAEEYINKYFMTYPGVKTYLDGLIEEGKETGVVKTLYGRIRPVPNLTNSNFMKRSAEERIAMNSPIQGTAADIMKLAMIHVNQVLKERKLKSRLLLQIHDELLVETHESEVEEVAKIMKEEMQQAASLSVPLEVEVANGNNWYEAK >NC_010001.1|WP_012200813.1|3411738_3412359_-|dephospho-CoA-kinase MERHGYFMKVIGLTGGIGSGKSRVADLLQREFLVYVIYTDDIARDQMKQGGCSYEKVVKQFGTEILDEGGEIDRNKLAKIIFQKEDLVKLLNSLTHPNVHLEVLHQIKEAKSKGKLYSAIIVETALLFEAGYQDFCDEIWYVHAPIGDRMKRLKESRGYSEEKIESIIKKQKSEEFFLKNSTVIIENGNDVLQDELRLQCERYLTT >NC_010001.1|WP_012200812.1|3409246_3411358_-|cell-division-protein-FtsA MDAITYPENMVFGLDIGTRSIVGTVGYKQNEHDFIVVSQSVRYHETRAMLDGQIHDINKVAETIREVKKDLEKQLGKKLKEVCIAAAGRVLKTVTVKAEYNLINEGIISEEHIRTLELNGVEKAYEELRKEMNSGDGNFYCVGYSVVHYYLNDYVMTNLEDHKGSKIGVDLLATFLPEEVIEGLYAAVGKAGLEVVNLTLEPIAAINVAIPDKFRLLNIALIDVGAGTSDICITKDGSIIAYGMIPKAGDALTNILMQRYLVDFKTAETMKTSILKKKTVSYKDIMGLSNKVTREEIYEAVRDEIDHITAQIAEQILYLNGGKSVSAVFVVGGGGKLPYFVEALSSKLNLPKERVALRGEEVLNMVQFLQKEIKKDPLLVTPIGICLNYYENRNNFIYAMVNGERIKLYDNSHLTIVDAALAIGFPNELLFPRRGKALHYTINGSERLARGESGEGAIIILNGKQVSLNASITQNDIIQITESTAGADATLMVAKLPEYKSTITFAVNHKEVICPKYALANDILVSDTYQIKDGDRLELLNHYTLEQLLEFMDLPYRKGITINHQSAKPQDRVYENFTIYYPLHEDIAASYEEVAAMISEEDFKEYSLGEDALLKDDFLRDGISEEVDEKKVEKASSISVYVNKTSIILKGKDKFILVDILDVYPFDLSTAHGSKVVLKINGDEAEFTSPLNNQDVIEMYWEK >NC_010001.1|WP_012200811.1|3407492_3409214_-|sensor-domain-containing-diguanylate-cyclase MSDDLLVFRKKFSRGLVLVTSVFWLYTLIQLVGNLSEHASFGILVTSGALIIDILAEHTKLFKKRIAWMILRSIELISFSICFFVTIGSINSMFFGIELIAVMLQLLMLTDFLDVYSRAITLTTMSLPAIIYLISIILLKPERQEEFFGMVCAYLSLIFVVMLISELISEVFIATDKRIFEIRRFSEQTKETNEALRNQQEKFRKVNEELGIQKIMLEAAYHKINSANTENQTLYQVIRYISTELEIGNLMKLITEAIYEAMGLDVCTIILEPDIAGNKQVTYEIHSRLGKGFYEQMSNRIEQGCIEEYMKEEGNYIDNQVQPGKYSFLKDRKINSLLIVPLVREKKVIGALLCGHSQFEYFNGNIIFFETVVAQLLVAIHNASLYSKMQQMAIRDSLTGIYNRGQLNVILEQYTKRASEQNKSLSVALLDIDLFKKINDTYGHLFGDEVIKMVASKLQEVANCFHGIAARYGGEEFVIVLPDIGILDFYHIVTSLKETIDTTTLYFNEDEINVKVSVGISSYPETSLSCQQLLNRADGAMYYSKRNGRNSITVDNDIIQDYVRKNKETRGEL >NC_010001.1|WP_012200810.1|3406582_3407377_-|MBL-fold-metallo-hydrolase MKMCSIASGSSGNCIYIGSNETNLLVDAGVSGKRIESGLLSAGVDPNSLDGILITHEHSDHIQGIGVLARRYKLPIYGTVETINAMLRLSSVGRIEESQLRFVKPDEALCIGDILVEPFSISHDASNPVCYTFTNGGHKIGMATDLGTYDSYTISKLCGAEVLYLEANHDVNMLMVGSYPYHLKQRILGERGHLSNETSAKLICELLHDDLQHVLLAHMSKENNYAELAFETVRYEVEQSVATSSKMPVITVANRDIPSEMVII >NC_010001.1|WP_012200809.1|3406174_3406447_-|ACT-domain-containing-protein MKKTIITVVGHDCVGIIAKVCTYLANNKINILDISQTIVSGYFNMMMIVDTIESSKDFSQLADELEEIGKEIGVVIKAQREDIFDMMHRL >NC_010001.1|WP_012200808.1|3404791_3406156_-|PFL-family-protein MINFNEVLETNKMIEQENLDVRTITLGISLLDCISSNLEELNQNIYDKITTVAKDLVTTGKKIERQFGIPVVNKRISVTPIAMIGASACKTPSDFVTIAKTLDRAANTVGVNFIGGYSALVSKGMTSSERLLIESIPEALAVTERVCSSVNVGSTKTGINMDAVKLLGQIMLDTAEYTKEKDSLGCAKLVIFCNAPDDNPFMAGAFHGVTEADAIINVGVSGPGVVKTALESVRGEDFGTLCETIKKTAFKITRVGQLVAMEASKMLNIPFGIVDLSLAPTPAVGDSVAEILQEIGLEYPGAPGTTAALALLNDSVKKGGVMASSYVGGLSGAFIPVSEDQGMIDAVRAGCLTLEKLEAMTCVCSVGLDMIAIPGDTKATTISGIIADEMAIGMINQKTTAVRLIPVIGKKVGDIAEFGGLLGYAPIMPVNNFSCDNFVNRGGRIPAPIHSFKN >NC_010001.1|WP_012200807.1|3403223_3404684_-|aminoacyl-histidine-dipeptidase MEGVYQQLSSMDYKNVLKYFVEISAVPRGSGHNEKISEYLVNFAKDHNLKYVQDETLNVIIYKEATPGYENHTPVVIQGHMDMVCLKAEDSNHDFLTEGLELIVEGNSIRANKTTLGGDNGIAIAFGLALLSDENLEHPALEVLITTDEETGMDGAKALNPDHLKGRYMINVDSEEEGTVLVGCAGGLRFYAELPLNFTEKEGKRVKLVIRGLKGGHSGAEIHNNRTNATILLARAIMELKEKYDFLLCDMKGGDKDNAIPSLAQAEAIVSAEEVDAFVASVKELEEKYQKELLASEPNVKFECQIFEEEKAKVIHPSSMMKVLFAILQAPNGVQVMSSEIAGLVESSLNLGIFAIEDDLAIFHYSVRSGKSSYKYFISDKLSFMFGFLGAEYESNADYPAWEYKKDSKLRDLFLNVHKELFNKDAEVMSIHAGLECGLISEKIPDMDIISIGPDMKDIHTPMEQLDIPSTIRVYQTVEKLLQKMK >NC_010001.1|WP_085953463.1|3401613_3402696_-|cell-envelope-like-function-transcriptional-attenuator-common-domain-protein MGNNKNNKKNNRKKVLTITFSILGALTLVIGLIVGTPAGRKLIYNAVGGYVSGRIDNVDSENKKPSNIFGDNKDDDIENTDPNLRKEKYVANFLISGIEEIGGGGRTDSMMIVSVNKKDNTIKLTSIMRDCYVEIPGHSPNKLNAAYSLGGMDLLVDTIQQNFKIKIDGYATVNFNAFESIVDILGGVDIELGSAEANYLNTTNYISNPAYRKVRTGMNHLNGNQALGYSRVRKVVTLGGANNDFGRTLRQRRVLNAIFEEYKSKNLFELMSIMDQVLPFVKTDLSGSEISDLLQAVVENRIFTIENHRIPANEYYTAARNERGAVLILDFEANIKELYRVIFLDEEVTPTPEVLIDIPN >NC_010001.1|WP_012200816.1|3417679_3418021_-|hypothetical-protein MDRNFENPMERNRIDNERRDEIERRNGFERRNEEERRNGFERRNEEERRNGFERRNEDERRNEFDRRNEPEMREEFRRRMDFDRRHENERRDEFDRRHDFDRRFPFWWLFFVR >NC_010001.1|WP_012200817.1|3419248_3419596_+|hypothetical-protein MDNPNQNLSNNAKQTHVHEIQGSVEIAEQNDPHSHRFATISGEAIPYGMDHYHEVSFKTDFFREHYHEFQGHTTTAIPIGNSHLHYLESVTTANAGHKHGFRFATLIDDPTSGQH >NC_010001.1|WP_012200818.1|3420321_3421581_-|glycosyltransferase MAYYSFVMVCYNNWNFTVKAVKSFFDYLNPIHQNKGIELIIVNNGSNDETEAGIEEFRIKFKEVSEIKTVHLEKNLGYIAGVNIGLSYCSGEIITLLNNDLIFCPGWFDSLANIFDADLTVGAATPLLTNGSGAENIELEYKNPEMKLAFFKSKETMNYYAEKIMEKNHKAIINSNRLVGTCIAFRKDILLLVGGMDFWFGIGMFDDDDFSIRINLAGYKTVIVGGSFVYHIGSATFSKYTQINNAAVISNKKKFLRKWKIKCTENAEGLYSRDDVHLRTNYIRKKHFIPFEFSQFKKPLEISSAKKTDIKRILFVADWTNLKSGWVKELERNLLHTDAKEEINLWIPSEYFSKNEVENEVNKINSTDSKVNYIEKDINPEDLLEFLSSFDTVIPVTDDFVNRYIIYLAKQLNIEVARL >NC_010001.1|WP_012200819.1|3421709_3422930_-|alpha-clostripain-like-protein MQNDQQKEWTILFYLNGNNELQPEMLQSKLFIEKEGSDGTVNIVIQYSFVEKHIIEIIRPKYRFNNDAEGQSGVIRYSAAGPDSTFHEELRNINMADPMCFYNFLEWGITNYPAQKYILVLGGHVFQYIGLMPDYSQDLPYLMGYPEMVNVLNLIKKNIGKKIDLLVLDTCYVNRIEMLYELGKEPDPPVSNVLTYINGGPASGLPYDLLIKTIKKINSTVTDKFLLSKLMENLNYDLIAYEIDHNKLESIKNLYSDLADCRLNFNSSSCSFPYELLTNVDENLPWFDLRKKLQDYMPELTICYNNISRKPFGHFYVSAQTISDQHKIDLYHRLAFAKKNSWSKLLYGLHSPINTSDEAETNVHPTILKSNDLYALISAMNPNLGLNENNEILSKLIEYKGWKWKN >NC_010001.1|WP_012200820.1|3423036_3429723_-|chromosome-segregation-ATPase-like-protein MPYNDKYYNKYGLIKNDNDYNDEDDDRHLCHDGNDRYDCGEDDDRYDCHEDDDRYDCHEDDDRYDCDEDDDRYDCHEDNDRYDCHEDDDRYDCDEDDDRYDCDEDDRYDCDKDCDEICDNDECCFLCDTELIICLLTNRKFGLKEIKREVRRIETIVGIIETIVVEIQTQVAVIDTNVARIESTIGEVATQVAIIDTNVARIETSIIEIQTQVAVIDTNVARIESTIGEVATQVAIIDTNVARIETSIIEIQTQVAVIDTNVARIESTIGEVATQVAIIDTNVARIETSIIEIQTQVAVIDTNVARIESAVAEIETQVSVIDTNVAVIETIVGRIETSVVEIQTQVSVIDTNVARIETSIIEIQTQVAVIDTNVARIESAVAEIETQVSVIDTNVSVIETIVGRIETSVVEIQTQVSVIDTNVARIETSIIEIQTQVAVIDTNVARIESAVAEIETQVSVIDTNVSVIETIVGRIETSVVEIQTQVSVIDTNVARIETSIIEIQTQVAVIDTNVARIESAVAEIETQVSVIDTNVSVIETIVGRIETSVVEIQTQVAVIDTNVARIETSIIEIQTQVAVIDTNVARIESAVAEIETQVSVIDTNVSVIETIVGRIETSVVEIQTQVSVIDTNVARIETSIIEIQTQVAVIDTNVARIESAVAEIETQVSVIDTNVSVIETIVGRIETSVVEIQTQVSVIDTNVARIETSIIEIQTQVAVIDTNVARIESAVAEIETQVSVIDTNVSVIETIVGRIETSVVEIQTQVAVIDTNVAKIETAITEIETQVAVIDTNVAKIESAVAEIETQVSVIDTNVAKIETAITEIQTQVAVIDTNVAKIETAITEIQTQVSVIDTNVARIETSIIEIQTQVAVIDTNVARIESAVAEIETQVSVIDTNVSVIETIVGRIETSVVEIQTQVSVIDTNVARIETSIIEIQTQVAVIDTNVARIESAVAEIETQVSVIDTNVSVIETIVGRIETSVVEIQTQVAVIDTNVAKIETAITEIETQVAVIDTNVAKIESAVAEIETQVSVIDTNVAKIETAITEIQTQVAVIDTNVARIETSITEIETQVAVIDTNVAKIESAVAEIETQVSVIDTNVAKIETAITEIQTQVAVIDTNVAKIETAITEIQTQVAVIDTNVARIETSIIEIQTQVAVIDTNVARIESAVAEIETQVSVIDTNVSVIETIVGRIETSVVEIQTQVAVIDTNVARIETSIIEIQTQVAVIDTNVARIESAVAEIETQVSVIDTNVAVIETIVGRIETSVVEIQTQVSVIDTNVARIESAVSEVETQISVIDTNVSVIETIVGRIETSVVEIQTQVSVIDTNVARIETSIIEIQTQVAVIDTNVARIESSVAEIETQVSVIDTNVAVIETILVEIQTQVAVIDTNVARIESAIGEIPTQVATIDTNVATIETIVSRIESSVAEIETQVSVIDTNVAVIETIVGRIETSVVEIQTQVSVIDTNVARIETSVIEIQTQVAVIDTNVARIESSVAEIETQVSVIDTNVAVIETILVEIQTQVAVIDTNVARIESAIGEIPTQVATIDTNVATIETIVSRIESSVAEIETQVSVIDTNVAVIETIVGRIETSVVEIQTQVSVIDTNVARIETSVIEIQTQVAVIDTNVARIESSVAEIETQVSVIDTNVAVIETILVEIQTQVAVIDTNVARIESAIGEIPTQVATIDTNVATIETIVSRIESSVAEIETQVSVIDTNVAVIETILVEIQTQVAVIDTNVARIESAIGEIPTQVATIDTNVATIETIVSRIESSVAEIETQVSVIDTNVAVIETILVEIQTQVAVIDTNVARIESAIGEIPTQVATIDTNVATIETIVSRIESSVAEIETQVSVIDTNVAVIETILVEIQTQVAVIDTNVARIESAIGEIPTQVATIDTNVATIETIVSRIESSVAEIETQVSVIDTNVAVIETILVEIQTQVAVIDTNVARIESAIGEIPTQVATIDTNVATIETIVSRIESSVAEIETQVSVIDTNVAVIETILVEIETQVAVIDTNVARIQTEVSVIDTNVAKIETIVTIIESSVAEIQTQVSVIDTNVAIIETLLGHLNEIETQVAIIDTNVARIETLLGIGSCTRLTTGPVLRDNGTNSIVVKVLNNSIATVTDVSSILFNIETCPKLAVETVVFDPLPPKCSDHFVFGLTENIQDEFEIEFLGVTTEIFVSVAARHESPNAPFTSNSIVEPNSFRFSELSCTNDPQ >NC_010001.1|WP_012200821.1|3429862_3430933_-|glycosyltransferase-family-2-protein MITISLCMIVRNEEDNISKCLISVRDIVDEIIIVDTGSTDKTKEIVGLFTNEIYDFEWINDFSAARNFSFSKATKDYILWLDADDVLLEADRIKLKRVKEILDPSIDVVMMNYNYAFDEKGNVLLSHFRERLLKRAKNFLWNDPIHEFISFEGKVVNSDITITHKKSHMNNRRNLNILEAMLAEGKEFSPRNMFYYAREKLNVNEYEGAIEYFNKMLDSEKGLPADCISSCIYLAKAYKAKNDRKNMLKALIRSFEYDTPRAEICCQLGYYYKDIEDYKRAIFWFDLAMKLEKPESKWGPILHEYWGFIPCIELCLCYYKLGNIDEAIKFNDKAAEYKPEHPSVLQNKKAFGNIKN >NC_010001.1|WP_012200823.1|3433802_3434108_-|hypothetical-protein MPGPPTTPDAIIILLDINVVGGTFADSTIALTTQVINNGEAPIQIGLRYLLDYMIDFDDGPTFQQLGPNGPILVNETQFVLPTFEDYEIEDNDVSPNHCCL >NC_010001.1|WP_012200824.1|3434216_3435191_-|glycosyltransferase-family-4-protein MKIVQVAPDVYPIPPVNYGGIERVMYDLIEELVRRGHEVFLYAPKGSNTSARLIPYQHEKSWSQHEILKYVSATLPEDIDIIHDHTHASIIGRVGLPVPTVCTEHFSANCPVKYPVYASRTVQERYGGNQGFFIHHGIRLEDFEFKESKEDYLFYIGKLDESKGPQFAIKVSERTNKMLILAGPIHDTAYFDKAIAPVIKANPNIIFIGEVGGRRKQDLLKNAACVLFPTLCQESFGLVAIEAMACGTPVLSFPSGAVPEVLQGVPDFICTNVDEMVQKVLSGDYPKPQLLRDYVKNNFSIELMADRYIKVYMQVLALEHLYYS >NC_010001.1|WP_012200825.1|3435345_3436719_-|glycosyltransferase MENPVTSIIILAHNNFESLRKCIDSIRKYTTDGTYEIIVVDNHSTDGTAQWLQSQQDIRAIINTDNVGCPRGYNQAINIALGDAVLLMNNDIIVTPNWLKNLIQCLYSADDIGAVGPITNNCPIQQLPVKYSSIEEMFEFAKTYNISNPETWEERLKLISFCLLIKKSAIEKIGLLDEGFTPGNFEDDDLSFSLRIARYKLMLCKDTFIHNFGYITFKDYGSQSLETFKLNQKKFEDKWGFNSLYSTFARQELINFINKPKTQSFAVLDVGCACGNTLLQIKNTYPNSILYGIELNKGASEIAKTVANVTADNIESLDVHFDENYFDYILFGDVLEHLVDPWQVLLNIKRYLKPDGKILASIPNVMHISIVKKLIHGNWTYEDAGILDRTHMRFFTLKEIHKMFRDSGYSDIYVAGKLLMESKEDFELIENLCKLSNPDLSKQFEIYQYLIKASVKP >NC_010001.1|WP_012200826.1|3437021_3437582_+|ferritin-like-domain-containing-protein MHYNDYYRYNESRYDYDEPEFYDIRITANNNNNNNTATINEDIYSYPENFSNAIALIEEAIAGEEEDRLFYTYLINNAPTAEDRQIISGIRDNELRHHSLFLKLYSELTGQTAPQLPGERFVPPSSYCEGLQRSIIGEESAVAKYRQILFAMQNRVHINMLTEIITDEIRHGILYTYLYSKNNCNI |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_010001_6 | 3488497-3488597 | Orphan |
NA
Consensus repeat of NC_010001_6
|
1 spacers
spacers of NC_010001_6
>6.1|3488528|39|NC_010001|CRISPRCasFinder TATCTACTAATCTATCCTATTTCTATCTACTAATCTATC |
CRISPR arrays and Neighbor proteins around NC_010001_6
The CRISPR arrays of NC_010001_6 >merge|NC_010001|6|3488497-3488597|CRISPRCasFinder CTATTTTTATCTACTAATCTATTCCTATTTTTATCTACTAATCTATCCTATTTCTATCTACTAATCTATCCTATTTTTATCTACTAATCTATCCTATTTTA >NC_010001|6|6|3488497-3488597|CRISPRCasFinder CTATTTTTATCTACTAATCTATTCCTATTTT TATCTACTAATCTATCCTATTTCTATCTACTAATCTATC CTATTTTTATCTACTAATCTATCCTATTTTA
>NC_010001.1|WP_012200857.1|3486358_3488224_+|hydroxylamine-reductase MSQMFCFQCQETAGNKGCTLNGVCGKTAALANMQDLLIYVSKGLSEVTTKLRLEGGNISSEVNHYITLNLFTTITNANFDDEVFYQRVKETLAMKENLINQLNNKENLSEAALWTLPVDSTKDEIESMIAKSNSDEVGVLATKEEDVRSLRELITYGLKGLSAYVKHANALGYDEEAIAIFMQETLAKLLDDTLTIDELIALTMETGKFGVDGMALLDKANTTTYGNPEITKVNIGVGTNPGILVSGHDLSDLEQLLIQTEGTGIDVYTHSEMLPAHYYPNLKKFKHLKGNYGNAWWKQNEEFEKFNGPILMTTNCIVPPRASYKDRLYTTGAAGYVGCQHIDGESGSKKDFSVIIKHALQCEAPVEIETGEIIGGFAHAQVLALADAVVGAVKSGAIKKFVVMAGCDGRAKSRNYYTDFAKALPNDTVILTAGCAKYKYNKLDLGDIGGIPRVLDAGQCNDSYSLALIALKLKEVFELSDINELPIIYNIAWYEQKAVIVLLSLLYLGVKNIHLGPTLPAFLSPNVANVLVNNFGIAGIQTVEEDMDLFFGKDSSESTSDEITKDTVIGDILKINPESASTLMEAGMHCLGCPASQMETLEEACSVHGIDVEELLNKLNA >NC_010001.1|WP_012200856.1|3484851_3486042_+|hypothetical-protein MERKVGTVSRGVRCPIIREGDNLSTIVVNSVLDAAESEGFSLREKDVIALTESIVARAQGNYASVSAIATDVKNKLGGETIGVIFPILSRNRFAICLKGIAMGAKKVVLMLSYPSDEVGNELVSLDQLDEAGVNPYSDVLTLERYRELFGENKHPFTGVDYVDYYSSIIKDAGADVEIVFSNQPKTILEYTKNVLTCDIHTRMRTKRILKAAGAENVIGLDDILTSPIDGNGYNENYGLLGSNKSTEDQIKLFPRECFDLVKDIQTQIKEKTNQHVEVMVYGDGAFKDPVGKIWELADPCVSPAYTEGLEGTPNEVKLKYLADNDFKNLSGDALTEAISERIKHKEDNLVGNMASQGTTPRRLTDLIGSLCDLTSGSGDKGTPVVLIQGYFDNFTN >NC_010001.1|WP_157668819.1|3476823_3478179_-|aminotransferase-class-V-fold-PLP-dependent-enzyme MSNSSDNIRNMMFGLDALVELDNNKMVPAINLDNAATTPPFKEVIQEIERQLMYYGSIGRGKGQKSENSTEVYTNGRDIVKDFVGANSDIYTVFYINNATDGINKLASAFIESPEDIVLSTRMEHHANDLPWRERTKTVYAEVDKKGRLIVDDIKRLLKAYNGRIKYVTVTAASNVTGYVNDVHYIAKLAHQYGAKIIVDGAQIVAHRAFNMLGQTLEENIDFFVFSAHKMYSPFGGGAVVGLTDVLNKHIAKFYGGGMVEAVCDYSVRYLPAPDRYEAGSPNYPGVVGMLRAMEVLKCIGFDYIKNHEQILLRRALDGLMKLPGVILYGDNENIADRVGIAVFTLRGIKNEEVANFLAGYRAIAVRHAAFCAHPYVRRLTGGSDTSGSFCYPLEGMVRISFGIYNNETDVDTFLATIKELLYSEYLRHFARVKNNSVQLSDRLCIPYDRA >NC_010001.1|WP_012200854.1|3475325_3476234_-|chemotaxis-protein-CheV MENNILLESGTNELEVLEFTIGGNSYGINVAKIKEILPFVSPTPVPNAYPTVEGIYMPRDFIMTIIDLRKTLNLHQETEQDGKDMIIVTNFNNLHVGFHVNKVLGIHRISWGDISKPDATLSHAGMGVATGIIKISNKLILLLDFEKIVADISPETSLKVSEMDLLKNRRRCDLPIIIAEDSHLLNQLLVDCLAKAGYTNITRTENGKEAYDLLVKYKQEGIVDKKVSLIITDIEMPVMDGHHLTKLVKEDTKLSKIPVVIFSSLVNEDMRRKGESLGANAQLSKPEIGQLVAKIDELLLSE >NC_010001.1|WP_012200853.1|3473424_3475293_-|hypothetical-protein MNRCKICHAEVKDNSEYCLNCKELGLDHSYFNTLSESMEALHNSEDITGMDENYIPEDYSIFQTESEDLTNNKELQSMLNHKVFEEQEILKYSSNNDEIENETMNPNFIQDTKVENELIQSEDKFLLIEEQDAINSTKQKTDFTLEQELEEDEFDDSVENLIANLSLSEIAATTEEDISEREKMNLSDTLNDNFDNNNYKQAEGEFEVGSTPDFDGNQDILDLLNEINRTPEDNSQEDYASDVLSIDDFMDDEESKVDPMLSLYSESDDLLNSSVNDIGGIYQDALGGISDLEDVGIDEELLKLIPDMPNQDDVQPKEIENLEVKSKKSKKSEKVKNSRKKKNLFARAFGNVKEEYSEEEKEQLKQDIINDAKEKDAKAQELEKEKKATKAKKDADKLAAKKKAKDDSIKAKQAKVEKAKVKKEEKERLSKEVQELIDEIDENEGRINRIGASFVFALFASIALFVVIGTNVYTYVVNIQNATKNFDMQRYNEAYNQVYGLDIRDADIEIYDKIMTVMFVNKQLNSYNNYSAIDMYPQALDSLLKGLERYDKYYDLATKLDIQTDLDYVRDRIISELSSKFYLSVDEAYNIINSPTQLDYSMAVYNVIFEKLDNKLVRKSEK >NC_010001.1|WP_012200852.1|3472388_3473003_-|imidazole-glycerol-phosphate-synthase-subunit-HisH MIAIIDYDAGNLRSVQKALQFIGEEVVITRDHDEIMNSGKVILPGVGAFGDAMQKLHSYHLINTIKEVADCGKPLLGICLGQQLMFEGSEESEGIEGLGLLPGKIIRIPEGGGLKIPHIGWNNLNITQGDSLYQDITGTPYVYFVHSYYLKSEDRSIVAATTEYGTLIDASVEKNNIYACQFHPEKSGEIGLKILKNFASLEER >NC_010001.1|WP_012200851.1|3471603_3472380_-|imidazole-glycerol-phosphate-synthase-subunit-HisF MHTKRIIPCLDVHNGRVVKGTNFLNLRDAGDPVLVGAEYGQAGADELVFLDITASSDARTIKLDMVRKVAETVFIPFTVGGGIRSIEDFKLILREGADKIAVNTAAIMNPTLISEAADKFGSQCVVVAIDAKCRPDNSGWNIYKNGGRIDMGIDAVEWAMKANELGAGEILLTSMDCDGTKNGYDLELTKQISENVSIPVIASGGAGTKEHFYEALTRGKADAVLAASLFHYKELEINDLKEYLRMKEVSVRLEDRSC >NC_010001.1|WP_012200850.1|3470924_3471602_-|uracil-DNA-glycosylase MSMIQNDWLDSIGEEFHKPYYKQLYDFVKEEYSQTTIYPLAENIFNAFHFTPLSKVKVLILGQDPYHNVNQAHGLSFSVLPEQKDIPPSLQNIYKELQSDLGCFIPNNGYLKKWADQGVLLLNTVLTVRAHQANSHQGRGWEQFTNAIIQAVNQQDRPIVYLLWGKPAQSKIPMLTNPKHLILKAPHPSPLSSYRGFFGSKHFSQTNEFFNANGLEPIDWQIENI >NC_010001.1|WP_081428555.1|3468153_3470802_+|response-regulator MFSSVKIIFHTAQRTCARMEIIMYSLMLAIQIIALLTNFTVILVLLVKKPFRGQAIFLALCAAVLVQCFGYTLEITSTTLDSAMMSIKIQYLGSAYVNILFLSFLFDYCKLKKSRLLFCLLFLINTLILIAVVTCEYHPYYYTDVQFVQEGSFPHVIFTKGILYHLFKSEVLLINFMILFIVISHYFRQGKERRRQELNFVGACLFPSVTCASYFMSFFKEYDPSSASFVISGLLVLIAIYRHQLFDIIHTARDSVIEVMDEALVVVDADFHLLDFNPAAKKLFPELKIEVLNSPLNKLSNELDCLFHQNQIYEFQKENRTYNAHLNKIYYNDDIVGHSAWIFDITESNNYMKNLIEMREQAEKANSAKSIFLAHMSHEIRTPLNAIIGLTDILLHKDTDFELHNDILNIKHAGGTLLSLINDVLDLTKIESGKLTLVDEPYKLTSVVHEVINIIGVKLMSKPVSLQVSISDQIPKYFYGDELRLRQVLINLMNNAVKFTERGTISLQVELSSFDLDTQTAQLIFHVRDTGLGIAKDDQKRIFHSFEQGSVGSDVLVEGSGLGLTICKRIIESAGGAIKVKSELGVGSDFSFTLPQKVYSQDQLNSSSTLKPGAIYKITPPFTAPNVKALVVDDNRLNLKVASGLLKLFDISVTVALSGAECLKLIQKETYQIIFLDHMMPQMDGLETLREIRSLSSVYYQTVPVIALTANAISGNKEMFLTSGFNDYLSKPIAISHLEALLKRWLPSSLVKLQGSKISEPLYQEVADFDNIDYQSGLVNCANQTDVYLAAVKQFLHDAGTTTEQLANAKDVGDALLFTTVVHGLKSAAKTLGAIELSRISLKLEESGHKQCFDEIEELYPSFKAEYQNAISSFTNFIKEYS >NC_010001.1|WP_012200848.1|3463754_3468050_+|2-hydroxyacyl-CoA-dehydratase MLKSNYSLGIDIGSTTVKIAILDINNQMVFSDYERHFANIQGTLADLITRAKSALGDLTVAPVITGSGGLAISKHLNVPFVQEVVAVATSLKDYAPQTDVAIELGGEDAKIIYFTNGIEQRMNGICAGGTGSFIDQMATLLKTDAAGLNEYAKNYQAIYPIAARCGVFAKSDIQPLINEGATKPDLAASIFQAVVNQTISGLACGKPIRGNVAFLGGPLHFLSELKNAFVRTLNLTKEQTIAPEHSHLFAATGSAMNHNPEVTTSLQTLINHLTTGISLDFEVHRMDPLFDNEEAYEMFLHRHNTHTVKKGDLSTYQGNCYLGIDAGSTTTKVALVGEDGSLLYSFYSNNNGSPLKTTIKAIKEIYTLLPENANIVRSCSTGYGEALIKSALMLDEGEVETVAHYYAAAFFDPKVDCILDIGGQDMKCIKIKSGTVDSVQLNEACSSGCGSFIETFAKSLNYEVADFAKIALFAKNPIDLGSRCTVFMNSKVKQAQKEGATVADISAGLAYSVIKNALYKVIKIADPKDLGSHIVVQGGTFYNDAVLRSFELTSGCTAIRPDIAGIMGAFGAALIAREHYSQEETTMLPIERINELKFDSSMARCKGCTNSCLLTINKFTGGRQFISGNRCEKGVGKEKNKDNIPNLYEYKLHRYMDYEPLAKDLAPRGVVGIPRVLNMYENYPFWFTFFTKLGYRVELSPDSTRKIYELGIESIPSESECYPAKIVHGHIMWLIKQGIPYIFYPCVPYERKEIPDAGNHYNCPIVTSYGENIKNNMEEIKSENICYQNPFLSFENKEILTNRLVEYLLAEQRMASQSAFDYTSELACTNNFSNSKITETEIRAAASLAWAELEEAREDMKKQGEQTMEYLRKTGRMGIVLAGRPYHVDPEINHGIPELINSYGVAVLTEDSISHLGTVERPMIVVDQWMYHSRLYTATSYVRTQPNLQLIQLNSFGCGLDAVTTDGVSDILATAGKIYTVLKIDEVNNLGAARIRIRSLLSAVSDRNRKHIETKVESPAYHRVVFTKEMRKDYTLLAPQMSPIHFDFLEPAFNSCGYHLEVLNNDNKAAVDAGLKYVNNDACYPSLMVVGQIMDALLSGKYDPNKVAVMITQTGGGCRATNYIGFIRRALKKAGFEQIPVVSISTSGIEKNPGFEINYDMIVRAVQALVYGDIFMKVVYRTRPYEQVPGSANALHAHWKDICAKSVQNGKWKEFRKNCRGIIEAFDTLPLDESIKKPRVGIVGEILVKFLPAANNYLVDLLEAEGAEAVMPEMVDFFLYCSYDANFKAQYLGKKKIDAFYNNMIIRFLEFARKEARKAFKESKRFNPPKYINELADLAEPIVSIGNQTGEGWFLTSEMVELINSGVPNIVCAQPFACLPNHIVGKGVIKELRHRYPDSNIVAVDYDPGASEVNQLNRIKLMLATAVKNLK >NC_010001.1|WP_012200858.1|3488736_3490245_+|hydroxylamine-reductase MGNNMDLEYEMFCYQCEQTAGGKGCTKQGVCGKTAEIANLQDLLVFQIKGISCYAKEMIERGEYIDKSIVILIENILFTTLTNVNFDASVHVELLKETQKVKESLRNHVGEIHNNTAQATYNLPDTKTDMLKDAPLAGIMYDNALDPDIRSLRQTIVYGVKGISAYGHQARSLGYYSDQVDNFYILALEAVTDDKLSVEELIRWTMRIGEMAIEVMKKLDEANTNTYKNPTPHKVNVNIRKGPFIIVSGHDLKDLEMLLIQTKGKGINIYTHGEMIPSHGYPNLKKYPHLVGNFGGAWQDQQKQFDNLPGCILMTTNCLMKPRESYKDRIYSTNVVGWDGVKHIKKDEDGEKDFSEIIQQALELGGFLEDEEPHEILVGFGHHATLSYAEKIVEAVKSGELRHFFLIGGCDGARPGATALADAFQTDVNGLPLSLIVSWYEQKAVADLLALLSLGIKSIYLGPSLPAFLSPNVLQYLVDTFDIRAISTAEDDIKTCLKQSIA >NC_010001.1|WP_041703710.1|3490640_3492038_-|alpha-glucosidase/alpha-galactosidase MKYQSNMVSDLQIAYIGGGSRGWAWTFMTDLAREPKLSGTVRLFDIDKSAAEQNMFIGNSITQREDAIGKWNYETKETLEEALTGADFIVISILPGTFDEMESDVHTPERLGIYQSVGDTAGPGGIIRALRTIPMFVDIAEAVKKYAPKAWVINYTNPMTLCVKTLYHVFPEIKAFGCCHEVFGTQKVLKGIAEQVLGIEDIPRNEVHVNVLGINHFTWFDYASYQGIDLFPIYRDYVKEHFEEGFIENDANWANTTFACSHRVKFDLFQKYGLIAAAGDRHLAEFVPGDWYLKDPENVKSWKFGLTTVDWRKEDLKQRLEKSHRLVSGEEKVDLKASGEEGILLIKALCGLERVVSNVNIPNTNRQIPNIPDSVVVETNAIFERDAIRPIIAGEMPDSILHLTIPHIQNHELVLKAALTCDKELVKQAFANDPLVKGRATAEEIDLLVEDMIQGSIKYLPEGWK >NC_010001.1|WP_012200860.1|3492313_3493177_-|NLP/P60-protein MTLSEQRQLLVESLKRREYKNTYTQDSKLRLNVYENPRGYGDCSSTMFTTYKMISGINIGSYSSSQAQNKLGIIVDVAKSELPDESNLLPGDLIFFNYEKAKQNSSNWGTWKDRYLHVGHVEMYIGDGKTIGHPSGFGPRIIDMRTYCNRMFKSGETYSISKRFVFNVNSYDAKFTGIESGFYSWCMNLQNEIKVKVDGIPGPEVLSKVPLIKFGTKGKVVELVQQRLIDLGYDIGKYGKNKDGIDGIYGLKCQEAIKSIDQWVLLKNVGTDITVGTDEWKFLLNIA >NC_010001.1|WP_012200861.1|3493176_3493494_-|phage-holin MNEILFSAIQIIVVILLGLVSRYVIPWLKVKLDTEKASQILAWIQTAVTAAEQIISGESKGIEKKAFVTEYMNKLLKEKGISITEEQLNLLIESAVKALNTKGGL >NC_010001.1|WP_012200862.1|3500320_3500605_-|hypothetical-protein MVEIRIESCPFCGSNEMGWGYQSAQGAVMTGKSGYAGSKVEHLICTECGSIVHSRVAKPELFKNVIKDKAKPRIRRRKAAKVNETNSDVATMDK >NC_010001.1|WP_041703711.1|3500993_3501404_-|PH-domain-containing-protein MLVGECNMEVVYKEKKRTKLFGLPLHFVTYRIGPDKINIQSGFLTIVEDDAYMYKVQDVRLTRSFLERVFCLGTVTCYTGDKTHPELKLIHIRKSSSIKDYLMEASEEARRKRRAMHILDGQEQKDVDEDDVEDEY >NC_010001.1|WP_012200864.1|3501424_3502117_-|hypothetical-protein MLKKRLLLSLAVVSMFGLTGCASSIRLTENENNIIAEYLSGVLLSQQRSYDQALIEPSPTPIPVATVTPTPSAEKPSTVSNKGNTNGHQTGANIQANSDFTEVIGIKNLTIEYTGYDIVNSFSDEYFSLDASKGKQLMVIKFNVKNTSKNATKLQLTDAGIQYQLDIDMGTILKPQLTFILNDLRYIDLEIGGKETKEAIVIFEVPKKQEMKAANLIISKDEKTAIIKLK >NC_010001.1|WP_012200865.1|3502212_3503712_-|HAMP-domain-containing-protein MKHSLRLKITFLLTISLALTIFLCWGLNKSFLTDYYQYSKIKSLDSVFYEVNNTFNESSQKGLTQEQLVMMDSMISKNNASAYISDMDLGLVYRSNGTDRDTQRVKKSMKAYLYGNTPDTLIDKIKWIKTVDNKYDIYIQHDALMQMNYIDLIGILDTGFIVFIRTNMENLQASSAISNNFLAYVGIFVTVVGTIVMYFISRSFTKPILVLENIAKKMSNLDFNAKYEGKSQDEIGQLGNSINMLSEKLEQTISELKVANIELQSDIEDKVQIDEMRKEFLSNVTHELKTPIALIQGYAEGLKDNISEDEQSREFYCEVIIDEAMKMNKMVKKLLSLNQLEFGNNQPEIIRFDIVSLINSVIQSTDILCKQKEIRIIFEEKQPCYVWADEYMIEEVVTNYVSNAINHADGAKIVEIKLIHMENVVRVAVFNTGELIPEEDLEKVWIKFYKVDKARTREYGGNGIGLSIVKAIMNAHNKECGVVNHSNGVEFWFELDITS >NC_010001.1|WP_012200866.1|3503711_3504386_-|response-regulator-transcription-factor MERLKVLVVDDESRMRKLVKDFLSRSNYDVLEAENGEQAVDIFFEQKDISLIILDVMMPKMDGWQVCKEIRKYSKVPIIMLTAKSDEKDELLGFELGVDEYISKPFSPKILVARVEAIVRRSIQTLDETMEIGGIVIDKAAHEVKIDDLAIELSVKEFELLTYFITNRGVALSREKILNNVWNYDYFGDARTIDTHVKKLRSKMGDKGDYIKTIWGMGYKFEVV >NC_010001.1|WP_012200867.1|3504706_3505006_-|hypothetical-protein MFEFDGYRYATVKEMDLAKKEAESIAYIKGRTDFKDREKLKKLYEGLIEKQSFVTPTGINFLREVQRELNAFSDKAVSPVFVTVPTEMKKGSYVRSTSF |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_010001_7 | 3508360-3508473 | Orphan |
NA
Consensus repeat of NC_010001_7
|
1 spacers
spacers of NC_010001_7
>7.1|3508393|48|NC_010001|CRISPRCasFinder AATTTCCTTTATGTTTATTTATTTTAGTATATTATTTCTGTCTTGAAA |
CRISPR arrays and Neighbor proteins around NC_010001_7
The CRISPR arrays of NC_010001_7 >merge|NC_010001|7|3508360-3508473|CRISPRCasFinder ATTGACAAGTATGTCACATTGGGGTGATTTAATAATTTCCTTTATGTTTATTTATTTTAGTATATTATTTCTGTCTTGAAAATTGACAAGTATGTCACATTGGGGTGATTTAAT >NC_010001|7|7|3508360-3508473|CRISPRCasFinder ATTGACAAGTATGTCACATTGGGGTGATTTAAT AATTTCCTTTATGTTTATTTATTTTAGTATATTATTTCTGTCTTGAAA ATTGACAAGTATGTCACATTGGGGTGATTTAAT
>NC_010001.1|WP_157668730.1|3508121_3508274_-|hypothetical-protein MGKKLLVILSVVVLASNLLCFASVSNETKSTNQIQLCDQAGLIVNLPVKL >NC_010001.1|WP_012200869.1|3506209_3506770_+|thymidine-kinase MSKLYFKYGCMNSSKSANLLMIRHNYEEQGFNILLLKPSIDDREGKSIIKSRIGIEAECIMVKPLDSIKDIFQKNPADIIMVDEAQFLTKDQVDELYDISFQNNVLCFGLLTDFQQRLFEGSQRLIELAESLQEIKTVCACGRRATMNVRFDEHGNVITRGEQVDIGGNDKYRAMCKYCYNNLTKK >NC_010001.1|WP_157668820.1|3505059_3505851_-|peptidylprolyl-isomerase MILCGAIAISLVGCGLINKGNSEGKSSITQEEPGKEGVVEEPTLTQEAKKLYQFKDVKKGDTIAEINVKDYGTMKIKLFGKEAPKAVENFVTHAKDGYYDGVTFHRIIEEFMIQGGDPLGTGFGGESIYGEPFEDEFSNDLYPFRGALCMANSGSNTNGSQFFIVQADSEQVNLLKDLAKEYYDLSFIDYVQKAYGVKLSSNELNQFITYGGTPWLTRKHTVFGQVIEGFDVLDAIANTEKADDQGTPKNPVVIENIKISEVE >NC_010001.1|WP_012200867.1|3504706_3505006_-|hypothetical-protein MFEFDGYRYATVKEMDLAKKEAESIAYIKGRTDFKDREKLKKLYEGLIEKQSFVTPTGINFLREVQRELNAFSDKAVSPVFVTVPTEMKKGSYVRSTSF >NC_010001.1|WP_012200866.1|3503711_3504386_-|response-regulator-transcription-factor MERLKVLVVDDESRMRKLVKDFLSRSNYDVLEAENGEQAVDIFFEQKDISLIILDVMMPKMDGWQVCKEIRKYSKVPIIMLTAKSDEKDELLGFELGVDEYISKPFSPKILVARVEAIVRRSIQTLDETMEIGGIVIDKAAHEVKIDDLAIELSVKEFELLTYFITNRGVALSREKILNNVWNYDYFGDARTIDTHVKKLRSKMGDKGDYIKTIWGMGYKFEVV >NC_010001.1|WP_012200865.1|3502212_3503712_-|HAMP-domain-containing-protein MKHSLRLKITFLLTISLALTIFLCWGLNKSFLTDYYQYSKIKSLDSVFYEVNNTFNESSQKGLTQEQLVMMDSMISKNNASAYISDMDLGLVYRSNGTDRDTQRVKKSMKAYLYGNTPDTLIDKIKWIKTVDNKYDIYIQHDALMQMNYIDLIGILDTGFIVFIRTNMENLQASSAISNNFLAYVGIFVTVVGTIVMYFISRSFTKPILVLENIAKKMSNLDFNAKYEGKSQDEIGQLGNSINMLSEKLEQTISELKVANIELQSDIEDKVQIDEMRKEFLSNVTHELKTPIALIQGYAEGLKDNISEDEQSREFYCEVIIDEAMKMNKMVKKLLSLNQLEFGNNQPEIIRFDIVSLINSVIQSTDILCKQKEIRIIFEEKQPCYVWADEYMIEEVVTNYVSNAINHADGAKIVEIKLIHMENVVRVAVFNTGELIPEEDLEKVWIKFYKVDKARTREYGGNGIGLSIVKAIMNAHNKECGVVNHSNGVEFWFELDITS >NC_010001.1|WP_012200864.1|3501424_3502117_-|hypothetical-protein MLKKRLLLSLAVVSMFGLTGCASSIRLTENENNIIAEYLSGVLLSQQRSYDQALIEPSPTPIPVATVTPTPSAEKPSTVSNKGNTNGHQTGANIQANSDFTEVIGIKNLTIEYTGYDIVNSFSDEYFSLDASKGKQLMVIKFNVKNTSKNATKLQLTDAGIQYQLDIDMGTILKPQLTFILNDLRYIDLEIGGKETKEAIVIFEVPKKQEMKAANLIISKDEKTAIIKLK >NC_010001.1|WP_041703711.1|3500993_3501404_-|PH-domain-containing-protein MLVGECNMEVVYKEKKRTKLFGLPLHFVTYRIGPDKINIQSGFLTIVEDDAYMYKVQDVRLTRSFLERVFCLGTVTCYTGDKTHPELKLIHIRKSSSIKDYLMEASEEARRKRRAMHILDGQEQKDVDEDDVEDEY >NC_010001.1|WP_012200862.1|3500320_3500605_-|hypothetical-protein MVEIRIESCPFCGSNEMGWGYQSAQGAVMTGKSGYAGSKVEHLICTECGSIVHSRVAKPELFKNVIKDKAKPRIRRRKAAKVNETNSDVATMDK >NC_010001.1|WP_012200861.1|3493176_3493494_-|phage-holin MNEILFSAIQIIVVILLGLVSRYVIPWLKVKLDTEKASQILAWIQTAVTAAEQIISGESKGIEKKAFVTEYMNKLLKEKGISITEEQLNLLIESAVKALNTKGGL >NC_010001.1|WP_012200871.1|3509559_3509970_-|response-regulator MNILIVDDVAFIRIGIKSSLSKYRNLYMFDAGTYEEAVKILDEEKIDLIFLDLNLNTNSQTKLEHENGLDIVRYLMEKEIDMPYVAILSGTVNESKMREAYNLGITNIVSKPFSTESLMSIIDEVHDVMYQVPLPR >NC_010001.1|WP_157668731.1|3510196_3510319_-|cyclic-lactone-autoinducer-peptide MTKLLLLAVTAFVAIAEATSVYPCLIWILGQDEMPEELIE >NC_010001.1|WP_041703713.1|3510426_3511002_-|hypothetical-protein MNHFTYNNLEQLLITTKGYEPIRAKRAVYQTKNFLRSLIYSLLIAFIFFWFHCLKEAVLVMIILKLYRGYSGGIHVKNYMLCFFSSLLLVCAIIVITKALPLTIELEIILWLINLILWYRYVPQGTYARPIRKMELKKELKFKFFIAMVLTFSIRFLWMEIYSMCLFSMLLILSLTTPMAYKIFKVQHDRI >NC_010001.1|WP_012200873.1|3511057_3512362_-|signal-transduction-histidine-kinase MLHDTIFLLIDCLVLAFFIRSLFRKKGICRLAGFTIVSFGLSYYKLNMDMNLPFYMGEILMIFIPVAVIIILTYCLYQRNLVVSITTGILITVVIVFLQILALLITNFSLYLLSITLSIEIHRDICQILYMLGMLITAYYMWINQDNIYEKVIRYCETKSERTQRYVKYIKFGITLFLMLTFTVLGEGIYDKLGISNESFMLICFTILLAATIFLLTYYESIITSYRNRQIEERNKLNEIHQDFVDNINYFGHSYNNMMQAVNFFVNCEELKIEDVRTVLKDLLEWDEKNKINYKLKYINIPNTVVASILSMKQDYARELGVNLKVIYDGSSNVKINSKIFVDLINIIVDNAIEVAHFTEDKTVYINLIFDDNRFEFTTKNFKNYDKNGKLLKYGTSKHIGLRNIEEMVRKNISINYDIIDGEGEFEIRLIINN >NC_010001.1|WP_012200874.1|3512762_3512966_+|cold-shock-protein MNKGTVKWFNAQKGFGFITNSETGEDVFVHFSGIASEGFKSLEEGQNVTFEITKGARGMQATNVSIA >NC_010001.1|WP_012200875.1|3513221_3514868_-|putative-manganese-dependent-inorganic-diphosphatase MITNAKKVIVIGHKNPDTDSICSAISYAALKRKLTGNDYVAKRAGQINSETQYILERFKITPPEYVADVKTQVRDIEIRETEGVDDTLSLKKAWSLMRKNNVATLPITEKGKLKGIITTGDITTSYMEVYDNRILAEAKTPYINILETLEGTLLVGDEHTIFEQGKVLIAAANPDLMEDYIEENDLVILGNRYESQLCAIEMKAGCIVVCEGAKVSMTIMKLAKERGCTIISTPHDTYTVARLMNQSMPISQFMIQDNLITFRTDAYVDEIKNVMAKQRNRDFPILDHKGIYRGMISRRNLLNMERKQVIMVDHNEKDQAVDGIEDAEILEIIDHHRLGTIETMKPVFFRNQPLGCTATIVYLMYCENRVEIEPSIAGLLCAAIISDTLMYRSPTCTKFDIEAAEHLAKIAGVDVTEFAGEIFEAGSNLKSKSADEIFYQDYKDFSVGDTTFGVGQINSLNALELSEIKDRLYPYLEKAREEHGVDMIFFMLTNIIRESTELLCVGSMANQVVENAFHVKEVSNGYKLDGVVSRKKQLIPAIVAAMQE >NC_010001.1|WP_012200876.1|3514881_3515346_-|SsrA-binding-protein-SmpB MAKEGIKLIANNKKARFDYFIEETYEAGVVLHGTEVKSLRMGKCSIKESFMRIENGEVYVYNMHISPYEKGNIFNKDPLRVKKLLLHKFQINKIVGQIQQKGYTLVPLTIYLKDSLVKMEIGVARGKKLYDKRQDIAKKDQKREAEKDFKVKNL >NC_010001.1|WP_012200877.1|3515368_3517519_-|ribonuclease-R MEKEILNNKKELLLQVITDRSYRPMKFRELSSLLQVPKDERDDLKIVMDSLISDGKIMLDGNGRYKETNGNIKTGIFSGTTRGFGFVKIEGEENEEDIFIPESETKGALNKDRVQIAIFEEQSGRRREGAVISILERNVTELVGTFQKSKNFGFVIADNTKFNSDVFIPKEHTKGAVNGHKVLVQLTDYGSETKNPEGKIIKIIGHINDPGVDVVSVILENGLPTEFPDEVMKQVERIGEEVSSADIGGRVDLRNLQTVTIDGEDAKDLDDAITLSKKGDIYQLGVHIADVSNYVTEDSPLDKEALKRGTSVYLVDRVIPMLPHKLSNGICSLNAGSDRLALSCMMEIDEKGNVVGHRIAETVINVDRRMTYTSVKKIIEDHDEAEIEEYKELVPMFELMLELADILREKRRKRGSIDFDFPESKIILDSDGRPTDIKPYERNKATKIIEDFMLIANETVAEDFFWQELPFVYRTHENPDLEKIQKLSVFINNFGYTMRIGQDEIHPKELQKLLIKIDGKPEEALISRLTLRSMKQAKYTTTCDGHFGLSTKYYSHFTSPIRRYPDLQIHRIIKENLRGGLKEKRINHYESILNEVARQSSLAERRADESEREVEKLKKVEYMSQFIGQTFEGVISGVTSWGMYVELPNTVEGMIRLADMHDDYYIYDEEHYLLTGEHTKKIYKLGEAVVIRVEDTDKLMRTINFSIVGRANRIEE >NC_010001.1|WP_012200878.1|3518072_3518318_-|preprotein-translocase-subunit-SecG MEILRAIVTVLYVLICLGLVVVVLMQEGKSAGLSGSINGVADTYWGKNKGRSMEGALVKITKLLGALFIVISIVLNMNWGL >NC_010001.1|WP_012200879.1|3518397_3519942_-|2,3-bisphosphoglycerate-independent-phosphoglycerate-mutase MSKKPTVLMILDGYGLNEKTEGNAIALAKKPVLDKLMKDYPFVKGNASGMAVGLPEGQMGNSEVGHLNMGAGRIVYQELTRITKEIQDGDFFENTQLIKAVENCKKNNTALHLFGLLSDGGVHSHITHLYGLLELAKRHGLENVYVHAFLDGRDTAPTSGKSFMEALEAKMAELGVGRIASVTGRYYVMDRDNRWDRVEKAYAALVDGEGVEAANAVEAVAASYAEGVNDEFVLPTVVVKDGKAIAPIKANDSIIFFNFRPDRAREITRAFCTDDFDGFVRKSGRLPLTYVCFSEYDVTIPNKSVAFEKVSITNTFGEYLAEHGKTQARIAETEKYAHVTFFFNGGVEAPNEGEDRILVNSPKVATYDLQPEMSANAVADKLVEAITSLKYDVIIVNFANPDMVGHTGISDAAIKAVEAVDACVGRAYDALLSVDGQMFICADHGNAEQLVDYTNGEPFTAHTTNPVPFILINYDDSYTLREGGCLADIIPTLIEMMKMEQPKEMTGKSLLIKK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1006945 : 1019230
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_010001|1006945:1019230|DBSCAN-SWA GTTATATATCGCCATATAGCTTTTGGCTTCGCAATAAATCCAAAGCATATTCTTGAACATCAAAGCTATTTTGATGTTTCTTTAGATTCGCGATAAAACAGCTTGGACATACTTTAGAACCATTGTAAGCGGGATAAAAAAAACAACACTCACACTCTGAAATCGACCAAACAAGTTGATTAATTTCAGTATTAAAGAAATGTTCGTAAATTTGCTTATCGGTATCTGGAAATTGTAGCTTTGCTTTTCGCAACCAATATTTATAATCTGAGAAGGCATAGCCAGCTGCTTCTTCAGATAAATCAAAACGTTTTATAACGTCCGCTAAATTTTTGCATCCAGAATAGTGTATGACAATTCTAGGGGCAAGTAAATTTGTAGCAAAAAAATTCGCCTCTCGTTCTTCGGCTTCAGAGCGATTTTCTTTATGGTTAAGCATGATGTGACCTAGTTCGTGTATAAGAGAAAACCGTATTCTTTTATCAAGTATTTTATCATTATAGTAAATCTCATTTTTTAGGGTAAAAGCTTCTTCACTGACTAATAAACAATGTTCCTGTTTTTTTCGACTAAGCGAAGAGTAGCTCTTACAAGTAATTCCGCACTTACTTATCAAATCAAGACAGTCAAGGGGGAATGAGGAAATATTGCAGATTTGGTAAACCTCCAATATTTTTTCTAAAATATTTTGTTTGTTCAAATACTCACCTTCTAATTATTCATCGGATAAGAGGGTTCTAATGATGTCCTGTTTCTGTTCTAAAGTTAACCTTTTTCCATTTCGTGCGATTAAAGATTGAATATCTTCATACGTAGGTTCGTATGACTGTTCACTGTTACCTTCTGCCATAACTTCCATCTGCTCTACTGTAATACCTAAGTTCTTACATACAGTAATTACAGTATTTACACTAGCTTTACCAACGCCGTTTTTTAGCATGGTATAAAGTGTTGTATATGGAATTCCACATTTTTCAGCAAAAGATTTTAAACTATATCCTTTTTCTTTTATTAACTGTTCTAATATCTTAGCTTTCTCCATAGGGACACCTCTTTCTAATAACTTGATTATATACGAAATAACGAATTTTGTAAACAATATATTTAATAATTTTGAAAATAAATACGAAAATCCGAATTTTTATCTTGACATAGTTCGAAAATACAAATATAATTACTAGTACAAATACGGAATAACGTATTAAAAGAATAGAGGGTATTGAAAATAAATATTATTAGCTAAAACTTAATAAGGAGAGTGAAAATAATGAGTATTGCAGTTGCAGAGGATATTGCAAAAAGAGCGGCCAAGGAAGCATTAATGGAATATGTAAGAGAGAGTAAAGAGGTTGCGAAAAAGGAAGTTAGAAAGAAGACAAGAAAGTTAATGGCGAATTACAATAGTATAAAAGCTCACGTGGAGGAGGGGGTATCAGAAGCGATGGAGATGGAGATTGATTTTGTTCGTGAGGATTTGGACGAAGATGATCTTTATATAATGAGCATTCGTAGAAGTAGAATTCGCAGTATGATTATGATGTCACATATAGATAAATGTCTTGACTTATTAAAGGCGGAACAAAAACGAAGAGGAACCCCAGAGAAATATGAAATATATAATGGTCACTATATTAAGGAAAGAACTTATGAAGAATTAGCGGAAAGGTTTCATTGTTCCGAACGTACTGCTATGAGGTGTGTTTCTGAGTTAGATGATATGATATCTGTTCTTCTATTCGGAATGGAAGGTATTACCTACGACTGACAAAAGGCTGTCATTATGTTGTCATTTACATGTCATTAAGGCCGTGTTAAACTTGTATCATGAAAAGTTGTATGGTTTTAGAAATACACTAAATTTGTATTTATGTTATAGCAAAAGTAACGATAATCCTACTAGGAAGAACCTAGCTATGGTATTTGTTATTTTGAAACTAAGATTTTGATACATCAAAAAACATGAGAAAAGAGTGATATCTCGACTGGCAAAAGGCTGTCATAATGATGTCATTTACATGTCATTAAGCTCATGTTAAACTTGTATCATGAAAAGTTATACGATTTTCTAAAACTTAAAATTTTGCTACTTTTCGAAAGAGAGAGGAAAGTAATAGTTTATAAAGTATTCTGTCTTAGGAGTTTTCTAAGATAGAGTACTTTTTTTATTGCCAGACAATATTACTAGTACTATATAAGAATTCAAATCAGAATCTATCTTGTAGTAATTCTATGAATGTAAAGATATAATGGATATTACAAAAAAAGAATGCTTTGGAACTTAATAATTATTTGGGTTCTATCCTCTCTCTAATTTGTTTCATGTAAGTATAATTTTCGTTATGTTATAACGATTGATTGAATGAGGTTTTTTTATGTATGATTACAGGATTGTGATTGTATGGCTGTTTCTCTATATTAAGTGAGTCGTACAGTTAATGTTATAAGGAATTGCAGTTGATTTTATATAAGGCAGAGATACTCGATATTTTTTCAAGTATCTCTGCATTTTTTATTCAAGGATAAATTGTATTTTTAAGTACATTTCCTGTATCTCAGAGAAGAAAAAACAATTAATTTCATAACTTACATGAGGAATTTATATATTCAAAATCTATGTAAAAGCATGGAATACTTGAAAGGAGGTGAAACAATGAGAGTATACGAAGCGACGGTTGATAAGAGGCCAAGGGAATGTTGTGCATGCCCCATTGACGGTAAAGTACGAGCCAGATGGCCATGTGGCACTGTAATTAGGGTTAATTTTAATGGAAGTTCAAAATATATTAAAATCCCTAATGATAAGTGTTCACTTAGGCTTGAGAAACAATAGGGCAAAGTTTGGAGGTAGGAATGATTCGTAACCTATTAGATGGTGTTACCAATGAACTACTAAAATTTTCTACAGATGCAAGGATTTATGTAGGAAGTGAAAAACAAGAAAAGATACTTCCTTGTTATGTGGTAAAGCCTTCTACTATTCAATATGGAAGTGGCAGAGAAGGAAGGCGAACAAAAACATATACTCTTCAGGTTTTTTATTATCCAGAGGGTGATATCACTACAGAGGCTGTACAGATAGAAGACGAGCTTGGAGAGATATTAGAAAAAATAACAGTAAATGATATTGTAATAAAGAATTCAGGAATAAAGTCTGAAATGGTGGAGGGGGCTGTAATGTGTACTGTAGATTATATCGTTAACTATACCAAAACAGATTCAAAACCACAGACTATGAAGTCCTTATTACAGAAAGAGAGGATTGAATAGGTTGAAGAAAAAAGAAACAAGCAATGTATCTCAAAAATTTTATTCGAAATCCTCCCTGCTTGCGGCGAAAAGATATAAAGAAAGCCGCGATTTAGTTGAGGGATTATTACAAGAGGATAGAGAATATGAGATTTCAGAAGTAGATCAGATGATTGAAAATTATTTGAAAGGGAAGGTAGAGTAATATGCTAGGTGGAGGAACATTTACAAGTCAAAATAAGAAAATTCCAGGAACATATATTAATTTTGTTAGTGCCGCGAAGGCAGAAGGGGTATTATCAGAAAGAGGTATCGTAGCATATGCGGTAAACCTCGATTGGGGTAATGATACGGAAGTTTTCCAGATTAGTAAGGAGGAATTCCAGAGAAATTCCTTGAGTTTATTAGGTTATGAGTATGCGGCAGAAGCACTAAAGCCATTCCGAGACTTATTTCGACATGCAACAAGAGTTTTGTTGTATCGTTTAAATCCAGGAACAAAGGCCACAAATAATTATGCAACAGCGAAATATTCTGGTACCAGAGGAAATAGTCTTAAGGTTGTGATTGGTACAAATCCAGATGAGGAAGATAGATGGGATGTGAGTTTATATCTTGGAACGACATTGGTTGATGAACAGCGAGGTATAGCAAGTGCACAGAATCTTGTAGACAATGATTATGTCGTTTATCAAAAGGACGCAACCCTTACACAGACTGCCGGTACTCTTTTAGCAGGTGGTACAAACGGTGAAGTAACTGGTGAGAAACATAAGAGTTTCCTAAATGCGATTGAAGGATATCATTATCACATCTTAGCATGTGATTCTATGGATGAACCAACGAAGGAATATTATGTAGCATTTACAAGAAGATTACGTGAAGAATTGGGAATAAAATTTCAAACAGTTTTATTTGGTAAAGCTGCTGATTATGAAGGTATTATTAATGTAAAAAATAATGCAACTCTAATCCCTTGGGTTGCTGGTGCACAAGCTGGATGTGGTATTAATAAGTCCATCACTAATATGACTTATGATGGGGAGTTAACTATTAAAGATACTTATTCTCAGGCAAAATTAGAAGACTCTATCGAAGCTGGCGAATTCGTATTCCATAAGGTGGGACAAGAATACCGTATATTGGTTGATATTAATTCTAAGACAAGTGTGACACCAGAGAAAGGTCAGGATTTTAAGAAGAACCAAACGATTCGTATACTTGATCAGATTGGAAATGACGTTGCTTCCATTTTCAATAATAAATACAATGGTAAGATTGCGAATGATGTATCAGGAAGAGTATCTTTCTGGTCAGATCTTGTAACTTATTTTAAGCAGCTGGTTAGCATCCGTGCGATTGAGGATTTTGACAGCCAGGATGTTGAAGTATTACCAGGAACGGAAAAGACAGATGTTATTGCAAACAGTGTGATAACACCGGTTAGTAGCATGGAAAAACTTTATATGACAGTGATTGTACAGTAGGAGGGTATGGGTATGAATCAAATTACAATGAATGCGAAAGACACAATTTCATCATCACTAGCGGAATGTTTTGTTACAATTGGGGAAGAAAGATTTAATGCTTTTCATTTTACACAGTTCGAGGCAAGCTTTAAAAAGTTAAAAAAGAAGGTCCCAATCTTAGGTAGTACGGGGAAAGGAAATAAAACAACAGGCTGGGAGGGTACGTTTAAAGCCACGATGCATTATAATTCCTCAATCTTTCGCCGTATGCTGATGGCATATAAGAATACAGCCGAAGATGTATATTTTGAAATTCAAGTTACAAACGAAGATCCTAATTCCAGTTCAGGAAGACAAACCATCGTATTTAAAAATTGTAATATGGATGATGGTATTTTAGCAAAATTCGATGCATCCAGTGATGATACATTAACCGAAGAAGTGAATGGAACCTTCGATGACTTTGAGATGCCAGAAGAATTTAAGGTATTAAAGGGTATGGTTTTATAGGGAATGTGATAGTTTTCTAGGGACTAGTGTTATGATATAGGGGTTGTTTATGTATGGCAGTTATTGATATATAGGCGTGAAAGACGCTCTATTTTTTTATAGATATTATTTATGCGGAGCTAATTGGGATTCATGGGAGTGAAAAGAAACCCAAATTAGCTCTGCTTTTTTTAGAATACATTAAAAAAGCTATGTTTACATGATAAATAGTGTTCATGATTGAAAGGAGATTATTATGAGTTTACAGTTATTTATGAAAAAAAACAAAAAGGTGAAAGAAAATGTGTTTTATGCACCAACAAAGTCCTTACTAGATGAAAATGGCCTTCCTTTAAATTGGGAGTTTCGTCACGTATCCACCAAAGAAGATGAAGATATTCGAGAGTCCTGCACTATGGATGTGCAAATTACAGGAAAGCCAGGGGCATATCGCAAGAAAATTGATACCAACGCTTATATTGCTAAATTAGTTGCAGCCTCTTGTGTCGTTCCAAACTTAAATAATGCAGAGCTTCAAGATAGTTATGGCGTTAAGAAACCAGAAGATTTGTTAAAAGAGCTGGTGGATGATCCGGGAGAATACCAAGATCTTTTTGTATTTATTCAAAAGTACAATGGTTTTGATACTTCGATGGAGGAAGAAGTTGAAGAAGCAAAAAACTAATCAATGGAGAGGATAGTGAGGCTTCTTATGCTCACTATTGTCTCCATAAGTTTCATATGCTTCCTTCTGTATATTTGAGTTTAGATCGGCAGGAAAAGGCGTTTATTATAGCATCGATACAGATAAAAATTGATCACGAGAAAAAAGAATATCAGAAGATGGAAAGTAAAGCTAGGAGGTAGGCGATGGCAGATATATCTGCATTTTTTAATTTAAGTCAAAAAATTTCCGATACGGTCATAAATCAAATTACAAACGTGTATTCCAAATCAATAAGTGCTTCCGTTACTCATACCGTAAATAAGGTTATAGAGAACTCTGTCGTCAATGTCGATAGAACAATTAATAATATAGAAAAAATGGATCGCTCTATTAATATAAATATCGGTAGATTCAGAAAGTTTAAAGAAGAGACGCAAGAGCCAATAAAAACGGAAAGCTATGGTGAAGCTATTGAAAAGATAGGTATGGTTGTAGAAGCAATTAATAAAATGGGTGAAGTCCTTCAAAGAATTGAGACGCAAGAGTCAATAAAAACGGAAAAATTTGATGAAGCTATTGAAAAGATAGATAAGGTCGATGAATCAATTAATAAAATGGGTGAATCCCTTCAAAGAATTGAGACGCAAGAGTCAATAAAAACGGAAAAATTTGATGAAGCTATTGAAAAGATAGATAAGGTCGATGAATCAATTAATAAAATGGATGAATCCCTTCAAAGAATTGAGACGCAAGAGTCAATAAAAACGGAAAAATTTGATGAAGCTATTGAAAAGATAGATAAGGTCGATGAATCAATTAATAAAATGGGTGAATCCCTTCAAAAAGCTGAGATGCAAGAGCCAATAAAAACGGAAAGCTTTGATGAAGCTACTGAAAAGATAAGTAAGACCGAAGAAGCAATTAATAAAATAGAGGAAGCCCTTCAAAAAACCGAGGTGAAGAGCGAGGGGACAGGGGCAAAATTAAAAAAGAGTTTTAGTTCCATATTTAGTTCAGTAAGAGATAATTTAGGAAATAATTTTGCTGCGGTGGGAAAAGGGATAGGCTCTGTTGGAAATATAATTAATAGTGTGACATCCTTTGGTACCAAATATTTAGATAAGGTTGAAAATAGTAAGATATTAAAGACAGCAGATGCACTAGCTCAAACCAGAACAAAATTAACAGCAATGACAGGTAGTCAAGCAGAGGCTGATCAATTTCAACAAAGAATTTTTGATTCCGCACAAAATTCCAGAACTTCTTATGAGTCAACAGCCAATATGGTTCTTGGGCTAAGTGCAAAGGGCTCCTTTTCAAATAAGGAGCAGATTGTTACCTTTACTGAACTTGTTAATAAAAACAGTGTATTAGGAGGGGCAAGCGCTGAAGGTACGAAAGGCGTACAAACAGCAGTTACAGAAGCTATGGTTTCTGGAACACTTAGCGGAGAAGGATTTAATAATGTATTAGAAAATGCTTATCCAATTATAGAAAACATAGCAGCATACCTTAACAAACCAATAGAAGCAGTTCAAAAAATGGGTGCACAAGGTGAAATCAGTGGTGAATTCTTAGCAAATGCTATGTTTGCTTCTGCACAAAAAACGAATGAAGAGTTTAGCAAAACTCCTATGACCTTTGAACAATTGATTAGTTCAATAAAAGATAAAGCTCTGATGGTATTTCAACCAGTATTACAAAAGATAAGTGAATTGACACAAAATCAAGAGTTTATGAACATGATACAAAATATTATGAGTGGGTTAACTTTTGTGGGCGATTTGGCATTAAGGATTGTTGGCGTATTAATAAATGCTGCAAGTGCAATTGTTGATAATTGGTCCTGGATTGCTCCTATGATTCTTCTAATTGCAGTTGCTTTTGGAATATGGAAGTTATCTGTTCTACTAAGTAGTTTTAGTATTAAAGAATTAACTGCTTCCTTGCTGGCATGCCCATTGGTATGGATTATTGGTATTATTATGGCTATTATAGCAGTCATCAAGATCGTAATAGATCACATAAATAAGGTTGGAGATAAGACATACACTGTAGCAGGCGTTATTTGCGGAATTTTAGGTGGAGTGGGAGCCTTTGTTTGGAACTTATTTTTGGGATTAGGAGATTTTATTCTTAGTTTTGTAAATCTCATTGCAAATGCATTTATAGGAGTTGCGAACTTTTTTGCTAATGTATTTAAGAACCCGATATCTTCCATTATCTATTTATTTCAAGGAATGGCTGACGGAGTATTAGGTATTCTGGAAGGTATTGCAAATGCGATTGATTTTGTCTTTGGTAGTAATTTTGGTGGAACAGTTGCTGGTTGGAGAAGTGGACTAAAAGACATGGCTGATGCAGCGGTTCAAAAATTAGCACCAGATGAAAAATATGAACAAAAAATTGATTATCTTAATTTATCCATGGAAAGCTTTGGCCTTACGAGAGCAGAATATTCAGATTGGTGGGATAAGGGGAATGAATTTGGTAATAAAATCAATGATCTCTTTAAAGGAAGCACGGGAGATGACAAGAGTTTCGATGATACTTGGGATGGAATTCTAAAAAATACAGATAAAATCGCTCATAATACGGAACTTCAACCAGATGATTTGTCCTATTTACTCGAACTTGCAGAGCGTGATGCAATCAACCGTTTCACAACAGCGGAAGTTAAAATTGATATGGGTGGTGTTTATAATACGGTATCAAGCAAACAGAATCTGGATGGAATCGTAGAGTATCTGACGGATAAGTTACGAGACGAACTTAATAATACTGCAAGAGCTTGTAACGCTTAAGGGGGGAGTTATGTATAAAGTGTTTTTAAGTGATATGCTATTGCCTGTCACACCATCAAAATTTGTTACAAAAATTAAGAATCAAAATAAAAGCATTCAATTATTGAATGAAGAAGAAATAAATCTTATAAAACCAGCCGGTCTATCGGAATTTAGCTTCTCTTTTCTTCTACCAAATGTAAGATATCCTTTTGCTTTGTATGATTCCGAATTTCATATGGCCAATTGGTATACAGAGAAGTTAAAAATCCTTAAAAACGGTAAATTTGCCTTTCCTTTTATCGTATCGAGAATGTCAAGTAAGGGCGTGTGGATGTTTCATACCGATACTATTGTTACATTAGAAGATTATAGCATCACAGAAGATTGCGATAATGGGGTAGATCTGATAGTAGATGTTACTTTAAAAAACTACCAGCCTTATGGTGTGGTCATTGTTCCCTTAAAAAATAAAGGGGAAGCAGTGAAGAAAAATGTACGTATGAAAAGCAAAACAATGCCATCAACATACACAGTTAAGCCTGGAGACACGTTATGGAAAATAGCAAAGGAATTATTAGGGGATGGTTCAAAATGCTATAATCTTGCGAAATTAAATAACATAAGTAATCCGAATCTTATCCGAGTTGGGCAGGTGCTACGAATCGAAAATGTGAGTACTTCCACGCAAAGTGCAACACGGAATGTATCCTTAGCAAATAAAAGTACTAGTTATAACTCTGCATTATATAATGATGTGACTTTGTTAACTCGTCTTATAGGGTACAATAATGTACCGATTCCTGAAGCAAATGAGAAAGGACTGATGAAAGTTTCATCAATCAAAAAGCCTGCTATCATCACAAGACCCAATAGCTGTGTAAGATAAGTGGAGGTGATTTTATGCTTAGTTTAGCGATAGCAAATGGGGATTATCTATATTATCCATCGGTGCAAGGTGGTGTGACTTGGGATACAGAGCGAAAATCTTCCCCGGGAGTACTAAAATTTAATATTGTAAAAAGTCCTCTTATTAAAGTAGAAGAGGGAAATGCGGTTCTGTTTCGGGAAGATGATAAGGATATCTTTTTTGGTTTTATCTTTAGTCGTTCTGAAACAAAGGATGATTTAATTCAATTAACAGCCTATGATCAATTAAGATATCTAAAAAATAAAGATAGTTACGTTTATTCGAATTGGTCCACAGGGGAATTAGTCAAAAAAATAGCAAGAGATTTTTGCATGAACGTTGGTGAAATAGCCAATACGGGAGTGAAATTATCCCGAACCGAAAGTGATACAGAATTATTTGAAATGATCAATAACTCTCTTGCCGAGACAACACTTAAAACAGGAGAGCTTTTTGTTTTATATGATGATTTTGGTAAGCTTTGTTTAGAGAATAAAAATCTAATGCTCCTTGATGTTTTGATTGATGTTAATACAGCAGGAGATTATTCCTTTACAACGAGTATTGATGAAAATACGTATAATTCCATTAAATTAACGTGTGAAGATCAAGATTCAGGCAAACGTAGTGTCCATCTAAAAGACGATCTTGAGAACATAAAGAAGTGGGGTGTTTTACAGTTTACAGAAAATGTGAATAACAAAAATACGATGAAGGAACGAGCAGAAGGTCTGTTAAAGCTTTATAATACTCCTAAAAGAACTCTGCAGGTGAAGAACTGTTTTGGGGATAGCAGAGTAAGAGCGGGAACTAGTGTGGTAGTTCCCCTACTGGATGAAAATGGCGTTATAAGTCCACGGTTTATGATGGTTGAGAGTGCAAAGCATACGTATGCCAATGATGAACATTTTATGGATTTAAATTTAAGGGGAGGAATTATGAATGGGTGATTTAATACAAATCATTAAGCAGGCTGCGTTTGATGCGATTGAAGCGTCGAAACCAGCCTGTTTTTTGTATGGTGTCGTCATAGAAACAAAACCTTTAAATATTCAGATAGATCAGAAGTTAATCTTAACTTCTGATTTTTTGCTTTTACCAGAATACCTTACGAATCATGAAATCATATTAAAAAGCTCAGATGGTTTAAAGTCAAAGTTTCTTTTAGAGAACGGACTTAAAAAGGGGGAAAACGTAATTTTGTTACAGCAAAAAGGAGGACAACGATTTCTTGTACTCGATAGGATGGTGATATCATGATTCCGGAAATTGATATGTCGATTCAAAATGTAAAATTAACAAACCAACCAACAAAAACATACGCACTCGTTGGAGATAAGATTGTTGGGATGATCGATGATGTAGAAGCGATACGACAGGCAATTTATCTTACCCTTAGCGTGGAACGTTATGAATATCTCATTTATAGCTGGAGTTATGGAGTTGAATTGAAAGAACTGATTGGTAAGGATGTTGCATTTGCTTACCCGGAAATTAAAAGGCGTGTCGTAGAAGCCTTGATACAAGATGATAGGATTCTGGATGTTGATAATTTTACCTTTCAGAAAGAGAAAGAAGGCGTTTTAGTTGTTTTTACTGTCCACACCATATACGGAGATTTGTTAGAAGAAATGGGGGTGATGATTTAATGTATGAAGAAATGACCTATGAAACTATTTTAAGTAATGTTCTAGCAAAAGTACCAGCAGATATAGATAAAAGAGAAGGATCTATGATATATACAGCGTTAGCTCCAGCATGCATAGAATTAGCGCAGTTGTATCTTGAACTTGATTTAATCTTAAATGAAACATTTGCAGACACTGCATCAAGAGATTATTTAGTGCGTAGAGCATTGGAGAGAGGGATAAAGCCGAAAGAAGCTACTTATGCAGTTGTAAGGGGAGAATTTAATATAGATGTACCAATCGGGTCAAGATACAGCCTAGATAAATTCACTTATATTACGAAGAAGAAACTATCGGATGGTGTATATGAGCTGGAATGCGAGGTTTCTGGGAGTACGCCGAATGGTTCCATTGGTAAGTTAATACCGATTGAGTATATTGATGGCTTGGAAACCGCTATGATAACGGAAATTTTAATACCCGGTGAGGACGAGGAAGATACAGAAATATTTCGAAAGAGATACCTTGATAGTTTTGATGCACAAGCCTTCGGTGGAAACCGTACCGACTATAAGGAAAAGGTATTGAGATTATCCGGAGTAGGTGCGGTTAAGGTATATAGGGCGACGAATGTATCAGGAGAAGAAGCGGGTGGGAATGTTAAATTAACTATCTTGGATTCTTCCTTAAATAAGCCTAGTGGCGTTTTGGTAAATATGGTCCAGACAGCAGTTGACCCAACAAATAATTCTGGGGATGGAGAAGGGTTTGCCCCTCTATGGCATTTTGTACATGTAATTGGAGCAGAGGAAACAAAAATAGACATTACAACTTCAATCACTTATCAGGCAGGATATAAGTTTGAGGATCTAAAAAGTTACATTGAAGAAGTTATCGATGGATACTTTAAGAATTTAGTAAAGTCATGGCAGGAGAGTGACACTCTAGTTGTGAGAATCTCACAGATTGAGAGCGCAATACTTGGAATCACTGGAATCATTGATGTAAATAATACTTCTATCAATGGAGGTAAGGTTAACATCGAACTAAATCCAGACTCTATACCAGTAAGGGGGTCTTTTAATGAAAACTAA
Protein sequences of DBSCAN-SWA_1 >NC_010001|1006945:1019230|1010396_1011677_+|WP_012198817.1|DBSCAN-SWA MLGGGTFTSQNKKIPGTYINFVSAAKAEGVLSERGIVAYAVNLDWGNDTEVFQISKEEFQRNSLSLLGYEYAAEALKPFRDLFRHATRVLLYRLNPGTKATNNYATAKYSGTRGNSLKVVIGTNPDEEDRWDVSLYLGTTLVDEQRGIASAQNLVDNDYVVYQKDATLTQTAGTLLAGGTNGEVTGEKHKSFLNAIEGYHYHILACDSMDEPTKEYYVAFTRRLREELGIKFQTVLFGKAADYEGIINVKNNATLIPWVAGAQAGCGINKSITNMTYDGELTIKDTYSQAKLEDSIEAGEFVFHKVGQEYRILVDINSKTSVTPEKGQDFKKNQTIRILDQIGNDVASIFNNKYNGKIANDVSGRVSFWSDLVTYFKQLVSIRAIEDFDSQDVEVLPGTEKTDVIANSVITPVSSMEKLYMTVIVQ >NC_010001|1006945:1019230|1018156_1019230_+|WP_012198825.1|plate|DBSCAN-SWA MYEEMTYETILSNVLAKVPADIDKREGSMIYTALAPACIELAQLYLELDLILNETFADTASRDYLVRRALERGIKPKEATYAVVRGEFNIDVPIGSRYSLDKFTYITKKKLSDGVYELECEVSGSTPNGSIGKLIPIEYIDGLETAMITEILIPGEDEEDTEIFRKRYLDSFDAQAFGGNRTDYKEKVLRLSGVGAVKVYRATNVSGEEAGGNVKLTILDSSLNKPSGVLVNMVQTAVDPTNNSGDGEGFAPLWHFVHVIGAEETKIDITTSITYQAGYKFEDLKSYIEEVIDGYFKNLVKSWQESDTLVVRISQIESAILGITGIIDVNNTSINGGKVNIELNPDSIPVRGSFNEN >NC_010001|1006945:1019230|1010212_1010395_+|WP_041703176.1|DBSCAN-SWA MKKKETSNVSQKFYSKSSLLAAKRYKESRDLVEGLLQEDREYEISEVDQMIENYLKGKVE >NC_010001|1006945:1019230|1009794_1010211_+|WP_012198816.1|DBSCAN-SWA MIRNLLDGVTNELLKFSTDARIYVGSEKQEKILPCYVVKPSTIQYGSGREGRRTKTYTLQVFYYPEGDITTEAVQIEDELGEILEKITVNDIVIKNSGIKSEMVEGAVMCTVDYIVNYTKTDSKPQTMKSLLQKERIE >NC_010001|1006945:1019230|1006945_1007644_-|WP_012198813.1|DBSCAN-SWA MNKQNILEKILEVYQICNISSFPLDCLDLISKCGITCKSYSSLSRKKQEHCLLVSEEAFTLKNEIYYNDKILDKRIRFSLIHELGHIMLNHKENRSEAEEREANFFATNLLAPRIVIHYSGCKNLADVIKRFDLSEEAAGYAFSDYKYWLRKAKLQFPDTDKQIYEHFFNTEINQLVWSISECECCFFYPAYNGSKVCPSCFIANLKKHQNSFDVQEYALDLLRSQKLYGDI >NC_010001|1006945:1019230|1015653_1016487_+|WP_157668773.1|DBSCAN-SWA MLLPVTPSKFVTKIKNQNKSIQLLNEEEINLIKPAGLSEFSFSFLLPNVRYPFALYDSEFHMANWYTEKLKILKNGKFAFPFIVSRMSSKGVWMFHTDTIVTLEDYSITEDCDNGVDLIVDVTLKNYQPYGVVIVPLKNKGEAVKKNVRMKSKTMPSTYTVKPGDTLWKIAKELLGDGSKCYNLAKLNNISNPNLIRVGQVLRIENVSTSTQSATRNVSLANKSTSYNSALYNDVTLLTRLIGYNNVPIPEANEKGLMKVSSIKKPAIITRPNSCVR >NC_010001|1006945:1019230|1007659_1007986_-|WP_012198814.1|DBSCAN-SWA MEKAKILEQLIKEKGYSLKSFAEKCGIPYTTLYTMLKNGVGKASVNTVITVCKNLGITVEQMEVMAEGNSEQSYEPTYEDIQSLIARNGKRLTLEQKQDIIRTLLSDE >NC_010001|1006945:1019230|1012404_1012833_+|WP_012198819.1|DBSCAN-SWA MSLQLFMKKNKKVKENVFYAPTKSLLDENGLPLNWEFRHVSTKEDEDIRESCTMDVQITGKPGAYRKKIDTNAYIAKLVAASCVVPNLNNAELQDSYGVKKPEDLLKELVDDPGEYQDLFVFIQKYNGFDTSMEEEVEEAKN >NC_010001|1006945:1019230|1017450_1017768_+|WP_012198823.1|DBSCAN-SWA MGDLIQIIKQAAFDAIEASKPACFLYGVVIETKPLNIQIDQKLILTSDFLLLPEYLTNHEIILKSSDGLKSKFLLENGLKKGENVILLQQKGGQRFLVLDRMVIS >NC_010001|1006945:1019230|1016501_1017458_+|WP_012198822.1|DBSCAN-SWA MLSLAIANGDYLYYPSVQGGVTWDTERKSSPGVLKFNIVKSPLIKVEEGNAVLFREDDKDIFFGFIFSRSETKDDLIQLTAYDQLRYLKNKDSYVYSNWSTGELVKKIARDFCMNVGEIANTGVKLSRTESDTELFEMINNSLAETTLKTGELFVLYDDFGKLCLENKNLMLLDVLIDVNTAGDYSFTTSIDENTYNSIKLTCEDQDSGKRSVHLKDDLENIKKWGVLQFTENVNNKNTMKERAEGLLKLYNTPKRTLQVKNCFGDSRVRAGTSVVVPLLDENGVISPRFMMVESAKHTYANDEHFMDLNLRGGIMNG >NC_010001|1006945:1019230|1013018_1015619_+|WP_012198820.1|DBSCAN-SWA MADISAFFNLSQKISDTVINQITNVYSKSISASVTHTVNKVIENSVVNVDRTINNIEKMDRSININIGRFRKFKEETQEPIKTESYGEAIEKIGMVVEAINKMGEVLQRIETQESIKTEKFDEAIEKIDKVDESINKMGESLQRIETQESIKTEKFDEAIEKIDKVDESINKMDESLQRIETQESIKTEKFDEAIEKIDKVDESINKMGESLQKAEMQEPIKTESFDEATEKISKTEEAINKIEEALQKTEVKSEGTGAKLKKSFSSIFSSVRDNLGNNFAAVGKGIGSVGNIINSVTSFGTKYLDKVENSKILKTADALAQTRTKLTAMTGSQAEADQFQQRIFDSAQNSRTSYESTANMVLGLSAKGSFSNKEQIVTFTELVNKNSVLGGASAEGTKGVQTAVTEAMVSGTLSGEGFNNVLENAYPIIENIAAYLNKPIEAVQKMGAQGEISGEFLANAMFASAQKTNEEFSKTPMTFEQLISSIKDKALMVFQPVLQKISELTQNQEFMNMIQNIMSGLTFVGDLALRIVGVLINAASAIVDNWSWIAPMILLIAVAFGIWKLSVLLSSFSIKELTASLLACPLVWIIGIIMAIIAVIKIVIDHINKVGDKTYTVAGVICGILGGVGAFVWNLFLGLGDFILSFVNLIANAFIGVANFFANVFKNPISSIIYLFQGMADGVLGILEGIANAIDFVFGSNFGGTVAGWRSGLKDMADAAVQKLAPDEKYEQKIDYLNLSMESFGLTRAEYSDWWDKGNEFGNKINDLFKGSTGDDKSFDDTWDGILKNTDKIAHNTELQPDDLSYLLELAERDAINRFTTAEVKIDMGGVYNTVSSKQNLDGIVEYLTDKLRDELNNTARACNA >NC_010001|1006945:1019230|1008211_1008709_+|WP_012198815.1|DBSCAN-SWA MSIAVAEDIAKRAAKEALMEYVRESKEVAKKEVRKKTRKLMANYNSIKAHVEEGVSEAMEMEIDFVREDLDEDDLYIMSIRRSRIRSMIMMSHIDKCLDLLKAEQKRRGTPEKYEIYNGHYIKERTYEELAERFHCSERTAMRCVSELDDMISVLLFGMEGITYD >NC_010001|1006945:1019230|1009594_1009774_+|WP_041703174.1|DBSCAN-SWA MRVYEATVDKRPRECCACPIDGKVRARWPCGTVIRVNFNGSSKYIKIPNDKCSLRLEKQ >NC_010001|1006945:1019230|1017764_1018157_+|WP_012198824.1|DBSCAN-SWA MIPEIDMSIQNVKLTNQPTKTYALVGDKIVGMIDDVEAIRQAIYLTLSVERYEYLIYSWSYGVELKELIGKDVAFAYPEIKRRVVEALIQDDRILDVDNFTFQKEKEGVLVVFTVHTIYGDLLEEMGVMI >NC_010001|1006945:1019230|1011689_1012169_+|WP_012198818.1|tail|DBSCAN-SWA MNQITMNAKDTISSSLAECFVTIGEERFNAFHFTQFEASFKKLKKKVPILGSTGKGNKTTGWEGTFKATMHYNSSIFRRMLMAYKNTAEDVYFEIQVTNEDPNSSSGRQTIVFKNCNMDDGILAKFDASSDDTLTEEVNGTFDDFEMPEEFKVLKGMVL |
15 | Clostridium_phage(54.55%) | plate,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
3592260 : 3637196
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_010001|3592260:3637196|DBSCAN-SWA TATGATTTTTGTAGGAATTGACGTTGCTAAAGATAAGCATGATTGCTTTATCACCAACTCTGACGGTGTGGTCTTATTCAAAGCCTTTACCATACCAAACAATCTAGAAGGGTTCAATAATCTTTATCAAAAGATAAAATCCGTTATGGAAGATGAACACAAAGTAAAAGTAGGCCTAGAAGCCACTGGACACTATAGTTACAATCTTCTTGGATATCTCATCGATAAAGGTTTCCCGACCTACGTTATCAATCCGTTACATACCAATCTGTACAGAAAAAGTCTAAGCCTTAGACAGACGAAAACGGATAAGGTAGATGCCCATACAATTGCTTCTATGCTTATGTCTAATGTGAGCTTAAAGTCCTACTCAGACACATCATACCACAACGAAGAATTAAAATCATTAACTCGCTATCGATTTGACAAAGTAAAAGAACGTTCAGCTCTCAAAGTTTCTGTATCAAGACTTGTTTGTATTCTTTTCCCCGAATTAGAAAAGCTTGTTACAACACTTCATATGGCATCAGTTTATGCCTTACTTTATGAATTTCCTGGAGCCCAACAAGTAGCTTCTGCACATCTAACCCGACTATCAAATCTTCTTGAAGAAGCTTCAAAAGGTCGTTATCGCAAAGATATCGCCATTTTGTTTAGAGAAGCTGCAAGAAATTCCATTGGTTCAAATATGCCTGCAAAATCTCTCGAACTTAAGCACACTATTAGGCTTATTCAGGAACTCGATTCTGAGATTGATGAAATTGAAAACGAAATCAATCGTATCATGGACGAAATCAACTCCCCAATACTTAGCATTCCAGGAATCAGCTACCGTATGGGGGCCATAATCATCGTGAGATTGGTGATTTTAACAGATTTGACTCTCCAGATAAGATATTGGCTTATGCTGGTTTATCCCCCTCAACTTATCAATCCGGTCAATTAGATAGCTCCCATTCTCGCATGGAAAAACGTGGTTCAAGATATCTAAGATATGCTTTATTCAATGCTACTAAATATGTTTGCCATTGGGATCCGGCATTTAGTACATACCTAGCTAAAAAACGCGCTGAAGGAAAACACTATAATGTTGCTATATCTCATGCAGCCAAAAAGTTAGTTCGAGTCATTTATCAACTCGAAAAATCCGGACAACAATACATAAAGTCAGCCTAATCATCTCTAATTTCATATCTTCTTTTTGAGCACCGATATCGATGCTCTATTTGTCATGCAGTTTACAAGGTTCTGATTGCAATGAAAAGTCATTGCCACCTTATTCATAAGCATATCTGTTTTTTAACTTTTTACTTGACTTCTTTTAATAGTAGTCTCTTTTTATATTAAGCTATTATGCTTCGTTTAGTAATTGTATGAATTCTTCTTCTGTAATAACAGGAATTCCAAGCTCTCTTGCTTTCTTATTTTTCGATGAACTTGAAGTATTATCGTTGTTAATCAGATAATTAGTTTTCGCTGTAACAGAACCAGTCACTTTTCCCCCGTGCTGTTCGATTACATCCTTTACTTCATTTCGATTTGTAAAATGTTCTACAGAACCAGTGATAACAAAGGTAAGTCCTTTTAGTTTTTCAGAACCCCCTGCTGCCTCTTCTTTTTCAAATGTTATTTCCTTTAATAAATCTTCTATAATAAGTTTATTTTCCGGTTTTTGAAAATAAGACACAAATGCTTCTGCTATGACATAACCGATCTGTGGTATCTGTGTAAGCTCATCAACCGTAGCATTTGATACTTTATTAAAATCATAACCAAACTCTTTACAAATCAGTTTCGCATTTGCTAATCCAACATTTGCTATGCCTAAGCTATATATAAATTTAGCTAAAATTGGCGTTCTCGCTTTTTCCACAGAGTCTACAAGATTTCGGAATGACTTCTCGCCAAAGCCCTCCATGGTTGTAATCTCTTCTTTAAAATCTTTCACGTGAAAAATGTCTGCTAGTTCCTTGATAAGCCCTTTTGCAATCAATTTCTCTATGGTAGCCTCTGATAGACCTTCAATATTCATAGCATCACGGCTTACAAAATGGGTAAAAGACTTTATTTTCTTTGCAAGACACTCTGGATTCATACAGTACAATGACTTCACATCGCTATCTTGTTTTATCATCGTATCCCCACCGCAAACCGGACAAGTCTTTGGTATTGGTAAATGTCCACTTCTTGTTAGGTTATCTGCAATTTGCGGTATAATCATGTTTGCTTTATAAACCGTTACCTTATCACCTAATCCAAGCTCTAGTCCCTCCATAATGCTGATATTATGAAGGCTCGCACGACTAACTGTTGTTCCTTCTAATTCTACAGGCTCAAAGATAGCGACTGGATTGATCAGTCCTGTTCTTGAAGCACTCCATTCAATTTCTTGTAAGGTAGTTTCCTTAATTTCATCTCTCCATTTAAATGCTATTCCGTTTCTTGGGAACTTCGAAGTTCTTCCAAGTGATTCACCATATGCAATATCATTAAAAAATAGCACTAGACCATCTGATGGAAAATCATTCGTGATAATATGGCTTTCAAAATACTCCATCGTTTCTGCCATATTCTCTGCGGTTACAAGTTTTTCTTCCACCACATCAAATCCTAGCTCTTTTAACCAGTTGAATTGCTCCATCATCGTTTTGAAACGATTCATCTCATCCATTCGTATAAGAGCAAATGCGAAGAAATTAACATTACGCTTTGCTGTTATCGCGTTATTTAACTGTCGTACAGAACCGCTACAAAGATTTCTTGGGTTCTTATATTTTGCATCTGCGTCCGGGATTTGTGCATTTATCATTTCAAAATCCGAGTAACGAATTACTGCTTCTCCTCGGATAATCAGTTCACCTTCATATGGAATTGTTAGTGGAAGATTTTTAAATACCTTTGCATTGTTCGTTACAACTTCTCCTACTTCACCATTTCCTCTTGTTACAGCTTTTACTAACTCACCATTTCGATAAGTTAAAACAATGGTGAGACCATCCATCTTCCAGGAAAGAATTCCTTCTTTATTACCAAGCCAATCAATTAAAGCTGGTACTTCTTTTGTCTTATCAAGAGAAAGCATCGCAGACTCGTGACGCTCCTTTGGTAACTCGCTTAATATTTCATAACCAACTTTTCTAGTCGGACTTCCTGCTAAGACAATTCCAGTTTCTTCTTCAAGCTTTTTTAACTCATCATACAATTTATCATACTCAAAATTGCTCATGATTTCCCGGTCTTCCTGCTCATAGACTCTTGCTGCCTCGTCAAGAAGTTCGGTTAATTGGGTGATACGTTGTTTTTTTCCCTGCATTGTCTTGCTCCTTTTTTATCATTTCTCTAATACTCTTCATGTGTCCAATATTTAATCTTGTGAAATGTCTTTATCTCACACTACCTAAATACACACTTACTTTATTACATTCTGATCTTATTTATACTTCACTTATTTCACACTGTCTCTGCTACAACTACCTTTATTCCATAAAGCCTTATTGTTTCAACTTACCTTTTGTAATCTGATTTCTTAGTTGCAAACCGATTACTACTTGCATTTCCCTTTGAGTAGCTCCTAGGCTGCTCTGTTTTGCCTCTTGATGCGAACTTATTATCCTCTGATTTGATCTTAGAAACGAACCTTTTATCCTCAGATCTGTTCTTTGCGCCAAACTTATTCCCCTCTGAACTATTCTTTGAACTAAACTTATTTCCTTCTGAGCTATTCTTTGAACCAAACTTATTTACCCCAGAACTATTCTTTGAACCAAGCTTATTTACCCCAGAACTATTCTTTGAATCAAACTTATTTCCTTCTGAGCTATTCTTTGAACCAAACTTATTTCCCTCAAAGCTATTCTTTGAGCCAAACTTATTTCCTTCTGAGCTATTCTTTGAACCAAACTTATTTCCCTCAAAGCTATTCTTTGATCCAAACTTATTTCCCTCTGAGCTATTCTTTGATCCAAACTTATTTCCTTCTGATTTATTCTTTGAAGCGAATTTATTTACCTCTGCTTTATTTCTTGTAGCATCGCTTCTTTTCTTATCATCTTTCTTTTCTGCGAAGTATTTATCTTTCGTTCTTCTCTTTCCAGTAGCTACATTGTTACCGTGTGTCTTAGCTATATTTGCCACGTTCACTCTTTCAATTGGGAATTCATCATCTGGTTCTAAATTTAAGGCTTCATTGTTTGATAACTCAATTTGTTCCTTTATTCCCTCAAGCTCTTTATCCGTTAAGTTACGATATCCACCAACTTTTAGCCTTCCAAGATTAACATTCATAATTCGTATACGTACTAGTTTAGTAACCTTATAATCAAAATGTTCGCACATACGACGAATCTGACGATTCAATCCTTGTGTTAAGATAATTCTGAACGTAACCTTATCAATTGCCTCTAATGTACAAGGTTTGGTTACTGTATCAAGGATTGGTACTCCTCCTGCCATTCCTTTTAAAAATGCAGCAGTAATCGGTTTATTTACTGTTACGATGTATTCTTTTTCGTGATGATTTCCCGCACGCAGGATTTTATTAACGATATCACCATTATTCGTTAAGAAGATAAGGCCTTCGGAATCTTTATCAAGTCTTCCGATTGGATAAATACGCTGACTGTATTTAATATAGTCTATAATATTATCTGGTTCCCTTTTCTCTGTCGTACACACGATACCCTTTGGTTTATGAAATGCAATAAGAACCATCTTTTCTTCTCTACTTACTTCTTTCTTACCAAGCACAACCTTCTGCCCTGGTAATACTTTGCTTCCCATCACAGCAACTTCGCCATCAATGGTAACCTCGCCTTTTGCTATTCTAGTGTCTGCTTCTCTACGAGAACAGATTCCAGATTCACTAAGATATTTATTAATACGGACAGCTTTCTCTCCCGTCTCTTCTGTATCAAATGATCGTTTTGTAACCTTCGCCATGTTTCTTATTTTCCTTTCCATCTGAGAATCTCATAAGGTTTCTAAAATTTCGTGGAATCTAATCCAAATAAAAAATTTCATAGAACCTATTTCTAAGAAATAAGCCTTCCAATAAATTTTTTAAAGTCAATTCTTCAAAGAAATATAACCTTATTTTATTTTCTCACCAGTTGTACTATTATATCATTATTTTCTTATCATTCCAATCATTTCTTATATTCCTATTTACTAGTGTTTACTATTCATTAGACCCCATGAATATCCCCCTTATGAGGTTATTACTAATCAATATGTGGCTGTCAATGTGCATTCTTTCAGTTAATCTTTATTTCACTATATAATTTCATATTTCACGAAATATGAAAAGCTTATTTTCATACTTGGTGAACTTAATAGTTATATTTACTGAAATGTTTAGTAATATTTGTGTCTTTTGCTAATTTACACAATTTGTCATTTATTGTAAACTTATAATATGTTAAGTATCTCTTAAATTTTAAATCCGGATAAGAGCTACTTGATATATTCTATTACTTATAGTTGAGTTTTTTATGAAAGGAGTCGTACTATGAAAAAATTATTTAAACGTCTTATTGCATTTACATTTGTCGCTGCTTTACTTTTTACATCATTAGCACCTTGCTCTAATGTATTCGCTGCAACTAATACTGGTTCTGATTCACACGATATGGTATTTAAAGCAAATTCTAATACTTTCGCTAAAGTTACTGTTACAGTAAAATATACATATACCGGAAGTAGTGCTGAAATAACTTCCATTACGAGATCCCTAACTACATATGATACTTCAACTTATACCGTGCAGTACGATAGTATTGCTTCATCATATAGTGGAGCAAGTGGAACTATAACCTATAAAATTTATAAAAATGGTGTATATTATAGTTTTGCAACTTTACTAGTATCTGTAAGCCGTGATGGAACTGTTAGCTGGAACACTGGAACTTGATTTTATTTAGTATTATTAAAAATCCGCCGGTTAGGGAGTAATCCTTAACCGGTCCTTTTTAATTTTTAGTATGCGTTTTTAATAGAAATAAAACCCTATTGTTGTTCCAATTATGTGATACATACCTGCATTTCATGAAATATTTTAATTCGTTTTTTGACTTTACAAATCTTTCATTTGTCGATATAATATTGGAGTACTATAAACAATTTAATATAAATAGCCTTTCTTAGGCATATCTTGCATCTGTAGCTCAGTGGATAGGGCAGCAGCTTCCGAAGCTGAAGGTCGTAGGTTCGAATCCTATCAGGTGCATTTTTTATACTCAAAATTAGCCTTTCTTTTATTTTTTCATAAAAAATCTCATTTTCTTCTTTGATATTACCTTTCCCCATGTAAATTTTTGTTACATCTTTGTTACAAATTATTAATCTAATGTTACCAAAATATGAGATATCAGTTGATTTTTGGGACATCTTTTGGTATGATAAGTAAGGTTAAATAATTCATAATATTATACTAGTAATTTATTGTACATACAAACACCCATCGCATGCCAAGCAAGAAAAAGCAAGCTTCACGAACTTTTCTTTGTCATGAATAAGAGTGTGGAGTGCTTTTATTTATTTCTTTCTAACTTCGATGTATACCACATCTACTAATTGAAGCAGTACTGATATAATTTAAAATTAATGTTTTTAATTTTCGCACTTACACTCATTACATACCAAATCTATTTACAGATGTGGTACAGCCATAATTATATAGTGTGATATGCTATTGATTTCATACTATACTCTTGATCAACATAGCAATACCATCTCAAAGTAGAACTTAAGGAGTAACCCTATGAGATTAAAGTATAAAAAATTGTTGTTATTATTCACGATGGGTATCTTTGGAATTGGAATGGTCACTATCTCCTTTCAAGTTTCGCCGGATGCATTACAAGCTGCCTTTATAAAAGAGGGAGACCGAGATTCTTTAGAACCTACTAGTTCTCCCGATTCACAAGCAATCACTCCTACAGCATTTGCAACAAATACACCAATACCAGAGAATCCGAATGCACTTAAGAAAGATGCTTATCCGGAAATCAATGAATTAATAGAAACATATTATAAGGCAAAAGTTAGTACAGATGCCAATTCCATTGAAATTTTAAAAAACTGTGTTACAGATGCAAGTCTTCTTGATTTTGAACGTATTTCGAAAAAAGTTGAATATATTACAGATTATAAGAACTTCCATTGTTATAGCAAACCCGGAACTGGTGAAATTGATTACATTGTCTATGTATGTTATGATATTATGATCACCAAGATACAAACCGGAGCCCCAAGTGCTGATGTATTCTATATTACCTATCAGGATGGAAAGCCACATATCTTTTTGGGTAATGTATCTCAGAAAACTAAAAATTATATTGATAAAACAAATTTAGACGAAGACGTTCAAAAACTAAGTAAAGAAGTTGACGATAACTTAGCCAAAGCAATTGAAATGGATGAAGATTTAAGGGAATTTTACGAAAATCTAACTGCACAAACTTCAAATGTTGAACCAAGTGAAACACCAGCTCCATAAATTATTTATATTCCCTTAAACATAAATGATTTTAAGCGTTCGTGTTGAAGTCCAAGCGGATATTCGCAATTAGCTTACGCTACCAGCTCTACCTCATTGCCACTACGTGGCATTTTAAAACATGCAAGTCCTTACCCCCGCTTATAGAAGCGGAGAACTTACCTGATGAGGTTAAATTATAATATAAGCGGGAACAAATCTTAAATTCCTTATGAAGATACAAAGCGTATCTTAGGAATATATATTTGTTCCCGCTTTTCTAGACTGAATTACATTAAGAACTAATAAACAAATTATCAAGTACCCAAATCTAACAATCCATCTCTTATCGATTCATAGCTGTATACGGTATCCGAAAGAATCTCTTCCATCGCAACCTGTGTCTGATTAATTTCTTTTACCATTTCCTCCAAGTGTTCTCTATCCCTATCTTCTCCGACTGGTACTAAAATTAGCTCATGGATACTGGATGGCAAAAGGAAAAAATCTGATCCTAATTGTTTATAAAAGAGTTCCAGTAATCCTTGGTATAACAAGCAACTAGCACCATTAATTCCACAACTATTCGTTAAAACATACATAGATGAATTGAAATTGGTTGGTAAGGTTTCTACTTCTTTTTTTGGTGTCCATGACTCATGCTTAATTGCATCATAAACAATGTCTTCCATAGGTCGTATGAGAGCTGGAAAGTTCTTTTTAGTATTTTCCATAGCCACATTTCGTAACTCTTCTACGGACACGCCCCAGTTCTTTCGATGTTCTTCTGTAATTCGAACACTTCCAATACCCTCTTCATCCCTTCTGATTAAGCAATGAAAGGTAACTGCAAATTCTAAAACGCGGTAATGCGGGATGTTTTGTAATAATAACCGATTTTTTTCATAATTTACTAACCGATAGATTATCCTTGATTTTACTTCCTCAAACTCCATTCGAGCTGGTATCGCAATCCTTCCCGTCTCCTCTCTCGCTCCATCATAAGCTTTTATTATCTCATCTACTATGTCAGTTACAGACTGACCCTTTTGATAATTCTCGTAATAAGGATTTAAATAAACACTTGGCATTATCTTATCCGTATTACAACGTATCATAAGACCATCTAGTAACAAACCGTTGTTTTTTCGAATCTGATTTAAATCAGTTTCATATCCTTCTCCTAGACGTTTTTTCACTTCCTCCTTCATATAGTTTAGGAAGTTTTCATAGGACATTAAATTAGACACCTAGAACATCTCCTTTCCTTGATTTCCATGAGTTATCTCCACTTTTGAACTTCACATAAATTTATCAAATTAAGAATTCGAAATTTAAACCTTAGATAATAAAAATAAATATAAACTTTCAAAAAAATTTCAATGAATTTCCAAGAAAGCCATCCTCAAAAAACTTTCATAATAAGTTTTCAGATGAACTTTCAATAAGCTTTCAAACGAACTTTCAAAATGAACTTCATATTTCTATATAAAGATTTAAATTTTTTGAATAAAGAAAAGCAACCATCGCAAGCTTTTCCTTGCTATGTAACATTAAGTGGCAAAACCACAAAATGCATTTTGCTGAATTCTTTAGTACACAGACTGAGGTTCGAATAGAAATTGGGGTAATTTACGATAATGTAATTTTAGACTTTAATAGCAAAATATTTTGATTCATTCGTAATGATTATTTTGCTTTAAAGAAATGAAATTTAAATAAGATTCTACTTTGATTTACTACGAAAGCCAAAAGTGGGTGTATATCCATCATGGTAGCTTAATCGTACCAAAAGTAGATAAAAAAAACAATCGCAAGTTTCCATTGCGATTGTAGAAATTTTTGATTAAAATTCAACAAATTAAATAAATGAACCTCTGGTACTTCTGTTATACTTCGCATCACTCATACTAATTGCTACCTGACCTGGTAGGTAACAGGCCCACATTCCTACGCCCGCTATTTGTACTAAGATATCCAAAACCTTTGATTTATTGGAAAGATACATAGGCAGATTATATAACCCTACTTGTATGTCACATAAGAAAAAGAGTACCATTCCAAGGCAAAATAAACCATATCGAATCTGTCCATTCATATAATGCTTCTTTGGATATAGAAGGATTAAAAAGATGAGGTTTCCGACAAAATTCACGATGTAGAATACACTTGCGAATAAGAGTAGATCTAAAACAAAACCATTAAGTTGCAAGACAACGATAACTACAGCACTTACAAATACTCTTGTAGCAAGGTGAAGAAAGAAATTTAAGAAGCCTTTCTCATGTCTACTTCTATGAACACGATTTTCCATACGAAGTGAATACCGTGATAAGCTGAACATACGATAAAAATAAATTAGCTGAACGATACAAAATGCAATAACTCCATATTGAAATTTTGTTGTAAATAATAAAAAAACATCCGCTATTACTGTAAATACAAAGGCAATATTTAATAAGATTCGATCTGCTATTGCTTGATAACATAGTGTTCTTATCACTGTCAAAACAAAGCATAACACAATCGATGTAAACTTTATAATCGAGGACACAATCACTGGACCTCCAAAGATGTCCAGTGCTAAGAATGCTAGATATAAAATTCCAAGGCTACAAACAATGATCTTACTTAACCTATTCCCCAAACGATAATTCATATGCTTCCTCCAAATCTAAGGAAAGACATTCTGCCTGTTTCCTACGAATTTTCGCTATTTAATTTCTTCAGTATCTCAAATGCTATTTCTTTAGGTACAGGCTTACCGTAGTAATATCCTTGCGCTTTTCCACAGTTGATTTCTTTTAAGAATTTTTCTTGTTCCGATAATTCCACACCTTCTGCAATCACTTCAATATTTAAATGTTTTGCTAGATCAATCATCGTATGAACAATCTTTTGATCAGAATTATTCTCTAATATCGTATCTAAAAAGCTTTTGTCAATTTTTAAATTATTCACTGGTAATCGCTTTAGGTAATTTAAAGAAGAGTATCCTGTCCCGAAATCATCTAATGCGAAAGTAATACCGATTTCCTTAAGTTTTGATATCGTAGAAATTGAAAATTCCAGATCATCTAACGCAACGGTCTCTGTAATTTCAAATTCCAAACCTCTTGCATCTGCTTTCGTCTCCTCCAGTATTTCATATACCATAGGGAGAAAATCTTTGTCCTTAAATTGTCTCGCTGACAAATTCACAGCAATCACGATATCACGATAACCACTCTCTTGGAGCGAACGCAGCTGCATACAAGACTCATAAAGCATCTTCTTTCCAATTGCAACTATCATTCCATTATCTTCAGCAAGTGGAATAAAATCAATTGGTGCAATAATTCCCTTCTCTGGATGTTGCCAACGGGCTAATGCTTCGAATCCAACCACACGGTCATTTACCAGATTAATCTGCGGCTGATAATATACAACAAATTGATTCTCTTCGATTGCCTTACGAAGTTCTGATTGCATCTCTAACTTCTGCATCATCTTAAGATTAATTGAATCATCATAATAACAGAAATTATTTTTTCCACGTTCTTTTGCTTCGTACATAGCAGAATTCATATTCTTTACAAGGCTTTGTGTTGTCTTACCATCCTTTGGCGCAATCGCAACTCCGATACTGACCGTAATAAAAAATTCTCTCGTTGCAACTCGAAATGGTGTTGCAAAAATTTGCTGTACTTCCAAAATTCTTTCTTCTAACTCTGATAAATCTGTTAGATTCTGAATTAATATTGCAAACTCATCTCCACCGATTCTTGATAAATAATCATTTTCTTTGATGAGTTCTTCTAAGCGATGCATCACATCAATTAACATCTCATCGCCATAGGAGTGGCCTAACGTATCATTGATGTCTTTAAAATTATCAATATCGATATCTAAGATTCCAACGATTTCATTCTGACGTATTGTCGCCATGATACCATCCAATAGTTCAACAAATGCAAGCCGATTTGGCAGCGATGTCAAATAATCGGAATACGCCATACGTTTCACGATTTCTTTACTCTTTTTCAGTTCTTCATATTTACTATGTAATGCATCTTTGGTTGCAGTAACCTCTGATAATGCCTTTTCTAATTCTTTATTGCTCTCTTTTAGTTTCTGATTTACATTAATAGCAGCTTTCTCTGCCTTGGTTCGCTTCCTTATATTCACAAACAATAAAAGAAAAAAAATCAAAAACACCACAATTGTCACAAGTAAAATATATAATAAATTCCGCCCAGACCCAATGTCTATCTGATTAACGGTCGTATCTGCAGTTGTATTCATTAAAATAAGTTCGGTATCTGTAGAAGTTGTATTCATCGCTGATAATACAATTTGAAGCAATCTGCTTTTTCCCATTCTTATTCTTATGCTCTCTTGGAGTTATATTTCATACAAGAAATCACGCTTATGACTCCTTGTTTTTCTACGGAGCATTCCTTTCTACACAATATGTCTTCTATTGTGTTACAATCTGTCTTCTAATACGTAACCTCAAAATCCGTACCTTTATAAAATTACCTTATCAGATATATACTAACATAATTACCAAGTTCTTTCAATTATAAAATTACAGATTTTCGCAAAATATTACGTTATAAGTATTAGAAAAGCGAATATTATGGCTCATTTTGTTCATAATAAAGGAAATATTCAATTACCTCGACATATTTATCGATTCTAATAAATTTTCTATTTAATTTAAATTCTTGTGTCTATCAGAAAATAGTGATAAGTCTATACCTTCTGAAACTCAAAATAGTATGTTTTGTGATCTTTTTCTTGTCATGAATGAAAAAACTGGGGAAACCAAGTCTATCGCATGAAGCATAGCTTCACTTTAGAATTTTTGGTTTCCCCAGTTTTTTAATACACGATAAATTCGCAATCCAATTCACTTATCATGAACAAGAAAATACACACTTCGAGAACTTTTCCTTGTTATAAATATTTATTATCACATGATAAAATATATCGCTACTTATAATTTATGATAAAAATTCATCAATTCCTTTCGCTGCCGCCTTACCTGCTCCCATTGCTAGGATAACAGTTGCAGCTCCTGTTACAGCATCACCACCGGCATATACACCTTCTTTTGTGGTCTTGCCGTCAGTTTCTTCTGCGATAATACATTTATGAGAATTAATCTTAAGACCTTCGGTTGTAGAAGAAATGAGTGGATTAGGACTTGTTCCTAAAGACATAATAACAGTATCAAGCTCTAATACGAATTCAGAATCTGGTACCTCTACTGGTCTTCTTCTTCCGGATGCATCTGGCTCGCCAAGTTCCATCTTTACACAGCGCATTCCTTTCACCCATCCAGTCTCATCTACAAGAATTTCTGTTGGATTTGTTAATAAATCGAATATGATTCCTTCTTCTTTTGCATGATGTACTTCTTCTACTCTTGCTGGAAGTTCTTCTTCACTACGACGATAAACAATGTGAACCTCTGCCCCAAGACGTAATGCAGTTCTTGCAGCATCCATTGCAACATTACCACCGCCAACAACTGCTACCTTTTTACTGGAGACAATTGGTGTATCATAGGATTCATCAAACGCCTTCATCAAATTGCTTCTTGTTAAGTATTCATTTGCTGAGAATACACCATTTGCATTCTCACCAGGAATACCCATGAATTTTGGCAGACCAGCACCAGAACCAATGAATACAGCATCAAAGCTTTCCTCATTAAATAATTCATCAATCGTGGTTGATTTTCCAATTACTACGTTTGTTTCGATTTTAACACCAAGAGCTTTAACATTCTCAATTTCTGTCGCTACTACAGTTGATTTTGGCAAACGAAACTCTGGAATACCATATACCAAAACACCGCCTGGCTCATGAAGTGCTTCAAAAATAGTTACGTCATAGCCAAGTTTTGCAAGATCACCAGCACAGGTTAAACCAGCAGGGCCAGAACCAATTACCGCTACTTTCTTACCCTTCTTCTCCTTTGGAGGTTCCGGTTTAATTCCATTTTCTCTTGACCAATCTGCAACGAAACGTTCTAATTTACCAATGGATACTGGGTCTCCCTTAATGCCACGAATACATAAAGCTTCACACTGGGATTCCTGAGGACAAACACGGCCACAAACTGCAGGTAATGCAGAATATGTGCTGATTACTCGATAAGCTTCTTCAATGTTACCAGCTTCCACCTCTTTAATAAAGCCAGGAATGTCAATGCTAACTGGACATCCTAAAACACACTTTGGGTTCTTACATTTTAAACATCTCGTTGCCTCAAGCATAGCCTCTTCTTTATTATATCCAAGACAAACCTCCTCAAAGTTTTTCGCTCTGACACTAGGTTCTTGCTCTCTTACCGGTACTCTTTTTAATACGTCCATGTGATTTTCCCCCTATTCCTCGCTGCCGCAGTATCCGCATCCGCCATGATGAGTATCGCCTTCTTTTTCACGAAGCTCTTTTCTTCCTTCCTTGGTCTTATACATCTGCGATCTCTTCATTGCTTCGTCAAAGTCAACCAAATGACCATCAAACTCAGGGCCGTCAACACAAGCAAACTTTACCTGACCACCAACCGTAATTCGGCATGCGCCACACATACCAGTACCATCTACCATAATAGGATTCATACTGACAACAGTCTTAATTCCTAGTTCTTTCGTTAATAGGCAAACGAATTTCATCATGATGATTGGGCCGATTGCAATGACAAGATCGTATGATTTTCCTTGATTTTGTACTAACTCTTTGATCATATCATTTACATTTCCCTTAAACCCATAGGAGCCATCATCTGTTGCTACATATACATTTGCTGCAACTTCCTTCATCTGATCTTCTAAGATGATGAAATCATTGCTACGTGCACCAATAATGCAATCTACATTACATCCAATACTCTTCATCCACTTTACCTGCGGATAAACCGGAGCAGTACCTACACCACCGGCAACAAAGAGAATCTTCTTTTCTTTTAACTCCTCAACTGGTTCGTCAATTAATTCAGACTTACAACCAAGTGGGCCTGTAAAATCTCTAAAGTATTCACCAACCTCGTACTTAGCCATCATCTGAGTTGATGGACCTACCGCCTGAAATACGATGGAAACAGTTCCTTCTTCCCTATCATAGTCGCAAATGGTTAGTGGAATTCTTTCTCCTTTTTCATCCATCTTAACAATCACAAATTGACCTGGATAGCAGCTCTTAGCAACACGTGGAGCTTCTACATCCATTAAATAGATGTTGGAAGCTAGAAGCACTTTTTTAGTAATCTTGTACATACCATCTCTCCTAACAATTTTTTCATCCGTTTTGCCTGGATTTTATGTAGTTATTCTACATGAATTTCTAACCTTAAGCAAGAAACAGTTAAGAAACACATGTTAAATAAATAACAATTAATTTGATATCATTGGTTATTATAGACCATCGGGTTAAAAAAATCCAGTATTAATTACAAGTTTTATATAATAGCATATTTGTGCTATTCTGTCCATAGCACTGTAAACTTACATTCAAGCAATTCCTTGTTTTAAATCACTTCACCAGAATTTAAGATGTATATCTATACACTTAAAGTAATATATATTACTAAAATTTACCGATTATTGTTAATAAGTTAACATTCAAATTTACATGTCTGACTATTTCTATGCACACATTTTCTATTCATTACATAAATTAATGTGACAATAAAAAGAACTTGCTATCGGGAGGTTACATACCATGTGCGGAATTGCTGGTTTCTATCATCCTAGACAAAACTACTTAGAGAAAGAATCCTACTATAAAACAATTTTAAATTGCATGACCAAACGTCTTTATCATCGCGGTCCGGACGAACAGGGAATCTATCTGAATGAACACATAGGCTTGGCTCATGCGCGTCTATCCATCATTGACTTGGTTTCAGGCGGACAGCCAATGCTACGCTCTTTTGGAGAAAGGACGTACGTCATCGTATACAATGGCGAAATATATAATGCAGAGGAATTAAAAAAAGAGTTGCAACAACAAGGCATGAGTTTTCAAACGACTTGTGATACTGAAGTCATACTACTTGGTTTTATTGCTAATGGACCTGATTTCGTAAAAAAACTGAATGGTATCTTTGCTTATGCAATCATTGATGTTGCAAAGAATTCCCTCTATCTCTTCCGCGATCAAGCTGGGGTAAAACCTTTATTCTATACTTTATATGAGGATACCTTAATCTTTTCCTCTGAGATAAAAGGATTGTTTGAGTACCCAGGATTTACACCGAAAGTTACGAGCGAAGGGTTAAATGAAATATTCTCAATCGGTCCTGCCAAAACCCCTGGATGCGGTGTTTTTGATAAGGTCAAGGAAGTTTTACCAGGGGAAATGGTTTGCTATAACCAAAGTGGTTTTACGAAAGAACTCTATTGGAAACTGGTGAGCAAGCCACACGAGGATTCTTATGAAGAAACCATGGAACGAACTGGGTTCCTAGTTACGGATGCGATACGCCGCCAGATGGTCAGTGATGTTCCAATCTGTACCTTTCTCTCAGGTGGGATAGACTCTAGTATTGTAACTGCAGTTTGTGCAAATGAATTAAAGAAAAAGAATAAGCAACTTGACACCTTTTCCTTTGACTTCGTGAACAATGAAGAATTTTTTAAAGCTAATAAATTTCAGCCATCAAGAGATCTGCCTTATGCACTGAAAATGGCGGAGCATTTTAACACCAATCATCATCTTCTAGAATGTGATAATGTTATGTTAGCGAAGCGACTACATGATTCCGTACTTGCAAGAGATTTGCCAGCTATGGCGGATATCGATTCCTCTATCCTACATTTTTGTTCTCTCGTCAAACAATATGATAAAGTAGCTCTGACAGGGGAATGCGCAGATGAAATCTTTGGAGGCTATCCTTGGTTTCATAGTGAGGAGGCCAAAAAATCTCATTCCTTTCCTTGGTCAAGAGACCTAAGTGCAAGAAAGCAGTTATTAAAGGATGAATTTCTTCAGTGTCTTCATATGGACGAATATGTTCAGAATGCGTATGAAACAACAGTGAGTGAAACACCTTATCTCGCAGAAGATTCCGAGGACGAAAGAAGACTTAGAGAAATCTCTTATCTGAATCTTAAGTGGTTTATGCAAACCTTATTAGATCGTATGGATCGAACCAGTATGTATAGTGGATTGGAAGCTAGAGTACCTTTTGCAGACATAAGAATCATTGAGTATCTATGGAATGTTCCATTTTCTATGAAAGCTCCGGATGGCATCGTAAAAGGGTTATTAAGAATGAGTTGTAGCGGTTTGTTACCCGATGATATTCTCTGGCGAAAGAAATCTCCTTATCCAAAAACCTATGATCCAGGATATGAATCCTTAGTTGCTACACAATTATTAGAAGTAATGAATGATAGTTCTTCACCAATCGTTTCATTTATTGATAAGAAAAAACTGGATTCCTTCCTTCATACTCCATCTGATTATGGGAAACCATGGTATGGCCAACTTATGGCTGGTCCGCAAACACTAGCATACCTCTTAATGATAAATGATTGGTTAGAAACTTATCATATTGAAACAGTAATATAATACCTATAAAAATTAAAAAGATATCAAAGCTAAATTATATTAAAGCTAAGATTATATTAAAACCAAATAAAACAAGATAATGCTTTGCAAACTTTTCCTTAGAATAAAAGAAAGAGGGAATTTACAAGCCAACACATTTCCTATTGTGGTGGTTCTCATATATTCCCTCACCTTTTAATATTTTCGAAATTAACTATTAAAAAGCCGATAGTTTATTCTTCATTTATAGAGCTTTTGAAGATTTAATTTCAAAACTTAGAACTCAACATGGAAGCTAACATAATTTTCTCCAAGAATTATTTCATTCTCCACATAGATATCCCCGCCCCAATGCTTCATAATGCGATTTGCATTATATAAACCATATCCTTGACTCTCCTTCGTATTTTTTGTAGAGTAACCACGCTCAAAAAATCTAGATAGTTCATTGATTGGAATCTTTTCATGGCGGTTCTTAATGATAAAAATTAAGCGGTCATCCTCGGAGGTTAGATAGATTTTTACGTTTCCTTCTTCTTTACCACAAGCTTCAAAAGCATTATCAATCAAAGTTCCAAGTACTTCTACTGCATCCATCTCCGGCACATTCGTAAAGATATCTTTGCTCCCCACAATCAATTCAACTTGAACATTCGGGGGTGAAGCTATAATTTTACTATACAAAAACCCCGCTAACACTTTGTTACTGATTTTTAGCAACCCTTGATAATTATGTTCTGGTAAGGATATAAGCTCCCTAATATACTCTGCCTGTGATTTGACAAGCTCCTCATAATTATCTATGGTTAAATGCATACTTAGGAGGGCATTTAAGTGGTTGTCAAACTCATGCTGCCTTGCTCGTATTTCTTTAATAAATTCCTCCAAAGGCTTTACATATAATTTATACAACCGCAGTTCATCTTCCTGTTTTAGATAAATCATTAACTTTTGCTTCCATAATAGATATATAGCTAAAAGAACAATCAATACGATCAATAGAATGGCCAGTGATTTAAATATTGTAATATTATTCATTCATCCATCTCCTCATATTACTTTTATAGGTAATCCCTATCTCTATTGTTTTATGAATGCCCTTTAGCTTTATCAATTGATTTACAGTATCCACGTAGTCAATATAATCGGTGTTTACAACAAACATACGATGACATTGAAGAAACTGCTCACTTGGAAGCTTCTCTAACAGTTGTCTTATCGTTAAATACTTAATATCTAATATCTCCTCTTTTAAGTAAAGACTCACTCCTCTTGGAATCGCCTCAATACAGATGATATCTTCAATGTGAATTCGGTAATTAACTCCAGCCTTTTTGACGGTTAATTGCTCTCTTTTCATCGGGGTTTTTTCCATCGCTAATTTTGATAAAAGTTCGAGTACCTGATTCTTTTCATATGGTTTAACCAAGTATGAATAACATTGAGTCTCACGGTAAGATATTAATTCTAGCTCTGCTATTGATGTGATAAAAACAATCGGTGTAAACGCATAGAATGGTATCTCTCTTATTTTTTTAGCAAATAAAATTCCTTGTTTATTCTCTTTCTCAAATGCATCTAGGTTAATATCCAAAAGAAATAATGCTATTTTAACTTCACTTTTTAGTATTCTAATTGCTTCATCGTAAGTGTATGCAAAAAGAGCATGTAAGTCCAGCTTACTCTCTTTTATGATTGCTCCCAACGCCTCTGCATTTTGCTTACAATCCTCTAAGATTAGTACACTGGTCATAACATGCTCCTTTGATACAATTTCTTAATTAGAAATTATTTTACAGTTCGTCTTTCGCACCAACCTCAGTAAGGCTCTTAACAAGTCCTGCCATTCCTTGAATTTCGGATGGAATAATAATCTTTGTAGCCTTACCATCTGCAGCTTTTGCAAATGCTTCTAAACTCTTTAATTGAATAACACCCTTCCCTGGATTAGCTTCATTTAACATACGAATACCATCTGCATTTGCTTTTTGAATTGCTACGATAGCTTCTGCTTGACCTTCTGCTTCGCGAATAGTAGCTTCTTTCTTAGCTTCCGCGCGTAAAATCTGAGATTCTTTATCAGCTTCAGCCTCTAGGATTACAGATTCTTTCTTTCCTTCCGCTACAAGAATTGCGGACTTTTTCTGACCTTCTGCAATTAAAATAGACTCACGACGTTCACGCTCTGCCTTCATCTGTTTCTCCATTGCATCCTGAATTGCAGCTGGAGGAATGATATTCTTTAACTCAACACGAGTTACTTTTATCCCCCATGGGTCAGTAGCCGCATCAAGAGATACCCTCATCTTCGTATTAATAATTTCACGTGAGGTTAATGTTTCATCCAATTCCAAGTCACCGATAATATTACGAAGTGTTGTTGCTGTTAAGTTTTCAATTGCCATCATCGGATTTTCAACACCGTATGCAAATAACTTTGGATCAGTAATTTGGAAAAATACTACGGTATCGATTCTCATTGTTACGTTATCTTTTGTAATAACTGGCTGCGGTGCAAAATCGGCAACCTGCTCTTTTAAAACAACTTTTCTTGCGATTTTATCAATTAGTGGAACTTTTAAGTGTACACCAACACTCCATGTTCCTTGATATCCACCAAGACGTTCTACTACATAGGCATACGCTTGAGGTACGATCTTTACGCAAGATGCTAATACTAATAAAATAATGATTCCTAGCCCTATTAAAAGGTAAATCTGTGCATTACCCATATTAATTCTCCTCCCTATATTCTGTTACAATTAGTTTCACACCAACAATATTAACAATTCTAACCTTCGTTCCTGGTTCATAGATAACACCATCTGTTTCAGAACGTACGGTCCATTCTTGGCCATTGACTAATGCTTCCCCTGTTTGATTGTCATTGTCAACCTGTTGTGTTATTTTAACAACTTTTCCAATGAGTCCCTCATAATTCGTTTTCACTCGGCTTTTATTCAGATGTTTGACTGCAACTGGTCTTGTAAAAAATAAAAGCAGTAGAGATACCAGTATAAATAGAATCATCTGAATCCAGAAGTCCACTCCGAGCAATGATGCAATGAAACCTATAAGAGCACCACCAGCAAACCAAACGGTAGTCAGTCCAAGGGTCGCAATTTCTATGGCAAGCAATAAGGCTAAAACGATAAGCCAACATAATGATACCATAAACTTCACCCCTATATGCAATTGCATATTCCTTTCCTCTTTGATATATACAGAATATACTAAGCAAAGAAAGGAATCAATAGATTTGCGTGAATTAGGTTATTTTCAGCTTGAATGATAAAAGTTGAATGATTTAATAATAAACAATGACTTTTTTTACTTACTGGTTAATAAACATTAGCTTAATTTTTCAAAATGGTTATCTTCAATACTATTTTTTAGATTAATTTCACGCATCTTCTCTTTATCTAAACTATAATGTTTCATTGAAAGTAAAGAAAGTAACCATCCCCCTATTGGTATGATACAATAGCATAAAATCGCCACCATTTTTAACGAGTTAGTTAGCTTATCTTCTATTTGTGGTAAGGCATTTCGAAAGCCAATCATAGCAACCACAAGTCCTACAAAAGCAGTCCCAAAAGCAGAAACAACTTGGTCAATTAACGAAAATAGTGCTCCCATGATACCAGGAATATAATTCCCGCTACGATACACCTCATAATCAGAGCAATCTGCTATCATTGGTACGACAATATTATTGCTTACTGTCTTACAACCATTTAATAAGATAAAGATTCCAAAAAATAAGAGTGAAATCATATTCCATTTTGTTAAACCAATTTGAGTTAAATTACCAAATAATAATAAGAATGTCATAATGATTTGAAAACAAATCGCTAACCATGTAAAAGATTCAAAGGCCTTCTTCTGTCCTAAACGCTGAGCCACCATAACACCAATTGACACAACGAGTAAATTTGGAATGGCGGTTACTAAACCAATCTGACCTGACAATTCATAGTTTTTCATCATGATTCCAAAAATTATAACACCAACTGTTATATTACTGTAAACCATAGAGGTGAATTTATTCACTGAAGCAGCCGTTACTAACATTCTGATTGGCTTATTATGTTTCATGATGCTAACATAATCTTTCATATTAATCTTTGTGACTTGATCAGAACTTCCATAGTATTCTTTTCTATCTTTACTCCATATTCCAATTACTGCACAAATCGTACAAATTCCACCGATTAGTACAATCCAGAGCGTTAATTCTTGATATAACTTAGGATTTTTAAACCCGCCATACTTCTTCGCAAGAAATGTTTGCGCATAGAGGGCCGTTCCGCCGTAGCTTGCAGTAATAAAAAGAGAGTCAAAATAGGTGGATAAAGGTCTCATCTTCGGATTATTCGTCATTACTGTCTGGCCTGCTTTTGCTACCAGCATCTGAAATGTGTAACCAAAAACAAAGATAATATAAATTAAGACAAAGATCGGTAACCTGATAGCTTTTGGTATTGTGTAAGAACAATAGTATAATAACAAAGAGCTAATCGCCATTAATGCATTCCCAAGCACCATAAAAGGTCTAAATTTTCCGTATTTTCCTTCCGTTCTGTCTACGACATAACCGATAAATGGATCAATCAGTCCGTCAAAGACTCTAAGTGCTGTTAATACAACTGATGTTAATACGACCGAAAATCCTATAATACCATTCACAAAATATGCGGTATATTCCATTAAAGCCAAATATAATGTGGAAGCTGCTGTATTAAGAGAAAACAGTGCAATCAGAAAAGCACTTGCATTATTATAACGTGGATCTAATTTTTTTGTACTAAATATCATATAGCCCTCACAACTAGAATTTATTTAAATGACCTATCATCGAAAGCCAATGGAATTGTAATCACATTATTTCAACCAATGGTAACATACTTTGCAATAACATGTCAAATTCTTCCTCACTTTCTATACTATCCAATATTTCTTTGGAACATACATCTCATAATTTCTTAATTTACTCATATATTTTTTTGAAGAGGATTTGGAGTGAATGCAATGCTATTCCATACAAGATCCATACAAGATACCTTAAAAGCTCTCAAAGTTAATGCTTCAACTGGGCTTAGCACAAAAGAGGCACAAAAACGGCAACAGGAATACGGCAAAAATCAACTGGAGGCAAAAAAGGGAAAAAGCATTCTCTCCCGCTTCCTTTCGCAATTTAAGGATTTTATGATTATAGTATTAATCGCTGCGGCTGTCGTATCCTTTTTTATCTCCCTGCTAAAGGGTCATGCAGATTACATAGACCCCATCATAATTTTCGCCATAATATTTTTAAATGCAATCTTGGGAGTTATACAAGAAGAAAAGGCTGAAAAATCACTCGAAGCACTTAAGAAAATGTCAGCACCAACCGCAGAAGTATTACGTGATAGTAAACGAATCACACTTCCCTCCACGGAACTTGTACCGGGAGACATTATTTACTTAGAAACAGGACACTACATACCAGCAGATGCACGCCTCATTACTAGCATTAATCTTCGCGTTGATGAGTCTGCGCTTACTGGAGAGTCACATCCAGTTGAAAAAGATGCCAATGTAATTCTAAAAGAAAACACGATGCTAGGAGATAGAAAGAACCTAGTGCCTGCAACCGGTGTAATCACCTTTGGTCGTGGGATTGCGGTTGTTACTGCAATTGGTATGGGTACTGAAGTTGGTACAATTGCGCGAATGATAATGGAAGATGAGACTCCAGAAACACCACTCCAAAAACGTCTTGAAAAGACTGGGAAAGCTCTAGGTATTGCAGCTTTAGGTATTTGTATCGCTATATTTTTACTTGGTACCTTACAAGGACGTGAATTATTTGATATGTTTATGACTAGCGTAAGTTTAGCAGTCGCCGCAATTCCAGAAGGGCTACCAAGAGTAGTATAGTATAAAGTGCTTGGGTAATGGGCACTTTTTTATTGCCTAATCATATCTAAAATGTTACTATATGACCATAATATGGAACCGGAGGTGGCTATGAAATACGGTATTAGAAAACCAAGTTTAAAAAAGAGTTTTAAAGCTCGCACAACTGGAAAAGCAAAACGAAAGATAAAGAAAGCTCTTATACCTGGATACGGTAAGAAGGGTATGGGATGGTTAAAGAATCCAAAGAAAGCAGCATATAATAAAGTTTATAATAAAACAAGTGTTAGTCTGTCTTCTATATTAAAAAACTTGTTCAAATAAAATAAGGCCCCTCAGCATAACCGAGGGGCTATTTTTTATATTCCGAGCAAACCTAGTATTTCATGTGCTCTTTTATCTGTTGCAGGTCCATAGATACCGTCTGGTGTAACTCCGATTAGTTGCTGCAGCTTCTTAACATCATCCCCACGCATGTATGGAGTGGTAAGCTTAAGCTGACGTTTTTCTTGCTCTGTATAAGTAATCCATGGGAGCTTACCATGCTTACACCACTTGCCTTGATAGCTTAATTTTGTAACGCCAACCTTATTAAGACTAGGTGCACACTCGATACACTCACCATTTCCAAGGTAAAGACCAACGTGTCCATCAAACCAGATCAATTCCATTGGCTTAATATTAGACATATCCGTACTTACATCTTCGCATAAATTAATAAGACCGTTCGCATTAGTATCTGGGACAGTATTAGAGTTGTAAACACCTTGCTTTCCATTAAACAAACCCCATAAAATCGCTTTAATTAGGTTGGAACAGTCGAAGGCATAGTAACCTTTGCCTACTAACTTTTTAAGCTCATTAATACGTGATGGTGTGTACCATGAAGGTAGTTGCTTAGCTTTCTGGTCGATAAATGCATCTGTAACTAGCTGTCCTGTTCCGCCGAGCGCATAACAGGTCTTGTGGTTATTTACGATGTCTAATACTTTCGCTTGCAGTTCCTGATCAGTCATAAGTTTTTTATATTCAGCCATTACTATTCATCCTCCTTCTTATCATGGTTTCCTACTTGAATTAATGCATCTTTTAATTTTTCTGGTACCGGCAAACCGATAACCGAAATATTTTCTAATATACTTACACCTTCATTATAGATATAGAAAGAAATAACAGCAAGTCTTATGGCGCTTCCGGTCTTAATTACATAAATATCTAATATATTGCTAATACCAACTAAACAGAAAATAATAACTTTCTTGAATATTCCCCTAAATCCTACCTTGCTTGATAGCTTTTTCTCAATAATCGCTACCATTACCCCGGTAAGATAATCCACGACAACAAACCAAATGAGTGCTGTCATAAAACCGTCTACACCACCAAGAAACCATCCTAAAAACGCTCCAAGCAAAGATATAATATATTGTAAGCCTGTAATAAACTTCTCCATTCTTCCTCTTTCCGCATACAAAAAAGAGCGCCTAAGCGCTCTATGTATGCAATGATTATTCTAATCCTACTTGATCCATCTTATCTACTAACTCCTGGTACTGCTCTGCTGTAATTCTGTTGTTCATTACATAGAGGTCCATAAGAGAAAGCATTTGTTCTCTTGTTTTCTTCTGATTTGTGATTACTATCATACAGTTATTAAATGTATTCATAGTCTTTCTCCATTCTATGCTAGACCAAGTTCGAGTAATGATAAACGTTCATCTATCTCCAACTCGGATTCTGCCTTGTTCAATAGCATTGCATCAAAACGTTTATTAATCTCACTAATATCTTCTGGTGCGTTCGCCCAATTTGGATTTAAAACAAATTCTCCATCGATATAGAAATACTTATTGGGTATTACATCGATTGGAACATCGTCAACATCGGTTACAGTAAACCAATTATCGATGTAATATATATTACCAACTTTCCATTTTTCTTCACCTTCGAAAGTTCCAAATGTAATTTCTGTACTTATAGCTATAATTAGTTTCCTTGAATCTAATAATAATTTCACTATTCTCCCTCCAATACTGCGCATATGGGTGATTGGTTGCTATTTGCCGGTAAGTTTGTATAGTTTAAATCAGCTCCTGTCGTGGTATAAATACAATTTGAAACTGTGCTATTGTTACCAACTTGATACATCCTACCTGATTCATCATCAACGATATAATCATCAATTCTATGATTTATTCCGTCTGCATAAAGATCTAAATTCCAAGTAATTGCATCTGATGACCTCCAAATAGTGCCAACTCCATTCATTGCCACAAACCTGCCTTGTAGAATATTGAGATACCAAGGAGTACTGATACCCTTTGTAGTTAGCATGGTCCATGTTGTTCCATCTTGTGATCTGTGTATACAATTACCTGTTGAATTTTTACACGCATAATAATATCCGTTAAAAAAGCATATATCAGTAAAATAATTGTCAGAATAGGTACTTGTTACTTTTACATAATTTATTCCATCAGTAGACTTGAATATTGTTTCATCACTTGTTCCCAAGGCAAAAAATACACCATTAATACACCTTAATTTATTTACTATTGTTAGTACGCCAGTTTGTACTAACGAACTCTTTGTAAAGTTTATTCCATCGTAGGAATACAGGACTCCTTGGTTTTCATCAAACAGTACAAGTATATTTCCATTTGTAGCCATTGTCTTATTATTGCTCTTTTGCCCATTACTAGTATTTGCATACACCTGCGTCCAAGTTACACAGTCTTTAGATGAATAGATAGAATAAGAGTTATCATAAATGTAAAACTTACCTTTGAAAAAAGTTCCACCAAATGCCCTTAGATAACTGGTTGGTACAAAATTGGAAACAGTAAAATTTATTCCGTCTTCACTATAAGCAATTTGGGCATATCCATAACCTACTAATACTTTTTTTCCGTTACTCCAAAATTTTATTGCACCTTTATATCCATTTTTTAGCTTTGTTGTACCAATAAACCCTTTTGGTAATTTTGTTAATTGCTTAAATCCTGCCATACAATCCCTCCTTATGTTGTAAAGTAAATCGTATTGGCATCCTTAGTAGATAAAGCATCGTAGGCGGCTTGAGTAAGTACTAGAAAGTGCAAATCATCTACAGTATGTGCGTTACCACCATTAGCAGGTAGTTTTTCAGGAAGCCCTGTAATCATTTTTGCATCATGTGTCTCTGGGTGTGCATAAACTGTATCGGTGAACTTTGCATTTGATGGAACGTCTGATTTTACTGTGTGATTATTAACAGTATCAGCATTTCCACCATTAGCCGGCAAACTTTCTGGTAAGTCTGTGATATCACTGGCCTTATGCTTATGTTCTTCTATTTTATCCATGTTGTCATTGAAGTCTTTTATATTATAAAAGTCCGTTGGTTCCGGCTTTTTGAGATTTAGCTTTTCTGTATATTTTGCCATTATAGATTTTCCTCCTCTCTAATCTGACCATATGTGTAAGCAGACAATTGATTGTGTGTGAAACCGCCCACCATATTATGAGTGTTATATAATAAGGAAATTGATAAGATTAAATTGCAAGGAGTAATCCTGTCCAGGAGTTCTGCAACAGCGCTGAAGTTTTTCTTTGCCGACTGTGCAATCCTTACATTTAAAATATATTTAGCTTTATCCATGACAATTGAATATCCATCTTTACCGCATAAGTCTGCAAGCCGTTCCTCCAAGCTTCGATACGTGTAAGGCAATTGCTCGTTCACTCTGGCCAGAATCCTAAAATTACGTTCCTGAAGTGTGTCTGTATCCAACGGTTTGATAGCAAGCATCGTTTCCCATCGAGAGCAACCGTAATTATCCAAGTAGCCAATAAATTGATTGTTTAATGCAAGGCCAAGCGAAACTGAAAGATTATCTATTTCTGGATTTTCTACAGTTGTTATTGCCTTAAACTCACGCACTTCTTGGAGCACTCCTGGTACATAGCTGATAAGATTAGCTGCCATTTAAAGACCCCCTAACCGGTATTGCGTTTTTATCAAGCTCAATGTTAACAGCACTACCATTCAACATCGTACCAGTGACATCAACTATTCCTGGAATGCCTAGTAGGGCAACATCGATACGGCTAATTCTCACTACCAATCCGGAGCTGCCCTCATCCTGCCAGCTCTTGGCTAAGTCTTTGAAATAGCCATCGATTGCCGAAGCAATATAGCTTTTTACGTCCTCATAGGTATATCCGCTTTGGTAAGTAATTGTGGTTGTAATATTAATAACTGTTTCGTCCGCACCGGTGACATGAACAAAATGCCATAGAGGAGCTAGTCCTAAACCATCACCGCCATTTGTAATCGGATCTATGGTAGACTGAACACTATCTACTAAAGTTGCGCTAGGCTTTGCATGATCGGAGTTAAGTATCACTAGCTTAACATTTCCTCCGCTTTCTTCTCCTGAAACATTAGTTGCACGATATATTTTTACAGCCCCTACACCATCAAGACTCAATACCTTTGCTTTATAATCCGCACGATTACCACCAAATGCCTGAGCATCTAAGCTGTCAAAGTATCGTTTTCTGAATACTTCTGTATCTTCTTCCTCTTCGCCAGGAATGAGTATTTCGGTGATACTTGCACTTGTCAACCCATCGATGTATTCCACTGGTAACAATACCCCTGTTGAGCTGTTAGGCGAACTTCCAGGAGTCTCACAGGTGAGCTTGAATGCTCCTGCAGATATCTTCTCTGTTGCCCTATAACTAAATTTATCCAGAGTAAATCTAGCGCCGATTGGTATATCAATATTAATTACGCCTTTGACAACTGCATAAGTTGCTTCTTTTGGTGTCAGACCTCTCTCTGCAGCTCTGCGAATAAGGAAGTCTCTCGAAGCAGTATCAGCGAAGGATTCATTCATAATTCCTTCGAGCTCTACATACATCTGCGCTAACTCTGCACATACCGGAGCAACTGCAGTATAAATCATACTACCTTCTCGTTTATCAATTCCGGGTGGAATTCTGGCAAGGGCTCGATTTAAAATTACTTCATATGTCATATCCTCATACATTATTAGATATTCACCTCCTTCTCCTGTTCAATATCACCATAAGCGGTATGAACCGTGAACTTGACCGTCACACTTCCTTTGTTAGATGCAAAACTAAATTCATCCACACCAAGAATACGGTCATCTTGCAGCAATGCTTCTTCAATGCGACCTATTAATGTAGGGTATACATACGCAGGGTCTTTTCCGAAAAGGTCTTCTAGCTCAACGCCGTAATCCCAGCTATAGATAAGATGTTCATAGCGCTCCACACTCAAAGCCATATAAACGGATTGCTTAATCGCCTCCAGTCCATCCACAGTGCCTATAACTTTATTTCTAGATAGTGCAAATGTTTTACTTGGCATTTCTGTGATTTCAAGAGTGGTTATATCAAAATCAATTTGTGGTACCATTAGTTATCCACCACCTTATCTAACAAAAGGTACTGTTGTCCACCTTTTTGCTGTAAGAGTACCACCTTATCATCAATTTGGAGAGCATTTTTAAATGTATAGGTAGTTTCGGCGCCCATGAACTCGACTGTCATCTTATGGTCTGTCAAGTGTTCTGGTACAACTAAAAATCCGCTTGTAAGCACTATTTTTTGGTTGATAGATACTGTAAGCGGATTAATAGATGTGACTTTTCCATATAGGAATTGCGTAGGCTTGGTAGCCTCTACAGCGTCACATGCTGCCTTTTTAATTAACTGCACTAAATTAGCTATTGATAACACCTCCTGTTAACTTCAAGTCCATAAAATGAGCGTCTGCTTCATAAGTATGCTTTGCACTCTCGACAAACATAGGATGAAACTTCTTTTCACCATTAGTATCAAGCATCGGTATAATAATGCTAAACCCTGCTCTTACTCTACTGTCACCAAGACAATTCTTTATCTGTAAAGTGCGTTTCGGCGTATTATGCATACTTAGGAGCTTTTCAACTTTATCCTTAGCTATATTTTGGTTTTGAATCTGCTCGCTGTACTGTAGTACACCCCACTTTTCAATCGTACTACTGTCATAAACCTTGTATACTTCACGGGTTCCCGCTTTATTATTTTCGTATACAAGTTTGATGCTGTTATAAGTGCTCTCATCAATACTTCTCGTAAAGACATAATCCTCTGCAGAAGATTCATCGATTAATATATTTAGAATCAAATCACTAGCTTTTCTTAAACATAGCTTTCCATAATCATCATACATAACATAAAGATCTTTTGTTATTTCTACTGTATGTGCAATTGAGTTGTTAAGCATATCAAATAGTTCTGTATCTACTTCTGCTAATTTAAGCTTAATTCCAGTATCATCAACTGTACCAATCTTTAGCTTAAAATCTGTTGCTATACGCTTTAATAGGTCCCCAGTACTCCAAGCAGTATAAACATAAGTATCTTTGTTTTTTAGGTACCGAAGCTGATCAAATGCAATTATCTCTAGCAAATCGTCTTTTGTTTCCCTGATTGTAAAGATATACCCATAAAATACACCGAGTTCATCGTCAACCAAAGCGATTTGACACCCTTCTTCTATCTTCATTGATTCATCCTTAATGACTGACAGCTTAATTGATCCAGGCGTCGATTTACGTTCCGTGGTCCATACTAGCCCACCACTCACTACGGGGAAATACAGTTTGTTTTTTATTATTAAGCTGACCTGGTACATTTAAGCCTCCTAACCAATCTTTTCTGAGATCATAACTGCCTTGTCCTTCGTTACCAGATAATTTTGCCTGAAGTATACCAGTATTACTAAGTGGGGCGGAAGTTGTTGCTTTTTTCATAGTATCACTCTTTCCACCCAATGCAGTAGTAGTGATAGACGACGTACTCGAACTACCTGTTTTCTGAGATGTGGATGTGCTTTTTTGCGTTGTTGCCGGAACATCCTGAATCTTTAAAACCTGTCCAACAAAGATAATATTAGGGTTTTTGATATTGTTTAATTTTGCTAGGTTCCAACACTTTGAACCGTCTCCTAGTAACTTCTTAGCAATGGCCCAAAGAGTATCACCATTCGCAACGGTATAAGTTTTTGGGATTGTCTTGGAAGAGGATCTAGTTTTCTTCTCAACTACACCATCCTTTACATTTTCAAGTATGACGTATTTAACTCCATAATCGATGTATTGCTTGAGTTTGACATCAACAACTAAATCGAACCCGTTTTTAGAATCTTCCGTTATGGAGTAGTCTTCGAGGGTAACATCCATCTCTTGCTTGAACCACTCTCTGCCCTTGATTTCTCTTGATACTAACAGTGTAGTATAAAATTTATTAACTTTCAGTTCCTGTAATCTGTTAAGATACCAGTTTGGCGATTTGTAACTACCAATGACAAAAGGATATTTATTTCCTGGTAGTATCATTTGAAAATTAATATCAGTAAGACCAGGAGATTTTACTAAATTGATCTCCTGGCCATTAAGTAACGTTAACGTTTTATTCTGATTGTTTATTTTTGTGACAATTTTTGAAGGCGTAATTGGCAGAAGCATATCCCATAAATAAACTTTATACATCTATACCTCCTAACCTTTGTTATAGGCTTGGGCTGTATTATTAAGCTCCTCGCCAATCTTTTCAGATAGATAGTCTATCATTCCGTCAAGGTCTTGATTGTTTGTTACGGTATTGTTTACGCCGCCCATCTCAACCTTTATTGATGCTGTGGTAAATCTATTGATAGAATCTCGTTCTGCAGCGTCTCGAAGATATTTCAATTCTTCATTTGTTGCAGTTACAGAATTCTTTATATCACCTGTGTTATCAGCAATTTTACTTACGTTTCCAGGAACATATGAAAAATCACCATACTGATCTGTCATTCCACCAAACATTCCACTTATTTTATCATCAATACCTTCACCGAATTTATATCCTGCGTCCCAAGCGTCACCATATGCAATTCTTGCATCGATAGTTGGTGCATTTCTGTCAAGAGTTATTGCTTTATCATTTTTACCCCAAGCAAGAACACTGTCCTGTAATGAATTAAGTCCTCCTGTCCAGTTTGTACCGAAGATTGCATCTATTATTTTAGTGACCACTTTTCCAAGTGAAAGGAACCATGAAATAATCTGTCCAATCAGGTTAGCAACTGAATCACCGAAGCTATTGAAACCGCCATCGAAGACATTCAGAACCCACTCGATTATTCCAATCCAAGGCTCAACAAAAAGCGTCCATAATAATTGAATAATCCCATTGAGTACGCCTATAACAGTGTTCCATATGAATGCACCCAGCCAGGCAATCGAACCTGCAATAATACCCGTTGCAGAAACAGTAGAACCTGTGACCTTATTAATAGCAGCCACAACCGCATACAGAACAGCAATGATTGCAATAATTGCAATGATTATCCATGTAATAGGCGAAGCAAGAAGTGCAGCATTAAAACCATATTGTGCGGCTGTAGCACTTGCTTTTGCCTTAGCCTCTGCCATCTCTGCCGCTGTAAGTGCTGTATTTGCTGCGGCCGCTGCATATGCTCTTACAGCTGCAATACCCTTAGATATGTTACTTATTACTTCAAGTGCATTTGTAATTGCCATTGCAATTGAATATGCTGTTAATGCAGCAACAATACCATAAATTATTGGTGAAATTATAGGCCAGTTATCAGCTATAAAGCTATACATTGTGCTTGCAACTTCGATTACTCTATTGATTACAAATATAATCTTTTCAATCCCTGACGCAAATCCATGAAGTACTTGTTCTATTTGTGGCATATTTGAACGAATCGTGTCAAACAATGAAAGCACAGCCGGGTATAATTCCGCTCCAATTTCCCTACGAATTCCAACAAAATCATTCTTCATCTGTATAATCACTCCTTGTGGAGTATTTGCCATCTGCACAGCCAGATCTGCCCATGACTGATTGATTACATCTTCAATTACTAAGGCTTTTTGCATGTCAGTGCCTTTTTCGATTATTTTTTGCTGTGCTTCCGACAATTCAAATCCTTTTTTCTTTAATCCATCATATGTCCCATCTAGGGCCTTACCAAGTTGTGTTGCATACTCAACCATCTGCTGGTAGCCAACCTCACCACCACCTGACATACCTGCAGCATAATTTGCCAGTGTACCCATCATGGATTGAATAGCTTTTGTATCCTTAATATAAGTTGATAATTCTGCGGCGCCACCAATCATTGTTTCATCGCCATAAAGCGAATATCCTTGAACGGTGGAAGCTGTATTTTTCAATTTATCAAATGCATCATCCGCAGCACCTATATTACTAAGTACAGTTTGTAATTGCCTTTCTGCATTTGTTGAAACATTAGTTAAATCTAATGACTCATGAACCCAATTAATCGAGCTCTTAGCAGCTGCTATTCCCAAAAAGGCACCAACAGCATTTTTTATTGTGGAACCTAGGTTGCCTGCATAATTAGTTCCGTTCTGGACTTCTCGATTAAATCCTTGCTGTGCTTCTTCATTCCTCTTAATGCTGCCATGAATTTCTTCCATTTGTTTGTTTGCTAAATCAACGGACCTTCTTGCATTATCAATTGCGCTAGTATCAAAACCTCGATTTATTGCACCATCAACATCTTGTAATGAGCCAATAACATTATCAACAGCACTTATGATATTGTTCAATGGGCTTGACATTCTATCAACTATGTTTATTTGGGTCGAAATGCCTGCCATTCACATTCACCCCCAATTCTATTTCTTTTTTATTTTTGCTCTTTCCTTTTTCTCATTCTCAACCCTGATTTTAATAGATGCGATAATGAAGGCTTTTTCATTTTCATCCATAGCCGCATATGTTGAAGGCAATATTTTAAGTTTGTGAAGACAGTAATACGCATAAGTGGCATCAGAGTCTTCGTTTATTAGTTTTTTGCTTCTTCCACCTTGTCATCCATAGTGGTTGTGAATCCATTGAACTGCTGAACGAAATTAGCAAAGTCTCCGTATTCGCCAGGATCATCTACCATTTCTTTTAAAAGTTCTTCAGGAGTCATGACGCCGTAAGAATCTTGAAGCTCCTTGTCATTTAGATTTGGTTCAATAACAGCTGCACAAATCATTTTTGCAATGTACTTAGAGGTGTTTACCTTTGGTCTAAACATGTTAGGCTTACCAGTTACAGGCACATCCATTGTGCAGTCCTCTCTAATTGCATCATTTTCTCTTGTCGTCAATGGCTTAATCGACCAAAGCAAAGGATTTCCTTTTTCATCTGTTAACGACTTGGTCGCAGGGTAAGTAGTATTTTCCTTCTGTACCTTGTTCTGTTTCATAAATCTATTAAAATTTGACATAATATTTTCACCTATCCTTTTATTTTTTTAAAAGAAAAGACCCCTGTTAATTAAATCAGGGGTCAAAGTATTAAAGCATACCACTAAGAATTTTGAAAGCTTCAGGCATATTGAAGTCTTCAAAAGTGAAATCCATATCTTCATCAAGATATTCACCATCGGCATCAAACTTTGCCAATATACCGCCATCGATATTACAGTCAATGAAAACCATTGTCTGCCTTCCGGCTGCACTTGTTTTGTCCTCGTTGGTTACTTGAATTTCAAAATAAACATCTTCGCCTGTGTTCTTGAAGTCAATCATCATCTGTCTGAAAATACTTGTATTGTAATGGAAGGTTGCTGAACCAGTTCCTTTCCATCCAGTAGACTTGTTTCCCGCACCAGTCTTACCAAGAATAGGTATTTCTGTTTTAGTCTTCTCAAACTTTGCTTCGAAATTGATCGCTTGCATAAAATTGTATCTATTTCTTCCGATTGTAACGAAACATTGTGCTAACTTAGCAGAAACGGCATCCTTACCAATCATAGTTACATTATCCATTCAGATATTCCTCCTTTCCTATTGAACCGTAACTGTCATATAGAGCTGGCCCATTGCATTAACAACAGTTACAACATCACTGACAACAACTGATTTTTTGGTTGCCCCCTGTGTTATAACAACATCAGAATCCTGGAAATTCTCAATTGCTCTAATGTTTTGAAGTGCTTTGTGGTGCTTCACAACATCTGACCAAAGACTAATTCTTCCTGATGCATCGTTTGGAATTGCTCCCAAATACTTTGTATTAAACAGAACTGCAATATCGTTTGCAATCTGATCAATTACCCTGATGGTCTGGTTTTCTTTGAAGATATCACTCTTTGTATCTGATATAGTGACCATGGAATTAATATCAGCAAGAACACGAACATTAGGACCTACCTTGTGTAGTGTGAATTCACCTGCTTTGATCGCTGCTTCAAGCTGTGACTGTGTGTAGTTTGTATCAATGGTGAATTCGCCATTGTAGCTCTTGTTTAGGTTAGATTTATTGACTTCGCATCCGGCAATAATACCAGTCACCCAATAAACAAGTGATGCTTCTGACCATCCAGCATTTATTACTTTGTTTTTAACGTTTACAACACCTTCATAATCATAAGCTTTATGGTAGGTTACCGCTTGGAACTTTACACCAAGCTCATCACGCATTCTTTTTGCATATGCTGCATATAAGCCATTCACTTCGTCATCAGTTGAAATAACGCCGATTGCATTGAATGAGTAGGATTCTGCTTTTGCAAGAAATGCTGTATGTGAAGCACCGTCAATCACTCCATTGGACCCATTTGAAAGCGCAGTACCTGCAGTAACAGTAAGTACTGCGTCTGATTTGAATTTCACAAAATCATTTGCAACCAGTTTATCCGCTGTCGCAACCGTCTGCGCATCAACAATCAAGGTTCCAAGATAAGTTTTTACATCGAAGTTATCCGGTTCATCAGCGTTTAGTGATATTACTATCTTTAAATCATTACCACGGGTACCTGAATGCTTTGCAGTTGCATATTCATTAGTTGCCTTCTCTCCATTACTAGTCAACTTGTATGCATAGAGCATCTTTGCATTTAAGAATAAATCTCTTAAACCCTTCATCTTTGCATCTGTGTACTCATACCCAAATATCCTCAAAGATTCTTCTTTAAAATCAGCGTTGGTTACTTCAAATATTTCACCATCGATGCCCCAGTTAAGATCAAGGCCAATTGCAGCAATGCCCCTATCAGTCAAGCTTTCACTTGCCGATATTGCAGAAATAAAATTGATGTAAGCGCCAGGAAGGACTTTGTTTTGCGTAACAAACGTTCCGCCACCTAATGACATACCTATTTCACCTTACCTTTCATAAACTTTTCAATCAGTGCATCCACTTCTTCACGTGAGTATAATCTTTCATCATGAAGAATCGCACTGATAATATCTTTTTTGTCATTGTATCTTTTTGAAGATAACAATTGTTCCTTTGTAAACATTGGAACGTTTTCTTCTATTTTTTTAATTGCCATCTGTCATCATCCTTTCGCATCAGATTTGACTGTTACTGTATCCATTTCGTCTGCTCTATCAGCATCTTTGTAAACAAATAGATTAAAATTAACCATAAAGTTTAAAACTCCATCCACTATTTCACCGTTCATTCTTGTTCCCATGATCAAGTCTTCATTGATAGTGATTGTTTCCAAGCAATCAAATAATCTATCAAGGACATCAGCGCACTCGATTCTAGTTTTATCTGTTGAAGGAAAGTATTGAATGCAAAACTGATTTGTTCTGAAATATTTTTTCCCAAGGAATAGTTCATTGGTAGGGTTCAAGCACAAAATAAAAAAACAAGGCTCTTCTAAACCTTGTTCAATAGATTCTGTGTGTATGCCGTAGCCGTCACCAAACTCTTGGTCGATTGCTTTGCTGATTCCATAAATAATTTCATTTATCACTTGAAACATCCCCCCAGCAACTTACTTATTTTCTTTTCAAGGATTGCAGGTGCGGCAGTTTTAATTTCTTGTTCAGATATTGTCAACATAAACTTACCTTCCACCCAGCCTTTGTGATTGCTTGTTCTATGCCCAAACTCTACATAAGAAGCGTATTCCGTAGGATTTACAATTTCAATTACATAAGTGTCGCCCAAATGATTAACCTTTAAAGACTGTGCATATTCAGTTGCGTTCTTTCCTGCAGTCCAACCTCTTCTTAATGTGCCGCCATTCTTTCCAGTACTTGAGGGATACTGACCGACTGGTGTTCTTTTAATTACCTTCGCAAGAAGTCTTGCAGCCAATTCCTTTGCACACGATTCAATAAATAAGCTCAATTGTTCATCTGTTGCTTCAAGATTTGCTTTCAGCTTTTCCCATTCACTAAAATTACAACCACCATTTCTACCCATTATGCCCACCCCTTAAATAGCTCTAAAATAATCTCTTGATGCGTATCATACACGGCTGGAACACCGCTGTTTTTATATTCTACCATTCTTCCTTTGCTGCTAATGACCAACTTAGATCCAGCCTTTATGTCAAGTTCCGGAGCAATGAAAACCTTTATAACCTGTTCAACAGCATTAGCAGTATCCGTCTGGTTTGCCTTATTCTTACTCGAATATGATAGCCGACACGACTTATCCAAGTGTATAATAACCTCTTGGTTGGATGTTACCTTTGTAACAGGGTCCTTTTCCTTTCGGTATTCAATAACAGTACAGGTGGAATCATAAAGGATTTCTATAGCCTTCCTGGCTTGCTTAACAGCTATCCTATTCATCGGAATCCCACCTTTCTAAATCTTCTTAGCTGAGCTTTATAATCCTTTAACAGGTGGTCCTTGAATTCAGTCGTAGGGCTTTTAAAACTGGTTGATGCATCTCCTTCGCTTATGGAGGACACTGTCCCCAGGGGCGTACTTTCTTCCCCTGGAGATTCATTCCTGTATAGATCCATAGCCATCCGATAAGCTGTAGTTTCAAGTGCCACCGGTAGCTCTTTTAGATTGCAGTAATTTAAAATCGTGTCCTGAGTATCGTCCAGAGCAAATTGAAGAGGAATGTCTTTCGTTTTATCGCAAGACTTCACCCCCAACAACTTTTTAAGTTTTGACAACTCCAACATATTGTCACATCCTATCCGATTTTATGCTTAAAGGCTACGATACGAATCTGCTTAGGCTCATACACTGGCTTCCAGTTTTCAGCATCTCTTAATTCTTCCCTGGAAGGCCCTTCAGTAAGCTCAACATTTGTATTCGTGAACTTAACTCCACGTGGATGAAGGATATTGGTCTTTCTGTTGATTAAATAATCTACACCAGAACCTTTTCGTTTTGCTCTGTCAGTTTCAGTTGGTACAAAGCCCACTGGTGAGCCGTTACCAAGTGCAATTGCGCCGGCACCAAACAGATAAGATGTAAATACCATATTAGCTCCAGAACCTTCATAAGGGCAGCCATCATCGATGATAACTCTCTTACCCTGATATATATTAATTGGGCCGCCAGTTGATGGCTGAATAGTTTCAAGTAGGTTTTGCTTTCTTAAAAAGGCTTCAGTAGCGCTATGGACTGCTACAGCTGTTAACTGCTCTTTTGCATCTCCTAAACATTGTTGTGCGTCAATGAACGCTGGTCCAGACCACTTTGCCTTATCTCCAGTTAAAGCGGAAATATCTGAAATGTTGCTAGCCATTCTTGTAATAGGATCAGCACCTTCTTTATAAGAACCAAATACACCTTTGAGCAATGCAATCAGTTCTTTTTGCATATCTCTCGCCCAAAATCCTGCAACTAAATCCCCTATAGCTTTCATTGGATCACTGCCGGCCAATGCTGCAGATAAATCTGTTGCTCCCCACATCTTTGCACGTCTAATAATAACTGCCACGTCCTTGTTGGATGTAATTTTGTTTTCATCCAAATCTTTATCTTCAATGACTTGCTCAGATTCTCCGTTTAAATCTTCAAAGAATGGCATATTAACCGTAGGAGCTGCTTGAGATGCTAAGGCATCAAATTCACTATTATTTACTACAATTCCACTCTGTAACAGTGCGGATAATTCCATAGTTCGATTAACTACATATGGATTAAATAATTCTGGTACAATTACGTCACTTAATCTTGTTATTGCTGGCATTATATCACCTTTACCTTTCTTAAATTGTTATTCCGGCAGCACTTGCTAACTCCTTAGCTTGTGCCGGGTTTTCTTTTAATAGTTTCCCTTGCTCCGTCAAGTTGAAAGAATCTTTTGCAAATGGATTTTTTCCGGTATAACTTCCTCCACCTGCAGGATTGTAATCAGTTCCTGATTTGGCATTCTTAAAAATATGTGGAAGAGATTCTTTGTATGGCTTTATGGAATCTTCAAGGCCTATCAGATTACCGTCCTTATCATAGTTAAACTTATCAATTCCACCATGCTTGTAAATCAGATAGTCAGCATCAGTAACCCCTAAATCCTTCAGCTTATCCTTTAAGGCATACTCCTTCTTAACCTTTTCAGATTCAGCCTTAAGTGTTGCTATAGTAGTTTCGTGAGTCTTAATGGTTGTCTGCAATGTCTCGTTGTCACCATTGCTCTTCTTTAAATCAGTAATTGTATTATTTGCAGTCTTAAGTTGTTCATTGACAGTATTAAACTCTGTCTTTGGTACTGCATGCTTTGGGAACTCTGTATTGAGTTCTTTTGTAAGAGCTTCTGTATCTAGTACTCCCTCTTTTGTGTGTTTTGCTATTAAGTCTAATAACCATTGCATAGATCATTCCTCCATAGATTTTTATTCCCGCTCTCCGGGTGTTGGGATTCAGCCGTTTATTCTCCGGCAGAGTGACGATAGTTTTGCGTCATTCCGGACATAAAAATAAGACGTATACGCTACGTCTCAATGCGAGATAAGAAGGATCACATCCTTTCTACTTCACAACAAAATTCTCCCATTTCTTATATGCATCAACATATGTTTCCTGCTTATCTCCATTATGGGTGATTTCGTAGTACATTCCATCAGAAACAGTGGTACTTGCAAGTGCTTTGTTATTCTGCAACGTCTTACATGACCAGACGATGAACACGTCACTTTCTATAATCCGTGCATGGTCGGTCCTGTCTGCATTCTTGTTGAAATACTCAACCACAATCTGTTTTACTAGCTTCAAAAAATCATCATTTCCCATATAGTCACCTTAACCTTTCTTTAACAGACATCTACTGTCTGGAGCCTTCTCGCTCTGCACTGCATGCCCTTTATGCACTGTCTCAACTCGACCACACTCATAATCCTTGCGTCGTTCAGCGCATATCGGGCATCCGGCGCAGGTCTTTGGTCTTTTATCTACTGTTGCTGATAGTATTTTCATATCGCTTTAGCCTCCTTAAATGCCTTTATCATCTTAGGAAACTGAATTGCTATCCAATCAACCATTTCTTCGTTTGTCGCCCATCCCGTAGTAGTACACGAGTTGTCCCACAGTCCGCTTTCAGCCAGAAACGCATGCACGATTTCGTGTCTTAGCGTTTGCTTTTCGATCTCTGCGTGGTATTCTCGCGATTCTTCTTCATAACCGGGATATGTAGCAATGTTACAAACAACAATTTCTTTCATGACTACATCACAGTAACCAACTATATCTCTTTTTTTGAAATCCTTATCTTCGTTGTAACTGCGTCGTTTTATTTCATATTCAGTCCCTAATACATTTACCTTCATAATCCTCCTCCCAGGCATAAAAATACCACCTACCGTTTACGATAGATGGTACGTTAATTGGATATAACTTCTATAGTTTCTATTTCTGATTCTAAAATTTCAACCAGTCCTTTGTTTGAAGCATCTCTAATAGTTATTTCTGCTTCTTCTGGCTCATTATCCAGTGCAGAAGTGTAACTGTCATAAGTGCCCTCAACCACTATGCCTCCAACACATAGAATCTTTAATTTTGTATTCTGATTTAATTCGTTTATGTTTTTTAATGCGCTTAACAGATCCATTTGTATCATTCCTTTCCATGCGGAACTACGTGAACACCTTTTTTACTGTAGTGTATCTTCGCCTTAGTTGTGACGTATTCAATGCCAGTCAGATTATCAATACTGATACCTATTTCCTTCTTAAAGTCTACAACCTCTTTATTGTTCCAGTTCCCTCTGTTATCTCTCTTGATTTCTCCTGTACCAGCATACTTGTTGACTAAATCTTGGGTTTCCTTCTCAGTTATAGTTAAGTAACTACGACCTTCTATATAATTATTATGTCCTTTAATATGCTTGCCCTGTTTACCAGTTTCAATTATTTTTACGGTATCATCAGAACGGATATTATTACGTACTTTCTGGTCCTTATATGCCAGTTGCAACATATTCCATTTCTCATCATTAGTATACTTCAAATTTTGGAAATCATCAAGGTTTTTAGGAGTGTCTTTTCCAAGGATACTTTTATACTTGTCATACTGTTTCTGATCAGCAAGCGTATTATTTGACTTTTTCTCTGCAAGGTCTGCTTGAGGATTGCCCTTAACGTACTTCTCATGCCATTCCTTATATGTCATATCCGCAGGCACATTATGTGTCTTTCCGGTAACCGGATCTCTTGCTGCTCTTTCCTCTCCTTCGGTGAATTCGTCGTTAAAGTAAGGGCATGTGCAAGTGCGGCAATTACAATGGAAAGGTGGCGCAGTAACGTTGACTTTGTACTCACTTATAGGAAAATGGGTGCCATCCATCCCTATGCAAATATCGCTTGTTCTAAGGTCAAGTGTTGCTACATTTTGAAACTCTTCAATTTCAAGATTTTTATAACATTCCTTCTGAGCCATTGAATGTATAGCTGCTGTTTCAGTCATTACCAAGTTACCGGCTTTGCTCTTGCTGACATCCATCTTCTTGGCAAGTGCTGCAATAGTCTTACCAGGATCACTTCCACGTATCAGCTGTTGCGTAAGCTCCGTATGTAGCTCACGTATAAGCTTTTCTTTGTCTTGCCATATACGACTACTAAAATCCTTGCCATCAGCAGCCCAAGGCTTACGAAGGAAACTACTTACCAAATTATCGTCTAGTGCATGAAGATCCTTACCTACTCCAGTGCCTTTCGCAATCTCAAAAGCAGATTTATAGTAGGATGATTTAAAAGCTTTATTCAACAAATCCGAGACACCGCCCTCGTACTCAACAAACAGCTTTTCCACCGACTGCTGTAATTGGAGTTTAACTGTTTCTAGCCTGGTAATGTGTACTCTAGCCGAAGCGTTTTCAAGTTCCTTCATCCATTTCTGATCAAGAGCATTTTCCTTACCGTACTTAATATATTCATCTACGGTCCAACGGAACTCTTTGACTTCTTTCCGATTCAGATACCGCTTAGCTTGAGCATAGCTAATTTCGTTATTCTCTGCAACTCGATAATACCATGTAGAAACATCTTTTTCTAATTCACTTGTAGCTCGTCTAAACTGCTCTTGTAGATCATCGTAATATGCCTTACTCTTTCGATACGTATCGTCCTCTAACTTTTCAAATCGCTTTTTCCAGTACTCTCTACTTTTCGCCATCGTCTACGTCACCACCTTGTTGCAGGGCGTAATCATAACCGTTTTCCATGGATAGCTCCTTTTGTATCTGCTTCGCTTCTTGGTCAGCATCTTCTACCCACGGGTGGTTTTTATGAATAGTCTTATCAGATACAATCCCTTGGCTTTGTGAAGCAATTGTGGCAAGCTCTAAATCGTTTGTAATAGCAGTACGTGTCCATGTTTGAATGATTGTCTTACACTCTGTATTAAGATACTTACATATTGCCCGTACAAACTCTCCAAATGCAAGTTTAAACTCTGTTTCCATTAGACCCGCTTTTAATTCCAATAAAGAATATAAATATTTAAGAGCTACCCCCGAAGCATTACCAAACGCCTCCGGTTGTGGGTCAACTCCCTGTCCCTGTTCGAATATAGCTTTTCTTGTGATATCCAGTAATTTCTCTCTGGCTTCTATAGGTATGTCTATGGTCAAAGTCTGTATACCTGCATGAACATTTTCACCATCATCATCCAGCTTTACAGTTTTATACTTCTTCAAGTCCTGTAAGAACGGCGCCAGATCAGTTCCACCATACCCGGTCAATACAAAGATAATCTCCTGGATATCTTCCAAGTCATTAACAAATCCTGAAAATACTTTGTCATATACATCAATCAACGCTTTAATATTTTCAAGGTCACTTGAATCAACATTATTGTTGAAGAAAGGTATAAAAGGCACCCTGCCATAATCATGAATAAATGTATCTTGCTGTGAGGATTCACCGTCTACAAAAGAACAAAACATTGTATAAGACTGTAGCCCATCATCCAAAGTATCGGAAGTCTTTTTACGGAAGGACTGACACTCAGTATCATTCCAGTATTGATATATGGTATATAGTTCTCCATCATTATCATCCAGCTGTTCGTATACTCTTAAGACTCCACTAAGTTGCTTGTTTAAATCAGCTGTCCATATCGGTATGATCTGCTTTGAATCGACCACACCATATTCGAATTTTTTTTCTGAGTTTATCCAATAATGAAGCCATGCTACACTACAGTTAGATGCATTAATACATAGGTCCTTTGTAGTCTTGGCGTATCTATCTCCCAGAACTTCTGTAACCCGTTTGTTGGAATCGTCATCGCCGATATCGAAAAGAGGCGGTGCAGTGAACATGTACGCTGCCTTTTGATTAACTAATAACCCGTGGAAGTTCCTCGGTATCCGATTGTCTGCATTACGGAGAGGGTTTTCAGTCTCATCCGATTCTGTCCTGGGCTTATTCTTTAGGACATCATTCTTGTTCCGGTAATATCTTTCAGCTGTAGCAGCATTAGTAATAAAATTTGTGTGCCCTGTTGTATACCTCTTAATCACCTTTTTCATTGTTTCTAAATCCATGTTTCTCACCTGCCTTTATCACCTGCTTTATTCAGTTTTTGCCTAATATTTTCAATCAGTGCATCAATGATACAGCCTCCAGCCAATAGGCAAAATATCATACATGCTATACACACGATTAAATCTACCCAATACTGATTTATAAGTATTTTAGGTAGATAGGAGAATATATCATTTAATAGATTTACCAACCTAGCTAAGACTGATATTTCAATTAAAATTGCAATGTTTTTACCCATATTACCTCCCTTTATTTTAAAACCGATACACCATTGCCTTTATCAATGCGCTCAGCAATTCCTGTTGTAGCGTCTGGCGCATCATCATGTTTGTTCTTGCCTTCACGCTGATATTTAGACATAGCACTATAATAATCGGGCCATCTATCACGCCAGTTTTTAGGGAAATAAATGTGGTCCATAACCCATGTACTGTTAGAAAGTATTCTAGCAATCTTATTCTTACTTTGATGGAACCACCGAATCTTGGTTTTATTACTCTTAAATGTTTCTCTTAGGATTCTTACTACCGAACGGGCAAATCCTCGACCACCATTATTACTTTCTATATCAGCTACGTTTACACCGTGTTCATGGATTCTTTTAGCGGTTTCTGTTTCAGTAACTTCCATCGGTGCATTGGTATAGTAAATATCTAGGACATAAGCCTCTTTATTATAAACACCATAAATAATATTACAGAGATAATCTGTTCCCTCGTCTGCCGTATCGCAATATGATTTAATAGCAGTGAATAGGAGATTGCCTTTATCATCCATAGGAAACTTTTCGTATGTTTTAAAGCTACTATATAATTTGCCTTTCAGATCAATAGGTTCCTGCTGATAGTTAGCGCTTGCGATATCCTCGCCCATAGCCTTTATCTTGCCAAGATAAGATTTATAGGAAAGCACTTCACTACAAAGCATTTGTTTCTTTTCTTTATCAACGAGAGCTTTCATGCTTACATGACGAATCTTTGCCCCTTGCTCCTTGTAATGCTCTAACGCTCTTCCAGCCAGATCATCAGAGGCCCATCTAGTCATAATAATGATTATCTTCCCGCCCTCTTCCAGACGGGACAGCATTGTATTGGTGAACCATTCCCAGTGTTTTTCTTTTACAGCCTCATTATTTGCTTCCTCCGCATTCTTGATAAGGTCATCGATTATCATAAGCGAACAACCGAACCCTGTTGCAGTACCAGTTGGAGAAGTAGCTAAGTAATTGTTATATCCACCCTCTAAACTCCAAAGATTCATAGCACCATCACCACGCTTTATAGACACCCCAGGAAACACGTCTGAGTACACAGGCTTATATTTATCTGCTTTCTGTTCCATAATGCTGTTACGGACGTTTTTAGAGAACATGGTGGACAGGGTTTCATTGTATGATCCAGTCATTATCTTTTGTGTTTGGTCCTTACCAAGCACCCACTCAACCAAATTACCAACTGTCCTAGACTTCCCGTGTCTTGGAGGCTCATTAACTATCATTACTTCATCGTCAGAATCTAAGAACTCTTGGAACTCATTGCATAACTCAACCAAGTACTTTCTATCAGACTTATAGAAGTCAGGAGCTTTTAACTGACAATAAAAAAAGAACTCACGTCTTGCAAGTTCTATCTTTGCACCTAGCTTTATTAACTCTTTATCCATCATTTATCAGCTTCTTTAAGTCATCGGTTGATATCCCCTCATAAGGATTGTTAATATTACCACTGACTTCTATGTCTTGCTTGTCACGCCACTCATTAGGCTTGCGGTTCTTTAACCAGAAGATTTGAGCAGTAGTATCTGGTGTAACATCCTTTGTCTTTCTTTCAACCAACACTGTTTTAGTCTGTGGAAATTGCTCTCTTACCAACATAAGATCTGAATCAGTAGCCTCCGGATGCTCATACTTATAATGATTCATGAAAGCCTCAAGTTTTTCATAATACTCTACCTGATCCATTTTAACACTTATATATTTATCCTCATTGTAGGAATAACCAAGAGCCCTTTTCAAAAGTGCATTCTCAACTAAAATGTCAACAACTTCCTTGCCCCTTTTTAGGGCCTCACATATCTCACCATACCTGTTTTTCCATTCATACAATGTCTTTGCCGTGATTCCTAAGTTATGAGCTATCTGTTCATCGGTTAAACCATCTCTAGCATATGCTTCAAGCCTAAGCAAGCCATCTGGTGTTAACCAATATTCATATTTGCCTTTTGCCAT
Protein sequences of DBSCAN-SWA_2 >NC_010001|3592260:3637196|3625571_3626045_-|WP_012200965.1|tail|DBSCAN-SWA MDNVTMIGKDAVSAKLAQCFVTIGRNRYNFMQAINFEAKFEKTKTEIPILGKTGAGNKSTGWKGTGSATFHYNTSIFRQMMIDFKNTGEDVYFEIQVTNEDKTSAAGRQTMVFIDCNIDGGILAKFDADGEYLDEDMDFTFEDFNMPEAFKILSGML >NC_010001|3592260:3637196|3606113_3607004_-|WP_012200943.1|DBSCAN-SWA MYKITKKVLLASNIYLMDVEAPRVAKSCYPGQFVIVKMDEKGERIPLTICDYDREEGTVSIVFQAVGPSTQMMAKYEVGEYFRDFTGPLGCKSELIDEPVEELKEKKILFVAGGVGTAPVYPQVKWMKSIGCNVDCIIGARSNDFIILEDQMKEVAANVYVATDDGSYGFKGNVNDMIKELVQNQGKSYDLVIAIGPIIMMKFVCLLTKELGIKTVVSMNPIMVDGTGMCGACRITVGGQVKFACVDGPEFDGHLVDFDEAMKRSQMYKTKEGRKELREKEGDTHHGGCGYCGSEE >NC_010001|3592260:3637196|3627559_3627985_-|WP_012200968.1|DBSCAN-SWA MINEIIYGISKAIDQEFGDGYGIHTESIEQGLEEPCFFILCLNPTNELFLGKKYFRTNQFCIQYFPSTDKTRIECADVLDRLFDCLETITINEDLIMGTRMNGEIVDGVLNFMVNFNLFVYKDADRADEMDTVTVKSDAKG >NC_010001|3592260:3637196|3616691_3616850_-|WP_157668734.1|DBSCAN-SWA MNTFNNCMIVITNQKKTREQMLSLMDLYVMNNRITAEQYQELVDKMDQVGLE >NC_010001|3592260:3637196|3630956_3631217_-|WP_012200974.1|DBSCAN-SWA MGNDDFLKLVKQIVVEYFNKNADRTDHARIIESDVFIVWSCKTLQNNKALASTTVSDGMYYEITHNGDKQETYVDAYKKWENFVVK >NC_010001|3592260:3637196|3635218_3636631_-|WP_041704576.1|terminase|DBSCAN-SWA MDKELIKLGAKIELARREFFFYCQLKAPDFYKSDRKYLVELCNEFQEFLDSDDEVMIVNEPPRHGKSRTVGNLVEWVLGKDQTQKIMTGSYNETLSTMFSKNVRNSIMEQKADKYKPVYSDVFPGVSIKRGDGAMNLWSLEGGYNNYLATSPTGTATGFGCSLMIIDDLIKNAEEANNEAVKEKHWEWFTNTMLSRLEEGGKIIIIMTRWASDDLAGRALEHYKEQGAKIRHVSMKALVDKEKKQMLCSEVLSYKSYLGKIKAMGEDIASANYQQEPIDLKGKLYSSFKTYEKFPMDDKGNLLFTAIKSYCDTADEGTDYLCNIIYGVYNKEAYVLDIYYTNAPMEVTETETAKRIHEHGVNVADIESNNGGRGFARSVVRILRETFKSNKTKIRWFHQSKNKIARILSNSTWVMDHIYFPKNWRDRWPDYYSAMSKYQREGKNKHDDAPDATTGIAERIDKGNGVSVLK >NC_010001|3592260:3637196|3617199_3618192_-|WP_012200955.1|DBSCAN-SWA MAGFKQLTKLPKGFIGTTKLKNGYKGAIKFWSNGKKVLVGYGYAQIAYSEDGINFTVSNFVPTSYLRAFGGTFFKGKFYIYDNSYSIYSSKDCVTWTQVYANTSNGQKSNNKTMATNGNILVLFDENQGVLYSYDGINFTKSSLVQTGVLTIVNKLRCINGVFFALGTSDETIFKSTDGINYVKVTSTYSDNYFTDICFFNGYYYACKNSTGNCIHRSQDGTTWTMLTTKGISTPWYLNILQGRFVAMNGVGTIWRSSDAITWNLDLYADGINHRIDDYIVDDESGRMYQVGNNSTVSNCIYTTTGADLNYTNLPANSNQSPICAVLEGE >NC_010001|3592260:3637196|3597765_3598167_+|WP_041703716.1|DBSCAN-SWA MKKLFKRLIAFTFVAALLFTSLAPCSNVFAATNTGSDSHDMVFKANSNTFAKVTVTVKYTYTGSSAEITSITRSLTTYDTSTYTVQYDSIASSYSGASGTITYKIYKNGVYYSFATLLVSVSRDGTVSWNTGT >NC_010001|3592260:3637196|3607449_3609303_+|WP_012200944.1|DBSCAN-SWA MCGIAGFYHPRQNYLEKESYYKTILNCMTKRLYHRGPDEQGIYLNEHIGLAHARLSIIDLVSGGQPMLRSFGERTYVIVYNGEIYNAEELKKELQQQGMSFQTTCDTEVILLGFIANGPDFVKKLNGIFAYAIIDVAKNSLYLFRDQAGVKPLFYTLYEDTLIFSSEIKGLFEYPGFTPKVTSEGLNEIFSIGPAKTPGCGVFDKVKEVLPGEMVCYNQSGFTKELYWKLVSKPHEDSYEETMERTGFLVTDAIRRQMVSDVPICTFLSGGIDSSIVTAVCANELKKKNKQLDTFSFDFVNNEEFFKANKFQPSRDLPYALKMAEHFNTNHHLLECDNVMLAKRLHDSVLARDLPAMADIDSSILHFCSLVKQYDKVALTGECADEIFGGYPWFHSEEAKKSHSFPWSRDLSARKQLLKDEFLQCLHMDEYVQNAYETTVSETPYLAEDSEDERRLREISYLNLKWFMQTLLDRMDRTSMYSGLEARVPFADIRIIEYLWNVPFSMKAPDGIVKGLLRMSCSGLLPDDILWRKKSPYPKTYDPGYESLVATQLLEVMNDSSSPIVSFIDKKKLDSFLHTPSDYGKPWYGQLMAGPQTLAYLLMINDWLETYHIETVI >NC_010001|3592260:3637196|3593117_3593435_+|WP_085953449.1|transposase|DBSCAN-SWA MGDFNRFDSPDKILAYAGLSPSTYQSGQLDSSHSRMEKRGSRYLRYALFNATKYVCHWDPAFSTYLAKKRAEGKHYNVAISHAAKKLVRVIYQLEKSGQQYIKSA >NC_010001|3592260:3637196|3593610_3595572_-|WP_012200935.1|DBSCAN-SWA MQGKKQRITQLTELLDEAARVYEQEDREIMSNFEYDKLYDELKKLEEETGIVLAGSPTRKVGYEILSELPKERHESAMLSLDKTKEVPALIDWLGNKEGILSWKMDGLTIVLTYRNGELVKAVTRGNGEVGEVVTNNAKVFKNLPLTIPYEGELIIRGEAVIRYSDFEMINAQIPDADAKYKNPRNLCSGSVRQLNNAITAKRNVNFFAFALIRMDEMNRFKTMMEQFNWLKELGFDVVEEKLVTAENMAETMEYFESHIITNDFPSDGLVLFFNDIAYGESLGRTSKFPRNGIAFKWRDEIKETTLQEIEWSASRTGLINPVAIFEPVELEGTTVSRASLHNISIMEGLELGLGDKVTVYKANMIIPQIADNLTRSGHLPIPKTCPVCGGDTMIKQDSDVKSLYCMNPECLAKKIKSFTHFVSRDAMNIEGLSEATIEKLIAKGLIKELADIFHVKDFKEEITTMEGFGEKSFRNLVDSVEKARTPILAKFIYSLGIANVGLANAKLICKEFGYDFNKVSNATVDELTQIPQIGYVIAEAFVSYFQKPENKLIIEDLLKEITFEKEEAAGGSEKLKGLTFVITGSVEHFTNRNEVKDVIEQHGGKVTGSVTAKTNYLINNDNTSSSSKNKKARELGIPVITEEEFIQLLNEA >NC_010001|3592260:3637196|3620920_3621877_-|WP_012200961.1|DBSCAN-SWA MYQVSLIIKNKLYFPVVSGGLVWTTERKSTPGSIKLSVIKDESMKIEEGCQIALVDDELGVFYGYIFTIRETKDDLLEIIAFDQLRYLKNKDTYVYTAWSTGDLLKRIATDFKLKIGTVDDTGIKLKLAEVDTELFDMLNNSIAHTVEITKDLYVMYDDYGKLCLRKASDLILNILIDESSAEDYVFTRSIDESTYNSIKLVYENNKAGTREVYKVYDSSTIEKWGVLQYSEQIQNQNIAKDKVEKLLSMHNTPKRTLQIKNCLGDSRVRAGFSIIIPMLDTNGEKKFHPMFVESAKHTYEADAHFMDLKLTGGVINS >NC_010001|3592260:3637196|3629169_3630177_-|WP_012200972.1|DBSCAN-SWA MPAITRLSDVIVPELFNPYVVNRTMELSALLQSGIVVNNSEFDALASQAAPTVNMPFFEDLNGESEQVIEDKDLDENKITSNKDVAVIIRRAKMWGATDLSAALAGSDPMKAIGDLVAGFWARDMQKELIALLKGVFGSYKEGADPITRMASNISDISALTGDKAKWSGPAFIDAQQCLGDAKEQLTAVAVHSATEAFLRKQNLLETIQPSTGGPINIYQGKRVIIDDGCPYEGSGANMVFTSYLFGAGAIALGNGSPVGFVPTETDRAKRKGSGVDYLINRKTNILHPRGVKFTNTNVELTEGPSREELRDAENWKPVYEPKQIRIVAFKHKIG >NC_010001|3592260:3637196|3618203_3618608_-|WP_012200956.1|DBSCAN-SWA MAKYTEKLNLKKPEPTDFYNIKDFNDNMDKIEEHKHKASDITDLPESLPANGGNADTVNNHTVKSDVPSNAKFTDTVYAHPETHDAKMITGLPEKLPANGGNAHTVDDLHFLVLTQAAYDALSTKDANTIYFTT >NC_010001|3592260:3637196|3636623_3637196_-|WP_012200980.1|DBSCAN-SWA MAKGKYEYWLTPDGLLRLEAYARDGLTDEQIAHNLGITAKTLYEWKNRYGEICEALKRGKEVVDILVENALLKRALGYSYNEDKYISVKMDQVEYYEKLEAFMNHYKYEHPEATDSDLMLVREQFPQTKTVLVERKTKDVTPDTTAQIFWLKNRKPNEWRDKQDIEVSGNINNPYEGISTDDLKKLINDG >NC_010001|3592260:3637196|3601594_3602392_-|WP_012200940.1|DBSCAN-SWA MNYRLGNRLSKIIVCSLGILYLAFLALDIFGGPVIVSSIIKFTSIVLCFVLTVIRTLCYQAIADRILLNIAFVFTVIADVFLLFTTKFQYGVIAFCIVQLIYFYRMFSLSRYSLRMENRVHRSRHEKGFLNFFLHLATRVFVSAVVIVVLQLNGFVLDLLLFASVFYIVNFVGNLIFLILLYPKKHYMNGQIRYGLFCLGMVLFFLCDIQVGLYNLPMYLSNKSKVLDILVQIAGVGMWACYLPGQVAISMSDAKYNRSTRGSFI >NC_010001|3592260:3637196|3610312_3611035_-|WP_012200946.1|DBSCAN-SWA MTSVLILEDCKQNAEALGAIIKESKLDLHALFAYTYDEAIRILKSEVKIALFLLDINLDAFEKENKQGILFAKKIREIPFYAFTPIVFITSIAELELISYRETQCYSYLVKPYEKNQVLELLSKLAMEKTPMKREQLTVKKAGVNYRIHIEDIICIEAIPRGVSLYLKEEILDIKYLTIRQLLEKLPSEQFLQCHRMFVVNTDYIDYVDTVNQLIKLKGIHKTIEIGITYKSNMRRWMNE >NC_010001|3592260:3637196|3628439_3628814_-|WP_012200970.1|DBSCAN-SWA MNRIAVKQARKAIEILYDSTCTVIEYRKEKDPVTKVTSNQEVIIHLDKSCRLSYSSKNKANQTDTANAVEQVIKVFIAPELDIKAGSKLVISSKGRMVEYKNSGVPAVYDTHQEIILELFKGWA >NC_010001|3592260:3637196|3620221_3620614_-|WP_012200959.1|DBSCAN-SWA MVPQIDFDITTLEITEMPSKTFALSRNKVIGTVDGLEAIKQSVYMALSVERYEHLIYSWDYGVELEDLFGKDPAYVYPTLIGRIEEALLQDDRILGVDEFSFASNKGSVTVKFTVHTAYGDIEQEKEVNI >NC_010001|3592260:3637196|3592260_3593205_+|WP_085953448.1|transposase|DBSCAN-SWA MIFVGIDVAKDKHDCFITNSDGVVLFKAFTIPNNLEGFNNLYQKIKSVMEDEHKVKVGLEATGHYSYNLLGYLIDKGFPTYVINPLHTNLYRKSLSLRQTKTDKVDAHTIASMLMSNVSLKSYSDTSYHNEELKSLTRYRFDKVKERSALKVSVSRLVCILFPELEKLVTTLHMASVYALLYEFPGAQQVASAHLTRLSNLLEEASKGRYRKDIAILFREAARNSIGSNMPAKSLELKHTIRLIQELDSEIDEIENEINRIMDEINSPILSIPGISYRMGAIIIVRLVILTDLTLQIRYWLMLVYPPQLINPVN >NC_010001|3592260:3637196|3634973_3635207_-|WP_041703720.1|DBSCAN-SWA MGKNIAILIEISVLARLVNLLNDIFSYLPKILINQYWVDLIVCIACMIFCLLAGGCIIDALIENIRQKLNKAGDKGR >NC_010001|3592260:3637196|3616864_3617200_-|WP_012200954.1|DBSCAN-SWA MKLLLDSRKLIIAISTEITFGTFEGEEKWKVGNIYYIDNWFTVTDVDDVPIDVIPNKYFYIDGEFVLNPNWANAPEDISEINKRFDAMLLNKAESELEIDERLSLLELGLA >NC_010001|3592260:3637196|3631396_3631750_-|WP_049762400.1|DBSCAN-SWA MKVNVLGTEYEIKRRSYNEDKDFKKRDIVGYCDVVMKEIVVCNIATYPGYEEESREYHAEIEKQTLRHEIVHAFLAESGLWDNSCTTTGWATNEEMVDWIAIQFPKMIKAFKEAKAI >NC_010001|3592260:3637196|3612633_3614100_-|WP_012200949.1|DBSCAN-SWA MIFSTKKLDPRYNNASAFLIALFSLNTAASTLYLALMEYTAYFVNGIIGFSVVLTSVVLTALRVFDGLIDPFIGYVVDRTEGKYGKFRPFMVLGNALMAISSLLLYYCSYTIPKAIRLPIFVLIYIIFVFGYTFQMLVAKAGQTVMTNNPKMRPLSTYFDSLFITASYGGTALYAQTFLAKKYGGFKNPKLYQELTLWIVLIGGICTICAVIGIWSKDRKEYYGSSDQVTKINMKDYVSIMKHNKPIRMLVTAASVNKFTSMVYSNITVGVIIFGIMMKNYELSGQIGLVTAIPNLLVVSIGVMVAQRLGQKKAFESFTWLAICFQIIMTFLLLFGNLTQIGLTKWNMISLLFFGIFILLNGCKTVSNNIVVPMIADCSDYEVYRSGNYIPGIMGALFSLIDQVVSAFGTAFVGLVVAMIGFRNALPQIEDKLTNSLKMVAILCYCIIPIGGWLLSLLSMKHYSLDKEKMREINLKNSIEDNHFEKLS >NC_010001|3592260:3637196|3609558_3610320_-|WP_012200945.1|DBSCAN-SWA MNNITIFKSLAILLIVLIVLLAIYLLWKQKLMIYLKQEDELRLYKLYVKPLEEFIKEIRARQHEFDNHLNALLSMHLTIDNYEELVKSQAEYIRELISLPEHNYQGLLKISNKVLAGFLYSKIIASPPNVQVELIVGSKDIFTNVPEMDAVEVLGTLIDNAFEACGKEEGNVKIYLTSEDDRLIFIIKNRHEKIPINELSRFFERGYSTKNTKESQGYGLYNANRIMKHWGGDIYVENEIILGENYVSFHVEF >NC_010001|3592260:3637196|3627373_3627553_-|WP_012200967.1|DBSCAN-SWA MAIKKIEENVPMFTKEQLLSSKRYNDKKDIISAILHDERLYSREEVDALIEKFMKGKVK >NC_010001|3592260:3637196|3620613_3620937_-|WP_012200960.1|DBSCAN-SWA MLSIANLVQLIKKAACDAVEATKPTQFLYGKVTSINPLTVSINQKIVLTSGFLVVPEHLTDHKMTVEFMGAETTYTFKNALQIDDKVVLLQQKGGQQYLLLDKVVDN >NC_010001|3592260:3637196|3618607_3619150_-|WP_012200957.1|DBSCAN-SWA MAANLISYVPGVLQEVREFKAITTVENPEIDNLSVSLGLALNNQFIGYLDNYGCSRWETMLAIKPLDTDTLQERNFRILARVNEQLPYTYRSLEERLADLCGKDGYSIVMDKAKYILNVRIAQSAKKNFSAVAELLDRITPCNLILSISLLYNTHNMVGGFTHNQLSAYTYGQIREEENL >NC_010001|3592260:3637196|3621794_3622733_-|WP_012200962.1|DBSCAN-SWA MYKVYLWDMLLPITPSKIVTKINNQNKTLTLLNGQEINLVKSPGLTDINFQMILPGNKYPFVIGSYKSPNWYLNRLQELKVNKFYTTLLVSREIKGREWFKQEMDVTLEDYSITEDSKNGFDLVVDVKLKQYIDYGVKYVILENVKDGVVEKKTRSSSKTIPKTYTVANGDTLWAIAKKLLGDGSKCWNLAKLNNIKNPNIIFVGQVLKIQDVPATTQKSTSTSQKTGSSSTSSITTTALGGKSDTMKKATTSAPLSNTGILQAKLSGNEGQGSYDLRKDWLGGLNVPGQLNNKKQTVFPRSEWWASMDHGT >NC_010001|3592260:3637196|3627981_3628440_-|WP_012200969.1|DBSCAN-SWA MGRNGGCNFSEWEKLKANLEATDEQLSLFIESCAKELAARLLAKVIKRTPVGQYPSSTGKNGGTLRRGWTAGKNATEYAQSLKVNHLGDTYVIEIVNPTEYASYVEFGHRTSNHKGWVEGKFMLTISEQEIKTAAPAILEKKISKLLGGCFK >NC_010001|3592260:3637196|3633579_3634968_-|WP_012200978.1|portal|DBSCAN-SWA MDLETMKKVIKRYTTGHTNFITNAATAERYYRNKNDVLKNKPRTESDETENPLRNADNRIPRNFHGLLVNQKAAYMFTAPPLFDIGDDDSNKRVTEVLGDRYAKTTKDLCINASNCSVAWLHYWINSEKKFEYGVVDSKQIIPIWTADLNKQLSGVLRVYEQLDDNDGELYTIYQYWNDTECQSFRKKTSDTLDDGLQSYTMFCSFVDGESSQQDTFIHDYGRVPFIPFFNNNVDSSDLENIKALIDVYDKVFSGFVNDLEDIQEIIFVLTGYGGTDLAPFLQDLKKYKTVKLDDDGENVHAGIQTLTIDIPIEAREKLLDITRKAIFEQGQGVDPQPEAFGNASGVALKYLYSLLELKAGLMETEFKLAFGEFVRAICKYLNTECKTIIQTWTRTAITNDLELATIASQSQGIVSDKTIHKNHPWVEDADQEAKQIQKELSMENGYDYALQQGGDVDDGEK >NC_010001|3592260:3637196|3622742_3624881_-|WP_012200963.1|DBSCAN-SWA MAGISTQINIVDRMSSPLNNIISAVDNVIGSLQDVDGAINRGFDTSAIDNARRSVDLANKQMEEIHGSIKRNEEAQQGFNREVQNGTNYAGNLGSTIKNAVGAFLGIAAAKSSINWVHESLDLTNVSTNAERQLQTVLSNIGAADDAFDKLKNTASTVQGYSLYGDETMIGGAAELSTYIKDTKAIQSMMGTLANYAAGMSGGGEVGYQQMVEYATQLGKALDGTYDGLKKKGFELSEAQQKIIEKGTDMQKALVIEDVINQSWADLAVQMANTPQGVIIQMKNDFVGIRREIGAELYPAVLSLFDTIRSNMPQIEQVLHGFASGIEKIIFVINRVIEVASTMYSFIADNWPIISPIIYGIVAALTAYSIAMAITNALEVISNISKGIAAVRAYAAAAANTALTAAEMAEAKAKASATAAQYGFNAALLASPITWIIIAIIAIIAVLYAVVAAINKVTGSTVSATGIIAGSIAWLGAFIWNTVIGVLNGIIQLLWTLFVEPWIGIIEWVLNVFDGGFNSFGDSVANLIGQIISWFLSLGKVVTKIIDAIFGTNWTGGLNSLQDSVLAWGKNDKAITLDRNAPTIDARIAYGDAWDAGYKFGEGIDDKISGMFGGMTDQYGDFSYVPGNVSKIADNTGDIKNSVTATNEELKYLRDAAERDSINRFTTASIKVEMGGVNNTVTNNQDLDGMIDYLSEKIGEELNNTAQAYNKG >NC_010001|3592260:3637196|3611075_3612014_-|WP_012200947.1|DBSCAN-SWA MGNAQIYLLIGLGIIILLVLASCVKIVPQAYAYVVERLGGYQGTWSVGVHLKVPLIDKIARKVVLKEQVADFAPQPVITKDNVTMRIDTVVFFQITDPKLFAYGVENPMMAIENLTATTLRNIIGDLELDETLTSREIINTKMRVSLDAATDPWGIKVTRVELKNIIPPAAIQDAMEKQMKAERERRESILIAEGQKKSAILVAEGKKESVILEAEADKESQILRAEAKKEATIREAEGQAEAIVAIQKANADGIRMLNEANPGKGVIQLKSLEAFAKAADGKATKIIIPSEIQGMAGLVKSLTEVGAKDEL >NC_010001|3592260:3637196|3614304_3615204_+|WP_041703718.1|DBSCAN-SWA MNAMLFHTRSIQDTLKALKVNASTGLSTKEAQKRQQEYGKNQLEAKKGKSILSRFLSQFKDFMIIVLIAAAVVSFFISLLKGHADYIDPIIIFAIIFLNAILGVIQEEKAEKSLEALKKMSAPTAEVLRDSKRITLPSTELVPGDIIYLETGHYIPADARLITSINLRVDESALTGESHPVEKDANVILKENTMLGDRKNLVPATGVITFGRGIAVVTAIGMGTEVGTIARMIMEDETPETPLQKRLEKTGKALGIAALGICIAIFLLGTLQGRELFDMFMTSVSLAVAAIPEGLPRVV >NC_010001|3592260:3637196|3595763_3597197_-|WP_012200936.1|DBSCAN-SWA MAKVTKRSFDTEETGEKAVRINKYLSESGICSRREADTRIAKGEVTIDGEVAVMGSKVLPGQKVVLGKKEVSREEKMVLIAFHKPKGIVCTTEKREPDNIIDYIKYSQRIYPIGRLDKDSEGLIFLTNNGDIVNKILRAGNHHEKEYIVTVNKPITAAFLKGMAGGVPILDTVTKPCTLEAIDKVTFRIILTQGLNRQIRRMCEHFDYKVTKLVRIRIMNVNLGRLKVGGYRNLTDKELEGIKEQIELSNNEALNLEPDDEFPIERVNVANIAKTHGNNVATGKRRTKDKYFAEKKDDKKRSDATRNKAEVNKFASKNKSEGNKFGSKNSSEGNKFGSKNSFEGNKFGSKNSSEGNKFGSKNSFEGNKFGSKNSSEGNKFDSKNSSGVNKLGSKNSSGVNKFGSKNSSEGNKFSSKNSSEGNKFGAKNRSEDKRFVSKIKSEDNKFASRGKTEQPRSYSKGNASSNRFATKKSDYKR >NC_010001|3592260:3637196|3615294_3615507_+|WP_041703719.1|DBSCAN-SWA MKYGIRKPSLKKSFKARTTGKAKRKIKKALIPGYGKKGMGWLKNPKKAAYNKVYNKTSVSLSSILKNLFK >NC_010001|3592260:3637196|3604721_3606101_-|WP_012200942.1|DBSCAN-SWA MDVLKRVPVREQEPSVRAKNFEEVCLGYNKEEAMLEATRCLKCKNPKCVLGCPVSIDIPGFIKEVEAGNIEEAYRVISTYSALPAVCGRVCPQESQCEALCIRGIKGDPVSIGKLERFVADWSRENGIKPEPPKEKKGKKVAVIGSGPAGLTCAGDLAKLGYDVTIFEALHEPGGVLVYGIPEFRLPKSTVVATEIENVKALGVKIETNVVIGKSTTIDELFNEESFDAVFIGSGAGLPKFMGIPGENANGVFSANEYLTRSNLMKAFDESYDTPIVSSKKVAVVGGGNVAMDAARTALRLGAEVHIVYRRSEEELPARVEEVHHAKEEGIIFDLLTNPTEILVDETGWVKGMRCVKMELGEPDASGRRRPVEVPDSEFVLELDTVIMSLGTSPNPLISSTTEGLKINSHKCIIAEETDGKTTKEGVYAGGDAVTGAATVILAMGAGKAAAKGIDEFLS >NC_010001|3592260:3637196|3599016_3599754_+|WP_012200938.1|DBSCAN-SWA MRLKYKKLLLLFTMGIFGIGMVTISFQVSPDALQAAFIKEGDRDSLEPTSSPDSQAITPTAFATNTPIPENPNALKKDAYPEINELIETYYKAKVSTDANSIEILKNCVTDASLLDFERISKKVEYITDYKNFHCYSKPGTGEIDYIVYVCYDIMITKIQTGAPSADVFYITYQDGKPHIFLGNVSQKTKNYIDKTNLDEDVQKLSKEVDDNLAKAIEMDEDLREFYENLTAQTSNVEPSETPAP >NC_010001|3592260:3637196|3600050_3600983_-|WP_041703717.1|DBSCAN-SWA MSNLMSYENFLNYMKEEVKKRLGEGYETDLNQIRKNNGLLLDGLMIRCNTDKIMPSVYLNPYYENYQKGQSVTDIVDEIIKAYDGAREETGRIAIPARMEFEEVKSRIIYRLVNYEKNRLLLQNIPHYRVLEFAVTFHCLIRRDEEGIGSVRITEEHRKNWGVSVEELRNVAMENTKKNFPALIRPMEDIVYDAIKHESWTPKKEVETLPTNFNSSMYVLTNSCGINGASCLLYQGLLELFYKQLGSDFFLLPSSIHELILVPVGEDRDREHLEEMVKEINQTQVAMEEILSDTVYSYESIRDGLLDLGT >NC_010001|3592260:3637196|3616222_3616636_-|WP_012200953.1|holin|DBSCAN-SWA MEKFITGLQYIISLLGAFLGWFLGGVDGFMTALIWFVVVDYLTGVMVAIIEKKLSSKVGFRGIFKKVIIFCLVGISNILDIYVIKTGSAIRLAVISFYIYNEGVSILENISVIGLPVPEKLKDALIQVGNHDKKEDE >NC_010001|3592260:3637196|3612015_3612456_-|WP_012200948.1|DBSCAN-SWA MVSLCWLIVLALLLAIEIATLGLTTVWFAGGALIGFIASLLGVDFWIQMILFILVSLLLLFFTRPVAVKHLNKSRVKTNYEGLIGKVVKITQQVDNDNQTGEALVNGQEWTVRSETDGVIYEPGTKVRIVNIVGVKLIVTEYREEN >NC_010001|3592260:3637196|3631803_3632031_-|WP_012200976.1|DBSCAN-SWA MDLLSALKNINELNQNTKLKILCVGGIVVEGTYDSYTSALDNEPEEAEITIRDASNKGLVEILESEIETIEVISN >NC_010001|3592260:3637196|3619139_3620219_-|WP_012200958.1|plate|DBSCAN-SWA MYEDMTYEVILNRALARIPPGIDKREGSMIYTAVAPVCAELAQMYVELEGIMNESFADTASRDFLIRRAAERGLTPKEATYAVVKGVINIDIPIGARFTLDKFSYRATEKISAGAFKLTCETPGSSPNSSTGVLLPVEYIDGLTSASITEILIPGEEEEDTEVFRKRYFDSLDAQAFGGNRADYKAKVLSLDGVGAVKIYRATNVSGEESGGNVKLVILNSDHAKPSATLVDSVQSTIDPITNGGDGLGLAPLWHFVHVTGADETVINITTTITYQSGYTYEDVKSYIASAIDGYFKDLAKSWQDEGSSGLVVRISRIDVALLGIPGIVDVTGTMLNGSAVNIELDKNAIPVRGSLNGS >NC_010001|3592260:3637196|3615542_3616220_-|WP_012200952.1|DBSCAN-SWA MAEYKKLMTDQELQAKVLDIVNNHKTCYALGGTGQLVTDAFIDQKAKQLPSWYTPSRINELKKLVGKGYYAFDCSNLIKAILWGLFNGKQGVYNSNTVPDTNANGLINLCEDVSTDMSNIKPMELIWFDGHVGLYLGNGECIECAPSLNKVGVTKLSYQGKWCKHGKLPWITYTEQEKRQLKLTTPYMRGDDVKKLQQLIGVTPDGIYGPATDKRAHEILGLLGI >NC_010001|3592260:3637196|3630196_3630799_-|WP_012200973.1|DBSCAN-SWA MQWLLDLIAKHTKEGVLDTEALTKELNTEFPKHAVPKTEFNTVNEQLKTANNTITDLKKSNGDNETLQTTIKTHETTIATLKAESEKVKKEYALKDKLKDLGVTDADYLIYKHGGIDKFNYDKDGNLIGLEDSIKPYKESLPHIFKNAKSGTDYNPAGGGSYTGKNPFAKDSFNLTEQGKLLKENPAQAKELASAAGITI >NC_010001|3592260:3637196|3628810_3629158_-|WP_012200971.1|DBSCAN-SWA MLELSKLKKLLGVKSCDKTKDIPLQFALDDTQDTILNYCNLKELPVALETTAYRMAMDLYRNESPGEESTPLGTVSSISEGDASTSFKSPTTEFKDHLLKDYKAQLRRFRKVGFR >NC_010001|3592260:3637196|3626063_3627371_-|WP_012200966.1|DBSCAN-SWA MSLGGGTFVTQNKVLPGAYINFISAISASESLTDRGIAAIGLDLNWGIDGEIFEVTNADFKEESLRIFGYEYTDAKMKGLRDLFLNAKMLYAYKLTSNGEKATNEYATAKHSGTRGNDLKIVISLNADEPDNFDVKTYLGTLIVDAQTVATADKLVANDFVKFKSDAVLTVTAGTALSNGSNGVIDGASHTAFLAKAESYSFNAIGVISTDDEVNGLYAAYAKRMRDELGVKFQAVTYHKAYDYEGVVNVKNKVINAGWSEASLVYWVTGIIAGCEVNKSNLNKSYNGEFTIDTNYTQSQLEAAIKAGEFTLHKVGPNVRVLADINSMVTISDTKSDIFKENQTIRVIDQIANDIAVLFNTKYLGAIPNDASGRISLWSDVVKHHKALQNIRAIENFQDSDVVITQGATKKSVVVSDVVTVVNAMGQLYMTVTVQ >NC_010001|3592260:3637196|3625069_3625501_-|WP_012200964.1|DBSCAN-SWA MSNFNRFMKQNKVQKENTTYPATKSLTDEKGNPLLWSIKPLTTRENDAIREDCTMDVPVTGKPNMFRPKVNTSKYIAKMICAAVIEPNLNDKELQDSYGVMTPEELLKEMVDDPGEYGDFANFVQQFNGFTTTMDDKVEEAKN >NC_010001|3592260:3637196|3602433_3604092_-|WP_049762398.1|DBSCAN-SWA MGKSRLLQIVLSAMNTTSTDTELILMNTTADTTVNQIDIGSGRNLLYILLVTIVVFLIFFLLLFVNIRKRTKAEKAAINVNQKLKESNKELEKALSEVTATKDALHSKYEELKKSKEIVKRMAYSDYLTSLPNRLAFVELLDGIMATIRQNEIVGILDIDIDNFKDINDTLGHSYGDEMLIDVMHRLEELIKENDYLSRIGGDEFAILIQNLTDLSELEERILEVQQIFATPFRVATREFFITVSIGVAIAPKDGKTTQSLVKNMNSAMYEAKERGKNNFCYYDDSINLKMMQKLEMQSELRKAIEENQFVVYYQPQINLVNDRVVGFEALARWQHPEKGIIAPIDFIPLAEDNGMIVAIGKKMLYESCMQLRSLQESGYRDIVIAVNLSARQFKDKDFLPMVYEILEETKADARGLEFEITETVALDDLEFSISTISKLKEIGITFALDDFGTGYSSLNYLKRLPVNNLKIDKSFLDTILENNSDQKIVHTMIDLAKHLNIEVIAEGVELSEQEKFLKEINCGKAQGYYYGKPVPKEIAFEILKKLNSENS |
49 | Clostridium_phage(55.56%) | plate,portal,transposase,tail,terminase,holin | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
3649174 : 3659730
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_010001|3649174:3659730|DBSCAN-SWA ACTAATTAAATGGTAGTTCTTCGTCAATGCCGTCCGGAATGTTCATAAACCCATCACCGCTAGCTTCACTTGGTGCAGGTCTTGATGATTGCTGATAGTCACTTCCGGAGCTCTTACTTTCTGCAAATTCCATCTCGTCTACAAGAATTTTTGTAGAGTAAACTTTTTGACCATCTTTATTGGTATAATTCTCATTCTGAATACTACCAACAATAGCAATCTTACTTCCCTTGGCAAGATACTTCTCAGCAAACTCGGCGTTCCGTCCAAAACATACACAGTTGAAAAAGTCTGCATCCGGTTCACCATCACGCTTAAATCGTCTGTTTACTGCAATTCCGTAGGAGGCAATCGCTGTTCCATTTGCTTGTCCATATCGAATCTCAGGATCTTTGGTCAATCGACAAATCATCATAACTTTATTCATTAGGCACCCACCTTCTTATCTCTCAATGGTTTTAATCTCTTAATTGCATCTTCTAAATTTTCCGGAGTAAGCTTTTCTACTGAATCAATCTTGTAGAACATTAACATGCTCTTAACGCTAGCACCGGTACGTGCTAGCTCTGCGTTGAGTTCAGCCATCTTGATAGCATCTAATTCAGTAGATGGCTCTTGTTCCTCTTGTTTCTTTTCTGTTATTGAATCTTTAGGCTTATTCTCTGGTTTTGGCTTGTTATTCTTTTGCTCCGGTTTTGCATTCGTTTTCCCTCTTGGGTTAACATCATGAAACTCTTCGTCTGGATCCTTCATTTCCTCTGTTGGAATACAGAAGACTTGGAAACAAGCGTATTTAAATGCAATAGACATCGCTTTATTAGTTGCCTTGTCACCACTATCCATGCCTTCACCAATCACAATAGCTTCAATGTTTGAACCATCCTCCGCATAGAAGGTATACTTCACCTTACAAATCGAGTAAATAAGGTTCCCTCCCTTGGTAGTTGTTCGTTCTTCTCTTGTCTGTTCCAAAATCTCCGGTACAACAAATAATTTATGCTTAATTAGTGCCGGATTAATTGCATTCATTACTGCATCAATTCCCCTGAACATAAATCCTTGTTGTTGGTTCCTGGAGTTCTTACTAATGGCTCCAATATCCTCCATGACACTACTAATTGCCTTATAAATCATTCCTTCCATTCTCAATACCTCCTATAACTTTGCAAACTGTACACCAAGCTCAGAAAGAATCTTCTCAACTTGCTTATGTTGTTCTTCAGTTCCTTTAACCATATATCTAACAGTAGTCACTTGACTAACTGGCGTGTGTTTCACTTCAAAAACAGATTCTTCTTTTTTCACTTCTTCTGCAGGAAAAGGTGTCTCGTCAGCGATAACAGGGAAAGGTGAACCAGGAAAATCTACTTCCTCTACAATAGGAAATGGAGATACCTCTTTCTCAATAACTCTTTGAGATTCCTCACGCCTTCTAATCTCTTCCTGTTGCCTACAAACTTCCTCCTGTACCTTTGCTCTTTCCTCTGCGCGGATCCGATCTATCTCTGACTGGCGTTCACGTTCTGCAGCAAGCTCGCGCTCTCTTTGTTCCCTGGAAAGACGCTCTTGCTCCTCACGAGCCTTACGCTCTTCTTCTCGCTTTCTGCGTTCCTCTTCCGCTCGTATACGACGCTCTTCGGCTTCAAGAATAGCTTTTTTCTGTATGTCATAATCATTAATAATCTTCATAGCATCAGATAATTTCTTGGTCTGTTTATAAACTGCTATTGCTTTTTCTTCAACCTCTGATTTCCTTGCAGCTATCGCCTCTAAGTCAGCTAAACGATCAGATAAGGCTGTATCAATATCATTCTTGATAGATTTAAAAGATGTATTAACATTGAGCCAGGAATCCTTAAAAACTTCTTCAAACTCTAAGGTTTCAGATTGACCGCCCATCTTCTGCAGATAATAATCGAAAGCTTTTTGTCTTTTCTCAGCCTTCTGTTTTTCGTCAAATTCCTTAACTTGCTTATTGATTAACTCAATAGGCTCATTTATAATCTCTACAAGCTCCTTAGCCTTTTTCTCGAACTCCTCATACGGAATCATGTAGTTACTTTTTACTTCCTTGCGCTTATCATCGATTGCTTTAGCTATCTTGCGAAGAGTAGCAATATCTTTCTTACGCTCGGCAAGGACTTCTTCCGTTACTTCCATAGATTTATAGATTTCCATCTGTTCAGAAAGGCTCTGCTTGATTTCATCAAAGTTGAAGGAAATCTCTGCAGGCTTCTGTGTTACTATTACTTGAAGTTCATTCAATGTTAATCTACCCCCTCATATAAACTATTAGAAATGTCAATTTGCTCTTTACTAAGCTCACTTATTGCAGATGCGATTCTTAGTTTGGTTTCTTTGCATGGCATATAACCATACTTTATATACCGGATCATTCTTTCGAATGTACTCATTGGATATAAAATTTTGTCGTCAGTAACCAGTCTTTTAAGATGCAAGTGTTCAAAAAACTTATCATCGCAAGCTATCATATATTCGATATGGTTATCATCTGATTCTGCATCTTTTACTTGCTCTTTAAAATAAGCAAATTTAGTTATCGTGAAATCAAACTGGCTTATTATGTTTCTAGCACTTCCGAATACTTTGCAACATAGTTCCAATGTGATGCCTGTTTCAACGTGCTTATAAGCTTTAACGTTTTTATTCTCGTAATAAAAATGATATTTTCCGCCAGTGTTATTCTCGCCGCACATTTTATCAAAGTGTTCCACTGCTGAATCAAAGTCCACTTTGCTTTCAAAAAAGATATCTAAATCCTTCACTTTCTCATGATTGAAAATATTTTTGAAAGAACCACCTGCGATAAATCCTTTGTAACCTGTCATAAACTGGTCCAACCATGACAACATATAAAAGTTATCTCTTTCAAAACTATTAATCATTTGACTCACCATCTTCCTCAGTAACTCGACTCGCCCACATGTCAGCAAAATGGATAAGTATCTGAAGAGGTGTTTCATTGTCCTTAAGGCTACGTCCCAAACCAACATACATTCCATTATGATAAAGGATTGCGAATTGCTCCTCTTCAGTGAGTTGAATAAATCTACTCGCCTCAACTACGGACACAACTTCATGGTCAACCTTGAGCAAATCTGAGCTCTGTGCATAAGGCTTTGCAGTCGATCGTTTCCCACTCTGCAGAATATTCTCAACATACAATGGTTTTTCGAATTGACCAATCTTACCTAAATCATGAAGTAGCGCAGCAATGATTATGGAATCACCAAGTCTACACATTTCTTCAAATCCAAAAATCGTAACCGCTGTATCATTCGCGATATTGTACACATTTAAGCTATGTTCGGCTAATCCACCTTCTTTGGATAGATGATTACCTCCACTACATGGAGCTGTAAAAAAACCGTTCTTATCCATATGATTAAACAAGTTACTGATACCTTCTCTACCGGTACTCATAAGAAGATTGAATATCTTCTCCCTTACTTCTTCCTTGCTAAAATCATTCTTATATCCCATTCGAATATACCTCCTTATCATGTTTTCCGTAGTCGTTTGGAACAACTACAAAACCTAATCTACTTTCAGCACAATCGCACTTCTCACCGTGGTCCAAATTGGCTTGGCAGTGATGGCATGTCCAATAATCTTTCATAACTTTATATCTCATCGTTAAACCACTTAATGCATTCGGTACATAAGGTAATATCATTCAGAGTAACCATTCTCTTTTCTTTACCTAGTTTCTGGCCACATACTCTGCAGGTACACTCATCTTGATGCTTTTTAAGTACTATGCTGTCATAGTCCACATAAATTTCCAGTGGATCTCCTTCTTTGATGTCCTGTAATTTTCTCATTTCCATTGGAAGAACGATTCTTCCTAATCCATCGATATGTCTAACAATTCCTGTATTCTTCATTTACAGATCCTTCCCGGCATGTTATAATGCCTTTAGATTATTTTTGTTTGTGCGCTAAGCTTGGACGGCCATCCAGCTTGGCGTTTTTCATTTATTAACAGCTCCTGTGTTCATTTTTATTTTCTTCCTCATCCTAGAGTTACACCATGGACACGTGTAATTATCTTTTGTGTATCCTAAATGTGCTGAAACATTCCAGGGCACGCCACAATACCTACACCAAAAATATGTAGCCAAGTTATTACCTCCTTTTTTAACGGAACTCTAATTCCCATTTCACTTCCCAATAAAGCCTTTCAGATAACTTTCTAGCCAACCTACCATGCTGATTGTTGATTAGGAATCTTACAAGCTTACGTTTCACGTTCTCACCATCCTTTCTTATTGTAATAATTTGTCGTTTCTCCTATAATGTACTTACAGGCCCTGCCAAGCTGAGTACATAAGAAAGGAGAAATACTTATGAGTAAAACTGATAAAGAGCTGACTGCTGAAATTATAATTGCTTACATAAATGCTAATCCAACGCAAGCTTCGTATGTCGGCGGCAATTCGCACAGCAAAGTAGAAAAATTCGTAAATATTGATGGATTATGCAATGGTATTAAAGCCATCCATTCGACACTTGCTGATTTAGAGAAACCATCAGTTGAATGATTTCTGCTATCGTAACTATGTTTTTTCTTATCTGTTCAGCATCAACATTAATGTTTTCAAGATTCCATTCTGCTAGGGCATTGATTTGTTCTAGCAGAACATCTTTAAGTGTATTAATATCTATTCCACTATTATCCATCTCTTCCACCTCCTCTCAGCTTGTCCTACGCTTAACTGCCTAAGCAGTTTTATCATCTGTCGGCTTTTCATTACTAGCAAAAACCAATTCTGTTATCTCATATCCTTCTTGCTCGCAATATAATTCTATAAGAAGCCTCATTGCCTTCTTAGCATCAGGTTTTCCAACAAATGTGAAGGTTGGTTCTTTCAAATCCGCTTCCTCCTATTCTCTTTGTACAATGTATGTAGTACTGGTTGTACATGTTGCGAAAAATATTTGTACAAAGCTTTCCTTGACTAAAAGTATATTGACTGTTAAAATATCAGAACATTTGTTCTTATGAAACAAGTTTTTCACGGATAAACAAATAGTCATAATCATATTCCGGAAAAAATACTTTTTTAATTTTGTACGCTTCGTCAAAATAAAATCCGCAGTTTACAACTCCATTTAATTTATCACTCATCGTTGCTTGTCTACATTCAAGTAAATTCGCTAACTGCGTTGATGTAATACCTTTCGATTTCATAGCGTTTAATAAATTTGCATACAACATAACAGTGCTCCTTTCTATTTTTGTTTGCGGTAGTCCGTAAACTATATCTAAAATATACTCGGTTTACCGCACGTTGTCAATCCAAAATTCGCGTATGTCCGCATTATTACGGTTTTCCGTAAATTTAGTATTTACAAATCTTGTTATATCGTATATAATGTTGATATCTTAGATAAGAGGTGTCATTATGCGTAAAGCAAAAGTATTAGAAAAATTAATTAAAGAAAGCGGTTATACTGTAAGGGCATTCGCACAGAAATGCGGTATCCCAGAAAGCACTCTTTACACTATTTTAAAAAATGGCGTGGGAAGAGCTACAATGGATAACATTCTCACAATTTGTAGAAACCTTGGAATCAAGGTTGAAGATCTTGAGCGTATGGCTGAAGGCGAAGTAGAGGAATCGCAACCCTCTTATGATGAATTAATCACAGTTTACACACGAAGTAAGAAGAATCTGTCACAAGAAGAGAAAATGAGGTTGGCTAGAATAATACTAGAGGATGACGAAGATTGACAAATAACGAGATAATAGCAGGAACATTATTCGTGTTTAATCATTGTAGCATTAATAATTTTCCATTAGACTGTTGGAATATTGTTAATCAATACGGATTCAAAGTTAAGAAATACTCAGAGCTGAAGCCAAAGAAGCTAGAAGCTTGCTTGGAATTAAGTGAAGATGCAAACATCATAGGTGATACGGTATATTATAACGAGAATAAAAGTCATAATAGAGTTCGATTCTCATTAATGCATGAATTAGGTCATATAGTCTTAGAATCTAACGATGAATCAGAATGTGATAAGTTCTCCAGCAATACTATAGCTCCACGCATGGCTATACATTATTCCAAATGTAGGAATGCCAATGATGTTGCAAAGATTTTTATTTTGTCTGACCAAGCAGCTAACATAGCTTTTGATGATTATAGGCGCTGGAGACGAAATGTAACAATATATGGAATGTCAGACCTGGATAAACAGATGTATGAACATTTTTATAATGATGATGCTAAAAAGTTTGTATGGTCTTTTGAAAGATGTGACTTTTGTTTCACAGATTATGCTTACAATGAACAATCATTATGCGAGACTTGCAGGAGATTCGAGATTAGAAAACTGATAAAAGCACAAAGAATCGATGGTCAAGAACGTAATTTAGACCTATCCAGAAGTAATTGGCTATATGGTCAATTATAAAAACATACCATGCACTACCATGACCAAGAGTATGTCCTAATAAATTCAAAAATATATTTAATACTATACGAAGGGTGGTCATCTTTATGGAAAACTATGGAGTAGCAGCGCCAAGGAAGAAAAATAAAGGCTGTTTAATTGGAATAATAGTTATTATACTGTTCCTTGGAGGATTAGGATTTGGTTTTTATCGAATAGTTCAAAATCCTGAAGAATATGGGGCTAAAACGAAAAAATCTGAATTGGCAACATTGCTTGATGTTTCAGATGAACAGGAAGCTAATATTTTAAAGATATTTAAAGAGTGCGGTATCGATGATGTGAAGTCTGTAAAACCATTTAATGCAGGCGAAAAAATGTCATCTTATTCTCTTTCAAGTGCTGACACATCGAATATTGTTGTTTGGGTATCCAATGACAAGAAAGAAGTTCAAGAGATTTATTTTAACGATTATGACATATATAAAGATGGTAAATGCGTTTCAAAAATAACAGACTATATATTAACCAAGGATGAGAAGACAACATACCAAACAGCATCACAATTACTGATAAAAGACACGCTGACAGTTCCTTCATCTGCAAAATTTCCATCTATATATGATTGGAAGTTTGGAAAAATCGACGGTGTTATAATAGTACAATCATATGTGAATAGTAAAAATGCGTTCGGAGTGGAGATAAAGAACGAATTTCAAATTAAATTTGATGAAAATGGCAATCCAATATCAATCATAATCAACGGTAAAGAACTACTACAATAATAAAAAACAGCCCCAGTGCTACCAACACCGGGACTGCTCATAAGATATTATACTAGGTGAACTAAATATAATACCTGTTCCAACAAACATATTATATCAGCTCACCCGGTAAAAAACAACACTACCGGGCATTTTTATGCCCATTTACAGCATAGAATTCAATAGAAAGGGTTGATATAATGATTAGAGTAGCTTTATACATTCGTGTAAGTACAGAAGAGCAGGCACTTCATGGTTTCTCTCTGGAAGCTCAAAGAGAAGCTCTGACTAAATATGCCAAAGAACACGACATGGAAATCACCGGAGTATATATCGATGAAGGTATTACAGCAAGAAAAAAGTACAATCGACGTAAAGAGTTTATGCGTCTAATTGAAGGAGTGAAAGAAAAGAGCTTTGATCTTATCTTATTCACCAAGCTAGACAGATGGTTTCGTAATATTTCAGACTATTATAAAATTCAAGAAACATTAGAAACCTATGAAGTAAATTGGAAAACTATATTTGAGAGTTACGATACAAGCACAGCTTCTGGAAGGCTTCACATAAACATTATGCTTTCCGTAGCGCAGGATGAAGCAGATAGAACTAGTGAACGTATCCGATCAGTCTTCGAAAATAAGATTAAAAACAAAGAGATTGTTACGGGTAAGCAGCCGTATGCATACGTTATAAATAATAAAACTATCGAAGTTGACGAAGATAAAGCTGAATTGATCCGTGACATTTTCAAATATTACAATGATGTACGTTCCGTAAATGCTACTATGAAGCATATGAATCAAAAATACAATCTTCATAAAAGTTATGATTTTTATCGCGAAACTTTACGTACACGTAAATATACAGGTGATTTTAGAGGTGTTCCTGATTATTATCCTCAAATCATCTCTACAGATCTGTTTGATAGTGTTCAGGATTCCAAGTCAAGTTATATCCGCGAAAACCAAACAATGCGCCTTTATATCTTCAGTGGGCTTATTCAATGTAAAGAATGTGGACTAAAAATGAATGGAATGCAGACTGTCACTTCCGCAAATAAGTTCATGACCTACAGATGCCGAAACGCTGCTACTTACCACAGGTGTTCAAATAGATTATCCATACGTGAAGATAAGATTGAACTTTACCTGCTCAATAACATCCAATTACTTTATGATACTCATGTAGAGAAATTAAGAATACAGAAAAAGAATAAAAAGAAGAAACACATTGATAAATCAAAAATAAAATTGAAATTAAAAAAATTGAAAGAACTATACGTAAATGATTTGATTGATATAGATGATTATAAAAAAGATTATAATGAATATATGAAAATACTATCTGAAGCAGATCTCGAAGATACCGCGGTAACTATCCAAGAAGATACCGAAAAGATAGGAATAATGATTGCTGACAACCAACTCGACACATATAACAAATTAGATCGAAAGAATCAACGTAGATTTTGGAGAGGTATTATTAAAGAGATAATTATTGATAATGAACACAACATTGATGTCGTGTTTTAAGTTTACCCAATGCTATACTACCTAAGCTATACGTGAAGGTCTGCCAGCCATAGTAACAATTATGCTTTCTCTTGGTGTGCAGAGGATGGCTAAGAAGAATGCAATTATCAGAAAATTACCTGCTGTAGAAACACTTGGTAGCGCAACCTTTATCTGCTCTGATAAAACTGGAACTCTGACTCAAAATGTTATGACAGTAACTGATATAGCTTCGATAAAAGGTATGGAACCAGAAAATAAAGAGTTTGGGAACCAGCTACTAGAATATGCTGCTTTATGTAACGATTGCTATCCTTCCAGTAATTCCTCCGAAATCATTGGGGAACCTACGGAAAAGGCGTTATTAGTTGCAGCAGTAAGAAATGGATATGATAAGAAAACTCTTGACAAAAAGCTTCCAAGAATCAGAGAAATTCCTTTTGATTCTGCAAGAAAATTAATGACAACAGTACACCAAGTTGCAGATGGGCGTTTTCTCATTATTACAAAAGGTGCATATGATGTCTTACTCCTACATTGTAACAAGGTATATAATAATGGTGAAATAGAAAATTTTTCTCAGTCACATAAAGCAAAATTTGATCGAAGCAATCTCTTAATGGCAGAAAAAGCTCTTCGTGTTATCGCTGTCGCTTATAAGTATGTTGACCGTAATCCAAACCAGATGACGGATTCTTCCCTAGAACAAGATCTAACATTACTTGGGCTACTTGGTATGATTGATCCTCCAAGAGAAGAAGTAAAAGGTGCCGTTTCCATGTGTAAATCCGCTGGAATCACCCCTGTAATGATTACGGGAGATCATATCCTAACAGCTTGTGCAATTGCAAAAGCACTTGGTATAATTACGGAAGCAGAAGCCGAAAGTGTAAAGCCCCAACAAAGTAAGCTCTATGGAAATAAAAATCGTGGTGAAAATTTCAAGGCTTGTGCTATCACTGGGGAACAATTAAGTCATATGTCTGATAAAGAATTAGAAGAAAATATCTATCAATATAAAGTATTTGCACGTGTTTCCCCCGCTCACAAAGTACGTATTGTAAAGGCATTGCAAAAGCGTGGTGAAGTTGTAGCAATGACGGGTGATGGTGTAAATGACGCACCTGCATTAAAGGCTGCTGATATCGGATGTGCTATGGGTAAAGGTGGGACAGATGTTGCCAAAAATGCTGCTGATATGATTTTAGCAGATGATAACTTTGCAACCATAGTTGCCGCGGTAAAAGAGGGACGAGGAATTTATGATAACATTCGTAAATCCATACACTTCTTACTTTCCAGCAATATAGGAGAAATAATCACCATATTTATTGCAATTTTATTTGGCCTTCCAGCACCACTACTGGCGGTTCAGTTACTTTGGGTAAACCTTGTTACTGACTCGCTACCTGCGATCGCACTTGGTGTTGAACCAGCGCCGGATGACATCATGAAAAAACCTCCGATTTCTCCAAAGAAAGGAATGTTTTGCGATGGTCTTGTCTTTAAGATAATATTTGAAGGTGCAATGATTGGTTCACTTGCTCTTGTTGCTTATACTTTAGGTGGAAGAACGATGGCATTTACCGTATTAAGCCTTTCACAGCTCTTTCATGCTTTTAACATGCGAAGTGAGCACTCAATCTTTAAAATTGGAGTCTTTCGCAATAAGCAGATGGTGCTATCCTTCCTTGTATGTTCCTTTTTGCAAATCGCAGTTGTTTCCTACGAACCTCTTACGAAAATTTTTAGAGTTACACCAATGTTGCCATTCCAATGGGTTATTGTTTCCATTCTTTCCGTCATACCAATTATTATTGTAGAATTACAAAAAGCTGTATCAAGTAGAGCATGA
Protein sequences of DBSCAN-SWA_3 >NC_010001|3649174:3659730|3652071_3652677_-|WP_012200999.1|DBSCAN-SWA MGYKNDFSKEEVREKIFNLLMSTGREGISNLFNHMDKNGFFTAPCSGGNHLSKEGGLAEHSLNVYNIANDTAVTIFGFEEMCRLGDSIIIAALLHDLGKIGQFEKPLYVENILQSGKRSTAKPYAQSSDLLKVDHEVVSVVEASRFIQLTEEEQFAILYHNGMYVGLGRSLKDNETPLQILIHFADMWASRVTEEDGESND >NC_010001|3649174:3659730|3656561_3657896_+|WP_012201005.1|DBSCAN-SWA MIRVALYIRVSTEEQALHGFSLEAQREALTKYAKEHDMEITGVYIDEGITARKKYNRRKEFMRLIEGVKEKSFDLILFTKLDRWFRNISDYYKIQETLETYEVNWKTIFESYDTSTASGRLHINIMLSVAQDEADRTSERIRSVFENKIKNKEIVTGKQPYAYVINNKTIEVDEDKAELIRDIFKYYNDVRSVNATMKHMNQKYNLHKSYDFYRETLRTRKYTGDFRGVPDYYPQIISTDLFDSVQDSKSSYIRENQTMRLYIFSGLIQCKECGLKMNGMQTVTSANKFMTYRCRNAATYHRCSNRLSIREDKIELYLLNNIQLLYDTHVEKLRIQKKNKKKKHIDKSKIKLKLKKLKELYVNDLIDIDDYKKDYNEYMKILSEADLEDTAVTIQEDTEKIGIMIADNQLDTYNKLDRKNQRRFWRGIIKEIIIDNEHNIDVVF >NC_010001|3649174:3659730|3652666_3652828_-|WP_157668740.1|DBSCAN-SWA MRYKVMKDYWTCHHCQANLDHGEKCDCAESRLGFVVVPNDYGKHDKEVYSNGI >NC_010001|3649174:3659730|3657957_3659730_+|WP_012201006.1|DBSCAN-SWA MLSLGVQRMAKKNAIIRKLPAVETLGSATFICSDKTGTLTQNVMTVTDIASIKGMEPENKEFGNQLLEYAALCNDCYPSSNSSEIIGEPTEKALLVAAVRNGYDKKTLDKKLPRIREIPFDSARKLMTTVHQVADGRFLIITKGAYDVLLLHCNKVYNNGEIENFSQSHKAKFDRSNLLMAEKALRVIAVAYKYVDRNPNQMTDSSLEQDLTLLGLLGMIDPPREEVKGAVSMCKSAGITPVMITGDHILTACAIAKALGIITEAEAESVKPQQSKLYGNKNRGENFKACAITGEQLSHMSDKELEENIYQYKVFARVSPAHKVRIVKALQKRGEVVAMTGDGVNDAPALKAADIGCAMGKGGTDVAKNAADMILADDNFATIVAAVKEGRGIYDNIRKSIHFLLSSNIGEIITIFIAILFGLPAPLLAVQLLWVNLVTDSLPAIALGVEPAPDDIMKKPPISPKKGMFCDGLVFKIIFEGAMIGSLALVAYTLGGRTMAFTVLSLSQLFHAFNMRSEHSIFKIGVFRNKQMVLSFLVCSFLQIAVVSYEPLTKIFRVTPMLPFQWVIVSILSVIPIIIVELQKAVSSRA >NC_010001|3649174:3659730|3654602_3654932_+|WP_012201002.1|DBSCAN-SWA MRKAKVLEKLIKESGYTVRAFAQKCGIPESTLYTILKNGVGRATMDNILTICRNLGIKVEDLERMAEGEVEESQPSYDELITVYTRSKKNLSQEEKMRLARIILEDDED >NC_010001|3649174:3659730|3655704_3656382_+|WP_012201004.1|DBSCAN-SWA MENYGVAAPRKKNKGCLIGIIVIILFLGGLGFGFYRIVQNPEEYGAKTKKSELATLLDVSDEQEANILKIFKECGIDDVKSVKPFNAGEKMSSYSLSSADTSNIVVWVSNDKKEVQEIYFNDYDIYKDGKCVSKITDYILTKDEKTTYQTASQLLIKDTLTVPSSAKFPSIYDWKFGKIDGVIIVQSYVNSKNAFGVEIKNEFQIKFDENGNPISIIINGKELLQ >NC_010001|3649174:3659730|3653685_3653877_-|WP_041703723.1|DBSCAN-SWA MDNSGIDINTLKDVLLEQINALAEWNLENINVDAEQIRKNIVTIAEIIQLMVSLNQQVSNGWL >NC_010001|3649174:3659730|3649174_3649600_-|WP_012200995.1|DBSCAN-SWA MNKVMMICRLTKDPEIRYGQANGTAIASYGIAVNRRFKRDGEPDADFFNCVCFGRNAEFAEKYLAKGSKIAIVGSIQNENYTNKDGQKVYSTKILVDEMEFAESKSSGSDYQQSSRPAPSEASGDGFMNIPDGIDEELPFN >NC_010001|3649174:3659730|3654928_3655618_+|WP_157668742.1|DBSCAN-SWA MTNNEIIAGTLFVFNHCSINNFPLDCWNIVNQYGFKVKKYSELKPKKLEACLELSEDANIIGDTVYYNENKSHNRVRFSLMHELGHIVLESNDESECDKFSSNTIAPRMAIHYSKCRNANDVAKIFILSDQAANIAFDDYRRWRRNVTIYGMSDLDKQMYEHFYNDDAKKFVWSFERCDFCFTDYAYNEQSLCETCRRFEIRKLIKAQRIDGQERNLDLSRSNWLYGQL >NC_010001|3649174:3659730|3654196_3654412_-|WP_157668821.1|DBSCAN-SWA MYANLLNAMKSKGITSTQLANLLECRQATMSDKLNGVVNCGFYFDEAYKIKKVFFPEYDYDYLFIREKLVS >NC_010001|3649174:3659730|3652817_3653081_-|WP_012201000.1|DBSCAN-SWA MKNTGIVRHIDGLGRIVLPMEMRKLQDIKEGDPLEIYVDYDSIVLKKHQDECTCRVCGQKLGKEKRMVTLNDITLCTECIKWFNDEI >NC_010001|3649174:3659730|3650328_3651438_-|WP_012200997.1|DBSCAN-SWA MNELQVIVTQKPAEISFNFDEIKQSLSEQMEIYKSMEVTEEVLAERKKDIATLRKIAKAIDDKRKEVKSNYMIPYEEFEKKAKELVEIINEPIELINKQVKEFDEKQKAEKRQKAFDYYLQKMGGQSETLEFEEVFKDSWLNVNTSFKSIKNDIDTALSDRLADLEAIAARKSEVEEKAIAVYKQTKKLSDAMKIINDYDIQKKAILEAEERRIRAEEERRKREEERKAREEQERLSREQRERELAAERERQSEIDRIRAEERAKVQEEVCRQQEEIRRREESQRVIEKEVSPFPIVEEVDFPGSPFPVIADETPFPAEEVKKEESVFEVKHTPVSQVTTVRYMVKGTEEQHKQVEKILSELGVQFAKL >NC_010001|3649174:3659730|3649599_3650307_-|WP_085953465.1|DBSCAN-SWA MIYKAISSVMEDIGAISKNSRNQQQGFMFRGIDAVMNAINPALIKHKLFVVPEILEQTREERTTTKGGNLIYSICKVKYTFYAEDGSNIEAIVIGEGMDSGDKATNKAMSIAFKYACFQVFCIPTEEMKDPDEEFHDVNPRGKTNAKPEQKNNKPKPENKPKDSITEKKQEEQEPSTELDAIKMAELNAELARTGASVKSMLMFYKIDSVEKLTPENLEDAIKRLKPLRDKKVGA >NC_010001|3649174:3659730|3651440_3652079_-|WP_012200998.1|DBSCAN-SWA MINSFERDNFYMLSWLDQFMTGYKGFIAGGSFKNIFNHEKVKDLDIFFESKVDFDSAVEHFDKMCGENNTGGKYHFYYENKNVKAYKHVETGITLELCCKVFGSARNIISQFDFTITKFAYFKEQVKDAESDDNHIEYMIACDDKFFEHLHLKRLVTDDKILYPMSTFERMIRYIKYGYMPCKETKLRIASAISELSKEQIDISNSLYEGVD >NC_010001|3649174:3659730|3653916_3654069_-|WP_157668741.1|DBSCAN-SWA MKEPTFTFVGKPDAKKAMRLLIELYCEQEGYEITELVFASNEKPTDDKTA |
15 | uncultured_Caudovirales_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
3877668 : 3885061
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_010001|3877668:3885061|DBSCAN-SWA TATGGATACGCTTGAGAATATGAAAAATGCGATTAATTATATTGAAGATAATCTTGAAGCTGAAATTGATTACGTAAAGGTTGCCCAAATAGCATTATGTTCCCAATATCACTTTCAACGCATGTTTGCTTTTCTTATCGGCGTACCGCTATCCGAGTATATACGCCGTCGCCGATTAACGCTTGCTGCCTTTGATTTGCAGAATAGTAATGAAAAAATTATCAATCTTGCTTTGAAGTATGGCTATAACTCGCCTGACTCGTTTTCTCGGGCATTTATGGCAATGCATGAAGTTACACCCTCAAAAGCAAGAGAAAAGGGCATATCGCTAAAAGCGTATCCTCGTGTAACGTTCTCCTTATCAATAAAAGGAGTGGTTGAGATGAACTACAGAATTGAGCAAAAAAATTCATTTACTGTTGTAGGTGTAAAACAGAGGTTTTCACATATTAATGGTTTAGGTGAAAGTATTGGTAAAATGTGGAGCGAGACACCACAAGAAACTATTTCACAAATTGCTGGACTTGGAAACGGGTTAGTAGGCGTGTACAGTGGAATGTATGAAGATAATACAACTGATTACTATATTGCCGCGATAACTGAAAGCGATTCCCCAGAAACTTTGTGCAAGCTTGAAATACCATCTCTTACATGGGCAATATTTGAAATAATCGGCCCTATGCCTACTGCAATGGCAGAAATATGGGGACGAATTTTTTCCGAATGGTTTCCTACATCTGGATATGAGCATGCAGAAGCACCAGAAGTAGAATGGTACTCAAACGGTGACTTGAGTTCGTCTGATTATAAAAGCGAAATATGGATACCCGTTATTAAAAAGTAATGTTCTGTGTCTAATAAGCATTATGCTAAGTAATTATGTGTTACATAATATTTCGCTAATGCAAATCAAAAGCAATAACTTCTAGAATAAAAAAGCACACATCCCATATAACAAGAATGTGTGCAACCATGTGTGCAGATAATAACAGAAGCCGCAAACCTCCTATTTATCAGTATTTGTGTGTGCGGGGATTTGTTCGAATCCCGTCTCGTGCTTACTTTTAAAATGATGGAAGCCTAGTAAATTGAACTGACCCCTGTCAAGTAGACAAGTAAAATAATTAAAAATGTATTAGATAATGCGATCACATTTGTGGTCGCATTTCTTATGCGATTCGTTCCTTTTTCTTTGCTTCTAATTCTGCTTTATAAGCAAGAATTCTTTTGTTTTCTGGTATTGGATATTTAACTGGAAACTCTGAGTTGATAGCCGCCTTCCGAACTTCCATTGGTGCTAAGTTACTAAAGCGTTCCTGATATCGTCCATTGTTATAGTATTCTATATATTCAGCGATTGCCTGACAAAGTTCGTTTTCATCAGCGAAATCTGATATATAATACATTTCAGATTTGATAATTCCCCAAAGACCTTCTGTTGGACCATTATCAATACAGCAACTAACACGAGACATAGATTGAGTCATTCCTTGATTCAAAAGTTTCTTCTGAAAAACTTTGTTTGTATTTTGAAATCCTCTATCACTATGAAAGAGGGGTGTGACATCTGGATTAGATTTAAGTGCTAAGTCATATGTATCAAAAACAAGCTTGTTATCATTGTGGTTGCTTAGTACATATGACACTACTGACTTATCATAAAAATCTAGAATGGCACTTAAATGTACTTTCCTATTACTCATAGGCACTTTGAATTCTGTAACATCTGTTCCCCATTTCTCATTAGGTTTTGAAGCCTCAAATTCCCTATGTAAAATATTTTCAGCAGTTGTTTCTGGTGTTGAATAACGATACGGTTTTCGCTGCTTTCTAATTACAGATTTGATGCCATATACACGCATAATACGTTGTATTCTACCTTTACTATAATGCTTGTTTTTATCTCGGTTAATGTAATTACGCATTCTACGATAGCCTAAAGTATGTTTGAACTTTTCATCGTATTCCTTGATCCATAGAGCTATCTTTTCGTTTTCAATCTCTTCTTTTGGCTGTTCACGATTCAACCACTTATAGTATCCTGTACGTGAAACATTTAATTGCTGACACATCCAGCTTATATCCCAATCTTTTTCTGTTTTGAAAAACTGAATAGTAAGATATTTAGGCTCCATTTTTCCCTTGGCTAGAATCGCCTCCTTTCGAATTCCTTCACTTTTTTTAACAGTTCTACAATCATATCTCTTTCTTCAAGCTGGCGTTTCAGACGTTTGTTTTCCCTCCGAAGTTTCTCTAGCTCATCAACTTCTTCATCAGATTTGTGATGACCACGTTTATCTGCCAAGCCCTCTTCACCACTAGCATTGTATTTGCGAACCCAGCTATAGACTTGACTATAAGAAACATCATATTGTTCAGCAGTATTCTTATAATCACATCCATGTTCGATACAGTATGCAACGATTTCCTTACGTTCTTCAATTGTTACTTTTCTTCTTGCTTCAGCCATATAGACCTCCTGTTTCGGATTATAATCCTTTAGTTCCATATCGCTATTATACACTGAAATCCATCTTGCTAAAGAAGAAACCGCACAAATATTATATTTAGCCATGAGTTCTGGGAGCGAGCCATTTCCAGCAATATATTCCTCGACAACCATGGTTTTAAATTCTTTTGTATAACTTTTATTCCCGCTAGTTGAAATAAACGCTTCTGCTCCTTGATTTTGAAATAATAGAACCCATTTACGTACAGTTTCTTTACTAGCAATTTTCAGTTCTGTACATATCTCTTGTAGGTTCTTTGATCCAGTTAAATAATCTTCAACAGCCTTTAATTTTTGAGCTGGTGAAGAAGGTGATTTAGACATAATAAAATCCCTCCAAAGTAGTTTTGGTTATTTACCATGTCTACTTTAGAGGGATCATATCAAATTGCTAGACTTCCTTATTTTTTGCTTTGCTATATCTTTAATTACACGTGATTACACTAACTACATAGTTTTTCTGTATACTCTTTGAAAGCTTCTTCCTCAATTCTCGGTTTTTCCTCTGTAAGAAATGCTTGAGTGATTCTGATTTCTTTTTCTGTGTCTTTGAAAATATCTTACCTGTTATCTTGATACGTTCAAATGGACTTTCTTTTATATAATCTTTATCTACAGCTAATTTACAAGAGACTTCAAACACGCTTCTCTCACATTCAATTTATTTTTCACCTAAAATCATGCCTTTTTCGATTTTTCTGCCGTTTACAAAATCTTTTGCGGACATTATTTTCCCGCCCTCTAACTGTAATTCTAAAATTATTATTGTATATCCTATTGCAAATACCTCTATGTTATTTCTGATATCTATAACTTCACCGCTTTTTTTATCTTTAATTGCAGACACATCTTTGTAATCCGAATAAATCGCTTTTAATATTTTCAACTTTTTGCCATTAAGCCACGTAAAAGCGGCCGGTGACGGTGAAAGTCCTCGAATTTTGTCATATATAACTTTTGCCGGTATATTCCAGTCAATTTCACACATATCATCATCTATTTTTTTTGCGTAAAAAGTGTCGCTTTCCGGCTGTTTTTCACGTTTTACCGTACCGTTTTGTATTTGATTTATAGCTTCAATCAATAATTTACCGCCGATTTCTGCCAATCTGTCATGAAGTTCGCCGAATGTTTCATGTTTGCCGATTCCGGTTGATTCTTTCAATATCATATCACCTGTGTCAATTCCCTCGTCCATGTACATAATAGTGATACCTGTGACTTTTTCACCGTTCATGATTGCGGCATTTATTGGCGAAGCTCCGCGATATTTCGGCAGAAGTGACCCATGAACATCCACACACCCATATTGGGGAAAATCTAATATATATTTCGGGATTAATTTGCCATATGCCGCAACTATGATAATATCCGGTGCAATTTCTTTTATCAGATCAAAAAATTCTGCACTTTTCAAAGTCTGCGGCTGGTATACAGGAATATTTTCACTCAACGCATATTCTTTCACCGGAGGCGGGGTCAAAGTCATCTTACGCCCGACAGGTTTATCCGGCTGGGTGACAATGCCAATGACTTCGCACTTGTTTTTTATCAGATATTCAAGAGAAACCCTCGCAAATTCAGGAGTACCCATAAACAATATTTTCATGAATATCCTCCTTTCCGTTTTCTTCATTTTTCTCATTTCATTCGTAAAATATAAATAACCTACACTACAACTTCAAATTTGTCGTTCGTAATTGCTAACGATCATGTACTATCGCTTGATTAAACAAAATATACTCTAACCTATGAATCATATCACTTATAAATAATAACATCAATATTTCAAACAAAAATTTTCCAAAAAAAGTTGTAAACAGGCTGTAGACACTTTGTGGACAGTGTATTTTCCTCCTTATTATATAGTGATTACGGAAAGGGGTAAAGTTCTAATCCCATCTCGTGCTTAACTTAAAAGTATCGCAAATTCTTATAAATCAAGGGTTTGCGATTTTTTTTGCTTGAAAAACACGTGTGCAGGATATGTGCAACATCCTTTTACGGACTCAAACCATTATAGTTCTTTGTTCATAAAGGAGCGGTTCTGCTCGATTTCATGGTAATTATAGATATAACCCCGAATAGTTTCCCCATAGTGGATTGACTTTCTAAATTAAGGAAATTATAATTAGTTAAACTATTAGGAGTAAATAACATAATCTATAGTTTTCAAAGGTGAACTGCAGATTATTTATATTTTGAGCAGGAGGAAAAATAATGCTAAGAACAATGTTTCAGATGTATCTCAAGAATAGTGTTGAAGCAGTCGAAATGTATCAAAAGGCTTTTGATGCTAAGTTATTATATGAGAGTAAGAACGAAGATGGAAGCTATCTTCATGCTGAATTAGACGCTTGTGGGCAAGTATTAGCTATATCAGAGGCACTTGATGAGAGGAAAATCGGTAATACTATGCAATTTTGCTTCCACTTTGGTGAAGGCAATGAAGAAATTGTAAAGAGAGCATATGAAGTTTTAAAAGATGGTGCAAAAATTAATCAACCATTGGGACCATGTTTTTTTAGTTCTTGTATGTTTGGTATTGTAGATAAATTTGGAGTTGATTGGTGTATTTTTGTTTAACGAATCTACGTACAAATTATATATCCATATTACTAAGTTCGTTGGTGTTATATATTACGAATCAAAATTTTTTATATAATGGAGAGATAGGATTATGAAAGATACTAAAGTATTAGCTCAAAATAAGACAAGTTGGGATTTCATAGCCGATGAATGGTTTGGATCAACTTCACTACCTACATACGGTCCAACTTTACCTAATGAAGGTACATTGAATTTGTTTGATTCATTAGATAATAAAAAAGTTTTAGAGATTGGTTGCGGCAGTGGACATTCTTTGCTATATACAGCTAAACAGGGGGCTAAAGAGTTATGGGGTTTAGATTTATCTTCAAAACAGATAGAGAATGCTGAAAAACTGCTATCAGAAAATAACGTAATTGCAAATTTATTTGTGTCACCTATGGAAGATAACCCAGGAATACCAGAAAATTACTTTGATTTTGTTTACTCGATATATGCATTTGGGTGGACTACCGATTTAAAACAATCCATAGATTTAGTTCATAAATACTTAAAGAAATCTGGTGTTTTTATATTGTCTTGGGATAATCCATTGATGCAATGTATAGAAGCTGAAGGGAATAAGTATACAATTTTCAGGTCGTATTTGGATGAAGCTACGATTGATTTATCAAAGGGTAATCAAGCTATGAAAATAAAGAACTGGAAATTATCTTCATATATCAATGAACTGGCATCTGCAGGATTTAAAATTGACAAGTTGATTGAAGAAACTGATAAGGATATTCTTGCCCAGGAATATGATTTTACACTAAAATACTATTCAAAGCATAAAGCTAAATTAATTAATACATCGTTTATTATCAAAGCAATTAAATTATAATCGACAAACAGGAAGTTGTAGTATAGATTATTTATATTTTAGGAGGTATTAGCGTTATGGATGATGAATTAAAAATCAAAATGTACTCATTTACGGTGGATTGCAAAGACCCTCATGAATTAGCAAAATTTTATGCAGCGTTGCTCAAGTGGGAAATAATGTTTATCAATGAAGAATGGGCATGTGTATACGCCCCAGGAACCAATCAGGGGACATATCCTTGTATATTGTTTCAACAAAATCCTGAGTATAAACCTCCTGTGTGGCCGGAAGAGCCTGAAGCTCAACAGCAAATGGCACATATAGACTTCGCCGTTAATGATTTAGAAAAAGCAGTTCAATATGCAATCCATTGTGGAGCTACAATCGCAGATGAGCAATTTTCTAATAATTGGAGAGTTATGCTTGACCCCGCCGGACACCCTTTTTGCTTATGCCAAATGAAATCAATTGTCGAGAGTGCCGATTTTGCGTTGTTATAGAAAATCATAAATTGGTTATGTAGACTAAATGCGGCACAAAAAGTCGAGTGAATATAATTCTAATAATATATTATATCAATAGGGGAAGAGAATGGATTGGATTAAAAAATTTAATGAGGTTATAAAATATATAGAAGATAATCTTAAAGGTGAAATTTCATATGACACTATATCCCAAATTGCAGGATGTTCCATTTATAATTTTCAAAGAATGTTTTCATATATTGCTGACAAGCCACTATCGGAATACATTAGAAACAGACGTTTAACACTGGCAGCCTTTGATATTATGAACAGCAAAGACCGAATCATTGATATTTCATTTAAATATGGATATGAGTCTCAAGATGCCTTTTCTCGTGCTTTTCGAAGTTTTCATGGTGTCTTGCCTTCCGCTGCAAGAAATGAAACCGTCCAATTAAAATCCTGTCCAAAACTCTCCTTCCAAATCAATATTAAAGGAGAGAATTATATGAATTATCAAATTGTACAGTTTCCTGCATTCAAGGTAGTGGGAATAACTAACCGTATAAACACTTCAGAAGCATTTAAAATTGTACCGCAGATTTGGGAGAACGCTTGGAAAGATGGAACAATGAACCGCTTTATTGAACTTTTAAAAAAAACAGATTATCGTCCTGCTGGCTTTTTAGGCATATGTGCAGATGGAAAATGGGGAAACTCAGAAGAAATGGATTATATTCTTGCTATCACAAACCATGTAGATGTTCCAGAATGCAACTATGTTTCTCCTCCTGATGGGATGAAAGAATTCTGCTATCCAGCATCTACTTGGGTTGTCTTTGATGCTGATGGGGAACTTCCAAGTGCTGTTCAGAAAATTCATAAGCAGTTTTATTCAGAGTGGTTACCCAATTCAGGATATGAACTAGCAGATCTTCCTGTTATTGAATCTTATATGCAAAACAATCATCAAGAAGTTTGGATAGGAATTAACAAAAATAAATGA
Protein sequences of DBSCAN-SWA_4 >NC_010001|3877668:3885061|3877668_3878511_+|WP_012201180.1|DBSCAN-SWA MDTLENMKNAINYIEDNLEAEIDYVKVAQIALCSQYHFQRMFAFLIGVPLSEYIRRRRLTLAAFDLQNSNEKIINLALKYGYNSPDSFSRAFMAMHEVTPSKAREKGISLKAYPRVTFSLSIKGVVEMNYRIEQKNSFTVVGVKQRFSHINGLGESIGKMWSETPQETISQIAGLGNGLVGVYSGMYEDNTTDYYIAAITESDSPETLCKLEIPSLTWAIFEIIGPMPTAMAEIWGRIFSEWFPTSGYEHAEAPEVEWYSNGDLSSSDYKSEIWIPVIKK >NC_010001|3877668:3885061|3880837_3881785_-|WP_012201182.1|tRNA|DBSCAN-SWA MKILFMGTPEFARVSLEYLIKNKCEVIGIVTQPDKPVGRKMTLTPPPVKEYALSENIPVYQPQTLKSAEFFDLIKEIAPDIIIVAAYGKLIPKYILDFPQYGCVDVHGSLLPKYRGASPINAAIMNGEKVTGITIMYMDEGIDTGDMILKESTGIGKHETFGELHDRLAEIGGKLLIEAINQIQNGTVKREKQPESDTFYAKKIDDDMCEIDWNIPAKVIYDKIRGLSPSPAAFTWLNGKKLKILKAIYSDYKDVSAIKDKKSGEVIDIRNNIEVFAIGYTIIILELQLEGGKIMSAKDFVNGRKIEKGMILGEK >NC_010001|3877668:3885061|3882856_3883609_+|WP_012201184.1|DBSCAN-SWA MKDTKVLAQNKTSWDFIADEWFGSTSLPTYGPTLPNEGTLNLFDSLDNKKVLEIGCGSGHSLLYTAKQGAKELWGLDLSSKQIENAEKLLSENNVIANLFVSPMEDNPGIPENYFDFVYSIYAFGWTTDLKQSIDLVHKYLKKSGVFILSWDNPLMQCIEAEGNKYTIFRSYLDEATIDLSKGNQAMKIKNWKLSSYINELASAGFKIDKLIEETDKDILAQEYDFTLKYYSKHKAKLINTSFIIKAIKL >NC_010001|3877668:3885061|3878836_3879802_-|WP_041703741.1|transposase|DBSCAN-SWA MEPKYLTIQFFKTEKDWDISWMCQQLNVSRTGYYKWLNREQPKEEIENEKIALWIKEYDEKFKHTLGYRRMRNYINRDKNKHYSKGRIQRIMRVYGIKSVIRKQRKPYRYSTPETTAENILHREFEASKPNEKWGTDVTEFKVPMSNRKVHLSAILDFYDKSVVSYVLSNHNDNKLVFDTYDLALKSNPDVTPLFHSDRGFQNTNKVFQKKLLNQGMTQSMSRVSCCIDNGPTEGLWGIIKSEMYYISDFADENELCQAIAEYIEYYNNGRYQERFSNLAPMEVRKAAINSEFPVKYPIPENKRILAYKAELEAKKKERIA >NC_010001|3877668:3885061|3884182_3885061_+|WP_012201186.1|DBSCAN-SWA MDWIKKFNEVIKYIEDNLKGEISYDTISQIAGCSIYNFQRMFSYIADKPLSEYIRNRRLTLAAFDIMNSKDRIIDISFKYGYESQDAFSRAFRSFHGVLPSAARNETVQLKSCPKLSFQINIKGENYMNYQIVQFPAFKVVGITNRINTSEAFKIVPQIWENAWKDGTMNRFIELLKKTDYRPAGFLGICADGKWGNSEEMDYILAITNHVDVPECNYVSPPDGMKEFCYPASTWVVFDADGELPSAVQKIHKQFYSEWLPNSGYELADLPVIESYMQNNHQEVWIGINKNK >NC_010001|3877668:3885061|3883665_3884091_+|WP_012201185.1|DBSCAN-SWA MDDELKIKMYSFTVDCKDPHELAKFYAALLKWEIMFINEEWACVYAPGTNQGTYPCILFQQNPEYKPPVWPEEPEAQQQMAHIDFAVNDLEKAVQYAIHCGATIADEQFSNNWRVMLDPAGHPFCLCQMKSIVESADFALL >NC_010001|3877668:3885061|3879813_3880500_-|WP_012200193.1|DBSCAN-SWA MSKSPSSPAQKLKAVEDYLTGSKNLQEICTELKIASKETVRKWVLLFQNQGAEAFISTSGNKSYTKEFKTMVVEEYIAGNGSLPELMAKYNICAVSSLARWISVYNSDMELKDYNPKQEVYMAEARRKVTIEERKEIVAYCIEHGCDYKNTAEQYDVSYSQVYSWVRKYNASGEEGLADKRGHHKSDEEVDELEKLRRENKRLKRQLEERDMIVELLKKVKEFERRRF >NC_010001|3877668:3885061|3882396_3882762_+|WP_012201183.1|DBSCAN-SWA MLRTMFQMYLKNSVEAVEMYQKAFDAKLLYESKNEDGSYLHAELDACGQVLAISEALDERKIGNTMQFCFHFGEGNEEIVKRAYEVLKDGAKINQPLGPCFFSSCMFGIVDKFGVDWCIFV |
8 | Streptococcus_phage(50.0%) | transposase,tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|