Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_010695 | Erwinia tasmaniensis Et1/99 plasmid pET09, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NC_010699 | Erwinia tasmaniensis Et1/99 plasmid pET45, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NC_010694 | Erwinia tasmaniensis Et1/99, complete genome | 4 crisprs | cas1,cas3f,cas8f,cas5f,cas7f,cas6f,cas3,DEDDh,DinG,RT,csa3 | 0 | 23 | 6 | 0 |
NC_010696 | Erwinia tasmaniensis Et1/99 plasmid pET35, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NC_010693 | Erwinia tasmaniensis Et1/99 plasmid pET46, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NC_010697 | Erwinia tasmaniensis Et1/99 plasmid pET49, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_010694_1 | 631766-631858 | Orphan |
NA
Consensus repeat of NC_010694_1
|
1 spacers
spacers of NC_010694_1
>1.1|631790|45|NC_010694|CRISPRCasFinder AGCATCGCGCCTATCCCTGAAGCGCGCCGCAAAGCGCTGGACGGC |
CRISPR arrays and Neighbor proteins around NC_010694_1
The CRISPR arrays of NC_010694_1 >merge|NC_010694|1|631766-631858|CRISPRCasFinder GTGACTCTGTGCGTGACCTGTCAGAGCATCGCGCCTATCCCTGAAGCGCGCCGCAAAGCGCTGGACGGCGTGACTCTGTGCGTGACCTGTCAG >NC_010694|1|1|631766-631858|CRISPRCasFinder GTGACTCTGTGCGTGACCTGTCAG AGCATCGCGCCTATCCCTGAAGCGCGCCGCAAAGCGCTGGACGGC GTGACTCTGTGCGTGACCTGTCAG
>NC_010694.1|WP_012440311.1|631382_631610_+|DUF2732-domain-containing-protein MRNTETRSFNTDSNALAVLLTDAKKEERKDRALAVSIRLEALAIHITKVGMSGTEAAELLRREATRFENESQELH >NC_010694.1|WP_012440310.1|630973_631315_+|DUF5347-domain-containing-protein MAIEGPTATIPLSPGERLEGLNHIAELRAKVFGLDIEPELERFIKDMRAPRDVNHKQNERALAAIFYMAKIPAERHGVNISDLTTDEKRELIKAMNHFRAVVSLFPKRLTMPN >NC_010694.1|WP_012440309.1|630809_631010_+|DUF2724-domain-containing-protein MLTKEPSLASLLVKQSPAMHYGHGWIMGKDDKRWHPCPSQNELLAGLSTTKQGKSWLLKALRQLFH >NC_010694.1|WP_012440308.1|630292_630802_+|phage-regulatory-CII-family-protein MFDYCVSKHPHFDEACRTFALRHNMAKLAERAGMNVQTLRNKLNPEQPHQITPSEIWLLTDLTEDSTLVDGFLAQIHCLPCVPMNEVAKEKLPHYVMSATAEIGRVAAGAVSGDVKTTAGRRDVISSINSVTRLMALAAVSMQARLQSNPAMASAVDTVTGLGASFGLI >NC_010694.1|WP_012440307.1|629998_630262_+|hypothetical-protein MASEIAIIKVPAPIVTLQLFAELEGVSERTAYRWTTGDNPCVPIEPRKIRKGCKKAGGPIRIYYARWKEEQTRKALGHSRFQLVIGS >NC_010694.1|WP_012440306.1|629290_629869_-|phage-repressor-protein-CI MGIQKNTLEPLTILDRIISVYGFTQKLQLANHFEMSPSSLQNRYTRGTISYDLAAFCSLETGASLRWILTGEGPQFEGSPSITDPKNMDLYTLNNGILDKNSILSIDSNILNKQISKGIAVRAEGKLHFVDQEAPHSDGLWLVDIESANSIRELTILPGRRLHVAGGKVPFECNFDDIRLLGRVVGIYSEIN >NC_010694.1|WP_012440305.1|628262_629291_-|tyrosine-type-recombinase/integrase MAIRKHPSGVGWLSEIYPNGAKGKRIRKKFATKGEALAFEQFTVQNPWQEEREDRRTLKELVDAWYSAHGITLKDGIRRQQAMHHAFGCMGEPLARDFDAQMFSRYRERRLKGEYARSNRVKEVSPRTLNLELAYFRAVFNELNRLGEWKGENPLKNMRPFRTAEMEMAWLTHDQIALLLDECKRHDHPDLETVVRICLATGARWSEAESLKKSHLAKYKITYTNTKGRKNRTVPISKELYESLPDDKKGRLFSDCYGAFRSALERTGIELPAGQLTHVLRHTFASHFMMNGGNILVLQRVLGHTDIKMTMRYAHFAPDHLEDAVRLNPLNHQLNNYTTAIN >NC_010694.1|WP_012440304.1|626925_628248_-|hypothetical-protein MAGYFYDFTKEHSFHGEFWSAPHDNKDRFSAKIEYTPYNGLVLDYCISDSDSPRTCQRLYGVLNTGEPCTLIGSFDFLQGSMHFGKLRVLTGKHYFKAIIFNGIYTEEDSVEYCDIALHGMQEFIHPQGFISQLKYSTKPILSIHGSEWKIDVINNATFSMIGDSLVNIIDCQHEEAFNKFTKDFWSTKKEYPKAFFSIRKNLKFFLRYANTINDSIIKHIDDIWKLTGLFSILLDKPVIPDELNIKFKGKQKNNPCLFSNGIEQRTIDLALSTINHHFLPLNWKQIDMGEVISKWLNMSDEYNPLSVTYQYETGLRTLHQAHADIILYATQLESINLTLSAKNEDKYIGPINKYASIDLKNKLEAIFSKFNKKTIGENITIVRGELAHVGRPKKLMKVMSIDDYIKIGLYLKITITAHLLSQLGLTKEQIERYQSKVAP >NC_010694.1|WP_042958615.1|626624_626819_+|hypothetical-protein MTEFIDTFYLFNLEHEVGSENLKTFQTLADKYSHLLSEAEKEVEEKEAEAFYGIRPSDYEFLTE >NC_010694.1|WP_012440303.1|625326_626058_+|hypothetical-protein MDTVIAFLSLALFIAFIVGLIKPSLVMMPNRKRSSALYLGGCLALSFIGSILWPTEKSQRVAKADVPAVKAEPAPPTFEYADKTLKEYRNELKETRHDIVKDYVNFKSVPASSTDAFYACMSEYSFTKDDALKLGDVLGWCFNHFEKDPQSLNNKINLDTFKGNFSGWDGSYRPLEKLIKASMNDDSSYKHISTVYHLILNKDPYAVVKTTFRGTNAYGGVVKQTVAARVNVRTGEVLSILDN >NC_010694.1|WP_042958617.1|631902_632139_+|hypothetical-protein MKTILKRVGSKSATMPERVKSLYRRFDINHINARRSIGVAAGEGKRVAEVIAVSTSTVCTGHNPSCTPRCNVVAGARR >NC_010694.1|WP_042959250.1|632174_634550_+|replication-endonuclease MKPGGTDDAAWAFPWNAPKKAINPYLDRPEVKPSALSDPIALFAAENEGAKQRRAALSDEAWNRYFYNESRDPVLKEMEQERLTGRARLIHEQHRFNPDLVIIDNVRAEPAFISKPLMQRIAYFQQLDRPKACSRYLRDTITPCLQRLERVRDSQASASFRFMASRDGLDGLLVLAEMNQHQVKRLATLVGAHMSLCLEEAGSALFTADEVKPQEIRRVWERVAAEAMRLDVIPPAFEALRRKKRRRKPVPYELIPGSLARMLCADWWYRKLWQTRCEWREEQLRAVCLVSKKASPYVSYEAVVHKREQRRKSLAFFRAHELVSENGDTLDMEEVVNASASNPAHRRNEMMACVKGLELIGEMRGDCAVFYTVTCPSRFHATLSNGRPNPTWSSATVRESSDYLVNTFAAFRKAMHRRGLRWYGVRVAEPHHDGTVHWHLLCFMRKKERRSISALLRKFAIREDRAELGNNTGPRFKSELINPRKGSPTGYIAKYISKNIDGRGLAGEISKETGKSLRDNAENVNAWASLHRVQQFRFFGIPGRQAYRELRLLAGQAGRAQGDKKAGAPVLDNARLDAVLAAADVGCFATYIMKQGGVLVPRKNHLIRTAYALNDEPGTYGDRGIRIYGIWSPLVAGRICTHALKWKKVRKAVDVQEATADQGGSAAPWTRGNNCPLVENLNKSGGELPDIKTMDEKELQEYLHNMGQKERRELTARLRLVKPKRKKAYKQTISDHQRLQLEAELSSRGFDGSESEIDLLLRGGSIPSGAGLRIFYRNHCLQEDGKWRQWY >NC_010694.1|WP_012440314.1|634697_634886_+|hypothetical-protein MQDYFLESLKLQRIDFFIKLVAASECSEEEKRLAIQWVSELTDELMAKIRSHEYCRSMDVIS >NC_010694.1|WP_012440317.1|635552_636014_+|hypothetical-protein MDTTEQLNGTYFYGGLSNLNAGELFYWIMVDVTAEHFTGATAATGNVIAAAAIYAGRNNVAVSGKLANATPGTSWASIQSRRLLQKYKLPFPLPTIVGNPFKMKIIMTKKLGTFVGRTVPVIGWAIVASDVAIIGWKSVNRYNTIASAEDKIW >NC_010694.1|WP_042958620.1|636007_636307_+|DUF1493-family-protein MVMDDNEKAVFALVEEYNGHWFWLRKRFRLTPATDLNKDFRMAPEDAAELLETFADRFSVDPKEINFGRYFPADNGKAEKPLTIQLLIDSARAGHWIDK >NC_010694.1|WP_012440319.1|636343_637399_-|phage-portal-protein MSKRRNRTRTQSVPQPDNMTSGAASEAFTFGDPIPVLDRRELLDYVECVINDRWYEPPVSVDGLARTFRAAVHHSSPISVKCNILASTFIPHPLLSQQAFTRFAMDYLIFGNAYLEKRISRLGNTLKLEPSLAKYTRRGLDLDTYWYAHYGLNTEPYEFTKGSVFHLMEPDINQEIYGVPGYLSAIPSALLNESATLFRRKYYLNGSHAGFIMYMTDPAQSQQDVDNIRSAMKSAKGPGNFRNLFMYSPNGKKDGIQIIPLSEVAAKDEFLNIKNVSRDDMLAVHRVPPQLMGIIPNNTGGFGDIEKASRVFVRNELIPLQARMKELNDWLGLGQEVIRFAPYNLDLEDGN >NC_010694.1|WP_012440320.1|637398_639165_-|terminase-ATPase-subunit-family-protein MTTTIAPADLDPRRQALLLYFQGYRIARIAEMLGEKPATVHSWKKRDKWGSYGPLDQMQLSTAARYCQLVMKEVKEGKDYKEIDLLARQSERHARIGKFNNGGNEAVLNPNVENRNTGPRKPPKKNVFSDAQIEKLQDIFHSTMFGYQRQWWEAGNKYAVRNLLKSRQIGATFFFAREALIDALTTGRNQIFLSASKAQAHVFKQYIVEFAREADVDLKGDPMTLDNGACLYFLGTNARTAQSYHGNLYLDEYFWIPKFQELQKVASGMALHKKWRETYFSTPSSLTHSAYPFWSGAQFNRGRAKADRVDIDLSHASLAAGRLCADGQFRQIVTVEDAVRGGCDLFDLEQLRTRYSPEDYQNLLMCVFMDDLASVFQLAMLQKCMVDSWEVWDDFEALALRPFGWKEVWIGYDPAKGTQNGDSAGCVVIAPPAVPGGKFRILERHQWRGMDFRAQADAIKTLTQQYNVTYIGIDSTGVGLGVYENVKAFFPQVKEFVYNPTVKNALVLKAYDTMATGRLEFDASHLDIAQSFMSIRKATTSSGNRPTYETSRSEEVSHGDLAWATMHALANEPLQGQAAHTQNIVEMY >NC_010694.1|WP_012440321.1|639309_640173_+|GPO-family-capsid-scaffolding-protein MAKKAKRFRIGVEGATTDGRTIVRSWLEQMAANYDPAVYTAVINMEHIKGYTPDSAFRRFGVVDALDTEEISDGLLKGKLGLYAVINPTDELVTMTGNMQKLFTSMEIRPEFADTGEAYLIGLAVTDDPASLGTEILQFSASAGANPLANRKQHPNNLFTAATETVIEFEDVADDKPSLFSRVSALFSNKQKSDDARFGDVHKAVELVATEQQEFSQRIETALSEQASSLQAQFTEGLSAEVAAREQLQADFSQLQERLSREDGRQDFRPRTPGNGSGNSQDVRTDC >NC_010694.1|WP_012440322.1|640216_641386_+|phage-major-capsid-protein,-P2-family MKNNTRFKLNAYMSVLAEINKINLSALNSKFTVESSIAQTLETKIQESSAFLQAINITPVDEQSGERLGLGIGQTIAGTTDTTQKEREPTDPTYIDGDGYKCTQTNFDTALPYSKLDMWAKFSDFQVRIRDVIVKRQALDRIMIGFNGLKREKTSNRVQNPLLQDVNIGWLEKIRQEKPSQVISQRIDNSGKVVAGNITIGKGGVFNNLDAVVMGAVSEKIAVQYQDDTELVVICGRQLLADKYFPIVNKDQPNTEALAADLIISQKRIGGLPAVRASFFPADALLITRLDNLSIYWQEETRRRSIIDNPKRDRIENFESVNEAYVVEDYDCTCLIENIEMLDQEPEPEAGQMSDAEIARIASVAASVVKAMSESGTPHAQAGTDTAGE >NC_010694.1|WP_012440323.1|641389_642031_+|phage-terminase MTNPFRAHTRFIQAQEAAQRGSNSRHAKGYDLMLLQLNEDRRRLKGIQSNVNKAQVKIEVLPKYAAWVEGVLSVDGAQQDDVIMYVMLWRIDAGDYAGALTIGRHALKHGWVMPIGKRTTSTVLAEEMADAAKAAILAETPFDADLLLQTLESVDGEDMPDQSRARLHKSIGWAQTGNSPVSALNHLKQALQLDERCGVKKDIEQLERKLRNS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_010694_2 | 790320-790471 | Orphan |
NA
Consensus repeat of NC_010694_2
|
1 spacers
spacers of NC_010694_2
>2.1|790374|44|NC_010694|CRISPRCasFinder GCCGGAGGCGGTGAGTATTTGAAGTGCTCTTTTTACACCGAACA |
CRISPR arrays and Neighbor proteins around NC_010694_2
The CRISPR arrays of NC_010694_2 >merge|NC_010694|2|790320-790471|CRISPRCasFinder GAGTGCAGCGAGCAAAGGTGAGGCAAGGCAAAAATTCACGAAAAAGCGCAGCTTGCCGGAGGCGGTGAGTATTTGAAGTGCTCTTTTTACACCGAACAGAGTGCAGCGAGCAAAGGTGAGGCAAGGCAAAAATTCACGAAAAAGCGCAGCTT >NC_010694|2|2|790320-790471|CRISPRCasFinder GAGTGCAGCGAGCAAAGGTGAGGCAAGGCAAAAATTCACGAAAAAGCGCAGCTT GCCGGAGGCGGTGAGTATTTGAAGTGCTCTTTTTACACCGAACA GAGTGCAGCGAGCAAAGGTGAGGCAAGGCAAAAATTCACGAAAAAGCGCAGCTT
>NC_010694.1|WP_012440456.1|789075_790236_+|Na+/H+-antiporter-NhaA MNLFLKKLLKNDATGGVVLIVAAAFAMFLANNDSTRHAYQAMLTLPVQFRFGALDINKDLLLWINDALMALFFLMIGLEVKRELMMGSLKGRERAMFPLIAALGGMLAPGLIYAAFNHQDAQAIHGWAIPTATDIAFALGILALLGSRVPAALKMFLMALAVIDDLGAIVIIALFYTSELSLISLTVAAASIAVLAVLNGCGVRKTSVYLAVGMVLWVAVLKSGVHATLAGVIVGLFIPLKKQEGHSPAIELAHGLHPWVSWLILPLFAFANAGISLSGVSLNGLFSAVPLGITLGLFIGKPLGITLICWLAVKLKIAALPENTRLIDIAAVGVLCGIGFTMSIFIASLAFDGAHEELVTLAKLGILSGSVISALVGYTLLRVKLR >NC_010694.1|WP_012440455.1|788579_788870_+|lipoprotein MKKILVATTLAVLLSGCAQQTFQMKHNQVAAPKQVTTHHFFVSGIGQQKTVDAAAICGGAAKVERVEVQETFVNVLLRVVTLGIYTPREARVYCEL >NC_010694.1|WP_012440454.1|787232_788375_+|molecular-chaperone-DnaJ MAKRDYYEILGVAKSADEREIKKAYKRLAMKFHPDRNQGDKESEGKFKEIKEAYEILTDGQKRAAYDQYGHAAFEQGGMGGGGHGGFGGGGADFSDIFGDVFGDIFGGGRRQQRAARGADLRYNMELTLEEAVRGVSKEIRIPTLEECGVCHGSGAKAGTKPQTCSTCHGAGQVQMRQGFFTVQQACPTCHGRGSVIKDPCNACHGHGRVEKSKTLSVKIPAGVDTGDRIRLSGEGEAGEQGAPAGDLYVQVQVRKHHIFEREENNLYCEVPINFVMAALGGEIEVPTLDGRVNLKVPAETQTGKLFRMRGKGVKSVRGGAQGDLLCRVVVETPVSLNEKQKTLLRELDESFGGPSGEKNSPRSKTFFDGVKKFFDDLTR >NC_010694.1|WP_012440453.1|785208_787122_+|molecular-chaperone-DnaK MGKIIGIDLGTTNSCVAIMDGGKARVLENAEGDRTTPSIIAYTQDGETLVGQPAKRQAVTNPQNTLFAIKRLIGRRFQDEEVQRDIKIMPFKIVGADNGDAWLDVKGQRVAPPQISAEVLKKMKKTAEDYLGEAVTEAVITVPAYFNDAQRQATKDAGRIAGLDVKRIINEPTAAALAYGLDKGQGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATNGDTHLGGEDFDSRMINYLVAEFKKDQGIDLHNDPLAMQRLKEAAEKAKIELSSAQQTDVNLPYITADATGPKHLNIKVTRAKLESLVEDLVTRSIDPLKVALQDAGLSVSDINDVILVGGQTRMPMVQAKVAEFFGKEPRKDVNPDEAVAVGAAVQGGVLAGEVKDVLLLDVTPLSLGIETMGGVMTSLITKNTTIPTKHSQVFSTAEDNQSAVTIHVVQGERKRAADNKSLGQFNLDGIQNAPRGMPQIEVTFDIDADGILHVSAKDKNSGKEQKITIKASSGLNDEEIEKMVRDAEANAESDRKFEELVQTRNQGDQAAHSTRKQLDEAGDKLPAEDKAPIEAALTELNTALKGEDKAEIEAKIQALMEVSTKLMEFAQQQQAAGGAADAAEGAKKDDDVVDAEFEEVKDSKK >NC_010694.1|WP_042958656.1|784895_785093_-|hypothetical-protein MPDKMGPGNTASRGRVKKFFTFWQNNRLLTPCRVAIIIRGLLSGMPRRPPTLLASIWRGSVKWET >NC_010694.1|WP_012440451.1|783462_784050_+|molybdopterin-adenylyltransferase MNTLRIGLISVSDRAANGIYQDLGLPLLEEWLGQALVSPFEIEKRLVPDEQPMIEQAICDLVDERFCHLVLTTGGTGPARRDVTPDATLAVADREMPGFGEQMRQISLNFVPTAILSRQVAVIRKQSLVINLPGQPKSIKETLEGLKDEEGNSQVAGIFASVPYCIQLLEGPYIETNPQIVVAFRPKSARRETNI >NC_010694.1|WP_012440450.1|782313_783267_+|transaldolase MTDKLSSLRQVTTVVADTGDIAAMERYKPQDATTNPSLILSAAQIPEYRKLIDASIAWARDQSSDKDEQVSYAADRLAVNIGLEILKLVPGRISTEVDARLSYDTEGSIAKARSLIKLYNDAGISNDRILIKLASTWQGIRAAEQLEKEGINCNLTLLFSFAQARACAEAGVFLISPFVGRILDWYKANTDKKEYAGSEDPGVISVSEIYQYYKQHGYETVVMGASFRNVAEIIELAGCDRLTISPALLKELAETEGSIERKLSYRGEVKARPAKMTESEFLWQHNQDPMAVQKLAEGIRNFAIDQGKLDKMIADLL >NC_010694.1|WP_012440449.1|781269_782046_-|peroxide-stress-protein-YaaA MLMVISPAKTLDFASPLATERFTQPALLAESQKLINVARKLSPADIASLMHISDKLAVLNAERFNDWQPAFTPDNARQAILAFKGDVYTGLQAETFGEEDFTFAQQHLRMLSGLYGLLRPLDLMQAYRLEMGIKLANPAGKDLYSFWGDKLTTALNEALAQQGDNLLINLASDEYFRSVKPKRLEADIIKPVFLDEKNGKFKVISFYAKKARGLMCRYIIQNRLTKVEQLKKFDLDGYAFDGDTSSNNELVFKRREMA >NC_010694.1|WP_012440448.1|779925_781212_+|threonine-synthase MKLYNLKDHNEQVSFAQAVKQGLGKQQGLFFPLELPEFELTEIDDMLEMDFVTRSSKILSAFIGDEIPPHQLNERLKTAFTFPAPVVDVTDDIAALELFHGPTLAFKDFGGRFMAQMLSYVSGADEQITILTATSGDTGAAVAHAFYGMENVRVVILYPQGKISPLQEKLFCTLGGNIETIAIDGDFDVCQSLVKQAFDDEELKKAIGLNSANSINISRLLAQICYYFEAVAQLPQEKRNQLVISVPSGNFGDLTAGLLAKSLGLPVKRFIAATNANDTVPRFLADGQWTPNATVATLSNAMDVSQPNNWPRVEELFRRKTWRLGDLGYGAVNDETTKAAMRELADLGYLSEPHAAIAWRLLRDGLQDGEFGLFLGTAHPAKFKESVETILERTLPLPDALAERADLPLLSHSMKAEFAELRAFLLKK >NC_010694.1|WP_012440447.1|778992_779922_+|homoserine-kinase MVKIYAPASIGNVSVGFDVLGAAVSPVDGTLLGDCVSVEAAAEFSLRNEGRFVSKLPADPKDNIVYQCWDRFCSAIGQRVPVAMTLEKNMPIGSGLGSSACSVVAGLMAMNEYCNRPLNNNELLILMGELEGRVSGSVHFDNVAPCFLGGMQLMLEENDIISQPVPGFNDWLWVMAYPGIKVSTAEARAILPAQYRKEEIIRHGRYLGGFIHACHTQQPLLAAKLMQDVIAEPYRTKLLPGFAQARQAAADIGALACGISGSGPTLFAVCNQPDTANRMADWLSQHYLQNDEGFVHICRLDTAGARKLG >NC_010694.1|WP_012440458.1|790605_790869_-|30S-ribosomal-protein-S20 MANIKSAKKRAVTSEKRRKHNASRRSMMRTFIKKVYAAIATGDKAAAQNAFNEMQPLVDRQAAKGLIHKNKAARHKANLTAQISKMA >NC_010694.1|WP_012440459.1|791184_792123_+|bifunctional-riboflavin-kinase/FAD-synthetase MKLIRGIHNLRAQHRGCVLTIGNFDGVHRGHLALLAQLCAEGRERNLPVMVMLFEPQPLELFAAEKAPARLTRLREKLRYLEQAGVDAVLCVSFDRHFAAYSAQRFITDLLVNRLGVQLLAVGDDFRFGAGRQGDFLLLQKAGVEYGFDVISTQTFCDNGKRISSTAVRQALAEDNLPLARSLLGRPFSISGRVVHGDALGRTIGFPTANLPLRRTVSPVKGVYAVEVLGLGPRALPGVANIGTRPTVAGLRQQLEVHLLDVTIDLYERHIEVVLLDKIRDEQRFNSLDALKEQIANDVVTARRFFGQSTSV >NC_010694.1|WP_012440460.1|792159_794976_+|isoleucine--tRNA-ligase MSDYKSTLNLPETGFPMRGDLAKREPGMLQRWYDDKLYSIIREAKKGKKTFILHDGPPYANGSIHIGHSVNKILKDIIVKSKGMAGYDSPYVPGWDCHGLPIEHKVEQTIGKPGEKVSAAEFRAACRQYAAEQVEGQKADFIRLGVLGDWDRPYLTMDFKTEANIIRALGKIIGNGHLHKGAKPVHWCLDCRSALAEAEVEYYDKTSPSIDVMFDAVDKDAVQAKFGAAHVNGPISLVIWTTTPWTMPANRAISLHPEFDYQLVQVEGRALILAKDMVDSVMKRVGVTQWTVLGDVQGAALELMGFQHPFLAHVSPVVLGEHVTLEAGTGAVHTAPGHGPDDYVIGQKYGIETANPVGPDGSFLPGTYPTLDGLNVFKANDTIVELLREKGALLHLEKLHHSYPHCWRHKTPIIFRATPQWFISMDQKGLRAQSLKEIKGVQWIPDWGQARIESMVANRPDWCISRQRTWGVPMALFVHKDTEQLHPDSLELMEKVALRVEQDGIQAWWDLDARELMGADADNYVKVPDTLDVWFDSGSTSYSVVDARPEFGGSAPDLYLEGSDQHRGWFMSSLMISTAMKGKAPYRQVLTHGFTVDGQGRKMSKSLGNTVSPQDVMNKLGADILRLWVASTDYSGEIAVSDEILKRSADSYRRIRNTARFLLANLAGFNPETDKVKPEEMVVVDRWAVGRALAAQNDIVASYEAYDFHEVVQRLMQFCSVEMGSFYLDIIKDRQYTAKADGLARRSCQTALWYIVEALVRWMAPIMSFTADEIWGYLPGKRAQYVFTEEWFDGLFSLEDNQPMNDAYWAELLKVRGEVNKVIEQARADKRVGGSLEASVTLYADAQLAEKLTSLGEELRFVLLTSGAEVADYAGAPDDAQQSETVKGLKIALRKAEGEKCPRCWHYTSDIGQNAEHADMCGRCVTNVAGSGEERKFA >NC_010694.1|WP_042958658.1|794972_795482_+|signal-peptidase-II MMSKPVLSTGLRWLWLVLVVIAIDFVSKQWIMNNLMLHESMPVMPFFNFFYAHNYGAAFSFLADKGGWQRWFFAGIAVAIVVVLLVMMYRSKASDRLNNIAYALIVGGALGNLFDRAYHGFVVDFIDFTIGDWHFATFNIADCGICIGAALIVLEGFINPTSKRSEHKG >NC_010694.1|WP_012440462.1|795485_795956_+|FKBP-type-peptidyl-prolyl-cis-trans-isomerase MSDSVQSNSAVLVHFTLKLADGSTAESTRNNAKPALFRLGDGSLSPALENHLIGLSVGGKAAFALEAQDAFGSISPDLIQYFSRRDFVDAGEPEIGAIMLFSGMDGNEMPGVIREISGDSITVDFNHPLAGQTIHFDIDVLEIDPHLEMSNADPVG >NC_010694.1|WP_012440463.1|795936_796890_+|4-hydroxy-3-methylbut-2-enyl-diphosphate-reductase MQILLANPRGFCAGVDRAISIVERALEMYGAPIYVRHEVVHNRYVVNSLRERGAIFIEEIDEVPDGSILIFSAHGVSQAVRAEAKARALTMLFDATCPLVTKVHMEVARASRKGTEAILIGHAGHPEVEGTMGQYNNPQGGMYLVEQPGDVQNLQVKDEDNLCFMTQTTLSVDDTSDVIDALRARFPKIVGPRKDDICYATTNRQEAVRTLARDADVVLVVGSKNSSNSNRLAELAQRAGKLARLIDSAEDIQEAWVKGVSCVGVTAGASAPDILVQQVIQRLNELGGVDAVELIGREENIIFEVPKELRVEVKQLD >NC_010694.1|WP_012440464.1|797147_797969_+|4-hydroxy-tetrahydrodipicolinate-reductase MSNAEIRIAIVGAAGRMGRQLIQAVVLAEGARLGAALVRSGSSLVGTDAGELAGCGALGITLTDDLEAVANDFDVLIDFTRPEGTLHYLAFCRQHHKAMVIGTTGFDDAGKAAIEAAAQDIAIVFAANFSVGVNVVLKLVEKAAKVMGEYADIEIIEAHHRHKVDAPSGTALAMGEAIADAMSWDLKQHAVYAREGFTGEREAQTIGFATVRAGDIVGEHTAMFADIGERVEISHKASSRMTFAKGAVRAAIWLDGRKKGLYDMRCVLNLHDL >NC_010694.1|WP_012440466.1|798411_799587_+|glutamine-hydrolyzing-carbamoyl-phosphate-synthase-small-subunit MTVYSLEDVLIKSALLVLEDGTQFHGRAIGAIGSAVGEVVFNTSMTGYQEILTDPSYSRQIVTLTYPHIGNVGTNSADEESTQVHAAGLVIRDLPLIASNYRNEEGLSEYLIRHNIVAIADIDTRKLTRLLREKGAQNGCIIAGDALDAAVALEKARAFPGLKGMDLAKEVTTPEAYTWLQGSWTLEEELPKAKAESDLPYHVVAYDYGVKRNILRMLVDRGCRLTVVPAQTRAEDVLKLNPDGVFLSNGPGDPEPCDYAITAINRLLETDIPVFGICLGHQLLALSSGARTVKMKLGHHGGNHPVKDHDNNTVMITAQNHGFAVDDSHLPANLRVTHTSLFDHTVQGIHRTDKAAFSFQGHPEASPGPHDAAPLFDHFIELIEAYRSTAK >NC_010694.1|WP_012440467.1|799601_802826_+|carbamoyl-phosphate-synthase-large-subunit MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSNPATIMTDPEMADATYIEPIHWEVVRKIIEKERPDAVLPTMGGQTALNCALELERQGVLAEFGVTMIGATADAIDKAEDRRRFDVAMKSIGLDTARSGIAHTMEEALAVAADVGFPCIIRPSFTMGGTGGGIAYNREEFEEICERGLDLSPTNELLIDESLIGWKEYEMEVVRDKNDNCIIVCSIENFDAMGIHTGDSITVAPAQTLTDKEYQIMRNASMAVLREIGVETGGSNVQFSVNPENGRLIIIEMNPRVSRSSALASKATGFPIAKIAAKLAVGYTLDELMNDITGGLTPASFEPSIDYVVTKIPRFNFEKFAGTNDRLTTQMKSVGEVMAIGRTLQESMQKALRGLEVGANGFDPKVDLNDPEALTTIRRELKDAGSDRIWYIADAFRAGLTVEDVFALTNVDRWFLVQIEELVQLEQQVAQEGVSGLSYDFLRTLKRKGFADARLSALAGVPESEIRQLREQHNLHPVYKRVDTCAAEFSTDTAYMYSTYEEECEANPHQDRDKIMVLGGGPNRIGQGIEFDYCCVHASLALREDGFETIMVNCNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVPVIGTSPDAIDRAEDRERFQQAVERLGLKQPANATVTAIEMAVEKAAIIGYPLVVRPSYVLGGRAMEIVYDEIDLKRYFNTAVSVSNDAPVLLDRFLDDAVEVDVDAICDGEQVLIGGIMEHIEQAGVHSGDSACSLPAYTLSAEIQDVMREQVKKLAFELGVRGLMNVQFAVKDNEVYLIEVNPRAARTVPFVSKATGMPLAKVAARVMAGKTLAAQGMTKEIIPPYYSVKEVVLPFNKFQGVDPILGPEMRSTGEVMGVGRNFAEAFCKAMLGAQSNMKKSGRALLSVREGDKKRIVELARRLQEFGFELDATAGTASVLTAAGIEVRQVNKVHEGRPHIQDRLKNGEYAYIVNTTAGRQAIEDSKLIRRSALQYKVHYDTTLNGGFATANSLNASATEQVISVQEMHAQIVS >NC_010694.1|WP_012440468.1|802951_803566_-|LysE-family-translocator MLETSLFVATIVALGMLSPGPDFFLIVKNAARYRRSAAMMSALGVNCAVASHMAYCVAGLAVVITTTPWLFMLLKYAGAAYLIYIGIQALMSRGNGTMNINNVTLEETSLKKAFLQGYLCNLLNPKATLFFLSIFTQVLNVNSGISEKLLYAGIILGLSAIWWPSLVLLMQSGPVRRGLAKAQRVVDKLLGGVLIALGIKVALS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_010694_3 | 934259-935308 | TypeI-F |
I-F
Consensus repeat of NC_010694_3
|
17 spacers
spacers of NC_010694_3
>3.1|934287|32|NC_010694|CRISPRCasFinder,CRT ATCAGAACCCCGTTCACAATTGCGTGTTTCAG >3.2|934347|32|NC_010694|CRISPRCasFinder,CRT ACTGGTTCGCTGCACGGGTCAAACTCAATTTC >3.3|934407|32|NC_010694|CRISPRCasFinder,CRT ATCAGAACCCCGTTCACAATTGCGTGTTTCAG >3.4|934467|32|NC_010694|CRISPRCasFinder,CRT TCAAGAAAATCAAATGGCCGGACAAGGTAAAG >3.5|934527|32|NC_010694|CRISPRCasFinder,CRT CAGATGACCAGCTAATAAGCCTCTCATCATCA >3.6|934587|32|NC_010694|CRISPRCasFinder,CRT AGTTTTTGGTTTGGTCGCCATATAGAATTATT >3.7|934647|32|NC_010694|CRISPRCasFinder,CRT TTAACCCCGGCACCAATACCGATAGAGTCATA >3.8|934707|32|NC_010694|CRISPRCasFinder,CRT AAAACGTGTTCATGAATCTCGGAACGGCTAGT >3.9|934767|32|NC_010694|CRISPRCasFinder,CRT TGTTTAGCGGTATCTCCGCATAGCGCATGGAA >3.10|934827|32|NC_010694|CRISPRCasFinder,CRT TTGCCGGGTAAGGCAAGGCAATGGCTAAAAGA >3.11|934887|32|NC_010694|CRISPRCasFinder,CRT CATCGTGGACGCCGCCCAGAGCATCACCAGCT >3.12|934947|33|NC_010694|CRISPRCasFinder,CRT CATCAACCTGATGGACTCCATGCTGCCCAAAAC >3.13|935008|32|NC_010694|CRISPRCasFinder,CRT AGTAGGGCGGGATAGTGCCGCATTTAATAGCC >3.14|935068|32|NC_010694|CRISPRCasFinder,CRT TTACCCGATGCTTCAATGAATCCAGACGTACC >3.15|935128|33|NC_010694|CRISPRCasFinder,CRT AGTTTGCGATTAGCCAAATCATGTCAGCAATAA >3.16|935189|32|NC_010694|CRISPRCasFinder,CRT AGACGCTGAACGAGCTTATTAACGTCTTAGAG >3.17|935249|32|NC_010694|CRISPRCasFinder,CRT AAAGACGGCACGTTTTTCACCAAAGACGATTT |
cas1,cas3f,cas8f,cas5f,cas7f,cas6f |
CRISPR arrays and Neighbor proteins around NC_010694_3
The CRISPR arrays of NC_010694_3 >merge|NC_010694|3|934259-935308|CRISPRCasFinder,CRT GTTCACTGCCGCACAGGCAGCTTAGAAAATCAGAACCCCGTTCACAATTGCGTGTTTCAGGTTCACTGCCGCACAGGCAGCTTAGAAAACTGGTTCGCTGCACGGGTCAAACTCAATTTCGTTCACTGCCGCACAGGCAGCTTAGAAAATCAGAACCCCGTTCACAATTGCGTGTTTCAGGTTCACTGCCGCACAGGCAGCTTAGAAATCAAGAAAATCAAATGGCCGGACAAGGTAAAGGTTCACTGCCGCACAGGCAGCTTAGAAACAGATGACCAGCTAATAAGCCTCTCATCATCAGTTCACTGCCGCACAGGCAGCTTAGAAAAGTTTTTGGTTTGGTCGCCATATAGAATTATTGTTCACTGCCGCACAGGCAGCTTAGAAATTAACCCCGGCACCAATACCGATAGAGTCATAGTTCACTGCCGCACAGGCAGCTTAGAAAAAAACGTGTTCATGAATCTCGGAACGGCTAGTGTTCACTGCCGCACAGGCAGCTTAGAAATGTTTAGCGGTATCTCCGCATAGCGCATGGAAGTTCACTGCCGCACAGGCAGCTTAGAAATTGCCGGGTAAGGCAAGGCAATGGCTAAAAGAGTTCACTGCCGCACAGGCAGCTTAGAAACATCGTGGACGCCGCCCAGAGCATCACCAGCTGTTCACTGCCGCACAGGCAGCTTAGAAACATCAACCTGATGGACTCCATGCTGCCCAAAACGTTCACTGCCGCACAGGCAGCTTAGAAAAGTAGGGCGGGATAGTGCCGCATTTAATAGCCGTTCACTGCCGCACAGGCAGCTTAGAAATTACCCGATGCTTCAATGAATCCAGACGTACCGTTCACTGCCGCACAGGCAGCTTAGAAAAGTTTGCGATTAGCCAAATCATGTCAGCAATAAGTTCACTGCCGCACAGGCAGCTTAGAAAAGACGCTGAACGAGCTTATTAACGTCTTAGAGGTTCACTGCCGCACAGGCAGCTTAGAAAAAAGACGGCACGTTTTTCACCAAAGACGATTTGTTCACTGCCGTACAGGCAGCCCAGAAA >NC_010694|3|3|934259-935308|CRISPRCasFinder GTTCACTGCCGCACAGGCAGCTTAGAAA ATCAGAACCCCGTTCACAATTGCGTGTTTCAG GTTCACTGCCGCACAGGCAGCTTAGAAA ACTGGTTCGCTGCACGGGTCAAACTCAATTTC GTTCACTGCCGCACAGGCAGCTTAGAAA ATCAGAACCCCGTTCACAATTGCGTGTTTCAG GTTCACTGCCGCACAGGCAGCTTAGAAA TCAAGAAAATCAAATGGCCGGACAAGGTAAAG GTTCACTGCCGCACAGGCAGCTTAGAAA CAGATGACCAGCTAATAAGCCTCTCATCATCA GTTCACTGCCGCACAGGCAGCTTAGAAA AGTTTTTGGTTTGGTCGCCATATAGAATTATT GTTCACTGCCGCACAGGCAGCTTAGAAA TTAACCCCGGCACCAATACCGATAGAGTCATA GTTCACTGCCGCACAGGCAGCTTAGAAA AAAACGTGTTCATGAATCTCGGAACGGCTAGT GTTCACTGCCGCACAGGCAGCTTAGAAA TGTTTAGCGGTATCTCCGCATAGCGCATGGAA GTTCACTGCCGCACAGGCAGCTTAGAAA TTGCCGGGTAAGGCAAGGCAATGGCTAAAAGA GTTCACTGCCGCACAGGCAGCTTAGAAA CATCGTGGACGCCGCCCAGAGCATCACCAGCT GTTCACTGCCGCACAGGCAGCTTAGAAA CATCAACCTGATGGACTCCATGCTGCCCAAAAC GTTCACTGCCGCACAGGCAGCTTAGAAA AGTAGGGCGGGATAGTGCCGCATTTAATAGCC GTTCACTGCCGCACAGGCAGCTTAGAAA TTACCCGATGCTTCAATGAATCCAGACGTACC GTTCACTGCCGCACAGGCAGCTTAGAAA AGTTTGCGATTAGCCAAATCATGTCAGCAATAA GTTCACTGCCGCACAGGCAGCTTAGAAA AGACGCTGAACGAGCTTATTAACGTCTTAGAG GTTCACTGCCGCACAGGCAGCTTAGAAA AAAGACGGCACGTTTTTCACCAAAGACGATTT GTTCACTGCCGTACAGGCAGCCCAGAAA >NC_010694|3|1|934259-935308|CRT GTTCACTGCCGCACAGGCAGCTTAGAAA ATCAGAACCCCGTTCACAATTGCGTGTTTCAG GTTCACTGCCGCACAGGCAGCTTAGAAA ACTGGTTCGCTGCACGGGTCAAACTCAATTTC GTTCACTGCCGCACAGGCAGCTTAGAAA ATCAGAACCCCGTTCACAATTGCGTGTTTCAG GTTCACTGCCGCACAGGCAGCTTAGAAA TCAAGAAAATCAAATGGCCGGACAAGGTAAAG GTTCACTGCCGCACAGGCAGCTTAGAAA CAGATGACCAGCTAATAAGCCTCTCATCATCA GTTCACTGCCGCACAGGCAGCTTAGAAA AGTTTTTGGTTTGGTCGCCATATAGAATTATT GTTCACTGCCGCACAGGCAGCTTAGAAA TTAACCCCGGCACCAATACCGATAGAGTCATA GTTCACTGCCGCACAGGCAGCTTAGAAA AAAACGTGTTCATGAATCTCGGAACGGCTAGT GTTCACTGCCGCACAGGCAGCTTAGAAA TGTTTAGCGGTATCTCCGCATAGCGCATGGAA GTTCACTGCCGCACAGGCAGCTTAGAAA TTGCCGGGTAAGGCAAGGCAATGGCTAAAAGA GTTCACTGCCGCACAGGCAGCTTAGAAA CATCGTGGACGCCGCCCAGAGCATCACCAGCT GTTCACTGCCGCACAGGCAGCTTAGAAA CATCAACCTGATGGACTCCATGCTGCCCAAAAC GTTCACTGCCGCACAGGCAGCTTAGAAA AGTAGGGCGGGATAGTGCCGCATTTAATAGCC GTTCACTGCCGCACAGGCAGCTTAGAAA TTACCCGATGCTTCAATGAATCCAGACGTACC GTTCACTGCCGCACAGGCAGCTTAGAAA AGTTTGCGATTAGCCAAATCATGTCAGCAATAA GTTCACTGCCGCACAGGCAGCTTAGAAA AGACGCTGAACGAGCTTATTAACGTCTTAGAG GTTCACTGCCGCACAGGCAGCTTAGAAA AAAGACGGCACGTTTTTCACCAAAGACGATTT GTTCACTGCCGTACAGGCAGCCCAGAAA
>NC_010694.1|WP_012440581.1|932443_934072_+|multicopper-oxidase-CueO MQRRDFIKLTAALGAASALPGWSRALTAAEQRPLLPIPTLLTPDARSEISLTAQAGSSSWRGSRVSTWGYNGPLLGPAIQLERGKEVNITVYNRLPEATTVHWHGLELPGNVDGGPQARIEPNRSRRVTFTPDQPAATCWFHPHQHGRTGYQVAQGLVGLVLVNDPESGKLLLPKRWGIDDIPVILQDKRLSADGSRIDYQLDMMSAAVGWFGDTMLTNGAIYPQHGVPRGWLRLRLLNGCNARALNLATSDKRPMYVIASDGGLLGEPVQVSELPMMPGERYEVLIDTADGKAFDLQTLPVRQMGMTLEPFNQPLPVLSLVPLLVQASGTLPDKLVDLPAVPSSQGLNTRWLQLMMDPELDRRGMQALMDKYGHASMAGMSMEAHGGDKKAGAHHDEMPEMDHGGMAGMAGMAGMDHGHSAAKKAYDFHNGNQINGVAFNMDKPSFEVRQGVYEKWTISGEGDEMLHPFHIHGTQFRILTENGKPVAAHRSGWKDTVRVEGGRSEVLVRFDHQADKASAYMAHCHLLEHEDTGMMLGFTVA >NC_010694.1|WP_012440580.1|931925_932273_-|YacC-family-pilotin-like-protein MKKSIKALLLLGLLGCSGSSFAIGEPEAEDLADLTAVFVYLKNDCGYQNIPDSQIRRALLFFAEQNRWDLSNYTSFNMKALGEDSYRDLSGIAIPNDTKCKSLARDSLNLLAWVK >NC_010694.1|WP_012440579.1|930971_931829_-|polyamine-aminopropyltransferase MATNEMWYETLHTGFGQYFSVDKIIYREKTDHQDLVIFENAALGRVMALDGVVQTTERDEFIYHEMMTHVPLLAHGAPKRVLIIGGGDGAMLREVCRHKNIEQITMVEIDAGVVTFCRQYLPNHNAGAYDDARFKLVIDDGVNFVNQTSDKFDVIISDCTDPIGPGESLFTSEFYQGCRRCLNQDGIFVAQNGVCFLQQDEAVNSHRKLSHYFGDVSFYQAAIPTYYGGIMTFAWASDNPALRQLDMATLTARFSEAGLNCRYYNPAIHTGSFALPQYLLNALAD >NC_010694.1|WP_012440578.1|930146_930950_-|adenosylmethionine-decarboxylase MKLQKLKLHGFNNLTKSLSFCIYDICYANTEAERDGYIAYIDEQYNANRLTEILSETCSIIGANVLNIARQDYEPQGASVTILVSEEPIDPRDIDTSEHPGPLPNSVVAHLDKSHICVHTYPESHPEGGLCTFRADIEVSTCGVISPLKALNYLIHQLESDIVTIDYRVRGFTRDVNGVKHFIDHEINSIQNFMSEDMKSMYDMMDVNVYQENMFHTKMLLKEFDLKHYLFNTKPEDLSAQEHKRITDLLWKEMREIYYGRNIPAIG >NC_010694.1|WP_012440577.1|929679_930042_+|YacL-family-protein MEYEFLKDVTGVVKVRMSMGHEAIGHWFNDEVNGHPEILAEVEAAIAGVKGSERQWQRVGREYTLLLDEEEVMIRANQLGFEGDDMEEGMNYYDEESLSFCGVEDFLAIIAAYRAFLLGR >NC_010694.1|WP_012440576.1|926910_929508_+|bifunctional-aconitate-hydratase-2/2-methylisocitrate-dehydratase MLEEYRKHVAERAAQGIVPKPLDASQMAALVELLKAPPAGEEEFLSDLLINRVPPGVDEAAYVKAGFLAAVTKGEATSPLVTPEKAIKLLGTMQGGYNIHALIDALDNDKLAPLAAESLSHTLLMFDNFYDVEDKAKAGNPHAKKIIQSWADAEWFLKRPKLAEKITVTVFKVTGETNTDDLSPAPDAWSRPDIPLHALAMLKNAREGIEPDDAGNVGPIGQIDALQKKGFPLAYVGDVVGTGSSRKSATNSVLWFMGDDIPYVPNKKGGGVCLGGKIAPIFFNTMEDAGALPIEVDVDRLNMGDVIDIYPYKGEVRHHDTDEVLANFELKTEVLLDEVRAGGRIPLIIGRGLTTKARESLGLPHSDVFLQAKDVAASTRGFSLAQKMVGRACGVEGVRPGAYCEPKMTSVGSQDTTGPMTRDELKDLACLGFSADLVMQSFCHTAAYPKPVDVTTHHTLPDFIMNRGGVSLRPGDGIIHSWLNRMLLPDTVGTGGDSHTRFPIGISFPAGSGLVAFAAATGVMPLDMPESVLVRFKGKMQPGITLRDLVHAIPLYAIKAGLLTVEKKGKKNIFSGRILEIEGLPDLKVEQAFELSDASAERSAAGCTIKLGQDPIIEYLNSNIVLLKWMISEGYGDRRTIERRIQGMEKWLADPQLLEGDAEAEYAAVIDIDLAEIKEPILCAPNDPDDARLLSDVQGTKIDEVFIGSCMTNIGHFRAAGKLLDSHKGQLPTRLWVAPPTKMDAAQLTEEGYYSVFGKSGARIEIPGCSLCMGNQARVADGATVVSTSTRNFPNRLGNGANVFLASAELSAVASLLGKLPTPDEYQAFMDRVDKTAVDTYRYLNFDQLTQYTDKADAVIFQTAV >NC_010694.1|WP_042958683.1|925380_926139_-|winged-helix-turn-helix-domain-containing-protein MRAYFEPKNCILSNDTKSIKITIQEARCLEYLIKHEGEFIRREMLQQECWIKRGVTVSDSAVRQSLYRLRRAFEDAGLPNLTLTTQARKGHILQKGSIALIHSGAKTDAYTDNSINPSVLNSINNDACGNFKPHFSVTSLLLIAKLLLLSALLFLAGFYSYQKIMLTDIKYHHSEEKEGRLYFYRKNQTYPQSAIERIHYWLRNKHVNYDNLKFIYLNNAWSGHISFYLCKGEMGSAGSDCTSIMIIGEHHP >NC_010694.1|WP_012440574.1|924913_925381_-|hypothetical-protein MKFFVFFASVIIFISGWYFFPWVLISTMEDCISKKIVIYDEPDRYIISRSTWYSWRDDNEHRYSAQILIDGPQGKLETFSSERVIETEYRFNFDSINLSTIKSFRIAGQLTSDPLTEKYIDPQAKEGFTGLIHLFRYKDNSLLFGFKGIPLSLCL >NC_010694.1|WP_042958682.1|923491_924274_-|enoyl-CoA-hydratase/isomerase-family-protein MNADNPITPLVLCDRPAEHVVRLTLNRPARRNAYNAQMVSELEQWLNWCERQSAVRTVILTGSGEAFCSGADLHEAFTHGGEGLRNSRGGYHPLQHLPRRKIWIAALNGHAIGGGLEMALACDFIVASEDSRIALPEVQHGLLPLGGAISQLAARLPPNIARELLLTGETMEAQRALALGLFNQVVNAERLADTALALAERLNQAAPLAVQACNALLNQALAADDSQQGDRELQQLQRSEDYQESLRAFAARRAPRWQGR >NC_010694.1|WP_042958679.1|922284_923490_+|thiolase-family-protein MAAAQSLLNYQPEDDRQPVIVVACRTPIGRAYGSLASVSPEALLAPLFDRLIAALPGGFTAIDEVIIGNATGGGGNIARLAALAAGVPLTVPAVTVDRQCGSGLEAVINACRLVQARAGECYLAGGVESVSNAPWRVEKPTTLKQMPRFYPRARFSPDEIGDPEMGIAAENVARQCGISRERQDSFALRSHQRALAAAQQGAFLEEIVALDVNHQRVENDECPRPDTSLARLAALPPVFAADGSVTAGNCCPLNDGAALLLVMSRRRARECGFTQGLLFADACSAGVDPNLLGLGPVPATQKLLRRQPGLTLDRVEAIEFNEAFAAQVLASVDALGIDEHRINPQGGAIALGHPYGASGAIMVTRLFSQLVSQRQSEGYGLAMLGIAGGLGLSALFKGMRL >NC_010694.1|WP_012440585.1|935432_935963_+|DUF2778-domain-containing-protein MALHGSFVLNGADYSPLSFPGVGTFMAFSGSGDNRNRAGCAHIPTVGPLPTGKYWIVDRSQGGLLSQSLSASKDLFNKVFRDAQFGHSDWFALWRDDMSIDDWTWINSVRRGNFRLHPGTISEGCVTLYRNSDFALLRNMLLRTPLVDVPCMRNLKARGSIEVSSHAYGDTCPTTR >NC_010694.1|WP_042958685.1|935928_936204_+|hypothetical-protein MRMATPARRLAKTVMFIALFCLFARLIDSSQFIGLATANAFAAWLHGSASQENYDDLWFFVDVTLSVLSAVVAYHMVMLLGRKLRASSGHK >NC_010694.1|WP_012440586.1|936763_937741_+|type-I-F-CRISPR-associated-endonuclease-Cas1 MEMIKPSDLKTILHSKRSNIYYLQYCRVLVNGGRVEYVTDEGKQSLYWNIPIANTTVVMLGTGTSITQAAMREFARAGVLVGFCGGGGTPLYAANEVEVDVSWLNSQSEYRPTEYLQHWVSFWFDEQKRLSAAIAFQRVRISQIRQAWLGSKMMREHKFAISEPHLTGILDRFEQGLARCDNNTDLLALEAVMTKALYKLAAQAVSYGDFVRAKRGGGIDAANRFLDHGNYLAYGLAAVACWVIGLPHGLAVLHGKTRRGGLVFDVADLIKDALILPQAFLAAMAGEGEQEFRQRCLSSLQNAEALDTMIAALEATAREHSQVGK >NC_010694.1|WP_012440587.1|937737_941031_+|type-I-F-CRISPR-associated-helicase-Cas3 MNVLLIAQCNKRALEESRRILDQFAERKGDRSWQTAITQQGLLTLRKLLRKTARRNTAVACHQIKSNGQSELLWIVGNLRRFNAQGAVPTHTTSRDVLKSADENSWHSVEAVSLLAAIAGLFHDFGKANSLFQQMLVGKKGVKRSQPYRHEWVSLRLFCAWVAGRDDRVWIAALSQIEPQDEQAMLAGLEKEGLMDTTNPFAPLPPVARVVAWLILSHHRMPVYPKKNGSSESASYLPPDLEHCDGWLTEQLDALWNAENHHDQGWTPADFKAQWQFPQGTPMRSGLWCGKARKMAQRLLAQPAWLAQIDINQRFSCHMARLALMLADHVYSAQPATPGWQDADCLLYANTDRDSGSLKQRLDEHNIGVAQNALLLARSLPHLRKTLPAITRHKGFKKRSTDERFRWQDNAWQKTCELRDRAFQQGFFGINMASTGCGKTFANARIMYALSDEQKGCRFSVALGLRTLTLQTGDALREKLNLEQDDLAVLVGSQAVTQLHQLAKDNPVSHDTGSESAEALPEENQYISYEGSLDDGRLSRWLQKSPRINKLLSAPVLVTTIDHLIGATEGLRGGRQIAPMLRLLTSDLVLDEPDDFDIDDLPALCRLVNWAGMLGSRVLFSSATLPPALVLALFNAYRSGREIFQHACGLPVDGNICCAWFDENAVLTEELRLPQAFMQQHKEFVANRVSWLAKQPVLRRGWIAPVAPPARDEATIYSHMAQVILQSMMTLHHAHHQRHKELPKTISVGVVRFANINPLVAVAQQLLATEAAEDTHIHYCVYHSRHPLAMRSHFEQRLDATLTRHQSDAIWQVAEIAAALEQHPQQHHLFVVLATSVAEVGRDHDYDWAIAEPSSMRSLIQLAGRVQRHRQEEAQSENIHILQQNICSLKERDSQKPTYCKPGFEQKGYMLASRDLQKILDKEQYQTISAIPRIQSRQKVGKGPLFANLADLEHRRLMVELQGKQKEPNEYCAALWWREQASWCGEMQRRKPFRQSPPEDMHFMLIAEEGDRPEIWQPDDGPSGRKKSMVAYPDLTFAAGVSAWITPDYQQVWQQLAERLTMELEEVSLRFGEIVLRTKPESKEWHFHPLLGAFQAE >NC_010694.1|WP_012440588.1|941108_941387_+|BrnT-family-toxin MDICYDPDKDVKNRRKHGYSLADSALLDWDEMVVYEDNRQPFDEIRLIGLTYGLARLGNRIFSVCFTEHEEVYRIISLRLATRKEIQRYAET >NC_010694.1|WP_012440589.1|941373_941670_+|BrnA-antitoxin-family-protein MPKLKPGTVFPTTEEDAKIYAAVADDEDSMLLEDPQLKLTPLKKRGRPQKAQPKIAVSVRYTPEVISAFKASGAGWQTRMDVALQDWLKTHQPTEIKL >NC_010694.1|WP_012440590.1|942113_943457_+|type-I-F-CRISPR-associated-protein-Csy1 MLRETLASFITSYIAARKTAKLEAFDKESAKKLAVLASEDEISVLRQQLQQQRAELEQKYQPQAWLSDAASRAGQIKLVTHAAKFTHSDVRGSSIFSSGSGQHETYLSTATLQKPALDAVGNAAALDIARLLQSEVEGDSLIASLQRGDYSALESLTDNPELCASWISGFKQVLVDRQPASHKLAKQIYFPIADGQYHLLSPLFSSSLAHALNQRITEAKFSEQAKTARAALKAKSWHDAPVVAYPDTAITQFGGTKPQNISYLNSVRGGKVWLLPCAPPVWQTLSKPPAKHKSIFNSSNDFSRQSWPVIQRMSRFLRRVERLDSTLDIRQQRLAMTDEIIDILFNYVAGIQNQTESIGWSAHPDCVLKRSQQLWLDPWRGDKEFQFEREGGDWKSEVARDFGHWLSRHLHSDKLNMGETERRHFSTAPLFKQRLRELEKDLAEDLP >NC_010694.1|WP_012440591.1|943453_944395_+|type-I-F-CRISPR-associated-protein-Csy2 MSALIVLRHLRVENANAIAGITWGFPAITHFLGFTHALSRKLQQSHNMTLSGCGVICHQQQVHAYTSGRDYQFALTRNPLTKEAKTAAFNEEGRMHMTVSLLMECHGSIAGGEQGAAELKQTLANLCQRLRLAGGTVISIGQVQISGWPQDDGETRKIMRRLLPGFALLDRSALLAQHHDQQPQPEMLDAWLDFAALKMQADDGATPADGNVQWQYQPKPGAGYLVPLMTGYRAISPLYPPGEVANSRDTETPFCFTEAVYGVGEWRGLHRIDDLRHLFWRYHHQDDYYLCRGEETACDQDYPDDADDDINYN >NC_010694.1|WP_012440592.1|944418_945423_+|type-I-F-CRISPR-associated-protein-Csy3 MAKSAIKTASVLAFERKLSNSDAIMLAGKWQDKQNWTPIKIQEKAVRGTISNRLKNAIASDPQKLDAEIQKPNLQRVDVAALPYNCDSLKVCFTLRVLGGLATPAVCNDRAYQAALAAVIDGYIARHGFSTLAARYAENIANGRFLWRNRLGAGRVAVQVTSGEKRWQFDGHNYSLRAFSQPQGDLLELAQAIEQGLSGDSFALFNVEAQVYLGNGQEVFPSQELVLDSNSKKSKLLYQIDDTAAIHSQKIGNALRTIDSWYPDADELDVGPISVEPYGSVTSRGIAYRQPIKKMDFYTLLDNWVTKDKQPDLEQQHYVMAILIRGGVFGEKSE >NC_010694.1|WP_012440593.1|945431_945986_+|type-I-F-CRISPR-associated-endoribonuclease-Cas6/Csy4 MDRYQDIRVRVDAEMTAPVLLAQVFMRLHQVLMRAANGRIGISFPDVKLTLGDRIRLHGTLDDLSSLQQSGWDKGLTDYIACSAIDPVPPGAAWRTVRRVQVKSSAERLRRRSVNKGWLNEAEAAERINVLSEQRSDLPYLQIKSGSNGHAWRLFIEHGPLVSVPVNGGFSSYGLSATATVPWF |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_010694_4 | 946115-947582 | TypeI-F |
I-F
Consensus repeat of NC_010694_4
|
24 spacers
spacers of NC_010694_4
>4.1|946143|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT TTCAACAAGAAGCGCGATGAAGAAATTGCTGC >4.2|946203|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT GTATTGACTGAATCGGCAAATTCCCATCAGGT >4.3|946263|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT TTTGAAACTGGCGAGAGAGTCGGCGTGAAACA >4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT AACTCGTCTAGCCAACGCCGCCCGCCGCGCTC >4.5|946383|33|NC_010694|PILER-CR,CRISPRCasFinder,CRT AACTATGAGGCACTCATTAATGTCTTTGTGCGG >4.6|946444|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT TGGCATCGCTGAAGCTGGGCCTGAATCATGAC >4.7|946504|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT GGAGAAATGGAAAGCATTCATGACCATGAAAC >4.8|946564|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT CTTCTGGGCCTGTCCAGTCAGTTTACGACCTA >4.9|946624|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT TGTTCGGTGCTGCGAATTCCAGTGTGGCTTAT >4.10|946684|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT TCAGAACCCCGAATTGCTTCGTCGATATAGTC >4.11|946744|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT TGTTAAATGAACACCCAAGATTTTGCCTACGT >4.12|946804|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT ACACCAACTTGGCCCGTTTCCCACACCAACTT >4.13|946864|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT TGGCATGGTGTACCGCCTACCAGTACATCGGG >4.14|946924|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT ATGAATATAAATTCCGTTTCCGGGTCTTTCTC >4.15|946984|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT ACCCAGGTGCTTACCCCAGAGAACTAACAAGT >4.16|947044|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT CACAGGCAGTCTGATTTGCACTGACATTCTGA >4.17|947104|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT CGGCAAACTTTAATAGCTGCATGCGGATTCCT >4.18|947164|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT GCGTTCCGAACATTGAAAATCTCCGCATCATC >4.19|947224|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT TTCCAGCTCACGCTCCGTCCAGTCACGCATGG >4.20|947284|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT TTCGGGAACGGCTCGGAATAACTGTTGTGGCT >4.21|947344|31|NC_010694|PILER-CR,CRISPRCasFinder,CRT AACGTCTGATTGGTATCGCATTCCACGCTGC >4.22|947403|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT TGTGATACCCGGAAGCGCTTTTAATTCTGCGG >4.23|947463|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT GGTAACGATGGGTATGAGATTAACTGCGGAGA >4.24|947523|32|NC_010694|CRISPRCasFinder,CRT ATGAAAATCGGCGAAAAAATTAAACAGATCCG |
cas6f,cas7f,cas5f,cas8f,cas3f,cas1 |
CRISPR arrays and Neighbor proteins around NC_010694_4
The CRISPR arrays of NC_010694_4 >merge|NC_010694|4|946115-947582|PILER-CR,CRISPRCasFinder,CRT GTTCACTGCCGTACAGGCAGCTTAGAAGTTCAACAAGAAGCGCGATGAAGAAATTGCTGCGTTCACTGCCGTACAGGCAGCTTAGAAGGTATTGACTGAATCGGCAAATTCCCATCAGGTGTTCACTGCCGTACAGGCAGCTTAGAAGTTTGAAACTGGCGAGAGAGTCGGCGTGAAACAGTTCACTGCCGTACAGGCAGCTTAGAAGAACTCGTCTAGCCAACGCCGCCCGCCGCGCTCGTTCACTGCCGTACAGGCAGCTTAGAAGAACTATGAGGCACTCATTAATGTCTTTGTGCGGGTTCACTGCCGTACAGGCAGCTTAGAAGTGGCATCGCTGAAGCTGGGCCTGAATCATGACGTTCACTGCCGTACAGGCAGCTTAGAAGGGAGAAATGGAAAGCATTCATGACCATGAAACGTTCACTGCCGTACAGGCAGCTTAGAAGCTTCTGGGCCTGTCCAGTCAGTTTACGACCTAGTTCACTGCCGTACAGGCAGCTTAGAAGTGTTCGGTGCTGCGAATTCCAGTGTGGCTTATGTTCACTGCCGTACAGGCAGCTTAGAAGTCAGAACCCCGAATTGCTTCGTCGATATAGTCGTTCACTGCCGTACAGGCAGCTTAGAAGTGTTAAATGAACACCCAAGATTTTGCCTACGTGTTCACTGCCGTACAGGCAGCTTAGAAGACACCAACTTGGCCCGTTTCCCACACCAACTTGTTCACTGCCGTACAGGCAGCTTAGAAGTGGCATGGTGTACCGCCTACCAGTACATCGGGGTTCACTGCCGTACAGGCAGCTTAGAAGATGAATATAAATTCCGTTTCCGGGTCTTTCTCGTTCACTGCCGTACAGGCAGCTTAGAAGACCCAGGTGCTTACCCCAGAGAACTAACAAGTGTTCACTGCCGTACAGGCAGCTTAGAAGCACAGGCAGTCTGATTTGCACTGACATTCTGAGTTCACTGCCGTACAGGCAGCTTAGAAGCGGCAAACTTTAATAGCTGCATGCGGATTCCTGTTCACTGCCGTACAGGCAGCTTAGAAGGCGTTCCGAACATTGAAAATCTCCGCATCATCGTTCACTGCCGTACAGGCAGCTTAGAAGTTCCAGCTCACGCTCCGTCCAGTCACGCATGGGTTCACTGCCGTACAGGCAGCTTAGAAGTTCGGGAACGGCTCGGAATAACTGTTGTGGCTGTTCACTGCCGTACAGGCAGCTTAGAAGAACGTCTGATTGGTATCGCATTCCACGCTGCGTTCACTGCCGTACAGGTAGCTTAGAAGTGTGATACCCGGAAGCGCTTTTAATTCTGCGGGTTCACTGCCGTACAGGCAGCTTAGAAGGGTAACGATGGGTATGAGATTAACTGCGGAGAGTTCACTGCCGTACAGGCAGCTTAGAAGATGAAAATCGGCGAAAAAATTAAACAGATCCGGTTCACTGGCGTACAGACCGCCTTAAAT >NC_010694|4|1|946115-947522|PILER-CR GTTCACTGCCGTACAGGCAGCTTAGAAG TTCAACAAGAAGCGCGATGAAGAAATTGCTGC GTTCACTGCCGTACAGGCAGCTTAGAAG GTATTGACTGAATCGGCAAATTCCCATCAGGT GTTCACTGCCGTACAGGCAGCTTAGAAG TTTGAAACTGGCGAGAGAGTCGGCGTGAAACA GTTCACTGCCGTACAGGCAGCTTAGAAG AACTCGTCTAGCCAACGCCGCCCGCCGCGCTC GTTCACTGCCGTACAGGCAGCTTAGAAG AACTATGAGGCACTCATTAATGTCTTTGTGCGG GTTCACTGCCGTACAGGCAGCTTAGAAG TGGCATCGCTGAAGCTGGGCCTGAATCATGAC GTTCACTGCCGTACAGGCAGCTTAGAAG GGAGAAATGGAAAGCATTCATGACCATGAAAC GTTCACTGCCGTACAGGCAGCTTAGAAG CTTCTGGGCCTGTCCAGTCAGTTTACGACCTA GTTCACTGCCGTACAGGCAGCTTAGAAG TGTTCGGTGCTGCGAATTCCAGTGTGGCTTAT GTTCACTGCCGTACAGGCAGCTTAGAAG TCAGAACCCCGAATTGCTTCGTCGATATAGTC GTTCACTGCCGTACAGGCAGCTTAGAAG TGTTAAATGAACACCCAAGATTTTGCCTACGT GTTCACTGCCGTACAGGCAGCTTAGAAG ACACCAACTTGGCCCGTTTCCCACACCAACTT GTTCACTGCCGTACAGGCAGCTTAGAAG TGGCATGGTGTACCGCCTACCAGTACATCGGG GTTCACTGCCGTACAGGCAGCTTAGAAG ATGAATATAAATTCCGTTTCCGGGTCTTTCTC GTTCACTGCCGTACAGGCAGCTTAGAAG ACCCAGGTGCTTACCCCAGAGAACTAACAAGT GTTCACTGCCGTACAGGCAGCTTAGAAG CACAGGCAGTCTGATTTGCACTGACATTCTGA GTTCACTGCCGTACAGGCAGCTTAGAAG CGGCAAACTTTAATAGCTGCATGCGGATTCCT GTTCACTGCCGTACAGGCAGCTTAGAAG GCGTTCCGAACATTGAAAATCTCCGCATCATC GTTCACTGCCGTACAGGCAGCTTAGAAG TTCCAGCTCACGCTCCGTCCAGTCACGCATGG GTTCACTGCCGTACAGGCAGCTTAGAAG TTCGGGAACGGCTCGGAATAACTGTTGTGGCT GTTCACTGCCGTACAGGCAGCTTAGAAG AACGTCTGATTGGTATCGCATTCCACGCTGC GTTCACTGCCGTACAGGTAGCTTAGAAG TGTGATACCCGGAAGCGCTTTTAATTCTGCGG GTTCACTGCCGTACAGGCAGCTTAGAAG GGTAACGATGGGTATGAGATTAACTGCGGAGA GTTCACTGCCGTACAGGCAGCTTAGAAG >NC_010694|4|4|946115-947582|CRISPRCasFinder GTTCACTGCCGTACAGGCAGCTTAGAAG TTCAACAAGAAGCGCGATGAAGAAATTGCTGC GTTCACTGCCGTACAGGCAGCTTAGAAG GTATTGACTGAATCGGCAAATTCCCATCAGGT GTTCACTGCCGTACAGGCAGCTTAGAAG TTTGAAACTGGCGAGAGAGTCGGCGTGAAACA GTTCACTGCCGTACAGGCAGCTTAGAAG AACTCGTCTAGCCAACGCCGCCCGCCGCGCTC GTTCACTGCCGTACAGGCAGCTTAGAAG AACTATGAGGCACTCATTAATGTCTTTGTGCGG GTTCACTGCCGTACAGGCAGCTTAGAAG TGGCATCGCTGAAGCTGGGCCTGAATCATGAC GTTCACTGCCGTACAGGCAGCTTAGAAG GGAGAAATGGAAAGCATTCATGACCATGAAAC GTTCACTGCCGTACAGGCAGCTTAGAAG CTTCTGGGCCTGTCCAGTCAGTTTACGACCTA GTTCACTGCCGTACAGGCAGCTTAGAAG TGTTCGGTGCTGCGAATTCCAGTGTGGCTTAT GTTCACTGCCGTACAGGCAGCTTAGAAG TCAGAACCCCGAATTGCTTCGTCGATATAGTC GTTCACTGCCGTACAGGCAGCTTAGAAG TGTTAAATGAACACCCAAGATTTTGCCTACGT GTTCACTGCCGTACAGGCAGCTTAGAAG ACACCAACTTGGCCCGTTTCCCACACCAACTT GTTCACTGCCGTACAGGCAGCTTAGAAG TGGCATGGTGTACCGCCTACCAGTACATCGGG GTTCACTGCCGTACAGGCAGCTTAGAAG ATGAATATAAATTCCGTTTCCGGGTCTTTCTC GTTCACTGCCGTACAGGCAGCTTAGAAG ACCCAGGTGCTTACCCCAGAGAACTAACAAGT GTTCACTGCCGTACAGGCAGCTTAGAAG CACAGGCAGTCTGATTTGCACTGACATTCTGA GTTCACTGCCGTACAGGCAGCTTAGAAG CGGCAAACTTTAATAGCTGCATGCGGATTCCT GTTCACTGCCGTACAGGCAGCTTAGAAG GCGTTCCGAACATTGAAAATCTCCGCATCATC GTTCACTGCCGTACAGGCAGCTTAGAAG TTCCAGCTCACGCTCCGTCCAGTCACGCATGG GTTCACTGCCGTACAGGCAGCTTAGAAG TTCGGGAACGGCTCGGAATAACTGTTGTGGCT GTTCACTGCCGTACAGGCAGCTTAGAAG AACGTCTGATTGGTATCGCATTCCACGCTGC GTTCACTGCCGTACAGGTAGCTTAGAAG TGTGATACCCGGAAGCGCTTTTAATTCTGCGG GTTCACTGCCGTACAGGCAGCTTAGAAG GGTAACGATGGGTATGAGATTAACTGCGGAGA GTTCACTGCCGTACAGGCAGCTTAGAAG ATGAAAATCGGCGAAAAAATTAAACAGATCCG GTTCACTGGCGTACAGACCGCCTTAAAT >NC_010694|4|2|946115-947582|CRT GTTCACTGCCGTACAGGCAGCTTAGAAG TTCAACAAGAAGCGCGATGAAGAAATTGCTGC GTTCACTGCCGTACAGGCAGCTTAGAAG GTATTGACTGAATCGGCAAATTCCCATCAGGT GTTCACTGCCGTACAGGCAGCTTAGAAG TTTGAAACTGGCGAGAGAGTCGGCGTGAAACA GTTCACTGCCGTACAGGCAGCTTAGAAG AACTCGTCTAGCCAACGCCGCCCGCCGCGCTC GTTCACTGCCGTACAGGCAGCTTAGAAG AACTATGAGGCACTCATTAATGTCTTTGTGCGG GTTCACTGCCGTACAGGCAGCTTAGAAG TGGCATCGCTGAAGCTGGGCCTGAATCATGAC GTTCACTGCCGTACAGGCAGCTTAGAAG GGAGAAATGGAAAGCATTCATGACCATGAAAC GTTCACTGCCGTACAGGCAGCTTAGAAG CTTCTGGGCCTGTCCAGTCAGTTTACGACCTA GTTCACTGCCGTACAGGCAGCTTAGAAG TGTTCGGTGCTGCGAATTCCAGTGTGGCTTAT GTTCACTGCCGTACAGGCAGCTTAGAAG TCAGAACCCCGAATTGCTTCGTCGATATAGTC GTTCACTGCCGTACAGGCAGCTTAGAAG TGTTAAATGAACACCCAAGATTTTGCCTACGT GTTCACTGCCGTACAGGCAGCTTAGAAG ACACCAACTTGGCCCGTTTCCCACACCAACTT GTTCACTGCCGTACAGGCAGCTTAGAAG TGGCATGGTGTACCGCCTACCAGTACATCGGG GTTCACTGCCGTACAGGCAGCTTAGAAG ATGAATATAAATTCCGTTTCCGGGTCTTTCTC GTTCACTGCCGTACAGGCAGCTTAGAAG ACCCAGGTGCTTACCCCAGAGAACTAACAAGT GTTCACTGCCGTACAGGCAGCTTAGAAG CACAGGCAGTCTGATTTGCACTGACATTCTGA GTTCACTGCCGTACAGGCAGCTTAGAAG CGGCAAACTTTAATAGCTGCATGCGGATTCCT GTTCACTGCCGTACAGGCAGCTTAGAAG GCGTTCCGAACATTGAAAATCTCCGCATCATC GTTCACTGCCGTACAGGCAGCTTAGAAG TTCCAGCTCACGCTCCGTCCAGTCACGCATGG GTTCACTGCCGTACAGGCAGCTTAGAAG TTCGGGAACGGCTCGGAATAACTGTTGTGGCT GTTCACTGCCGTACAGGCAGCTTAGAAG AACGTCTGATTGGTATCGCATTCCACGCTGC GTTCACTGCCGTACAGGTAGCTTAGAAG TGTGATACCCGGAAGCGCTTTTAATTCTGCGG GTTCACTGCCGTACAGGCAGCTTAGAAG GGTAACGATGGGTATGAGATTAACTGCGGAGA GTTCACTGCCGTACAGGCAGCTTAGAAG ATGAAAATCGGCGAAAAAATTAAACAGATCCG GTTCACTGGCGTACAGACCGCCTTAAAT
>NC_010694.1|WP_012440593.1|945431_945986_+|type-I-F-CRISPR-associated-endoribonuclease-Cas6/Csy4 MDRYQDIRVRVDAEMTAPVLLAQVFMRLHQVLMRAANGRIGISFPDVKLTLGDRIRLHGTLDDLSSLQQSGWDKGLTDYIACSAIDPVPPGAAWRTVRRVQVKSSAERLRRRSVNKGWLNEAEAAERINVLSEQRSDLPYLQIKSGSNGHAWRLFIEHGPLVSVPVNGGFSSYGLSATATVPWF >NC_010694.1|WP_012440592.1|944418_945423_+|type-I-F-CRISPR-associated-protein-Csy3 MAKSAIKTASVLAFERKLSNSDAIMLAGKWQDKQNWTPIKIQEKAVRGTISNRLKNAIASDPQKLDAEIQKPNLQRVDVAALPYNCDSLKVCFTLRVLGGLATPAVCNDRAYQAALAAVIDGYIARHGFSTLAARYAENIANGRFLWRNRLGAGRVAVQVTSGEKRWQFDGHNYSLRAFSQPQGDLLELAQAIEQGLSGDSFALFNVEAQVYLGNGQEVFPSQELVLDSNSKKSKLLYQIDDTAAIHSQKIGNALRTIDSWYPDADELDVGPISVEPYGSVTSRGIAYRQPIKKMDFYTLLDNWVTKDKQPDLEQQHYVMAILIRGGVFGEKSE >NC_010694.1|WP_012440591.1|943453_944395_+|type-I-F-CRISPR-associated-protein-Csy2 MSALIVLRHLRVENANAIAGITWGFPAITHFLGFTHALSRKLQQSHNMTLSGCGVICHQQQVHAYTSGRDYQFALTRNPLTKEAKTAAFNEEGRMHMTVSLLMECHGSIAGGEQGAAELKQTLANLCQRLRLAGGTVISIGQVQISGWPQDDGETRKIMRRLLPGFALLDRSALLAQHHDQQPQPEMLDAWLDFAALKMQADDGATPADGNVQWQYQPKPGAGYLVPLMTGYRAISPLYPPGEVANSRDTETPFCFTEAVYGVGEWRGLHRIDDLRHLFWRYHHQDDYYLCRGEETACDQDYPDDADDDINYN >NC_010694.1|WP_012440590.1|942113_943457_+|type-I-F-CRISPR-associated-protein-Csy1 MLRETLASFITSYIAARKTAKLEAFDKESAKKLAVLASEDEISVLRQQLQQQRAELEQKYQPQAWLSDAASRAGQIKLVTHAAKFTHSDVRGSSIFSSGSGQHETYLSTATLQKPALDAVGNAAALDIARLLQSEVEGDSLIASLQRGDYSALESLTDNPELCASWISGFKQVLVDRQPASHKLAKQIYFPIADGQYHLLSPLFSSSLAHALNQRITEAKFSEQAKTARAALKAKSWHDAPVVAYPDTAITQFGGTKPQNISYLNSVRGGKVWLLPCAPPVWQTLSKPPAKHKSIFNSSNDFSRQSWPVIQRMSRFLRRVERLDSTLDIRQQRLAMTDEIIDILFNYVAGIQNQTESIGWSAHPDCVLKRSQQLWLDPWRGDKEFQFEREGGDWKSEVARDFGHWLSRHLHSDKLNMGETERRHFSTAPLFKQRLRELEKDLAEDLP >NC_010694.1|WP_012440589.1|941373_941670_+|BrnA-antitoxin-family-protein MPKLKPGTVFPTTEEDAKIYAAVADDEDSMLLEDPQLKLTPLKKRGRPQKAQPKIAVSVRYTPEVISAFKASGAGWQTRMDVALQDWLKTHQPTEIKL >NC_010694.1|WP_012440588.1|941108_941387_+|BrnT-family-toxin MDICYDPDKDVKNRRKHGYSLADSALLDWDEMVVYEDNRQPFDEIRLIGLTYGLARLGNRIFSVCFTEHEEVYRIISLRLATRKEIQRYAET >NC_010694.1|WP_012440587.1|937737_941031_+|type-I-F-CRISPR-associated-helicase-Cas3 MNVLLIAQCNKRALEESRRILDQFAERKGDRSWQTAITQQGLLTLRKLLRKTARRNTAVACHQIKSNGQSELLWIVGNLRRFNAQGAVPTHTTSRDVLKSADENSWHSVEAVSLLAAIAGLFHDFGKANSLFQQMLVGKKGVKRSQPYRHEWVSLRLFCAWVAGRDDRVWIAALSQIEPQDEQAMLAGLEKEGLMDTTNPFAPLPPVARVVAWLILSHHRMPVYPKKNGSSESASYLPPDLEHCDGWLTEQLDALWNAENHHDQGWTPADFKAQWQFPQGTPMRSGLWCGKARKMAQRLLAQPAWLAQIDINQRFSCHMARLALMLADHVYSAQPATPGWQDADCLLYANTDRDSGSLKQRLDEHNIGVAQNALLLARSLPHLRKTLPAITRHKGFKKRSTDERFRWQDNAWQKTCELRDRAFQQGFFGINMASTGCGKTFANARIMYALSDEQKGCRFSVALGLRTLTLQTGDALREKLNLEQDDLAVLVGSQAVTQLHQLAKDNPVSHDTGSESAEALPEENQYISYEGSLDDGRLSRWLQKSPRINKLLSAPVLVTTIDHLIGATEGLRGGRQIAPMLRLLTSDLVLDEPDDFDIDDLPALCRLVNWAGMLGSRVLFSSATLPPALVLALFNAYRSGREIFQHACGLPVDGNICCAWFDENAVLTEELRLPQAFMQQHKEFVANRVSWLAKQPVLRRGWIAPVAPPARDEATIYSHMAQVILQSMMTLHHAHHQRHKELPKTISVGVVRFANINPLVAVAQQLLATEAAEDTHIHYCVYHSRHPLAMRSHFEQRLDATLTRHQSDAIWQVAEIAAALEQHPQQHHLFVVLATSVAEVGRDHDYDWAIAEPSSMRSLIQLAGRVQRHRQEEAQSENIHILQQNICSLKERDSQKPTYCKPGFEQKGYMLASRDLQKILDKEQYQTISAIPRIQSRQKVGKGPLFANLADLEHRRLMVELQGKQKEPNEYCAALWWREQASWCGEMQRRKPFRQSPPEDMHFMLIAEEGDRPEIWQPDDGPSGRKKSMVAYPDLTFAAGVSAWITPDYQQVWQQLAERLTMELEEVSLRFGEIVLRTKPESKEWHFHPLLGAFQAE >NC_010694.1|WP_012440586.1|936763_937741_+|type-I-F-CRISPR-associated-endonuclease-Cas1 MEMIKPSDLKTILHSKRSNIYYLQYCRVLVNGGRVEYVTDEGKQSLYWNIPIANTTVVMLGTGTSITQAAMREFARAGVLVGFCGGGGTPLYAANEVEVDVSWLNSQSEYRPTEYLQHWVSFWFDEQKRLSAAIAFQRVRISQIRQAWLGSKMMREHKFAISEPHLTGILDRFEQGLARCDNNTDLLALEAVMTKALYKLAAQAVSYGDFVRAKRGGGIDAANRFLDHGNYLAYGLAAVACWVIGLPHGLAVLHGKTRRGGLVFDVADLIKDALILPQAFLAAMAGEGEQEFRQRCLSSLQNAEALDTMIAALEATAREHSQVGK >NC_010694.1|WP_042958685.1|935928_936204_+|hypothetical-protein MRMATPARRLAKTVMFIALFCLFARLIDSSQFIGLATANAFAAWLHGSASQENYDDLWFFVDVTLSVLSAVVAYHMVMLLGRKLRASSGHK >NC_010694.1|WP_012440585.1|935432_935963_+|DUF2778-domain-containing-protein MALHGSFVLNGADYSPLSFPGVGTFMAFSGSGDNRNRAGCAHIPTVGPLPTGKYWIVDRSQGGLLSQSLSASKDLFNKVFRDAQFGHSDWFALWRDDMSIDDWTWINSVRRGNFRLHPGTISEGCVTLYRNSDFALLRNMLLRTPLVDVPCMRNLKARGSIEVSSHAYGDTCPTTR >NC_010694.1|WP_157861836.1|948071_948779_+|PAS-domain-containing-protein MEFSLNKDMDIRSRSFDALISYMEHSNEFWYIKDHNSRFIYMNDYGLHYSGLPKGFNPEGKLDSECPVYWSEIADIIQANDRNVMESQKVIPTLMTFMYGGKEKLIQPFLADVTPLVKEGKSIGVVGRAKKLEIYSMYHLENNKCPESISFGKPTDLFTDREFDVVFFALQSLSAKEIAKKLSISHNTVENYLHSIYDKIGVSALNQLIEYCRKNGYDKYAPNRFINPNPYMPLI >NC_010694.1|WP_012440597.1|949746_950397_+|helix-turn-helix-transcriptional-regulator MDIYAQRSGSLKKIVLVTDDGYFYLGLKYSCLSNLTMTTLGFDRFMKESVCADAMLIIDMLSWSFFKSSNETSFYEKMIKNRRPEDIVMLTSNIFQEIITDMLYPGLCKVDRKLSFSFFSELASNQEKINLAKWCPKFERKRGLTNREMNIILEIFRGGKETEISLQLNICPKTVSAHKLSALSKVGCKNISHFFLLGRPFYRDLKLLLNTKSRSL >NC_010694.1|WP_012440598.1|950751_952032_-|cystathionine-gamma-synthase-family-protein MASSHSKKTHIGQRELQPETQMLNYGYDPALSEGAVKPPVFLTSTFVFNSAEEGRDFFDYVSGRREPPTGEGNGLVYSRFNHPNSEIVEDRLAIYERTESAALFSSGMSAIATTLLTFVRPGDTILHSQPLYGGSETLLGKTFSNLGVAAVGFADGIDEASVQAAADKAMAQGRVSAILIESPANPTNSLVDIALMKRVADRIERQQQHRPVVACDNTLLGPVFSRPTEHGADISLYSLTKYVGGHSDLIAGAAIGNRALIRQVKALRSAIGTQLDPHSSWMIGRSLETLALRMERANDNAAAVAGFLRSHPKVEQIHYLPFLSPDSAAGKIFSAQCSGAGSTFSFDIRGGQDAAFRFLNNLQLFKLAVSLGGTESLASHPASTTHSGVALDVRERIGIKSTTVRLSIGIENKDDLLEDLRLALEG >NC_010694.1|WP_012440599.1|952285_952822_+|hypoxanthine-phosphoribosyltransferase MKHTVEVMISEAEIASRITELGLQISEHYRNSGSDMVLVGLLRGSFMFMADLCRAIDVSHEVDFMTASSYGNSTTSSRDVKILKDLDEDIRGKDVLIVEDIIDSGNTLSKVREILSLRGPKSMAICTLLDKPSRREVDVPVEYVGFAIPDEFVVGYGIDYAQRYRHLPYVGKVVLLDE >NC_010694.1|WP_012440600.1|952934_953597_-|carbonate-dehydratase MKDISTLISNNRQWSRLLKEEDPGFFERLSLAQKPRFLWIGCSDSRVPAERLTGLEPGELFVHRNVANLVVHTDLNCLSVVQYAVEVLEVEHIIICGHYGCGGVQAALENPELGLIDNWLLHIRDLWYKHSALLGELPPEKRVDKLCEINVIEQVYNLGHSTIMQSAWKRGQQVNLHGWVYGIQDGYLRDLEVSATNRETLEQRYRHGIANLLNDPDLNP >NC_010694.1|WP_012440601.1|953789_954716_+|ABC-transporter-ATP-binding-protein MTYALELEKLTKTYQGGVQALRGIDLAVEAGDFYALLGPNGAGKSTTIGIISSLVNKTAGKVRVFGYDLQKDMVNAKRQLGLVPQEFNFNQFETVMQIVVSQAGLYGVEKAVALQRAEKYLTQLDLWDKRHERARMLSGGMKRRLMIARALMHEPKLLILDEPTAGVDIELRRSMWSFLQQLNAQGTTIILTTHYLEEAEMLCRNIGIIQHGELVENTSMKGLLAKLKSETFILDLAAKSPLPRLEGFQYRLTDTTTLEVEVLREQGMNSVFSQLSHQGVQVLSMRNKANRLEELFVDLVNGRKGDKA >NC_010694.1|WP_012440602.1|954712_955483_+|ABC-transporter-permease MTHLYWVALKSIWGKEVNRFARIWIQTLVPPVITMTLYFIIFGNLIGSRIGEMHGFSYMQFIVPGLIMMAVITNAYANVASSFFSAKFQRNIEELLVAPVPTHVIIAGYVGGGVARGVCVGVLVTAISLFFVPFHVHSWLMVAVTLLLTAILFSLAGLLNAVFARTFDDISLIPTFVLTPLTYLGGVFYSLSLLPPVWQMVSKLNPIVYMISGFRYGFLGINDVPLGFTLGVLVAFILVFYALVWGLIQRGRGLRT >NC_010694.1|WP_012440603.1|955558_955939_-|aspartate-1-decarboxylase MNRTMLQGKLHRVKVTQADLNYEGSCAIDQDFLDASGILQYEAVDIYNVNNGQRFSTYAIAAERGSKIISVNGAAARCACEGDLLIICSYVQMSDEQAREWQPKVAYFEGDNQMKRVAKAVPVQVA >NC_010694.1|WP_012440604.1|956002_956857_-|pantoate--beta-alanine-ligase MLIIETLPMLRREVRRWRQDGKRVALVPTMGNLHDGHMTLVDEARERADIVIVSIFVNPMQFERADDLARYPRTLQEDCEKLNRRGVDLVFSPAPADIYPHGVDGQTFVDVPSLSTLLEGASRPGHFRGVSTIVSKLFNLVQPDLACFGEKDYQQLALIRKMVADMGYDIDIIGVPTVRAKDGLALSSRNGYLTAEERKIAPLLSKVMQQIAERLGQGERHVEEMMISAENTLAENGLRADGLAIVDADTLLPLNVDSQRAVILMAAWLGKARLIDNQQVDLTQ >NC_010694.1|WP_012440605.1|956872_957667_-|3-methyl-2-oxobutanoate-hydroxymethyltransferase MKPTTVSTLRQWKQQGEKFASITAYDFSFARLFADEGIQVMLVGDSLGMVVQGHDSTLPVTLADIVYHTEVVRRGAPAALLLADLPFMSYATPEQTFDSAARLMRAGANMVKLEGGKWLAETVKQLTERAVPVCGHLGLTPQSVNIFGGYKVQGRDAEAADLLLEDALALEAAGMQLLVLECVPVALAKRVTEALSIPVIGIGAGNATDGQILVMHDAFGITGGHIPKFAKNFLAETGDIRAAVRQYVEEVKAGSYPAEQHSFQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_010694_3 | 3.2|934347|32|NC_010694|CRISPRCasFinder,CRT | 934347-934378 | 32 | FQ482085 | Erwinia tasmaniensis phage phiEt88 complete genome | 39444-39475 | 0 | 1.0 |
NC_010694_3 | 3.2|934347|32|NC_010694|CRISPRCasFinder,CRT | 934347-934378 | 32 | NC_015295 | Erwinia phage phiEt88, complete genome | 39444-39475 | 0 | 1.0 |
NC_010694_3 | 3.12|934947|33|NC_010694|CRISPRCasFinder,CRT | 934947-934979 | 33 | NZ_CP028352 | Pantoea vagans strain PV989 plasmid pPV989-94, complete sequence | 40457-40489 | 1 | 0.97 |
NC_010694_3 | 3.12|934947|33|NC_010694|CRISPRCasFinder,CRT | 934947-934979 | 33 | NZ_HG813238 | Erwinia amylovora strain 692 plasmid pEA68, complete sequence | 65570-65602 | 1 | 0.97 |
NC_010694_4 | 4.14|946924|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946924-946955 | 32 | MN602881 | Erwinia phage Midgardsormr38, complete genome | 44923-44954 | 1 | 0.969 |
NC_010694_3 | 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT | 934467-934498 | 32 | MN602881 | Erwinia phage Midgardsormr38, complete genome | 371-402 | 2 | 0.938 |
NC_010694_3 | 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT | 934467-934498 | 32 | JX403939 | Pseudomonas phage YMC/01/01/P52_PAE_BP, complete genome | 16126-16157 | 2 | 0.938 |
NC_010694_3 | 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT | 934467-934498 | 32 | MT261384 | Salmonella virus PAT1, complete genome | 23658-23689 | 2 | 0.938 |
NC_010694_3 | 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT | 934467-934498 | 32 | MK511012 | Pseudomonas phage BR153, partial genome | 23029-23060 | 2 | 0.938 |
NC_010694_3 | 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT | 934467-934498 | 32 | MT580116 | Salmonella phage 65FD, complete genome | 19326-19357 | 2 | 0.938 |
NC_010694_3 | 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT | 934467-934498 | 32 | KU310943 | Pseudomonas phage YMC11/07/P54_PAE_BP, complete genome | 31285-31316 | 2 | 0.938 |
NC_010694_3 | 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT | 934467-934498 | 32 | NC_016762 | Pseudomonas phage phi297, complete genome | 24008-24039 | 2 | 0.938 |
NC_010694_3 | 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT | 934467-934498 | 32 | MT580117 | Salmonella phage 66FD, complete genome | 10164-10195 | 2 | 0.938 |
NC_010694_3 | 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT | 934887-934918 | 32 | NZ_CP019065 | Rahnella sp. ERMR1:05 plasmid unnamed3, complete sequence | 58536-58567 | 2 | 0.938 |
NC_010694_3 | 3.12|934947|33|NC_010694|CRISPRCasFinder,CRT | 934947-934979 | 33 | NZ_CP019065 | Rahnella sp. ERMR1:05 plasmid unnamed3, complete sequence | 64152-64184 | 3 | 0.909 |
NC_010694_3 | 3.12|934947|33|NC_010694|CRISPRCasFinder,CRT | 934947-934979 | 33 | NC_013973 | Erwinia amylovora ATCC 49946 plasmid 2, complete sequence | 68864-68896 | 3 | 0.909 |
NC_010694_3 | 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT | 934887-934918 | 32 | NZ_CP018918 | Serratia marcescens strain UMH5 plasmid unnamed2, complete sequence | 74897-74928 | 6 | 0.812 |
NC_010694_4 | 4.13|946864|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946864-946895 | 32 | NZ_CP023152 | Mycobacterium chimaera strain FLAC0070 plasmid pFLAC0070_1, complete sequence | 35691-35722 | 6 | 0.812 |
NC_010694_4 | 4.13|946864|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946864-946895 | 32 | NZ_AP012556 | Mycobacterium avium subsp. hominissuis TH135 plasmid pMAH135, complete sequence | 105666-105697 | 6 | 0.812 |
NC_010694_4 | 4.19|947224|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 947224-947255 | 32 | NZ_CP028352 | Pantoea vagans strain PV989 plasmid pPV989-94, complete sequence | 43873-43904 | 6 | 0.812 |
NC_010694_3 | 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT | 934887-934918 | 32 | NZ_CP024583 | Roseomonas sp. FDAARGOS_362 plasmid unnamed2, complete sequence | 164460-164491 | 7 | 0.781 |
NC_010694_3 | 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT | 934887-934918 | 32 | NZ_CP030127 | Indioceanicola profundi strain SCSIO 08040 plasmid unnamed1, complete sequence | 92430-92461 | 7 | 0.781 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP013003 | Caulobacter henricii strain CB4 plasmid pCB4, complete sequence | 86196-86227 | 7 | 0.781 |
NC_010694_4 | 4.14|946924|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946924-946955 | 32 | NZ_CP032928 | Agrobacterium tumefaciens strain 1D1460 plasmid pAt1D1460, complete sequence | 99960-99991 | 7 | 0.781 |
NC_010694_3 | 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT | 934887-934918 | 32 | NZ_CP024583 | Roseomonas sp. FDAARGOS_362 plasmid unnamed2, complete sequence | 206708-206739 | 8 | 0.75 |
NC_010694_3 | 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT | 934887-934918 | 32 | NZ_LR594690 | Variovorax sp. WDL1 plasmid 2 | 577582-577613 | 8 | 0.75 |
NC_010694_3 | 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT | 934887-934918 | 32 | MN857473 | Teseptimavirus S2B, complete genome | 35174-35205 | 8 | 0.75 |
NC_010694_3 | 3.12|934947|33|NC_010694|CRISPRCasFinder,CRT | 934947-934979 | 33 | NZ_CP015640 | Pseudomonas lurida strain L228 plasmid unnamed, complete sequence | 36948-36980 | 8 | 0.758 |
NC_010694_4 | 4.2|946203|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946203-946234 | 32 | JF974301 | Vibrio phage VD1, *** SEQUENCING IN PROGRESS ***, 5 unordered pieces | 39823-39854 | 8 | 0.75 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP021082 | Deinococcus ficus strain CC-FR2-10 plasmid pDFI1, complete sequence | 386903-386934 | 8 | 0.75 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | LN997843 | Streptomyces reticuli genome assembly TUE45, plasmid : II | 729824-729855 | 8 | 0.75 |
NC_010694_4 | 4.6|946444|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946444-946475 | 32 | NZ_CP010326 | Pantoea sp. PSNIH1 plasmid pPSP-3a9, complete sequence | 321157-321188 | 8 | 0.75 |
NC_010694_4 | 4.10|946684|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946684-946715 | 32 | NZ_CP012641 | Massilia sp. WG5 plasmid unnamed 1, complete sequence | 41485-41516 | 8 | 0.75 |
NC_010694_4 | 4.23|947463|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 947463-947494 | 32 | NZ_CP046723 | Pantoea agglomerans strain ASB05 plasmid pASB05p1, complete sequence | 41284-41315 | 8 | 0.75 |
NC_010694_4 | 4.23|947463|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 947463-947494 | 32 | NZ_CP034470 | Pantoea agglomerans strain CFSAN047153 plasmid pCFSAN047153_1, complete sequence | 254893-254924 | 8 | 0.75 |
NC_010694_4 | 4.23|947463|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 947463-947494 | 32 | NZ_CP034475 | Pantoea agglomerans strain CFSAN047154 plasmid pCFSAN047154_1, complete sequence | 551853-551884 | 8 | 0.75 |
NC_010694_4 | 4.23|947463|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 947463-947494 | 32 | NZ_CP031650 | Pantoea agglomerans strain TH81 plasmid unnamed1, complete sequence | 445600-445631 | 8 | 0.75 |
NC_010694_3 | 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT | 934467-934498 | 32 | NZ_CP045534 | Bacillaceae bacterium C02 plasmid unnamed1, complete sequence | 26389-26420 | 9 | 0.719 |
NC_010694_3 | 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT | 934467-934498 | 32 | NZ_CP042928 | Bacillus cereus strain G1-1 plasmid unnamed, complete sequence | 50750-50781 | 9 | 0.719 |
NC_010694_3 | 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT | 934467-934498 | 32 | NZ_CP040341 | Bacillus cereus strain DLOU-Tangshan plasmid unnamed1, complete sequence | 252495-252526 | 9 | 0.719 |
NC_010694_3 | 3.6|934587|32|NC_010694|CRISPRCasFinder,CRT | 934587-934618 | 32 | MT104465 | Pseudomonas phage MR1, complete genome | 33219-33250 | 9 | 0.719 |
NC_010694_3 | 3.7|934647|32|NC_010694|CRISPRCasFinder,CRT | 934647-934678 | 32 | LR134127 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 7 | 63782-63813 | 9 | 0.719 |
NC_010694_3 | 3.9|934767|32|NC_010694|CRISPRCasFinder,CRT | 934767-934798 | 32 | NZ_LR723678 | Arsenite-oxidising bacterium NT-25 plasmid 2 | 103966-103997 | 9 | 0.719 |
NC_010694_3 | 3.9|934767|32|NC_010694|CRISPRCasFinder,CRT | 934767-934798 | 32 | NZ_FO082821 | Rhizobium sp. NT-26 plasmid NT26_p1, complete sequence | 185187-185218 | 9 | 0.719 |
NC_010694_3 | 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT | 934887-934918 | 32 | NZ_CP015585 | Roseomonas gilardii strain U14-5 plasmid 1, complete sequence | 262811-262842 | 9 | 0.719 |
NC_010694_3 | 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT | 934887-934918 | 32 | NZ_CP024587 | Roseomonas sp. FDAARGOS_362 plasmid unnamed3, complete sequence | 136275-136306 | 9 | 0.719 |
NC_010694_3 | 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT | 934887-934918 | 32 | NZ_CP021082 | Deinococcus ficus strain CC-FR2-10 plasmid pDFI1, complete sequence | 54420-54451 | 9 | 0.719 |
NC_010694_3 | 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT | 934887-934918 | 32 | MN694277 | Marine virus AFVG_250M238, complete genome | 33546-33577 | 9 | 0.719 |
NC_010694_3 | 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT | 934887-934918 | 32 | MK422450 | Klebsiella phage ST13-OXA48phi12.4, complete genome | 6992-7023 | 9 | 0.719 |
NC_010694_3 | 3.12|934947|33|NC_010694|CRISPRCasFinder,CRT | 934947-934979 | 33 | NZ_CP037915 | Sphingomonas sp. AAP5 plasmid p150, complete sequence | 35204-35236 | 9 | 0.727 |
NC_010694_3 | 3.14|935068|32|NC_010694|CRISPRCasFinder,CRT | 935068-935099 | 32 | NZ_CP006684 | Melissococcus plutonius S1 plasmid pMEPL_178, complete sequence | 121602-121633 | 9 | 0.719 |
NC_010694_3 | 3.14|935068|32|NC_010694|CRISPRCasFinder,CRT | 935068-935099 | 32 | NZ_AP021886 | Melissococcus plutonius strain DAT1033 plasmid pMP1, complete sequence | 121601-121632 | 9 | 0.719 |
NC_010694_3 | 3.14|935068|32|NC_010694|CRISPRCasFinder,CRT | 935068-935099 | 32 | NZ_AP018525 | Melissococcus plutonius strain DAT585 plasmid pMP1, complete sequence | 121423-121454 | 9 | 0.719 |
NC_010694_3 | 3.14|935068|32|NC_010694|CRISPRCasFinder,CRT | 935068-935099 | 32 | MW084976 | Bacillus phage Kirov, complete genome | 101796-101827 | 9 | 0.719 |
NC_010694_3 | 3.17|935249|32|NC_010694|CRISPRCasFinder,CRT | 935249-935280 | 32 | NZ_CP044976 | Hydrogenophaga sp. PBL-H3 substr. PBL-H3(B2) plasmid pPBL-H3_B2-1, complete sequence | 193979-194010 | 9 | 0.719 |
NC_010694_4 | 4.1|946143|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946143-946174 | 32 | MK376341 | Pseudomonas sp. strain ANT_H7B plasmid pA7BH1, complete sequence | 1113-1144 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | CP023013 | Ralstonia solanacearum strain T110 plasmid unnamed, complete sequence | 1843238-1843269 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP032323 | Azospirillum brasilense strain MTCC4035 plasmid p2, complete sequence | 445244-445275 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP049790 | Ralstonia solanacearum strain 202 plasmid unnamed, complete sequence | 1441391-1441422 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP049794 | Ralstonia solanacearum strain 204 plasmid unnamed, complete sequence | 1230456-1230487 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP007795 | Azospirillum brasilense strain Az39 plasmid AbAZ39_p2, complete sequence | 651297-651328 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP049792 | Ralstonia solanacearum strain 203 plasmid unnamed, complete sequence | 1011932-1011963 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP016915 | Ralstonia solanacearum strain CQPS-1 plasmid unnamed, complete sequence | 2038052-2038083 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP015851 | Ralstonia solanacearum strain YC40-M plasmid, complete sequence | 562084-562115 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP022783 | Ralstonia solanacearum strain SL3755 plasmid unnamed, complete sequence | 1850606-1850637 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP022795 | Ralstonia solanacearum strain SL2330 plasmid unnamed, complete sequence | 1845453-1845484 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP022482 | Ralstonia solanacearum strain HA4-1 plasmid HA4-1MP, complete sequence | 1258689-1258720 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | CP023015 | Ralstonia solanacearum strain T25 plasmid unnamed, complete sequence | 1846823-1846854 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP052077 | Ralstonia solanacearum strain FJAT445.F50 plasmid Plas1, complete sequence | 1813603-1813634 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP052079 | Ralstonia solanacearum strain FJAT445.F1 plasmid Plas1, complete sequence | 1814252-1814283 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP052125 | Ralstonia solanacearum strain FJAT1452.F1 plasmid Plas1, complete sequence | 1813628-1813659 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP052123 | Ralstonia solanacearum strain FJAT1452.F50 plasmid Plas1, complete sequence | 1813628-1813659 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP052081 | Ralstonia solanacearum strain FJAT442.F50 plasmid Plas1, complete sequence | 1813628-1813659 | 9 | 0.719 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP052083 | Ralstonia solanacearum strain FJAT442.F1 plasmid Plas1, complete sequence | 1813628-1813659 | 9 | 0.719 |
NC_010694_4 | 4.6|946444|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946444-946475 | 32 | NZ_CP021082 | Deinococcus ficus strain CC-FR2-10 plasmid pDFI1, complete sequence | 376024-376055 | 9 | 0.719 |
NC_010694_4 | 4.8|946564|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946564-946595 | 32 | NC_007974 | Cupriavidus metallidurans CH34 megaplasmid, complete sequence | 473061-473092 | 9 | 0.719 |
NC_010694_4 | 4.8|946564|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946564-946595 | 32 | NZ_CP046333 | Cupriavidus metallidurans strain FDAARGOS_675 plasmid unnamed3 | 574159-574190 | 9 | 0.719 |
NC_010694_4 | 4.19|947224|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 947224-947255 | 32 | MN694676 | Marine virus AFVG_250M673, complete genome | 25946-25977 | 9 | 0.719 |
NC_010694_3 | 3.7|934647|32|NC_010694|CRISPRCasFinder,CRT | 934647-934678 | 32 | MG592483 | Vibrio phage 1.110.O._10N.261.52.C1, partial genome | 1417-1448 | 10 | 0.688 |
NC_010694_3 | 3.7|934647|32|NC_010694|CRISPRCasFinder,CRT | 934647-934678 | 32 | MG592605 | Vibrio phage 1.239.O._10N.261.52.F6, partial genome | 1339-1370 | 10 | 0.688 |
NC_010694_4 | 4.1|946143|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946143-946174 | 32 | JN035618 | Gordonia phage GTE7, complete genome | 14328-14359 | 10 | 0.688 |
NC_010694_4 | 4.1|946143|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946143-946174 | 32 | KF879861 | UNVERIFIED: Nocardia phage NOC1, partial genome | 12537-12568 | 10 | 0.688 |
NC_010694_4 | 4.1|946143|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946143-946174 | 32 | NC_028673 | Gordonia phage GMA7, complete genome | 14269-14300 | 10 | 0.688 |
NC_010694_4 | 4.3|946263|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946263-946294 | 32 | NZ_CP030842 | Acidisarcina polymorpha strain SBC82 plasmid pACPOL2, complete sequence | 58281-58312 | 10 | 0.688 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP032342 | Azospirillum brasilense strain MTCC4038 plasmid p3, complete sequence | 198674-198705 | 10 | 0.688 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP033321 | Azospirillum brasilense strain Cd plasmid p3, complete sequence | 632563-632594 | 10 | 0.688 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | NZ_CP033315 | Azospirillum brasilense strain Sp 7 plasmid p3, complete sequence | 4952-4983 | 10 | 0.688 |
NC_010694_4 | 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946323-946354 | 32 | KY653127 | Corynebacterium phage IME1320_01, complete genome | 12352-12383 | 10 | 0.688 |
NC_010694_4 | 4.7|946504|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946504-946535 | 32 | AP018319 | Nostoc sp. HK-01 plasmid plasmid1 DNA, complete genome | 406090-406121 | 10 | 0.688 |
NC_010694_4 | 4.12|946804|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946804-946835 | 32 | NZ_LT991956 | Enterobacter hormaechei subsp. steigerwaltii isolate C309 plasmid pC309-p2 | 67604-67635 | 10 | 0.688 |
NC_010694_3 | 3.14|935068|32|NC_010694|CRISPRCasFinder,CRT | 935068-935099 | 32 | NZ_CP016317 | Bacillus cereus strain M3 plasmid pBCM301, complete sequence | 22531-22562 | 11 | 0.656 |
NC_010694_4 | 4.7|946504|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946504-946535 | 32 | NZ_CP024793 | Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence | 191289-191320 | 11 | 0.656 |
NC_010694_4 | 4.7|946504|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946504-946535 | 32 | NZ_CP024793 | Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence | 313344-313375 | 11 | 0.656 |
NC_010694_4 | 4.7|946504|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946504-946535 | 32 | NZ_CP024793 | Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence | 498900-498931 | 11 | 0.656 |
NC_010694_4 | 4.7|946504|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946504-946535 | 32 | NZ_CP026693 | Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 plasmid pNLP1, complete sequence | 2767-2798 | 11 | 0.656 |
NC_010694_4 | 4.7|946504|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946504-946535 | 32 | MK047638 | Phage NG54, complete genome | 35388-35419 | 11 | 0.656 |
NC_010694_4 | 4.11|946744|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT | 946744-946775 | 32 | NZ_CP024793 | Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence | 242879-242910 | 11 | 0.656 |
1. spacer 3.2|934347|32|NC_010694|CRISPRCasFinder,CRT matches to FQ482085 (Erwinia tasmaniensis phage phiEt88 complete genome) position: , mismatch: 0, identity: 1.0
actggttcgctgcacgggtcaaactcaatttc CRISPR spacer actggttcgctgcacgggtcaaactcaatttc Protospacer ********************************
2. spacer 3.2|934347|32|NC_010694|CRISPRCasFinder,CRT matches to NC_015295 (Erwinia phage phiEt88, complete genome) position: , mismatch: 0, identity: 1.0
actggttcgctgcacgggtcaaactcaatttc CRISPR spacer actggttcgctgcacgggtcaaactcaatttc Protospacer ********************************
3. spacer 3.12|934947|33|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP028352 (Pantoea vagans strain PV989 plasmid pPV989-94, complete sequence) position: , mismatch: 1, identity: 0.97
catcaacctgatggactccatgctgcccaaaac CRISPR spacer catcaacctgatggactccatgctgcctaaaac Protospacer ***************************.*****
4. spacer 3.12|934947|33|NC_010694|CRISPRCasFinder,CRT matches to NZ_HG813238 (Erwinia amylovora strain 692 plasmid pEA68, complete sequence) position: , mismatch: 1, identity: 0.97
catcaacctgatggactccatgctgcccaaaac CRISPR spacer catcaacctgatggactcgatgctgcccaaaac Protospacer ****************** **************
5. spacer 4.14|946924|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to MN602881 (Erwinia phage Midgardsormr38, complete genome) position: , mismatch: 1, identity: 0.969
atgaatataaattccgtttccgggtctttctc CRISPR spacer atgaagataaattccgtttccgggtctttctc Protospacer ***** **************************
6. spacer 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT matches to MN602881 (Erwinia phage Midgardsormr38, complete genome) position: , mismatch: 2, identity: 0.938
tcaagaaaatcaaatggccggacaaggtaaag CRISPR spacer tgaagaaaatcaaatggccggacaaggtgaag Protospacer * **************************.***
7. spacer 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT matches to JX403939 (Pseudomonas phage YMC/01/01/P52_PAE_BP, complete genome) position: , mismatch: 2, identity: 0.938
tcaagaaaatcaaatggccggacaaggtaaag CRISPR spacer tcaagaagatcaaatggccggacaaggtgaag Protospacer *******.********************.***
8. spacer 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT matches to MT261384 (Salmonella virus PAT1, complete genome) position: , mismatch: 2, identity: 0.938
tcaagaaaatcaaatggccggacaaggtaaag CRISPR spacer tcaagaaaattaaatggccggacaaggtgaag Protospacer **********.*****************.***
9. spacer 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT matches to MK511012 (Pseudomonas phage BR153, partial genome) position: , mismatch: 2, identity: 0.938
tcaagaaaatcaaatggccggacaaggtaaag CRISPR spacer tcaagaagatcaaatggccggacaaggtgaag Protospacer *******.********************.***
10. spacer 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT matches to MT580116 (Salmonella phage 65FD, complete genome) position: , mismatch: 2, identity: 0.938
tcaagaaaatcaaatggccggacaaggtaaag CRISPR spacer tcaagaaaattaaatggccggacaaggtgaag Protospacer **********.*****************.***
11. spacer 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT matches to KU310943 (Pseudomonas phage YMC11/07/P54_PAE_BP, complete genome) position: , mismatch: 2, identity: 0.938
tcaagaaaatcaaatggccggacaaggtaaag CRISPR spacer tcaagaagatcaaatggccggacaaggtgaag Protospacer *******.********************.***
12. spacer 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT matches to NC_016762 (Pseudomonas phage phi297, complete genome) position: , mismatch: 2, identity: 0.938
tcaagaaaatcaaatggccggacaaggtaaag CRISPR spacer tcaagaagatcaaatggccggacaaggtgaag Protospacer *******.********************.***
13. spacer 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT matches to MT580117 (Salmonella phage 66FD, complete genome) position: , mismatch: 2, identity: 0.938
tcaagaaaatcaaatggccggacaaggtaaag CRISPR spacer tcaagaaaattaaatggccggacaaggtgaag Protospacer **********.*****************.***
14. spacer 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP019065 (Rahnella sp. ERMR1:05 plasmid unnamed3, complete sequence) position: , mismatch: 2, identity: 0.938
catcgtggacgccgcccagagcatcaccagct CRISPR spacer catcgttgacgccgcccacagcatcaccagct Protospacer ****** *********** *************
15. spacer 3.12|934947|33|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP019065 (Rahnella sp. ERMR1:05 plasmid unnamed3, complete sequence) position: , mismatch: 3, identity: 0.909
catcaacctgatggactccatgctgcccaaaac CRISPR spacer gattaacctgatggactcaatgctgcccaaaac Protospacer **.************** **************
16. spacer 3.12|934947|33|NC_010694|CRISPRCasFinder,CRT matches to NC_013973 (Erwinia amylovora ATCC 49946 plasmid 2, complete sequence) position: , mismatch: 3, identity: 0.909
catcaacctgatggactccatgctgcccaaaac CRISPR spacer cgttaacctgatggactccatgcttcccaaaac Protospacer *.*.******************** ********
17. spacer 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP018918 (Serratia marcescens strain UMH5 plasmid unnamed2, complete sequence) position: , mismatch: 6, identity: 0.812
catcgtggacgccgcccagagcatcaccagct CRISPR spacer cattaccgacgccgcccacagcatcaccaact Protospacer ***... *********** **********.**
18. spacer 4.13|946864|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP023152 (Mycobacterium chimaera strain FLAC0070 plasmid pFLAC0070_1, complete sequence) position: , mismatch: 6, identity: 0.812
tggcatggtgtaccgcctaccagt-acatcggg CRISPR spacer gggcatggtgtaccgccatccagtcacgttgg- Protospacer **************** ***** **.*.**
19. spacer 4.13|946864|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_AP012556 (Mycobacterium avium subsp. hominissuis TH135 plasmid pMAH135, complete sequence) position: , mismatch: 6, identity: 0.812
tggcatggtgtaccgcctaccagt-acatcggg CRISPR spacer gggcatggtgtaccgccatccagtcacgttgg- Protospacer **************** ***** **.*.**
20. spacer 4.19|947224|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP028352 (Pantoea vagans strain PV989 plasmid pPV989-94, complete sequence) position: , mismatch: 6, identity: 0.812
ttccagctcacgctccgtccagtcacgcatgg CRISPR spacer ttccagctcgcgctcagtccagtcgttcatag Protospacer *********.***** ********.. ***.*
21. spacer 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP024583 (Roseomonas sp. FDAARGOS_362 plasmid unnamed2, complete sequence) position: , mismatch: 7, identity: 0.781
catcgtggacgccgcccagagcatcaccagct CRISPR spacer gaccatcgacgcagcccagggcatcaccagca Protospacer *.*.* ***** ******.***********
22. spacer 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP030127 (Indioceanicola profundi strain SCSIO 08040 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.781
catcgtggacgccgcccagagcatc--accagct CRISPR spacer gatcgaggatgccgcccagagcatcggggcag-- Protospacer **** ***.*************** . ***
23. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP013003 (Caulobacter henricii strain CB4 plasmid pCB4, complete sequence) position: , mismatch: 7, identity: 0.781
aactcgtct--agccaacgccgcccgccgcgctc CRISPR spacer --ccaggctgaagccgccgccgcccgccgcgctc Protospacer *. * ** ****. *****************
24. spacer 4.14|946924|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032928 (Agrobacterium tumefaciens strain 1D1460 plasmid pAt1D1460, complete sequence) position: , mismatch: 7, identity: 0.781
atgaatataaattccgtttccgggtctttctc CRISPR spacer atgaatatgaattccggttccggcttgtgatc Protospacer ********.******* ****** *. * **
25. spacer 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP024583 (Roseomonas sp. FDAARGOS_362 plasmid unnamed2, complete sequence) position: , mismatch: 8, identity: 0.75
catcgtggacgccgcccagagcatcaccagct CRISPR spacer gaccgtggacgcggcccagggcatcacctcgg Protospacer *.********* ******.********
26. spacer 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_LR594690 (Variovorax sp. WDL1 plasmid 2) position: , mismatch: 8, identity: 0.75
catcgtggacgccgcccagagcatcaccagct CRISPR spacer ctgcgttgacgccgcccagatcatcatgggcg Protospacer * *** ************* *****. .**
27. spacer 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT matches to MN857473 (Teseptimavirus S2B, complete genome) position: , mismatch: 8, identity: 0.75
catcgtggacgccgcccagagcatcaccagct CRISPR spacer gcgcctggacgccgtccagatcatcaccctct Protospacer * *********.***** ******* **
28. spacer 3.12|934947|33|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP015640 (Pseudomonas lurida strain L228 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.758
catcaacctgatggactccatgctgcccaaaac CRISPR spacer catcaacctgatggattcgatgcttgccccggc Protospacer ***************.** ***** ** ..*
29. spacer 4.2|946203|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to JF974301 (Vibrio phage VD1, *** SEQUENCING IN PROGRESS ***, 5 unordered pieces) position: , mismatch: 8, identity: 0.75
gtattgactgaatcggcaaattcccatcaggt CRISPR spacer gtggctactgaatcggccaattcccagcagaa Protospacer **. . *********** ******** ***.
30. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP021082 (Deinococcus ficus strain CC-FR2-10 plasmid pDFI1, complete sequence) position: , mismatch: 8, identity: 0.75
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer ttccgatcaagccaaagccgcccgccgcgcac Protospacer *. .** ****** ************** *
31. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to LN997843 (Streptomyces reticuli genome assembly TUE45, plasmid : II) position: , mismatch: 8, identity: 0.75
aactcgt-ctagccaacgccgcccgccgcgctc CRISPR spacer -acgggcgctggcccacgccgcccgccgcgcgg Protospacer ** *. **.*** ****************
32. spacer 4.6|946444|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP010326 (Pantoea sp. PSNIH1 plasmid pPSP-3a9, complete sequence) position: , mismatch: 8, identity: 0.75
tggcatcgctgaagctgggcctgaatcatgac CRISPR spacer tggcagcgctgacgctgggcctggcgcctttc Protospacer ***** ****** **********. * * *
33. spacer 4.10|946684|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012641 (Massilia sp. WG5 plasmid unnamed 1, complete sequence) position: , mismatch: 8, identity: 0.75
tcagaaccccgaattgcttcgtcgatatagtc CRISPR spacer accgcacccagaattgctttgtcgatatccgc Protospacer * * **** *********.******** *
34. spacer 4.23|947463|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP046723 (Pantoea agglomerans strain ASB05 plasmid pASB05p1, complete sequence) position: , mismatch: 8, identity: 0.75
ggtaacgatgggtatgagattaactgcggaga CRISPR spacer cgtaacgtagggtatgagattaacgaagctga Protospacer ****** *************** . * **
35. spacer 4.23|947463|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP034470 (Pantoea agglomerans strain CFSAN047153 plasmid pCFSAN047153_1, complete sequence) position: , mismatch: 8, identity: 0.75
ggtaacgatgggtatgagattaactgcggaga CRISPR spacer cgtaacgtagggtatgagattaacgaagctga Protospacer ****** *************** . * **
36. spacer 4.23|947463|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP034475 (Pantoea agglomerans strain CFSAN047154 plasmid pCFSAN047154_1, complete sequence) position: , mismatch: 8, identity: 0.75
ggtaacgatgggtatgagattaactgcggaga CRISPR spacer cgtaacgtagggtatgagattaacgaagctga Protospacer ****** *************** . * **
37. spacer 4.23|947463|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP031650 (Pantoea agglomerans strain TH81 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
ggtaacgatgggtatgagattaactgcggaga CRISPR spacer cgtaacgtagggtatgagattaacgaagctga Protospacer ****** *************** . * **
38. spacer 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP045534 (Bacillaceae bacterium C02 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
tcaagaaaatcaaatggccggacaaggtaaag CRISPR spacer ccaagtaaatgaaatggccggacaatattggt Protospacer .**** **** ************** .* ..
39. spacer 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP042928 (Bacillus cereus strain G1-1 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
tcaagaaaatcaaatggccggacaaggtaaag CRISPR spacer ccaagtaaatgaaatggccggacaatattggt Protospacer .**** **** ************** .* ..
40. spacer 3.4|934467|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP040341 (Bacillus cereus strain DLOU-Tangshan plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
tcaagaaaatcaaatggccggacaaggtaaag CRISPR spacer ccaagtaaatgaaatggccggacaatattggt Protospacer .**** **** ************** .* ..
41. spacer 3.6|934587|32|NC_010694|CRISPRCasFinder,CRT matches to MT104465 (Pseudomonas phage MR1, complete genome) position: , mismatch: 9, identity: 0.719
agtttttggtttggtcgccatatagaattatt CRISPR spacer cgtttttggtgtggttgccatatatgcctcct Protospacer ********* ****.******** . .* .*
42. spacer 3.7|934647|32|NC_010694|CRISPRCasFinder,CRT matches to LR134127 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 7) position: , mismatch: 9, identity: 0.719
ttaaccccggcaccaataccgatag-agtcata CRISPR spacer ataaccccggccccgataccgataaccgttgc- Protospacer ********** **.*********. **...
43. spacer 3.9|934767|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_LR723678 (Arsenite-oxidising bacterium NT-25 plasmid 2) position: , mismatch: 9, identity: 0.719
tgtttagcggtatctccgcatagcgcatggaa CRISPR spacer agacacgcggtatcgccgcatagcggatgtca Protospacer * . ******** ********** *** *
44. spacer 3.9|934767|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_FO082821 (Rhizobium sp. NT-26 plasmid NT26_p1, complete sequence) position: , mismatch: 9, identity: 0.719
tgtttagcggtatctccgcatagcgcatggaa CRISPR spacer agacacgcggtatcgccgcatagcggatgtca Protospacer * . ******** ********** *** *
45. spacer 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP015585 (Roseomonas gilardii strain U14-5 plasmid 1, complete sequence) position: , mismatch: 9, identity: 0.719
catcgtggacgccgcccagagcatcaccagct CRISPR spacer gacggtggacgcggcccagggcatcacctcgg Protospacer *. ******** ******.********
46. spacer 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP024587 (Roseomonas sp. FDAARGOS_362 plasmid unnamed3, complete sequence) position: , mismatch: 9, identity: 0.719
catcgtggacgccgcccagagcatcaccagct CRISPR spacer gacggtggacgcggcccagggcatcacctcgg Protospacer *. ******** ******.********
47. spacer 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP021082 (Deinococcus ficus strain CC-FR2-10 plasmid pDFI1, complete sequence) position: , mismatch: 9, identity: 0.719
catcgtggacgccgcccagagcatcaccagct CRISPR spacer gttcgtggccgccgcccagagcctcggcgcca Protospacer ****** ************* **. *. *
48. spacer 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT matches to MN694277 (Marine virus AFVG_250M238, complete genome) position: , mismatch: 9, identity: 0.719
catcgtggacgccgcccagagcatcaccagct CRISPR spacer cgccaaacccgccgccgggagcatcaccagct Protospacer *..*. . ******* .**************
49. spacer 3.11|934887|32|NC_010694|CRISPRCasFinder,CRT matches to MK422450 (Klebsiella phage ST13-OXA48phi12.4, complete genome) position: , mismatch: 9, identity: 0.719
catcgtggacgccgcccagagcatcaccagct CRISPR spacer aatcgtagacaccgcccagagcattaaacgaa Protospacer *****.***.*************.* *
50. spacer 3.12|934947|33|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP037915 (Sphingomonas sp. AAP5 plasmid p150, complete sequence) position: , mismatch: 9, identity: 0.727
catcaacctgatggactccatgctgcccaaaac CRISPR spacer ctcgatgatgatggactacatgctccccaaaaa Protospacer * . * ********* ****** *******
51. spacer 3.14|935068|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP006684 (Melissococcus plutonius S1 plasmid pMEPL_178, complete sequence) position: , mismatch: 9, identity: 0.719
ttacccgatgcttcaatgaatccagacgtacc CRISPR spacer gaaccagatggttcaatgaatccagaaggcat Protospacer *** **** *************** * .
52. spacer 3.14|935068|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_AP021886 (Melissococcus plutonius strain DAT1033 plasmid pMP1, complete sequence) position: , mismatch: 9, identity: 0.719
ttacccgatgcttcaatgaatccagacgtacc CRISPR spacer gaaccagatggttcaatgaatccagaaggcat Protospacer *** **** *************** * .
53. spacer 3.14|935068|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_AP018525 (Melissococcus plutonius strain DAT585 plasmid pMP1, complete sequence) position: , mismatch: 9, identity: 0.719
ttacccgatgcttcaatgaatccagacgtacc CRISPR spacer gaaccagatggttcaatgaatccagaaggcat Protospacer *** **** *************** * .
54. spacer 3.14|935068|32|NC_010694|CRISPRCasFinder,CRT matches to MW084976 (Bacillus phage Kirov, complete genome) position: , mismatch: 9, identity: 0.719
ttacccgatgcttcaatgaatccagacgtacc CRISPR spacer actgctgttgcttcaatgattccagacttacg Protospacer . *.* *********** ******* ***
55. spacer 3.17|935249|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP044976 (Hydrogenophaga sp. PBL-H3 substr. PBL-H3(B2) plasmid pPBL-H3_B2-1, complete sequence) position: , mismatch: 9, identity: 0.719
aaagacggcacgtttttcaccaaagacgattt CRISPR spacer tactacggcacgtttttcaacgaagacctgat Protospacer * *************** *.***** *
56. spacer 4.1|946143|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to MK376341 (Pseudomonas sp. strain ANT_H7B plasmid pA7BH1, complete sequence) position: , mismatch: 9, identity: 0.719
ttcaacaagaagcgcgatgaagaaattgctgc CRISPR spacer gagatagagaagcgggatgcagaaattgctga Protospacer * .******* **** ***********
57. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to CP023013 (Ralstonia solanacearum strain T110 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
58. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032323 (Azospirillum brasilense strain MTCC4035 plasmid p2, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer acagcatctggccgacgccgcccgccgcggcg Protospacer * *.***.***.*************** .
59. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP049790 (Ralstonia solanacearum strain 202 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
60. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP049794 (Ralstonia solanacearum strain 204 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
61. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP007795 (Azospirillum brasilense strain Az39 plasmid AbAZ39_p2, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer acagcatctggccgacgccgcccgccgcggcg Protospacer * *.***.***.*************** .
62. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP049792 (Ralstonia solanacearum strain 203 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
63. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP016915 (Ralstonia solanacearum strain CQPS-1 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
64. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015851 (Ralstonia solanacearum strain YC40-M plasmid, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
65. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP022783 (Ralstonia solanacearum strain SL3755 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
66. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP022795 (Ralstonia solanacearum strain SL2330 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
67. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP022482 (Ralstonia solanacearum strain HA4-1 plasmid HA4-1MP, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
68. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to CP023015 (Ralstonia solanacearum strain T25 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
69. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP052077 (Ralstonia solanacearum strain FJAT445.F50 plasmid Plas1, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
70. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP052079 (Ralstonia solanacearum strain FJAT445.F1 plasmid Plas1, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
71. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP052125 (Ralstonia solanacearum strain FJAT1452.F1 plasmid Plas1, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
72. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP052123 (Ralstonia solanacearum strain FJAT1452.F50 plasmid Plas1, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
73. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP052081 (Ralstonia solanacearum strain FJAT442.F50 plasmid Plas1, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
74. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP052083 (Ralstonia solanacearum strain FJAT442.F1 plasmid Plas1, complete sequence) position: , mismatch: 9, identity: 0.719
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer catcgaacaagccgacgccgcccgtcgcgctc Protospacer *.. . * ****.**********.*******
75. spacer 4.6|946444|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP021082 (Deinococcus ficus strain CC-FR2-10 plasmid pDFI1, complete sequence) position: , mismatch: 9, identity: 0.719
tggcatcgctgaagctgggcctgaatcatgac CRISPR spacer atgaggtgctgcagctgggcctgaagcatgtc Protospacer * . .**** ************* **** *
76. spacer 4.8|946564|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NC_007974 (Cupriavidus metallidurans CH34 megaplasmid, complete sequence) position: , mismatch: 9, identity: 0.719
cttctgggcctgtccagtcagtttacgaccta CRISPR spacer cttcggggcctggccagtcagttttctgagag Protospacer **** ******* *********** * . .
77. spacer 4.8|946564|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP046333 (Cupriavidus metallidurans strain FDAARGOS_675 plasmid unnamed3) position: , mismatch: 9, identity: 0.719
cttctgggcctgtccagtcagtttacgaccta CRISPR spacer cttcggggcctggccagtcagttttctgagag Protospacer **** ******* *********** * . .
78. spacer 4.19|947224|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to MN694676 (Marine virus AFVG_250M673, complete genome) position: , mismatch: 9, identity: 0.719
ttccagctc----acgctccgtccagtcacgcatgg CRISPR spacer ----aaccttgaaacgcgccgtccagtcaagcatgg Protospacer *.*.. **** *********** ******
79. spacer 3.7|934647|32|NC_010694|CRISPRCasFinder,CRT matches to MG592483 (Vibrio phage 1.110.O._10N.261.52.C1, partial genome) position: , mismatch: 10, identity: 0.688
ttaaccccggcaccaataccgatagagtcata CRISPR spacer ccagtatgagcaccaacgccgatagagtcata Protospacer ..*.. . .*******..**************
80. spacer 3.7|934647|32|NC_010694|CRISPRCasFinder,CRT matches to MG592605 (Vibrio phage 1.239.O._10N.261.52.F6, partial genome) position: , mismatch: 10, identity: 0.688
ttaaccccggcaccaataccgatagagtcata CRISPR spacer ccagtatgagcaccaacgccgatagagtcata Protospacer ..*.. . .*******..**************
81. spacer 4.1|946143|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to JN035618 (Gordonia phage GTE7, complete genome) position: , mismatch: 10, identity: 0.688
ttcaacaagaagcgcgatgaagaaattgctgc CRISPR spacer gagaacaagaagcgcgagcaagaaatgaagtc Protospacer ************** ******* . *
82. spacer 4.1|946143|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to KF879861 (UNVERIFIED: Nocardia phage NOC1, partial genome) position: , mismatch: 10, identity: 0.688
ttcaacaagaagcgcgatgaagaaattgctgc CRISPR spacer gagaacaagaagcgcgagcaagaaatgaagtc Protospacer ************** ******* . *
83. spacer 4.1|946143|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NC_028673 (Gordonia phage GMA7, complete genome) position: , mismatch: 10, identity: 0.688
ttcaacaagaagcgcgatgaagaaattgctgc CRISPR spacer gagaacaagaagcgcgagcaagaaatgaagtc Protospacer ************** ******* . *
84. spacer 4.3|946263|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030842 (Acidisarcina polymorpha strain SBC82 plasmid pACPOL2, complete sequence) position: , mismatch: 10, identity: 0.688
tttgaaactggcgagagagtcggcgtgaaaca CRISPR spacer aagtaaactggcgagaaagtcggtgtgcatgt Protospacer ************.******.*** *
85. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032342 (Azospirillum brasilense strain MTCC4038 plasmid p3, complete sequence) position: , mismatch: 10, identity: 0.688
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer ggccgcgctggccgacgccgcccgccgcgccg Protospacer ..*. **.***.****************.
86. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP033321 (Azospirillum brasilense strain Cd plasmid p3, complete sequence) position: , mismatch: 10, identity: 0.688
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer ggccgcgctggccgacgccgcccgccgcgccg Protospacer ..*. **.***.****************.
87. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP033315 (Azospirillum brasilense strain Sp 7 plasmid p3, complete sequence) position: , mismatch: 10, identity: 0.688
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer ggccgcgctggccgacgccgcccgccgcgccg Protospacer ..*. **.***.****************.
88. spacer 4.4|946323|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to KY653127 (Corynebacterium phage IME1320_01, complete genome) position: , mismatch: 10, identity: 0.688
aactcgtctagccaacgccgcccgccgcgctc CRISPR spacer cggcaacctagccgacgacgcccgccgcgccc Protospacer . . ..******.*** ************.*
89. spacer 4.7|946504|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to AP018319 (Nostoc sp. HK-01 plasmid plasmid1 DNA, complete genome) position: , mismatch: 10, identity: 0.688
ggagaaatggaaagcattcatgaccatgaaac CRISPR spacer atagaaattgaacgcattcatgaccccagatt Protospacer . ****** *** ************ ...* .
90. spacer 4.12|946804|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LT991956 (Enterobacter hormaechei subsp. steigerwaltii isolate C309 plasmid pC309-p2) position: , mismatch: 10, identity: 0.688
acaccaacttggcccgtttcccacaccaactt CRISPR spacer cttccatcttggcccgtttccctcaccgttcg Protospacer . *** *************** ****. ..
91. spacer 3.14|935068|32|NC_010694|CRISPRCasFinder,CRT matches to NZ_CP016317 (Bacillus cereus strain M3 plasmid pBCM301, complete sequence) position: , mismatch: 11, identity: 0.656
ttacccgatgcttcaatgaatccagacgtacc CRISPR spacer cctgaagatgcttcaatgactccagaggttat Protospacer .. ************* ****** ** .
92. spacer 4.7|946504|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP024793 (Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence) position: , mismatch: 11, identity: 0.656
ggagaaatggaaagcattcatgaccatgaaac CRISPR spacer atagaaatagaacgcattcatgacccacgttt Protospacer . ******.*** ************ . .
93. spacer 4.7|946504|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP024793 (Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence) position: , mismatch: 11, identity: 0.656
ggagaaatggaaagcattcatgaccatgaaac CRISPR spacer atagaaatagaacgcattcatgacccacgttt Protospacer . ******.*** ************ . .
94. spacer 4.7|946504|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP024793 (Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence) position: , mismatch: 11, identity: 0.656
ggagaaatggaaagcattcatgaccatgaaac CRISPR spacer atagaaatagaacgcattcatgacccacgttt Protospacer . ******.*** ************ . .
95. spacer 4.7|946504|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP026693 (Nostoc sp. 'Lobaria pulmonaria (5183) cyanobiont' strain 5183 plasmid pNLP1, complete sequence) position: , mismatch: 11, identity: 0.656
ggagaaatggaaagcattcatgaccatgaaac CRISPR spacer atagaaatagaacgcattcatgacccacgttt Protospacer . ******.*** ************ . .
96. spacer 4.7|946504|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to MK047638 (Phage NG54, complete genome) position: , mismatch: 11, identity: 0.656
ggagaaatggaaagcattcatgaccatgaaac CRISPR spacer acgccggcagaaagcagtcatgaccaggaaac Protospacer . . ....******* ********* *****
97. spacer 4.11|946744|32|NC_010694|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP024793 (Nostoc flagelliforme CCNUN1 plasmid pNFSY08, complete sequence) position: , mismatch: 11, identity: 0.656
tgttaaatgaacacccaagattttgcctacgt CRISPR spacer caataaatgaacccccaagatttagcgaggca Protospacer .. ********* ********** ** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
625326 : 694971
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_010694|625326:694971|DBSCAN-SWA CATGGATACCGTTATAGCATTTTTATCTCTGGCTCTATTTATTGCTTTTATCGTGGGTTTAATCAAGCCGTCGCTGGTTATGATGCCTAACCGTAAGCGCTCCAGTGCGCTTTATCTTGGTGGCTGTCTGGCGCTGAGCTTTATTGGCTCAATCTTATGGCCGACTGAAAAAAGCCAGCGTGTAGCAAAAGCTGATGTACCGGCAGTAAAAGCGGAACCGGCTCCGCCAACGTTTGAATACGCCGATAAAACCCTCAAAGAATATCGCAATGAGCTAAAAGAAACCCGGCACGATATCGTTAAAGACTATGTTAACTTCAAAAGTGTACCGGCCAGCTCTACTGATGCCTTTTATGCTTGTATGAGTGAGTACAGTTTTACTAAAGATGATGCGTTAAAACTCGGTGATGTGTTGGGGTGGTGTTTCAACCACTTCGAGAAAGATCCACAATCGCTGAATAATAAAATCAACCTTGACACATTTAAGGGTAATTTTAGCGGTTGGGATGGATCCTATCGCCCGTTAGAGAAGCTGATAAAAGCCAGCATGAATGATGATTCCTCTTATAAACATATTTCAACGGTCTACCATCTTATTTTGAATAAAGACCCGTATGCCGTTGTAAAAACAACGTTTCGCGGCACTAATGCTTATGGTGGTGTGGTCAAACAGACCGTAGCGGCACGCGTCAACGTGCGAACGGGTGAAGTCCTTTCGATACTCGACAATTAAGTTTAATTTCGTGAGTGCTAATATAATGTTAGCACTCACGTTTAAATGCGGTTAAGGGGGCGCAACGAAATATCAAGTAGACTTAAAATAAAATGCAATTGATCGTTTAATGAAGTATTAACAAAGTATAAATTACGACAGGTAGGAGAGACGCGGAAATATAAATTTGAAATTTTGCATTTACCACGTTTTCTTGAGTTAGTGCTGAAGTTATATATCGTCTCTGTGGACGAATATCTGCTGTTCACTAAATGCTATAGCCCTATTAGAGATAAAGCTAAGTGAGACGCAATAGACCTTTTGGATGCGCATAATTCTCTTACCAAAGAAGGTGCTGATTTAGGTAAATTCATTGATGGTGTTGCCCGTCCTTACACCGTTACATTAGACCCGGCGCTAAGTTTCACTAAGTGCGAAAAATGCAGTAAAACAGGTGTCAATTTGTAGATATTGTTTTTTGTAATGATACCGACTGGCTTAAGAATCTAAAGAAGGGTATTGGGCATTATCAATTCAGATTTACGCCAAAAAAAAATCGATTGCATATTGGCCGTCTTGTCCGAGGTGTGACTGAGTTTATCGATACATTTTATCTTTTCAACCTGGAGCATGAAGTTGGTTCGGAAAACCTAAAGACATTTCAAACATTGGCTGATAAATATTCCCATTTACTCAGTGAGGCTGAAAAGGAAGTAGAGGAAAAGGAGGCGGAAGCATTTTACGGCATAAGACCAAGCGATTATGAGTTCCTGACGGAATAACCGGAAATCAATGGTTTGTGTTATTAATCATTGACATAGGTTGCCATTGTTTGCCAATGATTACTCGTTATCCGCCATTTCTACCGCCACTTTATCGCCACTTATCCTACGGGGCAACCTTGCTTTGATAGCGCTCAATCTGCTCTTTTGTTAAACCTAATTGTGATAGCAAATGCGCGGTAATAGTTATCTTGAGATATAACCCTATTTTGATATAGTCATCTATCGACATTACTTTCATGAGTTTTTTTGGTCGGCCGACATGAGCGAGTTCGCCTCTAACTATAGTAATATTTTCTCCAATTGTTTTTTTATTGAATTTTGAAAATATTGCTTCTAATTTGTTTTTTAAGTCGATTGACGCATATTTATTTATAGGCCCTATATATTTGTCTTCATTCTTCGCTGAGAGAGTTAAGTTAATGGATTCTAACTGAGTGGCATATAATATAATATCCGCGTGAGCTTGGTGAAGAGTCCTAAGTCCCGTTTCGTATTGATATGTTACCGAGAGAGGGTTATACTCATCGCTCATATTTAACCATTTGCTAATAACTTCTCCCATATCAATTTGTTTCCAATTCAATGGTAGAAAGTGATGATTGATAGTCGAAAGTGCGAGATCTATAGTTCTCTGTTCTATACCATTAGAAAAAAGACAAGGGTTGTTTTTTTGTTTTCCTTTGAATTTTATATTTAGCTCATCCGGTATGACTGGTTTGTCTAATAATATTGAAAATAGACCAGTCAATTTCCATATGTCATCAATGTGTTTTATGATGCTATCATTAATGGTGTTGGCGTATCGTAGAAAGAACTTAAGGTTTTTTCTTATGGAGAAAAATGCTTTAGGGTATTCTTTTTTTGTTGACCAAAAGTCTTTGGTGAATTTATTGAAAGCCTCCTCATGCTGGCAGTCTATGATGTTTACAAGGCTATCCCCAATCATACTAAATGTAGCATTGTTGATGACGTCTATTTTCCATTCGGAGCCGTGAATAGATAGGATTGGTTTTGTAGAGTATTTTAGTTGAGATATGAATCCTTGGGGGTGAATAAATTCCTGCATGCCATGAAGAGCTATATCGCAGTATTCCACACTATCTTCTTCAGTATAAATTCCGTTAAAGATTATTGCTTTAAAATAGTGTTTGCCAGTTAAAACCCTTAACTTACCAAAATGCATTGAGCCTTGAAGGAAATCAAATGAACCAATCAGAGTGCATGGCTCGCCAGTGTTCAATACTCCGTATAACCGCTGGCATGTTCTCGGACTATCACTATCTGAAATGCAATAGTCAAGTACCAATCCATTGTAGGGAGTGTATTCTATTTTGGCAGAAAATCTGTCTTTATTATCATGGGGGGCACTCCAAAACTCTCCATGAAAAGAGTGCTCTTTTGTGAAATCATAGAAATAACCAGCCATGACTACCTCATTATTTAATTGATTGCCGTTGTATAATTGTTCAGTTGATGATTAAGTGGGTTTAATCTAACAGCATCCTCTAAATGGTCGGGGGCAAAGTGTGCATATCTCATGGTCATTTTAATATCGGTATGGCCAAGCACGCGCTGCAAGACCAGGATATTACCACCATTCATCATAAAATGACTGGCGAAAGTATGGCGCAAAACGTGGGTTAGTTGTCCAGCCGGTAATTCGATACCTGTTCTTTCCAGCGCAGACCTGAATGCGCCATAGCAGTCACTAAATAACCGACCCTTTTTATCATCAGGCAGAGATTCGTAAAGCTCTTTGCTGATGGGAACGGTGCGGTTTTTTCTGCCTTTGGTGTTGGTATAGGTAATTTTATATTTCGCAAGGTGGCTTTTTTTCAGGCTCTCAGCTTCAGACCATCGAGCTCCGGTGGCGAGACAGATCCTGACCACGGTTTCTAAATCTGGGTGGTCATGCCGCTTGCACTCGTCTAGCAGTAGTGCAATCTGGTCGTGGGTTAGCCAGGCCATTTCCATTTCTGCCGTACGGAAAGGGCGCATATTTTTTAGTGGATTTTCGCCTTTCCATTCCCCGAGGCGATTTAGCTCATTGAACACCGCGCGAAAGTAGGCCAGCTCAAGGTTAAGCGTGCGGGGCGAAACCTCCTTCACCCTATTTGAACGGGCATACTCACCCTTCAACCGTCTTTCCCGGTAGCGGGAAAACATCTGCGCATCGAAATCGCGTGCGAGCGGTTCGCCCATGCAGCCAAAAGCGTGGTGCATTGCTTGCTGCCGTCTGATGCCATCTTTAAGGGTGATGCCATGTGCGCTATACCACGCATCGACAAGCTCTTTTAGAGTGCGACGATCTTCTCTTTCTTCCTGCCACGGATTTTGGACAGTGAATTGTTCAAACGCCAGCGCTTCGCCTTTGGTTGCGAATTTTTTCCTGATGCGTTTGCCTTTTGCACCATTCGGATAAATTTCGCTTAACCAGCCTACGCCAGATGGATGTTTTCGAATCGCCATTAATTAATCTCGCTATATATCCCTACCACGCGACCTAATAATCTGATGTCATCAAAATTGCACTCAAAGGGCACTTTGCCCCCTGCTACATGGAGTCTTCTACCGGGCAGGATGGTTAACTCACGAATGCTGTTTGCTGACTCAATATCGACCAGCCAAAGACCATCTGAATGTGGGGCTTCTTGATCAACGAAGTGCAGCTTCCCCTCTGCGCGAACAGCAATCCCTTTTGAAATCTGCTTATTCAAAATATTGGAGTCGATGCTCAATATGGAATTTTTATCGAGAATTCCATTATTAAGAGTGTATAGGTCCATGTTTTTTGGATCTGTAATGGACGGGCTTCCTTCAAACTGCGGTCCTTCACCGGTGAGTATCCATCTCAAACTTGCACCTGTTTCAAGCGAACAAAAAGCCGCGAGATCGTAGGAGATAGTGCCACGTGTGTAGCGGTTCTGCAGGGAGCTTGGGGACATTTCAAAGTGATTTGCGAGCTGTAATTTTTGAGTGAAACCATAAACAGAAATAATCCTGTCAAGGATAGTTAGAGGCTCAAGAGTGTTTTTTTGTATGCCCATTAGTGTTTTCCATGTTGACCAATGCTCAAAAGTTAATTACATTGCTCACAAGTGAAGTTTGAATGTTGGCAAACATTGTGGAATGAGAGGCCAACATTGTTCAAAAAAGCAAATAGGGAATCATGCACCATGGCATCTGAAATCGCAATCATCAAAGTACCTGCGCCCATTGTCACCCTGCAGCTGTTTGCAGAGCTGGAAGGTGTTTCCGAGCGTACTGCATATCGCTGGACAACTGGTGACAACCCTTGCGTGCCAATCGAACCGCGCAAAATCCGTAAGGGTTGTAAAAAAGCCGGTGGTCCTATCCGTATCTATTACGCACGTTGGAAAGAAGAACAAACGCGTAAAGCTCTGGGGCATTCACGTTTTCAACTTGTTATAGGCTCTTAATTCACTTTATGGCAAAAGTAAGGGTGTAACATGTTTGATTATTGTGTTTCCAAACATCCGCATTTTGACGAAGCATGCCGGACTTTCGCACTGCGTCACAATATGGCGAAGCTGGCAGAACGCGCGGGAATGAACGTTCAGACGCTGCGCAATAAGCTGAACCCGGAGCAACCGCATCAGATTACGCCGTCGGAAATCTGGCTGCTTACCGATCTAACAGAGGACTCCACGCTGGTTGACGGTTTTCTGGCTCAGATTCACTGCCTGCCATGCGTCCCGATGAACGAAGTAGCAAAAGAGAAGCTGCCGCATTACGTCATGAGCGCTACTGCTGAAATCGGACGTGTTGCTGCCGGCGCCGTATCTGGTGATGTGAAAACCACCGCAGGCCGTCGCGATGTTATCAGCAGCATTAACTCTGTTACTCGTCTGATGGCGCTGGCTGCCGTTTCCATGCAGGCACGTTTGCAGTCTAACCCGGCGATGGCTAGCGCGGTCGATACCGTGACGGGTCTCGGCGCATCGTTCGGTCTGATCTGAGGTGGTTATGCTGACTAAAGAACCCTCCCTTGCATCACTTCTCGTAAAGCAAAGCCCTGCAATGCACTACGGTCACGGCTGGATCATGGGGAAAGATGACAAGCGCTGGCATCCGTGCCCATCTCAGAATGAGTTGCTGGCTGGCCTGTCCACAACCAAACAGGGGAAATCATGGCTATTGAAGGCCCTACGGCAACTATTCCATTAAGCCCCGGTGAACGCCTGGAGGGACTGAACCATATTGCAGAGTTGAGGGCTAAAGTGTTTGGTCTGGATATTGAGCCGGAGCTAGAAAGGTTTATTAAAGATATGCGCGCCCCACGCGACGTAAATCATAAACAGAATGAGCGGGCACTGGCAGCCATTTTTTATATGGCAAAAATTCCAGCAGAACGTCACGGCGTCAATATTAGTGATCTGACGACTGACGAAAAGCGGGAGCTGATTAAAGCAATGAACCATTTTCGTGCAGTGGTGAGCTTATTTCCCAAACGGCTAACCATGCCGAATTAATCCAAACAGAAATTTAATGGCGTAAACCCGCCGGGCTTTTTATTGCCCGAAATCAGGAGAGTCAATTATGCGTAATACCGAAACCCGTAGTTTTAACACTGATAGCAATGCGCTGGCCGTATTGCTGACCGATGCCAAAAAAGAAGAACGCAAAGACCGCGCGCTCGCTGTTTCCATCCGTCTTGAGGCGCTGGCTATCCATATCACTAAAGTGGGGATGAGCGGTACCGAAGCCGCCGAACTACTACGCCGTGAAGCCACCCGCTTTGAGAACGAATCGCAGGAGCTTCACTAATGGCAGACTCAATGGATTTAGTGCAGCAGCGCGTTGCGGAAAATCTGGCGTGTGAACTCGCCAACATCATCCGCCGCCCGGTAATGCCAAGCGCGTTTTTCTGTGAATGCTGTGATGCCGCTATCCCTGAAGCGCGCCGCAAAGCGCTGGACGGGGTGACTCTGTGCGTGACCTGTCAGAGCATCGCGCCTATCCCTGAAGCGCGCCGCAAAGCGCTGGACGGCGTGACTCTGTGCGTGACCTGTCAGGGTATCGCCGAGCGTAAAAGTAAACACTACAAAGGTGCTGGGCTGTGAAAACCATTCTGAAACGGGTGGGCAGCAAATCCGCCACGATGCCGGAGCGGGTTAAAAGCCTGTACCGCCGCTTTGATATCAATCACATAAACGCCAGGCGCAGTATTGGCGTGGCGGCGGGTGAAGGTAAGCGGGTAGCGGAAGTTATCGCGGTATCAACATCCACGGTTTGCACTGGCCATAATCCGTCCTGCACGCCGCGCTGCAACGTGGTGGCCGGGGCGCGGCGGTGAGCGCAGGCAAGCCTGATTTAGCCCACGGGGATAACATAAAACCAGGCGGCACAGACGATGCCGCCTGGGCTTTTCCGTGGAACGCCCCAAAGAAAGCGATCAACCCCTATCTGGACAGGCCAGAGGTTAAACCATCCGCGCTTTCAGACCCGATCGCTCTTTTCGCCGCTGAAAATGAAGGGGCAAAGCAACGGCGCGCGGCACTGAGCGATGAGGCCTGGAACCGCTATTTCTACAATGAATCACGCGATCCCGTGCTGAAGGAGATGGAACAGGAGCGGCTGACGGGCCGCGCCAGACTGATCCATGAGCAGCACCGGTTCAACCCCGACCTGGTCATTATCGATAACGTGCGCGCTGAGCCCGCATTTATCAGCAAGCCGCTGATGCAGCGCATTGCGTATTTCCAGCAGCTGGACAGGCCCAAAGCCTGCTCACGCTATCTGCGCGACACCATCACCCCCTGCCTGCAACGACTGGAGCGCGTGCGTGACAGCCAGGCGTCGGCGTCTTTCCGCTTTATGGCCAGTCGTGATGGCCTGGACGGGCTGCTGGTGCTGGCGGAAATGAACCAGCACCAGGTCAAGCGCCTCGCGACCCTTGTTGGTGCGCATATGAGCCTCTGTCTGGAAGAGGCCGGCAGCGCGCTGTTTACCGCCGATGAGGTGAAGCCGCAGGAGATCCGCAGGGTATGGGAGCGCGTGGCGGCTGAAGCGATGCGCCTTGACGTTATCCCGCCCGCCTTTGAAGCGCTGAGGCGCAAAAAGCGCCGCCGTAAGCCGGTGCCTTATGAACTTATTCCCGGTTCGCTGGCGCGCATGCTGTGCGCCGACTGGTGGTATCGCAAGCTGTGGCAGACGCGCTGCGAATGGCGCGAGGAGCAGCTGCGCGCCGTGTGCCTGGTCAGCAAAAAAGCATCGCCCTACGTCAGCTATGAGGCGGTGGTACACAAGCGCGAGCAGCGCCGCAAATCGCTGGCGTTTTTCCGCGCCCACGAGCTGGTGAGTGAAAACGGCGACACGCTGGACATGGAAGAGGTGGTGAACGCCAGCGCCAGCAACCCGGCACACCGGCGCAATGAGATGATGGCCTGCGTGAAGGGGCTGGAGCTGATTGGCGAAATGCGCGGCGACTGCGCGGTATTTTATACCGTCACCTGCCCGTCGCGCTTTCACGCCACGCTGAGCAACGGCAGGCCAAACCCGACGTGGAGCAGCGCCACGGTGCGCGAGAGCAGCGATTACCTGGTCAATACCTTTGCCGCCTTCCGCAAGGCGATGCACCGGCGCGGGCTGCGCTGGTACGGGGTGCGCGTGGCCGAGCCGCATCACGACGGCACGGTACACTGGCACCTGCTGTGCTTTATGCGCAAAAAGGAGCGCCGCAGCATCAGCGCGCTGCTGCGCAAGTTTGCCATCCGGGAAGACCGGGCGGAGCTGGGTAACAACACCGGGCCGCGCTTCAAATCGGAACTGATTAACCCGCGTAAGGGCTCGCCAACCGGCTATATCGCCAAATACATCAGCAAGAACATTGACGGGCGCGGGCTGGCCGGGGAAATCAGCAAAGAAACCGGCAAATCCCTGCGCGATAACGCGGAAAACGTCAACGCCTGGGCGTCGCTGCACCGCGTGCAGCAGTTTCGTTTCTTTGGTATTCCGGGGCGTCAGGCGTATCGTGAGTTGCGCCTGTTAGCCGGGCAGGCGGGCAGGGCGCAGGGTGATAAGAAAGCGGGCGCGCCGGTGCTGGACAATGCGCGGCTGGACGCGGTACTGGCGGCGGCGGACGTCGGCTGCTTTGCCACCTATATCATGAAGCAGGGCGGCGTACTGGTTCCGCGCAAAAATCACCTGATCCGCACCGCCTACGCGCTGAACGACGAACCCGGCACCTACGGCGATCGCGGCATCCGCATTTACGGCATCTGGTCGCCGCTGGTGGCGGGGCGTATCTGCACCCACGCGCTGAAGTGGAAAAAGGTGCGCAAGGCCGTTGACGTTCAGGAGGCGACGGCCGACCAGGGCGGTTCTGCCGCCCCTTGGACTCGTGGCAATAACTGTCCCCTTGTGGAAAATCTGAACAAATCAGGGGGTGAATTACCCGATATTAAAACCATGGATGAGAAGGAGCTGCAGGAATATCTCCACAACATGGGCCAGAAGGAACGGCGGGAGCTGACAGCCAGGTTAAGGCTGGTAAAACCGAAGCGGAAAAAAGCATATAAACAGACTATTTCGGATCATCAGCGTCTGCAGCTTGAGGCAGAACTGAGTTCCAGAGGGTTCGATGGTAGCGAGTCAGAGATTGACCTGCTTCTGCGCGGCGGCAGTATTCCGTCAGGTGCGGGGCTACGTATTTTTTACCGCAACCACTGTCTACAGGAGGATGGCAAATGGCGTCAGTGGTACTGATACCGCGGCTTTAACAATTATTGCTCTTATTGATCCGAATCAGAGCGATCCTAATTGACATATAAAAAATGTTTTACATTCACCAAACCCTAATATACTGTATTTATATACAGTTATTTTGTATCCGTAGTAGTGATAGGAGGGAAAATGCAGGATTATTTTTTGGAGTCTTTGAAGCTCCAGCGCATTGATTTTTTTATCAAGCTTGTAGCGGCTAGTGAGTGTAGCGAAGAAGAAAAGCGGCTGGCTATCCAGTGGGTGTCCGAACTGACCGACGAGCTGATGGCGAAAATCCGTAGCCATGAATACTGCCGGTCGATGGACGTAATTAGTTAAAGGGGAATCTGTATGCGCATTGAAATAATGATCGATAAAGAGCAGAAGCTTAGCCAAGCTACACTGGACACCCTTGAATCCGAGCTTTACCGTAATTAGCGCCCTCTATATCCCAAAACAGCAATTCGTATCCGCAAGGGCAGCGCCAATGGCGTTGAGCTGAGCGGATTAAAACTGGATGAAGACAAAAAGCGAGTGATGGAAATTATGCAGCAGGTATGGGAGGACGACAGCTGGTTACATTAGCGAACGTTGCGGACGATAAAACTGGTTTTTACCGTCGGCAAGGTTGAACAACGAGCCACGCGAGGCGTTAGTGCCGTTGTGCATGACTATGCCGCATGAAATCGCATGATCGTTTGAGGATCGTTTTTGCCCCGGCCCGTCATATATGGCGGGCTTTTGCTTATGTCATGCACCTGCATGAAAACCACTGCACAAAGCGGGCAGGCGTGGCGGGGATACGAGCGCGCGCCATGACGTTCAGGCCATGACAATGCGGCCCAATTTCCGGCCCGCTGATGTGGGATCGGCTTTCGAGTGGCGCTGAGAAGGATAGGGCGGTACGCGTTGTAGGATGCGATACATGCGTGATACAGGGGGGGCTGGAGTTTCAAAGTCGTTGCGGTTAAAGTTTCATCATGCTTTTTATTGCGAGGTGTGGTGATGGATACAACCGAACAGCTTAACGGAACCTATTTTTACGGCGGCCTTTCGAATCTTAACGCCGGTGAGCTTTTTTACTGGATTATGGTTGATGTGACCGCAGAGCATTTTACGGGCGCGACAGCGGCAACAGGCAACGTTATCGCCGCTGCTGCTATTTATGCCGGTCGTAATAACGTCGCCGTGTCCGGCAAACTTGCAAACGCTACTCCTGGCACATCGTGGGCATCTATTCAGTCACGCAGGCTGTTGCAAAAATATAAACTGCCTTTCCCGCTGCCGACCATAGTTGGAAATCCGTTTAAAATGAAAATTATAATGACTAAAAAGCTAGGCACATTTGTTGGCAGAACAGTGCCAGTCATTGGCTGGGCTATAGTGGCGTCAGACGTAGCGATCATAGGCTGGAAGTCGGTTAACCGTTACAATACGATAGCCAGTGCGGAGGATAAAATATGGTGATGGATGATAATGAAAAGGCAGTATTCGCACTTGTTGAAGAATATAACGGTCATTGGTTTTGGTTGCGTAAGCGCTTTCGGCTGACACCTGCAACGGATCTGAATAAGGATTTCAGGATGGCACCGGAGGACGCCGCCGAGCTGCTGGAAACATTCGCGGACAGATTTTCTGTCGACCCGAAAGAAATAAATTTCGGGCGTTATTTTCCGGCAGACAATGGCAAGGCGGAAAAGCCGCTGACCATCCAACTGTTAATTGATTCAGCCCGCGCCGGTCACTGGATTGATAAATAAAAAAGCGCCCGCAGGCGCTTTGTTTTCCACTGTATATTAGTTGCCATCTTCCAGATCGAGGTTATAAGGCGCAAACCTGATAACCTCCTGGCCGAGTCCGAGCCAGTCATTTAACTCCTTCATACGTGCCTGCAGCGGAATGAGTTCGTTTCGTACAAACACGCGGCTGGCCTTTTCAATATCACCAAACCCGCCGGTATTGTTTGGGATTATCCCCATTAGCTGCGGCGGTACGCGATGCACGGCCAGCATGTCATCACGGCTCACGTTTTTGATATTCAGGAACTCATCTTTGGCCGCCACCTCTGACAGCGGGATGATCTGGATGCCGTCTTTCTTCCCGTTCGGGCTGTACATAAACAGGTTGCGGAAGTTGCCAGGGCCCTTTGCGCTTTTCATGGCGCTGCGGATATTGTCCACGTCCTGCTGGCTCTGCGCCGGGTCAGTCATATACATGATAAAACCCGCATGGCTGCCGTTCAGGTAATACTTGCGGCGGAACAGCGTGGCCGACTCATTAAGCAGCGCAGACGGGATGGCCGACAGGTAGCCCGGCACGCCGTAAATTTCCTGATTGATATCCGGCTCCATCAGGTGAAAGACGCTGCCCTTTGTAAACTCATACGGTTCAGTGTTAAGGCCGTAATGCGCATACCAGTAGGTGTCGAGGTCAAGCCCGCGACGCGTATATTTTGCAAGCGAGGGTTCCAGTTTCAGCGTGTTGCCGAGGCGGCTGATCCGCTTCTCCAGATAGGCATTGCCGAAAATCAGGTAATCCATCGCAAAGCGGGTAAAAGCCTGCTGGCTCAAAAGTGGGTGCGGGATAAATGTGCTCGCCAGAATATTGCATTTTACGCTTATAGGTGAGCTGTGATGCACGGCGGCGCGGAACGTGCGCGCCAGTCCGTCAACGCTTACGGGCGGCTCATACCAGCGATCATTGATAACGCACTCCACGTAATCGAGCAATTCGCGGCGGTCCAGCACCGGGATCGGGTCGCCAAAGGTAAAAGCCTCCGACGCTGCCCCGCTGGTCATGTTATCCGGCTGCGGCACGGACTGTGTGCGCGTGCGGTTTCTGCGTTTGCTCATTAATACATCTCCACAATATTCTGCGTGTGTGCCGCCTGTCCCTGCAGCGGCTCATTTGCCAGCGCGTGCATGGTCGCCCAGGCTAAATCGCCGTGGCTGACTTCCTCGCTGCGACTGGTTTCATAGGTCGGACGATTGCCGCTGGACGTAGTGGCCTTGCGGATAGACATAAACGACTGCGCGATGTCGAGATGGCTGGCGTCAAACTCCAGCCGCCCGGTTGCCATCGTGTCGTAGGCTTTGAGCACCAGGGCATTTTTAACCGTCGGGTTGTAAACAAACTCTTTTACCTGTGGGAAAAAGGCTTTCACGTTCTCATAGACGCCGAGACCGACGCCGGTTGAGTCGATGCCGATATAGGTCACGTTGTACTGCTGCGTAAGGGTTTTAATCGCGTCGGCCTGCGCACGGAAATCCATGCCGCGCCACTGGTGACGCTCAAGGATGCGAAACTTGCCGCCCGGCACTGCAGGCGGGGCAATGACCACGCAACCGGCGCTGTCGCCGTTCTGCGTACCCTTCGCCGGATCGTAACCAATCCAGACCTCCTTCCAGCCAAACGGGCGCAGCGCCAGCGCTTCGAAGTCGTCCCACACTTCCCAACTGTCAACCATACATTTCTGCAGCATGGCAAGCTGGAACACCGACGCGAGGTCATCCATAAACACGCACATCAGCAGGTTTTGATAGTCTTCAGGGCTGTAGCGCGTGCGCAGCTGTTCCAGGTCGAACAGGTCACAGCCACCGCGCACGGCATCCTCAACGGTGACAATCTGGCGGAACTGGCCGTCAGCACACAGGCGGCCAGCGGCAAGCGATGCGTGACTTAAATCAATATCAACGCGGTCAGCTTTTGCGCGGCCCCTGTTGAACTGCGCGCCAGACCAGAACGGATAGGCGCTGTGTGTGAGGCTGGACGGCGTGGAAAAATAGGTTTCGCGCCATTTTTTGTGCAGCGCCATGCCAGAGGCAACTTTCTGCAGTTCCTGAAATTTCGGTATCCAGAAGTATTCGTCCAGGTACAGGTTGCCGTGATAACTCTGCGCCGTGCGGGCGTTGGTGCCGAGGAAGTACAGGCACGCGCCATTATCCAGCGTCATCGGGTCGCCTTTCAGGTCTACGTCCGCCTCGCGGGCAAACTCCACGATGTACTGCTTGAATACGTGCGCCTGCGCTTTACTGGCCGACAGGAAAATCTGGTTTCGTCCGGTGGTCAGCGCATCGATCAGCGCTTCGCGGGCAAAAAAGAACGTCGCCCCAATCTGACGCGATTTCAGCAGATTGCGGACGGCATATTTATTGCCCGCTTCCCACCACTGGCGCTGATAGCCGAACATCGTGCTGTGAAAGATATCCTGCAGTTTTTCGATCTGCGCGTCGCTGAATACGTTTTTTTTAGGCGGTTTGCGTGGGCCTGTGTTGCGGTTTTCGACATTGGGATTTAAAACCGCCTCATTGCCGCCGTTGTTAAATTTCCCGATGCGCGCATGGCGCTCTGACTGCCTGGCCAGCAGGTCGATTTCTTTGTAGTCCTTTCCTTCCTTCACCTCTTTCATGACCAGCTGACAGTAGCGCGCAGCAGTGGATAGCTGCATCTGGTCAAGCGGGCCATATGAGCCCCACTTGTCGCGCTTTTTCCAGCTGTGAACGGTTGCGGGTTTCTCTCCCAGCATTTCAGCAATGCGGGCGATGCGGTATCCCTGAAAGTAGAGCAGCAAAGCCTGTCTGCGGGGATCAAGGTCTGCGGGGGCGATTGTCGTTGTCATGTCCCCAAAATACGGGCCGCCCGTTTCCTTTTCCGCCGCCCGTGATTGTGTGAATTACGGTACATCCTCGCCGCGTTGTTTCAGTGCCCCTGTCGCCGCAAACATAGGGACTCACAGAGTTTTTATCTAACCGGAGCCTGGACAATGGCAAAGAAAGCAAAGCGTTTCCGCATCGGGGTGGAAGGTGCCACCACGGACGGGCGCACCATCGTGCGCAGCTGGCTGGAACAGATGGCGGCCAATTACGATCCGGCCGTTTACACCGCCGTGATCAACATGGAACACATCAAGGGTTACACGCCTGACAGCGCGTTTCGCCGTTTTGGTGTGGTCGATGCGCTGGACACTGAAGAAATCAGCGACGGCCTGCTTAAGGGTAAGCTGGGCCTGTACGCGGTGATCAACCCGACGGATGAGCTGGTTACGATGACCGGCAACATGCAGAAGCTTTTTACCTCGATGGAGATTCGCCCGGAGTTCGCCGACACCGGCGAGGCGTATCTGATCGGCCTGGCCGTTACCGACGATCCGGCCAGCCTAGGAACCGAAATACTGCAGTTCAGCGCCAGTGCAGGTGCTAACCCGCTGGCAAACCGCAAGCAGCACCCTAACAACCTGTTTACCGCTGCCACCGAAACCGTGATCGAGTTTGAGGACGTGGCCGACGATAAGCCGTCCCTATTCAGCCGGGTTTCCGCGCTTTTCAGCAACAAGCAGAAATCTGATGACGCCCGTTTCGGTGACGTTCACAAGGCCGTCGAACTGGTCGCTACCGAGCAGCAGGAATTCAGTCAGCGCATCGAAACCGCCCTGAGCGAGCAGGCCAGCAGCCTGCAGGCGCAGTTTACTGAGGGGCTGAGCGCGGAAGTCGCAGCCCGTGAGCAGCTGCAGGCGGATTTCAGCCAGCTGCAGGAGCGGTTGAGCCGTGAAGACGGGCGCCAGGACTTCCGCCCGCGCACGCCCGGTAATGGCAGCGGCAACAGCCAGGACGTGCGCACCGACTGCTGATGCAGTTGCGGCAAACCCCATTAACGAACAGAGAACACAAACAATGAAAAACAATACCCGTTTTAAGTTAAACGCCTATATGTCGGTACTGGCGGAAATCAACAAAATCAACCTGTCGGCCCTTAACAGCAAATTTACTGTTGAGTCGTCCATAGCGCAGACGCTGGAAACCAAAATCCAGGAGTCATCCGCATTCCTGCAGGCGATCAACATCACACCAGTCGATGAGCAAAGCGGCGAGCGTCTGGGGCTGGGTATCGGTCAGACCATCGCAGGCACAACCGACACCACGCAGAAAGAGCGTGAGCCAACCGACCCGACCTACATCGATGGCGACGGCTACAAATGCACACAGACCAACTTTGACACGGCGCTGCCTTATTCAAAGCTGGACATGTGGGCGAAGTTTTCCGACTTTCAGGTACGCATCCGTGATGTCATCGTGAAGCGCCAGGCACTGGACCGCATCATGATCGGCTTTAACGGCCTCAAGCGTGAGAAAACCTCAAACCGGGTACAGAACCCGCTGCTGCAGGACGTAAATATCGGCTGGCTGGAAAAAATCCGCCAGGAGAAACCTTCGCAAGTAATCAGCCAGCGAATCGACAACAGCGGCAAGGTTGTCGCCGGTAACATCACCATTGGCAAAGGGGGCGTATTTAATAACCTTGATGCCGTAGTGATGGGGGCTGTGTCTGAAAAAATCGCCGTGCAGTATCAGGATGACACTGAGCTAGTTGTGATCTGTGGCCGCCAGCTGCTGGCCGATAAATATTTCCCGATCGTCAACAAAGACCAGCCCAACACCGAAGCGCTGGCCGCTGACTTGATTATTAGCCAGAAGCGTATCGGCGGCCTGCCTGCGGTGCGTGCGTCGTTCTTTCCGGCAGATGCGTTGCTGATTACGCGCCTCGATAACCTGTCCATTTACTGGCAGGAAGAAACGCGCCGCCGCTCCATCATCGACAACCCGAAACGGGATCGCATTGAAAACTTTGAGTCGGTCAACGAGGCGTATGTGGTTGAAGACTACGACTGCACCTGCCTGATTGAAAACATTGAAATGCTTGATCAGGAGCCAGAACCGGAAGCGGGACAGATGAGCGACGCCGAAATCGCGCGCATTGCCTCTGTTGCGGCAAGCGTCGTCAAGGCTATGAGCGAATCCGGCACGCCCCATGCGCAGGCTGGCACCGACACCGCAGGAGAATAAACCGTGACCAATCCTTTCCGCGCGCACACGCGCTTTATCCAGGCACAGGAGGCCGCCCAAAGGGGCAGCAATAGCCGACATGCGAAAGGCTATGACCTGATGCTGTTGCAGCTCAACGAAGACCGCCGCCGCCTCAAGGGCATTCAGTCCAATGTCAACAAAGCGCAGGTAAAGATTGAGGTACTGCCGAAGTATGCCGCCTGGGTAGAGGGCGTGCTGAGCGTCGACGGGGCGCAGCAGGATGACGTGATCATGTACGTCATGCTGTGGCGTATCGATGCCGGTGATTATGCCGGTGCGCTGACGATTGGCCGCCATGCCCTTAAGCATGGCTGGGTGATGCCGATCGGCAAGCGCACCACATCTACGGTGCTCGCCGAGGAAATGGCTGACGCCGCAAAGGCCGCCATTCTGGCCGAAACCCCGTTTGATGCTGATTTGTTACTGCAGACGCTGGAATCTGTTGACGGGGAAGACATGCCGGATCAGTCCCGCGCGCGCCTGCACAAATCCATTGGCTGGGCGCAAACCGGAAACAGCCCGGTGTCCGCACTGAATCATCTTAAGCAGGCCCTGCAGTTGGACGAAAGGTGCGGGGTGAAAAAAGACATTGAGCAGCTGGAGCGGAAACTCCGCAACAGCTGATAACCGAACGTGCCCCGCGCACGGGCGGCACGGGGTGGCGAAAGGCACTGCCACATCAAAACCCCGTCCACCGCCCACTTATTCAGGAGGAATCGGCATGAAGTTTGTTGCGCCCGAACAGGCACCGGAACAGACGGAAGTCATCAAAAATACGCCGTTCTGGCCTGATGTGGACCTGTCGGAATTTCGTAGCGTGATGCGCACTGATGGCACGGTAACGCAGCCGCGTTTAAAGCAGGTCGTACTGACGGCAATTTCCGAGGTAAACGCTGAGCTGTACGACTTCCGCAACCGTCAGCAGACGCTGGGCTATCGGGCACTGGCTGAGGTTCCGGCAGAAATGCTGGACGGCAAAAGCGTGCGTATCCGGCACTACCACACCGCCGTTTTTTGCTGGGCACGCGCCGTGCTCAATGAACGTTATCAGGACTATGACGCCACGGCGTCTGGCGTGAAGCGAGGTGAGGAGCTGGCGGAGGCCAGCGGCGATCTGTGGCGCGATGCCCGTTGGGCCGTCAGCAGGTTGCAGGATGCACCGCACTGTACGGTGGAGCTTATCTGATGAAAGTGCGTGCGCATCAGTATGACACGGTGGACGCGCTTTGCTGGCGTTATTACGGGCGCACGCAGGGGGTCACTGAGCAGGTTCTGCAGGCAAATCCGGGGCTGGCTGAGTACGGCCCATTTTTACCGCACGGGCTGCAGGTGGAACTGCCGGACATTACGGCGTCAACCACGGCGCAGACCGTCCAGCTATGGGACTGAATTATGACGCTTGAACGAATCAGCGCCTTTATCACTTACTGCATCGCCGTGCTGCTTGCATGGCTGGGCGATCTGTCGCTCAAGGATGCGTCAACGGTTGGCGGAGTACTGATTGGTGTGCTGATGCTGGCTATCAACTGGTACTACAAACACCAGTCTTTCAAATTGTTACGTGGCGGCAAAATTTCGCGGGGGGAATATGAATCCTTCAATCGTTAAGCGCTGCCTTGTCGGGGCGGTACTGGCTATTGCCGCCACGCTGCCCGGTTTTCAGTCGCTTCATACCTCTGTTGAGGGGCTGAAGCTGATCGCCGATTACGAGGGATGCCGTCTGCAGCCTTATCAGTGCAGCGCGGGTGCGTGGACTGACGGGATCGGCAATACGTCCGGTGTGGTGCCGGGAAAAACCATCACGGAACGGCAGGCGGCGCAGGGACTTATCACCAACGTGCTGCGCGTGGAACGGCAGCTTGAAAAGTGTGTGGTGCAGCCGATGCCGCAAAAAGTCTATGACGCGGCGGTGTCGCTTGCTTTCAACGTGGGCACCGGCAACGCATGCAGCTCTACGCTAGTTACGTTGCTGAATCAGCAGCGCTGGGCTGACGCCTGCCATCAGCTGCCGCGCTGGGTGTATGTCAAAGGTGTGTTTAATCAGGGGCTGGATAACCGCCGCGCGCGGGAAATGGCTTGGTGCTTAAAAGGAGTATAGATAAATGAAATGGTTAAAAGTTACTGGCTGCCGGTTTCGGTTCTGGTGCTGCTTGTAATGGTAGATGTGATTTTTCCTGCATCTTATGCGGCTTTTCCGCTGGCGCTGATCATCTGGTTTGAGTATGCCGCATTTTCACTGGTCTGCTTTGTAGGGCTTTACTCCTGCACGCTGACGGGGAGTGACCGGCTGCAGGTACGCCATTTGCTGGGGAGGGTGTTGGAGTTGTTGGGCAGAATACCTCTCACTTGGTATCAGCGTCTCGTTATTGCCTTTGTTATGTCGCTTGCCGGATGGAAGCTCACGGGGATGGTTTTTGTTTTTACGGTTGCCATGAGTCTGGTAATTCAAGATGAGCTTAAGGCTATGCGGGAATGAATCGTTTACTTGCGTTGGTTCTGGCGCTGGCACTTGCGGCGCTGGGCTGGCAGTCATGGCGGCTTAACAATGCCAGCCACACCATCGAGATGCAGGGCGCGGCATTGAAAAGCAAAATGCAGGAACTGACGAAGAAAAACAGCCAGCTGATCGGCCTGTCCATTTTGACCGAAACCAACAGCCGGGAGCAGATGCGGCTCTATGCAGCGGTGGAGGATACCGCCGCACTGCTGCGTAGCCGTCAGCGCCGGACAGAGGAGTTAAAACGTGAAAATGAGGATTTGCGCCGTTGGGCTGACACTCCTTTGCCTGCTGACATTATCCGGCTGCGGGAGCGTCCGGCCCTCGCCGGAGGTGCAGCTTACCGTGAGTGGCTGTCCCAGAGTGACGCAGTGCCGTCTGGAAAGGTCAGCACCGCGCAGTAACGGCGATCTGAATGCGGCGCTGGATGAAACCGAGGCCGCCTGGGCGGTCTGTGCTGACAAAGTGGACACGATTATTGCATGTCAGGAGCGAAACAGTGAACAAACCGCAGTCCTTACGCAGCGCCCTGAATAAAGCGGTTGCTTATGTCCGTGATAACCCGGACAAGCTGCACCTTTTCGTTGATAACGGCTCACTGGTGGCAACCGGGGCCAGCTCCATGTCATGGGAATACCGCTACACCCTGAACGTAGTGATCGAGGATTTCAGCGGAGACCAGAATCTGCTGATGGCTCCCGTGCTGCTGTGGCTCAGTGCCAGTCAGCCGGATGCCATCAACAACCCGGCTCTGCGCGAAAAACTGTTTACCTTTGAAGTGGATATTCTGCGAAACGATGTGTGCGATATCAGCATGAGTTTGCAACTGACAGAGCGCGTGCTGGTCAGCACTGACGGTACTGTGTCGAGCGTTGAAGCGGTGCCGGAACCGAACAAACCCGAAGAAATGTGGACGGTGAAACGTGGATGAGCTGCAGAGGGTGGATGACTGGCTGACGGCGCTGTTGGCAAATCTGGAGCCTGCCGCACGCAGCCGTATGATGCGGCAACTGGCGCAACAGCTACGCCGGACGCAGCAGCAGAACATCAGGATGCAGCGCAATCCTGACGGCAGCGGCTATGAGCCGCGCCGGGTGACAGCCCGCAGCAAAAAGGGCCGCATCAAACGTAAGATGTTTGCAAAGCTGCGTACCACAAAATACTTGAAAACCGCCGCCAGTGCGGACTCTGCCAGCGTGCAGTTTGATGGCAAAGTGCAGCGCATCGCCCGTGTTCACCATTATGGCCTGCGCGCTCGCGTCAGCCGCAAAGGCCCGGAGGTCCGCTATGCGGAGCGCCGTCTATTGGGCGTGAATGATGAAGTGGAAACCGTCACCCGTGACACCCTGCTGCGCTGGCTGGCGGGGTGATCTTTGTGCCACCGCTGATACAAGCGCCCGCGCTGACTCCCTTTTCCCTCTGATGGCAACCTTTCGTTATGAATGCACAACTGACCGAAATCATGCGCCTTATCACCAACCTGATCCGCACCGGCACCGTGACCGAAGTGGACCGGGATAACTGGCTGTGCCGGGTGAAAGTGGGTGAGCTTGAAACCAACTGGATTAACTGGCTGACGCTGCGTGCCGGTGGAGCCCGCACATGGTGGTGCCCGTCGCCGGATGAGCAGGTGGTGGTCCTGAGCATGGGCGGCAATCTGGAAACTGCGTTTGCGCTGCCCGCCATCTATTCCAGTCAGTTTGCGCCGCCGTCGGATTCCGTTGCTGGCTGCGTGACGGAGTTCCCGGACGGGGGCAGGTTTGAATATGAACCCGCCACCGGACGGTGGCATGTCCGGGGTATCAAATCCTTGGTGATCGAGGCGGCGGACACTATCACCCTCAAAACAGGTGAGTTTGTGGTGGAGGCTGATACCACGCGCATTAACAGCGAAATGGTAATCAACGGTGGCGTCATCCAGGGCGGCGGCGCGATGAGTTCCAACGGGATCGTGGTTGATAAACACGGTCACACCGGCGTTAAGTCCGGCGGCGATACGTCAGGAGGTCCGGTATGACGCTGTATATCGGCATGAGCCAGGGCAACGGCAAGGTCATTACTGATACGGACCATCTGCGCCAGTCAGTGCGGGATATTCTGCTGACTCCGCAGGGCAGTCGGATTGCCCGCCGGGAATATGGTTCCCTGCTATCCGCCCTGATTGATCAGCCGCAGAACCCGGCGCTACGCCTGCAGGTCATGTCTGCGGTCTATGTGGCGCTGAGTCGCTGGGAGCCACGGCTTACGCTGGATTCCATCACCATCAGCAGCAATTTTGACGGCTCCATGGTGGTTGAGCTTACCGGGCAGCGCAATAACGGCGCGCCGGTTTCCCTTTCGGTATCTACAGGAGCAGACAATGGCAGTGATTGACCTTTCCCAGCTGCCCGCGCCGCAGATAGTGGACGTGCCGGATTTTGAGACGCTGCTTGCTGAGCGCAAGGCCGCTTTTGTGGCCCTTTATCCGGCGGATGAGCAGGACGCGGTACGGCGCACGCTGGCGCTGGAATCTGAACCTGTCACTAAGCTGCTGCAGGAAAGCGCCTACCGCGAAACCCTGCTGCGCCAGCGTATTAACGAGGCCGCGCAGGCTGTCATGGTGGCGTATGCCATCGGTAGCGATCTCGAACAGCTGGCAGCCAACTATAACGTGAAGCGTCTGACGGTAACGCCTGCCGACAACAACGCGATGCCGCCGGTCGCGGCAGTAATGGAAAGCGACGAGGCGCTGCGCCCGCGTATTCCGGCTGCATTTGAGGGATTGTCCGTTGCGGGACCGACGGCAGCCTATGAGTTTCACGCCAAAAGCGCGGACGGGCGCGTGGCGGATGTTAGCGCAACCAGCCCGGCACCGGCGCAAGTGGTGCTTACTGTACTGAGCCGTGAAGGTGACGGTACGGCAGGAGCTGACCTGCTGGCGGTGGTTGATCAGGCGCTTAACAGCGAGAACGTGCGCCCGGTGGCAGACCGCCTGACGGTGCGCAGCGCAGAAATTATCCCGTATAGCGTGGATGCGACGATCTTTCTTTATCCGGGGCCGGAAGCAGAGCCGGTGATGGCGGCGGCCAAAGCCAGCCTGCAGAAGCACATCGCCAGCCAGACGCGGCTGGGGCGCGATATCCGCCGCAGCGCGATTTATGCCGCGCTGCACGTTGAGGGCGTCCAGCGTGTGGAGCTGGCCTCCCCGCTGGCAGATGTGGTGCTGGATAAGACGCAGGCAGCGTCGTGTACGGAATGGAGCGTGACTAACGGGGGCACGGATGAATAGCCTGCTGCCGCCCGGTTCATCGCCACTTGAGCGCCGACTGGCGCAGACCTGCAGCGGGATATCCGATCTGCAGGTGCCGCTGCGCGATTTGTGGAACCCGGCAACATGTCCGGTCAACTTTCTGCCGTATCTGGCGTGGGCGTTTTCCGTTGACCGCTGGGACGAAAGCTGGGTGGAGAGCGTCAAGCGCCGTGTGGTGCAGGATGCTTTCTATATTCACCAGCATAAGGGGACAGCCAGCGCAGTGCGGCGCGTGGTGGAGCCGTTCGGCTTCCTGATCCGCATCATTGAGTGGTGGCAGACCGGCGAGCAGCCGGGCACGTTTCGCCTGGACATTGGCGTACAGGACCAGGGCATCACGGAAGAAACCTATCTGGAACTGGAGCGCCTGATAGGTGACGCCAAACCCTGCAGCCGCCATCTGATCGGCATGTCCATCAACCTGCAGACCAGCGGCCCATATTTTGTGGGCGCAGCCACTTACATCGGCGAAGAAATCACGATCTACCCGTATATCAATGAAACCATTATTTCCGGCGGCACCGCTTATGAGGGCGGGGCGGTCCATGTTATTGACACAATGAGAGTGAATCCATGAGCGCAAAATTTTATACCCTGCTGACGGAGATCGGCGCGGCGAAACTGGCAAGCGCCGCCGCGCTCGGTGTCCCGCTGAAAATTACCCAGATGGCGGTGGGTGACGGTGGCGGCGTGCTACCCGCACCCAGCGCACAACAGACAAGGCTGGTTGCTGAAAAGCGTCGTGCTGCTCTCAATATGCTGTATATCGATCCGCAGAACAGCAGCCAGATTATTGCTGAACAGGTGATCCCCGAAACGGAGGGGGGTTGGTGGATCCGTGAAGTCGGTCTGTTCGATGAGACCGGCGCGCTGATTGCAGTGGGTAACTGCCCGGAGAGCTACAAGCCGCAGCTGGCGGAGGGAAGCGGGCGCACGCAGACCGTGCGCATGGTGATGATTACCAGCAGCACCGACAATATCACCCTGAAAATTGACCCGGCAGTGGTGCTGGCAACCCGCAAGTATGTGGATGACAAGGTGCTGGAGCTAAAGGTATATGTGGATGACATTATGGCGAAGCATATCGCCGATAACGATCCCCATACGCAGTACGCGCCGAAGTCCAGTCCAACATTCACTGGCATGCCAAAAGCGCCGACGGCGGCGACAGGCAATAATTCTACACAGCTTGCCAACACGGCTTTTGTGCAGGTGGCAATTGCCGCGCTTGTTGATTCCTCTCCGGGGGCACTGGATACGCTGAACGAACTGGCTAAAGCGTTAGGTAACGATCCTAACTTCGCCACCACCATGACCAATGCGCTTGCTGGAAAGATGGATAAATCAGCTAATGGCGCTGACATCGCTGATATTTCAGCATTTTTGAATAATCTTGGTCTGGGCGCTGGCTCAGCACTTCCCGTTGGTGTGCCTGTCCCGTGGCCTCTTGCAGCAGCCCCTGATGACTGGATTAAGTGCAATGGTGCGACTTTTGATAAGGCTAAATATCCGAGGCTTGCTATGGTTTATCCTTCCGGTAGCTTACCTGATTTGCGTGGAGAATTTCTGCGTGGCTGGGATGACGGACGCGGTGTTGATGGTGGAAGAGTAATTTTAAGTTCTCAGGATGCTTTAGTTGCAGGGCATGTTCATACGTTAGCCAGAATGTGGTCCTCATCGGATGAAACGAATAGTTCCGTCAAGCATTTAGGCGTATCTAACAATATACACAACACAACGAAAGATAATATGGGGAACGGGATTCTCGAGGAAGCTGATGGCGGATTAGGGATTGTTGTTGGTTGTGGCGCTGGCGGCAATTTTGCGTCTACCACGGCAATAAAAAGCAACTCATCATCAACAGATAACAGACCACGAAATATTGCATTTAACTTTATCGTGAGGGCTGCATAATGCAAAGCGCAGTAATAGAAAATGGCTTCGCTGTCATCGCGGGGGAAATTGATGTATTCAATTATGACGGCCTTACACGGGTTTATTTGACACAAACAACGGAATTTATTCCGGTGGGTGTCAGCATTCCGGCAAATGCCTGTACGGATAAGCCACTGGCAGCAAAAAAAGATTATGTTGTCTGCCGGAACAGTCAGTTGACCGGATGGGAATATCTGGCGGATCATCGCGGCGAAACGGTCTGGAATATCAGAACCGGGGCAGAGCAGCAAATTACCGTGCCGGGAGATTATCCTGCTGATACCACTATCTACTCACCATCGACACCGTATGATAAATGGAACGGTGAACGCTGGGTAACAGATGAGGCGGCGAAAGCAGAGGCAGACATTGCCGAAGCAGCAGCAGCAAAAGCAGTGCTTATCAAAAGCGCCGCTGCGAAAATAGAACCCCTACAGGATGCAGTTCAGCTGGATATGGCAACCGATGAGGAAAAGAGCCGCTATGACGCCTGGCGTAAATACCGTGTATTGCTGACGCGCGTGGATATATCGCAGGCACCTGACATTAACTGGCCTGAACCTCCAAAAGATTAATCCTGCCCCCCGCCTGCGGGGTTTTTTTTACTCATCCCATTGTGCCATTACCCACACATAGCCCGGCGCGTGCGCCGCGTGCATATCAACCAGAACATAGGCTCACCCCTGTAAACCGGAGAGACTGCCTTATGGCTCAGGATTACCACCACGGGGTGCGCGTTGTTGAAGTCAACGAGGGCACCCGATCCATTACCACGATGAGCACCGCCATCGTGGGCATGGTCTGCACCGGCGATGATGCCGATGCGTCCATGTTTCCCCTCAATAAGCCGGTCCTGCTGACCAATGTGCTGACCGCCAGCGGAAAAGCGGGCGAGTCCGGCACGCTGGCCCGTTCGCTGGATGCGATTGCCGATCAGGCTAAACCCGTGACCGTCGTTGTGCGCGTGGCTCAGGGAGAAACCGAAGCGGAAACCACCTCCAACATTATCGGCGGTGTGACTGCTGACGGTAAAAAAACGGGTATGAAAGCACTGCTTTCGGCGCAGTCGCAGCTGGGCGTCAAGCCGCGCATTCTCGGTGTGCCGGGGCATGACACGCAGGCGGTTGCTACTGAGTTGCTGAGCGTGGCGCAGAGTCTGCGCGGGTTTGCGTATCTGTCCGCCTATGGCTGCAAAACGGTGGAAGAAGCGATTGCCTACCGTGGCAATTTCAGTCAGCGCGAGGGGATGTTGATCTGGCCTGACTTCATCGACTTTGACACCGTGCTGAATGCAGACGCGACGGCTTACGCCTCCGCCCGTGCGCTCGGCCTGCGCGCCAAAATTGATGAACAGACCGGCTGGCACAAAACCCTGTCCAACGTGGGCGTGAATGGTGTCACCGGCATTTCTGCTGATGTGTTCTGGGATCTGCAGGAGCCGGCAACCGATGCGGGACTGCTGAACCAGAACGACGTCACCACGCTTATCCGCAAAGACGGTTTCCGCTTCTGGGGTTCCCGCTGCCTCAGTGACGATCCGCTGTTTGCCTTTGAGAACTACACCCGCACGGCGCAGGTGCTGGCGGACACCATCGCAGAAGCGCACATGTGGGCGGTGGATGGCGTGCTCAACCCGTCGCTGGCCCGTGACATTATCGAAGGTATCCGCGCCAAGCTGCGCAGCCTGAAGACGCAGGGCTACATCATCGGCGCAGACTGCTGGCTGGATGAGTCGGTGAACGATAAAGACTCCCTGAAAGCCGGGAAACTCACTATCGACTACGACTACACGCCGGTGCCGCCGCTTGAAAATCTGATGCTGCGCCAGCGCATCACCGATCGGTATCTGCTGGATTTCTCCAGTCAGGTCAGCGCGTAAGGGGACACTATGGCTTTACCACGCAAGTTAAAACACCTGAACCTGTTCAACGACGGGAACAACTGGCAGGGGATCGTTGAGTCTCTGACCCTGCCAAAATTCACCCGCAAGTTTGAGAAGTATCGCGGCGGCGGGATGCCGGGCGCGGTGGACGTGGATATGGGGCTGGATGACGGCGCACTGGACACGGAATTTTCAATCGGCGGCACCGAACTGCTGTTATTCAAGCAGATGGGCAAGGCCACGGTTGACGGTATCCAGCTGCGTTTCACCGGCTCCATTCAGCGTGACGATACCGGCGACGTGCAGGCCGTTGAGCTGGTTGTGCGCGGGCGTCATAAAGAAGTGGATTCCGGCGAGTGGAAAACCGGAGAGAGCAGCACCACAAAAGTCAGCAGCACCAACAGCTACGCGAAGCTGACCATCAACGGTGAGGTGCTCTATGAGGTCGATCTGGTGAACATGGTTGAAATCGTTGGCGGCGTGGACCTGCTGGAAGAACACCGTAACGCCCTGGGCCTTTAACTTTAACGGCGCGTGCAGTCGCGCCAGTATTTCATTAACAGGAAACGAACATGAGCGACAAACTGACTGAAAAGACCGTAAAGCTGGATACCCCCATCATGCGCGGCAAAACTGAAATTACCGAAATTGTGCTACGTAAGCCGCAGTCCGGCGCACTGCGTGGCACCCGTCTGCAGGCCATTATGGATATGGATGTGGGCGCGATGATGACTGTGATCCCGCGTATCTCCACGCCGACGCTGACCGCGCAGGAAATGGCAGAACTTGACCCCGCCGACCTGACAGCAATGGCTGTAGAGATGGTTACTTTTTTGTTGCCGAAGTCGGTGCTTGCCGATTTACCGACAACCTGACGGTTGATGATCTGGTGGCAGACATTGCCACCATCTTTCACTGGTCGCCGTCCGTCACTGACGTTATGCCGCTGACTGATGTGCTGGAGTGGCGGCATAAGGCAATTCAGCGAAGCGGGGCCAGCGATGAGTGATAATAACCTGCGACTGCAGGTCGTTCTGGGGGCGGTGGATAAGTTAACCCGCCCGTTTAAAAATGCGCAGGCTGGCTCTAAGGAGCTGGCATCAGCTATTCGGCAAACCCGCGATCAGATTAAGAAGATGAGTGATGCTGGAGGTCAGCTTAACTCTTTCGATCGGCTAACTCAGAGTGTTAGCCGTACTGGCACCGAACTAGATCAGGCGAGGTTACGCGCTCAAATGATGACGCGCGAAATGTCATTATTGGAATCCCCAACCAAAAAACAAACGCAGGCGCTTGAAGCTCAGTGGCGTGCGGTTTCACGTCTTGAACAAAAACAACAGCAGGAAACTCGTCAGATGGCTGTAGCCAGGGCTGAGCTTTATCGGCTGGGGTTATCTGCTGGGGGCGGAGCACGCGAGACGGCGAGGATTACGCGAGAAACTGAGCGATATAACCGACAGTTAGCTGAACAGGAGCGCAGGCTACGTGAAGTTGGCGAACGTCAGCGAAAGCTCAACGCTATCAAAGCCAGGGCTGAAAAGACCCACGAGTTAAGGAACTCTTTGGCAGGTAATGGTGCAGGGGCGATGGCGGCTGGGGTAACTACCGGCATGACGTTGCTGGCTCCAGTAAAAGCCTATTCAGAATCAGAAAATGCAGCGAATCAGCTTGCCGGTTCAATGATGGGACCGGGCGGGAAGGTAGCGCCTGAATTTGAAAAAATTAACCGGCTTGCGGTTGCTTTGGGCGATAAGCTGCCGGGAACAACGGCCGACTTTCAGAATATGATGACGATGCTGCGCCGTCAGGGGATGTCAGCGCAGGTCATCCTAGGCGGCTTGGGTGAGTCAGCCGCCTATCTTGGCGTGCAGTTACAGATGGCTCCCACTGCGGCAGCAGAGTTTGCAGCTAAGTTACAAGATGCTACTCAGACTTCCGAAAAAGACATGATGAGCCTGATGGATGTGATCCAAAAAGGGTTCTACGCGGGGGTAGATTCAGGAAATATGCTGCAGGGGTTCTCAAAAATCAGCAGCGCGATGGACATTATTCATAAAAAGGGATTGGACGCAGCTAAAACATTTGCCCCTCTATTAGTTATGGCCGATCAGGCTGGTATGGCTGGAGAGTCTGCTGGTAATGCCTACCGAAAAGTATTTCAGTCCGTCATGAATACGGAAAAAGTGAAGGATGCTAACGATGAGCTAAAAGGCACTGGCGTTAGGTTCGAGTTTACTGATGGCAAAGGTGAGTTCGGTGGCCTGGAAAAAATGTATACGCAGTTGGCTCAACTCCAAAAGCTTAATACTGAGAAAAGGTTAGCTACGCTAAAAGGTATTTTTGGTGATGATGCGGAAACGCTGCAGGTGCTGAATATTATGATTACTAAAGGCATCTCAGGGTATAGCGAAACGGCGTCAAAGCTACAAAATCAGGCTTCGCTGCGAGAGCGTGTTGATGCCTCTTTAAATACGCTTGGTAATAAATGGGAAGCCGCTACAGGTTCCTTTACCAATGCTATGGCTAGCATCGGTGAAACAGTTGCCCCAGCATTAAAAAAGCTGTCTGACTGGCTGGGTGAATTGGCGTCACGTCTGGATGGTTTTGTTAAGCGACACCCGCAATTAACGTCAGCATTGTTTACGCTGGCAGCAGGGTTTGCCATTGTTGCCACTGCCGCGGGGGGTGTTTCACTGGCATTGGCGTCAGTGCTGGGGCCGATGGCAGTAGTGCGAATGAGCGCAGGAGTTATGGGGCTAAAATTTTCATCTGTATTTGGTCTTATTGGGAAAGCAATCAGTTCTGTTGGCAAGTCAGTTGTATGGCTTGGCCGATTGATGTTTGCAAACCCTATATTGGCTGTCATTGGGCTGATCGCCGCTGGCGCTATTTATATCTGGCAGAACTGGGACACGCTCGGGCCAAAATTCAAGGCCATGTGGGATGCCGTATGTAATGCCACAGCTACGGCATGTGATTGGATTAAAGAAAAAGCCAGCGCCACATGGGAGGGGATTAAGTCACTGTTCTTTAATTATACCTTACCGGGATTAATTGCCAAAAATTGGGATGCAATACAATCTGGCGCTTCTGAAGCGTGGGCCAATATAAGACAATCTATTAGCGATAAATGGAATTCGATCCTGGCTGATGCCGCCGCGCTTCCTACGAAATTTCAGGATATGGGCAGCGCCATTATTGACAGTATCCTCAATGGAATTAATACCAAATGGGAGACACTCAAAAGCAAGTTTTCCTCAGTCACCGATTATCTGCCTGACTGGATGACCGAAAATAATAAAACACAAGGAAAAGCACAGGTGCAGGTGGTTGGTGGTGCAGCGGCTGCTGCGGTTCCGTTTGCCGGGATGTATGACAGTGGTGGAGTTATTCCGCACGGTCAGTTCGGTATTGTTGGGGAGAACGGGCCCGAAATTGTGAACGGTCCCGCAAATGTGACCAGCAGACGACGAACTGCCGCGCTGGCTTCCGTTGTTACAGGCGTCATGGGCGTAGCGGCAGCACCTGCGGAGGCCGCTCCGCTGCATCCGTACAGCCTACCGGTAATAGCATACAAACAAAGTCAGCCTGCGAAATCTGCCAGCGTACCGCCTGCGATCCGTTATGAGATTAACGCGCCCATTCATATCACTGCCCAGCCTGGGCAGAGTGCGCAGGATATTGCCCGCGAAGTCGCACGGCAGCTTGACGAGCGTGAGCACAAAGCAAGGGCTAAAGCGCGCAGTAATTTCAGCGATCAAGGGGGATATGATTCATGATGATGGTGCTGGGGTTATATGTCTTCATGCTGCGTACAGTGCCGTATCAGGAGCTGCAATATCAGCGAAGCTGGCGACACGCCGCCAACAGTCGGGTGAACCGCAGACCATCAACGCAGTTTCTTGGCCCGGATAACGATTCACTGACGTTATCTGGCGTACTGTTGCCGGAAGTCACCGGCGGCAGGCTGTCATTGCTGGCACTGGAGCAAATGGCAGAGCTGGGCAAAGCATGGCCCTTGATTGAGGGAAGCGGGACCATTTACGGCATGTTTGTGATCGAGAGCCTGAGCCAGACCAAATCGGAATTTTTTGAAAGCGGAATGCCTCGCCGTATTGAATTTACGATGACCCTGAAAAGGGTTGATGAGTCGCTGTCTGATATGTTCGGCAGCCTCAGCGATCGGCTCAGTAACCTGCAAGACTCTGCAACGTCTGCGATAGGTAATATTAAAAATACGGTTGGAGGGTTACTGCAGTGAATTTTAGTTCCGATCATTTCGACCTGAACAGAAGAAGCCCGGCTTTCAGTATCACAATTGAAGGTAAGGACGTGACTACCGCGCTGGATGCGCGCCTGATGAGTCTGACGCTCACCGACAACCGGGGTTTTGAGGCTGACCAGCTTGATCTGGAGCTGGACGACGCAGACGGGCAGATCGTTCTGCCGCGACGTGGTGCCGTTATTCAGCTGGCGCTAGGATGGAAAGGCCGGCCGCTTTTCCCAAAAGGGGCATTCACCGTGGATGAGATTGAGCATAGCGGGGCCCCTGACCGCCTGACCATTCGCGCACGTAGCGCCGATTTTCGTGAAACCCTCAATACCCGCCGTGAAAAGTCATGGCACCAGACAACCGTAGGCGATGTGGTAAAGGAAATCGCCTCACGGCACAACCTAAAGATGGCGCTGGGTAAAGACCTGACGGACAAAGCTGTTGATCACATTGACCAGACAAATGAAAGCGATGCCAGTTTCCTGATGAAGCTGGCGCGTCAGTATGGGGCGATTGCTTCCGTAAAATATGGAAACCTGCTGTTTATCCGGCAGGGGCAGGGAAGAACGGCGAGCGGTAAGCCACTGCCGGTAATCACTATTGAGCGCAAAGCCGGTGACGGCCATCGTTTTACCCTGGCCGATCGTGGTGCTTATACCGGCGTGATTGCCAGTTGGTTGCATACCCGTGAACCCAAGAAAAAAGAAACAACCAGGGTTAAGCGCCGCCGTAAGAAAACCACCAAACCTAAAGAACCGGAAGCGAAGCAGGGGGATTATCTGGTGGGAACATATGAAAACGTGCTGGTTCTTAATCGTACCTACGCCAACCACAGCAACGCTGAGCGAGCGGCAAAAATGCAATGGGAGCGCCTGCAACGAGGGGTTGCGTCATTCTCCCTGCAACTCGCTGAGGGCCGGGCAGATCTCTACACCGAAATGCCGGTAAAAGCGCGGGGCTTTAAACAGCCTATTGATGATGCTGAATGGACCATTACAACTCTGACGCATAGTGTAAGTGCGGATAATGGTTTCACTACCAGTCTGGAGCTTGAGGTTAAAATTGATGATTTAGAACTCGAATAGTAGTAAATTCACTAAAGTGAATTAAAAAACGTCAAATGAGGTTCTGATCATGATGAATTGCCCTAAATGTGGACACGCAGCGCATACAAGAAGCAGCTTTAGGGTATCGGACCATACAAAAGAACGTTATTGCCAGTGCCAAAATATTAACTGTGGTGCTACTTTTGTTACTCATGAAACCGTTGTGAGATACATTGCTACCCCCAACCTAATTGACCACGCCCCACCACACTCATCAATGGGTGGACAGGGGCATATGAGCTTTTAAAGCCACTGATTAGGTTTATATTTTTTCGATTAAACTGGTTCTGTATTCTTCAGCTTGCTTGAAGTCCATGGAACCAGTTTTTTTACACTCCATCCCACCGCCATTTATTTTGTAACCTTGATCGGACGTGACACTCCATAGAGTAATTTTTTTGATTGTCCCTGGACTCCACTTGTTCATAAAGTAGTCGTCACATATCGCGCTGAACAAGGATTTCGCAGCGTCTATCATTAGCGCTTGCTTGTTGTATCTGATGGCCAGCTCACCACTTTCCAGTACATGAGTTCTGGTATCGTAACCAGCTATCAGATTTTCGATGGAGTTGGGTATTTTGTCCGCGAATGCGCTCTGAGACGTGATCAGTAACAGAGCTAATAAAGGTTTTTTCATGCGAGATCGTCCTTTAAAAGTAAAAGCATGTGAAAAATCACCGCCATTTCACCGCCACTCGGAAACGAGATAACAAAAAAGCCACCTTGAAAGGTGGCTTAATTTACTGATTTAACAGGTGAAATTTGGTGGCCCCTGCTGGACTTGAACCAGCGACCAAGCGATTATGAGTCGCGTGCTCTAACCAACTGAGCTAAGGGGCCTGAGGCGCGGAATTATAATGTAACTACGCGGTACGATCCAGTATTTAACATCCGTTTGATGTATTTTTAGACAAACTATCCTGCCGGGAAGCAGTTGAATATTAAACTTTATCTTGCAGCACGATTCACGCATACCTGTAGCATTAGAGAACAATGCCCTACATTACCCAATACCGCGAAGTAACGTTTTATCTGCCTTCCTGTCATTGGAATTGATCTTGCGGTAGACAGCACTGCTCCAGTGACGGCTGCCGGCATGTTTTTCGCCCCGCCAGCCAAGTTTTCTTAGCTGGCGGGAAATAAGTTTGTTCATGATGCCGTGCCCCATCAGCAAAACATTGCCGTGCTGTGCCAGCTCGCTCAGCCGCCCTGCTGCCTTTACCGCCCGCTGTTTAGCCTGCGCGTAAGATTCAACCTTTCCGCTATAGCCAAGCAGCCATATGACGCGCAGCAGCAGCAACCATACAAAGGGTGGCAGGGTGGGAAATGCCAGCGGTATCACCGGAAGCGCCACCTCGCTGTAAAGACTGTCAATACGGCTTGCAGTCTTGCCGAGCCTTTCCAGCGAAGAGCGTGCCCGAGGTAGCGGGCTGGTGACGATGATATCCGCCTGTGCTGCAAGACGCAGACTGCTGTCGGTAGGTTTATCGCAAATTTCGGCGAGATCGTAGGCATCGCACCATTGCGCCATTGCCAGCGCAGAGCGGCGACCGGTTAGCGATCGATGAGGTTTTCCGTGGCGCATTAGTGTGATAGTCATATTTATCCGCCTGACTGTTGAGGGTCTTTTTTATCTTAGCTGCGGCCAGGCACCTAAATAAGCCTGTGATTCTTAATGCTGTCATCGTTTTGTCATGTCGGGTCGTTATCTTTGATGTGCAAAGCCACAACATGTAAGAATTTTCTTTGTTTATCCACTTTATTCCTGAATATAAAAAAAAATATTTAGTTATAAAAGTTTTAATTCAATGATTTTATTGGTTTTAGTGTTTTTGCCTCAAAAAAACCGCTCTTATATTGGTAAGTTAACTTGATTTATCTACACGCGTAGATACAATGACACCGTAGTGAAAATTCACGGTGTTTGATAGTTGTCAGCTACTACAAGATCAAATTTGTTTGACGTTACTTATGCGGCTGGTCATTTTAGACCGTCGGCGTTAAGCAAATATAGCGTTCCAACACCAGGGAACTTTATAGCGATCCTCAGTTATAGCTTTTTAACGGCGCATCATCACTGGTGAGTTTTCCTCCGCGGAGGAGTGCGTACCAGGGAGGGTGCCAAACAGAGCTCCTCACCTCAGGGAATTTCTCATGTCTACACCTCAAAACGTTGCTCAGGATAGTGCTATACAGGCTGATTCTGCGGCCGATGAACGTTTAACGACTCGCGAAGGACGTAAAGATTTCTGGCGTGCCACCTTCTCATGCTGGCTTGGCACCGCGATGGAATATGCCGATTTTGCACTGTACGGACTGGCCGCCGGTATTATTTTTGGCGATGTCTTCTTCCCCGCGTCAACGCCTGCTATGGCGCTGCTTTCCACCTTTGCTACCTTCTCCGTGGGTTTTGTGGCCCGGCCGATTGGCGCTTTATTCTTCGGCTGGCTCGGTGACCGTAAGGGGCGCAAAGTGGTGATGGTCTCTACCATTATCCTGATGGGTGCCTCCACGACGCTTATCGGGCTCATCCCCAGCTATGCCTCAATTGGCCTGTGGGCTCCCGCCTGTTTAGTCCTGTTACGTTTTACCCAAGGTTTTGGTGCCGGGGCCGAACTCTCCGGGGGAACGGTAACGCTGGGCGAATATGCCCCGAAACAGCGGCGTGGCTTGGTTTCGTCGATCATTGCTCTTGGTTCTAACAGCGGTACGCTGCTGGCTTCCCTGGTGTGGCTGGCAGTGATTCAAATGGATCAGCAGTCTCTGTTAGAGTGGGGATGGCGTATTCCTTTCCTCTGTAGTTCCCTGATCGCGCTGGTCGCGTTGTGGATCCGCCGTAACCTGAAAGAAACCCCGGTATTTGAACGCAAAAAGGCCGAAATGGAAGCGCAGCGCGCGCGGATACGGGTTACTCAGCCTCCTGTGCAGGACACGCGTGGTTTTTGGCGCCGTAGCCGGGCCTTTCTGACGATGGTCGGTTTGCGTATTGGTGAGAATGGCCCGTCTTACATCGCCCAGGGTTTTATTATCGGCTACGTGGTCAAAATCCTGGCGGTGGACAAGTCGGTCGCAACCAGTGCGGTCATGATTGCCTCCCTGCTTGGCTTCCTGATCGTACCGCTGGCCGGCTGGCTTTCGGATCGTTTCGGTCGCCGTATTACCTATCGCTGGTTCTGTCTGCTGTTAATCCTTTATGCTTTCCCTGCCTTTATGCTGCTCGACTCGCGTGAGCCTGCCATTATTATCGCCACCATCGTGACGGGAATGGGGCTGGCATCGTTAGGGATCTTTGGCGTGCAGGCGGCGTGGGGGGTAGAAATGTTTGGTGTGCATCACCGCTACACGAAAATGGCCACGGCGAAAGAGGTCGGCTCTATCCTTTCCGGTGGAACGGCACCGCTGGTGGCGGCAGCACTTCTCTCGTGGACCGGGCACTGGTGGCCGATTGCCACCTATTTCGCGGTGATGGCCGCTATCGGTTTCCTGACCACATTCGTGGCGCCGGAAACGCGCGGGCGCGATCTTAACGCCGCGGAAGATGCGATATAACCACGCATTAATACGCCAGAAACTTTTGGGCCGGAGACTTTCTCTGGCCTTTATTATTTTGATATTAAAGGGAGACCCGTCGTGCCGAAACAGATTAATCAGCGCGCCACCCGAGCTGACGTAGCGAAGGAAGCGGGAACGTCGGTGGCGGTGGTCAGTTACGTTGTGAATAACGGACCACGCCCGGTAGCACTGGCAACGCGCGAAAGGGTGCTGGCTGCGATTAAGAAAACCGGCTACCGCCCCAACAACGTGGCGCGTGCCCTGGCTTCAGGAACGACCAAAACTTACGGGCTGGTCGTGCCAAACGTCAATAATGCCTTCATTGCTTCCCTTGCCCATGAACTGCAGCAGGAAGCGCTGGCGAACGACATGGTCATGTTGTTGGGTGATGCCGGCGACGATCGCAAACGCGAGCTGCAGCTGATCAACAATCTTTTGAGTCAGCAAATTAACGGGCTGATCTACATCAGCGTCGATCGTCATCCTTATATTGATGTTTTGCAGGCCAGCGGTACGCCATTCGTTATGCTCGACAGGGTTGATCCGTCTTTGCAGGTTAACGTTTTACGCGTGGATGAACGTGAAGCCGCGCGTCAGGTGACCTCACATTTACTGAGTCACGGCTATCAGGACGTCGGTATCATCTGCGGCCCGCTGGAAAGGCTCAATTCACAGGACAGGCTGCACGGGTGGCGCGAGGCGCTGGCAGAATATGGCGTTCATGAGCGTCCGGAATGGGTTTTTTCGACTCCTTACACTCGAGAGGGCGGTTACACTGCGGCAAAACGTATGTTAGAGAGCGGCACCCTTCCGCGGGCACTTTTTGCCACCAACGAGGCGCAGGCAATCGGCTGTATTCGCGCGTTGTACGAGCACGGTGTGCGGGTACCGGAGCAGATAGCGTTGGTGTGCTTCAATGGAACCGATCAATCTGCCTACCACCTGCCTTCACTGACTACCGTACGCCAGCCGGTGCGTGAGATGGCAAAGGCCGCCATCAAAATGTTGGTGAACTGGAAAGGAGAAACCACGCTGCGTGAGTTTTCTCATCAACTTGAAATTGGCGAGTCTTGCGGTTGCAGGCCGTCCTGATATTAAAAAGATAAAGATAAATGAAACGTTTAATTATTGATTGCGATCCGGGGAATGGTATTACGGGAGCCAACGTTGACGACGGGCTGGCTATCGCGCTGGCGCTGTCCGCGCCGGAGGTGTCACTTGAGCTTATCACCACCGTAGCGGGTAATACCCAAAGCGAGATCGGCTACAGTGTGGCAAAAGATTTGATCGAACGCTTAGGGCAGTCGGTGCCGGTGATTAAAGGTGCGGATGCCGCATTGAGTGAACCGAGTGCCCCGTGGCGAGCCTCGCTGGATTTGCGGGTGCACAGCCACCAGCTGGCACATTTATGGCAGGGAGTGCGTCAACCGCAGAGCTATTCACCCCCGCCAGTGGAAGCCGCCGATGCAATAGGGCAGCTTATTTGCGCTCATCCGGGGGAGATCACGCTGGTGGCCATTGGCCCACTGACCAACGTTGCCCTGGCGCTGGATCGCTATCCGCAGATGGCCGATGCGGTGCAGGAAATTGCCATTATGGGCGGTGTATTTGCGCTGGACGACTTTATCAAAGACACCAATTTTGGTATTGACCCGGAGGCGGCGCACAGGGTATTGACCAGCGGTGCGAATATCACGTTGGTGCCGATGGATGTCACCAGCCAAACGCTGATGACCCATCAGGATCTTAACCGCATTGAGCAGATAGATACGCCGCTGGCGCGCTTCGTTACCGAAACCCTGCGTCCGTGGATCGATTATTCCATACAAACCCGACGTCTGGCGGGTTGCTGGATACATGATGCGTTGGTGGTGGCCTGGTTACTGAATAAGCAGGTAGCAACTGCGGCAGACTACTTCGTTAATGTGGAGCTGCGGGAAGGGATGACGCGCGGTAAAGCATGGCGCTTTCGTCAACCGCTGCGGCTTGACGTCGGTATCGGCCAGCCGGAGGGCAGACCGGTGCAGGTGTTGAAAACGGTGGATAACTCACTGCTGCTGGCAATGCTGGAGCAAAGTCTGGCCCTGCCGTTGAGGTAGGTTATCCCCGTTGCCGGTCGTGCGGGCCTGTGATTGAGGCCCGCACCGTATCACTTCATAGACTGCCGCCAGCGCCAGCGATCGATAGCCAGTAGGATCAGCAGGCTGATGCCCATTAACGGTAAGCTCACCGCGACCAATATCGTCAATACCAGCAGCAGCAGCCGGGAGCCAGGCGCTAACAGCTGCCAGCAGCCGATCAGCGTATCCAGCGGAGAGATTTGTCCCTGCTGCGGCCGGCGCAGCCACCACATGCGATAGCCGAACGCGATCATCAGGCACAGTCCCAGACCAAACAGCGCCAGTACCAGCTGGTTAGCGACACCAAACAGCACCCCCATATGCGCATCGACTCCCCAGCGTGTCAGTTTGGCCAGCAGGCCGTACTGCGCAAACTTCACCTTATCGATAACCGTATTGTTGCGCGGGTCCACCGCTACGCTATCCACCTGAGTTGGCCAGCTGCGGTCCGTTTCGCTCACCGTCCAGGCTTTATCCGCACGGTAAGCCGGACGGATCTCCAGTTTCGCCGCATCAATGCCTGCGTCACGCGCCGCGCGCAGTACGGTATCAAAATTGAGCGGATTGACGGCAATCGACTTACCGCCGTGCATCATCCTGCTATGCCCATGATGCTCTGCATGCTCATCCATCGGCATATCCATCGCCATGGCAGGTTGCAATGAGGTATTCACCACCGGCGTCATCCAGCCAAAATGACTGCGCATCAGCGAGATATTATCCCCGGCCCAGCGTGACCAGGTTAACCCGGTTGCAGAGAAAAACACCAGTCCGAGCAGCAGCAGCAGCCCAAGCGTGCTGTGCCAATGACGAAAGCGGCCGAGGCGCTGGGCCCGGGTGGGCTGCTTATCGGACGTTATCCGGCGCACTCTGCGGCGGCTTGCCCACAGCACCGCACCACCCAGTGCGGCAACCCACAGCCAGCTGGCCGCCAGCTCACTGTAATGGCGGCCCACATCGCCCAGCAGCAAATTACGGTGCAGATAATCCAGCCAGGTGCGCAGAGGTAATATACCGCTGGTGCCGTATGCGGTTAACTCTCCGCGTACCTCAAGGGTTTTTGGATCGATAAACACCGCGCGAGTTTCCGAGGGGGCAAGATCGTCACGGTAAAACATCACGCGGGTGGTCTCACCCTGATGAGGGGCGGGGCGCACGGCGGCAATCTTCATATTGCTACCGATAAACTCGCCGGCACGCGCCACCTGGGCGGAAAGCGGCTGGTCAATGCCGGTTGACGTGGTAAAGAGCTGATGATGATAAAACGCGTTTTCCAGCTGTGGCGTCACGACATACAGGGTGCCGGTGAGTGCGGCAACGAAGATAAACGGGGCGATAAACAGCCCGATGTAGAAATGCAGACGACGTAGCAGCGACAGCAGGCTGCTTCGGGATGAGGTGGCGTGCGCAGACACGTGCGCGGGTTGCTGTGACATGAGCTCTTCCATCCGCCTGTTTAACAGGCATAAACAAGGTTCTGTGGGTTTTCCACAGGTCAATATGTAAGGAATTAACGTACCGGCAGGCAGCGGGGTGGGGCGCGCGGATGGAAGAAGTGGGGGGCCAAAAGGGAAATCACCCTCTGGCACGGCGGCACATCCGGCGCGGCTACCGCCTGCAATGATGACCACAGCAGCGGGGATGGAAACAGATCAAGGGGCAGATGCACCAGCAGCACACAGTATCCACAGGCGGCATCATCCATGATTGGCGAGGGTGTGTGTGTCGTGTTGCTGATGGCAACGTCTGCCCCGACCGGCGGTTGATGATCCGCCGGCATCGCCATATCCATCTCCATACCCTGATGCAGCATCATGCCGCTGCCGCCATGATGCTGCGCCAGTGATGTTGAAATCACCGGAGCAATAAACAGCAGTAGCATGGCCAGCAGGGCCGCGCATGCGGCAAAACGGCGATGAATGGCAGAGTATGGGATTAAAGACATCCTTTTTCAGCACCAGGAATAATCTGCAAGATTGTACTGGAATTTAGTGCCAGAGGTGAAAAGTAATACCGCCATACCTCTTTATTTTGGCTAAACGGCGTACCGTGGCGTGCCCCACAACCGTCAGCAGCGGCAGCAGGATGTTGGGCAATTTACGCAGATCCCTCCGGTAACACAATGTCGCTTACATTTACATTCTGACGCTAAGGCTGACAAAAGCACCGGGAGTGATTGAAAGATTACATGATATAAACAAATGGTTAACATATCTTTCCGCACTCAAACAGGACGCAGCCTCATGTTCAACTGGACATCCACCCAGCGCAATGTGGCGTTTGCCAGCTTTGCCAGCTGGGCGCTGGACGCATTTGACTTCTTTATTCTGGTTTTTGTGCTTAGTGATATAGCCGCTAACTTTAGCACCAGCGTTTCTGATGTCTCACTGGCGATAATGCTGACGCTGGCAGTGCGACCTGTCGGCGCACTGCTGTTTGGCCGCCTGGCGGAAAATTATGGCCGCAGGCCGATCCTGATGGTCAATATCATCACTTTTACCGTCTTTGAGCTGCTGTCAGCCTGGTCACCAACGCTGACATGGTTCCTGTTTTTCCGCGTGGTTTATGGCGTAGCGATGGGCGGCGTATGGGGCGTGGCTTCATCGCTGGCGATGGAAACCATTCCCTATCGTTCAAGGGGATTGATGTCAGGCATCTTTCAGGCGGGCTATCCCTGCGGTTATCTGCTGGCTTCAGTCATTTTTGGCCTGTGCTATTCGCTGGTGGGCTGGCGTGGGATGTTTCTGATTGGCGCACTACCGATCCTGCTGCTGCCGTTTATCTGGTTTAAAGTACCGGAATCACCAATATGGCTGGCGGCGCGCCAACGCAAAGAGAGCGTGGCGCTGCTGCCGGTGATCAAAAGCCACTGGAAGCTGTGTGCCTATCTGGTGCTGTTGATGGCATGCTTTAACTTCTTCAGCCACGGTACGCAGGACCTCTACCCGACCTTCCTCAAGGTGCAGCATGGGATGGAACCCCACATTATCAGCATGATAGCCGTCTGCTACAATATTGCGGCGATGCTGGGCGGGGTCTTCTTTGGCGTGCTGTCGGAGAAGATTGGGCGGAAAAAAGCCATTATGATCGCCGCTATTCTTGCGCTACCGGTATTGCCGCTATGGGCCTTCTCCAGTGGTTCCTGGGCTATCGGCATCGGCGCATTTCTGATGCAGTTTATGGTGCAGGGGGCATGGGGCGTCGTGCCGACTTATCTGAGCGAGCTGGTGCCGGCGAATACGCGTGCGGTGCTGCCGGGTTTTGTCTACCAGCTGGGAAACCTGATCGCTTCGGTCAATGCCACCCTGCAATCCGGCATAGCGGAAGCGCACGGTAATAATTACGGATTGGCGATGGCCATTGTTGCCGGAACGGTGGCGGTGCTTATCTGTCTGATTGTTGCCGTTGGCCGTGAAACGCGCGGTATTAACATGTCGAACCCACCTTGAGCAGGACGCAGAACGGGTCGCCCGGACAGCTCGCCGTAAATCATGATTTTAGGTGTCAATCAGTAAAATCGATAGCGGTCAGGTCACAATGTTTTTTGTTAAAAGAGCTGGCGTGAAGCGCCGTGCACAGGCGGTTTTTATACGATATCGGCAGGCTTGCCGGGCGCTCTTTTTGACGCATCATGGAGACTAAAAACGGTATTACGACCCTGGCTTTTCGCCAGATAGAGAGCCCGATCGGCATGAGCTAACGCCTTGTCAAAATCATTGGCAATTAGCGGCGCAACGCCAACGCTGATGGTGACGTGAGTCGCCACCTTGTCGTTAAATCTGTGGGGGATTTCAAGATCAAGTACGAACTGGCGAACACGCTCGGCCTGCTTCAAGGCAATGGCTTCATTCACGTTGGTCAGTATGACCAGAAATTCTTCTCCGCCATAGCGGGTAACAATATCCCGGGATCGCACCGCGTCGCGAATGGCCACCGATACGCGCGTCAGCGCCTGGTCGCCCATCGCGTGGCCGTAATTGTCGTTATAAGATTTGAAGTTGTCGATATCCAGCAGCAGGACATAGTGATTGCCGGTATGGTTATCGATGACATTTTCCAGCCGGTTCTGTAGCCCACGCCGGTTATACAGGCCGGTCAGAGGGTCAAGCATGCTCAGATCGCTGTAGGTTTCTTTCTCTTCATAAAGCTGTCTGACCAGGCGCCGGGTGAATTTATCTCGCTTGCGCAGCATCAGATGATGCAGGCTAAAACCCATCAGCGGCAGAATAATAATGAACAGGATGATAAGCGTACTCTGACCCCGATCCAGCGCCAGTACCACGACAGAGCCAGGCGCCGTGTGCAGCAAGAAGGGCAGCAGATAATCTCCCAGCGCAATCGCGCTGATAAAGAACACGCTGACCAGCGCCACCAAAAGATAACCGCTTTCGAGATGCAGCTCCTGCTGATAGCGCAGGGTGATATGCCATGCCCAAAGTAACCCGCTCAGTGCCGCAGCGATATTTAAGAGTGGGAATATGGCTACGGGCTTTAACAACATCCACGTCAGTAAGATGATACTGATGATGATAATGGCAATGGTCGGCGTGGTGACCGTCAGTGTGTCATGATGAATGACGCGGAGCACGCAAAAAATAGCCGTGGCGGTGTTAAGAAGAAGAAATAGCATCAACGATAGCCGGTGCTTGCTGCCCAGTAACTCATCGTAGCTTTGTACATTCATTATGTTTGCTCGTGACTGCCCTGAAATATCAAAGTGTTACTACCGTTAAGCCACCGTTCTCTATGCTGAATAGCCTAAGCTCATCACATTAGGATTTTTCTTATTTGTGATCTCGCTCAATCTATCATTTAAATACAGAGCTGTCATCCGCAAGCGTCGCACCCAATATTAGTTGCTGCAAAATGATAATGATTATCATATGATATTGGTAATCATTATCATTCATATTTGTAAGGGAACCTGTGATGTTGGGCAAAAGACTGGATAGTGGCTGGGGTGTACTGGTGCCCTGTGCAATGATGCCATTGCTGGCATTAATGGAGCTGTCATTCAGCGAATGGCGACTGCTGATGGTGGTGGCGTTTCTCGCAACGGTGATCATGTTGTTTCATAAGCGTTTGCGTCATTACCTGTTGCTACCGTCCTGCATTGCTCTGGCAGGCGGACTGGCGGCCATTTCAGTGAATTTTAACGGAGTATAATCACAAGGATACGGGGATGAGGAAAATCACGGAAAAATTAGCTGTCGGAGGGGTGTTGCGGTATCAAAAGCGGGCAATCAGAGGGATTCAACATGGTGCGAAGAGAGGGACTTGAACCCTCACGTCCGTTAAGACACTAACACCTGAAGCTAGCGCGTCTACCAATTCCGCCACCTTCGCGTCCTGTTGAACTTATTCATATCACCGCATTGGTGCGAAGAGAGGGACTTGAACCCTCACGTCCGTAAGAACACTAACACCTGAAGCTAGCGCGTCTACCAATTCCGCCACCTTCGCGCAGATGCTGCGTAATATGAATATCATGGTTTTTGGTGCGAAGAGAGGGACTTGAACCCTCACGTCCGTTAGGACACTAACACCTGAAGCTAGCGCGTCTACCAATTCCGCCACCTTCGCATACCCGCAACATAACAAGGTTATATCGCAACCACGGAGGCGCATTCTAGAGGTTTTCCCCGGTACGTCAACAGTTATTTCTTGAGGTTAAAGCAAACGCTGTAAAAAGCATCGGACAGGGGGGAACCTGAAAACAGGCAGGGCGGAAACGGTATCCGCCCTGAAACTGAGGGTTACTTTTTGGCCGCGCGGCTCTTGACGGCACGATAGACCTTAAAGCGCCCGTTTTGCAGCAGCACTTCATGGCTGCCGAAGGTCTCATCCAGCACCTGGGGGTAAGGCAGGAAGGCGTTAGCAACGATACGCAGCTCACCGCCGGTATTCAGATGGCTGACTGCGCCACGGATCAGCGTCTGCGCCGCATCCAGACTGGTTTGCACACCGTCATGGAACGGCGGATTAGAAATAATCATATCGAAGCGACCGCTGATATCGGAATAGACGTTGCTGGCAAACACGTCGCCTTCCAGCTGGTTGGCCGCCAGGGTGGCTTTGCTGGAGGCAATCGCCGCCGCGTTAACGTCGGTCAATGTCAGGCGTACTTTCGGTGAGAAGCTGGCCAGCATGGCGGAGAGCACGCCGGCACCGCAGCCGACATCCAGCACTTTGCCTTTCATATGCGGCTTGAGGGTGGACAGCAGCAGCTGGCTACCGATATCAAGACCATCACGACTGAACACCCCTGGTAATGTTTTGACGGTGAGTTCGCCCAGCGGATACTCATCCCAGAACGTGTCGGCATCAAAGGTGGGACGGCTGTCGAGACGGCCATGATAAAGACCACAGCGGCGCGCGCTGTCGATTTTTTCCAGCTTTGCCCAATCGGCGACCATCTGTTCAGCGCTACGCACGCCGCTGCGATTTTCGCCAACCACAAAAATATCGCTGCCGACAGGCAGCAGAGCCAGCAGGTTTTGCAGCTGATACTGTGCTTCTGGCTTGTTCTTTGGCCAGTAATAGACCAGGGTATCACAGTCAGCTAACGTCTCTGCGCTGGCAAAGAGACCATAATGGGCATTTTCCCCCAGTACACGACTGAGGATCTGCCAGTGATGATATTGTTGGGTGTGCACCCGACTCAGGGCGGTTTCCAGCTGGGCGGGCAGGTCATCCTGCAGATCGCCAGCAAACAGCACGCGGCGTTGAGTAAATTCATCACTGTGGCGCAGTATCACTTCACTGGCCGGGGTAAATGCGGACATCGATGGCTCCTTATAAATCAGAGCGGCGATTATAGTTGTTTGTTGGCGCATAATCATCGGCTTTGTTAGCATACGTCCGGCACACTTATAGCCGCATGTGCGCGCGCCGCCCTTCTGCAACGCCAGCTATTGCGGCTTAAGGCGGCGTAAACGGGCGATCGGACAGGAAAAAATGATGTCTCCCAGACGTGACTGGTTACTACAGCAAATGGGCATAACGCAGTATACGCTGCGGCGTCCGCGCGCCCTGCAGGGGGAGATCGCGGTCACGCTACCCGCTGAAACCCGGCTGGTGATTGTGGCGGACAATCCGCCCATGCTGCATGATCCGCTGGTGGCGGACGTCCTGCTCGCACTGAATCTGCGTCAACCGCAGGTGCAGGTTCTTACCCCAGACCAGCTGGCGATGCTGCCGGACGACGCCCGCTGTCACAGCTGGCGTTTAGGTCTGGATGCCCCGGTAACGTTAGCCGGAACCCAAATTGCTTCCCCCGTACTGGCGGAGCTTTATAATAATGCCGAGGCCAAACGGGCGCTCTGGCAGCAAATTTGCCACTATGAATCAGATTTCTTTACTCACTCCGCACGATCTTGACGCCGCGTTTGCCATTGAGCGGCGTAGCCACGCTTTTCCCTGGACAGAAAAGACCTTTGCCAGCAATCAGGGTGAACGCTATCTCAACCTCCGACTGACGGTTGACGGCGTGCTGGCCGCTTTCGCCATTACTCAGGTGGTACTGGACGAGGCGACGCTATTTAATCTGGCGGTCGATCCGACATTTCAGCGTCGCGGATTGGGCCGTGAGCTGCTACAACACCTGATCTGCGAGCTGACGCAGCGCAACGTCATGACCTTATGGCTGGAGGTGCGCGCTTCAAATCGTGCCGCCATCGCACTTTATGAGCAGCTGAATTTTAATGAGGTCTCTATTCGCCGTAACTATTATCCGACGACCAGCGGTAAAGAAGATGCGGTCATTATGGCGCTTACCATCTAACGGGGATCTGACCGATGCTTAAAAACTGGGACTGGATTTTATTTGACGCGGACGAGACGCTGTTTCACTTCGATGCGTTTGCGGGCTTACAGCGTCTGTTTCAGCGCTACGATATTTCATTTACGCGTGCCGACTATGATGACTATCAGGCGATCAACAAGCCGCTATGGGTTGACTATCAAAATGGTGCCATCAGCGCGTTGCAGTTACAGCACCAGCGCTTTGAGGGCTGGGCCGCGAAGCTTGATGTGACGCCACAGGATCTTAACGGCGGTTTTCTGAGGGCAATGGCTGAAATTTGCACGCCTCTGCCTGGGGCAGCGGAGCTGATCAACGCGTTACAGGGTAGGGTCAAAATCGGCATTATGACTAACGGCTTTACCGCGCTGCAGCAGGCACGGCTTGAACATACCGGTTTCTCCGGGCTCTTCGACCTGCTGGTGATATCCGAGCAGGTGGGATATGCCAAACCGCATCCGGCGATCTTTGATTATGCGCTCGGCCAGATGGCTAACCCGCCACGCGATCGGGTACTGATGGTAGGAGATAACCCTGATTCAGACATTCTCGGCGGCATCAATGCCGGAATGGCAACATGTTGGCTGAACAGCGATGGCCGCAGCAGGCCGCAGGGGATAAAACCAGACTGGGAGGTCACCTCCCTGACTGAATTGCAGGGGTTACTGGGCGCGTAAGCGCCTTTTTTTGTGCTGTAGAACCATTCATTAGGGAATCTTAACCTCTTTTTTTGGTTTTTTGTGCTAACAACAAATGATCTCTATGTTATTAATGCTAAAAATTTGTTTTATTGCACCTTATTTTAGCAATAGGTGCTTTCCCGCCATTTATCTCACCTCGCTCGATTTTTTCCACGTTACGGCATTCAAAGTTTTTTTAGAAAACTCCTGCCGTGCAAACGCGCTTCATTGATAGGTCAGGGCCGTCTAAAGGTGAAGTTTAGCTTATTGACATAGGATGAGTCCTAACTAATCTCAGACTTTATGTGTAATTTTCATGCAGTGCCTGCATAAGAATGTCTTAACAAACATCCCGCCCCAGCTCTTCGTTGAATTATTGTTTTTTCTCTGTAAAATTGCCGAAAAACTTACAATTCAGGATGAATGCTTTTGGTCATCGGGATGGCTAATGTCTATCTCGCGCGGTAGCTATGAACATTAATGTTTAACATCTCTTCCCGACTTCCTTGATATTACGCGCAGTACGTTGCATTTGGTCAATTCAGCCACTGCATGGCAAATGATAGTTTCGATAGCCATTGCCGCTGCTGTTCAGATGGCCCATTACAAAGTGCATCTAAAGGTTACCTTTCGTTACCGTACGGCGAGACCTGCGTGTCATGTGACAACCCACGTTTTGTGAGTAATTTTACGCTGAAAATTAATCATTCATAACGTTTCGCGCCACCCGATAAAATCGACACGCTGCCTGCAAACCATCATCTGTTCTGAAGTAAATTGTGGTCGTACACGGCGAGCGAAAGCGGCTGTCGCAGCCCGCAAAATAGTCAGTTCGCTATGGCGATAAATGCTGCCACTGCGCAAATTTACCGGGTCGTCATCCTTCTCTCTTTTCGGCCTGATTACCGTCTTTTTTCTGCTGTTGTACTTCGGGTGTGAGTCAACGCCCGTTACTCCGTTATTTTTGAGAAAACGTAAGAGGATTATGGTGAAAATTCGTATCGCATTAAGTCTGCTGTTTGTATTGAGCGTCGTTGGTTGTAAAGCCCCTGCACCGAAAATCACCGACGATACGGTGGTTTCCAGCACGGTTGACGGCGTCACGCTGAGCTATCGTCACGCGATCACGCCGCCGCAAAGCTTTACGCCGGTTGGCGAAGAGTATCGAGCGCTGTACGCGGCCTCGGTAATGAGTCGCCCGAATTTTGGCGGCAAGCTGGTACGCAATCTGGACAACGGTCAGACCTTTACCGTGCTGGGATCGGTTGAAAACAACTGGTTCGCCATTGCCGATGCGGGACAGGAGCAGTTGATTGGTTATGTACCGCTGCGTGCCGGTGTGAAAAGCGCCCTGTATGACCAAACGCTAAAAGCGGATCAGCGCCGCAAGCGTGTGCGTGCCCCGGCGAAGAAGAAGACCTGCGTTGCGGTAGACGGCGACAGCAAAGCCTGCCAGAACAACAACAACGGTACCTGGATCATTGACTGATAACGCGTTAATGGCAGCATGACTATGCACAAAATTTCGATTTCTCGCCTGGTAAACCAGGCGTTTTTACTGGCAGCCCTTATCCTGCTGGCAGGTTGTGTTTCAACTTCACGCAGCGTTCCCACCCAGTACAGCCTGGTCTTTCAGGCACATCCTCAGATCAACGATTCGGCCCCTCTGAAAGTACGGGTGCTGCTGCTGAAGTCAGACGCGGATTTTATGGCCGCCGATTTCTACTCGTTGCAAAATAATCCGCAGGGCGTGCTGGGGCAAAATCTGTTGAACAGCGAGCAGTTTTTCCTGATGCCGGGACAAACGGGTAAAAAGCTGCTTGGGCAGACCAGCCCGGAGGCGCGCTATATCGGCATTATGGCCGAATATCAGGCGCTTGATGGCAAAACGTGGCGGATCTCACTGCCGGTTCCTCTTCCCGCCGAACGCCGTTTTTATCAATTTTGGCAAGGGAATACGGATAATCTGCGCGCCGACATCATCGCCGACATCAACGGTGTCCGTGTGGTAAACCCCCGCGATTAGCGCGGACACAGGAGTCATCATGAGCAAAGCAGAAAAAGTAGTCTGGACCGAAGGAATGTTTCTGCGCCCGCATCATTTCCAGCAGAGCGAAAACTACCTGCAAAGCACGCTGCGTGACTGGGGGCAGGCACAGCGCCCCTGGCTATGGGGCCTGCATGATATTGAATTTGATGAGTCGATGCTGCGCCAGGGCAAAGTCGCGCTGCTCTCCGCCAGCGGCCTGCTGCCGGACGGCACGGCGTTTGCCTTCAGCAATGGGGACGATGCCCCAGCGCCGCTGTTGATCCCCGATAACCTCACCCAGGCCAAAGTGGTGCTGGCGCTGCCGGCGCGGCGCGGCGGGCGTGAAGAGGTGATCTTCAGTGAGTCTGGTGATTCGCTGGCGCGCTTTATCAGCTTCGAACGCGAAGTGGATGATTTTAACGCGATGGCGGTCGGCCCGGCGGCGGTCCAGTTTGGCCGTTTACGTCTGCGGCTGATGCTGGAAAGCGAACTGAGCGCCGAATGGACCGCTATCGGCGTGGCGCGCATCGCTGAAAAGCGCAATGACCATCAGTTACGCCTCGACGGCAGCTATATTCCACCGATGCTCAATGCCATCAATCAGTCACAGCTGATGGAATACATTGGCGATATCCACGGCCTGCTGGTGCAGCGCAGCCAGCAGATCGGTCAGCGCCTGCAGCAGCCGGGGCGCTTCAACACCGCTGATATGGTCGATTTTATGCTGCTGACGCTGATTAACCAGCAGCTGGGGCATATCAGCCATGTGAAAAGCCTACCGCTGATCCATCCTGAAACGCTATTCCACGGCTGGCTGACGTTTGCGGCTGAACTGACCAGCTGGATGCCATCGCGTACCCCGGACGGTGCACTACCCACTTACCAACACGATGACCTGGCGGGCTGCTTCAGCCAGCTGGTGCTGCTGCTGCGACAGGGCCTGTCGCAGGTGATGGAAGAGAACGCGCTACAGCTGCCGCTCACCGAGCGTTCTCACGGCCTGAACGTGGCTACGGTGCCTGAGTCCTCCATGGTGCGCGAGTTTGGCTTTGTGCTGGCGGTGCGGGCCAACGTGCCGGCCGAAGCGATACAAACCCACTTCCCGGCGCAGATGAAGGTGGCCCCGGTGACCAAAATCCGCGACCTGGTTCAGCTGCAGCTGCCCGGCATTATGCTGCGGGCCATGCCGGCCGCGCCGCCGCAGATCCCGTGGCACGCTGGCTACAACTATTTTCAGCTCGATAAGGGCAGCGAACTGTGGCAGGAAATGGAGAAATCCGGCACCTTCGCGCTGCACCTGGCCGGTGAGTTTCCGGGGCTGGAAATGGAGTTCTGGGCAATTCGCAGTCACGCAGTCTGAGCAGGGACTCAAGCACAATGAAACAGGAAAAACATTCTGATGCCGCCCAGGCGGGTATCGGCGCTGCCAACGGTCACAATCCACTGGTCGCCGCCGCCAACCCGCTGCTGAACGCCATTCCCCAGATCCGCCATTCGGTTTCCCATCCCGATCCGGCCGGACTGCGCCAGCGGCTGGTCGATGAGATCCGCCAGTTTGAAATGAACTGCCAGCGCGCCGGGCTGCCGTGGGAAGTGATTATCGGCGCGCGTTACTGCCTGTGTACCGCGCTGGATGAAGCGGCCGCGCTCACCCCGTGGGGCGGGCGCGGCGTCTGGCCCGGCAATGGGCTGCTGGTCACCTTTCATAACGAAACCTGGGGCGGGGAGAAATTTTTCCAGCTGCTGGCGCGCCTGTCGCAGACGCCGCGCGAACAAATTGCCCTGCTGGAGCTGATTAACTTCTGCCTGCAGCTGGGGTTTGAGGGACGTTACCGGGTGATGGATAACGGGCGTTCCCAGCTGGAAACCATCAGACAGCGCCTGCTACAGATGATCCGCTCGGTACGCGGTGGCTATGCGCCGCCGCTGTCGGTGCATCCGGAAGACCATCCGGTAACGCGTAAACTGTGGCGTCCGGTAGTACCGCTGTGGGCCTGCACCGCCGTGGTGGGCTTCCTCGCCTGCCTGCTCTATATCCTGCTCAACTGGCGGTTAGGTGACTACACCAGTCCGGTGCTGGCCTCGGTCTACCAGACCTCGTTACCGGAAGTGAGTATTCACAATCCGGCTCCGCCGCCGCCTGCCGCGGTTAATCTGAAAACCTTCCTGAAGCCCGAAATCGAACAGGGGCTGGTGGCGGTGCGCGACGAGGCTGACCGTAGCGTGGTGACGCTAAAAGGTGACGGCCTGTTTACCTCTGCTTCTACCGAAGTGCGCGGGCGCTACGCCGAGGTGCTGGACCGCGTTGCCGCCGCCATGAACAACGTCAGCGGCCGCATCCTGGTGGTCGGCTACAGCGATAACGTGCCGATCCGCAGCGCGCGCTTTGCCTCCAACTACGAGCTTTCCCTGGCGCGGGCGCAGTCGGTTTCCGGGCAGCTGCGGCAGCATCTGAGCCAGCCGCAGCGGGTGAAGGCGGAAGGACGCGGCGAAAGCAGTCCGCTTGCCCCGAACAATAGCGCTGAGAACCGCGCCCGCAACCGCCGTGTAGAAATCACGCTGCTGGTGTCACCGGGCAATACCCGGGTCGAGCTGAACGACATGGCGCAAGGAAACTAACCGATGTTGAATATGCTGTTTGCCGTTTTAACCCATCGCCTGCTGTGGGGCTTTGTCGGGATCACCGCGCTGTCGTTTATTATCTGGGTCATCGGCCCGGTATTTTCGATTGCAGACTCGCGGCCGCTGGAGCCGGAAGTTAACCGCCAGATCAGTATCGGCCTGCTCTATCTGGTGTGGGCGCTGGGTAACCTGGTGCCGCGCCTGTATAACGCCTGGCTGAACCGTAAGCTGATGGGCAGCCTGAAAACCACCCCCGGCGAACAGCCGGACGGTGATAATCCGCGTCTGACCAGCGAAGATCGGGTGCTGGCGGAACGCTTTAGCGAAGCCTCCGAGCTGCTGAAAAAGGCGCACTTCTCCCATGCCGGTAGCCGTTCGCCGTTCTGGGCGCAGCGCTTCAGCCGCCAGTACCTTTACCAGCTGCCGTGGTATGTGATTATCGGCGCGCCGGGAGCCGGGAAAACCACCGCGCTGGTCAACTCCGGGCTGCAATTTCCGCTGGCCGACCGCTTTGGCAAATCCGCGCTGCGCGGCATTGGCGGCACGCGTAACTGTGACTGGTGGTTTACCAATGAAGCGGTGCTGCTCGATACCGCCGGACGCTACACCACCCAGGAGAGCCAGCAGCAGCAGGATGCCGGAGAATGGCACGGCTTTATTAACCTGCTGCGCAAATACCGGGGTCGCCAGCCGATCAACGGCGTGATCGTGACCGTCAGCGTCTCCGATCTGTTGACCCAGTCGGCAGAGGCGGCACGGCAGCAGGCGGTGGCGCTACGCCAGCGTCTGACCGAGCTGCATGAACAGCTGGGCATTCGTTTCCCGGTGTATGTGCTGGTCACCAAAGCCGATCTGCTGAAAGGCTTCCGCGCCTACTTTGCCAAATTCGACAAGGCCCAGCGCGAACAGATTTGGGGCTTCACCTTCCCGTGGGAACGGGCAAAGATGAGCGATTTTGACCTACAGGCGGCCTTCACCCAGGAGTACGCCCTGCTGCAACAGCGGCTGGATGCCGGGCTGCCGGATACGCTGCTCCAGGAGAGCGATAGCCAGGCGCGCGCCGAAAGTTTCCTGTTCCCGCAGGAGTTCGCCGCGCTGCGTCCGCTGCTGGCGGAGTACTTGGACACGGTGTTTGCCCTTTCAGATTTTGAAACCCAGTTCTCACCGCGCGGCATTTACTTTGCCAGCGGCACCCAGGAAGGGCTGCCGTTTGACCGGGTGATGGGCGAGCTTAACCGCGCCTTACAGCTGCCGCAGCAGGATGCGGCCAGCGGCCAGAGCGGAAGCTGGGACCAGACCAGCAAATATGCGCCGATCCCCGGCAACAAGGGGCAAAGCTTCTTCCTGAAAGACGTGCTGCAAAAAGTGATCTTTGCCGAGGCCGGGCTGGCGGGCAGCAACCGCTGGTGGGAGCTGCGCAACCGGGCGCTGCTGTGGTCGGGCTATATCGCCCTTGCCGCCGTAATGGTCATCAGCGCGCTGCTGTGGTTCACCAGCTACGGCAACAACAAAAGCTATCTGCAACAGGTGCAGGCGCGGGTGCCGGAGGTGGCGCGGCAAAGCGCTACGCTAGAGGGCAGCGAGCAGGGCGATCTGTTTGCGCTGCTGCCGTTCCTTAACAGCCTGCTGAAGCTGCCGGAAAGCAGTGAGTTTGACCTGAACTCGCCGCCGATAAGCCGCCGTATGGGGCTGTATCGCGGGGCCGAGGTCAGCGATGCCACCCAGGCGCTGTATCAGAAATCACTGAAACAGCTGCTGCTGCCGCAGGTGGCGCAGCTGATCACCGGCTGGCTGCGCAACGATAACGGCAGCGACGCCGACTACAGCTATGAAGCGCTGAAAGCCTATCAGATGCTGTATCAGCCGCCGCACTATGACGGCAAGTTCCTGCATGCCTGGCTAATGCTCAACCTGCAACGCAACCTGCCGCAAAACGTCACGCAGACGCAGCTAAAACAGCTGGAGTGGCACCTCAGCCAGCTGCTGGAAAATCAGATCCAGTCCTCTCCCTATGCGCGGGATGATGCGCTGGTAAAGCGCGAGCAGGCGCTGATTAATCAGATGCCGCTGTCACAGCGCGTCTGGGGGCGGCTCAAGCGCCTGCTGGAACGTGATGAAAGCCTGAAAGCGGTGTCGCTGGCCTCGCTCGGCGGGCCGCAGAGCGAGCTGGTATTCTCGCGCAAAAGCGGGCGCTCAATTGCCGATGGCATTCCCGGCCTGTTTACCCCGGACGGCTACTGGCAGAGCCTGGATAAACATATCGCGCCGGTCACCACAGCCCTGCATGACGATGACCGCTGGGTGCTGGGCGCACCGTCCAGCGGCGAAAGCCAGCAGCAAACCGATGCCGCGGTACGCCAGCTGTATATCGGCGACTATATCCGCCAATGGGACAGCCTGCTACAGGACATTCAGCTGAATAACAGCGCCGATCTCAGCCAGCGCATCAACAGCGCGCGCCTGCTCTCCAGCAATAACTCGCCGCTGCGCAGGCTGGTGATTAACCTCAGTCGCTACCTGGTGCTGGAAAAGCTGCCCACGGATGAGAAGCCGCCGGGCAAAGAGAAGGACGCAGAGGCCGATAACAGCGCCACCCGCACCCTACAGGCGCTGTTCCGCTCACGGCAGAACAGCACGGCGGCGGCGGCAGAGCAGGCACCTGAACAGGCGGTGGCCAACCACTTTGCGCCGGTTATCGAACTGGCGCAGCCGCTGGAGCAGGGCGGTAAAACCATCGCCTTTGATGACTTTCTTCGCCAGATAGATGACCTGTATCGCTATCTGACGGCGGTGCAGGATGCCGCCAACAGCGGCATGCCGCCGCCAGCGGGTGATGCCATCAGCCACCTGCAGGCCAGCGCCGGACGCTTACCCGGTTCGCTGCAAACCATGTTCTCCACGCTCGCGGTCGGTGCCAGCAGTGACGCCCAGCGCCGCGAACTGGAAAACGTGCGCAAACGCATCAGCAGCGAGGTCGGCGGCTTCTGCCGTCAGGCGATTGCCGGACGCTACCCGCTGGTGCGCTCGGCGCGCAGCGAAGTGACCCCGGATGACCTGGCGCGCATGTTTGCCCCCGGCAGCGGCCTGATGGACAGCTTCTTCCGCGATAATCTTGCCAACAAAGTCGATACCACGCAGTCGGCCTGGCGCTTTACGCCGGGCATTGACGGCAAGGGCATCGGCGGCGAAGACATTCTGCGCCCGTTCCAGCAGGCGCAAAGCGTGCGTGATGCGTTCTTTGCCAACGGTGCCACCACCCCGGCGTTTCGCGTGACGGTACGCACGCTGCGCATGGATAACGATATTCTTAACCTGACGCTGGACGTCGACGGCCAGCTGCTGCGCTACAGCCACGGGCCGCAGGCGGTACAGCTGATGAACTGGCCGGGCAGCGGCGGCACCAGCCAGGTGCGCATGCAGCTTGGACTGGCGAACGGCACCACCTCGACGCTGGTGACCAACGGCGTCTGGGCGCTGAATCGCTTCTTTGACCGGGCGCAGCTGGCGCCGGGCAGCAGCAGCCTGAGCCGCCAGGCCACCTTTAACGTAGACGGCCACCGCGTGACGCTGGAGTTCACTCCAAACAGCATCCGCAATCCGTTTCAGCTTTCCGGGTTCGCATGCCCGTAGCCGCAAGGAAATGATGATGAGCGAGACGCCCGCGATCGGCTGGTACGGCAAATTGCCCAGTGCCGGTGATTTTTTAAAACGCCGCTTCCCGGAGGCGCTGTTTAACTCATGGAGCCACTGGTTCCAGCTGGGGCTGCTGGACTGGAAGCAAAGCGAAGAACAGCGCCCGGACGGCGGCCGCCAGTTCGGCAACGCCCCGGTATGGAACTTCGTTGTTCCCCCGCTGCTCGGCAGTCGTCTGGTGCAAATGGGCTGTCTGCTGCCGGCGCGCGACAGCGTGGGGCGGCAGTATCCGGTCTGCGCGCTGCTTAGCTATAACCTGACGCAGTGGTCACCCCAGCGCCTGGCGCGGGCGGGTGAATGGTATCAGCAGCTGGGGCGCACCCTGCTACAGGGCGTACGCAACGGCTGTTCGGCTGAACGGCTCGACGAGGCGCTGCTGGCGATCCCCGCCCCACCGGAACCGGAGCACGGTGACGCCTCCGGCATTCTCGAGGCGATTGGCTACGACGATCGGCAGGATACGCTGGGCTGGCGGCAGGCGGCGGACTGCTTCAGCCCGCAGCAGTACACCAGCTTCTGGTGGACCAACCGCACCGATGGCTACCCGCTCTATACGCATCTGCACAGCGGCAACTTTACCAGCCAGCTGTTTAGCATGCTGTTCGACCCGGCGGAGGGCGCGAAGCCCGGCCGCAACGGGCTTTATCCTCCGATGTTTGAGTAACCGAGTAAGGAACCGCGATGGATCTTGAAGCGCTGCTGGCCCCGATCACGCCTGACCGCCCCTGTGGCGATAACCTTGAATATGACGCCGACTACATGGCGATGGACCAGGCCAGCGCCGGTAAAGCCGAACAGCAGTTTGGCGATACCATCATTCCGGCGGAACCGGCGGACTGGAACAAAGTCGAGCGGCTGGCGCTGGATCTGCTGGGGCGCAGCAAAGATCTGCGCGTGATGCTGGCGCTGACCCGCGCCTGGACCCAGCTCAAAGGGTTGAGCGGCTACGCCGACGGCCTGCATCTGATCCAGCAGGCGCTGCTGCTTTACTGGCAGCCGCTGTGGCCGTCACTGGAAGAGGACGGTATAGAGGACCCATTTTACCGCCTCAACGCGCTGGCGGCGCTGGGTGATAAATCGGCGCTGACCGCCGCGCTGCGCCAGGCTTCGTTGCTACGCTATGCCACCGATGAAATCAGCCTGCGCGATGCCAGTGGGCTGCTCGACGGCAGTAAAACCGAATGCCCCGGCTATCCCGGCGGCCGCGCCCGCTTACAGGATGAACTGGCGCGGGGAGGGCAGCCCGGCATTGAGGCGGTCGTCAAGATAAGTGAGCGGTTACTGACTATTCGCGAAACTTTGACTGAACGGCTGGGCGCGGGGGCGTTGCCGGAAATGGATCAGCTGCTGAAAACCATAAATAGCGTGGCTGCCGCCTGTCAGGCCACCGACCTCAGTACCCTGATCCCAGCGGTTGAAACTGCCGCCAGCAGCGCCACGTCTGCGCCAGCCGCCGCGCCTGGCGCACAGCAGCACGCCGACTGGCGCAGCGTACAGCTTAGCTCGCGCAGCGATGCGCAGCTGATGCTGGAAAAAGTGAAACAGTATTTTAGCCAGCACGAGCCAAGCCATCCCGCCCCGCTGATGATCGAGCGCGTACAACGCATGATAGAGCTGGACTTTATGGACATTATCCGCGATCTCGCCCCGGACGGCGTACACCAGCTGGAAACGATTTTCGGGCGTCGCGACCACTCATAGCCTGCATCGCCAGGCCTTAACCGCGTCACAATTTATAACGCGCACACACAATGGAGCAAACCATGGCAGTCAGTAAATCCAGTGGGCAAAAATTCATTGCCCGTAACCGCGCACCACGCGTGCAGATTGAGTACGACGTGGAAATCTATGGTGCGGAACGCAAAATCCAGCTGCCCTTCGTGATGGGCGTGATGTCCGACCTGGTCGGCAAACCGCTGGAGGCGCAGCCGGGCGTCGATGAACGTAAGTTTCTTGACATCGACGTCGACAACTTCGACGAACGCATGAAGGCGCTGAAGCCCCGCGTGGCGTTCCAGGCAGAAAACACCCTGACCGGCGAAGGGCGTCTCAACATCGATCTGACCTTCAACAGCATGGAAGACTTCTCGCCGGATGCGGTGGCACGCAACGTCGAGCCGCTTAACCAGCTGCTGGATGCCCGCACCCAGCTGGCTAACCTGCTGACCTATATGGACGGCAAGAACGGCGCGGAAGAGCTGATTGGCAAAATCCTCCAGGATCCAACGCTGCTGAAATCACTCAGCCATCTGCCGAAAGCTGACGATGCGCCAGCCGACGACCACGGCAACAAGGAGTAAGCGATGAGCAATCAATCCCAACAGTCAGGCGATTTGCAGCAGCAACAGGGTTACAGCGAGGACGCGTTCAGCGCGCTGCTGAATAAAGAGTTTCGCCCGAAAAGCGACCAGGCGCGCGCGGCGGTGGAGAGCGCGGTGAAGACCCTGGCGCAGCAGGCGCTGGAAAATACCGTTACCGTCTCCAACGATGCCTACCGCACTATCCAGGCACTGATCGCCGAGATCGACGAGAAGCTCTCGCTGCAAATCAACCAGATTATCCACCATGAAGACTTCCAGCAGCTGGAAGGCGCGTGGCGCGGTCTGAGCTACCTGGTCAACAACACCGAAACCGACGAGATGCTGAAAATCCGCTTTATGAGCATCTCGAAACAGGAGCTGGGCCGTACCCTGAAGCGCTACAAAGGCGTCAGCTGGGATCAAAGCCCGATCTTTAAGAAGATCTATGAGGAAGAGTACGGCCAGTTCGGCGGCGAACCTTTTGGCTGCCTGGTGGGCGACTACTACTTCGATCACGGCCCGCAGGACGTCGAGCTGCTGAGCGAAATGGCGCGTATCGGCTCCGCCGCGCACTGTCCGTTTATCACCGGCACCGCGCCGGGCGTCATGCAGATGGAGTCCTGGCAGGAGCTGGCCAATCCGCGTGACCTGACCAAAATCTTCCAGAACACCGAATACGCCGCCTGGCGCAGCCTGCGTGAATCGGAAGATGCGCGCTACCTCGGCCTGGTGATGCCGCGCTTCCTGTCGCGCCTGCCTTATGGCATCCGCACCAACCCGGTGGACAGCTTCGACTTTGAAGAGCAGACCGACGGCGCTAACCACGGTAACTACACCTGGACCAACGCCGCCTACGCGATGGCCGCCAATATCAACCGCTCGTTCAAAGACTTTGGCTGGTGTACCTCGATCCGTGGCGTCGAGTCCGGCGGGGCGGTGGAAAACCTGCCGTGCCACACCTTCCCGAGCGACGACGGCGGCGTGGATATGAAATGTCCGACCGAAATCGCCATCAGCGATCGTCGCGAAGCCGAGCTGGCGAAAAACGGCTTTATGCCGCTGGTACACCGCAAGAACTCCGACTTTGCCGCCTTTATTGGCGCACAGTCGCTGCAAAAACCGGCGGAATACCACGACGCCGACGCCACCGCCAACGCCCGACTGGCGGCGCGCCTGCCGTATCTCTTCGCCTGCTGCCGTTTTGCTCACTACCTGAAGTGCATCGTGCGCGACAAGATTGGCTCCTTCCGCGAGCGTGACGAGATGGAGCGCTGGCTGAACGACTGGGTGATGAACTACGTTGACGGTGACCCGGCTAACTCCTCGCAGGAAACCAAGTCACGCAAGCCGCTGGCTTCTGCCGAGGTGCAGGTGCAGGAGATCGAAGACAACCCGGGCTACTACGCCGCCAAGTTCTTCCTGCGCCCGCACTACCAGCTGGAAGGTCTGACCGTGTCGCTGCGTCTGGTGTCTAAACTGCCGTCGCTGAAATCGAACGACGCGTGATAAAGCGGGGTGGGGTGACAGAAAGGGGCGGTGATACCGCCTCTTTAGCGGGAGGAAAGCGGGTGTTGAGAAAATCGCTGCGCGGGTTATCGCGTATAGCGCTGACGTATCTGGCTTTCCTCGGTTTTATTGCCTGTGCTGGCTATTATGTTTTGATATTCGACTGGCATATTTCCGCGACTGCTACCGCATTGCATATTGCCCTGATGATCGTTGCTATAGCTGCCTTGATTGGCATTTATGGTATTGCAGAAAAACTCAGGTCAGCTTAACGGCCCACAGAAAATAACCAGCGCCTGGCCTGCTGGTAAATAAAGCAGGCTAAGCGTCATCAGGTTTAGCGATGAATAGTGGAAATATAATTGATCGCTATATCTTTCTGAATATCTCCCGCTTGGCGATATTCGGAAGCAGGGCAGAGGTAAATATAGAATATTTATTTTTATATTTATCTGTATGCTATATCAGTGCTTTATCTAAGCCCAAAGGTTTCCTGAAGGAAGCAGGGTAAAAGCAATAACTTAACTAATGAGAGTAGATAACCATGGCTATTGATATGTATTTGAAAGTTGACGGTATAACCGGTGAATCTAAAGATTCTAACCACACCGGCTGGATCGATGTGACTTCTTTCTCCTGGGGCGCTACCCAGCCGGGCAACATGTCCGTAGGCGGCGGCGGCGGTGCCGGTAAAGTGAACTTTAACGATCTGCACATCAATGCAAAAATCGATAAGGCGGTTACCGCGCTGCTGAAAAACTGCGCCAGCGGTAAGCACGTAGGCAAAGTCGAGGTCTCCGTATGTAAAGCGGGCGGCACTCAGATTGAGTACACCCGTATTACTCTGGAAGACGTGCTGGTCACCAACGTGCAGTTCGTCGGTTCCGACGGTGACGATACTCTGGGCGTAACCTACGCATTCCAGGCTGCCAAAGTGAAACAGCAGTACTGGGAGCAGAGCACCTCCGGCGGTAAAGGCGCAGAAAGCAGCGCTGGCTGGAATATTAAAGAGAATAAAGAAGCTTAATTATTCATCCCAATTGCCCATTCATGACTGAATGGGCAAGCTCATTTCTTCAGTGCTTTTAGTCGCATAAGGAAAGTTTATGGTAATAAGACCCGCTTTTTTACAGGCATGGCAGCGCTTCAGTGAGATAAATATCGATGTTTCATCTGTTGGAAAAAAGATAGGTGGAAATGTCGGTGCAAATATCACTCTAGGCGAACAGGACCCCGCACAGGGATTTACCAATGCCTGCGCGATAAGAATGAGTTACACGTTAAACTACTCTGGGGCTAAGGTCGAAAGGGGCGTCTGGAAAACGGTTTCAGGAGATGATAAAAATTGGTACATATACCGGGTGAAAGACCTGTTGACGTATATGCATAAAAAATACGGTAAACCTGACAAGATAGTAAAAAATCCAAAGCCCGGTGATTTTCAAAATCTGAAAGGGATATTGGTTTTTGCAGTCAATGGCTGGAGTGATGCAAGCGGGCATGCAACGTTGTGGAATGGGTCGGTTTGTTCCGATCACTGTTATTTCCCGATTTCGAATGAGGGGTCAATATGGTTATTAAAATAGGATGGGTTTTACTGGCTGCTGCATTATTATTTTCTATCCAAGCTGGCGCTAAGGCAGCCTATACCCCAAGTCAGTATTTAAAAAACTATGCGCTAAGTACCTGTATTTCTCAGGGGTATCAGAGTAAAGAGGTAAAAGAGGATGCCGCTGCTGCCGCTCGAGGCTATCTGGAATTTGGCGACTATTCCCTCGCAGCACACACGGCGGTTAGAAATCTGGGTAAGGCGTTTCTGGCTAAGGAATATACCAGTCAATCTGGTCAGCCGATGACGCTGGCAAAATGCATTGATTTCTATCATAGCGAGCAGTTGGATAAGCTGGTGAAGCAGTTTAAAGGTAAGCAGGACGATTGAATCGCCGTCTTATTCTCCATCATCTGGAATGTATCACCACCGCTATTGGCGAATGATGATATATATTGAATGCACCAAGGGTATAAATATGAATAGTCATTCAAGAAAGCATGTATAACAGCATAAATAAGCCCAAGTGGAGATAGATATGAACAAGCTATTATCACTTTCTCTTTTGACCTTTGCACTACAGGCGAGCACAGCTTATGCCATCGAAGATTGTTCTAAGGGCACAGGATTGATGCTGAAGTTAATGCCTGCGCAGAAAAAAACAGAATAGAGGCTGAAGCTGATTTAAACAAAGAGTATATTGCGGCTAAAAAACGCGTTGAGTCAACTTATGATAGTGACCCGGCCGAAAAGAAACAATATCTTGATACCCTGCTGGCAGCGCAGAGAACCTGGCTTAAATACCGTGAAAATGATTGCAAGCTGGCAGGTTTCGCAGCAGATGAAGGAACAAACGTTAGAATCGCCTTTATTAATATGTGTATCACCGATGCTAATCTTGAAAGAATAAAAAAACTGAAAGAGATCCCTTATGGGTAATCTATTTATTTACCTCTGCCGCAGACTGGCTATATTTGCTAAAAGCGTGTTTTTCTCTCTTTTAACGATGTTCTTCCTGATCCTCAGCGCAGAGGCAGACAATAAATATAGCCCACAAACGAATCTGAAAAACTACGCATTAAGCACCTGTCTGTCTCAGGGCTATCAAAACAAAGAGATAAAAGAGGATGCCGCCGCCGCTGCCCGTGGCTATCTGGAATTTGGTGATTATTCCCTGGCAGCACATACTGCGGCAAGAGAGCTGGGTAAGACGTTTCTGGATAAGCAGTATGGCAGTCAGTCAGGCGCACCGATGACGCTGGCGAAGTACATTGATTTTTATCACAGCCAGCAGCTGGATAAGCTGGTGAAAGAGTTTAAAGATAAAAGGGATGACTAATACTGTGATCGTTTTTCCCGACGCTACTGATTACTCAACATAATTGGCTGAACTGCTGTGAATGCCATGCAGTATGGAGACGGTACTGGCATGAAGAAGAGAGTTAGCCCCATGCCCGAACGAGCAACAGCCGTATTGTAAGGTTGTATAAAACGCGTATTGCAAAAAATAAATCTGTGTTTTATGAATAATGAAATTCTTCGACAAGAGTATTTATACCACAGGAATAGTGACACAAATATTCAATTAAATAGCCGAATTGAGAATGTATCGACGATTCTTCGGTCAGGGAAGATGAAGGAGTATTGAAATGAAATTAACCGTGGGTGATGTGAGGGGACTGACATCAGGTGAAATTAGCATGGCAATTTCGGTGTTTGGCAATGCAATAAATTATGCTTTTGTAAAAGTTCATAACGATAGTTATTTGCCATTTAACTTACAGTCAAACCGGGTGGCTATGACGCCTAATGGGGAAATCTACTTCAGGGAACCTTATTACAAGGATGACTTCTCTATTGCTCCAGCTGGAGACAAACACTGGTTTATTCATGAGATGGTTCATGTTCTTCAGCATCAAATAGGTATGAATGTCAGGCTCAGAGGAACTTTCAGTTGGGCGGCCAGCTATGAATATTCACTCCCTCCCGATAAGTTATTATCTGATTTTGGTATGGAACAGCAGGCATCAATAATATCTGACTATTATTACGTGAAGAGTTTCGGATTAAATAACTTTGACCGACTTTCTGGCTTTAGGGGGATTGTGGGGCCGGATTTAAAGCTAAAATATCAAAACACACTGCGAGTTTTCCTAACAAACCCACGCAGCCGGAGCGCTTTCTTATGAAAAAACCAGTGATTATCCCGATTGTGGCGCTACTTCTCACGGCTTGTCAGCTTGAAAAACCGCAGTTCGAACGGATGCAGGTCAGGGTTAACCACAACCAGCCGTGCTTTATTATCCCGCAGAATGCGGCCGATCGTCACGCGGCACTGACCAGCAATGGCCCGATGGTTTCCTGGTTTGATCAGCAGCAGTGGCAGGTAATTTCTCCGTCATCGGTGACAGCAGAAGATCGCGCCGTCAACGCCGGAGAATGTACTCAATGGCCTGGTATCAACTGGGATCCCGGCAGCTACAACGTTTTCATGCGCGTCAATGACCGCGCTTCTGGAGACATTATCCGCTATCGCGCTGATTTTTCTCTGCTAAGAAATCAACAAGGAGAACTTTCTCTCGGTTCGCAGTAATATTCAGACCATCCGCCTTCACCTCATTTCAGCCCCAAATCCAGCGCGTAAACTCCGCCGCTTTCCCCCAGAGCGGCGCAGTTTAACCGCCGTATTACCCTGCATCACCCATGCAGACGGGTCACTACCTAAACGCAGGAAAACGTTATGCGATTTACGATTATAACGAGCAAACCGGGCCAGCAGCCGCCGCAGAGCAGCTGCGACTTTCTGCCGCCCGGCGGCACTATTGGCCGTGGAGCCGATAACAACCTGGTGCTGCCGGATGACGACCGCACCATTTCACGCCTTCAGGCTATCGTACATATCAGCGCGACGGGCGAGTGCCACATCACCAATCGTGGCAACGTGACGCGCGTCCACCTGAATGATATTCCGCTGGAGCGCGGTCGCCAGGTCGAGCTCCAGGATGGCGATATCATCGGTATTGATGACTATCGCCTACAGGTCAGCGAACTGAGCGCCGCCGCGCTGCCGCTCAGTCAGCCGGTGTCCGCGCCCGCAGTGACGCGCCCGCAGCCGGTCGCCGCGCCGGTCGCTCCGGCGGCAGCTGCTGCACCCGCAGCGATCCCCAGCGAGATCTGGGACAGCCTGGCGCAGGAGTTCTCGATTTCCGATAACCTCTCCAGCCGCAGCAAAGCCAAACCGCCAGGCCAGGATAACCCGCTGACCGCTCCGGCCGCCGCCGAGCGCAACCCTGCCGATCCGCTGGCACAGCTGAAGCAGAACAGCGATCCGTTAAACCTCGATCGCCGCGATCGAACTCCCGACAGCCTGTTTAATCAGGATCCGCTGTTTAAACAGGACAGTATTTTTGATGACAGTACCCCCAGCACGCTGTTCCAGCCGCCGGGAGCCAAAAGCAGCGTCACGCATGACGCACGAGGGGATGAGCAGGATCCGCTGGCGCTGTTCGGCGCAGGCGCGGCGGCACAGGTCGATCGTGACGATCCGCTTGGCCTGATGATGGGCAGCGCGGTGCCGTTGACGCCGCCAGAGGATAAGCCGCAGCCGGGATCGCAGCCGCCGGTTAGCACGCCAGCGCCAGAACCCGTGCCGCCTGCCGGCGGGAAAAAAGCCTCCTCCGAACAGGAAGATCTGCCGCAGCCGGACAGTGATTTCAGCCTGTTTGGCGACGACAAAACGCCGCAGTCCGGTGAAGCCAGCTCGCCGCTGTTTGACGTGCCGCCAGCGGCTGCGGCCCAGCCCACGGCGTCCTCCTCCAGCGCTGACTACGGCGGTATTACCCTGCCGACCCCGCAGGCGCGTGCGCGCAGCAGCACTCCGCCACCGAAAGGGCGGCTCAGTATCGATCCGGTGGCGAACAGCGCCGCGCACAGCAGCGACGGCGGTGGTGAAATGCTGGACGGCGATCTGATGGCGGCGCTGATCGGCGGGATGGGGCTACAGGATCTGCAACCTACCCCGCGCTTCGATCAGCATGCGATGCAGGAGCTGGGGCAGATGCTCAGCATGTTCTCACAGGGCACCGTAGCGCTGCTCTCTTCACGATCGATCCTCAAGCGCGGCGTCAAAGCGGAAATGACGGTGATCCTCGACGAGGCCAACAACCCGTTCAAGCTGCTGCCGTCCGGCAAATCGGTGCTGATGCAGATGTTTGGCAGCCGCATGCCGGGCTTTATGCCGCCCAAGCAGTCGGTGCGCGATGCGCTGGTCGATTTACAGGCGCACCAGCTGGGGATGATCGCCGGTATCCGGGCGCTGATCGCCTCGATGCTGCAATCGTTTAACCCGCAGCAGCTGGAGGAGGAGGCGCGTCAGCAGGGCGTCACCTCGCGTCTGTCGCTGCCGGGCAGCCGCAAGGCGGCATTGTGGGACCACTTCAGCAAACGCTATAGCGAGACTGCCGGTGAGATTGAAGATGACTTCCATACGCTGTTCGGCGAAGCCTTCCTGCACGCCTACGATATGGAAGTCAACCAGTACAAAGACTCACAGATCAGATCGGAAGAATGATGAATATCACCTTAGCCTCAATGTCCAATCAGGGAGCGCGCGCCAGTAATCAGGATCAGGTCGGTGATATTGTCGGCGACCGCTCGGCCTGCTTTGTGGTGTGTGATGGCGTGGCGGGCCTGCCCGGCGGCGATATTGCCGCCAGCGTGGCGCGCGACACCCTGTTACAGCGTTTTGACGGTCAGCAGCACCTTAACGCCCAGCTGATCCGCCAGTATGTTAATGATGCCAACAGCGCCATCCGTCAACGGCAGAAAGCCGATCCGCCCCATCATCGTATGGGCACCACGCTGGTCAGCCTGTTTATTGACCGCGATTACCAGCTGGCCTACTGGGCGCATGCCGGTGACAGCCGCCTTTATTTATTCCGGCGCGGCTATCTCTATCATGTCACCACCGACCATAGCCTGGTGCAGCAGATGAAGGATGCCGGGCACCAGACCGACGGCATCAACGGCAACCTGCTCTATTTTGCCCTCGGCATGGGCGATGAAGACCGGGATGCCAGCTACAGCGACGTGGTGCCGATAGAAGATGGCGATGCGTTTCTGCTGTGTACCGATGGCTTCTGGCACGGCGTGTCGCAGCACCATATGCAGCAGGCGCTGCATATGGTCAACACGCCGCAGGAATGGCTGACGTTAATGCAGCAGATGATTAAAAAGAATGATGAACAGTCCAACGACAAGCAGGACAACTACAGTGCGGTTGCCGTCTGGATCGGTGAGCCACAGGACACCACGCTGCTGCATTCCCTGTCGGAAGCGGCACAATTTATTGCTCTGCGCGATTAATACAGGAAAAGGAACTTATGAAATATTGGCTGTCAGGTGCGTTCACCTTGATGATCGCGTCCAGCGCTTGGGCGCAGGATCACCGGCTGGTTCAGTCCCCGGCGCTTAAGCTGGATATCTGGATCGATAACGTGAAAAGTACCAGTGCCGAAAGCTGGTGCGCGCGTACACTGCCGCTGCGTATTGTTGCCAATGGCAAAAAAGATCCGGCGCTGCTGGATGACTACCTGCCGAAGGTGGGCAGCTTTTTGCAGAAGCAGTGCCCCGCGCTCAGCCAGATTAACTGGCAGATGAATGACGACGGCGGCAAAAAGCTGGCCGTAGGAAGCGCCAGCAAAGCACAGGGCTGGGCGGTGAAGACCGCGCTGGAAACGCCGGTGACTACGCCGCCAGTGACGCCAGAAACGCCGCCGCCAGCGCAAACGCCAGCGGTTGCATCAACCCCTCGCGCAGAGGATCTCTCACCCGCGGCGGATACCACCCCGTGGGTGCAGTTCAGCCTGTTGGACGGCTGTCATTTCCGCACGTTCTGGCGCGGCAGCAGCCAGACCAGCGCGCTGTTTGTCCCCGCCAAAGGCGGCGTCAGCTGCGGCAGCGACGGCTGGCTGAAGGGATCGGGCATTACCACCCAGTTGGGGCACGGCGCGGCGAAAAACCTGGCGATGACCTTCCTGCAGGGCTTCCCGATCGCCGGACTGAATGACAAGGCGCTGGGTAACAGTCTGCACATCGTGACGGTCAATAATCAGCGCATGGTGCTGAACGACAGCAGGCTGGCGGACAGCTGGATGGTGTTGCCTTATGCCCCCGAGCTAAACGGCTGGCAGGCGAACGGCGTGCTGGTGGTGCAGATCCCGGCGGCAGAGGCGGCGGATAACGGCACGTTGCAAAAACGCCTGAATGAGGTGCGTAACCTGTGGTCGCCGCTGCTGAAAAACAGTACCGATCTGACCATCAAACTGGTCGATGCGCTGCATCCGCAGCTCCAGGACCCGGCCGCCGGTGCCTTCCGTACCCTACATTAAGGACACGCCATGCACTCTCTACACCATCTGATGGACGGCCAGTCGCTGGCCGAAACCCTGGCGAGCCTCGAAAGCCAGATTAAACGCCAGCCCGCCGATGCCGATCTGCGCGCCAGCTTTGTCCAGCTGCTGTGCCTGGCGGGCAACTGGAGCCGGGCGCAGACCCAGCTGAAAAGCTGGCTTGCGCTGAAGCCGCAGGCACAGCCCACGGTGACGCTGCTGGAGCAGGCAATTAACGGCGAACGCCAGCGCGCGGCGGTATTCGCCGGTGAAGCCGCGCCACGGATGCCGGGAGCGGCCTGGGGTTGGGCAGAGCAACTGCTTGCAGCGCTGGCGGCCGATGCCGCAGGCGACACTGCCCGGGCGCAGCAGTTGCGCGCCGCCGCCTTAGACGAGGCGCAGCTCAATCCCGGCCAGCTTACGCACCAGAACCAGCAGCACGCCTTTGACTGGCTGACCGACGGTGACGGGCGGCTCGGCCCGATCTGTGAACTGATTGCCAACGGACAGTACAGCTGGCTGCCGTTCAGCGCCATCAGCGAAATGCGCTTTCAGGCACCGGTCAGCGTCACCGACTTGGTATGGCGGCATACGCTGGTCAGGCTGGTGGATGGCAGCGAGCAGGTGTGCCAGATCCCGGTACGCTATCCGTTCGGCGAACAGGCCGACGACCGCTACCGCCTGGCCACGCTCACCGAATGGCAGCCGCTGGCGGGCGACGACCCGCAGTTCAGCGGCCACGGTCAGAAATGCTGGTTCAGCGCCGATGCCGAATTCCCGCTGCTGGGCATGGAGACGCTGGCGTTTAGCGAAGCAGAGCCAGGCTTATGAGCCAGCACAGCGACGAGGGGCACAAGCTGCTGCACGGCGGTTACCGCCAGCGCCGCCAGCAGGAAGGGCTGAACGTGCGCGACAAAATGCAGCCATCGCTGCTCGACCGCTTAACCGACCACGCGCCGGACAACCCGCGTGAAGCGGCGAACAGCACCCTGATCTCGCACAGCGCGCTGCGGCGCAACGTGCTGCGCGATCTGCAATGGCTGTTTAATACCATTAACAATGAAGCGCAGCAGGATTTGTCGCCGTTCCCGCACGTGCAGCGCTCGACGGTGAACTTCGGCATTGCTCCGCTGGCGGGACAGCGCATGTCGGATATTGAATGGCAGGATATCCAGCGCAAGTTGAGCGCGGCGATCCTCAACTTTGAACCGCGCATTCTGCCCGCCGGGTTACAGGTGCGCTGCGTGTCCGACATCGGCTCGCTCGACCTGCACAACGTGCTGTCGATTGAGATCAAGGGCCGCCTGTGGTGCGTGCCGTACCCGCTGGAGTTCCTGTTTCGTACCGATGTGGATCTGGAAAACGGCCACTTTGCTCTGCGCGATATCGGTTAACCAAGAAGGTCAATTTCATGGACAGTAAGCAGCTCGACTATTACAACCGTGAGCTGGCGTATCTGCGCGAAATGGGCGCGGAGTTTGCCGGGCGCTATCCGAAGGTGGCTGGCCGACTCGGCATACGTGGCGACGAGGTCGCCGATCCCTACGTTGAGCGGCTGATGGAGGGCTTTGCCTTTCTTACCTCCCGCGTGCAGCTGAAAATGGACGCCGAGTTTCCGCGTTTCTCCCAGCGCCTGCTGGAGATGATTGCGCCGAACTACCTGTCGCCAACGCCGTCGATGGCGATTGCCGAACTGCAGCCGGACAGCCGTAAAGGGGATATCAGCAACGGTTTTCTGGTGCCGCGCGGCACCATGATGGACAGCCAGGCGTTGAAGCAGAATGGCGTCACCTGTAGCTACACCACCGCCCATGACGTGATGCTGCAACCGGTGCGCATCAAAAGCGTCGAGCTGGGCGGCATTCCGGCCGACGTGCCGCTCGGCGAGCTGGGGCTAAGCCAGCGCGGCGCGGTCAGCGCCTTGCGTATTCGTATTGCCTGTGATGAGAGCGTCACGCTTTCCCATCTCAGCTTTGACAGCCTGATGTTCTTTCTTAGCGGGCCGGATATGCAGGCGCTGCAACTGCTGGAGCTGGTGATGCAGCACTACGTGGGCGGCTATTGCCAGAGCCTGGGCAGCAGCCCGCAGCGCACGCCGCTGGGTGATGATGCGCTACGGCAGGAGGGGTTTGCGGCAGATCAGGCGCTGCTGCCAAACGATCTGCGCAACTTCGATGGTTATCGTCTGTTGCAGGAGTATTTCGCCTTCCCGGCGCGCTTCCAGTTTATCAGCATCGGCCAGCTGCGCCCGCTGCTCCAGGCCTGTGACCCGCAGGCGCGCGAATTCGATATCGTGCTGCTGCTGGACAGCGCCAACCCAGGCCTGGAGCGGGTGATTGACCACAGCCATCTGGCGCTGCACTGCACCCCGGTGATTAACCTGTTTCCAAAGGTGGCGCAGCGGCAAAAGCTCAGCGACAGCGTGCACGAATACCACCTGGTGGTGGATAACATCCGTCCGCTGGACTACGAAATTTACTCGGTGCAGAAGCTGTACGCCAGCGGCGAGCGCCAGCGCGAGGAGCAGCAGTTTCGCCCATTCTGGAGCACCTTCAGCGATGACGGCGGCAACTACGGGGCCTATTTCTCGCTGCGGCGCGAACAGCGCACCCTGTCCGAGCGTGCGCAGCGCTTTGGTACGCGCACAGGTTATGTGGGTTCGGAAGTGTTTCTTTCGCTGGTGGATGAGCACAACTCGCCGTGGCATGAAGATTTGCGCTATCTCACCGCCGAGGTGATGTGCACCAGCCGCGACCTGCCGTTACTGCTGCTGCAACAGCAGGGGCAGTTTGTGATGCCTGATTCGATCCCGGTGCGCCAGCTGACGCTGCGTAAAGGGCCAACCCCGCCGCGCCCGGCGCTGGCGGAGGGGAAAATCGCCTGGCGGCTGGTCAGTCACCTGCAAATGAACTATCTCAGCCTGATGGACGGCGACGACGGCCAGGGCGCGGCGGCGCTGCGCCAGATGCTCGGGCTGTATGCCAACCTCGCTGAAGCCGCCGCGGCGCGCCAGATCGACGGCATTCGCCACTGCCAGCTCAAGCCGGTGTTCCGGCGCGTGCCGGAGCCAGGCCCGATAGTCTTTGCCCGTGGCATCGGCATTGAGCTGGAGGTGGACGAGCAGGCGTTTTCTGGCAGCAGCCCGTGGCTACTCGGCAGCGTGCTGGAGCGGGTGTTCTCACGTCTGGTGGCGATCAACACCTTTACCGAAATGACGCTCACCAGCCAGCAGCGCGGGGATGTCGGCTACTGGGCGCCACGCATGGGCAAAAGGACACTGATATGACGCAGCAGCCCGCCGCGCAGGCGGAGAAGATCCACCGCCTGCATCGTCTGCCGCCGGACTTCTGGGCGGCGCTGTGCGCCACGCCGTGGCGCTACGATTTGTTCCAGCTGCTGCGGCGGCTGGATGCGCAGGGCGGCCAGCGCTATCCGCTGGGGCGTGCGCCGCTGCCGCGCCACGAGCCGCTGCGCATCGGACAGCGGCCGTCACTGGCCTTTGCCCCGGCGGCCATCGCCAGCGTCACGCCGCGTGACGGCAGCGCGCTGCACGATGTGTCGATCTACAGCTTTGGCCTGTTTGGCCCCAACGGCCCGCTGCCCGTCCATCTCACGGAATATGCGCGTGAGCGCAGCGACCACCATCAGGACAACAGCCTCAGTGCCTTTACCGATCTGTTCCACCATCGCCTGACGCTGCTGTTTTACCGCGCCTGGGCCGATGCGCAGCCCACGGTGGCGCTGGACAGGGCGGAAAAACGTCAGTTTGAGCGCTGGCTCGCCAGCCTGATCGGCATGGGGCAGCCGGGGCAGCTCAGGCAGGGCAGCCTTAGCCCGCATTCACGCCTGGCGCTGGCCGGGCATCTGACCCGACAGGCGCGCGATGCCGAGGGGCTGCAAAAAATTCTCAGCCACTACTTTCAGGTGCCGGTGCGGCTGGTGGAAAACGTTCCCCACTGGCAGCCGGTCGATCGCCGCGACCGCGCCAGCCTGGGGGCCGGGCGGCACAAGCCGCGACTCGGCGTGTCGGCGTTTCTCGGCGTGGCGGTGCGCGACGTGCAGCACAAGTTTCGTCTGGAGCTGGGGCCGCTATCGATGGACAGCTACCGGCGCTTTCTGCCCGGCGAACCCTGGGCGCAGCAGCTGCGCGACTGGGTGCGTCAGTACCTGGGCATTGAGTTTCTCTGGGAGGTGCGGCTTATTCTTGATGCCGCCGAGGTGCAGGGCGTCACCCTCGGCGGCGCATCGCGGCTGGGATACAGCAGCTGGCTGGGACGGCCGGCTATCCCACAGCACCGCAGCGATCTCACCTTTAGCCCGGAGCCACAGGAAAGTGTTTAG
Protein sequences of DBSCAN-SWA_1 >NC_010694|625326:694971|640216_641386_+|WP_012440322.1|capsid|DBSCAN-SWA MKNNTRFKLNAYMSVLAEINKINLSALNSKFTVESSIAQTLETKIQESSAFLQAINITPVDEQSGERLGLGIGQTIAGTTDTTQKEREPTDPTYIDGDGYKCTQTNFDTALPYSKLDMWAKFSDFQVRIRDVIVKRQALDRIMIGFNGLKREKTSNRVQNPLLQDVNIGWLEKIRQEKPSQVISQRIDNSGKVVAGNITIGKGGVFNNLDAVVMGAVSEKIAVQYQDDTELVVICGRQLLADKYFPIVNKDQPNTEALAADLIISQKRIGGLPAVRASFFPADALLITRLDNLSIYWQEETRRRSIIDNPKRDRIENFESVNEAYVVEDYDCTCLIENIEMLDQEPEPEAGQMSDAEIARIASVAASVVKAMSESGTPHAQAGTDTAGE >NC_010694|625326:694971|675193_678823_+|WP_012440363.1|DBSCAN-SWA MLNMLFAVLTHRLLWGFVGITALSFIIWVIGPVFSIADSRPLEPEVNRQISIGLLYLVWALGNLVPRLYNAWLNRKLMGSLKTTPGEQPDGDNPRLTSEDRVLAERFSEASELLKKAHFSHAGSRSPFWAQRFSRQYLYQLPWYVIIGAPGAGKTTALVNSGLQFPLADRFGKSALRGIGGTRNCDWWFTNEAVLLDTAGRYTTQESQQQQDAGEWHGFINLLRKYRGRQPINGVIVTVSVSDLLTQSAEAARQQAVALRQRLTELHEQLGIRFPVYVLVTKADLLKGFRAYFAKFDKAQREQIWGFTFPWERAKMSDFDLQAAFTQEYALLQQRLDAGLPDTLLQESDSQARAESFLFPQEFAALRPLLAEYLDTVFALSDFETQFSPRGIYFASGTQEGLPFDRVMGELNRALQLPQQDAASGQSGSWDQTSKYAPIPGNKGQSFFLKDVLQKVIFAEAGLAGSNRWWELRNRALLWSGYIALAAVMVISALLWFTSYGNNKSYLQQVQARVPEVARQSATLEGSEQGDLFALLPFLNSLLKLPESSEFDLNSPPISRRMGLYRGAEVSDATQALYQKSLKQLLLPQVAQLITGWLRNDNGSDADYSYEALKAYQMLYQPPHYDGKFLHAWLMLNLQRNLPQNVTQTQLKQLEWHLSQLLENQIQSSPYARDDALVKREQALINQMPLSQRVWGRLKRLLERDESLKAVSLASLGGPQSELVFSRKSGRSIADGIPGLFTPDGYWQSLDKHIAPVTTALHDDDRWVLGAPSSGESQQQTDAAVRQLYIGDYIRQWDSLLQDIQLNNSADLSQRINSARLLSSNNSPLRRLVINLSRYLVLEKLPTDEKPPGKEKDAEADNSATRTLQALFRSRQNSTAAAAEQAPEQAVANHFAPVIELAQPLEQGGKTIAFDDFLRQIDDLYRYLTAVQDAANSGMPPPAGDAISHLQASAGRLPGSLQTMFSTLAVGASSDAQRRELENVRKRISSEVGGFCRQAIAGRYPLVRSARSEVTPDDLARMFAPGSGLMDSFFRDNLANKVDTTQSAWRFTPGIDGKGIGGEDILRPFQQAQSVRDAFFANGATTPAFRVTVRTLRMDNDILNLTLDVDGQLLRYSHGPQAVQLMNWPGSGGTSQVRMQLGLANGTTSTLVTNGVWALNRFFDRAQLAPGSSSLSRQATFNVDGHRVTLEFTPNSIRNPFQLSGFACP >NC_010694|625326:694971|686981_688811_+|WP_012440375.1|DBSCAN-SWA MRFTIITSKPGQQPPQSSCDFLPPGGTIGRGADNNLVLPDDDRTISRLQAIVHISATGECHITNRGNVTRVHLNDIPLERGRQVELQDGDIIGIDDYRLQVSELSAAALPLSQPVSAPAVTRPQPVAAPVAPAAAAAPAAIPSEIWDSLAQEFSISDNLSSRSKAKPPGQDNPLTAPAAAERNPADPLAQLKQNSDPLNLDRRDRTPDSLFNQDPLFKQDSIFDDSTPSTLFQPPGAKSSVTHDARGDEQDPLALFGAGAAAQVDRDDPLGLMMGSAVPLTPPEDKPQPGSQPPVSTPAPEPVPPAGGKKASSEQEDLPQPDSDFSLFGDDKTPQSGEASSPLFDVPPAAAAQPTASSSSADYGGITLPTPQARARSSTPPPKGRLSIDPVANSAAHSSDGGGEMLDGDLMAALIGGMGLQDLQPTPRFDQHAMQELGQMLSMFSQGTVALLSSRSILKRGVKAEMTVILDEANNPFKLLPSGKSVLMQMFGSRMPGFMPPKQSVRDALVDLQAHQLGMIAGIRALIASMLQSFNPQQLEEEARQQGVTSRLSLPGSRKAALWDHFSKRYSETAGEIEDDFHTLFGEAFLHAYDMEVNQYKDSQIRSEE >NC_010694|625326:694971|637398_639165_-|WP_012440320.1|terminase|DBSCAN-SWA MTTTIAPADLDPRRQALLLYFQGYRIARIAEMLGEKPATVHSWKKRDKWGSYGPLDQMQLSTAARYCQLVMKEVKEGKDYKEIDLLARQSERHARIGKFNNGGNEAVLNPNVENRNTGPRKPPKKNVFSDAQIEKLQDIFHSTMFGYQRQWWEAGNKYAVRNLLKSRQIGATFFFAREALIDALTTGRNQIFLSASKAQAHVFKQYIVEFAREADVDLKGDPMTLDNGACLYFLGTNARTAQSYHGNLYLDEYFWIPKFQELQKVASGMALHKKWRETYFSTPSSLTHSAYPFWSGAQFNRGRAKADRVDIDLSHASLAAGRLCADGQFRQIVTVEDAVRGGCDLFDLEQLRTRYSPEDYQNLLMCVFMDDLASVFQLAMLQKCMVDSWEVWDDFEALALRPFGWKEVWIGYDPAKGTQNGDSAGCVVIAPPAVPGGKFRILERHQWRGMDFRAQADAIKTLTQQYNVTYIGIDSTGVGLGVYENVKAFFPQVKEFVYNPTVKNALVLKAYDTMATGRLEFDASHLDIAQSFMSIRKATTSSGNRPTYETSRSEEVSHGDLAWATMHALANEPLQGQAAHTQNIVEMY >NC_010694|625326:694971|685890_686430_+|WP_012440373.1|DBSCAN-SWA MKLTVGDVRGLTSGEISMAISVFGNAINYAFVKVHNDSYLPFNLQSNRVAMTPNGEIYFREPYYKDDFSIAPAGDKHWFIHEMVHVLQHQIGMNVRLRGTFSWAASYEYSLPPDKLLSDFGMEQQASIISDYYYVKSFGLNNFDRLSGFRGIVGPDLKLKYQNTLRVFLTNPRSRSAFL >NC_010694|625326:694971|667817_668846_-|WP_012440355.1|DBSCAN-SWA MSAFTPASEVILRHSDEFTQRRVLFAGDLQDDLPAQLETALSRVHTQQYHHWQILSRVLGENAHYGLFASAETLADCDTLVYYWPKNKPEAQYQLQNLLALLPVGSDIFVVGENRSGVRSAEQMVADWAKLEKIDSARRCGLYHGRLDSRPTFDADTFWDEYPLGELTVKTLPGVFSRDGLDIGSQLLLSTLKPHMKGKVLDVGCGAGVLSAMLASFSPKVRLTLTDVNAAAIASSKATLAANQLEGDVFASNVYSDISGRFDMIISNPPFHDGVQTSLDAAQTLIRGAVSHLNTGGELRIVANAFLPYPQVLDETFGSHEVLLQNGRFKVYRAVKSRAAKK >NC_010694|625326:694971|678839_679550_+|WP_042958634.1|DBSCAN-SWA MSETPAIGWYGKLPSAGDFLKRRFPEALFNSWSHWFQLGLLDWKQSEEQRPDGGRQFGNAPVWNFVVPPLLGSRLVQMGCLLPARDSVGRQYPVCALLSYNLTQWSPQRLARAGEWYQQLGRTLLQGVRNGCSAERLDEALLAIPAPPEPEHGDASGILEAIGYDDRQDTLGWRQAADCFSPQQYTSFWWTNRTDGYPLYTHLHSGNFTSQLFSMLFDPAEGAKPGRNGLYPPMFE >NC_010694|625326:694971|629290_629869_-|WP_012440306.1|DBSCAN-SWA MGIQKNTLEPLTILDRIISVYGFTQKLQLANHFEMSPSSLQNRYTRGTISYDLAAFCSLETGASLRWILTGEGPQFEGSPSITDPKNMDLYTLNNGILDKNSILSIDSNILNKQISKGIAVRAEGKLHFVDQEAPHSDGLWLVDIESANSIRELTILPGRRLHVAGGKVPFECNFDDIRLLGRVVGIYSEIN >NC_010694|625326:694971|629998_630262_+|WP_012440307.1|DBSCAN-SWA MASEIAIIKVPAPIVTLQLFAELEGVSERTAYRWTTGDNPCVPIEPRKIRKGCKKAGGPIRIYYARWKEEQTRKALGHSRFQLVIGS >NC_010694|625326:694971|651016_651532_+|WP_012440338.1|tail|DBSCAN-SWA MALPRKLKHLNLFNDGNNWQGIVESLTLPKFTRKFEKYRGGGMPGAVDVDMGLDDGALDTEFSIGGTELLLFKQMGKATVDGIQLRFTGSIQRDDTGDVQAVELVVRGRHKEVDSGEWKTGESSTTKVSSTNSYAKLTINGEVLYEVDLVNMVEIVGGVDLLEEHRNALGL >NC_010694|625326:694971|692041_693916_+|WP_012440380.1|plate|DBSCAN-SWA MDSKQLDYYNRELAYLREMGAEFAGRYPKVAGRLGIRGDEVADPYVERLMEGFAFLTSRVQLKMDAEFPRFSQRLLEMIAPNYLSPTPSMAIAELQPDSRKGDISNGFLVPRGTMMDSQALKQNGVTCSYTTAHDVMLQPVRIKSVELGGIPADVPLGELGLSQRGAVSALRIRIACDESVTLSHLSFDSLMFFLSGPDMQALQLLELVMQHYVGGYCQSLGSSPQRTPLGDDALRQEGFAADQALLPNDLRNFDGYRLLQEYFAFPARFQFISIGQLRPLLQACDPQAREFDIVLLLDSANPGLERVIDHSHLALHCTPVINLFPKVAQRQKLSDSVHEYHLVVDNIRPLDYEIYSVQKLYASGERQREEQQFRPFWSTFSDDGGNYGAYFSLRREQRTLSERAQRFGTRTGYVGSEVFLSLVDEHNSPWHEDLRYLTAEVMCTSRDLPLLLLQQQGQFVMPDSIPVRQLTLRKGPTPPRPALAEGKIAWRLVSHLQMNYLSLMDGDDGQGAAALRQMLGLYANLAEAAAARQIDGIRHCQLKPVFRRVPEPGPIVFARGIGIELEVDEQAFSGSSPWLLGSVLERVFSRLVAINTFTEMTLTSQQRGDVGYWAPRMGKRTLI >NC_010694|625326:694971|628262_629291_-|WP_012440305.1|integrase|DBSCAN-SWA MAIRKHPSGVGWLSEIYPNGAKGKRIRKKFATKGEALAFEQFTVQNPWQEEREDRRTLKELVDAWYSAHGITLKDGIRRQQAMHHAFGCMGEPLARDFDAQMFSRYRERRLKGEYARSNRVKEVSPRTLNLELAYFRAVFNELNRLGEWKGENPLKNMRPFRTAEMEMAWLTHDQIALLLDECKRHDHPDLETVVRICLATGARWSEAESLKKSHLAKYKITYTNTKGRKNRTVPISKELYESLPDDKKGRLFSDCYGAFRSALERTGIELPAGQLTHVLRHTFASHFMMNGGNILVLQRVLGHTDIKMTMRYAHFAPDHLEDAVRLNPLNHQLNNYTTAIN >NC_010694|625326:694971|643879_644308_+|WP_012440328.1|lysis|DBSCAN-SWA MNRLLALVLALALAALGWQSWRLNNASHTIEMQGAALKSKMQELTKKNSQLIGLSILTETNSREQMRLYAAVEDTAALLRSRQRRTEELKRENEDLRRWADTPLPADIIRLRERPALAGGAAYREWLSQSDAVPSGKVSTAQ >NC_010694|625326:694971|652011_654789_+|WP_012440341.1|tail|DBSCAN-SWA MSDNNLRLQVVLGAVDKLTRPFKNAQAGSKELASAIRQTRDQIKKMSDAGGQLNSFDRLTQSVSRTGTELDQARLRAQMMTREMSLLESPTKKQTQALEAQWRAVSRLEQKQQQETRQMAVARAELYRLGLSAGGGARETARITRETERYNRQLAEQERRLREVGERQRKLNAIKARAEKTHELRNSLAGNGAGAMAAGVTTGMTLLAPVKAYSESENAANQLAGSMMGPGGKVAPEFEKINRLAVALGDKLPGTTADFQNMMTMLRRQGMSAQVILGGLGESAAYLGVQLQMAPTAAAEFAAKLQDATQTSEKDMMSLMDVIQKGFYAGVDSGNMLQGFSKISSAMDIIHKKGLDAAKTFAPLLVMADQAGMAGESAGNAYRKVFQSVMNTEKVKDANDELKGTGVRFEFTDGKGEFGGLEKMYTQLAQLQKLNTEKRLATLKGIFGDDAETLQVLNIMITKGISGYSETASKLQNQASLRERVDASLNTLGNKWEAATGSFTNAMASIGETVAPALKKLSDWLGELASRLDGFVKRHPQLTSALFTLAAGFAIVATAAGGVSLALASVLGPMAVVRMSAGVMGLKFSSVFGLIGKAISSVGKSVVWLGRLMFANPILAVIGLIAAGAIYIWQNWDTLGPKFKAMWDAVCNATATACDWIKEKASATWEGIKSLFFNYTLPGLIAKNWDAIQSGASEAWANIRQSISDKWNSILADAAALPTKFQDMGSAIIDSILNGINTKWETLKSKFSSVTDYLPDWMTENNKTQGKAQVQVVGGAAAAAVPFAGMYDSGGVIPHGQFGIVGENGPEIVNGPANVTSRRRTAALASVVTGVMGVAAAPAEAAPLHPYSLPVIAYKQSQPAKSASVPPAIRYEINAPIHITAQPGQSAQDIAREVARQLDEREHKARAKARSNFSDQGGYDS >NC_010694|625326:694971|690639_691461_+|WP_012440378.1|DBSCAN-SWA MHSLHHLMDGQSLAETLASLESQIKRQPADADLRASFVQLLCLAGNWSRAQTQLKSWLALKPQAQPTVTLLEQAINGERQRAAVFAGEAAPRMPGAAWGWAEQLLAALAADAAGDTARAQQLRAAALDEAQLNPGQLTHQNQQHAFDWLTDGDGRLGPICELIANGQYSWLPFSAISEMRFQAPVSVTDLVWRHTLVRLVDGSEQVCQIPVRYPFGEQADDRYRLATLTEWQPLAGDDPQFSGHGQKCWFSADAEFPLLGMETLAFSEAEPGL >NC_010694|625326:694971|645917_646277_+|WP_012440332.1|DBSCAN-SWA MTLYIGMSQGNGKVITDTDHLRQSVRDILLTPQGSRIARREYGSLLSALIDQPQNPALRLQVMSAVYVALSRWEPRLTLDSITISSNFDGSMVVELTGQRNNGAPVSLSVSTGADNGSD >NC_010694|625326:694971|644827_645274_+|WP_012440330.1|DBSCAN-SWA MDELQRVDDWLTALLANLEPAARSRMMRQLAQQLRRTQQQNIRMQRNPDGSGYEPRRVTARSKKGRIKRKMFAKLRTTKYLKTAASADSASVQFDGKVQRIARVHHYGLRARVSRKGPEVRYAERRLLGVNDEVETVTRDTLLRWLAG >NC_010694|625326:694971|679567_680587_+|WP_012440365.1|DBSCAN-SWA MDLEALLAPITPDRPCGDNLEYDADYMAMDQASAGKAEQQFGDTIIPAEPADWNKVERLALDLLGRSKDLRVMLALTRAWTQLKGLSGYADGLHLIQQALLLYWQPLWPSLEEDGIEDPFYRLNALAALGDKSALTAALRQASLLRYATDEISLRDASGLLDGSKTECPGYPGGRARLQDELARGGQPGIEAVVKISERLLTIRETLTERLGAGALPEMDQLLKTINSVAAACQATDLSTLIPAVETAASSATSAPAAAPGAQQHADWRSVQLSSRSDAQLMLEKVKQYFSQHEPSHPAPLMIERVQRMIELDFMDIIRDLAPDGVHQLETIFGRRDHS >NC_010694|625326:694971|689622_690630_+|WP_012440377.1|DBSCAN-SWA MKYWLSGAFTLMIASSAWAQDHRLVQSPALKLDIWIDNVKSTSAESWCARTLPLRIVANGKKDPALLDDYLPKVGSFLQKQCPALSQINWQMNDDGGKKLAVGSASKAQGWAVKTALETPVTTPPVTPETPPPAQTPAVASTPRAEDLSPAADTTPWVQFSLLDGCHFRTFWRGSSQTSALFVPAKGGVSCGSDGWLKGSGITTQLGHGAAKNLAMTFLQGFPIAGLNDKALGNSLHIVTVNNQRMVLNDSRLADSWMVLPYAPELNGWQANGVLVVQIPAAEAADNGTLQKRLNEVRNLWSPLLKNSTDLTIKLVDALHPQLQDPAAGAFRTLH >NC_010694|625326:694971|683236_683719_+|WP_012440369.1|DBSCAN-SWA MAIDMYLKVDGITGESKDSNHTGWIDVTSFSWGATQPGNMSVGGGGGAGKVNFNDLHINAKIDKAVTALLKNCASGKHVGKVEVSVCKAGGTQIEYTRITLEDVLVTNVQFVGSDGDDTLGVTYAFQAAKVKQQYWEQSTSGGKGAESSAGWNIKENKEA >NC_010694|625326:694971|645342_645921_+|WP_012440331.1|plate|DBSCAN-SWA MNAQLTEIMRLITNLIRTGTVTEVDRDNWLCRVKVGELETNWINWLTLRAGGARTWWCPSPDEQVVVLSMGGNLETAFALPAIYSSQFAPPSDSVAGCVTEFPDGGRFEYEPATGRWHVRGIKSLVIEAADTITLKTGEFVVEADTTRINSEMVINGGVIQGGGAMSSNGIVVDKHGHTGVKSGGDTSGGPV >NC_010694|625326:694971|649834_651007_+|WP_012440337.1|tail|DBSCAN-SWA MAQDYHHGVRVVEVNEGTRSITTMSTAIVGMVCTGDDADASMFPLNKPVLLTNVLTASGKAGESGTLARSLDAIADQAKPVTVVVRVAQGETEAETTSNIIGGVTADGKKTGMKALLSAQSQLGVKPRILGVPGHDTQAVATELLSVAQSLRGFAYLSAYGCKTVEEAIAYRGNFSQREGMLIWPDFIDFDTVLNADATAYASARALGLRAKIDEQTGWHKTLSNVGVNGVTGISADVFWDLQEPATDAGLLNQNDVTTLIRKDGFRFWGSRCLSDDPLFAFENYTRTAQVLADTIAEAHMWAVDGVLNPSLARDIIEGIRAKLRSLKTQGYIIGADCWLDESVNDKDSLKAGKLTIDYDYTPVPPLENLMLRQRITDRYLLDFSSQVSA >NC_010694|625326:694971|643514_643883_+|WP_012440327.1|DBSCAN-SWA MVKSYWLPVSVLVLLVMVDVIFPASYAAFPLALIIWFEYAAFSLVCFVGLYSCTLTGSDRLQVRHLLGRVLELLGRIPLTWYQRLVIAFVMSLAGWKLTGMVFVFTVAMSLVIQDELKAMRE >NC_010694|625326:694971|642128_642593_+|WP_042958622.1|head|DBSCAN-SWA MKFVAPEQAPEQTEVIKNTPFWPDVDLSEFRSVMRTDGTVTQPRLKQVVLTAISEVNAELYDFRNRQQTLGYRALAEVPAEMLDGKSVRIRHYHTAVFCWARAVLNERYQDYDATASGVKRGEELAEASGDLWRDARWAVSRLQDAPHCTVELI >NC_010694|625326:694971|646263_647172_+|WP_012440333.1|plate|DBSCAN-SWA MAVIDLSQLPAPQIVDVPDFETLLAERKAAFVALYPADEQDAVRRTLALESEPVTKLLQESAYRETLLRQRINEAAQAVMVAYAIGSDLEQLAANYNVKRLTVTPADNNAMPPVAAVMESDEALRPRIPAAFEGLSVAGPTAAYEFHAKSADGRVADVSATSPAPAQVVLTVLSREGDGTAGADLLAVVDQALNSENVRPVADRLTVRSAEIIPYSVDATIFLYPGPEAEPVMAAAKASLQKHIASQTRLGRDIRRSAIYAALHVEGVQRVELASPLADVVLDKTQAASCTEWSVTNGGTDE >NC_010694|625326:694971|642995_643505_+|WP_012440326.1|DBSCAN-SWA MNPSIVKRCLVGAVLAIAATLPGFQSLHTSVEGLKLIADYEGCRLQPYQCSAGAWTDGIGNTSGVVPGKTITERQAAQGLITNVLRVERQLEKCVVQPMPQKVYDAAVSLAFNVGTGNACSSTLVTLLNQQRWADACHQLPRWVYVKGVFNQGLDNRRAREMAWCLKGV >NC_010694|625326:694971|684262_684631_+|WP_012440371.1|DBSCAN-SWA MVIKIGWVLLAAALLFSIQAGAKAAYTPSQYLKNYALSTCISQGYQSKEVKEDAAAAARGYLEFGDYSLAAHTAVRNLGKAFLAKEYTSQSGQPMTLAKCIDFYHSEQLDKLVKQFKGKQDD >NC_010694|625326:694971|663571_664006_-|WP_012440351.1|DBSCAN-SWA MSLIPYSAIHRRFAACAALLAMLLLFIAPVISTSLAQHHGGSGMMLHQGMEMDMAMPADHQPPVGADVAISNTTHTPSPIMDDAACGYCVLLVHLPLDLFPSPLLWSSLQAVAAPDVPPCQRVISLLAPHFFHPRAPPRCLPVR >NC_010694|625326:694971|664304_665510_+|WP_012440352.1|DBSCAN-SWA MFNWTSTQRNVAFASFASWALDAFDFFILVFVLSDIAANFSTSVSDVSLAIMLTLAVRPVGALLFGRLAENYGRRPILMVNIITFTVFELLSAWSPTLTWFLFFRVVYGVAMGGVWGVASSLAMETIPYRSRGLMSGIFQAGYPCGYLLASVIFGLCYSLVGWRGMFLIGALPILLLPFIWFKVPESPIWLAARQRKESVALLPVIKSHWKLCAYLVLLMACFNFFSHGTQDLYPTFLKVQHGMEPHIISMIAVCYNIAAMLGGVFFGVLSEKIGRKKAIMIAAILALPVLPLWAFSSGSWAIGIGAFLMQFMVQGAWGVVPTYLSELVPANTRAVLPGFVYQLGNLIASVNATLQSGIAEAHGNNYGLAMAIVAGTVAVLICLIVAVGRETRGINMSNPP >NC_010694|625326:694971|635552_636014_+|WP_012440317.1|DBSCAN-SWA MDTTEQLNGTYFYGGLSNLNAGELFYWIMVDVTAEHFTGATAATGNVIAAAAIYAGRNNVAVSGKLANATPGTSWASIQSRRLLQKYKLPFPLPTIVGNPFKMKIIMTKKLGTFVGRTVPVIGWAIVASDVAIIGWKSVNRYNTIASAEDKIW >NC_010694|625326:694971|656417_656636_+|WP_012440344.1|DBSCAN-SWA MMNCPKCGHAAHTRSSFRVSDHTKERYCQCQNINCGATFVTHETVVRYIATPNLIDHAPPHSSMGGQGHMSF >NC_010694|625326:694971|688810_689605_+|WP_012440376.1|DBSCAN-SWA MNITLASMSNQGARASNQDQVGDIVGDRSACFVVCDGVAGLPGGDIAASVARDTLLQRFDGQQHLNAQLIRQYVNDANSAIRQRQKADPPHHRMGTTLVSLFIDRDYQLAYWAHAGDSRLYLFRRGYLYHVTTDHSLVQQMKDAGHQTDGINGNLLYFALGMGDEDRDASYSDVVPIEDGDAFLLCTDGFWHGVSQHHMQQALHMVNTPQEWLTLMQQMIKKNDEQSNDKQDNYSAVAVWIGEPQDTTLLHSLSEAAQFIALRD >NC_010694|625326:694971|639309_640173_+|WP_012440321.1|capsid|DBSCAN-SWA MAKKAKRFRIGVEGATTDGRTIVRSWLEQMAANYDPAVYTAVINMEHIKGYTPDSAFRRFGVVDALDTEEISDGLLKGKLGLYAVINPTDELVTMTGNMQKLFTSMEIRPEFADTGEAYLIGLAVTDDPASLGTEILQFSASAGANPLANRKQHPNNLFTAATETVIEFEDVADDKPSLFSRVSALFSNKQKSDDARFGDVHKAVELVATEQQEFSQRIETALSEQASSLQAQFTEGLSAEVAAREQLQADFSQLQERLSREDGRQDFRPRTPGNGSGNSQDVRTDC >NC_010694|625326:694971|636343_637399_-|WP_012440319.1|portal|DBSCAN-SWA MSKRRNRTRTQSVPQPDNMTSGAASEAFTFGDPIPVLDRRELLDYVECVINDRWYEPPVSVDGLARTFRAAVHHSSPISVKCNILASTFIPHPLLSQQAFTRFAMDYLIFGNAYLEKRISRLGNTLKLEPSLAKYTRRGLDLDTYWYAHYGLNTEPYEFTKGSVFHLMEPDINQEIYGVPGYLSAIPSALLNESATLFRRKYYLNGSHAGFIMYMTDPAQSQQDVDNIRSAMKSAKGPGNFRNLFMYSPNGKKDGIQIIPLSEVAAKDEFLNIKNVSRDDMLAVHRVPPQLMGIIPNNTGGFGDIEKASRVFVRNELIPLQARMKELNDWLGLGQEVIRFAPYNLDLEDGN >NC_010694|625326:694971|626624_626819_+|WP_042958615.1|DBSCAN-SWA MTEFIDTFYLFNLEHEVGSENLKTFQTLADKYSHLLSEAEKEVEEKEAEAFYGIRPSDYEFLTE >NC_010694|625326:694971|657392_657989_-|WP_012440346.1|DBSCAN-SWA MTITLMRHGKPHRSLTGRRSALAMAQWCDAYDLAEICDKPTDSSLRLAAQADIIVTSPLPRARSSLERLGKTASRIDSLYSEVALPVIPLAFPTLPPFVWLLLLRVIWLLGYSGKVESYAQAKQRAVKAAGRLSELAQHGNVLLMGHGIMNKLISRQLRKLGWRGEKHAGSRHWSSAVYRKINSNDRKADKTLLRGIG >NC_010694|625326:694971|654785_655271_+|WP_012440342.1|tail|DBSCAN-SWA MMMVLGLYVFMLRTVPYQELQYQRSWRHAANSRVNRRPSTQFLGPDNDSLTLSGVLLPEVTGGRLSLLALEQMAELGKAWPLIEGSGTIYGMFVIESLSQTKSEFFESGMPRRIEFTMTLKRVDESLSDMFGSLSDRLSNLQDSATSAIGNIKNTVGGLLQ >NC_010694|625326:694971|685172_685580_+|WP_012440372.1|DBSCAN-SWA MGNLFIYLCRRLAIFAKSVFFSLLTMFFLILSAEADNKYSPQTNLKNYALSTCLSQGYQNKEIKEDAAAAARGYLEFGDYSLAAHTAARELGKTFLDKQYGSQSGAPMTLAKYIDFYHSQQLDKLVKEFKDKRDD >NC_010694|625326:694971|651899_652019_+|WP_012440340.1|tail|DBSCAN-SWA MADIATIFHWSPSVTDVMPLTDVLEWRHKAIQRSGASDE >NC_010694|625326:694971|665647_666745_-|WP_012440353.1|DBSCAN-SWA MNVQSYDELLGSKHRLSLMLFLLLNTATAIFCVLRVIHHDTLTVTTPTIAIIIISIILLTWMLLKPVAIFPLLNIAAALSGLLWAWHITLRYQQELHLESGYLLVALVSVFFISAIALGDYLLPFLLHTAPGSVVVLALDRGQSTLIILFIIILPLMGFSLHHLMLRKRDKFTRRLVRQLYEEKETYSDLSMLDPLTGLYNRRGLQNRLENVIDNHTGNHYVLLLDIDNFKSYNDNYGHAMGDQALTRVSVAIRDAVRSRDIVTRYGGEEFLVILTNVNEAIALKQAERVRQFVLDLEIPHRFNDKVATHVTISVGVAPLIANDFDKALAHADRALYLAKSQGRNTVFSLHDASKRAPGKPADIV >NC_010694|625326:694971|630973_631315_+|WP_012440310.1|DBSCAN-SWA MAIEGPTATIPLSPGERLEGLNHIAELRAKVFGLDIEPELERFIKDMRAPRDVNHKQNERALAAIFYMAKIPAERHGVNISDLTTDEKRELIKAMNHFRAVVSLFPKRLTMPN >NC_010694|625326:694971|631609_631906_+|WP_012440312.1|DBSCAN-SWA MADSMDLVQQRVAENLACELANIIRRPVMPSAFFCECCDAAIPEARRKALDGVTLCVTCQSIAPIPEARRKALDGVTLCVTCQGIAERKSKHYKGAGL >NC_010694|625326:694971|672587_673931_+|WP_012440361.1|plate|DBSCAN-SWA MSKAEKVVWTEGMFLRPHHFQQSENYLQSTLRDWGQAQRPWLWGLHDIEFDESMLRQGKVALLSASGLLPDGTAFAFSNGDDAPAPLLIPDNLTQAKVVLALPARRGGREEVIFSESGDSLARFISFEREVDDFNAMAVGPAAVQFGRLRLRLMLESELSAEWTAIGVARIAEKRNDHQLRLDGSYIPPMLNAINQSQLMEYIGDIHGLLVQRSQQIGQRLQQPGRFNTADMVDFMLLTLINQQLGHISHVKSLPLIHPETLFHGWLTFAAELTSWMPSRTPDGALPTYQHDDLAGCFSQLVLLLRQGLSQVMEENALQLPLTERSHGLNVATVPESSMVREFGFVLAVRANVPAEAIQTHFPAQMKVAPVTKIRDLVQLQLPGIMLRAMPAAPPQIPWHAGYNYFQLDKGSELWQEMEKSGTFALHLAGEFPGLEMEFWAIRSHAV >NC_010694|625326:694971|672055_672568_+|WP_042958632.1|DBSCAN-SWA MHKISISRLVNQAFLLAALILLAGCVSTSRSVPTQYSLVFQAHPQINDSAPLKVRVLLLKSDADFMAADFYSLQNNPQGVLGQNLLNSEQFFLMPGQTGKKLLGQTSPEARYIGIMAEYQALDGKTWRISLPVPLPAERRFYQFWQGNTDNLRADIIADINGVRVVNPRD >NC_010694|625326:694971|636007_636307_+|WP_042958620.1|DBSCAN-SWA MVMDDNEKAVFALVEEYNGHWFWLRKRFRLTPATDLNKDFRMAPEDAAELLETFADRFSVDPKEINFGRYFPADNGKAEKPLTIQLLIDSARAGHWIDK >NC_010694|625326:694971|682688_682964_+|WP_012440368.1|DBSCAN-SWA MIKRGGVTERGGDTASLAGGKRVLRKSLRGLSRIALTYLAFLGFIACAGYYVLIFDWHISATATALHIALMIVAIAALIGIYGIAEKLRSA >NC_010694|625326:694971|641389_642031_+|WP_012440323.1|terminase|DBSCAN-SWA MTNPFRAHTRFIQAQEAAQRGSNSRHAKGYDLMLLQLNEDRRRLKGIQSNVNKAQVKIEVLPKYAAWVEGVLSVDGAQQDDVIMYVMLWRIDAGDYAGALTIGRHALKHGWVMPIGKRTTSTVLAEEMADAAKAAILAETPFDADLLLQTLESVDGEDMPDQSRARLHKSIGWAQTGNSPVSALNHLKQALQLDERCGVKKDIEQLERKLRNS >NC_010694|625326:694971|644403_644835_+|WP_012440329.1|tail|DBSCAN-SWA MNKPQSLRSALNKAVAYVRDNPDKLHLFVDNGSLVATGASSMSWEYRYTLNVVIEDFSGDQNLLMAPVLLWLSASQPDAINNPALREKLFTFEVDILRNDVCDISMSLQLTERVLVSTDGTVSSVEAVPEPNKPEEMWTVKRG >NC_010694|625326:694971|625326_626058_+|WP_012440303.1|DBSCAN-SWA MDTVIAFLSLALFIAFIVGLIKPSLVMMPNRKRSSALYLGGCLALSFIGSILWPTEKSQRVAKADVPAVKAEPAPPTFEYADKTLKEYRNELKETRHDIVKDYVNFKSVPASSTDAFYACMSEYSFTKDDALKLGDVLGWCFNHFEKDPQSLNNKINLDTFKGNFSGWDGSYRPLEKLIKASMNDDSSYKHISTVYHLILNKDPYAVVKTTFRGTNAYGGVVKQTVAARVNVRTGEVLSILDN >NC_010694|625326:694971|671530_672031_+|WP_042959253.1|DBSCAN-SWA MKIRIALSLLFVLSVVGCKAPAPKITDDTVVSSTVDGVTLSYRHAITPPQSFTPVGEEYRALYAASVMSRPNFGGKLVRNLDNGQTFTVLGSVENNWFAIADAGQEQLIGYVPLRAGVKSALYDQTLKADQRRKRVRAPAKKKTCVAVDGDSKACQNNNNGTWIID >NC_010694|625326:694971|662090_663509_-|WP_042958626.1|DBSCAN-SWA MEELMSQQPAHVSAHATSSRSSLLSLLRRLHFYIGLFIAPFIFVAALTGTLYVVTPQLENAFYHHQLFTTSTGIDQPLSAQVARAGEFIGSNMKIAAVRPAPHQGETTRVMFYRDDLAPSETRAVFIDPKTLEVRGELTAYGTSGILPLRTWLDYLHRNLLLGDVGRHYSELAASWLWVAALGGAVLWASRRRVRRITSDKQPTRAQRLGRFRHWHSTLGLLLLLGLVFFSATGLTWSRWAGDNISLMRSHFGWMTPVVNTSLQPAMAMDMPMDEHAEHHGHSRMMHGGKSIAVNPLNFDTVLRAARDAGIDAAKLEIRPAYRADKAWTVSETDRSWPTQVDSVAVDPRNNTVIDKVKFAQYGLLAKLTRWGVDAHMGVLFGVANQLVLALFGLGLCLMIAFGYRMWWLRRPQQGQISPLDTLIGCWQLLAPGSRLLLLVLTILVAVSLPLMGISLLILLAIDRWRWRQSMK >NC_010694|625326:694971|631382_631610_+|WP_012440311.1|DBSCAN-SWA MRNTETRSFNTDSNALAVLLTDAKKEERKDRALAVSIRLEALAIHITKVGMSGTEAAELLRREATRFENESQELH >NC_010694|625326:694971|660019_661033_+|WP_012440348.1|DBSCAN-SWA MPKQINQRATRADVAKEAGTSVAVVSYVVNNGPRPVALATRERVLAAIKKTGYRPNNVARALASGTTKTYGLVVPNVNNAFIASLAHELQQEALANDMVMLLGDAGDDRKRELQLINNLLSQQINGLIYISVDRHPYIDVLQASGTPFVMLDRVDPSLQVNVLRVDEREAARQVTSHLLSHGYQDVGIICGPLERLNSQDRLHGWREALAEYGVHERPEWVFSTPYTREGGYTAAKRMLESGTLPRALFATNEAQAIGCIRALYEHGVRVPEQIALVCFNGTDQSAYHLPSLTTVRQPVREMAKAAIKMLVNWKGETTLREFSHQLEIGESCGCRPS >NC_010694|625326:694971|661053_662040_+|WP_012440349.1|DBSCAN-SWA MKRLIIDCDPGNGITGANVDDGLAIALALSAPEVSLELITTVAGNTQSEIGYSVAKDLIERLGQSVPVIKGADAALSEPSAPWRASLDLRVHSHQLAHLWQGVRQPQSYSPPPVEAADAIGQLICAHPGEITLVAIGPLTNVALALDRYPQMADAVQEIAIMGGVFALDDFIKDTNFGIDPEAAHRVLTSGANITLVPMDVTSQTLMTHQDLNRIEQIDTPLARFVTETLRPWIDYSIQTRRLAGCWIHDALVVAWLLNKQVATAADYFVNVELREGMTRGKAWRFRQPLRLDVGIGQPEGRPVQVLKTVDNSLLLAMLEQSLALPLR >NC_010694|625326:694971|669021_669441_+|WP_012440356.1|DBSCAN-SWA MSPRRDWLLQQMGITQYTLRRPRALQGEIAVTLPAETRLVIVADNPPMLHDPLVADVLLALNLRQPQVQVLTPDQLAMLPDDARCHSWRLGLDAPVTLAGTQIASPVLAELYNNAEAKRALWQQICHYESDFFTHSARS >NC_010694|625326:694971|642799_643015_+|WP_000171565.1|DBSCAN-SWA MTLERISAFITYCIAVLLAWLGDLSLKDASTVGGVLIGVLMLAINWYYKHQSFKLLRGGKISRGEYESFNR >NC_010694|625326:694971|631902_632139_+|WP_042958617.1|DBSCAN-SWA MKTILKRVGSKSATMPERVKSLYRRFDINHINARRSIGVAAGEGKRVAEVIAVSTSTVCTGHNPSCTPRCNVVAGARR >NC_010694|625326:694971|626925_628248_-|WP_012440304.1|DBSCAN-SWA MAGYFYDFTKEHSFHGEFWSAPHDNKDRFSAKIEYTPYNGLVLDYCISDSDSPRTCQRLYGVLNTGEPCTLIGSFDFLQGSMHFGKLRVLTGKHYFKAIIFNGIYTEEDSVEYCDIALHGMQEFIHPQGFISQLKYSTKPILSIHGSEWKIDVINNATFSMIGDSLVNIIDCQHEEAFNKFTKDFWSTKKEYPKAFFSIRKNLKFFLRYANTINDSIIKHIDDIWKLTGLFSILLDKPVIPDELNIKFKGKQKNNPCLFSNGIEQRTIDLALSTINHHFLPLNWKQIDMGEVISKWLNMSDEYNPLSVTYQYETGLRTLHQAHADIILYATQLESINLTLSAKNEDKYIGPINKYASIDLKNKLEAIFSKFNKKTIGENITIVRGELAHVGRPKKLMKVMSIDDYIKIGLYLKITITAHLLSQLGLTKEQIERYQSKVAP >NC_010694|625326:694971|630809_631010_+|WP_012440309.1|DBSCAN-SWA MLTKEPSLASLLVKQSPAMHYGHGWIMGKDDKRWHPCPSQNELLAGLSTTKQGKSWLLKALRQLFH >NC_010694|625326:694971|656651_657026_-|WP_012440345.1|DBSCAN-SWA MKKPLLALLLITSQSAFADKIPNSIENLIAGYDTRTHVLESGELAIRYNKQALMIDAAKSLFSAICDDYFMNKWSPGTIKKITLWSVTSDQGYKINGGGMECKKTGSMDFKQAEEYRTSLIEKI >NC_010694|625326:694971|630292_630802_+|WP_012440308.1|DBSCAN-SWA MFDYCVSKHPHFDEACRTFALRHNMAKLAERAGMNVQTLRNKLNPEQPHQITPSEIWLLTDLTEDSTLVDGFLAQIHCLPCVPMNEVAKEKLPHYVMSATAEIGRVAAGAVSGDVKTTAGRRDVISSINSVTRLMALAAVSMQARLQSNPAMASAVDTVTGLGASFGLI >NC_010694|625326:694971|683798_684278_+|WP_012440370.1|DBSCAN-SWA MVIRPAFLQAWQRFSEINIDVSSVGKKIGGNVGANITLGEQDPAQGFTNACAIRMSYTLNYSGAKVERGVWKTVSGDDKNWYIYRVKDLLTYMHKKYGKPDKIVKNPKPGDFQNLKGILVFAVNGWSDASGHATLWNGSVCSDHCYFPISNEGSIWLLK >NC_010694|625326:694971|666990_667227_+|WP_049778741.1|DBSCAN-SWA MLGKRLDSGWGVLVPCAMMPLLALMELSFSEWRLLMVVAFLATVIMLFHKRLRHYLLLPSCIALAGGLAAISVNFNGV >NC_010694|625326:694971|655267_656368_+|WP_012440343.1|DBSCAN-SWA MNFSSDHFDLNRRSPAFSITIEGKDVTTALDARLMSLTLTDNRGFEADQLDLELDDADGQIVLPRRGAVIQLALGWKGRPLFPKGAFTVDEIEHSGAPDRLTIRARSADFRETLNTRREKSWHQTTVGDVVKEIASRHNLKMALGKDLTDKAVDHIDQTNESDASFLMKLARQYGAIASVKYGNLLFIRQGQGRTASGKPLPVITIERKAGDGHRFTLADRGAYTGVIASWLHTREPKKKETTRVKRRRKKTTKPKEPEAKQGDYLVGTYENVLVLNRTYANHSNAERAAKMQWERLQRGVASFSLQLAEGRADLYTEMPVKARGFKQPIDDAEWTITTLTHSVSADNGFTTSLELEVKIDDLELE >NC_010694|625326:694971|647164_647770_+|WP_012440334.1|tail|DBSCAN-SWA MNSLLPPGSSPLERRLAQTCSGISDLQVPLRDLWNPATCPVNFLPYLAWAFSVDRWDESWVESVKRRVVQDAFYIHQHKGTASAVRRVVEPFGFLIRIIEWWQTGEQPGTFRLDIGVQDQGITEETYLELERLIGDAKPCSRHLIGMSINLQTSGPYFVGAATYIGEEITIYPYINETIISGGTAYEGGAVHVIDTMRVNP >NC_010694|625326:694971|686426_686834_+|WP_012440374.1|DBSCAN-SWA MKKPVIIPIVALLLTACQLEKPQFERMQVRVNHNQPCFIIPQNAADRHAALTSNGPMVSWFDQQQWQVISPSSVTAEDRAVNAGECTQWPGINWDPGSYNVFMRVNDRASGDIIRYRADFSLLRNQQGELSLGSQ >NC_010694|625326:694971|691457_692024_+|WP_012440379.1|plate|DBSCAN-SWA MSQHSDEGHKLLHGGYRQRRQQEGLNVRDKMQPSLLDRLTDHAPDNPREAANSTLISHSALRRNVLRDLQWLFNTINNEAQQDLSPFPHVQRSTVNFGIAPLAGQRMSDIEWQDIQRKLSAAILNFEPRILPAGLQVRCVSDIGSLDLHNVLSIEIKGRLWCVPYPLEFLFRTDVDLENGHFALRDIG >NC_010694|625326:694971|669403_669844_+|WP_012440357.1|DBSCAN-SWA MNQISLLTPHDLDAAFAIERRSHAFPWTEKTFASNQGERYLNLRLTVDGVLAAFAITQVVLDEATLFNLAVDPTFQRRGLGRELLQHLICELTQRNVMTLWLEVRASNRAAIALYEQLNFNEVSIRRNYYPTTSGKEDAVIMALTI >NC_010694|625326:694971|680649_681186_+|WP_012440366.1|DBSCAN-SWA MAVSKSSGQKFIARNRAPRVQIEYDVEIYGAERKIQLPFVMGVMSDLVGKPLEAQPGVDERKFLDIDVDNFDERMKALKPRVAFQAENTLTGEGRLNIDLTFNSMEDFSPDAVARNVEPLNQLLDARTQLANLLTYMDGKNGAEELIGKILQDPTLLKSLSHLPKADDAPADDHGNKE >NC_010694|625326:694971|647766_649107_+|WP_012440335.1|tail|DBSCAN-SWA MSAKFYTLLTEIGAAKLASAAALGVPLKITQMAVGDGGGVLPAPSAQQTRLVAEKRRAALNMLYIDPQNSSQIIAEQVIPETEGGWWIREVGLFDETGALIAVGNCPESYKPQLAEGSGRTQTVRMVMITSSTDNITLKIDPAVVLATRKYVDDKVLELKVYVDDIMAKHIADNDPHTQYAPKSSPTFTGMPKAPTAATGNNSTQLANTAFVQVAIAALVDSSPGALDTLNELAKALGNDPNFATTMTNALAGKMDKSANGADIADISAFLNNLGLGAGSALPVGVPVPWPLAAAPDDWIKCNGATFDKAKYPRLAMVYPSGSLPDLRGEFLRGWDDGRGVDGGRVILSSQDALVAGHVHTLARMWSSSDETNSSVKHLGVSNNIHNTTKDNMGNGILEEADGGLGIVVGCGAGGNFASTTAIKSNSSSTDNRPRNIAFNFIVRAA >NC_010694|625326:694971|681189_682692_+|WP_012440367.1|DBSCAN-SWA MSNQSQQSGDLQQQQGYSEDAFSALLNKEFRPKSDQARAAVESAVKTLAQQALENTVTVSNDAYRTIQALIAEIDEKLSLQINQIIHHEDFQQLEGAWRGLSYLVNNTETDEMLKIRFMSISKQELGRTLKRYKGVSWDQSPIFKKIYEEEYGQFGGEPFGCLVGDYYFDHGPQDVELLSEMARIGSAAHCPFITGTAPGVMQMESWQELANPRDLTKIFQNTEYAAWRSLRESEDARYLGLVMPRFLSRLPYGIRTNPVDSFDFEEQTDGANHGNYTWTNAAYAMAANINRSFKDFGWCTSIRGVESGGAVENLPCHTFPSDDGGVDMKCPTEIAISDRREAELAKNGFMPLVHRKNSDFAAFIGAQSLQKPAEYHDADATANARLAARLPYLFACCRFAHYLKCIVRDKIGSFRERDEMERWLNDWVMNYVDGDPANSSQETKSRKPLASAEVQVQEIEDNPGYYAAKFFLRPHYQLEGLTVSLRLVSKLPSLKSNDA >NC_010694|625326:694971|669858_670539_+|WP_012440358.1|DBSCAN-SWA MLKNWDWILFDADETLFHFDAFAGLQRLFQRYDISFTRADYDDYQAINKPLWVDYQNGAISALQLQHQRFEGWAAKLDVTPQDLNGGFLRAMAEICTPLPGAAELINALQGRVKIGIMTNGFTALQQARLEHTGFSGLFDLLVISEQVGYAKPHPAIFDYALGQMANPPRDRVLMVGDNPDSDILGGINAGMATCWLNSDGRSRPQGIKPDWEVTSLTELQGLLGA >NC_010694|625326:694971|651582_651885_+|WP_012440339.1|tail|DBSCAN-SWA MSDKLTEKTVKLDTPIMRGKTEITEIVLRKPQSGALRGTRLQAIMDMDVGAMMTVIPRISTPTLTAQEMAELDPADLTAMAVEMVTFLLPKSVLADLPTT >NC_010694|625326:694971|632174_634550_+|WP_042959250.1|DBSCAN-SWA MKPGGTDDAAWAFPWNAPKKAINPYLDRPEVKPSALSDPIALFAAENEGAKQRRAALSDEAWNRYFYNESRDPVLKEMEQERLTGRARLIHEQHRFNPDLVIIDNVRAEPAFISKPLMQRIAYFQQLDRPKACSRYLRDTITPCLQRLERVRDSQASASFRFMASRDGLDGLLVLAEMNQHQVKRLATLVGAHMSLCLEEAGSALFTADEVKPQEIRRVWERVAAEAMRLDVIPPAFEALRRKKRRRKPVPYELIPGSLARMLCADWWYRKLWQTRCEWREEQLRAVCLVSKKASPYVSYEAVVHKREQRRKSLAFFRAHELVSENGDTLDMEEVVNASASNPAHRRNEMMACVKGLELIGEMRGDCAVFYTVTCPSRFHATLSNGRPNPTWSSATVRESSDYLVNTFAAFRKAMHRRGLRWYGVRVAEPHHDGTVHWHLLCFMRKKERRSISALLRKFAIREDRAELGNNTGPRFKSELINPRKGSPTGYIAKYISKNIDGRGLAGEISKETGKSLRDNAENVNAWASLHRVQQFRFFGIPGRQAYRELRLLAGQAGRAQGDKKAGAPVLDNARLDAVLAAADVGCFATYIMKQGGVLVPRKNHLIRTAYALNDEPGTYGDRGIRIYGIWSPLVAGRICTHALKWKKVRKAVDVQEATADQGGSAAPWTRGNNCPLVENLNKSGGELPDIKTMDEKELQEYLHNMGQKERRELTARLRLVKPKRKKAYKQTISDHQRLQLEAELSSRGFDGSESEIDLLLRGGSIPSGAGLRIFYRNHCLQEDGKWRQWY >NC_010694|625326:694971|649106_649703_+|WP_012440336.1|tail|DBSCAN-SWA MQSAVIENGFAVIAGEIDVFNYDGLTRVYLTQTTEFIPVGVSIPANACTDKPLAAKKDYVVCRNSQLTGWEYLADHRGETVWNIRTGAEQQITVPGDYPADTTIYSPSTPYDKWNGERWVTDEAAKAEADIAEAAAAKAVLIKSAAAKIEPLQDAVQLDMATDEEKSRYDAWRKYRVLLTRVDISQAPDINWPEPPKD >NC_010694|625326:694971|673948_675190_+|WP_012440362.1|DBSCAN-SWA MKQEKHSDAAQAGIGAANGHNPLVAAANPLLNAIPQIRHSVSHPDPAGLRQRLVDEIRQFEMNCQRAGLPWEVIIGARYCLCTALDEAAALTPWGGRGVWPGNGLLVTFHNETWGGEKFFQLLARLSQTPREQIALLELINFCLQLGFEGRYRVMDNGRSQLETIRQRLLQMIRSVRGGYAPPLSVHPEDHPVTRKLWRPVVPLWACTAVVGFLACLLYILLNWRLGDYTSPVLASVYQTSLPEVSIHNPAPPPPAAVNLKTFLKPEIEQGLVAVRDEADRSVVTLKGDGLFTSASTEVRGRYAEVLDRVAAAMNNVSGRILVVGYSDNVPIRSARFASNYELSLARAQSVSGQLRQHLSQPQRVKAEGRGESSPLAPNNSAENRARNRRVEITLLVSPGNTRVELNDMAQGN >NC_010694|625326:694971|642592_642796_+|WP_012440325.1|tail|DBSCAN-SWA MKVRAHQYDTVDALCWRYYGRTQGVTEQVLQANPGLAEYGPFLPHGLQVELPDITASTTAQTVQLWD >NC_010694|625326:694971|693912_694971_+|WP_012440381.1|plate|DBSCAN-SWA MTQQPAAQAEKIHRLHRLPPDFWAALCATPWRYDLFQLLRRLDAQGGQRYPLGRAPLPRHEPLRIGQRPSLAFAPAAIASVTPRDGSALHDVSIYSFGLFGPNGPLPVHLTEYARERSDHHQDNSLSAFTDLFHHRLTLLFYRAWADAQPTVALDRAEKRQFERWLASLIGMGQPGQLRQGSLSPHSRLALAGHLTRQARDAEGLQKILSHYFQVPVRLVENVPHWQPVDRRDRASLGAGRHKPRLGVSAFLGVAVRDVQHKFRLELGPLSMDSYRRFLPGEPWAQQLRDWVRQYLGIEFLWEVRLILDAAEVQGVTLGGASRLGYSSWLGRPAIPQHRSDLTFSPEPQESV >NC_010694|625326:694971|658543_659938_+|WP_012440347.1|DBSCAN-SWA MSTPQNVAQDSAIQADSAADERLTTREGRKDFWRATFSCWLGTAMEYADFALYGLAAGIIFGDVFFPASTPAMALLSTFATFSVGFVARPIGALFFGWLGDRKGRKVVMVSTIILMGASTTLIGLIPSYASIGLWAPACLVLLRFTQGFGAGAELSGGTVTLGEYAPKQRRGLVSSIIALGSNSGTLLASLVWLAVIQMDQQSLLEWGWRIPFLCSSLIALVALWIRRNLKETPVFERKKAEMEAQRARIRVTQPPVQDTRGFWRRSRAFLTMVGLRIGENGPSYIAQGFIIGYVVKILAVDKSVATSAVMIASLLGFLIVPLAGWLSDRFGRRITYRWFCLLLILYAFPAFMLLDSREPAIIIATIVTGMGLASLGIFGVQAAWGVEMFGVHHRYTKMATAKEVGSILSGGTAPLVAAALLSWTGHWWPIATYFAVMAAIGFLTTFVAPETRGRDLNAAEDAI >NC_010694|625326:694971|634697_634886_+|WP_012440314.1|DBSCAN-SWA MQDYFLESLKLQRIDFFIKLVAASECSEEEKRLAIQWVSELTDELMAKIRSHEYCRSMDVIS |
80 | Salmonella_phage(73.68%) | plate,terminase,tail,lysis,capsid,head,portal,integrase | attL 626788:626805|attR 657181:657198 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
785208 : 794976
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_010694|785208:794976|DBSCAN-SWA GATGGGTAAGATTATTGGTATTGACCTGGGTACAACCAACTCATGTGTTGCCATCATGGATGGCGGCAAGGCGCGCGTGCTGGAAAATGCAGAGGGCGATCGTACCACGCCTTCGATTATTGCTTACACCCAGGATGGTGAGACCCTTGTCGGTCAGCCTGCTAAACGTCAGGCCGTGACCAACCCGCAGAATACCCTGTTTGCGATCAAGCGTCTGATTGGCCGTCGTTTCCAGGATGAAGAAGTTCAGCGTGATATCAAAATCATGCCATTCAAAATCGTTGGCGCTGACAATGGCGATGCCTGGTTAGACGTGAAAGGCCAGCGTGTGGCTCCGCCGCAGATCTCAGCTGAAGTGCTGAAAAAAATGAAGAAAACCGCCGAAGATTATCTCGGCGAAGCGGTGACTGAAGCGGTTATCACCGTGCCCGCTTACTTTAACGATGCGCAGCGCCAGGCGACTAAAGACGCCGGACGTATCGCGGGTCTGGACGTAAAACGTATCATCAACGAACCTACCGCAGCCGCTCTGGCTTACGGTTTGGATAAAGGCCAGGGTAACCGCACCATCGCGGTATATGACCTGGGCGGTGGTACTTTCGATATCTCCATTATCGAAATCGATGAGGTTGATGGCGAAAAAACCTTCGAAGTTCTGGCAACCAACGGTGATACCCATCTGGGTGGTGAAGACTTCGACAGCCGTATGATCAACTACCTCGTGGCCGAATTTAAGAAAGACCAGGGTATCGATCTGCACAACGATCCGCTGGCCATGCAGCGTCTGAAAGAAGCCGCAGAGAAAGCTAAAATTGAGCTGTCTTCCGCGCAGCAGACCGATGTTAACCTGCCGTACATCACGGCGGATGCGACCGGTCCTAAGCACCTGAACATCAAAGTTACCCGTGCCAAACTGGAATCACTGGTTGAAGACTTGGTTACGCGTTCTATCGATCCGCTGAAAGTGGCTTTGCAGGATGCGGGTCTGTCGGTATCTGATATCAACGACGTTATCCTCGTCGGTGGCCAGACGCGTATGCCAATGGTGCAGGCGAAAGTGGCAGAGTTCTTCGGTAAAGAACCACGTAAAGACGTGAACCCGGACGAAGCGGTTGCCGTTGGTGCTGCGGTTCAGGGCGGCGTGCTGGCTGGTGAAGTGAAAGACGTTCTGCTGCTGGACGTGACCCCGCTGTCGCTGGGTATCGAAACCATGGGCGGTGTGATGACCTCACTGATTACCAAAAACACTACCATCCCAACCAAGCACAGCCAGGTGTTCTCTACTGCGGAAGATAACCAGTCTGCGGTGACCATCCATGTGGTGCAGGGCGAGCGTAAACGTGCTGCGGATAACAAATCTTTGGGGCAGTTTAACCTTGACGGCATCCAGAACGCACCGCGCGGCATGCCGCAGATTGAAGTAACCTTCGACATCGATGCTGATGGCATCCTGCACGTTTCCGCTAAAGATAAAAATAGCGGTAAAGAGCAGAAAATCACCATCAAGGCTTCTTCCGGTCTGAATGACGAAGAGATCGAAAAAATGGTGCGCGATGCCGAAGCTAACGCCGAGTCTGACCGCAAGTTCGAAGAGCTGGTGCAGACGCGTAACCAGGGCGATCAGGCTGCTCACAGCACGCGTAAGCAGCTCGACGAAGCGGGCGATAAACTGCCAGCAGAAGACAAAGCGCCAATTGAAGCGGCCCTGACCGAGCTGAACACCGCTCTGAAAGGTGAAGACAAAGCGGAAATCGAAGCCAAGATTCAGGCTCTGATGGAAGTCTCTACCAAGCTGATGGAGTTCGCTCAGCAGCAGCAGGCCGCTGGCGGTGCCGCTGATGCTGCCGAAGGCGCGAAAAAAGATGACGATGTTGTCGACGCTGAATTTGAAGAAGTGAAAGACAGCAAAAAATAAGCGCCCTTGAGCGGGCACGGTAGCCCACCGCGGGCTACCTGAATTAACCAGCACGGGCGTAGGATTTTATCCACGCCCGTGCACGCATGTTAAGGGCAGGATCAATAACAATGGCGAAGAGAGACTATTACGAGATTTTAGGCGTTGCCAAGTCGGCGGACGAACGTGAAATCAAAAAGGCCTACAAACGTCTGGCCATGAAATTCCATCCTGACCGCAACCAGGGCGATAAAGAATCTGAAGGCAAATTTAAAGAGATTAAAGAAGCCTACGAAATCCTGACCGATGGTCAGAAAAGAGCGGCCTACGATCAGTACGGTCATGCGGCCTTCGAACAGGGTGGCATGGGCGGCGGCGGCCACGGTGGATTTGGTGGCGGCGGTGCTGACTTCAGCGATATCTTTGGCGACGTATTCGGCGATATTTTCGGCGGAGGCCGCCGTCAGCAGCGCGCCGCTCGCGGTGCTGATTTGCGCTACAACATGGAGCTGACGCTGGAAGAAGCGGTACGCGGCGTGTCCAAAGAGATCCGTATCCCCACGCTTGAAGAGTGTGGCGTTTGCCACGGCAGCGGAGCAAAAGCGGGAACGAAACCGCAAACCTGTTCAACCTGTCATGGCGCAGGCCAGGTGCAGATGCGTCAAGGCTTCTTTACCGTCCAGCAGGCGTGTCCGACCTGTCACGGACGTGGCTCTGTGATCAAAGACCCGTGCAACGCCTGTCATGGTCATGGCCGGGTCGAGAAATCGAAAACCCTGTCGGTGAAAATTCCGGCTGGCGTTGATACCGGCGATCGCATTCGTCTTAGTGGTGAAGGGGAAGCCGGTGAGCAGGGCGCTCCCGCTGGCGATCTGTATGTGCAGGTTCAGGTGCGTAAACACCATATCTTCGAGCGTGAAGAGAACAACCTCTATTGCGAAGTGCCCATCAACTTTGTGATGGCCGCGCTGGGTGGAGAGATTGAAGTCCCAACGCTGGATGGCCGCGTCAACCTGAAGGTGCCGGCAGAAACGCAGACCGGCAAGCTGTTCCGGATGCGTGGCAAAGGCGTGAAGTCGGTACGTGGCGGTGCGCAGGGCGATCTGCTGTGCCGCGTGGTGGTCGAAACGCCCGTCAGCCTGAATGAGAAACAAAAGACGCTGCTGCGTGAACTGGACGAGAGCTTTGGCGGCCCGTCCGGGGAAAAAAACAGCCCGCGCTCTAAAACGTTCTTTGATGGCGTGAAGAAGTTTTTTGACGACCTGACGCGCTAATCACATTGGCCCGGCTCCCTGCTTAGGCAGTGAGCCGGGCCAGCCTATTTTCCGTTATCCCTTTCTGTGTTTACGCGCCCATCCCCCTTGGCTTACCGCCCCATTTTGTCAATCAACGCATAAATAGTTGCATCGCTTAGTGTTGCGTTTAGATTCCATCTGTTAGTGTGTCCGACTTAAATTTATCTAACGGAATTGGATTCTATGAAAAAGATTTTGGTCGCAACGACTTTAGCCGTTCTTTTATCCGGCTGTGCACAGCAGACTTTTCAGATGAAACATAATCAGGTGGCCGCGCCTAAGCAGGTTACCACTCATCACTTCTTCGTTTCGGGCATCGGCCAGCAGAAAACGGTAGACGCAGCGGCAATTTGTGGCGGTGCTGCCAAGGTGGAACGCGTTGAAGTTCAAGAGACTTTTGTCAATGTCCTGCTCAGGGTAGTGACTTTAGGTATCTACACCCCGCGCGAAGCGCGCGTGTACTGCGAACTGTAAGCTCGCTGAATTGATTGAGTAAAGCCGCCTCTGGCGGCTTTTTTATTTATCCTGCGTTCCACTCGATTTTATCTCTCCCCCGCATCCTGCTTTCCCGTTTCACCCGCGGAGAAAGGCGCGGCATAAACTGTTTTTTTGCATGGACTAAACACGTATGCTGTCCATCTTCTGGACAATATCACCTGGGTAATCAGCGGAGTCTGTAATGAACCTGTTTCTAAAAAAACTGTTAAAAAATGACGCTACCGGCGGCGTGGTACTGATAGTCGCCGCAGCATTTGCCATGTTTCTTGCCAATAACGACAGCACCCGGCACGCCTATCAGGCGATGCTTACGCTGCCGGTTCAGTTTCGTTTCGGTGCACTGGATATCAATAAAGATCTGCTCTTATGGATTAACGATGCGCTGATGGCGCTCTTTTTCCTGATGATCGGCCTGGAGGTGAAGCGTGAACTGATGATGGGCTCGCTCAAGGGAAGAGAGCGAGCAATGTTTCCGCTGATTGCCGCGCTGGGCGGCATGCTGGCACCGGGATTAATTTATGCCGCATTCAATCACCAGGACGCGCAGGCGATCCACGGTTGGGCCATTCCTACCGCCACCGATATTGCGTTTGCTCTGGGCATTCTTGCCCTGTTGGGGAGCCGCGTACCCGCCGCGCTAAAAATGTTCCTGATGGCGCTGGCGGTGATAGACGACCTGGGGGCGATCGTTATCATTGCCCTGTTCTATACCAGCGAGCTTTCGCTGATTTCTTTGACCGTGGCGGCTGCCTCCATTGCGGTTTTGGCCGTGCTGAATGGGTGTGGTGTACGTAAAACGTCAGTTTATCTGGCCGTGGGGATGGTGCTGTGGGTGGCGGTACTGAAATCCGGCGTACATGCCACGCTGGCCGGGGTAATTGTTGGCCTGTTCATACCGCTCAAGAAGCAAGAAGGGCACTCCCCGGCGATTGAGCTGGCGCATGGCCTTCACCCCTGGGTCAGCTGGCTGATCCTGCCGCTGTTTGCTTTCGCTAATGCCGGAATTTCTCTCAGCGGAGTCTCTTTAAATGGCCTGTTCTCTGCCGTGCCGTTGGGGATCACGCTGGGGCTATTCATCGGTAAACCGCTGGGGATCACCCTGATCTGCTGGCTGGCGGTGAAGCTGAAAATTGCCGCGCTGCCCGAAAATACCCGCCTCATTGATATTGCTGCGGTCGGCGTGCTCTGCGGTATTGGATTTACCATGTCGATATTTATCGCTTCGCTGGCGTTTGACGGCGCGCATGAAGAACTGGTGACGCTGGCTAAGCTTGGCATTCTGAGCGGATCGGTGATCTCCGCGCTGGTTGGCTATACGTTGCTGCGAGTGAAGCTAAGGTAGGGCACGGGGCGGGCTGCACGGTTGGACGGGGTAGCCCGCTCTGCGAGTTTTGCCTGGTGTAACAGGTTTTTCCATATTGTATGGAGTGCAGCGAGCAAAGGTGAGGCAAGGCAAAAATTCACGAAAAAGCGCAGCTTGCCGGAGGCGGTGAGTATTTGAAGTGCTCTTTTTACACCGAACAGAGTGCAGCGAGCAAAGGTGAGGCAAGGCAAAAATTCACGAAAAAGCGCAGCTTACTGAAGTCAGTGAGCATTTTGAGGCGCTTTTTAACGCCGCATCACCGCCGCGCAGCAAACGGTCAGGAAGAATGCCAATAAAAAAACCGCCGTATAGGCGGTTTCTTTATCAAAGCTCGACTAACAAGCAGGCTTAAGCCATTTTGCTAATCTGTGCAGTCAAGTTTGCTTTATGACGTGCAGCTTTGTTTTTGTGGATCAGGCCTTTAGCAGCCTGACGGTCCACAAGTGGTTGCATTTCATTAAATGCATTCTGCGCAGCCGCTTTATCGCCAGTAGCGATAGCCGCGTATACTTTCTTGATAAAAGTACGCATCATTGAACGACGGCTAGCGTTATGCTTACGACGCTTCTCAGACGTTACGGCGCGTTTCTTAGCTGATTTGATATTAGCCAAGGTCCAACTCCCAAATAGTTCTATGAGGACATTCAAAGGCCGAGGAATATGCCCTTTTCGCCTTCGGTTGTCAATGGATTTGTGCAAATAAGCGCCGGTGAAGCAAAGGCACTTGGTTACGTTTCGATGGCGCAAGATTCTATCAGCTTCGTCCCGGCTAATACAGCTTAATCACAAGAAAAATATTATCCCGCGCTGGGAGAGCACCATACAGAATGAAATAGCAGGAAGAAAGCACCAAAGCCGGTATTCGTGGGTTAACCTTCAGGGCTGTACAAGGTATAATCCGCCGATTTCCACAAATTTGAGCCAGTTATGAAGTTGATACGCGGCATACATAACCTTAGAGCGCAGCATCGCGGCTGCGTGCTGACCATCGGCAACTTCGATGGCGTACATCGCGGGCACCTGGCGCTACTTGCGCAGCTGTGCGCAGAGGGGCGCGAACGCAATTTGCCGGTGATGGTGATGCTGTTTGAACCCCAGCCGTTGGAACTGTTCGCCGCTGAAAAGGCGCCCGCCCGGCTGACGAGGCTGCGCGAAAAGCTGCGCTATCTGGAACAGGCCGGTGTGGATGCGGTACTCTGTGTGAGCTTTGACCGCCATTTTGCCGCTTACAGTGCGCAACGTTTTATCACCGATCTGCTGGTCAACAGGCTCGGCGTACAGCTGCTGGCGGTGGGCGATGATTTTCGCTTTGGCGCTGGTCGCCAGGGTGATTTCCTGTTATTACAGAAAGCTGGCGTTGAATATGGCTTTGATGTCATCAGCACCCAGACGTTTTGCGACAACGGAAAACGTATCAGCAGCACGGCGGTGCGCCAGGCGCTGGCGGAAGATAATCTGCCGCTGGCACGGTCACTGCTTGGTCGCCCATTCAGCATTTCTGGTCGTGTGGTGCATGGCGATGCGCTGGGCCGCACCATCGGTTTTCCTACTGCTAATCTGCCGCTGCGCCGTACCGTTTCCCCGGTCAAAGGGGTTTATGCCGTTGAGGTGCTGGGCCTGGGGCCGCGAGCGCTGCCCGGCGTAGCGAATATCGGGACACGGCCTACCGTTGCCGGTCTACGTCAACAGCTGGAAGTGCATCTGCTGGACGTTACCATTGACCTTTACGAACGGCATATTGAAGTGGTACTGCTGGATAAAATACGCGATGAGCAGCGTTTTAACTCGCTCGATGCACTGAAAGAACAAATTGCAAACGATGTTGTGACTGCCAGACGCTTTTTTGGCCAGTCAACATCGGTTTAAGTTTAAACAGAACCGACATACGGAACCGAGAATCTGATGAGTGACTATAAATCTACCCTGAATTTGCCGGAAACGGGGTTCCCAATGCGTGGCGACCTGGCCAAACGCGAACCGGGCATGCTGCAACGCTGGTATGATGACAAGCTGTACAGCATCATCCGCGAAGCCAAAAAAGGGAAAAAAACCTTTATTCTGCACGATGGCCCTCCCTACGCTAACGGCAGCATTCATATTGGTCACTCGGTTAACAAGATCCTGAAAGATATTATCGTTAAGTCGAAAGGCATGGCGGGCTATGACTCACCTTATGTCCCAGGCTGGGACTGCCACGGCCTGCCGATTGAGCATAAAGTGGAGCAGACCATCGGCAAGCCGGGCGAGAAGGTCAGCGCGGCGGAATTCCGTGCGGCCTGCCGTCAATACGCCGCCGAGCAGGTGGAAGGCCAGAAGGCGGACTTTATCCGTCTGGGCGTGCTGGGCGACTGGGATCGTCCTTACCTGACGATGGACTTCAAAACCGAAGCCAACATCATCCGCGCGCTGGGCAAAATCATCGGCAACGGCCATCTGCACAAAGGTGCCAAACCGGTACACTGGTGCCTGGACTGCCGCTCCGCGTTGGCGGAAGCGGAAGTGGAGTATTACGATAAAACCTCTCCCTCTATTGACGTGATGTTTGACGCCGTGGATAAAGACGCGGTGCAGGCCAAATTTGGCGCAGCCCACGTCAATGGCCCGATTTCACTGGTTATCTGGACCACCACGCCGTGGACCATGCCGGCTAACCGTGCCATCTCCCTGCATCCTGAATTCGACTATCAGCTGGTTCAGGTCGAAGGTCGGGCGCTGATCCTTGCCAAAGATATGGTCGACAGCGTGATGAAGCGCGTTGGCGTAACGCAATGGACCGTGCTGGGCGACGTGCAGGGCGCGGCGCTGGAGCTGATGGGCTTCCAGCATCCGTTCTTAGCTCACGTTTCTCCGGTGGTGCTGGGCGAGCATGTGACGCTGGAAGCCGGTACGGGTGCGGTGCACACCGCCCCGGGCCACGGCCCGGACGACTATGTTATCGGGCAGAAGTACGGCATTGAAACCGCCAATCCGGTTGGGCCAGACGGCAGCTTCTTACCGGGCACTTACCCCACGCTCGACGGCCTGAACGTGTTTAAAGCCAATGACACGATCGTTGAACTGCTGCGTGAAAAGGGCGCACTGCTGCACCTTGAAAAACTGCATCACAGCTACCCGCACTGCTGGCGTCACAAAACCCCGATCATTTTCCGCGCCACGCCGCAGTGGTTTATCAGCATGGATCAGAAGGGACTGCGTGCGCAGTCGCTGAAGGAGATCAAAGGCGTGCAGTGGATCCCGGACTGGGGCCAGGCACGTATTGAATCGATGGTCGCCAACCGTCCCGACTGGTGCATCTCGCGCCAGCGTACCTGGGGCGTACCGATGGCGCTGTTCGTGCATAAAGACACCGAGCAGCTGCACCCGGATTCGCTGGAGCTGATGGAAAAAGTGGCCCTGCGCGTAGAGCAGGACGGCATTCAGGCGTGGTGGGATCTCGATGCCCGCGAACTGATGGGCGCGGACGCTGACAACTACGTCAAGGTGCCGGATACGCTGGACGTGTGGTTCGACTCGGGTTCAACCAGCTATTCAGTGGTTGATGCGCGCCCTGAATTTGGCGGCAGCGCGCCGGATCTGTATCTGGAAGGTTCCGACCAGCATCGCGGCTGGTTTATGTCGTCATTAATGATTTCCACGGCGATGAAAGGCAAAGCGCCGTATCGCCAGGTACTGACTCACGGGTTCACCGTCGATGGTCAGGGCCGCAAGATGTCAAAATCGCTGGGTAACACCGTCAGCCCGCAGGATGTGATGAACAAGCTGGGTGCTGATATTCTGCGCCTGTGGGTGGCATCCACCGATTACTCCGGTGAAATTGCCGTTTCCGACGAAATCCTTAAACGCTCCGCCGACAGCTATCGCCGTATCCGCAACACCGCGCGTTTCCTGCTGGCTAACCTCGCCGGGTTCAATCCTGAAACCGATAAGGTTAAACCGGAAGAGATGGTGGTGGTTGACCGCTGGGCGGTAGGGCGTGCGCTGGCGGCACAGAATGATATCGTCGCTTCATACGAAGCTTACGACTTCCATGAAGTGGTGCAGCGCCTGATGCAGTTCTGCTCGGTTGAGATGGGCTCCTTCTATCTGGATATCATCAAAGACCGCCAGTACACCGCCAAAGCCGATGGCCTGGCGCGCCGCAGCTGCCAGACCGCGCTGTGGTACATCGTTGAAGCGCTGGTGCGTTGGATGGCACCGATCATGTCCTTCACCGCGGATGAAATCTGGGGCTATCTGCCGGGCAAACGTGCGCAGTACGTGTTTACCGAAGAGTGGTTTGACGGCCTGTTCAGCCTGGAGGATAACCAGCCGATGAACGATGCCTACTGGGCTGAGCTGTTGAAAGTACGTGGTGAAGTCAACAAGGTTATTGAGCAGGCGCGTGCCGACAAGCGCGTGGGCGGTTCGCTGGAAGCCAGCGTGACGCTGTATGCCGATGCGCAGCTGGCGGAAAAACTCACCAGCCTCGGTGAAGAGCTGCGCTTTGTGCTGCTGACATCAGGGGCTGAAGTGGCGGATTATGCAGGGGCCCCCGACGACGCTCAACAGAGTGAAACGGTGAAAGGCCTGAAAATTGCCCTGCGTAAAGCGGAAGGTGAGAAGTGTCCGCGCTGCTGGCACTACACCAGCGATATCGGTCAAAATGCCGAACACGCTGATATGTGCGGTCGCTGTGTGACTAACGTCGCCGGCAGTGGTGAAGAGCGTAAGTTTGCATGA
Protein sequences of DBSCAN-SWA_2 >NC_010694|785208:794976|787232_788375_+|WP_012440454.1|DBSCAN-SWA MAKRDYYEILGVAKSADEREIKKAYKRLAMKFHPDRNQGDKESEGKFKEIKEAYEILTDGQKRAAYDQYGHAAFEQGGMGGGGHGGFGGGGADFSDIFGDVFGDIFGGGRRQQRAARGADLRYNMELTLEEAVRGVSKEIRIPTLEECGVCHGSGAKAGTKPQTCSTCHGAGQVQMRQGFFTVQQACPTCHGRGSVIKDPCNACHGHGRVEKSKTLSVKIPAGVDTGDRIRLSGEGEAGEQGAPAGDLYVQVQVRKHHIFEREENNLYCEVPINFVMAALGGEIEVPTLDGRVNLKVPAETQTGKLFRMRGKGVKSVRGGAQGDLLCRVVVETPVSLNEKQKTLLRELDESFGGPSGEKNSPRSKTFFDGVKKFFDDLTR >NC_010694|785208:794976|788579_788870_+|WP_012440455.1|DBSCAN-SWA MKKILVATTLAVLLSGCAQQTFQMKHNQVAAPKQVTTHHFFVSGIGQQKTVDAAAICGGAAKVERVEVQETFVNVLLRVVTLGIYTPREARVYCEL >NC_010694|785208:794976|789075_790236_+|WP_012440456.1|DBSCAN-SWA MNLFLKKLLKNDATGGVVLIVAAAFAMFLANNDSTRHAYQAMLTLPVQFRFGALDINKDLLLWINDALMALFFLMIGLEVKRELMMGSLKGRERAMFPLIAALGGMLAPGLIYAAFNHQDAQAIHGWAIPTATDIAFALGILALLGSRVPAALKMFLMALAVIDDLGAIVIIALFYTSELSLISLTVAAASIAVLAVLNGCGVRKTSVYLAVGMVLWVAVLKSGVHATLAGVIVGLFIPLKKQEGHSPAIELAHGLHPWVSWLILPLFAFANAGISLSGVSLNGLFSAVPLGITLGLFIGKPLGITLICWLAVKLKIAALPENTRLIDIAAVGVLCGIGFTMSIFIASLAFDGAHEELVTLAKLGILSGSVISALVGYTLLRVKLR >NC_010694|785208:794976|790605_790869_-|WP_012440458.1|DBSCAN-SWA MANIKSAKKRAVTSEKRRKHNASRRSMMRTFIKKVYAAIATGDKAAAQNAFNEMQPLVDRQAAKGLIHKNKAARHKANLTAQISKMA >NC_010694|785208:794976|791184_792123_+|WP_012440459.1|DBSCAN-SWA MKLIRGIHNLRAQHRGCVLTIGNFDGVHRGHLALLAQLCAEGRERNLPVMVMLFEPQPLELFAAEKAPARLTRLREKLRYLEQAGVDAVLCVSFDRHFAAYSAQRFITDLLVNRLGVQLLAVGDDFRFGAGRQGDFLLLQKAGVEYGFDVISTQTFCDNGKRISSTAVRQALAEDNLPLARSLLGRPFSISGRVVHGDALGRTIGFPTANLPLRRTVSPVKGVYAVEVLGLGPRALPGVANIGTRPTVAGLRQQLEVHLLDVTIDLYERHIEVVLLDKIRDEQRFNSLDALKEQIANDVVTARRFFGQSTSV >NC_010694|785208:794976|785208_787122_+|WP_012440453.1|DBSCAN-SWA MGKIIGIDLGTTNSCVAIMDGGKARVLENAEGDRTTPSIIAYTQDGETLVGQPAKRQAVTNPQNTLFAIKRLIGRRFQDEEVQRDIKIMPFKIVGADNGDAWLDVKGQRVAPPQISAEVLKKMKKTAEDYLGEAVTEAVITVPAYFNDAQRQATKDAGRIAGLDVKRIINEPTAAALAYGLDKGQGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATNGDTHLGGEDFDSRMINYLVAEFKKDQGIDLHNDPLAMQRLKEAAEKAKIELSSAQQTDVNLPYITADATGPKHLNIKVTRAKLESLVEDLVTRSIDPLKVALQDAGLSVSDINDVILVGGQTRMPMVQAKVAEFFGKEPRKDVNPDEAVAVGAAVQGGVLAGEVKDVLLLDVTPLSLGIETMGGVMTSLITKNTTIPTKHSQVFSTAEDNQSAVTIHVVQGERKRAADNKSLGQFNLDGIQNAPRGMPQIEVTFDIDADGILHVSAKDKNSGKEQKITIKASSGLNDEEIEKMVRDAEANAESDRKFEELVQTRNQGDQAAHSTRKQLDEAGDKLPAEDKAPIEAALTELNTALKGEDKAEIEAKIQALMEVSTKLMEFAQQQQAAGGAADAAEGAKKDDDVVDAEFEEVKDSKK >NC_010694|785208:794976|792159_794976_+|WP_012440460.1|tRNA|DBSCAN-SWA MSDYKSTLNLPETGFPMRGDLAKREPGMLQRWYDDKLYSIIREAKKGKKTFILHDGPPYANGSIHIGHSVNKILKDIIVKSKGMAGYDSPYVPGWDCHGLPIEHKVEQTIGKPGEKVSAAEFRAACRQYAAEQVEGQKADFIRLGVLGDWDRPYLTMDFKTEANIIRALGKIIGNGHLHKGAKPVHWCLDCRSALAEAEVEYYDKTSPSIDVMFDAVDKDAVQAKFGAAHVNGPISLVIWTTTPWTMPANRAISLHPEFDYQLVQVEGRALILAKDMVDSVMKRVGVTQWTVLGDVQGAALELMGFQHPFLAHVSPVVLGEHVTLEAGTGAVHTAPGHGPDDYVIGQKYGIETANPVGPDGSFLPGTYPTLDGLNVFKANDTIVELLREKGALLHLEKLHHSYPHCWRHKTPIIFRATPQWFISMDQKGLRAQSLKEIKGVQWIPDWGQARIESMVANRPDWCISRQRTWGVPMALFVHKDTEQLHPDSLELMEKVALRVEQDGIQAWWDLDARELMGADADNYVKVPDTLDVWFDSGSTSYSVVDARPEFGGSAPDLYLEGSDQHRGWFMSSLMISTAMKGKAPYRQVLTHGFTVDGQGRKMSKSLGNTVSPQDVMNKLGADILRLWVASTDYSGEIAVSDEILKRSADSYRRIRNTARFLLANLAGFNPETDKVKPEEMVVVDRWAVGRALAAQNDIVASYEAYDFHEVVQRLMQFCSVEMGSFYLDIIKDRQYTAKADGLARRSCQTALWYIVEALVRWMAPIMSFTADEIWGYLPGKRAQYVFTEEWFDGLFSLEDNQPMNDAYWAELLKVRGEVNKVIEQARADKRVGGSLEASVTLYADAQLAEKLTSLGEELRFVLLTSGAEVADYAGAPDDAQQSETVKGLKIALRKAEGEKCPRCWHYTSDIGQNAEHADMCGRCVTNVAGSGEERKFA |
7 | Chrysochromulina_ericina_virus(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1508344 : 1515280
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_010694|1508344:1515280|DBSCAN-SWA TATGACCAAGCTTAAAGCAGTTATCCCCGTTGCGGGGCTCGGAATGCACATGCTTCCTGCAACAAAAGCTATTCCTAAAGAGATGCTGCCCGTAGTCGATAAGCCTATGATCCAGTACATCATCGACGAGTGCGTAGCGGCGGGTATAAAGGAAATTGTCCTGGTCACCCATGCGTCCAAGAATGCGGTTGAAAACCATTTTGATACTTCTTACGAACTTGAAGCCCTGCTTGAGGCTCGCGTTAAACGCTCACTGTTGAGTGAAGTCAAATCTATCTGCCCGCCGGGAGTGACGATTATGAATGTGCGCCAGCCGCAGCCGCAGGGATTGGCTAATGCTCTGCTGTGCGCCCGCCCGATGCTGCACGATGAAGCGTTTGTGGTGGTACTACCGGACGTGCTGCTGGATAATGCCAGCGCCGATCCCCTGCGTTACAACCTGGCGGCAATGGTGGCTCGTTTCGAAGAAACCGGGCGGAGCCAGGTGTTAGCTCATCATATGCCGGACGCCGATCTGTCAGAGTATTCGGTGATTACCACTGAAGAAGCGCTGGATTTTCCGGGTAAAGTCAGCCCTATTTTAGACTTCGTTGAGAAACCAGATAGCCCGCAGTTGTTAAATTCCAATCTCGCCGCGGTAGGGCGCTATGTGCTTTCTGCCGATATTTGGCCGGAGTTGGAAAAACTTGAGCCAGGCGCGTGGGGCCGTTATCAGCTAACGGACGCCATTGCCAATTTGAATAAGAAAACGCCCGTGGATGCCCATTTGCTCAGCGGCGACAGCTTTGACTGCGGTCGCAAACTGGGATATATGAAAGCATTCGTCACTTGGGGGCTGCGTAACCACACCCAGGGACGTGGGTTTCGTGAAGAGATCCAAAAAATTCTGGCAAAGTGAACGCTATTTGATGAAGGTCGCGCCGCCTGCGTGACCCGAAGGAGTTAACATGTCTATTTTAGTTACCGGTGGCGCTGGTTATATTGGTTCTCATACCGTTCTGGCTCTACTGGAGCGCGGTGATGAGGTTGTGGTTCTGGATAACCTGAGCAACGCGTCACGTGAGTCGATCGCCCGCGTGGAAAAACTCGCTGGCAAGACGGCCGCTTTTTATGAGGGAGATATCCTCGATCGCGCCTGTTTACGTAACATCTTCAAAGCTCATGACATCAGCGCCGTGATCCATTTTGCGGGGCTTAAAGCGGTAGGGGAGTCCAGCCGCAAGCCGCTGGAATACTACCAGAACAACGTTTCCGGCACGTTGGTGCTGTTGGAAGAGATGCGCAACGCCGGGGTTAAACAATTTATTTTCAGTTCATCGGCAACGGTTTACGGTGCGGATGCGCCTGTCCCTTACGTAGAAACTACGCCAATTGGCGGCACAACCAGCCCTTACGGCACGTCCAAACTGATGGTTGAACAAATTTTACGTGATTACGCTAAAGCAACCCCCGAATTCAAAACCATTGCTTTGCGCTACTTCAATCCGGTAGGTGCGCATGAATCAGGTCAGATTGGAGAGGATCCTAACGGAATCCCTAACAACCTGCTGCCGTATATTGCTCAGGTTGCGATCGGTCGTCTGGAAAAGCTGGGTATCTTCGGTGATGACTACCCGACTAAAGACGGTACTGGCGTGCGTGATTATATCCACGTGATGGATTTGGCTGAAGGTCATCTGAAAGCGCTGGATCATCTGTCTGCGATTGAGGGATACAAAGCCTATAATCTCGGGGCGGGTGAAGGTTATTCGGTGATGGAAATGGTTAAGGCCTTCGAGAAGGCTTCCGGCCGTAAGGTGGCTTATCAGATCTCACCGCGGCGCGATGGCGACCTTGCCGCGTTCTGGGCCGACGCATCCCTTGCCGATAAAGAGTTGAACTGGCGTGTCAGGCGTGGAATTGATGAAATGATGCGCGATACGTGGAACTGGCAGAGTCAGAATCCGCACGGCTACAGCTGATTGATAGCTGAAACTAATGCAAATCCGGCCAGTGCCGGTTTTTTTGTTAAATGAAATTGTTAAATTTATTGTGTTAAGCATCATAAGCTTATATCTTTCGAGCACCGGGTTAAGTATCTTATCGACTGCCGAAGCAGTAGAAAATTCGTGCATTCACGGTGCGAATTCACTCGACGCTGCTGGAAATTTTCCAGTTTAGTTAGTTTTCAGAAAAAGTGGATGATATAATTTATACCTTGTTAACCGGTGGGTGGCTTTTTAGAACAGATGATAGTTAGCTGTATCAGCATGCTGAGAGCTGTAACTCAGGGGCGGTAGCGTGCTTATTTCCCGATAGTCAACTCTTTTCTCTGCTTCAGGTCTAAATTTCAATTAAAGAAAACCACAGTGTTTTGAGAAATACTATGAAAATTTTGGTAACTGGTGGTGCTGGCTTCATTGGTTCAGCTGTTATACGCCATATCATTAAAAACACTAATGATACAGTATTGAATGTCGATAAACTCACCTATGCTGGTAATTTAGAGTCCTTGCAGGATATCAGTGATAGTCCTCGTTATCATTTCTCTAAAACAGACATCTGCGATCCAGAGTCCCTTAATCGTATATTTAATGATTTCGAGCCGGATGTCGTCATGCATTTGGCTGCCGAAAGTCATGTTGACCGTTCAATCGATGGCCCGGCAGCTTTCATCGAAACAAATATAATCGGAACATATGTTTTACTTGAAGCTGCCAGAATTTATTGGTTTGGGCTGCCTTTTGAAAGAAAAGAAGCATTTCGATTCCATCATATATCTACAGATGAAGTGTTTGGTGATTTGCACGGTACAGACGATTTGTTCACCGAGCAAACCGCTTATGCACCGAGTAGTCCTTACTCTGCTTCTAAAGCCTCCAGCGACCATTTGGTCAGAGCCTGGCTGCGGACTTACGGTTTACCCGTAATAGTCACAAACTGTTCCAATAATTATGGGCCATACCATTTTCCCGAGAAGTTGATTCCTCTGACGATACTAAATGCGCTTGCAGGTAAGCCACTGCCCGTTTACGGTAACGGTAAACAGGTTCGAGACTGGCTTTATGTGGAAGATCACGCCAGAGCATTATATCGGGTAGCCACTGCAGGTAGAGTTGGCGAAACTTATAACATTGGCGGTCATAATGAACGGCAGAATATCGATGTTGTGAATACTATTTGCAAGATCCTTAATCGACTTGTCGTTGATAAGCCCTATGGTGTTGACGACTATTCTGCACTTATTACTTTTGTCCGAGATCGTCCAGGACATGATGTTCGCTATGCGATTGATGCTACAAAAATCGGGAAAGAGCTGGGTTGGTTACCCGAGGAGACCTTTGAGACGGGTGTTGAGAAGACAGTCCGCTGGTATTTAGAAAATACAGAATGGTGGCAACGGGTTCAGGATGGGTCGTACGCTGGTGAAAGATTAGGATTAGAGGCAAAGGATTTTTGATATGAAAGGTATTATTTTAGCTGGCGGATCTGGCACACGGCTCCATCCTATCACTCGAGGTTTATCAAAGCAGCTCTTGCCGGTTTACGACAAGCCAATGATCTACTATCCCCTTTCAGTGCTGATGCTGGCAGGCATTAAAGATATTCTTATCATTACTACCCCTCAGGATCTGAGCTCATTCCAAAGATTACTGGGTGATGGTGGTGAATTTGGAATTAATCTGCAATTTGCAATACAGCCCAACCCAGATGGATTGGCTCAAGCGTTTATCATTGGCGAGAAATTTATTGACGGTGAAGAATGCGCATTGGTTTTAGGCGATAACATATTCTTTGGGCAGGGGTTTGCACCTGTTCTTGAAAATATTGCTGCTAAAAAGTCAGGAGCAACCGTATTTGGTTATCAGGTTAAAGATCCTGGTCGTTTTGGTGTTGTAGATTTCGATAAGAATTTTAAAGCCCTTTCAATTGAAGAAAAACCCGAAAAGCCGAAATCTAATTGGGCAGTAACAGGACTATATTTTTATGATAAAAATGTAGTTGAGATGGCAAAAAAAGTAAAACCCTCTCACCGTGGTGAATTAGAGATTACCGAGCTAAACGAAATGTACCTTAAAGAGGGGATGCTTGAGGTTGAGTTACTTGGGCGCGGGTTTGCATGGCTTGATACCGGGACTCATGATAGCCTCATCGAGGCATCACAATTTATACACACCATTGAAAAAAGGCAGGGTCTAAAAGTAGCCTGCCTTGAAGAAATCGCTTTCAGAAAAGGTTGGATAACCAAAGCGCAACTAGCTGAATTGGCAAAGTCACTTGAGAAAACAGATTACGGAAAATATCTTCAAAGTGTCGTTTCTAATTAGATATCGGGTAGTTTCAACTCACCCTTTGGGTGCCTTTTTAATTGAGAGTGGAATGCAGTAGGATGCTGCATTCAATATACATTTATTCACGAGTAAGTGAATTGAAGGAGTTGCCATGCCAACGGTTTTGATATTAGGAGGTTCAGGATTCATAGGGACCAATTTGATTGAATTTTACTGCAATAAAAACTATAAAGTTGTTACTTTTGGACGTTCAATGCCGATAATTGAGCATCCTAACATTGAAAAAATTATCGGCGATATCAGAAACCTGGCAGATTTAGAGCTCGTATTTAAAAATCATAAAATAGACCTGGTTTTTCACTCCTTGACGAGTATATCAGCAACGGATTCATTTGCTAGCTGTCAGAATTTAGTATCAGTAAACCTGTCTTGCCTAATTGATATTATTTCTTTGATGAAAAAGTACAGCGTATATAACATGGTGTATTTTTCATCTGGTGGGTCTATTTATGGCATAGCGGATACGCCGATTAATGAGGAGCACGAATTGTCTCCGGTTAGTTTCTATGGATGGATAAAAGAAGTTTCTGAACGCTATCTGGCATATGAAAATAGAATTAATTCTAAATTCAACTATTTAATATTACGTCCTGCCAATGTATATGGTCAATATCAAAAGTTAAACAGGATAATCGGTGTAGCCCTAAAAAATGCAATAAAAAAAGAAGACATGCATATATATGGTGATGTGAATATCCGAAAAGATTATATACACATAGATGATGTCTGTGAAATGACTTACGCATTAGTTAATAGTGCCAACTCGTGGAATGATATCTACAATATTGGTTCAGGGATGGGGACAAGTCTGAAAGAAATATTGCATTACGCTGAGGTTATTAGCGGCAATAAAATGAATTTAGTCATGCACAATAAAAAGGTCGGTGATATTAGCTACAGTATTCTCGATAACTCAAAAGTACTGACTAAAATCGGTAAAAGAAGCTTCATACCTGTTTACGAAGGGATGAGAAGCATGCATATGTACGTACATCAGCAGCTCAGAGCTAATTCTGTTACCAGCTGATCAATTTCAACTCCAGTTGATGATGAATCTGTATAACGTTTCCCGTGAAGTAACTAAATGATGAGAAATTTATGAAGCTTAGTATAGGATACCTTTATGATCTGGTAACAGTCATAACTCAAAAAGAACTCAAAGTCAGGTACAAGAGCAGCTTCTTTGGGTATCTTTGGTCTATAGCTAACCCCTTATTATTTGCCATGATATATTTCTTTATTTTCAAGCTTATAATGAGAGTTCAAATACCTAACTATACCGTTTTTATTATCACTGGATTATTTCCATGGCAATGGTTTGCCAGTTCAACCGGGAATGCGCTCTTTTCATTTTTGTCAAATGCCCAAATCATTAAAAAAACAGTATTTCCACGTTCAGTGATACCGCTCAGCAACGTATTAATGGAGTGTTTACACTTTCTTTTTACGATACCAGTCATTCTTGTTTTTCTATATATCTATGACATGAGCCCATCTCTCGACTGGCTTTGGGGCGTCCCCTTAATTGGTATGGCGCAAATAATATTAATGCTTGGCGTAGCCCTAATGCTTTCGACACTAAATCTGTTTTTTCGTGATTTAGAGAGGTTTATAAGTCTTGGCATAATGTTGCTATTCTATTGCACGCCTATTTTATATTCTGCCGAGATGATCCCACAAGAGTATAGTTGGTTGGTGGAATATAATCCCTTCGCATCAATGATAACGAGCTGGCGTGAATTATTTATGCATAACACAATAAACTACCCTTTAGTGGCCGAACTCTATGCTTATGCTGCGATTTCTTTACTTATCGGTAGCTCAATCTTCAACAAGCTTAAACATAGATTTGCAGAGATTTTATAATGAGTGTTGTGATCGAATTTGCCAATGTAACTAAAACATACCCGCTCTATCATCATATTGGATCTGGCATAAAAGAGCTGCTATTTAATCCACGAAGAGCACTGAGTTTGTTAAGTGGGCGCAGCTATCTTGCTATAGAAGATATAAATTTCAAAATTGAGAAGGGTGAATCTGTCGCTTTAATCGGCCGAAACGGTGCGGGTAAAAGTACAACTCTCGGTTTGGTTGCTGGTGTTATGAAGCCAACAACAGGCAAGGTCAACGTTGTTGGGCGTGTGGCATCGATGTTGGAACTCGGTGGTGGCTTTCACCCAGAGTTGACTGGAAGAGAAAATATCCGTCTCAATGCTACACTTTTGGGGTTAAGACGCAAAGAGCTTAAATCTCGTATAAATGAGATTATAGAATTCTCGGAATTAGGCGAGTTCATTGATGAACCAATACGCGTATATTCCAGTGGAATGTTGGCTAAACTAGGTTTCTCTGTTATCTCACAGGTAGATCCAGATATTTTAATTATTGACGAGGTTTTGGCAGTCGGAGATATTGCGTTCCAGAAAAAATGTATCAACACTATAAATGCTTTTAAAAGCAAAGGCGTCACAATTCTTTTTGTCAGTCATAATTTATCAGATGTAGAAAAAATTTGTGATCGTGTCATCTGGATAGAAAATCATCGTCTAAAAATGACTGGCAATTCAAGCGAAGTTATCAATGCTTATAAAATCGCAATGGCATAA
Protein sequences of DBSCAN-SWA_3 >NC_010694|1508344:1515280|1513772_1514540_+|WP_012441078.1|DBSCAN-SWA MKLSIGYLYDLVTVITQKELKVRYKSSFFGYLWSIANPLLFAMIYFFIFKLIMRVQIPNYTVFIITGLFPWQWFASSTGNALFSFLSNAQIIKKTVFPRSVIPLSNVLMECLHFLFTIPVILVFLYIYDMSPSLDWLWGVPLIGMAQIILMLGVALMLSTLNLFFRDLERFISLGIMLLFYCTPILYSAEMIPQEYSWLVEYNPFASMITSWRELFMHNTINYPLVAELYAYAAISLLIGSSIFNKLKHRFAEIL >NC_010694|1508344:1515280|1512765_1513701_+|WP_012441077.1|DBSCAN-SWA MPTVLILGGSGFIGTNLIEFYCNKNYKVVTFGRSMPIIEHPNIEKIIGDIRNLADLELVFKNHKIDLVFHSLTSISATDSFASCQNLVSVNLSCLIDIISLMKKYSVYNMVYFSSGGSIYGIADTPINEEHELSPVSFYGWIKEVSERYLAYENRINSKFNYLILRPANVYGQYQKLNRIIGVALKNAIKKEDMHIYGDVNIRKDYIHIDDVCEMTYALVNSANSWNDIYNIGSGMGTSLKEILHYAEVISGNKMNLVMHNKKVGDISYSILDNSKVLTKIGKRSFIPVYEGMRSMHMYVHQQLRANSVTS >NC_010694|1508344:1515280|1510708_1511782_+|WP_012441075.1|DBSCAN-SWA MKILVTGGAGFIGSAVIRHIIKNTNDTVLNVDKLTYAGNLESLQDISDSPRYHFSKTDICDPESLNRIFNDFEPDVVMHLAAESHVDRSIDGPAAFIETNIIGTYVLLEAARIYWFGLPFERKEAFRFHHISTDEVFGDLHGTDDLFTEQTAYAPSSPYSASKASSDHLVRAWLRTYGLPVIVTNCSNNYGPYHFPEKLIPLTILNALAGKPLPVYGNGKQVRDWLYVEDHARALYRVATAGRVGETYNIGGHNERQNIDVVNTICKILNRLVVDKPYGVDDYSALITFVRDRPGHDVRYAIDATKIGKELGWLPEETFETGVEKTVRWYLENTEWWQRVQDGSYAGERLGLEAKDF >NC_010694|1508344:1515280|1509290_1510304_+|WP_012441074.1|DBSCAN-SWA MSILVTGGAGYIGSHTVLALLERGDEVVVLDNLSNASRESIARVEKLAGKTAAFYEGDILDRACLRNIFKAHDISAVIHFAGLKAVGESSRKPLEYYQNNVSGTLVLLEEMRNAGVKQFIFSSSATVYGADAPVPYVETTPIGGTTSPYGTSKLMVEQILRDYAKATPEFKTIALRYFNPVGAHESGQIGEDPNGIPNNLLPYIAQVAIGRLEKLGIFGDDYPTKDGTGVRDYIHVMDLAEGHLKALDHLSAIEGYKAYNLGAGEGYSVMEMVKAFEKASGRKVAYQISPRRDGDLAAFWADASLADKELNWRVRRGIDEMMRDTWNWQSQNPHGYS >NC_010694|1508344:1515280|1511783_1512650_+|WP_012441076.1|DBSCAN-SWA MKGIILAGGSGTRLHPITRGLSKQLLPVYDKPMIYYPLSVLMLAGIKDILIITTPQDLSSFQRLLGDGGEFGINLQFAIQPNPDGLAQAFIIGEKFIDGEECALVLGDNIFFGQGFAPVLENIAAKKSGATVFGYQVKDPGRFGVVDFDKNFKALSIEEKPEKPKSNWAVTGLYFYDKNVVEMAKKVKPSHRGELEITELNEMYLKEGMLEVELLGRGFAWLDTGTHDSLIEASQFIHTIEKRQGLKVACLEEIAFRKGWITKAQLAELAKSLEKTDYGKYLQSVVSN >NC_010694|1508344:1515280|1508344_1509241_+|WP_012441073.1|DBSCAN-SWA MTKLKAVIPVAGLGMHMLPATKAIPKEMLPVVDKPMIQYIIDECVAAGIKEIVLVTHASKNAVENHFDTSYELEALLEARVKRSLLSEVKSICPPGVTIMNVRQPQPQGLANALLCARPMLHDEAFVVVLPDVLLDNASADPLRYNLAAMVARFEETGRSQVLAHHMPDADLSEYSVITTEEALDFPGKVSPILDFVEKPDSPQLLNSNLAAVGRYVLSADIWPELEKLEPGAWGRYQLTDAIANLNKKTPVDAHLLSGDSFDCGRKLGYMKAFVTWGLRNHTQGRGFREEIQKILAK >NC_010694|1508344:1515280|1514539_1515280_+|WP_012441079.1|DBSCAN-SWA MSVVIEFANVTKTYPLYHHIGSGIKELLFNPRRALSLLSGRSYLAIEDINFKIEKGESVALIGRNGAGKSTTLGLVAGVMKPTTGKVNVVGRVASMLELGGGFHPELTGRENIRLNATLLGLRRKELKSRINEIIEFSELGEFIDEPIRVYSSGMLAKLGFSVISQVDPDILIIDEVLAVGDIAFQKKCINTINAFKSKGVTILFVSHNLSDVEKICDRVIWIENHRLKMTGNSSEVINAYKIAMA |
7 | Tupanvirus(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2079963 : 2091228
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_010694|2079963:2091228|DBSCAN-SWA GTTAGCTTGAGCAGCTGACTTCCAGCTTTTTGCCCCAGTCGGGCGGGCGTTGCGCCAGGTCCGCATATTCTGGCGCATCGTCAAACGGCCGCGAGAGGGCCTGATGCAGTCGTGCTAACACACTAATATCATCCTGTTCAGCCCGTTCTATTGCCTGCTGCGCCAGATAGTTACGCAACACCAGCGCCGGATTTGCCAGCTTCATCGTCTGCTGACGTTCCTCATCGCTGCGCTCCTCCTGCAGCACCCGTTGGCGCCAGACGTTATACCAGCTGTCGAAGGCGTCACGGTCGATAAACTCATCGCGCATTGGCGAGCGTAACTGGAGCTGCTCACTCTGGCTAAGCTGACGGAACGTACGCGTGTAGTCACTGCCCTCTTTGGTCATTAGCGACAGCAGCCCGGTGAGAATATTATTGTCGTCCTTCGCGGGGGTCAGCAGCCCCAGTTTTGCGCGCATCTTTTCCCCCCAGCAGCGCATAAGCTCCGGCTCATAGCCTGCCAGGGCCTGTTTAAGCTGCTGTGGGCTCATCAGGCCTGAAAGCGCATGAGCCAGACGATTCAAATTCCACAGGCCGATCATGGGTTGATTTTCGAAACTGTAGCGGCCCTGGTGGTCAGAATGATTGCAGATCAAATCCGGCCGGTAGTCGTCGAGAAAACCGTAGGGGCCATAATCAAACGTCAGGCCGAGGATCGACATATTGTCCGTGTTCATGACGCCGTGCGCAAAACCGACGCTTTGCCAGCCGGCAATCATCCGCGCGGTACGCTGAACCACATCGCTAAACCACAGTAAATACCTGTCTTTCTCCTGCACTAATTCAGGCCAGTGGTGCCGGATCACATAGTCAGCCAGTTGCGTCACTTTCTCCGTTTGGCCCTGATAATAGTAATGCTCAAAGTGACCAAAGCGAACGTGGCTTTCCGCCACGCGCAGCAGCATTGCGCCAGCCTCTGTGGTTTCACGATAGACCGGTTCATCACTGGTCACCACGGTCAGCGCCCTTGAGGTTGGGATGCCGAGATGATACATCGCCTCACCGGCAAGAAATTCGCGCAGGGTCGAACGCAACACCGCCCGCCCATCCCCCATGCGTGAATACGGCGTAAGGCCTGCCCCTTTCAGATGCCAGTCAAATTTTCGCCCGTCGGGCAGCTGCTGCTCCCCGAGCAGCAGCCCACGCCCGTCGCCCAACTGCCCTGCCCAGACGCCGAACTGATGCCCGCTGTAGACCTGCGCCAGTGGCTGCATACCTTCGGCTAAACGCTCGCCGCTCCACAGCCCCACGTTTTGTGCATCAAACAGGCGCTCATCCAGCCCCAGGTCCTGCGCCAGTTCGGCGCTGTAATACAGCAGGCGCGCATTTTTAAGCGGCATGGGTCGCAGTGTGGTGTGAAAGCCATTCAGCTCATTCAACCAGGTATTATTAAATTGCATGACGCCTCTCCCAGCGCTATCGGGAACCGCCACGGCGGCGATCTATTCTGCTCAGTGTAAAGCGCCAGACGGCGGCAAAACACCGACTAACATAAGGGGTATCATATAGCGGGTTCTAACAGGTGATAAGGCGCTTCCAGCAGGCGCGTCAACGTTTCTTCAGGTTCGTTGATAAATATTGCCCCCTGAATACCGTCGAAAGCAAACGGACGAACGGTTTGCAGGTCGGCGAAGTTATCTATTCCCTGCACCACGACTTCCTGACAATGAGGTTTAATGTGCTCAAGAATAGCGCCAATAAATGGCTGAAAGGACAAGCGCCTGATGCCACGATGAATAAAACTCTTATCCAGTTTAATACGATAAAAGAGATCGTCGAAAATGGCTTTAGACGTTGCCTGACCCGCTCCGTAATTATTCAGGCTTAAACTAAAACCTTCACGCAGGCCCAGCAGATAATGATTAGCATCTCCGGCCTTCAAACCAGGAAAATTCTCACTAATTCCCAGTTCAACACAGTTGAGCAGGCGCATCTTCTTAAGTAATAAATCGCTTTCCAGGATGGTTTTGGCCAGCGCGATGTCAATATTAATGGCAACGTTGACGTTGTGTTGAATAAAGAAGTCATGATGTCTGGCGATGATATTAATTTTATCCTGCATCAAACGTAGGCGCTGCTGGTCATTAAGCTGAGGAGTTAAAATTTCCTGCGGGATCGACACGTTGGCCGCCGCATGAGTAAAGTGAGTTAACATTTCGACAGACATTAGTTGCCCGTCTGGTGCATAAATCGGACAGAAACCGTTATGAGTTTTATAATCCGCCTGCAGATGGATTTTCATTTCATTCGCCATTTTAAGCGGTGATCTCCGCAACCTGTATTCAACAACAGGACAAGCGCTTCGTCATCCCGTCGTCATTCAACTATTTTTGCTCATACTTACATTCCGGATCGCTAAACTATCACAAAAATACCACAAACAGTCCTTACCCGTGTCATCTTAATGAAAAGGGAGAATCTGTTATAGGCATTTAGTTCATCACTTCTTAACTAAAAAGGAGACTGATGCTTCATAAAATAACACTTAAGGGGAATTAACCTCATTTCGCAATGACCGTCCCCGCTTGCGCTAACATGGTTGATATAAATCAAATGGCGCTAAAATCGGCATTCAGACAGGTAGATAAAAGATTAAATTCGTCGTGCCTGCCAGAAAACTTTACGCCAGTAAACATTATCCAATGATGAGCGTGTGACTCCGCGACTGGTTGAGGCATGAATGAAATAACTATCCGTGTCATAAATGCCAACATGCAGTCCATTTTCCCCGCTGCCGGTTTTAAAGAAAACCAGATCGCCCGGCAACAGCTGTGACTTATCAATACGTGTCCCGATCCCGGTCTGCGCTGACGTGGTGCGCGGTAATTGCAAATTGAAACGCTCGCGGAAAGTCAGATAAACAAATCCAGAACAATCAACGCCGCCACGATCCATACCACCATAGTGGTATGGCGCACCACGCCATTGCCCAAGCTGGTCGTTTAACTGCGCAATAACGGTAATAGAATCTGCAAGCCTTGAATTCGGTGCGGGTGCACGGCTGCTACAGCCGGCCAGAACCATGACGATCAACAGTAGGAAAAAACGCATTATGCACGGTCCTTTATTGCTCATGGTTCTAATTTAGCGGCCAACCCGTAGGCTGGCAACTTTTGCCGTTGATCAAACCAGGCATGACCTGCCCATACTGTCGATAGTTGGGCTGATTTCGGCCAGGATGTGACCGACGCCCTTCAGAACGCGGATAAGGCCTGAACGCCCCTGCTGTAGCCCGTGCTAAAATGGGCGGCGAGGGATAGCGCCACTCGTCATATCAACCTGGGTTAGTGGCCCAGCGCCTTGCCGATACACTCCAACAGACGCGGGTCATTAGGCTCCATGTCGGGAGAAAAGCGCCCGACAACCTGACCATCACGATCGATGATAAACTTCTCAAAATTCCATAAAACGTCACCATACTCTTTCGGAGCGCGGCCTTTGCTGGCCATTTTTGCAGCAAACCCACTATTTTCAGGGGCAACAGTGTCCGGACGTGCCGCAATAAGCTTTGCATATAGCGAATGCCGCTGCGGGCCGTTCACCTCAATTTTACTGAACATCGGGAAGCTCACACCGTAGGTAGTGCTGCAAAACGTTTTGATCTCGGCTTCGCTACCCGGCTCCTGCTCGAGAAAAGCATTGCAGGGAAATCCCAGGATTGTCAGCCCCAGTGAATGGAGCTTTTTCTGTAAATTTTCCAGCTGTTCATATTGCGGCGTCAGGCCACATTTTGATGCCACATTGACCACCAGCAGCACTTCTCCCTGCCACTGGGCCAGCGTTGTTTTTTCGCCGTCCAGTGTCACCAGCTCGGTTTCAAATAAGCTCATCTTTTTTCCTGTTAGCTGTGGAGATAACAACCGCTTTCCTCATTTGAGGCCCCCTGACCGCCAGTTTTAAAACATCTTTACCGCTTCAGGCCCTATATCGAACGCAATGAAAAAGGCCGCTTGCGCGGCCTTTTGAACAACTGTCTTTAATCTTCTTTCGGCGTGGCGTTCTCAACCCGGCTTTTTAATTTCTGACCAGGTCTGAATGTCACAACCCGTCGCGCCGTAATAGGAATATCTTCACCCGTTTTAGGATTTCTTCCCGGACGTTGATTTTTGTCGCGAAGGTCGAAGTTGCCAAATCCTGACAGTTTGACCTGTTCACCGTTTTCCAAAGCACGACGCACCTCTTCAAAAAACAGCTCTACCAGCTCTTTGGCATCACGTTTGCTAAGCCCGAGCTTTTCAAACAGGTATTCAGACATTTCAGCTTTTGTAAGCGCCATAGGTTCAATCCCTCAAGGATGCTTGGAATCGCTCTTTCAGTGCCTCTACACATTTTGCAACGGTAGCGGCAATCTCCTCTTCTTCGAGTGTCCGGCTGGTATCTTGTAAAATCAGGCTGATAGCGAGGCTCTTAAAACCTTCATTTACGCCCTTACCACGGTACACGTCAAACAAGTTTACGCCAACTATCTGATTTGCGCCAACTTTCTTACACTCTGTGATGATATCTGCCGCGGGTACGTTTTCAGCAACCACGACAGCGATATCACGACGGTTTGCCGGGAAGCGAGAAACCGGGTTGGCATCAGGCAGAATGCGGTCTGCAACCTTGTTCCAGAGCAGTTCGAAAACCACCGTGCGCCCGTTAAGATCCAGCTTACGTTCCAGTTCAGGATGAACGACTCCGATAAATCCAATATGTTCATCATGCAAATACACGCCAGCGCTCTGACCAGGATGAAGCGCCGGGTTGGCCGTTGCCACAAACCGGATGTCGTCAAGCTTACCGGTTAGCTCAAGGACAGATTCTAAATCACCTTTTAAGTCATAGAAATCTACCGCATCACGTGCCAGCTCCCAGTGTTCCTCATTGCGGTGACCGGCAATGACGCCTGACAGCATCACTTCCTGGCGGATACCGAGGTTCGCCTGCGTATCCGGAACAAAACGCAGACCGCTTTCGAACAGGCGCACGCGTGATTGCTGACGATTCTGATTATAAGCCACCGCGCCCAGCAGGCCGCTCCACAGCGATAGCCTCATCGCCGACATCTCTACCGAGATCGGACTTGGCAATATCAGGTTCTCTTCACCGGGATGCAGCAGCGCCTGAATTTTAGGATCAACAAAGCTATAGGTAATGGCTTCCTGGTAACCTTTGTCCACCAGTAACGCTTTAACACGTTTCAGCGACAGGTCAGCCTCGCGGTGTTTGGTCATAACCAGACCGGCCTGCACCGGTACGTCAGGAATATTGTCGTAGCCATAAATGCGTGCGACCTCTTCCACCAAATCTTCTTCAATAGAGATATCGAAACGCCAGCTCGGGGCTACCGCGTGCCATTTCCCCTCTTCTTCTGTGACTTCACAGCCCAGACGGCACAAAATATCGCTCACATCGGCATCAGCAATCACATGACCGATAAGACGATCCAGCTTTTCACGACGCAGAGTAATGGTGCTGCGCTGAGGCAACGCGGCTTGATCCGTAACATCTATCACCGGGCCGGCTGCGCCACCACAGATATCTAGCAACAGACGAGTCGCGCGTTCGATGGCTTTGTATTGCAGTGCCGAATCGACTCCACGTTCATAGCGATGAGAGGCATCGGTATGCAGTCCATGACGGCGCGCACGCCCGGTAATGGACAGCGGGCTGAAGAAAGCACACTCTAACAGTACGTTCTGGGTTTGCGGGTTTACGCCAGAATCTTCGCCACCAAAAATACCGGCCATTGCCAGCGCTTTCTTGCTATCCGCTATCACCAGCGTGTCGCTGCTCATTCTGGCTTCGTTACCATCAAGCAGCTTCAGCGTTTCCCCCTCTTCAGCCTGACGTACCACGATACCACCATCAATGCGGTCGAGGTCAAAAGCGTGCATCGGCTGACCCAGTTCAAGTAAAACGTAGTTTGTTACGTCAACGACCGGATCGATCGAGCGTATTCCGCAGCGGCGCAATTTTTCCTGCATCCATAACGGCGTGTCAGCGTTCACGTTGACACCTTTTACCACCCGGCCCAAATAACGTGGGCAGGCTTGAGGTGCATCAACACGGATCGGGAAAGTGTCAGGCAACGTTGCGGTAACCGGCTGAATATCCACCTCGTTCAACGGCAGCCTGTTCAGTACCGCGACATCACGAGCAATGCCGATAATGCCCAGGCAGTCCGCCCGGTTTGGCGTGACGCTGATTTCAATCGTGTTGTCATCCAGCTGAAGATACTCACGAATGTCGGTTCCTACCGAAGCATCCTGCGGCAGCTCAATGATACCGTCCTGTTCGACGGAGATACCCAGCTCGGAAAACGAGCACAGCATGCCTTCAGATGGCTCACCGCGTAGTTTAGCGGCTTTAATTTTGAAGTCACCGGGCAGAAGCGCGCCAACGGTTGCCACCGCGACTTTCAGCCCCTGACGGCAGTTGGGCGCACCACAGACGATGTCCAGCAGGCGATCGCCGCCCACATTGATTTTAGTCACACGCAGCTTGTCAGCATTAGGGTGCTGGCCGCATTCGACCACTTCACCCACGACCACGCCGTGAAATGCGCCGGCCACGGCATCTACGCCGTCCACTTCAAGGCCGGCCATGGTGATTTGATCGGAAAGAGCGGTGCTATCAATGGCCGGGTTGACCCATTCACGTAACCAGAGTTCATTGAATTTCATTGGTTTTACCCGCCCTTATTTAAACTGTTTGAGGAAACGTAAATCGTTTTCGAAGAAGGCACGCAGATCGGTGACGCCATAGCGCAGCATGGTCAGACGCTCCATACCCATACCGAACGCAAAGCCAGAATAGACTTCCGGGTCAATGCCCGCGTTACTAAGTACGTTTGGATGAACCATCCCGCAGCCGAGCACTTCAAGCCATTTACCGTTTTTACCCATCACGTCCACTTCGGCAGAGGGTTCGGTAAACGGGAAATAGGAAGGGCGAAAGCGAACCTGCATATCATCTTCAAAGAAGTTGTTCAGGAAATCGTGCAGAGTGCCCTTGAGATTGGTAAAGCTGATGTTCTTATCCACAATCAGCCCTTCCATCTGATGGAACATTGGCGTGTGGGTTTGATCGTAATCATTACGGTAAACGCGGCCGGGTGCGATGACTCGAATCGGCGGCTGCTGATTTTTCATTGTGCGGATCTGTACGCCAGAAGTCTGGGTACGCAGCAGGCGAGTGGCATCAAACCAGAAGGTATCATGATCGGCGCGAGCCGGGTGATGCGCCGGAATATTGAGCGCATCGAAATTATGATAGTCATCTTCAATTTCCGGGCCGGTTTCCACCGAAAATCCAAGCTCACCGAAGAAGGTTTCGATACGATCGATGGTGCGGGTAACCGGATGCAGGCCACCATTTTCAATACGTCGTCCCGGCAGCGAAACGTCGATGGTTTCCTGCGCCAGACGCGCGTTCATTACCGCAGACTCCAGCGCATTTTTCTGCGCATTGAGCGCATCCTGCACCTGCTGTTTAGCCTCGTTGATCACCGCACCTGCCGCAGGACGTTCTTCCGCGGGCAGTTCACGTAGTGTGGTCATCTGGAGCGTAAGATGCCCTTTTTTACCCAGATATTCGACGCGAACGTTATCTAACGCGGCGACATCCTGAGCATCGTTGATGGCGGCTAGCGCGCGCTCCACCAGGTCTGCGAGATGTGACATGCTTTCCTCTTCTTCCAGCCATACGGCCGATCTGTTTTATGTCAGGACGTATAGTCGCCCGGTAATCACTGTGTCTCATTGGCGCTAAAACGAAAAAAGCCTCCACGAGGGAGGCTTTCAGCGCGATTTTTCGTTTCTTTTCTATGCGCAAAAGCCCCCGAAATCAGGTGCTAAAGTAAAAAAAGAAACGAAAAAGAGCAGCGTTCATGCTTACATTACCTGGTTGCAGTCTTATTATCGGCACGGCCTATTGGAAAGCCAATGGATAAAAATGTCAACCTTTTGCTGAAGTTACATAAATAAAAGCGAAGGAGATAAAAAAGAGGGAGCCAAGCCCCCTCTTCAATCTGACTTACGCCAGTGCTGTTTTCGCTTTTTCTACCAGAGCAGCAAATACTGCTTTGTCGAATACCGCGATGTCAGCCAGAATCTTACGGTCGATTTCGATCGAAGCCTTTTTCAGACCATTGATGAAACGGCTGTAAGAGATACCGTTAGTGCGGGCGGCGGCGTTGATACGCGCAATCCACAGCTGGCGGAACTGACGCTTACGTTGACGACGGTCACGATAAGCATACTGACCAGCTTTGATAACAGCCTGGAAGGCAACGCGGTAAACGCGTGAACGCGCACCGTAGTAACCTTTAGCTTGTTTTAAGATTTTTTTGTGACGTGCGCGAGCAACTACACCACGTTTTACACGAGCCATTTAGCTCTCCTGTTTAATATTCTGTTTGCCCTTCGCCATTCATAACGAACGTTGTCGGGACTAAAAAAGGGTTACTTATGCGTACGGCAGGCAGGCAATGACCAGACCCAGATCGCCTTTAGACACCATACCTTTAGGACGCAGGTGGCGTTTACGCTTAGTAGATTTTTTAGTCAGAATATGACGCAGGTTAGCGTGCTTACGCTTGAAGCCGCCAGAAGCGGTCTTCTTGAAGCGCTTGGCCGCGCCACGTACAGTTTTAATCTTTGGCATTTAAAAAATATCCACTTCGCATTGTTAATAAAATGAACCAGACAGGCGCACTGCGCCGCGCACCGAAGTACGCGGAGCGATGACTTTAGGGCCTACTGCCTCTTCTTGGGAGCGAGCACCATAATCATCTGACGGCCTTCAATCCTCGTAGGGAAGGATTCGACAACTGCCAGATCGATATCTTCACACAAATCTTTACGGACGCGGTTAAGCACTTCCATACCGATCTGCTGGTGCGCCATCTCACGACCACGGAAACGCAGCGTGATTTTGGCTTTATCGCCATCTTCCAGAAAGCGAATCAGGTTGCGTAGTTTTACCTGATAGTCGCCATCATCGGTACCAGGACGGAATTTGATTTCCTTAACCTGGATGACTTTTTGCTTTTTCTTCTGTTCCTTAGAAGATTTGCTTTTTTCATAAAGGAACTTGCCGTAATCCATAATACGGCAGACGGGCGGTTCGGCGTTAGGGCTGATCTCGACAAGATCAACACCTGCTTCTTCAGCTTTTTCCAAAGCTTCATTCAGACTGACAATACCAATCTGTTCGCCATCGACGCCAGTCAGACGAACCTCAGTAGCGCGAATTTCTCTATTTATGCGATTAGGACGCGCCGGTTGAACTCGTTTTCCGCCTTTAATACTTTATTCCTCCAATTGATGAAGATTGCGACTGCGGATTTCATCCTGCAGCTTCGCGATCACTTCATTGACGTCAATGCTACCCAGGTCTTTACCGCGGCGGGTGCGAACGGCTACTTTGCCAGCTTCCACCTCTTTATCACCACAGACCAACATATAAGGCACACGACGCAATGTATGTTCACGGATTTTAAAGCCAATCTTCTCGTTTCTCAAGTCCGCTTTTGCGCGAATGCCCGCATTTTGCAATTTTCGGGTCAATTCTGCAACATATTCGGACTGACCATCGGTGATATTCATCACCACAACTTGTACCGGAGCCAGCCACGTTGGGAAGAAACCGGCATACTCTTCGGTTAATATTCCGATAAAGCGTTCCATGGACCCCAGAATGGCTCGGTGGATCATCACCGGCACCTGACGCTCATTACTTTCACCCACATAGGACGCACTCAGACGGCCCGGCAGCGAGAAGTCCAGCTGTACGGTACCACACTGCCAGGCACGATCGAGACAATCGTGTAGGGTAAATTCAATTTTCGGGCCGTAAAACGCGCCCTCGCCCGGCTGATATTCGAATGGAATGCCATTCTCTTTTAGCGCAGCGGCAAGGTCGGCTTCTGAACGGTCCCATAGATCGTCACTGCCAATGCGCTTTTCAGGGCGCGTTGAGAGTTTCACCACGATCTTCTCAAAGCCAAAGGTGCTGTACATGTCGTACACCATCTTAATGCAGCTGTTAACCTCGTCGCGCACCTGCTCTTCAGTACAGAAGATATGCGCATCATCCTGAGTAAAGCCGCGCACGCGCATCAGACCATGTAGCGCGCCTGACGGCTCGTTACGATGACAGCTACCAAACTCTGCCATACGTAACGGCAGGTCACGGTAAGATTTTAGACCCTGATTGAAAATCTGCACGTGCCCAGGGCAGTTCATCGGCTTAATACAGTATTCACGGTTCTCTGAAGAGGTCGTAAACATCGCCTCTTTGTAGTTTTCCCAGTGCCCGGTTTTTTCCCACAGCACGCGGTCCATCATAAATGGCCCTTTCACTTCCTGGTACTCATACTCTTTCAGCTTGGTGCGAACAAACACTTCCAGTTCACGGAAAATAGTCCAGCCGTCATTGTGCCAGAACACCATGCCTGGAGCCTCTTCCTGCATATGATACAGGTCAAGCTGCTTGCCAATTTTGCGGTGATCGCGTTTCGCCGCCTCTTCTAAACGCAGCAGATAGGCAGCCAGCTGCTTTTTGTCGGCCCATGCGGTGCCGTAGATACGTTGCAACATCTTGTTATTGCTGTCGCCACGCCAGTATGCGCCAGAAATCTTTTGCAGTTTGAAGTGGTGACAGAAACGCATATTCGGTACGTGCGGACCACGGCACATATCGACATATTCCTGATGATGGTACAGGCCCGGACGATCGTCATGACTGATGTTTTCATCAAGAATGGTCGTTTTGTAGTTTTCACCACGCGCGGCAAAAGCATCGCGGGCTTCCTGCCAGCTAACCTTTTTCTTGATCACATCATAATTTGTCTCTGCCAGCTGATGCATCCGCTTTTCCAGCAGTTCGATGTCTTCCTGGGTCAGCGTGCGGTCGAGATCGACGTCATAGTAGAAGCCGTTGTCAATAACCGGACCGATGGCCATTTTGGTATCTGGCCACAGCTGCTTGATAGCGTGCCCCAGCAGGTGAGCACAGGAATGGCGGATAATCTCCAGCCCGGCTTCATCTTTAGCGGTAATGATTGCCACACTGGCATCGTCGACGATCGGATCGACCGCATCAACCAGTTCGCCATTGACGCGTCCGGCGATACAGGCTTTTGCCAGTCCAGGACCGATGTCCATAGCGATATCCATCACACTAACGGCATGGTCAAAAGAGCGCTGGCTTCCATCAGGAAGAGTAATTACAGGCAT
Protein sequences of DBSCAN-SWA_4 >NC_010694|2079963:2091228|2084283_2086671_-|WP_012441567.1|tRNA|DBSCAN-SWA MKFNELWLREWVNPAIDSTALSDQITMAGLEVDGVDAVAGAFHGVVVGEVVECGQHPNADKLRVTKINVGGDRLLDIVCGAPNCRQGLKVAVATVGALLPGDFKIKAAKLRGEPSEGMLCSFSELGISVEQDGIIELPQDASVGTDIREYLQLDDNTIEISVTPNRADCLGIIGIARDVAVLNRLPLNEVDIQPVTATLPDTFPIRVDAPQACPRYLGRVVKGVNVNADTPLWMQEKLRRCGIRSIDPVVDVTNYVLLELGQPMHAFDLDRIDGGIVVRQAEEGETLKLLDGNEARMSSDTLVIADSKKALAMAGIFGGEDSGVNPQTQNVLLECAFFSPLSITGRARRHGLHTDASHRYERGVDSALQYKAIERATRLLLDICGGAAGPVIDVTDQAALPQRSTITLRREKLDRLIGHVIADADVSDILCRLGCEVTEEEGKWHAVAPSWRFDISIEEDLVEEVARIYGYDNIPDVPVQAGLVMTKHREADLSLKRVKALLVDKGYQEAITYSFVDPKIQALLHPGEENLILPSPISVEMSAMRLSLWSGLLGAVAYNQNRQQSRVRLFESGLRFVPDTQANLGIRQEVMLSGVIAGHRNEEHWELARDAVDFYDLKGDLESVLELTGKLDDIRFVATANPALHPGQSAGVYLHDEHIGFIGVVHPELERKLDLNGRTVVFELLWNKVADRILPDANPVSRFPANRRDIAVVVAENVPAADIITECKKVGANQIVGVNLFDVYRGKGVNEGFKSLAISLILQDTSRTLEEEEIAATVAKCVEALKERFQASLRD >NC_010694|2079963:2091228|2088454_2088652_-|WP_004157374.1|DBSCAN-SWA MPKIKTVRGAAKRFKKTASGGFKRKHANLRHILTKKSTKRKRHLRPKGMVSKGDLGLVIACLPYA >NC_010694|2079963:2091228|2086686_2087670_-|WP_012441568.1|tRNA|DBSCAN-SWA MSHLADLVERALAAINDAQDVAALDNVRVEYLGKKGHLTLQMTTLRELPAEERPAAGAVINEAKQQVQDALNAQKNALESAVMNARLAQETIDVSLPGRRIENGGLHPVTRTIDRIETFFGELGFSVETGPEIEDDYHNFDALNIPAHHPARADHDTFWFDATRLLRTQTSGVQIRTMKNQQPPIRVIAPGRVYRNDYDQTHTPMFHQMEGLIVDKNISFTNLKGTLHDFLNNFFEDDMQVRFRPSYFPFTEPSAEVDVMGKNGKWLEVLGCGMVHPNVLSNAGIDPEVYSGFAFGMGMERLTMLRYGVTDLRAFFENDLRFLKQFK >NC_010694|2079963:2091228|2082595_2083054_-|WP_012441565.1|DBSCAN-SWA MRFFLLLIVMVLAGCSSRAPAPNSRLADSITVIAQLNDQLGQWRGAPYHYGGMDRGGVDCSGFVYLTFRERFNLQLPRTTSAQTGIGTRIDKSQLLPGDLVFFKTGSGENGLHVGIYDTDSYFIHASTSRGVTRSSLDNVYWRKVFWQARRI >NC_010694|2079963:2091228|2088022_2088379_-|WP_012441569.1|DBSCAN-SWA MARVKRGVVARARHKKILKQAKGYYGARSRVYRVAFQAVIKAGQYAYRDRRQRKRQFRQLWIARINAAARTNGISYSRFINGLKKASIEIDRKILADIAVFDKAVFAALVEKAKTALA >NC_010694|2079963:2091228|2089299_2091228_-|WP_012441571.1|tRNA|DBSCAN-SWA MPVITLPDGSQRSFDHAVSVMDIAMDIGPGLAKACIAGRVNGELVDAVDPIVDDASVAIITAKDEAGLEIIRHSCAHLLGHAIKQLWPDTKMAIGPVIDNGFYYDVDLDRTLTQEDIELLEKRMHQLAETNYDVIKKKVSWQEARDAFAARGENYKTTILDENISHDDRPGLYHHQEYVDMCRGPHVPNMRFCHHFKLQKISGAYWRGDSNNKMLQRIYGTAWADKKQLAAYLLRLEEAAKRDHRKIGKQLDLYHMQEEAPGMVFWHNDGWTIFRELEVFVRTKLKEYEYQEVKGPFMMDRVLWEKTGHWENYKEAMFTTSSENREYCIKPMNCPGHVQIFNQGLKSYRDLPLRMAEFGSCHRNEPSGALHGLMRVRGFTQDDAHIFCTEEQVRDEVNSCIKMVYDMYSTFGFEKIVVKLSTRPEKRIGSDDLWDRSEADLAAALKENGIPFEYQPGEGAFYGPKIEFTLHDCLDRAWQCGTVQLDFSLPGRLSASYVGESNERQVPVMIHRAILGSMERFIGILTEEYAGFFPTWLAPVQVVVMNITDGQSEYVAELTRKLQNAGIRAKADLRNEKIGFKIREHTLRRVPYMLVCGDKEVEAGKVAVRTRRGKDLGSIDVNEVIAKLQDEIRSRNLHQLEE >NC_010694|2079963:2091228|2083979_2084279_-|WP_004157378.1|DBSCAN-SWA MALTKAEMSEYLFEKLGLSKRDAKELVELFFEEVRRALENGEQVKLSGFGNFDLRDKNQRPGRNPKTGEDIPITARRVVTFRPGQKLKSRVENATPKED >NC_010694|2079963:2091228|2088744_2089296_-|WP_071819175.1|DBSCAN-SWA MKGGKRVQPARPNRINREIRATEVRLTGVDGEQIGIVSLNEALEKAEEAGVDLVEISPNAEPPVCRIMDYGKFLYEKSKSSKEQKKKQKVIQVKEIKFRPGTDDGDYQVKLRNLIRFLEDGDKAKITLRFRGREMAHQQIGMEVLNRVRKDLCEDIDLAVVESFPTRIEGRQMIMVLAPKKRQ >NC_010694|2079963:2091228|2079963_2081403_-|WP_012441563.1|DBSCAN-SWA MQFNNTWLNELNGFHTTLRPMPLKNARLLYYSAELAQDLGLDERLFDAQNVGLWSGERLAEGMQPLAQVYSGHQFGVWAGQLGDGRGLLLGEQQLPDGRKFDWHLKGAGLTPYSRMGDGRAVLRSTLREFLAGEAMYHLGIPTSRALTVVTSDEPVYRETTEAGAMLLRVAESHVRFGHFEHYYYQGQTEKVTQLADYVIRHHWPELVQEKDRYLLWFSDVVQRTARMIAGWQSVGFAHGVMNTDNMSILGLTFDYGPYGFLDDYRPDLICNHSDHQGRYSFENQPMIGLWNLNRLAHALSGLMSPQQLKQALAGYEPELMRCWGEKMRAKLGLLTPAKDDNNILTGLLSLMTKEGSDYTRTFRQLSQSEQLQLRSPMRDEFIDRDAFDSWYNVWRQRVLQEERSDEERQQTMKLANPALVLRNYLAQQAIERAEQDDISVLARLHQALSRPFDDAPEYADLAQRPPDWGKKLEVSCSS >NC_010694|2079963:2091228|2081504_2082245_-|WP_012441564.1|DBSCAN-SWA MKIHLQADYKTHNGFCPIYAPDGQLMSVEMLTHFTHAAANVSIPQEILTPQLNDQQRLRLMQDKINIIARHHDFFIQHNVNVAINIDIALAKTILESDLLLKKMRLLNCVELGISENFPGLKAGDANHYLLGLREGFSLSLNNYGAGQATSKAIFDDLFYRIKLDKSFIHRGIRRLSFQPFIGAILEHIKPHCQEVVVQGIDNFADLQTVRPFAFDGIQGAIFINEPEETLTRLLEAPYHLLEPAI >NC_010694|2079963:2091228|2087833_2087878_-|WP_152525475.1|DBSCAN-SWA MNAALFRFFFYFST >NC_010694|2079963:2091228|2083287_2083833_-|WP_012441566.1|DBSCAN-SWA MSLFETELVTLDGEKTTLAQWQGEVLLVVNVASKCGLTPQYEQLENLQKKLHSLGLTILGFPCNAFLEQEPGSEAEIKTFCSTTYGVSFPMFSKIEVNGPQRHSLYAKLIAARPDTVAPENSGFAAKMASKGRAPKEYGDVLWNFEKFIIDRDGQVVGRFSPDMEPNDPRLLECIGKALGH |
12 | Microcystis_phage(14.29%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2726972 : 2745594
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_010694|2726972:2745594|DBSCAN-SWA TTTAACTCGCGTTTTCACAACCAGACTGGCTACGGTCAATGATCTCGGTGCCTTCAATAATGAAGCCAAACTTTCCGAATAGGAAAGAGTGGTTGAATTGAGTCACTACAACATCGGAGAGTGCGACTGAACAGCGATTCTTCTCGATTGCTCTGTCTATAGCTGTTTTAACGTTGGGAATGCCAAGAGGGAAGAGCACTACTGGTGCAGAGTCCTCTGCTTTAACACGCGCCCCTTTAACGAAATTATTAGAGTTCAGATTATAGTTTTTGGTGCTTGCTACGGTCAAATCTGCCACACGTGAACTACATCCTGCTAGCAACATGACTACCGCGGCCAATGCCACTGACTTATTCATATTTTTTCCTTCGATTACAATCGGAAACATCCTATCAATAATTTTTAACATTGCAATTATTGTATTTCGTGCTGTTTGGCGGCTTTGATTTCGGTGGCGGTGAGCTTCGTTTGGGAGGTCTCGCTTTCATTGAACCTGCTAATGCCCCCAAGATGGGTGTAGCTTTCAATAGAAATTAGTAAACGTTACGATAATCCTGAGTCCAGTGAGCATGAGAAAATGAGTAGCGATTTGAGTGATTTACTACCCGAATCGTTATCTAAGTGCCTTTGAAACTCACCCAGATCATTTACTGAAAGATGAGGAAATGTTTTTGTTTTGGTACTTTCAGAACACCTACCAGATCTGATACTGGATTACGCTCCGCCAGAAATGACTGCATAACTGAAGATTTGACGACATGCTTGGCGAGTTTTCTTCAGCTTATCGAGAACACCACGATCTTCCAACTTAGGAAGAACATCGGGCATCTCTTTTAAATTTATTGCAGAAATAGGTCTTTTACCTATGTGTGGGCAAACATCCTTTTTAAGGTATTCCAGCATGTCATCTGCATAACCTTTTGACCGGGAGGTTTCTTATATTCATGGCAATCAGTTCAAAACTGTTACTGACAGCCATAATCTTCGCGGCCTTTATATCCTGTTTCTGCTGGCTTGGGTCACCACCTTCTGACAAGAGTTTCTTAGCTTCAGATCATTTAATCCTTGCTTCAGCGAAGGTAAACAATGGATAGGTGCCAAAGACTACCTGTTTTTCTCTACCTGCAATCCGACACTTAATCCACCAACCTATTGTTTCATCTGGAAAAATCTCAAGACATATCCCACCACCATCGGCAAGCTTGTATGATTTTTCCTTAGGCTTAGCAGAGTCAATTTGTCGAGCTGTCAACTTCATATGGGGGCATCCATTTCATTGAACCTAGCAATGCCCCCTATCCTCCCCCCCCCCAATCGGAACAAGAACGCAATAGGTGACCTTGGACCAGGCTGGACCACAGAGAAGTGTTATATTCTGATTTGAAAGGGAAAAGTAGACTAGAAAGGATGTTACTAGAAGAAAAATTGGTACCCTACAGGATTCAATAATAACAATTAAACCACTGATTTATATACAATAAAAAATTTACATATTTATCACATACCCCCATATGTACCCCCAAATGTTTTCGTGTCACGCTCTTTAGTACGTTTTTTGTTCTTATTATCATAGTTCCACGATAAGGCAGATCCCTACCCTGACCTAACCCTGATTCTACCCGCCTGTTTCTTATTGTATGATAGATGCCTCATACATTAAGGAATGCCGCCAGCCATGAGAATGACAGCGCGTAAAAAAGAGATATTGAGCTACTACCAGCCGGATAACCTCGAATGGGTGACGGGTGAGATTGGGCCGCCGCCGTTTGACGTGTCCGGGCTGGCCTATCTGGTTTGCGGGCTGGACTCCTTCGATACCCGGCACCATCTGGAATCTACCCGGCGCACGCTTGAAACGATGGTGAAAGATGGCCTGTTAGAGAAGGTCACCAGCTTTGAGCGCAGACAGGACACGACACAAAGCGGTGACGGTAAAGGTATCTGGTGTAACTGCTCCCGGTATGGCCTGCCCGGTTCCTGTCTGGTAGTACGCGATGAAGGCGGCAAGGTTGAGGCTATAGACGGGGAAGCGGTGCGGATTGATAACTAACCGCCTGTCTTACGTTGTCTTACGATGGTAAAATAGCCGTTTAACCCTTACCGCTGGTGGTTGTAGTGAATAAATCCCTGTCTGTTACCTTTCGTGTTCCTGCTGAACTGGCTAATGCCTTCACCTCTGCGGTCAGTGAGTCCGGTTCTGATAAAACCGCCTGGCTGATTGACGCCGTTCGCCACAAGCTGAACCAGCCGGACAGTAACGCCGATAGCCGTTTGCTGGCACTGGTGGAACGCATGGAAATAGCAGCGGCTGCGCTGGCAGTGGGTAAGCAGGGCATCCCGCCGCGCCCGTACAATGAGGCTGCGGCGATCCAAGTTGTTGCCGCCACTATCCGGGAAGGCTTCGACAACGGGCGTATCATTGCTGAACGGCTTAACGAGGCTGGATATCAGACTAAGGCGGGTAAGGCGTGGGATAAGGACATTTACAGCGCGTGGAAGCGTCAGGGCGGTAATTTATCCCGGATATCAGCGGCGCTAATTGCCTGAATACTGACCGCGCCCCGTACTTGTCGCAAAACTATAAAACCGGGGATCCCCGGTATTAACATGCGGGATGTTTCCGTATCACCTGCGCGGCGCTGGCTTCTTCCACTGGTAGGCCGGGGCGCTCATTCTCTGGTGATGTCGCTCACTCGCCGCCAGTGCGTTTCTGATACCGTTCCTCACGGTGTAAAGGGTGTACCTGTCCAGCGGTTCCCCGCTCTGACGTGCCATCTGTAATACGGCTAACTCAAGCGTCTGGCGGTCTGTCATGGGGTTACCCTTCGCTGTAGGTTATTTGCAGTAATCTGTTGATATGTAGTGCGCGATATAACCTTTCCACTCCCAAAAAAGGTTTCTAACCTGCGGCTGTGCGAAAAAAGGTAAGGCGGCGGTACTGACGGCGGGAAGCCCTGAACGTTTAACCTGCCCCCGCCGCGCCTGTCCTGCGCTCAACCGCCCCCATTATCTTATCCACTACCGGACTGAATGCCCTGTAGTACGCCTGTTCGCTCCGTTGCAGGTGTAGAACCTTCCGGCTGAACGTTTCCCACCTCATGCCAGCAGGCTTGGGAAAAAGGGCGCAGTCGTTAAACAGGCTGTACGCGGGCGCATAGTCGCCCCATATCGCCGCCCGCTGCTTAAAAAGGCTGCGCCGCATTCTGGCTAACCGGTCTTCGCTCTGGCTGGCGTAGTGAAGCCCCCAGCACTTTCTACAGGCTAAATCACGCTTACCGATAAACAGTACTGCCGCACGGTTATGGCAGTGAGGGCATAAGTACCAGTGCCGCACGCCATAGCCCGCCCGTGTGCTGGTGGTTCGGAGCGGTAATACCCTGCCGCCGATGGTTGCCGTGTAGCTACTGCCGCCCAGGTGAAAGCGCATTTCTCCGGCACGGGTTGCCAGCTGTAGCGCCTTCGCCCCCTTACTGAACCTCTGCCGGATAGCCTTCAGGTGTGCCAGCGTCAGGCGGGGTTGTGCCGTGGTATAGCTTCGGGTTTGTTCGCGCATGGCTTCCACTCCGTCATTTTGAATACGAAATAATATGCGTTTATCGCTGAACGGCTACCCGTGCGCAGTGCGATCCGTGCCTGCGTGGGTAGGGTCTGCGAATGGTTTCGCACATCAATGGCGGTAAAATGTTCACATCACACTTTATCCTCAACATAGATACCCGCTGAAATTATTGCGCCGCCAATGCGGGGCTTACCATATGCAAGCGCTACCGGATAACCCTGCGCTGCCGTGTTGGTAACCCCACCGAACGCGTAAGAAGCCTTGTTATCCGCATCCTGCTTGCTGGCTAGTCCGGCAGTCTGTGGTGATAGCATCTGTATTACACCGCCCGCTGTAAGGGATGCGCCAGCAGCAAACATTAAGTTGCTGGTCGCTATACTAATGCCGGGTGCCCAAATTGACGCAGCGATTAACGCCGCACCCAATATCGTTTGAAGCATTCCGGCTCTTTTGCTACCCATTATGACAGGTACAATTCTGATAACTTCGCCATTAACCGGATAACCTAAATCATCCAGGCCTATGTTTTTCTTTCCTTTGAAAACGGCATAAGTAAGGCCGCGCCTTTTACTGCTTAGCATGTATTTTTCAAAGCCGTCAATAGTTACTGACAGCGCCCGCGCTGCTTCATGAGTGGTGCGTATAAGGCGGTGGTGCGTTTTACCAAATAGCTTACCGAGTGCGCCGCTTAACTCTATTCTGGTCATTACTTCATCGGTGTGTAACTGTGCTTCTGGCTTCGGGTTGGCATGGTTTGGCGTGTTCGGGGTCATTACTTATTATCCTTACTTTTCGTGGTTTTCGTTGACTAATTAAACGATTATCAGTTTTGCTAATGTCGATTTCTTAATGTATGAGGGATGTTTTTTCTCTCTATATAAGCAAAAGGGTGTTGGTTCAGTTGGTTCAGTTGGTTCAGTTGGTTCAGTTTGTAAAGATGTCTGTTTTTAAAGGACTTTTAACCCGATTACTGAACCAACACGGCGCTTTTTTTGTTGGTTCAGCGGGGCGTTTTGTTGGTTCACTGCCAGCCATGAGAAAATTCCTCTTTGTCCGTTGTTGGTTCAAAACAGGCTTTTGTTGGTTCAGTGTTGGTTCAGATTCAGAAATAAAAACCTTATAAAACAACAATCTTTACAAATTGAACCAACTGAACCAACTGAACCAACACCTTAACTACGTATGTGAGAAAAAATTATTCCTCTGGCTGGCTCTCCTCGTCCGGTAAATAGTTGAGCACGTAAACCCGGTACTGCCGTCCGTCGATGCGCGGTGACTTCCTCTGATAACCTCTGTCACTGCTGGGCGGGGTGAGCATCCCGGCGCTTTTCAGCACCTCCGCAAACTGTTTCGCGTTAAAGCCTCTCGCTATCTCCCCCTCAAAGGCGGCGGGGAAGGTATAGAAGGTTATCGGGTCGTGCTCATGCCCGCCTTTTGTCCGGTAGCCTGCAAGGTCGCGGATTGGCAGCGCTGCCGGGTCATACGGGAACGGGGCGTAACGGCTCAGGCCATGGGCGTTAAGAAATGCTTCAGCCTGGGCGATGATCTGCTGGTGCTCCTTGTTCCCGGTTCCGAACTCCCTTACCCATGCGTTATAGCTGTGCTGTACCGCATCCCGGCACGTTTGCGCGTCCCATCCGGTGACCACGCCGCCCAGTATCAGCGCGGCTTCCAGAATGGCGAACCGTGCGGCTACCCTGTGTACCTGCTCCCCGTAGTCAGCCGGGATAAGCGAGCGCCAGCGGCTTTCGGCCTCCCTCACGGCGTCAACGGCCTGCTGCTGGTGGTCTGCCAGCCACTTAACCCACGCCCGGCCTGCCGCGCCGTGGTGACGCATAAAGGCATCTTTCAGCGCGTCGGCGTGCTGCTTGCCGTTCCGGTGTTCGTGGAACTCTGCCGCCTTGCTTAGCGGGATATTCAGCAGGCGCACCAGCTGCCCGGCCTTTATCTTCTGTCCGGTGCTGGCGATAAAGGTTTCCAAGTCCATTTCCCCGGTGCTGATTGCCACGGTGCGCCAGCGCTTTAAATCCCGGTTGCCGCCTTCCTTCGCGCCCTGCAATTTCCCCACGCCGTTAAACAGCGCATAAGCGGACTGCGCCACGCTGCGCGGATCGGCTCCCTGTCCGACTTCATCCAGCGGCATCAGCGCGTCATTGTGGGCGGCGGCCTCGTTTGCCAGCCCCAGCGCCGTGCCGTACCACGTCAGGCGCAGCAGGTCAGGATTACCGTACAGGCTGGCGGCGGCGTTTGCCGTGGTGGTTTTGCCCGCGCTCGACTGCTCGTAAAAGTGAAGGCCGAACCCGTCAGCGCCCGCCAGTCCGATTAGCGGGGCCGCCAGCGCCGCGCCGATCCCCGTCATCATTGACCAGTTGCCGCCCGCCAGCCTTGCCACGCTGTCCCGCCATTCTGCCGCCGTTCCCTTGCAGGTGTAGCCCGCTGCGGCTGAACTCCTGCCGTTAAACAGTACGGGCATATCGGGGGAACCGATAATCTCACCGTCCGGCATAATGTACGCGCCGCACTGCCAGCCCGTAGCCTGTGCCACGCGCCACAGCTCACGGGAACCGCTGCGCTGTAGCCAGTCGGCAAGGGTGGCCCGCATTGCGGGCTTGGTGGTGATGTTCACCCCTCCGGCCTTCAGGACGCGCCAGCCTTCGCGCTCTCCGATATCGGCAAGGGGGATGGCCTGCGTGGTACTGGCGCTGGCTCCCGCCGGGAACCATTTGATGATTAAAAACTGGTCTTTGTCGTCGCGTCCGATGCCGATAACCTCCAGCGGGGAACACAGCCACGACTCATTGTTGATAATCTTGCCGCTCTCATTGTCCACCTTCGGGGTTATCCAGAAAACGCCGTCACTGCGGCTTTCAGTGCGCGGCTTGAGCGGGTCTTTATCGCTATCCACCCCTCCCGGTTTCCTTAGCTTAACCTTAGTGTGGCTGGCTGGATCCCGTGCCGTGTTATTGCCGTCCTCTGCGCCGCTCTGCTCGTCCGTCATGCTTTCCCCCTGCGGCTGGTACAGCGAATCGTTAAAAGCGGCTGTAGCGGCCTCCAGCCCGTTTTGCTGGTGGTGGTCGTTCCAGTCTGCCTTATGGTCTGTCGGGGGGATTGATACCCATCCCGCCACGGACAGGGCGGCTTTCTCCGCTGCCTCTTTCCCGGTGTTGGAGACTCCAGTTTCGGAGCCTCCTTGCTGGTGGTCGTTATCGGCGGCAATAATGATTTTCGCCTGCGGGTGCTGCTGGCGCATTGCCTCCGCTACGGGCAGCAGGTTGCCCGCATCCACCGCCGCCACCGCCAGCGCGTCAGGGCGTATCAGGTGCGCCGTTAATGCGGTTGCCAGCCCTTCGGCAATCACTATGCTCTGCGGTGCTTCTGGTGCGTTGACGGCGTGATATGCGCCACGCTTTTCCGAACCCGCCACCAGCTTTTTTGCGCCAGCGGCGGTAATGGTCTGCGCGGCGGTGACCGTGCCGGACAGGTTCGCCAGCGCCAGCAACAGCCGCCCGTCAGAAAGCATCGGGAAGGTGAAGCCATTCAGCCCTTTTGTTATCAGGTATTCGCTTTCGCCCGGCTGCGCCTGCTCTGCCAGCGTCTGATAGTGACGGGCAAAGTTTGCCCGCCGTGCCGCCTCGTCTGCCGCGGCCTGCCGCTGGCGCTCCTGCTCCCGCTGCTGGCGTTCTGTTGCCGCCTGCGCCCGCCTCTGGCTGGCTGCTTCGCTGTCGGTTTCTGCCGCCCGGTAATCAATACCCAGCGCATCAGCCACCAGCCGCGCCGCTTCGGTGGTGTCGCAGTCGTTCACCTTTGCCACAAGGTCAAGCCCGTCACCCGCCCCGCACTGATTGCAGATATGGGAACCGCGCCCGCCATCGTCAAAGCGAAAGCGATCCGTACCGCCGCACGCCGGGCATTCGGAATGCTGCCGGGGTGAGCGGGGAACGGTGATACCCGTCATAGCCAGCACGCCGGGCCAGCGCCCTGCGGCGGCGGCGGTCACTTCGCGGATTAAGTCAATATTTCGCATCATTCCCCCTCAGTGCGCCACGGGTGCGGGCGGTAAGCCTTCATCCCGCAACATGCGGATAAAGCCATCATGCAAATCAGCCAGCAGCTCCCGCCCCATATCGGACAGCCCGCCGCCCTCAAGCATGGCCTGATACATCAGCACGGCATTTCGCGTGCCCTGTTCCCTGCCGTAGCGTTCTATCAATGCGCTCTCAATGTTGTTTGCCATAGCGAAACGCTCAGGGGCGGGGTAGACCGCCATACAGCCGTATTGCCCGCAGTAGATAACGCCGCAGTCCGTGCCTCCCCGTTCGTTTTCCACCTCCACCGCGCCATTTTTCCGCAGCTGCTCGTTGATGAAGGCCAGCGCCACACAGTGCCGCCAGACAATAAGCTCCTGCTCTGCCGTGGGCGTGTGGTAGCCCAGCGCCGCGCCGTCAGCCATTGCGCGGAGTATGTCCGGCATATCGCGCACGCCGTCAAAGTCTCCGGCCTCCAGCTTGCGCAGCATGTCGGGGTAGTCGATAAGCGCCATATGCTCATACTGTCCGGCCTCGTTGCGCCGGGTGATACTGACCCCGTGCGGGGTGGCTTCAGTGCGGTAAACGCCCGTTGCGGTCAGTGATGCGGCTTTGTTCATGCTTTGCCCTCCCGGTTCATGTAATCGACCAACTCGCTATTGGTAGCGTTCATTGCCTCCGGCACGCCTTCCAACAGGGTAATCAGTGCGCCAACCATGCGCGGTGATAGTGTGCGATTAAATATCCCCATCTCCATCCAGACGGATAACACTCCCTGCGCCTGTTCTACCCGGCAAATAGCGTCACACAACGGCATAGTTTTCATCGGCTGGCCTCCAGTACGCGAATTGCCAAATCATTAATAACGCCGATAAGTTCGCTTTGGGCATCCCTTCCGGTGTCGTTGTGTGTCAGAAAAAGCGCGGCCTCTGTCAGGATTTTGATTTTTTCCAGTGCGTCTATAGCGTCATTTTTGGCATGGGTATCAGTGCGGATCATTGGGGTATCTCCAGTCCGGGAATGGTTAGCTGTATCAGGGCGGCAATACGGGCAAGGGCGGGGCGTAGCGTCTTTAACTCACGTTTCCGCTGGCTCATTTCTGCGCTACCGTGTTTGCCGTGCCGCCAGTGTGATTTATGGTCTGCTGCTGCAAGAATGTTCTGCGCCCTCTGCGGTGGCATGGCCTCCAGTTCGTCCAGTGAGGGAATGAGTACCGGGCGAATATCAGAGAAGTGGGGGTTGTGGCGGAAATAGCCATCAACCAACTGGCGCTGTACCTGCCATGCTAAATCATCGGTAAATGGCTTTGTCAGTAGCAGATAGCCAAACTCTGTAAAAACGGTGACTTTCCCACGAACAGGGCTGGTTATTTTTTTGCCAGGGCGTTTAATGCCCCTGGCTAAACCGTCAACACTTCCCGCGCTCACTACAAAGTAATCTTTCCCCTCAATGAAGTGGCGGCGGTTACGCTGAAATGCGTTTTTGGCTGTGTCTTTCGGGCGCTGGTGAACCGTGTCAATCATGGCGAACGTCACCACGCGCTGGCCCTGATACTCCACCAGCGGCATTTTTGCATCATGCAGGGTTATGGCTTTCATGCTGCCACCTCCTGAGCACGGGTGATGCGGATATGAGACAATCCGGCGTGTTGCGCCTGTAGCATGGCGGCTGCTTTGGCGCTCTCTGCGCTGCCTGATCGGGTTTTGTGGTTAATGCCAATGGTCAGTCCGCGCTTGTTGGTTCCGTAGCCGTACACGGTGAAATAATTAAGCATGACGCACCTCCTTCACAGGCAGCCGTCCGGCAAAAAAGCAGACGTTATCTTTTGCCAGAGTGCGGCGGGCTTCGCGCTCGTTTTCGGCGGCGATATGGTGGATGACAGGTGACAATGCCGGGTTATCACGGCGAACGGCGGCGATAATCCAGACAAAATGGCGGTGCTGGTGGCCTGAATTCAGGCCGAACGGCTCCCGCCCTTGTGGGTGTGTGGTAATATCCATTACAGCGACCTCGTTAGGTGTATTAACGGTGGTGGTTAGATGCCCTGTTAGTGTTGACGCACTGCGGGGCATTGTTTTTTGTGGCATGTGATAGTAATGTGTCATCACACATATATACATTACTATTGGTGTAGCTAACGTGTCAACACACAAGAATGAACGTAGAGGTAATCCCCCCTTCCAATTCAGGCTTGGTCCTGAATTACGGGAATCGATGGAAGCGGCACAACAAAATGACGGTGACGAGTCTTTGGCTGCATGGATTAAGCGGATCATCCGTAAAGAGCTACAATCTCGCGGCATAGAGCCTAAAGCCTGATTATTTCCGGCATCGTGGCAAAGGGTGGCATTCGCAGTGCTGCCCTTTTTTATGGACTCGCAAAACTCAAGGCTTCCCTTGTGCGCAGTGGTCGCAAAATGTCGTAACGTGTCCTTAAATTCGCCTCGCGTACTGGTCACGAAACTCTCTAACGTGCCCGAAAACTCCCCTTGCACACCAGCTAAGCAATTAATCCCCCGCTTCATTTCAATGGTAGAAAAACTTTCCACTACGTGAGGTTTTGTGAGGTTTTCACTTCCGGTGATCGGGCGTGAAATTACGGTTTCTTTCAGGCTCTGAACCATAACATTTACTAACAGGCCAGCCAGTGGGAAGGCTTCTTTCTGGACGGCGGCTTGAGTCACTCGCATGGCGTCACCTCGATAAACTCCGCTTTAAACTTCTCTAACGGCTGTTGGCACGGGTGCGGGTAGCCGGGGCGCATAAAAACCACGTTCCCGCCGCTCATGCCGCGCAGTGTGACGGTGCGCCCGTGTGAGTCCTTAAACTGGCGCTGTTGGGTGCGGGTGGTCATTAGTGAACCTCCAGACGTTTAGCTAGCCAGCGCTGTGACAGGCGCACCAGCTCCGCTTTGCGCTGGTCATAAGACATACCCATATCTATAAGTGTGATATCGGTGCTTTCCAGATAGGTGAGGTGTTCCAGCTGCCCGGCGCTCATGCTGTCGCGTGGCTCCGTGGTAATGCCGCTGGACTGCGCCCACTGCTTAGCCGTCAGGCCACCCAGCACAATGCGGGCGATCATGTTGCTTTCGTTGGTGTAGTGGCGCTGTTGGGTCTGCTTCCCCAGCTCGGCGCGGGCAGCATCCAGCGCGGCGCACATTGGCTTAAACATGCTGGCGGCACTGATACGTGCCTTTAGCTGGCGGCGGTACTGTACGGCTATTTCCGGCGCACTTATCTGTAAGGCTTCTTCACACTGGGTGAAGTAACGACGAACGGCGCGGCCTTGCTCGTTGCGCTCTACCATCGCCAGCTCTTTTGCCATATTCAAACACAGGCCATAATCTTTACTGCGCCGATCCCCCCCGCGCCTTGAAGTCCATTTGGTAACGGGGTGGTTAAAGTCTGTACTTTGATTCCTGAAATTTGAGGGATCAAAAACGACATAATCCGCACCCAATGAAAACCCGTACTCCTCAATGCGGCCTTTAACCCATGTAGAAAAGTCGTTACCTACTCCCAGCGCTGCGTGTAAAGCCTTCGCGCTCACAATGCTGGTTTGACGCCCGCCAATTTGACCGGGAATAACCGGAACAATTTCAGCGAAGCTATTTGCGTTGATATCGCTGCGGCGGTGCTCAAAGTGAGGGGCGGCCCCAGAAATTAATCTGCTATTTTCGATTTTCATTTTTCAGGCTCCGATTAGGCGGCGGTGAATAAGTCCGGGTATAAGGTCAGAATGTCGTTAATTTCCTGCTGTGACAGCCCACGATAGCCACCAGCGGCGGCGTTGTGGTTTACCAGCTGAATAACGCGCATGACGTCAGCGCGGCATGTGAAGCGATAGCGGAAATGTGATCCGATGCCGTCAGGGTTTTTCTCGTCGATACGCTCAAGGTGAATATCAAGGCTGCGTTCTAATTCGCTGGCATAATTTCTGCCAGACGATAAGCGGCAATAGCGTAATATTTCATTTTCGGTAAAGCCGTTAACCCCGGTGCGCATCATGTATATGCGGGCGCGGTGCTTTTTCGGTGCTGGCTTGATGGTGGCGGTAGGGGTATCATCGTTGGCGCTAACCTGTTCGACAATGCCCGCCACGCTGCGGGTTTTCTGCATTATGCCGCCTCCCCGTCACGCTCTGCGATACGCTGATTAATCCATGCGTCTATTTCACTTTCAACGAAAGCGATAGAGCGTGATCCAATTTTTACTGGTTTTGGGAATATCCCTTGAGAAATAAGTTTATATACCCAAGCCTTACCAAGACCTATGCGCCGGAGAACTTCCGGCATGCGGATGAGATTTTGTGACATATATACCTCTATTTACCAATTCATTCGTAAATGGACATCTGTAGATAGAGGGATTATATGAATGACTTGCAAATTATATAATGACTAAAGCACTAACTTACTAATACAAGATTGTCATGAAAGAGAAAGGCCGTTAAACAACATTTTTTGTCATGACAAAGAAAGCAAGATTTCGACACAACTTTGTCACTTGCCTTGAAGTCCAACTTTTTGAATAAATGCATAGCTAAAAGATTTTAGAGATTCGATACGCTCAGTAACTGAGTCTGAATAAACATATTTATATGATGTTGTACTTTTTGCCGACCTCCCAGCCCCATAATATTTATCCAGCTTAACCTGACTTTGGTTAAATGACTTCCACAATCTAAAAATAGTTCTGTTATCGACAAATTCTTTGATAAACCCGCACCCCATCAACATACTCACAATCGAAGGGATTTGTCTTGCATTAAATCGATGTAGAGTATCTATGAAATCTATTGCCAATTGTCTTTCTAAAATCGTTGAATCTTTTCTTTTGGTAATAATCCTTGCTCTTAAATTATCTATTCTCAACTGGATCGCATCAAGGCTAAATGCTCCCTGCCCTACAATAACTCGTCCTTCACGGGAATTCGATATAAATTCCCTGAAAGCTGAATTTGCATGATAAGCATATAATTCCTCAAGTTTCTTCTTGTATTCTTCCAGTGTGTTTATCTCATCTTTTATTGTAGCACTTGTGTCCTCACCACTAAGATACCTACATATACATGCTAATATTTGCACGGGGATTTTGAATCTTGAATAAGAACAGGGAGGATTAAACCCTTCAAACATATCATACAACTCTTTCCATTCTTTTTTATCCAACCCATCAATAGAATGTGATTTGTAAATGAAAATCTTATGCGCAATAATCGCTTGATAGGGTATATCAAAGCCAGTAACAACACCTGATTTTTTTAAGGCACGGCTAATTCCATTATCATCCATGAATTCATTTACTTTATCAAAAACACAGGACATCTCTAGATCGTTTTCAATATCCTGAAGGGCTGCGCTTGCTTTTTTAGTTATATCAATAGCCTCACTAATAGTAGTTATTTTTGATGAATTAATCTCTAACAAACAATAAGTCATGCAACTATCAAAAAAATCCAACTTTCTGTACAGATCATAAATATATTCATAGGAACTGTTTTTTTTCACAACGCCACCTCGCGCCCTCTAATTTAGGAACCGTGCCAGCTCGTAGAGGTGTACGAATTTTCGGGGATCAGCCTAGACACGGTTTATTCTATTGTCGTCTATTATAGTCTACTGCTGTTTATATATCCAGCTATGCGAGCTTTCCGAACGCACCATGTACCACATTATCCCCGCTTTCCAGCGCGTCCATATAATCGGCATACCATTGGAGCATTTCCCGTCTTCCGTCCAGGTACTGCGCGTGGTTATACGTTCCTCTAATACTGTTCTTATCGACGTGAGCTAACTGGGTTTCAACCCATGCGGAGTTATAACCCTGCTCATGTAGGATTGTGCTCATCGTGTGACGGAACCCATGCCCGGTGGCCTTGCCGTCATAGCCAATACGTTTAATGACCTGATTGATACTGGCCTCACTCATTGGCTTACCCGCATCATTACGACCGGGGAAAACGTACTTACCCCGTCCGGTTAGCTGGTGGATTGCTTCAAGTAACGATTTAGCCTGGCTGGAAATGGGTACTATGTGGGGGCGGCGCATCTTCATACGTTCGGCGGGGATCTGCCAGATACCCTTATCAAAATCGATATCTGACCATTCAGAGGCGCGGAGTTCGATAGTCCTTAACCCGGTAATCATCAGTAGCTGGGTAGCGTAGCGGGTGATCATACTGCCACTGTAATTGCTCAGGTCATTAAGGAATGGGGGTATCTGGTCAACATTCAGGTGTGGGAAATGTTTTTGCTTGGGTGCTTTCAGTGCGCCAGCCAGATCGGTGACCGGGTTATTCTCTGCCCTGCCCGTTATCACCGCATGGGTAAAAATCTGCCTGCAAGCCTGCCGGGTCTTTTTCAGCTTATCCAGCACGCCGCGCTGCTCCATTTTACGCAACACTGCCAGCATTTCGGCGGGTTTAATCTCATTGATAGCCCGTTTACCGACAAACGGAAAAATATCCTTACTAAGATATTCCAGTATGTCGGTAGCATAGCCTTCTGACCAGTTTGTTTTTTTGTGCTCGTGCCACTCAAGCGCGATAGCTTCAAAGCTGTTTTGTACGGCTAATATTTTGGACTGTTTCTCTGCCTGCTTTTCCTGCCCCGGATCGCCACCCGCCGCCAGTATCTTTTTTGCCTCATCGCGCCGCCTTCTGGCATCGGACAGTGACACATCAGGATATACCCCCACCGCTAACAGCTTTTCTTTTCCTGCGGCACGGTACTTTAAGCGCCAGTATCTTGCACCATTGGGATTAACCAACAGGTACAGGCCGCCGCCGTCTGCCAGCTTATAGGGCTTGTCCTTGCCTCTGGCGGTGTCCACCTGGCGGGCGTTTAGCTTCATATTGGGGGTATCTCGTTTCATTGAACAGGAATCTACCCCCAAATGTACCCCCACATGATAGTGGATTACAATAGCCAGTAGTAGACCTTGAGATAACGCGATTAGGGGTTAAAGCCTGATTTGACGGGGTTTTGTAGACTTTGAGAGACGGTAAAAGAGGATAAAATGGTACGCCCTACAGGATTCGAACCTGTGACCTACGGCTTAGAAGGCCGGTGCTCTATCCAACTGAGCTAAGGGCGCATAATGAATGCGTCGGATTATACGGCCAGCTTCGGCTGAGTCAACGATTTTACTGCCGCCCGGCATCTGATGCTGAAGTAATGAACAATAACCTGCCAATCTCTGCCTGACAGCGTCGCCAGCTTCTGACAAAATAGGGGCAATTCCCGATTTAACTTAACGCAGATGGATACCCTCTCTGATGGTAGCAAAAATTATTGATGGTAAAACGATTGCGCAGCAGGTGCGCAATGAGGTTGCCGGGTTAGTTCAGCAGCGTCTGGCCGCCGGTAAACGTGCGCCTGGCCTGGCGGTGGTTTTAGTGGGTGAAAATCCGGCTTCACAGATATATGTCAGCAGTAAACGCCGTGCCTGTGAAGAAGTTGGGTTTCTTTCACGCTCGTACGATTTGCCTGCCTCTACCAGCGAAGCGCAGCTGCTGGAGCTGATTACTCAGCTGAATGATGACGCAGAAATAGACGGTATCCTGGTGCAGCTGCCGCTTCCTGCCGGCATCGATAATACCAAAGTGCTGGAGCATATCTCCCCGGCAAAAGATGTCGACGGCTTCCATCCTTATAACGTTGGCCGCCTGTGCCAGCGCGCCCCCACCCTGCGCCCGTGCACGCCACGCGGCATCGTGACGCTGCTGGAGCGTTACCAGATTGATACCTTCGGGCTGAATGCCGTGGTGGTAGGTGCTTCCAACATAGTTGGCCGCCCGATGAGCATGGAGCTGCTGCTGGCCGGGTGCACCACTACCGTGACTCACCGCTTTACCAAAAACCTGCGTCAGCATATTGAAAATGCCGATCTGCTGGTCGTCGCCGTTGGCAAGCCCGGCTTTATTCCCGGCGACTGGATAAAGCCGGGCGCGATCGTGATTGATGTCGGGATTAACCGTCTTGAAAGCGGCAAAGTGGTCGGCGACGTCGATTTTGACCTGGCCTCAACGCGAGCGTCCTGGATCACCCCGGTCCCGGGCGGCGTGGGTCCAATGACCGTTGCCACCCTGATCCAGAACACCCTGCAAGCCTGCGAAAATAATGACAATGGAGTATCAGCATAATGGCCACTTTCTCTCTGGGCGCTCATCCCCATGTCGATCTCTGTGACCTGATGAAGCTGGAAGGCTGGGTAGAGAGCGGCGCGATGGCAAAATCGTTTATCGCTGACGGTCTCGTTACGGTTAACGGCGCGGTGGAAACCCGCAAGCGCTGCAAGATTGTCGCCGGTCAGAAGGTGGATTTTGACGGCAACAGTGTGACCGTGACTGCCTGATTTAATCTGTTCCGCGCACAAGGCTGTCCTGTTGGGCAGCTTTTTTTATGCCTGCAAGCCTACCCAGCGTTTATCTCTTCATCATCTGACGCCGCATCGCCGTCTATACTGTGCTGGCTTGATGTATATCAATATATGGCTGACCGCTGGCGTTAAGCTGATGCTATACACCTTATAAACTGAAGAGGTTCACGATGAAAGCTGCTGTCGTCACTCACGATCATCAGGTTGATGTGGTAGAGAAAACCCTCCGCGCGTTGCAACATGGCGAAGCCCTGCTCACTATGGAGTGTTGCGGGGTCTGTCACACCGACCTGCACGTCAAAAACGGGGATTTTGGCGATAAGACCGGCGTTATCCTCGGTCACGAAGGGATTGGCGTTGTCAAATCTGTGGGTCCAGGCGTCACCTCCCTTAAGCCAGGCGACCGTGCCAGCGTGGCGTGGTTTTTCAAAGGCTGCGGCCATTGTGAATACTGCAACAGCGGCAATGAAACGCTGTGCCGCGAGGTTGTCAACGCCGGTTATACGGCCGATGGCGGCATGGCCGAAGAGTGTATCGTGGCGGCGGATTATGCCGTGAAGGTGCCTGACGGTCTCGATTCCGCGGTGGCCAGCAGCATCACCTGCGCCGGGGTCACCACCTATAAGGCGGTCAAAATCTCCGGCATCACCCCCGGGAAATGGCTGGTTGTTTATGGCCTTGGTGGCTTAGGTAATCTTGCGCTGCAATACGCAAAAAACGTGTTTAATGCCAGGGTTATCGCCGTTGACGTCAGCGATGCCCAGCTGAAATTTGCCGAAGAGATGGGGGCCGATATGGTGATCAACTCACGCAATGAAGACGCGGCAAAGCTCATTCAGCAACGCACGGGGGGCGCACATGCGGCGGTAGTCACGGCGGTTGCCAAAGCGGCGTTTAACTCTGCGGTGGATGCGATGCGCGCCGGCGGCCGTATTGTGGCAGTCGGACTGCCGTCTGAATCAATGAGCCTGAATATCCCGCGCCTGGTGCTGGACGGCATCCAGGTCGTGGGCTCGCTGGTCGGCACGCGTGAAGATTTAGCTGAAGCCTTCCAGTTCGCGGCTGAAGGCAAGGTGGTACCCAAAGTTACCCGCCGTCCGCTACAGGATATCAACGCGATCTTCCATGAAATGGAGAGCGGCAAAATCGTTGGCCGTATGGTGATTGACTTTAACCGTACTCATTAAACCCGCGCCGTACTGATTAAAAGCCATAGCCCACAAAAAAGCCCTGAAAGTTCAGGGCTTTTCATATACTCACTTCAACTTATTTGCGGCGCCAGCGGGTGCCACCGGCACCGTCTTCCAGCACGATGCCCATCTCGTTCAGACGGTCACGCGCCACGTCGGCCTGCGCCCAGTTTTTCTCGTTGCGTGCATCGATACGCATCTGGATCAGCGCTTCGATCTGCGCCACTTCTTCATCGTTGCTGTTGGCACCGCTTTGCAAGAACTGCTCCGGGTCCTGCTCCAGCAGGCCCAGTACGCCGCCCAGCACGCGCAGGCGCGCCGCCAGGCCATCTGCCGCCGCCGGGGCTTCCGCCTTCATGCGGTTAACTTCACGCGCTAAATCGAACAGCGCCGAATAGGCTTCAGGCGTATTAAAGTCATCATCCATCGCCTCACGGAAGCGCGTCTCAAACTCTTCACCGCCGGCCGGGGTAGCACCGGTATCCGTGTTACGCAGCGCGATATACAGGCGCTCCAGCGCTGAACGCGCCTGGTTGAGGTTATCTTCGCCGTAGTTCAGCTGGCTGCGATAGTGGCCGGACATCAGGAAGTAGCGCACCGTCTCAGCATCGTAATGCTCCAGCACATCGCGCACGGTAAAGAAGTTGTTCAGCGATTTCGACATTTTTTCACGGTCAATCATCACCATGCCGGAATGCATCCAGTAGTTCACATACGGGCCATCGTGAGCGCAGGTGGACTGGGCAATCTCGTTCTCATGGTGCGGGAACATAAGGTCAGAGCCGCCGCCGTGAATATCGAAGTGCTCACCCAGCTGCTTGCAGTTCATCGCCGAGCACTCAATGTGCCAGCCCGGCCGGCCGTTGCCCCACGGAGACGCCCAGCCGGGTTCGTCGGCTTTAGACATTTTCCACAGCACGAAGTCCATCGGGTTGCGCTTCACGTCGGCAGCCACCTCAACGCGCGCGCCGGCCTGTAGCTGATCCAGATCCTGACGCGACAACGAACCGTACTGCGGGTCACTGTCGACGGAGAACATCACATCGCCATTGCTGGCAACATAGGCATGCTCACGGGCAATCAGCTTCTCAACCAGCTCGACGATCTCGGTAATATGGCGCGTGGCGCGTGGCTCAAGGTCCGGCGGCAAAATATTCAGCGCGGCGAAGTCCTTGTGCATTTCGCCAATCATACGGTTGGTCAGCTGCTCAATGGATTCGCCGTTCTCCTGCGCACGTTTGATAATTTTGTCATCGATATCGGTGATGTTACGCACATATTTCAGCTGATAGCCGCTGTAGCGCAGGTAGCGCGCCACCACATCAAAGGCGGCAAAGGTTCTGCCGTGGCCGATATGGCACAGGTCGTAAACGGTAATACCACACACGTACATGCCGATTTTACCGGCATGTATGGGTTTAAATTCCTCTTTTTGACGACTCAGGGTGTTGTAAATCTTTAGCAT
Protein sequences of DBSCAN-SWA_5 >NC_010694|2726972:2745594|2728025_2728229_-|WP_049778759.1|DBSCAN-SWA MKLTARQIDSAKPKEKSYKLADGGGICLEIFPDETIGWWIKCRIAGREKQVVFGTYPLFTFAEARIK >NC_010694|2726972:2745594|2742702_2742915_+|WP_012442148.1|DBSCAN-SWA MATFSLGAHPHVDLCDLMKLEGWVESGAMAKSFIADGLVTVNGAVETRKRCKIVAGQKVDFDGNSVTVTA >NC_010694|2726972:2745594|2739059_2740067_-|WP_012442145.1|DBSCAN-SWA MKKNSSYEYIYDLYRKLDFFDSCMTYCLLEINSSKITTISEAIDITKKASAALQDIENDLEMSCVFDKVNEFMDDNGISRALKKSGVVTGFDIPYQAIIAHKIFIYKSHSIDGLDKKEWKELYDMFEGFNPPCSYSRFKIPVQILACICRYLSGEDTSATIKDEINTLEEYKKKLEELYAYHANSAFREFISNSREGRVIVGQGAFSLDAIQLRIDNLRARIITKRKDSTILERQLAIDFIDTLHRFNARQIPSIVSMLMGCGFIKEFVDNRTIFRLWKSFNQSQVKLDKYYGAGRSAKSTTSYKYVYSDSVTERIESLKSFSYAFIQKVGLQGK >NC_010694|2726972:2745594|2744205_2745594_-|WP_012442151.1|tRNA|DBSCAN-SWA MLKIYNTLSRQKEEFKPIHAGKIGMYVCGITVYDLCHIGHGRTFAAFDVVARYLRYSGYQLKYVRNITDIDDKIIKRAQENGESIEQLTNRMIGEMHKDFAALNILPPDLEPRATRHITEIVELVEKLIAREHAYVASNGDVMFSVDSDPQYGSLSRQDLDQLQAGARVEVAADVKRNPMDFVLWKMSKADEPGWASPWGNGRPGWHIECSAMNCKQLGEHFDIHGGGSDLMFPHHENEIAQSTCAHDGPYVNYWMHSGMVMIDREKMSKSLNNFFTVRDVLEHYDAETVRYFLMSGHYRSQLNYGEDNLNQARSALERLYIALRNTDTGATPAGGEEFETRFREAMDDDFNTPEAYSALFDLAREVNRMKAEAPAAADGLAARLRVLGGVLGLLEQDPEQFLQSGANSNDEEVAQIEALIQMRIDARNEKNWAQADVARDRLNEMGIVLEDGAGGTRWRRK >NC_010694|2726972:2745594|2730661_2731237_-|WP_012442134.1|tail|DBSCAN-SWA MTRIELSGALGKLFGKTHHRLIRTTHEAARALSVTIDGFEKYMLSSKRRGLTYAVFKGKKNIGLDDLGYPVNGEVIRIVPVIMGSKRAGMLQTILGAALIAASIWAPGISIATSNLMFAAGASLTAGGVIQMLSPQTAGLASKQDADNKASYAFGGVTNTAAQGYPVALAYGKPRIGGAIISAGIYVEDKV >NC_010694|2726972:2745594|2731725_2734554_-|WP_012442135.1|DBSCAN-SWA MRNIDLIREVTAAAAGRWPGVLAMTGITVPRSPRQHSECPACGGTDRFRFDDGGRGSHICNQCGAGDGLDLVAKVNDCDTTEAARLVADALGIDYRAAETDSEAASQRRAQAATERQQREQERQRQAAADEAARRANFARHYQTLAEQAQPGESEYLITKGLNGFTFPMLSDGRLLLALANLSGTVTAAQTITAAGAKKLVAGSEKRGAYHAVNAPEAPQSIVIAEGLATALTAHLIRPDALAVAAVDAGNLLPVAEAMRQQHPQAKIIIAADNDHQQGGSETGVSNTGKEAAEKAALSVAGWVSIPPTDHKADWNDHHQQNGLEAATAAFNDSLYQPQGESMTDEQSGAEDGNNTARDPASHTKVKLRKPGGVDSDKDPLKPRTESRSDGVFWITPKVDNESGKIINNESWLCSPLEVIGIGRDDKDQFLIIKWFPAGASASTTQAIPLADIGEREGWRVLKAGGVNITTKPAMRATLADWLQRSGSRELWRVAQATGWQCGAYIMPDGEIIGSPDMPVLFNGRSSAAAGYTCKGTAAEWRDSVARLAGGNWSMMTGIGAALAAPLIGLAGADGFGLHFYEQSSAGKTTTANAAASLYGNPDLLRLTWYGTALGLANEAAAHNDALMPLDEVGQGADPRSVAQSAYALFNGVGKLQGAKEGGNRDLKRWRTVAISTGEMDLETFIASTGQKIKAGQLVRLLNIPLSKAAEFHEHRNGKQHADALKDAFMRHHGAAGRAWVKWLADHQQQAVDAVREAESRWRSLIPADYGEQVHRVAARFAILEAALILGGVVTGWDAQTCRDAVQHSYNAWVREFGTGNKEHQQIIAQAEAFLNAHGLSRYAPFPYDPAALPIRDLAGYRTKGGHEHDPITFYTFPAAFEGEIARGFNAKQFAEVLKSAGMLTPPSSDRGYQRKSPRIDGRQYRVYVLNYLPDEESQPEE >NC_010694|2726972:2745594|2729596_2729785_-|WP_042959053.1|DBSCAN-SWA MTDRQTLELAVLQMARQSGEPLDRYTLYTVRNGIRNALAASERHHQRMSAPAYQWKKPAPRR >NC_010694|2726972:2745594|2736321_2737245_-|WP_012442140.1|DBSCAN-SWA MRVTQAAVQKEAFPLAGLLVNVMVQSLKETVISRPITGSENLTKPHVVESFSTIEMKRGINCLAGVQGEFSGTLESFVTSTRGEFKDTLRHFATTAHKGSLEFCESIKKGSTANATLCHDAGNNQALGSMPRDCSSLRMIRLIHAAKDSSPSFCCAASIDSRNSGPSLNWKGGLPLRSFLCVDTLATPIVMYICVMTHYYHMPQKTMPRSASTLTGHLTTTVNTPNEVAVMDITTHPQGREPFGLNSGHQHRHFVWIIAAVRRDNPALSPVIHHIAAENEREARRTLAKDNVCFFAGRLPVKEVRHA >NC_010694|2726972:2745594|2738675_2738873_-|WP_012442144.1|DBSCAN-SWA MSQNLIRMPEVLRRIGLGKAWVYKLISQGIFPKPVKIGSRSIAFVESEIDAWINQRIAERDGEAA >NC_010694|2726972:2745594|2735550_2736153_-|WP_012442138.1|DBSCAN-SWA MKAITLHDAKMPLVEYQGQRVVTFAMIDTVHQRPKDTAKNAFQRNRRHFIEGKDYFVVSAGSVDGLARGIKRPGKKITSPVRGKVTVFTEFGYLLLTKPFTDDLAWQVQRQLVDGYFRHNPHFSDIRPVLIPSLDELEAMPPQRAQNILAAADHKSHWRHGKHGSAEMSQRKRELKTLRPALARIAALIQLTIPGLEIPQ >NC_010694|2726972:2745594|2729933_2730524_-|WP_012442133.1|DBSCAN-SWA MREQTRSYTTAQPRLTLAHLKAIRQRFSKGAKALQLATRAGEMRFHLGGSSYTATIGGRVLPLRTTSTRAGYGVRHWYLCPHCHNRAAVLFIGKRDLACRKCWGLHYASQSEDRLARMRRSLFKQRAAIWGDYAPAYSLFNDCALFPKPAGMRWETFSRKVLHLQRSEQAYYRAFSPVVDKIMGAVERRTGAAGAG >NC_010694|2726972:2745594|2726972_2727329_-|WP_012442130.1|DBSCAN-SWA MNKSVALAAVVMLLAGCSSRVADLTVASTKNYNLNSNNFVKGARVKAEDSAPVVLFPLGIPNVKTAIDRAIEKNRCSVALSDVVVTQFNHSFLFGKFGFIIEGTEIIDRSQSGCENAS >NC_010694|2726972:2745594|2734563_2735175_-|WP_012442136.1|DBSCAN-SWA MNKAASLTATGVYRTEATPHGVSITRRNEAGQYEHMALIDYPDMLRKLEAGDFDGVRDMPDILRAMADGAALGYHTPTAEQELIVWRHCVALAFINEQLRKNGAVEVENERGGTDCGVIYCGQYGCMAVYPAPERFAMANNIESALIERYGREQGTRNAVLMYQAMLEGGGLSDMGRELLADLHDGFIRMLRDEGLPPAPVAH >NC_010694|2726972:2745594|2740197_2741412_-|WP_012442146.1|integrase|DBSCAN-SWA MKLNARQVDTARGKDKPYKLADGGGLYLLVNPNGARYWRLKYRAAGKEKLLAVGVYPDVSLSDARRRRDEAKKILAAGGDPGQEKQAEKQSKILAVQNSFEAIALEWHEHKKTNWSEGYATDILEYLSKDIFPFVGKRAINEIKPAEMLAVLRKMEQRGVLDKLKKTRQACRQIFTHAVITGRAENNPVTDLAGALKAPKQKHFPHLNVDQIPPFLNDLSNYSGSMITRYATQLLMITGLRTIELRASEWSDIDFDKGIWQIPAERMKMRRPHIVPISSQAKSLLEAIHQLTGRGKYVFPGRNDAGKPMSEASINQVIKRIGYDGKATGHGFRHTMSTILHEQGYNSAWVETQLAHVDKNSIRGTYNHAQYLDGRREMLQWYADYMDALESGDNVVHGAFGKLA >NC_010694|2726972:2745594|2735171_2735381_-|WP_012442137.1|DBSCAN-SWA MKTMPLCDAICRVEQAQGVLSVWMEMGIFNRTLSPRMVGALITLLEGVPEAMNATNSELVDYMNREGKA >NC_010694|2726972:2745594|2736149_2736329_-|WP_012442139.1|DBSCAN-SWA MLNYFTVYGYGTNKRGLTIGINHKTRSGSAESAKAAAMLQAQHAGLSHIRITRAQEVAA >NC_010694|2726972:2745594|2737235_2737409_-|WP_012442141.1|DBSCAN-SWA MTTRTQQRQFKDSHGRTVTLRGMSGGNVVFMRPGYPHPCQQPLEKFKAEFIEVTPCE >NC_010694|2726972:2745594|2743109_2744126_+|WP_012442150.1|DBSCAN-SWA MKAAVVTHDHQVDVVEKTLRALQHGEALLTMECCGVCHTDLHVKNGDFGDKTGVILGHEGIGVVKSVGPGVTSLKPGDRASVAWFFKGCGHCEYCNSGNETLCREVVNAGYTADGGMAEECIVAADYAVKVPDGLDSAVASSITCAGVTTYKAVKISGITPGKWLVVYGLGGLGNLALQYAKNVFNARVIAVDVSDAQLKFAEEMGADMVINSRNEDAAKLIQQRTGGAHAAVVTAVAKAAFNSAVDAMRAGGRIVAVGLPSESMSLNIPRLVLDGIQVVGSLVGTREDLAEAFQFAAEGKVVPKVTRRPLQDINAIFHEMESGKIVGRMVIDFNRTH >NC_010694|2726972:2745594|2735377_2735554_-|WP_157861808.1|DBSCAN-SWA MIRTDTHAKNDAIDALEKIKILTEAALFLTHNDTGRDAQSELIGVINDLAIRVLEASR >NC_010694|2726972:2745594|2738259_2738676_-|WP_012442143.1|DBSCAN-SWA MQKTRSVAGIVEQVSANDDTPTATIKPAPKKHRARIYMMRTGVNGFTENEILRYCRLSSGRNYASELERSLDIHLERIDEKNPDGIGSHFRYRFTCRADVMRVIQLVNHNAAAGGYRGLSQQEINDILTLYPDLFTAA >NC_010694|2726972:2745594|2728646_2729021_+|WP_012442131.1|DBSCAN-SWA MRMTARKKEILSYYQPDNLEWVTGEIGPPPFDVSGLAYLVCGLDSFDTRHHLESTRRTLETMVKDGLLEKVTSFERRQDTTQSGDGKGIWCNCSRYGLPGSCLVVRDEGGKVEAIDGEAVRIDN >NC_010694|2726972:2745594|2727685_2727874_-|WP_049778758.1|DBSCAN-SWA MLEYLKKDVCPHIGKRPISAINLKEMPDVLPKLEDRGVLDKLKKTRQACRQIFSYAVISGGA >NC_010694|2726972:2745594|2737408_2738245_-|WP_012442142.1|DBSCAN-SWA MKIENSRLISGAAPHFEHRRSDINANSFAEIVPVIPGQIGGRQTSIVSAKALHAALGVGNDFSTWVKGRIEEYGFSLGADYVVFDPSNFRNQSTDFNHPVTKWTSRRGGDRRSKDYGLCLNMAKELAMVERNEQGRAVRRYFTQCEEALQISAPEIAVQYRRQLKARISAASMFKPMCAALDAARAELGKQTQQRHYTNESNMIARIVLGGLTAKQWAQSSGITTEPRDSMSAGQLEHLTYLESTDITLIDMGMSYDQRKAELVRLSQRWLAKRLEVH >NC_010694|2726972:2745594|2741836_2742703_+|WP_012442147.1|DBSCAN-SWA MVAKIIDGKTIAQQVRNEVAGLVQQRLAAGKRAPGLAVVLVGENPASQIYVSSKRRACEEVGFLSRSYDLPASTSEAQLLELITQLNDDAEIDGILVQLPLPAGIDNTKVLEHISPAKDVDGFHPYNVGRLCQRAPTLRPCTPRGIVTLLERYQIDTFGLNAVVVGASNIVGRPMSMELLLAGCTTTVTHRFTKNLRQHIENADLLVVAVGKPGFIPGDWIKPGAIVIDVGINRLESGKVVGDVDFDLASTRASWITPVPGGVGPMTVATLIQNTLQACENNDNGVSA >NC_010694|2726972:2745594|2729086_2729518_+|WP_042959052.1|DBSCAN-SWA MNKSLSVTFRVPAELANAFTSAVSESGSDKTAWLIDAVRHKLNQPDSNADSRLLALVERMEIAAAALAVGKQGIPPRPYNEAAAIQVVAATIREGFDNGRIIAERLNEAGYQTKAGKAWDKDIYSAWKRQGGNLSRISAALIA |
25 | Enterobacteria_phage(28.57%) | tail,tRNA,integrase | attL 2738592:2738605|attR 2743026:2743039 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2807102 : 2814381
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NC_010694|2807102:2814381|DBSCAN-SWA CTTAATTTACAGCGTCTTTCAAAGTTTTACCGGCACGGAAGCCCGGAACTTTTCCCGCTGCAATAGTAATTTCTTTACCCGTTTGCGGGTTGCGGCCAACACGTTCAGCCCGTTCACGCACCGAGAAGGTGCCAAATCCCACCAGTGCAACCTCATCACCCGCCTGCAGAGATTCCGTTACAGAACCGATAAACGCATCCAGTACACGCCCTGCAGCAGCCTTAGAAATATCAGCATCTGTGGCAATTTTGTCGATCAACTCTGACTTATTCACTCACTCATCCCCTATTTTGTTGTTTGCACCCGGTACAGCCGTTCGCGATACCTGCACATACAGCTGTTTTTTTATGGCCACCGTGCTTTGGCAATCCGTGACCGCTGCCACCCATAGCCTGACCGCATCCTAACGTTAGCCGTAGCCCAAAAGGCTGGCAAGCACCAATTCACTTACCAGCCTTAGTTTAGGGTGAGTTTGCCTTAAATCACTATTTGGCAGTCACAACCTGCATACCATTAGGCGCATTTTGCAGCGCCAGGGCTAACACTTCTTCGATACGTTTCACCGGGTGGATTTCCAGGTCGGCAATGACGTTTTCCGGGATTTCCTCAAGATCGCGCTTGTTTTCATCCGGGATCAGCACCGTTTTAATGCCACCGCGGTGGGCTGCCAGCAGTTTCTCTTTCAATCCACCAATAGGCAGAACCTGACCACGTAGCGTGATCTCACCAGTCATGGCAACATCAGCACGCACCGGATTACCGGTAAGGCAGGATACCAGCGCGGTACACATGGCGATACCTGCGCTTGGACCATCCTTAGGCGTTGCCCCTTCAGGCACGTGTACGTGAATGTCACGTTTTTCGTAGAAATCACCGTTGATACCCAGTTTCTCTGCACGAGCACGCACGACCGTCAGCGCCGCCTGAATCGATTCTTGCATCACTTCACCCAACGAACCGGTGTAGGTCAGTTTGCCTTTGCCAGGAACGCAGGCGGTTTCGATGGTTAACAGATCGCCACCGACTTCAGTCCACGCCAGACCGGTAACCTGCCCTACACGGTTTTCGCTGTCGGCACGGCCGTAGTCATAACGCTGCACGCCGAGATAATCTTTCAGGTTATCCGCGTTAATTTCAATATGCTTAACAGACTTGTCCATCAGCAGCGTTTTTACCGCTTTGCGGCACAGCTTGGACAGTTCACGCTCCAGGCTACGCACACCCGCCTCACGCGTGTAGTAACGAATAATGCCAACGATAGCGCTATCGTCAACGGTCAGCTCGTGTTCTTTCAACGCGTTGCGTTCGATTTGCTTCGACAGCAGGTGCTGTTTGGCGATATTGAGCTTTTCGTCTTCGGTGTAGCCGGAAAGACGGATCACTTCCATACGGTCCAGTAGCGGAGCCGGAATATTCATCGAGTTAGACGTCGCAACAAACATCACGTCAGAAAGATCGTAATCGACTTCCAGATAGTGATCGTTGAAGGCGATGTTTTGTTCCGGATCGAGCACTTCCAGCAGTGCGGAGGCGGGGTCGCCACGCATATCCGACGACATTTTGTCAATTTCATCCAGCAGGAACAGCGGGTTTTTTACCCCTACTTTCGCCATTTTCTGGATTAGCTTGCCCGGCATCGAACCAATATAGGTACGACGATGCCCGCGGATCTCGGCTTCATCACGCACACCACCCAGCGCCATACGCACGTATTTGCGCCCGGTCGCTTTGGCTATGGACTGTCCGAGGGATGTTTTACCCACGCCAGGAGGTCCAACGAGGCACAGGATAGGGCCTTTAATTTTGCTGACGCGACTCTGCACAGCCAAATATTCGAGGATGCGATCTTTGACACGTTCCAGGCCGAAGTGGTCAGTATCCAGCGTTTCCTGCGCTTTCTGCAGGTCTTTTTTCACCTTGCTGCGCGCATTCCACGGTACCTGCACCATCCAGTCAATGTAACCACGAACCACGGTCGCTTCTGCCGACATCGGCGACATCATTTTTAGCTTTTGCAGTTCGGCTTCGGTTTTTTCACGCGCTTCTTTCGGCATTTTGGCCGCATCGATTTTGCGTTTCAGCGCTTCATTTTCGTCCGGCGCATCGTCCATCTCGCCAAGCTCTTTCTGAATCGCCTTCATTTGCTCATTCAGATAGTATTCACGCTGACTTTTTTCCATCTGCTTTTTAACGCGATTGCGGATACGTTTCTCAACCTGCAGCAGGTCGATTTCCGATTCCATCATCGCCATCAGATATTCCAGACGCTCATTGACGTCTGACATTTCCAGCACCGACTGTTTATCCGCCAGCTTCAAAGGCATATGGGCGGCAACGGTATCAGCAAGGCGCACCGCGTCTTCAATGCTGTTAAGTGAAGTCAGGACTTCCGGCGGGATCTTTTTGTTCAGTTTGATATAGCCTTCAAACTGATTAATCGCCGTACGCACCAGCACTTCCTGCTCGCGTTCTTCAATTTCCGGTGACGTCAGATACTCCGCCTGGGCGCTAAAATGATCGCCGTTATCGGATAAGGTCGTGATGCGCGCGCGCTGTAATCCTTCCACCAGCACTTTAACCGTGCCATCCGGCAGCTTTAGCATCTGCAAAATGGAGGCAACGGTTCCGACTGAGAAGAGGTCGTTAATACCAGGCTCATCCGTTGAAGCTTCTTTCTGAGCGACCAGCATGATCTTTTTGTCATGATCCATGGCGGCTTCAAGGCAGCGAATTGATTTTTCACGACCGACAAACAACGGAATGACCATGTGCGGATAAACCACTACGTCACGCAACGGCAACACAGGGATTTCAATGCGTTCAGAACGCTCAGGATTCATAGAGCTCTCTCTTAGTTTAATTTCCGCCAGGGTAAGGGGACCGCATCATCCAGGATAAGGTGCGGTACAACCCACAGGTACCTGAGTATATGGGGATGAATGGAATAGATTCAATGTCGAGGTTATGAGAAAAGTAAAAGGGGGGAATATTTCCCCCCTGAGCCTTAACTCACTGGAATGATTTGGTTATTTATTCACCAGATACATGCTGAACGTCCGGTTTGCCGTAAATCAGCAACGGCTCGCTTTCTCCGGCAATAACCGATTCGTCAATGACCACTTTTTCCACGTCATCCACCGACGGCAAATCGTACATGGTGTTCAGCAGCGCGGCCTCAACGATTGAACGCAGCCCGCGCGCGCCGGTTTTGCGCAGCATCGCCTTGTTGGCGATGGCCTTCAGCGCCTCTTCCCGGAACTCCAGCTCCACGCCTTCCAGATTAAACAGCGCCTGGTACTGCTTGGTCAGCGCATTTTTCGGCTCGCGCAGGATCTGGATCAGTGCTTCTTCGCTCAGCTCGGTCAGCGTCGCCACCACCGGAAGACGTCCGATAAATTCCGGGATCAGGCCGAATTTAATCAGGTCTTCAGGCTCCACCTGCGCCAGCAGCTCGCCTTCCGTTGCCTTCTCTGACTTGCCCTTCACCGATGCACCAAAGCCGATGCCGCTGCCGCTGTCCACGCGCTGCGAAACCACCTTGTCCAGTCCGGCAAACGCGCCGCCGCAGATAAACAGGATTTTCGAGGTGTCCACCTGCAGAAACTCCTGCTGCGGATGCTTACGCCCGCCCTGCGGGGGAACCGCGGCCACCGTTCCTTCAATCAGCTTCAGCAGCGCCTGCTGAACGCCCTCGCCCGACACGTCGCGGGTAATCGACGGGTTGTCCGACTTGCGCGAAATCTTGTCGATTTCATCGATGTAGACGATGCCACGCTGCGCCTTCTGCACGTCATAATCGCACTTCTGCAGCAGCTTCTGAATGATGTTTTCCACGTCTTCACCCACGTAGCCCGCTTCGGTCAGCGTGGTGGCATCCGCCATGGTGAACGGCACGTCAAGCAGCCGTGCCATCGTCTCGGCGAGCAGGGTTTTACCGCTTCCCGTTGGGCCTATCAGCAGAATGTTGCTTTTACCCAGCTCGATACCGTTGCTGGTGTCGCCGTTACGCAGACGCTTATAGTGGTTGTAGACCGCGACCGACAGCACCTTCTTGGCGCGCTCCTGACCGATAACGTAATCGTCAAGATGGTGGCGGATCTCATGCGGCGTGGGCAGCGAACTGCGCTCGCGGTGCGGTGCAATCTCTTTGATCTCTTCGCGAATGATGTCGTTGCATAAGTCAACGCACTCATCGCAGATGTATACTGACGGGCCGGCAATCAGCTTACGCACTTCATGCTGGCTTTTGCCGCAAAAAGAGCAGTACAGCAGCTTCCCTGAACCGTCCTTGCGTTTGTCTGTCATCGGTTAGCCTCATACCTGGTTCACAACGGAACTGAAATGTTCTGTCGCAATTAACTCCTGCTGAATCTTAATATAGTTCATTTTCCCGTTAAACTGGGACAACAGCACAATATTCCACCATGTGCAGTGCTCAGAAAGGCCATTTTGCATGTTATTTGGCGACATTGCGCTAAAGCAGAAAAATTAGAGGTGAACTTCGCTTAACATGCTGTTTTAAATAATAAATAAGCGGAAACATTCGACTTGATTTAGCTGCTAACTGGATTATGCTAAACTCTGGTCACTTTAAATGAATCCGACGGAGTCTAAGCATGTCTGAAGAGCCAGAGAAAAAAGAAGAAAACGGCCAGGAAGAGAAGAGCTCTGCGCTGGCAGTGAATAAACTGTTGCAGTCACGCTCTATTATCATTTCAGGTGAGATTAACCAGGCGCTCGCCGAGAAAGTGACGGCTCAACTGCTGATCCTCCAGGAAATGGGCGATGAACCCATCAAGCTGTTTATCAACAGCCAGGGGGGCCATGTTGAAGCCGGCGACACCATCCATGACATGATTAGGTTTGTCAGACCTGAAGTGCTGGTTATCGGTACGGGTTGGGTGGCCAGCGCCGGTATCACTATTTTCCTCGCGGCCAACAAAGAAAACCGTTATACGCTGCCAAATACCCGCTTTATGATCCACCAACCGCTGGGCGGCGTACGCGGTAAGGTTTCAGATATTGAGATTGAGGCGAAAGAGCTGCTGCGCGCGCGCGCGCGTATCAATCAGCTTATCAGTAATGCGACAGGCCAACCGCTGGAGAAGGTGGAAAAGGATACGGACAGTAACTACTGGATGAGTCCGGAACAGGCGATTGATTACGGGATCGCCACCCACGTCATCACTTCGTGGAACGAGCTGAAGGCGTAATGCGCTTTAGCCTGATATAAAAACGGCCCGCTTCTGACTGGCGGGCCGTTTTTATTTGCATAACAAATGGATTATTCTCCAGATGCCTGCTGCGTGTCTGCCTTACGATAAATCAGCTGCGGATCGCTTTCTCCGGCAATAACCGATTCGTCAATGACCACTTTTTCCACGTCATCCACCGACGGCAAATCGTACATGGTGTTCAGCAGCGCGGCCTCGACGATCGAACGCAGCCCGCGCGCGCCGGTTTTGCGCAACATCGCCTTGTTGGCGATGGCCTTCAGCGCCTCTTCCCGGAACTCCAGCTCCACGCCTTCCAGATTAAACAGCGCCTGGTACTGCTTGGTCAGCGCATTTTTCGGCTCGCGCAGGATCTGGATCAGCGCTTCTTCGCTCAGCTCGGTCAGCGTCGCCACCACCGGAAGACGTCCGATAAATTCCGGGATCAGGCCGAATTTAATCAGGTCTTCAGGCTCCACCTGCGCCAGCAGCTCGCCTTCCGTTGCCTTCTCGGACTTGCCCTTCACCGATGCACCAAAGCCGATGCCGCTGCCGCTGTCCACGCGCTGCGAAACCACCTTGTCCAGTCCGGCAAACGCGCCGCCGCAGATAAACAGGATTTTCGAGGTGTCCACCTGCAGAAACTCCTGCTGCGGATGCTTACGCCCGCCCTGCGGGGGAACCGCGGCCACCGTTCCTTCAATCAGCTTCAGCAGCGCCTGCTGAACGCCCTCGCCCGACACGTCGCGGGTAATCGACGGGTTGTCCGACTTGCGCGAAATCTTGTCGATTTCATCGATGTAGACGATGCCACGCTGCGCCTTCTGCACGTCATAGTCGCACTTCTGCAGCAGCTTCTGAATGATGTTTTCCACGTCTTCACCCACGTAGCCCGCTTCGGTCAGCGTGGTGGCATCCGCCATGGTGAACGGCACGTCAAGCAGCCGTGCCATCGTCTCGGCGAGCAGGGTTTTACCGCTTCCCGTTGGGCCTATCAGCAGAATGTTGCTTTTACCCAGCTCGATACCGTTGCTGGTGTCGCCGTTACGCAGACGCTTATAGTGGTTGTAGACCGCGACCGACAGCACCTTCTTGGCGCGCTCCTGACCGATAACGTAATCGTCAAGATGGTGGCGGATCTCATGCGGCGTGGGCAGCGAACTGCGCTCGCGGTGCGGTGCAATCTCTTTGATCTCTTCGCGAATGATGTCGTTGCATAAGTCAACGCACTCATCGCAGACATATACTGACGGGCCGGCAATCAGCTTACGCACTTCATGCTGGCTTTTGCCGCAAAAAGAGCAGTACAGCAGCTTCCCTGAACCGTCTTTGCGTTTATCTGTCATTTACTTCCCTCTCATTGTCGCGCTTATCTTCCCACCGGGGGAGTACAAGCAGACCGCGGCCGGCGAAGGCAACGGTTATCTGTCAGCACAGACTTAATCAATTACGCTGGGCTAAGACTGAATCCACTAAACCATATTCCACGGACTCACTGGCCGACAGGAAGCGATCGCGATTGGTATCGCGCTCAATGGTTTCAATCGTCTGGCCGGTATGTTTTGCCATCAGCTCATTCATCATCTGTTTGGTTTTGATGATTTCGCGCGCATGAATATCAATATCTGAAGCCTGGCCCTGGAAGCCGCCCAAAGGCTGGTGGATCATGACGCGTGAATTAGGCAGGCAGTAACGCTTACCTTTTGCGCCAGCAGTCAGTAGGAAGGCTCCCATAGAACAGGCCTGCCCCATACAGATAGTGCTGACGTCAGGTTTAATGAACTGCATAGTATCGTAAATTGACATACCTGCGGTAATCACGCCACCGGGAGAATTGATATAAAGATGGATATCTTTTTCTGGATTCTCCGCTTCCAGGAACAGCATCTGCGCGACGATCAGGTTAGCCATATGGTCTTCAACCTGACCGGTCAGGAAAATAATGCGCTCTTTAAGTAGGCGCGAGTAGATGTCGTATGAACGCTCTCCGCGAGAGGTTTGTTCGACCACCATTGGCACCAAGGCCATATGAGGTGCTGTTAGTTCACGTTCGCCACTGTATGACAT
Protein sequences of DBSCAN-SWA_6 >NC_010694|2807102:2814381|2807102_2807375_-|WP_012442216.1|DBSCAN-SWA MNKSELIDKIATDADISKAAAGRVLDAFIGSVTESLQAGDEVALVGFGTFSVRERAERVGRNPQTGKEITIAAGKVPGFRAGKTLKDAVN >NC_010694|2807102:2814381|2811717_2812314_+|WP_012442219.1|protease|DBSCAN-SWA MSEEPEKKEENGQEEKSSALAVNKLLQSRSIIISGEINQALAEKVTAQLLILQEMGDEPIKLFINSQGGHVEAGDTIHDMIRFVRPEVLVIGTGWVASAGITIFLAANKENRYTLPNTRFMIHQPLGGVRGKVSDIEIEAKELLRARARINQLISNATGQPLEKVEKDTDSNYWMSPEQAIDYGIATHVITSWNELKA >NC_010694|2807102:2814381|2813757_2814381_-|WP_012442221.1|DBSCAN-SWA MSYSGERELTAPHMALVPMVVEQTSRGERSYDIYSRLLKERIIFLTGQVEDHMANLIVAQMLFLEAENPEKDIHLYINSPGGVITAGMSIYDTMQFIKPDVSTICMGQACSMGAFLLTAGAKGKRYCLPNSRVMIHQPLGGFQGQASDIDIHAREIIKTKQMMNELMAKHTGQTIETIERDTNRDRFLSASESVEYGLVDSVLAQRN >NC_010694|2807102:2814381|2807586_2809941_-|WP_012442217.1|DBSCAN-SWA MNPERSERIEIPVLPLRDVVVYPHMVIPLFVGREKSIRCLEAAMDHDKKIMLVAQKEASTDEPGINDLFSVGTVASILQMLKLPDGTVKVLVEGLQRARITTLSDNGDHFSAQAEYLTSPEIEEREQEVLVRTAINQFEGYIKLNKKIPPEVLTSLNSIEDAVRLADTVAAHMPLKLADKQSVLEMSDVNERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPDENEALKRKIDAAKMPKEAREKTEAELQKLKMMSPMSAEATVVRGYIDWMVQVPWNARSKVKKDLQKAQETLDTDHFGLERVKDRILEYLAVQSRVSKIKGPILCLVGPPGVGKTSLGQSIAKATGRKYVRMALGGVRDEAEIRGHRRTYIGSMPGKLIQKMAKVGVKNPLFLLDEIDKMSSDMRGDPASALLEVLDPEQNIAFNDHYLEVDYDLSDVMFVATSNSMNIPAPLLDRMEVIRLSGYTEDEKLNIAKQHLLSKQIERNALKEHELTVDDSAIVGIIRYYTREAGVRSLERELSKLCRKAVKTLLMDKSVKHIEINADNLKDYLGVQRYDYGRADSENRVGQVTGLAWTEVGGDLLTIETACVPGKGKLTYTGSLGEVMQESIQAALTVVRARAEKLGINGDFYEKRDIHVHVPEGATPKDGPSAGIAMCTALVSCLTGNPVRADVAMTGEITLRGQVLPIGGLKEKLLAAHRGGIKTVLIPDENKRDLEEIPENVIADLEIHPVKRIEEVLALALQNAPNGMQVVTAK >NC_010694|2807102:2814381|2812385_2813660_-|WP_012442220.1|protease|DBSCAN-SWA MTDKRKDGSGKLLYCSFCGKSQHEVRKLIAGPSVYVCDECVDLCNDIIREEIKEIAPHRERSSLPTPHEIRHHLDDYVIGQERAKKVLSVAVYNHYKRLRNGDTSNGIELGKSNILLIGPTGSGKTLLAETMARLLDVPFTMADATTLTEAGYVGEDVENIIQKLLQKCDYDVQKAQRGIVYIDEIDKISRKSDNPSITRDVSGEGVQQALLKLIEGTVAAVPPQGGRKHPQQEFLQVDTSKILFICGGAFAGLDKVVSQRVDSGSGIGFGASVKGKSEKATEGELLAQVEPEDLIKFGLIPEFIGRLPVVATLTELSEEALIQILREPKNALTKQYQALFNLEGVELEFREEALKAIANKAMLRKTGARGLRSIVEAALLNTMYDLPSVDDVEKVVIDESVIAGESDPQLIYRKADTQQASGE >NC_010694|2807102:2814381|2810131_2811406_-|WP_012442218.1|protease|DBSCAN-SWA MTDKRKDGSGKLLYCSFCGKSQHEVRKLIAGPSVYICDECVDLCNDIIREEIKEIAPHRERSSLPTPHEIRHHLDDYVIGQERAKKVLSVAVYNHYKRLRNGDTSNGIELGKSNILLIGPTGSGKTLLAETMARLLDVPFTMADATTLTEAGYVGEDVENIIQKLLQKCDYDVQKAQRGIVYIDEIDKISRKSDNPSITRDVSGEGVQQALLKLIEGTVAAVPPQGGRKHPQQEFLQVDTSKILFICGGAFAGLDKVVSQRVDSGSGIGFGASVKGKSEKATEGELLAQVEPEDLIKFGLIPEFIGRLPVVATLTELSEEALIQILREPKNALTKQYQALFNLEGVELEFREEALKAIANKAMLRKTGARGLRSIVEAALLNTMYDLPSVDDVEKVVIDESVIAGESEPLLIYGKPDVQHVSGE |
6 | Bacillus_virus(33.33%) | protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|