Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP043488 | Labrys neptuniae strain KNU-23 chromosome 2, complete sequence | 0 crisprs | csa3 | 0 | 0 | 0 | 0 |
NZ_CP043489 | Labrys neptuniae strain KNU-23 chromosome 1, complete sequence | 2 crisprs | WYL,csa3,cas3,RT,DEDDh | 0 | 0 | 3 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP043489_1 | 1837189-1837275 | Orphan |
NA
Consensus repeat of NZ_CP043489_1
|
1 spacers
spacers of NZ_CP043489_1
>1.1|1837212|41|NZ_CP043489|CRISPRCasFinder TTTTATTGCGTATCGCAGCGCAACATGATAGTCAACGGAAA |
CRISPR arrays and Neighbor proteins around NZ_CP043489_1
The CRISPR arrays of NZ_CP043489_1 >merge|NZ_CP043489|1|1837189-1837275|CRISPRCasFinder TTGACTGAACCGTTCGGTTCGATTTTTATTGCGTATCGCAGCGCAACATGATAGTCAACGGAAACTGACTGAACCGTTCGGTTCGAT >NZ_CP043489|1|1|1837189-1837275|CRISPRCasFinder TTGACTGAACCGTTCGGTTCGAT TTTTATTGCGTATCGCAGCGCAACATGATAGTCAACGGAAA CTGACTGAACCGTTCGGTTCGAT
>NZ_CP043489.1|WP_149252317.1|1836510_1837167_-|TetR-family-transcriptional-regulator MLAESPAPAKEPATDPPAADNAKCRQIVDGARRVFLAHGFEGASMNDIAKEAGVSKGTLYVYFENKERLFAAIVDEERSSHVERIFEFDYNSPDVEGTLLELSAAITAFICQPRIISAMRAVMGITERMPDIGAHFYNAGPGHSRKQLAKYLDLRVAAGQLAIEDTELAAAQFLEMSHGPLLKPMFFMANNTPPTKERIREVAQSAVRVFMAAYGVKK >NZ_CP043489.1|WP_149252316.1|1835601_1836384_+|hypothetical-protein MRTLLIRPLPVLVTLVVIAGWMAPKQGILDRASGSIKGDLPGGVQTGTGEFDVGTVTAVRVTGPAGLVRLSASRGGPYRAELRSRPEGWFGFWRSNWSAGGCANAGSIRLVGTQLQVDTGNRAWFGASDCRLELDASLPEGVAVSIEQDATSSQLSGNFASLDVDSRAGDIALDGHARTVSIEGNAIRARLSYARVDQDESITLGGNAIDAELRFAGAEAVNYAVSGHASLVDSTLPNKPGVRPAIAIKGNFLRVRIGGE >NZ_CP043489.1|WP_149252315.1|1834322_1835462_-|helix-turn-helix-domain-containing-protein MIFVPLPFVSALLFAILLVQMARQGDRPWRDNAFFLLLLGFTVLAVLLGLRWGYGIRTFIPLQAMLAALTPALAWLAFRGLTVEGPALRWAKVWPHLLPAGLVGLLFTISSVSIDLVVILAFLGYGLALVRLVLAGPDLLIAPRLDGVLRSYRALQITAVACLLSGLMDIVISFDMRWMGGAISEAMVSGGNVVVLLVLGAAAASAGSAAPAEEVEPPTVEPPVRPGSDEDKAVAQALDALMAERQLYKDVDLNLGRLARRLNLPARRVSSAVNRIHGMSVSHYVNKYRIEEACRLLASTDTSIIQVMMEAGFLSKSNFNREFLRLTGVSPIAWRRGQRAAPVGNAETDKQKAPLWRGPQTQLNMTQRDLTQLERNATQ >NZ_CP043489.1|WP_149252314.1|1834026_1834329_+|hypothetical-protein MLKKILTGTLVAATLAGTAIATTGTAEARYGRGGAFAAGAGIGLLGGLLAGSAYNNSYYGGGYYDEYRPVYYRQYRRCTIQKRWVEDYYGGHWARVRVCY >NZ_CP043489.1|WP_149252313.1|1831403_1833833_-|hypothetical-protein MRSGWRVGVWLVLGLMFVAPPVVAAPCSGEPAACAAVGKTALGLPYQAALTILQDMGEKDRQVRIERLSKALEANPVDDLARLELIMLGGDDRKQTAIATSLADLLAKAGKAGDPTVAALARLLSIPTIDAEGEEAEARITALVEAALADIQTMRHATMRGGQARLTVDDADLLAGLIARRAVFDLDEAIEDYDRDVGKAHAALLQAAIEAFSARLERAPRDPARFQLLVEALRDNYYDKGRYDRLVEICRRWTSVAPANPKARRSLVSALAARADERNLADKTASALADAEALAVLAKPTAGSARLDAIESEALVASHRVRAVAQGKTEPVRAAATLAEGLEQLRAGTQGDPAQSSGDDLILAAAMLAGERIDKQDAASADAIETKLLSVLAEPERRAQLRDMIAARYQTKGEVEAAMQRYRRSMAELAAAPLDGERSWRFFAAARNLLDLGRRDARSFDADAHARLLTDYAAEARLRALDGLDGKERARLLLQIADALHVTAKRLEEAGIEAPRIGLLEREIALREPLVRDPAERANRLDDLARAYRDLSDAFDTANREKEALDLARKRVEIRRELKEKSPSGFDELSDYVWALRDFGDEQRHVGDDKGAQASYQEGAEVGAMLLERFPDRGGSYEAMSAIQVAMGHAARSTMMKLIHYKRAEATNLAHLGKLGEKKFDQDFLAVSSINIGDTYLEAGKYQAALTHLGKALEASDKSLADDKDNNSLLRRRVRIFDKMARAEQGLGRTEPAIATRRRQIELLEKLARLRGAAVENQAEAYEALISLLSESGDHADEIAEIRRKVAAL >NZ_CP043489.1|WP_149252312.1|1830149_1831235_+|redoxin-family-protein MVKLLLQHLTAELTPITSRVEQMQMPGFVHRPRGTGAMPAGEGNGDEALIFEGDAACLVAAPAAFSGQLRREGDEIAWLVELIGFEPSPKPGESRLPLGFRGLQGRSGFTGIRQEAPQQLWGTSARLYHRTRAWLSPSSKAFRPACSVQGSSKPVRNHLRPVAKAGMMASSRIVGAHRSKEPGKAPHGRTGMIGRRAFLASIAASLATSARAEDATAPDSNGTVLGSFQVEALPGLVDVPAPTLEALQDQITVLNFWASWCEACQEEHRYLVNLQRKGVRIAGVAVQDRGEAVLRYLEKAGNPYGFVGIDNKRELITMLSLRSIPQTFLIGRRCEVVWQTDEGLDNALVAELLGKIEAISG >NZ_CP043489.1|WP_149255370.1|1829033_1829813_-|TIGR04222-domain-containing-membrane-protein MLSRALQDSVNGPVMPGDAGVLLGPYHFAYLAGGANRVLEAALTQLYLDGTIAMQSNEAVLIRRVPRRAPAVERLIGDKLAEGPLRIGPTTIEIAVEPIRRDLLVAGLVPGPDELARTRQIPFLLIGPLLLLALIRFFFGIANERPIALLAFCLVATPFLIVIAAMRQPPHTRAGGELLRQAERSIQARGKPAANSPGLTEWVALHGHVGLAGLGLTAFSFFLANQPALAVKAGGGGSCGGGGGDGGGGGCGGGGGCGG >NZ_CP043489.1|WP_149252311.1|1827302_1828865_+|trimethylamine-methyltransferase-family-protein MTDAADNATHTEAPSASRRGRDARRAARVQRGGVSVPYITRNIPLTEVLSEEAMQIIEHNAETLLEEVGIEFREYPRALELLKAAGCDIKGERVRFPRGLARKLIQTAPSQYTQHARNSERNVVIGGNNTVFAPNYGSPFVHDLDKGRRYGTIEDFRNFVKLAYANPYVHHSGGTVCEPVDLPVNKRHLEMLYAHMRLSDKPFMGSVTAPERAQDTVDMAKILFGEDFIRENTVCTSLINANSPMVWDNTMLGAADVYAQNNQACIITPFILSGAMSPVTVAGTLTQVLAEVLAGVSFLQLVRPGAPAIFGTFVSTLSMQSGAPTFGTPEAALAIYGAGQLARRMKLPFRSGGSLCASKVPDAQAAYESANTLLPAMFGGVNFMLHSAGWLEGGLSASYEKFVMDFDQLGAMHVLAKGVDMSENGQAMDAFHQVEPGGHFLGCAHTQANFETAFYRSTISDNNSVEQWEAEGKQDAAQRANKIWKKTLADYEAPAIDPGIDEALRDFIERKKAAVPDANY >NZ_CP043489.1|WP_149252310.1|1826503_1827043_-|HdeD-family-acid-resistance-protein MEVLQRSWPWFVILGVVAVIGGILALIHPGFASLVVVVWAAWAFIVLGVGQLVHATVIRAWSGFLMTALMGILALLLGASLLLNPLAGVVSLTALLGAMFLVYGLAKVIIAFNIRASANWTWLLLSGLISILLAVLIFSDFQQSASSLLGILLGVELLFYGFASLMTGMALRSRVDGSR >NZ_CP043489.1|WP_149252309.1|1825862_1826489_-|prolyl-oligopeptidase-family-serine-peptidase MTDRLPDSLVILLHGVAAFGYDLDPLAGMLRRSLPRTAVVAPDAPFAYEQGPGRQWYSLEGVTPENRLARIVAARPAFDALIRSLVAAQGLEKRLERVALVGFSQGATLAFDAVARGRWPVGALALLSGRFVAPAPFTPARMTPVLLVHGSADGAVPSEETRRARALLQEADMTVESHILRGVGHTISPTGVKLTRRFLRERLGEAGV >NZ_CP043489.1|WP_149252318.1|1837499_1838828_+|HlyD-family-efflux-transporter-periplasmic-adaptor-subunit MDAARDHNAVPGGQGATGPADAIDNVVTLERQRTDTPEAPEIKKTEPQKAEAPAVPAQTKPAAAAGKKKSKARTVMPILLIVALAAGGWYGYDWWTNGRFMVETDDAYVQADVSTLGVKVSGYVDSVPVQNGDSVKAGDVIVKLDDTDYRTALDSAKAKRVTQNATIARIDQQVTAQQAAIETANAGVASAKAGIESAQAGIDSAKAEIVRANAAFERADTLAAQNFGSKATLDQAIADRDKANAGLASAKATLTNAQASLNSAQAGVIAAKANLAVTQAQKAEAEQGAKELDVAITKAQNDLDATVVRAPSDGVVGNRAAQPGQYVSPGSRLIALVPLKSIYVAANFKETQLGPLVPGQKVEVSVDSMDGNAFEGVVGKFSPASGSVFSLLPPENATGNFTKITQRVPVRIEVPADVALSGKLRPGLSVVVTVDSRTGPKG >NZ_CP043489.1|WP_149252319.1|1838998_1840579_+|DHA2-family-efflux-MFS-transporter-permease-subunit MATATATAIPAPAEEAIDKRKLIAFLAMVFGMFMAILDIQVVSASLPQIQAGLGASGDEIPWVQTAYLVAEVVMIPLSGFLSRAFSTRWTFAVSCAGFTVMSFMCGTATNINEMIIYRALQGFIGGGMIPTVFAAAFTIFPRSKQAIVSPMIGLVATLAPTIGPTVGGILTDAISWHWLFFINVVPGVIVTLMTFSMVDFDEPDLSLLSNFDWTGLISMAVFLGGMEYALEEGPGHDWFAETPVLVMSVLAAIGALVFFARVLLARQPIVDLYAFKDGNFATGSLLSFVLGVGLYGLTYLFPVYLSGVRGYDSRMIGETMFVTGLCMFFTAPIAGNLTRFVDPRLMIAGGFIGFAAGTWIMTGITHDWDFYEILLPQILRGVSLMICMVPISNIALGTLPPARIKNASGLFNLMRNLGGAVGLAIINTSLNKRQDLHLSRLGEAVNWSRDNVLQTYDNMKAGFAAFGAAADQMTVARLVSLMRREALVMAFSDVFLLLTLLFGLLSLSVFMLKKPQMAGGGGGGGH >NZ_CP043489.1|WP_149255371.1|1840715_1841957_+|glutamate-5-semialdehyde-dehydrogenase MRKVGQAARSAARTLALAPAPIKNAALEAMAKAILANEAVILAANALDVADALARGQIASYVDRLTLDEKRVAGIAAAIREVAAQPDPVGRVLASWTRPNGLEFERVSTPLGVVGVIFESRPNVLADAGALCLKAGNASILRGGSESFRTCSEIAKALRAGLQAAGLPEAAIQMVPTPDRSAVGAMLAGLDGNLDVLVPRGGKNLVSRVQAEARVPVFAHLEGVNHTYVHAGASLDMAVAVVLNAKMRRTGVCGATETLLVDQAVAPVFLKPLVKALLEAGCEVRGDSRTLSVDPHVKPADDTDWATEYLDAIISVKVVSGLDAALAHIERYGSHHTDAIVTDDEAAAARFLAEVDSAIVLHNASTQFADGGEFGFGGEIGIATGRMHARGPVGAEQLCSFKYRVRGQGTVRP >NZ_CP043489.1|WP_149252320.1|1842091_1842613_+|transcriptional-repressor MAHRHAHDHEPAPVFAEPGHDHSHCSSSVLARAESLSAERGVRLTQIRRQVLEALAATHQPIGAYELIERLEDGEGKRPAPITVYRALDFLLEQGFAHRIESRNAFIACAHDHKDGSVVMFLICESCGTVGEAESDTVGKALATAAGAIGFTPRGQVIELAGICRHCREKAQA >NZ_CP043489.1|WP_149252321.1|1842765_1843689_+|MerR-family-transcriptional-regulator MSEKTYTIGELSKLSGIAVRRIRFYSDKGLLPPAARAESGYRVYSEADRARLDLILALRDAGVKLGDIARLIARRLGLADVLALRLDAIEAEISAKRRIAAALRATLRLADPTPQDLRRLWTVTALSKTQFRTAIEAFYAEAGSDARMDPAWRDKMIAAATPDLPDDPTTAQLDAWTELMGMLTDKSYRDEMQVSMRELWHDGFDPAAYRQASDQTFAQVRAAMAKGIAPDSDTGRAIAEAWLESSARAMKKEPDAAFLDWQLEQYRKHHARSRRYLELMAILRGDPPGQLAASEWGWIVEALSSRL >NZ_CP043489.1|WP_149252322.1|1843702_1844608_-|LysR-family-transcriptional-regulator MKDIHQLKSGDLFALTVFLSVAAHRSFRAAGIELNVTPSAVSHSVKSLEQRLDVRLFNRTTRSVSLTDAGEQLAAKLRPAVSSIAEALQVVDDYRETPSGTVRINSSEGAIRLVLLPVLARFARDYPQVHLDIVSDGRLSDVVADGFDAGIRLAEAVPQDMIAVRLTETARFAAVGSPGYFAARGRPAVPQDLHRHACIRFRFDSGAIYRWEFERHGMTETINVTGPLTLTDQPLMVEAAIQGIGIAFVPDHLVVGALADGRLERVLDDWCPAFPGLCLYYPGHRHVSAGLRALIAAIRAG >NZ_CP043489.1|WP_149252323.1|1844697_1845474_+|SDR-family-oxidoreductase MSKKILITGASSGFGRGAAIELARQGHQVVATAESWPLVRSLRADAAAAGVKLEAIKLNLLDDIDIAHAYSYDPDILVLNAGVMESGSVIDIPMQRVRESFEINVFGHIRLVQGIVPKMVARKAGKVVWTSSMGGILVIPFVGVYCATKHAIEAIAGSMRAELAPYGVKVATVNPGVFGTGFNDTGAESHTQWYDAGSAVVPMPDFAGSLADQNDPQEMIDAMVEIIPAEEHLYRTMRPLDTIKAARQWQETEWSQNA >NZ_CP043489.1|WP_149252324.1|1845466_1845874_+|heme-binding-protein MPEITLEDAHGVVARARAAAEKAGMKAVFAVLDKGANLVTFSRMDGAWLASNELAIAKARTSVMFQAPTVALSAPLKIGEPLLHFDHIHHGGLLLVGGGEPLFDVEGALIGGLGVSGGSPEQDAAIARSAVQQQS >NZ_CP043489.1|WP_149252325.1|1845986_1847087_-|FUSC-family-protein MMIGAGATARLFKWFAAKRMELALAVRVTVAAGLTFVAVKLVDLSQSSWAVITSIIVMQASLGGSVKAAMDRMAGTLLGALWGAVVSVVLPHHEGNIALGLAVLTAVAPMAVASALRPSFRVAPITALIVLIPAGGTLLPPYAYAAERVAEIALGIIVGVGVALFVLPARAQGALAAAAARVADLNAELLLALTGSLLDGKGRPELAGINKRIRAGLRQIDAAVEETVRERSTHLSHAIDPEPMARTLYRVRHDLVIIARVCVRALPERVAPTLTEPLEAMRDATVGLLRGIAEALRRGYLGPDAAGFDTSLSAYVAAMDKLRGAGVLRELQSEEVGRLYALRFGFEQLGQDIKDLVERSSDLARE >NZ_CP043489.1|WP_149252326.1|1847233_1848487_+|flavodoxin-dependent-(E)-4-hydroxy-3-methylbut-2-enyl-diphosphate-synthase MIPSYARFGLMSRRPTVPVDVGGVLVGGGAPIVVQSMTNTDTADIDGTVRQVAALARAGSEMVRITVDRDEAAAAVPRIKERLMRIGVTTPIIGDFHYIGHKLLADHPACAKALDKYRINPGNVGFKDKRDRQFGAIIEMAMRHDKPVRIGANWGSLDQELLTHLMDENARSEAPVDARTVTWEALVQSALLSADRAVEMGLPKSRIIISAKVSAVQDLIAVYNEMASRSDYALHLGLTEAGMGSKGIVASAAALSPLLQAGIGDTIRVSLTPEPGGDRTQEVKVSQEILQVMGIRTFVPLVAACPGCGRTTSTVFQELAQSIQGFIIDSMPEWKTRYPGVEALKVAVMGCIVNGPGESKHADIGISLPGTGETPTAPVFIDGQKAATLRGPGIADEFKQMVIDYIENRYGVRPAAE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP043489_2 | 2815237-2815313 | Orphan |
NA
Consensus repeat of NZ_CP043489_2
|
1 spacers
spacers of NZ_CP043489_2
>2.1|2815260|31|NZ_CP043489|CRISPRCasFinder ATCTTCCCTGTCCTCACATTCGGCGTTTCCG |
CRISPR arrays and Neighbor proteins around NZ_CP043489_2
The CRISPR arrays of NZ_CP043489_2 >merge|NZ_CP043489|2|2815237-2815313|CRISPRCasFinder GAGGGTCGATCTGGAGATCGACCATCTTCCCTGTCCTCACATTCGGCGTTTCCGAGAGGTCGATCTGGAGATCGACC >NZ_CP043489|2|2|2815237-2815313|CRISPRCasFinder GAGGGTCGATCTGGAGATCGACC ATCTTCCCTGTCCTCACATTCGGCGTTTCCG AGAGGTCGATCTGGAGATCGACC
>NZ_CP043489.1|WP_149253110.1|2814352_2815162_-|ammonia-dependent-NAD(+)-synthetase MPQPLTPDIIVPEAFDPAAEAERRIAFLADYLKASGGRAYVLGISGGVDSLTAGLLAQAAVERLRTEGYQAQFLAMRLPYGVQADEKDAQHSLATIKADRVVTVNIKPAADAMMAELTREAGDLIEGARADFHHGNIKARQRMIAQFGLAGAVRGIVIGTDHAAEAVMGFYTKFGDGAADILPLAGLNKRRVRALAAHLGAPRELVFKVPTADLETNVPLKPDEDAYGVTYDEIDDFLEGKAIAEASHQRILSTYRASAHKRALPVAAG >NZ_CP043489.1|WP_149253109.1|2813530_2814199_-|FCD-domain-containing-protein MNAPKDDTIAIRIAKVLADRIISGAIEPGARLRQDHVAEEFATSHVPVREAFRRLEAQGLAVSEPRRGVRVAAFDLAEVKEVAQMRAALEVLALRQAAPHLTATILDEAEEATKAGDRSPDVRSWEEANRRFHRLILAPCRMPRLLAAIDDLHAASARFLFAAWRSEWETPTDQDHRAILSALRSGKVDDAAGTLARHVQRVGLKPVRSASGATREAFAIIG >NZ_CP043489.1|WP_149253108.1|2812776_2813367_-|biotin-transporter-BioY MSHSATPSLTPAFSPLDISRRSLGWQAAAVVAGTAVLALASHIQVPMFPVPMTMQTLAVTLIGALYGWRLGAITVLAWLAEAWMGLPVTATGSIGTLLFVGPTAGYLISFPLVAALCGLLAERGWNGNRPVLAFVNMTLGNALCLAIGGAWLGAQIGLEKAFLLGVAPFLLGGLLKSVIGAVTLMALVRGKAGAAQ >NZ_CP043489.1|WP_149253107.1|2812354_2812780_-|DUF1284-domain-containing-protein MTVRLRAHHLLCMLTYVGKGYSPGFVANYDAIAARLSQGEDIMIVAGPDDICAPLLQQTDSHCHEPRIDQRDELAIRDVGALMRLPIRTGTRIALTPTLLARFRGAFAANLTRSACVGCEWSGLCSAVARDRYQESRVRRD >NZ_CP043489.1|WP_149255428.1|2811139_2812300_-|mandelate-racemase MRILDIRERSLPISRYTGPAGAGGLTTSVVALTTDRMKAGRPIVGYGYASIGRYAQGGLIRERFAPRLLQAPAETLIGNDGILDPFKAWQAMMTGEKPGGHGERCVAIGALDMALWDIAAKAAELPLHAYLADRLGGRAGQTDRVRVYASGGYHHPRNDLAWLADEMRRFADLGFINAKMKIGSASLDQDLHRIEVAAAALGEPGRLAVDAMNAYDRPAGLAAAATLSPLGLWWFEDICDPLDFETQSALASAYAPPIGAGEALFSAAEAGLLARHGDLRPDRDVLLFDPVHCYGLPGYLQIIEVMTAKGWARQAFWPHGGHLFALHLAAALGLGGAEVTPIAFQPFCGLADDSVIADGFATLPQAEGIGFETNTALKRLFAELAG >NZ_CP043489.1|WP_149253106.1|2809844_2811005_-|coproporphyrinogen-III-oxidase MIPNSPEPGFGVYVHWPFCLAKCPYCDFNSHVRTGGVDQAGYVEAYLKEIAHMAAIAPGRQVQSIFFGGGTPSLMEPKTVGAILDAIGGAWTIAPDAEISLEANPTSVDATRFAGYRAAGVNRVSLGVQAMNDADLKRLGRMHSVAEAMAAVEIAAKSFERYSFDLIYARPDQRPGDWQTELDEAIDRAAEHLSLYQLTIEPDTMYERLVAAGKLIPMPDEDARVLFDITRETCERRGLPAYEISNHARPGAECRHNLVYWRYGEYAGIGPGAHGRLIDAAGVRRALSTERSPEKWLAGVRTNGHGLVDDQALTADQQGDEMLLMGLRLAEGIDLARLARLRGRPMAEATLAGLQQNGMIERRDDRVRVTRAGFPLLDAVVAELAA >NZ_CP043489.1|WP_149253105.1|2809129_2809900_+|hypothetical-protein MPWRSGWRVLAMGLASGLLFGSAQIAAAQTSAADDGFADLPLEISAAPAGHGALVVFLSGDTGWGGLERSLVRRLARAGVGVIGLDARRYFFTKRSPAELARDIERVLAVYRRRWHAGRIVLAGYSFGADALPFAWPLLSTKTRQDTRLIALIGLLPEANFRISLLEMLDLPASDDTPVAPMLRHLPVGKVVCLYGREEHSACTLPELAGAERIARPGGHDRDGDAGAVVQAILRHLALRPPAPPPPHRAGESPRG >NZ_CP043489.1|WP_149253104.1|2808717_2809125_+|hypothetical-protein MSAARKIHTLTFHLLGIAGEDAMDDIIDEIEEFADGVDWPIEAPDAFKHKVSETAPDMGCAVELPVRASLAGETIAGEKADFAGAMALIEHLRQLSARHRFDVEIAFDKEIVGTIDKGAYSEALRQGLVEPWQAG >NZ_CP043489.1|WP_149253103.1|2808255_2808624_-|GFA-family-protein MALKGSCHCGGTTFELSEAPQEVTRCTCSFCSKRGSLWAYYRPEQFKLTSPPEQVATYRWGSKTIQHHFCATCGCGTYTQTPDWSTGEPDFDNPKISINARLFDDFDLEAVPVTVIDGRNLW >NZ_CP043489.1|WP_149253102.1|2807939_2808248_+|antibiotic-biosynthesis-monooxygenase MSVTYLIGFVVNPGQRERFLGMLNTLLDTMRHEATFVNATLHADPNDPCRFLLHETWVDHQDVLDVQLSRPYRQAWHEALPELLAQPREVSIWQPLRADRKD >NZ_CP043489.1|WP_149253111.1|2815392_2818122_+|DNA-mismatch-repair-protein-MutS MTSTPRKLDTADASAPVTPMMAQYLEIKAGYPGALLFYRMGDFYELFFEDAEIASKTLGIVLTKRGKHQGEDIAMCGVPVVRADEYLQRLIAHGHRVAVCEQMEDPAEAKKRGSKSVVKRGVVRLVTPGTITEETLLEPTRTNLLLALARLRISDDEARYGLAFADISTGEFGLSECDEAGLPAELARLDPSEVVMAEAVHEDAELASLWRECRAAVTPVGRDVFDGSQAERRLAAFYDVGTIDGFGTFSRAELVAASGLVTYILRTQVGQKPALAPPRRDGASAHMAVDAATRANLELTRTLGGERAGSLFDAVDRSVTAAGGRLLAQWLASPLLQPEAIALRQDAVAFFAEQALLRGQVRARLKSAPDIARSMARLALDRAGPRDLAALREGLVGIAGIETLLRQREAELPSMLAGILSALARPDQGLAARLAAALQDDLPLLKRDGGFVRAGYDQALDETRALRDESRRVIAGLQARYAEESEVRQLKIKHNNMIGYFVEVPQQAGEAFLQPGLRETYVHRQTMAGAMRFSTAELSTLESRIASAAERSLASELQIFSDLAAALLGDSVAIRAATEALALLDVIVALAVLADEENYVRPVVDTSLAFAIEGGRHPVVERALKREGKPFVANDCELSGEGQGKQAAGRIWLLTGPNMAGKSTFLRQNALIAVLAQIGSFVPAKSAHIGAVDALFSRVGAADDLARGRSTFMVEMVETAAILNQAGPRALVILDEIGRGTATFDGLSIAWAAIENLSAVNHCRALFATHYHELTQLTKKLPRLANATMRVTEWHGDVVFLHEVVPGAADRSYGIQVAKLAGMPAAVVERARAVLAQLEAGDRQAPAARLVDDLPLFAAAPRASAQVAVTARDEVAEALDGLDPNDMTPRQALDALFSLKAKRDSAKKG >NZ_CP043489.1|WP_149253112.1|2818333_2821105_+|[protein-PII]-uridylyltransferase MLDKPDRELHLVLDRMALEAEIDALALEHEGHADALRQAVVALLKATLRDGRETIRTWFSEDRLGTACAQRLSWLEDEIIRASYAYVTRYVYTTHNPTAGERMAVIAVGGYGRGTLAPGSDIDLLFLLPSKQTAWGESVTEAILYVLWDLGQKVGHATRTIDECLRLARGDMTIRTALLEARPILGDMGLASELAARFDRELVQLTAAEFVAAKLAERDERLIKAGNSRYRVEPNVKEGKGGLRDLNTLYWIAKYVYRVRDAADLVEAGLFTRREYRQFTLAEDFLWATRCALHFLTGRAEERLTFDIQREIAAFLGYSDRGGLRGVERFMKHYFLVAKDVGDLTAIVCAALEARQEKPKAMLDRFIAPFRRSQRQALIGTKDFVIETGRLNVANDQVFARDPVNLIRLFHLADLHSLALHPDAMRLVTRSLKLVNASLRENGEANRLFLEILTSRNAPETVLRRMNEAGVLGRFIPDFGKVVAMMQFNMYHRYTVDEHLLRSVGELADIDRGEGGDEHPLVNEIMPTIQNRTALYVATFLHDIAKGRPEDHSLAGAKIAKRLGPRLGLTPGQTDTVSWLVEQHLVMSMTAQSRDISDRKTIETFAGTVQTLERLKLLLILTVSDIRAVGPGVWNGWKGQLLRSLFWETELVLAGGHSNVDRRASVQLAQDELRAGLSDWSAEEIDAYTARLYAPYWLKVDLPRRLRHARFVRAVRERGETLGTEVATDAFRGVTELTILAPDHPRLLSIITGACAASGANIVDAQINTTTDGLALDTIFVSREFPEDEDELRRAGRIAQAMEQALTGTIRLPEAVAKRSAIKPRQKAFQVAPEVVVDNEWSNRHTVVEVWGLDRPGLLYDLTTAISRLNLNIASAHIATFGEKAVDVFYVTDLTGAKITSVQRQDSIRSNLLAVFRGEGKSG >NZ_CP043489.1|WP_149253113.1|2821135_2822467_-|hypothetical-protein MSGKTSHHPPISFANITDISSQHLDLLRSSSQRNDDSPEAIAAFRDRIADFRTGVAAAGARLEKEGDRETAQGIIDYWSTQLLAWSTNGARPDLDLPLASYLDLPGAALPAGATAARRPTGEAVAQDGRAQVRIGSLAYQWRRSNRAPGYLLTGNALVEAAGYRGKDPEIEAFVSASEAAERRTRHIRAGFIVTLVLSIAFAVLALISFFRENEARKEADQLSADNDKRAQRFLVADVRLQTERFQHAAELKELNDQLLAMKSALEEAQAKLKVALAQTPSPTLNPTQHMYLRDSNAVLTQAIDQSRAHKPSLPPLTAGEQLLANVNLIDGPDGDVRRATTENLVRAVRDGTVSPDDQRMLVGALVNMLARPAVQSLTLTGRYNVLYILSIITSAQWTLPAWSALRDRARIVTADLVGPNAQNDLPMGADSQKFYQQLVQRLQ >NZ_CP043489.1|WP_149253114.1|2822463_2823807_-|hypothetical-protein MGLVTLPPQSVVSISNAIGDLCTFRELGDLMQICYGTSVAITGVSFAEQPRRKVARDCVDWAQKCGILTNFVAIVLHAKNDNTAFRELVTQLIPDALTAPPSVASQVGTVVSGLDSLAAYLAKDEVRKKAGISKQRLVEIGQRINLLAAYKGLHDSLHHIQINHTRILLQAVATMDNPISYETVQVYISQVRAAVIAIRTEIAKPGANEILGGLDLSWVEDLASSSQRCQEGLDADKPGPVLIAIRQIASISDGQSVQLNKGIFDAATKLPLPELAGALVEIGAADAGLLAALDEAIVALGALQRTLMSRIATHNRLQKVEQNLSILASCLDDPGVLIVDEIAALWPETRVMLLNLAALGLERADMTVDGEDCGKIDACLQDVEIAQRDTAFNLKSAPAIRALQRAFSAFQKNASAQFFALDSRLKADFDSVFQISDRLRNILGAVQ >NZ_CP043489.1|WP_149253115.1|2823823_2824942_-|trypsin-like-peptidase-domain-containing-protein MDKLTGAQVSDLAKTLSTTVNLDDLSNFVYVATGDQLEVYWTDVRQPLVSVLRELVIRLEQEGQTGNFLKTVYVNRPLRDDVRQLIARLAPEAAAEILSNPYDLVMLDKDNRGTQPSDATLGPGLQRNIKPHLRMLDPALWIAGMTQTLRRVCRIEIAGSPAGTGFLVGPQAVLTNWHVVEAAAGQNDLPTVCCRFDYARKADGGFNEGEAVALSGTALLHHRPYAPAEMTEAPDEPPPVATELDFALLQLAETAGTERGWFALPERDGALSQGSPLIIVQHPHGGPVKLAIDTEAILPTPAPPGRPRLRYATNTDAGSSGSPCLNLEWQLLALHHFGDPAWGEPKFNQGVPAGLIRADIEAAGFGAAIPAA >NZ_CP043489.1|WP_149253116.1|2825249_2826275_-|ribose-ABC-transporter-permease MTSTNPSTADTARRKFALSGTLRGLGMLPALVLIAILFQLLSGYVESGGLSWASGRFMSWNNLSIVAQQASINTVLAAGMTFVILTGGIDLSVGSVLAASAMIALIVSLIPGWGMMGLVAALVTGGLLGLINGALIAFMRLPPFIVTLGSMTAVRGLARLFGEDKTIFNPSLPFAFIGNGTLFGVPWLMVIALATVVVSWLILRRTVLGLRIYAIGGNAEASRLSGIKVWSILLIVYGISGLLAGLGGAMSAAKLYAANGLQLGQSYELDAIAAVILGGTSFVGGVGSIWGTLIGALIIAVLSNGLILIGVSDIWQFIIKGLVIIGAVALDRLRSSSSART >NZ_CP043489.1|WP_149253117.1|2826357_2827896_-|ATP-binding-cassette-domain-containing-protein MNPGQTHPFLEMRNVSKTFGRVQALKNVSLDVKLGEIHALMGENGAGKSTLMKILSGAYTPDEGSEILIDGQKVAISGPMAAKQLGIAIIYQELALAPNLTVAENIYLGREPSRAGLIDRGAMIAGVESVLQRLGATFTARDKVAELSIAERQLVEIARAVHARSRVLIMDEPTTTLSERETERLFALVRQLKQEGLAIVYISHRMKEVYELSDRVSVLRDGTYVGTLDREAITPAAVVRMMVGRDLSSFYKKEHDAHQSRGRIIFSVRDIADGRRIQPCSFDLHEGEVLGIAGLVGAGRTELARLVYGADARTSGTVAVDGKEVSIRSPQDAIEAGIAYLTEDRKLLGLMLDMSVAENINLGVIARDALAGGFLNLAKGRKRTAEAIQATGIRTASPDAPVGGLSGGNQQKVLLSRLLETKPRVLILDEPTRGVDIGAKSEIYRLIDRLAREGVGVAVISSELPEIVGICDRVIVMREGHIAGELGGGPDAEAVSQENIMAIATSAAKDAA >NZ_CP043489.1|WP_149253118.1|2827917_2829507_-|mannitol-dehydrogenase-family-protein MSATTRDGMPSTTTDVAVWSKMASSCPRRPFQLLKVQAMLPLNTSTLASFGPTVTRPTYDRSRLRAGIVHFGVGNFHRVHQAIAIEACLHHPGQEEWAICGVGLTDGPAARAKAEAYRRQDNLYTVTQLTSPAPRDTQIVGAMIDYLHAPADPEAVLARLADPATRIVSLTITEGGYNIDETTGAFRLDTPDIRHDLEGGPPRTVFGYIVAALARRRQAGLPPFTVMSCDNLPRNGDTSRLAVLGFARALDPGLADWIEANGAFPNSMVDRIAPQVPEDERRRITAGIGVEDLVAATCEPYTSWVVEDRFCAGRPELERAGVVFSSEVPAYVAVKGRLSNAAHMLMCYPSLLMGARLVDEGMRHPDIPRLLHAFWERDARRLVEPPAGYSTRAFTDTVIERFANPAIKDQLLRVAGDGASKIVVFHGKTIGQLIAGGSDLAREAFLLACFARYLGGVDDRSIAFDIFEPRIGEADWQRLQSGDPLAVLDIEAFAGLGLRQSPAFVAAYQIQSKSLASQGTAATLAQLLK >NZ_CP043489.1|WP_149253119.1|2829463_2830423_-|substrate-binding-domain-containing-protein MKALKCLAGIAMAGAALVTGLAAPALAKDVKTVGISVGSLGNPGFVIIANTATRIIKKAYPQAQVTTVGYDYDLGKQVNQIDNFIAAGADFILLNPGDPKAITPAIKKAQAAGIPVIAFDTGADGADAIVMTDNIMAGSVSCQYIADKLKGAGNVVIQNGPQVSSVIDRVVGCKQVLAKYPDIKILSDDQDGKGSRDGGMAVAQGYLTRFPKIDAIFTINDPQAIGTALAAKQAGRSEFFITSVDGSPDIEAALKDPALDMIKASASQDFYAIPKVSAQTAMDLVNGKKPEKPVILIPSALVTRENVGDYKGWNAKHDD >NZ_CP043489.1|WP_149253120.1|2830521_2831616_-|substrate-binding-domain-containing-protein MTQNKKTNEESPRSGSSRPARLLDVARLAKVSRATAARALGGYGLVTEETRERVAAAARTLNYRLNEAARAMRAGRTQVIGVVLADISNSFFASAARAIIDTCASLGYQTLIVNTDDDLKTEIEAVQTLMEKRVAGMIVVPSSPDHNEHLQKAGAGEGRMVLLDRRIADIPVSAVTTDDRGGAREAVELFIARGHRRIGLVVLTAAAASQRQSEPRGAVSSARDRVLGAREALEAADLDLPKAWLRYTPNNPQTIIDAASTILRSHPRPTAILATCEEIAIGVLAACRDLNLVVGRDVALISFDESPWSGALTPAISVVQRPIHEMGRAAVNLLVRQIQGGEARRDIEMPTILIDRESVFDLTP |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
80105 : 90055
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP043489|80105:90055|DBSCAN-SWA ATCAGGTGCCGGAAAGATAGCGGGTCGGATCGACCGGTGCGGCACCCTTGCGTACTTCGAACAACACTTGCGGCGAGGTCACGTTGCCGGTCTGGCCTGCAGTCGCAATGGTCTGGCCTCGCCTGACGGTTTCGCCTTTCTTGACGTCGAGCTCCTTGGCATGGCCGTAAGCCGTGACATAACCGTTGGCGTGGCGGATCAGGATCAGGTTGCCATAGCCCTTCAATTCGCTGCCGGCATAGGCGACGACACCGTCTTCGGCGGCCTTCACTTCGGTTCCTTCCGGCACCGAGATGTTGATACCGTCGCTGTTGCGGCCGAAATTCTGGATGATGCGCCCCTTCACCGGCCAGCGGAATTCCGGGCTGCCCGGGGCGGCGACGTTCGCATCCTTCGGCAATTCAGCAGCGGGAGCAGCCGTTTGCGGCGCGCTCGCAGCAGGCGAAATCGTTCCGGTTGCGACCGGTGCGGGCTGACTCGTCTCGACAGGCTTGATCGGCTTGGCAATCGCCACCTGGGCGGGCTTGTTGGCCGGCACCTTCTGCTTGGCGGCTGCAAGCTTTTCCGCCTTGAGCTTGTTTGCCTTGGCCGCATCGACCGGCTTGGCCGCCGCAACCTGGGAGGGCTTGGCCTTGGTGTCGACCGGGACGACCTTGGCAGTCTGAACGGGCTTGACCGTTACCGGCTTGACGGTCGCGGTCCTGACAGCCTGCCCATGGGGAGCGGCAACCTGCATCGCCTCGGCAGCAATCGACTTGGCCTGGGTCTTGGCGGTCTTCACCGGAGCCGGGTTGGCTGCCCGCGCGGCCTGCACCGGCTTGGGAGCCGCCACGGGCCTGGGCTGGACGGCTGCCGCCACCGGCACGTCACGCGGCGGTCGGTTATCGACGACAGCCTCGGCGGCTGGAGCCGCCGCAGCAGTGGCATGAGCCGTTCCGCGCACCACCAGCTTCTGGCCGATCTTGACGCTCATCCATTCGTCGATGCCATTCTCCTTGGCCAGACGCGGACGCGTCGTGCCATAGGCCTTGGCGATCGACGTCAGCGTCTCGCCCGGCTTGACGACATGAACACCGGAGGCGACGGCGACAGGCTTCGGCTTGGCTCCAGCAGCTGGCGTGGTGGCGACGGGGCGAACCTTGACTTCACCGACGCTGCCCCGTGACGTCTCGGGCACATCGCCCGGATAAGGCTGTTCTCCCGTCGCGGCCACCGTGGTCACCGCGGCATTGCCGTTCTGGGCCGCGTTGTAGACAGGAATGATCACCTGCTGGCCGGGCGTCAGCGAACGCGAGGTGAGGCCGTTGACCTTCATGATCGCATCGGAGGGCACACCGTAGCGATTGGAGAGCGTATTGATGGTCTCTCCCGCGGCAGCGGTAACCGGGGTTCCGCCCGTGGCAGTCCAGCCACCGGCCTTGAAGCCGCCGATCTGCGGCGCAGGCGTCGAGATCGCACCGGTCACGATCGGTTGCTCGCCGCGAGCCAAGCGGTATTGCGTGGAAGGCGCGCTGCGTTGCGCCACCGCGCCGACCGGCGGCAGCGGCTGCGAGGCAACCACCCCAGCGGCCGGCACCGTGTTGACGGCGTAGTTGGAATTGTCATAGCGCGGCCTGGCTTTGGCAGGCTGCGCCTGGGCATAGGCCAACTGGTTGTCACCCGAGTTGAACGGGTTGGAGAAAGGGTTCTGGGCGAACCTCGATGTATCGCTGCAGCTTGCTGCAAGGCCAGCCACCAGGGCGATCACGGCAACGCGCGACAAGTGGCGCGAACGGATCAATTCGATCGGAAAACGCATAGGACGCTCACACCCACGCTTGAGAAGAACACGGGACAATTAAGCCTATATTGGGGTAAACAACGCCTTAAGAACCGGACTCAATTTGGACAAAATCCGGAAACGGCACTGACGGCGGCAGGCAGGGGAGCTGAAGCCCACTCACCGTCATCCCTTTGAATTCACTGTGTTTTCCAGTCGATCGCCCTAGCTTCCAACCGGTTCTCGGCTGGTCGGGGACGATTAAGGGCCGGATCGAACGCGCCCCATCCGTTAAAGTCGCGCCGCGCCGGTATTCGCGGGGAATCAGCACAACCGGCGCGACTCGCAAACCGCGTCAGCGTGTCCATTCCAGGCGCAGATCTTCGGCGCGGCGAATGCCGCCATGGCCCTCGACCTTCAACTTGCCGACAAGCCTGGCCTGGGGAATGCGGCTGACGAGCGCCGCCAGTCCTTCTTCCAGTTCGGCCTTTGCCAGGGCCTCGCCGAGGCAACGATGGGCGCCGCCACCAAAAACATAGTGCCAGCGCGTGTGATCCGTCCGCCTGATGTCGAATCGGTCCGGCTCGCCGAACACATCGGGGTCGCGCAGGGCAGACATGGTGGAAAGCATCAGCAACGATCCCGGTGCCAATTTATGGCCGCCCAGGTCGATCTCCTCCAGCACCAGCCGGGGCAGCGATGCTACCGAGGGTTCGAAGCGCAACGCCTCGGAGACGGCACCGGGAATGAGGCTGGGGTCGGCGCATACCGCCTGCCATTGTTCGGGATGCTCAAGCAGAAGCGCGGTCTGGATCGCCATGGAGGCCCGCGTGGTATCGCTGCCGCCCAGCACCACGATGATGATCTGCATGATCGCCTCAAGCGGCGTCATCTCGGGATTCTCGTCGATGGCAGTGATATAATCGGACAGAAAATCGCCACGTGGGTTCTGGCGGCGATCCGCCACCAGCTCATACATGTAATCCGTCATCTGACGGCAGGCATCCTCGAGTTCGGGAATGTCCCTTTCCGTCCAGGACGTCCCCAGGCTGCGGGCGATGACATAGATCCAATGCGCGAATTTGGGAATGTCGGCCTCGGGCATCCCCAAAATCGCGGCAACGGTATGGGCGGGAATGATCGCCGCATAATCCTCGAACAGATCCAGCCCGCCAGCCTCATAATGCCGGTCGATCAGTTTCTCGGCCATGGCCCGGACCTTGGGGCGAACCTCGGCGACCATCTTGAACGCAAAGGTCCGCGCCATCGGGCCGCGCCGCCTCCGATGGGCGGCCCCGTTCGAAAGCAGCATGCATTTGTCGTAGAAGTCGAATAGCGGCCCTGAAGTGACGCCCCTCAGCTGCAGGAGCTCCGTTTCGATCTGGCGGGTACGCTCGTCGGTCGTGAGACTGACGACGTCCTTGGCACGCAGGATGAAAAGTGGTCCGTCGTCCCGCTGGATGAAGGGTGTATCCGGTCGAAATTTTCGGAACAGGGCGTGGGTATCCTGGTCCAGTTGGGCAATGCTGAATGTAGGTATGTCTGAATTAAATTGTGCTGCCTGCTCGGTCATCTCAAATCCCCCAACCGACGTGCTCCTCTGACTGAACGGCCGGAGACGATCCTGAAATCGGACCTGAGCCGCCAGTTCCGCACCATCGGAAACCGCCGCATACTGCACGGGATGGGGGTGTCCCGCCATCCCTCCGCGATCGGAATTTGATGCGATCCACCAACGCGAATACTTTACCGATGCAAGTGCAACGAGGCGATCGACACTTGGCTGGGGCGGCAAGACATGATTGTCTTGCGATACACGCAATAACGCCCCTGAAAACCGTATCCGAACCGCGTTCGACCGCGGTTCAAAGCCGATCGGCCACGCCCAAGATCATCGACGCAGCACGGATGTCCATCAGCGGCGTATGATGAAAGCCGTCGGCATCGCGCGCGATCATGGTCAATTGCTGTTCTTCGCCCGGCGGGCCGATCGGCGCGATCATCCGACCGCCCAGCGCCAGTTGATCCAGCAAGTCCTCGGGCACCTCGCTCATCGCCGCCGTCGTGACGATACGATCGAATGGCGCCTGCTGGCGCCAGCCGGAGCTGCCATCGGCCACCATCATGGTGACGTTGAGGATTGTGACGGCCCGCAGCCTTTCCTCGGCCTCGGTGACCAGGGTGCGCCACCGATCGAGCGTGAGGACACGCTTGGCCAGGCGGCCGAGCACTGCCGTCAGATAGCCCGAGCCCGTACCGATCTCCAGCACGCGGTGCAGCGGCTCGACATCAAGGCGCTCGGCAATGGTGGCAGCCAGCACCGGCGGCGTGATGGTCTGGCCGCACTGAATGGGAAGGCTGAGATCGCGCCAGGCGAACTCGCCGAATTCCGGTAGCACGAAAACCGGGCGCGGCAAGGTGGAAACGGCGTTCAGCACCTTTTGATCCGTCAGCCCATGACGGCGCAGGCGCAGGATGAAGGCCATCATCTCCCGCGCATGCTCATCGTCAGGATTCACAAAAGACTGATAGTCCGTCATATTGTCCGTCATGATGTCGGTCTTGGCACCGTTTCAAAGGCATCCGAAAGGCGTTGCAAGGCAGGCACGTCGGTCAGATTGAGATGAAGGGGCGTCACGGAAATACGCTTGTTGAAGAGGGCCGCAAGATCCGTCCCCTCGAGCCGGGGCGGCGTTTCACGGCGCGGGGCCAGCCAATAATAGGGACCACCGCGGAAATCCTGCCGCTCGACCACATCGACCAGAGCAGCATCGCGCTGCCCCTGGGTGGTGACCGCAACCCCGGCGACCTCTTCGGGTTCGCAATCGGGAAAATTGACGTTGAACAGGGTTCCCTTGGCGAAACCATGAGCCAGCAACTTGCGGATCACGCCGGCCGCATGCTGTTCGGCGCAATGCCAGCGCATGCCGTCCCTGCCCCCGATGCCATAGGCCTGGCTGAGCGCGATGGATGGAATCCCAAGAACGGTGCCTTCCATCGCGGCCGCCACCGTACCCGAATAAGTAACGTCTTCGGCAATGTTCTGGCCTCGATTGACGCCGGACAGCACCAGATCCGGCTGCTGATCCTTGAGCAGATGCTGCACCCCCATCAGCACGCAGTCTGTCGGCGTTCCGCTGACGGCGTAGCGCTTTTCTTCGATCCGTCGCAGGCGCAGCGGCGTGCTGAGCGACAGCGAATGGGAAACGCCGGACTGATCCGTCTCGGGCGCCACGACGATCACGTCGTCGCTCAGGCTGCGGGCAATCTTTTCGAGGGAAACCAGGCCGGGCGCATGGATGCCGTCATCGTTGGTAATCAGAACTCGCATCCGCCGTTCGTCCTTGTTCGCTGAGTCCATACCAGTGTGCCACAGGAAAAGGAGCTCCGGCGAGAGTTTGTCACACGCAATCCCCTGCCCTTGCAAAAATCCCTGCAGAAACAAACAATTAAATCACCTCCCCTGATTCTGGAATCAAGGGAAGGTGTGACAAGATGTGCTGGGCTGGAGCCTATTGCTTCTCGATCCGACGCAGACCGCCCATATAGGGGACGAGAACGTCCGGAACGGCGATGGAACCGTCCTCGTTCTGATAATTCTCCATCACCGCCACGAGCGCGCGGCCGACCGCGACGCCCGAACCGTTCAGCGTATGCACGAAACGCGGAGCGCCTCCGGATTTCGGACGCGTCCGGGCATTCATGCGGCGCGCCTGGAAGTCGCCACAGACCGAACAGGAGGAGATTTCACGGAACTGGCCCTGCCCTGGCAGCCAGACCTCGATGTCATAGGTCTTCTGCGAGGCAAAGCCCATGTCGCCGGTGCACAGCGTCATCACCCGGTAATGCAGGTCCAGGCCCTTCAACACAGCCTCGGCGCAGGCGAGCATGCGCTCGTGCTCGTCCCGGCTCTGTTCGGGCGCCGTGATCGAGACGAGTTCGACCTTCTGGAACTGGTGCTGGCGGATCATGCCGCGGGTATCGCGCCCGGCAGCGCCGGCTTCCGCCCGGAAGCAATGGGTCAGCGCGGTATAGCGCAACGGCAGTTCTTCCTCGGCGGTAATGGCGTCACGCACGAGATTGGTCAGCGAGACCTCGGCCGTCGGAATGAGCCAGAAATCACCGGGCTGCACGCGGAACTGGTCCTCCGCGAATTTTGGAAGCTGGGCGGTGCCGATCATGGCGTCGTCGCGCACCAGCAGGGGCGCGGCAACCTCGGTATAGCCGTGCTCGCCGGTGTGCTTGTCGATGAAATACTGGCCGATCGCCCGCTCGAGCCTGGCCAAGCCTTTCTGCAGCACCACGAAGCGGGAGCCGGAGAGCTTGGCGGCGATCTCGAAATCCATCAGGCCGAGAGCCTCGCCCAATTCGAAATGCTCTTTGGGGGCAAAAGTATAGGCGCGCGCCACGCCATGGCGGTGATGTTCGACATTGCCGTGTTCGTCGGCGCCGAAGGGGACCTCGTCGAGCGGCAGGTTCGGGATCGAAATAAGCTGGTCGTAGAGAGCCTTGTCGGCGACGCGCAGCGCCTCTTCGTCAGCCGGCATCGAGGCCTTGAGCGTATTGACCTCGGCCATCAGGGCCTGCGCCCTGGCTTCATCCTTGGCCTGCTTGGCTGCACCGATTTCCTTGGAGGCGGCGTTGCGGCGCTCCTGGGACGCCTGCAGCCGTGCAACGAGTGCCCGGCGGCTCTCGTCGAGCTCGAGTATGGTGGCGGCCAGCGGCTCGAGCCCCCTGCGAACCCGGCCCTGGTCGAAGAGACCCGAGTTCTCGCGGATCCACTTCACGTCATACATGAGATCAACTCCAGAACGGACGGCAAAACACCGACGGGCTATTCTGAAGTCTGCGTATCAGTTTCCGCGCGCTTCTTCTCAACCCACCGGACCGAATAGATCGAGAGTTCGTAAAGGAGCACGCCCGGCACGGCAAGCGCGATCATGCTCCACGGGTCGGGCGGGGCCAGCACGGCAGAAATCGCCGTGACGATGACGATGGCGTAGCGGCGCTTTTCCTTCAGGAAGGTCGAATCCACAACACCGATCCGCCCCAACAGGGTCAGCACCACCGGCAACTGGAAGACGACGCCGAAGGCAAAAATCAGGTTCATCAGCAGCGATAGATAGCCTTCCACCGTCGCCAGCAATTCGATCGTGGCAATGCCGCTGCCACCGAGTTGCTGCATGCCCAGCGCAAAATGCATCACTGCCGGCATTGCGATGAAGTAGACGAGCAACCCGCCCAGGAAAAAGAACAGAGGCGTGGCGATCAGATAGGGCACGAAAATGTTTCGCTCATGCCGGTAGAGACCGGGGGCGATGAAGCGATAGATCTGGATGGCAAAGAAGGGGAAGCCGATAAAGGCGGCTGTGAAAAAAGCCACCCGGATCTGCGTAAAAAGAAAGTCGAGAGGGCCGTTGTAGATCAGCTTGACCGGATTGCCGGCTGCCGCCCACTCGTAGGGCCAGAGCAAGAAGAGATAGATCGATTTGGAAATTGCAAAGGCACCGATGAACAGGACGATGAAGGCAATGCCCGATTTCATCAGGCGCGTGCGCAATTCGATCAAGTGATCGAGCAGCGGCGCCTTGGAGGCGTCGATTTCGTCTTCGGTCATCTTAGGATTTGAACGCTGCGAGAGGGGCGAAGGCGGAAGGGATTTCGACGCCGACGAGTTTGGCGGCCAAAGGCGAAGGACCCGGGCCGGCGATGGAGGGCTCGCTCGCCGTATCCGTCCCGGTCACGCTCTCCAGAACAGGCGCGATCGCAGCCGGGCTGAGCGGCGGCTCCGAATCGAACACGGAAGACTTCAGGGCATTGCGGGCGAGGCCGGCGGGACTGCTGATGCTGCTCAAGGTCGCAGCCGATTGGCGCAGCGTATCGAACTCTTTTTTGACGTCCTCGAGATTGGCTTCCTTGAGGGCTTCGTTGAACTGGCCCTGAAACTCGCCAGCCATGCGGCGCAGCTTGTTCACGCCCCGGCCGAGGGAGCGCATCGTGCTGGGCAGCTCCCTGGGGCCAATGACGACGACTGCCACCGCTCCCACGATCAGGATATGGCTCCAGCTGATATCAAACATGCAACGCTCGGTCTGGCGAAACGAAACGCTCCGCGTATACACGCGGAGCCTAAAAAGACCCAGTCTTCAGGTTAAGGCCACCGGCGCAAACCGCGCCGGCTCAGCCTGCCTTGGTCTGCTCGGCCGGGTTGACGGTTGTGCCAGCCTGCGTGTCGATCGTCTTCACAGGCTGCTGGGCAGCTGCCGGTTCGTCATCGGCCATGCCCTTCTTGAAGGATTTGATGCCCTTGGCAACGTCACCCATCAAGTCGGAAATCTTGCCGCGTCCACCAAACAGGACGAGTACGACGACAGCTAGAATGATCCAGTGTGTAATACCCCATGTACCCATGATACCCTCCAACGCGGCAAAGCCGATTGCAACAACACAACAACGGCGCCGAACGCGACCCGATGATCCCATTGGAATCTAGGCGGGCTTGACGGCAAAAACAAGAACGTTCGCCGTTCTTACTCAACCTTCATGTTCGGGTTCCGGTTCCGGAAATGCGAAGGAAAGATCGGTGGTCTCCAGGGGATCCTCATCCGGCGTCAGGGCGCTGTCATCGTTGGGCATGGGCACGCTGAAGCCGGGGGGCAGGCGGCCATCGAGCAGGCCCGCCCCCTTCAGTTCGTCGAGCCCTGGCAGGTCGCCGACGCCGTCCAGGCCGAAATGCTCCAGAAATGCGGGCGTGGTGCCATAAGTCACCGGTCGTCCCGGCGCGCGCCGCCGCCCGCGCAGGCGAACCCAGCCGGTTTCGAGAAGCACATCCAGCGTGCCCTTGGAGATCGAGACCGAGCGGATGTCCTCGATCTCGGCCCGGGTCACCGGCTGGTGATAAGCGATGATTGCCAGGGTCTCCATCGCCGCCCGCGACAATTTGCGCGTCTCATAGGCATCGCGTGACAGGATCCAGGCCAGGTCGTCGGCCGTGCGGAAAGCCCAGCGTCCGCCCAGCCGGACCAGATTGACGCCGCGGGTGGCATAGTCCGCCTGCAGGGCGCCGATGGCGGCGCCGATATCGGCCCCTTCCGGCAGTCGCGTCGCCAGCGTCGCCTCGTCCAGTGGTTCCGTCGCCGCGAACAGCAAGGCTTCGATCAAGCGCATATGCTGCTGCAGATCGGTCTGCGCCGTGCTCTCCGGCTGGAACGATACGGTTTCGCCCAAGGCCTAATCCTTCCCTATGCCGGCCGCACCACAAAAGGGAACCCGGACCTGCATGGCATGCCAATAACCAGTCCGCTCGCGATGGAGCGGTCGTACGCTCGCCCTGCCCTCAGTCCGGCAAGCGGTTGAAATCCCGCGGATTCCTCCTGATCGAGCCATGGGCTCGATCAGGAGGAATCCGCTTGGCCGGCATCCCCGCGCGATCGCAGATAGATCGGCGAGAAGGGCGTGGCCTGGCGTAAATCCAGCTTGCCTTCCTTCACGAGTTCGAGGCTCGCCGAAAAGCTAGAGGCCCGAACCGTGGCCCGTGTCGAGGGCTCGGCCAGATAGTCCACCAGAAATTCGTCGAGACTGAGCCAGTCCGCCGCCGGCCCGATCAGCGCTTCCAGCAATTCGCGCGCTTCGGCCAGCGTCCAGACATTGCGCTCGGGCAATTTGACATAGCCGACCATGGCCCGCTGGCGCTGCTGGGCATAGGCGGAGAGCAGATCGTGCAGCGTCGCCTCGAATCGCCCGTGATTGTTGAGCCTGATGCCTTCGGGCATGCCCCGCTTGAACACGTCCCGGCCGAGGACGGCGCGCTCCTGCAACTGGGCGGCGGCATTGCGCATGGCTTCCAGGCGCTGCAGGCGAAAGGCAAGGGCGGCCGCCAATTCGGCCGCAGGAGGTTCCTCCCCCCTTGGCGATTCCGGCAGCAGCAGGCGTGACTTGAGATAGGCCAGCCAGGCCGCCATCACCAGATAGTCCGCCGCCAGTTCGAGCCTGAGCCGGCGGGCCTCTTCGACAAAAACGAGATATTGCTCGGCAAGGGCGAGGATGGAGATACGCGTGAGGTCGACCTTCTGGCGCCGCGCCAGTTCGAGCAGCAGATCGAGCGGTCCTTCGAAGCCGTCGACGTCCACGACGAGCGTGGGATCGCTGACCGCCTTATCCGTCCTCACCGCCTCCTCGAATTCATCGCTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP043489|80105:90055|82218_83436_-|WP_149250902.1|DBSCAN-SWA MTEQAAQFNSDIPTFSIAQLDQDTHALFRKFRPDTPFIQRDDGPLFILRAKDVVSLTTDERTRQIETELLQLRGVTSGPLFDFYDKCMLLSNGAAHRRRRGPMARTFAFKMVAEVRPKVRAMAEKLIDRHYEAGGLDLFEDYAAIIPAHTVAAILGMPEADIPKFAHWIYVIARSLGTSWTERDIPELEDACRQMTDYMYELVADRRQNPRGDFLSDYITAIDENPEMTPLEAIMQIIIVVLGGSDTTRASMAIQTALLLEHPEQWQAVCADPSLIPGAVSEALRFEPSVASLPRLVLEEIDLGGHKLAPGSLLMLSTMSALRDPDVFGEPDRFDIRRTDHTRWHYVFGGGAHRCLGEALAKAELEEGLAALVSRIPQARLVGKLKVEGHGGIRRAEDLRLEWTR >NZ_CP043489|80105:90055|88397_89030_-|WP_149255233.1|DBSCAN-SWA MRLIEALLFAATEPLDEATLATRLPEGADIGAAIGALQADYATRGVNLVRLGGRWAFRTADDLAWILSRDAYETRKLSRAAMETLAIIAYHQPVTRAEIEDIRSVSISKGTLDVLLETGWVRLRGRRRAPGRPVTYGTTPAFLEHFGLDGVGDLPGLDELKGAGLLDGRLPPGFSVPMPNDDSALTPDEDPLETTDLSFAFPEPEPEHEG >NZ_CP043489|80105:90055|80105_81902_-|WP_149250901.1|DBSCAN-SWA MRFPIELIRSRHLSRVAVIALVAGLAASCSDTSRFAQNPFSNPFNSGDNQLAYAQAQPAKARPRYDNSNYAVNTVPAAGVVASQPLPPVGAVAQRSAPSTQYRLARGEQPIVTGAISTPAPQIGGFKAGGWTATGGTPVTAAAGETINTLSNRYGVPSDAIMKVNGLTSRSLTPGQQVIIPVYNAAQNGNAAVTTVAATGEQPYPGDVPETSRGSVGEVKVRPVATTPAAGAKPKPVAVASGVHVVKPGETLTSIAKAYGTTRPRLAKENGIDEWMSVKIGQKLVVRGTAHATAAAAPAAEAVVDNRPPRDVPVAAAVQPRPVAAPKPVQAARAANPAPVKTAKTQAKSIAAEAMQVAAPHGQAVRTATVKPVTVKPVQTAKVVPVDTKAKPSQVAAAKPVDAAKANKLKAEKLAAAKQKVPANKPAQVAIAKPIKPVETSQPAPVATGTISPAASAPQTAAPAAELPKDANVAAPGSPEFRWPVKGRIIQNFGRNSDGINISVPEGTEVKAAEDGVVAYAGSELKGYGNLILIRHANGYVTAYGHAKELDVKKGETVRRGQTIATAGQTGNVTSPQVLFEVRKGAAPVDPTRYLSGT >NZ_CP043489|80105:90055|88043_88274_-|WP_149250907.1|DBSCAN-SWA MGTWGITHWIILAVVVLVLFGGRGKISDLMGDVAKGIKSFKKGMADDEPAAAQQPVKTIDTQAGTTVNPAEQTKAG >NZ_CP043489|80105:90055|84411_85194_-|WP_149255232.1|DBSCAN-SWA MRVLITNDDGIHAPGLVSLEKIARSLSDDVIVVAPETDQSGVSHSLSLSTPLRLRRIEEKRYAVSGTPTDCVLMGVQHLLKDQQPDLVLSGVNRGQNIAEDVTYSGTVAAAMEGTVLGIPSIALSQAYGIGGRDGMRWHCAEQHAAGVIRKLLAHGFAKGTLFNVNFPDCEPEEVAGVAVTTQGQRDAALVDVVERQDFRGGPYYWLAPRRETPPRLEGTDLAALFNKRISVTPLHLNLTDVPALQRLSDAFETVPRPTS >NZ_CP043489|80105:90055|85375_86659_-|WP_149250904.1|tRNA|DBSCAN-SWA MYDVKWIRENSGLFDQGRVRRGLEPLAATILELDESRRALVARLQASQERRNAASKEIGAAKQAKDEARAQALMAEVNTLKASMPADEEALRVADKALYDQLISIPNLPLDEVPFGADEHGNVEHHRHGVARAYTFAPKEHFELGEALGLMDFEIAAKLSGSRFVVLQKGLARLERAIGQYFIDKHTGEHGYTEVAAPLLVRDDAMIGTAQLPKFAEDQFRVQPGDFWLIPTAEVSLTNLVRDAITAEEELPLRYTALTHCFRAEAGAAGRDTRGMIRQHQFQKVELVSITAPEQSRDEHERMLACAEAVLKGLDLHYRVMTLCTGDMGFASQKTYDIEVWLPGQGQFREISSCSVCGDFQARRMNARTRPKSGGAPRFVHTLNGSGVAVGRALVAVMENYQNEDGSIAVPDVLVPYMGGLRRIEKQ >NZ_CP043489|80105:90055|89257_90055_-|WP_149250908.1|DBSCAN-SWA MSDEFEEAVRTDKAVSDPTLVVDVDGFEGPLDLLLELARRQKVDLTRISILALAEQYLVFVEEARRLRLELAADYLVMAAWLAYLKSRLLLPESPRGEEPPAAELAAALAFRLQRLEAMRNAAAQLQERAVLGRDVFKRGMPEGIRLNNHGRFEATLHDLLSAYAQQRQRAMVGYVKLPERNVWTLAEARELLEALIGPAADWLSLDEFLVDYLAEPSTRATVRASSFSASLELVKEGKLDLRQATPFSPIYLRSRGDAGQADSS >NZ_CP043489|80105:90055|83728_84415_-|WP_149250903.1|DBSCAN-SWA MTDNMTDYQSFVNPDDEHAREMMAFILRLRRHGLTDQKVLNAVSTLPRPVFVLPEFGEFAWRDLSLPIQCGQTITPPVLAATIAERLDVEPLHRVLEIGTGSGYLTAVLGRLAKRVLTLDRWRTLVTEAEERLRAVTILNVTMMVADGSSGWRQQAPFDRIVTTAAMSEVPEDLLDQLALGGRMIAPIGPPGEEQQLTMIARDADGFHHTPLMDIRAASMILGVADRL >NZ_CP043489|80105:90055|86697_87480_-|WP_149250905.1|DBSCAN-SWA MTEDEIDASKAPLLDHLIELRTRLMKSGIAFIVLFIGAFAISKSIYLFLLWPYEWAAAGNPVKLIYNGPLDFLFTQIRVAFFTAAFIGFPFFAIQIYRFIAPGLYRHERNIFVPYLIATPLFFFLGGLLVYFIAMPAVMHFALGMQQLGGSGIATIELLATVEGYLSLLMNLIFAFGVVFQLPVVLTLLGRIGVVDSTFLKEKRRYAIVIVTAISAVLAPPDPWSMIALAVPGVLLYELSIYSVRWVEKKRAETDTQTSE >NZ_CP043489|80105:90055|87481_87943_-|WP_149250906.1|DBSCAN-SWA MFDISWSHILIVGAVAVVVIGPRELPSTMRSLGRGVNKLRRMAGEFQGQFNEALKEANLEDVKKEFDTLRQSAATLSSISSPAGLARNALKSSVFDSEPPLSPAAIAPVLESVTGTDTASEPSIAGPGPSPLAAKLVGVEIPSAFAPLAAFKS |
10 | uncultured_Mediterranean_phage(75.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
336741 : 369695
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP043489|336741:369695|DBSCAN-SWA GTCAACCCACGAGCACGAGCGCGTCCGAAGGCGTCAGCAATTGGCGTTCGCCGGTGCGGCGGTTCTTCACCTCGACCTTGCCCTCCGCCAGGCTCTTGGGGCCAACGATGATCTGCCAGGGCAGGCCGATCAGATCCATCTTGGCAAACTTGCCGCCGGGCCTCTCATCGGTGTCGTCGAGCAGCACGTCGACCCCCTTGGCGCCAAGCTGGAGATAGAGATCGCCGCAGGCGGCATCCACCGCGCTGTCGCCCGGCTTGAGATTGATCAGGCCGACCTTGAACGGCGCGATCGCCTCCGGCCAGATGATACCGGCCTCGTCATGGCTCGCCTCAATGGCCGCGCCGGCAACACGGGTCGGACCGATGCCGTAGGAACCGCCATGCACGGGAATGAGGGAACCGTCGGGACCGGCAACGACGGCCTTCATCGGCGCGGAATATTTCGTACCGAAATAGAAGATATGCCCGACCTCGATACCGCGCGCGGAAACCTGCTTGTCGGCCGGCAAGGCCTCGAAGGCAGGCTTGTCGTGCTTCTCCTCGGTTGCCGCATAGAGCGAAGTCCATTGATCGACGATCGCCTGCAGTCCCGCCCTGTCGTCGAAGTCGGTGCCGGCCGAGGGTGGTGCCATCGACAGATAGTCGCGGTGGCAGAACACTTCACTCTCGCCGGTCGACGCCAGGATGATGAATTCATGGCTGAGATTGCCGCCGATCGGACCCGTATCGGCCACCATCGGAATGGCCGTCAGGCCTAGCCGCGCGAACGTGCGCAGATAGGCCACGAACATCTTGTTGTAGGAATGCTGCGCGCCGGCCTGGTCGAGATCGAAGGAATAGGCGTCCTTCATCAGGAATTCGCGCCCGCGCATCAGGCCGAAGCGGGGGCGGATCTCGTCCCGGAACTTCCACTGTATATGATAGAGATTGAGCGGCAGGTCCTTATAGCTGCGCACATAGGCGCGGAAGATCTCGGTGATCATCTCCTCATTGGTCGGCCCATAGAGCATCTCGCGCTCCTGGCGATCCCGAATGCGCAGCATTTCCTTGCCGTAATCGTCATAGCGCCCGCTTTCGCGCCACAGATCGGCCGACTGGATGGTCGGCATCAGCAATTCGATGGCGCCGGCCCGATTCTGCTCTTCGCGGATAATCGCGTTGATCTTGTTCACCACGCGCAGGCCGAGTGGCAGCAATGCGTAAATTCCGGCCGCTTCCTGACGGATCATGCCCGTGCGCAGCATCAGACGATGGGAAACAATTTCTGCTTCCTTCGGCGTCTCGCGCAGAATAGGCAGGAAATAACGGCTCAAACGCATAGGACAGACCTTCTCCAAGCTGCCGGCGACCTCTGCTGCGCCGGGTAAGACATGTGGCAAAATCGCGATTCAAGATGAAAAGACGCGCCGATCGAGGCAATGACGCACCTGTCCGGCAAATCGCATCCGATCGCCAGCTTCTGGCAGCTTGACCCGATCCCTCGTAAAAAGACAAGGGTACGACCAGAAAAACCGCCATTATTTGGCAATTTGCACGCCCTGTATGATGAGTGCGATGCTACCTGCATGTTCACCAACCTGCCCACGAACAGCGCAGCCTAGCGAAATGACGCGCACAAATTTTAAGCAAAGATATTAATTTTGTGCCGATCGCTGCACAAAGACGTGACAAGCAAAAAAACTTCGTCTAGCGTCCGTCCCGTCAACGACAGGGCATCCGAACTCAGCGATGAGTAGAAGGAAGTCAGGGTCCAAGTCTTGGGAGGAAGCCCGCAGGCGCCGCCGGTGAACAAGCCGTTGTCGAGCAGCCGTCCTGACACTTCAAACAGCACAAAGATGCGTGAGGGAAGTCTCAGGAAGATGGGTTAGGACGAGGTCTAGAATAAGGCGAGGCTCGCAAGAGCCTCGCCTTTTTTACTGGTGCTGACGTCACGGCATGTTGCATCATGCGGGCCCGTCATGACAACCACATGACAATGCCAGATCGGCAGAACCTCCGTCGCAATGGCGACAGCATCGCCTGGCGGAATCGGGTTTTTGCCGAAGCAGATTCCGCCTGACCGAACGCTGCGCTCGGTCAGGCGGAATCCGCGGGATATCAAACGCTTGCAAAGTTTCCCGATCTTGCCCACGAATGGGCATATCGGGAAACTTCGCTTAGCTTCGCATTGTCTGCCCGACCGATTTACTTTAGCCTTTGCCGGCTGTGTGACGTGCGGCAGGACTGAGGCGAAGGCCGGCAGTTATCAGCGCCTCCCCCAGACTGTATAGAGCACGTTGAGCCTGTAGATCGATGAGCCGTAGCCCTGTCTCGCAACCTGCGCGCATCGCGCCGCTGGCGGTATTGCCGGTATTCTATCGCCTGACCGGCAAACCCGTCCTGGTGATCGGCGCGAGCCAATCCGCCTTGTGGAAGGCCGAGCTTCTGGCGTCGACCGGCGCCGAAGTCCGCATCGCCGCGCCCGAGGCAAGCGACGCCCTGGCGCAATTATGCGCCCGCGAGGACATGAGTGTCCGCCACCTCGCCCGCTCCTGGCATGAGGCCGATCTCGACGGCATCGCTCTGGTCGTTGCCGATGTCGAAGATGACGAGCAGGCTGCCCATCTCGCCGCCGCCGCGCGCCGGGCCGGAACGCCCGTGAACATCGTCGACCGGCCCGCCTTCTGTGATTTTCAGTTCGGCGCGGTGGTCAATCGTTCCCCGCTGGTGGTCGGCATCTCGACCGATGGGGCCGTCCCCGTCTTTGCGCAGGCGGTCCGCAGCCGAATCGAGATGCTGCTGCCGACCGGTTTCCAGAAATGGGCGGAGGCCGCCCGCAGCTGGCGCCCCCTGGTGCAGGCACGAAACCTGTCCCTGCAGGCCCGCAGAGCGGTCTGGGAGCGTTTCGTCAATGCCGCTTTCCGCCATCCGGACCGCACGCCAGGCGAGGCCGACCGGGCCGACCTCCTCGCCGCGGTCGAGCAGCAGGAAAGAAGACCGGCACATGGCAGCGTCGCCTTGGTGGGCGCGGGCCCGGGGGATCCGGAACTGCTGACCTTGAAGGCGGTTCGCGCCCTGCAATCGGCGGATGTCATCCTGCATGACGATCTCGTCTCCCAGGAAGTGCTCGATTTCGCCCGCCGCGAGGCCGAGCGCATCATCACCGGCAAGCGCGGCCATCGCCCATCCTGCAAGCAGGGCGACATCAACGCGCTCATGGTTTCTCTGGCCAGGGACGGCAAGCGTGTGGTGCGGTTGAAAGGCGGCGATCCCTTGGTGTTCGGACGCGCCGGCGAGGAAATCGCCGCCTGCCGTGCGGCCGGCATCGCCGTCGAAGTCATTCCCGGCATCACTGCCGCGTCCGGCGCCGCCGCGAGCCTGGAGGTTTCCCTCACCCACCGCGACCACGCCAGGCGCCTGCAATACATCACCGCCCATGCCAAGAACGGCAAGCTGCCGGACGATCTCGATTGGAAAGCGCTTGCCGATCCTGCCGCCACCACTGCCATCTATATGGGCAAGCTGGTTGTCGCCGAAGTCGCCGCGCGCCTGCTCGAGGCTGGCATGGCAGCGGACACGCCGGCCATCATCGTGGAATATGCCACCCATGAGCGCGAGCGCCGCTTCCACACCGTGATCGGCGAGATGGCCGGGGTTTTTGCCGCCAACCAGCTCGACGGCCCCTGTATCATCCTGTTCGGCCAGGCCATGTCCGAGGCAAGGCCCGACAGCGCAAGCTGAACGGGAACCATCTGGGCCGTACAGCTTCCGTTAAGGAAACATTTGCCACCATCCTTCCATGGTGCAGCGCAGATTCGCGCCCAGCGGGCGTTTGAACCACAGGAGTTCTTCATGAAAAAACTGATTCTCGCGGCTGCTCTGGCCACGATGGGCTTTGCCGGCCTTTCGACCGGCGCCTCGGCCATGCCCGCCGCGCCCGCCATCGGCGCGGACACCGGTTTCGTCGAGACCATCGCCTATGGCTGCGGGCGCGGCTATGTGCCCAATCGCTGGGGCCGTTGCGTGCCGAACCAGCGTCCCGCCCCCGCCTACCGTCCTGGCCGTTTCTTCGGGGAGGCTCCCGGTTATCGCCGCGGGCCGGTACCGGCCAATCGCAACGGCTATGGCCGGGCCTGCCCGCCCGGCACGCGTCTGGGCCCGCGCGGCGGCTATTGCCACCCGGTCTACTGAGATCGCCTGGGGGAGTAAGGCCGAGAGCAAAGCGCGTTCATGCCTGAACGCTGGGTTGTCCGAGTGCCTTGGCCTCACGCGTTGTTGTGTTTCTCGGTGAGTCCGTCAAACCGCAAAACACTTCGGCATGATCCAAAGATCTTCCGGCCGAGGCCTGCCTTCGGCCGGAATATCTATGGCGGACCGAGAACCGGCCTGTCCGAATGACGAAACTGCACCCTCGTCAGGCCCGGCCGGTCCCGGATCCGGCTGCGAACCGGCTAAGCAGCTTTCTCTTGATCGAGCCACGTCGTTATCAAGAGGAAGCTGCGAAATATCAAACATTTGCAGGGTATTCGGGCGAGACCTATCCGTGGGTCTCACTAGGACACTCCGCTTAGGGCACGCCCTGTCGCCGCTACCAGTTCGGAACACAACGCCCCCACCGCCCCGTATGACGGCCCGGTGGACACCCCCAGCGCGGCACAGCCCAATAGCGCGGACCGTCATAATAGCCCGGCCCATCGAAATATCCCGGCCTGTCATAGTCATACGGGCGCGCATAATAGCGCTGCCCTTGATATTCATCACCGTAACGCGCGCCCCGATAGCCTCTGGCATTGGGCACACAGCGGCCATAGTGGCTGTAATAATTGCGTGGGCAACCGTCGGCGATGCTCACCAGCAGATCATTATCGACGACCGCGTTCGGCGATGCCGGCATGGCCGAGGCGGAAGTGGCCGCAAAGCCGACGGAGGCAACCAAGGCCGCGAAAACCCAGTCTTTCATCATGCTTTCCCCGGTTGAATGTCCTGCGGCGTGAGGACGATACCCGCTATTCCTTCCACATCGACCGCTCTCGCCGCAACATTTGCCGCGTCGACGCAAGAGCAGGCTCGGCGGCCACCCGACCCGAATCGCCTGGTCCAACAGCACTACTGCCGGATCAGCCCGCCCCGCGGTTATGCAATCCGGGGGCCCTCATACCTTGGAAACCAATGCACCGACGGGAACGCCGCTCAATCGCGAATGTTCCCAGAAACTGAAATCGGGGCTTGCGCATCAGAGCGAAAAACCCTATTCAGCCCACGCTCAATGAGCGTCGGAGCGTAGCGCAGCCCGGTTAGCGCACTAGTCTGGGGGACTAGGGGTCGGAGGTTCAAATCCTCTCGCTCCGACCATTTTCCTAAGAGATTTCAATCTTTTAGGCCGTTGACTGCAAAATATCGATATGGGGTCGACGGGCAAATAGCAGCCAGAAATGGCGCTGTTGAGAATGAACAGTCCCAACAAAGTCCCAACGCGTGTATGCAATACGTTCTCGGTCCGACCGAGCTCTCCCGCCGAGGGAGAATCCCCTATTCCTCCCCCTTCGCCGGCCCTGTAGCCTTTAGCTCCGCCTCATAAAGAGCAAGCGACTCAACCAGGTCAGGCGGGTTAATGAACGCTATCTCAAATGGTATTTTGAGCTTATCTTTAAAAGTTTTATAGACTCCCCGAAGAGCTTTCTCAGAGTCTCGATAATTCATGAATGTAGAATTGAATATGTAAAGCGCAATCGTGCTAACATTAGCTATTTGCGGCTGGGCAGATATAAACTTTGGATCGCTCATCATTGAACTCATCACATCCCCTTTAGCCTCATCGATGGGGAGTTTTTCAGCCAGAATGCCTCTTTCTTTTTCATATAGTTTCCAAAGATGCAGCATGTCGGCGTGCTGGATGCCTAGGTTCATCAATTCGTTCGTGGATGGGGCATCGCCGAGCCCAATGAGCATCCCCATTTCCTCGGGAGTGAACTCAATGCTCTTGCTGAGATTTCCCCGAGGCAGCAAATAAGCCCACAGTCTACGCATATCGTCGTCTTTATTTTCTTTTTTCCTCTTTAATGTAGATCCCATGTATCTAACAGCAACGGCGATATCTGAATGTATTTGCATAACTTTAAATAGAATGGAATTGGCGACACCTTGTTTTGTTAGAAACCTTTCCGCCTCCCTTTCGATTTTTGAAGCCTTCGCGTTCGATTTTTGTAGATAATATGATATGAGACCTCCCACGACGGCACCGATTACAGCACCGCCGACACCAGATACAAACTCCTTCGAAGTAAGAATGTCGGCGACCAATTGAGGATTGTCCATGTTGGCTTCACTACACTGGAGAAAATGGAGAATTCCTTATAGGTCGTGTATCGCGGGGCCGTTACAAAAAACTCTGCCCAGCCAGCCCTGCTAACCGCCAATTTCACGGGTGCGTCACAACGCTTTGCCCAAGCCACCGCTTACAAGTCGCTTGTCTCTTGAGTAGCTTCAATGCCAGTTCCGACAAACCAATGTGGAAGGCCCTCTTGAAACTGCTATGGTTTTCGCCCGTGTCGGAACACTAAACTTCATGTTTAGGAGGGCGGAATGGCCGGCAAGTTGCATCCCTTGATCCAAGGGCCGAGGGTCGATCTCGATATCTCAGATCGCCTATTGGAATGCGAAGAGGCGCTGGAGCGCAGCTTTCAAGATCTGGTAGAGCGCGCGGAGCTGGCAGGGTGGAAGACGATCGAGATCACCTGCGCGCTGCAGAGCCTGGCCGATCACCACATGCTCGCTAAGGCAGCCAACGAAGAGACCGATCTCCAGATTGCCGACGCTCTCAACCGTAGTCATGGACCTTAGGACGAGCCCTTAGCTCAGCTAGCAATGCCTTCCGCGCCTTCGCGCTGGGCTCGATCGAGAAGCCACCGAATCGATCCAGAAACTCCGCGGCATACCGCTCCTCCTTGAAACAGAAGATGGTATGGTGGATCGCCCATTCGCCAACACTGTGATGCCTCGGGGCGCCGCCGAAGGTGCGAACAAACTCGTCGACTTCGTGGAAGTTGGGGACACCTTTTGGCGTGGGTAAGGCCACGTGGAAAAGATACTCACGATCCAGCCGCGATGGTTGGAACTCACCTTTGCGATAGACCATCGCTCACTCCCTCGACGGCCGGAGCGTCATCCCTTCGTAGAAATCGACGCGTTTGGCGTAGAATCCCTTTTCGCCATTGATCTCTCTTGCGATGGCAAGTGATCCCGTGATTGGGCGCAAGCCACACCTGCTGCATTTGCACCGACTGAAGACATCCCAAAGTATCATCCTTCGGAAGCCCAAAACGACGTTGAATTGCGCAAGCGGGACGTGGCCGACGTGCTGGCAGCGCTTGCGCTTGCACGTGAACTCCAGATGGGTATAGCCCCAGTCCAGCGCATCCTCGACGGTCTGCGGCGGCTTATCGAGAGCCATGTCAGTCGTCCGGTTTGTAGGGGATCGGCACCTCGGCACGGGTCAGAAGGTCCTCCCAGCGCCCGCGGGAATAGCCGCGCGAGATATCCTGTGCGAGTTGCGCGACCTCGTCGTTTCGTTGTTCAAGCTCGGCCTGGCAATGATGCAGCGCTACCAGCAGGACGCGTACGGCCGCTCTCGCGTCGCCCTCACAGGCTGCGATCACATCGTCGGTACTGGCTTCGAGATCATCGGGGACCGGCTCAGGGGCCGGTCGGGTCATTGAGGCGCCCGGTACGGATCGCGCCACTCTTCGGGCGGCAGGCCCTGCAGTTCCCGAATTTCCGCGCGGTGAAGCCGGCCCAGGCGCTCTATGCGGTCAAGAGATTCTTCGTTCTCCTGATGCGAGATCAGGGCCACGCGATACAGCCATAGCAGTGCCGTGTTTGCGGCCCCGCACTCCGCGATGATCTTCTGCAGCGCATCCTCTTCGGCCTGCGTTGCCGAGGTTCGGTTAGGTCTTTCAAGATAGTCCCAGACCTCCATCGGAAGCCTCCGTCTGTCGTCTCGATTGAAGGAAGGCTAGTCCCGTTTTTTCGACGAGTCGAGAATTTTGTTCTCATTTCGTTCACGTCATGCTATCCCATCACCCGCATCCATTGATGGCAATCAGCAGGAGCCTCCAATGAACCTGAACGCGCAATTCCAGGGGAATTTGCAGCCTGGCGACACCAAAAAGCTCGCCATTTATGTGCCTGGCGCCACCAGGGAGGAATTGGAGAAGGGCCACCAGGCGGCCGCTGCATTCTTCGAGGAGAACATGTGCCTTCCCGCCCAGGCTGCCGCCGCATTCTTCAAGCTCGAAAGCATCGAATTCGACCCCAGCGTGGAGATGACCGATCGCGAGGCTCGCATAGCCGACGTCTGGCAGGAGGCACAGGAGATCGCCGCCAAGGCCATTTGCGAGGGCTGGCGCGAGCCTGCCAAATTTGCGAGCTTTGCGTTGGGCATCAGCACCAAGAAGCTCGCAGCAGACTGCGAGCGGAACCGTGAAATGCGAGGAGACACCACGCTCCCGCATCGTTAAGGGGGTCGTCGTGTGCAATCTCTTCAACCAGCAGATGACCCAAGAGGAGCTTCGCCAGCTCGGTCCGGTCATTCGAGACACCGTCAACTGGCCGGATGCCGTCGACGTCTATCCCGACTATCCGTCTCCGATCATCCGGGATGGCGCCGACAGCGTCCGTGAACTCGTACTGGCGCGGTGGGGAATGCCGACGCCTCCGAAGTTCCTTGAGGGGAAGAAGAGCGATCCTGGCGTCACCAACATCCGCAACGTTGCATCGCCTCACTGGCGCCGCTGGTTGGGGCCAGAAAGCCGATGCCTGGTGCCGTTCACCGCCTTCTCCGAATACTCGGACAGTGAGAAGAACGAGAAAGGCGGCAAGGCGCTGAAGTGGTTCGCCCTCAACGACAACAAACCCCTCGCCGTCTTCGCCGGCATCTGGACCAACTGGACGTCGGTGCGGAAAGTGAAGGAGGGCGAGGTCACTGCCGATGTCTTCGGCTTCCTGACGACGGAGCCGAACGCAGTCGTAAAGCCACATCACGCCAAGGCGATGCCCGTAATCCTAACGACCGCCGAAGAGCGTGATGCCTGGATGCGAGCACCATGGAACGAGGCAAAGGCACTCCAGCGCCCGTTGCCCGATGATCAGTTGGTCGTCGTCCCCCGGCCGGCGTCTGGCAAGTAAACCGAATCCACACCGTTATGGCGCCCAATCCGTCCGAGGTAAAGAGAATGGCACACCCCGCCGAGCTGCCGTTCGATCTGCCGTTTGACGGAGAAATGCCGATGCCCGCCGCTCTCGCTCTAGCCCCTTCCTACGACGGCCCATTCGCCGCCGAGTTCGCCAAGCTGACGAGGGAGAACCCTACGATCGACCCTCGCCGCTGGGAACAAGCCAAGAAGGACGCTGCAGAATTCCTCGCCGATTGGGGCGAGCAGGCTGCCGAACTCGGCTGGACCGCCAAAGAGCTGTTTGGGCTGCACCCCGTGGCGCCGCTAACCCGCTATGACCAGATGGGCCTGGTCTGGCTCCTCAACGGCAAAGAGGTCGACGAGATCACAGACCGAACCGCCAAGATCGGCGCGACTACGTTTTACCGGACCACGGCTTAGCGAGGTCCGGACGCGCGGCTCGGCTGCTTGTCATCGATCAGGCGATCGAGCTTGGCCTCGATCCGTCGCGTCGTCTCGACAAAGGCGAGCATGGTCGTCTCGGCCTTCGCGAACCGCTCGCGGATCTCCCCGCCATTATCCTTCAGCTTGCCGATCTCGATCTCGAGCTTGATGTTACGCTCCTCGATCATGTTGACGCGGCTGTTGAGCTGGCTCGACCACCAGACGACGCCCCCGGTCTGGACAAGCAAGGTGATCACGAGGGCAACCGGAATTTTCTTGTCCACGACCCAATGCTCTTTCTCGCTCGCAGATTGGTCATTCATGGCCTTGCCTCCTTCTCACAGGCGGGATCGGCCGGCGCGGCGTAGCAGTGCCGAAGCTGCTCATACCAGCGTCGGGAACGTGTCAGGCGGTCGTTTGCGGCTGCCGTCGAGCCAAGCGCCTGCGCTGCCAGCAGGCGGGCATCCTGGCCTTCTTGGACAGCCGGCACCGGGACTGGCGCCATCAGCGACGGCGCCGGCGGGAGCCCTCGCTCAAGGGGCGGGGTTGAGCGCGCGCAGGCGGCGCACATCGTCAGCACCGAGAGCGCAACGAGCGTTCGGCCTGGCGGCCAAGGCCTTCTGATAATCATCGATCCTGTCCCGGTTTTCGGCGACGGCCCGCTCGGCCGCGCGGGTTTCGTCCTCAGCGATGCCGGCCAGGGCGGAGGCGATGGCCAGATCCGTCTTGGTGGCAGCAAGCTCGGCCTTCAAGGCGGCGACATTGTCGAGGCGAGCCTGCTCGTGCCGGCCGCTGCTGTAGCCGTAGAAATAGGCGGCGATCACGATGCCGATCGCCGCCACAGCCCGGCCGAGGCCGGTCGAGAGGAAGGCCAGGGCCAGGCTCATGGCTGCAGCCCCGACACGCAGAGTTCGGCCTCGCCGATTCGGGAGGCGTCGCCCATCTCGCGCCGGTTCACAAGGCCGATCACCATGCGCCCGCCCGCCTTGTTGTAGGCGGTCGCCGCGAGGCAGCTGGCCTGATACTGCCCTGCCCGGCCGAGCCTTGCGGCCGTCGAGCTGCAGGCCGTGCCGACGCCGGCATTCCAGCTGAGCGAGATCATCATCGCCTGCCAGGCCAGCGGCCGCTTCTCGAAACCGTCGATACAGGCCTTGAGGCCGGGATAGAACTCGTCGCCGAGCCGGCGCTTCAGGCGCCGGTCGCAGCCGGCCGGCGTCTCAACCATGCCGGCCCTGACGTTCTGCGTGTCGCCGTCGCAGATCGTCCAGCGCGGCGGCCGCGCGATGCGATCGAAATAGGCGCGCAACTCGCGCCCCTCCCAGGGCTTGATCAGCGTATCGGCCGCCAACGCTACGGCGGCGGGCACGGCATCGCCGCCCGGCTGCTTGACGTACCAGATGCCGCCGATCGATGTCGCCACGGCCAGTACGGCGGCGATGGCGGCGGAGGCGCGCTTAGTCGGCCTGATCTTTTGGACCGGCATCGGAAACTCCCTTTTGTGCGAGGAAACGAGCAAACAGGGCCGCGCCGACCGTGAGGCCGGTCAGCGCGGCGAAGAGGCGATCGGGGATCGGCAGATAGGGTGCGAGCAGCGGCAGGGCCGCCTCGAGGCCGGAGAGCACGGCGGCGAGCAGCAGCAGGCGCACGCTCCAGGCATGGCGCAGCACTGCGCGCCAATTGGGGACGAGCTTCATGTCGGGAGATCCGTGAAGAGCCGCTCAGGTGCGGCGGGAGAGGCGCGAGTGGGCGATCAGCTCCACGATGCCCCGATCATGGCGCCGCCATTGATGGCCGTCGTCGAGCTGAGCGTGCAGGTCAGCGCCTTGTTGCCTGCAGCCGCCTCGCTGTTGCTGGCGCTGCTGCACTTCGACGTCGTGGCGACGGTGGCGTTGACATCGGTCGTATAAGGCGCCGTCCAGGTATGCGTCAGCACGCTCGATGTGTAAGTGTGGCAGTAGCCGATGGCGAAACTGTTCGCCAAAACGCCGTTGAGCGTGATGCTCTTCGCACCACCCGAACTCTGCCCCTTGAGGGCAAGCTTCGCCGTCGTGTCCGGCGTGACCGATTTGGCATTGTCGATCGTGTAGACCGAGGCATGGGAGCGATTCATGCTCGCCCCGAAGCTGACGACGACCGTTGCCGTCGCCCCTGTCGGAACGAAGGCGGCAAAGATGCAGGTGTTGTCGGATGTGCCCGATAGCGCGTTCTGTTGGTTGGCGACGAGCGTTGCCGATATGCCGCCGATCGTGCAGGTCGGATTGGCGACGAGCGCGGTATTGAGGCCCATCGCCACCACGACGACGAGCCTGCTTGCCGACGCCGTGCCGATGTCGACATTGGAGAGGGTGAGAGACGCCCCCGCAGCCTGGCCGGCCGCCGTTCGATAGGCGCCGACCACGGGCAGAGCCGCAGACTTGGCGGCGACGAAGCCGAGCCCGAACCTCATGCTGCAAGATCCCCGATCAGCACCCATTCGTCGGAGCCGCGCTTGTAGAGCGTCGCCCCGGAATACTGCCCCGTGAGCTTCAGCTTGCCACCCGCCGAACGGATGGTGACGCCGGCATCGGCCGCGATCGTCGTCTGGCCGGCGCCGAACTGTGACAGGTCGATCCGGCTGTTGACCGGAAAGGCGACGGTCGCATTTGCCGGGATGGTCAGCGTATTGGCCCCCGCATTGTTCATCTCGACGATCTTGCCCTTGTCGGCGAGCGCCAGGGTATAGGCCGTGCCGGTCTGCGTGTTGATGCCGACCTGCTCGCCCGCCTTCTCATAGGCCGTCAGCGCGGCCGGCTGCACGGCCGTGTCGGCCTTGGCGCCCTGCGCGGCCGTAGCGAAATCGGTCGAAGCCGACAGGGCGGCCGAGCCCAGCGTCGCCCATTCCATCGCCGTTGCGCCGGCATTGGTGCGCAGCACCTGCAGGGCCGTTCCAAGCGCGGTCAGGCCGGTGCCGCCATTCGCGACGGCGAGGGTGCCGGTGATATGCGTGGTCAGCCCGACCTTGCCCCAGGACGGCGCCACACCGACCCCGCCGGAGCGCAGCACGTTGCCACCCGCCACGTCGGCGAGCTTGGCGAGCGTGGTGGTGCTGTCGGCATAGAGCAGGTCGCCGATGGCAAAGCCGGCGCTCAGCCCGGCAATCGCCTTTGGAGGGAGCGGCGTGGCGTTGGAGGTCAGCCAGCTTCCGCCGCAAACGTCCCATTCGGTGCCGCCGCGATTGATCACCACGACATTTTCGCCCGGGCGCAAAGTCAGCGATGTGTTGGTGTTGCCGAGACCGGCAGGCGGGCAGTAGATAAAGTCGCTGCCCTGGCGGACGACGGTCACATCGGCCGTACCATGGTTGAAAATGGTGATGGCAGGCCCCGCCGCGCCGGCACCCCAGACGCCGCCGACCGGCAGGGTGATGTTGAACGGCGTGGCGGAGGATGCCTGCACCGCCTTTCCGTGGTCGTTTACCCCCAGCGTGACATTGACATTGGCAAAGACGAAGCCGGCGAAGTTGCCGATATGGCGCCGCACGAAACTCGTGCAGGCCAGCGACGGAGAGCTGTCGCCGGGCGGCCGTTCAGCGATATAGGCGTCACCGTTGAAGTAGGCGGCGAAACCTGCGGCGAAGGTGGCGGCCGCAAAGCGCTGCGGCTGGGTGTAAGTCTGGTCGCGATTGAGGCGGCTCATCAGCGAGAACTGCACGCCCCCGGACTTGATCAGTGTACCCGTGGTGCCGTCGAAGATGGCGACTTCGGTATCGACCACGGCAGCGGCCGGCCCCGTGACGTTACCGGTCCCGGTGCCGTCCGCCCCCTTCTGCGCGAACAGCTCCCAATAGGTCGTGTTGGTGACCGCCTGGTTCGTATGCGCGACTTTGCAGCGATAGGAGGAACCATTGCTGGTGACGAGGTCATTCACGGCGTAGGCCGTCGCGCTCGACCAGGCTCCCTTCGGCGTCAGGCCGGCCGGTCCGGTGTTGCCAGTGTCGCCCTTTGGCAAGGTCAGGTTGAGCGTCTGGTTCGGCGCCGTCCCGGTAATGGTGGCGCCGGCACTGGCGCCGCTCGCCACCGTGCCGATCGCCAGCGTGTTGGCGGGGCCGGTATCGCCCTTGGGGATCGTCAGGTTGAGCGTCTGGTTCGGCGAGGCGCCCGTGATGGTGGCCGCTGCCGGCCCGGTCGCCACCGTGCCGATCGCCAGCGTGTTCGCCGGCCCGGCATTGCCCTGCGGCCCCTTGATATTGCCCTGCGGCGCGCCCCACGCGCCGGCCGCCTTCTTGTAGAGGTCGCCATTGCTGGTGTTGATATAGGCGGCGCCGTCGACGCCTAGGCCGACGGCCGGCGCCCCCGCCCCCATCAGGATCTGATCGCCGGAGCGCGCGACGAGCTGCCACTGCGCGGCCGCCGTCGTCGGCGTGTTGCCGGTGTTGGCGTCGACCCGGGACAGCCAGGACGAACCGTTATAGGTGACCAGATCCTGCTTGGCATAGGTGGTACCGGCCGCCCAATCGCCGCGCGCATTCCACGGCAGCACGGTCGCGGCCGGCGAGATGCCCCAGGCGCCATTGTCGCGCCGATACTCGATCGGCGGACTTTCGTCATAGTCACGATAGAGATCGCCATCGACGCCGATATCGTTCGACGGCGGCCCGTTGCCGGTGAACCAGACCGAGGCCGGCGTGAAGCGGGCACGGATCGCCTGCACCGCCTGCATGGTGGCGATGGCGATGCTATCCTTGTCGGGATCGACCGCCACCCAGCCGCTCGCCCCGGTCACGGCTGATGGCAGCGGATGCAGCAGCGTCAGGGCATTGTCGGCGGTCGCATCGGCCGCGACCGGCACCTGCGCGAAGATGCCGCCGCCCATATTGACGATCAGCCAGGCGCCTTCCGTCACTATCGGCGCCAGACCGTCGCCCTTCATCGGATTGCCGGCGATCGCCGCGGCCGTGGAGCCCGCCGCGCAGGTGACGGTAACGGTGAGAATGAGATTGCTCATCGCGAAATTCCCAGGTCGGACAGGACGGTTTCGAGGGCGGCGGGTGCCGCGGCCGCGCGGACCAGGCTCTTGGCGGCAAGGCGCTGTTGCTCCACCTCTGCGAGCGCCTCGGCGTTTGCGGCGGACTTGGCCAGGATCGTCTCGGCGATGAAGTCGACGAAGTCACCTCGCGCATCGGCCTCGGCGCAGATCAGCGGACCAGGCCCCTTGCCCGCGATCACGTCTCGCGCCTCATCGACCTTGCGGCGATAAAGATCGGAGAACTGGCAGAGCCGGCCGATCTCCGGTCCGAAATGCCGGTCGACGAGGCCCTCGGCTTCGGCTCGCCTGGCGGCCATGTCCGGTAAAAGTACGATCTTCATGCCAACCTCAATAAAAAGCTGACCCGAGGGCCAGCTGCTGTATCTGGATCGACAGATCCATAGTAAGTTTTCGCGGGGTTACCTGGGCAACGGCTGTCGTGAAACCGCTGGAGGTACAGTTATGCTGAAAAGCAAGTCGCATCTTGTTTCGGCAGTCGCTGCAGGTTTTATATTTTTTGGAATGAATGCGGCATACGCCGCCTCTTCAGATGATTATTCCGGAAAGGTAACCGTTCTGAACGGCCATGAGGTTTCATTATTCGGCCATAGCGACCTAAATCCGGATTGCACCAAATTTGGATACTCGACAATTACATCAGTCGTTCGCCCGGCCCATGGCACGATACGTATGGTGCATCAGAAGATATTTGCTGTCTTTGGAGCAACAAATCCCAGATACCAGTGCAACACCAAGGGCGGGCCGGGAATTCGCGTCTACTACCGGGCAAACAAGGGCTATCACGGGTCGGATCACGCAGTATTCTCGGTCTATTCCGCCTATGGCCGGAAAGGCACGGCAACGATCGATATCAGCGTCGATTAAGAACCTGGCGCGACCGCAACGACTTGGCCGCGCCATGGCTGGAACGGCCACGGTCCAACAGCCTCGATCGCATAGGCGTGCGGCTCGCTGGACACGATCTCGATGTCGGTTCCGTCCGCAATGCCGAAGGTGCTGCCGTCCATGCGGATTTCCCAGCCGGCCGGAACGCCCTGCAACATCGCCGTGTCGACGCCATCGGCGAGGATCTGGCCGGCGGTCAGCGCGATCTGTGCGACCGGCCGGTCCTTGACCTGGCCATCGTCGACATAGCGCTTGTCCGGATCGGCCGAGGTCTCGTCGATCTCGATGAAGGCGGCGCCATTGGCGCGCATATGGTCGACCAGTTCTGGCCGATAGACCATGATGACGTGCGTGATCATGCCCGGTGCAAAGCCGAGCGGCCCCATATATTCAATGATCATCGATTTTACCCTGCTGGAATGCCGAACACCGCATAGGTGAGCTGGCCGGAAAAGGAGAAGCAGCTGCGAGCCTGCAGGCCCGTGCGCGTGACGGTCGCGTCGACCGAGTTGATGTCGTTGTAGGTCGCCATATAGGTGCAGGCCATGATGGCCTCGCCTGCCCAGGTCGGCCCATAGGGACCGATCGGCGGATCGGACTGGTAGAAGCGATAACCGTTGGCACTATAGGCCGCGATCAGGGTGAAGGGGATATAGCCCATGTCCGGCCAGGTCAGCAGAGTCTGGAAGCCCCGCCCCTGCGCGCCGCTATAGGCGACCGACCCGCTATAGGCGAGCCGCGTCACCGCATAGTCGGCGTCGAACACGATCTGCGCCCAGGTCGCCGCATCGATGTCGACGCCCGGCAACACGATGCGGAATTGCGGCTTGCCGTCGCTGGGCCGAAAGCCGAAACGCGCCCTTTGCACCATGGATCCTCAGCTCGGCATGTTCACGGCAACGACGACATAGGTCAGCCAGACCGTCGTGCCGGTATTCTTGAAAGTGAGATTGGTGGTGGATGGCTTGCAGGTCACCAGTGCCTGCGCGTAGCCCCATTTGCGGAAAGGCACGAGGCTGTAGCCGGCGCCGTCCGAACTCTGCATGTCGACCGCCGGCACCGCCTCGAGGGTGACGCCGAACGGCACCACGATCGTGCCGCTAAGCTGCAGGCGGCCAGCCATGATCACCTTGGCGGTGACATTGTCGGTATCGATCAGCATGTCGTCGGTGGCGCCGGTACCGGCATCGAAGCCGGGTTTCGACACCCGAAACCGCGCAGGATTGGTCGACATCAGGACGCGCAGCATCAGGCCGGCACCGCGACGGGGATCTTCCAGATGAAATAGACCGCGCTCGCCGGCACGGCCGGTGATGCAGCCGATTGACTGGAGCCAACCGGGAAGCCCACGCGAAAGAAATCGAGACCGACATGAACGCGAAGCGGCGTGAAATAGGCCTGCTCCACATAGTTTCCCTTGCACCAGGGATCATCCTGCATCGTGCCGATGGCGCTCTCGTAGCGCACGGAAACGAAGGGGATGTAGGGTAAGGCCGGAAAGTCGACGCGCTTCTGCAGGCTTGAGGATGGGCCGAGACCTGTCACGCGGCCGACCTGGTGCAAGTTCTCGATGCGCGACCAGTCCGAGCGGAACGCCCATTTGAGATTGTCGGCGGGATCGGCCGTGCGGGCATCGTGGCCCGGCAGCGTGGCGAACACGCCATAGGCGCCGTTGCCGTCGGCGCCCAGGATCATGCGGCAGACGTCAGTCATAGATCTCGATCCGGGGATTGGAGCCGGGACGGAAGAAGATCTTGTTCCCGATGTTCATGTCGCCGGAGACGTTGAGCGCCCCGGTGTTGATGCTGAGATCGGCGAGCGCGTTGATGATGATGTGATCGGCCGTGAGCTGGAACGACGAGACCGTGCCGTCATTATAGGCCTTCAGGCCGGCGATCTTGCCGTCGACATCGAGCGTCACCATCCATGTCGCCGAGAGCTTGCCATCGACACTTGCGACGGCCTGCGACACGGTCTGGATCGCCGCCGTGTTGCCGTTCGCCGTGGCGCTCACCGTATCGATGCGCTGCGACAGGGCGCCGTCGGCATTGGCGCGGGCCGTCGCTTCCTGCGCGATCTGCGCCTGGATATCCTCGCCGACCTGCGCCTCGAGGATATCGACCTTGGTCGCCAGGGCTTCGGTCGCACTCACCTGCAACCGGATCTGGCTGGTGAAGGAGGCGCGGGCATTGGTATCCTGCACGTGGTTCGAGGTTTCCTGCGTGAAGGATTGCGTCGCCTCTTCGATCGTCGCCAGCGCCAGATAGCGGATCGGCACGATGGGCTTGAAGGCATCCTTGGCCGACTGGGCCAGGTCGTCGAAATTGATCGAGCCGGGCGGCAGCTGCGACTGACCGACCTGCACGCCCACCCATGTCGTCCACGTGGTCGGCCGCACCGGCGTGGTGGCGATCGTGGCGCGGAACTCGTATTGATCCCCGGCCCCGACCGTGTCGAAGACATAGGCGCCGTCTTCCGGGCTGTCGTCGCGCAGGCGCGTCGCCGCCACGGCATTGATACGGCGATATTCGATGATCACCGCGTCGACGGTCGGATCGGCGACCGGCGTCCATGTCAGCTTGACGGCCGGCCGCTGCGCACCCGATTCCGCGTCGATCATGATCGCCTCGGCCGAGAAGCTCGACACGGTCGACATATAGCCGGGATCGCCGGGCGCGCCGCCCGGGATCTCGATCGGCTTCTCATCCGAGGAAGACCAGGCATAGATCTGGTAGTTCACTTCGCGCAGGGCGACCGTGATGCTGTCGTCAAGATTGAGCTGGCGCGTCATCACCCGGAACAGCTTCGTCCAGCCATAGCGGGCCGACTGCCAGGTGATCCAATCGCCGGGATCGAGGAACAGCAGGTGATTGCCGAGGGTGAAGGTGCCGTTCGCCTGGGCCCGGGTCTCGCGCAAGCGGGCCGTCGCGATGCGCTGCGCCTGCGTGCCCGACGGCACGGCGGCCAGATCGAGCGACACGCGCAGAGGCTCGCCATCCTCGGCCCGGGCCGAGGCCGAGATGATCGGCGGATAGGAATTGGCCTGGTAACCGGCAGCCGGATCGACGAACTGGCCGTAGACCTCGTTGGTCCGCTCGGTACGCGACAAGCCCAGGCTGAAACGCTGTTCCGGCCCGACCCGGAAATCCGCATCCGTGATGGTGACGACGCTGACCTGTGCCGCGCCGGCCAGAAGTCCGAACGCACCCTGATACTCGATCAGGTACCCGGCCATGGTCTGGATCACCGGTTCGATCAGCGAGCGGAAATCGGCATCCTCGGCCACCAGCACATAGGCGCAGCGATAGCGCGGCTCGGTACCGCCTGCCGTCAGGGCTACGGTCTCATCGCAGACATTGGCGGCCGCGATCCATGTCGAGGCGATCAGATCATAGGCCGGGATGCCGGGGCCGAGAACGACCTGCCCCTCGGCCACGATGCCGGAGAGGAAGTTGTAGCCATGGATGGCGCCGTTCTCCGTCCATTGCCAGGTGGCGGGATCGTTGAGCCGATGCAGGCCGATCCCGCCTGCCGTGCTATCCTTGCGCACGTCGTAACAGCGATAGCCGCGCATCTGCCAGGTGAACTCCGGCGGGCCGGAGGCGAACTGGTCCGGGTCATAGATCAGCGTCACCGCCGCATAGCACATGCCGGCGAAGCGATCGTTCATCGTCAGCCGGCTCGCCGAATTGGCGACCGTGATCGCATCCGCCGCCTGGCCTGGCCGCCCGTCATGGAAGCGGACGCGGGCGAAGCCGCCATAGCCCTCGATCGAGTAGGTGGTCATGCCGTTGCCGTCGGTCGACAGCTGCGTCAGCGTCTTCTTCTTGCCGCCGATCCAGACTGCCTCCAGCCCGTCGCACCAGCCTTCCGACAGGATGTAGACCAGCTCCAGCCGCTTGTTTTCCTCCCCCGAGGTCTGGAAGAAGGCCAGATGCCCGCGCGTGGCGATGCGGCCGAACACGACCTGCCGGCCGACATCGGAGCCCAGCTGCAGATTGAGCTGCACGCCCGACGGCGTCGCCTTCGGCTTCGGCGCCAGCAGCTTCTGCAGGGCCGACAGGCCGAGCGAGAGGCCGATGCCGAGCAAAGCGCGGCCGATCGAGAAACCGCCGACACTGAGCGAACCGATAAAGCCGATGACGGCCGAGACGGCGGAAGCGATGAACCCCATCAGCCGACCCGCCAGGCGTGCAGCGCCGTCCAGAGCGGATTGATCTGGATGCCGGCCGGCGCCTTGCAGAGGATGCCGTCGCCGACCACCACGCCGAGGGCGATACCGTCATCGGCCATGAGGCTGACGATGTCGCCGCGCCGCGCCAGGGACGGGATCACCCGCTCGCAGATCGCATCCATGGCGCCCTCGAGGTCGCGAAAGCCCTTGCCGCGGATGCGCTTCATGGCCCCGGCCGCCGTCTTGTAGCGGCCGCGATGTTCGCCATAGGGATCTTCCCCGGTGACGGCGCGCATCGCGTCCATCGCCATGAGGAAGCAATCCGATGCGCCCCAAGCAAAGGGCGCGGCCATATGCGGGGCGAGCGCCTCGAACAGGGCGCTTTCCCAGCCTTCCAGACGTGTCATGAAGGAATTCCCTTGAAGGGCGTCAGCGTGCCCAGGCGCGAACTATCGCGCCCGGCGAGGCGTGACGCGGCCCCACCAGATTTCTGCGGTGGCGGTCGATTGCAGGTAGCGCAGCGAACCGTCGTTCGGATCGAGCGTGCGCTGATCGGCATCGGAGCGACGCCGCCAGCCGGAGCGGCCCATGTCGAGCGCGCGGCTTTCGACATGGGCCTCGAGATAGGCCTCGCCGGGGCTGGCGTCGGCGATCGCGCCCTCGCCCTTGCTCTCGACGTGGTCGATCGTATCGATATAGCCGCGATACACGGTCTCCACCGACAGCAGCGCGTAGCTGTCCGGATCGAGATAGGCCCGGGCCATGGTGACGGGACGGCCGCGATAAATCTCATTCTCGATCGAGGCGAGCATGTTCATCGGCAGCTCGGCCGAAGCATTGAGCCGGATATTCATCGGCACGGCCGTGCCGTCGGAAACGCCGCCGATCGCGTCGACCTGAAACAGCGAGCCGGCGCCCTTATAGGTGACGCCGTTATAGGTGAACGTCCCCTCGCCGGAGAAGAAGCCATAGAGGCCCGAGGGGAAATCGAACAGCAGGAGATCGGCCCTGGCGACGCGCCCAGCCTTGAGCGCATCGAGCAGGGCCTGCGAAATGACGCGGGCCATCAGTACAGCCTCTGAATGCCGGTGAAGGTTACAGGCTTTCGCGACAGCTCGGCCGAGGCATCGGGCGCCGTGGCGGGATCGATGATCATTTCGCAGACCGGCCTGGCGAAGTTCACCGTGGCGCCGATCGAAAACAGGCTCGTGTTCACGGCCGGCGACAAAGTGACCGACGCCGTGCCGTTGCCGGCCGCGACGACAGGCTCCATGATCCGGTAGAGCCCATATTTGCCGTTCTGCACCAGGCCGAGCAGATCGCCCGCCCCGATCACAAAGCCGGCCGGCAGCCCGATGATCGAGATCCCGGTCGCCGACACGGCCGACGCCTGCGCGGTCCCGTCGAAGGCGCCGCCCGAAGCCCGGTTCATGCCCGCGAAGCCGCCCGGATAGGCGCCCGGCCAGATCTGCAGCGGACTATGGCCGAGGAAGGTGCGCAGCGCACCGTTGAGGCTGTCCAGCCAGGCTCGCCACGGCCGCCGCTCGGCCATGTTGAGCGGCTTCGTCGTGTAGTCGCAGCGCCACCGCGGCGGCGCCAGCTGGATCGCCTGCGTAGCGCCGGAGGAGAGCATCGAGGCCGCCTCCGTCCGGATCAGGCGCATGGTGTCCGTGGCCTGGCAGCCGGCCAGCATCGCGCGGGGATAGGTGATTGCCATCAGAGAGATCCCCGCTTGCGGGCATTGAGGACCAGGCTCGGAACCTCCTGCCGGATCTGCTGGCGCATCACCTGCATTTCGCGTCGCAGGGCCGGGATGGCATCGGCGGAGGCCCCGCGAGCGTCGAAGCTCATCGGCATATTCAAGACGAGACCGGGCGCTGACGTAGCCCCGACCGCGGACACCTTGAAGCTCGTCGGCAGGCCGGCGCCCACAAAGCCGCCATTGGCGAAGCCGGGCAGGCCACGCCGCAAACGCTCAAGGTTCGCAACGCCGATCCGCTTGGTTGCCTGCTGATCGAAAACATACTCATCCCGATGAACAATGCCTGCAGGCTGGTACTTGCCACCCGACCCGGTGTAGCCACCGCCAGAGAAGCCCCCCAGAAGCCCGCCAAACAAACCGCCGAGCAGCCCGCCGCTTTTGCTACTTCCGCCAAACAGCCCAGCGAGCGGCCCCTCGCCCAACAGAGCAGCTTGGAGCACAAGGCGGATGAGGCTTTTCAACACATCCTCCAGGGCTCCCTTCAGATCCTTGGAGCCATCGATCAGGCCAGTGAGACCATCCACTGCCAGATTTGCGAACTCCCGCTGGGCGTCGATCATTTCATCGAATTTCTTCTGAGCCGCCTCTCGCGCAACAGCAGATTCAGCATATTTTCGAGCTTCAGCCTCCAGCGACGCCACCACCTGCGGCGTCAGGGCAATTCCGGATTGCGTTGCTTCGTTGAGCAACTCCTGGCGCTTGGCGAGATACTCAGTCTGGTAGGCTGTCTGAGTAAGTGCCGCCCCTTTTAGTTTCAGGCCATCGATATCCTGCTGGTTCTTCTTGATGATCTCATCCAGCGTCTGCTTACGCCGTTCGTCCGCCGTCAGGTCGCCCGCCGCGACACGAGTGTCATTGGCGCGGCGCCGAGCGTACGCGATCGCATCGTCGACCGTGCGGCCGCCCCCAAGGATCGAAGGGTTGGCATTGATCGATTTCTGACTGATCAGTCCCTGCAAGGGCGTGCCGGGCGCCGCCTTCAGCACCTTCGCAGCGTCGCCGGCTCCGAGGAAATGCGACAGCTGCAAAGCCGCCTCATTGACCGATATGCCAGCCTTCTGGAGAACAGCGGCATTTTCAGCTGCATACTTCTCGATAAGGTCTCGCGAGATGTTGCCATCCTTGCGGAGATCGAGAATGGCGTCGCGCCCCATGCTTTCCGCTTGCTGCGGGTAGTATTTGCGGAACAGGGCGATCCAGGTGCTTTCGATGAACTGCCCGACGCCGGTTGCCGTGCTATTTGGGTTCTTCGCACGCGGATCGCCGCCGCTCTCGGCTTTCACCACCCGATCGACGTAGGATTTAACGATCGCCTCGGTGGCCTGTACGCCAGCGCGAAGCGAGTATTCACGCTCAGCCTGCGCCCGGGCAGCATCCTGAAGAGTGAAGCCCGTCAGCTTCGTCCCCGCCTTCTTCATCGCCTCGACGATGTCGTCGGTCGCTTTGGCGATCTCCGCTTCCTTGCTGGACAGGCCGGCATCGCGGGAGCGCTTGTCGAGCATGTCCTGCACCGGATCGGCATGCCCGCTGGTGCCGCGAGTGCGCGCCGCCTCGCGCAACAGTTCCTGCTGGAGATCGGTGATCTTCTGGACGGTGGCCAGCTCTTCCTTCTTGACGCCGGCGACCTTCTCGGCCTCCTTCAGACGATCCTTCTCTGACTGTAGAGCAAGCCGGAGGTTGTCGAGATAGTAATCGCTAGCACCGGCAGCCTGTTCCGCCTGGATTTGCTTCTCGATCTCAGCGATATTATCGCGGATGAGCTGCGGCGCCTTCAGGGTCGCCTCGTCGTCGAAAGCCCGGCGCCTCTTCTGCGCATCCTCATAGGCCTGAGAGACGCCGTCGATTTTCTTGGCGAGATCCGCCATCGCAGCGGCAACGCTGTTGGCGGCCTCCTCGCCACCCACCATGGTCGCCACCCATTTGGTGGTGGCCGTGTCGAGATCGGTCCAAGCCTGCGACAAGGTCGGCACCGTGCCGTCGGCAAGAGCTTCGATCTCGGGCTTGGCAGCGACGATAGCCTTGAACACGCGATCGGACGTCAGCTGTCCTGCCGCTGCCAAGCCGCGCAGTTCGTTGGCGGAAACGCCGAACTCTCGTGCAATCGACCGGATCAGCGGGCTCTGTGTACCGAGCGATTCCATGATGGAGTTGAATTCGTCGCCACGCAGCACGCCGGAGCCGAGGGCCTGGCCGAGCTGCGTGATTGTCCCCTTTGCCGCGTCCGCGCTGACGCCGGCAAGAGACAGTGCCTGCGCGACAGCCTGCGTCGCATCGGCCGCAGAGCTGTTCGCCTGCCCCAGGGACTGGGAAACGCCAAGCATTTTCACGTAGAGATCGGAAGTCGCCTCAAGCTCGGAATGCGCCTTGAGTGCAACCGACGAGACCCTGCTCTGCTCGTCGCCAATGTTGCGAACCGCAATGCCGGCCGTCTTCAACCGGTTGCCGATGGCGTTCCAGCGATCGCCGTAGTTCACCACGCCGGCGCCGGCAGCCGCGACGGCGGCCGTGATGATACCGACCAGCCCTGTCGTGCTACGAAGGCTCTGCGCCAGCCCGCTGAACGATCGCGACATGCCCGACGCATCGAAGCCGCGCGCCATGGATTTGTTGCCACGCTCCATGCTGCTGTCGATCTGCTGCGACAACTTCTTGAAGCGCTGATCGATGGCGGACGCCTGCCGGTCGGTCTGAGACTTGGCCTTGGCGAGCGCATTTTCATATTTCTTCGTGCTCGCCTCAAGTGTAACGATGAGGCGTTCAACATCTGTAGGCATCGCTGGGGGACCATGACTAGGAGAATTGCGCTAATTGCGTTGATGCTCGCGCTTGCGGCATGCACTAAGGAAGTCAAACCCGAGGAAGCTATTTCGTCGGTTGGGCCAATGCCCCTTGATTACAAGGCCCAGATAATTGCGAACGCGAAGAGCAATTATTTCGACCCTTATTCGATACGAAGTGCCGAGATCAGCAAGCCTGTTCCGGCGAAGAACGAATTATATGGGAAGTACGCTTGGGTGGTTTGCGTCAAAGCAAATGCCAAGAACCGCTTCGGCGCCTATGTCGGCCAGCAACTGGATGGCTACGTCTTTCAAAACGGGAAGATCACGCAGAAATCCGGCCACCCTGAAACCTATTGCGATGGCAAACCCTTCGAGGCATTCCCCGAACTTGAGAGCATCAAATAATACTCGACACTCCGGCGACCCCTCGATCAGGGGCCGCCCTCGGTGCAACTGCATCTAGATCATTGAGGCAGCGGCCTCAAATTCAGCATCACTCATATCCGCGACACCCTCTTCCCCGGTTTTCGACGCGATGAAGGCCGACCAGCAGCAATTGAACTGCCATAGGCTCATCGCGCCGACTTCCACCGGGGAAAAGCCCATCACGGCTCCGGCGCCGTAGAATTCGGAGAAGCGGAGCTTTCCGTTTCGGCGAGGCCGGGCTCCGCCATCCCGTCCGCCTGGTCTTTTCCCGGCTGGTCATCCTCGACGCCCCATAGGGCCGCCGCCAGAATGGACTGGGCCGGCAGCAGGCCTTCCATCAGAGGGCGAGCCGGATAGACATAGCGCTCCACCATGCGAAGCGCCGCAGTCGGCTCAAGGCCGCCGCCGATCAGTCCGAGACGGATGGTTTGACGAAGATCGGCCTCGCGCCAGGTGCCGTTCCGCATCCTCTCTAACAAGGCCAGCGGGCCGCAGTCCGCCTTTTCCTGCAGCTCCTCGAGCTGGCCGAGCGGAAGGGCAAAGCTGTATTGCCCATCCGCCCAGGTGAAAGTCGTCTTGCAATCCATGGATCACCAGCGATCAGGGCGCGATCAGGATATTGGTCGAAGTCAGCTCGCCATCACCTTGGGCGTCGAGAGTGACCTGATATTTCTCGCCGCGCTGCGCCTGGAGCTGGATCTTGATATGCGCCTTGCCTTGGTAACGCTTGTCAGGCGTGCCGGTACCGGTGCCGGCGCCCTTCAGTTCATGGCGAACGCTCACCGACTTGGAAGACAGCGCCGCAGCTTCGTAGAGTGCATAGCTCTCCTTTGCCATGACGCCCTGAAGCTGCATCGACCACGACATCGTCGTGACCCCGGTCTCCGACCAGACCGGAGCGTCCGGGTCGTCGCAATCCGGCACAGCCGTCTCATTGGTCGATTTGTCGATCGTCACATTTGCCTGCGTAAAACCGCAGGGCGCGACGAAAACCTCCGGATCGGCACCATCCCCGATCAGGAAACGGCCGGCGCCATAGGGAATAGTCGTGGGACGAGCCATCATGGCCTCCTTTGAAAAGCCGGCTCGGCCGGCGTTGCGATAGGGATGGGATCAGCGCGCGTCGATCAGGGCGCGCAAGGTGACCACGGCGTGCGCGGTGACGCCGTCGGGATCGTTCATCACATTGGTGCTGCGATGCTCGATCAGGTGGAAGTGCCAGGCCGGGCCGAGATCGAGTTCAGCCTCATGCAGGCTGGAGCGGATCTCGCTCGCCACCGTCTTCACCTCGACCTGGCCGCCGACCCGCGACCAGGCATGCAAAGTGAGGAAGATCTCGACACCGTCGATGCAATCGGCGCTGTCGTCGACCGTCTGGCTTTCGCCGAGGCCGATATAGGGGAAGGCGACGGCCTGCGGCGGCCGGTCATAGACCCTGCCGGCCACCGAAGGGCAATCCGCCTTCAGGCGATTGTAGATAACGCCCTGCAGGGCGAGGGATGGATCAGCCATTCGCCCCCTCCTTTATGGCCTTGGACGCGGCGCGCGAGACGCGGGATTTCGCCCGCTTCCGCAGGGCTCGATAGGCGGGATAGAAGAAGGGCTGCGCCGGCGTGCCCGGGTTTTGCGCTCCCTCGAACATTCCGCCCGCTTCATGTGGGGAAGTGCCGAACTCGACGAACGCCGCGTAATACGCATCCTTGTCGCCCGCATGGACCACGACGGTAAGATCGGGATCGCCGCCCTGCCCGCTGCCGATGCCGCGCACATTGGCATTGTCGGGCGTATGGCCGCCGAAGCTGTAGCCGATGCTATTGGCGAGCGCCCCCGTCCGCTTCGGCGCCAGGCGCTTCTGCAGCGCGACGATCTCCTCGGCGCTTTGCTGCAGCGCCTTTCGCATCGCGGCACGAGGCTTGCCCTGGATCGCCTGCAGCTTCTTCAGCAACCTGTCCCGGTTTTGCATGGCCATCGGCGAGAACTCCGGCCGCACGGGCGGCGTCAGCGTGGGCGCGAGGGATGAGGAGCGAGTGGCCGGCCTTGTAGGCGACGCAGACGCGAGGGTTCGGCCGAAAGTCGAAATCAGCGATGAAGACGACGCGAGGCATCAGATCGCCACTCCCATTTCGAGATCGAACTCGATCTCGGCATTGTCGGTGCGCGGCCGTGCCGCGCGAATCTGCGCGACCTGGTCCTTGTAGGGGCCGGCGACGAACACGGCCCGATCGGCTTCGGTGACCGTTGTCGTCACGCTGTCGCGCAGCACGGTGATGACGGCCCGCTGGCTGGCCTGCGGCCGCCCGGCCTCGACCGGCTCCGCCCCGAACCGCGGCGGCAGCAACAGCCCGCACCAGCGCGAGCCGAGGTCGAGCGGATCTTCCCAGCCCTGCGTGACGTTGCCATACCCGTCGTCATGCGCCGTCTGGCGCTCGAAGCGGATGCGGTGATTTCTCTTGCCGGCGCCGAGCATCAGACCGTCCAGACGCGATAGGGCGAGAGGAGATTCGAGACGGCGAACGGCAGCTCGGCCTGGTCGCTCTTTCCGACGGCCTCGCGCGCGACATACCACTGCCCGACGAGAAGCAGCACGGCCTGCCGGATCGGCGCCGGGATCTCCGAATAACCCGCCGTGAAGCGGACGCGCACGGTTTCCGGGCCGTCAAGCGTGACCGGCCAGGACTTGCCAGGCGCCGGGCGCAGCTCGGTATCGGCGAAACGGTAATCGGCCGGCGCGAGCGTCTGTTCGTTGCCGCTGCCGTCGAGATAGAGCACCGCCGTGACGGCCGAGACCGGGCGCAACGGCAGGGACAGCCATGGCACCCAGTAGCCGCCGAGACGCTGGCAGAAGCCGGCGAAGCGGGCTTCGAAGCCCTGCTCGCGCACGGCCCGGCCGAGCCAGCCGTCGGGCGCATCGATCCAGACGAAAGCCGCCTCGATCAGCCCCGCGATATAGGCGTCATCCTCGGTATGCTCGACGCGAAGATGCTTCTTCGCGTCGTCAACCGTGATGCCCGGCTCCACTGCCGGCGTCACGACGGAAAGCCGCATGATCAGGCTCCGGCCTTCTTGTTCTTCGGAGCGGCGGCCTCGGCCTTGTTGTCGGGATCAATTTCGGCCTTGTTCAGGCCTTCACCTTCCTGCTTGCCTCCGCCGTCGCCCGCCTCGCCGGCCTCTGCCTCAGTCTCGGCGACCTCCTCGAGGCAATTGAGCCGGACCAGATGGGCGGCATCCGCCGTCGACAGCTCACGGATTGCGCCAATGGCGAAGCTTTCCGAACCGGCATCCAAAGGCCGGAGAACTTTGTATTTCGGCATTTGTCGTCTCCTATGAAAACGGCGGGGCACCATTGCCCGCCGCTCAAGTTCAAGCCGGCGGAGCCGGCGTTAGGCGACGCGCCCGAAATCGCCGTAAATGAAGGCTTCCGGGCGATACACGGCGAGCGCCAGGCGCTCTTCAGCCAGGATGGTGACGAGGTTCTTCACGAAGTCATCTTCGTTCTCGGTCGCGACTTCGACACGTCCCGCCCAGCGGTCGAACACCTGGGCACCAAGCCGGAACGCACCGGTCAGGAACTTGTCGACCGCCATCGCTTGCGTGGTCACGACGGGCAGACCCCAGAGGGTCGGCGAGATCGTGCCCTGCGGATTGCCGATGATGTAACGGCCCTCCCCGTCCTTCAGAGTCTCGATCCACGTCCAGTCGATCGGGTTCATGACGTGTCCGGTCGCGGGGAATTCGGCAAGAGCGGCCTGCAGCATGGCCAGGCGCATCAGGTCGATGCTGGTCGGATCCGTCAGCGTGATCGGAGCGGCGAAGGCCGTTGCCTGCGGGATGATGCCCATCAGGTTCTGGCCGGTGCCGTCACCATTGAGCAGCTGAGCCTCCTCCTTGAACGCAAGGCCATAGAGGAGGCGCTGATCGATGATTGACTGCAGCTGCGGGATATCGTCCAGCACCTGGCGCGAGGCCTTCATGGTGTGGGCGATGACCTTGGCCGAGGTGGTCACCAGATCCATCTTGATGTCGGACTGCGGCTTGAGCGCGCCTTCCGCGACCGGAGCGGCATTGTTGGTGAAGCCGGTTTCCCGCACATATTCCAGCGCGTTGCCGCTCATGCGACCAGGCGAGATGAGGTTACGGACGAAGAGCTGGCGCTGCGGCAACTCCTGAATGCCGGGCAGGCGCGTGGTCTGGATGGCGGCGCCGACCGAACCGGGAGCATTCGTCGTGGCAGAGGTCAGAACGGCCTTCACCTCAAGGGAGGCACGGCCGCGCTGGCCACTCTTGCCGACGAGTGTCTTATAGCTCTCGCCTTCGACGAACTGCTCGCCGATGGTCTTGACCTCGTCTTCCTCGCTCGTGTTTCGCGCGAGCTTCTGCTCGATCTGAGCAATCTGCTCGCCAAGGCCGTTCATCTTGATCAAGGCTTCGTCGGCCTTGTCCTTGGTCTCGCTGGCGAGCGCGCCCTTTTCCTTCGCCTCGGCGAGGGCCTTCTCGGCGATATCCTTGACCGCATCGAGCGACTTCTGGAACGAAGCCTTGACTTCGACAGCCAGCTGCTCGGCCGTCTTGGTATCGATGCCACCGCCGCCATCCGGAGCGAAGCAGATACGGGGGCCAAATGCGGCGGACGCCAGGAACGCAATGGTGCCCGCCGTGAAAACACGGTTATACTTAGCCATGACGGCTCTCCTGTGATGGAAATGGTGGTGAGTTAGGGCCGCCCTACCTGAGCAGGGCCGCCAGGAAGGCGCTTCGCGCATCCGCCTCGCTTCCAGCATCACGCTGGTCGATAAGGCGCTTCAGGCCATGGGCCGCAATGGCTTTGGCCTGGCTTCGCGAGAACCCGCCTGCATCGCGCAGGAGATCCTCGAATTCTGCCAGAGTTGGGAGCTTGCCAGCTTCAACCGTGCTCTTCACGCCGGTCACGCGCGCCTCGATATTCATCGGCATCGTGACCAGTGAAATCTCGCGGAGGTCGATCGTCTTCAGCCTGGTGACGCCTCGACGCTTCTCATCGGGCTCGGCGCCTCCGGCCGGGATGCGATAGCCGATCGACATGCCGCCGAGCGCCTTTGCCTTCAGCAGGCCGTGCGCCCGCTTCGCGAGAGGATCGGCGTCGATCAGCAAGTTGCCCTTGACGTACAGCCCTTTGGTATCCTCGGCCATGTCAACCCAGACGCCGATCGGTTCGCGCTGATCATGCTGCCAAAGCATCGGGATGGTGCGGCCATCCTTCTTGGCCTTGACGATTCCTTCGATGAAAGCGCCGGGCTCGACGACATCGCCGCCTTGGTCGACATTGCCGAACGTCGAGGCGTACCCCTCGAATTCCCCGGCATCGCCGACGGCCTTGGCGTCTAGAGCAAAGTCATAGATCTTCATGGCTTACCCTCCAGCGGCGGCCCGCCGTTGTGGCCGATGCCGGCCTGCGTGATCGGAACGTTCTGCATCTGCATGCGCGGGACATCGCCCCCCTCGACCGGAGGCAGGTTTTCCAAGCGACGCACTTCGTTGATGGTCATGACCCCGGCATTCAGCATTGCCGTGTAGAAGGCCGATCGGGCCGCACTGTCACCGCGCAGCAACCCCTCAAGATTGAATTCGACGGTGACGCCGTTGGCGCGATCGAACGGCGAGAGGATCTGTTTTCCGACCGCCATTTCGATGCGCTTCAGGCGACGGCGCAGCGTAAATTTCACAAAGGCCAGTACCTGCTGTTCGAGCCCCGTACCCCAGCTGGTGGTCTTTTCCGTGTGGCCGATCATGAACGGAGGCACGCCGAACCAGCGGCAGACCTCTTCGACGCTGAAGCCGCGAGACTCAAGCATCTGCGCGTCTTCCGGATTGATGGTCAATTGCTCCCATTTCGAGCCGCCCTCGAGCACGAAGGGTCGACCAGCATTGAGCGCGCCGGCAAACTTCTCGATCAGCTTTTGCTCCACCACCGACCGTTGCGTTTCAGTCAGGAACTTTTCGAAGGTGAGCGCCCCGGACGGCCGTAGGCCATTGGCGAACGTCGCCCCAGCGGCGCGGTCGATGGCCTGCGCCAGGCCAAACGAATTCCGAGCAAAGGCCAACGTGGACAATCCGCCGAGAGCAGAGCCTCCAAAGCCGCGGATGTGCAGAACGCCATCCTGATTGGTTACATAGCTCCTGCCATCCTCGGACCATCGATATTCCAGATCGCCGGAAGAGAGGCGGCGGACCTGCATAAAGTCCGGGCTGATCGGCGTCATTGCAATGACACGCTGACCGCTGCGCTCGACCCTGGCAAAGCCATTCCCGCGCAGATCTACGCTCGCACTAAGGAATTCCCAAAAGTCGAGGGCCGTCTGCTCCGCATTCGGGCTATCGTGAATAATCCGAAACAGCGGGTGGTCCGAAGCGAGCACCCGCTCGCCCTTGCCATTGTTTCGATAGACATTGAACGGTAGAGAACCGATCGAACCGGCCAGCAGGTTGATACATGCCCAAACGGTCGATAGGCCGAGAGCGGATGCTGTAGTAACCCCTTCGCCAGAGGCCGCGAGACGCAATCCAAAATCTTCGGCGTGCGTTACTCGAACATCTCGCCCGAGATCGGCCTTCCGACGAAACCAGCTAAATAGGTTCACGATGCTGCCCCCAGGCTGGCGATGAAACTGTCGAGATTCCCGCTGGAGGCCTCCGGGTTGCGGGCCATCAGCGTGAAGGCATTGAAGGTGGCGATCAGCGGGTCGATTTTGGCTTTGCCAGCCGTCTCTTTGGTGATCAGGACGGCGTTACCCTTCTGCTCGACCTTGGCGTTGCCAACGCACCAGGTCATGAGATCCTGGCCGCAATGCCAGAATGTGCCGTCCTTGAGCTTGCGCTCCATGCCCCAGACCGCCGACGAAAGCCTGAAGCCCTGGCTGACAGCAACAACCGGCCCGCCATGCTCGGCTGTATGAACGCCGCGCGCAGCCAACTCATCGACCATCGCCGCGACGCCGAACGGGTCGAAGCCGACCCCGTTCGCTTCCGGTAGGAGCCCCGACTGATGCAGGCGCTCAATCACGTCAGCGACCTCGAGGATATCCTGTGTAGCGTGGTCGCAGAGGGTGAAGGTTCCCTCTTTTTCGAGATCCCGCAGCTTCGGGACGATTTCCTCACGCAGCTCGTAGACCTCGCTATGTCCCCAAGCGTGGTTCCAAAGCAGCCAATCGCGCGTTTCACGATCGCGGCCGATGACAGCAAGGCCCAGAAGATCGTCGAGCCCGCCGCCGTCGATGCCCGCGACCACCACCTCGCTGCGGGCGAGTAGCTCATCGAGTGTCAGGCCCTCGATCGCAGCGCCAAGCCAGTAATCGACACCACGCCAACGATCGCCGTGCTGGCCGATGCCGATCTCGACGTTGAAATGCTGCGAGGCCAGCAGAGCGAGCTTTTCGGCCCCTCCAGTCTTGGCCTTGGTCAGTTCGTCGCGGAGGAAGGCCTCATCAACCGACCGGTTGAGATTCGGATTGACCATCCCCCAGGTTTTCGGGTCCTGCCAGCCGCCATCCTTGGTGAGCCCCACCGGCAGCTCATAGAGCACCGGCAGCAACGGGAGATCGAGTTCACCGGCCCTGACCTGTCGGGCGATCGACAGCTCATCCTTGAACACGCCAGAAGGCGGGTCCTTCGACTGCGTCGTGATCTGCAGCATGAAGCCGTCAGGACGCGCCGCCAAGGAACCACGGATCTCGACGAAGATGTCGGCCGCCTTCGACTTCTTGGCGAACACATGCGTTTCGTCGATCAGGATATAGGTCGCCTTCGAACCCGTGATGACGTCGGCATCGGCGGCCTTGATCGCGATCATCGCCAGCGAGATCCGATGCGTGATCGTGCGCTGATGGGTTTGAGGGTGGAAGATCTTCGACAGCTCCGGATCGAGCCGGATGATGCCGGAGGCCTGCTTGAAGGCGATTTCGGCGATCTTCTTCGTCGGCGCGATCAGCAGGAGTTCGGCTTCAGGCCTCCGATTCATGATCGCGGCCGTCACCATGAGCGCGGCCGCGATGCTGGATTTCCCGTTCTTCTTGGGGATCATCAAGAAGAATTCGCGGATCATCCGCCGCTGCGCGGCCTGGTCGTAGGAGCCGAACAGCGCCCGGACGAAATCAAAGACCCATTTGCCGCAGGCCTCGCCATAGGTCGGCGTGCCGATGATGTCGGGAACGCGCAGGCGCTTGAAGATGCGCAGCGCCTTCGCCGCCTGGTCTTCGAAGAGAGGAAGATCCGGGACGAGAGGCAGGCCGGCGACGATCCGATCTTCCCAGTCCGGGCAGGCCGTGCGCCATGCCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP043489|336741:369695|345951_346332_+|WP_149251107.1|DBSCAN-SWA MAHPAELPFDLPFDGEMPMPAALALAPSYDGPFAAEFAKLTRENPTIDPRRWEQAKKDAAEFLADWGEQAAELGWTAKELFGLHPVAPLTRYDQMGLVWLLNGKEVDEITDRTAKIGATTFYRTTA >NZ_CP043489|336741:369695|361987_362452_-|WP_149251127.1|DBSCAN-SWA MMARPTTIPYGAGRFLIGDGADPEVFVAPCGFTQANVTIDKSTNETAVPDCDDPDAPVWSETGVTTMSWSMQLQGVMAKESYALYEAAALSSKSVSVRHELKGAGTGTGTPDKRYQGKAHIKIQLQAQRGEKYQVTLDAQGDGELTSTNILIAP >NZ_CP043489|336741:369695|344429_344696_-|WP_149251104.1|DBSCAN-SWA MEVWDYLERPNRTSATQAEEDALQKIIAECGAANTALLWLYRVALISHQENEESLDRIERLGRLHRAEIRELQGLPPEEWRDPYRAPQ >NZ_CP043489|336741:369695|340573_340912_+|WP_149251097.1|DBSCAN-SWA MKKLILAAALATMGFAGLSTGASAMPAAPAIGADTGFVETIAYGCGRGYVPNRWGRCVPNQRPAPAYRPGRFFGEAPGYRRGPVPANRNGYGRACPPGTRLGPRGGYCHPVY >NZ_CP043489|336741:369695|352220_352646_-|WP_149251116.1|DBSCAN-SWA MIIEYMGPLGFAPGMITHVIMVYRPELVDHMRANGAAFIEIDETSADPDKRYVDDGQVKDRPVAQIALTAGQILADGVDTAMLQGVPAGWEIRMDGSTFGIADGTDIEIVSSEPHAYAIEAVGPWPFQPWRGQVVAVAPGS >NZ_CP043489|336741:369695|344172_344433_-|WP_149251103.1|DBSCAN-SWA MTRPAPEPVPDDLEASTDDVIAACEGDARAAVRVLLVALHHCQAELEQRNDEVAQLAQDISRGYSRGRWEDLLTRAEVPIPYKPDD >NZ_CP043489|336741:369695|363491_363854_-|WP_149251130.1|head,tail|DBSCAN-SWA MLGAGKRNHRIRFERQTAHDDGYGNVTQGWEDPLDLGSRWCGLLLPPRFGAEPVEAGRPQASQRAVITVLRDSVTTTVTEADRAVFVAGPYKDQVAQIRAARPRTDNAEIEFDLEMGVAI >NZ_CP043489|336741:369695|364434_364698_-|WP_149251132.1|DBSCAN-SWA MPKYKVLRPLDAGSESFAIGAIRELSTADAAHLVRLNCLEEVAETEAEAGEAGDGGGKQEGEGLNKAEIDPDNKAEAAAPKNKKAGA >NZ_CP043489|336741:369695|344835_345237_+|WP_149251105.1|DBSCAN-SWA MNLNAQFQGNLQPGDTKKLAIYVPGATREELEKGHQAAAAFFEENMCLPAQAAAAFFKLESIEFDPSVEMTDREARIADVWQEAQEIAAKAICEGWREPAKFASFALGISTKKLAADCERNREMRGDTTLPHR >NZ_CP043489|336741:369695|353950_356419_-|WP_149251120.1|DBSCAN-SWA MGFIASAVSAVIGFIGSLSVGGFSIGRALLGIGLSLGLSALQKLLAPKPKATPSGVQLNLQLGSDVGRQVVFGRIATRGHLAFFQTSGEENKRLELVYILSEGWCDGLEAVWIGGKKKTLTQLSTDGNGMTTYSIEGYGGFARVRFHDGRPGQAADAITVANSASRLTMNDRFAGMCYAAVTLIYDPDQFASGPPEFTWQMRGYRCYDVRKDSTAGGIGLHRLNDPATWQWTENGAIHGYNFLSGIVAEGQVVLGPGIPAYDLIASTWIAAANVCDETVALTAGGTEPRYRCAYVLVAEDADFRSLIEPVIQTMAGYLIEYQGAFGLLAGAAQVSVVTITDADFRVGPEQRFSLGLSRTERTNEVYGQFVDPAAGYQANSYPPIISASARAEDGEPLRVSLDLAAVPSGTQAQRIATARLRETRAQANGTFTLGNHLLFLDPGDWITWQSARYGWTKLFRVMTRQLNLDDSITVALREVNYQIYAWSSSDEKPIEIPGGAPGDPGYMSTVSSFSAEAIMIDAESGAQRPAVKLTWTPVADPTVDAVIIEYRRINAVAATRLRDDSPEDGAYVFDTVGAGDQYEFRATIATTPVRPTTWTTWVGVQVGQSQLPPGSINFDDLAQSAKDAFKPIVPIRYLALATIEEATQSFTQETSNHVQDTNARASFTSQIRLQVSATEALATKVDILEAQVGEDIQAQIAQEATARANADGALSQRIDTVSATANGNTAAIQTVSQAVASVDGKLSATWMVTLDVDGKIAGLKAYNDGTVSSFQLTADHIIINALADLSINTGALNVSGDMNIGNKIFFRPGSNPRIEIYD >NZ_CP043489|336741:369695|351314_351680_-|WP_149251114.1|DBSCAN-SWA MKIVLLPDMAARRAEAEGLVDRHFGPEIGRLCQFSDLYRRKVDEARDVIAGKGPGPLICAEADARGDFVDFIAETILAKSAANAEALAEVEQQRLAAKSLVRAAAAPAALETVLSDLGISR >NZ_CP043489|336741:369695|341309_341681_-|WP_149251098.1|DBSCAN-SWA MKDWVFAALVASVGFAATSASAMPASPNAVVDNDLLVSIADGCPRNYYSHYGRCVPNARGYRGARYGDEYQGQRYYARPYDYDRPGYFDGPGYYDGPRYWAVPRWGCPPGRHTGRWGRCVPNW >NZ_CP043489|336741:369695|356868_357486_-|WP_149251122.1|DBSCAN-SWA MARVISQALLDALKAGRVARADLLLFDFPSGLYGFFSGEGTFTYNGVTYKGAGSLFQVDAIGGVSDGTAVPMNIRLNASAELPMNMLASIENEIYRGRPVTMARAYLDPDSYALLSVETVYRGYIDTIDHVESKGEGAIADASPGEAYLEAHVESRALDMGRSGWRRRSDADQRTLDPNDGSLRYLQSTATAEIWWGRVTPRRAR >NZ_CP043489|336741:369695|339034_340462_+|WP_149251096.1|DBSCAN-SWA MSRSPVSQPARIAPLAVLPVFYRLTGKPVLVIGASQSALWKAELLASTGAEVRIAAPEASDALAQLCAREDMSVRHLARSWHEADLDGIALVVADVEDDEQAAHLAAAARRAGTPVNIVDRPAFCDFQFGAVVNRSPLVVGISTDGAVPVFAQAVRSRIEMLLPTGFQKWAEAARSWRPLVQARNLSLQARRAVWERFVNAAFRHPDRTPGEADRADLLAAVEQQERRPAHGSVALVGAGPGDPELLTLKAVRALQSADVILHDDLVSQEVLDFARREAERIITGKRGHRPSCKQGDINALMVSLARDGKRVVRLKGGDPLVFGRAGEEIAACRAAGIAVEVIPGITAASGAAASLEVSLTHRDHARRLQYITAHAKNGKLPDDLDWKALADPAATTAIYMGKLVVAEVAARLLEAGMAADTPAIIVEYATHERERRFHTVIGEMAGVFAANQLDGPCIILFGQAMSEARPDSAS >NZ_CP043489|336741:369695|336741_338061_-|WP_149251095.1|tRNA|DBSCAN-SWA MRLSRYFLPILRETPKEAEIVSHRLMLRTGMIRQEAAGIYALLPLGLRVVNKINAIIREEQNRAGAIELLMPTIQSADLWRESGRYDDYGKEMLRIRDRQEREMLYGPTNEEMITEIFRAYVRSYKDLPLNLYHIQWKFRDEIRPRFGLMRGREFLMKDAYSFDLDQAGAQHSYNKMFVAYLRTFARLGLTAIPMVADTGPIGGNLSHEFIILASTGESEVFCHRDYLSMAPPSAGTDFDDRAGLQAIVDQWTSLYAATEEKHDKPAFEALPADKQVSARGIEVGHIFYFGTKYSAPMKAVVAGPDGSLIPVHGGSYGIGPTRVAGAAIEASHDEAGIIWPEAIAPFKVGLINLKPGDSAVDAACGDLYLQLGAKGVDVLLDDTDERPGGKFAKMDLIGLPWQIIVGPKSLAEGKVEVKNRRTGERQLLTPSDALVLVG >NZ_CP043489|336741:369695|356418_356826_-|WP_149251121.1|DBSCAN-SWA MTRLEGWESALFEALAPHMAAPFAWGASDCFLMAMDAMRAVTGEDPYGEHRGRYKTAAGAMKRIRGKGFRDLEGAMDAICERVIPSLARRGDIVSLMADDGIALGVVVGDGILCKAPAGIQINPLWTALHAWRVG >NZ_CP043489|336741:369695|347789_348029_-|WP_149251111.1|DBSCAN-SWA MKLVPNWRAVLRHAWSVRLLLLAAVLSGLEAALPLLAPYLPIPDRLFAALTGLTVGAALFARFLAQKGVSDAGPKDQAD >NZ_CP043489|336741:369695|364767_366066_-|WP_149251133.1|capsid|DBSCAN-SWA MAKYNRVFTAGTIAFLASAAFGPRICFAPDGGGGIDTKTAEQLAVEVKASFQKSLDAVKDIAEKALAEAKEKGALASETKDKADEALIKMNGLGEQIAQIEQKLARNTSEEDEVKTIGEQFVEGESYKTLVGKSGQRGRASLEVKAVLTSATTNAPGSVGAAIQTTRLPGIQELPQRQLFVRNLISPGRMSGNALEYVRETGFTNNAAPVAEGALKPQSDIKMDLVTTSAKVIAHTMKASRQVLDDIPQLQSIIDQRLLYGLAFKEEAQLLNGDGTGQNLMGIIPQATAFAAPITLTDPTSIDLMRLAMLQAALAEFPATGHVMNPIDWTWIETLKDGEGRYIIGNPQGTISPTLWGLPVVTTQAMAVDKFLTGAFRLGAQVFDRWAGRVEVATENEDDFVKNLVTILAEERLALAVYRPEAFIYGDFGRVA >NZ_CP043489|336741:369695|351801_352224_+|WP_149251115.1|DBSCAN-SWA MLKSKSHLVSAVAAGFIFFGMNAAYAASSDDYSGKVTVLNGHEVSLFGHSDLNPDCTKFGYSTITSVVRPAHGTIRMVHQKIFAVFGATNPRYQCNTKGGPGIRVYYRANKGYHGSDHAVFSVYSAYGRKGTATIDISVD >NZ_CP043489|336741:369695|363853_364432_-|WP_149251131.1|head,tail|DBSCAN-SWA MRLSVVTPAVEPGITVDDAKKHLRVEHTEDDAYIAGLIEAAFVWIDAPDGWLGRAVREQGFEARFAGFCQRLGGYWVPWLSLPLRPVSAVTAVLYLDGSGNEQTLAPADYRFADTELRPAPGKSWPVTLDGPETVRVRFTAGYSEIPAPIRQAVLLLVGQWYVAREAVGKSDQAELPFAVSNLLSPYRVWTV >NZ_CP043489|336741:369695|366109_366769_-|WP_149251134.1|head,protease|DBSCAN-SWA MKIYDFALDAKAVGDAGEFEGYASTFGNVDQGGDVVEPGAFIEGIVKAKKDGRTIPMLWQHDQREPIGVWVDMAEDTKGLYVKGNLLIDADPLAKRAHGLLKAKALGGMSIGYRIPAGGAEPDEKRRGVTRLKTIDLREISLVTMPMNIEARVTGVKSTVEAGKLPTLAEFEDLLRDAGGFSRSQAKAIAAHGLKRLIDQRDAGSEADARSAFLAALLR >NZ_CP043489|336741:369695|352651_353116_-|WP_149251117.1|DBSCAN-SWA MVQRARFGFRPSDGKPQFRIVLPGVDIDAATWAQIVFDADYAVTRLAYSGSVAYSGAQGRGFQTLLTWPDMGYIPFTLIAAYSANGYRFYQSDPPIGPYGPTWAGEAIMACTYMATYNDINSVDATVTRTGLQARSCFSFSGQLTYAVFGIPAG >NZ_CP043489|336741:369695|348777_351318_-|WP_149251113.1|DBSCAN-SWA MSNLILTVTVTCAAGSTAAAIAGNPMKGDGLAPIVTEGAWLIVNMGGGIFAQVPVAADATADNALTLLHPLPSAVTGASGWVAVDPDKDSIAIATMQAVQAIRARFTPASVWFTGNGPPSNDIGVDGDLYRDYDESPPIEYRRDNGAWGISPAATVLPWNARGDWAAGTTYAKQDLVTYNGSSWLSRVDANTGNTPTTAAAQWQLVARSGDQILMGAGAPAVGLGVDGAAYINTSNGDLYKKAAGAWGAPQGNIKGPQGNAGPANTLAIGTVATGPAAATITGASPNQTLNLTIPKGDTGPANTLAIGTVASGASAGATITGTAPNQTLNLTLPKGDTGNTGPAGLTPKGAWSSATAYAVNDLVTSNGSSYRCKVAHTNQAVTNTTYWELFAQKGADGTGTGNVTGPAAAVVDTEVAIFDGTTGTLIKSGGVQFSLMSRLNRDQTYTQPQRFAAATFAAGFAAYFNGDAYIAERPPGDSSPSLACTSFVRRHIGNFAGFVFANVNVTLGVNDHGKAVQASSATPFNITLPVGGVWGAGAAGPAITIFNHGTADVTVVRQGSDFIYCPPAGLGNTNTSLTLRPGENVVVINRGGTEWDVCGGSWLTSNATPLPPKAIAGLSAGFAIGDLLYADSTTTLAKLADVAGGNVLRSGGVGVAPSWGKVGLTTHITGTLAVANGGTGLTALGTALQVLRTNAGATAMEWATLGSAALSASTDFATAAQGAKADTAVQPAALTAYEKAGEQVGINTQTGTAYTLALADKGKIVEMNNAGANTLTIPANATVAFPVNSRIDLSQFGAGQTTIAADAGVTIRSAGGKLKLTGQYSGATLYKRGSDEWVLIGDLAA >NZ_CP043489|336741:369695|357485_358136_-|WP_149251123.1|DBSCAN-SWA MAITYPRAMLAGCQATDTMRLIRTEAASMLSSGATQAIQLAPPRWRCDYTTKPLNMAERRPWRAWLDSLNGALRTFLGHSPLQIWPGAYPGGFAGMNRASGGAFDGTAQASAVSATGISIIGLPAGFVIGAGDLLGLVQNGKYGLYRIMEPVVAAGNGTASVTLSPAVNTSLFSIGATVNFARPVCEMIIDPATAPDASAELSRKPVTFTGIQRLY >NZ_CP043489|336741:369695|361566_361974_-|WP_149251126.1|DBSCAN-SWA MDCKTTFTWADGQYSFALPLGQLEELQEKADCGPLALLERMRNGTWREADLRQTIRLGLIGGGLEPTAALRMVERYVYPARPLMEGLLPAQSILAAALWGVEDDQPGKDQADGMAEPGLAETESSASPNSTAPEP >NZ_CP043489|336741:369695|346328_346658_-|WP_149251108.1|DBSCAN-SWA MNDQSASEKEHWVVDKKIPVALVITLLVQTGGVVWWSSQLNSRVNMIEERNIKLEIEIGKLKDNGGEIRERFAKAETTMLAFVETTRRIEAKLDRLIDDKQPSRASGPR >NZ_CP043489|336741:369695|348085_348781_-|WP_149251112.1|DBSCAN-SWA MRFGLGFVAAKSAALPVVGAYRTAAGQAAGASLTLSNVDIGTASASRLVVVVAMGLNTALVANPTCTIGGISATLVANQQNALSGTSDNTCIFAAFVPTGATATVVVSFGASMNRSHASVYTIDNAKSVTPDTTAKLALKGQSSGGAKSITLNGVLANSFAIGYCHTYTSSVLTHTWTAPYTTDVNATVATTSKCSSASNSEAAAGNKALTCTLSSTTAINGGAMIGASWS >NZ_CP043489|336741:369695|343303_343561_+|WP_149251100.1|DBSCAN-SWA MAGKLHPLIQGPRVDLDISDRLLECEEALERSFQDLVERAELAGWKTIEITCALQSLADHHMLAKAANEETDLQIADALNRSHGP >NZ_CP043489|336741:369695|358135_360955_-|WP_149251124.1|DBSCAN-SWA MPTDVERLIVTLEASTKKYENALAKAKSQTDRQASAIDQRFKKLSQQIDSSMERGNKSMARGFDASGMSRSFSGLAQSLRSTTGLVGIITAAVAAAGAGVVNYGDRWNAIGNRLKTAGIAVRNIGDEQSRVSSVALKAHSELEATSDLYVKMLGVSQSLGQANSSAADATQAVAQALSLAGVSADAAKGTITQLGQALGSGVLRGDEFNSIMESLGTQSPLIRSIAREFGVSANELRGLAAAGQLTSDRVFKAIVAAKPEIEALADGTVPTLSQAWTDLDTATTKWVATMVGGEEAANSVAAAMADLAKKIDGVSQAYEDAQKRRRAFDDEATLKAPQLIRDNIAEIEKQIQAEQAAGASDYYLDNLRLALQSEKDRLKEAEKVAGVKKEELATVQKITDLQQELLREAARTRGTSGHADPVQDMLDKRSRDAGLSSKEAEIAKATDDIVEAMKKAGTKLTGFTLQDAARAQAEREYSLRAGVQATEAIVKSYVDRVVKAESGGDPRAKNPNSTATGVGQFIESTWIALFRKYYPQQAESMGRDAILDLRKDGNISRDLIEKYAAENAAVLQKAGISVNEAALQLSHFLGAGDAAKVLKAAPGTPLQGLISQKSINANPSILGGGRTVDDAIAYARRRANDTRVAAGDLTADERRKQTLDEIIKKNQQDIDGLKLKGAALTQTAYQTEYLAKRQELLNEATQSGIALTPQVVASLEAEARKYAESAVAREAAQKKFDEMIDAQREFANLAVDGLTGLIDGSKDLKGALEDVLKSLIRLVLQAALLGEGPLAGLFGGSSKSGGLLGGLFGGLLGGFSGGGYTGSGGKYQPAGIVHRDEYVFDQQATKRIGVANLERLRRGLPGFANGGFVGAGLPTSFKVSAVGATSAPGLVLNMPMSFDARGASADAIPALRREMQVMRQQIRQEVPSLVLNARKRGSL >NZ_CP043489|336741:369695|362500_362899_-|WP_149251128.1|DBSCAN-SWA MADPSLALQGVIYNRLKADCPSVAGRVYDRPPQAVAFPYIGLGESQTVDDSADCIDGVEIFLTLHAWSRVGGQVEVKTVASEIRSSLHEAELDLGPAWHFHLIEHRSTNVMNDPDGVTAHAVVTLRALIDAR >NZ_CP043489|336741:369695|366765_368001_-|WP_149251135.1|portal|DBSCAN-SWA MNLFSWFRRKADLGRDVRVTHAEDFGLRLAASGEGVTTASALGLSTVWACINLLAGSIGSLPFNVYRNNGKGERVLASDHPLFRIIHDSPNAEQTALDFWEFLSASVDLRGNGFARVERSGQRVIAMTPISPDFMQVRRLSSGDLEYRWSEDGRSYVTNQDGVLHIRGFGGSALGGLSTLAFARNSFGLAQAIDRAAGATFANGLRPSGALTFEKFLTETQRSVVEQKLIEKFAGALNAGRPFVLEGGSKWEQLTINPEDAQMLESRGFSVEEVCRWFGVPPFMIGHTEKTTSWGTGLEQQVLAFVKFTLRRRLKRIEMAVGKQILSPFDRANGVTVEFNLEGLLRGDSAARSAFYTAMLNAGVMTINEVRRLENLPPVEGGDVPRMQMQNVPITQAGIGHNGGPPLEGKP >NZ_CP043489|336741:369695|362891_363356_-|WP_149251129.1|DBSCAN-SWA MAMQNRDRLLKKLQAIQGKPRAAMRKALQQSAEEIVALQKRLAPKRTGALANSIGYSFGGHTPDNANVRGIGSGQGGDPDLTVVVHAGDKDAYYAAFVEFGTSPHEAGGMFEGAQNPGTPAQPFFYPAYRALRKRAKSRVSRAASKAIKEGANG >NZ_CP043489|336741:369695|345247_345904_+|WP_149251106.1|DBSCAN-SWA MCNLFNQQMTQEELRQLGPVIRDTVNWPDAVDVYPDYPSPIIRDGADSVRELVLARWGMPTPPKFLEGKKSDPGVTNIRNVASPHWRRWLGPESRCLVPFTAFSEYSDSEKNEKGGKALKWFALNDNKPLAVFAGIWTNWTSVRKVKEGEVTADVFGFLTTEPNAVVKPHHAKAMPVILTTAEERDAWMRAPWNEAKALQRPLPDDQLVVVPRPASGK >NZ_CP043489|336741:369695|353493_353958_-|WP_149251119.1|DBSCAN-SWA MTDVCRMILGADGNGAYGVFATLPGHDARTADPADNLKWAFRSDWSRIENLHQVGRVTGLGPSSSLQKRVDFPALPYIPFVSVRYESAIGTMQDDPWCKGNYVEQAYFTPLRVHVGLDFFRVGFPVGSSQSAASPAVPASAVYFIWKIPVAVPA >NZ_CP043489|336741:369695|343859_344171_-|WP_149251102.1|DBSCAN-SWA MALDKPPQTVEDALDWGYTHLEFTCKRKRCQHVGHVPLAQFNVVLGFRRMILWDVFSRCKCSRCGLRPITGSLAIAREINGEKGFYAKRVDFYEGMTLRPSRE >NZ_CP043489|336741:369695|343538_343856_-|WP_149251101.1|DBSCAN-SWA MVYRKGEFQPSRLDREYLFHVALPTPKGVPNFHEVDEFVRTFGGAPRHHSVGEWAIHHTIFCFKEERYAAEFLDRFGGFSIEPSAKARKALLAELRARPKVHDYG >NZ_CP043489|336741:369695|367997_369695_-|WP_149251136.1|terminase|DBSCAN-SWA MAWRTACPDWEDRIVAGLPLVPDLPLFEDQAAKALRIFKRLRVPDIIGTPTYGEACGKWVFDFVRALFGSYDQAAQRRMIREFFLMIPKKNGKSSIAAALMVTAAIMNRRPEAELLLIAPTKKIAEIAFKQASGIIRLDPELSKIFHPQTHQRTITHRISLAMIAIKAADADVITGSKATYILIDETHVFAKKSKAADIFVEIRGSLAARPDGFMLQITTQSKDPPSGVFKDELSIARQVRAGELDLPLLPVLYELPVGLTKDGGWQDPKTWGMVNPNLNRSVDEAFLRDELTKAKTGGAEKLALLASQHFNVEIGIGQHGDRWRGVDYWLGAAIEGLTLDELLARSEVVVAGIDGGGLDDLLGLAVIGRDRETRDWLLWNHAWGHSEVYELREEIVPKLRDLEKEGTFTLCDHATQDILEVADVIERLHQSGLLPEANGVGFDPFGVAAMVDELAARGVHTAEHGGPVVAVSQGFRLSSAVWGMERKLKDGTFWHCGQDLMTWCVGNAKVEQKGNAVLITKETAGKAKIDPLIATFNAFTLMARNPEASSGNLDSFIASLGAAS >NZ_CP043489|336741:369695|342250_343036_-|WP_149251099.1|DBSCAN-SWA MDNPQLVADILTSKEFVSGVGGAVIGAVVGGLISYYLQKSNAKASKIEREAERFLTKQGVANSILFKVMQIHSDIAVAVRYMGSTLKRKKENKDDDMRRLWAYLLPRGNLSKSIEFTPEEMGMLIGLGDAPSTNELMNLGIQHADMLHLWKLYEKERGILAEKLPIDEAKGDVMSSMMSDPKFISAQPQIANVSTIALYIFNSTFMNYRDSEKALRGVYKTFKDKLKIPFEIAFINPPDLVESLALYEAELKATGPAKGEE >NZ_CP043489|336741:369695|360967_361366_+|WP_149251125.1|DBSCAN-SWA MTRRIALIALMLALAACTKEVKPEEAISSVGPMPLDYKAQIIANAKSNYFDPYSIRSAEISKPVPAKNELYGKYAWVVCVKANAKNRFGAYVGQQLDGYVFQNGKITQKSGHPETYCDGKPFEAFPELESIK >NZ_CP043489|336741:369695|347218_347818_-|WP_149251110.1|DBSCAN-SWA MPVQKIRPTKRASAAIAAVLAVATSIGGIWYVKQPGGDAVPAAVALAADTLIKPWEGRELRAYFDRIARPPRWTICDGDTQNVRAGMVETPAGCDRRLKRRLGDEFYPGLKACIDGFEKRPLAWQAMMISLSWNAGVGTACSSTAARLGRAGQYQASCLAATAYNKAGGRMVIGLVNRREMGDASRIGEAELCVSGLQP >NZ_CP043489|336741:369695|346868_347222_-|WP_149251109.1|DBSCAN-SWA MSLALAFLSTGLGRAVAAIGIVIAAYFYGYSSGRHEQARLDNVAALKAELAATKTDLAIASALAGIAEDETRAAERAVAENRDRIDDYQKALAARPNARCALGADDVRRLRALNPAP >NZ_CP043489|336741:369695|353122_353494_-|WP_149251118.1|DBSCAN-SWA MLRVLMSTNPARFRVSKPGFDAGTGATDDMLIDTDNVTAKVIMAGRLQLSGTIVVPFGVTLEAVPAVDMQSSDGAGYSLVPFRKWGYAQALVTCKPSTTNLTFKNTGTTVWLTYVVVAVNMPS |
42 | Sinorhizobium_phage(21.43%) | terminase,portal,tRNA,capsid,tail,head,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
376975 : 386857
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP043489|376975:386857|DBSCAN-SWA ATCACGCCGCCTCCTGCCCGGCCGATCCCCGGGCCTCGGCGTCGAGCACCGAGAACAGGTCCGGCATGGCCATTTCGCGCGCGGCCGCTTCCACATATTTGCAGCCGTCGAGGAAATAGAGCGGGTTCAGCTCGCTGGCCGAGCCCTGGCGGCCGAGCTTGATCGCGCGATAGGGCACCGTCATCAGCCCGCCGAACGGGTCGAACACCATTTCGCCGGGCTCGGAGAATTGGGCGATGGCGCGATCGACGATGTCGAACTGCAGCGGGCACAGATGCCCTTCCTTGCCCTTGCCGGCCTGCAGCGTGTTCATGGTCAGCATGCGGGCGACATCGGTCCAGACGTCGTCATGCCAGGAATGCGGCGGCATCAGCATGAAGCCGGAAGGCAGCTGCCCCTTGATCTCCAGCGTCTCGGCGATGCGCACATGGTGCTCGAAATCGTAGATCGCCGAGAGGTTCTCGCCGCGCCAGGTCTTGTAGACCATGTCGGCGGCCAGCGGCACGAACTCCTCCGGCGCCATCTGGCGATTGCCGCTCGATCGGTTGAAGGCATGGGCATCGAGCTGCCAGCGGGCGCGGGAGTAGCCTTCCGGGTTCACCCAATGGCGGGCGAGGCCCTCGCCCTGCCATTCCTTTTTCGGCTTGATCACCGGCTTGTCGGCATAGCCGTTGGAAAGGTCCGACGGCGGCTTGCGGAACAGCAGCAGATATTCCGGCACGCCCGGCCCCATGCGCGAGCCATCCTTGCATTGCTCGCTCCAGCCCAGGCGATAGGTCTGGTTGTTCTCCCTGACCACGTCGGTGGTGATGATCTTGCGGCCGAGGAAGGCGAAGCCGTGCCGGCGGAAATGGGCGACGCATTCGTCGGAGAACGGCTCCAGAGTCTGGAAGCCGAGGCCGTTCACCCCGCCCGGCGTGATGCGATCCTTGACATGGATCGCCGCCACCCGCCCAGGCTGCAGGATGCGCAGGAGCTGCGGCGTGAGATAGTCCATCTGGCGCCAGAAATGGCCGTTGTCGTCGGTATGGCCGAAATCGGCATAGTTCGGCGTGTACTCGTACTGGTTGGAGAACGGGATCGAGGTCACGATCAGCGCGACGCTGTCACCAGGCATCCGGCTCGTCTCGTCGACGCAGTCATTGTTGGCGAGGCGGTAGCCCGGCCCCGCAACCTCGATCCTTTCCTTCACACCGAGCGCGCGCGTCATCTCGGCCGCGAAGGCGTCCCGGTTGAGGCCGTACTCGCGAATGATGGCTGTCATCTGTGCGACAAGCTCCTTGTGCTGGGCCCATTTGCGTTCCAGCGAGCGGCGCACCTCGCGCTCGGCCTCGGTAAAGATCAGGTCGACGCGCACGCGGCGGGTCTGGCCAAATCTCTGAACGCGATGAATGGCTTGGATGAAATCGTTGAATTTAAAGCCGATGCCGAGGAACACGGCCCACGAGCAATGCCGCTGGAAATTGCAGCCCGATCCCATGATCGAGGCCTTGCCGGCCAGCTCGGCGATCTTGCCGTCGGAAAAGGCGATGATCGCCCGCTCGCGCTCGTCGAGGTCCTGGCTGCCATAGACCGAGACGGCCGAGGGAATGGCCGCCTCGATCGCCCGCCGCTCGTCTTCCAGATCGTGCCAGAGCAGGCGGTGCGCTTGCGGATCCTCGGCGCGGATCTCCATCAGTTTGGCGATGCGCGCCGGCAGGCTGTCCCGCTTCTCGCGGGAAGCATCCACGACGCCGATCGCCGCATTGCGGAACATCCGGCCCTGCCCGTCCTTCTCGACGCCGGCAGAGCCATGGTCGGAAGGGATCTCATGCCAGCGCAGATCCAGCTCCGGCAGGTCGTAGCCTTCGTCGGAAAAGCCGAGATTGCTCGGCTTCTGGACGAACATTCCCCAGGTGGCGCACCACAGCCAGAATTCCCGCTCCTTGTGCGGGTGGATGGTGAGCGTGTCGGCCTTCTCCGAATTGCGCTTGAAGAAGCGCGTCTTCACCTGGCCGACATCCATCACCTCGAGGAACGCGGCATAGGCCAGCAGCTCGATGAATTCGTTCGGGCTCGGCGTCGCCGTCGCCACGAATTTGTAGGCGACGCGCCCCTCGAACAGCCGCATGAACTCGCGGAAGGTCTTGCTGCCGCCGAAGCCGCGCAGCACCGAGGCCTCGTCGAGCGAGACGGCGCCGAACAGCGCGGGATCGAGCTTGCCGTCGCGGATGGTCTCGTAATTGGTGAGGTGGATGGTGCCGGGCTCGATCTCTCTGGCACGGCGGATGAACTTCACCTTCACCGCATACTCGCCCTGGAAGTAGAGCGCGGCATCGCGGATGAACTCCTGGCGCACACCGAGCGGCAGCACGATCAGGGTCGGCTTGTCGATCCGCTTGCCGATCAGGCGCATCACCTCGATCTGCGTCACGGTCTTGTGCAGGCCGAAGGCCATGAACATGGCCCGCCGCCCGCCCGCCACGGCCCATTGCACGATGGCGCGCGTATGCGGCTTCAGGGCCGGGTTGATGTCTTCGAGCGGGATATCGAAGCCGAGCGGCGGCGCGACGATCGCCTTCGCCTCGAGGAAGGAGCGATAGGTGGCGGGCGCGTTCATCGGCGCCCCCGCCAGGCTTCCCAGCAGCCGTAGAGAATGGCGGCGAAGGGCGCGGCGATGATCAGCGCCAGTATCACGGCGGCGAATACGAGGATGCCGCAAGCGACAAGCCAGAGCAGGAACAGCAGCGTTGCGATCATGAGCGCCTCCGCTTGCGCAGGGCGCGATTGACAGCGCCCTCTGCGGCCCAGACGACGGCGCAGGCCAGGGTGAAGGCGATCGGCAGCGCGAAGGGCAGCACGATGAGGAAGATCTGATCGCCTTCCCTCATGCCGCACCGCCTGCCACCACGGAGAGGCGGCGCTTGGCGACGGTCAGTTCGCGTTCGAGCGAGGCGATATTGGCAAGGCCGCCCTTCGCTTCCCCGGGCGTCACCTTACCGTCGGCGAGCGCATCGCTCATGCCGGCCGTGACCTTGCTGGCCGCACCGGCCACGCTGACGAGATCGCCGACGAAGTCGCGCGAGCCCGGCGTGTCGCCGCCCTCCTCGGCGATCGGCGTCAGCCGGTGGCCGGTCAGGCCGGCCATCATCCGGGTGAAGGCCGGCTGCTGGCAGTTGAACTCCAGCACGCCGACGGCCCAGGCCGGGATCACGTCGGGATAGGCGTCGCCCTGCCAGCGGCTGATCTGGCCATCGGAGGCGCCGGTAAGGTTTTTCACCGTGGCCGCCCCGCCGCAGGCCAGCACGAGGGCCCGGGTGGCGGCCTTGATCATGGCCATCAGGCCATCAGGCAAAGCGATAGTGGTCACGAAAAACACTCCTGTGTTTTTTCGGTGCGAAAACAGGTGGCTGATGAGAAAAGCTGAGGCGTCAGATCAACGGAGGCGCCCGGACATGGCGAAGGCGAAACAGACGCGGATCGAACGGAATTTCTTCCGCATGTTCGATGCCGCCATGGCGAGGATCGAGGAGAGCCGGCATCAGGCGGCCCTCTCGGAAACGGAAAGAGCCGCCGAGGCGGAGGCGGACGCCTCGGCGGCAGGTGCCTCCTGCGCGTCAACGACAAGGAGGGCAGAGGAACCTGATTGCTCCCGCAACATATCGAGCTGAATGGCCAGGCAGTCGCGGGCACTCACCTTGCCGCCGGTACCATCCTCGATGTCGATTGCGAGATCTGTGCTCGCGGAACGTTCACCGCGAAGGACGCGGGAAATAGTGCTCGGCGAGCGACCAATCCGAAGCGCCAGCCTCGACGGCGTGTCTCCGGTCTCAAGAAAATATCTTTGGAGCACATTCATAGGCCATCTTGTGCCAAATAGTCACAAACATTGCAAGCCAAAATGTGACTATTTGGCTATGGCGTGAATTTGCCTATTTGGCAAAGATCCCCAAATGAACATCCCAAACCGCATACGCGAGCACCGCTTGGCGCGGCGCATGACACAAGAGAAGCTGGCCGAAGAGACCGGCCTTTCAGTCTCCTATGTGACGCGCCTAGAGAGCGGCGGACGCAACCTCGCGCAGAAGCACATGATTGCCTTTTCGAAGGCGCTTGGCGTGAAACCGGCAGACCTCTTGCCGGAGGATGCCGAGCCGGCTCACAATGTCGTCAGAGTGATGGGGCGGATTGGCGCGGGGGCGGAAATCCTGTCTGAGTTTGAACAGATTCCGGAAGGCGACGGCCTTTACGAGATCGAAGTACATTTTCCTGTGCCAGAAGACGCAATCGCGCTCGAAGTGGTCGGCGAGAGCATGTGGCCGCGATACGACCCCGGCGACGTCATCATCGTTTGGCGCTTCGGCAATCACCCAGACGAAATCATTGGGTGGGAAGCCGCCGTTGAGACAGAGGACGGCCGGCGCTTCCTTAAGCGCGTCCTCAAGGGGTCGCAGCCAGGAACCTATGATCTGGAAAGTCACAACGCGGCAACCATTCGAAACACCAAGCTTGTTTGGGTTGCCGAGGTGCTCTCCGTCGTGAGATCCGGCCAATGGCGAAAGCTCGACGCGAACGCGCGCAGGAGGATATTGCGCAAGGCCACTAAGTCATAGCGATAACATGACAATTTGACACAGCGAGCGGCGACACCGCCGCCGAGGTAAAAACGTATTTGCCAGATAGTCACATTTTCTCTTGACGGATGTGACTATCTGGCACTATCCCTATTCTCCATCGCAACCGGCGATACCGCCCTTCCATGCGATGGAGAACCCCAATGCCTGCAGTTGCTACCGGCGCCGCCGTGCGCCCTGCCTCCCGCACCCTCGACGCCAAGCGCGCCACCCATGACGGCTATATCTGGCCACGCGCCCGCCATCACATGACCGACGGCGTCGCCATCCGCGACCGGATGGCCGAGGATGCCCGCGCCTTCGAGGAGCTGCGCGGCGATTTCGAAGGCATCAGCGAGCGCTACTTCTGGTCGAAGGGCTGGACCCCCGAGCAGGTCGAGCAGCACGGCGTCGCCGTGATCGAGGAAGTCACCTCCACCCCCATCAGCCGTACCAGCCCGCTCTTTCCCGGCGAGCGCTCCCGCGCGGCCTGAGCCCAGGCCGATGCACCAGACGGCCTATGAGATCGCGCTCTATGTCGTGCTCAAAGGCAGCGACGACATGGAGCGCCTGCGCAGCACCCTGCGCCGCGCCACTTTCACCAGCGACGAAATCGAGCGCCACCTCCCGGCCGCCCTCGCCGCTGCCCCTTCCCTCAAGGCGAGGATGAACCGGCCATGACCAAGCTCCTCCTTCCCTTGCCTGCCCCCCCTAAGCAGACGCCGGCGCCGATCTACGACGGCATGGGTCATTTCATCAGGCCCGACTTCCACGAACTCGAAGAGGCCTCTGTGCGCCTTTGGCGCGGCGATCTCTTCGAAGCCCTCATTCACATCGAACGCGCGATCCCCGCCTTGAGCGGGCTGCACGATGCCGTTCGCAAGCTCCAAAGGTAACCGACCATGCCCGGCATCGTTTCCGTCGCCGCCGACCAGTTGAAATCCATCATCGAGCGCATCGAGCGCCTCGAGGAGGACAAGAAGGCCATCGCCGACGACATCAAGGAAGTCTATGGCGAGGCCAAGGCCACGGGTTTCGACCCCGGCATCCTTCGCAAGATCGTCAGCCTGCGCAAGAAGCCCGCCGCGGAGCGCTCCGAGGAAGACGCTATTCTCGAACTCTACCTCCAAGCCCTCGGCATGGAGTGATCGATGACGCGCCTCGTCGAGTTTTTCAGGACCGAAGACGGCGAGCCCTGGGGCTTGTTCGTCTACGGCCATGTCGACCCGGCCTCGGTTGCCAACGAACTGCAGCAGACTTTCGAGCGCCATCGAGACCTCGGCGAGGTCGACGAGGAATGGGACGGCTGGGCCGTCGATCCCGGCGAGATCCGCCAATACTGGACGTACCAGAGGGAAGACGCCCCCGAAGACCTCTCTTTCTATTGGTGCGAAGCCGGCAGGGCCGGCGCCATCTCCGTCACAGGAATTCGCTTCTAATGCCAGCCACCATCACCATGGAGCGGGGAAGCCTTTGGACTGCCCTGCAGGCGGTCTCACGCATCATCGACCGCAAGAACACCATCCCGATCCTCGGCAACGTGCTGATCGTCGCCGACACCGGCTCCGTCACGATCACCTCGACCAATCTCGACATGCGGATCGAGGCGACCGTGCCGGCCGATGGCGACGCCTTCGCGCTGACGGTGCCGGCCGGCGCGATCAGCGACATCGTGCGCAAGTTCGACGACGGCGCGCAGGTCAGCATGACGGCCGAGGATACCCGCGTCATCATCCGCTCCGGCCGCAGCCGCTTCACCCTGCTCACCTTGCCGGCGCTCGACTACCCCTCCATGCAGCCGCAGGAATGGCGCCGCGAGTTCTCCATGCCGGCAGCCGCGCTGCAGCAGCTGCTGACGACAGTCGATTTCGCGATCTCGACCGAGGAAACGCGCTACTACCTCAACGGCATCTATCTCCACGCCGTGGACGATGCGGGCACCGCCGTGCTGCGCGCCGTTGCGACCGACGGGCATCGCCTGGCGCGCGATCAGGTCGAGGCCCCAGCCGATGCCGTCGGTATGCCGGGCATCATCGTTCCCACCGGCTTTGTCGCAGAGGCCTCCCGATTCCTGTCGTCGCTCAAGACGGCTGGCGAGGTGAGGCTCGAAATCGCCGAGACGATGATCCGGCTTTCGGCCGCGAACACCGTGTTGCAGTCCAAGTTGGTCGACGGCTCGTTTCCCGATTATGCCCGCGTCATCCCGCGCGACAACGGCCTGACGCTCGCTGCGGCGCGCGCCGAATTGGCGGCCGCCGTCGATCGGGTCTCCGCCATCGGCTCCGCGGCGAAGAGCCGCGAGATCGAACTGGCGCTGGAGGCTGACAAGGTGAGCCTCTCGTGCCGCTCGATCGACGTCGGCGACGGCGGCGACGAGCTGGATGCGACCTTCGACGGCCCCGAGCCGATGAAGATCTACGTCAATTCCCGCTACCTGCTCGACATCCTCGCCCATTGCCCCGGCGACCAGGTGCTGATGCGCGTCAGCGATCCGATGTCCCCCTTCCTGTTCGAGCCCTCGGCCGGCGCCGCCGCGCAGTTCGTGCTCATGCCGATGAGGAAGACCAATGGCTAGCACCTATATCGTCCAGGCGGTCGGCTCGACCGTCATGCTCACTGTCACGGCGCCCGGGCCGGCCTGGCCGCCACGGTATGAGCAGTTTTACCTCTCCGTCGCCCAGGCACGCGACATCGCGGCCGACATGATCGAGGCCGCCGATAAGGCAGACGCCAACCCCGATCGCGCCGCCACCCTGCGCAACGAGATTACCCGCCTCGTCGACGAACTGAACGGCCTCGAAGGCGCCGGCAGCAAGACATGCGTCGTGGGCATCGATTTCGGCAAGGAGCCCTGACATGTCTGACATCACCAACACCGCGCGGGCCCTTGTCGAGATCCTCGAAGCCGCCGATCGCGAGAACGGCGAGCCGATGGCTAGCACCCTGCCGCTCGCACCCGGCTTGGAACCGGACTGGAAACTTCCCGTTCCCATCCGCCTCCTCCGCGAGCTGCAGGTCGCCCTGCGCAATGGCGGCCACGGGCCGCGCTGGCGCCACCTCAAGCGCCAGTCGACCTATGTCGAGATCGGTAACGCCATCGTTCAGGCGGCACGCCCGCTGATCGAGGGCGATCGCGTCAAGATCTACCGCGGCGAGGCCGACGGCCGGATTTGGGTTCGCCACTTCCTCGAATTCACTGACGGCCGTTTCGAGCGGCTGCCGGCCGATGCTTCGCAGCCGGAGGCGGCGCCATGACCTCATTGCATCACATACATGCCGGCCTCGTCGCAGGCCTCGATAAACGCCTTTCGAGCCAGGTCAGCCGGCACCGCCCCAGCCAACGCATCCAGGCAGGCTTTTCGGGCGAGCAGATGCTTGTCGCCGCCCTCGATCGGCCAATGGTTCAGCAGGACGTTGGCGGCCTCCTCAGGCGAGGCGGCGACCCGAAAGGTTCCAACGCCGGTCTCGAATTTGACAGGGTGAGTCCAGGTTATGTCGTTCATGGCGCACCTCCGCCACATAAACCGCGCCAGGGCTTCGCAGTTCCAGCGATCGTCGGGAGGGTGACATGACCTCCCGCCGCGACCGCATCCGCGAGAAGATCCTCGCCCGCACGGTCGAGGCACCGCCACCGCCAGGGCTCGGCCTCACCACGTCCTGCAGACTCTGGACGGGGCCGACTTCCGGCGACAGCGGTCGCGGCGCCGGCTATGGCCGCATGAGCCTAGACGGCGGAACCGTCGCGCCGCATATCGCCTACTTCGTCGTGGAACACGGGCCGATCCCGCCGCGCAAGCACCTCGACCATCTCTGCCGGCGACGCCTCTGCGTCGTCCATACCGAGATGGTGACACACAAGCTCAACCAGAAACGCCGCGACGCCGCTAGGCGCGCGATCACCTGCGAGACTGTGGAGGCCGCCTGATGTCGCTCCCGTTCGCCCCCGTATTCGCTGGCCTCCTCACGATCGACCAGGCTGCCGCCCATCTCTGCATCGGCAAAAGACTCTTGCGCGAGCACGTGCGACGGGGCGAGATTTCTTACGTCCTGACCGGCAAGGGAGAAAAGCGCAAGAAGATCGCGTTCGCCCTCTCCGATCTGGAGGCCTTCATCACCCGCCATCGCCGCGTGGAGATCGTTTCGTGTCCGTCTACAAGCAGCAAGGCCGCCCGTACTACATGTACGACTTCCAATGTGGTGGCCGTCGATTTCATGGCAATACCCAAACCGCCAACAAGGCCGAAGCCCGGGCCATCGAGCAGCGCGAGCGCGAGGCAGCCAAACAGCAGATCGCGGCCGAGCACGCCGCGGCATCCGCGTTCCGGGGCGAAGCCCCGCTCACGCTGAGCCTTGCGATTGCCCGCTATTGGGAGGAAGCCGGCCGGCACCATGCCGGTGCCGCAACCACGTGGGTAGACCTGCAACGAGCCCTGTCGCATTTCGGCGGCGCCAAGCGCCTGGATGAGATCACCGATTCTGATGTCGCCGCCTATGTCGCCAAACGGCGCGGCGAGCGTCGCAAGGGCAAGGAGAAGGCCGCCCTGGTCTCGCCTGCAACCGTCAATCGCACAACGATCGACCTGCTTCGCAAGCTCATGACTAGGGCTCGCAAGGCCTGGAAGATCCCCCTGCCCGACGAGCCCGACTGGAAGGTGCATCGGATCAAGGAGCGCGGCGAGCTGGTGCGCGAACTTCGGCCCTCCGATGAAGATCGCCTCGTGGCCAGCCTGGCGGACGGCTATCGCGACATCTGGCGATTCGCCCTCGCCTCTGGACTTCGCCTGGGCGAGTGCTTCCTTACCTGGGAGCAGGTCGACTTCGAAGCCGGTGTAATCAACGTCATCCAGAAGGGCGGCCGGCCGCACACGATCCCGATATCGCGCTCGATTGCGGCCATCCTGTCGACCTGCAGGGGCCACCATGATGTTTATGTCTTCACCTACACGGCCCGCCGCACCATCAAGTGGCGCAAGCCCGCCTCTGCGAATCGCGTCAAAGGCCAGCGATACCCGGTGACCTATGAAGGGCTGAAGTCCGAATGGCAGCGCACCCGTGACGAGCTGGGCCTCGACCTGCGTTTCCATGACATGCGGCACACCAGGGCGACCCGCCTGCTACGCTCTAGCGGAAACCTCAAGGCCGCCCAGAAACTCCTCGGCCACGCCGACATCAGCACCACCGCTAAGTTCTATGCGCATGTGGACATGAACGACCTTCGGAGCCTGCTGGACGCCGAGGGTAAGTCCGCACCAACAAAAAATCGCTTGCGCGGCAAAGCCAAAACCGGATAG
Protein sequences of DBSCAN-SWA_3 >NZ_CP043489|376975:386857|380494_380812_-|WP_149251147.1|DBSCAN-SWA MNVLQRYFLETGDTPSRLALRIGRSPSTISRVLRGERSASTDLAIDIEDGTGGKVSARDCLAIQLDMLREQSGSSALLVVDAQEAPAAEASASASAALSVSERAA >NZ_CP043489|376975:386857|383022_384141_+|WP_149255248.1|DBSCAN-SWA MERGSLWTALQAVSRIIDRKNTIPILGNVLIVADTGSVTITSTNLDMRIEATVPADGDAFALTVPAGAISDIVRKFDDGAQVSMTAEDTRVIIRSGRSRFTLLTLPALDYPSMQPQEWRREFSMPAAALQQLLTTVDFAISTEETRYYLNGIYLHAVDDAGTAVLRAVATDGHRLARDQVEAPADAVGMPGIIVPTGFVAEASRFLSSLKTAGEVRLEIAETMIRLSAANTVLQSKLVDGSFPDYARVIPRDNGLTLAAARAELAAAVDRVSAIGSAAKSREIELALEADKVSLSCRSIDVGDGGDELDATFDGPEPMKIYVNSRYLLDILAHCPGDQVLMRVSDPMSPFLFEPSAGAAAQFVLMPMRKTNG >NZ_CP043489|376975:386857|385134_385491_+|WP_149251156.1|DBSCAN-SWA MTSRRDRIREKILARTVEAPPPPGLGLTTSCRLWTGPTSGDSGRGAGYGRMSLDGGTVAPHIAYFVVEHGPIPPRKHLDHLCRRRLCVVHTEMVTHKLNQKRRDAARRAITCETVEAA >NZ_CP043489|376975:386857|385708_386857_+|WP_149251158.1|integrase|DBSCAN-SWA MSVYKQQGRPYYMYDFQCGGRRFHGNTQTANKAEARAIEQREREAAKQQIAAEHAAASAFRGEAPLTLSLAIARYWEEAGRHHAGAATTWVDLQRALSHFGGAKRLDEITDSDVAAYVAKRRGERRKGKEKAALVSPATVNRTTIDLLRKLMTRARKAWKIPLPDEPDWKVHRIKERGELVRELRPSDEDRLVASLADGYRDIWRFALASGLRLGECFLTWEQVDFEAGVINVIQKGGRPHTIPISRSIAAILSTCRGHHDVYVFTYTARRTIKWRKPASANRVKGQRYPVTYEGLKSEWQRTRDELGLDLRFHDMRHTRATRLLRSSGNLKAAQKLLGHADISTTAKFYAHVDMNDLRSLLDAEGKSAPTKNRLRGKAKTG >NZ_CP043489|376975:386857|384133_384421_+|WP_149251153.1|DBSCAN-SWA MASTYIVQAVGSTVMLTVTAPGPAWPPRYEQFYLSVAQARDIAADMIEAADKADANPDRAATLRNEITRLVDELNGLEGAGSKTCVVGIDFGKEP >NZ_CP043489|376975:386857|380906_381566_+|WP_149251148.1|DBSCAN-SWA MNIPNRIREHRLARRMTQEKLAEETGLSVSYVTRLESGGRNLAQKHMIAFSKALGVKPADLLPEDAEPAHNVVRVMGRIGAGAEILSEFEQIPEGDGLYEIEVHFPVPEDAIALEVVGESMWPRYDPGDVIIVWRFGNHPDEIIGWEAAVETEDGRRFLKRVLKGSQPGTYDLESHNAATIRNTKLVWVAEVLSVVRSGQWRKLDANARRRILRKATKS >NZ_CP043489|376975:386857|382468_382714_+|WP_149251151.1|DBSCAN-SWA MPGIVSVAADQLKSIIERIERLEEDKKAIADDIKEVYGEAKATGFDPGILRKIVSLRKKPAAERSEEDAILELYLQALGME >NZ_CP043489|376975:386857|381730_382060_+|WP_149251149.1|DBSCAN-SWA MPAVATGAAVRPASRTLDAKRATHDGYIWPRARHHMTDGVAIRDRMAEDARAFEELRGDFEGISERYFWSKGWTPEQVEQHGVAVIEEVTSTPISRTSPLFPGERSRAA >NZ_CP043489|376975:386857|385490_385913_+|WP_149251157.1|DBSCAN-SWA MSLPFAPVFAGLLTIDQAAAHLCIGKRLLREHVRRGEISYVLTGKGEKRKKIAFALSDLEAFITRHRRVEIVSCPSTSSKAARTTCTTSNVVAVDFMAIPKPPTRPKPGPSSSASARQPNSRSRPSTPRHPRSGAKPRSR >NZ_CP043489|376975:386857|384422_384821_+|WP_149251154.1|DBSCAN-SWA MSDITNTARALVEILEAADRENGEPMASTLPLAPGLEPDWKLPVPIRLLRELQVALRNGGHGPRWRHLKRQSTYVEIGNAIVQAARPLIEGDRVKIYRGEADGRIWVRHFLEFTDGRFERLPADASQPEAAP >NZ_CP043489|376975:386857|384823_385087_-|WP_149251155.1|DBSCAN-SWA MWRRCAMNDITWTHPVKFETGVGTFRVAASPEEAANVLLNHWPIEGGDKHLLARKACLDALAGAVPADLARKAFIEACDEAGMYVMQ >NZ_CP043489|376975:386857|379840_380323_-|WP_149251146.1|DBSCAN-SWA MTTIALPDGLMAMIKAATRALVLACGGAATVKNLTGASDGQISRWQGDAYPDVIPAWAVGVLEFNCQQPAFTRMMAGLTGHRLTPIAEEGGDTPGSRDFVGDLVSVAGAASKVTAGMSDALADGKVTPGEAKGGLANIASLERELTVAKRRLSVVAGGAA >NZ_CP043489|376975:386857|376975_379573_-|WP_149251145.1|DBSCAN-SWA MNAPATYRSFLEAKAIVAPPLGFDIPLEDINPALKPHTRAIVQWAVAGGRRAMFMAFGLHKTVTQIEVMRLIGKRIDKPTLIVLPLGVRQEFIRDAALYFQGEYAVKVKFIRRAREIEPGTIHLTNYETIRDGKLDPALFGAVSLDEASVLRGFGGSKTFREFMRLFEGRVAYKFVATATPSPNEFIELLAYAAFLEVMDVGQVKTRFFKRNSEKADTLTIHPHKEREFWLWCATWGMFVQKPSNLGFSDEGYDLPELDLRWHEIPSDHGSAGVEKDGQGRMFRNAAIGVVDASREKRDSLPARIAKLMEIRAEDPQAHRLLWHDLEDERRAIEAAIPSAVSVYGSQDLDERERAIIAFSDGKIAELAGKASIMGSGCNFQRHCSWAVFLGIGFKFNDFIQAIHRVQRFGQTRRVRVDLIFTEAEREVRRSLERKWAQHKELVAQMTAIIREYGLNRDAFAAEMTRALGVKERIEVAGPGYRLANNDCVDETSRMPGDSVALIVTSIPFSNQYEYTPNYADFGHTDDNGHFWRQMDYLTPQLLRILQPGRVAAIHVKDRITPGGVNGLGFQTLEPFSDECVAHFRRHGFAFLGRKIITTDVVRENNQTYRLGWSEQCKDGSRMGPGVPEYLLLFRKPPSDLSNGYADKPVIKPKKEWQGEGLARHWVNPEGYSRARWQLDAHAFNRSSGNRQMAPEEFVPLAADMVYKTWRGENLSAIYDFEHHVRIAETLEIKGQLPSGFMLMPPHSWHDDVWTDVARMLTMNTLQAGKGKEGHLCPLQFDIVDRAIAQFSEPGEMVFDPFGGLMTVPYRAIKLGRQGSASELNPLYFLDGCKYVEAAAREMAMPDLFSVLDAEARGSAGQEAA >NZ_CP043489|376975:386857|382717_383005_+|WP_149251152.1|DBSCAN-SWA MTRLVEFFRTEDGEPWGLFVYGHVDPASVANELQQTFERHRDLGEVDEEWDGWAVDPGEIRQYWTYQREDAPEDLSFYWCEAGRAGAISVTGIRF >NZ_CP043489|376975:386857|382243_382462_+|WP_149251150.1|DBSCAN-SWA MTKLLLPLPAPPKQTPAPIYDGMGHFIRPDFHELEEASVRLWRGDLFEALIHIERAIPALSGLHDAVRKLQR |
15 | Sinorhizobium_phage(50.0%) | integrase | attL 372557:372572|attR 388354:388369 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|