Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP018811 | Paraburkholderia sp. SOS3 chromosome 1, complete sequence | 1 crisprs | cas3,csa3,DinG,DEDDh | 0 | 0 | 1 | 0 |
NZ_CP018813 | Paraburkholderia sp. SOS3 plasmid unnamed1, complete sequence | 0 crisprs | NA | 0 | 0 | 3 | 0 |
NZ_CP018812 | Paraburkholderia sp. SOS3 chromosome 2, complete sequence | 1 crisprs | csa3,WYL,DinG,cas3,RT,DEDDh | 0 | 0 | 2 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP018811_1 | 1983464-1983565 | Orphan |
NA
Consensus repeat of NZ_CP018811_1
|
1 spacers
spacers of NZ_CP018811_1
>1.1|1983488|54|NZ_CP018811|CRISPRCasFinder GCCTCCCGCAGCAGGACCGCCGCCTCCGCCCGCATGTGGCGGCGGTGGGTTATT |
CRISPR arrays and Neighbor proteins around NZ_CP018811_1
The CRISPR arrays of NZ_CP018811_1 >merge|NZ_CP018811|1|1983464-1983565|CRISPRCasFinder CTGCGGCGGCGGATGTCCGCCGCCGCCTCCCGCAGCAGGACCGCCGCCTCCGCCCGCATGTGGCGGCGGTGGGTTATTCGGCGGCGGACGTCCGCCGCCGCC >NZ_CP018811|1|1|1983464-1983565|CRISPRCasFinder CTGCGGCGGCGGATGTCCGCCGCC GCCTCCCGCAGCAGGACCGCCGCCTCCGCCCGCATGTGGCGGCGGTGGGTTATT CGGCGGCGGACGTCCGCCGCCGCC
>NZ_CP018811.1|WP_075156753.1|1981941_1983354_+|iron-sulfur-cluster-binding-protein MQVQSMHFKARAGQKLADQRLQQNLTKLSTRIVAARAQVMTEIDFQATRTSLKERRNRALDNLDVWLDTFEREATRRGVTVLYAETTQDAAKLVADIARKHDVKKVIKSKSMVSEEMRLNEVLGKMGVQSVETDLGEYILQINNNEPPSHIIMPVIHKDKEEIADLFAKTHNKPRLTEIPQMTREARETLRPHFMSADMGVTGANFVIAETGSIAVVTNEGNEGMCTVMPRVHVAVTGIEKVLPTLEDLATALRLLPRSATAQSMTNYVSLLTGPRRPGDQDGPEHMYVVLVDGGRTGLIGGDFQEMLRCIRCGACMNHCPVYQKVGGHAYGWVYPGPMGSVLTPSYVGIDKALDLPQAATLCGECNSVCPVGIPLSDLLRKLRERQVERHLRPWRERFGLALWGYFALHPAAYALLTKLAVRVLEKVGGEQRSIGRLPFGGGWTNTRDMPAPVGRTFRELYAAQRSHRG >NZ_CP018811.1|WP_075156752.1|1981122_1981845_+|(Fe-S)-binding-protein MRVGFFVTCLIDLMRPEIGFSAIKLIERAGFEVVVPPAQTCCGQPAYNSGERGIARDLAEKMLREFEQFDYVVVPSGSCGGMIRAHYGDLFGNDPELMGRFARLRAKVYELTDFLVTVAKVELGPGEFTGPVTYHDSCAGLRELGVKAQPRALLAQAGVQVAEMKGCEHCCGFGGTFAVKYGDISTAIVDEKCDNIHASGTNAVVLGDLGCMLNIEGRLRRTGDTTTRVLHIAQVLAGDA >NZ_CP018811.1|WP_075156751.1|1980136_1980943_-|IclR-family-transcriptional-regulator MSDTNPDPKTSIQVIERMMRLLDALAAHSDPVSLKELAQRTELHPSTAHRILNDMVTCRLVDRSDPGTYRLGMRLLELGNLVKARLSVRDAALMPMRELHRLTGQTVNLSVRQGDEIVYIERAYSERSGMQVVRAIGGRAPLHLTSVGKLFLAADEAARVRAYATRTGLSGHTQNSITDLAKLERELTHVRQQSCARDNEELELGVRCIAAGIYDDTGRLVAGLSLSAPADRLQDSWLNQVSKTALVISESLGYQPEAAQPDAAARSA >NZ_CP018811.1|WP_075156750.1|1978756_1979947_+|D-alanyl-D-alanine-endopeptidase MKTDMFSSLKVMHGAALTTALSVAASMVIAAASALPVDAFAASTTASAKSEKSAKATKKSKKPAQPAAKVSSRESAKGAKVAAAKRGGEGDAPRAVSKKKRVSYTMNGRHHSVVRRVAFEPRSPTAGQAFGLHESPEGVALRSTVAYVVDQNSSEPLFDKNSRAVVPIASITKLMTAMVVLDSKEPMTEQIEVTDEDRDYEKNTGSRLSVGSVLSREDMLHIALMASENRAAAALSRYYPGGRPAFMAAMNAKAKQLGMVDTHFENPTGLTSQNVSTARDLVKMVNAAYQYPLIRRFSTDRSYEVYTGKRTLAYNSTNALIRNPSWDIGLQKTGFINEAGECLVMQATIDNRPMIMVLLDSAGKYSRFADATRLRTWIDNGGEQRVTSADANGAGT >NZ_CP018811.1|WP_075156749.1|1977371_1977959_-|phasin-family-protein MTLLTPEQFAAAQKANLETLFGLTNKAFEGVEKLVELNLQVVKSTLAENQENAQRALSVKDAQELLALQASLTQPVAEKVLSYGRHLYEIASATQAEFARVAEAQYEEQNRKVQALVDNVAKNAPAGSETAVAVIKSAITAANTTYETVHKATKQAVEIAESNFNAAATAASKAASQAAAQVSRQASATKKAAAA >NZ_CP018811.1|WP_075156748.1|1975092_1976913_+|dihydrolipoyl-dehydrogenase MSLVEVKVPDIGDFKDVDVIEVNIKPGDVIEKEQALMTLESDKASIEVPSDTAGTVKEVRVKAGDKVSEGTVIATVETSGEAAPAKEEAKPAKESAKEPAKAAAPAAAAQPAAQPAAQSAPKQPAAAPQAGSFSGTADIECDMVVLGSGPGGYSAAFRAADLGMKTVLVERYATLGGVCLNVGCIPSKALLHTALVIDEAAALGAHGITFAKPQVDLDKLREFKSGVVKKLTGGLGGMAKARKVEVVTGVGAFVDPHHMEVQTEGGKKVVKFKNAIIAAGSQAVKLPFIPGDPRVIDSTGALELRQIPQRMLVIGGGIIGLEMATVYSTLGAQIDVVEMLDGLMMGADRDLVKVWEKFNSKRFANVMLKTKTTAAEAKQDGIYVTFEGEKAPAEPQRYDLVLVAVGRSPNGKKIGADKAGVAVTDRGFIEVDKQMRTNVPHIYAIGDIVGQPMLAHKAVHEGHVAAEAAHGEKAYFDALQIPSVAYTDPEVAWAGKTEDQCKAEGIKYGKAVFPWAASGRAIANGRDEGFTKLIFDEQTHRVIGGGIVGLNAGDLISEVCLAVEMGADATDIGKTIHPHPTLGESIGMAAELYEGVCTDLPPQRKK >NZ_CP018811.1|WP_075156747.1|1973205_1974897_+|dihydrolipoyllysine-residue-acetyltransferase MSQAIEVKVPDIGDYKDVPVIEVLVKAGDTVEKEQSLVTLESDKATMDVPSPVSGVVKEIKVKVGDNVSEGTLIVVVDGEGAGAAPSKAPAQAQANGAAAPAAQAAAAAAAPAAAPAAAPAAASGGGVQEVKVPDIGDYKDVPVIEIAVKVGDRVEKEQSLITLESDKATMDVPSPAAGVVKEVKVKVGDNVSEGTVIVLIEGAGGAAPAPAAAAPQKPVEAPSDAPQKPAAAPAAAQPSALAQAPVIPAGEGGAYRPSHASPSVRKFARELGVDVARVQGTGPKGRVTQDDVTAFVKGVMTGERAAPAAAAGAPAGGGELNLLPWPKIDFTKFGPIDPKPLSRIKKISGANLHRNWVMIPHVTNNDEADITELEALRVQLNKENEKAGVKFTMLAFVIKAVVAALKKFPTFNTSLDGDNLVYKQYFHIGFAADTPNGLVVPVIRDADKKGLVDIAKEMTELSKLARDGKLKPEQMQGGSFSISSLGGIGGTHFTPIINAPEVAILGLSRSFMKPVWDGKQFVPRLTLPLSLSYDHRVIDGAEAARFNAYLSSILGDFRRVIL >NZ_CP018811.1|WP_075156746.1|1970411_1973123_+|pyruvate-dehydrogenase-(acetyl-transferring),-homodimeric-type MSAVPDEVLKYVAASQDDNDPQETAEWLEALDGVISAVGPNRAHYLIEKQIEFARVHGEHLPFSANTPYINTIPVANQAKIPGDQDIEHRIRSYTRWNAMAMVLRAGKDTNVGGHIASFASAATLYDVGFNHFWHAPSADHGGDLVFVQGHSSPGVYARAFLLGRLSEKQVDNFRQEVGGEGISSYPHPWLMPDFWQFPTVSMGLGPIMAIYQARFMKYLQARGIAKTEGRKVWAFLGDGETDEPESLGAIGMAGRERLDNLVFVINCNLQRLDGPVRGNGKIIQELESEFRGAGWNVIKVVWGSRWDALFARDKSGALMRRMMEVVDGEYQTYKSESGAFVREHFFNTPELKALVADWSDDDIWNLNRGGHDPHKIFAAYKAATEAKNQPTVILAKTIKGYGMGEAGQAMNITHQQKKMQVEALKQFRDQFRLPIPDEEIAHIPYLTFEEGSKELEYMRARRQELGGYLPARRQKAESLPVPELSVFEPLLKGTGEGREISTTMAFVRILNILLKDKTLGKRVVPIVPDESRTFGMEGLFRQIGIWNQDGQKYVPEDSDQLMFYKESETGQILQEGINEAGGMSDWIAAATSYSTHNEIMIPFYIYYSMFGFQRIGDLAWAAGDMRSRGFLLGGTAGRTTLNGEGLQHEDGHSLLWASSIPNCISYDPTFGYELAVIIQDGLRRMVQEQEDVFYYLTVMNENYEHPAIPQGDAVASDIIKGMYAFRKADGDAASGKNAPRVQLMGAGTIFNEVIAAADLLKNDWGVAADLWSVPSFTELAREGHEVQRWNLLHPTEEKKLSHVEKLLKDAKGPVIASTDYVRALSEQIRAFVPQRYVVLGTDGFGRSDTREQLRHFFEVDRYWVTVAALNALADEGTIERKVVADALKKYNLDPSKPNPMTV >NZ_CP018811.1|WP_075156745.1|1967618_1970135_-|PAS-domain-S-box-protein MLTDRLFARSARPSGSPADTSPSRWHHGPWWSNSYLLTPLLSILVFLVVMSLILWSLNRREQQQQEDTLYRNVAWAQQQIRLSMTGAQEQIQALARDLVTGRADPHSFQVSTADIMQGHPEILYMNWYTSESQPRWPNTPLPVLGQRLAKPNEQQMDEAVKAAFNEARTTRRQVYSPLLYDDLGNGYITLQTPVYRDREFLGSIAAVFSVEGMLKRDIPPELSAKYKISIIDVNNRELATTSTRPRLPRDAYYDLPLDPPGQGISVRVYAYPQMTNFTNNTLVWLVAGLSCFVLWSLWSLWKHTRQRFEAQQALYAEAFFRRAMENSVLIGMRVLDMHGRITHVNPAFCRMTGWDEIDLVGKTAPFPYWPRDSYPEMQRQLDMTLRGKAPSSGFELRVRRKDGSLFHARLYVSPLIDSSGRQTGWMSSMTDITEPKRAREELAAAHERFTTVLESLDAAVSVLAADEAELLFANRYYRHLFGIRPDGHLELAGGGGFDSSQASSDSIDMVDTYAGLPAAALTESTADAQEVYVQGIQKWFEVRRQYIQWVDGHLAQMQIATDITTRKQAQELARQQDEKLQFTSRLMTMGEMASSLAHELNQPLAAINNYCSGAVALVKSGRTTPDNLLPVLEKTAQQAVRAGMIIKRIREFVKRSEPKRQATRVADIVADAVGLAELEARKRRIRIVTDIRSRLPVIYVDPVLIEQVLVNLLKNGAEAMHDARPDAVDPVIRVIARLEAGNVCISVVDQGPGVDEATAEHLFEPFYSTKSDGMGMGLNICRSIIESHRGRLWVVNNVEADGHITGATFHCSLPIGEPDGPSNGGREAPTPQTVTGEL >NZ_CP018811.1|WP_075156744.1|1966977_1967622_-|response-regulator-transcription-factor MNTPVTTQETVFVVDDDEAVRDSLRWLLEANGYRVQCFSSAEQFIEAWQPHQHPGQIACLILDVRMSGMSGLELQERLIADNTLLPIIFVTGHGDVPMAVSTMKKGAMDFIEKPFDEAELRKLVERMLDKARSESTSVQQQRAAAERLGKLTAREHQVLERIIAGRLNKQIADDLGISIKTVEAHRANIMEKLNVNTVADLLRLALSNKPQQAQ >NZ_CP018811.1|WP_075156755.1|1984132_1984369_+|hypothetical-protein MKKTISMYWPLAIVLPLAAAAFLHLCSSAVKHASTAPLAASEVTAELARTVSYGFVGDDEAASAQPMRANVTLQSQPL >NZ_CP018811.1|WP_075156756.1|1984391_1984877_+|low-molecular-weight-phosphotyrosine-protein-phosphatase MKTISVCFVCLGNICRSPTAEGVMRYELAEAKLSDRVTVDSAGTGNWHIGEAPDERAQLAARGRGYDLSQLRGRQIAAADFERFDLLIAMDDANVAALNRICPPAYRDKIRLLMEFAPQGDAREVADPYFGGANGFETVLDQCESACRGLVAALRVQLQVV >NZ_CP018811.1|WP_075156757.1|1985067_1985601_+|Fe-S-cluster-assembly-transcriptional-regulator-IscR MRLTTKGRFAVTAMIDLALRQEQGPVTLAGISQRQHISLSYLEQLFGKLRRHEIVESVRGPGGGYNLARRAEDVTVADIIIAVDEPLDATQCGGKGTCEGTKQRDGHCMTHELWSTLNQKMVEYLDSVSLKDLVDQQRAREATPATVLRDRRSEAAAVEPTRVLPKGPNSVFNMAGS >NZ_CP018811.1|WP_075156758.1|1985703_1986927_+|IscS-subfamily-cysteine-desulfurase MKNDIPHLPIYMDYSATTPVDPRVVDKMIPYLREQFGNPASRSHAYGWDAEHAVEEARENVAALVNADPREIIWTSGATESDNLAIKGAAHFCKGKGKHIITVKTEHKAVLDTCRELEREGYEVTYLDVKDDGLIDIEVFKAALRPDTILVSVMHVNNEIGVIQDIATIGEICREKGIIFHVDAAQATGKVEIDLQKLKVDLMSFSAHKTYGPKGIGALYVRRKPRVRIEAQMHGGGHERGMRSGTLATHQIVGMGEAFRIAREEMATENERIRMLRDRLLRGLSQIEEVYVNGDMESRVPHNLNISFNFVEGESLIMAVKDVAVSSGSACTSASLEPSYVLRALGRNDELAHSSIRFTVGRFTTEQDVDYVINLLNTKIAKLRDLSPLWEMHKDGVDLSTIQWAAH >NZ_CP018811.1|WP_075156759.1|1987035_1987464_+|Fe-S-cluster-assembly-scaffold-IscU MAYSDKVLDHYENPRNVGSFSKDDDAVGTGMVGAPACGDVMKLQIRVGADGVIEDAKFKTYGCGSAIASSSLVTEWVKGKTLDQALEIKNTQIAEELALPPVKIHCSILAEDAIKAAVADYKHRHGGEAGESAAPAGDKQAA >NZ_CP018811.1|WP_075156760.1|1987581_1987905_+|iron-sulfur-cluster-assembly-protein-IscA MAITLTEKAAQHVQKYLSRRGKGVGLRVGVRTTGCSGLAYKLEYVDELAPEDEVFECNGVKVVVDPKSLAYIDGTELDFAREGLNEGFKFNNPNAKDECGCGESFRV >NZ_CP018811.1|WP_075156761.1|1987972_1988500_+|Fe-S-protein-assembly-co-chaperone-HscB MASLNDSHFDLFHLPAQFALDTQALDDAYRTVQAQVHPDRFAAAGDAQKRIAMQWATRANEAYRTLRDPLKRATYLLSLRGIDVGAENNTAMEPAFLMQQMEWRERIEDAAAAKNVDELDALLAELRDEERVRFTKLAALLDSGSDQAASEAVRQLMFIERVASEIDSQIGKLED >NZ_CP018811.1|WP_075156762.1|1988660_1990532_+|Fe-S-protein-assembly-chaperone-HscA MALLQISEPGMAPAPHQRRLAVGIDLGTTNSLVAAVRSGVPDVLPDEEGHALLPSVVRYLQNGGRRIGRAAKAEAATDPRNTIVSVKRFMGRGKSEVEGAENAPYDFVDAPGMVQIRTIDGVKSPVEVSAEILATLRQRAEDTLGDELVGAVITVPAYFDEAQRQATKDAARLAGLNVLRLLNEPTAAAIAYGLDNGAEGLYAVFDLGGGTFDLSILKLTKGVFEVLAAGGDSALGGDDFDHALFRHVLAQAGVARDTLTPEDVRLLLDTVRSTKEALSGAAQAKVDVALASGVRIDQTIEESTFAAITESLVQRTLGPTRKALRDAKVTPADIKGVVLVGGATRMPVIRRAVESFFGQPPLINLDPDQVVALGAAIQADLLAGNRGADGDEWLLLDVIPLSLGVETMGGLTEKIIPRNSTIPVARAQDFTTFKDGQTAMAIHVVQGERELVSDCRSLARFELRGIPPMAAGAARIRVTYQVDADGLLSVFAREQHSGVEASVVVKPSYGLADDDIARMLEDSFSAAEVDMRARALREAQVEAQRLVEATDAALAADADLLDADERGALDTLLAALRNVAPGNDADAIEAATKALAAGTDEFAARRMDKGIRRALAGRKLDEI >NZ_CP018811.1|WP_075156763.1|1990666_1991008_+|ISC-system-2Fe-2S-type-ferredoxin MPQIVVLPHVELCPEGAVIDAVPGKSICDTLLENGIEIEHACEKSCACTTCHVIVREGFDALEPSEEAEDDLLDKAWGLERESRLSCQAIVPENDDLVVEIPRYSINHAKENH >NZ_CP018811.1|WP_075156764.1|1991023_1991221_+|Fe-S-cluster-assembly-protein-IscX MKWTDTQDIAMALTDKHPDIDPQQVRFTDLHRWVMELDGFDDDPNRSNEKILEAIQAAWIEDADY |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
3914849 : 3924190
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP018811|3914849:3924190|DBSCAN-SWA CGTGATCAAAAAACTCATTCGCAAGCTGTTCGGACAAGACCGGGCAGACGCCGACAATCTCGCCGACGAAGACCCGTCCGCCCCTCTGCACGAAGAAGACACTGCGGCGGCGAAGCGCGTCGAAAATCCGCCGCCGCGCGGCGCGCCCGCTGCCGCTTCTGCCGCCGCTTCTGCCGCTTCGACCTCGTCTTCGACCCGTTCGCGCCGCAAACCCGCACGCGCGTCCAGCGCCGGCGCCGCCGTCAAGGACGACAGCGCGCGCGACCCGAACCTGCCCGTCATCATTGCGTCGGACGTCCACGGCATCGACCCCGCGCTGATTTCCCGAAATGCCATCCGCGTCACCGAAGGCCTGCAGCAGGCGGGCTTTCGCGCGTTTATCGTCGGGGGCGCGGTGCGCGACCTGCTGCTCGGCATCGCACCGAAAGACTTCGACGTCGCGACCGACGCGACGCCCGAACAGGTGCAGCGGCTGTTTCGCCGCGCCCGCATCATCGGGCGGCGTTTCCAGATCGTGCATGTGCAGTTCGGCCAGGAAATCATCGAGACATCGACGTTCCGTGCATTGGTCGATCCGCCGGCCGCCGCCGCGGCGCCTGAGTCTGCCGATGCGGCACAGTCCGCTGCAGGCGCGCCCGCCACGCCGCCGCGACGCCTGAGGCGCGACGAGCTCGATCGCCGCACGCATGCGGTCGATGCGAGCGGACGCGTGCTGCGCGACAATGTGTGGGGTGAGCAGCACGAAGACGCGACGCGCCGCGATTTCACGATCAACGCAATGTATTACGACCCCGCGACGCAGACGGTGCTCGATTACCACAACGGCATGGCCGACGTGCGCGCGCGCCTGCTGCGGATGATCGGCGACCCGGCGACGCGCTATCGCGAAGACCCGGTGCGAATGCTGCGCGTCGTGCGCTTCGCGGCCAAGCTCGGCTTCGAGATCGACGATGCGACGCGCGCGCCGATCACCTCGCTCGCGGATCTGGTCAACAATGTACCTGCGGCGCGTCTCTTCGACGAAATGCTGAAGCTGCTGCTGTCGGGACATGCGCTCGCATGCCTGCAGCGGCTGCGCAAGGAAGGGTTGCATCATGGTCTGCTGCCGCTGCTCGATGTCGTGCTCGAACAGCCGCACGGCGAGCGCTTCATCACGCTCGCGCTGAACAAGACCGATGCGCGCGTGATGGCGGGCAAGCCGGTGTCGCCGGGCTTCCTGTTCGCCACGCTGTTGTGGCACGACGTGCAGCAGCGCTGGCAGCAGTACGAAGCGAACGGCGAATTTCCCGTGCCGGCGCTGCATCGCGCGATGGACGACGTGCTCGACATGCAGACCGAAAAGCTTGCCATTCACAAGCGCTTCTCGGCCGATATGCGCGAGATCTGGGGCCTGCAGATGCGTCTCGAGCGGCGCTCCGGACGCAGTGCGCTGAAGCTGATCGAACACCAAAGATTTAGAGCGGGGTATGATTTCCTCCTGCTGCGCAGCGAATCCGGTGAACTGGACGAATCGGTTGCTGCGTGGTGGACTGAATTCATCAATGGAGACGCTGCCGCGCGCGAAGCATTGCTCACGCAAGGCGGGAAGGACCGGACGCCCAGAAAACGACGGCGGCGCGGAGGTGGCGCCAGAAACCGCAAGGCGGGCGACGGTATGGAGGGCGGCACGCCCGCAGGCAACGGCGAAGTCAAGGATGTGAGCCCGAACGGCACACACGAAGACTGACGCGGTTCCGGCGGAAGCCGTCGAACGAAAGTCGTAGGAAGTGATGCCATGACGGTTGCCTATCTCGGCCTTGGCGCAAACCTCGGGGATGCGCGCCAGACCCTGAAAGACGCGGTGGTGTGCCTCGCACAGCAGCACGCGATCACCGTGCTCGCGAAGTCCAGCCTGTATCGCACTGCCCCCATCGACGCGGGCGGCGATGACTATTTCAACCTCGCCGTGAAGCTCGAGACGGCGTTGCCGGTACGGCATCTGCTCGCGCTTTGTCACAAGATCGAAACCCACTTCGGCCGCGAACGGCCGTATCGCAACGCGCCGCGCACGCTCGATCTCGACATCCTGCTGTATGGCGAGCATTGCATCGACGAACCGGATCTGACCGTGCCGCATCCGCGGCTCACCGAGCGCGCGTTCGCGCTCGTGCCGCTCGTCGAAATCGAGCCGTCGCTGATCATTCCGCAACGCGGCCGCGCGCACGATTTCCTCGAAGCGGTCGCAGCGCAACGTATCGAGAAGGTGAAGGCCGCGTGTTTGTGCCCCTCGCTGGCCGCGCTGAGCGGTAACAAAGCGGCCGGCAAGGATGCGGGCGAACCGGGCAAGGGCGGCTGCTCATGACCGCATCGCCGCTCGCGCTCGCGCCGCTCACGGTGACCGCGCCGGCCATGCGGGCGCCCTATGCATACCTCGCGATCGAAGGGCCGATCGGTGTCGGCAAGACATCGCTCGCGACGCGCCTCGCCGAACGATGGTCGATGCAGACGCTGTTCGAACGGCCGCAGGACAACCCGTTTCTCGAACGCTTCTACCGCGATACGGCGCGCTATGCGCTGCCCGCGCAGTTGAGCTTTGCGCTGCAGCGCACGCAGCAGATCCAGGAAATTGCCGCGCTGCAGGCGGCCGGCACGCCGCTCATCACCGACTTCATCACGCAGAAGAACGGCATCTTTGCGCGCCTCACGCTGAAGGACGACGAATGGCAGCTGTATCGCGCGCTGGCCGCGCGGCTCGAAGCCGACGGGCCCGCACCCGATCTCGTCGTGTATTTGCAGGCGAGTCCCGAGGTGCTGTTTTCGCGCATCCAGAAACGCGCGGTGCCGATGGAACTGCAGATCTCGGACGCCTATCTGCGCGCACTGTGCGACGCGTACAACGAGTTCTTCTATCACTACGACCGCACGCCGGTGCTGACGGTCAACGCCGAACATCTGAACCCGATCGAATCGAATGCCGACCTTGCGCTGCTGGCCGAACGCATCGAGACGATGCGCGGCCGCAAGGAATTCTTCGTCAAGGGCACGTCGCTATAACGCGCCGGCCCCGGTTTGCCGCGTCCCCTCTTCAACGAACGGATTGCCCCATGAGCTATTTGCAGGAAACGAGCCGCAACGCGATCACGGTGCCGAAACTTCAGGCAATGCGCGAGGCCGGCGAAAAGATCGCGATGCTGACCTGCTATGACGCGAGCTTCGCATCGCTGCTCGATCATGCGGGTGTCGATTCGCTGCTGATCGGCGATTCGCTCGGCAACGTGCTGCAAGGCCAGACCACCACGCTACCCGTCGCGCTCGAAGACGTCGCTTACCACACGGCCTGCGTCGCGCGCGCGCGACCGTCCGCGCTGATCGTCGCGGACCTGCCGTTCGGCACTTACGGCACCGCCGCCGACGCGTTTGCGAACGCGGTCACGCTGATGCGCGCGGGCGCGCAGATGGTGAAGCTCGAAGGCGGCGAATGGGCGGCGGACATCGTGAGGTTTCTTGTCGATCGCGCGGTGCCCGTGTGTGCGCACATCGGGTTGACGCCGCAATCGGTGCACGCGTTCGGCGGCTTCAAGGTGCAAGGCAAAACCGAAGCAGCGGCAAGCCAGATGTTGCGCGATGCGCGCGCGCTGCAGGACGCCGGCGCGCAACTGATGGTGATCGAAGCGGTTCCCGCGCGGCTCGCCGCCGAAGTGACGAAGCAGCTGAAAATTCCGACGATCGGCATCGGCGCGGGCCTCGACTGTTCGGGTCAGGTGCTCGTGCTGCACGACATGCTCGGTATCTACCCCGGCAAGCGGCCGCGCTTCGTCAAGGATTTCATGCAAGGCAGTTCGAGCATTCTCGCTGCGATCGAAGCGTACGTGCGCGAAGTGAAGGATGGCTTGTTCCCGGCTGCGGAGCACACGTTCTGAACTAGCGCCCGCGGTTAGCGCGCGGGCTTTGTTGCGGGTCGATGTGCGGGTCGACGTGCATTCGCGGTCACGACCAGTCGTCTGCGCCGACGATCCGCGCCGGCAGTGCGCCCCGCAGCGCGTTGCAGATCAGAAGCGCCTCGGCGTTGATCACATCGTCGCGCGTCAGCACCCGCTCGGCGGCGGCGATGTCCGCGTCATCGAGCAACACCGCGCGCATCACGCCCGGCAGCACGCCGCAATCGAGCGGCGGCGTCCACCACTTCCCTTCGAGCTTCACGAATACGTTCGACCTTCCGCCTTCGGTCAATTCGCCGCGTTCGTTGAAGAAGAGCGTATCGAACGCGTCCCTCACTTCCGCCTCGCGCCAGCCGCGATCGTATTCGGCGCGGCGCGTGGTCTTGTGACGCAGCAGCGGATCGTCCGACTGCACATTCGCAAAACCATGCGCGGGGCCGAGCAGCACGCCAACGCTTTCGTCCGCGAGCGGCATCAGCGGCGCCACCGTCAGTTCGACCGCGCCGGCCTTCTCGAGCACGAGCCGCATCCGGTGCGGCTGGCCGGCCTGCAGCGCATCGCAGTGCGTGGCGATGCGCGCGCGGACAGCCGCTTCGTCGAACGCGAAGCCGAGCCACGCGGCGCTTGCGCCGATACGGGCGAGATGACGGTCGATATGGCGCACGCCCGCCTCGCGCGTTGCGTACATCGTTTCGAACAGCTGGAAGCCGGGATCGGCTTCAGTCAGGAAACGCGCTTTCAATCGACACTCCGCATATTCGTCGGCCGCGACGCTATCGAGCACGATACCCGCGCCGATGCCGAGCGTACCGTGGTAGGTGCTGACGCCGCGCGGCACCGGGTCGAGCGTCAGCGTGCGAATCGCGACCGACAGGCAAAAATCGCCGCAGCGCTGTGTGGCCGGCGCCATCGGCTGCGCGGGCGCATCGAGCCAGCCGATCGCGCCGGTGTACAGGCCGCGCGGCGTACTTTCGAGCACGTCGATCAGCTGCATGGTCTGGTGTTTCGGCGCACCGGTGATCGAGCCGCACGGAAAAAGTGCGCGCAGGATATCGGCGAACGACGTGCGAGGCCGCAGCAGCGCGGTCACCGTCGACGTCATCTGCCAGACCGATTCGTAGGGCTCGACCGAGAACAGCGCCGGCACCTTGACCGAGCCGGTCTCCGCGACGCGCGACAGATCGTTGCGCAGCAGGTCGACGATCATGACGTTCTCGGCGCGGTTCTTCGGATCGTGCGCGAGAAAATCGGCGGCCGCACGGTCCGCGTCCGGGTCGCGCGAGCGCGGCGCGGTGCCTTTCATCGGACGCGCACGTAACGCGCCGCTCTTCTTTTCGATGAAGAGCTCCGGCGAACATGAGAGCACCCAGCGGCGCTCCGGCAACGCGATGAGCGCGCCGTACTGCACCGTCTGCCGCGCGCGCAGTCGCCGGAACAGCGCGGCAGGCGGTCCGAACACGTCGAACGCGAGACGGTATGTGTAGTTGACCTGATAGGAATCGCCGGAGCGCAACGCGTCGTGAATTGCGCCGATCGCGCGCTCGAACTCGGGATAATCGACGCTCGCGCGCACGCCGCCCGTGCCCGCCACCGACGGCTCGACCGCGCCGCTGTCGCGCTGCACGAGCCACGCGTCGACTTCGTCGCGCGAGAGCTTCACGCACTGATCGAACAACAGGAAACGCAGCGCGCCGGTTGTCTTCGAATGCGGACCCGCGCGGGCCCCGACGCGCTGCGTCGTTTTGGTAAGGACGAGGTTGCGGCCGAATTCGTAGTCCGCGATCACGACCGCATGCAGACCGCGGCGGGCATCGTCGGCGGCCGCCTCGCAGACGGCGTCGAGTTGCGCGCTGTCGGTACATACGCGCTCATGGCGAAAGCCCGTGTACAGACGACTCGATCGGCGGACCGCAGTCGAGTTGCAATCATCGAGCAGCGCATATACGGCATGCGGGTCATCAACCGCCATGGGCACCGCCGACATCAAGTTGCACTAATTCGAAATTTAATCGAAAAAGCTCTTGACCCGATCAAACCAGCTCTTGCTTTGCGGACTATGTCGCGCTCCGCCTTCGACAAGCGACTTCTCGAACTGCTGCAACAGATCGCGCTGCGTGTCGGTAAGCTTGACCGGCGTCTCGACTTGTACGTGCACATACAGGTCGCCTGCGATGCTCGAGCGCAAGCCCTTGATACCCTTGCCGCGCAGACGGAAGGTTTTACCTGACTGCGTGCCCTCCGAAACCGTGAAGCTCGCACGGCCGGCCAGCGTCGGCACTTCGATCTCGCCGCCGAGCGCCGCCGTCGTGAACGGGATCGGCATCTGGCAATGCAGATCGTCGCCGTCGCGCTCGAAAACCGAGTGCGGCTTGATGTGGATCTCGACATACAGATCGCCCGACGGACCGCCATTGATGCCCGGCTCGCCATTGCCGGCCGAACGGATGCGCATGCCGTCGTCGATGCCGGCAGGGATCTTCACTTCGAGCGTCTTGGTTTCCTTCGTCTTGCCCGCGCCGTGGCAGTGCGTGCAAGGCTCGGGAATGTACGTGCCGGTGCCGTGGCACTTCGGACACGTCTGCTGGATGCTGAAGAAGCCTTGCGACATCCGCACGGTGCCCGAGCCGCTGCAGGTTGGGCACGTTTCCGGCTTCGTGCCGGGCTTCGCGCCCGATCCGTGGCAGACCTCGCACGATACCCAGCTCGGCACGCGAATTTGCGTGTCGTAGCCGTGCGCAGCCTGCTCGAGCGTGATTTCCATGCTGTAGCGCAGGTCCGCGCCGCGATACACCTGCGGACCGGCACGTCCGCCGCGGCCACCGCCGCCGGCCGCCTGGCCGAAGATGTCGCCGAAGATATCGCCGAACGCGTCGGCAAAACCGCCGAAGCCTTGTGCGCCCGCCGCACCCATGTTCGGATCGACGCCTGCGTGACCGTATTGGTCGTACGCACCGCGTTTCTGCGCGTCCGACAGCATTTCATAGGCCTCTTTGGCTTCCTTGAAATGCTCTTCCGCATCCTTGTTGCCCGGATTGCGGTCGGGGTGATACTTCATCGCGAGTTTGCGATAAGCCTTCTTGATTTCGTCGTCGCTTGCGTTCTTCGCGACGCCCAGAACCTCGTAGTAATCCCGTTTCGCCATATCGGTTCAACGCCCTCCGCGCACTGCAACGCGGCGGCTCCTCATGAATGCTGGAGTCTAGCGACTCGTCCGGCGCTGGCTTCGCGGCTGTCTACCGATGCGAAGCCATGCCTGCCATAAAACAAATGTGCCCGGAGAGCCAAAAAGGCTCGCCAGGCGCGTTTTCCGTGCGGCTTCCGTTCCAGGTTTTGACCCGAGGTTCTAATCCAGGGCCTGCGCCAAGGTTTAACCGGGCGTCTGACCCAAGGTTTCGATCCACAGGGTTGACCGCAGGTCTTCCAGCCGGGCCGCGCGTGGTCGAATCACGCGCGGTTCGGGTCGGACGGCACCCCCGGCTTAGTCCTTCTTCACTTCCTTGAAGTCCGCATCGACGACATCGTCGGCCTTCTCTGCGCCCGGCGCACTGTCGGAAGCCGCAGCTCCCGCTGCCGCGCCGGCCGCGCCCTGCGCCGCCTGCATGTCGGCATACATCTTTTCGCCGAGCTTCTGCGATGCGGTCGCAACGGTCTCGATCTTCGCGTCGATCTCCGCCTTGTCAGCCGAGCCGCTCTTCAGCGTGTCTTCGAGGTCCTTCAACGCCGCTTCGATCTTTTCCTTCTCACCGGCGTCGAGCTTGTCGCCGTACTCGGTGAGCGCCTTCTTCGTGCTGTGGACCAGCGCGTCGCCCTGGTTGCGGGCATCGGCCAGCTCACGCAGCTTGTGATCTTCCTCGGCGTTCGCTTCCGCGTCCTTCACCATCTTCTCGATCTCGGCTTCGGAGAGACCCGAGTTCGCCTTGATCGTGATGCGGTTTTCCTTGCCGGTCGCCTTGTCCTTCGCGCCGACGTGCAGAATGCCGTTCGCGTCGATATCGAAGCTCACCTCGATCTGCGGCACGCCGCGCGGTGCCGGCGGAATACCTTCCAGGTTGAACTCGCCGAGCAGCTTGTTGCCCGCCGCCATTTCGCGCTCGCCCTGGAACACCTTGATCGTCACCGCCGACTGGTTGTCGTCGGCCGTCGAATACACTTGCGCGTGCTTCGTCGGGATCGTGGTGTTCTTGTTGATCATCTTCGTCATCACGCCGCCGAGCGTTTCGATACCAAGCGACAGCGGGGTCACGTCGAGCAGCAGCACGTCCTTGCGGTCGCCCGACAGCACCTGGCCCTGAATCGCAGCGCCGACTGCCACTGCTTCGTCCGGGTTCACGTCGCGGCGCGGATCCTTGCCGAAGAACTCCTTCACCTTTTCCTGCACCTTCGGCATGCGCGTCATACCGCCGACGAGGATCACGTCGTCGATCTCGCCGACCTTCACGCCCGCATCCTTGATTGCCGTGCGGCACGGTTCGATCGTGCGATCGATCAGCTCTTCGACGAGCGCTTCGAGCTTTGCGCGCGTGAACTTGAGATTCAGGTGCTTCGGACCCGACGCGTCCGCCGTAATGTACGGCAGATTGATGTCGGTCTGCTGCGTCGACGACAGTTCGATCTTCGCTTTTTCAGCAGCTTCCTTCAGGCGCTGCAGCGCGAGCACGTCCTTCGACAGGTCGACGCCTTGCTCCTTCTTGAACTCGCCGATGATGTAATCGATGATGCGCTGGTCGAAGTCTTCACCGCCAAGGAACGTGTCGCCGTTCGTCGACAGCACTTCGAACTGCATTTCGCCGTCCACATCCGCAATTTCGATGATCGAAATATCGAACGTGCCGCCACCGAGGTCGAACACCGCGATCTTGCGGTCGCCCTTTTCGGCCTTGTCGAGACCAAACGCGAGCGCAGCAGCGGTCGGCTCGTTGATGATCCGCTTCACTTCGAGGCCGGCAATGCGGCCCGCGTCCTTGGTCGCCTGACGCTGGCTGTCGTTGAAGTAGGCCGGAACGGTGATCACCGCCTCGGTAACCGGTTCGCCGAGATAGTCTTCGGCGGTCTTCTTCATCTTGCGCAGCACTTCCGCCGAGATTTGCGGCGGCGCGAGCTTCTGGCCGTGCGCCTCGACCCATGCATCGCCGTTTTCAGCCTTGACGATCTTGTACGGCATCAGGCCGATGTCCTTCTGGACTTCCTTCTCTTCGAAGCGGCGACCGATCAGACGCTTGACCGCGAACAGCGTGTTCTTCGGGTTGGTAACCGACTGGCGTTTCGCAGGCGCGCCGACGAGCACTTCGTTGTCGTCCATGTACGCGATGATCGACGGCGTCGTACGAGCGCCTTCCGAGTTCTCGATCACCTTGACCTGATTGCCTTCCATCAGCGCCACGCACGAGTTCGTGGTGCCGAGGTCGATACCGATAATCTTGCCCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP018811|3914849:3924190|3920770_3921907_-|WP_075158078.1|DBSCAN-SWA MAKRDYYEVLGVAKNASDDEIKKAYRKLAMKYHPDRNPGNKDAEEHFKEAKEAYEMLSDAQKRGAYDQYGHAGVDPNMGAAGAQGFGGFADAFGDIFGDIFGQAAGGGGRGGRAGPQVYRGADLRYSMEITLEQAAHGYDTQIRVPSWVSCEVCHGSGAKPGTKPETCPTCSGSGTVRMSQGFFSIQQTCPKCHGTGTYIPEPCTHCHGAGKTKETKTLEVKIPAGIDDGMRIRSAGNGEPGINGGPSGDLYVEIHIKPHSVFERDGDDLHCQMPIPFTTAALGGEIEVPTLAGRASFTVSEGTQSGKTFRLRGKGIKGLRSSIAGDLYVHVQVETPVKLTDTQRDLLQQFEKSLVEGGARHSPQSKSWFDRVKSFFD >NZ_CP018811|3914849:3924190|3922243_3924190_-|WP_075158079.1|DBSCAN-SWA MGKIIGIDLGTTNSCVALMEGNQVKVIENSEGARTTPSIIAYMDDNEVLVGAPAKRQSVTNPKNTLFAVKRLIGRRFEEKEVQKDIGLMPYKIVKAENGDAWVEAHGQKLAPPQISAEVLRKMKKTAEDYLGEPVTEAVITVPAYFNDSQRQATKDAGRIAGLEVKRIINEPTAAALAFGLDKAEKGDRKIAVFDLGGGTFDISIIEIADVDGEMQFEVLSTNGDTFLGGEDFDQRIIDYIIGEFKKEQGVDLSKDVLALQRLKEAAEKAKIELSSTQQTDINLPYITADASGPKHLNLKFTRAKLEALVEELIDRTIEPCRTAIKDAGVKVGEIDDVILVGGMTRMPKVQEKVKEFFGKDPRRDVNPDEAVAVGAAIQGQVLSGDRKDVLLLDVTPLSLGIETLGGVMTKMINKNTTIPTKHAQVYSTADDNQSAVTIKVFQGEREMAAGNKLLGEFNLEGIPPAPRGVPQIEVSFDIDANGILHVGAKDKATGKENRITIKANSGLSEAEIEKMVKDAEANAEEDHKLRELADARNQGDALVHSTKKALTEYGDKLDAGEKEKIEAALKDLEDTLKSGSADKAEIDAKIETVATASQKLGEKMYADMQAAQGAAGAAAGAAASDSAPGAEKADDVVDADFKEVKKD >NZ_CP018811|3914849:3924190|3914849_3916574_+|WP_075158074.1|DBSCAN-SWA MIKKLIRKLFGQDRADADNLADEDPSAPLHEEDTAAAKRVENPPPRGAPAAASAAASAASTSSSTRSRRKPARASSAGAAVKDDSARDPNLPVIIASDVHGIDPALISRNAIRVTEGLQQAGFRAFIVGGAVRDLLLGIAPKDFDVATDATPEQVQRLFRRARIIGRRFQIVHVQFGQEIIETSTFRALVDPPAAAAAPESADAAQSAAGAPATPPRRLRRDELDRRTHAVDASGRVLRDNVWGEQHEDATRRDFTINAMYYDPATQTVLDYHNGMADVRARLLRMIGDPATRYREDPVRMLRVVRFAAKLGFEIDDATRAPITSLADLVNNVPAARLFDEMLKLLLSGHALACLQRLRKEGLHHGLLPLLDVVLEQPHGERFITLALNKTDARVMAGKPVSPGFLFATLLWHDVQQRWQQYEANGEFPVPALHRAMDDVLDMQTEKLAIHKRFSADMREIWGLQMRLERRSGRSALKLIEHQRFRAGYDFLLLRSESGELDESVAAWWTEFINGDAAAREALLTQGGKDRTPRKRRRRGGGARNRKAGDGMEGGTPAGNGEVKDVSPNGTHED >NZ_CP018811|3914849:3924190|3917185_3917881_+|WP_075158076.1|DBSCAN-SWA MTASPLALAPLTVTAPAMRAPYAYLAIEGPIGVGKTSLATRLAERWSMQTLFERPQDNPFLERFYRDTARYALPAQLSFALQRTQQIQEIAALQAAGTPLITDFITQKNGIFARLTLKDDEWQLYRALAARLEADGPAPDLVVYLQASPEVLFSRIQKRAVPMELQISDAYLRALCDAYNEFFYHYDRTPVLTVNAEHLNPIESNADLALLAERIETMRGRKEFFVKGTSL >NZ_CP018811|3914849:3924190|3918814_3920734_-|WP_075158974.1|DBSCAN-SWA MAVDDPHAVYALLDDCNSTAVRRSSRLYTGFRHERVCTDSAQLDAVCEAAADDARRGLHAVVIADYEFGRNLVLTKTTQRVGARAGPHSKTTGALRFLLFDQCVKLSRDEVDAWLVQRDSGAVEPSVAGTGGVRASVDYPEFERAIGAIHDALRSGDSYQVNYTYRLAFDVFGPPAALFRRLRARQTVQYGALIALPERRWVLSCSPELFIEKKSGALRARPMKGTAPRSRDPDADRAAADFLAHDPKNRAENVMIVDLLRNDLSRVAETGSVKVPALFSVEPYESVWQMTSTVTALLRPRTSFADILRALFPCGSITGAPKHQTMQLIDVLESTPRGLYTGAIGWLDAPAQPMAPATQRCGDFCLSVAIRTLTLDPVPRGVSTYHGTLGIGAGIVLDSVAADEYAECRLKARFLTEADPGFQLFETMYATREAGVRHIDRHLARIGASAAWLGFAFDEAAVRARIATHCDALQAGQPHRMRLVLEKAGAVELTVAPLMPLADESVGVLLGPAHGFANVQSDDPLLRHKTTRRAEYDRGWREAEVRDAFDTLFFNERGELTEGGRSNVFVKLEGKWWTPPLDCGVLPGVMRAVLLDDADIAAAERVLTRDDVINAEALLICNALRGALPARIVGADDWS >NZ_CP018811|3914849:3924190|3917931_3918747_+|WP_075158077.1|DBSCAN-SWA MSYLQETSRNAITVPKLQAMREAGEKIAMLTCYDASFASLLDHAGVDSLLIGDSLGNVLQGQTTTLPVALEDVAYHTACVARARPSALIVADLPFGTYGTAADAFANAVTLMRAGAQMVKLEGGEWAADIVRFLVDRAVPVCAHIGLTPQSVHAFGGFKVQGKTEAAASQMLRDARALQDAGAQLMVIEAVPARLAAEVTKQLKIPTIGIGAGLDCSGQVLVLHDMLGIYPGKRPRFVKDFMQGSSSILAAIEAYVREVKDGLFPAAEHTF >NZ_CP018811|3914849:3924190|3916622_3917189_+|WP_075158075.1|DBSCAN-SWA MTVAYLGLGANLGDARQTLKDAVVCLAQQHAITVLAKSSLYRTAPIDAGGDDYFNLAVKLETALPVRHLLALCHKIETHFGRERPYRNAPRTLDLDILLYGEHCIDEPDLTVPHPRLTERAFALVPLVEIEPSLIIPQRGRAHDFLEAVAAQRIEKVKAACLCPSLAALSGNKAAGKDAGEPGKGGCS |
7 | unidentified_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
0 : 8597
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP018813|0:8597|DBSCAN-SWA
Protein sequences of DBSCAN-SWA_1 >NZ_CP018813|0:8597|8147_8597_-|WP_075159051.1|DBSCAN-SWA MNITLLEAELRRDEGTKYSPYIDSTGNQTIGVGRNMKANPIPKDWTFPLTDAQVNQLLAEDIQTACASLDRFLPWWRQLDEVRQRVMANMCFNMGIGTLLTFRNTLTNIKQGFYTAAATGMRASLWARQVHGRAQRLAQAMETGVMPTT >NZ_CP018813|0:8597|5905_6640_-|WP_075159056.1|DBSCAN-SWA MDLAALYAAVLRAQAAYVMDPAKAQAAFEALGCQWIGQYKDGDSQAVLSRDAAGEVCLSISGTRFSDHQLGDLFDDVDLWPVDVGGGAKVTRGAYDGCHEIWQWALSQVPAGTVFNVEGHSLGGWRTSYTPLFLPAAQIGKLHCFEPPKGANAAYYARFEKELAGMVIVANGRDIWFGYPRLDDLLPFLVDWIHRPGDVMWLHDVGYTTIQADAWPGGFDEDDHSIDLVASRITAIYAPQPKAA >NZ_CP018813|0:8597|1844_2129_+|WP_075161151.1|DBSCAN-SWA MGPHAAAAFFFRGRWNLSDTFLTPEEVEELSGIRVGRGGKTREQLQIEWLRTSGIPFWTNARGRPIIARSAIEGGARADEQPRPKWQPKVLSLR >NZ_CP018813|0:8597|4235_5315_-|WP_083615279.1|DBSCAN-SWA MTTYTANNPVPMPALPTISGVKQEFLVASTNDGVSTYAPDGLAPAPIFGLGGQQLQGNEIVAGGIATLVSFVEPLLNDGELCWILVGCTAGAVQVAPAIASEQAAQLSQVTTVAGATRNLAAIQSAAASTLTFTADELVLKTGLGGLPIILSSFNETLNLAGPVGIGGMDAGSPPANGYVGIYAAWNPTAGTRGIFATNATSSIVGETYGGQNLPTGFTYTELISVWPTDSAGKLKVGFQKERSIGIAPVTVMNSGVLTSTFKAFSIASAVPMNAKSAELNGNVGVGGQTGISADFIVASTSTGAGVGMVAGFNPPDVFSGNGSSRSMITIPQTLFYVLTTTATTGVINAELGLNSYSF >NZ_CP018813|0:8597|5526_5850_+|WP_075159057.1|DBSCAN-SWA MRPKLPEFPSPTTEQLRAIYARSDQDVRNVVLEVIRLRQLVADADACRETIDKCWKAEGLGQLVALYEFRILMQTERSRMGFISEYKGKADEQPRDAGDDGGAATPA >NZ_CP018813|0:8597|7501_8008_-|WP_075159052.1|DBSCAN-SWA MSVQQVHEQKETIEVDVLLPGHDPRTTTSLFRRSRAQLLERDGARCFISNKTAEELGAPLEAHHHPIERCYAEIIDWPKFAADCKRGLWGPHAAAFDWLSFLAAKPFDPYRFVDDMTVNGVLLGKQFHTAKDAGIHMLPYPIWIAQKYAREGYQFSPTEVIHHDPEHL >NZ_CP018813|0:8597|1285_1768_+|WP_075161152.1|DBSCAN-SWA MKALSIRQPWAWLIVRPDLTGTARAAAVSSGELKDIENRSWPTRFRGRVLIHAAKGMTRAEYEDAEDPLYWCGGPTIELPPFEQLQRGGIVGAATIDACIQSMHRSSKWHADGCFGFHLVDTRPVPFVACKGALGFFDVPKEVASHLRQMHDLGAIACNT >NZ_CP018813|0:8597|974_1253_-|WP_156884052.1|DBSCAN-SWA MNSSGLIKHAAYAFKRGDRDFWPAIAIFSVDEDGKEEQLFYGEPGYGKGFRGPGFETEEEALAAAREISAPHFSNGVLMVAYRGQIYPVARL >NZ_CP018813|0:8597|7301_7505_-|WP_075159053.1|DBSCAN-SWA MNQTSPLNTATAVGAGAVVAPVVSYVAGLFHVTLPLDVQSALVVLIVAGAHKLTQMRAAKQPAAPAQ >NZ_CP018813|0:8597|6639_7092_-|WP_075159055.1|DBSCAN-SWA MKKLMLLAAGLVASIAILAGCTSAQQQNLATLAAQAQTNVVKACAVVQPTLLDLSASIPGDPNLALLAQDNGKLCAAVATLDPSNVQSLVNTVIPQAIGLLSLLPIDAGTRATIRLALGAASIALSNWLAVYGTPTAAPPAPASTAAASA >NZ_CP018813|0:8597|2132_3197_+|WP_075161150.1|integrase|DBSCAN-SWA MGRKPTVNLNLPPRMRARPRGKVTYYYYDWGGKPRREESLGTDFVLAVRRWSELEQTQAPAAAAPTFKDAADIYIRDVLPTKAPRTQSDNLKELIFLREFFGDAPLDEIEPVHIKQYLRWRHQKAVAWFEAKKQPVPKDAGHVRANREIALFSHIFNNAREIGLTKAPNPCLGVKKNSEDGRDVYVEDDLYARVYAHADEPTRDAMDLAYLAGQRPQDTLNYDERDIRDGYLFIGQGKTGKKLRMEVVGELKAVIDRIKARKAGYKVVSTALVVTETGQRMTLRTLQYRFRAARAAAGIPAKDFQFRDLRAKAGTDKTDATDIRQAQQQLGHGSITMTEHYVRKRRGDKVGPTR >NZ_CP018813|0:8597|7116_7305_-|WP_075159054.1|DBSCAN-SWA MIGAALLALVTHVQVLPVLDKQFDVQPGAIVYAEIPPSTRLSLSVVKGAEAPTVKASIRWKF >NZ_CP018813|0:8597|3520_4210_-|WP_156883926.1|DBSCAN-SWA MTTYTATNSVPLEALPTTSGVVQKFRAAATNDGPSTYAPDGPTAAPIFGLGGQQLQGDEIVEGGIATLVSFVGSLLNDGDLCWVLLSCDAGAQQVAPATESAHAVQLGQVENIASPLPLATSASTSPNQAVNQSQVLGVAQTIIDVTASRTLGTTYTNTTGKPIIVYAAGTCGVGGGSIAITIDGLVAQIGNDNTTGHAIATNLIIPAGSAYSVFITGSVTLNSWNELR |
13 | Sinorhizobium_phage(20.0%) | integrase | attL 102:115|attR 11858:11871 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
14104 : 16129
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP018813|14104:16129|DBSCAN-SWA GTCATGCTGTTCGCGACCGACGCCACGCCGGGGACCGCCCACATGACGAGAAGAGCTGCGTCCAGAGCAGCGAGCCGCCAAGCGGGATCGACGCGATGAACGATTCGATCGCATTCTTCACGGCAGCCTCGACATCGGGAAGGGTGTACCCTGGCGCCGCTGTGACCGTCGCCGTGACATTCGCGGTCAGGTCCGACGCGGCGAACACGTCGAAGCGGATACTCAGCCCGCGAATCGCATCGATAGCCGAATAGACAGCTTCCTGCAGCGTGTCCGTGAACGGCGAGATGACGACATAGAAATAGCCGTAGAAGGGTGTGCCGTCGATCGCCTCGTTCTCGACGATTTCGTACTGGATGCCCTGTTCGAGGCCGTCGATGGCGGATTCCACCGCGATTTGATAGCTGCGCGCAGGCCCTGAATGTAGAGCTGGAAGCGCGCCATCACTGCTTCGTCGCTCTCCTGATTCACGCCGTTCGCAAAGGCGCTCGGGTTCGTGACAGTGTCGACGTACTGGATAGCGGTCGAGATCGTCGTGATGCTGCCTGCTGCGACGTTGCCCTGCGTGCCAGCGTTCTGCGCCTGCACGGTGACCTGCGCGCTCGTCACACCGGCCGGGATGATGTACGCATTCGCCGCTGCGTTGTAGTAGGTCTGGGACGAATCGGGAATCACCTGAAACGGCTGCGTACGTCCGCGGTGAGCACCATTGCCCCGCCGGAATACGTGCCGCCGATGCCGTCCGGATTGGGCGTGAAAACGCCCGCAGGGATCGTCGCTTGCGCAGTCGGCGTGAAGCGCGGAGAAGACGACCTGCCCGGTCGATGCGACAGCCTGCTCGCGCGGCGGGCAGCCGAAGTCCGAGATGAACGTGTCGACGTCGTTGCCTCGGACGTCGAAAGACGCGCGGTCGCGAGCACCTGCTGCACGAGCGATTGCAGCCACATCGTCACGCCGGCGACGGCCTCGACGCGCGCGAGCTCGAGCGAACCGATAACGAACGACAGGAAGACCGCGACGCCCGTCGCCGCAGAGGCGAACCGCCGACTGGATCGCCGCGACCTGCTGCTGGACGATCGTCGTGAAGCTTTGCGTGTTAAGAGCCATATGCGGGCACGTTGAAGTTCAGCGTCTGCGGCACGCCGTGGGCGCGGTAGATGTACTGGATCGTCACGCTCAGCAACCCCTGTCGCGTCGTGCTGATATGTGATCGTCGGGCGGCGGGCTGCTTCTGCACATCCGGCTCGACCATCACGACCGAATTGATCAGCGACTGGATCTCGGCAAGCTCTTCGGGCGAGAGCGCGTGGCCGACGAAAGCGGCCGAGACCGGCTCCATACGTCGGATGCCAGATGTACGTGCGGGAGGCGTAAGCAGCCCGCGCACGATGCGCTGGTTGAGTTCCGTTACGCCTGTCGCAAGCAGATCGTCCCCAGAGGCCGAGAACTGATGTCCTGGCCCCACCAGTGGAAGGAATCGGGCATTGCCTTGCTCTCAGGTCGGTGGGCTGGTCTGACTTCCAGCACCATGTTCAAGGTGGGTGTGGTTGTGCACGCTCTTGCCTTGCGCGGTGGAGGTCGTTGTTGACGGTCACGGGGCCGTTCAACGTGGCTTCGCCGCCCTGCGGACCGGTCCCCTGCGTGACCTGACGTCCAGTTCGATCTGCGGCGAGGAGACGTCGCCTCGGTGCCAGCGGTAAGACCAATGGTCTGCGTCGCCTGCAGCACGATCGACTTCGATGCCATGCTGATCTGCTCCGGCGCGCCCGAACGTCATCGTGCCGTCGTTGTTCAGCCTCACGAAGGATCCGGTGCTGTCGACGATCGCGGCCTGACCTGACTGAACAACCGGCGGCCGCGCGGAGTTATTGAAGAATCGCGAGCCGACGATCGCTACCTCAACGCGCCCGTCGACGAAATCGAGCCGCACTGCGTCGCCGATCGCGGGCCCGAATACCGCGCCGAACCCGTTGCCAACCCATTGCGCGCCTAGCGGGATGAAGCCCGTTTCCTTCAGCGTCGGCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP018813|14104:16129|15199_15442_-|WP_075161635.1|DBSCAN-SWA MEPVSAAFVGHALSPEELAEIQSLINSVVMVEPDVQKQPAARRSHISTTRQGLLSVTIQYIYRAHGVPQTLNFNVPAYGS >NZ_CP018813|14104:16129|15706_16129_-|WP_156884106.1|DBSCAN-SWA MPTLKETGFIPLGAQWVGNGFGAVFGPAIGDAVRLDFVDGRVEVAIVGSRFFNNSARPPVVQSGQAAIVDSTGSFVRLNNDGTMTFGRAGADQHGIEVDRAAGDADHWSYRWHRGDVSSPQIELDVRSRRGPVRRAAKPR >NZ_CP018813|14104:16129|14104_14500_-|WP_156884105.1|DBSCAN-SWA MAVESAIDGLEQGIQYEIVENEAIDGTPFYGYFYVVISPFTDTLQEAVYSAIDAIRGLSIRFDVFAASDLTANVTATVTAAPGYTLPDVEAAVKNAIESFIASIPLGGSLLWTQLFSSCGRSPAWRRSRTA >NZ_CP018813|14104:16129|14325_15210_-|WP_083615572.1|DBSCAN-SWA MALNTQSFTTIVQQQVAAIQSAVRLCGDGRRGLPVVRYRFARARARRGRRRRDDVAAIARAAGARDRASFDVRGNDVDTFISDFGCPPREQAVASTGQVVFSALHADCASDDPCGRFHAQSGRHRRHVFRRGNGAHRGRTQPFQVIPDSSQTYYNAAANAYIIPAGVTSAQVTVQAQNAGTQGNVAAGSITTISTAIQYVDTVTNPSAFANGVNQESDEAVMARFQLYIQGLRAAIKSRWNPPSTASNRASSTKSSRTRRSTAHPSTAISMSSSRRSRTRCRKLSIRLSMRFAG |
4 | uncultured_virus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
21876 : 23871
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP018813|21876:23871|DBSCAN-SWA CTTAGGGGTTCTGGACAGGCGTTACGGTCACGGTCTGACCGCCCTCGATGTTCACGAGGAAGTAGCGGACGATCGACAGGTAGGTCACCTTCACGAGCGCCTGCATGTAGCCGAGCGCGACCTGACTGAAGGGGTTGTTCTTCGCGTCGATTTCCACCGACCAGCCGGGTTGTGTCGGCGCGTTGACGTTGCCGATCATGTTGTTGGTCTGCAGGTTCGCGAAGAACGCGTCCATCGCGCCCTTCACGTTGCGGCGCAGGTTGATCGTCTGCACCTTGCCCGGCACATACCCGAACGCGCTCGCGATCGTGAACGCGATGTAGTTCGTCATGCGCGTGTAGTTGTCGCCATTCGTTGCGCTATTGCTGCTCGCGTTCTGGCCGGTGCGGCACGCGAAGATGCTGCCCGCCGGCGCGCCAAGCGTCACGACATCGAGACGCGACGTCGAGCACTGCGCGATCTCCGCGTCGCTATACGTCAGGTTCTGCAGGCTACTTTGCGTCGCGATGATGCCGTTGATTGGAGCGTTCAGGATCGACTGCTCGGGGCTCGTCGCGGCCTGAAGCGCCGACGTGTACGTCGCCGGCGAGACCATGCGCTGGACACCGTTGACCGTATCGTTGAAGAACGTCCAGTCGCCTACCAGGCACATGAAGCCATACCCGTCCACGCCAGAATTCGCGAGATTCGTCGCGCTGTCCGTGATGCTGGTGCCCGCCGGGTTCGCGCCGTGGAAATAGATGCCTTCCTGCAGGCCGAAGGCGAGCTGCGCGCTCCATGTGTTCTGGCTCGTACAGTCGATCAGGTTGCCGACCTGTGCGCCCGATTTGCGCAGGGCATACATGCCCGAGCGCACAAGGCCATCGGCGCCGAGCAGCGTGTTGTCCGACACGCCTGCAGCGCCATCGGTGCCGCCCGACAGCTGGTAGGTCGCCGTGAGGTTCGGCACATTCGGCGACGTGCCGAGCGTCGCGATGACGTTCTGCGACGGACCGCGCAACCCGGATTGCCCGTTGTTCACGGCGTTCACGAGGTTCAGCCACACACTGACCGTCAGGGCGATCGAGCCCGGCGCCGTGGCGCCGCCGCCCGTGAGCGTCGCGGTGGCGCTCGTGTAGCCGTTGCCGGCATTGCTAACCGTGAATGCACCGAGGCCCCACGTCAGCGTGACCGTCGCGCCAGTGCCGGCGCCCGAGGTCGCGGACGGCGACACCGGATTTGCCGGTACCGAGCCGCTCGTGAGCGCGCCAGCGTTCGTGACGGCAAGCGCGGTAATCGCGCCGGCATTGGCGGTCACGGTCAGCAGGACGCCATTCGGCAGCGTGAGCGTGTCACCCGTCTGAAAACCCGTACCCGCAGCAGCGACGGCCGCCGACAGGACCTTGAGGCTGATCGAACCAACAGCCTGCACACCGTTGGCCGCCTGCGGCGCGGACAGGCTCAGCGTCGGCACCGACGTGTAGCCAGTGCCCGGCGTGACCGTGCCACCGCTCACGCCCATGCCCACGTTGTCGAACACCTCCGGCGTGAAGCCGGAACGCGTGACAGAAAGCTTGTACGTGTTCGCAGCGGTGCCCGCCGTGATAGCCGCCGTGATGCCATTGCCGACGACGCCCGTGTACATCCCGGTGAGCGTCATGCCGGTGACCGGCGAGCCGGCGGTGTCTTTCAGCTGTGCGCTGGCGGCCGTATCGGTGCCGTCCGTCACACGCACCAGAATGTAGTTCTGGACGTTGTTCAGGTCGCCGATCGCAACGGCCGTCGCGATGTCGTGCAGGCGGTTCGTGACCGGGCCGATCAGCTGCTGCGCCTGCGCGCCATTGCCGACACCCATCATCGGCGCATTGACGGGACCCCACGAACCGACGCCGACCAGACCGAGGCCATCGGTCGCCACGCCGTTAATGAACGCGACGCTGGGCGGCTGGATGATGACGTACAGGTCCGGCGCCTGCAGTGCGGTCTGATTGATCTGACCAGCTTGAAAGACGGGCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP018813|21876:23871|21876_23871_-|WP_075161201.1|DBSCAN-SWA MPVFQAGQINQTALQAPDLYVIIQPPSVAFINGVATDGLGLVGVGSWGPVNAPMMGVGNGAQAQQLIGPVTNRLHDIATAVAIGDLNNVQNYILVRVTDGTDTAASAQLKDTAGSPVTGMTLTGMYTGVVGNGITAAITAGTAANTYKLSVTRSGFTPEVFDNVGMGVSGGTVTPGTGYTSVPTLSLSAPQAANGVQAVGSISLKVLSAAVAAAGTGFQTGDTLTLPNGVLLTVTANAGAITALAVTNAGALTSGSVPANPVSPSATSGAGTGATVTLTWGLGAFTVSNAGNGYTSATATLTGGGATAPGSIALTVSVWLNLVNAVNNGQSGLRGPSQNVIATLGTSPNVPNLTATYQLSGGTDGAAGVSDNTLLGADGLVRSGMYALRKSGAQVGNLIDCTSQNTWSAQLAFGLQEGIYFHGANPAGTSITDSATNLANSGVDGYGFMCLVGDWTFFNDTVNGVQRMVSPATYTSALQAATSPEQSILNAPINGIIATQSSLQNLTYSDAEIAQCSTSRLDVVTLGAPAGSIFACRTGQNASSNSATNGDNYTRMTNYIAFTIASAFGYVPGKVQTINLRRNVKGAMDAFFANLQTNNMIGNVNAPTQPGWSVEIDAKNNPFSQVALGYMQALVKVTYLSIVRYFLVNIEGGQTVTVTPVQNP |
1 | uncultured_virus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP018812_1 | 2318091-2318178 | Orphan |
NA
Consensus repeat of NZ_CP018812_1
|
1 spacers
spacers of NZ_CP018812_1
>1.1|2318116|38|NZ_CP018812|CRISPRCasFinder CTGCCGGTGTTGTTCGAGCCGGTGCCCGGATCGGTTGC |
CRISPR arrays and Neighbor proteins around NZ_CP018812_1
The CRISPR arrays of NZ_CP018812_1 >merge|NZ_CP018812|1|2318091-2318178|CRISPRCasFinder GCTGCCGGTATTGCCGCCCGTGCTGCTGCCGGTGTTGTTCGAGCCGGTGCCCGGATCGGTTGCGCCGCCGGTGCTGCTGCCGGTGTTG >NZ_CP018812|1|1|2318091-2318178|CRISPRCasFinder GCTGCCGGTATTGCCGCCCGTGCTG CTGCCGGTGTTGTTCGAGCCGGTGCCCGGATCGGTTGC GCCGCCGGTGCTGCTGCCGGTGTTG
>NZ_CP018812.1|WP_083615551.1|2315027_2315525_-|type-II-secretion-system-major-pseudopilin-GspG MRSPQAGQTRCVAQRHRQGHRQEQVQRREQARRQGGFTLLELLVVLVIIGMLAALVGPRYFAQLGKSQVTVARAQIDVFTKAVDNFRLDVGRYPTTQEGLAALVVKPANADKWAGPYLKKEVPLDPWGHPYIYRIPGTKSDYAIISNGPDSQSGTNGESAQISSE >NZ_CP018812.1|WP_075160727.1|2313754_2314942_-|type-II-secretion-system-F-family-protein MQYEVRALAPDNQIVALVIDAHDENDARRQVEARGLHATRLAPRRSLRRPAASRGALSLVLFSEELLALLTAGLSIVEGLEALLEREGSGRLRGVLERLLSGLREGKRFSSLLAEQPDVFPPLYIGIVRAAEGTSDLPRSLQRYIDYQSRIDLVRNKLVSAAIYPCILLCVGGGVAAFLIAYVVPRFATIYEGTGRSLPWMSQLLLDWGKFAAAHGLPLAGATLGIVFAGGTVLRAAIAKAGLATLLARIPVIGPHLRIYQLSRLYLTLGMLLEGGIPIVPAMETAGGTISPALRERLMQARIAVQAGTNLSSAFDNEGLTTPISLRMLRVGERSGELGDMLTKSAAFYDGEISRWIDRFTRLFEPLLMSAIGLVVGTIVILLYMPIFDLAESLS >NZ_CP018812.1|WP_083615456.1|2311901_2313752_-|type-II/IV-secretion-system-protein MPANVVPLAAGVPGVPDASDVSDAPAGSLSAQPVSLARPVLKLDAELLARARAAASATQRHLIVELEAMTGSEPRELLQTLAQQLAMQVIETPAMLAMQPAFDRVPLSRAMQRRCALLRDVDNTVKHAAQNLPEDANDRVASLCGTLIGVVTSPFDLDLQTWLAAQAGGPIEIRVTLPSDLQAYHTRMEESARAVDSLVTDGGDTRADGRTAQVLSFETVSEAASPAVKLVNSTLYDALKAGASDIHMESTATGLALKYRVDGVLDAAATIHGVETAEQVISRLKVLAELDIAERRVPQDGSFRVAAGGRDIDLRVSIMPSIHGEDAVIRILDKRAMIEAYGSLTLEALGYDGESLEALRRLAEEPYGMLLVTGPTGSGKTTTLYAALTEIHNGRDKIITIEDPVEYQLPGILQIPVNEKKGLTFARGLRSILRHDPDKIMVGEIRDRETAEIAVQSALTGHLVLTTVHANNVFDVFGRFSHMGIDPYAFVSALNGIWAQRLLRVNCSHCATPYTPTDAELARVGLTRAEVADFDFRQGAGCGDCRGTGYRGRRAIAEILILDDEIRDMVVEKQPIRLIKEVARRNGTRHLREVALAQVKRGETTLAEVKRVTLNA >NZ_CP018812.1|WP_075160726.1|2310772_2311909_-|hypothetical-protein MRDVLKRWSTTPVSIPGLPRVSLKWRRGVCRIGLSRDGVAILRTNGGRGDGVNALGARHGGTGLLADARANSSVGPAASSVTNAVALRQAAAPPLLAERSLRATSPALTPDELAAAIGDALDESGCGGLPIHATFGDDLVRYFIVTPPPNGTRVQDLRTAAEVRFQVLYGESAAAWHLVADWHAARPFLACAVPLRMHMALRLAARARRSCLVSAAPNFIGAWNRWRRRVTGDAWLATLHDGALTLGVIDGAGRRARLAAIRTLKLVEEAPPLAWLHEQLVRTALLDNVHAPKLLHVYGPLPPAWQQTPRAAMSGNSSTNASSSLNPSLDPSPNPIDSFIEVRACTAVQAAFASPEGAIATWSSAAQLVFGSQGGASR >NZ_CP018812.1|WP_075160725.1|2310224_2310776_-|PilN-domain-containing-protein MKRLHIDLAPLSARRQLYRSHPVMRVLALAALIACVIAGVRAHKLLGRLDVLERQQERIASHHAQAARARASVASRPVDAKQGAAVNAAVARLNLPWDDILDAVEAATPPKVALLSITPDASRALLRIEAEGAGSDAMIGYLQALEQQPLFGRVDLVRHEMSKDHMDGVIRFQIEARWRRSAS >NZ_CP018812.1|WP_075161543.1|2309610_2310228_-|hypothetical-protein MKMPDLMRALLWTRLALRRTGIDALAAGGLAIGAATMWFVLLPGLTVRIADEARAVARARSAPPPAPVISPQALAAQRLDAFYAALGDAGHTEQVVMHLFDAASETGVVLDKTEYKPARDAAGRFETYSIVLPVKGDYAQLRRFCEKVLLTVPYAALDDMRFKRSSANDQTVEASLRFTVFLRPVQQGEGAANTAALVAAGEAQR >NZ_CP018812.1|WP_075160724.1|2309056_2309614_-|hypothetical-protein MKPLHAVLAVAFVVCGGLLIFGRHDAPDQVVEASPRAAAAARTAAPAGRGTDATSAANPDNGALVAVGALRSRKELFGSAGGGHHALFGSQTWAPALPSVAPAGAQVPPLPSAPIAPTLPFTYIGKQASDGAWEVYLAHGDETVIVHDKSVIDATYRVDSIKPPVLTLTYLPLKLVQTIDIGSAD >NZ_CP018812.1|WP_075160723.1|2306424_2308953_-|secretion-protein MPRRQRRARRASLQWLAPLVLAVALPVLLAGCAAQKAYRAGEDLVAQGKVDEGLAKMQEAMTQEPHDAQYRATWLAVRASTLTHDNEQGDRLAASGARDAARKSYQHALTLDPENERALAGLAALDQAARLDALVAQAEAMAAKQNVAGARRLVEQVLTEAPEQPRALALQRRLTLDSGATRIEAALAAAYRRPIHIDFKDASLKQVFDVISRSANMNFLFDKDVRTDQHTSIFLRNSTIEAAVRYVLATNQLAQQVLDENTVLIYPNTPAKLKDYQELAVRTFFLSNAEAKTVANTLKTILKAHDIVVDEKLNLVIVRDTPDVIHMAEKLVALEDVAEPEVMLEVEVLEVQRNSTQSLGIQWPDSLTVTPLPVGSVISNTGSGNNTGNGFGNPFNSGSFGIPSSGGSNPPALSLNDLFHQTSKTLGVSQIQATAQANLVDSNAKLLTNPRIRVRNHEKAKILIGERVPNITSTATSTGFVSQSVNYIDVGLTLNVEPSIYLDNSVGIKVALEVSSLLNVVSGGNSGTTAYEIGTRTASTVLQLKNGETDVLAGLIDSEERTSGNKIPGFGQLPVLGRLFGATTDDDKNTEIVLAITPHLIRNIQRPDAAAAYFSAGTETNSQGLMQSNGISSASFSPGSSSSSTSATPGKAPSSGFGNNSAQGTNGLGSNYGGAGSGTVGGGAGGFAGGEGDPVVAGGAPGTGTAQMTVQGPQQVKVGDSVTVTVTMQADQPLLIVPANVTYDSSKLQFTGVTEGDFLKQGGAQTNFSSRVGQNGQLSLSDSASGGTGATASATYAVLSFRAIAPSSQTSIQVQPGTLLGLTGIPVTALPPTPYDLTVNPK >NZ_CP018812.1|WP_075160722.1|2305945_2306428_-|type-II-secretion-system-protein MTQARARGFTLIELLVTLAILGVLACMTVPVAQVMRQREREQDLRAALHEIRDAIDAYHTASKDGRIQKEAGATDYPANLELLVQGVPDQQDPKAHKMYFLRRIPRDPMNDNDPQLADSATWGKRAYASEADDPQEGDDVYDVYSLSNGIGLNGIPYRKW >NZ_CP018812.1|WP_156884030.1|2305547_2305940_-|prepilin-type-N-terminal-cleavage/methylation-domain-containing-protein MQGARHAKTAAGFTLIELLIVLAIIALMLTLALPEYFHSIDSSKEKVLVQNLHVTRVAIDQFYGDQGRYPDSLQELVDKHYLRSLPFDPVADSATTWQIVAPDEQFPGNVYDIKSGAEGTDAEGKAYGAM >NZ_CP018812.1|WP_156884031.1|2324624_2326250_+|hypothetical-protein MNTTITGSSPWPTSSSFADAPPHTKITQDRSSRAAQTNHALRQPPRARRDDPSIPTRTSDAFADYVLRAGSQAPMTEEQMFSARIGILRSISASRPRLADRQTADMIGNTLLDAMNRSPTFRSVVSYSFARNGGRFDNLTFRNEYIHNATYLGAMTPIGSMTISYLQSHGTNPLPVTAESDANRAWRNIGPYVNLGVAPNPDTPASQAWQAVLMHEITHHLTGADDPPASSQGSHLGPTEIIARRIAGEMGWALPTFRGYGDPARVAYHLQANWTGLLEAAQRNGAHEHSFFERLENISDAHDASADFHELDPPGAAGSATHNASVLADMEPDDLERVRFHDGEVSFFPFADSAPAGKPDGYQSAYAASSASWNTQGRFFRYGEPVGGNPHVRQFDFPDGSKAVITAHQPTLAASDLTGFETAMAVGGAALGGAVIGFVGSGGNPAGAYLGGAAGAAAGSAIAAKFPYDRIWQGYTLEYFNQGETAPFYTQSMYAWDSGWSRVGLLSRQRDSNLWPDYADTNPDKNWNWWTWRTGNAPKRT >NZ_CP018812.1|WP_075161546.1|2326745_2327504_+|2OG-Fe(II)-oxygenase MHGARHATDAGASVLPDRPVDERVASLDWPRIEDELDSFGCATAAGLLDPQECDRLAGLYMQDAIFRSRVVMERHGFGRGEYKYFAYPLPDIVGTLRGALYPHLVAVANRWNEALRIGVRYPSLHRDFIERCHRAGQTRPTPLLLRYKAGDYNCLHQDLYGEHVFPIQAAILLSEPGKDFTGGEFVITEQRPRMQSRAEVVTLNKGDAVIFTVNSRPVQGTRGIYRVNLRHGVSRLRSGYRHTVGVIFHDAL >NZ_CP018812.1|WP_083615457.1|2327695_2328937_+|MFS-transporter MSNAPLSPGNSMAQVTETPAFDRRVWLLALAFFAIGTDFNVVSGILPSLAQSLDVSVPAAGQVVSSYALFYAVCAPVLAGLTAHIPRKPLVVISLTAFAAANAASALSPTYAFLIGTRIVAAFAASVFSPAGYGLAATLARADRRGVALAAALSGITISLVVGVPIGAYIGNRFGWPSSFWFVTALTLIGAAGLAMRLPQIDAPAAAQAPGLAARFAPLANREVLLGMTPSLVWYVAMFSLYTYLGAAMTERGMTKEELSAIYGALGVGCIAGNHIGGKLSDRIGSQRVVAVALVIQIANLVWLGLAGSSMVANACSIAIFGMNMWFLFPAQQSRLLSISPQHGPLVLALNNSTMYLGGAIGSAASALLIRHGIATSNLPWVSCVFFFAALVLFALCSWALKRSAASGQQGVA >NZ_CP018812.1|WP_083615458.1|2329144_2329759_+|hypothetical-protein MGDLSRRTPMPAQFLTLSDDMVLDFLNTSSAEDTHLHEYFASDQHVIDWMQAHRLIGTRKLPLFADGALVNAAHALGEAIRKALFERKAGEKVQVGSLNACLAQGRYRLTLVRQSNGQLQLMHEYEHSTPEQFLAQIASAAAELLAAGDFALIRKCESDTCSLWFYDRTKAHRRRWCSMARCGNRRKVAAYRARRKGRAGAGIA >NZ_CP018812.1|WP_075160732.1|2329796_2330384_-|hypothetical-protein MASPLASVRAQDAVLEFLNTVVPRNGQLLDLFQSDSDVLAWLEGAGLLDMIEMRGDGLYCDLALEARQLREDVRQLVLQKKLGQCVDAELLNRVLSAGSYQIELAEDGEGNLNALYRFPAQSAMQVLVPVAIAAAQLLARGDFLLIRQCESPDCPLWFYDRTKSHRRRWCNMTICGNRQKAARFRTRLLCGGLVE >NZ_CP018812.1|WP_075160733.1|2330925_2331696_-|SDR-family-oxidoreductase MEKKIAIVTGASRGIGRATAIRLAQDFGGVAIAARDAELLKATARDVVAAGAESLDICIDLREPIAAEAVVAKTLDRFGRIDAVVNIAGAVPQIDVLEMTDEQWDDGFALKLHGARRLAIHAWDALKATRGAVIFTSGNSAEAPKSGFAAVATVNAAIVAMAKAFADRGIGDGIRVNSVLPGPVMTDRRRSYMEKYAASHGMTVEEALDKFPGLAGITRYGKPDDIAELMAFMLSPAGEWLIGAALRMDGGEVKGL >NZ_CP018812.1|WP_075160734.1|2331910_2332810_-|LysR-family-transcriptional-regulator MKSNVDYQARLFIEIAAHKSLSAAAEAMSLTQSGLSRHLASLEEFIGQPLFIRHGRGVELTEAGRKLLEVVSAAYQLVDNTMLQLRNEHGVTDGSIHIATIHTLSYYFMAEVAARFMAQRPSASLALLGRSSPGVVELVESGRAEIGFAYDSAVASDQLEITPLFEDTMCLVVHERSRFATKASVDLRSESVPLVAFPAGYALRKMLHTKEFDATVTAEVETVDAMLKLVSMTNGQCILPDLIPEKLLQEYQLVSVKIEQPMMRRWVVAVTRRGRPLSAMTALMLEIARESASKVMSEA >NZ_CP018812.1|WP_156884032.1|2333008_2333896_-|alpha/beta-fold-hydrolase MSKGHNVPAVSVKRIEADGVGVFYREAGPATAPVVLLLHGFPSSSHMYRRLMPRLADTYRVIAPDLPGFGFTDVPDARGYKYSFDALAQTMAAFVDKLGLKRYAVYVFDYGAPVGLRLAVKFPERVSALISQNGNAYEEGLGDAWDPIRKYWAEPTPAHRKTVHDAILNFEGTRFQYLHGVSSPDSVPPESYTLDAALLERPGQKDIQLDLFLDYRSNLKLYPVFQQFFREKQPPTLAIWGKNDPFFIPPGAEAYRRDNPGAVVRMLDTGHFALETHVDEIAAAMRDFLAEHLKP >NZ_CP018812.1|WP_075161548.1|2334147_2334822_-|haloacid-dehalogenase-type-II MSDGIRAFVFDAYGTLLDVNAAVKRNAASMGEKADAFAAMWRSRQLEYCWTRTMMNRYEDFWTVTEEALVYTLRTFGLDHDEDLKARLLSAYFVLDAHPEARQALARLKDSQVKISVLSNGTNQMIKAALEAGGLLEFIDAALSVDDLKVYKPDPRVYQYACDTLNVKPSQVVFVSSNAWDLAGASSFGFKAARINRSQLPPEYEFAGLHSEHRSLVELATLIR >NZ_CP018812.1|WP_075160735.1|2335245_2335812_-|hypothetical-protein MEPEFRFNSGRVSLDLAATIRRRASKPLDIMASAGASGRWLKAAGLFSDVPALSRSEEARLVELREAIWQMVTGAMHGRLPERAVSIVNRAAKYPLGMPQLDSTTGTVSVVSDDPLATALSIIARDAIDLVTGALKLRVKTCDQPDCRMLFVDTSPSGQRRWCSMQRCGSRAKVHAFKRKHARAAHMR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1399987 : 1431945
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP018812|1399987:1431945|DBSCAN-SWA GTTATCCGCGCACGCCCCACAATTCCAGCTTCAATCCGGGGAAATCGCCCGCCACGTGCAATGCAATCCCGCCGTGCTGTGCGACCGCTTCCCATAGCGCCCCGGTTTGCGAAAGCTCGTAGTACACATATCCGGCGTTGAACGGAATCTGTCGCGGCGGCACCGGCAACGCCTGAAGCGCAATGCCGGGCAGATGCGAGCGGACCAGGTCGGGCAACCGTTCCGACGGTCCCATCTTCGCTTGCGTGGCGAACTGCTGAACGAGAGCATCGGCGGGCATCTGGGCGCAGACAGCCAGAACCAGTGAATTGAAGCTGCGAATCTCGGCGGGATCGATTACCGCATTGCGCATGCCGTACCCGGCGTCGCCGAGAGGAATGCTTTGTGCGCTCCGTATCAGCACTGCGTTGAGCAGCCATTGCGCGTCGTCGATAACGGGTTTCAGGCAGATGTGCGGTTCGATGTGCTGATACGCCGGATGGCTATCGAGCGGCCGGCGCGTTTGGGGACGCACATAAGTCGAGAGCTCGCCGGTCATGCTGAGCAGCAGCGCGTACAAATCAGCGGGCGATGTCGTTGGAACCCGGCGCAGATGGTGCAGCAGGGGTTCGTAGCGGTTGAGTGTCTGCAATAGCAGATAGTCGGACACTTCGGCGACCGTGCCGGCTTGTCCGTCGCTGCCAGTGAGGCGCTTCGCCAGCGCGTCGGCGCGCAGCCGCGTCAGTTCATGAACGTTTGCCAGCCAGCTCTGCAGCAGCGCACTGGCGCCGTAGCCCGACACGGGAGGCACGAGCGCGTCGTCGAGTTCAATGCTGCCGTCCGCGCACAACGTCTTGACCCTGGTTAAAGGCAGCCCGATCCATCCCTCGGCCATCTCCTTTTTCGGAAGCAGTCGCAGGCGCAAATTGGACAGTTGAACCGATCTCGCTCCTTGACCGACGGAGTTCGTGTCGCGCACCTCGGTATCGAAGACGGCATAACGCGCGAGCGAATCGGGCGCGTTTTCGAATGTCGTTTCCTCGGCGTTCGGTGTGCGAATCGGTACAGCGAGATAGACGATCTGTTCGAGATGTTCGGGCCGGATTGTGAGCGGGGCGGGCGGCGGCGTGTCGCCTGGGCTATCGAAAGGCGTTCCATCGGCAAACACGCCGCTGGCGGATTTGACGACTATTTTGCCGAGTGCAAGCGCTTCATTGTCGATCGCAAAGGCGGAGAATCCGAAAAAGAATGGCGATAGCGGGACAGCGCGCTTATGCGCGAAATGCTCGAGGTAGCGTTCCTGCTGCTGAAAAAGTTGCGGGCGCAAGAAGAGCCCCTCGCTCCAGGTGACTTTGTTATGCCAGCTCATGCGTGTGGTTCACTTGTTCTGTGGTCGTGCGTCGGTGATGCGGATCGCACTGGTGTCGAGGTCGATCGTCAGTTGCACTTTGGGCGTGCGGCGATACCATGCGGCCGACGGCGTGGCCGGCAGGGTCCACGTGGCGCGCCATATGGAGTTCGGCAGATCGCGGTAGGCCGCGAGCACGCCAAGCGTGGTGGTGGCCTCTGCTGCCTTGCGCCGGATTGTCTTGCTCTCGCCGGGCCGCAACTGGAACTGGTCACGCATGACGAGATCGTCGGTGAGCACCGTTTTGTCTTTATCCTGCAGCGAGAAAAAATCGGCTTCCTCGAAGGTGTCGGCGTTTTTGAGTTCATAGATGCGTACGACGATCGGTGCCGCGCGCTTCTGGTCGTCCGGGTTGACGGTGGATGCGGCCTTTACCGACAGTTCGAGCTTGACCGGCTCATGGGCGGCTTCGGAACCGTTTCCAGCACATGAAGCGAGTGTCAGAGCGCAGACGAGTGCGATGACGGGTTTTAACGGACGCATCGGTAGTGAAGAGACAAGGGAGAAGCGCGTAAATTAAGGCATCTTAAAGATCCGGCACGTGCGCCGGATCTTCGAGGGCAGGCGTGGTCAGGCTTCCTTGTTGAGCTTGATGTCGTAACCGGCTGTCACGGCGCCGCCGCTGCCGCCCTGGGCGTTCTGCACCACGTATTCCTGTTTGACCTTTGCGAACGAAAGGCGTACGTGCTCGCGAATGCCGTTATCGGCGTTATTGCCGGATGGATGAACCTGCGTGACGATTACATCGTTCATCGTGATCTTGAGGTACTCGAGCGGATTGCCCCCGGCTTTGCGCACGGTCAGCACGGCCTGATCGATGTGCTTGCCGGTCAGGCAATATTTCATGAGGTTCGGACTTGCGCGGTCGACCAGGTGTTCGAATGCCAGATCTTCGACGGTCGCCTTGCCGGCGCCGCCTCCGGAGCCCGCATGCATGCTCGATTGCTGCAGGATTTGCCATCCCCAGCTCGACACCTCGATTTCGTTCTTGTGGGCCGAGTCCTGCGATTCGCCATCGATGCCGTTGATCTTGATGAAAATGTCTTGTGCCATTTGAAACTCCGTTTTCGAATGAAACCTCTGATTCACGTACTGCTTTCTCAAGCTTCTGATTTCCGCATTCAACGCCTCGCGGAGGGTTGTCGAGCCGCTACTCGCACGGATGTCATCGATGCGCCATTGCTTGCTTGTCCAGACAAGGTGACGTGCCCCGCAAGGTCGCCGCCCCGATACGGTCGCGGTCCATTGTCAGGACGATGGCGCTGCTGTCAGTTCGGCGCTGCGTTGCAGCGCATGCAGACATGCTGAGCGACGCGGATAACGGTCTGATTCGGCCACCCGGGTAACGGCTGGGATTGCCCTATCCAGATATTCCTTGAGTTCGCGCGCTGTGGGTTCAATGAACGGTGCGCTGCATACGGCGTTCGAACCGATCACAAGGCCGTCATCGAGAATGGTCAACACGGCCTTGGGGTTCTTCGGCAACGCCCGCGCCAGGGCGATGGTCAGCCCCGTGGAATTGCCTGCGTCGGTGCCTTTGGCAAGGTCGGGTGAGAGGCGAATCCATGCGGCATTTCCAGAGGCGATGCTGTTCAGTACGCTGTCGAGACGACCGGTCTGGTCCAGTGACCGCACCGTGGCACGGGCGCCGTGCGCCCGGATGTCTTGCGCGACCTGTCCGGGTGTCAATTGCCGTGCGCTGGCTGGAATGGCAAGAGCGGCGATAGCGAGTGCGCAGAGTAAATGGCGGCTGTTCATTGCACCACCAGTTCGCGGTCAGTACTTCGATTGATGTCCTCGCGCACGTCGCGCCGGAGAATGATGCAACCATGGGACGCGGTATGGCTGGAGTTATCGCCATGAATCAGGAATGCGCTGCGTCCAAAAGTGCTGGTACCGACATTCGGAGACAGGTCCATGGAAACCGGACCGGTGCGTACCGAGTGCCGTGCGCCCCGAATCTGGTAACTGCCGCGAGGAATCGGGCCGACGTTTGGTTCCCGCTCCATGGCGGGATTATTGCGGCCTTGCCCCGCACCTGAATAACCGTCTGATGCGACGACACGTCCTTCGGGGTTCGTCAGTGTTCCCGTTCTCTGCTCGTATGTCCAAGGCATGGTATTTCCTTTATTAAGAAGACTTGCGACGCATTCGACAGCATGCGAATGCGTCGCCGACTCCTGCAACCGGCTTTCTTCAGGCGGTTGTCGCTCAGACCGCTTCCTTCACCGACGGCAGTCGCGCAACCAGCCGCAGCGACACCGTCAGACCTTCCAGCTGGAAGTGCGGTCGTAGAAAGAACTTCGCCTGGTAATAGCCTGGATTGCCGTCAACATCCTCGACCACAACCTCTGCTGCCGCGAGTGGACGCCGCGCCTTCGTTTCCTGCGATGAATTCGCGGGGTCCGCGTCGACGTAGTTCATGATCCATTCGTTGAGCCAACGCTGCATGTCCTCGCGCTCGCGGAACGTGCCGATCTTGTCGCGCACGATGCACTTCAGGTAGTGCGCAAAACGCGAGCATGCGAACAGATAGGGCAGACGAGACGACAGATTCGCATTGGCAGTCGCATCGGGGTCGTGATATTCCGCGGGCTTTTGCAGCGACTGCGCGCCGATGAAGGTCGCGTGATCGGTGTTCTTGCGGTGGATGAGCGGAATAAAGCCGTTCTTCGACAGTTCGGCTTCGCGACGATCGGAAATGGCGATCTCCGTCGGGCACTTCATGTCGATGCCGCCGTCGTCGGTCGGAAAAGTGTGGCAAGGCAGATTTTCGACCGTGCCGCCGGATTCGACGCCGCGGATCAGCGAGCACCAGCCATACAGCTTGAACGAGCGATTGATGTTGACACCCATCGCATAAGCCGCATTAGCCCACGCATAATTGCGATGATCCGAACCCTGCGTGTCTTCCTCGAAATCGAATTCATCGACGGGGTTTGTCTGCGCGCCGTATGGCAGCCGTGACAGGAAGCGCGGCATCGCAAGCCCGATGTAGCGTGCGTCTTCGGCATGACGCAGCGAATTCCACGGCGCGTACTCGAGGTTCTGCGTGAAGATTTTGGTCAGGTCGCGCGGGTTCGCGAGTTCCTGCCACGATTCCATTTGCAACACCGATGGCGACGCGCCCGAGACGAATGGCGTGTGCGAGGCGGCAGCGATTTTCGCGATCGAACCGAGCAGGTCGACATCCGGCGGCGTGTGATCGAAGTAGTAGTCGGCGACGAGGCATCCATACGGTTCGCCGCCGAGTTGCCCGTATTCTTCCTCGTAAATCTGCTTGAAGAGCGGGCTCTGATCCCACGCGAGCCCTTTGTAACGCTTCATCGAGCGGCGCAACTCTTCCTTCGAGATGTCCATGAAGCGGATTTTCAACCGTTCGTCCGTCTCGGTGTTCGAGACAAGATGATGCAGGCCGCGCCACGCCGACTCGAGCTTCTGGAAGTCGTGATGATGCAGGATCAGATTGATTTGCTCCGATAGCTTGTGATCGATCTGCGCGATGATCGCTTCGATGCTTTTGTACGCGTCGTCGCTGATCGTGACCGATTGCTGGAGGGCCTGCTCGGCAAGCGTCTGCACCGCATTTTCCACGGCCTCCCGAGCAGCCTCGGTTTTCGGACGGAACTCCTGCGTGAGCAGTTGGGAAAAATCGGTCTGCGTATGGACTGCCTTGTTCGACGCCGTGTTCGGTGCCTGTACTTGCTGTGTTGACATGAGGTTCTCTAAGAGTCTTTGTTTGCGTTCGAGGGCGCGTGGCCGTATGCCCGGCCGTACGCTCGGCAAAGCGCGATGCCGATCAGTTAAGCGAAGCGGGGCGGTCGCTTTCCGTCGTCGTTACCTCTGCTTGCGGTTTCGGCGCCGCCGCAAGCGATTTCAGCAGCGCGGGATCGGCCAGCAGATTGTTGACGAGCGCTTCGGCGCCGGACTTGCCGTCCATGTACGTCTGCAAGTTCGCCAGTTGCGTGCGGGCGTCGAGCAACTGCCGCAGCGAATAGACCTTGCTCGCGACCGCAGCCGGCGAAAAGTCCTCGATGCTCTCGAAAGTCATGTCGACCATCAATTGCCCCTCGCCGGTGAGCGTGTTCGGCACGGCAAACGCGACGCGCGGTTTGATCGCTTTCATGCGCTCGTCGAAATTGTCGATGTCGATATCGAGGAAACGGCGGTCGGTGACGGCCGGCAACGATTCGAGCGGATTGCCGGACAGATCCGCTAGAACCCCCATCACGAACGGTAATTCCACTTTCTTTTCGGATCCGTAGATCTCGACGTCGTATTCGATCTGGACGCGCGGGGCACGGTTGCGCGCGATGAACTTCTGCGAACTATTTGAAGCGGACATACTTGACTCCGTGAGCTAGGAAAAGCGTCGGCACTTTATTGCGGCTCGCCGACGGAAACGGTGGATGCAGTCACGAGCGGTTTGAACGCCAGCGCTGCGGCAGGCTGTGCGGTGCTATCGACGATTCCGACACCGATGCCTTGCAGCGTGCGTTTCAACGAGAATGCATCGAGCCATAACGCCGACAGCGCGGGAAGGATGTGCTGTTCGATAAAGCCGATCAGGACGCGCGCACCTGTCTCCTGCACGGCGCAACGCTCGACGACGTAGTCGACTACGTGTCCGGTATAGCTGAGCGCGACGCCGTGGTTATCGGCCATACGCCGTGCTACGCGCTCGAGGTGCAGACGCACGATGCTCGCAAGCGACGCGTGAGCCAGCGGCCGGTATGGCACCACGGTGACGCGGCCGAGAAAGGCGGCGGGGAACGTTTTCAGCAACTGCGGTGTGAGTGCTTCCCGTAGTCGATCGTGATCGGGCACAAGTGCGACGTCCGAGCACAGGTTCGCGATCAATTCGGCACCTGCATTGCTCGTCAGTAAAATTGTCGTATTGCGAAAATCGATGTACCGTCCATCACCGTCTTCCATGTAGCCTTTGTCGAACACCTGGAAGAACATCTCATGCACGTCGTGATGTGCCTTCTCGATTTCGTCGAGCAGCACGACCGAATAGGGGCGCCGCCGGACGGCTTCGGTCAATACACCGCCTTCTCCGTAACCAACGTACCCGGGTGGCGCGCCTTTCAGACCGGATACCGTGTGCGCCTCCTGATACTCGCTCATGTTGACGGTGATCAGATTCTGCTCGCCTCCATAGAGCGCTTCGGCAAGGGCGAGCGCGGTTTCTGTCTTGCCTACGCCAGAAGGGCCCGCCAGCAGGAACACGCCAAGCGGTTTCCTGGGATCGGCCAGACCGGCACGCGCGGTCTGGACGCGTTCGCTGATTCGACGCAGCGCATCAGTCTGTCCGATCACGCGAGCGGCCAGTGTCTCGGGAAGTGTTCGCACGGCCGCGATTTCATCCGTCACCATGCGGCCCGCCGGGATGCCTGTCCAGTCGGAGACGATTGCCGCCACGATTGCTTCGTTCACTTCCGGGAAGACGAACGGCGCCTCGCCTTGCGCGCTATGAAGGTCTCGTTCGAGTTTGCGCAGTTCGTCTCGCGATGAACGAGAATTGCCAGAAGCGTCCGATTCGCTCGTACACGCCGCCTCACGCGCGCTGGCTAGCGCCTTTGCCGCTGCAGTCTGCCTTTCCCAACAGGCCGTGATTGCGGCCTCTTCCGCCGACAGGGCGGTGATTTTCTCCTGTGCGGCGGCGATGGCCTTATCGGCGTCGAGGCCGATGCGCGCTTCTTTACCGAGCAGATCGCGTTCGACGTGCGCGGCCTGAAGACGTCGACGTACGGTCTGCAGTTCGCGAGGCGGAGCGTGCTGGGATAGCGCGACACGTGCACAGGCCGTATCGAGCAGACTGATCGCCTTGTCAGGCAATTGCCGCGTGGGGATGTAGCGGTGCGACAACATAACCGAGGCCCGAATCGCTTCGTCCCGGACCACGACGCCGTGGTGAATCGATAGCGCGTCAGCCAGCCCCCGCACCATATCGATGGCCTCCCGTTCCTGTGGCTCGGGAATCTGCAATACCTGGAACCGCCGGGTAAGCGCAGGGTCTTTCTCGACGTGCCGTTTGTATTCTGTCCATGTCGTCGCGCCGATCATGCGAATCGCACCGCGCGCAAGCGCCGGCTTCAGGAGGTTGGCGGCGTCGCCTGTGCCTGCCTGGCCGCCGGCGCCGATCAGCATGTGAATTTCGTCGACGAACAGAACGATAGGCGTTGTACTCTTGAGCACCGCCTCGAGCACGCCTTTGAGCCGCGACTCGAACTCGCCCTTCATGCTGGCGCCCGCCAGCAATGCACCGACATCGAGCGTCATCAATCGCACGGCCGAAAGCGCGGGCGGCACGTCGTTTTTTGCAATCGCAAGCGCCAGTCCTTCTACGACGGCAGTCTTGCCGACGCCGGCTTCCCCGGTAAGCAACGGGTTGTTTTGCCGCCTGCGCATCAGGATGTCGACGATTGCGCGGGTCTCCCGCTCGCGTCCCGTCACGGCGTCGATCTCTCCAGCGCCTGCACGAGCTGTCAGATCAACGCAGTACTGATCGAGCGGCGATCGTTGCGCGGTTGCGAGCAGTGCTTGCGACGCCTCACCGGGTATGGCGGGAGAAAAACTGGTGCTGTCGTAAGGCGCATCGGATGATTCCGGAGAGCCTTCGATCCAAGCGATCAACTGATCGCCTGGTGTGTCCGCATTGATTCCTGCGAATGTCGGGGAAATCGAAAGCAGTACACGCCGTAACTCCGGCGTCCCGACGAGCGCGGCAATCAGCCACGCGCCACGCACGCGCCGATCGTCGAATGCGAGTGTCGCGAGCACCCAGGCGCGCTCGATCGCCGTTTCGACGTGGTGGGAGAAGTCGCTGATGGAGCTTGCGCCGGCTGGAAGCGCATTCAATGCACGAATGATGTCGCGCTCCAGTGTTTCCCGTTCGATACCGGCGTGCCTGATGATGCGATGCAGGTCGCTGTCGGGCAGTTGCATCAATTGATGCAACCAGTGAACCAGTTCGACATAGGGATTGCCGCGCAGTTTGCAGAACGCAGTTGCCGATTCGATGCTGCGGAAGAGCGTGGCCCCGAGTTTGCCGAATAGTGACTGGCGGGAAATCGTCATGAAGTGGTCTAAAAATATTGTAATGACTAACTGCTGATTACTAATTTGCAGGTTGATGTCGGATTAATCCGACAGGTGTAGAGAAAATTAAATGATGTTGTAGTCCATGCCTGGCTGACGTGTGCGAAATCTGCTTTGGTCAGAGTTTGCTGGATTCGAATTTGCCGGTTCAAAAATCGCTCTGTTCGCAGATTATTAAATCATGAGGCGTCATCGGTCAACACCGATAATCAACTTGTGGCATGTTGAATTTATAGCTGACATAAGCAAGTACTTTCGAAGGAGCGTTGCTATATACAAACAAAGCTATCACGTAACTAGTTGATTTTATTATTGTTATGAATATTGGATATTTTGTGGCATGTAAGGCGATTCTTGTCGCCTGTCGTGTTTTTCAGTTTGAGTGTGAGAACGTTATAGGATTTATCCTAGTCGAATTATTTTGATATTTCGCTTATCTCGAATTAACCTTCTAATTTATATTTGAGATGAAAATTATCGATGGACAAAATTCATAAAGGTGTGCGTTAATCCGGCTCGTTTCTCTGACATATCGAAATTGATCAGCCAGATTTACGTTATCCGATGACTTACTCCCGATTTTTTCGCGCCTTCCGCGATGCAACCAGTCAACTCGCAATCCCAGGCCGCGAGCGCGCTGACGCGAGAGCGTTTGCAATGCCGCCGCTTACGGAGGGCGATCGCGCATCGGCGGCTATCGATAACGGGTCGTGCGGCTCGGACAGCCGGTCATCGTGTGGCAGCGGTGGATTCCTTGATTTGTTCAATGAATTTTCGGACGTAGCACATGGCAGGTCGCCTTCAGAACGTGACGAAGCGTCGGCAGCAACTGCACATGATGACCTCATGGCCACGCTCGGCGACCGGTATTACCGCGCGTTGGAATCAGCCGACAGTTTGTTCGGTGAATGGGTAGACGGCGGTGCGTCTTCGGGCGAAATGTCGGCGGATATACCTGCCGCGAATCGTGATTTCAGTGATGCACCCTCAAGTCCGATCATCGGTCTGTTGTCCGACCTCGAGAGATTGGAGGAAGCGTTTGGACCGCTTCGCAACGCCGGGGTCGGCGTTCCTTTCGAGGTAGAGAAAGTACCCGAAATCCTGCGGCTTTTCGCTCCCCCGGAATTTCAGGCGTCGGCGGCCAGGCGTTTGGATACCGTGTTGCCATCTATCGCGCGCCGCGATCATCACACGCTGGCGATCGACAGTCCGCTCGTCGCTCCGAATGGAATGACCAGCACCGACAGCGTCGTGGGGCACGCCGAATGACTCACTACGCTGCCAGGACCTCCACCTCAGTTACACGACGGCATGTATCGCCTGCTGGCGAGATTGAAACCATCGAGACAAGCATACGCCGGGAGCCCGCCGTGGCTTTGCATCGCTGGGCGCTGTTTCAGTGGTTATGCGTTACCGGGCAATGGGAGCGTGCGGTCCAGCAGATCCAGATATTTGCGCAGCTCGATCCACCATGGGCTCGTGTTGCGCAGGCGGGCCGCGAGCTTGTGCGTGCCGAACTGCTGCGTACAAAAGTGATGGCAGGCCTCGCGAAACCGGGATTTGTTTTCGACGATGTCCCGGCCTGGATGCAGAGCTTGCTGGACGCGCTCGAACTGGCTGCCCAAGGGCGGCTCGATGCATCCGACGATACTCGCGAGCGGGCGCTTGATCTCGCGCCGCTCGTCGCGGGTCGCGGCGGCGGACACGCGTTCGCGTGGATCGGCGACAGTGATTCGAGGCTGGGACCCGTGTGCGAGTTTGTCACCGCGGGCCGTTACCGATGGCTGCCGCTTGCGGATATCGCCGGTTGGCGGATCGAGTGCTCGGGTTCGCCGATCGACCTCGTCTGGGTGTCCTGCGTGCTGACGTTGATCGACGGAGCTGTGTTGCGCGGTTTCATGCCCGCGCGCTATCCGGTTTCGGGTTCTGAAGCGGCGCACGATCGAGAGGCGCTGCAGCTTGGTGATACCACCGTCTGGCAGGACAGGGGGCGCACGGGCGTCTTTGCGTCTGGCCGCAAGACCTGGGCTACGAGCGCAGGCGACTTTGGCCTGTTCGAACTGACCCGCTGCACGTTAGGTTCCGCGATTGCCGATAGCTGTGGTGTCGATCAGTGCACGGCGAAGGGAGTAAGCGAGTGAGCCGCAAACCTCGAGGCAGTGGCCAGGCTTCTACAGCGCCGCGCCGCGCAAGTGCGCACCTTCTCCCCACGTTGATCGACCGATTGCGCGATGACGCGCCTCATCGGCAGGTTGAAACAGCTCATGAATATGCGGTCACGCCTACCCGGATGCGCGACATCATTCAACGCGATCTTACGTTTTTGCTCAATGCGACGAGCATTGAAGATCTGATCGATCGCAAGCGCTATCCACATGCGGCAGCGTCTACGGTTAATTTTGGCGTGCGGCCTCTCGCGGGCGCGTTTACCGCGTCGCGTCGGTGGACCGAGATCGAGAAAAGCATTCTCCACGCTATTGGCGATTTCGAGCCGCGTCTTGTTCCTGGATCGGTGCGCATCGTGCCGTTGACGGAGGCCGACGGCAACGCTCACTACAACGAATTGGCGTTTGAGATTCGCGGCACTATCCGCATGGATCCTTATCCGCTCGAATTTCTCGTGCAGAGTTCGCTCGACCTCGAATCGAGCCGTTTATATACGAACGCCCGCTAATTTGAGGATGCTCGAATGGACCCGCGATTGCTCGATTACTACAATCAGGAACTCCTGTACATGCGCGAACTCGCGAGCGAGTTCGCGCAGATGCATCCGAAGATCGCGAGGCGCCTCGGCATGCAGGCGGGGGAGGTCGCCGACCCCTACGTCGAACGGCTCATCGAGTCGTTCAGCTTTATGGCTGCACGAATGCAATTGAAGCTCGATGCGGAATTTCCGCGATTTACAGGGCGATTGCTCGAGGTTGTCTACCCGAACTATGTCGCGCCGACGCCGTCGATGGCGGTGGTCCGCTTTCATCCTAGCCAGACACAAGGCAATCTGATCGAGGGCTTTCACGTTCCGCGTGCGACCACGTTGACGGGCGCGGCGCCGGTGGGCGAACAGACAACATGTGAGTTCCGCACCAGTCAACCTGTCACGCTGTATCCGCTCGAGATCGTCGAGGCGCGACTGACAGGCATTCCACCCGATATTCCGATGCTTGACCGCTATGTGCCACCCGATACGCAGATACGCGGCGCATTGCGCTTGCGGTTGCGCACGACAGGTGAGGTGAGAGTCGCGGCTCTTCGTGGGCTCGACCGTCTGCCTGTCTATCTTGCGGGTGACGAACAGGTCGCATCGCAATTGTTCGAACTGCTTCATGCAGCGGGTGTGGCGTCGATCACCGGCGAACCTGGCGCGTTCTCCGATCCGGACCGGCCATTCTCGGCTGTCATGCAGGATGCCGTCATGCATGAGGGGTTAGGTATCGATCAGGGGCTTTTGCCGCTTGTCTGGTCTAAATTCCATGGTCACAACCTGCTGCATGAATATTTCGCTTGCCCGAGCCGCTTTTACTTCTTTACGTTGACAGGTCTGCAAAAGGGGCTGCAAAAGGCGAAAGGGAACGGGCTTGAAATTGTCGTGCTGCTCGATCAGCTTCCCCAAAAGCTGACCGGACTGGTCGATGAATCGCGCTTTGCGTTGTTCTGTACGCCTGTCATCAACCTGTTTCCCCGCGGAATTGAACGTATCGAACTGAACAAGGCAAGCACCGACTTTCATCTGGTGCCCAAACGGCTCGCTCCGCTCGACTATGAGGTCTATGCCGTCAACTCCCTGACGGCGCAGGCCGGCAAGGAAACCGCAGCACTCGAATTCCGGCCGTTGTACCAGACATTGAACAACGACGAAGGCAATCACGGCCGATATTTCTCGCTGCGGCGCGAGCGTCGGCTGATGTCCGATTCGGCACGCCGCTATGGCACACGCACACCCTACGTCGGAACCGAAGTGTTCGTGTCGCTGGTTGACCAGCACGACGCGCCGTATCACGAGGGAATGCGCTATCTGTCGGTCGACGCATGGCTGACCAATCGCGACTTGCCGAACCTGTTGCCGCGCAACGGTATCGACGATCTCAAAATCGCTGCATCGTTCCCGGTCACGGGAGTCGGACTGATCCGCGCTCCGAGTACGCCGCGCGCGCCGTTTGCGGAGCATGAAACAGCATGGAGGCTGATCCGGCAACTGAACTTCAACTATCTGCCGCTCGAAGATATGGATCATCGACCCGGGGGGCAAGGTCTGCGAGACATGTTGCGGCTGTTCCTGAGTACTGACGACACCGGGCTTCAGCACCAGGTGCAAAGTCTTGTCGGCGTGAAAACGAGGCCGGTCGCAAGGAAGTTACCCGGTAACGGGCCGCTCGTTTTCGGCCGCGGCATCGAGTGCCAACTGACGGTTGACGAAGGCGGCTTTTCCGGTACGAGTCCTTACCTGTTCGGCTTGATTCTTGAACACTATCTGGCAAGGCACGTCTCGATCAATTCTTTCACCCAGACCGAACTCCATTCGATGCAGCGTGGCCGCGTCATGCGCTGGCCGGTACGCATCGGTGCGCGTGGGGTGGCGTGATGAAGCGAACGCCATTCGCATCGCGGCTGCGAGAACCGACGCTATCGGCTGAGACGGTTGAATGCCTGTATGAGCAGCCCTGGCGCTACGGATTTCTCCCGCTCATGCGTCGCATCGGCGCCGACGACCGCATCGATCCGATCGGCACAGCGCGTCGGCCGGGCGCAGAACCGTTCCGGCTCGGACAGAAGCCAAGTCTGGCATTTGCGCCGCGCGAACTCGCGAGCGTCGCAGAGGTGGGCGGACGTCTCAACGTTCGTCTTTTCGGGCTTGGCATGTTGGGGCCGAACGGGCCGCTGCCGATCCACGTAACAGAGATTGCAAGGGAACGTGAAGAAAATCGGCACGACCACACGCTGGTCGATTTTCTCGACATTTTTCACCATCGCTATTTCACATTGCTGTATCGCGCGTGGGCTGACGCGCAGGCGACGGTCGGGCTCGACAGAGCGGGCGCTGGCGGCGAGCGGTTCTCGTTCTACATCGCGAGCTTGAGCGGGGACGACAGTGCAGAGATCGCTCGACGTGTGCTACCCGCGCACGCGCGATTAGCGGCGTCGGCGCATCTCGTGCGCGAGGCGCGTGATCCCGACGGCTTGCGCTCCACGCTTGAACGCTTCTTCGGCGTGCCGGTAGCCATCGACGAATACGTATTTCACTGGATCGATATCGCGCATGCGGATCAATGCCGTCTGGGCCGGCGCGGCGATGTCGCGACGATGGGCAGGGGCGCGATGCTCGGCGAGCAGGTGCCTGATCGCCAGCACCGCTTCCGGGTCGTCATTGGGCCGCTCGACCTCGACGAATACCTCCGTTTCACGCCACGCGGCGTAGATCTGCCGAAGCTCGTGGACTGGGTGCGCGTTTTTGTCGGTCGCGAATTCGAATGGGAAGTCGAGTTGCGGATTCGCGCGCAAAGCGCACCGCCTGCGCAGATTGGCGGCCCGCAGCAATTGGGTTGGACCGGATGGCTCGGTTGCTCATCTTCTGGCGAGTCGATCACGGGCATGAGGTTCGAGCCCGAGCGGTATGCCGATCAGTTTGTGAATACATGTGGTGAGCGGCGTGTACAGCCAGATGTGCAGGGGCAATATGAACAGCGCTAGGAGGCAGGCCTTGCCGCAAGCAGAAACAGCACGATTCGAAGCGCCATTGCCGTACGACTGGATGACACCCGTCGACGTGGATTCGCCATGCGGGCCGGACCTCGAATACGACCCGGAGTTCGTCGTGCTGTCGTCACAGCTTGCCGCGAGGATGGATGCGCAGTACGGGGACTTCGTCGGTACCCCGGAACCCGTGGACTGGGGCGAGGCGGAGCGAGACTGCCGGCGTCTGCTTCTGCGTAGCAAGGACATGCGCCTTGCCGTGCTCTTTACGCGTTGCCGCACGAGGCTCGCGGCGTCGAGCGGACTGGCGGAGGGGCTGACGCTGCTGGCTGCCTGGCTCGGTGCGTTTCCCGAGACTGTTCACCCGCAACCCGGCGTCGATGACGACCTGGACGTCGCATCGGAAATTCGCATGAACGCGTTGCAAACGCTGACCGATACCGATGGCCTGATGTCGGATGTCCGGGAAATCGCGCTGACGCGTTCGTCGGCCGCACGTTTGCAGGTCCGCGATGTCGAGCGCGCGTTTGCCTATCCGCGGCCAGTCGATTCGCTTGCGCCGGAGTCTGTCACGCGACAGATCGACGATCTGCGCAGCCGGCAGCCGGACGCGCTGGCAGGTTTCGAGGCGGCGCTCACCAGTCTGGAATGTATTGAAAGCTGGTGCAAAGGGCATCTCGGCGTCTATCAGCCGGACTTTTCCACTTTGAGCCGGCTTTTGCGGCGCGTGATTGGCGACGCAGTGACGACTGTCTCCGTTCCTCAAGAAGCGCCTTGCGCCGAGGCGATCGTTCCTGAAACGTCGGGCGGGGGCGTATTGGCGGGCTCTCGTCACGCCGGGTTGCCGCGAATGGTCGCTTCGGACACGGTAGCGCTCGATGCCGGCACGTTGTCTTTACGGGATCGTGACGAAGCGCTGGATCTTATCCGCGAAGCGCGGCACTGGTTCGAGCGACACGAGCCGAGCAGCCCGATTCCCGTGCTGCTGCGCAGGGCCGAGCAGTTTGTCGGCAAGCGATACGCGGACGTTGTGAAGGCAATCCCGGCGGAATTGCTTGAACAATGGGAAAGGCAGGAGTGAGCCGCGATGGCGAACGAGGCGAAACAGATGAGTGCGGCAGGTCTGGCGGCATTGAGAGTGCGCGAAGGCACTGTGTTTCACTACTACAACGACATGGCGAACAACTGCACGTATGGCGTGGGCACGCTGGCGCACCATGGGCCTTGCACGGACGACGAAGTGCGTCGCCCGGTGACGACCGCGGACGTCAATGCGCAACTGTCGCTAGCGGTGCGCAGCGCCGAGGCCGCGGTGCGGAGACAGGTATCGGCGCGCGAATTGAGCCAGGATCAGTTCGATGCGCTTGTCAGTTATACGTACAACGCCGGCGCGACGGGTGCACGTCCGGCTCTCGAGGCGGCTAACGCGGGGCGGGACGCTCAGGTCGTCACTCACATGAACCGGAACGTGAATCTCTACCCCCGCGATGCTCATGGACACCGTCTGCATCCGGTGCGATCGGCAGGCCTCGTCAACCGGCGCCGTGAAGAGTCCGCGCCGTTCAGGTTGCCGGAGGCACCCGCACGATGAAGCGCCTGGCTTTCTTTTTGATCGGTCTCGTCTGCACATCGCTTCATGCCGCGGCTACGGATCCTGTCGACGAAATTGCCAACCGCAGCGGATTGCCCGCGAGCGAGGTGAGCGCGCTGATCGCGAATTGCGACGCGAGCCAGACAAGCATGAATTTTTGTGCGTGGCGGGACCAGCTCGTCGCCGAACAGAATCTGCATCTCGTAATGGCCGACAGGGAAGCGCAATCGCCGACATGCAAGGCGCGCCTCGAAAAGCAAATATCGCGCTGGATCACGCAGCGCGATCGCGCCTGCCGGAGCGAAGCGCAACAGGCCTGGGGAACCGGCTCGATGCGCCAGGCGGCGCAGGCAACGTGTGCGGCAAAACAGACGGAAACGCTCATCGGGAAGGTGAAAGCCTTCGGTTGCCGGTAGTTCAGTGACTTCGCCATATGGCTTGACGTCTACGGAGTGTCTTCAGAAATGCTTGCGTTCTCCACCAATCGAACGATAACAATGAGCGGGCCTGCCTTGCCGGATTCGTCGCTGGGTATGCCTGCGCTCCAACTCGAAAGCATAGCCGGCGAAGAGGGATTGTCGGAGATCTTCGTGTACACGTTGAGGTGCCGCACGCCGATCGAACTACCCGACGAGGAGGCGGACAATCTCGACCTGAAATCGATGATCGGCAAAGAACTGACGGTCACGATCGAGCTCGACGGGATGGGCACGCCGATGGCCGGAATGCAGGGAATGGCGGGCGCAGCCAATATCGGAGCGGGTGAGCGCGAGATCAGCGGGATCGTGACGGAAGCGCGCTACGTAGACCGGTCCGACCGGCAGAGCAGCTATGTGCTCGTCATGAAACCGTGGATCTGGCTTGCCGACCAGCGATCGGATTTCCGCATATTCCAGCGCAAGACGGTCATCGACATCATCGAGGCCGTATTCGACAACTACCTGTACTCGTATGACTTGCGGCTGAGCGGTTCGTATCCGGTGCTGGACTACCAGGTTCAGTACGGCGAAACGGATTTTTATTTTGTCCAGCGCCTGATGGCCGAACACGGTATCTACTGGTTTTTCGAACACTCGAACACCTTCCATCGGATGGTGCTGGTGGACCACCTCGGCGCGCACAAACCGGTGGAGAGCGTCGCCTATCAGACTCTTTGGTATTTCCCGCCGGGGCACAGGATCGATCAGGAACATATCACCGAGTTCGACATGGGCGGCGAGCTTCAGTCCGGCCGTTGGACGACAAATGACTACGATTTCAAGAATCCGAATGCGCACCTCGTCAAACAGAATGAATTGCCGCAGGAGACGGCGCACAACGATTTCGAGCGATACGAATGGCCGGGCGACTATACGGATCCGTCGCACGGAGAGCATTTCGCGCGCGTCCGTATGGAGGAGGTTCGCGCGCGCGGCGAGCGGGCGTCCGGCAGCGGCAACGTGCGCAATGTGGTGTGCGGGACCACGTTCGAGCTGGAGGGCTATCCGCATCAGGCCGCCAACCAGGAATACCTGGTGATCAACACGTGGTTGTCGGCAACGGAAACCGGTGAGGCGAGCGGTTCGGGAGACTACTCGATTAGCAGTTCGTTTGTCGTGCAACCCGCGACGACGGTTTTTCGTCCGTCGCGCTCGCGGTATCAGAAGCCGCGTACAAGCGGACCGCAAACGGCGATCGTCACCGGGCCGGCGGGGCAGGAGATATGGACCGACCAGTATGGTCGCGTCAAGTTGAGTTTTCACTGGGACCGCTCGGGCGTGAAGGACCAGAACTCTTCGTGCTGGGTGCGCGTGTCATATCCATGGGCGGGCGGCGGATTCGGCGGAGTCAATATTCCGCGCGTCGGCACCGAGGTGATCGTGGACTTCGAGAACGGCGACCCGGACAGGCCAATCGTGGTCGGTCGGCTTTATAACGCGATGACGATGCCGCCCTGGACATTGCCGGGGAATGCGACGCAAAGCGGCCTCATTTCACGGTCGATGAAGGGAGGCTCAAATAACGCGAACGCAATCCGCTTTGAGGATAAGCAGGGTGCGGAAGAGTTGTGGCTGCAGGCAGAGAGGAACATGCGGACCGAGGTCAAACATGATGAAACGCACAGCGTCACTAACGACAGGAAGCAACGCGTGGGGCGCGACGAGGTGGTGAACATCGGTCACGATCATGTACACATCTCGGGTAACAACAAAACGGTGGGCGTAGGCGCTGCATTTTCGACTGTCGTGGGTGAGATGCCTGACCCGCGTCAGGCGCCTTTGCCTCCTGGTACCTACGTGCTGGACGTAAAGGAATCGATACTGATTCGCTGTGGCGATTCAAGCATCTTCATGTCGAAGGAAGGCATGATCGAAGTGAAGGGCAAGCAGATCAGCGAAGAGGCTGCGGACTATTTTTTGATGAAGGGCGGGAAGATCGACCTGAATCCATGATTTCAGTTTTCGACGTAATTCACAGCCCATGCTTCTGATCGAACGGAACCAATGAAATATTTCACCCACGAGTGCAGTTTCGATATCCCCAATCTTGCGGAAGACCGCACGGTCAACGCGCTCACTTTGCATTGCCCGATTACAGGTTCGCCTTTCCAGGTCGTGGTTAGCCGGGACGAACTGATCGGCGGAGAAGATCTGGTGGTTTGCATCAAGCGCCAGGTACGCATGATGACGCGTCATGTATCCAACTTTCGCGAACTTGCGCGACGCGAGATATTCCGGAATTCAAATGGACTGCAGGCGCTCGAGATTGAAAGTGCATTTCGGCAAGCCAATTCGAACTATTACCAGGTACAGGCGGTGATCATTACGCGGCCGCCATCACTTCTGGTGCTGACGCTGTCAAATCAAACGCCACTGCGCGAACCGCATCGCGAAACATGGGCAACGATCCTTGACACGTATCGGCTGAGGGAAGTTATTCAACAGGGCTGACGATACTGGTCATTGCCGAAGTATTCCATTGGGTTCCTAACGGATAAAGAAAAGGATGGGACTTCTTGCTGCTGCTCGTATCCACGACACGTTCACACACACTTCTCTGTTCGCTCAGATACTGAAAGTGGCCGCGAGCGTCGGCGCAGGGTTGCTCGTCGGCCTCGCTGTCGGGGCGGCGGCGGCGCTCATTGTCGGGACTGGAGGGTTGGGTGCGATTGTTCTGGGAGCCGTTGTCGGTGCGGTGATCGGCGTCATGGCGGACGCCGCGACTTCGGCCTTTACCGGGAAGGGTTCTCTCGAGCAATATTTGTCAGATATGGCAAACGAGGTGATCGACAGTTTCATACCGGGTAAAGTCGAGGGAGCGATCGCAACAGGGTCTCAGGATGTTCTTATCAACGGTCAGAGGGCGGCGCGCGCAGCCGGTGTCATGGCTCCACCGCTTTCTCCAGGCGTTGAGCCGCAATCTGTCGCCGACATATTCACGGCCACCGACCTTGATTTCGTGGGTTGCTCGAACCATCCCTCACCGCACGGTGAACATATGGCAGAGGGCTCGTCGAGCGTTTTTATCAACGGCCACCCCGCCTCGCGCATCAAGGACAAGACAACATGTGATGGTTCTGTGAACACTGGGTCGCCGAACGTCAGCATCGGCGGTGAAACCGTGGCGGTTCGCGAGATCAGGTCCGAGATGCCGCCGTGGCTGGCCACGGTGGCAAAGTATGCAGGCCTCGCGATCGCATTGTGTCAGGCCGTACGAGGTAAGGGGCCGCTGGCCAGTAAGGTCGCCTGTTTCGCGATGAACTTCGCAGTTAACGCGGTTGCCGGAGAGGTAGTGAACGCGGCAGCGAGGCGAATGACTTCCGGTCGTGTCGGGCATCCGGTGCATTTGCCAACGGGAGCGAAGTTAATCGACGGAGACGAAGACCTCGATTTCACGCTGGATGCACCCATTCCGGTGATATGGCAACGTTTCTACAGCAGCCTGGACCCACGCAGCGATGGGCCGCTCGGGAAAGGCTGGAGTTTGCCCTACAGCGTGGAATTGAACCTGTGGGTAAGCGACGGCACGCATCCACACGAATGGATCGACGCGCAAGGCAGGCGTACGCGTTTGCCGCATTTGCGTCGAGGCGAAAAGACATATAGCCGAATCGAGGGGATGACCTTTGCCTGCACGTCGGGTGGCCACTGGATGGTTGAGCACGACGATGGTCTTTGCATGGATTTTGGTCAGGCCACGGCTGAGGAGACGAAGCAGTTTCTTCGGCCTTGGGTTATTGAAGACCGCAATTCGAATCGACTTTATTTGCGTCACGATGATGAGCGCCGGCTTGTTGGCCTCGCGACGATGTCGGGACACACGATCGTTCTCGCATACGATGCGATACACCGGCGGCGCGTTTCAGAGGTGACGCTGCATATGGAAGGCGAGGCGCCCATATGCCTCGCGCGCTACGCTTATTCGGAAGCGGGCCAGCTTGCCGCAGTTCGCAATGCAGCGGAGCAGATTACGCGCGAATACGGCTACGACGCCGACGGACGAATGATCATGCATCGCTTGCCCGGAGGACTTGCGGCGTTCTATCGGTGGGAGCAGTTCGAGAGAACCGGGTTGGCACCGGAAAGCGGGCCAGAGGTCGGCGAGGCGCGAATTGTCGAGCACTGGAGCGATGCAGGAGATCGATACCTGATCGCTTACGATTTCGAGGCTCAAACAACCTTCGTGCGTGATCATTTGGGACGGGAGACCCGTTGCGTCTGGAACGAGGCATACCTGGTGACAGCACACACCGATGCCTTGGGTCATACGTGGCGATTCGGGTGGAGTGGTTCGCGGGAACTGTTGTCGATGACGGATCCGATGGGCGCGGCGACGCGAATCGAGTATGACGATGAACGGGGTCTTCCCACTCGCGTGATCAACGCCATTGGGGAGGTGACTGAGATCGCATGGCATCCGCGATGGGCTGAGCCCGTGTGCCTTACCTCGGCGGACGGCAGCCGATGGCAATACGAATACGATCGGTTAGGCAATCGGGTGGCGCAGATCGATCCGCTGAAGCAGCGGACGGAGTGGTGCCTGAACCGTGCGGGACAGCCGGTAGCCCGGATCGATGCGCGTGGCGGAATGAGCCACTTCGGTTGGGATCGTTTACATCGCCTGATGGCAAGCACGGATTGTTCAGGGAATACGACGCGGTTGACGCACGATGGCAGGGGACGGATGGTGGAAATCCGCGACGCGTTGGGCCACGGGTCGTATTACGGTTACGATGCGTTGGACCGTATTGTGCGAGTAAGGCATCCAGACGGTAGCGAAGAATGCTATGCGTGGCAGCTTGCCGGCCTTGTCTCGCATAGGGACGGTGGCGGGCGGGAAACCCGATTTGCTTACGACAAGCTAGGGCGTGTCGTAAGCCGGCGCGACGGCAATGGTCATGAGGTGAACCTGCAGTATGACGCAGCGGGTAACCTGAGCCGGCTTATCAACGCCAGTGGCAAGCAATATCGTTTCGGCTGGGATGCCGCAGACCGGCTGATGGAACAGGCAGGTGTCGACGACGTCTTGACGACATATCAGCGCGATGCACTTGGCTACCCAAGCGCGGTCTGTCAGGGGGCAGGGACAGCAGAGCAGGTGACTGTGCGGTTTGAGCGTGATCGGCTAGGGCGCCTGACGCTCAAGACCACACCTCTGACGGAGACGTCCTATCGCTATGATGCCGGCGGCTACCTGATAGAGATCGCTCGCCGCGATGCACAGGACAAGCCAATAGATCGTATCGAGTTGAAGCATGACGCGCTGGGCCGCCTGCTTAGTGAGTGTACCGAGTGGTCAGGGGAACCGGCGCGGTGCTCGAAGTTGGAACATGGCTACGATGAACTCGGCAACCGTACCTCGCTGCAGCTTGCTGACGGGCATCGGCTTGAGTGGATGTATTACGGGAGTGGGCACTTGCATCGTGCGACCCATGATGGCGAGGTTGTCAGCGACTTTGAGCGCGACGCATTGCATCGGGAGGTGGCACGCACGCAGGGGGCACTAACGCTTCGGACGGCGTGGGACAGTCGCGGGCGACGCGTGGGTCGCTGGACTGGCGGCCGAAGGCAAGCGGGTGAGTGGGGGCGTTTCGCCAGGGGAGAGGACTCGCTTGCAAAGGCATTTGCGTATGACGCAAGCGGGGAATTGATCAAGCGACTTGATCCTATAGCGGGAGAGTTGCGCTTCGCCTACGACCGAAGCGGGCAACTGTTGCGATGCGAGGGCGTGCCGCAGGTTGGCGTATTGAATGAGCAGTTTGTGTACGATGCGGCCGCCAATTTGTTGGCGGCCGGGCAGGTGGGGCGTATCGAGGGTAACCGGCTGGTCATGTCTGGGGATAGCCGGTTCCGATATGACGGACACGGCAGGCTGGTCGAGAAACTAAAGGGTTCGCATACACGGCAAAAGCTCTTCTGGAATGCAGAGCATCAGCTCGAAGCGGTGGTGACTGTTCGACATGGTGTGGAGCAGCGGGTGCGCTACTGCTACGACGCATTGGGCCGGCGCGTATCGAAAGAAGATGCGTTTGGGGAAACGCGTTTTCTGTGGGACGGGTTGAGGTTGTTGCAGGAGGCGCGTGGGTACCGTGAAGCGACGTATCTGTATGAAGGCGGGAGCTATGAGCCATTGGCGCGCATCGATTCGGTACGTGAGGATGGTCAGTGGTTGTGGCGTGCGTATGAGCAAGGGGAGGGAAGGCAGACGGAGGCCACGGCGCGCGCGCCAGCGCAGATCTATCATTTCCACACGAATTCGAGTGGGGCGCCGGAGGAACTGACCGACAGGCATGGGCGGGTGGTGTGGCGAACGCGGTATCGCGCCTGGGGCAATGTGGTGCTGCAGGAGTATGGGCGGGAGTTTGAGCCGGGGCGACGAGGCGAGGTCGAAAAGCCGCTGCCGCAGTCGTTGCGGTTGCAAGGGCAGTACGAAGACGCTGAGACGGGACTGCACTACAATCTTTTCAGATATTACGATCCGGATGTCGGGCGGTTTGTTAGTCAGGATCCGATTGGGCTTCTGGGCGGATTTAATCTGTATCAATACGCGCCCAACCCGGTGCAGTGGATTGATCCGCTGGGGTTGTCTAGTGTCTGCTGCTGCAAAGGCTCGTACGGCGGAGCAAAGCAAGCTTCGCAATACCTCAAGGACGCGGGTGTTCCGAGAGCGCGGAGAAAGGAGATCATGGAGTCGTTCGATGTCGGAACGATCGATATGCGTAACGCTGGAGCCAGCGAGTACGGCCTTCGCTACTACGATGAAAGCAAAGCGTACGCAAAAGGCCGGTATTTGTTCGAGACATTCCCAGCAAGCCGTGAATCGTTGGCGGTCAAGCCAGAGTGGAACGATATGACCAAGATTCAACAATGGAATGTCCGGGAAGGTGAACCTATGATCGAAGGCCGGGCTTCAGCACAGGGTCCGTGCCTACCAGGTGGACAGGTTCAGAAATTCATCCTCAACTTGGACGCGCTTAGTAAACCATGACCAACAAAAAGGATGCGCTCTTGGCAGAGCTAAAGGCTGTTATTGCCGAGGTGTCGCAACTCACCGATAGAGACGGCGTATCGGCAGAGCAACGGGGATACGTGCTCACTGTGCTTGGCGGCTATGTGGACTGCCTCGAACATGACCCGCCGACTGCGACCGATTGCAATCGAGTTGCGCGCATGGTGGTCGAGTACTGGCCTATGAATTTAACGGCTGGTGAACGTGTAGTCAGCGTCGAACAGAAGTTGCGCGGGTTCTACGGCCCAAAGAAGTGAATCTCTGGCGGCTCCGTTGCGGGGCCTTTGTTTATTCAAACGGTCGTGATCGCCAGCGGCGAGAAAATCGCATCCAGGGAAAAATTCCGTGCAGTGAAGTGCGAGCAGGCTGGAAGCCTTGATGGACAGGGGATTGCGGCGTATACGCGGAGTGGGTGCGTCGGGATAAAAAGTTTATCAATACGCGCCTAACCCGATCAGCTGGATTGATCCGCTTGGGTTGATGTGTGGAGTAACCGCAAAGCAGGCGGGCAAGGAACTGGATGCCGGTGCAAAGAAAGTTACTGTATCGTCGCGCTCGGAAGCTGAGAAATTATTTATGGAGCGCTATCTCGGTCACGATTATAAGAACATGACTGGGGAGAGCGGACCATCGACGAAAAACGTAATGGACTACCTAACTGGTAAATCTACGAAGAATGGTACATACCACTGGGATGATGTTATGGACCCGCTGAATCCAGGGCGAGTGATGGGGCATGGCCCAGAGAATGTGGATGGAGCTTTACCACATTTGCAAATTCATCAATTAGATGGTGATGTTATTCACGTCTTTTTTCCTTGGGGAAGCTAATGAATATCGACTCTAAAAAGTTTAAAAAGCTTTTGCAAGGGCATATTCTTGTTAAGGAACTAGTGTCGCAAAGTTCTTTAGAAAGGACATGGGTAGAAGTCAAATTGAGCGAAAATGATCGACTGGCTTATCCGTTGAGGACGCCACCGAATGCCATGCTTGGCGGTTCGGTATATGAGCAAGCTTGTTCGCCTGAAAAGGCATCGTTCAAATTGCGATATGCAACGTTCTCAGCGAGTGACATCGCTAATGGATTCGATCCTAATTACGATAAAGTGGGGCCGTATATTGATATTCCGTCTACCTCCGCATTAGAAGATTATTTTGTAAGGGAAAATTTGGATTTGGAGGGGTTTGTTGATTCTGGATTGACAGATTATCCGATGTAGTAGACAAACAAAGGATAATACCCCCGTCCGGTAAGCGGGATCAGAAGTAGAATTTCCAGCAGAGGGAGTTCTACGCAATGAAGAAGTCGAAGTTCACGGATAGTCAGATCATGGATGCGCTCAAGCGCGCAGAAGCTGGCGTGGCGGTGCCGGAGATTTGCCGGGAGTTGATCGACACGATTGCGCTGCTGCATCAGTGTCAGCGGGAAGTGAAGACGGTGCATCACGCGGGCGAGACGATGGAATACATCGAGGTGACGCTGGCCGATATTGAAGCGGCGAACCGGATCGTGCATGAAGTGCTCGGACGCTCGCTCGATGAGTTGCCGCCGGTGACGCGCAAAGTATTGGAAGCGATTGTTGAGGCGGTGAAGGCGAAGCCATTGCCGCGCGAATCGGTGCGTTTCAGTCGTCGGGAAGTGCGCGCGTGGACGGGACAGAGTGACACGCAGGCGCGCGCGCACCTTGATCGACTGGTGGAGCTTGAATATCTGCTCACGCATCGCGGCAAGCGCGGCCAGAGCTTCGAATACGAGTTGATCTATGACGGAGACGGCAGCGATTCCACGCACCTTGCCGGGCTGATCGACACGGCTACGATTCAAAGTTCGCGGGGTAAAAATGGTCAGTTCGCGGGGGCAACGCGGCCCCAAAACGCCCCCATATCGGCCCCATCGCGGGCTGCGCCAATTCCCGCTGATGCCGATGAAATAAGCGCGATGGCGGAATCGGATGAAGACGCGCGAGAAATGCACTTCTGCCCGGATAAAAATCCGGGCTTCGTCGTACCGTACCCGCAGGAGCACGTCTGATGGCTTACACGCCGGTGCGTCCGAACAAGCGGGTGAAGGGCCGCAAGACGGCGGCCTATAGTCGCCGCAAATCGCGGTTGACAGTAGAGGCAGACGTGCTGCCGGATTGGCAGGCGTATCTGCACGCGCACGCGCAATGGCTGCGGATGATGGCGTATGCAGAAAGCACCGTCACGACGGGTCATCGTGCGTTGGTGGACTTCGTGCGATGGTGTGGGGTGCGTGCGCTCGATGGCCCGCAGCAGTTGACAGTAAAAATCGTGGAGCAGTATCAGCGTTCGCTGTATCTGTATCGCAAGGCCAACGGCGAGCCGTTGTCGGTGAAGGGTCAGGTGGTGCGTTTGCAGACGCTCAGGCGCTTCGGGCGGTGGCTGGTGCGCGAAGGTCATCTGCCGTTCAATCCGCATCCGAGCTGGTGATGCCGCGTATCGGGCAACGCATCCCGCGCTCGATCCTGTCGGTGACGGAGATCGAGGCGATGATGGCGCAGGCGGATGTGAGCACGCCGATTGGATTGCGTGATCGCACGATCATCGAGGTGTTCTACTCGACGGGTTTAAGGCGCTCGGAGATGGCGCACTTGCAGACATGGGACATCGAGCATGCGCGACGTCTGGTGCTGGTTCGCGAGGGCAAAGGCCGGCGTGACCGGGTCGTGCCGATTGGCGTGCGCGCGTTGCAGTGGCTTGATCGATATCTGCTTGAGGCGCGCGAAGCGCTCACCACGGGCGGGCATCATGCGATGTGGGTGACGGACTTCGGCGAGCCGATGGAGGCTTCGTATCTGGCGCGTATCGTGAAGCGGATGATGGAGGCGACGGGAATCCGCAAGGCGGGATCGTGTCACCTGTTGCGACACGCGATGGCCACGCACATGCTGGACAACGGGGCGGATACGCGCTTCATCCAGGCGATGCTTGGACACGCGTCGCTGGCAACGACGCAGATCTACACGATGGTGTCGGTGGAGAAGTTGAAGCAGATTTATGCGGTGACGCATCCGGCGCGGGGCGATGGGGTGATGCGCGAGCCGGATTACGTGGATGACGCAAAGGCACCCATGCGCAAGCTGCGGCCGCTTCTCTCTTAGCCGCTATTGAGGCGGAAGCGGACAACGAAGGCGAGCCAGAAAGTGAATGACGAAACACCGGCGCTCTTGCCCGCGTGATCGAGCGTTGAGAGGTAGAAGAAAGTGCGGCCCGTTGGGCCGGAACGCACGAAGGGCCGGGGGTAAATGCGCTGCGCGCATTTAACATAACGCAAAAGCCAATTAGCGCATTCCGGCGTGTGGGCCAAAGCCTCGAGGAACGGCTCGCCGCTGGCCCACACGCCGGAATGCGCTAATCGGCGTTATGTCCGGGGCCCCCGGCCCGAGGGTGTGCTTCGCAAGAAATAGAACAGTGGCCGCGAAGCGGCCATGAACTGGAAGAACGAATAAAAGCAAATCCAGTGCGCTAATGCGCAGTCGTCCAGCGCCGCGTTCGCGGCGGGATCAATGAAAAGTCAAAGTCAAAAGCGAGCGGCTAAGAGGTGGCAGCGATGCGCCTGCGCGGGTGTGGAGCGTCGGTGCGCGTGTGCCCTGTTCCGCCGCGTCTGCCGTCTGTCGTCGGCCTGCGGCCGCCGACAGCCGTCAGTCACGGCGGGATGATGGAGAGCAAGGGCTATACTCGCGCGCATGAAACGATTTGGACTACGATTGGCCGCGTCTGGTGCGGGCGAACTGCTTCGCTGGCTTTGTGCGCGCTCGCGTTGTTATGTGTGTTCGCTGGCAACGGACTGTTGCCGATGGTGGCGCAGGATGCGCGTCTGGGGAATTTTTTTAGTTCGGCGGCAGGCGTCGGGCTGGAAGGCTTGCCAGATAAGGAGTTGCGGCTCGCGCGCGGGGCAGATGCGTCGAAGGAAAAAGTTTATCAATATGCGCCGAATCCGATACAGTGGATTGACCCGTTAGGGTTGCTTGTTCGTCAGCAGCAGGAGCTCTTCAGAGATTGAAGGGTATGAGCGTCGCTCAGATCGAAAAGGCCATAGCGAAGGACGGGCTCACGAAGACCAAGGACAATGGGAAAAACGCGACGTGGAATCACGCGGATGGCTCGGAGGTCAGAATCCACAAATACGGCAATCAAAACCCGAGTGGATACAAGTCAGGGAATAATGCTCACGTTCATAAGCAAGACCCATGTAAAAACCGGTTGAGCGACCGAGGAATAACGACGATGGACGGTGACGAAATGCACATTGGAATTCGAAACCCCGCCGATCTTCCTGCTGTTCGCGGACGCCCTCATGGTGACGGAAGCTGATTATGAGTTACGAATATTCAATCAGATTGGATAGTGGTAAGCACAGCGCCGACAACGTTATGAGTGAGTTGAGGCGAGGCCCGATCATCGTGTATGAAAAGGACAATTGCATCGCGCTCAAAGATCCGAAATCTCTCAATACATGGGCGTACGATCTGAGAGTGTTCAAGGTAGGCGAGAATGAGTTGCTTCTGGAAATCACGAATGCAACCAATGACCTATATGAGTCCATTCGATCCGCATTACCTTCAGGCTACTCAATCACTGAGGTAGAGGACGAGGACGAACTCTCTCTGGCACAGATATTCCGGTTGATCTGAGACGAAGGGCCGCTGACGCGGCCGTTCGTGTTTCTACTCGGCGGTAATGATCAGCCGCCCCTGTTCAACGTTGATCCTTACGCGCTGCCCTGCTTCAAATCCTGCCTGCTCGATCCATCGTCCGGTAAGCTTGAGCCACGGAAAACACGGCGGTTCGGATCGCTGCCAGTATGGTTTGTTCGAGTGGCTCTCGCGCTGCGACTGCTGAACGGTTACGAAGCGTTCCGTAACGTGTGGGCGTGCTTCAAGATTGGCATCAGCCATGATCAACCCCCTTGATGAGTTGGTCAGTGGTCGTGGCGTGCGTATGAGCAAGGGGAAGGAAGGCAGACGGAGGCCACGGCGCGCGCGCCAGCGCAGATCTATCACTTCCACACGAATTCGAGTGGGGCACCGGAGGAACTGACCGACAGGCATGGGCGGGTGGTGTGGCGAACGCGGTATCGCGCCTGGGGCAATGTGGTACTGCAGGAGTATGCGGGGGAGTTTGAGCCGGGGCGGCGAGGCGAGGCCGAAAAGCCGCTGCCGCAGTCGTTGCGGTTGCAAGGGCAGTACGAAGATGCCGAGACGGGACTGCACTACAATCTTTTCAGATACTACGATCCGGATGTCGGGCGGTTTGTTAGTCAGGATCCGATTGGGCTTCTGGGCGGATTTAATCTGTATGCGTATGCGCCGAATCCGGTCAGCTGGATTGATCCGTGGGGGTTGAGTTGTGGATCGACCGGTAAATTCGAATCAAGAAATGCTGCATTCAGAGCAGCCAGGAGGGATGCTCAGATACCAGTGAGCCAGAAGCCAGAGATGGTTTTCAATCAAAAAAATCAAAGAATGCAGCAGTACGATCAAGTTATGATGACGGATAGAAATGGTAATCCTGTGTTGAATCCATCAACAAAGCAGCCTGTTTGGACAAGGGAGTACCATTACACAAATTCTGATGGTTCCAAGATTGTCATTCAGGACCATTCTGTCGGGCACGAATTCGGGCAAGGTGGAGTTGGCGATCAGGGGCCTCATCTAAATGTAAGGCCATACGATAATACTAGAACAGGTAGTGTCCCGGGAACCTTGGATCATTACGGTTTTTAATGATGGGTGGCATACGATGTATTGGACTGATTTGCCCTGCAATGAGTTGATGCGGCGTGTTTTCTCTACGCCGCCAGAGGTTGGTTATATTGATCTCTTTGACATAGAAATGAAAAGGGATGGCCCAACCATAACCATTAATTTTGATATAATTGACACTTTGCCTGACAAGCCTCCGGAAAAATGGGGAAAGGACTTTAATCGTTGCCGGATTGGACTTTATTGTGCTGGCGTGACGGGTGTCTCTATTTCAGGGATATTGACAAGTATGTCTGCAAAGATAGAATTTTATCTGAGAGAAGGTAATAATAGGGTAATAATCAATGGCAATGACTTTAGTATAGATTTTATGTGCCAGCACATCCATCTTACCGGACCATCGGTATATCTTAGTAAGTAAGGAGTATCTGGTTTGTAGTTGTATCAAGGGCCGCGTGTGCGGCCCTTGATACGTTCGGTCAGCCAAGGTCCGGCATGATGATCAGTTGTTATAGCGGTAATCGATGCTGAAGCTGACCCGCTCACCGGGTTTGAAGCCGGCGGTGGCGAGCCAACGGTCGGCGATTTCCATCCATGGAAAATACGGTGTGTCGTCTGGCTGCGTAGGACCGGGATCAGGGGTGCGTCGCAGGATGCCTCTGCTGTCACCTTCCGGCATAAAGGAGCCGGGTATTTCCGGCATATAGGCGCCAGCAGAAGTGGGTAGAAGCGAGCATCTTGGTGGGTTCAAGAATGGACGGTTTGGCTGCCGTTTGCAAGGTTCAAGTGGTGCTCCGGAATATGTCGTGGGCTTCGATAGGAATGACCGTCGAGCACCAGGCAGTAAGCGTTGTGGCGCAGGCGGTCGAGCGTGGCCGAGGCGAGCAAACGATTGGCCGGGAACGCTTGATCCCATTCGGTCAGATCGAGGTTGCTCGTGACGATGGTGGACGCCGCCTCGTAACGTTCGGCGATCAGTTCATGCAGGTCCTCGTCGGCGGGCGGACTTCGGGTACCTGCAGATCGGCTCCAACGGCGGCAACCTGTTCTTCCAGCTCGTCAATGCCTGCTACGAACGCTGCGCGATCATCCTGACGTCCAACCGCAGCTTCGGCGAGTGGGGTGATGTGTTCGGCGATAGCGTCGTCGCCGCGGCGTTGCTCGACCGGCTGCTGCACCACGCGATCGTCGTGCAGATCGAGGGCTCCTCATATCGCCTGCGCGAACACGCCGATCTGTTGCCTGAACATCTTCGCAACCGAACCTCTTCCCTGAATCCGGCACCTGCCGAACCGCCACGCCGGCGTCCCGGCCGCCCCCGAAGGAGCTCACCTGATCACGTCGACGGCTGATCACCACAAACACACAGGGTGGGGAATTTTACTTCGACACTTCGGGGGTGCGGCGGAATTCTTGGACTATTTGGGAGCTAGTCCATGTACTCGTACGAAGACCGCATCCGAGCGGTCAAGCTCTACCTGAAGCTTGGAAAGCGGCTTACAGCGACCGTTCGTCAATTAGGCTATCCGACAACGAAGTCTCTCGAACGTTGGTATCACACGTACGAGCGATGTCTGGATTTGCCGAAGGAACGTATTTGCCTGAGGCCGCGCTATTCGGAAGAACAAAAGAAGGCGGCCACAGACCACTATATGAGCCATGGCCAGTGTCTGGCTGCAACGACGAAAGCGTTGGGGTATCCGGGACGCGGGACGCTTGTCGCCTGGCTTGACGAGTTGCATCCCGAGCGAACGAAACGCGTTCTCGGGAAGGCTTCGGGTATTCGGCATACACCAGAGCGCAGGCGGACAGCGGTCATCGATCTGTGCACTCGACGCGAAAGCGCAGAGGAAGTTGCTAAAAAGATTGGCGTGAGCCGCCCAACGCTGTACAACTGGAGAAATCGGTTACTCGGTCAGGAGGTTGCTTCAATCATGATACGCCGCAACGATCCGCCGGAGAGCTCCGAGAAAGCGACGCTGGAGCAGCAGGTCGAATCACTCCGACGAGACATCCGGAAGCTCCAGATCGAACACGACATCCTGAAGAAGGCCAATGAAATAATAAAAAAAGGCCTGGGCGTCGACCCGCAACTCCTGACGAATCGGGAGAAGACAATGCTGGTTGATGCCCTGAAACAGACCTACACGCTGCCGGAGCTTTTTGCGGAATTGGGGCTCGCCCGTAGCTCCTACTTCTATCACCGGGCACGGATACAGACTGCCGACAAGTACGCTCGTGTACGCCTTGCCATCGCAGATATCTTCGAGCTCAATCACCGCTGCTATGGATATCGCCGGGTGCGCGCGGCACTTGGCAGGCAGAAAGTCTTCATCTCGGAGAAGGTCGTTCGGCGCCTGATGAAGCAGGAGCAGCTAAGCGCGGCCACAACGAGGCGACGACGATACGGCTCCTACGCTGGAGAAATAGGTCCGGCTCCCGAGAACCTTATCAACCGCGATTTCCGGGCAGCAGCACCGAATGAAAAGTGGCTCACAGACATCACGGAGTTCCAGATTCCTGCCGGTAAAGTCTATCTGGCGCCGGTTATTGATTGCTTCGATGGGCTTGTGATCAGTTGGTCGATCAGTACGCGACCAGACGCCGAGCTTGTGAACACGATGCTCGATGCCGCCATCGACGCAATCGGTAGTAGCAGTAGCCGACCTGTTGTCCATTCAGATCGCGGTGCGCATTATCGCTGGCCAGGCTGGCTTACCCGGATACGTGAGGCCAATCTGATTCGCTCAATGTCGCGCAAGGGCTGCTCGCCCGATAACGCGGCGTGCGAAGGCTTCTTTGGACGACTGAAAACGGAACTGTTTTACTCCCGGGACTGGCAGACCGTCAGCACCGACCAGTTCATTGAGGTCGTCGACTCGTACATTCGCTGGTACAACGAGAAACGGATCAAGATCTCGCTTGGCGCACTCAGTCCGATCGAATACCGGGAGAGCCTCGGGCTCACGACATAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP018812|1399987:1431945|1401943_1402426_-|WP_075160055.1|DBSCAN-SWA MAQDIFIKINGIDGESQDSAHKNEIEVSSWGWQILQQSSMHAGSGGGAGKATVEDLAFEHLVDRASPNLMKYCLTGKHIDQAVLTVRKAGGNPLEYLKITMNDVIVTQVHPSGNNADNGIREHVRLSFAKVKQEYVVQNAQGGSGGAVTAGYDIKLNKEA >NZ_CP018812|1399987:1431945|1418239_1418686_+|WP_075160069.1|DBSCAN-SWA MKYFTHECSFDIPNLAEDRTVNALTLHCPITGSPFQVVVSRDELIGGEDLVVCIKRQVRMMTRHVSNFRELARREIFRNSNGLQALEIESAFRQANSNYYQVQAVIITRPPSLLVLTLSNQTPLREPHRETWATILDTYRLREVIQQG >NZ_CP018812|1399987:1431945|1423219_1423501_+|WP_075160071.1|DBSCAN-SWA MTNKKDALLAELKAVIAEVSQLTDRDGVSAEQRGYVLTVLGGYVDCLEHDPPTATDCNRVARMVVEYWPMNLTAGERVVSVEQKLRGFYGPKK >NZ_CP018812|1399987:1431945|1403127_1403490_-|WP_075160057.1|DBSCAN-SWA MPWTYEQRTGTLTNPEGRVVASDGYSGAGQGRNNPAMEREPNVGPIPRGSYQIRGARHSVRTGPVSMDLSPNVGTSTFGRSAFLIHGDNSSHTASHGCIILRRDVREDINRSTDRELVVQ >NZ_CP018812|1399987:1431945|1429007_1429391_+|WP_075160079.1|DBSCAN-SWA MYWTDLPCNELMRRVFSTPPEVGYIDLFDIEMKRDGPTITINFDIIDTLPDKPPEKWGKDFNRCRIGLYCAGVTGVSISGILTSMSAKIEFYLREGNNRVIINGNDFSIDFMCQHIHLTGPSVYLSK >NZ_CP018812|1399987:1431945|1402621_1403131_-|WP_075160056.1|DBSCAN-SWA MNSRHLLCALAIAALAIPASARQLTPGQVAQDIRAHGARATVRSLDQTGRLDSVLNSIASGNAAWIRLSPDLAKGTDAGNSTGLTIALARALPKNPKAVLTILDDGLVIGSNAVCSAPFIEPTARELKEYLDRAIPAVTRVAESDRYPRRSACLHALQRSAELTAAPSS >NZ_CP018812|1399987:1431945|1426806_1427268_+|WP_156883988.1|DBSCAN-SWA MAAMRLRGCGASVRVCPVPPRLPSVVGLRPPTAVSHGGMMESKGYTRAHETIWTTIGRVWCGRTASLALCALALLCVFAGNGLLPMVAQDARLGNFFSSAAGVGLEGLPDKELRLARGADASKEKVYQYAPNPIQWIDPLGLLVRQQQELFRD >NZ_CP018812|1399987:1431945|1430406_1431945_+|WP_075157113.1|transposase|DBSCAN-SWA MYSYEDRIRAVKLYLKLGKRLTATVRQLGYPTTKSLERWYHTYERCLDLPKERICLRPRYSEEQKKAATDHYMSHGQCLAATTKALGYPGRGTLVAWLDELHPERTKRVLGKASGIRHTPERRRTAVIDLCTRRESAEEVAKKIGVSRPTLYNWRNRLLGQEVASIMIRRNDPPESSEKATLEQQVESLRRDIRKLQIEHDILKKANEIIKKGLGVDPQLLTNREKTMLVDALKQTYTLPELFAELGLARSSYFYHRARIQTADKYARVRLAIADIFELNHRCYGYRRVRAALGRQKVFISEKVVRRLMKQEQLSAATTRRRRYGSYAGEIGPAPENLINRDFRAAAPNEKWLTDITEFQIPAGKVYLAPVIDCFDGLVISWSISTRPDAELVNTMLDAAIDAIGSSSSRPVVHSDRGAHYRWPGWLTRIREANLIRSMSRKGCSPDNAACEGFFGRLKTELFYSRDWQTVSTDQFIEVVDSYIRWYNEKRIKISLGALSPIEYRESLGLTT >NZ_CP018812|1399987:1431945|1428259_1428991_+|WP_075160078.1|DBSCAN-SWA MYHFHTNSSGAPEELTDRHGRVVWRTRYRAWGNVVLQEYAGEFEPGRRGEAEKPLPQSLRLQGQYEDAETGLHYNLFRYYDPDVGRFVSQDPIGLLGGFNLYAYAPNPVSWIDPWGLSCGSTGKFESRNAAFRAARRDAQIPVSQKPEMVFNQKNQRMQQYDQVMMTDRNGNPVLNPSTKQPVWTREYHYTNSDGSKIVIQDHSVGHEFGQGGVGDQGPHLNVRPYDNTRTGSVPGTLDHYGF >NZ_CP018812|1399987:1431945|1401343_1401856_-|WP_075160054.1|DBSCAN-SWA MRPLKPVIALVCALTLASCAGNGSEAAHEPVKLELSVKAASTVNPDDQKRAAPIVVRIYELKNADTFEEADFFSLQDKDKTVLTDDLVMRDQFQLRPGESKTIRRKAAEATTTLGVLAAYRDLPNSIWRATWTLPATPSAAWYRRTPKVQLTIDLDTSAIRITDARPQNK >NZ_CP018812|1399987:1431945|1415252_1415756_+|WP_075160067.1|DBSCAN-SWA MANEAKQMSAAGLAALRVREGTVFHYYNDMANNCTYGVGTLAHHGPCTDDEVRRPVTTADVNAQLSLAVRSAEAAVRRQVSARELSQDQFDALVSYTYNAGATGARPALEAANAGRDAQVVTHMNRNVNLYPRDAHGHRLHPVRSAGLVNRRREESAPFRLPEAPAR >NZ_CP018812|1399987:1431945|1424074_1424464_+|WP_156883987.1|DBSCAN-SWA MNIDSKKFKKLLQGHILVKELVSQSSLERTWVEVKLSENDRLAYPLRTPPNAMLGGSVYEQACSPEKASFKLRYATFSASDIANGFDPNYDKVGPYIDIPSTSALEDYFVRENLDLEGFVDSGLTDYPM >NZ_CP018812|1399987:1431945|1425412_1426366_+|WP_083615387.1|integrase|DBSCAN-SWA MAADDGVCRKHRHDGSSCVGGLRAMVWGACARWPAAVDSKNRGAVSAFAVSVSQGQRRAVVGEGSGGAFADAQALRAVAGARRSSAVQSASELVMPRIGQRIPRSILSVTEIEAMMAQADVSTPIGLRDRTIIEVFYSTGLRRSEMAHLQTWDIEHARRLVLVREGKGRRDRVVPIGVRALQWLDRYLLEAREALTTGGHHAMWVTDFGEPMEASYLARIVKRMMEATGIRKAGSCHLLRHAMATHMLDNGADTRFIQAMLGHASLATTQIYTMVSVEKLKQIYAVTHPARGDGVMREPDYVDDAKAPMRKLRPLLS >NZ_CP018812|1399987:1431945|1405172_1405718_-|WP_075160059.1|DBSCAN-SWA MSASNSSQKFIARNRAPRVQIEYDVEIYGSEKKVELPFVMGVLADLSGNPLESLPAVTDRRFLDIDIDNFDERMKAIKPRVAFAVPNTLTGEGQLMVDMTFESIEDFSPAAVASKVYSLRQLLDARTQLANLQTYMDGKSGAEALVNNLLADPALLKSLAAAPKPQAEVTTTESDRPASLN >NZ_CP018812|1399987:1431945|1418741_1423223_+|WP_075160070.1|DBSCAN-SWA MGLLAAARIHDTFTHTSLFAQILKVAASVGAGLLVGLAVGAAAALIVGTGGLGAIVLGAVVGAVIGVMADAATSAFTGKGSLEQYLSDMANEVIDSFIPGKVEGAIATGSQDVLINGQRAARAAGVMAPPLSPGVEPQSVADIFTATDLDFVGCSNHPSPHGEHMAEGSSSVFINGHPASRIKDKTTCDGSVNTGSPNVSIGGETVAVREIRSEMPPWLATVAKYAGLAIALCQAVRGKGPLASKVACFAMNFAVNAVAGEVVNAAARRMTSGRVGHPVHLPTGAKLIDGDEDLDFTLDAPIPVIWQRFYSSLDPRSDGPLGKGWSLPYSVELNLWVSDGTHPHEWIDAQGRRTRLPHLRRGEKTYSRIEGMTFACTSGGHWMVEHDDGLCMDFGQATAEETKQFLRPWVIEDRNSNRLYLRHDDERRLVGLATMSGHTIVLAYDAIHRRRVSEVTLHMEGEAPICLARYAYSEAGQLAAVRNAAEQITREYGYDADGRMIMHRLPGGLAAFYRWEQFERTGLAPESGPEVGEARIVEHWSDAGDRYLIAYDFEAQTTFVRDHLGRETRCVWNEAYLVTAHTDALGHTWRFGWSGSRELLSMTDPMGAATRIEYDDERGLPTRVINAIGEVTEIAWHPRWAEPVCLTSADGSRWQYEYDRLGNRVAQIDPLKQRTEWCLNRAGQPVARIDARGGMSHFGWDRLHRLMASTDCSGNTTRLTHDGRGRMVEIRDALGHGSYYGYDALDRIVRVRHPDGSEECYAWQLAGLVSHRDGGGRETRFAYDKLGRVVSRRDGNGHEVNLQYDAAGNLSRLINASGKQYRFGWDAADRLMEQAGVDDVLTTYQRDALGYPSAVCQGAGTAEQVTVRFERDRLGRLTLKTTPLTETSYRYDAGGYLIEIARRDAQDKPIDRIELKHDALGRLLSECTEWSGEPARCSKLEHGYDELGNRTSLQLADGHRLEWMYYGSGHLHRATHDGEVVSDFERDALHREVARTQGALTLRTAWDSRGRRVGRWTGGRRQAGEWGRFARGEDSLAKAFAYDASGELIKRLDPIAGELRFAYDRSGQLLRCEGVPQVGVLNEQFVYDAAANLLAAGQVGRIEGNRLVMSGDSRFRYDGHGRLVEKLKGSHTRQKLFWNAEHQLEAVVTVRHGVEQRVRYCYDALGRRVSKEDAFGETRFLWDGLRLLQEARGYREATYLYEGGSYEPLARIDSVREDGQWLWRAYEQGEGRQTEATARAPAQIYHFHTNSSGAPEELTDRHGRVVWRTRYRAWGNVVLQEYGREFEPGRRGEVEKPLPQSLRLQGQYEDAETGLHYNLFRYYDPDVGRFVSQDPIGLLGGFNLYQYAPNPVQWIDPLGLSSVCCCKGSYGGAKQASQYLKDAGVPRARRKEIMESFDVGTIDMRNAGASEYGLRYYDESKAYAKGRYLFETFPASRESLAVKPEWNDMTKIQQWNVREGEPMIEGRASAQGPCLPGGQVQKFILNLDALSKP >NZ_CP018812|1399987:1431945|1414148_1415246_+|WP_075160066.1|DBSCAN-SWA MNSARRQALPQAETARFEAPLPYDWMTPVDVDSPCGPDLEYDPEFVVLSSQLAARMDAQYGDFVGTPEPVDWGEAERDCRRLLLRSKDMRLAVLFTRCRTRLAASSGLAEGLTLLAAWLGAFPETVHPQPGVDDDLDVASEIRMNALQTLTDTDGLMSDVREIALTRSSAARLQVRDVERAFAYPRPVDSLAPESVTRQIDDLRSRQPDALAGFEAALTSLECIESWCKGHLGVYQPDFSTLSRLLRRVIGDAVTTVSVPQEAPCAEAIVPETSGGGVLAGSRHAGLPRMVASDTVALDAGTLSLRDRDEALDLIREARHWFERHEPSSPIPVLLRRAEQFVGKRYADVVKAIPAELLEQWERQE >NZ_CP018812|1399987:1431945|1427932_1428163_-|WP_075160077.1|DBSCAN-SWA MADANLEARPHVTERFVTVQQSQRESHSNKPYWQRSEPPCFPWLKLTGRWIEQAGFEAGQRVRINVEQGRLIITAE >NZ_CP018812|1399987:1431945|1429472_1429673_-|WP_156883989.1|DBSCAN-SWA MPEIPGSFMPEGDSRGILRRTPDPGPTQPDDTPYFPWMEIADRWLATAGFKPGERVSFSIDYRYNN >NZ_CP018812|1399987:1431945|1409041_1409746_+|WP_156883986.1|DBSCAN-SWA MTYSRFFRAFRDATSQLAIPGRERADARAFAMPPLTEGDRASAAIDNGSCGSDSRSSCGSGGFLDLFNEFSDVAHGRSPSERDEASAATAHDDLMATLGDRYYRALESADSLFGEWVDGGASSGEMSADIPAANRDFSDAPSSPIIGLLSDLERLEEAFGPLRNAGVGVPFEVEKVPEILRLFAPPEFQASAARRLDTVLPSIARRDHHTLAIDSPLVAPNGMTSTDSVVGHAE >NZ_CP018812|1399987:1431945|1416220_1418188_+|WP_083615386.1|DBSCAN-SWA MLAFSTNRTITMSGPALPDSSLGMPALQLESIAGEEGLSEIFVYTLRCRTPIELPDEEADNLDLKSMIGKELTVTIELDGMGTPMAGMQGMAGAANIGAGEREISGIVTEARYVDRSDRQSSYVLVMKPWIWLADQRSDFRIFQRKTVIDIIEAVFDNYLYSYDLRLSGSYPVLDYQVQYGETDFYFVQRLMAEHGIYWFFEHSNTFHRMVLVDHLGAHKPVESVAYQTLWYFPPGHRIDQEHITEFDMGGELQSGRWTTNDYDFKNPNAHLVKQNELPQETAHNDFERYEWPGDYTDPSHGEHFARVRMEEVRARGERASGSGNVRNVVCGTTFELEGYPHQAANQEYLVINTWLSATETGEASGSGDYSISSSFVVQPATTVFRPSRSRYQKPRTSGPQTAIVTGPAGQEIWTDQYGRVKLSFHWDRSGVKDQNSSCWVRVSYPWAGGGFGGVNIPRVGTEVIVDFENGDPDRPIVVGRLYNAMTMPPWTLPGNATQSGLISRSMKGGSNNANAIRFEDKQGAEELWLQAERNMRTEVKHDETHSVTNDRKQRVGRDEVVNIGHDHVHISGNNKTVGVGAAFSTVVGEMPDPRQAPLPPGTYVLDVKESILIRCGDSSIFMSKEGMIEVKGKQISEEAADYFLMKGGKIDLNP >NZ_CP018812|1399987:1431945|1410614_1411151_+|WP_083615385.1|plate|DBSCAN-SWA MSRKPRGSGQASTAPRRASAHLLPTLIDRLRDDAPHRQVETAHEYAVTPTRMRDIIQRDLTFLLNATSIEDLIDRKRYPHAAASTVNFGVRPLAGAFTASRRWTEIEKSILHAIGDFEPRLVPGSVRIVPLTEADGNAHYNELAFEIRGTIRMDPYPLEFLVQSSLDLESSRLYTNAR >NZ_CP018812|1399987:1431945|1415752_1416172_+|WP_075160068.1|DBSCAN-SWA MKRLAFFLIGLVCTSLHAAATDPVDEIANRSGLPASEVSALIANCDASQTSMNFCAWRDQLVAEQNLHLVMADREAQSPTCKARLEKQISRWITQRDRACRSEAQQAWGTGSMRQAAQATCAAKQTETLIGKVKAFGCR >NZ_CP018812|1399987:1431945|1403584_1405090_-|WP_075160058.1|DBSCAN-SWA MSTQQVQAPNTASNKAVHTQTDFSQLLTQEFRPKTEAAREAVENAVQTLAEQALQQSVTISDDAYKSIEAIIAQIDHKLSEQINLILHHHDFQKLESAWRGLHHLVSNTETDERLKIRFMDISKEELRRSMKRYKGLAWDQSPLFKQIYEEEYGQLGGEPYGCLVADYYFDHTPPDVDLLGSIAKIAAASHTPFVSGASPSVLQMESWQELANPRDLTKIFTQNLEYAPWNSLRHAEDARYIGLAMPRFLSRLPYGAQTNPVDEFDFEEDTQGSDHRNYAWANAAYAMGVNINRSFKLYGWCSLIRGVESGGTVENLPCHTFPTDDGGIDMKCPTEIAISDRREAELSKNGFIPLIHRKNTDHATFIGAQSLQKPAEYHDPDATANANLSSRLPYLFACSRFAHYLKCIVRDKIGTFREREDMQRWLNEWIMNYVDADPANSSQETKARRPLAAAEVVVEDVDGNPGYYQAKFFLRPHFQLEGLTVSLRLVARLPSVKEAV >NZ_CP018812|1399987:1431945|1405753_1408456_-|WP_075160060.1|DBSCAN-SWA MTISRQSLFGKLGATLFRSIESATAFCKLRGNPYVELVHWLHQLMQLPDSDLHRIIRHAGIERETLERDIIRALNALPAGASSISDFSHHVETAIERAWVLATLAFDDRRVRGAWLIAALVGTPELRRVLLSISPTFAGINADTPGDQLIAWIEGSPESSDAPYDSTSFSPAIPGEASQALLATAQRSPLDQYCVDLTARAGAGEIDAVTGRERETRAIVDILMRRRQNNPLLTGEAGVGKTAVVEGLALAIAKNDVPPALSAVRLMTLDVGALLAGASMKGEFESRLKGVLEAVLKSTTPIVLFVDEIHMLIGAGGQAGTGDAANLLKPALARGAIRMIGATTWTEYKRHVEKDPALTRRFQVLQIPEPQEREAIDMVRGLADALSIHHGVVVRDEAIRASVMLSHRYIPTRQLPDKAISLLDTACARVALSQHAPPRELQTVRRRLQAAHVERDLLGKEARIGLDADKAIAAAQEKITALSAEEAAITACWERQTAAAKALASAREAACTSESDASGNSRSSRDELRKLERDLHSAQGEAPFVFPEVNEAIVAAIVSDWTGIPAGRMVTDEIAAVRTLPETLAARVIGQTDALRRISERVQTARAGLADPRKPLGVFLLAGPSGVGKTETALALAEALYGGEQNLITVNMSEYQEAHTVSGLKGAPPGYVGYGEGGVLTEAVRRRPYSVVLLDEIEKAHHDVHEMFFQVFDKGYMEDGDGRYIDFRNTTILLTSNAGAELIANLCSDVALVPDHDRLREALTPQLLKTFPAAFLGRVTVVPYRPLAHASLASIVRLHLERVARRMADNHGVALSYTGHVVDYVVERCAVQETGARVLIGFIEQHILPALSALWLDAFSLKRTLQGIGVGIVDSTAQPAAALAFKPLVTASTVSVGEPQ >NZ_CP018812|1399987:1431945|1413055_1414162_+|WP_075160065.1|plate|DBSCAN-SWA MKRTPFASRLREPTLSAETVECLYEQPWRYGFLPLMRRIGADDRIDPIGTARRPGAEPFRLGQKPSLAFAPRELASVAEVGGRLNVRLFGLGMLGPNGPLPIHVTEIAREREENRHDHTLVDFLDIFHHRYFTLLYRAWADAQATVGLDRAGAGGERFSFYIASLSGDDSAEIARRVLPAHARLAASAHLVREARDPDGLRSTLERFFGVPVAIDEYVFHWIDIAHADQCRLGRRGDVATMGRGAMLGEQVPDRQHRFRVVIGPLDLDEYLRFTPRGVDLPKLVDWVRVFVGREFEWEVELRIRAQSAPPAQIGGPQQLGWTGWLGCSSSGESITGMRFEPERYADQFVNTCGERRVQPDVQGQYEQR >NZ_CP018812|1399987:1431945|1399987_1401334_-|WP_075160053.1|plate|DBSCAN-SWA MSWHNKVTWSEGLFLRPQLFQQQERYLEHFAHKRAVPLSPFFFGFSAFAIDNEALALGKIVVKSASGVFADGTPFDSPGDTPPPAPLTIRPEHLEQIVYLAVPIRTPNAEETTFENAPDSLARYAVFDTEVRDTNSVGQGARSVQLSNLRLRLLPKKEMAEGWIGLPLTRVKTLCADGSIELDDALVPPVSGYGASALLQSWLANVHELTRLRADALAKRLTGSDGQAGTVAEVSDYLLLQTLNRYEPLLHHLRRVPTTSPADLYALLLSMTGELSTYVRPQTRRPLDSHPAYQHIEPHICLKPVIDDAQWLLNAVLIRSAQSIPLGDAGYGMRNAVIDPAEIRSFNSLVLAVCAQMPADALVQQFATQAKMGPSERLPDLVRSHLPGIALQALPVPPRQIPFNAGYVYYELSQTGALWEAVAQHGGIALHVAGDFPGLKLELWGVRG >NZ_CP018812|1399987:1431945|1411166_1413056_+|WP_075160064.1|plate|DBSCAN-SWA MDPRLLDYYNQELLYMRELASEFAQMHPKIARRLGMQAGEVADPYVERLIESFSFMAARMQLKLDAEFPRFTGRLLEVVYPNYVAPTPSMAVVRFHPSQTQGNLIEGFHVPRATTLTGAAPVGEQTTCEFRTSQPVTLYPLEIVEARLTGIPPDIPMLDRYVPPDTQIRGALRLRLRTTGEVRVAALRGLDRLPVYLAGDEQVASQLFELLHAAGVASITGEPGAFSDPDRPFSAVMQDAVMHEGLGIDQGLLPLVWSKFHGHNLLHEYFACPSRFYFFTLTGLQKGLQKAKGNGLEIVVLLDQLPQKLTGLVDESRFALFCTPVINLFPRGIERIELNKASTDFHLVPKRLAPLDYEVYAVNSLTAQAGKETAALEFRPLYQTLNNDEGNHGRYFSLRRERRLMSDSARRYGTRTPYVGTEVFVSLVDQHDAPYHEGMRYLSVDAWLTNRDLPNLLPRNGIDDLKIAASFPVTGVGLIRAPSTPRAPFAEHETAWRLIRQLNFNYLPLEDMDHRPGGQGLRDMLRLFLSTDDTGLQHQVQSLVGVKTRPVARKLPGNGPLVFGRGIECQLTVDEGGFSGTSPYLFGLILEHYLARHVSINSFTQTELHSMQRGRVMRWPVRIGARGVA >NZ_CP018812|1399987:1431945|1409742_1410618_+|WP_075160062.1|DBSCAN-SWA MTHYAARTSTSVTRRHVSPAGEIETIETSIRREPAVALHRWALFQWLCVTGQWERAVQQIQIFAQLDPPWARVAQAGRELVRAELLRTKVMAGLAKPGFVFDDVPAWMQSLLDALELAAQGRLDASDDTRERALDLAPLVAGRGGGHAFAWIGDSDSRLGPVCEFVTAGRYRWLPLADIAGWRIECSGSPIDLVWVSCVLTLIDGAVLRGFMPARYPVSGSEAAHDREALQLGDTTVWQDRGRTGVFASGRKTWATSAGDFGLFELTRCTLGSAIADSCGVDQCTAKGVSE |
28 | uncultured_Caudovirales_phage(42.86%) | integrase,plate,transposase | attL 1389395:1389411|attR 1434967:1434983 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2931714 : 2951870
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP018812|2931714:2951870|DBSCAN-SWA GTCATGATCTCGTGCCTCGGTAGGTCTTCGTGAGACCGTGGTTCACCGCATGCCCGCGGCCCCGCACCACATTCGCGAGCCGCGCGCGATCGTGATGACTGGCCGGTGCTTGCCCGTACAGACCGAAGTAGCTATTCGCCATAACGTGCACGTCAACGGCCGGCGCGTTCGCGACGCGCCGCACGCCTTCATTCACCGTGCGCCGCCGCGTCGACCGGCGCCACGGCTTGATGACGTGGCCGACAAAATCGACGCCGCGCTCGATCGGCTGCAGGATCGTCTTGTGTTCGTTCAGACGCACGCAGAGGCGTGCCAGCAGAAAGGCTGCAATGTCGGCGTGCGCGGCATTGAGCCAGTCGGGCGACTCGTGCAGGAACACGAAGTCGTCGACGTAGCGAACGTAGTGCCGCGCGCGCACGATGTGTTTCGCGCGCTGGTCGAGCGCGTCGAGGTAGACGTTCGCGAAGAACTGCGACGACAGATTGCCGATCGGCAGCCCAAGGTGCGCCGGCTGGTTCATGAGCCGCTTGTGCGCCGGCACGAGGTCGATCGATGCCGGGTCGCCGCGGAATTCGAAGTCGGTGCGTGGATCGTGCATCAGCACGGCGCCGGTCAGCCATTGCCAGAACGGATCGCTGATCTTCGCGAGCAGCAGCTCGGACAGAACGGTTTTGTCGATGCTCACGAAGAAGTTCGCGAGATCGCACTTCAGGTAATGAGCCGGGCGCGTCCAGTTCTGCGTGATCGAGCGGATCTTCGCCTCGAGTCGCTGCGCTGCGTACAGCGTGCCGCGCTCCTTGATGCATGCGCACGAGTCGGCAACGAAGCTGCGCTCGAACCGCGGGCCTATGCGGTTGTACAGCAGGTGATGCACGACGCGATCGCGAAAATCGGCTGCCCATACCTCGCGCGCTTTCGGTCGCGTGATGACAAAGCAGATCGACCGGCCGGGACGATACGAGCCGTCGAGCAGTTCGTCGTACAGGTCGCGCAGGTTGTGTTCCAGATTCGCTTCGAACGCAAGCGCGCTGGCCTTGTTCCGTTTGGTGCGTCGACAGTCGACATACGCTTCGACCAGTTCGACGAAATGAAACGCCTGATCTCCATTTTTCCAATCTGCGGACGGCCACGGCGCGAAGCTTGTTGTTCTTGTCGTTGTAGTTCTGGTTGCCGTTGTTGAAGTTCTGATACCAGGCGTAGCCGGATATATCGCACTATCTACGTCGCCCGATCGATTGCTCAATTGGGAAACTGCGCTGGACCGGTCCGCACGCCGGCGGACGGTATCCTCAATGCGCATGTCGGTGGCCTCGTGAGCCAGCGGTGCGACCAGATTGAAAGATCGCTCAGCCATGGGAGCCTTGACCTCCATGGAGCGGGCGATTGCTTGCAGCCTTCTTCCATCCGGTGGCCTGCTTGCCGATGCTCGTGGTCAGTTCGACCGCGCGAGCGTAGGAACCCTTGTCGACCTTGCGCTTGTTGAAACCGAGCCGAAGCAGCAGGTTGATCACCTGCAGGCGCTCGATGAGCTCGGTCAGATGCGGCGATTTGTCCGTCGCAACATTCGCGCGATATACGAGCACGATGATCTCGATGCACTCTAGATTAATCTTCTCGCCGATCGACCGCTTGAAGTCCCGCTGCATGTTCGTGACGACATCTGTCACGACATCCAGCAGGTCTTCTGCGGCGCGGTAGATCGGCAGTTGGGTGTGCAGGGCCACGATGGATTTAAATCGTTAAATTACTGAAGGAATAAATCTGCGGACGGCCACGGCGCGAAGCTCGCAGTCCTTGCCGTCGTAGCCCTGGCCGCCGCTGGTGAAGTACTGAAACCAGGCGGAGCCGGAATTGGATTCGTACTGCTCACCGGACCAGTACCAGGCCGATGCGAAAGCACTCTTGACGTTTGCGAACAGGAGCGACTGCTCGCGCCGCGTCGGCAGGTCGCCGCCCTGCTCGATGGCCCACTCGCGTGCCGCTTCCCACTGAACCGACTCTTTCTCGCCGGGTAGCAGGATCAGGTAGTGCGCGAGGCTGCCGTCGTCGTTCAGCACGGCGCCGGCGAAGCGCTCACCCGACGCGAGCGGAATCGCGGCCGCGGGGACGCGGTATTCGGTCGTCGCCTGCTTTTCGAAGGTGGCGATCATTTCGGCTACCTTCGCGTGCGCCGATTTGATCGCTTCAAGCGTGATCGTCATGCGATGCTCCGTTGAAAATGGATGAAGGGTTAAATCGACAATCTGCGGACGGCCACGGCGCGAAGCTCGAGGTCCTTGCCGCCGCAGGTCTGGCCGCCGGTGCCGAAGCCCTGACACCAGGCGTAGCCGGAATCCCACTCGACGTCATCGTCGGTCCAGTACCACTCCGGCTTCACGAGCGCGCGATGCTCGTTGTAGATGACGAGAGCGTCGAAGCGGTTGAACAGCACGCCGCCGACACTCTTTGCCCATTCGCGCTGTAGCTCGCGCGGCAGGCGCTCATCGTTCACGGCTACCAGCACCGTGTGCTCGACGTCACCGGTCTTGCTGACGCGGCCGTGCAGATAGACTTCGCCCTCGGCAAGCGGCGGAATTTGAATCTGTTGCATGTGTTCTCCTGAAAAAGCAGGGCGCTGAATAAAGCCGCCCCATCAAAACCGGCGCTCAGAAAACGCCGGCGGGAGTGATTCGAAAAAGCGGGCGTCATGTGGACCGCCCTTCCACAAGCGCCGCGGCTACGGGGAGCGCCGCGACGGACCACACAACCCTGCTAACGGGGCCGCCATTCGCTGCCGCGAATTACCCGACCCACCGGCTCAAGCACCAGCACCTCAGATTCCTTTTCCGAGCGGATCAGCGCACTCGCGCGCTTCTGCGCCTTGTCGAGCGAGCCGTGACGCTTCGGCTTCGCGTAACGTCCGACCGTCACGAACAGCGGCGACTTCGCGCCGATCGGACCCAGCGTCAGTTCGTCGATGCGCAGTTCGAGCGCCCTGGTGTTCTGGGTCAGCTTCGCGACATCGCTCTTCAGCGAGTCGTTCTCGACCTTCAGTTCGTCTCGATCGGTTTCGGCCTGCAGTCGCGCGCTCGACATTTCGTGGTACCGAGTGGTCAGGTCAGTAAGCTGATTCGTCAATGCGGCGACCTGGCGCTCGAGGGCGGCGCGCGCGTCCGGTTGCGCAGCAGCGGGCGGCGACGCGGGGGGCAGCGCGGGCGCAGCTGGCGCAGCAGTCGTCACGGCAGCGGTCGCCGGCGTGCGCTCGGCACGCGAGAGCCAGTAGTGGTACTCGTTGCCGGCGCCCTGCCTCTTCTCGCGCTCGACCACACCATCGTTAAGCATGTGATTGAGTTCGCGCGCGACGTCGAGCTGCGGCAGGCCGAGCTTGTGCGAGATCGTCTTCGCGGTGGACTGCTGCACAGTGCTGAGGTACTTCTCGATATCTGCTCTCACGTTCGTATGCTCCTGCGCGAGGTTGGGTTACTGCGCGTCGTCGCCTTGCTCTTCGTGCCAGCGCTTCCAGCCCTTCACCCACTCGATGCACAGCGCGCCGGCCATGACGGGGCAATCGCTCTGCGGCTTGCCATCGGCTGCCGCGTCGAAGCCCGCTTGTTCGGCCGCTTCGAGCTCCTCTTGCAACGGCTGGTGTTCGATCTGCCGCACATCGCCCGGGCCCGCGTCTACGACATCGCCGGTCGGCCCATTCATGCCGTCACCGTCCTCGGACGTGTATTCCTGGCCGAGGTCGAGGCCGCGTTGATCGCCTTCGCCCCTCACTTCATCCATGCCGGCGGTGTGGTCAGCGGAGCTCGCTACAACCACCAGCACCGCCTTGCTTTGCGCGTCATACAGCTCGTGCAACGAAGGTGCGTTCGCACCGAACTTGATGACCGCCTTAACGCCATCCTTGATGGTGATCTGATCGAGGTCACCCGCCACCACCGTGCGACCCGCGCTGGCCAGCGTGTGAACGGCCATCGCGATGTTCGATTGCACGCGTGCACGCAGGCGATCGATGACGTCGTTCTGCTTGGCCTCGGACAGCTTCGCCCAGACGTCGGGCAACAGCTTCAGTTCAAGCACGAGCGCCTGCAGAAGATCTCGACCGATCGTGTCGGCGGTCATGGCGCGGAAGTCTTGCGGTGCATTCATCAGGGTTCATCAGGTGGTGGCGGGGTCAGGGATTCTTCAGGCAGCTTTACCGGCGGTCGCCTGCTTGATGCCGTAGTACACGGCCTTGCAAGCCTCGAGATCGACCGCGGCGTTGTGCGCACCCTCGAGCTTCTTGCCGGTGAAGTACTCGTACGCTTCGCCGAGGTTCGGCGACTTCGCATGCTTGCGGCCCGCGGCGAGCATCTTCGCGGTGGGCGGCAGATTGAGGATCTTCGTGCAACGCCCTTGCGTGCAGAAGGCTTGCCCCCCCTTCCACTGGTCGTGGAACGGATCGTCGTGATCGAGTTGCCGGAAGTACTCGATGCGCAGCATGCGGGCATCGAAAGACTCGTTGTGCGCGACGCGCATGCCGGCGCGGCGCCACATCTCCGTGAACTGCCGCAGCGCATCCGAAATCGGCACACCTTCGAGCGTCGCCCGCTCGGTCGTGATGCCAGTAAGCGCCGCGAGATCGTCGGGGATCGTCCAGCCATCCGGCGAGATCAGCACGTCCATCGCGGCTATCGTGTTGCCGCTGCGCTCGTCGCACAGGTGCGCGGCGAGTTGCGTGACGTGCGGCTGCGACGGATCTTCGGACGGCAGATTCCACTGCGGCAGACCGTTCGTTTCCGTGTCGTAGAAGAGGATGAGTTCCATGAATTGTTCCGTTCGTGGTTGGTGGGACGCTCAGGCGGCCTGGGCGAAGCGAGCGGCGTGCAACTCGCCCTTCTCGATCCAGTGCGCGCGTGTGGTGGACGGCAGGCCTTCGGGTGCCTTCTTCAGCGTGCCGAATACAAGCGCCGTCGCGATCTCGTTCTGCGAGGCCATGTCGTCGAGCAGTTCGAGCAGATCGCCGCGGCCCTTCAGGTCCAGCACGTCGAAGCGGTCGAAGAAAACGATGCGCGACTCCGACAGCACCGCGATCGTCAGTGCGATCAGCGCGTCGACGCGGTACCGCTCGGACTCGCTCAGCAACGAATAGAGGCGGCCGCCGGCGCGGATTGTCATATCGGTGTCGAGCATGGGCACGGCCCACTGCGCGAACGCAGCGAGTTCGGCGAGACGGTTATTGATCGGCGCAAGCGCCTGCGCGAGCATGTCACCCGGGATGCCATCCGGCGACAGCGCAGCGGCGATGTCGAGCCATTGCGTGATGTCGGCGTGGTGCTTCGCGGCAGTCTGCGTTTTGCTGTCGGCAGCGTCGGCCGCGAGCTTCGCGTTGTTCAGCCGATCCAGCTCGGTGCGCAGAGAATCGCGTTGCGCTGTCGATGCGGACACGCTCGTACGCGCTGCGGTCAGTTGCTCATCGGTCACTGCTTCGACGTCGGACTTCAACTTCAGCGCTTCCGACGCAGCGAGAGCGGCGGCGAGATCGCGCTCGTCGTTCTCGACCGAGCGCTTCATCAGGTCGCGCGCCTTGACGAGTTCGGGCAGCTGGGCGCGCGCGTCGGCATCGCCGCCGGTGCCCGGCTCTCCATGCTGATCAACGTAAGCGGCGTAGGCTCGCTCGACGGTCGACAGGTCGTAATCGCTCCACGGCCTGATTGCACCGGTCACACGCTGCACGAGGCTCTCCGCGTCAGCAATGATCACGGTGAACTCGCGTACCGCAGCAGCCAGATCGTGAACGAGGCCTTCGCGCGGCGCGTCGCCGGCACGCTGCTGCGCATCGAGCAGGCGCGCCTCCGCGTCGGCGAGGTTCTGCTTGTCAGTCGCGAGCTTTGCTTCGATGCGCGTCACGCCTTTCGCCAGTTCGGCGCTGCGCTCGGCGGATTCGCGCGACGCGGCGTAGGCGCGGTGTTTTTCCTGCAGCGCGCCCAGGTCCTTATTTGCCTTCGCGATACGGCTCTCGACGGCGGTCATCTGCTGAGTTACCTCAGCGTGCCGCGCTGCATCGAACAGAGGCACCTCGGCCTCCCAATCGGCGGCCTTGTCCTTACCCCACTGTTCATTGGTCACGGCCTTCCATGCGCCCTTCGACTCGCGCGCTTCGTCCTCGGCGAACTTCACGGCGGCCGGGAAGCCTGACCGCAGCATCGGCAGCACCTTCTCGACCTTCGCCGCGTCGCACTTGCGCTCAGCGAGCAGACCCTTCACCTTCTCGGGTGTCGCGCGCAGGCCCGTCAGTTCGAACAGGAACGTGCGCCGATCGTCGGCCTTCATTTGCGCGAACCGCTCCGGGTTGAGCAGGTACGGCAGCGCGCCGGTGTACACGGTGAGGCCCACATGCTTGGCGTCGGGCAACGTGAGGTTCGCCGGACCGACATCCAGATCCAGCGACACGATCGACTTCTTCGCACCGTCGGTCACCATCGACGGGTATTCCTTCTTCAGCGCGACGCGCGAAATGTCACCGGTCAGCGCTGCGCAGATGGCCTCGCGCAGGCTGCTCTTCCCCGCGCCGTTCGGGCCCGCGAAGATGGTGACCGGCGTGTCCACTTCGATGTCCACAGCGCGCGCGCCGAGGAAACTCTTTACGGTGATGTGCTGGATCTGCATGTCAGTCTCTATGGATTGGGGCGGGACTCACCGGGCGGCGGCGTAGATCAACGCGCGGCGGCTACATACGCCGCGGCTGCCCGGGAGCCCCATTGACGGTTACTCGACGCTCGGCGCGGATCCGCGCGATGCGCGGCGGCCGGCGCGCTCGAACTTCTGGCGCAGCGTTGATGCGAGCTCGGTGAGCCGCTGTTCGGCGTCCGGATCGGGTACGTCAGCGATCAGTGCGATGGCGGCGTTCAGCTCGTCGCTGCTGTTCGCTGCGTTCAGGCGCGCGGAGATCTCCGATTCGGGGAGCGGACCTGACTTCGAAGCGGTCTGCGCGGGCGCCGCGCTCTGACCATCCGGGCCGGTCGTTTCGCCCTGCTCGGTGCCCTGGTGATGGTGTTGAGCCTGCTGATCCATCTGCTCGACATTCGCTGCTGGCGCATCAGCTTGTCGCGATGTCTCGGCTTGCGCGGCCCGGTCGGCTGCCGGCGACACGCTTGCGCGGCGCAGCGTGTCAATGTCGGCGGCGTAGGTGCCGTCCGGCTGCTTCGTGACATCGATAATGTCGAGCTCTTCTTCCGACGTGCGTCCCATGCCCATCACGATGTCCGGCGCGTGGATGTTGCCGAAGAAGCTACCGGCGCGGTACTGGAACATCAGGGCGCGCAATTCCGTTTGCCACTTGGACCCAGGCTTGCCGTACCACCCTTCCTCGACAGCCATCTTCATCGTGACGGGCGCAGATTCGACGACGGGCATGCCGAGGGAGCGGTACAGGTCCAGCAGGCGGTTCGGGTACTGACGCAGCATGTCCGGCGTCAAATGCGGCTCGGGAACGCCCTTCGGCAACGCCCACGCGATGCACTCGATGTCGTCGACCTCGACCTCGCGCTCAGCGAAGATCGGTCGCCGCGCTTCTTTATCCCAGCCGGTCTTTTCCTTGTATGTCGCCTTGATGCGGCCGCGGTTGATCGCCTGGAAGCGCAACGGTGTAAAGCGGCCTGACGCATTGATGGCGGCGATCACGAACTTGCCCGACCAGCGCAGCTTGCCTTCGATCGGGTCGGCGTTTTGCATGACCGCGGTGATCGACATGCGGACTGCCCGCGCGACTTCGATAGCAACCAGGCAATTACCGACCGCCGCCGGGTTCTCGACCCAATGCTCTTCGTTACCGACCCTCTTCAGGTTGTGCGAGCGAAACTGCGCCGGCACGGCGTCGCTGCTCGCATACGCCTTGGCAATCCGGTTTGCGAGCACGAAGCCGCGCTCGGTGAACATGTCGACCGCGACGTCAGCAGCGACAACCGCCTGCGGTCCTTGCGCGACGTCAGACAACGCGCGCGCCTGTGTAGGTGCGTTCATTTGTTACCCCTTTGTTTGTTTCGGTAATCCGCTTCGACCTGGCTCCAGTACGGATGCCGGCGCATGCGGGCATGCAATTCCATGTGGTACGTATGCGTACAGATCACGAGATTCGTGTTGCCGTTGTCGCCACCGTCAAAATTCACGTGATGTACGTCTTCGCTACGCTTGAGTGGGCGACCGAGCGCCTTCTCGGCGATAACGCGATGCTCCAACTGCTTCGCGCGGTTCCCGCGTTGAACCAGATAGCCATCCTTCCGCTGCCTCGTGCCGCCCTTGAACTGGTGATTAAGCTGGCCAGCGCCAGCGCCCCGCGGGCGGCGCGGAACACCCATCCGGATCAGGGCGCGCCGCACGTCTTCGTCGTCCAGACCCATCTCGGACGCGATTTGATTCAGGGAGCTACCCGCGAGGTAGGCTTCGCGCACCATGTCGTAGTCGAATTCTTTGCGTTTCATGGTGGTCACTCGTGATAGATGCAGGTGGACCATCTAGAGCAATACGACTTGCTACAAAGCGGACTACTTGGGTTGGGCGGGAAAAGACCGGCCTTGAACATGTCGGCGGCCAACTGGATGAGGCCCGGTGTCTCTTCGTTACCGATCAGCACGCGGCGCGCGTCGAAGATCGGGCTCACGGCGACCGGCGTCTTGGACGTCGTGCCGAGGCCGATGATCTGCGCGCCCGCCGTGCGCTCGCCGGTCGTCTGCTCGTACATGAGCTGATAGGTACCGGTCTGTGCCGAGCGCGACTTGATGTTCGCCTCGCCGTTCGTCACGACGCGGGCACCGGTCTTCACGTCAGGAATGATCGTGCCGTCGACGCCCTTCGCGACGCGCGCGCGGTCCATCGTGCCGGTGAGCGTGATCGTCGTGCCGCCGCCGCAATCGATCTGCAGGGGCTCGAGCGGCATTTCGACGGCCGTGTATGTGAACTGCGGCGCGACCTCGGTGCAGTACTTCGTGGTCAGCGTCAGGCCGATGCGCTCCGCCTCCGCGAACTTGATGTTGTCGTTCGAGAAATCGACGTCGTATTCCGGGTGGTGCAGCGTGTCGACGAACGCGCCTGCAGCATCGTCCGGCGTGATCTCGCTGCCGTTCAGGCGCGCCGAGTCGAACGCGGCGGTACCGGCGTGCACGCTCGTGCCGAGCAGCGCGCGCAGGCCTGCAGGCTTGCGCTTGCCGAGCAGATGTTCAGCCTCCCAGGCATAGGCGCAGTCGAAGAATTTGCCGAACGACGAGGCGCGGATGCGAAGATCGCTCACGTTGCGCTCCTCGATGCCATCAGCTGCAGGCGCAGCGCCGTGTCGTTGTCGCTATCCGCGCCGGCCGCCGCGTAGAGGCATGCCGCGCAAGCAGCAATCATCAGCGCGGCCGCGATGCGCGGATGCTTCGCGTAGAAGCGGTCGAATGCACGAATCACCGCACACCCCCGACGTTCGTGTACGTGAACACGCGGGAGATCTCTTCCGGCACGTAGCCGGTCGAGATCGGCGTGCGAGGCGCAACTCCCTGACGGCTCAGTGCTGCTTTTGCGTCGCGCTGCCGACGCGCCGTGCGGGCGCAAAGCGCTTCGTGCTTGAGCTCGAGTACGCGATCGCGAGTAATGCCGGTCGGACGTCGACGAAAAATGGTCAGCATGCTGATCCCCTGTAGAGCAACAAAAACACGATGCAGACAAGCGCGCCGAACAACGAACCGGTGAGCAATCCGCACTTATGCTCGAGCGCGCGCCACTGGTCTTCGGTAAGCGGTTCTTTCCGAATCGGCTCGCGCTCGATTCCACGCACACGGCGCAACGCGAAGCCAACGAGCTTGCGACACGTCCACTTGAGCGACGGTGTGCGCGTGATGTTCGCGAGGAAGACCGGGCGTCGCACGTCAAGCTTCCTCGCGCTTTTCAACGCACGGGACGCCCAGCTCAGCGCCATCGATGGGGCAGCGATTCACCCAGTCCTCGTGCACCACCGTGCGTCCAAAAGCGTCATCGCGACCGATGCCACCGTCGACGAAACTGCCGGCCACGTTGCATTCGCACGTTGGGCAGAAGCCGATACGACCGCGACGGATATATCGCGGACGTCTCACTTCATTCGGCATTTCGTCGTAGAGCACAACCACAATGGACTCCTAAATTAGGACCGGCCTTCCACCGGTACGGCTTTCGCTCGCAGAGGTCGGATTTCGACATGCATCACGGACTTTCGCCGCTCTGCCGGAAACTGCACGCCTATTCCGGTCACTCCACGGCGCGGGACACTGTTCCCGCCCGCTTCTCACGGGCTGCGGCTTATAGTTGCTTAAAGCGGCCGTCGCTTCGTTGAGGCCCTCAAGGGTGGATGCCCAGCGCTCGCATAAGGCAGCGTTGTGACGCCATTCGTGGGGGTCCATCATTAGTGCAGCCGCGCCCACCGACTGCCGCCAGACACCCACACTTCAGGGCGCCGGTGCTCTCCCGGCTGTCAAGCCTCAATTGCAGGTCGGCTTTCACCTTCTTCACTTCACACTTGGGGCCGGCGGCTGCGCTTATGGTGCAAACCTCGCGAACCGGGGAGTCCCGCAGCCCTCTCCCCGCCCAAACCCTCGCCGTACTGCGCCCCTCTTCGCACGGGGTATGACGTCGACTCCTTCGGAGTCGAATAACTGAATGACTAAATGAACAATCTGCGGACGGCCACGGCGCGAAGCTTGCTGAGCTCGTCGCCGTAGAGCTGGCCGCCGTTGCCGAAGAGCTGAAACCAGGCGTAGCCGGCCGTCTCCTGCGTGCTCGACCAGTACCAGTCCTTCTCGAACTTGTGCGGGATCGTCGCGTAGCAGAGGTTGAGCTCGCGCTGCGCCGGCAGATGGAAATCGGTGTGGCCGTCGGCCGAGTACTCGCTGGCCCACTTCGCCGCGGGATGCTCGGTCGTCGCAGCGAGCAACGCGCGCGTGTTCGCGGCGCCGTCGTAACGGCTGTCCGCACCCGGCACCTTGCTGCCGTAACCGCCGTACGCGACGTCTTCGACATCGGTCGCTGAAACGATCAGGTGATACGGACGCTGCCCGTCTTCGCCCGGCATGATGCCGGCGTAGATGCCGCCCTGACCGGGCCAGTACTCGCCAATGGCGGGCGCTGCGATCGCGACAGCGGCTTCGTTCACTGCATTCATGTCGTTCATCCTTTATTCGTTGCTGCGGTTTATCGGGTCCGGCGCCAGCTGGCACCGGTCAAGCTGCTTAATCGAATCGGCGTTCGCCCTTGCTTCCCGCCTTGCAGATCACAAGGGCAAACAGCAGCGCGATACCGAAAGCGGTGAATCCAACTGCCACGCAAGTTGCAAACATGATCAGTGCCCCAACGCTTCGTTGCGGACCAGCACGAGACACGGCTGGCAGCAGTACGCAGAAAACCTTTGCCCGAACGCACCATAGAACGAGCGCGTCTCGACGCCTTCGACCAACTTCTCGCAGCGCGGGCACAGTTCCATGACCGTCACTTTCATCGGCCGGTTGCGCCACGCTTCCAGTGACTTCGACTCCATGTCAGCACCCTCCCCGAACATCCGCGACGCCCGGCGCGATCAGGTTCAGCGCTAGCTCGCGCAGCAAGTGCTCGATCAACGCGCCGCGCGGCATCGCGCGCAGCTCCAGCAGATCCTTTGCGAAGATGGACATGACACCCTCACTGACGTGGTTGAATCAGTTCAGATCGCGACGTTGCGGTAGTGGGCGAACTGCTCGCCATCGACGTCAGCCATGCCGACGAACTCGGCGCCGGTGATCTGCTGAAGCTTCGCGCACACCGCGGCGTTGTCGATGACGCGCTTTGCCGTCGCCGCCGCCCGCGCTTCGCGTTGCGCCGTGATGCGCTTGGATTGTTCGCACATCGCGACTGCTTCTTCATCGCTGATCTTGTTCAGACCGTTGTGCGGAACGAACGCGTACAGCTGCGTGCCGTTCGTGAGGATGTAAGCGCCCGGCAGGCCGTCACCGGTTGCCTCGAGCGAGGCGACGACCGTGAGACTCAGGAAACCCACGCGGACCTGTTGACCGATTTCCCACGATTGCTTGCTGTGCTTGCCCATGTCCCGACTCCCGCGATGCGTTGTGTGCTGCCATGGACTTGATTAAACACCATGTTTATAGCGATGTCAAACACCGTGTTTAATTTCTACGTGGAAATTTGTAACAGGTGTTATGCGCATAGCATTCGCGTATGGGAAATGCCACAAAAAAGCCCGCTCTAGGCGGGCTTCAGTGGAGATATGCGGCGGGGTACGTCAGATCGGGCGCCACATGGACGCACGCGTGTAGGCGACGACGTAGTGCATTTTTTCGATTTCTTCCTGCGCGATCGCGACGGACGGGTGCGCTTCGTTGATCGAAACAAGGTGGATGCGGCCAGCTCGCTTGTAAAGGAATGTCTTGACCATCACGCGACCGTCGATCGACTTTACGAGCACATCGTCACCCGGCTCGGGTTCGTGGTTCGGCTCGATGATGACGAATTCGCCGTCACGAATGCGGGGGCGCATCGAATCGCCCACGCAGCGCAGCGCATAGGCGTCTGGGTCCCGCGAAGGGAAGTCAACATAGCCGTCCCCATGGCCGACCGGATACTCGAGATCCGACCAGTGCCCGTTGTCCCCTAGCTGAGCCATTCCTAGTACCGGAATCGGTTTCCAATTTGTGATCGGGATAGGCCGGTATTCATCGGCGTAGCGGACAGCTACGCCTGGGTCACCTTTGCCCTTTATTAGCCAAACCGCGTTCACGCCGTACGTTTTCTGAATGGCGACAGCCTGATCAAGCGTCAAAGTCTGGCCTTCGCCGGCAAGCCATTGAGAAGCGACCTCGAGGCCAACGCCAGAGACGCTTGCCAAAGTCTGCGCAGTGACGTTCTGCTCGGCGAGCACCGCTTTTAGACGTTGTGGGGAAGTAAGTTTGGCTGCATCAAGGGTAGTGCTCGCGGCTTTTTCCGATTCCGCAATAACCGGCATGTTTAATACAGTGATTGCCGGATCATCTTCGGTGGAAACGTGCTTCGCGCCGGCATCTACCATTTCGCCTTCGCCCGTGCGCAGCCACACGGCGCGGCACCCGATCACGCGCTCCGCTTCCAGCATTCCGTCGGAAGACACGCCGCGGCCCATCCAGTTAGTCACTTTTTGCGGCGAGACATGAAGCAGGCGTGCGACCGCCGAAGGGCCCTCGGCGCCCTTCAGGGTTTTTGCGGCCTCGAGCAGACGCCCGGCGGTTGGATGGATTTCACGGCTGCGAGCATTCATGCGCGGCATGGTTGCACAAGTAAACAAAATGTTGTTACACGCGGTGTTTGCTTTTCTTTTAAACATGGTGTTTAATCATGCTCACTATGAGCAAACACCGTGACCTCCACCCGGACGCAAAGATCATCGACGAGCTCGGTGGTCCGACGAAGCTTGCTGAGCGCCTCGGCTACGACAAGGCGTCGGGCGGCGTCCAGCGCATCCAAAACTGGAAGTGGCGCGGCATTCCCGCTCACGTGAAGGTTGAGCACCCCGAAATTTTCATGACTGACCTGATAGATCGCGTCAAGGCGTCCGATGACGCTCAACCGCCTGCGGGAGGATCTGTCGACGACGCAAAGATGGCGAAGATGGTCGTGTGATTGGAGTCGGTGCATTGAAGTCCTGATCTGATTTTCGCTTTTGGGCTGACTGCTTGTCGGCCTTTATTTTCGGCCCCACTCCATGTGGGTAAGCAAGTGGGTAATCAACCGGGTTTTCATCATTTTTCACTATCGCAAAGGCCATGAAACAACAAGAAATCCGCCTCTTCGCGCCGTATGTCGAGGGCGAGCGGCTACCTAAAGAGCAGATCGAGGCGATGACCTTCGAGGAGTGCGTTGCCAAGGCGCTCGAGATCGGTCTCAAGCGTTTCGATCGCAAGACGCTCGCAAAGCAATGCCGCATCCACTACCCGCACTTCTCGGAATACATGTCCGGTGCGCGCAAGCTCGACCACCACCGCCTCTTTCTCTTCTGCATGTTCTCCGGCTGCGAGTACCCGCGGCAGTGGCTCGAACTTGCAGAAGAAAAAGCGCGCGCCGAATACAAGCGCCAGAGTGCGCAGGTCGTCGGCGAGTACGTCCAACAAGCTTTCGCACAACGGGCGGCTGCATGAAATTGAAGCAGGGCGACAAGGGGACCGTGTTCGTCGCGCCTTCCCAGCGCCGCGTCGTGTTCGATCACGAGCACCGCGGCCGTTACGTTTTCATCTACGAGGACGATCCGGATGATGGCCTCGCGCTGTCGGCCGAGGGGCTGAAGATCCTGGAGCGCGCGCCGGCGCCCGCGGTAGGGGGTCAATCGTGATCCTGCACGGCCTTCTCGATGCGCTGCTGCGCGCGTTCGTCATCGCGCTGGTCGCCGCTGCATTTTTTCCGCACCGCTGGATCTTCTCGGTCAGCCTCGTCGTCGCGTTCATCGGCGCCGTGGCCACGATGTTGTGGGGACGGCTGTGAACGCGCGCCTGCCGACCGATTCGTATCGCTCGTTCCTTGAAGCGAAGGTGCGTTTGGCCCGCTTCGACGGGTTCGACGTCTCGCTCGACGAGATCAATCCGAAGCTGAAGCCACATACGCGCGACATCGTGCGATGGGCGCTGCAGGGTGGCCGTCGCGCGGTCTTCGCGTCGTTCGGGTTGCACAAGACGGCGACGCAGATCGAAATCATGCGACTGATCGGCGCACACCGGCCGTGCCTTCGGGCCATCGTCTTGCCGCTCGGCGTGAGGCACGAATTCACCGGGCAGGCGGAAGAGCATTTCGCCGGCGACTACGCGGCGCCGGTGCGCTTTATCCGGTCGGACAGCGAGATCGGCGACGAGCGCGAGATCTATCTGACGAACTACGAGTCGGTGCGCGAGGGAAAGGTGACGCCGAAGCTGTTCGGCGCGGCCAGCCTCGACGAGGCTAGTGTGCTGCGCAGCTTCGGGAGCAAGACATACCAGGAGTTCCTGCCGCTGTTCGACGGCGTCGAGTTCAAGTTCGTCAACACGGCGACGCCAAGCCCCAACCGGTTCAAGGAACTGATCCATTACGCGGCGTTCCTCGGCGTGATGGACAGCGGCCAGGCGCTCACGCGCTTCTTCCAGCGCGACAGCGAGAAAGCCGGAAACCTGACGCTGTATCCGCACAAGGAAGAGGAATTCTGGCTGTGGGTCGCGAGCTGGGCGGTCTTCATCCAGCGTCCGAGCGATCTCGGCTATAGCGACGAAGGCTACGACCTGCCCGAGCTCGACGTGCGCTATCACGAGGTCCCGACAGACTATGCGAAGGCCGGCGCGGATCGCGATGGGCAGACGTTGATGTTTCAGGATCCGGCTCTCGGCCTCGCCGCGGCTGCAACTGAGAAGCGCGACAGCATGCCGGCGCGGATCGCGAAAGTGCAGGACATCGTCAGCGCCGACCCGGCCGACCACTTCGTCATCTGGCACGACCTCGAAGCCGAGCGGCACGCGATTCAAACCGCGCTGCCGGAGGCGGTGAGCGTTTGGGGTACGCAAGATCTCGACGAGCGCGAGCAGCGCATCGTCGATTTCGGTAAAGGCGCGTATCGCCTGCTGTCGACGAAGCCCATCATCGCCGGCTCTGGCTGCAACTTCCAGCGGCACTGCCACCGCGAGATCTTCGCGGGCATCGGTTTCAAATTTAACGACTTCATCCAGGCGATTCACCGCGTCCAACGTTTCCAGCAGCCGCACCGCGTGCGCATCGACATCGTGTACAGCGAGGCCGAGCGCGAGGTGCTGCGCACGCTGCAGGCGAAGTGGGCGCAGCACGAAGAGATGGTGCAGAAGATGACCGACATCATTCGCAAGTACGGGCTCAACCAGCTGGCGATGCAGGAAACGCTCGCGCGCTCGATCGGGGTCAAACGAATCGAAGTATCGAGCGATCGCTTTTGCGTCGCGAATAACGATTGCGTCGACGAGGCGCGCCGGCTGCCTGAAAACCATGTCGACCTCATCGTGACCTCGATCCCGTTCGCGAACCACTACGAGTACTCGCCGAGCTATAACGACTTCGGGCATACGGACGATAACGCCCATTTCTGGCAGCAGATGGACTATCTGACGCCGCAGCTGCTGCGCATCCTGAAGCCGGGCCGCATCTACGCATGCCATGTGAAGGACCGGATCCTGTTCGGCAACGTGACTGGCGCCGGCCTGCCGACCGTCAGTCCGTTCCACGCGGAAGCACTCTTCCACGGCATCAAGCACGGCTTCGACTACTGCGGGATGATCACTGTCAACACGGACGTCGTGCGCGAAAACAATCAGACATACCGCCTCAGCTACACGGAAATGTGCAAGGACGGCTCGAAGATGGCCGTCGGCTCTCCCGAATACATTCTGCTGTTCCACAAGCCGCAGACCGATCGCACGAAGGGCTATGCCGACGTGCCGATCCGCAAGTCGAAAAGCGAATACAGCCTCGCGCGCTGGCAGATCGACGCGCACGCATTCTGGCGTTCGAGCGGCAACCGCCTGCTGACCGCGGAGGAGCTCGCCGCGCTGGGCCCGGAGAAACTCGCCAGCCTATTCGCGAAGTACTCGCTCGAGAACGTGTACGACTACGAGTTCCACGTCCGGATCGGCGAAGAGCTGCAGGAGCGCGGCGCGCTGCCGACCAAGTTCATGAGCCTCGCGCCCGGCGCGCACCATCCGGACACGTGGCACGACGTCACGCGCATGCTGACGCTCAATGGCGAGCAGGCGAAGCGCGCGGTCGAAAAGCACGTGTGCCCGCTGCAGTTCGACATCGTCGATCGTCTGATCGATCGTTACAGCAACCCCGATGAGCTCGTCTACGACCCGTTCTGCGGCCTCGGCACCGTCCCTTACCGCGCGATTCTGAAGGGGCGCCGCGGCGGCGGCTCCGAGCTTAATCCGACCTACTTCATGGATCAGGTGCACTACCTGCGCGCGGCCGAGCGCGAGTACTCCATGCCATCGCTGTTCGACATCGAGGAGGCTGCAGCGTGAACGCACCAGACCTTTTCCGCGATCACAAGCCGCCGCTGTGGCGCGAACAGGATGCTATCGCCGCGCGGGTAGTGCACGCGCGCAGCCGCGATGAGAAGCACCAGATCGTGCGCTCGCTTTGCCATTCGCTCGCGCGCATGACCGACAAGCGTCGCTATCGGCTCTGAGATCCGCATGGAAAGCCCTCGCCCAATCCACGCAGACCCTCTGCTCGGCGAAGTATTCGAGTTGCTGCACCGGATGGAACTGTGCAGGTCGCTCGAACCATTCCAATGGATGGCAAGCGACGCGCTGAAGAAGCTTCGCGCGTATGAAGTCAAGCAATTGAAGGAGGCGTGTAGTGGCTGGTGACTGGATCAAGATGCGCACGGATCTCTTCACACATCCGAAAGTTGTCCGAATTTCGTCCACATTGAAAGCGGACAGATTTCGGACGGTTGGCGGACTCATGTCCGTGTGGTGTCTCTTCGACGCGCACTCAATTGACGGACATCTCGACGGATACGATCTGTCGACGGTGGACGATCTGATTGGATGGCCCGGCTTTGCGGCGGCCATGAAAATGGTCGGCTGGCTCGACGATAACTCGGAGGGCCTTGTGCTGCCTGAGTTTGACACGCACAACGGTCAATCCGCGAAGCGTCGCGCACAGGATAGCGACCGCAAGAGAGCGTCCCGTTTGTCCGCTTCCGATGCGGACAAAAAGCGGACTAGAGAAGAGAAGAGTAAGAGTAGTAAACAACCCCCCATACCCCCCAGGGGGGGCGAAGCCGAAGAATCGAAAAAACGGCCAACCATCGCTCTGAAGACCTTCCTTGAGCAGTGCAAGGAAAAAGGCGAAAAGCCGATCCTGGAAAACGACACGGTTTTCGACTATGCGTCGAAGACGGCTATCCCGGACGACTTCCTGCGTCTGCACTGGCTCGAGTTCAAGGCGCGTTACTGCGAAGAGGGTTCGAAGCGGTACAAGGACTGGCGCTCGGTGTTCCGTAAATCCGTCCGGGGCAACTGGTTCAAGCTCTGGTGGATCGCTGCTGACGGTGCCTGTTCGCTGACGACTGTCGGCGAGCAGGCGAAGCGCGCGCACGGCAGGGACGCAGCATGAACGCGCGCGCGCCCCATGACGAAAGCAGCGCGTTCGTCCCGCCGCACAGCGTCGAAGCCGAGCAGTTCGTGCTCGGCGCGCTTCTCATGGACAACGATGCGATCGACCGCATCGGCGAACTGCGCGCGGAGCACTTCTATCGCCACGACCACCGGACCGTTTTCGACCTGATATCGCGGCTGATCGTCGCCGGCAAGCGCGCGGACGTCCTCACCGTGCTCGAGGCGGCACAGATCGGCGGCCGGGACGAAGCAATCGGCGGCCTGGCCTACCTGAACGCACTGTCGGGCAACTTTGTCGGCAGCGGCGGTATCGCGCGCTGGGCCGAAATGGTCATCGAGCGCTGGCGCCTTCGCGGACTGCTCGCGGCGGCGCGCGACGTTGAGGCGCTCGTGCATAACCGCGGCGCGCGTACGGCGTCCGAGCTGATCAGCGAGGCGCAGGCGAAGTTCGAACCGCTCGCTGACATCCGTTCGTTCGAGCCTCAAATGCCCGGCCCGGTGCTGACGGAGATCGTTGAGGAGATCGATCAGACCTACCACGGCGCCGAGTTGCCGGTCGTGCCGACCGGCTTCCGGGATCTCGACGCCAAACTCGGTGGCGGCATGCGCGGCGCCGAGCTGGTCATCATCGCCGGCAGACCGTCGATGGGGAAAACCGCGATCGCGATGGCCATCGGTGGCTACGTTGCCGAGTTGCAGGGCATGGTGCTGGTCTTCTCGCTCGAGATGTCGTCGAAGCAGCTGCATCAGCGGAACATCGCGCGCGTGGGCGGGATTCACCTGTCGCACGTGCTCGACGGCAAGAAGTTCGTCGATAGCGACTGGCCGAAGCTGACACACGCGGTGACCGTGCTGTCCGAAGCGCAGATGCTCGTCGACGACACGTCGGGCCTGTCGATGGACGAGATCGCGAGCCGCGCGCGAACGGTGAAGCGCCAGCACGGGCTGAAGCTGATCATCGTCGACTACCTCGGCCTGATGACCGGCGGCCCCGACGAGCGGCACGACCTGAAGATCGGCAGCTACTCCGCGGGCCTGAAGGGGCTTGCGAAGCAGCTCGACGTGCCGGTGATTGCTCTCTCGCAGCTCAACCGCGGCGTCGAGCAGCGGCCGAACAAGCGCCCGACCATGGGCGACCTGCGCGACTCCGGTGCCATCGAGCAGGACGCCGACATCATCCTGATGCTCTACCGCGACGAGGTCTACAACGCCGATAGCCCCGACAGGGGCACGGCCGAAATTATCGTCGGCAAGCAACGCAATGGTGAGACGGGACCCGTGCGCCTGGCGTTCGCCGGCGAGTACCAGCGCTTCTCGGATCTCGCGTACGACTACGTGCCGGCGACCCCCGAGCAACCCAAGAAAACACGTCGAGGTTTCGAATGACCTGGATCGCCAATCAAACCCCGGAGGAGCTTCGCGCGTTCCTGCGCAAGCGCGAGAAAGAGTGGATTGGAAGCATGACCGACCCGCTCGCGCAGCGCCTGTGGAAAGAGGTACAGGTGCTCGACGCGCGGATCGTCGAACTCGAAAAGCAACTTGCCGCGCGGACGCCGGCAAAGGCGGTGGCATGACCGCGATCCTCGCTATCGATCCGGGCACAGACGAATCCGGCTGGTGCCTTTATCACGACAGCGGCCGCTTGTTCGCCAGCGGCGTGAAGCCGAACGATTTGCTGTTGCAGGAGATCCGGGAGACGCCGGCCGACGTATTGGCGATCGAGATGATCGCCAGCTACGGCATGCCGGTTGGCCGCGAGGTCTTCGAAACATGCGTGTGGATCGGCCGTTTCACGCAGGTGTTCCGGCGCCCCGATGACGTCCTCTTCGTCTATCGCCGCGACGTGAAGTTACACCTGTGCGGCAGCCCTCAGGCGAAGGATCCGAACATCCGCCAGGCGCTGCTCGACAGGTTCCCGCGCACTGGCGGCGGCAAGGTCCCGCAGGTTGGGGTGAAGAAGCAGCCGGGCCCACTGTTCGGTGTCTCGACGCACGCATGGTCGGCTCTTGGTGTCGCTGTTACTGCCGCGCACCAGCTCGCGCAGAAGGAGGTCGCATGA
Protein sequences of DBSCAN-SWA_2 >NZ_CP018812|2931714:2951870|2948700_2948871_+|WP_156884056.1|DBSCAN-SWA MNAPDLFRDHKPPLWREQDAIAARVVHARSRDEKHQIVRSLCHSLARMTDKRRYRL >NZ_CP018812|2931714:2951870|2945467_2945839_+|WP_075161173.1|DBSCAN-SWA MKQQEIRLFAPYVEGERLPKEQIEAMTFEECVAKALEIGLKRFDRKTLAKQCRIHYPHFSEYMSGARKLDHHRLFLFCMFSGCEYPRQWLELAEEKARAEYKRQSAQVVGEYVQQAFAQRAAA >NZ_CP018812|2931714:2951870|2933452_2933914_-|WP_075161160.1|DBSCAN-SWA MTITLEAIKSAHAKVAEMIATFEKQATTEYRVPAAAIPLASGERFAGAVLNDDGSLAHYLILLPGEKESVQWEAAREWAIEQGGDLPTRREQSLLFANVKSAFASAWYWSGEQYESNSGSAWFQYFTSGGQGYDGKDCELRAVAVRRFIPSVI >NZ_CP018812|2931714:2951870|2942346_2942844_-|WP_075161625.1|DBSCAN-SWA MNAVNEAAVAIAAPAIGEYWPGQGGIYAGIMPGEDGQRPYHLIVSATDVEDVAYGGYGSKVPGADSRYDGAANTRALLAATTEHPAAKWASEYSADGHTDFHLPAQRELNLCYATIPHKFEKDWYWSSTQETAGYAWFQLFGNGGQLYGDELSKLRAVAVRRLFI >NZ_CP018812|2931714:2951870|2935880_2936501_-|WP_075161164.1|DBSCAN-SWA MELILFYDTETNGLPQWNLPSEDPSQPHVTQLAAHLCDERSGNTIAAMDVLISPDGWTIPDDLAALTGITTERATLEGVPISDALRQFTEMWRRAGMRVAHNESFDARMLRIEYFRQLDHDDPFHDQWKGGQAFCTQGRCTKILNLPPTAKMLAAGRKHAKSPNLGEAYEYFTGKKLEGAHNAAVDLEACKAVYYGIKQATAGKAA >NZ_CP018812|2931714:2951870|2939692_2940187_-|WP_083615500.1|DBSCAN-SWA MVHLHLSRVTTMKRKEFDYDMVREAYLAGSSLNQIASEMGLDDEDVRRALIRMGVPRRPRGAGAGQLNHQFKGGTRQRKDGYLVQRGNRAKQLEHRVIAEKALGRPLKRSEDVHHVNFDGGDNGNTNLVICTHTYHMELHARMRRHPYWSQVEADYRNKQRGNK >NZ_CP018812|2931714:2951870|2943954_2944674_-|WP_083615569.1|DBSCAN-SWA MPVIAESEKAASTTLDAAKLTSPQRLKAVLAEQNVTAQTLASVSGVGLEVASQWLAGEGQTLTLDQAVAIQKTYGVNAVWLIKGKGDPGVAVRYADEYRPIPITNWKPIPVLGMAQLGDNGHWSDLEYPVGHGDGYVDFPSRDPDAYALRCVGDSMRPRIRDGEFVIIEPNHEPEPGDDVLVKSIDGRVMVKTFLYKRAGRIHLVSINEAHPSVAIAQEEIEKMHYVVAYTRASMWRPI >NZ_CP018812|2931714:2951870|2946175_2948704_+|WP_075161175.1|DBSCAN-SWA MNARLPTDSYRSFLEAKVRLARFDGFDVSLDEINPKLKPHTRDIVRWALQGGRRAVFASFGLHKTATQIEIMRLIGAHRPCLRAIVLPLGVRHEFTGQAEEHFAGDYAAPVRFIRSDSEIGDEREIYLTNYESVREGKVTPKLFGAASLDEASVLRSFGSKTYQEFLPLFDGVEFKFVNTATPSPNRFKELIHYAAFLGVMDSGQALTRFFQRDSEKAGNLTLYPHKEEEFWLWVASWAVFIQRPSDLGYSDEGYDLPELDVRYHEVPTDYAKAGADRDGQTLMFQDPALGLAAAATEKRDSMPARIAKVQDIVSADPADHFVIWHDLEAERHAIQTALPEAVSVWGTQDLDEREQRIVDFGKGAYRLLSTKPIIAGSGCNFQRHCHREIFAGIGFKFNDFIQAIHRVQRFQQPHRVRIDIVYSEAEREVLRTLQAKWAQHEEMVQKMTDIIRKYGLNQLAMQETLARSIGVKRIEVSSDRFCVANNDCVDEARRLPENHVDLIVTSIPFANHYEYSPSYNDFGHTDDNAHFWQQMDYLTPQLLRILKPGRIYACHVKDRILFGNVTGAGLPTVSPFHAEALFHGIKHGFDYCGMITVNTDVVRENNQTYRLSYTEMCKDGSKMAVGSPEYILLFHKPQTDRTKGYADVPIRKSKSEYSLARWQIDAHAFWRSSGNRLLTAEELAALGPEKLASLFAKYSLENVYDYEFHVRIGEELQERGALPTKFMSLAPGAHHPDTWHDVTRMLTLNGEQAKRAVEKHVCPLQFDIVDRLIDRYSNPDELVYDPFCGLGTVPYRAILKGRRGGGSELNPTYFMDQVHYLRAAEREYSMPSLFDIEEAAA >NZ_CP018812|2931714:2951870|2934464_2935145_-|WP_075161162.1|DBSCAN-SWA MRADIEKYLSTVQQSTAKTISHKLGLPQLDVARELNHMLNDGVVEREKRQGAGNEYHYWLSRAERTPATAAVTTAAPAAPALPPASPPAAAQPDARAALERQVAALTNQLTDLTTRYHEMSSARLQAETDRDELKVENDSLKSDVAKLTQNTRALELRIDELTLGPIGAKSPLFVTVGRYAKPKRHGSLDKAQKRASALIRSEKESEVLVLEPVGRVIRGSEWRPR >NZ_CP018812|2931714:2951870|2949805_2951197_+|WP_075161176.1|DBSCAN-SWA MNARAPHDESSAFVPPHSVEAEQFVLGALLMDNDAIDRIGELRAEHFYRHDHRTVFDLISRLIVAGKRADVLTVLEAAQIGGRDEAIGGLAYLNALSGNFVGSGGIARWAEMVIERWRLRGLLAAARDVEALVHNRGARTASELISEAQAKFEPLADIRSFEPQMPGPVLTEIVEEIDQTYHGAELPVVPTGFRDLDAKLGGGMRGAELVIIAGRPSMGKTAIAMAIGGYVAELQGMVLVFSLEMSSKQLHQRNIARVGGIHLSHVLDGKKFVDSDWPKLTHAVTVLSEAQMLVDDTSGLSMDEIASRARTVKRQHGLKLIIVDYLGLMTGGPDERHDLKIGSYSAGLKGLAKQLDVPVIALSQLNRGVEQRPNKRPTMGDLRDSGAIEQDADIILMLYRDEVYNADSPDRGTAEIIVGKQRNGETGPVRLAFAGEYQRFSDLAYDYVPATPEQPKKTRRGFE >NZ_CP018812|2931714:2951870|2935172_2935844_-|WP_075161163.1|DBSCAN-SWA MNAPQDFRAMTADTIGRDLLQALVLELKLLPDVWAKLSEAKQNDVIDRLRARVQSNIAMAVHTLASAGRTVVAGDLDQITIKDGVKAVIKFGANAPSLHELYDAQSKAVLVVVASSADHTAGMDEVRGEGDQRGLDLGQEYTSEDGDGMNGPTGDVVDAGPGDVRQIEHQPLQEELEAAEQAGFDAAADGKPQSDCPVMAGALCIEWVKGWKRWHEEQGDDAQ >NZ_CP018812|2931714:2951870|2943378_2943759_-|WP_075161172.1|DBSCAN-SWA MGKHSKQSWEIGQQVRVGFLSLTVVASLEATGDGLPGAYILTNGTQLYAFVPHNGLNKISDEEAVAMCEQSKRITAQREARAAATAKRVIDNAAVCAKLQQITGAEFVGMADVDGEQFAHYRNVAI >NZ_CP018812|2931714:2951870|2949506_2949809_+|WP_075161628.1|DBSCAN-SWA MKTFLEQCKEKGEKPILENDTVFDYASKTAIPDDFLRLHWLEFKARYCEEGSKRYKDWRSVFRKSVRGNWFKLWWIAADGACSLTTVGEQAKRAHGRDAA >NZ_CP018812|2931714:2951870|2951193_2951385_+|WP_075161177.1|DBSCAN-SWA MTWIANQTPEELRAFLRKREKEWIGSMTDPLAQRLWKEVQVLDARIVELEKQLAARTPAKAVA >NZ_CP018812|2931714:2951870|2945835_2946030_+|WP_075161174.1|DBSCAN-SWA MKLKQGDKGTVFVAPSQRRVVFDHEHRGRYVFIYEDDPDDGLALSAEGLKILERAPAPAVGGQS >NZ_CP018812|2931714:2951870|2946026_2946179_+|WP_156884055.1|DBSCAN-SWA MILHGLLDALLRAFVIALVAAAFFPHRWIFSVSLVVAFIGAVATMLWGRL >NZ_CP018812|2931714:2951870|2943021_2943216_-|WP_075161171.1|DBSCAN-SWA MESKSLEAWRNRPMKVTVMELCPRCEKLVEGVETRSFYGAFGQRFSAYCCQPCLVLVRNEALGH >NZ_CP018812|2931714:2951870|2940956_2941118_-|WP_156884054.1|DBSCAN-SWA MIRAFDRFYAKHPRIAAALMIAACAACLYAAAGADSDNDTALRLQLMASRSAT >NZ_CP018812|2931714:2951870|2936531_2938343_-|WP_075161165.1|DBSCAN-SWA MQIQHITVKSFLGARAVDIEVDTPVTIFAGPNGAGKSSLREAICAALTGDISRVALKKEYPSMVTDGAKKSIVSLDLDVGPANLTLPDAKHVGLTVYTGALPYLLNPERFAQMKADDRRTFLFELTGLRATPEKVKGLLAERKCDAAKVEKVLPMLRSGFPAAVKFAEDEARESKGAWKAVTNEQWGKDKAADWEAEVPLFDAARHAEVTQQMTAVESRIAKANKDLGALQEKHRAYAASRESAERSAELAKGVTRIEAKLATDKQNLADAEARLLDAQQRAGDAPREGLVHDLAAAVREFTVIIADAESLVQRVTGAIRPWSDYDLSTVERAYAAYVDQHGEPGTGGDADARAQLPELVKARDLMKRSVENDERDLAAALAASEALKLKSDVEAVTDEQLTAARTSVSASTAQRDSLRTELDRLNNAKLAADAADSKTQTAAKHHADITQWLDIAAALSPDGIPGDMLAQALAPINNRLAELAAFAQWAVPMLDTDMTIRAGGRLYSLLSESERYRVDALIALTIAVLSESRIVFFDRFDVLDLKGRGDLLELLDDMASQNEIATALVFGTLKKAPEGLPSTTRAHWIEKGELHAARFAQAA >NZ_CP018812|2931714:2951870|2938442_2939696_-|WP_075161166.1|DBSCAN-SWA MNAPTQARALSDVAQGPQAVVAADVAVDMFTERGFVLANRIAKAYASSDAVPAQFRSHNLKRVGNEEHWVENPAAVGNCLVAIEVARAVRMSITAVMQNADPIEGKLRWSGKFVIAAINASGRFTPLRFQAINRGRIKATYKEKTGWDKEARRPIFAEREVEVDDIECIAWALPKGVPEPHLTPDMLRQYPNRLLDLYRSLGMPVVESAPVTMKMAVEEGWYGKPGSKWQTELRALMFQYRAGSFFGNIHAPDIVMGMGRTSEEELDIIDVTKQPDGTYAADIDTLRRASVSPAADRAAQAETSRQADAPAANVEQMDQQAQHHHQGTEQGETTGPDGQSAAPAQTASKSGPLPESEISARLNAANSSDELNAAIALIADVPDPDAEQRLTELASTLRQKFERAGRRASRGSAPSVE >NZ_CP018812|2931714:2951870|2933943_2934303_-|WP_075161161.1|DBSCAN-SWA MQQIQIPPLAEGEVYLHGRVSKTGDVEHTVLVAVNDERLPRELQREWAKSVGGVLFNRFDALVIYNEHRALVKPEWYWTDDDVEWDSGYAWCQGFGTGGQTCGGKDLELRAVAVRRLSI >NZ_CP018812|2931714:2951870|2933059_2933437_-|WP_075161159.1|DBSCAN-SWA MALHTQLPIYRAAEDLLDVVTDVVTNMQRDFKRSIGEKINLECIEIIVLVYRANVATDKSPHLTELIERLQVINLLLRLGFNKRKVDKGSYARAVELTTSIGKQATGWKKAASNRPLHGGQGSHG >NZ_CP018812|2931714:2951870|2940159_2940960_-|WP_075161168.1|DBSCAN-SWA MSDLRIRASSFGKFFDCAYAWEAEHLLGKRKPAGLRALLGTSVHAGTAAFDSARLNGSEITPDDAAGAFVDTLHHPEYDVDFSNDNIKFAEAERIGLTLTTKYCTEVAPQFTYTAVEMPLEPLQIDCGGGTTITLTGTMDRARVAKGVDGTIIPDVKTGARVVTNGEANIKSRSAQTGTYQLMYEQTTGERTAGAQIIGLGTTSKTPVAVSPIFDARRVLIGNEETPGLIQLAADMFKAGLFPPNPSSPLCSKSYCSRWSTCIYHE >NZ_CP018812|2931714:2951870|2941114_2941336_-|WP_075161169.1|DBSCAN-SWA MLTIFRRRPTGITRDRVLELKHEALCARTARRQRDAKAALSRQGVAPRTPISTGYVPEEISRVFTYTNVGGVR >NZ_CP018812|2931714:2951870|2948878_2949055_+|WP_156884057.1|DBSCAN-SWA MESPRPIHADPLLGEVFELLHRMELCRSLEPFQWMASDALKKLRAYEVKQLKEACSGW >NZ_CP018812|2931714:2951870|2931714_2933067_-|WP_083615499.1|DBSCAN-SWA MAERSFNLVAPLAHEATDMRIEDTVRRRADRSSAVSQLSNRSGDVDSAIYPATPGIRTSTTATRTTTTRTTSFAPWPSADWKNGDQAFHFVELVEAYVDCRRTKRNKASALAFEANLEHNLRDLYDELLDGSYRPGRSICFVITRPKAREVWAADFRDRVVHHLLYNRIGPRFERSFVADSCACIKERGTLYAAQRLEAKIRSITQNWTRPAHYLKCDLANFFVSIDKTVLSELLLAKISDPFWQWLTGAVLMHDPRTDFEFRGDPASIDLVPAHKRLMNQPAHLGLPIGNLSSQFFANVYLDALDQRAKHIVRARHYVRYVDDFVFLHESPDWLNAAHADIAAFLLARLCVRLNEHKTILQPIERGVDFVGHVIKPWRRSTRRRTVNEGVRRVANAPAVDVHVMANSYFGLYGQAPASHHDRARLANVVRGRGHAVNHGLTKTYRGTRS >NZ_CP018812|2931714:2951870|2945039_2945324_+|WP_083615502.1|DBSCAN-SWA MLTMSKHRDLHPDAKIIDELGGPTKLAERLGYDKASGGVQRIQNWKWRGIPAHVKVEHPEIFMTDLIDRVKASDDAQPPAGGSVDDAKMAKMVV >NZ_CP018812|2931714:2951870|2951468_2951870_+|WP_156884103.1|DBSCAN-SWA MKPNDLLLQEIRETPADVLAIEMIASYGMPVGREVFETCVWIGRFTQVFRRPDDVLFVYRRDVKLHLCGSPQAKDPNIRQALLDRFPRTGGGKVPQVGVKKQPGPLFGVSTHAWSALGVAVTAAHQLAQKEVA |
28 | Pseudomonas_phage(28.57%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|