Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_LT962939 | Brucella melitensis isolate 1 chromosome 2 | 0 crisprs | DEDDh,csa3,cas3 | 0 | 0 | 0 | 0 |
NZ_LT962938 | Brucella melitensis isolate 1 chromosome 1 | 1 crisprs | csa3,DEDDh | 0 | 1 | 3 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_LT962938_1 | 1347030-1347112 | Orphan |
NA
Consensus repeat of NZ_LT962938_1
|
1 spacers
spacers of NZ_LT962938_1
>1.1|1347058|27|NZ_LT962938|CRISPRCasFinder ACAGATCTTTCCTTGCGCGCATCTTAT |
CRISPR arrays and Neighbor proteins around NZ_LT962938_1
The CRISPR arrays of NZ_LT962938_1 >merge|NZ_LT962938|1|1347030-1347112|CRISPRCasFinder CCGAACACCGCTTCGCACTTTTCGGGATACAGATCTTTCCTTGCGCGCATCTTATCCGAAAACCGCTTCGCACTTTTCGGGAT >NZ_LT962938|1|1|1347030-1347112|CRISPRCasFinder CCGAACACCGCTTCGCACTTTTCGGGAT ACAGATCTTTCCTTGCGCGCATCTTAT CCGAAAACCGCTTCGCACTTTTCGGGAT
>NZ_LT962938.1|WP_002963792.1|1346316_1346988_+|uracil-DNA-glycosylase MISEPSPNCSICPRLHAFLQEWRQKEPAWHNAPVPPFLPGSDEDVRLLIVGLAPGLRGANRTGRPFTGDYAGELLYSTLCEFGFATGKFEARPDDSLRLLDACIVNAVRCVPPENKPVGSEINNCRQFLTPMLTRFPNLDAVVTLGTIAHQSTVRALGVPVSRHPFGHGKATDIGPLRIFSSYHCSRYNTNTGVLTDAMFRDVFKNVRAYLSEKAAGSARLSV >NZ_LT962938.1|WP_004683386.1|1345710_1346286_+|NYN-domain-containing-protein MFDSREKIALFIDGANLYAASKTLGFDIDYRKLLKAFQKRGYLLRAYYYTALVEDQEYSSIRPLIDWLDYNGYKVVTKAAREFTDSTGRRKVKGNMDIELTVDAMQLTDTVDHFVIFSGYGDFRSLVEALQRKGRKVSVVSTLTTQPAMISDELRRQADHFIDLVSLKAEIGRDPSERAPRRQEEDFDESY >NZ_LT962938.1|WP_002963795.1|1344912_1345314_-|DNA-directed-RNA-polymerase-subunit-omega MARVTVEDCVDKVENRFELVLLAGHRARQISQGAPITVDRDNDKNPVVALREIADETLSPDDLKEDLIHSLQKHVEVDEPEAAPAQIANAAEEIAEGIAEAGEEDVVTFDRMSEEELLAGIEGLVAPEKNDGF >NZ_LT962938.1|WP_004685525.1|1342419_1344672_-|bifunctional-(p)ppGpp-synthetase/guanosine-3',5'-bis(diphosphate)-3'-pyrophosphohydrolase MMRQYELVERVQRYKPDVNEALLNKAYVYAMQKHGSQKRASGDPYFSHPLEVAAILTDMHLDEATIAIALLHDTIEDTTATRQEIDQLFGPEIGKLVEGLTKLKKLDLVSKKAVQAENLRKLLLAISEDVRVLLVKLADRLHNMRTLGVMREDKRLRIAEETMDIYAPLAGRMGMQDMREELEELAFRYINPDAWRAVTDRLAELLEKNRGLLQKIETDLSEIFEKNGIKASVKSRQKKPWSVFRKMESKGLSFEQLSDIFGFRVMVDTVQDCYRALGLIHTTWSMVPGRFKDYISTPKQNDYRSIHTTIIGPSRQRIELQIRTREMDEIAEFGVAAHSIYKDRGSANNPHKISTETNAYAWLRQTIEQLSEGDNPEEFLEHTKLELFQDQVFCFTPKGRLIALPRGATPIDFAYAVHTDIGDSCVGAKVNGRIMPLMTELKNGDEVDIIRSKAQVPPAAWESLVATGKARAAIRRATRSAVRKQYSGLGMRILERAFERAGKPFSKDILKPGLPRLARKDVEDVLAAVGRGELPSTDVVKAVYPDYQDTRVTTQNNPAKAGEKGWFNIQNAAGMIFKVPEGGEGAAAKVDPAATTPKPGKRALPIRGTNPDLPVRFAPEGAVPGDRIVGILQPGAGITIYPIQSPALTAYDDQPERWIDVRWDIDDQMSERFPARISVSAINSPGSLAEIAQIAAANDANIHNLSMVRTAPDFTEMIIDVEVWDLKHLNRIISQLKESASVSSAKRVNG >NZ_LT962938.1|WP_002963797.1|1341833_1342412_-|orotate-phosphoribosyltransferase MNTDDVLAVFREAGAILEGHFILTSGLRSPVFLQKARVFMHADKTEKLCKALAEKIRAADLGPIDYVVGPAIGGLIPSYETSRHLGVPSVWVERENGVFRLRRFDVPKGARVVIVEDIVTTGLSIRETIDCMKDLGIEVVAAACIVDRSAGKADVGTRLISLAEYEVPAYPADKLPPELAAIPAVKPGSRNI >NZ_LT962938.1|WP_004683392.1|1340918_1341674_-|helix-turn-helix-domain-containing-protein MVVVPFALDQNQKLAARTPCNVCEDCCVRSMAVCSALDDGDLAALEAIMTSKKLDTNEMLVEEGEPKLRVYSLTSGMLRIYTSLPDGRRQIAGFLFPGDFLGLADDEVYSLSAEAVVPSALCAFSAKEIERLMERFPKLKERLYQMTRLALRTARDNQLVLGRLAPVEKLASFLLVLSARAEKRGEKPNPVHLLMNRTDIADYLGLTIETVSRSFTKLKTQGLIQLRDANTVEILSRRSLAVVAGLDPDNL >NZ_LT962938.1|WP_002967518.1|1339202_1340537_-|oxygen-independent-coproporphyrinogen-III-oxidase MHDDAIRRYAALAVPRYTSYPTAADFVPVGMDTTRRWLRQIGPEESVSLYIHVPYCSQICHYCGCNAKMAIREDVIENFVNALLGEIRTVSASLTARPRVAHLHWGGGTPSILNAGQFQTVLAAIRTAFDFDLSMEHAIELDPRTVTPLLAKTLAEMGVNRASLGVQDVDSRVQMAIGRVQPIETVAEATRLLREVGINRINFDLIYGLPLQTVETLRETCERVVTLSPDRVACYGYAHLPQRRANQRLIDENTLPDADERFRQARIVTDSFIGFGYQPVGIDHFALPDDDLAIAAREGTLNRNFQGYTNDRCGTLIGFGPSSISQFPGGYAQNISDVGQYRKRVEAGELATVRGYTLRDTDRIRSAIISALMCNFCVDLNAVAPGMEFSDEFALLRPLVADGLVAVEGRTIRATENGKSLIRLVAAAFDEFRRDSVHGFSFAV >NZ_LT962938.1|WP_004683395.1|1337807_1339019_+|MFS-transporter MSVSAAQQPSNSSADNTVLTIILATSLGHFLNDMMQSLLPAIYPMLKENYSLSFWQIGLLTFTFQMTASILQPLVGIYTDRKPMPYSLTFGMGCTLIGLVFLATAHHYSFLLLGAACVGFGSSVFHPEAARVARLASGGRHGFAQSLFQVGGNFGSSIGPLLAAFIVLPFGQISVSWFSVAALIGMFVLWYVGNWYNRYRLANANKPKPDKTLPLPRNRVIASVAVLALLVFTKYIYMASLTSYYTFYTISHFGVSVQTSQLLLFLFLGAVAAGTIIGGPIGDKIGARKVIWASILGVLPFTLALPYANLEMTAVLTIIIGLILASAFPAIIVFAQELLPGRVGMLSGLFFGFAFGMAGIAAAVLGIVADQKGIEFVYRICSYLPFLGLLTIFLPKLERRRKA >NZ_LT962938.1|WP_002971501.1|1336716_1337556_-|helix-turn-helix-transcriptional-regulator MPENEDLQRFHEHRIAMLESMTGPAIALPTRYPDGYFVPRHSHSRAQLLCASQGVVLVTTDAGRWMIPSDHAMWIPAGVEHSVAILGDVFMRSIYISVDAISGVPNYLHVVGLTDLMRCLIIDATSHDSIPAPESRDGLVIELILRDLHTLPEYPLGLPFPSDPRLQKLCRDFVKKPSSRATIDEWADRMAMSRRSFTRHFQRETGVSLSVWRQQACLFAAVPRLSEGEPVTSVALDLGYDSVSAFTTMFRRMLGVPPKFYQPRLDVAPFERSDKAAFG >NZ_LT962938.1|WP_002966733.1|1336091_1336667_-|DUF2062-domain-containing-protein MLFQRRYPPTRWERLRLYLWPRRSFSRSFRYGGKRILRITASPHAVAAGLAVGVFSAFTPFFGFHLIIAIVLAYFLAGNVAAAALGTTLANPLTLPFIWGSTFELGRFIMSGSVDSVPPVHLGHALETMRFEEVWTPLLKPMLFGSTILGAAFAVLVYFVTRYAVSVFRRRRLERLAEKHRLHRQRELQKA >NZ_LT962938.1|WP_002963791.1|1347124_1347601_-|SsrA-binding-protein-SmpB MNKPKNSPARKMIAENRKARFNFEILDTLEAGLVLTGTEVKSLRANQANIAESYASFEDGEFWLINSYIPEYTQGNRFNHEPRRLRKLLVSRREMSRLFNSVSREGMTVVPLKLYFNDRGRAKLELALARGKKTHDKRETEKKRDWNREKARLLRDRG >NZ_LT962938.1|WP_002963790.1|1347691_1348573_-|4-hydroxy-tetrahydrodipicolinate-synthase MLKGSITALVTPFDREGAFDEKAFRAFVNWQIEEGTKGLVPVGTTGETPTLSHDEHKRVIEVCIEVAAGRVPVIAGAGSNNTVEAIELAQHAEKAGADAVLVVTPYYNKPNQRGLYEHFSRVARSISIPLVIYNIPGRSIIDMTPETMGALVRDCKNIVGVKDATGKIERVSEQRAICGKEFIQLSGEDATALGFNAHGGVGCISVTSNIAPRLCAEFQEACQAGNFAKALELQDRLMPLHKALFLEPNPSGPKYALSRLGRIENVLRSPMVTIEAATAEKIDHAMKHAVLIN >NZ_LT962938.1|WP_050559813.1|1349049_1351107_+|lytic-transglycosylase-domain-containing-protein MVSLAQSSLPPEIPTPLARPFAPTATHQSPISLVTPRPRPIAPDPMATSAVSRTDNPAVIGGTLKNGLDALSAKNVASAIASRNNLPRGSLDRQILTWAIATSGMDGVPSTEIAAAASELSGWPGMATLRRNSERALFKENPSSATIIATFGSTRPQTTEGMIALARAYVATGNSTKAHQLLSPWWTRKRLSSDDEQKILKEFSAILTRADHQRRLLHSLYNGHLQSARLLAGPAQAQSLYNAYAAVAQKSPNAASAIAAVDRSWQANPVYQFLKIRYLRRAERYNEAAELLLKAPRKASVLVDPDAWWVERRILSRELLDLGKPQLAYRLAAAHAAETPTMAAEAEFHAGWYALRALNQPKLAAPHFAKITQISARPISASRAYYWLGRAAEAGSGGDARAYYRRSAHFGTTFYGQLAAAKLNEKAPELAYPKPTEAERVRFASRPAVQAIKRLEQVGYGNKAAALYTQLSQELDSVGELALLAVMAERNDNHYMALRVGKTAAMRGLDVGALSHPLGAIPASANIKGSGKALAYAIARQESEFNVSAVSKAGARGLLQLMPATAKTVATRNGMSFSAQKLTADAAYNATLGAHFLGEQLDRFNGSYVLTFAGYNAGPRRASEWVEKYGDPRGKSVEQVVDWIERIPYSETRNYVQRVMENYEVYKTRLIGRADIKTDLVYGRR >NZ_LT962938.1|WP_005969508.1|1351169_1351826_-|endonuclease MPLLIVQLAILIAVAFVVGCFLGRILRRRKSAGSDHERTIVAAALSTPPLAEKPEPVPPAEPVRADSDPLKRKAGLEKARIEPVEGSADSDVAAAPVLKKVPAETPDDAGRPQWREAPRRGKADELTAIEGIGKAIEAVLHELGIFHYDQIAQWTREEAIWIERRIGFPGRVEREGWIAQAAKLAEPPAKSTTKRGAKTKKAANNVRSGTRRAKKQAG >NZ_LT962938.1|WP_002963787.1|1351904_1352087_-|hypothetical-protein MTVFPIRPFLCYDYSAKGLKCLAKYYAESLMLYLVETFWPVLVLAILIGAATGWLTAGER >NZ_LT962938.1|WP_002963786.1|1352142_1352784_-|hypothetical-protein MKKWFWPCLTWTACLTALALWFGADRVETDIAEQTAKALEPYVWAGFYVDGRDVVLKGMAPDPDMQRAAHAALEHVWAIRDITDLTTVLQLASPYRFKIGRNAQGLVLSGFIPNNESRDQVMTAAGEVAPDIFIDDEMAVARGNQPEFMERVLFAIDLAKKLPEAEIEIVDEKLSIRGTVSDEAIYDEIEALKAAQLPYGLKLAVLDVKKSVQ >NZ_LT962938.1|WP_005969513.1|1352927_1354016_-|porin MNIKSLLLGSAAALVAASGAQAADAIVAPEPEAVEYVRVCDAYGAGYFYIPGTETCLRVHGYVRYDVKGGDDVYSGTDRNGWDKGARFALRVSTGSETELGTLKTFTELRFNYAANNSGVDGKYGNETSSGTVMEFAYIQLGGLRVGIDESEFHTFTGYLGDVINDDVISAGSYRTGKISYTFTGGNGFSAVIALEQGGDNDGGYTGTTNYHIDGYMPDVVGGLKYAGGWGSIAGVVAYDSVIEEWAAKVRGDVNITDQFSVWLQGAYSSAATPDQNYGQWGGDWAVWGGLKYQATQKAAFNLQAAHDDWGKTAVTANVAYELVPGFTVTPEVSYTKFGGEWKNTVAEDNAWGGIVRFQRSF >NZ_LT962938.1|WP_002970988.1|1354855_1355959_+|porin MNIKSLLLGSAAALVAASGAQAADAIVAPEPEAVEYVRVCDAYGAGYFYIPGTETCLRVHGYVRYDVKGGDDVYSGTDRNGWDKGARFALMFNTNSETELGTLGTYTQLRFNYTSNNSRHDGQYGDFSDDRDVADGGVSTGTDLQFAYITLGGFKVGIDESEFHTFTGYLGDVINDDVVAAGSYRTGKIAYTFTGGNGFSAVIALEQGGEDVDNDYTIDGYMPHVVGGLKYAGGWGSIAGVVAYDSVIEEWATKVRGDVNITDRFSVWLQGAYSSAATPNQNYGQWGGDWAVWGGAKFIAPEKATFNLQAAHDDWGKTAVTANVAYQLVPGFTITPEVSYTKFGGEWKDTVAEDNAWGGIVRFQRSF >NZ_LT962938.1|WP_004683363.1|1356234_1357161_+|site-specific-integrase MIIVGETKIDTGDKYAPIIDYNLNYISGKNPKHRLVEHYSVAELTAKYINILWDDGPHKYNVRSFLGEIDEILKGARFSGFDQEMLDSIIGTLRERGNSNATINRKMAALSKLLRKAHKMGDIFNLPEFIRQKERVGRIRFLEQEEEKRLFAAIKSRCEDSYRLSVFLVDTGCRLGEAIGLTWNDIQEQRVTFWVTKSNRSRTVPLTRRARKASHIPRERLKGPFSMLNQVRFRQIWNEAKAEVGLGADDQIVPHILRHTCASRLVRGGIDIRRVQMWLGHQTLQMTMRYAHLATHDLDSCVKVLEIH >NZ_LT962938.1|WP_002963782.1|1357523_1358657_+|pyridoxal-phosphate-dependent-aminotransferase MAQPRLTPLVESLPSTVPFIGPETLELQRGKPFEARIGANESSFGPAPSVIEAMRNEATEVWKYGDPENYALRHAIAAHHGLKAEHIMPGAGVDALLGLIVRQYVQQGDKVINSLGGYPTFNYHVAGYGGQLVTVPYRDDKPDLDALIDAAAREKPALLYIANPDNPMGTWHGGADIQSFIERLPETTLLILDEAYCETAPASAFPPFETDRPNVLRMRTFSKAYGLAGIRCGYAVGNPVAIKTFDKVRDHFAVSRMAQAAAIAALKDQAYLHEVVGKICAGRDRIAAIAEANGLHAVASATNFVAIDCGRGKDFAQAVLNGLISRDIFVRKPGTPVLDRCIRVSVGVKEQLDQFEAAFPEALEEARKICAANAENT |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_LT962938_1 | 1.1|1347058|27|NZ_LT962938|CRISPRCasFinder | 1347058-1347084 | 27 | NZ_CP013986 | Klebsiella variicola strain LMG 23571 plasmid unnamed, complete sequence | 45587-45613 | 5 | 0.815 |
1. spacer 1.1|1347058|27|NZ_LT962938|CRISPRCasFinder matches to NZ_CP013986 (Klebsiella variicola strain LMG 23571 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.815
acagatctttccttgcgcgcatcttat CRISPR spacer tcagatctttccttgagagcatctgtt Protospacer ************** * ****** *
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1040559 : 1048896
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_LT962938|1040559:1048896|DBSCAN-SWA TTCACCAGTTGTGGGTAACTGCGTTGCCTATAATTACAGTCTGACAATTTGCATTCAAAGAGATACCTTGGGAAACAGTGTTAAGATTTCCTGGTGCCATATTCCGGCGGCCACCACTGCGGATCGTATTGATGTTTATGCGATCTACGTTGATAAAATTGAACCCAGTTCCGCCGTCGATGGATGAATTGAGATCGTTTCCATCTACGTATTGGACGTTTTGGCAAAACACGCCATCACCCTTGTTATTATCTGCAAACAGGCTATTGATGTGGAGGGCTGCTCCGTCTGCAAAGATGAACCCCCGCTGTTCAGCATCTGGGACGTATCCAAGGTCTGTATATCCATCCAATCCAGTGGCTTCGGTGCGGCGAAGAAATAACCCCTCAAAATTTCGAAACTCATATGCTGGTAGGCTTACACCACCATATTGCGGCCCTAACCCAATTTCAACATCTTCAAAGCGCAGAAACGCAAAGCCATCCCTCATGCGGATGCCACGCCCACGGCATCCGAAGGCATAAACGTTCCTAATTAAAATGTCCCAAACATAAGGTGTGGCTGCGGCCCCCTTTACATTGTTATCGTCAATCAACCCACCGGAATTAAAAGACCGAACATTCTCAATAAGCGTATGTTGAATGTATCTGGATGCATCATTTTCGATAGTGTCGAAATGGAACAACGCTGCTTGCCCCGTCATTTCAGACCCATCTTCTATGAAATCTCCAATATTTATATGATTTGCTCTAAGTAAAAAACGACCAATAGCACCGACGCGCTTTATTTTTGAGCCACCAGATGCGCCACGCAACTTGACAGGGTTATTGATTATAACGTCTCCATCAAACAGATAACTGTCGTCTGTCTTTGGAATTACGACTGTTTCACCTGAATTAACGGCAGACTGCAACGCAGAAATTGCACTGTTGCTTCCATTGTCAATCACTGTAGTCGCAAAGTCTTTCAGCGAGAAAAACCCGCGAGCCTGACCTTGTTGCAGAGTTCGGAGAAGGTTTACGACTAACCCATCGTTCTCTGGCGTACCCGGCGCTTTTGCTTGAGTTTCGGTGGCAAACTCTACAGCAAGGCCAACATCATAACCGGTCCCATCTTGATTAACGATAAGAGACTTACCTGCATCGGCGGCGCTGATTTTGGGTAGATTAACACCGGATGCCGCCGCCTCCGCCCGTGCGGCTGCGTCTTCAGCGGCGGCAAGGATGGCTTCTGCATCGCCATAGGACAGCAAACGCAATTCTGTGCCGGTATCAATGCATAGCAGCGCCATGCCAGGCATCAGGTATCCTGCCGGGACTGGCTGATTGATATTCGTAACCAGATCGCGATTAATTGTGCCAGATACCGTCACCGGCCCCGTGTTCTCCTGAGTGACATTCAGGATATAAAGCACCTGATACGCAGCCGTCGGGATCGCGACAGAAGATGTCACGACGATATTATTGGCTCTGTACCTTCGTCAGCGTTGTTCAGACGAATGACGCGGTTATCAGGGAACGCCGAAAGGCCCTCAAGCAACGCTTTCAACGTATCGCGAAGGTCAGGCTTGTAAGGATGAAACGGCCCTGATGCCGGAACACATCAATCACGAAATCGCGGAAAATCTCGTCAATCGTGCGAACGGTCATGCGAATGCTCCATAGCAAAACGCCCCGCCAAGGCAGGGTGTGGGTTTCTTAGTTTGTTGGGGTGTCAGGGCATCGTTTTGATCGTGCTGGCCGAGTAAGCGCCTACGCGTCCCTTCTGGGTACGAACGGCCAATTAGAACTCGTACTGTGTCAGCGCTGACAGCGTCGGTGTTTCAAAGCTCTCTGCGTCGTTTTCGAGCGGACCAGCCACACGCCATTCGGTATCAGCCGTCTTTTTCCAGCGGACCATATAGTTCAACAGGATGTTGCCGGTAGGCGGGAAGCTCAGTTCTGCTGCTGGTCCAGCGATAATCAGAACATCTGGCGCGTCTGGAACTGGCAAATCATCGTCCGAAGTAGTTTCGTCCGATACCGGCGCTGTGCCTTCCTGCGATGTGTCCCACTGGTAAGCAGTGTCCGTCATGGACTGGACCTGAATGGTCGCCCCTTGCAGAATGCCTCCTTCACCAAGGATGAATTTGAAATCAAGGACTTCAAAAACACTGTTAATTCCGAACAGCGGGTATTGGATGCCGATCAACCGTTCGCCGAAAGCAGCAAGGCCCATCAGGTTCGTGTTGAACGTCCCTACCCAATTCGGGTTAGCTCGGAACCATTCGAGCTTCATCAAGCGTCTTGCCTGGCTGTGCGAAGGAGCCATATTGAATTGAACATCTTTGGCTTCTTCACCACGCTCCGATACATCGGCCTCATCAGCCCAAGGATCTGCGTCAGACGCCTGATAATCTTGATTGATGTCGAGGAATGTTGCCCGGATCGTGTTTGCCGTTGTCATCACGTCACGGCCAACCGGCGGGTAGTTTTCCAAAGACCGGACCTCATTGATAGTCATGGCCCCAATAGCGGTCATCTGCTGGTAGAACTTCGCCCTGGGGCGTGTCTGCATTCAACGTAACCAGATCATAGCGCATGCGAGATGGACGAAACCCACGAATGCGGTCAATGTTTTCTCGCATCGCAGCGCAATACGACGATAGCGTTTCAACTTGTTAAAAAAGCATTCAATCTAATGGCGTTCCTTGTACAGCCTCCAGTCGATTGTTGGGACACTGGAACGTGTTGGATTGACCTTGATCTGAGCCGTTGCCTTGAGATTGCTGGCAATGAAGGCCCTTAAGTGATCGGCATCATAGGCTGCATCAGCAATGACATGCCCCACACCCTTCAAGCCGGATAGAAGGCTTGAAGCTTGCGGACAGTCACCATAATGGCCGGGTGTTGGCTTTATTCGCAGCGGTAGGCCGATAGCATCGACAACAGCATGCAGCTTGGTCGTCAATCCCCCGCGCGAGCGACCGATGCAGGCAGCTTCAGCCCCCCTTTTGCGCCCGCCGCATCTGCGTGGACTTTCGATATGGTGCTGTCAATGAGGACATATTCAAAGTCCGGCGTATCAGCCAGGGCATGGAAAAGCCTTTCCCATACACCGGCGTGCGACCAGCGCCGAAAGCGGGCATGAACCGCTGTCCATTTGCCGAAGGTCGCAGGCAGATCGCGCCAGTGCGCTGCATTGGCAGCCATCCACAAGATGGCGTCGACAAATAATCGGTTATCAACGCCACTGCGGCCGGGCGTACCAACTCGCCCCGGAAGATATGCTTCGATCCGGTTCCATTGCTCATCTGTAAGGCTTCGTCTGCTCACGGCTGTTCTCCTTTAACAACCTTGAATCAGAATTTCGTACAAAAGGGAATCCTTGAATGCAGACAGGCCCTAGAGCGCAGGCATATCGCCTCTTGCCCAGCCTTCCGTGGTTTCAGCGGTAAAATCACTGAAATTTCCATCGGCAATCGCCGCGCGAATGCCCTTCATGAGATATTGATAATAGGCAAGATTGTTCCAGGTGAGCAGCATTGCACCGAGCGCCTCGCCGGACTTCACCAGATGATGCAGATAGGCGCGGCTATAATCGCGCGATGCCGGACAATCGGATTGCGGATCGAGCGGGCGATGATCTTCCGCATGACGCGCATTGCGCAAATTCACCTTGCCGAAGCGGGTGAAAGCAAGGCCATGGCGCCCCGCGCGCGTCGGCATCACGCAATCGAACATATCGATCCCGCGCGCCACGGATTTCAAAATATCATCTGGCGTGCCGACCCCCATCAGGTAGCGCGGCTTTTCCGTTGGCAGGATCGGGCAGACCACTTCCAGCATATCCAGCATGACTTCCTGCGGCTCGCCCACCGCAAGCCCACCAACCGAATAGCCCTTGAGGTCCATGGCCTTGAGCGCTTCTGCCGATCTTTCGCGCAGACGCGCGATATCGCCCCCCTGCACGATGCCGAACATGGCCTTGCCCGGCTGGTCGCCAAATGCCACCTTGCAACGTTCGGCCCAGCGCAGCGAAAGCTACAGGAGCAAGGCCGAGGAAGGAAAGGAAGCGACGGCGGTTCATAATACCCTCACGTGTACGTACAGAAAATATTATTTGTATGTACGATTATTATTGCAAGCCCAGACGAATTATTGTACATACGTTTTATGAAGATCATCTGGGACGAACCGAAGCGACAGACCAACATTGCCAAGCACGGCTTGGACTTCGCTGACCTGCATTTCGAATTCTTCCTGTCGGCTAAGGTCTTCCCCACCAAGGCAGATCGCCTGATGGCAATCGGAGAATTCAACGGCCTGATTATCATCGCCGTCTTTTTCAAGCCGGTTGGTTCGGAAGCCCTCTCCGTGATCTCCATGCGTTCAGCAAGCCAGAAGGAAAGGAAGCTCTGATGGGTATCAAATTTTCCTCTAAGCGCCCTCTCACTCAGGAGGAAGAGGCAGAAATTCAGAAGATGATCGCAAGCGATCCCGATGCACCGGAAGCCACCGATAAACAGCTTGCCAAGGCCAAGCCTTTCAAGGAATCCTTTCCAGATATGGCCGCCAAGATGGAGAAGGCCATTCGAGGCCGGCCGCGCATCGACAATCCCAAGACGCCGGTTACTATCCGGCTCGATCAGGATGTAGTTCAACGCTTCAAGGCCACCGGCAAAGGCTGGCAGGGCCGTATGAACGATGCCCTGCGCAAAGCTGTCGGGCTTTGATCAGCAAGTGACCTGTGGCTTGAGAAGCGGCGACACTTCCGAGGTCCCTTTTCATCCCATGCTGGCTGAGACTAAGGTTGAATTTCTTGACCCGTTTACAACCAATCCGTTTTGGTTAAGCTGCCAACCGTGCGTTTCGGAAAAACGCAATCATGCGGGGGATAAATCCGACAGTTGCTGGATAACGAGCGAATACCTCGCCATCGTATCGAGCGAGTGCGAGGGACGATATGCGCTTTCTGGCTTCGTCGACAGAACGCGCCTGTATTTCAATGCTCCATTTAGCGCCATCATATCGATAGCTGAAAAGGAATTTACGAAACACATCATTCTTGGAGGAGACAGACATGCAATACGTCCTTTATCGAGATAACGCCGGGTACTGGCGCTGGCGTCTTCTGGCCAACAATCACCGAACTATTGCCGACAGTGGCGAGGGATACGTGAACAGAGCTGATGCGATAAATGGCATTAATCTCGTCAAGAGCTCTGCCGCTGCCCCTGTAGTCGAGAGATAATCACAACGACCAAGCGTTGCGCACCAGCCACTGGCTTCTTGGCTTCCGTTACTATCTGGTGCGCCGCCGTTCTAATTCCGTTGCCCTCATGTCTCGCGCTACCCTGATAAGTGGGGCGGTACTGCATGAGGGCGAATATCTGCCCTAGTCGGCGCTCTTTCGAGCCGTCTGGCTACGAGGACTTCCCCAGACCCATTCTCCAACTACGGCTATCTTCGCTGGCCGTCAGCTACATGCGAGGATCGGCGCTCGATGTGCGAAACGCCATCGCATTCCGTTGAGGCAGAGCCTCGAATAAAAAAGCCGCCCTGCGAACAAGGCGGCTCTATATCTGATCAAATATCAGATTAGAACTTGTAAGCTACGCCCAATCGGATGTCGTGTGTCTTAAACTTGTTGCGAGCCTCAACGCTGAGATCGCCATCAACTACACTGAAATCCTTATGGCCGTAGTCGGTATAACGATATTCAAGACGTACGATGACATTGTCAGTTGCCGCGTAATCAACACCAGCGCCAGCGGCCCAGCCAGTCATAGTCTTGCTCTGAGAAACCCCAATGCTCTCTATACCGTCCGTGAGTGAAACTGAATTCTTAACGTTTCCGAAGGCCACACCACCGGCAATGTATGGCATCCATCGATCCATTGCGACACCCATGCGAGCGCGAACAGCGCCTGACCAGCGGAGCTTGCTGTCAACACCAGTGGTGAGATCAAGATCTGGGTCGGTGTCACTGTGGCTAGCATCTAGATCATTATAGGTAATGTCACCGTCAACACCGAGGACGAAGTTGTTCCCCATGTCAAAGTTATAACCGGCATAAAGACCGCCGAGGAAACCGTCTGGCTTTACTCGGATAGATGTGTCTTCATCGCTCAGCGTCGAACGGCCCCAACCATAACCGATCTGGCCGCCGAGATAGGCGCCATTCCAGGTGAAGGTCGGAGCAACAACAACCGGTGCAGGTTCCTGTTCAATGACGGCGTCAGCAGCCTTTGCGCCAGTCGCAGCAACGAGAACGACAGTTGATGCAAGAAGAAGCGATTTAATGTTCACGGTTCCCTCCAAACATTCCAATATTCATTGATCTCTTAATTATATCATCCGAGTGAAATGTCTGTAGTTTTTCAGCAACACTTGCAACCATAACGCTCATACTGAACGCCACACATTTCATAGTAACCAAACAGTAAAAAAGCGGCCCGAAAAGCCGCTGTAATCCTCATGTCGCAATTCTGCATCCTGAAACTATGCGCAGATTATTCTGCTTTGCTCGATATGGAAAGACAATCATGCGACCTTCTTGCGGTTATGTGTGAAAAAATGACGATAAAGTGCGTTACAGGCGAGACGAATATCTCCAAGCATATGCGGGAGGAATTGATCATTCTCAAGCATGAATTGAACGGCAGCATAGAGATTGCTATTTCGCGTTTCGGCCTGTGCTTCATCTATTGCCCCACGAGCGTCGGCATAAGCCTGCTTGGCGCGCTTTACCCACGCCTCATACTCATCGGGGTCGGAAGCACCAATCGAACCTATGTGGTCATAGTGCGCGCCAGCGGAACACTCCGCCTTACGCTTGTCGTTGAAAACTTCCCTGTAGCGCTCTGCGGCGTCATACTGGCTCGTGCTGATGCCCTCCGCCTTGTCTCGCTTCCATGCCATATGCAAGCGCCCCAGATTATCCCCCGCCAAGTCGCTTAATGCTTCCGATGCGCTCATGCCGAATATCCTCATCCTTGCCAGTTGCGCCACCTTTGCGGGCGGTTCCTTTGCCCTGCTGATCTGACCGGAAGGTGCGCGCAAAACTCCCTCCTTCTTTGGCCTTCCTCTCGACCGCTTGGCTTGAAGCTTCGCCGCTTTCGTCCTCGCCATGTCCGTTCCTCGCTGACTGGGGTTATGCTGCCTGTTTGTGCGCTGCGCTGAAAGGCTCAAGCCCCTGCGCAAGGCGAATTTCATCTGCCATTGCGATTAGCCCTGGCATGGACATACGTTTCCACTGGCCTTTGCCACGACGAACCTTGACGCCGCCTTTCGGTGCTCCGGCTACCTCGTAGCCATATTCAGCCATGAATGACGCTATCCCGTGAATGCCACTGGTGGCGCCCTGCTCGAACCGGCGTGCACCATGCTTGGCGAGGTACTCGTTGATCAATGCAGTCTCCGGCGTCATGCTAGCTCTCCATGTGGAACGGTGCGGACCTTCTTGCGCTTCGTGGAACTGATCGCTCTGTGAACACGCATGAGAGCATCTGCTTCGGTGATGCCCATGATTTCAGCTATTTCCAGCGTGTCTTTGCCCTGTCGGAAAAGGCGTAAGGGCTTGAAAATTTCCTGATCCTTCTTCGACCAGTAATACGACGCCGAACGGTCAGTTTTTGCGTGATAGGCGGCGATCATGCTGCAATCCTCCCCTGCTCCGTCCACGGGTCGCCGTTCGACTTTCGACCATCTGTCGGGAGGATGAGATGCGCAGGGACGCGGCAACCCAGCATATTCGGCATCGGCCCCCACACGTCAGAGTGCCAACACCGATTGTTGCGGGCGTAGTTCATGAATTTCTTCCAGCGGGCGTCGCGTTCCTCCTGGGAAAGCTCGCTGTCCGGCTTGTAGCTGCTGGCCTGTCGCTGGGCTTCCGGCTTCCACCCAGACTGCATGGCATTTTTCGCCCATTCCGCGTCGAAGCCTTGCCATCCGCGCTCGATCATCGGGTCCGCCGCCTGATCGGCTGTGAGGCCGCAGGCATCCTTTGCGGCGGCCAGCCGCTTGGCGAGCAGTTCAGCACCGCGCTCGGTCAAAGGTTTTTTGATCTGGGCCCTGTGCTCGACCACTGCGACGGCTATCGGCTGGGAAAGGACAGCTTCGAGTACCGAACGGGGGGATTTCTTTTGGACCTCTTTAGGGGTATTTTCTTCGGAAGGGGTATGGGGAAACCCTTCTTTTTGAATTTCCTCTGTGGCGTCACTTTGCGTCACTTGTGACGCTTTGTTACGCCTGTAACGCTCTTCCCGGTAGGCGAAGCGCCGGGAAGCGAAGTTCAAAGGCGTGAAGCGTCCGTCTTCCAAGCTTGCCACCAAGGACAAGCAGCCGAAACGCCTCAC
Protein sequences of DBSCAN-SWA_1 >NZ_LT962938|1040559:1048896|1047692_1047968_-|WP_002964092.1|DBSCAN-SWA MTPETALINEYLAKHGARRFEQGATSGIHGIASFMAEYGYEVAGAPKGGVKVRRGKGQWKRMSMPGLIAMADEIRLAQGLEPFSAAHKQAA >NZ_LT962938|1040559:1048896|1042340_1043114_-|WP_002967634.1|DBSCAN-SWA MQTRPRAKFYQQMTAIGAMTINEVRSLENYPPVGRDVMTTANTIRATFLDINQDYQASDADPWADEADVSERGEEAKDVQFNMAPSHSQARRLMKLEWFRANPNWVGTFNTNLMGLAAFGERLIGIQYPLFGINSVFEVLDFKFILGEGGILQGATIQVQSMTDTAYQWDTSQEGTAPVSDETTSDDDLPVPDAPDVLIIAGPAAELSFPPTGNILLNYMVRWKKTADTEWRVAGPLENDAESFETPTLSALTQYEF >NZ_LT962938|1040559:1048896|1044712_1044958_+|WP_004683741.1|DBSCAN-SWA MKIIWDEPKRQTNIAKHGLDFADLHFEFFLSAKVFPTKADRLMAIGEFNGLIIIAVFFKPVGSEALSVISMRSASQKERKL >NZ_LT962938|1040559:1048896|1040559_1042011_-|WP_006144496.1|DBSCAN-SWA MTSSVAIPTAAYQVLYILNVTQENTGPVTVSGTINRDLVTNINQPVPAGYLMPGMALLCIDTGTELRLLSYGDAEAILAAAEDAAARAEAAASGVNLPKISAADAGKSLIVNQDGTGYDVGLAVEFATETQAKAPGTPENDGLVVNLLRTLQQGQARGFFSLKDFATTVIDNGSNSAISALQSAVNSGETVVIPKTDDSYLFDGDVIINNPVKLRGASGGSKIKRVGAIGRFLLRANHINIGDFIEDGSEMTGQAALFHFDTIENDASRYIQHTLIENVRSFNSGGLIDDNNVKGAAATPYVWDILIRNVYAFGCRGRGIRMRDGFAFLRFEDVEIGLGPQYGGVSLPAYEFRNFEGLFLRRTEATGLDGYTDLGYVPDAEQRGFIFADGAALHINSLFADNNKGDGVFCQNVQYVDGNDLNSSIDGGTGFNFINVDRININTIRSGGRRNMAPGNLNTVSQGISLNANCQTVIIGNAVTHNW >NZ_LT962938|1040559:1048896|1048191_1048896_-|WP_006137281.1|DBSCAN-SWA MRRFGCLSLVASLEDGRFTPLNFASRRFAYREERYRRNKASQVTQSDATEEIQKEGFPHTPSEENTPKEVQKKSPRSVLEAVLSQPIAVAVVEHRAQIKKPLTERGAELLAKRLAAAKDACGLTADQAADPMIERGWQGFDAEWAKNAMQSGWKPEAQRQASSYKPDSELSQEERDARWKKFMNYARNNRCWHSDVWGPMPNMLGCRVPAHLILPTDGRKSNGDPWTEQGRIAA >NZ_LT962938|1040559:1048896|1047082_1047670_-|WP_004683738.1|DBSCAN-SWA MARTKAAKLQAKRSRGRPKKEGVLRAPSGQISRAKEPPAKVAQLARMRIFGMSASEALSDLAGDNLGRLHMAWKRDKAEGISTSQYDAAERYREVFNDKRKAECSAGAHYDHIGSIGASDPDEYEAWVKRAKQAYADARGAIDEAQAETRNSNLYAAVQFMLENDQFLPHMLGDIRLACNALYRHFFTHNRKKVA >NZ_LT962938|1040559:1048896|1044957_1045272_+|WP_004683740.1|DBSCAN-SWA MGIKFSSKRPLTQEEEAEIQKMIASDPDAPEATDKQLAKAKPFKESFPDMAAKMEKAIRGRPRIDNPKTPVTIRLDQDVVQRFKATGKGWQGRMNDALRKAVGL >NZ_LT962938|1040559:1048896|1046137_1046848_-|WP_004683739.1|DBSCAN-SWA MNIKSLLLASTVVLVAATGAKAADAVIEQEPAPVVVAPTFTWNGAYLGGQIGYGWGRSTLSDEDTSIRVKPDGFLGGLYAGYNFDMGNNFVLGVDGDITYNDLDASHSDTDPDLDLTTGVDSKLRWSGAVRARMGVAMDRWMPYIAGGVAFGNVKNSVSLTDGIESIGVSQSKTMTGWAAGAGVDYAATDNVIVRLEYRYTDYGHKDFSVVDGDLSVEARNKFKTHDIRLGVAYKF >NZ_LT962938|1040559:1048896|1045619_1045790_+|WP_002964095.1|DBSCAN-SWA MQYVLYRDNAGYWRWRLLANNHRTIADSGEGYVNRADAINGINLVKSSAAAPVVER >NZ_LT962938|1040559:1048896|1047964_1048195_-|WP_002964091.1|DBSCAN-SWA MIAAYHAKTDRSASYYWSKKDQEIFKPLRLFRQGKDTLEIAEIMGITEADALMRVHRAISSTKRKKVRTVPHGELA |
10 | Brucella_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1119610 : 1131538
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_LT962938|1119610:1131538|DBSCAN-SWA TTCAGCTTTCCAGATTGAGCGATTGCGCAAGCTCCGCATCCACCGCGTCACCGGCATCGGTTTCGCGCGGCTTGAGACCGAACTGAACAAGCAGCGGTGCTGCGATGAAGATTGAAGAATAACTCGCCACAATGATACCGACACTGAGCGCCAGCGCAAACATGCGAATTTCCGAGCCGCCAAAAGCATAAAGCGGAACATGTGCCAGGAAGGTGACGAAAGAGGTCAACAGCGTTCGCGACAGGGTCTGGTTGATCGAGGCATCGATGATCGCGGGCAATGGCGCACTCTTGTAACGGCGAAGGTTCTCGCGCACCCGGTCATAGATCACCACCGTATCGTTCAGCGAATAGCCAATGATCGTGAGGATTGCCGCCACGCTCCACAAATTGAATTCCATGCGGAAAACGATGAACATGCCTGAAAGAATGACGACATCATGCAATGTGGAAAGCACCGCGCCCAGTGCGAGCTGCCAGCGGAAGCGGAACCAGACATAGATGAAGATGCCAATCAGCGACAGGATGACAGCGAGCACGCCTGCGCGCGACAATTGTTCCGATACGGTCGGCCCGACCACATCGACCCGCTGGAAGGAATAATCCTGCTCGAATTCACCGCGAAGCTTGACGGCAACGGTCTGCTCCGCATCGTCGCCCACTTCCTGACTGCCGATAATGACAAGGGCCGAGCGCGGCGATTTCGCAGGCAGAACGCGGGCGCTATCGATATTCAGCTCCGCGAGCCGCTCGTTGATATCTTCCAGATTGGCATCGCCATTGCGCGCCTGCAGCTCGACCATCGAACCGCCACGAAAGTCGATGCCGTAGTTGAAGCCGATATTGACGAAAAGCGCGACCACGATTGCGCAGGCCAGCACCGAAATGCCGAGCGTTACGAACTGGAGGCGCATGAAGGGAATATGAGTGACGGTTGGCACCAGTTTCAGGCGACGCTTGGGCACTTCCTTCGGCTTGGCGGTGCGCACCCATTGGGCGATCAGCAGGCGCGTGAAAGTGAGCGTGGTGAAAAGCGTCGTGCCGATGCCGATTGCGACCGTGAGCGCAAAACCATGCACCGTACCCGACCCCAGAAGGAACAGAACGAGTGCGGCGATAAGCGTGGTAAGATTGGCATCCACAATGGTGGAAAGGGCGCGATAAAAGCCCGATTCCATCGCCTGCACGACGGAATAGCCCTTGCGGCGATCCTCGCGTACACGCTCATAGATCAGAATATGCGCATCGACCGCAAGGCCGATGGTCAAGACGAGACCTGCAATGCTCGCAAGGCTTATCGAAGCGCCAATGAGCGACAGAACAGCCGTCAGGATGATGATATTGACCACAAGCGCAACCAGGGCGATGACACCCAGAATGCCATAGGAAAGCACCATGAAGAGGCCGACCACCAGTGCCGCCAGAAGGGCCGCAAGCACTGCCGCACTCGCATAATCCTCACCGAGCGCGGAGGCGATGGTGCGCTCTTCAAGCACAGTCACCGCTTGCGGCAAGGCGCCAGAGCGCAGAACCACGGCCATATTATTGGCGGCCTGTAAATCGAATGCGCCTTCGATCTGCAATTCACTGGTGTCGAGCGGGCCGGAAACCGTGGGCGCGGAAACCACCTGATTATCAACGACGATGGCGAAGGAATTCTCATTGCCCTGCGCCGTCAGATCGGCCAGACGCCGGCGTCCGTTGTCGTCCAGTGTCAGCGTGATGACCGGCTGGCCATCGTCGGCGGAAATACTCGCCTTGGCATCCGTGATGTCATGCCCGGTGAGAATGGGTGTTTTTTTCAGAAGATAACCAACCGGAGGATCATCAAAGGAATAGACGATTTCGCTATCGGCAAGCGGTGTGCCGCGAATGGCGTCATCCGGCGACATGGTGTCGTCCATGGCGCGGAAGGAAAGATTTCCGCGAATGGTGAGAATATCTTTGAGAAGCTGCGCATCGTAAAGGCCCGGAACCTCCACGCGAATCTGATTGCGCCCCTCTCCCTCGACAACGGGATTGCCATAGCCAAGCTCTTCCAGACGCTGCCGCATGATATTGGCAGTCGTCTCCAGATCGGTCTTGCCTGCATTCTGCACCTGAAGAATAAGCCGCGAGCCGCCCGAAAGGTCGAGCCCCAGCGATACCTGCTTCTTGGGCAGAAAATCCGGCAGATTTGCCAGCGTTTCACGCGAGAAAAAATTTGGAGATGCGATGATGAGGCTGACAAGGACTGCCAGCCATATCAGTGCAGATTTCCAGCGTGAAAAATAGAGCATGGAATGCAAGTCCGTTTCAGCTTGGATGAAGCATTGCCGGAGTTCACCCAAAGGGAAAATCTTCCGGCTATTTGCACGCAAACGCGATTTTGCGCGCGTGCTGCATAATGAACGGGCTCAGGTTATTTAAACCTTACCGGAGCCCGCCTGCTCTTTCAAAAAAAAAGAGATTATTTGTTCTTGTTGTCGGCGACAGGTTCGCCCTTCACGCGCACATCCATCAAGGTGGCACGCACAACGCGAATACGAACGCCATCGGCAATTTCGAGTTCGAGTTCGTTGTCATCGACAACCTTGAGAACCTTGCCGACGATGCCGCCACCCGTCACGACCGTATCGCCACGACGCACGGAGTTCAGCATTTCCTGACGCTTCTTCATCTGGGTCCGCTGCGGGCGGATGATGAGAAAGTACATGATGACGAAGATCAGGATGAACGGCAGGATGCTCATGAGCATATCTGGTCCAACAACGCTGCCGGATGCTTGAGCGAAAGCCGGTGTTACGAACATTAGGAACTCCTCTGAAGTCACGAAGACAAATTATTAACGGCATTACAATGGTTGGTGCTTCAATTGCAACCGGCGATGAACCCGCCTGCGCCAAGGTCGCACACTTTTCGCTTCCGCGTTAGCATGTTAAGTCGTTACCCCTACCATCAGGGGCCTAAAATGACGACCGAAGATATACTCAGCCAGAAACTGGACCGTTTGATAGCCGCGCTGGAACGCATTGCACCGCCGGAAGCAAAAACACCCGATCTCGACGCCGCAGACTGTTTCGTCTGGGCCCCGGAGCGGCTTGTTCTCGATCCCGTCAGCCGTGTGAACCGCGTGGATATCGCGTTGATCCGTGGCGTAGATTTGGTACGTGACCAGCTTGTGGACAATACCCGCCGCTTCGCAAAAGGTTTTCCCGCTAATAATGTTCTGCTTTGGGGCGCGCGCGGCATGGGTAAATCATCGCTCGTAAAGGCCGCGCAGGCGAGCGTGAACGCCGAATTTCCGGATATAACACCGCTGAAGCTTGTCGAAATTCATCGTGAGGATATCGATAGCCTGCCGGTTTTGATGAATCTCATCAAGGAAACGCGCCATCGTTTCATCCTGTTCTGCGACGACCTATCCTTCGATCATGACGATACATCGTATAAATCGCTGAAAGCAGCACTTGAAGGCGGCGTGGAAGGACGCCCCGACAATGTGATTTTCTACGCAACATCGAATCGTCGCCACCTGATGCCGCGCGACATGATCGACAATGAACGCTCGACGGCGATCAACCCCTCAGAAGCCATCGAGGAAAAGGTTTCACTGTCGGACCGCTTCGGCCTGTGGCTCGGCTTTCATAAATGCAGCCAGGACGATTATCTTGCCATGATCGACGGCTATGCGGCGCATTTCAAACTCGACTATGCCCGCGACCAGATGCACAGGGAAGCCCTGGAATGGTCCACCACACGCGGCAACCGTTCGGGCCGTGTTGCGTGGCAATATATTCAGGATCTGGCAGGCAGGCTTGGAAAAACCATGCAATGAAAAAGGCGGGCCCCTGCCCGCCTTTTTCATTTTTTCCTGCTGAAACTGTTATGATTCCAGATATTTGGTCGGATTGACCGGCGCCGAGTTCTTGCGCACCTCGAAGTGCAGCTTCGGCGACTTGGCGTTGCCGCTCATGCCCGACTTGGCGATTTCCTCGCCACGGCGAACCTTCTGGCCGCGCTGCACCATTATCTGGCTGTTATGGCCATAGACGGTCACAAGGCCATTGTCGTGGCGGATCAGAACGGTCTGGCCAAATTCCTTCAAACCATCGCCCGCATAGATCACAACACCGTTTTCGGCGGCTTTGACCGGCGTGCCTTCCGGCACCATGATATCGATACCGTCGCTGACCGAGGTGCCCTCACGCTGGCCGAAGCTTGCCAGAATGCGCCCACGAACCGGCCAACGCATCTGCGAGATGCCGGTTGAGGATGGCGCTGCGGCCTGATCCTTTTCCGCATCCTCGATTACCTTGTTGCTGGCCTGCGGCGGCGTATAAGGCTTTACCTCTGCACCTCCATTGGCCGGCGCACTGGCTGCTTTGGCCGGGTTTGCCGGCTGCGGCGTGATTGCGGCCACCTGTGTCGGCGCACCTGCGGCAGCAGACGGAATAACGAGCGACTGCCCGACGCGAATGGCGCCACTGGTCAGGCCGTTTGCCGCCTTCAACTGGTCGACAGGGACATTGTGCTTCTTTGCGATGGAGAACAGCGAATCCCCGCTCTTCACGACGTAGGCACCGCCCACTGATGGCGGGGTTGCGATAGCGCCGCCAGCCGATGCCATATTGGTCGGCGAGGATTTCTTGCCGTTGACAGCAGGTGCTTGCGGAACGCCAGCGATATTGTTGTCCATCGGGCGCGGCGCGCCCTGCCCCGTCGCCGGAAGCTGGCCAAGAACCTTTTCCTTCGCGCCATTCACCGTATTATGCGTAACGGTGGCGGCACTTGCGACCCTGGTTTCAGCGGCATTGACGGTATTATTGACGCGGGACGCCGCCATATCCTGGGCTTGATCGACCTGATTGCGCATCTGCGTCGATGCCGCAGCGACAGGCGCACCCAGAGGCGCTGACGATACCGGCGGCAAGGAATTGCGCTGGACCGTTCCGCTTGAAACCGGCGGGGCCGAAGCCACGGGGGCAGACGCATAAACATTGCCTGCGGGTTGCTGCGCGACGGGCTGACTGGACGTAGAACCAGTAAAAATGCCGTCTGTGAAGCGCATCGTATCAGCACTGCACCCGGCTCCAAAACCGGCAATCAGAACGATTGCGACATTCCGCAGGAGACGTTCGGACGTATGCTGCAAAATTGGTAAACGCATGTTCAACTCGTCCATACTCGAAACTCACTAAGACGATTAAAACGCGTTAATGTTACTCTCTGGTTAAGAATGCCCCGAAACTAAAAAAAGATTCACCAAATAAACACCATAAGCGCAAGAATGACGCGCAAACAGCCACTTCCGGCCCGCAAATGCCAAGCTGGCGAGGCTGCCATTGCTCAAAATCAATGCAACTGAAGCCGTTCCGACAAAAGCGCGAAGCGGTTTTTTGGAATCATCCTCAAACAAAATCTTGGAGCGGGATGATGGTTGGACTTAAATTCAACCCGTTTTAGAGCGCGTTTCGATCTGATTGAATCAGATCGGCGCTCTAATCCTTTGTTTTGACGCGCATCTTTTCCGAAAACCGTTTCACACTTTTCGGGATGCGCTCTAAAGAACGGAAGACGTGCCTTCGATGAACGGCTGATATCGAACCGGCATGAGGTCTTCCTGTTCAAAACGGCTTCCAACCTTTGAAATCCGCGTCATGATCTGGCGTCCATCGCCAGGGCCGATCGGGGCTATCAGGACGCCATGGGTGGCGAGCAGTTCAACGAAATGGCGCGGCACCTCATCGCATGCGAGCCAGATGACAATGCGGTCAAACGGCCCGCCCGGCATACCGTGGCGCCCGTCTGTATGTTTCACCATGATATTCTCGCGCTTCAGCGAAACGAACTGCTGGAGAGCGTGGTCGCAGAGTTTTCGATACCGTTCCACCGTCGTTACACGGCCGGACAGCAAGGACATAACGGCGGCGGTAAAGCCGGAGCCGGTGCCGATTTCCAGAACCCGATGGCCGGGCTCAAGCTTCAGGGCGGAAATGACGCGCGCCTGATCGTCTATGCCTTCCATATATTCACCGCAATCAAGCGGCGCGGTTCGCGGGCTATAGGCAAGATGCGACCATGCCGCCGCCAGAAAGCTCTGGCGCGGCGTTGCTTCAATTGCCGCAAAAAGTTGCGGATCATCAATGCTGTGCCCACGCATCCGCAGAACAAAGGATGCAAATCCCTCCCGGTCCGAAAGCCGCGGGCGTTCAGACGTTGCCTGCCTCATGCTTCCACTCCAAGCGCCGCGCCCAGTTCTGCACGAACCTTATGAGCGGTCAGATCAAGGTGGAGTGGGGTCACTGAAATGCAACCCGAACGGATGGCAGCAATATCGCTGTCGTCGGCAACCGGAGCCTTGCCGCGACCGAAATGCAGCCAGAAATAAGGGAAACCACGTCCATCGCGGCGCTCGTCAAGGCGCGCATCATGGCTAAGCTTGCCTTGTGCCGTGACGCGCACGCCCTTCACTTCTTCCGGAGCGCAATTCGGGAAATTGAGGTTCAACAGCACGCCTTCCGGCCAGCCCGCCTCCATCAGCCTCCCGATAAGCTCAGGCGCATGAGCTTCCGCCGTTTCCCACGGCACGATCCGGCGATCGCCCGCATATTCATATTCCTGCGACAAAGCGATGGCTCGCACACCAAGCAATGTCCCCTCCATCGCACCGGCAACCGTGCCCGAATAGGTCACATCGTCGGCCATGTTCGCCCCGGAATTGACGCCGGAGAGGACGAGATCGGGCGCGCCCGGCAATACATGGCGCACCCCCATGATGACGCAATCGGTCGGAGTGCCGCGCAGGGCAAAATGACGGGCATCGATCTGGCGAAGGCGAAGCGGCTCCGACAGTGTCAGTGAGTGGGCAAGCCCGCTCTGGTCCGTTTCAGGGGCCACCACCCACACATCGTCGGAGAGCTTGCGTGCAATTCGCTCCAGAACAGCGAGGCCTTCAGCGTGGATACCGTCATCGTTCGTCAGCAGAATACGCAATTTGTCACTCCTTCGCCGAAATGGATAAGACACTTAAGACACTACAGCGGTTCCAGTTGAAATGGGATCGTTGAAACTGCTCTCTCTTTGTTCTTTCGCATGTCCCCAAAACCGGTTCCCACTTTTGGGGGCATGCTATAATTCCAGATCAAGCGGCTTTTTCGATCCGCGTGAGGCCGCCCATATATGGCTGTAATGCTTCAGGAATATGAATGCTGCCGTCTTCCTGCTGGTAATTTTCCATAACCGCAATCAGCGCGCGCCCGACAGCAGCGCCCGACCCGTTGAGGGTGTGCACGAAGCGCGTGGATTTCTCGCCTTCCGGGCGATAGCGGGCATTCATGCGGCGGCCCTGGAAATCACCGCAGGTCGAACAGCTTGAAATTTCGCGATAGGTGTTCTGCCCCGGCAACCAGACCTCGATATCATAGGTCCGCTGTGCGCCAAAGCCCATGTCGCCCGTGCAAAGCACAACGGTACGGAACGGCAGGCCCAGCCGCTTCAGCACTTCTTCCGCGCAAGCCGTCATGCGCTCATGCTCGGCAACGGAGCTTTCCGCATCGGTGATCGATACCATCTCCACTTTCAGGAACTGATGCTGGCGCAACATGCCGCGCGTATCGCGCCCGGCCGACCCCGCTTCCGAGCGAAAACATGGGGTCAGCGCCGTGAAGCGCAGCGGCAGCCCCTTCATATCGACAATTTCTTCGGCAACCAGATTGGTGAGCGGCACCTCCGCCGTCGGGATCAGCCAGCGGCCATCCGTCGTGCGGAAAAGATCTTCTGAAAACTTCGGCAATTGCCCCGTGCCATAGACCGCTTCGTCGCGCACCATCAGCGGCGGCATGACTTCGGTATAACCGTGTTCTGTCGTGTGAAGATCGAGCATGAACTGGCCAAGCGCGCGCTCAAGACGGGCGAGCGGGCCTTTCAGCACCGTAAAGCGCGCACCGGCAAGCTTGGCCGCGCGCTCGAAATCCATGTATCCAAGCGCCTCGCCAAGCTCAAAATGCTCTTTCGGCTGGAAGGAGAAATTGTGCGGGTTGCCAATGCGGCGCAGCTCAACATTGTCGCTTTCATCCTTGCCGAGCGGCACATCATCAAGCGGAATATTGGGAATGGTGGACAATGCGTCGCTCAGTTCCTTGCTGAGGCGGCGCTCGTCTTCTTCCGCATGGGCGAGAAAATCTTTCAGTTCGCCCACTTCGGCCTTCAGCTTTTCAGCCGTGCCCATGTCCTTTGCGGCCATGGCCTTGCCGATTTCCTTCGAGGCGGCATTGCGGCGCTCCTGCGCTGCCTGCACCTTGCCGACATGCTCGCGGCGCTTTTCATCCAGCGCAATCAGTTCGGACGAAAGCGGAGCAGCCCCACGCTTTGCGAGCGCCTTGTCGAGGGTTTCCGGGTTTTCGCGAATCCATTTGATGTCGAGCATGGAAAAAAGCCATTTCGTGAAATTGAACAGAAGCGAGGCTAAACGATCTTCAGCCCCAAAGATGCCTGACGTCAGATCAGGTGGAGGAAGCGTTGTTATCAGCGTCGGCAGATGCCTGCGCCTCATCCCGCTTCTTCTCGATCATGCGCGCCAGAAAGATCGAAATCTCGTAAAGAAGGATCGTCGGCAAGGCAAGACCGATCTGGCTCGCCGGGTCCGGCGGGGTCAGCACCGCAGCCGCGACGAAGGCAATGACGATCGCATATTTGCGCTTGTCCTTCAGCCCCGCCGAAGTCACCAGCCCCACACGCGCCATGAGGCTCGTCACCACCGGCAACTGGAAGACCAGGCCAAAAGCAAAGATGAGCGTCATGATGAGGCTCAGATATTCCGACACTTTCGGCAGAAGCGAAATCTGGACCTCGCCGCTGCCGCCGGTCTGCTGCATGGCGAGGAAGAACCACATCACCATGGGCGTGAAAAAGAAATAGACGAGCGCGCCGCCGATCAGGAACAGAATGGGCGACGCGATCAGGAACGGCAGAAATGCAGTGCGTTCGTGCTTGTAGAGACCGGGAGCCACGAATTTATAAATCTGTGCGGCGATGACCGGGAAGGCCAGCACAATGCCGCCGAACATGGCCACCTTCACCTGCGTGAAGAAGAATTCCTGAGGTGCGGTATAGATCAATTCCGCCTTGGAGCGGTCCATGCCGGCCCAGTCGATGGCCCATTGATACGGCACCACAAGCAGGTTGAAGAGCTGTTTTGCGAAAGCAAAGCAGAAAATGAATGCCACGAAAAAAGCCAGGATAGCCCAAATAAGGCGGCGGCGCAGTTCGATCAGGTGTTCAAGCAGAGGCGCTGCGCTCTGTTCGATTTCATCCTCGTCCCGGTTCACGCTTTGGTTCCTGTCTTCTTTGTGGTCTTTTTAACCGGCGTTGCGGTCTTGTCTGCCGTCGGCTTGGGGGTAGCTCCGGTTTTTTTGGCAGTCTTTGTCGTCGTCGGTTTCGGCCCGGCTTTTGCAGCCGGACGCGGTGATGTTTTCCTAGGCTTGGCGGGTTCTTCGGGCGCGGTGATCATTGGTACGGGAACTGGCGGCGCGGGAACTGGCGTTCCGCCCGGCTCAACCGGCGTCGTAACCTCACCCACCTTGTTCTCGGTGACTGGCGACATTGATGTTGCGGACTGGAGACCAGACCGCAAATCCTCGCCAGCACTGCGAATCGGGTCAAAAACCTGTGTCAGCCTTGTGCGCGGATCAAGGCTTCTGGCTCCATCGATGATGGTCTTGACGTCTTCAAGTTCCGCCTCTTTCAAGGCCTCGTTGAATTGATGGCGAAACTCGTTGGCGGTGGTGCGCATGCGTGCAGTCGCCTTGCCGAACGCGCGAAGCATTTTCGGCAAATCCTTGGGACCGACCACCACAATCATGACAATTGCGATAATCAGCAGTTCAGACCAAGCGATATCGAACATAATTTGATACCTTGCGCTCTGCGCGCACATCCTGTCTCTTGGCGAAAAGCCGCACTGCCCACAAACCTGCCATGCGCGTTTTCAGCCCATGGCAGTTCATCCCGGAAGGATCAGGACTTGGTGGTCTTCTTGACGTCCTTGACGGGTTCTTCCGCTTTGGCGTCGATCGTACGCGGATCTTCCTTGGCGTCTTCGTCAGCCATGCCCTGCTTAAAATTCTTGATACCCTTGGCGACATCGCCCATCAGCTCGGGGATCTTGCCGCGGCCGAACAGAAGAAGCACAACCGCCAGAACGATCAGCCAGTGCCAGATGGAAAAGCTACCCATATTATTCCTCTCAGTGCCGCCCAAGGCGCGGCATATGCCTGCTATCTCCGATACGATTTAAGCGCTTTCAACAAATCTTTCAAACAGAAGTGTGATGATGAACGGCTTCAAACCGGATTAATTCGTCGCAGGCAGAAATTTTGTTCTATTCTCCCCTGGGTGCAAGCAAACCCAGCCCCTCCAGATCAATATCCTCCAGCGGGTCCTCCCCTTCGGTCAGCTCGTCCGGGTCGATATTGGGGATCGGTACGGCAAAACTGGAAGGAATGCGCGCCGAGAGAAGCCCTGCGCCGCGCAATTCCTCAAGACCGGGCAGATCGCGGATTTCCGGCAGGCCAAAATGGTCGAGGAAAGCGTCGGTGGTGCCATATGTTACCGGGCGCCCTGGCGTGCGCCTGCGCCCGCGCAGCTTGATCCAGCCGGTTTCCATCAAGACATCAAGCGTCCCCTTGGATGTTTCCACGCCGCGAATATCCTCAAGTTCGGCGCGTGTCACCGGCTGGTGATAGGCAATGATGGCAAGCACCTCCATGGCCGCGCGCGAAAGCTTGCGCTGCTGAACAGTCTCGCGGTTCATGATGAAGGCGAGATCTGGCGCGGTGCGAAACGCCCAGCCACTGCCCACCTTCACAAAATGCACGCCCCTGCCCTCGTAAACCTTCTGGAGATGGTTCAAAACCGGAGCAATATCCACATTGGCGGGAAGCCGCTCGGCAAGTGCGCGCTCGCAAACAGGCTGCGAAGACGCAAAAACAATCGCCTCCACAATGCGGGCAAGCTCGGCAAGCGTCACCGGCGAGGCAGGCCCCGCCTGCTCTTCTTCCCCAACGCCTTCCATATCCATCAAATCGCGGCGCTCTGCTTCAGGCATTTTCGTCCTCATCGAATTCATCGAGTTCGCGGGTCGCGCGCATATAGATCGGCTCGAACGGAGCGTTCTGGCGTACTTCAAGCTTGCCTTCGCGCACCAGCTCGAGGCATGCGGCGAAAGAACTGGCAAGCGCCGACGCCCTCTCCTGCGGAGAAAGTGCATAATCGATCAAAAAACGGTCCAGCGAAACCCAGTCGCCCACCGCGCCCATCAGGCGCACAAGCGCCGTGCGTGCCTCCTTGAGGGACCAGACGCTGCGTTTTTCTATCTGTACCTGGGAAACCGCCTGGCGCTGGCGCTGCGACGCATAGGCGCTAAGCAGATCGTAAAGCGTTGCGGAAAAACGGCTGGCGCGGTCCACCACCACCATTTCCGGCATGCCGCGCGGGAAAACATCGCGGCCGAGCCGATGACGATTGACGAGTGCCGCCGCCGCATCGCGCATGGCTTCAAGCCGTTTCAACCGGAATTGCAGGGAGGCAACGAGTTCCTCGCCCGTGGCGCCATCGTCGCCCTGCTGCTTCGGGATCAGCAGCTTGGATTTCAGATAGGCAAGCCATGCCGCCATAACGAGATAATCGGCGGCAAGCTCCAGACGCAGCGCGCGCGCCTGCTCCACGAAACCGAGATATTGTTCGGCAAGCGCCAGCACGGAAATGCGCGCAAGATCGACGCGCTGGTTACGCGCAAGATGCAGAAGAAGGTCGAGCGGACCTTCAAAGCCCTGCACATCGATCAGCAGTGACGGCTCGCCTGCGCCTCGCCCGGCCTCATTTTGCCACAGGGTATCCATCGGCACGCGTGTGCCGTCGTTTCCACCTGTATGTGCATCCGATGCTGCCAA
Protein sequences of DBSCAN-SWA_2 >NZ_LT962938|1119610:1131538|1129968_1130694_-|WP_002964011.1|DBSCAN-SWA MPEAERRDLMDMEGVGEEEQAGPASPVTLAELARIVEAIVFASSQPVCERALAERLPANVDIAPVLNHLQKVYEGRGVHFVKVGSGWAFRTAPDLAFIMNRETVQQRKLSRAAMEVLAIIAYHQPVTRAELEDIRGVETSKGTLDVLMETGWIKLRGRRRTPGRPVTYGTTDAFLDHFGLPEIRDLPGLEELRGAGLLSARIPSSFAVPIPNIDPDELTEGEDPLEDIDLEGLGLLAPRGE >NZ_LT962938|1119610:1131538|1123473_1124772_-|WP_002964019.1|DBSCAN-SWA MDELNMRLPILQHTSERLLRNVAIVLIAGFGAGCSADTMRFTDGIFTGSTSSQPVAQQPAGNVYASAPVASAPPVSSGTVQRNSLPPVSSAPLGAPVAAASTQMRNQVDQAQDMAASRVNNTVNAAETRVASAATVTHNTVNGAKEKVLGQLPATGQGAPRPMDNNIAGVPQAPAVNGKKSSPTNMASAGGAIATPPSVGGAYVVKSGDSLFSIAKKHNVPVDQLKAANGLTSGAIRVGQSLVIPSAAAGAPTQVAAITPQPANPAKAASAPANGGAEVKPYTPPQASNKVIEDAEKDQAAAPSSTGISQMRWPVRGRILASFGQREGTSVSDGIDIMVPEGTPVKAAENGVVIYAGDGLKEFGQTVLIRHDNGLVTVYGHNSQIMVQRGQKVRRGEEIAKSGMSGNAKSPKLHFEVRKNSAPVNPTKYLES >NZ_LT962938|1119610:1131538|1128091_1128916_-|WP_002964014.1|DBSCAN-SWA MNRDEDEIEQSAAPLLEHLIELRRRLIWAILAFFVAFIFCFAFAKQLFNLLVVPYQWAIDWAGMDRSKAELIYTAPQEFFFTQVKVAMFGGIVLAFPVIAAQIYKFVAPGLYKHERTAFLPFLIASPILFLIGGALVYFFFTPMVMWFFLAMQQTGGSGEVQISLLPKVSEYLSLIMTLIFAFGLVFQLPVVTSLMARVGLVTSAGLKDKRKYAIVIAFVAAAVLTPPDPASQIGLALPTILLYEISIFLARMIEKKRDEAQASADADNNASST >NZ_LT962938|1119610:1131538|1122057_1122399_-|WP_002964021.1|DBSCAN-SWA MFVTPAFAQASGSVVGPDMLMSILPFILIFVIMYFLIIRPQRTQMKKRQEMLNSVRRGDTVVTGGGIVGKVLKVVDDNELELEIADGVRIRVVRATLMDVRVKGEPVADNKNK >NZ_LT962938|1119610:1131538|1125150_1125819_-|WP_004683704.1|DBSCAN-SWA MRQATSERPRLSDREGFASFVLRMRGHSIDDPQLFAAIEATPRQSFLAAAWSHLAYSPRTAPLDCGEYMEGIDDQARVISALKLEPGHRVLEIGTGSGFTAAVMSLLSGRVTTVERYRKLCDHALQQFVSLKRENIMVKHTDGRHGMPGGPFDRIVIWLACDEVPRHFVELLATHGVLIAPIGPGDGRQIMTRISKVGSRFEQEDLMPVRYQPFIEGTSSVL >NZ_LT962938|1119610:1131538|1130686_1131538_-|WP_004683698.1|DBSCAN-SWA MAASDAHTGGNDGTRVPMDTLWQNEAGRGAGEPSLLIDVQGFEGPLDLLLHLARNQRVDLARISVLALAEQYLGFVEQARALRLELAADYLVMAAWLAYLKSKLLIPKQQGDDGATGEELVASLQFRLKRLEAMRDAAAALVNRHRLGRDVFPRGMPEMVVVDRASRFSATLYDLLSAYASQRQRQAVSQVQIEKRSVWSLKEARTALVRLMGAVGDWVSLDRFLIDYALSPQERASALASSFAACLELVREGKLEVRQNAPFEPIYMRATRELDEFDEDENA >NZ_LT962938|1119610:1131538|1126731_1128015_-|WP_004683702.1|tRNA|DBSCAN-SWA MLDIKWIRENPETLDKALAKRGAAPLSSELIALDEKRREHVGKVQAAQERRNAASKEIGKAMAAKDMGTAEKLKAEVGELKDFLAHAEEDERRLSKELSDALSTIPNIPLDDVPLGKDESDNVELRRIGNPHNFSFQPKEHFELGEALGYMDFERAAKLAGARFTVLKGPLARLERALGQFMLDLHTTEHGYTEVMPPLMVRDEAVYGTGQLPKFSEDLFRTTDGRWLIPTAEVPLTNLVAEEIVDMKGLPLRFTALTPCFRSEAGSAGRDTRGMLRQHQFLKVEMVSITDAESSVAEHERMTACAEEVLKRLGLPFRTVVLCTGDMGFGAQRTYDIEVWLPGQNTYREISSCSTCGDFQGRRMNARYRPEGEKSTRFVHTLNGSGAAVGRALIAVMENYQQEDGSIHIPEALQPYMGGLTRIEKAA >NZ_LT962938|1119610:1131538|1124820_1125006_-|WP_002964018.1|DBSCAN-SWA MFEDDSKKPLRAFVGTASVALILSNGSLASLAFAGRKWLFARHSCAYGVYLVNLFLVSGHS >NZ_LT962938|1119610:1131538|1122558_1123425_+|WP_002964020.1|DBSCAN-SWA MTTEDILSQKLDRLIAALERIAPPEAKTPDLDAADCFVWAPERLVLDPVSRVNRVDIALIRGVDLVRDQLVDNTRRFAKGFPANNVLLWGARGMGKSSLVKAAQASVNAEFPDITPLKLVEIHREDIDSLPVLMNLIKETRHRFILFCDDLSFDHDDTSYKSLKAALEGGVEGRPDNVIFYATSNRRHLMPRDMIDNERSTAINPSEAIEEKVSLSDRFGLWLGFHKCSQDDYLAMIDGYAAHFKLDYARDQMHREALEWSTTRGNRSGRVAWQYIQDLAGRLGKTMQ >NZ_LT962938|1119610:1131538|1129604_1129823_-|WP_002964012.1|DBSCAN-SWA MGSFSIWHWLIVLAVVLLLFGRGKIPELMGDVAKGIKNFKQGMADEDAKEDPRTIDAKAEEPVKDVKKTTKS >NZ_LT962938|1119610:1131538|1125815_1126583_-|WP_004683703.1|DBSCAN-SWA MRILLTNDDGIHAEGLAVLERIARKLSDDVWVVAPETDQSGLAHSLTLSEPLRLRQIDARHFALRGTPTDCVIMGVRHVLPGAPDLVLSGVNSGANMADDVTYSGTVAGAMEGTLLGVRAIALSQEYEYAGDRRIVPWETAEAHAPELIGRLMEAGWPEGVLLNLNFPNCAPEEVKGVRVTAQGKLSHDARLDERRDGRGFPYFWLHFGRGKAPVADDSDIAAIRSGCISVTPLHLDLTAHKVRAELGAALGVEA >NZ_LT962938|1119610:1131538|1128912_1129494_-|WP_006137300.1|DBSCAN-SWA MFDIAWSELLIIAIVMIVVVGPKDLPKMLRAFGKATARMRTTANEFRHQFNEALKEAELEDVKTIIDGARSLDPRTRLTQVFDPIRSAGEDLRSGLQSATSMSPVTENKVGEVTTPVEPGGTPVPAPPVPVPMITAPEEPAKPRKTSPRPAAKAGPKPTTTKTAKKTGATPKPTADKTATPVKKTTKKTGTKA >NZ_LT962938|1119610:1131538|1119610_1121938_-|WP_014490079.1|DBSCAN-SWA MGELRQCFIQAETDLHSMLYFSRWKSALIWLAVLVSLIIASPNFFSRETLANLPDFLPKKQVSLGLDLSGGSRLILQVQNAGKTDLETTANIMRQRLEELGYGNPVVEGEGRNQIRVEVPGLYDAQLLKDILTIRGNLSFRAMDDTMSPDDAIRGTPLADSEIVYSFDDPPVGYLLKKTPILTGHDITDAKASISADDGQPVITLTLDDNGRRRLADLTAQGNENSFAIVVDNQVVSAPTVSGPLDTSELQIEGAFDLQAANNMAVVLRSGALPQAVTVLEERTIASALGEDYASAAVLAALLAALVVGLFMVLSYGILGVIALVALVVNIIILTAVLSLIGASISLASIAGLVLTIGLAVDAHILIYERVREDRRKGYSVVQAMESGFYRALSTIVDANLTTLIAALVLFLLGSGTVHGFALTVAIGIGTTLFTTLTFTRLLIAQWVRTAKPKEVPKRRLKLVPTVTHIPFMRLQFVTLGISVLACAIVVALFVNIGFNYGIDFRGGSMVELQARNGDANLEDINERLAELNIDSARVLPAKSPRSALVIIGSQEVGDDAEQTVAVKLRGEFEQDYSFQRVDVVGPTVSEQLSRAGVLAVILSLIGIFIYVWFRFRWQLALGAVLSTLHDVVILSGMFIVFRMEFNLWSVAAILTIIGYSLNDTVVIYDRVRENLRRYKSAPLPAIIDASINQTLSRTLLTSFVTFLAHVPLYAFGGSEIRMFALALSVGIIVASYSSIFIAAPLLVQFGLKPRETDAGDAVDAELAQSLNLES |
13 | uncultured_Mediterranean_phage(90.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1356234 : 1404640
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_LT962938|1356234:1404640|DBSCAN-SWA AGTGATCATCGTTGGGGAAACGAAGATCGATACGGGCGACAAATATGCGCCAATTATCGATTACAATCTGAATTATATATCCGGCAAAAATCCAAAGCACCGGCTCGTCGAGCACTATTCGGTGGCGGAGCTGACGGCAAAATATATCAATATCCTCTGGGATGATGGACCGCATAAATATAATGTAAGGTCGTTCCTCGGCGAGATTGACGAGATTCTGAAAGGCGCACGTTTTTCAGGTTTTGATCAGGAAATGCTCGATTCCATCATCGGCACGCTTCGCGAACGCGGCAACAGCAATGCAACCATCAATAGAAAGATGGCTGCGCTGAGCAAGCTGCTGCGAAAGGCGCACAAGATGGGGGATATCTTCAATCTTCCGGAGTTTATCCGGCAGAAAGAGCGCGTGGGGCGCATTCGATTCCTGGAACAGGAGGAAGAGAAGCGATTGTTCGCCGCAATAAAGTCGCGCTGCGAGGACAGCTATCGCCTATCGGTCTTTCTCGTGGATACGGGTTGCCGTCTTGGCGAAGCAATCGGCCTCACATGGAATGATATTCAGGAACAACGGGTTACGTTCTGGGTCACCAAATCCAATCGCAGCCGCACCGTTCCCCTCACCCGGCGTGCACGAAAAGCATCCCATATTCCGCGTGAGAGGCTAAAAGGCCCCTTCTCCATGCTCAATCAGGTTCGGTTTCGCCAAATCTGGAACGAAGCGAAGGCCGAAGTTGGCCTTGGCGCGGATGACCAAATCGTTCCGCACATTCTACGCCATACATGTGCGTCGCGACTGGTGCGTGGGGGCATCGACATCCGCCGGGTTCAGATGTGGCTTGGTCACCAGACCTTGCAGATGACAATGCGCTACGCGCATCTGGCAACACATGATCTCGATTCCTGCGTTAAGGTGCTCGAAATTCATTAGGCCACAACAAAAAGCGTCAAGCTGTGGCGATTTTGGAACGGACTTTGAGCCAACAGACCTTGCGGATAATGAGGGACTTCCCGCCTATAGGACGTGATCTGGAGTAGCCCCGTGGCATAGAAAAACGGCAAAGCAAATTTCCCTTGGGCTCGCGCAGAGTACCGAAGGATATTCTCATTATCCTTATAGAACCGGCTGTTGCAGGCTGATGTGCGTTAAAACGATTCCAGCAAGATTGAACCGTTGGAGCCGGTTTGCATTTTGGGTTCGAAAAGCCTTTAATGCCGCCGAATAGCTGGCCGGGTTGGCCGGGCTTTCAATGCGCCTTTACTGTCAGACTGAAACAGACCGGGATATCGCCCATGGCACAACCTCGCCTCACCCCCCTTGTCGAAAGCCTGCCCTCAACTGTCCCCTTTATCGGACCAGAAACATTGGAATTACAGCGCGGTAAGCCCTTCGAGGCGCGGATCGGCGCCAATGAAAGCAGCTTTGGTCCAGCGCCTTCTGTCATCGAGGCCATGCGGAATGAAGCAACCGAAGTGTGGAAATATGGCGATCCGGAAAATTATGCGTTGCGCCACGCCATTGCAGCCCATCACGGCCTGAAGGCTGAGCACATCATGCCGGGCGCTGGCGTTGATGCGTTGCTTGGTCTCATCGTCCGTCAATATGTGCAGCAGGGCGACAAGGTCATCAACTCGCTCGGCGGTTATCCGACTTTCAATTATCATGTCGCAGGCTATGGCGGGCAGCTCGTCACCGTTCCTTATCGCGATGACAAGCCAGACCTCGACGCCCTTATCGATGCGGCTGCAAGGGAAAAACCAGCGCTCCTCTATATCGCCAATCCCGACAACCCCATGGGAACATGGCACGGGGGGGCCGATATCCAGTCCTTCATTGAGCGTCTTCCGGAAACAACATTATTAATTCTCGACGAGGCCTATTGCGAAACTGCTCCGGCATCGGCATTTCCACCTTTCGAGACGGATCGTCCGAATGTTCTTCGGATGCGTACATTCTCCAAGGCCTATGGGCTTGCAGGCATCCGCTGCGGCTATGCGGTGGGAAATCCGGTGGCGATAAAAACCTTCGACAAAGTCCGCGATCATTTCGCAGTCAGCCGCATGGCGCAGGCTGCCGCCATCGCAGCTTTGAAAGATCAGGCTTATTTGCACGAGGTCGTGGGCAAGATTTGCGCCGGGCGTGATCGTATTGCGGCCATTGCCGAAGCCAATGGCCTCCACGCTGTTGCATCAGCGACCAATTTTGTTGCAATCGATTGCGGCAGGGGGAAGGATTTTGCGCAGGCAGTACTCAACGGATTGATTTCGCGTGATATTTTTGTGCGCAAGCCGGGCACGCCCGTGCTGGACCGCTGCATCCGTGTGAGTGTGGGCGTGAAAGAACAGCTTGATCAATTTGAAGCGGCATTTCCCGAAGCACTTGAAGAAGCGCGCAAGATTTGCGCCGCCAACGCAGAAAACACCTGATCGTGGACAACAGCAAAATGCCTTCAATCAACGGCCAGCCACGCAGCGTCCTGTTCGGCACACTCGCCGGACTGTGCGGTGCGTTGGGCATAGCATCCTACGCCGGAGCAGCCCATATGGGCGAAAGCCATCTTGGCACGATCGCCCCTCTCCTTCTGGCCCACGCGCCGGCTCTGCTTTTCCTGTCGCTCATCAGCCCCGTCAGCCGTGTGGTACGGATCGGCGGCGCGATTCTGGTCGTCGGGCTCGCCCTGTTTTGCGGTGATCTTTTCATGCGCGACATGACCGGAGATCGCCTGTTTCCCTTCGCCGCGCCCACGGGCGGAAGCCTCATGATCCTGGGTTGGCTTTGTCTTGGCTGTAGTGGCTGGTTTTCGGCCAATGCCAAATGAAAAAGGGGCCAGAGCGGCCCCTTTCTTATTCACATGAAGTGCTTTTATGAATTGCCCTTCAAACGTGCATTGCAAAGGCTGTAATAGCCGCCGCCCTTTTGAACCCACTTCAGGCCGCCCAAAGCATTGGCATCCTTGAGCGCATGATATTGGTCGAGGCAGGTGTGCATCCGTGCCTTGCCGGCAGATTCTTTTGAATATTTCGCGGAAACCGCCGTCGGGAACTTCACGCCCTTGGGGGCCGCGACGCTTGGAGCAGCAGGTTCAGAATCGCCATCCGTGCTCAACGCCACCGGATCCGCGCCGGGGCCGCATTCCGCTTTACGGAAATCGTTCCACTTCATGCCATTGTCGGTACTGGCGTCTTTCGCCGCCTGATATTTCGCGCTGCACTGCTTCATGGTGAGGCTCTTGGCGCCATCATTACTGGCAGGCGCGGCGGCCTTCGCGGCTTTCTTGGTCGCAGGCGCAGCAGCAGGTGCGGCCGCCGGAGCTGACGCAGCATCATCGCCACACTGAGCCTTGCGGAAATCGTTCCATTTCATGTTGCCAAGTGTGCCAGCATCTTTCGCAGCCTGATATTTCGTGCTGCATTCTTTCATCGTCAGCGCACTTGCCGGCGAGGAAAAGGCAATAGCGGCAGCACCCATGATGAAAGCGGTACCGGCCATAGCTGTGATGGTTCGAATAGACATCTCGTTACTCCCTGTCGAACGCGGCCTCTCGATCGCTCGGCATCTTCCCAACCCCGCGACCTTTCTAATATATAGAAACCGAAGATAACATGTTTGCTCTCTACCGGTCCGGTTATCTGCCAAGTTTGCGCCCGGCAAAGTTGACTGTCAATCATCTGCCCAATATTACCTTGGGGTCCAATACAAATTTTCTATCGGGCATTCTTTGCACGTTATGGCCGTGTGATTGCCATGAAGTAAAGATAGAGTAATTCTCTATCCGTGCACGATCTTTATTCGGTAAAAGTTAACATTCATGGTCGGCCCGGCTGCACCAGCGAATCATGACCATCGCCGGCAGCAGCGTCGGAAAACCGAACCTTGCAACAAAACCGGGCGAGATAACGCACTGCGCGTCATCTACCCCCCCCCTTCATTCCGCAAGCGTTTTTCAGCCCACATGTCGCACGAAAGAATTTAACAGGAAGAACCGGAAAAATGGCGACAAACCACCAATGCGGGCATGAAAATAACAATAATGCTAGCCGTGCGCGCAGCACTTCTGCCTGTTATATTGCCAGTGTTTCCCAAGACTTGACGCTGAGCCTGATGATGTAAAGCGTTATGAACCAACATGAAAGCCGGGTTTTCCCGGCTTTTGTGTGTCCTGCCGGGTGATGCAAAGCAACTCCGGTGAAGACGGCATCCTTGGAACCGCTCTATTTCGTTGTTTTACGCATTATCCAAAGCAAAGCCGCTTCGCACTTTTGCTGGAAATGCTCTGGAGTTTGAAATTGGCCGATATCGCAGATAGCGCCCCTTCGCTTGCCGATGCATCCAGCCCGATCAGTGTCGATCGGCACCGCCTCTATGAAGATGCGATTGCGATGCTGATTGGCACATCCTTCATCGCCCTTGGCATAACGCTTTACAGCCATGCCATGCTGATGACCGGCAGCACAGCCGGTATCGCCCTGCCGATCCATTATGCGACCGGCACAGGATTCGGCCTTCTTTATTTCCTGATAAACCTGCCCTTTTATTATTTCGCGGTGCGCCGCATGGGCTGGGCCTTCACGATCAGGACTTTCGCAGCCGTGGCCCTATTGTCCGGGTTCACCCGCCTCATGCCGCTGAGCGTCGATTTTACAAGCATCAACCCGCTTTTCGCGGCCCTGATGGGCGGAACGCTGATGGGCATGGGCGTGCTGGCGCTTTTCCGCCATCGTTCCGGCGTCGGCGGCGTCAATATTCTGGCGCTCTATCTTCAGGACGCCTATGGAATTCGCGCGGGATGGTTCCAGCTTGGCCTCGATGTGCTCATCATGCTGGCTTCGCTGTTTTTCATTCCGTGGGAAAACATGGTTCTCTCACTGGTCGGTGCGGTCGCGATGAATGTCATCATCGCGATCAATCATAAACCGGGGCGCTATATTGGCATAAGCTAACAAAAAAGCCGCCATCGAAGGCGGCTTTTTTCATTCCGAACAGTTCGATACCAGACAGTTCAATGAGCCGTCATCCATTCGCGGGCGCAGCGCAAGGCAGCGGCCAGTTCGCTCTTGCTCATCGTTGCGGCAACCTGATTGCGCATCCGCTCCGCCTTCTGGTTGCCGCGGATGGCGGCAATGTTGAGCCACTTGTGTGCCTCGATCACATCGATCTCGCAATCGCGACCGATCGCATATTTCATGCCCATTTCCAGAAGGATACGATCCTGCGCCGCAGGGTTTTCGTTGTGGTTGAAGCTCTGAAACTGTGCCATTTTTCTTGTCCCTGTTAAATCCCGAGAACCGTCCGGTCTTTCTTTTGAAGCTGGCTCCTGCCGCTTCAAAACCGCCGGATCTGTCGTTCCGTTCATGATCCTGAAGATAGGGGAAAGGGCTGAAATGCACGTTAAACCAGATGCTTAACATCAACCCAACAAAATCCGAACAAGGAGTAAACTTGGGATTTTGTTAAACATCCAAGCCTCACGGACTATTGGGTTTTTCTGAAAATTTTGGATTTGGACGCAAGGCCGGCGGAAACCGGATGATAACCATAGGGCGCATTGGCATGGTGAGAATTGGCGCCAAGCGCGCACTCCAGACTGCCATGATGAAGCTGAAAATTGCCCCTTCCGCTTCCCCGGAAACTCGTTTACGTTTACGTTTGCGTAAGCTGATGAGGGGCGCTGAACGCATGCTCCTGCAAAGGCTGAAACGCATGAATAGCGGCTTCATCGCCGCCCCGACAATTCGGGATTCGCGGCATTTTGGGAGGAACGAATGATCCGCACACTAATTCTTGGAGTGACGCTCGCCGCTGGTTTTGCAGCACCTGCATTCGCCGACGAAGCCATAGTCGGCACGTGGAAACGCCCGAACGGAACGCTTATCAGCTATGCCGCCTGCGGCGCCAACAAGTTCTGCGGCACGGTGATGACGGGCGAATACAAGGGCAAGTCCATCGGCACGATGTCCGGCAAGGATGGCAATTACAAGGGCGAAGTGAACAAGCTTGACGAAGGCAAGACCTATTCGGGCAAAGCCAGCGTCAAGGGCAACACGCTTTCGCTTTCGGGCTGCGTCATGGGCGGCCTGATCTGCAAGAGCGAAAGCCTTGCCCGGCAATAGAGCACTCGCAGAAACAGCAAAGGGCGGCCACAAGCCGCCCTTTTGATACCATATACGATCAACGCGCCTTCTGGCCGAAGAGAATTTTCTGCTCCTCTTCCTTATTGGCTGCGGCATCGGCGCGCTCACGCTCTTCCTTGCCAACCAGCAACCCGCGCTGCACGGCAGGCCGGGCGTTCATGGTTTCATGCCACCGCTTGAGATTTGCGAAATCATCGAGATTCTGTTCGTAATTCTTGTAGGCGTTGACCCAGCCGATGCAGGCCATATCCGCGATCGAATATTCGTCGGCGATGTAATCGCGGCCTTCAAGGCGGCGGTTCAGGACGCCATAGAGGCGGTTGACCTCATTCGTATAACGGTCGATGCCATACTGGATTTTCTCCGGCGCATAGATGCGGAAATGCCCAGCCTGGCCGGACATCGGGCCAAGCCCGCCCATCTGCCACATCAGCCATTCTTCCACCGTAACGCGCTTGCGCTTGTCAGTCGGATAAAACTTGCCGAATTTACGGCCCAGATATTGCAGGATCGCACCGGATTCAAACACCGAGATCGGTTCGCCGCCCGGCCCTTCCGAATCGACGATGGCGGGCATACGATTGTTAGGTGCAATTTTCAGGAAACCTGGTTCAAACTGATCGCCCTTGCCGATATTGATATATTTCACCGCGTAAGGGACGCCCAGTTCCTCCAGCATGATGCTGATTTTGAAGCCATTCGGCGTCGGCCAATAATAAAGTTCAATAGGCTTGGTCTGTTCAGCCATGGGTCTTCTCCCGCTTTCTTGGCTTTGTCCATGAATCCAATAAGCGTTAGCATAGGCCGGAAGAAAACAGGTACAAGTTAACCCGGCATCAATATTTTTGAACGCGGCATGAACAGCACCCTCAGGGAAAGTTCAGGTACGCGGCCGTAAGCTCCAATGCGAAAGCAAACAGGTTTCGAGGATTGGGGACAATCATGCTGCTTCAGCGGTTTACAATAGTCTGCCTTATGATCGCGGGTTTTGCCGCCTTCGCCAGCATTCTGACATACCGGACATACGATATTTATGGCGACCAGCCGGTTGTCGACGCTGTCGCCTTGGCCCATATGCACGATCTTGCCGATGGAAAGACACGTCAATGAGAGCCGGAGCACCTTCCATTGCCGCCATTGGCCTGCGCATATTCCTCTTCACCGCCATTTTTACAGGCCCTGCCGCGGTTTTCGCCTATACGCAGATCGGCAACCATGCAGAGATGAACGGGGCTGGCCTGGTTCTTTATATAAGCCTGCAAAGAACTGCATAATCAGATCATGCCACCGTCATGGAGTTTGCTATACTAGGTCATGACGCAACGAATCGAACATCCATTTTTGACTGATATTAAAACCCCTCCGCTGATGGAGCCGGAAGTTTTTAGCGACGCGCAGGCAGCCGTGGCCGCCTTGTGTAAACTTTACGAGCGTAACACCGCTTTCTTGCGCTCTGCTTTTGAAAAAGTTGCCCGCGGCGAAATTGCGCCGCAGCGTTATCGTGCATTCTATCCTGAAATATGCCTTTCCACGTCGAGCTTCGCCCATGTGGATTCACGTCTGGCATATGGCCATGTCTCGACCCCCGGCGACTATTCGGCAACCGTCACCCGGCTCGATCTTTTCGGGCATTATCTGCGCGAACAGATCCGCCTCCTGATGCGCAATCATGGTGTGACCGTCACGGTTCGCGAATCCTCGACGCCTATTCCCATCCATTTCGCCTTCAAGGAAGGCGCACATGTTGAAGCCTCCGTCGCCAGTGCCTTTACCCATCCGCTGCGCGACCTTTTCGATGTGCCGGATCTGGCCGCAACAGACGATAAGATCGTCAATGCCGATTTCGAGCCTGCCCCCGGCGAGCCGATGCCGCTTGCCCCCTTCACGGCGCAACGGATCGACTATTCGCTGCATCGCCTTTCCCATTACACCGCAACCAGCGCCAGCCATTTCCAGAACTTCGTCCTTTTTACCAATTACCAGTTCTATATGGACGAATTCTGCGCCTATGCCCGCCAGTTGATGGCGGAGGGCGGCGGCGGTTATGACCAGTTTGTCGAACCCGGCAACATTGTCACGCGCGCTGGCGAAACCGCACCCAGCACCGGAAACCCGTTACAGCGCCTGCCGCAGATGCCTGCCTACCACTTGCAGAAGGCAGGCCATGGCGGCATCACCATGGTCAATATCGGCGTGGGGCCTTCCAACGCCAAAACAATAACCGACCACATCGCTGTGCTGCGCCCCCATGCGTGGCTGATGCTTGGCCATTGCGCGGGCCTGCGTAACAGCCAGCAGCTTGGCGATTATGTGCTGGCCCATGCCTATATGCGCGAGGACCATGTGCTGGACGATGACCTCCCGGTCTGGGTGCCGCTTCCAGCCCTTGCTGAAATTCAGGTGGCTCTGGAAGAAGCGGTAGAGGAAATTACCGGCCTTCAGGGCTATGATCTCAAGCAGATCATGCGCACCGGAACGGTTGCCACGATCGATAACCGCAATTGGGAGCTGCGCGACCAGCGCGGGCCGGTCCAGCGCCTTTCGCAGGCGCGGGCCGTGGCGCTCGACATGGAATCGGCCACGATTGCGGCCAATGGTTTCCGTTTCCGTGTTCCCTACGGCACCTTGCTCTGCGTCTCCGACAAGCCGCTGCATGGCGAATTGAAGCTGCCTGGCATGGCAACGGAATTCTACAAGCGACAGGTGGCACAGCATTTGCGCATCGGCATCCGCGCCATGGAAAAGATCGCTTCCATGCCGGATGAAAGGCTGCATTCCCGCAAGCTGCGCAGCTTCTACGAAACGGCCTTCCAGTGAGGTTAGGGGAGTAGGGGAATAAGGGAATAGGGCAGTAAGACAGTAAGGCAGTATGTTGGAGCCGTGCCGGAAACAGGGGGGCTCAGACAACCAGAAACCTATTCCCCTATTCCCCTACCCCATTCATTTTCTCGCCGTAATCCGGCACTACTGCCGCCTCATGTCAAAAAGCCGCCTGAACCTGCCCTTCCTGATACTTGTCGCCGCCACGCTGGTTTCGATCCCGCTTGTGCTGAGCTTCCTGAACAGCCTTCACCCGGCGTTCGACACCATCTCACATTTACGCATCCATCTGGCCGTGCTGATGGGGTTGCTGGCGCTGCCGCTGCTTTTCACGAAACTGCGGCGCGAAGGCGCGATGATCCTGTTGCTCGCCGTTTTTGCAATTGCCGTCACGCCCCATGTTTTCCCCGCAAGTGAAGATGCCCATGCGCGCGAAGCAGCACAGCCTCACTATCGCCTTTTGCAAATGAACCTTCGCTTCGACAATGGTTCACCGGAGCAGGCACTCTCCCTCATAGCCCATATCCGCCCCGATGTCGTCACCCTGGAAGAAGTATCAGCCATGTGGCGGGAGAAGTTCGGCCATATTGCATCCGCCTATCCCTACAGCATTTTCTGCCCTCATCCCGGCGCGGTTTTCGGGGTGGCAATCCTGTCGCGCCGGCCCTTCATCGCTGACAGCACCCCCGCCTGCGATCCGAAAGGCATGATGGCCGTCGCCTCGGTCGATTTCGGTGGCTGTCCCGTGGATGTGGCCGCACTGCATCTCCATTGGCCTTGGCCTTTTCAGCAGAGCGAGCAGATCGAGGCCCTTTCGGAGCAGTTTCGCGGACTGTCGGAAAACGCAATCCTGAGCGGCGATCTGAACGCAACACCATGGAGCGCGACTACAAAGCGCATCGCCGAACTCGCTGCGATGACACCTGCCCCGCCGACCGGCCCAACATGGCTTTATCGTCGCCTGCCCGCTTCGTTGCGCTTCGCCGGATTGCCTATCGACCAGACCTTCGCGAAAGGCCGGGTCGCGATATCGAAGATTACCCGGCAACAGCCCATCGGCTCTGACCATCTGCCCGTCCTTGTGGAATTTTCCATCATTCCCCAGCCGGAGACAGTCGCGTCGTAACAAATAAGGCGGCATAGAAGCCGCCTTATTCAGTCAAATTACAATTCTCGTTCCGGTCCTTCGCCGACATCTTCCCAGCGGCGGTCGAGACGGACACGGTCTTAAAATGTGAACATTACATGTTCATATAGACCGGTCCTTCGCCGCCTTGTGGCGGAACCCAGTTGATATTCTGGTTTGGGTCCTTGATGTCGCAGGTCTTGCAGTGGACGCAGTTCTGCGCATTGATGACGAAACGCACATCCTTGACGCCCGGATCAGCAGCGGCATTGCCGTCCGCATCCACCCATTCATAAACACCAGCCGGGCAATAGCGCGTCGACGGTCCTGCGAAAACATCGTGTTCCGAAGTCTTCTGCAATTCCATGTCACGCACTTGCAAATGCACCGGCTCGTTTTCGTCGTGGTTCGTGTTCGACAGGAACACAGACGACAGACGGTCAAAAGTCAGCACACCATCCGGCTTCGGATAGTCGATCTTCTTGTAGTTTGCAGCCGGTTCAAGCGCCTGCGCATCGGTCTTGCCGTGCTTCATCGTGCCGAAGAACGAGAAGCCGAAAAGCTGGTTGGTCCACATATCCAGGCCGCCCAGAGCAATGCCGATAGCGGTGCCGAACTTCGACCAGAGCGGCTTGACGTTGCGCACGCGCTTCAAGTCCTTGCCAATGGCGCTCGCCCGCCAGCTATTCTCGATCTCGATCGGCTCGTCATTGGCACGGCCAGCGGCAATCGCCTCTGCAATCTTGTCGGCGGCCAGAATGCCGGACAGGATCGCATTGTGGCTGCCCTTGATGCGCGGCACATTGACAAAACCGGCCGAGCAGCCGATGAGCGCGCCACCGGGGAAGGAAAGCTTCGGCACGGATTGCCAGCCGCCCTCGGTAATTGCACGCGCGCCATAGGAAAGGCGCTTGCCGCCCTCGAACGTATCGCGGATGGCCGGATGCGTCTTGAAACGCTGGAATTCCTCGAATGGCGAGAGATAGGGGTTCTTGTAATTGAGGTGCAGCACAAAGCCCACCGCAACCATATTGTCCTCAAGATGATAGAGGAACGAGCCGCCGCCGGTTTTCATGTCCAGCGGCCAGCCGAACGAATGCTGCACGAGGCCCGGTTTGTGCTTCGACGGATCAACCTGCCAAAGCTCCTTCAGGCCGATGCCAAACTTGGCCGGTTCGCGGCCCTCGTCAAGCTTGAACTTCGCAATGAGCTGCTTGGCGAGTGAACCACGCGCGCCCTCGCCGATCAGCACATATTTACCGAGCAGCGCCATGCCGCGCGTATAGTTAGGGCCACGTGTGCCGTCGCGTTCGACGCCCATGTCGCCCGTCGCAACGCCGATCACCGCGCCCTCGTCATTATAGAGTACTTCGGTCGCAGCGAAGCCCGGATAGATTTCCACGCCCAGCTCCTCGGCCTTCGTGCCGAGCCAGCGGCAGACATTACCGAGCGAAACGATGTAATTGCCGTGGTTGTTCATCAGCGACGGCATGGCAAAATTGGGCAGGCGCACGGAACCGGCAGGCCCCAGCACCAGGAAATGGTCCGCCGTCACAGGCGTTTTGAAAGGATGGCCTTCCTCCTCGCGCCAGCCGGGCAGAAGCTGGTCGATACCGACCGGGTCCACCACCGCACCCGACAGGATATGCGCGCCGACTTCCCCACCCTTTTCCAGCACGACCACGGAAAGTTCGGGGTTTATCTGTTTGAAACGAATCGCCGCTGCAAGACCGGCAGGGCCTGCGCCGACAATCACGACGTCGAATTCCATGCTCTCGCGTTCGGGAAGCTCGTTCGCTTCAGACATTCATTCCTCCGTCCGGTATGCCATCATCTCCTCGCTATGCCAGCGCCAATGATGCATCCAATTGTTTTTCCGCATGATCCTATCACAAACGCCAGTGAACGGGTCAAAGGCGTCTGGAAGATCAGTCTGTCCTGCATTCCGCCATACCCTCTCGCAGAGTTGGAGACTGCCGCATATGCGTCCAATTGTCCAAAAATGGCATCATACAGCTACATTTTTATTTTACCTTAACGTAAACGTCAAATGTGGTCTATTTAGATTGCTGCCATCACGATTTGGCGAAAGAATGTTCTTCTTATGCCAACAATCGACTGTATGAACCGGAAAATTGGCCCATGCCCTTGCATTCGGCGCCAAAATTCGCCACCTCAGACGGTGAGGTGCAAGAATGACCGATGGCCGGGAAACGCTGGATATGGAGGGCCTGCTTCGCTTTTATGCGGAAGCGGGTGTCGATGTGCCGCTTTGCGAAACGCCGATTGACCGTTTTGCAGCCGCCACACATCCTGCCCCCGCACAAAGCCGGATGCAGGCCGCAGCACAACAGCCGGAATCAAACCCGGCGCAGGCCCGCGAGGAACGGGCGCGTACCGTTGCCGGACCCTCCCCTTCCAGCAAGCCGGTGCAGGCTGCAATGGATCTGCCCGACAATGCGCAGATCGCGCTTGCCCGCGAAGCTGCTTCGCAGGCCGAAACACTGGAAGAACTGCGCGAAAAGCTTGCGGCCTTCGACGGCTGCAACCTGAAATTCACCGCCAAGAACCTCTGCTTTGCCGATGGCGACCCTTCGTCCGATATCATGTTCATCGGTGAAGCGCCGGGGCGGGACGAGGATATGGAAGGCCTGCCTTTTGTCGGCAAATCCGGCCAGCTTCTGAACCGCATGATCGAGGCTATCGGCCTCAAGCGTGAAGAGGTTTATATTGCCAATACGATTCCGTGGCGGCCACCGGGCAATCGCGCACCCACACCGCTTGAAACGGAGCTATGCCGCCCCTTCATCGAACGGCAGATAGAGCTGGCCGCGCCAAAAGTGCTGGTGGCGCTTGGCGGCCCGGCTGGCAAGGCGCTGACGGGCGCAGCGGAGGGCATATTGCGCCTGCGCGGCAACTGGAAAATCCACCGCACGCCGACAGGCATGGAAATCCCCGTAATGCCGACATTGCATCCGGCCTATCTGCTGCGCACCCCGGCACAGAAACGTTTCGCCTGGCGCGACTTTCTGGCGGTGAAACTCAAATTGGCCGAATTGCGTGGTTGATATTGCCCGCATAGAGCGATAGCACGCGCGCATCGTCACTATAGCCTGGTGCCGCAAACCCGGTTCAATTCGTTGTTTTCGTGCGTATCAATCCGCCACTTGTCAAATCACGCCAAAGCGACAAGTAGCTTTTGAATTCCTGACCCATTGTGGAGTCCGTCATGGCGGGACAGCTTCTGCCCATAGCCGCGCTGCTGGCAAGCACATTTCTCATGCTGCTCGCAGGCGGCCTTGCCGGTATTCTCCTTCCGCTTCGCGGTGGCATGGAAGGCTGGTCCACGACGACAATCGGCTGGATGGGGACAAGCTATTCGCTGGCCTTCACCATCGGCTGCATTTTCATTCCGCATCTGGTGCGCCGCGTCGGCCATGTGCGTGTCTTTTCGGCGCTTCTGACGCTGCTGTCCATGGCGCTGCTGTTCCATGCACTGGTGGTCAACCCGGCCGCGTGGATGATTTTTCGCGGCATTACGGGCTTCTCCCTCGCCGGTTCCTACATGATTATCGAAAGCTGGCTGAACGAGCGCGTGACCAATGAATCGCGCGGCATGATCTTCTCGATCTATATGATCATCACCATGGTCGGGCTGCTTCTGGGGCAGTATATCCTACCCTTCGGCAATGCCGCCACGCAGACACTTTTCATCATCTGCGCCATTATCTATGCCAGCGCGCTTCTGCCCACAGCACTTTCAAGCGCCCAGTCACCCAACCCGCTGACACAGGTTTCGCTGGACCTCAAAGGGCTTTACCGCCGCTCGCCCGCCGCAGTTGTCGGCTCCTTCATCGCAGGCATCGTGGCAGGCACATGGAATTTCCTTGCCGCCATCTATGGGGAGATGAACGGGCTTTCCACTTTCGGCATCGCCACCATGCTTGCCTCGGCGATGATCGGCGGCGCAATCTTCCAGTACCCGCTTGGACGCGCTTCCGATTTCGTCGACCGCCGCTATATGATGATCCTTGCAGGCGCCATCGGCTTCACCCTGTCGTTCATCATGGTGCTGTTTCACCCGACCTCGCCCTATACGCTCTATGCGATGATGTTTCTGTTCGGCTCGGTGGTCTTTCCGATCTATAGCCTGAACGTCGCCCATGCGAACGACTATGCCGATGCCAGCGAATTCGTAAAGATTTCCGGCGGGCTGCTCATCGTCTATGGGGTTGGCAGCGTGCTTGGGCCAGCAATTTCAGGCCCGCTGATGGATGTCATCGGCGCGAACGGATTTTTCGTCACCATGGCGATTGCCTATTGCATCTATGGGCTGCACGCCTGGTGGCGCATCTACCGGCTGGAGCGCCCCGCAATCAGCGACCAGAAGACCGAGTTCAAGTTCCACACGCCCGACGGGCAATCGACACCGGAAACCATGCAGCTCGACCCGCGTGCGGAAGGCACATCGGGGCAGGCGCAATAAGACCCTATTGCGCCTGCGGGGCATTAACCTTGTGCGGCGCCAACGAAAGCTGGTGCTCGCGGTAGATGATAAAGCCGCCGGCAGCCATCACGATGCCGCTGCCGATCAACATTTCAAAGGTCGGTATATCGCCGAAGAGCATGAAGCCGATCACCAGCCCCAGCAGCATTGAAGTATATTCAAACGGCGCAATCGTAGACATGGGTGCATGGCGATAGCATTCCGTGAGCAGAATTTGCCCGATACCGCCCGCAAAGCCCGCCCCCACCAGCATGGCAAGCTGTGACCAGTTCGGCACAACCCAGCCGAACGGCAGGCTCACCAGCGCGATCACGCTTGCCGATATGGAAAAATAGATCACGATGGTCGGCGTACGTTCGGTCTGGACCAGACGGCGCACCAGCATCATGGCGACAGCCGACATGACCGCGGCCCCAAGCGCGGCCAGCGCACCCACCGCTTCGTCGCGCCCGACACTTCCGGCAGAAAACAGGGTCAGGCGTGGCCAGATGATAATCATGACGCCGAACAGCCCGATCAGCACCGCACTCCAGCGGTAGAAGCGCACCACTTCGTGCAGGATGGCCGCCCCCAGAATAACGGTAATCAGGGGTGAGGCATAATTGATGGCGATGGCCTCCGGCAAAGGCAACTTTGTTAGCGCGAAGAAGCTCATCGACATGGAACACACACCCACCAGCCCGCGCCAGAAATGGCTGAAGCCGTGGCGGGTAGAGAAAACCCCGCCAAGCTGCCCGCGCCAGCCCAGATAAAGCAGGATGGCGAAAATTGCGAAGAAGGAGCGGAAGAAGATCAATTGGCCGACCGGAACGCCTTCCGCCGCCTTGAGCAAGGTGGACATGGCGACGAAAACAGCAACGGAAGCAATTTTGAGGCCGATGCCCAGCATCGGATTCATTTCAAGCCTGGCCGCATCGGCGGCTGCGCGCGTGTGGGCGTTCACGACGCTATTCCTTAAAAGTAAGCGGTGCGAATCTCCGGCTGAAAATATTTCAGCCATCCCCCTATTTATTAGACGCGTTTTTGTGGCCCCACCAGTAGGGCAAAGCTTAACAGCGAAATTTATTTGATACAGTAAAGCTCGCCTGCGCCCCGTAGTTTTGCTGGAATTGCCCTGAAAAGCCGGTTACGACTCCAGCACCCAGGAGTTCAAAGAAAGAGTTTCGCAATGCGTACTGAAACCGGCCATACTTTCCGTCTCGAAGATTATCGCCAGACACCTTACGCCATACCCGAAACGAAACTCGACTTCACACTGGAGCCGGAAAAAACCATCGTGCGCGCAACGCTCACCATAGAGCGCCGCTCCGATACGCCCGCCGGTACGCCGCTCGTTCTCCACGGTGACGAATTGAAGCTCGTGAGCCTTGCCATCGACGGCAAGGCGCTTTCCGACAACAGCTTTTCGGCCACGCCCGACCAGTTGACCATCAGCGATCTTCCGAAAGATGTGCGCTTCACCTTGCAGATCGTGACCGAGGTGAACCCAACAGCCAATCGCCAGCTTTCCGGCCTTTACCGCTCCAGCGGCGTCTATTGCACCCAATGCGAGGCGGAAGGCTTTCGTCGCATCACCTATTTTTACGACCGCCCGGACGTGCTGTCGGTCTATACGGTGCGTGTCGATGCCGACCGCAAAGCCGCTCCCATCCTGCTTTCAAACGGCAACCCTGTCGAAAACGGCATGGTGGAGGGCCAGCCGGAACGGCATTTTGCCGTCTGGCACGACCCGCATCCAAAACCCTCCTATCTTTTCGCGCTCGTCGCCGGTTCGCTCGGCGTGGTGAAAGACCACTTTACAACCCGATCTGGACGGCCCGTCGATCTCGCCATCCATGTGGAACATGGCAAGGAGGGCCGCGCGCTTTATGCGATGGACGCGCTGAAACGCTCCATGAAATGGGACGAGGAAAAATTCGGCCGCGAATATGACCTTAACGTTTTCAATATCGTCGCCGTCTCCGATTTCAACATGGGCGCGATGGAGAACAAGGGCCTCAATATCTTCAACGACAAATATGTGCTGGCCGATCCTGAAACCGTGACCGATGCGGATTATGCCGGCATCGAAGCCGTTATCGCGCATGAATATTTCCACAACTGGACCGGCAACCGCATCACCTGCCGCGACTGGTTCCAGCTATGCCTCAAGGAAGGCCTGACGGTTTATCGCGATCACGAATTTTCCGCCGACCAGCGCTCGCGCCCTGTCAAGCGCATTGCGGAGGTGAAAATCCTGAAAGCGCAGCAATTCCCGGAGGATGCAGGCTCGCTTGCCCATCCGGTGCGCCCTCGCGAATATCGCGAGATCAATAATTTCTATACGGCAACCGTCTATGAAAAAGGTTCGGAAGTCGTTCGCATGATCCGCACCATCATCGGGCCGGAGCTGTTCCGCAAGGGCATGGACCTCTATTTCGAGCGCCATGACGGCGATGCGGCGACCATCGAGAATTTCATCCAGGTTTTTGCCGATGTTTCCGGGCAGGATTTCTCGCAATTCGCGCTCTGGTACGATCAGGCCGGTACACCGAAGGTGGAAGCCGGGTTCCATCATGACGCAGCCGCGAAGACATTCACGATCAAGCTGGAACAGTCACTTGCGCCGACACCCGGCCAGTCGATCAGGAAGCCCATGCATATACCCATTGCCTTCGGCCTGATCGGGCCGGACGGCAAGGACATGCAGCCCTCGTCGGTGGAAGGCGGCGAGGTGCGCGACGGCGTAATCCATTTGCGCCGCCCATCCGAAACCATCGTCTTCCATGGCATCGAGGCCCGGCCCGTGCCGTCGCTGCTGCGCGGCTTTTCGGCGCCAGTCAATCTCGCCGCGCCTCTCACGGCGGAAGACCGGGTTTTCCTTGCCCTGAACGATAGCGACCCCGTTGCGCGCTGGCAGGCGATGAACAGCATTTTCTCTGCGACCCTTCTGGATGGCGCCAAGCGTGTGCGCGGCGGGCATCAGCCGGAAACCGATCCGAAGATCGTCGCGCTGGCCGGAAAGGTCGCCTTCGATGAAATGCTGGACCCGGCTTTCCGGGCGCTTTGCCTGACGCTGCCGAGCGAAAGCGATATCGCGCGCGAAATGGGTAACAATGTCGATCCAGACGCAATCCTCGCCAGCCGCAACCATCTGATTGCAGCAATAGCTTCAGGCTATGCCGATGGATTTGCCGGGCTCTATGACACGCTGAAGCAGGAAGGGGCGTTTTCACCCGATGCGGCCCCGGCGGGAAAGCGTGCCTTGCGTAGCGCCCTTCTCGATTATCTCAGCGTTCAGGAAAAGAGCCCTGAACGTGCAGAAAGGCAATTTGTCGAAGCCGACAACATGACGGACCGCGCCACGGCGCTGGCCGTTCTGGTCCATCGTTTTGGCGATAGCGGCGAAGCCCGTCAGGCGCTCGCAACCTTCGAGCAAACGTTCGGCCAGGATGCGCTCGTGATGGACAAATGGTTCATCGTGCAGGCGACACGCCCCGGCGAAACGGCCCTTGAAGCAGTGAGGGAACTGACCCGCCATCCGCTCTTTTCTCTCGACAATCCAAATCGCGTGCGCGCGCTCATCGGCGCATTTACGGCTTCCAACCCGACCGGGTTCAACCGCCAAGATGGTGCAGCCTATGGTTTCCTCGCCGATACGCTTCTGACCATTGATCCGAAAAACCCGCAGCTTTCCGCACGGCTTTTGACGGCAATGCGCTCATGGCGGTCGCTGGAAGAGGTGCGGCGCGAACATGCCCGCGCGGCACTGGCGCGCATTGCAGGCGCAGGCAAACTCTCCACCGATCTGCGCGACATCATCGACCGAACGCTCGCCTGATTCGCAAAATCCGAAGGTCGAGCGATGATTCGCCGATCCGGCCCGCACCAACGCGCCGGATCGCTGCACTTGGCGCGCCTGCCCGCACAACCTGTTCACTCTTCCGGCTTTCATGAGTCTTTTCATGGCGTTAAAGGCTGCACCTGCCAAACTGATTCCACTTAAACTCTTGAGATTCATGAGAAAACGGGCAACGTAACAAAGCGATTTTTACGCCTCGTTAACGAATTCACTGGACAGGGCGAATCACTTTTGATTCATTGAATGCATTCGGAAGTGGCGTAATCCGAATCATATTTTGAACCAGGATTTCAGTTCATACAGTTCTGCGTAGAAGAGTGTTTCATGCGCTTCGCAGGGCTGTTCCATAGATTCAAGAGGGGCCGAGAGATGGCGAGCACCGACGCGTATGGCGCGCCGGCGGGGACGCATTGCGAATCCGGCCGGAAAAAGAGGAAAGGCAGGCTTTCGGGCCATGTCAGCCTTCTCGCGGGGCCGGCTTATAGCAAATTCATCGTCATAGAACCCATCCTGCGCCGCCTCGTCCCTACACTCATCATCATTTTCCTGATTATTCTCGGCGTGGCCCGCGTTTTCTCGCTTTTGGCCTGGCGCGACGATATCGAATTGCAGCACAAGGCTGCTCTTTCCGGCGCGACGGCGCATCTGGCGCAGATGATCGAGCGTGTCGCCAACGGGATCGAAACAGGCGCACAGCTTTCCGCCAAGGATTTGCAGGACGCCATGACGGAGCTTCGCTCGCGCGGCCTTACCTCGTCCGGCATGACCATCGCCATCGTGGATGCGCAATCCATGATTAAAGCCGCATCTGGCCCAGCCGGGATCGCTGGCAGCCAGATCGACACCATTCTGGGGGATGCACAGCCGCTGTTCCTTTTTGCCGAGCGCGCCGGTGTGCTCAGGGTTGTATTGCAGGGCGAAGCCGCCTTCGGCGCGCTGGCCAAGCCAATGACCGCGCCCTATTCCATCATTGCGGTCGAACCGGAAAGCACCATCTTTGCCGAGTGGAAAAGGGCTGTATCGCTCAATGTGACGCTTTTTGCCGGCACGATCGGCGTGATGTTCGCAATTCTCTATGCCTATTTCAGCCAGGCCGCGCGGGCACGCGAGGCCGACGATCTCTCCGGGCAGATACAGCGCCGCATCGACATGGCGCTGGCGCGCGGGCGTTGCGGCCTTTGGGATTGGGACATGGCGCGCGGGCGCATCTACTGGTCACGCTCCATGTATGAAATGCTGGGCTATGAAGCGCAGGATGCCGTGCTGCCCTTTGGCGATGTGGCGGCGATCATCAATGAGGAAGACGGCGATCTCTACTCCATCGCCGAACAGGCGGCGGCTGGCGATATTTCACATGTGGACCGGGTCTTCCGTATGCGCCACGCGGACGGTTCATGGGTCTGGATGCGTGTTCGCGCCGAAATCGCCAGCGAGGGCGACCTTCATCTGGTCGGCATCGCCTTCGATGTCAGTGAGCAGCATCGCTTCGCGCAGCAGACCGCCGAAGCCGACATGCGCATTCGTGAGGCAATCGAGAATATTTCGGAAGCCTTCGTTCTCTGGGACGCGAATAACCGCCTGGTGATGGCGAATTCCAAGTTCAGCGAATATGCGGGCCTGCCGGTCTGGACGCTGAAACCGGGCGTGCCACGCAACGAAGTGGACGCGCATACCCGCCCCTTCACCTTCGAGCGCCGCATGGCGAACGAACACAACCGCGCAGGCGGCCAGACTTTCGAGCGGCAGTTGAGCGACGGGCGCTGGCTACAGGTCAACGAACGGCGCACACAAGATGGCGGCATGGTCTCCATCGGCACGGACATTACCCAGCTCAAGCTGCATCAGGAGCGCCTTGTGGATAGCGAGCGCCGCCTGATGGCGACGGTTCACGATCTTTCCATCGCCCGAAAGGGTGAGCGCGACCGGGTGCGCGAGCTTTCCGAACTGGCGCGCAAATACAGCCTTGAAAAGGAGCGCGCGGAAGCGGCCAACCGGGCCAAATCGGAATTCCTCGCCAATATGTCCCACGAGTTGCGCACGCCGCTCAATGCGATCATCGGCTTCTCGGAAATGATCCAGGCAGGCACGTTCGGCCCGCTGGGTTCCGACCGCTATGAGGAATATATCAACGACATCCACACCAGCGGCAACTTCCTGCTCAACGTCATCAATGACATTCTGGATATGTCGAAGATCGAGGCCGGGCATTTCTCGCTCGATCGCGAGGAAATCGATCTCTGCCCGCTCATCAATGAAACGGTGCGGATAATCTCGCTCCAGGCCGAAGAGAAGAACATCGCGGTCGAAACGCGTATCGAAGACGCGATGGAGCTTTATGCAGACCGCCGCGCGATCAAGCAGGTGCTCATCAACCTTCTCTCCAATGCGGTGAAGTTCACCTCTTATGGTGGCCGGATCACAGTGCGCGCCCGCAAGACCGCCGCCGCCCTGTTCATGACCATTCAGGATACCGGCGTCGGCATTCCGAAATCCGCACTGCGCAAGATCGGCCAGCCCTTCGAGCAGGTGGAAAACCAGTTCACCAAAACCCATACCGGTTCGGGGCTTGGCCTTGCCATCTCCCGCTCACTGGCAGAGCTGCATGGCGGCTGGCTGCGCATTCGATCCACCGAGAGGGTCGGCACGGTGGTTTCAGTCTGCATCCCGGATCGCAATCCCGCGCCCAATGCAGGGCACGACGCCCGGACCCACGCTGCTTAGTACGTTTCGAGCCATAAGTGTGAAACGGTTTTGCGGGAAAAGCTTTAGCCGCGGCCGGTCTGTCACAAACGTAATATTAAATAATTACACAAGATATACATAGTGCAATCATATTTGATCGCATCACGCATAATGACACCGGCTAAAAAGGGTGGCATATCAGACAGAAATCCTTTGATGGTGTATGAGCGGACCATAAAGGTGTCATCTGATAAGTTGAAGGGCGGTCATCATGCCAAAAATCGCAAGTGCGCTGAGGCAAGCGAATTAATGATATTGCCCGTATTCGTCATCAATATGGCCTCGCAGCCTGCCGCCTATAAGACCGTCGCAGCCTCCATTGAAGCTTACGGGCAGGGTTTCCAGCTTCACAGGATCGATGCGGTTAATGGGCATACAGCGACACAGCGCATTGGCATTGACGATGCACGTTTTGATGCGATCAATGGCCGTGAAATGCTGCCCGGTGAATACGGGTGTTATCGCAGCCATTTGAAGGCATTGGAAAGCTTCTTATCCGACGGCTCCCCCTACGGCCTCATTCTGGAAGATGATGTGGTTTTTACTGAAACTACATCGGCGCGCATTCATGACATCATTAAAAGCCTGCCTGATTTCGACGTCGTGAAGCTCGTTAATCATCGCTCACCCTTATTCATGAGCCTGCTTGAAACAGATGCAGGTGACAGGATCGGCCGAGCCATTCATGGCCCCCAGGGATCTGCCGCCGCCTATCTCGTCAGCAGAGAAGGCGCCCGGAAGCTTTTATCCGCACTATCGACCATGGAACTGCCGTGGGACGTTGCCATGGAGCGATTTTGGCATCACAAAGCCCGGCTGTTCAGCAGCGATGAAAACATCCTCGCTTTTTCTTCTCACAGCGAAATCTCAAATATTTCCGATCAGAATTCAGGTTATGATGAGGCCAAGCACCCTTGGTATATGCGCTTGAGAACGTCATTATTTCGCACTTTTGATTATTATGTGCGTGTTCACCATACATTATTGCAACCTCAAAATCCCGATGGCAGCAGCATGAAAAGCCAGTCCGGAGCCTATAAGCTGCCCGGAATTTCATTAACTGGCGAACTGATTGCCGCCATCAGCTTGCTGGTTTTCATGTCTACGGTATGGGTAGAGACGGACGCCTACAGATATATAGCCCTCGGTTTTGTGGTGGCTGCATTGATCCGTTATGCCCGCACCGATTTCTGGAAATACGAAAAACCAATGGTCGGCTGGGCCGGCTTACTTTGCGTGGCGTGGACATTCTATGTCCTGGCGAGGTTCGCATATATCTATCTATTCTACCCGGAAATGGGCACCGGCTCGGCAGAGGGCATATATCTTTTCCCGCTTTTCTACCCGACATTGGGGTTTGCGTTACTGCTTTTTATCCGACGGCCATTTCTCATTGCGGTCGCCTTCATGGCGATCAGCCTCGTAATTCTCATATTCGGCTTCCACTATGATCTATCGTGGAACGAACGAGCCGTTACGTTGCTCCAGCATAATCCGATCCATGCGGCTGTCAGCAGTGGCTTTATCGCCCTATGCGCAATGGCTTTTGGCATTCACACGTTGAATCGCAACACGCTCGATACCAGAGCGCGCGTCGTTTTGTGCCTGCTCGCGCTTGCTACTTTTATTGCGGCCCTGATTGCAATCTACAGCCTCTATTCAAAAGGGGTCTGGCTCGCAATGGCAATTGCATTTCCGACTTTCGTGGTCCTTGTTGCGCTGACAGATAAAAGCCAGACCTCACGCATGGCTGCACTGGTGTGCATTCTCATTGGCTTGTTGAGTGTGTTTGCAGGAGAACATATCCTGCAACGTGTCGGCGGCAATACTGCCAATACATCCTGGGAATTGTTATCGGACCTCAAGACGGGCGATAACATCATGCAGGATTTCGACAAAGCCATCAAAAACCCGGAAACAGGCCTGAGCGAGCGCGAACGCCTGATGATATGGGCCAACACGCTGCATATCTGGCATAAGAATCCGATATTTGGCGCAGGCGTTTCGTGGCTTCACTATTGGGAAAAGCGCCCTTATCAGCAAACCGACTTCACCCTGCTCCACAATGGATATCTGGAAATTGCCATTCGCTATGGATTTCTGGGCTTGCTGTTCTATGGCGTTTTGACGGTCTGGGCGGTTCGATGCACATGGCAAGCCACGCGAGCAGGTCTCATCGACAGTGCTGCCTTTCAATGCTACGTCGCAACACTGGTATTTTTTGCAGTGACGATCTTGTCAAACTCTAATGTTCGTCTGGCAATAGGAGAATCCTATATGGCACTGGCATTCGGCTTTGCCTTTTATTGCCAGTACCTTCTGCAACAACACAACAGACAATACCCGCGCACCTACTTCTAAGAGCGGTTCCACTTTTACACAGCCGGTGGAACCGTGTCTTTTCCTCAATCAGCGCCATGTCCGACGAGCTTGTCAAAAGCAGCCCTGACAAGCCTGTAATGTTCCAGCAACTCATATTCTATGCGTTCAATATCCGGCAATCCGGCAACGCGGCACAGGAGCTCACGCATACCGGGCGGAAATTGATCCAGCCCTGCGCTATCGTTCAGGCAAAGCCGTATCGCCTGGCTGAGATTGGTGTAGAAGCGATGCGCCTCCACAAGGCCATCCACCATTGCAGGGTCTCCAAAGAATGGATCAAGGTTTGCCAGCACTTCTTCGGTTGCAAATGGACGCGGCGTTTTCTTCACATACCCGGCAAGGGTTGCAAATTGGGCGATAAACTCTAGATCGATAATGCCGCCGGGCTTCAGCTTTAAGTCCCAATCATCCCGCGGCGGCTTTTCCTGCGCGATCAGTTCCCTCATCTCGCGCACATCCCCGGCAAGTTTCCGCACATCGCGCGGCATCGCGAGAACGTCCTCGATATCCACCTTGATGCGAGCGATAAAAGCCTCATCGCCATGGATGGGCCGCGCGCGGGTCAGCGCCATATGTTCCCAGGTCCACGCATCGTTGCGCTGATATTTGCCGAAAGCTTCGATATGCGTTGCGACGGGGCCTTTGTTGCCCGACGGGCGCAGCCGCATATCCACCTCGTAAAGCACACCTTCCGCCGTCGGGGCCGAAAGGGCCGCGATGAGGCGCTGTGTCAGGCGAATATAATATTGTGAAGGCGCAAGCGGCTTTTCACCATCGGATTCCTCGGCATCCTTGTCGTGATCGTAAAGCAGGATCAGGTCCACATCCGAGCCCGCCGTCAGTTCGCGACTGCCAAGCTTACCCATGGCGAGCAGCGCCACTTTCGCGCCCTTCACCTTGCCATGGCGGCGTTGCAATTCGGCTTCCACCGCCTCCAGCGCCCTGCCAACCATAAGTTCGGCAAGATCGGAAAAGGCCTGTCCGGCCCGCACGCCATTGATTGCTCCCGTCAGCAGGCGGATGCCGATGAGGAAGCGATGTTCGGCAGCAAAAATACGCAGCCTGTCCAGTACTTCCTCGAAATCCGTGGCGCTGCCCAGAAATGCCCGCAGGCGTTCTTCGAGATAGGCACGCGTTGGCACTTCCGAAAAAATGGCCGGATCGAGCAACCCATCAAAAACATGCGGGTTGCGTGTGATGATGTCCGCCAGCCGCGGGGCCGCGCTCATGATCATCACGAGGAGGTTCAAAAGCCGGGGATTGGATTGCAGCAGGCTGAAAAGCTGAATACCGGCCGGCAAGCCCTGCAAAAACCCGTCGAAGCGCAGAAGCGATTCATCCGCCCGCCTGGTTTCTGCAAAGGCTTTGAGAAGTGCGGGCGTCAGTTCGGTGAGGCGCTCCCGTGCTTCCGCCGATTGCGTGGCGCGATAACGCCCGAAATGCCAGGTGCGGATCACGCGGCAGATATCGCTTGAGCGCTCGTAGCCCATGGCAGAAAGCGTTTCCAGCGTGCCCGGATCATCCACATCGCCGGTAAAAACAAGGTTGCCGCTCGCCGCGCCCAGTTCCGGCGCCTGCTCGAACAGCGCCGCATACTGCTTTTCCACCACCTTGAGCGCGGCGAGGAATATTTCGGAAAATTCCGCCGGGTCGGCATAACCCATCATATGGGAAACGCGGGCAAACCCTTCATCATCTTCAGGCAGGATATGGGTCTGCTCGTCCGCAATCATCTGGATACGGTGTTCGACATCGCGGAGAAACCAATATTCCTGCGCCAGCGCATCGCGCGCCTGTTGCGTTATCCATCCCCGTTCGGCAAGCCGCGCCAGCATCGGCACAGTCTGGTTGCCGCGCAGTTCGGGAAAGCGCCCGCCCGCAATCAATTGCTGCGTCTGGACAAAAAATTCGATCTCCCGGATACCGCCCCGGCCAAGCTTCACATTATGCCCGCGCACGGCAATATCGCCGTGGCCCTTATGGGCGTGAATCTGGCGCTTGATCGAATGGACATCGGCAATTGCCGCATAGTCGAGATATTTGCGCCAGACATAGGGCGACAGTTCCGCCAAAATCTGTTTGCCGGACAAGCGATCTCCGGCAACGGGCCGCGCCTTTATCATGGCGGCGCGCTCCCAGTTCTGGCCACGCCCCTCATAATAATGCAGCGCAGCGCCAACCGGAATGGCAAGCGGCGTCGATCCCGGATCAGGCCGCAGACGCAGATCGACACGGAAGACGTAACCATCGCCGGTGCGGTCCTGCAAGATGCGCACCAGCCGCCGCGTCAGCCGCGAAAACGTATCGACACATTCATAGGGATCGCCGATAGCAGGCTTGGTTTCATCAATGAAAACAATCAGATCTATATCGGAAGAATAGTTGAGCTCGCGCGCGCCGAACTTGCCCATGCCAAGAACGATCCAGCCACAATCCTTTTCCGGATTGCTGCGATCCGGCAGATTGATCCTGCCAGCCGCGTCGGCATCGAGCAATAGAAAGCGGACGGCTGCACCTGTGCAGGCTTCCGCAAGGTCGGTCAGCCGGTCGGTGGTTGTTTCCGTATTGAAAATGCGCGCCAGATCGCAAAGGGCAATCAGCACATGGGCCTCACGCTTTAACTGGCGAAGGCTTGTCATCAGTTCGCTTTCGCTGACACCCGCGACGGTCCCGCTGGCGGAAATTTCGTCCAGAATAGCCTCAAGTGCGCTTTCCGGCGTTGCGGAAACGATACGATCCAGAATGCGCGGCTGGCGCGTCAGCGCCTCGCGGATGAAAGGCGAAAGATCGAGGATGGCTGAAAGAAAATCCGCCGCCTTTTTCCGGCCAAGCAGCGCCACGACGCCGGCAAGCTCTTCCTCGCGGGCACGGGCTTCCAGATCAGCCAGAAAGGCAGATGCCCTTTCAGGATCAAGCGGTGTCAATGCACAAAGGTTTCTCTCGAAAAACAGTGCCTTTGCGTTTTCAACCGTCATGATCCCCTCAATATCTGCCGTTTAGAGCGCGTTTCGATCTGATTGAATCAGGTCGGCGCTCTAATCCTTTGTTTTGACGCGCATCTTTTGCGAAAACAGTTTCACACTTTTCGGGATGCGCTCTAGCCAACCTCGCGATGCGGCAGCGGAAATTCCAGAACGGCGCGAAGGCCGGGTCCGTTGTCCTCAAGACGAAGCGCGCCACCGTGCAACTTCATGACCGACTTGGCAAGACTTAAACCCAGGCCAGACCCCGGCTGTGTGCGGCTTTCTTCAAGGCGCACGAAACGCTCGGTCGCATGGTCACGTTTGTCGGCGGGGATGCCGGGGCCATTATCAGCCACGACGATACGGACCCATTGCGCATCCTTTTCCATCAAAAGCGTGACCGTCGCCGTGCGTCCTTCGCCGCCCGCATATTTGATCGCATTGTCGACCAGATTGGACACGGTCTGGCCAACCAGTTCGCGGTTGATGTGCAGGGCTACATCATCAAGCGCACCAAGCGTCAAGGTAACACCCGCATCCTCTGCCACCGGCTCATACATTTCCGCAACATCGCGCATGATCGGGGCAACCGGCATATCGTCGAGGTTTTCAGATGAATAACCGGCTTCAAGCCGCGAGATCATCAAAATGGCATTGAACGTGCGAATAAGCTGATCGGATTCGCCGATAATATCTTCCAGCGCAGCGCGATATTCCGGCTCTACCTTCTCACCGCCAAGCGCCTCCTCAGCGCGGTTTCGCAACCGCGTGAGCGGCGTTTTCAGGTCATGCGCAATATTGTCGGAGACCTGTTTCAGCCCCTCGTTCAATTCCAGAATGCGTGCCAGCATGACATTGAGATTGCCAGACAGCCGGTCGAATTCGTCGCCCGAGCCGTTGACGGGAAGCCTGCCGGTCAGATCGCCATCCATGATGCGTTGCGATGCGCGCGACACATCGTCGATGCGTTTGAGCGCGCGCCGCCCCACGAAGAGCCAGATCAAAAGCGCACCCACGCCCATGATGCCAAGCGCCAGCACCAGCGAGTTGCGTATCAGATCGCGAAATCGCTCAGGTTCGCCCAGATCCCGCCCGACGAGCAGCCGCATTCCATTCGGCAGGGCAATCACCACGGCAATGGCGCGATGCTCCATCTGCGGCGCCTGTTCGCCGTAACGGCGATAGGTAAAGGCACGTTCGATAATACCGTCCGTGTTGAGCACGCCCGGTTCAACGCTTTCCACATTACCGGCAAGAATACGGCCCGTAGGGTCGGCGACAAGATAGAGATAGGCACCCGGCTGGCGCGAGCGATAATCAATGGTTCGCACAAGTTGCGGAATACCGCCGCGCGCATAGCTTTTGCCGATGCTCGCGACTTCCTCGCCCAGCGCCTGTTGGGTCTGCCCGGCCAGAATGGAAGCCGAAAGGTTGGTCATGTAAAAGACAAGTGCAACCGCGCCCACTGCAAAGAGCAGGAGATAAAGCGCGGAAAGCCGCGCCGCGGTGGTGCGCATGAGTGCGGAAAACCGGCTCATCATTCCGCCCTTGCGGCAGCGGATTGCTTGCCCCGGCCCGCCTTCAGCATATAGCCAGCCCCGCGCACCGTATGAAGCAGCGGCTCGTCAAAGCCCTTTTCAATCTTGGAGCGCAGCCGCGAAATGTGAACATCGATCACATTGGTCTGCGGGTCGAAGTGGTAATCCCAGACGTTTTCAAGCAGCATGGTGCGGGTGACGACCTGCCCGGCATGGCGCATGAGATATTCGAGAAGACGAAATTCTCGCGGTTGAAGCGTAATATCAACACTCTGGCGGCGCGCAGTATGCGTAAGGCGGTCAAGCTCCAGATCGCCGACGCGGTAGATCGTATCCGCCTCGCGCGGGCTTGAGCGGCGCTGCAAGACCTCGACACGCGCCAGAAGTTCGGAAAAAGCATAGGGCTTGGTGAGATAATCATCGCCCCCAGCGCGCAGACCGGTCACGCGGTCATCCACCTCGCCAAGCGCCGACAGGATCAGGACCGGCGTTTCCATGCCCTTGGCGCGCAGGCCCGCCACAACGGAAAGGCCGTCGCGTTTGGGCAGCATACGGTCCACCACCAGCACATCGTAATTGCCGTTTTCGGCAAGTGCGTAGCCGGTTTCGCCGTCGCCCGCGATATCGGCCGAATGCCCTGCCTCGGCAAAGGCCTTTTCCAGATAACGGGCCGCTTCGCGGTCATCTTCGATAACGAGAATTTTCATGGCGCTACTATAGGTTCCGTTCAGCGATAAGGCGATGCCAGGCAGTTACCCGCCTGGCATCGGAATTCGATCAGTTTCCCTGCGAGGCGCCGATCATTCCTGATTGATCGGCAGCGCCACGAAGCGGCTCTGATCATTGCTCTGCAATTGCAGCAGCACCGCCTTACGGCCTGACTTTTCGGCTGCCGTGATGGCCTTGTTGATATCGCCAGCGGTCTTTACCGTCTGGTTGTTGACGCTCACGATCACATCGCCGGAGCGGATGCCACGGTCAGCCGCATCGCTGTCCGGGTCCACATCGGTAACGACCACGCCCTTACCGTCTTCAGACGGAACGACGGTCAAGCCGTAGGAGTCGAGCGTTTCACCCTGTCCGCCGTCATTGTCGTTGGACTGGCTGCCGCTCTTCCCCTTGTCATTGGGCATGGCAGCAATCGTGACGTTGATTTCCTCGGCCTTGTTCTTGCGCCAGACGGTCAGGGCCGCCTTTTCACCAGGGGCGATATTGGCAACCTTGCGCGCCAGGTCACGCGGGTCCTGAACCGTTTCGCCATTGACAGCCGTAATCACATCGCCCGCCTTGATGCCAGCCTTGGCAGCCGGGCCATCATCCTGCGGCGAGGCCACGATCGCACCCTTTTCCTCGGCAAGACCGAGCGAAGCGGCGATATCCTTGGTCACAGGCTGTATCTGGACGCCGATCCAGCCGCGCTCGACGGAACCCTTCTTGATGAGCTGGTCCACGACCTGCTTGGCGGTGGAGGACGGAATTGCAAAAGCAATGCCCACGCTGCCGCCAGATGGCGAGAAGATGGCGGTGTTGATGCCGATGACTTCGCCGGAAAGGTCGAAAGCCGGACCACCGGAATTGCCCTTGTTCACGGCGGCATCAATCTGGATGAAATCGTCATAGGGGCCTGCGCCGATGTCTCGGCCACGGGCCGAAACGATACCGGAAGTCACCGTGCCACCAAGGCCGAACGGATTGCCAACTGCGACAACCCAATCACCGACGCGCACCTTATTATCGTCGCCAAAGGCGACATAGACGAACTTGCGCTTCGGAGCGTTGATTTTCAACACGGCCAGGTCCGTGCGCGGATCAGCACCAATCAGCTTGGCATCAAGTTCGGTGCCGTCGTCCAGCACGACGGTATAGGCATCGCCATCGGAAACGACATGGTTGTTAGTAACGACATAGCCATCTTCGGAAATGACGAAGCCCGATCCTTGTGCAACAGGGCGTTCATGGCCCGGGCGCGGCTTGTTGGCCTTGCCGCGACGGTTATCGGAGCGTGAATCGCCACGCGGCTCCATACCAAAATCACGGAAAAATCGCTTCAGCGGATGACCGTCCGGCAACTGGTCGAAGCCGGGAGGGCCGAAGAACTGCGGACCACGGTTGGAAGTCTCCTGCACGTCCTTCTTGACGCGGACGCTGACGACCGCAGGGCGAACCTTTTCCACCAGATCGGCAAAGCCAGCCTGCTGCGGCGGCGTCACATGCACCGCTTCGGCGCGGGCTTCGTTCAGCGCACCAAGCGGGCCGGTTACGACGAATGCGCCGGCAAGCGCTGCGGAAAGCGCGACGGCGGCAACGCCTTTGCGATAGTTGGAAATCCTGGCTCTGGACATCTGTTTCTCCTTGCGAGCAGTAATCCGGGCCAGGAATTTCTGGCCTCGTAATTTTCTATGAGAGCAAGATAGTTAGAGCTACCTTACCGTTCAATTTCCGCAAGATGAAAGTTTCGTAAGGTTTCCAGAGGTTTTGTTGGCCTATTTTCCGTCGAGCAGGCGGGAAAGTTCCGCTTTTTCGCTTTCGGAAAGGGGCTGTGGGGCTGTGCCAGCAGCGCGCCGACGCCGGAAAGCAACAAGAAGCGCGCCGCCGCCAATCAGGAGAATAATAACAGGAAAGCCCCAAAGCAGCGCCGTCTGCGCGTTGAAACGGGGTTTGAGAAGAACGAATTCGCCATAGCGATCAACGACGAAATCAATCACCTGCCCGTCCGTATCGCCTTTTGTCAGGCGTTCACGCACCAGAATGCGCAGATCGCGCGCAAGCTCCGCATTGGAATCATCGATAGATTCGTTCTGGCAGACCATACAGCGCAGCTCGGCGGAAATTTCACGGGCACGCTTTTCAAGCTTCGGATCGGAAAGCACTTCGTCGGGGTTGACGGCAAATGCCGCCGTGGCCTGCAAGGCGAATGTTGCGCTGATAAGCAGGGCCCGGAAAAGATTATTCTTTTTCATGCGACGGCTTCCGCTTCCTTCGAGGCGGACCTGCGCACCTTGGAAGGCGCACCAACCCTGAGGCGGCGGTCGGCCAGCGAGAACAAGCCACCCAGCATCATGACAAGCGCACCGTACCAGATGAGCGTGACCAGCGGCTTCCACCAGATGCGCACGACCACCGCGCCATTGCCCGGCTCGTCACCAAGGGCGACATAGACCTGGCTGAACCAGAGCGTCTTGATACCGGATTCAGTCGTGGGCATCTGGCGCGCAGGGAAGAACCGCTTGGACGGTTCGATCACGGCCAGATCGCGGCCACTGGAATCAAGCAGCGTGAAGGCGCCCCTGTTCTCGGTGAAGTTGGAACCCGTGATGGGGCGCAGTCCTTCAAAACGCAGCGTGTAATTCTGGACTTTGGCGGTCCCGCCGGGCTGCATGACGAGCACGTTTTCGGTACCGAATGTCGTGACGCTGACAATACCGAGAAGCGTGAGACCAAGTCCGATATGCGCAAGCGACGTTCCGAAGACGGAGCGCGGCAGGCCCTTGAAGCGCGCAAACGCCTTGCTGGCCGACACCTTGCCAATACCAGCCTTGAGAACAAGATCGGTCAGGCTGCCGAAGATGAGCCATGCCGCAAGACCGATGCCGAGAGCAGCTAGGACCGAATGCGCGGAGGTTCGCCAGAGCATGATGCCGACAACGGCCAGCGACAATGCGAAAGCCGTCATCAGGCGCTGCCCGACGCCATAAAGATCACCGCGTTTCCATGCGAGCAGCGGGCCGAAAGGCACGGCAAACAGGAGCGGCACCATCAGCGGGCCGAAGGTCATGTTGAAGAAGGGTGCGCCAACGGAAATCTTTTCGCCGGTCGTAACTTCAAGAAGCAACGGATAGAGTGTTCCGATGAGAACGGTTGCGGCGGCAGTCGTCAGAAAGAGATTGTTGAAGACCAGCGCGCCTTCGCGGGAAATCGGATGGAAAATGCCGCCTGCGCTCAGGCTTTGGACGCGCAGCGCGAAGAGCGACAGCGAGCCGCCAATGAAGAGCGCGAGAATGCCCAGAATGAACAGGCCGCGCCCCGGATCGGTCGCGAAACTGTGCACGGAGGTCAGCACGCCGGAGCGCACAAGGAAAGTGCCGAGCAATGACAGCGAGAAGGTGAGAATGGCGAGAAGCACCGTCCAGATCTTGAGCGCCGAGCGCTTCTCCATGACGATGGCGGAATGAAGCAGCGCCGTCCCGACCAGCCATGGCATGAGCGATGCATTTTCAACCGGATCCCAGAACCACCAGCCACCCCAACCCAGTTCATAATAGGCCCAGTAAGAACCCATTGCGATACCGCCGGTGAGGAACATCCACGCCATGAGCGCCCATGGGCGCACCCAGCGCGCCCATGCCGCGTCGAGACGGCCTTCGAGAAGCGCCGCAACAGCAAAGGAGAAACAGACCGAAAAGCCGACATAGCCCAGATAAAGAAGCGGCGGATGGATGGCGAGGCCGATATCCTGAAGGACCGGGTTGAGATCGCCGCCTTCCATGGGAGCCGGGAAAATGCGGGTGAAGGGATTGGACGTGAAAATGATGAATGCGAGAAAGGCGGTGCCAATCCAGCCCTGCACGGCCAGAACATTGGCACGCAGCGTTTCAGGCAGATTGCCGGAAAAGGCTGCAACCAGCGCGCTGAAAAGGGTGAGGATGAACACCCAGAGCAGCATGGAGCCTTCATGATTGCCCCAGACACCGGTGATCTTGTAGAGAAGCGGCTTTTGCGAATGGGAGTTCTCGACCACGTTAAGGACCGAGAAATCGGAAACGACATAGGCGTGGATCAGAGCGGCGGATGCCAGAACGATCAGAGCGAAGACTGCAAGCGCCGTTGGCACGGCCACTGCCATCAGTTGCGCATCACGGCGATGCGCGCCGACCACCGGCACGATGGATTGAACAATCGACAGCGCAAGCGCCAGCACCAGAGCAAAATGGCCGATCTCGACACTCATATGCAGTCTCCCCGGCCCGTCATTTGCCCTCCCATACGCCCTTTTTCTTCAGGCTGTCGGCAAGATCTTTGGGAACATAGTTCTCGTCATGCTTGGCAAGCACATTATCAGCGCGGAACAGGCCATCGGAGCCGAAACGCCCTTCCGCCACCACGCCCTGCCCCTCACGGAACAGATCAGGCGGAATGCCTTCAAACACCACTTTCACGGTCTTGATCGTGTCGGTGACGGTAAAGCGCAGCTCGCTGCCCGTGCGGCTGACGGAACCTTCCTCGACCAGGCCGCCAAGGCGGAAACGCGCGCCGGACGTCATGTCCTGTTCAGTAAGATCAGCCGGGGTGCGGAAGAAACGGATATCCTGATTGAACGCCGTCAGCATCAGCCCGACCGCAACAGCCAGCACGGCCAGCGCACCGCCAATCAGGAAAAGGCGCTTGCGCTTTCTCTGGCTTACCGTGCGGGCAAAGCCGCCCTTGCCTTTGGGATTACGTGCATTTTGTTCGGCGGTCGCGCTCATTCTTGTGCAGTCCCCACGTCCAGTCCAAGTGTGGTGGCGAAGCTTTGAAGTTCGGTCCGGTTTTCACCCTGAAGAGCCTTCATGCCGCGAGCCAGCGCATCCTGCGCATCGTTGCGGCGGTTGAGGATCATATAGGAGCGGACCAGCCGCTTCCAGCCATCGATATCCCCGCCATTCTGGCGAAGTGTTTCATCGAGGCGTTGAACCATGCCTTCCACCATCGCCTGCCGATCTCCGGCGCTGAGCGTGGAAGCCGCTTCGACATCTTCGGCGCTCGGACCTTTCGCCTCCGCCTGTTTGGCGCTCGCGGGATCACGAAGGATGGCAATGGTTTTCTCAAGCTGGCCGCGCCAAGGTGCATCCGCCGGAGCTTTGTCCAGAAAGGCCTGAAGGCGGTCAGCGGCACGATCCGGATGGCCATCCTGCATTTCGCCCTGCGCAAGATAATATTGCGGACGAATATCATCGGGGCTGAGTTCGGCTGCTTTCTTGAACAGCTTCTCGGCCTCGGCCGTAACCGTGCCACCGGAAGCAGCCGTCAAAGCCTCGCCCAGTCCAAGAATTCGTGCAAGGTTTTCGCCAGCGATCCGGATGGATGTGTGATAGGCGCTGACCGCGTCGGAAGCCCGCCCAAGCCGCAGATAGATCGGCGCCAGCACATCCCAACCCCGCACATCACCCGGATTCTGTGCCAGATGCGCTTCAGCGCGGGCAATGAGGTCGGTGACGGAACTGCGATCCGGTGCGGCTGCAAGCCGTGGCGCGAGCGGCATGGAGGGCATGTCGGGCGCACCAAACAGCGGATAGATGCCCCAGGCGACCAGCGGCACGGCGAGAACTGCAACAAACGCCAGCACACGGCCGGAACGGCCCTGCCCGCCATCAGCCGTTGCGGCCATGGCATCCTTTTCGGCATTGAGGATGCGGCGGGAGATTTCGATGCGCGCCTGTTGAGCACTTTGCAGATCGATCATCCCGCGCGCAACGTCGGCCTCAACTTCGCGAAGCTGGTCGCGATAGACCTCAAGGTCGTTTTTTTCGGCGGGCAAAGCGGCTTGCCTGCGCCGCGTCAGCGGCAACAGGACAGCCAGCGTGGCTGCGAAGGTTAAAAGTGCTGCTACAAGCCAGAATTCCATGGCTGCACACTTAGGCGTTGCCCGGAGAAAAACCAACCGGCTACCTGCACTAAGTTGGGGCCTAGGGCAATTCGCCGCAAACCACCTTTGGCACCCGTAATCAGGTCAGCGGTGTCCAGCTACCGTCGGGATTGCGGCAAGCCGTGCCACGCACCGTCTGCTGATCCCCGCCAATGGTGAAACTATGCGAATATTGGCGGCAGTTCTGCGAGCCGACCTGATAGGGTTGCGCCGCCGTCACGTCGCCAGCGTTTGATCCGGCTCCGCTCCACAAAACCGATTTCCCCGCTGGCGAATATTCAAGCGCGCGATATTCAGCTTCCAGCGCCTTCCTGCGATCAGCCGCACTCAACTGACTGGCCGAATTGCCAAGCAGGCCGTTGCCAAGCGAAGCGAGCAGGTTCGTTTCCGGCTTCTGTGATGAGCCGCCCAGCGATGGAAAACCGCTCCCCTTGCCGCCGCCCGTCGTGCCGCAGGCCGACAGTGCCAGCGACACGACAAACATCATCGATATAACTGGAACAGGGCGAGAAAACCTGGAAACTATCATCATAACAAACCGGCCTTCATAACAATCAGCACGCATACAGTCGCATCATGCGACATTTGTTTTACCTTCTAATCCGCTTTCGGCCAAAGACAACCGTCAATCTTGCGTCAACGGCAACACGACCCGGACAGCCAATCCGCCAAGCGCATCCTTGCCAAGATGCAGACTGCCGCCATATTCGCGCACCGTATCCTGAACAATGGCGAGCCCCAGACCCGTTCCGGGCTTTGTTTCATCGACGCGGCTGCCGCGTTTAAGGGCTGCTTCAATCTTGTCTGCCTCAAGCCCCGGCCCATCGTCCTCAATGACGATTTCAAATTGCCTTTGCTCGCCGGCGACTGCCGCAAGGCGGATAGTGATGCATTTGCGCCCCCATTTACCGGCATTTTCGAGAAGATTGCCGATGATTTCCTCCAGATCTTCCCGTTCGCCTGCAAAGACGGCACCGGGAAGATCGTTCCTGAAGGAAATATTGAAAGTTGGATGCAATTTTGCCGTCACCCGATGCAACCGCTCCAGAACGGGTGTGACAGGCGTTCGGAAAACCACACTATCGCGCTGTGCTGCGATGCGCGCGCGTTGCAGATAATGCTGGATTTGCACCTGCATGGCCTCGCTTTGTTCCTGCACGATGCGTCCGGGCGCACCGCCCATGGCACGGGCTTCGTTCACCAGAACGGAGAGCGGCGTTTTCAGCGAATGGGCGAGATTACCGACCTGGGTTCGCGACCGCTCCATGATGCGGCGATTGTTTTCTATGAGCGCATTCATTTCCCGTGCCAGCGGCGCGATTTCCAGCGGCAAGGTTGCATCCAGCCTGGACGAGCGCCCTTCGCGGATATCGGCAAGCGCCTGACGAACCTTATCCAGCGGGCGCAGGCCGAAAAGAATGACAGCGGCATTGATGAGAATGCTGCCAATGCCGAACACGCCAAGATAGACAAGCAGCCGCGCCCGGAAATTTGCTATTTCGTTGAGTACTTCGCTAAGATTGCCCATGACGCGAAAGCGCGCGACACGGTTGGAATTATCCAGCACCACTTCGGTTTCGACGATGGAAAGCTCTTCATTGTCGAGGCCCGGCAAGGTGTAGCTGCGCATGAAGGAGCTGTCGAAGGGGGCTTGTGACACGGGCATTTCCGGCACGATGCGCCCCACCAGCGAAGGCGATTCAAGCTTGCCGGTCAGATTGGGCGTTACCGGATCGACGGACCAGTACCAGCCCGAAAGCGGGCTCGAATAGCGCAGGTCCCCCAGTTCGGGGCGCCCCTGTAGCGTGCCCTCGCCCGAGACGCTGACCGCCCCGACGAGGCTGAAGAGATGTGCGGTCAAAAGCCGTTCGAAATTGTTTCGCGCGGCCTCGCCATAGAGTGAGCTGATGAAAGTGGCGACCACGACCAGCGCCACAATGACCCATAATGTCGAAAGTGTGACGACGCGGACGGCGAGCGAGCGCCAGGCGGGAAAGACGCGAGGAAGCTTCAGCTGCCGCTCCCTTTTGCTTCGCCCTCGCCGCCCGAGCGCATACGATAGCCCATGCCGCGAACGGTCTCGATGAGATCAACGCCCATTTTCTTGCGCAGGCGCCCGACGAAAACTTCAATCGTATTGGAATCGCGGTCGAAATCCTGATCGTAAAGATGTTCCACAAGCTCGGTGCGCGAAACCACTTCGTCCATATGGTGCATCATATAGGACAGGAGGCGATATTCATGCGAGGTCAGCTTGAGCGCCACGCCATCGATACTCGCCTTCGACGTCTTCGTATCGAGATGCAGCGGCCCGCAGACGAATTCTGACGATGCATGACCGGCAGCACGGCGGATCAGGGCGCGCAGCCGGGCCAGAACCTCTTCAATATGGAAAGGCTTTGCCACATAGTCATCCGCACCCGCATCGATGCCAGCCACCTTGTCGCTCCAGCGATCGCGCGCGGTCAGCATGAGAACAGGTATGGTGCGCCCGCTGCGCCGCCAGCGCTCCACAACGCTGATACCGTCCATCTGCGGCAGGCCGATATCCAGCACCACCGCATCATAAGGTTCTGTATCGCCCAGATAGTGGCCTTCTTCGCCGTCATAGGCGCTGTCAACGACATAACCTGCGGCAATCATCGCTTCGGAAAGCTGGCGGTTCAGGTCCTTGTCGTCTTCAACAATCAGGATACGCAAGCGGACAGTCTCCCGTCAAAAGCTACAACGATAGCCCAATAGAGCGGCTACAGCCGATCCATCAATAAAGAGAACGCTTTATTGGGCTGGAACCGCCACTTCGACGCGGCGCGGACGTTCACCATTGCGACCGGGTATCAGCACGACGATAACGCACATGGCCCTGCCATTCTGCATGGTCGGGGTAGCCTTGGCGAGCTGACCGCCCTGCTGGGCGGCCACCTGTTCGCCAACCGCCGTGCAATCCCCGGCAGTGGCGATCAGAAGGTTGGGCTTCTGCGGCGCAGCCATCGGCAAGGCACCGGCATTGACCGGCAACAGGCCGACACTAACCGCCAGCAGCGCAAAAACTTTGAGAGCAGGGTTCTGTTTCATCATGCCGCCTCATATAGCACCCGAGAGCTGAACGATGCATGAACAACAGTTTCTTCTCCCGGGCCTAAAGGAAATTTTTCAAATTTGACGTTAGGCTGTAGTGACGAGGTTTATGCCGGACAAGTATCAAACGTCGAATTCAATCCACCAGGTTCATAAAATTATAAAATGCGGGATGTGGCGCCAATGCGGCCGATTACTGTAACGACACCCGCAATGGCCGTGGCCAGTTGCAGCAGAACATCCGTCATGTTGCCCTGATCGATCATATCCGTCGCCACGCCGAACAGGCCCGCCACCGAAAGGAACAGTGCAACCAGCCCTGCCCAGACCGTGCGCGAAAGATACCATGATTTATTGGCTGTCATATTGTTTTCTCCGATTTTTTTAGAGCATGATGACGTCAGGCCGTCATCCACTCCAAGTCCTTGTTTAAGCGTGATCCTGTTGCGAAAGCCGCTCATACTTTTCAAGATCGCTCCCTGATTTGCCCGAAGGGTTCGGGTCGACATCCAGAACCGCAAAATCGCCGGGCCCTGTTTTTGCCCCGATCATCGCCACGCAGAACCGGAATGCATCGTCGCCCAGTTCCGCCCGGCGCTCCGCAAAACCGTAATTCCAGGCGGGCGCCGACACCTGTTCGCTCCGCACCAGAGTGTCGTCCCGCCAGATCTCCACGCGATACGCCTCCCGGTCCTCACCAAGCGGTATATCCTCGCCGAGCCAGCTATCGGCATCGATCCGCCCGCGCCTGATCCATGTAAAGCTCATGTCACCATCGAGGGATCGCATGACACCGAGATGCACCGGACTAAGCGGACGAAGCGCACGCAAGCCGCCGCTCTGGCGCACCGTTTCAAAGAATTCATCCGAAAAAGCCTTGCCCGCAGCGCCCACGCGCCAGCTAAGCTCCAATCCAAGCTCGGATGCTTGCAGGCCGGCACTCACAACCCCGCTATCCAGCAGGATGAATGGCGTTTCAACCGGCTTGTCATCCAAAGCCGCCGCCTCCGTTCCCAACTGGCCGCGCAGCAGCCTGCCAAGCCGCCAGCGGTTGAGCCCTACTTCCTCCGCATCGAGAAACTGGAAGATTTCCCATCGGCCATCCGGCGATCTGAGCAGGCCGGTATTGGCTCCATTCAGAACCTGCACCAATGGCCGCGACTGCAATTCGCCGGAATAGAGCACCACTTCCACGAATTGCCCCTCGATCAGCCGCCCGCTGGGCCCGCCTGCGAGCGGAGCCGTCAACTCGCCCATGATGGCCCGTTCGCCAATGAGTGCACGTTCCGCAAATCCGTCATTCGACGGCGAGGCATAGACTGCAACGCCGCGCCAGAGCTTGGCATGGCAGGCGATGCGGAACTGCGCCGCCGGGTCTTCAGCACCCGGCCACAGCGGCAGATCGACGAAATGAAAGACCGGCTCCATATCCAGTGCCGGGCCGCCCGGCGGGCGTGGCGGTGTTTCCCCCTTGTCAGCGAACACAAGGTTTGGCGCGAGCGCAGCAGCCGTGACGGTACGCATTGCACCGTCTTCCAGTGCCGTCACAACATAATCGCGCTGTCCGTCCAGCACACCGAGGCGAACCCTATCCCCCACATGCAACGCTGCATTGGACCATGGCAGTGAAAAGCTTGCCGTGCGCCGCTCGGCATGGCGGCGGGCCATCCATGCTTCAGCAAGCGCCGTCGCCTGACCGTTTTCCATCGAACCGGAAAGGCTGAGGCTTTCCGTGCCCTGCCCCACCTCGCGGCGCGCCGACGCCCCGACGATCTGAAAATCGCGCAGCGGGTCATTGCAGTATAATTCAGCGGTGGAGGGCAGATCACCCTGATCCTCGACCACCACCGTCAGCGCCTCGCCTTCGTCGGGTTGTGCAAACTCACCCAGTTCCAGCGCTGCTTCGGCACGGGTGATATTCCTGAAAACAAACTGCCCGGCCCGCTCATAGCCATGCACTCCAAACACATTCAGAAGCGGCTCCAAAACACCGCGCGCGCTTGATGGTTCTGACACTATGAAACTGGAAAGATGTCCTTCCACACCGATGCAATCTGCTTCAGGAAGGCCGAAATCCCTGAGGATCGCCGCGATCAACTCATCCAGAGCAATGCCGCTGATACGCCCGTTAAGCCAATGGCCAAGTCGCCAGTTGGCCGTGTCGCCCCATATATCCTGCCCCAGCGGAAACTCCGGGAAAGGCCGGGCATCCCACGCCCAGAGATAAATGCGCTCCATATCGAGCATCGGCCCGCCATAGACCGGCGAGAGCGGGTTTTCACCCTGCCAATGTCGATAATGGGCACGCAGAAAGTGGTCCATTGCGGCATCGGACCGCAACCCGTTGGAAAAATAGGGGGCGGCATTTTCCGAGGATTTCGGATCGGGAAAGACATTGGGCTGGTTCGGCCCCTTGTCTACCGCCGGGCAGCCAAGCTCGGTAAACCAGAGGGGCTTGGATTGCGGCACCCATGCGGTCGGCGCAGCCACCTCCACACCATCGATGCGGTTATAATGCCTGTTGCTCCACCAGCCATGGAGATCCTTGTAGCGATAAACCCAGGGCTTGGCCGCAAGCCCATCGGTGATTGGTGTGCGCCTGCGGGCCAGCCGGTCCTCGGCACTGGCATAATACCAGTCGTAGCCCTCGCCGGAATTGACGCTGCGGCTTAGTCCTGCAAGATCGTAAGGCGTTTTGAACCCATCAGGATTGCCTTCCGAAAAATCGCTGTCGCGCCAGTCGGCCAGCGGCATGTAATTGTCGATACCGATGGCGTCGATGGCCGGATGCGCCCAAAGCGGATCGAGGTGGAAAAAGAGGTCGCCCGTTCCATCCGACGCCTGATAACCGAAATATTCCGACCAGTCCGCACCATAGGTAATCCGGCAGCCAACACCAAGCTTGCTGCGCATTTCAGCGGCGAGCGCGCAAAGATGTGAAACGAAGGGGAAGCTGCCGCGCCCGTCACGAATGCTGGTGAGCCCGCGTAACTCCGACCCGATCAGGAAAGCATCGACCCCGCCAGCCCGCACGGCCAGATCGGCGCAATGGTTGAGAAAACGGCGATAGCCCCATTCCCCATTCACGAAGGCTGCGGCCTGTTCACCGGCCGCAGGCGTCCCGTCTGGCGAGCCTTCCTGCCCGATTGCCGGATGGCAGGTGATGCGTCCGCGCCATGGATAGGCAGGCTGGCCAATACCGCCATAGGGCGAAGGAAGCTGATTTTCCTTTGGCACATCCATCATGATGAAGGGATAAAGCGTCACGCCAAGGCCGCGCGCCTTTGCATCGCGAATAGCGGCGATCACGCTCGCATCCGAAGGTGTGCCGCCATAAGCCGCACCCTCCCCGCTCATGGAGATCAGGTGTGCTGCACCGCGCGACACGTTCTCCACCTTCCATATCGTGCTCGGCTTGCGGACAGAGAGGCTCGTAACACCGGGGCGGATACGGCATTGCCCGGCCCGCAGGTCATCCCCGAACCAGGGCAGCACGATGGCCACATGGCGCAGGCCGGGGCAGAGCGCCTGCAACTCATCCAGCGCCGCCGCCCAGTCGCTGCGGGCGCGAATTGCGTTCCGGTTGATCCATCGTTTTTCGCCGGGTACGGGTTCGTCGCTGACCGTATCGGGCGAAAGGCCGAATTCGGTAGAACCGGGGATGAGCGCCACGGCGCGCATGTTCCGTGCCACCTCGCCCACCGGGCGCATGACCTCGAACTGGAACTGCGGCAGGCGGTTGCCGAACGTGTCGAGCGGGATGCGCTCGAAAACCACATAGGCCGTGCCGCGATAAGCGGGCGCATTGCCGGTTCCCTGCTTCGCCTCGATCATCGGATCGGGGCTCTGGGTCGCCGTACCGCAATAGACACGCATATCGATCTCGGTCAGGTCCAGCTCCTGCCCGTCCGCCCAGACGCGGCGAATGCCAGCAATCTCGCCTTCCGCCACAGCATAGGCCGCATTGCCGAAATAGCTGTAATTGGTGACTTTGGGGCCACCCTTGCCGCCCTGACGGGTGGTGGTTTTGCGCTCTTCAAAGCGTGTCGCCCAGATCAGTGTGCCGGAAACCCGCGCCGTGCCATAAATGAAGGGAAGTGCGCCACCTTCTTCCGCTGTCGCCACGCGGCCGCCATTCAGCCGCGCGCCTTCGATATGACGGGTGGAATTGAGAAGCGCGTTATCAATGGCATAACCGCCCATCGCACCAAGCCCAGCGCCAATGGCGGCCCCCACCGGCCCGAATATGCCGCCAACAGCAGCGCCCACGGCCTGCAAAACAACTGTTGCCATGAATCAGACCTTTGGTTCAGGAAAAAGAAAAATGCCTGCCATGCGCCTGCGCCATTGCGGCACCAGCGCCGAGGCCAGCACGCCATGGCCCTGATAGGCATGGATGAAGCGGCCCTCGCGCGCCATGATCCCCATATGCTTGGCCGCAAAGCCCGGCTTCCAGCGAAACACCAGAAGATCACCCGGCTGCGGCGCGTGCTCCTCACGCCGCACCATATACCGCACCGCTGCCTCCAGCATGGGGTCGCCCTGCGAAACCTCCGCCCAGTCGGGCGCGTAGACGCCCGGATTTTCCGGCTCCACGCCGTAAAGCGCCCGCCAGATGCCACGCACCAGCCCCAGACAATCGCAACTCACGCCAAGCGTGGAAGCGCCGTGCCGATAAGGCGTTCCGATCCACCGGTGCGCCTCGGCAAGAACCCGTTCGGCAATCATCATGGAACGAGAACGCCCCCATCGTAATCATTCGTGCTGTTGACATAGGCATAGGCGGCATCATTGCCGGGCAGATGCGGAAAGCCGCGAAAATTGACGCCGTTGGCAAATTTCGCCTTGCAGGTGGCGAAGCTCTTGTCGCACCCGCAGACAAGCCGGAAAGCATCACCCGCAGCCACCGGCAAGATCATCGGCTCGCCAAGTTGCAGGCTTGCCCCCGCATGGCCGACAACACGGACCGCCCGTCCCCGATTGGCCCCGCTGGTCCACGCAAGACGGCCCTCGGAAAACCAACCCGCCGCAAAGCCATCAACGCCCGCCACATCAAGCCGTGTGCCTTCGGCGACAAGCACTGTTCCCTGCGCAAAAAAACGGGGATCACCCGTATCGATGCCGCAGCGTTTATCCCCCAGCATCGCATCGCAGTGGCGCAGGATACGCCGCCCGCGAACCGCATCGAAGGCTGCGGCAACACCTTTCAACTCCATAACGAAGCGGCTGCCCGAGCGGCTGATTTTGCCCGCCGCCCAGCGCCGCAACAGCATGTGCTGATCCGGCTCGTCCCAGTTGACGAGAAAAGCTTCGATCGTGGCGCCATCGTAGCGGCCCTGCTCGATATCCTCGTCGCTGATCTGCGTGGACGAAAGGACCCCCTCCACCTCGCCGCCAGCAATGCCGAGGCCAAGGGCGGTTGAGGCCTCGCTGCTGTTCAATCCTGTCAGCGGATCGCATATCACCTGATCGACGGTCAATGGCGCATCGTGGTCGGTGAACCCTAAAACAGCACCGTCGAGCCGTCTTATAAGCCAGGCAAAACAATGTGTTGTCACCTCGCCTTGCAAATGTGATTCAAGCGCGGGCGGGACCGGGATCATCTCTTCACCTCGACAATGGGAATGGATGGAATTTCGCCTGCCTGGAACGAGGCTATGCTGGCGGTGAGACGGTCCGTATCGAAGCGCGCAGGCACGTCGAACAGGAAGCCGGACGTGACAGGCACATCCCTTGCCGGCAGATAATCCGGCGTAAAGGTTACAATCCCGGTCAGTGGATCGACGGTAAATGCCTCGCCTTCCGGCACCTTCACACCATCGACACCGATAACCACCGAGCCCGGCACGGGAAGCGTTATTGGACGGTCATAGCTTTCATAATTCTTGCGAAGCTGAAAGTGCACCGCCACTCCATCGCCGGTTCCAAGCGGCTGATCGAATGCCGATAGGGACGCCTTGCCGGTCGCGGATGAAAAATCGAACGGATCGCGAAAACGAAAAGCGTGCAGGGAACCGCGCCGTGCCTCGAAAAAGGCGAGGACCATCCGCAGGTCGTCCAGCGAGCGCAGCCCCGTTCCGGCATCAAAATGCCGCCGGGAATGTGCCCAGCGGGCGTTGCGCTTTTCCAGGCCGGAGGTCAGCGTCACAATCTCGTTGCGCCATTCCGGCCCTCCTGTCGCACCAAACGATACGCCAAGGGGAAAGCGCACATCATGAAAGGCTTCGACCATGTTCAAAGCCTCCGCGCGCCGCGCCGCACGGCGCCTGCCAGCATGGTGGCAAGTTGTGCTTCGGACTTACGGAAGGAGGATGCATCGGGCGAGGTCATGTTGAACACGACCTGCACCGGTTTGCTGCCGCCGCCGGTGGCAATGCCAAGACGCCCGTCGCTGCCGCGCGCAAGCGGCAAAATGGCCTCGGCGCCCGCTTCACCGGCAAGCCCCAGCGAGCCGTTGCCCATGCCGAAATAGGTAGGGCTTGAAACCACTCCCCCCTTGGCAAAGGGCATGATACCGCGAATGCCGCTGAGCAGGCCGCCCATCATGGAAGAGGTCAGGCCCTGAAGCGGCTGGAGGCCTGCCGAGAGAGCCGTGCCCGCAAGGCTCGATGCAAGCCCGCGCAGCACGTCTTCCAGCCCCTTGCCGGATGTGATCGCACCTTTCAGCGCCGAATTGAGGCTGTTGCCGAAGCTGGACGAGCGTTTTTCAAGGTCGCTCAAGGCGCGATCAAAGGCGCTCGTATCCGCGTTGACGGATACGGTTACAGTTTCATCTGTCATGAATTTACCTGTCAGGGAAAACAAGCATCAGCGCGTCGAGCGTCTGGCGCGAGGGGGCATCAAGCACAGGGGCCAGGGGGCCGATCGCCGCTGAAAGTTCGCGTGGCGTCATCGACCAGAACGCCTGCGGGTGAGCCGCAGCAAACCGAAACCCGCATAGATCGCCTCGTCCCATGGAAAAGGCCGTACAGGCGAAGGTTTCGATTCAACTGCGGCACTCAAGGGTTTGGCGCAGAATCCTTTTCAGACGTTCCGAAAGTAACCGTCAGCAGCGAAGACACGATATGCGCAAAGCCTGCCACGCCACCCTCTGCCCGCATGTCGGCCACATCCTCTTCGCTTACCGTATGTCCGCCACCGCGAAGCCCCGCGCAGATAATGCGCTGCATATCCCGCGCCGAAAGCCGGCCCGAGGAAAAGCGCGCCACAAGGGCGGAAAGATTGTCCGTCTCGAAAACCGATTCCAGTTCCGCCAGCGCGCCCAGCGTCAGGCAGAGCGTCCAGTCGCGGCCATCCAGTCTGGCGGCGACTTCACCGCGGTGGCGATTGGCCATCATCATATCGCCTCTCCGAAGGTAATCAGGCTTGCCGATTCCAGCGCGATTTCAAACGTCACCTCGGCATCGTGATTGCCGCCATATTCCAGTGCGGTGATCTGGAACGGCCCGCTGATGGTACCAAAATCCGGCAGAACGATCTGCCAGTCGCGAATTTCGCCATCGAAGAAAATGCGCCTTATCAGGGCATCCGAGGCCGCATCCTTGAAGATGCCGGAACCGCTGATCGAGGCACGCTGCACACCGCTGCCCGCCAGCAATTGGCGCCAGCGCCCGGCAGCATCGGCATCCGTCACATCGACGGTTTCGGCATTGAACGCGATGCGCTTGGTGCGCAGCCCCGCACAGGTTTCAAACGTGCCATCATCGCGCACCGTTTTAAGCAAGATATCCTTGCCTCGTTGAGCTGCCATATAAATCTCCTTATCCCATACCAGCAGCGCGCTTCATATCGAGCTTCAGATCAAATCTCGCCATGCATGTTCCTTTCGGCAGGCAAAACGCCCGGCAGGGGCGCCTGTTTCAGCCGCTCGGCAAAACGCCTGAAAACCCCAAGCGCCGACCAGGCGGCAAGGCTTGCGGCAGCAGACCCCATCAGCATCAGTTCCGCGCGGCCAAGCGCGTCACCAAGCGCCAAAGTTTCAGCTATCTTCACACCCGCCGCGCCACCGAAAACCATGCCGCAGATGATGCCGACCGCAAAGCGTATCGCTGCTTCCCGCTTGCCATGCGGCAGCATATAGGCCAGCGAAACCGCCGAACCTGCGACCGCGCCAGCAATCTTCGCGAACCACAGCCATGCGGTATCGGAAGCCAGAACCGCGTCGTTCCAATTGTTCACGACACCCTCCTTCCCACGCGCGGCTGATAGCCAACCGCATCGCGCTTTTCATCGTCGGTGAGGAAAGAGGCTTCCGACACCCGCCGCCAGAGCGATTCCCGTTCCGCCGAAAGGCCCTCGATCCTGTCGATATCGTGTTCAAGCCGCAGGCCGCCGCCAAAGAGCGGCCCCAGCCAATTGCCAAAAGCCTTGGCGGTACGCCCGATCAATGGCAGCACGGTCAGCCGGTAGAAAGCACGGTTGGCCTCGGCATAATTGGCATAGGTATTGTCGCCGGGGATACCGAGCAGCATGGGCGGAACGCCGAAGGCGAGCGCAATATCGCGCGCCGCGCCGTTTTTCGCCTCGATGAAATCCATATCCTGCGGGCTGTAGCCCATCGCCTTCCAGTCAAGGCCGCCTTCCAGAAGCAGCGGGCGCCCCGCGCCGGAAGCGCCGGTATATCCCTCCTCCAGTTCCGCTTTCAGCCGCTCGAACTGTTCCTCCGTCAGATTGCCGCCTTCCTTCGGGGCATAGACCAGCGCACCGGAAGGACGGGCGGAATTATCCAGCAGGGCTTTGTTCCACGCGCCCGCCGCATTATGAATGTCGAGCGCCATCAAGGCCGCTTCGAGCGGCGGAAAACCATAATGGTCATCCAGCGGATGGAAGAGCTTCAGATGCAGGCCGGGAGCGTCCAAGCCTGCAACCGGGATAGTGCGGCTTGCCAAACCAGAGCGATAGACCAGCGCCTGCGGCCAGCCATCGGCATCGGTTTCCACCGTCACCCGCTCCGGTCGCAGGAGATGGAGTTCCATCCTGCCGCTTGGCAGATCCACCCGCTCCACATAGGCATTCCCGGCAATCAGCAGGTGCCCGTAAAGACGCTCGAAAAAGCTGCCGCCCTCCACCGCCCCCTGCGGATGGGTGAGAAGGTCGAGAAGAGGATGCGCCTCATGTTCCGTCGCCCCCTCATAAAGCAGCCACGGAATGGTGCTTGCCGCTTCGGCAATCAGGCGCACGCAACGATGCGCCACCGGATTACGCATGAAGCCCTCACGGGCAAGTGTCGAATAATCACGCGCGATCCAGGATGCATTCCGCTCCATATGCAGCGCCACGAAACCGTTCGCCATTTTGGTCTGGGATACTGCATTTGCCCGCGCGGGCAGGTTCGCGGCGCTTTTGCGCCACGGCCAATTCCAAGCCATAGGATGGCCTCTCCATGAATAATGATGATCTTCAGCCGAAACGGCGGATACGGGGTTTGTGGTCCGCACCGAGCATGAGTTCGCCCAAAGCCCAGACCAGCGCGTCAAGACGATCGGGCGAGCGCCCGCTGGAAAGCCCCTCAGGCGCGAAATCGCACATCTCGTCCTCCAGTGCCGGAAAGCGCCCGGCATGGCGCACCCGCCCCTGCTCGTAAAGGGCAGCCACAGGTTCGGCACGCAGCCACTTGCCGCGCGAGGCGCGCCGTTTCAAAACCGGCACGGAAGGATCTTCCGCCGCCAGAACCGCCGCCACCATTTCCCCGCCCTGATTGACCTCGGCCACAATCGCATCCGCTTCATGCGTATGGTAGAGCGCAATGGCGCGGCGCGCCCACTGGTGCGGCTTGGCCATGGTCATGCTCTCATCGGCAAGCACATGGCCGACCCCTTCCGCGTCAATTCCGGCCACAACGATGCCGCAGGCATCCGACGCTTTACCCGACGAGGCAGGCGGATCGACCGCCACGACGATACGCGCCAGCGGCGGCGGATTTTCCTCGAAACATTGCTCGATCCGGTCCCGCGACCAGAGTGCTCCCGGGCGCTCCTCCACCAGTTCCCCGTCCAGTTCCTGCCGTCCAAGGCGTGTTCCCGCATAGCGGCGCGCAATCGTCTGCATGAAACCTTCCGCAAGATTGCCGGCATTTTCCGCTGTGCGCATATGCGTCATGGAAACAGTCCTGTCCGTCAAAAGCGCCTTGAGAAGCGGCACCGCGCGCGGGGTCGTCGTCACCACCTGACGCGGATTGTCGCCAAGCCGCAGACCAAATTGCAACATGTCCCACGTCTCCTGCGGGTTTTTCCATTTGGCCAGTTCATCACACCAGGCCGCGTCGAACTGGGGGCCGCGCAGGCTGTCGGGGTCTTCCGAGGAAAAAAGCGTGGCCACCGCACCATTATCCCAGATGAGACGGCGGCGGGACGCCTCGTAGCGGGGCCGGACCAGTCGCGAAACCGAAAGAATGCCGGATGGCCCATCCACCATCACCTCACGGGCATCGTTGAAAGTCTCGCCCACCAGCGCAATATGTCCGCTCGGCTTGCCCGCGAAAGGCGGCAGCCCCAGCGCCATGCCTGAAACCCATTCCGCGCCCGCCCGCGTCTTGCCCGAGCCGCGCCCGCCCATGATAAGCCAGACGCGCCAGTCGCCATCGGGCGGCAATTGCGCATCACGCGCCCGGATCAGCCATTCGTCTTGCGCTGCCAATATCTGCCGAAACGCCAGGCCCAC
Protein sequences of DBSCAN-SWA_3 >NZ_LT962938|1356234:1404640|1385081_1386623_-|WP_004683321.1|protease|DBSCAN-SWA MSRARISNYRKGVAAVALSAALAGAFVVTGPLGALNEARAEAVHVTPPQQAGFADLVEKVRPAVVSVRVKKDVQETSNRGPQFFGPPGFDQLPDGHPLKRFFRDFGMEPRGDSRSDNRRGKANKPRPGHERPVAQGSGFVISEDGYVVTNNHVVSDGDAYTVVLDDGTELDAKLIGADPRTDLAVLKINAPKRKFVYVAFGDDNKVRVGDWVVAVGNPFGLGGTVTSGIVSARGRDIGAGPYDDFIQIDAAVNKGNSGGPAFDLSGEVIGINTAIFSPSGGSVGIAFAIPSSTAKQVVDQLIKKGSVERGWIGVQIQPVTKDIAASLGLAEEKGAIVASPQDDGPAAKAGIKAGDVITAVNGETVQDPRDLARKVANIAPGEKAALTVWRKNKAEEINVTIAAMPNDKGKSGSQSNDNDGGQGETLDSYGLTVVPSEDGKGVVVTDVDPDSDAADRGIRSGDVIVSVNNQTVKTAGDINKAITAAEKSGRKAVLLQLQSNDQSRFVALPINQE >NZ_LT962938|1356234:1404640|1393681_1393978_-|WP_002971526.1|DBSCAN-SWA MKQNPALKVFALLAVSVGLLPVNAGALPMAAPQKPNLLIATAGDCTAVGEQVAAQQGGQLAKATPTMQNGRAMCVIVVLIPGRNGERPRRVEVAVPAQ >NZ_LT962938|1356234:1404640|1365231_1366308_+|WP_006266661.1|DBSCAN-SWA MLEPCRKQGGSDNQKPIPLFPYPIHFLAVIRHYCRLMSKSRLNLPFLILVAATLVSIPLVLSFLNSLHPAFDTISHLRIHLAVLMGLLALPLLFTKLRREGAMILLLAVFAIAVTPHVFPASEDAHAREAAQPHYRLLQMNLRFDNGSPEQALSLIAHIRPDVVTLEEVSAMWREKFGHIASAYPYSIFCPHPGAVFGVAILSRRPFIADSTPACDPKGMMAVASVDFGGCPVDVAALHLHWPWPFQQSEQIEALSEQFRGLSENAILSGDLNATPWSATTKRIAELAAMTPAPPTGPTWLYRRLPASLRFAGLPIDQTFAKGRVAISKITRQQPIGSDHLPVLVEFSIIPQPETVAS >NZ_LT962938|1356234:1404640|1370798_1371758_-|WP_002963767.1|DBSCAN-SWA MNAHTRAAADAARLEMNPMLGIGLKIASVAVFVAMSTLLKAAEGVPVGQLIFFRSFFAIFAILLYLGWRGQLGGVFSTRHGFSHFWRGLVGVCSMSMSFFALTKLPLPEAIAINYASPLITVILGAAILHEVVRFYRWSAVLIGLFGVMIIIWPRLTLFSAGSVGRDEAVGALAALGAAVMSAVAMMLVRRLVQTERTPTIVIYFSISASVIALVSLPFGWVVPNWSQLAMLVGAGFAGGIGQILLTECYRHAPMSTIAPFEYTSMLLGLVIGFMLFGDIPTFEMLIGSGIVMAAGGFIIYREHQLSLAPHKVNAPQAQ >NZ_LT962938|1356234:1404640|1386764_1387241_-|WP_004683319.1|DBSCAN-SWA MKKNNLFRALLISATFALQATAAFAVNPDEVLSDPKLEKRAREISAELRCMVCQNESIDDSNAELARDLRILVRERLTKGDTDGQVIDFVVDRYGEFVLLKPRFNAQTALLWGFPVIILLIGGGALLVAFRRRRAAGTAPQPLSESEKAELSRLLDGK >NZ_LT962938|1356234:1404640|1361230_1361488_-|WP_004683353.1|DBSCAN-SWA MAQFQSFNHNENPAAQDRILLEMGMKYAIGRDCEIDVIEAHKWLNIAAIRGNQKAERMRNQVAATMSKSELAAALRCAREWMTAH >NZ_LT962938|1356234:1404640|1389248_1389746_-|WP_002963757.1|DBSCAN-SWA MSATAEQNARNPKGKGGFARTVSQRKRKRLFLIGGALAVLAVAVGLMLTAFNQDIRFFRTPADLTEQDMTSGARFRLGGLVEEGSVSRTGSELRFTVTDTIKTVKVVFEGIPPDLFREGQGVVAEGRFGSDGLFRADNVLAKHDENYVPKDLADSLKKKGVWEGK >NZ_LT962938|1356234:1404640|1371983_1374635_+|WP_004683334.1|DBSCAN-SWA MRTETGHTFRLEDYRQTPYAIPETKLDFTLEPEKTIVRATLTIERRSDTPAGTPLVLHGDELKLVSLAIDGKALSDNSFSATPDQLTISDLPKDVRFTLQIVTEVNPTANRQLSGLYRSSGVYCTQCEAEGFRRITYFYDRPDVLSVYTVRVDADRKAAPILLSNGNPVENGMVEGQPERHFAVWHDPHPKPSYLFALVAGSLGVVKDHFTTRSGRPVDLAIHVEHGKEGRALYAMDALKRSMKWDEEKFGREYDLNVFNIVAVSDFNMGAMENKGLNIFNDKYVLADPETVTDADYAGIEAVIAHEYFHNWTGNRITCRDWFQLCLKEGLTVYRDHEFSADQRSRPVKRIAEVKILKAQQFPEDAGSLAHPVRPREYREINNFYTATVYEKGSEVVRMIRTIIGPELFRKGMDLYFERHDGDAATIENFIQVFADVSGQDFSQFALWYDQAGTPKVEAGFHHDAAAKTFTIKLEQSLAPTPGQSIRKPMHIPIAFGLIGPDGKDMQPSSVEGGEVRDGVIHLRRPSETIVFHGIEARPVPSLLRGFSAPVNLAAPLTAEDRVFLALNDSDPVARWQAMNSIFSATLLDGAKRVRGGHQPETDPKIVALAGKVAFDEMLDPAFRALCLTLPSESDIAREMGNNVDPDAILASRNHLIAAIASGYADGFAGLYDTLKQEGAFSPDAAPAGKRALRSALLDYLSVQEKSPERAERQFVEADNMTDRATALAVLVHRFGDSGEARQALATFEQTFGQDALVMDKWFIVQATRPGETALEAVRELTRHPLFSLDNPNRVRALIGAFTASNPTGFNRQDGAAYGFLADTLLTIDPKNPQLSARLLTAMRSWRSLEEVRREHARAALARIAGAGKLSTDLRDIIDRTLA >NZ_LT962938|1356234:1404640|1400765_1400936_-|WP_087909548.1|tail|DBSCAN-SWA MGRGDLCGFRFAAAHPQAFWSMTPRELSAAIGPLAPVLDAPSRQTLDALMLVFPDR >NZ_LT962938|1356234:1404640|1369537_1370794_+|WP_005969556.1|DBSCAN-SWA MAGQLLPIAALLASTFLMLLAGGLAGILLPLRGGMEGWSTTTIGWMGTSYSLAFTIGCIFIPHLVRRVGHVRVFSALLTLLSMALLFHALVVNPAAWMIFRGITGFSLAGSYMIIESWLNERVTNESRGMIFSIYMIITMVGLLLGQYILPFGNAATQTLFIICAIIYASALLPTALSSAQSPNPLTQVSLDLKGLYRRSPAAVVGSFIAGIVAGTWNFLAAIYGEMNGLSTFGIATMLASAMIGGAIFQYPLGRASDFVDRRYMMILAGAIGFTLSFIMVLFHPTSPYTLYAMMFLFGSVVFPIYSLNVAHANDYADASEFVKISGGLLIVYGVGSVLGPAISGPLMDVIGANGFFVTMAIAYCIYGLHAWWRIYRLERPAISDQKTEFKFHTPDGQSTPETMQLDPRAEGTSGQAQ >NZ_LT962938|1356234:1404640|1377647_1379759_+|WP_002966727.1|DBSCAN-SWA MILPVFVINMASQPAAYKTVAASIEAYGQGFQLHRIDAVNGHTATQRIGIDDARFDAINGREMLPGEYGCYRSHLKALESFLSDGSPYGLILEDDVVFTETTSARIHDIIKSLPDFDVVKLVNHRSPLFMSLLETDAGDRIGRAIHGPQGSAAAYLVSREGARKLLSALSTMELPWDVAMERFWHHKARLFSSDENILAFSSHSEISNISDQNSGYDEAKHPWYMRLRTSLFRTFDYYVRVHHTLLQPQNPDGSSMKSQSGAYKLPGISLTGELIAAISLLVFMSTVWVETDAYRYIALGFVVAALIRYARTDFWKYEKPMVGWAGLLCVAWTFYVLARFAYIYLFYPEMGTGSAEGIYLFPLFYPTLGFALLLFIRRPFLIAVAFMAISLVILIFGFHYDLSWNERAVTLLQHNPIHAAVSSGFIALCAMAFGIHTLNRNTLDTRARVVLCLLALATFIAALIAIYSLYSKGVWLAMAIAFPTFVVLVALTDKSQTSRMAALVCILIGLLSVFAGEHILQRVGGNTANTSWELLSDLKTGDNIMQDFDKAIKNPETGLSERERLMIWANTLHIWHKNPIFGAGVSWLHYWEKRPYQQTDFTLLHNGYLEIAIRYGFLGLLFYGVLTVWAVRCTWQATRAGLIDSAAFQCYVATLVFFAVTILSNSNVRLAIGESYMALAFGFAFYCQYLLQQHNRQYPRTYF >NZ_LT962938|1356234:1404640|1363304_1363472_+|WP_002971518.1|DBSCAN-SWA MLLQRFTIVCLMIAGFAAFASILTYRTYDIYGDQPVVDAVALAHMHDLADGKTRQ >NZ_LT962938|1356234:1404640|1389742_1390882_-|WP_004686785.1|DBSCAN-SWA MEFWLVAALLTFAATLAVLLPLTRRRQAALPAEKNDLEVYRDQLREVEADVARGMIDLQSAQQARIEISRRILNAEKDAMAATADGGQGRSGRVLAFVAVLAVPLVAWGIYPLFGAPDMPSMPLAPRLAAAPDRSSVTDLIARAEAHLAQNPGDVRGWDVLAPIYLRLGRASDAVSAYHTSIRIAGENLARILGLGEALTAASGGTVTAEAEKLFKKAAELSPDDIRPQYYLAQGEMQDGHPDRAADRLQAFLDKAPADAPWRGQLEKTIAILRDPASAKQAEAKGPSAEDVEAASTLSAGDRQAMVEGMVQRLDETLRQNGGDIDGWKRLVRSYMILNRRNDAQDALARGMKALQGENRTELQSFATTLGLDVGTAQE >NZ_LT962938|1356234:1404640|1394410_1398274_-|WP_006144610.1|tail|DBSCAN-SWA MATVVLQAVGAAVGGIFGPVGAAIGAGLGAMGGYAIDNALLNSTRHIEGARLNGGRVATAEEGGALPFIYGTARVSGTLIWATRFEERKTTTRQGGKGGPKVTNYSYFGNAAYAVAEGEIAGIRRVWADGQELDLTEIDMRVYCGTATQSPDPMIEAKQGTGNAPAYRGTAYVVFERIPLDTFGNRLPQFQFEVMRPVGEVARNMRAVALIPGSTEFGLSPDTVSDEPVPGEKRWINRNAIRARSDWAAALDELQALCPGLRHVAIVLPWFGDDLRAGQCRIRPGVTSLSVRKPSTIWKVENVSRGAAHLISMSGEGAAYGGTPSDASVIAAIRDAKARGLGVTLYPFIMMDVPKENQLPSPYGGIGQPAYPWRGRITCHPAIGQEGSPDGTPAAGEQAAAFVNGEWGYRRFLNHCADLAVRAGGVDAFLIGSELRGLTSIRDGRGSFPFVSHLCALAAEMRSKLGVGCRITYGADWSEYFGYQASDGTGDLFFHLDPLWAHPAIDAIGIDNYMPLADWRDSDFSEGNPDGFKTPYDLAGLSRSVNSGEGYDWYYASAEDRLARRRTPITDGLAAKPWVYRYKDLHGWWSNRHYNRIDGVEVAAPTAWVPQSKPLWFTELGCPAVDKGPNQPNVFPDPKSSENAAPYFSNGLRSDAAMDHFLRAHYRHWQGENPLSPVYGGPMLDMERIYLWAWDARPFPEFPLGQDIWGDTANWRLGHWLNGRISGIALDELIAAILRDFGLPEADCIGVEGHLSSFIVSEPSSARGVLEPLLNVFGVHGYERAGQFVFRNITRAEAALELGEFAQPDEGEALTVVVEDQGDLPSTAELYCNDPLRDFQIVGASARREVGQGTESLSLSGSMENGQATALAEAWMARRHAERRTASFSLPWSNAALHVGDRVRLGVLDGQRDYVVTALEDGAMRTVTAAALAPNLVFADKGETPPRPPGGPALDMEPVFHFVDLPLWPGAEDPAAQFRIACHAKLWRGVAVYASPSNDGFAERALIGERAIMGELTAPLAGGPSGRLIEGQFVEVVLYSGELQSRPLVQVLNGANTGLLRSPDGRWEIFQFLDAEEVGLNRWRLGRLLRGQLGTEAAALDDKPVETPFILLDSGVVSAGLQASELGLELSWRVGAAGKAFSDEFFETVRQSGGLRALRPLSPVHLGVMRSLDGDMSFTWIRRGRIDADSWLGEDIPLGEDREAYRVEIWRDDTLVRSEQVSAPAWNYGFAERRAELGDDAFRFCVAMIGAKTGPGDFAVLDVDPNPSGKSGSDLEKYERLSQQDHA >NZ_LT962938|1356234:1404640|1360400_1361171_+|WP_006137429.1|DBSCAN-SWA MQSNSGEDGILGTALFRCFTHYPKQSRFALLLEMLWSLKLADIADSAPSLADASSPISVDRHRLYEDAIAMLIGTSFIALGITLYSHAMLMTGSTAGIALPIHYATGTGFGLLYFLINLPFYYFAVRRMGWAFTIRTFAAVALLSGFTRLMPLSVDFTSINPLFAALMGGTLMGMGVLALFRHRSGVGGVNILALYLQDAYGIRAGWFQLGLDVLIMLASLFFIPWENMVLSLVGAVAMNVIIAINHKPGRYIGIS >NZ_LT962938|1356234:1404640|1387237_1389229_-|WP_002963758.1|DBSCAN-SWA MSVEIGHFALVLALALSIVQSIVPVVGAHRRDAQLMAVAVPTALAVFALIVLASAALIHAYVVSDFSVLNVVENSHSQKPLLYKITGVWGNHEGSMLLWVFILTLFSALVAAFSGNLPETLRANVLAVQGWIGTAFLAFIIFTSNPFTRIFPAPMEGGDLNPVLQDIGLAIHPPLLYLGYVGFSVCFSFAVAALLEGRLDAAWARWVRPWALMAWMFLTGGIAMGSYWAYYELGWGGWWFWDPVENASLMPWLVGTALLHSAIVMEKRSALKIWTVLLAILTFSLSLLGTFLVRSGVLTSVHSFATDPGRGLFILGILALFIGGSLSLFALRVQSLSAGGIFHPISREGALVFNNLFLTTAAATVLIGTLYPLLLEVTTGEKISVGAPFFNMTFGPLMVPLLFAVPFGPLLAWKRGDLYGVGQRLMTAFALSLAVVGIMLWRTSAHSVLAALGIGLAAWLIFGSLTDLVLKAGIGKVSASKAFARFKGLPRSVFGTSLAHIGLGLTLLGIVSVTTFGTENVLVMQPGGTAKVQNYTLRFEGLRPITGSNFTENRGAFTLLDSSGRDLAVIEPSKRFFPARQMPTTESGIKTLWFSQVYVALGDEPGNGAVVVRIWWKPLVTLIWYGALVMMLGGLFSLADRRLRVGAPSKVRRSASKEAEAVA >NZ_LT962938|1356234:1404640|1391528_1392839_-|WP_002969444.1|DBSCAN-SWA MALVVVATFISSLYGEAARNNFERLLTAHLFSLVGAVSVSGEGTLQGRPELGDLRYSSPLSGWYWSVDPVTPNLTGKLESPSLVGRIVPEMPVSQAPFDSSFMRSYTLPGLDNEELSIVETEVVLDNSNRVARFRVMGNLSEVLNEIANFRARLLVYLGVFGIGSILINAAVILFGLRPLDKVRQALADIREGRSSRLDATLPLEIAPLAREMNALIENNRRIMERSRTQVGNLAHSLKTPLSVLVNEARAMGGAPGRIVQEQSEAMQVQIQHYLQRARIAAQRDSVVFRTPVTPVLERLHRVTAKLHPTFNISFRNDLPGAVFAGEREDLEEIIGNLLENAGKWGRKCITIRLAAVAGEQRQFEIVIEDDGPGLEADKIEAALKRGSRVDETKPGTGLGLAIVQDTVREYGGSLHLGKDALGGLAVRVVLPLTQD >NZ_LT962938|1356234:1404640|1379803_1382755_-|WP_006137444.1|DBSCAN-SWA MTVENAKALFFERNLCALTPLDPERASAFLADLEARAREEELAGVVALLGRKKAADFLSAILDLSPFIREALTRQPRILDRIVSATPESALEAILDEISASGTVAGVSESELMTSLRQLKREAHVLIALCDLARIFNTETTTDRLTDLAEACTGAAVRFLLLDADAAGRINLPDRSNPEKDCGWIVLGMGKFGARELNYSSDIDLIVFIDETKPAIGDPYECVDTFSRLTRRLVRILQDRTGDGYVFRVDLRLRPDPGSTPLAIPVGAALHYYEGRGQNWERAAMIKARPVAGDRLSGKQILAELSPYVWRKYLDYAAIADVHSIKRQIHAHKGHGDIAVRGHNVKLGRGGIREIEFFVQTQQLIAGGRFPELRGNQTVPMLARLAERGWITQQARDALAQEYWFLRDVEHRIQMIADEQTHILPEDDEGFARVSHMMGYADPAEFSEIFLAALKVVEKQYAALFEQAPELGAASGNLVFTGDVDDPGTLETLSAMGYERSSDICRVIRTWHFGRYRATQSAEARERLTELTPALLKAFAETRRADESLLRFDGFLQGLPAGIQLFSLLQSNPRLLNLLVMIMSAAPRLADIITRNPHVFDGLLDPAIFSEVPTRAYLEERLRAFLGSATDFEEVLDRLRIFAAEHRFLIGIRLLTGAINGVRAGQAFSDLAELMVGRALEAVEAELQRRHGKVKGAKVALLAMGKLGSRELTAGSDVDLILLYDHDKDAEESDGEKPLAPSQYYIRLTQRLIAALSAPTAEGVLYEVDMRLRPSGNKGPVATHIEAFGKYQRNDAWTWEHMALTRARPIHGDEAFIARIKVDIEDVLAMPRDVRKLAGDVREMRELIAQEKPPRDDWDLKLKPGGIIDLEFIAQFATLAGYVKKTPRPFATEEVLANLDPFFGDPAMVDGLVEAHRFYTNLSQAIRLCLNDSAGLDQFPPGMRELLCRVAGLPDIERIEYELLEHYRLVRAAFDKLVGHGAD >NZ_LT962938|1356234:1404640|1394139_1394346_-|WP_002969445.1|DBSCAN-SWA MTANKSWYLSRTVWAGLVALFLSVAGLFGVATDMIDQGNMTDVLLQLATAIAGVVTVIGRIGATSRIL >NZ_LT962938|1356234:1404640|1362399_1363110_-|WP_004683348.1|DBSCAN-SWA MAEQTKPIELYYWPTPNGFKISIMLEELGVPYAVKYINIGKGDQFEPGFLKIAPNNRMPAIVDSEGPGGEPISVFESGAILQYLGRKFGKFYPTDKRKRVTVEEWLMWQMGGLGPMSGQAGHFRIYAPEKIQYGIDRYTNEVNRLYGVLNRRLEGRDYIADEYSIADMACIGWVNAYKNYEQNLDDFANLKRWHETMNARPAVQRGLLVGKEERERADAAANKEEEQKILFGQKAR >NZ_LT962938|1356234:1404640|1366423_1368115_-|WP_004683340.1|DBSCAN-SWA MSEANELPERESMEFDVVIVGAGPAGLAAAIRFKQINPELSVVVLEKGGEVGAHILSGAVVDPVGIDQLLPGWREEEGHPFKTPVTADHFLVLGPAGSVRLPNFAMPSLMNNHGNYIVSLGNVCRWLGTKAEELGVEIYPGFAATEVLYNDEGAVIGVATGDMGVERDGTRGPNYTRGMALLGKYVLIGEGARGSLAKQLIAKFKLDEGREPAKFGIGLKELWQVDPSKHKPGLVQHSFGWPLDMKTGGGSFLYHLEDNMVAVGFVLHLNYKNPYLSPFEEFQRFKTHPAIRDTFEGGKRLSYGARAITEGGWQSVPKLSFPGGALIGCSAGFVNVPRIKGSHNAILSGILAADKIAEAIAAGRANDEPIEIENSWRASAIGKDLKRVRNVKPLWSKFGTAIGIALGGLDMWTNQLFGFSFFGTMKHGKTDAQALEPAANYKKIDYPKPDGVLTFDRLSSVFLSNTNHDENEPVHLQVRDMELQKTSEHDVFAGPSTRYCPAGVYEWVDADGNAAADPGVKDVRFVINAQNCVHCKTCDIKDPNQNINWVPPQGGEGPVYMNM >NZ_LT962938|1356234:1404640|1368503_1369376_+|WP_006144604.1|DBSCAN-SWA MTDGRETLDMEGLLRFYAEAGVDVPLCETPIDRFAAATHPAPAQSRMQAAAQQPESNPAQAREERARTVAGPSPSSKPVQAAMDLPDNAQIALAREAASQAETLEELREKLAAFDGCNLKFTAKNLCFADGDPSSDIMFIGEAPGRDEDMEGLPFVGKSGQLLNRMIEAIGLKREEVYIANTIPWRPPGNRAPTPLETELCRPFIERQIELAAPKVLVALGGPAGKALTGAAEGILRLRGNWKIHRTPTGMEIPVMPTLHPAYLLRTPAQKRFAWRDFLAVKLKLAELRG >NZ_LT962938|1356234:1404640|1390982_1391435_-|WP_002971524.1|DBSCAN-SWA MMIVSRFSRPVPVISMMFVVSLALSACGTTGGGKGSGFPSLGGSSQKPETNLLASLGNGLLGNSASQLSAADRRKALEAEYRALEYSPAGKSVLWSGAGSNAGDVTAAQPYQVGSQNCRQYSHSFTIGGDQQTVRGTACRNPDGSWTPLT >NZ_LT962938|1356234:1404640|1359093_1359744_-|WP_004683357.1|DBSCAN-SWA MSIRTITAMAGTAFIMGAAAIAFSSPASALTMKECSTKYQAAKDAGTLGNMKWNDFRKAQCGDDAASAPAAAPAAAPATKKAAKAAAPASNDGAKSLTMKQCSAKYQAAKDASTDNGMKWNDFRKAECGPGADPVALSTDGDSEPAAPSVAAPKGVKFPTAVSAKYSKESAGKARMHTCLDQYHALKDANALGGLKWVQKGGGYYSLCNARLKGNS >NZ_LT962938|1356234:1404640|1402155_1403349_-|WP_002963736.1|portal|DBSCAN-SWA MAWNWPWRKSAANLPARANAVSQTKMANGFVALHMERNASWIARDYSTLAREGFMRNPVAHRCVRLIAEAASTIPWLLYEGATEHEAHPLLDLLTHPQGAVEGGSFFERLYGHLLIAGNAYVERVDLPSGRMELHLLRPERVTVETDADGWPQALVYRSGLASRTIPVAGLDAPGLHLKLFHPLDDHYGFPPLEAALMALDIHNAAGAWNKALLDNSARPSGALVYAPKEGGNLTEEQFERLKAELEEGYTGASGAGRPLLLEGGLDWKAMGYSPQDMDFIEAKNGAARDIALAFGVPPMLLGIPGDNTYANYAEANRAFYRLTVLPLIGRTAKAFGNWLGPLFGGGLRLEHDIDRIEGLSAERESLWRRVSEASFLTDDEKRDAVGYQPRVGRRVS >NZ_LT962938|1356234:1404640|1400215_1400761_-|WP_002970984.1|tail|DBSCAN-SWA MTDETVTVSVNADTSAFDRALSDLEKRSSSFGNSLNSALKGAITSGKGLEDVLRGLASSLAGTALSAGLQPLQGLTSSMMGGLLSGIRGIMPFAKGGVVSSPTYFGMGNGSLGLAGEAGAEAILPLARGSDGRLGIATGGGSKPVQVVFNMTSPDASSFRKSEAQLATMLAGAVRRGARRL >NZ_LT962938|1356234:1404640|1400979_1401321_-|WP_004683300.1|DBSCAN-SWA MMMANRHRGEVAARLDGRDWTLCLTLGALAELESVFETDNLSALVARFSSGRLSARDMQRIICAGLRGGGHTVSEEDVADMRAEGGVAGFAHIVSSLLTVTFGTSEKDSAPNP >NZ_LT962938|1356234:1404640|1398277_1398712_-|WP_004683307.1|DBSCAN-SWA MMIAERVLAEAHRWIGTPYRHGASTLGVSCDCLGLVRGIWRALYGVEPENPGVYAPDWAEVSQGDPMLEAAVRYMVRREEHAPQPGDLLVFRWKPGFAAKHMGIMAREGRFIHAYQGHGVLASALVPQWRRRMAGIFLFPEPKV >NZ_LT962938|1356234:1404640|1392913_1393603_-|WP_002963753.1|DBSCAN-SWA MRILIVEDDKDLNRQLSEAMIAAGYVVDSAYDGEEGHYLGDTEPYDAVVLDIGLPQMDGISVVERWRRSGRTIPVLMLTARDRWSDKVAGIDAGADDYVAKPFHIEEVLARLRALIRRAAGHASSEFVCGPLHLDTKTSKASIDGVALKLTSHEYRLLSYMMHHMDEVVSRTELVEHLYDQDFDRDSNTIEVFVGRLRKKMGVDLIETVRGMGYRMRSGGEGEAKGSGS >NZ_LT962938|1356234:1404640|1361993_1362341_+|WP_002963776.1|DBSCAN-SWA MIRTLILGVTLAAGFAAPAFADEAIVGTWKRPNGTLISYAACGANKFCGTVMTGEYKGKSIGTMSGKDGNYKGEVNKLDEGKTYSGKASVKGNTLSLSGCVMGGLICKSESLARQ >NZ_LT962938|1356234:1404640|1401317_1401731_-|WP_002963743.1|tail|DBSCAN-SWA MAAQRGKDILLKTVRDDGTFETCAGLRTKRIAFNAETVDVTDADAAGRWRQLLAGSGVQRASISGSGIFKDAASDALIRRIFFDGEIRDWQIVLPDFGTISGPFQITALEYGGNHDAEVTFEIALESASLITFGEAI >NZ_LT962938|1356234:1404640|1363468_1363636_+|WP_004683346.1|DBSCAN-SWA MRAGAPSIAAIGLRIFLFTAIFTGPAAVFAYTQIGNHAEMNGAGLVLYISLQRTA >NZ_LT962938|1356234:1404640|1399580_1400213_-|WP_002963747.1|DBSCAN-SWA MVEAFHDVRFPLGVSFGATGGPEWRNEIVTLTSGLEKRNARWAHSRRHFDAGTGLRSLDDLRMVLAFFEARRGSLHAFRFRDPFDFSSATGKASLSAFDQPLGTGDGVAVHFQLRKNYESYDRPITLPVPGSVVIGVDGVKVPEGEAFTVDPLTGIVTFTPDYLPARDVPVTSGFLFDVPARFDTDRLTASIASFQAGEIPSIPIVEVKR >NZ_LT962938|1356234:1404640|1363676_1365179_+|WP_004683344.1|DBSCAN-SWA MTQRIEHPFLTDIKTPPLMEPEVFSDAQAAVAALCKLYERNTAFLRSAFEKVARGEIAPQRYRAFYPEICLSTSSFAHVDSRLAYGHVSTPGDYSATVTRLDLFGHYLREQIRLLMRNHGVTVTVRESSTPIPIHFAFKEGAHVEASVASAFTHPLRDLFDVPDLAATDDKIVNADFEPAPGEPMPLAPFTAQRIDYSLHRLSHYTATSASHFQNFVLFTNYQFYMDEFCAYARQLMAEGGGGYDQFVEPGNIVTRAGETAPSTGNPLQRLPQMPAYHLQKAGHGGITMVNIGVGPSNAKTITDHIAVLRPHAWLMLGHCAGLRNSQQLGDYVLAHAYMREDHVLDDDLPVWVPLPALAEIQVALEEAVEEITGLQGYDLKQIMRTGTVATIDNRNWELRDQRGPVQRLSQARAVALDMESATIAANGFRFRVPYGTLLCVSDKPLHGELKLPGMATEFYKRQVAQHLRIGIRAMEKIASMPDERLHSRKLRSFYETAFQ >NZ_LT962938|1356234:1404640|1382877_1384284_-|WP_005969562.1|DBSCAN-SWA MMSRFSALMRTTAARLSALYLLLFAVGAVALVFYMTNLSASILAGQTQQALGEEVASIGKSYARGGIPQLVRTIDYRSRQPGAYLYLVADPTGRILAGNVESVEPGVLNTDGIIERAFTYRRYGEQAPQMEHRAIAVVIALPNGMRLLVGRDLGEPERFRDLIRNSLVLALGIMGVGALLIWLFVGRRALKRIDDVSRASQRIMDGDLTGRLPVNGSGDEFDRLSGNLNVMLARILELNEGLKQVSDNIAHDLKTPLTRLRNRAEEALGGEKVEPEYRAALEDIIGESDQLIRTFNAILMISRLEAGYSSENLDDMPVAPIMRDVAEMYEPVAEDAGVTLTLGALDDVALHINRELVGQTVSNLVDNAIKYAGGEGRTATVTLLMEKDAQWVRIVVADNGPGIPADKRDHATERFVRLEESRTQPGSGLGLSLAKSVMKLHGGALRLEDNGPGLRAVLEFPLPHREVG >NZ_LT962938|1356234:1404640|1356234_1357161_+|WP_004683363.1|integrase|DBSCAN-SWA MIIVGETKIDTGDKYAPIIDYNLNYISGKNPKHRLVEHYSVAELTAKYINILWDDGPHKYNVRSFLGEIDEILKGARFSGFDQEMLDSIIGTLRERGNSNATINRKMAALSKLLRKAHKMGDIFNLPEFIRQKERVGRIRFLEQEEEKRLFAAIKSRCEDSYRLSVFLVDTGCRLGEAIGLTWNDIQEQRVTFWVTKSNRSRTVPLTRRARKASHIPRERLKGPFSMLNQVRFRQIWNEAKAEVGLGADDQIVPHILRHTCASRLVRGGIDIRRVQMWLGHQTLQMTMRYAHLATHDLDSCVKVLEIH >NZ_LT962938|1356234:1404640|1384280_1384988_-|WP_004683323.1|DBSCAN-SWA MKILVIEDDREAARYLEKAFAEAGHSADIAGDGETGYALAENGNYDVLVVDRMLPKRDGLSVVAGLRAKGMETPVLILSALGEVDDRVTGLRAGGDDYLTKPYAFSELLARVEVLQRRSSPREADTIYRVGDLELDRLTHTARRQSVDITLQPREFRLLEYLMRHAGQVVTRTMLLENVWDYHFDPQTNVIDVHISRLRSKIEKGFDEPLLHTVRGAGYMLKAGRGKQSAAARAE >NZ_LT962938|1356234:1404640|1403380_1404640_-|WP_005969588.1|DBSCAN-SWA MGLAFRQILAAQDEWLIRARDAQLPPDGDWRVWLIMGGRGSGKTRAGAEWVSGMALGLPPFAGKPSGHIALVGETFNDAREVMVDGPSGILSVSRLVRPRYEASRRRLIWDNGAVATLFSSEDPDSLRGPQFDAAWCDELAKWKNPQETWDMLQFGLRLGDNPRQVVTTTPRAVPLLKALLTDRTVSMTHMRTAENAGNLAEGFMQTIARRYAGTRLGRQELDGELVEERPGALWSRDRIEQCFEENPPPLARIVVAVDPPASSGKASDACGIVVAGIDAEGVGHVLADESMTMAKPHQWARRAIALYHTHEADAIVAEVNQGGEMVAAVLAAEDPSVPVLKRRASRGKWLRAEPVAALYEQGRVRHAGRFPALEDEMCDFAPEGLSSGRSPDRLDALVWALGELMLGADHKPRIRRFG >NZ_LT962938|1356234:1404640|1398708_1399584_-|WP_004683305.1|DBSCAN-SWA MIPVPPALESHLQGEVTTHCFAWLIRRLDGAVLGFTDHDAPLTVDQVICDPLTGLNSSEASTALGLGIAGGEVEGVLSSTQISDEDIEQGRYDGATIEAFLVNWDEPDQHMLLRRWAAGKISRSGSRFVMELKGVAAAFDAVRGRRILRHCDAMLGDKRCGIDTGDPRFFAQGTVLVAEGTRLDVAGVDGFAAGWFSEGRLAWTSGANRGRAVRVVGHAGASLQLGEPMILPVAAGDAFRLVCGCDKSFATCKAKFANGVNFRGFPHLPGNDAAYAYVNSTNDYDGGVLVP >NZ_LT962938|1356234:1404640|1358674_1359049_+|WP_002963781.1|DBSCAN-SWA MPSINGQPRSVLFGTLAGLCGALGIASYAGAAHMGESHLGTIAPLLLAHAPALLFLSLISPVSRVVRIGGAILVVGLALFCGDLFMRDMTGDRLFPFAAPTGGSLMILGWLCLGCSGWFSANAK >NZ_LT962938|1356234:1404640|1401781_1402159_-|WP_002967499.1|DBSCAN-SWA MNNWNDAVLASDTAWLWFAKIAGAVAGSAVSLAYMLPHGKREAAIRFAVGIICGMVFGGAAGVKIAETLALGDALGRAELMLMGSAAASLAAWSALGVFRRFAERLKQAPLPGVLPAERNMHGEI >NZ_LT962938|1356234:1404640|1375025_1377377_+|WP_006137441.1|DBSCAN-SWA MASTDAYGAPAGTHCESGRKKRKGRLSGHVSLLAGPAYSKFIVIEPILRRLVPTLIIIFLIILGVARVFSLLAWRDDIELQHKAALSGATAHLAQMIERVANGIETGAQLSAKDLQDAMTELRSRGLTSSGMTIAIVDAQSMIKAASGPAGIAGSQIDTILGDAQPLFLFAERAGVLRVVLQGEAAFGALAKPMTAPYSIIAVEPESTIFAEWKRAVSLNVTLFAGTIGVMFAILYAYFSQAARAREADDLSGQIQRRIDMALARGRCGLWDWDMARGRIYWSRSMYEMLGYEAQDAVLPFGDVAAIINEEDGDLYSIAEQAAAGDISHVDRVFRMRHADGSWVWMRVRAEIASEGDLHLVGIAFDVSEQHRFAQQTAEADMRIREAIENISEAFVLWDANNRLVMANSKFSEYAGLPVWTLKPGVPRNEVDAHTRPFTFERRMANEHNRAGGQTFERQLSDGRWLQVNERRTQDGGMVSIGTDITQLKLHQERLVDSERRLMATVHDLSIARKGERDRVRELSELARKYSLEKERAEAANRAKSEFLANMSHELRTPLNAIIGFSEMIQAGTFGPLGSDRYEEYINDIHTSGNFLLNVINDILDMSKIEAGHFSLDREEIDLCPLINETVRIISLQAEEKNIAVETRIEDAMELYADRRAIKQVLINLLSNAVKFTSYGGRITVRARKTAAALFMTIQDTGVGIPKSALRKIGQPFEQVENQFTKTHTGSGLGLAISRSLAELHGGWLRIRSTERVGTVVSVCIPDRNPAPNAGHDARTHAA >NZ_LT962938|1356234:1404640|1361757_1361997_+|WP_004686532.1|DBSCAN-SWA MITIGRIGMVRIGAKRALQTAMMKLKIAPSASPETRLRLRLRKLMRGAERMLLQRLKRMNSGFIAAPTIRDSRHFGRNE >NZ_LT962938|1356234:1404640|1357523_1358657_+|WP_002963782.1|DBSCAN-SWA MAQPRLTPLVESLPSTVPFIGPETLELQRGKPFEARIGANESSFGPAPSVIEAMRNEATEVWKYGDPENYALRHAIAAHHGLKAEHIMPGAGVDALLGLIVRQYVQQGDKVINSLGGYPTFNYHVAGYGGQLVTVPYRDDKPDLDALIDAAAREKPALLYIANPDNPMGTWHGGADIQSFIERLPETTLLILDEAYCETAPASAFPPFETDRPNVLRMRTFSKAYGLAGIRCGYAVGNPVAIKTFDKVRDHFAVSRMAQAAAIAALKDQAYLHEVVGKICAGRDRIAAIAEANGLHAVASATNFVAIDCGRGKDFAQAVLNGLISRDIFVRKPGTPVLDRCIRVSVGVKEQLDQFEAAFPEALEEARKICAANAENT |
44 | Rhodobacter_phage(20.0%) | protease,tail,integrase,portal | attL 1347037:1347051|attR 1360478:1360492 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|