Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_018079 | Enterobacter cloacae subsp. dissolvens SDM, complete sequence | 1 crisprs | cas3,DEDDh,csa3,WYL,DinG | 0 | 2 | 7 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_018079_1 | 747546-747677 | Orphan |
NA
Consensus repeat of NC_018079_1
|
2 spacers
spacers of NC_018079_1
>1.1|747570|30|NC_018079|CRISPRCasFinder CACCACTGTCGCCGTTATCATTGCCGCCGC >1.2|747624|30|NC_018079|CRISPRCasFinder TGCCATTGTCACTGTTACCGCCATTATCAG |
CRISPR arrays and Neighbor proteins around NC_018079_1
The CRISPR arrays of NC_018079_1 >merge|NC_018079|1|747546-747677|CRISPRCasFinder TGCCGCCGCCGTTATCGGTATTATCACCACTGTCGCCGTTATCATTGCCGCCGCTGTTGCCGCCATCGTCGGTATTGCTGCCATTGTCACTGTTACCGCCATTATCAGTGTTGCCGCCATCGTCAGTATTGC >NC_018079|1|1|747546-747677|CRISPRCasFinder TGCCGCCGCCGTTATCGGTATTAT CACCACTGTCGCCGTTATCATTGCCGCCGC TGTTGCCGCCATCGTCGGTATTGC TGCCATTGTCACTGTTACCGCCATTATCAG TGTTGCCGCCATCGTCAGTATTGC
>NC_018079.1|WP_013095533.1|746316_746550_+|YgdI/YgdR-family-lipoprotein MQNKLLIASVLAATAMFTVAGCSSNQAVKTTDGRTIITDGKPQVDDDTGLVSYKNAETGQTEQINRDQVKSMGELDN >NC_018079.1|WP_014830610.1|742952_746177_+|carbamoyl-phosphate-synthase-large-subunit MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSNPATIMTDPEMADATYIEPIHWEVVRKIIEKERPDAVLPTMGGQTALNCALELERQGVLEEFGVTMIGATADAIDKAEDRRRFDVAMKKIGLDTARSGIAHNMEEALAVAAEVGYPCIIRPSFTMGGTGGGIAYNREEFEEICERGLDLSPTKELLIDESLIGWKEYEMEVVRDKNDNCIIVCSIENFDAMGIHTGDSITVAPAQTLTDKEYQIMRNASMAVLREIGVETGGSNVQFSVNPKTGRLIVIEMNPRVSRSSALASKATGFPIAKVAAKLAVGYTLDELMNDITGGRTPASFEPSIDYVVTKIPRFNFEKFAGANDRLTTQMKSVGEVMAIGRTQQESLQKALRGLEVGATGFDPKVSLDDPEALTKIRRELKDAGAERIWYIADAFRAGLSVDGVFNLTNIDRWFLVQIEELVRLEEKVADLGINGLDADFLRMLKRKGFADARLAKLAGVREAEIRKLRDQYDLHPVYKRVDTCAAEFATDTAYMYSTYEDECEANPSVDRDKIMVLGGGPNRIGQGIEFDYCCVHASLALREDGYETIMVNCNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVPVIGTSPDAIDRAEDRERFQQAVDRLKLKQPANATVTAIEQAVEKAKEIGYPLVVRPSYVLGGRAMEIVYDEADLRRYFQTAVSVSNDAPVLLDRFLDDAVEVDVDAICDGEMVLIGGIMEHIEQAGVHSGDSACSLPAYTLSQEIQDVMRQQVQKLAFELQVRGLMNVQFAVKDNEVYLIEVNPRAARTVPFVSKATGIPLAKVAARVMAGQTLAQQGVTKEIIPPYYSVKEVVLPFNKFPGVDPLLGPEMRSTGEVMGVGRTFAEAFAKAQLGSNSTMKKSGRALLSVREGDKERVVDLAAKLLKQGFELDATHGTAIVLGEAGINPRLVNKVHEGRPHIQDRIKNGEYTYIINTTAGRQAIEDSKLIRRSALQYKVHYDTTLNGGFATAMALNADATEKVISVQEMHAQINK >NC_018079.1|WP_014830609.1|741785_742934_+|glutamine-hydrolyzing-carbamoyl-phosphate-synthase-small-subunit MIKSALLVLEDGTQFIGRAIGATGSAVGEVVFNTSMTGYQEILTDPSYSRQIVTLTYPHIGNVGTNAADEESSQVHAQGLVIRDLPLIASNFRNTEDLSSYLKRHNIVAIADIDTRKLTRLLREKGAQNGCIIAGDNLDAALALEKAKAFPGLNGMDLAKEVTTTEAYSWTQGSWTLEGDLPEAKPESELPFHVVAYDFGAKRNILRMLVDRGCRLTVVPAQTSAEEVLKMNPDGIFLSNGPGDPAPCDYAINAITSFLETDIPVFGICLGHQLLALASGANTIKMKFGHHGGNHPVKDIDNNTVMITAQNHGFAVDEASMPANLRVTHKSLFDGTLQGIHRTDKAAFSFQGHPEASPGPHDAAPLFDHFIELIEQYRKTAK >NC_018079.1|WP_014830608.1|740509_741331_+|4-hydroxy-tetrahydrodipicolinate-reductase MHDAQIRVAIAGAGGRMGRQLIQAALQMDGVALGAALERDGSSLLGTDAGELAGAGKTGVTVQSSLEAVKDDFDVFIDFTRPEGTLNHLAFCRQHGKGMVIGTTGFDEAGKQAIQDASNEIAIVFAANFSVGVNVMLKLLEKAAKVMGDYTDIEIVEAHHRYKVDAPSGTALAMGEAIAHALDKDLKDCAVYTREGHTGERVPGTIGFATVRAGDIVGEHTAMFADIGERVEITHKASSRMTFANGAVRSALWLKGKDNGLFDMRDVLDLNNL >NC_018079.1|WP_014830607.1|739353_740304_+|4-hydroxy-3-methylbut-2-enyl-diphosphate-reductase MQILLANPRGFCAGVDRAISIVENALEIYGAPIYVRHEVVHNRYVVDSLRERGAIFIEQISEVPDGAILIFSAHGVSQAVRNEAKSRDLTVFDATCPLVTKVHMEVARASRRGEESILIGHAGHPEVEGTMGQYSNPEGGMYLVESPEDVLTLNVKNEARLSFMTQTTLSVDDTSDVIDALRQRFPKIVGPRKDDICYATTNRQEAVRALAEQADVVLVVGSKNSSNSNRLAELAQRMGKAAFLIDDATDIQEAWVKNANCVGVTAGASAPDILVQNVIARLQELGGGEAIPLEGREENIVFEVPKELRIDAREVE >NC_018079.1|WP_014830606.1|738902_739352_+|FKBP-type-peptidyl-prolyl-cis-trans-isomerase MSKSVQSNSAVLLHFTLKLDDGSTAESTRNNGKPALFRLGDTSLSEGLEQQLLGLKEGEKKAFSLEPDAAFGVPSPDLIQYFSRREFMDAGEPEIGAIMLFTAMDGSEMPGVIREINGDSITVDFNHPLAGRTVHFDIEVLEIDPALEA >NC_018079.1|WP_014830605.1|738284_738785_+|signal-peptidase-II MSKTLCSTGLRWLWLVVVVLIIDLGSKYLILQNFALGETVPLFPSLNLHYARNYGAAFSFLADSGGWQRWFFAGIAIGICVVLAVLMYRSKATQKLNNIAYALIIGGALGNLFDRLWHGFVVDMIDFYVGDWHFATFNLADSAICIGAALIVLEGFLPKPAAKEQA >NC_018079.1|WP_014830604.1|735468_738285_+|isoleucine--tRNA-ligase MSDYKSTLNLPETGFPMRGDLAKREPGMLARWTDDDLYGIIRAAKKGKKTFILHDGPPYANGSIHIGHSVNKILKDIIVKSKGLAGYDSPYVPGWDCHGLPIELKVEQEFGKPGEKFTAAEFRAKCREYAATQVDGQRADFIRLGVLGDWSHPYLTMDFKTEANIIRALGKIIGNGHLHKGAKPVHWCVDCRSALAEAEVEYYDKTSPSIDVAFHAVDQDAVKAKFGVTSVNGPISLVIWTTTPWTLPANRAISLSGEFEYALVQVDGQALILAKDLVESVLKRANITDYTVLGTVKGDALELMRFKHPFLDFDVPAILGDHVTLEAGTGAVHTAGGHGPDDYNISLKYGLEIANPVGPDGAYLPGTYPSLDGINVFKANDIIVEMLRERGALLHVEKMQHSYPCCWRHKTPIIFRATPQWFVSMDQKGLREQSLKEIKGVQWIPDWGQARIESMVANRPDWCISRQRTWGVPMSLFVHKETHELHPNTLELMEEVAKRVEVDGIQAWWDLDARDILGADADNYEKVPDTLDVWFDSGSTHASVVDVRPEFAGHAADMYLEGSDQHRGWFMSSLMISTAMKGKAPYRQVLTHGFTVDGQGRKMSKSIGNTVSPQDVMNKLGADILRLWVASTDYTGEMAVSDEILKRAADSYRRIRNTARFLLANLNGFDPAKDMVKPEEMVVLDRWAVGCAKAAQEDIVNAYESYDFHEVVQRLMRFCSIEMGSFYLDIIKDRQYTAKADSVARRSCQTALYHIAEALVRWMAPIMSFTADEIWGYLPGDREKYVFTGEWYEGLFDLSSTEAMNDAFWDELLKVRGEVNKVIEQARADKKVGGSLEAAVTLYAEPELAAKLTALGDELRFVLLTSGAKVADYADASADAQQSELLKGLKVALSKADGEKCPRCWHYTTDVGQVAEHADICGRCVSNVAGDGEKRKFA >NC_018079.1|WP_014830603.1|734496_735423_+|bifunctional-riboflavin-kinase/FAD-synthetase MKLIRGIHNLSKAPHGCVLTIGNFDGVHRGHQALLQGLRKEGEARGLPVVVMIFEPQPLELFAGDKSPARLTRLREKLRYLAESGVDYVLCVRFDRRFAALTAQNFVSDLLVKRLGVQFLAVGDDFRFGAGRQGDFLLLQKAGLEYGFDVTSAMTFCEGGVRVSSTAVRQALANDELETAENLLGHPFTISGRVVHGDALGRTIGFPTANIPLRRQVSPVKGVYAVEVAGLGDKPFYGVANIGTRPTVAGVRQQLEVHLLDVVMDLYGRHIDVILRKKIRNEQRFASLDELKAQIARDELTAREFFGL >NC_018079.1|WP_003856458.1|733905_734169_-|30S-ribosomal-protein-S20 MANIKSAKKRAVQSEKARKHNASRRSMMRTFIKKVYAAIEAGDKAAAQNAFNEMQPIVDRQAAKGLIHKNKAARHKANLTAQINKLA >NC_018079.1|WP_163281471.1|749032_749524_+|hypothetical-protein MMPVSVFDMLIAEESSLTKIPFSKPVMFPLLVKLPCIVTMFTAFFVAEEMVAPALLLISSGCALVFISSPFEVPEIFPLFVIELTLNPEAETAAKSPDEDTVISLLMTYSVSAEKSCGAEVDWSMVPAELIIGVKTSRAGNNLPSNWVVLFNSRRFTFIFTSI >NC_018079.1|WP_014830612.1|749877_750408_+|glutathione-regulated-potassium-efflux-system-oxidoreductase-KefF MILIIYAHPYPQHSHANKRMLEQVRTLDNVEIRSLYQLYPDFNIDITAEQEALTRADLIIWQHPMQWYSTPPLLKLWIDKVFSHGWAYGHNGHALKGKSLMWAVTTGGGESHFDIGSFPGFEVLAQPLQATALYCGLTWLPPFAMHCTFVCDDETLQAQARHYKQRLLEWQEAHNG >NC_018079.1|WP_014830613.1|750400_752266_+|glutathione-regulated-potassium-efflux-system-protein-KefC MDSHTLIQALIYLGAAALIVPVAVRLGLGSVLGYLIAGCVIGPWGFRLVTDAEAILHFAEIGVVLMLFVIGLELDPQRLWKLRASVFGGGALQMVACGLLLGGFCILLGMDWKVAALIGMTLALSSTAIAMQAMNERNLTVSQMGRSAFSVLLFQDIAAIPLVAMIPLLAASGASTTLGAFALSALKVVGALALVVLLGRYVTRPLLRFVARSGLREVFSAVALFLVFGFGLLLEEAGLSMAMGAFLAGVLLASSEYRHALESDIEPFKGLLLGLFFIGVGMSIDFGTLVTHPLRILILLVGFLVIKMMMLWLIARPLNVPGRQRRWFAVLLGQGSEFAFVVFGAAQMANVLDPEWAKALTLAVALSMAATPILLVLLTRLEKTGSEQEREADEIDEEQPRVIIAGFGRFGQITGRLLLSSGVKMVILDHDPDHVDTLRKFDMKVFYGDATRVDLLESAGAAKAEVLINAIDDPQTSMQLVELVKEHFPNLTIISRARDVDHYIQLRQAGVAAPERETFEGALKSGRMALESLGLGAYEARERADLFRRFNTDMVEEMVAMAGSTATERAAVFKRTSAMLTEIINEDRNHLSLVQRHGWQGTEEGKHTGDPADEPESKPSA >NC_018079.1|WP_014830614.1|752380_752860_+|type-3-dihydrofolate-reductase MISLIAALAVDRVIGMENAMPWNLPADLTWFKRTTLNKPVVMGRLTWESIGRPLPGRKNIVISSQPGTDDRVEWVKSVDEAIAACGNADEIMVIGGGRVYEQFLPKAQKLYLTHIDAEVEGDTHFPDYDPDEWESVFSEFHDADAQNSHSYCFEILERR >NC_018079.1|WP_014830615.1|752898_753747_-|bis(5'-nucleosyl)-tetraphosphatase-(symmetrical)-ApaH MSTYLIGDVHGCYDELIALLKQVDFTPGRDILWLTGDLVARGPGSLEVLRFVKSLGDSVRMVLGNHDLHLLAVFAGISRNKPKDRLTPLLDAPDADELINWLRRQPLLQVDEEKKLVMAHAGITPQWDLETAKTCARDVEAVLASDSYPFFLDAMYGDMPNHWSEDLSGLARLRFITNAFTRMRFCFPNGQLDMYSKETPESAPAPLKPWFAIPGPVTSEYSVVFGHWAALEGKGTPDGIYGLDTGCCWGGDLTCLRWEDKTYFVQPSNRQLDLGEGEAVAS >NC_018079.1|WP_014830616.1|753751_754129_-|Co2+/Mg2+-efflux-protein-ApaG MIDSPRVCVQVQSVYVESQSSPDEERFVFAYTVTIRNLGRMPVQLLGRYWLITNGNGREIEVQGEGVVGEQPHIDPGEEYQYTSGAVIETPLGTMQGHYEMVDVDGNVFRVAIPVFRLAVTTLIH >NC_018079.1|WP_014830617.1|754132_754954_-|16S-rRNA-(adenine(1518)-N(6)/adenine(1519)-N(6))--dimethyltransferase-RsmA MTNRVHQGHLARKRFGQNFLNDQFVIDSIVSAINPQKGQAMVEIGPGLAALTEPVGERLDELTVIELDRDLAARLQTHPFLGPKLTIYQQDAMTMNFGELAEKMGQPLRVFGNLPYNISTPLMFHLFSYTDAIADMHFMLQKEVVNRLVAGPNSKAYGRLSVMAQYFCNVIPVLEVPPSAFTPPPKVDSAVVRLVPHKTMPYPVKDLRVLSRITTEAFNQRRKTIRNSLGNLFTVDVLAELGIDPAMRAENISVEQYCKLANYISEKAPPKES >NC_018079.1|WP_014830618.1|754950_755937_-|4-hydroxythreonine-4-phosphate-dehydrogenase-PdxA MKQHRVVITPGEPAGIGPDLVVQLAQRSWPVELVVCADATLLQDRASLLGLPLTLTPYVEGQQPAPQQAGTLTLLPVPLRTPVIPGQLSTENGHYVVETLARACDGCLKGEFAALITGPVHKGVINEAGIPFTGHTEFFEERSHSPKVVMMLATEAMRVALVTTHLPIKAIPDAITPELLREIIGILHHDLQTKFGIKQPHVLVCGLNPHAGEGGHMGTEEIDTIIPVLDEMRAKGMNLSGPLPADTLFQPKYLDNADAVLAMYHDQGLPVLKYQGFGRGVNITLGLPFIRTSVDHGTALDLAGQGKADVGSFITALNLAIKMIVNTQ >NC_018079.1|WP_014830619.1|755936_757223_-|peptidylprolyl-isomerase-SurA MKNWKTLLLGVAMVANTSFAAPQVVDKVAAVVNNGVVLESDVDGLMKSVKLNSGEAGQQLPDDATLRHQILERLIMDQIILQMGQKMGVKVTDEQLDQAIANIAKQNNMSLDQMRSRLAYDGISYSTYRNQIRKEMLISEVRNNEVRRRVTILPQEVDALAKQVGNQNDASTELNLSHILIPLPENPTSDQAAEAESQARAIVEQARNGSDFGKLAITYSADQQALKGGQMGWGRIQELPSLFAQALSTAKKGDIVGPIRSGVGFHILKVNDLRGQSQNISVTEVHARHILLKPSPIMTDDQARAKLQQIAADIKSGKTSFANAAKEFSQDPGSANQGGDLGWAAADIYDPAFRDALMKLNKGQMSAPVHSSFGWHLIELLDTRNVDKTDAAQKDRAYRMLFNRKFSEEAATWMQEQRASAYVKVLSN >NC_018079.1|WP_014830620.1|757276_759628_-|LPS-assembly-protein-LptD MKKRIPTLLATMIGTALYSQQGLAADLASQCMLGVPSYNRPLVSGDTNSLPVTITADSSKGTYPDNATFTGDVDINQGNSRLQADEVQLHQKQPEGAAEPVRTVDALGNVHYDDNQVILKGPKAWSNLNTKDTNVWEGDYQMVGRQGRGKADLMKQRGENRYTILENGSFTSCLPGSNTWSVVGSEVIHDREEQVAEIWNARFKLGPVPVFYSPYLQLPVGDKRRSGFLIPNAKYSTSNYFEFYLPYYWNIAPNMDATITPHYIHKRGNVMWENEFRYLTQAGTGLMELDYLPSDKVFQDEHPTEGDKHRWMYYWRHAGVMDQVWRFNVDYTKVSDPYYFNDFDSKYGSSTDGYATQKFSVGYAVQNFNATVSTKQFQVFNTQTRSTYGAEPQLDVNWYQNDVGPFDTRVYAQAVHFVNTNSDMPEATRVHLEPTINLPISNNWSSLNTEAKLMATHYQQKNVDWYNNRYGTDLDESVNRVLPQFKMDGKLIFERDMGLLADGYTQTLEPRMQYLYVPYRDQSKIQNYDSSFLQSDYSGLFRDRTYGGLDRIASANQLTTGVTTRVYDDASVERFNVSVGQIYYFTESRTGDDDINWEKDNKTGSLVWAGDTYWRMTDRWGLRGGLQYDTRLDNIATSSAAIEYRRDEDRMMQLTYRYASPEYIQATLPNYATSEQYKDGISQVGGAASWPIADRWSIVGAYYFDTNANKPADQMVGLQYNSCCYALRVGYERKLNGWDTQNNQGKYDNVIGFNVELRGLSSNYGLGTQQMLRSNILPYRSSL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052797 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N18S2039 plasmid pN18S2039, complete sequence | 63456-63485 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052804 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S973 plasmid pN17S0973, complete sequence | 12622-12651 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP038508 | Salmonella enterica subsp. enterica serovar Infantis strain FARPER-219 plasmid p-F219, complete sequence | 129996-130025 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052788 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0611 plasmid pN19S0611, complete sequence | 221022-221051 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052786 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0641 plasmid pN19S0641, complete sequence | 232950-232979 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052838 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S097 plasmid pN16S097, complete sequence | 231525-231554 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | NZ_CP028316 | Salmonella enterica subsp. enterica serovar Typhimurium var. 5- strain CFSAN067217 plasmid pSC-31-2, complete sequence | 128630-128659 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP051676 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1234 plasmid pN16S1234, complete sequence | 101317-101346 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | NZ_CP022063 | Salmonella enterica strain FDAARGOS_312 plasmid unnamed3, complete sequence | 84260-84289 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052781 | Salmonella enterica strain CVM N19S0949 plasmid pN19S0949, complete sequence | 187128-187157 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052834 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S041 plasmid pN17S0041, complete sequence | 24105-24134 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052793 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0388 plasmid pN19S0388, complete sequence | 43406-43435 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052832 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1040 plasmid pN17S1040, complete sequence | 178361-178390 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052830 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1105 plasmid pN17S1105, complete sequence | 211357-211386 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | NZ_CP022662 | Salmonella enterica subsp. enterica strain RM11065 plasmid pRM11065-2, complete sequence | 73314-73343 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052812 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S376 plasmid pN17S0376, complete sequence | 19319-19348 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052810 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S535 plasmid pN17S0535, complete sequence | 230399-230428 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052808 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S637 plasmid pN17S0637, complete sequence | 11111-11140 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052806 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S816 plasmid pN17S0816, complete sequence | 182227-182256 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052791 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0552 plasmid pN17S0637, complete sequence | 185722-185751 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052818 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1509 plasmid pN17S1509, complete sequence | 208172-208201 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | NZ_CP043441 | Cupriavidus campinensis strain MJ1 plasmid unnamed1, complete sequence | 1688442-1688471 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052799 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S990 plasmid pN17S0990-1, complete sequence | 24105-24134 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052795 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0125 plasmid pN19S0125, complete sequence | 264943-264972 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | NZ_CP047882 | Salmonella enterica subsp. enterica serovar Infantis strain 119944 plasmid pESI, complete sequence | 77271-77300 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052802 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S976 plasmid pN17S0976, complete sequence | 298048-298077 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052840 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S024 plasmid pN16S024, complete sequence | 110008-110037 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052783 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0679 plasmid pN19S0679-1, complete sequence | 176473-176502 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052836 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S103 plasmid pN16S103, complete sequence | 764-793 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052779 | Salmonella enterica strain 19TN07GT06K-S plasmid pN19S1233, complete sequence | 122769-122798 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | NZ_CP031362 | Salmonella enterica subsp. enterica serovar Heidelberg strain 5 plasmid p3, complete sequence | 123173-123202 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052828 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1126 plasmid pN17S1126, complete sequence | 109328-109357 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052826 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1245 plasmid pN17S0637, complete sequence | 93338-93367 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | NZ_CP016409 | Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502916 plasmid pFSIS1502916, complete sequence | 77270-77299 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052824 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1265 plasmid pN17S1265, complete sequence | 73851-73880 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052822 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1349 plasmid pN17S1349, complete sequence | 93338-93367 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | NZ_CP016407 | Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502169 plasmid pFSIS1502169, complete sequence | 77270-77299 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052820 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1442 plasmid pN17S1442, complete sequence | 77270-77299 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | NZ_CP016413 | Salmonella enterica subsp. enterica serovar Infantis strain CVM44454 plasmid pCVM44454, complete sequence | 77270-77299 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | NZ_CP016411 | Salmonella enterica subsp. enterica serovar Infantis strain N55391 plasmid pN55391, complete sequence | 77270-77299 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052816 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1598 plasmid pN17S1598 | 147671-147700 | 7 | 0.767 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | CP052814 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S349 plasmid pN17S0349, complete sequence | 81463-81492 | 7 | 0.767 |
NC_018079_1 | 1.1|747570|30|NC_018079|CRISPRCasFinder | 747570-747599 | 30 | NC_022049 | Paracoccus aminophilus JCM 7686 plasmid pAMI4, complete sequence | 341179-341208 | 8 | 0.733 |
NC_018079_1 | 1.1|747570|30|NC_018079|CRISPRCasFinder | 747570-747599 | 30 | NZ_CP032695 | Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525c, complete sequence | 1386373-1386402 | 8 | 0.733 |
NC_018079_1 | 1.2|747624|30|NC_018079|CRISPRCasFinder | 747624-747653 | 30 | KP881232 | Sinorhizobium phage phiM9, complete genome | 86166-86195 | 8 | 0.733 |
1. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052797 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N18S2039 plasmid pN18S2039, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
2. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052804 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S973 plasmid pN17S0973, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
3. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP038508 (Salmonella enterica subsp. enterica serovar Infantis strain FARPER-219 plasmid p-F219, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
4. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052788 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0611 plasmid pN19S0611, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
5. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052786 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0641 plasmid pN19S0641, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
6. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052838 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S097 plasmid pN16S097, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
7. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to NZ_CP028316 (Salmonella enterica subsp. enterica serovar Typhimurium var. 5- strain CFSAN067217 plasmid pSC-31-2, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
8. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP051676 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1234 plasmid pN16S1234, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
9. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to NZ_CP022063 (Salmonella enterica strain FDAARGOS_312 plasmid unnamed3, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
10. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052781 (Salmonella enterica strain CVM N19S0949 plasmid pN19S0949, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
11. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052834 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S041 plasmid pN17S0041, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
12. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052793 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0388 plasmid pN19S0388, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
13. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052832 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1040 plasmid pN17S1040, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
14. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052830 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1105 plasmid pN17S1105, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
15. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to NZ_CP022662 (Salmonella enterica subsp. enterica strain RM11065 plasmid pRM11065-2, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattccccc Protospacer .*******.***.************ .*
16. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052812 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S376 plasmid pN17S0376, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
17. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052810 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S535 plasmid pN17S0535, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
18. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052808 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S637 plasmid pN17S0637, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
19. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052806 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S816 plasmid pN17S0816, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
20. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052791 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0552 plasmid pN17S0637, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
21. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052818 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1509 plasmid pN17S1509, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
22. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to NZ_CP043441 (Cupriavidus campinensis strain MJ1 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccactggtaccgccattcgacg Protospacer .*******.***** ********** *
23. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052799 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S990 plasmid pN17S0990-1, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
24. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052795 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0125 plasmid pN19S0125, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
25. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to NZ_CP047882 (Salmonella enterica subsp. enterica serovar Infantis strain 119944 plasmid pESI, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
26. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052802 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S976 plasmid pN17S0976, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
27. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052840 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S024 plasmid pN16S024, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
28. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052783 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0679 plasmid pN19S0679-1, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
29. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052836 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S103 plasmid pN16S103, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
30. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052779 (Salmonella enterica strain 19TN07GT06K-S plasmid pN19S1233, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
31. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to NZ_CP031362 (Salmonella enterica subsp. enterica serovar Heidelberg strain 5 plasmid p3, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
32. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052828 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1126 plasmid pN17S1126, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
33. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052826 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1245 plasmid pN17S0637, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
34. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to NZ_CP016409 (Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502916 plasmid pFSIS1502916, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
35. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052824 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1265 plasmid pN17S1265, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
36. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052822 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1349 plasmid pN17S1349, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
37. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to NZ_CP016407 (Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502169 plasmid pFSIS1502169, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
38. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052820 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1442 plasmid pN17S1442, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
39. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to NZ_CP016413 (Salmonella enterica subsp. enterica serovar Infantis strain CVM44454 plasmid pCVM44454, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
40. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to NZ_CP016411 (Salmonella enterica subsp. enterica serovar Infantis strain N55391 plasmid pN55391, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
41. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052816 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1598 plasmid pN17S1598) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
42. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to CP052814 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S349 plasmid pN17S0349, complete sequence) position: , mismatch: 7, identity: 0.767
tgccattgtcactgttaccgccattatcag CRISPR spacer cgccattgccaccgttaccgccattgcccc Protospacer .*******.***.************..*
43. spacer 1.1|747570|30|NC_018079|CRISPRCasFinder matches to NC_022049 (Paracoccus aminophilus JCM 7686 plasmid pAMI4, complete sequence) position: , mismatch: 8, identity: 0.733
caccactgtcgccgttatcattgccgccgc CRISPR spacer ggtttcggtcgccgatatcatggccgccgc Protospacer ... * ******* ****** ********
44. spacer 1.1|747570|30|NC_018079|CRISPRCasFinder matches to NZ_CP032695 (Rhizobium jaguaris strain CCGE525 plasmid pRCCGE525c, complete sequence) position: , mismatch: 8, identity: 0.733
caccactgtcgccgttatcattgccgccgc CRISPR spacer tattgccaccgccgttaccattgccgccgc Protospacer .*...*...********.************
45. spacer 1.2|747624|30|NC_018079|CRISPRCasFinder matches to KP881232 (Sinorhizobium phage phiM9, complete genome) position: , mismatch: 8, identity: 0.733
tgccattgtcactgttaccgccattatcag CRISPR spacer tgccattgccaccgttaccgccagctccgc Protospacer ********.***.********** . .*.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
105164 : 116245
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_018079|105164:116245|DBSCAN-SWA TTTATTCAATTTTATTATGAAGATTTTGAATCCATTTTATTTCTTGAAAGTATTTTCTGTTCAGCTTAGTGTAAAAATGAGGGTCTATATCTTTTGCATAAGCAATATAACCTGAAAGTTTGTTAAAATCTTCATCTTTTAATACTCCCTTTGATAAAAGAGAAAGATGTAATCTTATTTTATCTTTCATTGACCTATGTAATGTAACATGGAAATCATGACAAACTTTCAACCCGGTTACTACTATACTGCCACCCGAAGCACTACAGATCTTGAACTTTTTAATATTTATTTTAAAGTCGGGCCCAATTTCATTCATTGCCCTTTTAAAACAATTTAGAATTATTTTACTAGCCCCTTTCATATTTGTAGATACAATAATATCATCTGCATATCGTGTATAAGTGGCATTAAGTTTATCAATTGTGTTTAGTTTTCTCGTCAGTTTTTCATCAAGCTCTCTTGCGACAAAGTTTGCAATTATAGGTGACGTTGGAAATCCGATAGGGAGAGTACTATCTGATATGAAACAGATTGTTTTTATAAGTTGTAGTAACTCTTTGTCATACTCTGTTGTAAATTCAATGCGATCTCTATAACGAGTGAATGCATTTTCAAAATCACTAAATTTAATTGAAGGAAAAAAATCTTTGAGATCTATTTTTACATAATATTTATTTTTTGATTCAGCATGCAATAAGGCGTTGCTTTTTATTGAACGGTTTTTAACAAATGCATATGCAGCATTATGCATTGGCAGTTTAGAAAAAATATTATTCATTAACCAATATTGAATTAGTTTTACTTTAGATGATGGGTGGTATATTGTTCTCACACCGCCTTTTTTCTTAGCTATATCCCATCTTTTTGGAGGCTCAGGTGACCGCATTACTTCAGAAGCAAATCCCTTAGTCATTAATGTTTGAGAATCAATCAGGCTATATATTCTCATATACCTTCATCCTTTCTGGTTTGTTCTTGAGGAAAAAAACCTTAAACATTGATGAGATGTTGTCAATATCAAAATCATATTTAAAATAATATTCTTTATACAGAGAATAATAAACTCCATTTGTGCATGATATTATTCTAATAGCCATTAGAATACCAAGATGCTTTAGAAGGTTTTTTTTGTAATGACTTTCTGTACCAAAAATTTTGGTGATTATTTCGATAAGTTCATTAAGCTGTAAAGGGCCACATACAAAAATTATATCATGAATAAATCGTACTGAGTCTTTATTAAAGTTATTGGAAGGATCTAACTCTTCCTTTTTTAAAGTTCTTGAAATTGCTCTCTCATTCTTAGAAAGAATATCATATAGAGCATCGAATATTTCGCCAATGCCGTCAGAGCGCTCTATACTTTCAATACCTTCTGTCATTTTATAATGTAAGAAATGACCAGATTGTTGTGATTGTTGAGTAATAGCCTTTATTGGTCCCATATTTATAAATGATTTCTCATTTATAAATTTTGTATTGTTAACTATTATTAATTTCTTGCGTAATTGCTTGCTGTATGCGAATGCACCAAGTTCCGTGAAAGATGAATAACTTTCTAAAACAATGATAATATGGTCAGCTAATTTAGATAAGTCAGCTTCGATATCTAATAAATTATCAGATAATGATTCTTCATCGGTGCTTAATTCTTTGAAAACTAGTTCAGCAAGAAAAAAATGACAGTTATTCAAATACCTTTCAGAAAAATTTATTAATTCCAATCGTCTTGCTGATGGTTCTCCATTGTTTTTGTTAGCACCACACAGAAAGACGAATTTTTTTACATTACCAGGACATTTTACTTTGTTCTTTTCAACGAATGATGAAAATAACGCGGTTGCTCTGGATACCTCAGTACCATTTTTTTTTAGCTGAGTCAGCATCGAGAATATCCTTATGCACCTTGATGGTAGAGGACTAAAAGAGCCAAACCTACTTGAGCACGTCGATCAGTTCGCTGATCGGTGGCCCCCAGCCGCCGCTCAGCGAACTGAACGACGGGCATAGCTAAAATACTAGGCGAAACCATGACTCATGGCCCGATGGGCTATTAACCCATAAAATGCTAGGTTTGGCTCTGTTACGCTCTCATAAACCTCAAGAGGATTCAAGAGGAAAGCTATTGATGTGAATTAATTTTCTTTTATCCAAGTGCTGAGAAAAAAGTTTTTTTGCATTTAACTGTTAACACTGTTCACCTCGGTTATTTCTCATTTTATATCATTAAGTTAAGTGATGATGAGTTGGTGAAGGGTGAACAGTCGACTCTTCACCTTTTTTCGCTTTACTCGTTCCGGGGGCGCCCGGTCAGGCAATACTCGGGGGGATAAAAAGTTTTTTCCGGTTTTACTGTTCACACTGTTCACCTTTGCTTTTTTATCAATAATTTCATACTGATACAGGGTGAATATACGGTGAAGGGTGAACAGTGGAGTGTTCACCTTATGGCAATTGCCGGAAAGAAAAAGACCGGCTGTAGCCGGTCTGGAGTGGGTTATGTCGCTGCGGGTTCGTCGCATTTCGGCAGCCAGTCGCCGTAGCTTTCCTCTTTCAGCGACAGGTTGGTCTGTATCCCCTGCTTTGTGTGCCGCTTCTCATAATTCAGGCCGTACTCTTTCAGCATCATGGGCAGCCCCAGTCCGAACATTTTCAGGCTGAGCACGTTCCTGTACCCGTTAGCCTCCATATAGGCCAGATATGCGTGATAAAGATATTTACGGTAATTACGCGGGATGATGCTGGCATTACCCATAAACATTCCGTTGGTCTGCGGCAGCATTTCCAGATAGCCGCAAAAATCAAACGTCGGGTCAGCATCACGCTTGATGCTGAGCGCCTCGTCGGAATTCTGCTGCGACTGGAGCAGTGCTCGCGCCGTCATCGGGTCGCTGAACCGCTGCATAAGCTGGCGCACAATAACGGCCAGCTCGCGCGCAATTTTATCCCTGAGCTGCGGGTCGCGTTCCTCCGGGGCAATCTGCTCCGGGAAATGAATAATCACCCGGCGACGTGACACACCGCCGCTGCGGTCGGTGAAGCGCATCGGATTGTTGTTGACGGCCAGAATGACCGCCGGAATATGTGTTGAATACGGGTTCTGGTATTTCGGGTCAACCGAAACTGCATCACCGCCGGTGATGGCCTTTAGTCCTGCGCCGTCACCGCTCCATTTTTCCTGGTCAGGCAGACGGATAAGCGAGAAGCCAATCAGGGAGGCACGCTTGCGCGGGTCTTCCAGTGTGTCGATGTCGGCCGACGTGGCGTTATCCTCTCCGGCGAGCAGGGTCGCGATTTCGGCCAGAATACTTTTCCCACTTCCGCCGGGACCGGTGACTTCGAGAAAGAGCTGCCAGTCGTAGCGGTTCGCCAGCACCATAAACAGCGCGGCCAGAATCACGTCGCGTTTTTCTGGTCTGCTACCGGCAGCACGGTCGAGCCAGCGCCAGAAATTCGGGGCATGAGTTTCCAGCGTTTCCCCCTCGACCGGCGGAGTAAAATCCACGTCGCACAGCGTGCGCAGCCAGTGTGATTTGCTGTGCGGGCTGAACAGGCCGCTCCGGGTATCGAGTACCCCGTTGCGAAAACCAATCAGACGGCGTGCCGGTGCGGCCTGCTGAGGAATAATCAGCTTCAGCGTCTCCACCACTGAGGTGATTCTCCCCGACGAGAACGGGGCGCGCAGGCGCTGGAAAAGCCCGGCTACGTCGCGTTCGAAACTGGACGGCTGAACGACTTTCCATATCCCTGCTTCATAACGGGACAGGAGCTGGCCGTTCGCATCCACCGCCAGCGCTTCGCCGTAATGCTCATGCACCCGCAGAGCCTTGTCGCTGGCGCTCATGGCGGTAAATTCCGCCTCGCTCATGGTGTCGAACGGACTTTGTACCGGTGGCCGGATGGCATCATAAATCGCTTTTCGCGTGGCCTCCCCGCCATTCTGCATAAACGCATCATTCCAGTCACCAAACACCGGCGGCAGGGCGACAATGCCCTCACAGGCTTCTGCGGCCGCGGTGGCTTTGTTCTGGCCGTCGCCGTTAAGGTCACGGTCAGCAGCGAGGACAATCTGACAGGCCGGGTGTTTCTGACGGGCAAGGCTCGCCAGAGAAAGGAGGTTCACGGACGACAGTGCCACCATGACGGTTTCGCCGGTCAGGTGATGCACGGTGAGTGCCGTCGCATAGCCCTCCGCAATCCACAGGCGTTTTCCTGCCTGTTTTTTCCCTTCGATGGTGTGGCACGCCCCTTTTACCGTCCCGCCTTTCAGGGTGCGTTTGAGCCCCTCAGAATTGATGAGCTGAAGGTTAACCAGTGCGCCGGTCTCGTCATACAGCGGGATAACCACATCACCGGCGCGGAACGTCACGCCGCCGGTTTTATGCATGACCGTCATTACCGGACATTCCCGGTCGGGGAATCCCTTGCGGGTCAGATAGGTGTTGCCGCTGGCCGTGCGGGTTTTCTCCATGAGCCTGACGGCCAGTGTGGCCGCCGCTTTACGGTCAGCTTCGGTTTCAGCTTCTGCGGCCGCAATCACTTCGGGGGCAACCGGTGGCAGGTTGCCGGTCACGGCGTTCACCTTTTGGGCTGCCTCTGACGGTTTTACACCGAAAACCTTTTCAACCAGCTTAAGCCCGTCACCCGCGCCGCACTGGTTACAGAACCATGTGCCGCGCCCCTCTTTATCGTCAAAGCGGAAGCGGTCAGAGCCGCCGCACACCGGACAGGCCTGATGGCGGTTTTTAATGACCTTAACACCCAGCGCCGGGAGAATGCGCGGCCAGTGGCCGCACGCCTGTTTTACGGTTTCCGTTACGTTCATTTTCATCGTTATTTTCTCCCTCAGTGCATAACAGGCGATGTGATATGACGGGCGCAGAGTTCCTCCATCACGGCCAGCCCGAGAAAGGACAGCGACGGGGTGGCCTTGAGCGGTCCGGCTTCCATTAAATCTTCCAGCAGCGCACAGGCAATCTGGCGGCCTTTTTCCTCACCGTGCTGGCGCAGATAGAAGCCCTCCAGCTCGTCGGCGATGGCGCTTTCCAGCGCATCAAGGGTGAGGTGTTCGTAGCGGTGCTGACGCTCGCATACCGTCAGCCATGCACAGGCCACGGCGCGGCGATAGAGCGCGGCGCGTAATACGGGTGAAAGAGGCTTTTTCATACGTTATCCTCCCCGGTCAGCCAGCGCTTTTTGCAGCGCTCGACCACGCCGTCGAGCTGGGCGGTCATGAGGTAAATCACGGAGGTGAGCTGCAACTGCTGCGCCGGGTCGCGACAAACGGAGGTTGAGTCCTGCACCTTCATCAGCTCGTCAACGAGCTGGCCGACATTACGCATATGCTCAAGGCACTCAATATCATGAGCGGTGATAAGGGTCTGTTTCATGGTCGCACCTCCGTAACCGGCAGGCGGCCAGCGAACGACAGAATGTAATCACGAACGAGGGAAAGGCGTGCGGCGTGTTCATCACCGGCAACGGTGCGGAGCATACAGATACGGGGTTTACGGTCTGCCCGACGAACGGCGGCAAACACAAAGACAAACTGCGGATGTGACGGGGTGAGGGTCGTAGCCATAGCGGCAACCTCCATAGTTAGCTGTTTACAGCTACCACCGAAGTTCTCACGCAATGGTGGTAGCCCAGACGGGGGTGAGAAACCGGCAACTATGGAAACCGGCCAGCCCGAAAGCTGCCCCGCCTGAGCCACCATTATTTTGACGGCGCAACAGACAAAGAACCGTTGCCCGATAAATGGGTGCACAAAAGCATAGACACAAAAAAAGACGCATGGCGCGTCCGGTGTCGCCATAGTTAAACTCGGGTTCTCACGCCCGGCTGCCGATTTTGCGACAGCGGGAAAACTATACCTGGAAACGTCGACAGGAAGCAAGCCAGAAAAAGGGGCTGTTTGCTGAACGGTCATCATCATGCGTCACAGCCCCGGTTGCGCTCGGCAATGCGATCCGCCATCCATGCGGTGATTTCAGACTGCGCCCACGCCACGTTTTTTCCGCCGAGGCTGATTTGTTTCGGGAAAGCCTCCCGGCTGATGAGGTCGTAAATGGTCGACCGGGACAGACCGCACAGATGCATCACTTCGGGCAGGCGGATAAAGCGTTCCTGAACGGCATCAGAGACCGGCATCATTGGGGCGGCAGGGGCAGAAGACGGGAAAGAAAAAGCGGTGTGCATCGGGCTACCTCATAAAATCCATACTGTGCCGGTCGTGTCCGTCCGGCTTCGGGTAGCTCCTTATTATGTCTATATTTTTCCTCAGGTCATGTGAGATTTTCGGGGAAACAAACATTGACTTTTCGCTATGGCAAACAAAGGAAAACGCTGGTAAACATATGCAAATCACTGCATTACAATGCAGCAATTTCTATTGCTTTTAGTTATACGTTTTTCTTTTTTAATCGAAATAAAATCTAAGTGGTATAGACAGAGCAAAACAGGAGGGTGAACAGTGGTGAACAGACGGTGAACAGTCAGACCTGCAACTGTTCACCCTTTAAATTACTGTATTACTTATATTTTTATTTATGGTGAACAGTGGTGAACAGTTATCAGTAAAAAAACAAACGATGAGTAAGGTTTTGCTGAGACCTTTCTCTGGCCAGCCGGGTTTTGAGTGCTGTTTGTGCCAGAACTGCCACAACTGCAATGAATCGAGATGTTGTGTGATGAAGGGCAGAATCATTTCAGGTTGAACACACGGAGAGCCTGAACATGAAACCCGAAACAGTCATTACCGCCCTGCAGAATGTAGCCGCTCAACAGTCAGCAGAGAGCAGCCAGCGCCTCACCGACAAGCTGAGCGCATTCACTGCGGCCAGAGACACGCACGCGGCCAGTATGCAGGAACTGAAAGAGATTGATACGGCCATTGAACGCTGTAAGCAGGAGCGGCAGACCGCCCTCAGTGAGAGTGCAGAGGCGGAGCAGGACTGGCGCAGTCGCTTCCGCACCCTGCGCGGCAATCTCACCCCTGAAATGAAAGCTGAGCACAGCAAGCGTATCGCCAGTCGCGAGCTGGCCGACGAGTTCACCGGCCTGATTGCGGAGCTGGAAACTGACCGGACACGCGCCATGCTGAATGCCTGCTCGACCGCCAGTAAATACCTGTCAGCGCATGATGATGCCTTTACCACTTATGCCGGTGCGGAATGGGCTCAGGCGGTCAACACGCTCCCCGCCGCGCTCATTCGTGCTTTCCTGCTGCGCATTCGCGCCCTCGAAATGCAGGGTGACAGTGCACCCCAGTCCGTGGCCATCGGCGAGCTGCGCGATGCGCTGAGCCGTCAGGGCAGCCTGTATCACTTCGATATGAAGCAGGAGCCGGTATTGTCCGTGACGGGCATGCACCGGCCGCAGATTAACGGTGTGGATATGGAGCTGTTACGCAGCCCGGCGAAGAGAATGATGCTCGCCAGAAAACTGGCTGATAATGGCAAGACAAAAGCGGAGGTATAAGCATGTTTCACTGCCCGTTCTGCAAAACCAGTGCTCACGCCCGCACCAGTCGCTATCTGTCTGAGAACGTCAAACAGCGCTATCACCAGTGCGTGAACGTCGAGTGCTCGGCGACCTTCCGTACGCTTGAATCCGTTGACGGGATTATCCGTTCACCGGTTACTGAGCCGGTTATCCCTGTACCCGCACCGGCGGCCACCGTTAACCGTGCCGGTGCGTGAGCACGGCCAGACATCAGGAGAGATATACGTGACCACACTGACGCTACAGAAAGCCTTTGAGGCCTGTCAGGCAAATAAATCCGCATGGCTGCAGCGCCGGGATGAACTGAAGCAGGCCGAACAGGTGTACCGTGAACAGCTTGCCGGTAACGGCCAAAGCGGCCGGAGCCTGAAAACCCTGCGCGAAATTATCGACGTGAAAAAGTGGGAAATTAACCAGGCTGCCGGGCGTTATATCCGCTCGCATGAGGAGGTGCAGCGAATCAGCATCCGTAACCGTCTGAATGATTTTATGCAGGCACATGGCGCGGAGCTGGCCGCCGCCCTTGCCCCGGAGCTGATGAATTATTCCGGGCAACATTCCGCCATTCAGCGCTGCGCCATGCAGCACTCACTCGACTATCTGCGTGAGGCGCTGCAGGTCTGGCTGGCCGCCGGTGAAAAAATTAATTATTCGGCGCAGGACAATGACATTTTAACGGCCATCGGATTCAGGCCTGATGCGGCTTCGCGAGATGATAGTCGTGAAAAATTCACGCCAGCACAGAACCTGAATTACACCCGCCGCCGTGCAGAACTGGCCGTGCGGTAGTCCGCTTAAAAATCCCAGAACATCCCGCCATTTTTACGTATAAAAGCCATGCATGCATAAGGTGCATGGTTTTGCATGCGTTTTACCGACACAGGATCCCCCGCCAGCGCCAGCACTGGCGTGCCCTGAGGCCGGTCATGCACCTGCATTAAAAGCGCCCCCTTAAGCGGGCAGGCGGGGCGGGGAGAGCATTGCGCGCCGGGAAAGCATGTTCATGAATTTAAATCTACAGAAAGCAAATAAAGGGTAACATGGACAATGGATGTTTCTCAGTCAGATATTTATTTTTATAAAATAAAAGGGAGCCTTAGGCTCCCCTTACAAAATTATTTTTTATTAAATGGTGTTTTTTAATGTCTTTAAAGATATGAGCCTCTTTAGTCTGCACGCGATTATCTATCCAGTCATCCATCATTGCGTCCACTTGACTAAACAGTCCCTGAGCTTCATATTTTAAATAAACTTTTTCAAAGTCGTTATTAACGGAGTAACCATCGGCTTCGTTACCCTTTGGATTTGACAACCTTGCGTATCTGAGCTTTTCCATCATCAGCCCTCCTATCAAAATATCGAAAACAGCATTCGATACACTATAATATATAGCACAAAACTTGAGCGATTTCCACTTAAGTTTGATTTTAATTTGTTGGTGTATCTACATACATTTTGTAAATAACTTCTGAATATATTGCTATTAATGAACTAAGATTTAAAAGGAAGTCTTGGTCCCTCTCAAGCGAGAATTGGTCTGGCAGCGCGCTAGTAAGAACAAGAACCCCAAGACATTCTTCTTCATTGGAGTTTTGCTGGTAGGCAAGAATAGGAACGGAAAGAAATGAACAATAATTCTTTCTATCGGCAGGATTGGATATTTTTAACTCATTACTTTTAGTAATGTCTGGGCATATCTTCAATTCCTTATGTAAATATGTCAAGCCTACGTGTCCAAATCCAGGTTTCCAGGCTCTATTTTTCTTATTAATTCTGTCATCACAAATTCTAGATATGACTTCTAGATGATTTGTTTCTTTGTTATAAATGTAAAGAGCTATATTGAATTTTGACTCACTTTTATAACCAAATAGCGTTTCTCTGTTTCTGGATAAATAACCAAGAACAGTCGAAAAAAAACTTGCGGTATCAATAGGAAAGTCTGCGAGGGTCAGTTTTTTCTCTTTGTTGGCAATAAGTATTTTGCGAATATGGCTATTGACTTGCAATGTTGTTAAATATATAACAAGTTTTTGAGTGGTGGATAATTTATCCTCTTGCACCATTCTTTTCCGAATTATCTCAATCTCACTTACTCTCTCATTATATTCTTTTGTTAACTCACCAAGTTCGATTGTAGAGTCATCTTTTTTATCTAAATATTTTGCACCCCACGCAAGGATCAAATGTAGGATGAAAAACATAATACAAGCTATAAAAAAGAAATCACTTATTCCATTGTTAATATCTTTTTTTTGAAAAACATACGATACACCAAGGCCAATTATAATAGGAGCAAATAATGCGTTCCACAAACTTGTGAAACTTGCAAAATAACCTGCAAACCGTGCATTGATGTGTCTGTTTTTTCTTAAGATTCTTTCTATTTTTCTATTGTCCATGACTCACCTTAGAATATAAATAATCAGCATACCATTGTAACATTATTCTTCTGTTTTCTATATGAAGGGCATGATTATAGATCCCTCGTATCGAGTTCTTATCTATATGAGCTAATTGTAATTCAATCCACTGAGTATCAAAACCTTTTTCATGCAAAATTGTCGACATCGTATGTCTAAATCCGTGACCTGTCGCGCGTCCTTTGTAACCAAGTAACTCAATCACATGTGACACGCTTTCTTTAGAGATCGGTTTACTACGGTTATTCCTACCAATAAAAATGTAGGGATAATTTTGGGTTATTGGTTTGAGCTGCTTGAATAAATCGATCACCTGAGTAGATAGGGGTACAATGTGAGGTCTACGCATTTTCATGCGTTCAGCGGGGATCTCCCATATACCTTTCTCAAGGTCAACCTCTTCCCACGTAGCAAAGCGCATCTCCTGCGTTCTTACACCAGTCAACATGACTATCTTCGTAGCATTTTTGGTGATGATGCTACCGGTATACGCTTCGAGATCCTGAATAAAATGAGGCAATTCTTCGGCAGATAGAAATGGATGATGTTTTTGCTTAGGAACAGCCAGAGCGATGGCTAAATCAGGCGCAGGATTGTATTCAGCGCGGCCAGTTATAACTGCGTAGCGATAAACCTCACCGCATCTCTGACGCACCTTACGTGTTTTCTCAAGCGCTCCACGATTCTCTATTCGTCGCAATACTTCAAGTAGTTCTAACGGTTTGATTTCACTGATAGGGCGTTTACCAATGAACGGGAACACATCTTGTTCAAATGTCTTAATGATTTCTTCGCGATAGGCCACTGTCCAGCGGTTTGCTTTGTTGGCGTGCCATTCTCGACATATAGATTCGAATGAGTTTTCAGTGGAGAGTTGCTGAGCTAGTTTCTGGGCTTTGCGTTCCTGAACCGGGTCAATGCAATTGGCAACTTGCTTACGAGCGGTCTCACGCTTCTCACGTGCCTCAGCTAGGCTCACAAGGCCGTAGCTACCAAATGACATTAACCGCGCTTTTCCGGCAAAGCGGAAACGGAAACGCCAGCCCTTCGAGCCATCTGGATTGATAAGCAATGACAGGCCTTGCCCGTCGTTCAATGTGTAAGGCTTGTCTTGGGGCTTTGCTCGTTTGATTTGTATATCTGTAAGTGCCAT
Protein sequences of DBSCAN-SWA_1 >NC_018079|105164:116245|114096_115071_-|WP_014830135.1|DBSCAN-SWA MDNRKIERILRKNRHINARFAGYFASFTSLWNALFAPIIIGLGVSYVFQKKDINNGISDFFFIACIMFFILHLILAWGAKYLDKKDDSTIELGELTKEYNERVSEIEIIRKRMVQEDKLSTTQKLVIYLTTLQVNSHIRKILIANKEKKLTLADFPIDTASFFSTVLGYLSRNRETLFGYKSESKFNIALYIYNKETNHLEVISRICDDRINKKNRAWKPGFGHVGLTYLHKELKICPDITKSNELKISNPADRKNYCSFLSVPILAYQQNSNEEECLGVLVLTSALPDQFSLERDQDFLLNLSSLIAIYSEVIYKMYVDTPTN >NC_018079|105164:116245|112894_113458_+|WP_014830133.1|DBSCAN-SWA MTTLTLQKAFEACQANKSAWLQRRDELKQAEQVYREQLAGNGQSGRSLKTLREIIDVKKWEINQAAGRYIRSHEEVQRISIRNRLNDFMQAHGAELAAALAPELMNYSGQHSAIQRCAMQHSLDYLREALQVWLAAGEKINYSAQDNDILTAIGFRPDAASRDDSREKFTPAQNLNYTRRRAELAVR >NC_018079|105164:116245|111901_112645_+|WP_014830131.1|DBSCAN-SWA MKPETVITALQNVAAQQSAESSQRLTDKLSAFTAARDTHAASMQELKEIDTAIERCKQERQTALSESAEAEQDWRSRFRTLRGNLTPEMKAEHSKRIASRELADEFTGLIAELETDRTRAMLNACSTASKYLSAHDDAFTTYAGAEWAQAVNTLPAALIRAFLLRIRALEMQGDSAPQSVAIGELRDALSRQGSLYHFDMKQEPVLSVTGMHRPQINGVDMELLRSPAKRMMLARKLADNGKTKAEV >NC_018079|105164:116245|106101_107052_-|WP_014830125.1|DBSCAN-SWA MLTQLKKNGTEVSRATALFSSFVEKNKVKCPGNVKKFVFLCGANKNNGEPSARRLELINFSERYLNNCHFFLAELVFKELSTDEESLSDNLLDIEADLSKLADHIIIVLESYSSFTELGAFAYSKQLRKKLIIVNNTKFINEKSFINMGPIKAITQQSQQSGHFLHYKMTEGIESIERSDGIGEIFDALYDILSKNERAISRTLKKEELDPSNNFNKDSVRFIHDIIFVCGPLQLNELIEIITKIFGTESHYKKNLLKHLGILMAIRIISCTNGVYYSLYKEYYFKYDFDIDNISSMFKVFFLKNKPERMKVYENI >NC_018079|105164:116245|110011_110332_-|WP_014830127.1|DBSCAN-SWA MKKPLSPVLRAALYRRAVACAWLTVCERQHRYEHLTLDALESAIADELEGFYLRQHGEEKGRQIACALLEDLMEAGPLKATPSLSFLGLAVMEELCARHITSPVMH >NC_018079|105164:116245|110328_110556_-|WP_014830128.1|DBSCAN-SWA MKQTLITAHDIECLEHMRNVGQLVDELMKVQDSTSVCRDPAQQLQLTSVIYLMTAQLDGVVERCKKRWLTGEDNV >NC_018079|105164:116245|112647_112866_+|WP_014830132.1|DBSCAN-SWA MFHCPFCKTSAHARTSRYLSENVKQRYHQCVNVECSATFRTLESVDGIIRSPVTEPVIPVPAPAATVNRAGA >NC_018079|105164:116245|113765_114008_-|WP_043951541.1|DBSCAN-SWA MMEKLRYARLSNPKGNEADGYSVNNDFEKVYLKYEAQGLFSQVDAMMDDWIDNRVQTKEAHIFKDIKKHHLIKNNFVRGA >NC_018079|105164:116245|107663_109997_-|WP_014830126.1|DBSCAN-SWA MKMNVTETVKQACGHWPRILPALGVKVIKNRHQACPVCGGSDRFRFDDKEGRGTWFCNQCGAGDGLKLVEKVFGVKPSEAAQKVNAVTGNLPPVAPEVIAAAEAETEADRKAAATLAVRLMEKTRTASGNTYLTRKGFPDRECPVMTVMHKTGGVTFRAGDVVIPLYDETGALVNLQLINSEGLKRTLKGGTVKGACHTIEGKKQAGKRLWIAEGYATALTVHHLTGETVMVALSSVNLLSLASLARQKHPACQIVLAADRDLNGDGQNKATAAAEACEGIVALPPVFGDWNDAFMQNGGEATRKAIYDAIRPPVQSPFDTMSEAEFTAMSASDKALRVHEHYGEALAVDANGQLLSRYEAGIWKVVQPSSFERDVAGLFQRLRAPFSSGRITSVVETLKLIIPQQAAPARRLIGFRNGVLDTRSGLFSPHSKSHWLRTLCDVDFTPPVEGETLETHAPNFWRWLDRAAGSRPEKRDVILAALFMVLANRYDWQLFLEVTGPGGSGKSILAEIATLLAGEDNATSADIDTLEDPRKRASLIGFSLIRLPDQEKWSGDGAGLKAITGGDAVSVDPKYQNPYSTHIPAVILAVNNNPMRFTDRSGGVSRRRVIIHFPEQIAPEERDPQLRDKIARELAVIVRQLMQRFSDPMTARALLQSQQNSDEALSIKRDADPTFDFCGYLEMLPQTNGMFMGNASIIPRNYRKYLYHAYLAYMEANGYRNVLSLKMFGLGLPMMLKEYGLNYEKRHTKQGIQTNLSLKEESYGDWLPKCDEPAAT >NC_018079|105164:116245|105164_106115_-|WP_014830124.1|DBSCAN-SWA MRIYSLIDSQTLMTKGFASEVMRSPEPPKRWDIAKKKGGVRTIYHPSSKVKLIQYWLMNNIFSKLPMHNAAYAFVKNRSIKSNALLHAESKNKYYVKIDLKDFFPSIKFSDFENAFTRYRDRIEFTTEYDKELLQLIKTICFISDSTLPIGFPTSPIIANFVARELDEKLTRKLNTIDKLNATYTRYADDIIVSTNMKGASKIILNCFKRAMNEIGPDFKINIKKFKICSASGGSIVVTGLKVCHDFHVTLHRSMKDKIRLHLSLLSKGVLKDEDFNKLSGYIAYAKDIDPHFYTKLNRKYFQEIKWIQNLHNKIE >NC_018079|105164:116245|111097_111364_-|WP_014830130.1|DBSCAN-SWA MHTAFSFPSSAPAAPMMPVSDAVQERFIRLPEVMHLCGLSRSTIYDLISREAFPKQISLGGKNVAWAQSEITAWMADRIAERNRGCDA >NC_018079|105164:116245|115060_116245_-|WP_014830136.1|integrase|DBSCAN-SWA MALTDIQIKRAKPQDKPYTLNDGQGLSLLINPDGSKGWRFRFRFAGKARLMSFGSYGLVSLAEAREKRETARKQVANCIDPVQERKAQKLAQQLSTENSFESICREWHANKANRWTVAYREEIIKTFEQDVFPFIGKRPISEIKPLELLEVLRRIENRGALEKTRKVRQRCGEVYRYAVITGRAEYNPAPDLAIALAVPKQKHHPFLSAEELPHFIQDLEAYTGSIITKNATKIVMLTGVRTQEMRFATWEEVDLEKGIWEIPAERMKMRRPHIVPLSTQVIDLFKQLKPITQNYPYIFIGRNNRSKPISKESVSHVIELLGYKGRATGHGFRHTMSTILHEKGFDTQWIELQLAHIDKNSIRGIYNHALHIENRRIMLQWYADYLYSKVSHGQ >NC_018079|105164:116245|110552_111101_-|WP_071841820.1|DBSCAN-SWA MMMTVQQTAPFSGLLPVDVSRYSFPAVAKSAAGRENPSLTMATPDAPCVFFCVYAFVHPFIGQRFFVCCAVKIMVAQAGQLSGWPVSIVAGFSPPSGLPPLRENFGGSCKQLTMEVAAMATTLTPSHPQFVFVFAAVRRADRKPRICMLRTVAGDEHAARLSLVRDYILSFAGRLPVTEVRP |
13 | Enterobacteria_phage(100.0%) | integrase | attL 101415:101433|attR 119193:119211 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1017098 : 1025762
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_018079|1017098:1025762|DBSCAN-SWA CATGCTGGAACAAATGGGTGCTGCCGCCAAGGCCGCCTCTTACAAACTGGCGCTCCTTTCCAGCCGCGAGAAAAACCGCGTGCTGGAAAAAATCGCTGATTATCTGGAATCGCAGTCGCAGGAAATCCTCCTCGCCAACGAACAGGATCTGCTGGAAGCGCGTCGTAATGGTCTGAGCGAAGCGATGCTTGACCGTCTGGCGCTGACCCCGGCACGCCTGAAAGGCATCGCCGACGATGTTCGTCAGGTTTGTAACCTGGCCGATCCGGTCGGGCAGGTGATTGACGGGGGACTGCTCGACAGCGGTTTACGTCTGGAGCGCCGCCGCGTGCCGCTCGGCGTTATCGGCGTGATTTATGAAGCCCGCCCGAATGTGACGGTGGATGTCGCTTCCCTGTGCCTGAAGACCGGTAACGCCGCCATTCTGCGCGGCGGGAAAGAGACCTGGCGCACCAACGCCGCGACGGTGAAAGTCATCCAGCAGGCGCTGCAGGAGTGCGGCTTACCGGCAGGTGCCGTGCAGGCCATCGAAAGTCCGGACCGTGCGCTGGTGAACGAGATGCTGCGCATGGACAAATACATCGACATGCTGATCCCACGCGGTGGCGCGGGCCTGCACAAGCTGTGCCGTGAGCAGTCGACGATCCCGGTGATCACCGGCGGTATCGGCGTGTGCCATATCGTGGTGGACGAGAGCGCCGAGATTGCACCGGCGCTGAAGATTATCGTGAATGCCAAAACCCAGCGTCCAAGCACCTGTAATACGGTGGAAACGCTGCTGGTGCATCAGGGTATCGCAAACACCTTCCTGCCAGCCCTGAGCAAGCAGATGGCCGAGAGTGGCGTCACGCTGCACGCGGACGAGAAGGCGCTTGCGCTGCTGAAAGACGGCCCGGCGACGGTGGTGCCGGTAAACGCGGAGCAGTACGACGACGAGTTCCTGTCTCTGGATCTGAACGTGAAGATCGTGGCCGATCTCGACGACGCTATCGCGCACATCCGGGAGCATGGTACTCAGCACTCTGATGCCATCCTGACGCGCACGCTGCGCAATGCCGACCGCTTTGTGAACGAGGTAGATTCTTCTGCCGTGTACGTGAACGCATCAACCCGCTTCACCGACGGTGGACAGTTTGGTCTGGGCGCGGAAGTGGCCGTCAGCACGCAGAAACTGCATGCGCGTGGTCCGATGGGCCTGGAAGCGTTGACCACCTACAAGTGGATCGGCATCGGCGACGATACGATTCGTGCGTAAATAATCACGGGTGATGCAAAAATAGCCGTTTGATTCAAAAGGGCATTGACGCATCACCCGGATAGATCTAACCTTTTGCCCCGTGGTTACGCTCGTAACCGGCCTCTCAGGGCCGATATAGCTCAGTTGGTAGAGCAGCGCATTCGTAATGCGAAGGTCGTAGGTTCGACTCCTATTATCGGCACCATTCTAACGTCTCCCCAAGTCTACTAAAGTTCACTGAAACCCCTTATACTCTGCGCTTTACAGCCCCTTTTAGTATTTCTACGTCTACTAAAGTTCCCTGAAATCTACGGTCGTTTGGGGGTACTTATGGGGGTATATGCTGTTCGGTCTAGAGGAGGTACCCCCAAGTGAAACTAAACGCCCGGCAGGTGGATGCCGCCAAACCTAAAGATAATCCTTACAAGCTGGCTGATGGTGGTGGTTTGTATCTCCTGATTAAACCTAATGGCGGCAAATACTGGCGACTCAAGTATCGTGTAGCCGGCAAAGAGAAGCTGTTAGCGCTTGGTGTGTATCCTGAAGTCACATTGGCCGATGCTCGGGCAAAACGTGAAGAAGCGAAAAGGGGTATCGCTGGGGGTATCGATCCTATGGAAGCGAAACGGGAGGAGAAGATTGCCCGTGAAATTCAGTTAAACAACACCTTCAAAGATATTGCCCTTGAATGGCACAGCAGCAAACTAAAAAAATGGTCTGCTGGTTATGCTTCAGACATCCTCGAAGCCTTCAATAAAGATGTGTTCCCATACATTGGCAAAAAACCAATAGCCGAAATCAAACCGCTTGAACTGTTGAATGTGCTACGGCGCATTGAAGGGCGCGGCGCTACCGAAAAGGCAAGAAAAGTTAGGCAGCGCTGTGGGGAAGTTTTCCGTTACGCAATAGTCACCGGTCGAGCTGAGTATAACCCCGCCCCGGATCTCACCAGCGCGATGCAAGGGCACGAGTCCAATCATTTTCCTTTCCTCACACCTAAAGAATTGCCTGATTTCTTCAATGCGTTGTCAGGATATTCAGGAAGCGAGTTAGTAGTTTTGGCTGCTCGTTTACTGATTATCACCGGATTGCGTCCCGGCGAACTCCGTGGGGCATTTTGGGATGAAATCAATATCAGTAAGGCGGTCTGGGAAATACCCGCCTCACGCATGAAAATGCGTCGCCCTCATGTGGTGCCATTGTCTAGGCAAGCTCTTACGCTTATTGGTCAGATCCAAGAGCTAACAGGCAATTACCCGCTTGTGTTTCCAGGCCGTAACGATCCGCGAAAAACAATGAGTGAAGCCAGCATAAACCAAGTCTTTAAGCGGATTGGCTATAACGGAAAAGTCACCGGGCACGGTTTCCGGCACACCATGAGTACCATCCTGCACGAACAGGGCTACAACACCGCGTGGATTGAAACGCAGCTGGCACACGTCGACAAAAACTCTATACGAGGAACGTACAACCACGCCCAGTATCTGGATGGCCGCCGCGAAATGCTCCAGTGGTATGCCGACTATATGGAGGCGTTGGAAAACGGCGAAAATGTAGTGCACGGAACGTTTGGGAAAAGCGCTTAACTGTATGTATAGACAGTGCTAATTGACAGTAGTAGACTTCAGTAGACGAACAATAATTAGGCGTTGTCTAGGCTGATCACCGAAATCCCGCTCACCTCTGCGGGCTGGCAATGCCGCTTAAATAAGAGGACGTGGGGTGGCGTGTGACTCGAGCTACAAAAAAAGATTTATCGTGGTTTAACCTCAGTAATTACGATTTTATTAATAATTTAACTCTCTCTGAATTCATCGTTGAGCTTGAGTGGCGAGATTTCCTTTATCGTAATGTAAATGAGGATGATTTATTTTTTGATGAAGAATACGAAATTAAATATCAGCGTATATTTGGAGGGGATCCTCATCTTGATATTCCAAATGAAGAAGAAAAAGAGATTGATGAGTTTGTCCGTAAAGTAAACAGCGAGACTCCGTCTTTGCTAAATATGTACGGTACTCTACCTCATCTACCATCAGCTCTAGGCGTAAGCCCCATTAGTTTTACTGAACTTTCTATGTATGGTTATAGTGCGATAGACCAAGGCTTTTTCAAAAGAGATGATGAGTATTGCTTTATCAAATCTGACGCTATGCTTGCGAGTGTTTCGGGCAACCTAGATGATTGTTTTACGAATTCGGTGCTGTTATCAGTAAATTTGGATGATGCAACTGATGAAGAAATTATAGCCAGCATGGCGAAACTTCTTCCTCTTTGGCGGGAACAGTTGATGTTGCCCGAGCATGAACACGTAGCCCGAAAAAGAATAGGTTTGAAAACTCTTCAAAAGCTCATTAATAACAGAGTGTTGCCAATTCTTGACTTACTAATCTGGGAAAGGCGATTCACCAAAGAGGTTAGTAACCCAATGATAGGTGCATTAGTTTTTGATGACGATCCTAAAGACACTCAAGCCATCAAAGAGACTATCAAGCCATTCGCATTGGAGGCTATGAGTGAGCAGTACACTCGTTTGCTTCGCTTACATATGAGCAAAGATGGAGAAGTTAACTCAGCGAAAATGTCTGATTTAATGAGTAGAATTTTGTAGTCTGTGAAATTCTTAAAAAACACAATACTTGTTTTTCGAGGATTACGCTTTGATAGATCTATGGTTAAAATCAAAGCGTTTCTTTTTCGAAGAAAAACTTTTGTTTTATTATCTCTCCCGGCTAGAAAATAGATTGTTTACCCTTTCATCATCTACTCGCGTCTACTTAAGCGCATTTAGGAATAAAGGAGTAAACATGTCTAAAGCCCTCATTCGCTTACCTGAAGTTCAACGTCGTACTGGCTATAGTAAGGCTTGGATTTATCGCCTTCTTAAAGAGCGTAAATTCCCCCAATCTGTAAAAATAGGTTCTCGGTCGATCGCGTTTGTTGAAAGTGAGATTGATGCGTGGATAACCCAACGGATTGAAGAGCGAGATGCTTTACTCGTCAGAAGACCACAACTGTAACTTAGCCCGGGAAAACTAACATGACTATCAAAAATGCCCGTGCCGGGCAGGGTTTTGCTCACCCTGAAAACAGCAGCGATGATATTTCGGTCATTAAATTTGAGGATGTAAAAGTACGTATTGTTAAGATCTTGGGCGAGCCATGGTTTGTAGCGGCAGATGTATGCGCAGCTCTGGAAATAGCTGATCACAAGGTTGCTTTGCGACGTCTTGATGATGATGAAAAGGGGGAGTGTTTAATACCCACCCCTGGCGGAAAGCAGACTATGCGAACCGTATGCGAGTCAGGATTCTACAAACTGATCTCACGTAGTCGTAAGGCGACTACTCCCGGCACCTTTGCGCACCATTTCAGTAATTGGGTATTCCGTGAGGTCATTCCCTCAATTCGTAAAACTGGCTTCTATGGAGTGCCGTTCGTGTTCCTGAACGACTTCAGCCGGCGCATGGCTGCTTACCAGCAGGAGGCCAGCAAACGCGGGTATAAATTGCAGCAATGTAAAGGTGTAAAAGAGGCTCTTGAGCGGGAAGAGATTCAGTTGTGGCTTAAGTATCAGCCCGAGCTATTGAAGGAAAATGGCGATGAATAAAAAGGCGGAAAGACGCCGGGATTTTTACCCGGCAGAGAGCATGCTTAATCAGCCCTTTGGTTCGATACCACGCTGCTGGAGTTCCTTGCGAATAATTCTCTTAAGCCATGCGGCTAGAGACTCATCACCATCTTGCTGTTGCGCTCGTTCCATCATCTCTCGAAGCTCTGGATCAAGCCGAAATTGGAATGGAGGATTGCCTCGTCTCTCGTTTTTGTGTGTTGACACGTCAATTACACCCGATGTAATGTGTTTATGTGTAATGACACATTACACACAGGAAATGAAAAAGACAACGCCCCGAAGTGCGGGAACACTTTCAGGGCGTCTAACCAAAACGTTAGTTGAGGTAACATTATGGCTTGCACTAAGTCTACCCAAACACGCCCTGAATTTACATGGCGTTTTCTCACCTTGGGTGAATTCACAAATCAGATCGTCAATGTTACTGCTTCCACCGAGCGCGAAGCCCGCGAAAAAACGCCAGAAGGATGTGTCTGTATTCTGGCGTGTCGATTTCGTGTTGAGGAGGTGCAGCATGTTTAACCTCCAGACCCTTACAGCTAAAGCCCGCGAGCTGCGCGGCAACGTGGTAAAAGCCACTACCACGAAGGGCACCCGCACCATGACCCCTGTTTACGAACGGGAAGAGCAGCGCAAACTGCGCGAACGTATCCAGCAGACCCAGCCGGACTGGGTTTTACTCTGGTGGGATATTGCGACCGTTACCGGCTGGCGTACCAGTGACGTGTGCAACTTTCGTTACTCGTGCATCAACTGGGAAACCGGCATTGCAACAATCATCGTAGCGAAGCAGACCAAAGCAGCGGAAGCCAGAGCGACCCGGAAGGGGATCGAGATTGTACGCCAGCAGCGAAAGGACGCTGCCCGGCTTGCTGGCGATCACATTGGGTACATGCACTGGGATAGCGTGAGCTGCGACGAGCTGGCCGCCGGCATGACGGAAGAAGAACAGGCGATCGTGTTTGAGCTGGTGGCAAAGGCTGAAGTTAAGCACGACACCAAACAGCTGCCGCCGGGCATTATCAAGCGGCTGCGCGAACGCATGGAGCGCAATCTTATCGGTGACGACCTGGTATTTTCCCGCAGCCAGATCGAAAGTAACCGTTGCCAGTCTCTGGAAGGTAGCGTGAGCCGCCAGACAATCTGGAAGAAACTGCACAACGTAATGCTGTGGTTTACGCGCGTCGTAAATACGCGTCTGCGCCTGAGCGCCTATTCCAGCCGCAAAATTGCCGCCTTTAATCTCATGTCCGCCGGCGGCGAACAGGGCTTGCTGGTCGCCTCTGAAATGCTCGGGCACAGTAACCCGGCAATCACCCGAACTTACCTCCAGTTAGGCAGTAAGGCCTCCGCCATTCAATCCCGTCTGGCCATGGAGGTATCTGTATGAAAATGGTTATCCAATTTTGCCGTCTCGGCGATTTTGCCGATCACGTATCTAAGCAATTAAATAGTGCGCGATATTGTTTTGCCAGCCAGTCATTAAGGGAAGGGGAGGTGAAATTATGACTCCTGTTTACGATCTGGTTCGCCGGGCCGACGGCAAAAACGTGTTCAGTTTCCCGGCCGGCGGCCGCTATCTGGTGGACACGTCAAATGGTCTTCAGTCGATGCGCCCCCTTATGGACGACGAGATCATTTTTACGGTGGAGAGTGCCGCGCGCTTTCTGAAGAAAATTGGTTATCAGGTAATCCCGCCAGCGGCGTGAGGTAAAAAATATGACGATTAAAAATTCCGGCTTAGCTGCTGGTGCCCGCGCTCATCCTGAAATCAGGCCGGGCGATAAATGGAAGGACAGTCGGGGCAATATCGTAATTATCGAAAGTTACCGATTCGACAGAGTGACATATTGCCGCGAAGGGTACAGCTCACCGTGTTTTTGCACGCCAGAAAGACTGGCGCGGGAATTTGAATTTATTTCTTCTGCGCCGGGCTCCGGGGGAAGAGATATCGATCGGATTATGCGGGTGCAGGGCATCGAACGAATTCGGGTTATGCGGGAAATCATCAGGGAGCGAGGGAACAGAAAATGAAGAATGCACCAAACCTTAAAAAGCAGCCGGCGGATCTCATGGAGGAGTCAATCATCTTTGCCGGCGCCGATGCCTGGACTTTCGCCAAAGCATGGCAGGAAATGAACCCGATTGGCGATACGGTGCCGCCGGTTGTGCTGGATAAAAAGCAGCTGGCAGAGCTGGAGAATATCCGGATTGTGGATGATGGCCGGCTCTATGCCCGGGTTTGCCGTGGCGGGCATCTGACCGAACGGCAGATAACCATTCTCGCTACAAAGCTGGCGGTGGCCGGCGTGGAGCGCGCGCAATTCTACTCTGAAGGTTATCAGCTTCTGGAGGACTGGACGCCGCAGCTGCCGCGCCTCAAAGCCGATGCGGAAGCCGGCAAAAGCATGGTGATCGGCAAACCGCTGACGGATGTAAACCTCCGCGACCTGGCTGATAACGAAAAGGCGCTCATACTGGCCGCGCGTTACACCGGCATTGCAATCAACGAAAACAGCGAGGGCGTGTACGTCTACCGCGCCGGCATCTGGGAGAAAACGTCACTGCTTGAGCTGAGCCGTGAAATGGTGGCTATCTACAACGAGAACAAAACCAACTTCAGCAAGCGCGCGATCAACAACGTTATCGACGCCCTGAAAATCGTTATCCCTGTAATGGGAGAGCCGCGGCGCAGCCTGATCCCCTTTGCAAACGGCGTCTACGATATGGAAACCGGCGTTTTCTCTGAACACAGCCAGGATAACTGGCTGACCAACCACAACGGCGTGACCTACACGCCGGCGGTGCCGGGCGAAAACCTCCGCGACCACGCGCCGAACTTCCATAAGTGGCTAAGTTACGCATCAGATAGAGACGCAATTAAGATGCAGCGTATCGCTGCGGCGCTCTTTATGGTTCTGGCGAACCGGTACGACTGGCAGCTGTTCCTCGAGATAACCGGGGAGGGTGGCAGCGGGAAAAGTGTCTTTACCCATATCGCCACGATGCTGGCCGGCGCGCATAACACTGCGAGCGGGAACATGGCGGCGCTCGACAGCGCACGCGGACGGGCGCAGTTCGTCGGTAAGAGCATGATAACGCTTCCTGATCAGCCCAAATATTCAGGTGAGGGCACCGGGATAAAGGCAATCACCGGCGGGGATGCGGTGGAGATCGACCCGAAACACGAGCATCAGTACACCGCCGTTCTGCGGGCGGTGGTTGTGGCCACGAACAACACGCCGATGATTTTCACCGAACGTGCCGGCGGCGTTTCCCGGCGCCGCGTAATTTTCCAGTTTAACCGGCGCGTCAGCGAAGAGGATAAGGATCCCGACCTGGCAGAGAAGATATCCGCTGAAATTCCGGTGGTTGTTCGTCGGCTGCTGGCGAACTTTGCGGACCCGGAAAAAGCGCGGGCGCTGCTGCTGGAGCAACGGAACAGCGAAGAAGCACTGGAGGTGAAGCAGAAAACGGATCCGCTGTATGCCTTCTGCGCTCACCTTGAGCGACTGGCTGATTGTGCGGGAATGATGGTAGGAAACCGCAATCCGCCTCACTATCCGCGAATTTATCTCTATCACGCTTATCTGGCATTCCTGGAGGCCAACGGTTTCGACAAGCCGCTGACGCTGAATAAATTCGCAGAGGGGATGGAAAGCGCGATGCGGGAGTTTAATCACGAGTACCGTAAGGAACGTAGAGCCCGTGGCATGGTGACTAACGTCGAACTTTCGGAAAGTGCGGAAGACTGGTTACCTCAGACGCATCCTGTAGCCGGTCATAAAGAATGA
Protein sequences of DBSCAN-SWA_2 >NC_018079|1017098:1025762|1022482_1023358_+|WP_014830795.1|integrase|DBSCAN-SWA MFNLQTLTAKARELRGNVVKATTTKGTRTMTPVYEREEQRKLRERIQQTQPDWVLLWWDIATVTGWRTSDVCNFRYSCINWETGIATIIVAKQTKAAEARATRKGIEIVRQQRKDAARLAGDHIGYMHWDSVSCDELAAGMTEEEQAIVFELVAKAEVKHDTKQLPPGIIKRLRERMERNLIGDDLVFSRSQIESNRCQSLEGSVSRQTIWKKLHNVMLWFTRVVNTRLRLSAYSSRKIAAFNLMSAGGEQGLLVASEMLGHSNPAITRTYLQLGSKASAIQSRLAMEVSV >NC_018079|1017098:1025762|1022301_1022490_+|WP_071531908.1|DBSCAN-SWA MACTKSTQTRPEFTWRFLTLGEFTNQIVNVTASTEREAREKTPEGCVCILACRFRVEEVQHV >NC_018079|1017098:1025762|1023473_1023677_+|WP_014830796.1|DBSCAN-SWA MTPVYDLVRRADGKNVFSFPAGGRYLVDTSNGLQSMRPLMDDEIIFTVESAARFLKKIGYQVIPPAA >NC_018079|1017098:1025762|1017098_1018352_+|WP_014830790.1|DBSCAN-SWA MLEQMGAAAKAASYKLALLSSREKNRVLEKIADYLESQSQEILLANEQDLLEARRNGLSEAMLDRLALTPARLKGIADDVRQVCNLADPVGQVIDGGLLDSGLRLERRRVPLGVIGVIYEARPNVTVDVASLCLKTGNAAILRGGKETWRTNAATVKVIQQALQECGLPAGAVQAIESPDRALVNEMLRMDKYIDMLIPRGGAGLHKLCREQSTIPVITGGIGVCHIVVDESAEIAPALKIIVNAKTQRPSTCNTVETLLVHQGIANTFLPALSKQMAESGVTLHADEKALALLKDGPATVVPVNAEQYDDEFLSLDLNVKIVADLDDAIAHIREHGTQHSDAILTRTLRNADRFVNEVDSSAVYVNASTRFTDGGQFGLGAEVAVSTQKLHARGPMGLEALTTYKWIGIGDDTIRA >NC_018079|1017098:1025762|1023687_1024002_+|WP_014830797.1|DBSCAN-SWA MTIKNSGLAAGARAHPEIRPGDKWKDSRGNIVIIESYRFDRVTYCREGYSSPCFCTPERLAREFEFISSAPGSGGRDIDRIMRVQGIERIRVMREIIRERGNRK >NC_018079|1017098:1025762|1021141_1021354_+|WP_014830793.1|DBSCAN-SWA MSKALIRLPEVQRRTGYSKAWIYRLLKERKFPQSVKIGSRSIAFVESEIDAWITQRIEERDALLVRRPQL >NC_018079|1017098:1025762|1023998_1025762_+|WP_014830798.1|DBSCAN-SWA MKNAPNLKKQPADLMEESIIFAGADAWTFAKAWQEMNPIGDTVPPVVLDKKQLAELENIRIVDDGRLYARVCRGGHLTERQITILATKLAVAGVERAQFYSEGYQLLEDWTPQLPRLKADAEAGKSMVIGKPLTDVNLRDLADNEKALILAARYTGIAINENSEGVYVYRAGIWEKTSLLELSREMVAIYNENKTNFSKRAINNVIDALKIVIPVMGEPRRSLIPFANGVYDMETGVFSEHSQDNWLTNHNGVTYTPAVPGENLRDHAPNFHKWLSYASDRDAIKMQRIAAALFMVLANRYDWQLFLEITGEGGSGKSVFTHIATMLAGAHNTASGNMAALDSARGRAQFVGKSMITLPDQPKYSGEGTGIKAITGGDAVEIDPKHEHQYTAVLRAVVVATNNTPMIFTERAGGVSRRRVIFQFNRRVSEEDKDPDLAEKISAEIPVVVRRLLANFADPEKARALLLEQRNSEEALEVKQKTDPLYAFCAHLERLADCAGMMVGNRNPPHYPRIYLYHAYLAFLEANGFDKPLTLNKFAEGMESAMREFNHEYRKERRARGMVTNVELSESAEDWLPQTHPVAGHKE >NC_018079|1017098:1025762|1020063_1020945_+|WP_014830792.1|DBSCAN-SWA MTRATKKDLSWFNLSNYDFINNLTLSEFIVELEWRDFLYRNVNEDDLFFDEEYEIKYQRIFGGDPHLDIPNEEEKEIDEFVRKVNSETPSLLNMYGTLPHLPSALGVSPISFTELSMYGYSAIDQGFFKRDDEYCFIKSDAMLASVSGNLDDCFTNSVLLSVNLDDATDEEIIASMAKLLPLWREQLMLPEHEHVARKRIGLKTLQKLINNRVLPILDLLIWERRFTKEVSNPMIGALVFDDDPKDTQAIKETIKPFALEAMSEQYTRLLRLHMSKDGEVNSAKMSDLMSRIL >NC_018079|1017098:1025762|1018705_1019920_+|WP_014830791.1|integrase|DBSCAN-SWA MKLNARQVDAAKPKDNPYKLADGGGLYLLIKPNGGKYWRLKYRVAGKEKLLALGVYPEVTLADARAKREEAKRGIAGGIDPMEAKREEKIAREIQLNNTFKDIALEWHSSKLKKWSAGYASDILEAFNKDVFPYIGKKPIAEIKPLELLNVLRRIEGRGATEKARKVRQRCGEVFRYAIVTGRAEYNPAPDLTSAMQGHESNHFPFLTPKELPDFFNALSGYSGSELVVLAARLLIITGLRPGELRGAFWDEINISKAVWEIPASRMKMRRPHVVPLSRQALTLIGQIQELTGNYPLVFPGRNDPRKTMSEASINQVFKRIGYNGKVTGHGFRHTMSTILHEQGYNTAWIETQLAHVDKNSIRGTYNHAQYLDGRREMLQWYADYMEALENGENVVHGTFGKSA >NC_018079|1017098:1025762|1021374_1021944_+|WP_014830794.1|DBSCAN-SWA MTIKNARAGQGFAHPENSSDDISVIKFEDVKVRIVKILGEPWFVAADVCAALEIADHKVALRRLDDDEKGECLIPTPGGKQTMRTVCESGFYKLISRSRKATTPGTFAHHFSNWVFREVIPSIRKTGFYGVPFVFLNDFSRRMAAYQQEASKRGYKLQQCKGVKEALEREEIQLWLKYQPELLKENGDE |
10 | Enterobacteria_phage(66.67%) | integrase | attL 1011671:1011685|attR 1023622:1023636 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2203155 : 2211710
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_018079|2203155:2211710|DBSCAN-SWA GTTAACGGTAGAGCGGTTTTTCCGCGACCGGGATCAGTAAGGTAGATTGCCAGTCCGCAAGCGTCAGCTGCGCGAGTTTCCCCAGCGCGATATAGAACGGGTGCCCGGCGTTTTCAACAAACACGGAGAGAAAGCGGGTGCTCCACGGCAGCAGGTGCCATGCCAGAAGCTGGTCGCGGTCCGTCTCGCGTCCGTTCTCAGTGAGCCACGCGGCCAGCAGCAGCAGCGTACCGAAATGATCTTCCGGTTCGTTCTGCTGCATCTCAAAGGCAATCTGGTTTTCCCGCATCCACTGGCGCAGTGCGAGGGTTGAATCCCCAAACAGCACGGATTCACGATCCAGCCATACGGAACCCCACGGCGGTGCGGGCAGCGCATAGGGGCCAATAAACAGCCGTTGCCAGGCGTCGGTGACGGGCTCATCGGCAGGTACCGTAAAGGTGTCGGCGATGGGCTGCAGCACCGCCTGCGGCAGCGGCCAGTCCGCAACCCACTCGCCAGCGGTCAGGGCGTTGACCAGCGGCGCAACATGCTCGCTGTCGGGTGAATAATAAAACAATGCGCCCAGTACCCGGGCGCTGAACGCGAACGCTTCACGCTGTGAGACATCTTTCATTACAACCTTCCTTGCTCGCGCGGGCGAACCCGCGCGCCTGTGTTAACCGGCAATTGCCATACCTACCGTCATGTGCAGACCATAGAACAGACCGCGGCCGATCAGTTCCCCCACGACAACCAGCACCAGCCCTGCAAAGAGCGCGAGGGCTTTCGGCTCCTGGCGCCGGATGAGCGGGCACAGCCAGCAGCCTAATCCGGCGGCGACCAGTATGATACGGCCAACTTGCCATGCGCCGTAGTCCGGCAGCAGGGCGCTGGCCTGTTGTACGGAGCTGTGTAGCGTCGCCAGTTCGTTACTCTGCAGCACAATCACCGCCACGCTCAGCAGCAGCGCCAGTACGCTCACCGATGCGGCAAACGTGCCTTTGATGGCGGTGCCAGCGACGCGAAGCAGCAGGGCGGCAAACAGCGGGCCGCCCAGCACCATCGTCAGGAAAAAGGCCAGCGTGGTGTAACCGGTGTACCAGGTCGGAACCGTATCGATCTGATACACGCGAGTCATGGCCCAGACAAATACAATCCCCAGGATCTGGCTCACCACCATCCATACTTTACCCAACGCGGGCGGCATTTTACCGATAACCGACACGAGCCACCAGAAGCCACCGACGGCAAAGAAGACCGAGCCTGCCGCGATCTCGTTGCTGAGCGCCGACGCGCCAACACGGTTCAGGGAGTTAAACGCCCTGAGCGGGGAGCCCAGGTGCATCACCGATGCGATAAAGCCAACGCCCATGACCAGCCATAAAAAGAACATGCTGCGCACGATGCGGGTTCGGACCGCCTCGTCACTCTCCCTCATCCATGCCAGACCGCTGACGAGCAACGCCCCTGCCACGCACTGCCCAAAAATGGTGAAGATCACCAGCGGCCATTCATGCCATCCACTTCCCATCTCACACCTCCTTCGGGTTTGCCAGGTAGCCGGTGGTATCTCCCGTCGGGCGGCTGTTGGCGTTAGGTTTTATTACAATGCTCGGCTTCGTGAAGTGCGCGGACGGCAGCGGCGCGACGGCCGCAAGCTGGCCGTGTTTTTTACGCAGCTCGTCAATCGGACCGAAGTCCAGCGCCCGCAGCGGACAGGACTCCACGCAAATCGGCTTTTTGCCGTCGGCGACGCGGGTGTGGCAGCCGTCGCACTTGGTCATGTGACCTTTGGCGGCGTTGTACTGTGGCGCACCGTACGGACAGGCCATATGGCAGTAGCGGCAGCCGATGCAGACATCCTCATCCACCACCACAAACCCGTCGTCACGCTTGTGCATTGCCCCGCTCGGGCACACTTTGGTGCAGGCCGGATCTTCACAGTGGTTACAGGCAATCGACAGGTAATAGGCAAAGACGTTCTGATGCCAGACGCCGTTATCCTCCTGCCAGTCGCCGCCCGCGTATTCGTAGATCCGGCGGAAGCTGACGTCCGGGGTCAGGTCTTTGTAATCCTTGCAGGCCAGCTCGCAGGTTTTGCACCCGGTGCAACGGCTGGAATCAATAAAAAATCCATACTGGGTTGTCATCGGTTACTCCTTACGCCTTCTCGATCTGCACCAGATTGGTGTGCTGCGGGTTGCCTTTCGCCAGCGGTGACGGGCGATGGGTGGTCAACGTGTTGATGCATGAACCGTGGTCGATCCGATCGCCGTTCATGTTGGCGTCGTGCCAGGCCCCCTGGCCCATCGCGCTGACGCCGGGCATGATACGCGGGGTGACTTTCGCGGCAATGCGCACTTCGCCGCGGTCGTTAAACACGCGCACCATATCGCCGTTTTGAATACCGCGTTGTGCCGCATCAACCGGATTGAGCCACACCTCCTGACGACAGGCGGCTTTCAGCACATCCACGTTGCCGTAGCTGGAGTGGGTGCGCGCCTTGAAGTGGAAGCCAAAAAGCTGCAGCGGGTAGGTTTTACGTTCCGGGGCATCCCAGCCGTCAAACGTTGAAGCGTACACCGGCAGCGGGCTGATGGTTTCATCTTTTTCCAGCTCCCAGGTGGCGGCAATCTCTGCCAGCTTGCTTGAGTAAATTTCAATCTTGCCTGACGGGGTTTTGAGCGGATGGGCATCCGGATCATCGCGGAATTTTTTATACGCCACGAAGTGTCCATTGGGATCTTTGCGCTTATAAATGCCCATTTTTTTCAGTTCGTCATAGGACGGAAGCTGCGGATCTTTCTCCCGCATTTTGGCGTACAGATACTGCAGCCACTGCGCCTGTGTCCGGCCTTCGGTGAATTTCTGATAGATATCCGGCCCGAGGCGTTTCGCCACTTCGCTCATGATCCAGTAGATGGGCTTCCGCTCAAACTTCGGCGCGGTGACGGGCTGCAGGAAAATCAGATAGCCCATGTTGCCCGCATAATCGTTCGGAATGATGTCTTCCTGCTCGACGGTCATAAGGTCCGGCAGCAGGATGTCGGCGTATTTCGCCGAAGAGGTCATAAAGTTGTCGATCACCACGATGGTTTCGCACTGGCTTTCGTCCTGCAGAATATCGTGGGTTTTGTTGATGTCGGAGTGCTGATTGATGATGGTGTTACCGGCATAGTTCCAGATGAACTTGATTGGCACATCCAGCTTATCCTTGCCGCGCACGCCGTCGCGCAGGGCGGTCATTTCCGGGCCGCGGGCGATGGCATCCGTCCAGCTAAAACAGGAGATCTGCGTTTTTACCGGGTTTTCCGGCAGAGGCATCCGTTCGATGGTGATGGTATAGGTCGATTCGCGCGCGCCGCTGTTACCGCCGTTGATACCCACGTTGCCGGTTAAGATCGGCAGCATGGCGATGGCACGTGACGTCAGCTCACCGTTGGCCTGGCGCTGCGGTCCCCAGCCCTGGCAGATATAGGCGGGTTTTGTCGAGCCAATCTCGCGCGCCAGCTTGATGATGCGATCCGCCGGAATACCGGTGATGCGCGAGGCCCACTCCGGGGTTTTCGCCGTGTTGTCGTCACCCTGGCCGAGAATATACGCTTTGTAGTGCCCGTTAGCAGGCGCGCCTTCCGGCAACGTTTTTTCATCATAGCCCACGCAGTATTTATCAAGGAACGGCTGGTCGACCAGGTTCTCGTTGATTAATACCCATGCAATTCCGGCTACCAGCGCGGCATCGGTACCCGGACGAATCGGGATCCATTCATCTTCACGTCCGGCAGCGGTGTCGGTATAACGCGGGTCGATGACGATCATCCGCGCATTAGAGCGTTCGCGCGCCTGTTCGAGATAGTAGGTGATCCCGCCGCCGCTCATACGCGTTTCCGCCGGATTATTGCCGAACATCACCACCAGTTTGCTGTTTTCGATGTCGGAGGTGCTGTTGCCATCGTTGCTGCCGTAGGTGTACGGCATCGCGCAGGAAATTTGCGCGGTACTGTAAGTGCCATAGTGGCTCAGGAAGCCGCCGTAACAGTTCATCAGGCGCGCCACCAGCGAGGCGTAAGGGGAAGAGCGGGTGATATTGCCGCCGACAATGCCGGAGGAGTAGTTAATGTACACCGCTTCGTTGCCGTATTTTTCGACCACGTTTTTCAGGCTGCGGGTCAGGGTATCCAGCGCTTCATCCCAGGTGATGCGCTCAAATTTACCTTCACCGCGTTTGCCCACGCGCTTCATCGGGTAGTTAAGGCGGTCAGGATGATTAATGCGACGGCGGATGGAGCGTCCGCGAAGACAGGCGCGGACCTGATGATTGCCGTAGATATCCTCCCCGGTATTATCCGTTTCCACCCAGTAAACTTCATCGTCACGGACATGCAGACGCAGCGCGCAGCGACTGCCGCAGTTCACCGAGCAGGCTCCCCAGACCACTTTATCTTCAGCAGGCTGTACGGCGTTTTGCACCGCAGCAGCGGCGCTTTTCAATCCAAACGGCAACGAGATGCCACCGGCGGCAAGCGCCAGAGAACCCATCGCGGCGGATTTAACGAGCGTTCGACGGCTTATACCGTCGTGATGTTCGACATCGGACATGACTCACCCCATCATTTTATTTGCGTATTATTTTCGCCCGTGATTACAACCGGGCTAATGATGGGGTGAGTGTTACTTATTTGGAGGTATTAAATATTAATCCTCATCAATTCAAGGGGGATTGGCGTCAGGAAAGGGGATAAAAATCACTGCGGTTCGCCTTGCGGACTCTCCTGGGTGCCAGACGCAGCCCCGCTGCCTGTGCGCGTGTACAAAATTTTGAAGGTGTCATTGGCACAATGCCCAACGACCTGCGCATCCGGCTGATCGGCCTGATCGTTGGGTACGATGTTCAGCGTAAAGCCAGATTCCGGTACGCCGTTGTTGATGATTTTCTGCTGAATGTCGCTTTTTACACGTTCGCAGGAATCCGGTGCCGCCAGCGCGGCGGTAGAGGCACTCATTAACAGCAGGGCGGTAATCCAGGGTAACCGTTTCATCTGTAGCTCCTTTTCTCTGTGTGTAGATTAATTTTAGCAGGGTTCGTGTAAACAACTGTATTTGCTAATATGATTGGGAATAATCTCCCTGAATCGGTAAATGATGTGAAGAAATACGCAGCGATAACGCTACTGGCCGCGACACTGGTGGGGTGCGACAACAATTCCGCGCCGCTGTCGTTCACGCCTGAGATGGCGAGTTTTTCGAATGAGTTTGATTTTGACCCGCTGCGCGGCCCGGTCAAAGACTTTACCCAGACGCTGTTTAACGACAAGGGCGAGGTCTCCAAACGGGTGACCGGGACGATGTCGACAGAAGGGTGCTTTGATACGCTTGAGCTACACGATCTTGAAGCGAATACCGGCGTCGCGCTGGTGCTGGATGCCAACTACTACATCGATGCAGAAACCCAGCAGCGAAAAGTGAAACTGCAGGGCAAATGCCAGCTGGCCGAGCTGCCGTCTGCCGGTGTGACGTGGGATACCGACGACAACGGGTTTGTTGTGGCGGCGCACGGTAAAGAGATGGAAGTGAAGTATCGCTATGACGCGGATGGCTACCCGCTCGGCAAAACCACCGTTTCCGGGGATCAGCACTTATCCGTACAGTCGACACCCTCGAAGGATCTGCGCAAAAGAATGGATTACTCGGCCGTGAGTTTACTGAATGACAAACCGCTGGGGAACGTGAAGCAGAGCTGCGATTACGACCGGCACAATAACCCGGTGAGCTGCGACCTGACGATCACCGACGACAGCGTGAAACCCGCGGTTGAGCGCAAGTACAGCATAAAAAACACCATTGAATACTACTGAGAGTAAAGCCGCGCAGGTTACTGCGCGGTAGGTTTCAGCAGGCTGGCGCCAGACGGTTTGTGTCCGGCCAGATGCTGGTGCTGGAAAATGCACATGCGAATGGTGTTGCGGTATTCCCCGTTGATAAAGAACTCGTGGATCAGCTCGCCTTCCACCATAAAGCCCAGCTTACGGTAGATATGGATCGCTTTTTCGTTCTCTTTATCGACGATGAGATAGAGCTTGTACAGATTCAGCACGTTGAATCCGTAGTCCATCGCCAGCTTCGCCGCCCGTGAGGCCAGGCCTTTACCCTGATGCTCCGGGGAGATAATGATCTGGAATTCAGCCCGGCGATGCACGTGGTTGATCTCCACCAGTTCAACCAGCCCGGCCTTTTCGCCTTCACACTCCACCACAAAACGCCGTTCGCTTTGATCGTGAATGTGTTTATCGTACAGATCGGACAGTTCGACAAACGCCTCGTACGGCTCTTCAAACCAGTAGCGCATCACGCTGGCGTTGTTGTCGAGCTGGTGGACGAAGCGCAAGTCTTCGCGCTCCAGCGGACGCAGTTTAACCTCAAGACCCGGCATTACGGTGCTACCGTACGACCCGTGCGACGATCCAGGCAACGCAGGGTGTTCGGCTCCCAGTAGGCATTAACGTTTGCGCTTTGCTGACACTTATCACGTGCGTCAAAGGCGACATCTTCTTTGTCCCACTCTTTCTCAGCACGCTTGTTCACCTTCTGACGAAGGCTGCGGGTGTCATTCCATTGTTCTTTGTCCATCGCGGCGTTCTGGCGGCTCTGCGCGCTATCGCCAGACTCAATAATGAGTTTGCTGGTTTCGGCGGTAGCCGTTGCAGCGAAGGCGAGCGTTGACAGCGCCAGTGCGGCGGTCAGACAAAGGCGTTTGCTTAATGTACTCATAGCGTTTCCTTTTTGAATCGGTGCAGATAATCACGTTACCCTGTTCAGATTCTACACCAAACAGAAATGACGGCATACCCGCGGCGCAGGTATGGAAAAGTAAACGGTATCATCAGGTATGATGGTTAAATAAATCCTCACCCGAAATGAAATATGATCAAAACAACGCTGCTATTTTTTGCCACCGCGCTCTGTGAAATCATCGGCTGCTTCCTTCCCTGGCTCTGGCTTAAGAAAGGGGCAACTGTACTCCTGCTGATCCCGGCGGGCGTTGCGCTGGCGCTGTTTGTCTGGCTGTTAACGCTGCATCCGGCGGCCAGCGGCCGCGTCTATGCCGCCTACGGTGGGGTTTATGTCTGTACCGCGTTGCTGTGGTTACGCGTGGTCGATGGGGTAAAACTCAGCGCCTACGACTGGGCCGGGGCCCTGATTGCCCTGTGCGGCATGCTGATTATCGTTGCTGGCTGGGGGCGCGCCTGAGCGCCCTGATTTTGTGATCGTCCGCGTATATTTTGATCGTTATACTTGTATGGTAGTAGCTCAGTTGCGTAAATTTCCCCCATCACAACATGCGATGGAAGGAAAGGAATTATGAAGATTGTCGGGGCTGAAGTATTTGTCACCTGCCCGGGGCGTAACTTTGTCACCCTTAAAATTACGACCGATGAAGGCATCGTCGGTCTTGGCGATGCCACGCTGAACGGACGCGAACTCTCCGTCGCCTCTTACCTGAAAGACCACCTCTGCCCGCAGCTGATTGGCCGCGATGCGCACCGTATCGAGGATATCTGGCAGTTCTTCTATAAAGGCGCCTACTGGCGTCGCGGTCCGGTCACCATGTCGGCAATCTCCGCCGTGGATATGGCGTTGTGGGATATCAAAGCCAAAGCCGCAAACATGCCGCTTTACCAGCTGCTGGGCGGTGCCTCCCGCGAAGGCGTGATGGTCTATTGCCACACCACCGGGCACACCATTGACGACGTGCTGGAAGATTACGCCCGTCATAAAGAGATGGGCTTCAAGGCGATCCGCGTCCAGTGCGGCGTGCCGGGAATGAAAACCACCTACGGCATGTCGAAGGGGAAAGGGCTGGCCTATGAGCCTGCTACCAAAGGGGACTGGCCGGAAGAGCAGCTGTGGTCCACCGAGAAATACCTCGACTTCACGCCGAAGCTGTTCGACGCGGTGCGCAGTCAATTTGGCTTCAACGAACATCTGCTGCATGACATGCACCACCGCCTGACGCCGATTGAAGCGGCGCGTTTCGGTAAGAGCATCGAAGAATTCCGCATGTTCTGGATGGAAGATCCTACGCCTGCTGAAAACCAGGCGTGCTTCCGCCTGATCCGCCAGCACACCGTCACGCCGATTGCCGTGGGTGAAGTCTTTAACAGCATCTGGGATTGCAAGCAGCTGATTGAAGAGCAGCTTATCGACTACATTCGCGCCACCATTACCCATGCGGGCGGGATCACCGGCATGCGTCGTATTGCAGACTTTGCCTCGCTCTACCAGGTGCGGACCGGGTCACACGGCCCGTCCGATCTATCGCCGATTTGCCACGCTGCCGCGCTGCACTTCGACCTGTGGGTGCCAAATTTCGGCGTGCAGGAATATATGGGCTATTCCGAACAGATGCTTGAAGTCTTCCCGCACAACTGGCGCTTCGACAACGGCTATATGCACCCGGGCGACAAGCCGGGTCTGGGGATTGAGTTCGACGAAAAGCTGGCCGCGAAATACCCGTACGATCCGGCCTACCTGCCTGTGGCTCGTCTGGAAGACGGCACGCTGTGGAACTGGTAA
Protein sequences of DBSCAN-SWA_3 >NC_018079|2203155:2211710|2207881_2208175_-|WP_014831729.1|DBSCAN-SWA MKRLPWITALLLMSASTAALAAPDSCERVKSDIQQKIINNGVPESGFTLNIVPNDQADQPDAQVVGHCANDTFKILYTRTGSGAASGTQESPQGEPQ >NC_018079|2203155:2211710|2210495_2211710_+|WP_014831732.1|DBSCAN-SWA MKIVGAEVFVTCPGRNFVTLKITTDEGIVGLGDATLNGRELSVASYLKDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHTIDDVLEDYARHKEMGFKAIRVQCGVPGMKTTYGMSKGKGLAYEPATKGDWPEEQLWSTEKYLDFTPKLFDAVRSQFGFNEHLLHDMHHRLTPIEAARFGKSIEEFRMFWMEDPTPAENQACFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRATITHAGGITGMRRIADFASLYQVRTGSHGPSDLSPICHAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWRFDNGYMHPGDKPGLGIEFDEKLAAKYPYDPAYLPVARLEDGTLWNW >NC_018079|2203155:2211710|2203812_2204667_-|WP_014831727.1|DBSCAN-SWA MGSGWHEWPLVIFTIFGQCVAGALLVSGLAWMRESDEAVRTRIVRSMFFLWLVMGVGFIASVMHLGSPLRAFNSLNRVGASALSNEIAAGSVFFAVGGFWWLVSVIGKMPPALGKVWMVVSQILGIVFVWAMTRVYQIDTVPTWYTGYTTLAFFLTMVLGGPLFAALLLRVAGTAIKGTFAASVSVLALLLSVAVIVLQSNELATLHSSVQQASALLPDYGAWQVGRIILVAAGLGCWLCPLIRRQEPKALALFAGLVLVVVGELIGRGLFYGLHMTVGMAIAG >NC_018079|2203155:2211710|2204668_2205286_-|WP_014169373.1|DBSCAN-SWA MTTQYGFFIDSSRCTGCKTCELACKDYKDLTPDVSFRRIYEYAGGDWQEDNGVWHQNVFAYYLSIACNHCEDPACTKVCPSGAMHKRDDGFVVVDEDVCIGCRYCHMACPYGAPQYNAAKGHMTKCDGCHTRVADGKKPICVESCPLRALDFGPIDELRKKHGQLAAVAPLPSAHFTKPSIVIKPNANSRPTGDTTGYLANPKEV >NC_018079|2203155:2211710|2210057_2210384_+|WP_014831731.1|DBSCAN-SWA MIKTTLLFFATALCEIIGCFLPWLWLKKGATVLLLIPAGVALALFVWLLTLHPAASGRVYAAYGGVYVCTALLWLRVVDGVKLSAYDWAGALIALCGMLIIVAGWGRA >NC_018079|2203155:2211710|2209565_2209904_-|WP_013096793.1|DBSCAN-SWA MSTLSKRLCLTAALALSTLAFAATATAETSKLIIESGDSAQSRQNAAMDKEQWNDTRSLRQKVNKRAEKEWDKEDVAFDARDKCQQSANVNAYWEPNTLRCLDRRTGRTVAP >NC_018079|2203155:2211710|2205296_2207735_-|WP_014831728.1|DBSCAN-SWA MSDVEHHDGISRRTLVKSAAMGSLALAAGGISLPFGLKSAAAAVQNAVQPAEDKVVWGACSVNCGSRCALRLHVRDDEVYWVETDNTGEDIYGNHQVRACLRGRSIRRRINHPDRLNYPMKRVGKRGEGKFERITWDEALDTLTRSLKNVVEKYGNEAVYINYSSGIVGGNITRSSPYASLVARLMNCYGGFLSHYGTYSTAQISCAMPYTYGSNDGNSTSDIENSKLVVMFGNNPAETRMSGGGITYYLEQARERSNARMIVIDPRYTDTAAGREDEWIPIRPGTDAALVAGIAWVLINENLVDQPFLDKYCVGYDEKTLPEGAPANGHYKAYILGQGDDNTAKTPEWASRITGIPADRIIKLAREIGSTKPAYICQGWGPQRQANGELTSRAIAMLPILTGNVGINGGNSGARESTYTITIERMPLPENPVKTQISCFSWTDAIARGPEMTALRDGVRGKDKLDVPIKFIWNYAGNTIINQHSDINKTHDILQDESQCETIVVIDNFMTSSAKYADILLPDLMTVEQEDIIPNDYAGNMGYLIFLQPVTAPKFERKPIYWIMSEVAKRLGPDIYQKFTEGRTQAQWLQYLYAKMREKDPQLPSYDELKKMGIYKRKDPNGHFVAYKKFRDDPDAHPLKTPSGKIEIYSSKLAEIAATWELEKDETISPLPVYASTFDGWDAPERKTYPLQLFGFHFKARTHSSYGNVDVLKAACRQEVWLNPVDAAQRGIQNGDMVRVFNDRGEVRIAAKVTPRIMPGVSAMGQGAWHDANMNGDRIDHGSCINTLTTHRPSPLAKGNPQHTNLVQIEKA >NC_018079|2203155:2211710|2203155_2203770_-|WP_014831726.1|DBSCAN-SWA MKDVSQREAFAFSARVLGALFYYSPDSEHVAPLVNALTAGEWVADWPLPQAVLQPIADTFTVPADEPVTDAWQRLFIGPYALPAPPWGSVWLDRESVLFGDSTLALRQWMRENQIAFEMQQNEPEDHFGTLLLLAAWLTENGRETDRDQLLAWHLLPWSTRFLSVFVENAGHPFYIALGKLAQLTLADWQSTLLIPVAEKPLYR >NC_018079|2203155:2211710|2209008_2209566_-|WP_013096794.1|DBSCAN-SWA MPGLEVKLRPLEREDLRFVHQLDNNASVMRYWFEEPYEAFVELSDLYDKHIHDQSERRFVVECEGEKAGLVELVEINHVHRRAEFQIIISPEHQGKGLASRAAKLAMDYGFNVLNLYKLYLIVDKENEKAIHIYRKLGFMVEGELIHEFFINGEYRNTIRMCIFQHQHLAGHKPSGASLLKPTAQ >NC_018079|2203155:2211710|2208280_2208991_+|WP_014831730.1|DBSCAN-SWA MKKYAAITLLAATLVGCDNNSAPLSFTPEMASFSNEFDFDPLRGPVKDFTQTLFNDKGEVSKRVTGTMSTEGCFDTLELHDLEANTGVALVLDANYYIDAETQQRKVKLQGKCQLAELPSAGVTWDTDDNGFVVAAHGKEMEVKYRYDADGYPLGKTTVSGDQHLSVQSTPSKDLRKRMDYSAVSLLNDKPLGNVKQSCDYDRHNNPVSCDLTITDDSVKPAVERKYSIKNTIEYY |
10 | Escherichia_phage(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2693370 : 2717270
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_018079|2693370:2717270|DBSCAN-SWA TATGTGCGGACGTTTTGCACAAGCCCAAACCCGTGAAGAATATCTGGCTTACCTGGCCGATGAAGCCGATCGTGACATCGCATACGACCCGGAGCCGATTGGACGCTACAACGTTGCGCCCGGTACCAAAGTTCTGCTGCTAAGCGAACGCGACGAGCAGCTGCATCTCGATCCAGTGTTCTGGGGCTATGCGCCCGGGTGGTGGGATAAAGCACCATTGATAAACGCGCGCGTCGAGACGGCTGCCACCAGCAGGATGTTCAAACCTCTCTGGCAGCATGGCCGGGCAATCTGCTTTGCTGATGGATGGTTCGAGTGGAAGAATGAAGGCAACAAGAAGCAGCCATATTTCATTCACCGGGCAGACGGTCAGCCGATATTTATGGCCGCGATCGGCAGTACGCCATTTGAACGCGGCGATGAAGCGGAAGGATTTCTGATAGTGACGTCTGCAGCTGACAAAGGACTGGTCGACATTCACGACCGTCGGCCATTGGTTCTGTCGCCTGAAGCGGCCCGGGAGTGGATGCGTCAGGACGTAGGCGGAAAGGAAGCTGAGGAGATAATTGCCGACGGAACAGTGCCCGCCGACAAGTTTATCTGGCACGCCGTAACGCCCGCCGTGGGTAATGTCAAGAATCAAGGGCCGGAAATGATTGCAAAAACAATTGATAATTATGAATAATCTGCATAGCCTCGACGACCATATGAAGCAATGATAGAATGCCGCTTATCAAACATTATCATTTTTCTCGTTTATTTTTACCCTAAGGTTAGATGATGGATTTATTATTCTCGCTTTGCCTTGTTTTGACATGTTTAGTGCTTGCAATTGGGATGTTTAGCTTGCCATTTTTTAGTTTTATAGATGAAGAGACAGATGCAGCAAGAGCAAATAGTTTAGATGGCATGAGATTCATCCTCGCTTCTTTTGTTATATTTCATCACATTGACTGCGCCTATACCTACATAACAAAAGGCAAATGGATGCCTACTTCTGATTGGTTGTTATATCTGGGCAAATATGGTGTTGCTTTATTTTTTATGATCACGGCTTTTTTGTTTTGGGGAAAAGTAAGAACATCCAATCAAATTGATTGGGTTGAGCTATATAAAAAAAGATTTTACAGGATTGCCCCTTTATCATTTTTCTGTTCAGCTATTGCGTTGGCGAGTCTTTTTTTATTAACACAAAGAAAGGACTTTTCTCATAGCATCCTTGCTGCCTCACTATCATGGTTTGATGCTGGACTGTGGAACTCCAAGCCTGCGGTTACTGATTTTACACCTCCGTTTATGGCGTTAGCCGGAGTGACCTGGACACTTCGCTGGGAGTGGATCTTTTACTTTACGCTCCCACTGTTTTTCATGCTCAAGAAATGGTCATTTGAGTTAAGTGTTTCTGTGTTTGCTTTTTCCGTATATTTCCTTCCAGAATTTACGAAAGATGCATACTTATGGTCATATTTTTTTGCCGGAATGTTATGCAGCGTTCTAAAAGATAAAATTAACTTAACTTCAAAACATGCCAATATCATATTAGTACTTATGATATTAATCACTTTGCTAGTCCAACCAACACTGTACGCCCCTCCTGAAAAGATATTTCTTAGCGTCATTTTCTTCTCGGTAATATCAGGAGCAAATCTGTATGGCATATTAATTAGCAAAGCAGCAATTCGCCTTGGAGCAATAAGCTATAGTTTATATTTAACGCAAGGGCTAATTCTTTTCCCAATGGTTATACATTTTAAAAACCAAGGCGATTTAGAGTTAAATATAAATACATTTATTATATTCGCAGCTTCTTACATCTTAATTTGTATTCTTTCCTCATTAACCTTTCATTTTATTGAAAGACCATTCATGAAAAGATTTAAACCGAAGAGCATTTCAGAACAAGCATACAATAAATAATAATACCCGGCGACCAGTCGCCGGTATTTTCTGTCTTCTGTTACCTAACAATCAGCAGATCTGAATATCTCGTCGTATATCGAGGAGAAAGCATTTCTCGCTTCATCTGCCATTGCTGCTGAATGCCCTGTCCGGCAAAGTAGAGCGTTCCTTTTCCATCTTTAGCGTTAAGGTGATCGAGCACTTCCATTAACTTGCCACTTCCAGCACGCGGCGCGTTCTCATCAAATAGGTTTAGCTGGGCCACACCCTGGCTGAAGAAGTCACCAAGCATGATTCCGGCTTTCTGGTACCGGTGACCGTCACTCCAGATTTTATCCAGGCACTTTACTGCGGCGTTGATGATGTCGCGGGAATCCTGTGTAGGCGTGAGAAGCTTCATTGACGCACTGTTGCCGTAATATGGCTCATTTAGGGCAAATGGTGATGTCTTCACAAACGTAGAGATAAAACGGCAATACTGGTGCTCACTACGCAGTTTTTCAGCACCACGGGCAGCATAACTGCAGATAGCCTGACGCATCTGCTCGTATTCTGTAACGCGTTCGCCAAAAGACCGACTGCAGACGATTTCCTGCTTTGCTGGTGCAAACTCCTCCAGCTCAAGACAAGGCTCGCCGCGCAGCTCTCTGACCGTACGCTCGAGTACCACGTTAAAGTGTTTACGGATAATCCCTGTGCTTTGTTCAGAGAGGTCCAGAGCCGTTTTGATACCCATAGCGTTCAGCTTCTTACTGATACGCCTGCCGACGCCCCATACGTCCTCCACCGGAACCAATGCAAGCAGTCGGCGCTGGCGATCGATATTGGACAAATCAACCACCCCGCCAGTCTGCCGCTGCCATTTCTTCGCAGCGTGGTTTGCGAGCTTGGCCAGTGTCTTTGTCTGGGCAATTCCGACTCCAACTGTCAGGTGAGTGCGTTTCAGGACCGTAGCGCGTATTTCTTTGCCAAAGTCAGTCAGGTCCCGGCAGCTCCTAACGCCCGTTAGGTCACAAAAAGCTTCATCAATACTGTAAATTTCAACGCGAGGGCTCATTTCCTCTAACGTGGACATCACACGGTTCGACATATCAGCATAAAGCTCATAGTTGCTGCTGAAGCAAACAACGCCAGCACGCCGGAATAGCTCTTTTTGTTTGAAGAAAGGCTCACCCATGGTAATTCCGGCTGCTTTGGCCTCGGCGCTGCGCGCGATTACGCAGCCATCATTATTCGACAGAACAACCACCGGCCGCCCCTTCAGGTCCGGCCTGAATACAGTCTCGCATGATGCGTAGAACGAATTCACATCACAGAGCGCGAACATGTTCAGCTCGCCGATTTAACAATAAAAGTCACGACGCCGAATACATCGAGCGAGTCTTCACTCCCGACAATAATCGGCGAGTATGCGCTGTTCATCGGTATGAGTTGGACGGTTGGGCGTAGCTGCAGACGTTTAACAGTGAACTCCCCTTCTACCGCGGCGATCACGATATCTCCATGCTCAGCTGTTAGCGAGCTATCAACCACCAGCAGATCGCCGTCACTGATACCGGCTTCGATCATAGAATCACCTGCGGCTTTGACGAAATATGTCGAACTGGGGTGAGCGACAAGTAACTCATTGAGATCGATGCGCTGTTCAACATAATCAGCTGCAGGGCTGGGGAAACCACACTGAACTAAATCACTGAAAAGCGGGAGAGCGATAATTTCTCGCAGTTCTGTAGACCTGATAAATTCCATATTGCACACCTCAAACACTGTTTTTATATACAGTAGTTTTATTTGTAAGTGTCCGCAAGATCCGGGCTCAACTGTCACTGCTTAAAGCTTCGCCGTTTCGTTTCTAAGTTTCTATGTCGCTTCGAATTATGAGTTTTGTAAATTTTATGCCCGTAACCCTTTGTGCGCAGATTTAAGCCGCTTTTGAAACCGGGAATTTTTTTATACAGGGTGCATACAACTACATCGTAGATGATTGCCACCTGCTTCCTGCCCACGCCGTTCGCAATCAGTCTGCCCGCCTGCTCCCATTGTTCCTGGGTGAGTTTCGGACGCCTGCCGCCGATTCGCCCTTTACCAGAGCTGCAGCTAACCCAGCGCTAGTGCGCTCCACTATCAACTCCATCTCCATCTTGGCCAGGCAGACATGATGTGGAAAATGAAACGCCCTATTGGGCTGGAAGTGTCAACCTGACAATCCGCCTGTTTAAGCGCAAATACATGCTGAGTGATGATGGAGAGATCATCAAAACGAAAGGAGAACTGATGGATGTCCCGGCGAACAGCTGGATCGATGTTCGCCTGGATATGCCATCTGATTCTCTGTTTAACTAGCGGATGAATCAGGAATTTCAGCCAGAGCCTTGCGCTGATTCCAGATACTGTCGATACTGTCCTGCGGCATCTGAACACGTACAGAGACAAATTGGTCAGAAGGGATGTCTATCGGGTCACCATCGCTAATATCCTGCAGTTCATTCCTTGCGAACTCTGGTGCGGCAGGATGTGTGCGGTGATAGGTTTTCACTAACACCGAACCGTCAGCATGAACCTCATAATCCAGCCAGATCAGAGGCTGCTTGTTGCGGTCGGTGGGAATATCAAAACCGCCATCAATGCCACCCCACGTAGCTTCAGAGTTAAGACCCTCGCACCCTTCAATCAGATATTCGCCTGTAGAAAGACGGGTTACAGAGCAACCTTCTGATTCGTCGTTTGTTTCGAAAGAACCGTCAGAAAATAATTTCACCACAGGGGAGGCGGCTTTAAGGGTTCCATCAGATGCTACGGTGGTATTGGTTGATGTATAAAACTTTTGCCACGAACCAGCAGTACCTCCAGCCATTCCACGGAACCAAATAGTAGGACTCGTTGTGTCCAGTGCTGCTGCGATAACACAGGCATAGGAGCTGCCAGCATAAGCCAGAACCACTCCCCCCATACGTTGAGCAACTGGATTACTTCCATTCCCCTGCGTTGCACCGAAGAACTGAGTTCTATCCAAGGGGATGCCTGTTGAAAATTGGTTAGGAATAGACTGGCCGAGACCAAATGCGCCCTGCATCAAAGCGACGCCTGCTGTCGTATCTGTAATAGAGGACTGAAGATCGACTGTAGCCGCACTTCTCAAACCGAGGTTTGAGCGAGCGCTTTCTGCAGTCGTTGCCCCGGTACCGCCGTCAGCGATTGCAAGCGCACCATTGCTCCCTTTCTGCGCCAGTTTGCCGATGCCGGGGATCGTAACAGTGGTGCCGTTGATGGTTACGGTGATGCTCTGATTGGCTGAGGTGGTGGCGAACGTCTCCCACGCACCAATATTCTCGTCATACTCTTTGATAAGCTGCGACATGGCCTGTGCCAGGCCGTCGACCGAGATATTGTCCGACACAAGGATTCCATACTTCTGACCGCTCAGCGCCGGGGAAGCAGCTGGCGTAACCGTCATTGACGTGGCGCTGTTCACGGCTGAAATCTGGAACATCTGTACCGGGTTTGACATGACGATAATCGTCTGGCCAGCGCGGACCTGGCTGGCGGGTGCCGTCCAGTTCGTGCCGGTCCCGGTTGCGGTGTTTCCGTTAATGGCGATGGTGCCGGTGTTATAAAGCATGAACTACCTCACGATAATAACGATCGTTTGAAACGATCAATAATGTAAAATTGATCGTTTATAGCAATCTGACTATTTTTTTAACTTAAATAAAATGGATATTCCCGCTCTTACAGGAATGTAGAAATGAAACGATTATTTGCTGCAGCTCTTTTGCTGCTGGCTGGCTGTGCCGATAAACACACAGATTATACATTTAAAATGGATTATCCTGTGGAGGCTGCGCGTCTTTCTCTCGGTGGCGATATTCATGTAAGTATCGACTGTGCCACGAGGGAAATGAAGGTTATTTCAGACAGCAGCAATGGAATATTCAGCCGCCATGTTAATAAACGTCTGAGTAATATTTGCTATAAAAAAACCGATAAACTCGATGTCATTTATCGCTTCAATGCTGCGAAGGGAGTCAGGCAAGATATGATTGCCACGCAATATCCACGTGTCCCGCCTGTTTCAAATACCGACAAACTGAGCGACAGGGATTCTTAATCCTCGCCCCTGCATCGTCTGGCTCCAGCTGCGCTGATTTTTAGAAATATACCTCCCCTGCAGCTGTGAGCCCGTCCACTTCAGCACTATCCCTGAATAACCGACCACCTCCCCGTCATCGCTGAGATTTCCAGGGCAGTTGTTTACCAGAATCCACGGGTTAAAACTGAGGCTCACCGATAGTGTGTTGTTCTGCAGGTCATAGTTCGTCGGCACATCAAAGAACCCGACAACGCGGGGCATTTTTGAGGCTGAAGCCGCGCTCCAGATGAGATATCCGGCACTGTCGAAGACATCCAGATACCCGCTTTGTATTCCAATGTTACGCGCAGTGCGGATCATGCTCCCCGCATTATCTTCAAGCAGGTCTGCACCAGGCATCCCGTATCTGTTTGCGCCAAGCTGCAGCCACCTTAATCTCCCGTCATTCCAGAATGACTGCGGAGTGAATCCCAGGGTACTGCCATTCCCAAATGGGCTGTCTATCTGGTAAAAACCTTTGTTGTTCACTGCGCCGAGCATACGCTGATCATAAAAAAGGGTGGACCTGTTTTGCGAGTCCACCAGTAATTTTCCATCGCTGTTGTAAACTTCGAATCCGCTCATTGAAAGTTATACACCTCCACATTGAGAGTGATCGCAGGACTACCGGTAGACGGCAAATAGTAAGCAGTAAAGCCGCCATTATATGCGCGGCAGTGATATTCATTGACCGTGACACCAGTCGATACAATTGAAATAAATGAACCATCCTGGGTTATCCCGGCGAAGGAAACATTTTTCGACGTTTCCCCCGCAGCGAATGTTACCGAGGTGCTTCCGATATATCGGATGGCATAATCACTTAAATCAACCGCTATCCGCCCTGCACTGTCCCAGCATTGCAACCCCTGTGGCATTACCATAACCCCATTCTGACACGCAGCACATTATTGCTGTCATAGATACGAATGAGAGTGCTGGTTATCAGCATCCTCCCGCCCCCGGCCACGCCGTTAATTTCGAACGTCCCTCCCTTATCAAGCTTCCAGCCTGCAGAACCAGCCACATAGTTATTCGACTGGATATAGTTGCCGATTTTGGCGTTCTCAATGGTGCCGTCCTGGATGAAGCTGGCCCGGATGAATGTCTGCCCGTTCTGGATCACGAACGGCAAAGCCACGCTATTTCCGGCTGCCGTGGTGACGGCGAAGCGGTCAGCCAGGAAGATAACCTGCGACTGCATGCCGGATGGCGTATTCTCCACGCCGATACCCATCCCCGCGGCGTAATACTGCCCGTTGCTGGAGACACCAACCTTGATGTTGTACATCGCGCTGAGGTCGCCATTAACGTTGGCTATCGCCTGAGCGTTAGTGGTGATGGCGGAGGTATGCCCGTTCACGGTCGCCGTAATGCCGTTTATCTGCGTGGCCGTAGCCTGCTGATAGTCGGAGAACGTCTGATTCAGGCTGTTGATGGATGCCTTGTTGCCGTTAACGTCCGTCTGCAGGCTCAGCAGCGAGCGCGCCGTTGCCTCCCTGTCGTTGACAATCACTTCATCAATGCGGTCCAGATTCGCGCTGTTGCCGGCGACCGATGCAGAAAGGGTTTTACGCGTGGCCACCCGAGCGAGGTTGGCCTGGATTATCGCAATTGCAGAGTTCTTCACCCCGCCCGTCATGCCGTCCATAGACACGCTGATGCTGTCGATTCGCTGGCCCAGCGCGGTATCAGCCGTCGCCACGGTCTGCTCAAGCTCTAAAAGAGAGGACGACACATCACCGACTGTGCTCGAAAGCTCATTAACGCTGGTCTGAACCTGCCCGATTTCCTGCGCGTTTTTGGCGATTTCCTGCGCCTGCAGCTCAAGTTCATCGTTGGCCTGTTTGATGTCGTCAGCCATACCAGCAATTTTTTCGTTGGTATCAACGGCATTCTCGATCAGGTCCTTGAAGGTATCCGAGTCTTTAATTTCCTCCAGGATCACATCGGTGATGTCGGACACGTCGATGCTGGCCTGACCGCGCACCCATTCTGTGTAACCTGATTCGTTGCCGCTGCGGTCCACCAGTTGCGCCCGATACCAGAAAATCTGCCCAGCCTTAAGGCCCATCTGCTGATATTTGCGCTGTGGGTAAGGTACATCGGCCAGCAGCATCGCATCGTCTTCGGTACCGGTCAGGCTGTACTGAATTTCCGTCTTCAGCGTGTCGTCGGTATTCGCCGGGAATCCCCAGTTCAGCTCGATACCGAACACCACATTTTCAGAAGCGATGAAGCCAACCGGCTTCGGTGGATTGCCCACTTTACCGGTCAGCGTTTTCTCTTCTGAATAGCCCCATCCGGACGAGATTTCTGCGGCATTGATTGCGCGTACGCGCACCAGGTAGCGCCCGGCATAAATCCCGGGGACGTCGAATGACGTGGTGGAGCTGCGCGGCACGTTAACCCAGTTCCCGTCGTTTCGGCGCCATTGCGCTTCATAGGCGATAGCGTTCAGCGCCTGGTCCCAGCTCACGCGCATCGTTTCGACGCTGATATTTTGCTGCACCACAGAAAACGAGCTGATCAAGATGTTCGCAGGCGGCGACTGGTTGCCCGGCGGGATCACGCTCACCGGCCGCTGGTCAATGATTGCTCCGGTATCAATGCGATCGAATTTATCCGGATCGTGATTTGCACCGACGATTCTGAACGTGCCGTCGTTATTATCAGTTACCGTAATAACGCGATACTGCTGTGCGTAGAGCTCATCAGACTCAATGACCCATACGGCCTCAGCCACAGGCGTTTCACTGTAAGCGGTCGTAACGGTCACTTTATTGCCCGTTATCGACTGAATGGTGCGTGACTGTGAAACACCCGATGGAAGATTGACAATCATCCTGTCGGCTGCCGAAGCATCCGGCGCCCTGTCCAGCGTCAGCACGCGACCATTCACCGCAGAGATACGGCCGCCCAGGTCGCGCCCGGAGAGATTTCTGTCCGCTACAGCGATTACATAACCAGGCTGCGGAATGTTGCCATCTTCCCCTACATTGAAAGTAACAACGCGATCTTTGTTGTTGGTGAGGATCCCCCATCGCCCTTTGCGATTCGCTTCTGACTGACGGGTACAGCCGATCGCAGTTATCTCAAGTTGATTAAACCCATAACGCGCAACCAGCGCCTGCTCAAAAACAGGCTCCATCGCATCAGAATAAGCGTTATCAGGATCAGACCAGGACACCAGCGCATTGGTGTAACGGTTCTTTGTGGTGCTGCTGGAATAGGTAAAGCGCCCATCAATAACGTTCGCATGCGTGTATGTAAAATCTACATCTCTCGGCATGTCCGCCAGCGCCACAATCTGGTCGTCGCCCCAGTAGGTCATCCCACGGAATATGGCAGCAAAATCACGCAGGACCGTATAAGCGTCGTTGCGTTCCTGAATGTAGACGTTGCAGGTATAACGTGGTTCGGTACCACTTCCGCCTTTGCCATCCGGTACCATTTGATCGCAATACTGCGCAACCTGGTAGAGCGTCCATTTATCTATGTTGGCCGTTGTAAGACGATCCCCAAGTCCGAAACGGTCGCTAACCACCAGGTCGTAGAAAATCCATGCAGGGTTATCGGTCCAGGCCCATTTAAATGTCCCAGCCCACGTACCGCTATAAGTGCGGGTTTCGGGGTCGTAAGTATCCGGTACGCGGATAACGCGGCCGCGGGGCTCGCAGGCGATCTGCGGGATAGAGCCGTTAAACTGGCTGGAATCGAATTCGATATAAAGCAGCGCTGTGTTTGGATAGCGTAATTTGGCGTCAATTACCTCGGTGAAGCTCTGCAGCATCATCGTGTCGCCGATTTTCGCGCTGTTGGCATCAGACGTAATCTTACGGAGTCGGATTGTCCAGGTGCTGCCAGCCTGAGGTAAATCAATACGGTGGCTGCGCTCGTAACCTGACGTCGTTTTGCCTGTCACGCTGGTATTGAGTACTGTCTGCCATGTGCCGCCGTCCGTCTGCAGGTCAATCGCATAATTAACCGAGTAACCGACCAGATCGCCGTCGTCCTCCTGCTTGAAAAGCGAAGGCCATTTCAGGCGCAGGCGAACAGCTGAAAGCTGCGTATTGGTAAACGTGCGCGTCCATGCTGTAGCGCTCGATACCTCAGTTCCCACATTGATTTCGTTTTCGGTACCGGGTATGCCCTGAATATATTTTTGCGCCTGAGTTCCCGCGCGAAATTCCCACGTAACGCCGCTGAAGTTTTGAGAGCCGTCAGCATTCTCCAGAGCCGTTCCGTCAAGGTAGATATCTTTCGCCGTCAGCTGCCCTGCAAATTCCCCCTCTCCTAGTGCAACGAGGATTTTTGCCTTTGCTACAGATTGCAGATCATCAGGCTGTTCGGTAGGGGTTCGGGAACTTGAGCTGCCGCCCTTGCGGCCTTTAATAGCGGTTGCTAAAGCCATATTGCGCCCATAAAAAAAGCCACCCAGAGGTGGCTTATTGTAATGTGTTTAATCTAGTAGAATTCAACTTTCGTATTCGCTGCCATTAATACTGGCTTCGGCACAGGATTCCCTGCGCTCTGCATTGTTACTTTACATATTTCAGGAACAAGCTTTTCGCTATAATGACAAGTATTCGTTGCTGTATAACCTTCCGAGTAAACTGATGTGTTTATAATTTTATAATCAAGAGGCTTCCTGTCTTTATCAACATAAGATATTTCGTTTTTAGATAGGATCTTTCCAGAACCATAAAACTCAGAACGTATCAAATTAGAATCATCATCATAAAAATGCTCAGATATTTTCTTACCCAAAAAATAGGTGTCTTTAATTAAACCATTCGAGTAAAGATCATATCGCAACTCGTCACCATTTTCATTTTTACTCAAAATATTACATTTTTCATCGATCTGTATAGAGAAAGGCTTACCATCTCTCTGACCAACAAGACTTCCGTTGCTATTTTTTAAATTGGTTTTATGACCAGAAGAAACGTTATCAAGATCTAAGCTTTCAACACACCCATTCTTATCCAGTCTGATGGCAATTTTATAAGTGACTTTCCCATTTTCTTCAATATCAGTATCTAAGGATTTGACAGCTCCTTTAACTGGATTGAAATCAAATATAGTAGATAAATTATAGAGGAGAGGTATGTAATGGTTCTCAGCCAAAGCCATGCCTGAAAATAAGGAAGTACAAAGGAAAAGAATAGATGTTTTTTTCATTTAGAATCTAAGCTCAAGTCACCCCATATTAAGTATGCCCATTACAGCATGCTTACTGCTGATCTTCAACATAGATAGCGGAGGATGATCGCCTCATCAGATGCTTCTGCGTAGTCAACATAAAGCCCCTCTCGCGCAACCTCGAGCACGTTCTCAGCCACCAGCTGTGACTGCTGATAGATGCTGGTACCGGCCCCGTCAGCATTGTCCAGCAGGTATTTCAGCTTTTCAGGACCATTAAACGTGGGGTCCTTGCGATACGCCATCCCCAGCATGCCGATCTTCGTATTACCGGCAATCGCATAGAACACCGCGCGGCTCAGATAGTCCTCATTGCGCTTGCGATTGCGTGTGGATTTATCGGTTGGGTCGAGATAAGGCAGATACTTATTACCCGCCGCTTTTACGGCCTCAGCTCCTTTGCAGAAGTCCCTGTATTTCCTCCAGGCAGCAGAAGCCGCCCGGTGTTCTGGTCGAACCCAGGTGATATCGTCGTTTGCCATATCAGAAAGTAGTGTCCATGGTGATTGAGTATGCCGGTTTCACGATCGGGTAATCCTTCACGATGAAGTACCCACCAGCATCATTGGGGTGATCGTTATCTGCTGATTTGTCCGGTTCGCCATTGGCCGCCCAGATTTGCTGTTCGAGGCTTTCGGTATAAACCGGGCAGTTCTGGACGTTCACCAGATAGCGGCGTTCGCCGTTGGCGTTACAGAACATGGCGTTCATCGAGTTGATACGGTCCTTAACCGGCGGGTTGGCATCATCAACAATGACGCTGAATCCGGCTTCGTTGAGCTGAGCAATATCTGTCTTGCTGGCGTTCTGGGACTTGCGGGAGTCGCCAGAGGCATCCGGATAGATGTAAATCTCCCGGCTCTTCACGTAGCGACCATCCTCGTAGCGCCAGAACTCTTCCTGAATGCGCTTAATCATCGCCGGCGTGTCGTAGACCTTCACCAGTTCACGAACAGCACGCGGCAGGCCGTTACGCTTTACGTGAACAATCGCGGCCATTTTCCCCACGTTGAAGTCCATGCCGATAAACAGCTGATCCCCGTCCTGAATCTCGTCAGAACAGTTATTTAGCTTACGGTTGAACGTGTGGTAAATGGTCCCGCTGTTAAGGTTGGTGAACTTCCCCCGCAGATATGCCTGAATCAGTTCCTCAGGGTAAGAACTTAGCAGCGATGGTATGTAATCAGGCGGCAGATTCTTCGCATTGTCGAACGTGCTGGCCTGAATCAGACCGTATAGAGCCGCAAGCTCTGGCTTCTCACGTACCGCCTTCACGAACTGCTGGTAGACGAATTTGAACCCTTCCGGCGTGGTCGTGACGTCAATACCGTTTCGCAACCCATCAACCTTGTAACGCATACGAGCGATGATTTTTCGCCAGGCCTGCTGTGCTTTGGCAGCCGCCATGACGTCCAGCTCATCCACCATCGCGTTACCGATTTTGAAACCTACTATCGAGCCGGGCTTCTCCATCGAGCGGCATATTGTCGTCCCGCGGTACCGTCGCCCCTCGTAGAAGTGAACCTCTTTGTTCCCCTCGTTGATTTTGACGCTCAGTCCCCAGTCAAAGGCCACCTCCTCAATCGTTGGGTAGAATATGTCACGAATTTGCGGGTACGTTGGCGCGAAGTAGCCTTGGTTAATCTTCGGGTGTTCCCACATGCCCTTACAGATGCCGCCACAACCCACCCACGTTTTACCCGAACCGAACCCGGCAACATAGGCTTTGAATTTGTGCTGCATCGCGAGGAAGCGCGCCTGAGGAGTGTTAAGTGTCGGGCTTATCCCCATCGTCTGCCCTCGCATCCACTACGTTGATATTGATTGCAACTGGCGTTGGTTCATCGTCCTCACCCTCACCGGCCAGCTCTTTACGGAGTTTCTCAACCTCAAGCTGCCGTCGTTCGATTTCAATCTGCTGTAGGCGCTGCGCGAACTCGCTATCGGCCAGTCCGAGTCGTTTCATTACCGCTTCGTACATGCGCTCACGGCTGATGGCTGTGATTTCGACGCCGTTCTTGCCGACCTTCACTCCAGAGTATGCGAGCCGCGAGACTGGAGGGAGCTTGCGGGTGTCAGGGAAATAAGGCTGGCCAATGCCGTCACCATTGCAGCGTGGGCATTCAGGGCTTGGTTCTCGGTTGTGGTCGTAACCGTAACCACCGGAATCATCTGGTTCACGTCTGTCACGCTCAACAGCCTCGAGTCTTTTCTCTTCAAACTCAACTGCATCCCGCCACTGGTAGTGGTGACCGAAACCCCAGCAGTAACGACACGCGCCGCGGCGATACTGCGAAAGCTGGTTTGCATCGAAGGTGGCGAGCTGCCACATCTGCGCGAGGACTTCATCGGCACTGCCAAGCGTGCGCGCAATGGACTCTTTTTGCTGGTGCGCAATTGCCTGCGCAACGTTAGGATTCGCTATGAGCTGACGACCGTAGTTTGGGTCACTATAACCAGCACGTGCAGCAGCAGCGGTGGCGTTGTTGTCCTTCAGGTACTCCGCGACAAATAAGCGCTGCTGAGCAGTAAGTCCATCATCATCCACCAGCACTTCTGCGCTTTGTTCTTTCTGCGCAGTGCGCATTTTTTTCTGCGCAGGTTTTTGCGCAGTAGATTTTTTGATATATCGACGGGCGGTAGCGTAATTCAGTCCCTGCGCTTCACACCATTCCTTTGGTGATACGCCGGTTGCGGCATGTTCGGACAGGAACCGTTGCTGAAGCCCGCCCCAGTCCGGTTTTGCCATTGCTTACTCCAATAAAAAAGCCACCAGCGGATGCCAGTGGCTCAAAGTATTATGCTTACCGATGCGGGCTAGGAGATAAAGTCGTCGCTCCCAAGCCTGTGGGAGCCGTAGCGAGCCTCATAACCATCACCCAACATCCCTGCTTTTGTCTCTGCAACATCTTTCGTCGCATAAACACCTGCAAGGTGCCATGGAGCATTACGAATTACCCCCCAGCCTTTAACCCAGCCGTCATTATCAATATCAGCTTTCAATCCTTCTGCAACAAACATATATGTCTCCTTTGGGTACCCAGAGGCATACTATTAAAAGCAGGCGATGAAATAAATGTTAGCCGTCAAGACTTGCAGTAATACTCAGATGGTGAAGGTCTGGTCGGTTGTGCTCAATAACAATATCACCAGCCCCAGCTTCAACATCATAGCTGCGGCAGTAAGGAGCAGTACACCTACCCGATACGTTGTCTTCAATCAGCACAACCCCTGATTGAATAGCCTTAAAAGTTACATCCGTCCCGTAAACCATCCTGTCTTTCCACTCTTCGAGACGCGTAAACGTAACAGTTAGCTTTGCCATAATCCCTCCTATGTTGAGCATTATCACAGGCACTGAGTGAATGCCTGCTGTAATGCCACCCACACTCATGCCCTTGAGTTGCTGTCGCTTCATCGCCGCTTATAACCAGTACGCGTATGGCCTTCGTGCTGCTTTACCGGAGCATGTTCCCTTATTTACCCTCACATCGGTATGCTATACCTGCTCGCTATTACGCGACTCGGGGCAGCATCATGACTGCTGCACTGCCTTTCGGCGGCGGCCCAACCGCTTTACTACTTCAAATTGACTTCCCCTTCTGGCAGTTCGCCTGCCACGTTTTTGGTCTATCTCAACGAGACGATTTAGGACGTATATCGCATTTATGCCAACCAGATCATTGCGTTGAGCTTTAAGTTCGGCAATTCTGGACTGGATGTCAGGTTTTGATAGATTTTCGGACGCAGTACGGTTGGCTGTTTTTGCGCTGTACCCCGCCCGAATAGCCGCTTGCGTGGCGTTTAAATCGATGAGGTACTCGCGACAGAACATTTCTTGCTTGTCGGTGAGTGCCATTTTTACCTCAAGGAGTTGTTAATGAGTAAATATAAAGTTGGAGATAAAGTTAAGCTTAGGTCTGGTGGCCCAGTCATGACCGTTCAACAAATTAGCGTTCCCCAACCAACAATGTATCGGGGCACTAATCGGTGCCAGTGGTTTGCAGGGAAGAAACTAGAAGAAGGTTATTTCCCAGACGACTCACTGGAGGAAGTTGGGGATGACGAGCAATAAGGAACAAGCCGTCTACTGGATGCTTGAACTTCTCAAAAGAGACGGATGTCTATATCAAGATGATGTAGTAGATCACTTGGTTAAAACTAAAAATGAAGACCTTTTAGTTGAAAATGCCGACGGTAATTTAGCCTTAAGTCGTCAGATCCTTACCTTGTTTATGAAACATACAGCTGAAGAAGTGGTTTGGGTGAGGCCTCATCGTTATTGGCGCTATCGTGTCGCTGAGGACGAACCTGGCCGTGAGGCTCGCGGTTAAAATAAAGGGCGATGATTCGCCCTTCTTTGCCATTGCGATGGTTCTGATCGCAGTGATGACAACTAAAACGCCCGGAGGGGGTTTTAGTTAGAAAAGGAAAGCTTATCGTTAAACGTAAGCTATATATTTTCTTCATCAACAATTGAGTACATACCCCTATCTCCAAGGCTATATACGTATTTGTCTTTGTATGAATTTCCTAGTTTACTTTGATAAGAAATGGTCATAACAATTGTTGACGGTAGCTCTTGTTCATTTACTTCCCAAGACTCTCCGATCACTACATCTCCCACTTTCGCGTGATGAGCAGAGTACTCACTATACGGGCTGAATGGTGGTTCAGTGTTTATCGACAACATTAAAGCATCTTCCCCACCAATGGACAGGCTACCTCTGTATTTATATTTTGTTCCACCTATAATGACTGCTGGTCTAATCGATGGTGCCTTTATAATTGGTTTATATTGCCTCTCTTTTTCCCGCTTAGATGTTTCAAGATTTTCGCGATCTGAAAGTAATTGCTCCCTCGCAACTGAAACCATTTCCCTGTACTGGTCTACAGAGTTTTTCAGCTCAAGTGCCTGAAGCTCAAGTGCTTTTGTATTTTGCTGCAACTCTTTTTGCTGTTGAAGATAGCCTAATACTAACCAGAGAAAAGCGACGGGTGAGAATGCCCCAGCTAAAAAGTCGCCAAACTCATTCCAAGACGTCATGACACCTAAATCCATAAAGTAAATTACGGCACCCAGCACGACGAAGTAAAGAACACTAACTATTAGTCCATACCAAAACAGTCTCATCACCATCTCCTTAATTGGATTTGGTATTTAATATCCGTTAAGAGTAATAAGTCAATTCAAGCACTGCCCTTTGATGTATTCCTGAAGGTAGCCAACCTGCTTCGTCACTGTGGCGATTCGCTCTCTGAGGGTGAAATAATCCCGTTCAGCGGAGTAAGTAAGTCGGGTGCGGGAAGCATCGCCCAGGCCGCCGGTGCTGGACGCTCCGTTCGCGGGAAAGTGTGCGTTGAGCTGCAGCCGCTTACGGCCAGCAATGACATCGCTATGCAAACGCTCAATGGTTTCTTTCGCATCAGCCAGTTCTCCTGTGTATTTGGCATCCAGCGCAGCCACATCTCGCTGTCGGGTTTGCATGTCTTTGATGCTAACGATCGCCAGGCTCAGTTGTTCGGTAGCGTTATCGCGCTGGCCTTTGTAGGTGATGGCGTTGTCGCGGTAGTGGTTAATCGCCCAGGCCATAGATACAAGCAGGCAGATAACGACAGCACAGATGATTGCTGTTAATCGGCTCATTTCTGGCCCCATATGCAGACTTCGCGCTCAATCTCACGGCGCGTTACTCACCCTTTCCACTGTTTGCCTCCAGCATGAGTCCAGCGCCGCAGCTGATCGCATACGCCTTTCAGGTTCCCATGTCCACCAGCTCATCTGCGCTTTTATCTTTCTGCGCAGTGCGTGCTATCTTTCGCGGGTAAATTTTTGTATTTTGCGTAGCAGTTTTCTTGATATAGCGACGAGCAGTTACATAATTAAGGTTATGTGTCTCGCACCATTCCTTAGGGGATATGCCAGTAGCAGCGTGATCAGACAGGAACCGCTTCTGCAGCTCGCCCCAGTCCGGCTTAGCCATTGTTACCTCTAAACTGAATGAACTTTAGACGTCACTCACAGCTTCAGTATTTGAAGCAATGAAGTATTTTTTCTCAAAGAAATCTTGAAATGAGGATTTAAGTTTATGAAATATGTATAACTACGATGACGTACAGAAAATCAAGGCCAATCTCGAGTGGATAGTGCATCAAGCCTCTGCCCGGTCTCATTTGCGCACTGAGCATGATCAATTAGTGATTTCCGATCTAATGGAACTGATTCAGACATATGAAACTCTTCTTGACCTTGTAAGCAAATTTGGTGCTTCCGTCTTAAACTCGGAAATCATAGCGGGTCTATCAATCACAGAGGAATTCATTGCTAAGGTTAAGCGGAATGAGGGTGCGATGTGAGCGACCAACACACTGTGAGACGATTGATTCGTGTATTGAAGCTTCAAACTGGTGGATTACAGTTTGAAGCATGGGTTATTTAATTGCCGTACAGTCGATTGAAAAGCGCATTTTTCATCGCATCAGAATCAATCGGATCCACGTTTAACCAGGTCAATGTCTCTTGGTTCTTTTCTGCATTGAAATCAGAAAACACACCATGGATATCACCGTTATCAGGTGAGTAGAGAACAGCAATATTCTGTTCTGGGCAACTGTGTGGTTTACACCCTGACAGAGCAATATACTTTTTGCCCGCAACTGTTACTTCGGTTGATGGTGTGCTTGTGCCACCACTTTTTACCCATGCAGGTAGTTTATTTTGACTTATCAGTTGGGAGTAGCTCTTAGACGTGCTTTTTGCACTGGCGAAGTCAGAAAGATACTGCCCTTCCTCAGCAACAGCGCTGAACGAAACCAAAGCCATAGCAGCGATAATCACTTTACCTTTCATGTTAATCCTCATTCCATAAAGACACCTCAACTCTATACCTTTACAGTCGCTATGTCAGCCCTATGGATAATCAGAGCATTTGATGTTACTGCCCGGCCCAAATGATTAGGTAAGGATTATCTTAATCACTAGCGCTTATGCTTGTTGATTACTGCCTGACTGCCAAACTGTTCAGGACTCTGATGCGGAGAATGCCAACTCCAGGGAATCATCGATAAAAAGAGCATGTGAAACTGAGACTCCTTTAGCTCTCCTTGCGAGGGCTTTTTTTTTGGATAGTGGCGCTTCGCTTGCTAAATATTGTGTGTTTTCTAAAGTTAAAAGGTGGCTTTGCTATGTCAGGTAAAGCCGTCGTTCAGAAATACCCGTGTGCTCAAGGACGAGCCATCCCTAATTCTTTCTTTCCAACTCGATCTGCCTTATGCCAGCGAAATTATTGTTTCCCTTTTCAATTACAGCCAATAGCGGCTTAATCCACAATACTGCCTGGCAGTACGTCATGGAGCTGGTGGCAACGGCACGATCATCGGCTGTGTCAGGTCCATTGGAATCGGTGTGCATGGCGCTGGCACGTAAACGGTGCGCGTATTTGAGCAGCCCACCAGCAATGTCAGCAGGAACAGGCAGATCACAGGTTTTTTCACGGCGGAGAATCTCCCGGTATTCGATTACGGTTTCTTCGGTGCTGGTGTCAATCAGGGAGTTGAGCCTATTGGCATGTTCTGCAACCTGATTGAATCGATTGAAGTTGAATGCCTGGGTAGCGATAACCTGCCCCTGCAGAGAATTGTCACTTCGCAGAACGTCGTTTTCGCTTTGAAGATTACTGGCCTCCAAGCAGCTCTTAACGAGGGCGACCGAAAGGCCAGCAATAATGACAACGCCGATAATACCCGGATTAATTTTCACTGGTCTATCCCCCAGCACGCCAGCGCGCTTTCCTGGTCTCTCCGTTCCACCTGACCGTAGCAGCCGTTCTTCTGACCTTTAGTCATGCGGCAATCGCGACCACCGTCTTTAATCCACCAGCGGATTGCCTCGCATGCCCCGCGGCGGTCACCGGCATTAATGCGCTTATAGAACGTGGACGGGAAGCACTTACTCGGCCCGATGTTGTACGGGCAGAAGGATGCGATCCCAGCTTTCTGCGGTTCGGTAAGCGGTACCGCAATATTGCGGTCAACCCACGCCAGAGCCTTATTGCGTTCGATGGCGTTCACTTGATTGCATTTGGCCTGAGTCAATTTCATGCCCTGCACAACCGGTTTACCATCAACCATCGTTGCGCCGCGGCAAATAGTCCAGATACCGCCAGCATCTTTGTAGGACGTGAGGCTGTTACCCTCTTTCTCATTCAGAAACTGGTCGAGGATTACGGATGCAGGAGCACCAGCTAGTATCAGCCCCAGAACTGCGGTACTCAACTTTGCTCTGGATCCCATCACTCACCTTCCTTTTGTAAGGCCTCAACGACCACGCTTGCAGCAGCAGGACGCTCGTGAAGGGGTTTATCACCAACACCTTTCAGATAGTCATTGACCATTTTTGTTCGCTTCTCATCCTCTCTACGCCTGCGGTTTGCATCTACCCGCCCGTTAATGTAGGAGGCAAGCGAGATAAGCAGACCAGCAGCGCCAAAGAACATGAACACCAGATCCTGAGTGGTAAATCCAATGGCTGACGCCAGAGCTGCTACCCACGCGAAGAACTGCGTGAAGATGTTCCCTGAATCATTCATTTTCATGGTCTCTCACCTCGCTAAGTGCGGGTGCTGTTGCTAGAAATAAAAAAGGCTGCCAAACGGCAACCTTATGATGATCTAAACCTGCTTGAGCGCCCTTCTCATGAGGAGTGCAATAAATTAAATAATCCTTAAGAGAGCTATTTAACCCATTAAAATAATTAACTATTCAAATAGTTATTTATTGTTATCATTTGGGTTAAGTTAAAGAAAATCTTCTTAGGGATATAAGACATACAAGCGGGCATGCACTGGCTTTATTAATGCGGAGAGGGTTGATGTCGTTCTCCGCACTTTTTAGTGCTTACATCTGGCTACCAATGCCACTAGTCAAGACATTAGTGCCAGAAACGCATTGAGACATTCATCCACCTCTAATTATGCCCACCCCATTCAGAAGATTTGAGTGAATAAAAAAAGCCCGCTTTTGAAGGCGGGCTAATGAGTTGACTATTGGTAAGGTAGGTGCGAGTAGTACCTATGCTCAGTAGTGAAACTGTATCGGCTTATTCACGTTTGGTTCTGGAGAACCATCAGGCAGTTATCTTCGACCCACTTTTCAAGCGTAGCAGCAGTTTAAAATTTCATAAAAAAAGGCCTGCTTTTTTACGGCAGGCTCTCAAGGAAATTGAAACTGTATTGTTATTGTCATGGTGCCGGGTGCCTCCCGGTGACCCTACCCCAGTGAGCAAGGCCGCGTGCATACCTGCAGAGCGCAGTTGACTGGAACGCCCTTTCGCTTAGAAAGGATTCACCACACGCATAAATTACGCATGAAATATTCACTCGGTCAATACTGTTCATCATTGGCAAAAAAAAGCCTGCTCGGAAAAGCAGGCATAGATCGCTAAGTTGGAAATAATTGAGGGTGTGGTGCCGGGTGCCTCCCGGTGGAAATGATCACAGCACTCATTCCCGCGCGCTGGTTGGACACTCTGGAGAAATGTCCTGCTGAATCGCCCCTCCGCTTAGGGGGATCCACCACAAAAACGCTTTCAGAAACATCCATTCTGCAGGATGCATAAGAAGCTTATGTGCAGTATGAAGAATCTGCCACGTAATCAGATGAATATATTCATTTAAATGGTACAGGCAGAGGGCCTTCAATCACCTCGGCCTCTCCGTTATCGCAGATGTCGTCACCCTGTGTCAGGTGCCAAATACCATTAAAAGTAAGTCCCGTCTCAAGGTCTTCTGTAACACCATTGCTAAAGTAAGCAACCTGTACTCTGCCGTTGTGTTGAATCCAGTAATAGCCTTCTCTCATCATTCCCCCTCCTGTTCGATATAGAGATTATAAGAGGCAATGAATATTGATGATTTTAGTAATACTTAAATCGCTATTAAGCAAAAAGCCCCACGGGGTAAACCGCAGGGCTTTAAACGAAGGCAATAACCCATCGTTAGAGCAAAATTACCACAGATTCGGGAAAAGTAAATAGCTCACGATAAAATAATGCCCTACTTTGTTATCTGCTTCAGCTGCGCATCAGCCCAGGCTTCTTCGATATCAAACTTGGTGATGAGCTGGTCGTAGAATGGCTTAACAGACTTCTTCCAAGTATCGAGGCTGATTGCATCCGTTATCTGGCACACCGCGGCGTAAGCCTCAGTGGAAGGAATTCGTTCATGCCCCCGCCCGCTGCAGCGCTTACAGTCAGCCAGAACCGGCACACCCTGCTTTTCAGTGAGCTCCTGATTCACTGCTTTACCGCGCCCGTGGCAATCTCTACAGGCACAACTAACTACCTTCTTACCCTTACACTGAGGGCAGAGAACTCGCGCTATCTCCCTGACCTTCCTGCGCACTTCATACTCGGAAGGGTGAATATTTTCCTCACCCATGTGCAAAGACATCTTCACGAACTTCTTCTCTTTTACCGGCGTGTGAGACTTCATGCTGAAAACTTCCGCGTCTATAAACCCTTCCCCATTGCAGCCATCACACTGCTTCACGCTGGCGGCGCTGCGGGAATAGTCTTCGAACGCGAAGGTGGCCAGCTGATGCATCACCAGTGGTTTAATCTCATCATCAAGCTTCCTTAATGCCGCCACCCGATCGCACTTGGTCAGCGCATACTGAGCCAGCAACTCAATCGCGCTCTCCCGGTCATTGTTGCTGATACCCATCTTCCCGAGAAAAGCGCTGTAACCCATGGCGGCCCGTTCCTGCGTCATGCCCATCGCAGCCATAATATCCGAACCAGTTAATGCATCTGATGCCGTAGCACGAGGAGAGTCGCTAATCATCGTCGATTTGGCGAAGTGATATTTGAGTGTATTTTCGAGATTCATGCGGTCTCCAGCTCGGTAATGGTGAGTTCTAATTTTCCGCCCTTAACGACAGGCATTTTCACAACGCGATAATCCACAACCTGGCTGTCATCCAGCCAGAATCCCGCCTTGGTTAAAGCGTCAAATGCAGCTTTCTGCAGGTTATCCAGATCGCGGCGCCGGCGATCTGGCATGTAACATTCAATGCGGATTTTAAGTGGTGCGGCCGTGCGGATATTAAGCCGGGCGCTTCGAATGACGCTGGCGACTACATATCGGTACGCAAGGCCATCAGCACTGATATGCGTTCTCCCGCGGTTGTGCCGGTAATACCGGTTGTTGCTCGGCGGCCAGGGCAAAGTGATTTGATATGTCTTCACGTTTACCCCCACATCCGGTTTCGCCAGCGGCTATCCGGGCGCGCTGGAGTATTTGAGGTCGGAAGGAATGCACTGACAGTCCAGGTCACATAATCCTGGTTTAGGCTGCGCTCTACTCGCACGCCGCGCGCTTTGTAACGCTTAACCAGTTCGTCGGCCTGTTCGGTGCTGCAATCGGTGTGGTGGAACCAGGAATACTTCATTCCATCACCCCGCAAAGCCAAGCAGCTGCGCGGCGACATTTTCTGCCTCATCACGACTACGGAATGAACGCGACAGGACCCAGCGCCAAAGAACATCGAGCGCAGCTTTATAGAGCTGCTGGAACTCGAACTCGTCCATGTTGGCGAATGAGATGCTACGAGGATGTTTTTTTAGTGTTCCGTCAGGTAGCTGAATGGCATCAAAGTGCCCTGCCTCAACGATCACCCATGAGCGGTAAGCATCGAAGGATTTGCACAGGCTAATACCATTCGTGACTCGCCGGTAAGCAACCTGCTCAAGATACTGCTCAGCAGCATCGATCAGAGCCCCCTCGTTTCCGCCATACGAGGCCAGGAACTTGGCGTAGCCGGTGATCAGCTTCCGCTCGTTACTCGAGATAGCCCCGCCGGTTGGTTCCCAGTATTCAAAACCGAGATTGAGAAGCGCGAAAAAGCGCCGGTGAAATGCCGGGTTTCGTACCCGCCTGAACTCGGCAACAAGAACATCGCCGAGCCGGGTTTTTGATTGCAGGATATCGCTGGTCTCGGGTGTGGCCGGGATCAGTATTCCTGAGTGGTGTTTGATAAGTTGTAATTCTAGCGCCATGGTTCTCTCCGTGGCGCATCAGGTATAGGTTGTTCAGGCCTATGAAAGAATAATATCAGACGGTGGTGTAACTCGGTACCCCAGTCGTTTTGCAAATTGCATAAACCCGTTGAGAGTGAATATTTCTTCCTCTTCGAGCAACGGTCGTAATGAAACTATTCCATTTACTCGATAAACCAGATATCTGCCCTCAGCCGGGAAGCTATAGATAACTGCTTTATCGGCCCTTCTGACCACGTCGTACCATTGATCATCTGCATTAAAGGCATCTGCACTACACACTATTTCCCCCAGAGCGACTTATTGACGCGGTAAACAGTAATCGGGAACAGCCAGGGGAACGCAAACAGCGATACTCTTTGAAACTGCTCCAGTGAAATTCACGCGATTAATAAAACCACTCGTCGGCGCTTTCCCAGGTCTCCTGCACGATTTGCTCAACCTCTTTCTTGTCGCCCCCGAAAACACTCAGACCATCATTGCTGGCACGCTTGATCGTTAGCTGACAATTATCAAACTGCTTGCTGAGCCTTTTGAGCAGTTCTGACTCTAGTGCAGGTATAGCTCCATCAGGAAGTTTCTTCATGCGATCAATGGCTAACTCGATTTTCAT
Protein sequences of DBSCAN-SWA_4 >NC_018079|2693370:2717270|2699978_2700281_-|WP_038419704.1|DBSCAN-SWA MVMPQGLQCWDSAGRIAVDLSDYAIRYIGSTSVTFAAGETSKNVSFAGITQDGSFISIVSTGVTVNEYHCRAYNGGFTAYYLPSTGSPAITLNVEVYNFQ >NC_018079|2693370:2717270|2694149_2695286_+|WP_014832148.1|DBSCAN-SWA MDLLFSLCLVLTCLVLAIGMFSLPFFSFIDEETDAARANSLDGMRFILASFVIFHHIDCAYTYITKGKWMPTSDWLLYLGKYGVALFFMITAFLFWGKVRTSNQIDWVELYKKRFYRIAPLSFFCSAIALASLFLLTQRKDFSHSILAASLSWFDAGLWNSKPAVTDFTPPFMALAGVTWTLRWEWIFYFTLPLFFMLKKWSFELSVSVFAFSVYFLPEFTKDAYLWSYFFAGMLCSVLKDKINLTSKHANIILVLMILITLLVQPTLYAPPEKIFLSVIFFSVISGANLYGILISKAAIRLGAISYSLYLTQGLILFPMVIHFKNQGDLELNINTFIIFAASYILICILSSLTFHFIERPFMKRFKPKSISEQAYNK >NC_018079|2693370:2717270|2707419_2707623_-|WP_014832159.1|DBSCAN-SWA MFVAEGLKADIDNDGWVKGWGVIRNAPWHLAGVYATKDVAETKAGMLGDGYEARYGSHRLGSDDFIS >NC_018079|2693370:2717270|2703864_2704581_-|WP_014832155.1|DBSCAN-SWA MKKTSILFLCTSLFSGMALAENHYIPLLYNLSTIFDFNPVKGAVKSLDTDIEENGKVTYKIAIRLDKNGCVESLDLDNVSSGHKTNLKNSNGSLVGQRDGKPFSIQIDEKCNILSKNENGDELRYDLYSNGLIKDTYFLGKKISEHFYDDDSNLIRSEFYGSGKILSKNEISYVDKDRKPLDYKIINTSVYSEGYTATNTCHYSEKLVPEICKVTMQSAGNPVPKPVLMAANTKVEFY >NC_018079|2693370:2717270|2699013_2699376_+|WP_043951644.1|DBSCAN-SWA MKRLFAAALLLLAGCADKHTDYTFKMDYPVEAARLSLGGDIHVSIDCATREMKVISDSSNGIFSRHVNKRLSNICYKKTDKLDVIYRFNAAKGVRQDMIATQYPRVPPVSNTDKLSDRDS >NC_018079|2693370:2717270|2712893_2713196_-|WP_014832171.1|DBSCAN-SWA MKMNDSGNIFTQFFAWVAALASAIGFTTQDLVFMFFGAAGLLISLASYINGRVDANRRRREDEKRTKMVNDYLKGVGDKPLHERPAAASVVVEALQKEGE >NC_018079|2693370:2717270|2712357_2712894_-|WP_014832170.1|DBSCAN-SWA MGSRAKLSTAVLGLILAGAPASVILDQFLNEKEGNSLTSYKDAGGIWTICRGATMVDGKPVVQGMKLTQAKCNQVNAIERNKALAWVDRNIAVPLTEPQKAGIASFCPYNIGPSKCFPSTFYKRINAGDRRGACEAIRWWIKDGGRDCRMTKGQKNGCYGQVERRDQESALACWGIDQ >NC_018079|2693370:2717270|2716054_2716657_-|WP_014832175.1|DBSCAN-SWA MALELQLIKHHSGILIPATPETSDILQSKTRLGDVLVAEFRRVRNPAFHRRFFALLNLGFEYWEPTGGAISSNERKLITGYAKFLASYGGNEGALIDAAEQYLEQVAYRRVTNGISLCKSFDAYRSWVIVEAGHFDAIQLPDGTLKKHPRSISFANMDEFEFQQLYKAALDVLWRWVLSRSFRSRDEAENVAAQLLGFAG >NC_018079|2693370:2717270|2709055_2709736_-|WP_014832164.1|DBSCAN-SWA MRLFWYGLIVSVLYFVVLGAVIYFMDLGVMTSWNEFGDFLAGAFSPVAFLWLVLGYLQQQKELQQNTKALELQALELKNSVDQYREMVSVAREQLLSDRENLETSKREKERQYKPIIKAPSIRPAVIIGGTKYKYRGSLSIGGEDALMLSINTEPPFSPYSEYSAHHAKVGDVVIGESWEVNEQELPSTIVMTISYQSKLGNSYKDKYVYSLGDRGMYSIVDEENI >NC_018079|2693370:2717270|2708663_2708936_+|WP_014832163.1|DBSCAN-SWA MTSNKEQAVYWMLELLKRDGCLYQDDVVDHLVKTKNEDLLVENADGNLALSRQILTLFMKHTAEEVVWVRPHRYWRYRVAEDEPGREARG >NC_018079|2693370:2717270|2697603_2698887_-|WP_043951643.1|DBSCAN-SWA MLYNTGTIAINGNTATGTGTNWTAPASQVRAGQTIIVMSNPVQMFQISAVNSATSMTVTPAASPALSGQKYGILVSDNISVDGLAQAMSQLIKEYDENIGAWETFATTSANQSITVTINGTTVTIPGIGKLAQKGSNGALAIADGGTGATTAESARSNLGLRSAATVDLQSSITDTTAGVALMQGAFGLGQSIPNQFSTGIPLDRTQFFGATQGNGSNPVAQRMGGVVLAYAGSSYACVIAAALDTTSPTIWFRGMAGGTAGSWQKFYTSTNTTVASDGTLKAASPVVKLFSDGSFETNDESEGCSVTRLSTGEYLIEGCEGLNSEATWGGIDGGFDIPTDRNKQPLIWLDYEVHADGSVLVKTYHRTHPAAPEFARNELQDISDGDPIDIPSDQFVSVRVQMPQDSIDSIWNQRKALAEIPDSSAS >NC_018079|2693370:2717270|2709787_2710249_-|WP_014832165.1|lysis|DBSCAN-SWA MSRLTAIICAVVICLLVSMAWAINHYRDNAITYKGQRDNATEQLSLAIVSIKDMQTRQRDVAALDAKYTGELADAKETIERLHSDVIAGRKRLQLNAHFPANGASSTGGLGDASRTRLTYSAERDYFTLRERIATVTKQVGYLQEYIKGQCLN >NC_018079|2693370:2717270|2714268_2714460_-|WP_014832172.1|DBSCAN-SWA MREGYYWIQHNGRVQVAYFSNGVTEDLETGLTFNGIWHLTQGDDICDNGEAEVIEGPLPVPFK >NC_018079|2693370:2717270|2714654_2715488_-|WP_014832173.1|DBSCAN-SWA MNLENTLKYHFAKSTMISDSPRATASDALTGSDIMAAMGMTQERAAMGYSAFLGKMGISNNDRESAIELLAQYALTKCDRVAALRKLDDEIKPLVMHQLATFAFEDYSRSAASVKQCDGCNGEGFIDAEVFSMKSHTPVKEKKFVKMSLHMGEENIHPSEYEVRRKVREIARVLCPQCKGKKVVSCACRDCHGRGKAVNQELTEKQGVPVLADCKRCSGRGHERIPSTEAYAAVCQITDAISLDTWKKSVKPFYDQLITKFDIEEAWADAQLKQITK >NC_018079|2693370:2717270|2696597_2697017_-|WP_014832150.1|DBSCAN-SWA MEFIRSTELREIIALPLFSDLVQCGFPSPAADYVEQRIDLNELLVAHPSSTYFVKAAGDSMIEAGISDGDLLVVDSSLTAEHGDIVIAAVEGEFTVKRLQLRPTVQLIPMNSAYSPIIVGSEDSLDVFGVVTFIVKSAS >NC_018079|2693370:2717270|2708482_2708677_+|WP_014832162.1|DBSCAN-SWA MSKYKVGDKVKLRSGGPVMTVQQISVPQPTMYRGTNRCQWFAGKKLEEGYFPDDSLEEVGDDEQ >NC_018079|2693370:2717270|2700274_2703811_-|WP_014832154.1|tail|DBSCAN-SWA MALATAIKGRKGGSSSSRTPTEQPDDLQSVAKAKILVALGEGEFAGQLTAKDIYLDGTALENADGSQNFSGVTWEFRAGTQAQKYIQGIPGTENEINVGTEVSSATAWTRTFTNTQLSAVRLRLKWPSLFKQEDDGDLVGYSVNYAIDLQTDGGTWQTVLNTSVTGKTTSGYERSHRIDLPQAGSTWTIRLRKITSDANSAKIGDTMMLQSFTEVIDAKLRYPNTALLYIEFDSSQFNGSIPQIACEPRGRVIRVPDTYDPETRTYSGTWAGTFKWAWTDNPAWIFYDLVVSDRFGLGDRLTTANIDKWTLYQVAQYCDQMVPDGKGGSGTEPRYTCNVYIQERNDAYTVLRDFAAIFRGMTYWGDDQIVALADMPRDVDFTYTHANVIDGRFTYSSSTTKNRYTNALVSWSDPDNAYSDAMEPVFEQALVARYGFNQLEITAIGCTRQSEANRKGRWGILTNNKDRVVTFNVGEDGNIPQPGYVIAVADRNLSGRDLGGRISAVNGRVLTLDRAPDASAADRMIVNLPSGVSQSRTIQSITGNKVTVTTAYSETPVAEAVWVIESDELYAQQYRVITVTDNNDGTFRIVGANHDPDKFDRIDTGAIIDQRPVSVIPPGNQSPPANILISSFSVVQQNISVETMRVSWDQALNAIAYEAQWRRNDGNWVNVPRSSTTSFDVPGIYAGRYLVRVRAINAAEISSGWGYSEEKTLTGKVGNPPKPVGFIASENVVFGIELNWGFPANTDDTLKTEIQYSLTGTEDDAMLLADVPYPQRKYQQMGLKAGQIFWYRAQLVDRSGNESGYTEWVRGQASIDVSDITDVILEEIKDSDTFKDLIENAVDTNEKIAGMADDIKQANDELELQAQEIAKNAQEIGQVQTSVNELSSTVGDVSSSLLELEQTVATADTALGQRIDSISVSMDGMTGGVKNSAIAIIQANLARVATRKTLSASVAGNSANLDRIDEVIVNDREATARSLLSLQTDVNGNKASINSLNQTFSDYQQATATQINGITATVNGHTSAITTNAQAIANVNGDLSAMYNIKVGVSSNGQYYAAGMGIGVENTPSGMQSQVIFLADRFAVTTAAGNSVALPFVIQNGQTFIRASFIQDGTIENAKIGNYIQSNNYVAGSAGWKLDKGGTFEINGVAGGGRMLITSTLIRIYDSNNVLRVRMGLW >NC_018079|2693370:2717270|2705085_2706393_-|WP_043951645.1|terminase|DBSCAN-SWA MGISPTLNTPQARFLAMQHKFKAYVAGFGSGKTWVGCGGICKGMWEHPKINQGYFAPTYPQIRDIFYPTIEEVAFDWGLSVKINEGNKEVHFYEGRRYRGTTICRSMEKPGSIVGFKIGNAMVDELDVMAAAKAQQAWRKIIARMRYKVDGLRNGIDVTTTPEGFKFVYQQFVKAVREKPELAALYGLIQASTFDNAKNLPPDYIPSLLSSYPEELIQAYLRGKFTNLNSGTIYHTFNRKLNNCSDEIQDGDQLFIGMDFNVGKMAAIVHVKRNGLPRAVRELVKVYDTPAMIKRIQEEFWRYEDGRYVKSREIYIYPDASGDSRKSQNASKTDIAQLNEAGFSVIVDDANPPVKDRINSMNAMFCNANGERRYLVNVQNCPVYTESLEQQIWAANGEPDKSADNDHPNDAGGYFIVKDYPIVKPAYSITMDTTF >NC_018079|2693370:2717270|2717045_2717270_-|WP_014832176.1|DBSCAN-SWA MKIELAIDRMKKLPDGAIPALESELLKRLSKQFDNCQLTIKRASNDGLSVFGGDKKEVEQIVQETWESADEWFY >NC_018079|2693370:2717270|2706370_2707351_-|WP_014832158.1|terminase|DBSCAN-SWA MAKPDWGGLQQRFLSEHAATGVSPKEWCEAQGLNYATARRYIKKSTAQKPAQKKMRTAQKEQSAEVLVDDDGLTAQQRLFVAEYLKDNNATAAAARAGYSDPNYGRQLIANPNVAQAIAHQQKESIARTLGSADEVLAQMWQLATFDANQLSQYRRGACRYCWGFGHHYQWRDAVEFEEKRLEAVERDRREPDDSGGYGYDHNREPSPECPRCNGDGIGQPYFPDTRKLPPVSRLAYSGVKVGKNGVEITAISRERMYEAVMKRLGLADSEFAQRLQQIEIERRQLEVEKLRKELAGEGEDDEPTPVAININVVDARADDGDKPDT >NC_018079|2693370:2717270|2715484_2715847_-|WP_014832174.1|DBSCAN-SWA MKTYQITLPWPPSNNRYYRHNRGRTHISADGLAYRYVVASVIRSARLNIRTAAPLKIRIECYMPDRRRRDLDNLQKAAFDALTKAGFWLDDSQVVDYRVVKMPVVKGGKLELTITELETA >NC_018079|2693370:2717270|2695326_2696595_-|WP_014832149.1|DBSCAN-SWA MFALCDVNSFYASCETVFRPDLKGRPVVVLSNNDGCVIARSAEAKAAGITMGEPFFKQKELFRRAGVVCFSSNYELYADMSNRVMSTLEEMSPRVEIYSIDEAFCDLTGVRSCRDLTDFGKEIRATVLKRTHLTVGVGIAQTKTLAKLANHAAKKWQRQTGGVVDLSNIDRQRRLLALVPVEDVWGVGRRISKKLNAMGIKTALDLSEQSTGIIRKHFNVVLERTVRELRGEPCLELEEFAPAKQEIVCSRSFGERVTEYEQMRQAICSYAARGAEKLRSEHQYCRFISTFVKTSPFALNEPYYGNSASMKLLTPTQDSRDIINAAVKCLDKIWSDGHRYQKAGIMLGDFFSQGVAQLNLFDENAPRAGSGKLMEVLDHLNAKDGKGTLYFAGQGIQQQWQMKREMLSPRYTTRYSDLLIVR >NC_018079|2693370:2717270|2699340_2699982_-|WP_014832152.1|DBSCAN-SWA MSGFEVYNSDGKLLVDSQNRSTLFYDQRMLGAVNNKGFYQIDSPFGNGSTLGFTPQSFWNDGRLRWLQLGANRYGMPGADLLEDNAGSMIRTARNIGIQSGYLDVFDSAGYLIWSAASASKMPRVVGFFDVPTNYDLQNNTLSVSLSFNPWILVNNCPGNLSDDGEVVGYSGIVLKWTGSQLQGRYISKNQRSWSQTMQGRGLRIPVAQFVGI >NC_018079|2693370:2717270|2707681_2708020_-|WP_014832160.1|DBSCAN-SWA MKRQQLKGMSVGGITAGIHSVPVIMLNIGGIMAKLTVTFTRLEEWKDRMVYGTDVTFKAIQSGVVLIEDNVSGRCTAPYCRSYDVEAGAGDIVIEHNRPDLHHLSITASLDG >NC_018079|2693370:2717270|2693370_2694054_+|WP_014832147.1|DBSCAN-SWA MCGRFAQAQTREEYLAYLADEADRDIAYDPEPIGRYNVAPGTKVLLLSERDEQLHLDPVFWGYAPGWWDKAPLINARVETAATSRMFKPLWQHGRAICFADGWFEWKNEGNKKQPYFIHRADGQPIFMAAIGSTPFERGDEAEGFLIVTSAADKGLVDIHDRRPLVLSPEAAREWMRQDVGGKEAEEIIADGTVPADKFIWHAVTPAVGNVKNQGPEMIAKTIDNYE >NC_018079|2693370:2717270|2711038_2711452_-|WP_014832168.1|DBSCAN-SWA MKGKVIIAAMALVSFSAVAEEGQYLSDFASAKSTSKSYSQLISQNKLPAWVKSGGTSTPSTEVTVAGKKYIALSGCKPHSCPEQNIAVLYSPDNGDIHGVFSDFNAEKNQETLTWLNVDPIDSDAMKNALFNRLYGN >NC_018079|2693370:2717270|2711842_2712361_-|WP_014832169.1|DBSCAN-SWA MKINPGIIGVVIIAGLSVALVKSCLEASNLQSENDVLRSDNSLQGQVIATQAFNFNRFNQVAEHANRLNSLIDTSTEETVIEYREILRREKTCDLPVPADIAGGLLKYAHRLRASAMHTDSNGPDTADDRAVATSSMTYCQAVLWIKPLLAVIEKGNNNFAGIRQIELERKN |
27 | Enterobacteria_phage(47.37%) | lysis,tail,terminase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2723991 : 2734602
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_018079|2723991:2734602|DBSCAN-SWA GTTATCCCCTCCCACCAGCCTTCATGCGCTCATGTTTGGCTTTCAAAATCTCCACCGGTGTCGGCCCCTTCGTTGGTACCGGTGCGGCCAGCGCCCGCCGAGCAGGCGGAATCGGCTTCCCAGCCAGCACCCGCTTTTCCCACATATCCAGAATGTCGCTGGCTTCGCGCTCTAGTTCTTTGTGGCTCAACTGGCCATCGGTTCCGCGGCGCCGCAGTTCAAGGCAAATGTGGTAATAAACCGGCTTTGGCCACGGATATAGCTCGCTGCTCGGGTACCGGAAAACCAGCTTCCTCCACTTCCAGTATTCAGCCATGACGTCAGCAGTGGTGATCCCCAGTACGCAGCGCCCTTCCCTGCACCACTTGATGAACTGGCCTGGCGAAGGCAGAAATGGACGCTCCTGTCGGCGTACCATGCGCATACCAGCTTCAACTTGCTCAATGGTATTAATCCCGTTTTCTTTAAAGGCCAGCACCCATTGCCGGCGGATCTCGTTCACGTCTTCCTGGCTGCGATTAACCAGGCTTGCCGGGAACGCGGCTGCCAGCTGTACGAACAGCCCGTTGATAATCTGAGCCACCTGCTGCGTTTGTTCGCGTTCGGTGTACTGCTCAGGCATGTTGTGCGCCACGCGGCGAGCCTGTTCCCTGTCAAAATTGCGAATGCTCTCTGCGAGGTTTTTCATTCCAGCACCCCGTCAATCCAGTCGGTGTTATGCAGGTCAATGCCGCCCCGGGTAAGTTTTACCGTTCCGGTTGCGCGCAGCCGTTTGGTAGTGAGCTGATCCCACTGTTTGCGCAGACTTGACGGGCTCAGGATGTTGTCTCTCCAGAACTCGTCCCGGTTGGCCCACTGGAACAGGTCGCAGATTTCGTAGTGCGTGCGATTGTCCTGGACACGCATCAGCCTGATGGTGTTTGCCCATTCAGCCCAGTTGGGTTCGGATAGCGAGGCGTTGACCGTAAGAAGCCTGTCGTAAATCCAGCGAGCGGCCTTGAGGTCATCAGCGGATCCCCATGATTTACCTGCAGGGGTGTATATTCCGGCGGCAGCTTCTGGATGGCGTGAGAGAAACTTTTGAGTTTTCTGGTTTCGGGATTCGTCAGAATTCCGAGACGAGGATATTTTAATATTGTTCTTGTTATAGTCTTGGGTGTCTACCGTTTCCGGGAAGACTTTTCCCGTTTTCGGTAACACTTTTCCCGATTTCGGGAAGGTTTTTCCCGTTTTCGGTTTGTCTAAAATCCAGGCAGAAAGGTCAGTATTTATACCGACAGTTTTCATCACGCCCTGCTTTTGATTGAATATAATTTTACGTTCAGCGAGTGATTTGAGCGCATCAGAAACATGCGAATCACTCAGCCCTGTAAGCTCGGCGATCACCGTGTTCGTAACGCGGTCCTGTTTCTTGTTCCAGCCGTAGGTAAGCCAGATCACCGCCTCAAAACATTGCCACTCCCGGCCTGACATTCTCAGACGAGGCTTGAGCTGTTGGATCTCGTTAGCGACCTTGGTATACCCATTCGACAGGTCGGCCATACGACCTCCCGGTTGTTCGGTTCTGTTGGGGAAATTGATAATTTCAGCTGTGTTTGACATACTTAGCTCCGCAATTACACTCCGTTTTTGCACCTGAAAGTCGGTTCTGTTCGCGCAGACCGGCTTTCGCCTTTTCTAGAGTATTCACATTGCCCCCAGCATGGTTGTCACCATCGCCAGCAGCGGTGCTGTAAGATCCGGATCGACTCTGAACATTTCGAAAATCCCCTCGCCTAACTCCTTCAGCTTTTCCTTCTTTGGTGCATCGAGCATCAGAGCTTGCTTCGCCTCACTCACCTCTTTTTCCAACCTGGCCATCCGGTAGGCAAACGTGTCGTTTTTAACGACACGGTCGCGGTATCGAATCGGTAATACATACATGATGGCGGGCACCAGCTGTTCGATGTTCCTACGGTACGCAGCGGAGTCTTCTTTGTTGTCCAGCCAGCGAAACAGCTTCACGTTCCAGACATCGGCCTGGCCTGAAAAATCCACGCCAACAACTTGAAGTTCTTCCGCTGCTTCCTGGATCTGAAGCGCAACAGTTACGCGCCCTTCTACCGCGGCCCAAGCTCGTACCGCTGAGCAGATATCGCGATGATCAATATCCTTCACTGCCGATTCGCTTTGATGACACTGGAATATCAGTGAATTAGAGGAAGCTCTGCTACTCTGTTGAAATGAAACAGTTTGCATTGTTAAGGCTCCTGGTTAGGTAAACCGTCTGTGGGATTCGGGTAAAGATCAGGGCGCAACTCGTGTGGAGTCACGCCGGTAACCCCATAAATTGGCAGCACTCGCTCAGCCGGGATCCCCTTGCGGCGCCAAAGCGAAACAGCCATTTTTGAAACGCCGATCAGAGTACCAAGCGCACTGGCCGAGCCAGATCGAAAAATTGCAAGTTCAATTCCAGTCATAGGACCTCCTTAAGTGCAAAAGAGTAAAGCATCAATTTACCAATCAGTCAATAAACGCCTGCCTATCAAGTGGTAAAGCTATTGTTTACAATCCATATATGAATAGAAAAGAACCTAACCAGAGCCTAATTTCTAGGCTGACTGAATTGAATAACAAAGGCTTCTCAAAAACAGAGATGGCCAGAGTAGCTAATGTCAGCAAACAAGCGGTGACCGGATGGTTTCGAACCGGCAAGATGAGTAAAGAATCGGCACTTGCTTTAGCTGATGCAGCTGGGGTGTCGGTCCCTTGGCTCCTTGGTGAGGAAGTTGGCGAGAAAGACGGACTTAAGGCAGACGAACAGCGCCTGCTTGAGCTCTATCGCCAGCTGCCCGAGGAAGAGCAGAAGAACATGCTCCGCGTCTTCTCAATTCGCCTGAAGGAGCTAGATGAACTGTATGCGAAGTACATGAGCCGAAGGATCAAGGGCGATAACGGAGTAAATTAATCAATCCCTTAATTCCGATCTCTTCCATGAGAAGTAAGGAAGTAATACTATGGATGCAAAAATATCTTTTCTTTTCCCATATACGACTGTTGGGATAAATCAGCACAATTTTTCCCCTGTACTAACTTTTGAATGCGACGTTTTACCTGTCAAAGCGGTACTGCAAATTGCTTTTTACTTTATCTGCCTGAAAAACAAAGAAAACTACAGATTAAGATTTGACATACTTCGAGATGGAACTTCTGTAATCGACGATAGCTGGGATAGAGACAAAATATTTATGTCAGAAGATCCATCTTCTGAACCCGATAAAGTAGCAGTTGGATTGAATATTGATCTTCCATCAGTACCATTTGACATGGAAGGAATTTATCAGATTAGTGCTGAGCTTTTTTACCCTCAAGATAGTAAAACACCCATTCACCAAAATGATGCCTTTTTTAAGGTAACAAAGAAGGCTGAGTAAAAGTACATGCCGGAAAAAGTTACAATACTGAGACCTGGAAAAGAGACCATGCGGTTGCCTGCTGGACGCTATAACGAAAATAATGCATCATTCGAGCATGATGGCGGTAATGGCGGAGGTGGAAACATGCTTGAGGCTAGAGTTGCAAAACTCGAAGCAGATGTCGAAAACATCAAAGTGAATCTTTCTGAAGCACGAATGGATATTCGTGAGCTTACTAAGAGCTCAGCCTCTATTAAAACTGATATATCTACAGTATTACAAAAACTAAAAGATATAGACGAAAAGCTTTCAACTAAAGCAAGCAAAGATTTCGTTGATTCTAAAGCCGGCGATATCAAGGTTTGGATGTTAGGCTTACTTTTACTTTCTATAGCAATGCCAATAATAATGTTCTTGCTCAACCTTTATCTCAAAAAGCCGTAACCAATTACCCAGCCACCGCGCTGGGTTTTCTTTGCCTTCTTCCAACAGCCCAACCACCAAGCCCCTGCCCTGAACTCACAGATCCCGACCTTAGCGTCGGGATTTTTTTTGCCCTCAATAGCTCATTTTCATACTCCACATACTTCAGGTAAAGTATTACTGTACTTTTGTGTATTCAAACACTTGACCTATGGGTAAAGTGGTGGTTTACTAGAATAACCAAGACGCACTACGAACCACCAAGGCAGGACGCCCACGAAGTAGCCGCCGACGGCATATGAAGAGTCGGATGAGGTGGAGAGATTAACGCGCATCAGGTGTAAACGTTCCGCTGGCCGGCGATAAGGCAAACGAGGGTGAGAATGATTGATTTCGCACGCAAACCAGGACGGCAGCAGGCCGTAAAGCTGAACTTCTTCGAGGTGATTCTTCGCCGCCTGTGCTACCTGCTGGCGCAAAAGGGGAATCCAGATGTGTAACTCAACGAAATGCGGGTACTGCGGCAAGCCGGTTGAACCGGCGAAAGTAGTCAAAAGTACCCTTCTCTATTGCAACGGCGCACAGCTGGCGCGCAAAGAAAAAGAATACTGCTCTGAACGTTGTGCTTCGTACGACCAGATGGCCCACGAGGCATAACGTAAAAGCCGCGCAAGGCGGCCCGTACGTCCGGTGCTCCCGACCAAAGTTACACCGGAAAACTACTTAAAAAACCAAAGTTCACCCAATGGGCGCTATCTCTGGCCCGGGGATCTTACATCCAAAAAAGAGGATCTCACATGGAATTTTTCTATGTAGTTAAGGCTACGCAGAAATCTGGCAAAGAAGACGCAGTGATTTGGTTCACTGCTAAATCAGAAGCCCGTGCCAACCTGCAGCTCGATGTCGAGCTGGAAGATGCAGGTATTGAAACCGGCCGCGGTAAGGATTACTCAAAACCGGTTCGCACCGATTTCCCTGTTTACAACGATCTGCCTGAAGAAAGCACCATGGATTACACCTGGTGCAAACGCTACGAACTGCAGGACGATGGACGCACCTGGCTGCCAAAGGCTGGTGCTGAGTCGACTGGTGACGTGGACAGCACTGCCGCACCGGAGACGGCCGTTAAAGTCGAAACTACCGTCGATAGTGTCCCACTTGAAAACCGCACTCCAGCGGTCCGTTTCGCCGTACACCTGACCAGCGACAAATACCAGTCACACATCACTAAAGAGCAGCAGCTGGCTGCCAGTGAAATGTCACTGGATGAAGGCAACACTTATCTCCAGAACCTGCTGCTGGCGAAGAACGACATCCCTGAAGTTGCCGAACTCAGCCTGAATGCTGAGTGGAAACTGGTTCAGGCGATAAAGCAGGTTTTCGCGCCAGATAAAACGCACGAAGCTGAAGTTATCGCTGCATTCATGGCTGACTGGGCGAGAGCAGATGCCGGCAACCGCAATCAGTTAGTTGAAGAGTGGAGAAGCGGAAAGCTTAATCTTCTCAAATCAGAAATCACCAGCGAGACCGGCATTACAACCGATCAGGCTCCAGAACCTGATAACGGTATCCAGATTGACGAGAATGATGACGAAACCACTCGTTATCCGGTCGTTAGGATGCCCTTCCGCAAGCAGCTACTCGCCCAGTTCACCGCGGACGAACTGCGTCACCACTTAACCCGCGAAGAATACGAAGGTATCAGCGCGCTGGAGATGGACACTGACAACAGCTATGTCCAGAACCTTCTGCTGGCGGCAGAAAACTGCGAAGAGGTTAAGGGTTACGATACCAAAGACCTTTGGCGGTATACCGACGCCATTCGCAAAGTGTTCAGCCAGGAGAAGCGTCACGAACTCGCTTTGGTTCTCCAATTCACCAGAGTCTGGGCTGCGACTGATTACATTGACCGCGGCACCCTGGTGCGCGAATGGACGGATGGTAATCGCATTTCTGAAGTAGGCTCTCCTGCACCTTTAGAACCAGCAAAGCCAGAAACTACAGAATCCTATAAACGAGCTGTTGCCCAGAACATGGCGAACCTGAGCATTGAGATCGCGATTGCTCTGCTGTACCCAGATGCAGTACCGGGACAAATCAACCGTACGCAACTCCTGGCCGCCAAAGAACTCGCTGACAAAAAAGATGAGTCGCACACCAAGGCTCTCAAGGTTCTTGGTAAAACTACCGACATCCTCGACTACGACGCCAACAGTATTTTTGGAGTTACCCGCGCTATTTCATGGTCTGGAGAAGAAAGCACAACCGAACTGCGTAGCCAGGTGCGTGAGTGGTTCACCGCGAACGGCATCTATGAAAGCGGTGAGCGCTCTAAAGGCTATCCAGAATGGAGCGAAGACTCCCGCGCGGTTCGTCATTCCACAGTGGAAGAACCAAGTACTCCAAGCCAGCCAAAGGTCGCAAACCTTGGCAGCGGCGTGTTCTCCATCGATGGTCTGATGGTTGGAAATACCGCCCCGGTCATCGATATCCCCTCAAATGAAGTCGAAAAAACGGAAAACACAGCGGAGACCACCAGCGATGTGCAGATGGAAGCGGCTAAGCCAGAGAAAGACGAAGATGTTGGTTCGGTACCACCGGGCGAAAGCACTGATGCAGCTAATTCGCAGACAGATTCCATAGCGCCAGAAGAGCAGCAGTCAGAGCCAGTAATCGAATACCCGGCTTACTTCGAGCCTGGCCGATACGAAGGCCTACCGAATGACGTTTATCACGCAGCAAACGGTATTAGCTCAACCCAGGTGAAAGATGCCCGTGTCAGCCTGATGTACTTCAACGCACGCCATGTGGCTAAAACCATCCCGCGCACAGCATCGAAAGTGCTGGATATGGGAAATCTGGTGCACGCCCTTGCACTGCAGCCGGAAAACCTCGAAGCAGAGTTTAGTGTAGAACCTGAGATCCCAGAGGGTGCTTTCACTACCACCGCAACCCTGCGCGAGTTCATTGACGCGTACAACGCCAGCCTGCCAACGCTGCTGAGCGCAGACGAGATTAAAGCGTTGCTTGAAGAACATAACGCGTCCCTTCCCGCTCCGGTGCCGCTTGGCGCCAGCCTGGAAGAAACGGCTCAAAGCTATATGGCTCTCCCTGCTGAGTACCAGCGTATTGAAGAAGGCCAGAAGCAGACAGCAACGGCAATGAAGGCATGTATTAAAGAGTACAACGCCACCCTGCCCGTGCCGGTTAAAACCAGCGGCAGCCGTGATGCGTTACTTGAGCAATTAGCAATCATCAATCCTGATTTGGTCGCACAGGAAGCGCAGAAACCGACACCGCTGAAAGTGTCTGGCAGCAAAGCAGACATGATCCAGGCAGTTAAGTCGGTTAAGCCCGATGCCATATTCGCCGACGAACTGCTGGATGCCTGGCGCGACAACCCTGGCGAAAAAATTCTGGTTACCCGCCAGCAGCTGGCAACGGCACGGGCAATTCAGTCTACGCTCCTGGCGCACCCGACCGCTGGCATGCTGCTGACACATCCAAGCCGTGCCGTTGAAGTGAGCTACTTCGGCTTTGACGAGGAAACGGGCCTAGAAGTTCGTGTACGCCCTGACCTCGAGATTGAGCTGGACGGCGTGCGCATCGGTGCCGATCTGAAAACCATCAGCATGTGGAATGTGAAGCAAGAAAGCCTTCGCGCCAGGCTGCACAGGGAAATCATAGACCGTGACTATCACCTCAGCGCGGCAATGTATTGCGAGACCGCGGCGCTGGACCAGTTCTTCTGGATTTTCGTCAACAAAGACGAGAACTACCACTGGATCGCCATCATTGAGGCATCCACCGAACTGCTGGAACTGGGCATGCTTGAGTACCGCAAAACAATGCGCGCCATCGCTACAGGATTCGACACGGGCGAATGGCCAGCACCGATCACTACCGATTACACCGATGAATTGAACGACTTCGACCTGCGCCGCCTCGAAGCGCTGCGCGCTCAGGCTTAAGGGGGATTTATGCATAACACAAACGTTACCGTTGCTGACCAGAACACCGTTATTAACTCCAACGTGGCTTTGTTCGATTCCCAGTATCTGAACGCCATCAGCACGTTCGCGCAGATCATGGCCCAAGGCACCGCTACTGTTCCTAAGCACCTGCAGGGCAATCAGGCCGACTGCATGGCTGTGGCGATGCAAGCGGCACAGTGGCAGATGAATCCCTTTGCCGTGGCGCAGAAGACGCACCTGATTAACGGTGTGCTCGGGTATGAAGCTCAGCTGGTTAATGCCGTCATTTCACGAAGCGGCGTGCTGGCCAGCCGCTTTGAATATGAATGGTACGGGCCATGGGAAAAGGTCGTTGGAAAATTCCACATCCGTAAAGGCGACAAAGGCGAGTACCGCGTCCCGGGCTGGACCCTGGCTGACGAGGCTGGGATCGGCATCATTATTCGCGCAACGCTTAAAGGTGAAGATCAGCCAAGGGAACTCGATTTGCTGTTGGCTCAGGCCCGAACCCGAAATTCTACCCTCTGGGCTGACGACCCTCGCCAGCAACTGGCATACCTGGCCGTCAAACGCTGGGCGAGACTGTTCTGCCCGGATGTGATTCTGGGCGTTTACACCCCGGATGAGCTCGATGATCGCCGTGAAGAACGAGAGGTAAACCCCGCACCGGCGCAGCACGTAAGCCTTGCAGACATTTCAGGTGACAACGTCACTACGACTCAAACGGCTCAGGAATCAGCTCAAAACATCGATGCACTTGCTGATGATTTCCGTGATCGCATCGAGGCGGCTCAGGATGTGGATAGTGCTAAAGCTCTGCGCGCAGATATTGAAACCGTCAAAGCAACGCTAGGTTCCGCCCTGTTTACCGAGCTTAAGAACAAAGCAGTGAAGCGCTACTACCTGGTTGATGCTCGCAATAAGGTGGAAGCGGCGATCAACTCCCTGCCATCTCCGGATGAACCCGACGCGGCAGAACGCTTTGGCGAAGCTGAACGCGTGCTGGCATCTTCAAAGCGTCACCTGGGCGATGAACTGCATGAGCAGTTCAGCATCACCCTGGCGGATATGAAACCGGAATACGTGAACTAACGAGATTGGGAGGGGAATCCCTCCCTCAAGAGAAGAAATGCGACTGATTAATCGCAGTAAGCAATCCCCTTTGGCTCGCCAGGCATGTGATGCCGCTCTCGCGAAGCACGTCGAAACTTACGGTGAATTCGCCAGACAGAAAACCAAGACCACATACACCGTAGTGGTTGATGGAATAAAGGTAACAGTGGAAGTCGTTAACCGCCGGGCTAGCTACGTTGCGACAGCCATGAATGGTGCCCGCAGGCTGCGCAATCTGCCGGGACAATGCAACTGAGAGGTGCAATATGAATGAAACAACTTATACGAATGTTGATATCTTGATCACCAGTGAAGTTTTATCAAGATACAAAATTTCGCGGAGCACGCTGTACTTTTGGAGCACCCCTTCCAGAATGCCGGCATATTTTTCACAACCATTTCCGAAGCCGAAGATAAATGGAAGCCCGAAGCGCTGGAGACTTTCAGACCTCCTGGAGTGGGAAGAAAAAGTGGGCATCAAACCAGAGGATGACCAACCAGTTTCTCCAAATGGTTCTGCCATACCGCAAGCCAATGGCGCTGATCATCAATGTAGTCATGCAGGTTATACCTCGCCATGACACCTGCCATATGATGGCCAAGCAGTTTCTCAACAACATGTGGCGGCGCACCTAATTCAGAAAGTCGCGTCGCCACGGTTCGTCGAAGGTCATGGAGGGACCACGGCTTCATTCCTGTCTTGGAAATTATTTGGGCAGAAAATAACGCAACGTTAGGCTGAAGAGGCGGCCTGTCATCCTCTGGCCCCTTATAGCGTGACAACGTAACAACATGCTTTGACATTGATGTTTCTTTATAAGATTTCATCAGCTTTACAACAGCATCCGGTAACGCTCTCCTGACGGATTTACCTGTTTTATAATCACTTGCTGGAATGGTCCATGTTTGGTCCATAAGATCGAACCATTCCCACTTTGCCGTTCTTATTTCCGTACTTCTACAACCGGTCAGAATGAGAAACTTCATGATCAGCTGTTGTCGTTCCCTCATTTCAGGCAAAGCATTCCAGACAGTTTTGATCTCCTCGTCACTCAATCTTCGGTCTTTAACTGCCGCGGTGAGCCCCACATCAGATCGCCTGAGACTCTCAATTGGGTTTAGGTTGATTATTCCTCTGTTAGAGCAGAAACGAAAAACACGCTGCATCAACCCTAACATTTGCCCTGTAACCACCCTTCTTCCCATACCGTCGAACAGGCTAAGCCAATGCGCCTTTGTGGTCTGATCTACGATCATATTCCCCAGCACAGGATTGATATGGTTGTTGAAGTCCCGGCGGTTCACGTGGATTTTTACCAGGCCTTCAGGAATGCAGTAATGCTTTTCCCAATAATCAAAGGCTTCTTTTACGGTAATCGCATCAGTTCTTTTCTGTTTCTCCAGTACGGTCTGTCGTCTCGGATCGACCCCCTCTTTAAGCCAGATTCTGAACTTCTGTCTACGCTCTCGCGCCTGAGCTAAGGATATGGTCGGATAATCGCCGATCGTCAATTGAGCCGCTTTTCCGTTCCATCTGTAGCGGTAAAAGAATGTTATACTGCCGGATGTAGACAATCTGACATTCAAGCCGTGAGCGTCTGAAATGACCTCGATTTTGTCTCTCTTTTTGCCAAGAGCTTTTCTCAGTTTTGTATCGGTTAGCAATGTGTACACTCCGGGAAAAGATATACACAATAGTGTACACAT
Protein sequences of DBSCAN-SWA_5 >NC_018079|2723991:2734602|2732911_2733151_+|WP_014832196.1|DBSCAN-SWA MRLINRSKQSPLARQACDAALAKHVETYGEFARQKTKTTYTVVVDGIKVTVEVVNRRASYVATAMNGARRLRNLPGQCN >NC_018079|2723991:2734602|2723991_2724678_-|WP_014832185.1|DBSCAN-SWA MKNLAESIRNFDREQARRVAHNMPEQYTEREQTQQVAQIINGLFVQLAAAFPASLVNRSQEDVNEIRRQWVLAFKENGINTIEQVEAGMRMVRRQERPFLPSPGQFIKWCREGRCVLGITTADVMAEYWKWRKLVFRYPSSELYPWPKPVYYHICLELRRRGTDGQLSHKELEREASDILDMWEKRVLAGKPIPPARRALAAPVPTKGPTPVEILKAKHERMKAGGRG >NC_018079|2723991:2734602|2726547_2726937_+|WP_014832189.1|DBSCAN-SWA MNRKEPNQSLISRLTELNNKGFSKTEMARVANVSKQAVTGWFRTGKMSKESALALADAAGVSVPWLLGEEVGEKDGLKADEQRLLELYRQLPEEEQKNMLRVFSIRLKELDELYAKYMSRRIKGDNGVN >NC_018079|2723991:2734602|2733372_2734602_-|WP_043951646.1|integrase|DBSCAN-SWA MCTLLCISFPGVYTLLTDTKLRKALGKKRDKIEVISDAHGLNVRLSTSGSITFFYRYRWNGKAAQLTIGDYPTISLAQARERRQKFRIWLKEGVDPRRQTVLEKQKRTDAITVKEAFDYWEKHYCIPEGLVKIHVNRRDFNNHINPVLGNMIVDQTTKAHWLSLFDGMGRRVVTGQMLGLMQRVFRFCSNRGIINLNPIESLRRSDVGLTAAVKDRRLSDEEIKTVWNALPEMRERQQLIMKFLILTGCRSTEIRTAKWEWFDLMDQTWTIPASDYKTGKSVRRALPDAVVKLMKSYKETSMSKHVVTLSRYKGPEDDRPPLQPNVALFSAQIISKTGMKPWSLHDLRRTVATRLSELGAPPHVVEKLLGHHMAGVMARYNLHDYIDDQRHWLAVWQNHLEKLVGHPLV >NC_018079|2723991:2734602|2731788_2732874_+|WP_014832195.1|DBSCAN-SWA MHNTNVTVADQNTVINSNVALFDSQYLNAISTFAQIMAQGTATVPKHLQGNQADCMAVAMQAAQWQMNPFAVAQKTHLINGVLGYEAQLVNAVISRSGVLASRFEYEWYGPWEKVVGKFHIRKGDKGEYRVPGWTLADEAGIGIIIRATLKGEDQPRELDLLLAQARTRNSTLWADDPRQQLAYLAVKRWARLFCPDVILGVYTPDELDDRREEREVNPAPAQHVSLADISGDNVTTTQTAQESAQNIDALADDFRDRIEAAQDVDSAKALRADIETVKATLGSALFTELKNKAVKRYYLVDARNKVEAAINSLPSPDEPDAAERFGEAERVLASSKRHLGDELHEQFSITLADMKPEYVN >NC_018079|2723991:2734602|2726986_2727403_+|WP_014832190.1|DBSCAN-SWA MDAKISFLFPYTTVGINQHNFSPVLTFECDVLPVKAVLQIAFYFICLKNKENYRLRFDILRDGTSVIDDSWDRDKIFMSEDPSSEPDKVAVGLNIDLPSVPFDMEGIYQISAELFYPQDSKTPIHQNDAFFKVTKKAE >NC_018079|2723991:2734602|2725676_2726228_-|WP_014832187.1|DBSCAN-SWA MQTVSFQQSSRASSNSLIFQCHQSESAVKDIDHRDICSAVRAWAAVEGRVTVALQIQEAAEELQVVGVDFSGQADVWNVKLFRWLDNKEDSAAYRRNIEQLVPAIMYVLPIRYRDRVVKNDTFAYRMARLEKEVSEAKQALMLDAPKKEKLKELGEGIFEMFRVDPDLTAPLLAMVTTMLGAM >NC_018079|2723991:2734602|2728605_2731779_+|WP_014832194.1|DBSCAN-SWA MEFFYVVKATQKSGKEDAVIWFTAKSEARANLQLDVELEDAGIETGRGKDYSKPVRTDFPVYNDLPEESTMDYTWCKRYELQDDGRTWLPKAGAESTGDVDSTAAPETAVKVETTVDSVPLENRTPAVRFAVHLTSDKYQSHITKEQQLAASEMSLDEGNTYLQNLLLAKNDIPEVAELSLNAEWKLVQAIKQVFAPDKTHEAEVIAAFMADWARADAGNRNQLVEEWRSGKLNLLKSEITSETGITTDQAPEPDNGIQIDENDDETTRYPVVRMPFRKQLLAQFTADELRHHLTREEYEGISALEMDTDNSYVQNLLLAAENCEEVKGYDTKDLWRYTDAIRKVFSQEKRHELALVLQFTRVWAATDYIDRGTLVREWTDGNRISEVGSPAPLEPAKPETTESYKRAVAQNMANLSIEIAIALLYPDAVPGQINRTQLLAAKELADKKDESHTKALKVLGKTTDILDYDANSIFGVTRAISWSGEESTTELRSQVREWFTANGIYESGERSKGYPEWSEDSRAVRHSTVEEPSTPSQPKVANLGSGVFSIDGLMVGNTAPVIDIPSNEVEKTENTAETTSDVQMEAAKPEKDEDVGSVPPGESTDAANSQTDSIAPEEQQSEPVIEYPAYFEPGRYEGLPNDVYHAANGISSTQVKDARVSLMYFNARHVAKTIPRTASKVLDMGNLVHALALQPENLEAEFSVEPEIPEGAFTTTATLREFIDAYNASLPTLLSADEIKALLEEHNASLPAPVPLGASLEETAQSYMALPAEYQRIEEGQKQTATAMKACIKEYNATLPVPVKTSGSRDALLEQLAIINPDLVAQEAQKPTPLKVSGSKADMIQAVKSVKPDAIFADELLDAWRDNPGEKILVTRQQLATARAIQSTLLAHPTAGMLLTHPSRAVEVSYFGFDEETGLEVRVRPDLEIELDGVRIGADLKTISMWNVKQESLRARLHREIIDRDYHLSAAMYCETAALDQFFWIFVNKDENYHWIAIIEASTELLELGMLEYRKTMRAIATGFDTGEWPAPITTDYTDELNDFDLRRLEALRAQA >NC_018079|2723991:2734602|2727451_2727829_+|WP_014832191.1|DBSCAN-SWA MRLPAGRYNENNASFEHDGGNGGGGNMLEARVAKLEADVENIKVNLSEARMDIRELTKSSASIKTDISTVLQKLKDIDEKLSTKASKDFVDSKAGDIKVWMLGLLLLSIAMPIIMFLLNLYLKKP >NC_018079|2723991:2734602|2724674_2725592_-|WP_014832186.1|DBSCAN-SWA MSNTAEIINFPNRTEQPGGRMADLSNGYTKVANEIQQLKPRLRMSGREWQCFEAVIWLTYGWNKKQDRVTNTVIAELTGLSDSHVSDALKSLAERKIIFNQKQGVMKTVGINTDLSAWILDKPKTGKTFPKSGKVLPKTGKVFPETVDTQDYNKNNIKISSSRNSDESRNQKTQKFLSRHPEAAAGIYTPAGKSWGSADDLKAARWIYDRLLTVNASLSEPNWAEWANTIRLMRVQDNRTHYEICDLFQWANRDEFWRDNILSPSSLRKQWDQLTTKRLRATGTVKLTRGGIDLHNTDWIDGVLE >NC_018079|2723991:2734602|2728300_2728465_+|WP_014832193.1|DBSCAN-SWA MCNSTKCGYCGKPVEPAKVVKSTLLYCNGAQLARKEKEYCSERCASYDQMAHEA >NC_018079|2723991:2734602|2726230_2726449_-|WP_014832188.1|DBSCAN-SWA MTGIELAIFRSGSASALGTLIGVSKMAVSLWRRKGIPAERVLPIYGVTGVTPHELRPDLYPNPTDGLPNQEP |
12 | Enterobacteria_phage(81.82%) | integrase | attL 2726686:2726699|attR 2739597:2739610 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
3233675 : 3239949
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NC_018079|3233675:3239949|DBSCAN-SWA CTCAGAGGGTTTTTAACGCAGGTTGCTTTTCGTCTTTTGCAGAAAGCATCGGTTCTTTATCTAACGGCCATTCAATCTGAAGATCAGCATCGTTCCAGATCAGTCCTCTGTCGCTTTCAGGATGGTAGTAATTTGTCGTTTTATATACAAATTCTGCGGTTTCGCTTAACACCAAAAATCCATGAGCAAAACCTTCTGGGATCCACAACTGCCGTTTATTCTCAGCTGAAAGGTTCACCCCAACCCATTTCCCAAAAGTCGGTGAGGATTTGCGAATATCAACTGCAACATCAAATACTTCACCAGCAACACAGCGTACCAGTTTTCCCTGTGCATACGGTTCCAGTTGATAATGCAACCCGCGAAGCACGCCTTTGCAGGATTTGGAGTGATTATCCTGCACAAACTCTACTTTACGACCTACAGCCTCTTCAAATACTTTTTGACTGAAACTCTCAAAGAAAAAACCACGATCGTCACCAAAAACTTTTGGCTCAAAGATGAGTATATCTGGGATATCCGTTTTAATAACATTCATGTCTTAGTAACCTTTAATCATCTTAAGCAGATATTGTCCATAGGCATTTTTTTTCAATGGTTCAGCCAGTTTCTTTACCTGCTCTGCATTAATAAAACCTTTGCGATAAGCGATTTCTTCAGGACAAGATACTTTGAGTCCCTGACGCTCTTCAATTGTCGCAATAAAGTTACTTGCTTCTATAAGACTCTGATGAGTACCAGTATCGAGCCATGCGTATCCACGTCCCATCATTGCCACTGAAAGACGACCTTGTTCCAGATAAATACGATTAATATCAGTAATCTCCAACTCACCGCGCGCAGAAGGTTTAAGGTTCTTCGCCATTTCCACTACATCATTATCGTAGAAATAGAGCCCAGTTACAGCATAGTTGCTTTTAGGTTCCAGCGGCTTCTCTTCCAGGCTGATGGCCGTACCATTTTTGTCGAACTCCACCACGCCATAACGTTCAGGATCATTGACGTGATAAGCGAATACCGTTGCACCATTGCCTTTACTCACTGCGGCTTCCAGCTGTTTAGGCAGGTCATGGCCGTAGAAAATATTATCGCCCAGCACCAGCGCACAGCTGTCGTTACCAATGAACTCTTCGCCAATGATAAAGGCCTGAGCCAGTCCATCAGGGCTTGGCTGTACTTTATACTGAAGGTTTAACCCCCACTGACTGCCGTCGCCCAGAAGCTGTTCAAAGCGCGGAGTATCCTGTGGCGTACTGATGATCAGAATGTCTTTAATCCCTGCCAGCATCAGCGTTGACAACGGGTAGTAGATCATCGGTTTATCATAAATGGGCAGCAGTTGTTTACTCACCGCCATGGTCACTGGGTAGAGACGAGTGCCGGATCCACCAGCAAGAATAATACCTTTACGCGTAGCCATTATAATTTCTCTATAAACACAAAATGCCCGAAAACGGGCATTTATTTAACGATTAAAGTGATTGAGTTGTAAAAAGCTCAGCCAGCATGCGCTTAACACCAACATCCCATGTGGGCAATACCAGACCAAAGTTATGCTGGAATTTCTCTGTGCTTAACCGCGAATTCTGCGGCCGACGAGCAGGTGTCGGGTAAGCATTTGTCGGCACTGCGTTGAGCTTTTCAATCGCTAGTTCAATGCCTGCCTTTCTGGCTTCATCAAATACCAGCGCTGCATAGTCATACCAGGTTGTCGTATCGGATGCCACCAGGTGATAAAGACCAGCCACTTCTGGTTTTACCATCGCCACCCGAATAGCATGCGCGGTACAATCTGCCAGTAATTCTGCACCTGTAGGTGCACCAACCTGATCGTTAATAACAGAAAGTTCTTTACGTTCTTTCGCCAGACGCAGCATGGTTTTCGCGAAATTGTTACCTTTACCCGCGTAGACCCAGCTGGTACGGAAAATGAGATGGTTAGGGCAATGCTCTTGCAGCGCTTTTTCGCCATCAAGTTTAGTCTGCCCATAAACATTCAAAGGCGCTGTTGTATCCGTTTCACGCCATGGTTTATCCCCATCGCCCGGAAAAACGTAATCTGTCGAATAATGAACAACCCAGGCCCCAATTTTTGCAGCTTCCTTCGCAATAGCTTCAACGCTGGTTGCGTTCAGTAACTGCGCAAACTCTGGCTCAGATTCTGCTTTATCAACAGCGGTGTGAGCAGCTGCGTTAACGATCACATCAGGTTTGATGGCACGAACCGTCTCAGCAATCCCTTCCGGATTGCTAAAATCCCCACAGTACTCTTTGGAATGCACATCAAGTGCAATCACGTTCCCTAACGGTGCGAGTGCGCGCTGGAGTTCCCAACCCACCTGCCCCGTCTTGCCGAACAGAAGAATGTTCATTATTGACGTTCTCCGTAATTCTGTTCGATCCAGGTCTTATAGGCACCGCTTTTAACGTTGTTAACCCAGTCCTGGTTGCTCAAATACCATTCCACGGTCTTGCGGATACCGCTTTCAAACGTCTCTTCCGGCTTCCATCCCAACTCGACGCCAATTTTATGCGCATCAATCGCGTAGCGGCGGTCATGCCCTGGGCGGTCAGCAACATAGGTAATCTGGTCGCGATACGAACCCTCTTTCGGCACAATCTCATCAAGCAGATCGCAAATAGTATGCACAACATCCAGATTCTGCTTTTCGTTATGACCACCGATGTTATAAGTTTCTCCCGGTTTACCCTGAGTTACGACGGTGTAAAGTGCACGCGCATGGTCTTCAACGTACAGCCAGTCACGGATCTGGTCGCCTTTGCCATAAATTGGCAGCGGTTTACCGTCCAGGGCATTCAAAATCACCAGCGGGATCAGTTTCTCAGGGAAATGATACGGGCCGTAGTTATTGGAGCAGTTAGTGACGATGGTCGGGAATCCATAGGTACGCAGCCATGCGCGAACCAGGTGATCGCTGGAGGCTTTAGACGCAGAATATGGGCTGCTTGGCGCATAAGCTGTGGTTTCGGTAAAGAGCGGCAGAGCGGTAGAAGCCGGATGCTCGTCCGGGTGCGGCAAATCGCCATACACTTCATCGGTAGAGATATGATGGAAACGAAACGCCTTCTTCGCCGCATCATCTAAAGTGGACCAGTAAGCACGAGCCGCTTCCAGCAATACATAGGTACCGACAATATTGGTTTCAATAAACGCGGCCGGGCCGGTAATGGAACGATCGACATGACTCTCAGCCGCCAGGTGCATAACGGCATCCGGCTTATGCTCAGCAAAGATACGATCCATGGCACTCTTGTCGCAAATGTCCGCATGCTCAAAGACATATCTGGCGCTATCACTCACTTCGGTAAGCGACTCAAGGTTGCCAGCATACGTCAGTTTATCAACATTGACGACTTCATCCTGGGTATTTTTAATAATATGTCGCACCACTGCTGAGCCGATAAAGCCGGCTCCGCCCGTCACAAGAATTTTCACGTTATCTATTCCGTCTAATGGATATGTCGAGATATTGGCGTTCTGAAGCTGGATAATTAATATCGGCTTCAGAAATTTTGCACGCTACCGCCCCTGGCTTAACAGCTACCAGTGCACTGAGCGAAGTTTGAATAAATGATTTTTCTGGTCTGGATTCAGACAAAAACTGGCGTTAATTGTCGTCCTAAATGACCTCAGAAACAAGCACAAATCGTCACAAATCTGACCAGATTTAAATCAGATAAATCCATGTTATCATTAGAATTAATAACGAAAAACGGCAGTCATCATTTTGGTTATGTTGCAACATACCCAATGATGACTGCCGCTTGCGTAAATCAAGTAAGATAAAATCAGTCGTTAGCTAACAACTTCTCAATCCGGCTTCTGAACTTCGCCCCTTCCTTCAGGTTACGCAGTCCATAGTTCACAAATGCCTGCATATAGCCCAGCTTTTTACCACAGTCGTAGCTCTCACCGGTCATCAGCATCGCGTCAACAGACTGCTTTTTCGCCAGCTCGGCAATCGCATCGGTCAACTGGATACGGTCCCAGGCACCTGGTTCGGTTTTTTCCAGCTCGGCCCAGATATCCGCGTTCAGGACATAACGGCCAACCGCCATCAGGTCAGAATCCAGCGTCTGCGGCTGATCCGGTTTTTCGATGAACTCAACGATGCGGCTCACCTGCCCTTCTGTCTCCAGCGCTTCTTTAGTCTGGATTACAGAGTACTCGGACAGATCGCCTTTCATGCGTTTCGCCAGCACCTGGCTGCGGCCTGTTTCGTTGAAACGCGCCACCATCGCCGCCAGGTTGTAGCGCAGCGGATCCGCAGAAGCGGTATCGATGATGATATCCGGCAGGACAACGATGAACGGGTTGTCGCCCACGACCGGACGAGCACACAGGATGGAGTGACCCAGGCCCAGCGGTTGCGCCTGACGCACGTTCATGATGGTCACGCCAGGAGGACAAATAGACTGCACTTCGGCCAGCAGTTGACGTTTAACACGCTGCTCCAGCAGCGCTTCGAGTTCGTAAGAGGTGTCGAAGTGGTTTTCTACCGCATTCTTGGACGAGTGCGTAACCAGAACGATTTCTTTGATCCCTGCAGCAACAATCTCGTCGACGATGTATTGAATCATCGGCTTGTCGACGATCGGTAGCATCTCTTTTGGAATGGCTTTTGTGGCTGGCAGCATATGCATGCCCAGTCCGGCTACCGGAATGACTGCTTTCAAATTAATCATTATTTCTTCCACCTTAAAATGGTTGACGAATTATAGCTCTTAAACCTGTTTTCGCCAGCATGATTTGCCGTAATCCTGAAGGGATATTACGCCTTAGCATCACAAATTACAGTAGTTGTAATCTGAATAGAATGCGATTCCATGCCAGGAATAGTCGCAAAAAAGCCCGCTGACGATTATCTGTGCGGAAGTGCGAAGTTCAGCCGTTCCGTATTCACCACATGCTGATCCACCCGGTCAATATCCACCGAGCTCTGCCCCTTCTCATTCACTGCTTTGATGTTTGCCAGCGACAGCAACGTCTCGTCTTTCGCCATGAATTTACCCCGTACATCCTTACGCAGGTCGAAGTTCAGCTTCAGGGCCGGCCCGATTGCCGCTTCCTGCATCACATTGATATTGCGTAAGAAAAGATGCTGAGGTTTGTTATGCAGCTCTAACGTCGCACGCTGCATCTCGACGTTGGTGATGGCCACAAACGAGGTCGCATTACCGGATGAAATTTGTATGCCGCGCAACTTATAGGCAAGCTGGCTATTATTCAGACGAATATCGTTAAGCTTGAAATTCTGCGGAATCGAGAGGTAGTCGCCTTTAATCACACCGTAACCGATTAACATTCCGGCACTATTGACCATATCGACATTATCAATAACGAAATTATCACAGCCATAAATTGCTACGGTGGCATTGTCTATTCCCGCCTTTTTACTGAAATCAGGCGTAATATTTTTGGCCTTAATATTGCGAATAATAAAGTGCTTACCGTTCTCGACGTGGACCAACTGACGGCAGTTGCTGCCGGTGATATTGGCCACCACAAAGTTCTTCACGGTCTGCTTTTCCGGGTAGTCGTTGTCGTAGGTACTGCCCGCGAGGCCAATACCAATCCCCCAGTTAATCTTCCCGTTGGTACAGTTGATGTTGTCGATCACATGGTCGGAGATCAGGATATTTCGATCATTGATGGCCACGTTCCACTCGATCGCATCGCCCTGCAGATGGCTGAATGTGCTGTTGGTGATGCGGGCGCCGTCCACCTGGTTATGAAACCCCTGGCGCAGAATGGCGTAATTGGCCTGGCTCACGCGGATGTTATCAATAACCAGATTACGCATCACGCTGGGCTTTTTGCCGCCAATGTAAATTTGCGTTACCGGGCCAAATCCGCTCATCGCCAGCCCTTTGATCACACAGTCTGAGCCGCGAACGTCAAGGGTGATATTTTCGGTACGACCTTCGCCCTCACCGATGACTTTGCTGCCCTCCTGCAGGACAAAGCGCCCCCGGCCATTGCCGGTTAACGCCCCACGAATCAGCAGCGTTTTGCCGTCAGGAATGAAAATGCCCGTGTTGATGTTTTTGCAGGTGAGCCCTGCAGGGACAACGACGGTGTCGCCTTCGCTAAACGCTTGCTTAAAGGCGGCGATCCAGTCGTTGTTGTTGTACTGATTAACAGAGACCGTCTTGCCGGTTGCCGCCCGCGCAACGCGGGACGACAGCAGCGGCATTGCCGCAAGGACGGACAGAGAAGAGACAAAGCTGCGTCGGGTAATCTTTTTCAGCAT
Protein sequences of DBSCAN-SWA_6 >NC_018079|3233675:3239949|3236045_3237131_-|WP_014832611.1|DBSCAN-SWA MKILVTGGAGFIGSAVVRHIIKNTQDEVVNVDKLTYAGNLESLTEVSDSARYVFEHADICDKSAMDRIFAEHKPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEAARAYWSTLDDAAKKAFRFHHISTDEVYGDLPHPDEHPASTALPLFTETTAYAPSSPYSASKASSDHLVRAWLRTYGFPTIVTNCSNNYGPYHFPEKLIPLVILNALDGKPLPIYGKGDQIRDWLYVEDHARALYTVVTQGKPGETYNIGGHNEKQNLDVVHTICDLLDEIVPKEGSYRDQITYVADRPGHDRRYAIDAHKIGVELGWKPEETFESGIRKTVEWYLSNQDWVNNVKSGAYKTWIEQNYGERQ >NC_018079|3233675:3239949|3238557_3239949_-|WP_014832612.1|DBSCAN-SWA MLKKITRRSFVSSLSVLAAMPLLSSRVARAATGKTVSVNQYNNNDWIAAFKQAFSEGDTVVVPAGLTCKNINTGIFIPDGKTLLIRGALTGNGRGRFVLQEGSKVIGEGEGRTENITLDVRGSDCVIKGLAMSGFGPVTQIYIGGKKPSVMRNLVIDNIRVSQANYAILRQGFHNQVDGARITNSTFSHLQGDAIEWNVAINDRNILISDHVIDNINCTNGKINWGIGIGLAGSTYDNDYPEKQTVKNFVVANITGSNCRQLVHVENGKHFIIRNIKAKNITPDFSKKAGIDNATVAIYGCDNFVIDNVDMVNSAGMLIGYGVIKGDYLSIPQNFKLNDIRLNNSQLAYKLRGIQISSGNATSFVAITNVEMQRATLELHNKPQHLFLRNINVMQEAAIGPALKLNFDLRKDVRGKFMAKDETLLSLANIKAVNEKGQSSVDIDRVDQHVVNTERLNFALPHR >NC_018079|3233675:3239949|3237484_3238381_-|WP_013097865.1|DBSCAN-SWA MINLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHSSKNAVENHFDTSYELEALLEQRVKRQLLAEVQSICPPGVTIMNVRQAQPLGLGHSILCARPVVGDNPFIVVLPDIIIDTASADPLRYNLAAMVARFNETGRSQVLAKRMKGDLSEYSVIQTKEALETEGQVSRIVEFIEKPDQPQTLDSDLMAVGRYVLNADIWAELEKTEPGAWDRIQLTDAIAELAKKQSVDAMLMTGESYDCGKKLGYMQAFVNYGLRNLKEGAKFRSRIEKLLAND >NC_018079|3233675:3239949|3235146_3236046_-|WP_014832610.1|DBSCAN-SWA MNILLFGKTGQVGWELQRALAPLGNVIALDVHSKEYCGDFSNPEGIAETVRAIKPDVIVNAAAHTAVDKAESEPEFAQLLNATSVEAIAKEAAKIGAWVVHYSTDYVFPGDGDKPWRETDTTAPLNVYGQTKLDGEKALQEHCPNHLIFRTSWVYAGKGNNFAKTMLRLAKERKELSVINDQVGAPTGAELLADCTAHAIRVAMVKPEVAGLYHLVASDTTTWYDYAALVFDEARKAGIELAIEKLNAVPTNAYPTPARRPQNSRLSTEKFQHNFGLVLPTWDVGVKRMLAELFTTQSL >NC_018079|3233675:3239949|3233675_3234212_-|WP_014832608.1|DBSCAN-SWA MNVIKTDIPDILIFEPKVFGDDRGFFFESFSQKVFEEAVGRKVEFVQDNHSKSCKGVLRGLHYQLEPYAQGKLVRCVAGEVFDVAVDIRKSSPTFGKWVGVNLSAENKRQLWIPEGFAHGFLVLSETAEFVYKTTNYYHPESDRGLIWNDADLQIEWPLDKEPMLSAKDEKQPALKTL >NC_018079|3233675:3239949|3234215_3235094_-|WP_014832609.1|DBSCAN-SWA MATRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIKDILIISTPQDTPRFEQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFIGNDSCALVLGDNIFYGHDLPKQLEAAVSKGNGATVFAYHVNDPERYGVVEFDKNGTAISLEEKPLEPKSNYAVTGLYFYDNDVVEMAKNLKPSARGELEITDINRIYLEQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAYRKGFINAEQVKKLAEPLKKNAYGQYLLKMIKGY |
6 | Enterobacteria_phage(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
4203579 : 4231457
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NC_018079|4203579:4231457|DBSCAN-SWA GTTATTTACAGCTGTTATTGGAAGCGCCCTGTTCACGCCATGCGCCCCACTGCCCGCCGGTTCCCGGGGTTTCCTGGCCCGGGTTGACATACCACTGGTTGAACCAAATTTTTCCGTTGTAAGAGACTTTGGTGCATTTGGTCGGGTAAGCAATTTTGGCGTTATAGGCCGGTGCAGCAGGTTTTACCGGCGTATCGTCTTTTGCCGGGGTGTCGTCTTTCGCGGGTACCGCAGGCTTAGTGACCTTAATGGTCATGCTGTCCGTCGCGGTACGGCCGTCTGCGCCGGTTGTGGTCAGCGTGAAGATCGCTTCTCCCTCGCTGTTGGCAGGAATGACGGCATAGGCATGCGCGCTATCCACGCTCTTCACCAGCGTGCCGTTCAGTTTCGCTTCCAGTGAGAAGGTTTCAGCCCCTTTCGTCACACTCCAGTTATACGATTTGGCATTCAGGCTCTTGCTGCCGTCCAGCGGGTAGGTGCTCACGCCAGTAGCCGGAGACACCATGGTGTAATCTGCCCCTGCCGCAGCCGTTGGGGCCGGCGCGACAGGCGTGTTACCGTCTTTCAGCGCCATAATGCGGGCTACCGCGCGCTCCATGTCAGGCTGTGTACCCACTTTCGCCGACTCCCCGTTCGCCAGCGGCGTACCGGTCTCCGCCAGAATCTGACGCATCTGACGCGGCGTCACGGTGATACCGTTGGCTTTCGCAATGCCCGACAGGCTCGCGACCACGCCGGCAATAATCGGGTTCGCGGACGACGTGCCGGCGAACGTCGACGTATACTGTGCGTTTGGCGCATTATAAAGGTTGCCAAAGCCCGCGCTGACTACATCCCAGCATCCCCATGACGAACTGGTGACGCGCGATCCGTAGGTACTAAAGTAGGCTCTCTTACCCTCCTTCGCACAAAACGCACCGGCAATGATGGCCCCGGAATCGCGGACATTAACGTCAAATTTACCGTTAAACGCACTATGATCGAGGTTAACGTTGCCGTTCCCTGCCGCTTCAATGACATAGACGCCTTTATCGGTGAGAGCCTTGATGATGTCGTAGTAGCTCTGCACATTCTCCTGCGGCACGTAGCAGGTTGTGGTGCACCCTGTGATCTCACCGCCGCCGGTCTGCATGCCAATCTGTACGACATCGCCTGCTTTGAGCAACGGGATCATGTTATAGAGGTTGTTATAGTGCCAGTCGCTGAAGCCCACGCGGCTTTTCCAGCTAAGCCCACGGACGCCTGCGCCAATGTCTCTTGCCGCCATAATCCCCACTGAAGCCGTATCATGATCGTCAACATATTTATTGCTGCCCTGAATCAGAGCTGGCTTCGGCAGGTTGATATGGTTGACGTTCCAGATATCGTTCTCCATGGAGATAATCGTGACACCCTCCCCCTCGTTGCCAGGATATTTATTAACACTGTCCCGATTGACGCCGCCCATATAATAGCCCTGACGTTTGTCAGTGGGCGATTTAGTGTAATATTGCTGGCTACGAAAGTCTGGTACTGACGATGCCCCCACCGAGCTTTGCACCGTTGATTTTAATGGTGGCTTATTTTGATGTTCTTCACCACTGTATTTATCTAATGATACCGGGACGCTTTCCGGGTAGACCATTTCCACATTCTGGTTTTGTTCCAGTTCCGTAATGACGCGATTGATATAATTTTTATCGGTGCTTTTATTTTCCGGCAAATCAATCCGCACATAACGATCAAAACCGTAGCGTGCATTCAGCTGTGCAAACTCATCAGATTGGGTACTGCGCAGTTTATTTGGACGGAACATACTGGTCGGCACCAGAGCGGGGTCGCTCAGCGTCAGCGTATTTGCCGTGGTTGTGGTTGTACTGGTGGATTTGAGCAACGGTTTGCCCGGCTTCAGCTTCACCACAATCGCAATATAGGATTCGCCGTCCTGGGCAAACTCTGCCTGTGGCGTGGTTTCAGCAAAGGCAGAACATGTCACTGCCGCACATATTAAGAGTGAGAGAAGACTTTTTTTCATTTCGCCAACGTTCCTTTATGACATTTTGATGAGGGAGTTCCTTTTTACAGGATGAATTAACTCCCCATCACAGCCAATAAAAACGCAGGGTAAAAATAGTCAAAATAGATTGGGAAATATGTCAACCATGATAAGGCAATAATAAGCAAGGAACGAATAAAATAAAAATATCCGTTGAATTATTGAGTCGGTGAAAATGCGGGGGTTATAAAATAATTTGCTCTAATAGAGTAATGAATCTTCACTGCCTGCTTATCACCGCCCGAATGTCTGACATTCGGGCGGTGAATTGTCACGCCTCTGGCAACTCCGCCAGCGGCCAGCGAGGACGCACGGAGACGCTCAGATCGGACGTTGCGCCAGCGTTCAGGCGTACCATTCCGGCGTAGGCGATCATCGCCCCGTTATCGGTACAGAATTCCGGACGCGCATAGAACACTTCCCCGCGACGTTTTTGCATCATCTCCGCCAGCTTCGTGCGCAGCGTGCGGTTAGCGCTAACGCCACCCGCCATGACCAGGCGCTTAAAACCGGTCTGATCCAGCGCGCGTTTGCACTTGATCATCAGCGTATCGACCACCGCATCTTCGAACGCACGGGCGATATCCGCACGGGTCTGCTCGCTGTCGTCATTATTGCGGATGGTATTGGCGGCAAAGGTTTTCAGGCCAGAGAAGCTGAAATCCAGCCCCGGACGGTCGGTCATCGGACGCGGGAAGACAAAACGCCCTTCCGTACCCTGCGACGCCATTTTCGAGAGCATCGGGCCACCCGGGTAATCCAGACCGAGCAATTTGGCGGTTTTGTCGAAGGCTTCTCCGGCGGCGTCATCAATCGACTCGCCCAACAGCTCATACTTGCCGATGCCGGTCACGCTAATCAGCTGGGTATGGCCGCCTGAGACCAGCAGCGCCACAAACGGGAATTCGGGTGGATTCTCTTCCAGCATCGGCGCCAGAAGATGCCCTTCCATATGGTGAACCGGAATGGCCGGAACATCCCACGCGAAGGCCAGGGAACGGCCCACCGTGGCACCGACCAGCAGCGCGCCGACCAGACCCGGGCCTGCCGTGTAGGCGACGGCATCAATATCAGTTGAGTTCAATCCGGCCTCTTTCAGCGCCGCCTGAATCAGGGGAACCGTTTTACGCACGTGGTCACGAGAGGCCAGTTCAGGCACGACGCCGCCGTAGTCAGCATGCAATTTCACCTGACTATACAGTTGGTTGGCAAGAAGCCCTTTTTCGTCGTCGTAAATGGCGATGCCGGTTTCATCGCAGGATGTTTCAATACCCAGTACACGCATGACTTGTTTTACCTCGTTTCAGTACCGCGCAGTGTAGAGCCTGGGCGGGTTGATGTAAAACTTTGTTCGCCCCAGGACATCCGCTCGTGTATACTCTTCCCCCTTATAAAAGTCCCTTTCAAAAACGCGTCGGTGCTTTACAAAGCAGCAGCATTTGCAGTAAAATTCCGCACCATTTTGAAATAAGCTGGCGTTGATGCCAGCGGCAAACCGAATTTATCAAAGGTGAGAGTTACATGCCGGTAATTAAAGTACGTGAAAACGAGCCGTTCGACGTAGCACTGCGTCGCTTCAAACGTTCATGCGAGAAAGCAGGTGTTCTGGCTGAAGTTCGTCGTCGTGAGTTCTATGAAAAACCAACGACTGAACGTAAGCGCGCTAAAGCTTCTGCAGTGAAACGTCACGCGAAGAAACTGGCTCGCGAAAACGCACGCCGTACTCGTCTGTACTAATCCGTTGAGGGCGATAGTCCTCAATTGACAGACAGAGTAATAGTCGTAAGGCCGTGCTTCCGGAAGGAATGCGCGGCTTGTTTTCGTTTATGAGTTGCTAAAAACTTTTGGGGCATATGGCCGGACGAATCCCACGCGTTTTCATCAATGACCTGCTGGCAAGAACCGACATCGTCGATCTCATCGACGCGCGGGTAAAGCTGAAAAAGCAGGGCAAGAACTACCATGCGTGCTGTCCTTTCCATAACGAAAAAACCCCCTCCTTCACCGTCAACGGTGAAAAGCAGTTTTACCATTGCTTCGGCTGTGGCGCACACGGTAATGCCGTCGATTTTTTAATGAACTATGACAAGCTCGAGTTCGTTGAAACTGTCGAAGAGCTGGCCGCGATGCACAACCTTGAAGTGCCGTATGAAGCGGGCAGTGGACCAAGCCAGATAGAGCGCCATCAGCGTCAAACGCTGTATCAACTGATGGATGGCCTGAATTCGTTTTACCAACAGTCTCTTAAGCACTCTGCGGCTGAGCCTGCGCGTCAATATCTGAACAAGCGCGGACTGAGCGACGATGTTATTGCGCGTTTCGCTATTGGTTACGCCCCGCCCGGCTGGGACAACGTGTTAAAGCGTTTTGGCGGCAATAGCGAAGATCGTAAATCGCTGATCGACGCAGGCATGCTGGTCACCAACGACCAGGGACGAAGCTACGACCGCTTCCGCGAACGGGTGATGTTCCCGATCCGCGACAAGCGTGGCCGGGTGATTGGTTTTGGGGGACGCGTGCTGGGTGATGCCCTGCCCAAGTACCTCAACTCCCCGGAAACCGACATTTTCCATAAGGGCCGCCAGCTTTACGGCCTTTATGAAGCGCAGCAGGCGAATGCGGAACCTCCGCGCCTTCTGGTCGTCGAAGGGTATATGGATGTCGTTGCGCTGGCGCAGTACGACATCAATTACGCGGTGGCCTCGTTGGGTACCTCCACCACGGCCGATCATATTCAGCTGCTGTTTCGGGTGACCAACAACGTCATCTGCTGTTACGACGGTGACCGTGCAGGGCGCGATGCCGCCTGGCGTGCGCTGGAAACTGCGCTACCTTATATGACCGACGGGCGGCAGTTACGCTTTATGTTCCTGCCCGACGGTGAAGACCCGGATACGCTGGTGCGTAAAGAGGGCAAAGCGGCGTTTGAAGCGCGGATGGAGCAGGCTCAGCCGCTCTCCACGTTCTTGTTTAACAGCCTGATGCCGCAGGTTGATTTGAGTACTCCTGACGGGCGCGCGCAGCTCAGTACGCTGGCGCTGCCATTAATTAGCCAGGTGCCCGGCGAAACGCTGCGCATCTATCTGCGTCAGGAGTTAGGCAACAAGCTCGGCATTCTGGATGACAGCCAGCTTGAACGTTTAATGCCAAAACAGGCTGAAAACGGCACGGTTCGCCCCGCGCCTCAGCTAAAACGCACAACCATGCGTATACTGATAGGGTTACTGGTCCAAAACCCCGAACTTGCGCCGCAAGTGCCATCGCTGGCGGGTTTGAACCACGAAAAATTGCCCGGACTGGGCTTATTTTCAGAACTGGTCAACACATGTTTGTCTCAGCCAGGTCTGACCACCGGACAACTTTTAGAACATTATCGCGGCACAAAAGAGGCCGCTACCCTTGAAAAATTGTCGATGTGGGACGATATAGCAGATAAGGACATTGCAGAAAAAACGTTCACCGACTCACTCAACCATATGTTTGATTCGATGCTTGAGTTGCGCCAGGAAGAGTTGATTGCTCGCGAGCGCACACACGGCTTAAGCAGCGAAGAACGCCGGGAACTGTGGATGATTAACCAGGAACTGGCGAAGAAATAAAAGAATTTAACGGCTTAAGTGCCGAATATCGATCGGGAAGCCCCCGGCAGCCGCACTGAGAGGCAGCGGCAAAAATATAAGTACGCCCTCGCTTTAAAGGTTGGCAAACCATCGCCGACACCAATCAAACGAATTAAGTGTGGATACCGTCTTATGGAGCAAAACCCGCAGTCACAGCTGAAACTTCTTGTCCAACGTGGTAAGGAGCAAGGCTATCTGACCTATGCCGAGGTCAATGACCATCTGCCGGAAGATATCGTCGATTCAGATCAAATCGAAGACATCATCCAAATGATCAATGACATGGGCATTCAGGTGATGGAAGAAGCACCGGATGCCGATGATCTGTTGCTGGCTGAAACCTCCAACAACACTGACGAAGATGCGGAAGAAGCTGCTGCACAGGTACTGTCCAGCGTGGAATCTGAAATCGGGCGTACCACTGACCCGGTCCGCATGTACATGCGCGAAATGGGTACCGTTGAACTGTTGACCCGCGAAGGCGAAATTGACATCGCAAAACGCATCGAAGACGGGATCAACCAGGTTCAGTGCTCTGTTGCTGAGTACCCGGAAGCGATCACCTATCTGCTGGAGCAGTACGATCGCGTTGAAGCGGAAGAAGCGCGCCTGTCCGATCTGATCACCGGTTTTGTCGACCCGAACGCTGAAGAAGATATGGCGCCAACCGCCACTCACGTCGGTTCTGAACTGTCTCAGGAAGAGATGGATGACGACGAAGACGAAGATGAAGAAGAGAGCGACGACGACAGCGCCGATGATGACAACAGCATCGACCCGGAACTGGCACGTGAGAAGTTCGCCGAACTGCGTACCCAGTACGAAGTGACGCGTGACACCATCAAAGCCAAAGGTCGCAGTCACGCTGCTGCTCAGGAAGAGATCCTGAAACTGTCTGAAGTCTTCAAACAGTTCCGCCTGGTGCCAAAACAGTTCGACTACCTGGTTAACAGCATGCGCGTGATGATGGATCGCGTACGTACCCAGGAACGCATCATCATGAAACTGTGCGTTGAGCAGTGCAAAATGCCGAAGAAGAACTTCATCACCCTCTTCACCGGCAACGAAACCAGCGAAACCTGGTTCAACGCGGCTATCGCGATGAACAAGCCGTGGTCTGAAAAACTGCACGACGTGAAAGAAGACGTGCATCGCGGTCTGCAGAAGCTGCAGCAGATTGAAGAAGAGACCGGCCTGACCATTGAGCAGGTTAAAGACATCAACCGTCGTATGTCCATCGGTGAAGCGAAAGCCCGCCGTGCGAAGAAAGAGATGGTTGAAGCAAACTTACGTCTGGTTATCTCTATCGCCAAGAAATACACCAACCGCGGTCTGCAGTTCCTGGATCTGATTCAGGAAGGCAACATCGGTCTGATGAAAGCGGTAGATAAGTTTGAATACCGTCGTGGTTACAAGTTCTCCACCTATGCGACCTGGTGGATCCGTCAGGCGATTACCCGCTCTATCGCGGATCAGGCGCGCACCATCCGTATTCCGGTGCATATGATTGAGACCATCAACAAGCTCAACCGTATTTCCCGCCAGATGCTGCAGGAGATGGGCCGCGAGCCGACGCCGGAAGAGCTGGCTGAACGCATGCTGATGCCAGAAGACAAAATCCGCAAAGTGCTGAAGATCGCCAAAGAGCCAATCTCCATGGAAACACCAATCGGTGATGATGAAGATTCGCATCTGGGTGATTTCATCGAGGATACTACCCTCGAGCTGCCGCTGGACTCTGCCACCACCGAGAGCCTGCGTGCTGCCACGCACGACGTGCTGGCTGGTCTGACCGCCCGAGAAGCGAAAGTACTGCGTATGCGTTTCGGTATTGATATGAATACCGACCACACGCTGGAAGAAGTGGGTAAACAGTTCGATGTTACCCGCGAACGTATCCGTCAGATCGAAGCGAAGGCGCTGCGTAAACTGCGCCATCCGAGCCGCTCTGAAGTGCTGCGTAGCTTCCTGGACGATTAATCCAGGTAAACAGCAAAAAGCTCCCAATCGGGAGCTTTTTTTTGTTTTATTCCCTCTCCCTGTGGGAGAGGGCCAGGGTGAGGGAATCAGGCCGCAGAGGCCCCTAGAGCCCCCGCACCACCAGCGCCTCGTCGAGCTCCCTGTAAGCTTCCACCAGTTTATCCAGCGTTGCCCTGTTAAGTCCGCTCGGGTTTGGCAGCACCCATACCTGCGTCACGCCGATAGTGATGGCCTGTTTCCCCCATTTCACCCCGCGCTGGCTGAACGCCTGCTCATAGGCCTGTTTGCCCAGAATCGCCAGCGCGGCAGGCTGGTAGTCTTCGATCTTCTTAATCAGTTCCCGCCCGCCAGTGCGCAGTTCATGCAGGTGGACTTCGCTCGCCTGCACGGTAGGTCGCTCCACCAGCATGGTGATCCCGCAGCGCGTATCCAGCAGATGCTGCTCCTCTTCAGGCTTGAGTAAGCGGTCGGTAAACCCGGCCTGGTAGATCACTTTCCAGAAGCGGTTCCCCGGATGAGCAAAGTGAAAACCGGTATGCGCCGAGGATTTACCGGGGTTGATCCCGCAGAAGACCACCCGTAGTCCCGGGGCCAGAATATCGTTGATCATGTTTACTCCCGATTGATACATCATCATGGAAGTATAAAGGATTGATTATGCGTTGTTTATAAAAACAGCAGGCGGGTGTGAATGGCTGGATTGCGGCGGGGAGTTACATTATAATCCACCGCCACGGCCCCTTAGCTCAGTGGTTAGAGCAGGCGACTCATAATCGCTTGGTCGCTGGTTCAAGTCCAGCAGGGGCCACCAAATTTTAGCTTTAGAATCATATAATTAAGCCACTCACATGAGTGGCTTTTTTGTTGGCTCAATTTGCAGTGGCGATGAAATGGCGGTATCGCTGAGTTGTTACAGGGATAAAAAAAAACCGCACGAGGCGGTTTTTTTGACTGATGCGGCTATCGCTAACTGCGGTTCACCGAGGGCGCCCGGGCACGCTCTGGCATGCCAGGGGCTGCAATCAGGCGCTCAACAGACTCCATGGTGACAAACGTACAGCTGCAATCAACGTTGGTACACTGGTGATAACGCTCTTTGGTATTTTCACTCAAATAGCGACTGGTTCGCGCATGTGCCGAGTGCTTACATTTAGGACAATGAAACATACCGCCTCCGTTTAATTCACTTTTTGTGAATTAATAATACCCCAAAGACGTAAAATTAAAAGCATACTACTCGCTTTCGATGGAAAATTTTTCGTCGGCAATGTCAGGTTCAAGATTCACCTCCGTCGTAAAGCCTCTTCCGTCAAGGGTGTGTACGACCCGGCTGACGATCCACGCTTGCTCGTCAATCGCCCGCTTAAAGCCTTTTACCTGAAGGGGGGTTTCGGGAAAAAGGTCAGCTTTCCCCAGCGCCAGCTGGATTGAAAATTTCACCACCCCGCGCTGTAAAGCGCGCCATTTGGCTTCAGCCGCTCTGAGTGCCTGCGCTTCGGAGGCAAAAATAGTGGTCAACTCAAAGACGTTATCGGCGGCACCAACCAATGTCTCCTGCGGCGTTTGCTCCGTGCTGCCGGATACGGCTGCGGCATCCGGATGAGGTAACGCACCTGTCGCTGCCGCCTGAGGCAGACGATTGATGCTGACCTTGGGGTGTTGTTGTTTAGGATCGCTCGTTTGCAGCCATTTGGCCGTCACGCCGGAATACGCTTCGCGATCGGCGATGGAAAAGCTGTGTTTATCGCCATCGGCGCGTTCAATCACCTTCAACGGAACCGGTTTGCCGCTGGCCGTCACGGCGCCGCCCGCTTTCATGAAAAGCACCTTCCCTTCCTTAACGGCGACAAACGCGCCATTGCGTTCAGCCAGGCGGGAGAGAAACGCGGCATCGGACTCCTGAGTCTGGTCGATATGCGCAATGGCTATGGATGAAAATTCCGCCGCCACGCTGGCGCTAAGCTGGTTGCGCTGCGCGATGGTATCAACAATTGCGCCGAGGGTGGTGTCATGCCATGACTGCTCGCGACGAACGTTGAGCTTGCCACGAAAATCGGCGCTGCGCCCACGAATGGTTAACCGGTCCGGCGCCCCCCGAAATTCAATATCATCGACCGTGAAATTCCCCTTCGGCTCCAGAGGGGTACCCTGCCAGCCCAGCCATAGAGTAATAAACGCGCCGCGAGCCGGCAGGTCTAACAGACCGTCGGAATCATCCAGCTCAATGTCCACCTGATCGGCTTCCAGGCCTCGGTTGTCAGTCATGGTCAAGCGGATAAGGCGATGGCTAAAATTTTGCGTAATATCACGGCCATTAAGCTTGAGCATAAAATCGGGGGCGATTTTCCCCCCCGCCCGGATATTCATTTCCGTGATCATCCCAGCAGCCCTCCGATTTCGGTACGCGCGCTGGTCACCAGCTCTTCAGCCTGTGTTCTCAGGTCGCCAAACATCGCCATCAGCGATTCATCAACCCGCTTCAGAGAAAGCGTAAAATCAATTTTACGCGCCGTACCATCGCTGTAAAAATCCGTATGGGTGTGCGTGACCTTATCAATCACAAACATGCCATGGATAATGCCCGTCCCGTCAATCAGCGGCCAGGCCCGCCCCTCATTCGCCATCAGTTCGATGGCTTTGAGGGACAATCGCCCTCCGGTAATTTCAGGATAAAGCGTTCCGGAGAGCGTATGGGATGTATCGCCCTCACCCAGATACTGCCAGGCTTTGGGCTTCCCGATGCGATCGTTAGAGGCCCAGCGGTAATCCTTTGAAAACTGCATTGACTGATACGGCAAGGTTCGTCGCTCAAAGACAAACAGCCCCAGCACCATTAACATTTTCTCTCTCCTTACTCATACATAAAGCGCGATCGCTGGTTTGCCGCTCTCTGATTCTCTCTGCTGTCCAGGGCATCCTGAATTTGCCGGCTGAGGTCCGCGCCGGATGTCATGTCACCCTGCAGCGTGAAGTGATATTCGCTCCTGCTCTGATCGACATAAGAGCGTCCGCCAGTGGCCAGCGCTGGCTGATACTTCAGATAGCCACCATAGGTCGATGTCGGCGGAATATACGCACTGCCCGGCGCAGAGGAAGCCGCCCCTGCGTTAGCCGCAGCGGGCTCGATGTTGTCCGACTCTGTTTTGATAAGACCGAGTTTATCCAGCAGCCAGGTCGCTTTGCTGCTCAGACTGTTGAACAGATCAAGCGGTGCCATCAACGCCTCCCCGAGCATCTGACCGAACATCACCCCCGCGTTTTTGCAACTGTCGAGCGTTTCCTGAGTCGCTTTGATTGGCGTAATCAGGTCAGTAAACCACTGCCAGATCCCGCCCAGCTTCTCTGAGATGAAACCAAACGCCTGCAGAAGGGGAGAAAACAGCTCGCCCAGTGGAGCAAAGGCCACAGAAAGCCCTTCCATCACCCCGGTAAAGAAGGCGCTGACAGGTTCCCAGTATTTAAAAATCAGTAATGCACCCGCCGCAATCGCCGCACCAATGGCAATGATGGGCCAGCTGAGTGCACCCAGGATCGCCATGATGCTACCGCCGACCACGCTGAAGGCGGTGCCTAACATCCCGGCGGCGGTAATGACCAGGTTCAGGCCGGTCACCATCGGCGCAACGACCGTGCCCAGCCCGCCCACTATGCCGGCAAACGCCTGCGCGCCGACGATAACCCCGACCAGGCCTTGCGTCAGTGCGGGGTTGGCATTCACCCACAGAGAGAGCGTACTGATCCACCCGGTCGCGGTGGTGGTCAGGGTACGCAGGGCGCCATCCGCCTTATCAAATACATCCATCTTCAGGCCGTTCCACACCGCCTGAAGCTTGCCTATATCACCGGCAAGGTTATCGGACTGAACGGCAGCTACCCGCGCGGTACTGCCCTTTGCCCCCTGGAGCTGCTGACGTTTCTCAGACAGCGTTCCATTACCCGCGGCGGAGACCAGTACCGCAGCGCCTTGCGCGTTTTCCTGACCGAAAATGGTCTTCAGGTACGCCATCTGCTGCGCGGTACCGAGTTGGCTCTTTTCAAAGGATGCGGAGAGGGCTGTGAGGATTTGCTCCGCAGGCAGCAGATTACCTTGTCCATCATGGGTCTGTACGTTCAGTTCACCCAGCGCGGCAGAACCCTGACCGTTCTGCATCTGCAGATGGCTCAGCACAGCGCCGACCCCGGCGCCAGCCGTGCTCCCTTTCATCCCCTTTTCCATCAGAATGCCAAGCAGCGCGGTCGTCTCTTCAAGGCTCACGCCAGCGCCGCTGGCAATTGGCGCGACAGAGGCCACGGCAGCACCCAGCTCGGTAAGGCTGGTGTGAGAGGAGGTAAATCCGCGGGTCAGCACGTCTGCGACACGCCCAGCATCGGTATTGGCCAGGTTGAACGCGGTCTGTGCATTGCTGATGATGTCGGCGGCGTTTGCGGCATCGACATTTCCCGCAAGGCTGAGATTGACCGCAGGAAGCGTGGCTGCAAGCACACCCTCGGCGTCATATCCTGAACGAACCAGCTCGGTCTGCGCCCGGGTGACCGCCTCCGTTGAGGTTCCCGTACCGGCACTGACGTCCCGCGCCTGCTGGCGAATAGCCTCAAGCCTGGCATCTCCCTTCTCCAGACCAAGACTGGCCTGAATAGCAGACATCTGCTTTTCAGCACTGATACCCGGCGCCATAAACCGGGACATCTGATCAACGCCCGTCTTTGCCATGCTCACGCCTGCGCCAGCGATCTGATGCACGCGCGTGGCAACACGTTTGCCGGATTCGTAGCGACGCTGAACGGCATTGAGCCGCTCCTGCTGCTGATTAACCCGGGCGAGTGCCTCTCGTTGTCCGTTGAGCTGCTGCGTTTTTTCGCCGATGCTCTTTCTTAAACGACGTTCATCTGCTGCCAGGGTACGGGTGTTAATCCCCGCCTGCGCCAGTTCGCCGCGCTGGCGCTGGACGGACTGGCGCAGAGTGTTGTACTCAAGCTTCAGGGCGGCGGCAGACTGGCGGGCTGCCGTCAGCGCATCGTTCTGCTCCCGGGTGGGGTTTTGCGTGTTTTTCAGTTCGCGCGCCAGCGCCGCCGTCTGTTGTCTCGCCAGGGCAAGCGCCTGCCCCGTCATGGTGAGCTGACTGCTTGTTTTTCTGAATCCATCGACGCGGGCCGCCTGTTCATCGAGCGCGCGCAGCGCCGCCTGCGATTCACCGATCTCTCCCTCAAGAGAGAGACTGGCATTCTGAATAGCTTTAAGCGGTCGGCTTGCCCTGGCGACTGCATCCAGCAGCGCCTGAAGTCTGACATTATTACTCATGGTGGTTTCCGCTTCGCTGCAGCGCCTTTTCGCGCCATATGAGGAGTTCGGTCACGCTCAGGGGGTACAGCTCTGACGGCGGCCAGTGAAAAATCACCGCGATATCCGCCATCAGATCGTCGACCGACAGGTTTTCAGGAAATTTCAGCGAGCCGAAGTCGGTGACAAAAAACCGACCACCTTACCGGCAAAAGAGAGCAGGTCGGAGGCATCCAGACGTGCAATCTCATGTTCGGTCAACGCCGGAGAGGTCATTCGCGGCAGCACCTTAATCAGGGCATCCACGTCGGATTGCGCCAGCGACGCCAGCGATACGCCACGCAGGGTGCCCGCATTGGGTTTTGCGACGGTCACTTTTTCGATTTTTTGCTCGCCGCGCAGAACGGGGCTATCAAGCGTGACGGTGTGTGGGTTTTCGCTTTCGTTGATAACAGTCTCGTTGATATTTTCCATTTCGTTACTCTCAAAAAGGGTAAGTAACCGGCCAGCAACTGCGGCCGGTTAAGGGTTACAGGCCGATGGCCTTACGGTGTTCCGCCAGACGATCGACGCCGTCGACTTTCAGCACCATGTTGATGATGTCAATCTCGATAACCTCTTTGCCATCGATAGTAAGCTGGTAGTAAGCGCACTCGGTGGACATTTTGGTCGTGCCGCTTTCGCCCTGCTTGTTCTCGCCGCCGTCAAACTCTTTGTGACGGCCACGCATCACGATCTCCACCGCAGAGATTTCGCCGGTATCGTCACGCTGGAAGGAACCGGTAAAACGCAGCGGCACGCTGTCCGCGCCTGGCGAGGCATACTGCGCCCACAGAGCAACGTCCGGCAGGCCACCAATGGTCCACTCAAGCGCCAGGGCATCATCGTCCAGGCCCAGGTCGACAGAGACCGAGCCCGGCATACCGCCGCCGCGATACTTCTCCAGCTTGCGGGTGAGCTTCGGTAAGGTGACAGACTCAACAACGCCCATATAGCTCAGGCCATCGTTGAACATATTCAGATATTTAAGTTTGCGTGGTAACGCCATGCTGCAGCTCCTTAGCTGTTAACCGAATCTGACAGGTCTGCCAGATAGGTATCGGTGATGCGCTGGCGCAGGGTCAGATTCTCCAGCGGCGGAACAGGGGTGTAGTCGTAATCGATATACAGTTTCCCCGCTTTGAGGGTCGACACGTCATTCGATTCCGGGTCATACCAGCAGGAGCCGTCAACAATGTAGCCATTGGTTTTCAGCTCACGGAATTTGGCATTGATACCGGAAATGATGTCGCGAATAAGCGTTGGGGTAATGGGTTTATCCATCGCCCATGCGTGCGCTTCCGCCATGGTGTCGGCCAGTACCTGCGCGGTACGGGTGTAGTTTTCAAACAGGAATAACGGATCGTCAGAGCAGGTACGGTTGCCCCAGAATTTGAAACCGTCATTGCGAATCAGCGTAGTCACGCCCGCCTGGTTAAGCAGGTTGGCATCGGTGGCCTGTTCCTGCAGGTCCCAGGAGACAGAAGCGCTCACCCCGGTGACGCCGTTAACACCGACGTTGGACAGGGTTTTATGCCAGCCGGTAGTCTGATCAATCTTGGCGCGCAGACCAAGTGCGCGTGCGGTCGCCCAGGCCGTTTTGGTGGCGTTCGTCGTGGTATCCCACGCCAGGAAATCCGGGTGAATAACCATCAGCTCACGCTGGCTGAAGTTTTTGCGATAGTTGATCGCCTCAGGAATGGTTTTACAGCCCCATGCGCTGACATAGCCAAACGCACGCAGGCTCTGGCACGTCGCTGCCAGCGCGGTCGCCACTTCCTGCGAATCCAGCCCCGGTACGCCGAGAATGCGTGGTTTAACGCCGGTCACCGTTTTCGCGGTGAGAAGTGCCTTCAGGCCGGTGTATTTGCCGTTTTCGTCGGTGGTACCGATGATGTTAGAAATGGTTTCTTTCTGTGCGGCTTCCGGGTCGTCCGGGTTATCGACACCTTCGGCCACGCGAACAACAACGACAACCGGTTTACACTGATCGGCAATCGCCTGAAGGGAAGCGGACAGCGTGCCTGTTTTCCCGGCTTTCGCAATCGCGGATTGTACATTGGTAATGAGCACAGGCTCGTTAAGAGGGAATGTCTGTTCGTCAGCATCGCTGGCCGTACAGACCATACCGATGATTGCCGTCGAGACGGTGGAAATGGTGCGTGTGCCATCGTTGATTTCGATGACTTCCACGCCGTGGTGATAGTCGCCCATCCGTTTAACTCCGTGGTTTAGTGGTGAGGGTATTTTCTGTGGAGCGCGCGATGGATGCGACGTATTGGGGTTGGGGAAAGGATTACACAACAAACGAAAAGCCCTCCGGGAGGAGGGCTTGGGTTATTGCGGTTTTTCAGGCCATTCAATGTCTGGCGCTTTGCTGATATCAACACGGTTGAGGAGAACTCTGTATTTTTTCCATTCTACTAGCAATTCAGCCTCTTCTTTTGTGGCAATACCTAGCCCAGACGCATCCTCCAGCGGATCAATAGTTGCTGTTGCCGCACTCATAAGGATCTGTTTTTTTGATTCAGCCAGATAAACGTAATCGATGGGCGCAGGAATGATTTTTTCACCATCAAACATCCATTCTCCATTATCACTTAGACCATCAGGAACTAGATTTTTGCTGATATCAGCAACCGATCTTCCAATCGGCCAAAGCATAGACACATCATATGAAAAACGCGTAATAATGCCGTTTTCGTCAAATTCGAACTTTAATTTATCCTTTTCAAATAAAGATTGTGCCTCATACCAATCCTGGGCATCCTCTGAAATAAGATTAGCTATACTGTAACCCTGAATTTGTTCTGTGCCAGACAGACGGAATTTTTTAATAATCATAAGATACCTTTATGCAGTAATTGTTTTCCATGTACCGTTTACATTTATCTGTAAGGCGCGGGAATACCACACGGAATAACGTGCATCACCATTGTTACTGTATCGACCTGTTATGAAACTTCCAACTGGTGCATCAATATCGGCTGAACCGCTTGAAGCCGACCCCATACTTCCTCTTGCACTGACGCGCACGCCAATAACTGTTGCTGATTTAAGTGTGTAACGTGCATCAGATTCAGCTTTTGTATATGCCTGTCCAGCTGGTGTGTAACTACCTTTTGCCTGGAATCTCGAATCACTTTCCGTTTTGGTGTAATAGCGCCCATCAAAATTGGAATAATTGGTTGGTATTATCTGCCCAGTAAAGGAAAATGTTGTGGTGTTCCAGTACCCCATAACTTTTGAATTTGCCCACAGGTCAACCTGACCATCTACTGAATTACGCAGACCTGAATCGTTATCACCAATATTTAAAACAGCAGATCCAACTGAACCAATTTGAAGGGTGTTTTTTACCGTGAGATTCCCATTTACCGCCCCACCTGCAAGTTGCAGATAGCGAGCATCAAAGTTTGCGTAATTTCCCGGTATCCATTGTCCGCTGGTTTGAGCCGTTCCGGCATCAGTGAACGTTAAACGATACTTGTAATCCTGCCCGTTTAGGGTGCCGTCAGTATGTATGTCAATATAGAAATTTCCTGCTGCCCCCTTTAAATGCCCAATTTCCAGAGGTGTTCCGTCAGTTCTGGTGAATATCTGATTGCAATATTTTAACGCAGTGACTTTATCCACACTGCCATCAGAATTAAGTGGAACGGCACCTACATCCCCCGCTGAAGGTTTATTTGCCGCATCATACTGTTTTACCCAGTCAGACCAGGTCGCACCATATAACGTGCGAATGTATGAGCGGGAATTATTATAAATACGGTAAATCTGTGTAATACCAGCGTGTTTATAAACTTCAAGCGATCCTGCATTGGCCTCGGGATAATTTTTACCAGTCTGTGCCTGTGCGTTAGCTGGCTGGTAATAAAGCCCCGGTGTCGTGTAGGTATTCAAGTCGGCAGCATTGCCAATACCAGAAGCCTGACCGTTAAAAATATCCTGCGCACTGATATTAAAATCCCCAGTCAACGCATGACCATTAATCTTACGTCCTGACGGCACGCGCCCGTTAGCATTATCATTCGCGACCTTAACAGCTTTCAGCGTCGCTGCGAGAGTCTCAGACGTACTATCGGTCGCACTACTCAGCTGGACAATACCCTTTTGCGCCGTAGTGGCATCCTGAGCTGTATATTTCCCATTAGCCAGATCGTACGCAGTCTTCACCGCCTTCGGCGTTGCCGCCAGCACCTCAGACACACTGTCAGTGGCGCTACTCAGTTGAACAATCCCCTTACGCGCCGTGGTTGCATCCGCTGCGGTATACTTCCCGTTCGCCAGGTCGTACGCCGCCTTCACCGCTTTCGGCGTTGCGGCAAGCACCTCAGACGTGCTGTCGGTGGCACTGCTTAACTGGGTAAAGCCCTTTGCGGTCAGCGTGGCGTCCGGATGGCGACGCGACTGCTCGTGCTCCGCGAGCTTCCCGTCGACGTATTCCTGCGTGGCCATTACCGTTGAGGTGTCAATCGTCAACTCGACGGAGGAAATATCGCTCACCATGATAACCATGCGAAGCGTCTGCGCGCGTCCTGACCCCTCCGCCAGCGCGGGCTTATAGCTTTCAGCCATATTCCCGACGGCAATCAGCGTTCCGGTATCGTCATAAAGGCCCATCTCACGCATCCAGAAACCGCCCACTTCTGGCGGGATCAGCAGCTCCGCCACAACGTAGTTTTTGTTCTTCTTGTCCTGGCTGATTTTGTTCAGCGCGTGGCGCCAGACTTCATTCACCAGCTTTGTCTGGCTGGCATCCGGAACGGGCAACGCACCGCCGCCGTCGCCGACCGCCATTGCGGTGAAGTTCACTTTCTTCCCGTTGGGGACGGTTGCTGCAGCCAGTTTTTCTGCACCGGCTTTGGTGATAACCGTTTTAAATTTCACTGTCATTGTGCTCTCACTTATCCGGGGTAAACCGTAATGATGTCGCCGTCATAGCTCAGGGCACCGGTAAAGAGATAACCCGGGATGTCCTGGATGATATTCAGGCCGATAAGGTGGCGGCTGGCAGGCTTCGCATCAGCGATAAGCCTTTCCATCTCGTAATACATTTCCTCGGTGATACCGGTGTCTAATACGCCGATATCAAGGCGGAAGGTGCCGGGCGGATCGTTGGTTTGCCACCACTCGGTGACGTTAATCAGATAACCAAGCGGCTCCACGACGCGGCGCACGGCGCCGATGGTTCCCTTGTGGGCATGGATAAACCACGCCGCGCGGATCACGTCCCGTTTGGTGGCCTCCGTCCAGTTCTCGTCCCAGCGATCGACCGAAAATGCCCATGCCAGCCACGGCAGCAAATTCGCCGGGCAGCTATCCGCGCTCCAGAGATGGCGCAGCGGTACCGGCGTTTTTTCTATCTCCGCACAGGCGCGCGCGGCCGCCACCTCAAGAGACGACGATCCAACCGGTAAAAGACGGGTATCACTCATCGCTCCCCCCCACGGTTACGCTGTACTGGCTGCACCATGAGGCCTGAGTTTCATCGAGCACGATATCGGCTGCCGGAGCGGTCAGCTCCACGCGCTGCACTCCTTCAACGTGAAGAGCGGCGTAAATGGCAGACTTGCGAATATCGCGTCCCAGCCGATGCTGCGCCGTGATATAGGCCTGCAGCCGGGCTCTGGCCGCGTTGAGCACCGGTTCGCTTTCCGGGCCAGGGAAGAGGAACAGCGAGGCCGCAATGGTGTAGTCGACAATTTTGGCCGACTGGACGGTCACGCGATCGGCGACTGGCCTGACGTCCTCATCGTTCAGCGCATTGCGAACGAGGGTAAGCAGCTCTTCAGACGCCACGCCGTTATTCTCCCGGGAGAGCACGGAAACCGTGACGTTCGCGGGCTGAGGGCTTATCACGGAAATATCCGCCACCCGACCATCCGCGCTGCGGCCATGGAACTGATAAGCGCCAGTCGAACCGGCCACGCTCAGCCCTTCCGGTGCCTGCTGGATACGCAGACGAAAGTCGGTATCAGACTCCATCACAGCCGGCGTGGGCGGAAAGGTCGTGTCGTCGGCAGGCGTAATCACCAGACGCTGAAGGTTAGCGTTTGCCCCAATCTGGTCGAGATCGCTGCCAGCGGCGTAGGCCAGCATGACCGCTCGCGCGGCCTCGTTGACGCGCTGGCGCCAGATAACTTCCCGATACGCGTTTTCCTGCAGCAGCTTCACAATCGGCTCTGACTCCAGGGTCAGCGTGCGTGCGATAGCCTCTTGCTCCGCCTCAGGAAAGAGTGAGACAAAGGTGGCCTTTCGTTCTGCCAGCAGCGTTTCATAATCCACCTCCTCCACGACATCAGGCGCGGCGAGCTGGCTCAGATCAACAATAGCCATAGCGTTTAACTCAGTGAAATGGTGAGAGAAAAAGAGCGATCGGAAGCCGGTCGCGTGCCGGTGATATCGACATACAGCGTCCCGTCACTCTCCGACCGTTCGAAAGCAATCGCCGTCAGGCTGATCCGCGGTTCCCATTTCTGGATCGCGGAATAACATGCGGCCATAATCTGCAGGCGCAGCGCCGGACTCTGGGGCCTGTCGATCATCGCCGCCAGCAAGGAGCCGTACTCCCGGCGCATAACACGTGAGCCAACTGGCGTAACCAGAATGTCGCGCACGCTTTGCCTGATGTGTTCCGCCTCTGTAATGCTGAGCCCGGTCTGACTGTTCATTCCCGTGTAGCGCACCGTCATTGCGTCCCCTTAGTCCAGCTTCCGCCGCTTTGCACACTGCCGTGGGCATGGTTGTCCACCTGCACCCCATTGGAGGTGAATTTACCGCCGGCGTGTTCGATATTTCCGGCCATCACCCCGCCCTTCTGCACCTCCAGAGAGGCGGTAATCAACTTGTTGGTACACACCACTTCAGGCGTATCCAGCGTGATACGGCTCTCAGCTTTCACCTGTACCACCGGAACGGTCGCGGTCAGCGATTCCGATGCGGTGATGGTTGCCGTTTTGATGCCGGTCGCCGTCAGCGCACCGTTGCCGGGCTCGTATTCGATAACCGCCCCATCAGGAAACGAGACGTGGAGCGCGTCAGGCGACTCAGACGGCGCCGGATGGTCGTCAGAGAAAATGCCTGGCAACACGAAGGCGGTATCCAGCTCACCGCCGATAGCCAGCAGCAAGACCTGTTCACCCTCGGACGGTGCCCACCAGACGCGCGAGCGGCCTGCGCGGCAGGTCAGCCAGTTCAGCCAGGTGGTTTTCATCCCCCCGGACTGGACGCGGCAGAGCCCTCGTTTGAGGTCGACATCGGTCACAACACCGAGACGAATAAGATTGCGGATCGCGCGAGCGATGCCGTGCATGGACGTTAGCGTATTCATAAGAAGAGAATGCCGCTCAGGGCGATCGGCAGCAACGAGGCGGGGTTTTATGCGGGATGAAACAACAAGCAGGGGTGCCGTCTGGCCAGCCGACAGGGCTGCGGACTGGCCATCAGCACGGTTACATCAGGCCTCCCACCGGCTGACCAGTTCACCGTTGACGTATAACTCCATCGGACGCGTGACGGGTTCTGGCAGCGGCGGCTCCGGAGAATAGGTGGCAAGCAAGGTCCCTTCATTCTCGGACACCAGAATCCGCTCGGTTAACTGCAGGCTGAAGCTGATATCTGAGGTTTCATCGTCATTTAAGACAACCTGAAAGAGATACCCCTTTTTTCGTCCCTCATCGAGGGTAAAAATATCCGGCTGGTTTTCCCGAAGCCAGGCCAGCACCGGGACGAAAATCCCCTCGCTGTCACCGGAAAAACCGCTGACGTTTGCCTTCAGCTCATATCGTTTTTCGAACGAGAGCGAGGCCGCCAGACGCGCATCGATGTTACCGCTCCAGACGGACATCTGTAAGCGCTCCGGGTTTGCGCGCAGTTGTGGTACAGCGTCAATTAATGCCTGACGCAGGCTTTTGAGTTTGTGCATCGAGTTTATCCTGACAGTCTTTAATGGTTTCAACCTGCAAGGCGCAGGCGATAAGAGCGTGCTCAAGCCTGCGAATATCGGCACTCAGATCGCCGTTTGTGACAGGCTCACTTGCCGGCATCGGGCAGGTGCTCACCTTTGGGCAAGCGTTGTAAACAATGTGCGGCGGAGGCGCAGGCGGTGCGGGTGTGCAACCTGCGGACAACATCAGGCAGCTGAGCGTGATACCAGCGGCGTAGTGCTTCATTTTCATTCATCAATCTCCCGATAGCCGCTTCGCGCCTTGCCATCTCCTCAGCGGTAACGGCAAGCGCATCGCGGAGCCTGACCTGCGCGTTTTCATTTTCGCTGGCCATCCGTTGCGCTACGGCCAGCTGATGGTTCAGCGCGTTAATCGTGTTTTTTTGTTCAGTGGCGATCTGGTTGGCTTTGGCAAAGGCGCGGGACAGGTTCTGATTTTCATGACGAAACCACAGGGTAATGGCCATCAGCACAGCCAGCATCAGCAGGAGCGTTCTCATCTTCCCCCCTGCATACACCAGGCCTTTTCACGGGCCCGACGGTTTTCCAGCCCGGCGTTCTTCTTCCCGTTGACATACACCCAGCGGGTCAGCTGATCGCAGGCCTGTGGCCACTGTTTGTGATGAATAAACGCCACCAGCGTCGAACGGCAGGCGGCACCGGTTCCCACGTTAAAGGCAAAACTGACCAGCGCGTCATAGACACGGGGTGGCATCTCCACCGGCGCGCATGCGGCAAGCCGTTTCTCAACATTCAGAACGTCGGCGATCAGGTTCGCCGCCGCGTCACGCTCGGTGATCGCCTGCGTCGGTACAACGCCTGCCGTATGGCCAATGCCTGACGTCCATACGCCCGCGCTACAGCGGTAGGGCGAAAGACGGCAGCCTTCGAGATCGGCAATTAACGCCAGCCCTTCAGGCGAGGTGTGCAGTAATCGAAAGTCAGGCATGAGCACCGCCAGGGCAAGGACGCTGGCGACGCTGCAACGCTTAATGATTGAGTTCACGAATACTCTTCTTATCGAGGCCCAGAGAGTTGAGATAGCGCCAGGTTTTTCGCTTAAACCAGTAATTCGTCAGCGCGGTAAAGATGGCGCAGAGGCTTCCCACGTACAGCGCCACTTTTTCAGGAGACATCGCCCCGAACCAGGCCAGCGCCACGGCCAGCCAGTAGGCAATAAACGTGGTGACTTTCTCCAGACTCAGTCCCATAGGTTCACGGATTCTTTTCTGGGTGCGCTATCAACCTCTGGCATGTCTACTGCCGTGCCATGGGGTAAGACAACGCCCCACTCAGTAAGGCCAGGATTGGCTTTCAAAACGGGTTCGACTACGCCTGCCGTGCGCCCGTAATAACGCGCACAGATGGCATCAAGCGTATCTCCCTGCATTGCATAGATCTTCATCAGACACTCCAACATCCGGATTTCCAGGTACTGTAGAGTTTCCAGAGCCAGCGCCCTTTCCGCTACCGCAGGCGGATGGACAATCGCGGGCACAACAGGCTGTCCGCGTACGACAGAACGCGGCTCAAATCTGAATGAAGGTTGTTGCCGGGGAAAGGGCAGACGCTAGCGCCAGCTCTCGTCTTCCCAGACTTCCCGAAGAATGTTGTCTAACGCTTCGCGCTCGGCCTCGCCGTTGATGCCCAGCAGCTCGACGCCCGTCATGGAGCCTTCTTTCACCGTCACCCGCGTTGACGGGAACAGCGGCCTTATCCTGCGGGTGAGTTCGCACTGGAAGGCTTCCACCACCGACGGACCAATGTGCTGATCTTTATCGAGGGTGATATTTACCCGCACATGGCTCTCTTTTTTGATTCGTTCCGGAACAGGCGATGCCGAGAAAACAACGGTAAACGCGTTGTTTTTGATTAAATTCCCTCTCGCTATCTCAGCAATAAGATTCAGGGCAATTTCACGATCTCTCTCCTGACATGTTCCTTCTGTCGTCAGTCGCGCAATCATCTCGACTCGTTCAATCATGACTTGCTCATTCAACTCTCTGTCCACACAACCTCCACCACGAGATACTGTATAAACATACAGTAGCACGTATTCATAAAAAGAGTGAAGCAAAAAATCAGAACCGTATGCGGTATGTGCATGATATCGATGGAGATTAGCGCATCACCGCAGCCAGCTGACGCGTTAAATGCCCGGTACGTTCAAGGATTGAACGGGTCTTTTCCTGGAATGGCGGGGGCAATTCGCGGTGGCGGTGCAGTTGAGAAAAATCGCCCGGCGTACAGTTATTGACAGAACTCCAAGAGGGCGCGATGGCGCCCTTAAGGTCAACGGCCCGCTTCGGGACAATCTTCCACTGTTTGAGGCGGGTTAACACCGGTGACCCGGCCCCAACGGCCGCATCGTAGACCCCGCGGATACAGACGGTTTCCTCGCCGTACTGGTTATATTCTTCTTTCGATTCGTAGAGCGTACGTACCTGCAGATCGTCACGACGGACAAACGGCCCACCCTGGGCGTTCACATACCCCGCCCAGTCTCCGGCGTCCGCCGCGTCATGCACGGCGGCAAACTCCACGCTCAGGCCATACGCCGTCTCGCGCTCGGTGAGACGGCGCAACTCCCGGTAGACCGTCACCGGCGCACCGCCAATAAACTGAAACTGACGGATGTGCCAGCGGGCCGCCCAGGCGGAAACGGCGGGAGCGGTCTCTTTCAGCAGCTCGCCATTTTCATCATCCGTTTCGCCCTCAAGCGCATAGCCATCGATATTCTTCGCGATATATTTGGCGATATAGCCTGTAGCGCTTCCCTTCTCCGGGTCAATGGCTTCGGCGTGAAAGCGGGCTTTTTTAGCCCTGTCGCGTTGTAACTCGAAGGCATCCTCTTCGCAGGCATAGTCATCCATGATGCGTCGAACGCGCCCGGTATCCTCCGGTCGCATGAACAGCAGCATATGCCAGTGGGGCGTACCGTCATGATGAGGTTCAGCAACACGGATCCCGAAAATACGCATCCCCTCCCGGTGCAGCTTCGCCCGGATGCGCGCCCAGAGCCGGTTAAGGTAGTGTTGCGTGTCTGCCGGGCTGGCGCCGTTCCACTTGCCGTTACGGTAACCGGCCTGCACGGTCGCGTGATATTTTGCGGGGGCGGTTAAGGTGTAGAACTCGCCTGCATAGCCCAGCTCATGGCAGATATTTTCAAACCCGCGAATACGGGTCATCAGTTCGCAGCGGCGTATTGCGGGGTTGGCCACCGAGCCATCGTATTTTTCAATCAGGCTGATACGGTTACCCTCTTCATCTTCAAGCTCCAGCCCCTTCAGGAACTCACGCGTACGGCGCTTCTGCTCGCGCCACTCGGCGATGCACCGTTTGCTCGCATAGGCAGATTTCTTTTTGCTGACGTGACCGATGGCGATCTGCAGATGCTCCCGCCAGGCGGCCGCCATCCGACGCAGACGACCGCGCCACCAGGCCTCTGAAAACATGCGCATCACCGCCGGGGCAACGTCATCTTTGCAAAAGACGGTTTTCGACACGCGCTCCCAGTGGGGAGGGACCACATTAAACTGACGGGTAATCACACCGGCGCGTTGATACCAGGCATGCAGCGTTTGATATTCACCCATGCCGGCATCGTCCATATTTGCCAGCTCGGCGCGAATAAAGCGGGCGATGTCAGCCCCCAGCAGATCGATATCCGCACGGGACATATCCGGAAGGCGGTTGAAGCGGGCGACCATATCCACCATCCGCGAGGCGAGGTATTGCAGTTGCGGCGTGTCAAAGTGGCCATCAAAAACCGCCGCGGAGACGCTCTCGTGCACTCCAACCCGGCTGTATTGCTCAGCAACCCGCTGCAGGCGCGGTAAGACCCCTTTACAGAAGCGGATCAGAAACGCATTGGCTCGCGGCGTGCCCTGCTGCTGTTCGATGGCGTCAACCGTACGCCACACTTCGGAACGGACGCATTCAGGCTGCAGAGAGAGGGCTTTTCTGGCTTGCAGCAGCGCCGCGAACAGACGATCGCGACGCTGCAGTTGAGAGTGTGTGAGATAGGGGCTGGCAATGGCCGTCCGTGGCGCATTCCACGGATAAGCACAGGAGATAGCCAATTCACCCTCCCCGATAGTGTTTGTTTTTCAGCTCCGTCATTTCCTGGCAGGTAACGCACATGGCCACGCCGGGGACGGCAATACGGCGCGCCTCGGGGATCGGGGCGTCGCACGCTTCGCAGAGAAAACGCGATGGCGACACAGGTCGACGGCGCGCGCTGTTAATGTGGCGCTCGCGTTCTGCCAGTTCACGCTCCTGCACCAGATCGATAAAATCCGCCATCAGTGCAGCTCCTGGGATTCACGCTCGTAACGGGCCGCTTCGTGGCACAGCAGTTCAGCCACGTCTTCACCGCTCATACCGGTTTTGTAAATGTGGCTGGCCAGCGCCTCAAGGCGCTGTGAAACCGCGAGCGCACGTGCTCTTCGCTCCTCGGTCTTTGCCTGCGCCAGCAGACGTACCAGTTTTTCTCCCTCGATCGGGGACGTGCGGTTTTCGCTATTTCGCATTACTCGTTCTCCTTACATTCAGGCAATAGAATGCCCGGCGGGTTTACGCCATTACGTTTGGGTGGGGTTACATCGGCAGGGTCAGCCGTTCGGGAAATAAGCTCACGACTGCACGAAAATGGTTCATGGCGGTAATCAGCGCCCTTTTCTCGTCAGTCGTCAGTTCACCGATCTCGCACGCATGACGGGTAGCGGGTAATTTCGCCAGGAAGAAGATGGCAGCCAGCGCGCGGGTGTTCTCTGCAAAACAGGGATCGCGCTTATCACGCAGCGCTGCAAAAAACCGCGTCAGCTCTTTCTCGCTATCGCCCCAGAATCTGGCGCGCAGTTCAGCCACGTGGTTAAGTCCGGACAGACGCGCCCCGACGCTGAGCGGAACGCTTGCCTGAGCAGCTTCTATCGCCATATCGCCCCTCGCTTCGTTTCCCGCAACGTGAATTATTCACCGATAACAAGCTGGAAGCGGGAATGGCCTAACGCCTTACGTAGCTGCTCCTCTTTCCAGCGCGCGTAATAGATACGGATGGGGCCACCTGCTTTTTTGCAGCCCTTGCGGATAGTGCGCGGCTCAATGGGGACCCGGGGGTTATCGCCGGTCGTCCAGCGGTAGGCAGTGCGTTCAGAAACACCTTCCAGCGCGGCGAATTGCTGCAGCGTCACCACAGGGGCTGGCACTTTGATGATTGCGATTTCAGAAGCCATGTTGCATGATTCCCATTTTGATAATGACTGCAATCTTTAGCCTCTGTTTGCCAACGTCTGCCGCTGATTGCCCGAATTTGCAATGATACTAATACCCAATTGAGTATTAGTAAACACCCAAAGGAATATATTTTGATTTTAGATTCTCAAGTGAATAATGAAGAGTTGCTCGATAGAATCTGTCAGGTATATGGTTTTACGCAAAAAATCCAGCTTGCCCGGCACTTCAATATTGCCGCCAGCTCGCTGCAGAACCGCTACGCACGCGGCACTATCTCTTATGACTTCGCCGTGCAGTGCGCCCTGGATACGGGCGCCAGCCTGCGCTGGCTAATGACCGGGCAAGGTTCTCAGTTTGAAGGTCAGCCTGCGCCGGGCGATCCTGTATCTGTCGCGTCGTTCACTCTCAGTGATGGTCGACTGGAAGAAAATACTACTTTGAGTATCGACTCCCATTTCTTCAGTAAACAACTCACCCGCGGCATCGCCGTCCGGGCTGAGGGAAAACTTCACTTTGTTGAAAAAGACGCATCGTTAACCGACGGGCTGTGGCTGGTTGAGATTGAAGGCACCACCAGCATCCGCGAGTTGACGCTCCTGCCGGGTAAAAAGCTCCACGTCGCGGGCGGAAAAATCCCGTTTGAATGCGGTATCGAGGAGATAAAAACGATTGGCCGTGTGGTGGGTGTCTACAGCGAGGTTAACTGATGGCGGTTCGTAAAAATCCTGCCGGTGGCTGGCTTTGCGAGATCTACCCCAACGGGGCAAAGGGTAAGCGCATCAGAAAGAAATTCGCCACCAAAGGCGAAGCGCTGGCCTTTGAACAGTACACCGTTCAAAACCCATGGCAGGAGGAAAAAGAGGACCGCCGCACCTTAAAAGCGCTGATTGACGCCTGGTATAGCGCTCACGGCATTACGTTGAAAGACGGTCTTAAACGCCAGCTGGCGATGCATCATGCTTTTGAGTGTATGGGCGAACCGCTCGCGAGGGATTTCGATGCGCAGATGTTTTCCCGCTACCGGGAAAAGCGGCTAAAAGGCGAATACGCCCGTTCTGCCCGGGTAAAAGAAGTTTCTCCCCGCACGCTTAACCTTGAGCTGGCCTACTTCCGGGCGGTATTCAACGAGCTGAATCGTCTGGGGGAATGGAAAGGCGAAAATCCGCTGAAAAATATGCGCCCTTTCCGCACGGAAGAGATGGAAATGGCCTGGCTGACGCAGGAGCAAATCGCCCGGTTACTGGCCGAGTGTCGGCGTCATGACAACCCGGATCTGGAAAACGTGGTCAGGATTTGTCTCGCCACAGGCGCACGCTGGTCTGAAGCCGAAGGGCTCAAAAAAAGCCAGCTGTCGAAGTACAAAATCACCTATATCAATACCAAAGGGCGAAAAAACCGCACCGTTCCTGTCAGCAAAGCCCTGTATGACGCTCTGCCTGATAACAAAAATGGCCGTTTGTTCAGTGATTGCTATGGTGCGTTCCGCTCTGCGCTGCAGCGGACAGGTATCGAGTTACCCGCCGGGCAACTGACCCACGTTCTGCGCCATACTTTTGCCAGCCACTTTATGATGAATGGCGGGAATATCCTGGTTCTGCAGCGGGTGCTTGGGCATACGGATATTAAAATGACTATGCGATATGCGCACTTTGCACCTGATCATTTAGAGGATGCCGTTAAGTTCAACCCGCTGGCGGTGAGTGGCGATAAAGTGGCGGTAGAAATGGCGAATAATGGGTAA
Protein sequences of DBSCAN-SWA_7 >NC_018079|4203579:4231457|4219383_4221429_-|WP_014833335.1|tail|DBSCAN-SWA MTVKFKTVITKAGAEKLAAATVPNGKKVNFTAMAVGDGGGALPVPDASQTKLVNEVWRHALNKISQDKKNKNYVVAELLIPPEVGGFWMREMGLYDDTGTLIAVGNMAESYKPALAEGSGRAQTLRMVIMVSDISSVELTIDTSTVMATQEYVDGKLAEHEQSRRHPDATLTAKGFTQLSSATDSTSEVLAATPKAVKAAYDLANGKYTAADATTARKGIVQLSSATDSVSEVLAATPKAVKTAYDLANGKYTAQDATTAQKGIVQLSSATDSTSETLAATLKAVKVANDNANGRVPSGRKINGHALTGDFNISAQDIFNGQASGIGNAADLNTYTTPGLYYQPANAQAQTGKNYPEANAGSLEVYKHAGITQIYRIYNNSRSYIRTLYGATWSDWVKQYDAANKPSAGDVGAVPLNSDGSVDKVTALKYCNQIFTRTDGTPLEIGHLKGAAGNFYIDIHTDGTLNGQDYKYRLTFTDAGTAQTSGQWIPGNYANFDARYLQLAGGAVNGNLTVKNTLQIGSVGSAVLNIGDNDSGLRNSVDGQVDLWANSKVMGYWNTTTFSFTGQIIPTNYSNFDGRYYTKTESDSRFQAKGSYTPAGQAYTKAESDARYTLKSATVIGVRVSARGSMGSASSGSADIDAPVGSFITGRYSNNGDARYSVWYSRALQINVNGTWKTITA >NC_018079|4203579:4231457|4207134_4207350_+|WP_001144069.1|DBSCAN-SWA MPVIKVRENEPFDVALRRFKRSCEKAGVLAEVRRREFYEKPTTERKRAKASAVKRHAKKLARENARRTRLY >NC_018079|4203579:4231457|4229080_4229419_-|WP_014833349.1|DBSCAN-SWA MAIEAAQASVPLSVGARLSGLNHVAELRARFWGDSEKELTRFFAALRDKRDPCFAENTRALAAIFFLAKLPATRHACEIGELTTDEKRALITAMNHFRAVVSLFPERLTLPM >NC_018079|4203579:4231457|4226601_4228563_-|WP_014833346.1|DBSCAN-SWA MAISCAYPWNAPRTAIASPYLTHSQLQRRDRLFAALLQARKALSLQPECVRSEVWRTVDAIEQQQGTPRANAFLIRFCKGVLPRLQRVAEQYSRVGVHESVSAAVFDGHFDTPQLQYLASRMVDMVARFNRLPDMSRADIDLLGADIARFIRAELANMDDAGMGEYQTLHAWYQRAGVITRQFNVVPPHWERVSKTVFCKDDVAPAVMRMFSEAWWRGRLRRMAAAWREHLQIAIGHVSKKKSAYASKRCIAEWREQKRRTREFLKGLELEDEEGNRISLIEKYDGSVANPAIRRCELMTRIRGFENICHELGYAGEFYTLTAPAKYHATVQAGYRNGKWNGASPADTQHYLNRLWARIRAKLHREGMRIFGIRVAEPHHDGTPHWHMLLFMRPEDTGRVRRIMDDYACEEDAFELQRDRAKKARFHAEAIDPEKGSATGYIAKYIAKNIDGYALEGETDDENGELLKETAPAVSAWAARWHIRQFQFIGGAPVTVYRELRRLTERETAYGLSVEFAAVHDAADAGDWAGYVNAQGGPFVRRDDLQVRTLYESKEEYNQYGEETVCIRGVYDAAVGAGSPVLTRLKQWKIVPKRAVDLKGAIAPSWSSVNNCTPGDFSQLHRHRELPPPFQEKTRSILERTGHLTRQLAAVMR >NC_018079|4203579:4231457|4221440_4221971_-|WP_014833336.1|tail|DBSCAN-SWA MSDTRLLPVGSSSLEVAAARACAEIEKTPVPLRHLWSADSCPANLLPWLAWAFSVDRWDENWTEATKRDVIRAAWFIHAHKGTIGAVRRVVEPLGYLINVTEWWQTNDPPGTFRLDIGVLDTGITEEMYYEMERLIADAKPASRHLIGLNIIQDIPGYLFTGALSYDGDIITVYPG >NC_018079|4203579:4231457|4223992_4224460_-|WP_014833340.1|tail|DBSCAN-SWA MHKLKSLRQALIDAVPQLRANPERLQMSVWSGNIDARLAASLSFEKRYELKANVSGFSGDSEGIFVPVLAWLRENQPDIFTLDEGRKKGYLFQVVLNDDETSDISFSLQLTERILVSENEGTLLATYSPEPPLPEPVTRPMELYVNGELVSRWEA >NC_018079|4203579:4231457|4213588_4214053_-|WP_014833328.1|tail|DBSCAN-SWA MLMVLGLFVFERRTLPYQSMQFSKDYRWASNDRIGKPKAWQYLGEGDTSHTLSGTLYPEITGGRLSLKAIELMANEGRAWPLIDGTGIIHGMFVIDKVTHTHTDFYSDGTARKIDFTLSLKRVDESLMAMFGDLRTQAEELVTSARTEIGGLLG >NC_018079|4203579:4231457|4217550_4218744_-|WP_014833333.1|tail|DBSCAN-SWA MGDYHHGVEVIEINDGTRTISTVSTAIIGMVCTASDADEQTFPLNEPVLITNVQSAIAKAGKTGTLSASLQAIADQCKPVVVVVRVAEGVDNPDDPEAAQKETISNIIGTTDENGKYTGLKALLTAKTVTGVKPRILGVPGLDSQEVATALAATCQSLRAFGYVSAWGCKTIPEAINYRKNFSQRELMVIHPDFLAWDTTTNATKTAWATARALGLRAKIDQTTGWHKTLSNVGVNGVTGVSASVSWDLQEQATDANLLNQAGVTTLIRNDGFKFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHAWAMDKPITPTLIRDIISGINAKFRELKTNGYIVDGSCWYDPESNDVSTLKAGKLYIDYDYTPVPPLENLTLRQRITDTYLADLSDSVNS >NC_018079|4203579:4231457|4225682_4225886_-|WP_014833344.1|tail|DBSCAN-SWA MKIYAMQGDTLDAICARYYGRTAGVVEPVLKANPGLTEWGVVLPHGTAVDMPEVDSAPRKESVNLWD >NC_018079|4203579:4231457|4228785_4229013_-|WP_014833348.1|DBSCAN-SWA MRNSENRTSPIEGEKLVRLLAQAKTEERRARALAVSQRLEALASHIYKTGMSGEDVAELLCHEAARYERESQELH >NC_018079|4203579:4231457|4229451_4229715_-|WP_014833350.1|DBSCAN-SWA MASEIAIIKVPAPVVTLQQFAALEGVSERTAYRWTTGDNPRVPIEPRTIRKGCKKAGGPIRIYYARWKEEQLRKALGHSRFQLVIGE >NC_018079|4203579:4231457|4223224_4223866_-|WP_043951700.1|plate|DBSCAN-SWA MNTLTSMHGIARAIRNLIRLGVVTDVDLKRGLCRVQSGGMKTTWLNWLTCRAGRSRVWWAPSEGEQVLLLAIGGELDTAFVLPGIFSDDHPAPSESPDALHVSFPDGAVIEYEPGNGALTATGIKTATITASESLTATVPVVQVKAESRITLDTPEVVCTNKLITASLEVQKGGVMAGNIEHAGGKFTSNGVQVDNHAHGSVQSGGSWTKGTQ >NC_018079|4203579:4231457|4216504_4216624_-|WP_014833330.1|tail|DBSCAN-SWA MADIAVIFHWPPSELYPLSVTELLIWREKALQRSGNHHE >NC_018079|4203579:4231457|4218867_4219374_-|WP_014833334.1|tail|DBSCAN-SWA MIIKKFRLSGTEQIQGYSIANLISEDAQDWYEAQSLFEKDKLKFEFDENGIITRFSYDVSMLWPIGRSVADISKNLVPDGLSDNGEWMFDGEKIIPAPIDYVYLAESKKQILMSAATATIDPLEDASGLGIATKEEAELLVEWKKYRVLLNRVDISKAPDIEWPEKPQ >NC_018079|4203579:4231457|4229847_4230423_+|WP_014833351.1|DBSCAN-SWA MILDSQVNNEELLDRICQVYGFTQKIQLARHFNIAASSLQNRYARGTISYDFAVQCALDTGASLRWLMTGQGSQFEGQPAPGDPVSVASFTLSDGRLEENTTLSIDSHFFSKQLTRGIAVRAEGKLHFVEKDASLTDGLWLVEIEGTTSIRELTLLPGKKLHVAGGKIPFECGIEEIKTIGRVVGVYSEVN >NC_018079|4203579:4231457|4228564_4228786_-|WP_014833347.1|DBSCAN-SWA MADFIDLVQERELAERERHINSARRRPVSPSRFLCEACDAPIPEARRIAVPGVAMCVTCQEMTELKNKHYRGG >NC_018079|4203579:4231457|4217020_4217539_-|WP_014833332.1|tail|DBSCAN-SWA MALPRKLKYLNMFNDGLSYMGVVESVTLPKLTRKLEKYRGGGMPGSVSVDLGLDDDALALEWTIGGLPDVALWAQYASPGADSVPLRFTGSFQRDDTGEISAVEIVMRGRHKEFDGGENKQGESGTTKMSTECAYYQLTIDGKEVIEIDIINMVLKVDGVDRLAEHRKAIGL >NC_018079|4203579:4231457|4212179_4212380_-|WP_014833326.1|DBSCAN-SWA MFHCPKCKHSAHARTSRYLSENTKERYHQCTNVDCSCTFVTMESVERLIAAPGMPERARAPSVNRS >NC_018079|4203579:4231457|4212446_4213592_-|WP_014833327.1|DBSCAN-SWA MITEMNIRAGGKIAPDFMLKLNGRDITQNFSHRLIRLTMTDNRGLEADQVDIELDDSDGLLDLPARGAFITLWLGWQGTPLEPKGNFTVDDIEFRGAPDRLTIRGRSADFRGKLNVRREQSWHDTTLGAIVDTIAQRNQLSASVAAEFSSIAIAHIDQTQESDAAFLSRLAERNGAFVAVKEGKVLFMKAGGAVTASGKPVPLKVIERADGDKHSFSIADREAYSGVTAKWLQTSDPKQQHPKVSINRLPQAAATGALPHPDAAAVSGSTEQTPQETLVGAADNVFELTTIFASEAQALRAAEAKWRALQRGVVKFSIQLALGKADLFPETPLQVKGFKRAIDEQAWIVSRVVHTLDGRGFTTEVNLEPDIADEKFSIESE >NC_018079|4203579:4231457|4221963_4222872_-|WP_014833337.1|plate|DBSCAN-SWA MAIVDLSQLAAPDVVEEVDYETLLAERKATFVSLFPEAEQEAIARTLTLESEPIVKLLQENAYREVIWRQRVNEAARAVMLAYAAGSDLDQIGANANLQRLVITPADDTTFPPTPAVMESDTDFRLRIQQAPEGLSVAGSTGAYQFHGRSADGRVADISVISPQPANVTVSVLSRENNGVASEELLTLVRNALNDEDVRPVADRVTVQSAKIVDYTIAASLFLFPGPESEPVLNAARARLQAYITAQHRLGRDIRKSAIYAALHVEGVQRVELTAPAADIVLDETQASWCSQYSVTVGGSDE >NC_018079|4203579:4231457|4224567_4224981_-|WP_014833341.1|DBSCAN-SWA MRTLLLMLAVLMAITLWFRHENQNLSRAFAKANQIATEQKNTINALNHQLAVAQRMASENENAQVRLRDALAVTAEEMARREAAIGRLMNENEALRRWYHAQLPDVVRRLHTRTACASAAHCLQRLPKGEHLPDAGK >NC_018079|4203579:4231457|4205884_4206898_-|WP_014833323.1|tRNA|DBSCAN-SWA MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAALKEAGLNSTDIDAVAYTAGPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEENPPEFPFVALLVSGGHTQLISVTGIGKYELLGESIDDAAGEAFDKTAKLLGLDYPGGPMLSKMASQGTEGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRNNDDSEQTRADIARAFEDAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRTKLAEMMQKRRGEVFYARPEFCTDNGAMIAYAGMVRLNAGATSDLSVSVRPRWPLAELPEA >NC_018079|4203579:4231457|4211316_4211823_-|WP_014833325.1|DBSCAN-SWA MINDILAPGLRVVFCGINPGKSSAHTGFHFAHPGNRFWKVIYQAGFTDRLLKPEEEQHLLDTRCGITMLVERPTVQASEVHLHELRTGGRELIKKIEDYQPAALAILGKQAYEQAFSQRGVKWGKQAITIGVTQVWVLPNPSGLNRATLDKLVEAYRELDEALVVRGL >NC_018079|4203579:4231457|4230422_4231457_+|WP_014833352.1|integrase|DBSCAN-SWA MAVRKNPAGGWLCEIYPNGAKGKRIRKKFATKGEALAFEQYTVQNPWQEEKEDRRTLKALIDAWYSAHGITLKDGLKRQLAMHHAFECMGEPLARDFDAQMFSRYREKRLKGEYARSARVKEVSPRTLNLELAYFRAVFNELNRLGEWKGENPLKNMRPFRTEEMEMAWLTQEQIARLLAECRRHDNPDLENVVRICLATGARWSEAEGLKKSQLSKYKITYINTKGRKNRTVPVSKALYDALPDNKNGRLFSDCYGAFRSALQRTGIELPAGQLTHVLRHTFASHFMMNGGNILVLQRVLGHTDIKMTMRYAHFAPDHLEDAVKFNPLAVSGDKVAVEMANNG >NC_018079|4203579:4231457|4209365_4211213_+|WP_013098777.1|DBSCAN-SWA MEQNPQSQLKLLVQRGKEQGYLTYAEVNDHLPEDIVDSDQIEDIIQMINDMGIQVMEEAPDADDLLLAETSNNTDEDAEEAAAQVLSSVESEIGRTTDPVRMYMREMGTVELLTREGEIDIAKRIEDGINQVQCSVAEYPEAITYLLEQYDRVEAEEARLSDLITGFVDPNAEEDMAPTATHVGSELSQEEMDDDEDEDEEESDDDSADDDNSIDPELAREKFAELRTQYEVTRDTIKAKGRSHAAAQEEILKLSEVFKQFRLVPKQFDYLVNSMRVMMDRVRTQERIIMKLCVEQCKMPKKNFITLFTGNETSETWFNAAIAMNKPWSEKLHDVKEDVHRGLQKLQQIEEETGLTIEQVKDINRRMSIGEAKARRAKKEMVEANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKLNRISRQMLQEMGREPTPEELAERMLMPEDKIRKVLKIAKEPISMETPIGDDEDSHLGDFIEDTTLELPLDSATTESLRAATHDVLAGLTAREAKVLRMRFGIDMNTDHTLEEVGKQFDVTRERIRQIEAKALRKLRHPSRSEVLRSFLDD >NC_018079|4203579:4231457|4225470_4225692_-|WP_014833343.1|DBSCAN-SWA MGLSLEKVTTFIAYWLAVALAWFGAMSPEKVALYVGSLCAIFTALTNYWFKRKTWRYLNSLGLDKKSIRELNH >NC_018079|4203579:4231457|4224977_4225487_-|WP_014833342.1|DBSCAN-SWA MNSIIKRCSVASVLALAVLMPDFRLLHTSPEGLALIADLEGCRLSPYRCSAGVWTSGIGHTAGVVPTQAITERDAAANLIADVLNVEKRLAACAPVEMPPRVYDALVSFAFNVGTGAACRSTLVAFIHHKQWPQACDQLTRWVYVNGKKNAGLENRRAREKAWCMQGGR >NC_018079|4203579:4231457|4207466_4209212_+|WP_014833324.1|DBSCAN-SWA MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNYHACCPFHNEKTPSFTVNGEKQFYHCFGCGAHGNAVDFLMNYDKLEFVETVEELAAMHNLEVPYEAGSGPSQIERHQRQTLYQLMDGLNSFYQQSLKHSAAEPARQYLNKRGLSDDVIARFAIGYAPPGWDNVLKRFGGNSEDRKSLIDAGMLVTNDQGRSYDRFRERVMFPIRDKRGRVIGFGGRVLGDALPKYLNSPETDIFHKGRQLYGLYEAQQANAEPPRLLVVEGYMDVVALAQYDINYAVASLGTSTTADHIQLLFRVTNNVICCYDGDRAGRDAAWRALETALPYMTDGRQLRFMFLPDGEDPDTLVRKEGKAAFEARMEQAQPLSTFLFNSLMPQVDLSTPDGRAQLSTLALPLISQVPGETLRIYLRQELGNKLGILDDSQLERLMPKQAENGTVRPAPQLKRTTMRILIGLLVQNPELAPQVPSLAGLNHEKLPGLGLFSELVNTCLSQPGLTTGQLLEHYRGTKEAATLEKLSMWDDIADKDIAEKTFTDSLNHMFDSMLELRQEELIARERTHGLSSEERRELWMINQELAKK >NC_018079|4203579:4231457|4203579_4205592_-|WP_014833322.1|protease|DBSCAN-SWA MKKSLLSLLICAAVTCSAFAETTPQAEFAQDGESYIAIVVKLKPGKPLLKSTSTTTTTANTLTLSDPALVPTSMFRPNKLRSTQSDEFAQLNARYGFDRYVRIDLPENKSTDKNYINRVITELEQNQNVEMVYPESVPVSLDKYSGEEHQNKPPLKSTVQSSVGASSVPDFRSQQYYTKSPTDKRQGYYMGGVNRDSVNKYPGNEGEGVTIISMENDIWNVNHINLPKPALIQGSNKYVDDHDTASVGIMAARDIGAGVRGLSWKSRVGFSDWHYNNLYNMIPLLKAGDVVQIGMQTGGGEITGCTTTCYVPQENVQSYYDIIKALTDKGVYVIEAAGNGNVNLDHSAFNGKFDVNVRDSGAIIAGAFCAKEGKRAYFSTYGSRVTSSSWGCWDVVSAGFGNLYNAPNAQYTSTFAGTSSANPIIAGVVASLSGIAKANGITVTPRQMRQILAETGTPLANGESAKVGTQPDMERAVARIMALKDGNTPVAPAPTAAAGADYTMVSPATGVSTYPLDGSKSLNAKSYNWSVTKGAETFSLEAKLNGTLVKSVDSAHAYAVIPANSEGEAIFTLTTTGADGRTATDSMTIKVTKPAVPAKDDTPAKDDTPVKPAAPAYNAKIAYPTKCTKVSYNGKIWFNQWYVNPGQETPGTGGQWGAWREQGASNNSCK >NC_018079|4203579:4231457|4216656_4216965_-|WP_014833331.1|tail|DBSCAN-SWA MENINETVINESENPHTVTLDSPVLRGEQKIEKVTVAKPNAGTLRGVSLASLAQSDVDALIKVLPRMTSPALTEHEIARLDASDLLSFAGKVVGFLSPTSAR >NC_018079|4203579:4231457|4214064_4216512_-|WP_014833329.1|tail|DBSCAN-SWA MSNNVRLQALLDAVARASRPLKAIQNASLSLEGEIGESQAALRALDEQAARVDGFRKTSSQLTMTGQALALARQQTAALARELKNTQNPTREQNDALTAARQSAAALKLEYNTLRQSVQRQRGELAQAGINTRTLAADERRLRKSIGEKTQQLNGQREALARVNQQQERLNAVQRRYESGKRVATRVHQIAGAGVSMAKTGVDQMSRFMAPGISAEKQMSAIQASLGLEKGDARLEAIRQQARDVSAGTGTSTEAVTRAQTELVRSGYDAEGVLAATLPAVNLSLAGNVDAANAADIISNAQTAFNLANTDAGRVADVLTRGFTSSHTSLTELGAAVASVAPIASGAGVSLEETTALLGILMEKGMKGSTAGAGVGAVLSHLQMQNGQGSAALGELNVQTHDGQGNLLPAEQILTALSASFEKSQLGTAQQMAYLKTIFGQENAQGAAVLVSAAGNGTLSEKRQQLQGAKGSTARVAAVQSDNLAGDIGKLQAVWNGLKMDVFDKADGALRTLTTTATGWISTLSLWVNANPALTQGLVGVIVGAQAFAGIVGGLGTVVAPMVTGLNLVITAAGMLGTAFSVVGGSIMAILGALSWPIIAIGAAIAAGALLIFKYWEPVSAFFTGVMEGLSVAFAPLGELFSPLLQAFGFISEKLGGIWQWFTDLITPIKATQETLDSCKNAGVMFGQMLGEALMAPLDLFNSLSSKATWLLDKLGLIKTESDNIEPAAANAGAASSAPGSAYIPPTSTYGGYLKYQPALATGGRSYVDQSRSEYHFTLQGDMTSGADLSRQIQDALDSRENQRAANQRSRFMYE >NC_018079|4203579:4231457|4222877_4223228_-|WP_014833338.1|DBSCAN-SWA MTVRYTGMNSQTGLSITEAEHIRQSVRDILVTPVGSRVMRREYGSLLAAMIDRPQSPALRLQIMAACYSAIQKWEPRISLTAIAFERSESDGTLYVDITGTRPASDRSFSLTISLS >NC_018079|4203579:4231457|4226051_4226492_-|WP_014833345.1|DBSCAN-SWA MDRELNEQVMIERVEMIARLTTEGTCQERDREIALNLIAEIARGNLIKNNAFTVVFSASPVPERIKKESHVRVNITLDKDQHIGPSVVEAFQCELTRRIRPLFPSTRVTVKEGSMTGVELLGINGEAEREALDNILREVWEDESWR |
33 | Erwinia_phage(45.16%) | tRNA,tail,protease,integrase,plate | attL 4209371:4209385|attR 4236745:4236759 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|