Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP022336 | Sphingorhabdus sp. SMR4y chromosome, complete genome | 1 crisprs | csa3,cas3,DEDDh,DinG | 0 | 0 | 2 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP022336_1 | 848776-848876 | Orphan |
NA
Consensus repeat of NZ_CP022336_1
|
1 spacers
spacers of NZ_CP022336_1
>1.1|848805|43|NZ_CP022336|CRISPRCasFinder CACCGCCATCTGGCCAAGCGGATTTTTGTTGAGCGCCCCCTCG |
CRISPR arrays and Neighbor proteins around NZ_CP022336_1
The CRISPR arrays of NZ_CP022336_1 >merge|NZ_CP022336|1|848776-848876|CRISPRCasFinder ACCGCTTCCTGAAATTGCTTCTGGTTCTTCACCGCCATCTGGCCAAGCGGATTTTTGTTGAGCGCCCCCTCGACCGCTTCCTGAAACTGCTTCTGGTTCTT >NZ_CP022336|1|1|848776-848876|CRISPRCasFinder ACCGCTTCCTGAAATTGCTTCTGGTTCTT CACCGCCATCTGGCCAAGCGGATTTTTGTTGAGCGCCCCCTCG ACCGCTTCCTGAAACTGCTTCTGGTTCTT
>NZ_CP022336.1|WP_089132405.1|847019_848555_-|proline--tRNA-ligase MKKNALSITREKDFAGWYQQVITEADMAEESGVRGSMIMRPWGYGIWERVQYLMDQRIKASGHENCYFPLFIPLSYFSKEAEHVDGFAKEMAVVTHHRLIKGEDGGLVVDPEARLEEPLVVRPTSETVIGTAFARWVQSWRDLPVMINQWANVVRWEMRTRMFLRTTEFLWQEGHTAHATKAEAMEETLNALEMYRDFSESVLAMPVVAGEKPENERFPGADATYSIEAMMQDGKALQAGTSHFLGQTFSKAQNIRYQDKEGEFQFAYTTSWGTSTRMIGGVIMTHGDDDGMRVPPLIAPYQVVIVPMLRDKPEDEEVLDYCKALAKDIGKLRAFDEPVRVKLDITGAKTQTKRWSWIKKGAPVILEIGPRDMADDKVAMIRRDALYQDSGKLDTKFLAKAEFIPEIPALLSEIQSALHAEARQRLEGNITRGVEHWSDVERFYKENKKYPGWLEVQWSKATGDELDAIEERLKKLKLTFRNVPVDAAPADGVCIFTGKPAVERIIIGKAY >NZ_CP022336.1|WP_089132404.1|844722_847020_-|ribonuclease-R MAGPRKTHKPKQVPTRKQILDFIERSDQQVGKREIAKAFGLRGSDKIALKALLKDMADEGLIDSTPGRAFHKMGGIPKVTVLKIVAIDGNEAIGVPERWEADNAPAPKLRIKEKGRRAALAVGDRILARTEERGQGHVAFMMKKLAQSSEMLMGVVEKDERDRYWLKPVDRKIRRSTPILDLGEAEPGHLVLAEPVKRGSEVKARVKDILGDPFAPKSFSLIAIHKFEIPLHFPDAVKEEAEQATKLPLSKDKREDLRHLPIVAIDPADARDHDDAIWAEPDDNPDNKGGFKALIAIADVSYYVRSGSKLDREAYKRGNSVYFPDRVVPMLPEALSTDVCSLKQGVERAALVCHVTINPKGQVIASRFTRAIVKLAGNIAYEDAQAAMDGKLDHELKANALAPLWACWKLLARARDEREPLNLDLPEKRVVLDDQGKIVEIAVRERLDAHRLVEDYMIVANVAAARMLESKKSPVIYRVHETPSREKLIALKDYLKSFDMSFAMGQVIKPAVFNNLIKQVDDELLLPLVMEQILRSQTQAYYGHTNMGHFGLSLGSYAHFTSPIRRYADLVVHRALVSACKLEQPAPENSSIPEFSGLGKEEAGRLERISETISKLERRAMKAEWETVDRYISAHLADRIGEVVKCRITGVQNFGFFATVEDLGGDGLVPVRTLGAERFDYDESKQQLTGMDSGTTYNIGQKIELRLVEANAATGSLLFELPEGANHMAARPAPYRARKGKPHRKGKPGRPANIRGNKGKARKKK >NZ_CP022336.1|WP_089132403.1|844121_844673_-|AhpC/TSA-family-protein MEKEFFPCRIFCRPKTVPDFTVPLIDGGKFHLSQRLGDNFTLLLFYRGVHCPICKMQLRELQRRLGAFSERGITVLAISMDSKERAEKSVDEWGVDELLLGYGLSEDLARDLDLYISSGRPGSEEPTIFSEPAMLLVKPDQTLYFASIQSMPFTRPPLDELLKGIDYAMEHDYPARGELAKSR >NZ_CP022336.1|WP_089132402.1|843669_844125_+|universal-stress-protein MRTYLVVIDETDEASLALRFASRRAMKTNSALHILALVESEGFVAWGGVQATLEEEAKSRAEALVSGAAGTLFEETGIRPSITVKQGKGPKVVRAMMDDVNGLAALVLGAAATGSPGPLVSHFAATEAGTLPCPVMVVPGSLTKEEIDRLS >NZ_CP022336.1|WP_089132401.1|842228_843533_-|pyruvate-dehydrogenase-complex-dihydrolipoamide-acetyltransferase MPINLQMPALSPTMEEGTLAKWLVKEGDSISSGDLLAEIETDKATMEFEAIDEGVIAKIVVAEGTENVKVGDVIAIIAEEGEDASEAKAAPAPKAEKSAEAAPKKEEPAPTPAPKASPAPAAKAAAGANDDRLKISPLARRLAEQKGVDLTAVSGSGPGGRIVKADIDAAEGGTAPVKTETPAPAASPAPSSAPKKGAETGSDFDIPHTVEKLSGMRKTIARRLTEAKQTIPHYRVSIDINIDKLLALRGELNASLESRGVKLSVNDLLIKAQAVALMQQPDCNVSFTEDNLIRYERADIAVAVSIKGGLITPIVTSADSKSLSAIATEMKDLAERAKAGKLKMHEYQGGTASLSNMGMFGVTTFDAVINPPQAMILAISAGIKKPVVVEDNVQIATIMNATGSFDHRAIDGATGAAFMKAFKEIVESPMGMLA >NZ_CP022336.1|WP_089132400.1|841848_842187_-|thioesterase-family-protein MPTEGNPYGVAFGGWLMAQMAQAGGALAAQHSRHQSVVVGANDVRFGHPVEIGDELSVYAEFTKIGRSSMTVEVEAYRRDRHADERTKAASGLYTFVAVDADGKPVQVPALD >NZ_CP022336.1|WP_089132399.1|840414_841827_-|dihydrolipoyl-dehydrogenase MANAYDLIVLGSGPGGYVAAIRASQLGLKTAIVERENLGGICLNWGCIPTKALLRSAEVFHNMKHAADYGLAAEKISADLDAIVKRSRGVAKQLNQGVTGLMKKNKIDVVMGDGTISAPGKLTVKTDKGEEELTAKNIIIATGARARDLPFAKADGKRIWTYRHAMTPSETPTKLLVIGSGAIGIEFASFYNDIGVDVTVVEMLDRIVPVEDEEISKFLEKSLKKQGMTIMTGAGVKSIETGDKGVKAEIEDSKGKKETHDFSHVIVAVGIQPNIETIGLDELGVQPDERYHIKTDEYCRTNIDGIYAIGDCTAGPWLAHKASHEGVIAAEAIAQSMGKDAHPHVMDVNNIPGCTYCHPQIASVGLTEAKAKEAGHDVKVGKFPFIGNGKAIALGETEGFIKTVFDAKTGELLGAHMIGAEVTELIQGYTIGKTLETTEAELANTVFPHPTLSEMMHESVLDAHGRVLHM >NZ_CP022336.1|WP_145955451.1|839637_840183_+|hypothetical-protein MQLSQSSSSVQNIWAGVLATSSVLGSLALACIFPFAAIATLLAASLPFRKAAAWMGAVWFANQLVGYLLLGYPQTANSFGHGLAMGATALAALFVAKTVLDIRSDRSLLSLGLAFAAAFATYQALLLVAATFLGGVQNFMPSIVWMVAQNDMLWFAGLGILYLVVDGTLIERVGKQASPQG >NZ_CP022336.1|WP_089132397.1|837715_839608_+|TonB-dependent-receptor MYKFRTIATLLASSAYSIAAPALAAEDREIIVTATGIEQDIEDTGVAISVIDEAEIRDQQIISVSDILQELPGVNVTQSGGLGSQSSVRIRGAESDQTLVLINGIRVNDPSSPDGAFDFGNLLAGNIERIEVLRGASGVTWGSQAVGGVVNITTKAPSDDLTLFAQGEYGAQDTVRLVSNASGKIGPVGLSIGGGYVRTDGISTYSGGSERDGYRQYSASGQLQVELTDAIRLEASGYYADSRVDNDSVFPPFSSDSDQFSLAEEIYGNAAIVAQALDGRLQNRLAFSISDINRDIINPSFTSLPRGRTERFEYRGDFAVIEQLRLVFGAETEDSRYRNSGVTDSTGIDSLYFQAVVKPLDGLVVTGGIRHDDHEDFGGNTSLSANLSYRTSDRGPTIRASYAEGFRAPSLIDLDDRPFGFGTPDLVPETAKSYEIGVDQSLIGNAVQLSVTVFQRDTKNQIAFAACPVAPDPAPEVCTNGSRPFGTTLNIESNRTKGVEAILSIAPVERLRFDANYTWLDTENRSTGANHGNELARRPASSLYFNTSYETAFGLNLGADMQMVGDSFDDLANNRRIDGYILAGARASIAVTDAFEVFGRIDNLFDVTYETATDFGSPGRSAYIGARARF >NZ_CP022336.1|WP_145955450.1|836872_837220_-|hypothetical-protein MIAFLSIALFSTVSAASIADAETARVAYSNCLVDFTVAQLDEKTGTGAFKKAAKTACVAEREAMISAIKKDELTYGSSDEEAASFATEEADGVLFAFTDSYAGYASSSTRPVRQD >NZ_CP022336.1|WP_089132407.1|849386_850442_-|alanine-racemase MSDEKPNIPPTARVKLDSDALLANWRALDLMSGDAKAGAAVKANAYGLGSREVVGRLLAAGCTDFFVANWQEAQEIEDLTTGNAEVSVLNGVRAADLPFALQSPAKPVLNSLEQVQRWKATGKPCDIMINSGMNRLGINVEDLKADLFAGMAIDMVMSHLASADEDGPQNEEQLTQYKAALAIVTGKRASLANSAGIALGSDYHFDVTRPGLSLYGGIQRPALDNVIAQVAIPQAEIIQVRSLQAGDKLGYNAQYVAAEPHKVGILAMGYADGYLRGFSNSGMFMHEGLLLPVLGRVSMDLIAIDLTAAPHLQEGDWVDCEYDLEVASAQSGLSQYELITGLGNRLGRVWT >NZ_CP022336.1|WP_089132408.1|850464_852099_-|MFS-transporter MTSEAAIAGQREPTQKEIRLVIAASSAGTIFEWYDFFIYGTLFYIIGPTFFPSGNPTLEILMVWATFAIGFGFRPVGAILFGFLGDKLGRKYTFLVTVTLMGIATAGVGFIPSAETIGLAAPLIVIFLRILQGLALGGEYGGAAIYVAEHAPTNKRGYYTSYIQASVAGGFVLSIGVILACRFLIPEQAFNDWGWRVPFLLSIMLLAISLWMRLKLNESPVFQAMKAAGETAGNPFVESFTYPGNKKRIFVALFGITGVLTTIWYTAFFSGMSFLRGPMNMEARTVDIILFVSGLIAMSFYLVVGKWSDRVGRKKPIIVGALLSLLLLFPAFWGLGQLANPGLTEAAEANPVRVEGSACSTDPFAELFSREQSDCGKILETLTSAGVSYTLVDTSELKLSAGSNPISIDPSWLDDGAARSSGIRDALAEYGYDFAKQQPDTIRILGIVAILLVLGALSALTYGSVAALLSEMFPAKIRYSSMSIPYHIGAGYLGGFLPLIAGYIVARSGDIYAGLWYTWVVVAFGVIVAWWGIPNDPEASLDEA >NZ_CP022336.1|WP_089134668.1|852131_854105_-|acetyl/propionyl/methylcrotonyl-CoA-carboxylase-subunit-alpha MVFKKILIANRGEIACRVIRTAQKMGIKTVAVYSDADARSPHVKMADEAVHIGPSPAAESYLIAEKIIQACKDTGAEAVHPGYGFLSERTSFAQELADNDIAFIGPPANAIAAMGDKIESKKLAEKAGVSVVPGHIGEIDDTEHAVRISNDIGYPVMMKASAGGGGKGMRLAWNEKDVREGFEATKREGLASFGDDRVFIEKFIEQPRHIEIQVLGDKHGNVIYLGERECSIQRRHQKVVEEAPSPFVDPEMRKKMGEQAVALSQAVGYYSAGTVELIVGADKSFYFLEMNTRLQVEHPVTEYITGLDLVEQMIRVAYGEKLPLTQDEVKLTGWAVENRVYAEDPYRGFLPSTGRLVKYRPPEEEEGIRVDDGVAEGGEVSIFYDPMIAKLITYGETRIEAIDRQIDALNRFELVGPGHNIDFLSALMQHERFREGTITTNFIAEEYPDGFQGAPATDELLVNLAAIGAFAATAHADRARRVDNQLGKRLEAPSEWQVKIGDKIMDVRISEEEIAVDGTAVDLSMEYTPGDSLILAEVAGKPLSVKIAKSTDGFALNSHGATHKARILPAHVAQHAVHMIEKIPPDLSKFLLCPMPGLLVALHVGEGDSVVEGQPLAVVEAMKMENILRAEKNGVVKSVEAAQGDSLAVDAVILELE >NZ_CP022336.1|WP_089132409.1|854222_855248_-|biotin-synthase-BioB MSDVTETEETEVRNDWTREEIAALFDLPFDDLVFQAATVHRANHKAGEVQLSTLLSIKTGGCPEDCGYCNQSAGAKSGLKAEKLLDVRTVLQNAAQAKDRGSSRFCMGAAWRNPKDRDMPAIIEMIKGVRQMGMETCMTLGMLTKKQSDMLSEAGLDYYNHNIDTSPEHYEKVITTRTFDDRLETLENVRNSGINVCSGGIVGMGETREDRVGFVHALATLPKHPESVPVNALVPVKGTVLGDMLADTPLAKIDDIEFVRTIAVARITMPASMVRLSAGRESMSESTQALCFLAGANSIFTGDKLLTTGNAGDDADETLFAKLGLVPMQAEQRDCALEAAE >NZ_CP022336.1|WP_089132410.1|855244_857380_-|methylmalonyl-CoA-mutase MTDKPTIKDWQELADKEVKGRDLTWETPEGIAVKPLYTAEDAGDPGLPGFGPFTRGVKASMYAGRPWTIRQYAGFSTAEESNAFYRRNLAAGQKGLSVAFDLATHRGYDSDHPRVVGDVGKAGVAIDSVEDMKILFDQIPLDEMSVSMTMNGAVIPCLAFYIIAAEEQGVSQEKLSGTIQNDILKEFMVRNTYIYPPAPSMRIISDIIGYTSEHMPKFNSISISGYHMHEAGATAVQELAFTIADGREYAKQAMATGLDIDAFAGRLSFFFGIGMNFFMEVAKLRAARTLWHRVMTDLGAQSERSKMLRTHCQTSGVSLTEQDPYNNVIRTTIEAMAATLGGTQSLHTNALDEAIALPTDFSARIARNTQIVLQEEAGITKVVDPLGGSYYVEALTEELVDKAWEIIERVGAEGGMAKAVADGWPKAMIEEAAAGRQAAVDKGDAVIVGVNKYRLPEEDPMETLDIDNAKVRQGQIARLEKMRADRDEAACQAALKALTDGARGGGNVLALAVDAARQRASLGEISDAMEAVFGRYETQPTPVKGIYGKAYENDARYAMVIDGVDAVSQRLGRKPKILVAKMGQDGHDRGANVVSSAFTDMGFDVISGPLFQTPGETRDLALAENVDAVGASSLAAGHKTLIPELINLLKESGRGDIKVFAGGVIPAKDYEFLRKSGVVGIYGPGSNIVECAADILRLLGHNMPPEEEAAE >NZ_CP022336.1|WP_089132411.1|857507_858290_-|enoyl-CoA-hydratase/isomerase-family-protein MEFENIKLDITDQVATITLNKPERLNACSLAMADDIFVALDKLDDARALVITGEGRAFCAGADLQAKNDSALSGGQGSYAALLQHYNPLMLKLAKLDIPTITAVNGPAAGVGCSIALASDFAIAGKSAYFLQAFVNIGLVPDGGASWMLTRLVGKARATEMMLLGEKIHGEKAADWGLIYKCVDDADLMDEAGALAKRLASGPTVALGVMRQNLAAALESDYASALVREAEGQRIAGDSKDAREGAIAFLQKRPTEFKGE >NZ_CP022336.1|WP_089132412.1|858378_858813_-|methylmalonyl-CoA-epimerase MKLGRMNHIGVATPDLDASIAFYRDVMGATDITEPFVLDSQKVRVCFVNTPGEGGTAGTQVELLQPTEADSAVGKWLEKNPLGGQHHICFEVPDIHAAKAEFEAMGKRVLGEPRIGAHGTLIFFVHPKDMGGMLTEIMETPKGH >NZ_CP022336.1|WP_089132413.1|858813_859758_-|DUF808-domain-containing-protein MPTGLVALLDDVAAIAKVAAASVDDIGAAAARAGTKSAGVVIDDAAVTPTYVTGFDPSRELPMIWKITKGSFRNKLVFLLPAAVLLGQFAPFLIPIILIFGGLFLCYEAAEKVLEIFHVDESTKHDQPAIVEGPGREKEMVSGAIRTDLILSGEIMAIALSELTDQVWWEQAIILAIIGIVITVAVYGSVALIVKMDDIGLHIAQNNEGALRSFGCGLVRLMPKLLAALSIIGTLAMAWVGGGLLVHNVAALGWHGPEHLIEWLSHPLLSLFPASAELMVGGIAFALLSGILGVLVGAGIAPLVHRFIPHGEGH >NZ_CP022336.1|WP_089132414.1|859872_861399_-|acyl-CoA-carboxylase-subunit-beta MSDIIAQLEAKRAEAMLGGGQKRIDSQHAKGKLTARERIEILLDEDSFEEIDMYVEHNCVDFGMEQTKIAGDGVVTGSGTINGRLVFVFSQDFTVFGGSLSERHAEKICKVLDNAMKVGAPVIGINDSGGARIQEGVASLGGYADVFQKNVLASGVIPQLSLIMGPCAGGAVYSPAMTDFIFMVKDSSYMFVTGPDVVKTVTNEIVTQEELGGAVTHTTKTSVADVAYENDIEALLAARDFIDFMPLSNREEAPERPTADPWDRLEESLDTLIPANANQPYDMHEVIRKMLDEGDFFEVQPAHASNIICGFGRMEGATVGVVANQPMVLAGCLDINASKKAARFVRYCDAFNIPIVTLVDVPGFLPGTSQEHNGIIKHGAKLLFAYAEATVPKITIITRKAYGGAYDVMASKHLRGDLNYAWPTAEIAVMGAKGAVEIIFRGRTEEEIAERTAEYEARFANPFVAAQKGFVDEVIMPHSTRRRVALGLRKLRNKQLENPWKKHDNIPL >NZ_CP022336.1|WP_089134669.1|861527_862877_-|glutathione-disulfide-reductase MAEYDYDLFVIGAGSGGVRAARVSASYGAKVAVAEEYRVGGTCVIRGCVPKKLLVYGSHFSEELEDGKNFGWTWENAKFDWGRLRDHVLKDVDRLNNAYTDTLKNHGVEIILERAELTGPHEIKLAGGKTVTAKYILIAVGAWPAVADFPGSELMSTSNEMFHLEKLPETIIIAGGGYIANEFAGIFNGLGSDVTVVNRSDIILRGYDESVRDRLIQISLAKGISYKFNCTFDRIEKRDDGALDVYMDGKKDPVRADIVLAATGRRPKVDNLGLENAGVATNDKNAIIVDDYSQTNVENIYAVGDVTDRVQLTPIAIREGQAFADTVFGDNPRTVDYDNIPSAVFSQPPIASVGLTESEAREQYGNIKIYSSDFRAMRNVFADRAERSLYKMIVEHPTEKILGLHMIGPDAPEILQAAAVAVKAGLTKQAFDDTVALHPSMSEELVLLK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
341814 : 362214
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP022336|341814:362214|DBSCAN-SWA ATTAAGCGGCAAGCTTGCGGTGTAGCAAGCGCCGATGTTCGATGGTTTGTTTTTTGATCCTTTCTCGTTTTTTGATGATGGCGAGTGCTCTGCCGAAGTAGGCATCGGCAGGCGTAACGTTGCCCAGGCTTTCGTGATAGCGCTGGTGATTATAATGTTCGACGAACGCCTCGATCTGCGTTTCCAAGTCCCCTGGCAGGAAGTAGTTTTCCAGCAGGATGCGGTTCTTCAAAGTCTGGTGCCAGCGTTCGATCTTGCCCTGTGTTTGCGGATGGAAGGGGGCGCCGCGTACATGGCTCATCTTCTGTGCTTCGATATATTCTACCAGTTCACCCGCAATGTAACTCGGGCCATTGTCAGAGAGCAGCCGTGGCTTGTGCATAACTGCTGCCTGGTCGCAGCCTGAAGCTTCCAGAGCCATGTCCAGCGTGTCGGTGACGTCCTCGGCCCGCATGGTGCTGCAAAGCTTCCAGGCGATGATGTAGCGCGAGTAATCGTCCAGCACCGTCGATAGATACATCCACCCCCAGCCAATGATCTTGAAGTAGGTAAAATCTGTCTGCCACATCTCGTTGGGCCGCACGGTCGGTGTGTGGAATCGATCAGCAGCTTTGATCACGACATAGGCGGGGCTGGTGATCAGGTCGTGGGCCTTCAGAAGCCTGTAAACCGTGGCTTCTGACACGAAGTAGCGCTGTTCATCGGTGAACGTCACCGCCAGTTCGCGTGGACTGAGCTCCGAACGTTCCAGCGCCAGATCGATGATCTGGTCATGCACTTCGGCAGGGATCCGGTTCCACACCCGGCTTGGAGCCGATGGCCGGTCAGCCAGCGCTTCAGGCCCCCCTTCCAGATACCGATCATACCAGCGGTAAAAGGTTCGGTGCGGGATCCCCAACTTATCCAGCGTCTTGCGCGCTGACAGATGTGATTGCTCGACGATCCTGATGATCTCAAGCTTCTCCGATGCCGGATATCTCATGCGTCGTCGTCCCCATCCGCGATCATGCTTTTTTTGAGCAGACGGTTCTCGAGGGTCAGGTCGGCCACGCATTCCTTCAGATCACGCGACTCCCGCCGCAGGTCTTTCACTTCATCGCTGGTTGCAGCACGGGCCGTGTCACCGGCCAGCCGACGTTTGCCGGCTTCCATGAACTCCTTCGACCAAGTGTAATACAGGCTCTGCGCTATGCCCTCCTTGCGGCACAGCTCGGCAATGCTGTCATCACCGCGCAGGCCGTCGATCACGATCCGTATCTTGTCTTCTGCTGAGAAATGCCGGCGGGTCGCACGGCGGATATCTTTAACCACCCGCTCTGCGGGCTTTTTTGCGGTGGAGTGTTTTGGTTTCATGCTTCGTTCCTTCGTCACTACGATGAAACCAAAACACTCCTTAATTCACAACATCAATTTGGTGACATAGGTGCTGACGGGGAACAGTCTATCGTCAACTGCGGGTCGACATCATTTCCTGCCGGCTAAAGCCTAACGAGAAGCTCCGCTTTGATGCTCTTCGCAAGAGATATGGTGCCGGTGTCGGTACGCTTCGCGAGGCCTTGTCTCACCTTGTATCCGATGGCCTTGTGCGCATTGATGTCGGGCGCGGGTTCAGGGTGGCTCCAGTTTCAGTCGAAGATCTTGAAGATGTCACATCTTGGAGAATTGAATTCGAAACGCGAGCACTCGACGCTTCGATTGGCAACGGCGATGATAACTGGGAAGCAGAAATCATTGCTTCCTATCACCTGTTGAGCAAGATTAAGGTGCCGCGCGTAGATGCTCCCGCGGAAGAAATGTACGACTACGGCGAGAAACATATGCGCTTTCACGACGCTCTGGTCGCTGCCTGCGGATCGCCATGGTTGATGTTTTTCCGCTCTGCGCTGCAGACTCAGGCCTTGCGCTATCAGGCGCTCGCCATGACAGACCGGGAGCACATGGTGAACCGGGCCGTGGACGAGCATCAAGCAATCAAGAATGCTGCGGTAGCACGCGATACAAAGCAGGCGATGGCACTGGTTACGCATCATATACAGCAGACATCAGACGATGTGAAAGTTCTGCTGCAGACTTCCGGCAAGCTGGACGAGTTTCTACCGGGACCAGTCGCCGAGGAAAAGCCGCGCCGCAGAGGACGTCCACGCAACACTGCCTGAATTCGGTTGAACTGCTATGCCTGCTAAGAAGCGCTGTGGTCCCAAAGGGCCCTCAGCGCTGGCAGCAGATGCTCGCGATGTTCAGGCGGGACACGCTTAAGGAAATTCTCCTCATATTTGAGCAGGATTGCTTTCGCACGGTTGCAAAGTTCACTCCCTTTCTGCGTGAGTTCGAGTCCAAATGATCGGCCATCACGCTTGATTCTGACGATTTCGCCGCGATCTTCCAATCGCACAACCATGGGAACCATATTCGCGCGCTGGATGCCCAGAATTCTGCCCAGTTCGCTTGGCGTGACCCGGGGGTGGTCGTCGATTGTAATCAGCATCGACGCATCGGACGGTCGGAGATCAAGCTCACTCAGTCCATCTGTCAACTCGCTCAGGATAACCGTTGATGCGCGGCGCAGATTGTAGCCCGGCAGGCGTTTCAGAAAATCATCCATAATGCCGTGTATCGCATATATTTTTCTGTAGCAATCGCAATTGTTATGCATCATAACATTATTTATTGCGAGGCACCGAAAGTGCATGACGTGTCGCCTGGCAGGGCCACATCCAGAGAACCTTCCATGCTGAGGATATTGTCCGGCCGTGTCAAAAAGAGAGGAATGATTTATGGATTTCGAACCCACCGAGCAGCAGGCCCAATGGCGCGATAGAGTGGCGCATTTCATGAACACGAAGGTCCGTCCCGCCATTCCGCGTTACCAGGCAGAGCAGGCCACTGGCGAACGATGGAAAATTCTCGAAGTTGTGGAAGAGCTCAAGGGGGAGGTGAAGGAGGCGGGGCTGTGGAACCTTTTCATGCCGCCGCGCAATGACAGCCACCATCATGTGGACGAAACTTTTGAATTCGATGGTCCGGGTCTGACCAACCTTGCTTATGCGCTTTGTGCCGAGGAAATGGGACGTGTTGAATGGGCGGCAGAATTGTTCAACTGCTCGGCGCCCGATACTGCCAATATGGAAGTCTTGCACCGCTATGGCACACGTGAACAAAAGGATCGGTGGCTGAAACCTCTGATGGACGGAGACATCCGGTCCGCGTTCCTGATGACAGAGCCGTTCACGGCGTCGTCCGATGCCACCAACATCGAAACAAGGATTGAACGGGACGGCGATCATTACGTGATAAACGGGCGCAAATGGTGGTCGTCGGGTTTCGGCGATCCGCGTTGCAAGGTCGCGATTGTGATGGGCAAGACCGATTTCAATGCAGCGCGTCATGCCCAGCAATCCATGATCATTGTCCCGAAAGATACGCCGGGCGTCACGCCGCTGCGATACCTGAATGTCTTCGGCTATGATGGTGCCCCGCACGGGCAGATGGAGGTGGAACTCAAAGACGTGCGGGTGCCGGCGGACAATATGCTGCTGGGCGAAGGGCGCGGATTCGAAATTGCGCAAGGGCGTCTTGGGCCAGGCCGGATTCATCACAGCATGCGCAGCATCGGGCTTGCCGAAGAAGGTTTGCACAAAATGTGCAAGCGTCTTCAGGAACGCGAAGTCTTCGGCAAACCGATCTACAAGCACTCTGTCTGGGAGGAGAGGGTGGCTCGGGCCCGGATCGATATCGATATGACCCGGCTGCTGTGCCTCAAGGCTGCTGATATGATGGACAAGGTCGGTAACAAGGGCGCGAGACAGGAAATCGCGATGATCAAGCTTCAGGCACCACAAGCTGCGCTCAGAATTTATGATGATGCCATCCAGGCATTCGGGGGTGCCGGGGTGGCCGACGACTATGGGCTCGCCGAGGCTTTTGCCGGTCTGCGTTCTTTCAGGCTCGCCGATGGGCCGGATGAGGTCCACGCACGGTCGATTGCACGATTGGAGTTTGTCAAGCATCCGCCGGAAGCCGGCCCGGCTGCAAACGCTCTTCGCGGCGATCAGGCGCGAAATGCTGGCGCTTGAATCCCTCGATTCCGATGGCCATCCGCTAAAGCAAGTCCACCAGCTTGGATCCTTATGATGAGAACGGGAGCGGTCTTGCCACCCGTCTCACCGCCAATTTGAATATTGCCCACTCGGACATTTATTGCTATGAGATATAACAATCGGGAGAGAATCATGAATTTCCAACGGGTCAATCCACTCACCGGCGATATCGCATCGACCGCAGTTGCCATGCAGGCTTCGGATATGCCGGCTATTGGAGAAAAGGCGCAGCAGGGCTTCGCAAAATGGTCGGTGATGGGGCCAAATGCCCGCCGCGCGGTTTTGATGAAGGCGGCTGAAGCGCTTGAATCCAGACAAGCTGATTTTGTTGCCGCTATGGCCGCTGAAATAGGAGCCACTGCAGGTTGGGCGATGTTCAACCATGGACTTGCTGTCGGTATGGTGCGCGAAGCAGCTGCGTTGACCACCCAGATTTCGGGCGAAGTAATCCCGTCAGACAAACCGGGCTGTCTCGCTCTTGCCTTGCGCGAGCCGGCTGGCGTGATCCTCGGTATTGCGCCGTGGAATGCCCCGATCATTCTGGGCGTGCGCGCCATTGCTGTGCCGCTTGCTTGCGGGAATGCTGTAATCCTCAAGGCAAGCGAGCAGTGTCCGCAAACGCACGCCTTGATTATTGATGCTTTTAACGAGGCGGGATTCCCTGAGGGCACCGTCAATCTGGTCAGCAACGCCCCTGACGATGCAGCCGATGTCGTGGGCGCGCTGATCGACCTGCCTGCGGTCAAGCGGATCAACTTCACCGGTTCGACCGCAGTGGGCAGGATCATCGCGAAGCGCGCAGCCGAGCATCTCAAACCATGCCTGCTCGAGCTTGGGGGCAAGGCACCGCTCGTCATTCTCGAAGATGCTGATCTGGATGAGGCGGTTAAGGCGGCAGCTTTTGGCGCCTATATGAACCAAGGCCAGATATGCATGTCGACCGAACGCATTATCGTGGTCGACGCTGTCGCGGATGCATTTGCGGAAAAATTTGCCGAAAAGGTCCGTGCTATGCCTGCTGGGGATCCGCGCGAAGGAAACACGCCGCTCGGCGCTGTCGTTGACACGAAAACGGTCGCGCATTGCCGATCGCTGATCGAGGATGCCGTAGCCAGGGAGCGAAGCTGATTGTCGAGGGCGAGACGACCATGAATGTCGTAATGTCCGCGCATCTGGTGGATCATGTGACGCCGGAAATGAAGCTGTTCCGCGATGAAAGCTTTGGTCCGGTTGTGGGGATCGTGCGCGCCCGTGACGAGGCGCATGCGATCGAACTGGCGAACGATACCGAATACGGCCTCTCTGCAGCAGTGTTCACCCGGGACAGTGCGCGCGGCCTCAGGGTGGCGCGCCAGATCCGGTCGGGCATCTGCCACGTTAACGGTCCGACGGTTCATGATGAGCCGCAAATGCCCTTCGGGGGTGTCGGTGCCTCAGGTTACGGCCGGTTTGGAGGCAAAGCCGGGATCGATGCTTTCACCGAATTGCGCTGGATCACCATGGAAACCGAACCGGGGCACTTCCCGATCTAAGTGCATTCATATCACCTTGTCGGTGTTCCGACAGTTTAGCCATCAGGCCCTTCAGGAGAAGTCAAATGACCAAAGTTATCTCGCCAAGCGGTGTTGCAGCATATGAAATCAAGGATGGAATCGCGTGGGTCTATTTCACCCGATCAGAAAAACGCAATTGCATGAGTCCGACGCTCAACAAGGAGATGGGCGATTTGCTCGCGGAGATCGAGTTCCGCGACGATTTTGGTGTTCTGGTGCTAACCGGTGAAGGCACATCCTGGTCTGCGGGAATGGATCTCAAGGAATATTTTCGCGAAGCGGAAGCCAAGGGGCTGGGGGCAATTCGCGAGGCGCAGGCACAGGCCTATTCCTGGTGGCGCAAGCTGCGCTGGTACCAGAAGCCGACAATCGCGATGGTCAATGGCTGGTGCTTTGGCGGTGCCTATGGACCACTGTTTGCGTGCGACCTCGCCTTTGCGGCAGAAGACGCCCAATTTGGTCTCTCTGAAATAAACTGGGGCATCCTGCCTGGTGGTGCAGCATCCAAGATCATCCGTGAGCTCGCGAATTTCCGCAACTCGATGTACCACGCGATGATGGGCGAAAACGTCGATGGCAAGACCGCCGCGCAGTGGGGACTGGTCAACGAGGCCGTTTCCGCAGACAGGCTTGAAGACCGGGTAACCGAGGTCGCCAAGGTGCTTCTTGAAAAGAACCCGGTTGCGCTCAAAGCGACCAAGGATGCGATCCGCCGTGTGGGTGAAATGACCTACGACAACGCAGAGGACTACCTCGTTCGCGCGCAGGAAGCTGCCAACTACTTCGACAATATGGGCCGCAAGGAAGGGATCAAGCAGTTCATCGACGACAAGACATTCAAGCCGGGGCTTGGCGCTTATGATAAAAGCAAACAGCCTGTATCCGAGTGACGAGGCCGGCTATGCGAACTCAGGTCTGTATTATCGGCGCAGGACCGGCCGGCCTTCTACTCGGGCATTTGCTCCGGGCGGAAGGGGTCGAATGTGTGGTAATTGAGCGGCAGACGCCGGACTATGTTTTGGGCCGCATTCGTGCGGGCGTGCTGGAGAACATCACAGTAAGCCTGATGGAGCGACTTGGCCTGGATGCTCGGATGAAAGCCGAAGGCCTGCCGCACGATGGCTTCAATCTCGCAGATGGCGAGCGATTGATCCGCATCGATATCGCCGAGCTCGTTGGCCAGGAAGTGATGGTATACGGCCAGACCGAACTGACACGCGACCTGATGGAAGCGTGCGAAGAACGCGGCCTCGAGGTCATCTATCAGGCTGCGGACGTGGCTTTGCATGACGTGGATGGTGACGCGCCTTTCGTGACTTACATCCATGAAGGGGCGCAGCAGCGCATCGATGCCCGATTCATCGTCGGCTGCGATGGTTTTCACGGACCAAGCCGCAAGGCTATCCCCAGCAGCGTTTCGCAGGAGTTCGAGAGGGTCTATCCGTTTGGCTGGCTCGGCATCCTCGCCGATGTCCCGCCCTGCAATCACGAACTGATCTACGCAAATCATGAGCGTGGCTTTGCGCTGGCCTCGATGCGTTCACCGACCCGCAGCCGCTATTACGTCGATGTGCCGGTGGATGAGGATATCAGGGAGTGGAGCGACGATCGCCTTTGGGATGAGCTTGCCATCCGGCTTGGCCCCGCAGCCGCTGCCAACATCACCCGCGGACCTGCGATCGAGAAAAGCATCGCACCGCTCCGCTCCTATGTATTCGCGCCGATGCGTCACGGATCGCTCATGCTTTGTGGTGATGCGGCGCATATCGTGCCGCCAACCGGAGCCAAGGGGCTCAATCTCGCGGCCAGCGATGTGCACTATGCCTCCGAGGCGCTGACGCGGTATTTCAAGACGAACGACCCGGCCGGGGTTTCGCGCTATTCTGAAACGGCGCTCGCTCGCGTCTGGCAGTCGGAACGCTTTAGCTGGTCTCTGACCAAATTGATGCATCGCTTCCCCGACGATGGCCCATTCGAAAGAGCGATGCAGGTCGCAGAGCTCGACTATATCGCAAACAGTCGCGCGATGCAGACCGCAATCGCCGAGAATTATGTCGGTCTTCCGGTCTAACCTGGAAGCGGTCATACTTCGAATAAAATTATAAGGTGGCTGCGCTGTCTGGTGGCATCGCATCAGCTGGCGAATGGCAAAAATAAAAATGCCCTCTACATCTGCTTGAAATGAGGATTTTTTAATAGGTAGTCAAATTTTCGAATTGGCATCTATCCTGATAAAAGAGCATAGCAGCGGAAAGGTCTGTATCGGAGGTTGTTTATGGGTGTCGCGGTTCAGAAAATCGAACGGGAACTTCCCCGAGACGGACATCTGGATACCATGACTCCGTGCACCGCGCGCACCATGGAGCGTCAGCGCAAGTTGCTCGTAGCGAGCCCCGAAGCACTGTTTCGACCAGTGCCGGATTAACGCGATGTCCCACAAAAACATCCTCGAACTGCGCCACGTAACATGTAACTCCAAAGATCGGGGCCGGAAAAATGGCTGACCAGAAAATCGGCGGCGGCGCCAGGAAAATCCTCTACACGCTGAACACCATCCGCAAGATGGGCGTGACCAAGTCCAGCAAGGCGCTGACCGCCAGAAATACGTGCAAGGCGTGCGCACTGGGCATGGGTGGTCAGCTCGGCGGGATGACCAACGAACTGGGTGAATTTCCTTCGGTCTGCAACAAGAGCGTTCAGGCGCAATCGTCGGATATCCAGCCTGCTATCCCTGAGGAAGTCTTCGCCCACGGGATTGATGATTTCGCCGCGCTTTCGGCGAGAGAGATTGAGGGGTTAGGGCGTTTGGGTTCGCCGATCTTCAAGGACCAGGGCGAGGACCGTTATCGACCGGTCAGCTGGGATCAGGCCATGGAAATCGCCGCCGCTCGTCTGGAGGCAGCCGAGCCGAACCGCACCTTTTTCTACAGTTCCGGGCGTTCATCGAACGAGGCGGGCTTTCTCTTCCAGCTTCTCGCCAGACTATATGGCACCAATAATGTCACGAACTGTTCCTACTATTGCCATCAGGCGACGAGCGAGGCCCTTGCCAGCACCATCGGGACAGGCACTTCGACCGTGGAGATTGCCGATCTTTCGCTTTGCGATCTGATCTTTGTCATCGGTGCCAATCCCGCATCCAACCATCCCCGTTTCATCCATCAGTTGGCCAGTTGCCGGGCTCGCGGTGGCCAGGTGATCGTGATTAACCCGGCCAAGGAGCCGGGACTGGTGAAATTCGCGCTGCCAAAATCCCCGAAGTCGCTCATCAAGGGCGGCGACGAGATTGCTTCCTTCTATCTCCAGCCACGGATCGGTGAAGATCTCGCGCTTTTCAAGGGAATTGCAAAAGCCATTGTGGCGGACGAGGCCATCGACCGGACCTTCATTGACCAGCATACGCAAGGAAGCGAGAGCTTTCTTTCGGACCTGGCTGCGACAAGCTGGGAAGAGATTGTCGAGCGGACGCAGATCGAAAAGGCAGTGATAGAGCAGGCGGCGGCCCTTTACGCGAAGTCGAAATCGGCTGTCTTCGCCTGGGGCATGGGCATGACCCACCACCACAATGGTGTCCAGAATATTGAAATGATCGCCAACCTGGCCCTGATGCGCGGAATGGTTGGACGGCCGGGTGCCGGCCTGTTGCCCCTGCGCGGTCATTCAAATGTTCAGGGTATCGGGACAATCGGTGTGAAACCCGTGCTCGCCGATGACGTGATGCATGCGATCGAGAACTATTTTGACATATCTTTGCCGGAAGAGAAGGGGCTCGATACCATGGCGTCGATGAAGGCGGCAGATGCCGGCCAGATCGACGCTGCGATGATGATGGGCGGCAATCTGTTTGCGGCCAATCCCAATGCCGAGTGGGCAAAATGTGCGCTAGAGAAAATCGGGTTCAAACTTTTCCTTACCACCACCCTGAACCACGGCCATTTCAAGGGCATCGGCGACGGAGCAAGCCTTGTCCTGCCCGTCACTGCGCGTGACGAAGAATGGGAGCCGACTACCCAGGAATCCATGTTCAACTATGTCAGATTGAGCGATGGCGGAATCAATCGCCTGCCTTCAGTCCGCCCCGAGAGCCACATCCTTGCCGATCTCGCGGAGAGGATTTTGCCCGATTGTCCCATCGATTTTGTCGAGTTCAAGCATCACAAGAATATCCGCAAGGCGATCGCAGCCACCGTTCCGGGCATGGAAGAGCTGGCCGGTCTGGATGTGGCCCGGCGGGAGTTTCACATTGCGCAGCGCCTGCTTCACGAGCCGCAGTTCGGAACGGCATCGGGCAAGGGAGAATTCGCCACTTGTAAATTGCCAGCCAGAGTGGCTGAGGATGACCAGTATAGCCTGAGCACTATCCGCAGCGAAGGTCAGTTCAACTCGATCATCTACGAAGAGGCTGATACATATCGCGGCACCCAGTCGCGCTGGACGGTGTTGATGAACCCGGAAGATATGGCGGCTATGGGCTTGGCGAAAGGGGACGCTGTTGATCTTTTATCGGACAATGGCAAGATGATCGCGGTCGCCGCGCATCCGTTCGATATCCCGCGTAGGAATATGATGGCCTATTATCCGGAGGCGAACGTGCTCACCGGAACCGACGTCGATCCGCGCAGTCATACGCCGTCTTTCAAGAACACACCGGTCCGCATTGCCATCCGCTAGCCCAACTGCGCTCTCGTTCTCGGGGGGGGGACGAATCGAGGAAAAGCTCGATAAAAGGGCGGGCAAGAACTGTGATGGCCTGATCGGCAAGGCCGTCAAGCATGCAGGCAATCCCGGACCGCGCCAATAGGACCCTATGCGCATGGAAACGCCGGGCCTCTGTATTGGGCTGCGAAAAGCATGAGGCCGCCATCCAGAGCGCCTGTGAACATCGATCACCGGAGGCGGTCGGGTGCAGGGGCGTTAGCTATATATTGCGAGCGAACACTCCTCATTTCGAGCCTGCAGGTCACAGAATAGCGCGAGCAACGGCGTCTGTGAAATTCGGTCGACGCTCCATCCGTGAGGGAGAGCAAGACGTCATCGCTTGCATCAACATGCCGTGTCCATACTGGAACCGCTTTGGAACAAGTTTTTGCTGCGATTGCAGATGATCCCGATCAGACAATGCGACCCGACCGCAGGCCAACAAATGATCTCAAGCGCTCAGGACTGCAATGACGATGTGAGACGCCGGTTGCCGGATATCGGTTCACCGTTTGAATTAATTAGAATGGTGAGCCCTGCTGGGTTCGAACCAGCGACCTACTGATTAAAAGTTTTTTTAAGCCGATATTTTGATATCCCACATATTCCCTAAACGTCTGAAATAATTAGATAATAGAAAAATAATGTTCTTGATATGTTCCAGCCAATCCCATAGTATCCTCTCGAAACGCTACCCGGGTAGCACGTTTTTAAGAGGCCAAAATGCCAAATCAAATAAAGCTTACTGATGCTAAATTAAAAGCGGTGAAAACACCATTGCGAGGACGAGTGGAGCTCGCAGATAGTGATTGTGTTGGGTTGAGATTTGTAAAAGGATCGACAGGTAAATGTCGTTGGATTGTTCGAAAACGGGTTGCTGGAAAGTTCAGAAAGATCACTCTGGGAGACTATCCCGGGATTGGTCTGGCTAAGGCTAGGCAGAAAGCGCTCCAGGCTTGTGCTGACATTGAGAGCGGCCGGAAGCTGGCGAAGCCTGATAAACGATCGAACAAAACGGTTTCCGCACTGTGGAGTACTTACGATCAGCAGCGCGTTGAGAACAAGCGATCTGCAAAAGAAATACGGCGCATATTCAAAAAGTACGTTTTACCGGAGCTTGGCGGTCGTCCGGTGGACGCCATAACGCGCACGGATGTTACTCGGTTAATTGATGGCATCGCTTATGGCGAGCGTTCTGCGCCGGTAATGGCCCGGCTTGTTGCTGCGCAAATGTCAGTGTTTTTCAATTGGGTTTTACCACGAGTAGAAACTCTAGCGTACAACCCAGTAACAGCAGCAAGCAGACCACCAGCACCGAAACCTCGTGATAGGGTTTTGTCGAATGAAGAATTGAAAAGCCTTTGGAAGGCTGTTGAAAATCAGCCGTTTCCTTGGCAGCATAGTTTGAAACTCATGCTGTTCACCGCTGCACGGCGAAGCGAAGTATTTGGAGCTGAACGTGCGGAGTTTGATTTAGAAGAAAAAGTCTGGAACTTACCTGCGTCGAGCGCAAAAAATAACAACGCTGTTATCATTCCACTTTCCGAAGAAGCTTGCCGTGTCGTTTCAGGTTGCCTGGTTACTGATGGTAGCAGCAAGCTGTTTCCTTCTGCGACAAATCCAAAAACTAGTGCATCAGGTATATCCAAATTGTTGCGACGGTTGAGGGCCGACGTTCAATCTGACTTGCAACGTCCCGTTCTGCATTGGACATTACATGACATCCGGCGGACTGTTGCTACAAACCTGCAGCGGCTGGGAGTAAGGTTGGAAGTGACTGAGGCAATTCTCAATCACGTTAGCGGTTCTCAGGCTGGTATCGTCAAAGTTTACCAACGATATAACTGGATGAATGAAAAACGCGAGGCGTTGCAACTTTGGTCTGATGATCTTGTTCGTATCTGTGACTGCAGCCGACAATAGGTTATTTTGTTTCACCAACCCAATCAACCGCAATGGAGTCAACTCCAGTTTTCACACCCTCAACCAATACCATGTGATTATCTTCGGGAACTTCATTGTCTTGAAAATCCAGTTTCCAAACGCTTTTATCATCACATTGCAGGACATATCGGCGGTCCCCTTTGATGAGTTTGCCAGTCAGCCGAAGGCGGTTTGAGTTTGAATGGTTCATGATCAAGTGTCGCGCGACAATCTAGGTGAAAAAGCGCCGTCAATCAGGATAAGAACTCACCTTTGAGGATGTTGCCTGCGGGCGTAGCCCGCAGGCAACATCCTCAAAGGTGACGGCGCGATCTTTAGGGACCGCGCCGTCGGTGATCATGGCCGATCGTGTAAGGAACGACTTCTCTCGTCCAAAAGGGAAGTTGTTCAATGCCAGGTCGACACATCAACGATCACCAAGTGAGACTCTATATGAAAAATCGACAGAATGACACATTGGCCACCTCTGCCGCGCGTGCGGGCTTCAGCACATCTACTGCGTACCGGATTGCGGGCGATCCGCGCTTGCCGTCACAGAAAAAGGTTCCTCGCGGAAGCCGGCGACCTGACCCCCTGGTAAATATCTTCGACAACGAGGTTGTCCCGATGCTCGAGGCGTCGCCGGGCCTGCGGGCTGTCGGTATTTTTGCGGAACTGCAGCGGCGTCACCCCGACCTCGCCCCCGGGGTGCGACGTACGCTGGAGCGCCGGATCCGGGGATGGCGTGCTCTTCATGGTGCGGAATGCGACGTGATATTCCGTCAGGTCCATGAACCGGGCCGCATGGGCCTTTCGGACTTTACGCACATGAACAAGCTGCGTGTCACCGTCGGCGGCGTGCGTCTCAAGCATATGCTCTATCATTTTCGCCTCGCCTGTTCGGGCTTTCAGCACGCCCATGTCATCCTGGGCGGGGAAAGCTTTGTGGCGCTCGCTGAAGGGCTGCAGAACGCTCTGTGGGCCTTGGGAGGAAGTCCTGCCAATCACCGCAGCGACAGCCTCTCGGCGGCCTTCCGTAATCTCGACAAGGACGCGAGTACGGACCTGACGCGGCGCTATGACGCGCTGTGCCAGGACTACGGCATGGAGCCGACGCGCAACAATCGAGGTGTCGCCCATGAGAACGGATCGATCGAGAGCCCCCACGGCCATCTCAAGGCACAGGTCGAGGACGCACTCCTGATGCGGGGGTCGCGCGACTTTGACGATCTCGCCGCCTACCGGTGCTTCATCGACGAACTCGTCGGTCGCAACAATACCCGCAATGCCAAGCGCATTGATGCCGAGCGCCCGGCTCTTCAGCCTCTTCCCCTGCGGCGCAGCAGCGACTTTACCGAGAAGCTTGTCCGGGTCACCACCTCAGGCGGCTTCACGCTTCTCAAGGTATTCTACACCGTCCCGTCCCGATTGATCGGCCATCGCCTGCGAGTCCGGCTGTACGATGCTCGCCTCGATGTGTTTGTCGGCGCGACACAGGTAATGACCGCCGCGCGCGGTCACGCCGAGGCCAATGGGAAGCACGGCCAAATGGTCAATTATCATCACGTCATCCATGCGCTCAGGCGCAAACCCATGGCGCTCATGCGCCTTGTCTACCGCGACAAACTCTTCCCGCGTGCGGCGTATCGCCAGAGCTTCTACTGGCTGCTTGAGCAACACGGCGAGAAGGCCGCATGCCATATGATGGTCGATCTGCTTGGGCTTGCTCATGACCGCGGCTGCGAAGCGGAACTCGCAGCTGTCCTCGACGAGGACCTGGCCGCGCAGCGCATTCCCGACATGGCCGTCTTGCGCAAACGATTTGCGCCCGACCCTGAAAGCCTGCCTGAGGTCTTTGTCCATCTCGCCTCGCTGAGCAGCTACGAAGCGCTGCTCAGCCAGCCCACGCGAGCGGCGGCATGACCCAGAGCATTGACAGCGCCCGGCTGACCCTGCTCCTCAACGAGCTCCGCCTTCCCACCATCAAGGTGAACTGGCCAGACCTTGCCAGGCAGGCCGACAAGGAAGGCTGGCCCGCCGCCCGCTTCCTGGCCACGCTCGCCGAGCATGAAGTCACCCAGCGTGATCTCAGGCGCATTGAGCGCAATCTCAATGAGGCCCGCCTGCTGCCCGGCAAGAGCATCGACAGCTTCGACTTCACGGCCGTCCCGATGGTCTCCAAGGCGCATGTCATGGCCTTGTGCGCTGGCGACGCCTGGCTCGACAAGGGCGCCAATCTGATCCTGATAGGGGGGCCTGGCGGCGGAAAATCGCACTTATCGTCAGCGATCGGGTTCGCCCTTGTCGAGAAAGGATACCGGGTCCTGTTCACCCGAACCTCCGACCTCGTCCAGAAGCTGCAGGTCGCCAGACGGGAGCTGGCACTCGAAGCGGCCATCGCCAAGCTCGACCGCGTGGATTTACTGGTGCTCGACGATTTTGCGTACATCAGCAAGGATCAGGCGGAAACCTCCGTTCTCTTCGAGCTCATCAGTGCCCGCTACGAGCGGCGATCGCTCCTGATTACCGCCAACCAGCCTTTTGGCGACTGGAATCGAATCTTCCCAGACCCCGCCATGACGCTGGCAGCCGTCGATCGCATTGTTCATCACTCGACAATCTTCGAAATGAACGTCGACAGTTACCGAAGGCGCGCCGCTCTCGATCGCAAGCAACACGGCGCTGGCCGACCACCAACACGCGCGACAATCAAGGGCACTCGCGTTGCCTCACCAACAAATGGTACCGAATCCTGAGGCGTTGCCTCATCAAAATGTTGCCGGGCACTCCATGCCCGGCAACGCTCAGTCCAGCAAAATCGGACGAGCTGTCGCGCAGCGACAACCAAACTCAAAATTAATCCTTGTCAGCGACAACCAGGACGAACATCATGGCGACGCCGTTATGACCGATTCTTATCTTGGTTGTCGCGTTGATCATCCTGTTTGTCGCGCTACAATGATCGGGGACAAAGGCTATGACAGCGACGAGTATCGCGCCGCCCTGCAGGCCAAAGGCATCACGCCGTGTATCCCGCCGCGAAAGGGGCGAATATTACCCGCTGACTTCGACAAAACCCTCTACCGGCAGCGTCACAAAATCGAGAATATGTTCGGGCGCCTCAAGGACTGGAGGCGTATCCACACCCGCTATGACCGATGCGCGCACACCTTCATGTCCGCCATCGCTATCGCAGCTACCGTCATCTTCTGGCTCAATTAATGAGTCCTGACCCTAGTCATCTGGAATTGAAGTAAGTATCATTGTATATGCTCTGTTTGAACAGACAGGAGCATTGGTAATGGTAGGCAGGCAGGCGGGTTTGGTCGTATTGAGCGGCGAGGACCGCTTTTTTCTTGAAGGGCAGGTTCGCAGGCACAAGGTGCCGCGCTCGTTGTCGGATCGATGCCGGATGATTCTGCTATGCGCGGAGGGATTGCAGAGCAAGGAAGTTGCGCAACGCCTCGGTGTCCACGAGCACACAGTTGGCAAATGGCGACGGCGGTTTGTACAGGATGGCATTGAAGGGCTGACCGACGAATATCGTTCGGGACGACCACGAACGGTATCAGATGCGCAGGTGGCTCAGGTGATCGAGCGCACGCTGAACAGCACGCCGAAGGATGCCACCCATTGGTCAATCCGCACGATGGCAGCCGAGACCGGGCTGTCGCACACTACCATCCGGCGGATCTGGTCTGCCTTTGGCCTGCAACCGCACCGCTCACAGACGTTCAAACTGTCTACCGACCCTTTGTTCGTCGATAAGGTGCAGGATATTGTCGGCCTCTATTTGTCGCCGCCAAACCGGGCGGTCGTGCTGTGCGTAGACGAAAAATCCCAGATCCAGGCGCTGGACCGTGAGCAACCGGTCTTGCCTATGGCACCCGGTGTGGCCGAACGCAGAACACATACCTACATCCGCAATGGTACGACGTCCTTGTTCGCTGCGCTCGACATCGCCACCGGCGCCGTGATCGGCAAATGCTACAAGCGGCACCGGGCCACCGAGTTCCTCGACTTCCTCAAGAGAATCGACGCCGAGATACCCGAAGGACCGGACATACACCTCGTGATGGACAACTACGCCACCCACAAGACGCCAAAGGTCAAGGCCTGGCTGGCGCGCCGCCCGCATTGGCATGTTCACTTCACGCCGACATCGGCATCCTGGATCAATCAGGTCGAACGCTGGTTTGCAGAGTTGACGCGCAAGCAGTTGCAACGCGGTGTCCATCGATCAACGGCAGAACTCGAAGCCGACATTGTCGCCTTCATCGCAGCACATAATGAAAATCCCAAACCCTACAAATGGGTCAAATCCGCCGACGAAATCCTCGCCGCCGTCAAACGCTTCTGTCAAAAAACAATGAGCCGAACTTCAGATTCAGGTGACTAGGCTATCTGCCGAGTATGCTTTTCCCAATATTTAGCCCTCACATTTTTGCGGAGAACCTTACCAACCGTAGTGGTTGGCAATTCATCCACAAAAGTGACCGTTTTGGGAGCCTTAACCGAACCGATCAGACTTTTTGCATGGTCGATCAGTTCTTGCTCGTGCACTTCCGTCACTCCATTTCGGATGACTTCTGCGTGGACAGCCTCCCCCCATTCTTCGTCAGGGATGCCGACCACGGCTGACATTGTTACACAGGGATGAGAATTAATTGCGGCTTCCACCTCGTTCGCATAGACGTTGAATCCACCTGTGATTATCAGGTCTTTTTTTCGGTCAACCAGATAGCAATATCCAGCTTCATCGGAATAGCCTAGATCGCCAGACTTCCAGTAACCTTGGAAAAATTCTTCCGCTGTTTTTTCTGGATTGCCTTCATAACCCTGGCATGTACCCCGGGACCGGAGATATAATTCGCCAACTTCACCCTGCAGCACGGGGTTGCCGTCATCATCGCAAATGGTCACCTCAACCCCCGCGTTCCGCCGTCCGGCCGATGCCAGCCTCTTTTCCGTCTGCGTATCTCCGACATGTTCTGCCTTGCCCAAAAACAGGGTCGCCACGAGATGTTCGGTGGAACCATATACTTGCATGAAAAGCGGACCAAATTTGGCAACTAATTTCTTGGCCTTGGCCGGACTCATTGGCGCGGCTCCATAGAAAATTGTTTTAAGCGTCGAGAGATCATATCTCTCAGCCTCCGGCATCTCCAGCAAGCGATAAGCGATTGTTGGCACAACAAACGCATGAGTGATCCGTTCTTTTTCAATGTGCTCACACCAGCGCTCCAGATCGGGCTGGTTCATAGTCACCGTACAACCGCCGCGAAACAAGGTTGGCTGCAACATCATACCGCTGGCATGACTCAACGGTGCGATATGGAGCATCCGGCTGTCTGGATCAAAATCATGACCTGGCAATGTTAGAAAGCTCTCTGAACAGGCCATAATATTGTCTGCAGTATATTTTGCGCATTTGCTATCGCCTGTCGTGCCGCCGGTAAAGCGCATCAAAACAATATGCGTTCTATCATCAATTTCTACGTCTGTATTATGTGTTGGAGCCTCTGCCAACAAGCTCGGCAGTTCCAATATACCATCCCTTTTTTCTTCCAGTCGATCAACCACCACCACTTGAATGCCGCGTTGCGTAAGTAGCTCAGCGTGGGTTTCGACTAAGGAATTTTCGATGAAAGCAACTTTAGGACCAGTTATCTCGACCTGTCTCATGTGGGTTTCTAAGCTGTCACGGAAATTACCATGGCAGCAGGTGGCCAAAGCTTTGGCTGCCGACCCCCAATGGAAAAGTGACAGATTGTCATTGTCGAGCAAACAGACGTATCGGTCGCCAGTGCCCAATCCCAACGGACCATGTAAGACGTGAGCAAATCGGTTGGTCAACTCGTGATATTCGCGAAAGGTTAATCGGCGCCCGCGTTCGATATTCACGATCGCTTCGCGATCACCGAATGTGTCGACCAGTCCTTGCATGATCCGGCTGTAGTTGAAAGACCCAGTAGAAAGCGTTTGTGTATTTGATGGAGTGAGTTTCGATTTCATTTCAAAGTTTCCGATGTTATCTGGCCCTCAGGCGCGTTGCAGCATCAAGCCGGATGGTCTCACCGTTTAAATAGGCATTTTCGACGATATACTGGCAACTATGGGCGAATTCACGCATATCGCCAAGCCGCTTGGGCGCCTCCACCATTTCGATGAGTGAAGAAACAATCTTTCCTCCCAATCCCATAATCATGGGCGTCCCGAACAAACCCGGGGCGATAGCATTGACTCGGATACCGTGGGCACCCAGTTCCCGCGCGGCGGGTAAATTGAGGCCAATGACACCGGCCTTGGATGCCGAATAGCCAGCTTGCCCGATCTGGCCTTCATAGGCCGCACCGGAAGAGACATTGATAACGACTCCACGCTCCCCGTCATTTTCTGTAGGGTTTTTGACCATCACAGCAACGCATTTGCTCATGACGTTGAACAAGCCGATAAGGTTGACATTTACCACTTTGGCAAATTTTGAAAGGTCGCAGGCATTGCCTTCGCGGTCGAGAATTTTTGTTGGTGCTGGTATCCCGGCCGCGTTAATGTTGATGTGAATAGTGCCAAATTTTGCAGCGGTCTTTTCGATCGCCTCCTTCGCCTGGGTATCATTAGTTACGTCAACATTATGAAATATGGCGTTTTCCGCCCCCAGTTCTGCACACAGACTCTGGCCTGCTTCCTCGTTCAGATCGAAAAGTGAAACTTTGGCACCATGTTCAACAAAGTATCGTGCAGTAGCGGCGCCGAGACCGGATGCCCCACCCGTAATGACTACTATCTTCCGTTTAAGTTGCATTAGTCATCACCCTCGCTAAATTATCGAAACGGTCCATTTTTATATTTCCACTTGTTTTAGATATAGCACCCGGGAAGTCGAATGCCGACTTTATGACCAATGCCGATTGACACGATCTCGCCGTAGAGCTGGATATTAATTTGCCTTCCCTACCCAGCAATTTAAAATTTTTACCGATTGACATTATCATTAGAGCTTTCGCTGAAATTGTTAGGCGCACTCCAGTTTGCCCGCCTCCAAACTCGATTAAATTTAGGAGCTTTGGCACAAAAAATCTTGGTGTAACATTGGTCAGACTGTGCCAATCTGTCGTCTTCGGGCACGCAACGCTATGCTCATAATTTCAGAACGATATACCCCGACTCCTTGAGTATCCGAAAAAATAGCGGTTGCGTTATAGTCGGGAAGCGATTCTGTATTGGGGCATGATCCGCAGTGGTTTTCTGAGCAAAGCAGAACGGCTTGAGCTACGAGCTCTGGCGCGGGATGGGCTGAGTGAAGCGCGCGCGGCGCGCCGTGCCAATGCGATCATTCTGCTGGACAAGGGATGGAGCTGCCAGGAGGTGGCGGAGGCTTTGTTGATCGACGATGACACGGCTCGCTCCTGGCACAGACTTTATGACGAGCATGGCCTGACCGGCCTTGTGGTCTTCGATGTTGGGGGCAGTCACAGCCGGCTTTCTGCGGAACAGGAAGATGCGCTGTTTACGTGGGTCAGCGCCAGCCTGCCACGCAACGCGCGCACGATCGGCGCGTGGATCGTCGACAGTTTTGGCATTGAGTACAGCCACGCGGGGCTGATCGCGCTGCTGCATCGGCTGAAGCTTTCGTACCGCAAGCCTGCGATGGTGTCGGGCAAGCTGGACGCGGCCAAGCAAGCGACGTTCATCGCAGACTATGACCGGCTGATGTGCGGCCTTAAGAACGACGAGGCAGTAGCGTTCGTCGACGCCGTTCATCCGACCCATCAGGTCCGTGCGGTGGGCTGCTGGGCACCCACGGGCGAGGCGGTAGCGATCACGCCCAGCAGCGGTCGTGACCGGGTGAATATTCATGGCGCCATCGACCTCGAGACCGGCAAGACCCAGATGCTCGATGTCCTCACCGTCGATGCGCAAAGCACGATCCTGCTGCTCATGGCGATCCTTGCCACCCATCCGTCGCGGCGGTTGATCCACGTCTTCCTCGACAACGCCCGCTACCATCATGCCCGGCTGGCGCAGGAATGGCTGGTCCGAAAAGGCCAGCGGATCAGGCTCCACTTCGTGCCGACCTACAGCCCCCACCTCAATCCGATCGAGCGACTCTGGGGAGTGATGCACTAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP022336|341814:362214|348364_349534_+|WP_089132010.1|DBSCAN-SWA MRTQVCIIGAGPAGLLLGHLLRAEGVECVVIERQTPDYVLGRIRAGVLENITVSLMERLGLDARMKAEGLPHDGFNLADGERLIRIDIAELVGQEVMVYGQTELTRDLMEACEERGLEVIYQAADVALHDVDGDAPFVTYIHEGAQQRIDARFIVGCDGFHGPSRKAIPSSVSQEFERVYPFGWLGILADVPPCNHELIYANHERGFALASMRSPTRSRYYVDVPVDEDIREWSDDRLWDELAIRLGPAAAANITRGPAIEKSIAPLRSYVFAPMRHGSLMLCGDAAHIVPPTGAKGLNLAASDVHYASEALTRYFKTNDPAGVSRYSETALARVWQSERFSWSLTKLMHRFPDDGPFERAMQVAELDYIANSRAMQTAIAENYVGLPV >NZ_CP022336|341814:362214|356003_356840_+|WP_089132016.1|DBSCAN-SWA MTQSIDSARLTLLLNELRLPTIKVNWPDLARQADKEGWPAARFLATLAEHEVTQRDLRRIERNLNEARLLPGKSIDSFDFTAVPMVSKAHVMALCAGDAWLDKGANLILIGGPGGGKSHLSSAIGFALVEKGYRVLFTRTSDLVQKLQVARRELALEAAIAKLDRVDLLVLDDFAYISKDQAETSVLFELISARYERRSLLITANQPFGDWNRIFPDPAMTLAAVDRIVHHSTIFEMNVDSYRRRAALDRKQHGAGRPPTRATIKGTRVASPTNGTES >NZ_CP022336|341814:362214|341814_343166_-|WP_089132006.1|transposase|DBSCAN-SWA MKPKHSTAKKPAERVVKDIRRATRRHFSAEDKIRIVIDGLRGDDSIAELCRKEGIAQSLYYTWSKEFMEAGKRRLAGDTARAATSDEVKDLRRESRDLKECVADLTLENRLLKKKHDRGWGRRRMRYPASEKLEIIRIVEQSHLSARKTLDKLGIPHRTFYRWYDRYLEGGPEALADRPSAPSRVWNRIPAEVHDQIIDLALERSELSPRELAVTFTDEQRYFVSEATVYRLLKAHDLITSPAYVVIKAADRFHTPTVRPNEMWQTDFTYFKIIGWGWMYLSTVLDDYSRYIIAWKLCSTMRAEDVTDTLDMALEASGCDQAAVMHKPRLLSDNGPSYIAGELVEYIEAQKMSHVRGAPFHPQTQGKIERWHQTLKNRILLENYFLPGDLETQIEAFVEHYNHQRYHESLGNVTPADAYFGRALAIIKKRERIKKQTIEHRRLLHRKLAA >NZ_CP022336|341814:362214|343396_343969_+|WP_186266017.1|DBSCAN-SWA MRIDVGRGFRVAPVSVEDLEDVTSWRIEFETRALDASIGNGDDNWEAEIIASYHLLSKIKVPRVDAPAEEMYDYGEKHMRFHDALVAACGSPWLMFFRSALQTQALRYQALAMTDREHMVNRAVDEHQAIKNAAVARDTKQAMALVTHHIQQTSDDVKVLLQTSGKLDEFLPGPVAEEKPRRRGRPRNTA >NZ_CP022336|341814:362214|357385_358483_+|WP_089131732.1|transposase|DBSCAN-SWA MVGRQAGLVVLSGEDRFFLEGQVRRHKVPRSLSDRCRMILLCAEGLQSKEVAQRLGVHEHTVGKWRRRFVQDGIEGLTDEYRSGRPRTVSDAQVAQVIERTLNSTPKDATHWSIRTMAAETGLSHTTIRRIWSAFGLQPHRSQTFKLSTDPLFVDKVQDIVGLYLSPPNRAVVLCVDEKSQIQALDREQPVLPMAPGVAERRTHTYIRNGTTSLFAALDIATGAVIGKCYKRHRATEFLDFLKRIDAEIPEGPDIHLVMDNYATHKTPKVKAWLARRPHWHVHFTPTSASWINQVERWFAELTRKQLQRGVHRSTAELEADIVAFIAAHNENPKPYKWVKSADEILAAVKRFCQKTMSRTSDSGD >NZ_CP022336|341814:362214|354495_356007_+|WP_089132015.1|transposase|DBSCAN-SWA MPGRHINDHQVRLYMKNRQNDTLATSAARAGFSTSTAYRIAGDPRLPSQKKVPRGSRRPDPLVNIFDNEVVPMLEASPGLRAVGIFAELQRRHPDLAPGVRRTLERRIRGWRALHGAECDVIFRQVHEPGRMGLSDFTHMNKLRVTVGGVRLKHMLYHFRLACSGFQHAHVILGGESFVALAEGLQNALWALGGSPANHRSDSLSAAFRNLDKDASTDLTRRYDALCQDYGMEPTRNNRGVAHENGSIESPHGHLKAQVEDALLMRGSRDFDDLAAYRCFIDELVGRNNTRNAKRIDAERPALQPLPLRRSSDFTEKLVRVTTSGGFTLLKVFYTVPSRLIGHRLRVRLYDARLDVFVGATQVMTAARGHAEANGKHGQMVNYHHVIHALRRKPMALMRLVYRDKLFPRAAYRQSFYWLLEQHGEKAACHMMVDLLGLAHDRGCEAELAAVLDEDLAAQRIPDMAVLRKRFAPDPESLPEVFVHLASLSSYEALLSQPTRAAA >NZ_CP022336|341814:362214|352884_354084_+|WP_089132013.1|integrase|DBSCAN-SWA MPNQIKLTDAKLKAVKTPLRGRVELADSDCVGLRFVKGSTGKCRWIVRKRVAGKFRKITLGDYPGIGLAKARQKALQACADIESGRKLAKPDKRSNKTVSALWSTYDQQRVENKRSAKEIRRIFKKYVLPELGGRPVDAITRTDVTRLIDGIAYGERSAPVMARLVAAQMSVFFNWVLPRVETLAYNPVTAASRPPAPKPRDRVLSNEELKSLWKAVENQPFPWQHSLKLMLFTAARRSEVFGAERAEFDLEEKVWNLPASSAKNNNAVIIPLSEEACRVVSGCLVTDGSSKLFPSATNPKTSASGISKLLRRLRADVQSDLQRPVLHWTLHDIRRTVATNLQRLGVRLEVTEAILNHVSGSQAGIVKVYQRYNWMNEKREALQLWSDDLVRICDCSRQ >NZ_CP022336|341814:362214|361314_362214_+|WP_089132020.1|transposase|DBSCAN-SWA MIRSGFLSKAERLELRALARDGLSEARAARRANAIILLDKGWSCQEVAEALLIDDDTARSWHRLYDEHGLTGLVVFDVGGSHSRLSAEQEDALFTWVSASLPRNARTIGAWIVDSFGIEYSHAGLIALLHRLKLSYRKPAMVSGKLDAAKQATFIADYDRLMCGLKNDEAVAFVDAVHPTHQVRAVGCWAPTGEAVAITPSSGRDRVNIHGAIDLETGKTQMLDVLTVDAQSTILLLMAILATHPSRRLIHVFLDNARYHHARLAQEWLVRKGQRIRLHFVPTYSPHLNPIERLWGVMH >NZ_CP022336|341814:362214|354085_354295_-|WP_145955434.1|DBSCAN-SWA MNHSNSNRLRLTGKLIKGDRRYVLQCDDKSVWKLDFQDNEVPEDNHMVLVEGVKTGVDSIAVDWVGETK >NZ_CP022336|341814:362214|344587_345886_+|WP_089132008.1|DBSCAN-SWA MDFEPTEQQAQWRDRVAHFMNTKVRPAIPRYQAEQATGERWKILEVVEELKGEVKEAGLWNLFMPPRNDSHHHVDETFEFDGPGLTNLAYALCAEEMGRVEWAAELFNCSAPDTANMEVLHRYGTREQKDRWLKPLMDGDIRSAFLMTEPFTASSDATNIETRIERDGDHYVINGRKWWSSGFGDPRCKVAIVMGKTDFNAARHAQQSMIIVPKDTPGVTPLRYLNVFGYDGAPHGQMEVELKDVRVPADNMLLGEGRGFEIAQGRLGPGRIHHSMRSIGLAEEGLHKMCKRLQEREVFGKPIYKHSVWEERVARARIDIDMTRLLCLKAADMMDKVGNKGARQEIAMIKLQAPQAALRIYDDAIQAFGGAGVADDYGLAEAFAGLRSFRLADGPDEVHARSIARLEFVKHPPEAGPAANALRGDQARNAGA >NZ_CP022336|341814:362214|360115_360889_-|WP_089132019.1|DBSCAN-SWA MQLKRKIVVITGGASGLGAATARYFVEHGAKVSLFDLNEEAGQSLCAELGAENAIFHNVDVTNDTQAKEAIEKTAAKFGTIHININAAGIPAPTKILDREGNACDLSKFAKVVNVNLIGLFNVMSKCVAVMVKNPTENDGERGVVINVSSGAAYEGQIGQAGYSASKAGVIGLNLPAARELGAHGIRVNAIAPGLFGTPMIMGLGGKIVSSLIEMVEAPKRLGDMREFAHSCQYIVENAYLNGETIRLDAATRLRAR >NZ_CP022336|341814:362214|356874_357306_+|WP_186266018.1|transposase|DBSCAN-SWA MPGNAQSSKIGRAVAQRQPNSKLILVSDNQDEHHGDAVMTDSYLGCRVDHPVCRATMIGDKGYDSDEYRAALQAKGITPCIPPRKGRILPADFDKTLYRQRHKIENMFGRLKDWRRIHTRYDRCAHTFMSAIAIAATVIFWLN >NZ_CP022336|341814:362214|358479_360030_-|WP_089132018.1|DBSCAN-SWA MQGLVDTFGDREAIVNIERGRRLTFREYHELTNRFAHVLHGPLGLGTGDRYVCLLDNDNLSLFHWGSAAKALATCCHGNFRDSLETHMRQVEITGPKVAFIENSLVETHAELLTQRGIQVVVVDRLEEKRDGILELPSLLAEAPTHNTDVEIDDRTHIVLMRFTGGTTGDSKCAKYTADNIMACSESFLTLPGHDFDPDSRMLHIAPLSHASGMMLQPTLFRGGCTVTMNQPDLERWCEHIEKERITHAFVVPTIAYRLLEMPEAERYDLSTLKTIFYGAAPMSPAKAKKLVAKFGPLFMQVYGSTEHLVATLFLGKAEHVGDTQTEKRLASAGRRNAGVEVTICDDDGNPVLQGEVGELYLRSRGTCQGYEGNPEKTAEEFFQGYWKSGDLGYSDEAGYCYLVDRKKDLIITGGFNVYANEVEAAINSHPCVTMSAVVGIPDEEWGEAVHAEVIRNGVTEVHEQELIDHAKSLIGSVKAPKTVTFVDELPTTTVGKVLRKNVRAKYWEKHTRQIA >NZ_CP022336|341814:362214|349959_352134_+|WP_089132012.1|DBSCAN-SWA MADQKIGGGARKILYTLNTIRKMGVTKSSKALTARNTCKACALGMGGQLGGMTNELGEFPSVCNKSVQAQSSDIQPAIPEEVFAHGIDDFAALSAREIEGLGRLGSPIFKDQGEDRYRPVSWDQAMEIAAARLEAAEPNRTFFYSSGRSSNEAGFLFQLLARLYGTNNVTNCSYYCHQATSEALASTIGTGTSTVEIADLSLCDLIFVIGANPASNHPRFIHQLASCRARGGQVIVINPAKEPGLVKFALPKSPKSLIKGGDEIASFYLQPRIGEDLALFKGIAKAIVADEAIDRTFIDQHTQGSESFLSDLAATSWEEIVERTQIEKAVIEQAAALYAKSKSAVFAWGMGMTHHHNGVQNIEMIANLALMRGMVGRPGAGLLPLRGHSNVQGIGTIGVKPVLADDVMHAIENYFDISLPEEKGLDTMASMKAADAGQIDAAMMMGGNLFAANPNAEWAKCALEKIGFKLFLTTTLNHGHFKGIGDGASLVLPVTARDEEWEPTTQESMFNYVRLSDGGINRLPSVRPESHILADLAERILPDCPIDFVEFKHHKNIRKAIAATVPGMEELAGLDVARREFHIAQRLLHEPQFGTASGKGEFATCKLPARVAEDDQYSLSTIRSEGQFNSIIYEEADTYRGTQSRWTVLMNPEDMAAMGLAKGDAVDLLSDNGKMIAVAAHPFDIPRRNMMAYYPEANVLTGTDVDPRSHTPSFKNTPVRIAIR >NZ_CP022336|341814:362214|343992_344415_-|WP_089132007.1|DBSCAN-SWA MDDFLKRLPGYNLRRASTVILSELTDGLSELDLRPSDASMLITIDDHPRVTPSELGRILGIQRANMVPMVVRLEDRGEIVRIKRDGRSFGLELTQKGSELCNRAKAILLKYEENFLKRVPPEHREHLLPALRALWDHSAS >NZ_CP022336|341814:362214|347507_348353_+|WP_089132009.1|DBSCAN-SWA MTKVISPSGVAAYEIKDGIAWVYFTRSEKRNCMSPTLNKEMGDLLAEIEFRDDFGVLVLTGEGTSWSAGMDLKEYFREAEAKGLGAIREAQAQAYSWWRKLRWYQKPTIAMVNGWCFGGAYGPLFACDLAFAAEDAQFGLSEINWGILPGGAASKIIRELANFRNSMYHAMMGENVDGKTAAQWGLVNEAVSADRLEDRVTEVAKVLLEKNPVALKATKDAIRRVGEMTYDNAEDYLVRAQEAANYFDNMGRKEGIKQFIDDKTFKPGLGAYDKSKQPVSE |
16 | Equine_infectious_anemia_virus(25.0%) | integrase,transposase | attL 343389:343402|attR 356277:356290 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2885666 : 2894198
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP022336|2885666:2894198|DBSCAN-SWA GATGAATGATGGCAACGCAACGGAATTATTGGCCCGACTGACGCCGCGCGATCGCCAGATGTTTGTCTCGCGGCTGAGCACGATAGAGCAGCAGGAATTTCTGTATCGCTGGCCGTTCTGGGCGCGGCCCGAACAAATGCCGCCTGCCGGCGACTGGTTGATCTGGCTGATCATGGCAGGGCGTGGCTTTGGCAAGACCCGGGCCGGGGCCGAATGGGTGCGTCATATCGCCGAAGGCGATGGCAGCGCCCGTTTTGCGCTGGTTGGCGCGAACTATGCCGAAACCCGGACCGTGATGGTCGAAGGAGAAAGCGGCCTGCTGTCGATTGCGCCGCCCAAGCAGCGGCCGGTCTGGGAGCCTTCGCTGAAGCGGCTGACCTGGGAGAATGGCGCGCAGGCGCATCTCTATTCGGCTGCCGAACCGGAGGGCCTGCGCGGTCCCCAGCACAGCCATGGCTGGTGCGACGAGATCGCCAAGTGGATGAACAATGCCGGGCAGGCGGAAGCCGCCTGGGACAATCTGAAAATGGGTCTGCGGCTTGGCTCTCGGCCGCAACTGGTCGCCACCACAACGCCGCGACCGGTGCCGCTGGTGCGGGCGCTGGTTAGCGGGGAGGTGACAATCACGACCGGACGGACGCAGGACAATCATCTGCATCTGCCGGCCGCCTTTCTGACCGCCATGGAGGCCGACTATCGCGGCACGCGGCTGGGACGGCAGGAACTGGACGGCGAGCTGATAGAGGATGTCGAAGGGGCGCTCTGGACAAGGGCGATGATCGAGGGGTGCCGGGTTGGCCGTGACGCCGTTCGTGTCGAGCATAGTCGATACACAGGGGATAGCGCGAACCGGTCTCTCGCCTGCGCTCGAGACGAACGGGATATGGGCCTGGTCCGTATCGTCATCGGGGTTGATCCACCGGCATCGAAAAGCGGTGATGCCTGCGGCATCATTGTTGCGGGCCTGGGCGGGGATGGCAAAGCCTATGTGCTGGCCGATGCCAGCGTGGAAAAGGCCAGCCCCGAAACCTGGGCGCGCAAGGTGGCGGAGGCCGCTGATCGCTTTGAGGCCGACCGGATCATTGCCGAAGCCAACCAGGGCGGCGCGATGGTCAAGTCGGTGCTGCAGGCGGCGAAGATTTCGCTGCCGGTGAAGCTGGTCCACGCCGCGCGCGGCAAGGTTGCAAGAGCGGAGCCGGTGGCGGCGCTTTATGAAAATGAGCGGGTGCGTCATGTCGGCGCTTTTCCGCAGCTCGAGGACGAGATGTGCGGTTTGCTGGTTGGCGGCGGCTATGAGGGGCCGGGGCGCTCGCCCGATCGGGCGGATGCGCTGGTCTGGGCGTTGACGGAACTGATGCTGGGGAAAAGGCGGGAACCGCGGGTGCGGTGAATGGCCTGTCGAATGGCGCGTGCTCCGCACTTGATGCGGAGCTCTCTCGGCATTCAGTTGCTATCCGCCAGAGCTCCGCATCAAGTGCGAAGCACATCGCGTAATAGAGGAAATAAACATGACATTCTGGGAAAATATCACGCTCGCCTTCAAGGGCGGGGGTGCCTCCTTGCGGCCGCCTTTGGGGCGCAGCTATATCGGCACTTATGGTGGCGCGGCTCTTTCGGGGGACGCGCCCTTTTCTTATGAGGGCCGGGTGCGCGAGGCCTATGTCGAAAATGCGATTGCCCAGCGGGCGGTGCGCATTGTCGCCGAAGGGGTTGGCGGCGCGCCGCTGGTCCCGGTGGAAGGCAAGCTTGCGGCCCTGGTCGAAGGAAGCGGCGCGGGGCAGTCGCTGCTGGAGACTGTCGCGGCGCATCTGCTGCTGCATGGCAATGCCTATGTCCAGATCATCCATGGCGATGACGAGCCTGCCGAGCTTTACGCGCTCCGCCCCGACCGGATCACGGTCGAGCCCGATGCCAAGGGCTGGCCAGTGGCTTATGCCTATCGGGCCGGTGAACATCGCACCCGCTTTGCCGCGCAGGACGCGATGGGCCGCCCGGCGATCATCCATCTCAAGGGCTTCCACCCGACCGATGACCATTATGGGCTCGGCTGTCTGGGCGCGGCGGCAAAGGCGGTGGCGGTGCATAATGCGGCGGCGAAATGGAACAAGGCGATCCTCGACAATGCGGCGCGCCCATCCGGGGCGCTGGTCTATGATCCGGGCGCGGATGGATCGGCGTTGACTGGCGAGCAGTTCGACCGGCTGAAGGCCGAGATGGAGGCGAGCTTTGCCGGTTCCGGCAATGCCGGGCGGCCGATGCTGCTCGAGGGCGGTCTGAAATGGCAGGCGATGAGCCTGACTCCGGCGGACATGGATTTTGTCGCCCTGAAAGAGGCGGCGGCGCGGGAAATCGCGCTGGCCTTTGGCGTGCCGCCGATGCTGCTCGGGCTGCCCGGTGACAACAGCTACGCCAATTATCGCGAGGCCAACCGGGCCTTGTGGCGGCTGACGATATTGCCGCTGGCGGGGAAGATCCTGGATGGATTGTCTGGGGCGCTGGGCAGCTGGTGGCCGGATGTAAAGCTGGCGGTGGACCGGGACCAGATCCCGGCCTTGTCGGAGGATCGGGAGCGGTTGTGGAAGCAGGTGTGCGAGGCGGACTTCCTGTCGCCGGAAGAGAAGCGGGCGATGCTGGGGGTGTAGATTCATTTTGTCTCGCGGCAGTTCGCTGCGCGAACGCCTCCGACGGGGATTTTTTGTTTAACCTCAGGCGCCGGGCGGGCGCATCCGCGCCCTTGGCTTTCGCGCCATAAGGCGCGGCGGCCGGTCGGCCTTGCCTCGTTGCGCTCGGTTAGAAACCAGCCACGAGTGTGAGCGCCGAACCGACGCGAAGCGGAGGCAAGCGCGACCGCGCGTCCGCAGCGTAGCGAGGATAGCCAAGCGGGCGGATGCCCGCGCCCGGCGCTTGAGGTTAAAACGGAGTTTTATACAAAATGCAAAATAACGAAATGCTCGCCCGCCTGATGGCGCAGGCGGAAGGCGATGGCGCGGATCTGGTGACATTGCGCGCGATTGTCGAGGAGGCGACCGACAGCGGCGCGGTGCGGGTGCTGGACCGGCTTGGCCTGTCCGACCCGGGCGCCGAGGATGATATTGACGAGTTGCGCGAACTGCTCCGCGCCTGGCGCGACGCCAAGGCGAGCGCGTGGAAGGCGGCGATCCGCTGGATCGTGCGGGGGGTGCTGGCGCTGCTCTTGGTCGGCATCGCGATGCGGCTGGGTCTGGGGCATCTGGTGTCGTGAGAGAGGTGGAACGGGCTGTTTGTGGTGAGCTTTTCGAACCACGCCTCTCGGACGAGCAGTGCCCTTCGACAGGCTCAGGGCGAACGGATATCACCGCAAAAGGTCCTTCGACAGGCTCAGCACGAACGGATTTTCTTTCAGTGATACCGGACACCATCCGCTTCGCCGGCTATGCCGCGATCTTCGACCGGGTGGATCGGGGCGGCGATATCGTCAGGCCGGGCGCGTTCGGCGATCTGGCCGAAGGACAATCCCTGCCGCTGCTCTGGCAACATGATCCGATGCAGCAGATCGGCCGGGTTGATTTTGCCCGTGAGGACCGGCGCGGATTGCGGGTCATCGGGACTGTTTCTACCACGACCCGGGCCGGACGGGAGGCGGCCGCCTTGTTGTCCGGCGCGTCGGTGAAGGGGCTGAGCTTCGGCTACCGCGTCAAGCGGGCGACGGGCGAAAAGCCGCGCGAATTGCTCGATCTGGATGTCGCGGAAATTTCGCTGGTGACAGCTCCGATGCAAGCACTGGCGCGCGTTCATCTGGTCCGTAAGCCCTGATTCGACCGCGCGAAGACGCGAAGAGGAACGATCTGTTTTGTGACTGCGTGCGATTTCGACAATCGAATGAAAATTAACCCCAGGAAAGGAATATCATGGAACTCTCTCCCAATACCCCCCTCGAAACCAAGGCGGATCCGCTCGAGGCGTCGTTTGATGCGGTGCTGATGGCGGAAGAGACCGAAAGTCATGGCAAGGCGATTGCTGCCTTGCGCGGCGATGTGGACGGCCTGAAGGGGCAGGTCGAGGCGATCGGCAAGGCTTCGGCGCGACCCGCGCTGGCCGGAAATATCGATGGCGCGAAGGGGATGCCGTCTTCGGCTGCTGCGCAGGATTTTGTCGCGAAATATCTCCGGCGCGGGAACCATGAGGGCGTCGAGCTGAAAAGCTTTTCCGGGGCCTCTGGACCCGAGGGCGGTTTTGCCGTGCCACAGGAGATTGACGGGCTGATCGGGGCGACGCTCAAGGATATTTCGCCGATCCGCGCGATCGCGACCGTGGTGCAGACCGGCACGGCGGGTTATCGCAAGCTGGTCACCACCGGCGGCACGCCCTCCGGCTGGGTCAGCGAGACCGCGGGGCGTCCGGAAACCGATACGCCCGATTTCAACGAGATCGCGCCGCCGACCGGCGAGCTTTACGCCAATCCGGCGGCGTCTCAGGCGATGCTCGATGACGCGGCCTTTGACGTCGAATCCTGGCTGGCCGACGAGATTGCCCGCGAATTTGCGCAAGCCGAGGGGGCGGCCTTTGTCGGTGGTTCCGGGGTTAACCAGCCGCGCGGTTTTCTCAATGCGACGGTGACCGATGAGAGCGATGATGTGCGGGCCTTCGGCTCGCTGCAATATGTGCCGTCGGGCGCGAGCGGTGACTTTGACAGCGAAGATGTGCTGGTCGATCTGGTGCACACGCTGCGGCCCGCTTACCGGCAGGGCGCGAGTTTTGTGATGAACAGCTCGACGCTCGCCCATATTCGCAAGTTCAAGACGGCGGACGGTGCTTTTCTGTGGCAGCCGTCGCTGGCCAATGGACAGCCCGCGACGCTGCTCGGCTATCCGGTGGTGGAGGCCGAGGACATGCCCGATATTGCCGCCGACAGCCTGGCGATTGCCTTTGGCAACTTCCGCGCCGGTTACCTGATCGCCGAACGCAGCGCGACCCGCATCTTGCGCGATCCGTTCACCAACAAGCCGTTCGTCCATTTCTACGCGACCAAGCGGGTTGGCGGACAGGTGATGAATTCGGAAGCGATCAAGCTGATGCAGTTCAGCGCTTCCTGAACCTCCTTGCTGTGCTTCGGCGCAGCGCGCCCGTGCCGGTTGCTCCCCCCTCTCGATCCGGCACGGGCGCATCTTTTGAACCATGAATGAAAGGATGGCGCGACCGTGAGCTTTCCCATTGCAGACTGGCCGGATTTGCCGGCAGCGCTGATCGCAGAGGTCAGGGATTTTGTCCGGATTGATCATCAGGCCGATGATGACGCCATCGATGCGTTCCTGCGCAGCGCGGCGTCCCTGTGCGAGGATTTTACCGGCCAGATGCTGATCGCGCGGTCGGTGACCGACATGTTGCCGGCGCGTCAGGCCTGGCAGAAGCTGAAACGGCTGCCGGTGCAGGCGATTGCTTCGGTCGAGGCCGTGGGGGCGGACGGAGCGGCGTCGCTGCTGGCGGTCGCGGACTATGCCGTGGATATCGACAGCGACGGGATCGGCTGGATCAGGCTGCACCGGAGCGACGGCGGGTCGCGCATCCGGGTTCGCTATAGCGCCGGTCTGGCGGCGGAATGGAACGCGTTGCCCGCGGGTCTGCGCCAGGGGATCGTGCGGATGGCTGGCTATCTCTACGCCAATCGCGACAGCGTCGATGCCGGTGGTCCGCCGAGCGCGGTGACCGCCTTGTGGCGACCCTATCGGCGGATGAGGATGGCATGATGGGACAGGAATTTTCCGGTATCTTGCGCGAACGCATATCGATCGAGCGGCAGGCGGTCGGGCGCGATGCGCTCGGTTCTGCCGAGCCGCAATATATTACCGTTGGCGTCTTCTGGGCAGCGGCTGAAGCGCTGCACGGCGGAACGCCCAGTGAGGCGGAAAGCCGTTCGGCGATGCCGCGCTGGCGCTTCATTCTGCGCGAAACCCGGGCGATCAAGCCGGGCGACCGGCTGGTCTGGGGGGAGCGGATAATGACCATATCGAGCGTCCTATTGGAGCATCGGCTGATCCCGAAGACCATCCTGCAGGCGGAGGAGAAGAGATGATGGAAAAATTGCAGCAGCGTGGCGAGGCCCTTGCCGAACAGCGGTTGGCCGAGGCGAAATCCGAGATCAAGTCGGTGCTGGTCGAAGAGTTGCCCGACGACGTGCAGGTCGTCGAAACGGAACTGGGGATCCAGGTGCAAGCGCCGCGATTGCGCGCGCGGCTGATCGGGAACAGCAGCTTGCGCGATATCGCCTTTCTGATGCGGGCCGCGCGATGAGTAGCGCGCTCGAAGCGGTGCAGCAGCAGTTGGTGACGCAACTGGATGCCAAGCCGTCATTGACCGGCCTGATCAGCGGTATATTCGACGGCCCGCCGCCGCGTGCCGCTTTTCCCTATATCGCGCTGGCCACGGGGGCTTCGCTTGACTGGAGCCACAAGGGCGGTGTCGGCCGGGAATTGAGCCTGGCTCTGACCGTCCATGATGATGGTGAGACGGCGGCGCGATTGCATAGGGTGATGGCGTTGGTCGAAGAGGCCTTGGAGCCGGGGCTGGATGATCCCGATGGCTGGCAAATCGTCACCTTTGATTTTCGCCGGACGCGGATTGTGCGCAGCGCGGTTAGCCCGTGGAGCGGGCTGGTCGAGTATCGGGCGCGGGTTTTGAAAAGCTAGTGCCCCTGCGCAGGCAGGTTTTTATTTTAACCTCGGACGCCGGGTTTACTTTGTCTCACGGCAGTTCGCTACGCTCACGCCTCCGACGGGGATTTTTTATTTAACCTCAGGCGCCGGGCGGGCGCATCCGCGCCCTTGGCTTTCGCGCCATAAGGCGCGGCGGCCGGTCGGCCTTGCCTCGCTGCGCTCAGTTAGAAGCTCATCATAGTGGGAGAGCCGAACCGACGCGAAGCGGAGGCAAGCGCGACCGCGCGCCGGGCGAAAGCCCGCCCTGTCGGAGGCGTGAGCGTAGCGAACTGCCGCGAGACACCATGACCACACACCAGAAAGGAACCAGATTATGGCAGCAGAAAAAGGCAGCGCCTTTCTCCTGAAAATTGGCGACGGCGAGGAGCCCGTCGGCTATACGACCATCGCGGGTCTGCGGACGACGCAGATGTCGATCAATGGCGAGCCGGTGGCGATCACCAGCAAGGACAGCGGCGGCTGGCGGCAATTGCTGTCGGGCGCGGGGGTGCGATCGGTTTCGGTGTCCGGGGCGGGCGTATTCACCGGCTCCGACGCCGAGATGCGGATCAAGAATCACGCGCTGGGCGGAATCATTGACGCCTATGAACTCAGCTTCGAGGGCGGCGAGCGGATGCAGGGCGATTTTCTGGTCGCCCGGCTGGACTATAGCGGCGATTATAACGGCGAGCGCAGCTACACGCTGAGCCTGGAAAGCTCGGGCGCGGTGGCCAGTGTCTGACCGGTCCGCCAACGCCCTGCGCGGCGAGGCGCAGATTGTTATCCATGGCACGCGTCTGATCCTGCGGCCCAGTTTTGCGGCCTTGGTCGCGGCCGAAGAGGAGCTTGGCTCGCTGTTCGATCTGGTCGAGCGGGCAGCCGGCGGGCGGCTTTTGCTGTCGGAAATCGTCACCCTGTTCTGGCATCTGGCTGCTGATCGTCCGGATCATCTGACCCGCGACCAGCTCGGCGAGGGGATGATGGTGCTCGGACTGGCCGGGGTGACGCCGCCGCTGAAGATATTGCTCAGGCAGATATTGTCGGGCGGCGGTGCATGAGGACTGGTCCGCCTTCACAGGAGAACAGGACTTTCGCCGCCTCCGCCTCGCGCTTGTCCGGCACTGTGTCTGCGGTCCTGGGCTGGACACCCGACCAGTTCTGGCGGGCAACGCCTGCCGAACTCGCGACGATATTCTCGACCTTTGCCGACAATATGGCCGGCCTGTCCGGCGAGCTTCCGCTCGGCACAGCACAATTGGAAAAACTGAAAGAGGTCTTCCCCGATGGATGAAGAAATCGAACGGCTGGTGGTCAGCGTGCGCGCCGATACCGCCGGCTTTGCCAAGGATGTGGCCGATATGAAGGGCCAGCTTGATGGTCCTTTTGCATCGGGGCTGGAGCGGGCGGGATCGGCGCTGGAGTCCACGCTCGGCCGCGCGATCCGTCGCGGTTCACTGGGTTTCGAGGACCTGCGCCGGGTGGCGCTGTCGGTGATGAACGATATCGCCGATGCGGCCATCCGGAGCGGTTTGCACAATCTTTTTGGTGGCGGTGCGGGTGGCAGCGGTCTGCTCAATATCGGAACATCCTTGCTGGGCGCCTTTCTCGGCGCGCCGGGGCGGGCGACCGGTGGTCCGGTCAGCGGCGGCCGTGCCTATATGGTCGGAGAACGCGGGCCCGAGCTGTTCGTGCCGACCGCAGCGGGGCGGATCGAGCCGCCGGTACCGGTCAGCACTGCGCCGAATATCCGATTGACCATCAACATATCCGACAATGGCCAGGGCAGCGCGCCCGACCAGATGCGCCGGTCGAGCCGGCAGGTGGCGCGGGCGGTGCGCAATGCGCTGTCGGCGAAGGCGGGCTGA
Protein sequences of DBSCAN-SWA_2 >NZ_CP022336|2885666:2894198|2893616_2894198_+|WP_089134078.1|tail|DBSCAN-SWA MDEEIERLVVSVRADTAGFAKDVADMKGQLDGPFASGLERAGSALESTLGRAIRRGSLGFEDLRRVALSVMNDIADAAIRSGLHNLFGGGAGGSGLLNIGTSLLGAFLGAPGRATGGPVSGGRAYMVGERGPELFVPTAAGRIEPPVPVSTAPNIRLTINISDNGQGSAPDQMRRSSRQVARAVRNALSAKAG >NZ_CP022336|2885666:2894198|2887173_2888307_+|WP_089134070.1|portal|DBSCAN-SWA MTFWENITLAFKGGGASLRPPLGRSYIGTYGGAALSGDAPFSYEGRVREAYVENAIAQRAVRIVAEGVGGAPLVPVEGKLAALVEGSGAGQSLLETVAAHLLLHGNAYVQIIHGDDEPAELYALRPDRITVEPDAKGWPVAYAYRAGEHRTRFAAQDAMGRPAIIHLKGFHPTDDHYGLGCLGAAAKAVAVHNAAAKWNKAILDNAARPSGALVYDPGADGSALTGEQFDRLKAEMEASFAGSGNAGRPMLLEGGLKWQAMSLTPADMDFVALKEAAAREIALAFGVPPMLLGLPGDNSYANYREANRALWRLTILPLAGKILDGLSGALGSWWPDVKLAVDRDQIPALSEDRERLWKQVCEADFLSPEEKRAMLGV >NZ_CP022336|2885666:2894198|2891928_2892327_+|WP_089134074.1|DBSCAN-SWA MSSALEAVQQQLVTQLDAKPSLTGLISGIFDGPPPRAAFPYIALATGASLDWSHKGGVGRELSLALTVHDDGETAARLHRVMALVEEALEPGLDDPDGWQIVTFDFRRTRIVRSAVSPWSGLVEYRARVLKS >NZ_CP022336|2885666:2894198|2892667_2893075_+|WP_089134075.1|tail|DBSCAN-SWA MAAEKGSAFLLKIGDGEEPVGYTTIAGLRTTQMSINGEPVAITSKDSGGWRQLLSGAGVRSVSVSGAGVFTGSDAEMRIKNHALGGIIDAYELSFEGGERMQGDFLVARLDYSGDYNGERSYTLSLESSGAVASV >NZ_CP022336|2885666:2894198|2885666_2887055_+|WP_186266110.1|DBSCAN-SWA MNDGNATELLARLTPRDRQMFVSRLSTIEQQEFLYRWPFWARPEQMPPAGDWLIWLIMAGRGFGKTRAGAEWVRHIAEGDGSARFALVGANYAETRTVMVEGESGLLSIAPPKQRPVWEPSLKRLTWENGAQAHLYSAAEPEGLRGPQHSHGWCDEIAKWMNNAGQAEAAWDNLKMGLRLGSRPQLVATTTPRPVPLVRALVSGEVTITTGRTQDNHLHLPAAFLTAMEADYRGTRLGRQELDGELIEDVEGALWTRAMIEGCRVGRDAVRVEHSRYTGDSANRSLACARDERDMGLVRIVIGVDPPASKSGDACGIIVAGLGGDGKAYVLADASVEKASPETWARKVAEAADRFEADRIIAEANQGGAMVKSVLQAAKISLPVKLVHAARGKVARAEPVAALYENERVRHVGAFPQLEDEMCGLLVGGGYEGPGRSPDRADALVWALTELMLGKRREPRVR >NZ_CP022336|2885666:2894198|2891384_2891714_+|WP_089134072.1|head,tail|DBSCAN-SWA MMGQEFSGILRERISIERQAVGRDALGSAEPQYITVGVFWAAAEALHGGTPSEAESRSAMPRWRFILRETRAIKPGDRLVWGERIMTISSVLLEHRLIPKTILQAEEKR >NZ_CP022336|2885666:2894198|2893387_2893624_+|WP_089134077.1|tail|DBSCAN-SWA MRTGPPSQENRTFAASASRLSGTVSAVLGWTPDQFWRATPAELATIFSTFADNMAGLSGELPLGTAQLEKLKEVFPDG >NZ_CP022336|2885666:2894198|2889624_2890737_+|WP_089134909.1|capsid|DBSCAN-SWA MAEETESHGKAIAALRGDVDGLKGQVEAIGKASARPALAGNIDGAKGMPSSAAAQDFVAKYLRRGNHEGVELKSFSGASGPEGGFAVPQEIDGLIGATLKDISPIRAIATVVQTGTAGYRKLVTTGGTPSGWVSETAGRPETDTPDFNEIAPPTGELYANPAASQAMLDDAAFDVESWLADEIAREFAQAEGAAFVGGSGVNQPRGFLNATVTDESDDVRAFGSLQYVPSGASGDFDSEDVLVDLVHTLRPAYRQGASFVMNSSTLAHIRKFKTADGAFLWQPSLANGQPATLLGYPVVEAEDMPDIAADSLAIAFGNFRAGYLIAERSATRILRDPFTNKPFVHFYATKRVGGQVMNSEAIKLMQFSAS >NZ_CP022336|2885666:2894198|2893067_2893391_+|WP_089134076.1|DBSCAN-SWA MSDRSANALRGEAQIVIHGTRLILRPSFAALVAAEEELGSLFDLVERAAGGRLLLSEIVTLFWHLAADRPDHLTRDQLGEGMMVLGLAGVTPPLKILLRQILSGGGA >NZ_CP022336|2885666:2894198|2891710_2891932_+|WP_089134073.1|DBSCAN-SWA MMEKLQQRGEALAEQRLAEAKSEIKSVLVEELPDDVQVVETELGIQVQAPRLRARLIGNSSLRDIAFLMRAAR >NZ_CP022336|2885666:2894198|2889061_2889457_+|WP_089134908.1|head,protease|DBSCAN-SWA MRFAGYAAIFDRVDRGGDIVRPGAFGDLAEGQSLPLLWQHDPMQQIGRVDFAREDRRGLRVIGTVSTTTRAGREAAALLSGASVKGLSFGYRVKRATGEKPRELLDLDVAEISLVTAPMQALARVHLVRKP >NZ_CP022336|2885666:2894198|2890854_2891388_+|WP_089134910.1|DBSCAN-SWA MADWPDLPAALIAEVRDFVRIDHQADDDAIDAFLRSAASLCEDFTGQMLIARSVTDMLPARQAWQKLKRLPVQAIASVEAVGADGAASLLAVADYAVDIDSDGIGWIRLHRSDGGSRIRVRYSAGLAAEWNALPAGLRQGIVRMAGYLYANRDSVDAGGPPSAVTALWRPYRRMRMA >NZ_CP022336|2885666:2894198|2888597_2888906_+|WP_089134071.1|DBSCAN-SWA MQNNEMLARLMAQAEGDGADLVTLRAIVEEATDSGAVRVLDRLGLSDPGAEDDIDELRELLRAWRDAKASAWKAAIRWIVRGVLALLLVGIAMRLGLGHLVS |
13 | Dinoroseobacter_phage(14.29%) | capsid,portal,head,tail,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|