Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP019942 | Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed5, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP019940 | Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed3, complete sequence | 1 crisprs | NA | 0 | 1 | 0 | 0 |
NZ_CP019938 | Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed1, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP019941 | Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed4, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NZ_CP019937 | Ketogulonicigenium robustum strain SPU_B003, complete genome | 0 crisprs | csa3,DEDDh,WYL | 0 | 0 | 4 | 0 |
NZ_CP019939 | Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed2, complete sequence | 1 crisprs | RT | 0 | 3 | 0 | 0 |
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
414917 : 423306
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP019937|414917:423306|DBSCAN-SWA GCTAGTGCCCTGCCGAGCCCGAGGCGGGCAGCGCCGCCGACCAATTCATGGCACGCTGCGCATCGCGCAAACGCACCAGCGACGAGATGTGGCTGTCGGCATCGAGCGCGGTCATCACGCGGTGCAGGTGTTCAGCATCGCGCACGTCGACATCAACGCGAATGCGGTAATAATCCGTCTTGCGATCCAGAAAATCGACGTCGGATATGTTCGCATTCTGCTCGCCGATCAGGGTGCAGATCCGGCCCAAAACACCCGCGTCATTGGCCATCGACAATTCCAACGACACCGCGCTGACGGCCTTGTGCTGCCCTTCGTGCCAGCGCAGGTCGACCCAACGGTCAGGCTGGTCTTCGTAATCGGACAGGTTGGGGCAGTCGATTGCGTGGATAATTACGCCTTGCCCGCGGAAGGTGATGCCGACGATCCGCTCGCCCGGGACGGGCTGGCAGCAGGGCGCGCGGCGGAAGCTTTGGTCGGCGCTGAGGCCGACAATTGCTTTTTCCGCGTCGATCTCGTTCGCGTTTTGCAGCTTCAGATCAGGGTAAATCGCCCGCACAACTTCGCGCGCGGTGATTTCCGCGGCGCCGACGCGCAGCAGCAACTGTTCGGCATTTTCCAGCGCCAGCGCGCGCGCCGCCGTGGCCAAGGCCTTGTCAGTCGCTTTTTTACCGGCGTTTTCAAAGGCAACGCGTGTCAGCTCGGTTCCCAGCTTGATGAACCGGTCGCGGTCTTTTTCGCGCAGCCACCGACGGATGGCCGATTTCGCGCGGCCCGTAACGGCAATGTCGATCCACGTCGCCTGCGGGGTCTGGCCGTCGGCAATGATGATCTCGACCGACTGGCCATTCTTCAGCCGCGTCCACAGGGGCACGCGCAGCCCGTCAACCTTAGCGCCAACGCAGGCGTGGCCGATGCGGGTGTGGATCGCATAGGCAAAGTCGATCGGCGTCGCGCCTTGCGGCAGCTTGATCACCTCGCCCTTGGGCGTAAAGCAGAAGACCTGATCCTGATACATCTCGAGCTTGAAGGTCTCGAGAAATTCATCGTGGTCTTTGTCTTCCTCGAACCGCTCGGAGAGCTGCGATATCCAACGAACGGGATCGACGACAAAGCGGTTTTTGACCGGCTCGCCATCACGGTATGACCAGTGCGCGGCAACGCCGGCCTCGGCCACTTCATGCATCTCGCGGGTGCGGATTTGCACTTCGACCCGCTTGCCACCCAGCGCCGATACGGCTGTATGGATCGAGCGGTAACCGTTTGATTTCGGCTGGCTTATGTAGTCTTTGAAGCGCCCCGGCACCGAACGCCAGCGCTGGTGGATTGCCCCCAGCGCGCGGTAGCAATCGGCGTCGGTTTTCGTGATCACGCGGAAACCGTAGATGTCGGACAGCTGCGAAAAGCTTTGATCTTTCTGCTGCATCTTGCGCCAGATCGAATAGGGCTTTTTCGCGCGGCCATACACCTCGGCGGGGATGCCGGTTTTGCCCAGCTCCTCCAGCAAATCCTCGGTGATGCGTTGCACCAGATCGCCCGCTTCTTGTTGCAGCAGGCTGAAGCGCTGGATGATGGAATCGCGCGCCTCGGGGTTGAGGACGCGGAAGGCCATGTCCTCCAGCTCCTCGCGCATCCACTGCATCCCCATGCGACCGGCCAGCGGGGCATAGATGTCCATGGTCTCGCGGGCTTTTTTCGCCTGCTTGTCGGGGCGCATCGAGGCGATGGTGCGCATGTTGTGCAAGCGGTCGGCCAGCTTGACCAGCGTCACCCGCAAATCGCGCGAGGTCGCCATGATCAGCTTGCGGAAATTCTCGGCCTGCTTGGTCTCGGCCGAATGCAGCTGCAGGTTCGTCAGTTTGGTCACGCCATCGACCAGATCGGCGATGACGTGGCCGAACTGGGCCTCGACCTCGGCATAGGTGGCGCGGGTGTCTTCCACGGTGTCGTGCAGCAGGGCGGTGACGATGGTGGCATCGTCCATCTGCTGCTGCGCCAAAATCATGGCGACGGCGACGGGGTGGGTGAAATAGGGCTCGCCCGAATGGCGGAACTGGCCTTCGTGCATCTGGCGACCAAAGTCCCAGGCATCGCGCAACAGTTTTTCATTGGTGGCGGGGTTGTAGGTGCGCACGCAGGTGATCAAATCATCCAGCGTTACAACGGGTTCGGGGACGACCTGCGCAACCCCGGCAGCAGCCAGTGGGGCGCCGGTGCGCCCCGCTGCAGGCAGGCCTTGATCAACTGCCACGGCCTTGCCTTTGTCAGTGCCTTGCATGGCGGCCTCAGCGGCCCTCTTGGGCTTGCAGCAATTCGCGCAGCAGGCGCTCTTCGCTCATGTCGTCTTGGGCCGGACGATCCATTTCGGCGCTCATCAGCAGAGCCATCGAATCGTCTTCGGGCTCGTCGACTTCGATCTCGTGCTGGTTGGCCTCGATCATGCGCTCGCGCAGGTCGTCAGCCAGCTGGGTTTCCTCGGCGATTTCGCGCAGCGAGACGACAGGGTTTTTGTCGTTGTCGCGCGGCACGGTCAGGGCGGACCCTGCGGCGATTTCGCGTGCGCGATGAGCGGCCAGCATGACCAATTCGAAACGGTTCGGAATCTTGTCCACGCAGTCTTCGACGGTAACACGTGCCATTCAACAACTCCCCTTTAAGACGGGTTTGACGGGTTCGGACAGCCCGGGCAAAGTCCACGATTCTGCATGGATCGCGCGCGACGGGCTAGGAAAGCAACCATTTAAAGCCGCGCGGGCCAGAACGCAAGGGCCAGTCGGATCGGGCCCGCGCAGTGCGGTTTCCCTTTGGAGAAATGGTTAACAATATATCGCAAATGGCAATCACAGTGGTCCTACACAGGATTGCTATTTATTCATGCGAAATAACGTATGTTATCGCAAACAAGTCGCGATTTGTCGCCATATTGCACATCTTTTCGTGAATTACTTGCAAGATGACATATCTTGGATAACTAATCCGATAGACCGCTGCTTGTTCATATTTGCAGGTTTATTTCCAAACGCAGCGGCCGAGCATGATTGGAAGGATAACACATGTTTTATCGGGACGAGCGAATTGCGCTCTTTATCGATGGCGCCAATCTTTATGCGGCATCGAAGGCCCTCGGGTTTGATATAGATTATAAATTGTTACGCAGCGAATTCATGCGGCGCGGGCGCTTGGTGCGGGCTTTTTACTACACCGCGCTGCTGGAGAATGACGAATATTCGCCCATCCGGCCACTGGTCGACTGGCTGCACTACAACGGCTTCGCCATGCGCACCAAAGCCGCGAAAGAGTTTATGGACGCCCAAGGTCGCCGCAAGATCAAGGGCAACATGGACATCGAACTGACGGTCGATGCGATGGAACTGGCCCCTCATGTCGATCATATCGTGTTGTTTTCAGGCGACGGCGATTTCCGCCCGCTGATCGAGTCGCTGCAGCGGCGCGGTGTGCGGGTCTCGGTCGTCTCAACCGTGCGCAGCCAGCCGCCGATGATTTCGGATGAACTGCGCCGCCAAGCCGACAACTTCATCGAGCTGGACGAGCTGCGCGATGTTTTGGGCCGCCCGCCGCGCGACCCGCGCCCCGAGGGGCGTACCCATCCCGCCCCGCGCGAGGATGCCGAAACCCACAGCTTGCTGGACTAACACCGACGGGCGCCCTTTACGGGGGCGCCCGCTGCGCATATCTCGGGTGGGAACGCCACAAGCACCGTCGGAGCCTGCGCCATGACATCTGCCCCCCTGACCGTTTATCTGGCAGCGCCGCGCGGGTTTTGCGCAGGGGTCGACCGCGCTATCCGCATTGTCGAAATGGCGCTGGATAAATGGGGCGCGCCCGTTTTTGTCCGCCACGAGATCGTGCACAATAAATACGTTGTCGATGCGCTGCGCGCCAAGGGGGCTGTGTTCGTCGAGGAACTGGACGAGTGCCCCGATGACCGACCCGTCATTTTTTCGGCCCACGGTGTCCCGAAGGCGATCCCGGCCGAGGCGCTGCGCCGCAATATGATTCACGTGGATGCAACCTGCCCGTTGGTCACCAAAGTCCATAACGAGGCCGCCCGTCACCACGCCAACGGCCTGCAGATGATCATGGTGGGCCACGCGGGTCACCCCGAAGTGATCGGCACCATGGGCCAGCTTCCGGCGGGCGAGGTGATGCTGGTGGAAACCGTCGCCGATGTTGCAACCGTGCAGGTGCGCGATGAGACCCGTTTGGCGATGATTACACAGACGACCCTGTCGGTCGATGACACCGCCGAAATCGCTGCCGCGCTGCGCGCGCGCTTTCCCGCGATCCTCGTGCCCGCCAAGGAAGATATTTGCTACGCCACCACCAACCGGCAAGAGGCTGTGAAGGTGATGGCGCCCAAATGCGACGCCATTCTGGTGGTGGGCGCGCCCAATTCATCGAATTCCAAACGCTTGGTCGAGGTGGGCACCCGCGCGGGCTGTGCCTATTCCCAGTTGGTGCAGCGCGCCGCCGATATTGATTGGCGCGCGCTGGAGGGCATCCGCACCATCGGCATCACCGCCGGTGCCTCGGCCCCCGAGGTGCTGATCGAGGAGGTGATCGACGCCTTCCGCGACCGTTACGACGTCACCGTCGAATTGGTCGTCACCGCCGAAGAGCGGGTCGAGTTCAAAGTTCCCAAAGTCCTACGCGAGCCTGCCTGATATGCCTGAATTCATTTGCTTTACCGACGGTGCCTGTTCGGGAAACCCAGGGCCCGGGGGGTGGGGCGTGTTGATGCAGGCGCGCGAGGGGGCCACTGTCGTCAAAGAACGCCCCCTGTCGGGCGGCGAGGCGATGACCACCAACAACCGGATGGAGTTGATGGCCGCTATATCGGCGTTGGAAAACTTCACCCGCGCGGCGCAGGTCACCATCGTGACCGACAGCGTGTATGTGAAAGACGGTATCACCAGCTGGCTTCACAATTGGAAGCGTAACGGCTGGCGCACATCGCAGGGCAAGCCGGTGAAGAATGACGACCTGTGGCGCCGGCTGGACGCCGAGGTCGCCCGCCACAGTGTGGTGTGGAAATGGGTCAAAGGCCACGCGGGGCACCCCGAAAACGAACGCGCGGACGAGCTAGCCCGCGCAGGCATGGCCCCCTTCAAGGTTGCCCGAAATGCGGGCAGCGGCGTTTAAACACTGGCAAGCGCCTGCGCGGCCTGTTAGCCAACTGCCGATGATTGCAGAAGGCTGACAATGTCCCGTTATATTCTGACTGTTACCTGCCCCATGACGCGCGGTATTGTGGCGGCCGTTTCCGGCTTTTTGGCCGAGGCCGGCTGCAATATTACCGACTCGGCCCAGTTTGACGATGTAGTCACCGGCCGCTTTTTCATGCGCGTCAGCGTCACTAGCCAAGAAGGCGCTTTGCTGGCGGATTTGCAGGCCGGATTTTCGCCCGTCGCCGCGCGCTTCGGCATGGATTTCGCGTTTTTTGATGCAGGCCAGCGCGTCAAGGCGGTGATCATGGTCAGCCGTTTCGGCCATTGTCTGAATGACTTGCTATACCGCTGGCGCATCGGCGCGCTGCCGATCGACATCGTGGGCGTCATCTCGAACCACCTCGATTACCAAAAGCTGGTAGTGAATCACGACATCCCGTTCCACCACATTCGCGTAACGCCCGAAAACAAGCCCGAGGCCGAGGATGCGCAGATGCGCGTCGTGCGCGAGACGGGGGCCGAGCTGGTCGTGCTGGCGCGGTATATGCAGATCTTGTCCGACCAGATGTGCCACGAAATGTCGGGCCGCATCATCAATATCCACCACTCGTTCTTGCCCAGCTTCAAGGGGGCGAACCCCTACAAGCAGGCCTATGAGCGTGGCGTGAAATTGATCGGCGCGACATCGCATTATGTGACCGCCGACCTTGATGAAGGGCCGATTATCGAACAAGATACGGTACGTGTTACCCACGCGCAAAGCCCCGAAGACTACGTCAGCCTGGGCCGCGATGTGGAAAGCCAGGTTCTTGCGCGGGCCATCCACGCGCATATTCACCGCCGTGTGTTTATCAACGGCAATAAAACCGTCGTCTTTCCGGCCAGCCCGGGATCCTATGCATCGGAGCGGATGGGATGAAGAAATTCTTGTGTGTATTGGCCCTGTTGCCGCCGTTTTTGCCGCTGCCTGTGTTCGCGGAAGAAGGGCCCGAATGCCCCGACGCGACATCGACGGCCGACATTATCACCTGCCTGAACGACAAGCAGACCGAAGCGCAGCAGGAACTGGACAGCCAGTTGCAAATGCTGCGCGGCGCCCTAGAGGGTGAGCGTCTGGGCGTTTTGAACGCCGCGCAGGGGATCTGGGAAGATTACCGCAGCGTCGAGTGCCGCTCGCAAGCCCTGACGGTTGAGGGCGGTAGCCTCGAGGGCGTCTACGCGCTGGGGTGCCAGTTGGATTTGACGCGCGACCGCGTCAAAGTCCTGAACAGTTACGAGCTGCTTTAAAGCAGGCTTTCGGGCAGGGTCAGCCCCAAAAGCGCGTCTGCCACAGGCATGGCGCGCTTGCCCTCGCGCTGAATGTCCAGCACGCGCACGCTGCCGGTGCCACAGGCAATCTCGAACCCTTGCAATACGCGACCGGCGGGGCCGTTGCCGTCGCCTGCGGCGGCGCGCAGCAGTTTGACCCGCTCGTCCCCGATCATGCACCACGCCCCGGGGAAGGGCGACAAGCCGTTGATTTGGCGCGCAACGGCGAGGGCGGGTTGCGTCCAGTCGACTGCGGCTTCGGCCTTGTCGATCTTGGCTGCATAGGTCACCCCGTCTTCGGGTTGCACCTGCGGGATCAGCCCGTGCAGGCGGTCAAGCGTGTCGATGATCATCCGCGCGCCCATGTGCGACAGGCGCTGGTGCAGATCGCCGCTGGTGTCGGTCGGGCCGATGGGGGTAGCCGCGCGCAGTAGCACGGGGCCAGTATCAAGACCGGCTTCCATCTGCATAATGCATACGCCGGTTTCGGCATCACCCGCCATGATCGCGCGCTGAATGGGCGCGGCACCGCGCCAACGCGGCAGCAGGCTGGCGTGGATATTCAGGCAGCCGTGCGTGGGCGCATCCAGCACGGCTTGCGGCAAGATCAGGCCATAGGCGACAACAACGGCGATATCTGCGTTCAGCGCGGCAAAGTCGGCCTGCGCCTCGGGCGTGCGCAGAGTTTTGGGGTGGCGCACATCCAGTCCCAATTCCAGTGCGCGCGCATGGACGGGAGTGGGCCGGTCTTTTTTACCGCGGCCCGCAGGGCGGGGTGGTTGGCAGTAAACACAGGCAACCTCGTGCCCCGCCTGCACCAGCGCATCCAGAACCGAGACGGAAAAGTCGGGTGTACCCATGAAGACGACGCGCATCAGGCGAATTTCCTTGCCTTGCGCAGCAGCATGTCGCGCTTCAGTTTGCTCAGGCGGTCGAAATACATCTTCCCCGCCAAATGGTCGATTTGATGCTGCACCGATGTGGCCCACAGGCCAACCAGATCACGTTCCTCAACCTCGCCGGCCGCGTTCAGGAAGCGGACGGTGACAGCGCGGGGGCGGCTGACGACAGCACTGACGCCCGGCAGGTTCGGCGAGGCTTCCTCGTGCTCGCGCAGCTGCACGCTGGCGTGCAAAATCTCGGGGTTGGCCATGCGTATCGCTTGGCCGCGCGCGTCCGATGCATCCACAACCGCCAGCGCCAACGGCACCCCCAGCTGCACGGCGGCCAAGCCGACACCCGGCATCGCGTCCATCGCCTCGATCATCTCGTCCCACAGGGCGGTGATTTCGGGCGTGATGGCAGTCACGGGGGCGGCGGGCGTGCGCAGGGCAGGCGCGGGCCACTGAACGAAGGGGCGGGGCATCATTCGCCCCGCGCGCGTTCGCGCTTCAGTTTTTCCATTTTGCGGGTGATCAGCTGGCGCTTCATCGGGCCCAGATAATCAATGAACAACTTGCCGTCCAAGTGGTCGATTTCGTGCTGCACGCAAGTCGCCCACAGCCCCTCCATCTCGCGCTCTTGTTCGGATCCGTTCAGGTCCAGCCAGCGGACTTTTACGGCGGCGGGGCGTTCGACTTCGGCATACTGGTCGGGGATCGACAGGCAGCCTTCCTCGTAGACAGACCGCTCGGGCGAGGTCCAGATGACCTGCGGATTTACCATGACCATCGGCTGCGGATCTTCGTCCTTGGCGCAATCCATCACGATGATGCGCTGCAACTGCCCGATCTGCGGGCCAGCAAGGCCGATGCCGGGGGCGTCATACATGGTTTCCAGCATGTCTTTCGCCAGCGCCCTGATCTCGTCCGAGATGTCGGGCAGCGGTTTGGCGATAGTGCGCAGGCGCGGGTCGGGGTGAATGATGATGGGACGGGTGCTCATGCCTCGCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP019937|414917:423306|421020_421392_+|WP_085785445.1|DBSCAN-SWA MKKFLCVLALLPPFLPLPVFAEEGPECPDATSTADIITCLNDKQTEAQQELDSQLQMLRGALEGERLGVLNAAQGIWEDYRSVECRSQALTVEGGSLEGVYALGCQLDLTRDRVKVLNSYELL >NZ_CP019937|414917:423306|417201_417555_-|WP_085785440.1|DBSCAN-SWA MARVTVEDCVDKIPNRFELVMLAAHRAREIAAGSALTVPRDNDKNPVVSLREIAEETQLADDLRERMIEANQHEIEVDEPEDDSMALLMSAEMDRPAQDDMSEERLLRELLQAQEGR >NZ_CP019937|414917:423306|421388_422288_-|WP_085785446.1|tRNA|DBSCAN-SWA MRVVFMGTPDFSVSVLDALVQAGHEVACVYCQPPRPAGRGKKDRPTPVHARALELGLDVRHPKTLRTPEAQADFAALNADIAVVVAYGLILPQAVLDAPTHGCLNIHASLLPRWRGAAPIQRAIMAGDAETGVCIMQMEAGLDTGPVLLRAATPIGPTDTSGDLHQRLSHMGARMIIDTLDRLHGLIPQVQPEDGVTYAAKIDKAEAAVDWTQPALAVARQINGLSPFPGAWCMIGDERVKLLRAAAGDGNGPAGRVLQGFEIACGTGSVRVLDIQREGKRAMPVADALLGLTLPESLL >NZ_CP019937|414917:423306|418650_419601_+|WP_085785442.1|DBSCAN-SWA MTSAPLTVYLAAPRGFCAGVDRAIRIVEMALDKWGAPVFVRHEIVHNKYVVDALRAKGAVFVEELDECPDDRPVIFSAHGVPKAIPAEALRRNMIHVDATCPLVTKVHNEAARHHANGLQMIMVGHAGHPEVIGTMGQLPAGEVMLVETVADVATVQVRDETRLAMITQTTLSVDDTAEIAAALRARFPAILVPAKEDICYATTNRQEAVKVMAPKCDAILVVGAPNSSNSKRLVEVGTRAGCAYSQLVQRAADIDWRALEGIRTIGITAGASAPEVLIEEVIDAFRDRYDVTVELVVTAEERVEFKVPKVLREPA >NZ_CP019937|414917:423306|414917_417194_-|WP_085785439.1|DBSCAN-SWA MQGTDKGKAVAVDQGLPAAGRTGAPLAAAGVAQVVPEPVVTLDDLITCVRTYNPATNEKLLRDAWDFGRQMHEGQFRHSGEPYFTHPVAVAMILAQQQMDDATIVTALLHDTVEDTRATYAEVEAQFGHVIADLVDGVTKLTNLQLHSAETKQAENFRKLIMATSRDLRVTLVKLADRLHNMRTIASMRPDKQAKKARETMDIYAPLAGRMGMQWMREELEDMAFRVLNPEARDSIIQRFSLLQQEAGDLVQRITEDLLEELGKTGIPAEVYGRAKKPYSIWRKMQQKDQSFSQLSDIYGFRVITKTDADCYRALGAIHQRWRSVPGRFKDYISQPKSNGYRSIHTAVSALGGKRVEVQIRTREMHEVAEAGVAAHWSYRDGEPVKNRFVVDPVRWISQLSERFEEDKDHDEFLETFKLEMYQDQVFCFTPKGEVIKLPQGATPIDFAYAIHTRIGHACVGAKVDGLRVPLWTRLKNGQSVEIIIADGQTPQATWIDIAVTGRAKSAIRRWLREKDRDRFIKLGTELTRVAFENAGKKATDKALATAARALALENAEQLLLRVGAAEITAREVVRAIYPDLKLQNANEIDAEKAIVGLSADQSFRRAPCCQPVPGERIVGITFRGQGVIIHAIDCPNLSDYEDQPDRWVDLRWHEGQHKAVSAVSLELSMANDAGVLGRICTLIGEQNANISDVDFLDRKTDYYRIRVDVDVRDAEHLHRVMTALDADSHISSLVRLRDAQRAMNWSAALPASGSAGH >NZ_CP019937|414917:423306|417969_418569_+|WP_085785441.1|DBSCAN-SWA MFYRDERIALFIDGANLYAASKALGFDIDYKLLRSEFMRRGRLVRAFYYTALLENDEYSPIRPLVDWLHYNGFAMRTKAAKEFMDAQGRRKIKGNMDIELTVDAMELAPHVDHIVLFSGDGDFRPLIESLQRRGVRVSVVSTVRSQPPMISDELRRQADNFIELDELRDVLGRPPRDPRPEGRTHPAPREDAETHSLLD >NZ_CP019937|414917:423306|422778_423306_-|WP_157115600.1|DBSCAN-SWA MRGMSTRPIIIHPDPRLRTIAKPLPDISDEIRALAKDMLETMYDAPGIGLAGPQIGQLQRIIVMDCAKDEDPQPMVMVNPQVIWTSPERSVYEEGCLSIPDQYAEVERPAAVKVRWLDLNGSEQEREMEGLWATCVQHEIDHLDGKLFIDYLGPMKRQLITRKMEKLKRERARGE >NZ_CP019937|414917:423306|422287_422779_-|WP_085787193.1|DBSCAN-SWA MPRPFVQWPAPALRTPAAPVTAITPEITALWDEMIEAMDAMPGVGLAAVQLGVPLALAVVDASDARGQAIRMANPEILHASVQLREHEEASPNLPGVSAVVSRPRAVTVRFLNAAGEVEERDLVGLWATSVQHQIDHLAGKMYFDRLSKLKRDMLLRKARKFA >NZ_CP019937|414917:423306|420139_421024_+|WP_085785444.1|DBSCAN-SWA MSRYILTVTCPMTRGIVAAVSGFLAEAGCNITDSAQFDDVVTGRFFMRVSVTSQEGALLADLQAGFSPVAARFGMDFAFFDAGQRVKAVIMVSRFGHCLNDLLYRWRIGALPIDIVGVISNHLDYQKLVVNHDIPFHHIRVTPENKPEAEDAQMRVVRETGAELVVLARYMQILSDQMCHEMSGRIINIHHSFLPSFKGANPYKQAYERGVKLIGATSHYVTADLDEGPIIEQDTVRVTHAQSPEDYVSLGRDVESQVLARAIHAHIHRRVFINGNKTVVFPASPGSYASERMG >NZ_CP019937|414917:423306|419602_420079_+|WP_085785443.1|DBSCAN-SWA MPEFICFTDGACSGNPGPGGWGVLMQAREGATVVKERPLSGGEAMTTNNRMELMAAISALENFTRAAQVTIVTDSVYVKDGITSWLHNWKRNGWRTSQGKPVKNDDLWRRLDAEVARHSVVWKWVKGHAGHPENERADELARAGMAPFKVARNAGSGV |
10 | Synechococcus_phage(33.33%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
877201 : 912034
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP019937|877201:912034|DBSCAN-SWA GTCAGGCGACTTCCGGTAAAGCGACAGCCTCAACACAGGCGCGCATTTGCGCTGCTGAAGAGCGGGCGATGTAAGACGCGCGGGTGACGTTCTTGCGGTCAGGAACGTGGCCGATCACATCCTTGATAACGGCCTCTTGTTGCCCTGCGCGGTCTGCTGCCGTGGCGAACCATCGGCGCGCAGAATGGAAATTCACTAGACTGCGGCGCTTCCCTTCTCGGACTTCATCGACCCCAAGCGCCTTGCGGTATCGTGCAAAGCGTTTGCTAAACGTGTCTCCAGCGTCACGCTCGGACTGCAACTCCTCAAATAGCCAAGCCTTGCCGTCTTTGCCCGCTGTCCTACGCCGCACAATGTCCAATAGGGTCGAATGCACGGGAACAGGGCGGGCGGCGGCTTCGGTCTTCCCTTGTTGGATGTCGAAGGTTAGCCCGTCTTTGTCCTCTCTGACCTCCTCGACCCACAGGGTCAGTATCTCTGCCATCCGCATACCGGACAACAGCGATATACGCAGGGCATCTTGAATGGCGGGTAGGTGGTCGGGGCTCATTCCTTTAGGCGGTTGGCTGTTCAGCAGCAGTGCAATCTCTTGGCCGGTGAATGCGCGTTCCTCGTCACGACCGCCGCGCTCAGCGCGCTTGCCACGGCGCAACACCTGCTGACCATCCCAAGGCCACCCAGACGTCATAAGCTCGCCCGTGGGCAGCTTTACATGCCCCCGCGCAGCTAACCAGCGCCAATACCCGCGTAGGGCCGAAAGGTGCTTTTTTTGGGTCGCTGGGTGCATCTCGTCGATGACCTTGGACACATACGTGCCCGCCTGTTTCCGTCCGATCCTGTCAATGGTCAGGGCAGGACGTTGATCTGCGGCCCAGCGAGCAAACTGGGCAAGGCTTCCACGCCGGTCGTTCCGTGTCTTTGCTGCGAAGTCTGCTGTCTTCAGGTAGGATTCTAGGTGCGCGTCAACGGTCTCGCGTCCGTGGAGAGCCCCCAAGAAGGCGTCGCGGTCATCTTGATCCTTGAATGCATCGACGGTTGCGTCCAGCGACCATAGGGCAGCGTCATAGGCGCTAATGGGGCTGTCTGGGTCGCCGTGATCGTCATCTTCAAGTTCAGCCAGTGTCTCGCGGTAGAGCAACCCTTGGGCGCGAGCGGTGAGCTGTGCGGTGTCTGCGGGCTTCCCTGCTAGGATGTCCTCAAAGAGCGCCGAGGTCGTTCTCTCCAGTTCGTTGCGACGGCGCTGGGCGGTCGTGAGGTCCGACGTTTCGAGGTTGATGGAGTAAGTTACTCCAAGCCCGAGAGAGGCCTGACAGCGCGCGGGAATCGCACGGCGGAACCACCAGCGATTATGCCGCAGAACGAGATAGCGCTGCCCTGCTTTTTTCGTTATTCCTGCCACTGTTGCACCATTCTAACCAGCCATTCTGACCAGTTTTATCTATGCAACTTACTGATACTAAGCAATAACATATAGTTAGATGCCGTTGGTGTAGATCCTCCTCGGCTCCACCAAGTAATATCACATAAAGCATTGAAATTAAACACATTTATGTGTCGTATGGTTTGATCTATACCACAATTTATCCCACATCTTGCAGCGGATGCCCTAAGCTCAGAACTCGTTCTTGATCATAGCCAGAACAGGACGGTTGCGGCGAGGATGATGGCGGAGAAGAAGGTCTCTGGACACCGGCCGTGTCGGGTCGCCACGCGCCGCCAGTCCTTGAGGTGGCCGAACATGATCTCGATGCGCTTGCGCCTCTTGTCATATTTGACGGCCTTTTTGCGGGACTTCCGGCTTGGGATGTAGGCCTTTATCCCCTTGTCTTTCAATGCATTTCTGAACCAATTGACATCTTAACCTCGATCGCCGAGCAGCCATCCTGCCTTCGTCCGGCTGCCGATCAGTGGCGCCGCGCCAGTATAGTCGCTGACCTGTCCAGCCGACATGAGGAACCCGATCGGGCGCCCCTTGGCATCAGCAACCGCGAACAGTTTTGTGTTCATGCCGCCTTTAGTGCGCCCGATCTGGTGCCCGCGCCCCCTTTTTCACCCCAAGGCTTGATGCCGTGCGGTGCGCCTTCAAATACGTCGTGTCGATCATGATCGTCTCGTGCTCGGCGCGCTCAGTGGCAAGCCCACCAGGATCCGGGCGAAGACCCCGTTGTCACTCCATCGCTTCCACCGGTTGTAAAGGGTCTTCGCCGGCCACATTCTTTCGGCGCATCGCACCAACGCAACCTACTGCAATTGATGAAGATTATGCCACTGAGGACGCGACGATCCTCCACGCGGGGCTTGCTATGGCTCTTGGGTAAGAACGGCGCAAGGCGCGCCATCTGAGTATCGCTCAGCCAGTAAAGATTGCTCATCGATCAGGTCTCCGCGCGGAGCCTGAATCAAGCCAAAATCCCGAAATCAATGGGTCCTGAGCCTAAAAATCGTAAACAGTTCTTCATTTATTTGAAATATGGAAGCAAAGACGCTGCAGTGTGGTGTTAACATGCGGCCTGTTTGCTGTGTTCGTGAGTCGTGGATGATTAGCGCAGCAGCGCCAGCTCGATCTGGATCAGATTGGCCCGATAGCTGACAGTCACCGTCACAAGCGTGACGAAGGCGATCAACGCGATACATCGGCCTATGGGGGCGGATTTCATCCAGATCGGCTGGCCGCGCCACAACCACAGGTGGCCCATTAGGAACGGCGGGATGGGCAGCAGGTTGAAGGCGACATACCAAAAAACAGTGCGGCGGAATGCGTCGATTAACGCCATGGTGACATAGGCGGGCATCGCGCCGGGGAACGTGTGCGCGATGAGGGGGCGCAGGTGCATCAGCGACTGCGCAAACAGCGCCAGCAGCGCCAAGCTGATGAGTAATAGCGGCAGCAGCGATACTGCGCCGCCCCGCAATGCGCGGGGGGTGATCTGCATGGGCCGTATCCAGCCGATCTGGGTAAAGGTGATGGAAAGCAGGCTGATCGGGCTGGCGGCGCGTGTTGGTGAAAGTGTCAGTCGGCCATCTTGGCGGGGTCCGCGATCGCCGGCCAACCATGTCAGCGCCGCAAGGATCCAGCTGTGCAGGGTCACAGTGACGACACCGACCAGCACCGCATATGCGATGGTCTGCGGGCTTTGGCTGTAGAAGGTAAAGGGGTTGATCATGTGGCGTCCTCGGGCAGGGGCGCGTAGCCGTAGCGCTCGGCGATTGCATCAATCTCGCCGCTGTCCTGCATGTCGCGAAGGGCGGCGCTGATCGCGCGTTTCAGGGTCAGATCGCCTTTCCAGAACCCCATCGCGATGGAATCGCGGCGCACATTGGGGACAAGCGAGACTGTCAGGTCGGGCTGATCGCCAAGGCCCGCGGCCAGAATCGCGTCGGTCACCACCATGTCCAGCTGGCCCTGTCGCAGCGCGGCGATAGCGCCTGCCAGTTGGGGCTGCCCGACGATGGTCGCCGCGCCATTTTCGCGCAGCGCGCGGCTGAGCTCTAGCCGGTCGCGCCCCAGCGAACCGACGAAGACGCCCACACGGGCACCTTCCACTGTTTGCGCGGGGTCGCGCACAAGGCTGACCCAGCCGCTTTGCAGATAGGGGGCGCTCATATCCAAAAAGGTGCGCATGGTCGTGCTGTCGACCAGCCCGCCGATGATGATGCTGCAACTGGCGCGGTTCAACTTCCACGCGGCGGGGTTCATGTCGCGGCCGATGGCGGCGTTGGTGTTGAACGCGGCGCGCACGCCCAACCGCGCGGCGACGGATTGCAACAATTCGATCTCGATCCCCGGCGCGGCGGCTGTGCCGGTCACGAAGGGTGGGTAGCTTTGCGGCGTGCAGACGCGCAGCACGCCCTGTTTCTGAACCTGCTGCAGCGCGGTGTCGGGCGGCAGCAGGTTGACCGCCGAAAACAGCCCCAGAACCAATGCGATCATGCCCATGTCGCGTGCGTGGCGGCGAAGAAAGCCCATGCTACACCTTTTGAATGCGCAAAAAGCGCAGCAGCGCCAAGCCAAAGAATATCGCCGCACAAATCGCGATCATCACCGCGCTGCCTTTCAGGCTGAAGCTGTGAAAGCCGATGAAGATGCCGCGCAGCAGGTCGTTCACGTAGGTGATCGGATTCATCGACACCAGAATGACCATCCATTCGGGGTAGAGGGCCTTGGCCTGCGCCGCCCCCAGCGCTGGGTCGATGGGAAACATGGCGTTCGATGTGAAATACAGCGGTAGTAGCACGGCATTCGAAAACAGGCCGAAGCCTTCAAAGCTGCTCGTACGCGTGGCAAGAAAAATGGCAAAGCAGCTCATCGCGAAGGCGAAGGCCGCCATTGCGCCAAGGCCCGTCAGCACCTGCGTTGCCGTCAGCGATACATCGACAATCGGCGCCAGCAGCAGCACGATTGCGCCTTGCAATGTGGCGATTGTCGTGCTGCCCAATGCTTTGGCGAACATGATGGTGCCGCGCGCGCGCGGGCTGGTAGCCAGATGCTGGGCAAAGCCGTATTCGCGGTCGGCAATTACCGTCATCGCGGCTTGCATGGCGCCGAACATAATATTCAGCGAGATAACCGCGGGAAAGATGAACTGCAGGTAGGTATAAGGCACGGCAAAGCGGATCTGCCCAAAGCTTTCGGCGCGAAAGTAGGGCCCAAGGCCGATGCCCATGATGACGACCCACAGCAAGGGCCGGCTGATGCACCCCACCAGCAGCGCGCGGTCACGCAGGAATTTCTGGACGTCGCGCTTCCAGATGGCGTGGATGGCCTGCAAGGTGTCGATCATGCCGATACCCCCGGGTTTACGCCTTCGGTCGCGCTGATGATCGCCCGCTCCAGCGGGGCGCGGCCAGCGGCGACCTTGGCGCCATAGCGCGCGATTGCGGCAAGGTCGGCAGGGGCGGCGCTGGTCGTGGTTAGCAGCACGTCGCCTGTGGCGGCGTCCAACGTGGCATCGTGCAGGGCGACCAGAAGTGCGGCGCGCGCGGCCGAATCTGTCGGTGTGATGCGGTAGTGCTGCCCACCAAATTTGTCCAACAGCGCATCGGTCTGCCAAATGCCGTGCAACTTGCCCCCGTGCATGATGCAGGCGCGTTCGCTTTGGGCGACTTCCTCGATGAAGTGGGATGTCATAATGACGCTTACGTTCTGGGCATTACGCCGCGCCGCGATTTGATCCCAGATCTGTTGACGCGAATGTGCATCAATCCCTGCGGTCGCCTCGTCAAGGATCAGCACTGCCGGCTCGTGCAGCAGGGCGCGGGCGACTTCGACGCGCCGCTTTTGCCCTGTGGATAGGGTGCCGACGATGCGGTCGGCCAGCGGTTCAAACTGCGTCAGCGCCGCATCAATTGCGGCTTTGCGGCGCGGCCCGCGCAGCCCTTGCAGTGCTGCGGCGAACAGCAGGTTCTCCAAAACGCTCAGGCGGGCATCAAGGCTGCCCGATTGCGGTACGAATCCCAGATGCTGCCGCGCGCGGCCCACGGGGCGCATTTTGCCCTGAAAATGGAAGAACGCTTTGCCGAAGTCTGGCCGCAGTAAACTGGCCAGCACCTTGATCAATGTGCTTTTGCCCGCCCCGTTGCGCCCCAAAATGCTGAACCATTCGCCCTGCGCGATCTGCAAAGACACCCGATCAAGTGCGACGTTCCGGCCAAACTCTACCGTGACCGCATCCAGCGTGAACAATTGGCTGCGACCCGTCATAGCCGCGCACCGCGCAACCGGCAAAATGCATGGGCGGGCACCCGTCGGCGGGGTTTTGATACTTGGTATTTCTCAATGAATTTCATATGATGCGTCGGAAAGAAGCGGTGCTACTTTCCCTTACGTTCCTGTGGGCCGATCGGGTTAAACTGCGGGAACAAGCAAAAATTCCTGATTGCTTTATCTGGGCTTCGTGATAGCGACAAAAGGCGCCATTGACGAATCTTAATATCACGACGGGTACGTTCAATGCGCATGAGTCGCAACCTGTCGTCCTCCCCCGGCAGGTTGATTTGTTCCAGCAACACTCCGTCACGCTACGCGCGGCGGGGTATACGTTAGATAGAGGACGGAACGCCGTGAAATCTTCATTCCTTTTGCGTGCCAGCGCTGCCATCGTGCCGATGGTTGTTGCCGCCGGCGCTGCGCAAGCGGCAGATGGTTTCTTTACCGCCGAGCAGGTTGCAAACGGTAAAGAACTGTATGACGCACGCTGCAGCGTCTGCCACGCCCAAAGCATTCGTGACAGCTACGCGAACTCCAGCGCGACTGCTGCCCTTATTGTTGAATCGATCGTCTCGAACGGTATGCCGCTGGATAACCCCGGTGGTCTGCCCGCGCAGGACTATGTCGATATTGCCGGCTACATTCTGAACCACAACGGCATGGCCCTGGGCGATGAAGTTCTGGCCGGTTCGGCAGCGGTGCAAGTTGCCGCCATTGGTGCCGACCCCGCAGACATCGCCAATGTCGTTCTGGAAGAAGTCGTCGACGACACGCCCCAGCGCGAATACTCGCCCGTCACGTCGGAAATGATCCTGGATCCGAACCCGGCTGAATGGCTGCAATGGCGCCGCACGGTCGACAACCAAGGTCACAGCCCGCTGGACCTGATCAACCGTGACACGGTTGGCGATCTGGAACTGGCTTGGGCCTACCCGATGGGCGTCCCCGGCCTGCAGGAAGTGGCCCCGATCGTTCACGACGGCATCATGTTCCTTGCGACGAACCAGAACAACGTGATGGCTGTCGATGCCGTCACCGGTGACACCATTTGGATGTACACCCACCAGCGCCCCGAATTCGAAGGCGCCTACCACAGCCGCCAAGCCGAACGTCAAAAGAACTCGGTCGCCCTTTGGGACGACAGCGTCATTCTGACGACGGTTGACGGCAAGCTGATCTCGCTGAACGCCCTGACCGGTCAAAAGGAATGGGAATTCCAGGTCATGGATTGGGAAAAAGGCTACAGCTACACGGCTGGCCCTCTGATCGCTGACGGCAAGATCTTCACTGGCACCTCGGGCTGCTCGATCGCGGGCACCAACGGCGGCTGCTACATCACCGCCCACAACGCCGACACCGGCGAAGAAATCTGGCGCTTCAACACCATCGACGATCCGAACAACCCGCTGGTTGACGAATCGTGGGGTGGCGTTCCGGCTGAAAACCGTTGGGGTGCGACGCCGTGGGCGACCGCGTCCTACGATGCCGAGCTGAACATGGTCTACTACGGTACCGGCATGCCGATCCCGTATTCGGAACATACCCGTGGTACCGGCGAGGGTTCGGCCCTCTACACCAACTCGACCCTGGCGCTGGACGCCGACACGGGCGAGCTGAAGTGGTACTACCAGCACATGCCGCGCGACAACTACGATCTCGACTCGCCGTTTGAGCGGATCATCATCGAAGAAGAAATCGATGGCGAAATGCGCAAGCTGGTCGTCTCGACCCCCGGCAAGAACGGCATCACCTTCGCGCTGGACGCGGCAACGGGTGAATTCGTTTGGTCGAAGGAAACGATCTACCAGAACGTCATCGACAGCATCGACCAAGAAACCGGCGAGATCACGCTGAACCTCGACACCATTCCGAGCGAAATCGGTGAAGAAAAGCTGTTCTGCCCGACCTTCAACGGTGGCCGTCTGTGGCAGGCAACTGCTTACAGCCCCGACACCGGCATGTTCTACCTGCCGGCTGCGAACCTGTGCCAAACCATCACCCCGCTGGCCTTTGAAATGGCCGCGTCGGGCGAGACCATGGGCATGGCCAACACCGGCCCGCAACAGCTGGCCCCCGGCCACGACAACGTCGGTTCGCTGTTCGCCCTGAACGTGGTTGACGGTTCGGACGCCTTCGAAGTCGAGCAGCCCGCTCGCTTCTCGTCGTCGGTTCTGGCAACTGGTGGCGGCCTGATCTTCGTTGGCGATGCCAACCGTTGGGTCTACGCGATGAACGACGAAACCGGTGAAGTCCTCTGGAGCCAGCGTCTGCATGCTCCGATCGGTGGCTACCCGATGACCTACGAAATCGACGGCGTGCAATACGTCGCCATTCCGGCAGGCCAATCGGCAACCACCCAAGTGGCCCTGACCCCGGGTATGAGCCTGCCGCCGATCACCGGCGCCAACATGCTCTACATTTTCCGTCTGCCGTCGTAAAAGAAACCGTCTGCACCGGCCATCTGGTCGGTGCAGACCCTTATTCGAAAGTGCCTGCCATGCGTGCCTTGCTTGCCGCCGCCCTGACTATCGCCCCTGCCATTTTTCCGGTCGCCGCATCTGCGCAGCTGATGGAACAGGTGGCAAATCCCCAGCTGATGAACCCCAACATTTTGCCGTCGGGCAGCCAGCTGCGTATTTGCCATCAGAACGGGCTCGTCACCGCCGACCTCGACATAGCCATCGCGCGCGAAATCGCTGCGCGGCTGTTTCTGGAAATTGAAGTGAACGTCCTGCCGTCGGGCTATGGCGTCGGAGGCGAATTTGCCGCTGCCGATCTGCTGGTAAACCTCAGCGCGCAGTGTGACGCGATCTTCGGCATGGGGATTGGGGCGAATATTTATCCGGCCGAATTCACAGTGACGCAGCCCTACGTGGCCCACAGCTTCACCTATCTTGCGACCAACCCCGATTATCAGCGCCTGACCGACATTCCCGCAGGGCTGCGTGTCGGGGTCGAGATGGCGTCCTATGGCTCGTTCGTCTTCCGCCAATTCAACAGCCTGCGCCCCGAATCCGAACGACTGGGCTTTCTGCCTTACGCCGACCACACGTTGATGCTGACGCGCCTGCAAGACGAGACGCTGGCCGCAGTCAGCATTTACGCGCCGTTCTGGCGGGCAGGGGCCGATACGCCCGTCTACCAAGGCGTGCACGAGCTGCAGCGCATGCCCGAACTTGCCGCGCGGGTGAATACCGGCGCGCTGCTCCTCAGCCAGAACACCTTCATGCGCGGTGAACTGGATGCTGCGATCGCCGATCTTCTGGCCGACGGGACTGTCGCCCGGCTGATCGAAGAACTGGGCTACGACGCCTACGGCACCACGGTTGCGCAGTAAGCGCGTAAATTCTGGGCGTTTGCCTTGCTGCTGTGCCTTGTCGCAGGCACAGCAGCGCGTCACGGCAATTGTGATCGGCTGCGGTGCGCGAAAATGGTGAAAACGTTAGGCCCGCCATCATCATCCCGCTACCCTTTGACCATACGTTAACCGCCATTGCAGGAGGTGACCGATGGCACAGGGGTATAAAACAGGCGCGTTGGCCGCGTTGAAATCCACCATCCAGACCGCGCGGGCATGGGTCGAGGCACGCAGCCTGCCCACCTTGCACGCAATCGACCCGATGATCGGGTGGGGCGGCTGCGTCGACACGCTGGCCTTGGCCCACACCCGTTACGTGGCGATCCCCGTCGTCGTCACCAAACGCTAGACCTGCAGGTCTAGCGTCGTCGCGGTGTTGGTGATCATCAATGTGCGTCCGATGGCTGTGGTCGCCGTGGGCACCAACAGGAACTGAACCTGCGCGGGTTCAATCGGCTCGCGCGGAGACAACCCCGTAGCGCAGGCCCAAAGGTTGGCGTTGAAGCGCACTTCGCGCCCCGTCGCGCGCACGGCCCACCCGCCGCCTGTGCGAGGCGTAAAGCTGACTTCGCCGCCATAGGGCAGGCTGCTATCGCAGCACAGCGCCAGCAAAGCCAAAACCTTTGCCTCGTTACGCGGCAGGGGCTTCGTGCCGGCGTGCGTGTATTTGCTGCGCCCGCCTGCTGCCAGCCCTGCCAGAATGCCCGTCCAATCGCGTACGGAAATCTCGCCACTTCCCGCAGGGCCGAATGCGATGCGCAGCATCTGCAGCCGCGCGGTTGCGCTTTTCACGCTATCGCTGATCAGGTCCAACTCGGGCGAGGGGGATACGAGGCCCAGCAACTCGACCCCGTTGCTGATGGCGCCCAAGGGGTTGAACAGGTCGTGGCTGATGCGCGATGTAATAAGCGCTGTCAGATCGGCTTGCATGGCAATGCACCTTTCGGGCTGGAACTGGGTTAGGGGCGCCGCGTGGCGCCGGAAAGGCGGACTATGGCCGATTTGAACGCAATGCTGGCACCCGGAATGATCGTGCGCCACCCCGACCACCCTGAATGGGGCGATGGCCAAGTGCAATCGAACATTGGCGGCAAAATCACCGTGAATTTCCGCGAAACGGGCAAACAGGTCATCGACGGCGCGCGGATCTTTCTGGTGCTGGTGTCATTCTGATACGGCCTGCACCGGCGGGAGCCGATCAAGCTTGCGGTGATTGTGGCATTTACTGCCTAACAAGTTCTTAACGGGGGGCAAGCGGGCGCACAAACACGATTGTCTGCTTGCCTTGAGCGGTGAAATTCTTTATCCGCCGCAGGGAATATGGCCAGCCGGGCTCTCTGGCACTGGCCGGCTATGGATCGAAAGCTGAAGGACGGAAAAATGCAGGTCACCGAGACCCTCAACGAAGGACTGAAGCGCGGCTACGCCATCACGGTGACCGCTGCAGAACTGGACGCGAAGGTCCGCGAGAAGCTGGAAGAAGCCGCCCCCGAGATCGAGATGAAGGGCTTCCGTAAGGGTAAGGTGCCCTTCGCGCTGCTGAAAAAGCAGTTCGGCCCGCGCCTGCTGGGCGAAGCCATGCAAGAGGCCGTCGATGGCGCGATCAACAGCCATCTGGAAGCCTCGGGCGACCGCCCCGCGATGCAACCCGAAGTGAAAATGACCAATGAAAATTGGGAAGAAGGTCAAGACGTCGAAGTGTCGATGACCTATGAAAAGCTGCCGCAAGTGCCGGAAGTCGACCTGTCGGGCGTTGAAATCACCCGTCTGACCGTCAAAGCCGACGAGGCGTCGATCGAAGAAGCGCTGACCAACCTGGCCGAGACGTCGAAGCAATTCGAAGACCGCCGCAAGGGGTCGAAGGCCAAAGACGGCGACCAAGTCGTGATCGACTTCGTCGGCAAGGTCGATGGCGAAGCTTTTGAAGGCGGTTCGGCTGAAGACTACCCGCTGGTTCTGGGCTCGAACTCGTTCATCCCCGGCTTCGAAGAAGGTCTGGTCGGCGTGAAGGTTGACGACGTCAAAGACGTCGAAGTGACCTTCCCCGAAAACTACGGCGCGGCCCATCTGGCTGGCAAAGCCGCTGTTTTCACCTGCACCGTCAAAGCCGTGAAAGAGCCGAAAGCCGCCGAGCTGAACGACGAGCTGGCCAAGCAATTCGGTGCCGAAGATCTGGCCGGTCTGAAGACGCAAATCGCGACTTCGCTGGAAGGTGAGTTCGCTGGTGCCGCCCGTGCAGTTGCCAAGCGCGCCCTGCTGGACAAGCTGGACACGCTGGTCTCGTTCGAGCTGCCCGAGTCGCTGGTCGAAGCTGAAGCTGGCCAAATCGCGCACCAGCTGTACCACGAAGAGCACCCCGACGATCACAACCACAGCCACGGCGCCATCGAGACCACCGACGAGCACCGCAAGCTGGCTGTGCGCCGCGTGAAGCTGGGCCTGCTGCTGGCCGAGCTGGGGCAGAAGAACGAGATCAAGGTCACTGATTCCGAGCTGACCCAAGCGATCCTGAACCAAGCCCGTCAATACCGCGGTCAAGAGCGTCAGTTCTTTGAATTCGTGCAACAGAACGCTGCTGCCCGTCAGCAGATCCAAGCACCGATCTTCGAAGACAAAGTCGTCGACTTCATCTTCAGCCAAGTCAAGGTTGACGAAAAAGACGCGACCAAGGACGAGCTGCAAGCCGCTGTCGAGGCACTGGACGTCGAATAATCGACCCCAACGCTTGAATTAAATTGGGCCACCCGGGAGGGTGGCCCTTTTTGTTACAGCGCGTCTTTTAAGCCGCGTCTTTTACAGTGCGATGCGGAACAGCGCGAAGCCGTTTTCACCGTCGCCGACATGTTCTATGGCAATCGTCTGCACCTGATCGACATAGTCGACGGCTTTGGGGCCGGTCTCGAATGTCACGGTCGTATCGGGCAGGGGGGCGAAGCTCCAATTCCCGTCAGCGGTCGGGTTGATGGTGCCTTCCTCGACAATATAGCGCACCAGCGCATCGCGGTTGGTATCGGGCGCAATCAACACCCGCGTGCTGCCATCGGCACCGGGGAAGCTGCCGCCGCCGCCCGCGCGGTAGTTGTTGGTCGCGATGACGAACTTTTGGTCAGCCGTCACGGGGGCGCCGTTATATTGCAGGTTGATGATGCGGTTGGCATCGGGGTTTGCAACTGTCCCGTCGGTTTCGAACATCGCGGGCTGTGTCAGGTCGATCTGATAGGTTACCCCGTCGATCACGTCGAAGTTGTAGCTGGGGAAGGTGGGGTTCAGCAGCGGCGCGTCCTGCGCACCTGCTTCGATCTGGTTGAACATGCCTGCCGAACGTTCCAGCCAGTTTTTCAACTGTGTCCCATCCACCAGAACGGCCTGCACCGTGTTCGGGTAGAGGTAGAGGTCGGCGACGTTCTTGATCGCGATATCGCCTGCGGGCACGTCGGTATAATAGTCCGGCCCGCTGCGCCCGCCGGCCTTGAAGGGCGCGGCGGCGGACAAAAGCGGCAGCGACTCGTACTCGGTACCCTGCAGCAGTTGCGCGACATACCAAAGCTGCGCCTGTGATACGATCTGCACCGAGGGGTCATCCGCGACCAAGGCGAAGTAGCTGTAAAGCGGTGCGCTGGTCTGCCCGACGGCGCGGCGCACATAGGCGAGCGTGGCGTCGTGTTCAGTTTGCGCTGCGGCCTGGACGACGGCGACGTCGCCGACGGTCGGAACAACCTTATTGTCGGCATCGCGGATCCAGATCGGGCGGGCCTCGACGCTGGCACCGGCGATTTTCCATGCGGTGCCGTCGTGTTCCAGCATCAGATCGACCAGCCCCATATGGCTGCCCCAGAACCCCGGCATCACGGCGGGCTTGCCGCTGACAGTCCCTGCGACGGTATCAACGCCCTCGGTCCCCTCGTAATCGGTGCCGGGGAAAACGCGGTGGGCGTGGCCCATCAACACCACATCAATGCCCGGCGTGGCTGCCAGCGGGATTGCAGCGTTTTCTTGGTCTGCGACGGCCTCGGCCTCGCCAATGCCCGAATGGCACAGGGCGATGATGACATCCGCACCTTCGGCCTTCATTTGCGGCACATAGGCCGCGGCGGCTTCGACAATTTCGCGCACGACGACTTTGCCTTCCAGATGCGCACGGTCCCATGCGGCGATCTGCGGCGGGACAAAGCCGATGACACCCAGCTTGATCGGCTTCACCGACCCATCACCCAACGTAATCTCGCGATCGAGTATCACGTAGGGGGCGATCAGGGTCTCGTCTTGCAGCGGGGTTGCGCCCAGCTTGACGGCGACGTTCGCGCAGACCAGCGGGAAGGCCGCGCCTTCCAGCGCATTGGCCAAGAAGGGCAGGCCGTAGTTGAATTCATGGTTGCCCAGCGTTCCGCAGGCATAGTCCAACGTGTTCATAGCGTTGATGACGGGGTGGGTCTGGCCGGGCTCCATCCCGCGTTCATAGGCGATGTAGTCGCCCATCGGGTTGCCCTGCAGCAAATCGCCATTGTCGAACAGGATGCTGTTGGTTGCCTCGGAACGAATCCCTTCGATCAGGCTGGCCGTGCGCGACAGGCCCAGCGTGTCGACGGGGCGGTCGGCGTAATAGTCATAGGGGAAGACGTGGACATGCAGGTCGGTCGTCTCCATGATGCGCAAGTGGGCCTGATTACTGGCCGCCCGCAGCGAATAGGGGTGCAAAGTCGCCATAGCGGCGCCTGACGCGGCCAGAAAATGGCGACGGGAAAGATGAATGGGCATCGAAGCCTCCGAAGTGGTGGGATGTGTCCTAGCTTGGCGCCGGGTGTCACAATCTGATGACAGGATGCGCCGGATCGCAACTTTACCTTGCGGTAAAAATTCAGCCGGCGGAATTTTTCCCATCAACTGCACAGACTTTGGCCGAAAGGCGCCGCTTTTATGCTGCTTTTATTAACAATTTGGCGGCGCACAGGAAGTTGAACTACCTTTTCCGTGTTCACTAGAGGAGTAGAAACAGTGTTCAACATTCGTACTGCTGTTGTTGCTGCCGTTGCCTGCATGGGCGCCACCGCAGCTTTCGCCCAAGCCCCCGCCATGCCGACGCAAGATGACATGATCGCCGCGTCGAAGAACCAGCTTGGCATTCTGGAATACTGCCAAGGCAAGGGCTTTGTCGAACAGGATGTGGTTGATATTCAAAACCGCCTGATGGCTGCGCTGCCCCCGTCGGAAACCCCCGACGTTGCCGAGGCTGCCTACCAACAAGGCCTGGAAGGCAAGGTTTCGGCCATGGGCACCGAGGTGAGCATCGCCGATGCTGCGACCGCGCAGGGCACCACCGAAGAGGCTTTCTGTGGTCAGATCAGCGATCTGGTCAAGCAACTGGGCGCTTCGCTGCCGGCCCAATAAGGCTGCATCCAGCTTTAAGAGATTTAAAAAAGCCGGGCCCCGCGAGGGGCCCGGCCTTTTTGTGCCGCAAGCGGCGATCTTATTCTTCGTCTTCGTCCACAGCGCCGCCGAGGTCGTCGAACAGCTCGGCGATCTCGAATTCGGCCGCAGCTTCGGCTTCAGCAGCCAATTCGCGGATCGACTTGCCCGAGGCTTGCAGTTCAGCTTCTTCAGCCGAACGTGCGACGTTCACCGAAACGGTGACTTCAACTTCGGGGTGCAGACGCACGGTCACATCGTGAACGCCCAAGTCCTTGATCGGTGCGATCAGGGCAACTTGGTGACGGTCAACGGTGAAGCCGGCAGCAGTTGCAGCTTCCGCGATGTCGCGGCCAGCGACCGAACCGTAAAGAACGCCACCGTCCGAAGCCGAACGGATGACGACAAAGCTTTGGCCGGCCAGCGTTTCGCCAACGGCTTCGGCTTCTTTGCGGGTCTCGAGGTTACGCGCCTCGAGTTGAGCTTTTTGCGATTCGAACTTGGCGATGTTCGCCTCGGTCGCGCGCAGTGCCTTGCCTTGCGGCAGCAGGTAGTTACGGCCGTAACCGTCCTTGACCTTTACCACTTGGCCCATCTGGCCAAGCTTGGCAACGCGTTCCAGCAGGATGACTTCCATCGGGAATGCTCCTTATTTCACTGCGTAGGGCAGCAGCGCCAGGAAGCGCGCGCGCTTGATGGCTTGGGCCAGCTTGCGCTGGTTTTTCGCCGAAACGGCGGTGATGCGGGCGGGAACGATCTTGCCGCGCTCGGAAATGTAGCGCTGCAGAAGACGGGTGTCTTTGTAATCGATAACCGGTGCGCCTTCACCCTCGAACGGGTCGGACTTGCGGCGGCGGAAGAAAGGCTTGGTTGCCATGTGCGTTATTCCTTATGACCTGAGATCAACGACGCTCGCGGCGTTCGCCGCGCTCGGGACGGTCGCCACGCTCGCCGCGGTCACGCTCGTCACGCTTTTGCATCTGAACCGAGGGGCCTTCCTCGTGGGCGTCGACCTTGATGGTCAGAACGCGCATGACGTCGTCATGCAGGCGGGCCAGACGTTCCATTTCCAGCACGGCAGCCGCGGGGGCGTTGGTGCGCAGGAAGGCGTAGTGGCCTTTGCGGTTCTTGTTGATCTTGTAGGCCATCGTCTTGACGCCCCAGTATTCCGACATGACCACGGCGCCGCCGTTGTCAGCCAGAACGGCCGAGAAGTGTTCGATGAGGCCTTCGGCCTGCGCGTTGGACAGATCCTGACGCGCGATCAATACATGCTCATACAGGGGCATGTTTTCTCCATGTGTCATGCGCATTTCAACGGGGCGGCTTACTTTCCGCACCAACCCCACGAGAGACTGCGCCGTTCATATCTATGGCGGAAAGATGGTGCCTTATACAGTTTGCGCCCCGTTAGGCAAGGGGCACAGGCTGTTATTTGGTCGCCAGCTTCATCAGAATCAGGCCCGACAGCAGCAGCAGGGCAGCAACAACGCGCAATGGCGTAAGCGCCTCGCCCAGAAAGGTCACGCCGATGACGAAAGCACCCAACGCGCCGATGCCCGTCCAGATCATGTAGGCGGTACCTAACGGCAGCGTCTTCATGGCGATCGACAGCAGCGCAAAGCTGCAAACCATCGTGACAAGGGTGATGCTGGTATAAAGCGGCTTGGTGAAGCCATCCGACAGCTTCATCGTGTAGGCCCAGATGATTTCGAACCCGCCGGCCAAGATCAGATAAACCCAAGGCATTTCAAATACTCTCGCTAAACCGGGTCGTCCTGGCGGGATCATTAAAATGCGGGTCGTCCCACATAGCTTATATTGGCACGCCACTTTGCGTTTGCAAGGCAGTAGTTGAGAAATCCCGCCGCTTTGACTAGGTAGACCCAAACATAACGAGGAGAGAGCCATGCGCGCATTCGTATTTCCCGGGCAGGGGTCGCAAGCCATAGGCATGGGCAAGGCCTTGGCCGACGCCTATCCCGCCGCGCGGGCCGTGTTCGAAGAAGTCGACGCGGCGCTGGGTGAAAACCTGTCGGGTCTGATCTGGGACGGCGACATCGACACGCTGACCCTGACGCGCAACGCGCAGCCTGCGCTGATGGCAACCTCGATCGCGGCGCTGCGCGCGCTAGAGGCCGAAGGCGTCACATTGCAGGCCGCCGGTTTTGTCGCAGGGCACAGCTTGGGTGAATATTCGGCGCTTTGTGCTGCTGGTGCGCTGGGGCTGGCCGACACCGCGCGCCTTCTGCGCACGCGCGGCGACGCGATGCAGGCCGCCGTGCCCGTGGGCGTAGGCGCCATGGCGGCCATTCTGGGGCTGGACTTTGCCACCGTCGCCGCCATCGCCGCCGAGGCGGGGCAGGGCGAAGTCGTGCAAGCGGCCAATGATAACGACCCGTCGCAGGTTGTGGTCTCGGGCCACAAAGCCGCGGTCGAGCGCGCCTGCGAGCTGGCTAAAGAAAAAGGCGCCAAGCGCGCGCTGATGCTGCCCGTCTCGGCGCCGTTCCACTCGGCCCTGATGCAACCCGCCGCCGATGTAATGCGCGACGCGCTGGCTGCCGTGACGATCAACACCCCCGTCGTGCCGGTGGTGGCAAACGTGATGGCCGAAGCCGTCAGCGACCCGGCGACGATTCGCGATCTGCTGGTGCAGCAGGTGACGGGCTCGGTCCGGTGGCGCGAATCCGTACAGTGGATGGCCGCGCAGGGCGTCACCGAGGCCTGGGAAATCGGCGCAGGCAAGGCGCTGTCCGGCATGATCCGCCGCATTGAAAAAACTATCGAGTGCCGCACCATCGGCACCCCTGAAGAAGCGGCCGCCGCTGCCGCAACGCTGTCCTGAGGAGAGAGACATGTTTGACCTAACCGGAAAGACCGCGCTGATCACCGGCGCATCGGGCGGCATCGGGGCTGCAATCGCGCGCACGCTGCACGGCGCGGGCGCAACCGTTGCCCTGTCCGGCACGCGCGAGGCGCCCCTGCAGGCGCTGGCCGAGGAGCTGGGCGAGCGGGCCTTCGTCGTCCCGTGCAACCTGTCCGACATGGAAGCTGTCGAAGCCCTGCCCAAAGCCGCCGCCGCCGCCATGGGCAGCGTCGATATTCTGGTGAACAACGCCGGCATCACGCGCGATAACCTGTTCATGCGCATGTCGGATTCCGAATGGGACGACGTGATCGCTGTCAACCTGACCTCGACCATGCGCCTCAGCCGCGGAGTGATCCGCGGGATGATGAAGGCGCGCTGGGGCCGCATCGTCAATATCTCGTCCATCGTGGGTGCGACGGGGAACCCTGGGCAGGCCAACTACGCCGCGTCCAAGGCCGGTATGGTCGGCATGTCCAAGGCGATCGCGCTGGAAGTCGCAAGCCGTGGCATCACCGTCAACTGCATCGCGCCCGGTTTTATCGCCACCGCGATGACCGACGCCTTGAACGAGGGGCAACAAACCGCCATCTTGGGGCAAGTTCCTGCTGGCCGCATGGGCAACCCCGATGAAATCGCCGCCGCGGTGCTGTATTTGGCCAGTAACGAGGCCGCCTATGTCACAGGCACCACGCTGCACGTGAACGGCGGAATGGCGATGATTTAAGGCCGAATTTGGCTTTTTGCGATGTGCTGGCCCATTGCGGAATTGTGAAAGCGTTTGCCTTCGGATAAATTCCTGCTATGAGCCAGCCGTGTTCTGGGCTGTTTGCGTTAGCAGCGGTTCCGGAAAGAAGGGCCCGGTGCCGGGCAGTTGAACCGCCCCTGTTCCGGGGGTATCGTCAAAGCGGGCACAAGCCCATAAAGACATGAGGACATATAGATGAGCGATGTCGCAGAGCGCGTGCGCAAGATCGTTGTAGAGCACTTGGGTGTGGAAGAAGACAAAGTAGTGGAAGCTGCTTCGTTCATCGACGATCTGGGCGCCGACAGCCTTGACACCGTTGAACTGGTGATGGCTTTCGAAGAAGAATTCGGCATCGAGATTCCCGATGACGCCGCTGAAACGATCCAAACGTTTGGCGATGCGGTGAAATTCATCTCCGAAGCTCAATAATCACGGATCATTCCGGATTTTGCGGCGCCTCCCGTTTACGGGTGGCGCCGTTTTCGTTTGCGGTTCCCCGCCGGTGGCGCGGGCCGCGCGCTTGACTTTCAGTCCCCCATCCAGAACGCTGCATCCCATGGAAACCGCACCGAGGAGGGCGCGCGCTGCCACAGATGGGGGAGTTTTCGTGCTAAATGGCGCCCCCATATCGTGAAAACCCGCCGGTATCACGGCACTAGGGTGTTGGCCTTGGCGCCATCGTGCAGTATCAGGGGCGCCAAATCTTCCGAGGAAGGGTAGAATATGCGTCGTGTCGTCGTAACCGGTCTTGGCCTTGTCACCCCGCTTGCCGATGGGGTCGAAGAAACGTGGTCGCGTTTGCTGGATGGCCAATCCGGGGCCGGAACCATCAAGCAGTTTGACGCCAGCCATCTGGCCACCACCTATGCCTGCGAGGTTCCGCTGGGCGACGGCACCGACGGCACCTTTAATGCCGACCGCTATATGGAGCCGAAAGACCAGCGCAAGGTCGATGATTTCATCATCTTCGGTGTTGCCGCCGCCCAACAGGCTGTCGAGGATTCGGGCTGGGTGCCCCAGACCGAGGAAGACCGTTTCCGCACCGGCGTGATGATCGGTTCTGGTATCGGCGGGTTGAAGTCGATTGCTGAAACCGCCGTCCTGATCAAAGAAAAAGGCCCGCGCCGCGTGTCGCCCTTCTTTATTCCGGGCGCGCTGATCAACCTGATCTCGGGTCAGGTCGCGATCAAACACGGATTTAAAGGCCCGAACCACGCCGTTGTCACCGCTTGCGCCACCGGCGCGCATGCCATCGGCGATGCGGCCCGCCTGATCAAATACGGCGATGCCGATGTGATGATCGCGGGCGGGGCCGAGGCCTCGATCTGCGAAATCGGCATTGCAGGCTTCAACGCGTGCAAGGCGCTGTCGACCAAAGCGGGCGATAATCCCAAGGCGGCATCGCGCCCCTATGACGCCGACCGCGACGGTTTCGTGATGGGCGAGGGCGCAGGCGTCGTCGTGCTGGAGGAGCTGGAACACGCCCTTGCGCGCGGCGCGAAAATCTATGCCGAAGTGCTGGGTTACGGCCTGTCGGGCGATGCCTACCACATCACCGCCCCGTCCGAAGACGGCGAAGGCGGCGCGCGCGCGATGGCCATGGCCATGCGCGATGCAGGCGTCACGGCTGCCGACATCGACTATGTGAACGCGCACGGCACATCGACCATGGCCGACACCATCGAACTGGGCGCGGTCGAGCGTCTGCTGGGCGATGCCGCGGCGAACGTGACGATGTCCTCGACCAAATCGGCCACCGGCCACCTGCTGGGCGCGGCTGGCGCGATCGAGGCGGTGTTCTCGATCCTCGCGATCCGCGACCAAGTGGCGCCGCCGACGATCAACCTCGACAACCCTGCCGTCGAGACCAAGATCGATCTGGCGCCGAACAAAAAGGTCGCGCGTAAAATCGACGTGGCGCTGTCGAACTCGTTCGGGTTCGGGGGCACGAACGCTAGCCTGATCGTGGGAAAATATAAGTAATGTGGAAGCATCTGGCGTCGAATTTCCTGACCTTTCTGGTCGTTGCGCTGTTTCTGGTGGCTGGTGTCATCACATGGGGCGTTCGGGAATATTCGGCGCCGGGCCCACTGTCCCAAGCGATCTGCCTGCGTGTCCCCGCCGGTGGCACGTTCGGGCGCACGGCGGATGATCTGCGGGCACAAGGCGCGATTTCCTCGCGCGAGGTGTTCTTGATCATGGCCGACTACCGGCAAAAGCGCACCCAACTGAAACAAGGCGCGTTCCTGATCGAGCCGGGCGCGACGATGGAAAGCATCACCGACACCATCACCCGCGGCGGCGCATCCACCTGCGGCGCACAGGTCGTCTATGTCGTCGGCGTCAACGATTTCAGTGCGCGCATCCGTCAGCTTGACCCCGACACCGGCCGCTATGGCGAAGTCGCACGTTTCGACCCCACCGCCGAGGGCGAGGCACCGCCCGAATACGCGACCGCGCTGGCCGAGTCGGACGTGCAGCTGGCCGTCCAAGTGGTCGAGGGTACGACCGTCTGGCAAGTCATCACCAGCCTGAACGCCATCGACACGCTGAACGGCGATGCCGCCATGCCGCCCGAAGGCATGCTGGCCCCCGATTCCTACGAATTCCGCCGTGGTACCGATACGCAGGCTTTGGTTCAGCAGATGCAGGACCGCCAGCAAAGCATCCTCGATGCCGCTTGGGCCGCGCGCGACGACAACCTTCCCGTCTCGACACCGGAACAGGCGCTGATCCTCGCCTCGATCATCGAGAAAGAAACCGGCGTCCCCGAAGAACGCCGCCAAGTCGCCAGCGTTTTCGTGAACCGCCTGCGGCAGGGGATGCGCCTGCAAACCGACCCGACCGTGATCTACGGCGTGACCGACGGGCGCGGCAATCTGGGGCGCGGCCTGCGCCGCAGCGAACTGGACGGGCCGACACCGTGGAACACTTACGTCATCACCGGCCTGCCGCCGACGCCGATCGCCAACCCCGGGCGCGCCAGCATCGAAGCCGCGCTGAACCCCGACGATACGCCCTACATCTTCTTCGTGGCGGACGGGTCGGGCGGGCATGCCTTTGCCGTCACGCTGGATGATCACAACCGCAACGTGGCGCGGTGGCGCGCGCTGGGGAACTGATCTGTAAATCATTGATAATTTGCTCTTAATGCGCCGCACGCGAACCCCGCTTGCGGCGCATTTCGTTGCCACCGCACGGTGCAACGGCTAGGTTTGGCACACTGGCAGAACTGTCGAAAGGCTCGTGGGGCGAAGGCCCCTGCGGGCCTTTTTGCGTTCTGCCCGCCCAACTGAACCCGAGGAGGAACACATGTCGCAGGTTTTAACCCAAACAGGCGATGACGAGGCCGCACGCCTGCAACGCGATCTGCGCGACGCGATGGTGCTGGTGCGGTCCGTGCGCACGTCGCTGGCCGACATGCTGGCCGAGCTCAGCGCCGGAAACCCGGGGCCATTGCGCGAAATCGCCCCCAAGCACAGCGAGTTGGAAAGCGCCCTGCGCCGCGCCTTTGAGACCGAGCAGAAGTTCAACGACTGGACCGCAAAATTCACCGGAGGCCAAGATGGCGAAACCCTCGATTACGACGCAATCCGTGATGAAATCAGCTGCCGGTTGGCTCGCCTCGGCCCCTGCTGCGACGCGGGCTGATTTCGTCGCAGCCTTATCCGAGGCCGAGGTCGCCGCGCTGCCGTGGTTGTTCGACTTTTGGGCGCTGCCGCACCAACTGCCGCCTGCGGGCGACTGGCGCACATGGGTGGTTATGGGCGGGCGCGGCGCGGGCAAGACGCGGGCAGGGGCCGAATGGGTGCGCGCCATGGTCGAGGGGGCGCGCCCCGCGTCCCCCGGCCGCGCGCGGCGCGTGGCGCTGGTGGCGCAAACCATCGATCAGGCGCGCGAGGTGATGGTCTTTGGCGACAGCGGCATCATGGCCTGCTGCCCCCCCGACCGGCGCCCGTCGTGGCTGGCGGGGCGCGGCGTGTTGCGCTGGCCGAACGGCGCCGAGGCAACCATCTTTTCCGCCCACGACCCCGAGGCCCTGCGAGGCCCGCAATTCGACGCGATCTGGGCGGACGAGGTCGCCAAATGGCGGCTGGCGCAAGAGGCATGGGACATGCTGATGATGGGCCTGCGGCTGGGCGACAGCCCGCGCGCCTGCGTGACGACCACACCGCGCGGTGGGGCGTTCCTGCGCGGGTTGCTGGCACAGGATAGCACCGTGATGACCCACGCCCCCACACGCGCGAACCGCGCCAACCTCGCCCCCGGTTTCGTCGAGGCCGTCGAGGAGCGCTACGCCGGCACGCATCTGGGCCGGCAAGAGATCGAGGGCCTGCTGGTCGAGGAAGCCGAAGGCTCGCTTTGGCCAGACCGGTTGATCCAGCTGGCCCGCACGCAAATCGCCCCTGCGCTGGACCGGATCGTAGTAGCAGTTGACCCGCCAGTGACCGGCCACGCGGGGTCGGACGCCTGCGGGATTATCGTTGCGGGCGTGCAGCGCCGCGCCGACGGGCCGCCGCACTTTTGGGTCATCGAGGACGCAACCGTTCAAGGCGCGTCGCCGAACACATGGGCCAAAGCCGCAATAGCCGCCTTTCACCGCCACGGCGCCGACCGGCTGGTCGCCGAGGTAAACCAAGGCGGCGCGCTGGTGGAAAGCGTGCTGCGGCAGGTTGACCCGAACATCCCGTATCGCGCTGTGCGGGCCACCACGGGCAAGGCGGCCCGCGCCGAACCTGTCTCGGCGCTGTACGAACAGGGCCGCGCCAGCCACGTCGCGGGCCTTGATCTGCTGGAAGCGCAAATGGCGCTGATGACGCTGCAGGGCTTCAAGGGGCGCGGATCACCCGACCGCGTCGACGCGCTGGTCTGGGCCGCGCACGAGCTAATTTTGGGACCGGTGGCGCAGCCCAAAATCCGCTCCCTGTTCTGATGCAACAGGCAACGCGCGGTGCTGCTGCCGCGCGTTAATCATTTGTTGGGAAAACGGCCTCGTTCGCAAAGACAGCAGCGGGGAAGCAGATGTTCGGGTTTGGCGAAAAGAAACAGCCGGTGTCGGTGCCCGAGGTGAAAGCCTCGGCCGCGGGCAGGGTGATTGCGTTCGGGGCGGCGGGCCGCACGGCCTTTGCCCCGCGCGAAGGGTCTGGTCTGGTGCGGGCGGGCTTTGGGGCGAACCCCATTGGCTTTCGCGCCGTCCGCCTGATCGCCGAGGCCGCCGCCGCGCTGCCCTTGATCTTGCAAGATCAGACCCGCCGCTACGACACGCACCCCGTGCTGGATCTGCTGGCCCGCCCCAACAGCGCGCAGGGCCAGCTCGAGTTGCTCGAGGCTGCCTATGCCCAGATCCTGCTGACCGGAAACGCTTATTTCGAGGCTGTCGCCCCCGAGGGCCTGCCGGTTGAGCTGCACGTGCTGCGATCCGACCGCATGTCGGTGGTGCCTGGCACTGATGGCTGGCCGGTCGCCTACGACTATGCCGTGGGCGGGCGCAAGCACCGCTTTGCCGTGGGCGAGGGCGCATCGCCCATCTGCCACATCCGCAGCTTTCACCCGCACGATGACCACTACGGCCTGTCGCCCCTGTCGCCCGCCGCTGCCGCGATCGAGGTGCACAATTCCGCCTCGCGCTGGTCGCGGGGCTTGCTGGAAAACGCTGCGCGGCCCTCGGGCGCGATCGTCTATCGCGGGGCCGACGGCAATGCCACGCTGTCGCCCGACCAGTTCGACCGGCTGGTGGCTGAAATGGAAAGCCAGCATCAGGGCGCGCGCAACGCGGGCCGACCCATGTTGCTGGAAGGCGGCCTCGACTGGAAACCGATGGGTTTTTCGCCCTCCGACATGGAATTTCTGCAAACGAAAGAGGCCGCCGCCCGCGAAATTGCCATCGCCTTTGGCGTGCCGCCCATGCTGCTGGGTATTCCGGGCGACGCCACCTACGCCAATTACCAAGAGGCAAACCGTGCCTTCTACCGCCTGACCGTACTGCCGCTGGCCGCGCGTGTCACCGGTGCGCTGGCCAATTGGCTCGAGGATTTCACCGGCGACTGGCTGGACCTGCGCCCCGACCCCGACCAGATCGCGGCCCTTTCGGTCGAGCGCGACGCCCTGTGGGCGCGTGTTGGCGGGGCGTCGTTCCTATCGGATGCCGAAAAGCGCGTTCTGTTGGGCCTGCCTGCGCTGGATGGCAGCGATGGCACCGCCTGAGCGCCCGTTCCTATGCGCCCCCGGCCTACGGATCGAGGCGCAAGAGCGGCTAGTCGCGCTGCAATTTCAACAACTTCAGCAACAGCTGGAACGCTTGGAAGCCCTGATCGAGCGGCTGGAAAAGCGGCTGTGGCTGACCGTCTACGGCGTTCTGGGCGCGATTTTGGCGCAGGCGTTCCAATCGTTCCTGCAAGTCGCGCCATAAAAGGGGGAAAAGGTGGATCTTGAATTCAAATATGCCGCGCTGACACCGCACAACACCGGTGACGGGTTGAAGGTGTCGGGCTACGCCTCGCTGTTCGGCGTGCGCGATCAGGGCGGCGATGTGCTGCAGCCGGGGGCTTTCGCCGCCTCGCTGGCGGCGCTGAAGACCCAAGGTAACAAGGTGCGTATGCTGTGGCAGCACGACCCGAACACGCCCATCGGCGTCTGGGACGACGTGTCTGAGGATGCCACCGGCCTGCACGTTTCAGGCCGCCTGCTGCCCGACGTGGCCAAAGCGCGCGAGGTTGCCGCGCTGCTGGCCGCCGGTGCCATCGACGGGCTGTCGATTGGCTACCGCACGCTGCGCGCCACCAAGGCGGCGGACGGCAGCCGCCTGCTGCACGAGGTCGCCCTGTGGGAAGTCTCGTTGGTCACATTCCCCATGCTGCAGCAGGCCCGCGTGACGCAAAAGGCCGACGACAGTCTGCTGTCCGCCCTGCGCCAAGCCCGCGCCACCCTTGCCAATTTCAACTGAGAGGAGCCCCGGATGGATCCGGTCCAATTATCGCAGATCACCAGTGAAATGGAAAACTTCCTTGGTGAATTCAGTGGCTTTGCAGCCGAAGTGAAACAACGACTTGAACAACAGGAAACCCGCATGACCCGTCTTGACCGCAAAGCCGCCGCCCACCGTCCCGTGCTGTCCCGCGCCGCCGATCTGGACGCCCCGCACCAAAAGGCGTTCGACGCCTACCTGCGCTCGGGCGACGACGATGGCCTGCGCAATCTGGAACTGGAAGGCAAGGCGATGAACACCGCCGTCGCGGCGGATGGCGGCTACCTCGTCTCACCCGAAACGGCGCTGACCATCCAGAATGTGCTGGGCGCTACCGCCTCCATCCGCGCCATCTCCAGCGTGGTGAATGTCGATGCCGCCAGCTACGACGTGCTGGTCGACCGCACCGAACCGGGCGCGGGCTGGGCGTCTGAAACTGGCACGGTCGCTGAATCCGGCACCCCCGTCATCGAGCGCGTCTCGATCCCGCTGCACGAATTGGCCGCGTTGCCCAAGGTCTCGCAGCGCTTGCTGGATGATAGCGCATTCGATCTGGAAGACTGGCTGGCCAACCGCATCGCGCAGAAATTCGCGCGGGCCGAAGCCGCCGCCTTCATCAACGGCGATGGCGTCGACAAGCCCAAGGGCTTCCTGACCGGCACCAAAGTCGCGAACACCGCGTGGGCATGGGGCAACCTCGGCTACATTGCAACCGGCGCGACCGATACGCTGCCTGCGGATTCCATCGTCGATCTGGTCTATGCCTTGGGCGCCGAATACCGCGCGGGCGCCAGCTTCGTGATGAATTCGAAAACCACCGGCGTGCTGCGCAAACTGAAGGATGCGGACGGCCGCTTCCTGTGGTCGGACGGCCTTGCAGCGGGTGAACCCGCGCGCCTGATGGGCTACCCCGTCCTGATCGCCGAAGACATGCCCGACATCGCCGCCAATGCCTATGCGGTCGCCTTTGGTAACTTCCAGTCCGGCTACACCATCGCCGAACGCGCCGACCTGCGCGTCTTGCGCGACCCGTTCTCGGCCAAGCCGCACGTGTTGTTCTACGCGACCAAACGCGTCGGCGGCGCGGTCACCGATTTCGCCGCGATCAAGCTGCTGAAATTCGCCGCGTCCTAAGGCGCGATGCGATCCGGCGGGGCGTGTCCCCCGCCACTTTCCCAGATTTTGACAAAAGGGCAGCGCCGCGATGATGCTGGTTGAGGAAACGACGGTGGCAGATGCCGCCCTGCCGGTAGCGGCGCTGGGCGACTTTCTGCGCCTGGGTACGGGTTTTGACACCGATAACATGCAAGAGGGCCTGCTGCGTGCCTTCCTGCGCGCGGCCCTGGCCGCAGTCGAAGGGCGCATCGGCAAAATCCTGATCGCCCGCACCTTCCGCGAGGATCTGGCCCCGCCTGCCGCGCTGTCGGCCCTGCCGCTGCGCGAGGTCATCGCGGTCACCGCCAATGGCGCACCCGTCGATTGGCAGGTGCAGCAGGGCCTGCGCCCGCGCGTCACACTGCGCGGCTGGCAGACGGACCAGCAGCTGAGCGTGCGCTATATCGCAGGCATGGCCGCCGATTGGGAAACCTTGCCCGCCGACATCCAACAAGCCGTGCTGATGCTGGCCGCGCATTACTACGAATACCGCGAAGACCCCGACCTCGACGGCGCGTGCATGCCCTTCGGCGTCTCGGCCCTGACCGAGCGCTACCGTACGGTGCGCCTTGGCTTTGGGGGCGCACAATGACCCGCTTGCGCCTGAACCGCGCCCTCATCCTGCAACGCCCCGAGCGCGACAGCGACGGGGCAGGCGGCTACACCGAAGGGTGGCAGACGCTGGGCACCTTGTGGGCCGCGATCACCCCCGCAACAGGGCGCGAGGCTGCCGCCTTCGGTGCCGCCTTGGCCCGCGTGCCCGTGCGCATCACCCTGCGCGCCGCCTCCATAGGCGACCCGCGCCGCCCGGTCGCAGGCCACCGCTTCCGCGAAGGGGCGCGCAGCTACCTGATCTTGGCGGTGCAAGACAGCCCCCAGCGCCGCCTGACCTGCATCGCCGAAGAGGAACTGGTCCGATGAGCTATACCCTCAGCCTTGCGCTGCAACAGGCCCTTTTCACCCGCCTGACCACCACCGACGGCCTGTCGCTGCCCGTGCACGACGCGCTGCCCAGCGGCACCGTGCCCCCCCTTTACATCGCCATCGGCCCCGAGGACGTCGAAAGCCTCGCCACCCCCGATGGGCCGATCACCCTGCATCAGGTCAAAATCAGCGTCATCGCCACCGGCGGCGGCTTCGGCACCGCCAAGGGCATCGCCGCGCAAATCATCACCGCCCTGACCGCGCCGTTGGAATTGCCCGCAGGCCACGCCAGCGCCCCGCTGTTTCAAGCCGCCACCGCCAAATCCACCACCGGTGCCGACCGCCGCATCGACCTGACCTTCCGCATCCGCCTCGAACCTTGAAAGGGACTGAAATGACCGCACAAAACGGCAAGGATCTGTTGATCAAAATCGACATGACGGGCGATGGCCTGTTTGAAACCGTGGCCGGCCTGCGCGCCTCGCGCATCAGCTTCAACGCCGAAACGGTCGATGTCACCAGCCTCGAAAGCCAAGGCGGCTGGCGCGAGCTGCTGGCTGGCGCGGGCGTCCGCTCGGCTGCCGTCTCGGGCTCGGGCGTGTTCCGCGACGCTGACACCGACGAACGCATGCGCGCGCTGTTCTTCGCGGGCGATGTGCCGACCTTCCGCATCATTATCCCGCACTTCGGTGCCATCAGCGGCCGCTTCCAGATCACCGCGCTGGAATACGCGGGCAGCTACAACGGCGAGGCCACCTACGAAGTCTCGCTCGCCTCGGCAGGGTCCCTCAGCTTCGAGGCCGAAGTATGAACCCCGCAAACCCCCACGCGGGCGAGGTGATCATCCCCATCGACGGCGTGCCCCATGTGGGCCGCCTGACGCTGGGCGCGCTGGCCATGCTGGAAGCCGAGCTGAACACAGGCACCCTGACCGACCTTGTCGCGCGGTTCGAAGGCGGGGCCGTCAAATCCGCCGATGTCATGGCGCTGGTCGTCGCAGGCCTGCGCGGCGGCGGCTGGACGGGCACCGCCGCAACCTTGCTGGCCGCCGACATCGCGGGCGGCCCGCTAGGCGCGGCGCGTATCGCAGGCCAACTGCTGGCGCGCGCGTTTTCCACATCCGACGGGGCACAGTGATGGACTGGCCGGGCCTTCTGCGGCTGGGCTTGCAGCGCCTGCACCTGCGCCCGGCCGAATTCTGGGCGCTGACCCCGATCGAACTGATGCTGATGCTCGGCCTTGCGGGCGCAGCCGCACCCATGGCGCGCGCCCGCCTGGCCGAACTCGCCCGCGCCTATCCCGACACCCGCCCCCCACACGAGGCCCCCGATGAGTGACCTGACACCCACCACCGCCGAGCTGCAGCAACTGCACAGCATGACCCAGGCCCTGAACGCGGGCCTGCGCGACATGCGCGGCACCATGGCAAACACCAACCGCGAGGTCGCAGGCCTAGAACGCGGCCTGTCCAGCGGGCTGCGCAAGGCGTTCGACGGGCTGATCTTCGATGGTGACCGGCTGGGCTCGGTGCTGGGCACCATCAGCACCAGCATCCAAAACGCCGTCTACAGCGCAGCGGTCAAACCCGTGACCAACCACCTGACCGATTACCTGATCAACGGCCTGCATGGCGCAACCCCCTTCGCCAACGGTGGCGCTTTCACTCAAGGGCGCGTCATGCCGTTCGCCAAAGGCGGCGTCGTCACCGCGCCCACAACCTTCCCCATGCGCGGCGGCACCGGCCTGATGGGCGAGGCTGGCCCCGAAGCCATCATGCCCCTGACACGCGGTGCCGACGGCCGCCTTGGCGTCGCCACACAGGGCGGGGCAGGCGTCAACCTGACGATGAACATCCAAACCCCCGACGCAACCGCGTTCCAGCGCTCGCAAAGCCAGATCGGCGCGCAGATCTCGCGCCTCGTCGCGCGCGGCCAACGCAACCGATAGGACACTCCATGGCCTTCCACGACATCCGCTTCCCCGCCGCCATCAGCTTTGAATCGCTGGGCGGCCCCGTGCGCCGCACCGAAATCGTCACGCTCGCGAATGGGTACGAGGAACGCAACACCGCCTGGGCCCACTCGCGCCGCCGCTATGACGCGGGCGTCGGGCTGCGCTCGCTCGACGACGTCGCCGCCCTGATGGCCTTCTTCGAGGCGCGCGGCGGCCAACTGCACGCCTTCCGCTGGAAAGACTGGTCGGATTTCAAATCCTGCCTTCCCTCCGAAACAACCGGCCCGACCGATCAAACCCTTGGCTACGGCGATGGCACCACAACCACGTGGCCGCTGATCAAACGCTATGTCTCAGGCGATTTCGCATACGCGCGGCCCATCACCAAACCCGTGGCCCATACCGTCACGGTCGCGGTGGCCGCCCAACCGCTGGACGCAGGGCACGACTACACCCTGAACCCCGATACCGGCACCATCACCTTCACCGCCGCCCCCGCCATCGGGGCCGAAATCACCGCAGGCTACGAATTCGACGTGCCCGTCCGATTTGAGAGCGACGCGATCCAAATGTCGGTCTCGTCCTTCCGCGCGGGCCAAATCCCCTCCGTCCCGCTGATCGAGGTGCGCCTATGACTGACCTGTCCACCACCCGCTGCACCGCGTGGGCCATCACCCGCGCCGATGGAACCACGCTGGGCTTCACCGACCACGACGCCGACCTGACCTTCGCGGGGCTCACCTTCCGCGCCGCCAGCGGCATGACGGCCAGCACGCTGGCGCAGGGCAGCGGCCTCTCGGTCGATAACGCCGAAGGCTTCGGCACCCTGACCGCCGACGCCATGCGCGAGGCCGACATTCGCGCAGGCCATTTCGACGGGGCCGACGTGAAAATCTGGCAGGTCAACTGGCAGGCCCCCGCCGTCCGCCAGCAAATCTTCCACGGCACCTTGGGCGAAATCACGCTGGAAGGCGGCGCATGGCGTGCTGAACTGCGCGGCGCGGCCGAGGCTCTCTCGCGGCCGCTCGGCCGCAGCTATCAACGCGGCTGCGCGGCGGTGCTGGGCGATGCGGCCTGCGGGTTTGACCTCTCAACCCCCGGCTTCACCGCCGATACAACCCTGCGCAGCGCCACCGAAACGCGCCTCACCCTCCCCGCGATCGACGCCGCCCCGCGCTGGTTCGAACGGGGGCAGGTGCAAATCCTCTCGGGCGCCGCCGCAGGGCTGACGGCCACCATCAAAACCGACGAGGCCGCAGGCGACACCCGCATCCTCACCCTCTGGTCGCCGCTGGCCATCACCCCCGAGCCGGACGCCCAAATCCGCCTGCTGCCGGGGTGCGACAAACGCATGGCCACCTGCCGCGCTAAATTCGGCAACCTGCCAAACTACCGCGGCTTCCCCCACATCCCCGGCGAGGATTGGCTGCGTGCCATCCCCAAATCCGGCGCATCCGGTGAAAGCCTGTTCCAATGACAAACCCGATCCCCGCTGCCCGCCGCTGGCTCGGCACCCCCTTCGTCCCGCGCGCCAGCTGCCGCGGGGCGGGGGCCGATTGCTTGGGGCTGATCCGCGGCCTCTGGCGCGACCTACACGGGGCTGAGCCATGGCCCATCCCCGCCTATGGCCCCGATTGGCCTCGCGCGCTGGGCGATAATGCGCTGCAAATAGCCCTGCAAAAACACCTGCCTAGCCTCGCCGCGCCGCGCACGGGGGCGGTGCTGCTGTTTCGGCTGCGGGCAGGGCAGACGCCTGCCCACCTCGGCCTGTGCACCGGTACGCATTTCATCCACGCCCACCACACCAGCGGCACTATCGAAAGCCCGCTGTCGACCCCGTGGCGGCACCGTATCGCCGGTGCCTTCGCCCTGATCCCAAAACCCCAGCAGGAGGCCTGACCATGGCAACTCTCGTTCTTTCGGCCGTCGGGGCCGCCGCAGGCAGCACAATCGGCGGCGGCGCGCTGGGCCTGTCGTCCATGGTCATCGGTCGCGCCGTCGGTGCCGTCGCAGGCCGCATGATCGATCAACGCCTGCTGGGCGGCAGCGCCGATCCCGTCGAAACCGGCCGCGTCGACCGCCTGCGCATTACCGGCGCATCCGAAGGCGCGGCCATGGCCCGCATCTACGGCCGCATGCGCGTCGCGGGGCAGGTGATCTGGGCCACCAATTTCATGGAAACCAGCCAAACCACCCGCGCAGGCAAAGGCCAACCGGGCACCACTGCCTACAGCTACACCATCAGCCTCGCCATCGCGCTGTGCGAAGGCCCGATCAACGGCATCGGCCGCATCTGGGCCGACGGGACCGAGATCGCCCCCTCCAGCATGACCCTGCGCCTCTACCACGGCGACGCGACCCAACAGCCCGACCCCCGCATCAGCGCGGTTGAGGGGCCAGACAACGCGCCCGCCTATCGTGGCACCGCCTATGTGTTGATCGAGGACCTCGACCTCGCCCCCTTCGGCAACCGCGTCCCCCAATTCAACTTCGAGGTGATCCGCAACGACACCACCCGCGACGATAGCTGGGCCGGCGTCGTCCAATCCGTCGCGCTTATCCCCGGCACGGGCGAATACGCGCTGGCCACCGAACCGGTCGCGCTGCACTATTCCTATGCACATCAGGAAACCGTCAACGAAAACAGCCCCAGCGGCAAAAGCGACCTGATGACGTCGCTCGACCAAATGAACACCGAACTGCCGCGCGTCAAATCCGTCTCGCTGGTGGTGTCGTGGTTCGGCGACGACCTGCGGGCAGGGCACTGCACCGTGCAGCCCAAAGTCGAACAAACCCCCTTTGACGCCCCGACCCAGCCGTGGCGGGCAGGCGGCATCACACGCGCGCAGGCCGCCACGGTGCCGCGCCTGAACGACGCGCCGGTCTATGGCGGCACGCCGGGTGATACATCGGTGATCCAATCCATCCGCGCCATCCGCGCCCGCGGGCAAGAGGTGATGTTCTACCCCTTCATCCTGATGGACCAACAGGCCGACAACACCCTGCCGAACCCGTGGACGGGGACGGAAGGCCAGCCCGCCATGCCGTGGCGCGGGCGCATTACCACAGCGCTCGCCCCCGGCCTGCCCAACAGCCCCCACGGCACCGCCGCCGCCGATGCGCAGGTCGCAGCCTTTTTCGGAACCGCCGCCCCCAGCGACTTCCGCTGGGATGGCACGCGCCTGCACTATACCGGCCCCGCCGAATGGTCGCTGCGCCGCTTCATCCTGCACTACGCCCACCTGTGCACCGCTGCGGGCGGCGTCGATAGCTTCTGCATCGGATCGGAAATGGTCGCGCTGACCCAAATCCGCGGCGCCACCGGCTTTCCGGCGGTCGATGCCCTGTGCCAACTGGCCACCGACGTCCGCGCCATCCTCGGCCCCGACACCAAAATCAGCTATGCCGCGGATTGGAGCGAGTACCACGGCACCCAACCCGCAGGCACCAGCGACAAATACTTCCACCTCGACCCCCTCTGGGCGCACGAGGCGATCGACTTCATCGGCATCGACAACTACATGCCGCTGTCTGACTGGCGCGACGGCACGGCCCATGCCGACGCCGCCGCAGGCGCGATCTACAACCTCGACTACCTGCGCAGCAACGTCGCCGGCGGCGAGATGTACGACTGGTTCTATGCCAGTGATGAATCGCGCGATGCACAAATTCGCACGCCTATAACAGATGGTTACGGGCAGGAATGGATGTGGCGCATGAAGGATATCCTCGGCTGGTGGTCAAACGCCCACTTCAACCGCGTTGACGGCGATGTCGGCGCGGCCAGCCCGTGGCAGCCAAGATCAAAACCCATCCGCTTCACCGAAATCGGCTGCGCGGCTATCGACAAAGGCACCAACCAACCCAACAAATTCCTCGATCCGAAATCATCGGAATCCGCGCTGCCTTACTATTCCAACGGCCTGCGGGATGACTTCATCCAGCTGCAATACCTGCGCGCGCTGACCCAACACTATGCTGACCCCGCCAACAACCCACCATCCGACCTTTATGACGGTCCGATGATCGAGATGGACTACGCCCACGTCTGGGCCTGGGACGCGCGCCCGTTCCCGTGGTTCCCCGCACGCCAAAACCTGTGGTCTGACGGCACAAATTACGACCGTGGCCACTGGCTGAACGGCCGTGCGGGCGGGCGCGCGTTGCAGTCGCTGGTGGGCGAAATCTGCACAGGCGCGCAAATGGGCCCCGTCGACACCAGCGCCCTGTGGGGAACAGTCCACGGCTACGCGCTGGATCAGGTCACTACGGGCCGCGCCGCGCTGCAACCGCTCATGCTCTCGCACGGGTTCGATGCAGTGTCGAAAGACGGCACCTTCACCTTCCAAACCCGCCACGGCCGCCCCGTGCTGACCGTGTCGCCAGATGAGCTGGTGCAGACCGATCCCGACACTGCGGCCTTGATCCTGACCCGCCAACCCGAGGCCGAAATGGCGGGCCAAGTCCGCGTGTCCTTCATCGCCGCCGAGGGTGACTTCGCCACCGGCGCCGCCGACGCCGTCTTGCCCGACGCCCGCGCCGATACCGTGGCGCAAAGCGAGCTGCCCCTGCTGATGACCCGCGCCGACGCCAAAGCCGCGGCCGAGCGCTGGCTAGCGGAATCGCGCCTTGCCCGCGAAACGGCAACCCTCGCGCTGCCCCCGTCACGCGGCTGGCTGCAGGTCGGCGACGTGCTGCGCGTGGCCGATATGGATCTGCGCATCGACCAAATGGAACGCGGCCACCATCTGGCCGTCACCGCCACCCGCGTTTCGGACACGCTTTATCAGCGGCACGACGTGCTGGCCGACATCGAACAACCCGCCGCCTACGCGCCGCCCATGCCCGTCGCAGCGACATTCCTCGACCTGCCATCCGAGGACGGCGTGGCCGCCCATATCGCGCTGACATCCGCGACCTGGCCCGGCGAGGTCGTCGTCCAAAGCGCCGCCGCTGGCGCAACCCCCGCCACGGTCGCCCGCGTCGGCAGCCCCAGCGTCGTTGGCGAAACGCTGACCCCGCTGCCCGCCGCCCGTGCAGGGGTGCTCGACCGTGGCCCTGCGCTGCGCGTCAAACTCATCTCGGGCGACCTTGGCGGGGCCAGCCTTGCCGCGCTGCTGGATGGCGCGAACCTTGCCGCGCTGGGCGATGGCGAAAGCGATGTGTGGGAAGTGTTCCAGTTCGCCGGGGCCGAGCTGGTCGGCCCGCAGGAATACGCCCTGACTCTGCGCCTGCGCGGCCAAGGCGGCACCGATGGCATCATGCCCCCCGTCTGGCCGGTAGGCACGCGGTTCATCTTGCTGGATCAACACGTTGCATCGCTGCCCGCGCCCCCGCGCGGTGTCGCGCGCGACTGGCTGTGGGGCCCCGCACAGCGCCCGACGACCGACCGCACATGGCGCAGCGCAACACGCGCCTTTGCGGGCATCGCCTTGCGCCCCTATGCGCCGTGCCACCTGCGCAGCACGCGCAGCGGCGGTGATGTGACGCTGGGCTGGCAGCGCCGTACGCGCAGCGGCGGCGACAGTTGGGACGGCCTCGACGTGCCGCTGGCCGAGGATGCCGAAGCCTACCGCCTGCGTCTGTTGCAAGGCGGCAGCGTGCTGCGCGATGTGGTCGTGGGCAGCCCGGCCTTCACCTATACCGCCGCGATGCGGACCGCCGATGCCGCATCCGGCCCCATTACGGTCGAGGTTGCGCAACTGTCGCAGGCCTATGGTGCCGGCCCCGCGCTGGTCGTGGATCTGGCGCTGTGA
Protein sequences of DBSCAN-SWA_2 >NZ_CP019937|877201:912034|880360_881167_-|WP_085785823.1|DBSCAN-SWA MGFLRRHARDMGMIALVLGLFSAVNLLPPDTALQQVQKQGVLRVCTPQSYPPFVTGTAAAPGIEIELLQSVAARLGVRAAFNTNAAIGRDMNPAAWKLNRASCSIIIGGLVDSTTMRTFLDMSAPYLQSGWVSLVRDPAQTVEGARVGVFVGSLGRDRLELSRALRENGAATIVGQPQLAGAIAALRQGQLDMVVTDAILAAGLGDQPDLTVSLVPNVRRDSIAMGFWKGDLTLKRAISAALRDMQDSGEIDAIAERYGYAPLPEDAT >NZ_CP019937|877201:912034|887645_888977_+|WP_085787249.1|DBSCAN-SWA MQVTETLNEGLKRGYAITVTAAELDAKVREKLEEAAPEIEMKGFRKGKVPFALLKKQFGPRLLGEAMQEAVDGAINSHLEASGDRPAMQPEVKMTNENWEEGQDVEVSMTYEKLPQVPEVDLSGVEITRLTVKADEASIEEALTNLAETSKQFEDRRKGSKAKDGDQVVIDFVGKVDGEAFEGGSAEDYPLVLGSNSFIPGFEEGLVGVKVDDVKDVEVTFPENYGAAHLAGKAAVFTCTVKAVKEPKAAELNDELAKQFGAEDLAGLKTQIATSLEGEFAGAARAVAKRALLDKLDTLVSFELPESLVEAEAGQIAHQLYHEEHPDDHNHSHGAIETTDEHRKLAVRRVKLGLLLAELGQKNEIKVTDSELTQAILNQARQYRGQERQFFEFVQQNAAARQQIQAPIFEDKVVDFIFSQVKVDEKDATKDELQAAVEALDVE >NZ_CP019937|877201:912034|906280_906913_+|WP_085785852.1|DBSCAN-SWA MAFHDIRFPAAISFESLGGPVRRTEIVTLANGYEERNTAWAHSRRRYDAGVGLRSLDDVAALMAFFEARGGQLHAFRWKDWSDFKSCLPSETTGPTDQTLGYGDGTTTTWPLIKRYVSGDFAYARPITKPVAHTVTVAVAAQPLDAGHDYTLNPDTGTITFTAAPAIGAEITAGYEFDVPVRFESDAIQMSVSSFRAGQIPSVPLIEVRL >NZ_CP019937|877201:912034|887258_887438_+|WP_085785830.1|DBSCAN-SWA MADLNAMLAPGMIVRHPDHPEWGDGQVQSNIGGKITVNFRETGKQVIDGARIFLVLVSF >NZ_CP019937|877201:912034|883260_885345_+|WP_085785826.1|DBSCAN-SWA MKSSFLLRASAAIVPMVVAAGAAQAADGFFTAEQVANGKELYDARCSVCHAQSIRDSYANSSATAALIVESIVSNGMPLDNPGGLPAQDYVDIAGYILNHNGMALGDEVLAGSAAVQVAAIGADPADIANVVLEEVVDDTPQREYSPVTSEMILDPNPAEWLQWRRTVDNQGHSPLDLINRDTVGDLELAWAYPMGVPGLQEVAPIVHDGIMFLATNQNNVMAVDAVTGDTIWMYTHQRPEFEGAYHSRQAERQKNSVALWDDSVILTTVDGKLISLNALTGQKEWEFQVMDWEKGYSYTAGPLIADGKIFTGTSGCSIAGTNGGCYITAHNADTGEEIWRFNTIDDPNNPLVDESWGGVPAENRWGATPWATASYDAELNMVYYGTGMPIPYSEHTRGTGEGSALYTNSTLALDADTGELKWYYQHMPRDNYDLDSPFERIIIEEEIDGEMRKLVVSTPGKNGITFALDAATGEFVWSKETIYQNVIDSIDQETGEITLNLDTIPSEIGEEKLFCPTFNGGRLWQATAYSPDTGMFYLPAANLCQTITPLAFEMAASGETMGMANTGPQQLAPGHDNVGSLFALNVVDGSDAFEVEQPARFSSSVLATGGGLIFVGDANRWVYAMNDETGEVLWSQRLHAPIGGYPMTYEIDGVQYVAIPAGQSATTQVALTPGMSLPPITGANMLYIFRLPS >NZ_CP019937|877201:912034|898880_900215_+|WP_085785841.1|DBSCAN-SWA MKSAAGWLASAPAATRADFVAALSEAEVAALPWLFDFWALPHQLPPAGDWRTWVVMGGRGAGKTRAGAEWVRAMVEGARPASPGRARRVALVAQTIDQAREVMVFGDSGIMACCPPDRRPSWLAGRGVLRWPNGAEATIFSAHDPEALRGPQFDAIWADEVAKWRLAQEAWDMLMMGLRLGDSPRACVTTTPRGGAFLRGLLAQDSTVMTHAPTRANRANLAPGFVEAVEERYAGTHLGRQEIEGLLVEEAEGSLWPDRLIQLARTQIAPALDRIVVAVDPPVTGHAGSDACGIIVAGVQRRADGPPHFWVIEDATVQGASPNTWAKAAIAAFHRHGADRLVAEVNQGGALVESVLRQVDPNIPYRAVRATTGKAARAEPVSALYEQGRASHVAGLDLLEAQMALMTLQGFKGRGSPDRVDALVWAAHELILGPVAQPKIRSLF >NZ_CP019937|877201:912034|896005_897265_+|WP_085785838.1|DBSCAN-SWA MRRVVVTGLGLVTPLADGVEETWSRLLDGQSGAGTIKQFDASHLATTYACEVPLGDGTDGTFNADRYMEPKDQRKVDDFIIFGVAAAQQAVEDSGWVPQTEEDRFRTGVMIGSGIGGLKSIAETAVLIKEKGPRRVSPFFIPGALINLISGQVAIKHGFKGPNHAVVTACATGAHAIGDAARLIKYGDADVMIAGGAEASICEIGIAGFNACKALSTKAGDNPKAASRPYDADRDGFVMGEGAGVVVLEELEHALARGAKIYAEVLGYGLSGDAYHITAPSEDGEGGARAMAMAMRDAGVTAADIDYVNAHGTSTMADTIELGAVERLLGDAAANVTMSSTKSATGHLLGAAGAIEAVFSILAIRDQVAPPTINLDNPAVETKIDLAPNKKVARKIDVALSNSFGFGGTNASLIVGKYK >NZ_CP019937|877201:912034|900304_901486_+|WP_085785842.1|portal|DBSCAN-SWA MFGFGEKKQPVSVPEVKASAAGRVIAFGAAGRTAFAPREGSGLVRAGFGANPIGFRAVRLIAEAAAALPLILQDQTRRYDTHPVLDLLARPNSAQGQLELLEAAYAQILLTGNAYFEAVAPEGLPVELHVLRSDRMSVVPGTDGWPVAYDYAVGGRKHRFAVGEGASPICHIRSFHPHDDHYGLSPLSPAAAAIEVHNSASRWSRGLLENAARPSGAIVYRGADGNATLSPDQFDRLVAEMESQHQGARNAGRPMLLEGGLDWKPMGFSPSDMEFLQTKEAAAREIAIAFGVPPMLLGIPGDATYANYQEANRAFYRLTVLPLAARVTGALANWLEDFTGDWLDLRPDPDQIAALSVERDALWARVGGASFLSDAEKRVLLGLPALDGSDGTA >NZ_CP019937|877201:912034|892570_892957_-|WP_085785834.1|DBSCAN-SWA MPLYEHVLIARQDLSNAQAEGLIEHFSAVLADNGGAVVMSEYWGVKTMAYKINKNRKGHYAFLRTNAPAAAVLEMERLARLHDDVMRVLTIKVDAHEEGPSVQMQKRDERDRGERGDRPERGERRERR >NZ_CP019937|877201:912034|904719_905136_+|WP_085785848.1|tail|DBSCAN-SWA MTAQNGKDLLIKIDMTGDGLFETVAGLRASRISFNAETVDVTSLESQGGWRELLAGAGVRSAAVSGSGVFRDADTDERMRALFFAGDVPTFRIIIPHFGAISGRFQITALEYAGSYNGEATYEVSLASAGSLSFEAEV >NZ_CP019937|877201:912034|894523_895261_+|WP_085785837.1|DBSCAN-SWA MFDLTGKTALITGASGGIGAAIARTLHGAGATVALSGTREAPLQALAEELGERAFVVPCNLSDMEAVEALPKAAAAAMGSVDILVNNAGITRDNLFMRMSDSEWDDVIAVNLTSTMRLSRGVIRGMMKARWGRIVNISSIVGATGNPGQANYAASKAGMVGMSKAIALEVASRGITVNCIAPGFIATAMTDALNEGQQTAILGQVPAGRMGNPDEIAAAVLYLASNEAAYVTGTTLHVNGGMAMI >NZ_CP019937|877201:912034|891299_891650_+|WP_085785832.1|DBSCAN-SWA MGATAAFAQAPAMPTQDDMIAASKNQLGILEYCQGKGFVEQDVVDIQNRLMAALPPSETPDVAEAAYQQGLEGKVSAMGTEVSIADAATAQGTTEEAFCGQISDLVKQLGASLPAQ >NZ_CP019937|877201:912034|901703_902225_+|WP_085785843.1|head,protease|DBSCAN-SWA MDLEFKYAALTPHNTGDGLKVSGYASLFGVRDQGGDVLQPGAFAASLAALKTQGNKVRMLWQHDPNTPIGVWDDVSEDATGLHVSGRLLPDVAKAREVAALLAAGAIDGLSIGYRTLRATKAADGSRLLHEVALWEVSLVTFPMLQQARVTQKADDSLLSALRQARATLANFN >NZ_CP019937|877201:912034|901472_901691_+|WP_085787250.1|DBSCAN-SWA MAPPERPFLCAPGLRIEAQERLVALQFQQLQQQLERLEALIERLEKRLWLTVYGVLGAILAQAFQSFLQVAP >NZ_CP019937|877201:912034|908179_912034_+|WP_085785855.1|DBSCAN-SWA MATLVLSAVGAAAGSTIGGGALGLSSMVIGRAVGAVAGRMIDQRLLGGSADPVETGRVDRLRITGASEGAAMARIYGRMRVAGQVIWATNFMETSQTTRAGKGQPGTTAYSYTISLAIALCEGPINGIGRIWADGTEIAPSSMTLRLYHGDATQQPDPRISAVEGPDNAPAYRGTAYVLIEDLDLAPFGNRVPQFNFEVIRNDTTRDDSWAGVVQSVALIPGTGEYALATEPVALHYSYAHQETVNENSPSGKSDLMTSLDQMNTELPRVKSVSLVVSWFGDDLRAGHCTVQPKVEQTPFDAPTQPWRAGGITRAQAATVPRLNDAPVYGGTPGDTSVIQSIRAIRARGQEVMFYPFILMDQQADNTLPNPWTGTEGQPAMPWRGRITTALAPGLPNSPHGTAAADAQVAAFFGTAAPSDFRWDGTRLHYTGPAEWSLRRFILHYAHLCTAAGGVDSFCIGSEMVALTQIRGATGFPAVDALCQLATDVRAILGPDTKISYAADWSEYHGTQPAGTSDKYFHLDPLWAHEAIDFIGIDNYMPLSDWRDGTAHADAAAGAIYNLDYLRSNVAGGEMYDWFYASDESRDAQIRTPITDGYGQEWMWRMKDILGWWSNAHFNRVDGDVGAASPWQPRSKPIRFTEIGCAAIDKGTNQPNKFLDPKSSESALPYYSNGLRDDFIQLQYLRALTQHYADPANNPPSDLYDGPMIEMDYAHVWAWDARPFPWFPARQNLWSDGTNYDRGHWLNGRAGGRALQSLVGEICTGAQMGPVDTSALWGTVHGYALDQVTTGRAALQPLMLSHGFDAVSKDGTFTFQTRHGRPVLTVSPDELVQTDPDTAALILTRQPEAEMAGQVRVSFIAAEGDFATGAADAVLPDARADTVAQSELPLLMTRADAKAAAERWLAESRLARETATLALPPSRGWLQVGDVLRVADMDLRIDQMERGHHLAVTATRVSDTLYQRHDVLADIEQPAAYAPPMPVAATFLDLPSEDGVAAHIALTSATWPGEVVVQSAAAGATPATVARVGSPSVVGETLTPLPAARAGVLDRGPALRVKLISGDLGGASLAALLDGANLAALGDGESDVWEVFQFAGAELVGPQEYALTLRLRGQGGTDGIMPPVWPVGTRFILLDQHVASLPAPPRGVARDWLWGPAQRPTTDRTWRSATRAFAGIALRPYAPCHLRSTRSGGDVTLGWQRRTRSGGDSWDGLDVPLAEDAEAYRLRLLQGGSVLRDVVVGSPAFTYTAAMRTADAASGPITVEVAQLSQAYGAGPALVVDLAL >NZ_CP019937|877201:912034|897264_898404_+|WP_085785839.1|DBSCAN-SWA MWKHLASNFLTFLVVALFLVAGVITWGVREYSAPGPLSQAICLRVPAGGTFGRTADDLRAQGAISSREVFLIMADYRQKRTQLKQGAFLIEPGATMESITDTITRGGASTCGAQVVYVVGVNDFSARIRQLDPDTGRYGEVARFDPTAEGEAPPEYATALAESDVQLAVQVVEGTTVWQVITSLNAIDTLNGDAAMPPEGMLAPDSYEFRRGTDTQALVQQMQDRQQSILDAAWAARDDNLPVSTPEQALILASIIEKETGVPEERRQVASVFVNRLRQGMRLQTDPTVIYGVTDGRGNLGRGLRRSELDGPTPWNTYVITGLPPTPIANPGRASIEAALNPDDTPYIFFVADGSGGHAFAVTLDDHNRNVARWRALGN >NZ_CP019937|877201:912034|886610_887195_-|WP_085785829.1|DBSCAN-SWA MQADLTALITSRISHDLFNPLGAISNGVELLGLVSPSPELDLISDSVKSATARLQMLRIAFGPAGSGEISVRDWTGILAGLAAGGRSKYTHAGTKPLPRNEAKVLALLALCCDSSLPYGGEVSFTPRTGGGWAVRATGREVRFNANLWACATGLSPREPIEPAQVQFLLVPTATTAIGRTLMITNTATTLDLQV >NZ_CP019937|877201:912034|907751_908177_+|WP_085785854.1|DBSCAN-SWA MTNPIPAARRWLGTPFVPRASCRGAGADCLGLIRGLWRDLHGAEPWPIPAYGPDWPRALGDNALQIALQKHLPSLAAPRTGAVLLFRLRAGQTPAHLGLCTGTHFIHAHHTSGTIESPLSTPWRHRIAGAFALIPKPQQEA >NZ_CP019937|877201:912034|877201_878599_-|WP_085785821.1|integrase|DBSCAN-SWA MAGITKKAGQRYLVLRHNRWWFRRAIPARCQASLGLGVTYSINLETSDLTTAQRRRNELERTTSALFEDILAGKPADTAQLTARAQGLLYRETLAELEDDDHGDPDSPISAYDAALWSLDATVDAFKDQDDRDAFLGALHGRETVDAHLESYLKTADFAAKTRNDRRGSLAQFARWAADQRPALTIDRIGRKQAGTYVSKVIDEMHPATQKKHLSALRGYWRWLAARGHVKLPTGELMTSGWPWDGQQVLRRGKRAERGGRDEERAFTGQEIALLLNSQPPKGMSPDHLPAIQDALRISLLSGMRMAEILTLWVEEVREDKDGLTFDIQQGKTEAAARPVPVHSTLLDIVRRRTAGKDGKAWLFEELQSERDAGDTFSKRFARYRKALGVDEVREGKRRSLVNFHSARRWFATAADRAGQQEAVIKDVIGHVPDRKNVTRASYIARSSAAQMRACVEAVALPEVA >NZ_CP019937|877201:912034|886416_886614_+|WP_085785828.1|DBSCAN-SWA MAQGYKTGALAALKSTIQTARAWVEARSLPTLHAIDPMIGWGGCVDTLALAHTRYVAIPVVVTKR >NZ_CP019937|877201:912034|881977_882901_-|WP_085785825.1|DBSCAN-SWA MTGRSQLFTLDAVTVEFGRNVALDRVSLQIAQGEWFSILGRNGAGKSTLIKVLASLLRPDFGKAFFHFQGKMRPVGRARQHLGFVPQSGSLDARLSVLENLLFAAALQGLRGPRRKAAIDAALTQFEPLADRIVGTLSTGQKRRVEVARALLHEPAVLILDEATAGIDAHSRQQIWDQIAARRNAQNVSVIMTSHFIEEVAQSERACIMHGGKLHGIWQTDALLDKFGGQHYRITPTDSAARAALLVALHDATLDAATGDVLLTTTSAAPADLAAIARYGAKVAAGRAPLERAIISATEGVNPGVSA >NZ_CP019937|877201:912034|903450_903993_+|WP_085785845.1|DBSCAN-SWA MMLVEETTVADAALPVAALGDFLRLGTGFDTDNMQEGLLRAFLRAALAAVEGRIGKILIARTFREDLAPPAALSALPLREVIAVTANGAPVDWQVQQGLRPRVTLRGWQTDQQLSVRYIAGMAADWETLPADIQQAVLMLAAHYYEYREDPDLDGACMPFGVSALTERYRTVRLGFGGAQ >NZ_CP019937|877201:912034|889058_891020_-|WP_085785831.1|DBSCAN-SWA MPIHLSRRHFLAASGAAMATLHPYSLRAASNQAHLRIMETTDLHVHVFPYDYYADRPVDTLGLSRTASLIEGIRSEATNSILFDNGDLLQGNPMGDYIAYERGMEPGQTHPVINAMNTLDYACGTLGNHEFNYGLPFLANALEGAAFPLVCANVAVKLGATPLQDETLIAPYVILDREITLGDGSVKPIKLGVIGFVPPQIAAWDRAHLEGKVVVREIVEAAAAYVPQMKAEGADVIIALCHSGIGEAEAVADQENAAIPLAATPGIDVVLMGHAHRVFPGTDYEGTEGVDTVAGTVSGKPAVMPGFWGSHMGLVDLMLEHDGTAWKIAGASVEARPIWIRDADNKVVPTVGDVAVVQAAAQTEHDATLAYVRRAVGQTSAPLYSYFALVADDPSVQIVSQAQLWYVAQLLQGTEYESLPLLSAAAPFKAGGRSGPDYYTDVPAGDIAIKNVADLYLYPNTVQAVLVDGTQLKNWLERSAGMFNQIEAGAQDAPLLNPTFPSYNFDVIDGVTYQIDLTQPAMFETDGTVANPDANRIINLQYNGAPVTADQKFVIATNNYRAGGGGSFPGADGSTRVLIAPDTNRDALVRYIVEEGTINPTADGNWSFAPLPDTTVTFETGPKAVDYVDQVQTIAIEHVGDGENGFALFRIAL >NZ_CP019937|877201:912034|903989_904322_+|WP_085785846.1|head,tail|DBSCAN-SWA MTRLRLNRALILQRPERDSDGAGGYTEGWQTLGTLWAAITPATGREAAAFGAALARVPVRITLRAASIGDPRRPVAGHRFREGARSYLILAVQDSPQRRLTCIAEEELVR >NZ_CP019937|877201:912034|893099_893417_-|WP_085785835.1|DBSCAN-SWA MPWVYLILAGGFEIIWAYTMKLSDGFTKPLYTSITLVTMVCSFALLSIAMKTLPLGTAYMIWTGIGALGAFVIGVTFLGEALTPLRVVAALLLLSGLILMKLATK >NZ_CP019937|877201:912034|891729_892305_-|WP_085785833.1|DBSCAN-SWA MEVILLERVAKLGQMGQVVKVKDGYGRNYLLPQGKALRATEANIAKFESQKAQLEARNLETRKEAEAVGETLAGQSFVVIRSASDGGVLYGSVAGRDIAEAATAAGFTVDRHQVALIAPIKDLGVHDVTVRLHPEVEVTVSVNVARSAEEAELQASGKSIRELAAEAEAAAEFEIAELFDDLGGAVDEDEE >NZ_CP019937|877201:912034|904318_904708_+|WP_085785847.1|DBSCAN-SWA MSYTLSLALQQALFTRLTTTDGLSLPVHDALPSGTVPPLYIAIGPEDVESLATPDGPITLHQVKISVIATGGGFGTAKGIAAQIITALTAPLELPAGHASAPLFQAATAKSTTGADRRIDLTFRIRLEP >NZ_CP019937|877201:912034|881168_881981_-|WP_085785824.1|DBSCAN-SWA MIDTLQAIHAIWKRDVQKFLRDRALLVGCISRPLLWVVIMGIGLGPYFRAESFGQIRFAVPYTYLQFIFPAVISLNIMFGAMQAAMTVIADREYGFAQHLATSPRARGTIMFAKALGSTTIATLQGAIVLLLAPIVDVSLTATQVLTGLGAMAAFAFAMSCFAIFLATRTSSFEGFGLFSNAVLLPLYFTSNAMFPIDPALGAAQAKALYPEWMVILVSMNPITYVNDLLRGIFIGFHSFSLKGSAVMIAICAAIFFGLALLRFLRIQKV >NZ_CP019937|877201:912034|893577_894513_+|WP_085785836.1|DBSCAN-SWA MRAFVFPGQGSQAIGMGKALADAYPAARAVFEEVDAALGENLSGLIWDGDIDTLTLTRNAQPALMATSIAALRALEAEGVTLQAAGFVAGHSLGEYSALCAAGALGLADTARLLRTRGDAMQAAVPVGVGAMAAILGLDFATVAAIAAEAGQGEVVQAANDNDPSQVVVSGHKAAVERACELAKEKGAKRALMLPVSAPFHSALMQPAADVMRDALAAVTINTPVVPVVANVMAEAVSDPATIRDLLVQQVTGSVRWRESVQWMAAQGVTEAWEIGAGKALSGMIRRIEKTIECRTIGTPEEAAAAAATLS >NZ_CP019937|877201:912034|895477_895711_+|WP_013383938.1|DBSCAN-SWA MSDVAERVRKIVVEHLGVEEDKVVEAASFIDDLGADSLDTVELVMAFEEEFGIEIPDDAAETIQTFGDAVKFISEAQ >NZ_CP019937|877201:912034|885404_886244_+|WP_085785827.1|DBSCAN-SWA MRALLAAALTIAPAIFPVAASAQLMEQVANPQLMNPNILPSGSQLRICHQNGLVTADLDIAIAREIAARLFLEIEVNVLPSGYGVGGEFAAADLLVNLSAQCDAIFGMGIGANIYPAEFTVTQPYVAHSFTYLATNPDYQRLTDIPAGLRVGVEMASYGSFVFRQFNSLRPESERLGFLPYADHTLMLTRLQDETLAAVSIYAPFWRAGADTPVYQGVHELQRMPELAARVNTGALLLSQNTFMRGELDAAIADLLADGTVARLIEELGYDAYGTTVAQ >NZ_CP019937|877201:912034|892317_892545_-|WP_014537607.1|DBSCAN-SWA MATKPFFRRRKSDPFEGEGAPVIDYKDTRLLQRYISERGKIVPARITAVSAKNQRKLAQAIKRARFLALLPYAVK >NZ_CP019937|877201:912034|898594_898933_+|WP_085785840.1|DBSCAN-SWA MSQVLTQTGDDEAARLQRDLRDAMVLVRSVRTSLADMLAELSAGNPGPLREIAPKHSELESALRRAFETEQKFNDWTAKFTGGQDGETLDYDAIRDEISCRLARLGPCCDAG >NZ_CP019937|877201:912034|905654_906272_+|WP_085785851.1|tail|DBSCAN-SWA MSDLTPTTAELQQLHSMTQALNAGLRDMRGTMANTNREVAGLERGLSSGLRKAFDGLIFDGDRLGSVLGTISTSIQNAVYSAAVKPVTNHLTDYLINGLHGATPFANGGAFTQGRVMPFAKGGVVTAPTTFPMRGGTGLMGEAGPEAIMPLTRGADGRLGVATQGGAGVNLTMNIQTPDATAFQRSQSQIGAQISRLVARGQRNR >NZ_CP019937|877201:912034|905132_905462_+|WP_085785849.1|DBSCAN-SWA MNPANPHAGEVIIPIDGVPHVGRLTLGALAMLEAELNTGTLTDLVARFEGGAVKSADVMALVVAGLRGGGWTGTAATLLAADIAGGPLGAARIAGQLLARAFSTSDGAQ >NZ_CP019937|877201:912034|906909_907755_+|WP_085785853.1|DBSCAN-SWA MTDLSTTRCTAWAITRADGTTLGFTDHDADLTFAGLTFRAASGMTASTLAQGSGLSVDNAEGFGTLTADAMREADIRAGHFDGADVKIWQVNWQAPAVRQQIFHGTLGEITLEGGAWRAELRGAAEALSRPLGRSYQRGCAAVLGDAACGFDLSTPGFTADTTLRSATETRLTLPAIDAAPRWFERGQVQILSGAAAGLTATIKTDEAAGDTRILTLWSPLAITPEPDAQIRLLPGCDKRMATCRAKFGNLPNYRGFPHIPGEDWLRAIPKSGASGESLFQ >NZ_CP019937|877201:912034|905461_905662_+|WP_085785850.1|tail|DBSCAN-SWA MDWPGLLRLGLQRLHLRPAEFWALTPIELMLMLGLAGAAAPMARARLAELARAYPDTRPPHEAPDE >NZ_CP019937|877201:912034|902237_903380_+|WP_085785844.1|capsid|DBSCAN-SWA MDPVQLSQITSEMENFLGEFSGFAAEVKQRLEQQETRMTRLDRKAAAHRPVLSRAADLDAPHQKAFDAYLRSGDDDGLRNLELEGKAMNTAVAADGGYLVSPETALTIQNVLGATASIRAISSVVNVDAASYDVLVDRTEPGAGWASETGTVAESGTPVIERVSIPLHELAALPKVSQRLLDDSAFDLEDWLANRIAQKFARAEAAAFINGDGVDKPKGFLTGTKVANTAWAWGNLGYIATGATDTLPADSIVDLVYALGAEYRAGASFVMNSKTTGVLRKLKDADGRFLWSDGLAAGEPARLMGYPVLIAEDMPDIAANAYAVAFGNFQSGYTIAERADLRVLRDPFSAKPHVLFYATKRVGGAVTDFAAIKLLKFAAS >NZ_CP019937|877201:912034|879740_880364_-|WP_085785822.1|DBSCAN-SWA MINPFTFYSQSPQTIAYAVLVGVVTVTLHSWILAALTWLAGDRGPRQDGRLTLSPTRAASPISLLSITFTQIGWIRPMQITPRALRGGAVSLLPLLLISLALLALFAQSLMHLRPLIAHTFPGAMPAYVTMALIDAFRRTVFWYVAFNLLPIPPFLMGHLWLWRGQPIWMKSAPIGRCIALIAFVTLVTVTVSYRANLIQIELALLR |
39 | Paracoccus_phage(33.33%) | tail,head,integrase,capsid,protease,portal | attL 869787:869802|attR 908006:908021 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1032818 : 1040892
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP019937|1032818:1040892|DBSCAN-SWA GATGTATGATGTAGAAAAGGTGCGCGGGGATTTTCCCATCCTCGCGCGGCAAGTGAACGGTCGGCCATTGGTCTATCTGGACAACGGGGCATCGGCGCAAAAGCCGCAGGTTGTCATCGATGCGGTGACAAATGCCTACACGAACGAATATGCTAACGTTCACCGTGGCTTGCACACCCTTTCTAATATCGCGACTGAAAAGTACGAAGGCACGCGGGGTATCATCGCGCGCTTTTTAAATGCGCCGCGGGCCGATGATATCGTCTTCGCCTCGGGCTCGACCGAGGCGATCAATCTGGTCGCTTATGGCTGGGCGATGCCGCGGATGGTGGCGGGGGACGAGATCATTCTGTCCATCGCCGAACATCACGCCAATATCGTGCCGTGGCATTTTCTGCGCGAACGGCAGGGCGTCGCGATCAAATGGGTCGAAACTGACGCCGCCGGCGCGATTGACCCGCAAGCTGTGATCGACGCCATCGGCCCGCGCACCAAGCTGATCGCCATCACGCAGTGCTCGAACGTGTTGGGCACCATTGTCGATGTCAAAGCCATAGCCGCAGGCGCGCATGCCAAAGGCGTTCCGGTGTTGGTTGATGGCAGCCAAGGTGCCGTCCACATGCCGGTCGACGTGCAGGATTTGGGCGTCGATTTCTACGCCATCACGGGTCACAAACTGTATGGCCCCTCGGGGTCCGGCGCAATCTACATCCACCCCGATCGTCAGGCCGAAATGCGGCCTTTCTTGGGTGGCGGGGATATGATTGACACGGTCACGCGCGACACAGTCACCTACGCGGCCCCTCCGATGAGGTTCGAGGCCGGCACCCCCGGCATCGTGCAAACCATCGGCCTGGGCGTGGCGCTGGAGTATCTAATGGGCCTCGGGATGGCGAACGTCGTCGCGCACGAGGCCGATCTGGCCGAATACGCCGCCGAGCGGCTAGGGGCGTTGGATTTCATCCGCCTGCAAGGCAACGCGCCAGGCAAAGCCGCGATCTTTTCGCTGACGATGGATGGCGCGCATCCCCATGATGTATCGACGGTTTTGGACAAACGGGGTATCGCCGTGCGCGCAGGGACCCATTGCGCGCAGCCGCTGCTGAATAGCTATGGCCTGACCGCAACCTGCCGCGCGTCCTTCGCCCTCTATAATACCCGCGCCGAGGTTGATGCGCTGGTCGATGCCCTTCACTTATGCCGGGATTTGTTTGCGTAAGGGTAAGCGATTGATTGGTTGCAGCCCATGGCGGCTGCGACTATAGATGATGAGCATGCACCCGTAGCTCAGCTGGATAGAGTGTCGCCCTCCGAAGGCGAAGGTCACAGGTTCGAATCCTGTCGGGTGCGCCAAATCCTTAAAATATGTATGTTATACAGATGCTTAGGTGTCCGTTTTGGAAACTATTTGACTTCGGTTTCCAGCTTTCTCGTTTCGTTCCATCGACAAGATTTTCATCTGGTCGGCCTGTTTGGTGTAATGGCTGATCTCGGAAAGGCTTGCATGGCCCGTCCAAGCGCCGATCTGTGAAGATGTTGCACCAGCTTCCGCCAGTGCCGCAGCGCGGGCTTTTCGCAGTCCGTGGGCGGTGCAGTCGTCGGGCAGTCCTGCGGATTTTGCTGACTTGCTGACCCACTGTGAAAGGGCCTTGACGCTACGCGCTGCGCCATTGCCTTTGGCTATCCAGATCATTCCGGTCTGCGGCAGAGCAACCAGAAATTGGGCGTGATCGGTCGACATTGCTTCTGCCCAAGTTGGAAGGTGCCGAACAGGGCAGGTGGCGGGGCCATCTGTCTTTGCTTGAACGAAGCGCAGCCAACCGTCTTGACCAACCATCTGCCACCCGAGCCTCACGGCATCCACGCAGCGCGCGCCCGTCCAGTAGATCACCTCGAATGCGCGCCGCTCGGCTGTTGTCATCGGCCACTGCGATCTAAACGTTTCGATCTCGGATTGCGTCCATTGGCGATGGGGTCTTACTTCGCCGATCTGCGCTTTCAGGCCGAATGACGGATCGCTAGCGATTTGGCCTTCCTCGACTGCAAACTTCAGGATGCTGCGCCACGCCTTGATCCTGTTCTGCTGCGCGCCGGGCGTGAACGCTCGAATGTCCTTGCGTAGATGGTCGATGCGAAGGTGCTTAACCAACGCCTGTCCGCGTTGTTCGCTGATACGATCAAGCGTTCTACGCCAAGTTGCCTGAGTGCTAGTTGCCATGCGCTTATATTCGGGGGAGCCAAGATAGGCGACGATCAGCGCTGCGATAGTTCCCTCCGGCTGCCGTCCTTTGGGCTTTGCGTTTCCGGCCTCGGCATAGGCTTTCAGGAATGTGGGGTGGTTCTCGGGAAGGTCAGGTAGTGGCACCAGCATTCCCGCAACCCGCCTGTATACGTAGCGCTTGCCATTCGGCTTTGTGACTACCTTGATGTCTGTTAGCCGAATACCACGTCGCATGTGTTCTCACTCGTGTCTTTGTCAGTTGGCAGTGTGTCGGCAAGTGCATCGAGGTCAGACCGATCGTATAGCCGCCGCGTGCCAATCATTCGGCGCGGCAGCCCCATGTCGCTCAGGGTGCTGATGCTTACTCCGAGATAACGCGCTGCCTCGCTTGCCCCGAGTAATCGCGGTGGATAGTGCAGTTCGGCCATCACTCACCCCCTTCCAGCGCCGACAGTATCCGCGCCTCGAAGTCGGCTTGGGCTGCGGCTTTGGCGGCTTCAAAGCTGTCCGCATTGATCCAATGATCCTTCATGCCTGAGAAATACCAGCCGTCATCGTCACAGTAGTATTTCCCAATAGACGTTCTGGCTGATTGGGAGCGGACGCCTGTTACAGATACACTGACATCAGTCCAAGCCAGCGGCTTGACCTTGCGGGCAATGTCGGCGCGGATGTATTCCACGGCTTCCACATACCCCATAGACATGGTGCGCTCCTGACACATCACATCGCGGGTGCCCGTGCCAATCTTTCGCGCTACGGTAATTCGTTCGGGTGCGTTAGTAGTCATCACGCTTCCCCCTTAGATGTCTGCCATGTCCAACTGTTATTGGCCGACACAAGAAAGACACTTTCGCACTCTCCGCATTCATACTCTCCGACGCCCTCGTCGTATAGGAACCCGTCGCTGTCTCTTGCTTGGTTCATGTATCCACAATGGGGGCATTCCGCCCCCACGTTTGAATATCGGTCAAATTCAGTCATCACGCTTCCCCCTTCGGCTTGAGCAGGGCGTGGATGGCGTCTGCTGATATGAACAGTTGTGACCCATCATTTGGGTTTTCAGAGCAATATGTCCCATCTGGCTCGCCCGTTCCCATGTCATAGCCGAACCACATTTGACCCTTGCTATTGAGGGTAGCCTCTACCCCCTCATTCCACGCCAGCCGCTTCTGTTCTTCCAGTGCTGCGGTCGCGTCGGCGGGGGTCAGGGCGAGGATGGCTTGACGCCACGACCACGGCACTCGAACTTTCTCTCCGGGAACCCGACCAGTTTCCACCTCAACTGCCGCCGCCTCAAGCAGCGCCTCGTTGCGGGCGTCACGCTTCACCTGTTCAAGGGCAGCGGTGGCGGTGACATGCGATAGGCCTGCCGTCATGTTGTCGATCTGGGTCAACACCCCAAGTAGGTCTTCAAACGGCTGCCAATCCGGTGCGTCTGTCTCATAGTGGGTTTTGTGCAGGCAGACGGCAAGAGATGTGGCGTATTCCTTGGCTCGACCCGTCTCTGCGCGGGCTGTGTCGCGTTGTGAAAGGGCTAGGCGGGCTGCATCATGGGCCGCATACTCTGCCTTCTTGTGAGCCTCGCACCACTTGAGGGCTGTGTCGCGTTCGGCAGTCAGGCGGGCGATTTCGGCTTGCAGCGACACAACGCTGACGATTTCTGTTATTGGATGGTTCTCCATCAGGCACCCACTTTCTCAAAGTGAAAAACAACTGGGGCATCGTGGATGGTCACGATGCCAAACCGGACAGCAGCCCGAAATATGGATTTCTCACGCGACACGATAGCAGCCGACCGCGTTAACAAATCTCGCCATCCCTCAAGAGACGATCCGCCCTTTCTGATGTTGCAGGGGCCGCACGCGGGCATCATGTTCGACACCACATTGCGCTCGGGGTTCATCATCCTCTGCTCGCTGGCCGGTAGTGCCCGCCCGTAGATGTCGGTGGTGATGCGCATCACGGGCTTAATGTGATCGGCGTGCATCCTCTCCATAACTTCACCGCAATACCCGCACCGGCCATCAAACCGTGCGCGCAACTGTTCTTTTTGTTTTTTCGAGCCTGTCCAGACTTCGTCGGAGTTCATCACTTCACCCCTTCGCGCAGATTGGTGTGGGCAGCGAGGGCGCTATCGACCCTGACACCCACACCGCTATGACCACCAAAATAGGTGCCGCAATCGCTTAAAGCGGCTGCCATCGCGTCCGCGTTAGCCCGCTCCCCAGCCAGTTCAGCGCGAATGCGGGCGGCATCCGCCTCCAACTCGATCACGCGGCGGGCTAGGTCGGGGGCAAGGGCGATGAGGCGCATGTTCCCACGCCCACCGTCATTATTCGCCCTGAACACTTCTGCCACGATCACATCTAGGTTTCCGCTCCACACCTCCTGATAGTGTGTCGTCGTCCTCCCGTCATAGCTTTCGCAGTCGTGCAGGCGCCAAGGTCCCTCTGTCGCGCCATCCAGCAGCGATTGCAGGTGTTCGGGGGTTAGGTTGGGGATGGTCATTCCACAATCTCCAGCAGGCGATACATGAAGTCAGCAGCCGCCCTCGATTGCATCCACGTATCGAACACTACGAAGGCCGTTCCGAAAATTGCCAAGAATGTGGCGGCATTGCCGATGAACGCCCATGCTTTCGACTTCGGCTTTTCTGCGACATCTCCATCTTTCCGCGCCTCAATTTCACGGGCGTATGTTCGGATGATCCCCGCGCACAATTCCTCGGGGGTCATCCCGAGAATTTCGGCGGTGCCTTCAATGACGCGCAAATCTGACGCAGCAATCTTGATGAAGCGGGGGCCGTGATCTGGCTCGGCCTGCTCGATTTCGCGGGCGGTGTGGGGGTCTGTGTATCGGTTCATTCCGTTACCTTTCAAAACGATGCGGCGAGGCAGTCATCCTCAAGCGCGCGGAGTTGCTGTGCGACCCGATCAAGCAGGGCCTTGCTGGGGCCGTGGGCGGCGTAGAAGGCCTTCGGGCTGTAGTGGTAGGCGGTCGGGCCGAACTCGCGCCGGTGGTGGCGCGGGCACAGCGGCAGGACGTTCCAGTCGCTGCGGGGCTTGCCCTCGTGGTGCACCTCGACGCCGTAGCAGCCGCAGACGAGGCACGGCAGTTGAGCCACGGCGGCCATGTGCTTCTGTGCGGCCTTGCGCTCCTTACTGCGCATGTAGGCCTCTCGCTTTGGGCTGCGGGGCGGTAGGGCTTTGCGCGGGGCTTTCGGTGCGCGGCCTTGCTTCTGGCGGACGGGCTGGCCGGTCAGGTTCATCCCTTTCCTCCAAGGTGGTATCCGCCGCACACGCCACAGCGATACGTGTTCATCAGGCCCCGCGCGCGGCGCGGCAGTCGCATGAGGCTGCGCCATGCCTGCTGGGGTGTTTGGTAGCGGTGCTTGCCGTCGCACATCGCGCAGGCCGGTGCCGAGATGCGGGTGGCGGGGTTCCAGTGCATCATTCCGTCGCCTCATTGGTCCAGCGGACGCCGTGCCTGTCGCCGTATTCTTGCGCGAAGGTGATCAGATCCGCGAATTGCGATTTCGACAGCCGAGACGAGCGAAAGCCCGCGGGGAACGGCGGGGCGTTGTCGATGCCCGGCTGCCAGATGATTTCGTGGCCGAGGGCCGACATGATGGCGGCCTTCCAAACTTCGGGGGCGTGCACGCGGCCCTCAGGCTTTGCGCGGCTGATGTCCGACAGCATGGCCCAAAGCTTGGCGTTCTGGTCTGTCGTCCGCTTAGCCTCGCGCACCGTCACGACGCAGCCAGCAGGGGCCTTGTCGATCAGGTCATGCGCGCTGCGGCGGGCGTAAGCCGATGTGAGCCAGACGGTCTGCATGGCTACACCGGCGCTGCGATGGTGAAGTGCTTGCGCCAGATCGCCCAGACCGAGGCGCGCACCGCAGCCGGATCGGCGGGGTTGAGCAGGGCGCAGGCGTGGCCGATCACCGCCTCGACGGTTTCGGCGTCAGCGTCGATGTCTGGGATGCCGAGCACCGACATGGCGCGATGCAGGGCGTCCTTGATGTTGAGCGGCGGGGTCACGCGGGGCATCACAGCACCTGCGCGGCAAAAGGTATTTCGTCGTCGCTGTAGCTGCCGCCAGCGTATCCGCCTTGGCTTTGCTGCGACTGCTCGCCCGCGCCGTTATCGCTGCCGTCTCGGCCGTCCAGCAGCGTGAGGGTGCTGTTGAACGGGCGCAGGGCGACCTCGGTGCTGTAACGGTCAGCGCCGGACTGGTCTTGCCACTTGCGGGTTTCCAGCTGGCCTTCCAGATAGACCTTGCTACCCTTGCGCAGGTATTTCTCCGCGATGACGCCCAGCGGTTCCGAGAAAATGGCAACGCTGTGCCATTCGGTGCGCTCTTTGCGTTCGCCGCTGGATTTGTCGCGCCATGTCTCGCTGGTCGCGATGCGCAGGTTCACGACCTTGCCGCCATTGGGGAAATTGCGGACCTCGGGGTCGCGGCCCAAGTTGCCAATCAGGATGACCTTGTTGACCGAGCCAGCCATTAGAAAGCTTCCTTTTCCTGCCAGACGCGGAGGCCGTCTGCATTTTGGGTTGTGGTGTGGTTGCGACGGGCCCAGTCGTCGATAAATGCGGTGATGGCATCGCGATCATTTCGCGCGATGAAGTTGAGCAAGGAGCGGTGATCCGTGACCTCGTAGCGCGTCACGGTGCGCAGCCCCTTCACCCTGTCTTTGCCTGCCTTTGCGGCTTGTGCTGCGGCGATCTCTGCCTCGTGCTGCGCAGCGGCGGCGGCCCGTTGTGCCTCAATGTCGGCTGCATTGGCGCGCGCAGCAGCTTCTTGGGCTTCACGCATTTTGCGGTCGGCCTCGGCGCGGGCTTGGCGCTCGGCTTCGGCTTTTTCCGCTGCCAGCTTTTGCTTGAAGCCGTTCACCACCGAGACAAGTCCAGCTTCGATCCGCTTGGCGTCCTCGATGGTTGGTTTCCAGCGCGCGCCCTCTGCCTTGTAGGCATCGTATAGCGGGGCGGTGGCGGATTTCTGACCCGCCTCCAGTTTCAAGCGCCATTCGCGCATCGACTGCCGCAATTCATCAACGGCATTCATTTGGCTTTCGGTTTCGACCGGCGATCCATCCAGCCAGTTTTCTGCCTCGATGCGCCAGCTTTCGTACTGGCCGCAGATTTCTTCGATGGGGTCGGGTGGGTTGTTATGCCCGACTGATGCAAGTGCGTTCATTCGTTCAGGCCTCAGTAGGGAATTTCATCATCGACCAAGCCGCTGCTCAGTCGCGCTTTCGCTGCGTCTTTGGCAGCAACCACACGCGGATCGCCCCGCACACTGATCGGCAGCCCACTGAACACCGCTTGCAACTGCTCCATGGTCGCGGCCTTGCCCAAGCTATCGCAGGCAACATCCATAGCGGTGGGGCCGGTGCTGGCCGGTGCCGCCTGACGCGATGCTTGCTTCGGGGCGGCGGATGCTGCTGCGTTCCCGTCGTCGTCCTCGGGTGCGATGCCTGCCATTGCCATCAGGCCGTAGCGGCGGGCGTAGGTGACGGCAGAGCCATATCCCTGCATGTCGTTTTTCGACACGATCAGTGGCACATCGCATTCGATGGTGGTGCCGCTTTCGCCGTGCGCCAGCACCGTGCGAATATAGCGGTCGTCGCCGATCGTCAGCGCGGGCTGAAAGACTGCGATGCCGTGGCTGTTCAACGCGGGCAGGCACGCATCCATGACCGCACCCAAATCAGCGTACTTGCTGCGAAAATGGGGATTGCTCGAAGACTTCAGGGCTTTGCCCATCTCGGCCTGCGCAGCTGCAAGGGCGGCATAGACGTTGGCATGAACGGCGGTCTTCTCGCCAGTGACGGGCATCATATCGTTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP019937|1032818:1040892|1035465_1035828_-|WP_085785957.1|DBSCAN-SWA MTTNAPERITVARKIGTGTRDVMCQERTMSMGYVEAVEYIRADIARKVKPLAWTDVSVSVTGVRSQSARTSIGKYYCDDDGWYFSGMKDHWINADSFEAAKAAAQADFEARILSALEGGE >NZ_CP019937|1032818:1040892|1037916_1038309_-|WP_085785962.1|DBSCAN-SWA MNLTGQPVRQKQGRAPKAPRKALPPRSPKREAYMRSKERKAAQKHMAAVAQLPCLVCGCYGVEVHHEGKPRSDWNVLPLCPRHHRREFGPTAYHYSPKAFYAAHGPSKALLDRVAQQLRALEDDCLAASF >NZ_CP019937|1032818:1040892|1032818_1034036_+|WP_085785954.1|DBSCAN-SWA MYDVEKVRGDFPILARQVNGRPLVYLDNGASAQKPQVVIDAVTNAYTNEYANVHRGLHTLSNIATEKYEGTRGIIARFLNAPRADDIVFASGSTEAINLVAYGWAMPRMVAGDEIILSIAEHHANIVPWHFLRERQGVAIKWVETDAAGAIDPQAVIDAIGPRTKLIAITQCSNVLGTIVDVKAIAAGAHAKGVPVLVDGSQGAVHMPVDVQDLGVDFYAITGHKLYGPSGSGAIYIHPDRQAEMRPFLGGGDMIDTVTRDTVTYAAPPMRFEAGTPGIVQTIGLGVALEYLMGLGMANVVAHEADLAEYAAERLGALDFIRLQGNAPGKAAIFSLTMDGAHPHDVSTVLDKRGIAVRAGTHCAQPLLNSYGLTATCRASFALYNTRAEVDALVDALHLCRDLFA >NZ_CP019937|1032818:1040892|1036021_1036723_-|WP_085785958.1|DBSCAN-SWA MENHPITEIVSVVSLQAEIARLTAERDTALKWCEAHKKAEYAAHDAARLALSQRDTARAETGRAKEYATSLAVCLHKTHYETDAPDWQPFEDLLGVLTQIDNMTAGLSHVTATAALEQVKRDARNEALLEAAAVEVETGRVPGEKVRVPWSWRQAILALTPADATAALEEQKRLAWNEGVEATLNSKGQMWFGYDMGTGEPDGTYCSENPNDGSQLFISADAIHALLKPKGEA >NZ_CP019937|1032818:1040892|1034201_1035272_-|WP_085785955.1|integrase|DBSCAN-SWA MRRGIRLTDIKVVTKPNGKRYVYRRVAGMLVPLPDLPENHPTFLKAYAEAGNAKPKGRQPEGTIAALIVAYLGSPEYKRMATSTQATWRRTLDRISEQRGQALVKHLRIDHLRKDIRAFTPGAQQNRIKAWRSILKFAVEEGQIASDPSFGLKAQIGEVRPHRQWTQSEIETFRSQWPMTTAERRAFEVIYWTGARCVDAVRLGWQMVGQDGWLRFVQAKTDGPATCPVRHLPTWAEAMSTDHAQFLVALPQTGMIWIAKGNGAARSVKALSQWVSKSAKSAGLPDDCTAHGLRKARAAALAEAGATSSQIGAWTGHASLSEISHYTKQADQMKILSMERNEKAGNRSQIVSKTDT >NZ_CP019937|1032818:1040892|1036722_1037130_-|WP_085785959.1|DBSCAN-SWA MNSDEVWTGSKKQKEQLRARFDGRCGYCGEVMERMHADHIKPVMRITTDIYGRALPASEQRMMNPERNVVSNMMPACGPCNIRKGGSSLEGWRDLLTRSAAIVSREKSIFRAAVRFGIVTIHDAPVVFHFEKVGA >NZ_CP019937|1032818:1040892|1037545_1037905_-|WP_085785961.1|DBSCAN-SWA MNRYTDPHTAREIEQAEPDHGPRFIKIAASDLRVIEGTAEILGMTPEELCAGIIRTYAREIEARKDGDVAEKPKSKAWAFIGNAATFLAIFGTAFVVFDTWMQSRAAADFMYRLLEIVE >NZ_CP019937|1032818:1040892|1037129_1037549_-|WP_085785960.1|DBSCAN-SWA MTIPNLTPEHLQSLLDGATEGPWRLHDCESYDGRTTTHYQEVWSGNLDVIVAEVFRANNDGGRGNMRLIALAPDLARRVIELEADAARIRAELAGERANADAMAAALSDCGTYFGGHSGVGVRVDSALAAHTNLREGVK >NZ_CP019937|1032818:1040892|1035250_1035466_-|WP_085785956.1|DBSCAN-SWA MAELHYPPRLLGASEAARYLGVSISTLSDMGLPRRMIGTRRLYDRSDLDALADTLPTDKDTSENTCDVVFG >NZ_CP019937|1032818:1040892|1038876_1039089_-|WP_085785964.1|DBSCAN-SWA MPRVTPPLNIKDALHRAMSVLGIPDIDADAETVEAVIGHACALLNPADPAAVRASVWAIWRKHFTIAAPV >NZ_CP019937|1032818:1040892|1039088_1039547_-|WP_085785965.1|DBSCAN-SWA MAGSVNKVILIGNLGRDPEVRNFPNGGKVVNLRIATSETWRDKSSGERKERTEWHSVAIFSEPLGVIAEKYLRKGSKVYLEGQLETRKWQDQSGADRYSTEVALRPFNSTLTLLDGRDGSDNGAGEQSQQSQGGYAGGSYSDDEIPFAAQVL >NZ_CP019937|1032818:1040892|1038490_1038874_-|WP_085785963.1|DBSCAN-SWA MQTVWLTSAYARRSAHDLIDKAPAGCVVTVREAKRTTDQNAKLWAMLSDISRAKPEGRVHAPEVWKAAIMSALGHEIIWQPGIDNAPPFPAGFRSSRLSKSQFADLITFAQEYGDRHGVRWTNEATE >NZ_CP019937|1032818:1040892|1040250_1040892_-|WP_085785967.1|DBSCAN-SWA MNDMMPVTGEKTAVHANVYAALAAAQAEMGKALKSSSNPHFRSKYADLGAVMDACLPALNSHGIAVFQPALTIGDDRYIRTVLAHGESGTTIECDVPLIVSKNDMQGYGSAVTYARRYGLMAMAGIAPEDDDGNAAASAAPKQASRQAAPASTGPTAMDVACDSLGKAATMEQLQAVFSGLPISVRGDPRVVAAKDAAKARLSSGLVDDEIPY >NZ_CP019937|1032818:1040892|1039546_1040239_-|WP_085785966.1|DBSCAN-SWA MNALASVGHNNPPDPIEEICGQYESWRIEAENWLDGSPVETESQMNAVDELRQSMREWRLKLEAGQKSATAPLYDAYKAEGARWKPTIEDAKRIEAGLVSVVNGFKQKLAAEKAEAERQARAEADRKMREAQEAAARANAADIEAQRAAAAAQHEAEIAAAQAAKAGKDRVKGLRTVTRYEVTDHRSLLNFIARNDRDAITAFIDDWARRNHTTTQNADGLRVWQEKEAF |
14 | environmental_halophage(12.5%) | integrase | attL 1031566:1031579|attR 1036227:1036240 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1169695 : 1176803
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP019937|1169695:1176803|DBSCAN-SWA ATCACACGGGCGCGATTGCTTGCTGATGCGACCAAAGTTGCGCATAGCGCCCGCCGCGCGCCAGCAAATCGGCGTGGCTACCCTGCTCGATCACCTGTCCTTGATCCAGCACGATGATCTGGTCGGCATCGGCGATGGTCGACAGGCGGTGCGCGATCGAAATGGTGGTGCGGTTCGCGCCCGCGTGTTGCAGCACATCCTGAATTCCCAGCTCGGTGCGGGTATCGAGGGCGGATGTCGCCTCGTCCAACAGCAGCAGCGGGGCGTCTTTCAACAACGTGCGCGCAATGCCGACGCGTTGCTTTTCGCCGCCCGACAGCTTCAGCCCGCGCTCGCCCACTTGGGTCTGGTAACCGTCCGGCAGGCTGGCGATGAAGTCATGAATTTGCGCCGCGCGGGCCGCATCCTCGATCTGGGCCTGCGTGGCGTTGGCGCGGCCGTAACCGATGTTGTAGCCGATAGTGTCGTTGAACAGCACGGTATCTTGCGGCACCACCCCGATGGCACGGTGCACCGAGGCCTGCGTCACATCGCGGATGTCGTGGCCGTCAATGGTGATGCGCCCGCCGGTGACATCGTAGAACCGGAACAGCAGCCGGCCGATGGTGGACTTACCCGACCCCGTCGCGCCAACCAGCGCCAGCGTTTGACCGGCCGGCACAGTGAAGCTGACACCTTTGAGGATTTCGCGCTCGGGCGAGTAATGGAAGCGCACATCCTCGAATCGGATTTCGCCGGCGGTGACGATCAGATCGGGGGCGTTCGGCTTATCCTCGACCTCGGCGCTTTCGCCCATCAGGTCGAACAGCTGGCCCATGTCGACCAGCGCTTGCCGGATCTCGCGGTAGATGGTGCCTAGAAAGTTGAGCGGCTGGGTGACCTGAATCATGTAGGCATTGACCATGACGAAGTCGCCAACGGTCATCGTGCCATTTTGCACGCCCATCGCCGCCAGCACCATAACTACCGTCAGCCCCACCGTGATGATCAGCGACTGGCCGAAGTTCAGCGCCGCCAGCGAGACGCCGGTTTTTACTGCGGCGTCTTCGTATTTCTCCATCGCAGCGTCGTAACGGCCGACCTCGCGCCGTTCGGCATCGAAATACTTGACGGTCTCGAAATTGAGCAGGCTGTCGATGGCCTTCTGGTTGGCGTCATTGTCTTGTTCGTTCATGCGGCGGCGGATTTTCACGCGCCATTCGGTCGCCAGTAGGGTGTAGATCGCATAGGCGACGATGGTGGCCAGCACGACCAGCATGAACCAGATATCAAACAGGAACAGCAGTGTGCCGCACACCAGCACCAGCTCCAGCACTAACGGCCCCAGCGAGAAGACAAACATCCGCAGCAGGAAGTCGACACCCTTGATGCCGCGTTCCATGATGCGGCTCAGCCCGCCGGTTTTACGCGTGATGTGATAGCGCAGCGATAGCTGGTGCATATGGGCAAAGGCCTGCCCCGCGATCAGGCGCAGGCTGCGCTGACCAACGCGCGTGAACACCGCGTCACGCAGCTGTTGGAACGCGACGTTCATCAACCGTGCCATGCCATAGGCGACCGTCAGGCCAACGGCCCCAAGCGCCATCATGGCGGCGGGGCCGCTGACAACGCCCGACAGGCTATCGACAGCCCATTTGTAAAACAGCGGCGTTGCGACGGTGATAACCTTGGCCAGAACCAGAAACACCAGCGCCGCAATCACGCGGACCTTCACCCACGTCTGCCCTGCGGGCCAAAGGTAAGGCAGTACACGGGCCAAAATCCGGCGGCCGTTCACGGGTGTGGCCGAAGCATCGGTAATGGTGATCCGCGCCCTGTGCATGTCTGCCCTCGTCCATATTTCAGCGGCTTAGCCTAGACCGCTTTTGCTGCTAACGCCAGTGCGAAGGAAGGCGCGCTTACGGGATCGCAGGCAGATCGAAGACTTGGCCGGGGTAGATCAGGTCGGGGTTGCGAATACGATCGCGGTTCGCCTCGAACACCTGCACGTAGAGAATCCCGCGCCCATAACGTTCGCGCGCGATGGCCCACAGCGTGTTACCGGGCTGGACGGTGCGGCTGGCGATGCCGCTGACGCTGCCGGTCGCCTCGTCAGCAAGGGCGGCGGCCAGATCTTCGGCGCCTTCGCGCAGGAAGGGGGTCTCGACACGACCCATCACCGTGCCATCGGCGCCCAGCCAGTCGACCCGCAGGGTGTGGCGGCCCGCAGCGATGGTATCGTTTTCGAACGACCAGTCGCCGGTGTCGCCAACCGGCGCGTGGCCGACCAGCGTGTTGTCAATGTAGATAGCGACCTGCCCCACACCCGCCGCACGCCCCGACAACAGCACGCTGCCGCTGCTGCTGTACGAGATTGTCTGCAACAGCTGGGCGGGCGCATCGTCGGCGTTTTCGGGCGCGGGTGCGACCACGCGCACCCCGCCTTCGCCTGCGATCAGGACGGTTGCGGTATCGGGGGCCGCGTCGGGCGCAGCGTCGTCTGCGGCGGGTAGATCATCGGCTTGTGCGGGTGTCTGCGTGGGTGACTGCGCGGGGGCCAGAATGGCTTCGTTCTGCGACATGAGGGTGCCGGACTCATCGCTGCTCGACAGTTCGATCACGCGGGGCGCTTCGCTGGCCCCCAACGCAAGAACAGTGGCAAAGGTGCCGTCATCGCCTGCGACCGCTGTGGCGGTGTCGGCCCCGTCGACCAAAACACTCACCGTGCTGCCCGGCGCGGCCTGCCCTGCAACAACCGCAATTCCGTCGCCCGCAAGGCGGACGACATCAAGACGCGGTCCTGTGGAGGCGGCAATATCTTTGGGGGTGTCTTCATCTTTGGGGGTGTCTTCGGCGATTTCCTCGGGCGCGCTGGTGACCGCCTGATGATCGGCGGCAAGCGGTTGTGCGGCGGCAGTTGCGCCATCTGGCGTAGGCGCGACGCGTAAGCCCCAGATCACAAAGCCGGTCAATACGACCGCAGCCACCCCAAGCCCGCCGACGATTTTCGCTGTATTTGCCACGTGCCAACCCGAATCTGTCAAAAATGTGACCGCCATTTGACACAATCTTGTTGCGTTGACGAGTTCTGCGCAGCGTTGAGCTTGTCTTCGGGGCCAACTAAACCCTGAATGAGACTTACGGGAAACAGTTCAGAGGGGAAATCATGGTTGCGACGACAAAAGCGGTCTGCGTCTATTGCGGATCGCGCAATGGCAAGCTGGACCATTATGCCGAGCTGGCCACCGAAACCGGCGCAATGATCGCCCGCAATCATTGGGCGCTCATCTATGGCGCGGGCGATGTGGGGTTGATGGGGCTGGTCGCACGCGGCACGCAAGAGGCTGGCGGCGCGGCCGTGGGCGTCATCCCCACCCATTTGATGAAGCGCGAGGTCGGCCGCCGCGAGTTGGACCGCCTGATCATCACCGAGACGATGCACGAGCGCAAAAAGGTGATGTTCATGAACGCCGACGCGATCATCGTGCTGCCGGGCGGCGCAGGCACGCTGGACGAATATTTCGAGGTGCTGACATGGCGTCAGATCGGCCTGCACCAAAAGCCGATCTTCCTGCTGAATGTCGACGGCTTCTGGGACCCGCTGCTGCAGTTGCTGCGCAACATCGTCGATCAGGGCTTCGCCGAGCCCTCGCTGCTAGACTACACAACCGCGGTTAGCGATGTCGCCGCGATTGAAGCGGCGCTGCAGGTCAGCTTCGCGCCCTGATTGCGCCCCGCAGGCTGGCCCATGTGTAAATGGCAAGGGCCAGCCAGATCAGCGCAAAGGCGATGCCGTGCCAAATGGTCAACGATTCTCTGAAATATAGCGCAGCAACAACTAGTTGTATCGTGGGGTTCATGTAAGAGATAAGGCCCGTTGTCGCCATGCTGACCCGCTGCGCGCCATAGCCGAACAATATCAGCGGGATGGCGGTGATCGGCCCGCTGAGCATCAGCAACCCCTGCTGTTGCCACCATTCTGGCGATAGCGGCACGTCCCAAAGGGTATGGCCGAAAACGACCAGATAAATCAGGGCCAGCGGCACCAGCGGCACCACCTCGGCAAAGGCGGAAACGAGGGGCGATACATCACGCAGCGCGCGTTTGGCCACGTTATAAAGAACGAATGTCGTTGCCATCCCCAGGCTGACCCACGGCGCGCGCGCCAGCCCCACCGTCAGAACCACGACACCCACAGCAGCAATACCGACAGCCAGAATTTGCCACGGGCTTAAACGCTCGCCAAAAAAGAAATAACCTGCGACAACGGCCATCAGCGGCAAGATATAATAGCCGATCGACGCCTCGGTCGCGTGGCCGTTGCTGACGGCATAGATAAACAGCAGCCAGTTGCCAGCAATCATCACGCCCGCAAAGCCGATGAGCCGTAGGCGGCGGGGGTTGCCTAGCACCATGCCCACCGTGCCAAGGCGGCCCTGCACATCAAGGACTAAGCCGTAAAACACGGCGGCCCAGAAAATCCGGTAACACAGGATTTCCAGTGGCGGGGTGCCGCTGACCAATCCGAAGTAGATCGAGATGAACCCCCAGATCAGGTAGGTGCCCACAAGGGCCGCCACGCCCGAAGATTGGTTCAGTTCACGCATCATCCACCCATGAAAAAACCCGGCGCGCAGACGCCGGGTTTGCCGTAAAACCTTGGCTTAGAGGGCCGAGGCGACCTTTTCCCAGTCGACAAGGTTGTCGAGGAAGTTGGTCAGGTAGGCGGGACGCTTGTTGCGGTAGTCGATGTAGTACGAGTGTTCCCACACGTCGACGCCCAGCAGCGCGGTTTGGCCGAAGCACAGCGGGTTCACGCCGTTTTCGGTCTTGGTGACCTTCAGGCTGCCGTCTTTGTCCTTGACCAGCCAAGCCCAGCCCGAACCGAACTGGCCGGCGCCTGCGGCCGAGAAGTCTTCCTTGAACTTGTCGACCGAGCCGAACGACTCGACCAGCGCTTTTTCAAGCTCGGTCGGGATGGCGACGGTGGCGGGGGTCATCCATTCCCAGAACTTGTTGTGGTTCCAAAGCTGGCTGATGTTGTTGAAGATGCCGCTTTGGGCAACAGCACTGGCGTTGTAGGTGCCGGTGATGATGTCTTCCAGCGACTTGCCTTCCCATTCGGTACCGGCGATCAGCTTGTTGCCGTTATCGACATAGGCCTTGTGGTGAATGTCGTGGTGGAATTCCAACGTCTCGCGCGACATGCCTTTGGCGGCAAGCGCATCGTGGGCGTAGGGAAGTTCCGGAAGGGTGAAAGCCATGGGACTAATCCTTTCAATGCCTTGGTTGGTACGGTATGCGGCCCTTCCCCCCTAAAGGTCAAGGGACCGCGATTGTTCATGGGCGCACAAACCTATGGACGCGCCTGAACAATAATCATTCCTGCGGCAGAAGAACACTAAAAGTGCTGCCCTTTTGTGGTGCGCTGGAAATCGCCAGTCGGCCGCGATGGCGATTGATGATGTGCTTGACGATCGCCAGCCCCAGCCCGCTGCCCCCCACCGCGCGCGAGCGGTGGGCGTCGACGCGGTAGAACCGTTCGGTCAGGCGCGGGATATGGTGGGATTCGATCCCCAGGCCTTGGTCCTGCACATCCAGCCGGACGGCGCAGCCGCGCAGCGCAGCGTCGAACCGCGCGGGGTGCAGCGCGATCGTCACCGGCTTGTCGGGGCCGCCGTACTTCAGCGCGTTCTGCACCAGATTGCGCAGCACCTGCAGCAACTGGCCGCTATCGCCGGGGACAACCCAGTTCCCTTCGGGCACGTCCAGCACTAGGTGGTTGTTTCCTTGGGCGGCCAGCGGCTCTAGCGCGGCGATGGTTTCTTCGGCCAGCGCCTTCAGATCGACGGGCGCGGTCGGACGGACGCGCTCGTCCACCTCGACACGGCTGAGCGAGAGGAGACCGTCGACCAGTTGTACCATGCGACTGGCCTCACGTTCCATTATATCCAGAAAGCGGGTGCGGACGGCAAGGTCGTCACGGGCGGGGCCGCGCAGGGTTTCGATGAATCCCATGATGGCGGTCAGCGGAGTGCGCAATTCGTGGCTGACGTTCGCCACGAAATCGCGCCGCACCTGATTGGCCTCTTCGACCGCAGTCAGGTCAGTGAAGGTGACCAGAATAGCCCGTGTCGCGCCGCCGATGGGGGCCGCGCGCATCTGCCAGACGGAATCGCGGCTGCTGCCGCGCTGGGTGTGGCGGGCCTTGGCCTCGCGCGCGCCGCCCACGACACGTTCGATCGCCGCGCTGACGGCGGGTTGGCGCAACACCGCCGCGTGCTGCAGCCCCACCATATCCAAGCCAATTAGATCGCGCGCCGCGATATTTTGGGCCGCAACGACCCCTTCCGCCCCGATCACCAAAGCGGGGAAATCCATCGCTTCGATCAAGGCCGCGCCATAGTCCGACATAAGCACCTCCGTGGTGCCCCGAGGCCTACGCATTGAATGCGAAAAATTTATGACAGCGCGCCACGCAATGTAAACTCCTGTTGAAGGGCGCGGACGTGGCCGGTCAGCCATGGTTGGCGGCTGCGGCGCAGACGTTCGGCCTGAATGATAGCGATCAGGCGGGATTCGGTGGCATCCAGATCATCATTGACCAGCACGTAATCGTATTCGGCCCAGTGGCTGATCTCGTCGGCGCTTTCGGCCATGCGGCGGGCGATAATCTCGTCGCTGTCTTGGGCGCGGGTGCGCAGGCGCTTTTCGAGCTCGGCGATCGAAGGCGGCAGGATGAAGATCGAGATGACGGCTGCGCCCAGCGGCGAATTGCGGATCTGTTGGCCGCCTTGCCAATCGATGTCGAACAAGGTGTCGCGCCCCTCGGCCATCGCCTGTTCGACCGGGCCGCGCGGGCTGCCGTAGTAGTTTCCGAACACCTCGGCATGTTCCAGCATCTGGTTTTCGGCGATCTGCTGGCCAAAGGTGTCATGCGTCATGAAGTGGTAGTGCTGGCCGTCAACCTCGCCTGCGCGGGGGGCGCGGGTGGTCGCCGACACCGAAAAGCGCAGTGTTTCATCCCACGCCATCAGGCGGCGCGACAGGGTCGATTTCCCCGCGCCTGATGGCGACGACAGGATGATCAGAAGGCCTTGGCGTTGAGGCATGGCGGCGGGCGATATCAT
Protein sequences of DBSCAN-SWA_4 >NZ_CP019937|1169695:1176803|1173383_1174280_-|WP_085787280.1|DBSCAN-SWA MRELNQSSGVAALVGTYLIWGFISIYFGLVSGTPPLEILCYRIFWAAVFYGLVLDVQGRLGTVGMVLGNPRRLRLIGFAGVMIAGNWLLFIYAVSNGHATEASIGYYILPLMAVVAGYFFFGERLSPWQILAVGIAAVGVVVLTVGLARAPWVSLGMATTFVLYNVAKRALRDVSPLVSAFAEVVPLVPLALIYLVVFGHTLWDVPLSPEWWQQQGLLMLSGPITAIPLILFGYGAQRVSMATTGLISYMNPTIQLVVAALYFRESLTIWHGIAFALIWLALAIYTWASLRGAIRARS >NZ_CP019937|1169695:1176803|1171592_1172696_-|WP_157115636.1|DBSCAN-SWA MANTAKIVGGLGVAAVVLTGFVIWGLRVAPTPDGATAAAQPLAADHQAVTSAPEEIAEDTPKDEDTPKDIAASTGPRLDVVRLAGDGIAVVAGQAAPGSTVSVLVDGADTATAVAGDDGTFATVLALGASEAPRVIELSSSDESGTLMSQNEAILAPAQSPTQTPAQADDLPAADDAAPDAAPDTATVLIAGEGGVRVVAPAPENADDAPAQLLQTISYSSSGSVLLSGRAAGVGQVAIYIDNTLVGHAPVGDTGDWSFENDTIAAGRHTLRVDWLGADGTVMGRVETPFLREGAEDLAAALADEATGSVSGIASRTVQPGNTLWAIARERYGRGILYVQVFEANRDRIRNPDLIYPGQVFDLPAIP >NZ_CP019937|1169695:1176803|1176134_1176803_-|WP_157115637.1|DBSCAN-SWA MISPAAMPQRQGLLIILSSPSGAGKSTLSRRLMAWDETLRFSVSATTRAPRAGEVDGQHYHFMTHDTFGQQIAENQMLEHAEVFGNYYGSPRGPVEQAMAEGRDTLFDIDWQGGQQIRNSPLGAAVISIFILPPSIAELEKRLRTRAQDSDEIIARRMAESADEISHWAEYDYVLVNDDLDATESRLIAIIQAERLRRSRQPWLTGHVRALQQEFTLRGALS >NZ_CP019937|1169695:1176803|1169695_1171516_-|WP_085786092.1|DBSCAN-SWA MHRARITITDASATPVNGRRILARVLPYLWPAGQTWVKVRVIAALVFLVLAKVITVATPLFYKWAVDSLSGVVSGPAAMMALGAVGLTVAYGMARLMNVAFQQLRDAVFTRVGQRSLRLIAGQAFAHMHQLSLRYHITRKTGGLSRIMERGIKGVDFLLRMFVFSLGPLVLELVLVCGTLLFLFDIWFMLVVLATIVAYAIYTLLATEWRVKIRRRMNEQDNDANQKAIDSLLNFETVKYFDAERREVGRYDAAMEKYEDAAVKTGVSLAALNFGQSLIITVGLTVVMVLAAMGVQNGTMTVGDFVMVNAYMIQVTQPLNFLGTIYREIRQALVDMGQLFDLMGESAEVEDKPNAPDLIVTAGEIRFEDVRFHYSPEREILKGVSFTVPAGQTLALVGATGSGKSTIGRLLFRFYDVTGGRITIDGHDIRDVTQASVHRAIGVVPQDTVLFNDTIGYNIGYGRANATQAQIEDAARAAQIHDFIASLPDGYQTQVGERGLKLSGGEKQRVGIARTLLKDAPLLLLDEATSALDTRTELGIQDVLQHAGANRTTISIAHRLSTIADADQIIVLDQGQVIEQGSHADLLARGGRYAQLWSHQQAIAPV >NZ_CP019937|1169695:1176803|1174337_1174937_-|WP_085786095.1|DBSCAN-SWA MAFTLPELPYAHDALAAKGMSRETLEFHHDIHHKAYVDNGNKLIAGTEWEGKSLEDIITGTYNASAVAQSGIFNNISQLWNHNKFWEWMTPATVAIPTELEKALVESFGSVDKFKEDFSAAGAGQFGSGWAWLVKDKDGSLKVTKTENGVNPLCFGQTALLGVDVWEHSYYIDYRNKRPAYLTNFLDNLVDWEKVASAL >NZ_CP019937|1169695:1176803|1175052_1176087_-|WP_085786096.1|DBSCAN-SWA MSDYGAALIEAMDFPALVIGAEGVVAAQNIAARDLIGLDMVGLQHAAVLRQPAVSAAIERVVGGAREAKARHTQRGSSRDSVWQMRAAPIGGATRAILVTFTDLTAVEEANQVRRDFVANVSHELRTPLTAIMGFIETLRGPARDDLAVRTRFLDIMEREASRMVQLVDGLLSLSRVEVDERVRPTAPVDLKALAEETIAALEPLAAQGNNHLVLDVPEGNWVVPGDSGQLLQVLRNLVQNALKYGGPDKPVTIALHPARFDAALRGCAVRLDVQDQGLGIESHHIPRLTERFYRVDAHRSRAVGGSGLGLAIVKHIINRHRGRLAISSAPQKGSTFSVLLPQE >NZ_CP019937|1169695:1176803|1172839_1173400_+|WP_085786094.1|DBSCAN-SWA MVATTKAVCVYCGSRNGKLDHYAELATETGAMIARNHWALIYGAGDVGLMGLVARGTQEAGGAAVGVIPTHLMKREVGRRELDRLIITETMHERKKVMFMNADAIIVLPGGAGTLDEYFEVLTWRQIGLHQKPIFLLNVDGFWDPLLQLLRNIVDQGFAEPSLLDYTTAVSDVAAIEAALQVSFAP |
7 | Bacillus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP019939_1 | 19499-19669 | Orphan |
NA
Consensus repeat of NZ_CP019939_1
|
3 spacers
spacers of NZ_CP019939_1
>1.1|19522|21|NZ_CP019939|CRISPRCasFinder ATGGTGTGTGAAATGACGAAA >1.2|19566|24|NZ_CP019939|CRISPRCasFinder CGGCTCGAGTTATCCACAGCCTCC >1.3|19613|34|NZ_CP019939|CRISPRCasFinder TTGACGAGTGGTGTGTGAAATGACGATAGTCTTG |
RT |
CRISPR arrays and Neighbor proteins around NZ_CP019939_1
The CRISPR arrays of NZ_CP019939_1 >merge|NZ_CP019939|1|19499-19669|CRISPRCasFinder CCTGGTGTGTGAAATGACGAAAGATGGTGTGTGAAATGACGAAAGATGGTGTGTGAAATGACGATAGCGGCTCGAGTTATCCACAGCCTCCTTGGGTGTGTGAAATGACGAGCCTTGACGAGTGGTGTGTGAAATGACGATAGTCTTGTACGGTGTGTGAAATGACGAATT >NZ_CP019939|1|1|19499-19669|CRISPRCasFinder CCTGGTGTGTGAAATGACGAAAG ATGGTGTGTGAAATGACGAAA GATGGTGTGTGAAATGACGATAG CGGCTCGAGTTATCCACAGCCTCC TTGGGTGTGTGAAATGACGAGCC TTGACGAGTGGTGTGTGAAATGACGATAGTCTTG TACGGTGTGTGAAATGACGAATT
>NZ_CP019939.1|WP_085787608.1|18015_19254_+|hypothetical-protein MFYDDRIILSYGRYLRNAQFRILHPHVTFGSYEQLIKEQLKKLPFLSHKETLLVRQMLRGGHDYMLCLPASSTHHHRTAGGLFYHSLETAGLAVDFFNSEFSKAVPSEAGDLAAFIAGFVHDLGKVRTMFKVYPANVQLQDLGFGEIEQPETDLPSWDPMAQSLWEWSQSFESPVTHLSLRYNQRPVIGHDTARGNGYWRQVVSDCLLESVSKRDEAYAEQLEGYLEGRLHRSPFVSAVKKADRRSVERDVNPLHRMEPKRSDLHAVRRFMEFAALSSWNNQYSMFIMADIWLGDQDLAIRVPLFRSRWRHMQTFREYLYGEDRYGAAYVDNGTPDIFYGILENHGVLRRVLPGMPDFSPIPSTYKPAPAFNATVVFADGDPAGGGREDLCYFPGGINIACEGLPVVRVILE >NZ_CP019939.1|WP_157115760.1|17494_18004_+|hypothetical-protein MSEEHPQESCSHEVRFHDFDSDWEPELLRSIPAMGYDLEGWARYWQDRAVEALKAYRKMDAGAEALNLGIVVRIREDTLGPRIYWVKFQGKARYRMGGDKFTPTEQIRMSGKYKYSDRIFARFPDAIRRELLLCEANFAWIRYRTDRLAQLRNLCRAHTGTYQLRANID >NZ_CP019939.1|WP_085787606.1|16009_16654_-|transcriptional-regulator MMRTRKIGYHGSVRVARGTKTAKSKNADPIVAECLKLIREDSGYERDEFAELLGVQHKTYRNYEGCIYPLPLKVVKTIREKLGYDLADPDLTSDAIITKIAEQRHDVAAAPDLAATEQVAKGVSCPQRIRTCLQAFRQELIGVQSKRKHDIRDAVFVGAAALFAFCLVVLRTEPQNIRLESIYTLMLSVSFLVAASIVPFQAIHMIQAAYRSRR >NZ_CP019939.1|WP_085787605.1|13058_15920_+|hypothetical-protein MQTTDPKRAVLQINQRLSLRKPQAEALRRLDDIVDLVGPAKEADIDMARAAVREVYGDLADATFEEFERDFPSVCFALATGVGKTRLMGAFISYLYMIGKSRNFFVLAPNLTIYEKLLSDFQPSSPKYVFRGIEVFAHNAPLIVNAENYEEGRGVRGTDLFGQEGAIINIFNVSKINSEARGGNAPRIKRLQEYIGESYFSYLSELDDLVLLMDEAHRYRGSAGARAIAELKPILGLEVTATPKTVGARSQPFKNVVYRYDLPDAMEDGYVKEPAVGTRANFNPKTVDEDTLERIKLEDGIHYHEHVKVALETYARQNDVKVVRPFMLVVTQDTTHARQVNEFVQSDEFFGGRYKGRVAEIHSKLTGEESDENAQRLLNIEKAGDTDIVIHVNKLKEGWDVSNLFTIVPLRASASDILTEQTLGRGLRLPYGKRTGVEVVDTLTVIAHERFNELIEKAKEENGVTRKLKQVTIGEGGDVPPSKPVSVSAPSILDQMLAQAEAKTEPTVVVADDEKASDAPVQLVAKPTSAAPTQAPFSFSTPEELKVARTVLSVVIPQLSKEVSSIRDLNDPKVIERIAEAAIAAQKPEEGFLPSITKEKAVAVAQELCKNFVERTLAIPALTITPQQQVSFGFKRFDLDMKSWNFQPLSNELMIQALRTEKTSRISSEDKGDTANRLENYIVARLIDYPEIDYDAHAAILYDLAGQAVTQTRNRFADDDEQTRSVVRGHAKAMAESIFSQIKQNMWREHTNYRVSLTSAFGELRPQTFDTAGTSFIRDFKTPPDQKQEIRKYIFTGFVKGCYQYAKFDSNPERKLAIILEKDSSVRLWMKPGPNQFKIFDNHGAPYQPDFVVETDTGKLIIEVKRQSEMTATEVLRKADAASLWCHIATQAGAKSGEKPWHYLLVPESDVEENFTVSGLEAMHTRAPDTDLLSRYIFDEPSVSGEKKSALCV >NZ_CP019939.1|WP_085787604.1|11384_13055_+|site-specific-DNA-methyltransferase MAAKTKLELTWIGKNNRPRLEPRILIEEPEFSHHASTRREGDIFDNMLIHGDNLLALKALETDPAVRGKVKCIFIDPPYNTGSAFEHYDDGLEHSLWLTMMRDRLEILRNLLSEDGSIWMTIDDNEVHYLKVMCDEIFGRSNFFGSIIWQHSVQGKNDAKTVSLHHNYVLAYRKTEAFSRNLLPRKPEHNKNYNNPDKDPKGPWRAGDVRSPNYRENLMFDVTTPSGKIIPPPEKGWRWSKETFASKVETGEITFLNEDTRVLRKIYLCDQEGRVPESIWFGEEAGTTREATNDLRTLLGLGDATFATPKPEKLLERIIQIGTQPGDLVIDSFAGSGTTGAVAHKMGRRWIMVELGDHAKTHVAPRLQKVINGTDKGGVTEATNWRGGGGYRFFRLAPSLLQKDVWGNWVISKDYNAEMLAEAMCKHFNYVYAPSTEAYWMHGQASENAFIYVTTASLTIEQLRAISDEVGEDRSLLICCMAYEAQGESLSNLTLKKIPRVVLDRCEWGQDDYSLKINALPMAEDEPDDIEDTPAPKKSAKAKAADTPDLFGAEEN >NZ_CP019939.1|WP_085787609.1|8523_11385_+|hypothetical-protein MLFQFSEYQSKFFAHHLTSEGIQEEDALTQSLSAAKVDLNPHQVDAATFALRSPLSKGVLLADEVGLGKTIEAALVISQRWWERERNILLIVPASLRKQWATELREKFSLPSFILDAKRVKDLENEGTPHPVGRGEGIIIVSYEYAARIADTLRRTPWSLVVFDEAHKLRNVYKAAENSRASVLRKALAGRQKLLLTATPLQNNLMELYGLISIIDETYFGSEQAFRSEFGGREGLTSQALLAKRLEPICKRTLRRQVQRAGLINYTNRLPKTFDFTPGRLETDLYEKVSEYLQDPSTIALGQNGRHLVTLMLRKILGSSSFAVMQTLDKMIRRLEAKRVVGADTLDDLDGFSDEAEDWREADSGSEADAIEDDTEDAERVDPAQLEAEIKRLTQYRDLAASISDNAKGKALLDCLPNVLDEIVSKGGQRKAVIFTESVRTQTYLRDLLEQSGFEGQTVVLNGSNSDADSKALYKAWLDKHGDTNVVSGSKTADMKAAIVDAFRNDRTILIATESGAEGINLQFCSLLINYDLPWNPQRVEQRIGRCHRYGQKIDVTVINFLNRKNHAEARIHQLLEQKFKLFEGVFGSSDEVLGVIESGVDIERRILDIVQSCRTTDQIDAAFDRLQEEFSVEIDEAKKNVRDQLLAEMDDKVIERLLGRKDAVHSAIGDFKRALLGVARAELPEARFHDDHAQRFDYGGETWSSEWPEADERGWRFFRLGDEGLADQLVQKAKSRDLVPAMLRLDYSAYQGNMGDVAQLRGQSGWMRVARVRLKTPAKVYDELVFAAFSDGGTAIDPETASRMLYVPATTEGLGADTLPESDLTQTLNARQTAIIGMAQDRLSSFLNEEEERLDAWREDAKVSFDQQIKALNKEATEKKKLARATIGLEEKVTLQREAKALQRQVDDLQHQLYTRLREIDAERERMLDDIADQLNLTPEMTTLFTVRWTLA >NZ_CP019939.1|WP_085787603.1|7891_8467_+|hypothetical-protein MPVVRLNDPTFVDLKCISTWMGTETPSETIMLLVREKMAALDLERDIEISIDPTSEISSFMEFKSTPGLSFTKVLSAEVDHKKIDKPNWAAILLGAVSSLKAKGLSGDHLAEAIQIPTKTACYEEEGFRYYPDLGISIQGQSAQEAWKEVSRIADKHGIAVQVTFQWRDNEKAQYPGQKGVLRAGRSSRRP >NZ_CP019939.1|WP_085787601.1|5772_7101_+|hypothetical-protein MPGFYLPSENEFDPSTDDFVLPAISKERKYKHFDLPLVDRELSFDFSVEDKPHRFLPLLGFTDVNRRYVRNKDGAREVKVKERPIRFASHEDAAYLQAYAGHLNRMYERALWRDGTSDSVLAYRRGGGTNIHHAKALFDEIKSRGDCTVFAMDISGFFDCLDHTLLRDEIADLIGETRLEGHHASVWKNVTRYSWVETEDLDKLLGRKRNGHGRICSPSDFSDHVRGRKDGLIRKHDQTFGIPQGTPVSGLYANIYLRTFDREMIAWCSRAGGSYRRYSDDIAVTLPLGAKVHHVVAVVEKMLADFCLSMSIDKTDTADFKDGLLASATPIQYLGFTFDGQKTKIRPSSLDAYRGKMRRGIHAKMIAAKAKNVPSFEVFKRESLARYTHLGKRRNFLRYAYKAADIMGCPEIKDQVSRHVTWFNRAWEREAIRVFGGLVTTT >NZ_CP019939.1|WP_157115759.1|4312_4594_+|hypothetical-protein MFKMIGSFLNLNWWGTPDHTVQSFSTIPTSGAHTMDELSTLDISGGTFGGIHSINPASGLPTVAGDGTPDIAGNSWGTLDMDHSSGGGQWGQF >NZ_CP019939.1|WP_085787599.1|3707_3998_+|helix-turn-helix-domain-containing-protein MSAFDSIMQGLEEARAFSTGQKAGAAVHEISLPEIDVARIRAQTGLSQTEFARSIGVAKGTLLNWEQKRRMPQGPAQVLLALIERKPSLVQDLLSG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP019939_1 | 1.1|19522|21|NZ_CP019939|CRISPRCasFinder | 19522-19542 | 21 | NZ_CP019939 | Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed2, complete sequence | 19522-19542 | 0 | 1.0 |
NZ_CP019939_1 | 1.1|19522|21|NZ_CP019939|CRISPRCasFinder | 19522-19542 | 21 | NZ_CP010686 | Phaeobacter piscinae strain P14 plasmid pP14_e, complete sequence | 23463-23483 | 0 | 1.0 |
NZ_CP019939_1 | 1.1|19522|21|NZ_CP019939|CRISPRCasFinder | 19522-19542 | 21 | NZ_CP010686 | Phaeobacter piscinae strain P14 plasmid pP14_e, complete sequence | 23485-23505 | 0 | 1.0 |
NZ_CP019939_1 | 1.2|19566|24|NZ_CP019939|CRISPRCasFinder | 19566-19589 | 24 | NZ_CP019939 | Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed2, complete sequence | 19566-19589 | 0 | 1.0 |
NZ_CP019939_1 | 1.3|19613|34|NZ_CP019939|CRISPRCasFinder | 19613-19646 | 34 | NZ_CP019939 | Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed2, complete sequence | 19613-19646 | 0 | 1.0 |
NZ_CP019939_1 | 1.1|19522|21|NZ_CP019939|CRISPRCasFinder | 19522-19542 | 21 | NZ_CP019939 | Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed2, complete sequence | 19500-19520 | 1 | 0.952 |
NZ_CP019939_1 | 1.1|19522|21|NZ_CP019939|CRISPRCasFinder | 19522-19542 | 21 | NZ_CP019939 | Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed2, complete sequence | 19544-19564 | 1 | 0.952 |
NZ_CP019939_1 | 1.1|19522|21|NZ_CP019939|CRISPRCasFinder | 19522-19542 | 21 | NZ_CP010686 | Phaeobacter piscinae strain P14 plasmid pP14_e, complete sequence | 23507-23527 | 1 | 0.952 |
NZ_CP019939_1 | 1.1|19522|21|NZ_CP019939|CRISPRCasFinder | 19522-19542 | 21 | NZ_CP010686 | Phaeobacter piscinae strain P14 plasmid pP14_e, complete sequence | 23611-23631 | 1 | 0.952 |
NZ_CP019939_1 | 1.1|19522|21|NZ_CP019939|CRISPRCasFinder | 19522-19542 | 21 | NZ_CP054605 | Sulfitobacter pseudonitzschiae strain H46 plasmid unnamed6, complete sequence | 9196-9216 | 1 | 0.952 |
NZ_CP019939_1 | 1.1|19522|21|NZ_CP019939|CRISPRCasFinder | 19522-19542 | 21 | NZ_CP045396 | Roseovarius sp. THAF27 plasmid pTHAF27_c, complete sequence | 14000-14020 | 1 | 0.952 |
NZ_CP019939_1 | 1.1|19522|21|NZ_CP019939|CRISPRCasFinder | 19522-19542 | 21 | NZ_CP021433 | Yoonia vestfoldensis strain SMR4r plasmid pSMR4r-2, complete sequence | 25408-25428 | 1 | 0.952 |
NZ_CP019939_1 | 1.1|19522|21|NZ_CP019939|CRISPRCasFinder | 19522-19542 | 21 | NZ_CP020476 | Roseovarius mucosus strain SMR3 plasmid pSMR3-2, complete sequence | 26178-26198 | 1 | 0.952 |
NZ_CP019939_1 | 1.1|19522|21|NZ_CP019939|CRISPRCasFinder | 19522-19542 | 21 | NZ_CP010686 | Phaeobacter piscinae strain P14 plasmid pP14_e, complete sequence | 23441-23461 | 2 | 0.905 |
NZ_CP019939_1 | 1.1|19522|21|NZ_CP019939|CRISPRCasFinder | 19522-19542 | 21 | NZ_CP054605 | Sulfitobacter pseudonitzschiae strain H46 plasmid unnamed6, complete sequence | 9321-9341 | 2 | 0.905 |
NZ_CP019939_1 | 1.3|19613|34|NZ_CP019939|CRISPRCasFinder | 19613-19646 | 34 | NZ_CP010686 | Phaeobacter piscinae strain P14 plasmid pP14_e, complete sequence | 23576-23609 | 4 | 0.882 |
NZ_CP019939_1 | 1.3|19613|34|NZ_CP019939|CRISPRCasFinder | 19613-19646 | 34 | NZ_CP054605 | Sulfitobacter pseudonitzschiae strain H46 plasmid unnamed6, complete sequence | 9218-9251 | 6 | 0.824 |
1. spacer 1.1|19522|21|NZ_CP019939|CRISPRCasFinder matches to NZ_CP019939 (Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed2, complete sequence) position: , mismatch: 0, identity: 1.0
atggtgtgtgaaatgacgaaa CRISPR spacer atggtgtgtgaaatgacgaaa Protospacer *********************
2. spacer 1.1|19522|21|NZ_CP019939|CRISPRCasFinder matches to NZ_CP010686 (Phaeobacter piscinae strain P14 plasmid pP14_e, complete sequence) position: , mismatch: 0, identity: 1.0
atggtgtgtgaaatgacgaaa CRISPR spacer atggtgtgtgaaatgacgaaa Protospacer *********************
3. spacer 1.1|19522|21|NZ_CP019939|CRISPRCasFinder matches to NZ_CP010686 (Phaeobacter piscinae strain P14 plasmid pP14_e, complete sequence) position: , mismatch: 0, identity: 1.0
atggtgtgtgaaatgacgaaa CRISPR spacer atggtgtgtgaaatgacgaaa Protospacer *********************
4. spacer 1.2|19566|24|NZ_CP019939|CRISPRCasFinder matches to NZ_CP019939 (Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed2, complete sequence) position: , mismatch: 0, identity: 1.0
cggctcgagttatccacagcctcc CRISPR spacer cggctcgagttatccacagcctcc Protospacer ************************
5. spacer 1.3|19613|34|NZ_CP019939|CRISPRCasFinder matches to NZ_CP019939 (Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed2, complete sequence) position: , mismatch: 0, identity: 1.0
ttgacgagtggtgtgtgaaatgacgatagtcttg CRISPR spacer ttgacgagtggtgtgtgaaatgacgatagtcttg Protospacer **********************************
6. spacer 1.1|19522|21|NZ_CP019939|CRISPRCasFinder matches to NZ_CP019939 (Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed2, complete sequence) position: , mismatch: 1, identity: 0.952
atggtgtgtgaaatgacgaaa CRISPR spacer ctggtgtgtgaaatgacgaaa Protospacer ********************
7. spacer 1.1|19522|21|NZ_CP019939|CRISPRCasFinder matches to NZ_CP019939 (Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed2, complete sequence) position: , mismatch: 1, identity: 0.952
atggtgtgtgaaatgacgaaa CRISPR spacer atggtgtgtgaaatgacgata Protospacer ******************* *
8. spacer 1.1|19522|21|NZ_CP019939|CRISPRCasFinder matches to NZ_CP010686 (Phaeobacter piscinae strain P14 plasmid pP14_e, complete sequence) position: , mismatch: 1, identity: 0.952
atggtgtgtgaaatgacgaaa CRISPR spacer atggtgtgtgaaatgacgaat Protospacer ********************
9. spacer 1.1|19522|21|NZ_CP019939|CRISPRCasFinder matches to NZ_CP010686 (Phaeobacter piscinae strain P14 plasmid pP14_e, complete sequence) position: , mismatch: 1, identity: 0.952
atggtgtgtgaaatgacgaaa CRISPR spacer atggtgtgtgaaatgacgaat Protospacer ********************
10. spacer 1.1|19522|21|NZ_CP019939|CRISPRCasFinder matches to NZ_CP054605 (Sulfitobacter pseudonitzschiae strain H46 plasmid unnamed6, complete sequence) position: , mismatch: 1, identity: 0.952
atggtgtgtgaaatgacgaaa CRISPR spacer atggtgtgtgaaatgacgaat Protospacer ********************
11. spacer 1.1|19522|21|NZ_CP019939|CRISPRCasFinder matches to NZ_CP045396 (Roseovarius sp. THAF27 plasmid pTHAF27_c, complete sequence) position: , mismatch: 1, identity: 0.952
atggtgtgtgaaatgacgaaa CRISPR spacer atggtgtgtgaaatcacgaaa Protospacer ************** ******
12. spacer 1.1|19522|21|NZ_CP019939|CRISPRCasFinder matches to NZ_CP021433 (Yoonia vestfoldensis strain SMR4r plasmid pSMR4r-2, complete sequence) position: , mismatch: 1, identity: 0.952
atggtgtgtgaaatgacgaaa CRISPR spacer atggtgtgtgaaattacgaaa Protospacer ************** ******
13. spacer 1.1|19522|21|NZ_CP019939|CRISPRCasFinder matches to NZ_CP020476 (Roseovarius mucosus strain SMR3 plasmid pSMR3-2, complete sequence) position: , mismatch: 1, identity: 0.952
atggtgtgtgaaatgacgaaa CRISPR spacer atggtgtgtgaaattacgaaa Protospacer ************** ******
14. spacer 1.1|19522|21|NZ_CP019939|CRISPRCasFinder matches to NZ_CP010686 (Phaeobacter piscinae strain P14 plasmid pP14_e, complete sequence) position: , mismatch: 2, identity: 0.905
atggtgtgtgaaatgacgaaa CRISPR spacer tcggtgtgtgaaatgacgaaa Protospacer .*******************
15. spacer 1.1|19522|21|NZ_CP019939|CRISPRCasFinder matches to NZ_CP054605 (Sulfitobacter pseudonitzschiae strain H46 plasmid unnamed6, complete sequence) position: , mismatch: 2, identity: 0.905
atggtgtgtgaaatgacgaaa CRISPR spacer ttggtgtgtgaaatgacgaat Protospacer *******************
16. spacer 1.3|19613|34|NZ_CP019939|CRISPRCasFinder matches to NZ_CP010686 (Phaeobacter piscinae strain P14 plasmid pP14_e, complete sequence) position: , mismatch: 4, identity: 0.882
ttgacgagtggtgtgtgaaatgacgatagtcttg CRISPR spacer ttgacgagtggtgtgtgaaatgacgatagtagct Protospacer ****************************** .
17. spacer 1.3|19613|34|NZ_CP019939|CRISPRCasFinder matches to NZ_CP054605 (Sulfitobacter pseudonitzschiae strain H46 plasmid unnamed6, complete sequence) position: , mismatch: 6, identity: 0.824
ttgacgagtggtgtgtgaaatgacgatagtcttg CRISPR spacer ttgacgatcggtgtgtgaaatgacgatagtgagt Protospacer ******* .*********************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP019940_1 | 6755-6845 | Orphan |
NA
Consensus repeat of NZ_CP019940_1
|
1 spacers
spacers of NZ_CP019940_1
>1.1|6778|45|NZ_CP019940|CRISPRCasFinder TGTGACAGCCAGCCGAGCGGAGGTAGAGGACTCTGGTGTCCTATG |
CRISPR arrays and Neighbor proteins around NZ_CP019940_1
The CRISPR arrays of NZ_CP019940_1 >merge|NZ_CP019940|1|6755-6845|CRISPRCasFinder CTTAGGCCATTTATGTCCAAGAATGTGACAGCCAGCCGAGCGGAGGTAGAGGACTCTGGTGTCCTATGCTTAGGCCATTTATGTCCAAAAA >NZ_CP019940|1|1|6755-6845|CRISPRCasFinder CTTAGGCCATTTATGTCCAAGAA TGTGACAGCCAGCCGAGCGGAGGTAGAGGACTCTGGTGTCCTATG CTTAGGCCATTTATGTCCAAAAA
>NZ_CP019940.1|WP_085787617.1|6066_6363_-|type-II-toxin-antitoxin-system-RelE/ParE-family-toxin MIELIRSGTFDTWLSGLRDRRAVARITARLDRLAAGNPGDVEPVGEGVSELRINYGPGYRVYFIQRGPVLVILLCGGDKSTQSKDIKQAKVLAAEWKG >NZ_CP019940.1|WP_085787616.1|5766_6063_-|putative-addiction-module-antidote-protein MPEEKFARYDSADYLKTEEDIAAYLEAVMEDGGDDPAYVARALGVVARARNMTALAREVGMSRVGLNKALSGDGNPTLSTVMKVAKALGLKVSIQPGA >NZ_CP019940.1|WP_085787615.1|5195_5762_+|hypothetical-protein MSGSTDPFLVLVDDIGALRRQIENLQRTSLDRDEAEHLNATIAQSLDNMAQTGKRLEQRLEGQLQLATAKTHRDAIEAAQGAARAAIRESHAEILQTARSLSQAAGEARREAWRWFGGFWVWLASIGAAGALVGALAVFWLQGRADAKAFGQYPSIYCTTAGGAFADQRDGSRYCIFMISPPTQPDGE >NZ_CP019940.1|WP_085787614.1|4932_5199_+|hypothetical-protein MARTIDQQIADAQAKLARLKTRQKASDTRRKIIVGAIVTTEALKDPKISKWLASTLRKNATRDVDQKEIAGLLADLDARAQSAGAGEA >NZ_CP019940.1|WP_085787613.1|3522_4779_-|hypothetical-protein MAIYHLRATMISRSSGRSATAAAAYRVGERIEDHRTGLTFDYRARGGVDHVETLAPANAPAWVQDREALWNAVEAAETRKNSQVAREIRVALPAELDHGQRVELVREFCQREFVARGMVADIALHAPGRTGDDRNHHAHILLTTREIGPEGFGAKNRDWNAVEMLEGWREAWARDSNRALERCGHEERIDHRTLEAQRIEAQERASAAHDRGDEAEALRQTVRAVELDRDPLPQLSAGAWQMKERGIEVGAVRVWREVKAQAAEVARVAEVLAGQVRDWIGRAVDRLGPLTQEGAAQGLAYAGAADRDGREGRQDLAARLREAWEARQSRTASDAIAAPGQTPEPESSRSLAERLREAAQGIDRGTLAEAAARLQESREAEERQRVQEAERLKEQERQQERLREREARDHDRDGGLTH >NZ_CP019940.1|WP_085787612.1|2654_3233_+|TetR/AcrR-family-transcriptional-regulator MQEVRSGPGRPKDPVVAEAIRKAALRLVRERGYRNVSIGAIAQAAGVARQTLYNRWHAKADLILDAVFEETGRRADDQLPLETGDASRDRLERLLIGVFNHLRADGDTLRALIAAAQEDSEFREAFRERFVAPRETIVTDILAEALRRGELSREADPDTLSTMIHGAFWYRLLNGRELDHELARSIARSVFP >NZ_CP019940.1|WP_085787611.1|1320_2538_-|multidrug-effflux-MFS-transporter MLDRPLIPARAGALIAVLALLSIFPPLATDMYLSAIGILAEDLNTSHAATELSLSLFFLGLCLGQLIVGPLTDGYGRKRPLLIGVFIFTVTSIALPLVDNIVVFNALRFLQAIGACVGMVVSRAIVADLYSGQKAAKVMTLLVMLMTIGPVIAPTLGSLLLEAFGWRSIFVTMVLVGLPALILSKLVVPETLSPERRVAQPFRSATRNGLRLVRQPAYLAPVLVAGLVQGGMFAFITGSSGVFQGFFGMSALNYGLIFALIAAALFVFAQINSRLLDRFTPRDMLNRGLPFYGLAALAAVLASATGSVWMFIIPLWIAIGMVGLLSANATSLAMAAATEAAGTGSALLGAVQFGMAFTISTCVALAGTDSPFPMAMGLFLPALAAVALWAALRSRANAGAESKMQ >NZ_CP019940.1|WP_157115761.1|452_851_+|hypothetical-protein MKKVDKLSIREAVKHFDVSRPTLQKALKSGKISGVQDGQGTWTIDPSEMARVYQPRQDEVVKDGGQEHENLSAKNTPLHGQVEVLKERLADAEKRVAIAEALAEERGKHIEDLRRMLPAPEAGQPRRRWWPW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP019940_1 | 1.1|6778|45|NZ_CP019940|CRISPRCasFinder | 6778-6822 | 45 | NZ_CP019940 | Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed3, complete sequence | 6778-6822 | 0 | 1.0 |
1. spacer 1.1|6778|45|NZ_CP019940|CRISPRCasFinder matches to NZ_CP019940 (Ketogulonicigenium robustum strain SPU_B003 plasmid unnamed3, complete sequence) position: , mismatch: 0, identity: 1.0
tgtgacagccagccgagcggaggtagaggactctggtgtcctatg CRISPR spacer tgtgacagccagccgagcggaggtagaggactctggtgtcctatg Protospacer *********************************************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|