Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
LS483485 | Aggregatibacter aphrophilus strain NCTC11096 genome assembly, chromosome: 1 | 2 crisprs | cas3,DEDDh,DinG,WYL,cas2,cas1,cas4,cas7,cas8c,cas5 | 1 | 5 | 2 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LS483485_1 | 1769904-1770045 | Orphan |
NA
Consensus repeat of LS483485_1
|
1 spacers
spacers of LS483485_1
>1.1|1769952|46|LS483485|CRISPRCasFinder TTCTCAACATCAAGTTATGCGGGAATAGCTCAGTTGGTAGAGCACA |
DEDDh |
CRISPR arrays and Neighbor proteins around LS483485_1
The CRISPR arrays of LS483485_1 >merge|LS483485|1|1769904-1770045|CRISPRCasFinder ACCTTGCCAAGGTCGGGGTCGCGAGTTCGAGCCTCGTTTCCCGCTCCATTCTCAACATCAAGTTATGCGGGAATAGCTCAGTTGGTAGAGCACAACCTTGCCAAGGTTGGGGTCGCGAGTTCGAGCCTCGTTTCCCGCTCCA >LS483485|1|1|1769904-1770045|CRISPRCasFinder ACCTTGCCAAGGTCGGGGTCGCGAGTTCGAGCCTCGTTTCCCGCTCCA TTCTCAACATCAAGTTATGCGGGAATAGCTCAGTTGGTAGAGCACA ACCTTGCCAAGGTTGGGGTCGCGAGTTCGAGCCTCGTTTCCCGCTCCA
>LS483485.1|SQI99188.1|1769210_1769756_+|Oligoribonuclease MELDKQNLIWIDLEMTGLDPEKERIIEIATIVTDKNLNILAEGPVLAVHQPDELLNKMSEWCVKTHTANGLVERVKASKLNERAAELQTLDFLKKYVPKGTSPICGNSVAQDKRFLFKYMPELADYFHYRHLDVSTLKELAARWKPEILNGFTKQNTHLALDDIRESIKELAYYREHFLNI >LS483485.1|SQI99179.1|1768098_1769139_-|Putative-ribosome-biogenesis-GTPase-RsgA MSKPKLTQNQKRRIQSNNNKVLHRHQKKEIEWRDDMLGESQEGLVVTRYARHADVENAQGEIFRCNLRRTLSGVVVGDHVIWRQGNEQLQGVSGVIEGIHPRKNEISRPDYYDGIKVIAANIDRIIIISSVLPSLSLNIIDRYLVVCEEANIEPIIVLNKVDMLTEAQWLEADELLAIYRKIGYQTLMLSAQSGKNLEKLTALLSHGVSIFVGQSGVGKSSLINAVLPHVDAQVGEVSATSGLGQHTTTSSCLYHLPQGGSLIDSPGIREFGLWHLEEEQITKGYREFQTVLGTCKFRDCKHLSDPGCALCQAVEEGKISTVRYENYHRLLASRNEMKSQRHFSSE >LS483485.1|SQI99177.1|1767613_1767871_-|Phosphocarrier-protein-HPr MYSKDVEITAPNGLHTRPAAQFVKEAKAFASDITVSSAGKSASAKSLFKLQTLALTQGTVITISAEGEDAEKAVEHLVALIPTLE >LS483485.1|SQI99176.1|1765746_1767474_-|Phosphoenolpyruvate-protein-phosphotransferase MISGIPASPGIVFGKALVLKEEKIVLDTQKIKDSQIENEIARFYAGRDAAVEQLNSIKDRAYQSLGEEKAAIFEGHLMILEDEELEEEIIDYLRSNHVNAAVAANVVIDQQVAMLSEIDDEYLKERAGDIRDIGNRLIKNILGMHIVDLGEINEEAILVAYDLTPSETAQLNLDKVLGFVTDIGGRTSHTSIMARSLELPAIVGTNNVTEKVKTGDFLILDALNNAVYVNPSQQEIQRLKTLQDKLAEEKAELAKLKDLPALTLDGHQVDVVANIGTIRDVEGAERNGAEGVGLYRTEFLFMDRDQLPSEEEQFIAYKEVVEAMNGNLVVLRTMDIGGDKELPYLNLPKEMNPFLGWRAIRIALDRREILNAQLRAVLRASAYGRLAVMFPMIISVEEIRELKSVIEELKVELRNEGKAFDEDIQVGIMVETPSAAVNAKFLAKEVDFFSIGTNDLTQYTLAVDRGNELISHLYNPMSPSVLNLIKQVIDASHAEGKWTGMCGELAGDERATILLLGMGLDEFSMSAISVPRIKKLIRNVNYQDAKLLAEKALQQPTAAEIEQLISDFLAEKALN >LS483485.1|SQI99175.1|1765185_1765686_-|Glucose-specific-phosphotransferase-enzyme-IIA-component MGLFDKLFGSKDKKAVDVEIYAPLSGEIVNIEDVPDVVFSEKIVGDGVAIRPTGNKLVAPVDGVVGKIFETNHAFSMESKEGVELFVHFGIDTVELKGEGFTRVAQEGQSVKRGDTIIELDLPLLEAKAKSVLTPVVISNMDEISNIEKKSGEVVAGDSVVLVLKK >LS483485.1|SQI99174.1|1763007_1765047_+|Oligopeptidase-A MSNPLLTPTDLPAFSKIEPQYIEPAIKQLIEENRATVEHLLKQPHFTWENFILPLAEAGDRLSKVWSPISHLNSVKNSPELREAYQACLPLLAEYGTWVGQHQGLYEAYLQLKNSPEFANYSQAQKKAIENSLRDFKLSGISLPAEKQKRYGEIVARLSELTSQFSNNVLDATMGWEKIIEDESQLKGLPESALQAAKQSAESKGLSGYRFTLEFPSYIPVMTYCENRELREEMYHAFATRASDQGPNAGKWDNSALMQEILSLRVDLAKLLDFNTYTELSLATKMAETPQQVLDFLTNLAQRSKAQGKRELQELKDFCKTHYNLTALELWDLTFYSEKQKQHLYAINDEELRPYFPEERVLNGLFELIKRIFHIRAVERHGVETWHKDVRFFDLIDDTNEVRGSFYLDLYAREHKRGGAWMDDCIGRRKTIDGNLQKPVAYLTCNFNRPLGDQPALFTHDEVTTLFHEFGHGLHHMLTKIDVADVAGINGVPWDAVELPSQFMENWCWEEEALQFISGHYQTNEPLPKEKLTQLLKAKNFQAAMFVLRQLEFALFDFRLHHTFDANKSNQVLDTLHQVKAEVAVVPTVDWGRMPHSFSHIFAGGYAAGYYSYLWAEVLSADAYSRFEEEGIFNAQTGQSFLDEILTKGGSEEPMKLFKNFRGREPQLDALLRHKGIAN >LS483485.1|SQI99173.1|1762509_1762875_-|Inner-membrane-protein-ybaN MKYFYIGLGFLFLIIGLIGIVLPILPTTPFLLLTVFFFAKGSERVHNWFVGTKIYQNHLKDFHEQRALTKKTKMAILTFSTTMLLIGFYFTPSIIGKSLIIAVLLIKYWFFFFWIKTLEEE >LS483485.1|SQI99172.1|1760902_1762471_-|Periplasmic-oligopeptide-binding-protein-precursor MPVAHFPLSAFRLFPFKSVVLFCSVFALNACDKKPQEPVTPPPTIETVQLQLISSQGNRQLLVRGVYSDLVLNPSQAVNAEQFAFLRDLFEGLVIYDQRGNVIPAVAESWQTTDNKTWKFSLRQDAKWSNGEPVTAQQFVASWQALVTSNSPLRHYLAYINLANAESVLKGKLPADKLGISAENDRTLRLTLDKATPYLPQMLVHISLLPQYLAPHEGIVTNGAYQVAGQENHFIHLEKNPHYWAQDKVAFKHVDYQKIASQQDPIALDLVINPSKTEQAQYFPQLCTYFYAFNMKQPKLAQSSVRKALSMMAPSRNMNNEGKNFIYLSDNFLPISMQTVESHWEQTPMEQLLSQSKISEKAPLKLTLSYDQTELQSKIAQSLIRMWSQSDMIRIIGEGMPRQKLLENIAKGDFQIARSGWCADYNDPAAFLSLFYSHSPDNKSGYHNEEVDRLFEQSLQLMPSAERTTLYSRIEQILQQEKVVLPLYQTTVPIYINPTINGYYLSNPTEVIYSKDLFRKIQ >LS483485.1|SQI99171.1|1760066_1760864_+|Uncharacterised-protein MLKTLSKVISSSISMAFLVVLGWSVAYAYGWGQSYFYGFPWWYVDVGSGNVARSLGYVIWATIILLLTYLIGLFGLKKVKPYMSERCVNLLRTYILCTIFFIPIPVACILLVGKLNSIFAIVYIITTFIFTLLFKNYFRNHISTISIHVVIRFFHRNKSYVMLFMYCYFVIFGFIMGYVRPNFKIIFDSMEVEKQSYYVLAKYSDTFILSRSIRATNGDFYIYKMNPNSICHIKVVDIRKLGIDKMAPKEIELKEVKAEEANTEL >LS483485.1|SQI99170.1|1758296_1758950_+|GTP-cyclohydrolase-2 MAKIELVAQANLPTEFGLFKIVGFEFPDSKKEHVALVLGDISNGDEPVLARIHSECLTGDALHSLKCDCGFQLAAALRQINQEGRGVLIYHREEGRGIGLINKIRAYSLQDQGMDTIEANLALGFAADERNFSVCADIFDLLGVKKIRLLTNNPEKIETMKQAGINVVERVPLNVGENRYNTAYLDTKAKKMGHFIVHNGEQHLMECPYCQEEVPKK >LS483485.1|SQI99189.1|1770258_1771884_+|Putative-phosphoethanolamine-transferase-ybiP MFKKFLSYLNSRIFWIWLLFFSFITLIISPENSAYYGIFVIYIIYYLIFSFNQKIFWLFITFVVITLSLYQPIYSSYGNLNSGVVAAFFETNPAESFEFLGKLKIDQFILPFLFSLSAYILYRLREQANPQREITEKDIKYKKILNITLISVTIFSIIWIPTKFHFENSSKEQVDSHWTLANSPVNLISFYANIIDSITDYYNDKKDLETAKDVLPPWHIISTQPKYKNYVLIIGESARKDYMSTYGFKLPTTPFLDKTNGYINAGYVSAAPATYHSLLNTLHFKPKEKGKKDYSYNIISLAKVAGIKTFWLSNQGTIGKYDTLASRLGIGADFHYFTKKGGFITNNADDFKLLEELKIKFKEKAYENDTRLFVIHLMGSHRNFCQRITDKEKKLEFINESLSCYVNTILKTDKLIEEIVNVLKEQNEPYSLIYFSDHGLSHVNKENKKEVDLDFGEEHKQNFEVPFVKISSDDTSREVVNVKRSAFNFIYGFSQWLGIETKELNQEYNFFSNKNDENIKVFNFKENIPYSTLKNDDIPNL >LS483485.1|SQI99190.1|1772116_1772593_+|ADP-binding-protein MTERFTQYIPNENAMCVFGEKLIKAICQVSNNKSVALYLNGDLGAGKTTLSRGMIQGLGYSDNVKSPTYTLVEEYKIGGKIIYHFDLYRLADPEELEFMGIRDYFAENTICLIEWAEKGAGLLASADLLVNIAYAENARNIELLAESETGRQIIQQLN >LS483485.1|SQI99191.1|1772639_1774097_+|N-acetylmuramoyl-L-alanine-amidase-AmiB-precursor MHGLLFFAVLAFADNTWTIAIDPGHGGKDPGAIGRNLKIYEKNVTLSIAKELKALLDKDPHFRAVLTRNGDYYISVPQRSEIARKYKANYLVSIHADSSETPNLRGASVWVLSNRRANDEMGQWLEDHEKRSELLGGAGSVLASHNEKYLDQTVLDLQFGHSQRVGYELGNIVLRHFSQIASLSRPTPRHASLGVLRSPDIPSILVETGFLSNQEEELKLSTPAYRKRIAKAIYNGLAEYRRKNVKDEPKVAIADKNEKTSEKSTALEVKDSGIRHTVKSGEGLGKLAEKYHVSTADIIALNKLKRKALWGGETIKIPDNGKNIPTIEDKSVKTKENNIVEVKDSGVRHKVKRGETLGKLAEKYKVSVNDILTLNKLKRKELLIGENLKIPAIAKAETSNKTEKGKETETPKPVDKSPKTQGKTKPEVKEVVPKFHTVKKNETLYSIAREYKIAPNKLLKLNPQLKNGKVLSGQKIKLTEDQGKK >LS483485.1|SQI99192.1|1774096_1775944_+|DNA-mismatch-repair-protein-mutL MTIRVLSPQLANQIAAGEVVERPASVVKELVENSLDAGADKIQIDIENGGAGLIRIRDNGIGIAKEELALALARHATSKIADLADLEAILSLGFRGEALASISSVSRLTLTSRTAEQHEAWQVYAQGRDMETTIQPASHPIGTTVEVANLFFNTPARRKFLRSEKTEFSHIDEVIRRIALAKFNISFTLTHNGKVLRQYKSAMTNEQKLKRVATICGDDFIQNALQIDWKHDDLHLSGWVALPHFNRPQNDLNYCYVNGRMVRDKIITHAIRQAYAEYLSNDQYPAFVLFIDLNPNDVDVNVHPTKHEVRFHQSRLVHDFITQGISHALTSESLDFSATETERKIQEPMGLWEVSSKPNRSAAGPNMFTQPSTYSTGYRVEKQPSEDTYHSSQKHRQNPPHFNRDNITPSVLDAHKHLWMDSTAPSRSKITISEDSKPQSTCLHALALVGNHALLLQQEQHFYLLSLSRLQRLKLKLNLTLTATSQPLLIPVIFRLSETQWQAWQQQKAWFTQVGFDFLAEDAQRKITLQKVSAHLRRQNLQQLIIALLNEPVENLSEFLTALLAQLDFPPIQVLADAVTMLTEIEQLLNKQSHIQLSDLFLEINWQPYLTQLAD >LS483485.1|SQI99193.1|1775946_1776903_+|tRNA-dimethylallyltransferase MMQHSEHKPTAIFLMGPTASGKTDLAIQLRQQLPVEVISVDSALIYRGMDIGTAKPTAEELALAPHRLIDICDPAESYSAMNFCHDALREMQDITAQGKIPLLVGGTMLYYKALLEGLSPLPSADEKVRSEIETKAMQIGWSGLHQELAKIDPISAQRINPNDSQRINRALEVFYLTGKTLTELTAQKGEALPYDILQFAIAPEQREVLHLRIEQRFHKMIELGFQQEVEKLYQRSDLNENLPSIRSVGYRQMWEYLRGDYDHKEMIFRGICATRQLAKRQITWLRGWKSPIQWLDSLHPTQALEKVLVSVNSLSDKQ >LS483485.1|SQI99194.1|1777029_1777332_+|RNA-binding-protein-Hfq MAKGQSLQDPYLNALRRERIPVSIYLVNGIKLQGQIESFDQFVILLKNTVNQMVYKHAISTVVPARSVAHHNANQQQQHQQGQQQEAPSSVETNTDAQTE >LS483485.1|SQI99195.1|1777346_1778729_+|GTP-binding-protein-HflX MDNLLGNLTQSAVDSGNVSTAFSMPENSTQTSDHTINNAIIVHCFFEQSKNTDDLTEFQLLAKSANVHILNVITATRSTPQAKYFIGSGKAEEIADAVRQYNADLILVNHSLTPAQARNLEALCDCRVVDRNGLILDIFAQRARSHEGKLQVELAQLKHLSTRLVRRKTGLDQQKGAVGLRGPGETQLETDRRLIKVRINQLQSRLEKVDKQRNQNRQTRQKADIPTISLVGYTNAGKSTLFNLITDANVYAADQLFATLDPTLRRLTLQDVGTTILADTVGFLRDLPHDLISAFKSTLQETTEASLLLHVIDCADNRKLENIEAVNQVLEEIGAQEVPRLLVYNKIDQLENVVPHIEYDEKHLPSAVYISANSGSGLDLLLEAIRLRLTEHILNLQIRLPPSDGKLRHAFYQLNCVEKEEINEQGEFLLSIRLEKTEWLKLVKRFTQLTPFNPIQPDEN >LS483485.1|SQI99196.1|1778725_1779685_-|Transcriptional-regulatory-protein-tyrR MTTSKNTDPFAQIVSKNPRMQDVIEKAKKFALLDVPLLIQGETGTGKDVIAKACHDFSERRDHAFLAVNCAGIPGEDAETEMFGRRNKDGEFIGFFEYADGGTVLLDGVEELSLTLQAKLLRFLSDGTFRRVGEEEERYANVRVICTSQQPLQHYVEQGKMRSDLFHRLNVLSLNLPLLRERKEDLALLSHQLIQEISEKLGIFPPHFDENVLRYLQEYPWPGNIRELYNALYRACSLCQNNQLRIEDLGLASQIPHSQDIDQFITEGDTLDEMVGRFEAAVLNKFYAKYPSSRKLATRLGVSHTAIANKLRQYGIGKS >LS483485.1|SQI99197.1|1779781_1780876_-|Domain-of-uncharacterised-function-(DUF697) MEKKVFTQNESDMNEDLNAQSAFIAKQEFHEEQAVPDTEDENGIFEGELLEQQFEQSLQPKPRWWKKILGGVSLLFFVASVAQSVQWLVDAWQQNQWIYFAFSLVACAVILLGVSAIAGEWRRLAKLRHRAEIQTQSQQLLKSAVNFKDVFSSAEHQQAVSLCQEVTKLVHIDGQNPDFMQWQKQIHEAYSAREILHLFSQNVLQPVDKQAKKLITKASTESAALVAISPLELVDIFFIAWRNIRLINQIARIYGIELGYFSRIRLLRMVLVNMAFAGVTELIQDLGVNWLSQDLTAKLSARAAQGIGVGILTARLGIKAMEFGRPLAFQSNEKPRLSQLHKELLSHLSNTVFDQVKFKQKNKV >LS483485.1|SQI99198.1|1780886_1782293_-|Predicted-ATPase MLNHLQKELNELVNRGLDRTLRIAVTGLSRSGKTAFITSLMNQMLHINRVDNGHLPLFDAARQHRILAVQRVPQLDLSIPRFDYEGNLNALSQEPPIWPQSTRGVSETRLAIRYQRSNGLLRHLKEKGTLYLDIFDYPGEWLLDLPLLDLGFEQWSLELQRQLNGTHAELAQTWLNKVKKIKLTDSADEDILAHLAKDYTDYLWQCKKQGLHFIQPGRFVLPGELEGAPALQFFPLLHLNASQWQQLKREAKSESYFAVLTKRYDYYRQHIVKGFYENYFETFDRQVILADCLTPLNHSRQAFNDMQQALQQLFRNFHYGKRHLLNRLFSPKIDKLMFIATKADHITTDQLPNLIGLMRQLVQEGGRYVEFADIVTDYTAIASIRATQQVVVNQNGKQFKALQGIRSSDKQKVTLYPGSVPGRLPSADFWQNQKFEFDQFEPRRLEQGENIPHLRMDAVLQFLLGDKL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LS483485_2 | 2001339-2001979 | TypeI |
NA
Consensus repeat of LS483485_2
|
9 spacers
spacers of LS483485_2
>2.1|2001371|34|LS483485|CRISPRCasFinder,CRT CGCCTTTTTGGTAGGTTTAAAAATAAAAATGAGT >2.2|2001437|36|LS483485|CRISPRCasFinder,CRT,PILER-CR CCAGAGCAGTTACGAAAATTAAGCTGCGGAATCAAA >2.3|2001505|34|LS483485|CRISPRCasFinder,CRT,PILER-CR ACAACGAGCAGGCGTGGGCGGAGAACTATTTGAC >2.4|2001571|33|LS483485|CRISPRCasFinder,CRT,PILER-CR ATTTATGGTGCGATTACATTAATTGTTGTTCCT >2.5|2001636|34|LS483485|CRISPRCasFinder,CRT,PILER-CR TTTCAAGTTGTTGTGCGGTTAATACCGCACTATT >2.6|2001702|35|LS483485|CRISPRCasFinder,CRT,PILER-CR GTATCATTGTTTGTAGATAATTCTTGCTTACAAAT >2.7|2001769|39|LS483485|CRISPRCasFinder,CRT,PILER-CR GCAAAACTTTTAGAACAAAACGGCTTGCCGGTAACCGCG >2.8|2001840|37|LS483485|CRISPRCasFinder,CRT,PILER-CR GCAATCTGACCGCACTTTTGCAGAAAAACCTGTTGCG >2.9|2001909|39|LS483485|CRISPRCasFinder,CRT,PILER-CR GCGAACCATTTACGCATCAGCATCGGGTTGCCGGAAGCG |
cas2,cas1,cas4,cas7,cas8c,cas5 |
CRISPR arrays and Neighbor proteins around LS483485_2
The CRISPR arrays of LS483485_2 >merge|LS483485|2|2001339-2001979|CRISPRCasFinder,CRT,PILER-CR GACAAAATATATACTCACCCGAAGGCGGCTGCCGCCTTTTTGGTAGGTTTAAAAATAAAAATGAGTGTTTCAACACACAGCCACCCGAAGGTGGCTGCCCAGAGCAGTTACGAAAATTAAGCTGCGGAATCAAAGTTTCAACACACAGCCACCCGAAGGTGGCTGCACAACGAGCAGGCGTGGGCGGAGAACTATTTGACGTTTCAACACACAGCCACCCGAAGGTGGCTGCATTTATGGTGCGATTACATTAATTGTTGTTCCTGTTTCAACACACAGCCACCCGAAGGTGGCTGCTTTCAAGTTGTTGTGCGGTTAATACCGCACTATTGTTTCAACACACAGCCACCCGAAGGTGGCTGCGTATCATTGTTTGTAGATAATTCTTGCTTACAAATGTTTCAACACACAGCCACCCGAAGGTGGCTGCGCAAAACTTTTAGAACAAAACGGCTTGCCGGTAACCGCGGTTTCAACACACAGCCACCCGAAGGTGGCTGCGCAATCTGACCGCACTTTTGCAGAAAAACCTGTTGCGGTTTCAACACACAGCCACCCGAAGGTGGCTGCGCGAACCATTTACGCATCAGCATCGGGTTGCCGGAAGCGGTTTCAACACACAGCCACCCGAAGGTGGCTGC >LS483485|2|2|2001339-2001979|CRISPRCasFinder GACAAAATATATACTCACCCGAAGGCGGCTGC CGCCTTTTTGGTAGGTTTAAAAATAAAAATGAGT GTTTCAACACACAGCCACCCGAAGGTGGCTGC CCAGAGCAGTTACGAAAATTAAGCTGCGGAATCAAA GTTTCAACACACAGCCACCCGAAGGTGGCTGC ACAACGAGCAGGCGTGGGCGGAGAACTATTTGAC GTTTCAACACACAGCCACCCGAAGGTGGCTGC ATTTATGGTGCGATTACATTAATTGTTGTTCCT GTTTCAACACACAGCCACCCGAAGGTGGCTGC TTTCAAGTTGTTGTGCGGTTAATACCGCACTATT GTTTCAACACACAGCCACCCGAAGGTGGCTGC GTATCATTGTTTGTAGATAATTCTTGCTTACAAAT GTTTCAACACACAGCCACCCGAAGGTGGCTGC GCAAAACTTTTAGAACAAAACGGCTTGCCGGTAACCGCG GTTTCAACACACAGCCACCCGAAGGTGGCTGC GCAATCTGACCGCACTTTTGCAGAAAAACCTGTTGCG GTTTCAACACACAGCCACCCGAAGGTGGCTGC GCGAACCATTTACGCATCAGCATCGGGTTGCCGGAAGCG GTTTCAACACACAGCCACCCGAAGGTGGCTGC >LS483485|2|1|2001339-2001979|CRT GACAAAATATATACTCACCCGAAGGCGGCTGC CGCCTTTTTGGTAGGTTTAAAAATAAAAATGAGT GTTTCAACACACAGCCACCCGAAGGTGGCTGC CCAGAGCAGTTACGAAAATTAAGCTGCGGAATCAAA GTTTCAACACACAGCCACCCGAAGGTGGCTGC ACAACGAGCAGGCGTGGGCGGAGAACTATTTGAC GTTTCAACACACAGCCACCCGAAGGTGGCTGC ATTTATGGTGCGATTACATTAATTGTTGTTCCT GTTTCAACACACAGCCACCCGAAGGTGGCTGC TTTCAAGTTGTTGTGCGGTTAATACCGCACTATT GTTTCAACACACAGCCACCCGAAGGTGGCTGC GTATCATTGTTTGTAGATAATTCTTGCTTACAAAT GTTTCAACACACAGCCACCCGAAGGTGGCTGC GCAAAACTTTTAGAACAAAACGGCTTGCCGGTAACCGCG GTTTCAACACACAGCCACCCGAAGGTGGCTGC GCAATCTGACCGCACTTTTGCAGAAAAACCTGTTGCG GTTTCAACACACAGCCACCCGAAGGTGGCTGC GCGAACCATTTACGCATCAGCATCGGGTTGCCGGAAGCG GTTTCAACACACAGCCACCCGAAGGTGGCTGC >LS483485|2|1|2001405-2001979|PILER-CR GTTTCAACACACAGCCACCCGAAGGTGGCTGC CCAGAGCAGTTACGAAAATTAAGCTGCGGAATCAAA GTTTCAACACACAGCCACCCGAAGGTGGCTGC ACAACGAGCAGGCGTGGGCGGAGAACTATTTGAC GTTTCAACACACAGCCACCCGAAGGTGGCTGC ATTTATGGTGCGATTACATTAATTGTTGTTCCT GTTTCAACACACAGCCACCCGAAGGTGGCTGC TTTCAAGTTGTTGTGCGGTTAATACCGCACTATT GTTTCAACACACAGCCACCCGAAGGTGGCTGC GTATCATTGTTTGTAGATAATTCTTGCTTACAAAT GTTTCAACACACAGCCACCCGAAGGTGGCTGC GCAAAACTTTTAGAACAAAACGGCTTGCCGGTAACCGCG GTTTCAACACACAGCCACCCGAAGGTGGCTGC GCAATCTGACCGCACTTTTGCAGAAAAACCTGTTGCG GTTTCAACACACAGCCACCCGAAGGTGGCTGC GCGAACCATTTACGCATCAGCATCGGGTTGCCGGAAGCG GTTTCAACACACAGCCACCCGAAGGTGGCTGC
>LS483485.1|SQI99590.1|1999499_2001164_+|EIICBA-Glc MKKLLSFEFWQKFGKCLMVVIAVMPAAGLMVSIGNSLPLISDAEWLARVGNIIAQIGWGIIGNLHLLFALAIGGSWANERAGGAFAAGLAFILINLITGHFFGVKIEMLTDPNAHVSTILAGDIPVANYFVNILGQPALNMGVFVGIIAGFVGATTFNSYYNFRKLPEVLTFFNGKRFVPFVVIYRSVLVALILAVFWPVVQTGINHFGEWIANSQDSAPILAPFVYGTLERLLLPFGLHHMLTIPMNYTSLGGTYEFLTGMQQGKQVFGQDPLWLAWISDLINLKDAGNMTQYNELLSTVTPARFKVGQMIGSSGILMGITLAMYVNVDTDKKTIYKGIFLSSALAVFLTGVTEPIEYMFMFVALPLYLVYAAIQGCAFAMADIVNLRVHSFGNIEFLTRTPMAIKAGIGMDLINFIWVSGVFAVAAFLIANFMIKKLNLATAGRNGNYDAKGSDEAPTEEKKVANASAQVVQIINLLGGRNNIAEVDACMTRLRITVHNPELVGDAAAWKQAGAMGFIVKGTGIQAIYGPKADVLKSDIQDLLSSGVEIPKM >LS483485.1|SQI99589.1|1998697_1999489_+|Uncharacterized-protein-conserved-in-bacteria MKLLTLNVHAWLEDNQAEKIDIIADTIVEKGYDIVALQEVNQLMSAPAISQALKQDNYGVVLLNKINQRATQKYSLFWSNSHIGYDKYDEGIAFLTRLPVYEVDAFYCSQHQRLDSILSRKILGLTVEYQGQLVDCYSCHINLPNCAGENQLDNIRNIVERSQSQNLKILMGDFNTDAISDPDAYQKIKSLGLLDTFEMAEQKDSGITVEKAIDGWKGHSEEKRLDYIFLNQAKRVLSSQVVFNGKNKPVVSDHFGLEIELTL >LS483485.1|SQI99588.1|1994914_1998688_+|Maltodextrin-phosphorylase MLTRSSGVLMHITSLPNAFGIGSFGQSAYDFVDFLVETKQSYWQILPLTTTSYGDSPYQSFSAIAGNTHLIDFALLTQMGLLQETDYASVNFGDDPTKVDYERIFYARRPILEIAVKHFLADKKRQADFKNFEKNNRTWLEDYAEFMAIKEHFGNKALQEWEDKLVVARKPKTLAKYRTMLKEQIQYFKVTQYFFFQQWLALKNYANQRGIKIIGDMPIYVAEDSVEVWTMPELFQLDKECKPLFVAGVPADQFSATGQLWGNPLYDWPEHKKQGYAWWIHRIEESFKIYDVLRIDHFKGFSDYWQVDGKADIAKYGTWQPGPGYDLFKAVKAQLGDLPIIAENLGNIDEKAEKLLTDCGYPGMKILQFGFENVSGESLDSPHYCIPHCIAYTGTHDNDVINGWYADLSTKQQQYINAYTHRATDESVCQAMIRQLFATVSNTVIATMQDILDLPASSRMNLPSTIGGNWEWRMQESDLTKAKKDFLTQITMLYGRANKEQVMIKFSEFVQQTTNKKLEKLSDHAIYVQLLNYVKTLAANKGKNTAKRKIYYISAEFLIGKLLSNNLINLGVYQEIKDELAQAGKSLSHIEDIEPEPSLGNGGLGRLASCFIDSMSTLGLNAEGVGLNYHCGLFKQVFKNNEQHAEPNNWIEKESWLIPTDIHYEVPFKDFTLTSKLDRIDILGYKKDTQNYLNLFDIESINHKLIKKGITFDKTKIKENLTLFLYPDDSDKNGELLRIYQQYFMVSNAAQLLIDEAIERGSNLHDLADYAYVQINDTHPSMVIPELIRLLTEKHRIKFAEAVEIVRNMVGYTNHTILAEALEKWPLAYLEEVVPHLVKIIKKLNKLVHKEYPNPDVQIIDKQKRVHMAHMDIHFSNSVNGVAALHTEILKNSELKAFYEIYPEKFNNKTNGITFRRWLEFSNQELAAYIKQLIGDGYLHDATQLEKLLTFKDDKKVHQKLAEIKFRNKLALKTYLKENKGIELDENSIIDTQIKRFHEYKRQQMNALYVIHKYLEIKAGKLPKRKITVIFGGKAAPAYVIAQDIIHLILCLSELINNDPDVNHYLNVHLVENYNVSVAEKLIPATDISEQISLASKEASGTGNMKFMLNGALTLGTMDGANVEIAELAGAKNIYTFGKDSESIIKLYETAGYVSKDYYKKDKHIKRAVDFILDSTLVKLGNQKRLKRLHDELLNKDWFMTLIDFDAYVTAKEQILADYEAQDSWNEKVIHNIAKAGFFSSDRTIAQYNTDIWHCED >LS483485.1|SQI99587.1|1994002_1994773_-|Trehalose-operon-transcriptional-repressor MVFYYRFKEGNKSQKWSGKVSKYKAVYNDIKSKITDGILPPKQELPSESELMQEYGFSKDTIRKALSLLEMDGYIQKQQGRTSIVLEHNLSTPQQLSEIKTVGELNRPLTHQVKTTLTSLYIVQGEEELMRIFNVNDQIDFYRIGRVREIDGEAVEYEVSYFDRRIVPFINREIAEQSIYHYLESELGLKISYSQREIVFRYANEEEKSTMDLGEYNMVVNVTSTTYLADGRPFQYGSISYRPDKITFASTAKRHV >LS483485.1|SQI99586.1|1991592_1993866_+|5-methyltetrahydropteroyltriglutamate---homocysteine-methyltransferase MTIFHLAGFPRVGAKRELKFAQERYWRGEIAEADLLDIAKKLREINWQHQANANADFVAVADFTFYDHILDLQVATGAIPTRFGFDSQNLTLDQYFQLARGNKTQFAIEMTKWFDTNYHYLVPEFHKDTQFKANPAHYVQQIREAKALGHNVKPTIVGPLTFLWLGKEKGATFNRFDLLNKLVPVYVDILNALSSEGVEYIQIDEPALTLDLPAEWVAAYKEVYATFAAQVNAKLLLATYFGSVSEHADLLKALPIAGLHIDLVRAPEQLSAFADYDKILSVGVIDGRNIWRANLNQVLDVVEPLKAKLGERLWIAPSCSLLHTPYDLAVEIQLQANKPELYQWLAFTLQKIQELRVIKTALEQGREAVQAELDASQAAADARKNSREIHRTCVAERLANLPKNADQRKSPFAERIKLQNTWLNLPLLPTTNIGSFPQTTEIRHARAAFKKGALSLADYEAAMKKEIEFVVREQEKLDLDVLVHGEAERNDMVEYFGELLDGFAFTKFGWVQSYGSRCVKPPVIYGDVTRPEPMTVRWSQYSQSLTNKVMKGMLTGPVTILQWSFVRNDIPRSTVCKQIAVALSDEVLDLEKAGIKVIQIDEPAIREGLPLKRADWDAYLQWAGEAFRLSSMGCKDDTQIHTHMCYSEFNDILPAIAALDADVITIETSRSDMELLTAFGDFKYPNDIGPGVYDIHSPRVPTAEEIEHLLRKALQVVPKERLWVNPDCGLKTRGWPETIAALKVMVDVTKKLRAELA >LS483485.1|SQI99585.1|1990395_1991328_-|Cyn-operon-transcriptional-activator MKPIFLELRHLKTLLALKETGSVSLAAKRVYLTQSALSHQIKLLEDQYGLPLFERKTQPLHFTPAGERLIQLANDILPKVIEAERDLARVKQGEAGELRIAVECHTCFDWLMPAMDLFRQHWPLVELDIVSGFHTDAVGLLLSHRADWAVVQEVEETPGIVYKPLFSYEMVGLCAKDHPLAAKDVWQAEDFIDQTLITYPVPDDMLDLLRKVLHPKGVNPTRRTSELTIAIIQLVASKRGVAALPFWAAKPYLDRGYIVARKITEQGLHSNLYAATRELDSQIAFVDDFYETVKAQSFSTLPELSILEEI >LS483485.1|SQI99584.1|1989659_1990391_-|azaleucine-resistance-protein-AzlC MSDVKTNSHPIWAAAKAALPYSAPMLAGFLFLGVAYGIYMKALGFSFWYPVLMALLIYGGSVEFIIAGALSLAFAPLNALLITLMVSGRQLFYSISMLEKYGKSLGKKRPYLIATLVDESFSLNYMAKVPSHIDRGWYMFFVSFYLHMYWMIGAGLGNLFGNIIPFDLKGIEFAMTALFLVIFAENWAQEKSHESSLLGLAIAAISLIVFGREYFLLPTLIGIWTVLTFRRPKLSSRLERIEE >LS483485.1|SQI99583.1|1989330_1989663_-|Branched-chain-amino-acid-transport-protein-(AzlD) MTLTEQIITIGIAVLGVQFTRWLPFWVFSANRPIPEYIRYLGKVLPAAMFGMLVVYCYKNVDVFSGFHGVPEFLSGVIVVALHLWKRNMFLSIAAGTMLYMFLVQRVLVA >LS483485.1|SQI99582.1|1986471_1989264_+|protease3 MSNQKTMKKLTALFVLLCSFRLVIACQAGIDPDALAFDPNIKHGKLTNGLQYYILNNRDPKDRVYIRLVVNAGSMHEDDDQKGIAHLVEHMAFNGSKKYPENTIINALEKLGMKFARDINAFTDFENTVYTLNLDGNSPQKLSLAFDVINEWMNHLTILPKDLDGERGVVQEEWRRRLSPMLRLGDKKSAIEMAGSRYVLRDPIGDMNIIRHISRDRVADFYHKWYRPDNMSLIVVGDIDTHKITQLISQQLDKPSSHTQRPLDKIDFSIPLIHHWRVASIAEQGTNIPALELSFFEEDKQKETITDYKQDLIQQIVTRLVNLRLQKWEENQNNWLDSANFYRSHLGKQTLQSVFSLQLADTNYLKNITALFAFIAEIKQHGFTADELNSEIARLHNLNEKQQNIRPGSLKIANDLIAIAANHQIMLSAKERYNLNRRFLNEIKVTDLNVTFNQMLALNAKLLLITQLLPEKKLPFDATYIEQRWNQAMRSDQNQWENKKHIVKQPHFEFKDGSLVLEKHWDKGNIDEFRLSNGAKLIYHYSNKTPNQVHFRAVTSGGLRSVPNQDYHLLRTAITLVDDTGTGELTQADVSNLFGQSPLVLATVIDDDKQGFTGVAKPQDLSRLLTLFRLKLQSAPVSNNVLQKYHRETQDYFKQIDAETKFMQAISYLRRPNTATVYTQNQNEQLSFTAAQLSQIYQEKILGKTDFTYFIIGDISRSELEKLAKQYLATVEIKTQARAYQPGYIHTPKKAFIMRGLSEPRADVEIYLTAENQWHPEQKYALEILGEIVQEKLRLVLREKVSGIYSVNSWFSQDPHTPQIEGKIAFSCAPNRAEELIKLTHQILDEIIENGIDETLLRKKQAEQQQYIKRQFDSLVSVASMIEDSYWQQGNPQSVYLYQRLEQLADKPHLEALARKVLVKAARFEAILRQ >LS483485.1|SQI99581.1|1984045_1986412_+|Outer-membrane-cobalamin-receptor-protein MYKKTKIAFFICTALYAQHVLSEEKSTNKSNMLPEIIVYGDSNKSLSSTQAVTSNEMEKIPTTNNNITDYLRSNPHIRYEDSDQNGFQLGEIKPQNISINGADANQTAYFVDNVNVNNDLTVDNEIFDGAMQVVPGISNTQAYFFDASMLSKVEVHDSNISASLGGFMGGAVVAKTKQYNGKDGVSLKYRTTNSGWAKINADSSAKTLLDKIRPDAGGVAEFQPKYHKQTFSIMAEKGLTENLGMVIGYSKRHARIQQNRLIGYAPDVKLDKQNHKRDSDNLLLNFNLAASEKDRFELGFRYSNYKEQKYYATNIDSNVSDYHQALGSTLAWVHSFNSGILTNTLAYDHFKDKRKSSSANVEIVSVFDENFDPLYDYEKGGYGNSSLTQDNIHFSTEFAVDPFNLGFANHSISIGGIYQATHYKFNRPQDVHSKIIQKYPNLSPIETTNVTHQGNAQTRYQNFVFYTEDLIKWKKLELRPGVRIERDDYLQNNNIAPRFVARYKPWEETGFTLGLNRYYGRSFASLKLTNEILKINRDTSRKYQEFHSLKTPYADELSIGFDQEFNNLAFKLNYIHRKNKNRIVLKRDANKVNFYHNGSDFSVEVYTFQMNNIEPWQLGKSYWTSSLGFDWLKTKRADIGRDLDPNELVYLDGKLLTRREMLNKVNSSTEDWITRFGLDMAIPDYNITWSNKVYIKAPIRSYDVLEGDFNDGISRYRSYHYGRHTQWDSSIRWQPTITGNHSIYLQVDILNVLNKTRKSKTVKPISSNDEYGIYTPGREFWLEVGYKF >LS483485.1|SQI99591.1|2002172_2002466_-|CRISPR-associated-endoribonuclease-Cas2 MLMLITYDISLEDAEGQARLRRVAKLCLDYGVRVQYSVFECDITPDQWVVLKDKLLKTYNPETDSLRFYHLGSKWRRKVEHHGAKPAVDVFKDVLVI >LS483485.1|SQI99592.1|2002540_2003554_-|CRISPR-associated-endonuclease-Cas1,-subtype-I-C/DVULG MRKLQNTLYITTQGSYLHKERETLVVEQERKKVAQLPVHSIGHIFCFGNVLVSPFLLGFCGENNVNLAFFTENGRFLGRLQGRQSGNVLLRRAQYRVSEQNPVPIARNIIAAKIQASKRVLQRQIRNYGENAAIQSAVDALNISLRQLKGTAELDVVRGIEGDAAARYFGVFGQLLSEKSGFAFDGRNRRPPRDGVNALLSFVYSILGKDISGALQGMGLDPQVGFLHADRPGRDSLAQDILEEFRAWWADRLVLSLINRGQIKPQDFVTEASGAVSLKADARKLLFQALQAKKQEKIVHPFLGEEVEIGLLPYIQAMLLARHLRGDLAEYPPFLMR >LS483485.1|SQI99593.1|2004195_2004708_-|Uncharacterised-protein MVNKNMIKSKKIFRFEDGEINFEAEKFLFKLEVDNTISVANHIQFHPVRSYKNSLKQVLRPIEGIADLFISCIEEKFNNYDDEMNDALIELIENMLHNIISHVDTCKNIIKQLSGEMKSNGIQNEFLNNIELYRSLFAKQINLVKHNSRFIRVIRGRDEKRKEILLLGTI >LS483485.1|SQI99594.1|2004754_2005405_-|CRISPR-associated-protein-Cas4 MTALLTETQGKNQDTRLIPLSALQHYAFCPRQCALIHNEQAWAENYLTAQGKALHERVDSGEPETRRGVRFERTVHVSAEKLGISGVLDLVEVETKTGRLKPVEYKRGKPKPDPMDEIQLCAQGLCLEEMTGQIVSEGALWYMQTRHRVPVAFSDDLRAQTLATIAAVRELLNSGQTPPPNYSKRCKACSLVEICQPELLGKRDRSVGYVAGLFGE >LS483485.1|SQI99595.1|2005420_2006287_-|Uncharacterized-protein-predicted-to-be-involved-in-DNA-repair MSAIQNRYEFVYFFDVTNGNPNGDPDAGNMPRLDPESSKGLVTDVCLKRKIRNFVEISSENEAGYEIYVKEKSVLNLQNKRAYEALGIESEAKKLPKDEAKARDITAWMCKNFFDIRTFGAVMTTEVNSGQVRGPVQLAFAQSIDPIVPLEISITRMAVTNEKDLEKERTMGRKYIVPYALYRVHGFISANLAAKTGFSDDDLAKLWQALTLMFEHDRSAARGEMAARKLIVFKHDSALGSQPAHKLFDAVKVERVNGESGTPASGFGDYKISVVSDGLNGVSVEELL >LS483485.1|SQI99596.1|2006298_2008089_-|CRISPR-associated-protein-Cas8c/Csd1,-subtype-I-C/DVULG MILASLARYYRCLAAETDEMGNPKVPPYGFSEEKIGWILVLDKEGRLKTVVPNLTADKKPQPKLMSVPRPEKRTSGIKPNFLWDKTAYALGVEANKNKAEAKEKPFTLSEKTFDAFKQYHLDLLQNSDDEGLQALCRFLQNWQPAHFAAENLPAEMLDANIAFSLEKPTALIHKREAAQTLWAGSLKSDEALEGLCLISGDTAPIARLHPAIKGVFGGQSSGGSIISFNKEAFASFGKEQGANAPVSEQSAFAYTTALNYLLRQRNQEANNHCLTIGDASTVFWAEADDNATAQAAEGFFAQVFMPPNDEQESVKIFNVLEQIGKGCPLQEIAPELSPNTRFYILGLAPNAARISVRFWLDTTFGQLAENLAQHWQDLALEPCAWKTPPSIWRLLLQTAVLGKSENISPVLAGEMTRAVICGTPYPLSLLSQLITRIRADGDVNGLRVAIMKAVLERRFRKGFIEEGVPMSLNNESPNRAYLLGRLFAVLERIQYQALGELNAGIADRYYGSASAVPFSVFPRLLSGAKHHLSRLRKDKAGMAVNLDKDLGEIIAKLPETFPRHLSIDEQGRFAIGYYHQKQSYFAKKETAETIEN >LS483485.1|SQI99598.1|2008085_2008763_-|CRISPR-associated-protein-Cas5,-subtype-I-C/DVULG MANQIRLHIWGDYACFTRPEMKVERVSYDVITPSAARGILAAVHWKPAIRWVIDRIYVLKPIRFDSVRRNELGGKISAGKVSGAMKRKSVADLYTLIEDDRQQRAATVLKDVAYVIEAHAVLTAKAGADETVTKHIEMFKRRAKKGQCFQQPCLGVREFPADFALIDEDEPLPPSVLSENEANRDLGWMLHDIDFDHGNTPYFFRAQMKDGVIDVPPFYAEEVKA >LS483485.1|SQI99601.1|2008886_2010224_-|NurA-domain MSYSSVGKKPFEKASKSSHHHIINDEVVQSALSNFYIPDVLPEVSISSLTVPHNSCEHSLKHVVAIDGGYTEIPLKIGYPSASLHFFQFGALYFKTEDLKNMKQQKHIAPEDMQKLRNIARIKLPLCTKGVKRKDCSSLTSSVRRSLFEFLKSENMAENSSLLDTLAWFVFHRYKHNRGVEEKHWNLSSHPCNSDTRNVLLEENEMQNYTFSSNDGDIYLSDIFRLHEIIDDDLGASGISGYVTGLVEHLMLLHIIRSLLDKNRQTLNETIFILDRPTGWFGVTAGMHRLMLDLNNWLFENHNLFLIGLEKSGAFVEHASQIQSKMENGSILILNDKYIYSYISPGHEDANRPYASTSYYGHKIIFKTKFGQMYVASLPVKDLKKNPDENDIPNLHEILSVIESLHCDMYENALLPIALTNKLVSLSAHPSTQILTNFAKATITK >LS483485.1|SQI99603.1|2010236_2012282_-|Domain-of-uncharacterised-function-DUF87 MLSISSSIANINLFERKFERENDPKSNWSWNTGIFVGRPFKISYTSSSILMADAWKEQANGVPQGCFLLAYYDCDPGKDNLQEALLLRVIEPAELPTDKDIVSSMVDYYKDHIRTGNTKQSQLDEYSRYEFGFSGLRCSILGSFYLDAKKNLRFGADVENFYAAHNYSVIKPSNEILGLIANYRENSVPGGNGDIRIGSIRYSSSQRFNNDIGDIPVYIQAKDFAGKRTALFGMTRTGKSNSIKKIIQANEQMSELAQYQLDKQNESPEEILKQFVGDAPKYPIGQIIFDINGEYANANLQDEGTAIFDIYQAKTDRYSIVEKDGFKVMKVNFYNEIEAGFELIKSYPLIADDTSKYMVNFKSVMLEKPYNYDSDRSAKTRYDRRLSVYKCILKAAGFKCSENEKVTFSVNKELLEEMDIPDVTSQSLKAGISLDLATEWWTALWDSYDSSVACNDYKQKKRKEWADDELKALLVILTRKSSSGGSIDRTGYLNLRPIVAQHTDKSQTPFEDDILQALRQGKIVIADLSSGDEELQRMYSERITRRIFRDSMNRFTNAQPNNFIQFYFEEAHNLFPKKDDKNLSQIYNRLAKEGAKLNLGLVYATQEVSSISSNILKATQNWFISHLNNEDEIKELRKYYDFSDFSDSLIRFSQNTDKGFVRVKTYSNSFVVPVQIDKFGK >LS483485.1|SQI99604.1|2012285_2013278_-|Modification-methylase-PvuII MKKNKFPRNISKVFERKSGALYLGDSLDLLQSKSFSNLNGKVNLIITSPPYPLNQKKSYGNKTGEEYLNWIKELTPLLAEKLADDGSLVVELGNSWEPGRPVQSLLALKALMSIAESAETNLRLIQEFVCYNPSRLPSPAQWVTVNPLRTVDSYTHVWWFSKTDYPKADNRKVLRPYSEAMKSLLKRGSYNAGKRPSEHSIGEKSFLNDRGGAISHNLFEIESIDENRKPRLPNAFSFSNSASNDFFHRECKRQNITVHPARMPIGLVKFFIQYLTDEGDLILDPFAGSNTTGFAAALLNRNWISIELQESYIEQAKIRFEDPILSYKEV |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
LS483485_2 | 2.3|2001505|34|LS483485|CRISPRCasFinder,CRT,PILER-CR | 2001505-2001538 | 34 | LS483485.1 | 2005263-2005296 | 0 | 1.0 |
1. spacer 2.3|2001505|34|LS483485|CRISPRCasFinder,CRT,PILER-CR matches to position: 2005263-2005296, mismatch: 0, identity: 1.0
acaacgagcaggcgtgggcggagaactatttgac CRISPR spacer acaacgagcaggcgtgggcggagaactatttgac Protospacer **********************************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
LS483485_2 | 2.4|2001571|33|LS483485|CRISPRCasFinder,CRT,PILER-CR | 2001571-2001603 | 33 | NZ_KX753679 | Pasteurella multocida strain RCAD0259 plasmid pRCADGH-2, complete sequence | 4504-4536 | 0 | 1.0 |
LS483485_2 | 2.4|2001571|33|LS483485|CRISPRCasFinder,CRT,PILER-CR | 2001571-2001603 | 33 | GQ866235 | Aggregatibacter actinomycetemcomitans strain D11S-1 plasmid S57, complete sequence | 610-642 | 0 | 1.0 |
LS483485_2 | 2.5|2001636|34|LS483485|CRISPRCasFinder,CRT,PILER-CR | 2001636-2001669 | 34 | NZ_KX753679 | Pasteurella multocida strain RCAD0259 plasmid pRCADGH-2, complete sequence | 3746-3779 | 0 | 1.0 |
LS483485_2 | 2.5|2001636|34|LS483485|CRISPRCasFinder,CRT,PILER-CR | 2001636-2001669 | 34 | NC_021724 | Aggregatibacter actinomycetemcomitans plasmid pS23A, complete sequence | 2286-2319 | 0 | 1.0 |
LS483485_2 | 2.5|2001636|34|LS483485|CRISPRCasFinder,CRT,PILER-CR | 2001636-2001669 | 34 | NC_002579 | Aggregatibacter actinomycetemcomitans plasmid pVT745, complete sequence | 25226-25259 | 0 | 1.0 |
LS483485_2 | 2.5|2001636|34|LS483485|CRISPRCasFinder,CRT,PILER-CR | 2001636-2001669 | 34 | GQ866235 | Aggregatibacter actinomycetemcomitans strain D11S-1 plasmid S57, complete sequence | 1364-1397 | 0 | 1.0 |
LS483485_2 | 2.4|2001571|33|LS483485|CRISPRCasFinder,CRT,PILER-CR | 2001571-2001603 | 33 | NC_021724 | Aggregatibacter actinomycetemcomitans plasmid pS23A, complete sequence | 1531-1563 | 1 | 0.97 |
LS483485_2 | 2.4|2001571|33|LS483485|CRISPRCasFinder,CRT,PILER-CR | 2001571-2001603 | 33 | NC_002579 | Aggregatibacter actinomycetemcomitans plasmid pVT745, complete sequence | 24469-24501 | 2 | 0.939 |
LS483485_2 | 2.1|2001371|34|LS483485|CRISPRCasFinder,CRT | 2001371-2001404 | 34 | MT475811 | Uncultured crAssphage clone CRB_ENV3 polymerase gene, partial cds | 283-316 | 7 | 0.794 |
LS483485_2 | 2.3|2001505|34|LS483485|CRISPRCasFinder,CRT,PILER-CR | 2001505-2001538 | 34 | MH825706 | Streptomyces phage Microdon, complete genome | 6503-6536 | 10 | 0.706 |
LS483485_1 | 1.1|1769952|46|LS483485|CRISPRCasFinder | 1769952-1769997 | 46 | NC_014633 | Ilyobacter polytropus DSM 2926 plasmid pILYOP01, complete sequence | 245215-245260 | 14 | 0.696 |
1. spacer 2.4|2001571|33|LS483485|CRISPRCasFinder,CRT,PILER-CR matches to NZ_KX753679 (Pasteurella multocida strain RCAD0259 plasmid pRCADGH-2, complete sequence) position: , mismatch: 0, identity: 1.0
atttatggtgcgattacattaattgttgttcct CRISPR spacer atttatggtgcgattacattaattgttgttcct Protospacer *********************************
2. spacer 2.4|2001571|33|LS483485|CRISPRCasFinder,CRT,PILER-CR matches to GQ866235 (Aggregatibacter actinomycetemcomitans strain D11S-1 plasmid S57, complete sequence) position: , mismatch: 0, identity: 1.0
atttatggtgcgattacattaattgttgttcct CRISPR spacer atttatggtgcgattacattaattgttgttcct Protospacer *********************************
3. spacer 2.5|2001636|34|LS483485|CRISPRCasFinder,CRT,PILER-CR matches to NZ_KX753679 (Pasteurella multocida strain RCAD0259 plasmid pRCADGH-2, complete sequence) position: , mismatch: 0, identity: 1.0
tttcaagttgttgtgcggttaataccgcactatt CRISPR spacer tttcaagttgttgtgcggttaataccgcactatt Protospacer **********************************
4. spacer 2.5|2001636|34|LS483485|CRISPRCasFinder,CRT,PILER-CR matches to NC_021724 (Aggregatibacter actinomycetemcomitans plasmid pS23A, complete sequence) position: , mismatch: 0, identity: 1.0
tttcaagttgttgtgcggttaataccgcactatt CRISPR spacer tttcaagttgttgtgcggttaataccgcactatt Protospacer **********************************
5. spacer 2.5|2001636|34|LS483485|CRISPRCasFinder,CRT,PILER-CR matches to NC_002579 (Aggregatibacter actinomycetemcomitans plasmid pVT745, complete sequence) position: , mismatch: 0, identity: 1.0
tttcaagttgttgtgcggttaataccgcactatt CRISPR spacer tttcaagttgttgtgcggttaataccgcactatt Protospacer **********************************
6. spacer 2.5|2001636|34|LS483485|CRISPRCasFinder,CRT,PILER-CR matches to GQ866235 (Aggregatibacter actinomycetemcomitans strain D11S-1 plasmid S57, complete sequence) position: , mismatch: 0, identity: 1.0
tttcaagttgttgtgcggttaataccgcactatt CRISPR spacer tttcaagttgttgtgcggttaataccgcactatt Protospacer **********************************
7. spacer 2.4|2001571|33|LS483485|CRISPRCasFinder,CRT,PILER-CR matches to NC_021724 (Aggregatibacter actinomycetemcomitans plasmid pS23A, complete sequence) position: , mismatch: 1, identity: 0.97
atttatggtgcgattacattaattgttgttcct CRISPR spacer atttatggtgcaattacattaattgttgttcct Protospacer ***********.*********************
8. spacer 2.4|2001571|33|LS483485|CRISPRCasFinder,CRT,PILER-CR matches to NC_002579 (Aggregatibacter actinomycetemcomitans plasmid pVT745, complete sequence) position: , mismatch: 2, identity: 0.939
atttatggtgcgattacattaattgttgttcct CRISPR spacer atttatggtgcaattacattaattgttgttccc Protospacer ***********.********************.
9. spacer 2.1|2001371|34|LS483485|CRISPRCasFinder,CRT matches to MT475811 (Uncultured crAssphage clone CRB_ENV3 polymerase gene, partial cds) position: , mismatch: 7, identity: 0.794
cgcctttttggtaggtttaaaaataaaaatgagt-- CRISPR spacer aatcttattggtgggtttaaaaataaaa--gagttg Protospacer ..*** *****.*************** ****
10. spacer 2.3|2001505|34|LS483485|CRISPRCasFinder,CRT,PILER-CR matches to MH825706 (Streptomyces phage Microdon, complete genome) position: , mismatch: 10, identity: 0.706
acaacgagcaggcgtgggcggagaactatttgac------ CRISPR spacer ccaacgcgcaggcgtgggcgaagaa------ggcgcagca Protospacer ***** *************.**** *.*
11. spacer 1.1|1769952|46|LS483485|CRISPRCasFinder matches to NC_014633 (Ilyobacter polytropus DSM 2926 plasmid pILYOP01, complete sequence) position: , mismatch: 14, identity: 0.696
ttctcaacatcaagttatgcgggaatagctcagttggtagagcaca CRISPR spacer ctaaggggcacaatgtatgcgggaatagctcagttggtagagcgtc Protospacer .* .. *** ****************************..
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
96115 : 106155
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >LS483485|96115:106155|DBSCAN-SWA AATGACAACTTCAATTCAACAACAAATTGACGACCTCAGAAAAACCCTGCGTTATCACGAATATCAATATCATGTGTTAGACGAGCCACAGATTCCCGACAGTGAATACGATCGCTTGTTTCATCAGCTCAAAGCCTTAGAACAACAACATCCCGCGCTGATCACCACCGATTCGCCGACCCAACGCGTGGGTGCGCGGCCATTATCGGAGTTTGCGCAAATTAAACATGAACTTCCGATGCTCTCGTTAGATAACGCCTTTTCTGATGAGGAGTTTCTGGCCTTCGTGAAACGTATTCAAGATCGTTTAGGACTTGTGCCAGAACCATTGACATTTTGTTGCGAACCTAAATTGGACGGATTGGCAGTCAGTATTTTGTATGTGAATGGTGTGCTGACCCAAGCGGCAACGCGCGGCGATGGCACAACGGGCGAAGACATTACGCAAAACATTCGTACCATTCGCAACATTCCCTTGCAACTACTAACAGATAATCCGCCGGCACGCTTGGAAGTGCGTGGTGAAGTGTTTATGCCGCACGACGGTTTCGAGCGCTTAAACGAACGAGCCTTGGAACAGGGCGAAAAAACCTTTGCTAATCCGCGTAATGCGGCGGCAGGTTCCTTACGCCAATTGGATCCGAAAATCACCAGTCAGCGCCCGCTCATGTTCAATGCTTACAGCATCGGCGTCGCCCAAAGGATAGACTTACCGCCTACTCATTTTGAACGCCTACAATGGTTGAAATCCATCGGCGTGCCGGTAAATAGCGAGATCCGTTTATGCGATGGCATTGAAAACGTGCTGAATTTCTACCGCACTATGATGGAAAAACGCAGTTCCTTGGGTTATGACATTGACGGTACCGTGTTAAAAGTCAATGACATTGAATTGCAACAAAGACTGGGCTTTATTTCCAAAGCACCGCGTTGGGCAATCGCTTACAAATTCCCAGCACAAGAAGAACTGACGGTATTAAATGACGTGGAATTCCAAGTCGGTCGAACCGGTGCCATCACGCCTGTAGCCAAGTTACAACCGGTGTTCGTTGCCGGCGTAACGGTAAGTAACGCAACCTTACATAATGGCGACGAGATCGAGCGTCTCAACATTGCTATCGGCGACACGGTGATTATTCGTCGTGCCGGCGATGTGATTCCACAAATTATTGGTGTCGTACATGATCGCCGCCCGGCCAACGCGCGCCAAATTGTCTTTCCAACGCACTGTCCGGTATGCGATTCCTTAATCGTACGCATTGAAGGCGAAGCGGTGGCACGTTGCACCGGCGGGTTATTCTGTGCGGCACAACGCAAAGAAGCGCTAAAACATTTCGTCTCACGCAAAGCCATGGACATTGACGGTGTAGGTGCCAAACTGATTGAGCAACTGGTGGATCGCGAATTGGTTCATACCCCGGCAGATTTGTTCAAGTTGGATTTAGCCACACTCACCCGCTTGGAACGCATGGGGTCGAAATCGGCTGAAAATGCGTTAGTCAGCTTAGAAAAAGCAAAACACACGACCCTCGCCCGTTTTATCTTTGCGTTGGGTATTCGTGATGTGGGTGAAGCCACGGCATTAAATCTGGCAAACCATTTCAAAACCTTAGAAGCGTTACAAAATGCCGATTTAGAACAGCTACAACAAGTGTCTGATGTGGGCGAAGTGGTTGCGAATCGCATTTTTGTGTTCTGGCGCGAAGAGCATAATGTTGCGGTGGTGAATGATCTCATCGCCCAAGGCGTGCATTGGGAAACCGTCGAAGTACAGGACGTGAAAGAAAATCCATTCAAAGACAAAACTGTGTTGCTCACCGGTACACTTTCCCAAATGGGACGCAACGATGCCAAAGCGCTGTTACAACAACTCGGTGCCAAAGTCAGCGGCAGTGTTTCGGCAAAAACCGATTATGTTATCGCCGGTGAAGCAGCAGGTTCCAAACTCAGTAAAGCGGCAGAATTAGGCGTGCAGGTGTTGAGTGAAGAGGAGTTTTTAGCGTGGGTTAATGGATAAAGTGTAAAGACAGAAATCCATTTATCCCAAAGCAAAAAATAAGGACAGGCATAAAGCCTGTCCCTACAAAAACACTTGAAAAATTACTCCCAATCGAATCAAGTTCGGTTGTTCGCCCCGAAGGGGCCGTTGGCAAAGCTAACGTTCAAAACCCGTTGGATTTTGTGACCGCACTTTCAGTGACCCTATTCGGCAATCCGTTCTTTATAATGCAATCGGCAGACCGATAAATAACTGTCGTTGCCACCAATTTGAATTTGATTCCCTTGTTTGACGACCTCGCCTTGTTCATTTAAACGCAATACGAAATTGGCTTTACGTCCGCAATAGCAAATGGTTTTAAGTTCTTCCAACTGATCCGCCCATGCCAACAAATAGCGACTCCCTTCAAAAAGTTCTGCTTGGAAGTCAGTGCGTAAGCCATAACATAAAACAGGAATTTTCAGTTTATCCACCACATCGCTCAGTTGATACACTTGTGCTTTGGTTAAAAATTGTGCTTCATCCACCAAAATACAATGCAACGGCTCTTTCGTTAAGTGTTGTTGAATTTCTGCAAATAAATCCGTGTCACGTGCGAAGGTGTTCGCCTGTTCACTGATACCAATACGGGAAGTAACGCGTCCGGCGCCAAAACGATCATCAATGGCAGCCGTATAAACGAGTGTGTTCATATTGCGTTCACGGTAGTTATAGGAAGATTGCAGCAAAGTTGTCGATTTTCCCGCATTCATGGTGGAATAATAAAAATACAGCTTGGCCATAATTATTTAAGCGCCCATTTCCAAAAGTAATAGCAGATAAAACCGGTTAACGGCGGTAACGAGGCAAGAATAAAAACACCAATAGTGGTTCGCCCACCAGCCATTTGCCCGACTTCCGTTAAGAAATCAATAAGTGCGTCTGTCGGCGTAGTAAACACAAAAAAACCGATAATCGCTAACATAACGAAGCAGCCGATTCCGGCGGCACGCATTGCTCTGAATTTAATGTTGTCCATAATGTTTCCTTGAATGAAGTGCGGTCAAAAAAACGTTTAAATTTTTGACCGCACTTTGATTTAGCATCGGGTTAAACGGCGGTTAATTCAGTCATTGCCCAACGCGGACGCACTTCGATAGCAAGATCTTGCTGCAGTCCTTGTTTTAAACGTAGAAAACCGGCATAGGCAATCATGGCGCCGTTATCAGTACAAAATTGCGGTTGCGGGTAAAACACTTCACCACCTAATTGTTGCATCAGCTCCGCCAAGGTTTGGCGCAACTGTTTATTGGCGCTCACGCCGCCGGCAATTACCAAACGCTTCAAGCCCGTTTGTTTTAACGCACGCTTGCATTTGATTGCTAGCGTATCCACCACAGCCTCTTGGAACGCATAAGCAATATCCGCCTTGCTTTGCTCGGTTAATTCCCCTTCTTCTTGCATAACTTGATGAAGCGTATTGGCGGCAAAGGTTTTTAAACCGGAAAAACTAAAATCCAAACCAGGGCGATCGGTCATTGGGCGAGGGAACGCAAAACGATTCGGCGTACCATTTAACGCTAAACGGGCTAATGCCGCTCCACCAGGATAATCCAAACCGAGTAATTTTGCCGTTTTGTCAAACGCCTCCCCTGCGGCATCATCAATAGATTCGCCTAATAATTCGTAGCGTCCGACGCCGCCCACACGCACCAATTGAGTATGCCCGCCGGACACCAACAATGCCACAAAAGGAAAGTGCGGTGGATTTTCTTCCAACATTGGCGCCAGTAAATGCCCTTCCATGTGATGTATGCCGATCGCCGGTACATTCCACGCATAAGCCAAGGATCGCGCCACCGTGGAGCCAACTAACAACGCGCCGACTAAACCCGGACCGCAGGTATAAGCTACGCCGTCAATGTCTTTGGCGGTAAGATTGGCTTCTTGTAAGGCGGCTTGTAATAATGGCGCTAATTTCCGGATATGATCGCGTGAGGCTAGCTCCGGCACAACGCCGCCATAATCGGCATGCAACGCAATTTGCGTGTGTAACTGATTGGCAATCAAGCCTTTTTCTTCATCATAAATGGCAACGCCCGTTTCATCACAGGACGTTTCAATGCCTAAAATTCGCATTTTGAATCTCTTTTTACTCTGTTCAATGAGGCTGAATTTTACCTTTTTTACAAGGATTTAACCAGTTTTCAAACGGGGATTTGCGGAAAAGGCGAATCTTTCCTTTACTTTCATGGTGACTTTGGATTAAAATTGCAACCTTTATTGAATCTGCCACGACTTGTGGCAACAAATAAATTTTAAATTGCAATTGAATTAATTAAACTCATTGAGGTGATTGGCTTATGCCTGTAATTAAAGTTCGTGAAAATGAATCCTTTGACGTAGCATTACGTCGTTTCAAACGCTCTTGCGAAAAAGCTGGTATCTTAGCAGAAGTTCGTGCTCGTGAATTCTACGAAAAACCAACAACGATTCGTAAACGTGAAAATGCAACCCGCGCAAAACGTCACGCTAAACGCGTAGCTCGTGAAAATGCACGCAACACACGTTTATACTAATTAACAGTATTTTTTAACTCGAGTTATAAAAACCGTGAATCTTCCGAGGCTCACGGTTTTATTTTCTCTCAATCTCATCAGTTAGGCTCATATTTCGTTCACAATAAGAGGCAGAATACTCGATGAAAGGCACCATTCCACGTACATTTATAGACGATATCTTAACTAAAGTTAATATCGTTGATCTGATCAATTCCAGAGTCAAACTGAAAAAAGCCGGCCGTGATTATCAGGCGTGCTGTCCGTTTCACCATGAAAAGACTCCTTCTTTTACCGTCAGTGATAAAAAGCAGTTTTATCATTGCTTCGGTTGTGGTGCGCACGGCAACGCCATTTCCTTTTTGATGGAATATGACAAGCTGGAATTTGTGGAAGCGGTGGAAGAACTGGCCGGTTTTCTTGGGTTGGAAATTCCCTACGAAAAACGACCGCACTTTAACGAGAGCGGCAAACAAGTCGGCTATCAAACCAAGCGTAATCTGTATGAGTTAATGCAGGAAATCGCCAAATTTTATCAACAACAATTGCCGTTAAATATTCCTGCGCAAAGTTATCTGCAACAACGTGGTTTATCAGCGGAAATTATTGAGCGTTTTCAAATCGGTTATGTGCCGAATGCCATGGATACCGTTTATCGCCAATTCGGTAAAACTCGTGAAGAGCAACAAAAACTGTTCGATTTAGGCATACTATCACGCAACGATCGCGGCAATGTGTACGACAAATTTCGCAATCGGATTATGTTCCCGATTCGCGATCGTCGAGGTCGCACCGTGGCTTTCGGCGGACGCGTGTTAACCGATGAGAAACCGAAATATTTGAACTCGCCGGAAACCGTGACCTATCACAAAGGCAGTGAATTGTACGGTTTATTTGAAGCCCTACAAGCCGACGATTCACCACAAAAATTACTGGTTGTTGAAGGTTATATGGATGTGGTGGCGTTGGCCCAATTCGGTGTAGATTATGCCGTGGCCTCTCTTGGCACATCAACAACCTCGGAGCAAATTCAATTACTCTTTCGCTCAACAGAACAAGTGATTTGCTGTTATGACGGCGATCGCGCGGGGCGTGATGCGGCATGGCGAGCCTTAGAAAATGCACTGCCTTATTTGGAAGACGGCCGTCAACTCAAATTTATCTTTTTACCTGACGGCGAGGATCCCGATACCTTTATTCGCCAATTTGGCAAAGAGGGATTCGAGGAATATCTCAATAACGCACAATCTTTAAGTGAATTTTTATTTGCTCATTTGACGCCACAAGTGGATTTCTCCAGCAAAGAAGGGAAAAACAAACTGGCGGCATTAGCAGTACCGTTAATTAAACAAATTCCGGGCGATATGTTACGTTTGGATTTGCGTAACACGTTGGCAAAAAAACTGGGGATTCTCGATCCGACGCAACTGGAAAGCCTTATTCCAAATCAACAGAAAACAGAAAACACACCGACAGCCCAACCGATACAATTTAAGCGAACCCCAATGCGTGTGCTGATCGCATTGCTGTTACAAAATCCGGAATTGGTGAAATTTGTACCCGATTTGGAATCTTTTCGTTCGTTAAATGAGCCGGGCTACGATTTGTTTGCAGAAATGACCGCACTTTGCCGTGAAAAAGTGGGTATTAGTTCCGGACAACTGTTAGAACACTGGCGCGATACACCTCAACAAAATACGCTTGAAAAACTGGCCACATGGAACCATTTGGTTGAAGAAGACAAGATTGAAGATACCTTCCGCGAAACATTACGTTATTTTTATCTACAGATCATTGATAAACGAATAAATTGGCTAATTGCTAAGGATCGTAGCGAAGGATTAAATCTTGATGAGAAAAAAGAACTTTCAACATTGTTGTTGGTAAAAAAACGCGAAAAAGAACACGAAAGAAATAGTTAAACCGAAGGAAGAATGCTAAAATCTTGGCGTTTTATCTTCACTAAATTAAGTAAGCAAGGCGGATATCAAATATGGATCACAATCCACAATCTCAATTGAAACTACTCATCGCCCAAGGGAAAGAGCAAGGCTATTTAACGTATGCCGAAGTCAATGACAGCCTGCCCGAAGAACTCGTCGATGCCGATCAAATTGAAGATATCATTCAAATGATCAACGACATGGGGATTCAGGTGTTGGAGACTGCACCGGATGCCGATGATCTGATGCTCAATGAAACGATTACTGATGAAGATGTCGTTGAAGAAGCCACACAGGTGTTATCCAGCGTTGAGGCCGAGTTAGGCCGTACAACCGACCCTGTGCGCATGTATATGCGTGAGATGGGCAGTGTGGAATTGCTTACCCGCGAGGGCGAAATTGATATTGCCAAACGTATTGAAGAAGGTATCAATGAAGTACAAAGTGCTGTTGCCGCTTATCCTGAAGCGATCACTTATTTAATTGAACAATACGAATCAGTAGAAAATGGCGGTGTTCGCTTGGCTGATTTAATTACCGGTTTTGTCGATCCAAACGTATTGAGCGAATCTGATAACACCCACTTAGATGAAAATTTTGATTCCGATGAAGAAAATGAAGAAGATGTCGGCGATAATGGGTTAGATGATGAAAGCGAAGATGAAGAAGATAGAGAGGAAAACAGTAGCGACGATGGTGATAGCGATAACAGCATCGATCCCGAAGTTGCACGCGAAAAATTCACCGCACTTAAAGAACAACATCAAAAAACCTTGGCAAGCATTGAAAAACATGGTCGCACATCGAAAAAGACCAAAGATGAAATTCAAGCCTTGTCAGATATTTTCACTCAATTCCGTTTAGTGCCAAAACAGTTCGATATTCTTGTGCTATCCATGCGTGACATAATGAAACGCATGCGTGCGCAAGAACGCTTTATTCAGCGAATCGTGGTCGATAATGCCAAAATGCCTAAATCCAGTTTCCAAAAGAGTTTCATCGGACATGAAACTACCGATACTTGGTTGATTAAAGCCTTGGGCGCTGGCAAAGCATGGTCTGAAAAACTAGTACAATATGAAAATGATTTGCGTCAAGCCATCGCAAATTTAGTACAAATTGAGCAAGACACTCATCTCACTATTCAGCAAATTAGAGAAATCTGCGAACGCATTGCACAAGGTGAGTTAAAAGCACGTCGTGCAAAGAAAGAAATGGTGGAAGCCAACTTGCGTTTGGTGATTTCCATTGCAAAAAAATATACCAATCGTGGATTGCAATTCCTTGATTTAATTCAAGAAGGTAATATCGGCTTAATGAAAGCAGTAGATAAATTTGAATACCGTCGTGGTTACAAATTCTCCACTTATGCCACTTGGTGGATTCGTCAGGCGATTACCCGTTCTATTGCGGATCAAGCACGGACAATCCGTATCCCGGTACACATGATTGAAACGATTAACAAGCTAAATCGTATTTCCCGCCAAATGTTACAAGAAATGGGACGTGAAGCCTCACCAGAAGAATTGGCGGAGCGTATGGGTATGCCTGAAGATAAAATCCGTAAAGTACTGAAAATTGCGAAAGAACCAATCTCTATGGAAACCCCTATCGGAGATGACGATGATTCCCATTTAGGTGATTTCATTGAAGACTCCACCTTAGAGCTTCCGTTAGATTCCGCCACCGCACAAAGCTTAAAAGCGGCCACACATGAAGTGCTGGAAGGTTTAACGTCACGTGAAGCGAAAGTTCTTCGTATGCGTTTCGGTATCGACATGAACACCGACCACACGTTAGAAGAGGTTGGCAAACAATTTGACGTTACCCGTGAACGTATTCGTCAGATTGAAGCCAAAGCATTACGCAAATTGCGTCATCCAAGTCGCTCAGAAACGTTGCGTAGCTTCTTAGATGAGTAGTGAAACAAACCCTAATAGTACAAAAAGGATAAGTCATAATGACTTATCCTTTTTGTTTACACATAATTATCATAATAGCTAGACTTTAGCGTAATATTTCCCTATAATCTGCACCTCAAATCAATGCTCACCGCCCCCATAGCTCAGTCGGTCAGAGCAGTCGACTCATAATCGATTGGTCACAGGTTCAAGTCCTGTTGGGGGCACCAACTTAATAATTCATGCTGCTCCATTCTGATTCATTTCAATTCAAAAACACTTAAAAATCAACAATTTTAATTGATTCAATAGCGCATTATGCTCCATTCTGATTCATTTTGATTTTGCTTTTTCGTACCCGATATAGTACCCTGGCACAAATTTACTAGTTTTTTGGGTACTAAAAACAAGGCAAAATCATGGCAAGAATAGTGAAGGGTTTAACCAATACACAGGTTGAACGAGCAAAATACACACCTAATGGAACAAATGAATTAAACGATGGCAAAGGGCTATTTCTTCAAATGTATCCAACTGGGGCGAAGAAATGGCGCTTCCGCTATGAAAGACCAATCACCAAAGCGCGCACCAAATTTAATATAGGCGATCACCCCTCTATAACACTAGCCCAAGCACGCGCCAAAAGGGACGAATACAATGCTTTATTAGCGCAAGGGATAGACCCGCAAGAACACGCCAAACAGAAACACCAAGCCTTACAGTTTCAGTTAGAAAATACCTTTCTCAAATGGGCGGAACGTTGGAAAGAAAATAAAGAAAAGAAAGTCAAAGCGGATACCTTAAGGAAAGATTGGCGACGGATTGAAATGTACCTGTTAGAATCACTGGGGCAAATCCCTATTGATAAAGTGCTTCCTCCCCTACTGATTCAGGCCCTTCCCCCCTTGGAAGCCATGAAAGACCGCAGAACAGGCACGGCAGATAGCGACACCTTGAAGCGAGTTATCCGCTTAGCCAATGAAATTCTAACCTATGCAATGAACGCCGGGGCAATTCCTTTTAATCCGTGCTTAAGCGCAAAAGATATTTACAGTTTCGCCCCGGCGGAAAGTCATCCGCACATTGAGCCGGCAGAATTGCCCCTATTGCTAACGGATATAAGCGAATCAAAGGCACAACCAAGAACAAAGGATTTAATCTTGTTTCAACTTTTAACCATGGTGCGCCCATCAGAAGCGAGTAACGCAGAATGGGCCGAGTTTGATTTAGAAAACAAGGTTTGGACGATACCCGCCGAAAAAATGAAAATGAAGCACCCGCACAAAGTCCCGCTTTCTAGTCAAACAATTAGACTGCTTCAACACTTACAATCGCAAACAGGGCACAAGCGTTTTGTATTTGCCAGTAGAAATAAAATAAATGAGCCGATGAATTCCCAAAGCGTCAATAAAGCCCTTGTTGATATGAGCTATAAGGGCAAACAAGACGCGCACGGATTGCGCTCCATAGGAAGAACCTACATTGGCGAGAAACAAATGGACGGTTACGAAGTGCTAGAAATGTGTATTGCCCATAAAGTCGGCACAAGCACCGGCAAAATCTACGATAAAGCAGACTTCTTTGAACAACGAATCCCAATTATGCAAGCATGGGGGGATTTTGTAGAACAGTGTGCGGAACGCACTTCTAGTAAAAGCTAG
Protein sequences of DBSCAN-SWA_1 >LS483485|96115:106155|102643_104515_+|SQI92257.1|DBSCAN-SWA MDHNPQSQLKLLIAQGKEQGYLTYAEVNDSLPEELVDADQIEDIIQMINDMGIQVLETAPDADDLMLNETITDEDVVEEATQVLSSVEAELGRTTDPVRMYMREMGSVELLTREGEIDIAKRIEEGINEVQSAVAAYPEAITYLIEQYESVENGGVRLADLITGFVDPNVLSESDNTHLDENFDSDEENEEDVGDNGLDDESEDEEDREENSSDDGDSDNSIDPEVAREKFTALKEQHQKTLASIEKHGRTSKKTKDEIQALSDIFTQFRLVPKQFDILVLSMRDIMKRMRAQERFIQRIVVDNAKMPKSSFQKSFIGHETTDTWLIKALGAGKAWSEKLVQYENDLRQAIANLVQIEQDTHLTIQQIREICERIAQGELKARRAKKEMVEANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKLNRISRQMLQEMGREASPEELAERMGMPEDKIRKVLKIAKEPISMETPIGDDDDSHLGDFIEDSTLELPLDSATAQSLKAATHEVLEGLTSREAKVLRMRFGIDMNTDHTLEEVGKQFDVTRERIRQIEAKALRKLRHPSRSETLRSFLDE >LS483485|96115:106155|104913_106155_+|SQI92263.1|integrase|DBSCAN-SWA MARIVKGLTNTQVERAKYTPNGTNELNDGKGLFLQMYPTGAKKWRFRYERPITKARTKFNIGDHPSITLAQARAKRDEYNALLAQGIDPQEHAKQKHQALQFQLENTFLKWAERWKENKEKKVKADTLRKDWRRIEMYLLESLGQIPIDKVLPPLLIQALPPLEAMKDRRTGTADSDTLKRVIRLANEILTYAMNAGAIPFNPCLSAKDIYSFAPAESHPHIEPAELPLLLTDISESKAQPRTKDLILFQLLTMVRPSEASNAEWAEFDLENKVWTIPAEKMKMKHPHKVPLSSQTIRLLQHLQSQTGHKRFVFASRNKINEPMNSQSVNKALVDMSYKGKQDAHGLRSIGRTYIGEKQMDGYEVLEMCIAHKVGTSTGKIYDKADFFEQRIPIMQAWGDFVEQCAERTSSKS >LS483485|96115:106155|100455_100671_+|SQI92239.1|DBSCAN-SWA MPVIKVRENESFDVALRRFKRSCEKAGILAEVRAREFYEKPTTIRKRENATRAKRHAKRVARENARNTRLY >LS483485|96115:106155|98316_98895_-|SQI92236.1|DBSCAN-SWA MAKLYFYYSTMNAGKSTTLLQSSYNYRERNMNTLVYTAAIDDRFGAGRVTSRIGISEQANTFARDTDLFAEIQQHLTKEPLHCILVDEAQFLTKAQVYQLSDVVDKLKIPVLCYGLRTDFQAELFEGSRYLLAWADQLEELKTICYCGRKANFVLRLNEQGEVVKQGNQIQIGGNDSYLSVCRLHYKERIAE >LS483485|96115:106155|98897_99131_-|SQI92237.1|DBSCAN-SWA MDNIKFRAMRAAGIGCFVMLAIIGFFVFTTPTDALIDFLTEVGQMAGGRTTIGVFILASLPPLTGFICYYFWKWALK >LS483485|96115:106155|99202_100231_-|SQI92238.1|DBSCAN-SWA MRILGIETSCDETGVAIYDEEKGLIANQLHTQIALHADYGGVVPELASRDHIRKLAPLLQAALQEANLTAKDIDGVAYTCGPGLVGALLVGSTVARSLAYAWNVPAIGIHHMEGHLLAPMLEENPPHFPFVALLVSGGHTQLVRVGGVGRYELLGESIDDAAGEAFDKTAKLLGLDYPGGAALARLALNGTPNRFAFPRPMTDRPGLDFSFSGLKTFAANTLHQVMQEEGELTEQSKADIAYAFQEAVVDTLAIKCKRALKQTGLKRLVIAGGVSANKQLRQTLAELMQQLGGEVFYPQPQFCTDNGAMIAYAGFLRLKQGLQQDLAIEVRPRWAMTELTAV >LS483485|96115:106155|100793_102572_+|SQI92254.1|DBSCAN-SWA MKGTIPRTFIDDILTKVNIVDLINSRVKLKKAGRDYQACCPFHHEKTPSFTVSDKKQFYHCFGCGAHGNAISFLMEYDKLEFVEAVEELAGFLGLEIPYEKRPHFNESGKQVGYQTKRNLYELMQEIAKFYQQQLPLNIPAQSYLQQRGLSAEIIERFQIGYVPNAMDTVYRQFGKTREEQQKLFDLGILSRNDRGNVYDKFRNRIMFPIRDRRGRTVAFGGRVLTDEKPKYLNSPETVTYHKGSELYGLFEALQADDSPQKLLVVEGYMDVVALAQFGVDYAVASLGTSTTSEQIQLLFRSTEQVICCYDGDRAGRDAAWRALENALPYLEDGRQLKFIFLPDGEDPDTFIRQFGKEGFEEYLNNAQSLSEFLFAHLTPQVDFSSKEGKNKLAALAVPLIKQIPGDMLRLDLRNTLAKKLGILDPTQLESLIPNQQKTENTPTAQPIQFKRTPMRVLIALLLQNPELVKFVPDLESFRSLNEPGYDLFAEMTALCREKVGISSGQLLEHWRDTPQQNTLEKLATWNHLVEEDKIEDTFRETLRYFYLQIIDKRINWLIAKDRSEGLNLDEKKELSTLLLVKKREKEHERNS >LS483485|96115:106155|96115_98131_+|SQI92235.1|DBSCAN-SWA MTTSIQQQIDDLRKTLRYHEYQYHVLDEPQIPDSEYDRLFHQLKALEQQHPALITTDSPTQRVGARPLSEFAQIKHELPMLSLDNAFSDEEFLAFVKRIQDRLGLVPEPLTFCCEPKLDGLAVSILYVNGVLTQAATRGDGTTGEDITQNIRTIRNIPLQLLTDNPPARLEVRGEVFMPHDGFERLNERALEQGEKTFANPRNAAAGSLRQLDPKITSQRPLMFNAYSIGVAQRIDLPPTHFERLQWLKSIGVPVNSEIRLCDGIENVLNFYRTMMEKRSSLGYDIDGTVLKVNDIELQQRLGFISKAPRWAIAYKFPAQEELTVLNDVEFQVGRTGAITPVAKLQPVFVAGVTVSNATLHNGDEIERLNIAIGDTVIIRRAGDVIPQIIGVVHDRRPANARQIVFPTHCPVCDSLIVRIEGEAVARCTGGLFCAAQRKEALKHFVSRKAMDIDGVGAKLIEQLVDRELVHTPADLFKLDLATLTRLERMGSKSAENALVSLEKAKHTTLARFIFALGIRDVGEATALNLANHFKTLEALQNADLEQLQQVSDVGEVVANRIFVFWREEHNVAVVNDLIAQGVHWETVEVQDVKENPFKDKTVLLTGTLSQMGRNDAKALLQQLGAKVSGSVSAKTDYVIAGEAAGSKLSKAAELGVQVLSEEEFLAWVNG |
8 | Serratia_phage(16.67%) | integrase | attL 96041:96057|attR 108738:108754 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1466805 : 1492402
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >LS483485|1466805:1492402|DBSCAN-SWA CATGGTAAAAAATTTAATTCTTTGGGTGGTTGTTGCAGTTGTGATGATGACTGCATATCAAAGTTTTAATGCCGCTTCTAGCGGGGGAATAACGGACTATACAACGTTTATTTCAGATGTTGAAAATAATCAGGTACGTCAGGCTAAATTTGAAGACAATGAAATTTTAGTCACCAAAGCCGACGGCGCTAAATATACAACAGTTATTCCATTGGAAGATAAAGATCTTTTGAATGACTTATTGAAGAAAAAAGTAAAAGTTGAAGGCACCCCGCCAGAAAGAAGAGGATTGTTATCTCAAATTCTGATTTCTTGGTTCCCGATGTTGTTATTAATCGGAGTTTGGGTTTTCTTCATGCGTCAAATGCAAGGCGGTGGTGGTAAAACTATGAGCTTTGGTAAAAGTCGTGCTCGCATGATGACACAAGAACAGATAAAAACCACTTTTGCAGATGTAGCCGGTTGTGATGAAGCAAAAGAAGAAGTCGCTGAAATTGTTGATTTCTTACGTGAGCCAAAAAAATTCCAAAATCTTGGCGGTAAAATTCCAAAAGGGATTTTAATGGTTGGCCCTCCGGGTACGGGTAAAACCTTATTGGCGAAAGCCATTGCCGGGGAAGCAAAAGTGCCTTTCTTTACTATTTCCGGTTCTGATTTCGTAGAAATGTTTGTTGGGGTAGGGGCATCCCGTGTGCGTGATATGTTTGAACAAGCTAAGAAAAATGCGCCTTGTTTGATTTTTATCGATGAAATCGATGCCGTCGGTCGCCAACGTGGTGCTGGGCTTGGTGGTGGTCATGATGAACGCGAGCAAACCTTAAACCAAATGTTGGTTGAAATGGATGGATTTGAAGGTAATGAAGGCGTTATCGTGATTGCTGCAACTAACCGTCCGGATGTTTTAGACCCTGCGTTAACCCGCCCGGGGCGTTTCGACCGTCAAGTTGTGGTTGGATTGCCTGATGTGAAAGGTCGTGAGCAAATTCTTAAAGTTCACATGCGAAAAGTACCTGTTGGGCCGGATGTTGATGCAATGACGCTTGCGCGTGGTACGCCAGGCTATTCAGGTGCAGATTTGGCAAACTTGGTGAATGAGGCGGCGTTATTTGCTGCACGTACCAATAAACGTATCGTTACGATGGTTGAGTTTGAAAAAGCCAAAGACAAAATCAACATGGGGCCTGAACGTCGCACCATGATCATGACGGATAAGCAAAAAGAGTCCACTGCGTATCACGAAGCGGGTCATGCTATCGTTGGTTATTTAGTACCGGAACATGATCCTGTGCATAAAGTGACTATCATTCCACGCGGACGTGCCCTAGGGGTAACCTTCTTCTTGCCGGAAGGAGATCAAGTGAGCATCAGCCAAAAACAATTGGAAAGTAAACTTTCTACATTGTATGCAGGTCGTTTAGCAGAAGATTTAATTTATGGCGAAGAAAATATTTCGACCGGAGCATCTAACGATATTAAAGTGGCAACTAATATTGCGCGCAATATGGTCACCCAATGGGGTTTCTCCGATAAACTTGGCCCAATTCTTTATACAGAAGATGATGGTGAAGTTTTCCTCGGTCGTTCAATGGCGAAAGCAAAACATATGTCCGATGAAACGGCTCACGTGATTGATGAAGAGGTTCGCGCAATTGTTAACCGTAACTATGAGCGTGCAAGACAAATTCTGATCGATAATATGGATATTTTACATGCCATGAAAGATGCTTTAGTAAAATATGAAACTATCGAAGAAGAGCAAATTAAGCAACTTATGAATCGTGAACCGGTGACTCCGCCATCAGGTTGGGAAGAGCCGAGAGACAATGACAACAAGGCGCAGCCGCAACAACCAAAAGCCGAAACGCCAAAAACAGAAGATCGGGAATCTACAAAAGATACACAAAGTGCGGTTGAAAAAGATACCGATTCCGAATCTTTATAATGCTAATTAAAAAGCCTTTCTCCTTTGTTAGAAAGGCTTTATTATTACAACCCACTTTTTGAAGTGGGTTTTATTTTTTGGAGACAAAATGGAACACAAAATCGAAGATTTAATTGCAATTTTCAATCAATGTTTTGAACAGGAATACAACACTAAATTGGTCAAAGGCGGAGATGAACCGCTATATGTGCCGGCAAATGAGGATTGCCCTTACAACGCCATCTATTTTGCTCGCGGATTTTACAGTAGTGGATTACACGAGATTGCGCACTGGCTGGTTGCCGGCAAGGAGCGGCGTAAATTAGAAGATTTCGGTTATTGGTATGAACCCGATGGGCGAACAGAAGAGCAACAGCGCTTATTTGAAAAAGTAGAAGTCAAACCGCAAGCGCTTGAATGGATCCTCGCCACAGCGGCCAATTTCCGCTATTTCGCCAGCTCCGATAACCTTAACGGCCAACCGGGCGATACCCAACCTTTTAAGCTGGCGGTGTATGAACAAGTCAAAACTTATGCGACTAAAGGGCTGCCGAAACGTGCTGAAACCCTCCGTCAAGCCTTAGCTAAGTTCTATGGCACGGAAAATAAAATTGATTTAACCAAGTTTGATGTAACTTGATGTAACAAGAATTTAGGACAAGAAATAAAAGTGCGGTTAAAAAACGAAGTGAATTTTAACCGCACTTTTGCTTTTATTATTGACGAGCCAATCTTAAATTTGGTGGTAATGGCTCAAAATTTGAACGATAAGGATTGATATCTAGGCCACCGCGGCGGGTATAGCGTGCGTAAACAGTGAGTTTTTCCGGTGCGGCGAAGTGCATTAAATCGCAAAAAATACGCTCGACACATTGCTCATGGAATTCGTTATGTTGACGGAAGGAAATGATATAACGCAATAGCTGTTCACGGTTGATTTGTTTGCCTACATAATGAATTTGCAGACTGCCCCAATCCGGTTGTTGGGTAATGAGGCAGTTAGACTTGAGTAAGTGGCTAACCAACGTTTCTTCTATAATATTGTCATTTGTACAATTTTTCAGCAATTCCGCATTAAATGCATAATCCTCAATTTCAATATTCAGCCCATCAATACAGTCGCCGTCTAATGCAACGATGGGTTGTTGAGTATAATCCGTTAAGGAATTTAACCGCACTTTCACCTCACCTTTAGCGCAATCTTGCAAATCCTGCTGCAAAATTTGCTGAACCTCATCGAAATTGGCAAATTGGGTTTGATTAAAGCTATTTAAATAAAGTTTAAAGCTTTTAGATTCGATCAAATTTTCACTACGAAAATCAATGCTGACATCGGCAATTGCCACTTGTGGAACTCCTTTCGGGTTAAGCCACGAAATTTCGTATGCCGTCCAAATATCCGCACCTATGGTGAAAGGTTGAGTTGTTGTAATGTTCAGCATGTCGCGGTTTAAGTGACGTGGAACGGGTTGTAATAAAGTGCGGTCATATTTTTCCGCATATTTTGTTTGTTGACCGAGTTTAAGGGCGTTTAGGCTTTTATCTTGATAATGCATGGGATTCCTTATTGCCAGTGCCATGAGAGATTGGAAACGAGACGGCAATGATCACATTTAAAATATAAGAAGTCAAAAAGTGGTTGCTCCAGTGTCCACGCACGTTTGCATAATGGACAATAGGAATGTAATTCGGTTTCTCGGCTTTCACCGCCGATTCGATACAAATAATAGTATGTGGGGATGCCGCTGTAGCGAGTAATTTCTTGAGCTAAGTGATAACCATGTTTAAACAAATGAGTTTCCGTACTGCTAATTTGTTCCAATGCTTCTTTTTCTAACACGGAACCATTCATTTGCAACTGATCGCACGCCTGCCAATTTTCTTGCCATTTGATGGTGTCCATTGCCAAGTGCGGTATATTTTTCAGTTGTTTATACAACGGAAGCGGTTGCAAAGTATCACCGCTATGTAATGGTGAGCAAGATTGTAAATAGGTCGTGTAGAGCAGTTGCCAAGAAGGTGTTTCCTGCGCGGTAATGTCGGAATTGAGATCTTCCGCTACGATTTGAAAACTCTCGAAAAACACACCGCACTTTTTTGCTTCTAGAAGGGCTTGGGTGACCGGCTGATTGTTAAATTCAGCCAATAATGAGGTTTGTTCGGGACAAGTCACACGCACGGCCAAGCCTTGTTGATTTTCCTGCTCAGCAAGAAATTGCGGGATTTCACGTCCAATAATCTGTCCGTTATAACGCCATTGCTCAATCAGTTGATTGACGCAATGGGCTTGTTTTTCCAGTGAAAAATCACTTGTTTGAAGAGAAAAAAAGGCTTCGATTAAATACATAATAGATCACGTTAATATGAAAAGCCGAATTTTACACTAAATTCGTTCAACTGGCGCAAAAGCATGAAATCTTCTACAATGGCGCGTTTATTTTTATGAGGTGTTTTATGCATCAACAAACCCGATTACATTTGCAACATTTACAGCATACTATGGAACGCTTGGCGCTGTGGCAAAGTGTTCCGCCACAAGAAGCGGCTTTTTTAAGCGAACAACCTTTTGCGTTGGATACGATGAATCCGACGGAGTGGTTGCAATGGATTTTTATTCCGCGGATGCATGCTTTGGTTGAAAGCCAAGCGCCATTGCCGCGCCAAATCGCTATTAGCCCTTATTTGGAAGAGGCTTTGAAAGAAGAGGATTATTTAGCGGAATTATTAATCCCGATCATGGAAATTGAAAAGCTGTTGCAACAACAATGTTAGAGATTTTATATCAGGATGATGTATTGGTCGCCGTAAACAAACCGGCGGGAATGTTGGTACATCGCAGTTGGCTTGATCGCCATGAAACACAATTTGTTATGCAAACCTTACGCGATCAAATCGGGCAGCTGGTTTATCCTATTCATCGATTAGATCGTCCGACATCCGGCGTTCTGCTTTTTGCCTTAAACAGTGAAATAGCTAATTTGTTATGCCAACAGTTTGAGCAAAAACAAACGGGAAAACAGTATTTAGCTGTAGTTAGAGGATATGTGACAGGGCATGGCGAAATTGATTATCCGCTAAAGATTCAGTTAGATAAAATTGCCGATAAGTTTTCTCAACAGGATAAAGCACCACAAAGTGCGGTCACTTTTTATGAAGGATTGCAGACTGTGGAAATGCCTTACGGTGTAGGGCGTTATGCCACTTCCCGTTATTCTTTGGTGCGTTTGGTACCGAAAACGGGGCGTAAGCATCAGTTGCGTCGTCACATGAAGCATATTTTTCACCCGATTTTAGGGGATACACAATATGGTGATTTACATCAAAATCGTGCTTTAACGGAGCACACCGGTTGCTCCCGTTTGATGTTACACGCGGAAAAATTAACCTTTGTTCATCCGTTGACTCAAGTTCCCATTACGATTCAGGCTGGATTGGATGTGCAATGGCAGAATTTAATGCAAACTTTTCAATGGTAAAAAATATATAAAAAAACGACCGCACTTTTTATGAAAATAAGAAAGGAAAGAAAATGTTAGATAATGATTTATTAAACCTTACTCATGAACAGCAACAACGTGCTGTTGAGAAAATTCAAGAATTAATGGCTCAAGGCATTGGTAGCGGAGAGGCGATAGCCTTAGTGGCAAAGCAATTACGCGAACAGAATAAAAATACGCAAAATCATAAATAATCGGCTGAAAAACCGGAAAAGCTGAAGGGTTTTTGCACGTTATTTTTATTAATTTTTGTGATCTCTGTTTGATTTTCAAACTTTTTTTATTGCATGAGCGATAAATTTATTTAAAGATGGGCACCAGCAAATTAAGGTGATAGGGCAGTATGTCTTATTGAAGTAAAGGTAGTTCTAAATGGACTAAAAAACCATAAGAGGTTTAAATTATGGAAGTTGGTGTTGTTAAATGGTTTAATAATGCGAAAGGATTTGGGTTTATTTCTGTAGAAGGCAGTAATACCGACATTTTTGCACATTATTCTGTCATTGAAATGGAAGGCTATCGTTCCTTAAAAGCGGGGCAAAAAGTACAGTTTGAAGTCATTCATGGCGACAAAGGTTCTCATGCCACAAAAATTATTCCGGTTGTAGAATAACTATAAAAATTTCCTTATTTAGTCATTGAAATTTAAAGACAAGCACCTTTCGGTCTTGTCTTTTTTTATTTCAGTAAATCTTTCAACCCGGCTTTCATGGCTGACATTTGGCTTTCGTATAAAGCTTTCTTTTCACGTTCATCAATCATGTAAGAAATAGTTTCCGACAATGTCATTTTCATTTTGCGGGAATATTTGGATAAACGTAGCCAAACGGCATATTCCAAATCAATGGATTTTTTCTTGGTGGATTGTTTTTCACCGTTAAAGAAACGCTTTCTTCTGGCGCGAATTGCCTGATCGAGCTTAATGACCAATTCCGGTGCCAAATGGGTTTTTATCCATTCTTCGATTTGTTGCGGATAGTTTTGGCAGCCGATGAGTTGCTTTACGATGTCTTCTTGTAAACTTTTTTCGGCGTAACGGGTGATATTTTCGCCTTCACGGGCTTTTTTGGTTAAATAAACCCATTTCCAATGTGCTTCTTGATTTTCTAGCTTTTGATATTTCATTTTTATACATGATTTTAGTGACGTGGTAACTGTGTCACTATACGCTTTTTGTGACGGATTTTCTAGCAAATTTTTACTCAATACGCATAAATATGCGATAATGCCGCCAAACAACCTTTAACTTAGAGAGTAGTCGTGACTTCATCTTATAACACTTTGCCTTGGCAGGAACTTCGTCCGGAATTGAGCGTGACGGAAATATCTTCACAGCCTCAAGATTTTTTCGCATTACAACCCCGCGCAGAAAAAGCGATTCGTCATTTCATTAAAAATTCCCACCGCACTTTATTGGTGCTTAAAGCTGATGATCAGGCAGAATATGCGCCGTTATTGGAACAATTTATTCAATCTCAAAAGCCAATGCCGGATTTGTGTGGCGTGCAATACATTATTGAACAAGGAGATTCTTTTTCTTTTCCTCGTATTTCTGCAGAATTGGCGCAGTCTCATGATGACAATTTTGCTACGCAAAAAAGCGTAGGAACGGCGTTATATTTCGATCAGTTTCAGTTATTTGGCTCAGTGAAAATTCACGCCACATCACACGACATTCAGCTTAACCCCGGTTTGGTGCATCAATTAAATAGCGGTGTTTTAATTGTGACTGCCGGCGCATTATTGGCACAATTTGATTTATGGCAACGACTAAAACAAATTCTCACCACCGGCGTGTTTAACTGGTATTCCGCGCATCCGTTTAAAACCTTGCCTTGTGACATTCCAAGTTATCCGTTGCAGTTAAAAGTCATTGTCTTAGGCAATCGTACAGAATTGGCGACATTGGAAGAGTTGGAAGAAGATCTTTATCATTTGGCGGATTATGCGGAAATCGAGAGTTATTGCCGTGTTACCAATGCGGAGCAACAGCAACAATGGATGGGTTATGTTCAGTCCGTAGCGCAACAACATCAATTGCCGATGTTGGATTTGAGCGGCTTTAATAAACTTTATCAATTATTGGTACGTGACAGTGAAAACCGCGAATTGATTAATATATCGCCGTTGACGTTAAAAAATATATTGACGGAGACGGCACTTTTAACGCAGTCCACCTCCTTAAGTGCGGTGGATTTTGAGCGTTATTTTTTACATAAAGCAGAGCAATTTGGCTTCTTGCGTGAACAAACTTACGACTGCATTTTGCAGGAACAAATTTATGTGGCGACCGAAGGTGAAATGGTTGGACAGATTAATGGTTTGTCAGTCATTGAATATCCGGGGACGCCGTTAGTTTTTGGCGAACCTTCCCGTATTAGTTGTATTGTGCAATATGGTGATGGTGAAGTCGTAGACGTGGAGCGCAAAACCGAATTGGCGGGGAATATTCACAGCAAAGGCATTTTAATTGCAGAAGCCTGTTTGGCGAATATTTTAGAATTGCCTTCTCAGTTGCCGTTTTCTGCTTCCTTGGTTTTTGAACAATCTTACGGCGAAATTGATGGTGATAGCGCTTCTTTAGCCGGCTTTTGTGTGCTAGTCAGTGCCTTGTCCGATTTACCATTGCCACAATCTATCGCCATTACCGGGGCGATCGATCAGTTTGGTTTGGTGCATTCTGTCGGAGGCATAAATGAAAAAATCGAAGGATTTTTCACAATTTGTCAGCGTCGCGGACTTACAGGTAAACAGGGCGTAATTATTCCGAGCGCAGTGGTGAATCAACTGAGTTTATCTGAGACGGTTATAAGTGCGGTTAAAAATCAGGAATTTTTTATTTATCCTGTAGAAACCGTGGATCAAGCCTGTGAAATTTTGTTACAACGTGACTTAGTGGAACAGGAAAATAAAACCTACACGATGGATACCATGCCCTTGTCACGCTTAATTAATCAACGTATTAACCAATATGCAGATCGGCAGTCACATCGATATGGTTTTTGGGATTTTCTCTTTTCCCGCAAAAGCCATTAATTCAAAATTGCCATTAAAAATTAACTGATCGGACAAGTTAGGCGTACAAGTGTTAGCTATTCTAAATGCAGGAAGGCTTTGTTAGAATCCTTTTCGCGTTTGTATTTCAACAAAACATTTACTTTTTATAGGATTTTTATAATGAACACTTGTACACCAAACATTAAAGACAGTTATTCTTACGAAGATTTATTAGCTTCCGGACGCGGAGAACTGTTTGGCAAAGAAGGCCCGCAGCTTCCTGCGCCGACCATGCTAATGATGGATCGTATTGTGAAAATGACTGAAGATGGTGGTACCTTTGGTAAAGGTTACATTGAAGCCGAACTTGACATCCATCCTGATTTACCGTTTTTCGCTTGTCACTTTATCGGTGATCCCGTTATGCCGGGTTGTTTAGGTTTAGATGCTATGTGGCAATTAATCGGCTTTTTCCTTGGTTGGGTTGGTGGCAAAGGCAAAGGTCGAGCTCTTGGCGTAGGCGAAGTGAAATTTACCGGGCAAATCCTGCCGACCGCCAAGAAAGTGATTTATCGCATTAACATGAAACGCGTCATCAACCGCAAATTAGTGATGGGCATGGCGGACGGCGAAGTGGAAGTGGACGGTCGTGTCATTTATACTGCAACGGATTTGAAAGTCGGTTTATTCCAAGATACGTCAAGTTTCTAATTCACGTTTCCAAAAACACTGAAATTTTTGACCGCCCTTTTCTACCAATGAAAGTGCGGTTTGTTTTTTCGGTGTTTTTATCTTCATAATGAACTGGTTATTTTTTTTGCAATCATCAACACAATCAAATAAAAAACTTGTTTTAGCCTGAAAAGCTAGCTATTATTAGCGCACTTTTCACTGCGACCGTAGCTCAGTTGGTTAGAGCACCACCTTGACATGGTGGGGGTCACTGGTTCGAGTCCAGCCGGTCGCACCATTTTCTTCCCTCGCACCCCCTCATAATTTATCAAAAGCCCTTGTTAAATCTAGTAAATATACCGTTTTTAAAGTATTCTACCTATCGTAACTAATCGCAGTTATTCGTCTTTAATCGCGCTTTTTAGTAACCATAATAGTAACTACGGCGATAGTTACCATTCTCATAGTTACTATAAACAGGATAAATATAGATATGGCACGCTCTGTTAAACAGCTTAATAATAAATCCGTAGATAATGCCAAGCCGAAAGATAAACCTTATACGCTGACCGACGGTAATGGTTTATTTCTGTTGGTTATGCCTAGCGGCTCTAAAACATGGCAATTCAACTATTACCGACCAATCACTAAAAAAAGGGCTAAATTTAGTCTAGGTGCTTATCCTAGTGTGACGATTGCGCAAGCCCGTTCTATTCGTGAAGAATATCGTTCTTTATTGGCTCAAGGTATCGATCCTCAAAATTACATTAAAGAACAGGAACAAGAACAATCTACCACCTTTTTAAACGTGGCTAATCGTTGGAAAGAGAAACGGGCAAAGGAAGTAGAACAACTTACTATGAAAAAGAATTGGGATAGGCTTGAAAAATATCTGTTTCCAATCATCGGGCATTATTCAGTAGATAAAATTACCTCACCGTTGCTGATTGATACGTTTAAACCGCTTAACGATAAAGGCTATAACGATACGCTTCATCGAATGTTTAACCTATCTAATCAGATTCTAAACTACGCCGTAACGATTGGATTAATCCCAGTAAATGTTTGCCAGAAAGCGGCTGATGCGTATCACAAAGAAGCGCAAACGCATCACCCGGCAATAAAGCCGCAACAACTGCCTAAACTGTTATTTGATTTCAAAAACTCTAACCGTTCTTTCTTAACTAAGGTTTTGTTCCGTTGGCAACTTCTTTCAATGGTGCGACCGGCTGAGGCTGTTTCGGTTGAATGGTCTGAAATTGATTTTACTAAGAAACTTTGGACGATTCCGGCGATAAAAATGAAGAAAACAAAGCAAGGGCAATTTCCGCATATTGTCCCGCTTTCCTCGCAAATGTTGGCAATCTTGGAAGAATTAAAGCCGATAACCGGTGAAAAGAAATTTGTGTTTCCTCATCATCACTTACCGAATAAATCCATGAGTAAGGAAATAATTGCCAATTCACTACGGAAAATTGGTTACAAAGATATTCAAGATAGCCATGGTTTACGATCTATCGCAAGAACATATTTAGAAGAAAAGGCGGTTGATTTTAGATTATCTGAAAGCTGTCTTGCGCATAAAATAGGTGATTCTACCAGCCAAGCCTATAACCGTTACGATTATATAGAATTGCGCCGCCCGGTGATGCAACTTTGGGGCGATTATGTAGAACAATGTGAACGTTAAAAGTGATGTGCGTGGAAAATATTTAAAAACTGCCTTCACTCGTTCACCGATTGCTTTTATTTTATTTTTCAAATGGTTACAAGGTGAAGGGAAAAGAGAAGTTATTCACTGGGGCATTCACCTTTTTAGATAAAAAAAGCGCGGTTTTTATGCCGCGCCTTTCACTATCTATTTGATTTTATGTACTCATTATAAAACTCGTCAAAGTCTTTAAAGTGTACGTTAGTGATATATCTTCCTTTTTCCGCACCGGAAGTAATTCGCCTGCGGGTGTATGGATATTTATTTTTGTGTTGTGCCAATCCTTGTCTTAATGATTCGGTAAAGTTATTCAGTGTTAAAGCATTTTGAATATTATTGGCTTCAGTAAATACCAAGTAAGCAGGGTAAAGGTGCGTTCTTGGAAATCCGGTTTTCGTATTGCCAATTCCTAGCCCGTTACTTTCTTGGGAGGTTAGGAAATAACTACAAAACACCGTTAAATGATCTGAGTTCATTTTTATTTCTAACGCTTCGGCACTTTCCTGTTGTTGAATTAACGCTTTTTTCGCGTCCAACGGATTTTTAAAAGCCTGAATCAGTTTATAAATAATTCCTCCGGCTTCGGCTTCTATCTTATCCATTAAGTGCGGATCGCGTTTACTTTCCGGCACCACCTTATCAAAGTGAAAAATCACTCGGCGGCGTTCAATACCTCCGTTTCTTTCTGTGAAGCGTGTCGGCTCATTGTTGACGATTAGCACAATCGCAGGAATGACCGCTTTAAACTTGCTTTTGTGCTTCGGATCGATATTGACTAAATCCCCCGCACTAATGCTTTTTAGTCCGCCACCATCACCACCATAGCGGGATTGTTCCGGGCATAGAATGAGCGTTTTATTCACAAAGTTTTCACGCCCGCGCGGTTCGTCTAAATCAACTAATCGACCGCTTTCCGTGTTTTGTTCCCCGGCTAACATTGTGGCAATTTGCGCAAATACCGATTTCCCGCTACCACCATCGCCGGTTACTTCAAAGAATAATTGCCAGTTATGGCGATTCGTTAAAATCGCGTAAAGTGCGCCCAAGATTGCTTGCTTTTTATCTTCCTTACCATCAGCAACAAAATTCAACCAATTATCAAAGTGCGGTGTATTTTCTTCCTGATTCGTGTAATCGTGCGGAATAAAGGAAGTGAGCCAGTTTTCCCGACAATGTGGGCTAAATTCCAAAGTAGAACGATTTAATACGCCGTTTTTAAAGGCTAATAACTCCTTGGATTGTTCCCCCATTTTGGCCGCTTGAATCTTCGCTGTATCAATCATTGAATCAATAGAACGGGCGCTATAAGTAAATTCCTGTTCATCGTAAAACTGAACGGCTTTAATCTCTAACTCCGATCTCGGTAGAATCTCCCAATTTGCGCCGGTGTAATGATAAAGCTCCTTATCTAAACTGTGTTGAGCAATATCTAGATTCAACCATTTTACGAATGCCCGGGCTTTTACGTTAGTTCCGTCCTTTTCTTTCAATTTCGGCGGCGGAGCAATATTATTCGCAATTTCTACCGCACTTTTATCCGTGCGTAGGCGTTGGATATAACCACTTAGATTTTCCTTCATCTGCGCGGCGCCATCATATAAAACAACATTTTCCGCCTTGGTATGCTTTGCTAGGTTATGGCAAATACCGGTGATTTCTGTTGGTTCTAATTCGCCATATTGGAAAAGCATGATTTCGCGTTGTTCTGAATCAGCTATTTTTATTCCTGAAACATCTTCAAGCTGCTGTTCGGCAAGAATCACCGGCTTTTGTCTAGCGTCCATACCTTCCACCAACGAACACAACAATAACCATTCTTCGCCCTTTCCTTTGTTCCATGCTTGCCAAGCCTTACGCCCGGCAAGAATGATTAATGCGGAAAAAGGTTCTTTCGGCTGATCCGCTAAGTGTGGTGCATTAATTAAGCGGGCCATGATTTACCCCCTCAACCGTACCGCTCTCAATTATATGAATGCGCATGGCGACTGTTTTTTGAAAATAGGAAAACGCACTTACCAACGAATTCACGACACAATGTTTTAATAAACCATCTATCAGTTCATCATTGAATAATTCCGCCGCTACTTCTTCCGGATCGGGTGAAGGTGGCTTCGGTGCCATTGCTAATAAATGCTTATTGATAGCCAGTAGTTCATCATGAATGATTTTTAGCCCGTCGAGATATTCACCGGGATAATCAATAAACTGTTCGGCTAATGCCAAAATTGCTTCGGTGGTATAAGGGAAAGACGAATTAAAGGCTTCATCTTTAGGTTTTCCCATGTTCTGATGACTGATTGAAATCGCATTCAATTCAACGGCGTTTAGTTTGGAATAATCCATTTTTTGATTGAGATCCATTATTGTGCTCCTCCTAGGTCCACTACCGGCAAAGTTAAATAATCCTTTTCGGTTTTGGCTTTATGGTGAATTGCTAACGCTTCGATTTGGTGTTTTAAATCTTCAACATCCATCAAGGCCAAAGTTGCTTGGTCTAAAATCAGATCTAAAATCTCGTCTTTACTTATTGCGATTCTTTCGCCATGTTCATTCGCTCGGTAAATTGCGCCGTTTAATCCTGCATCAGCTATTTCTTGGATTTCGGTAATAATACTTTCAATGCTCCATAGTGTCGCTGTGCCGTGCTTTTGGGTTGGTGTATTTGTATCGTTCATTATTTCACTCCCGCTAAATGTTCTAAGCCTTCATTCATCAAATTCACTACGCCTTTTAACGCACAAATTACGGTCGGCATATCACCTTCATTTAAGTTTGCTGATGCTAATAAATCAGCCATTGAAAAGGCTTGCCAAAAAAGCGCGCTTGCCGCTTCAAGGTTGTTTAGAGCGCCTTCATCTAAAACAAATTTATTTTCTGTGCTCATTTTTCACCCCAATTTGATTTGTCATTTACGGCATTTTCAATATTTTCAACGTGTGAATTTATCGCATACGCAATATTGGCATAGCTTAGATCGTCTAGCCTTTGCCCTTTATCTTCAAAATGTGCGGCAAACATATTTAGCATCGCTTTAAGATAGGCAGTTTCAATATTGATTTTGTCGATTAAGTCTAAGTTACGCATTGTTTCCCCCTGTGATTTTCAATCCTGCTGTAGTTGCTCGTTTAATTGATTTCAGCGCGTCTGTTGCGCCTTTTAAATGCCCGTTGTGAATGTAGTCTTTGGCAAAGCTAAGAAAAAATTCAGTCTGTTTAATGGCTTTTACCAACTGTTCCCAATCCGGCAAGCGTTCTTCCTCAAATAGACTAGGGCGCTTCTTGCTTACTTTGGTTTTTAACTTACGCATAAATCACCTCCAAAGTGCGGTTGGTTTTTGTGGTGTTTTGTGGATTGATACGACCGGCAAACACTAATACAAATTCTTTAGCCAGTTTTGCCCGGGCTTCTAATTCGGTATTAGCAGTTATTTTCAGTCGTTGGGGTTTGCTGTTTACATCAGCACGGCGGACTGCTACAAAAATAAATGTAGGTTTTGTATAATTACAGTTAGTCATTGTCTTACTCCGTTTTAGTAAGGTGGTGATTAGAACGCTCAAGAATGTTCCCGCATACTTGGGCGTTTGCTTTTTTAAATACCTGTTTATGGGTGTTTATGCACCAATAATAATTTAGGTGCATAAACACGTCAAGCTTTTTTTTATCCCTTTTTTCTGTTAAAGTGTATAAACAGTTTAAAAAAGGGTTTTCTATGGCTACTGGACATAAAAATAATAAATCTGCTACAAAGGGAATCAGGTTCCCGCACGAACTTATTAAAGAAATTGATGCTGCTGTTGAAACAGAAAACAGTAATGGAAATAACGCCAATTTTTCTTCTTGGGTTATTAATGCTTGTAAGGAAAAATTAGATAAGCCTAAAGGGTGATTTATATATTGTGAAATATTTTCATATTGTTTATAATTTACGTGATTTAAATTCATATTTATTCCTTTTATGGATTTAGATCGGGAAACGTGGCATGTGCTTTGTGTTTGGCTCATGGTCGCTAAAATCATTCATTAGCGTTCAAAGTGTGTTCAGCAGTTTCCGCCCTCGTGTAATTTAGGTTATCTACCTTCTGAGCGATTCTTAAGGTAGGGAAATGCGCCGCTTGGGTTGAGCCTTGCGGCGTTTTTCTTTTAACGGAATCGAAAGTAGGCTTGCCGTTCTTCATGCGCAGTAAGCTCACCTCCCTTAGCCTTGTAAATGGCAATCACCTGCTTTAGCTGTTCGGCATCTGCGATTTCATAGCGGTAATATTGCCCCATTCCATCTGCAGTCTTTTCCGTTGTACGTTTCACTTTGCCGGTTAAATGATTGCGTTCAAGTTCACTGATATAGTTACGCGCTGACGTCATGCCCATTGAATAACCATCAATACCGCTAATGCTAGAAAGAATTAAGCGGTATAACACTTTTAAGAATTGTGTTGGTTTTCTTGCTTCGTTCATCTTCCACCACCTTAAGCGCGTGCCGCTTTTTGTTCTTCAATCCATTGATTCACTTCTTCTAAGTCCCAACGGACAAAGTTTTGTGAAAAGCGGATCGGTTGTGGGAATTGTTTAGCTCTTACAAGCTCATTGAGTTTGGTGCGGCCAAAGCTTACGCGTCGGCAAACATCGGCACCGGTGATGAGTTGTGTAGATTGGGTTTGAGATTGGCTCATAAAAAATACCTCTCGTTAGTTTAACCATGTGGAATAGCGTTCTATTCCGTTGAGTTGTTCGAACGAGAGGGATATTATGAATAAAACACTTTTAATTCAAAGACTTAACTTCCTAATAAGAAATGATCGCTTAAATCGAAAGAAAAAGCCCCTTAAATCGAAAGACTAAGAGGCTTTTATAGGAAACTATGATTTTTTAGGTGGGAAAATTGCTTTTATAATATCGTTTGATTCATCAATTAACTTGTAAAAAGTGTCTGCTGCGATATTGGTATTTCTAATTCCTGATTCCTTTAGATCTGCGTTGATAACATCAAATAACTTATTTCTGCTATCAAGTTTTGGATAACATTTTTTAACCAATAAAGCGAACAGTTGCTTTTGTGGAGAACTTATTCGGCCGCTTTGGTTTGTTAAACTAAGAGAGTCTATTATCTTTTGCTTTTCCTCAAGCTCTGATTTTAATTTTTTGATTTCTTGCTCTAAATCATTACATTTTAAAGATTTGTCGGATGGTATAAGATTAATTAAATCATCATAGCTAATTTTTATATCTTTTAAATTAATTCTAAATGTACGTTCCTTAATTAATTGATTGTATTCATTATAGTCTGGATATTTCATCGTAAAATAGAGAGAGTTATAAATATCCGAATTAGTAATAGAATAAATGGTAAAGCCCTCTAGCTCTATATAGTTTGTATTGCTTAGACTATGATCAGGCATTGAGCTACTTACAAAGAGATCTAATAGCTCTGAATATAGAACTACGTAACCAGAAAAATTTCTTATCTTGTTTTTATCATAAGAAATATATAGATCGCCGAATTCATCTGGTTCTTTCTCAAAGCCATCTATTTTTTCAATCTCTAAAGTAGAAAAGCTATCTTCTAGACGCAAAATACCATCAGTTTCATTTTTTAAACTTTTCTCTTTAAAAAACAATTCTGAATCTTTATCAATAAAGAAGCCTTCTACATCATTTCTACCTATTTTAATCAGTTCATTATTTTTAATTTTTATCTTTATTAGAAACTTTATTTTTTCCTCAATAGCATAAGAATAGAGTAAATTTTCCTTAATCATCGAACCTGTTTTTTGATTAATAAAATCTACCGCTTGACTTAAAGAATAAAAATTAAGTGGAAGTAAATCCATAAACGCCCCTTTCGCATTTTCCCTTATGATAGGAACGCACCAACAAGATAAGGTTTCTTGCTTTCGGGGATCAGCCTAGGTGCGCTTTATTCGGTTACTTGTTTTTTATATCTACACTAATATCTATTAAAGTGGTTTCTTTGCCGGTGCTGCCGTCTATCCAGTTTATCGTGCCGGTAATAAGTGGCTTTTGCCTTTCCGGTAATGTTTTCTTGATAACTGCGTTATTTATTTCATATTCTACCGGGCGGCTTTTTCTTTCTTCCCACTCGGCGGCCAGCCGTTTATTTTCTTTCCGGCGCTTATATTCATCGTAAATAAGCCAACCGATAGTAAATGAAAGCCCGCCGGCTAAACCAAAGGCCACAATAAACCACCAGTCTTGAAAGGTAACTATTGCTAATACGACAAAAAAAGCAGCAACCATACACTTTATTGCAACTAGTATCAGCCCCATAGTTCCCCCTTGTTTTTACTGGTTATTATAATAAATAACATAGTAACAAAGGAGAAAAGATTCAATAGTGTAAGGGAAATATTTTATTATTCATCACTTACCGGCATTGATTCTTTTTTGCTGTTTTAGCATGGCATCAAGTTGTTTCTTGGCAATAAAGTAAATCGTTTCTAACGCGTCCATATTCGGGCTATTTGGTCGGCTGTCTATTTCTCGCTTGGCCGCATTACATTTACATTTTAATGCGTGAATAACTTCACTTATTGGATAGGGTTCTTCATCATCGTAAAGGCTGACAAAGGTAAAAACTGCGGTCGATTTTTTATAGTGATTCACCGCTGAAAGAAGTAAGTTTTGCTTTGCCTGTTTACATCTCATAAATCAATCCCGTTAAATTGTTCCAATGCCTGTTTGTGTTCTTCCGATAACTCAAAAATCAGATCGCCATATTCAAGTTGATAAGTGCCGAATGACATCAAGAAAGCTACTGCCGGATCGATTTTGTTTGCGGCCTTCTTCTTGTTTGGTTTTATGTTGGCGTTGGCATCAGTTTCCATGACGACATTGGATAATGCCCAAGAAAGCACCGGATCGCCGTGGTGTTCTATCACTTGGCGATTTATCAACACTTCCGCACTTTTCGCCACCGGGCTAAATCGTTGATAGGTTTGCGGGAATGGTTCTACTTCCAAGCCTGCTGCCTGTAATTGCGTTCTTAAATGCGTGGCGTTCCAAACATCAAAGCCTATCATTTTGATATTGAAGTTTTCCGCATCTTTGAGAATATCATCGCGGATTTTGTCATAGTCGATACAGTCACCCTCAGTTGCAATAAGCCGACCACTGCGCACCCAGTTTCGATAGATTGCCCGGTTCTTGTTGGCCACGTTGTTAAGCTGAAATTCAGGAATATAGTGCCGGGTAATCAACCGCACTTTTTTCCCTTGTGGGAAGGTGTAACAAAGGCTTGTTAAGTCGTTGGTGCTGGATAAATCCAAGCCTAAATAGCAATCTTGGTGAAGTAAGTCGCTTTCGGTGTAATCTCGTACGCACTGCGCCCAATTGCCTTCGCCTAGCCATGGTGTCGTGCCTTGGCACCAAACATTAAAACGCTTGGTGAGCATTTCCACCCACTCGGAAGGAATCCCTCGGGCTTTCTTGATCGTGTTTTCAAAATCAAGGTAAGGAATGGATTTACCGATATTCGGATTGGCTTTTATCCAGTTTTCCGGATTATCAATTTCGCTTTCTTCGTCCAATTCAAAAATCAGCACAAATAGGCTGTCGTTTTGTTCGTTGCCTTCCAGTATTTGTGCGCAATAATCATAGTGCTGCTTACAAGCGGAAATTACGTTACTTCCTGCGGTAGTAATGGCAAACAGTAAACCTTCCGGGCGTGCGCCTTGTCCTAGCTCTAACGCGCTGTAAACGCTGTTATCAGTATGTAGGTGATATTCGTCCACAATGGCGAGACTTGGGTTAGTTCCCTCAATGGTTGAGGATTTAGCCGCTAACGGGCGCATTAAGCTATTTGATTTTGGATTAATCAGTTTATGCTGCTGAATATTGAGCCGTTTGCGCAAAAGGGGAGAGAGTAGGCACATTTGACGCGCATCATCAAACACAATGCGGGCTTGGTCTCGGCTTACTGCTGCAGTGTAAATATCTTGTTGGCCCGCTTCCATCAGTAGAAACCAATTAGCCAACACGGCAGCCACGGTGGACTTGGCATTTTTTCGCGCCACTTGGATATAAGCGGAACGATATTTTCTCAAGCCGGTATCGGTGCGCTTAAAGCCTAACAGATTGGCGAATAGAAACGTCTGCCAGTCTGAAAGCTCGATTGGTTGCCCGCGTAAATGCCCCTTAACGTGCGGGCATAGGCGAGAGAAAGCCAAGAATTTATTTACCGCACTTTCATCAAAGAAATAAGCGGGGTTCGCTAAATCATCAAAATAACGCGCTACGGCTTGTTTTATCTTACGACAAGCCACTATTTCACCTGTTTGAACTTTCTTCGCGTATTCGTGCCAGATTTCCATTTTCGCCTACATTGTGAGGATTTCATCCAACATATCAGTAACGTCCGTTTCTACTGGATTTTTACGGCGACTCACCGGATCGAAGCCTAAGAGGGAAGACATCTTGATCATGACTTTTTCGGCATCTGCTTTCGCGGACAATGCCGGGTTTCTTGATTGCGTACCTTGGCTATTTACGATAATGAAGCCATTTTTCGATAAATCTGACACGGAATGACGCCAAATTGCGTAGTTTTCGCAATAAATTTCAAGGTTTGTTAAATCTTCCAACTTAATATCGCCACGCTCTGAAAGTTGTTTAATACGCGCTTTCCATTGGCTTTTAGCAATATCATCCAAGAAATCAGGTGTCTTATAACTTTTTCGTTTGCTCATTCACTTTCCTTATTTTCTAAAAAATCACTTTGCGTAAAAATTTGAGTGGGCGGGCGGTTCCGAAGGGTTGCCACTTTCTTTTTTAAATTGCCCCCACCTGGTCTAATCAATTTTCTTCGCACCAAATCCGCGTTGGTCTATCACTCGTGTTTTATAGCTGTGACAATCACGACATAAAGGCTGATGATTGCTTGCTACCCAAAACAACGGATCGGATTGTCCGTTCTCTACCGGCTTAATATGGTCTATCACTGTTGCCGGTGTATATTTGCCTTGCTCTAAGCACATCACACAAAGGGGATGATGCTTTAAGTATTGCTCGCGGTATTTGCTCCACTTGTGGTCGTAACCGCGTGCGCTACTGTTTGGGCGGTTGTCTTTTGGCTTGTGCTCCTCGCATCTACCGGACTTTACTTTGTTTTTACATCCGGGATAGCTACAACGTCTTAACGGTTGGTATGGCATCGGTTACTAAATCCTTAGTAAGCGCACGGTTCTCTATACACTTCCCACAATGCGGAAATCGTCATAGGTGCCGGTTTAAGGTTGGCTAAGTCTGTAACTGCTTCTCGGTTCGTGTAGAGATAGGCGATATACATTAAGCAACCAATCTTAATCGCCGGGGTAAAAGGTATGGTCTTTTCCGTTTCTTCTTCCCCAAAGGTTTTGCCAATATGTTTTTGGCATACTTCCAATGTGGCTACCTTATAGGCTTCCAGTAACTCATCATCTAAATCATGATCGAGATTTAAGTGCGCTTTGATTTCATCAATTGTTAAATTAATTTCCACCATTGCCGGTCAACTCCTTACAGATAAGCTGTAGTTCTTTGTGCGCTTCTTTACTATCAATAATGTTCATTATCTCTAGCGAACGGTTCCCATATTTCACGCGCATAGTGTTATCAACATTAGTTCCGTAACGTATGCGAATGCGCACCGTATTTTCATTTAATGGCACCGCACCGGAGAAGAACTCTCTACCTTGTAATGGTTCAACCGCCGCCCGGATATTGGCAACGGTTTTCCATTTACTCACAATACCGCCGTAGTCGTTCTGTTCGTTCACTTGCTTTTGTAGGCTAATCACCTTGTTATACTTTCCGGCCTTAATCATGATTGCCATTGCTTACCCCCGGTTCTTGTTCATCACCGCGTTTTACTTCTACAGTTTGTTTCCATGCTTGGCTAAATTCTTCTCCACCATCATAAGGCGGTAAACCTTCACGGCGGCGAACTTCATTCGGGCACATTACACCGGCTTTAATTGCCACATCGTAACTCTTGAAGCGCTCACTTTGACTTGTGCGCAATAAGTCGCTTGTATCAAATTCAATTAAGTAACGTTTATTGCTGTTGCTACCTAAATCAATCATCAAGGCATCTTTTAGCTGCTGTTCAAAATTAGTTAGCCAAGGACGCAAGGTTTGCGATAAAAAGGCTCGACTGGCTTCACTAAAGTTTGAATAACTGCTATTGGAATAGTCTTGAAGGAAAATCGGGCTAATGTTGTAGATTCTGGCAATATCGGAAATTGTGAACGTACGGCTTGCTAACCATTCCGCATCTTGGTTTGTCATGCCTAACTGTTTATATTCCATTGAGCCTTCAAGGATGGGTGTTTTCCCTGCGTTCTTCGCCCCCTTGTAACGTTCAAGGGCTTTTACGGCTTTTTGTGCTTTGGCTTCGTCCAACCACTCGGCGGTAGTAATTAATCCGCTCGCCATTAATCCGTTTTTCATCACTGCCGATCCGTGTTTCTGTTGAGCAATGCCTAATCCCACGGTTTCACGGCAAATCGTAATTGGCGAACGTCCCATAAAGCCATCAAGGGATGAATGGCGTAAATGTAGGATTTCATCTTGAAGATAGTTTTTGGTATTGCCGTCTAAGTCGGTAATTTGATAGATATACTCGCCGCCAACTTTGCGATAGATATTGACCGCACTTGGTTCGTACGGGGTAAGGCTGATTGGTTCGCCTTTGCTGTTCCATTCAATCACCGCATAAGCGTTACCGTTTAATAGGCAGTGACGCATCATGGTGTATTTGAATTGATACGGTGTTTGGCTACGGTTTGGCATCTCGTTTAAGAGATAATCCACCGGGTGATGATAAACGCGCTCGCGGCCATCATCTTTAAGCTGATATAAATAACAAGGCATACTTGCCACCGCTTCAGAAATAACGGTAACGGCACTCATCACTGCCGGTAAACTTTCTGCCGTGTTCGGGCTGACAAATTCCCCTGCGCCGGTGTTTGATACTCCAAGATAAGAAAGCAGTTCATTAATTGCCATCGGTGAGCTGCGTTGTTCTTTTCGTCTGAATGGGTTCCACATACTACGCCTCCGCCACATCAAGCCACTGTTTCAAAAGTACGGTGGATTTTCCTTGCGTTTTTCCTTTCGCGGTTGCCATTGATCGTTTGGCAATCTCAACGCTACTTTCAGGATAGGCAGGAATGCTGGTAACGGTGATTTCAAATAATTCCGCTTTGGCCACTGTGCGTTGACAAGGCTCTACATCAAAATTCCATGTTTCTTCTTTAGCCCAAAAGCCGAAAGACATCCCGCTAATATCGCCGCGTTCAACACTTACCAACAAATCACGCCCTAAGGTGGTATCAGGTGGCATTAATTCAAAACGTAAGCCTATTGCGTCTTCTTCCAGTTTTAAGGTTCCCGCACGGGTGCGCCCTAATAGTTTGGTATGATCGTGTTCAAATAATGCCCGTACATCGGCACCGCTACTTAAACTTTCACTAAACGCATTCGCACTGAATTGTTCTACAAAATCGCAATAAAGCACTTCAGAAGGGCTGTTCCACTTCACCACATAGCCAACCAGTTTTTTATTCTCGCTGTCTGCGGTGATTTCGGATGAACGGATTTCAAATTCTTTATTCATACTTTCCCTTTTAACAAAAAGGGGGCTTAATTGCCCCCGTTGGAATTTGACGATTAAGCCGTAACTTCAATGAACTTGATTGCGTTACTATCTACCACGCCACCACCAAGATATTTATCGGTATGGACTTTATAGAAGCCCGGTTCGGTAATGTTATCAGGGCGGGTTCTTACGCCGGTTTCGTGATCTACAATGAAGTAACCGCGTTTGAAGTCACCAAAGGCAACCACCGGTTTATTGGCACCACTTGCCGGCATGGTCTCAAGGAAATAAACCGGACGACCTAAAAGGGTAGAAGGCGCATCTACGGTTAAACCATCACGCCAAATAAAATCGCCGTTTTTGTTTTTGAGTTTTTGTAATGCCGCCGCAATTGTGGAAGACATCACCCAAACGGCATTTTTACGGTATTTGCTGTGTAAGGTGTAGAACAAATCAATGAGCGTATCGGCGGTGATTTTGTCGGCACCGGCAACGTCTAATTTTTGTAACTTACCAAAGGCGCGTACTTTGTCCGCTTCGGTAGTACGTTCATAGGATAAGAAGCCTTTTGATTTCTTCGTGCCGTCACCGCCGGTTAAGTCAGTTTCTTCGGTTTCGGTGAAGCTTTCGGTAATTTCATCAGTCAACCAACCTAAAACATCAATGCTGGAGAAGTCCAAAATGTCTTGAGTGGTTTTTGGATAGGCATAGATAGGGTTTAACGCAATGGTGACTTCATGGAGTTTCGGGGTTGCCGTACCGTTACGCGCTACGCCTTCTTCACCATGCGCTACCACCGCACCACCGGCGGAAACCAGTTTTTTGTATTCTTTCGCACCAACCGGCAAGCGGACCACGTTACAAATTTGACGCATCACGCTATCATCGGTTAAGCGTTTCATTACGTCTTTATCCAATTGTGGGATCACGGTATAACCGCCATCTTCTTGACCGGTGGTGGAAAGATTTCGTAATTCACCCGTTTTAATGTAGTGGCGTAGTTCGTCATTGCTGAAGGTTTTACCGCGTGTTTCTACCGGTTTTCCTTTGTCGGCAATGTTACGTTCTTCATCTGCCACCGTTTCATAACGGGCGATTTCATCGCTCAATTGTTTAACCAAGTCTTTCAACTTTTCAAAGTCAACGTTTTCAGTTTCGGTCAATGAGCGGTTTTCTTGTTCCGCTTTGTCTAACATAGCGCGCATTGCTGCGACTTTTTCCGCTTTTTGTTGGCGTAGTTCTAACAGTTTTTTAAACATAGTTAATCCTTTTAAAATCATCTTAATTAAGACGGCTTATAAAAAGCCCATAGAACAATATATACACAAAAAACAGGAAGTAAATTGCTTAAAAATTAATAGTTTAGCTACGTTAGGATACGTTGAGCGGGTGAAATTTGACTGCTTGTTTTTTATACAGAAATAGATGTGAAAATATAATTTAGATAGGATTTTTTTATCTATTTTAAAGAGTGAACACTAGTGAATACCTAGTGAACACCTTATTCACTATATAACTATATGATAAATAAAGAAATATAATATATAGTGAACAGGTGAACACCTTTTTATATAATTTTAAACGTTTATTAAAATTTAGGTATATAATGGTTCGAAAGTAGAAGAAAGATTTATAGTGAATTTAATTTATAGTAACTATGATGGTAACTATGAAATATGCCAATCTTATATTTAATTGATTTTAAATATTAAATACTTAGATTCGAGTCCAGCCGGTCCATCTTTTCTTTCCTTATAATTTCTTCATATTTAGTTATATTTTTTATTTCTCCAAACCCGTCGCAGTTTTTCTGATCTTTTGCTAGAATCAGCGGGAAAAACACCTTTCTTATAACCTTGTTTTTCATCGTCTTTATTTTGTCTCTGATTGGAAATAACAATGAAAAATCAATCTTTTCTACAACAATTCTTTAAATTAAAAGAAAAAGGAACCTCCTCCAAAACGGAAATTATTGCCGGAATCACCACTTTTTTTACTATGGTGTATATCGTTTTCGTGAACCCATCCGTGTTGGGTGATGCAGGCATGGATAAACAAGTTGTGTTCGTGACTACGTGTTTAATTGCAGGCTTCGGTACGATTGCCATGGGGTTATTTAGTAACTTGCCTATCGCATTGGCACCGGCAATGGGCTTGAATGCGTTTTTTGCGTATGTGGTTGTCGGTAAACTTGGTTATTCTTGGCAAGTGGGCATGGGTACCATTTTTTGGAGCTCTGTCGGCTTGTTGCTATTGACTATTTTCCAAATCCGTTATTGGCTGATGGCGTCTATTCCGCTCAGTTTACGGGTAGGTATCGGCGCGGGGATTGGTTTCTTTATCGCCTTAATTGGTTTCAAAAATATGGGTTTGGTTGTTGCAAATCCTGCAACCTTAGTTGCTTTAGGCGATCTACACAGTCCGCAAGTGTTATTAGGTATCCTTGGCTTCTTTATTATCGTGGTGTTGGCCGCACGCAATATTTATTCCGGCGTGTTAATTTCTATTGCAGCAGTCACCGCACTTGCATCGTATTTTGACGAAAGCGTGATGTTCCATGGCATCGTTTCCATGCCACCGGCATTAACTCAAGTGGTTGGCCAAGTGGATATCGCCGGTGCCTTGGATACGGCACTTATTGGTATCATCTTCTCTTTCTTATTAGTGAACTTATTTGACTCTTCAGGAACTTTATTGGGCGTGACGGACAAAGCCGGTTTCAGCGATGAAAAAGGGCGTTTTCCGAAAATGAAACAAGCCTTATATGTAGATAGTGCCAGCGCCGTAGTTGGTTCTTATATCGGTACCTCTGCAATCAGTACTTATATTGAAAGTGGTGCCGGTGTTTCTGTCGGCGGACGTACGGGAATGACTGCGGTTGTTGTCGGATTATTGTTTTTACTAACCATTTTCTTCTCGCCATTGGCCGGTATGGTGCCGGCTTATGCGACAGCAGGTGCGTTAGTTTATGTGGGTATTTTAATGGCATCCAGTTTGATTAAAGTGACGTGGGAGGACTTAACCGAAGCTACCCCGGCCTTTATTACTTCAGCTATGATGCCATTTACATACTCCATCACGGAAGGGATCGCCTTCGGTTTCATCAGCTATTGTGTGATGAAAGTGGGAACAGGACGTTGGCATGAAGTTAACGCGCCGGTTTGGGTGGTTTCCGTATTGTTCTTGATTAAATTTATATGGATAGGTTAA
Protein sequences of DBSCAN-SWA_2 >LS483485|1466805:1492402|1491091_1492402_+|SQI98471.1|DBSCAN-SWA MKNQSFLQQFFKLKEKGTSSKTEIIAGITTFFTMVYIVFVNPSVLGDAGMDKQVVFVTTCLIAGFGTIAMGLFSNLPIALAPAMGLNAFFAYVVVGKLGYSWQVGMGTIFWSSVGLLLLTIFQIRYWLMASIPLSLRVGIGAGIGFFIALIGFKNMGLVVANPATLVALGDLHSPQVLLGILGFFIIVVLAARNIYSGVLISIAAVTALASYFDESVMFHGIVSMPPALTQVVGQVDIAGALDTALIGIIFSFLLVNLFDSSGTLLGVTDKAGFSDEKGRFPKMKQALYVDSASAVVGSYIGTSAISTYIESGAGVSVGGRTGMTAVVVGLLFLLTIFFSPLAGMVPAYATAGALVYVGILMASSLIKVTWEDLTEATPAFITSAMMPFTYSITEGIAFGFISYCVMKVGTGRWHEVNAPVWVVSVLFLIKFIWIG >LS483485|1466805:1492402|1477745_1479536_-|SQI98450.1|DBSCAN-SWA MARLINAPHLADQPKEPFSALIILAGRKAWQAWNKGKGEEWLLLCSLVEGMDARQKPVILAEQQLEDVSGIKIADSEQREIMLFQYGELEPTEITGICHNLAKHTKAENVVLYDGAAQMKENLSGYIQRLRTDKSAVEIANNIAPPPKLKEKDGTNVKARAFVKWLNLDIAQHSLDKELYHYTGANWEILPRSELEIKAVQFYDEQEFTYSARSIDSMIDTAKIQAAKMGEQSKELLAFKNGVLNRSTLEFSPHCRENWLTSFIPHDYTNQEENTPHFDNWLNFVADGKEDKKQAILGALYAILTNRHNWQLFFEVTGDGGSGKSVFAQIATMLAGEQNTESGRLVDLDEPRGRENFVNKTLILCPEQSRYGGDGGGLKSISAGDLVNIDPKHKSKFKAVIPAIVLIVNNEPTRFTERNGGIERRRVIFHFDKVVPESKRDPHLMDKIEAEAGGIIYKLIQAFKNPLDAKKALIQQQESAEALEIKMNSDHLTVFCSYFLTSQESNGLGIGNTKTGFPRTHLYPAYLVFTEANNIQNALTLNNFTESLRQGLAQHKNKYPYTRRRITSGAEKGRYITNVHFKDFDEFYNEYIKSNR >LS483485|1466805:1492402|1483521_1483884_-|SQI98461.1|DBSCAN-SWA MGLILVAIKCMVAAFFVVLAIVTFQDWWFIVAFGLAGGLSFTIGWLIYDEYKRRKENKRLAAEWEERKSRPVEYEINNAVIKKTLPERQKPLITGTINWIDGSTGKETTLIDISVDIKNK >LS483485|1466805:1492402|1480276_1480486_-|SQI98453.1|DBSCAN-SWA MSTENKFVLDEGALNNLEAASALFWQAFSMADLLASANLNEGDMPTVICALKGVVNLMNEGLEHLAGVK >LS483485|1466805:1492402|1482455_1483427_-|SQI98460.1|DBSCAN-SWA MDLLPLNFYSLSQAVDFINQKTGSMIKENLLYSYAIEEKIKFLIKIKIKNNELIKIGRNDVEGFFIDKDSELFFKEKSLKNETDGILRLEDSFSTLEIEKIDGFEKEPDEFGDLYISYDKNKIRNFSGYVVLYSELLDLFVSSSMPDHSLSNTNYIELEGFTIYSITNSDIYNSLYFTMKYPDYNEYNQLIKERTFRINLKDIKISYDDLINLIPSDKSLKCNDLEQEIKKLKSELEEKQKIIDSLSLTNQSGRISSPQKQLFALLVKKCYPKLDSRNKLFDVINADLKESGIRNTNIAADTFYKLIDESNDIIKAIFPPKKS >LS483485|1466805:1492402|1472254_1472416_+|SQI98444.1|DBSCAN-SWA MLDNDLLNLTHEQQQRAVEKIQELMAQGIGSGEAIALVAKQLREQNKNTQNHK >LS483485|1466805:1492402|1489263_1490451_-|SQI98470.1|capsid|DBSCAN-SWA MFKKLLELRQQKAEKVAAMRAMLDKAEQENRSLTETENVDFEKLKDLVKQLSDEIARYETVADEERNIADKGKPVETRGKTFSNDELRHYIKTGELRNLSTTGQEDGGYTVIPQLDKDVMKRLTDDSVMRQICNVVRLPVGAKEYKKLVSAGGAVVAHGEEGVARNGTATPKLHEVTIALNPIYAYPKTTQDILDFSSIDVLGWLTDEITESFTETEETDLTGGDGTKKSKGFLSYERTTEADKVRAFGKLQKLDVAGADKITADTLIDLFYTLHSKYRKNAVWVMSSTIAAALQKLKNKNGDFIWRDGLTVDAPSTLLGRPVYFLETMPASGANKPVVAFGDFKRGYFIVDHETGVRTRPDNITEPGFYKVHTDKYLGGGVVDSNAIKFIEVTA >LS483485|1466805:1492402|1476384_1477581_+|SQI98449.1|integrase|DBSCAN-SWA MARSVKQLNNKSVDNAKPKDKPYTLTDGNGLFLLVMPSGSKTWQFNYYRPITKKRAKFSLGAYPSVTIAQARSIREEYRSLLAQGIDPQNYIKEQEQEQSTTFLNVANRWKEKRAKEVEQLTMKKNWDRLEKYLFPIIGHYSVDKITSPLLIDTFKPLNDKGYNDTLHRMFNLSNQILNYAVTIGLIPVNVCQKAADAYHKEAQTHHPAIKPQQLPKLLFDFKNSNRSFLTKVLFRWQLLSMVRPAEAVSVEWSEIDFTKKLWTIPAIKMKKTKQGQFPHIVPLSSQMLAILEELKPITGEKKFVFPHHHLPNKSMSKEIIANSLRKIGYKDIQDSHGLRSIARTYLEEKAVDFRLSESCLAHKIGDSTSQAYNRYDYIELRRPVMQLWGDYVEQCER >LS483485|1466805:1492402|1485932_1486301_-|SQI98464.1|terminase|DBSCAN-SWA MSKRKSYKTPDFLDDIAKSQWKARIKQLSERGDIKLEDLTNLEIYCENYAIWRHSVSDLSKNGFIIVNSQGTQSRNPALSAKADAEKVMIKMSSLLGFDPVSRRKNPVETDVTDMLDEILTM >LS483485|1466805:1492402|1472900_1473347_-|SQI98446.1|DBSCAN-SWA MKYQKLENQEAHWKWVYLTKKAREGENITRYAEKSLQEDIVKQLIGCQNYPQQIEEWIKTHLAPELVIKLDQAIRARRKRFFNGEKQSTKKKSIDLEYAVWLRLSKYSRKMKMTLSETISYMIDEREKKALYESQMSAMKAGLKDLLK >LS483485|1466805:1492402|1479519_1479963_-|SQI98451.1|DBSCAN-SWA MDLNQKMDYSKLNAVELNAISISHQNMGKPKDEAFNSSFPYTTEAILALAEQFIDYPGEYLDGLKIIHDELLAINKHLLAMAPKPPSPDPEEVAAELFNDELIDGLLKHCVVNSLVSAFSYFQKTVAMRIHIIESGTVEGVNHGPLN >LS483485|1466805:1492402|1471179_1471497_+|SQI98442.1|DBSCAN-SWA MHQQTRLHLQHLQHTMERLALWQSVPPQEAAFLSEQPFALDTMNPTEWLQWIFIPRMHALVESQAPLPRQIAISPYLEEALKEEDYLAELLIPIMEIEKLLQQQC >LS483485|1466805:1492402|1486780_1487095_-|SQI98466.1|DBSCAN-SWA MVEINLTIDEIKAHLNLDHDLDDELLEAYKVATLEVCQKHIGKTFGEEETEKTIPFTPAIKIGCLMYIAYLYTNREAVTDLANLKPAPMTISALWEVYREPCAY >LS483485|1466805:1492402|1487081_1487426_-|SQI98467.1|head,tail|DBSCAN-SWA MAIMIKAGKYNKVISLQKQVNEQNDYGGIVSKWKTVANIRAAVEPLQGREFFSGAVPLNENTVRIRIRYGTNVDNTMRVKYGNRSLEIMNIIDSKEAHKELQLICKELTGNGGN >LS483485|1466805:1492402|1475399_1475930_+|SQI98448.1|DBSCAN-SWA MNTCTPNIKDSYSYEDLLASGRGELFGKEGPQLPAPTMLMMDRIVKMTEDGGTFGKGYIEAELDIHPDLPFFACHFIGDPVMPGCLGLDAMWQLIGFFLGWVGGKGKGRALGVGEVKFTGQILPTAKKVIYRINMKRVINRKLVMGMADGEVEVDGRVIYTATDLKVGLFQDTSSF >LS483485|1466805:1492402|1483977_1484262_-|SQI98462.1|DBSCAN-SWA MRCKQAKQNLLLSAVNHYKKSTAVFTFVSLYDDEEPYPISEVIHALKCKCNAAKREIDSRPNSPNMDALETIYFIAKKQLDAMLKQQKRINAGK >LS483485|1466805:1492402|1470289_1471072_-|SQI98441.1|DBSCAN-SWA MYLIEAFFSLQTSDFSLEKQAHCVNQLIEQWRYNGQIIGREIPQFLAEQENQQGLAVRVTCPEQTSLLAEFNNQPVTQALLEAKKCGVFFESFQIVAEDLNSDITAQETPSWQLLYTTYLQSCSPLHSGDTLQPLPLYKQLKNIPHLAMDTIKWQENWQACDQLQMNGSVLEKEALEQISSTETHLFKHGYHLAQEITRYSGIPTYYYLYRIGGESRETELHSYCPLCKRAWTLEQPLFDFLYFKCDHCRLVSNLSWHWQ >LS483485|1466805:1492402|1472625_1472835_+|SQI98445.1|DBSCAN-SWA MEVGVVKWFNNAKGFGFISVEGSNTDIFAHYSVIEMEGYRSLKAGQKVQFEVIHGDKGSHATKIIPVVE >LS483485|1466805:1492402|1468834_1469365_+|SQI98439.1|DBSCAN-SWA MEHKIEDLIAIFNQCFEQEYNTKLVKGGDEPLYVPANEDCPYNAIYFARGFYSSGLHEIAHWLVAGKERRKLEDFGYWYEPDGRTEEQQRLFEKVEVKPQALEWILATAANFRYFASSDNLNGQPGDTQPFKLAVYEQVKTYATKGLPKRAETLRQALAKFYGTENKIDLTKFDVT >LS483485|1466805:1492402|1481121_1481544_-|SQI98457.1|DBSCAN-SWA MNLNHVNYKQYENISQYINHPLGLSNFSLQALITQEEKLALFPLLFSVSTAASISLISSCGNLIPFVADLLFLCPVAIENPFLNCLYTLTEKRDKKKLDVFMHLNYYWCINTHKQVFKKANAQVCGNILERSNHHLTKTE >LS483485|1466805:1492402|1480678_1480909_-|SQI98455.1|DBSCAN-SWA MRKLKTKVSKKRPSLFEEERLPDWEQLVKAIKQTEFFLSFAKDYIHNGHLKGATDALKSIKRATTAGLKITGGNNA >LS483485|1466805:1492402|1487409_1488642_-|SQI98468.1|portal|DBSCAN-SWA MWNPFRRKEQRSSPMAINELLSYLGVSNTGAGEFVSPNTAESLPAVMSAVTVISEAVASMPCYLYQLKDDGRERVYHHPVDYLLNEMPNRSQTPYQFKYTMMRHCLLNGNAYAVIEWNSKGEPISLTPYEPSAVNIYRKVGGEYIYQITDLDGNTKNYLQDEILHLRHSSLDGFMGRSPITICRETVGLGIAQQKHGSAVMKNGLMASGLITTAEWLDEAKAQKAVKALERYKGAKNAGKTPILEGSMEYKQLGMTNQDAEWLASRTFTISDIARIYNISPIFLQDYSNSSYSNFSEASRAFLSQTLRPWLTNFEQQLKDALMIDLGSNSNKRYLIEFDTSDLLRTSQSERFKSYDVAIKAGVMCPNEVRRREGLPPYDGGEEFSQAWKQTVEVKRGDEQEPGVSNGNHD >LS483485|1466805:1492402|1488643_1489210_-|SQI98469.1|head,protease|DBSCAN-SWA MNKEFEIRSSEITADSENKKLVGYVVKWNSPSEVLYCDFVEQFSANAFSESLSSGADVRALFEHDHTKLLGRTRAGTLKLEEDAIGLRFELMPPDTTLGRDLLVSVERGDISGMSFGFWAKEETWNFDVEPCQRTVAKAELFEITVTSIPAYPESSVEIAKRSMATAKGKTQGKSTVLLKQWLDVAEA >LS483485|1466805:1492402|1471490_1472201_+|SQI98443.1|tRNA|DBSCAN-SWA MLEILYQDDVLVAVNKPAGMLVHRSWLDRHETQFVMQTLRDQIGQLVYPIHRLDRPTSGVLLFALNSEIANLLCQQFEQKQTGKQYLAVVRGYVTGHGEIDYPLKIQLDKIADKFSQQDKAPQSAVTFYEGLQTVEMPYGVGRYATSRYSLVRLVPKTGRKHQLRRHMKHIFHPILGDTQYGDLHQNRALTEHTGCSRLMLHAEKLTFVHPLTQVPITIQAGLDVQWQNLMQTFQW >LS483485|1466805:1492402|1469441_1470281_-|SQI98440.1|DBSCAN-SWA MHYQDKSLNALKLGQQTKYAEKYDRTLLQPVPRHLNRDMLNITTTQPFTIGADIWTAYEISWLNPKGVPQVAIADVSIDFRSENLIESKSFKLYLNSFNQTQFANFDEVQQILQQDLQDCAKGEVKVRLNSLTDYTQQPIVALDGDCIDGLNIEIEDYAFNAELLKNCTNDNIIEETLVSHLLKSNCLITQQPDWGSLQIHYVGKQINREQLLRYIISFRQHNEFHEQCVERIFCDLMHFAAPEKLTVYARYTRRGGLDINPYRSNFEPLPPNLRLARQ >LS483485|1466805:1492402|1481742_1482054_-|SQI98458.1|DBSCAN-SWA MNEARKPTQFLKVLYRLILSSISGIDGYSMGMTSARNYISELERNHLTGKVKRTTEKTADGMGQYYRYEIADAEQLKQVIAIYKAKGGELTAHEERQAYFRFR >LS483485|1466805:1492402|1473482_1475258_+|SQI98447.1|protease|DBSCAN-SWA MTSSYNTLPWQELRPELSVTEISSQPQDFFALQPRAEKAIRHFIKNSHRTLLVLKADDQAEYAPLLEQFIQSQKPMPDLCGVQYIIEQGDSFSFPRISAELAQSHDDNFATQKSVGTALYFDQFQLFGSVKIHATSHDIQLNPGLVHQLNSGVLIVTAGALLAQFDLWQRLKQILTTGVFNWYSAHPFKTLPCDIPSYPLQLKVIVLGNRTELATLEELEEDLYHLADYAEIESYCRVTNAEQQQQWMGYVQSVAQQHQLPMLDLSGFNKLYQLLVRDSENRELINISPLTLKNILTETALLTQSTSLSAVDFERYFLHKAEQFGFLREQTYDCILQEQIYVATEGEMVGQINGLSVIEYPGTPLVFGEPSRISCIVQYGDGEVVDVERKTELAGNIHSKGILIAEACLANILELPSQLPFSASLVFEQSYGEIDGDSASLAGFCVLVSALSDLPLPQSIAITGAIDQFGLVHSVGGINEKIEGFFTICQRRGLTGKQGVIIPSAVVNQLSLSETVISAVKNQEFFIYPVETVDQACEILLQRDLVEQENKTYTMDTMPLSRLINQRINQYADRQSHRYGFWDFLFSRKSH >LS483485|1466805:1492402|1482065_1482269_-|SQI98459.1|DBSCAN-SWA MSQSQTQSTQLITGADVCRRVSFGRTKLNELVRAKQFPQPIRFSQNFVRWDLEEVNQWIEEQKAARA >LS483485|1466805:1492402|1479962_1480277_-|SQI98452.1|DBSCAN-SWA MNDTNTPTQKHGTATLWSIESIITEIQEIADAGLNGAIYRANEHGERIAISKDEILDLILDQATLALMDVEDLKHQIEALAIHHKAKTEKDYLTLPVVDLGGAQ >LS483485|1466805:1492402|1484258_1485926_-|SQI98463.1|terminase|DBSCAN-SWA MEIWHEYAKKVQTGEIVACRKIKQAVARYFDDLANPAYFFDESAVNKFLAFSRLCPHVKGHLRGQPIELSDWQTFLFANLLGFKRTDTGLRKYRSAYIQVARKNAKSTVAAVLANWFLLMEAGQQDIYTAAVSRDQARIVFDDARQMCLLSPLLRKRLNIQQHKLINPKSNSLMRPLAAKSSTIEGTNPSLAIVDEYHLHTDNSVYSALELGQGARPEGLLFAITTAGSNVISACKQHYDYCAQILEGNEQNDSLFVLIFELDEESEIDNPENWIKANPNIGKSIPYLDFENTIKKARGIPSEWVEMLTKRFNVWCQGTTPWLGEGNWAQCVRDYTESDLLHQDCYLGLDLSSTNDLTSLCYTFPQGKKVRLITRHYIPEFQLNNVANKNRAIYRNWVRSGRLIATEGDCIDYDKIRDDILKDAENFNIKMIGFDVWNATHLRTQLQAAGLEVEPFPQTYQRFSPVAKSAEVLINRQVIEHHGDPVLSWALSNVVMETDANANIKPNKKKAANKIDPAVAFLMSFGTYQLEYGDLIFELSEEHKQALEQFNGIDL >LS483485|1466805:1492402|1480482_1480686_-|SQI98454.1|DBSCAN-SWA MRNLDLIDKINIETAYLKAMLNMFAAHFEDKGQRLDDLSYANIAYAINSHVENIENAVNDKSNWGEK >LS483485|1466805:1492402|1480901_1481117_-|SQI98456.1|DBSCAN-SWA MTNCNYTKPTFIFVAVRRADVNSKPQRLKITANTELEARAKLAKEFVLVFAGRINPQNTTKTNRTLEVIYA >LS483485|1466805:1492402|1466805_1468746_+|SQI98438.1|protease|DBSCAN-SWA MVKNLILWVVVAVVMMTAYQSFNAASSGGITDYTTFISDVENNQVRQAKFEDNEILVTKADGAKYTTVIPLEDKDLLNDLLKKKVKVEGTPPERRGLLSQILISWFPMLLLIGVWVFFMRQMQGGGGKTMSFGKSRARMMTQEQIKTTFADVAGCDEAKEEVAEIVDFLREPKKFQNLGGKIPKGILMVGPPGTGKTLLAKAIAGEAKVPFFTISGSDFVEMFVGVGASRVRDMFEQAKKNAPCLIFIDEIDAVGRQRGAGLGGGHDEREQTLNQMLVEMDGFEGNEGVIVIAATNRPDVLDPALTRPGRFDRQVVVGLPDVKGREQILKVHMRKVPVGPDVDAMTLARGTPGYSGADLANLVNEAALFAARTNKRIVTMVEFEKAKDKINMGPERRTMIMTDKQKESTAYHEAGHAIVGYLVPEHDPVHKVTIIPRGRALGVTFFLPEGDQVSISQKQLESKLSTLYAGRLAEDLIYGEENISTGASNDIKVATNIARNMVTQWGFSDKLGPILYTEDDGEVFLGRSMAKAKHMSDETAHVIDEEVRAIVNRNYERARQILIDNMDILHAMKDALVKYETIEEEQIKQLMNREPVTPPSGWEEPRDNDNKAQPQQPKAETPKTEDRESTKDTQSAVEKDTDSESL >LS483485|1466805:1492402|1486403_1486766_-|SQI98465.1|DBSCAN-SWA MPYQPLRRCSYPGCKNKVKSGRCEEHKPKDNRPNSSARGYDHKWSKYREQYLKHHPLCVMCLEQGKYTPATVIDHIKPVENGQSDPLFWVASNHQPLCRDCHSYKTRVIDQRGFGAKKID |
34 | uncultured_Caudovirales_phage(53.85%) | portal,capsid,head,tRNA,integrase,protease,tail,terminase | attL 1472211:1472232|attR 1504542:1504563 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|