Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
LS483492 | Serratia rubidaea strain NCTC10848 genome assembly, chromosome: 1 | 3 crisprs | cas3,DEDDh,csa3,DinG | 0 | 5 | 2 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LS483492_1 | 2895941-2896327 | Orphan |
I-F
Consensus repeat of LS483492_1
|
6 spacers
spacers of LS483492_1
>1.1|2895969|32|LS483492|PILER-CR,CRISPRCasFinder,CRT TACCAGGGGAGTTCCCTTGACAATCTTAATCA >1.2|2896029|31|LS483492|PILER-CR,CRISPRCasFinder,CRT CTTTGCCTTCTGATTCCCTGCTACTTTTGAT >1.3|2896088|32|LS483492|PILER-CR,CRISPRCasFinder,CRT TTCTTCCACGCTGCACGGGACTCTCGAGCACT >1.4|2896148|32|LS483492|PILER-CR,CRISPRCasFinder,CRT CAGAAGTCAACGATGTTCCGACTGTTCCGCCA >1.5|2896208|32|LS483492|PILER-CR,CRISPRCasFinder,CRT ACGTGTGGACGGACGGCATCGGTAACACGCAC >1.6|2896268|32|LS483492|PILER-CR,CRISPRCasFinder,CRT ATCAGGAGCAAGCGGTTGCCGCTGCCGGTATG |
CRISPR arrays and Neighbor proteins around LS483492_1
The CRISPR arrays of LS483492_1 >merge|LS483492|1|2895941-2896327|PILER-CR,CRISPRCasFinder,CRT TTTCTAAGCTGCCTGTACGGCAGTGAACTACCAGGGGAGTTCCCTTGACAATCTTAATCATTTCTAAGCCGCCTGTACGGCAGTGACTCTTTGCCTTCTGATTCCCTGCTACTTTTGATTTTCTAAGCTGCCTGTACGGCAGTGAACTTCTTCCACGCTGCACGGGACTCTCGAGCACTTTTCTAAGCTGCCTGTACGGCAGTGAACCAGAAGTCAACGATGTTCCGACTGTTCCGCCATTTCTAAGCTGCCTGTACGGCAGTGAACACGTGTGGACGGACGGCATCGGTAACACGCACTTTCTAAGCTGCCTGTACGGCAGTGAACATCAGGAGCAAGCGGTTGCCGCTGCCGGTATGTTTCTAAGCTGCCTGTACGGCAGTGAAC >LS483492|1|1|2895941-2896327|PILER-CR TTTCTAAGCTGCCTGTACGGCAGTGAAC TACCAGGGGAGTTCCCTTGACAATCTTAATCA TTTCTAAGCCGCCTGTACGGCAGTGACT CTTTGCCTTCTGATTCCCTGCTACTTTTGAT TTTCTAAGCTGCCTGTACGGCAGTGAAC TTCTTCCACGCTGCACGGGACTCTCGAGCACT TTTCTAAGCTGCCTGTACGGCAGTGAAC CAGAAGTCAACGATGTTCCGACTGTTCCGCCA TTTCTAAGCTGCCTGTACGGCAGTGAAC ACGTGTGGACGGACGGCATCGGTAACACGCAC TTTCTAAGCTGCCTGTACGGCAGTGAAC ATCAGGAGCAAGCGGTTGCCGCTGCCGGTATG TTTCTAAGCTGCCTGTACGGCAGTGAAC >LS483492|1|1|2895941-2896327|CRISPRCasFinder TTTCTAAGCTGCCTGTACGGCAGTGAAC TACCAGGGGAGTTCCCTTGACAATCTTAATCA TTTCTAAGCCGCCTGTACGGCAGTGACT CTTTGCCTTCTGATTCCCTGCTACTTTTGAT TTTCTAAGCTGCCTGTACGGCAGTGAAC TTCTTCCACGCTGCACGGGACTCTCGAGCACT TTTCTAAGCTGCCTGTACGGCAGTGAAC CAGAAGTCAACGATGTTCCGACTGTTCCGCCA TTTCTAAGCTGCCTGTACGGCAGTGAAC ACGTGTGGACGGACGGCATCGGTAACACGCAC TTTCTAAGCTGCCTGTACGGCAGTGAAC ATCAGGAGCAAGCGGTTGCCGCTGCCGGTATG TTTCTAAGCTGCCTGTACGGCAGTGAAC >LS483492|1|1|2895941-2896327|CRT TTTCTAAGCTGCCTGTACGGCAGTGAAC TACCAGGGGAGTTCCCTTGACAATCTTAATCA TTTCTAAGCCGCCTGTACGGCAGTGACT CTTTGCCTTCTGATTCCCTGCTACTTTTGAT TTTCTAAGCTGCCTGTACGGCAGTGAAC TTCTTCCACGCTGCACGGGACTCTCGAGCACT TTTCTAAGCTGCCTGTACGGCAGTGAAC CAGAAGTCAACGATGTTCCGACTGTTCCGCCA TTTCTAAGCTGCCTGTACGGCAGTGAAC ACGTGTGGACGGACGGCATCGGTAACACGCAC TTTCTAAGCTGCCTGTACGGCAGTGAAC ATCAGGAGCAAGCGGTTGCCGCTGCCGGTATG TTTCTAAGCTGCCTGTACGGCAGTGAAC
>LS483492.1|SQJ20932.1|2895624_2895831_-|Microcin-H47-immunity-protein-mchI MEVLNRTSAVLSLPLTILCVSFSSLTIMGEHNSLVDVFLSVIVFFGFLNLFRIAGKLIAWFNGNANQA >LS483492.1|SQJ20930.1|2894260_2895526_+|Probable-cysteine-desulfurase MTFTPERARSQFGALSQHYQDKPAIFFDGPGGSQVSHSVLEKMTGYLGSYNANLGGHYFSSQMTEQVMHNARASVRALLNAPSPDNIVFGMNMTSLTFHLSRIISRDWQPGDEIIVTALDHYANVSSWQQAAEDKQAIVHHIPLRPEDCTLDTERLCAQINDKTRLVAVTHASNVTGSVVDIRAITAAAQRVGAQVYVDAVHYAPHNLIDVQALSCDFLVCSAYKFFGPHIGMAYIAPQWLQRLRPYKVEPATDIGPGRFETGTQSFEGLAGVTAAIDYLAQWGTPGAPLRQRLQESFAAYHQHEEDLCRYFLQRLRQLEGARLYGYPEPDSTRRTPTFALTFKNFSPEQIAMLLGRHNICAGSGHFYAQGLIRQLNLQEHGGVLRIGMMHYNTRQEIDRLFEVLEDALAEADSPAAVDGR >LS483492.1|SQJ20928.1|2893619_2894264_+|Uncharacterized-protein-conserved-in-bacteria MLPGHENVVSMTSLSRQATQQLMPSFSDLPHTQHADGKYRLRRYSVITFRNNQVEDIGQRNFVQSDEFNHFQGNVVRHFEPIDSAVLQSEGMAEMCALFADVNGLPDGAKIEIHQMRVVAIYNETPVAPEGIHQDGFEHIAMASINRHNIVGGEVMLYDDNRSMPFFRRVLADSETVLLADNKLWHYFAPITAVAPGEEGHLDLLILTARREHA >LS483492.1|SQJ20927.1|2893241_2893361_+|Uncharacterised-protein MESLDYLAYLPVAIALNLVAIVAVAGISLWKIWRRKLRA >LS483492.1|SQJ20925.1|2892938_2893187_+|Bacterial-protein-of-uncharacterised-function-(DUF883) MFGKAEDKAKEVAGAAQEAFGEATGSPEHEIKGAARKYASQASYAVRDGADVVRHRVESNPLAGVAVAAAVGVVFGYLLGRK >LS483492.1|SQJ20923.1|2891908_2892532_+|transcriptional-regulator MSQVIYRIEGSIIFSPEDTGLYRHNQADSIVAISSAAARLFTLLIKNKGHIVERETILEKVWDEYGLQASNNNLNQCLSILRRIIKNMGIDKNIIETIPKVGLRISHVIQIEEINLHPPMPPVPGDPPTPDKRSRKKIKKKLFYSALLVGLLLMFSLFFNIFHLYNSYQSRPASLSLLNSTRGQAALCAVPPAETPRLSTVSLRTPH >LS483492.1|SQJ20921.1|2891156_2891582_-|Universal-stress-protein-A MAYQHIVVATDLGENATQLLQKGAELASALRAKLSLIYIDIHHTGYYAELGIGAYNYTDQKFSERAASILESFRNQTPHPIAEVIIGRGELSEELNNAAQAKGFDLIIFGHHHDLWSRLFSATRQAINTLQIDVLVIPIDN >LS483492.1|SQJ20919.1|2890122_2890878_-|3-oxoacyl-[acyl-carrier-protein]-reductase-FabG MRFDNKVVIITGAGNGMGEAAARRFAAEGATVVLADWAIDAVDAVAASLPAGKAHAVHIDVSDAEAVEAMMNEIAARFGHIDVLLNNAGVHVAGSVLETSVADWRRIAGVDIDGVVFCSKFALPYLLKSKGCIVNTASVSGLGGDWGAAYYCAAKGAVVNLTRAMALDHGGEGVRVNAVCPSLVKTNMTNGWPQEIRDKFNERIALGRAAEPEEVAAVMAFLASEDASFINGANIPVDGGATASDGQPKIV >LS483492.1|SQJ20917.1|2888804_2890085_+|Gamma-glutamylputrescine-oxidoreductase MTEHVKSYYAASANPHQPYPQLNESIECDVCIVGGGYTGLSSALFLVEAGYDVVVLESARIGFGASGRNGGQLVNSYSRDIDVIEQRYGADSARLLGSMMFEGAEIIRSRIKRYAIECDYRPGGIFAALNNKQYHMLIEQKGHWERYGNTQLELLDADRIRQEVASDRYVGALLDHSGGHIHPLNLALGEAEAIRLQGGRIFEQSAVTNIRHGDPALVSTAHGQVSARYVIVAGNAYLGDKLEPRLAKRSMPCGTQVVTTEPLAPEVAQSLIPQNYCVEDCNYLLDYYRITADHRLLYGGGVVYGARDPDDIDNLIRPKLLKTFPQLKGVRIDYRWTGNFLLTLSRMPQFGRLENNVYYMQGYSGHGVTCTHLAGKLIAELLRGDAERFDAFAKLPHLPFFGGRNLQIPFTAIGAAYYTLRDRIGV >LS483492.1|SQJ20914.1|2887302_2888793_+|Aldehyde-dehydrogenase-PuuC MDFHHLQYWQQRAQALTIENRLFINGRYQPAAEGGTFTVEDPAGQRELTQAARAGSADVGLAVSAARAAFERGDWSRAAPSRRKAVLLALAEQIEQHAETLALLETLDTGKPIRHSLRDDIPGAARCLRWYAEAIDKVYGEIAPTGADALALIEREPVGVVGAITPWNFPLLLACWKLGPALATGNSVVLKPSEKSPLSAIYLGQLAQQAGLPDGVLNIISGFGHDAGKALALHPDVDALTFTGSTLVAKQLMVYAGESNMKRVWLEAGGKSANIIFADCPDIDKAARAAAAGIFYNQGQVCIAGTRLLVEESIQQPFLQALHKHAAAFTPGNPLDPDTQMGTLIDSGHSAKVADFIKQGLDQGATLFLDGRGSGPHDAYLGPTVLTQVDNAMQVAREEIFGPVLAVTPFNGEQQAIELANDSDYGLGAALWTRDLSRAHRLARQLRAGSVFINNYNDGDMTVPFGGYKQSGNGRDKSLHALDKFTELKTTWISLE >LS483492.1|SQJ20935.1|2896675_2896966_+|Bor-protein MKKLLAAVVLAMTVTGCAQQTFTMKHETVAVPKQTTTHHFFVSGIGQKKAIDAAAVCGGADKVVRTEVQQTFPNILLGIVTFGIYTPREARIYCAQ >LS483492.1|SQJ20936.1|2897036_2897672_-|Major-phosphate-irrepressible-acid-phosphatase-precursor MGFLTPETAPDSLQILPPPPAENTTAFLNDKAAYETGKTLKDAQRQALAASDADYKNISAAFSAAFGEEISQEKTPALYALLQGVLQDSHDYAMRAAKDHYMRVRPFVVYKDSTCTPEKDKSMAKTGSYPSGHASFGWATALVLAEINPARETEILKRGYDFGQSRVVCGAHWQSDVDNGRLMGAAVVAALHSNQGFTDALAKAKAEFGKR >LS483492.1|SQJ20938.1|2898202_2898472_-|Uncharacterised-protein MKKLAFYKFIIGWGTANLAYAFAFLWLFIVWVIASHTVFRHEDYFHFKKILFNIITYGFNTWLDFAALYSTIFALLWALISVYRSKSVS >LS483492.1|SQJ20940.1|2898983_2900102_-|DNA-polymerase-V-subunit-UmuC MGVPYFKVKHPFERAGGIAFSSNYELYADMSMRVMSVLEEVAPRVEIYSIDESFMDLTGVRHCVDLATFGRQVRAKVLRDTGLTVGIGIAQTKTLAKLANHAAKKWPGTGGVVDLSSVGRQRKLMGLVPVAEVWGIGRRIAKKLQVMGIDTALQLADASTAMIRKNFSVVIEKTVRELRGEPCLELEEFAPTKQQILCSRSFGERITEYDDMRQAICMYATRAAEKLRGEHQYCMHISAWLKSSPFAINEQYYGNSASIKLSTPTQDTRDIIAAAMRCFDAIWQPGHRYQKAGVMLQDFFSHGVAQLGLFDEYQPQHNSERLMSVLDRINNSGRAKLWFAGQGMYQQWSMKREMLSPAYTTRVSDLPQARVC >LS483492.1|SQJ20942.1|2900252_2900675_-|DNA-polymerase-V-subunit-UmuD MTLFFPTPNPVKLELPLFSDKVPAGFPSPVADYVSSRIDLNEYCISHPNATYFLYATGDSMLEAGITEGSMLVVDRSIAPAHGDIVIACIAGEFTVKRLCLHPRAQLEPMNAKYDPIVLHDDGDSLDVVGVVVSSITRLK >LS483492.1|SQJ20944.1|2900900_2902154_-|Isocitrate-dehydrogenase-[NADP] MESKVVVPAEGKKITVDAQGKLVVPHNPIIPFIEGDGIGVDVTPAMINVVNAAVDKAYHGERKISWMEIYTGEKSTHVYGKDVWLPDETLDLIRDYRVAIKGPLTTPVGGGIRSLNVALRQQLDLYVCLRPVRYYQGTPSPVKHPELTDMVIFRENAEDIYAGIEWKAGSAEADKVIKFLRDEMGVQKIRFPEQCGIGVKPCSEEGTKRLVRAAIEYAITNDRDSVTLVHKGNIMKFTEGAFKDWGYELAREEFGGELIDGGPWLKIKNPNTGKEIVVKDVIADAFLQQILLRPAEYDVIACMNLNGDYISDALAAQVGGIGIAPGANIGDECALFEATHGTAPKYAGQDKVNPGSVILSAEMMLRHLGWFEAADLIVKGMEGAIAAKTVTYDFERLMEGAKLLKCSEFGDAIVEHM >LS483492.1|SQJ20946.1|2902256_2902913_+|Ribosomal-large-subunit-pseudouridine-synthase-E MRLSSASTIDMNKFPVKKHQVKRFSRPNSAKRPVPPGPRRVVLFNKPFDVLPQFTDEAGRSTLKEFIPFNDVYAAGRLDRDSEGLLVLTNDGKLQAQLTQPGKRTGKVYYVQVEGSPQESDLAQLRSGVTLKDGPTLPAGVERVAEPAWLWPRTPPIRERKSIPTSWLKITLYEGRNRQVRRMTAHIGFPTLRLIRYSMGNLSLGDLQPGEWKTTDSP >LS483492.1|SQJ20949.1|2902932_2903268_+|Uncharacterized-protein-conserved-in-bacteria,-putative-lipoprotein MKTLGTLLLTTLLCAAGAQAASFDCRKAASKDEQAICASRSLSDKDVEMATKYQFLRGLFAMGFRGEMQDRQNSWLAKRKQCGSDVSCLSNSYRQRIGELDAIYNKIDRPL >LS483492.1|SQJ20951.1|2903277_2903724_+|Phosphatase-nudJ MFKPHVTVANVVHAAGKFLIVEETVNHKALWNQPAGHLEADETLLQAAERELWEETGIRAAPQSFLKMHQWIAPDGTPFLRFCFVVELPAIVPTAPQDSDIDRCLWLSADEILHAPNLRSPLVAESLRCYRQPERYPLSLIGSFNWPY >LS483492.1|SQJ20953.1|2903820_2904933_+|tRNA-specific-2-thiouridylase-mnmA MSDNSQKKVIVGMSGGVDSSVTAYLLQQQGYQVAGLFMKNWEEDDDEEYCSAATDLADAQAVCDKLGIELHTVNFAAEYWDNVFELFLEEYKAGRTPNPDILCNKEIKFKAFLEFAAEDLGADYIATGHYVRRQDVDGKSRLLRGVDGNKDQSYFLYTLSHEQVAQSLFPVGELEKPEVRRIAEQLELVTAKKKDSTGICFIGERKFRDFLGRYLPAQPGPIMTVDGQTIGQHQGLMYHTLGQRKGLGIGGMKDSSEDPWYVVDKDVANNVLIVAQGHDHPRLMSLGLIAQQLHWVDRQPLTAPLRCTVKTRYRQQDLPCTITPIDDQRIEVRFDEPVAAVTPGQSAVFYQGEICLGGGIIEQRLPLEEG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LS483492_2 | 4444718-4444866 | Orphan |
I-F
Consensus repeat of LS483492_2
|
2 spacers
spacers of LS483492_2
>2.1|4444747|31|LS483492|PILER-CR,CRISPRCasFinder ATCGCTGGTATGAACCGCCGGTTTCATTCCC >2.2|4444807|31|LS483492|PILER-CR,CRISPRCasFinder GCTGCCCCTGGGTAGATGCGGTATCGCCATA |
CRISPR arrays and Neighbor proteins around LS483492_2
The CRISPR arrays of LS483492_2 >merge|LS483492|2|4444718-4444866|PILER-CR,CRISPRCasFinder TTCTCTAAGCTGCCTGTACGGCAGTGAACATCGCTGGTATGAACCGCCGGTTTCATTCCCTTTTCTAAGCTGCCTGTACGGCAGTGAACGCTGCCCCTGGGTAGATGCGGTATCGCCATATTTTCTAAGCTGCCTGTACGGCAGTGAAC >LS483492|2|2|4444718-4444866|PILER-CR TTCTCTAAGCTGCCTGTACGGCAGTGAAC ATCGCTGGTATGAACCGCCGGTTTCATTCCC TTTTCTAAGCTGCCTGTACGGCAGTGAAC GCTGCCCCTGGGTAGATGCGGTATCGCCATA TTTTCTAAGCTGCCTGTACGGCAGTGAAC >LS483492|2|2|4444718-4444866|CRISPRCasFinder TTCTCTAAGCTGCCTGTACGGCAGTGAAC ATCGCTGGTATGAACCGCCGGTTTCATTCCC TTTTCTAAGCTGCCTGTACGGCAGTGAAC GCTGCCCCTGGGTAGATGCGGTATCGCCATA TTTTCTAAGCTGCCTGTACGGCAGTGAAC
>LS483492.1|SQJ30288.1|4443844_4444396_+|Protein-of-uncharacterised-function-(DUF2778) MARLYQSRMGNMALYGKFIVNDADYSPLTIYGIGTFMAFSGDGNYRNAPGCGMIPNEGPIPSGKYWIVDRPTGGIREPIATWFSDVLNRNLRNIPTGKNEWFGLFRDDGKIDDHTWINGVKRGNFRLHPGTISYGCITLKHHSDFAAIRSALLRTKTINIRNMRLRACGIIDVISNGECKVSG >LS483492.1|SQJ30286.1|4443165_4443600_-|Uncharacterized-protein-conserved-in-bacteria MEQSEQQQHIIRFLSKQHVLTLCAGNGVEMWCANCFYVFDAERMALWLMTEPHTRHGQLMQQHAEVVGTIAPKPKSIALIKGVQYRAEARILSGDEEQQARARYCKRFPVAKAMPATLWQLSLQEVKMTDNVLGFGTKLFWARS >LS483492.1|SQJ30283.1|4442446_4443088_-|Uncharacterised-protein MARVLITGATGLVGGELLRLLLADRRVTAITAPSRRPLPPHDKLTNPVGDDLFALLTELDQPVDAVFCCLGTTRQQAGSDGNFHYVDYTLVVESALTGRRLGAQHCLAVSAMGASPHSTFLYNRIKGEMEQALREQGWQRLTLVRPSMLLGKRAAPRLLERVAQPLFRLLPGRWKSVAARDVAQTLLEQAFSPGDGVRVLESDRLQHGERLAH >LS483492.1|SQJ30281.1|4441701_4442499_+|carbon-phosphorus-lyase-complex-accessory-protein MQLTFLGTGGAQQVPAFGCECAICQRARREPARRRRACSAMLDYQGETTLIDAGLTSLERRFSAGQIQRFLLTHYHMDHVQGLFHLRWGCGSAIPVYGPPDEQGCDDLFKHPGILDFQPPLAPFVSVTLGGLRITPLPLIHSKITHGYLIQSADRTLAYLTDTVGLPPDTLRFLQGVRLDLLVLDCSLPPQAQAPRNHNDLTRALEIQQRLLPQRTLLTHISHHLDAWLLTHPLPPGLELAYDGLCIDVSAPDAPRAAADRTPAP >LS483492.1|SQJ30279.1|4441150_4441711_+|Ribose-1,5-bisphosphate-phosphokinase-PhnN MAQLIYLMGASGSGKDSLLAALRNAADNAPLVAHRYITRPAQAGCENHVALSEREFLRRRANGLFALDWQAHHTRYALGIEVDLWLLQGIDVVINGSRAYLPYARQRYGAQLLPLCLQVEPAVLRRRLERRGREDSGQIEQRLARAAAYSVPSDCLRLDNNGELSDTLAALLQLLTALRREETPCN >LS483492.1|SQJ30277.1|4440014_4441151_+|phosphonate-metabolism-protein-PhnM MIINNVKLVLDDQVVNGSLEMGDGVIRSFADGYSRLPQALDGEGGWLLPGLIELHTDNLDKFFMPRPNVDWPAHSAMSSHDALMVACGITTVLDAVAIGDVRDGGHRLENLQKMIDAVIHSQRAGVNRAEHRLHLRCELPNAGTLPLFEQLMDKPGVSLVSLMDHSPGQRQFASRAKYREYYQGKYNLSDAAMDEYEQQQVALSARWATPNRQAIAERCRERGIALASHDDATAAHVAESCALGSAIAEFPTTEEAARASHRQGLQVLMGAPNIVRGGSHSGNVAAHQLAAQGVLDILSSDYYPASLLDAAFLLAADVRNGYDLPQSVAMVTRNPARALGLADRGTIAEGLRADLVLARAHGEHIYVQHVWRQGRQVF >LS483492.1|SQJ30275.1|4439310_4440015_+|Lipoprotein-releasing-system-ATP-binding-protein-LolD MTIKIRVEHLSKTFVLHQQHGTRLPVLHDASLTVSAGECVVLHGHSGSGKSTLLRSLYANYLPDSGHIWLDHQGEWIDMAQAEARQILAVRRRTLGWVSQFLRVIPRISALEVVMQPLLELGVAHDECRERASALLSRLNVPQRLWGLAPSTFSGGEQQRVNIARGFIVDYPILLLDEPTASLDARNSAAVVALVEQARRRGAAIVGIFHDDQVRAQVADRLHIMQAPQALETP >LS483492.1|SQJ30273.1|4438506_4439295_+|Glutathione-import-ATP-binding-protein-GsiA MSTTPLHDAPLLSVERLTHLYAPGKGFSDVSFDIYSGEVLGIVGESGSGKTTLLKSISARLAPQQGRILYRPQAGQEQDLYAMAESARRRLLRTDWGVVHQHPLDGLRPQVSAGGNIGERLMAVGQRHYGDIRRQAGQWLEDVEIPLSRLDDLPTTFSGGMQQRLQIARNLVTHPKLVFMDEPTGGLDVSVQARLLDLLRTLVVEMRLAAVIVTHDLGVARLLAHRLLVMKQGEVVESGLTDRVLDDPHHPYTQLLVSSVLS >LS483492.1|SQJ30271.1|4437649_4438510_+|Phosphonate-metabolism-protein-PhnJ MSNVLSGYNPGYLDEQTKRMIRRAILKGVAIPGYQVPFGGREMPMPYGWGTGGIQLTASVVGDADVLKVIDQGADDTTNAVSIRRFFQRVAGVATTERTVEATLIQTRHRIPETPLREDQILIYQVPIPEPLRFIEPRETETRKMHALEEYGVMQVKLYEDIARYGHIATTYAYPVKVNDRYVMDPSPIPKFDNPKMHMMPALQLFGAGREKRIYALPPYTKVVSLDFDDHPFSVQQWEQPCALCGSRSSYLDEVVLDDQGNRMFVCSDTDYCRQQLASSTETPRP >LS483492.1|SQJ30268.1|4436565_4437657_+|Bacterial-phosphonate-metabolism-protein-(PhnI) MYVAVKGGEKAIEAAHRLQEQLRRGDPAQPALTAAQVEQQLGLAVDRVMTEGGVYDRELAALAIKQASGDLVEAIFLLRAYRTTLPRLAVSEPLLTEQMRLERRISAIYKDLPGGQLLGPTYDYTHRLLDFALLADGGAPDATPQLADDAPATGCPHVFSLLSRQQLAQAEQDDGSTPADITRTPPVYPCPRSARLQQLVRGDEGFLLALGYSTQRGYGRNHPFAGEIRSGYVTVEIVPEELGFAIDIGEILLTECEMVNGFVAPEQQPPHFTRGYGLVFGRAERKAMAMALVDRALQKDEYAEDGDSPAQDEEFVLSHADNVEAAGFVSHLKLPHYVDFQAELELLQRLRQRGEPTQETPHE >LS483492.1|SQJ30290.1|4445509_4446214_+|Uncharacterised-protein MSKRSDIIDGAEASSAPYGLVYTEVIGLIDLGHAQGTDIRNLMRTIDSGKSSSEEYYRVSYSQSMRDPTKTIKMGKFITWRIKHGRSYYERQSIALAMMMSLSLKFEGLQASFPVSLVTDSGFSGEDLISNLLGFYRAVSLQNPFGMLRPVSKAEALKRWDHYGKIGSWKNETFKPILFPDPKRFPHATPRRGELPSFMKTVRPWSDFRSGIVKIATANGSYMDSARNGVLPYA >LS483492.1|SQJ30292.1|4446934_4447561_-|Leucine-efflux-protein MLGTAQIISYAAALGLAAAVPGPGMTALVARSVSGGAVTGFTMLTGLILGDLIYLSTAVFGLAVIAHTYTSLFTLINWAAACYLFFLAWQFWRYQPQAVNIDQKATRRELTSAWLSGLTITLGNPKTIAFYLAILPLVISLDRVSLQVWGTMLVPLTMSVLLAVGAIFILAAIKIRRFLTSASSQRILFRTTGVIMVLAAFGMVMKSL >LS483492.1|SQJ30294.1|4448065_4448554_+|Leucine-responsive-regulatory-protein MPLDMTEMKILRLLQDDARVTNQALAEKIGMSASPCWRKVRKLEEEEVIQGYRAVLDRKKIGLGVMVFIRVAIDSHSETEAKKFEEEVTALEDVVACYSIGGDADFLLQVVAPDLDSYADFAMSVVRRLPGIKEMQSMFVLKEIKPLVSYPIKKTAQTKSGQ >LS483492.1|SQJ30296.1|4448672_4448894_-|Uncharacterised-protein MKRVIAIIVGTAIGIVLIWLAYPYISDWLVGPVHGEDQMSANFVLLLVGLGVGCVVGSLVGGLAYSRLKKRMR >LS483492.1|SQJ30299.1|4449725_4450019_+|GIY-YIG-nuclease-superfamily-protein MAETTTWHLYMLRLPSGMLYTGITTDVARRLAQHQAGKGAKALRGKGELQLAFHCAVGDRSLALRLEYRIKQLSKVQKERLVSHPPLSLDYLLVKNG >LS483492.1|SQJ30301.1|4450005_4450509_-|Predicted-acetyltransferase MLIRVEIPVDAAGIDALLRSAFGRDDEADLVRQLREDGLLTLGVVATDDEGGVVGYAAFSPVDVEGEDRQWVGLAPLAVEERLRRQGLGEKLVYEGLDALNEFSYAAVVVLGEPAYYSRFGFKPAAEFGLSCRWPGCEQAFQVYPLAQDALDGVNGAVEYAAPFNRF >LS483492.1|SQJ30303.1|4450502_4451042_-|Putative-lipid-carrier-protein MKEWMTVLEKLRSRIVRQGPSLLRVPLKLTPFALQRQLLEQVLGWQFRQALADGDLEFLESRWLKIEVRDLALQWFMTVENDRLVVRQHAQADVSFSGDANDLILIAARKQDPDTLFFQRRLQIEGDTELGLYVKNLMDAIELEAMPAPLRIGLLQLAEFVEAGLQEGAEPSSRVVAPC >LS483492.1|SQJ30305.1|4451276_4452272_+|Uncharacterized-protease-yhbU-precursor MELLCPAGNLPALKAAVDNGADAVYIGLKDDTNARHFAGLNFSEKKLQEAVEYVHRHGRKLHIAINTFAHPDGYARWQRAVDMAAQLGADALILADLAMLEYAAERYPHIERHVSVQASATNEEAIRFYQRHFAVHRVVLPRVLSMHQVKQLARTSPVPLEVFAFGSLCIMAEGRCYLSSYLTGESPNTVGACSPARFVRWQQTPQGMESRLNDVLIDRYQEGENAGYPTLCKGRYLVDDVRYHALEEPTSLNTLALLPELLAANIASVKIEGRQRSPAYVSQVARVWRQAIDRCIADPAGYQPDAGWMAALGAMSEGTQTTLGAYHRKWQ >LS483492.1|SQJ30306.1|4452283_4453162_+|putative-protease MKYALGPVLYYWPKTETESFYQQAMHSSADIIYLGESVCTKRREMKVGDWLALAKTVAQSGKQVVLSTLALLQAPSELNELKRYVENGEFLIEANDLGTVNMAAERGLPFVAGHALNCYNAYTLRLLHKQGMTRWCMPVELSRDWLGKVLEQCGELGFRRGFEVEVLGYGHLPLAYSARCFTARSENRGKDECETCCIKYPQGRAMRSQEDQQVFVLNGIQTMSGYCYNLGNELSGMQELVDIVRLSPQGVETLTMLEQFRANENAQQPLALAGSSDCNGYWRRVAGLELVQ >LS483492.1|SQJ30310.1|4453598_4454738_-|daunorubicin-resistance-ABC-transporter-membrane-protein MKSYWQTFSRVLIGMLERPMWLMLILSLCVMSLVYANRSVWDLPVGVIDQDHSSASRELIRQLDATSKIAIKTYDSLDQAQRDLGWRELFAVIIMPVDLEKKILHGENIVVPVYGDATNRLANGQIQQDVVTAYQQLLNQYNTSLLLRSGFSERQAQIILTPIQGQTLDVFNPGISFAAIVFPGLLVMLLQHSLLIASVRVSIALKSAPSGKPSIPVYLGGLSALLPIWLFLSIVLFVLWPWVLGYRQTANIAEVLLLTFPFLLAVLGLGKLVTECLRSVEMIYLTLAFITTPIFYLSGTIWPLQAMPGWVRAISYCIPSTWATKAIAGVNQMGMSLNEVGNDVLMLLLLGAVYTVIGIGVGLVHNSVALRSLFRKRRA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LS483492_3 | 4453190-4453402 | Orphan |
I-E
Consensus repeat of LS483492_3
|
3 spacers
spacers of LS483492_3
>3.1|4453220|31|LS483492|CRISPRCasFinder GATGATTTCGTCATGATTGGTGACTCCCTCG >3.2|4453281|31|LS483492|CRISPRCasFinder GTCATGTTGACGAATCCGCACTCGATCGCCC >3.3|4453342|31|LS483492|CRISPRCasFinder CGTAATAACAGCCAGTAATAACAGATATGAG >3.4|4453281|32|LS483492|PILER-CR GTCATGTTGACGAATCCGCACTCGATCGCCCG >3.5|4453342|32|LS483492|PILER-CR CGTAATAACAGCCAGTAATAACAGATATGAGG |
CRISPR arrays and Neighbor proteins around LS483492_3
The CRISPR arrays of LS483492_3 >merge|LS483492|3|4453190-4453402|CRISPRCasFinder,PILER-CR TAGATTGATCCCCGCTCACGCGGGGAACACGATGATTTCGTCATGATTGGTGACTCCCTCGCCGGTTTACCCCCGCTCACGCGGGGAACACGTCATGTTGACGAATCCGCACTCGATCGCCCGCGGTTTATCCCCGCTCGCGCGGGGAACACCGTAATAACAGCCAGTAATAACAGATATGAGGCGGTTTATCCCCGCTCGCGCGGGGAACAC >LS483492|3|3|4453190-4453402|CRISPRCasFinder TAGATTGATCCCCGCTCACGCGGGGAACAC GATGATTTCGTCATGATTGGTGACTCCCTCG CCGGTTTACCCCCGCTCACGCGGGGAACAC GTCATGTTGACGAATCCGCACTCGATCGCCC GCGGTTTATCCCCGCTCGCGCGGGGAACAC CGTAATAACAGCCAGTAATAACAGATATGAG GCGGTTTATCCCCGCTCGCGCGGGGAACAC >LS483492|3|3|4453252-4453402|PILER-CR CGGTTTACCCCCGCTCACGCGGGGAACAC GTCATGTTGACGAATCCGCACTCGATCGCCCG CGGTTTATCCCCGCTCGCGCGGGGAACAC CGTAATAACAGCCAGTAATAACAGATATGAGG CGGTTTATCCCCGCTCGCGCGGGGAACAC
>LS483492.1|SQJ30306.1|4452283_4453162_+|putative-protease MKYALGPVLYYWPKTETESFYQQAMHSSADIIYLGESVCTKRREMKVGDWLALAKTVAQSGKQVVLSTLALLQAPSELNELKRYVENGEFLIEANDLGTVNMAAERGLPFVAGHALNCYNAYTLRLLHKQGMTRWCMPVELSRDWLGKVLEQCGELGFRRGFEVEVLGYGHLPLAYSARCFTARSENRGKDECETCCIKYPQGRAMRSQEDQQVFVLNGIQTMSGYCYNLGNELSGMQELVDIVRLSPQGVETLTMLEQFRANENAQQPLALAGSSDCNGYWRRVAGLELVQ >LS483492.1|SQJ30305.1|4451276_4452272_+|Uncharacterized-protease-yhbU-precursor MELLCPAGNLPALKAAVDNGADAVYIGLKDDTNARHFAGLNFSEKKLQEAVEYVHRHGRKLHIAINTFAHPDGYARWQRAVDMAAQLGADALILADLAMLEYAAERYPHIERHVSVQASATNEEAIRFYQRHFAVHRVVLPRVLSMHQVKQLARTSPVPLEVFAFGSLCIMAEGRCYLSSYLTGESPNTVGACSPARFVRWQQTPQGMESRLNDVLIDRYQEGENAGYPTLCKGRYLVDDVRYHALEEPTSLNTLALLPELLAANIASVKIEGRQRSPAYVSQVARVWRQAIDRCIADPAGYQPDAGWMAALGAMSEGTQTTLGAYHRKWQ >LS483492.1|SQJ30303.1|4450502_4451042_-|Putative-lipid-carrier-protein MKEWMTVLEKLRSRIVRQGPSLLRVPLKLTPFALQRQLLEQVLGWQFRQALADGDLEFLESRWLKIEVRDLALQWFMTVENDRLVVRQHAQADVSFSGDANDLILIAARKQDPDTLFFQRRLQIEGDTELGLYVKNLMDAIELEAMPAPLRIGLLQLAEFVEAGLQEGAEPSSRVVAPC >LS483492.1|SQJ30301.1|4450005_4450509_-|Predicted-acetyltransferase MLIRVEIPVDAAGIDALLRSAFGRDDEADLVRQLREDGLLTLGVVATDDEGGVVGYAAFSPVDVEGEDRQWVGLAPLAVEERLRRQGLGEKLVYEGLDALNEFSYAAVVVLGEPAYYSRFGFKPAAEFGLSCRWPGCEQAFQVYPLAQDALDGVNGAVEYAAPFNRF >LS483492.1|SQJ30299.1|4449725_4450019_+|GIY-YIG-nuclease-superfamily-protein MAETTTWHLYMLRLPSGMLYTGITTDVARRLAQHQAGKGAKALRGKGELQLAFHCAVGDRSLALRLEYRIKQLSKVQKERLVSHPPLSLDYLLVKNG >LS483492.1|SQJ30296.1|4448672_4448894_-|Uncharacterised-protein MKRVIAIIVGTAIGIVLIWLAYPYISDWLVGPVHGEDQMSANFVLLLVGLGVGCVVGSLVGGLAYSRLKKRMR >LS483492.1|SQJ30294.1|4448065_4448554_+|Leucine-responsive-regulatory-protein MPLDMTEMKILRLLQDDARVTNQALAEKIGMSASPCWRKVRKLEEEEVIQGYRAVLDRKKIGLGVMVFIRVAIDSHSETEAKKFEEEVTALEDVVACYSIGGDADFLLQVVAPDLDSYADFAMSVVRRLPGIKEMQSMFVLKEIKPLVSYPIKKTAQTKSGQ >LS483492.1|SQJ30292.1|4446934_4447561_-|Leucine-efflux-protein MLGTAQIISYAAALGLAAAVPGPGMTALVARSVSGGAVTGFTMLTGLILGDLIYLSTAVFGLAVIAHTYTSLFTLINWAAACYLFFLAWQFWRYQPQAVNIDQKATRRELTSAWLSGLTITLGNPKTIAFYLAILPLVISLDRVSLQVWGTMLVPLTMSVLLAVGAIFILAAIKIRRFLTSASSQRILFRTTGVIMVLAAFGMVMKSL >LS483492.1|SQJ30290.1|4445509_4446214_+|Uncharacterised-protein MSKRSDIIDGAEASSAPYGLVYTEVIGLIDLGHAQGTDIRNLMRTIDSGKSSSEEYYRVSYSQSMRDPTKTIKMGKFITWRIKHGRSYYERQSIALAMMMSLSLKFEGLQASFPVSLVTDSGFSGEDLISNLLGFYRAVSLQNPFGMLRPVSKAEALKRWDHYGKIGSWKNETFKPILFPDPKRFPHATPRRGELPSFMKTVRPWSDFRSGIVKIATANGSYMDSARNGVLPYA >LS483492.1|SQJ30288.1|4443844_4444396_+|Protein-of-uncharacterised-function-(DUF2778) MARLYQSRMGNMALYGKFIVNDADYSPLTIYGIGTFMAFSGDGNYRNAPGCGMIPNEGPIPSGKYWIVDRPTGGIREPIATWFSDVLNRNLRNIPTGKNEWFGLFRDDGKIDDHTWINGVKRGNFRLHPGTISYGCITLKHHSDFAAIRSALLRTKTINIRNMRLRACGIIDVISNGECKVSG >LS483492.1|SQJ30310.1|4453598_4454738_-|daunorubicin-resistance-ABC-transporter-membrane-protein MKSYWQTFSRVLIGMLERPMWLMLILSLCVMSLVYANRSVWDLPVGVIDQDHSSASRELIRQLDATSKIAIKTYDSLDQAQRDLGWRELFAVIIMPVDLEKKILHGENIVVPVYGDATNRLANGQIQQDVVTAYQQLLNQYNTSLLLRSGFSERQAQIILTPIQGQTLDVFNPGISFAAIVFPGLLVMLLQHSLLIASVRVSIALKSAPSGKPSIPVYLGGLSALLPIWLFLSIVLFVLWPWVLGYRQTANIAEVLLLTFPFLLAVLGLGKLVTECLRSVEMIYLTLAFITTPIFYLSGTIWPLQAMPGWVRAISYCIPSTWATKAIAGVNQMGMSLNEVGNDVLMLLLLGAVYTVIGIGVGLVHNSVALRSLFRKRRA >LS483492.1|SQJ30312.1|4454734_4455889_-|ABC-2-family-transporter-protein MAMERVRAGWRCFSHAFSRECRVAFRSPVIHWLGWLFPLILFGLISSNFSEGTLLNLPVSVVDNDHSPLSKSLVRKLDAGSHAHVEAHDGGLSESLRRLRSAQDYALLYVPPDLEANALAGKQPSIVLYYNALFYGAGLYSTQDFSGLITETNGSYRSIIAASMGKTLPPLADVTLNYGSLFNASGSYIYYQQFAATIHLLQLFVVTCMIYVMARSKSLLSARPFSLALLGKLAPYTISFTTLLMIEIAALVGIFDARVSGNPLFMLLIGFFYVAAAQSIGLLLFTFTGSAITAYSLIGILVSIAMTYSGMAVPELSMPLPARIISNIEPLTHALYAMFDVFLRQVPARAIFGVCALLLVYPLVTAFLVRNRLPKRLAKDEGAL >LS483492.1|SQJ30314.1|4455870_4456842_-|putative-efflux-pump-membrane-fusion-protein MNKKTLFTLLLVVIAIAMAVLFRSHNQDLLLQGEVDAPEVIVASKAKGRVVERLIERGDDVKAGQVMITLDSPELLAQLRSAQATRDEAKAQLDQSLHGTREENIRSLRASLAQAQAELRNAQHDYDRNRSVAAKGYISKSELDSSRRSRDTAYQQVQDAKAKLDEGINGDRIELRQQYAAALRAAEENLLEVQAQADDLQVKAPVDGEVGPIPAEVGELLNASSPLVTLIRLPDAYFVFNLREDILANIRKGDKVKLRVPALKDKVIDAEVRYIAPLGDYATKRATRATGDFDLKTFEVRLYPAQPIEGLRPGMSSLWQWKE >LS483492.1|SQJ30319.1|4457052_4458066_+|Limonene-1,2-monooxygenase MTQNASVPLSVLDLSPIPQGAKARDAFHCSLDLAQHAEKWGFQRYWLAEHHNMTGIGSAATSVLLGYLAAGTDTIRLGSGGVMLPNHAPLVIAEQFGTLESLYPGRIDLGLGRAPGTDQRTMMALRRHLSGEVDNFPRDVQELQHYFAEVQPGQPVQAVPGQGLHVPIWLLGSSLYSAQLSAKLGLPFAFASHFAPDMLFQALRLYRENYQPSERWPQPHAMVCVNVIAADSERDARFLFSSMQQQFINLRRGEPGPLPPPVDNIHALWSAGEQYGVEQALRMSIVGDENNVREGLQALLRETEADEIMVNGQIFDHQARLRSFEIVAQVKDALTRG >LS483492.1|SQJ30342.1|4458108_4459548_-|High-copy-suppressor-of-rspA MSTPSSAAVLGRAGLIILLAGQLLPMIDFSIVNVGLDAMAQSLHATPTQLELIVAVYGVAFAVCLAMGGRMGDNLGRRRLFLWGVRLFALASLMCGLANSVWLLLVARTLQGGGAAMIVPQIMATIHVCLRGREHARAMALYGGIGGLAFIIGQVLGGLLISLDIAGYGWRSVFLINIPLCVLVLVCAHRWVPDTRAQRRVNIDWPGTLLLALTIVCLLFPLALGPVWHWPWPCLALLALSGPLLLQLWRVERRQERRNNFPLLPPALLRLASVRFGLTIAILFFSCWSGFMFVMALTLQAGAKLNAFQSGNAFIALGVAYFISSLLSSTVIEHFGKVRTLLFGCLIQIGGLLALMLTMLLVWPQPGIVNLIPATLLIGAGQALIVSGFFRIGLADVPVEHAGSGSALLATVQQTSLGLGPILLGTVLVQVLHASPGDYLHALLAALVLELLLMLALLLRTLLVLRADRRLAAGQPSRT >LS483492.1|SQJ30343.1|4459669_4460536_+|Uncharacterised-protein MSEITSPTDAGQADNRKQLGAFLRARRESLDPNRLGLPRMRQRRTPGLRREEVAQLADVGITWYTWLEQGRAIQASTKTLSAIAHALQCNEAETHHLFRLAGQSLPASAQHGGCEKLSPYGQRLLDQLDPLPAIISNARFDILGFNRAYCLLMGVDLATLPPEDRNCIYLAFTHEQWRASQLDWDDIMPKMVALFRAQMAEHLGDPVWERQLARYLSVSEDFRALWERHEVRSIDNNVKRFLHPQVGEMSLRQNNWWSAPRNGDRMLVYIPADEQSERRLQQLTGGAG >LS483492.1|SQJ30344.1|4460558_4460798_-|Predicted-transporter-component MTQQTIVPDYRLDMLGEPCPYPAVATLEAMPQLKAGEILEVISDCPQSINNIPLDARNHGYKVLDIQQDGPTIRYLIQR >LS483492.1|SQJ30345.1|4460797_4462009_-|putative-inner-membrane-protein MNWQQFKTDYLVRFWAPLPAVIAAGLLATYYFGLTGTFWAVTGEFTRWGGHVLQWLGLHPEQWAYFKVIGLEGTPLTRIDGRMIIGMFAGCIAAALWANNIKLRRPQHRIRIVQALLGGIIAGFGARLAMGCNLAAFFTGIPQFSLHAWLFALATAAGSYFGAKFTLLPLFRVPIKLQKVQAAAPLTQLPQQARRRFRLGMLVFLLMSAWSLWTLFDAPKLGIAMLFGIAFGLLIERAQICFTSAFRDLWITGRTHMARAIILGMAVSAIGIFSYVQLGVAPKIMWAGPNALIGGLLFGFGIVLAGGCETGWMYRAVEGQVHYWWVGLGNIIGATLLAYYWDALAPSLATDYDKINLLEVFGPTGGLLVTYLLLALALVAMLWWEKRFFRAKARAERVDARSM >LS483492.1|SQJ30346.1|4462139_4463996_-|Cold-shock-DEAD-box-protein-A MTTELETSFADLGLSAPIISALNDLGYVKPSPIQAECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLHNIDASLKAPQILVLAPTRELAVQVAEAMTDFSKHMNGVNVVALYGGQRYDVQLRALRQGPQIVVGTPGRLLDHLKRGTLNLSNLSGLVLDEADEMLRMGFIEDVETIMAEIPAEHQTALFSATMPEAIRRITRRFMKEPQEVRIQSSITTRPDISQSYWTAYGMRKNEALVRFLEAEDFDAAIIFVRTKNATLEVAEALERNGYSSAALNGDMNQALREQTLERLKDGRLDILIATDVAARGLDVERISLVVNYDIPMDAESYVHRIGRTGRAGRAGRALLFVENRERRLLRNIERTMKLTIPEVELPNAELLGKRRLEKFAAKVQQQLESSDLDMYRALLAKLQPEEELEMETLAAALLKMAQGERPLILPPDPVFKPRKPREFNDRDDRRGDRRDGDRPRRERRDVGEMQLYRIEVGRDDGVEVRHIVGAIANEGDISSRYIGNIKLFGSHSTIELPKGMPGEILNHFTRTRILNKPMNMQLLGDAQPHDRRERRDGGNRRPFNGERREGGPRRSFGERREGGNAAGNGERRGGNANRERAPRRRFGDA >LS483492.1|SQJ30347.1|4464185_4465070_-|Lipoprotein-NlpI-precursor MKLFLRWCYVATALMLAGCSNNDWRKDEVLAIPLQPTLQQEVILARMEQILASRALTDDERAQLLYERGVLYDSLGLRALARNDFSQALAIRPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYARLNRGIALYYGGRFPLAQDDLQAFYQDDPNDPFRSLWLYLVEREIDPKMAVIALQQRYDKADRGQWGWNIVEFYLGDISENTLMERLKADATDNTSLAEHLSETDFYLGKHYLSLGDKNSASALFKLTVANNVHNFVEHRYALLELALLGQEQDDLSESDQQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
LS483492_1 | 1.5|2896208|32|LS483492|PILER-CR,CRISPRCasFinder,CRT | 2896208-2896239 | 32 | MK433577 | Klebsiella phage ST512-KPC3phi13.6, complete genome | 18074-18105 | 5 | 0.844 |
LS483492_1 | 1.5|2896208|32|LS483492|PILER-CR,CRISPRCasFinder,CRT | 2896208-2896239 | 32 | MK433581 | Klebsiella phage ST258-KPC3phi16.1, complete genome | 18074-18105 | 5 | 0.844 |
LS483492_1 | 1.5|2896208|32|LS483492|PILER-CR,CRISPRCasFinder,CRT | 2896208-2896239 | 32 | MK416011 | Klebsiella phage ST437-OXA245phi4.1, complete genome | 18074-18105 | 5 | 0.844 |
LS483492_1 | 1.6|2896268|32|LS483492|PILER-CR,CRISPRCasFinder,CRT | 2896268-2896299 | 32 | NZ_CP009619 | Vibrio coralliilyticus strain RE98 plasmid p380, complete sequence | 309213-309244 | 6 | 0.812 |
LS483492_2 | 2.1|4444747|31|LS483492|PILER-CR,CRISPRCasFinder | 4444747-4444777 | 31 | NC_049342 | Escherichia phage 500465-1, complete genome | 7539-7569 | 7 | 0.774 |
LS483492_2 | 2.1|4444747|31|LS483492|PILER-CR,CRISPRCasFinder | 4444747-4444777 | 31 | CP025900 | Escherichia phage sp., complete genome | 7539-7569 | 7 | 0.774 |
LS483492_1 | 1.5|2896208|32|LS483492|PILER-CR,CRISPRCasFinder,CRT | 2896208-2896239 | 32 | NZ_AP021845 | Azospira sp. I09 plasmid pAZI09, complete sequence | 172917-172948 | 8 | 0.75 |
LS483492_1 | 1.6|2896268|32|LS483492|PILER-CR,CRISPRCasFinder,CRT | 2896268-2896299 | 32 | NZ_CP039912 | Agrobacterium tumefaciens strain CFBP6625 plasmid pAtCFBP6625a, complete sequence | 373822-373853 | 8 | 0.75 |
LS483492_1 | 1.4|2896148|32|LS483492|PILER-CR,CRISPRCasFinder,CRT | 2896148-2896179 | 32 | NC_024794 | Escherichia phage vB_EcoM_PhAPEC2, complete genome | 91327-91358 | 9 | 0.719 |
LS483492_1 | 1.6|2896268|32|LS483492|PILER-CR,CRISPRCasFinder,CRT | 2896268-2896299 | 32 | NC_002682 | Mesorhizobium japonicum MAFF 303099 plasmid pMLb, complete sequence | 167215-167246 | 9 | 0.719 |
LS483492_1 | 1.6|2896268|32|LS483492|PILER-CR,CRISPRCasFinder,CRT | 2896268-2896299 | 32 | CP021184 | Sphingomonas wittichii DC-6 plasmid pDC03, complete sequence | 891-922 | 9 | 0.719 |
LS483492_1 | 1.6|2896268|32|LS483492|PILER-CR,CRISPRCasFinder,CRT | 2896268-2896299 | 32 | NZ_CP029356 | Azospirillum sp. CFH 70021 plasmid unnamed1 | 843252-843283 | 9 | 0.719 |
LS483492_1 | 1.6|2896268|32|LS483492|PILER-CR,CRISPRCasFinder,CRT | 2896268-2896299 | 32 | NZ_CP011523 | Streptomyces sp. CFMR 7 strain CFMR-7 plasmid unnamed, complete sequence | 34923-34954 | 9 | 0.719 |
LS483492_3 | 3.1|4453220|31|LS483492|CRISPRCasFinder | 4453220-4453250 | 31 | NZ_CP022523 | Pseudoalteromonas sp. NC201 plasmid pNC201, complete sequence | 488095-488125 | 10 | 0.677 |
1. spacer 1.5|2896208|32|LS483492|PILER-CR,CRISPRCasFinder,CRT matches to MK433577 (Klebsiella phage ST512-KPC3phi13.6, complete genome) position: , mismatch: 5, identity: 0.844
acgtgtggacggacggcatcggtaacacgcac CRISPR spacer gcgtgtggaccgacggcatcggtaacacgtcg Protospacer .********* ******************.
2. spacer 1.5|2896208|32|LS483492|PILER-CR,CRISPRCasFinder,CRT matches to MK433581 (Klebsiella phage ST258-KPC3phi16.1, complete genome) position: , mismatch: 5, identity: 0.844
acgtgtggacggacggcatcggtaacacgcac CRISPR spacer gcgtgtggaccgacggcatcggtaacacgtcg Protospacer .********* ******************.
3. spacer 1.5|2896208|32|LS483492|PILER-CR,CRISPRCasFinder,CRT matches to MK416011 (Klebsiella phage ST437-OXA245phi4.1, complete genome) position: , mismatch: 5, identity: 0.844
acgtgtggacggacggcatcggtaacacgcac CRISPR spacer gcgtgtggaccgacggcatcggtaacacgtcg Protospacer .********* ******************.
4. spacer 1.6|2896268|32|LS483492|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP009619 (Vibrio coralliilyticus strain RE98 plasmid p380, complete sequence) position: , mismatch: 6, identity: 0.812
atcaggagcaagcggttgccgctgc-cggtatg CRISPR spacer gtcacgagcaagtggttgccgctgcgccatat- Protospacer .*** *******.************ * .***
5. spacer 2.1|4444747|31|LS483492|PILER-CR,CRISPRCasFinder matches to NC_049342 (Escherichia phage 500465-1, complete genome) position: , mismatch: 7, identity: 0.774
atcgctggtatgaaccgccggtttcattccc CRISPR spacer atcgctggtatgaaccgccggtgagctttga Protospacer ********************** **.
6. spacer 2.1|4444747|31|LS483492|PILER-CR,CRISPRCasFinder matches to CP025900 (Escherichia phage sp., complete genome) position: , mismatch: 7, identity: 0.774
atcgctggtatgaaccgccggtttcattccc CRISPR spacer atcgctggtatgaaccgccggtgagctttga Protospacer ********************** **.
7. spacer 1.5|2896208|32|LS483492|PILER-CR,CRISPRCasFinder,CRT matches to NZ_AP021845 (Azospira sp. I09 plasmid pAZI09, complete sequence) position: , mismatch: 8, identity: 0.75
acgtgtggacggacggcatcggtaacacgcac CRISPR spacer ccgtcatcacggacgccatcgggaacacgcaa Protospacer *** ******* ****** ********
8. spacer 1.6|2896268|32|LS483492|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP039912 (Agrobacterium tumefaciens strain CFBP6625 plasmid pAtCFBP6625a, complete sequence) position: , mismatch: 8, identity: 0.75
atcaggagcaagcggttgccgctgccggtatg- CRISPR spacer gctacgaggaagcggttgccgctgtcgg-acgt Protospacer ...* *** ***************.*** *.*
9. spacer 1.4|2896148|32|LS483492|PILER-CR,CRISPRCasFinder,CRT matches to NC_024794 (Escherichia phage vB_EcoM_PhAPEC2, complete genome) position: , mismatch: 9, identity: 0.719
cagaagtcaacgatgttccgactgttccgcca CRISPR spacer gtgtgttagacgttgttccgactgttccacca Protospacer * . * .*** ***************.***
10. spacer 1.6|2896268|32|LS483492|PILER-CR,CRISPRCasFinder,CRT matches to NC_002682 (Mesorhizobium japonicum MAFF 303099 plasmid pMLb, complete sequence) position: , mismatch: 9, identity: 0.719
atcaggagcaagcggttgccgctgccggtatg CRISPR spacer agtgggagcaagccgttcccgctgccggcggc Protospacer * ..********* *** **********..
11. spacer 1.6|2896268|32|LS483492|PILER-CR,CRISPRCasFinder,CRT matches to CP021184 (Sphingomonas wittichii DC-6 plasmid pDC03, complete sequence) position: , mismatch: 9, identity: 0.719
atcaggagcaagcggttgccgctgccggtatg CRISPR spacer gcgagtagcaagcggtttccgctgcccttccg Protospacer .. ** *********** ******** * .*
12. spacer 1.6|2896268|32|LS483492|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP029356 (Azospirillum sp. CFH 70021 plasmid unnamed1) position: , mismatch: 9, identity: 0.719
---atcaggagcaagcggttgccgctgccggtatg CRISPR spacer tcggccagg---aagcggtagccgctgccggtggc Protospacer ..**** ******* ************.
13. spacer 1.6|2896268|32|LS483492|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP011523 (Streptomyces sp. CFMR 7 strain CFMR-7 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
atcaggagcaagcggttgccgctgccggtatg CRISPR spacer tcgagcgtcaagcgattgccgccgccggtatc Protospacer . ** . ******.*******.********
14. spacer 3.1|4453220|31|LS483492|CRISPRCasFinder matches to NZ_CP022523 (Pseudoalteromonas sp. NC201 plasmid pNC201, complete sequence) position: , mismatch: 10, identity: 0.677
gatgatttcgtcatgattggtgactccctcg CRISPR spacer tctgattgcgtcatgattggagactgatctt Protospacer ***** ************ **** ...
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
2997318 : 3038806
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >LS483492|2997318:3038806|DBSCAN-SWA GTTACTGGACGTTGCCCGGATAAGTGGCGTCGTCATATTGATAAATCAGGGGGGTGTATTGCAGGGCTTTGATCTGAGAGGTTCCGTCTGATCCTGGCTCTATTGAGTCGAAAATACCGTCGTAGCCGACGCGACTTGATGAGCAGAAGATAAGCCGCGGAGGCTCTATCGCTGGATCATCCATAATCCACTCATCAAGGCGTACGTCATCCGTGTTATCCACGGCCAGCGTATAATCATCAACGCGTGATGCCACCAGCAGCGATGAGGCAGAGCCATCCTGGAAACGTATGATGCAACGGGGGTTATTAAACGCCCAATCAAGCTTCTCACTAACGTGAATATGCACCGTATTTTCGTCATGCTGCTCTTCGACGATCAGACAGCTGATTGTCTGGCTTCCGGGAATGTCATCAGTCAGCACAATCCGATCGCCATAGTTGTAGCACAATGCATCCATTTCCGTGCTCGTCGAATGGTTAAGTCGCTGGTACCGGTATTTCATTAAGCGACGCATGCCGATGCGGTATGCGTTGTCAGGATCACTGACGCCATCCAGCTTATAGCTCTCTACTTTTAACGCTGTAGGGTTATCTGCAAACCGGCATTGGATAGTTTCTTCCGCCCAGGTCGTCCCATTAACGTAAGTGACATCAACCCCGTCGAAATCATCTTGGGAAAGTGCTTTAAAACTGGTCTGCAGCTCCTCTGTTTGTTCCTGTGGGCTAATTACCCCTGACCAGTTTTTTACACCTTCACGACCCACGGATGCCAAACCATCACTGAGAAGAAAATAGCCCATCCCTGCAGTTGCTATCTTCTGCAAAACCTCCAGAGCGCTGGTTGAGTCAGCGCCGGCCGCCCAGTCAAACGTTTCTCCACGGGGTGTCCAGTATGTGCTTTCCAGCGTATTGATTGTCTGGTAATCGATCTGGTCATCGGTATATCCCAAACTTTTCAATACGTGGAATATGGCGCCGCTGATACTCCTCGATGTATGGCCATCATAGAGGCGAGTAGCCACCATATTCACTCGCCGATCAGATTGGGAAGCCAGTCGGTTACCGGTGCGGATCGTTAACGCGATAGTAGTAACATCTCGGTAGCTTGTTGGGCGTTCCGATAACTTCGCTCTCATCGACTGCCATTGCACAGACTCTCGGGTCGTCCCTCCCCAGACTGGCGTATCACGTTTCATGCGCACTTCATAATTACCGGCGGTCGGGAACGTGATTGTTTCGGTATAACCGACTTCATTAACAGTATTGTTGCCGTGCTTTATCTTGACGCTTGTCCATTCCTCAGAGCTGGATAATCGGTACTGAACCGTCATTTCAATGTCATGCCAGTGGATAGCGCCGTCCTTACTTCCTACGTCACAAAGTCCCTGCGGGTAGACGAAGTTCAGTTCAATTTCACTGGTTTTCTCGCTGTCAGGGCAGCATAAGAAAGGCCCCATCCAGTCATACTTATCATTAAGGCCGGTAACCGTGGAATCGAGTAACGTTCGGCTGGTAAATCCTGGCCATCCACTATCAACGGTTTTCGAACCATCTGAATTTTCTATCAGACGTTCAACGCTTAGCGTCAGACCATCAATCTCCGTAATACGGAAATGGTACCCTTTTAAACCAATGGCAATCCGCTGTAGCCCCGTAGGGATACCAACAAATGGATCGCCACCGGTACTTCCCCATGCTAGCGCGATATGCTCGCTGACTGCCGGCGTTCCTCCCGTTGATGCGGTGCCGGCAACAACTGCTGGATCACTGCCGAAGAGCACCTCCGGCAAAACGGTATATCCAATGCTGTTACCGCTGAATGGGCTTTCTTTCTCTCTGATAACAATCTTGGTATCTACGGCAACGACGTCTAGACCAGAACCAGAAAGCTGATCTGCAATTTCATCAGTCAAGCCTGACATGGTCACATAGTTTGCCGACAACGAAATTACGTAACTCGCCCCTCCCCAAGTCAACGTAAAGGACAATGGCGTAGTGCTGAAGTCATAAGTCGTTGGCGCCGCAGATGCGGTAATACTGGCGGCATTCCCCCCTTCACCCGGTATTGCAGGCTTGCCAGGCTCATGGGCAGAAATGAACAAATCCAGCCTGGTGGCATTCCACGTAACAGAGACGGACTGGCCAACAGAAGGTTTTAGCTCCGCAAAATCACCATAAATAACGTTGCTGCCGCCCTCTATTGCCACGGTATAGGTATCTGGAACCTGAATAGTAAGATCAGTACCAACAACCCAGGAGTCGGGGATCACCGCATCGCCATTATCATCAGTCGTTCCGCCGACTACTGTCAGGCTGTTGCCGGATACGCTGATAGCGTCGGCGGTGATGCTTACCGAATCAGGCCCGGTTGATGCCAGATCAAGGCCGGCAGTTCCTGACGTTGTTCCGCCAACTTCTGTTGATGAATACCAATTTTCAGTACGACGATCAGCGCTAACATCCGCTCCCGGCGGGTATATCGTAAAACTGACGTCATCGCCAAAGCTTCCTACCGGGGTATTGCCGATTTTTATCGATGACTGGTTGATAACGAAATTGCCGACGCCAATACACAGGAACATTTCTGTCCTGTAGATCTGCGGGTTTTCCTTGTCAAAACGGCTGACTGGCTGCACAAGGTAATCAGGAAAGATTCGAGTCCGCCCTAACACTTCTCGTATCGGATCCCCCAACTTTGCCGAGTTCGCCTTTGCCGGCGTCAGATCAAGTGAATCCCCGTTTGCCGTCGAAGACATGCCGTCTTTACTTAACTGCGACATCATGATCAATGAATAAGCTGCCGACGCCACAGCGACTGCTACAGCTGCCCACACAGCGATTTCCAATCCAGTACCATACGGAACTGGATAGATACGGACATCGTCGTCTGGCTTAACGTAGCAAAGCGGCCATTCTGCTGCAGGGATGGCTTTATGATTCACTTCAACAACGATCGGATGACGCATATCCGCCTCATACCCCGACACGTTGCCCACCAGCCATTGGTGCAACGTCATGGCCTTATGCTCATGAGTCTCTAGCGGATCACCAGGAAGTCTGGAAGGGTAAACGCGTATCGTCACTGGTAATACTCCACTTTTACAAAACGGCGCATGAAACGTGTTAACGGCAAAAACGTTACGTTTGAGTTCGGATTACATTCCGCAGCATGCAACACACCATCGATATTGACGACGATCGCCACATGCGTTACCACACTCCCTGAGTAACAGGCGATGCCGGCTCCCTCGCAAGGTTCACATCTATTAAGATCACCCATAAAAGATCGCGCGGCTCGGTCGAGGCCATTACCATCCTTAGTGATACCGGCAAAATCAGGCCAAACAGTGAGCCCCAGGTCACGCCTGATTTCATTAACGATGCCAAAGCAGTCGAGATTAGGAAAAGTACGACCGCCCTTCAGCCAGGTAACTGAAAGGTATTTATCAGGATCAAACATAGGGTAGTCCTTACGAAATGTACCGAAGTCCCGGATAGTCAGGGAGCGTATAGCGGTTCCGCGGCCACGCAGAATCAAGCACGTTCATATAACCTGCGGTGATCTGAACAACTATCGGCGTCCAGTAGCCTGATTTTATCGTTAGGGTAAAAGGGGGTGACTCAGGGGCCGACAAATCGGTCGAAAGATAATGCCGATATGTGAGCGTTCCAATCTCCTGACCATCAAGAGCAGCCCGGATCGTGTTTGAAACCTTCCCGTCGATGTTACAGATGGCAAACTTTAGATCCTGGGTTCCATCCGCATTTCTTGCCGGAAGCGATAACTCAATGCCACACGCCTCGAACGTAACCTGCACACCATTTTCTAATGTTGCCGTAATATCATCCCAGCCGTTGGTTAACCAGTACGTGGAATCCCCTACCTTAACCTCCAGTGTCTTTATCAGCACCTCAGAACCAGACGACGCATAGAACCGATTTAATATCGTCATGCTTTTGGCCACTCCCGATTCAATGCCATGTCGATGATACTTTGCCCCGCTAAAAATTCAGGGAAGTTGCCCCATCCCGCATCAGGAATAGGCCGCTCCCAGAGTTCAAGCGTGGCGCTGTACTGCCAGTATTTACCGCCGGTCAATACCGGGCCATCATAAATATCGGTGAACCGGCACAGATACGGTTTAACCCCGATCGGGGTACGGAGCACCATATTGAACCATGCAGAACCATCCGATAATGCATCTCTAAACCATGCTTCAAACGTCATTCCCTGTACGTCATTCATCAACCAGGAAACCGTCGTTTGTGTCGGCGTTGATGTATATCGCCGGCGCTGTCGCGCGCGTCCGGTTACCATATCTGTACGGAGCAACGGACTGATTGGCTTTAACGCATACCCTTCCTGTAAAGCCATGGGTAAATAGTCATGCGGGTAATTGATATTGGATGTTAAAGCCATTAGCTGGCCCTCCGTTTGGTGTTCCATCCCGCCATAAGAGCTTTTGACATATCACCTCGCCCTGTTGCTACATCCCCGGTAACCCGGCCATAAACCCGTTTTTCTCCCCGGGCAACAGCTTCTTCTATCGCGCGAATAGTGCTTTTATCAGGATCACCATTCACTACGATATTATTAACGGGCATTACACCAGATACGGATTGTTTGCCGACTCGATCAAGCGTGGCATCCAGCTTTGCGCTGGTCTTGGCTGTGGTAACACGCTCACCTTTTTGCAACAGCCAGGTACCTGTCTCTGGAACACTATCCAATCCATCGTGAGCCATGCCAATCGCAGATATGTTTGAAACTATACCTGCCGTTGCCGCAGCAACAGAGGCCATAGCAGCCAGGTTAAGAGGGAATGGGTTAGCAGCAGCCATTGCAATACCTTGTTGAATAGCAATAGCCGATTGAGCGATAGCCGCAGCCTTTTGGGCGACAAATGCCGCCTTATAGATAGCCGACTGTTCACCAAAAGCAGATTTAGTGACATCGACTACTGAGCCAAGCCCATCACTGACGCTACTTAGCATCAGCTGGTTACGCTGGTCATTGAGAGATTGTAGGCTATCTGCATGCTGCTTACGCAGTTCGAGTTCTTTTGCGTCCCACTGTTCATTGAGTTCAGACTGTGACTGGCGATACTCATCGAGCATATTGAGTTGCGTTGAGTACCACTCCTCAAGTTCTCCCTGAGCTTTATCGACCTTTCTCAGCTCACCGAGCTGGCCACCGAACATCGGATCAATGCCGCCAAACTCCGGCGCCTTTTCAAAAGCGTTATTGGCAATGGCTTTCGAAGCCTTGGCATAGTCATCATCATTCGTCAGTCCAGCATCTTTGGCATCCTTGAGAAGCTTTACCCTCTCACGCGTTGACTCGAGAAGTTTCTCTTCCGGGGTTAATAGCTCTTCCTGAAGACCCCGGTATTTTTCCAGAAGCTTAAATCTATCAATTTCAGCGGCAAGACCTTCTAGCCTGGCCTGCTGCTGCTTATTAATACCAACAAGCTTACCAGATGAAATATCCAACCGTAATTTTTCAAGTTCTGTCGCTTCTTTGGTTTTTCCATTTAATTGGTCAGTTAATGCAATTTGCCTAATATAGCTTTGCTCAGTAGATTTGAATGCGCTCTCGATAGCCTTAGACGAACCAGATGATTTGGGTATTTTAGGTGCACTTGGTTTACCATTCGATTCTCCACTTCCTAGAGAATACTCTCCATTACCAATGGTAGCCTGCCCAAGAGGCAATTTCATTGCACCTCTGGCGCTATTGTTTACCTTTTCGATTTGTCCCTGTAACTTTGCTATCTCCGTCTGATAATGCTCCAAGCTATCATCATTACCAAGGATATTACCAAGCCATGTCCCGCCGGGTTTAGCGTTATCAGCCTGCATCTGCAAAAATTTTATACGTTCTTGCATCGCAGGGACATTATTATCCGAAAATGTTCCAAAGACGCCCTGATGCCTTGCCGACACTTTAACAGCCAAATCACCAGCAGATGCAGCAGCTTTCACTAACCACCCGGCAAGCTCCGCAACTTCACTGACGAGATTAACAATGCCCTGCAAAACCTGAGGGTCAGTCAAAACATCTTGAAGTTTATCAAGAGAGCCCTGCAATGGGGATAAATCGACTTTAGCCAGACCTGAAGCAATCTCTATTTTTAACCCTTTAACTTGAGACTCCATATTCTCAAAAAGGGTATTTACTTTCACCAAATCATCGATTGAATCAGGGTCAGGGGCAACACCATAATCTTTCGATAACTGGATAAATTGCTTTAATTTCTCGTTATTATTATCAAACAGCGGCAACATCTTTGAAAGATCGTTACCCATACTTTCAAGTATGGTTGTTTTCTCTGCGTTGGTATTTATCTTCCCTAATGCTTCACCTATTGCCAACAGCTGCTTATCTGGTGATACCTTAGACAATTTATCAGCAGATAGGCCTAATGCATTCAATGCATCTACAGCTTCTCCTGATTTATTTAATACTGCATCACCAATTTTATCGCCTAAATCTTTAAATATATCCGCCATGTTATCGGCAGATAAACCAGCCTTCTCAGCTGCAAATTGCCATGCAAGCAGTTCCTGAGTTGACATATTCAATGATTTTGCCCAGCGATCAGCTTCAGCCATCTGGTTAGATGTAGACTTCAACAACTGATACCCAGCTATACCAACGGCGGTACCAGCAGCGGCGGCAGCAGCAGCCATCCCAGCTAATGCTTTGCTCGCTCCCTCGACGTTTTTTTGTACTTGCTTGCGCCACTTATCTGACGCTCTTTCAGCCTTATCCATGCCAGAAACAAACCCACCGACCTTGGCGACAAGATCAATAGTCAAAGTACCAAGAGATTTGCTGGCCATACCGACTCCGACAATAAAAAACCCCGCCGGGGCGGGGTTCGTCACAAAGGTGATAAATCAGCTTGCGCTTTCGCAACCAGGCTGACTTCTATCAATGATCAAGTTGCCCTCGACACGGAGGCCAATCTTACCAAACAGGAAAGCATGATTGAGCTGAGTAACCACTACATCAGTCAACCCAACGGCACACTTGTCTTTCTCAATAGCTCGGTCTGCCGCTGTTTTTACGTTAGGGATCCCCAGAGGGAAGATCACCACGGGATAACTGTCTTCGGCAGTAACCCTTTTGCCAGTAACAAATTTCCCACCATTAAGGTTGTAGTTTTTCGTACTTGCTACCGTCATATCAGCAACACGCACAGTACACCCAGAAAGTAGAAGAGCCCCCAAGGCTAACGCCAACACCTTTTTCATTTTAAAGTTTCCTTTGATTGCAATCGGAAACATCCTAACGCCTTGCGTGGAAATGTTCAAATTAGGTTATCCCCATATTAAATATGAGTAAATTACCTCCACTCTTCCATGGCATCATCAAGCGTTAGCGGCTTCTCATTAATGTGAGGAGCAAAATCAGTGACTTTGAACGCCGGCGTGTCCTTACCCTTGTTCACATTGGCAATCGTGCTTGAAACCAGTGCAGCAGCCCACTCAGTACGCATGATCGGGTTAAGACCACCATACTTATCGCGATACTTCACCCATAGCCAGAACTCGTTAATACTCATCCTCTCTTGGGCTTCAGCGATAGTTCTCCCCCCGATCCCGTTAAGCACTAATTCGCACCAGATTTCTTCTTCTCCGGTGAGCTCGCTGTCTTTCCCATTTCAGAGACTTCCTGAATGGCCACCAGCAATGCAACAGTGAGTGAACCATCAAGCGCGCCACGTTCTGGATCAGCTTCGCCGGTAATATCTTCAGGGGTGAAAACTGGTTTACCGCTTTCATCACAGATTGATGCAGCAATGCGGCCAGCGATGGCATCGTTCTTACCACCATGGGAGAGGATGTCAGATTTTGCCGAATGGTACCCTGCTGGCCGAACATAAACCGTTGCAGTAAACTCCTCGTCACCCTGTTTCCACGATATTTCTTTTTCGATAGGGCGACCAGTAAAGGCTCCCTTTTCTTTCAAAGCATCCAACGTAAGTATCATTTCATCACCGCTCAATATGAATTACCGGGGCGAAACACCCCGGTTTGATTAACTGGTTTGTTCAGCCTTGGGGATCCACACCGCGGGCCAGAACGTTGGATAGTGGCAGTGGTAGTGACCACCGCGTTGGCTTGAAAATCAAACGGGAAGTCGCTGACATATCCCTGAAAAACAAACCACGTCCGGTCAGACGGAAGCAGTAGACCATCCACCGCGTCAGGATCGCCAGGTTCAGCAATAGATGGTGAAGACTCACCATCAGACCATCCCAGAGCAAAGGTTAACTTGGATTCATCATCAGACTCAGCCAACTGATGCAACATTAAGTGGCTGGCATTTTTAGGATCAGCATTCAGCGTTACTGATGCTTGGCCAGGAGTCTGCAAACCTTTTAAGTAAGTTCGGCTTTTACGCTCACTTAAACAGGTATCTTCAATCTGATCCGCGGGGTTTCCACCCGGCGTGAAAGATGTAATACATTCAACCTCGCTGACAACGCCATTCGCGAGAACGTAGAACTGCGTGCCTTGTGTCAGAACAGACATAAGTTTTCTCCGGTCATAAAAAAACCGGCGCAAGGCCGGCATGTTAAAAATAAGGAGGATCAGCGTCTGACTATCCAATCAACATCGAATGAATAGCGATACCGCTTTGTGGTTGCATCACGCGTCTGGCCACCCCAGCGGGTAATGTATGCATGAGGTTCAATCGCATCACGTAATGCCCGGGCAACGGCTACACACCCCGCGGCAGTATCCGCGTAAATATCCACCTGCAGGGAGTAGCGATCAGCGTCTGGACGCTGGTTAAGATAGTTCTCTGGATCACCGGAAATGTTTTGCCACACGGCATAGGGATAGATGACTGCATCGTCATGCAACCCGAACGGATATAAACGGACGGGTTCACTACCCAGCAATGCCCTTACTTCATCACTACCGGAACATATAGCGAAGATCGGTGCCATCATGGTTTTACACCTCGCTTCTTCGCCCGAGCAATGCCTCGATCAATGGATTTCTCATACTCGATAGCAAACGTATTTATCGCCGGTACGGCTCCCTCTTCGGCCGCCGGTCGCATAATAGGTTGCGCCCGCATATTCTCCGTACCAAACTCAAGGAGCCGCCAGTGAGGGGTCGGTGCATTCCTGGCTTTGTCCGGGTGATTTTTCAAGACGGCACCGTGGAGAACACCGATTCGAAAAGCCAAGTTACCGGTTCGCTTGAATTCCCGACCGTTCCAGCGCAATGCTATGTTGTCAGCGATACGCCGGCCGGTTTCAGGATCATCAACACGCGTCGCATTATCTTTCGCCTTATTGACGATCACATTTGCAGCCTTGCGCAAAGCAGCACGGCCGCCTTTTCGTTTCAAGTCATAGCTAACAGAATCAAGCTTTCCAAGAAATTCGTCGATACCAGTAAGGCGATAATTAATGCGGTCAACCATCATTCACTCCATCCGAACATGGTAACGTTAAGTATTCCAGCCCACTACCTGGATCTGGGAGAACGCCCTCAACGTTATAAATTTTCCCGCGGTGGAGGATCCGACACTGCGGGAAAATATCATCACGGAACCGGATCTTTATCCTTGTTGTTACGGCATTCTGGAATGCCTGGGCGGCAACAAAATCTCTCGCCGATGCTGCAACGACCTCCCCATATATACCGCCGTCGGAAGTGGAATTGACGAGGTTAACCCACTGCGTAACAACTTCGCCGGTAACAGGATCCTGGGATGAAACCCTCTTTTGAGGAATAATGCGGTGCCTCAGTTTTCCTGCATGCATAATTCTTACCCTATCGGTTTCCCATTGAGATAAGTTGGACGCAGATCGCTAACAAGTTCTGACGGCTCCCCTTCTTCCATCAACGTCTGCAAAATTAGATTGCAGAGTGCATTATTAGATTCAGCCAGGAGGTTTAACGCTGCTGTCTGCTCCTTTTGCGCCTGCGTTTGTTCCCGCAGCGCTACTATCAATTCGCTTACCTGTTGCTCGTTCATAGGCAATCCTCGTCCACTTTTTTAACCATTCACGGCGCCGGCGACATCCTTCACAAGCCATAAGCCCTCACATAATTGTTGGGCAGTGCAAATCGTAAATAAGCATCGAGACAGAGAACGGCAGTTCTCCCTGTTTCAGCTTATCTTCTTCTTCACCACCACGGTTACGGTCAAGATACCCGAGCAATACCAACAGCGCCGTCTTAACCCGGTACAAAGGCTCACCATCAATCAGCTTGCCATCATCCGAAACAACCAATTTTCTGTTGTCTTTCAGATATGCGAGCAGCGCAGCACTACCGGCCTGAATTTTCAACGTCAGATCGGCATCGGAATAATCCTCATCAATCCGCAGATGCATTTTCGCTTCTTCAAGGGCTACCAGTTCAATCATGGCTTATCTCCCGCCAGCGGACGCATTACGGCCTCGCTTCACTGATAATGTCCACCCTTTGGAGCCAGTTTCACCGGGTTTATCAGATGTAGTTTCATCGCAATGCCACAATGAACCGCCCCACGTCACAGTATCTCCTGGGCGATAATCAGCACCGGACTTAAACACCCCGCGGTAGATCATAACGGGAACATCGAAAGACTTACTGGTACTGGCGCCTGAGGATCTTTCAACTGTTACAGTAAAGTGGCGATGATCATCCTGCTCAATATCAACTGTAGTAATGCCATCCACCAGACATTCCCACCCGCGCATGCCTGCGGTCTTTTCAAATGACCGCCAAAGCCCGCCGCTATGAGTGGCATAGGTTCCGCGCGGATAGCTTTTTTCCGCGTCAATGTCTGGCAATATTTCAATCTGCATGGCATCGCGACCATCTTCACCATCGCGCGCCGGCGGAGGAACCGCCGCGATAACAGCTTCATCCACCATCGACTTTATGTCCGGCAACTCCGGTGCCGCCGGTACCGGAATTTCTGCAACCGCAGTGCTCACCATCTCTTCCAGCATCGGCCGTACATCGTCAACGGTGACGCTCTGTCCATCCTGTGCCGGCGGAGGCATCGCCGCGGTAACAGCCTCATCCACCATTGATTTAATGTCCGGCAGCTCCGGCGCCACCGGTACCGGAATGTCAGCAACCGCAGTGCTCACCAACTCTTCCAGCATCGGCCGTACATCGTCAACGGTGACACTCTGTCCATCCTGTGCCGGCGGAGGCATCGCCGCGGTAACAGCCTCATCCACCATTGATTTAATGTCCGGAAGCTCCGGTGCCGCGGGAATTTCCAGCCGGCCAACGACCTCACCGATCCGCGTCTCAATATCGGGGGCCGCCTCTTTCAGTTCTGCGATAGATAGGGTGAGTTGGGTAATTTTCTCTTCAAACGCCAGCTGTTGAGCGGATAATTCACTCGTAAACTCCTGACGCATAGACTTAATTACCGCCCCAAATTCCTCGCTGATGGCCTTTATCAGAGATAATTCACGTTCATTCATATTTCATAATTCCCCTGATCATCGCTTTTGCCGCCACCATCTCAGAAGGAGACAAGGATTTACCGTCATCAGCCGGCGGCTCGCTGCTAGATTGCTTTGAGGGACTATCGAATGGATTTTCTGAAGCATCACGCCGCGCCAGCGCTTCCAGACTGAAGTTTTGCTGCTGTAGGTAAAGAGAATCACCGCCTGGAACAGGGGGTAAATTCTCTTTTTTGCGCGCTTCATTGGGCGACATAATGGTATTTTTCACCGCCTCGCCTAATGTCTTGATGCGACGCTCAGTGTCCATACGGAGCAGCGCGTTCACATCAAACTCAGTTTCTTCATCCCCAGATAGCTCAAATGCCTCGTCCAACAGAACCTCGATAGATTCAATAAGTATCTGCAGACATTGTGAGTAATATTGCTGCTCCAGTGCCTCAATATTGTTGTATGACGGAGGTTCCCCCACCCCGACCTTGTACGCCGGAACATGGAACGTTGAACAGATAATTTCTGCTGCCATTTTCAACTGCTCAACCATCTGAGCGTCAGCTGCCGAAATAGCGATGGGATTATATTTGGCGCCATTGCTCAATATTCCCGTTTTACCGGCATTCTCCCCGGTATAACCAGTATCCCAATTTTGTTTTATTTTCTTCGCATTATCGTCAGTCAACGTTCCTGGCACTTCAATCACACCGCTCGGCTTACCACCATTTTTGAAAAAGAATGTCGAATTCTCCAGAATATGGTGCCCCTGCATCGCCGCCAAACCAGCAGCGTAGATCGGCGACAAACCAACGAGGGGGATGAAATAAGCAATTAAAGCGGTCGTGAATAATTTCCCGGGCCGGCACAGTGACCTGTTCGGATAGGCCACTAATATTTTCAGGGCTGATCTGATAAAAAACCGAACCGTCGTCCGCAACCAACGGGGTAACCTTGTGCGGATCTAAAATCCTGAGCTCTTCAATGGAGCCAGCGGAGTTACGGATCTTCATGACGTAGGTATTGCCATGGCACAATTTTGAGTTCACCCAGCTTTCGAAAAACTGGATACTGTTCTGGAATGCATTCGGCTTTTTAAGCAGGGCTGACACACGGCCTGTTCGTTGTTCTTGCCACACCCCTTTAGAATCCCGGCATTGGTGGCCTGCCGGCATTTTTGCAATATCACTGGCTATCAATGAAATACAGGAGAAGACCGCATAGAACGAAGTGACGGTTTTGTCATTTATTTCAAGATTCTTCTGCCAAGCACCAGAAAACGGCTCATGAATAAACGAAAAAATCCTCTGCCAATTTCCCTGCCCGGATGGCTGCTGCAGCGATTTTTCTTTGCGACGAAAAGGATTCCACATCAGCCATTCCCCACACGCTTATTTTTCTTCGGACGACCACCAGCACGTTTCGTACTGATGTACTCAGCTTTGCCCAGCAGCACCAGCACTCTGGCGCACTGATCATCCACGCTCTTTTCATCCCCAGGAATTGAATCGTGTGTGCGCTGTATGTACCTGATTTTTGCCATGCAAAACGGCGGGGTTACCCCCGCCCTCCTTTAGCTGATTAACTGCTTTGGCTGGATCCGTAATTGACGCCACTGATAACCGCCACGGCAGCACTGCGGCGACGTTTCCAGTTGATCCAGCGTTCTGCCCGGATGGCCACGCTATTTGTCTGCCACATGGAAACCAGCTCGACACCAGTAGGAGTCACGCTATCACTTGACGGCGATGACTCCATTTCCAATGACGCTTCGCTGGACATATCCACAGCAACACCGCCTTCGTCAGCCAGGTAAACATCTGGTGCATTCATCAGAATCAACTGATTACCGACATACTGCGAGACGACCGCAGGAAGCCCCTCAAACACACCACCAAACATGGTCATTTCTGGGTATTCCTTCTGTCCAAGCGCATTTTTACGTTTGGACAGGGCCAATGCCGTAGAACTCGACATTAACCACACTGCGCCTGTTGGCTGCAGGTTCGCGTCAATGAACACCTGGAAAGCCGCGCTGCTGTCAGTATCAGGATCACCAGAACTTGGAATAGCAATGGCGCCATTTGTGATTGACGCCGGCGATACCCCAACAACTTCGTCTTTTGCCGGATCAACAAAGTCAGTATCCAAACGAGCAATGACTGATTCCGCCAGAGAGTTACGCACCAAAACGTCCGCTTTCGGGTTGGAGAACCGCAGTAACTCTTCAGTAATGACAGAGATCGCCGCCACCTTAGAGAAACCGAATGTAATATTCGAGAAATCGAACTTGGTTAATGGCTTTGCTTTACCTTGGCCAACCCATTGAGCAGCCCCACCTGAGGTCTGAACCGGGATACGAACGTTGAACGGGACATCACGGAGTGACGGGATATTACCGGTGCCAAATTTCCCGATAATCGTTTGAGGACGCAGGAATTCAACAAAATCGTTGGCAATGTCCTGGTACTCCACCAGCGCACCGGCCCAGGTCGGATCGGTAGTGGTGCCCGCATTTACAGCAGCCTTCAAAACATGGTGAAGCTTGGTGTCGTCTGGATACTTGCTTTTTGCAATGCTGAAAGCTTCAGAGCGAACGCCTTTTGCTGCCGCCAGTGACTTGGTGAAACGCGCAAACGCAATGCCTTTCTCCAGCTTTTGTTCAACCCGGATAATACCCGGCGCATTAGTGATCACCGTATTAACGGTACCACTGGCAGCAGCTTTAGTATTCACCGGTTTAGCTGTTGATGCCTGAGACGCCTCCATATCGTTCAGGCGCTTCAGATGGGCATCAACAGATTTAACTTCCGCTGACGTATTGTCGTACTGCTCTTCCTCTTCTGCGTCCAAAGTACGGCCCTCTTCTGCAGCCTTGGACATGATATCTTCCAATGATGCCGCCAGCGCCGCACGCTTATTTTCAAAGCTCTTAATTTGCTCTGCAATATTCATAGTTTTTCCTTTAATTTCAGTTGATTTTTTTGCTGTAGCGCCAGCGGTGGTATTAGCCTTGACTACCGGTTTCTCTTTGCCGGACGCGGCGAGTAACTGGCGGTCATAGGACTTAACGGTTTGGATAGAACATTCGGCATTTGCCGGAATGGTGACTGCTGATACCTCCAGCAGATCCCAGGAAAGAAAACGGATACCGCCGTCATCAATGAAGGCGTATTCAATAGGCCTGAATCCGATAGAAAGGCCTTTAACCAGACCTGACTTGATCGACGCCCAAGCCTCATCGAGACGGGCAATAAGCAGCGAAGGCATATCCGGTGTCGGTTTCACCAGCTTCGCGGTGATTTCCAACCCCTCTTTCACCATTCGTGGTGTGCAGCTGCCGATCGGCTGTGACCGGTCATGCTGCCAAAGAAATGGGGTATCGCTGCGGAATTTGGCGCCGTCTGGCTCCATAATGTCACCGTCTCGATCAGGTGAAGGCGTAGACGCGATGCCGGTAATAACCCGTTCATCCTCGTTGATCGCTTTCACTTTCATGAGGGTGCATGCGCGATTAAGCGTCATTTACTAACCTCCAGAAACGAGAAAACCCGCCGGAGCGGGTCTGTTGCTGACGTTGTTGTCATATGAAGAATACTTGGTAGTCTCTTTTTTTCGCTTCTGGATTGAGAGCCATCAGCGAGACTGCATTGAACATGGCCATCAGCGGGTCTATCTTGCCCTTCCCGCTGGCCTGTTTTGTTATGAGGATGGCGTTGCCTTTTGGCTCCACCCGTGCATTACTCACACACCATGTCATTAATGGCTGGCCACCATGTATAAGGACGCCTTCGGCAAGCTTCCTCTCGGTGGTTTTGATAGCGCCCCCCAACCGCCAACCCTGGCTGATGCCAACTACATCGTCTGCAGGGATTTCCTCAACGGTCAACGCGTCAAGTATTGAGCCAACGCCAGATGGGTCAACACCGACTTTGTCGAGCAAACCAGCCTCGTGAATACGAGCGACGATAGATGCCAGTTCGTCGGTATCCTGGCCAATAAACTCAACTATCGTGAGGTCACCGGCTTTTTTCAGGTCATGGAGCCTTGAGGCTTCGCTTTTACGTCGCTCGAAAACTATCTTATGCGCCCATGCATGGCACCAGCACAGCCATTCTCGCGTTTCTTTATCACGGCCAATGACTGCAAAGCCCAGCAAATCATCAAGGCCACCTCCATCGATCCCCACAGTTACAACCTCAGAACGCGTGATAATGTCATCCAACGTAACAAGCCGGGCCTGTTGCTCCCAGACATCAACACCAGCCCATCGGTCGGTGCGCAGGTTCAATCCAATTTCGATATTGAGGTGTTTAGCCAGGAATTGTTGTAACGTTCCGTCTGTCTTTGTCTGATTTTTCCGCAGGTTATCTTCAATCCACTCAGCACTGACAGAACGACCGATATTCGGGTTCGTGATATAGAAGTTTTCCGGCAGCAGATAGTCTTTGTTTTCGACCATTCGCTCCGGGAATTCGTACAAAATACCGAGCGTTTTTGGATCCTTAATTTTCCCATCGCGCACATCACGCCAATAGTCAAGTCGCTCTTTGAAAACTCCCGCCGGCGGCTCGTCACTCTGGGTTGTCAGGAAAATTACCCATCCCTCATTACGAGAAACCTGCCCGCCCAAAGCCTCCATGAACATCGCCTCTGCGTTCTGTCGCTTGCCAAATACCCAGAGTTCATCAACCAAGATCCTCCCGGACTTCTTGCCAGAAACGGTGTCAGTATCAGCCGCCACAACCTTCAGCGTGTTACGTGAAACGCGGTGCGTAATCGTCCTGATATGATCCTGAATCTGAAACATGTCAGAGAGCTCTTCATCAGCACGAATCATCCCCGCGGCAGGTTTAAAGCTGTTATCAGCAACCTCTTTTGTCGGCGCCAGGATCAAATGCTCCTCATCCTCACGCCAGCACAAGATCAGCGCGGTGAGCATAATCCCTGCTGCGATGGTCGATTTTGTATTTTTCTTGGAAATAAGCAGCCCATACTCACGGATTAGCTGCCGGCCGGTTTCAGCCTCATATCCACCAAATATTGCTTTCACAAAGTCGAAAACCCATTCTTCAGAGCACTCACCAAACGTTGGTTTACCGGGGAAATCCGAAACTCGGAGCTCTTTGAAAATACTCAGTGCATGCTCGGCCTGGTCAGCAAAAATCGGCGGAGGAATTATCGACGTTCTGTCCACCAGCCTCTTTTCCCAATCCAAGCAGGCTGTTGTCCATTCCGCCATAATTACCCCTTGTTATTAACAACCAGCTTTGGCGGTGCCATCGCGCCAAATTTACTGGCTGTTGCCGCCACTTTTGCCGCAGCATTGCGGGCATCTTTCTTGCCGCCCTCGCCTTTTTTCGGATGGATATAAGGCAACATCGCCTTTGCGGCATCCTTGCGGGTGTCGATGTCTTTTGCCGGGTCGTTCATTACTGACTTGAGAAAATCGAGCGGATCGTCGTAGGTGACGTTTTCTGTGGGTCGCTTTTTAGCAGCGGCACTATCAGAAACCGGCGACGCGTGAGCGCTTTTTCCTCGCTGCTCTTCCATAAACGCGATGACGCCCGGGTCTTTAGCCAGCTGCGAGCCCTTCGACCGAGCTGATTTCTCAGAATACCCAGCCTCAATGGCCGCGGCGGTCTGAGTGGAGCCGGACATCAGCGCCAAGGCAAACTTGCGCTTCTGTCCTGTTAACATGTTTACACCTCTGAAGGGGGATTTTTTCTGCGCGTGAGAGGGGGCGCGGTGTCCGTGGGGATCGAGGTTTACTTTTTCCCCCTCCCCCCCGGCCTATTTGATAATGATTATCATTTCAAATGAAATGATTTCACCACGCAACCATCACCAACCTAAACCGACAGGTTCACTCAAAACGCAATGAAGCGGCTCGACTCGTCACCCGATGGAACCTGATGCTTCAATGCCTCTTCATCAGGCTGTCCTGCTGACGCCTCACGCGCTGATTTTCCGGCATGGCATTCGGTGCATAATGTCCAGAGGTTCGATGCATCATTATTGCCCCCAAATTGTAAAGCGATGCGGTGATCCAGTTCGCTATCGTGCAGGTCAACAACCCGGTCGCACATGCAACAGTGACCACCGTCGCGGACGTAGATCTGACGCTTCATGTTATATCGGGCGCTACCGCTTACCCGGCGCTTCTCACCGTATACAGGCTTTATGCGCCGGGTATCCATCGCTTTCAATCGTGGCTGTAACGTTGTCAATTTAGCCATATAACCTCCAGGCGCGACGGCGCTCATTGCGGGGCAAGCCATCCTTAGCACGCTCAACGGGAATGCCGTCTGCATGATCAACAAGCGACTGGCACGGGTAGATAACCTGTCCGCCATACGCATCGCCGACAGCATAATCCGCAGCTTTTTTGCTATCCCATTTTGCAAGTACTTTGGGTATATGCTGGCGCGGCATGCTGTAACAGACACCGTGGATCAGGCGATGCATTTCGATGTAGTCTTGCCTTGCTCTGTCGGCAGCAATCAACTTAACGGCGATCTCCATCTGATACTGTGGCGGCCGGCCGGTGCCAAGATAGAAACTGATCAGGAGCTCGGGAAACATAGCCAGCCATTGACCAACCATGCTGCGGAAGCCAGCAACCGGAACAGCGTCATCCTCAAGAACAACGACACGGTGACTCTGCTTGGCTGCATATTCCATCGCGCGCCGGTGATTCCAGTTGGCGCCGTGGTGTTCCTCATCAATCAGCAGATGAGCCCCCAAAGAGCCAGCTAATGCTGCGGCCTGCTCACGTCGCTCATGATGGCCAACGACCACAAATGCTATTTGTGTTTCCACCAGGCGTACTCCTTCCCCAATCCGTTCATTTTAAAGACGGTATGGACGCGAGGCCCGGTGACGACGCGGTCGCCAAAAGACTTGGCAACCATGCCAAAGGCCATCATATCTCCGACCGCGCTAGCACCCTGCTCCATCTTCCAGAAGCGATAACTTTCGATGCGGTAATACAGGCGAATAATACCGTGAGCAAATGCCATAACGTCTTCGCGGCAGCCACCCAGCAGACCAGCATTAAACATCACGCTGCCGCGGTTATTTTCGATAAAGTCCTGGTAGAGACGCTCGGGATGCTTTGCCTTTGCCCACGTATCACCATAGGTTTTTGGTTCGGAGCCGACATATACATAGCCAGGCTCCATGTCATCCCATGGTTCGCGTAGCATTTCAACGTCAGTTCCATCCGTGCACCACACAAACCGGTACTCCGGGTGATCGCGCAGGTATTGATAGATGTGGAGCCAGCGCCGGTAATATACGTTCATGGCTACATCAGGCACAGAAACCAGCATGGCGCCGGCAGGCGGCGTTGTCAGCTGATCGGCCAGCACCACCGCTGCAGCGCCGGAAATTGACTTAGCCCAAGTCGATAGCACCAGCGGGTCTGGTGTCATACGCGTTCCGCGCTGTGAATCAGGCTCACTGGTAAGTAACGACGTGATCACAACATCTCGCTTTTGCCGGTATTCTGCATAACCGGTATAGCCAGCATCACGCCGACCGTTGTGGATCTTTACGTTACGCTCCACCAGCGACTGTCTGTCAGTTTTGGGAACTGAACGAACAACGGCATCATGCTCGTCGAGCGAATGGATCAGCTTTTCAGACCCCGTAACATCGGCATACGCCCACGTCGTCATCCCGGCATTATGGATGCGTAAAGCAAGATCAGAATGCTCATACATACCGCGGCCATAAATTGGGTCAAATCCACCAACCGCTTCAATAGCGCTCCGGTGGTAATACAGCATTACACCTCGCTGACCGGTATAAGACACATGCTGATCATCACGATAAAGCACGGCAATATCTTTCAGCTTGCGAGGGCCGGATAGGTCGAGGAACTGATACGCCAGGTGGGGTTCAGGAGACTCGATATATGGAACATGCCAGCCATCAGAAATAGGCCATGCGTCATCGTCAAACAAAAATATGTGCTCACAGCCGGCGTTCATTAACGCTTCGAGGCTCGCGTTCTTTGATGCAACAATCCCCAGCGACTTTTCATGCCTGATAATTTTAATATCCGCTGGAGCCACTGCTGCTGGCATCGAGCCATCATCAACAATAACCACCAGCGAACCGGCAGGAAGATGCTTAATATGCTGGTCAATGGCCTGCTTCAGAACATCAGGCCGCTGATGGGTCGTGATAGCAATGCCAATCCGTGATGATCCCTGCTGCTGTGCAGGCTCATATTGAATACCGTCGATAATCACCTGCATAGCGTTCCCTTAATGATCTGCAGGGTTGGTGTACGTGGCCACCGTCCGGTTGTTGCCGTTTACCACATAAGCGGTATCACCGGGTAGCAATGCTATCGTCAGCGATTCACCGGTATGTTCATCGGCATAAATGGTCTTCTCGGTAGCTTTCCATTCGAGGCTTTTAGCCGCGTGAATGATTTCGTTCTGGTTAGCTGTGACAATCTTTACTGTATACATGGCGTGTTCCTATTGGTGGTGGTAGTGGTTTACAAATTAAAAAAGCCACCAACCGGCCAGTGCGGTGGGTGCCGGTGATGACTTTGTCTATCGCTCATTACACAGCACCCGGTTAGGTGCTCTGTGATGCACAGTAAAAAAAGGCCGCCTTAGCGACCTATTCTTCTATTATCCCTCTAGCCTAAAAGCAGATTTTTTCTGTTAGTATAAAGTAACAATTAATGTTGCGACGGGAACGTGATGGTCACTTTATACACAGTTGATAGACTGGGCTTTTACCAAAACAATAAACACATATCATTAATCAAACAGTCTTCTAACCGACCAGAGTTAGATGAATATTTAAACAGGCTCTTTCCAGATGGGCTATCACACCATGGAATGGTGTATTTCTGGAGTAGCCACTCGAAACCAATAGATATTGAGCCTACATTAGAACTTCATATCGAGATGCACCGAAGAGCATTTCACCCACATAAACCGTCTAGGTTTACAAGTCTCTTCTGCTGCCAATCCATTGACGAAGCAATGGCATTCCGTGATAGAAAAGGAAGTGCAAGCTTTCCTATTTATGAAGTTATTTGCGATGAAAAAATAATTCATTCGGCAGACATGAACATAATTAACCCTGCCGCCACTACGTTGGTATTCTCACAACATATGGATATGTACTGGTCAGGCCAAAGCCTGAAGGAATTTTGCCCAGAGCATCCAGAATTTAATGAAGTTTTAGTTCCGCTTCCAGCTACCATTGGCAATCGTGTCGCCTGAGTTTATAGCTTAATGGCATTACAGAGGCACCCAGCAGAACCACCTGTAATACCAATGAATTCAGACACCTAGAGCAACAACGTAAACGTCTCTTTACATTGCATTTCAAATACCCTTTTATACTTTGTGGGAATATTGAGCACGAGGGATTGAAACTAACGTCTCATGAAAATATAGTTGCGCCCAGGTAACTACTTATCTGTCGTATGTTACCTTTCCCTGGTGTTGTGTATTGCTTTGCCCCTCTCCTTGAGGGGCTTTTTTTTTGCGCATTACGCAGCGTCCGTGTGGGCGCTCTGTGATGGGCAATAAAAAACCGCCCGGAGGCGGTTAGTAGGATGATTTCAGCAGCGGACTTGATGTTCTGATTTCTAAAATTTTCGCAACTGCTTCATCACGCGTTATTCTATGGTCACAGTATTGGGTGTGGACTTTGGATAACTGGCTGAAAAGTAGAGCCGTTCTTCCCCTTACTCGAGTCAAATGAATCATTAAATGTGACTCATACACCGATGCCAAAGCTTTATAGGTTTGCATAAACAACTCATAATCTGGGCCAACGAAACCTTCTCCTTCTTGGCTCTCCGGGGCTCTGATCATCGTATTATGAAAGTTGTAGACGGCTAAAACGAAATTTCTTACCTCTTGTGCTCGCGCTTGACGTTTCCAGCTATTAATAGTCATAAGAGCAATTAACGCAGCAATGAGGGTTGCACCTGCTGAACAGAATGCCGCAACCATGCTCCAATACGCCCAGTTTGCCGAATCGCGCGCAATCAGCATCGACTCATAAGAAATTAAATCCGTGTCCATTTTCACCTCATGATTTTAACGAGGCGATTTTAGTTACTGCTTCTCCAGTTGTCACGCCAGGCGTTGAGCGTATCCACCTGCCCGGCGCAGATTGATAACGCTGTTTGAAGCGCCAGGGCGTGGGCGCCGATATCTCCCCATGTGCTGCCAGCAAACTTCGGCTGTTCGCAGGGTTTGAACACAGATTCAGGGGGCAGTAGAACGAGCGGTGCCACTGGCTCCGGCATCCGTTCCGCGCAAGATGTCAATGACAGGGCCAGGAGCAGCGCGGCGGGCGCAGTCGTCATTTTTAATGGCATCCTGATATTTCCTCTGGTAGTTTTCGCCCTGCTGGCGCAGCTGCTGCTCTCTCTGTTGCTGCTTTGCCATCAATGCACGATTACGGGCGTCATCCGCGCGCATTGTCGCGAGCAGTTCGGCCTGCTGCGCCAGCGTCTTTTTCTGCTGTTCAGCCTGCTGTTCCGCCATCCGCAACTCTTTGCGCAGCCCCCAGTTGCTGGCCGCGAAATAAATCAGTGACGTAATAGCGAAAGCCACCAGCGCGGCCTTGATGTGGAATAACAAGTTCATCGCTGCCCCCAGGAGCACACTGCGTGCTCAATCTCACGGCGATTCATCAGCCCGCGCCATGACTTACCATTGGCATAAATCCATTGCCGCAAGCCATCACAGGCGCCGGAGTAATCCCCGGCATTCAGTTTTCGTAGCAGTGAGGAATTCTCAAATGCGTGGGTGCCGATGTTGTAACTGAAGCTGATGAGCGCCGCCCGCTGATATTGGCTGACAGGCACCTTAACTGAACGCTCAACAGAGCGCGCGAAGGGCACCAGGTCTTTATCCAGCATTGCCTTGCATTCTGCCTCTGTGTACTTTTTGCCGAGAACGATGTCGGCTCCGGTATGGCCGTAGCAAACCGTCAGAACGCCAGCGACGTCATAATATGGCTCGTACCGTATCCCCTCTAAATCTGGAATCAATACCGCGGCAATAGCCACTGCGCCGCCACTAGCCGCAGCCGCCAGCTTCTTTTTCAATGCTGAGTTCATGATTAGTCCTTCTGCGGCGGCGCCGTCACATACCCACGCTTTAACGCAGCTTCATATGCCTTTGTCTGCCGATATTTGAAATAGACGTTCACGGCAAACGTTGCCAGGCCGATAACAAAGCCGCCGACGACGGCGATCAGGTTCCAGTCGAGGTCATGCAACCAACGCGCAATACCCCCCCAGCAAATGAGGCCGCCCGATGTGCAATACCCGACCGCTGTGGCGATTTTGTCAGGCATGATTATTTTCATCCTTTCCCCCTTACCGGGGCTTGCCCCGGTCTCCGGGTGATAGAAACGAAAAAGGCCGCGCGTTAGCGCAGCCCTGAAATAGAAAAGCCCCGGCAGATGCCAGGGCTAAGAAAAAGGGTTGTGGTGGCCGGTGCTGATCTTCGGCTTGTCTCAAAGGACTGCAATTCACCACAACGGGAAAAGCACTTTCAAACCGACTCTTTCAATGCGTTTACCGTTATGTTGAAGCAGAAGTACATAGTTATATCGAGAGGCACACTCATTCAATCCTGGGGGGATCTGATAACGAATATCAAAATGAATGTGCCTTTCTGATAAAACCCAAGTCACAAAAACGCCCGCATCAACGCACTTTTATCGTCCGAAAAAAGAAACTCATCTTATTGAAAAACCCGTATATTTTACTAAAACAGAATATTCTTTCTTTTTAGTGTGGTTGACTTGTTCATCGTAACCCATTCCTATAGTGTCGTCAATTGACGACATTAACAGCCTCGAACGTCAAACAGAACGGATTCTCATGCAATCATTTCCCCCCGCTTAATGCGGGGATTTTTTTGCATATTGTGCAATCAAACGCTGCGATTTTTCTTAAACCCATTCCTTCTCCCACTGGTCACCTTGCCCTTCACTCGGAATAATCCTAAGTTAAAAGTGTCTTCAGTGAAGACGACCCCGTTGCTACTCATGTACAAAACACCACTTTTTGGCAGGAGTGCTGATTAATCACAGAGAAAAACATTAGCCAGATAATTAATTACCCACTAATTATAAGACGCTTTGGGAGAGATAATGAATCTGGATGACCTTACTGATGAACAGAGACAGCAGCTTGATAGCATTAAGAGCATTGTTGGCCATTGTGTTGTTATGTTGCTTGCTCAAAGAAAGACTATTAACACAGCAAACCTATTATCACAGATAGCTGGGGAAATGGAAAAAATTCCAGAACGTTCAGAATTTAACAAATATAGAGCTACGTTAGAGTTTGTAGGAACCGTTTCGCAATAACACCGGGGAGCCCATCATGAAAAATGAACTGGTAATAAGTGTCGCAATAGGTGCAGCTCTGTTGTTTGCTCTTCTTGTTGGCTCACCACTCTCAATCATTACCACACTGCCGTAACGGACTGTATGGATGACCGATGCTGCAACATCAGAACGCGCTACTTGTTAGTTGCTCTGTCTAGCATTCGTGTCATCCACCTTTCCCTCCCTCATACATACTGTTGGCGCACACATTAATTTGCATGGTTGACACATATTCGTCTAACTTCCAGCCACCAAATGTTGCAATAATGCTTTCTATGTGTAACATTTTGTTTGATGTGGCAGAATAGCTACACACAGCATAGCAACACCGCGGATTAGGCCGGGAATTCTCGGCCATTTTCCTACAAATATTGCTATTTGCTGTCCTCACCATGGCGCGGGCATAAAAAACCCCGGCGGTTAGCTAGGGCGTGTGGTTTCGGGATGTATCTCCCTTTCAGTCTGTTTGAACCGCTCTTCTTCAAGCTCTACGCCCAGGCCTATACGCCCCAGCTTTATCGCAGCCTTTATGGTTGCGCCAGAACCCATGAAGAAATCGGCCACCACGTCACCGGGGCGGCTACTGGCGCTAATGATGTGCTCCATCATTTCGGCGGGCTTTTCGCATGGGTGCTTGCCAGGGTAGAAAGCCACCGGCGGATAGTGCCAAACATCAGTGTAGGGAACCGCTGCCGTTACCGCGAAGGGGCGGCGAAGTGATCGGTATTCCTGGCACAATGCCAAATATTCACGGTTCAGTGTCCGGTATTCCCGCACCAGCTCATGATGGGGCCGATTCAATCCACCAGCCTGATGCCGCTCTTTTGCGATACGGTCAAATAAAGCCTGGAGCGCCTGATACTGCTTTTCGCCCGGTAATTGCCATTGGCTCTCTGAGAACCAGTGGCTGCACATCTGAGTTTTCGTCGCTGCGTTGATTTCCTTCGCTGACACACCGAGGGATTGCCGGGCCGTTCTGAAATAATCAATCAGCGGCTTAAAGACATTGTGCTTCAGCTCTCCACATTTCGCAGCGAAGCCGTCTACCTTCGGTTGTAGTGGGCCAGCGTAGTGACCGGCGAAAATGATCCGCTCAGTCGATGGGAAGTATGACCGCAGGCTTTCTTTGTTCATCCGCTTCCACGGACCGGCAGGCTTCGCCCAAATGATGTGGCTCAATACGTCGAAACGCTGGCGCACCAGCAGCTCAGTATCGGGCGCCAAGCGGCTACCGCAGAACATATAAAGACTGCCGTTCGGCTTTAGTACCCGGTAGAACTCCACCAGCAACGAATCCAGCCAGGCGATATATTCGGCCTCGCTCTTCCACTGATTATCCCAGCCGCAGGATTTAACCCGGTAATACGGCGGGTCAGTCGCGATCAGGTCGATGGAGTTATCCGGCAAAGTTTTGATAAACGCTGTTGTGTCAGCGTTGATAAGACGTGTGTTTGAAATCATAAGCGCCCTTAGTTGATACGCTCGTCCTGCTGTTAGCAGCACGGGCAAAGGTTGCTTGTGACCAAAGACATGAGCAACTGGCGGATGGGGTGTTACCTCACTCTTTCCGCCGCTCACTTCACAAAACAGAATGAAAAAAGCCCGCATGATTTGCGGGCTTTTCACATGAAATCTGGTTAAAAAATCAATCTTTTTGTGGATAGATTTTTCCATACATCCCTTTGATTTTACCCATACATTTATGGGCGGTTATCAAAGCAGATTTAGCATCAGCCTCTTTTACATCGGCATCTATAGTATAGTCAGCCCACTTCCTCTGACCATGAAGTTGCTTAAGTATAGATCCAACAGCAATGAGATCATTTTTTTCAAAAGGCTCATTACCTTTCATCCATGACTGCGAAATAAGATAACCACTTACACCGGCATGAGTGGTAGGCGGACAATTTTCTAAAACACCACATATTGAGTGGTAAACACCGTAGTACGCTCGTCCAATAGCATTTCGAAAGCCAATCTCATTCCCATCATTGAGACAACGATCAGCAAAATTAATAAAATCAGAACCTTCGACACTCATTGATATTCTCCATCCTTTCGAGAAACACTCCGAAACCAAGAGGTAAAGGCTTTATCGATATACTTTTCATCTGCTAATACGCAGGCGAGCTCAAAGTTCAACTCTGAAAGTATTTCTGGGTCCTCAGTTTCTGCATCGATGATGTAAGCACAATCACCGTGAGTATCAATAAAGTAACTAACTCCAATGCAGTTTACATTTTGAGCATTAGCCAACCCCTCGGCTGTATCGCAAAGGTCTTCGACATCTTTTTGGGTAAGAGAAGTTAGCTCTTTGAAATCTTGAATCCTTTGCATCATAGCCTCTCCCTCACTCATCATTCGTTGCTTTTGCTCACCATCAAGCAGCGCCGCCATCTTGAGAGTGTACTGGCGAACACGCCTAGAATCACCTATGCCATATGACGCATTTCTAGCAATACGACGCATCGTGCTGTTGTGGTAGAGCTCTTCAAGCCTGTAGATTTCAAGCCTATGTGTATAGTTATGAGCGGAAGTTCCAAGATAAGCAAGGTAGTTTAGAGCTACTCGTACATCATCATATCGTAATGATTGTTCGAAAAACTGCATAGCTCCACTATGATCACGTTTTGCTACATGAACCATAGCTCTTATGTAAGAGCGTTCTGGCTCAAGTAAATATGACGCATCTCTCAAGATCTCTTCAATAACCCCTACATCTGGAATTTCACCGGAATCAATGTAGCCCACGAGTTTGTGGTGAAGATCTGAGGTTTTAGCTTTTGGCTGTGATGACATGGAAGCCCTTACAAAATCGACTATTACATCTTTTTGTTTTTATTCGTTTTATCGACAACTGTCAGCGTCGAGTAGTGCTAATTCTAGCACACGCCCGAGTATTGTCACTCCGATTCTTCTCAGAATGCGTCCTACCATGGCTCATACATGACCTCGCCGAAGCGAGGTTTTGAAAGCTGATAAGCTACGTGACTGCGTAACCACTCTTATCACAATAGCCCGTGAAATTCGTAACGAAAAGCAGAATCTTACGCCACCTTCGAAATTTCTGATTTATGCGTCCACGCATCCATCTCCAAACTGGCGCCAGTCATCGCCAGGCACCCCTCGATAAAGCTCTCTGCCATCATCAGCTTCTGCCGGACGTTACCCTCAGATATTTTCCAGCGTCGCGCTATCTCAGACTTTGACACCCCGTAGCGATAGTGCAGCATGATTACACCCAGTTCCCGTTCATCCCTGACCTTCTTCAGCCGCCCCACCGCTCCGTCAATGATCAGCCCATCACTATCACAGCAGGAATCTTTGCTTTTCCCTGTTGCCGGCAGCAGCCCTTTAAAGCCGGCAGCGATCGGCGAGTACCCCACCCCGCTGTTATCTTTTGCCCACTGTCCCCAACGTTCCAACACCAACTGAATATCTCTCATGCTTTTTTCTCCCGGCGGCCGGCCGCATACCGGCGCGCCATATCTCACACGTTATTTGATTGAACCGACCGACAATGACCAGTCCATGAATTTGAACCACAGCTCTATCTGACTTCCGTACTCATCTTCAAATCGTGACATGTCACGATGTAGCTCATCGTGGTGTTTTCGGCACAGCGGGATAGTGAACAGGTCATGCGCTTTTGTCGCCATGCCACCCTGCCCGTGGCCTATGATGTGGTGCGGATCGTCAGCTGTCCCACCACAGGCCGCGCAACGCTGTGATTTAACCCATTGCAGGTATGATTTGCTTTCCCACCGCTCGCGCTTCGGCAGTTTGAATAGCGCCTTCGGCGGCTCCGGGTCGATAACCAACGTCTTTGCCGCTTTAGTCACCTTCTCAGCCACCACCTGGCGCGCCGCTGGCGTGTGCGTTATCTCTGATTCCTTACGCCCTCCCGTGTGGATCTTCTCTGGTGGTAACCGCAAAGAGGCAGAGCAAACGTCATCAGGCAGCAGATCGGCGACGGCGCGCATCACCGCCCACCAGCACAGCTCCGGCAACGTCAACTGGTGGCTTTCGTCAAACAGGAAGTACGAGCGCACAGTTTCGAGAATGTATGCCGCTCTGTTTGCATCAGCCACGACCTGCAACTTCTCGATAGACTGCCCGCGCAGCCGGTTGTCGCACCCCTCGCATAGCCTGATTGCGCTGTTGTCGTAGCGTAGCGTCGTCATGTGTCGGGTGTGGTCACCATCGACCTGACAGCCGCGCTTAAGCGAAACCCAGTGCTCCAGCGAACTGATCCCCCCTGCGGCAATTAGAACGCGCTCATGCTGATAGAACGGCGTAAACCGTGGGTCATTCGCCAATTCTTGCCCTGGTGCCGGCAGCGCGCCAGATGGCAACTTTGCAAACTCGCTCGGCTCACGGCAGACCAACAGACGACCCGTCAGATGAGGCAGCAACTCCTTGCCAGGTCGCAGGATGATTTGCCCCAGCTCCGGTACGGTGATCGATTTGAGCAATGCCCTCATACGCCCACCTCACTGATCCGCAACTCCACCTTCCCCCCCTTCGTCACTGGTCCCCACTCCGCAGACAGCTTTTTGATCTGGCTGTCATCGAGCCAAATACCCGCTTTCGTCATCGCGTCAAACAGCGCCTTGAAGTAGTTATCCAGATCCCGCCTTGCGCGGCTCGGCGGGTAAAACACCACTGAAACGGCCAAATCACAAGTTAACGCCCGCGGCTTACGGTTTAACTGCTCAAGAATGCTCGCCAGCGCCTCACTGTTGAATTTTCTCCCCCTGTCGCTGATAAGGTGTCTCCCACGAAGAGATCCCTTGTTAGGCGCTCTCCAGTAGCCATTTACGGACGGTGGGAATGGCAGCGTCAAAATCATGCGGGCACCTCCGCCTTTCCCGGCACCAGCACAACGGACTGATCGCACTCGTTACCCCAGCTATCCCAACCATCAGCCTGCTGCCGGGCGAACAGCTCAATGCGTGGTACGTCGCCCACCAGCTGCACCAGCTTTTCCCGGAATACATCAGGCTTGGCGCTATGCTCCATGCGTGTTGCTGTCACATGCTGACTGATCGAGGCGTCATGGCGCTGCAGCAGCTTCCCGCGCACTGCGAACATGCAGTCCTCACTATTGGCGCGAGTCATCCAACCCATGCCCAGCGCGTTATTGCCTTTGCGCAGGTTCGTCTTGTGCCATGTGAACCCCTTCATGGTCATCAGTCGGAACCCCCAGGCGCGCATCACCTGCAGCGCTTCCTCCGGCTGCGTTGGCACCCACCACATCGCCAGCAGGCAGTTTTCAGCAGCCAGATCCCACACCGGCAACCGGCAGATATCGGCAACGCTCATTGTCGGGTACTTGAACCCGGCGCCACGCTTCCCTGATGCGCATTTGTCTTTGTAGCTCCATGGCGGATCGGCATAAATCAGCGAATACTTCATGCCACCAGCTCCTCTGCAGCCACCAGCCGGTAGAAATATACGAGTTTGCCCGTATCTGGATCCTTGACTGTCCGGCGTTCTTTCACCAAACCATGGACCTTTGGGTGAACTTCACGAAGCCGCGCGCTGATGGCTGCCTGGGTATCCGCCACAAAGAACATCATGAATACGGTCCTCTCCAGCTCGCGCAGCGTCATCCATTTAGCACCAGCTGCAGCCTGAATAACTCGCCCCATCTGATTCTCTGGGTTGTCTTTCAGTAGGCCGGCCAGAACTAATTTGCGGATCCCGCTGTTGATGCGTTCGCTCTCAAACACGTCAACGTCGATGGTTAATTTTTTCATAGCCAGCCCTCTCTCTTGCGTCGTTCATACTCTGCCTGCAGCAGCTGCGCCGGCGTTGGCCCTGCCGGGGTTTTCGGCGCGGCAATCTGCACCGCTGGTGTTGGGATTTGCTCACCGTGCGCCAGGCGCTTAGCCCAGCCGCTCAGCTGGTTCTGAATTGCTTTGCGGGTTTCAGTCTCGGTGTGGTTGTAGCGATGCATCGCCCTGCGGACGTCAATCACGATCCAGTACATCGCCGGGTGCGACCACGGAAACGCCTCGGCAGACGGGTACTGCGAGCGCGTGGCGGCATACTTGTCGAACTCGGTCACCGCCTCATCAAGCGACGGCAAACCAGCAGACTCAGCAGCACCAGCTTTGCACCAGCCGATGAATTTGCCGCAGCTCGGCCAGAAGTCGCTCTCCTGCTGACGAGCCATGCGCATACCGGCGCGTAGCTGGTCGGCGGTGGTGATCCCGTTCTCAGCAAAAGCCAAGATCCATTGGCGCTTTGCAGCTGCGATATCCGCAGGAGTGCTCAACGCGGTGTTCTTCGCCGCTGGGAATATCTGCATCAGATTTTTGAACAGCAGATCCACAAGCTTCTCGGCGTTGTCGTTCACCACGCGCTGTGCCGGTTCACTCGGCATCATGCGCGCCAATGTGCCGGCGTCGCGGCTCTGAACTGCTTGCATGAGTTTGTTCATAGCGTTTCACTCCAAGCTTCTGCGGTGTTCCACTCTTCCGACGCTGCAGCAACGGCTGGCGCCAACTTGATGACCAGGTCATCCCAGCGCTCACGAAGTTTTGACGGGCTGAGCACGTTGCGGCACCAGAACTTGTCGCGGTTGGCCCTGCCAAACAGCTGGCAGATTTGTTTATGGCTCCTGCCGTCTTGAGCGCACATCAGGCGAACCTCGTTTGACCACGCCACCCAGTTAGGCTCCTTGGGTCGGCTAATCTCACCGTCGTATTCAGCTGCCTGCTCGTACATGCCGATAATCCGCCCCCAAATCCATTCAGCGCATTTCAAATCGTCCTCGCCACCCCACTGCCGCTTTTTGGCATCCCACACAACAGCCTCTGGATGAGCAGCGAGAAACTTCTCCTCCGGCGACAAAGTCTTTTCGTCAGGTTGCGAAGCGTCCTGACAAGAAGGTTTTTTGTCTTTAGGTTCTAATGACTGGTTCTGGTGCCATCTGGTGGCACAGGGGGTGCCAGCAGGTGACACAGGGGCTGTGCTTTCTGACGACACACCTGTGCTTTTTGGCGGCATAGGGGGTGTGCTTTTTGACGGCACAGGGGCTATGCTTTTTGGCGACACAGGGGTGCCTAGGGTCATCCGGTAGATATTGGACGTGTTACCCTTTCCGTTATTGGCTCCAAGTCGGTTCTCTTTGGAGATCAGCCCCATTTTTATCAGTGCAGTGATATGCGCCCTCACAGCGCTTTTGCTGCACTCGCAATGGTCAGCAATATGCTGATATGACGGCCAGCATTCGCCTTCATCATTGGCGTTATCTGCCATCTTAATCAGCACCAACTTACGCAGGGGATTGCCAACCTTAATGCTCATAGCTTGAGCCATCAGGTTCATACTCATTCCGGCACCTTCCTGAACTTCGTACCAAAATCCCGGCGCGGGACTGCGCAGTCATGCGGGTAACCAGGACGGCGAAAGATAACCCGATGGTTAGCCTGATCTACTCCGGTTACGTGCACCAGGCGACCATGGTCGTCATGACAGTCCATCTCAAACGGCACGATCTGTTCGTTTTCCATCGCGCCCCCCTTGAGCCAGAAATCCCAGCGTTTCCCACAAATTACGCGCTACCGTGTGGTTGATACCCTGCCAGCGCCCCTGTACATTCAGCTCATACGAAAACGCCGCAGCACTACGACCACCGGACACTGGCCGGCAGCGCAGTTGCGGAATACTCGAAAAATTGCGTACACTGTTCATGCGTTAATTTCTCTCACACGTTAAATTGATGCGCCGACGCCCGGGACCGCATATCCTGGGCGTCACCCTTTCCGAACATCATCACTGTCACGGCGTAGATCTCCGCCACCAGCGTCTGAATCCGATAACCTTTAGCTTTCAACTTCTTGCTTTCGTCGCCGTCCAACACCCCGTCGGCAGTAAAGTCGTTGTGCGCCTTGGCAAACACACCCAGCGCCGCCATCAGTTCGTTGAATTTGAAAAGCAGTTCTTCGTTATCAACAGCGCCGACTTCTGGCAGCTTCACAAACACACCACCAGCGCGCTTACACATCGCCTCTGTGATATCTGTGCGCTCGCTGATCTGCTCCATCTGAATGGCCATACCCATTGGCACCATCTGGCCACCGACCTGGCGAACGCGGTTGCGCAGCGCGTTATGGGTGCCGTCGTGTGCCAGCTCCTGCGCCATAGCTTCATATCCACCAGCAAACGACGTGATCAGCCGGTGCATTACATCGGTGATATCGCTTTGGGTTGGAAAGTCTTTGTTATCCACAATGTTTCTCCGATTGTTGTGGTTTTTGTTAAGCCACTGGGGCTGTAGACTTTTTGTAAAGGGATAAATCGACAAGTAATTCACCACTGGTCAGCACCTGAATTTCAAAAGCCCGGCCTTTGGGAATGATTTCACCCCACCCCGACACGGAGGCATGCGACAGACCCAATGCCTTGGCTGTCTTGCCAACCCCACCGAAGTAGGAAATAACATCATCTTTTTTCATAACGCTCTCTATGGTAGGGGAATAGAACATAACGATAGTAGGATATCTTACATTAAAGAGTCAAGGAATCCTACATTACTAAATGGTAGGATTGCCTACATGGAAATGAATGATCGAATTAAAGCTCGACGCAAAGAGTTGAAGCTGACTCAAGACTCCTTAGCCAAGCGAGTCGGGGTAAACCGTGTCACAGTCACAGGTTGGGAATCTGGTGACTATAAACCTGGTGGTGAAAACCTTCAGGCGCTTGCTGCTGCTCTAAACTGCAATCCTGGTTGGCTGCTGGATGGCGGAGATATCAACGAACAAATCACATATGTGGGCAAGGTTCGACCAGGGCTAGTCCCTGTGGTTGGTGATGCCGTTTTAGGAGTTGATGGCATGATCGACATGGTCGAATACCGTGGTGGATGGCTCAAAATTTATAGCGATGACCCTAACGCTTATGGGCTACGTGTCAGAGGCGACAGCATGTGGCCTCGCATTCAGTCAGGAGAGTTCGTGTTGATTGAGCCGGGCAGCACAGTCCACCCTGGCGATGAAGTGTTTGTACGCACCAGTGATGGCCACAACATGATCAAGGTGCTCAACTACACCCGCGAGGGGGAGTACCAGTTTAACAGCATCAACCAAGATCATCGCCCTATCACCATGGCTCGTGCTGAAGTTCAGAAGGTTCAGTATGTTGCTGGCATTCTGAAGGCATCTAGGCATGTAGACACAGATGCAGTAACTGATTCAATTCCTTAGTTATATAATTTCAGCTTAGGAACAATAGGTAACCGCACACACCGATTTATAACAAATAGTTTGCTTGGACATAGTAAAGCAAGAAATCGCAATGTAAATTAAATAGGATAAAGGATACATATGCAGGATAATCAAGAAGTTATTATCAACTCTGAAGAAGAAGCTTTCGCTTTTCTTGAGAAATATGTATCTGGATATTCCCTTCCCGAAAACGTATCATTTGGCGAATGGCCTAACCTTAAAATCAAGCTAACTGGTAAAAAATTCAACAAGAGCTTAACACCTTCAGTGATGAAGGGATTTGTTGATATGCAGCACCAAATCAACAAATCATATGCGCTTGTTAAGTATGGAATTGCAGACCCAAGAAAACTGTCTAAAGAAGAAAAAGATGCCCTTGAAATAGAAGTTACCGTTGAGCAAGGTTCATCGCTAATCGAAGTTAACATAGACGGTTTTCTAACCAAAATAACGCATGAACTGGTGGGGAAAATGAACCCTCAAGACGTAGTAGTGACGGTTTTGGGTGTCGCCTTGATTTGGGGAGGCGTCACCCTCTTCAAACGATTCCTTGATAACCGCAAGGAAGTTAGACTTGCAGAAACAAAAAAAGAAAGTGACCGAGAGCATTTGCACACCATGAAATTCATGAGTGAGCAAGAGACAAAACGCCTTGAAACTATTGGAAAAATAATTGCTGAAAAACCACAGTTAGATAATATGGAGCGTTTGTCTTACGACGCAAAAACTGAAATGGTAAAATCCTTTGCCACTGCAACAACGGCACAGATAGACAACATTGAACTCGATAGTGCAACATCAAAAGAACTGGTCACTAACGCTCGCCGAAAATCGGTCGAACTGCGCATGGATGGCATGTATAGAATTGAAGAGGTAAACTCTACAGATCCTGAGTGCTTTAAAGTCAAGGTTAGAAATATAAATAACGATCTTCGGATAAATTGTGTAGTTCAAGACGTTTTCCTTGATGCATCCGAAAACAAAAAGGCATTACAACAAGCTGAGTGGGATCGTAAGCCTGTCCATCTTAGCATCAATGCCAAACATATTGATGGTGACATTAAATCTGCAGTTATCCTCTATGTAAAAGAGGCTGAACCACCTAAAGAATCCTGACCTCTTCATCATACCCGGCCACTGCGCCGGGTTTTTTGTGCCCTCTCCCCTCCTCCATAACCTTCCTTACCCCAACGACCAAAAGCTAAAAACGCAAATGTAAGATTACCTACAAAACACCATTGACATAAAATGTCAGATATCCTACATTAAATCCATCAACAGCGAACAGGCAGGACGCCCACGAAGTAGCCGCCCGAGGCACACGAAGATCGGGATGATTCGCTAACCAGATACACAGCAGAGGGTTACCGATGGAAATGCAGATGCTAAAGCCAGGATTTACCTACACCAGGGTGGATAGCAAATGCCCGCATTGTGGGAAAGTTTTTTCACAGGATCTTGATTCCTCGGTAAATGAGATATCCACCGATGATGTAAAGGATTATGCCAGTGATATCAAAGCATCTCCTGAAGCCAGTACCATGCCAAACCTTTCCGTACGCTGGTACAAGATGATTAAAGCGCTTGTGGCTCTTCCATTCACCACAAAAGAAGAATACGCCACCGAGCGAAATCAGAAGCGTAGCGTTCGTTGGAAGTTTTGGAAGTAGCCCAGCACCAGTGGTGAGAAAGATTACCGTACAGACGACGATCATTACTTTGTACCAGGTATCTAGTTCCAATTTAGATAAAGGATTTTCCATATTTTCAAATTCTTGGATGTGTGAGAGCTACCAAGAATACCACCGAGCCTGAAGTGGTTAAAAGACAGGCATCAAAATTATGCGGCCTGCCGGGGCCACCGTAGCGAAAGCGAGCGCGGATATCCGGCAACTAATGCAAGCTATGTGTAGTCTTTGGCGGCTGTTCCATGGGTTGGTGTCAGCCGCACTTTTTTCACAATTGACGTGGTTCGCTGTGCCGGTTTCCCCTTAAACCTTTACACAGTATAAGCCCCGGCGCGGCGCACCACTTGAGTTGTGAGAGAAACGCGGGATCCGGCCGCTATGCGGGCTGTCGGACTCCCCTTCTAAAACCCGATTTTCTATCTGCGAAAAATTGCCAGGTCTGGCAGGGCTTCGCTTTGCCGAAAATCAGCGAGAAGGAATAACGTATGTCGTGGATTACCACATTTACCGGCCAACACTTTAACTTTGCGGCGCCGGACGTTGAAAGCATCTGCATTGAAGATATCGCCCAGGCACTTTCGCATGAATGTCGCTTTGCCGGCCATCTGCCGAACTTCTACAGCGTCGCGCAGCATTCCGTTCTGTGCAGCCAAATTGTTGAGCCAGAGTTCGCGCTGGAAGCGCTGCTGCATGATGCCACTGAAGCATACTGCAAAGACATCCCCGCCCCACTGAAACGCATGCTGCCTGATTACCAGGCGGTTGAATGCCATATCGACATGGTGATCCGTGAGCGCTTTGGCCTGCCATTAGTGATGGATATTTCCGTGAAGTATGCCGACCTGGTGATGCTGGCTACCAAACGCCGCGATCTGGATATCGACGACGGCAAAGAATGGCCGATGCTCGCCGGCATTGAGCCGGCCGAAATGCTGATCACACCAGTGATGCCAGTGCAGGCACGCGCAATGTTCATTGAGCGCTACAACGAGCTGACCGAATGGGGAGTTCTGTGATGATCAAGCGATACACACCAGACTGCTCTATTCACATGAACCATGAAATGGCGTTCATGCGCGAACTGGCCGGCGGCGAGTATGTGAAGTATTCAGACCACCAGCAGATTCTGGCTGCGGTGGTTGCTGAGAATGCGGCGCTTAAAGACGTCCGATGCTGGAACTTTAAAACCGGAGCTGGGGCGTTTGAGCAGGCACGAACCGCAGGTTGTGATCTGGATGACTGCATTCACGACGTAGTCCAAGTGATGCTTTGTAAGTTTGAAACCCCGTCCACTGACGCAGCACTTGCAGCTATCCAGGCGCAATCCGTTGAAGATGCAGTGAAGCAAGTTCTCAGCGTGGACACGATCGCATCTACTGCTGTGATTTCTCATCTGCTGCGTGTCTATGCCACCGAGCTGCGGGAGGGCCAGCAAAATGCAATTAAGTGAAATGATTATCAAATTGAAAATTGCAGCAGCGAGCGCGACCCAGGGTGAGTGGCGTTATTCACGATCTGGATTCAACTCAATAGTACAGGCCTCTGTTTCATTGCCACGAGGGGGTAGTTCCAGCGTTGTTCTGTGTAAACTTTTCCGCTCCGAATGGCGTGGGGAGCTGAAAACTGCACATGATGCTGCATTCATTTCGTTAGCTAACCCGGGAAATATCCTCGCGATAATAGCCGCACTAGAGGCCGCTGAGAATCTTATCTCCGAGCTGCGGGAGGGAAAATGAAAGAGCGCCCGATAATTTTCAATAGCGAAATGGTTCGCGCAATTCTCGACGGACGCAAGACGCAGACACGGCGCGTTGTTACCAAATCATCGGCCAGTATTTTAAGCCTGTTGGAACATTACCCGCATAAAAATTACAGCCTGCATTGCTCGCTAGGCCAGCCTGGCGATCGGCTGTGGGTGCGGGAGACGTGGATGCCTGATGCACCTCGTGATGGAACGTGGGGTGATGTGGAGTTCTACGGCTGCAAAGGATCGCCGCTCAGCATGATACCTCTGCGCTACCGAACTCCTGAGCATTGTATTTACCGCGCAGGCTGGGATGGGCACGAAATGGTTGGCTGGACGCCATCAATCCACATGCCGCGCTGGGCGAGTCGTATCACGCTGGAAATCACAGCAGTGCGCGTTGAGCGGCTGAACGATATCAGCGATGTGGACGCAACTGCCGAAGGTTGCAGCACTACAGACATGAAGAGCGGAGATTGTCTGGCCGATGTGTTCGCGCGCCTGTGGTCATCCATCTACGGCACTGACAGCTGGGGCGCCAATCCGTGGGTGTGGGTTATCGAATTTAAGCGTGTGGAGGCCAGCAATGACTGAACTGAAGCCGTGCCCGTTCTGCGGCGCCACAGATCCGGAAATTACTGCTGATTTTTATGACTGTTTTTTCGTCGTTTGCGGTTGTGGCTCTCGCGGCCCGGATAGCAAGGAGCCGGGAGACAATTTATCGGCAATGAATAACGCCGCGACAGCATGGAATGAGCGTGTTGAGCACAGCCAGGGGGTAGAGCGTGGGTAAGTTGACCAAAGCGCAGCGCGCGGTATTGCGTGAAAAGTTCGGCGGACGCTGCGCTTACTGCGGCTGCCCACTACCAGAAAAAGGCTGGCATGCAGACCATATTGAAGCTGTGTATCGCAAGCTGGAAATTGATGAAAAGGCCAGAACCAAGGGAAAGTGGAAGCTGAAGCAAACTGGTGAAGTTTTTCGGCCTGAGAATGACGCTTTCGAAAACATGCACCCTGCTTGCGCACCATGCAACCTGTTTAAGGCGACATTCAGTCTTGAGATGTTTCGCGAGCAAATAGCCGCACAGGCAGAGCGAGCGCGGCAGTACAGCGTCAATTTCCGTACTGCTGAGCGCTTCGGGCTGGTTGAAGTAAAAGAAGCACCAGTTGTGTTTTGGTTTGAGCGTTATCAGGAGGTAGAGCGTGGGTAAGCGTAAGAGCAACAGATCGGCCAGAGCTTTGCTTTGTGTGTCGGCTCCAGGCCCGGAAGACTGTTTCCGCATCAGCAACCGGCGCTGGCAGGTGTACAGCGCCAACTACCCGCACGCATGGCAGCACGTTAAACCATCCCGCCAGCAACGCCGGGCAGCACAGGAGAAAGCACAGTGAGCAACGAAACGAAAAGTCTGGAAGAACTGGCGCAGCCGGTGGCGTGGCCGAACAACTGCAATCGTTCCGCTATTGCAGCGCTGCGCTATCTCGCAGAGAAACCGCGCCCCGCTTACGGGAATAGTGCTTACAACACAGAGCATCTTTACATGATAGCCGGCGAACTGACGCGCATGTCTAGCCAGCCGCTCTACTCGCAAGAGTGCGTCGATGCATTACAGCAGCGCTCTGAACGTCTCGACGCAATGCTAACTGAGTCAGTCGAGGCGCTGAAGGCAGCAGAGCAGCGCGTTGCAGAAGCAGAGATCAGGGGGGTAGAGAAACTGGCGGCCGCTTTCAAGTCATGGTCTTGTGATGACTCTTTCCCGGATCACGAAGCACAACGGCATTGGGCCACTGCGAGCAAAGAAGCATCATCATTTGCTGATGACCTGCGAAATGGTGACTGTGATGATCGATAAATATCGACTCGATAACCTGCGGCTGCTGAGTGATGAGAAAGGCGAGCTGGCACGATGGGTTATCCAGCTCCAGGCAGATTTAGACACGGAGCGGCGCAAAAAGGCATCACTGGATAGTGTCATCGCGAAGCTGGAGCAGACTCTGGAACGTGAGCGTGAAAAATCCCGGCGGGTTATGTCTGAGAATCAGCAGCAGGCGCAGCGCATCGCAGAGCTGGAGGCCGCAGAAAAGGTTTGGGAATCTGCAGCTCAAAAGCATATTGCGCGCGCCGAGGCGGCAGAGCAGCGCATCGCGGAGCTGGAGGAAAAATACAGTGAGGTATCAGGCTGGTACGTAAAAATGAAAACTCGCATGTTATCCGCAGAAAAGCGGTTGGCTACGCCGGTGCGGTTGCCTCAGTTATGTGGTGACGGCGATAACGATACCGAATACACGGAAGGGCTTAACGACGGGATAACACAAAGCGCCATTTATCTGCGGCGCCAGGGTTTCAAGGTCGAGGGGGATGTGTGAGCAAATTAACGGCAGCAGAGCAGAAATGGCTGAATGACGTGCAGCGGGTTCTCGATAAGTGCCCGTCTAAGCGGCTGGGGTTCGCAACGATGGGTGACCGCGACGTCACTGTTTACGATACATCGTTCGCGCGGGACATTGAAAAGCACATTTTTAGTGGTGGAACGGGCGACTTCATCCCTACGGCCCAAAGTATGGGCGCAGTTCTAGGCGAAATAACGTTCCCGGCAAATGTCGAGTCAACGGCGGGATAAGGGGGAAGCATGACAATCACAACTGAGCAGTTAAAAGAGCGCGCTAAATTCTGGCGTGAAAAAGCTGCAGCGGCAAATACTTCGTCGCGTCAATCAGTAGCGATGCTGTATGCAGAAGTGTTTGAAGAGCTGTTGCATTACCGTGATGCCGCGCGGGAGGCGGTGCCGGATATATCTGAGCTTGAGGCAACACTCGCTTGGATTCTCGAACTGCCGGTGCCGACCTATCGGGCCGCACATTTTGCAAAACGCTTGAGCGCAGTGATAGACACCTGCCGCGCCGCAATGCTGGCAGCAGCGCCAGCCGCACCTGCAGCACCTGCAGCACCGGACGATTTAATCATGCAAGTGCGCCGGCTCGTCCACGCGCTGAAAAAATCCAATCCTGACCATGTGTTAGTGAAGCAGGTTCCGGACTACATGCGCCGTGCTGGGTATTGGAAGTTAACCGACTGCCTGCGTGGTGCGGCAGCAGCGCCGGACCATTCCGATGATGTGCTCGATATGGTGAGCGATGATTCATCACGGGAGGTGAAGCCGTGAAAACGCTCATTATCAGCGCCATAGCCACCAGCCCGGTCATGCTGTTATATCCCGTGCTCGGTTGGCCAGCGGTGGCTATTTTGATCCTAAACACACTGATTTTAGCGGGTGTCGCGCGGTATATCTCAAACAGAAAGCGTGATGATTCAGATAGTTGGGACGGTTATTAGAGATCCTCACCATGGCAAAGACAGCAGCAGAAAGAAAGGCCGAACAGCGGGAAAGGCAAAAAAATGATGGCGTGGCAAAGTTCGAATTGAAGCTCGACCGCCAGGAGCTGGAAATGCTACGCGAGAACTGCGCAGCGCGCAGGCCACAGCGGGAGCCGTACGACATGGACGAGTACATCACTATGCTGATCAGGAAGGATAACGCTGAACTAAAAAAGCAAATGGCGGCGCTCAGTGAGCGCTGCTGCGGGAAGTGTAAAGACAAGCTGCCCGGCGATCCTGCCGGGTGTTACTTCCTAGGTGATTCCGAATGCTGGCAGACGTATGGCTGGCATGATTTAAAAATAACAGTGTGACATGTCACGATAGATAATCCGTATGCGGCGGGTAGTGTGAGGGAATAGATGAAAACACATGATGAAAACGAGATCATTTCAGAAGCAGATATTGAGAGGATTACAGGGTATAAAATTGCATCAAAACAGTGTGAGGCTTTAAAAACCTCAGGGATATTTTTCATCACCAGGAAAGATGGGAAGCCGAGTACCACGTGGGCACATTTCAATAACCCTCTTTCTCATCGCCATGTATTAAATCATGTTGATGACGGTCCACAACCTAACTTTGGAGCACTGGACTAATGGCACGTGCGCGCAAGAACAAAGAAGACGCCTGGATGCCTCCGAGGGTTTACCTCGGGCGTTCGGCGTATGAATATCACCCAAAAGGAGGCGGAAATATCCGCCTATGTGATAGAACCTGCACTCAAGCTCAAGTTTGGGCTGCATGGGAGGCATTAATTACCGAAAAACCAAATGGCTCTACCCTTGCTGGGCTAATTGATAAATTTTTTAAATCCGGTGATTTTTTTGAATTAGCAGCAGAAACGCAAAAGGATTACCGGAAATACTCGAAAAAGGTAACAGACGTTTTTGGTGCTATGCCCCCAGATAACATTAAGCCGGAGCACGTGAGGAAGTATCTAGATAAACGAGGCACTCAAAGCAGGACACAAGCAAATCGCGAAAAGGCATTCATGTCCAGAGTTTATCGATGGGCATATGAACGCGGATATGTGAAGGGGAACCCAACGAAAGGAGTTAAACAGTTCAAAGAAACTGGTCGTGATCGCTATATAACGAATGAAGAGTATGACGCCCTATATGGCGTTGCTCCTGATGTCGTTCGTGTGGCCATGGAGTTAGCCTACCTATGCTGCGCGCGACAAAATGATGTATTGGAAATGAAAAAGAGCCAGCTTATTGCAGAAGGCATATTGATTAAGCAAAGCAAAACAGCTGTAGCACAGATAAAGGCATGGTCTGATCGTCTAACTGCTGCTATAGATTTGGCTAAAGCCCTCCCACTAAACGCTGGAATGAGCAGCCTCTATGTACTGCACCAGGCGTCGGGACATAAATACACAAGAGACGGTTTCAACAGTCGATGGAGAAAGGCCAAAGAAGAAGCTCGTTTGAAATACCCTCACCTCTGCTTCGACTTCACTTTCCATGATCTGAAGGCTAAAGGAATCTCAGATTTGCAAGGGAACATTTATGAGAAGCAGGCGATATCAGGGCATAAAAATGTTGAACAGACTGCGCGCTACAACCGGAAAATAGCTATCGTCCCGGTTGTCGGCGGACAGTGA
Protein sequences of DBSCAN-SWA_1 >LS483492|2997318:3038806|3000398_3000779_-|SQJ21736.1|DBSCAN-SWA MFDPDKYLSVTWLKGGRTFPNLDCFGIVNEIRRDLGLTVWPDFAGITKDGNGLDRAARSFMGDLNRCEPCEGAGIACYSGSVVTHVAIVVNIDGVLHAAECNPNSNVTFLPLTRFMRRFVKVEYYQ >LS483492|2997318:3038806|3017480_3018011_+|SQJ21889.1|DBSCAN-SWA MVTLYTVDRLGFYQNNKHISLIKQSSNRPELDEYLNRLFPDGLSHHGMVYFWSSHSKPIDIEPTLELHIEMHRRAFHPHKPSRFTSLFCCQSIDEAMAFRDRKGSASFPIYEVICDEKIIHSADMNIINPAATTLVFSQHMDMYWSGQSLKEFCPEHPEFNEVLVPLPATIGNRVA >LS483492|2997318:3038806|3024650_3025637_-|SQJ22192.1|DBSCAN-SWA MRALLKSITVPELGQIILRPGKELLPHLTGRLLVCREPSEFAKLPSGALPAPGQELANDPRFTPFYQHERVLIAAGGISSLEHWVSLKRGCQVDGDHTRHMTTLRYDNSAIRLCEGCDNRLRGQSIEKLQVVADANRAAYILETVRSYFLFDESHQLTLPELCWWAVMRAVADLLPDDVCSASLRLPPEKIHTGGRKESEITHTPAARQVVAEKVTKAAKTLVIDPEPPKALFKLPKRERWESKSYLQWVKSQRCAACGGTADDPHHIIGHGQGGMATKAHDLFTIPLCRKHHDELHRDMSRFEDEYGSQIELWFKFMDWSLSVGSIK >LS483492|2997318:3038806|3025633_3026005_-|SQJ22202.1|DBSCAN-SWA MILTLPFPPSVNGYWRAPNKGSLRGRHLISDRGRKFNSEALASILEQLNRKPRALTCDLAVSVVFYPPSRARRDLDNYFKALFDAMTKAGIWLDDSQIKKLSAEWGPVTKGGKVELRISEVGV >LS483492|2997318:3038806|3031762_3032062_+|SQJ22222.1|DBSCAN-SWA MEMQMLKPGFTYTRVDSKCPHCGKVFSQDLDSSVNEISTDDVKDYASDIKASPEASTMPNLSVRWYKMIKALVALPFTTKEEYATERNQKRSVRWKFWK >LS483492|2997318:3038806|3026911_3027601_-|SQJ22209.1|DBSCAN-SWA MNKLMQAVQSRDAGTLARMMPSEPAQRVVNDNAEKLVDLLFKNLMQIFPAAKNTALSTPADIAAAKRQWILAFAENGITTADQLRAGMRMARQQESDFWPSCGKFIGWCKAGAAESAGLPSLDEAVTEFDKYAATRSQYPSAEAFPWSHPAMYWIVIDVRRAMHRYNHTETETRKAIQNQLSGWAKRLAHGEQIPTPAVQIAAPKTPAGPTPAQLLQAEYERRKREGWL >LS483492|2997318:3038806|3005329_3005905_-|SQJ21750.1|DBSCAN-SWA MPALRRFFYDRRKLMSVLTQGTQFYVLANGVVSEVECITSFTPGGNPADQIEDTCLSERKSRTYLKGLQTPGQASVTLNADPKNASHLMLHQLAESDDESKLTFALGWSDGESSPSIAEPGDPDAVDGLLLPSDRTWFVFQGYVSDFPFDFQANAVVTTTATIQRSGPRCGSPRLNKPVNQTGVFRPGNSY >LS483492|2997318:3038806|3007725_3008781_-|SQJ21759.1|DBSCAN-SWA MNERELSLIKAISEEFGAVIKSMRQEFTSELSAQQLAFEEKITQLTLSIAELKEAAPDIETRIGEVVGRLEIPAAPELPDIKSMVDEAVTAAMPPPAQDGQSVTVDDVRPMLEELVSTAVADIPVPVAPELPDIKSMVDEAVTAAMPPPAQDGQSVTVDDVRPMLEEMVSTAVAEIPVPAAPELPDIKSMVDEAVIAAVPPPARDGEDGRDAMQIEILPDIDAEKSYPRGTYATHSGGLWRSFEKTAGMRGWECLVDGITTVDIEQDDHRHFTVTVERSSGASTSKSFDVPVMIYRGVFKSGADYRPGDTVTWGGSLWHCDETTSDKPGETGSKGWTLSVKRGRNASAGGR >LS483492|2997318:3038806|3028493_3028676_-|SQJ22213.1|DBSCAN-SWA MENEQIVPFEMDCHDDHGRLVHVTGVDQANHRVIFRRPGYPHDCAVPRRDFGTKFRKVPE >LS483492|2997318:3038806|2997318_3000402_-|SQJ21735.1|DBSCAN-SWA MTIRVYPSRLPGDPLETHEHKAMTLHQWLVGNVSGYEADMRHPIVVEVNHKAIPAAEWPLCYVKPDDDVRIYPVPYGTGLEIAVWAAVAVAVASAAYSLIMMSQLSKDGMSSTANGDSLDLTPAKANSAKLGDPIREVLGRTRIFPDYLVQPVSRFDKENPQIYRTEMFLCIGVGNFVINQSSIKIGNTPVGSFGDDVSFTIYPPGADVSADRRTENWYSSTEVGGTTSGTAGLDLASTGPDSVSITADAISVSGNSLTVVGGTTDDNGDAVIPDSWVVGTDLTIQVPDTYTVAIEGGSNVIYGDFAELKPSVGQSVSVTWNATRLDLFISAHEPGKPAIPGEGGNAASITASAAPTTYDFSTTPLSFTLTWGGASYVISLSANYVTMSGLTDEIADQLSGSGLDVVAVDTKIVIREKESPFSGNSIGYTVLPEVLFGSDPAVVAGTASTGGTPAVSEHIALAWGSTGGDPFVGIPTGLQRIAIGLKGYHFRITEIDGLTLSVERLIENSDGSKTVDSGWPGFTSRTLLDSTVTGLNDKYDWMGPFLCCPDSEKTSEIELNFVYPQGLCDVGSKDGAIHWHDIEMTVQYRLSSSEEWTSVKIKHGNNTVNEVGYTETITFPTAGNYEVRMKRDTPVWGGTTRESVQWQSMRAKLSERPTSYRDVTTIALTIRTGNRLASQSDRRVNMVATRLYDGHTSRSISGAIFHVLKSLGYTDDQIDYQTINTLESTYWTPRGETFDWAAGADSTSALEVLQKIATAGMGYFLLSDGLASVGREGVKNWSGVISPQEQTEELQTSFKALSQDDFDGVDVTYVNGTTWAEETIQCRFADNPTALKVESYKLDGVSDPDNAYRIGMRRLMKYRYQRLNHSTSTEMDALCYNYGDRIVLTDDIPGSQTISCLIVEEQHDENTVHIHVSEKLDWAFNNPRCIIRFQDGSASSLLVASRVDDYTLAVDNTDDVRLDEWIMDDPAIEPPRLIFCSSSRVGYDGIFDSIEPGSDGTSQIKALQYTPLIYQYDDATYPGNVQ >LS483492|2997318:3038806|3033095_3033530_+|SQJ22226.1|DBSCAN-SWA MIKRYTPDCSIHMNHEMAFMRELAGGEYVKYSDHQQILAAVVAENAALKDVRCWNFKTGAGAFEQARTAGCDLDDCIHDVVQVMLCKFETPSTDAALAAIQAQSVEDAVKQVLSVDTIASTAVISHLLRVYATELREGQQNAIK >LS483492|2997318:3038806|3009473_3010127_-|SQJ21763.1|portal|DBSCAN-SWA MWNPFRRKEKSLQQPSGQGNWQRIFSFIHEPFSGAWQKNLEINDKTVTSFYAVFSCISLIASDIAKMPAGHQCRDSKGVWQEQRTGRVSALLKKPNAFQNSIQFFESWVNSKLCHGNTYVMKIRNSAGSIEELRILDPHKVTPLVADDGSVFYQISPENISGLSEQVTVPAREIIHDRFNCLFHPPRWFVADLRCWFGGDAGAPYSGEFDILFQKWW >LS483492|2997318:3038806|3029423_3029621_-|SQJ22217.1|DBSCAN-SWA MKKDDVISYFGGVGKTAKALGLSHASVSGWGEIIPKGRAFEIQVLTSGELLVDLSLYKKSTAPVA >LS483492|2997318:3038806|3013998_3014454_-|SQJ21768.1|DBSCAN-SWA MLTGQKRKFALALMSGSTQTAAAIEAGYSEKSARSKGSQLAKDPGVIAFMEEQRGKSAHASPVSDSAAAKKRPTENVTYDDPLDFLKSVMNDPAKDIDTRKDAAKAMLPYIHPKKGEGGKKDARNAAAKVAATASKFGAMAPPKLVVNNKG >LS483492|2997318:3038806|3006284_3006770_-|SQJ21754.1|DBSCAN-SWA MVDRINYRLTGIDEFLGKLDSVSYDLKRKGGRAALRKAANVIVNKAKDNATRVDDPETGRRIADNIALRWNGREFKRTGNLAFRIGVLHGAVLKNHPDKARNAPTPHWRLLEFGTENMRAQPIMRPAAEEGAVPAINTFAIEYEKSIDRGIARAKKRGVKP >LS483492|2997318:3038806|3035209_3035674_+|SQJ22232.1|DBSCAN-SWA MSNETKSLEELAQPVAWPNNCNRSAIAALRYLAEKPRPAYGNSAYNTEHLYMIAGELTRMSSQPLYSQECVDALQQRSERLDAMLTESVEALKAAEQRVAEAEIRGVEKLAAAFKSWSCDDSFPDHEAQRHWATASKEASSFADDLRNGDCDDR >LS483492|2997318:3038806|3026001_3026571_-|SQJ22205.1|DBSCAN-SWA MKYSLIYADPPWSYKDKCASGKRGAGFKYPTMSVADICRLPVWDLAAENCLLAMWWVPTQPEEALQVMRAWGFRLMTMKGFTWHKTNLRKGNNALGMGWMTRANSEDCMFAVRGKLLQRHDASISQHVTATRMEHSAKPDVFREKLVQLVGDVPRIELFARQQADGWDSWGNECDQSVVLVPGKAEVPA >LS483492|2997318:3038806|3030491_3031508_+|SQJ22220.1|DBSCAN-SWA MQDNQEVIINSEEEAFAFLEKYVSGYSLPENVSFGEWPNLKIKLTGKKFNKSLTPSVMKGFVDMQHQINKSYALVKYGIADPRKLSKEEKDALEIEVTVEQGSSLIEVNIDGFLTKITHELVGKMNPQDVVVTVLGVALIWGGVTLFKRFLDNRKEVRLAETKKESDREHLHTMKFMSEQETKRLETIGKIIAEKPQLDNMERLSYDAKTEMVKSFATATTAQIDNIELDSATSKELVTNARRKSVELRMDGMYRIEEVNSTDPECFKVKVRNINNDLRINCVVQDVFLDASENKKALQQAEWDRKPVHLSINAKHIDGDIKSAVILYVKEAEPPKES >LS483492|2997318:3038806|3023190_3023952_-|SQJ22129.1|DBSCAN-SWA MSSQPKAKTSDLHHKLVGYIDSGEIPDVGVIEEILRDASYLLEPERSYIRAMVHVAKRDHSGAMQFFEQSLRYDDVRVALNYLAYLGTSAHNYTHRLEIYRLEELYHNSTMRRIARNASYGIGDSRRVRQYTLKMAALLDGEQKQRMMSEGEAMMQRIQDFKELTSLTQKDVEDLCDTAEGLANAQNVNCIGVSYFIDTHGDCAYIIDAETEDPEILSELNFELACVLADEKYIDKAFTSWFRSVSRKDGEYQ >LS483492|2997318:3038806|3005922_3006288_-|SQJ21752.1|DBSCAN-SWA MMAPIFAICSGSDEVRALLGSEPVRLYPFGLHDDAVIYPYAVWQNISGDPENYLNQRPDADRYSLQVDIYADTAAGCVAVARALRDAIEPHAYITRWGGQTRDATTKRYRYSFDVDWIVRR >LS483492|2997318:3038806|3001268_3001739_-|SQJ21737.1|DBSCAN-SWA MALTSNINYPHDYLPMALQEGYALKPISPLLRTDMVTGRARQRRRYTSTPTQTTVSWLMNDVQGMTFEAWFRDALSDGSAWFNMVLRTPIGVKPYLCRFTDIYDGPVLTGGKYWQYSATLELWERPIPDAGWGNFPEFLAGQSIIDMALNREWPKA >LS483492|2997318:3038806|3001738_3004165_-|SQJ21748.1|DBSCAN-SWA MASKSLGTLTIDLVAKVGGFVSGMDKAERASDKWRKQVQKNVEGASKALAGMAAAAAAAGTAVGIAGYQLLKSTSNQMAEADRWAKSLNMSTQELLAWQFAAEKAGLSADNMADIFKDLGDKIGDAVLNKSGEAVDALNALGLSADKLSKVSPDKQLLAIGEALGKINTNAEKTTILESMGNDLSKMLPLFDNNNEKLKQFIQLSKDYGVAPDPDSIDDLVKVNTLFENMESQVKGLKIEIASGLAKVDLSPLQGSLDKLQDVLTDPQVLQGIVNLVSEVAELAGWLVKAAASAGDLAVKVSARHQGVFGTFSDNNVPAMQERIKFLQMQADNAKPGGTWLGNILGNDDSLEHYQTEIAKLQGQIEKVNNSARGAMKLPLGQATIGNGEYSLGSGESNGKPSAPKIPKSSGSSKAIESAFKSTEQSYIRQIALTDQLNGKTKEATELEKLRLDISSGKLVGINKQQQARLEGLAAEIDRFKLLEKYRGLQEELLTPEEKLLESTRERVKLLKDAKDAGLTNDDDYAKASKAIANNAFEKAPEFGGIDPMFGGQLGELRKVDKAQGELEEWYSTQLNMLDEYRQSQSELNEQWDAKELELRKQHADSLQSLNDQRNQLMLSSVSDGLGSVVDVTKSAFGEQSAIYKAAFVAQKAAAIAQSAIAIQQGIAMAAANPFPLNLAAMASVAAATAGIVSNISAIGMAHDGLDSVPETGTWLLQKGERVTTAKTSAKLDATLDRVGKQSVSGVMPVNNIVVNGDPDKSTIRAIEEAVARGEKRVYGRVTGDVATGRGDMSKALMAGWNTKRRAS >LS483492|2997318:3038806|3024200_3024599_-|SQJ22145.1|DBSCAN-SWA MRDIQLVLERWGQWAKDNSGVGYSPIAAGFKGLLPATGKSKDSCCDSDGLIIDGAVGRLKKVRDERELGVIMLHYRYGVSKSEIARRWKISEGNVRQKLMMAESFIEGCLAMTGASLEMDAWTHKSEISKVA >LS483492|2997318:3038806|3033812_3034418_+|SQJ22228.1|DBSCAN-SWA MKERPIIFNSEMVRAILDGRKTQTRRVVTKSSASILSLLEHYPHKNYSLHCSLGQPGDRLWVRETWMPDAPRDGTWGDVEFYGCKGSPLSMIPLRYRTPEHCIYRAGWDGHEMVGWTPSIHMPRWASRITLEITAVRVERLNDISDVDATAEGCSTTDMKSGDCLADVFARLWSSIYGTDSWGANPWVWVIEFKRVEASND >LS483492|2997318:3038806|3007118_3007328_-|SQJ21755.1|DBSCAN-SWA MNEQQVSELIVALREQTQAQKEQTAALNLLAESNNALCNLILQTLMEEGEPSELVSDLRPTYLNGKPIG >LS483492|2997318:3038806|3019010_3019394_-|SQJ21890.1|DBSCAN-SWA MNLLFHIKAALVAFAITSLIYFAASNWGLRKELRMAEQQAEQQKKTLAQQAELLATMRADDARNRALMAKQQQREQQLRQQGENYQRKYQDAIKNDDCARRAAPGPVIDILRGTDAGASGTARSTAP >LS483492|2997318:3038806|3019390_3019870_-|SQJ21908.1|DBSCAN-SWA MNSALKKKLAAAASGGAVAIAAVLIPDLEGIRYEPYYDVAGVLTVCYGHTGADIVLGKKYTEAECKAMLDKDLVPFARSVERSVKVPVSQYQRAALISFSYNIGTHAFENSSLLRKLNAGDYSGACDGLRQWIYANGKSWRGLMNRREIEHAVCSWGQR >LS483492|2997318:3038806|3015560_3017021_-|SQJ21771.1|DBSCAN-SWA MQVIIDGIQYEPAQQQGSSRIGIAITTHQRPDVLKQAIDQHIKHLPAGSLVVIVDDGSMPAAVAPADIKIIRHEKSLGIVASKNASLEALMNAGCEHIFLFDDDAWPISDGWHVPYIESPEPHLAYQFLDLSGPRKLKDIAVLYRDDQHVSYTGQRGVMLYYHRSAIEAVGGFDPIYGRGMYEHSDLALRIHNAGMTTWAYADVTGSEKLIHSLDEHDAVVRSVPKTDRQSLVERNVKIHNGRRDAGYTGYAEYRQKRDVVITSLLTSEPDSQRGTRMTPDPLVLSTWAKSISGAAAVVLADQLTTPPAGAMLVSVPDVAMNVYYRRWLHIYQYLRDHPEYRFVWCTDGTDVEMLREPWDDMEPGYVYVGSEPKTYGDTWAKAKHPERLYQDFIENNRGSVMFNAGLLGGCREDVMAFAHGIIRLYYRIESYRFWKMEQGASAVGDMMAFGMVAKSFGDRVVTGPRVHTVFKMNGLGKEYAWWKHK >LS483492|2997318:3038806|3014985_3015576_-|SQJ21769.1|DBSCAN-SWA METQIAFVVVGHHERREQAAALAGSLGAHLLIDEEHHGANWNHRRAMEYAAKQSHRVVVLEDDAVPVAGFRSMVGQWLAMFPELLISFYLGTGRPPQYQMEIAVKLIAADRARQDYIEMHRLIHGVCYSMPRQHIPKVLAKWDSKKAADYAVGDAYGGQVIYPCQSLVDHADGIPVERAKDGLPRNERRRAWRLYG >LS483492|2997318:3038806|3007395_3007722_-|SQJ21757.1|head,tail|DBSCAN-SWA MIELVALEEAKMHLRIDEDYSDADLTLKIQAGSAALLAYLKDNRKLVVSDDGKLIDGEPLYRVKTALLVLLGYLDRNRGGEEEDKLKQGELPFSVSMLIYDLHCPTIM >LS483492|2997318:3038806|3037166_3037511_+|SQJ22236.1|DBSCAN-SWA MAKTAAERKAEQRERQKNDGVAKFELKLDRQELEMLRENCAARRPQREPYDMDEYITMLIRKDNAELKKQMAALSERCCGKCKDKLPGDPAGCYFLGDSECWQTYGWHDLKITV >LS483492|2997318:3038806|3027597_3028497_-|SQJ22211.1|DBSCAN-SWA MSMNLMAQAMSIKVGNPLRKLVLIKMADNANDEGECWPSYQHIADHCECSKSAVRAHITALIKMGLISKENRLGANNGKGNTSNIYRMTLGTPVSPKSIAPVPSKSTPPMPPKSTGVSSESTAPVSPAGTPCATRWHQNQSLEPKDKKPSCQDASQPDEKTLSPEEKFLAAHPEAVVWDAKKRQWGGEDDLKCAEWIWGRIIGMYEQAAEYDGEISRPKEPNWVAWSNEVRLMCAQDGRSHKQICQLFGRANRDKFWCRNVLSPSKLRERWDDLVIKLAPAVAAASEEWNTAEAWSETL >LS483492|2997318:3038806|3020496_3020607_-|SQJ21913.1|DBSCAN-SWA MSSIDDTIGMGYDEQVNHTKKKEYSVLVKYTGFSIR >LS483492|2997318:3038806|3012337_3013996_-|SQJ21766.1|terminase|DBSCAN-SWA MAEWTTACLDWEKRLVDRTSIIPPPIFADQAEHALSIFKELRVSDFPGKPTFGECSEEWVFDFVKAIFGGYEAETGRQLIREYGLLISKKNTKSTIAAGIMLTALILCWREDEEHLILAPTKEVADNSFKPAAGMIRADEELSDMFQIQDHIRTITHRVSRNTLKVVAADTDTVSGKKSGRILVDELWVFGKRQNAEAMFMEALGGQVSRNEGWVIFLTTQSDEPPAGVFKERLDYWRDVRDGKIKDPKTLGILYEFPERMVENKDYLLPENFYITNPNIGRSVSAEWIEDNLRKNQTKTDGTLQQFLAKHLNIEIGLNLRTDRWAGVDVWEQQARLVTLDDIITRSEVVTVGIDGGGLDDLLGFAVIGRDKETREWLCWCHAWAHKIVFERRKSEASRLHDLKKAGDLTIVEFIGQDTDELASIVARIHEAGLLDKVGVDPSGVGSILDALTVEEIPADDVVGISQGWRLGGAIKTTERKLAEGVLIHGGQPLMTWCVSNARVEPKGNAILITKQASGKGKIDPLMAMFNAVSLMALNPEAKKRDYQVFFI >LS483492|2997318:3038806|3008773_3009529_-|SQJ21761.1|portal|DBSCAN-SWA MQGHHILENSTFFFKNGGKPSGVIEVPGTLTDDNAKKIKQNWDTGYTGENAGKTGILSNGAKYNPIAISAADAQMVEQLKMAAEIICSTFHVPAYKVGVGEPPSYNNIEALEQQYYSQCLQILIESIEVLLDEAFELSGDEETEFDVNALLRMDTERRIKTLGEAVKNTIMSPNEARKKENLPPVPGGDSLYLQQQNFSLEALARRDASENPFDSPSKQSSSEPPADDGKSLSPSEMVAAKAMIRGIMKYE >LS483492|2997318:3038806|3036451_3036985_+|SQJ22235.1|DBSCAN-SWA MTITTEQLKERAKFWREKAAAANTSSRQSVAMLYAEVFEELLHYRDAAREAVPDISELEATLAWILELPVPTYRAAHFAKRLSAVIDTCRAAMLAAAPAAPAAPAAPDDLIMQVRRLVHALKKSNPDHVLVKQVPDYMRRAGYWKLTDCLRGAAAAPDHSDDVLDMVSDDSSREVKP >LS483492|2997318:3038806|3034410_3034617_+|SQJ22230.1|DBSCAN-SWA MTELKPCPFCGATDPEITADFYDCFFVVCGCGSRGPDSKEPGDNLSAMNNAATAWNERVEHSQGVERG >LS483492|2997318:3038806|3028870_3029395_-|SQJ22215.1|DBSCAN-SWA MDNKDFPTQSDITDVMHRLITSFAGGYEAMAQELAHDGTHNALRNRVRQVGGQMVPMGMAIQMEQISERTDITEAMCKRAGGVFVKLPEVGAVDNEELLFKFNELMAALGVFAKAHNDFTADGVLDGDESKKLKAKGYRIQTLVAEIYAVTVMMFGKGDAQDMRSRASAHQFNV >LS483492|2997318:3038806|3010335_3012279_-|SQJ21764.1|capsid|DBSCAN-SWA MTLNRACTLMKVKAINEDERVITGIASTPSPDRDGDIMEPDGAKFRSDTPFLWQHDRSQPIGSCTPRMVKEGLEITAKLVKPTPDMPSLLIARLDEAWASIKSGLVKGLSIGFRPIEYAFIDDGGIRFLSWDLLEVSAVTIPANAECSIQTVKSYDRQLLAASGKEKPVVKANTTAGATAKKSTEIKGKTMNIAEQIKSFENKRAALAASLEDIMSKAAEEGRTLDAEEEEQYDNTSAEVKSVDAHLKRLNDMEASQASTAKPVNTKAAASGTVNTVITNAPGIIRVEQKLEKGIAFARFTKSLAAAKGVRSEAFSIAKSKYPDDTKLHHVLKAAVNAGTTTDPTWAGALVEYQDIANDFVEFLRPQTIIGKFGTGNIPSLRDVPFNVRIPVQTSGGAAQWVGQGKAKPLTKFDFSNITFGFSKVAAISVITEELLRFSNPKADVLVRNSLAESVIARLDTDFVDPAKDEVVGVSPASITNGAIAIPSSGDPDTDSSAAFQVFIDANLQPTGAVWLMSSSTALALSKRKNALGQKEYPEMTMFGGVFEGLPAVVSQYVGNQLILMNAPDVYLADEGGVAVDMSSEASLEMESSPSSDSVTPTGVELVSMWQTNSVAIRAERWINWKRRRSAAVAVISGVNYGSSQSS >LS483492|2997318:3038806|3017030_3017240_-|SQJ21796.1|DBSCAN-SWA MYTVKIVTANQNEIIHAAKSLEWKATEKTIYADEHTGESLTIALLPGDTAYVVNGNNRTVATYTNPADH >LS483492|2997318:3038806|3032565_3033096_+|SQJ22224.1|DBSCAN-SWA MSWITTFTGQHFNFAAPDVESICIEDIAQALSHECRFAGHLPNFYSVAQHSVLCSQIVEPEFALEALLHDATEAYCKDIPAPLKRMLPDYQAVECHIDMVIRERFGLPLVMDISVKYADLVMLATKRRDLDIDDGKEWPMLAGIEPAEMLITPVMPVQARAMFIERYNELTEWGVL >LS483492|2997318:3038806|3030002_3030371_+|SQJ22218.1|DBSCAN-SWA MIDMVEYRGGWLKIYSDDPNAYGLRVRGDSMWPRIQSGEFVLIEPGSTVHPGDEVFVRTSDGHNMIKVLNYTREGEYQFNSINQDHRPITMARAEVQKVQYVAGILKASRHVDTDAVTDSIP >LS483492|2997318:3038806|3021573_3022827_-|SQJ22049.1|DBSCAN-SWA MEKSIHKKIDFLTRFHVKSPQIMRAFFILFCEVSGGKSEVTPHPPVAHVFGHKQPLPVLLTAGRAYQLRALMISNTRLINADTTAFIKTLPDNSIDLIATDPPYYRVKSCGWDNQWKSEAEYIAWLDSLLVEFYRVLKPNGSLYMFCGSRLAPDTELLVRQRFDVLSHIIWAKPAGPWKRMNKESLRSYFPSTERIIFAGHYAGPLQPKVDGFAAKCGELKHNVFKPLIDYFRTARQSLGVSAKEINAATKTQMCSHWFSESQWQLPGEKQYQALQALFDRIAKERHQAGGLNRPHHELVREYRTLNREYLALCQEYRSLRRPFAVTAAVPYTDVWHYPPVAFYPGKHPCEKPAEMMEHIISASSRPGDVVADFFMGSGATIKAAIKLGRIGLGVELEEERFKQTEREIHPETTRPS >LS483492|2997318:3038806|3020913_3021132_+|SQJ22047.1|DBSCAN-SWA MNLDDLTDEQRQQLDSIKSIVGHCVVMLLAQRKTINTANLLSQIAGEMEKIPERSEFNKYRATLEFVGTVSQ >LS483492|2997318:3038806|3037795_3038806_+|SQJ22238.1|DBSCAN-SWA MARARKNKEDAWMPPRVYLGRSAYEYHPKGGGNIRLCDRTCTQAQVWAAWEALITEKPNGSTLAGLIDKFFKSGDFFELAAETQKDYRKYSKKVTDVFGAMPPDNIKPEHVRKYLDKRGTQSRTQANREKAFMSRVYRWAYERGYVKGNPTKGVKQFKETGRDRYITNEEYDALYGVAPDVVRVAMELAYLCCARQNDVLEMKKSQLIAEGILIKQSKTAVAQIKAWSDRLTAAIDLAKALPLNAGMSSLYVLHQASGHKYTRDGFNSRWRKAKEEARLKYPHLCFDFTFHDLKAKGISDLQGNIYEKQAISGHKNVEQTARYNRKIAIVPVVGGQ >LS483492|2997318:3038806|3026567_3026915_-|SQJ22207.1|DBSCAN-SWA MKKLTIDVDVFESERINSGIRKLVLAGLLKDNPENQMGRVIQAAAGAKWMTLRELERTVFMMFFVADTQAAISARLREVHPKVHGLVKERRTVKDPDTGKLVYFYRLVAAEELVA >LS483492|2997318:3038806|3035639_3036188_+|SQJ22233.1|DBSCAN-SWA MTCEMVTVMIDKYRLDNLRLLSDEKGELARWVIQLQADLDTERRKKASLDSVIAKLEQTLEREREKSRRVMSENQQQAQRIAELEAAEKVWESAAQKHIARAEAAEQRIAELEEKYSEVSGWYVKMKTRMLSAEKRLATPVRLPQLCGDGDNDTEYTEGLNDGITQSAIYLRRQGFKVEGDV >LS483492|2997318:3038806|3019872_3020109_-|SQJ21912.1|DBSCAN-SWA MPDKIATAVGYCTSGGLICWGGIARWLHDLDWNLIAVVGGFVIGLATFAVNVYFKYRQTKAYEAALKRGYVTAPPQKD |
48 | Salmonella_phage(25.71%) | terminase,tail,capsid,head,portal | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
4016010 : 4033091
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >LS483492|4016010:4033091|DBSCAN-SWA GCTACTCTCCAGTGCGGGCCATCAGCGCGGCGCCCAGTTGCTGATGCATATCACGCAGCAGCGTTTCCGTGCTTTCCCAGTCAATGCAGGCATCGGTCACGGACACGCCGTAGCGCATCTCCGCGCGCGGTTGTTCGGAGGACTGATTGCCTTCATGCAGGTTGCTCTCCAGCATAATGCCGGTAATTGAGCGGTTACCCGCGTTAATCTGCTCAACCACGGAAGCGGCAACCGCAGGCTGACGACGGTAATCTTTATTTGAGTTGCCGTGACTGCAATCTATCATCAGGGACGGGCGGAGTCCCGCATCGCGCATCTGTTTTTCACAGGCGGCGACGTCCTCCGGGCCGTAGTTCGGCGTTTTACCGCCGCGCAAAATCACATGGCCGTCCGGATTACCCTGCGTCTGCAGCAGGCACACCTGTCCGGCCTGGTTGATGCCGACAAAGCGGTGCGGCATTGCCGCAGCGCGCATGGCGTTAATGGCCGTACTCAGGCTGCCGTCGGTGCCGTTCTTAAAGCCGACCGGCATCGACAGGCCAGAGGCCATTTCACGGTGGGTCTGAGATTCGGTGGTGCGTGCGCCAATCGCCGACCAGCTGAACAGATCGCCCAGGTATTGCGGGCTATTCGGATCCAGCGCTTCGGTCGCCAGCGGTAATCCCATTTCCACCAGATCCAACAGCAGACGACGTGCAATGTGCAAACCGGCTTCAACGTCAAACGAACCGTTCATATGGGGATCGTTGATCAGGCCTTTCCAGCCGACGGTGGTTCTCGGTTTCTCAAAATAGACGCGCATAACGATATACAGCCGATCGCTCAAGTCGGCCGCCAGTGTTTGCAAACGACGGGCGTAGTCCAGCGCCGCATCAGGATCATGGATAGAACACGGCCCGCAGACCACCAGCAAACGGGGGTCGCGCCCCTGCAGAATATCGGCGATGGTTTTGCGCGCCGTGGCGATTTGGGCCTCATCCGCCACGCTAAGCGGGAATTGATTCTTCAGTTCTTCCGGCGTAATCAAAACCTGTTCTGCACTGATGTGCACGTTGTTGAGCGCGTCTTTTTGCATGGTGATATGTCCTGTCGTTTGCCGTTATGCATAAATGAAGTAAAACATTGAGCGTTTCGAGGACTACCTTACCACGCATTGTAAAGTTTTCAATCCATTGCGTGTAAATAAAAAATTACCACCCATATTGGCGCGAGGTAATCAATAAAAAATACATAAATAACAACAAAATAAAAAATAAAAACAAAACAACAAATGACATAGGACAAGAATAAAAACACCATAATGTAAATAAAAAATTACACCTTTTCTAAGACCGAGCGCTCATACGGCACTTAAATGCCATAGCCCCGCCACGCCATTTCAGACAATGTAAAAACACTGGCGCCACAAACAACAACCCGTTGATTTTCAATAATTACACATTATTACACTGATAAAAATCGCGCTATTCCCTTATAAATATTGAGCTATCTCAGGGTTCTGTCTGCGCAATTCATGAATGATGGCAGAAACGGCCAATACCTCTTTCCGGTTAGCTTGCGGTCAACGCTAACGTCGGCGGCAAAGGTAATAAGCCGGGCCTCTCTTTTTCAGCGAAGGAACCCGCATGATAACTCTGGAATTCGGCATCATCATTGCTTGTTTATTGGTCGGCACGCGCTATGGCGGTATGGGGCTGGGATTAATCAGCGGTGTCGGTATCTTTGTGCTGTGCTTTGTCTTTGGGCTGCAGCCGGGCAAGCCGCCTATTGATGTGATGCTGACCATTCTGGCGGTAATCGGCTGTGCCTCGGTACTGCAAACCGCCGGAGGACTGAACGTATTGATGCAATATGCCGAACGTCTTCTGCGCCGGCACCCTCAGCATATTACGCTACTCGCGCCGTTCACCACCTGGATGCTGACATTTCTGTGCGGTACCGGCCACGTGGTATACACCATGTTCCCGATCATCGGCGATATCGCGCTGAAAAAAGGCATCCGCCCGGAACGCCCGATGGCCGTGGCCTCGGTCGCCTCACAAATGGCCATCACCGCCTCGCCGGTTTCCGTCGCCGTCGTGTCTCTGGTATCGATTATCGCCGCCGGGCACGGTATCGGCCGCGCCTACTCCCTGGTGGAAATCCTGGCGGTTTCCATTCCCTCGTCTTTGGTTGGCGTGCTGATCGCGGCGCTCTGGAGCCTGCGCCGCGGTAAAGATCTGGAGCACGACGCCGATTTTCAGGCGCGTATCAGCGATCCTGAGCAGCGCGCCTATATTTACGGCGCCGGCGAAACCCTGCTGGATCAGCGTTTTGCCAAATCTGCCTATGCGTCAACCCTGATCTTTTTTGCCGCAATCGCCGTCGTGGTCTTGCTCGGCGCCTTCTCTGAACTGCGTCCGGCCTTCGCCGTCAAAGGCGTGATGAAGCCGCTGTCGATGAACCTGGTGATTCAAATGATGATGCTGATTGCCGGCGCCGTCATGCTGATCGGCTGTAAGGTAAAGCCGGGTGATATCGCCAACGGCGCGGTATTCAAAGCGGGTATGGTCGCCATTTTCTCGGTATTCGGCGTCGCCTGGATGAGCGACACCTTCTTCCAGGCGCATATGGGAGATTTAAAACTGCTGCTTGAAGACGTGGTTAAAAGCCAGCCCTGGACTTACGCCATTGTGCTGTTTCTGGTATCGAAATTGGTTAACAGCCAGGCCGCCGCGCTGACCGCCGTTGCACCGATGGCGCTGCAGTTGGGCGTCGAGCCGAAAATGCTGCTGGCGTTTTTCCCGGCGGCCTATGGTTACTTCGTATTGCCGACCTACCCCAGCGATCTGGCCTGTATCGGCTTCGACCGCTCAGGGACGACGCGCATCGGGAGATTTATCATCAACCACAGTTTTATCCTGCCGGGGCTGATTGGCGTTTCATCCGCCTGCGTAACCGGCTACCTGCTGGTTACGACCCTGCTGTAAGAGGATGCTGCGGCCCGACACGGGCCGCTTACGCCTGCGGCGTCAAGGTCAGCCACACCATACCCAGCGCTTTTACGGCATCGGTACTGCACTGAAAATCTGTCCCGTTGCCGCTGACCTGCAGCTGATTGCCCGGCAAACGGGCAATATCATATACGTCCAGATTGCCGTCGATATCGAGCAGCCAGCGTCCGTTGGCAATGTTCGTTACCGACAAATCCACCAGCCAGGCTACACGCCCACCGTCCACGTATGCGGGCTGCTCGGCCGTTGATGGAATCAACGTTCGATCGGCCAGCCATTCACCGCCATCTTTCAGCTGGCCGGCAATCAGTCGATACAGCGGTACAGCACATGCCGGCGGTACCATATCCTGCGGATCGGACGCGCGCATTTCCCCCTGCCCGGTAGCCAGCCACTGCAGAGAAACGCCGGTATCCAGCGCGCAGGTCACCACAATATCCCCCGGGAAGTAATTACGGCGCACCCAGGTGCTCATGGTGCCGGATGAAATACCCAGTAAATCCCCCAGCTCCTTCTGCAAGCTGAAACCATAGGCATCCAGAATCCTACGCAGCACGCCCTTGCCGCCGGACGCCAGTACGCGCTCATACAACGCCTTACCATGCGCCGCAGCCCGCCCATTCACAGTATCTTCAGCCATATTTTCCCCGGTCATCATTAAAACTTGCAGAAGCAAGCCGTACTTTTCGACTTAACGCACATAGGATTCGTTATTAACGCCAGGCTCGCCTGCTATTTAACCTATGGTTAAATGATGCAATATAGTCGCCGAAAATACAAATGCCGATACCTGCCACAAACATTGTTCAGCACTGAAAAATCACGTAATTCATCTACTTAACAATGATAATTAAGGGAAAAATACAAAACTTGAATTTGCGAGAAAAATAAAAAACTTATGACATTTCCCCCTTCCCAGACGCACAAGGCATAGCAGCAAACAACGCCGCCCTCTACGGGCCGAGCCGACGTTATCCTGCCAGGCTTGTCCGTTTGTGCCGCTCGGCCATACCAATAAATGGCGTCGAGAACCATTGACCCATTTAGCGTTTTGCTTAATACTGTATAAATACACAGTATATTTAAGCGAGGGCAAACACGGTGGAAATTACGGATAAACAACAACTAACACTGTCACGCATCCAGTTCATCGCCGATGTATCACAGGCAGCCCAGTGCAGTCCGTCCGAATACCTGATCGCACTGTCACTGATTTCCGATCTGGCCGCTCAGGAACTCCCCGAGCACGACTACCAAGCCGTCTACTATCCGGCAGACCAGCAACACTCCCGCTAGCACCGCCCCCCGCTTCTTAATACCGATTTTTACCCGCCATCCGGCGGGTATTTTTTTATCCGCCTTCCGGGGATTTATCCCGATCCGTTGTGCCACAACGACCACAACCCTATTGCATTGTGGCGGCAGGCCGTCATCAGGAAACTACTGCTAACCGGGCGCACTCCCCGCGTCCTCAACCAACAACCGCATCGCAGGATGGTAATGATGAAAATTTATGCACAACAAGGCGACACCATTGACGCGATGTGCTGGCGCTATTACGGCAGCACAGAAAACGTGGTGGAACAGGTTTATCGCGCCAATCGCGGGCTGGCGGCATACGGAGCGCTGCTGCCGCACGGCTGCCCGGTAGAAATGCCCGAACTCAACGCCGCCGCCAGGCGGGAAACGGTGAAACTGTGGGACTAAACGATGGAAAAATTTACCTCAACACTTTCCTACCTTATCGCCGCCTGCCTGGCCTGGTTCGGCCGTCACTCGACGGAAGACATCGCCATGATGGTCGGCGCTGCGGTAGGCGTAGGCACCTTTGCGGTGAACTGGTACTACCGCCATAAGAGCTATCAGTTACTGAAATCGCTCAGAAAGAAAATTCTCAAACGGGGGAAATACGATGAACCCACTCCTTAAACGCTGCAGCGCCGTCGCCATTTTGGCGCTGGCCGCTACTCTGCCGCAGTTTGGCCAACTGCATACCTCCGAACGGGGACTGCGCCTGATCGCCGACTTCGAAGGCTGCCAACTGTCCCCCTACCAGTGCAGCGCCAACGTCTGGACCAACGGCATCGGCCATACCGCGGGCGTGAAACCCAATACGGCGATCACCGAGCGTCAGGCGGCAGCCAACCTGCTGGACGACGTACGCAATGTGGAAAAAGGCATCGCGCGCTGTATGAGTGTGGATATGCCGCAGCCGGTTTACGACGCGGTCAGCGCTTTTGCCTTTAACGTCGGCGTCGGCGCCGCCTGCCGCTCTACGTTGGCCGGTTTCATTAAACAACGTCGCTGGCAACAAGCCTGCGATCAACTGCCGCGCTGGGTATACGTCAACGGCGTAAAGAGCAAAGGGCTGGAGCGTCGTCGTCAGGCCGAACGTGCGCTGTGTTTGCAGGGGGTGTCGCCATGAATCGCTGGCTGCTCAGCCTTGGCGCGCTGCTATTGACCCTGCTCGCGGCGTCGCAGTGGCAGAACCAGCGTCTGCAACAAACGCTGGCGCATCAACAGCAACGCCTTGGCGAGGCGCAGGCCACGCTAACGACACGCGACGCCCAGATCAACCAACTGAAACAACAGCGCCAACGGCGTGAACGCGCTGAACTGGCGCTGCGTCAGGCGCTGAGCAGCGCGCAGGATTTAACGCTGCAGCGAGAAAAACGATTACAGGAGTTACTCAATGAAAACAAGAAACTGCGCGACTGGTATCGCACTGGGTTGCCTGATGATGTTATCCGGCTGCAGCAGCGCCCCGCCTTCGCCAGGCCCGGAGATTACCTACGCTGGCTGTCCGAAAGTCAGCAGTTGCCCCATTCCCGGTAATCACCTCGCCACCAACGGCGATCTCAGCGCCGACATCCGCCAGCTTGAGCATGCGCTCATACAGTGCGCATTACAGATAGAAGCGATTAAACAGTGTCAGGAGGAACACGATGTTAAAACCCGACCAGCTGCGTGACGCCTTATTCAAGGCGTTACCGGACCTACAGGCCCAGCCGGAAAAACTGATGATGAAACTGACGGACGGCCGCGTCGTCGCCACGCCAGGCCCATCGTTGTCATTTGAATACCGCTACTCGTTGAGGTTAACGCTGGCCGATCGCGCGGCCGACTGCGAGCTGGTGATGGTCACCCTATTGGCATGGTTGCGCAGCCACCAGCCGGATCTGCTGGCTAACGTGGAAAAACGCAACGGTGATTTTGCTTTTGCCCTGCACGACGAAGCGCACGGCCAGCTCGACATTCAGTTGCAGCTCACCGAACGGATCTTGGTGGAACAGCGTGAGGAGACGTTGCATATCACGCCGCTGCCGGAACCGCCGGAGCCGCAAGACGTTTTGCGTTTTACCGACGTCTACCTGCATGGGGAGTTGATCAGCCACTGGCAACGCTCTTAACCCCGGGCAGGCGGCGGGTTTTGTTGTGCTGTCAGCCGTCAAACCCCGCCGCGTTGCCGCCCTTTTCTCTTACCGGCATGGTAACCACATGAACACATATAACCCTGATATTCAGCGCCTGGTGCGCAACCTAATCCGCATTGGCACCGTCAGCGACATCGATCTCGAACGCGGACGCTGCCGCGTGAGGACCGGCGGCAATCATACCGACTGGCTGTGTTGGATGACCAGCCGTGCCGGCAGCGCCCGCAGCTGGTGGGCGCCGAGCCTTGGCGAACAGGTGCTGGTGCTGTCGCTGGGCGGTGAGTTGGATACCGCTTTCGTGCTGCCCGGCATTTTTTCCAACGCCTTCCCGCCGCCGTCGCATTCGGCCGAGGCGCTGCATATTACCTTTGCCGACGGCGCAGTCATTGAGTACGAGCCAGCGCAAGGCGCTTTGAAAGCCAGCGGCATGAAAAGCGCCACGCTGGAGGCGACGGAGACGGCGACGGTCAGCGCCGCCAATATCGTCTGCCGCGCCAGCAGCAAAATCACCCTGGATGCCCCGGAAGTGGAGTGCACCCAGCATCTGGTTACCGGTTCACTCGCCGTCCGTCAGGGCGGTTCAATGACCGGTGATGTCACCCACTCCGGCGGCAGCATCACGTCTAATGGCATTGTGGTGCATACCCACACCCACGGCGGCGTACAAAACGGCGGAGGCCAAACGGATAAACCCTTATGAACAACGCAAAATATATTGGCATGGGCCGCAACTCCGGCCGCGCCATTAACGATATCGAACATATTCGCCAGTCGGTGAGCGACATTCTGGTCACCCCGATAGGTTCACGCGTGATGCGTCGCCAGTACGGATCGCTGCTTTCCGCACTGCTGGATCAACCGCAAAACGATGCGCTGCGCCTGCAAATCATGGCCGCCTGTTACAGCGCCCTGCTGCAGTGGGAACCAAGGATCCGTCTGACCGAGATCGCGGTCAACACCACCTTCGACGGAAAAATGGTGGTCGACCTGACCGGCAGTCGTACCGACACCCCGGATAATTTTTCCCTTTCAGTTTCAGTGAGCTAACGCTATGCCTACGATTGATTTGAGTTTATTGCCCGCCCCGACGGTGGTGGAACCGTTAAATTACGAAAGTTTACTGGCCGAACGTAAGGCCAGGCTGATCGCCCTTTACCCCGAAGATCAACGCGACGCCATTGCCCGCACCCTGGCGCTGGAATCGGAACCGCTGGTCAAACTGCTGCAGGAAAACGCCTACCGCGAGATGATTTTGCGCCAACGCGTAAATGAAGCGGCACAGGCGGTAATGCTCGGCTACGCCACCGGCAGCGATCTGGATCAGATCGGCGCCAATTTCCAGGTTGAGCGCTTAGTGGTGCAAAAACCCGATAACAGCGTCGTGCCGCCGGTGCCGGCGATTATGGAGTCCGACAGCGATTTCCGCGTTCGCATCCAGCAGGCATTTGAAGGATTGAGCGTGGCGGGATCCAGCGGATCCTATGAATATCACGGCCGCTCGGCGGACGGGCGCATCGCCGACGTTTCCGCCACCAGCCCCAGCCCGGCAACCGTCGTTATCGCCGTGTTGTCCCGTGAAGGCGACGGCACCGCCAGCGACGATCTGCTGGCCGTTGTCGAACGCGCGCTGAACGACGAAGACGTGCGACCGGTCGCCGACCGCGTGACGGTGTGCTCCGCCGATATTGTCAGTTACGCCATCGACGCGACGCTATACCTGTATCCCGGCCCGGAAGCCGAGCCGATTCGTCAGGCGGCCGAAGCCAAGCTCAAAAAATACATCAGCACCCAACACCGTCTGGGACGCGATATCCGCCTGTCGGCCATTTACGCCGCGCTGCATGCCGAAGGGGTACAGCGCGTCGAGTTGGCCAGCCCGAAAAACGACATCGTGCTGGACAAAACTCAGGCGTCTTACTGCAACAGCTACGCGCTGAAAATCGGGGGGTCGGATGAATAATTACTTTTTCCACCTCTATATCACCTACTGACACGCCCCTTTGGGGCGTTTTTTTATCTTATTTTTCCTGATGTTGTGTCAGATTTAGTACATACCCAATGACGTGCACGCCCGCCTTGAGAAGCGCATCCTACTCCTACCAACCAACAAACGGAGTAATACTATGGGTGACTATCACCACGGCGTACGTGTTCTTGAAATCAATGAAGGCACTCGCGTAATTTCCACCGTCTCAACGGCGGTCATCGGCATGGTTTGTACCGCAGAAGATGCCGATACCAGCGTTTTTCCACTTAACCAACCGGTGCTGATCACCGACATACTGGCCGCCAGCGGCAAAGCCGGTAAAAAAGGCACGCTGTCCGCCGCGCTGCTGGCCATCGCCGAACAGGCCAAACCGGTCACCGTGGTCGTACGCGTCGCCGAAGGTCAGGACGAAGCGGAAACCACCACCAATATCATCGGCGGTGCCGATGAGAGCGGCAAATACACCGGCATGAAAGCCCTGTTGGCAGCACAGGCCGAACTGGGCGTAAAACCACGCATTCTCGGCGTACCCGGCCTCGATAACCTGCAGGTAGCAACGGCGCTGGCAACCATTTGCCAGCAGCTGCGCGCCTTCGGCTACATCAGCGCTTACGGCTGCAAAACCGTTACGGAAGCCATTGCCTACCGTAAAAACTTCAGCCAGCGTGAACTGATGCTGATTTGGCCTGACTTCGTCAGTTGGAACACCGACACCAACAGCAGCGATATTGCCTACGCGACCGCCCGTGCGCTCGGCCTGCGCGCCAAAATCGACCAGGAGACCGGCTGGCATAAGTCGCTGTCCAACGTCGGCGTTAACGGCGTCAGCGGTATTTCCGCCAGCGTATTCTGGGATCTGCAAACTGCCGGCACCGATGCCGACCTGTTGAATGAAGCCTGCGTGACCACGCTGATCCGCAAAGATGGCTTCAAATTCTGGGGTTCGCGCACCTGTGCCGACGATCCGCTCTTCCAGTTTGAAAACTACACCCGCACCGCGCAGGTTCTGGCGGACACCATGGCCGAAGCGCATCTGTGGGCGGTGGATCGCCCCCTGACGCCAACGCTGATCCGCGACATGATCGACGGCATCAAAGCCAAATTCCGCGAACTGAAATCCGCCGGTCTGATTATCGACGGCGACTGCTGGTACGACGAAAGCGCCAACGATCAGGAAACCCTGAAAGCCGGCAAGCTGTTTATCGATTATGACTACACCCCGGTGCCGCCGCTGGAAGACCTCACCCTGCGTCAACGCATTACCGACCGCTACCTGGCGAACTTCGCCGCGTCCGTAAACGGCTAAGGAGAACAACAAAATGGCATTACCAAAGAAATTGAAATACCTGAACCTGTTTAACGACGGCTTCAACTATATGGGCGTGGTTTCCTCGCTGACCCTGCCAAAACTGACCCGCAAGCTGGAGAAATACCGCGGCGGCGGCATGAACGGCGCGGCATCGGTCGACTTTGGCCTGGACGACGATGCGCTGGTGGTCGAATGGACTATGGGCGGTATCGATGAACTGGTGCTGAAGCAATGGGGTCAGGTCGATGCAGTTCCGCTGCGTTTCACCGGTTCTTTCCAACGCGATGACAGCGGCGACGTTTCCGCGCTGGAAGTGGTGATGCGCGGCCGCCACAAAGAGATCGACAGCGGCGATTTCAAGCAAGGTGAAGACACTGAAACCAAGGTTTCGACCGACTGCACCTACTTCAAGCTGAGCATCGACGGCAAAGAACTGATTGAGATCGATACCGTCAACATGATCGAGAAAGTGGACGGCGTCGACCTGCTGGCCGCACACCGCAAGGCCATCGGCCTGTAATCGTTATACCCACGGCCAGCCGTACGCTGGCCGCTTTACCCCACAGCAAACACAGTTAAGGAAAAAACAATGGAAATGATCACCTCTCAGGATACCAGCGTCGTACTCGATACCCCAATCAAACGCGGCGACAGCGAAATTAGCGAAGTCATGGTCACCAAGCCCAACGCCGGCAGCCTGCGCGGCATCGGCCTGGCTGCGCTGGCCAATGCCGACGTGGATGCGCTGATCACCATTTTACCGCGCGTCACCTATCCGAACCTGACCAAAGAAGAGTGCGCCCGTCTGGAACTGCCGGATCTGATCGCGCTGGCAGGCATGGTGATCGGTTTTTTAGCGCCGAAGTCGGCGGAGTAAGCATCGACCCGGCGCTGACCGTGGACGATCTGATGGCGGATATCGCAGTGATTTTTCACTGGCCGCCATCAGAGATGAACGGTATGGGGCTGGGGGAGTTGATCGACTGGCGTCGCCGAGCCCTTCAACGTAGTGGAGCAGAGACCAATGAGTAATCAACAGGCAGTCAACGATAGCACCATCAAGACCGCCAATCGTAATGTGATCCTGTCGACCAAACAGGTCCACTATGCGCAACGACAGCTCGCACAGCTGAAAAAGTTGGCGGCACTGAAGCGCCAGACTGACGATATTCAGCAACAGTTAAATAACAACCGGGCGGTGCGTACGTCGCTGCCCGCCGCGATTTCCTCTATCGAGGATATTCAAACCGTCCAACAATTAAACTTCGACGACGTGCTCCAACGGGCACACCGTAAACTGCTCGCCTTCCAAAAGGGGCAGTTAAGTAAAAAGCTCGAAAAAAAAGGTTTCAACACCGCACGGCTGGACGAGGCGCAGCAACAGCAAAGCCAGCATCTGAATAATGCGCAGGCATCTCTGGGCGCCAACAAAGCCGAACAAAGCCGCCTGCGCCAGCAACATCGTGCGCAAATTTTTGCCAATCACGACCGTCGTCAGGGCCGTATTGATGCGCTGGGCAAATTTACCGAACCTGCCGCCGCAATCAGCAATCAGCTGTTTGCCGGCGGTAAGCAGTTGTTCACGCCCGGCGCGCAATTCGAGCAGCAGCTTTCCGCTCTGCAGGCACGGTTGGGATTAAAGCAAAACGATCCTCACTTGCTGGCACTACAGCAGCAGGCCAGCCAACGACAAAACGGCGCGAATACGCCAGCCGATATACAGCAGGCGCAGGCGGCGCTGGCAAACAGCGGTTATGACGCCGAAGCCGTGCTGGCCGCCACGCCGGCCGCGTTGAATTTGGCCAGGGCGAGCGGCAGCAGCATTAAAGATGCGGTAAAAGCACTCACCGGCGTGCAACAGGCCTTTAAGCTGCCCGTCGATCAGGCCAACAACATCGCCGACGTCATGGCCAAAGCCAGTAACAGTTATGCACTTCCCCTGACGGATCTGAACAAACAGCTGTTATCGGCGGCGCCCGACGCGCTCAGCCGAGGGACTGGCCTGGAGCAAACGGCGGCGCAGCTGGCTCCGGCCGACAAACAGCTGGGCAACGTCGCGGGCGCCGCTCAGGCCATCGTGACGGTACGTGGCGACAACCTGGACGGCGATATCCAAAAGCTGTTCGCCAGCTGGGACCGCATTCGTATCAACCTTTTTGACGGACAGAACACCGCGCTGCGGGCACTGACGCAAACGGCGACCAAGTGGCTGGATACGCTACAGCAGTGGGTTACCAATAATCCGGATCTGACCGCAACGCTGATAACCGTTGCCGGCGCCGTATCCGTGCTGCTAGGCGGCCTGGGTTCCATCGGCACCTTTATTGTTCCGGCCCTCGGCGCCATCAATATGCTGATGAGCGGCGCCGCACTGCTCGGCGGTATTTTCACCGGTGTCGGCAGCGCCATCGCCGCCGCATTCGCCGCGCTTGGCCTCCCAATTATCGCTGTCATCGCCGCCGTCGTCGCGGGCGCAGCGGTAATTTATAAATATTGGCAGCCGATTTCTGCATTTATCGGCGGCATTATCGACGGCGTTCTTGAGGCGCTGGCGCCGTTTAAAACCCTCTTCGAGCCTATCGGTCAGGTGTTCAGCTTCGTGGTTGGCAAAATTAAGGAGTTCCTTACTCCGGTAAAAATGACGCAAGAAGGTTTAGATAAAGTTTCAGCGGCGGGAAAAAAGGTAGGAGAAGTCCTGACCATAACGCTAAGGAGCTCAATTTGGGTCTTGCAACAAGTTAGCGGCGCCGTTTCCAAGCTACTGCAAAGTATCGGGATCATTAAAAAAACGCCTGAACAGGAAGCAGCCATCGCGCCACCAACGGCATCAACATCAGAGCCAGAAAATACCAATAGCCAACTCAGTGTATCGAAAAGCTCCCCGTTGACGGCTTATCAACCGGCCCTAACGCCCGGCGCCGCCGTCACGGCCAATCATCGAACGGTGACCAATAACGTGACGATCCATACCACGCCGGGCATGGACCGCGAGGAAATACAGAATATCTATCATCAACTGAATCAACAGCAGCAGCGGGAACAGGAGCGTGTAGCGCAATCCCAATTCGCCAATATTTAAGGAACCAACACTATGATGCTAACCCTTGGGATCTACGTATTTATGCTGCGTACTCTCCCCTATCAAACCATCAAACGTGACGTCGCCTATCAGTGGCCGGAAAACAAACGCGTGGGCCAACGCGCGACTTCGCAGTTTCTCGGCCCCGACGTCGAAACGATCACCCTGGCGGGACAGTTAATGCCGGAACTGACGGGAGGCCGCGTCTCGCTCGCCGCTTTGCAGGCAATGGCGGATCAGGGGCGCGCCTGGCCGCTGATTGAAGGGAGCGGAACGATTTACGGCATGTTTGTGATTCAACGATTAAGTCAGACCGGTACGCAGCTGTTCCCCGATGGCCAGCCAAGACAAATCAATTTTGATATCACGCTGAAACGCGTGGATGAGTCTCTGCACGCCATGTTTGGCGACCTGCGTACGCAGATGGAAGGCGTGTTGCACAAAGCCGGAGCGTTGCTGGATAAGGCGCAGCAGGCAGCGGGGATTAATCCATGATAACCGACACCTCTCTGCCAATGGGGGGCAAGATTGCCCCTGATTTTATTATCACGCTGGACGACAAGGATATTACCGCGTCAATCGCACCCCGGCTGATTTCGCTGACGTTGAAAGACGTCAGCGGCTTTGCCGCGGACTCTCTGAGCATCACTTTGGACGATAGCGACGGCCAGCTCATCATGCCCAGACGCGGCGCGCTGTTAACACTTTATATCGGCTGGCAAGGTTCGGCGTTATTTGGACATGGCTCATTCGTCATTGACACCATTACCCACACCGGCGCACCGGACCAGCTGACAATATCGGCTAACAGCGCTGATTTTAGGCGCTCACTTAACCAATTACGCAGCCAGTCTTATTCAGATATCGAGCTGGGAGACATCATTAAACAAATATCGGAGCGCAACCAACTGCGCGCCCCTAATGTAGATGCCGAGATAAGCACAATAACCATTCCGCATATCGATCAGACCAATGAGTCAGACCTGCAGTTCCTGGTGCGGCTGGCGACGCTCAACGGCGCCAAGGTGAGTATTCGCTTCCAGAAAATTTCATTTAGCAAACCGGGAGCGGGCGGCGTCAGCAATGGTCAGCCGCTGCCCGTAAGCGTTCTTACCCGCAATGATGGCGATCGGCATACCTTCAATATTGCGGATAAAAACGCCTATACCGGCGTCAGCGCCGTCTGGCTCAACACCGATGAACCTAACGACGCGAATAAAAAAATTCGTCTCGACCGCCAGTTGCCGGGGGCCCCCAATAATACCTCCGGTTCACCACCAGCAAAACCTGGCGCTCAAACATCGCCAAACAACTATCTACTCGGTGCCGGAGACAACATATTTAATATCCCCAAGGTCTTTCGGGATAAAAATGCTGCGATGCGTGGCGCCGAGGCCGTATTCAAACGAATACAGCAAGGATTGGCCAAGTTCACCATCAACCTGGCTATCGGGCGGCCAGACCTGGTGCCAGACACGCCAATCATCGTCCGCGGCTTCAAATCACAGATTGATAGTCAGCGATGGCTCATCAACGCAATCACTCACAACCTGAGCAGCAGCGGATACACCTGCACCTTGGACTTACAGGTGATGCTTGATGACGTCACGTACCACACAACAGAAACACGATCTCAAATTTGAGTTACTAACTTGCATTTACAAGATTGAGGTGTAATATGAATATTACTGTTTGAATAAACACTAGAGGAATTTAATTATGATGCACTGTCCAGTATGCGGTAACGTCGCTCACACCCGTTCCAGCCGTTACCTGAGCGAATCAACCAAAGAACGCTATCACCAATGTCAAAACATCAACTGCAGCTGTACGTTTGCCACGCATGAGTCGGTTGCGCGCGTCATTGTTAAGCGCGGTGATAACCCACAAGTGCCGGCGGCGGTAGCCACGGGCAAGGCACGCCCAGCGGCGCAGACTGCGGCCAAGAGGTAAACCATGCTGCCGTTTGACTCCGATTTACTGCCGACTCCGGCAACCAACCGGCTGCTGCCGGTTGGTTCAGCACCGCTCGAAGTCGCCGCCGCACAGGCCTGCGCCGAACTGGAACACATATCGGTATCGCTGCGCGATTTATGGAACCCGGCAACCTGCCCAGCCAATTTATTGCCTTATTTGGCCTGGGCCTTTTCCGTCGATCATTGGGATGAAGCATGGCCGGAGGAGACAAAACGCAGCGTCATCGCATCATCCTTTTTTGTCCACCGGCATAAAGGAACGATCGGCGCGATACGCCGCGTGGTTGAACCGCTGGGTTACCTGATTAACGTTGACGAATGGTGGCACGGCGGCGATGACCCACCCGGTACCTTCCGGCTCAATATCGGCGTATTGGAAAGTGGCATCAGCGAAGAGATGTACCTTGAGATGGAGCGCATGATCGCCGATGCCAAACCCGTAAGCCGGCATTTGATTGGCCTGACGATTTCACAAGCCGTGACAGGATATGCCTATACAGGCGCGGCGGCTATTGACGGTGATGTCACCACCGTTTACCCCGGATAGAGATAACCCGTATGACAAAATATAAAGCAATAGTGACCACTGCCGGCGCGGCAAAAATTGCCGCAGCAACGGCCGGCGGGACACAGTTAAAGATTACGCACATGGCCGTCGGCGACGGCAATGGCACACTGCCCGAACCAACGCCAGAACAGACACAGCTGATTAACGAGCAATACCGCGCTGCGTTAAACGCATTGGATATCGCCAATACGTCACAAAACCATATTGTCGCTGAACTGATCATCCCGGCAAACGTCGGCGGTTTCTGGCTGCGCGAGATGGGGCTGTTCGATGAGTCGGGCGAACTGATCGCCGTCGGCAATATGGCCGAAAGCTACAAGCCCAAGCTGGAAGAAGGCAGCGGACGCACGCAAACGCTGCGCATGGTCATGGTTGTCAGCAGCACCGATGCCATCCAGGTTATCGCTGGTGGCGATACCGTGCTGGCAAGCAGGGACTTTGTCGAAAAAGCCATATTGGAGCATGAAGGCTCACGCAATCATCCTGACGCCACCACCACGGTGAAAGGCTTCGTGATGTTGTCAAACGCCATCAACAGCTCTGATGAAAACAAAGCGATAACCCCGGCGGCGCTGAAACAGGTCAATGATGCCGGGTTAAAAAAAACGTCAAACCTCAGCGATGTGCCAAATAAAGAAACTGCCCGGAGTAATCTACAACTGGGTGACGCAGCCACCAAAAACGTCGGTACCGAAATCGGGCAGTTGATGGCCGTAGGTGCTTTCGGCCTGGGTGGAGCAGCAATTAACATCAGCGATGCTAATGCAGCCAGCAAAATCGGCTTTTACAGATTAGCGCCTCGTGCAGCAAACACGCCAAACCCCGCAGCGCCCTGGGAAATGCTGTGCATTAAATACGATTCAGGATCTACCAAACAGCTTTTTTGGCAAGCCGGCGCAAATACTACCCGTACATACATACGTTACCGCACTTCTCCCGCAGGAGTATGGGGACCATTTCAACATTACTACAGCACGGAAAACAAACCGACGTCTGCAGATGTGGGAGCCTGGAGCAAAAATGAAGCAGATGGTCGTTATCTGATGAAGAGCGGCGGTCAGTTAACCGGGACACTAAAAACCAGCGCTGAAATCCAGTCAACAGTGCCTAATAATTACCGCTCAATCGGCGGTGACTATGGGACATTCTGGCGTAACGACGGTAATAGTCTGTATTTGATGCTGACAAATGCCAAAGACCAATATGGTGGATTCAACGATATGCGTCCACTGTACGTTAATATAAAAAACGGTGCGGTGACGATGGGGCACGATGTTTCCATCAATGGCAGGCTGAATGTTGGCCCCGCAATTCATTCCAGCGATGGCAATATTATGGGCAGCCGCTGGGGTAATAAGTGGCTATGGGATGCCATCATTGAACAAGTCAACGGGCGCGTAGACTGGGGCTCTTTTTCTAATCGTACGCATGAAGGCTCCGGATGGTGGCGTGATGAAAACACCGGGGTTTTATTCCAATATGGCCGCACCGTGACTAAAGGCATTAATGAAAATACAAATATTTGGCAAAATTTCAACATTTCATTCTCTCAGTGGGTTGGTCCAATACTATTAACGCCATACTGGCCAGGCAGAGACTCTTGGGGGGATGTCTTCGATTACGCCCCTATACTAGGCAGTAATAATAATCAGGGTTTCAACTGGAACAAGCAAAGTTTTGGCTCCGTTCATTCAGACTGGGACTCAGGTATGACATGGTTTGCAATAGGAAAATAA
Protein sequences of DBSCAN-SWA_2 >LS483492|4016010:4033091|4028823_4029306_+|SQJ28753.1|DBSCAN-SWA MMLTLGIYVFMLRTLPYQTIKRDVAYQWPENKRVGQRATSQFLGPDVETITLAGQLMPELTGGRVSLAALQAMADQGRAWPLIEGSGTIYGMFVIQRLSQTGTQLFPDGQPRQINFDITLKRVDESLHAMFGDLRTQMEGVLHKAGALLDKAQQAAGINP >LS483492|4016010:4033091|4026321_4026609_+|SQJ28751.1|tail|DBSCAN-SWA MEMITSQDTSVVLDTPIKRGDSEISEVMVTKPNAGSLRGIGLAALANADVDALITILPRVTYPNLTKEECARLELPDLIALAGMVIGFLAPKSAE >LS483492|4016010:4033091|4020103_4020298_+|SQJ28723.1|DBSCAN-SWA MEITDKQQLTLSRIQFIADVSQAAQCSPSEYLIALSLISDLAAQELPEHDYQAVYYPADQQHSR >LS483492|4016010:4033091|4023487_4024396_+|SQJ28748.1|DBSCAN-SWA MPTIDLSLLPAPTVVEPLNYESLLAERKARLIALYPEDQRDAIARTLALESEPLVKLLQENAYREMILRQRVNEAAQAVMLGYATGSDLDQIGANFQVERLVVQKPDNSVVPPVPAIMESDSDFRVRIQQAFEGLSVAGSSGSYEYHGRSADGRIADVSATSPSPATVVIAVLSREGDGTASDDLLAVVERALNDEDVRPVADRVTVCSADIVSYAIDATLYLYPGPEAEPIRQAAEAKLKKYISTQHRLGRDIRLSAIYAALHAEGVQRVELASPKNDIVLDKTQASYCNSYALKIGGSDE >LS483492|4016010:4033091|4017634_4018978_+|SQJ28569.1|DBSCAN-SWA MITLEFGIIIACLLVGTRYGGMGLGLISGVGIFVLCFVFGLQPGKPPIDVMLTILAVIGCASVLQTAGGLNVLMQYAERLLRRHPQHITLLAPFTTWMLTFLCGTGHVVYTMFPIIGDIALKKGIRPERPMAVASVASQMAITASPVSVAVVSLVSIIAAGHGIGRAYSLVEILAVSIPSSLVGVLIAALWSLRRGKDLEHDADFQARISDPEQRAYIYGAGETLLDQRFAKSAYASTLIFFAAIAVVVLLGAFSELRPAFAVKGVMKPLSMNLVIQMMMLIAGAVMLIGCKVKPGDIANGAVFKAGMVAIFSVFGVAWMSDTFFQAHMGDLKLLLEDVVKSQPWTYAIVLFLVSKLVNSQAAALTAVAPMALQLGVEPKMLLAFFPAAYGYFVLPTYPSDLACIGFDRSGTTRIGRFIINHSFILPGLIGVSSACVTGYLLVTTLL >LS483492|4016010:4033091|4024559_4025729_+|SQJ28749.1|tail|DBSCAN-SWA MGDYHHGVRVLEINEGTRVISTVSTAVIGMVCTAEDADTSVFPLNQPVLITDILAASGKAGKKGTLSAALLAIAEQAKPVTVVVRVAEGQDEAETTTNIIGGADESGKYTGMKALLAAQAELGVKPRILGVPGLDNLQVATALATICQQLRAFGYISAYGCKTVTEAIAYRKNFSQRELMLIWPDFVSWNTDTNSSDIAYATARALGLRAKIDQETGWHKSLSNVGVNGVSGISASVFWDLQTAGTDADLLNEACVTTLIRKDGFKFWGSRTCADDPLFQFENYTRTAQVLADTMAEAHLWAVDRPLTPTLIRDMIDGIKAKFRELKSAGLIIDGDCWYDESANDQETLKAGKLFIDYDYTPVPPLEDLTLRQRITDRYLANFAASVNG >LS483492|4016010:4033091|4016010_4017084_-|SQJ28565.1|DBSCAN-SWA MQKDALNNVHISAEQVLITPEELKNQFPLSVADEAQIATARKTIADILQGRDPRLLVVCGPCSIHDPDAALDYARRLQTLAADLSDRLYIVMRVYFEKPRTTVGWKGLINDPHMNGSFDVEAGLHIARRLLLDLVEMGLPLATEALDPNSPQYLGDLFSWSAIGARTTESQTHREMASGLSMPVGFKNGTDGSLSTAINAMRAAAMPHRFVGINQAGQVCLLQTQGNPDGHVILRGGKTPNYGPEDVAACEKQMRDAGLRPSLMIDCSHGNSNKDYRRQPAVAASVVEQINAGNRSITGIMLESNLHEGNQSSEQPRAEMRYGVSVTDACIDWESTETLLRDMHQQLGAALMARTGE >LS483492|4016010:4033091|4020712_4020931_+|SQJ28742.1|DBSCAN-SWA MEKFTSTLSYLIAACLAWFGRHSTEDIAMMVGAAVGVGTFAVNWYYRHKSYQLLKSLRKKILKRGKYDEPTP >LS483492|4016010:4033091|4020914_4021427_+|SQJ28743.1|DBSCAN-SWA MNPLLKRCSAVAILALAATLPQFGQLHTSERGLRLIADFEGCQLSPYQCSANVWTNGIGHTAGVKPNTAITERQAAANLLDDVRNVEKGIARCMSVDMPQPVYDAVSAFAFNVGVGAACRSTLAGFIKQRRWQQACDQLPRWVYVNGVKSKGLERRRQAERALCLQGVSP >LS483492|4016010:4033091|4022500_4023136_+|SQJ28746.1|plate|DBSCAN-SWA MNTYNPDIQRLVRNLIRIGTVSDIDLERGRCRVRTGGNHTDWLCWMTSRAGSARSWWAPSLGEQVLVLSLGGELDTAFVLPGIFSNAFPPPSHSAEALHITFADGAVIEYEPAQGALKASGMKSATLEATETATVSAANIVCRASSKITLDAPEVECTQHLVTGSLAVRQGGSMTGDVTHSGGSITSNGIVVHTHTHGGVQNGGGQTDKPL >LS483492|4016010:4033091|4019006_4019642_-|SQJ28572.1|DBSCAN-SWA MAEDTVNGRAAAHGKALYERVLASGGKGVLRRILDAYGFSLQKELGDLLGISSGTMSTWVRRNYFPGDIVVTCALDTGVSLQWLATGQGEMRASDPQDMVPPACAVPLYRLIAGQLKDGGEWLADRTLIPSTAEQPAYVDGGRVAWLVDLSVTNIANGRWLLDIDGNLDVYDIARLPGNQLQVSGNGTDFQCSTDAVKALGMVWLTLTPQA >LS483492|4016010:4033091|4020502_4020709_+|SQJ28726.1|DBSCAN-SWA MMKIYAQQGDTIDAMCWRYYGSTENVVEQVYRANRGLAAYGALLPHGCPVEMPELNAAARRETVKLWD >LS483492|4016010:4033091|4021423_4021837_+|SQJ28744.1|lysis|DBSCAN-SWA MNRWLLSLGALLLTLLAASQWQNQRLQQTLAHQQQRLGEAQATLTTRDAQINQLKQQRQRRERAELALRQALSSAQDLTLQREKRLQELLNENKKLRDWYRTGLPDDVIRLQQRPAFARPGDYLRWLSESQQLPHSR >LS483492|4016010:4033091|4030767_4031334_+|SQJ28756.1|tail|DBSCAN-SWA MLPFDSDLLPTPATNRLLPVGSAPLEVAAAQACAELEHISVSLRDLWNPATCPANLLPYLAWAFSVDHWDEAWPEETKRSVIASSFFVHRHKGTIGAIRRVVEPLGYLINVDEWWHGGDDPPGTFRLNIGVLESGISEEMYLEMERMIADAKPVSRHLIGLTISQAVTGYAYTGAAAIDGDVTTVYPG >LS483492|4016010:4033091|4025742_4026252_+|SQJ28750.1|tail|DBSCAN-SWA MALPKKLKYLNLFNDGFNYMGVVSSLTLPKLTRKLEKYRGGGMNGAASVDFGLDDDALVVEWTMGGIDELVLKQWGQVDAVPLRFTGSFQRDDSGDVSALEVVMRGRHKEIDSGDFKQGEDTETKVSTDCTYFKLSIDGKELIEIDTVNMIEKVDGVDLLAAHRKAIGL >LS483492|4016010:4033091|4023132_4023483_+|SQJ28747.1|plate|DBSCAN-SWA MNNAKYIGMGRNSGRAINDIEHIRQSVSDILVTPIGSRVMRRQYGSLLSALLDQPQNDALRLQIMAACYSALLQWEPRIRLTEIAVNTTFDGKMVVDLTGSRTDTPDNFSLSVSVS >LS483492|4016010:4033091|4030530_4030764_+|SQJ28755.1|DBSCAN-SWA MMHCPVCGNVAHTRSSRYLSESTKERYHQCQNINCSCTFATHESVARVIVKRGDNPQVPAAVATGKARPAAQTAAKR >LS483492|4016010:4033091|4031345_4033091_+|SQJ28757.1|tail|DBSCAN-SWA MTKYKAIVTTAGAAKIAAATAGGTQLKITHMAVGDGNGTLPEPTPEQTQLINEQYRAALNALDIANTSQNHIVAELIIPANVGGFWLREMGLFDESGELIAVGNMAESYKPKLEEGSGRTQTLRMVMVVSSTDAIQVIAGGDTVLASRDFVEKAILEHEGSRNHPDATTTVKGFVMLSNAINSSDENKAITPAALKQVNDAGLKKTSNLSDVPNKETARSNLQLGDAATKNVGTEIGQLMAVGAFGLGGAAINISDANAASKIGFYRLAPRAANTPNPAAPWEMLCIKYDSGSTKQLFWQAGANTTRTYIRYRTSPAGVWGPFQHYYSTENKPTSADVGAWSKNEADGRYLMKSGGQLTGTLKTSAEIQSTVPNNYRSIGGDYGTFWRNDGNSLYLMLTNAKDQYGGFNDMRPLYVNIKNGAVTMGHDVSINGRLNVGPAIHSSDGNIMGSRWGNKWLWDAIIEQVNGRVDWGSFSNRTHEGSGWWRDENTGVLFQYGRTVTKGINENTNIWQNFNISFSQWVGPILLTPYWPGRDSWGDVFDYAPILGSNNNQGFNWNKQSFGSVHSDWDSGMTWFAIGK >LS483492|4016010:4033091|4026756_4028811_+|SQJ28752.1|tail|DBSCAN-SWA MSNQQAVNDSTIKTANRNVILSTKQVHYAQRQLAQLKKLAALKRQTDDIQQQLNNNRAVRTSLPAAISSIEDIQTVQQLNFDDVLQRAHRKLLAFQKGQLSKKLEKKGFNTARLDEAQQQQSQHLNNAQASLGANKAEQSRLRQQHRAQIFANHDRRQGRIDALGKFTEPAAAISNQLFAGGKQLFTPGAQFEQQLSALQARLGLKQNDPHLLALQQQASQRQNGANTPADIQQAQAALANSGYDAEAVLAATPAALNLARASGSSIKDAVKALTGVQQAFKLPVDQANNIADVMAKASNSYALPLTDLNKQLLSAAPDALSRGTGLEQTAAQLAPADKQLGNVAGAAQAIVTVRGDNLDGDIQKLFASWDRIRINLFDGQNTALRALTQTATKWLDTLQQWVTNNPDLTATLITVAGAVSVLLGGLGSIGTFIVPALGAINMLMSGAALLGGIFTGVGSAIAAAFAALGLPIIAVIAAVVAGAAVIYKYWQPISAFIGGIIDGVLEALAPFKTLFEPIGQVFSFVVGKIKEFLTPVKMTQEGLDKVSAAGKKVGEVLTITLRSSIWVLQQVSGAVSKLLQSIGIIKKTPEQEAAIAPPTASTSEPENTNSQLSVSKSSPLTAYQPALTPGAAVTANHRTVTNNVTIHTTPGMDREEIQNIYHQLNQQQQREQERVAQSQFANI >LS483492|4016010:4033091|4029302_4030454_+|SQJ28754.1|DBSCAN-SWA MITDTSLPMGGKIAPDFIITLDDKDITASIAPRLISLTLKDVSGFAADSLSITLDDSDGQLIMPRRGALLTLYIGWQGSALFGHGSFVIDTITHTGAPDQLTISANSADFRRSLNQLRSQSYSDIELGDIIKQISERNQLRAPNVDAEISTITIPHIDQTNESDLQFLVRLATLNGAKVSIRFQKISFSKPGAGGVSNGQPLPVSVLTRNDGDRHTFNIADKNAYTGVSAVWLNTDEPNDANKKIRLDRQLPGAPNNTSGSPPAKPGAQTSPNNYLLGAGDNIFNIPKVFRDKNAAMRGAEAVFKRIQQGLAKFTINLAIGRPDLVPDTPIIVRGFKSQIDSQRWLINAITHNLSSSGYTCTLDLQVMLDDVTYHTTETRSQI >LS483492|4016010:4033091|4021947_4022412_+|SQJ28745.1|tail|DBSCAN-SWA MLKPDQLRDALFKALPDLQAQPEKLMMKLTDGRVVATPGPSLSFEYRYSLRLTLADRAADCELVMVTLLAWLRSHQPDLLANVEKRNGDFAFALHDEAHGQLDIQLQLTERILVEQREETLHITPLPEPPEPQDVLRFTDVYLHGELISHWQRS |
21 | Erwinia_phage(38.89%) | lysis,plate,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|