Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP036299 | Planctopirus ephydatiae strain spb1 chromosome, complete genome | 3 crisprs | cas3,WYL,csa3,DinG,Cas9_archaeal,cas4,RT | 1 | 2 | 1 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP036299_1 | 2103013-2103227 | Orphan |
NA
Consensus repeat of NZ_CP036299_1
|
4 spacers
spacers of NZ_CP036299_1
>1.1|2103036|25|NZ_CP036299|CRISPRCasFinder GTGTTACACCGTGTGCAAGCCTGTT >1.2|2103084|25|NZ_CP036299|CRISPRCasFinder GACCTGCACGATCAAGAAGCCGGTT >1.3|2103132|25|NZ_CP036299|CRISPRCasFinder GACCTGCACAATCAAGAAGCCGGTC >1.4|2103180|25|NZ_CP036299|CRISPRCasFinder CTGCTACACCAAGTGCCGTCCTGTG |
CRISPR arrays and Neighbor proteins around NZ_CP036299_1
The CRISPR arrays of NZ_CP036299_1 >merge|NZ_CP036299|1|2103013-2103227|CRISPRCasFinder CGCGAGACCTGCTATCGTGAAGAGTGTTACACCGTGTGCAAGCCTGTTTATCAGACTTGCTACCGGGAAGTGACCTGCACGATCAAGAAGCCGGTTTACCAGACCTGCTACCGCGACGTGACCTGCACAATCAAGAAGCCGGTCTACCAGACCTGCTATCGCGATGTCTGCTACACCAAGTGCCGTCCTGTGTTCCAGACCTGCTATCGCGATGT >NZ_CP036299|1|1|2103013-2103227|CRISPRCasFinder CGCGAGACCTGCTATCGTGAAGA GTGTTACACCGTGTGCAAGCCTGTT TATCAGACTTGCTACCGGGAAGT GACCTGCACGATCAAGAAGCCGGTT TACCAGACCTGCTACCGCGACGT GACCTGCACAATCAAGAAGCCGGTC TACCAGACCTGCTATCGCGATGT CTGCTACACCAAGTGCCGTCCTGTG TTCCAGACCTGCTATCGCGATGT
>NZ_CP036299.1|WP_145298154.1|2100783_2102265_-|sulfatase-like-hydrolase/transferase MTEASPIRTIRKMLMSLGSHPAIALWLALVAFCSQTLSAAEDVNQTSQPGRPNILVIMADDLGYADLGVQGGCEIPTPHLDQLAASGIRCTNAYVSAPYCSPSRAGFLTGKYQTRFGHEFNPHVGEEAKLGLPLEEVTIASLLGKEGYRTALIGKWHQGFSKDHHPQSRGFDEFFGFLVGGHNYLLHKDVKPRFGSAHSHDMIYRGREVEPQEGYATDLFTNEALRWMSGPPDKPWFLYLSYNAVHTPLEIAPHLQKRIPESVKLPARRSYLSLLAGLDDSIGRIMQHLSQNGLRENTLIIFLSDNGGSGRAPILAYNSGLNHPLRGDKGQTLEGGIRVPFFVSWPRQLPAGTIYEQPIISLDILPTVCQLAAKDPAKPQPLPQGIDGVNLLPYWLAQRSGSPHESLFWRFGPQKAVRAGNWKLVDWRDFPASNSSGWELYDLSTDISEKNNLAEAHPETVARLKTSWEKWNQSNIEPLWRGSKMEDPTPMTK >NZ_CP036299.1|WP_145298151.1|2098404_2100444_-|ATP-binding-cassette-domain-containing-protein MESLNQLAPYLWRVRGYLIGSSIAALLVAIFWSAGLMLSYPLVKVLLQGQSLQAYLTEETQKAETETAQHQAEAARVQHQLDQLDPNVSSSEKISLLKSLDRAQRQLASSGWRLMFCTWLTKQLSTWLPEQAFPTLVLLLALLALVTLAKGLTVYLQENLVGIAVESTMVMLRQRMFQHVLRLDHVTLSLLGTPQLMARFTYDLQQLGTGLTLFGGRLVVEPLKILSVLICCFLVNWRLTTLSLIVVPLGALLFGRYGKKIKKASRKQMESMSRIFTQLEETLRSFRTILAFGLERQRRQSFHREQKVAFEKTMKILRMDALSSPTTEILATLGACMALLPGAYMVLRQKSEIFGIQLSEGPLDIAELALLYTLMGGLLDPARKLAAVYPKFKKSSAASDRVFGLLKTHTRLQQDRQWVPLPRHTDQIEFENITYTHPVAEDRNQARPPALDRVSLKIAFGETLAIVGGNGSGKSTLVNLLPRLYDPDFGILRIDGIDVRHTQLALLRKQIAVVPQETQLFNRTISENLRYGSWDAPADQVTRLAHDFAVDSFASRFPDGLETSVGERGQRLSGGQRQRIALVRALLRDPAILILDEATSAIDRESETQIYEALAHHKQNRTVLIVTHSLTDTLLKIVDRVVVMDHGQIIAHGEHTQLLKTCPIYRNLFAAQREQAHAA >NZ_CP036299.1|WP_145298148.1|2097234_2098314_-|alcohol-dehydrogenase-catalytic-domain-containing-protein MLAACLSSYWASMFSQPKARCFLSAKRMIRHPHERAPEQVMTKIVLSRGNIVIDSSVDEALLSPSSTTDETIIELRLAGICETDLQLISGYMGFEGVLGHEFVGTALNGPFAGKRVVGEINCPCGQCVYCQRGQNSHCPHRTVLGIYRRHGAFASRFSLPARNLHVVPESLSDDLAVLTEPLAAALRIPEQLDLAGKSVLVVGDGRLGSLCAAALFPLAGELAVVGKHPEKLQFANELGLTTYLKDKLPAQRKWEVVIETTGRKEGLEFALPLVEPEGVVVLKTTVAAPHQLSLTPVVIDEIQVIGSRCGPFAKALDWLIQQGSILQPLIEKTYYLHEGREALAHAQKPGTRKILLKPS >NZ_CP036299.1|WP_145298145.1|2095116_2097081_+|cadmium-translocating-P-type-ATPase MTAPRQKKSTAEPTTENSNRWLTEGQLQTSIAGLAIVAMLVWGIIAYALPSFTIYGSIQVADLSLLVALLLGGGYLVAGLLGNLFRGEFGSDLLAGISIVTSVLLGEYLAGTLVVLMLSGGEALEAYAVRRASSALDALAKRMPTVAHRLIDGHLTDVPLQEVQVGELLVVLPHEFCPTDGTVTAGQGTMDESFLTGEPYLLPKAVGASVLSGAINGEAALTIRVDQPASDSRHAKIMEVMRASEQQRPRLRRLGDQLGAWYTPLAVSIGIAAWVISGEAVRFLAVLVVATPCPLLIAIPVAIIGAVSLSARKGIVIKDPAVLETVTQCRVGIFDKTGTLTYGRPAMVELTPLSGFDRNNVLQLAATLEQYSRHPLAMPFLEAIQKTGLPLLEVSSVAELPGQGLTGLVQGHQIVVTGRAKLQKNQPALAEQLPPTQGGLECIILIDGQLAAVTSFRDRPRDDGKRFIDHLRERHHFQKVMIVSGDRESEVKYLADSVGISEIYFSQTPEQKVAIVRAETRKAKTVYVGDGINDAPALLEATVGLAFGRGSDVTAEAAGAVILDGNLKRVDEFLHISRRLRTIALQSAVGGIVLSLTGMSFAALGYLPPVFGAILQEVIDVIAVLNALRVGFPAGSLVDYDSSDSGGSSGNVQS >NZ_CP036299.1|WP_145304423.1|2094103_2095120_+|SUMF1/EgtB/PvdO-family-nonheme-iron-enzyme MLANPWIMANKVRGEKSGPPEGVMAPAGMVWIPGGEFRMGSNDPLAWDDERPTHPVSVEGFWLDKHEVTNREFAEFVKATGYVTTAEKAPTAEEILANSPPGTPAPDPAVLVPGSLVFTPPDHPVPLNDFSQWWVWTPGANWHHPEGPDSNIDERMDHPVVHVSWDDASAYAQWAGKRLPTEAEWERAARGGKEGLPFVWGTAPPSETSPQANLWNGKFPYENTAKDGFTRTAPVKSFPANDYGLYDMSGNVWEWCQDWYDRELYSRRPQLTVTKNPAGPEKSHDPTQPFQQLRSQRGGSFLCNDSYCSRYRPSARHGCSPDTGMSHVGFRCAKSVAP >NZ_CP036299.1|WP_145298142.1|2091212_2093819_-|DUF1592-domain-containing-protein MLKTYQFSITLCGLKGRNPFWLTLCSLFLFSMQPTVWAAEHGTTEFVPDQQGYQKLVAPFFKSYCSECHSGDKPEGEFSLDAAQLKNQLTDPVAKARWREVVNVLNSHEMPPEESRQPQAKEVAAVVDWVTAEAVRAEKFSREQTNVLRRLTRDEYRRTIRDLVGIDFDVSAFPEDPAAGGFDNNGSALTISPMHMELYLASARQILDRALVEGEQPPTIRWKFDPKVGPADRVRLRLDPKNNPLVNGGNNRQQGEWVAVHHASWDTNVGARDFRVPIAGTYRIRAQLAGTKPTRQEVIASATKILAKRRDEQIAKHPERTRQHQEQYQRDLQHFETSRIYDYGPVRAKLNVQLGSQPRTIAEFDIEGTRELPEIKTFLTRMTTETAGISFEYAYSIPRELENFWLQSSLEFARPELLVDWFEIEGPVVESWPPPSHQLIIGKENPSADNEVSAVTSILRNMMLRAYRRPPDEAEVQQRLAQFIAARKSRTFIEALKLPLISILTSPNFLYLVENPVQNPAQPLDDYQLATRLSFWLWSSMPDNELLQLAAEKRLSQPEVLKSQVARMLKDARSRAFVENFAGQWLGLRQVGANPPAKDLYPEYDRHLETSMIEESLAYFQEFLQHDLDVRQMIQSDFVVINERLARFYEIDGVRGDQFRRVQVSSDIPRGGIVTQASILCLTSNGTRTSPVKRGTWILKTLLGTDPGLPVANAGEIAPKVPGIDKATVRQRLEIHRELPQCARCHNKIDPLGFALENFNAAGDWRDREGFGYQGRIQANDPLIDASSRMPDGTNIVGVRGLQQALYQRHDLFVKALAERLLTYALGRELGLSDQPAIDRIVQAARDDNYRLSAMLESIATSETFCLK >NZ_CP036299.1|WP_145298139.1|2089206_2091126_-|hypothetical-protein MIATASQPNLTLSAIRHKIPVYSWLALFLAMGLLGYASFQIQATSSITFDETFYLNAGVRTVASGTLDPAICDCGVAPLPIIICYLPPLLLSRNEYRHEVWVGQNDDPQLIIWPRRLNTLLVGMPLLIVIWLWLGRKCGPAGASLGVLMTATSPTIQAHAALATTDLSFALFGLLGILALAHYLERPTWVRLGILAIAMAACLSAKYSGVFLFPVAGLMFLGRALRDQNVTKKKLTPTNETSTEKSVESASVSQLSQGWWPVVRQTFVTYTLLLLLVIPLWWAFHGFSFTGPLKNVPLEVTPPDSPWVEMLGRGPWADWIMDQAHRRWKRPAPIAGVLFQYLHNKSGHTAFLMGETSLTGWRTYFPCTIGFKSTPSELILIALLLFGTISLRLAGCISWSKLDFSRRCLWLSLAVLAALLLFARINIGHRYVLILYPLFIMLGIDTIFMIYRNYIDRFPNTNGTSKAHWPRIIFTLSTSLLLAMQFWSSWSIQPHALSYFNGLCGGPEQGGRLLLDSNIDWGQDLPTLQQLLSSEDSSKVALQYFGTALPTAYGIEADPTKALRQPLEDYELLAISVTHLGGLYVFGNDPYQKFRTWQPDARAGYSINLYRLDTPERKAAFAQAIQEHEKARTQSLLKQ >NZ_CP036299.1|WP_145298136.1|2088335_2088974_+|histidine-phosphatase-family-protein MRQAPPANVCRMLLCRHGATAANEMRPYILQGCEMNGPLTAIGEAQARSLAAALSGFLVAGVYASPLQRAHQTAEFIAKSHQLKVETDANLRECSVGRWEGLDWETIRTKDTEAHDHFFADPATNPHPGGESYSDVLNRAEPTLKQLAERHQGENIVVVAHNMLNRVILTPLLGLSLRMARQVKQANCCINVIEWTTDRAELITLNSVLHLD >NZ_CP036299.1|WP_145298133.1|2086322_2087729_+|hypothetical-protein MPATRPGAGGAVNRPGTGQGISNRPAQLPARLPGLGGGDVAQRLPNQGARVQDRMGDRPSVQDRQNNLSDRMANRGDWQENRQGLQDDRQDWRDQNREDWQNWGDNKLENYGDWYHDSWHPGSGWGYMWDNYPVAAAIGLTAWGINRIGYGWGYWNYANPYASSGGSYGYDYSQPLVSAESYAVASDPGASAEATQAASQQPTDEGLAAFEAAREAFRSGDYKGALAKLDITLKTMPNDTVVHEFRSLVLFALQKYPESAAAIYAVLAAGPGWDWTTMISLYPSTETYTEQLRVLEAFVKANPMSADGHFLLGYHYQTMKHDSAAAKQFSLASNLLPNDQLIKQLLGMTTPPEEATKSTVPSVPPVVPAEKVLQADQFIGNWKASQQGATFELDMTKSGVYTWTYAQGKRKQSVKGVFAVDQNNLALETDDGGETMLAEVQFISPTEFQFKMIGDSEKSPGLKFTKTQ >NZ_CP036299.1|WP_145298130.1|2086050_2086428_+|hypothetical-protein MAATLAREVAQQFNLVLDPTLVLAAVRSVHALHSCHQPGQRLALALGQESDRAQVLALAPVLALAPEVEPSEVEPSEGLDLVSCRVRGPLNCRRPGQVLVVLSTAPGQVRESAIVPLNCQPVCRG >NZ_CP036299.1|WP_145298162.1|2104519_2106262_+|hypothetical-protein MLQIQLGKWSTKCFSALALGASLLAATMVTRADEPRITFVQPGADRLREDLKYLVNLAPTPALKKEWENLDATLESFLAGVDGKTPIRVDLILGDKELIYQLNFPTTKLEGRDGFLDNVASFGYTVKKNAAGHYDLTEQGASKRKYFLRANGKYAMLSQRLQDIPANAPDPGKLAQNLLSGSQDIVADLKNDPATITQRQADFTKLRTELEAAVKFKRDEAKEVFELRKLGTNHQLSELERFIVQSEHLQATWTTDIEAGKGSGTLLLTALPGTELEKSIALLGTKPSAFSAIKLHDQPVISGKTCFELDDMRQKQLIEFYPVMNAAIGVNIENRPNLTAEGKAAAKEFSKTLFALLTDGVSLGVVDSYIDLHAVEGNVHNMTCGILAKETSTVPALLALMPKIRDDWKYEADVAEHNGVKIHKLTFPARRLEGFQSIFGKDQSDLYIGVGKEIVWGAAGKGGLDELKAAIDQQASGEKTVSPVVLSGTVKFGPTVQLLEIFRGNTPVSKTGKSKAELDRIKKIENLRKMAVDAFASGDDLLTFSLERKDREVVGQLNANQGILRYVGSVIANFSKEALR >NZ_CP036299.1|WP_145298165.1|2106724_2108995_-|phospholipid-carrier-dependent-glycosyltransferase MDRTRYDESASNAIHTCDAPGRPRRGTTGDRCIPLSLSLAGRREADNPRTGVRPGTRPVPAETALSVYKSTPGHPYHLGAAPRKRSGITLRKLFAHACASRRRHGKELKLRPGQRNASIASVAPQPLTRSHSIQMADWIYGSIAAIATGALLFALVPHQMYFHDEGQYAQGAERILLGELPHRDFHEMYTGGLAYLNALVFQVAGHRLDMLRYALVLLAIPVSVVSYHLLRGLKLSPWAACVLTVALFSTSLGTMHAPWANSYLMLFTLLAIAALLQDAKSPHLGWVLAAGICCGLAIAVKVTGLYTLAAIGLSLTYRGDPASPQVYPDLDFRPLSGAVWIERLWRGALALLMIALAVVLIRQHLSIHSFMYFVLPIAFAALTLVSSGGRLSWSMFGRYAVLGFGTAFPIALLLTPWLLEGAIQDFWHGVFELPRLRIGTTYYPPPPATGLLIAIPTVCCFLPIRHPWAPGVRITGLLILTSLMLYGLGDKSPSGFHTDGLLTAMSAWRWVLPIISAITCVVLTRSPQAPLSHRVGLYCLTICAACFSLNQYPFAQPIYFFFCVPLVILAAVALSLFDPSLPETTAAPGPRAMVHTGTGAVIVLAAFCLLRFGIDGTYGEFFESVGQRRQVYLSGIAVPAVYASGYNRLRELADEVLSEDQTVLAGPDCPHIYFLTGRRNPTPYMYDIFVTDPEYFSQLIELAHSDQVGMVIISRAPEFSPVWPQELVLKIAEAFSYAEEFERFIVLTKRAPASRP >NZ_CP036299.1|WP_145298168.1|2108664_2109411_-|methyltransferase-domain-containing-protein MDAGYVQVYRQLYNQHWWWRTRETAVLGAIREFHEPSPDDCVLDIGCGDGLFFDRLKDFGSVEGIEVDPLAVSETGPYRRQIHVGPFDDSFVTTRRYRLITMLDVLEHLDDPTAAAKLVRTLLTPDGLFVLTVPAFQTLWTGHDTMNQHRTRFTRATLRAVLDAAQLEIVASRYLFHWLAAAKLIIRGLECVRGPDQSPPKLPSPFINRLLGTLTTWELRRGKGLGLPFGSSLLMLARPAGAMGRNSN >NZ_CP036299.1|WP_145298172.1|2109414_2110482_-|glycosyltransferase MLANDVFQRTAALTSDEPHTAIGRTPPRVSVAIPVYNEEALIQELMRRVLAVLDGLPGGPHQLVLVDDGSRDRTFACICAAAQHDRRILPVALSRNFGHQIALTAALDYADGDLVVMMDGDLQDRPEVIPDMLTLWRDGADVVYAVRTRRKETWLLKTCYQAFYRVIERLAHLKLPRDSGDFCLVSREVADVMRTTREQHRYLRGLRAWAGFRQEPLLVERDARSAGDSKYGFRQLFQLAFDGIFSFSTVPIRVATWLGLTTVALTLFLGLFWVVAWSLNYAPQGFTALATSIAFFGGVQLVFLGLIGEYVGRIYEEVKNRPLYVVRKSSETGASAWPDDTPPAIQGLSAFTKRI >NZ_CP036299.1|WP_145298175.1|2110442_2111039_-|methyltransferase-domain-containing-protein MDLRSLLDHPMLYSAWQFPFVAQKVAPFLAVHDPMRTGRVLEIGCGPGTNASLFRQCDYVGVDLNPRYITTARKKHRGTFIQGDATSFVLEDSQPFDLVFCNSLFHHLDDAAVERTLARAASLLKPSGELHLLDLVLPDQPSVARFLARADRGDYPRSLSAWQALLSQHFVQDAFTPYPVKFLSITCWQMMYFRGRPL >NZ_CP036299.1|WP_145298178.1|2111350_2111572_-|hypothetical-protein MHDEITQNQGQSRIKAGSLAEEVVIGGRSRSQESTTDPWVNLAGYGNWTETNTSGWSESVSNYEFSLLTAGFE >NZ_CP036299.1|WP_145298181.1|2111769_2112147_-|hypothetical-protein MSSTMKMLVVFDPTKPDSQTTDFLIPWSRDGQRVFLGLKSGKESALGMMVFIGRSITENDLFAKLVDSGAVIPDVDETLALLRSYVERLQSLKIGNVARIRSIDQVNGSDVELELVANTPSALNA >NZ_CP036299.1|WP_145298184.1|2113309_2114485_-|lipid-A-disaccharide-synthase MHIFFSVGEPSGDQHAAHLIRALQHRDPGLKVSGLGGPAMEVAGCEVIYPLTNLAVMGIFRVLPLLTTFYKVFRQARAHLQKHRPDAVVLIDFPGFNWHIAKAAKSLGIPVYYFMPPQMWAWGGWRIHKLKRTVDHVISGLQFETEWYAQRNVPVTNVGHPFFDEIVHHPLDQSFVREWKPQAGRVVALLPGSRGHEVTHNWPRMLEAARMLHERFDDLTFYVANYKEKQRQWCSEEFVRTGGGLRMNFFVGRTPEIIDIADCALVVSGSVALELLARRTPYATFYSCSKLTHWIGRQIIHIPHFSLPNLMANRRIFPELLFVGEAPEAGRQMADAIAPWLADPQQMTMKLEELDALRRDVVQVGALTRTADLILRLTTDQAQTEQLSSAA >NZ_CP036299.1|WP_145298187.1|2114681_2114918_-|hypothetical-protein MEQTSAPMFVLSTDGELHGVDTEGNRELVRRIYACVQACEGISTDELEMGVIAEMKRLLGQVVPLLQNRSAAAEREAA >NZ_CP036299.1|WP_145298190.1|2115329_2116241_+|hypothetical-protein MALWIAIVTQENMHWLQEGQGRLPVCLWSFTTDAPLADLKLARETGEVVASDHSGGIYLIDIEGRVRALTRLGQVAREVAFADNGATIAAVSGPSSLSLMNRKLAFQWTRQLPDSILATALAPYGEHVAVSLTDGMNAIYTLGNKKQATFQTPRPLKHLAFLGTEAALIGAAEYGLISKIRLDGQILWSESLWSTVGGLAVTGNGESILLAGYMHGLQMFGNDGSAAGTLMLDGTASHVGQSFHQHRIYSATLEQHLACLSKAGDLVYLLGLPEPLHRLQVTAMGDAVIVGFPSGRIMKLAML |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP036299_2 | 2281065-2281135 | TypeII |
NA
Consensus repeat of NZ_CP036299_2
|
1 spacers
spacers of NZ_CP036299_2
>2.1|2281088|25|NZ_CP036299|CRISPRCasFinder AATCTTGCGGACACATTCTTCGCTC |
CRISPR arrays and Neighbor proteins around NZ_CP036299_2
The CRISPR arrays of NZ_CP036299_2 >merge|NZ_CP036299|2|2281065-2281135|CRISPRCasFinder ACCATCTTGCAGGTGCGAACGGGAATCTTGCGGACACATTCTTCGCTCACCATCTTGCAGGTGCGAACTGG >NZ_CP036299|2|2|2281065-2281135|CRISPRCasFinder ACCATCTTGCAGGTGCGAACGGG AATCTTGCGGACACATTCTTCGCTC ACCATCTTGCAGGTGCGAACTGG
>NZ_CP036299.1|WP_145298479.1|2279685_2280633_+|site-specific-tyrosine-recombinase-XerD MPPRFKPVQNSVASALPTALPSTHVEGFVAYLQGECGLARNTVISYERDIKRFSDWFHESAVARVTDITLQDLSGYLRYLHELKLATSTIARHLVSLKMFFRYLMLEGILKESVADLLNSPKLWEHLPKVLSEDAVNRLLDAPMHQDLNPYRDRALLAVLYATGCRASEVCSLKMRDLQLDEGFCRCVGKGNKERLVSLNPVAVAALKTYLARERPARAGTSLDSPVFTSRSGRSIDRIQVWKLVKRYASRIGLSDAVTPHTLRHSFATHMLANGAEIRALQELLGHASIRTTQIYTHVEHSRLKAIHKQCHPRG >NZ_CP036299.1|WP_145298476.1|2278722_2279307_-|DUF2892-domain-containing-protein MTIPTILPIELKTHIDAAEPVTIIDVRTPLEFQEVHAHGAVNIPLDELEAEKLSSFVNPESKTVYLICRSGGRGKKACEKLLAKGPWQPINIEGGTQGWEAAGLPVHRGKKIMSLERQVRIAAGSLVVLGCALGAFVTPYAFGLAAFVGAGLVFAGVTDTCAMGLLIARMPWNSTPRSNSSCSLNLTSPPATSH >NZ_CP036299.1|WP_145298473.1|2276515_2278636_+|DUF255-domain-containing-protein MNLVNRLAAETSLYLNQHAQNPVAWQPWDDEAWRLARELDRPVFLSIGYSACHWCHVMEHESFENPRIAELLNQWFVSIKVDREERPDLDQIYMAAVIAMTQQGGWPMSVFLTPQGHPFYGGTYFPPTSRYGRPGFAEVLAAIHDAWENRREVVTEQASQLTMSVHDQLSERQEPTTLQESLLEKAGRTLVRVCDRVNGGFGHAPKFPHAMDLRLALRLAHRFDTTETAEVAELGLTAMAKGGIHDHLGGGFARYSTDEVWLVPHFEKMLYDNALLLQAYLDGWQFYKTDFYRRTAQSIVHYVLREMQVPGAELPGGFCAAQDADSEGEEGRFFVWSQSEIRTVLSGSELGNDDSRLFERAYGVTSGGNWEGHNILNLPKTIAALGRELGMAETALEQKLSLLRAKLFEHRKSRIAPGRDEKLIVAWNGLMISALARSGLVLDDQEALQAAQRAARVILDMAESLPYGLPHSIQKGQPKHGAYLDDYGCFLEALIELFLADGDPSWLSRALPLIDRLVNEFHDEQGGFYFTSNKAEKLISRSRDFQDNVTPSGNAAVANALLKFGRITGDARSQELAHEVLQAASGLMQQSTMATAHSLAALDWWLGANYECVYVPAETTSTTDAEPLKQNAVERVAHELYLPNVLFLTGRAQWEGTLAAGLVNGRLAPASEPVLYVCQKGACQLPVVGEAAIIARLKGLAQSTDE >NZ_CP036299.1|WP_145298470.1|2275923_2276412_+|hypothetical-protein MPRFALLEHDWPENHGDFLLEVPGLKSRDGLGLCWTWRFPGDRKVWERLFLQGGSIEAERNTDHRAHYLDYQGAVTPAADGTPRGFVMIVFQGEFQWADPWELSPLEGPPVLPDFLQMVFQVHPGKQMMAGQMQLLESPEGWKFTILPETNSKALKTDKVHS >NZ_CP036299.1|WP_145298467.1|2273193_2275827_+|hypothetical-protein MAPLWCQVTTLKDSAMDFAVCPSCKQSVLDDDAVDCPFCGAPMKGGPRPGGAPAKPAASKAAQSPKSVSASAANTVSATNTVNRGAANKNAGKSSSADQSSGTDDLPFETETKAATTVAVATATKQRSFRIVCPMCETVGYISPKSAGTRVKCANPKCLVPVFDAPQLPPPEPEKPPEPPRKPNYILLGGGTAAICIIGGIVALFVASQPAASPVAVNRMTPEEAARLLAETAPAANNTSNQTKNSGTPDGSPQTTTETKPNPGDTKTAASSTVDLAKELPILIEKTVLIRGSQNRSLPFSRQMAAEAYALLGNIEKAQEQAVAFDRVGQNVPFYKISILTELGWAQLRAGTSDAAKASALAAYNLVNKLPGNGRSRLIFASRLGSLLIATGEADKATQLANDYATTWKPGLIDSNQVAERLERQDGQVAALLQAGLDDFAGSIAQVTATRPGVVWFEPQAAAMARSAAAQGFSEAAISWAISQPEVRQLETLGEIAALEGAIAAPDVTAEALKAKIEGWGANASVMKLPASKAYLFARAAAAVASVSKPAALGLIKEAQALLEPVELPAPMTWPNTQGLVNPQLPAIAPLERLFVAWAEVSIACQLAGDADGCDSALSKALATLQGVGPALNYVDQFLEQLNGPDVASIRDRLKDELGIRTEDAARQAFSSYRRGTLNYREHAAIRYDFELQALDRLAQAGGQAIVWKKLQEMATETNAVQKQLPSEISMKLVSSLATAFQLANEPTQAQAVTDWYSGITRIGAITPSAMLLSASRFGAKDGAGAVTVLKEADAPTRALMTSLLAGQLKDAGMVEPALVLVGQTSDLSVREEASIFAAHAAHHLGEAARIVTHLQTLPQATERLSWIRGLSMPIKK >NZ_CP036299.1|WP_145298464.1|2272222_2272858_+|universal-stress-protein MEASRGNIFKVIEMIRRRGLSHRLEKALMECRPWARVTFISNLQPEDDTVKLLLCVDASGSSDRAISRVTELGLGHSKSNSVTLFTVCESLPEHVFEISEKVGMHAKELASAWSLKSRSAGEQALAQAKAKLLESGLPESAIHIKLCVADATPESRQVAAAASLIEEMTSGNYDLVVIGRRGAKHLSENLIGSVADKVIRAASGRSVLLVD >NZ_CP036299.1|WP_013109441.1|2271255_2271813_-|Hsp20/alpha-crystallin-family-protein MAIFRWGNAWQAMENFEQEVDSLLAGVLQGLRVVRHYPAVNLYELPDQLLITAELPGIRLEELDISVNEGSLTLKGRHLGPEGVPDEAFRRQERPRGKWQRTLKLPDGIREEAVSAQFSNGVLLIRLPKVPPKPARQITVTAVAENPPPVESVPEAIPTPARRIATEPGQVSGLIEPKERESHES >NZ_CP036299.1|WP_145298461.1|2270846_2271266_-|Hsp20-family-protein MRAEQPEIPPGRASIRNPGDAAIEQPVEQQVVFTPPIDIFERPDGLILLADLPGVTLETLELQVQDNRLTLYGRVQTQWLEGVQTLHQEYPIGHFLRSFILNDEVDHEKITARLAHGVLEVFLPRATKAIPRKIEVKTD >NZ_CP036299.1|WP_145298458.1|2270353_2270728_+|(2Fe-2S)-binding-protein MELDDTICFCFHITKRKVMNFIRVHRPRVASQVSECGGAGTGCGWCRKYLKRYFDESQGRMPATSGGLDAHTEEAPEITLEEYATSRAAYIRAGHGKPAAGAIPLPGVTTEPTLQQPSDPESPE >NZ_CP036299.1|WP_145298455.1|2268514_2270275_+|hypothetical-protein MPFPACPKCLAPIRIRDRKFVGREIPCPSCAAHIVPLVTGHDEWQVILPEEYARQSAATSNSAKSNLEKQTTGVHASTGPAIVKPLGANTLRAGGRQLKAPSPFVMAWIGSGVITFCIVMVIVQSFSRRDHQARELVTQQPNESSQQQPVQPIENSDPGQTTGEKELVPQPLPASHEKLAGLGQWLSGELASQGAFPAATTGKSENPEEAWGWLNRYVAQTQPTIVPPPFEGSWRGPTHDRFVRRRMNSLLINEQEGLIGGDGYPATQLVGVAGVGIDGPHLPASSEYAGIFAYGRTTRQRDILDGLSQTAMVASVKEDLASWASGGKGGIRSLTAAPVIDGPDGFGIVGQSGQLLLMADGSVKELTNAADPVIMRRMFTMAEGIDLAEAASPDAPLFPARAASRQTMPMADPPGGSESVADSHSSAPTGISPEKDDPAAEKPNGKVEKSSTGEVVAPPAAQAPLPPAMVVQKQDLKINLGLSLQRFRTDEPVARRRLVKSAADLLGAPVITPEGPLPAELVEILDQKITIDLEKVSIENLLHEIFRDSQARWSLQDGQLLIQLSSEVVPGPPPATGGEPREVAVP >NZ_CP036299.1|WP_145298485.1|2282869_2283682_+|5'/3'-nucleotidase-SurE MHILLVNDDGIHAPGLNSLHAELVKFAQVTVVAPAVEQSGVGHSITYLHPLLAHREYRQDEFFGWKVEGSPADCVKLGIMELAQPRPDLVVSGINHGANVGINILYSGTVAAAIEGAFFGVTSFALSQWLGNSAPRFPQAARLGAQLIQQIMAQNPPAGSLWNINFPTWSTNDQSSAESWPRGVRLTSMGVSRQRETLEKRIDPRGRTYYWTGLEPIHGHELVEGSDVAALVDGYVTVTPLHFDLTNRSQLTQAETNWSPLVIPPQHDQE >NZ_CP036299.1|WP_145298488.1|2283686_2284655_+|cation-diffusion-facilitator-family-transporter MEQLPEVHLEASKVTSNNLRQLRYRDALFGAWLGLWINLALGITKLIAGIIGQSFALLADAFNSLSDCVTSSAVIFALNYSQRPANSNHPYGFSKAEALAGSHVALAILISCFLLGWEAIERLTVQHGLPPWWTLAVAAANAVIKESLYWYKLKIAKSTGSTALLAHAWDHRNDALCSVAVLIGLSVVRIGGEAWRSADEVASLVVVCVVGFSAASIFRQSLREMLDLQIDETEVDAMRATVQRVPGVQGIEKFRVRKTGIEQIAELHLRLPAEITVFAGHALAHQVKAELMQQHPMLRDVIIHVEPASEPASHSSPEESSR >NZ_CP036299.1|WP_145298491.1|2284692_2285988_-|L-fuconate-dehydratase MPKFSGIQAIDLRFPTSRLLDGSDAMNPDPDYSAAYVILKTDDAALPAGHGMTFTIGRGNELCVAAIESLGKLLIGRDLSEFTEAPGQFWRELTGDSQLRWLGPEKGVIHLATAALVNALWDLWAKIEQKPLWKLVTDFTPAQLVNCIDFRYLTDALTPAEAIERLERKVAGKSARVEQLLATGYPAYTTSAGWLGYSDEKLRQLCRSNLAQGWEVFKVKVGRNLDDDLRRLTIIREEIGPERRLMIDANQVWDVSTAIDWVRELAKFHPWFIEEPTSPDDILGHAAIAKAVHPIAVATGEHCANRVMFKQFLQAGAIGICQIDSCRLGGVNEVLAVMLLAEKFGIPVCPHAGGVGLCEYVQHLSMIDFIAISGSWENRFCEHADHLHEHFVHPISMKNGRYQAPLAPGYSIEMLPETIAAYRFPEGAIWA >NZ_CP036299.1|WP_145298494.1|2286040_2287411_-|PTS-transporter-subunit-EIIA MIIVDLADLVSGIRIVELTAKTRQGAIRALVQAANWDDDGINPENVLEAIEEREAAAQTLVANDFALPHAFIDWDGDFRIVLGRSKSRVDYGGPAGVNVQLIVLLVIGRRLQQTHVEVLAALAELLKSPDFRQNLIDAKDIKAIDLLLMTQAGIQPENRPVRGPSIPRLTVNMVKTAIQLTESLAAQALLLAVERVENVPWEALANYKGRLLLVTSQHSEEFDHKREDLHIFDVAHASLSRADRANLGLLLAASSGLLTEKNSVVCVTGPDGRRLDSINVTKPEVHFRAMLSEKNRRGADVVRPAVMLRVLTLAIEIAAEGREAHPIGALFVIGDSRQVLRHSQQLVLNPFHGYARQLRNVLDPSLAETMKEFALIDGAFIIQGDGTVLSAGTYLTPKSASGSVAHGLGARHQTAAAISAHTQAMAITVSQSTGTVTVFRNGSSVLSLERSGRTKW >NZ_CP036299.1|WP_145304441.1|2288566_2289112_+|acetolactate-synthase-small-subunit MKHVLSAMVMNQPGVLAHISGMLASRAFNIESLAVGETERPDFSRITFVVAGDNKVVDQVRKQLEKIVTVVKVMDYRDQDILERDLMLLKVSTVESGLSKVKELAEIFRAKIVDVGQSHVMIELSGPERKIDAFIDLMRPFGILELVRTGRIALAREVSLSVEDALKPPTPYDTSEVEIEV >NZ_CP036299.1|WP_145298497.1|2289407_2290412_+|ketol-acid-reductoisomerase MAAKVYYDDDADLSLLKGKTIAILGYGSQGHAQAQNLRDSGLNVVVGQRPGSKNYDLAVSHGFKPLSIAEAVKQADLVNILLPDEVQGDTYKSEILPNLKPNALLLCSHGFNLHFGQIVPPAGVDAALVAPKGPGHLVRSEYEKGGGVPSLIALWPGASDNSRKLALAYAKGIGGTRGGVIETTFAEETETDLFGEQVVLCGGVSALIKAAFEVLVEAGYQPEMAYFECMHEMKLIVDLFYQGGLNYMRYSVSNTAEYGDYTRGPRIITAETKQEMKKILGEIQSGQFARDWLLENRVNQASFKAVRRREREHQIEKVGRDLRKMMKWINAKEF >NZ_CP036299.1|WP_145298500.1|2290532_2291024_-|cell-division-protein-FtsH MDSAPLPQTELTATAYHEAGHAVVALFLGRPVHEVTIEPNSLRLGQCRLNKGNFKPSKDVLEGEILILLAGVAAEARYRGDYRWEGAVSDLRQVRHFSSMRATNIRQVERLEQRLLNKVEHILSQVGPWLAVEEIARELISRTTISGRSARHHFELAMERFAE >NZ_CP036299.1|WP_145298503.1|2291053_2292541_-|UDP-N-acetylglucosamine-1-carboxyvinyltransferase MDMLIIRGGIPLAGNVRLAGAKNASLPIMAASIAVSGLSQLRRVPQLADVSTLTQVLESLGAVVTRDLSTGELAITPPQATTGIADYDLVRRMRASVCVLGPLLARWGKAVVSLPGGCNIGHRPIDLHLKGLVALGARIRIERGYVHADATRLHGAEIYLGGPFGSTVTGTANIMTAASLARGMTRISAAACEPEIVDLGNFLNAAGARITGHGTPVIEIEGVDELHGVEHAVIPDRIEAGTMMIAAAATCGDVLIEDAQPRHLSAVIDILRQIGVSIEPSSTSDGRTGLHVSGAAHLNPADCTALPYPGVPTDLQAQLTALLCLVPGISIITDKVFPDRFMHVPELLRMGAQVRREGASAIIAGPAHLSGTNVMASDLRASAALVIAALAAEGESVIRRVYHLDRGYEKHDEKLRQLGAQIQRTTDHPEALPDSLRMTGESTIEQHKIQEIPGRFTQPFQNQPTKPHFLSDGYSIANPMVSQSDNFTAPAVDLP >NZ_CP036299.1|WP_145298506.1|2292695_2293490_-|trypsin-like-peptidase-domain-containing-protein MLVPELFLRVAKVETYGQGQLLTNASGFFFEWQSELYLITNRHVCVDEMAGHRPDTLKLQLHTVKDDLTKSGPLEIQLYKQGVPLWKAYPRPVNDIDVVAVPLPGKLLKQNYVISSFGASDILGPEETLPPGQQVLITGFPLGFHDTLNNLPLVRQAVVASDFSRPFKGNPYFVTDARTHRGTSGSPVVTKLLRPTPVAGQFEERWCLLGIHAATLDVSNRDPMYDDRLGLNVTWFASLIPQIIEGILPANPHPAPSSDSVCFN >NZ_CP036299.1|WP_145298509.1|2293853_2294327_-|DUF386-family-protein MIIDELANQHFYSHCHPAITRALQFLASPAAHELPTGKHRLDSDALIAIVEEYQTKQPTEAVWESHRKYVDVQYIVRGEEAFGLARTSDNLTIRTPYSDERDVVFYHPGTQRFVASAGMFLIFFPQDIHAPGLAVNEPSPVRKIVMKVRGDWVFNQD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP036299_3 | 4233030-4233112 | Orphan |
NA
Consensus repeat of NZ_CP036299_3
|
1 spacers
spacers of NZ_CP036299_3
>3.1|4233053|37|NZ_CP036299|CRISPRCasFinder AGAGGCTTCCTTTTTGGCTCCATGGCCGCTGGAAGAA |
CRISPR arrays and Neighbor proteins around NZ_CP036299_3
The CRISPR arrays of NZ_CP036299_3 >merge|NZ_CP036299|3|4233030-4233112|CRISPRCasFinder CCATGCCCCGAAGAACCGTGCCCAGAGGCTTCCTTTTTGGCTCCATGGCCGCTGGAAGAACCGTGGCCTGAAGAGCCATGCCC >NZ_CP036299|3|3|4233030-4233112|CRISPRCasFinder CCATGCCCCGAAGAACCGTGCCC AGAGGCTTCCTTTTTGGCTCCATGGCCGCTGGAAGAA CCGTGGCCTGAAGAGCCATGCCC
>NZ_CP036299.1|WP_145302259.1|4231669_4232635_+|endonuclease/exonuclease/phosphatase-family-protein MNGQWTRRVLKRSLLVAGFSWLLLSSTVTQAESPVVRVMSYNLWHGGDAGKQPLEQTARVIREARADLVGLQETRGYAASGQPRPDRAKEIAQILGWNYFDQGGGTAILSRYPIGKASPGKWGVEIEHPSGTKLYLFNAHLMYIPYQPYQLLNIPYGDYPFIKTEQEAIDFAKKARGEQVERLIADSKAAVPPDATVFVTGDFNEPSHWDWTAAAQKAGRCPIAVQWPSTSMMAETGFIDTYREAHPDPVTAPGITWTPITKITDPKDRHDRIDFVLMRQGGGKVLSSHVVGESKEAADVVIVPYPSDHRSVVSAVELTGK >NZ_CP036299.1|WP_145302256.1|4229722_4231384_+|hypothetical-protein MLSVRWVNLTSTLLLATCLMCAFSSTCQAHFLWVVDANQSADGKVHVYFSEEAAPDNPALLEKILGAKLWEANLHDDAVKVSPLKFEKGENSLEATFSPKSTNAGVFLDYGVMSKGGAEPYLLRYCGRSQTIANRKGGKVIGDAAVLPLEVVAERSGDGFLLRTFWQGQPAEGIEVTVDGGDLSARKEGVSNAQGELALGTLANGLYSIRVKKVLEEPGKLGDKEYKSIRIYSTTSLLIKPEAEVTQPAVSKALPELPRGITSFGGAVVGDVVYVYGGHTGGAHHYSKEDQSRELLALNVKQPSEWKVVGEGPERTGLSMVAVGTDLYRMGGFIADNGKEEKEVLKSTSDFARWDAASKQWIELPALPEPRSSLDAAVIGKTIYVVGGWNLNGGGKEAKWHETAWKCDVSANPPVWQPIANPPFLRRALSLAEHQGKLYVIGGMKNQGGPTTEVAIYDPATDSWSSGPALPGEGMEGFGNSAFAVNQNLFVSVMSGKVYKLSADGSKWDQVHQLEKPRFFHRMLPTSQHQLVFVGGASMEAGKAKTTEVLEVK >NZ_CP036299.1|WP_145302253.1|4228874_4229678_+|hypothetical-protein MKLCITIILLVIINLPTKFAQADPPLRFLRTEQGGVRLSLNTYTRVVMPDFHPPINTDSDSYSSGFVETAQESVTSASTILTNNAISHLQGDIVAEGDGDTTISAPAIVLGTHNISLDGSDPSDANLSASVGLDWDVEFVADNESELRENFGFYTVSVSVSYEAQALTSGIAHFNHGASGYIQVTVGEQIVTVYALDGVWVSDSNGIVTVTEPSSGIFSVSASSELFIEHDDVVDISGALWYNLTSATVDGGAVLGVSDMVIQVRVE >NZ_CP036299.1|WP_145302250.1|4228350_4228809_+|hypothetical-protein MFQKMTMLLLLFFGVFSFSGCGQNYIGRQPISGSILFDGTPMNEGTVRFIPDGQTKGPAAYGIIQDGFYEISHHEGVVPGTYRVEIEKKIELPFEIDDEAAYAKHLAEQKGRQFPRQPVPPKYNRNSELRVTIVPGRNASKLDFDLTTSENK >NZ_CP036299.1|WP_145302247.1|4227281_4228157_+|DUF1559-domain-containing-protein MRHLLPRRGFTLIELLVVIAIIAILIALLLPAVQQAREAARRTQCTNNLKQIGLAFHNYHDVHGAFCTTDGGTAATGSSAFAAILPQIDQGNLFLKYNFNLTNSDPANTAVTSQVIPAYLCPSAVLRRPVPMPGCTNTSTGRNIDDNRAPGTYAVNTGTTAGNGAIIPSSTGVTRLRDFTDGTSNTLLAGESAWNLPDYLFSSSASCAGQVRWGFTYWASPYNLSTAFVVQPPFNPKSGGSAVLSRFRSDHTGVVNMVFADGHVRFLSENIDSGLLSALATRSGGEVVGDF >NZ_CP036299.1|WP_145302244.1|4226123_4227041_+|methyltransferase-domain-containing-protein MTGSDESAQNPPSPRSPELSENRVEPLQDLSSQQPAFTEGTPKREKPEKSRRIEREFGIPIPGEILPPEQWVKTAIKALPSSGLLNWHEVFGQTLEQKQGIVLDLGCGNGRSTLWHAIVEPQYCYLGVDILPVVIRYATRRANQRGLGHLRFAVIGGKELLAQLVPAGSVSKICIYHPQPYYEEHLIGKRLVTPEFLAMAHQALVPGGELILQTDHPAYWQYMEQVVPAFFELRAIPGPWPDAPRGRTRREILATKRGLPVFRGVATRKEGLDRASAFEKASALPVPTFNADRRLLALDREEQSL >NZ_CP036299.1|WP_145302241.1|4225134_4226232_-|hypothetical-protein MQAADWRDPAKARLDFPKALVIWEMADSVLIRRILSSRPHPEEAFALLCKGIRNKFSSASLSLPPSTLILRWTSEDCSPARSTGTHAMKDLLYTLCPNRQSVSGCFWMPGRTCWLMLLGASAVLWHAVVFDQVEVIAAPPTVIDNPATTTAAPPQTAAVSQAERWNKELAAFDKENTMHPQRTGGLVFCGSSTIRLWKIKDSFPEYSPINRGFGGSRYDDLVRHAERILKPLQPAVLVIYSGDNDIAAKQSSEEVVQNFEKFHTWVRTELPETKIIVLGIKPSVKRWGMIESIRETNSQLAKICGKDPQATFIDTETVMLGADGQPIPELFVADGLHMSEEGYRRWAQLLRPYLEILHPVNHPTP >NZ_CP036299.1|WP_145302238.1|4224312_4224900_-|hypothetical-protein MASLHAIGETSLHSGLSQTGLFRQCFWRSWIFACGLLGIFGCDQINPPVPEVKPVAGFKRDDEDPPPPPQVTPPPAVVETVPEVKEPMPPPAPKLFTARLSSWRITPTQEKGVNGLLFQVEFEALENLNDVALLEISLLDNRSQEFRLLPMRDGNYIRVSKFVGNATAAGAPYSTVLWWKGVDGGNWKEAVRSRL >NZ_CP036299.1|WP_145302234.1|4223448_4224159_-|carbonic-anhydrase MRTLFAGLHQFHKEVYEKQRNLFEKLSSGQKPVALFIGCSDSRVVPDLIMMTNPGELFILRNAGNIVPPFGASTGGEAATIEFAVSALNVSDIIVCGHSQCGAIKALLNPASTEKLPMVRQWLLHAETTRRIMEENYPALAPAERYEVAIQEHVLVQLENLQTHPAVAVKLQRNQIALHGWVYQLETGQVHAFSTNTGIFEPLMGDKSMDSAALAQPVTSNVVNNANGSTPQSPPA >NZ_CP036299.1|WP_145302231.1|4223006_4223438_-|hypothetical-protein MNSAPQTAEISVTETNRLTPEKLQIRWKGFAPTVIIASGIALGGLYALGTRGGTQGLWMMLILGVTLGLTMTAAGMSWAGTIALAENAKKGLLFVLFPPYTLWRVIVRCDIFWQSMLVFVLGLIISFGCVWLASEGLKSQFAS >NZ_CP036299.1|WP_145302264.1|4233886_4234750_-|flagellar-motor-stator-protein-MotA MFVIIGFIVVTGCVLGGYTWAGGHVEALIHPSEVLTIGGASLGAMLVMGSPKILKDLIKSIIGALKGSPFNKKAYGDLFKLSYDLLKTARKDGLLALEPHLNDPHESKIFSKYPRLQKDHHFTTFLCDGLSSVVEASMSQEQLHDLLQKQIAVMDEEHHQAVGILAKSADAMPGFGIVAAVLGIVITMGAIDGPVEEIGHKVGAALVGTFLGILASYGFMAPLAARMDLLGKSEMDFYRTTAAIIEAAIGGAAPKQIIEQARRVVCSEARPERVELEKMLKEADAAA >NZ_CP036299.1|WP_145302267.1|4235062_4236001_-|inositol-monophosphatase MQSEFLINALSVHLPPILRWAGVIARRLRSHNISVATKTTGSALTDALTLADLTIQELLVAALRDVDPIFRHCRIEAEESTGDLDAFAQESEFVLALDPIDGTKQYRDHTGNGYAVMLHLRTASEMLYSLVYIPEKGVQGWWVEVTPERVVSGPDDWLRPATDVVRSLVVAEADRPTTSNKIYVIGFQHRDVMAASEITKCGYEGVAPDEMPGSIYELMASGEFAGSLIHSPNVYDFPVSMQIARYFGGNACWTDTGEPVHFRETWLDEKADMIRLPRVVACANDPQILAGLINLAKSWHPARYRPGDSLVE >NZ_CP036299.1|WP_145302271.1|4236077_4237052_-|RimK-family-alpha-L-glutamate-ligase MTAVLKMGVLSQPGNWSLEQIELAARERGHMVVPLEFSELAASCGSVTRSATWPIEITGQAFLKRSTEVMPRFELEELDALIVRSMPGGSLEQVVTRMNLLALAEQKGLPIINSPKSLECAIDKFLTTARLAHAGLPVPATFVCETAQAALEAFEKLGRNVVLKPLFGSEGRGIFRIEEPELLWRVAQTLVRTGAVLYLQEYIDHGGRDLRVLVLNGVPLGAIERMSQEDFRTNLSLGGQSMRTELDDEAASLAVAAAKTVGTVFAGVDLVRHPDGQWLLLEVNGVPGWKGFQAATGIPVAIRLIEYVEQLVELSQAGRSEPGG >NZ_CP036299.1|WP_145302274.1|4237048_4238008_-|methenyltetrahydromethanopterin-cyclohydrolase MNLELNELSLSIIEEVVESPDHFRIIAHESAGTGLLLDFGVEAEGSLEAGQVLSTVCMAGLATVNLQPQLLGKWQWPHVVVETDYPVAACLYSQYAGWQLSSGKFHAMGSGPMRAAAGVEPLFDKLWYKEEADHVVGILETAMLPPESIFEEISAKTGVEPADITLLIAPTSSVAGNYQVAARSVETAMHKLLELGFDVHRIRSGFGSAPLPPVAANDLQGIGRTNDAILYGSTVTLLMTGDDESIAEILPLVPSRSSSVFGKTFLEIFEAAGRDFYKIDKLLFSPAQVIVQNVDTGRVHVAGEPHVPLVLKSFGLVTE >NZ_CP036299.1|WP_145302277.1|4238231_4238981_-|Bax-inhibitor-1/YccA-family-protein MRTGNPVLNEGAFDRWAQEGGLSSSSVMTVSGAVNKSMVLIGLTASLGLLSFSYLQNNPQLLMPSLIGSAIGGLIASLCVCFWLRSAPVAAPIYALLEGVFLGGITTIFEARFPGIGVQAIAATVGVAAAMLLAYKVGLIRATATFRAGVIAATGGIGLVYLLAFVLSLFNIQAMSFVYDASLLGIGFTAFCVIIAALNLILDFDFIATAASQNLPKNYEWLAAHGLNVTLVWLYLEMLRLISKLQSRD >NZ_CP036299.1|WP_145304621.1|4239299_4240271_+|RluA-family-pseudouridine-synthase MAILSKTVTVEKYLAGVRVDSFLVKHFRSYTSWRILRMVAAGLVTIDGQVAATTQRVHPGQQVTIQLAEPPDLYIPPEPTPLEILFEDDSLLVINKPAGLIAHPTGEIPDGTLINAVQYYLDKQSTHPGLERPGIVHRLDRDTSGAMAICKEHLSHRLLSIQFQLGRISKSYLALVEGVLTEDKLVIDLPIGRAPGCSSALMSCRADALEAQASKTSLRVVERFPQHTLVEAKPRTGRMHQIRIHLATIGFPIVGDEFYLKEGEIRPLVWPESEREQVSPLINRQALHAACLSFAHPLTNEWQDFQAPLPDDLRQAIERARHG >NZ_CP036299.1|WP_145302280.1|4240880_4241783_+|alpha/beta-hydrolase-fold-domain-containing-protein MMHDDQRKPSRLIINQSLKLTRILGFLLIPLMAPVSLSAQFSYPPEFKEARVETYRKTGSTELKLWIFGESDPKTPKPAIVFFFGGGWNSGSPAQFENQARHFAKRGMIAIVADYRVKSRHNVQVVECVKDAKAAIAWVRENAKRLGVDPDKIAASGGSAGGHLAASTGTISGFGSDERPNAMILFNPACTLAPIAGWQPPGARAKLSTERFGVEATAISPAHHVGPQTPPTLILHGTKDTTVPYASVVAFEAEMKKAGRPCKLVGYEGAEHGFFNRGENYDKTLAEADRFLVELGWIKK >NZ_CP036299.1|WP_145302283.1|4241810_4242449_+|chemotaxis-protein-CheX MRFIGYPCTNTPLSRRTAFRLFESPSASMHFVIPNVSGFVVTEPYASPSAWIIPFCDAATEVFLCMLQATCQIQSIRPPDSAPPLESITIAIDLTGTAPGRVILSVAESVADQFVERLTGFGAEGDLGLLRDIVGEVANMVAGSGKGNLPQLGFLIGTPRQLTAEEFINLSANWPLRQTATLETSFGLCLLDVSWNFQFATPSADTPLPETV >NZ_CP036299.1|WP_145302286.1|4242312_4243392_-|FAD-dependent-oxidoreductase MYDCTIAGGTALAVEVATTAAQLGLRTALIEPASAWEISSESVGAQWSPEFLEMITQNEFAAAMSSQTLARRLLPYGRREQERQQRLLREARVDCFAGAAKVVGADGASIAVMAMSAANPSVKQPVVDGPLLMTHLLIVATGSVVAPVHHGVLPHTPHESMNSKSSLPSSLGELLLTRSVPQSMVVLGSNRWAQAVARIYSRLGSDVLLLGRRLPERQKSTEWQEFRAGIRGLRQRGDYCEVEICNGERVLVQTVVSCQTEVGVTVELDLGKVGIESDECGKLWCDAMGRTWHPQILAMGSVVGFPEELQQPRNIRQFLEEVYLLKESRTESSSSHPAGISRMKSPKWLSASKASLPTN >NZ_CP036299.1|WP_145302289.1|4243833_4244985_+|hypothetical-protein MTDIVEIRRSDDPRDVIHRVCQSLAQGELVGLPTETTYTVAASALSEVGIQRLKSLATELAVNHQAPQFELLLKAADEALDYLVEQSRYGRKLIRRCWPGPVTLRFPKSHCGWFFEQLPETSRHLLTAPENTVSFRVPAHTLPADILNLLPGPLIALSEAQPGQPPLRHARDADQRFGNNLALIVDDGPSRYGEPGTVVQIGNNDWNIAFPGVVSERTMGRLASEVILFACTGNTCRSPMAEVLFRKLLAEKLNCPEEELVDHGYVILSAGLAAAIGAPANPEAIALLADEGLDLRNHESQPLTERLLQQVDMIYTMTRGHRDAILAERPDLASRVRTLSPAGKDIADPIGGGRDVYRSCKQIIETHLQQIIQDLLSLRSHPS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NZ_CP036299_1 | 1.3|2103132|25|NZ_CP036299|CRISPRCasFinder | 2103132-2103156 | 25 | NZ_CP036299.1 | 2103228-2103252 | 2 | 0.92 |
1. spacer 1.3|2103132|25|NZ_CP036299|CRISPRCasFinder matches to position: 2103228-2103252, mismatch: 2, identity: 0.92
gacctgcacaatcaagaagccggtc CRISPR spacer gacctgcacagtcaagaagcctgtc Protospacer **********.********** ***
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP036299_1 | 1.2|2103084|25|NZ_CP036299|CRISPRCasFinder | 2103084-2103108 | 25 | NC_007974 | Cupriavidus metallidurans CH34 megaplasmid, complete sequence | 1734391-1734415 | 4 | 0.84 |
NZ_CP036299_1 | 1.2|2103084|25|NZ_CP036299|CRISPRCasFinder | 2103084-2103108 | 25 | NZ_CP046333 | Cupriavidus metallidurans strain FDAARGOS_675 plasmid unnamed3 | 1892919-1892943 | 4 | 0.84 |
NZ_CP036299_1 | 1.4|2103180|25|NZ_CP036299|CRISPRCasFinder | 2103180-2103204 | 25 | NC_010811 | Ralstonia phage RSL1, complete genome | 144567-144591 | 5 | 0.8 |
NZ_CP036299_1 | 1.4|2103180|25|NZ_CP036299|CRISPRCasFinder | 2103180-2103204 | 25 | AB366653 | Ralstonia phage RSL1 DNA, complete genome | 144567-144591 | 5 | 0.8 |
NZ_CP036299_1 | 1.2|2103084|25|NZ_CP036299|CRISPRCasFinder | 2103084-2103108 | 25 | NC_008739 | Marinobacter hydrocarbonoclasticus VT8 plasmid pMAQU02, complete sequence | 71803-71827 | 6 | 0.76 |
1. spacer 1.2|2103084|25|NZ_CP036299|CRISPRCasFinder matches to NC_007974 (Cupriavidus metallidurans CH34 megaplasmid, complete sequence) position: , mismatch: 4, identity: 0.84
gacctgcacgatcaagaagccggtt CRISPR spacer gacctgcgcgatcatgaagccggag Protospacer *******.****** ********
2. spacer 1.2|2103084|25|NZ_CP036299|CRISPRCasFinder matches to NZ_CP046333 (Cupriavidus metallidurans strain FDAARGOS_675 plasmid unnamed3) position: , mismatch: 4, identity: 0.84
gacctgcacgatcaagaagccggtt CRISPR spacer gacctgcgcgatcatgaagccggag Protospacer *******.****** ********
3. spacer 1.4|2103180|25|NZ_CP036299|CRISPRCasFinder matches to NC_010811 (Ralstonia phage RSL1, complete genome) position: , mismatch: 5, identity: 0.8
ctgctacaccaagtgccgtcctgtg CRISPR spacer tcgctacaccaagcgccgtcctgac Protospacer ..***********.*********
4. spacer 1.4|2103180|25|NZ_CP036299|CRISPRCasFinder matches to AB366653 (Ralstonia phage RSL1 DNA, complete genome) position: , mismatch: 5, identity: 0.8
ctgctacaccaagtgccgtcctgtg CRISPR spacer tcgctacaccaagcgccgtcctgac Protospacer ..***********.*********
5. spacer 1.2|2103084|25|NZ_CP036299|CRISPRCasFinder matches to NC_008739 (Marinobacter hydrocarbonoclasticus VT8 plasmid pMAQU02, complete sequence) position: , mismatch: 6, identity: 0.76
gacctgcacgatcaagaagccggtt CRISPR spacer atcctgcacgatcaagaagcccacg Protospacer . ******************* ..
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
4792899 : 4837753
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP036299|4792899:4837753|DBSCAN-SWA CATCGAGCAATTTTGCCCCGACAAGTCCAGCGATCCCGGACGCACAGCGGCCGATCATAAGCTCTTCATCAATGCGGTGATTTTCACGCTGAAAGCTGGGATTACCAGGCAAGATCCGCCCGAGAGATTTGGGTGATACAAATCAATTTGGAGGCGATCTGGCCGCTGGTGCAACAGAGGTGCTTGGCAGACGATTGTTCGGGCGCTGGAGGATCCGGAACTGGAATAAGTTCTGATCGTCTCGACGACGGTGAAGGCCCATCACATGACGGCGACGGGCCGCCACCAGGCCGGGGAAAAAGAGGCGCGGCGGACGATCGCGGTTGCCAGGGTTCCGGTCGTAGTGGATTGACGATGAAAATGCCTGTGGTAGTGGATGGCCGAGGATGACTGTTGAAGTTGATCCTCACACCGGGACAGATCGGCGACCCCCTCGTGAGTGAGAAGTTGCTGGAGGGTCTTTGTGCCATCCATGTGTGGCAGACGCGAACTCCGACAGTGACGCGATCCGCCGGCGTGTGAAGAAGATGTGGGCCAAGGCCTGCATCAAAGCCGAAGGCGAACCAAAAATGAGGAAGCCCGAAAGGGTGCGGCCTCCTAGGGAATCCGGTTGGAGGTGGTGAGGCACACCAAAGCCAAGAAAGGATTCTCGCTCCTGCCCCGGCGCTGGGTGGTTGAGCGGACCTTCGCCTGGCTGGGCCGCTCGAGACGACTGGCCCGAGACTACGAACGGTTGGCCAAGGTTCTGGCCGGATGGCACTGGGTCGCGATGGTGGCGCTACTCGTCAAATGCATCCCAGAGATCATCCGGAAAAATCCATAACAGGCTCTCGGGCCTATGGACCATTCAGAGCGGCTTTGTCTTCAGGCCCTAGCACAAAACCCTGCAGATCATAAACCTCCGAAGGAGACCCATGCGGCTGGAAAGGTAGAAGCGTGCAACACGCATACTACCAACGGAACACTGCCGTACCCCACGTCAAGCCAGCGCCGAAGCCACTGAACACAAGGGGATCGCCGCGGCGGATCAATCCTTTCTGAAAGGCTTCATCAAGAGCTATCGGAATGGAACCAGCAGAAGTATTTCCGTACTCCTGAAGATTGTTATACATCTGACTGGGCTTGACCCCTAGATGCTGTGTGGCGTGATTGATGATTCGCATGTTCGCCTGATGCAGTATGAACAAGGCCACATCTTCAACACTTAGCCCGCTCTTTCCGAGAACCAGTTCGACAGTTTGCGTCAGAGCATTGACGGCCCATTTGAAGACGCTTCTGCCATCCATCTGGAGAAAGTGCTCGCCCTCTGCGACAGCATCTGGGGTGAACGGAAATCGTGAGCCACCACACTTGCGGTCCAAAAGATCGCTGCCACTTCCGTCGGAACCAATCTGGTAACAGAGAAACCCCTGCTCTGCATCACCCTTTGTCAGCAGTACGGCCCCTGCCCCGTCTCCAAACAGGGGAGCAACTTTCTGGTCGTAAGGATTAACGATCCGGCTATTACAGTCACCACCAATTACCAGTGCCATCTTGCTGTTGCCGGTGGCGATGTACTGAGCCGCAGTGGTTAGTGCATAAACAAAACCTGAACACGCAGCCTGAACATCCATAGCAGCAGCATCAAGCCCAAGTGCATGCTGCACCAGATTCGCGGTTGTTGGACAAAGATAGTCGGGGGAGAATGTCCCCACTACGACAAGATCAATCGACTGAGGGTCGATATTGCCAGAAAAAATCGCCCGCTTGGCGGCCTCAATACACAAATCGCTGGTAGCCTGATCGGATGCGGCATAACGCCGACGCAGAATCCCCGTACGCTGCTCGATCCACCCGGGATCACAGCCATAGAGATTCTGCAAGTCGCTATTTGTGACGATCTGATCAGGAACATACGAGCCCGTCGAAACGATCTGCACTCCGAGCAGCGAACTGGTACGTTGACTCCAGGCAGTCGTTGCTCGCCCTTCGCTCATCGGCCTGCCGGATAGCGCCTGCGACAAGACGTTTGATGTGGACTCAGTCTGGGTCCGAGAAACTCCACCAGTCACCTTCGAAAAGGCAGTGGCCGAGGAGGCTTCAAGTGTACTGGGAGCGGCCTGCAACGGTTCCATAATGCCTGCAATGGTGCGAGGGTGTCTGATCCCAGATGCGTTTGGCACAAAGCTTCAATCGACGCTGGCATTTGCAACAACATGGTATTTACACCATGCAGGTAACTATACCAAGGTATGGTACCAAACCAGTATTGTCGCAATCCATCAACGGTATGGGGTACAATGCGCTGCAGTGAGAAGTATAAAGACTAGCAACGGGTTCGAGAGAGGGCAACAAATAAATTCGAACAGCAGATCATGTCGTGCGGATCACGCGCGAGGATCTTGAACTTCAGGCCAATGGTGGGAATCTGATGGCGCTTCTGCTGCCATCACCATCGTTGCCGCATCATGAACATGACTCCTTCCTGAAAAGTCGGCACTGTGTGGTCAGTTTTTAGGCAACCCCTCAATGACTTTGACAAAGGAACATTCATACACATGCCACTCAGTGCAACAACAGTAACAATTAGGGCCGACAACAAATATAACGCACTCTTTTGCGGTGCTTTTGCCACCACCGGCGGAAATGATTAGCCGCCACCTCTTTATTACTCTGAGGGTGGCGAAACCATCCAAGTGCGGCCCGAATCATGGTGCGAGATTTGCACTAAACAGTGGATGGGTTAATTCCAGCATCCTGAGAATCGATCATGGTTGGAGATCCGGCGGATTTCCGACTTTGACTGACCGCCGGCACGGGAATCCCGAGCAATTCTCAACAACATCCGTTGGGGCCCCCGATGTCCTATTTTTCACGACCGCTTACCGAAGGCCACTTGGTTCGTTCTGGCCCTTTTTCATTCTGTCAAACAGCTCAGACTGACGGAACAACCTTTGTGACATTGATCATTCCGTCGCGAAGACCCGGCAGCGAGTGGTTTATTTGGAATAATCGAATGCTTCGCAAGATTATTGGCAGGAGTTGCCAGCGAGACTCCAGAAGAGGTGGAATTTGATTCGACAAGTGGTAATCGTCAATTCTACACTTCTCCTTTCCCTCCCTGGTGGTGCTGATACCAACTCCAACCCCTTAAACCGCGGGCCATTTGGGACAGATCGCACATGGATCATGAATGGCCCCTTGCCGTGGTCTCGACCTAGCGCTCACCAGCAACGATCCTTGTCCGCTTCTGGGAATGAATTTCTGTCACAATGCAGTGGTATTCCTGAAAGGCTGATGGTGCAGGCGCGACACCTGAAGAAGGAGGCTGGCCGAATCCGCAAAGCAACTTTCATTAAGTCTAGATCCTCTCTTTTCATCCCAGAGGATGCAAAAAAGGGCAGACTGGGCGAGTTCCTTGGATTGTGGAGCGGACACTCCAGTTAACTAGCGGCCTTCATTGATTCAAAACCCATACGATCCCGACAAGGTCATCTACACCTCTTTGCAACCAAGCAGTCTTTTTAACGACCTAAAGAAGCTCTCTCACGAGACTAGCGACATGGAAATTAAATATTAAAGAGTTTTTAACAGGCATTCAATGTGGATTTTCGATGACATTACGGTTTGTGAATCGGCGGAATGTTTTGAAAACATGGTATCGTTGCGTCGGCGGCTTGGTTACGATTACGTTTGAAATCTGCCGAAGATCCGCTTTCGGTGTAACCCACAATTGGCACAGCAGCAATCTGTTGTAACACGCAACTCTTCGGATTAGGTGTTTCTACTTATCCGCTCGGCTTCAGCAATTGATAATGCAGCTTGTAGATCGATTAGTCCAGTTGATTCTCGATAGCATTTGCATACTTTCTCGTGACTTTTTTTGCATTCCATTTGCTTTTAGTCACTTCGGCTTTCGGGGCTTTGCGATTCAATGTTGACACCATCACTTATCACAACGAACGATCGACTTTGCCTGGCTGAGGTGCTGGCTTGGCGGGCCGAGCACTTTGGTGATCGAAGGGCGTTTTCGTTCTTCCGACGGCTGGAGGATGGCTGCATTTCCATCACATATGCCGAGCTTTATGCGAGAGCTCGGGCAGTCGCCTCTTTGCTCTCGAGCCACGGCATCAGCGGCGAGCGAGCCGTGTTGTTGTTCGACTCGGGCTTTGAGTTTATCGAAGCACTTTTGGGATGCTGGCTCGCAGGCGTGATCGCGGTTCCCGCACCCTTGCCCGCACCGAACAAACGGAGTTCCCGACTTGAGGGCATTCTTCTCGACTGTGATTCGAAGACCATCCTCTGCACGGCAGCAACACGGGAAGAACTTCGATTACCTGCGATGGATGAACGCTTCGAAGTGATAGCGATTGGTGGCTCAGAAGGGTTTTCGACTGATCGACCAGACATTGCTCGCTTTGAGCGGCCGCTCGACAGCCATCTAGTGACCACAAGTCCATACGCCACCAATCCCGAAAGCGTTGCGCTGTTGCAATACACTTCAGGCTCAACCGGCACACCGAAAGGTGTGCGGGTGACCCACGGGAATATCTTGGCGAATGTCGCAGCGATGAGCGACCGCCTGGGGTGTACACCCGCCGACACGGGGTTGAGCTGGCTGCCAATGTTTCACGACATGGGACTCATCGGCGGCGTTCTACTTCCTCTATACGCCGGCTTTGAGACCGTATTGTTTCCTCCGGTATCGTTCATTGAACGCCCACTCGCTTGGCTACACGCAATCTCCTGCCTGGATGTGACGATTACAGGTGGCCCGAACTTCGCTTACAACATTTGTTCACGCCGGGCCGCGGCACAAGGGACAGCAGGGCTCGACCTAAGCTCATGGCGTGTCGCCTTTTGTGGCTCGGAACATGTTTGCGTGGAAACACTCCAGCAGTTTGCCCAAGATTTTTCCAAGGTCGGTTTCCACAAAGAGGCCCTGTTACCGTGTTACGGACTGGCCGAAGCAACGCTGTTTGTCAGCGGCCGCGGTCGCGGGGAACTGTTTGGCACCATTCAGGTTGATCGTGAGCATTTGAGTCATGGCCGGATCGTCATCAAGGAAGCCGATCAAACAACTCTTGGTACAGAGCCACACGGGGATCTCGCGCCCAAAAACTTCAAAACGCTCATGTCGTGCGGATCGGCGGCAAGCGGGCACGACATCGTAATCTGCGATCCCAAAGAAGACCGCTTGGCCCCCGCTCGCTCCGAAGGAGAGATCTGCGTTGCAGGACCAAGTGTCACACCCGGATATTGGAGTTCGGAGCATAAATTTGTCACATTGCCAGTGCAGGATGGCCAAGCCCAAGAGGGCCAGATTTCAAACCGACCAGGGAATTCTACACGTCAATACCTGCGGACAGGTGACCTCGGTTTCATGCACGGCGGAGAATTATTTGTCACTGGCCGGTTGAAAGATCTGATCATCGTGAGGGGCGTGAACATCTGCCCTCACGATATCGAGGAAGCTGTGGCTGGGGCACACGCTGCAGTGCGTGCTGGCGGCATCGTGGCGTTCTCGGTTGGTGAACATGGCCACGAGCAGGGAGCCGTAGCATTTGAACTTGAGCGATCATTTATCAATAGGGTTGATCTGGATACTCTATTCAATGCCATTCATGCAACTGTCGCTGAAGAAACGGGAGTCGTACTTGATACGATCGTGGCCTTAAAGCCGCATCGAATTCCACGAACCTCCAGTGGTAAACTTCAGCGCAGCCTGTGTCGAAGGCTATATACTGAGAGCCGCCTTGAAGGGATTTTGGGGCAATCGCGGCGACGGAGTGCTTCGACTTCAGAGAAGCTAAGCCCAAAACGGACGCACATCAGCGGACATATTGCGACAAAGTCACAGATTGACGCCGATCCGATTGCGGTTGTGGGGATGGCATGCCGCTTTCCTCAAGCTGATTCGATCGACGATTTCGACAGGTTGCTGAGCGAGTCGCGGGACGCGATTACAAGTGTCGGCTCGAATCGCTGGGATCCGGCGGACTACCAGGCCGCCGATGGGAAGGAACTGCCGATTCGGTTTGGTGGCTTCTGCGAAGGTATCGATCAGTTTGATGCAAGCTATTTTGGCATAACTCCCCGTGAAGCCGAGAAAATGGATCCTCAGCAGCGAATGCTGCTAGAAGTGAGCTGGAACGCCCTGCTGGATGCAGGATTGGGAGGTCCGCAGCTGAATGGCACCAACATGGGTGTGTTTGTCGGTGTCGGGAATAGTGACTACTCCAAGCTAAATGTACTTGCATCGCCCAATTATGGGGCCGTCGATGCCTATTCAGGGACAGGCAACGCGCACAGCCTTGCGGCCAATCGATTGTCTTACACCTACGGCTTAAGGGGGCCCAGTCTGTCGATTGATACGGCTTGCTCGTCGTCGCTCGTGGCGCTGCATTATGCCTGCCAGAGCTTGCGGACCCATGAATGCGAATCTGCTCTGGTGGGCGGTGTCAATGCACTCCTTTCCCCAGCCGTTTCAATTGCGTTCTCCCATGCGAGAATGCTGTCTCCTGATGGACAGTGCTTTGCATTCGATACCCGAGCAAATGGCTACGTTCGTGGCGAAGGCTGTGGCGTAGTTGTATTGAAACGCCTTTCCGATGCGGAACGAAATGGCGATCGAGTCTTGGGGATCATTCGTGGGACAGCCGTCAATCATGTCGGACGCAGTCGAAGTATCACGGTTCCGGATGCTGAGGCACAACGCCAGGTGATATTGACCGCGCTCGCAGCTTCGGGATTGCAACCCGATGATTTGGACTACATCGAAGCTCACGGGACTGGTACTCCCCTGGGCGATCCAGTGGAAATGGACGCGATTAAGAGTGTCTTTGGACATCGCGAGGAGAGCCGTCCCTGCTATTTCGGCTCGGTGAAATCAAACATCGGTCACCTCGAGACTGCGGCGGGAATCGCCAGCCTCATCAAAGTACTGATGATGTTCGGACGTGAGCAGCTGTATCCACAGCGTAATCTGAAGTCGCTCAATCCTGAATGCGACCTGTACGGAACCTGTCTGCAACCAGCTTTGGAGGTATGCACGTGGCCACTGCCGGCACGCACGCGGCGCGCTGGAATCAACTCATTCGGGTTTGGCGGCACGAATGCACATGCGATACTGGAAGGCCCGGACTCATCTGCGAAGCCAGATATTTTTGCAAAGCCTGAGATCGCAAATCCGTTGACCGTTGGGCCGGTTCTTGAGCCGCAGACTTCCAATCGGCCAGTTCATCTGCTGCGTGTATCCGCATATTCAGAAGTGGCCGCTCGCAAAGCAGCAGCGGATAACGCGGCATGGCTTGCACGCCACGTCACCGATGAGACCTTGGCTGATTTCTGCTATACAATCAATACGGCAAGCGATATGGAGGAGTATCGCGAAGTGGCGATTGCCAGCGACGTGGCATCGATGCAGGCCAACCTTTCGCTGCTGTCGAGCCAGGAAGGGTCTTTTGCCGTCTCGAAACGATTGACTGGGGACGCTCCCAAGATCGCTTTTCTGTGTACCGGCCAGGGATCACAGTATCTCGGTATGGGTCGCGAACTTTACAGAAGTCAACCGGTGTTTCGGGACCGTATCAACTGCTGTGACAGAATTCTTCGAGGGCATCTTGACCGTTCGCTGATTTCGATTCTCTTTGGCGATAAAGACGGTCAATCTGAGATCGGTCTGACGCAGTACGCGCAACCGGCGCTGTTTTCCATCGAGTGGGCCTTGGGAGAACTATGGCAGAACTGGGGCATTCGTCCAGCAGCGATGATGGGACACAGTGTCGGGGAGTATGCCGCCGCGTGTCTCGCGGGCGTTCTCTCTGTCGAAGACGCACTGCTGTTGGTTGCTGAGCGCGGCCGACTGATGGCCACCCTGCCTACGCGAGGCGGAATGATCAGCGTAATGGCCTCTGCGGAGGTCGTTCGAGAATTATTGGCGAACTCTGGCTTGGAAATTTCGATCGCCGCTGAGAACGGTCCGCGGATGTGTGTGATTTCCGGCACTCAGGCCGATCTGCATGAGATCACAGCGCAGCTTCGCGGCAAGCGCATCGCGACCCAGGTTCTCTCTTGTTCCGAAGGGTTTCACTCGCATCTGGTAGATCCGATCCTGCCTCAGCTTGGCTTGGCAGCTTCACGCGTGCAGCATGCAGCAGCGCGAGTCCCTCTGGTGTCGAATCTGACGGGAGCACTGCTGGCAGAAGGTCAAGTGCTGGACGCTGGGTATTGGCAAGAGCACGCGCGTGAGGCGGTTTTGTTCGAGAAGGGCATGCGGACGCTGGTCGATATGGGCATCGATTGCTTCCTGGAAATTGGCCCTTCGCCGACCCTGCTGGCACTGGGTCGCACCTGCGTTTCCTCCACCCGCCTGCGGTGGCTGGCATCGTTACGTACTGGCCAACCCGATTGGCAGGTCATTGCCAAGACGCTCGAAGGCCTGGTGTCGCTCGGTGCTCCGATTGACTGGCGTGCGTTCGACAAACCCTATGCCCGGCGGCGTGTGCGCGTTCCCAGTTATCCGTTTGAGCGGCAGAGGTACTGGTTACCGGAGACGACCCCCTCGGCTGCCACTTCTGGATCAGCCTCGGTCAACCATCCATCGCTGGGAAAACGAGTCTCGCTGACTGATTCGAGTGTCGTGTTTGAAGCACGACTGAACTCCACAATGGGTGATCAGGCCAACGAACATCGACTGTTGGGAACGCCGATCTTGCCGGCATCGGCGGTCATTAGTATGGGATTCGCTGCGTGTCGAGCCGCTGGACAGTTAGGACCACTGGCAATATCCGATTTGTCGATTGAAAAGCCAGTCGGCTGGGACAATTCAGCGGATCGCGTGCTGCGTACTGAAATCGCGGCCAAGGATAGCAGCGTGTTGTCGATTGTCATCAGTAGCCACGAACCGAATGGAATCGCACCTCTGGTTCACGCCACCGCGAACAGGTCTTGCGCACTGCCGCCGCACTCACCGGAGGGACCGCTGGAATTGGCGGCAATCACCTCACGCTGTGAAGACGCAGTCGAAGCTGATCAGTTTTACACAAGCCTACGGCAGTTCGGAGTGGACTACCGACTCGAAGCACGACATTTGACGGGAATTCGAGCTGGTCAGATCGAGCGACTGGCCTGCTATCAGTTCGATTCGGCATTCGGTACTGCCAAGACGCTCTCTCCGGTGATGATCGACACCGGATTGCAGTTGTTGCTGAGCACGGCAGCTTCCAACGACGATGCGACCGTCCATTCGCCCTATGTTGCAACTCGCATCGGTCAAGCTTTCCTCGAGTGCCGTTCGAGCAATCAAGTTTGGATTCATGCCACCGTCGATCACGCGACAGCCACGACGGGAATGGTGTCGTTGCTGGAACAGCCACTGCGGGGTCATGTTTCCTTCGTCGACGAAAACGGTCATGCATTGTTGCGGCTCGTCGATGTGGAACTTCAGCCCATCGCTCAGAAATTTCCCACTGAGCCGCAGTCGGTTTTGAATCCTTCTCTAAGTCGCGAGATCCGGCAACCGGTCAATTCGCTTCTCCAGAGTGAGATCGATCGGCCACCTGCCAGGTGGATCCTTGTTTGCGATCGAGGAGGGTTGGCAGAGTCCTTGGCTTATCAACTCCGCGAACTGGACAACGAGGTTAGCCTGATCGAACTTGCGGGCCATTACGGCCACAATTCAAAAGCTGCGGAGTGTGCCCTGACCGCATGTGGAAGTCCTCTGCCAAAGGCTCCTATTGGTCTCAAAGAGCACTTGGCAGCGTTGGTGGCCTCCCTCCCGAAAGCATCCGTCAGAACTCACCTGATCGACTTTCGATCGCTGGATTTGCCTGCCAATGAAATCGCTCCATCGTCCCAAAGATCAAATCCCGATTGGCCTGACGACCAGACAATGAACGAACTCGCGATCCTTCGAGAGAAATTGGATGGCGAGCATCAATTGGCCGGAATGCTGGCTGTCGTTACACGCTGCACGCTCAACTCCAAGGTGGCTTCACCCACGAACACCAGATGTTCTCACGATGGGTTGCCGAGTATCGAATCACATTACCCGCAGATTTTGTCGCAGGCTGATCTGCGACTGATCCAGATTGGTGCGGACGGAAATGCCGATCTCCCCTCGCTGCTGGCAGAACTTGTATTCGGCGAGCCACGGCGAGTTGTCTGTCTATGTGGATCGGGCCGCCGGAAGTCGTCGAGACCACTCGCCGAAAATGGTGTGGGAGTCGTTGTTCCCATCCAATTGGAAGTTGGGGTATTCGAGACCACCTCGAATGGATCTCCAATACGAGAGGCATCCGGGCGATCACTCTCGGAGAGTGCCCTCGGCGGTTCGCAGACCCGACACGGTCTATCCCACGGTTCGCGGCTGGCGATCTCCCCATCGACGGTGGAGGTGAACGGCGATGCAGGATGGCTCGACCGTCTGCTCAGTAACGAAGTGCCGGAGAAGCGACACGCGTTGTTGCAAACGCACTTCCTCGACATGCTTACCCGGGTAACGGGCCAGGATCGGCGTCTGATCAAGCCTGCTGACAACATTCGTCGTCTCGGCCTCGATTCGTTGATGATGATGGAGTTGAAGAACTCAATTGAGATGAGCCTCGGCATTACTCTGAATCTGACGCTTCTGTTCCAAGACCCGTCGATCGAACGGCTGGTCAGTTTTTCGATCGATTTGTGGAATCAATCCAAAGGCAATGACCGAAAATCGTCCCATCCACGACCGTCGAAGAAACCTGTTGCACAGGGGGCATCAGAGTGACACCAGCCCGACGAGTCGTGGTAACTGGCCAAGGCGCGGTGACTCCCTTCGGGATGAATGCCGATTCGCTTTGGGCGGGAGTCAGTCAAGGCCGAAGTTCGATCCGGACCATTACCGGTTTCGAGACATCGGGCATGCCGGTAACCTTTGCAGGAGAGATGGACGATTTCGATCCTTGCGAAGTACTGGGACGTACTTTACCCGCGACTGGCGACAAGCCACTGAAAATGGTGTTTGTCGCCGCCGACGAGGCACTCCGGCAGGCCCAATTGCTCAGCTCAAACAAATGCCTCGAAGACCTACTCGTCCATACGGTTCTGGGTACCGCGCTGGGGGCTTGCTTTGAGCTGGAATACTCATATGGCTGTTTTCACAAGCTGGGCTGGAAGGGCATTCGTCCGACGGCCGTCCCCAAAAGCATGTTCAACACCTATGCCAGCCAGTTGTCAATCGAATTTGGCTTGAAGGGCATGAATCAAACAATTGCGTGCGCGTGCGCATCGGGAGCTGCAGCGATTGGCCATGCTTACCAACTCATCAAGCATGGCATGGCAGATATCATCCTGACAGGAGGAGTTGACTCGCCATTGTGCCCCAGCATGTTTGGCGCGTGGACCAACATGCGGGTATTAGCCCGTCACGAAGAACCGGAAAGTGCCAGCCGTCCGTTTGACCGCGACCGGGGCGGACTGGTACTTAGCGAAGGTGCCGGAATGCTCGTCCTGGAAGCTGAGGACCACGCTCTGAAGCGAGGCGTGGCTCCGCTCGCCGAGATTCGCGGCTACGGTGCGACGAGTGATTCGGCTCATCTGACAGCACCGACCATCGAAGGCCCTGTTCGTGCAATGCGAGCTGCGATCCTCGATGCCGGATTGCGGCCGGAAGATATCCAATACGTCAATGCCCACGGGACGGCCACCCAGGCTAATGACGAAAACGAGGCGAGAGCCCTTCATGATGTGTTTGGTAATCGCGGATGGACTTTGCCGATCAGTTCCACCAAATCGATGCTCGGGCATAGCATGGGTGCCAGTAGTGCGTTGGAGGCCTTGATATGTATCAACACGCTGCGGTATCAATGGGTTCCCCCAACATTAAATTGTGAACACCCTGAGTGGCCCGAGTTCGAGTTCGATTTTGTCCCCGGCCAAGGGCGCGAGCATCGCGTGCAGAACACGCTTTCAAATTCCTTTGGCTTCGGCGGCACCAACTGTGTTCTTGTGATGAGTAGACCGGAATGAGCATCCCCACTCGAATCGCTATCATTGAACAAATGCATCAATCGCTTCAACGCCTGCGCGGGCGCCGTGTCGCGGTGGATCAATTTACGGAGGAGGCGAGGCTTAGTGAGGAACTGAGTCTTGATTCACTCGATCTATTGGAGATCCGCTTCGACATCGAGGAAAAGTGGAAGGTGGAACTGGAAGACAAAGAAGCCGCGGCTCTGGTGACGGTGAAAGATGTCATCGACCTCATCCAATCCAAGATGTCGACAAATCTGGATGAGAAGCCGTGACAATTGCCTATCCGTCTCCGAATCCCCCGCTGCAGAGCAAGGTCTGCCATTCACCGCCTCGGCATGTGCTGGCGATTTGCTACTCTCAGAGTGGAGATGCCGCCCGCTGTGCGAAGGCCTTTCTGACACCACTGAAAGAGGCGGGAGCTGTCGTTGATGAGGAGTGGATTAAGCCGATTCCCGCCTATCCATTTCCCTGGAAGAGCGTGCTGCGGTTTTTCGATGTGATGCCCGATTGCATTCTTGGTAAGGCACCCAGCATCGAAGAACCCCAGTTTGATCCCGATTTGCCCTACGATCTGGTGATTCTCTTCTATCAGGTGTGGTTTCTCGCTCCGTCACTTCCCCTGGTCGGATTCTTCAATCATCCAAAGTCCCGAGTGCTCAATGGTCGGGCAACGATTACGGTCGTGGTCTGCCGCAACATGTGGATGGTGGCCACGGCCGAAGTCAATCGCTGGCTGGCGAAACTCAAGGCGACTCACCTCGATAATCTAGTGGTGACCCATCAAGGTCCAATTTGGGCGACATTTATCACGACGCCCCGCTACTTACTGTTCGGCCGCCGCGATCGACTCTGGAATCTCTTTCCAGAACCGGGAGTTGGTGAGTCTGAACTCGGACGCGTTCGGGAGTTTGGAAATACTCTTGCCAAACAGCTTGACCAACTGGTACCGGGACGGACTCAGCCGTTCTTCACGGGACTCGATGCGACGCGGATCGAAGACAAATACATTCTACCCGAACTCATTGGCAGCCGTTTGTTTCGCTTTTGGGCGAGCGTGATTCTGGCGATGGGGCGTCTTGGCGGACGCTATTTGCGATCTGTTGCGGTCGGATTGTTCGTTTTAAACCTTGTCTGCGGGATCGTGCTCGGAATTCCGATCCTCTTCGTCTTTCGAATACTGGCATACCCGTTGCTTCGTCCGGTGCTGCAAAGCTATAAGATGGAGATGCGGGCCCCTTCCGAACCGCGGCCAGTGGATGCTGCAAAACATCCTGAGACGATCGATGTTATCGGCTAAGCAATTCAGGCAGCCCGCCCCTACATTTGAAACCGGTCGTCATCGAAGAGACCAACCCCGAAATCCCACGCGTGCAATGGCCGAGTCCTTCAGGTCTTTCTTAGAGTGCGAGTGAACCGATGTCCGACGTATACATCAGCGGGGTCAGCTCTTTTCTTCCCAACGATCCCGTAGATAACGAGAATATTGAAAACGTCCTTGGCCGAATCAACGGGCGTTCTTCGAAGGTAAAGGACTGGGTGCTTGACTATAATGGAATACGGACGCGGCATTACGCGCTCAATCCGCAGACCATGCAACCGACACATACCAATGCAGAAATGACGGCACTGGCGGTCCGCCAGCTGCTAACGAACCATAATCTGTCACTCAGTGACATCGAGTGTCTTGCCTGTGGGACCTCCAGTGCTGACCAGATCATTCCCAATCATGCGGCCATGGTGCATGGTGAACTGGGATGTCCTCCCTGCGAGATCGCCTCGACAACGGGAGTGTGCTGTTCGGGCATGACCGCGATGAAGTATGCCATCATGAATGTTAAGAGCGGACTGGCAAAAATGTCCGTCGCAACCGGTTCGGAATTGGCGTCGCTTTCGTTTCGGGCTTCGAGATTCACACCACAGATTGATCGGCAGATCACTGAATTCAATCAGGAGCCAATGCTGGCTTTCGAGAACGACTTTCTGCGGTGGATGTTGTCTGATGGTGCCGGTGCCGTGCTGGTGACCCCCCAACCGAGTGCTGAAGGTTGCTCGTTGCGGGTTGACTGGATCGAATTTCGAAGCTTTGCAAACGACTTCGAAACCTGCATGTATTTCGGAGGGATCAAGACCCCCGACGGCAACCTTGAAGGATTTCGCATCGTTGATGACCCTCTCGAGCTGGTAGGCCGCGGGTATCTTAGCCTCGCACAGGATGTACGGGTGCTGCGTTCCAATCTGCCCAAGACAGTACGAGATACATTCGTGCACTGCCAATCCAAATATGGATTGGAAGAGGACGAAGTCGATTGGATCCTGCCGCATTATTCCTCGGAAGGATTTCGCGAGCCGCTGCAAAAGGGGTTGAAAGAAGTCGGCTTCAATCTGCCTGAGGATCGCTGGTTCACCAATTTGCATACCAAAGGGAACACAGGTTCGGCGTCGATTTATATCATTTTGGACGAATTTATATCTTCCGGGCGTGCACAGCCCGGCGATCGAGTGCTCTGTTTCATCCCGGAGAGTGCCCGCTTCACCATGTGCTTCATGCATCTAACCGTTGTGTAAGTGTCAGGCGGACTCACTCATCACTTGTCACGGCTGCTGTTGACGCCGGTCACCCGTCGTAGGAACTTACCATGCGCCGCGTCGTAGTCACCGGATGCGGAATCGTCTCTTCCATCGGGATTGGAAACAAGACCGTTGCTACATCACTTCGCGAAGGTCGCTCGGGGCTCCAGTTCGTCCCGGAAATGAAAGCTTATGGCTTACGCTGCAATGTCGCCGCGCCGATTGTTGGATTCGATGAAGCGATCTTTCAAGAAGACGGTAAACCACGGTTGAGTAGAGCGGCACAGTACGGCTTAACCGCAGTATTTGAGGCAATTGCCGAGGCTGGGCTGATGCCGCATTCGGAGCCGCTCAACAGTGCTGCGGTGATCTTGGGAAGCGGTGGAGGCGGCCAGAGCTGCTTGCCGGAAACCGCCTTGCGAAGCGACGCCAATCCGATCGACGAACCAGGAATCTATGAACTGCGGCGACAGATGAACGAGACCGCTGCAGTCGCTGTCGCTCAGAGACTGGGTGCTGAAGGACGGGTCTCATCAAACTCCGCTGCGTGCGCGACAGGGCTATATAATATTGGGTTTGGATACGAATTGGTTGCTGCCGGCCTGCACGACTGCTGCATTTGCGGAGGTGTCGAAGAGCAAAGCTGGCAGCGTGTAGGGGTCAGCGCCGACAATTCCCTGGGGATGCCGACGACCTTCAATGAAACACCCAAGGCCGCTTGCCGCCCGTTCGACCGTGACCGCCAAGGGTTTATCATTTCTGAGGGCAGCGCGGCCGTGGTGCTCGAGTCTTATGACGCCGCAATGGCTCGGGGCGCGCCGATCTATGCAGAGATCGTCGGCTACGCGGCCGCAAATGACGGACACGATTTGTATGTGGCCAATGGCGATGCGATGCGTCGCGTTGTGCTGAGCGCGCTCTCGGATGCCGCGAAGCTTGGAGTCGCGAATATCGATTATATCAACGCCCATGCAACCGGCACGCCGATCGGTGATGCAATCGAAGCTGGTGTGATCCGTGAGATTTTTGGAACTGGTCCAGCCGTCAGTTCCTGTAAAGGTGTGACCGGCCATTCGCAAGGAGCCGTTGGCGCTCAGGAAGTCGTTTACACCGTACTGATGCTGAGAAACGAATTTCTTGCGGCGACAGCGAATCTCGAGCATCCGTCGGACGACTGTGGTGGGATTGATCATGTCCGTACCCGGCGCGATGTCAGGATCGACACTGCTCTTACCATGAATAATGGACTCGGTGGAACCAACGCCGCCATGATTCTTCGAGCCCTCTAAGAGCGAAGGAACCGAAAATGTTTCGACGAATTGCCATTACTGGCATGGGTATAGTCAACAGCTTGGGCAATAGCGTACCAGAAGTTCTGGGATCGCTTCGAGCAGGCAAGTCGGGCGTAAGCATTGTCGATGAGTATCGTGAACTTGGATTTCGAAGCGCACTGGCTGGAACGCTGAAAGACTTTCAGCCGCCTGAGATCGACCGCATCTATCTCCGGCAAATGGGCGACAACGGAGTCCTCACCACGGCGGCCACTTTCGAGGCGATCCGCGATGCCAATCTGAGCGACGAACTCATTCAGCATTCGAGAACCGGTGTCATCGTCGGGAATTCGGGAACGTACAAAGAAACCTATCAGCTCTGCCATCAACGGCGCGATTTGGGAAAGAAAATCACAGGCTTGGCTCTGCCTCGAGCCATGGCATCAACAGTGTCAGCCAATCTTTCGGTGTTGCTCAAGACCAAAGGACATTGCTTCACGGTTAATGGCGCCTGTGCCGGGGCCGCTGTTGCAATTGTGCAGGCCGCCTGGGCAATTCGCCTGGGTCTTCAGGATCGCATGATCACCGGGGGCGTTCACTGGGGATCATGGGAATTTGACTGCTTGTTCGACGCCCTGCGGGTCTTTTCACGACGTGAAGACAATCCTACGGCGGCTTCGCGTCCATTCGACGCCGATCGGGATGGGCTGGTTCCTTCCACCGCAGCGGGTATGGTCGTCTTAGAGGACTGGGAACATGCCATCGACCGCGGTGCGAAGATTCATGGCGAATTGATCGGCATCGCCGTGAACTCCGATGGCCAAGAGATGACCACTCCCTCCGGCGAAGGAAGTGGCAGATGCATCAGGCTGGCACTTGACGACGCGGGCATCGGTCCTGGCGATATCCAATACATTAATGCCCATGCAACCGGAACAAAGCTGGGTGATGAAATCGAAGCCCGCACGATTGGCGAGATCTTTGGCCAGACGCCGTATGTGAGTTCGACCAAGTCTGGAACAGGCCACGAAGTCAGCGCTGGCGGTGCCACCGAGCTCATCTACACCCTGCTGATGATGAAGCACGGCTTCATTGCGCCCACGCTGAATTTGGAGCGGATCGATGACCAATGTGCCATGATTCGACATGTCCCCTGCCATCCGATTGAAGCCGATATCCAGCTGGCGATGAGCAATTCGTTTGGATTTGGGGGAGTGAACGCTGTCTTGCTTGTACGGAGAGTCGAAACATGACGCAGCTCCCTCACTCAGCCGCGGTATCGCCCACAGGTCGGAGAGTTGTTCTGACCGGGTTGGGGATTCTCAGTCCTATCGGAGTCACTTGCGACGCAGTCTGTGCTCATTTGCAAGCGCCTTGCCACCCGGAAGAAGCGTCCGCCAGTGCCATTCCAAACGACGGATTTCTGGGCCACCTCAAGGGTTGGGAGCCAGAATCCGATGAGAAACTGGGGCGGCTCTCGGTCGGTGCACAGTATGCGGTGCAGGCAGCGGTCGACGCTTGCGGGCAAGCGCGGCTTGATTCGTCATCCGTCGACACTGCGCGCGTTGCGGTGCTGATGGGTACGATGTTTAGCAGCATGGCTGAAGTCGTCAGGATCAAACAACTCCTTGAAAGTGGGAAAGTCCGCCGCGCGGGTGTGAGGGGTACCACCAAACTGATGAACTCGGGTCCGGCCGTCAATATCGCGGCCAGTCTTGGCCTGCGTGGTCCGGTTATGTCGAACTCGACCGGGTTTGCGTCCGGAATGGATAACATTGGCTGTGCTTACGAATGGGTCCGTGACGGTCACATCGATATTGCCGTCTGTGGTGCGTCGGAAGAAAACTGCGCACCGTTTCTGGGCAAGCAGTTCACCCTGTGGGAACGCTCGCCTCGGGGCGTCGATCTGAACCGCGAAGTCACCATCCGCCCCTTCGATGCGCGTCGAGAAGGTTCTATTCTTTCGGCTGGGAGTGGAGTTGTTGTACTAGAGTCGGCCAAGCATGCTCTCCGCCGCGGTGCGCGCCCCCTTGCGGAAATCTGCGGATATGCAAGTTGTTTCGACGTGGAACCCGGCCCGGATTCATTTTCGACCGCCTTGACTCGCGCGATTGAGAGAGTCTGTCGACATGCCGAGCACAGCGGCGTCAACCGTCTGAGGCAGGTGGTTTCCGGTGCCGCTGGATTCCATTTAGCCGATGCCGATCATGTGCTCGCCATCCGAAACAGTTTGGGAATGTCGGCTCGGGTCACCTCCACAGCCGGTTGGTTGGGGCATGGCTTGGCGGTGAGCAGCGCGTGGAATGTAGTCCTTGCCTCATTGATGCTCTCGAACGGCTTTCTTATCCGTTGTCGGGGCTTGACTGAACCGGGCGCCGACTGTCGCGGGGTCGAATACGTAATGAATGGAGAAATAGCATCGGATCCGGGCCTTCTGATTACCGCAGGAGGACTTGGATTTGCCAGTTGCCTCGCTCTTCAGCGTTTGTCGCCCATGTCGGACTAAGAATGCCAACATGGTCTGGATCGGCTGTCTCTATCAGATCGGTTCCACAACGGACTCAATCGGCTCGCCCTCGACAGGAGGACGATCCAGAGAAAGTCGAGCACGCTCCTCCGCGATCCTATCCTTTAGCATTTGCACTTCAAACATTCGGTCGGCAATTTTGCGTTCGAGCAGTGCGATATGACGACGCCGATTCCAGATCACACGATGAATCGGCTCGTAGGCCAATATCCCAAGAATCACTCTCCGAAAGAGGATCACCACAAGCGCTGAAAACAATGGTGACAGCAGGATCATCGCTGTCGTGCGGTGTTGCGGAAACATCTTTCGTCCAATTATGGAGTACAACACGCAACATTCGACCACCGCAGCGATCCAGATCAGCTTCATCGTTAATTGAAAGACGGGCAACTTCGGAGCTGCCAGCAACGATTCGCCACTGTTCGCAATGGTTAGGATGATGCCCAGGAACACCAGCAGCGCCACCGTCAAGCCTACAGTAACAGTTGCCGAAACGAGTTTGGCCAGTCGTGCATCAGGCAGAGTCCACCCCATTCTCGGAAGCCAGATGAACGATCCGGTGCCCAGGTCACGGCAAGTCAGTCGAAAAGGTGTAAGAATTACGGTCCAAGCGAATTCCCGGGAAATCGTTGGGCAAATGATACGATAAGTTAGATACTCCATGAGACAAACGACCTGAACCGCAATGCAGACCCAGATCGACCACCGATCCAAGCGAACTGCATCACGTGACACAATGGTGACCACCAGATCATGAAATGCCGCCAGATACAACACGATTGCGAAAAACATGGGGGCAGCCAAATGCCGATTGAGCAGAACCAGCCATGACGGTAGCTCGCGATCCGGGCACCTGTTGCTTTGGCTTGTCTTGACTTCCGCATCGGTACGGTTAACGGCATCAATCCCCATGCCGCGACACTCCCCTCGAAATCCAAAGGGAACTCCCGCCTGCTCCCATTGCAAAGCATGACATGCAGGACAGTATTGTACCTTTGGTATTGAACAGACGCGAGACTAATGGTGATTTCCGGATTTTGAATGCGGCGAACTTATCATGCAGGATCCCTCCCAAGTAACGATCTTTCAGCAGCGACTTGTGGCCATGAATCGTTTCATTGCACCGCTGTTTTTTGTGCTGTCACTGGGCTTCCTCGCGTTGACTTACCATGGCTTGCGGAGCGTTGGCCAGTGGGAGGGACCACTTCCGGAAACAACTGCCTTGTGGGGACTGATCTGGCTTTATCCCCCCTTCATTGTCGAAGCCTGCTTCTATGCGTGGTCTGGATTACCGGGAGGCCGACAGTACCGCTGGTCTGCGCTTTTGCCTCCATTACGATTGACCCGCCGCGACTTGGAAATGCAGCACTTGATCTGGTGGCCCCTCAGGAACTGGAAGCCGGCCGATCGGCAACTTGCCAGGCATGTGATAGTCTGCGAGGCAGGGATCTTATGGTTCGGCGGAATCGCGCTGATCGCTACGGCAGCAATAAGTTCGGCCAGCATCACTTCCGTGGAGTCATTTACGAACCACTTCGTCATCGACGCGATTTTCGGCGCGATGTGGGTCGTGGTGATGGCGGACACTTGTGTGCTGCTGATAATACTGCCGCGCGCTAGGCGTAAGAGCCTGGTCATTACCTGCCTGCCCATTCTGGCATCGCCGGTAGTACCACTACTGGTTGTGGCGTACTTGCCAAGGCTGGTGCGCCGAACCATGGCTTGGAATGAGTTGCACCACTACGGGATTCGCGACCCCGCCGAGCGACTCAGGCTTTTGGATGCTCAGTTGCAGCGGCGAACGGCCGAAGTAGAAGAGTTGCAAGATCAGCTTCGCCGCGAACAACAACTCAATTAAGCCAATTCTAGCGACAATCCATGACGGGTGTCGTTTGGGACTGGATTCCGTTCAAGCCATTGCTGAGTCTGCAACTATTCAGCGGTGGGTTCGCCTTTGAGACTCTCTTGGGTATAAACATTGATTGCCCGGATGGCTTCTTCAGCGTCCGTGGTATCGATGAGTACATCGCCATCTCGAAGGACAATCTGCCGGCGTGTGGCCCGCGCGACATTGTGATCGTGCGTTACCAAAATCACCGTAATCCCATTCTCACGATTCAGCCGATGGAATAGTTTGAGAACATCACTCCCAGTTCGTGAATCGAGATTTCCGGTCGGTTCATCAGCGAGCAGAATGGGCGGATCGTTTGCCAATGCCCTAGCAATCGCAACTCGCTGTTGCTGACCACCAGACAACTGACTGGGATGATGATCGTGACGGTCTCCCAAGTCGACCATTTGGAGCAGTTCAGCCGCACGCTGCCGGCGCTGACGGGTCGTCAACTTTCCGGAATAGAGCATCGGCAGCTCTACATTTTCCAGAGCGCTGGTCCGGGAAAGGAGGTTGAAGTTTTGAAACACGAAGCCAATTTTCTGATTGCGAATTCGAGCGAGGGCGTCGGGATTCAGCTTTTCAACCTCGACGTCGTCCAGCCGATATGTTCCCGATGTTGGACGATCCAGACAGCCAACCAGATTCATCATGGTGGACTTTCCCGAACCCGATGGCCCGACCAGTGCCAGGATCTCCCCCTTTTGAATCTGGAGGGAGACACTCCTTAGAGCGTGCACTTCCACATCCTCGATCCGGTACAGCTTGCCTATACCATCCAGATCAATAAGCGCCATAAAAGCATAACCTTCCAACGGCGGGTATAATGAGTGCTGATTCTGCTCGGGTCGCCCCGGAACCGTTTAGGTTAGCCGCCGTTGACATGCATTCTAGCTGACCTTCGACTTGACGAATTCGACCATCTCGCCATAGGTCCGCACGTTCTGAGCATCTTCATCCGAGATATCGATGTCGAACTCGCTCTCAAGAGTTGCCACTATCTCGACCATGTCGATCGAGGTTGCTCCCAAGTCGCTACGCAGCTTTGCCTCTGGAACGAGTTCGCTTTCCGGAACTTCCACATTGCTCAGGATGATCTTCTTAATGCGGGATTCGATTTCCATTGACTCGACCTTTTGATTGACAAATTCGGCTTGGCGGCACACTTGATCGATACGTTTTCGCGTTTGTCAGAACGACGCCCGACACCCACAACACGATTCCAACTGGAGATTTAAAGCCCCCTGTGTCATAAGCAATTGAAAGCAACGGTCAAGCCTGAATCAACTCTATTCCCGCGTGAATCTCCGTTCCCTCCAGAGTTTCTTTCCCGAAGCCGTCAAATCGCGCCAATGTCCATGCCATGCAGTTATTCGTAACGCAACGCATCGATCGGATTGAGCTGGCTGGCCCTCAGAGCCGGATAGTACCCAAAAAAAACACCAATCATCGCCGAAAACAAAAGTGCCACAACTGCCGCTTCCACCGAAATGATGTACGGCCACGACGTATTGGGTGTCAGCCAATTGACAAGCATTGTCAGCCCGGCAGATCCGGCGAGCCCCAACGCAATGCCTATAAAACCTCCGAGACATGAGAGCACGACTGCCTCAATCAGAAACTGACGAAGAATATCGGACGATCTCGCCCCAATCGCCATGCGGATTCCGATTTCGCGTGTTCGCTCAGCCACCGAAACGAGCATAATGTTCATGATCCCCACCCCACCCACCACGAGCGAGATCGCCGCGATGGCCGTCAACATCAGGGTAATGGCACCCATGATCACCTGCAGCATCAGTGCAATTTCCGTTGTACTTTGAATTCGAAAATCTGCATCGCGGCCTGGATTGATTCCATGTCGCTGGCGGAGAAGTGCCTCAGCTTCACGCGCGGCCACGGTCGCAAGCGATGGACTGGAGGTAGACACAAGAATGGCATCGACATTCGAACCGCCCCCCGGGCGAAGTCGCTTGGCTGCAGTGGTATAGGGAATCAACACAATGTCATCGTGATCTTGACCGAACATGCTGACGCCCTTTGGCTCCAGCACACCGACCACCTGAAACGGCGCATTTTTGACCCGAATCGTTTTGCCGATCGGATCTTCGGCCTGAAACAATTCGCGTACGACCGTTTGCCCAAGGACGCACACTTTCGCTGACGAGGCGATGTCCCGCTCAGACAGGAACGATCCCAGTGCCAAGTTCCAGTTCCGTACGATCTGATAGTCCTTCCCGACTCCAAAGGTTGTCTGCGGGTTCCAGTTGGCGCTCCGATAAACAACCTGTCCGCTGACCCCCACCAGCGGAGAGACCGCATTGACCGACGGGCAGCTCTCTCCTAGGGCGACCACGTCTGCAATACTCAAAGATCCGACGTCGACCGTTCGAATGCCTCCCTTCTGCATCGATCCGGGCAAGACGGCCAGCATGTTTGTGCCCAGCGTTTCCAATTGTGCCTGGACAAGCTGACTGGCACTTAGACCAATGGAAACCATCGCCGTTACGGCGGCAATGCCGATGACGATCCCCAAAATGGTCAGGCCGGCGCGAAGTCTATTCTTCGACAGCGCCCGAATCGCAATTCGCATCGACAGCCAAAAACCCATTCGACACCACACACCTAAGAGGAAACTACGCGATCTAGGCGATCATTCATCCGTCTTATGTTCCGGAGATTTCATAAACGACACGATCACCTTCTTTGAGCGGACCGTCGAGAATCTCGGTGGACTGGTCATTATTCTCACCAATGCTGACTTCAACATATCGTAGCATTTTCCCAGCAGGCTGCCATAGCAATCGCTTTTCGGCAGCTTTCTTCCCGTCGGCCGTCAACGCTCCCCCAATGTGGTCCTTGGCGCGATTTTTCAGGTCCAATAAGGTTCGATAGGCTGCTCGTTGCTCGGGAACAACCTTTTCGGGGTCGGGTCGTATGCGGAGTGCCGCGTTGGGAACCAACAGCACCTTCTCTCGAGCGGAAATCTCGAAATTCACACTTGCGGTCATCCCTGGCAGTAATCGTGCATCGGGATTCGTTACAGAGATCACCACTGGGTAAGTTACGACGGCCTGTCGTGCATCAGCACTCGCTCGAATCTGTTTGATTGTTCCTTTAAACTCTTCGTCGGGATAGGCATCCACCGAAAAACTCACAGGTCTCTTCATCTGATTCGCAGCAATGATGGCTCCAATGTCGGCTTCATCAACAGAAACGACAATTCGGATTTCCCGATCCATGTCCTGGGCAATGACAAACAGCTCCGGAATCTGGAACTGCGACGCCAACGTTTGCCCGGGCTCGATGCGGCGGTCAATTACGATGCCGGAAACCGGAGCCCGAATCTCGGTGTACTCTAGATTTGCTCGGGAAGTTCTCATCGTGGCCTCAGCCTGCTTTATTGCGGCATTGGCTATTTCCATCTGGGCTTCGACCGAGCGTCGGTTGAATTTCGCTGCATCCAATTCGGCGACAGAGATAAAAGCGGTGTTCTCGTTTTTGACAGCAAGTGTGCGAGCCTCTTCGGCCTTCGCCTGTTCAAGAAGAACGTCCAACCGCTTCAGATCGGCCTTTGACGTGGCAAGAATGGCCTCATCGCGGGCCACCGCCGCCTCGTAGGTTTTAGAATCGATCCGCGCCATGAGCGCGCCTTGTTCGATTTTTTCGTTGAAGTCAACCAAAACTTCCCTAACTGGCCCCGACACAAACGAACCAACGTTGACCGTCACGATGGGCTTGATCGAGCCACCTGAGCGGATAACCGACACCAATGACCCACGCGTCAGTTCGACCTCACGAAAGGTCAAATCGGATTGGAGAAACTGAATGTCCATGGCGTAAATGTACGCAGCAAGTGAAGCTCCGCCGACCAACGCCAGCACGACAAGGACCTTTATCGCGGTATACATCTAAGTGTTGCCTCGCCTATCGATTGCCAGTTCTGGTGGCTTGTCACACGATTCTGATGCGGCTCCGCCTCAAATCACTGCGAATTCTACCGGCTGACACGCCCCAGCGGGTCGCTCCTGGCCACAGTAGCTCGCTCTCCAAACCATTAGGAGACACGCTACATCATTCGTTTGCAATCCCCCATGCTCGAAAAGAATACGATCGGTCGGAATTGCTGTCGGCACTGTCGGCGTTTGGAACTATTCACTGTGGGCGTTTGTAACGATGAGGTGGTCCATGGACGGACATGCAACCTCCCAAATGCCGCAAAAGCACGTGCAGCACCATCAACTTCATCCCCTGCACGGAGCCCCACGCAAGGCCTTTGACGACTGCTGAAATCACTTCGAATGCAAGTATACATTTGTTATTTATTCGGCGCAGAACAATGCACTTCGAACGAGACTCATGTTCCGTGGCATCAAATTTTTCAGAGGCGAGATTCCCGACTCTCAGCGATTGCTTCAACAACGATTTGCTCGGCGCTGAAAACAAATTCTGCCAGCCAACTGGGAGGACAATCCCGACAAACTTGCCCTAGGCGGCACAGCTTAACTGTATTCGAGTTGAAAAGCCGTAGAACACGAACAGAACGTTAGCACCACAAAACCAACAATAACAGCACTTTACGACACAAAAAGCCCACCCAATCGGAGTGAAACTCAACCGAAGAGGGTCGGTAGTCGTAGCGAAATCCCAAGCGTTGAGCTTTCCAGATACGACCGACGCAACTGTCAGAAATGCGCGAACGAATCTGCACGAAGCAGTCGATAAGAAAGCAAGGTTCAGCAAGGAGAGCCGCGTAGACCCGCAATGGCCGTAGTCAGGATGACCAATGAGCCGCACAGGTTTGCTCGACCAGCATGATTTCAAACCGGTTGGCCGATTGGAGAGTACAGTGATGGATCACACACGTATCGGCATCTTGGCCTGGTTTGGTCGCTGGCTACCCGCTATTCGTCCGAGCGGTCGTCTTTTGTGCCGCATTCAAAGGCGCTTCAGCCTTTTCGTGAGGCAGTCTGGGGGCAGATGCCTGGCGACAGCGAGCTTGCTCGGAATGTTGACCCCCACACCCATCGCCGCTCAGGAAGAGCGGCCCTGGAAACTGATTCTGCCCGAGCAACGCAGCATTCAACTTCAGACGCCGCCGCAGATTCCGGTCGATGTCCCTATGGGTGGCCCTCCCACAACAGTCACGTCGCCTGGTGAAAGGGTGGAGTGGCTGCTGTCGCTCGACGAAGCCATTCACATTGCAATCCAGAATTCCGAAGCCATCCGCGTTGCTGGCGGGGCCAGTGTTCGTGTCACCGGGCAAACCATCTACGATGTGGCGATTGCCAATACAAAGATTGACGCCGAACGAGCCCAATTCGATCCGACACTGGGGATCGACAACACCTTCGTGCGACAGGACATCCCCACTGCTGGCCTGATTGTTCCGAATCCTCGTCAGCGGAGAATCAGCGGATTCACAAGAGACGGCTACAATCACGCCACCGGAATAGAAAAACTCAATCCGCTAGGGGGCCTAACGCGATTTGACATTGAAACGAACGTCCAGAGAACCAAACCAACAATCGCGCAGCTCAATCCACTAACCACGACACGTGCGGGTGTCAGCTATGTGCAGCCGCTGCTGCTGGGGGCGGGACAGGACGTCACGATGGCTCCCATTCGCATCGCACAATTGGAAACCGATCGGACGTTCTTTCAGTTCAAAAACAGCGTCCAAGACATGGTGCATGCGGTGATCCAAGCCTATTGGAGATTGGCAGCCGCGCGAATCACTGTCTGGGCGACAGAAAAGCAGGTCGAACAAAGTCGCTTCGCCTCCGAACGGGCCGAGGCACGAATGCGAGCCGGGCTCGCCGATTCAGCCGAGGTCTCACAAACTCGGTTGGCCTACTACAACTTTCGTTCAGCTCAGATCGTCGCGAAGAATGCTCAGCTCGACACGGAGAATCTGCTTCGAAACCTCCTCTTCCTTCCTCCCGAGTCGGAGACGGAGATTATCCCCAGCACGGCCCTGCATACCGAAACGATGAATTTCGAATGGAACGAATTGCTGGCCATCGTTGAAGAATGTCGACCCGATCTGCAGGAGTTACGAACGACCGTTCTCGCGGATAATCAACGCATCATCCTCGCCAAGTCACAGGCTTTGCCCCAATTGAACCTGGTTAGTGCCTATGGTTGGACGAACACGCAGGGGGAGCAGCGAAACCGCAATGGCGTCGTTTCCAACTTCAACGCCAGCGGCAGCCGCTACACCGACTGGACATTGGGCGTTAACTTTGACATGCCATTGGGTCTGCGCGAGGGTCGCGCTAACGTCCGTGGTCAAGAGTTGAACTATGCCCGCGATCTTGCCAATCTTCGAACACGGTCTCATACCGCTGTTCACGACGTTGCCTCAGCAGTGCGAGCAGTCGACCGGAACTACGCCGAATACCAGTCTTATCGTATGGCTCGAGAGGCCGCGGCGCTCAATCTGGAGCAGCAGCGCATTGAGTATGACAACGGGCGTACAATTTTTCTGAATCTACTCTTGGCGATTTCGGATTGGGGAACTTCCGTCAGCCAGGAAATCGATGCCCTTTCGCGTTTCAACACAGAACTCGCCAACCTCGAACGGCAAACCGGTACCATTCTTGAAACGCATGCAATCTATTTCGAGCAGGAAAAGATGCGAACAACGGGGCCTCTCGGGCCTCATCATCCGGTGGACTATCCACGTGACTTGCGACCAACTCCCAATGCGCCTCGCTATGGTGCGGAGCCCACACCACCTGAGAAAAAATATTTTGACGAAGATAGTATACCAACGACTGAACCGACTTTGGACCCCACCGAAACTGGCAAAGCTTCGGTCGATCCGCCCACCAAGTCAGGGTGGGCGGCATCACTGACTGATACCATTACCGGACGGCGCTTCCGACAGAAAAACTCCGCCGATCGCTCCCGTGCCACCAGCGAGGAAATTCCCTAACAGTACGAATCAAGACTCCCGACACACAAAGATGTCGAGTGGCTACAGCGCAATTTCATCGACGACCCGCCCATCAGGCCTCGTTTCAACAGTGAACTCGACCGCTGATTCTCGTAATACACAGGGTTACCACCCTAGCTGTATGATGCCGAATTGGATGAACGGGCGTTGCGTAAGGGGTTACTAGCGGCTGCCCAAGCTCCAGAGACTGCTTCCTGCGGAGTTCCTGCAGGATTCATTCACCCAATCACGGCCAGACTCCCATTTCCTGCCAATATTTCAACCGTTGGCGGGCCGTCTCGTTGGAGCAGCGCCTATCAATTCAGGGAAGCTCGGTCAGCAACATTCGGTGCCCAGCACGAATCAGATCACCCAAAGCATGACCTCATGAGTAGTGGACGCTCTGACGCCCCAAGGACCCGGCCAAGACCGTTCAGGAAGCATCCTCGCGACAGTCCGGAGGACGGTATCCGGCCCATGAGAAGCCATGATTGCCTCAGAGAAAGGTGAAGGTCTGCTTTCACCTTTCTGAACCCCATTCCCATGGCATCTCTCGGATGTTTTGATGTCCTGTCTCCGCAGGCTCTAGCTCAGGGCAGGCAATTCCTTCCGCACGATGTTGATTAGCGGTGTATGGGCTGTTCGCAGGAAGAGGTGCTGGCCATCGATCACGTGAAAGTTGAAGCTGCCACTCGTCTGGCGGCGCCATCCGATCATCTGATTCCACGGCGCAATTTTGTCGCCGCGTCCGACAAACGCCGTGATGGGACACTCCAATGGCGACTCGGTCGTGTAACGATAGCCTTCAATCGCTTCGAGGTCGGCCCGAAGCGTCGGCAAAAGCAGGTCTTGCCATTCAAGATTGGCGAGCGGCTTGTCCATTTCTCCGCCCGTTTCTCGCAGCCAGTGAAGTAATTCTTCGTTGCCAAGTTGTGAGACGCTCTGGCGAAATAGTGAGATTTCCGGGGAACAGAAGGCCGCGACAATAAGCTTGCTGGGCAATGGACTGCCTCGCCGTCGCAACTGACGAGTCACCTCAAAGGCCAGTAAGGCCCCAAAACTGTGACCGAAGAAAACAAACGGACGGTCAAGCCAAGGCAACAACTGTGTCGTGACTTGCCCTGCAAGATCAACAAGGCCCCTGCAGGCAGCCTCCGCCAAACGATTTTCTCGACCGGGAAGCTGAATAGGACAAACTTGCCAAGCAGATGGCAGGCTGGACTGCCATCGATTGAAAAGGCTTGCACCGCCCCCGGCGAACGGCCAGCAGAATAGTCGCCCCAGCGCATCACCGTGCGGACTGGGAGAAAACCAAGGCTCAGCGGAGTCCGATACACCTGGTGAAGAACTCATATCAAGGCCCGCCCAATCGATTAGTCACCTTCGGATCGTTGGCATCCGCAAACTATCCGCGAATACACGGCATTCACACGACAGCAGAGGCGTCCAATCGTTTTCGCGGAACTGCGAACTTGGCGCGCGTAGTTAACACCGCTTCTTGACGCTGATTGACCATCTTGATCCGCACAGTCACTAATCCGCTCTGGGCATTCACGGGTTTCATTTCGATTGTTTCCGCATGCATTCTGAGGGTGTCACCAATCCTCGTAGGATTCACAAACCGAACCTGCTCAAGACCGACAAACACAATATTCTTGAGCGGTAGCAATTCATACCCGACCACGCGGAAAAGCAGTCCACTGCTGATCGAAAGTATCAGCAGCCCGTGTGCGATCCGCTCCCCGAAAGGATGCTCGCTCATAAACTCGGCGTTACAGTGGATTGGGTTCCAATCCCCCGTGAGCGCAGCATACGCGACAATGTCGGCCTCAGTGATTGTTCGACCATCCGAAATGGCCTTTTCGCCGACGCGACAATCCTCATACATCTGTCTTTGCATGGTTAACCCATGGCTCCATACCAGACGACACATAACCACCTAGCGAACACTTCTGCCAGAGGAACTCCCCTCGTTCTCTGAAAAGCCATTATGTGATGTACTTATGATATCGCATGCTCTTCTCGAAACTCCTCAGCAGTCGTGACTCCCACTCCTCAACTTCAGGAAGCAACCGCGTCGGTCGTGGCTGAATCCACTCACCATAGTCGACACCACTCGACGATGTCGCCTGAACAAACGAGAGAATAGACTGCATGACCTGCTCGGGATGGTCACAAAGATCCTCGTATCTCACCGACACATAGCTGTCGGTGGGCAACTCGGCAATCCTATCAACAAACTCACATCTCGCCTTCTCACCGGAGGAAGCCAAGTGCCGCATCCCAAGCCTCCAACGATTGTTCGGTGATGTGGCCCACTTGACAAGCCTTTGCACGAACTTGCTTTTATGGAGAGTCTGCAGAAATCCGGGAGCGTCGACCAAGAACTTCACATAGGCATTTCCCGACTGTTGATTCGTCTGCATCGCCTTCAACTGCGAGTTCAGGATATGAATCGGATGACGGTGAATAAACACGAATCGAGCATTGGGCAGAACAGACTTGATAAATCGAAAGTTGGAGAAATCCCAAGGATTCTTTAGCAGCAGTGGTTTTGATGGATCCTCGACCTCCTGAATGGTTCTGCAGAAATCGAGGAACACGCCGAGGTTTGACCGTTTGAGTTTGGGGGACGCCGTCTTCCGGAACAGATGCATGCCATACTCTTCCGGCATTCCCGGTTCCAACGGCATGTTGTCGATAACGCGGGTTGAAATATCCCACGATGCGAAAATATCACTCAATTCCTGCTGAGCACCGGCCATTGCCCCTTCCGCGTGCAGAGCACGCAGTCGATCGCTATAGAGGACATGGAACGCCGTGGTGATATTGAAACACCCCGTCAGACCGAGCAACTTATAAAGAATCGTTGTGCCCGAACGGTGTTCGCCAAGAATAAAAATCGGTCGCAGTTCCGCACTCGGTGATGCGGCCTCAAGTATTCCGAGCGATTTACTGCTCACAGGGGAGTTACCATTCATATCCTGACCCTTAACGACAAACTCAAATCAGTCAGCGAGACCTGACTTTCAAGACGCGATTTAGGTGTAACACAATCAATCGAGGCCCTGAATCCCCCACAAACTCCCGAATCACTCCGAATCACTTCCCACCAGCAATTCGAGACCGATTTGCATGATGAAGAATCTCGTTCATAATCACTTCGGCGGCACGTGGTGGACCACCGGCAAGTTGGTAGACCTGCTTCATAGTCAAGGCTGCTTTTTTGAGACGATCATCGCTGAGTGCAGCTAACGTTCGACGGCGGAGCGAATCGGGCTTGAGATCACTGTCCTTCAAAATCGCTCCGGCCCCTGTCCGCACAACACAACCGGCGTTGTAAGCATGCTCGGGATTCTGAGTCCACATTAACAGAGGCACACCAAAATGGAGGCTCTCGTTCACGCTTCCCAGCCCTCCATGACTGATGAACAAGTCAGCACACTTGAGAATCTCGATCTGAGGCATAAAATTGCGAACCACAAAATTGGCCGGAATCGGCCCCAGATCGGATATCGCCGTTCGCTTACCAATCGACATGTACACCGTCACATTGCAATCCTTAAAGGATTGAATGCACGCACGGTAGAAGCCAGGTGCGTCGTCAACAACCGTCCCGAGAGACACATACACCAACGGCCGATCACCTGAAGTAATCGACAGATCCGATACTGTCGCTGGCTCGACGACGGATGGGCCGACAAATATATAGGAGGCATCGGGCTGATTCGGATGCCGCAGGAACTCTCGACAGGTGAAGACGATGTTCTTCTCAGCCGTATTTGTCAGCAGATCCAGAAAGCCAAGCTTCTTGAGTTGAAAGCGCTTTTTCAACTGACCCCGGGCCTTGAGGAACTGGAAATACATCCGCACATCGAGTAATTCCATCAGCAGATACCGGAGATGAACTGACGGCTTTCCGCCGCCCACCATCCCGAGCACTGTGCAAAACAGGGAGACTGCGGGAATCCCCAGGATGTCGGCCACGAAGCGGCCCCATGGACAAACACGATCAAACAGAATCAGATCGTAGTTCTCATCGCGGAGAGGCTCGATCGCATTGGGGAGCCACGCCGAGCTCTGCTGCAGAACCAGCCCGGCAAGACCGATGAGCGATTTGGGAAGCTTATCAACAGGATGATCGACCAGCCTCGGTACGACGCGGACATCAGCCCCAGTCGCGTCCCTGACGCGCTCTGCGTACTCTTCACTGATATAGTAATGAACGGCCTCACCCTGTTCCACAAGCTGCTTGACAACCGGAAGCGTCGGGCTGATGTGTCCCCAGGCGGGAATCGTAAAGACTGCGATTTTCGCCATGACACCACTCTCAAAAAGGCAACTCAGCCACGTTCGGCCAACTGTTACGGCAAATTCCCACACAATGCAAGCAGTGTATCATAGCACCACTGCAAATTATTCAAACCCGTACGAAATTAAACCTCTTCGTCAAATCCAACCAGATCCCTCGTCTTCTGAAACAACCGCAGCAGATATTCCTGACGCTTCTGAATGTCCGGCAACAACTCAAATTTTCGAGGTTGAGTATAAGGATTGAGGTTAACATCCGCAGGACAAGTCTCGCCAACATGAGCCAGAATTCGCTGCACGTGCGACTGAGCATCCACCATCAATTCCTCATACCCGACAAATATATAATCTTTCCGCTCAACAAATGGAAGATCCTTAAAGAGTTGTCCCTCGGCCACTTTCATCCGATGAATGAGCCAAGAATGCCCCGGATTAATTCCGAAGCAATCTGAAAAAGCACCTTTCATCAATCGCTTCACCACGGGTGATTCGGTGATTGAGTAACGCTTAGACATCATTAGCATCAGCTTATTCGCCTCGCCCTTCTGAAAAATCCTCAATTGAGAATCAAGAAGTCGCAATGGATGCCGAAAGAGGAAGACAAACTTCGCGTTTGGGATCTTGCTTTTAATGAATAGAAAATTGCGATAATCAGGACAGCTTTTCAGGACCAGCGGAACCGATGGATCACTAACGTATTGAACCTTTTGACAAAACCTTACAAACGATTCCATATATCTCGAACGAAGCACTCCGTCCCCGTGGACAAATGCCAAATACAACACATAATCAATAGAAGTGTCAGCTCCAGCTGCAATATTGTCGATACCACGAGTCAGAACACCTCACCCTGAAAAGTCGCTATTGATCCTGCCTCTCTCCGATATTTCCAACCCCATTACACGATCATGCAAGAGGCGGTTTCTATTCAGAACATGCTGTAACTTTACATAGTTTATACGCCTAGTCCGAGCGAGCAACTCGGAGAAAAGTGTTGTACCAGAACGATAATCCGAGACAATGAAGAGTGGTGCAATATTTGTATCCCGCGAAAGCTTCTGGTAACCCTGATCGAGATCCGGGCCCCCGCTGACACGAACAAACGGGGACAAGAGTGATGCCACGAAGGCCGCTCGTGATTCAACAATCATGGGCTTCGCATTTGCTCCATTGCTCATAACTCCACGATCCATTTTTCTCGCTTTGTTCGTCTTTTGATTAATTTTCGACAAGTTCAAACTACTGGCCAGGAACTTCTGTCGAGGCCGATGGCATAGGCCGCGACGACTCTTCCAGAGAGCATAACTCCTGAGATATCCCGTCGAGAATGAGTTCGCACACTTCAGGCTTTGCGAGGCTCTCCGTGGCACCCAGGGAAATCGTCACCGCACCGTTGAACTGGCTCATAAAGAAACCCATCCCGGCGGCGCGGTATATCGAGACAGTCGCTAACGCATCGACGACCTTAACCCCATCGAATTCGGGAAACAACGGCGCCATCTCACCGGCATTGGCGAGTCCGCAGGCACGAACTAAGCCACGCAGCTGTGACCTTGTTCCCCACTTGAGCAATCGATGTACCCAGCCGAAAGGAACTGGCAGCAGGAGCCATCGGATCACTGGCATGGCCAATATCACACCCGGATAGTTAAGACCCAGAAGACGCTGCTTATGACTCCTCTTAAACTGCTCATTGATCACCTCAACAACTTGCTCAAGGCTGGCATCGGCAGGTAGCGGCACAAACATCCGACTCGGGCCAGACAGATTCGCGACCGGAGACAACTCTCGCTCCTTCGCCGGGAGATAGTTACGATGATCGACTGTCGTTCCTACACCCAGTCCTTCCTCTCCTGTTACTCCAATATGGCGACGAAGTGTCCGATACAACACAGCCAGCATGACGACCGTCATGGTCGTTCGCTTTCGCTTAGCCAGTTTTGCGGCAGCGACACCCACATCTGGTTCGAGGCGACGAATGTAGTAATGGCCGCCATCTCGCCTTTCACCGACCTGATCAGCTTTCAGTAACCAGAGGTTGCGGGTCTTGCGCTGATTGGCCAGAAAGCTGCCGATCACTCTGAGACGATGACTGATGGGAAAATGGCGAATAACTTCACTGTGGTCTCGCGACCGAAGATTTACAGTTGGTGCGACATCGGGGCCAGTGCGGCCCTGCAGATAATTTTTGCTGACCATTGCAATGATTTTCATCAGCGAGGGAGCGTCGCCGGCAACGTGATTCATTTTATAGCAGAGCGTATCTCGGTCGCCGCGAATGACCCGCACGTGAATCATCGGATCAACGCGGGGATCCATCTCGGCTTCCAGAAACTCTGCCAGTTCGCGCTCGGGATAGACAGACTCGACAACGGTGCATAAATCCATACTCTCATGATCGTCACGGCGTTGCCAATATGGTCGATACCAATGCTCCACGAAACGGCAACCGAGAATCGGCTCGGCGACGACTACCATTCGCAGCGCCCTCTGCAGATGTGTAGCGTCGATCCTTCCGTCATAACGGACAATCACATGCACCTGTTGCGTATTCGCGTGCCCAACAAGGAACTGGCACATGTCAGCAAAGTTTGCCGGGAATCTCTGGGGACAAACTTGACGACCAAAACTCCACCTGGCCGAGGGTGAAAACTCGGCAACGGAAGACTCTGAGTTGGACAAAGTCGGCATCGGTTCCATAGTAAAATAAGCCCTTGACACATAGCAGCATACGCTCGCGTCAATTCATACAGTGCCACTGCGTTTTCGTCAATTAGGAGCAACGCCCACCTTGCAATAAACCACATCAGAAATCGTATTACCTGGCATAACACCTACATATAAACACCACCATTGATTCGGATCACCTGCCCCGTAATAAAGGCCGCTCTTTCCGACGCCAGAAACGATACAAGAAACCCAACCTCTTCGGGACGACCCGCACGCGCCAGCGGAGTGAGTTCCAGAAGTCTTGCTTTGTTGACTCCATGCAGACTAATAGACTCTGTCTCAAACAATCCCGGTGAGATGCAGTTCACCGTGATGTCTTTTCGACCCAACTCACAAGCCAAGGATTTCGTCAGGCCGATGACGCCAGCTTTCGTCGAGCAGTAAACTACTGAACCGGCAGCACCAGCCTCAGCCGCCGAACTGGTAATATTGATAATGCGATTTCCAGAACACCCCCCACTCCACGTCTTCACAACTTCGCAAGTGCAATGAAGAACACCCGCCAAGTTAACCTGTAATGATTGCTCTATATCGCTCGGCTTGACTCTTGCCAATGGCTTTAAAACGCCACTCGCTGCATTGTTAACAAGCACGTTAATCCTTCCAATATCACGATCAATGTTCCGTACAGCGGTCCTCACCGCGCTCGCATCTGAGACGTCAAACTGAGCAATTTGACATCGTCCGCCCAATGCATTTATTGCTTCCTGTGTCTGAGCAGCACCCTCAGCATCACTTCGAAAATTTACGATCACAAGCAATCCGTCGCGTGCCAGCTCGAGAGCAATTGCTCTACCCAAACCACGTGACGCACCTGTAACCATTGCAATCCTCGCAGCCACTTCACAGACCCTCCAAAGGCATCATGGATCTCACCACGCAAATGGCACACACTACAAATTCACCGGAACTCACAACCCAAGACACCTCACAACCTACCGGCACAAACTGTGGCGACCGCGACAACACGGAGAATCACAATACGTCTATTAGAAAGAGAATGTAAAAACCAACTACTACGAGACGAGTACACTCATCTCTCATCACTAAAAGCCATGTGAAAACAGCTTGTCGACACTCAACTCGAAACAACAACACAACATATCGCCGACATATCATCTATAAAAACAACAAACACAATACACTCCCGCGAGCGAACCAACCCAAAGCATCCACAACTTTCGGGACTGCTCCCTCTTCAACATCTGAGCACTCAGAACCTTAAGCAACAAGCACAATCATGACCACAGAAAGCCTTACGCTTGAACACTCTTCTGAGGCTCGCAATTGCGGACACCTTGCATAGGGGACATTTTACTTCTCCATGGACTTTCGCATAATCATGCTATAACCATAGCCATTCCATAACGTTTTGATTTCTGACTCAGAAACACTGGGAGGAATCGCCTCAAAAGGAGTTGGCGATGGAATGGTGATAGAGGGAAAGGTTCTTGCGAGCTCCTTCCCAATCGCTTGTTCTGCATTCTCGTGCGTAATCCATTCGTCAGTTACAACCACCCTTGACCCCGGCTTCGCTACACGAAGCAGCTCCGAAAGCGCCAACCGCTGATCTCGAAACTCGTTAATAGCACCTGTATGAAATACAACATCAAAACTGTTATCTTGGAACGGCAAGTTCTCCACAAGCCCGACCATTAGGATGTAATCAAGGCCGGCGGACTTCAGTTTCTTTCTCGCAACTCGTAACATTTCTACCGAGATGTCTACACCACAAACACTCCAGCTTGAAACCATTTCCGGATTCTTCTCGCAGAGATATCTGATCTCGCTTGCCGTCCCGATTCCAACATCTAAGATCAAAGACCCTTGATCAAGCTCTAGCGCATCGAGGTACTCGCTGCGGATCCTGTCAGGCTCTAGGCCAACAATCTCAGCAAACTCCTTTTCTAGCATGTCGTAAAGACGCGCACCGAGACGCGTCTGCTGCTCCATCTCGGGATAATCCTCAATGATCTGATCTTTTCCAACAAGTTCGACGATTCCATCAACTCCGTCACAATGGTACCCGCAGCCTTCACAGCAAAATGAGACTACTTTGTCCAGTCCGTATACTAGTCGCACCTCGGAATGAGCGTCTCCGCAAGTCGGACACCTCAAGATCTCAGACTTCATCATTACGCTCCACAAGAAAACTTTTCACTTCCCCTCATCTCTGAGGATTTGTTACGACCACCGGAAGTTTCGACAAGAAACATATCTACTCTAGTCTACACTTTTTCAAAGCCCCCATTGCCCCGAGCATCCATTGCCGATAACTCTTCGCAATTAAGTGAAAGGAGTTCCTCTTCAATCCCATCTAGAATCATCTCACACACGTCCGCTCTGACAAGACTTTCCGTCGCCCCGATGGCGATTGTAGTTGTCCCCTTGAACTGGGTAACAACAAACCCAATTCCTGCAGTCCTGAAAATTGCGGCGGTCGCAAAGGCATCGATAGGCTTCACATCACCGAAGACATCAAGGACCGCTGCAAGATTTCCTACGTTAGCAATTCCTACCGCACGGGTTCTCCCAAAAAGCCGAGGACCCGCTACACGCTTGAGGATGGATTGTAGTACTCCAAACGGAACCATTAAGAGCAAGGGTCGCACAAATGGCATCGACAGAAAGACCGCCGGGTAATTTAGGCCAAGTAGGCGACTCTTAAGGGCTTCTTTGAATTGCTCCTTAATCACCTCCACAACTTCACCTAGGCTGGCATCTGCGGGTAGATCAACGAACATTCGACTCGGGCCGGATAGATTCGCAACCGGCGAAACCTCCCGCTCCCTTGCTGGAAGATAGTTTCGGTGATCGACAGTCGTTCCTACCCCAAGTCCATCGTCTCCGGTTGCTGGAAACATGCGCCTCATTGTCTTGTAAACCGCCGCCAGCAGGATGACCGTCACCGTAGCCCGATGCTGCTTTGAGAACCTCCATATTGAGTCCGATATCGACCGATCTAGCTTTCTAATGAAATATCTTCCCTTTACCCGGTCGACTCCCACATCATTCACGTCAAGCAGCCATAGATTCCTGGTCAGCCTCTGATTGGCAAGAAAGCTTCGGGCAACTCGAATCTTTTCGCTTAACGAGAAGTGCTTTAGGACTTCACTGTGGTCTCGGGACCGCAGATTTGGCTCAGGCAGAAATGAGGGTTCAACTGATAGCTTCTGATAGTATTCACCTATGAGCAGAACAATCTTAAGCAGAGAGGGTGCATCGCCTACTAAATGGTTCATTTTGTAGCATAACGTGTCTTGGCACTTGCTACGAAAGACACTCACCTGAATGAGAGGGTCTTTTGAAGGATCAATTTCGGCCTCCATGAATTCTGTCAGTGCCGACTCCGGGTTCTCAACATTCATGACAATGCATAAAGCCATGCAGTCCACATCGTCTCGGCGTTGCCAATAAGGCCGATACCAGTGTTCAACGAAACGGCACCCAAGAATCGGCTCAGCGTAGATCACCAGCCGCATCGCCTGCTCCAAGCGGCTAGAATCAATGTGCCGGTCGAATCGGACGATAAGATTACCATGCTGCGTATTCGCCTGGCGGACCATAAACTGGCTCATGTCAGCGAATGTTGCGGGAAAGCGCTTTGGGATTCGTTTTAGCAAGTCAAGCTGCCGAACCTTAAGCATTTCGGAGTCAGACAATTTGCCAGGGGCTGTCGTCGGCAAGGCACTAGATTTCATAGACAACTTCTCGTTTCCCATGCTACTCTTGAACCAACCGTCAGATCAGTCGTGCATACTTATCGATGATAGAAGAACACTCCAGCCTATCGATAGCGCTATTTAGAGGATAAAATATTCTATACATCGTAAAAAGAATGGTAGTCATTCATAATCCACCCATTCCAGCTCAACCGATATTGAGTTAGAGAGGCTTTTAGGAGCCGCTGACTTGGCATTTTTGATCTAGTAACTTTGACTGACCTCCGCAGCTCTAGCAATCAAAATAATTGCCTCACGATTTGGAGATTCACCCTTTAAGGCACCAGGACAATCTTCCCGGCCACGCCGCCGCTGGCCAGCAGTTCGTGCCCTCGGACAATCTCCTTGAGCGGCAAAAGTTCACCGATGACAGCCTTGACCTGACCGGACTCCACAAACCGCAGCGAATCTCGGGACTCCGCCGTGGTTGCAAAATGCGAACCCAGATAGTTCTGCTGCTTTGTCCACAGAAAGCGGATATCAAGCGGGGCTGTGAAGCCGGTGGTCGCACCACAAGTCACGATGGTCCCCCCCCACCTCAGGCAATGCGTACTCTCCTCCCACGTTGCAGCGCCAGAATGTTCAAAGACGATGTCGACGCCCCGCTTGCGGGTTATCTTCATGACTTCCCTTGTGATACGCTGCGACTTGCGATTCAGTACATAGTCTGCGCCGAGGTCGGAAACGAGCTTTGCTTTATCTTCACTACCAACAACTGCGATCGACTTAGCGCCAAACGCGCGAATCAGCTGGATGGCCGCTGTTCCGATTCCTCCAGCAGCTCCCCAGACGAGAATGCAGTCTCCAGCTCCCATGCGAGCTCGCGTGACGAGCATTCGCCAAACAGTCGCCAGCACGTTGGAAATAGCTGCAACTTCCTCAAACGAAGTGTTGCCCGGCTTGGGAATCACATTCCTCGCAGGAACGGTGGCATACTCGGCTTCCGAACCTTGATTCGGCCCTGTCTCAAACCCCCAGATGCGAAATTTGGGGCAGAACATGGGCCAGCCACGAACGCACTCGACACAGTCACCGCATGAGAAGGCGCCATTGACCACTACTTCGTCACCGACAGCCACAGTGGTAACGTTAGGCCCAAGTGCTTCGACAATTCCTGCGCCATCGGTTCCCGAAATATGCGGCAGAGGAAAGTCCATTCCCGGCACGCCCTGCCGCGCCCAGATATCGTTATAGTTCATCGCCGCGGCTTTGATCCGTAGCAGCACCTCGCCATAACCCGGTTCAGGAATGGGAAGATCATCTCGATACTGGAGCACTTCTGTGCCGCCATGTCGATCGAATGTTACACCCTTCATTTCCGCTTTCAATCTTTCGGTTTAGGGACAGAATGGTCGCTGTAAGGAGGACATAGAAGATCAATGAAGATATTCTGTTCCCGATCCAGGAATCGGGCGATATTCTGGAGGCCGTGTGCACCGAAAACTGCAGAAATTGCTCGCCGAAAAAAATGGTAACGCCGTTCGTAACCCTGTAACAAGAGGCGAAGTTTGATTTGCACGCGAAAAGTTCTTCTCAAAGCAGACTCAGGCTGGGTCGATAGTCCAGTTGTTCTGTGCGACGGGACTTTCGGGGTCTTGAGGAGACTGTCGAGCTTCCGGCAACGTTAAGCACGGCTACCGATATTTCGAGTTCGCGTACTACCCCAGGCTCTGCGGTCCTGCGGCAACTGGTACATTCACCTACATCCAGAATTCTCAAGACAACCTCAGAGAGTACATTGGACGATCGCAATGCTGAGAGACTCCGTAGTCGGTTCCATAGGCCCTGCCGAAAAGCTCGAAATCCTTCGTGAGATGGATGTTATGTAGAGTCGCGTTTTTAGCGACTTTATTCAAGTGGTAGTTTTTCTATGTTCTTTTCAAGGGGAAATGAGGGACTACCACTTTTATTGTGGTAGTAAATTTCGACTCCATCTTGTGTCCATGGTTCTTCTTCCATCAGGCGGCCAAGGAACTGAGTCCCGATTGCATTGTCGGCACTGGTGGACGGTGAGACATGAATCTTTCGACGGATCATGGAGAACCCGAAGGACGAAACGAATTTCTTTCTTGAGAAAGTAGTCTGCGGCACTGTCGACGAATTCCCGCCCCGTTCATCACAGCCGACATGAATTGACGGCCTCATTCTGTGAATGCTTGATGGACGGATGTCATGGGCAGACAGGTTAAGGCCATCGGAACCACTCGAGCGCGGCATGAGATCTCGGAAGAAGTACTCCGACCGCCAGCGCACCTTCGTCAGATAGGGTAGAGACCCTTCTTTCGGAGCCTTGGCCGTATTTCATGCCATCACCTGCCTGTTCAACCCCTGCCAGTATCAGCGGATCTGGCGGAATTACGACCTTTTCCGCGAGCGTCTTGACGCTCCCTTAACCACGGTCGAGCTCTCTTACGACGGCACGTTTCATGTGCCGGACGCCATTCAATTGCTGGGCGATCCCGCGAAGCACACCCTCTGGCAGAAGGAGCGGCTGCTCAATCTGGCGATCGAAGCCGTCTCCGACGAATACGACGACATCGCCTGGATCGACGCGGATGTTCTCTTCACCGACCCGCGTTGGCGGGAACGGACGGAGGAGGCGCTCTCCCGCTCGCCGGTCGTCCAGGTCTTCGATCACGTCGCGCTGCTCGATGCTCATGGGGAAGTCCACGAGATCCGCCCCGGTGTGGTGGCGAAGTTGAGCAAGACCCATGATCCCTCGGTGCGTTTCGCCCATCCGGGATTTGCCCGGGCCGCCCGGCGGGAGGCGCTCCCCGCTGGACTCTTCGACCAGAACTTTGTGGGTGGTGGCGACACCACCATGCTCCACGGTTAGCTGGGAACGACGGCTCCCTATTTGGACAAGTTGCGCAGCACCGGTTGGCGGCGTGGACTCGCCGCTGCTGGAGGAAGTGAAGCAGAAGAAAGCTCACGGCAATGCGGTTGCTGCGACGGAAGCGACACTGCGCACATATTGGCAGGACCACGACCACAAAGGCGGGTCTATTTTCGAGTTAGGGACTCTCGACAAGCCGCCTGAGGGGAGCAGTGGGATACAGATCCGCTGGCAAACCGGCTAATTGCTTGATGGATTTCACCTTAGCCAAATCGTCCGGTTCCGCTGTCACGACTGGCCGAACATCAAACTGCCACCGGAAGAACGACTGCGTTGACTCACGAATCTGTACACGTCTCAATGACGTTATAGGGTAAGATCTTAACTCCGATTCGCTGGAGAAAAACCCATGGCTCACTTGCTTCCTCGATTTTGATCTTCGTGGAGGCATCTGCCACCGGGCGGCCGGTCCCGTGCGTCAGCAGGTAAGGCGACCCCATTTCGTTGATGAACTGTGTATCAAGAGACCAGCCGCCCAGGTCATCGAAGCTCTCCGCTTCCACCAGCAGAGTTTTCCCTGCGGCAGATAGCGAGGCGTTTCCCAACACGAGGCCAGCGAGAAATAGAAGCGTCAGGCAGCGAATCATCGGACACTCGGCAAATCAACAAACATCGATGTGATGCGTTATCATGCGGTTCTGGAGGAAAATTGCAAGAAGTTCACGCGAAGTGAAGACTCTGCGATTGGGAAAAGACTCCGGCAACCGCCAAGACTGCAGACCTGAGGTCAGCAATAGCTGATGATGTGACAAGAAGTCAGTTGCGCTCTGAGAGCCACGCTTTGCGGCAGTAGGTTTTGACGAGATCCTGTTTCAGGACGTGTTGGCGCCAGATTTTCGTCGCGGTCTCGAACAACTTATCGTGATACAGATAGATTCGGTTCGACCAGTAGTGGCTGCGCAGGTAGCGCCACAAGTTTTCAATGCCGTTCAGTTCCGGAAGTACGGGAGAGATCCGGCGAACACAGCGGCTACCCGATCAGTGAAAATTCGAGCTACAACCAGTCCTTCGATCGGCAAAGGCTCTCACCGTCCCAAAACAGCTTGGAATCTCAGCATCCATTGAAAGGGAATGCGATTTTCTGCAAATCAGCGAGCTCTTTGGCCGTTCACGCAGTTGCCTCTTTTGATCACCGCAACTCCCCCGTCCTCAATGGCATTTCCCAATTGCGACCCCATTTATAGTGAGATCCTGGCCTTCAAGGTCACCAGCCATCTCAAGCTTCGCACAGTCACTGTCGGGACTATTTCGACCCGCTTTTCCGAACGAGTTCATGAAGTATTTCCAGTTGCGCGGGGCTGAGAGACATTTCCGCTTGGCGGCGAGATTCGCGAGACCGCGACTTAAGTGTTGGAAGCTCCCAGAGACTTGCCTTTTTGAGTTGCTCGAAATCCTGCCATACAATCGCGGCAATGGAATCAATTTGTGAGTCGTTCAGATTGAGCTTCTGAATCACATGCGGCGCCAACAGCACTAACGGGCCCTGGACTCCACTCAGAACCTCCCGCCGCAATTCTTTGACTTGCTCAGTCGTCAGCGTCTGATCGATCGTCTTTTCGATCTCCTGCCCCGCCTGCTCATACTCGCGAAGCATCATCGGTAGCTCCGCAACAGGGCGTTTTCTGAGGTCGTTCACTGCGTTCTTCACGCTGTCTTCCAAGCCAGCCAGTCGCCGGCATTGGTCTTCGGAAAGACCGAGCTGCTCCCGCACCGACGCTTTACCCAAAAACTTGGCGCAGTTGCCCTGGAAGAGGATTTCCGCAAGCGCTCGTGCCTGATGCGAAACCCCTTCACCTTTCGACTCCGACGACCTCCTGGAAGGGGAAGAACCAGAAGAATTCAAGTCGGGACTCCGTGCGGTCTGGTCATCGGGATTCTTCGACGAAACCCCAACGATATCAACGTAGGCCGATGCATCACGTACAACTCCCTCGACCTTTGTATCTCGCGACGGCTGCGAAACGTCAGGTTGATAGCCACAGCCAATGCCACTCACAAAGACGAGCCCGCACATTAGCGCACTTCGAATCGATGTCGAATCAACCGTGCAATCAGTGTTTTGAATCATCAACCCCTCCCAAATGGTCGGCGTATTGCCGCAACCGAGCAACACCATAACCATCGTGACCCATAATCTTGCACTCCTTTGCCGGCATCGCAACAACGAGGTTCTCAATTGCGTACCGGAATCGGCCCAAAAGCCGTTGCTCTAGGCCGAATATCTCGCAAAGTTTGAGGGTGGCTGGATCCAATGACGGCGGCAAGCGGGGAACTTGTCAGCGGGAAGCCGGCGAATGGGCCTGTCTTTAAATGAGCGTTTGGCAGACTTGCCTCCCGCCCGATGCGAACTCTGTGCCAAAGTCACGGACACGCACTCCACTTTCCACAATTTTCATCCTAGCCAGGAGATCCATCACAAGCAACCTTGTTCCACGCAGCATTTCTTGACGGCCCTCACATTGCGATAGAGAACTGCGCGTTCAACCCGACACCGGATTTGCGCAGCGTTATTTCTCGAATTCTTTCCGTCATTCTGGCAGGAGGCTGGAGATGAAGGTCGAGGGGCATCACACCGCGGACGAACTTCAAGATCTGCTCCCGCGAGAACGCCGCGCGAGTGTCGCCATCAGGATCCGGATCGTCCGACAGGCCGCCTTGGGAGGCACTGCCCCACAAATCGCACTGGAGTGCGGCCTGTCCCGACGCTCCGTCCAAGAGTGGGTCGAACGCTACAACACCCAAGGACTTCCCGGACTGTAGGATCACCCCGGACGCAGTCGTCCGGAGATCCTCGACGCCGAACAGCAAGCGCTGCTCGCCCAGCGGATCGAGGACGGTCCCCGGGCGAACGATCCGTGCTCGCTGCGGGGGATCAATTTCCGGAACTTCATCGAATCCCGCTTCGGCAAACGTCTGGCGATTTCCACCTTCTACAACCTGCTGCATAAACTCGGATACGAGCCGCTTGTGCCGTGTCCCCGCCATCGCGGCCACGACCACGCCGCAGCAGCCGAGTTCCAAAAAAAATCCCTGAGGACATCGCCCGCATCCAAGTCGTCCATCCTGGAAAACGGGTGATCGCCTTCTTCGACGACGAATGCCGGTCCGGCCAGCAGGGCACGTTGACGCGTGTCTGAGCCCAGCGTGGATCCCGGCCGCACCGCCGCCGACAACCAGCTGTTAGTCGAAGGTGTGATCTTCGTACTGAAAACTGGCATCCCCTGGTCCGATCTGACCCAGCGGTTCGGATCGCGCAACTCGGTTTGGAGGCGTTTTAACCGCTGGTGCAAGAAGGGGGTCTGGCAGAAGATCGCCCGCACATTGGAGGACCCCGAGTTGGAGCAGGTTCAGCGGGATTCAACCACGATCAAGGCTCATCCCGGCCCGGCGACGGGACGCCGCCGGGCCGGGGAAAAGAAGTCGACGCCGACGAGCGCCGCTACCTGGGCCGCAGCAGCGGCGGACTGACCACGAAAGTGCATGCGGCGGTGGATTGCCGTGGACGGTAGTTGAATTTGATCCTCACACCGGGACAGGCAGGCGACGCCCCTTGGGGAGAGCAGTTGCTGAAGGGCGTTAGTACGAACCATGTGCTGGCCGATGCCGTCTACGACAGCGACGCCATCCGTCGGCGTGTGAAGCGGATGTGGGCCAAGGCTTGCATCAAGCCGAAGGCGAACCGATAAGTGAGGAAGCGCTGCGTCCAGGAGCGCTACCAGCACCGCAACATCATTGAGCGATTTCTCGGAGCCTTGATACGATTCCGGCGCATTGCCACCCGTTATGAAAAGAAGGCTATGAACTTCGCGGGTTTTATATCGCTCGCCGCCCTACTAACCAAGCCATTCTGAATGTCCGTAGGCCCTAGGTGGAGATGGCGAGAAGATCGTCATATCCGCAGACAACTGCACAAAGTGCGATAAATACAACACTTATTGAAGAGTGTGTGGCTGCACGTCGGCGAGGATCGGTAAGTTCGGCGAACCGATCCGGCAACCGAGTCGTGGACAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP036299|4792899:4837753|4825633_4827094_-|WP_145303477.1|DBSCAN-SWA MEPMPTLSNSESSVAEFSPSARWSFGRQVCPQRFPANFADMCQFLVGHANTQQVHVIVRYDGRIDATHLQRALRMVVVAEPILGCRFVEHWYRPYWQRRDDHESMDLCTVVESVYPERELAEFLEAEMDPRVDPMIHVRVIRGDRDTLCYKMNHVAGDAPSLMKIIAMVSKNYLQGRTGPDVAPTVNLRSRDHSEVIRHFPISHRLRVIGSFLANQRKTRNLWLLKADQVGERRDGGHYYIRRLEPDVGVAAAKLAKRKRTTMTVVMLAVLYRTLRRHIGVTGEEGLGVGTTVDHRNYLPAKERELSPVANLSGPSRMFVPLPADASLEQVVEVINEQFKRSHKQRLLGLNYPGVILAMPVIRWLLLPVPFGWVHRLLKWGTRSQLRGLVRACGLANAGEMAPLFPEFDGVKVVDALATVSIYRAAGMGFFMSQFNGAVTISLGATESLAKPEVCELILDGISQELCSLEESSRPMPSASTEVPGQ >NZ_CP036299|4792899:4837753|4811264_4812164_-|WP_145303437.1|DBSCAN-SWA MGIDAVNRTDAEVKTSQSNRCPDRELPSWLVLLNRHLAAPMFFAIVLYLAAFHDLVVTIVSRDAVRLDRWSIWVCIAVQVVCLMEYLTYRIICPTISREFAWTVILTPFRLTCRDLGTGSFIWLPRMGWTLPDARLAKLVSATVTVGLTVALLVFLGIILTIANSGESLLAAPKLPVFQLTMKLIWIAAVVECCVLYSIIGRKMFPQHRTTAMILLSPLFSALVVILFRRVILGILAYEPIHRVIWNRRRHIALLERKIADRMFEVQMLKDRIAEERARLSLDRPPVEGEPIESVVEPI >NZ_CP036299|4792899:4837753|4834020_4834329_-|WP_145303502.1|DBSCAN-SWA MIRCLTLLFLAGLVLGNASLSAAGKTLLVEAESFDDLGGWSLDTQFINEMGSPYLLTHGTGRPVADASTKIKIEEASEPWVFLQRIGVKILPYNVIETCTDS >NZ_CP036299|4792899:4837753|4837251_4837428_+|WP_145303514.1|transposase|DBSCAN-SWA MNLILTPGQAGDAPWGEQLLKGVSTNHVLADAVYDSDAIRRRVKRMWAKACIKPKANR >NZ_CP036299|4792899:4837753|4833136_4833682_+|WP_145303499.1|DBSCAN-SWA MAVFHAITCLFNPCQYQRIWRNYDLFRERLDAPLTTVELSYDGTFHVPDAIQLLGDPAKHTLWQKERLLNLAIEAVSDEYDDIAWIDADVLFTDPRWRERTEEALSRSPVVQVFDHVALLDAHGEVHEIRPGVVAKLSKTHDPSVRFAHPGFARAARREALPAGLFDQNFVGGGDTTMLHG >NZ_CP036299|4792899:4837753|4804915_4805158_+|WP_145303415.1|DBSCAN-SWA MHQSLQRLRGRRVAVDQFTEEARLSEELSLDSLDLLEIRFDIEEKWKVELEDKEAAALVTVKDVIDLIQSKMSTNLDEKP >NZ_CP036299|4792899:4837753|4823185_4824403_-|WP_145303470.1|DBSCAN-SWA MAKIAVFTIPAWGHISPTLPVVKQLVEQGEAVHYYISEEYAERVRDATGADVRVVPRLVDHPVDKLPKSLIGLAGLVLQQSSAWLPNAIEPLRDENYDLILFDRVCPWGRFVADILGIPAVSLFCTVLGMVGGGKPSVHLRYLLMELLDVRMYFQFLKARGQLKKRFQLKKLGFLDLLTNTAEKNIVFTCREFLRHPNQPDASYIFVGPSVVEPATVSDLSITSGDRPLVYVSLGTVVDDAPGFYRACIQSFKDCNVTVYMSIGKRTAISDLGPIPANFVVRNFMPQIEILKCADLFISHGGLGSVNESLHFGVPLLMWTQNPEHAYNAGCVVRTGAGAILKDSDLKPDSLRRRTLAALSDDRLKKAALTMKQVYQLAGGPPRAAEVIMNEILHHANRSRIAGGK >NZ_CP036299|4792899:4837753|4796811_4803642_+|WP_145303408.1|DBSCAN-SWA MLTPSLITTNDRLCLAEVLAWRAEHFGDRRAFSFFRRLEDGCISITYAELYARARAVASLLSSHGISGERAVLLFDSGFEFIEALLGCWLAGVIAVPAPLPAPNKRSSRLEGILLDCDSKTILCTAATREELRLPAMDERFEVIAIGGSEGFSTDRPDIARFERPLDSHLVTTSPYATNPESVALLQYTSGSTGTPKGVRVTHGNILANVAAMSDRLGCTPADTGLSWLPMFHDMGLIGGVLLPLYAGFETVLFPPVSFIERPLAWLHAISCLDVTITGGPNFAYNICSRRAAAQGTAGLDLSSWRVAFCGSEHVCVETLQQFAQDFSKVGFHKEALLPCYGLAEATLFVSGRGRGELFGTIQVDREHLSHGRIVIKEADQTTLGTEPHGDLAPKNFKTLMSCGSAASGHDIVICDPKEDRLAPARSEGEICVAGPSVTPGYWSSEHKFVTLPVQDGQAQEGQISNRPGNSTRQYLRTGDLGFMHGGELFVTGRLKDLIIVRGVNICPHDIEEAVAGAHAAVRAGGIVAFSVGEHGHEQGAVAFELERSFINRVDLDTLFNAIHATVAEETGVVLDTIVALKPHRIPRTSSGKLQRSLCRRLYTESRLEGILGQSRRRSASTSEKLSPKRTHISGHIATKSQIDADPIAVVGMACRFPQADSIDDFDRLLSESRDAITSVGSNRWDPADYQAADGKELPIRFGGFCEGIDQFDASYFGITPREAEKMDPQQRMLLEVSWNALLDAGLGGPQLNGTNMGVFVGVGNSDYSKLNVLASPNYGAVDAYSGTGNAHSLAANRLSYTYGLRGPSLSIDTACSSSLVALHYACQSLRTHECESALVGGVNALLSPAVSIAFSHARMLSPDGQCFAFDTRANGYVRGEGCGVVVLKRLSDAERNGDRVLGIIRGTAVNHVGRSRSITVPDAEAQRQVILTALAASGLQPDDLDYIEAHGTGTPLGDPVEMDAIKSVFGHREESRPCYFGSVKSNIGHLETAAGIASLIKVLMMFGREQLYPQRNLKSLNPECDLYGTCLQPALEVCTWPLPARTRRAGINSFGFGGTNAHAILEGPDSSAKPDIFAKPEIANPLTVGPVLEPQTSNRPVHLLRVSAYSEVAARKAAADNAAWLARHVTDETLADFCYTINTASDMEEYREVAIASDVASMQANLSLLSSQEGSFAVSKRLTGDAPKIAFLCTGQGSQYLGMGRELYRSQPVFRDRINCCDRILRGHLDRSLISILFGDKDGQSEIGLTQYAQPALFSIEWALGELWQNWGIRPAAMMGHSVGEYAAACLAGVLSVEDALLLVAERGRLMATLPTRGGMISVMASAEVVRELLANSGLEISIAAENGPRMCVISGTQADLHEITAQLRGKRIATQVLSCSEGFHSHLVDPILPQLGLAASRVQHAAARVPLVSNLTGALLAEGQVLDAGYWQEHAREAVLFEKGMRTLVDMGIDCFLEIGPSPTLLALGRTCVSSTRLRWLASLRTGQPDWQVIAKTLEGLVSLGAPIDWRAFDKPYARRRVRVPSYPFERQRYWLPETTPSAATSGSASVNHPSLGKRVSLTDSSVVFEARLNSTMGDQANEHRLLGTPILPASAVISMGFAACRAAGQLGPLAISDLSIEKPVGWDNSADRVLRTEIAAKDSSVLSIVISSHEPNGIAPLVHATANRSCALPPHSPEGPLELAAITSRCEDAVEADQFYTSLRQFGVDYRLEARHLTGIRAGQIERLACYQFDSAFGTAKTLSPVMIDTGLQLLLSTAASNDDATVHSPYVATRIGQAFLECRSSNQVWIHATVDHATATTGMVSLLEQPLRGHVSFVDENGHALLRLVDVELQPIAQKFPTEPQSVLNPSLSREIRQPVNSLLQSEIDRPPARWILVCDRGGLAESLAYQLRELDNEVSLIELAGHYGHNSKAAECALTACGSPLPKAPIGLKEHLAALVASLPKASVRTHLIDFRSLDLPANEIAPSSQRSNPDWPDDQTMNELAILREKLDGEHQLAGMLAVVTRCTLNSKVASPTNTRCSHDGLPSIESHYPQILSQADLRLIQIGADGNADLPSLLAELVFGEPRRVVCLCGSGRRKSSRPLAENGVGVVVPIQLEVGVFETTSNGSPIREASGRSLSESALGGSQTRHGLSHGSRLAISPSTVEVNGDAGWLDRLLSNEVPEKRHALLQTHFLDMLTRVTGQDRRLIKPADNIRRLGLDSLMMMELKNSIEMSLGITLNLTLLFQDPSIERLVSFSIDLWNQSKGNDRKSSHPRPSKKPVAQGASE >NZ_CP036299|4792899:4837753|4814033_4814267_-|WP_145303446.1|DBSCAN-SWA MEIESRIKKIILSNVEVPESELVPEAKLRSDLGATSIDMVEIVATLESEFDIDISDEDAQNVRTYGEMVEFVKSKVS >NZ_CP036299|4792899:4837753|4829352_4830846_-|WP_145303489.1|DBSCAN-SWA MGNEKLSMKSSALPTTAPGKLSDSEMLKVRQLDLLKRIPKRFPATFADMSQFMVRQANTQHGNLIVRFDRHIDSSRLEQAMRLVIYAEPILGCRFVEHWYRPYWQRRDDVDCMALCIVMNVENPESALTEFMEAEIDPSKDPLIQVSVFRSKCQDTLCYKMNHLVGDAPSLLKIVLLIGEYYQKLSVEPSFLPEPNLRSRDHSEVLKHFSLSEKIRVARSFLANQRLTRNLWLLDVNDVGVDRVKGRYFIRKLDRSISDSIWRFSKQHRATVTVILLAAVYKTMRRMFPATGDDGLGVGTTVDHRNYLPAREREVSPVANLSGPSRMFVDLPADASLGEVVEVIKEQFKEALKSRLLGLNYPAVFLSMPFVRPLLLMVPFGVLQSILKRVAGPRLFGRTRAVGIANVGNLAAVLDVFGDVKPIDAFATAAIFRTAGIGFVVTQFKGTTTIAIGATESLVRADVCEMILDGIEEELLSLNCEELSAMDARGNGGFEKV >NZ_CP036299|4792899:4837753|4836529_4836820_+|WP_145304685.1|DBSCAN-SWA MLDAEQQALLAQRIEDGPRANDPCSLRGINFRNFIESRFGKRLAISTFYNLLHKLGYEPLVPCPRHRGHDHAAAAEFQKKSLRTSPASKSSILENG >NZ_CP036299|4792899:4837753|4793502_4793721_+|WP_145303384.1|transposase|DBSCAN-SWA MRLEVVRHTKAKKGFSLLPRRWVVERTFAWLGRSRRLARDYERLAKVLAGWHWVAMVALLVKCIPEIIRKNP >NZ_CP036299|4792899:4837753|4814512_4815754_-|WP_145303451.1|DBSCAN-SWA MGFWLSMRIAIRALSKNRLRAGLTILGIVIGIAAVTAMVSIGLSASQLVQAQLETLGTNMLAVLPGSMQKGGIRTVDVGSLSIADVVALGESCPSVNAVSPLVGVSGQVVYRSANWNPQTTFGVGKDYQIVRNWNLALGSFLSERDIASSAKVCVLGQTVVRELFQAEDPIGKTIRVKNAPFQVVGVLEPKGVSMFGQDHDDIVLIPYTTAAKRLRPGGGSNVDAILVSTSSPSLATVAAREAEALLRQRHGINPGRDADFRIQSTTEIALMLQVIMGAITLMLTAIAAISLVVGGVGIMNIMLVSVAERTREIGIRMAIGARSSDILRQFLIEAVVLSCLGGFIGIALGLAGSAGLTMLVNWLTPNTSWPYIISVEAAVVALLFSAMIGVFFGYYPALRASQLNPIDALRYE >NZ_CP036299|4792899:4837753|4808760_4809978_+|WP_145303429.1|DBSCAN-SWA MFRRIAITGMGIVNSLGNSVPEVLGSLRAGKSGVSIVDEYRELGFRSALAGTLKDFQPPEIDRIYLRQMGDNGVLTTAATFEAIRDANLSDELIQHSRTGVIVGNSGTYKETYQLCHQRRDLGKKITGLALPRAMASTVSANLSVLLKTKGHCFTVNGACAGAAVAIVQAAWAIRLGLQDRMITGGVHWGSWEFDCLFDALRVFSRREDNPTAASRPFDADRDGLVPSTAAGMVVLEDWEHAIDRGAKIHGELIGIAVNSDGQEMTTPSGEGSGRCIRLALDDAGIGPGDIQYINAHATGTKLGDEIEARTIGEIFGQTPYVSSTKSGTGHEVSAGGATELIYTLLMMKHGFIAPTLNLERIDDQCAMIRHVPCHPIEADIQLAMSNSFGFGGVNAVLLVRRVET >NZ_CP036299|4792899:4837753|4827228_4827948_-|WP_145303481.1|DBSCAN-SWA MVTGASRGLGRAIALELARDGLLVIVNFRSDAEGAAQTQEAINALGGRCQIAQFDVSDASAVRTAVRNIDRDIGRINVLVNNAASGVLKPLARVKPSDIEQSLQVNLAGVLHCTCEVVKTWSGGCSGNRIINITSSAAEAGAAGSVVYCSTKAGVIGLTKSLACELGRKDITVNCISPGLFETESISLHGVNKARLLELTPLARAGRPEEVGFLVSFLASERAAFITGQVIRINGGVYM >NZ_CP036299|4792899:4837753|4812309_4813110_+|WP_145303440.1|DBSCAN-SWA MQDPSQVTIFQQRLVAMNRFIAPLFFVLSLGFLALTYHGLRSVGQWEGPLPETTALWGLIWLYPPFIVEACFYAWSGLPGGRQYRWSALLPPLRLTRRDLEMQHLIWWPLRNWKPADRQLARHVIVCEAGILWFGGIALIATAAISSASITSVESFTNHFVIDAIFGAMWVVVMADTCVLLIILPRARRKSLVITCLPILASPVVPLLVVAYLPRLVRRTMAWNELHHYGIRDPAERLRLLDAQLQRRTAEVEELQDQLRREQQLN >NZ_CP036299|4792899:4837753|4836871_4837210_+|WP_145303511.1|transposase|DBSCAN-SWA MSEPSVDPGRTAADNQLLVEGVIFVLKTGIPWSDLTQRFGSRNSVWRRFNRWCKKGVWQKIARTLEDPELEQVQRDSTTIKAHPGPATGRRRAGEKKSTPTSAATWAAAAAD >NZ_CP036299|4792899:4837753|4821509_4822022_-|WP_145303464.1|DBSCAN-SWA MVMCRLVWSHGLTMQRQMYEDCRVGEKAISDGRTITEADIVAYAALTGDWNPIHCNAEFMSEHPFGERIAHGLLILSISSGLLFRVVGYELLPLKNIVFVGLEQVRFVNPTRIGDTLRMHAETIEMKPVNAQSGLVTVRIKMVNQRQEAVLTTRAKFAVPRKRLDASAVV >NZ_CP036299|4792899:4837753|4807522_4808743_+|WP_145303426.1|DBSCAN-SWA MRRVVVTGCGIVSSIGIGNKTVATSLREGRSGLQFVPEMKAYGLRCNVAAPIVGFDEAIFQEDGKPRLSRAAQYGLTAVFEAIAEAGLMPHSEPLNSAAVILGSGGGGQSCLPETALRSDANPIDEPGIYELRRQMNETAAVAVAQRLGAEGRVSSNSAACATGLYNIGFGYELVAAGLHDCCICGGVEEQSWQRVGVSADNSLGMPTTFNETPKAACRPFDRDRQGFIISEGSAAVVLESYDAAMARGAPIYAEIVGYAAANDGHDLYVANGDAMRRVVLSALSDAAKLGVANIDYINAHATGTPIGDAIEAGVIREIFGTGPAVSSCKGVTGHSQGAVGAQEVVYTVLMLRNEFLAATANLEHPSDDCGGIDHVRTRRDVRIDTALTMNNGLGGTNAAMILRAL >NZ_CP036299|4792899:4837753|4822071_4823064_-|WP_145303467.1|DBSCAN-SWA MNGNSPVSSKSLGILEAASPSAELRPIFILGEHRSGTTILYKLLGLTGCFNITTAFHVLYSDRLRALHAEGAMAGAQQELSDIFASWDISTRVIDNMPLEPGMPEEYGMHLFRKTASPKLKRSNLGVFLDFCRTIQEVEDPSKPLLLKNPWDFSNFRFIKSVLPNARFVFIHRHPIHILNSQLKAMQTNQQSGNAYVKFLVDAPGFLQTLHKSKFVQRLVKWATSPNNRWRLGMRHLASSGEKARCEFVDRIAELPTDSYVSVRYEDLCDHPEQVMQSILSFVQATSSSGVDYGEWIQPRPTRLLPEVEEWESRLLRSFEKSMRYHKYIT >NZ_CP036299|4792899:4837753|4837428_4837593_+|WP_145303518.1|transposase|DBSCAN-SWA MRKRCVQERYQHRNIIERFLGALIRFRRIATRYEKKAMNFAGFISLAALLTKPF >NZ_CP036299|4792899:4837753|4809974_4811231_+|WP_145303433.1|DBSCAN-SWA MTQLPHSAAVSPTGRRVVLTGLGILSPIGVTCDAVCAHLQAPCHPEEASASAIPNDGFLGHLKGWEPESDEKLGRLSVGAQYAVQAAVDACGQARLDSSSVDTARVAVLMGTMFSSMAEVVRIKQLLESGKVRRAGVRGTTKLMNSGPAVNIAASLGLRGPVMSNSTGFASGMDNIGCAYEWVRDGHIDIAVCGASEENCAPFLGKQFTLWERSPRGVDLNREVTIRPFDARREGSILSAGSGVVVLESAKHALRRGARPLAEICGYASCFDVEPGPDSFSTALTRAIERVCRHAEHSGVNRLRQVVSGAAGFHLADADHVLAIRNSLGMSARVTSTAGWLGHGLAVSSAWNVVLASLMLSNGFLIRCRGLTEPGADCRGVEYVMNGEIASDPGLLITAGGLGFASCLALQRLSPMSD >NZ_CP036299|4792899:4837753|4837606_4837753_-|WP_145303521.1|transposase|DBSCAN-SWA MSTTRLPDRFAELTDPRRRAATHSSISVVFIALCAVVCGYDDLLAIST >NZ_CP036299|4792899:4837753|4813184_4813940_-|WP_145303443.1|DBSCAN-SWA MALIDLDGIGKLYRIEDVEVHALRSVSLQIQKGEILALVGPSGSGKSTMMNLVGCLDRPTSGTYRLDDVEVEKLNPDALARIRNQKIGFVFQNFNLLSRTSALENVELPMLYSGKLTTRQRRQRAAELLQMVDLGDRHDHHPSQLSGGQQQRVAIARALANDPPILLADEPTGNLDSRTGSDVLKLFHRLNRENGITVILVTHDHNVARATRRQIVLRDGDVLIDTTDAEEAIRAINVYTQESLKGEPTAE >NZ_CP036299|4792899:4837753|4803587_4804883_+|WP_145303412.1|DBSCAN-SWA MVPSTTVEETCCTGGIRVTPARRVVVTGQGAVTPFGMNADSLWAGVSQGRSSIRTITGFETSGMPVTFAGEMDDFDPCEVLGRTLPATGDKPLKMVFVAADEALRQAQLLSSNKCLEDLLVHTVLGTALGACFELEYSYGCFHKLGWKGIRPTAVPKSMFNTYASQLSIEFGLKGMNQTIACACASGAAAIGHAYQLIKHGMADIILTGGVDSPLCPSMFGAWTNMRVLARHEEPESASRPFDRDRGGLVLSEGAGMLVLEAEDHALKRGVAPLAEIRGYGATSDSAHLTAPTIEGPVRAMRAAILDAGLRPEDIQYVNAHGTATQANDENEARALHDVFGNRGWTLPISSTKSMLGHSMGASSALEALICINTLRYQWVPPTLNCEHPEWPEFEFDFVPGQGREHRVQNTLSNSFGFGGTNCVLVMSRPE >NZ_CP036299|4792899:4837753|4793848_4794874_-|WP_145303403.1|DBSCAN-SWA MSEGRATTAWSQRTSSLLGVQIVSTGSYVPDQIVTNSDLQNLYGCDPGWIEQRTGILRRRYAASDQATSDLCIEAAKRAIFSGNIDPQSIDLVVVGTFSPDYLCPTTANLVQHALGLDAAAMDVQAACSGFVYALTTAAQYIATGNSKMALVIGGDCNSRIVNPYDQKVAPLFGDGAGAVLLTKGDAEQGFLCYQIGSDGSGSDLLDRKCGGSRFPFTPDAVAEGEHFLQMDGRSVFKWAVNALTQTVELVLGKSGLSVEDVALFILHQANMRIINHATQHLGVKPSQMYNNLQEYGNTSAGSIPIALDEAFQKGLIRRGDPLVFSGFGAGLTWGTAVFRW >NZ_CP036299|4792899:4837753|4806302_4807451_+|WP_145303422.1|DBSCAN-SWA MSDVYISGVSSFLPNDPVDNENIENVLGRINGRSSKVKDWVLDYNGIRTRHYALNPQTMQPTHTNAEMTALAVRQLLTNHNLSLSDIECLACGTSSADQIIPNHAAMVHGELGCPPCEIASTTGVCCSGMTAMKYAIMNVKSGLAKMSVATGSELASLSFRASRFTPQIDRQITEFNQEPMLAFENDFLRWMLSDGAGAVLVTPQPSAEGCSLRVDWIEFRSFANDFETCMYFGGIKTPDGNLEGFRIVDDPLELVGRGYLSLAQDVRVLRSNLPKTVRDTFVHCQSKYGLEEDEVDWILPHYSSEGFREPLQKGLKEVGFNLPEDRWFTNLHTKGNTGSASIYIILDEFISSGRAQPGDRVLCFIPESARFTMCFMHLTVV >NZ_CP036299|4792899:4837753|4817929_4820083_+|WP_145303457.1|DBSCAN-SWA MSRTGLLDQHDFKPVGRLESTVMDHTRIGILAWFGRWLPAIRPSGRLLCRIQRRFSLFVRQSGGRCLATASLLGMLTPTPIAAQEERPWKLILPEQRSIQLQTPPQIPVDVPMGGPPTTVTSPGERVEWLLSLDEAIHIAIQNSEAIRVAGGASVRVTGQTIYDVAIANTKIDAERAQFDPTLGIDNTFVRQDIPTAGLIVPNPRQRRISGFTRDGYNHATGIEKLNPLGGLTRFDIETNVQRTKPTIAQLNPLTTTRAGVSYVQPLLLGAGQDVTMAPIRIAQLETDRTFFQFKNSVQDMVHAVIQAYWRLAAARITVWATEKQVEQSRFASERAEARMRAGLADSAEVSQTRLAYYNFRSAQIVAKNAQLDTENLLRNLLFLPPESETEIIPSTALHTETMNFEWNELLAIVEECRPDLQELRTTVLADNQRIILAKSQALPQLNLVSAYGWTNTQGEQRNRNGVVSNFNASGSRYTDWTLGVNFDMPLGLREGRANVRGQELNYARDLANLRTRSHTAVHDVASAVRAVDRNYAEYQSYRMAREAAALNLEQQRIEYDNGRTIFLNLLLAISDWGTSVSQEIDALSRFNTELANLERQTGTILETHAIYFEQEKMRTTGPLGPHHPVDYPRDLRPTPNAPRYGAEPTPPEKKYFDEDSIPTTEPTLDPTETGKASVDPPTKSGWAASLTDTITGRRFRQKNSADRSRATSEEIP >NZ_CP036299|4792899:4837753|4805154_4806183_+|WP_145303419.1|DBSCAN-SWA MTIAYPSPNPPLQSKVCHSPPRHVLAICYSQSGDAARCAKAFLTPLKEAGAVVDEEWIKPIPAYPFPWKSVLRFFDVMPDCILGKAPSIEEPQFDPDLPYDLVILFYQVWFLAPSLPLVGFFNHPKSRVLNGRATITVVVCRNMWMVATAEVNRWLAKLKATHLDNLVVTHQGPIWATFITTPRYLLFGRRDRLWNLFPEPGVGESELGRVREFGNTLAKQLDQLVPGRTQPFFTGLDATRIEDKYILPELIGSRLFRFWASVILAMGRLGGRYLRSVAVGLFVLNLVCGIVLGIPILFVFRILAYPLLRPVLQSYKMEMRAPSEPRPVDAAKHPETIDVIG >NZ_CP036299|4792899:4837753|4828438_4829260_-|WP_145303485.1|DBSCAN-SWA MMKSEILRCPTCGDAHSEVRLVYGLDKVVSFCCEGCGYHCDGVDGIVELVGKDQIIEDYPEMEQQTRLGARLYDMLEKEFAEIVGLEPDRIRSEYLDALELDQGSLILDVGIGTASEIRYLCEKNPEMVSSWSVCGVDISVEMLRVARKKLKSAGLDYILMVGLVENLPFQDNSFDVVFHTGAINEFRDQRLALSELLRVAKPGSRVVVTDEWITHENAEQAIGKELARTFPSITIPSPTPFEAIPPSVSESEIKTLWNGYGYSMIMRKSMEK >NZ_CP036299|4792899:4837753|4815809_4817054_-|WP_145303454.1|DBSCAN-SWA MYTAIKVLVVLALVGGASLAAYIYAMDIQFLQSDLTFREVELTRGSLVSVIRSGGSIKPIVTVNVGSFVSGPVREVLVDFNEKIEQGALMARIDSKTYEAAVARDEAILATSKADLKRLDVLLEQAKAEEARTLAVKNENTAFISVAELDAAKFNRRSVEAQMEIANAAIKQAEATMRTSRANLEYTEIRAPVSGIVIDRRIEPGQTLASQFQIPELFVIAQDMDREIRIVVSVDEADIGAIIAANQMKRPVSFSVDAYPDEEFKGTIKQIRASADARQAVVTYPVVISVTNPDARLLPGMTASVNFEISAREKVLLVPNAALRIRPDPEKVVPEQRAAYRTLLDLKNRAKDHIGGALTADGKKAAEKRLLWQPAGKMLRYVEVSIGENNDQSTEILDGPLKEGDRVVYEISGT >NZ_CP036299|4792899:4837753|4831121_4832162_-|WP_145303494.1|DBSCAN-SWA MKGVTFDRHGGTEVLQYRDDLPIPEPGYGEVLLRIKAAAMNYNDIWARQGVPGMDFPLPHISGTDGAGIVEALGPNVTTVAVGDEVVVNGAFSCGDCVECVRGWPMFCPKFRIWGFETGPNQGSEAEYATVPARNVIPKPGNTSFEEVAAISNVLATVWRMLVTRARMGAGDCILVWGAAGGIGTAAIQLIRAFGAKSIAVVGSEDKAKLVSDLGADYVLNRKSQRITREVMKITRKRGVDIVFEHSGAATWEESTHCLRWGGTIVTCGATTGFTAPLDIRFLWTKQQNYLGSHFATTAESRDSLRFVESGQVKAVIGELLPLKEIVRGHELLASGGVAGKIVLVP >NZ_CP036299|4792899:4837753|4834985_4835810_-|WP_145303505.1|DBSCAN-SWA MIQNTDCTVDSTSIRSALMCGLVFVSGIGCGYQPDVSQPSRDTKVEGVVRDASAYVDIVGVSSKNPDDQTARSPDLNSSGSSPSRRSSESKGEGVSHQARALAEILFQGNCAKFLGKASVREQLGLSEDQCRRLAGLEDSVKNAVNDLRKRPVAELPMMLREYEQAGQEIEKTIDQTLTTEQVKELRREVLSGVQGPLVLLAPHVIQKLNLNDSQIDSIAAIVWQDFEQLKKASLWELPTLKSRSRESRRQAEMSLSPAQLEILHELVRKSGSK >NZ_CP036299|4792899:4837753|4792899_4793034_+|WP_145304682.1|transposase|DBSCAN-SWA MEQFCPDKSSDPGRTAADHKLFINAVIFTLKAGITRQDPPERFG >NZ_CP036299|4792899:4837753|4820668_4821436_-|WP_145303461.1|DBSCAN-SWA MSSSPGVSDSAEPWFSPSPHGDALGRLFCWPFAGGGASLFNRWQSSLPSAWQVCPIQLPGRENRLAEAACRGLVDLAGQVTTQLLPWLDRPFVFFGHSFGALLAFEVTRQLRRRGSPLPSKLIVAAFCSPEISLFRQSVSQLGNEELLHWLRETGGEMDKPLANLEWQDLLLPTLRADLEAIEGYRYTTESPLECPITAFVGRGDKIAPWNQMIGWRRQTSGSFNFHVIDGQHLFLRTAHTPLINIVRKELPALS >NZ_CP036299|4792899:4837753|4824519_4825128_-|WP_145303473.1|DBSCAN-SWA MESFVRFCQKVQYVSDPSVPLVLKSCPDYRNFLFIKSKIPNAKFVFLFRHPLRLLDSQLRIFQKGEANKLMLMMSKRYSITESPVVKRLMKGAFSDCFGINPGHSWLIHRMKVAEGQLFKDLPFVERKDYIFVGYEELMVDAQSHVQRILAHVGETCPADVNLNPYTQPRKFELLPDIQKRQEYLLRLFQKTRDLVGFDEEV >NZ_CP036299|4792899:4837753|4836292_4836502_+|WP_145303508.1|DBSCAN-SWA MKVEGHHTADELQDLLPRERRASVAIRIRIVRQAALGGTAPQIALECGLSRRSVQEWVERYNTQGLPGL |
37 | Paenibacillus_phage(50.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|