Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP023398 | Pseudoalteromonas spongiae strain SAO4-4 chromosome 1, complete sequence | 1 crisprs | DEDDh,cas3,DinG,cas6f,cas7f,cas5f,csa3,WYL | 0 | 0 | 2 | 0 |
NZ_CP023400 | Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence | 1 crisprs | Cas14u_CAS-V | 0 | 7 | 0 | 0 |
NZ_CP023399 | Pseudoalteromonas spongiae strain SAO4-4 chromosome 2, complete sequence | 2 crisprs | WYL,DEDDh,cas3,DinG,RT | 1 | 2 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP023398_1 | 2090872-2090954 | Unclear |
NA
Consensus repeat of NZ_CP023398_1
|
1 spacers
spacers of NZ_CP023398_1
>1.1|2090895|37|NZ_CP023398|CRISPRCasFinder ATAATCATTGTGTAGAATTAATAAAAAAGAAAATTTA |
cas6f,cas7f,cas5f |
CRISPR arrays and Neighbor proteins around NZ_CP023398_1
The CRISPR arrays of NZ_CP023398_1 >merge|NZ_CP023398|1|2090872-2090954|CRISPRCasFinder CAGCTACCTATGCGGCAGGTCACATAATCATTGTGTAGAATTAATAAAAAAGAAAATTTACAGCTGCCTATGCGGCAGGTCAC >NZ_CP023398|1|1|2090872-2090954|CRISPRCasFinder CAGCTACCTATGCGGCAGGTCAC ATAATCATTGTGTAGAATTAATAAAAAAGAAAATTTA CAGCTGCCTATGCGGCAGGTCAC
>NZ_CP023398.1|WP_157813494.1|2089769_2090510_+|hypothetical-protein MKKTSFSILALGILATGCTSDTDSDLLKTKAIQAEVLVSSNGERTKVNVELNSGNSFGSNVRLSNGDKIYAEVHNKRIQLEEDTDILDIDYEGRFSGSEESAEFKVELIRSSAQNASSTVELPLNFKILAPNSNSEIRYNQSVNVLLDGIDVGSKNEFTLNYICDNNNGGTTSGSASRHFENKSSLTLNLHTLKLLENVSLNSVKNCELDVIINRYKVGNISSEFADSSQIRAEQTRKIENITFKI >NZ_CP023398.1|WP_100913717.1|2087125_2089204_-|EAL-domain-containing-protein MDANCRIFIIDDEEINTLQLEHVLQECGEVLSCNDSSKALSLIDTFRPDVIVLDIEMPKVNGFDILKELRQKHYFNDLRIVVITSHSDIEIEEKALSLGAIDFISKPLNLTLCRMRIENHVNFKHQENMLQIAQSSLQSEKEHLRITLDSIADGVISTDDNARINYMNPVAQRLTGWSLIAAKGKHIEEVMSLNDATTGSRSLNPLVMALQENRPVAMAINTQLTSRQGITHRVEDSAAPILNSSDEQCGAVMVFQDVSETMAMAVQMTYLSHHDQLTALPNRVLLHDRLTQAITRAAFSENKLSLMLLDLDKFKYINDALGHHVGDEIIAHVGHSLDKFACNDITVARVGGDEFALVVPHTSTLSNIEPLIEQVLNVIGTPFRIADEQHVLSASVGISVYPDDAKSVEEMLRHADSAMYKAKHEASNSYCYFNDELQLAMNERLRIGNLLRKALDENTLEVYFQPKKDLATNEIVGYESLVRLIDEEGKCVSPLQFIDYAEETGLIYRLGAQVLEKSCIVAKKWLDAGCPKQVAVNISAKQFSDSSLVELVASTLEKTKLPSRYLELEITESALIDCYEQTVTQLKSISKMGISIALDDFGTGYSSLSYLRLFPLNVLKIDQSFVRDMLTDSQALDIVSTIVELAKTLNLKIVAEGIETTEQKQKLQSLGCKIGQGYLLGRPTPFEKNHKQ >NZ_CP023398.1|WP_100913716.1|2086193_2087114_-|diguanylate-cyclase MLLNIKPFSECKVLIVDDEELIRLSLSTLLETEFQVDSLGSGRAALAYCEDELPDLVLLDVNLPDMSGLEVCEKLKQQPAFRYIPVVFITSSTDESLQDKCWEVGASDFIFKPIVASTLVHRTKNHLTNKLHLEKLFEYSLKEPLTGLYNRYYLMEEVKNVLAQSKREKRSFSLLMIDIDEFKSYNDHFGHVKGDECLVIIANLIKEELKRPHDVAVRFGGDEFLVVLPYTDYHGIMKVCENLKEKLSDCLLPHPLSPHKTVTLSIGGVLFEKFCFLTLESMMSLADESLYNAKKAGKNCIEVAKI >NZ_CP023398.1|WP_100913715.1|2084831_2086100_-|extracellular-solute-binding-protein MESSFNRRKFLKTVGVGAALTSIGFKAPYVLAKKQVTLRVMGTHVTLQEELRQRAMRELGINLEFTPMGSAAVLQKAAADPSSFDLYEQWSDSINILWQADAIKPIDTDRLIYWDEINALTKEGKIVPDAKIGAGDSPNKLLFVQPDGTLGKQSSTKISFLPYVHNVDSFGYNTKFIDKGAPYKTESWAWLLDESNRGKVALVNAPTIGIFDVALAAEGAGLMKFNDIGNMSINEIDSLFSILLAKKREGHFAGFWTSVPQSVDFMKSNRVHIQSMFSPGVSACKGNGIPVIYASPKEGYRAWHGVMCMSKNVTGYVEEAAYKYMNWWLSGYPGAFIARQGYYISNPERSQPLMSIPEWDYWYNGKEATIELKGTDGKVSVHPGEIRDGGSYIKRFSNIAVWNTVMNNYDYSLDKWYELLNS >NZ_CP023398.1|WP_100913714.1|2082563_2084822_-|response-regulator MFNKIARSIASKISLALVVVFVILAISYFIMSQRLNSIEQSVESTAEISNYALEILRISKDIVELQRDINVYGVSGSASIFDKIKENFLSVETRLNSIATVKVHPSEQIYVDSMLELIVRYKSNLTTFHKRHEIRINLLEQALPDTYFNAVDLLNGLNAVAMTPTEKHYSLELMNKWHTLHHDALLYLNKKDYSKRKSVENIFKELNNLNQPFNNNSFNATLAEVKNILDQYAKIFRQSIQANRNYLTLVNVVMSGDSIEFSSLADKLRESSLQRLSKIKKQSEKSINETELILSFLSIGVVCYLVLLAVFIQSHVSFALKRLTNSFSSFLEGDFSAPISDLERQDEIGKLALAADQFRHLSNALNDAKLEAESVSKVKSEFLANMSHEIRTPMNGILGMARQLEHTPLTEQQSDMLNIIQSSGDSLLVIINDILDLSKIEASKVELENTPVDLTRLLNELEQLFVYQASDKGIQLFVSKSVDEEVLFYADKTRLKQILMNLLGNAIKFTEIGSVSLNVYVQNLNNELVLSFSVSDTGIGISDDNIKDLFDAFSQADTSITRRFGGTGLGLTISSKLLKLMDAPLQVKSELGKGTQFYFDFKTQLAKHTDRLESVTDDIVAESDLSHLSMLVVEDNEINQIVIEALLKEFNITDITIASDGAQAIEECKFNSFDIVLMDMQMPVMGGEEATKHIRKMTNYVDTPIIALTANVLKEDRQRCFDSGMDDFVSKPVSFENLKVVLSKWSNKANTN >NZ_CP023398.1|WP_100913713.1|2079106_2082547_-|response-regulator MKNKPLEQVVLSQKTLVFVVVLVVCFLSLAIYVFEAQITSNTLRELNSSVSENSQLLKTTLQSSVRRSKANLRFLHATPPISGLPRAHFNEGIDPFDGTTFEQWKNRLETIFVGFMQNNPNITQLRVVEVSASGKELIRVDKNGSQIAPVKHYQLQNKSSEPYFLPSSQLEVNQIYTSLISLNKEYNEVTFPYQPTMRFSIPIYSENGKRYAFLIMNIDASQLLNDLKDNTLNYADLVITGPDGDVLFHPNESYLFSKDLGTDTNWNTLYKQPVKYDSLQLAESNLLNIQPAYNYTQKVQIRNGQERGFLNLSIIVDKSYINTQINERRYVTYGVLVAVVIFSTILIMFFHRNSLRNKILAETRREAEAIVDGAIDAIIGFDLDGNVSSLNLAAESLCKTTKGMSIGRHYSEISSLKDLPIAKYISDVSDKTKILRDDYSNDANDGRNYFAITVSPLFSENSKVIGISLIIRDISNEKLAEEKIKRLNYDLEAKVRERTKELATAKDFAVKNNKFKNAFISNITHQLITPLNGIIGPLNILKREPFSNDSMKLIEMIESSASNLNLYINDILDLSKIEAGKLELNYKAVPLKTLIENLIESLSVKVFSKGLEIYLDTTELQCGVVKTDPIRLTQIIDNLISNAVKFTETGGIEIKASTVASDDKNYKLIIEISDTGIGIDYKKKKDIFNRFNNGQSFSDANEEGAGLGLSICKQLCQRMGGDINCVSTRNVGSTFSFFISVENQDETKTPYIACFESENILIISDIEKTKESLRHLLNFEGACVQDSSGHDIHSNINKNYDYVFFDKESFDLIESEQILEEVSKSVKSDNIIILNRPGSPLSKNLVDLYSVVHKPITQSKLKVLVDGEVYRESVIQPSSSIEVTETVSQTILDKLFGSKVLIVDDNDINIEVVKDALSTLPIEIYIANNGVGAIAQLKQSGLNDEPICCVLMDCQMPELNGYETCEKVRAGEAGKEFTDIPIIAMTANAMKGEKEKCLLVGMNDYISKPLSSVQVLQKTIEWSLSNFNNEAKVNTTQVSELWDRDSALKRLLNKEDLLFKICKMFSVKALGRFRELCDAIMQKDQELSRQLAHSLKGMCGEISANGLRELFSEIEIKASQGNFECTNELEEIEESLPKLVHTLQEL >NZ_CP023398.1|WP_100913712.1|2078063_2079026_-|diguanylate-cyclase MFSEVFNRNGVFVPINLKPLADSKILVVDDDLLMRKVLSTILKDICYVDVLSDSTQVINYCTSGVPPDLILLDVHMPNINGLEVCKQLRTLEAMNDVPIIFVTASVEVKSQNECWEAGGSDFISKPIAPSTLVHRVKAHLLNKARLEQLEKLTYKDSLTGLYNRHYLNEHSLAIVKQVQRQNESIALLMLDLDFFKRFNDLYGHQAGDDCLKDAAEAIVESLKRPQDVAVRYGGEEFLIILPYTDLTGLEHVAKNILSNIKKKNIKHAESPFEYVSFSIGGALSHTKSCEYSLEAMIKIADENLYEAKRLGKNQFVMLNN >NZ_CP023398.1|WP_100913711.1|2076992_2077874_-|helix-turn-helix-transcriptional-regulator MKTIFEHTQSNVLVIPQALYAMKDVTVLSHHRNSAIFYKNLEQDLANIEFYTNIPCFIYIENGREVITNSNKGKVELNAGSSIFLPQGLNLLSDFVKETESLKAYLVFFDDDVVNDYLSKVKKPLDGNVGEESFFLIDGGDEFTAFFDSIRLDIKAPTYLNIKLQELLHLIAWKDNQHILNSLLAKKIRVAPKRNLARILNTLDVFHLTVSDMAYLSGRSLSSFNRDFKESYKETPKKWLREKKLIKAKELLVSEELSVTEVAMTMGYENVSAFIKAFKLKYGVTPKQIKLRK >NZ_CP023398.1|WP_100913710.1|2076609_2076900_-|antibiotic-biosynthesis-monooxygenase MSVKVTLNCQVKTEQFQALLPFLESNLPNVRGFKGNIHVSVLFDEENNEMLLDEEWLTIEDHREYLSFIHENGVLKELSSFLSEPPIIKYFNKVEI >NZ_CP023398.1|WP_100913709.1|2076112_2076496_-|hypothetical-protein MNIWIFSAGIISLFTSFVHIFAGQVDPVRPFLKSDFPDLPKATLLGCWHMVSSILVLAGVTLVYIGWFGLDSFQNLVLGISICFIIFSVVFIMVGWHFFKLHTFSKLPQWMLLLPIGVLGLLGIIYI >NZ_CP023398.1|WP_100913719.1|2091075_2091669_-|type-I-F-CRISPR-associated-endoribonuclease-Cas6/Csy4 MSRAYFTITYLPTNGDVTLLAGRCIGILHGFMNKRSCNHIGVSFPKWTDKHLGNQIAFVSEDKAALKNLRHQNYFEMMTHDKLFEISAIKPVPADATEVRIIRDQSLGKLFMGEKRRRMERANRRAEARGEEYIPQYVPNDSEISAFHKIPIVSKSNRNDFVLHLRLELANSIQNTFNSYGFATNEEYKGSVPLLAF >NZ_CP023398.1|WP_100913720.1|2091665_2092688_-|type-I-F-CRISPR-associated-protein-Csy3 MKLPRQLTYIRSLSPGKAVFFYKTPESDFEPLQIERQKIRGQKSGFAEAYKNEVTPKELAPQDLAFGNPHTIELCYVPPTVQQLYCRFSLRVEANSLEPNVCGEPKVSYWLTRFMNTYKQHDGVGELAKRYSKNILMCEWLWRNKTSPNVDLEIIGEGFDPISISKANRLRWDGKWREAEDALHTLTEVIRTGLEDPYSFCFLEVTAKLDTYFGQEIYPSQSFAENDDVARTYAYTQVAGKDAACFHSQKVGAAIQMIDDWYNEDANKRLRIHEYGADYKNVIARRAPSNRLDFYSLLKKVALYVKEMEQNGLKNQEQTRHIHYIAAVLIKGGLFQRTKE >NZ_CP023398.1|WP_100913721.1|2092687_2094739_-|hypothetical-protein MELHTLLQNTPSKELIQTLRTAFAPYSEHVDITGSEPQALVILINLTYKRKDIKDLTSMSAAKAALRDEQHLKTCIDEVNWYHTHNLKYPDIRVSHQRLLAQVVDDGFRVVSSTSCPMQFGWSHNSAKINHAKLFLTGFLWRGEYMCLAQLLAKQEGFWVAQFRELGLLKKDVIAVCELIAAQLPNEELPEQISNYTPQLLIPIEHDYVSVSPVVSHAVLAALQMATRSGLLKRGIIEHNRPANVGELPSALGGRVNVLKYFAKTYSTTTVNDYYIKDEQRLFNVTALNNKTVSEALLVLSKSKPFNTLRQKRLARIAATKGLRTTFYGWLEKLIKLAQNEAINDSVPEPVKLLAMQEYKQVELVLSQELNRQMSGLKYLKRYAYHPDLMPILKAQLSYVLSHQNNQEGELEAEHIKYVHCQNLRVYNAEAMSNPYLMGMPSITALEGLGHQFQRKLQKLVGSGVHVIGVAVYIRSYQLHNNKQLPEPNKLKKDGQKREPQRSAIMQIPKCDICFDLVFRVQTNNQAVANDLSENLLKAAFPSRYAGGTLHPPSLYDQVDWCQVYAKPQALFERISSLPKGGSWLYPTSVEVSSFEELAEQLTLRPTMRPAALGYLALEEPKPRQGSITQLHCYVEPVIGLLECVDAVTVRLSGARHFLNHGFWAMHCKKASMLMKKAKFEYD >NZ_CP023398.1|WP_100913722.1|2094742_2095918_-|TniQ-family-protein MQFLSQTTPYPDESIESYLLRLSQENGYLGFADMADILWDWLVTQDHELEGAFPNELQSVDVYKVSQSSSFRVRALRLVAQLAGAEASELLNLCWLRSNTQFGANTAVSRGGLLVPRQLLRKSGIPVCAECLKQDAHIPYLWHLRVYKACHKHNQQLTRICGQCDAEIDYRASEAFLECECGAAIKPAAQANQADLKLANVHAGNAVAKLGGLLAWFSRWRAISYDDDEFSELFVSYFSNWPTGFEDELSNTAALAKVKQLRPFNHTPFNDVFGSLLKDSNLACAGSQGYNMVQLAIIAFLTKLVEQNPKKKHPNLGDLLLSMLDAAVILGTSTEQVFRLYEEGFLAAAQPPKKNTFLKPSNSVFYLRQVIELQQSFATNMSNNQQFVPPW >NZ_CP023398.1|WP_100913723.1|2095957_2096263_-|helix-turn-helix-transcriptional-regulator MESPTPLRLKQARKAAGYTQQRIGIALGMEPNTASARMNQYEKGKHSPDYTTMKRIADELGVPVAFFYCEEEVLAELILELGKLNKSELEQKLKDLKSSEK >NZ_CP023398.1|WP_100913724.1|2096843_2097170_-|hypothetical-protein MSNKILTPDGWVILKITTAKSTFYKVFASWQENDRWRLSSGSEAPPEISKCGKYWIWPQASGSCYHLLVNEENGYTYYTGNKLSEILACSNNKGGLIERVKLSSILNE >NZ_CP023398.1|WP_100913725.1|2098679_2099723_-|IS110-family-transposase MKFYTTFHKYYCGIDLHARILYVCILDERGNKLVHQKIKADRHELFKLISPYLEDIVVGVECMHCWYWVSDLCAEHDIDFVLGHALYMKAIHGGKAKNDKIDSYKIASLLRGGNFPLAYNYPAQMRSTRDLLRRRTRLVQHGAQLKAHIVNTNSQYNYPPLELHMKNKCVRQEFQTRFDDQAVQRNVNFDIAILDAYSEELKKLEYYLESNAKKHQPNYYAQLRTIPGVGLILAMTILYEIGDINRFESVQTFASYCRLVKCKAESAGKTYGTSGNKIGNGHLKWAFSEAAVLYLRGNDKARNYLNKLQKRMSKAKALSVLAHKLGRCVYFMLKNKTVFDDERFLKS >NZ_CP023398.1|WP_100913726.1|2099929_2101417_+|ATP-binding-protein MSLLTFKKLFGREAAEDEIPERLAEYFVESREFEDFDDVEESVLFVRGKKGTGKSAVLCFYAHKLEKENEFVLTGKGTELWHHFKPQENNPHSYVNEWQRVICTYVSMQLGKKIGIALTDDEMALVELSELEGIKRKNIVRSLLDRFKLKLGGVESSDFLPNSNHYTLKRYIEEGEQNIWLFIDDIDQTFLNDSDSRYRLAAFFTACRYLASSIPSLKIRAAVRDDVWWSIHREDEALDKVPQFIRTIKWDSNDTVRILAKRIEVFVSRFPKAAAFTTSKEMKKPFYAVFPYNYSWRKGQVPSRFIIHMFALGRPRWAIQLGRQCLETASRFSTVPDKIGIQLLLQTMPSYSGIKLDDLISEYDHECPQLREIIFSFYGGGSRWTTNALKAVIGSKVISSLKISIDDNSAPKPKQVVQFLYRIGFISAFVPALKAEDEEYFRFEEKPHLLSLELNEDDGFHWAIHQTFHSALNLSRDYKKNKEKPTKRPMSELPR >NZ_CP023398.1|WP_100913725.1|2101903_2102947_-|IS110-family-transposase MKFYTTFHKYYCGIDLHARILYVCILDERGNKLVHQKIKADRHELFKLISPYLEDIVVGVECMHCWYWVSDLCAEHDIDFVLGHALYMKAIHGGKAKNDKIDSYKIASLLRGGNFPLAYNYPAQMRSTRDLLRRRTRLVQHGAQLKAHIVNTNSQYNYPPLELHMKNKCVRQEFQTRFDDQAVQRNVNFDIAILDAYSEELKKLEYYLESNAKKHQPNYYAQLRTIPGVGLILAMTILYEIGDINRFESVQTFASYCRLVKCKAESAGKTYGTSGNKIGNGHLKWAFSEAAVLYLRGNDKARNYLNKLQKRMSKAKALSVLAHKLGRCVYFMLKNKTVFDDERFLKS >NZ_CP023398.1|WP_100913728.1|2104105_2105338_-|hypothetical-protein MKYKFTLLATLLLSTSTLIHAVELKNKENLIKTPPEVLNQDSIWGFNSENRELIDIYTSDGDKITVVNVGGLAHLGDMILGKVEDLQTYGLKIHKEEDDNNLKSDWIGTQAHTVYPSTGKLWRNGVVPYVFDSSLGPNAKRAVISAIQQWNTKTNLKFVPRTNEYDYISVSGGRSCSSWVGRVGGKQKVMLSELCGLHAALHELGHAVGFFHEHTRPDRDSYVQINYQNIVNGMQYNFQQKNISDSTGRGEYDYYSIMHYPANAFTKNGYNTITPLVPGVDTSYMGFGNYLSENDINAANILYSNYIPPTNVENYSGQLSGNGDSDIQINGNYFEFEGGNLRATLKGPSSANFNLYLYKWNSLIGNWALTSSSTNTSSNETIYGNVDQGYYYFLVYSLSGNGRYELDIEK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1809850 : 1833891
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP023398|1809850:1833891|DBSCAN-SWA TATGGATGGTAGACCTTTAGTAAATGACGAGTTTTGGGGTGATTTAATTGATAAATCATCGGGTAGAAATAGCCAGTTTAAAGAACGTGATCAGCTGCTCTTAGGCTTGGCTTGTTTAGCTGGTTTTCGTGAGATTGAATTAACACTATGCACCATTGATTTATTCATAGCGCCCAATGGTGAGCTAAATGAATTAATCGTTATGCCAGAGTCGATTGCTTATGATAATTGCGAACGGCCAATACCATTAGCACACCCTGACTTACAAAACCTATTTCAAACTTATATAAAGTGGTTGCTAAATAATGGACTTAATACGCATCCAGGCGAAAGTTATTTAGGCTTAAATCCTAATGCACAATTATTTGTTGATGACAGTTATAAGCCATACACAGTGCAAAAGCGCGGTAACGATACACTTAGCCCTAATTCACTTAACAAACACCTTGATAGCCTAATTAAACGCGCATCATTGTGGGATTGCGGGATTAGACGTAAATCGTTTGTACGTACTTTTATTATTAATGGTTATCGTGCGGGTATGAGTACCAGTGATTTAATTGTAATTAGTGGGCTGGGTCACGATACGATAGAGAAAACATTAACGATGGACTATGAACAATATAGCCCAATCGCTGAATGGTTTGTTAAACGCCGAGAACAGAAGGTTAAACACCTAGAGTCGATGAAAAAACGGCGTAGGTTTATGCTTTAGTTTTCAGTTGTTCTTTTCTTGATTGGCGCTAGGTCCACACCTAGCGCATCCATGATCCGCATTACAAATGATGCTTTGCACTGCCCACCATTCACTTTCTTTCTAAGCGTTGATACGGCAACAGTATGCCCTTGCGCTTCGAGCCTTTTAGCCAGCTCAGTAAAATCAACATTCACTAAGCCCATAGCGCCCTTAATCAGCGCGGTAGCTTCATCACCATAAATTTCATCAAGCTCTATTTCACGCTCAGAACGGCCCGACCTTACAGGCGCTTTTTGCTCTGCCACTTGGCATTCTCCATAGATGGATCACATTTAATCAGTGAATTGATTGTAAACGCTCCATTTAATGATTGCAAACACTCCATTAATGGAGTATGTTTATCCATATAGTGATTAATAAAGGATTTCGTTATGGCTAAGTATTTATTAACAAGCTCATTCGGCAATGATTCAGTAGCTTTAATACAACTGGCTTTAAACAAAGGCTTAGATTTTGAAGTTGTCTATAACGACACTGGTTGGGCGCGTAAAGATTGGCCTAAGCGTGTCGCGTTATTTTCATCTTGGTTGATGGAAAAAGGCATCACTTTGCATATAACGAAATCAATTGGCATGGCTGAACTTGTTAAGAAAAAGAAAGGCTGGCCCATGCCAGCATCTAAGATGCAGTTTTGCACCCAAGAGTTAAAAGAAAAGCCAACCGAAGAACTGTTAAATAAAATCGACCCTGATTGCGAACTAATCATAGTAACCGGTCGCCGCAGAGAAGAATCTCAAAACCGCGCAGACCTACCACTTTGGCAACATGAAAGCCCTAAACATGGTGGTCGTGATGTATGGAACCCGCTTATTAATCACGATGAAAAGCAGCGTGATGAACTTATCAAACAAACTGGATTTGAAGTGCTGCCGCATAGCTCAATGGAGTGCTACCCCTGCGTGTGTGCAAACAAGGATGACTTAGCACAGTTATTAGAAACACCAGACCGTATTGATGAAATCGAAAGACTCGAAATCGAAATGGGCTTTACTCGTAACCAAAAGCCAAGAGTGATGTTCCGACCGTATCGTGTTGGTAACGGTGTTGGCATTCGCCAAGCGGTGCTTTGGGGTGCTGGTGCAAGGGGTTATAAGTCAGGCTTTATCCCCAATGAATACAAAATAGCAGGTGAGCAATGTTTAATGTTTGAAGGCATTAGTGACATTGCTTATGAGATTAACACCCGAGAGGGACGCGAATTTGCCAGACAATGTGATGGCGGATTCTGTGGTAACTAACCAGATAGGAAATTTTGAACATGGAATACTTTGATAAAGAAGCGGTGTATGACGAAAAAATAGCACCACTCATGAAAGAGATTATTGCGGTTTGCAAAGAACATCAAATACCCGCTCTAGCCAGTTTTACATTCAGAAACGATGAAGAAGACGGAGTTGGTACTTGTGACACATTACTCTCTCACTCAGATGATCGTAATAACCCTAAATATCCCGCTGCTCTCAACGAAATACGTAAATCAGATGGCTTTATAACAGCGGTAACTATTGCAAAACGCGTATAGGAGATTTTGATATGTCAGAAGAAACTAATAAAGCTGTGTCTTTTGAAAGAGTGCGAAGCTCGGTACCAAAAAACTCAGTAGGCATACTGGCTTATTCAACAGTTCTATTTCACAGCCGTTTAGGGGAAATCGCTATATCAGGTTCTGATAGAGATACTGTTTGCACAGCTTTTAACAGACTAAAAACAAAGTCTACTACTTCATGTGACCCTGATAGATTGCAAGAAGTGTGTTTTTTCAAACAAGATGACTTAGAAGATAACAATGAGAATTTTGAGCAGTTGAATAGTACACCTTGGCAACCTATGGAAACAGCTCCTAAAAATGGTGATGAAGTTATTTTGTATGTAGAAAAAAGAGCTGGCATACCAGGCGGTTTTTTAGTTGGTCATTATATGGGGGGTGGTCATTGCATAGAGGATCATCCACCTATAGATGCAGGTTGGTATTTTTGGAACGGTTGTCAATTTGACCTTGCATCTAAACCGTTGGCGTGGATGCCTCTACCAAAGTTACCTACAGGCTTTAAATACTAACACTTTTTAAATGGGAAATTTTTGAAATGAATCAACAGGTACTTGACCCTTGCTGTGGCAGCAAAATGTTCTGGTTTGACAATGATCACCCTGATGTTTTGTTTGGTGATATTCGCGATGAAAGCCACATTTTATGTGATGGTCGCAAATTAGAGATTAAACCTGACCAAGTATTAGATTTCCGCAATCTGCCTTTTGATGATGAAAGCTTTAAATTGGTTGTTTTTGACCCTCCGCATTTAGTTCGAGCTGGCGAAAATGGATGGCAAAAAAAGAAATACGGAAAGCTTGGCCAAGAGTGGAAAGAAGATATTAAAAACGGTTTTAAAGAATGTTTCCGTGTGTTGGCCACAGGTGGTGTTTTGATTTTCAAATGGAATGAAACACAGATTAAAACGAGTGAAATTTTAAAGCTCACAGACGTAAAACCACTCTTTGGCCACGTGTCAGGAAAACGAGCAAATACACACTGGGTTTGCTTTATGAAAACTAACAATTCTTAAATAGGAAATTTTATGGACGTGTTAATCGCAAAAACCGCGAGAAGCTTTGAAGAAATGACACCGGTTTTTGAAGCTGAAAAAGTAAATGAAGTATTTTCTCAGCTGCAAAAGCAAATCGCAGATCTTGAAAAGGCGGTTGTTGAAGAAATTGAAAACCGAGATAAATGGGAAGAAAAGGCAACCTATTTAGCAGAATGTGTTGGTGTTTATTTTGATGAAAGTGTTGGTGAGCATTCAAGCGCAAATTGCCCAATAGCAAACGCTCACGACCTACTAAACCAGATATAGGAATTTTTGAACATGTCAGAGAAGCAAGCATTAGAGAGCATGAAAGGAAAAGTTATTGAATCAATCCAGCATCATGAAGTAATCGGTTCGATTCAATTTAATTTTACAGATGGAAGCACCGTGCAGGTTTTTGCTCAAGAAAAAGCAAGCCAGCAAATAATGATTATTGAACCTAACACTAAATAAAAGGTGAATCATGGAAAACCAACACAAAAAAATTAAAGGCTATCGCGACCTGTCACAAGCCGAGATTGATGCAATGAATGAAGCTAAAGCGCTTGCTGAAAATGTAGGCTCTTTAGTTGAAAAACTTCAAGGCCAAGACGGGTTAGATCAGCGTTGGATTGCAACTGCTAAAACAGATTTGCAGAAAGGTTTTATGTCTCTAATTCGTGGCATAGCACAACCAACAACTTTTTAAGTTTAGTTAACCGCTGCGCTTGTTAATCGAGCGCATTATTAAACAAATTTTTAGATTGGGAATTTTTATGTCTATTTTTCAATGCGAAAACTGCGGTTGTGCTGAAAATACAGCTTGCGCCAATCAGGGGTTTAAAGGAATAAAGCACCTTTTTGATTGGTCATACAACCCAAGTTTAGAGGGAAAGTTACTTTGTCGAGCTTGTGGCCCTACCAAATTTAGCGATGGTTCAAAAATGAAATCAGACCATTGGGGTGAATATGGTATTTGGCACAATAGATTTGAAAGAAGATATTTGCCGCATGGTGAGTTCAAGACCAATAACCAAGGAAATTTGGAGCATATTGAATCAGGTTTAATTGGTAATGAAGCATACAAGAAATTTGCCCGTTCTGAACCTTACCCACTTAAAGAAAACGTATAGGAAATTTTATGGCTAACCGATACGGCTTAGATGCCAGCTATTTCACTAGCAAACTTGAGCAATTGATAAGAGATATTGATAACTACACGCCTGATGAATTTGCGCGTGTTATGGCTCGAATGTCTAAAACTGCAGATAAGAATGTAGTTTTAGAGCCTGAGTTTCAAAGCAACGAATCTTGGGTGCATGTAGGTGAAAAGTTGCCAACACCTGCAGAGGGAATTGAATATTTGTGCGTTGATTATGACGGTGTTCGTTGGTTATGTGAATACGATTACAACCAAGATGATGAAGCTGTTTTTACTGTTTTGGGTTCGGCGGCACAACCCAAAATAGGCAGCATTGTTTACTGGTCTGAAATACAAGAACACCCACCAATTAACACTAATTCTTAAATAGGAAATTTTGAGCATGTCAGATTACGAGTATCAACACGCAATACATGGAGAAAGCAAGGTCAAACAGAAAGCTTTTGTAGATATGTTTAAGCCCGTTATCGCGTGGGCCATGAATAACAACAATCAAATTTTTGAGCAAATAGAGACTAGTTTTATCTTTGGCAAAGTGCATCGAAAAAATGACCCAAGCTTTGACGTTACTGTATTTTTTCGGGGCGGTGATAATGAGAGCTTTACTTTTTATGCATGGCGCGATGAGAATAAATTGAAAGCACAAAAAGAAATTCTTAAGAAGATTTTGAAAGCGAAAGATCTTAATGAATATAAATCACTATCAGATCGGTTTAACAACCGTGATTTGCTATAGGAAATTTTCGATGCCGAGCTTTTTGGACAATGCACCTGAGTTTATGAAGAAGATCTTTGAGAAAGACTTTCGCTTTTGGTGCTGTTCAAACAAAGAACATCGAGAAGTGGAATGGATTGAGTCAAAAACGCAAGTGCAGGCTCAATGCAAAGAATGCGGTGAGAAATCACCGATATTTAACAAGAACTCTTAAATAGGAAATTTTATGTCACAACAACAAGCTGTAGAAATAACGGCAAAACTTTATCAGTGTCGTGAACAACAAATATTCTTAGGTGGTGAAGATGGCTTTATTCAAATGTTTGATAAGTGGAAGCCAGTTGTTGAGGCTACGATGAATAAACATCAATGCTCTGAACTGCCAGCGTTAATTGAGCTTTTAAAATTGGCTGAATCGAAACCTGACGGTGGAATGATGATGCATGTACTTAATGCTGTTGTTTGCGAAATGTTAGAGCCTACAGTTACGGCTCATTAATTAAGTGGGAGTGGTTGAGGTAATTATGAACTACAGGCTTTTAGAACATGATGAAGTCATTCAGTCAGGCGATGAATTTTTAGAAGATGATGCGAAGACTTGGACAGAGATTACAGATAAAAGTCCTTGTTCTTGGGCTATAGGGATGAAGTGGAAAGGGCGTGGTTTAAAACCAATGCGACGTAAAGTACAAAATAGCAAGTAGCGATATAGGAAATCATTAAGAAGGAACTTAACCAATGAAAACAACACAACAAAAAACATGGCAAAGGCGCAAAGTTATTATCACGCCAAGTTTTGCCAAGGTAAACTTTACTGATTATTTAACCGCATGTTTTGAGCTGGAACAGCAAGTGTTTAAAGCCATGCGTACTAAATGATTATTTTTCTAAATAATAATTTCTGAAAACGGTAAATCTGGCGCATTACCAATAATGCGATCGCCTTCTAGATAGGCTTTGCCAAGTGCGAGGTTTTGCCCCACTGCCACACTGGTTGAACCATCGTTATGTGTTACAATGGCCGAACCATCACTACGCACCTTTGTTATCGTCACAATGGTGCGGCTGTAGCCGATTAATGTTTTTAAGCGCTGTAATTGATCAGACATTTTTCACCAATTTAACCGTTTGAGTAACATCAATGGCGCCATTACTATCCACTGACGCATTAATGGTTAAACTATCGCACGTGGCTTTGTATAATTCGCCTGGGTATTGAATACCAACCAATGATCCGAGCTTAATCGGTGGCAAGTCGGCTTTAATATCCGTTACCACTGTACTTTCTTCTTTATTGCCTGCGTTGCCAAGTTCGCATTCGCCACGTTGCCGTGCTGCTTGGTTATCGGTTATAAGCGAATCAACCACATCACTTGCAGTCAGATCCCCTGCCGTATTATCACGCCTAATTTTACACGCCACACCATTTTGTTGGCCTGATACAAACACCGCATTAGCCGCAAGCTTATTTACAAACTGGCTGTTGTGGCTATCTATGTGCGATTCATTCAAAATTATGTCGCAGGTAGCCGCATCAACACGCCAAGGTTGCACCGGCCAAAACGGCACAACATTAATCGTTTTGGTACTTTCGCCCACATCAAGCATTGCACCAATAGCTTTTGCAACATCATTTAGCGCACTGGCAGATGCTTTATCTACATAGCTAAAACTTTCCGCTGGCACCGCATAATCCACCATAGAATATTGCAATAACCAACCCGTGTTTTGCAATATATCACTGCAAATACCAGCGTGCGTTCTGTCAACCGTGTTTGTATAGTTGGTTGGGCGCACATAGGGCTGTGAAAGCTCTGCAATCCTTGAGCGGCCACTACAACGATATTCGCGGTTATTAAAGCTGCGTGAAATACCCGGTGTTTCACAGTAAGCATAAAACACATAACCGTTCACCGTAATCTCAAGTAATTGATTCAGAGCATAATCAAAATCAAGTTTTGTTTGAAACGTAATATCTGTCGTTGCAGCAAACTGGCCGCGCGATTTTGACCACGCAAAATTGCTAATATAAATTTCGCGGTTATCGCTTACGCGCTTGCATGTAATCGTTGGTTGCATAATGTAGGTATTCTGAATTTGTGGTTCGATAGGTACTTTGCGATCAATTGTCGGTATATCGTCAAACGCTCTGATTAAACCAACATCTGACCAATAACACACTTTAGCAATGTCATTTAATGTAATAGATAACGGGCTTTGTTTATTACTTGCAGGCCTATCAAGTGACAAATAAATAACACCCGGTGCAGGGTGTTTTGTTGGCACACAAATATACTCAGCATCGCGTGGGCCAAAATAAAACGTAAACTCATTTTGCGTTTGCGTTGATTGGCTCGCCCAATCTATTTTGTAATCTGTTTCTACTTTAGATAACTGTAAATAAATTAGCTCACGCTCTGTTTGTGTGGCTGCAAGTAAACTGTATTTAGTCAACCACTCACGCTTTACAGCGTTTTGCTGCTGCCATTTAAGCCCGCTTTCAACTACTAGGTTATCTTGATTACGGTGTGCTAATGATACTTCGTGATTTAACGATTGCTGTATTTGACTGCTAAACACGAATTGGTGCGTTATAGATTGCTGTAGCGAGTAAGTGCACACATTTTCAATCATTAACCAGTCTTTGTTGTTTAACTCATCAGTCGGATCAGTTGGGTTGCCGTCATTATCACCGCCACCACTGCTACTGCCAAAATTAATTAGAAGCGGTGATATTTTGGCAGTAAAATTACGGCTAAAATTAATTACTAATTGCGTCATACAGGTGGCGTGTAGTTTTTCATATCAATCACAGCGGCTTTTACATTATCAACAAGCTCACCGTCTTTTTCAGCGTCATCGTCTAGCAAAATGCACATCAAATTGTTATTTACGGTGTATTCAAATGGCACAATGATCGCTAATTTGTTATTTGCTGGCACTAAGCGCTGCGTAATGACTTTGCCGCGTCTATCAATTACAAGCAAGCGTTCACCTAGTGCGTCAAGCGTGGTTTCAACCACACCAATATTAAGCGGTGAATTGGTCAATAACACAGGGTTATAACTACTAATTTCAAAGTGAATCATTCCCAAGTCTCCAACCCAATCCACACACAACTCGTGTCCGAATACCCATCAGGCATTTGAAAGTAAGTTTGAGTATGAATTGGCTTAATTACTGGAAACGCCTCATTGCGCCAGCCCGAAACGTCACTGACTTTTAAACCTGGTACTCTGCCACGCAATCTAGGTCTAGTTGGGTCATTATATGAGCTAACACTTCCCTGTAGCGAATTTCCAAAAATTATAAACATATCCGCCAACATAGTTACTGTAGGTGAAACATTGGCGTGCGCTGATGATACTGCAAAACCACCATTACCCATCGGTGATATCAACGAGCTTGAAAGTTTGTCATTTTCGTTATTAATTGGGTATGTAACTAGGATGTCAATCGGTGTTCCTTGAGTTAGCGTGTATGGCATTTCCACTGAGTAGCTTGTATTTGAGCCATTCCCTCTACGCCCTATTTGAATAAACGTATTTGGATCATTAGGTATGAATGAATACATATCACCGCAGAAAAAAATCCAAGAATATTCATCGGATCGCCCCCAGTAATTAGTAGATATGTTATAGGGCTTTATAAACCAGAACGCTTTAGCTGTACCAATAATCACCCAAGGTGCAGATGACCCACCTCTTATTCGCACGTCATAATAATAACTCGATTTAGCAAAAACTGTTTTTGATGTGTATTCTTGATGTGATTGCACTCGTGAGTTCGTGCCTGCACTATCATTAACTGCTGAAAACGAGGTTACACCACCGCTTCCACCAAGCGCCATGTTATTTCTAACCGCGAGGTGTGGCGCAGCAGTATTAATTTCATCTTCCTCAACACTCCAACCGAGTGGTGACTTTGTGCCATAACCATCCACTAAGCAGGCTTTTACTACTTTCATAATATCGCTGGGTTTTCGGCTAGCTAGGGATGGTGCACCTACATCATCCCAACGATAAACGGTTACTGCGTCTGTCATTATGCATATCCTTTAAAACTTAACGTGGTTGAATCAAACTCAATATTTGAATGACCCGGAGAAATACAACGCGATACCATTATCGGGTTGCCTGCAGCAGTGGTTTCGAACAATAAACACTCACCTGGTTGCAAGCCAGAACCTAACGCTTCTTTGCGAATTATCATAAATGGCTGTTGGGTGATGGGGTTAATTGGTGCAAAATCATTCAGCACATCACCGCTGCCAATCTCACCAACGGATTCACCAAACAATACAAATGTGTTATTTGTACCCCACACTAGTGCCCACCGCTGATTAATTGCGCCTAAATTATTCATCTCGATTGGATATTGCTGCAGATTGAGTGAGCTTGATCCCGCAGCACTTTTTTGGCCAAAGTTATTAGACCATGCACTTTGGGTGCGTTCATCAAGTGCTAGCGCTTGTAAATCGCCAAGCATAGTAACGCTTGATACTACCGCGCCTTGAGGATAATTACGCAGTACAGGCGATAACAAACGAAGCTTGTTAGGCTCAACTTGGGTAACTAAACGTAGTTCAAATTGCACTGCTGTAATAATAAATGGCGCAGTAAAACCGCTAATGCCTGTATTAATCGTTATAACACCGCTTGGCTTGTCGTAGCTGTAATTCGCATCATTACTGTCGTATAGACTTGCACCTGTGCTATCGGTAATATCCACATAATCCGCATCAACAATACTGTTAATTGTCTGCCCGTTTGTGAGACTTGTTTTCTCTGTGCGATTGCGGTTTTGAATACAAACCAATCGGTTTTCATTAAATATATAAACATTGCCACCATTGGGTAGTGCTGATGCATTAATACCAAGCCAAGGCGCATCAATGGCGGTTTTTTCAATCACTGTTAAGTCATAGGCAATGTTAGCTACATCAACACTTGCAGTTAACGTTACGTAGCCATTTTCGAGAGTACCATTTATTTCTGCGTGACTAATAACGCCGCTCGCATTTGCACCAACACTAAACGTTGTACCATTGGCCCGCGTAATCACCAAACGCAATGAATCAACGCGGTACTCATCTGATTCAACACTAAACGAAATATTTGTTGGGTATGCTTGGCTTTCGGTTGTGGCAACAAGCACAATACAACCTAAATCATCATTAACTGTGCCATTTAAATCGACAGCGTTACTTGGTGTTATTTCACCTGTATCGTGATTTACTTGAGCAAACACATTGCCTGTATTGTCGATAAATATGCCATTCTTATCGGTATATACAGCACCATCCTCACCCGCTTTTTTTACACGAAATGTGTTCGGTTGTATTTTTTCACCTGGCGCAAGCGTTAGCGTTGGTGTTGTGGCAAAGGAGTGCTTACGATAGTCTTTATTTGAGTGATAAGTTACTTCAAGCGTTGTGCCCGCTCGCATTGGGTAGACAGCGGAGAGCGTAAACACAAACTGATTATTTTGATATTTTACTTCATCAAAGTGCGCGTTATTTTCATACGGTTTTAGAGTCCGCCAACCATTCACAGATAAATATTTAACTGTAAAGTCTGGTAAGTACTCAAAATAGTCATCAACATTAATATTAAAAATCGCAGCATTTGACGAACTTGGCGCAACAAATGTTTTTACACGGGTTTTATTGTATGTGGTGCTACTGTTGGGCGAAAATGCATATGCTTGAACGTTTAGTTTATCGTTAGCAGTTGTAACTTGCGGCAGTACTTGTGTTTGAGTCGTTTCGACTATTAAGTCTAATGCCCCAGATTCTACGGCATTTGTTAACACACTCATACCATGATATTTAGCCGTGCTTGAGGTAATTAGATTACGCATGGTAGCCATTTTAGTGTCATCCGCAAGCGAGGGCGATTCAATTAGTAATGTATTAACAAGTGGATCTTGTGGCTTTTCGCTTAGATACACGTGGCTATCAGCTAGTCGTGTTGCATCGTGTGTTGCAACACTAGGAAAAAGCTTTACTACCTCAAACGCTGACTGTGCATAATCCACTTCGGTAATTGAGCGAAACACATCGTTTAGTTTACCGTTCACCACTTCGTTATTGGTGCGGTGGCCGCCAGCATCAACGTGCTGTTCTAAGCTTTCGCTTTTAAATATTTTTAAATGATTACGTTGCATTAAACGGTTACCAAAAAGAGTTGAATGTTTTTGTAATAGTCAGGCTTTACATCAGAATACGCGCGCTGTGGCTCTGCTATTGCAGCCCCTTTTAAGTAATTCCAGCGGCATGTGTAAACCACGCTGTGTATCGTCACATCAAAGGTGTCTAAGTTGGTTTTTGCGTGGGCTTCTATCGCTTCAAAAATGGCGCTTGGCTCAAGCTCTGCAAAGAGCACTATCTCAAAGTCACTTAATGGCGTTAACAGTACTTTGCCAACACCGTTAAGTGCGCGGCGTTCTTTGGCTGCAATGTTGATAGTGGCGCCTTGGTTTCGCCATACAAAGTTATCAAGCGAAATGGTATCAAGTTGGATCATTGTGTTTGGCTTAACCTTTCAAGTTCATCAATTAAGTTAGTGTCGGTGGTAAACGCACTGCCCTTACGCCCTGTTGGCATCACTAGTTCAAGCCTCACTGCTTTACTGCTTGCAAGGGGTGACTTTTCGAGAACATTAATTAATTTATTAAGTTGTACAGATTGCTGAGTGGTGGCAGGTGTTTGCTTAGTCTTACCGCCGCCATTATTGCCACCACCGTCATTACCACCTGAACTTGAACCACCATTGCGTGATGACGGTCCTGCACGATATGGCGAGCTTGTCTCACTTTGCGATTTCATTGCCTTGATCATGTCGCTCAATTCTTTTTCTTGCGCTTTACTTAAAAAGTTCAAGTAGTTGTTTATTTCATTGAGTAATTGATTAAGTGCTGACGTATTGCCCGAGGCCGCATTAATACGGCTAACGTATTTTTCAAATTGCGCTTGCGCCGCTGCCGCTGTTTCTTCGTCACGTTTAGCACGTTCGGCAACTTTCGGATCAAAGGTGGTAAAGGTGCGATTACCCGCATCATTCACACCCAAACCGCTTTGCTTGTTATAATTAAACATCGATGAGGCAGCACGGCCAGCATTATTCGCCACTTGATTTAAGCTAGACGCTTTTTTACTCATGGCTTTTGCTGATTCTAAATCAGCTTTTGTTGCTTCTTTTGTAGCTTGTGCACCTGCGCGTGTTGCTTGGGTGGCTGCGCGCTGCTTGTTAGCAAAATCACCCAACAAATCGTTTACTACGCTCATTACACTGCTAAGCTCTCGCTTACGCTCTTTGTATTCATACGAGCTAATAATGCCCGATTCATACTGCCGATTAAGCTCATCCATTTTGCGCTCATACGCACTTTGTATGTTTATGAGCTCATAATAGTTAGCCTTTTCCGCACGCTTTACGTTGTTTAAACTTTCAGTTTGTGTTTTAAGTAAACGCTGTGCATCCGTTAACCTTGCATAAGCCGTGTTTTTATCATTAAGCGATGCGTTTGCATCATTCATGATGGCTTCAAACTCAGCCACTTTAGCCTTGGTTTCATCTACCTGCTGCTGAAAGCGTTTTACTGCATCTGAGTTTTCATCCAGTGCTGGATTAAGCGTTTGTGATTTTTTAGTTAATGACGCAAGTGCATCACTTAAACCCAGCGCTGCGGCTTCGGCTTTAATGGCAGGATCAAGCCCTTCACGTGTGGCATCGGCCACTTTAACGGCCGCTTGCGCCCACTTTGTATAGGCTTGCTCTACTTCATAAATACCAACTTCGCCTTTTTTGTATTGCTCAAGCAAGGTGTCAAACAGTAATTTTGCTTGCTCTGCCGCATCACGTAAACTTTGCGTGGTGACAATACCGGCTTTACTCATCGCAAGGCCAAGTTCGTCCGCCGCTTTTTTACCTGCTTCGGCGGACTCGCCAAAGGCTGCTTTTTGTTCGGCTTGTGTTTCTTTAACTTTGGCTGTTACATCAGTGTGGGTTTGTTTTACTTTTTCACCTGTTTGCTGCGTGAGTTGGTCCCACGCATTTGACAGGTCTTGACCATCTTGCAAAATGCCTTGATAAAAACCATCTGACACCGCACTCAGCGCATTCGCAACATTTTGTGCTTTTTGGCTCCACGTATCTGCACCAAACAAATCAAGTGTTTGCGCAAAAACTGTCATAACGCCTTGTAAATATTGCGTCACCACAAAGCCAATGGTTTTTACACCAGCGGTAAAGCCGTTAATCAGTATGCGAAAACCACCAACAATGGTTTCAACACCCGTAAAAAACGCATTTACATTGTCAATAAACGCTTTTACCGCTGCGCCACCATCGCGCACAATCGCGGTAAAAAATTCACTGATTTTTTTAGCCGCTGATTCAAGTTGGCCGTTGTTGGCCATTTTTTCAAACTGCGCATTGAGCTCTTTTAAAAAATCAACCGCAACTTGATAAACACCAGAGTCGGCAATGGTAATTTTAAATTCTTGCCACTTGTTTGATAGCACCGCCATCTGGCCTGATAATCGCTCTAGTGATTTTGACGCTTGACCACTGGCGGCATTACCAAGCTCATTAAGCAAGGCTTTAATAGTGTCACGGCCAAGCTCACCCGCAGATGACATTGCTTGTAATTCAACCGTGTTTTTGCCGGTTACTTTGGCAAGTAAATCCCACACCGGCACACCGCGTTCAACCAGCTGCAAAATTTCTTCACCTTGCAGCTTTTGTTTCGCCCATGCTTGGCCTACTGCCAAAATAATGCCGTTTAGCTTTTCTTGCGAACCACCCAATTTTGCATTGTAATCGACCAAGGCTTGCATTGAGCCATTCATCGGATCAATACCAAAGGTTTTTAACGTACTAAACGATTCGCGCACCGCGTTAATTTGCGTACCCGTGTCATCAGCAAAAGTGCGCATCCACTCGGTTGCTTGCTGGCCTGCTTCAAAGCTGCCCATTAGCGCGGTCATTTGCGCTTCAAAGGCTTTGGCTTTATCGCCTGCGTTTAATACACCAAATAAACTTTCTTTTAGCGTATCAATACCAACATACGCACCCGCCATAGCAATCAATGATTTAGTTGCAGCACCAATGCTGCCGCCAAACTCTTTTGCTTCTTTGCCTGCTTCGTTTAATAGCTTTTTATGTTTTGCGAGCTTGTTATTATTTTGCGCAATCGCTTGTTGCGCGGCCACTTGGCGTTGTGATAGCTCTTTTGATGATTTACTTAGATTATTAAGATCAACACCTGCACTTTCTAGGCTTTTCTCTAAACTCGCTATTTCCGCTTTGTTTTTATTAAATGCGGTACCAAGTTGATTAACTTCAGTGCGTGTGCCTTTTACTTTTAAGTTGTAGTCGGCATTGGTGGCAACCGCTTTGTTATATTCAACTTCAAGCTGATTGAGCGTTTGGCGCTTTTCGATTAACGCGCGCGACTCTTCTTTTGAAACCGTTGTTGCTGTTTTTTGTGCAGCTTCAAGCGCTGACACTTCACGCTTAATGGCTTGATAACTGGTTTCTGCGTCTTTTAGCGCGGCTTTAGATTGCTTTTGTTCAGCAACTAACGCAACCAACGATTTACTGGCCGCTAGGTAAGACTTTTCAACAGCATCAATGCTTGTTGTAAGTTTGTCATATTGCTCGGTTTGGCTTTTTACACCTTCAATGCTCTTTAGTTCGTCATTTAGCGCTTTACTGGCGTTTTCTAAAGCTTCCGCTTGTTTTGCTGCTTTATCGGCTGCCGGAGAAAATAAATCGCGTCCTTTGATGATAAATTCAACAACGGTATTTTTAAATGACATACTAAAATCCATTAAAAAAGCCCACTTTTAGTGGGCTTTGGTTGTTACAGTTTGGAGCAATTACGCGGCGTTGACGATATCCATGCAATAGTCTTTATCTTTACCTGTGGCTACCGTGCACACACCTTCAAATTCAGGTGCCAGGTATTCGTTTTGCAGCAAGTTAATATCAGAGCTTGCAGCAAGTTGTGTCAATGGAATTTCTAAGTGAACCGGTGCGCCCGTATCGATGTTTTCACCATCCAAAATAATGCGCCAGTTGACTTCTTGCGTTTCACCACCAGTTAAGCGCTTACCATCAAGCGCAAGTGCGTTACCAGATACGAGCACTTCACCACCGGCATCCATTTCACCACCGGGTAAGGCGCGGATAAGGCCCGCCGCATATTTAATTTCGTAATCCACACCTTCGGTTAATGTGGCCGCAACCTTTTTAACAACAACCCCGGTTTGCAGTTGTTTTTTACCAATTTCAGACCAACCACCGGTTGCCGCCGCAAATAAGGTTACCGCTTTATCAGTAAGCGTTGCTGCCCCTTCTGAAAAATCACCGATTTCAGCACCAAGCGCTTCGGCAAGTACTTCAGACGGCACTTCATCAAGTTTTAACTTGATTGATGTTGGCTTAGGATCTTTTACACCACCAAGCGACTTACCAAATGTTGATTTGTTTTTACTGGTACGTGTAATTTGTTCAGACTCAGCTTTAAGCTCAAGCATCTCAACGTTCATTGGACCAAGTAACGGTGAGTTTACCGCGGCAACACCTTTGGTGTTTACACGCTGTATCATCACTTCACCGCGCAGAATAAAACCTTCAGCCATGATAGGTTCCTTTTAGTATTTAATGTTTAGAGTAAAAAGCGCCCTTGCCACACGTTCGTTTGACTCAGGCGCGATAATTTTGGCGGGCTCAGTTTCTTGAAACTGAATCAGCTTGCCACCCAATGCAGCATTCACATTTTTGCCTGCGTTGGTAATGGCTTGTTGTAGGCTTTCGCTTGTTGCTAACAAGTCATCTAGCGCATTAGTTGCTGTTGCATCTTTAGCTAACACCAGCAACAAGTTCACGCTTTTTGACTTTACCGTTTGCAAATCCACCGGTTGAATAAAAATATCACCGTCTTTTTGTGTGCCTGCTAACTGATAAAAACCCAGCGTGACATTGCCATAATTTGCAAAAAATGCTTTTACGGCATCCATTGCTTCAGTAATGTTTTGTGCTTTTTTCATAGTACTTTTTGGTACATTTTTAAAAGGTGTGTCATGACTGGATCAACTTGTTTATCTTTCACCACACCAAGCGCACCACCAACTGAGACACCATAACTGACTTCTGGTATTTCGCGGCCTTTCACACCTTGATTACGGTAACCCATCAGTAAATTGCCATTTAAGCCAGTAAATAAAAACGCACCGCGCCAATGCTGTGTTTTGCCACGCAGATAAGCACCCGTGTAACCAGCAATGACTTGCTTGTTTGGCGTGCGCTTGTTTTTACTTTTGCGATAAACTGGCCCCTCCATAAAATTGACCGAGCTTGATTTTCGCATACGTGATGCCACACGAAATTCAAGTTCTTTACTGTCAATTTTTAGGTTAATGCGCTCAGCAATGTAGTCGCGAAAGCGAAAGTTATAGCGATCATAAATCGCATCCACTAATTTATTGTGTGCAAACTCGGCCGTTTGCGTTACCGCCACTTCAAGCGCGTGCGCATTTTGCTCTTTGATTCTCACCAAATCACCTGCCAGTTGCTCGAACACCTTCATGCATCACCTATACGCTAACCACAATATATTTCACTGTTACTTCGTTTTGCTCATACAACTCTGTAAGCATGTAACGCTTAGCGCCTAAAGTGAACACGTGCCGCGGCTCAAAATCACCTTCATCAAGTAAAAACTCAGCAATGGTGATTGCTTCGCGGTCTTTGGTGCCGTCAAACTCGTTGTTAACCGTGCGATACGAGGTGTACACATCAACGTGTTTGCTTTCACTTGGCAAATGTGCAAACTCGCAAGGCGTTGCCACTTGTAAAAACACAGCGCGCTCCGCCTGCAAGCTGCTTGCAAGTTGCTTACTCAGCATCATCAACAATGGTAACGTGTTTGTTGAGCGATAGTGCAAGCGCACGTGGGATCACATTGCGGCCAGTTAAAATTTCAACGGTTTCACCTTCAAACGAGGTAATTAATTTGCCAGCCACATGTGAAATAACCGCAACGGTTTCTGATTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAACAGGGTCAGGCTCTAGTGCGGCCACTTCTGCTTCAAGTGCTTCTACTTGTTTTTTCAGCTTGGCTTGCGTGGTTTCTGGTTCAACTTCGCGGCCTAATTGCATCGCGAGTTCAGCCGCTTTTACTTGTAAGTCTTCAATCGACATAACGTGCTCCAACAAAAAAGCCTGCTAAACAGCAGGCTTTTGCATTACGAATAAAAAGAACGGAATTAAATAAGCTTAATTACCACAAAACCATCAGGATCCGTGTTTGCCATCAGCGGTGCTGATTTAGTCACCGTATAGCTCTTAGCTACATCACCTGTGTCTTTATAGGTTTTTTGGAAGCGATCCGTTGCATACATACCAGCTTCAATGGCGTCATCGTCAAGAATTGCGCCATACATACGCGCACATTCAATACCTGTATGACCAAGCACATACGTGTAATCATCAAGGTACTTTTCTTTAACACCATCAACTAAACGGTAACCGGTATACACCACCACCATCACATCGCCAAGGTAACCTTTATAAGAAACGTCTTTACCAAGGTCTTTTAGCGCGGTTTCAAGCTGTGAATTAGAACCGCGGCGAGTTTCTAACTTCTCACGCACCTTGTCAAACTCAAGCAAAAGCGACCAACCTTTTTGGTCTACGATCATGATATCTATGCCATGCTCTGATGCAGATGCATATTGCTCGATATCATCCAGTGGATTATAAGTAGCTCTATCTTTACTAACCCAAGAAGCCGCACCAGCAAGGTTAATTTGATTTTCCGCGCTGCGATTAAAGTTAACGTTAATCGGCTCGGGTAAACCTTCACCTTCACAGGTGTAATTTGCATCAACAATAGAGTGCACGGCCATCCATTCTTCTGTTTGGCCAATTGCAACATCTTCATCGAGTAAATTTTGAACCATAATGGCTTGACGGCGTTGCTGTAAAGTTAACTCACCATTTAAACCTTCACCAGCACGGCGTTTAAGCGTCTTTTGGGGATTGATATCATGTTTTGGCTTAAGTGATGCAGGTTTGATTGTCGAGGTGGTATGCCCACGTGTTTTAATTACCTTGCCGTTTACTTCTGGCGCGATAAACACTGCAGAGTTCACAGCATTGTCAACTTTATCGAGTGCCACCTCTTCCGTTTCAAACGTGTACATGTGAGGGAAAAACAATGTTAAGAACAAGTTAGAAGAGCGAGCTCTTTTCTGTTGCACTACATTATATAGCGCGCGTGGAGAATTTAAGTCAGACATTATAATTCCTTATTCTACTGAGCCAGTTGCGATTGGCGTACCATCAAAAGCCGCCGCTTTTTGTGCTGCGGTTGCAGCGACAGGCCACGCAATAAGCGCGTTATTAAAGTGGCCACCATCATAAAATGGCGCTGTACGTGCACCTGATGTGGTATCAACTGCGCCAACAGTCAAAGCAATGGCTTTTTCTGTGCCGTCATTGGCTGCAGGATCCCATGCTTTTAACTCACCCGATGCGCTAACGCGGCCAACTGGCGCTAATTTTTCTAATGTTTGGCTCGCTGCAAACGTGCCGGTGTTTTGCGTCATGACGCGTTCGCCTGCCGCAATTTGCTCATGCTCAAATACTTGTTCCATGACAGTTCCTTATTTACCAAATACCGCTTCGTGAGCACCAAGCAAAGCTTGCTCACACTTTTCATCTACTGACAGTTCTTCAGCATCGCCGCTTGTTTCCAAGTTTGGTTGCTCTGTGTTTGCCATTGCAGCATCAAGCGCTGTTTGCATGTTTGCACCTTTTGGCTCACTTGCTTCAAGCTTTGTTTCTGCCGGTGATACTGCTAGCAGTGCATTTGCTTCATCAGCACTCATGCTAGTTTGCAGTGCTAAATGGCTTGCCATTTTTTGACGGCCGTCTGCATGTTCGCTTGACAGGATCGCACCAATGCGTTCACGTTCTGCAGTTGCACCGTCAAGGCGCGCTTGATTAACATCGGCTTCGGTTAGCGTTGCACCTGCTACAGTGCCGCCATTAGCGTGCGCGTTTGGCGCTTGGTTTTCTTCTAGTGACATAGCCACTCCGTTAGTTGTTGAGTTAGTTCGAAGAATATCGAGCAAAAGCGGCACGGCTTCTTGCCCGTTTACAAGACGATCAGCAAAACCAACATCAATGGCTTGCTGACCTGTGAATACTTTTGCTTCGGTATCGAGTACCGCTTGCTTATCCATGCCAATGCCTTTGGCAACCAATGCGGCAAATTTATCGCGCGATTCGTTCAAATTGGCTTGGATTGAAGCAAGCACATCGTCTGACAGTTTTTCATATGGATTGCCGTCTACTTTGTGCTTGCCGGAATGAATTAGCGTGATATCAATACCGCTTTCTTTCAGTTGGTCTTGATAGCTGGCATGTGCCACCACCACACCAACCGAACCAGCACGACCAGATTGCGTGATCCAACGCTCGGTACACGCTGAGCCAATGGCCATGGCTGCGCTGCACATGGTGTCGTAACACAAGGCATAAATTGGCTTGATTTCACGTAGTTTTAAAATAGTTTCAGCACAATCAAAACACCCTGCTGCTTCACCGCCTGGGGAGTCGATATCAAGCATAATGGCTTTAATACTTGGATCATTTGCTGCGTCATTGAGCCTTGCAATAATGCCGTCATAACCTGTTGAACCCGAATACGGTTTAAGGTAACCATATTTATGCAGTAACGTGCCTGAGATTGGCAAAATCGCTAAACCGTCTTTTACCTGATACGGGCGGTCACGCTCGCGAGTTGGACCAAAGCTTGATGCACGCTCTTTCATATCGGCTTGTTCAAGCATTTGGCCTTCAGGGTCAATCAAACTACTGAATTGCCCTTTTACACCAAGCACTGAGAAAAACGTGGTTGCATAGGCTGGCTCTAAAAAATGCGGTTGATTACACGCACGGCTAAGAATATGCGTCATTGACACATTGTTAGTTGTTTGCGGCATTGTCACTACCTTGATTAGTTTCGGGATCAAAGCCAAGCGCGGTTATCCAGCTTGGCGGTGGCAACCCTTTGGCTTTGCGCTCATTAATTTCACGTGCTTGCTGCTCAAATACTTCTTGATAGTCTTCGCCCATCAAAGCTAGCTCTTTTTCATAGGTTGAAAGGCCAGATTCAATACGCAAAATGGCTTCTTTCACTTCTTTTAAGCCATCGATGCTTAAGCGGCCTGCACCTATCCATTCGCAGCGCGTCCACGCTTCTTTACGTTCGTAAAAGTCAAAACGTGATTTTGGCGCTTTCAAAACGCCACGATGTAGCGCTTCCTCAAGCCAGTTTGCAAAAACTAATGATGCAAACCGTGCCGCAATGGTTTTACGTTTACCCATGGTGTAACGGAATGATTCGTTAATACTGGCCCGTGCACTTGAGTAGTTTACTTTTGAGTAATCACGGGCGATTTGCTCGTAACTCATACCAAAGCCGCTGGCAATAAAGCGCAGCATTGAGCTTTCTAAATCGGTAAAACCGTTGTTTGCATTGGTTGACTGAGTAAATTTAAGTTTTTCACCGGGTACTAAATGCGGAATACGCGCACCGTTCATGCGTATATTGGCGGCATTGTGGTATTCGCCCATGTAGGCCATCCACTTTTGCATTTGATCCGCGCCGCCCTCACCCGAAATAAGCTGAAACGCCTGTTCTGAGTCGAGTTCGGACTCAATCACGGCCGCATACATGGCGTTGATAATGGCATTTTGCAGTTTGGTGTTTTGCAGCTTAGATAAACTTTGCATTTGCTCCATCATGGATAACAGCAAATTAGCACCGCGCGTTTGGCCTGCTTCAAGCGGTTCAAACACATGTAAAAATTGTGTTCTGCCCCACTGGTTTTCGCGCTTAACAAAACGCCACTTACCTGAGTAATTATCAAGTTCGCCAAATTCGTTGTAAGCATCTTGCTTTACGTAATAACCCGTTGCTGCGCCGTGGCGATCAAGTGTGACGCCACCGCGCAAACGATTATTACTTAATAAACCGTTTGGATTGCTCACACGTTTAGGTGATACCAACTTAATTGCGGTTTTGAATAATGCGCCTGGGCGATTAATCCATTCACTGACCGCCATACCTTCACCGTGATGACAGTGACCAGCGGCAATGGCGCGCACTAACATGGTAAACGTGCGCTTTCGCTCAGCATCGATATAGCAGTAGGATGATTCTGCGTATTCAATAAAGGCTTGCTCGGCTTCTTTCATAAAGCTGCGCGCATCATCTTCATTCATGCCGAGCACTTTATGATTTAAACGGTAACTTGGACGAAAAAGCGAACCAACCACGTTATCAATGTGCATTTGCACGCCACCTTGCGCAATGGCGTTATTTCGAACCAGATCATCAGCGCGTGCATTGGCAGTATCAATCACTGGCAATAATGCGGCATCTGCACTTTTGTTGGCTGGATACCAACTTGCCATTTGACCACCAAAACCATTGCCGCCGCCACTGTAACTGGCGCAATGCTCACGTAATGGTGTTTCACCATCCATTGCTACTAATTGAGTCATAGTGAAACCCTTGCTGGACCTTTGCGGCGTGATGAACTGCCGTTAATGAGCGTTTCTAATTCGTTAATGTAGGCTTTTAATTCGTTTTTATTGGCCTGTGTAAAACGTGTTTCGCGGCCATTGCGGCTAAATGACACCACGGCTTGACCTGTTAGCAGGTTGTGATATGCCTGCTGTGCTTCTGAAAGCTGTGTTTTTAACTGTTCGATACTCATTTAGTCTGAACCTAATTGTTTGCCGAGAGCGGCAAAGCTATTTGAAGTGGATTTTTGTGTTTCTTGCTGACTTACAAACGATTCAAGCTTTAGTGAAAAACGTTCAATTGAGATATAAAGCGCAGCAAGCGCATAGTTGAAACAATCGAGCGCTTCGTTTCTTCGCCCTTCATTGTCGTAAACCATGATCACGCCTTGCTTTGTTTTTTTCGGTTTACGAATTTCTGACACCAGCTGTTTACACACATCTTCACCACACATTTGATCGTCAAGCGGTAGATGAATGGCTCTAGGTTCGCCCAGTGGCAATTCAATGTCGGTGTAAAAAATATCTTTGGCGGTATCGGTACCAATTTGCGCGATGAACGTGCCGCTTTGCTTGTTAATTTTGGGCGCCATGTTTTGCACTGGCTGACCATACGAGGTGGCACCTTTTACAGGTATAAACTTCATTAAGCCAATGCGTTTTGAAAGTTTATAAACAACTTCGGTGCGGTGGCCTGCTAAATCCCAACAAGTGCGCGAAATCGTTAAATGCGCGCCATCTTCGCGTTTATAAGTGCGCTTGGTAAATTCAACCACCGCATCTTGCACAGACTCATCACGTGGATCGCCTACATGATAAAAACGGTCAATTAAATATTTACGGTTGTCTGCAGTAAAGCCCCAAACAAAGGCTTCCATGCGGTTATCTTGCGTATCACCGCCTGCTGTTAAATAAACAACGTCTTTTGGTACTTGCGCTTTGTATTGCTCGCGCTTTTCAAGCAATATTTCGTGATCTAAGCGTTTGGTGTTGGCTGGCTCAAAGTGCAAGCCAAGTGTTAAGTTGATAAACGCTTGCAGCTTATCGGGGTCACCTTTGATATCGAGCCATTCACGAACAATCTCTATCCAACCTTCTGTAAGGTTGAGTGAGTAAAGCGCGGAGCACTTAATACCAATACGTTTTGGCGCCTTGATTTTGTTGCCAATCTTGTTGTAAAAACTTAAGCCGTCCTCGGTCCATATTCCGGTTTTTTCACAAATCCAGCGGCCAGCATCTTCCATTTTGGAAAGTTGGCGGTAATAAATACGACCTTTTGCTTCGCTTTCATGGCACTTATGATTTTTGCAGCTGTAATAGGCGCTTTGGGATTTAGCTTCATCACCTTGAAGTGTGTTATCCCACTTCATACCGTGCAGAGAGTCTTTATCACCCCACTCTAGTTTTTGCCGAGTACCGCAATGTGGGCAAGGCAAATGAAAACTGAGTATGCAATCCGCTTCATTCATTAAGCGTTCGATTACATCGCCCGTAAATACAACCGTTGAACCAAAGATTGCTTTACCAAATGATGCGCCTTGAATACGCTTCATCAGTACTTTGATGTTATCACCTTCACCGCCGTTGACGCTAAACGCACCAAGTTCGTCAGCAATAACAACTTGTTTTGTAATGGCACGAAAGTTATTTGGACTTTCCGCACCTTTCACATCGATTGAGAAACCAGTACACACCTTTTTCTTAATCGTGTTGTTTTCGTTATTCAATGACCAATCAGGAAAGGCCGATTGAATAGCTGGCACCACAGGCAGAAGTGGATTGACCTCTGACACCATAAAATCTTTTGCAAGCTGATCGTTTGGTTGATACACAACCGCACTGCGCTTTTTGTGCACACCCAGATACCAAAGCGCACCGCACAGCATTTTGGTATAACCAAATCGCGTTGGCTTTTGCACTGTAATGACTTGCACCGCATCATTACCCATCATGTTTAAAATGGCTTTTTGCACGGGTTGCGTATGCCAGCGACCAGGTATTTGCGAACTACCCTCTGGCAAGAAAAAGTGGTCATCACACCATTCAACACACGTTTGTGGCACTTGCGTTTTAAGTGGACTCAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP023398|1809850:1833891|1809850_1810564_+|WP_100913521.1|integrase|DBSCAN-SWA MDGRPLVNDEFWGDLIDKSSGRNSQFKERDQLLLGLACLAGFREIELTLCTIDLFIAPNGELNELIVMPESIAYDNCERPIPLAHPDLQNLFQTYIKWLLNNGLNTHPGESYLGLNPNAQLFVDDSYKPYTVQKRGNDTLSPNSLNKHLDSLIKRASLWDCGIRRKSFVRTFIINGYRAGMSTSDLIVISGLGHDTIEKTLTMDYEQYSPIAEWFVKRREQKVKHLESMKKRRRFML >NZ_CP023398|1809850:1833891|1816145_1817825_-|WP_100913534.1|DBSCAN-SWA MTQLVINFSRNFTAKISPLLINFGSSSGGGDNDGNPTDPTDELNNKDWLMIENVCTYSLQQSITHQFVFSSQIQQSLNHEVSLAHRNQDNLVVESGLKWQQQNAVKREWLTKYSLLAATQTERELIYLQLSKVETDYKIDWASQSTQTQNEFTFYFGPRDAEYICVPTKHPAPGVIYLSLDRPASNKQSPLSITLNDIAKVCYWSDVGLIRAFDDIPTIDRKVPIEPQIQNTYIMQPTITCKRVSDNREIYISNFAWSKSRGQFAATTDITFQTKLDFDYALNQLLEITVNGYVFYAYCETPGISRSFNNREYRCSGRSRIAELSQPYVRPTNYTNTVDRTHAGICSDILQNTGWLLQYSMVDYAVPAESFSYVDKASASALNDVAKAIGAMLDVGESTKTINVVPFWPVQPWRVDAATCDIILNESHIDSHNSQFVNKLAANAVFVSGQQNGVACKIRRDNTAGDLTASDVVDSLITDNQAARQRGECELGNAGNKEESTVVTDIKADLPPIKLGSLVGIQYPGELYKATCDSLTINASVDSNGAIDVTQTVKLVKNV >NZ_CP023398|1809850:1833891|1819088_1821212_-|WP_100913537.1|DBSCAN-SWA MQRNHLKIFKSESLEQHVDAGGHRTNNEVVNGKLNDVFRSITEVDYAQSAFEVVKLFPSVATHDATRLADSHVYLSEKPQDPLVNTLLIESPSLADDTKMATMRNLITSSTAKYHGMSVLTNAVESGALDLIVETTQTQVLPQVTTANDKLNVQAYAFSPNSSTTYNKTRVKTFVAPSSSNAAIFNINVDDYFEYLPDFTVKYLSVNGWRTLKPYENNAHFDEVKYQNNQFVFTLSAVYPMRAGTTLEVTYHSNKDYRKHSFATTPTLTLAPGEKIQPNTFRVKKAGEDGAVYTDKNGIFIDNTGNVFAQVNHDTGEITPSNAVDLNGTVNDDLGCIVLVATTESQAYPTNISFSVESDEYRVDSLRLVITRANGTTFSVGANASGVISHAEINGTLENGYVTLTASVDVANIAYDLTVIEKTAIDAPWLGINASALPNGGNVYIFNENRLVCIQNRNRTEKTSLTNGQTINSIVDADYVDITDSTGASLYDSNDANYSYDKPSGVITINTGISGFTAPFIITAVQFELRLVTQVEPNKLRLLSPVLRNYPQGAVVSSVTMLGDLQALALDERTQSAWSNNFGQKSAAGSSSLNLQQYPIEMNNLGAINQRWALVWGTNNTFVLFGESVGEIGSGDVLNDFAPINPITQQPFMIIRKEALGSGLQPGECLLFETTAAGNPIMVSRCISPGHSNIEFDSTTLSFKGYA >NZ_CP023398|1809850:1833891|1814711_1815065_+|WP_100913529.1|DBSCAN-SWA MSDYEYQHAIHGESKVKQKAFVDMFKPVIAWAMNNNNQIFEQIETSFIFGKVHRKNDPSFDVTVFFRGGDNESFTFYAWRDENKLKAQKEILKKILKAKDLNEYKSLSDRFNNRDLL >NZ_CP023398|1809850:1833891|1812139_1812664_+|WP_100913525.1|DBSCAN-SWA MSEETNKAVSFERVRSSVPKNSVGILAYSTVLFHSRLGEIAISGSDRDTVCTAFNRLKTKSTTSCDPDRLQEVCFFKQDDLEDNNENFEQLNSTPWQPMETAPKNGDEVILYVEKRAGIPGGFLVGHYMGGGHCIEDHPPIDAGWYFWNGCQFDLASKPLAWMPLPKLPTGFKY >NZ_CP023398|1809850:1833891|1828875_1830219_-|WP_100913546.1|DBSCAN-SWA MPQTTNNVSMTHILSRACNQPHFLEPAYATTFFSVLGVKGQFSSLIDPEGQMLEQADMKERASSFGPTRERDRPYQVKDGLAILPISGTLLHKYGYLKPYSGSTGYDGIIARLNDAANDPSIKAIMLDIDSPGGEAAGCFDCAETILKLREIKPIYALCYDTMCSAAMAIGSACTERWITQSGRAGSVGVVVAHASYQDQLKESGIDITLIHSGKHKVDGNPYEKLSDDVLASIQANLNESRDKFAALVAKGIGMDKQAVLDTEAKVFTGQQAIDVGFADRLVNGQEAVPLLLDILRTNSTTNGVAMSLEENQAPNAHANGGTVAGATLTEADVNQARLDGATAERERIGAILSSEHADGRQKMASHLALQTSMSADEANALLAVSPAETKLEASEPKGANMQTALDAAMANTEQPNLETSGDAEELSVDEKCEQALLGAHEAVFGK >NZ_CP023398|1809850:1833891|1810977_1811844_+|WP_100913523.1|DBSCAN-SWA MAKYLLTSSFGNDSVALIQLALNKGLDFEVVYNDTGWARKDWPKRVALFSSWLMEKGITLHITKSIGMAELVKKKKGWPMPASKMQFCTQELKEKPTEELLNKIDPDCELIIVTGRRREESQNRADLPLWQHESPKHGGRDVWNPLINHDEKQRDELIKQTGFEVLPHSSMECYPCVCANKDDLAQLLETPDRIDEIERLEIEMGFTRNQKPRVMFRPYRVGNGVGIRQAVLWGAGARGYKSGFIPNEYKIAGEQCLMFEGISDIAYEINTREGREFARQCDGGFCGN >NZ_CP023398|1809850:1833891|1812690_1813167_+|WP_100913526.1|DBSCAN-SWA MNQQVLDPCCGSKMFWFDNDHPDVLFGDIRDESHILCDGRKLEIKPDQVLDFRNLPFDDESFKLVVFDPPHLVRAGENGWQKKKYGKLGQEWKEDIKNGFKECFRVLATGGVLIFKWNETQIKTSEILKLTDVKPLFGHVSGKRANTHWVCFMKTNNS >NZ_CP023398|1809850:1833891|1815270_1815543_+|WP_100913531.1|DBSCAN-SWA MSQQQAVEITAKLYQCREQQIFLGGEDGFIQMFDKWKPVVEATMNKHQCSELPALIELLKLAESKPDGGMMMHVLNAVVCEMLEPTVTAH >NZ_CP023398|1809850:1833891|1815931_1816153_-|WP_100913533.1|DBSCAN-SWA MSDQLQRLKTLIGYSRTIVTITKVRSDGSAIVTHNDGSTSVAVGQNLALGKAYLEGDRIIGNAPDLPFSEIII >NZ_CP023398|1809850:1833891|1813179_1813455_+|WP_100913527.1|DBSCAN-SWA MDVLIAKTARSFEEMTPVFEAEKVNEVFSQLQKQIADLEKAVVEEIENRDKWEEKATYLAECVGVYFDESVGEHSSANCPIANAHDLLNQI >NZ_CP023398|1809850:1833891|1826649_1826964_-|WP_100913543.1|DBSCAN-SWA MMLSKQLASSLQAERAVFLQVATPCEFAHLPSESKHVDVYTSYRTVNNEFDGTKDREAITIAEFLLDEGDFEPRHVFTLGAKRYMLTELYEQNEVTVKYIVVSV >NZ_CP023398|1809850:1833891|1832001_1833891_-|WP_157813487.1|terminase|DBSCAN-SWA MSPLKTQVPQTCVEWCDDHFFLPEGSSQIPGRWHTQPVQKAILNMMGNDAVQVITVQKPTRFGYTKMLCGALWYLGVHKKRSAVVYQPNDQLAKDFMVSEVNPLLPVVPAIQSAFPDWSLNNENNTIKKKVCTGFSIDVKGAESPNNFRAITKQVVIADELGAFSVNGGEGDNIKVLMKRIQGASFGKAIFGSTVVFTGDVIERLMNEADCILSFHLPCPHCGTRQKLEWGDKDSLHGMKWDNTLQGDEAKSQSAYYSCKNHKCHESEAKGRIYYRQLSKMEDAGRWICEKTGIWTEDGLSFYNKIGNKIKAPKRIGIKCSALYSLNLTEGWIEIVREWLDIKGDPDKLQAFINLTLGLHFEPANTKRLDHEILLEKREQYKAQVPKDVVYLTAGGDTQDNRMEAFVWGFTADNRKYLIDRFYHVGDPRDESVQDAVVEFTKRTYKREDGAHLTISRTCWDLAGHRTEVVYKLSKRIGLMKFIPVKGATSYGQPVQNMAPKINKQSGTFIAQIGTDTAKDIFYTDIELPLGEPRAIHLPLDDQMCGEDVCKQLVSEIRKPKKTKQGVIMVYDNEGRRNEALDCFNYALAALYISIERFSLKLESFVSQQETQKSTSNSFAALGKQLGSD >NZ_CP023398|1809850:1833891|1821211_1821571_-|WP_100913538.1|DBSCAN-SWA MIQLDTISLDNFVWRNQGATINIAAKERRALNGVGKVLLTPLSDFEIVLFAELEPSAIFEAIEAHAKTNLDTFDVTIHSVVYTCRWNYLKGAAIAEPQRAYSDVKPDYYKNIQLFLVTV >NZ_CP023398|1809850:1833891|1825707_1826103_-|WP_100913541.1|DBSCAN-SWA MKKAQNITEAMDAVKAFFANYGNVTLGFYQLAGTQKDGDIFIQPVDLQTVKSKSVNLLLVLAKDATATNALDDLLATSESLQQAITNAGKNVNAALGGKLIQFQETEPAKIIAPESNERVARALFTLNIKY >NZ_CP023398|1809850:1833891|1811864_1812128_+|WP_100913524.1|DBSCAN-SWA MEYFDKEAVYDEKIAPLMKEIIAVCKEHQIPALASFTFRNDEEDGVGTCDTLLSHSDDRNNPKYPAALNEIRKSDGFITAVTIAKRV >NZ_CP023398|1809850:1833891|1815075_1815258_+|WP_100913530.1|DBSCAN-SWA MPSFLDNAPEFMKKIFEKDFRFWCCSNKEHREVEWIESKTQVQAQCKECGEKSPIFNKNS >NZ_CP023398|1809850:1833891|1824930_1825695_-|WP_100913540.1|DBSCAN-SWA MAEGFILRGEVMIQRVNTKGVAAVNSPLLGPMNVEMLELKAESEQITRTSKNKSTFGKSLGGVKDPKPTSIKLKLDEVPSEVLAEALGAEIGDFSEGAATLTDKAVTLFAAATGGWSEIGKKQLQTGVVVKKVAATLTEGVDYEIKYAAGLIRALPGGEMDAGGEVLVSGNALALDGKRLTGGETQEVNWRIILDGENIDTGAPVHLEIPLTQLAASSDINLLQNEYLAPEFEGVCTVATGKDKDYCMDIVNAA >NZ_CP023398|1809850:1833891|1818129_1819089_-|WP_100913536.1|DBSCAN-SWA MTDAVTVYRWDDVGAPSLASRKPSDIMKVVKACLVDGYGTKSPLGWSVEEDEINTAAPHLAVRNNMALGGSGGVTSFSAVNDSAGTNSRVQSHQEYTSKTVFAKSSYYYDVRIRGGSSAPWVIIGTAKAFWFIKPYNISTNYWGRSDEYSWIFFCGDMYSFIPNDPNTFIQIGRRGNGSNTSYSVEMPYTLTQGTPIDILVTYPINNENDKLSSSLISPMGNGGFAVSSAHANVSPTVTMLADMFIIFGNSLQGSVSSYNDPTRPRLRGRVPGLKVSDVSGWRNEAFPVIKPIHTQTYFQMPDGYSDTSCVWIGLETWE >NZ_CP023398|1809850:1833891|1815568_1815748_+|WP_100913532.1|DBSCAN-SWA MNYRLLEHDEVIQSGDEFLEDDAKTWTEITDKSPCSWAIGMKWKGRGLKPMRRKVQNSK >NZ_CP023398|1809850:1833891|1810560_1810851_-|WP_100913522.1|DBSCAN-SWA MAEQKAPVRSGRSEREIELDEIYGDEATALIKGAMGLVNVDFTELAKRLEAQGHTVAVSTLRKKVNGGQCKASFVMRIMDALGVDLAPIKKRTTEN >NZ_CP023398|1809850:1833891|1813651_1813876_+|WP_038640888.1|DBSCAN-SWA MENQHKKIKGYRDLSQAEIDAMNEAKALAENVGSLVEKLQGQDGLDQRWIATAKTDLQKGFMSLIRGIAQPTTF >NZ_CP023398|1809850:1833891|1814308_1814695_+|WP_100913528.1|DBSCAN-SWA MANRYGLDASYFTSKLEQLIRDIDNYTPDEFARVMARMSKTADKNVVLEPEFQSNESWVHVGEKLPTPAEGIEYLCVDYDGVRWLCEYDYNQDDEAVFTVLGSAAQPKIGSIVYWSEIQEHPPINTNS >NZ_CP023398|1809850:1833891|1813467_1813641_+|WP_157813484.1|DBSCAN-SWA MSEKQALESMKGKVIESIQHHEVIGSIQFNFTDGSTVQVFAQEKASQQIMIIEPNTK >NZ_CP023398|1809850:1833891|1831782_1832001_-|WP_100913548.1|tail|DBSCAN-SWA MSIEQLKTQLSEAQQAYHNLLTGQAVVSFSRNGRETRFTQANKNELKAYINELETLINGSSSRRKGPARVSL >NZ_CP023398|1809850:1833891|1830202_1831786_-|WP_100913547.1|portal|DBSCAN-SWA MTQLVAMDGETPLREHCASYSGGGNGFGGQMASWYPANKSADAALLPVIDTANARADDLVRNNAIAQGGVQMHIDNVVGSLFRPSYRLNHKVLGMNEDDARSFMKEAEQAFIEYAESSYCYIDAERKRTFTMLVRAIAAGHCHHGEGMAVSEWINRPGALFKTAIKLVSPKRVSNPNGLLSNNRLRGGVTLDRHGAATGYYVKQDAYNEFGELDNYSGKWRFVKRENQWGRTQFLHVFEPLEAGQTRGANLLLSMMEQMQSLSKLQNTKLQNAIINAMYAAVIESELDSEQAFQLISGEGGADQMQKWMAYMGEYHNAANIRMNGARIPHLVPGEKLKFTQSTNANNGFTDLESSMLRFIASGFGMSYEQIARDYSKVNYSSARASINESFRYTMGKRKTIAARFASLVFANWLEEALHRGVLKAPKSRFDFYERKEAWTRCEWIGAGRLSIDGLKEVKEAILRIESGLSTYEKELALMGEDYQEVFEQQAREINERKAKGLPPPSWITALGFDPETNQGSDNAANN >NZ_CP023398|1809850:1833891|1813943_1814300_+|WP_052140949.1|DBSCAN-SWA MSIFQCENCGCAENTACANQGFKGIKHLFDWSYNPSLEGKLLCRACGPTKFSDGSKMKSDHWGEYGIWHNRFERRYLPHGEFKTNNQGNLEHIESGLIGNEAYKKFARSEPYPLKENV >NZ_CP023398|1809850:1833891|1827474_1828509_-|WP_100913544.1|capsid|DBSCAN-SWA MSDLNSPRALYNVVQQKRARSSNLFLTLFFPHMYTFETEEVALDKVDNAVNSAVFIAPEVNGKVIKTRGHTTSTIKPASLKPKHDINPQKTLKRRAGEGLNGELTLQQRRQAIMVQNLLDEDVAIGQTEEWMAVHSIVDANYTCEGEGLPEPINVNFNRSAENQINLAGAASWVSKDRATYNPLDDIEQYASASEHGIDIMIVDQKGWSLLLEFDKVREKLETRRGSNSQLETALKDLGKDVSYKGYLGDVMVVVYTGYRLVDGVKEKYLDDYTYVLGHTGIECARMYGAILDDDAIEAGMYATDRFQKTYKDTGDVAKSYTVTKSAPLMANTDPDGFVVIKLI >NZ_CP023398|1809850:1833891|1828518_1828866_-|WP_100913545.1|head|DBSCAN-SWA MEQVFEHEQIAAGERVMTQNTGTFAASQTLEKLAPVGRVSASGELKAWDPAANDGTEKAIALTVGAVDTTSGARTAPFYDGGHFNNALIAWPVAATAAQKAAAFDGTPIATGSVE >NZ_CP023398|1809850:1833891|1826099_1826642_-|WP_100913542.1|DBSCAN-SWA MKVFEQLAGDLVRIKEQNAHALEVAVTQTAEFAHNKLVDAIYDRYNFRFRDYIAERINLKIDSKELEFRVASRMRKSSSVNFMEGPVYRKSKNKRTPNKQVIAGYTGAYLRGKTQHWRGAFLFTGLNGNLLMGYRNQGVKGREIPEVSYGVSVGGALGVVKDKQVDPVMTHLLKMYQKVL >NZ_CP023398|1809850:1833891|1821567_1824870_-|WP_157813486.1|DBSCAN-SWA MSFKNTVVEFIIKGRDLFSPAADKAAKQAEALENASKALNDELKSIEGVKSQTEQYDKLTTSIDAVEKSYLAASKSLVALVAEQKQSKAALKDAETSYQAIKREVSALEAAQKTATTVSKEESRALIEKRQTLNQLEVEYNKAVATNADYNLKVKGTRTEVNQLGTAFNKNKAEIASLEKSLESAGVDLNNLSKSSKELSQRQVAAQQAIAQNNNKLAKHKKLLNEAGKEAKEFGGSIGAATKSLIAMAGAYVGIDTLKESLFGVLNAGDKAKAFEAQMTALMGSFEAGQQATEWMRTFADDTGTQINAVRESFSTLKTFGIDPMNGSMQALVDYNAKLGGSQEKLNGIILAVGQAWAKQKLQGEEILQLVERGVPVWDLLAKVTGKNTVELQAMSSAGELGRDTIKALLNELGNAASGQASKSLERLSGQMAVLSNKWQEFKITIADSGVYQVAVDFLKELNAQFEKMANNGQLESAAKKISEFFTAIVRDGGAAVKAFIDNVNAFFTGVETIVGGFRILINGFTAGVKTIGFVVTQYLQGVMTVFAQTLDLFGADTWSQKAQNVANALSAVSDGFYQGILQDGQDLSNAWDQLTQQTGEKVKQTHTDVTAKVKETQAEQKAAFGESAEAGKKAADELGLAMSKAGIVTTQSLRDAAEQAKLLFDTLLEQYKKGEVGIYEVEQAYTKWAQAAVKVADATREGLDPAIKAEAAALGLSDALASLTKKSQTLNPALDENSDAVKRFQQQVDETKAKVAEFEAIMNDANASLNDKNTAYARLTDAQRLLKTQTESLNNVKRAEKANYYELINIQSAYERKMDELNRQYESGIISSYEYKERKRELSSVMSVVNDLLGDFANKQRAATQATRAGAQATKEATKADLESAKAMSKKASSLNQVANNAGRAASSMFNYNKQSGLGVNDAGNRTFTTFDPKVAERAKRDEETAAAAQAQFEKYVSRINAASGNTSALNQLLNEINNYLNFLSKAQEKELSDMIKAMKSQSETSSPYRAGPSSRNGGSSSGGNDGGGNNGGGKTKQTPATTQQSVQLNKLINVLEKSPLASSKAVRLELVMPTGRKGSAFTTDTNLIDELERLSQTQ >NZ_CP023398|1809850:1833891|1817821_1818133_-|WP_100913535.1|DBSCAN-SWA MIHFEISSYNPVLLTNSPLNIGVVETTLDALGERLLVIDRRGKVITQRLVPANNKLAIIVPFEYTVNNNLMCILLDDDAEKDGELVDNVKAAVIDMKNYTPPV >NZ_CP023398|1809850:1833891|1815782_1815923_+|WP_157813485.1|DBSCAN-SWA MKTTQQKTWQRRKVIITPSFAKVNFTDYLTACFELEQQVFKAMRTK |
33 | Vibrio_phage(25.0%) | tail,integrase,portal,head,terminase,capsid | attL 1804484:1804498|attR 1825028:1825042 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2342137 : 2351658
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP023398|2342137:2351658|DBSCAN-SWA CTTAAATTTGTTCTAACAGGTTAACCATTTCTAGCGCACCTAAAGCAGCTTCGCCACCTTTGTTGCCCATTTTGGTGCCCGCACGCTCAATTGCTTGCTCGATGTTTTCAGTTGTTAAAACACCAAACGCTACAGGAATATCGTAGTCTAGCGATACACTTGCTAGGCCTTTGTTTGACTCATTAGCAACTAAGTCAAAGTGCGGCGTACCACCACGAATAATCACACCAAGTGAAATGATTGCATCAAATTCTTTCTTTGCTGCGATGCGCTTAGCTGCTAGAGGAAGCTCTACAGCACCCGGCACATAAACAACGGTAATATCGTCGTCATTTACATCACCAATACGCTTTAAGGTATCAACTGCACCATCAAGTAAGCTTTTACCAATGAAATCGTTAAAGCGAGAAATAACGATAGCAAACTTTTTCCCTTGAGCGTTTAATTTACCTTCAATCACTTTCATTTTATGCCCTTCTCTTTGGCTAATTCATTTATCGCGCAATCGCAAAGGCGCGGATTTTATCATACTAAATTAAATTTTTTTAGGCTTATTTTTAGGCTTGTGGTTGAGCCTATTACGGCTCAACGTATTCAACTACTTCTAAGCCAAAACCTGAAATCGCATGGTACTTCTTCGGCATGCTCATTAAGCGCATTTTATGAATGCCTAAATCAGCCAAAATTTGTGAGCCTACGCCTACGGTGCGCGATGTACCTTGGAACTTGCGCACGCTCGGGCTTTCGCCTTTGTCTTGTGCTTCAAACGCTTTAAACAGGTGTTCAAGTTCTTCTGGGTTTTCGTGTTTCGCTAAAATAACTAATACACCGTTATTTTCAGCGATGTACTTCATCGCACCTGACAGTGTCCAGCTGCGATCGGATGCGCGATCTGACAATAAAATATCGTTAAACGTACTTTGTAAGTGAACACGTACTAAGGTTGCGTTATCTTCTTTTACATCACCTTTTTTCATTGCGTAATGAAGTTGACCGTCAATTGTGTCTTTATACGTCACTAGGTCAAACTCACCGTGTTCAGTTGGCAATTTACATTCTGCTACACGTTCAATAGTCGCTTCATTTAAGTTGCGGTATTCAATTAAATCCGCAATGGTACCCATTTTTAAACCGTGCTCTTTAGCAAAAATTTCTAAGTCTGGGCGGCGCGCCATGGTGCCATCTGGATTGAGTATTTCAACGATCACTGAAGATGGCTCGCAGCCTGCTAAGCGCGCTAAATCACAACCGGCTTCTGTGTGTCCTGCACGAGTCAATACACCACCCGGTTGCGCCATAATTGGAAAAATATGACCTGGCTGTACGATATCTTCAGGTACCGCACCTTTTGCCACTGCAGCTTGTACTGTGCGTGCACGGTCTGCTGCTGAAATACCCGTTGTTACACCCTTTGCCGCTTCAATCGACATGGTAAAATTGGTAGAAAACTGTGCGCCGTTATTGCTTACCATTAGTGGTAAGTTAAGTTGCTGGCAACGCTCTTGTGTTAGCGTTAAACAAATCAAACCACGGCCATGTGTCGCCATAAAGTTAATTGCATCTGCGCTGATGTGCTCAGATGCCATAATTAAGTCGCCTTCATTTTCACGGTCTTCGTCATCCATCAGAATGACCATTTTGCCTAGACGGATATCTTCAATAATTTCTGCTGCAGTGTGTAAGCCCATTTTGTTACCTTTGCTTGCGACTGGTTATCGATTCATAAAGCCAGCTTGCGCTAGAAGCTCATGAGTGAGAGAACTGGTTGGTTTTGCATCTTGCTGTGGCATCATTAAACGCTCTAAATAGCGTGCGAGCTGATCCACTTCTAAATTCACTTTGCTACCTACTTTAAAGCCACTAATCGTAGTTTCGCCTGCTGTGTGCGGCACAATAGTTAGCTTAAATTCGTTGTCGTTTATTTCGTTTACCGTAAGGCTAATGCCATCAATGCAAACCGAGCCTTTATACGGGATATACTTGAGTAATTGCGCTGGTGCACAAAGCCAGTAATCGGTTGCACGGGCATTGGTTTCAATACGGGTTACGGTAGCAATACCGTCTATATGCCCAGAAACTAAATGACCGCCTAAACGCGATTGTGGCATTAACGCTTTTTCAAGGTTAACGGTTTGCCCAACTTGGTAGTGTTCAAAGTTGGTTAACGATAGGGTTTCTTTTGATACATCGGCTACATAGCCGTGGCCTGTTAAAGCAATAACCGTTAGACATACGCCATTGGTTGCGATGCTGTCACCTAATTTAACATCAGCCAAATCAAGTTTGCCTGAGTTAATAGTAACGCGCATGTCGCCAGCAATCATTTCAATTGCCGCAACGTTTCCGGTTGCTTCTATAATGCCTGTAAACATATTAATTTCTTTTTACTAATGCATGTAGACAGATATCCTGTCCAACTTGCTTGAGTGTGTTTATTTCAAGTTCAGGGACTTGAGACATGGTTGTAAATGTTGGCAAGTTAATCAAGCTCTTGCTACTGTGCCCCATCAATTTAGGCGCCATAAACAAAACAAGTTCATCGACCAGCTGTTGTTCAAACATTTTACCTGCCAAGGTTGCGCCCGCTTCCAACAATACTAAGTTAACGCCGCGCTCGGCTAATAATGACAATAAAGCGCTTAAATCTACTTTGCCATTGTTTTGCTCAACCACTACTTGTTCAACAAAATGCGGCCAAGTGTGCGATTTATCAAGTGTTGTACGTGCAATAATTATTTTAGATGGTTGCTGAAATAACGCGAGATCCGGCGTTAGTCGGTTTTGCGAATCAATAATCACGCGGATTGGCCGGCGAGCACTGAGTTTTGCATCAGTATCTGTCAATTCATCAGGGCGGACATTTAATTTCGCATTATCGATAATAACAGTATCTGCCCCTGACAGTACTGCACAACTTTCTGCCCGTAAATCTTGCACAGCGCGGCGCGATTCACTGGAGGTAATCCATTTACTTTCACCATTTTCAAGTGCCGTTTTACCATCAAGGCTTGCCGCTAATTTACAGCGTACATAAGGCAAGTTGGTGCGCATGCGTTTTAAAAAGCCCAAATTGAGCGCTTCGGCTTCATCGTTTAATAAGCCAAAATCGGTTTCGATTCCAGCATCTTCGAGCATTTTTAAACCGCGACCTGCGACTTCTGGGTTTGGATCAACCATAGCAGCGACAACGCGGCTAACGCCTGCTTTAATTAGGCCTTCAGCACATGGCGGCGTGCGGCCATAATGCGAACACGGTTCTAGCGTGACGTACGCGGTTGCCCCTTTAGCGTTATCGCCAGCCATTTTCAGTGCATGCACTTCGGCATGAGGGCCACCTGCTTGAATATGAAAGCCTTCACCGATGATTTGATTATCTTTAACTAAAACGCAACCTACGTTGGGGTTAGGCGTGGTGGTAAATCGACCGCGCTTTGCAAGCTCGATGGCGCGCGCCATATATTTGTGATCACATTCAGAAAAGCGATTGTTGTGCATCAGATTACTCACCTAAGCGTGCGATTTCTTCGCCAAATTCTTTAATGTCTTCAAATGAGCGGTAAACAGATGCAAAACGTACGTAAGCAACTTTATCGAGTTTTTTCAGCGCTTCCATAATAAATTCGCCGACTGTTTTACTGTCAACTTCGCGCTCACCTGTTGCACGCAGTTGCGATTTAATTTGGTGTACTACTTCTTCTACTTGTTCTGTACTGACAGGACGTTTTTCTAGCGCACGATGCAAACCATTGCGAAGTTTGTCTTCATTAAACGGTTCTCGGCTGCCATCTTGCTTGATTACACGTGGCATAACAAGCTCAGCACCTTCAAAGGTAGTAAAACGTTCGTGACATTCATTACATTCGCGACGACGGCGAACCTGATGACCACCACCTACTAAGCGTGAATCGATAACTTTGGTGTCATTGGCTGTACAAAAAGGACAATGCATAAACAACTCTCGAAAGAAAAAAGGCCGCAAAAAAGCGGCCTTATCATATCATGTTAACAATGAGAAAACATTATGCGTAAACAGGGTGTTTCGCACAAATTGCTTTTACTTTCTCACGTACTTCATCTTGTACTTTTGTGTCTTCGATGTTGTCTAGCACATCACAGATCCAGCCAGTTAACTCTTTTGCTTCTGCTTCTTTAAAACCACGACGCGTGATAGCAGGTGAACCAATACGTAGACCAGACGTTACAAACGGAGAACGTGGGTCGTTTGGTACTGAGTTTTTGTTAACTGTAATGTAAGCATTACCAAGTGCTGCGTCCGCATCTTTACCTGTGATATCTTTGTTGATTAAATCAAGTAAGAATAGGTGGTTGTCTGTTTTACCAGAAACAACGTCGTAACCACGTGCTTTGAATACTTCAACCATCGCTTGCGCATTCTTAACAACTTGTGTTTGGTATGCTTTGAACTCTGGCTCTAGTGCTTCTTTAAATGCAACCGCTTTAGCAGCGATTACGTGACATAAAGGACCACCTTGACCACCTGGGAATACCGCAGAGTTTAACTTCTTGTAAATCTCTTCGTCGCCACAGTTAGACACGATTAAACCACCACGAGGACCCGCTAATGTTTTGTGTGTTGTTGTCGTAACCACGTGTGCAAATGGTACAGGGCTTGGGTAAACGCCTGCAGCAACAAGGCCGGCAACGTGTGCCATATCAACTAGAAGGTAAGCACCTACTTTGTCTGCGATTTCACGGAATTTAGCCCAATCAACAATACCTGAGTATGCTGAGAAACCACCGATAATCATTTTAGGCTTGTGCTCTAGTGCTAGCGCTTCAACTTGTGCGTAGTCGATTTCGCCAGTTTCTTCGTTTAAGCCGTACTGAACTGCGTTGTACGTTTTACCTGAGAAGTTTACGTGTGAACCGTGAGTTAGGTGACCGCCGTGTGCAAGGCTCATACCTAAAACTGTGTCGCCCGGCTGAAGTAACGCTTGGAATACAGCTGCGTTAGCTTGCGAACCCGCGTGCGGTTGTACGTTTGCGTAATCTGTGCCGAATAGTTCGTTAGCGCGGTCAATTGCTAATTGCTCAACAACATCAACGTGCTCACAACCACCATAGTAACGCTTGTAAGGGTAACCTTCCGCGTATTTGTTAGTTAACTGAGAACCTTGTGCTTCAAGTACACGCGGGCTACAGTAGTTTTCAGACGCGATAAGTTCGATGTGCTCTTCTTGACGCTGCTTTTCTTTTTCCATTGCGCCAAATAGTTCTGGATCAAAATCCGAGATATTCATGCTACGTTCTAACATGGTTACTCCTAAATAGAAGATGGTTGTGGGATAAAGGGCGTATTGTACGCTTAAAAAGCGCCCTTGCCTACTTTGTATATGTGAAGTTATTTAAACAGGGATTGAAAATTTTTCACAGTGTATTTTTAATGATATTTTTAAATCGAAATATAAATATTTAAAATACAACACAAATCATTCATTAAATTTAATTATACAACTACCAGTGTTTGCCTAATGATACAAAAAAGCTGGATTCGCCATCGTCGGTTGCGCCAAAACCAAATGCAGCTGGGCCAAAGCGTGTATCGGTGCCAACAAACAAGCTGCTGGCAAAAATAAGCTCAGATAAATCGACGCTTTGTTGAATATCCCACACATTGCCGACTTCCGCACTCACACCAAGGTAAAGTGGCATGCTGGTCATATTGAGCACATCGCGGCCTAAGTCGTATTGATAAATCACTGCGCCAAACGCTTTTTTAGGACCGTTTAACACGTTTTTACCGTAGCCCGATAAGTTTAAAAAACCACCCAAGCGCTCAGTATTAACGGTAAATGTACCGTCTGATTCATAACTGGTATACGATGCTGTGCCAACTAAGCCGTGGTGTCGAATAGAAAAAGCGCCGCGCCAATCAATTTTGTATTTACTGGCATAGCGACTGTCGATTGAGTCCAGGCGATCATCGTACGCTTCTTTAAAACGCCCATAATAAACACGTAGTCGATTACCTTGGGTTGGAAAACTAATGCTATCTAAGGTATCATACATCAAACTGCTGTAAAAACCCGTTTGATCAAAGGGAAATTTATCATTAATCAATGATTTTTGTTCAAGTTCGCCAACATCACGTATTGTGCCAAACTCAAATCGCCAAAAATTATTAATGTTGTAACCGAACGCCAAATTACCACGTAGGTGATTCTTTTCTAAATCAAAGTACGCTTGCGTTACCGTTTGTAGCTTGTATCTGTCTTTGATAAACGCGAGGCGAGCACTCGCATAATAGTCATCTTCAGCGTTTAGTGGTTGATAAAATTCGGTCGCGATATACTCTTCAAAACCAAGCGAGATCTCATTGCGCCAAAGCCCTCCATAGCGATTAATATTGCCGCGCGTAAACGCCATATCAATTGAGAGCACCGAGTTTACTGTCACGTCACTTTGATAATTAGCGCCAAAATCTAAATAGTTTGGCCCCCACGATTTACCAATTGTTTTTAATACCAACACTCGCCCTTGCTCGGTATCTTTAAACTCGGTTTCAACACGCTCAAATTCATTGAGCGCATAAACGCGATTTACCGCCTGCTGCACATCATGCTCTGACGGAGTGTCTCCTGGGTTAATCGAAAAGTGTTTGAGTATCAATTTATTATTAACGTCTGACAAATTATCCAACGAGATCGCAAGCACTGGGTTATAAATTGGCTGAAACCACACTTGGCTGCGCTCACGTTTAGCAACTTGATACGCCTCAAATGCACTCGGTGAAATAGCTAAGGTTTGCAGTTTTGGTTCCGCGATTTTTGCAGCGGCAAGGCCGCGCACTAACGACTCTTCTAAATCAGCCCAGTCGGTGGTGCTTAAGTCATCAATATTGGGGCGTATATATATATCTTTTTCACTTAGCAAACGTTTTTGCCATTGTGTTGAGGCATTGGTCAAAATGGTAGAAAGTTGGCCAAGTACTTCCATGGTGGATGTGAACTCATTCGATTGAGCAAGTGATGCACCTATATCTACAGCTATAACGATATCTGCGCCCATCTCTTTTACTACGTCAACCGGCAAATTATTGGCGATGCCACCATCAACCAGCAAGCGACCATCAATTTCGGTCGGCTGCACAGCCCCGGGTACCGAGGCCGAAGCTTGCATTGCTTCAGCAATACTACCAGATGAAATTACCACGGCTTCACTGGTCGATAAATCGGTCGCAATTGCGCGATATGGGATTGCCAGCTGGTCAAAATCATCAAACAAATGCACTGAACGCGTGGAGTTTCGCAATAAACGACTCATACTTTGCCCGACAAGCAAACCTTTAGATGAGCGCAATGCACCGTCACGAAGACCAAAATTCAGCTGTAGATTATATTTATCGCGTGCCTCTTTATCGCGATAGCGTAATACTTCTCTTGGAATATCATCTGAATAGCCTTGGTTCCAATCTTCATTAAGCATAAATTTTTCAATTTCACTTGCTGAATATCCCAAGGCATACATGCCCGCAACATAAGCGCCAATGCTAGTACCGGCGATATAATCGATTTTTACATTATTCGCTTCAAGTACCTTTAACACACCGATGTGCGCGCCGCCTTTGGCACCGCCGCCACCGAGCACTAAACCTATTTTTGGCTGATTCGATGCACTCAGTTCAAAACTCATCAATATTGAACAAAATAAAAGATAAAGAGAGGTTTTATTAAGCGCCGGCATAGGTAAGCAATTACTATTATTTTTATGTGACTTTACCTTAACTTAACAGAATGACATTTTAAATCTACTACGACATTGAATGCCCCCATTTATCACACTACAATATGCAGCAATTACATTATTAAAATCACTATAGGATTATTCATGCCTTTACCGGATAAGTTTATCTACACGATGCATCGTGTAAGTAAAGTAGTGCCGCCAAAGCGTACTATTTTAAAAGATATCTCGCTTTCATTTTTCCCTGGCGCAAAAATTGGTGTGCTTGGTTTAAACGGTGCAGGTAAATCTACCTTGCTTCGTATTATGGCGGGTGTTGATACTGAGTTTGAAGGTGAAGCGCGCGCGTTCCCTGATACAAAAATTGGTTACCTACCACAGGAGCCTGTACTAGACGAAAACAAAACTGTTCGTGAAACAGTTGAAGAAGCCCTTTCTGAAGTAACAAATGCCCTTGCACGTTTAGACGAAGTTTACGCTGAATACGCAATGGACGGTGCAGATTTTGACGCATTGGCAAAAGAGCAAGGTGAACTTGAAGCAATTATTCAGGCGCACGATGGTCACAACCTAGATAATGCATTAGAGCGTGCTGCCGATGCACTTCGTTTACCAGAGTGGGACACGCCAATTAAAGTACTTTCAGGTGGTGAGCGTCGTCGTGTAGCGATTTGTCGTTTATTACTTGAAAAGCCAGACATGCTATTACTCGACGAACCAACTAACCACTTAGATGCTGAATCAGTAGCTTGGTTAGAGCGTTTCTTACACGATTACGAAGGCACTGTTGTGGCGATTACCCACGACCGTTATTTCCTAGATAACGTTGCAGGTTGGATTTTAGAGCTTGACCGTGGTGAAGGTATTCCGTGGGAAGGTAACTACTCGTCATGGCTTGAGCAAAAAGATGCGCGTTTGAAGCAAGAAGAAAAAGCGGAGAAAGCACGTCAGAAATCAATTGCACAAGAACTTGAGTGGGTACGTTCAAACCCGAAAGGCCGCCAGGCTAAATCGAAAGCACGTATGGCCCAATTCTCAGAGCTACAACATTCTGATTACCAAAAGCGTAACGAAACCAACGAGCTATTTATTCCACCAGGTCCTCGCCTAGGTGATAAAGTACTAGAAGTAAATAACATCAAAAAAGGCTACGGTGACCGTGTACTAATTGATGATTTAAGCTTCTCTGTTCCAAAAGGTGCAATCGTCGGTATTATCGGTGCTAACGGTGCAGGTAAATCAACGTTATTCCGCATGTTAAGTGGGCAAGAAACACCCGATGCAGGTAACTTTGAACTTGGTGAAACGGTTCAGCTCGCAACGGTTGATCAGTTCCGTGATGACTTAGATGGTTCAAAAACAGTATTCCAAGAGCTTTCTGAAGGCCAAGATATTCTGAAAATTGGTAACTTTGAAATTCCAAGCCGTGCATACGTATCGCGCTTTAACTTTAAAGGCAATGACCAACAAAAATTCGTAAAAGACTTATCGGGTGGTGAGCGTAACCGTCTTCACTTAGCCAAGCTGTTAAAAGCTGGCGGTAACATGATCTTACTCGATGAGCCTACCAATGACCTCGACGTTGAAACACTACGTGCCCTTGAAAATGCGATTTTAGAATTCCCAGGCTGTGTAATGTGTATCTCGCACGACCGTTGGTTCCTTGACCGTATTGCTACGCATATTCTTGATTACCGTGATGAAGGTCAAGTTAACTTCTTCGAAGGTAACTACACCGAATACGAAGCATGGCTTAAGAAAACGCTTGGTGCAGAAGCTGCGCAGCCAAAACGTATTAAGTACAAGAAAATCGGTTAA
Protein sequences of DBSCAN-SWA_2 >NZ_CP023398|2342137:2351658|2342137_2342602_-|WP_010561687.1|DBSCAN-SWA MKVIEGKLNAQGKKFAIVISRFNDFIGKSLLDGAVDTLKRIGDVNDDDITVVYVPGAVELPLAAKRIAAKKEFDAIISLGVIIRGGTPHFDLVANESNKGLASVSLDYDIPVAFGVLTTENIEQAIERAGTKMGNKGGEAALGALEMVNLLEQI >NZ_CP023398|2342137:2351658|2349984_2351658_+|WP_100913860.1|DBSCAN-SWA MPLPDKFIYTMHRVSKVVPPKRTILKDISLSFFPGAKIGVLGLNGAGKSTLLRIMAGVDTEFEGEARAFPDTKIGYLPQEPVLDENKTVRETVEEALSEVTNALARLDEVYAEYAMDGADFDALAKEQGELEAIIQAHDGHNLDNALERAADALRLPEWDTPIKVLSGGERRRVAICRLLLEKPDMLLLDEPTNHLDAESVAWLERFLHDYEGTVVAITHDRYFLDNVAGWILELDRGEGIPWEGNYSSWLEQKDARLKQEEKAEKARQKSIAQELEWVRSNPKGRQAKSKARMAQFSELQHSDYQKRNETNELFIPPGPRLGDKVLEVNNIKKGYGDRVLIDDLSFSVPKGAIVGIIGANGAGKSTLFRMLSGQETPDAGNFELGETVQLATVDQFRDDLDGSKTVFQELSEGQDILKIGNFEIPSRAYVSRFNFKGNDQQKFVKDLSGGERNRLHLAKLLKAGGNMILLDEPTNDLDVETLRALENAILEFPGCVMCISHDRWFLDRIATHILDYRDEGQVNFFEGNYTEYEAWLKKTLGAEAAQPKRIKYKKIG >NZ_CP023398|2342137:2351658|2342714_2343824_-|WP_100913856.1|DBSCAN-SWA MGLHTAAEIIEDIRLGKMVILMDDEDRENEGDLIMASEHISADAINFMATHGRGLICLTLTQERCQQLNLPLMVSNNGAQFSTNFTMSIEAAKGVTTGISAADRARTVQAAVAKGAVPEDIVQPGHIFPIMAQPGGVLTRAGHTEAGCDLARLAGCEPSSVIVEILNPDGTMARRPDLEIFAKEHGLKMGTIADLIEYRNLNEATIERVAECKLPTEHGEFDLVTYKDTIDGQLHYAMKKGDVKEDNATLVRVHLQSTFNDILLSDRASDRSWTLSGAMKYIAENNGVLVILAKHENPEELEHLFKAFEAQDKGESPSVRKFQGTSRTVGVGSQILADLGIHKMRLMSMPKKYHAISGFGLEVVEYVEP >NZ_CP023398|2342137:2351658|2345635_2346085_-|WP_010561691.1|DBSCAN-SWA MHCPFCTANDTKVIDSRLVGGGHQVRRRRECNECHERFTTFEGAELVMPRVIKQDGSREPFNEDKLRNGLHRALEKRPVSTEQVEEVVHQIKSQLRATGEREVDSKTVGEFIMEALKKLDKVAYVRFASVYRSFEDIKEFGEEIARLGE >NZ_CP023398|2342137:2351658|2347611_2349840_-|WP_100913859.1|DBSCAN-SWA MPALNKTSLYLLFCSILMSFELSASNQPKIGLVLGGGGAKGGAHIGVLKVLEANNVKIDYIAGTSIGAYVAGMYALGYSASEIEKFMLNEDWNQGYSDDIPREVLRYRDKEARDKYNLQLNFGLRDGALRSSKGLLVGQSMSRLLRNSTRSVHLFDDFDQLAIPYRAIATDLSTSEAVVISSGSIAEAMQASASVPGAVQPTEIDGRLLVDGGIANNLPVDVVKEMGADIVIAVDIGASLAQSNEFTSTMEVLGQLSTILTNASTQWQKRLLSEKDIYIRPNIDDLSTTDWADLEESLVRGLAAAKIAEPKLQTLAISPSAFEAYQVAKRERSQVWFQPIYNPVLAISLDNLSDVNNKLILKHFSINPGDTPSEHDVQQAVNRVYALNEFERVETEFKDTEQGRVLVLKTIGKSWGPNYLDFGANYQSDVTVNSVLSIDMAFTRGNINRYGGLWRNEISLGFEEYIATEFYQPLNAEDDYYASARLAFIKDRYKLQTVTQAYFDLEKNHLRGNLAFGYNINNFWRFEFGTIRDVGELEQKSLINDKFPFDQTGFYSSLMYDTLDSISFPTQGNRLRVYYGRFKEAYDDRLDSIDSRYASKYKIDWRGAFSIRHHGLVGTASYTSYESDGTFTVNTERLGGFLNLSGYGKNVLNGPKKAFGAVIYQYDLGRDVLNMTSMPLYLGVSAEVGNVWDIQQSVDLSELIFASSLFVGTDTRFGPAAFGFGATDDGESSFFVSLGKHW >NZ_CP023398|2342137:2351658|2344509_2345631_-|WP_100913858.1|DBSCAN-SWA MHNNRFSECDHKYMARAIELAKRGRFTTTPNPNVGCVLVKDNQIIGEGFHIQAGGPHAEVHALKMAGDNAKGATAYVTLEPCSHYGRTPPCAEGLIKAGVSRVVAAMVDPNPEVAGRGLKMLEDAGIETDFGLLNDEAEALNLGFLKRMRTNLPYVRCKLAASLDGKTALENGESKWITSSESRRAVQDLRAESCAVLSGADTVIIDNAKLNVRPDELTDTDAKLSARRPIRVIIDSQNRLTPDLALFQQPSKIIIARTTLDKSHTWPHFVEQVVVEQNNGKVDLSALLSLLAERGVNLVLLEAGATLAGKMFEQQLVDELVLFMAPKLMGHSSKSLINLPTFTTMSQVPELEINTLKQVGQDICLHALVKRN >NZ_CP023398|2342137:2351658|2346155_2347412_-|WP_010561692.1|DBSCAN-SWA MLERSMNISDFDPELFGAMEKEKQRQEEHIELIASENYCSPRVLEAQGSQLTNKYAEGYPYKRYYGGCEHVDVVEQLAIDRANELFGTDYANVQPHAGSQANAAVFQALLQPGDTVLGMSLAHGGHLTHGSHVNFSGKTYNAVQYGLNEETGEIDYAQVEALALEHKPKMIIGGFSAYSGIVDWAKFREIADKVGAYLLVDMAHVAGLVAAGVYPSPVPFAHVVTTTTHKTLAGPRGGLIVSNCGDEEIYKKLNSAVFPGGQGGPLCHVIAAKAVAFKEALEPEFKAYQTQVVKNAQAMVEVFKARGYDVVSGKTDNHLFLLDLINKDITGKDADAALGNAYITVNKNSVPNDPRSPFVTSGLRIGSPAITRRGFKEAEAKELTGWICDVLDNIEDTKVQDEVREKVKAICAKHPVYA >NZ_CP023398|2342137:2351658|2343848_2344508_-|WP_100913857.1|DBSCAN-SWA MFTGIIEATGNVAAIEMIAGDMRVTINSGKLDLADVKLGDSIATNGVCLTVIALTGHGYVADVSKETLSLTNFEHYQVGQTVNLEKALMPQSRLGGHLVSGHIDGIATVTRIETNARATDYWLCAPAQLLKYIPYKGSVCIDGISLTVNEINDNEFKLTIVPHTAGETTISGFKVGSKVNLEVDQLARYLERLMMPQQDAKPTSSLTHELLAQAGFMNR |
8 | Staphylococcus_phage(14.29%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP023400_1 | 22777-23224 | Unclear |
NA
Consensus repeat of NZ_CP023400_1
|
7 spacers
spacers of NZ_CP023400_1
>1.1|22805|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT TGCATGGTCGTTATGTGTTAGTCGTTGTGCAT >1.2|22865|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT AAATAAAAGGCTGGATTGACCCAGCCGGTAAA >1.3|22925|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT AGATACCACTGTTATTCGCAGCCACTTTAAAT >1.4|22985|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT AAGCAAACGAACTTCCAAAAGTACAAACCTTG >1.5|23045|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT CTGCGGGGCTTCTTACCTCTGGCTATTTGTGA >1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT ACTCCGTGTTGAAGTGGTTATGCTATCACACA >1.7|23165|32|NZ_CP023400|CRISPRCasFinder,CRT AGTGTCAGGATGCGTATGATATGCGACGTATT |
Cas14u_CAS-V |
CRISPR arrays and Neighbor proteins around NZ_CP023400_1
The CRISPR arrays of NZ_CP023400_1 >merge|NZ_CP023400|1|22777-23224|PILER-CR,CRISPRCasFinder,CRT GTTCACAGACGAGTGTGTGGCCAGTAAGTGCATGGTCGTTATGTGTTAGTCGTTGTGCATGTTCACAGACGAGTGTGTGGCCAGTAAGAAATAAAAGGCTGGATTGACCCAGCCGGTAAAGTTCACAGACGAGTGTGTGGCCAGTAAGAGATACCACTGTTATTCGCAGCCACTTTAAATGTTCACAGACGAGTGTGTGGCCAGTAAGAAGCAAACGAACTTCCAAAAGTACAAACCTTGGTTCACAGACGAGTGTGTGGCCAGTAAGCTGCGGGGCTTCTTACCTCTGGCTATTTGTGAGTTCACAGACGAGTGTGTGGCCAGTAAGACTCCGTGTTGAAGTGGTTATGCTATCACACAGTTCACAGACGAGTGTGTGGCCAGTAAGAGTGTCAGGATGCGTATGATATGCGACGTATTGTTCACAGACGAGTGTGTGAACAGTCTG >NZ_CP023400|1|1|22777-23164|PILER-CR GTTCACAGACGAGTGTGTGGCCAGTAAG TGCATGGTCGTTATGTGTTAGTCGTTGTGCAT GTTCACAGACGAGTGTGTGGCCAGTAAG AAATAAAAGGCTGGATTGACCCAGCCGGTAAA GTTCACAGACGAGTGTGTGGCCAGTAAG AGATACCACTGTTATTCGCAGCCACTTTAAAT GTTCACAGACGAGTGTGTGGCCAGTAAG AAGCAAACGAACTTCCAAAAGTACAAACCTTG GTTCACAGACGAGTGTGTGGCCAGTAAG CTGCGGGGCTTCTTACCTCTGGCTATTTGTGA GTTCACAGACGAGTGTGTGGCCAGTAAG ACTCCGTGTTGAAGTGGTTATGCTATCACACA GTTCACAGACGAGTGTGTGGCCAGTAAG >NZ_CP023400|1|1|22777-23224|CRISPRCasFinder GTTCACAGACGAGTGTGTGGCCAGTAAG TGCATGGTCGTTATGTGTTAGTCGTTGTGCAT GTTCACAGACGAGTGTGTGGCCAGTAAG AAATAAAAGGCTGGATTGACCCAGCCGGTAAA GTTCACAGACGAGTGTGTGGCCAGTAAG AGATACCACTGTTATTCGCAGCCACTTTAAAT GTTCACAGACGAGTGTGTGGCCAGTAAG AAGCAAACGAACTTCCAAAAGTACAAACCTTG GTTCACAGACGAGTGTGTGGCCAGTAAG CTGCGGGGCTTCTTACCTCTGGCTATTTGTGA GTTCACAGACGAGTGTGTGGCCAGTAAG ACTCCGTGTTGAAGTGGTTATGCTATCACACA GTTCACAGACGAGTGTGTGGCCAGTAAG AGTGTCAGGATGCGTATGATATGCGACGTATT GTTCACAGACGAGTGTGTGAACAGTCTG >NZ_CP023400|1|1|22777-23224|CRT GTTCACAGACGAGTGTGTGGCCAGTAAG TGCATGGTCGTTATGTGTTAGTCGTTGTGCAT GTTCACAGACGAGTGTGTGGCCAGTAAG AAATAAAAGGCTGGATTGACCCAGCCGGTAAA GTTCACAGACGAGTGTGTGGCCAGTAAG AGATACCACTGTTATTCGCAGCCACTTTAAAT GTTCACAGACGAGTGTGTGGCCAGTAAG AAGCAAACGAACTTCCAAAAGTACAAACCTTG GTTCACAGACGAGTGTGTGGCCAGTAAG CTGCGGGGCTTCTTACCTCTGGCTATTTGTGA GTTCACAGACGAGTGTGTGGCCAGTAAG ACTCCGTGTTGAAGTGGTTATGCTATCACACA GTTCACAGACGAGTGTGTGGCCAGTAAG AGTGTCAGGATGCGTATGATATGCGACGTATT GTTCACAGACGAGTGTGTGAACAGTCTG
>NZ_CP023400.1|WP_100915930.1|20664_22308_+|DUF4942-domain-containing-protein MTNSKQLILDIAGSEHDFEFYPTTYEIISTIKSDIENESIKSKNPSVLDVGAGDGRVLNGLTLGKRYAIEKSSPLLNALSKDIFVVGTDFHQQTLIDKKVDVVFSNPPYKEYESFAAKIIKEANAKVVYLVIPTRWNNSNVINEALGVRRAKTEVIDSFDFLEADRAARAKVDVLRIMVGESSWGVISPKVDPFKLWFEENFAIDVNKEKVASSTNNLSKREQVSNELKNELVCGKDVVSTLESLYQRDLDKLITTYKSLELVDEEILKELNVNLEGLCEALAQRISGLKDLYWKELFSNLSKVTDRLASSSRKKMLDTLFEHTQIDFTKANAYAVLTWVCKNANEYYDSQLIELMEDMTAQANVLLYKSNERVFRDEDWRYTRRPNGLNKYSLEYRVVISNMGGIDTGYWGDKNEGIKERAKDFLNDICTIARNIGFDTTDCERVDSFNWVSGQKKLFTYRDHVSGLNLTLFEAKAFKNGNLHIKFNETFMAKLNVEFGRLKGWLKSADEATNEMNIDTELAISSFNTNLQLPMDGSSLLTFKTAA >NZ_CP023400.1|WP_100915929.1|19939_20557_-|hypothetical-protein MKKITLWFIGILFVACALGGISFYFYNAHIQQQNAAAEKKRAEDIKRYLEELRTYSGIHEECNSVFSTNKVANFAASKFNTDKIEISYPTTADRLKGDGFKTCIALVTEVVNDPTTSPQRQYAIYILNDQTFKVALVDEISKPLNVNNREIRIDRGKSVFDTAFESANDSIFESDSLNYEDSFYIIRNSTKYVSVGIEYEESPKN >NZ_CP023400.1|WP_100915928.1|18687_19938_+|hypothetical-protein MSVQLILARNNLGTLVHIDDVPSGLACECYCPDCGARLEARKGPVNRHHFSHDQRDAKFQNCSYGPETELHLALKRLLQQKMKAIIPTAYSSNKTVVINLDSSQLETCYPSSAYRCDVETVFRGEPIYFEIKVTHATGKDKLDFIKVNNFNVVEVSTPSDANINNETLEEIFQRSEFKWLSLDIFSAIGQSVFANEKSLQQSFIDENKRLEKIIEDQKWQIAGNDRRLSQQAPSIESVEKNEHDLTVKERGLRRQVARLMSQKREIQNQIQQKNDELDFLLQRGFEITEAEKQKTESNYLKSLAAKHSQTIEKNEKVMALLCEEIDALRSKKAELVVTVKSKQLEVSALDNKEEELIAYESKLQQLEEEIRRKTVAVNKAIRILKDLETNLKPTLSNIGTPWPISNSVYEEIVIEV >NZ_CP023400.1|WP_100915927.1|18334_18625_+|hypothetical-protein MKNNVVLITTSNSDLALYLNQDLICYFDSNFDGIAIKQVVEATAENLSNSLNVDLVKKPLHITKCDEDWVNGEDVNSLLSRIEKTTDSKCLIKFAS >NZ_CP023400.1|WP_100915926.1|17953_18175_-|DUF2375-domain-containing-protein MANSSKEYTVLFTRDCSIVVEHEVYFLEVKGSGRPIIPSSIKKNLSILAVIEGAPNIVNKLGDRELEIFRAVS >NZ_CP023400.1|WP_100915924.1|17178_17598_-|translesion-error-prone-DNA-polymerase-V-autoproteolytic-subunit MLSPILPSTTKIELCEFKPAGGFPNPCEDYLSKPISLDELLINRPSSTFMFKVAGNSMFPTIPEDSMIVVDRSVEIQSGKIVLATVDGQFVVKELQRTNGKIIFHSHNSDYEDIEINSTEENLQDGVVWGVVTSVIKKL >NZ_CP023400.1|WP_100915923.1|15901_17179_-|Y-family-DNA-polymerase MLWVLCDINAAYVSFCQLFNPQFNLEEVPLGVLSSNQGNIVARNQPMKDLGVKMGEAAFKIKNDVYKNQGHLWGSNFEFFGDMSNRFHTELEYFLIEPERYSVDEAFGRIDTSYTKDVKAYAQKIQSTLKRNLGLGCGIGISHTRTLAKAASHCAKIKQWKHKTNGVVFLDTEEQINYALERMQTTDVWGVGRKIGAKLSESGIHNALQLKQACPIEMKSKFNVVLARTIQELNKIPSIDLKDPMMAREQICVSRSMGKPVTQFNELKESIATHMAIASKKARQQLMFVSQIKVFISTNPFSDTKQYSKSIGIDLPYPTSNTLDLTNYALYALKHLYKAGYEYKKSGVILSKLVLSNQIQTNLFEENSDDQVKLKQTEVMDSINDKFGKGAIRIASAGYSHNWKPKDDLAPPSYTTRLDELVVVN >NZ_CP023400.1|WP_100915921.1|14442_15168_-|hypothetical-protein MKKLSILAASLSVLLSSQIYAGQQLSQSEIKQSLSQKGDIEGIQELPVKELMLVKTKAGETYFISKNGRFVFQGTLIDTWFRKTIDDLSEARTAHRVPLEKMGVNKEDLAILTFGNQDIPKQASIFVDPNCGVCTALFKMVSQSPEKYHFDFILAPIFGERSKTAINKLHCAADENKAVMDLLHKTSLVTKMKPQCDTSKINNSLIVMNLLEGKGTPFFVRADGLTLNGIPNDLDKWLEVE >NZ_CP023400.1|WP_157813620.1|13749_14439_-|TraV-family-lipoprotein MKKQLSILALLSMTSGCSVFSVGEEEFQCTGKPKNSLCKGPMEVYELTNNQDHLEHLMNDSQDESESTTQKPVQFIKDPQNNITYIYEGRTLRRQTSEDYDEAKIVAIASDNDYSQSQHNDELEDLRTFNYVPQDIAPEPLAVLEEAQAMRIYVSAWEDKEGDLNIPGYVYVELKPRRWVAGHQADMRPSRVVPFQVIQKSKTTQKRKSQMSKGVDPLNINRTQIPKQQ >NZ_CP023400.1|WP_100915920.1|11124_13737_-|TraC-family-protein MNISKFMSNLFNGSNESHNETSGPLWNDLLNKQKGLIDGRRINELNPVLEYIEEEKLFIMDGGYIGFVIACQPTNGVNNQIRTAYQGLLSEKYPDDTLIQLSLAALPRIDNFLRGYMVKRNNRMPGSDSELSDAMSQKVVEYYSKARKESLDQRLGIKPRDFEHWVTFKFPIKEQLPNQKELDSFREIKRKTLSTFEDLGMAPMLLNDSEWLSRMQTVFNHSENSSWRDKVKVDRKGILGRQILEPGSMVVREEKGISFSSGFKDSNGFAAAISLKQTPENLGYGDLINLLGDWQYGKRTASWNPFIMTLNVQVPNQAKAKEEYATRFNWLKKQAKGKMLEVVHTLKDQYHDYKEVEQEIRNAELCNVSLSMIVLGENKEQTLDATDKLVQSFKRQQYKFNIESVLAPSMILNSLPLGLDTEFIKLSDRFEEWSSTGAAFLIPHSASWKGNTAFPVLEFVTRFGQIFGLDLFQTDGGFNALICAATGKGKSFVTNKIISSYLGSGVANGYANSPDGAQIFVIDKGKSYENLTHTYSNSQFIEFNEKMDFSLDPFAQINDFYGKDGQAAMVHALLKAMASESGDLSDYQSTEMLTILTELWDEKGQDASVTDFAKRCENHPEKDIHRIGRQLRPWCEGQVYGDFFGKRFKPVNFNARLIVCEMDELSSNPHLAQVVLMSIINAAQHAMFLSGPDRRKLFILDEAWEYLKDRAGSKVNYLAEFLETGWRRFRKTRAAGICITQSLLDAYQSQAGIAIVNNSPWKLMLAQEPETIDKLKEMKAYDGTEQDYTLMKSVHTRKGYYSELFVRYGEQKEIVKFFVDQKTQLVYSTDPDDRALIAKYKDQGYDINEAIDKAYEEKQKLKILAEGKVA >NZ_CP023400.1|WP_100915931.1|23282_24749_+|IS200/IS605-family-element-transposase-accessory-protein-TnpB MTEKTQLTSVFKTYQFEIHNLSQSKRKKLLQTFKQSDMLYYKALAKCEEDAKQLLALETKKERKSALGIIQKKLQDIVKPLPFGSALKASIIETVKAQISSFVELTLSGQDATYPTKENQEVDYHYWLNVLLQSTDKATEDIARDELSKRDRGLYRPLTFEKYRTTDGFMVLSDDKGRLFAFLNLWSAKDKRASALALEMHDTRTGEAFKTKTSTGMLLPLSFSEFQRNALANGEAKTAKLVMRGERLFLMVSIKFEVPKRNPVYVMGIDRGIAELATYTVRDPETGKLIDSGTFSGSTLKRHQQEHEAKQKADQKIGRAFIRGWSNYTTNLMHHVANEVVKVADKYSCQVVIEDLSNIKNNPKMKRPKFARKNNFRRMLSRQQYGRLESMLSYKLQSVGLPEPALVWASYTSQTCPECGHSCKENRITRDAFQCQSCGFEQHADIVGALNIAGSYICFEKIKHKLKKGKPRTEEFRYQNWLVDNLEI >NZ_CP023400.1|WP_157813621.1|24755_25310_-|hypothetical-protein MARTVLNLSALLITTLLVHAIFQGDATPEYTAILILLSPIPLYVALQYIKLVIYTHMAKRSLIVELSKNNIHIRKIGHDIDVNVHSKFSNRCNLIVSHTDLLAGFKQAFHQINEITSEHWFHMRPIVIVRLPEKQLSRIELKYINDTAFEAGAFDVLQAKSAESESSIRRKLKNHINQMCVYGK >NZ_CP023400.1|WP_100915933.1|25315_25672_-|hypothetical-protein MRNIVLIFAGVAFLLWLSNFDNNAITENLQVYSTGLGRAVTLLVSLFISLSFYNSLCKYHDTKSTEEKPQNLLHHPVLRITLIHIALYVALMSITPTFTSWISDSGDSVANFIIKAKG >NZ_CP023400.1|WP_100915934.1|25796_26114_-|hypothetical-protein MKADNFVLVTAMQTSANNPKFNDFLGGIELLDTVFQQRINLALPVEEHTDDLQEALTYLRQHGNQVAALASKYQKLALSNQTPPMLQKAYVEIATKLFDLTKVNP >NZ_CP023400.1|WP_100915935.1|26188_26599_-|hypothetical-protein MKILREITKIVIAASIGFIVNMYYTDAQIERYSSDTVSYLSSIPKIEVVDIEAITKTMLEEETPPAKVAEYVELLVRVKEAKGVLVIDTKSVITTPTLSVAKVLSYDALKDEAAKHGIKPELDYESLLEDFNKQFE >NZ_CP023400.1|WP_100915936.1|26600_28004_-|hypothetical-protein MSFNLKEKWADPNFRKFVVVCLIGLFAIPLYHTMSNRKEFKPLEERIAERKAEDRVDDKGRVKGLFDTNEISNLDQDEFNEIYKESADALQVRERRIYQEKEKVLAMVEQLQKDLDAQSLELKNLKRQQEINAKAKAPRTPANRQQSNGQRKNTKRNSQPDNMTAYNNAIETVEQVDQRVFDRTPATFVTAKPQIEGKVIRTITQRSVRSIKKTGEIEEKSHDEFLITQKEQTPKEAVKQPKNTKDDGGLVDANEGTVYLPAGSFFSGVLLTGLDAPTQLAAKKSPMPVIIRVKKEAILPNHFTLDIRECHVIGQAIGDLSSERAHIRAQSITCVREDGRSVESAIQANAVSDYDGKLGIAGRLVSKNGNLLAGSMAAGFMSGISQAVAPRRSLSVNTEPGDQDLWQTVDYGSVAAAGVFQGASNAMDRLAEYYIELAEQIHPVIEISPGRSISFAVISGAKLKLGD >NZ_CP023400.1|WP_157813622.1|28013_29144_-|hypothetical-protein MKTKLIVLSLCCGLSTLATAEESSESPAKDFKLPTQLINPEPQPVSKQDVFDSAVFNSQFSANAAAMKTLPTDKEVRHIAKTTETGVLDESDFPGYVDNSPTVKSNTPYSFLEIVKSKYKPTQEFKDLEPGSNIAIPVAVGLTNPIITNFKMIAVRTHDEESVMELEDGRLYITINSLKPVALMLFEEGVPESMVNVTLVPMEAMPVIVNLDINLSKEMKYKGSKHRIDLKEQEAIAKAEQSEIQPNFKTDTARISRIKQILKPVGRGEIPRGFTLSHDVSLAMKTPCGMTIEQRVMQRIVGTKEVVDVVRVKNNTTRPYSMREQMCESRNVMAVAIYNKSLLQPGEATEVYILRDKSIGSSRTIKNNSRPRVVNM >NZ_CP023400.1|WP_100915938.1|29133_29865_-|hypothetical-protein MMFKDSQVGKSWSAAVASGNLQLVGNFCLSIGVVVLIMMLFFKEAEIIAVPEQQIFGELSVQGKKANRNYQQSWALSTAQLIGNITPENVDFVKSSLMNVLSPLLKFQIEPMLEKQSEVIRMRGIKQVFIAEDLIHEPSTDLITIWGQKLTFIDGTPKSDSKWSYEFKIEVRHGRPRITHIDQYPGTPRKRKQAMEARNQASPENLSSQPYYDSELEIAVLEAAQDLKEKEAKATNEGETNEN >NZ_CP023400.1|WP_100915939.1|29878_30175_-|hypothetical-protein MSNPNRYRAKRHINDPLLILFVFTKEQVIPLFLAVAIGMVIKKTFLLIVLASIYIYITQKLKSRFPKSYLRHKAWSKGLIITNPSKSIVDPIQKTYFR >NZ_CP023400.1|WP_100915940.1|30187_30541_-|hypothetical-protein MTTLAKKSAGHTQLSNKQKQAATFALMFISLGAIAAAPTAGDFLYGIYAEVIGWLTGAPGIIISLFTFGVAAFQGIMKQNYINAGGAFIIAMMFANAQDAIEYFLTAGLPVASGLPI |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP023400_1 | 1.1|22805|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 22805-22836 | 32 | NZ_CP023400 | Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence | 22805-22836 | 0 | 1.0 |
NZ_CP023400_1 | 1.2|22865|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 22865-22896 | 32 | NZ_CP023400 | Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence | 22865-22896 | 0 | 1.0 |
NZ_CP023400_1 | 1.3|22925|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 22925-22956 | 32 | NZ_CP023400 | Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence | 22925-22956 | 0 | 1.0 |
NZ_CP023400_1 | 1.4|22985|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 22985-23016 | 32 | NZ_CP023400 | Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence | 22985-23016 | 0 | 1.0 |
NZ_CP023400_1 | 1.5|23045|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 23045-23076 | 32 | NZ_CP023400 | Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence | 23045-23076 | 0 | 1.0 |
NZ_CP023400_1 | 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 23105-23136 | 32 | NZ_CP023400 | Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence | 23105-23136 | 0 | 1.0 |
NZ_CP023400_1 | 1.7|23165|32|NZ_CP023400|CRISPRCasFinder,CRT | 23165-23196 | 32 | NZ_CP023400 | Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence | 23165-23196 | 0 | 1.0 |
NZ_CP023400_1 | 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 23105-23136 | 32 | KC821610 | Cellulophaga phage phi3ST:2, complete genome | 8025-8056 | 6 | 0.812 |
NZ_CP023400_1 | 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 23105-23136 | 32 | KC821634 | Cellulophaga phage phi47:1, complete genome | 8025-8056 | 6 | 0.812 |
NZ_CP023400_1 | 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 23105-23136 | 32 | KC821629 | Cellulophaga phage phi38:2, complete genome | 8025-8056 | 6 | 0.812 |
NZ_CP023400_1 | 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 23105-23136 | 32 | NC_020860 | Cellulophaga phage phiSM genomic sequence | 8080-8111 | 6 | 0.812 |
NZ_CP023400_1 | 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 23105-23136 | 32 | KC821630 | Cellulophaga phage phi3:1, complete genome | 8025-8056 | 6 | 0.812 |
NZ_CP023400_1 | 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 23105-23136 | 32 | KC821616 | Cellulophaga phage phiSM, complete genome | 8025-8056 | 6 | 0.812 |
NZ_CP023400_1 | 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 23105-23136 | 32 | HQ670749 | Cellulophaga phage phi47:1, *** SEQUENCING IN PROGRESS ***, 6 unordered pieces | 39998-40029 | 6 | 0.812 |
NZ_CP023400_1 | 1.2|22865|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 22865-22896 | 32 | NZ_CP024012 | Acinetobacter sp. LoGeW2-3 plasmid unnamed1, complete sequence | 33118-33149 | 8 | 0.75 |
NZ_CP023400_1 | 1.2|22865|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 22865-22896 | 32 | NZ_HG938356 | Neorhizobium galegae bv. officinalis bv. officinalis str. HAMBI 1141 plasmid pHAMBI1141a, complete sequence | 286251-286282 | 9 | 0.719 |
NZ_CP023400_1 | 1.2|22865|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT | 22865-22896 | 32 | NC_017957 | Tistrella mobilis KA081020-065 plasmid pTM1, complete sequence | 35565-35596 | 11 | 0.656 |
1. spacer 1.1|22805|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP023400 (Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence) position: , mismatch: 0, identity: 1.0
tgcatggtcgttatgtgttagtcgttgtgcat CRISPR spacer tgcatggtcgttatgtgttagtcgttgtgcat Protospacer ********************************
2. spacer 1.2|22865|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP023400 (Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence) position: , mismatch: 0, identity: 1.0
aaataaaaggctggattgacccagccggtaaa CRISPR spacer aaataaaaggctggattgacccagccggtaaa Protospacer ********************************
3. spacer 1.3|22925|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP023400 (Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence) position: , mismatch: 0, identity: 1.0
agataccactgttattcgcagccactttaaat CRISPR spacer agataccactgttattcgcagccactttaaat Protospacer ********************************
4. spacer 1.4|22985|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP023400 (Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence) position: , mismatch: 0, identity: 1.0
aagcaaacgaacttccaaaagtacaaaccttg CRISPR spacer aagcaaacgaacttccaaaagtacaaaccttg Protospacer ********************************
5. spacer 1.5|23045|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP023400 (Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence) position: , mismatch: 0, identity: 1.0
ctgcggggcttcttacctctggctatttgtga CRISPR spacer ctgcggggcttcttacctctggctatttgtga Protospacer ********************************
6. spacer 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP023400 (Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence) position: , mismatch: 0, identity: 1.0
actccgtgttgaagtggttatgctatcacaca CRISPR spacer actccgtgttgaagtggttatgctatcacaca Protospacer ********************************
7. spacer 1.7|23165|32|NZ_CP023400|CRISPRCasFinder,CRT matches to NZ_CP023400 (Pseudoalteromonas spongiae strain SAO4-4 plasmid pl, complete sequence) position: , mismatch: 0, identity: 1.0
agtgtcaggatgcgtatgatatgcgacgtatt CRISPR spacer agtgtcaggatgcgtatgatatgcgacgtatt Protospacer ********************************
8. spacer 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to KC821610 (Cellulophaga phage phi3ST:2, complete genome) position: , mismatch: 6, identity: 0.812
-actccgtgttgaagtggttatgctatcacaca CRISPR spacer tgctcc-tgttgaagctgttatgctatcaccaa Protospacer .**** ********. ************* *
9. spacer 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to KC821634 (Cellulophaga phage phi47:1, complete genome) position: , mismatch: 6, identity: 0.812
-actccgtgttgaagtggttatgctatcacaca CRISPR spacer tgctcc-tgttgaagctgttatgctatcaccaa Protospacer .**** ********. ************* *
10. spacer 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to KC821629 (Cellulophaga phage phi38:2, complete genome) position: , mismatch: 6, identity: 0.812
-actccgtgttgaagtggttatgctatcacaca CRISPR spacer tgctcc-tgttgaagctgttatgctatcaccaa Protospacer .**** ********. ************* *
11. spacer 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to NC_020860 (Cellulophaga phage phiSM genomic sequence) position: , mismatch: 6, identity: 0.812
-actccgtgttgaagtggttatgctatcacaca CRISPR spacer tgctcc-tgttgaagctgttatgctatcaccaa Protospacer .**** ********. ************* *
12. spacer 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to KC821630 (Cellulophaga phage phi3:1, complete genome) position: , mismatch: 6, identity: 0.812
-actccgtgttgaagtggttatgctatcacaca CRISPR spacer tgctcc-tgttgaagctgttatgctatcaccaa Protospacer .**** ********. ************* *
13. spacer 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to KC821616 (Cellulophaga phage phiSM, complete genome) position: , mismatch: 6, identity: 0.812
-actccgtgttgaagtggttatgctatcacaca CRISPR spacer tgctcc-tgttgaagctgttatgctatcaccaa Protospacer .**** ********. ************* *
14. spacer 1.6|23105|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to HQ670749 (Cellulophaga phage phi47:1, *** SEQUENCING IN PROGRESS ***, 6 unordered pieces) position: , mismatch: 6, identity: 0.812
-actccgtgttgaagtggttatgctatcacaca CRISPR spacer tgctcc-tgttgaagctgttatgctatcaccaa Protospacer .**** ********. ************* *
15. spacer 1.2|22865|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP024012 (Acinetobacter sp. LoGeW2-3 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
aaataaaaggctggattgacccagccggtaaa CRISPR spacer ataataaaggcgggatagacccagccggaatt Protospacer * * ****** **** *********** *
16. spacer 1.2|22865|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to NZ_HG938356 (Neorhizobium galegae bv. officinalis bv. officinalis str. HAMBI 1141 plasmid pHAMBI1141a, complete sequence) position: , mismatch: 9, identity: 0.719
aaataaaaggctggattgacccagccggtaaa CRISPR spacer gagaaaaaggccggattgtcccagccgagatg Protospacer .*. *******.****** ********. * .
17. spacer 1.2|22865|32|NZ_CP023400|PILER-CR,CRISPRCasFinder,CRT matches to NC_017957 (Tistrella mobilis KA081020-065 plasmid pTM1, complete sequence) position: , mismatch: 11, identity: 0.656
aaataaaaggctggattgacccagccggtaaa CRISPR spacer tcgtaaaaggccggattgccccagccctcgtc Protospacer .********.****** ******* ..
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP023399_1 | 47255-47498 | Orphan |
NA
Consensus repeat of NZ_CP023399_1
|
4 spacers
spacers of NZ_CP023399_1
>1.1|47279|27|NZ_CP023399|CRISPRCasFinder AAAGCCGACTTTTAACTAAAAATAGCG >1.2|47330|29|NZ_CP023399|CRISPRCasFinder TAAAAACACAAAAATGTCGTTTTATTGAT >1.3|47383|27|NZ_CP023399|CRISPRCasFinder TTAAGGAATTTGATCATTAATTTACTA >1.4|47434|40|NZ_CP023399|CRISPRCasFinder AAATGCGCCGCAACTTAGTAAAAATATGATCTCTAAAGTC |
DEDDh |
CRISPR arrays and Neighbor proteins around NZ_CP023399_1
The CRISPR arrays of NZ_CP023399_1 >merge|NZ_CP023399|1|47255-47498|CRISPRCasFinder TTTAGTTAGTAAAATTTTGATCTCAAAGCCGACTTTTAACTAAAAATAGCGTTTAGTTAGTAAAATTATGATCTCTAAAAACACAAAAATGTCGTTTTATTGATTTTAGTTAGTAAAAAACTGATCTATTAAGGAATTTGATCATTAATTTACTAATCAGCTAGTAAAATAATGATCCCAAATGCGCCGCAACTTAGTAAAAATATGATCTCTAAAGTCAACAGTTAGTAAAATTTTGATCTCT >NZ_CP023399|1|1|47255-47498|CRISPRCasFinder TTTAGTTAGTAAAATTTTGATCTC AAAGCCGACTTTTAACTAAAAATAGCG TTTAGTTAGTAAAATTATGATCTC TAAAAACACAAAAATGTCGTTTTATTGAT TTTAGTTAGTAAAAAACTGATCTA TTAAGGAATTTGATCATTAATTTACTA ATCAGCTAGTAAAATAATGATCCC AAATGCGCCGCAACTTAGTAAAAATATGATCTCTAAAGTC AACAGTTAGTAAAATTTTGATCTCT
>NZ_CP023399.1|WP_157813528.1|46396_47041_+|hypothetical-protein MYNLLMDKASLSQVFLAFAPMEALKVKHNIDYALEALSLPSAELLAEMQVLGIDLPDKPSASKKVDLTTINSTVLSSHLTELFLDKHKLLKTFSDFSLSEQVELRAFLQSLMPMLHLPPQALVEKMQREGVLDVLMGESDKAKAVKDKIAALSQNKRDHKVDEVKERLAKLSNKPQQCADELKIARVKANLNKASVSDNAAIIADAKQKIVQLG >NZ_CP023399.1|WP_100914609.1|44599_46279_+|OmpA-family-protein MEQEKQQLIEQIISTAKREFDIDLAKEKLISEEEYRALVFTELLSLENAELSRLIEQFNASPSHQAHASQAEPISQPKQRNNGVIIAASVAVLALCAVGAWFGLSGDTTAQSGQVAAVPNKVISQKSAQMSKPTPVEKEPEAVYQSLFAIHGSNTIGEKLAPALLKAYFEEQGAKDIDWQQGRIATERTMQFEMDGEKLEIGLAAHGSSTAFKALNANMAQIGMSSRKIKTSEVEALKGQSGDLSKLGNEHIIALDGLAIIVNQNNPLKTITTETLSRIFSGEIDNWAEIGGDQAPIKVFARDDNSGTFDTFKSLVLKKYGRKLTKQAERFESSSELSEHVSRDDYAIGFIGLNYIRYSKALAIADSSDTKAIYPTRFTIATEDYPLARRLYLYTPTNSATQIKDFANFAISDRGQEIVQELGLISQSIRVEDVVASELAPDKYNQYAKLAKRLSLNFRFNYATRDLDNKGKRDLQRLVSFVEENPSRKLVLMGFSDSIGARDKNTMLSLSRAKSVEQELNARGINVYAVEGFGEDMPLANNDNEVGRERNRRVEVWVL >NZ_CP023399.1|WP_100914608.1|44111_44450_+|DUF3718-domain-containing-protein MKALATTAIAASLLMLPSAAIAFDKSTIESLLVQACKDVKSNKVLKLKTNLKTNHLTLPLISEKLMCNGESVYDFAMTHNADKTAKLLRTGSVSIHDVAYNSSDKIWVWLDD >NZ_CP023399.1|WP_010558407.1|43194_43896_+|purine-nucleoside-phosphorylase MSTPHINAQLNDFAETVLMPGDPLRAKHIAETFLEDAREVTSVRNILGYTGTYNGKPVSIMASGMGIPSMSIYARELIVSYGVKNLIRIGTCGGIGSDIKIRDVIFAQGASTDSNVNRARVQGYDFAAIANFDLLVNGVEAAKRLGIAAKVGNVFTTDTFYQASNDLYQKLDKLGVLAVDMETAGLYGVAAEYGANAMALFTVSDHVITGEATPSEERQNSFNEMVKIALESI >NZ_CP023399.1|WP_100914607.1|41965_43177_+|phosphopentomutase MARALILMADSLGIGAAPDAERFGDKGANTLAHLLAAYHGETGQKLALPNLSKLGLIDACEQASNTNCQVAERNAPIAAYGYAKEISSGKDTPSGHWEMAGVPVLFDWGYFHDKQNSFPDAFIAEFIKRANLPGILGNCHSSGTVILEALGEEHMQTGKPICYTSTDSVFQIACHEDTFGLERLYEICEIARELLDEYNIGRVIARPFLGDAADNFARTGNRRDYSVLPPAPTLLDKLAEDNGAVISIGKISDIYAHQGITEKHKAPGLENLMAKTAEVFKQAKDHSLTFVNLVDFDEKFGHRRDAIGYAKALKQLDDYLPEFLALLGKDDVLMITADHGCDPTWQGTDHTREYVPVLAYTPGMTPVNLGERETFADIGQTLATWFSLAPLEYGNGFKSQLGS >NZ_CP023399.1|WP_040642196.1|41029_41656_-|uridine-kinase MTRTIIAIAGASASGKSLFSQTIYNELLNELAPGAIAIIEEDAYYRDQSHLPFEHRTQTNYDHPDAFEHELLLEHLNQLKEGKPVDIPVYDYAKHTRSDQTRRIQPAKILIVEGILLLSDPKLCDEFNIKVFIDTPLDICLMRRMQRDLEERGRSLQSVIEQYQATVRPMFYQFIDPSKHNADVVVTRGGRNRVAIDILKSKIKQLLQ >NZ_CP023399.1|WP_100914606.1|39778_41008_-|NupC/NupG-family-nucleoside-CNT-transporter MTSLMSLVGIASLLALAFLCSTNRKKINLRTVGGAFALQVLIGAFVLYVPAGREVLNGISVGVANVISYANDGIRFLFGGLAGDEIDGIGFVFAIKVLPVIVFFSALVAVLYHLRVMDFVIKILGGGLQKLLGTSKPESLSATANIFVGQTEAPLVVKPFIATMTRSELFAVMVGGLATVAGSILAGYVSLGVELKYLIAASFMAAPGGFLMAKMMVPETETPKQDLNDLDHGEEKPINVIDAAAGGALNGMQLAMNVGAMLLAFVALIALSNGFVGWIGGWFDNPDLTIQQILGYAFAPIAWLLGIPWSEAQLAGSFIGQKLVVNEFVAYLDFMNYQAQLSEHTKAVVTFALCGFANLSSIAILLGGIGGMAPSRRKDIAQLGLRAVLAGSMANLMSAAIAGFFLSIA >NZ_CP023399.1|WP_100914605.1|38509_39232_+|ABC-transporter-substrate-binding-protein MLKKLLTASIITLSFSASAGAIRFGYNQQYSAPHVINNGKGTNVSGIVIDVSNAISQEAGFMAKQLPLPRKRIEQYLIDGKIDAQCHANPIWYNAPSIIWSEVLYSDADIIVSDQAIASLQALSSHKQFKLGTVLGYKYPNLTEYFEAGNLKRFNSTSSKDSLTRFTKGELDGFVASYSEANYLTRLRRFNVLEVNSYDLHCSFSPKLNAEKRQRLIDAAHKLRDNGEFKRVFAKYIKPE >NZ_CP023399.1|WP_100914604.1|36963_38445_+|VCBS-repeat-containing-protein MKIITALVVTLGFASTSSLANSKKNLFNEVAIETDLTLSHPVYPIDLLPNPGKELMLIGTHSGQQYIDIFAASEAQLYNRIKRVKVPDTMLGIDFTKWQDATQQVYLFSSNGIYQLALEGTELVVPVVDTQTYLKKSGAQHLSKMEFVYNINDDQKADFFTTDLNHSYVHISDETGYQMAKVDLPARLEIEGHSAEIEPPRFAFFDTNEDNISELIGFENGLLNLLPSVGSNPGQAIKLRDDIMVDDWWHTKDADGDSLDQSNLVYRKLERVVDVNHDGIFDLVVRYTRSSGALDRQNDYEIYLGAIEQGKFIFQNQTNSLIQDEGTLTDLHLLDIDNDKQLEALVSGFDIGVSQIIGALLSGSVSQDVHLFYQNEKGEFSPKNKITREVELSFSLSKGRSGSPVVLFADVNGDGKKDWVLSDEQKAFNVYLANNKHSFSRRASKYKTTLPMQGRRVVTSDLNLDGKDDLVMSYGRLDDKNMRKTIKILFATS >NZ_CP023399.1|WP_100914603.1|34650_36873_-|hypothetical-protein MIDKSNVATQNISYANVCRSDVMKELPEKYYLTHFIEFAEFIKSTSSHLLGDDETAFFAEFAKLPEDAQCILVRIVNRKSPFVRKDTLVYSEVADSGAAINLLRKVGFVSRISKHCFHDFIGQLNKQDLIDLSISLTIDKPAKSANKNKWLNFVSNALSNALSYDQVKKSPVLTSHIYFAKQDTFHYLLFLYFGHLSGRLNQFSMRDLGVMQTQSVQQQQANFSDKDEAKSAFYYANLFKAIKEANKSDIPLLQTLAEQLIASDKPIGFLATQKYHHSLYKLAIVLIEQESTLGETLLSQSQHPAAREKYIRLLYKSDRKAQCEALLLAILDDPDSETLLLFAEDFYAQKFNQKRTSVLTDMLKKSGEPIGIDEAYLGQVETGVKDRYIKQHRLCYFTENKLWRALFALTFWHELFQHPASQRANEFSYYPKVLKQDNFYDLLHTEIDHKLSEFTNSQQFCDYISATIDQYSGEPNGLFYWHNELKDVLTTFIQHAPFESIKHHLLAMSKTFKALCDGYPDLMYIEQGEVFFEEIKAPGDSLRRNQLVSIKQLINAGFNVSIQRTEWRFNSEQNYVVVDIETTGGKKDTHRITEIGMVRVEQGKITDSWQTLVNPERHIPKMITELTGISNEMVKDAPKFEQIAEKLDTFSRNAIFVAHNVNFDYGFIRQEFARLNKKYTRAKICTVQQARKYLPGFKSYSLGKLCCDLNIELKNHHRALDDAKAAAEILLRINHVRAEQ >NZ_CP023399.1|WP_100914611.1|47960_49568_-|isocitrate-lyase MAKQLTYQQEIDRIATLKEVGEKKWQNINPESAARMRLQNRFQNGLEIARYTADIMRKDMEAYDQDPSQYTQSLGCWHGFIGQQKMISIKKHFQNNTDRRYLYLSGWMVAALRSEFGPLPDQSMHEKTSVAALIEELYTFLRQADARELGGYFRELDAARKAGNSAEEETILAKINDHKTHVVPIIADIDAGFGNEEATYLLAKKMIEAGACCIQIENQVSDEKQCGHQDGKVTVPHEDFLAKINAVRYAFLELGVDNGVIVARTDSLGAGLTKQIAVSNEPGDLGDQYNAFLDVEEIESTDIKNGDVILNRNGKLVRPKRLPSNLFQFRSGTGEERCVLDCITSLQNGADLLWIETEKPHVGQIGAMVNEIRKVVPNAKLVYNNSPSFNWTLNFRQQVFDIMVEEGKDVSQYNREQLMSVEYDNTELAALADEKIRTFQADAAREAGIFHHLITLPTYHTAALSTDNLAKEYFGEQGMLGYVKNVQRQEIRQGIACVKHQNMAGSDMGDDHKEYFSGDAALKAAGEDNTMNQFS >NZ_CP023399.1|WP_100914612.1|49821_51990_+|malate-synthase-G MTTRIHHAGLAVDPSLYAFINHQVIPGTGIDVEHFWQGFSKIVSEMTPINRALLVKRDDLQVQIDQYHQAHSNFEFDHYKQFLQQIGYLKPAPSDVTVVTESVEPEIATMAGPQLVVPVSNARFALNAANARWGSLYDALYGTDAISYDNGREAGRDYNPVRGNAVIDFGREFLDQALPLTYGSHKDASDYAIVDGKLIVTLNNVSQSALQSPEQLVGYNGTKQQPTELILKNNGLHIVIQFDENHQIGKQDSAHIKDIVLESALTTIMDCEDSVAAVDAEDKVGVYSNWLGLNKGDLSVEFEKGGKTLTRALNPDLTYTDLSGDEQQLKGRSLMLVRNVGHLMTNPAILDKNGDEVFEGIMDAVITSLIALHDLKGKSTRKNSRANSINIVKPKMHGPEEVAYANTLFCKVESLLNLPENTLKMGIMDEERRTSVNLAACIAEAKERVIFINTGFLDRTGDEIHTSMLAGPMLPKDAIKAQKWIGAYENNNVDVGLACGLSGKAQIGKGMWPMPDEMAQMLVQKTAHVQSGANTAWVPSPTAATLHAMHYHQQDVFSLQQEIKQRVAANVDDILCPPLHPEPKSMTSDAIQNELNNNAQGILGYMVRWVEQGVGCSKVPDINNVGLMEDRATLRISSQHMTNWLCHGVCSEKQVIDTLEAMAKVVDKQNEGDANYRNMSDNYAQSVAFQAALELVLSGTKQPNGYTEPVLHRRRLEFKAAS >NZ_CP023399.1|WP_010558401.1|52283_53006_-|YebC/PmpR-family-DNA-binding-transcriptional-regulator MGRAFEVRKNAMAKTAAAKTKVNSKYGKEIYVAAKNGGAELDTNLSLKRLIEKAKKDQVPAHVIDKAIEKAKGGAGEDYQPARYEGFGPGNCMVIIDCLTDNPNRTIKDVRLAFTKTNSKIGGPGTAAHSFDHQAVFAFAGDNDEEVLEALMMADVDVTDVEVEDGIVSVFAPHTEFYNVKTALAEAFPDVKLDVEEITFVPQTSVDIEGDDIPLFQKFMDMLNDSDDVQNIYHNAIVQD >NZ_CP023399.1|WP_100914613.1|53277_54102_-|autotransporter-domain-containing-protein MLGITTGANAVEIKVGQSHSDLPLDQSGTLTSVTPSGTSFSLAHDFSESVRVSLDYINWEDTHFSRKANSLKIDSQSYAATLTYFIENFAISGNYTYWQSDYRESFLELPSSQHKTYAPSYGLSVGYGIFFDEWVIEPSLLVQYNEWRYRDIMIEQNEMALALRDALEDETLVVSGLLSASKIIEIRPDTYLLAGGVIRWNELFQGDGTKQTDKFVSFAGRKSYPHNTDDDYAELSVFVTYDITPEWMVELDSSIAFLPHDNVSSVSWRIGYRF >NZ_CP023399.1|WP_010558399.1|54566_54800_+|hypothetical-protein MDFYILKKGDELKAIPADKAICKSYEKSGYSYIDKVAACTEQAAINKLKGKTFETLKGPLLRLSVVIFALILLWWVS >NZ_CP023399.1|WP_100914614.1|54823_55423_-|lytic-transglycosylase-domain-containing-protein MLFFAFSAAAKPIYQYVKSDGSIAFTDRKPIKHSYSIYRTGCYACQLRSSLNWHKIPIYPDKYHLEIATASKQHNVDPALVRAMIHAESAFKANARSKKGAIGLMQLMPATAKYMGIKRPTQPSQNILGGVKYLSYLLEKFNGNIRLATAAYNAGPNAVKKYRGIPPFAETKAYVERVGILHNRYRVAINKKKRKISPS >NZ_CP023399.1|WP_100914615.1|55787_57686_-|beta-ketoacyl-synthase MTALPIVVGMGGINAAGRTSAHQGFRRIVIDKLTQEARQETFLGLATLMNLVRQENGELVDHENQAICASEIEAKFGEQIIAGTLIRKIENNHFDVDATHGHQKMTVMPSEGDSIVFETSARHLPSPQPRDWQIENLEGGKVKVTINGELSLKHDTYRDNPIKAAGQFPTGFDPSTLYNSRYQPRGLQATIFAATDAIKSTGLAWQDVLAVVDPDQIGTYSASVAGQMQDEAFGGMMKNRLRGDRVSTKNLALGLNTMSTDFINAYVTGNVGTTFTTSGACATFLYNLRAAVHDIKAGRTRVAIVGSVDCAITPEIVEGFGNMSALANIEGLKKLDNSDTPDLRRTSRPFGENCGFTIGEGAQFAVLMDDALALELGADVMGSVPDVFVNADGIKKSITAPGPGNYITMAKSVALATSILGEEAVQKRSYILAHGSSTPQNRVTESIIYDRVAKAFNIDNWKLAAPKAFVGHTIGPASGDQMAMALGIFSHNIMPGITTIDKVADDVYAERLDIRTEHYECEAQDIAFINSKGFGGNNATAPMFSANITLEMLTKRHGESAMASYRDKLTQTKQNQADYSAAANLGDYQLIYRFGDGLIDENDIELTQETLSLPGFTHSVKLTASNPFDDMV >NZ_CP023399.1|WP_100914616.1|57814_58801_+|helix-turn-helix-domain-containing-protein MTTKVRLLLYPQILATSVTLPVEMLKAGEASFHDRKNRTLDIKFVADKQELIQCRAGCAFLPELTVGCDDNADFIIVPGIWRNPRPVIKQNQAAIGYLKAQWQQGATIVGVGTGNCLLAEAGLLDHHAATTHWHYAKQFRKDYPKVDLKPEFFITQSERIFCAASLNSLADIMVYLIAQLFGRDAAQNVERNFSHEIRKPYEEQRYLEGAVDRHADELIAQIQFWLKNNASSELSMQHVAEQFGISQRTLSRRFKAATGTTANHYLQKMRLEMAQELLASTNLTVQDVAVAVGYVDQGYLTKVFKRELKQTPSDYRLLVRKKLFKTDS >NZ_CP023399.1|WP_100914617.1|59341_61159_+|S8-family-serine-peptidase MKIKIKPSLLAASVILACTPAIAKITDSTFPLQNGDPLVAEQWHLQNIGQTGFSQSAGTSGNDLDIDFTHLMGIKGRGITVSVIDSGVEIDHPDLRANVVAGSLNLGDGSDYPQDNNGHGTSVAGLIAATEANGLGGRGVAPHANIVGFNYLPNQTVGNWLVSHGLSEDFRQLDRFTDPRVFNQSYGSTPATPVVHNYVLNPWQELQDDVYRYITEQSHWGRGSVYVKSAGNSFGTYNAYYQGVPILVLPYENNQFFNNQGLPFHNANIAPDNNNHWNLVVSALDAHGKLTSYSSVGSNVFVAAPGGEFGEAAPAMVTIDLQGCEAGSNVAGEHGNALHGGSELDPNCDYTGTMNGTSSAAPNTSGAIANIMSANHALDARAIRHILAQTARKTDENHPGVDLTFENADGELVTYNAIPSWQRNAAGYNFHQFYGFGAVDVDDAVYKALFTPVTPLPEQVITPWQTQIADAEIPDASLTGATSTLAFEQDVTVEAVQVKLNIDHTRLRDLAIELISPSGTRSVLMSARTGWLGSTEGGYTDTLMLSNHFYGEAAQGEWQLKVIDTDKGSSFTVGYNASIGLVGFNRNNNVENGVLKDWSLRVIGH >NZ_CP023399.1|WP_100914618.1|61169_61610_+|hypothetical-protein MKKLFIIAALTLSTQAAIANSAEQFKGLTITPVEQLTLAQSEVIVNDKTFKTNGTKTVVPGLDVFDVSGLNAYTVTGEIIVKLSGDVDFEQFNKDNDLIIKQAFKSYYILKSQSQRDLLPLVDALKKLEGVSSVTLELVDKGVIEK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP023399_2 | 47608-47674 | Orphan |
NA
Consensus repeat of NZ_CP023399_2
|
1 spacers
spacers of NZ_CP023399_2
>2.1|47632|19|NZ_CP023399|CRISPRCasFinder TTTTTTTAGCGGTAATCTT |
DEDDh |
CRISPR arrays and Neighbor proteins around NZ_CP023399_2
The CRISPR arrays of NZ_CP023399_2 >merge|NZ_CP023399|2|47608-47674|CRISPRCasFinder AATCGCTAGTAAAATCTCGATCTGTTTTTTTAGCGGTAATCTTGATACTTAGTAAAAATATGATCTC >NZ_CP023399|2|2|47608-47674|CRISPRCasFinder AATCGCTAGTAAAATCTCGATCTG TTTTTTTAGCGGTAATCTT GATACTTAGTAAAAATATGATCTC
>NZ_CP023399.1|WP_157813528.1|46396_47041_+|hypothetical-protein MYNLLMDKASLSQVFLAFAPMEALKVKHNIDYALEALSLPSAELLAEMQVLGIDLPDKPSASKKVDLTTINSTVLSSHLTELFLDKHKLLKTFSDFSLSEQVELRAFLQSLMPMLHLPPQALVEKMQREGVLDVLMGESDKAKAVKDKIAALSQNKRDHKVDEVKERLAKLSNKPQQCADELKIARVKANLNKASVSDNAAIIADAKQKIVQLG >NZ_CP023399.1|WP_100914609.1|44599_46279_+|OmpA-family-protein MEQEKQQLIEQIISTAKREFDIDLAKEKLISEEEYRALVFTELLSLENAELSRLIEQFNASPSHQAHASQAEPISQPKQRNNGVIIAASVAVLALCAVGAWFGLSGDTTAQSGQVAAVPNKVISQKSAQMSKPTPVEKEPEAVYQSLFAIHGSNTIGEKLAPALLKAYFEEQGAKDIDWQQGRIATERTMQFEMDGEKLEIGLAAHGSSTAFKALNANMAQIGMSSRKIKTSEVEALKGQSGDLSKLGNEHIIALDGLAIIVNQNNPLKTITTETLSRIFSGEIDNWAEIGGDQAPIKVFARDDNSGTFDTFKSLVLKKYGRKLTKQAERFESSSELSEHVSRDDYAIGFIGLNYIRYSKALAIADSSDTKAIYPTRFTIATEDYPLARRLYLYTPTNSATQIKDFANFAISDRGQEIVQELGLISQSIRVEDVVASELAPDKYNQYAKLAKRLSLNFRFNYATRDLDNKGKRDLQRLVSFVEENPSRKLVLMGFSDSIGARDKNTMLSLSRAKSVEQELNARGINVYAVEGFGEDMPLANNDNEVGRERNRRVEVWVL >NZ_CP023399.1|WP_100914608.1|44111_44450_+|DUF3718-domain-containing-protein MKALATTAIAASLLMLPSAAIAFDKSTIESLLVQACKDVKSNKVLKLKTNLKTNHLTLPLISEKLMCNGESVYDFAMTHNADKTAKLLRTGSVSIHDVAYNSSDKIWVWLDD >NZ_CP023399.1|WP_010558407.1|43194_43896_+|purine-nucleoside-phosphorylase MSTPHINAQLNDFAETVLMPGDPLRAKHIAETFLEDAREVTSVRNILGYTGTYNGKPVSIMASGMGIPSMSIYARELIVSYGVKNLIRIGTCGGIGSDIKIRDVIFAQGASTDSNVNRARVQGYDFAAIANFDLLVNGVEAAKRLGIAAKVGNVFTTDTFYQASNDLYQKLDKLGVLAVDMETAGLYGVAAEYGANAMALFTVSDHVITGEATPSEERQNSFNEMVKIALESI >NZ_CP023399.1|WP_100914607.1|41965_43177_+|phosphopentomutase MARALILMADSLGIGAAPDAERFGDKGANTLAHLLAAYHGETGQKLALPNLSKLGLIDACEQASNTNCQVAERNAPIAAYGYAKEISSGKDTPSGHWEMAGVPVLFDWGYFHDKQNSFPDAFIAEFIKRANLPGILGNCHSSGTVILEALGEEHMQTGKPICYTSTDSVFQIACHEDTFGLERLYEICEIARELLDEYNIGRVIARPFLGDAADNFARTGNRRDYSVLPPAPTLLDKLAEDNGAVISIGKISDIYAHQGITEKHKAPGLENLMAKTAEVFKQAKDHSLTFVNLVDFDEKFGHRRDAIGYAKALKQLDDYLPEFLALLGKDDVLMITADHGCDPTWQGTDHTREYVPVLAYTPGMTPVNLGERETFADIGQTLATWFSLAPLEYGNGFKSQLGS >NZ_CP023399.1|WP_040642196.1|41029_41656_-|uridine-kinase MTRTIIAIAGASASGKSLFSQTIYNELLNELAPGAIAIIEEDAYYRDQSHLPFEHRTQTNYDHPDAFEHELLLEHLNQLKEGKPVDIPVYDYAKHTRSDQTRRIQPAKILIVEGILLLSDPKLCDEFNIKVFIDTPLDICLMRRMQRDLEERGRSLQSVIEQYQATVRPMFYQFIDPSKHNADVVVTRGGRNRVAIDILKSKIKQLLQ >NZ_CP023399.1|WP_100914606.1|39778_41008_-|NupC/NupG-family-nucleoside-CNT-transporter MTSLMSLVGIASLLALAFLCSTNRKKINLRTVGGAFALQVLIGAFVLYVPAGREVLNGISVGVANVISYANDGIRFLFGGLAGDEIDGIGFVFAIKVLPVIVFFSALVAVLYHLRVMDFVIKILGGGLQKLLGTSKPESLSATANIFVGQTEAPLVVKPFIATMTRSELFAVMVGGLATVAGSILAGYVSLGVELKYLIAASFMAAPGGFLMAKMMVPETETPKQDLNDLDHGEEKPINVIDAAAGGALNGMQLAMNVGAMLLAFVALIALSNGFVGWIGGWFDNPDLTIQQILGYAFAPIAWLLGIPWSEAQLAGSFIGQKLVVNEFVAYLDFMNYQAQLSEHTKAVVTFALCGFANLSSIAILLGGIGGMAPSRRKDIAQLGLRAVLAGSMANLMSAAIAGFFLSIA >NZ_CP023399.1|WP_100914605.1|38509_39232_+|ABC-transporter-substrate-binding-protein MLKKLLTASIITLSFSASAGAIRFGYNQQYSAPHVINNGKGTNVSGIVIDVSNAISQEAGFMAKQLPLPRKRIEQYLIDGKIDAQCHANPIWYNAPSIIWSEVLYSDADIIVSDQAIASLQALSSHKQFKLGTVLGYKYPNLTEYFEAGNLKRFNSTSSKDSLTRFTKGELDGFVASYSEANYLTRLRRFNVLEVNSYDLHCSFSPKLNAEKRQRLIDAAHKLRDNGEFKRVFAKYIKPE >NZ_CP023399.1|WP_100914604.1|36963_38445_+|VCBS-repeat-containing-protein MKIITALVVTLGFASTSSLANSKKNLFNEVAIETDLTLSHPVYPIDLLPNPGKELMLIGTHSGQQYIDIFAASEAQLYNRIKRVKVPDTMLGIDFTKWQDATQQVYLFSSNGIYQLALEGTELVVPVVDTQTYLKKSGAQHLSKMEFVYNINDDQKADFFTTDLNHSYVHISDETGYQMAKVDLPARLEIEGHSAEIEPPRFAFFDTNEDNISELIGFENGLLNLLPSVGSNPGQAIKLRDDIMVDDWWHTKDADGDSLDQSNLVYRKLERVVDVNHDGIFDLVVRYTRSSGALDRQNDYEIYLGAIEQGKFIFQNQTNSLIQDEGTLTDLHLLDIDNDKQLEALVSGFDIGVSQIIGALLSGSVSQDVHLFYQNEKGEFSPKNKITREVELSFSLSKGRSGSPVVLFADVNGDGKKDWVLSDEQKAFNVYLANNKHSFSRRASKYKTTLPMQGRRVVTSDLNLDGKDDLVMSYGRLDDKNMRKTIKILFATS >NZ_CP023399.1|WP_100914603.1|34650_36873_-|hypothetical-protein MIDKSNVATQNISYANVCRSDVMKELPEKYYLTHFIEFAEFIKSTSSHLLGDDETAFFAEFAKLPEDAQCILVRIVNRKSPFVRKDTLVYSEVADSGAAINLLRKVGFVSRISKHCFHDFIGQLNKQDLIDLSISLTIDKPAKSANKNKWLNFVSNALSNALSYDQVKKSPVLTSHIYFAKQDTFHYLLFLYFGHLSGRLNQFSMRDLGVMQTQSVQQQQANFSDKDEAKSAFYYANLFKAIKEANKSDIPLLQTLAEQLIASDKPIGFLATQKYHHSLYKLAIVLIEQESTLGETLLSQSQHPAAREKYIRLLYKSDRKAQCEALLLAILDDPDSETLLLFAEDFYAQKFNQKRTSVLTDMLKKSGEPIGIDEAYLGQVETGVKDRYIKQHRLCYFTENKLWRALFALTFWHELFQHPASQRANEFSYYPKVLKQDNFYDLLHTEIDHKLSEFTNSQQFCDYISATIDQYSGEPNGLFYWHNELKDVLTTFIQHAPFESIKHHLLAMSKTFKALCDGYPDLMYIEQGEVFFEEIKAPGDSLRRNQLVSIKQLINAGFNVSIQRTEWRFNSEQNYVVVDIETTGGKKDTHRITEIGMVRVEQGKITDSWQTLVNPERHIPKMITELTGISNEMVKDAPKFEQIAEKLDTFSRNAIFVAHNVNFDYGFIRQEFARLNKKYTRAKICTVQQARKYLPGFKSYSLGKLCCDLNIELKNHHRALDDAKAAAEILLRINHVRAEQ >NZ_CP023399.1|WP_100914611.1|47960_49568_-|isocitrate-lyase MAKQLTYQQEIDRIATLKEVGEKKWQNINPESAARMRLQNRFQNGLEIARYTADIMRKDMEAYDQDPSQYTQSLGCWHGFIGQQKMISIKKHFQNNTDRRYLYLSGWMVAALRSEFGPLPDQSMHEKTSVAALIEELYTFLRQADARELGGYFRELDAARKAGNSAEEETILAKINDHKTHVVPIIADIDAGFGNEEATYLLAKKMIEAGACCIQIENQVSDEKQCGHQDGKVTVPHEDFLAKINAVRYAFLELGVDNGVIVARTDSLGAGLTKQIAVSNEPGDLGDQYNAFLDVEEIESTDIKNGDVILNRNGKLVRPKRLPSNLFQFRSGTGEERCVLDCITSLQNGADLLWIETEKPHVGQIGAMVNEIRKVVPNAKLVYNNSPSFNWTLNFRQQVFDIMVEEGKDVSQYNREQLMSVEYDNTELAALADEKIRTFQADAAREAGIFHHLITLPTYHTAALSTDNLAKEYFGEQGMLGYVKNVQRQEIRQGIACVKHQNMAGSDMGDDHKEYFSGDAALKAAGEDNTMNQFS >NZ_CP023399.1|WP_100914612.1|49821_51990_+|malate-synthase-G MTTRIHHAGLAVDPSLYAFINHQVIPGTGIDVEHFWQGFSKIVSEMTPINRALLVKRDDLQVQIDQYHQAHSNFEFDHYKQFLQQIGYLKPAPSDVTVVTESVEPEIATMAGPQLVVPVSNARFALNAANARWGSLYDALYGTDAISYDNGREAGRDYNPVRGNAVIDFGREFLDQALPLTYGSHKDASDYAIVDGKLIVTLNNVSQSALQSPEQLVGYNGTKQQPTELILKNNGLHIVIQFDENHQIGKQDSAHIKDIVLESALTTIMDCEDSVAAVDAEDKVGVYSNWLGLNKGDLSVEFEKGGKTLTRALNPDLTYTDLSGDEQQLKGRSLMLVRNVGHLMTNPAILDKNGDEVFEGIMDAVITSLIALHDLKGKSTRKNSRANSINIVKPKMHGPEEVAYANTLFCKVESLLNLPENTLKMGIMDEERRTSVNLAACIAEAKERVIFINTGFLDRTGDEIHTSMLAGPMLPKDAIKAQKWIGAYENNNVDVGLACGLSGKAQIGKGMWPMPDEMAQMLVQKTAHVQSGANTAWVPSPTAATLHAMHYHQQDVFSLQQEIKQRVAANVDDILCPPLHPEPKSMTSDAIQNELNNNAQGILGYMVRWVEQGVGCSKVPDINNVGLMEDRATLRISSQHMTNWLCHGVCSEKQVIDTLEAMAKVVDKQNEGDANYRNMSDNYAQSVAFQAALELVLSGTKQPNGYTEPVLHRRRLEFKAAS >NZ_CP023399.1|WP_010558401.1|52283_53006_-|YebC/PmpR-family-DNA-binding-transcriptional-regulator MGRAFEVRKNAMAKTAAAKTKVNSKYGKEIYVAAKNGGAELDTNLSLKRLIEKAKKDQVPAHVIDKAIEKAKGGAGEDYQPARYEGFGPGNCMVIIDCLTDNPNRTIKDVRLAFTKTNSKIGGPGTAAHSFDHQAVFAFAGDNDEEVLEALMMADVDVTDVEVEDGIVSVFAPHTEFYNVKTALAEAFPDVKLDVEEITFVPQTSVDIEGDDIPLFQKFMDMLNDSDDVQNIYHNAIVQD >NZ_CP023399.1|WP_100914613.1|53277_54102_-|autotransporter-domain-containing-protein MLGITTGANAVEIKVGQSHSDLPLDQSGTLTSVTPSGTSFSLAHDFSESVRVSLDYINWEDTHFSRKANSLKIDSQSYAATLTYFIENFAISGNYTYWQSDYRESFLELPSSQHKTYAPSYGLSVGYGIFFDEWVIEPSLLVQYNEWRYRDIMIEQNEMALALRDALEDETLVVSGLLSASKIIEIRPDTYLLAGGVIRWNELFQGDGTKQTDKFVSFAGRKSYPHNTDDDYAELSVFVTYDITPEWMVELDSSIAFLPHDNVSSVSWRIGYRF >NZ_CP023399.1|WP_010558399.1|54566_54800_+|hypothetical-protein MDFYILKKGDELKAIPADKAICKSYEKSGYSYIDKVAACTEQAAINKLKGKTFETLKGPLLRLSVVIFALILLWWVS >NZ_CP023399.1|WP_100914614.1|54823_55423_-|lytic-transglycosylase-domain-containing-protein MLFFAFSAAAKPIYQYVKSDGSIAFTDRKPIKHSYSIYRTGCYACQLRSSLNWHKIPIYPDKYHLEIATASKQHNVDPALVRAMIHAESAFKANARSKKGAIGLMQLMPATAKYMGIKRPTQPSQNILGGVKYLSYLLEKFNGNIRLATAAYNAGPNAVKKYRGIPPFAETKAYVERVGILHNRYRVAINKKKRKISPS >NZ_CP023399.1|WP_100914615.1|55787_57686_-|beta-ketoacyl-synthase MTALPIVVGMGGINAAGRTSAHQGFRRIVIDKLTQEARQETFLGLATLMNLVRQENGELVDHENQAICASEIEAKFGEQIIAGTLIRKIENNHFDVDATHGHQKMTVMPSEGDSIVFETSARHLPSPQPRDWQIENLEGGKVKVTINGELSLKHDTYRDNPIKAAGQFPTGFDPSTLYNSRYQPRGLQATIFAATDAIKSTGLAWQDVLAVVDPDQIGTYSASVAGQMQDEAFGGMMKNRLRGDRVSTKNLALGLNTMSTDFINAYVTGNVGTTFTTSGACATFLYNLRAAVHDIKAGRTRVAIVGSVDCAITPEIVEGFGNMSALANIEGLKKLDNSDTPDLRRTSRPFGENCGFTIGEGAQFAVLMDDALALELGADVMGSVPDVFVNADGIKKSITAPGPGNYITMAKSVALATSILGEEAVQKRSYILAHGSSTPQNRVTESIIYDRVAKAFNIDNWKLAAPKAFVGHTIGPASGDQMAMALGIFSHNIMPGITTIDKVADDVYAERLDIRTEHYECEAQDIAFINSKGFGGNNATAPMFSANITLEMLTKRHGESAMASYRDKLTQTKQNQADYSAAANLGDYQLIYRFGDGLIDENDIELTQETLSLPGFTHSVKLTASNPFDDMV >NZ_CP023399.1|WP_100914616.1|57814_58801_+|helix-turn-helix-domain-containing-protein MTTKVRLLLYPQILATSVTLPVEMLKAGEASFHDRKNRTLDIKFVADKQELIQCRAGCAFLPELTVGCDDNADFIIVPGIWRNPRPVIKQNQAAIGYLKAQWQQGATIVGVGTGNCLLAEAGLLDHHAATTHWHYAKQFRKDYPKVDLKPEFFITQSERIFCAASLNSLADIMVYLIAQLFGRDAAQNVERNFSHEIRKPYEEQRYLEGAVDRHADELIAQIQFWLKNNASSELSMQHVAEQFGISQRTLSRRFKAATGTTANHYLQKMRLEMAQELLASTNLTVQDVAVAVGYVDQGYLTKVFKRELKQTPSDYRLLVRKKLFKTDS >NZ_CP023399.1|WP_100914617.1|59341_61159_+|S8-family-serine-peptidase MKIKIKPSLLAASVILACTPAIAKITDSTFPLQNGDPLVAEQWHLQNIGQTGFSQSAGTSGNDLDIDFTHLMGIKGRGITVSVIDSGVEIDHPDLRANVVAGSLNLGDGSDYPQDNNGHGTSVAGLIAATEANGLGGRGVAPHANIVGFNYLPNQTVGNWLVSHGLSEDFRQLDRFTDPRVFNQSYGSTPATPVVHNYVLNPWQELQDDVYRYITEQSHWGRGSVYVKSAGNSFGTYNAYYQGVPILVLPYENNQFFNNQGLPFHNANIAPDNNNHWNLVVSALDAHGKLTSYSSVGSNVFVAAPGGEFGEAAPAMVTIDLQGCEAGSNVAGEHGNALHGGSELDPNCDYTGTMNGTSSAAPNTSGAIANIMSANHALDARAIRHILAQTARKTDENHPGVDLTFENADGELVTYNAIPSWQRNAAGYNFHQFYGFGAVDVDDAVYKALFTPVTPLPEQVITPWQTQIADAEIPDASLTGATSTLAFEQDVTVEAVQVKLNIDHTRLRDLAIELISPSGTRSVLMSARTGWLGSTEGGYTDTLMLSNHFYGEAAQGEWQLKVIDTDKGSSFTVGYNASIGLVGFNRNNNVENGVLKDWSLRVIGH >NZ_CP023399.1|WP_100914618.1|61169_61610_+|hypothetical-protein MKKLFIIAALTLSTQAAIANSAEQFKGLTITPVEQLTLAQSEVIVNDKTFKTNGTKTVVPGLDVFDVSGLNAYTVTGEIIVKLSGDVDFEQFNKDNDLIIKQAFKSYYILKSQSQRDLLPLVDALKKLEGVSSVTLELVDKGVIEK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NZ_CP023399_2 | 2.1|47632|19|NZ_CP023399|CRISPRCasFinder | 47632-47650 | 19 | NZ_CP023398.1 | 2407484-2407502 | 0 | 1.0 |
1. spacer 2.1|47632|19|NZ_CP023399|CRISPRCasFinder matches to position: 2407484-2407502, mismatch: 0, identity: 1.0
tttttttagcggtaatctt CRISPR spacer tttttttagcggtaatctt Protospacer *******************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP023399_1 | 1.2|47330|29|NZ_CP023399|CRISPRCasFinder | 47330-47358 | 29 | NZ_LR214986 | Mycoplasma cynos strain NCTC10142 plasmid 13 | 721315-721343 | 4 | 0.862 |
NZ_CP023399_1 | 1.2|47330|29|NZ_CP023399|CRISPRCasFinder | 47330-47358 | 29 | HG796835 | Uncultured bacterium plasmid pRGF00158 | 2281-2309 | 5 | 0.828 |
NZ_CP023399_1 | 1.3|47383|27|NZ_CP023399|CRISPRCasFinder | 47383-47409 | 27 | FJ870916 | Sulfolobus spindle-shaped virus 7, complete genome | 101-127 | 5 | 0.815 |
NZ_CP023399_1 | 1.2|47330|29|NZ_CP023399|CRISPRCasFinder | 47330-47358 | 29 | NZ_CP045608 | Bacillus cereus strain SB1 plasmid p2, complete sequence | 150267-150295 | 7 | 0.759 |
NZ_CP023399_1 | 1.2|47330|29|NZ_CP023399|CRISPRCasFinder | 47330-47358 | 29 | NZ_CP042405 | Leuconostoc mesenteroides strain CBA3628 plasmid unnamed1, complete sequence | 22287-22315 | 8 | 0.724 |
NZ_CP023399_1 | 1.2|47330|29|NZ_CP023399|CRISPRCasFinder | 47330-47358 | 29 | NZ_CP046063 | Leuconostoc mesenteroides subsp. mesenteroides strain CBA3607 plasmid unnamed1, complete sequence | 26470-26498 | 8 | 0.724 |
1. spacer 1.2|47330|29|NZ_CP023399|CRISPRCasFinder matches to NZ_LR214986 (Mycoplasma cynos strain NCTC10142 plasmid 13) position: , mismatch: 4, identity: 0.862
taaaaacacaaaaatgtcgttttattgat CRISPR spacer ttcaaacaaaaaaatgtcgttttaatgat Protospacer * ***** *************** ****
2. spacer 1.2|47330|29|NZ_CP023399|CRISPRCasFinder matches to HG796835 (Uncultured bacterium plasmid pRGF00158) position: , mismatch: 5, identity: 0.828
taaaaacacaaaaatgtcgttttattgat CRISPR spacer caaacacacaaaaatgtcgttttgtttct Protospacer .*** ******************.** *
3. spacer 1.3|47383|27|NZ_CP023399|CRISPRCasFinder matches to FJ870916 (Sulfolobus spindle-shaped virus 7, complete genome) position: , mismatch: 5, identity: 0.815
ttaaggaatttgatcattaatttacta CRISPR spacer ataaggaatttcatgattaatttaatg Protospacer ********** ** ********* *.
4. spacer 1.2|47330|29|NZ_CP023399|CRISPRCasFinder matches to NZ_CP045608 (Bacillus cereus strain SB1 plasmid p2, complete sequence) position: , mismatch: 7, identity: 0.759
taaaaacacaaaaatgtcgttttattgat CRISPR spacer taaaaacacaaaaatttcatttttagggg Protospacer *************** **.**** *.
5. spacer 1.2|47330|29|NZ_CP023399|CRISPRCasFinder matches to NZ_CP042405 (Leuconostoc mesenteroides strain CBA3628 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.724
taaaaacacaaaaatgtcgttttattgat CRISPR spacer caaaaacaaaaaaatgtggttttaaacca Protospacer .******* ******** ******
6. spacer 1.2|47330|29|NZ_CP023399|CRISPRCasFinder matches to NZ_CP046063 (Leuconostoc mesenteroides subsp. mesenteroides strain CBA3607 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.724
taaaaacacaaaaatgtcgttttattgat CRISPR spacer caaaaacaaaaaaatgtggttttaaacca Protospacer .******* ******** ******
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|