CRISPRimmunity

Please click to download your results

Overview of predicted results

Overview of the results

Contig_ID	Contig_def	CRISPR array number	Contig Signature genes	Self targeting spacer number	Target MGE spacer number	Prophage number	Anti-CRISPR protein number
NZ_CP047045	Caulobacteraceae bacterium 0127_4 chromosome, complete genome	1 crisprs	csa3,WYL,cas3,DEDDh	0	2	3	0

Results visualization

Click the left colored region to show detailed information

CRISPR-Cas detection and classification

Crispr_ID: NZ_CP047045_1

CRISPR_ID

CRISPR_location

CRISPR_type

Repeat_type

Spacer_info

Cas_protein_info

CRISPR-Cas_info

NZ_CP047045_1

3277770-3278108

Orphan

Consensus_repeat	Method
GTTGAGAACGAAGAGTCCTTCAACAC	CRISPRCasFinder

5 spacers

The CRISPR arrays of NZ_CP047045_1

>merge|NZ_CP047045|1|3277770-3278108|CRISPRCasFinder
TTCGAAGCCGAAGACAGCTTCAACACGGAAAACAGACCGAGATCGACACCACGAGATCGAAACGGAAGGATCTTTCAACACCGAAGAATCCTTCAACGACGATCACTCGACCAGCAATCAAGTCGATAGCGAAGTGAAGACGGACGTTGACGTCGAAGACTCTTTCAACACCGACAACTCGATCAACACGGAGAACGAGTCCGAGATCGAAGAATCCTACAACACGGACGTCGAACTCGAAGAATCTTTCAACACCGAAGTTGAGAACGAAGAGTCCTTCAACACGGACAACTCGATCGACACCGACAACTCCGTTGAGAACGAAGAGTCGTTCAACAC

>NZ_CP047045|1|1|3277770-3278108|CRISPRCasFinder
TTCGAAGCCGAAGACAGCTTCAACAC	GGAAAACAGACCGAGATCGACACCACGAG
ATCGAAACGGAAGGATCTTTCAACAC	CGAAGAATCCTTCAACGACGATCACTCGACCAGCAATCAAGTCGATAGCGAAGTGAAGACGGAC
GTTGACGTCGAAGACTCTTTCAACAC	CGACAACTCGATCAACACGGAGAACGAG
TCCGAGATCGAAGAATCCTACAACAC	GGACGTCGAACTCGAAGAATCTTTCAACACCGAA
GTTGAGAACGAAGAGTCCTTCAACAC	GGACAACTCGATCGACACCGACAACTCC
GTTGAGAACGAAGAGTCGTTCAACAC

Protein	Signature genes	Signature genes Name	Protein_function
NZ_CP047045.1\|WP_158767333.1\|3285830_3287387_+\|hypothetical-protein	unknown	unknown	gnl\|CDD\|370724
NZ_CP047045.1\|WP_158767330.1\|3282713_3284048_+\|FAD-binding-protein	unknown	unknown	gnl\|CDD\|223354
NZ_CP047045.1\|WP_158767334.1\|3287412_3289611_+\|hypothetical-protein	unknown	unknown	unknown
NZ_CP047045.1\|WP_158767329.1\|3281280_3282717_+\|UbiA-family-prenyltransferase	unknown	unknown	gnl\|CDD\|236195
NZ_CP047045.1\|WP_158767316.1\|3269821_3270733_-\|glycosyltransferase	unknown	unknown	gnl\|CDD\|223539
NZ_CP047045.1\|WP_158767318.1\|3272103_3272319_-\|hypothetical-protein	unknown	unknown	unknown
NZ_CP047045.1\|WP_158767324.1\|3277606_3277744_+\|hypothetical-protein	unknown	unknown	unknown
NZ_CP047045.1\|WP_158767328.1\|3280859_3281276_+\|GtrA-family-protein	unknown	unknown	gnl\|CDD\|377229
NZ_CP047045.1\|WP_158767331.1\|3284053_3284815_+\|SDR-family-NAD(P)-dependent-oxidoreductase	unknown	unknown	gnl\|CDD\|180838
NZ_CP047045.1\|WP_158767320.1\|3272802_3273816_-\|phage-recombination-protein-Bet	unknown	unknown	gnl\|CDD\|273871
NZ_CP047045.1\|WP_158767319.1\|3272302_3272806_-\|hypothetical-protein	unknown	unknown	gnl\|CDD\|238038
NZ_CP047045.1\|WP_158767321.1\|3273817_3274297_-\|hypothetical-protein	unknown	unknown	gnl\|CDD\|368503
NZ_CP047045.1\|WP_158767315.1\|3268761_3269694_-\|glycosyltransferase-family-2-protein	unknown	unknown	gnl\|CDD\|132997
NZ_CP047045.1\|WP_158767322.1\|3275128_3275986_-\|hypothetical-protein	unknown	unknown	unknown
NZ_CP047045.1\|WP_158767335.1\|3289837_3290419_+\|hypothetical-protein	unknown	unknown	unknown
NZ_CP047045.1\|WP_158767326.1\|3278649_3279573_+\|NAD-dependent-epimerase/dehydratase-family-protein	unknown	unknown	gnl\|CDD\|187579
NZ_CP047045.1\|WP_158767317.1\|3270864_3271560_+\|cephalosporin-hydroxylase	unknown	unknown	gnl\|CDD\|226041
NZ_CP047045.1\|WP_158767327.1\|3279576_3280863_+\|NAD(P)-binding-protein	unknown	unknown	gnl\|CDD\|235977
NZ_CP047045.1\|WP_158767323.1\|3276118_3277192_-\|SDR-family-NAD(P)-dependent-oxidoreductase	unknown	unknown	gnl\|CDD\|224011
NZ_CP047045.1\|WP_158767332.1\|3284835_3285630_-\|methyltransferase-domain-containing-protein	unknown	unknown	gnl\|CDD\|369778

Protein	Function_ID	Function_description	E-value
NZ_CP047045.1\|WP_158767333.1\|3285830_3287387_+\|hypothetical-protein	gnl\|CDD\|370724	pfam09852, DUF2079, Predicted membrane protein (DUF2079). This domain, found in various hypothetical prokaryotic proteins, has no known function.	0.000565815
NZ_CP047045.1\|WP_158767330.1\|3282713_3284048_+\|FAD-binding-protein	gnl\|CDD\|223354	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion].	1.17954e-35
NZ_CP047045.1\|WP_158767329.1\|3281280_3282717_+\|UbiA-family-prenyltransferase	gnl\|CDD\|236195	PRK08238, PRK08238, UbiA family prenyltransferase.	0
NZ_CP047045.1\|WP_158767316.1\|3269821_3270733_-\|glycosyltransferase	gnl\|CDD\|223539	COG0463, WcaA, Glycosyltransferases involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane].	2.63336e-08
NZ_CP047045.1\|WP_158767331.1\|3284053_3284815_+\|SDR-family-NAD(P)-dependent-oxidoreductase	gnl\|CDD\|180838	PRK07102, PRK07102, SDR family oxidoreductase.	1.1287e-83
NZ_CP047045.1\|WP_158767328.1\|3280859_3281276_+\|GtrA-family-protein	gnl\|CDD\|377229	pfam04138, GtrA, GtrA-like protein. Members of this family are predicted to be integral membrane proteins with three or four transmembrane spans. They are involved in the synthesis of cell surface polysaccharides. The GtrA family are a subset of this family. GtrA is predicted to be an integral membrane protein with 4 transmembrane spans. It is involved is in O antigen modification by Shigella flexneri bacteriophage X (SfX), but does not determine the specificity of glucosylation. Its function remains unknown, but it may play a role in translocation of undecaprenyl phosphate linked glucose (UndP-Glc) across the cytoplasmic membrane. Another member of this family is a DTDP-glucose-4-keto-6-deoxy-D-glucose reductase, which catalyzes the conversion of dTDP-4-keto-6-deoxy-D-glucose to dTDP-D-fucose, which is involved in the biosynthesis of the serotype-specific polysaccharide antigen of Actinobacillus actinomycetemcomitans Y4 (serotype b). This family also includes the teichoic acid glycosylation protein, GtcA, which is a serotype-specific protein in some Listeria innocua and monocytogenes strains. Its exact function is not known, but it is essential for decoration of cell wall teichoic acids with glucose and galactose.	1.21064e-12
NZ_CP047045.1\|WP_158767320.1\|3272802_3273816_-\|phage-recombination-protein-Bet	gnl\|CDD\|273871	TIGR01913, Uncharacterized_protein_UU154, phage recombination protein Bet. This model represents the phage recombination protein Bet from a number of phage, including phage lambda. All members of this family are found in phage genomes or in putative prophage regions of bacterial genomes. [Mobile and extrachromosomal element functions, Prophage functions].	1.81156e-21
NZ_CP047045.1\|WP_158767319.1\|3272302_3272806_-\|hypothetical-protein	gnl\|CDD\|238038	cd00085, HNHc, HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins. The alignment includes members of the large group of homing endonucleases, yeast intron 1 protein, MutS, as well as bacterial colicins, pyocins, and anaredoxins.	5.71703e-07
NZ_CP047045.1\|WP_158767321.1\|3273817_3274297_-\|hypothetical-protein	gnl\|CDD\|368503	pfam05565, Sipho_Gp157, Siphovirus Gp157. This family contains both viral and bacterial proteins which are related to the Gp157 protein of the Streptococcus thermophilus SFi bacteriophages. It is thought that bacteria possessing the gene coding for this protein have an increased resistance to the bacteriophage.	1.9493e-14
NZ_CP047045.1\|WP_158767315.1\|3268761_3269694_-\|glycosyltransferase-family-2-protein	gnl\|CDD\|132997	cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold. Glycosyltransferases (GTs) are enzymes that synthesize oligosaccharides, polysaccharides, and glycoconjugates by transferring the sugar moiety from an activated nucleotide-sugar donor to an acceptor molecule, which may be a growing oligosaccharide, a lipid, or a protein. Based on the stereochemistry of the donor and acceptor molecules, GTs are classified as either retaining or inverting enzymes. To date, all GT structures adopt one of two possible folds, termed GT-A fold and GT-B fold. This hierarchy includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. The majority of the proteins in this superfamily are Glycosyltransferase family 2 (GT-2) proteins. But it also includes families GT-43, GT-6, GT-8, GT13 and GT-7; which are evolutionarily related to GT-2 and share structure similarities.	2.2096e-05
NZ_CP047045.1\|WP_158767332.1\|3284835_3285630_-\|methyltransferase-domain-containing-protein	gnl\|CDD\|369778	pfam08242, Methyltransf_12, Methyltransferase domain. Members of this family are SAM dependent methyltransferases.	1.66406e-09
NZ_CP047045.1\|WP_158767326.1\|3278649_3279573_+\|NAD-dependent-epimerase/dehydratase-family-protein	gnl\|CDD\|187579	cd05271, NDUFA9_like_SDR_a, NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, subunit 9, 39 kDa, (NDUFA9) -like, atypical (a) SDRs. This subgroup of extended SDR-like proteins are atypical SDRs. They have a glycine-rich NAD(P)-binding motif similar to the typical SDRs, GXXGXXG, and have the YXXXK active site motif (though not the other residues of the SDR tetrad). Members identified include NDUFA9 (mitochondrial) and putative nucleoside-diphosphate-sugar epimerase. Atypical SDRs generally lack the catalytic residues characteristic of the SDRs, and their glycine-rich NAD(P)-binding motif is often different from the forms normally seen in classical or extended SDRs. Atypical SDRs include biliverdin IX beta reductase (BVR-B,aka flavin reductase), NMRa (a negative transcriptional regulator of various fungi), progesterone 5-beta-reductase like proteins, phenylcoumaran benzylic ether and pinoresinol-lariciresinol reductases, phenylpropene synthases, eugenol synthase, triphenylmethane reductase, isoflavone reductases, and others. SDRs are a functionally diverse family of oxidoreductases that have a single domain with a structurally conserved Rossmann fold, an NAD(P)(H)-binding region, and a structurally diverse C-terminal region. Sequence identity between different SDR enzymes is typically in the 15-30% range; they catalyze a wide range of activities including the metabolism of steroids, cofactors, carbohydrates, lipids, aromatic compounds, and amino acids, and act in redox sensing. Classical SDRs have an TGXXX[AG]XG cofactor binding motif and a YXXXK active site motif, with the Tyr residue of the active site motif serving as a critical catalytic residue (Tyr-151, human 15-hydroxyprostaglandin dehydrogenase numbering). In addition to the Tyr and Lys, there is often an upstream Ser and/or an Asn, contributing to the active site; while substrate binding is in the C-terminal region, which determines specificity. The standard reaction mechanism is a 4-pro-S hydride transfer and proton relay involving the conserved Tyr and Lys, a water molecule stabilized by Asn, and nicotinamide. In addition to the Rossmann fold core region typical of all SDRs, extended SDRs have a less conserved C-terminal extension of approximately 100 amino acids, and typically have a TGXXGXXG cofactor binding motif. Complex (multidomain) SDRs such as ketoreductase domains of fatty acid synthase have a GGXGXXG NAD(P)-binding motif and an altered active site motif (YXXXN). Fungal type ketoacyl reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.	8.99467e-25
NZ_CP047045.1\|WP_158767317.1\|3270864_3271560_+\|cephalosporin-hydroxylase	gnl\|CDD\|226041	COG3510, CmcI, Cephalosporin hydroxylase [Defense mechanisms].	5.57859e-48
NZ_CP047045.1\|WP_158767327.1\|3279576_3280863_+\|NAD(P)-binding-protein	gnl\|CDD\|235977	PRK07233, PRK07233, hypothetical protein; Provisional.	0
NZ_CP047045.1\|WP_158767323.1\|3276118_3277192_-\|SDR-family-NAD(P)-dependent-oxidoreductase	gnl\|CDD\|224011	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism].	5.6679e-106

>NZ_CP047045.1|WP_158767324.1|3277606_3277744_+|hypothetical-protein
MQLRLFSSVCAVALGLAFALMAWADDGSSTDEGSASAVENSVAAR
>NZ_CP047045.1|WP_158767323.1|3276118_3277192_-|SDR-family-NAD(P)-dependent-oxidoreductase
MTGYCLGRPEVRFDLAALKAFYADKRVLITGAAGSVGSALSLELARLGCAHLAMLDQFDHGLINIVESVRRIAPKLQITEALCDVRDSGRLDAWVRRIEPDVVIHSAALKHVHLGERHPVECVLTNLLGVRNALSAAVNAGAGHFMLISSDKAAAPSCVMGATKRLAELHLTGFQMERPTATRLKAVRFGNVLGSQGSVLPRFEAQIAAGGPLEVTHEDMERFFMSVQEAVGLILSVTAYGDEGAGTYFMEMGAPISIIELGRDMIRASGKEIAVEITGLRPGEKLKEQLADECEAITPTTLPGVFRVTPIAEDAYVTAADVAHFEALARTMENAVVRQRVFACLDQRLQRPARVAG
>NZ_CP047045.1|WP_158767322.1|3275128_3275986_-|hypothetical-protein
MRLYIHIGIGKTGTSSIQHMLANSAQALADCGFYYPQQGRNGTAAHHSLAAFDVDDLGVGIEAYFKALLEELDAQSAPNAILSSEGFCFCRPRVVRRIGELLSSYHVRVIFYARRPVELIASSYLEKLKAGQLTNATIEQFYKVCLAERSFFMSDRLDSWAIEFGRQALSVRLYDRRFLKGDSVSDFLDVIGAGEMPADMGEVQENPTLSSAFVPCLEAFDRAAPSSPMRPHIVAALVNASEDVVGSDIFSAATLKQIARDHAVANAIFAQTYLSREEARAFIAP
>NZ_CP047045.1|WP_158767321.1|3273817_3274297_-|hypothetical-protein
MRILTAIRANLEVHHETLEDPDLMLMLAEGETSLLETLDFMLEADLFDEGLLHGLKTQKDTLAVRLHRIEERRQSRRAILEQALLLMERKSLERPVATLSLSERPPNLIVEEEAQIPSRFFDLKPTLNRRLTKEALTSGEEVPGARLSNGSISLTVRRR
>NZ_CP047045.1|WP_158767320.1|3272802_3273816_-|phage-recombination-protein-Bet
MSGWSAHKDATKREFTSAQLKLIRRTIARQCTEIEFDQFIAVSVQAGLDPLRRQMAPLILNASDPERRRMVPWATIDGLRVIAARQGDYRPMETAPLIERDESRLDADLNPLGITRAEVCAWKSSDGVWHPVAGEAWWDEYAPTREEWAADATGQHHPTGKRQLDPVWLRMGRVMIAKCAEAQALRRGWPDILSGLYGEEELHGLRLAEQTASEVLREGDEAAKRRLLKTRTLWFVFGSDGGFNPVLAHEAFDRLRGFYAEASVEEIERFDQVNSTSLHTLWEWAPSDAFALKQISEARRLFGKANTEVSAEAPPAGSHQGDPQSPKQAPPTASVGS
>NZ_CP047045.1|WP_158767319.1|3272302_3272806_-|hypothetical-protein
MKTRAGKPNEIHCRQRPSRRLKQCIWDAQGGRCLACDQRLVASEFDHVVPLGLGGSNAPDNWAALCVSCHRTKTVEDLRRIAKAKRQRRYHETGRSRAAKTWSPFNSAAKQGFSKTLRRHLNGFVTRRCPCAVCSGENDFSQSSPDGADDGGDAAPKCTGSSDDRSS
>NZ_CP047045.1|WP_158767318.1|3272103_3272319_-|hypothetical-protein
MIDQAEARAASDALYSVIMGLLEEGLDRGASAPRGDDPCIARGEIFRALAADLATLAEAAALLGRFADRSP
>NZ_CP047045.1|WP_158767317.1|3270864_3271560_+|cephalosporin-hydroxylase
MPNDQHRARPQSPDQVRGRSFRTALSAETLSSIQTGALQTLYRDRGFLKSPFDVALYLQLIGRLRPASIIEIGTKHGGSALWFADQMTAHGLRARVVSVDLSPPADLSDARIQFVAGDAHDLSAALTHDLLASLPHPWLVSEDSAHTFEACTAVLRFFDNHLVVGDYIVIEDGVLSDMAERHYETYEHGPNRAVERFLSEHVDTYEIDGALCDFFGQNVTWNPNAWLRRAR
>NZ_CP047045.1|WP_158767316.1|3269821_3270733_-|glycosyltransferase
MTVASERPKLSVIIIGYNMARELPRTIRSMSPAMQRGLHDSDYELILLDNGSTQPFDANLLLGLAPNLSIHRVQNPSASPVGAIRLGLELARGDLVGVCIDGARMASPGLFSTALAASKLHAKPMIGTLAFHLGPEVQMQSVLKGYNQTVEDQLLDSSSWETDGYRLFGVSAFAGSSNDGWFVTPAETNALFLTARHWRELGGYDQRFQTPGGGLANLDIWLRVCEDHSGALIMLLGEATFHQVHGGIATNNPVSPWEQFHEEYMRLRGKAFAKSKRGPLFYGALNRETYPSLRRSINALTPV
>NZ_CP047045.1|WP_158767315.1|3268761_3269694_-|glycosyltransferase-family-2-protein
MQRTPFLSVIVVLYDMVRESERSLFSLSTAYQSQVGAEDYEVIVVENGSRQPVSRERAESFGPNFRYLDISRDVALPSPCTAINAGVAMARAPFVGIMIDGARIASPGVLMLAIQALKGFNRAVVATIGLHLGPAMQTRAAETGYNQQVEDALLASVPWRENGYKLFEVSVLTGVNTAAWFGPMAESNLIFLSRAMYEEAGGFDPRFDIPGGGIANLDFYNCVGGLPGATLISLFGEATFHQIHGGVMSSRPAETIADEVQRYMAQYHGIHGKAFQFSQQIPLLLGQFRPEVAACIKAAEPRWNAESSPP
>NZ_CP047045.1|WP_158767326.1|3278649_3279573_+|NAD-dependent-epimerase/dehydratase-family-protein
MQVVVVTGAAGLVGQNLIPRLKSQFRIVAVDKHAHNVGVLRKLHPDIEVIEADMAEAGNWEDRVTQADAVIALQAQIGGLDPEPFHRNNVLSTERLIAAAKRGERPYLVQVSSSVVRSAACDLYTESKKAQETLALKSGLEHVILRPTLMFGWFDRKHLGWLRRFMERTPVFPIPGSGDYLRQPLYAGDFAAIIASSLERRTQGIYNISGLERVTYVQMIRMIKQTVGAKTAIVHIPYAAFWTLLKVYSWFDKNPPFTTLQLQALVTPDVFEEIDWPALFDVQRTPLERALEETFLDPTYSHVALEF
>NZ_CP047045.1|WP_158767327.1|3279576_3280863_+|NAD(P)-binding-protein
MAKVIVIGAGPMGLAAAYEALKRGHEVDLLEASDRPGGMAAHFDFDGLSIERFYHFCCLSDRDTIALLDELGLNGALQWVSTKMGYFVDGKLYRWGDPFALLTFPKLGLVDKIRYGVQVFLSTKRSDWQRLDKISAKKWFTEWLGEDLYNKLWRPLLELKFYELTDKISAAWVWQRIKRLGNSRKSLLEERLGYIEGGSETLVKALVGAIENKGGRIRLKTPAKTFLIENGAVRGVETASGETIAADFVVSTAPMPLVPGMLSQAPELRPAYERMDNVGVVCVLHKLKRSVSDNFWINISDPDFEIPGLVEFSNLRPLANTVVYVPYYMPATQPKWGWSDEQFVAESWGYLKRINPALADDDRLASHVGRLRYAQPVCEVGFAELIPPAQTPIKGLQIADTCFYYPEDRGVSESIRYARNMIAEMGAA
>NZ_CP047045.1|WP_158767328.1|3280859_3281276_+|GtrA-family-protein
MKLSGEILRFVGVGAFAALVNWVSRIALSVVLPLSAAIIVAYLIGMITAYALSRKYVFQPTERGVGSELTRFALVNVVALVQVWAVTIVMAEYVLPALHVDWRPLEVAHAVGVASPIVTSYLGHRYFSFAQARKSGRG
>NZ_CP047045.1|WP_158767329.1|3281280_3282717_+|UbiA-family-prenyltransferase
MNTEAAFDCPLVVDMDGALLRTDTTFEGLARALFAKPVTTMLACASILRGRAAFKRAIAEIVQIDVESLPLREDFVAHLKQERARGRHLHLVSGSDHQVVERVAARLGLFESAQGSSSGHNLKGANKARFLVERFGRFAYAGDSPADLKVWPHAQSAVLAGASPETARRARKLAVPIEREFLDPPRTVKHWMRTLRLHQWAKNILLFVPLLLSGHFTDADLVLRCGLGFLILGLTASGTYIVNDLADLAADRRHRSKKERPFAAGVLKVYQGLMVAPALIGGGLVAAFLLSPAFAAALLSYLVCTLAYSLRLKAIPFLDVMLLAWLYTLRLLMGVALAQSTSSVWLLTFSMMFFFSMSLAKRHVEVAAASPDQDEIAGRGYLPMDAPVTLAFGISSSVASLLIMTLYLMEEAFPSNVYGQPALLWLVLPIVGLWTMRIWLLAHRGELDDDPVAFAVKDKVSIVLGSAMALAFAIAVFG
>NZ_CP047045.1|WP_158767330.1|3282713_3284048_+|FAD-binding-protein
MSTAYVNDDTRLSWGRVVRSHHLIAKPRFVDEIAPALADASVMGLRALPVGLGRSYGDSNLNPGGALIDLSKLDRIVAFDTQNGVLRADSGISLSDILRFSVPRGWFLPTTPGTRFVTLGGSIANDVHGKNHHAAGSVGCSIRRVGLVRSDRGALELASDIEPELFAATIGGLGLTGVIAWAEIQMVPIVSAYIEQEVLPFDDLDSFFDIAEASQNTFEHTVAWIDCTASGRHLGRGLFTRGNWAPEGGLDAHSDKLKLTMPVDGTPLAFNALSLRVLNTMIRTAQSFKAAESRVHYEPHLYPLDAIGAWNRLYGRAGFYQYQCIVPPDGRAAIAELLCAIADEGAGSVLGVLKSFGPKRSPGLLSFPMEGFTLAMDFCNAGARTHALFARLDAIVRAANGRLYAAKDGRMPASMFQTSYPEWARFAKQIDPLLTSAFWDRVSQ
>NZ_CP047045.1|WP_158767331.1|3284053_3284815_+|SDR-family-NAD(P)-dependent-oxidoreductase
MSGSNRRVIVLGALSAMAEATCRMLAEEGAQLALLGRDAERLDTVARDLKTRGAAGVHVFARDLLDTSDTPSALQAAADSMGGANAVLIFYGVLGDQNRAETDLEEARRIIAVNFTSAAAWSLASADLLERSGGDGAVLVGVSSVAGDRGRRSNYVYGAAKGGLSILLQGIAHRFAAKPGGARAVTVKAGFVDTPMTAHLKKGPLWATPQQIAQVVRRAMDRGGPILYAPWIWRWIMLAIRLIPDAVFKRVNI
>NZ_CP047045.1|WP_158767332.1|3284835_3285630_-|methyltransferase-domain-containing-protein
MSTDGRPQQLARTAHFLELYRAYLVADVDMTRSSVESMENQWYVPVGHSAAQVIYSACVGSWLSEVRTVLDMPCGHGRILRHLTKLFPDAAIHACDIDEAGLQFCASQFGAHPILSKEIPEEVAFPVQYDCIWVGSLFTHLSKTMSERWLAHLARQLSPTGILITTWHGRWSAANGAEIQYIEPDKWRAILAEYESTGYGYASYLRGHQHQQYIEGDYGISLSTPVALMEMALAVPDVRVFSFTERGWAGHQDVLVLGKPQIMA
>NZ_CP047045.1|WP_158767333.1|3285830_3287387_+|hypothetical-protein
MSKALTTKTVVWDELRALPGHRLLIGFIIVAVCLRLIFWLYTGRTWEDAIISLTPARNLWDGFGLTHHASEPRVHSFTSGLGEIVLIIGEAVRAGLTTMRVVSIFAAAFALYYAFRVGVILSFHWSAQLLVLAYLAADHLQIFFGMGGMETQLATALVLANVYYYLNSNWTKLGIVGGLAVICRPELGLWGLILGAAIVLWHRQAFVKVAVPAILIAGVWFGFAALYYGSPIPHTITVKSGATMINNDIGQIATYLSSFWSHIAPFLQFWQVGEAPVPEILLQAVVALLLLLGSAGAVHAARFQPRMLAVLALLLAFLLYRAWGNVNPYFMWYMPPFVALLFLFAAAGISWLAQKYTSAAIGIACVLALAYSAPVFLAMPLDRAQQQVIEDNVRTRVGARLNELMSADDAVVLEPAGYVGWEIRPKTMYDFPGLTSPRAFEAWKKHHHMTGLIIELNPRFVVQRPPERLEFEEREPELAARYEAVETFRAEPGFHLSNAGLLYWPIDTEFTIYRRRDE
>NZ_CP047045.1|WP_158767334.1|3287412_3289611_+|hypothetical-protein
MRYIRYGSAVLVLLFLLCGLAIALWPEPAPRALVSSEGWLTTDQIGRVDVGSLPETIRQTHRSLQSTAFRTWTPESGARRGEVVSPGFQISPVMAVTIAGSTATADGGASVSIVCDSHTQSLPVFRGNVNTHFTEAIFEVDEGWCPGEARLQLRSREPGVNVGVGTVAEVSRITLWKRSAIGLFPFLVLAFVVLGALSIVGVLVARAARLTIPAAFAGLTTIGVAGLATFLAYTLGPSWDYGSALAILLVLLLGGIAVALPRALRQAVIDLSPAGAVWLLSATAFFFLSTLAYNGLGHWEPNYRFSPAHWSSDNELPWMFAETLRHNWNTEGVLGPWSFSDRPPLMTGALLLTADLFDLLQTGNDGNWLRGPAHNTSSILINTLWAPVFFTAAKHLFKLDTRVAALATLITAIIPFFVFNSIYGWPKLFGAAFAGVAIWSAIDRRSSIPISDRAVAFGLASALSILSHASNAIFLLPLALYFLPSLLRAPKALIGGVLAGLVMLAPWIAYQHFILPSNDPLLKYALTGDFGFAEPARTTLESARAFYAQLSLESWLQTKAAMAAQLFWPASTPLSQPPIHTLFGMHGVDALRQWDFYFLSAGNALLLAAAIVSAFKGRRDDNPIGALLCVTGSSYLLILLVFFHPLILHHAPYGALIALALAGFGGLAAYAPGWLRGIGLLAGIYGGVVWGLSPLRSALSIDLIAALGLAFCLGAALCSTLSDSRSTSSVEG
>NZ_CP047045.1|WP_158767335.1|3289837_3290419_+|hypothetical-protein
MTRQFAAFVMLAALSACGQPAEQAAPGETSAPVFRTDADLVAVPLDSIIGTADGLGTQYIEQISPGGGGLRSEPGPSGPTPTIVAPTPTTFTVTIPANSQEFVVMYGMSPESYTNGGTTKGACFAVAAVEIGGPRELAQRCLTPVETSADQGFQEFAVQVPPGVTQFQLQTTPAAPSGELTWGWSFWANPRAK

You can click texts colored in the table to view more detailed information

Click the colored protein region to show detailed information

Self-targeting detection

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_ID	Protospacer_location	Mismatch	Identity

MGE targeting detection<

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_phage_ID	Hit_phage_def	Protospacer_location	Mismatch	Identity
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	NZ_CP047900	Pseudarthrobacter sp. YJ56 plasmid unnamed2, complete sequence	14627-14654	5	0.821
NZ_CP047045_1	1.1\|3277796\|29\|NZ_CP047045\|CRISPRCasFinder	3277796-3277824	29	NZ_CP043499	Rhizobium grahamii strain BG7 plasmid unnamed, complete sequence	1091779-1091807	6	0.793
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MK864266	Gordonia phage Arri, complete genome	7789-7816	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MN284907	Gordonia phage Fireball, complete genome	7721-7748	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MK864264	Gordonia phage VanDeWege, complete genome	7909-7936	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MK937603	Gordonia phage Bakery, complete genome	7490-7517	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MH479910	Gordonia phage Danyall, complete genome	7617-7644	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MT639651	Gordonia phage Portcullis, complete genome	7677-7704	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MH479917	Gordonia phage KimmyK, complete genome	7745-7772	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MH669015	Gordonia phage TillyBobJoe, complete genome	7432-7459	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MK814761	Gordonia phage SmokingBunny, complete genome	7596-7623	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	KX557286	Gordonia phage Twister6, complete genome	7528-7555	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MK864267	Gordonia phage Valary, complete genome	7981-8008	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MK967381	Gordonia phage RogerDodger, complete genome	7981-8008	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MT310872	Gordonia phage Evamon, complete genome	7598-7625	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MT521998	Gordonia phage Jambalaya, complete genome	7429-7456	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MK864265	Gordonia phage Barb, complete genome	7745-7772	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	NC_030913	Gordonia phage Wizard, complete genome	7721-7748	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MK305889	Gordonia phage Mutzi, complete genome	7617-7644	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MN010760	Gordonia phage Nubi, complete genome	7618-7645	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	MT723933	Streptomyces phage Keanu, complete genome	10862-10889	6	0.786
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	NZ_CP049749	Rhodococcus fascians A21d2 plasmid pA21d2, complete sequence	25445-25472	7	0.75
NZ_CP047045_1	1.5\|3278055\|28\|NZ_CP047045\|CRISPRCasFinder	3278055-3278082	28	NZ_CP015236	Rhodococcus fascians D188 plasmid pFiD188, complete sequence	73079-73106	7	0.75

1. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to NZ_CP047900 (Pseudarthrobacter sp. YJ56 plasmid unnamed2, complete sequence) position: , mismatch: 5, identity: 0.821

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ctacgactcgatcgacaccgactactac	Protospacer
  **.***************** *** *

2. spacer 1.1|3277796|29|NZ_CP047045|CRISPRCasFinder matches to NZ_CP043499 (Rhizobium grahamii strain BG7 plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.793

ggaaaacagaccgagatcgacaccacgag	CRISPR spacer
agaacgttgaccgagatcgacgccacgag	Protospacer
.*** .. *************.*******

3. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK864266 (Gordonia phage Arri, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

4. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MN284907 (Gordonia phage Fireball, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

5. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK864264 (Gordonia phage VanDeWege, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

6. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK937603 (Gordonia phage Bakery, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

7. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MH479910 (Gordonia phage Danyall, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

8. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MT639651 (Gordonia phage Portcullis, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

9. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MH479917 (Gordonia phage KimmyK, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

10. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MH669015 (Gordonia phage TillyBobJoe, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

11. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK814761 (Gordonia phage SmokingBunny, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

12. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to KX557286 (Gordonia phage Twister6, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

13. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK864267 (Gordonia phage Valary, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

14. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK967381 (Gordonia phage RogerDodger, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

15. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MT310872 (Gordonia phage Evamon, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

16. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MT521998 (Gordonia phage Jambalaya, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

17. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK864265 (Gordonia phage Barb, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

18. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to NC_030913 (Gordonia phage Wizard, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

19. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MK305889 (Gordonia phage Mutzi, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

20. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MN010760 (Gordonia phage Nubi, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ggagaactcgatcgacaccgacgggatc	Protospacer
*** ******************..  .*

21. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to MT723933 (Streptomyces phage Keanu, complete genome) position: , mismatch: 6, identity: 0.786

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
cgacaacacgatcgacaccgccaagtgg	Protospacer
 ****** ************ *** *

22. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to NZ_CP049749 (Rhodococcus fascians A21d2 plasmid pA21d2, complete sequence) position: , mismatch: 7, identity: 0.75

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ctcggtctcgatcgacaccgacacctcc	Protospacer
    . ***************** ****

23. spacer 1.5|3278055|28|NZ_CP047045|CRISPRCasFinder matches to NZ_CP015236 (Rhodococcus fascians D188 plasmid pFiD188, complete sequence) position: , mismatch: 7, identity: 0.75

ggacaactcgatcgacaccgacaactcc	CRISPR spacer
ctcggtctcgatcgacaccgacacctcc	Protospacer
    . ***************** ****

Prophage detection

Region

Region Position

Protein_number

Hit_taxonomy

Key_proteins

Att_site

Prophage annotation

DBSCAN-SWA_1

2067539 : 2072961

Staphylococcus_phage(33.33%)

The bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_158768066.1\|2067539_2067959_-	6,7-dimethyl-8-ribityllumazine synthase	A0A2H4PQS3	Staphylococcus_phage	1.0e-12	44.7
WP_158766163.1\|2068012_2069140_-	3,4-dihydroxy-2-butanone-4-phosphate synthase	A0A2H4PQS2	Staphylococcus_phage	2.1e-49	36.1
WP_158766164.1\|2069136_2069748_-	riboflavin synthase	A0A2I2L4R9	Orpheovirus	3.1e-18	39.2
WP_158766165.1\|2069757_2070423_-	bifunctional diaminohydroxyphosphoribosylaminopyrimidine deaminase/5-amino-6-(5-phosphoribosylamino)uracil reductase RibD	A0A1V0SE20	Indivirus	9.4e-13	31.5
WP_158766166.1\|2070496_2070961_-	transcriptional repressor NrdR	NA	NA	NA	NA
WP_158768067.1\|2071004_2072303_-	aminotransferase class I/II-fold pyridoxal phosphate-dependent enzyme	A0A240F3L3	Aeromonas_phage	1.9e-102	53.4
WP_158768068.1\|2072580_2072961_+	transcriptional regulator	A0A1P8VVG0	Erythrobacter_phage	3.1e-21	46.4

DBSCAN-SWA_2

2105361 : 2118402

uncultured_Mediterranean_phage(57.14%)

tRNA

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_158766200.1\|2105361_2107794_-	ATP-dependent helicase HrpB	A0A2H4UU36	Bodo_saltans_virus	5.8e-36	28.0
WP_158766201.1\|2107892_2108432_+	hypothetical protein	NA	NA	NA	NA
WP_158766202.1\|2108428_2109553_-\|tRNA	tRNA guanosine(34) transglycosylase Tgt	A0A1B1IVQ4	uncultured_Mediterranean_phage	4.0e-96	50.3
WP_158766203.1\|2109549_2110608_-\|tRNA	tRNA preQ1(34) S-adenosylmethionine ribosyltransferase-isomerase QueA	NA	NA	NA	NA
WP_158768071.1\|2110680_2111145_-	peptidylprolyl isomerase	A0A1B1IVS0	uncultured_Mediterranean_phage	2.6e-33	50.3
WP_158766204.1\|2111114_2112005_-	peptidylprolyl isomerase	A0A1B1IVS0	uncultured_Mediterranean_phage	1.5e-18	33.0
WP_158766205.1\|2112004_2112508_-	pantetheine-phosphate adenylyltransferase	A0A1B1IVQ3	uncultured_Mediterranean_phage	1.2e-33	47.2
WP_158766206.1\|2112504_2115327_-	DNA gyrase subunit A	G3M9Z5	Bacillus_virus	4.6e-85	33.4
WP_158766207.1\|2115374_2115959_+	superoxide dismutase family protein	NA	NA	NA	NA
WP_158766208.1\|2115942_2116836_-	EamA family transporter	NA	NA	NA	NA
WP_158766209.1\|2116882_2117872_-	MsnO8 family LLM class oxidoreductase	NA	NA	NA	NA
WP_158766210.1\|2117922_2118402_-	single-stranded DNA-binding protein	A0A0K1LLZ9	Caulobacter_phage	5.0e-48	63.0

DBSCAN-SWA_3

2354477 : 2366947

Chrysochromulina_ericina_virus(14.29%)

protease,head,capsid,portal

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_158766430.1\|2354477_2355389_-	DnaJ domain-containing protein	A0A0N9QPY2	Chrysochromulina_ericina_virus	3.8e-12	23.8
WP_158766431.1\|2355480_2358765_-	hypothetical protein	A0A0K1Y6G6	Rhodobacter_phage	3.3e-191	40.1
WP_158766432.1\|2358824_2359592_-\|capsid	phage major capsid protein	A0A141GEW2	Brucella_phage	1.0e-31	36.3
WP_158768085.1\|2359482_2360013_-\|head,protease	HK97 family phage prohead protease	Q6JIM8	Burkholderia_virus	1.8e-14	38.5
WP_158766433.1\|2360229_2361381_-\|portal	phage portal protein	A0A0K1LLE7	Bacillus_phage	4.5e-63	37.8
WP_158766434.1\|2361395_2362709_-	ATP-binding protein	A0A1V0DY66	Dinoroseobacter_phage	3.9e-71	40.8
WP_158766435.1\|2362590_2362887_-	hypothetical protein	NA	NA	NA	NA
WP_158766436.1\|2363098_2365111_+	PBP1A family penicillin-binding protein	NA	NA	NA	NA
WP_158766437.1\|2365120_2366947_+	ATP-binding cassette domain-containing protein	A0A2K9L0W2	Tupanvirus	7.0e-26	25.6

Anti-CRISPR protein detection

Acr ID	Acr position	Acr size	Homology with known anti	Neighbor HTH/AcRanker	Neighbor Aca	In prophage	Protospacer in prophage